Query lcl|NC_018861.1_cdsid_YP_006908137.1 [gene=D302_gp075] [protein=major capsid protein] [protein_id=YP_006908137.1] [location=complement(68394..69791)] Match_columns 465 No_of_seqs 209 out of 446 Neff 5.7 Searched_HMMs 1612 Date Thu Nov 7 14:20:40 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_75 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_75_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:6601 Length: 528 # 100.0 8E-198 5E-201 1101.1 32.9 438 5-465 1-511 (528) 2 protein:vir:101039 Length: 529 100.0 6E-196 4E-199 1090.9 29.3 441 4-465 1-512 (529) 3 protein:vir:101811 Length: 529 100.0 2E-192 9E-196 1072.2 30.2 441 4-465 1-512 (529) 4 protein:vir:106286 Length: 534 100.0 7E-192 4E-195 1068.7 32.3 445 6-465 1-517 (534) 5 protein:vir:80986 Length: 528 100.0 2E-190 1E-193 1061.0 33.4 438 5-465 1-511 (528) 6 protein:vir:100603 Length: 529 100.0 6E-190 4E-193 1057.9 32.1 441 1-465 1-512 (529) 7 protein:vir:6901 Length: 522 # 100.0 2E-189 1E-192 1055.6 32.1 439 1-465 1-506 (522) 8 protein:vir:98143 Length: 524 100.0 3E-188 2E-191 1049.1 30.7 440 1-465 1-507 (524) 9 protein:vir:5942 Length: 523 # 100.0 1E-187 8E-191 1045.2 32.2 448 1-461 1-523 (523) 10 protein:vir:103463 Length: 521 100.0 3E-187 2E-190 1043.5 32.7 441 1-465 1-505 (521) 11 protein:vir:5670 Length: 514 # 100.0 4E-187 2E-190 1042.7 31.3 430 9-459 1-514 (514) 12 protein:vir:7214 Length: 521 # 100.0 8E-187 5E-190 1040.7 32.5 443 1-465 1-503 (521) 13 protein:vir:107947 Length: 519 100.0 5E-186 3E-189 1036.5 30.7 439 6-465 1-502 (519) 14 protein:vir:106998 Length: 468 100.0 1E-184 7E-188 1029.1 32.7 404 5-463 1-468 (468) 15 protein:vir:104915 Length: 470 100.0 1E-184 8E-188 1028.8 31.8 415 1-463 1-470 (470) 16 protein:vir:104549 Length: 462 100.0 2E-184 1E-187 1027.4 29.6 412 6-463 1-462 (462) 17 protein:vir:103181 Length: 457 100.0 1E-182 7E-186 1018.0 30.7 409 6-465 1-450 (457) 18 protein:vir:5942 Length: 523 # 99.7 2.2E-21 1.4E-24 133.8 4.1 390 45-465 1-506 (523) 19 protein:vir:107947 Length: 519 99.0 2.7E-12 1.7E-15 84.0 9.9 382 19-465 1-471 (519) 20 protein:vir:104915 Length: 470 98.8 2.6E-11 1.6E-14 78.6 9.1 369 16-465 1-438 (470) 21 protein:vir:6901 Length: 522 # 98.5 3E-09 1.8E-12 67.3 11.6 377 32-465 1-481 (522) 22 protein:vir:7214 Length: 521 # 98.3 3.1E-08 1.9E-11 61.8 12.2 375 16-465 1-480 (521) 23 protein:vir:5670 Length: 514 # 98.2 7.8E-11 4.8E-14 76.0 -4.1 387 33-465 1-469 (514) 24 protein:vir:106286 Length: 534 98.2 5.4E-08 3.4E-11 60.4 11.1 385 30-465 1-486 (534) 25 protein:vir:106998 Length: 468 98.2 3.7E-08 2.3E-11 61.3 9.8 352 18-465 1-448 (468) 26 protein:vir:103181 Length: 457 98.1 3.6E-07 2.2E-10 55.9 14.4 345 18-465 1-425 (457) 27 protein:vir:98143 Length: 524 98.1 8.4E-08 5.2E-11 59.4 10.2 375 18-465 1-502 (524) 28 protein:vir:103463 Length: 521 97.8 2.9E-06 1.8E-09 50.9 14.3 374 16-465 1-480 (521) 29 protein:vir:9704 Length: 394 # 97.1 0.00018 1.1E-07 41.1 19.7 301 1-465 49-393 (394) 30 protein:vir:8420 Length: 477 # 96.8 0.00036 2.2E-07 39.5 19.2 331 1-465 90-472 (477) 31 protein:vir:9820 Length: 272 # 96.5 0.0006 3.7E-07 38.2 15.9 267 136-465 1-270 (272) 32 protein:vir:3033 Length: 272 # 96.5 0.0006 3.7E-07 38.2 15.9 267 136-465 1-270 (272) 33 protein:vir:4830 Length: 397 # 96.4 0.00062 3.8E-07 38.2 16.2 304 1-465 29-388 (397) 34 protein:vir:1886 Length: 385 # 95.9 0.0013 7.9E-07 36.4 20.1 308 1-462 39-385 (385) 35 protein:vir:191 Length: 385 # 95.9 0.0013 7.9E-07 36.4 20.1 308 1-462 39-385 (385) 36 protein:vir:4600 Length: 415 # 95.7 0.0016 9.6E-07 36.0 21.2 309 1-465 44-406 (415) 37 protein:vir:4700 Length: 415 # 95.7 0.0016 9.6E-07 36.0 21.2 309 1-465 44-406 (415) 38 protein:vir:98339 Length: 415 95.7 0.0016 9.7E-07 36.0 20.2 315 1-465 40-407 (415) 39 protein:vir:79987 Length: 415 95.7 0.0016 9.7E-07 36.0 20.2 315 1-465 40-407 (415) 40 protein:vir:81100 Length: 415 95.7 0.0016 9.7E-07 36.0 20.2 315 1-465 40-407 (415) 41 protein:vir:9410 Length: 415 # 95.7 0.0016 9.8E-07 35.9 21.4 314 1-465 51-407 (415) 42 protein:vir:93742 Length: 274 95.4 0.0021 1.3E-06 35.3 15.3 268 136-461 1-274 (274) 43 protein:vir:96123 Length: 274 95.0 0.003 1.9E-06 34.4 14.3 265 136-455 1-274 (274) 44 protein:vir:4953 Length: 397 # 94.9 0.0032 2E-06 34.3 17.7 302 1-457 34-397 (397) 45 protein:vir:78523 Length: 338 94.5 0.0042 2.6E-06 33.6 17.5 300 21-456 1-338 (338) 46 protein:vir:3870 Length: 400 # 94.2 0.0052 3.2E-06 33.1 19.2 297 1-465 48-400 (400) 47 protein:vir:7771 Length: 330 # 93.9 0.0059 3.7E-06 32.8 15.0 272 154-451 1-330 (330) 48 protein:vir:94142 Length: 304 93.5 0.0074 4.6E-06 32.3 13.9 256 164-462 1-304 (304) 49 protein:vir:105905 Length: 304 93.5 0.0074 4.6E-06 32.3 13.9 256 164-462 1-304 (304) 50 protein:vir:95898 Length: 274 93.3 0.0081 5E-06 32.1 15.9 261 136-455 1-274 (274) 51 protein:vir:96262 Length: 274 93.3 0.0081 5E-06 32.1 15.9 261 136-455 1-274 (274) 52 protein:vir:94673 Length: 419 93.3 0.0082 5.1E-06 32.0 19.5 323 1-459 50-419 (419) 53 protein:vir:4339 Length: 395 # 93.0 0.009 5.6E-06 31.8 21.5 308 1-462 39-395 (395) 54 protein:vir:81160 Length: 371 92.9 0.0095 5.9E-06 31.7 17.3 300 1-462 16-371 (371) 55 protein:vir:10364 Length: 390 92.9 0.0097 6E-06 31.6 21.8 310 1-456 44-390 (390) 56 protein:vir:2344 Length: 397 # 92.3 0.012 7.6E-06 31.1 21.0 291 36-465 1-332 (397) 57 protein:vir:81227 Length: 413 92.1 0.013 7.8E-06 31.0 21.0 321 1-465 32-412 (413) 58 protein:vir:4997 Length: 397 # 91.9 0.014 8.5E-06 30.8 19.6 300 1-465 43-388 (397) 59 protein:vir:104256 Length: 458 91.8 0.014 8.7E-06 30.7 19.2 323 1-465 88-457 (458) 60 protein:vir:97053 Length: 390 91.8 0.014 8.9E-06 30.7 22.2 309 1-456 44-390 (390) 61 protein:vir:80930 Length: 278 91.7 0.014 9E-06 30.7 16.2 265 136-452 1-278 (278) 62 protein:vir:3613 Length: 272 # 91.7 0.015 9.2E-06 30.6 16.0 261 136-462 1-272 (272) 63 protein:vir:6212 Length: 434 # 91.0 0.018 1.1E-05 30.2 17.5 313 1-465 79-432 (434) 64 protein:vir:94494 Length: 274 90.7 0.019 1.2E-05 30.0 15.2 270 136-465 1-271 (274) 65 protein:vir:97433 Length: 274 90.7 0.019 1.2E-05 30.0 15.2 270 136-465 1-271 (274) 66 protein:vir:41 Length: 299 # N 89.3 0.027 1.7E-05 29.2 17.9 265 149-460 1-299 (299) 67 protein:vir:94622 Length: 341 88.5 0.032 2E-05 28.8 11.6 290 127-464 1-341 (341) 68 protein:vir:105038 Length: 428 88.5 0.032 2E-05 28.8 19.0 312 1-462 43-428 (428) 69 protein:vir:81070 Length: 390 88.1 0.034 2.1E-05 28.6 18.7 306 1-456 32-390 (390) 70 protein:vir:94711 Length: 347 87.0 0.042 2.6E-05 28.2 11.4 281 117-451 1-347 (347) 71 protein:vir:104085 Length: 320 87.0 0.042 2.6E-05 28.2 19.2 283 35-465 1-317 (320) 72 protein:vir:4856 Length: 293 # 86.9 0.043 2.6E-05 28.1 17.8 265 45-465 1-290 (293) 73 protein:vir:96392 Length: 324 86.6 0.044 2.8E-05 28.0 20.7 291 1-465 1-321 (324) 74 protein:vir:78830 Length: 324 86.6 0.044 2.8E-05 28.0 20.7 291 1-465 1-321 (324) 75 protein:vir:2504 Length: 305 # 85.9 0.049 3.1E-05 27.7 18.0 274 46-465 1-299 (305) 76 protein:vir:96833 Length: 275 85.6 0.051 3.2E-05 27.6 13.4 267 136-457 1-275 (275) 77 protein:vir:8885 Length: 347 # 85.4 0.053 3.3E-05 27.6 11.6 280 140-452 1-347 (347) 78 protein:vir:101607 Length: 379 83.7 0.066 4.1E-05 27.0 21.6 306 1-462 39-379 (379) 79 protein:vir:1268 Length: 397 # 83.3 0.07 4.3E-05 26.9 21.1 290 1-458 84-397 (397) 80 protein:vir:1239 Length: 274 # 83.1 0.072 4.4E-05 26.9 15.2 267 136-454 1-274 (274) 81 protein:vir:107593 Length: 392 82.6 0.075 4.7E-05 26.7 20.0 305 1-465 35-387 (392) 82 protein:vir:105004 Length: 392 82.6 0.075 4.7E-05 26.7 20.0 305 1-465 35-387 (392) 83 protein:vir:102082 Length: 392 82.6 0.075 4.7E-05 26.7 20.0 305 1-465 35-387 (392) 84 protein:vir:102873 Length: 392 82.6 0.075 4.7E-05 26.7 20.0 305 1-465 35-387 (392) 85 protein:vir:2430 Length: 318 # 80.6 0.093 5.8E-05 26.2 19.8 285 18-465 1-314 (318) 86 protein:vir:8187 Length: 311 # 80.4 0.096 5.9E-05 26.2 19.3 276 47-465 1-311 (311) 87 protein:vir:9759 Length: 303 # 80.1 0.098 6.1E-05 26.1 19.0 277 46-463 1-303 (303) 88 protein:vir:97148 Length: 324 80.0 0.099 6.1E-05 26.1 20.9 290 1-465 1-316 (324) 89 protein:vir:9574 Length: 300 # 79.7 0.1 6.3E-05 26.0 19.2 273 46-461 1-300 (300) 90 protein:vir:95107 Length: 270 78.5 0.11 7E-05 25.8 14.8 265 146-465 1-270 (270) 91 protein:vir:99749 Length: 324 77.8 0.12 7.5E-05 25.6 19.8 291 1-465 1-316 (324) 92 protein:vir:7409 Length: 408 # 77.7 0.12 7.6E-05 25.6 18.8 310 1-465 58-407 (408) 93 protein:vir:100884 Length: 389 75.9 0.14 8.8E-05 25.2 19.6 303 1-465 26-385 (389) 94 protein:vir:103955 Length: 324 74.8 0.15 9.6E-05 25.0 18.9 291 1-465 1-316 (324) 95 protein:vir:100135 Length: 418 74.5 0.16 9.8E-05 25.0 19.8 311 1-465 67-416 (418) 96 protein:vir:100247 Length: 425 74.3 0.16 9.9E-05 25.0 19.7 315 1-460 66-425 (425) 97 protein:vir:80684 Length: 315 73.6 0.17 0.0001 24.8 12.2 279 136-465 1-307 (315) 98 protein:vir:1383 Length: 421 # 71.1 0.2 0.00012 24.4 16.0 306 1-465 32-395 (421) 99 protein:vir:1084 Length: 437 # 70.0 0.21 0.00013 24.2 17.7 303 1-465 88-428 (437) 100 protein:vir:95763 Length: 297 70.0 0.21 0.00013 24.2 15.3 274 1-464 1-297 (297) 101 protein:vir:94424 Length: 387 68.3 0.24 0.00015 24.0 13.3 301 1-465 34-384 (387) 102 protein:vir:2685 Length: 387 # 68.3 0.24 0.00015 24.0 13.3 301 1-465 34-384 (387) 103 protein:vir:96978 Length: 387 68.3 0.24 0.00015 24.0 13.3 301 1-465 34-384 (387) 104 protein:vir:4092 Length: 390 # 67.9 0.25 0.00015 23.9 18.1 321 1-465 4-387 (390) 105 protein:vir:6242 Length: 390 # 66.1 0.27 0.00017 23.7 19.8 313 1-465 27-390 (390) 106 protein:vir:99675 Length: 324 64.1 0.31 0.00019 23.4 13.1 255 164-465 1-307 (324) 107 protein:vir:1433 Length: 435 # 63.6 0.31 0.00019 23.3 19.2 298 1-441 45-435 (435) 108 protein:vir:105334 Length: 276 62.9 0.33 0.0002 23.2 14.7 270 136-462 1-276 (276) 109 protein:vir:100172 Length: 394 62.4 0.34 0.00021 23.2 19.2 308 1-465 47-391 (394) 110 protein:vir:9309 Length: 324 # 60.7 0.37 0.00023 23.0 19.4 293 1-465 1-317 (324) 111 protein:vir:3845 Length: 395 # 60.0 0.38 0.00024 22.9 20.5 305 1-465 28-393 (395) 112 protein:vir:108211 Length: 318 59.0 0.4 0.00025 22.8 11.8 292 84-465 1-318 (318) 113 protein:vir:1781 Length: 221 # 56.7 0.45 0.00028 22.5 14.4 196 234-453 1-221 (221) 114 protein:vir:78223 Length: 333 56.5 0.45 0.00028 22.5 16.2 293 32-464 1-333 (333) 115 protein:vir:96223 Length: 324 51.9 0.57 0.00035 21.9 20.5 292 1-465 1-316 (324) 116 protein:vir:79078 Length: 307 51.5 0.58 0.00036 21.9 7.7 273 154-465 1-303 (307) 117 protein:vir:7855 Length: 497 # 51.3 0.59 0.00036 21.9 19.4 323 1-465 80-496 (497) 118 protein:vir:101650 Length: 497 51.3 0.59 0.00036 21.9 19.4 323 1-465 80-496 (497) 119 protein:vir:99920 Length: 311 50.9 0.6 0.00037 21.8 19.2 270 32-463 1-311 (311) 120 protein:vir:4511 Length: 409 # 50.8 0.6 0.00037 21.8 18.9 315 1-465 29-409 (409) 121 protein:vir:94771 Length: 298 50.8 0.6 0.00037 21.8 15.9 259 182-462 1-298 (298) 122 protein:vir:3364 Length: 347 # 50.5 0.61 0.00038 21.8 10.5 288 117-459 1-347 (347) 123 protein:vir:102119 Length: 404 50.5 0.61 0.00038 21.8 17.2 311 1-465 28-403 (404) 124 protein:vir:9361 Length: 402 # 50.1 0.62 0.00038 21.7 16.2 301 1-465 57-399 (402) 125 protein:vir:99888 Length: 309 47.6 0.7 0.00043 21.4 7.4 269 164-465 1-304 (309) 126 protein:vir:4226 Length: 326 # 42.6 0.88 0.00054 20.9 21.3 292 20-465 1-324 (326) 127 protein:vir:739 Length: 231 # 42.3 0.89 0.00055 20.9 15.0 216 192-462 1-231 (231) 128 protein:vir:94576 Length: 347 41.9 0.91 0.00056 20.8 12.9 284 117-456 1-347 (347) 129 protein:vir:1638 Length: 298 # 41.5 0.93 0.00057 20.8 20.2 272 46-462 1-298 (298) 130 protein:vir:1541 Length: 347 # 41.0 0.95 0.00059 20.7 14.3 287 117-459 1-347 (347) 131 protein:vir:1025 Length: 408 # 38.7 1.1 0.00065 20.5 18.9 307 1-465 53-405 (408) 132 protein:vir:7990 Length: 273 # 37.0 1.1 0.00071 20.3 14.8 264 146-462 1-273 (273) 133 protein:vir:5739 Length: 366 # 36.4 1.2 0.00073 20.2 19.5 313 1-462 1-366 (366) 134 protein:vir:8102 Length: 543 # 33.6 1.3 0.00083 19.9 17.1 311 1-465 188-543 (543) 135 protein:vir:102605 Length: 273 33.2 1.4 0.00085 19.8 16.4 264 146-462 1-273 (273) 136 protein:vir:105822 Length: 273 33.2 1.4 0.00085 19.8 16.4 264 146-462 1-273 (273) 137 protein:vir:107882 Length: 307 33.0 1.4 0.00086 19.8 6.6 275 154-465 1-303 (307) 138 protein:vir:96762 Length: 632 33.0 1.4 0.00086 19.8 17.9 307 1-465 269-632 (632) 139 protein:vir:93881 Length: 387 31.6 1.5 0.00092 19.6 17.2 305 1-465 42-384 (387) 140 protein:vir:1328 Length: 392 # 30.6 1.6 0.00097 19.5 22.0 317 1-465 44-392 (392) 141 protein:vir:80376 Length: 435 30.5 1.6 0.00097 19.5 20.5 302 1-441 52-435 (435) 142 protein:vir:79008 Length: 299 28.1 1.8 0.0011 19.2 10.1 267 154-465 1-292 (299) 143 protein:vir:6324 Length: 335 # 27.9 1.8 0.0011 19.2 12.5 289 154-465 1-335 (335) 144 protein:vir:78935 Length: 335 26.5 1.9 0.0012 19.0 13.2 290 154-465 1-335 (335) 145 protein:vir:3136 Length: 322 # 25.7 2 0.0013 18.9 7.0 282 129-455 1-322 (322) 146 protein:vir:80213 Length: 334 25.7 2 0.0013 18.9 14.7 284 117-464 1-334 (334) 147 protein:vir:9643 Length: 377 # 25.6 2 0.0013 18.9 15.7 311 1-462 1-377 (377) 148 protein:vir:78920 Length: 290 23.4 2.3 0.0014 18.6 13.6 265 136-465 1-286 (290) 149 protein:vir:10450 Length: 344 21.3 2.6 0.0016 18.3 13.2 289 117-462 1-344 (344) 150 protein:vir:485 Length: 407 # 20.1 2.8 0.0018 18.1 18.4 310 1-465 40-401 (407) No 1 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=100.00 E-value=8.2e-198 Score=1101.09 Aligned_cols=438 Identities=27% Similarity=0.370 Sum_probs=379.8 Q ss_pred chhhhHHHhhhhhhccccccCh----hhhhheehccccch--------------------------------hHHHhhhh Q lcl|NC_018861. 5 YLLDESTKEKFITSNLYPNLNE----SEKNIMRTVLENQG--------------------------------NEVKMLME 48 (465) Q Consensus 5 ~~~~e~~~e~~~~~~~~~~~~~----~~~~~~~~l~~n~~--------------------------------~~~~~i~e 48 (465) -.+.|+|+|||.|+|+|+++.| |||+|+|+|||||. +++++|+| T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 80 (528) T protein:vir:66 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAA 80 (528) T ss_pred CcchHHHHHHhHHhhcCCCcchhcchhhhhhhhhhhhhhHHHhhcccchhhHHHHHhhhhhhhhhcccccccccchhccc Confidence 4466999999999999998865 99999999999993 34578999 Q ss_pred hhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccc Q lcl|NC_018861. 49 STVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFN 128 (465) Q Consensus 49 st~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~ 128 (465) |++|++|++|||+||+||||++|||||+|||||||||||||||||||++|.++.. ...++++||++.+.++.++..+ T Consensus 81 s~~t~~v~~~~P~Li~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~---~~~~~eAfh~~~g~ea~fsea~ 157 (528) T protein:vir:66 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPL---KSGAREAFHPMYAPDAFHSSLA 157 (528) T ss_pred cccccccccCchhHHHHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCcc---cccccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999977654 3467788888777665444332 Q ss_pred cccccccccccccccccccccccccccccchhhhheeeeeccC-ccccccccccccccccccCCccC-----------CC Q lcl|NC_018861. 129 YTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNS-TGSVAIGDEMDKAATFATKKATV-----------EA 196 (465) Q Consensus 129 ~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~-~g~ea~~~e~~t~~s~~~~~~~~-----------~~ 196 (465) . ......|||||||||+++|.++ .|++++|+|+++.+++....... .. T Consensus 158 t--------------------~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~ 217 (528) T protein:vir:66 158 A--------------------KEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESED 217 (528) T ss_pred c--------------------ccccccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccccc Confidence 1 1123468999999999999776 58999999999999875432211 11 Q ss_pred cccccCccccccccccccccchhhhcc---C----CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHH Q lcl|NC_018861. 197 VYTNEALWLKVLKNYTGPYATAAGEKL---G----KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKE 269 (465) Q Consensus 197 ~~~~~a~~~~~~~~~~~~~~Ta~~E~l---g----~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~E 269 (465) ............++++.+|+|+.+|.+ | +.|+||+|+||||+|||||||||||||||||||||||||||||+| T Consensus 218 ~~~~~~~a~~~~~~~~~Gm~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtE 297 (528) T protein:vir:66 218 EVVMKLIEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAE 297 (528) T ss_pred cccccccccccceecccccchhhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHH Confidence 112223344667899999999999964 2 459999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeecc---------------C-CcccHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_018861. 270 LADILSAEVALEIDRTIIEKANEVATVCTDFDVNS---------------A-DGRWFIEKARGLSMRISNEAREIGRQTR 333 (465) Q Consensus 270 L~niLstEImlEINreii~~l~~~at~~~~~~~~~---------------~-~~~~~~e~~~~L~~~i~~~a~~i~~~T~ 333 (465) |+||||+|||+|||||||++|+..+++++++++.. + .+||.+||+|+|++|||+|||+|+|+|+ T Consensus 298 LsNILStEImlEINREii~~i~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~ 377 (528) T protein:vir:66 298 LNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTG 377 (528) T ss_pred HHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhc Confidence 99999999999999999999999999998876532 2 2499999999999999999999999999 Q ss_pred cccccEEEecHHHHHHHHhcCcccccCCccc--ccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeE Q lcl|NC_018861. 334 KGGGNKLIVSPKVATILDEIGSFVLSPAGSK--IDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIF 411 (465) Q Consensus 334 ~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~--~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glf 411 (465) ||+|||||||++||++|+|+|.+++.++++. ...+|+++..|+|+|+|||+||||||+++|||+|||||++++|+||| T Consensus 378 r~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glf 457 (528) T protein:vir:66 378 RGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIY 457 (528) T ss_pred cccccEEEEchHHHHHHhhccccccccccccccccccCCCCceeEEEecCceEEEecCCCCcceEEEEEeCCccccccee Confidence 9999999999999999999998888887653 45578888999999999999999999999999999999999999999 Q ss_pred EecccccceeeeeCCCcccceeeeeeeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 412 FAPYNITLQQNLTDPVSGQPAMILNNRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 412 y~PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) |||||||+|++++||+||||+|||||||||++|||+......-.+.+.+|.... T Consensus 458 yaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~vNP~~~~~~~~~~~ri~~g~~~~ 511 (528) T protein:vir:66 458 YAPYVALTPLRATDPQSFHPVLGFKTRYGIGINPFADSKSQEPSARITSGMLSK 511 (528) T ss_pred ecccccceeeEeeCCccccceeeeeeeeceeecCcccccCccccccccccchhh Confidence 999999999999999999999999999999999999877655566656665555 No 2 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=100.00 E-value=5.8e-196 Score=1090.92 Aligned_cols=441 Identities=28% Similarity=0.366 Sum_probs=382.9 Q ss_pred cchhhhHHHhhhhhhccccccCh----hhhhheehccccch--------------------------------hHHHhhh Q lcl|NC_018861. 4 KYLLDESTKEKFITSNLYPNLNE----SEKNIMRTVLENQG--------------------------------NEVKMLM 47 (465) Q Consensus 4 ~~~~~e~~~e~~~~~~~~~~~~~----~~~~~~~~l~~n~~--------------------------------~~~~~i~ 47 (465) -.|..|+|+|||.|+|+|++++| |||+|+|+|||||. |++.+|+ T Consensus 1 ~~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~~~~~~~~~~~~~~~i~ 80 (529) T protein:vir:10 1 MSLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIA 80 (529) T ss_pred CcccHHHHHHHhHHHhcCCccchhccchhhhhhhhhhhhhHHHHhhccccchhhhhhhhhcccchhhccccccccccccc Confidence 34667899999999999999875 99999999999993 2345678 Q ss_pred hhhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccc Q lcl|NC_018861. 48 ESTVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDF 127 (465) Q Consensus 48 est~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~ 127 (465) ||++|++|++|||+||+||||++|||||+|||||||||||||||||||+||.++... ...+++||+...++....+. T Consensus 81 est~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~---~~~~eaf~~~y~Pda~~sga 157 (529) T protein:vir:10 81 AGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLA---AGAKEAFHPMYAPDAWHSSL 157 (529) T ss_pred cccccccccccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCccc---cccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999876543 45677888888887666554 Q ss_pred ccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCC----------Cc Q lcl|NC_018861. 128 NYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVE----------AV 197 (465) Q Consensus 128 ~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~----------~~ 197 (465) .+.+. + ....++++++|++.++|+.+.|.|++|+|+++.|++...+.... .. T Consensus 158 ~~~ga---------------~---~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~ 219 (529) T protein:vir:10 158 ATKGA---------------T---TTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDK 219 (529) T ss_pred ccccc---------------c---cccCccccccccccccccccCcceeeeecccceecccccccccccCccccCccccc Confidence 44331 1 12356788999999999999999999999999998754332221 11 Q ss_pred ccccCccccccccccccccchhhhccC-------CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHH Q lcl|NC_018861. 198 YTNEALWLKVLKNYTGPYATAAGEKLG-------KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKEL 270 (465) Q Consensus 198 ~~~~a~~~~~~~~~~~~~~Ta~~E~lg-------~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL 270 (465) ....+...+..++++.+|+|+.+|.++ +.|+||+|+||||+|||||||||||||||||||||||||||||+|| T Consensus 220 ~~~~~~a~~~~~~~~~Gm~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtEL 299 (529) T protein:vir:10 220 LINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSEL 299 (529) T ss_pred ccccccccccccccccccchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHH Confidence 123344567789999999999999883 3599999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhHHHHhhhhheeeeeee------------eeeccC----CcccHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_018861. 271 ADILSAEVALEIDRTIIEKANEVATVCTD------------FDVNSA----DGRWFIEKARGLSMRISNEAREIGRQTRK 334 (465) Q Consensus 271 ~niLstEImlEINreii~~l~~~at~~~~------------~~~~~~----~~~~~~e~~~~L~~~i~~~a~~i~~~T~~ 334 (465) +||||+|||+||||||||+|++++++++. ||+.++ .+||.+||+|+|+++||+|||+|+|+|+| T Consensus 300 sNILStEImlEINReii~~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~r 379 (529) T protein:vir:10 300 NGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGR 379 (529) T ss_pred HHHHHHHHHHHhhHHHHHhHhhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 99999999999999999999999988764 455432 47999999999999999999999999999 Q ss_pred ccccEEEecHHHHHHHHhcCcccccCCccccc--ccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEE Q lcl|NC_018861. 335 GGGNKLIVSPKVATILDEIGSFVLSPAGSKID--AINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFF 412 (465) Q Consensus 335 ~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~--~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy 412 (465) |+|||||||++||++|+|+|++++++++.... ..++++..|+|+|+|||+||||||+++|||+|||||++++|+|||| T Consensus 380 g~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy 459 (529) T protein:vir:10 380 GAGNFIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYY 459 (529) T ss_pred ccceEEEEchHHHHHHHhhhhhccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceee Confidence 99999999999999999999999988654332 3578888999999999999999999999999999999999999999 Q ss_pred ecccccceeeeeCCCcccceeeeeeeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 413 APYNITLQQNLTDPVSGQPAMILNNRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 413 ~PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) ||||||+|+|++||+||||+|||||||||++|||+......-.+-..+|.... T Consensus 460 ~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~ 512 (529) T protein:vir:10 460 CPYVALTPLRGSDPKNFQPVMGFKTRYAIGVNPFAESRTQAPQGRITSGMPGV 512 (529) T ss_pred ccccccccccccCCCcccceeeeeeeeceeecCccccccccccccccCCcchh Confidence 99999999999999999999999999999999999776666666555555544 No 3 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=100.00 E-value=1.5e-192 Score=1072.23 Aligned_cols=441 Identities=28% Similarity=0.360 Sum_probs=372.8 Q ss_pred cchhhhHHHhhhhhhccccccCh----hhhhheehccccchhH--------------------------------HHhhh Q lcl|NC_018861. 4 KYLLDESTKEKFITSNLYPNLNE----SEKNIMRTVLENQGNE--------------------------------VKMLM 47 (465) Q Consensus 4 ~~~~~e~~~e~~~~~~~~~~~~~----~~~~~~~~l~~n~~~~--------------------------------~~~i~ 47 (465) -.|..|+|+|||.|+|+|++++| |||+|+|+|||||.++ +.+|+ T Consensus 1 ~~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i~ 80 (529) T protein:vir:10 1 MSLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIA 80 (529) T ss_pred CccchHHHHHHhhHhhcCCccchhccchhhhhhhhhhhhhHHHHhcccccchhhhhhhhhccchhhcccccccccccccc Confidence 24567899999999999999865 9999999999999322 34567 Q ss_pred hhhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccc Q lcl|NC_018861. 48 ESTVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDF 127 (465) Q Consensus 48 est~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~ 127 (465) ||++|++|++|||+||+||||++|||||+||||||||||||||||||||||.++..+ ..++++|++...++...... T Consensus 81 ~st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~---~~~~eaf~~~~~pda~~sga 157 (529) T protein:vir:10 81 AGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLA---AGAKEAFHPMYAPDAWHSSL 157 (529) T ss_pred cccccccccccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCccc---cccccccccccccccccccc Confidence 889999999999999999999999999999999999999999999999999876543 45677888888777666666 Q ss_pred ccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCC----------Cc Q lcl|NC_018861. 128 NYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVE----------AV 197 (465) Q Consensus 128 ~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~----------~~ 197 (465) ++.+.... . .++++..+.....++.+.+.+++|+|+++.|++...+.... .. T Consensus 158 ~~~ga~t~---------------~---~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~~t~~~~~~ 219 (529) T protein:vir:10 158 ATKGATTT---------------T---DGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDK 219 (529) T ss_pred cccccccc---------------c---cccccccccccccccccccceeeecccCceeeccccccccccCccccCccccc Confidence 55443211 1 12233344556677888899999999999998654333221 11 Q ss_pred ccccCccccccccccccccchhhhccC-------CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHH Q lcl|NC_018861. 198 YTNEALWLKVLKNYTGPYATAAGEKLG-------KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKEL 270 (465) Q Consensus 198 ~~~~a~~~~~~~~~~~~~~Ta~~E~lg-------~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL 270 (465) ....+...+..++++.+|+|+.+|.++ +.|+||+|+||||+|||||||||||||||||||||||||||||+|| T Consensus 220 ~~~~~~a~~~~~~~~~GmsTa~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtEL 299 (529) T protein:vir:10 220 LINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSEL 299 (529) T ss_pred ccccccccccccccccchhhhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHH Confidence 223344567789999999999999883 4599999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhHHHHhhhhheeeeeee------------eeeccC----CcccHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_018861. 271 ADILSAEVALEIDRTIIEKANEVATVCTD------------FDVNSA----DGRWFIEKARGLSMRISNEAREIGRQTRK 334 (465) Q Consensus 271 ~niLstEImlEINreii~~l~~~at~~~~------------~~~~~~----~~~~~~e~~~~L~~~i~~~a~~i~~~T~~ 334 (465) +||||+|||+||||||||+|++++++++. ||+.++ .+||.+||+|+|+++||+|||+|+|+|+| T Consensus 300 sNILStEImlEINReii~~l~~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~r 379 (529) T protein:vir:10 300 NGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGR 379 (529) T ss_pred HHHHHHHHHHHhhHHHHHHHhhhhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 99999999999999999999999988764 455432 47999999999999999999999999999 Q ss_pred ccccEEEecHHHHHHHHhcCcccccCCccc--ccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEE Q lcl|NC_018861. 335 GGGNKLIVSPKVATILDEIGSFVLSPAGSK--IDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFF 412 (465) Q Consensus 335 ~~~~~~~~s~~va~~L~~~~~~~~~~~~~~--~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy 412 (465) |+|||||||++||++|+|+|++++++.+.. ....++++..|+|+|+|||+||||||+++|||+|||||++++|+|||| T Consensus 380 g~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy 459 (529) T protein:vir:10 380 GAGNFIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYY 459 (529) T ss_pred ccceEEEEchHHHHHHHhhcccccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceee Confidence 999999999999999999998766554322 223678888999999999999999999999999999999999999999 Q ss_pred ecccccceeeeeCCCcccceeeeeeeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 413 APYNITLQQNLTDPVSGQPAMILNNRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 413 ~PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) ||||||+|+|++||+||||+|||||||||++|||+......-.+-..+|.... T Consensus 460 ~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~ 512 (529) T protein:vir:10 460 CPYVALTPLRGFDPKNFQPVMGFKTRYAIGVNPFAESRTQAPQGRITSGMPGV 512 (529) T ss_pred ccccccccccccCCCcccceeeeeeeeceeecCccccccccccccccCCcchh Confidence 99999999999999999999999999999999999776666666555555544 No 4 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=100.00 E-value=6.6e-192 Score=1068.70 Aligned_cols=445 Identities=26% Similarity=0.359 Sum_probs=351.0 Q ss_pred hhhhHHHhhhhhhccccccCh----hhhhheehccccchhH--------------------------------------- Q lcl|NC_018861. 6 LLDESTKEKFITSNLYPNLNE----SEKNIMRTVLENQGNE--------------------------------------- 42 (465) Q Consensus 6 ~~~e~~~e~~~~~~~~~~~~~----~~~~~~~~l~~n~~~~--------------------------------------- 42 (465) |+.|+|+|||.|+|++++++| |||+|+|+|||||.+| T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~ 80 (534) T protein:vir:10 1 MSKKSLLKKWQPLVESEGMPAIASMKRKDIVARIFENQDEDIAHNEGGVYTDQVVVNSMVDVKGRIEEARLAEANIGGDH 80 (534) T ss_pred CchhHHHHHhHHhhcCCccccccchhhhhhhhhhhhhHHHHHhhhcccccchhhhhhhhhccccchhhcccccccccccc Confidence 899999999999999999875 9999999999999544 Q ss_pred ---HHhhhhhhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccc Q lcl|NC_018861. 43 ---VKMLMESTVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKT 119 (465) Q Consensus 43 ---~~~i~est~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~ 119 (465) +.+|+||++|++|++|||+||+||||++|||||+||||||||||||||||||||||.++.++. .+.++|++... T Consensus 81 g~~~~~ia~s~~s~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~---s~~EAf~ne~~ 157 (534) T protein:vir:10 81 GYDATKIASGETSGSITNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDA---NAREAFHPTYG 157 (534) T ss_pred ccccccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCc---ccccccccccc Confidence 345789999999999999999999999999999999999999999999999999998876542 34455544433 Q ss_pred ccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCccc Q lcl|NC_018861. 120 ESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYT 199 (465) Q Consensus 120 a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~ 199 (465) +++.|||..................+...... +. .......|+.....-.........+.....+... T Consensus 158 -----adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~----~~---~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~ 225 (534) T protein:vir:10 158 -----PDADFSGRGAAQDIAVFVRGTAVASGAFAKLH----IE---AATGVQAGTKTVQFIKDYAVDALPADQTEAGLAY 225 (534) T ss_pred -----cccccccccccccccccccccccccccccccc----cc---ccccccccccccccccccccccccCCcccccccc Confidence 34444443222111100000000000000000 00 0000001110000000000000001111112222 Q ss_pred ccCccccccccccccccchhhhccC-------CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHH Q lcl|NC_018861. 200 NEALWLKVLKNYTGPYATAAGEKLG-------KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELAD 272 (465) Q Consensus 200 ~~a~~~~~~~~~~~~~~Ta~~E~lg-------~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~n 272 (465) .........++++.+|+|+.+|.++ +.|+||+|+||||+|+|||||||||||||||||||||||||||+||+| T Consensus 226 ~~~~~~~~~y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsN 305 (534) T protein:vir:10 226 KWLLANGYAVETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSS 305 (534) T ss_pred ccccccccceecccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHH Confidence 3344556788999999999999874 459999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhHHHHhhhhheeeeeeeee------------ecc----CCcccHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|NC_018861. 273 ILSAEVALEIDRTIIEKANEVATVCTDFD------------VNS----ADGRWFIEKARGLSMRISNEAREIGRQTRKGG 336 (465) Q Consensus 273 iLstEImlEINreii~~l~~~at~~~~~~------------~~~----~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~ 336 (465) |||+|||+||||||||+|++++++++..+ +.. ..+||.+||+|+|+++||+|||+|+|+|+||+ T Consensus 306 ILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~ 385 (534) T protein:vir:10 306 ILANEIMHEINREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQ 385 (534) T ss_pred HHHHHHHHHhhHHHHHHHhhhhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 99999999999999999999999987653 322 23899999999999999999999999999999 Q ss_pred ccEEEecHHHHHHHHhcCcccccCCcc--cccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEec Q lcl|NC_018861. 337 GNKLIVSPKVATILDEIGSFVLSPAGS--KIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAP 414 (465) Q Consensus 337 ~~~~~~s~~va~~L~~~~~~~~~~~~~--~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~P 414 (465) |||||||++||++|+|+||++++|+.+ ..+.+++++..++|+|+|||+||||||+++|||+|||||++++|+|||||| T Consensus 386 ~n~~v~S~~Va~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaP 465 (534) T protein:vir:10 386 GNFIICSRNVAAALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVEDYFTVGYKGASEMDAGLYYCP 465 (534) T ss_pred ccEEEEchhHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecc Confidence 999999999999999999999999654 345577888899999999999999999999999999999999999999999 Q ss_pred ccccceeeeeCCCcccceeeeeeeeeeeecCcccccccceEEEeeccce-eC Q lcl|NC_018861. 415 YNITLQQNLTDPVSGQPAMILNNRYDVVATPLHPEAFIRTFAVNLNNYI-IS 465 (465) Q Consensus 415 Y~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~-~~ 465 (465) ||||++++.+||+||||+|||||||||++|||++..+.+.|+++.+|+- .+ T Consensus 466 Yv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~~i~~g~~~~~ 517 (534) T protein:vir:10 466 YVALTPLRGTDPKNFQPVLGFKTRYGVKLHPMADATQNKGFAKISNGMPQHT 517 (534) T ss_pred ccccccccccCCccccceeeeeeeeceeecCcccccCCccccccccCCcchh Confidence 9999999999999999999999999999999999999999999998753 11 No 5 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=100.00 E-value=1.7e-190 Score=1061.02 Aligned_cols=438 Identities=26% Similarity=0.367 Sum_probs=352.2 Q ss_pred chhhhHHHhhhhhhccccccCh----hhhhheehccccch--------------------------------hHHHhhhh Q lcl|NC_018861. 5 YLLDESTKEKFITSNLYPNLNE----SEKNIMRTVLENQG--------------------------------NEVKMLME 48 (465) Q Consensus 5 ~~~~e~~~e~~~~~~~~~~~~~----~~~~~~~~l~~n~~--------------------------------~~~~~i~e 48 (465) -.+.|+|+|||.|+|+|+++.| |||+|+|+|||||. +++.+|+| T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 80 (528) T protein:vir:80 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAA 80 (528) T ss_pred CcchHHHHHhhhHhhcCCccchhcchhhhhhhhhhhhhhhHHhhccccccchHHHHhhhhhccccccccccCCccccccc Confidence 4466999999999999998865 99999999999993 34578899 Q ss_pred hhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccc Q lcl|NC_018861. 49 STVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFN 128 (465) Q Consensus 49 st~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~ 128 (465) |++|++|++|||+||+||||++|||||+||||||||||||||||||||||.+++.. .+++++|+.+.+.+..++..+ T Consensus 81 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~---~~~~ea~~~~~~~da~fS~~~ 157 (528) T protein:vir:80 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLA---SQAKEAFHPMYAPDAFHSSLA 157 (528) T ss_pred cccccccccCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCccc---ccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999887543 346667766665554443332 Q ss_pred cccccccccccccccccccccccccccccchhhhheeee-eccCccccccccccccccc-----------cccCCccCCC Q lcl|NC_018861. 129 YTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRL-ESNSTGSVAIGDEMDKAAT-----------FATKKATVEA 196 (465) Q Consensus 129 ~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~-y~~~~g~ea~~~e~~t~~s-----------~~~~~~~~~~ 196 (465) ....... ++++..|+..+. +..+.|+.+.+....+.+. ....+..... T Consensus 158 t~~~a~~--------------------~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~ 217 (528) T protein:vir:80 158 AKGAAVG--------------------SPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESED 217 (528) T ss_pred ccccccc--------------------cccccccccccccccccccceeccccccccccccccccccccCccccCCcccc Confidence 2111000 011111111111 1111111111111111110 0111111222 Q ss_pred cccccCccccccccccccccchhhhcc-------CCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHH Q lcl|NC_018861. 197 VYTNEALWLKVLKNYTGPYATAAGEKL-------GKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKE 269 (465) Q Consensus 197 ~~~~~a~~~~~~~~~~~~~~Ta~~E~l-------g~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~E 269 (465) .........+..++++.+|+|+.+|.+ ++.|+||+|+||||+|||||||||||||||||||||||||||||+| T Consensus 218 ~~~~~~~~~~~~~~~~~Gm~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtE 297 (528) T protein:vir:80 218 EVVMKLMEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAE 297 (528) T ss_pred cccccccccccccccccccchhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHH Confidence 222333455677899999999999954 2459999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeecc------------C----CcccHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_018861. 270 LADILSAEVALEIDRTIIEKANEVATVCTDFDVNS------------A----DGRWFIEKARGLSMRISNEAREIGRQTR 333 (465) Q Consensus 270 L~niLstEImlEINreii~~l~~~at~~~~~~~~~------------~----~~~~~~e~~~~L~~~i~~~a~~i~~~T~ 333 (465) |+||||+|||+|||||||++|+..+++++++++.. + .+||.+||+|+|++|||+|||+|+|+|+ T Consensus 298 LaNILStEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~ 377 (528) T protein:vir:80 298 LNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTG 377 (528) T ss_pred HHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhc Confidence 99999999999999999999999999998876521 1 2599999999999999999999999999 Q ss_pred cccccEEEecHHHHHHHHhcCcccccCCcc--cccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeE Q lcl|NC_018861. 334 KGGGNKLIVSPKVATILDEIGSFVLSPAGS--KIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIF 411 (465) Q Consensus 334 ~~~~~~~~~s~~va~~L~~~~~~~~~~~~~--~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glf 411 (465) ||+|||||||++||++|+|+|.++++++++ ....+|+++..|+|+|+|||+||||||+++|||+|||||++++|+||| T Consensus 378 ~~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glf 457 (528) T protein:vir:80 378 RGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIY 457 (528) T ss_pred cccccEEEEchHHHHHHhhccccccccccccccccccCCCCceEEEEecCceEEEecCCCCcceEEEEEeCCccccccee Confidence 999999999999999999999888888765 445578888999999999999999999999999999999999999999 Q ss_pred EecccccceeeeeCCCcccceeeeeeeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 412 FAPYNITLQQNLTDPVSGQPAMILNNRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 412 y~PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) |||||||+|++++||+||||+|||||||||++|||+......-.+...+|.... T Consensus 458 y~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~ 511 (528) T protein:vir:80 458 YAPYVALTPLRATDPQSFHPVLGFKTRYGIGINPFADSKSQAPSARITSGMLSK 511 (528) T ss_pred ecccccceeeEeeCCccccceeeeeeeeceeecCcccccCCcccccccccchhh Confidence 999999999999999999999999999999999999777666666666666555 No 6 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=100.00 E-value=6.3e-190 Score=1057.86 Aligned_cols=441 Identities=27% Similarity=0.357 Sum_probs=365.3 Q ss_pred CCccchhhhHHHhhhhhhccccccCh----hhhhheehccccchhH--------------------------------HH Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNE----SEKNIMRTVLENQGNE--------------------------------VK 44 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~----~~~~~~~~l~~n~~~~--------------------------------~~ 44 (465) |+ +..|+|+|||.|+|+++++.| |||+|+|+|||||.++ +. T Consensus 1 ~~---~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~ 77 (529) T protein:vir:10 1 MS---LKTKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKTDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPT 77 (529) T ss_pred Cc---cchHHHHHHhhHhhcCCccchhcchhhhhhhhhhhhhHHHHhhcccccchhhhhhhhhhccchhhcccccccccc Confidence 54 456899999999999998854 9999999999999433 34 Q ss_pred hhhhhhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccc Q lcl|NC_018861. 45 MLMESTVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANK 124 (465) Q Consensus 45 ~i~est~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~e 124 (465) +|+||++|++|++|||+||+||||++|||||+||||||||||||||||||||||.++... ..+.++|+.+.+++... T Consensus 78 ~ia~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~---~~g~eaf~~~~e~dt~~ 154 (529) T protein:vir:10 78 NIAAGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLA---AGAKEAFHPMYAPDAWH 154 (529) T ss_pred cccccccccccccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCC---Ccccccccccccccccc Confidence 578999999999999999999999999999999999999999999999999999877544 34566677766666555 Q ss_pred cccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCC--------- Q lcl|NC_018861. 125 DDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVE--------- 195 (465) Q Consensus 125 a~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~--------- 195 (465) ++.++++......... .... ..........+.+.+|+|+++.++...+..... T Consensus 155 SG~~~~~~~~~~~~~~----------~~~~--------t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~ 216 (529) T protein:vir:10 155 SGLAAKGATTSSDGTP----------FAAL--------TAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAA 216 (529) T ss_pred cccccccccccccccc----------cccc--------cccceeeccccceeeecccccccccccccccccccccccCCc Confidence 5544433221111100 0001 112234455678999999999888654432211 Q ss_pred -CcccccCccccccccccccccchhhhccC-------CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHH Q lcl|NC_018861. 196 -AVYTNEALWLKVLKNYTGPYATAAGEKLG-------KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAE 267 (465) Q Consensus 196 -~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg-------~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe 267 (465) ......+...+..++++.+|+|+.+|.++ +.|+||+|+||||+||||||||||||||||||||||||||||| T Consensus 217 ~~~~~~~~~a~~~~~~~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAE 296 (529) T protein:vir:10 217 LDALVSAKIAAGELAEIAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDAD 296 (529) T ss_pred cccccccccccccccccccccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChH Confidence 11222344556789999999999999873 4699999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhHHHHhhhhheeeeeeee------------eeccC----CcccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018861. 268 KELADILSAEVALEIDRTIIEKANEVATVCTDF------------DVNSA----DGRWFIEKARGLSMRISNEAREIGRQ 331 (465) Q Consensus 268 ~EL~niLstEImlEINreii~~l~~~at~~~~~------------~~~~~----~~~~~~e~~~~L~~~i~~~a~~i~~~ 331 (465) +||+||||+|||+||||||||+|+.++++++.+ |+.++ .+||.+||+|+|+++||+|||+|+|+ T Consensus 297 tELsNILStEImlEINReii~~i~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~ 376 (529) T protein:vir:10 297 SELNGILANEVMLEINREVIDWINYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQ 376 (529) T ss_pred HHHHHHHHHHHHHHhhHHHHHHhhhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHh Confidence 999999999999999999999999999887754 44333 35999999999999999999999999 Q ss_pred cccccccEEEecHHHHHHHHhcCcccccCCccccc--ccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccce Q lcl|NC_018861. 332 TRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKID--AINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAG 409 (465) Q Consensus 332 T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~--~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~g 409 (465) |+||+|||||||++||++|+|.+.++++++++... ..|++...|+|+|+|||+||||||+++|||+|||||++++|+| T Consensus 377 T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~g 456 (529) T protein:vir:10 377 TGRGAGNFIIASRNVVSALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDAG 456 (529) T ss_pred hccccceEEEEchHHHHHHhhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccc Confidence 99999999999999999999999999998764333 3677888899999999999999999999999999999999999 Q ss_pred eEEecccccceeeeeCCCcccceeeeeeeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 410 IFFAPYNITLQQNLTDPVSGQPAMILNNRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 410 lfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) |||||||||+|+|++||+||||+|||||||||++|||+......-++-+.+|.-.+ T Consensus 457 lfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~ 512 (529) T protein:vir:10 457 IYYCPYVALTPLRGSDPKNFQPVMGFKTRYAIGVNPFAESRTQAPTSRISNGMPGA 512 (529) T ss_pred eeeccccccccccccCCCcccceeeeeeeeceeecCccccccccccccccCCcchh Confidence 99999999999999999999999999999999999999765554444444444433 No 7 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=100.00 E-value=1.6e-189 Score=1055.61 Aligned_cols=439 Identities=28% Similarity=0.402 Sum_probs=354.5 Q ss_pred CCccchhhhHHHhhhhhhccccccCh---hhhhheehccccch--------------------------------hHHHh Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNE---SEKNIMRTVLENQG--------------------------------NEVKM 45 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~---~~~~~~~~l~~n~~--------------------------------~~~~~ 45 (465) |+++ ...|+|+|||.|+|+++++++ +||+|+|+|||||. +++.+ T Consensus 1 ~~~~-~~~e~l~~kw~p~l~~~~~~~~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~ 79 (522) T protein:vir:69 1 MTTI-KTKAQLVDKWKELLEGEGLPEIANSKQAIIAKIFENQEKDFEVSPEYKDEKIAQAFGSFLTEAEIGGDHGYNAQN 79 (522) T ss_pred CCcc-chHHHHHHhhHHHhcCCCCCccccchhhhhhhhhhhhhHHhhcccccchhHHHHhhhhhhhhhccccccCCCccc Confidence 8765 567999999999999988865 99999999999994 34578 Q ss_pred hhhhhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccc Q lcl|NC_018861. 46 LMESTVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKD 125 (465) Q Consensus 46 i~est~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea 125 (465) |+||++|++|++|||+||+||||++|||||+||||||||||||||||||||||.++... ..++++|+.+. ++ T Consensus 80 i~es~~t~~v~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~---~~~~eaf~~~n-----ea 151 (522) T protein:vir:69 80 IAAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIA---AGAKEAFHPMY-----AP 151 (522) T ss_pred ccccccccccccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccc---Ccccccccccc-----cc Confidence 99999999999999999999999999999999999999999999999999999876543 34556666544 44 Q ss_pred cccccccccccccc-----ccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccc Q lcl|NC_018861. 126 DFNYTGTPIEVSFK-----TATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTN 200 (465) Q Consensus 126 ~~~~Sg~~~~~s~~-----tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~ 200 (465) ++.|||......+. +.+..+....+.+. ..++..+...+.+................. T Consensus 152 dt~fSG~~~~t~~~~~~~~~~t~~G~~~~~~~~-----------------~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~ 214 (522) T protein:vir:69 152 DAMFSGQGAAKKFPALAASTQTKVGDIYTHFFQ-----------------ETGTVYLQASAQVTISSSADDAAKLDAEII 214 (522) T ss_pred ccccccccccccccccccccccccccccccccc-----------------cccceeeecccCCcCCCCCcccccccchhc Confidence 45555543322221 11111111111111 111111111111111111111111222233 Q ss_pred cCccccccccccccccchhhhccC-------CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHH Q lcl|NC_018861. 201 EALWLKVLKNYTGPYATAAGEKLG-------KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADI 273 (465) Q Consensus 201 ~a~~~~~~~~~~~~~~Ta~~E~lg-------~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~ni 273 (465) .+...+..++++.+|+|+.+|.++ +.|+||+|+||||+|+|||||||||||||||||||||||||||+||+|| T Consensus 215 s~~~~~~~y~~g~GmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNI 294 (522) T protein:vir:69 215 KQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGI 294 (522) T ss_pred cccccccceeeccccchhhhhhcccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHH Confidence 455667889999999999999753 3699999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhHHHHhhhhheeeeeee------------eeecc----CCcccHHHHHHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_018861. 274 LSAEVALEIDRTIIEKANEVATVCTD------------FDVNS----ADGRWFIEKARGLSMRISNEAREIGRQTRKGGG 337 (465) Q Consensus 274 LstEImlEINreii~~l~~~at~~~~------------~~~~~----~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~ 337 (465) ||+|||+|||||||++|+..+++++. ||+.. ..+||.+||+|+|++|||+|||+|+|+|+||+| T Consensus 295 LSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~ 374 (522) T protein:vir:69 295 LATEIMLEINREVVDWINYSAQVGKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEG 374 (522) T ss_pred HHHHHHHHhhHHHHhhhhhhheeeccccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 99999999999999999988888665 45532 248999999999999999999999999999999 Q ss_pred cEEEecHHHHHHHHhcCcccccCCcccc--cccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecc Q lcl|NC_018861. 338 NKLIVSPKVATILDEIGSFVLSPAGSKI--DAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPY 415 (465) Q Consensus 338 ~~~~~s~~va~~L~~~~~~~~~~~~~~~--~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY 415 (465) ||||||++||++|+|+|.+++.|+++.. .-.|+++..|+|+|+|||+||||||+++|||+|||||++++|+||||||| T Consensus 375 n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPY 454 (522) T protein:vir:69 375 NFIIASRNVVNVLASVDTGISYAAQGLASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEMDAGIYYAPY 454 (522) T ss_pred cEEEEchhHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccc Confidence 9999999999999999999999987733 33567778899999999999999999999999999999999999999999 Q ss_pred cccceeeeeCCCcccceeeeeeeeeeeecCcccccccceEEEeeccce--eC Q lcl|NC_018861. 416 NITLQQNLTDPVSGQPAMILNNRYDVVATPLHPEAFIRTFAVNLNNYI--IS 465 (465) Q Consensus 416 ~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~--~~ 465 (465) |||+|+|++||+||||+|||||||||++|||+......-.+.+.+|.- .. T Consensus 455 v~l~~~~~~dp~sfqP~~g~~tRY~l~vNP~~~~~~~~~~~ri~~g~p~~~~ 506 (522) T protein:vir:69 455 VALTPLRGSDPKNFQPVMGFKTRYGIGVNPFAESSLQAPGARIQSGMPSILN 506 (522) T ss_pred cccccccccCCccccceeeeeeeeceeecCcccccCCcccceeecccchhhc Confidence 999999999999999999999999999999998776666666666662 11 No 8 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=100.00 E-value=2.5e-188 Score=1049.06 Aligned_cols=440 Identities=26% Similarity=0.359 Sum_probs=354.5 Q ss_pred CCccchhhhHHHhhhhhhccc-cccCh----hhhhheehccccchhHH-------------------------------- Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLY-PNLNE----SEKNIMRTVLENQGNEV-------------------------------- 43 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~-~~~~~----~~~~~~~~l~~n~~~~~-------------------------------- 43 (465) |+.| |+|+|||.|+|++ +++.| +||+|+|+|||||.+|. T Consensus 1 ~~~~----~~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~ 76 (524) T protein:vir:98 1 MSKK----NELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQ 76 (524) T ss_pred Ccch----HHHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHHHhcCccccchHHHHhhhcccccccccccccccc Confidence 8887 7999999999976 66665 99999999999996542 Q ss_pred HhhhhhhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCc-ccccccccccCccccccc Q lcl|NC_018861. 44 KMLMESTVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNN-SVSPTKNAIVLKLKTESA 122 (465) Q Consensus 44 ~~i~est~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~-~~~~~~~aaf~~~~~a~~ 122 (465) .+|+||++|++|++|||+||+||||++|||||+|||||||||||||||||||+||.++... ......+++|+++.+++. T Consensus 77 ~~i~~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt 156 (524) T protein:vir:98 77 TNIASGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDT 156 (524) T ss_pred ccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCccccccccccccccccccc Confidence 2357889999999999999999999999999999999999999999999999999887543 223345677777766665 Q ss_pred cccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccc-cCCccC---CCcc Q lcl|NC_018861. 123 NKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFA-TKKATV---EAVY 198 (465) Q Consensus 123 ~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~-~~~~~~---~~~~ 198 (465) .+++.+................++...+.+...|.. .+.+ ..++. ....+. .... T Consensus 157 ~fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~-----------------~~~~----~~~g~~~~tgt~p~~~~~a 215 (524) T protein:vir:98 157 MYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIA-----------------YFQN----VTSGNVTVTGADPAALDAA 215 (524) T ss_pred ccCCccccccccccccccccccccccccccccccce-----------------eccc----cccCccccccccccccccc Confidence 555444333333333322222222222222211110 0000 00000 000011 1111 Q ss_pred cccCccccccccccccccchhhhcc-------CCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHH Q lcl|NC_018861. 199 TNEALWLKVLKNYTGPYATAAGEKL-------GKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELA 271 (465) Q Consensus 199 ~~~a~~~~~~~~~~~~~~Ta~~E~l-------g~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~ 271 (465) ...+...+..++++.+|+|+.+|.+ ++.|+||+|+||||+|||||||||||||||||||||||||||||+||+ T Consensus 216 ~~~~~~~g~~~~~~~GmsTA~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELs 295 (524) T protein:vir:98 216 VIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELS 295 (524) T ss_pred ccccccccceeecccccchhhhhhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHH Confidence 2234445677899999999999987 356999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhHHHHhhhhheeeeeeee------------eecc----CCcccHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_018861. 272 DILSAEVALEIDRTIIEKANEVATVCTDF------------DVNS----ADGRWFIEKARGLSMRISNEAREIGRQTRKG 335 (465) Q Consensus 272 niLstEImlEINreii~~l~~~at~~~~~------------~~~~----~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~ 335 (465) ||||+|||+|||||||++|+..+++++++ |+.+ .++||.+||+|+|+++||+|||+|+|+|+|| T Consensus 296 NILSTEImlEINReii~~i~~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg 375 (524) T protein:vir:98 296 AILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRG 375 (524) T ss_pred HHHHHHHHHHhhHHHHHHHhhhheeceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccc Confidence 99999999999999999999888887664 3322 4589999999999999999999999999999 Q ss_pred cccEEEecHHHHHHHHh--cCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEe Q lcl|NC_018861. 336 GGNKLIVSPKVATILDE--IGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFA 413 (465) Q Consensus 336 ~~~~~~~s~~va~~L~~--~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~ 413 (465) +|||||||++||++|+| .|++++++........|+++..|+|+|+|||+||||||+++|||+|||||++++|+||||| T Consensus 376 ~~n~~i~S~~Va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfya 455 (524) T protein:vir:98 376 AGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYA 455 (524) T ss_pred cccEEEEchHHHHHHhhhhcccccccchhhcccccCCccceEEEEecCceEEEecCCCCcceEEEEeeCCcccccceeec Confidence 99999999999999999 7888888877777778889999999999999999999999999999999999999999999 Q ss_pred cccccceeeeeCCCcccceeeeeeeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 414 PYNITLQQNLTDPVSGQPAMILNNRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 414 PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) |||||+|+|++||+||||+|||||||||++|||+......-..-+.+|.... T Consensus 456 PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~ri~~g~~~~ 507 (524) T protein:vir:98 456 PYVALTPLRGSDPKNFQPVMGFKTRYGIGINPFANSRSQAPADRITSGMISK 507 (524) T ss_pred cccccccccccCCccccceeeeeeeeceeecCcccccCCccccccccCcchH Confidence 9999999999999999999999999999999999776665555555555543 No 9 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=100.00 E-value=1.3e-187 Score=1045.20 Aligned_cols=448 Identities=21% Similarity=0.236 Sum_probs=343.1 Q ss_pred CCccchhhhHHHhhhhhhccccccChhhhhheehccccchh-HHHhhhhhhhccccccccchhhhhhhhhhhhhhhhhhe Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNESEKNIMRTVLENQGN-EVKMLMESTVTGDIAKFTPILVPVIRRALPSLIGTEIA 79 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~l~~n~~~-~~~~i~est~t~~v~~~~P~l~~l~~ra~~~lI~~DIw 79 (465) |++|. +.|+|+|||.|+|+.++ +||||+|||+|||||.+ +..+|.|+..+++|++|.| ||+||||++|||||+||| T Consensus 1 ~~~~~-~~e~l~~kw~p~l~~~~-~~~~~~~~a~llenq~~~~~~~l~e~~~~~~~~~~~~-~~~~v~r~~p~l~a~DIW 77 (523) T protein:vir:59 1 MSQPK-INEQLIEKWQPLLEGCR-NDWERHTLATLLENQYREAKKHLMETTQTTEVDGWNL-ALPIVRRVFANLRATDLV 77 (523) T ss_pred CCcch-hhHHHHHhhhhhhcccC-ChhHHHHHHHHhhhhhHHHHHhhhhhhhccccccccc-hhhhhhhHhhhhhhhhcc Confidence 99999 88999999999999888 88999999999999964 5789999999999999997 999999999999999999 Q ss_pred eeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccc-----------------cccc Q lcl|NC_018861. 80 GVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSF-----------------KTAT 142 (465) Q Consensus 80 GVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~-----------------~tat 142 (465) |||||||||||||||||||.++.+++...+..+..+...+..+++++..+++....... .++. T Consensus 78 GVQPMTGPTGLIFAMRSRY~~q~gteA~yg~~~~~~~~a~~~~~ean~~~s~~~~~~~~~~d~~~sg~~~~~~~a~stg~ 157 (523) T protein:vir:59 78 SVQPLSLPTGLVFYLDFKSPELPGNGSVYGGTGLTTDTATGGLYDENARLSRREYETTITVDLATAQQATMRDVGFDTGI 157 (523) T ss_pred ccccCCCCcceeEEEEeeccCCCCcccccCccccCcccccccccccccccccccccCccCCCcccccccccccccccccc Confidence 99999999999999999999987765433333333222333333333332221111000 0000 Q ss_pred c---cccc------ccccccccccchhhhheeeeeccCc------------------cccccccccccccccccCCccCC Q lcl|NC_018861. 143 T---VKGK------IVYSEKQAGTDNIVNVLLRLESNST------------------GSVAIGDEMDKAATFATKKATVE 195 (465) Q Consensus 143 t---~gga------it~~~~~TGPTgLifam~s~y~~~~------------------g~ea~~~e~~t~~s~~~~~~~~~ 195 (465) . .+.. .......++++.++|+++.++..+. .++++++|+....+......... T Consensus 158 A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~G 237 (523) T protein:vir:59 158 ASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGG 237 (523) T ss_pred hhhccccceeeeeccccccccccccccccccccccccccccccccchhhccccccccccccccccccccccccccccCCC Confidence 0 0000 0111112244444444444443332 23555555555444332221111 Q ss_pred CcccccCccccccccccccccchhhhccC---------CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhh-CCC Q lcl|NC_018861. 196 AVYTNEALWLKVLKNYTGPYATAAGEKLG---------KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQH-GIN 265 (465) Q Consensus 196 ~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg---------~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiH-GlD 265 (465) ..........++.+.+|+|+.+|.++ +.|+||+|+||||+|||||||||||||||||||||||| ||| T Consensus 238 ---tt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLD 314 (523) T protein:vir:59 238 ---TPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVD 314 (523) T ss_pred ---cccccccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCC Confidence 11112234457778888888888654 46999999999999999999999999999999999999 999 Q ss_pred HHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeee--------eec-cCCcccH--------HHHHHHHHHHHHHHHHHH Q lcl|NC_018861. 266 AEKELADILSAEVALEIDRTIIEKANEVATVCTDF--------DVN-SADGRWF--------IEKARGLSMRISNEAREI 328 (465) Q Consensus 266 Ae~EL~niLstEImlEINreii~~l~~~at~~~~~--------~~~-~~~~~~~--------~e~~~~L~~~i~~~a~~i 328 (465) ||+||+||||+||||||||||||+|+++++++|.+ |+. ..+++|. +||+|.|+++||+|+|+| T Consensus 315 AE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i 394 (523) T protein:vir:59 315 LENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRI 394 (523) T ss_pred hhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999988764 333 2344553 799999999999999999 Q ss_pred HHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecC-CCcc Q lcl|NC_018861. 329 GRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGA-SNFD 407 (465) Q Consensus 329 ~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~-~~~d 407 (465) +|+|+||+|||||||++||++|+++|||+..+.. .+++++.+++|+|+|||+||||||+++|||+|||||+ +++| T Consensus 395 ~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~----~~~~~~~~~~g~l~~~~~vy~d~~~~~dy~~~g~k~~~~~~~ 470 (523) T protein:vir:59 395 QQKTAVAGANFLVTSPQVAALLESMPGFTPGNDN----RDGGTGIFYVGMVQGRYRLYKNIYQNQPVIIMGNQDLNTPWQ 470 (523) T ss_pred HHhcccccccEEEEchhHHHHHHhccccccCCcc----ccccccceeEEEecCceEEEecCCCCcceEEEEecccCCccc Confidence 9999999999999999999999999999765443 3556779999999999999999999999999999995 5999 Q ss_pred ceeEEecccccceeeee-CCCcccceeeeeeeeeeee-cCcccccccceEEEeecc Q lcl|NC_018861. 408 AGIFFAPYNITLQQNLT-DPVSGQPAMILNNRYDVVA-TPLHPEAFIRTFAVNLNN 461 (465) Q Consensus 408 ~glfy~PY~~~~~~~~~-dp~s~qp~~~~~tRY~l~~-nPf~~~~~~~~f~~~~~~ 461 (465) +|||||||||+.+++++ ||+||||+|||||||||++ |||+..+-+-. -|.- T Consensus 471 ~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~---~~~~ 523 (523) T protein:vir:59 471 TGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLLYVK---LLQP 523 (523) T ss_pred ccceecccchhhcccccccCCcccceeeeeeehhheecchhHhhhhhhh---hcCC Confidence 99999999999888865 9999999999999999975 99985543211 1111 No 10 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=100.00 E-value=2.6e-187 Score=1043.53 Aligned_cols=441 Identities=26% Similarity=0.363 Sum_probs=351.6 Q ss_pred CCccchhhhHHHhhhhhhccccccCh---hhhhheehccccch--------------------------------hHHHh Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNE---SEKNIMRTVLENQG--------------------------------NEVKM 45 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~---~~~~~~~~l~~n~~--------------------------------~~~~~ 45 (465) |+-| +.|+|+|||+|+|+|+++.+ +||+|+|+|||||. +++.+ T Consensus 1 ~~~~--~~~~l~~kw~p~l~~~~~~~i~~~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ 78 (521) T protein:vir:10 1 MTIK--TKAELLNKWKPLLEGEGLPEIANSKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATN 78 (521) T ss_pred CCcc--hhHHHHHhhhhhhccCCCCccccchhhhhhhhhhhhhhhhhhccccchhHHHHHHhhhhhhhcccCcccccccc Confidence 7655 46999999999999988865 99999999999993 12345 Q ss_pred hhhhhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccc Q lcl|NC_018861. 46 LMESTVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKD 125 (465) Q Consensus 46 i~est~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea 125 (465) |+||++|++|++|||+||+||||++|||||+||||||||||||||||||||||.++... ..++++|+...+++..++ T Consensus 79 i~es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~---~~g~eaf~~~~~ada~fS 155 (521) T protein:vir:10 79 IAAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIA---AGAKEAFHPMYGPDAMFS 155 (521) T ss_pred ccccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccc---cccccccchhcccccccc Confidence 67889999999999999999999999999999999999999999999999999887543 235666766555544333 Q ss_pred ccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCc--cCCCcccccCc Q lcl|NC_018861. 126 DFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKA--TVEAVYTNEAL 203 (465) Q Consensus 126 ~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~--~~~~~~~~~a~ 203 (465) +.+...........+....+....+.+..+ + ..+..+....+..++.. ...+.....+. T Consensus 156 G~~~at~~s~~~~~~~~~~Gd~~~~~~~~~-----------------g--~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~ 216 (521) T protein:vir:10 156 GQGAAKKFAALAASTQTTVGDIYTHFFQDT-----------------G--TVYLQASAQVTISSTADDAAKLDAEIKKQM 216 (521) T ss_pred cccccccccccccccccccccccccccccc-----------------c--cceecccccccCCCcccccccccccccccc Confidence 332222111111222211221111111111 1 12222222222222211 12223344556 Q ss_pred cccccccccccccchhhhcc-------CCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHH Q lcl|NC_018861. 204 WLKVLKNYTGPYATAAGEKL-------GKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSA 276 (465) Q Consensus 204 ~~~~~~~~~~~~~Ta~~E~l-------g~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLst 276 (465) ..+..++++.+|+|+.+|.+ ++.|+||+|+||||+|+|||||||||||||||||||||||||||+||+||||+ T Consensus 217 ~~~~~y~~~~GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILST 296 (521) T protein:vir:10 217 EAGALVEIAEGMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILAT 296 (521) T ss_pred cccceeecccccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHH Confidence 67788999999999999977 34699999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhHHHHhhhhheeeeeeeeee------------ccC----CcccHHHHHHHHHHHHHHHHHHHHHhcccccccEE Q lcl|NC_018861. 277 EVALEIDRTIIEKANEVATVCTDFDV------------NSA----DGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKL 340 (465) Q Consensus 277 EImlEINreii~~l~~~at~~~~~~~------------~~~----~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~ 340 (465) |||+|||||||++|+..+++++.+++ .++ .+||.+||+|+|++|||+|||+|+|+|+||+|||| T Consensus 297 EImlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~ 376 (521) T protein:vir:10 297 EIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFI 376 (521) T ss_pred HHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEE Confidence 99999999999999999999887554 222 26999999999999999999999999999999999 Q ss_pred EecHHHHHHHHhcCcccccCCcccc--cccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEeccccc Q lcl|NC_018861. 341 IVSPKVATILDEIGSFVLSPAGSKI--DAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNIT 418 (465) Q Consensus 341 ~~s~~va~~L~~~~~~~~~~~~~~~--~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~ 418 (465) |||++||++|+|+|.+++.|+++.. .-.|+++..|+|+|+|||+||||||+++|||+|||||++++|+|||||||||| T Consensus 377 i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l 456 (521) T protein:vir:10 377 IASRNVVNVLASVDTGISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVAL 456 (521) T ss_pred EEchHHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccc Confidence 9999999999999999999988633 33577888999999999999999999999999999999999999999999999 Q ss_pred ceeeeeCCCcccceeeeeeeeeeeecCcccccccceEEEeeccce--eC Q lcl|NC_018861. 419 LQQNLTDPVSGQPAMILNNRYDVVATPLHPEAFIRTFAVNLNNYI--IS 465 (465) Q Consensus 419 ~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~--~~ 465 (465) +|+|++||+||||+|||||||||++|||+.........++-++.. .+ T Consensus 457 ~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~~i~~~~~~~~a 505 (521) T protein:vir:10 457 TPLRGSDPKNFQPVMGFKTRYGIGINPFAESAAQAPASRIQSGMPSILN 505 (521) T ss_pred ccccccCCccccceeeeeeeeceeecCcccccCCccceeecccchhhhc Confidence 999999999999999999999999999998776555543333321 11 No 11 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=100.00 E-value=3.7e-187 Score=1042.66 Aligned_cols=430 Identities=27% Similarity=0.400 Sum_probs=340.9 Q ss_pred hHHHhhhhhhcccccc--Ch----hhhhheehccccchhH--------------------------------HHhhhhhh Q lcl|NC_018861. 9 ESTKEKFITSNLYPNL--NE----SEKNIMRTVLENQGNE--------------------------------VKMLMEST 50 (465) Q Consensus 9 e~~~e~~~~~~~~~~~--~~----~~~~~~~~l~~n~~~~--------------------------------~~~i~est 50 (465) -+|+|||.|+|++|+. .| +||+|+|+|||||.+| +.+|+||+ T Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia~s~ 80 (514) T protein:vir:56 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) T ss_pred CchhhhhhHHhcccccccccccchhhhhhhhhhhhhHHHHHhcCCcccchhhhhhhhccccccccccccccccccccccc Confidence 6899999999999973 23 8999999999999533 34578999 Q ss_pred hccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccc Q lcl|NC_018861. 51 VTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYT 130 (465) Q Consensus 51 ~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~S 130 (465) +|++|++|||+||+||||++|||||+|||||||||||||||||||+||.+++.. +.++|+++.+ +++.|| T Consensus 81 ~t~~v~~~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~t-----g~EAf~~~nE-----adt~fS 150 (514) T protein:vir:56 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT-----GAEAFHPTRQ-----ADASFS 150 (514) T ss_pred ccccccccchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcc-----cccccccccc-----cCcCcc Confidence 999999999999999999999999999999999999999999999999876543 3466665444 445555 Q ss_pred cccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcccccccc Q lcl|NC_018861. 131 GTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKN 210 (465) Q Consensus 131 g~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~ 210 (465) |......+......+ ........ ....+...|+.+.+.................++....+...+..++ T Consensus 151 G~~~~~~~~~~~~~~-~~~~G~~~----------~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~ 219 (514) T protein:vir:56 151 GQAAASTIADFPTTG-AATDGTPY----------KAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVE 219 (514) T ss_pred ccccccccccccccc-cccccccc----------cccccccccccccccccccccccccccccccccccccccccchhhh Confidence 443322211111000 00000000 0000111111111111111111111122222233344556678899 Q ss_pred ccccccchhhhcc-------CCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_018861. 211 YTGPYATAAGEKL-------GKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEID 283 (465) Q Consensus 211 ~~~~~~Ta~~E~l-------g~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEIN 283 (465) ++.+|+|+.+|.+ ++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||| T Consensus 220 ~~~Gm~Ta~aEal~~lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEIN 299 (514) T protein:vir:56 220 IDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELN 299 (514) T ss_pred hhhhhhhhhhhhcccCCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhh Confidence 9999999999974 346999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhheeeeeeeeeeccC---------------CcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHH Q lcl|NC_018861. 284 RTIIEKANEVATVCTDFDVNSA---------------DGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVAT 348 (465) Q Consensus 284 reii~~l~~~at~~~~~~~~~~---------------~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~ 348 (465) ||||++|++.++|+++|+++++ .+||.+||+|+|+++||+|+|+|+|+|+||+|||||||++||+ T Consensus 300 Reii~~l~~~atv~~~~~~~~~~~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~ 379 (514) T protein:vir:56 300 REIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVS 379 (514) T ss_pred HHHHHHHHhheeehhcccccccccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHH Confidence 9999999999999999876642 2599999999999999999999999999999999999999999 Q ss_pred HHHhcCcccccCCccc---ccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeC Q lcl|NC_018861. 349 ILDEIGSFVLSPAGSK---IDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTD 425 (465) Q Consensus 349 ~L~~~~~~~~~~~~~~---~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~d 425 (465) +|+|+||+++.++++. ....|+++..|+|+|+|||+||||||+++|||+|||||++++|+||||||||||.++|.+| T Consensus 380 ~L~~sg~l~~~~~~g~~~~~~~~d~~~~~~aG~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~d 459 (514) T protein:vir:56 380 ALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSD 459 (514) T ss_pred HHHhhhhhccccccCccccccccccCcceEEEEecCceEEEecCCCCcceEEEEEecCcceecceeeccccccccccccC Confidence 9999999999887763 3556788889999999999999999999999999999999999999999999999999999 Q ss_pred CCcccceeeeeeeeeeeecCcccccc-----------------c----ceEEEee Q lcl|NC_018861. 426 PVSGQPAMILNNRYDVVATPLHPEAF-----------------I----RTFAVNL 459 (465) Q Consensus 426 p~s~qp~~~~~tRY~l~~nPf~~~~~-----------------~----~~f~~~~ 459 (465) |+||||+|||||||||++|||+++-. + +-|+.|| T Consensus 460 p~sfqP~~g~~tRY~l~~NPy~~~~~~~~~~~~~~~~~a~~~~n~y~r~v~v~~l 514 (514) T protein:vir:56 460 SKNFQPVIGFKTRYGVQVNPFADPTASATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) T ss_pred CccccceeeeeeeeceeeCCCCCccccccccCCcchhhhcccccceeeeEEEecC Confidence 99999999999999999999975221 1 1222333 No 12 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=100.00 E-value=8.4e-187 Score=1040.73 Aligned_cols=443 Identities=27% Similarity=0.352 Sum_probs=350.7 Q ss_pred CCccchhhhHHHhhhhhhccccccCh---hhhhheehccccch--------------------------------hHHHh Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNE---SEKNIMRTVLENQG--------------------------------NEVKM 45 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~---~~~~~~~~l~~n~~--------------------------------~~~~~ 45 (465) |+-| ..|+|+|||+|+|+|+++.+ +||+|+|+|||||. +++.+ T Consensus 1 ~~~~--~~~~l~~kw~p~l~~~~~~~i~~~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ 78 (521) T protein:vir:72 1 MTIK--TKAELLNKWKPLLEGEGLPEIANSKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATN 78 (521) T ss_pred CCcc--hhHHHHHhhhhhhccCCCCccccchhhhhhhhhhhhhhhhhhcccccchHHHHHHhhhhhhhcccCccccCccc Confidence 7655 46999999999999988865 99999999999972 23445 Q ss_pred hhhhhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccc Q lcl|NC_018861. 46 LMESTVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKD 125 (465) Q Consensus 46 i~est~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea 125 (465) |+||++|++|++|||+||+||||++|||||+||||||||||||||||||||||.++... ..++++|+...+++..++ T Consensus 79 iaes~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~---~~g~ea~~~e~~~da~fS 155 (521) T protein:vir:72 79 IAAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVA---AGAKEAFHPMYGPDAMFS 155 (521) T ss_pred ccccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCC---cccccccchhcccccccc Confidence 78889999999999999999999999999999999999999999999999999887543 234556665555444333 Q ss_pred ccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccc Q lcl|NC_018861. 126 DFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWL 205 (465) Q Consensus 126 ~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~ 205 (465) +.+..........++....+....+.+..+|-. -......++.+++.+... ........+... T Consensus 156 G~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~----------------~~~~~~~~~~~~g~t~~~-~t~~~v~~~~~a 218 (521) T protein:vir:72 156 GQGAAKKFPALAASTQTTVGDIYTHFFQETGTV----------------YLQASVQVTIDAGATDAA-KLDAEIKKQMEA 218 (521) T ss_pred ccccccccccccccccccccccccccccccccc----------------ccccccccccCCCCCCcc-cccccccccccc Confidence 332222111111222222221111111111100 000111122222222111 111222344556 Q ss_pred cccccccccccchhhhccC-------CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHH Q lcl|NC_018861. 206 KVLKNYTGPYATAAGEKLG-------KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEV 278 (465) Q Consensus 206 ~~~~~~~~~~~Ta~~E~lg-------~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEI 278 (465) +..++++.+|+|+.+|.++ +.|+||+|+||||+|+|||||||||||||||||||||||||||+||+||||+|| T Consensus 219 ~~~y~~g~gm~Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEI 298 (521) T protein:vir:72 219 GALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEI 298 (521) T ss_pred CceeeeecccchhhhhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHH Confidence 7789999999999999752 459999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhHHHHhhhhheeeeeeeeee------------ccC----CcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEe Q lcl|NC_018861. 279 ALEIDRTIIEKANEVATVCTDFDV------------NSA----DGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIV 342 (465) Q Consensus 279 mlEINreii~~l~~~at~~~~~~~------------~~~----~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~ 342 (465) |+|||||||++|+..+++++.+++ .++ .+||.+||+|+|++|||+|||+|+|+|+||+|||||| T Consensus 299 mlEINReii~~i~~sa~~g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~ 378 (521) T protein:vir:72 299 MLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIA 378 (521) T ss_pred HHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEE Confidence 999999999999999999887654 222 2699999999999999999999999999999999999 Q ss_pred cHHHHHHHHhcCcccccCCcccccc--cccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccce Q lcl|NC_018861. 343 SPKVATILDEIGSFVLSPAGSKIDA--INSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQ 420 (465) Q Consensus 343 s~~va~~L~~~~~~~~~~~~~~~~~--~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~ 420 (465) |++||++|+|+|.+++.|+++...+ .|+++..|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+| T Consensus 379 S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~ 458 (521) T protein:vir:72 379 SRNVVNVLASVDTGISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTP 458 (521) T ss_pred chHHHHHHhhcccccccccccccccccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccceeecccccccc Confidence 9999999999999999998874443 67888899999999999999999999999999999999999999999999999 Q ss_pred eeeeCCCcccceeeeeeeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 421 QNLTDPVSGQPAMILNNRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 421 ~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) +|++||+||||+|||||||||++|||+.........+.-++.+-. T Consensus 459 ~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~a~~i~~~~~~~ 503 (521) T protein:vir:72 459 LRGSDPKNFQPVMGFKTRYGIGINPFAESAAQAPASRIQSGMPSI 503 (521) T ss_pred ccccCCccccceeeeeeeeceeecCcccccCcccceeecCcChhh Confidence 999999999999999999999999999877666555444333311 No 13 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=100.00 E-value=4.9e-186 Score=1036.51 Aligned_cols=439 Identities=28% Similarity=0.383 Sum_probs=351.5 Q ss_pred hhhhHHHhhhhhhccccccCh----hhhhheehccccchh--------------------------------HHHhhhhh Q lcl|NC_018861. 6 LLDESTKEKFITSNLYPNLNE----SEKNIMRTVLENQGN--------------------------------EVKMLMES 49 (465) Q Consensus 6 ~~~e~~~e~~~~~~~~~~~~~----~~~~~~~~l~~n~~~--------------------------------~~~~i~es 49 (465) |+.|+|+|||+|+|+|+++.+ |||+|+++|||||+. .+.+|.|| T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~i~~~~~en~~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~t~i~~~ 80 (519) T protein:vir:10 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) T ss_pred CchhHHHHHhHHhhcccccchhhhhhhHHHHHHHHHHHHHHhhhcccccchHHHHHHhhhcchhccCCccccCccccccc Confidence 888999999999999999865 999999999999853 23456788 Q ss_pred hhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccc Q lcl|NC_018861. 50 TVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNY 129 (465) Q Consensus 50 t~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~ 129 (465) ++|++|.+|||+||+|+||++|||||+||||||||||||||||||||||.++... ..+.++|+. ++++++.| T Consensus 81 ~~t~~v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~---~~g~ea~~~-----~nEadt~f 152 (519) T protein:vir:10 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIA---AGAKEAFHP-----MYAPNAMF 152 (519) T ss_pred cccccccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccc---ccccccccc-----cccccccc Confidence 9999999999999999999999999999999999999999999999999877543 234444544 34556666 Q ss_pred ccccccccccccccccccccccccccccchhhhheeeeeccCcccc-ccccccccccccccCCccCCCcccccCcccccc Q lcl|NC_018861. 130 TGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSV-AIGDEMDKAATFATKKATVEAVYTNEALWLKVL 208 (465) Q Consensus 130 Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~e-a~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~ 208 (465) ||........+..... ... .|-+ .......+++. ....+..+.++++.... ........+...+.. T Consensus 153 SG~~~~~~~~~~~~~~-~~~--------~g~~---~~~~~~~s~~~~~~~~~~~t~~ag~t~~~-~~~~a~~~~~~~~~~ 219 (519) T protein:vir:10 153 SGQGAAETFEALAASK-VLE--------VGKI---YSHFFEATGSAHFQAVEAVTVDAGATDAA-KLDAAVTALVEAGQL 219 (519) T ss_pred Cccccccccccccccc-ccc--------cccc---ccccccccccceeccccccccCCCCcCcc-ccccccccccccccc Confidence 6654332221111000 000 0000 00001111111 11122233333332221 233334556677888 Q ss_pred ccccccccchhhhccC-------CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHH Q lcl|NC_018861. 209 KNYTGPYATAAGEKLG-------KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALE 281 (465) Q Consensus 209 ~~~~~~~~Ta~~E~lg-------~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlE 281 (465) ++++.+|+|+.+|.++ +.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+| T Consensus 220 ~~~~~gmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlE 299 (519) T protein:vir:10 220 AEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLE 299 (519) T ss_pred cccccccccchhhccccCCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHH Confidence 9999999999999742 369999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHhhhhheeeeeeeeee------------cc----CCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHH Q lcl|NC_018861. 282 IDRTIIEKANEVATVCTDFDV------------NS----ADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPK 345 (465) Q Consensus 282 INreii~~l~~~at~~~~~~~------------~~----~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~ 345 (465) ||||||++|+..+++++.+.+ .. ..+||.+||+|+|++|||+|||+|+|+|+||+|||||||++ T Consensus 300 INReii~~i~~sa~~~~~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~ 379 (519) T protein:vir:10 300 INREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRN 379 (519) T ss_pred hhHHHHhhhhhhhhcceeecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchH Confidence 999999999998888876433 22 34799999999999999999999999999999999999999 Q ss_pred HHHHHHhcCcccccCCcccc--cccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeee Q lcl|NC_018861. 346 VATILDEIGSFVLSPAGSKI--DAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNL 423 (465) Q Consensus 346 va~~L~~~~~~~~~~~~~~~--~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~ 423 (465) ||++|+|+|.+++.|+++.. ...|++...|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|+|+ T Consensus 380 Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~ 459 (519) T protein:vir:10 380 VVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRG 459 (519) T ss_pred HHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEecCcccccceeeccccccccccc Confidence 99999999999999976543 3466777889999999999999999999999999999999999999999999999999 Q ss_pred eCCCcccceeeeeeeeeeeecCcccccccceEEEeeccce-eC Q lcl|NC_018861. 424 TDPVSGQPAMILNNRYDVVATPLHPEAFIRTFAVNLNNYI-IS 465 (465) Q Consensus 424 ~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~-~~ 465 (465) +||+||||+|||||||||++|||+......-.+++.||.- ++ T Consensus 460 ~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~~i~~g~~~~a 502 (519) T protein:vir:10 460 SDPKNFQPVMGFKTRYGIGINPFADPAAQAPTKRIQNGMPDIV 502 (519) T ss_pred cCCccccceeeeeeeeceeecCcccccccCccceeccCchhhh Confidence 9999999999999999999999996665555665566521 11 No 14 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=100.00 E-value=1.1e-184 Score=1029.08 Aligned_cols=404 Identities=29% Similarity=0.463 Sum_probs=337.4 Q ss_pred chhhhHHHhhhhhhccccccCh----hhhhheehccccchhH----HH-------------------hhhhhhhcccccc Q lcl|NC_018861. 5 YLLDESTKEKFITSNLYPNLNE----SEKNIMRTVLENQGNE----VK-------------------MLMESTVTGDIAK 57 (465) Q Consensus 5 ~~~~e~~~e~~~~~~~~~~~~~----~~~~~~~~l~~n~~~~----~~-------------------~i~est~t~~v~~ 57 (465) ..-.|+|+|||.|+|+|+++.| |||+|+|+|||||.++ +. +|.|+++|++|++ T Consensus 1 ~~~~e~l~~kW~plLe~~~~~~i~~~~k~~i~a~llENQe~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~t~~v~~ 80 (468) T protein:vir:10 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAG 80 (468) T ss_pred CcchHHHHHhhhHhhcCCccchhccchhhhhhhhhhhhHHHHHhccccccchhhHhhcCCcccchhhhhhhhcccccccc Confidence 2334999999999999999865 8999999999999544 22 4777899999999 Q ss_pred ccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccccccccc Q lcl|NC_018861. 58 FTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVS 137 (465) Q Consensus 58 ~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s 137 (465) |||+||+||||++|||||+|||||||||||||||||||+||.++.++ +++++++++++|+...... T Consensus 81 ~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~g~--------------EAf~nEadt~fSg~~~~~~ 146 (468) T protein:vir:10 81 FDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGE--------------EALFNEPDTGFTGGYDASQ 146 (468) T ss_pred cCchhhhhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCCCCc--------------cceeccccccccccccccc Confidence 99999999999999999999999999999999999999999887643 5567888888887533222 Q ss_pred ccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccc Q lcl|NC_018861. 138 FKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYAT 217 (465) Q Consensus 138 ~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~T 217 (465) .......+... .+ ..... ..........+.++++.+|+| T Consensus 147 ~~~~~~~~~~~-----------------------~~---------------~~~g~---~~~~~~~a~~~~~~~g~gMsT 185 (468) T protein:vir:10 147 GDYAVRTGAGV-----------------------GG---------------DSEGN---NPALLNDAAPGTYEVGSKMPR 185 (468) T ss_pred ccccccccccc-----------------------cc---------------CCCCC---cccccccccccccccccccch Confidence 11110000000 00 00000 000111223456788999999 Q ss_pred hhhhccC---CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhhee Q lcl|NC_018861. 218 AAGEKLG---KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVA 294 (465) Q Consensus 218 a~~E~lg---~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~a 294 (465) +.+|.+| +.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+|++++ T Consensus 186 a~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va 265 (468) T protein:vir:10 186 EDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVA 265 (468) T ss_pred HHHhhcCCCCcccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhh Confidence 9999886 4599999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeee--------eeec-cCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccc Q lcl|NC_018861. 295 TVCTD--------FDVN-SADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKI 365 (465) Q Consensus 295 t~~~~--------~~~~-~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~ 365 (465) ++++. ||+. +.+|||.+|++|+|++|||+++|+|+++|+||+|||+|||++||++|+|+||+++.|+.+.. T Consensus 266 ~~~k~~g~~~~Gv~d~~~~~~~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~ 345 (468) T protein:vir:10 266 KKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGA 345 (468) T ss_pred hheecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceeccccccc Confidence 88774 5553 56899999999999999999999999999999999999999999999999999999987765 Q ss_pred c-----ccccccceEEEEecCceEEEEeCCC----CcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeee Q lcl|NC_018861. 366 D-----AINSGIKPNVGKFDNRYDVIVDNFA----EFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILN 436 (465) Q Consensus 366 ~-----~~~~~~~~~~G~l~~~~~vy~d~~~----~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~ 436 (465) . .+|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+|||||||||+.|++++||+||||+|||| T Consensus 346 ~~~~~~~~D~tg~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 425 (468) T protein:vir:10 346 GGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFK 425 (468) T ss_pred ccccccccccCcceEEEEecCceEEEEccccccCCccceEEEEEecCcceeceeeeccccccccccccCCCcccceeeee Confidence 4 4689999999999999999999986 589999999999999999999999999999999999999999999 Q ss_pred eeeeeeecCcccc---------------cccceEE-Eeeccce Q lcl|NC_018861. 437 NRYDVVATPLHPE---------------AFIRTFA-VNLNNYI 463 (465) Q Consensus 437 tRY~l~~nPf~~~---------------~~~~~f~-~~~~~~~ 463 (465) |||||++|||+.. +.|.+|- |=..|-. T Consensus 426 tRY~l~~NP~~~~~~~~~g~~~~~~~~~~~N~y~r~~~v~~l~ 468 (468) T protein:vir:10 426 TRYGMVSNPFVTTNGLYNGTPDGEALTPNANMYYRRVQVTNLM 468 (468) T ss_pred eeeceeecccceeccccCCCcccccccccccceeeeEEEeccC Confidence 9999999999831 2334443 2222222 No 15 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=100.00 E-value=1.2e-184 Score=1028.81 Aligned_cols=415 Identities=30% Similarity=0.494 Sum_probs=343.8 Q ss_pred CCccchhhhHHHhhhhhhccccccCh----hhhhheehccccchh---------------------HHHhhhhhhhcccc Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNE----SEKNIMRTVLENQGN---------------------EVKMLMESTVTGDI 55 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~----~~~~~~~~l~~n~~~---------------------~~~~i~est~t~~v 55 (465) |.-| ..|+|+|||.|+|+++++.| |||+|+|+|||||.. ++.+|+||++|++| T Consensus 1 ~~~~--~~e~l~~kw~p~l~~~~~~~i~~~~~~~v~a~l~enq~~~~~~~~~~l~e~~~~~~~~~~~~~~i~~st~t~~v 78 (470) T protein:vir:10 1 MQMF--NSEYLQEKWAPILDYDGLDPIKDSHRRSVTAVLLENQEKELREERNFLSEAPNVNTNSGATAGFSADATAAGPV 78 (470) T ss_pred CCcc--hhHHHHHhhhhhhcCCccchhcchhhhhhhhhhhhhhHHHHhhccchhhhhhhccccccccccccccccccccc Confidence 6554 36999999999999999855 999999999999943 23467777999999 Q ss_pred ccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccccccc Q lcl|NC_018861. 56 AKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIE 135 (465) Q Consensus 56 ~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~ 135 (465) ++|||+||+||||++|||||+||||||||||||||||||||||.++.++ +++++++++.|||.... T Consensus 79 ~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~sG~--------------EaffnEA~T~fSG~~~~ 144 (470) T protein:vir:10 79 AGFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQSGT--------------EALFNEADTAFSGQPDG 144 (470) T ss_pred cccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCCcc--------------ceeeecCCcccCccccc Confidence 9999999999999999999999999999999999999999999887543 56677888888876444 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPY 215 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 215 (465) ....+.....+... .|...... . ...........+......++++.+| T Consensus 145 ~~~~~~~~~~~a~~----------------------~g~~~~~~------~----gt~~~~~~~~~~~a~~~~y~~~~GM 192 (470) T protein:vir:10 145 LDDTSGFTATGANN----------------------VGLGTTAQ------Q----GSNPGLLNSTAAQTNATDYNVGQGM 192 (470) T ss_pred cccccccccccccc----------------------cccccccc------c----ccccccccccccccccccccccccc Confidence 33222111111000 00000000 0 0000001111223344567889999 Q ss_pred cchhhhccC----CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhh Q lcl|NC_018861. 216 ATAAGEKLG----KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKAN 291 (465) Q Consensus 216 ~Ta~~E~lg----~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~ 291 (465) +|+.+|.+| ++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+|+ T Consensus 193 sTa~aE~lg~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~ 272 (470) T protein:vir:10 193 RTDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIY 272 (470) T ss_pred chHHhhhcCCCCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHh Confidence 999999886 4599999999999999999999999999999999999999999999999999999999999999999 Q ss_pred heeeeeee--------eeec-cCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCc Q lcl|NC_018861. 292 EVATVCTD--------FDVN-SADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAG 362 (465) Q Consensus 292 ~~at~~~~--------~~~~-~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~ 362 (465) +++++++. ||+. +.+|||.+|++|+|++||++++|+|+++|+||+|||||||++||++|+++||+++.|+. T Consensus 273 ~~a~~~k~~~~~~~Gv~Dl~~~~~gr~~~e~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~ 352 (470) T protein:vir:10 273 NVAEPGAQANVAAAGTFDLDTDSNGRWSVEKFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPAL 352 (470) T ss_pred hhhhhceeccccccceEEeecccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhcccccccccc Confidence 99998876 4443 45789999999999999999999999999999999999999999999999999999998 Q ss_pred ccccccccccceEEEEecCceEEEEeCC------CCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeee Q lcl|NC_018861. 363 SKIDAINSGIKPNVGKFDNRYDVIVDNF------AEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILN 436 (465) Q Consensus 363 ~~~~~~~~~~~~~~G~l~~~~~vy~d~~------~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~ 436 (465) +....+|+++..|+|+|+|||||||||| +++|||+|||||++++|+|||||||||+.+++++||+||||+|||| T Consensus 353 ~~~~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~ 432 (470) T protein:vir:10 353 NANLNVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVGYKGSSPYDAGLFYCPYVPLQMVRAVGQDTFQPKIGFK 432 (470) T ss_pred ccccccCCCCceEEEEecCceEEEeeccccccCcccccEEEEEEecCcceecceeeccccccccCCCCCCccccceeeee Confidence 8888899999999999999999999997 8899999999999999999999999999999999999999999999 Q ss_pred eeeeeeecCccccccc----------ceEE-Eeeccce Q lcl|NC_018861. 437 NRYDVVATPLHPEAFI----------RTFA-VNLNNYI 463 (465) Q Consensus 437 tRY~l~~nPf~~~~~~----------~~f~-~~~~~~~ 463 (465) |||||++|||+..... .+|- |=..|-. T Consensus 433 tRY~l~~NP~~~~~~~~~~~i~~~~n~y~r~~~v~~l~ 470 (470) T protein:vir:10 433 TRYGLVENPFSQGTTQGLGTLTRNSNRYYRRVKVANLM 470 (470) T ss_pred eeeceeecCcccCCCcccccccCCCCceeeEEEeeccC Confidence 9999999999865443 2222 1111111 No 16 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=100.00 E-value=2.3e-184 Score=1027.40 Aligned_cols=412 Identities=31% Similarity=0.507 Sum_probs=335.5 Q ss_pred hhhhHHHhhhhhhccccccCh----hhhhheehccccchhH----HHhhhh----------hhhccccccccchhhhhhh Q lcl|NC_018861. 6 LLDESTKEKFITSNLYPNLNE----SEKNIMRTVLENQGNE----VKMLME----------STVTGDIAKFTPILVPVIR 67 (465) Q Consensus 6 ~~~e~~~e~~~~~~~~~~~~~----~~~~~~~~l~~n~~~~----~~~i~e----------st~t~~v~~~~P~l~~l~~ 67 (465) |+-|+|+|||.|+|+|+++.+ +||+|+++|||||.++ +.+|.| +++|+++++|||+||+||| T Consensus 1 ms~~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~l~ea~~~~g~~~~~~~t~~~~~~~P~Li~l~R 80 (462) T protein:vir:10 1 MSIQQLQEKWAPVLNHESVPEIKDSYKKGVVAQLLENQENAIREEGQVLNETLQTTGYTTGDTATGPVAGFDPVLISLIR 80 (462) T ss_pred CchHHHHHHhhhhhcccccchhhhhhHHHHHHHHhhhHHHHHHhcccchhccccccCCCcCcccccccccccchhhhHHH Confidence 445899999999999999865 8999999999999654 455544 5789999999999999999 Q ss_pred hhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccccccccccccccccccc Q lcl|NC_018861. 68 RALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGK 147 (465) Q Consensus 68 ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~gga 147 (465) |++|||||+|||||||||||||||||||+||.++..... ....+++++++++.+|+........+....++ T Consensus 81 ra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~n--------q~gtEAlfnEadt~fSg~~~~~~~~~~~~~~~- 151 (462) T protein:vir:10 81 RSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPAN--------SDFREALFNEPNAGFSGGAGTGLSNYDPTASS- 151 (462) T ss_pred HHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccc--------cccchhhhccCCcCcccccccccccccccccc- Confidence 999999999999999999999999999999977643211 22246667888888877533322211100000 Q ss_pred ccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCC-- Q lcl|NC_018861. 148 IVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGK-- 225 (465) Q Consensus 148 it~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~-- 225 (465) ..+ ....+. ......+.........+.+.+|+|+.+|.+|+ T Consensus 152 -----------~~~-------~~~~g~-------------------~~~~~~~~~~g~~~~~~~~~GM~Ta~aE~lg~~s 194 (462) T protein:vir:10 152 -----------SAV-------NDAEGA-------------------NPGLLNDSPAGTYEVTGDATGMATATAEALDDSS 194 (462) T ss_pred -----------ccc-------cccccc-------------------cceeecCCCccceecccccccccchhccccCCcc Confidence 000 000000 00000111111122355677999999999873 Q ss_pred ---chhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeee--- Q lcl|NC_018861. 226 ---DMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTD--- 299 (465) Q Consensus 226 ---~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~--- 299 (465) .|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+|++++++++. T Consensus 195 ~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~~k~~~~ 274 (462) T protein:vir:10 195 ASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVNAVKGAIANT 274 (462) T ss_pred CCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhheeeecccc Confidence 59999999999999999999999999999999999999999999999999999999999999999999999874 Q ss_pred -----eeec-cCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcc---ccccccc Q lcl|NC_018861. 300 -----FDVN-SADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGS---KIDAINS 370 (465) Q Consensus 300 -----~~~~-~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~---~~~~~~~ 370 (465) ||+. ..+|||.+|++|+|++||++++|+|+++|+||+|||+|||++||++|+|+|||+++|+.+ ....+|| T Consensus 275 ~~~Gv~dl~~~~~gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~p~~~~~~~~~~~d~ 354 (462) T protein:vir:10 275 ATDGIFDLDVDSNGRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMAGVLDYAPGLQGNSALTGVDD 354 (462) T ss_pred cccceeeeccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhccchhcccccccccccccccc Confidence 5554 357899999999999999999999999999999999999999999999999999999633 2334788 Q ss_pred ccceEEEEecCceEEEEeCC----CCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeeeeecCc Q lcl|NC_018861. 371 GIKPNVGKFDNRYDVIVDNF----AEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDVVATPL 446 (465) Q Consensus 371 ~~~~~~G~l~~~~~vy~d~~----~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf 446 (465) ++.+++|+|+|||+|||||| +++|||+|||||++++|+||||||||||++++++||+||||+|||||||||++||| T Consensus 355 ~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~ 434 (462) T protein:vir:10 355 TSSTLVGTLNGRIKVYVDPYSSNVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPNTFQPKIGFKTRYGMVSNPF 434 (462) T ss_pred ccceeEEEecCceEEEEecccCCCcccceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeeeeecCC Confidence 99999999999999999998 68999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccc----------eEE-Eeeccce Q lcl|NC_018861. 447 HPEAFIR----------TFA-VNLNNYI 463 (465) Q Consensus 447 ~~~~~~~----------~f~-~~~~~~~ 463 (465) ++..+.. +|- |=..|-. T Consensus 435 t~~~~~~~~~~~~~~n~y~r~~~v~~l~ 462 (462) T protein:vir:10 435 SGGLTQGSGALTANANKYYRRVQVANLM 462 (462) T ss_pred CCCcCCccccccccCcceeeeEEeeccC Confidence 9877643 222 1111111 No 17 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=100.00 E-value=1.2e-182 Score=1017.98 Aligned_cols=409 Identities=33% Similarity=0.519 Sum_probs=338.2 Q ss_pred hhhhHHHhhhhhhccccccCh----hhhhheehccccchhH----HHhhhh----------hhhccccccccchhhhhhh Q lcl|NC_018861. 6 LLDESTKEKFITSNLYPNLNE----SEKNIMRTVLENQGNE----VKMLME----------STVTGDIAKFTPILVPVIR 67 (465) Q Consensus 6 ~~~e~~~e~~~~~~~~~~~~~----~~~~~~~~l~~n~~~~----~~~i~e----------st~t~~v~~~~P~l~~l~~ 67 (465) |+-|+|+|||.|+|+|++++| |||+|+++|||||.++ +++|.| |++|++|++|||+||+||| T Consensus 1 m~~~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~lenq~~~~~~~~~~l~ea~~~~g~~~~s~~t~~v~~~~P~Li~l~R 80 (457) T protein:vir:10 1 MSFQNLQEKWAPVLEHDSLPEIGDSYKKGVVAQLLENQEKAIAEEGKILTETLQTTGYTGGDTVTGPVAGFDPVLISLIR 80 (457) T ss_pred CchHHHHHHhhHhhccCccchhhhhHHHHHHHHHhhhHHHHHHhccccccccccccCCCcccccccccccccchhhhhhH Confidence 455899999999999999976 8999999999999654 455666 5788999999999999999 Q ss_pred hhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccccccccccccccccccc Q lcl|NC_018861. 68 RALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGK 147 (465) Q Consensus 68 ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~gga 147 (465) |++|||||+|||||||||||||||||||+||.++.+.... ...++++++++++||+.......+.... . T Consensus 81 ra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a--------~~~EAl~nEadt~fSg~~~~~~~~~~~~-~-- 149 (457) T protein:vir:10 81 RSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAA--------GYDEAFFNEPNAGFSGGPGAYDPGATGV-T-- 149 (457) T ss_pred HHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccccc--------cccceeeeccCcccCccccccccccccc-c-- Confidence 9999999999999999999999999999999887654221 1245667788887776433322111000 0 Q ss_pred ccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCC-- Q lcl|NC_018861. 148 IVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGK-- 225 (465) Q Consensus 148 it~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~-- 225 (465) +. ..|+ .....+.........++++.+|+|+.+|.+|+ T Consensus 150 --~~-------------------~~gt-------------------~~~~~~~~~~~~~~~~~~~~gmsTA~aE~lgd~~ 189 (457) T protein:vir:10 150 --ND-------------------AEGT-------------------NPALLNDSPAGTYEQADDATGMSTATVEALDDST 189 (457) T ss_pred --cc-------------------cccc-------------------cccccCccccccccccccccchhhhhhhccCCCC Confidence 00 0000 00011111122234567899999999999883 Q ss_pred ---chhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeee--- Q lcl|NC_018861. 226 ---DMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTD--- 299 (465) Q Consensus 226 ---~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~--- 299 (465) .|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+|++++++++. T Consensus 190 ~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~~~~~~ 269 (457) T protein:vir:10 190 ANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAVAGAQNNT 269 (457) T ss_pred CccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeecccc Confidence 49999999999999999999999999999999999999999999999999999999999999999999998875 Q ss_pred -----eeec-cCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccc---ccc Q lcl|NC_018861. 300 -----FDVN-SADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDA---INS 370 (465) Q Consensus 300 -----~~~~-~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~---~~~ 370 (465) ||+. +.+|||.+|++|+|++||++++|+|+++|+||+|||+|||++||++|+++||++++|+.+.... +|+ T Consensus 270 ~~~gv~dl~~~~~g~~~~e~~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~~~~d~ 349 (457) T protein:vir:10 270 ATAGVFDLDVDSNGRWSVEKFKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGMAGVLDYTPALNGNNGLAGVDD 349 (457) T ss_pred ccceeeeeeccccchhhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhhcccccccchhhcccccccccc Confidence 4443 5688999999999999999999999999999999999999999999999999999998665433 688 Q ss_pred ccceEEEEecCceEEEEeCCC----CcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeeeeecCc Q lcl|NC_018861. 371 GIKPNVGKFDNRYDVIVDNFA----EFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDVVATPL 446 (465) Q Consensus 371 ~~~~~~G~l~~~~~vy~d~~~----~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf 446 (465) ++.+++|+|+|||+||||||+ ++|||+|||||++++|+||||||||||.+++++||+||||+|||||||||++||| T Consensus 350 ~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~ 429 (457) T protein:vir:10 350 TSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPDTFQPKIGFKTRYGMVSNPF 429 (457) T ss_pred ccceeEEEecCCeEEEEecccccCCccceEEEEEeCCcceecceeecccccccccCccCCccccceeeeeeeeeeeeccc Confidence 999999999999999999886 6999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceEE--EeeccceeC Q lcl|NC_018861. 447 HPEAFIRTFA--VNLNNYIIS 465 (465) Q Consensus 447 ~~~~~~~~f~--~~~~~~~~~ 465 (465) +...+...=. .|.|.+-=. T Consensus 430 ~~~~~~~~~~~~~~~n~~~~r 450 (457) T protein:vir:10 430 AGGLTQGSGALTVNANKYYRR 450 (457) T ss_pred ccccccccccccccchhhcce Confidence 8766643211 122211100 No 18 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=99.71 E-value=2.2e-21 Score=133.78 Aligned_cols=390 Identities=12% Similarity=0.015 Sum_probs=166.7 Q ss_pred hhhhhhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccc Q lcl|NC_018861. 45 MLMESTVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANK 124 (465) Q Consensus 45 ~i~est~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~e 124 (465) |=.-+++--=..+|.|.|=+. +-- | -|+.+. .| ..||..+.. .+ ..+..... T Consensus 1 ~~~~~~~e~l~~kw~p~l~~~-~~~---------~-~~~~~a---~l------lenq~~~~~------~~--l~e~~~~~ 52 (523) T protein:vir:59 1 MSQPKINEQLIEKWQPLLEGC-RND---------W-ERHTLA---TL------LENQYREAK------KH--LMETTQTT 52 (523) T ss_pred CCcchhhHHHHHhhhhhhccc-CCh---------h-HHHHHH---HH------hhhhhHHHH------Hh--hhhhhhcc Confidence 100001111234566666431 000 0 000000 00 011111100 00 01111111 Q ss_pred cccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCccccc--- Q lcl|NC_018861. 125 DDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNE--- 201 (465) Q Consensus 125 a~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~--- 201 (465) +..++.....-.-.....-...+||++||||||||||||||+||.++.|+|++|++....++...+.........+. T Consensus 53 ~~~~~~~~~~~v~r~~p~l~a~DIWGVQPMTGPTGLIFAMRSRY~~q~gteA~yg~~~~~~~~a~~~~~ean~~~s~~~~ 132 (523) T protein:vir:59 53 EVDGWNLALPIVRRVFANLRATDLVSVQPLSLPTGLVFYLDFKSPELPGNGSVYGGTGLTTDTATGGLYDENARLSRREY 132 (523) T ss_pred ccccccchhhhhhhHhhhhhhhhccccccCCCCcceeEEEEeeccCCCCcccccCccccCcccccccccccccccccccc Confidence 11111111011112334456789999999999999999999999999999999998776655443322221111110 Q ss_pred -CccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhC-CCHHHHHHHHHHHHHH Q lcl|NC_018861. 202 -ALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHG-INAEKELADILSAEVA 279 (465) Q Consensus 202 -a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHG-lDAe~EL~niLstEIm 279 (465) ..........+.+.....+-..+..+.+|+|+|+|..|++++|+++++|+.+.+++.+.++| .++.....+.+.+++. T Consensus 133 ~~~~~~d~~~sg~~~~~~~a~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s~~~an~as 212 (523) T protein:vir:59 133 ETTITVDLATAQQATMRDVGFDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYPLPRYNRIV 212 (523) T ss_pred cCccCCCcccccccccccccccccchhhccccceeeeeccccccccccccccccccccccccccccccccchhhcccccc Confidence 00011111122222233333445679999999999999999999999999999999999986 4444444555555555 Q ss_pred HHhhHHHHhhhhhe-eee--------------------e--------------------------eeeeecc-------- Q lcl|NC_018861. 280 LEIDRTIIEKANEV-ATV--------------------C--------------------------TDFDVNS-------- 304 (465) Q Consensus 280 lEINreii~~l~~~-at~--------------------~--------------------------~~~~~~~-------- 304 (465) -+.++-.-...... ++. + -.|.++. T Consensus 213 tAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSR 292 (523) T protein:vir:59 213 GAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTR 292 (523) T ss_pred ccccccccccccccccccccccCCCcccccccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecc Confidence 55443211100000 000 0 0011111 Q ss_pred -CCcccHHHHHHHHHH--H--------HHHHHHHHHHhcccccccEEEecHHHHHH--HHhcCcccccCCcccccccccc Q lcl|NC_018861. 305 -ADGRWFIEKARGLSM--R--------ISNEAREIGRQTRKGGGNKLIVSPKVATI--LDEIGSFVLSPAGSKIDAINSG 371 (465) Q Consensus 305 -~~~~~~~e~~~~L~~--~--------i~~~a~~i~~~T~~~~~~~~~~s~~va~~--L~~~~~~~~~~~~~~~~~~~~~ 371 (465) -...|.+|.+++|-. . .+-+++||...=-|=-...|++.+++... +...|.++....++...... T Consensus 293 aLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~-- 370 (523) T protein:vir:59 293 KLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAG-- 370 (523) T ss_pred cccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecccccchhhhh-- Confidence 022455677777755 2 12233344332222222223333322111 12234443332221110000 Q ss_pred cceEE--EEecCceEEEEe---------CC-CCcceEEEEEe------------cC---CCccceeEEecccccceeeee Q lcl|NC_018861. 372 IKPNV--GKFDNRYDVIVD---------NF-AEFDYCTVAYK------------GA---SNFDAGIFFAPYNITLQQNLT 424 (465) Q Consensus 372 ~~~~~--G~l~~~~~vy~d---------~~-~~~dy~~vg~k------------g~---~~~d~glfy~PY~~~~~~~~~ 424 (465) ..++ +.-.-.+-++++ +- -.-.|+++..| +. ...+.+.+|+=-.-+-+++.+ T Consensus 371 -~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~~~~~~~~~~~g~l~~~~~vy~ 449 (523) T protein:vir:59 371 -NFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGNDNRDGGTGIFYVGMVQGRYRLYK 449 (523) T ss_pred -hhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCCccccccccceeEEEecCceEEEe Confidence 0000 000000001111 00 12234444332 11 123345544323345567777 Q ss_pred CCCcccceeeeeeee-------eeeecCcccccccc------eEE---EeeccceeC Q lcl|NC_018861. 425 DPVSGQPAMILNNRY-------DVVATPLHPEAFIR------TFA---VNLNNYIIS 465 (465) Q Consensus 425 dp~s~qp~~~~~tRY-------~l~~nPf~~~~~~~------~f~---~~~~~~~~~ 465 (465) ||.+-+..+-+=.+. ||.=+||.|--..+ +|- --++-|=|+ T Consensus 450 d~~~~~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~ 506 (523) T protein:vir:59 450 NIYQNQPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALE 506 (523) T ss_pred cCCCCcceEEEEecccCCcccccceecccchhhcccccccCCcccceeeeeeehhhe Confidence 887777665443333 34557777632222 111 001111111 No 19 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=99.03 E-value=2.7e-12 Score=84.01 Aligned_cols=382 Identities=12% Similarity=0.013 Sum_probs=170.1 Q ss_pred ccccccCh-hhhhheehccccchhHHHhhhhhhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEEEEE Q lcl|NC_018861. 19 NLYPNLNE-SEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPH 97 (465) Q Consensus 19 ~~~~~~~~-~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSr 97 (465) +..+.|.| |. -|||+.+. ..|.. ..| -++++|.+-|.- .||.-+||+.-|+++-+-.+.. T Consensus 1 ~~~~~l~~kw~-----p~l~~~~~--~~i~~--------~~~---~~i~~~~~en~~-~~~~~~~~~~~~~~~~~~~~~l 61 (519) T protein:vir:10 1 MKKNALVQKWS-----ALLENEAL--PEIVG--------ASK---QAIIAKIFENQE-QDILTAPEYRDEKISEAFGSFL 61 (519) T ss_pred CchhHHHHHhH-----Hhhccccc--chhhh--------hhh---HHHHHHHHHHHH-HHhhhcccccchHHHHHHhhhc Confidence 33333322 32 24443321 11110 001 136788999988 9999999999999988755521 Q ss_pred ecC-CCCcccccccccccCcccccccccccccccccccccc---ccccccccccccccccccccchhhhheeeeeccCc- Q lcl|NC_018861. 98 YVG-DGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVS---FKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNST- 172 (465) Q Consensus 98 Y~~-~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s---~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~- 172 (465) =.. .++++.....+-+-+...+. ....++.-.. -.+.....-++|++|||+||||||||||++|.++. T Consensus 62 ~e~~~~~~~~~~~t~i~~~~~t~~-------v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~ 134 (519) T protein:vir:10 62 TEAEIGGDHGYDATNIAAGQTSGA-------VTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPI 134 (519) T ss_pred chhccCCccccCcccccccccccc-------ccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCcc Confidence 100 01111110000000000000 0011111111 12344567889999999999999999999999875 Q ss_pred ---cccc--cccccccccccccCCccCCCcccccCccccc-----c--------------ccccccccchhhhccCCch- Q lcl|NC_018861. 173 ---GSVA--IGDEMDKAATFATKKATVEAVYTNEALWLKV-----L--------------KNYTGPYATAAGEKLGKDM- 227 (465) Q Consensus 173 ---g~ea--~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~-----~--------------~~~~~~~~Ta~~E~lg~~f- 227 (465) +.|+ +++|+++.||+..............+...+. + ...+++..+.......... T Consensus 135 ~~~g~ea~~~~nEadt~fSG~~~~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~ 214 (519) T protein:vir:10 135 AAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALV 214 (519) T ss_pred ccccccccccccccccccCccccccccccccccccccccccccccccccccceeccccccccCCCCcCcccccccccccc Confidence 4444 4699999999875542211111100000000 0 0000000000000000000 Q ss_pred h-hcceEEEEEEEEeecceec------------ccchHH-----H-HHHHHhhhCCCHHHHHHHH--------HHHHHHH Q lcl|NC_018861. 228 K-EMGISVQRVLAEAKTRKVK------------GTYTIE-----M-LQDLKAQHGINAEKELADI--------LSAEVAL 280 (465) Q Consensus 228 ~-EM~FsIeK~tVtAKSRaLK------------AEYT~E-----L-AQDLkAiHGlDAe~EL~ni--------LstEIml 280 (465) . --.+.+-.--.|++--+|. ..++|| . ..=|||..-+..-+-|..| |++-+.. T Consensus 215 ~~~~~~~~~~gmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILST 294 (519) T protein:vir:10 215 EAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILAT 294 (519) T ss_pred ccccccccccccccchhhccccCCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHH Confidence 0 0001111111222222210 111111 1 2458888777777777664 8999999 Q ss_pred HhhHHHHhhhhheeeeeeeeeeccCCccc-----HH--------HHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHH Q lcl|NC_018861. 281 EIDRTIIEKANEVATVCTDFDVNSADGRW-----FI--------EKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVA 347 (465) Q Consensus 281 EINreii~~l~~~at~~~~~~~~~~~~~~-----~~--------e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va 347 (465) ||=.||=|-+...+++++++..++.+..| .. +.+|-...+...+--+|.+.- . T Consensus 295 EImlEINReii~~i~~sa~~~~~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~a--------------n 360 (519) T protein:vir:10 295 EIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEA--------------A 360 (519) T ss_pred HHHHHhhHHHHhhhhhhhhcceeecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHH--------------H Confidence 99999999999889999999998877532 11 123333333333333333321 1 Q ss_pred HHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCC-------CC-cceEEE-EEe--cCCCccceeEEeccc Q lcl|NC_018861. 348 TILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNF-------AE-FDYCTV-AYK--GASNFDAGIFFAPYN 416 (465) Q Consensus 348 ~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~-------~~-~dy~~v-g~k--g~~~~d~glfy~PY~ 416 (465) .+.+...-. .|+ .|.|-+. +. .||.-- |.. ...+...++||. -. T Consensus 361 ~I~~~T~r~-------------------~gn-----~ii~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G-~l 415 (519) T protein:vir:10 361 EIARQTGRG-------------------AGN-----FIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAG-VL 415 (519) T ss_pred HHHHhhccc-------------------ccc-----EEEEchHHHHHHhhccchhccccccccccccccCCCceEEE-Ee Confidence 122211111 111 2333332 00 111111 111 122333555554 22 Q ss_pred ccceeeeeCCCcccceeeeeee------eeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 417 ITLQQNLTDPVSGQPAMILNNR------YDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 417 ~~~~~~~~dp~s~qp~~~~~tR------Y~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) -+-+++.+||.+-+..+-+=.+ =||.=+||.|.-..+.-- .++.=.|-- T Consensus 416 ~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~ 471 (519) T protein:vir:10 416 GGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGF 471 (519) T ss_pred cCceEEEecCCCCcceEEEEEecCcccccceeeccccccccccccCCccccceeee Confidence 2445666677766665332222 133445666544433210 111111111 No 20 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=98.85 E-value=2.6e-11 Score=78.61 Aligned_cols=369 Identities=13% Similarity=0.101 Sum_probs=146.2 Q ss_pred hhhccccccC-hhhhhheehccccchhHHHhhhhhhhccccccccchhhhhhhhhhhhhh----hhhheeeeccCCCcce Q lcl|NC_018861. 16 ITSNLYPNLN-ESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILVPVIRRALPSLI----GTEIAGVQALKTPTAY 90 (465) Q Consensus 16 ~~~~~~~~~~-~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~~l~~ra~~~lI----~~DIwGVQPMTgPTGL 90 (465) -...+.|.|. +| +-|||..+. |.....-||.+---| ..|+-- -| T Consensus 1 ~~~~~~e~l~~kw-----~p~l~~~~~------------------~~i~~~~~~~v~a~l~enq~~~~~~-----~~--- 49 (470) T protein:vir:10 1 MQMFNSEYLQEKW-----APILDYDGL------------------DPIKDSHRRSVTAVLLENQEKELRE-----ER--- 49 (470) T ss_pred CCcchhHHHHHhh-----hhhhcCCcc------------------chhcchhhhhhhhhhhhhhHHHHhh-----cc--- Confidence 1112223331 12 234444321 000011111111100 000000 00 Q ss_pred EEEEEEEecCCCCcccccccccccCccccccccccccc---cccccc--cccc-cccccccccccccccccccchhhhhe Q lcl|NC_018861. 91 LYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFN---YTGTPI--EVSF-KTATTVKGKIVYSEKQAGTDNIVNVL 164 (465) Q Consensus 91 IFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~---~Sg~~~--~~s~-~tatt~ggait~~~~~TGPTgLifam 164 (465) .|...... -...|+....+..++.+- ....+. +..- .+..-...++|++||++||||||||| T Consensus 50 ------~~l~e~~~------~~~~~~~~~~~i~~st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAm 117 (470) T protein:vir:10 50 ------NFLSEAPN------VNTNSGATAGFSADATAAGPVAGFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAM 117 (470) T ss_pred ------chhhhhhh------ccccccccccccccccccccccccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEE Confidence 00000000 000011111111111000 001111 1111 23344577899999999999999999 Q ss_pred eeeeccCccccccccccccccccccCCccCCCccccc-Cccccccc----cccccccchhhhccCCchhhcceEEEEEEE Q lcl|NC_018861. 165 LRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNE-ALWLKVLK----NYTGPYATAAGEKLGKDMKEMGISVQRVLA 239 (465) Q Consensus 165 ~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~-a~~~~~~~----~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tV 239 (465) |++|.+++|+|+||+|+++.||+.............. +...+... .+..+.....+. +. ..-.+.+-+--. T Consensus 118 RsrY~n~sG~EaffnEA~T~fSG~~~~~~~~~~~~~~~a~~~g~~~~~~~gt~~~~~~~~~~--~a--~~~~y~~~~GMs 193 (470) T protein:vir:10 118 RSRYKTQSGTEALFNEADTAFSGQPDGLDDTSGFTATGANNVGLGTTAQQGSNPGLLNSTAA--QT--NATDYNVGQGMR 193 (470) T ss_pred EEEecCCCccceeeecCCcccCcccccccccccccccccccccccccccccccccccccccc--cc--cccccccccccc Confidence 9999999999999999999999866543322111100 00000000 000000000000 00 000112222233 Q ss_pred Eeecceeccc--------------chHHH-HHHHHhhhCCCHHHHHHH--------HHHHHHHHHhhHHHHhhhhh-eee Q lcl|NC_018861. 240 EAKTRKVKGT--------------YTIEM-LQDLKAQHGINAEKELAD--------ILSAEVALEIDRTIIEKANE-VAT 295 (465) Q Consensus 240 tAKSRaLKAE--------------YT~EL-AQDLkAiHGlDAe~EL~n--------iLstEImlEINreii~~l~~-~at 295 (465) ||.-.+|... +|+|. -.=|||..-+..-+-|.. .|++-+..||=.||=|-+.. ..+ T Consensus 194 Ta~aE~lg~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~ 273 (470) T protein:vir:10 194 TDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYN 273 (470) T ss_pred hHHhhhcCCCCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhh Confidence 3333344311 11111 235888777766666655 58999999999999888765 489 Q ss_pred eeeeeeeccCCcccHH-----HHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccc-cccc Q lcl|NC_018861. 296 VCTDFDVNSADGRWFI-----EKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKI-DAIN 369 (465) Q Consensus 296 ~~~~~~~~~~~~~~~~-----e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~-~~~~ 369 (465) +.++|++.+....+.. ..+|....+.+.+.-+|.+.-.. +.+...... +++. .+.+ T Consensus 274 ~a~~~k~~~~~~~Gv~Dl~~~~~gr~~~e~~~~l~~~i~~ean~--------------i~~~t~r~~----~n~~i~S~~ 335 (470) T protein:vir:10 274 VAEPGAQANVAAAGTFDLDTDSNGRWSVEKFKGLIFQIERDANA--------------IAQRTRRGK----GNMILCSAD 335 (470) T ss_pred hhhhceeccccccceEEeecccchhHHHHHHHHHHHHHHHHHHH--------------HHHhhcccc----ceEEEEchh Confidence 9999999987655543 23566677777777666653321 212211110 1100 0000 Q ss_pred ccc-ceEEEEecCceEEEEeCCCCcceEEEEEecCCCcc--ceeEEecccccceeeeeCCC------cccc--eeeeeee Q lcl|NC_018861. 370 SGI-KPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFD--AGIFFAPYNITLQQNLTDPV------SGQP--AMILNNR 438 (465) Q Consensus 370 ~~~-~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d--~glfy~PY~~~~~~~~~dp~------s~qp--~~~~~tR 438 (465) .-. ....|.|. +.+ |..+..+.| +..||. -.-+-+++-+||= +-.+ .+|+|== T Consensus 336 Va~~La~sG~l~---------~~~------~~~~~~~~D~t~~~~~G-~l~~~~~vy~d~y~~~~~~a~~dy~~vG~KG~ 399 (470) T protein:vir:10 336 VASALTMAGVLD---------YTP------ALNANLNVDDTGNTFAG-ILQGKYRVYIDPFSASGGAAATQYYVVGYKGS 399 (470) T ss_pred HHhHhhhccccc---------ccc------ccccccccCCCCceEEE-EecCceEEEeeccccccCcccccEEEEEEecC Confidence 000 00112221 000 000011111 222222 1122344455552 1112 2333310 Q ss_pred ----eeeeecCcccccccce-----EE---EeeccceeC Q lcl|NC_018861. 439 ----YDVVATPLHPEAFIRT-----FA---VNLNNYIIS 465 (465) Q Consensus 439 ----Y~l~~nPf~~~~~~~~-----f~---~~~~~~~~~ 465 (465) =||.=+||.|--..+. |- --++-|=|+ T Consensus 400 ~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~ 438 (470) T protein:vir:10 400 SPYDAGLFYCPYVPLQMVRAVGQDTFQPKIGFKTRYGLV 438 (470) T ss_pred cceecceeeccccccccCCCCCCccccceeeeeeeecee Confidence 1233355554322221 10 000111111 No 21 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=98.53 E-value=3e-09 Score=67.34 Aligned_cols=377 Identities=13% Similarity=0.050 Sum_probs=149.2 Q ss_pred eehccccc---hhHHHhhhhhhhccccccccchhhhhhhhhhhhh--------hhhhheeeeccCCCcceEEEEEEEecC Q lcl|NC_018861. 32 MRTVLENQ---GNEVKMLMESTVTGDIAKFTPILVPVIRRALPSL--------IGTEIAGVQALKTPTAYLYAMVPHYVG 100 (465) Q Consensus 32 ~~~l~~n~---~~~~~~i~est~t~~v~~~~P~l~~l~~ra~~~l--------I~~DIwGVQPMTgPTGLIFAMRSrY~~ 100 (465) |+++.-+. ..|.-+| |...--.+..++-.++ -|.+-|- .|.|=.-||-+. .+.. T Consensus 1 ~~~~~~~e~l~~kw~p~l-~~~~~~~~~~~~~~~~---a~l~enq~~~~~~~~~~~~~~~~~~~~-----------~~l~ 65 (522) T protein:vir:69 1 MTTIKTKAQLVDKWKELL-EGEGLPEIANSKQAII---AKIFENQEKDFEVSPEYKDEKIAQAFG-----------SFLT 65 (522) T ss_pred CCccchHHHHHHhhHHHh-cCCCCCccccchhhhh---hhhhhhhhHHhhcccccchhHHHHhhh-----------hhhh Confidence 33332221 0111111 1111111111222111 1122220 111111111110 0000 Q ss_pred CCCcccccccccccCccccccccccccccc---ccccccc---ccccccccccccccccccccchhhhheeeeeccCc-- Q lcl|NC_018861. 101 DGNNSVSPTKNAIVLKLKTESANKDDFNYT---GTPIEVS---FKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNST-- 172 (465) Q Consensus 101 ~~~~~~~~~~~aaf~~~~~a~~~ea~~~~S---g~~~~~s---~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~-- 172 (465) +. .-...|+.....-.++.+-.+ ..+.-.+ -.+..-..-++|++|||+||||||||||++|.++. T Consensus 66 ----ea---~~~~~~~~~~~~i~es~~t~~v~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~ 138 (522) T protein:vir:69 66 ----EA---EIGGDHGYNAQNIAAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIA 138 (522) T ss_pred ----hh---ccccccCCCcccccccccccccccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccc Confidence 00 000011111111111111000 1111111 12334457789999999999999999999999875 Q ss_pred --ccccc--ccccccccccccCCccCCCcccccCcccccc----------------ccccccccchhhhcc--------- Q lcl|NC_018861. 173 --GSVAI--GDEMDKAATFATKKATVEAVYTNEALWLKVL----------------KNYTGPYATAAGEKL--------- 223 (465) Q Consensus 173 --g~ea~--~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~----------------~~~~~~~~Ta~~E~l--------- 223 (465) +.|++ ++|+++.|++..............+...+.. .....+ .+...... T Consensus 139 ~~~~eaf~~~neadt~fSG~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~-~t~~~~~~~~~ai~s~~ 217 (522) T protein:vir:69 139 AGAKEAFHPMYAPDAMFSGQGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTIS-SSADDAAKLDAEIIKQM 217 (522) T ss_pred CccccccccccccccccccccccccccccccccccccccccccccccccceeeecccCCcCC-CCCcccccccchhcccc Confidence 55666 4999999998654332111111110000000 000000 00000000 Q ss_pred -CCchhhcceEEEEEEEEeecceec----------cc--chHH-----H-HHHHHhhhCCCHHHHHHHH--------HHH Q lcl|NC_018861. 224 -GKDMKEMGISVQRVLAEAKTRKVK----------GT--YTIE-----M-LQDLKAQHGINAEKELADI--------LSA 276 (465) Q Consensus 224 -g~~f~EM~FsIeK~tVtAKSRaLK----------AE--YT~E-----L-AQDLkAiHGlDAe~EL~ni--------Lst 276 (465) ....-+++.-+. |++--+|. +| ++|| . ..=|||..-+..-+-|..| |++ T Consensus 218 ~~~~~y~~g~Gms----Ta~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaN 293 (522) T protein:vir:69 218 EAGALVEIAEGMA----TSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSG 293 (522) T ss_pred ccccceeeccccc----hhhhhhcccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHH Confidence 000111121221 11111110 11 1111 0 2357887777766776654 888 Q ss_pred HHHHHhhHHHHhhhhheeeeeeeeeeccCCccc-------H------HHHHHHHHHHHHHHHHHHHHhcccccccEEEec Q lcl|NC_018861. 277 EVALEIDRTIIEKANEVATVCTDFDVNSADGRW-------F------IEKARGLSMRISNEAREIGRQTRKGGGNKLIVS 343 (465) Q Consensus 277 EImlEINreii~~l~~~at~~~~~~~~~~~~~~-------~------~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s 343 (465) -+.-||=.||=|-+...+++++++.+++..+-| . .+.+|-...+...+--+|.+.-. T Consensus 294 ILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an---------- 363 (522) T protein:vir:69 294 ILATEIMLEINREVVDWINYSAQVGKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAV---------- 363 (522) T ss_pred HHHHHHHHHhhHHHHhhhhhhheeeccccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHH---------- Confidence 899999999999999888889999998765333 1 13355555555555555544221 Q ss_pred HHHHHHHHhcCcccccCCcccc-ccccccc-ceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEeccccccee Q lcl|NC_018861. 344 PKVATILDEIGSFVLSPAGSKI-DAINSGI-KPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQ 421 (465) Q Consensus 344 ~~va~~L~~~~~~~~~~~~~~~-~~~~~~~-~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~ 421 (465) .+.+...-. .+++. .+.+... ...+|++ +|.+.- -+-.|+ ..+...++||. -.-+-++ T Consensus 364 ----~i~~~T~rg----~~n~~i~S~~Va~~L~~~~~~-----~~~~~~----~~~~g~--~~d~~~~~~~G-~l~~~~~ 423 (522) T protein:vir:69 364 ----EIARQTGRG----EGNFIIASRNVVNVLASVDTG-----ISYAAQ----GLASGF--NTDTTKSVFAG-VLGGKYR 423 (522) T ss_pred ----HHHHhcccc----cccEEEEchhHHHHHhhcccc-----cccccc----cccccc--cccCCCceEEE-EecCceE Confidence 122211111 01110 0000000 0011221 110000 000111 11334556664 2234566 Q ss_pred eeeCCCcccceeeeeee------eeeeecCcccccccceEE-Eee-------ccceeC Q lcl|NC_018861. 422 NLTDPVSGQPAMILNNR------YDVVATPLHPEAFIRTFA-VNL-------NNYIIS 465 (465) Q Consensus 422 ~~~dp~s~qp~~~~~tR------Y~l~~nPf~~~~~~~~f~-~~~-------~~~~~~ 465 (465) +.+||.+-+..+-+=.+ =||.=+||.|.-..+.-- .++ +=|=|+ T Consensus 424 vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~ 481 (522) T protein:vir:69 424 VYIDQYAKQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG 481 (522) T ss_pred EEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeecee Confidence 67788776665433222 134456777654443210 111 111111 No 22 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=98.31 E-value=3.1e-08 Score=61.77 Aligned_cols=375 Identities=14% Similarity=0.100 Sum_probs=140.5 Q ss_pred hhhccccccC-hhhhhheehccccchhHHHhhhhhhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEE Q lcl|NC_018861. 16 ITSNLYPNLN-ESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAM 94 (465) Q Consensus 16 ~~~~~~~~~~-~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAM 94 (465) -...+.|.|. +|. -|||..+. -.|..++-.+|+ +.+-| |-+---..- -+ T Consensus 1 ~~~~~~~~l~~kw~-----p~l~~~~~-----------~~i~~~~~~~~a---~~~en---------q~~~~~~~~--~~ 50 (521) T protein:vir:72 1 MTIKTKAELLNKWK-----PLLEGEGL-----------PEIANSKQAIIA---KIFEN---------QEKDFQTAP--EY 50 (521) T ss_pred CCcchhHHHHHhhh-----hhhccCCC-----------Cccccchhhhhh---hhhhh---------hhhhhhhcc--cc Confidence 1111222221 122 23333221 111111111111 11111 110000000 00 Q ss_pred EE--------EecCC---CCcccccccccccCccccccccccccccccccccccc-cccccccccccccccccccchhhh Q lcl|NC_018861. 95 VP--------HYVGD---GNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSF-KTATTVKGKIVYSEKQAGTDNIVN 162 (465) Q Consensus 95 RS--------rY~~~---~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~-~tatt~ggait~~~~~TGPTgLif 162 (465) |. .+... ++++.....+- .+.-...+..++...--++.- .+..-..-++|++|||+||||||| T Consensus 51 ~~~~~~~~~~~~l~e~~~~~~~~~~~~~i-----aes~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIF 125 (521) T protein:vir:72 51 KDEKIAQAFGSFLTEAEIGGDHGYNATNI-----AAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVF 125 (521) T ss_pred cchHHHHHHhhhhhhhcccCccccCcccc-----cccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhhe Confidence 00 00000 00000000000 000000111111111111111 244456788999999999999999 Q ss_pred heeeeeccCc----ccccccccc--ccccccccCCccCCCcccccCcccccc-----------c-----cc-cccccchh Q lcl|NC_018861. 163 VLLRLESNST----GSVAIGDEM--DKAATFATKKATVEAVYTNEALWLKVL-----------K-----NY-TGPYATAA 219 (465) Q Consensus 163 am~s~y~~~~----g~ea~~~e~--~t~~s~~~~~~~~~~~~~~~a~~~~~~-----------~-----~~-~~~~~Ta~ 219 (465) |||++|.++. |+|+++++. ++.||+..............+...+.. + .. ..+-.+ . T Consensus 126 AMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t-~ 204 (521) T protein:vir:72 126 ALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGAT-D 204 (521) T ss_pred eeeeeecCCCCCcccccccchhcccccccccccccccccccccccccccccccccccccccccccccccccccCCCCC-C Confidence 9999999875 678999874 566776544322111111110000000 0 00 000000 0 Q ss_pred hhccCC----chhh-cceEEEEEEEEeecceec----------cc--chHH-----H-HHHHHhhhCCCHHHHHHHH--- Q lcl|NC_018861. 220 GEKLGK----DMKE-MGISVQRVLAEAKTRKVK----------GT--YTIE-----M-LQDLKAQHGINAEKELADI--- 273 (465) Q Consensus 220 ~E~lg~----~f~E-M~FsIeK~tVtAKSRaLK----------AE--YT~E-----L-AQDLkAiHGlDAe~EL~ni--- 273 (465) .+.++. .+.. -.+.+-+.-.|+.--+|. +| ++|| . -.=|||..-+..-+-|..| T Consensus 205 ~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGL 284 (521) T protein:vir:72 205 AAKLDAEIKKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGM 284 (521) T ss_pred ccccccccccccccCceeeeecccchhhhhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCC Confidence 000000 0000 011122222222222211 01 1111 0 1346776666666666554 Q ss_pred -----HHHHHHHHhhHHHHhhhhheeeeeeeeeeccCC-------cccHH------HHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_018861. 274 -----LSAEVALEIDRTIIEKANEVATVCTDFDVNSAD-------GRWFI------EKARGLSMRISNEAREIGRQTRKG 335 (465) Q Consensus 274 -----LstEImlEINreii~~l~~~at~~~~~~~~~~~-------~~~~~------e~~~~L~~~i~~~a~~i~~~T~~~ 335 (465) |++-+..||=.||=|-+...+++++++.+++.. |..-. +.+|-...+...+--+|.+.-. T Consensus 285 DAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an-- 362 (521) T protein:vir:72 285 DADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAV-- 362 (521) T ss_pred ChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHH-- Confidence 788888888888888888778888888887663 32211 2344455554444555444221 Q ss_pred cccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCC--------CcceEEE-EE-ec-CC Q lcl|NC_018861. 336 GGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFA--------EFDYCTV-AY-KG-AS 404 (465) Q Consensus 336 ~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~--------~~dy~~v-g~-kg-~~ 404 (465) .+.+...-. -|+ .|.|-+.- +.||--- |. -| .. T Consensus 363 ------------~i~~~T~r~-------------------~~n-----~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~ 406 (521) T protein:vir:72 363 ------------EIARQTGRG-------------------EGN-----FIIASRNVVNVLASVDTGISYAAQGLATGFST 406 (521) T ss_pred ------------HHHHhcccc-------------------cce-----EEEEchHHHHHHhhcccccccccccccccccc Confidence 222222211 111 22222220 1111000 00 01 11 Q ss_pred CccceeEEecccccceeeeeCCCcccceeeeeee------eeeeecCcccccccceEE-Eee-------ccceeC Q lcl|NC_018861. 405 NFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNR------YDVVATPLHPEAFIRTFA-VNL-------NNYIIS 465 (465) Q Consensus 405 ~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tR------Y~l~~nPf~~~~~~~~f~-~~~-------~~~~~~ 465 (465) +....+||. -.-+-+++.+||.+-+..+-+=.+ =||.=+||.|.-..+.-- .++ +=|=|+ T Consensus 407 d~~~~~~~G-~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~ 480 (521) T protein:vir:72 407 DTTKSVFAG-VLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG 480 (521) T ss_pred cCCCceEEE-EccCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeecee Confidence 222334433 233556667777776665433222 134456776644433110 111 111111 No 23 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=98.21 E-value=7.8e-11 Score=76.01 Aligned_cols=387 Identities=15% Similarity=0.064 Sum_probs=125.7 Q ss_pred ehccccchhHHHhh-hhhhhccccccccchhhhhhhhhhhhhh----hhhheeeeccCCCcceEEEEEEEecCCCCcccc Q lcl|NC_018861. 33 RTVLENQGNEVKML-MESTVTGDIAKFTPILVPVIRRALPSLI----GTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVS 107 (465) Q Consensus 33 ~~l~~n~~~~~~~i-~est~t~~v~~~~P~l~~l~~ra~~~lI----~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~ 107 (465) -.|+| .|..+| .|+.. -|....+-||.+---| ..|+- --||---+-++=++-+. ..+.. T Consensus 1 ~~l~~---kw~p~l~~~~~~-------~~~i~~~~~~~~~~~l~enq~~~~~-~~~~~~~~~~~~~~~~~-----l~e~~ 64 (514) T protein:vir:56 1 MNLTE---KWKDLLEAEGAD-------MPEIATATKQKIMSKIFENQDRDIN-NDPMYRDPQLVEAFNAG-----LNEAV 64 (514) T ss_pred Cchhh---hhhHHhcccccc-------cccccchhhhhhhhhhhhhHHHHHh-cCCcccchhhhhhhhcc-----ccccc Confidence 11111 122222 11100 0222233344332211 11110 00110000000011000 00000 Q ss_pred cccccccCcccccccccccccc---cccccccc--c-cccccccccccccccccccchhhhheeeeeccC--ccccccc- Q lcl|NC_018861. 108 PTKNAIVLKLKTESANKDDFNY---TGTPIEVS--F-KTATTVKGKIVYSEKQAGTDNIVNVLLRLESNS--TGSVAIG- 178 (465) Q Consensus 108 ~~~~aaf~~~~~a~~~ea~~~~---Sg~~~~~s--~-~tatt~ggait~~~~~TGPTgLifam~s~y~~~--~g~ea~~- 178 (465) ....|+.......++.+.. ..++.-.+ . .+..-...++|++|||+||||||||||++|.++ +++|||| T Consensus 65 ---~~~~~~~~~~~ia~s~~t~~v~~~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~tg~EAf~~ 141 (514) T protein:vir:56 65 ---VNGDHGYDPANIAQGVTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHP 141 (514) T ss_pred ---ccccccccccccccccccccccccchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCccccccccc Confidence 0001111111111111100 00111111 1 233445778999999999999999999999987 6889999 Q ss_pred -cccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceE-EEEEEEEeecceecccchHHHHH Q lcl|NC_018861. 179 -DEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGIS-VQRVLAEAKTRKVKGTYTIEMLQ 256 (465) Q Consensus 179 -~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~Fs-IeK~tVtAKSRaLKAEYT~ELAQ 256 (465) +|+++.||+.....+........+...+...... . +. ..|+.+.. .|. +..+....-....-.+|+--++. T Consensus 142 ~nEadt~fSG~~~~~~~~~~~~~~~~~~G~~~~~~--~-t~---~~gd~~~~-~~~~~~~~~~~~~~~~~~t~~~~~~a~ 214 (514) T protein:vir:56 142 TRQADASFSGQAAASTIADFPTTGAATDGTPYKAE--V-TT---SGGDVSMR-YFLALGAVTLAVAGQMTATEYTDGVAG 214 (514) T ss_pred ccccCcCcccccccccccccccccccccccccccc--c-cc---cccccccc-ccccccccccccccccccccccccccc Confidence 9999999986554332221111111111000000 0 00 00000000 000 00000000000000011100000 Q ss_pred HHHhh--hCC-CHHHHHHH-------HHHHHHHHHhhHHHHhhhhheeee-eeeeeeccCCcccHHHHHHHHHH------ Q lcl|NC_018861. 257 DLKAQ--HGI-NAEKELAD-------ILSAEVALEIDRTIIEKANEVATV-CTDFDVNSADGRWFIEKARGLSM------ 319 (465) Q Consensus 257 DLkAi--HGl-DAe~EL~n-------iLstEImlEINreii~~l~~~at~-~~~~~~~~~~~~~~~e~~~~L~~------ 319 (465) -.-.. .|+ -+..|+.. --=.||.--|+|-- ++. ++.+ ...|.+|.+++|-. T Consensus 215 ~~~y~~~~Gm~Ta~aEal~~lggs~~~~f~EMaFsIdK~t-------VtAKSRaL-----KAEYTiELAQDLKAVHGLDA 282 (514) T protein:vir:56 215 GLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQV-------VEAKSRQL-----KAQYSIELAQDLRAVHGLDA 282 (514) T ss_pred chhhhhhhhhhhhhhhhcccCCCCcccccceeeeEEEEEE-------Eeeeccce-----eccccHHHHHHHHHhcCCCh Confidence 00000 011 11122100 00123333333311 111 2222 23444555554433 Q ss_pred -----------HHHHHHHHHH---HhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEE-------- Q lcl|NC_018861. 320 -----------RISNEAREIG---RQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVG-------- 377 (465) Q Consensus 320 -----------~i~~~a~~i~---~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G-------- 377 (465) .+.|.+.||- +.+..-+.+| .++. +...|.||++...+...... ....|-+ T Consensus 283 EtELsNILSTEImlEINReii~~l~~~atv~~~~-----~~~~-~~~~G~~d~~~~~d~~~~~~-~~e~~~~l~~~i~~~ 355 (514) T protein:vir:56 283 DAELSGILANEVMVELNREIVNLVNSQAQIGKSG-----WTQG-AGAAGVFDFSDAVDVKGARW-AGEAYKALLIQIEKE 355 (514) T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHhheeehhcc-----cccc-cccccccccccccccccchH-HHHHHHHHHHHHHHH Confidence 1222333331 1111111122 1111 23356676664332111100 0000000 Q ss_pred -------Eec-CceEEEEeCCC----CcceEEEEEec--------CCCccceeEEecccccceeeeeCCCcccceeeeee Q lcl|NC_018861. 378 -------KFD-NRYDVIVDNFA----EFDYCTVAYKG--------ASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNN 437 (465) Q Consensus 378 -------~l~-~~~~vy~d~~~----~~dy~~vg~kg--------~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~t 437 (465) +.. .+=.|.|-+.- ..=.++++..+ ..+.+..+|+. -.-+-+++.+||.+-+..+-+=. T Consensus 356 an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~~~~~~d~~~~~~aG-~l~~~~~vy~D~y~~~dy~~vG~ 434 (514) T protein:vir:56 356 ANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAG-VLGGRFKVYIDQYAVNDYFTVGF 434 (514) T ss_pred HHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccCccccccccccCcceEEE-EecCceEEEecCCCCcceEEEEE Confidence 000 12245565551 11222333322 22233456642 22356777888888777643333 Q ss_pred e------eeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 438 R------YDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 438 R------Y~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) + =||.=+||.|.-..+.-- .++.=.|-- T Consensus 435 KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~ 469 (514) T protein:vir:56 435 KGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGF 469 (514) T ss_pred ecCcceecceeeccccccccccccCCccccceeee Confidence 3 133446776653332210 111111111 No 24 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=98.19 E-value=5.4e-08 Score=60.41 Aligned_cols=385 Identities=12% Similarity=0.045 Sum_probs=159.2 Q ss_pred hheehccccchhHHHhhh-hhhhccccccccchhhhhhhhhh-----hh----------hhhhhheeeeccCCCcceEEE Q lcl|NC_018861. 30 NIMRTVLENQGNEVKMLM-ESTVTGDIAKFTPILVPVIRRAL-----PS----------LIGTEIAGVQALKTPTAYLYA 93 (465) Q Consensus 30 ~~~~~l~~n~~~~~~~i~-est~t~~v~~~~P~l~~l~~ra~-----~~----------lI~~DIwGVQPMTgPTGLIFA 93 (465) -+.-.|+ ..|.-+|. |+ -|.....-||.+ -| -||.|=.-|+-|.+=-+++=. T Consensus 1 ~~~~~l~---~kw~p~l~~~~---------~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 68 (534) T protein:vir:10 1 MSKKSLL---KKWQPLVESEG---------MPAIASMKRKDIVARIFENQDEDIAHNEGGVYTDQVVVNSMVDVKGRIEE 68 (534) T ss_pred CchhHHH---HHhHHhhcCCc---------cccccchhhhhhhhhhhhhHHHHHhhhcccccchhhhhhhhhccccchhh Confidence 0000111 11222221 11 122222323322 11 233333333333322111110 Q ss_pred EEEEecCCCCcccccccccccCccccccccccccccccccccc--cc-cccccccccccccccccccchhhhheeeeecc Q lcl|NC_018861. 94 MVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEV--SF-KTATTVKGKIVYSEKQAGTDNIVNVLLRLESN 170 (465) Q Consensus 94 MRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~--s~-~tatt~ggait~~~~~TGPTgLifam~s~y~~ 170 (465) -| ..+...++...+.+..-+....++......+.-. .- .+..-..-++|++|||+||||||||||++|.+ T Consensus 69 ~~-------l~ea~~~~~~g~~~~~ia~s~~s~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n 141 (534) T protein:vir:10 69 AR-------LAEANIGGDHGYDATKIASGETSGSITNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGG 141 (534) T ss_pred cc-------ccccccccccccccccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecC Confidence 00 0000000111111000000000111111111111 11 23444578899999999999999999999987 Q ss_pred Cc----cccccccc--cccccccccCCccCCCcccccCcccc---------------ccccc----cccc--cchhhhcc Q lcl|NC_018861. 171 ST----GSVAIGDE--MDKAATFATKKATVEAVYTNEALWLK---------------VLKNY----TGPY--ATAAGEKL 223 (465) Q Consensus 171 ~~----g~ea~~~e--~~t~~s~~~~~~~~~~~~~~~a~~~~---------------~~~~~----~~~~--~Ta~~E~l 223 (465) +. +.|+||+| +++.|++..............+...+ +.... .... .+...+.. T Consensus 142 ~~~~~s~~EAf~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~a 221 (534) T protein:vir:10 142 NSQDANAREAFHPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEA 221 (534) T ss_pred CCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccc Confidence 75 57999999 88999876543221111100000000 00000 0000 00000000 Q ss_pred CCc----h-hhcceEEEEEEEEeecceec----------c--cchHH-----H-HHHHHhhhCCCHHHHHHHH------- Q lcl|NC_018861. 224 GKD----M-KEMGISVQRVLAEAKTRKVK----------G--TYTIE-----M-LQDLKAQHGINAEKELADI------- 273 (465) Q Consensus 224 g~~----f-~EM~FsIeK~tVtAKSRaLK----------A--EYT~E-----L-AQDLkAiHGlDAe~EL~ni------- 273 (465) |.. + +--.+.+..--.|+.--+|. + .++|| . ..=|||..-+..-+-|..| T Consensus 222 g~~~~~~~~~~~~y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEt 301 (534) T protein:vir:10 222 GLAYKWLLANGYAVETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADS 301 (534) T ss_pred ccccccccccccceecccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHH Confidence 000 0 00011111222222222221 0 11111 1 2357887777777777665 Q ss_pred -HHHHHHHHhhHHHHhhhhh-eeeeeeeeeeccCCc------ccH------HHHHHHHHHHHHHHHHHHHHhcccccccE Q lcl|NC_018861. 274 -LSAEVALEIDRTIIEKANE-VATVCTDFDVNSADG------RWF------IEKARGLSMRISNEAREIGRQTRKGGGNK 339 (465) Q Consensus 274 -LstEImlEINreii~~l~~-~at~~~~~~~~~~~~------~~~------~e~~~~L~~~i~~~a~~i~~~T~~~~~~~ 339 (465) |++-++.||=.||=|-+.. ..++.++|++....+ .+. .+.+|-+..+.+.+..+|.+...+ T Consensus 302 ELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~----- 376 (534) T protein:vir:10 302 ELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANE----- 376 (534) T ss_pred HHHHHHHHHHHHHhhHHHHHHHhhhhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHH----- Confidence 8999999999999998765 489999999999643 221 256888889999999999886542 Q ss_pred EEecHHHHHHHHhcCcccccCCccc-cccccccc-ceEEEEecCceEEEEeCCCCcceEEEEEe--cCCCccceeEEecc Q lcl|NC_018861. 340 LIVSPKVATILDEIGSFVLSPAGSK-IDAINSGI-KPNVGKFDNRYDVIVDNFAEFDYCTVAYK--GASNFDAGIFFAPY 415 (465) Q Consensus 340 ~~~s~~va~~L~~~~~~~~~~~~~~-~~~~~~~~-~~~~G~l~~~~~vy~d~~~~~dy~~vg~k--g~~~~d~glfy~PY 415 (465) +.+...-+. +++ +.+.+.-. ...+|.|. + .+. .|.- -..+...++|+. - T Consensus 377 ---------i~~~T~rg~----~n~~v~S~~Va~~L~~~g~l~--~-------~~~----~~~~~~~~~d~~~~~~~G-~ 429 (534) T protein:vir:10 377 ---------IARQTGRGQ----GNFIICSRNVAAALGHTDMLM--T-------PAV----MGANTTMNTDTTSSLFAG-V 429 (534) T ss_pred ---------HHHhhcccc----ccEEEEchhHHHHHhhccchh--c-------ccc----ccccccccccCCCceEEE-E Confidence 222222111 110 00000000 01122220 0 000 0110 011222333433 2 Q ss_pred cccceeeeeCCCcccceeeeeee------eeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 416 NITLQQNLTDPVSGQPAMILNNR------YDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 416 ~~~~~~~~~dp~s~qp~~~~~tR------Y~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) .-+-+++.+||.+-+..+-+=.+ =||.=+||.|....+.-- .++.=.|-- T Consensus 430 l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~ 486 (534) T protein:vir:10 430 LAGKYRVYIDQYAVEDYFTVGYKGASEMDAGLYYCPYVALTPLRGTDPKNFQPVLGF 486 (534) T ss_pred ecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeee Confidence 22445666677766665433222 134556777655544211 111111111 No 25 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=98.17 E-value=3.7e-08 Score=61.30 Aligned_cols=352 Identities=13% Similarity=0.074 Sum_probs=134.0 Q ss_pred hccccccCh-hhhhheehccccchhHHHhhhhhhhccccccccchhhhhhhhhhhhhh----hhhhe------eeeccC- Q lcl|NC_018861. 18 SNLYPNLNE-SEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILVPVIRRALPSLI----GTEIA------GVQALK- 85 (465) Q Consensus 18 ~~~~~~~~~-~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~~l~~ra~~~lI----~~DIw------GVQPMT- 85 (465) ..+.|.|.| |. -|||..+. |....+-||.+.--| ..||- .-+|+. T Consensus 1 ~~~~e~l~~kW~-----plLe~~~~------------------~~i~~~~k~~i~a~llENQe~~~~~~~~~~~~~~~~~ 57 (468) T protein:vir:10 1 MFNAEHLQEKWS-----PVLNHGEA------------------PAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNS 57 (468) T ss_pred CcchHHHHHhhh-----HhhcCCcc------------------chhccchhhhhhhhhhhhHHHHHhccccccchhhHhh Confidence 333344422 32 24444321 111111122211000 00100 001110 Q ss_pred -CCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccc-cccccccccccccccccccchhhhh Q lcl|NC_018861. 86 -TPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSF-KTATTVKGKIVYSEKQAGTDNIVNV 163 (465) Q Consensus 86 -gPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~-~tatt~ggait~~~~~TGPTgLifa 163 (465) ++ ..+... +... +.-...+..++...--++.- .+..-...++|++||++|||||||| T Consensus 58 ~~~--------~~~~~~---------n~~~----~~~~t~~v~~~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFA 116 (468) T protein:vir:10 58 LGA--------GTIAPA---------GSAL----GSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFA 116 (468) T ss_pred cCC--------cccchh---------hhhh----hhcccccccccCchhhhhHHHHHhhhhhhhceeeecCCccceeeeE Confidence 00 000000 0000 00000011111010111111 2334467789999999999999999 Q ss_pred eeeeeccCccccccccccccccccccCCccCCCcccccCcccccccccccc-ccchhhhccCCchhhcceEEEEEEEEee Q lcl|NC_018861. 164 LLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGP-YATAAGEKLGKDMKEMGISVQRVLAEAK 242 (465) Q Consensus 164 m~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~-~~Ta~~E~lg~~f~EM~FsIeK~tVtAK 242 (465) ||++|.++.|+|+||+|+++.|++..................+.......+ ...+.+. . +.+-.--.||. T Consensus 117 mRsrY~n~~g~EAf~nEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~~----~-----~~~g~gMsTa~ 187 (468) T protein:vir:10 117 MRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPG----T-----YEVGSKMPRED 187 (468) T ss_pred EEEEecCCCCccceeccccccccccccccccccccccccccccCCCCCccccccccccc----c-----cccccccchHH Confidence 999999999999999999999998654332111100000000000000000 0000000 0 00001111111 Q ss_pred cceec--------ccc-----hHHH-HHHHHhhhCCCHHHHHHH--------HHHHHHHHHhhHHHHhhhhh-eeeeeee Q lcl|NC_018861. 243 TRKVK--------GTY-----TIEM-LQDLKAQHGINAEKELAD--------ILSAEVALEIDRTIIEKANE-VATVCTD 299 (465) Q Consensus 243 SRaLK--------AEY-----T~EL-AQDLkAiHGlDAe~EL~n--------iLstEImlEINreii~~l~~-~at~~~~ 299 (465) -.+|= ..+ |++. -.=|||..-+..-+-|.. .|++-+..||=.||=|-+.. ..++.++ T Consensus 188 aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~ 267 (468) T protein:vir:10 188 LERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKK 267 (468) T ss_pred HhhcCCCCcccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhh Confidence 11110 011 1111 235888777766666655 58999999999999988765 4899999 Q ss_pred eeeccCCcccHHH--------HHHHHHHHH-HHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccccccc Q lcl|NC_018861. 300 FDVNSADGRWFIE--------KARGLSMRI-SNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINS 370 (465) Q Consensus 300 ~~~~~~~~~~~~e--------~~~~L~~~i-~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~ 370 (465) |++.+.+..+..+ |..+.++-+ -++..++-+.=++ .... T Consensus 268 ~k~~g~~~~Gv~d~~~~~~~rw~~e~~k~L~~~i~~ean~i~~~------------------T~rg-------------- 315 (468) T protein:vir:10 268 GAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQE------------------TRRG-------------- 315 (468) T ss_pred eecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHh------------------hccc-------------- Confidence 9999876655443 344444322 3333332221111 1111 Q ss_pred ccceEEEEecCceEEEEeCCC-----C---cceE--EEEEecCC--Ccc--ceeEEecccccceeeeeCCC--c--ccc- Q lcl|NC_018861. 371 GIKPNVGKFDNRYDVIVDNFA-----E---FDYC--TVAYKGAS--NFD--AGIFFAPYNITLQQNLTDPV--S--GQP- 431 (465) Q Consensus 371 ~~~~~~G~l~~~~~vy~d~~~-----~---~dy~--~vg~kg~~--~~d--~glfy~PY~~~~~~~~~dp~--s--~qp- 431 (465) -|+ .|.|-+.- - .||- .=++.+.+ +.| +.+||. -.-+-+++.+||- + =+. T Consensus 316 -----~gn-----~ii~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~tg~~~~G-~l~~r~~vy~D~Ya~~~s~~dY 384 (468) T protein:vir:10 316 -----KGN-----FLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVG-TINGRIKVFVDPYAANLSDKHY 384 (468) T ss_pred -----ccc-----EEEechhHHHHHhhcCcceecccccccccccccccccCcceEEE-EecCceEEEEccccccCCccce Confidence 011 12222220 0 0000 00111111 111 122222 2223344444432 1 111 Q ss_pred -eeeeeee----eeeeecCcccc--------------cccce--------EEE---eeccceeC Q lcl|NC_018861. 432 -AMILNNR----YDVVATPLHPE--------------AFIRT--------FAV---NLNNYIIS 465 (465) Q Consensus 432 -~~~~~tR----Y~l~~nPf~~~--------------~~~~~--------f~~---~~~~~~~~ 465 (465) .+|+|== =||.=+||.|. ...+| |++ +.+|.+-. T Consensus 385 ~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~g~~~~ 448 (468) T protein:vir:10 385 YVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVSNPFVTTNGLYNGTPDG 448 (468) T ss_pred EEEEEecCcceeceeeeccccccccccccCCCcccceeeeeeeeceeecccceeccccCCCccc Confidence 1222200 02233444432 21111 110 00111110 No 26 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=98.13 E-value=3.6e-07 Score=55.90 Aligned_cols=345 Identities=13% Similarity=0.060 Sum_probs=142.8 Q ss_pred hccccccChhhhhheehccccchhHHHhhhhhhhccccccccchhhhhhhhhhhhhhh----hhhe------eeeccCCC Q lcl|NC_018861. 18 SNLYPNLNESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILVPVIRRALPSLIG----TEIA------GVQALKTP 87 (465) Q Consensus 18 ~~~~~~~~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~~l~~ra~~~lI~----~DIw------GVQPMTgP 87 (465) ..+ +.|.+ --+-|||..+. |.....-||.+.--|. .||. +-. .|| T Consensus 1 m~~-~~l~~----~w~~~l~~~~~------------------~~i~~~~~~~~~~~~lenq~~~~~~~~~~l~ea--~~~ 55 (457) T protein:vir:10 1 MSF-QNLQE----KWAPVLEHDSL------------------PEIGDSYKKGVVAQLLENQEKAIAEEGKILTET--LQT 55 (457) T ss_pred Cch-HHHHH----HhhHhhccCcc------------------chhhhhHHHHHHHHHhhhHHHHHHhcccccccc--ccc Confidence 000 11111 01223333321 1112221222211111 1211 000 022 Q ss_pred cceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccc-cccccccccccccccccccchhhhheee Q lcl|NC_018861. 88 TAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSF-KTATTVKGKIVYSEKQAGTDNIVNVLLR 166 (465) Q Consensus 88 TGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~-~tatt~ggait~~~~~TGPTgLifam~s 166 (465) +|.+ +.+ . ...+..++...--++.- .+..-...++|++||++||||||||||+ T Consensus 56 ~g~~--------~~s----------------~--~t~~v~~~~P~Li~l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRs 109 (457) T protein:vir:10 56 TGYT--------GGD----------------T--VTGPVAGFDPVLISLIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRT 109 (457) T ss_pred cCCC--------ccc----------------c--cccccccccchhhhhhHHHHhhhhhhhcceeecCCCcceeeeeeee Confidence 2110 000 0 00011111111111122 2334467889999999999999999999 Q ss_pred eeccCcc------ccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEE Q lcl|NC_018861. 167 LESNSTG------SVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAE 240 (465) Q Consensus 167 ~y~~~~g------~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVt 240 (465) +|.++.+ +|+||+|+++.|++............. ....+..........+...+. +.+-+---| T Consensus 110 rY~~q~~~~~a~~~EAl~nEadt~fSg~~~~~~~~~~~~~-~~~~gt~~~~~~~~~~~~~~~---------~~~~~gmsT 179 (457) T protein:vir:10 110 NYGAERNPAAAGYDEAFFNEPNAGFSGGPGAYDPGATGVT-NDAEGTNPALLNDSPAGTYEQ---------ADDATGMST 179 (457) T ss_pred eecCccccccccccceeeeccCcccCcccccccccccccc-cccccccccccCccccccccc---------cccccchhh Confidence 9999877 799999999999986654432221110 000000000000000000000 001111112 Q ss_pred eecceec--------cc--chHH-----H-HHHHHhhhCCCHHHHHHH--------HHHHHHHHHhhHHHHhhhhhe-ee Q lcl|NC_018861. 241 AKTRKVK--------GT--YTIE-----M-LQDLKAQHGINAEKELAD--------ILSAEVALEIDRTIIEKANEV-AT 295 (465) Q Consensus 241 AKSRaLK--------AE--YT~E-----L-AQDLkAiHGlDAe~EL~n--------iLstEImlEINreii~~l~~~-at 295 (465) |.-++|- +| ++|| . -.=|||..-+..-+-|.. .|++-+.-||=.||=|-+... .+ T Consensus 180 A~aE~lgd~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~ 259 (457) T protein:vir:10 180 ATVEALDDSTANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYT 259 (457) T ss_pred hhhhccCCCCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhh Confidence 2222221 11 1111 1 235888777766666655 589999999999999987654 89 Q ss_pred eeeeeeeccCCcccHHH--------HHHHHHHHH-HHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccc Q lcl|NC_018861. 296 VCTDFDVNSADGRWFIE--------KARGLSMRI-SNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKID 366 (465) Q Consensus 296 ~~~~~~~~~~~~~~~~e--------~~~~L~~~i-~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~ 366 (465) +.++|++.+....|..+ |..+..+-+ -++..++-+.=.+ ... T Consensus 260 ~a~~~~~~~~~~~gv~dl~~~~~g~~~~e~~k~L~~~i~~ean~i~~~------------------T~r----------- 310 (457) T protein:vir:10 260 NAVAGAQNNTATAGVFDLDVDSNGRWSVEKFKGLLFQIERDANAIGHQ------------------TRR----------- 310 (457) T ss_pred hheeeeccccccceeeeeeccccchhhHHHHHHHHHHHHHHHHHHHHh------------------hcc----------- Confidence 99999999887766543 333333322 3333332221111 111 Q ss_pred ccccccceEEEEecCceEEEEeCCC-----CcceE-----EEEEecCCCc-cceeEEecccccceeeeeCC----Ccccc Q lcl|NC_018861. 367 AINSGIKPNVGKFDNRYDVIVDNFA-----EFDYC-----TVAYKGASNF-DAGIFFAPYNITLQQNLTDP----VSGQP 431 (465) Q Consensus 367 ~~~~~~~~~~G~l~~~~~vy~d~~~-----~~dy~-----~vg~kg~~~~-d~glfy~PY~~~~~~~~~dp----~s~qp 431 (465) +-|+ .|.|-+.- -.+++ .-|.-+.++. |-+-.|+=-.-+-+++.+|| +|=++ T Consensus 311 --------g~gn-----~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~d 377 (457) T protein:vir:10 311 --------GKGN-----ILICSADVVSALGMAGVLDYTPALNGNNGLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKH 377 (457) T ss_pred --------ccce-----EEEEchhHHHHHhhcccccccchhhccccccccccccceeEEEecCCeEEEEecccccCCccc Confidence 1122 23333331 01110 0011111221 12222221112334445554 22222 Q ss_pred e--eeeeee----eeeeecCcccccccceEE-Eeeccce-------eC Q lcl|NC_018861. 432 A--MILNNR----YDVVATPLHPEAFIRTFA-VNLNNYI-------IS 465 (465) Q Consensus 432 ~--~~~~tR----Y~l~~nPf~~~~~~~~f~-~~~~~~~-------~~ 465 (465) . +|+|== =||.=+||.|.-..+..- .++.=.| |+ T Consensus 378 y~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~ 425 (457) T protein:vir:10 378 FYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPDTFQPKIGFKTRYGMV 425 (457) T ss_pred eEEEEEeCCcceecceeecccccccccCccCCccccceeeeeeeeeee Confidence 2 233311 134456777766655432 2211111 11 No 27 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=98.09 E-value=8.4e-08 Score=59.37 Aligned_cols=375 Identities=16% Similarity=0.126 Sum_probs=152.4 Q ss_pred hccccccCh-hhhhheehccccchh--H-HHhhhhhhhccccccccchhhhh-----hhhhhhhhhhhhheeeeccCCCc Q lcl|NC_018861. 18 SNLYPNLNE-SEKNIMRTVLENQGN--E-VKMLMESTVTGDIAKFTPILVPV-----IRRALPSLIGTEIAGVQALKTPT 88 (465) Q Consensus 18 ~~~~~~~~~-~~~~~~~~l~~n~~~--~-~~~i~est~t~~v~~~~P~l~~l-----~~ra~~~lI~~DIwGVQPMTgPT 88 (465) ..+.+.|.| | +-|||++.- | +..-++ .+++- -|-.-.+++|.|=.-|+-+. T Consensus 1 ~~~~~~l~~kw-----~p~l~~~~~~~~i~~~~~~------------~~~a~llenq~~~~~~~~~~~~~~~~~~~~--- 60 (524) T protein:vir:98 1 MSKKNELMEKW-----NDLLESQEGLPDIATKSKK------------QLVAAILEAQEKDAETDPVYRDEKIVESFG--- 60 (524) T ss_pred CcchHHHHHHh-----HHHhcCCcCcchhcchhhH------------HHHHHHHhhHHHHHhcCccccchHHHHhhh--- Confidence 222233321 3 235555420 0 000000 00000 02333444555555554441 Q ss_pred ceEEEEEEEecCCCCcccccccccccCccccccccc-----cccccccccccccc-cccccccccccccccccccchhhh Q lcl|NC_018861. 89 AYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANK-----DDFNYTGTPIEVSF-KTATTVKGKIVYSEKQAGTDNIVN 162 (465) Q Consensus 89 GLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~e-----a~~~~Sg~~~~~s~-~tatt~ggait~~~~~TGPTgLif 162 (465) .+..... ....|+.......+ +..++...--++.- .+..-..-++|++|||+||||||| T Consensus 61 --------~~l~ea~-------~~~~~~~~~~~i~~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIF 125 (524) T protein:vir:98 61 --------GFLAEAE-------IAGDHNYDQTNIASGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVF 125 (524) T ss_pred --------ccccccc-------cccccccccccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhhh Confidence 1111110 00011111111111 11111111111111 244456788999999999999999 Q ss_pred heeeeeccC---cccccccccc-------ccccccccCCccCCCcccccCcccccc-------------ccccccccchh Q lcl|NC_018861. 163 VLLRLESNS---TGSVAIGDEM-------DKAATFATKKATVEAVYTNEALWLKVL-------------KNYTGPYATAA 219 (465) Q Consensus 163 am~s~y~~~---~g~ea~~~e~-------~t~~s~~~~~~~~~~~~~~~a~~~~~~-------------~~~~~~~~Ta~ 219 (465) |||++|.++ .|+|++|+|+ ++.||+..............+...+.. .+...+..+.. T Consensus 126 AmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~t 205 (524) T protein:vir:98 126 ALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVT 205 (524) T ss_pred hhheeecCCCCCcccccccccccccccccccccCCccccccccccccccccccccccccccccccceeccccccCccccc Confidence 999999987 4679999997 677776543322111111111110000 00000000000 Q ss_pred h--hc------cC----CchhhcceEEEEEEEEeecceec----------cc--chHH-----H-HHHHHhhhCCCHHHH Q lcl|NC_018861. 220 G--EK------LG----KDMKEMGISVQRVLAEAKTRKVK----------GT--YTIE-----M-LQDLKAQHGINAEKE 269 (465) Q Consensus 220 ~--E~------lg----~~f~EM~FsIeK~tVtAKSRaLK----------AE--YT~E-----L-AQDLkAiHGlDAe~E 269 (465) + +. ++ ....+.++-++ ||+--+|. +| ++|| . -.=|||..-+..-+- T Consensus 206 gt~p~~~~~a~~~~~~~g~~~~~~~Gms----TA~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQD 281 (524) T protein:vir:98 206 GADPAALDAAVIAENEKGTLAEISVGMA----TSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQD 281 (524) T ss_pred ccccccccccccccccccceeecccccc----hhhhhhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHH Confidence 0 00 00 00111112221 22211220 11 1111 1 245888877777777 Q ss_pred HHHH--------HHHHHHHHhhHHHHhhhhheeeeeeeeeeccCC-------cccHH------HHHHHHHHHHHHHHHHH Q lcl|NC_018861. 270 LADI--------LSAEVALEIDRTIIEKANEVATVCTDFDVNSAD-------GRWFI------EKARGLSMRISNEAREI 328 (465) Q Consensus 270 L~ni--------LstEImlEINreii~~l~~~at~~~~~~~~~~~-------~~~~~------e~~~~L~~~i~~~a~~i 328 (465) |..| |++-++.||=.||=|-+...+++.+++++++.+ |.+.. +..|-...+...+..+| T Consensus 282 LKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i 361 (524) T protein:vir:98 282 LRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQI 361 (524) T ss_pred HHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheeceeecccccccccceeeccccccccccchhHHHHHHHHHHH Confidence 7665 899999999999999999889999999998632 32221 12333333333333333 Q ss_pred HHhcccccccEEEecHHHHHHHHhcC----cccccCCc--ccccccccccceEEEEecCceEEEEeCCCCcceEEEEEec Q lcl|NC_018861. 329 GRQTRKGGGNKLIVSPKVATILDEIG----SFVLSPAG--SKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKG 402 (465) Q Consensus 329 ~~~T~~~~~~~~~~s~~va~~L~~~~----~~~~~~~~--~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg 402 (465) .+.-. .+.+... .|.-.+.. ......+ .|.| ++++++. -+. T Consensus 362 ~~~an--------------~I~~~T~rg~~n~~i~S~~Va~~L~~~~------~g~~---------~~s~~~~----~~~ 408 (524) T protein:vir:98 362 DKEAN--------------EIARQTGRGAGNFIIASRNVVSALARID------SGIT---------PASQGLQ----KTL 408 (524) T ss_pred HHHHH--------------HHHHhhccccccEEEEchHHHHHHhhhh------cccc---------cccchhh----ccc Confidence 33211 1222111 11100000 0000000 0111 1111110 011 Q ss_pred CCCccceeEEecccccceeeeeCCCcccceeeeeee------eeeeecCcccccccc----------------------e Q lcl|NC_018861. 403 ASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNR------YDVVATPLHPEAFIR----------------------T 454 (465) Q Consensus 403 ~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tR------Y~l~~nPf~~~~~~~----------------------~ 454 (465) ..+.-..+||. ..-+-+++.+||.+-+..+-+=.+ =||.=+||.|.-..+ - T Consensus 409 ~~d~~~~~~~G-~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP 487 (524) T protein:vir:98 409 NVDTTKAVFAG-VLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGINP 487 (524) T ss_pred ccCCccceEEE-EecCceEEEecCCCCcceEEEEeeCCcccccceeeccccccccccccCCccccceeeeeeeeceeecC Confidence 11222445554 444556666677665554322222 123345555433322 2 Q ss_pred EEEeecc----ceeC Q lcl|NC_018861. 455 FAVNLNN----YIIS 465 (465) Q Consensus 455 f~~~~~~----~~~~ 465 (465) |+..+++ .|+. T Consensus 488 ~~~~~~~~~~~ri~~ 502 (524) T protein:vir:98 488 FANSRSQAPADRITS 502 (524) T ss_pred cccccCCcccccccc Confidence 3333332 2233 No 28 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=97.82 E-value=2.9e-06 Score=50.91 Aligned_cols=374 Identities=13% Similarity=0.075 Sum_probs=145.4 Q ss_pred hhhccccccC-hhhhhheehccccchhHHHhhhhhhhccccccccchhhhhhhhhhhhhhhhhheeeeccCCCcceEEEE Q lcl|NC_018861. 16 ITSNLYPNLN-ESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILVPVIRRALPSLIGTEIAGVQALKTPTAYLYAM 94 (465) Q Consensus 16 ~~~~~~~~~~-~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAM 94 (465) -...+.|.|. +|+ -|||..+. -.|..++-.+|+- .+-|- =.|+ -..| -+ T Consensus 1 ~~~~~~~~l~~kw~-----p~l~~~~~-----------~~i~~~~~~~~a~---~~enq-~~~~-----~~~~-----~~ 50 (521) T protein:vir:10 1 MTIKTKAELLNKWK-----PLLEGEGL-----------PEIANSKQAIIAK---IFENQ-EKDF-----QTAP-----EY 50 (521) T ss_pred CCcchhHHHHHhhh-----hhhccCCC-----------Cccccchhhhhhh---hhhhh-hhhh-----hhcc-----cc Confidence 1112222221 122 23333221 1111111111111 11110 0000 0001 00 Q ss_pred EE--------EecCC---CCcccccccccccCccccccccccccccccccccccc-cccccccccccccccccccchhhh Q lcl|NC_018861. 95 VP--------HYVGD---GNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSF-KTATTVKGKIVYSEKQAGTDNIVN 162 (465) Q Consensus 95 RS--------rY~~~---~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~-~tatt~ggait~~~~~TGPTgLif 162 (465) |. .+... ++++.....+- .+.-...+..++...--++.- .+..-..-++|++|||+||||||| T Consensus 51 ~~~~~~~~~~~~l~e~~~~~~~~~~~~~i-----~es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIF 125 (521) T protein:vir:10 51 KDEKIAQAFGSFLTEAEIGGDHGYNATNI-----AAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVF 125 (521) T ss_pred chhHHHHHHhhhhhhhcccCccccccccc-----cccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhhe Confidence 00 00000 00100000000 000000111111111111121 244456788999999999999999 Q ss_pred heeeeeccCc----cccccccc--cccccccccCCccCCCcccccCcccc-----------cccccc----ccccchhhh Q lcl|NC_018861. 163 VLLRLESNST----GSVAIGDE--MDKAATFATKKATVEAVYTNEALWLK-----------VLKNYT----GPYATAAGE 221 (465) Q Consensus 163 am~s~y~~~~----g~ea~~~e--~~t~~s~~~~~~~~~~~~~~~a~~~~-----------~~~~~~----~~~~Ta~~E 221 (465) |||++|.++. +.++++++ +++.|++..............+...+ ..+... ....+.... T Consensus 126 AMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~ 205 (521) T protein:vir:10 126 ALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDA 205 (521) T ss_pred eeeeeccCCccccccccccchhccccccccccccccccccccccccccccccccccccccccceecccccccCCCccccc Confidence 9999999875 56777765 77888876543221111110000000 000000 000000000 Q ss_pred ccCCc--h----hhcceEEEEEEEEeecceec----------cc--chHH-----H-HHHHHhhhCCCHHHHHHHH---- Q lcl|NC_018861. 222 KLGKD--M----KEMGISVQRVLAEAKTRKVK----------GT--YTIE-----M-LQDLKAQHGINAEKELADI---- 273 (465) Q Consensus 222 ~lg~~--f----~EM~FsIeK~tVtAKSRaLK----------AE--YT~E-----L-AQDLkAiHGlDAe~EL~ni---- 273 (465) ..++. . .--.+.+-.--.|++--+|. +| ++|| . ..=|||..-+..-+-|..| T Consensus 206 ~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLD 285 (521) T protein:vir:10 206 AKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMD 285 (521) T ss_pred ccccccccccccccceeecccccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCC Confidence 00000 0 00111122222233322220 11 1111 1 2347777666666666554 Q ss_pred ----HHHHHHHHhhHHHHhhhhheeeeeeeeeeccCC-------cccHH------HHHHHHHHHHHHHHHHHHHhccccc Q lcl|NC_018861. 274 ----LSAEVALEIDRTIIEKANEVATVCTDFDVNSAD-------GRWFI------EKARGLSMRISNEAREIGRQTRKGG 336 (465) Q Consensus 274 ----LstEImlEINreii~~l~~~at~~~~~~~~~~~-------~~~~~------e~~~~L~~~i~~~a~~i~~~T~~~~ 336 (465) |++-+.-||=.||=|-+...+++++++.+++.. |.... +.+|-...+...+--+|.+.-. T Consensus 286 AEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an--- 362 (521) T protein:vir:10 286 ADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAV--- 362 (521) T ss_pred hHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHH--- Confidence 888888999999999888888888898887763 32211 2345555555555555544221 Q ss_pred ccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCC--------Ccce-----EEEEEecC Q lcl|NC_018861. 337 GNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFA--------EFDY-----CTVAYKGA 403 (465) Q Consensus 337 ~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~--------~~dy-----~~vg~kg~ 403 (465) .+.+...-.. |+ .|.|-+.- ..|| +-.|+ . T Consensus 363 -----------~i~~~T~r~~-------------------~n-----~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~--~ 405 (521) T protein:vir:10 363 -----------EIARQTGRGE-------------------GN-----FIIASRNVVNVLASVDTGISYAAQGLATGF--N 405 (521) T ss_pred -----------HHHHhccccc-------------------ce-----EEEEchHHHHHHhhcccccccccccccccc--c Confidence 2332222111 11 12222220 0011 00111 1 Q ss_pred CCccceeEEecccccceeeeeCCCcccceeeeeee------eeeeecCcccccccceEE-Eee-------ccceeC Q lcl|NC_018861. 404 SNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNR------YDVVATPLHPEAFIRTFA-VNL-------NNYIIS 465 (465) Q Consensus 404 ~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tR------Y~l~~nPf~~~~~~~~f~-~~~-------~~~~~~ 465 (465) .+...++||. -.-+-+++.+||.+-+..+-+=.+ =||.=+||.|.-..+.-- .++ +-|=|+ T Consensus 406 ~d~~~~~~~G-~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~ 480 (521) T protein:vir:10 406 TDTTKSVFAG-VLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG 480 (521) T ss_pred ccCCCceEEE-EecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeecee Confidence 1334556654 223446667777776665433222 134456776644333110 111 111111 No 29 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=97.07 E-value=0.00018 Score=41.06 Aligned_cols=301 Identities=12% Similarity=0.048 Sum_probs=133.2 Q ss_pred CCccc------hhh-hHHHhhhh--hhccccccCh---hhhhh----------eehccccchhH---HH----hhhh--- Q lcl|NC_018861. 1 MADKY------LLD-ESTKEKFI--TSNLYPNLNE---SEKNI----------MRTVLENQGNE---VK----MLME--- 48 (465) Q Consensus 1 ~~~~~------~~~-e~~~e~~~--~~~~~~~~~~---~~~~~----------~~~l~~n~~~~---~~----~i~e--- 48 (465) +.++. +-+ +...|.=. .....+..++ .++.. .......+..+ +. ...+ T Consensus 49 l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 128 (394) T protein:vir:97 49 AKANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQK 128 (394) T ss_pred HHHHHHHHHHHHHHHHHHhhhhccccccccccchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhc Confidence 10000 000 00000000 0000000000 00000 00000000000 00 0000 Q ss_pred -hhhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccc Q lcl|NC_018861. 49 -STVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKD 125 (465) Q Consensus 49 -st~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea 125 (465) +.++.+....-|.-+ .+++.+-+..+...++.|.||+++++-+--++. +.. T Consensus 129 ~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-----~~~--------------------- 182 (394) T protein:vir:97 129 DGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQR-----ATT--------------------- 182 (394) T ss_pred cccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEec-----CCC--------------------- Confidence 111111111123333 356667777777889999999888754422220 000 Q ss_pred ccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccc Q lcl|NC_018861. 126 DFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWL 205 (465) Q Consensus 126 ~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~ 205 (465) . ..+.. T Consensus 183 --------------~------~~~v~------------------------------------------------------ 188 (394) T protein:vir:97 183 --------------K------MVTVA------------------------------------------------------ 188 (394) T ss_pred --------------c------cceec------------------------------------------------------ Confidence 0 00000 Q ss_pred cccccccccccchhhhccCCchhhc-ceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhH Q lcl|NC_018861. 206 KVLKNYTGPYATAAGEKLGKDMKEM-GISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDR 284 (465) Q Consensus 206 ~~~~~~~~~~~Ta~~E~lg~~f~EM-~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINr 284 (465) | |...++. ...+++++..+|.-+-...+|-||++|- +.|.+++|.+-|+..|..-+|. T Consensus 189 ---------------E--~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds----~~~~~~~i~~~la~~~~~~~~~ 247 (394) T protein:vir:97 189 ---------------E--LEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA----DVDLVGIVSESISQIKVNTTND 247 (394) T ss_pred ---------------c--cccccccccccceeEEeehhheeeehhhHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHH Confidence 0 0111111 1247778888888888889999999986 3477888999998888888888 Q ss_pred HHHhhhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccC Q lcl|NC_018861. 285 TIIEKANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSP 360 (465) Q Consensus 285 eii~~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~ 360 (465) .||..+...... +.. .+.+|...+.... ...+ . ..+|+++.+...|+.. |-..+.| T Consensus 248 ~i~~g~~~~~~~---------~~~----~~~~~~~~~~~~~-----~~~~-~-a~~v~n~~~~~~l~~lkd~~G~~i~~~ 307 (394) T protein:vir:97 248 AIAKVLKSFTTK---------TVK----NLDEIKALLNGGF-----DPAY-N-VSLIVSQSFYQTLDTLKDGNGRYLLQD 307 (394) T ss_pred HHhhcccccccc---------ccc----cHHHHHHHHHhhh-----hhhh-C-CEEEEcHHHHHHHHHhhccCCCeeeec Confidence 888765322111 111 1222322222111 1122 2 3467999999998875 3333332 Q ss_pred CcccccccccccceEEEEecCceEEEE--eCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeee Q lcl|NC_018861. 361 AGSKIDAINSGIKPNVGKFDNRYDVIV--DNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNR 438 (465) Q Consensus 361 ~~~~~~~~~~~~~~~~G~l~~~~~vy~--d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tR 438 (465) ... ...-++|. |++|++ |...+..-+++|-- ..+....+-..+.. ...|...++..+-...| T Consensus 308 ~~~---------~~~~~~l~-G~pv~~~~~~~~~~~~~~~gd~-----~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~r 371 (394) T protein:vir:97 308 DIT---------AVSGKVLL-GKPVFVLSDEVLGANKAFIGDF-----KRGVLFADRKDLGL-RWADNEIYGQYLQAVLR 371 (394) T ss_pred CcC---------CCCCceec-cceeEEecccccCCccEEEeec-----cccEEEEEecceEE-EEecccccceeEEEEEE Confidence 211 01124664 466665 44444444444420 01111222212222 23455566666666778 Q ss_pred eee-eecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 439 YDV-VATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 439 Y~l-~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) ++. +.+| ..|+ +.++.+..- T Consensus 372 ~d~~v~~~-------~a~~~~~~~~~~~p 393 (394) T protein:vir:97 372 FGVSKVDD-------KAGYYVTFTPEPLP 393 (394) T ss_pred EccEEecc-------cceEEEEecccccC Confidence 887 4444 4565 666665555 No 30 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=96.75 E-value=0.00036 Score=39.46 Aligned_cols=331 Identities=13% Similarity=0.122 Sum_probs=137.8 Q ss_pred CCccchhh--------hHHHhhhhhhcccc---ccCh----hhhhh-eehccccch------hHHHhhhhhhhccccccc Q lcl|NC_018861. 1 MADKYLLD--------ESTKEKFITSNLYP---NLNE----SEKNI-MRTVLENQG------NEVKMLMESTVTGDIAKF 58 (465) Q Consensus 1 ~~~~~~~~--------e~~~e~~~~~~~~~---~~~~----~~~~~-~~~l~~n~~------~~~~~i~est~t~~v~~~ 58 (465) +..+...+ ....+....+.... .... .++.. ........+ +++..+..++++|.. .. T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~-lv 168 (477) T protein:vir:84 90 VRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGY-AV 168 (477) T ss_pred hcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcce-ee Confidence 11111100 00011100000000 0000 00000 000000000 111112111111111 11 Q ss_pred cchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccc Q lcl|NC_018861. 59 TPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEV 136 (465) Q Consensus 59 ~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~ 136 (465) -|..+ -++...-++.+-.+++++.||++.+|-+-=-|..- +.. T Consensus 169 ~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~---~~~-------------------------------- 213 (477) T protein:vir:84 169 PPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILT---GTS-------------------------------- 213 (477) T ss_pred ccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEec---Ccc-------------------------------- Confidence 23322 25566667777889999999999887653333110 000 Q ss_pred cccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcccccccccccccc Q lcl|NC_018861. 137 SFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYA 216 (465) Q Consensus 137 s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 216 (465) .+ .|.... .. T Consensus 214 ---~a------~~~~Eg-------------------------------------------~~------------------ 223 (477) T protein:vir:84 214 ---TA------IQAADN-------------------------------------------AA------------------ 223 (477) T ss_pred ---ee------eeeccC-------------------------------------------cc------------------ Confidence 00 000000 00 Q ss_pred chhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhhe--- Q lcl|NC_018861. 217 TAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEV--- 293 (465) Q Consensus 217 Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~--- 293 (465) ......++...+++.++..+|.-+-...+|-||.+|- ..|.++.|.+-|+..|..-+|+.+|.-=-.. T Consensus 224 -----~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p 294 (477) T protein:vir:84 224 -----LTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQA----AVSVDEFVFRDLAADYANKLNVQVISGTGSNNQV 294 (477) T ss_pred -----cccccccccccceeeEEEeeeeEEeeeHHHHHHHhcc----chhHHHHHHHHHHHHHHHHHHHHHhccCCCCCcc Confidence 0001233444567788888888888889999999984 3578999999999999999999998531000 Q ss_pred ---eeeeeeeeec--cCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCccc Q lcl|NC_018861. 294 ---ATVCTDFDVN--SADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSK 364 (465) Q Consensus 294 ---at~~~~~~~~--~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~ 364 (465) .+......+. .....|. ....++..|-...+.+....+ -.+..++.+++....|... |.-.+.|.... T Consensus 295 ~Gi~~~~~~~~~~~~~~~~t~~--~~~~~~~~i~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~ 371 (477) T protein:vir:84 295 VGVRATAGITQVTATSAGSALE--KHQIIYQKIADAIQRVHTSRF-LEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPG 371 (477) T ss_pred ceeeeccccccccccccccchh--hHHHHHHHHHHHHhhcccccc-CCccEEEEcHHHHHHHHHhhccCCCeeeecCccc Confidence 0000001111 1111221 222233333333333332223 2456788888877776554 33333333221 Q ss_pred cccc----ccccceEEEEecCceEEEEeCCCCcc--------eEEEEEecCCCccceeEEecccc--cceeeeeCCCccc Q lcl|NC_018861. 365 IDAI----NSGIKPNVGKFDNRYDVIVDNFAEFD--------YCTVAYKGASNFDAGIFFAPYNI--TLQQNLTDPVSGQ 430 (465) Q Consensus 365 ~~~~----~~~~~~~~G~l~~~~~vy~d~~~~~d--------y~~vg~kg~~~~d~glfy~PY~~--~~~~~~~dp~s~q 430 (465) .... ..-.....|+| .|++|+++++.|.+ .+++|.-. -|+. ..+...++|.++- T Consensus 372 ~~~~~~~~~~~~~~~~~~l-~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~-----------~~~i~~~~~~~~~~~~~~~ 439 (477) T protein:vir:84 372 FNNLGVLTEVASQRVVGQM-HGLPVVTDPTLPTTLGTGTDQDVIHVLRAS-----------DLALFESSVRMRALQETRA 439 (477) T ss_pred ccccccccccccccccchh-cccceEecCcccccccccCCcceEEEEEec-----------eEEEEeeceeEEecccccc Confidence 1110 00112244677 47899999988754 23333321 1111 1122233444332 Q ss_pred --ceeeeeeeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 431 --PAMILNNRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 431 --p~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) ..+.|. .||.... .+.-....|+ .+||+=+. T Consensus 440 ~~~~~~~~-v~~~~~~--~~~r~~~afv-~~t~~~~~ 472 (477) T protein:vir:84 440 ENLSVLLQ-VYGYLAF--TAARFPQSVV-EIGGTALT 472 (477) T ss_pred ccceeeee-ehhhhhh--hhhccccceE-Eeeccccc Confidence 222221 1332111 0111123333 33444333 No 31 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=96.46 E-value=0.0006 Score=38.24 Aligned_cols=267 Identities=11% Similarity=0.029 Sum_probs=122.8 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCcccccccccc--ccccccccCCccCCCcccccCccccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEM--DKAATFATKKATVEAVYTNEALWLKVLKNYTG 213 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~--~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~ 213 (465) ++..+.....-.+.-... ..++..+... ..+..-+ +..+++.. +....... +. .. T Consensus 1 MA~~~T~~~~~~iPev~s----~~v~~~~~~~-------~~~~~~~~~~~~~~g~~------G~tv~iP~-----~~-~~ 57 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLA----DMIDAEVGKA-------IRFAPLAEVDTTLEGQP------GTTLTVPK-----WD-YI 57 (272) T ss_pred CCCccccchheechHHHH----HHHHHHHHHH-------hhhhccccccccccCCC------CCEEEEEE-----ec-CC Confidence 222111111100000000 0000000000 0000000 00000000 00000000 00 01 Q ss_pred cccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhhe Q lcl|NC_018861. 214 PYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEV 293 (465) Q Consensus 214 ~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~ 293 (465) +-..-.+| |.++..=..+.+.++++.|.++-.-+.|=|++.+ -+-|.++++.+-|+..|..+|+.+++..+... T Consensus 58 ~~a~~v~e--g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a 131 (272) T protein:vir:98 58 GDAEDVAE--GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLDALSKS 131 (272) T ss_pred CCcccccC--CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 11112223 2334444456778888888887666777666533 25799999999999999999999999887432 Q ss_pred eeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccc Q lcl|NC_018861. 294 ATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIK 373 (465) Q Consensus 294 at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~ 373 (465) .. .+ .+.-..+....+..++.++ ....++++++|+++..|.......+......... .... T Consensus 132 ~~-----~~---~~~~t~d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~--~~~~ 192 (272) T protein:vir:98 132 TQ-----TV---EATATVDGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWLGATEVGAN--RVVS 192 (272) T ss_pred cc-----cc---ccccCHHHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhcccccccccccccc--cccc Confidence 11 11 1111123333333333222 2356799999999999987765554332211111 1112 Q ss_pred eEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecCccccccc Q lcl|NC_018861. 374 PNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATPLHPEAFI 452 (465) Q Consensus 374 ~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~~~ 452 (465) ..+|++ .|++|+++++.|.+-+.+.-+|.- +++-.. +.....--|+.+++-.+-..-|||+ ..||=. .-. T Consensus 193 g~ig~i-~G~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~--~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~--vv~ 263 (272) T protein:vir:98 193 GVYGEV-LGVQIVRSRKCPKGTAYMVRKGAL----RIMLKR--NTMVETDRDITKAINQIVANKHYGVYLYKAEK--AVK 263 (272) T ss_pred ccchhh-cCeeEEEcCCCCcceEEEEcCCeE----EEEecC--CceeeeccccccceeEEEEEEEEEEEEEcCCc--eEE Confidence 346777 468999999998655444323311 111111 1112223378888888888889988 556520 000 Q ss_pred ceEEEeeccceeC Q lcl|NC_018861. 453 RTFAVNLNNYIIS 465 (465) Q Consensus 453 ~~f~~~~~~~~~~ 465 (465) .+|+ --+ T Consensus 264 ~t~~------~a~ 270 (272) T protein:vir:98 264 ITLK------DAA 270 (272) T ss_pred EEec------ccc Confidence 0111 000 No 32 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=96.46 E-value=0.0006 Score=38.24 Aligned_cols=267 Identities=11% Similarity=0.029 Sum_probs=122.8 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCcccccccccc--ccccccccCCccCCCcccccCccccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEM--DKAATFATKKATVEAVYTNEALWLKVLKNYTG 213 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~--~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~ 213 (465) ++..+.....-.+.-... ..++..+... ..+..-+ +..+++.. +....... +. .. T Consensus 1 MA~~~T~~~~~~iPev~s----~~v~~~~~~~-------~~~~~~~~~~~~~~g~~------G~tv~iP~-----~~-~~ 57 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLA----DMIDAEVGKA-------IRFAPLAEVDTTLEGQP------GTTLTVPK-----WD-YI 57 (272) T ss_pred CCCccccchheechHHHH----HHHHHHHHHH-------hhhhccccccccccCCC------CCEEEEEE-----ec-CC Confidence 222111111100000000 0000000000 0000000 00000000 00000000 00 01 Q ss_pred cccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhhe Q lcl|NC_018861. 214 PYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEV 293 (465) Q Consensus 214 ~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~ 293 (465) +-..-.+| |.++..=..+.+.++++.|.++-.-+.|=|++.+ -+-|.++++.+-|+..|..+|+.+++..+... T Consensus 58 ~~a~~v~e--g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a 131 (272) T protein:vir:30 58 GDAEDVAE--GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLDALSKS 131 (272) T ss_pred CCcccccC--CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 11112223 2334444456778888888887666777666533 25799999999999999999999999887432 Q ss_pred eeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccc Q lcl|NC_018861. 294 ATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIK 373 (465) Q Consensus 294 at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~ 373 (465) .. .+ .+.-..+....+..++.++ ....++++++|+++..|.......+......... .... T Consensus 132 ~~-----~~---~~~~t~d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~--~~~~ 192 (272) T protein:vir:30 132 TQ-----TV---EATATVDGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWLGATEVGAN--RVVS 192 (272) T ss_pred cc-----cc---ccccCHHHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhcccccccccccccc--cccc Confidence 11 11 1111123333333333222 2356799999999999987765554332211111 1112 Q ss_pred eEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecCccccccc Q lcl|NC_018861. 374 PNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATPLHPEAFI 452 (465) Q Consensus 374 ~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~~~ 452 (465) ..+|++ .|++|+++++.|.+-+.+.-+|.- +++-.. +.....--|+.+++-.+-..-|||+ ..||=. .-. T Consensus 193 g~ig~i-~G~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~--~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~--vv~ 263 (272) T protein:vir:30 193 GVYGEV-LGVQIVRSRKCPKGTAYMVRKGAL----RIMLKR--NTMVETDRDITKAINQIVANKHYGVYLYKAEK--AVK 263 (272) T ss_pred ccchhh-cCeeEEEcCCCCcceEEEEcCCeE----EEEecC--CceeeeccccccceeEEEEEEEEEEEEEcCCc--eEE Confidence 346777 468999999998655444323311 111111 1112223378888888888889988 556520 000 Q ss_pred ceEEEeeccceeC Q lcl|NC_018861. 453 RTFAVNLNNYIIS 465 (465) Q Consensus 453 ~~f~~~~~~~~~~ 465 (465) .+|+ --+ T Consensus 264 ~t~~------~a~ 270 (272) T protein:vir:30 264 ITLK------DAA 270 (272) T ss_pred EEec------ccc Confidence 0111 000 No 33 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=96.45 E-value=0.00062 Score=38.18 Aligned_cols=304 Identities=12% Similarity=0.008 Sum_probs=118.0 Q ss_pred CCccchhhhHH----------Hhhhhhhccc---------cc----------------cChhhhhheehccccchhHHHh Q lcl|NC_018861. 1 MADKYLLDEST----------KEKFITSNLY---------PN----------------LNESEKNIMRTVLENQGNEVKM 45 (465) Q Consensus 1 ~~~~~~~~e~~----------~e~~~~~~~~---------~~----------------~~~~~~~~~~~l~~n~~~~~~~ 45 (465) +.+...+.|++ +++-.-.... .. .++++|....-|-+........ T Consensus 29 ~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 108 (397) T protein:vir:48 29 MLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDS 108 (397) T ss_pred hcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhccccccchhhHHHHHHHHHHHHHHhhhhhHHHHH Confidence 22222222111 1110000000 00 0001111111111100000000 Q ss_pred hhhhhhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccc Q lcl|NC_018861. 46 LMESTVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESAN 123 (465) Q Consensus 46 i~est~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ 123 (465) ...++ +.+-...=|.-+ -+++..-++..-.+++.++||++++|-+--.+ ....... T Consensus 109 ~~~~t-~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~------------------- 166 (397) T protein:vir:48 109 KTDAS-GSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEK--WADITGL------------------- 166 (397) T ss_pred hhccC-CccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEe--ecCCCcc------------------- Confidence 11111 111111112222 44555566677788999999999998665444 1110000 Q ss_pred ccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCc Q lcl|NC_018861. 124 KDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEAL 203 (465) Q Consensus 124 ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~ 203 (465) + .+.. T Consensus 167 -----------------a------~~v~---------------------------------------------------- 171 (397) T protein:vir:48 167 -----------------A------KLDD---------------------------------------------------- 171 (397) T ss_pred -----------------e------eeec---------------------------------------------------- Confidence 0 0000 Q ss_pred cccccccccccccchhhhccCCchhhc-ceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHh Q lcl|NC_018861. 204 WLKVLKNYTGPYATAAGEKLGKDMKEM-GISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEI 282 (465) Q Consensus 204 ~~~~~~~~~~~~~Ta~~E~lg~~f~EM-~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEI 282 (465) .|..+++- ..++++++..+|.-+-...+|-||.+|-. .|.+++|.+-|+..|..-+ T Consensus 172 -------------------E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~v~~~l~~~~~~~~ 228 (397) T protein:vir:48 172 -------------------EAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSA----ENILAWLSGWIAKKVVVTR 228 (397) T ss_pred -------------------cccccccccccceeeEEeeheeeeeehhhHHHHHhhch----HHHHHHHHHHHHHHHHHHH Confidence 00111111 12355566666655666789999999842 5788999999999999999 Q ss_pred hHHHHhhhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCc Q lcl|NC_018861. 283 DRTIIEKANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAG 362 (465) Q Consensus 283 Nreii~~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~ 362 (465) |+.||.-.-....+ .+---.+....++..+ ... +.....++|++...+.|...- ...+ T Consensus 229 d~~il~G~g~~~~~---------~~~~~~d~i~~~~~~l-------~~~--~~~~a~~v~n~~~~~~L~~lk----d~~G 286 (397) T protein:vir:48 229 NKAILEAIATLPTK---------PTLTKWDDIIDLQAKV-------DPA--IKQTSFFLTNTSGFTALKKVK----NAFG 286 (397) T ss_pred HHHHhhcccccccc---------cccccHHHHHHHHHHh-------hhh--hcCCCEEEECHHHHHHHHHhh----cCCC Confidence 99998765221111 1111112233333333 221 224568899999999998652 1111 Q ss_pred ccccccccccceEEEEecCceEEEEeC--CC-----CcceEEEE---------EecCCCccceeEEecccccceeeeeCC Q lcl|NC_018861. 363 SKIDAINSGIKPNVGKFDNRYDVIVDN--FA-----EFDYCTVA---------YKGASNFDAGIFFAPYNITLQQNLTDP 426 (465) Q Consensus 363 ~~~~~~~~~~~~~~G~l~~~~~vy~d~--~~-----~~dy~~vg---------~kg~~~~d~glfy~PY~~~~~~~~~dp 426 (465) ..+..++ -...--++| .|++|++.. .. ...-+++| .++.-... ..++.- .+- T Consensus 287 ~~i~~~~-~~~~~~~~l-~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~----~~~~~~------~~~ 354 (397) T protein:vir:48 287 DYLMERD-VKSPTGYSI-DGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLL----STNIGG------GAF 354 (397) T ss_pred ceeeccC-cCCCCCcee-ccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEE----Eeccch------hhh Confidence 1111111 001112455 455555321 11 11112223 22111110 000000 001 Q ss_pred Ccccceeeeeeeeee-eecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 427 VSGQPAMILNNRYDV-VATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 427 ~s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) ...+=.+-...|++. ..|| ..|+ +.++..-=. T Consensus 355 ~~~~~~~r~~~r~d~~~~~~-------~a~~~~~~~~~~~~ 388 (397) T protein:vir:48 355 ETDTTKIRVIDRFDVVATDT-------ESFVPASFKAIADQ 388 (397) T ss_pred hcCceeEEEEeeeccEEecc-------cceEEEEecccccC Confidence 111223333344443 2222 1111 111111100 No 34 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=95.90 E-value=0.0013 Score=36.45 Aligned_cols=308 Identities=14% Similarity=0.042 Sum_probs=129.4 Q ss_pred CCccch-----------hhhHHHhhhhhhcccccc---------Chhhhhheehccccc-hhHHHhhhhhhhcccccccc Q lcl|NC_018861. 1 MADKYL-----------LDESTKEKFITSNLYPNL---------NESEKNIMRTVLENQ-GNEVKMLMESTVTGDIAKFT 59 (465) Q Consensus 1 ~~~~~~-----------~~e~~~e~~~~~~~~~~~---------~~~~~~~~~~l~~n~-~~~~~~i~est~t~~v~~~~ 59 (465) +.++.- ..+++.++.......... ++.++.......... .+.++.+..+++++. .-.. T Consensus 39 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~i~ 117 (385) T protein:vir:18 39 LQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAG-SLIQ 117 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCC-ceec Confidence 000000 000011111100000000 000000000000000 011111111111111 1123 Q ss_pred chhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccc Q lcl|NC_018861. 60 PILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSF 138 (465) Q Consensus 60 P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~ 138 (465) |.++ .++.++..+..-.+++-++||+++..-+. | +...+.. T Consensus 118 ~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~--~--~~~~~~~---------------------------------- 159 (385) T protein:vir:18 118 PMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYV--R--EEVFTNN---------------------------------- 159 (385) T ss_pred chhhhHHHHHhhhccchhhhcceecccCcceEEE--E--EecCCcc---------------------------------- Confidence 4444 56677777778888999999987753221 1 1000000 Q ss_pred cccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccch Q lcl|NC_018861. 139 KTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATA 218 (465) Q Consensus 139 ~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta 218 (465) + .|. T Consensus 160 --a------~~v-------------------------------------------------------------------- 163 (385) T protein:vir:18 160 --A------DVV-------------------------------------------------------------------- 163 (385) T ss_pred --e------eee-------------------------------------------------------------------- Confidence 0 000 Q ss_pred hhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhh--------- Q lcl|NC_018861. 219 AGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEK--------- 289 (465) Q Consensus 219 ~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~--------- 289 (465) +| |..+++-..++++++.+.|.-+-...+|-||.||-- +.++.|.+-|+..|..-+|+.+|.. T Consensus 164 -~E--~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~G 235 (385) T protein:vir:18 164 -AE--KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEG 235 (385) T ss_pred -cc--CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccc Confidence 00 122344444677788888888888899999999841 4678888888888888889888852 Q ss_pred hhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccc Q lcl|NC_018861. 290 ANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAIN 369 (465) Q Consensus 290 l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~ 369 (465) |...+.+ ...... ..+.-..+....++ ..+. ..+...+.++++++....|....- ..+....... T Consensus 236 i~~~~~~-~~~~~~-~~~~~~~d~i~~~~-------~~l~--~~~~~~~~~~~~~~~~~~l~~lkd----~~G~~l~~~~ 300 (385) T protein:vir:18 236 LNKVATA-YDTSLN-ATGDTRADIIAHAI-------YQVT--ESEFSASGIVLNPRDWHNIALLKD----NEGRYIFGGP 300 (385) T ss_pred ccccccc-cccccc-ccccchHHHHHHHH-------Hhhc--cccCCCCEEEEcHHHHHHHHHhhc----CCCceeccCc Confidence 1111110 000010 11111112222222 2321 233467789999999999986431 1122211100 Q ss_pred cccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeC--C-Ccc-cceeeee--eeeee-e Q lcl|NC_018861. 370 SGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTD--P-VSG-QPAMILN--NRYDV-V 442 (465) Q Consensus 370 ~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~d--p-~s~-qp~~~~~--tRY~l-~ 442 (465) ...-.++|. |++|++++..|..-+++|--- .++.. +.-.-+.+-++ . +-| +..++|. .|++. + T Consensus 301 --~~~~~~~l~-G~pV~~~~~~p~~~~~~gd~~-----~~~~~--~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v 370 (385) T protein:vir:18 301 --QAFTSNIMW-GLPVVPTKAQAAGTFTVGGFD-----MASQV--WDRMDATVEVSREDRDNFVKNMLTILCEERLALAH 370 (385) T ss_pred --ccCCCceec-ceeeEEcCcCCCCcEEEeecc-----cEEEE--EEecceEEEEeccccchhhcCcEEEEEEEeeccEE Confidence 011235664 589999999887666665210 01111 11111111111 1 111 3344444 46665 3 Q ss_pred ecCcccccccc-eEEEeeccc Q lcl|NC_018861. 443 ATPLHPEAFIR-TFAVNLNNY 462 (465) Q Consensus 443 ~nPf~~~~~~~-~f~~~~~~~ 462 (465) .+| .+..+ +|+ +.+ T Consensus 371 ~~~---~a~~~~~~~---aa~ 385 (385) T protein:vir:18 371 YRP---TAIIKGTFS---SGS 385 (385) T ss_pred ecc---cceEEEEec---cCC Confidence 333 22111 111 111 No 35 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=95.90 E-value=0.0013 Score=36.45 Aligned_cols=308 Identities=14% Similarity=0.042 Sum_probs=129.4 Q ss_pred CCccch-----------hhhHHHhhhhhhcccccc---------Chhhhhheehccccc-hhHHHhhhhhhhcccccccc Q lcl|NC_018861. 1 MADKYL-----------LDESTKEKFITSNLYPNL---------NESEKNIMRTVLENQ-GNEVKMLMESTVTGDIAKFT 59 (465) Q Consensus 1 ~~~~~~-----------~~e~~~e~~~~~~~~~~~---------~~~~~~~~~~l~~n~-~~~~~~i~est~t~~v~~~~ 59 (465) +.++.- ..+++.++.......... ++.++.......... .+.++.+..+++++. .-.. T Consensus 39 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~i~ 117 (385) T protein:vir:19 39 LQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAG-SLIQ 117 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCC-ceec Confidence 000000 000011111100000000 000000000000000 011111111111111 1123 Q ss_pred chhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccc Q lcl|NC_018861. 60 PILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSF 138 (465) Q Consensus 60 P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~ 138 (465) |.++ .++.++..+..-.+++-++||+++..-+. | +...+.. T Consensus 118 ~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~--~--~~~~~~~---------------------------------- 159 (385) T protein:vir:19 118 PMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYV--R--EEVFTNN---------------------------------- 159 (385) T ss_pred chhhhHHHHHhhhccchhhhcceecccCcceEEE--E--EecCCcc---------------------------------- Confidence 4444 56677777778888999999987753221 1 1000000 Q ss_pred cccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccch Q lcl|NC_018861. 139 KTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATA 218 (465) Q Consensus 139 ~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta 218 (465) + .|. T Consensus 160 --a------~~v-------------------------------------------------------------------- 163 (385) T protein:vir:19 160 --A------DVV-------------------------------------------------------------------- 163 (385) T ss_pred --e------eee-------------------------------------------------------------------- Confidence 0 000 Q ss_pred hhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhh--------- Q lcl|NC_018861. 219 AGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEK--------- 289 (465) Q Consensus 219 ~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~--------- 289 (465) +| |..+++-..++++++.+.|.-+-...+|-||.||-- +.++.|.+-|+..|..-+|+.+|.. T Consensus 164 -~E--~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~G 235 (385) T protein:vir:19 164 -AE--KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEG 235 (385) T ss_pred -cc--CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccc Confidence 00 122344444677788888888888899999999841 4678888888888888889888852 Q ss_pred hhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccc Q lcl|NC_018861. 290 ANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAIN 369 (465) Q Consensus 290 l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~ 369 (465) |...+.+ ...... ..+.-..+....++ ..+. ..+...+.++++++....|....- ..+....... T Consensus 236 i~~~~~~-~~~~~~-~~~~~~~d~i~~~~-------~~l~--~~~~~~~~~~~~~~~~~~l~~lkd----~~G~~l~~~~ 300 (385) T protein:vir:19 236 LNKVATA-YDTSLN-ATGDTRADIIAHAI-------YQVT--ESEFSASGIVLNPRDWHNIALLKD----NEGRYIFGGP 300 (385) T ss_pred ccccccc-cccccc-ccccchHHHHHHHH-------Hhhc--cccCCCCEEEEcHHHHHHHHHhhc----CCCceeccCc Confidence 1111110 000010 11111112222222 2321 233467789999999999986431 1122211100 Q ss_pred cccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeC--C-Ccc-cceeeee--eeeee-e Q lcl|NC_018861. 370 SGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTD--P-VSG-QPAMILN--NRYDV-V 442 (465) Q Consensus 370 ~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~d--p-~s~-qp~~~~~--tRY~l-~ 442 (465) ...-.++|. |++|++++..|..-+++|--- .++.. +.-.-+.+-++ . +-| +..++|. .|++. + T Consensus 301 --~~~~~~~l~-G~pV~~~~~~p~~~~~~gd~~-----~~~~~--~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v 370 (385) T protein:vir:19 301 --QAFTSNIMW-GLPVVPTKAQAAGTFTVGGFD-----MASQV--WDRMDATVEVSREDRDNFVKNMLTILCEERLALAH 370 (385) T ss_pred --ccCCCceec-ceeeEEcCcCCCCcEEEeecc-----cEEEE--EEecceEEEEeccccchhhcCcEEEEEEEeeccEE Confidence 011235664 589999999887666665210 01111 11111111111 1 111 3344444 46665 3 Q ss_pred ecCcccccccc-eEEEeeccc Q lcl|NC_018861. 443 ATPLHPEAFIR-TFAVNLNNY 462 (465) Q Consensus 443 ~nPf~~~~~~~-~f~~~~~~~ 462 (465) .+| .+..+ +|+ +.+ T Consensus 371 ~~~---~a~~~~~~~---aa~ 385 (385) T protein:vir:19 371 YRP---TAIIKGTFS---SGS 385 (385) T ss_pred ecc---cceEEEEec---cCC Confidence 333 22111 111 111 No 36 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=95.73 E-value=0.0016 Score=35.99 Aligned_cols=309 Identities=13% Similarity=0.070 Sum_probs=138.5 Q ss_pred CCc--cchh-----hhHHHhhhhhh------cc-cccc---Chhhhhheehccccc---hhH-HHh---h------hhh- Q lcl|NC_018861. 1 MAD--KYLL-----DESTKEKFITS------NL-YPNL---NESEKNIMRTVLENQ---GNE-VKM---L------MES- 49 (465) Q Consensus 1 ~~~--~~~~-----~e~~~e~~~~~------~~-~~~~---~~~~~~~~~~l~~n~---~~~-~~~---i------~es- 49 (465) +.+ +.+. .+++.++=... .. .+.. ...++......+.+. ..+ +.+ + ... T Consensus 44 v~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (415) T protein:vir:46 44 ITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGS 123 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhcc Confidence 000 0000 00000000000 00 0000 001111111111111 011 111 0 111 Q ss_pred hhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccc Q lcl|NC_018861. 50 TVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDF 127 (465) Q Consensus 50 t~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~ 127 (465) +++.+-..--|..+ .+++.+.+...-.+++.|.||+++++-+--.+. .. .. T Consensus 124 ~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~~--~~----------------------- 176 (415) T protein:vir:46 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQ--SE--VA----------------------- 176 (415) T ss_pred ccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEe--cC--Cc----------------------- Confidence 11211111224333 466777888889999999999999876543331 00 00 Q ss_pred ccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccc Q lcl|NC_018861. 128 NYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKV 207 (465) Q Consensus 128 ~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~ 207 (465) . ..+. T Consensus 177 ------------~------~~~v--------------------------------------------------------- 181 (415) T protein:vir:46 177 ------------A------LEKV--------------------------------------------------------- 181 (415) T ss_pred ------------c------eeec--------------------------------------------------------- Confidence 0 0000 Q ss_pred cccccccccchhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHH Q lcl|NC_018861. 208 LKNYTGPYATAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTI 286 (465) Q Consensus 208 ~~~~~~~~~Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINrei 286 (465) ..|...++.+ -++++++..++..+-...+|-||.+|- ..|.+++|.+-|+..|..-+|+.| T Consensus 182 --------------~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~i 243 (415) T protein:vir:46 182 --------------EELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAI 243 (415) T ss_pred --------------ccccccccccccceeeEEeeeeeeEeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHH Confidence 0011222222 246777777777777789999999984 357889999999999999999999 Q ss_pred Hhhhhheeeeeeee-------eeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----Cc Q lcl|NC_018861. 287 IEKANEVATVCTDF-------DVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GS 355 (465) Q Consensus 287 i~~l~~~at~~~~~-------~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~ 355 (465) |...-.....+.-. .+......++ +....|+..+. ..+.+...+|+++.....|+.. |- T Consensus 244 l~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~-~~i~~~~~~~~---------~~~~~~~~~v~n~~~~~~L~~lkd~~G~ 313 (415) T protein:vir:46 244 IDVITKGSTGSTSSGFEKEGKKLEVKKAKSL-DDIKDAINLNV---------KPNYEHNVAIVSQTMFAKLDKMKDKLGN 313 (415) T ss_pred hhccccCCccccccccccccceeccccccch-HHHHHHHHhhh---------hhccCCCEEEEcHHHHHHHHHhhccCCC Confidence 98662211111000 0111111111 23333333332 1233567889999999999764 22 Q ss_pred ccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc--------cceeeeeCCC Q lcl|NC_018861. 356 FVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI--------TLQQNLTDPV 427 (465) Q Consensus 356 ~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~--------~~~~~~~dp~ 427 (465) ..+.|.. .....++| .|++|++.++.+. | ...+..++|+.+.- .......|-. T Consensus 314 ~i~~~~~---------~~~~~~~l-~G~pV~~~~~~~~-----~----~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~ 374 (415) T protein:vir:46 314 YLIQPDV---------KEKTQQRL-LGAKIEILPDEVL-----G----QKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM 374 (415) T ss_pred eeeccCc---------CCCCCccc-cceeeEEeccccc-----c----CCCccEEEEEehhccEEEEeecceEEEeeccc Confidence 2222211 01122455 4667777665432 1 11112233333211 1111233556 Q ss_pred cccceeeeeeeeee-eecCcccccccceEEEeeccceeC Q lcl|NC_018861. 428 SGQPAMILNNRYDV-VATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 428 s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) +++-.+-...|++. +.+| ..|++-.--+..+ T Consensus 375 ~~~~~~~~~~r~d~~v~~~-------~a~~~~~~~~~~~ 406 (415) T protein:vir:46 375 HFGECLMIAVRQDCRILDY-------KSAIVIEYDDSER 406 (415) T ss_pred cCceEEEEEEEeccEEecc-------ccEEEEEeeccCC Confidence 66667777788876 4444 3344211111111 No 37 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=95.73 E-value=0.0016 Score=35.99 Aligned_cols=309 Identities=13% Similarity=0.070 Sum_probs=138.5 Q ss_pred CCc--cchh-----hhHHHhhhhhh------cc-cccc---Chhhhhheehccccc---hhH-HHh---h------hhh- Q lcl|NC_018861. 1 MAD--KYLL-----DESTKEKFITS------NL-YPNL---NESEKNIMRTVLENQ---GNE-VKM---L------MES- 49 (465) Q Consensus 1 ~~~--~~~~-----~e~~~e~~~~~------~~-~~~~---~~~~~~~~~~l~~n~---~~~-~~~---i------~es- 49 (465) +.+ +.+. .+++.++=... .. .+.. ...++......+.+. ..+ +.+ + ... T Consensus 44 v~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (415) T protein:vir:47 44 ITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGS 123 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhcc Confidence 000 0000 00000000000 00 0000 001111111111111 011 111 0 111 Q ss_pred hhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccc Q lcl|NC_018861. 50 TVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDF 127 (465) Q Consensus 50 t~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~ 127 (465) +++.+-..--|..+ .+++.+.+...-.+++.|.||+++++-+--.+. .. .. T Consensus 124 ~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~~--~~----------------------- 176 (415) T protein:vir:47 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQ--SE--VA----------------------- 176 (415) T ss_pred ccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEe--cC--Cc----------------------- Confidence 11211111224333 466777888889999999999999876543331 00 00 Q ss_pred ccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccc Q lcl|NC_018861. 128 NYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKV 207 (465) Q Consensus 128 ~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~ 207 (465) . ..+. T Consensus 177 ------------~------~~~v--------------------------------------------------------- 181 (415) T protein:vir:47 177 ------------A------LEKV--------------------------------------------------------- 181 (415) T ss_pred ------------c------eeec--------------------------------------------------------- Confidence 0 0000 Q ss_pred cccccccccchhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHH Q lcl|NC_018861. 208 LKNYTGPYATAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTI 286 (465) Q Consensus 208 ~~~~~~~~~Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINrei 286 (465) ..|...++.+ -++++++..++..+-...+|-||.+|- ..|.+++|.+-|+..|..-+|+.| T Consensus 182 --------------~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~i 243 (415) T protein:vir:47 182 --------------EELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAI 243 (415) T ss_pred --------------ccccccccccccceeeEEeeeeeeEeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHH Confidence 0011222222 246777777777777789999999984 357889999999999999999999 Q ss_pred Hhhhhheeeeeeee-------eeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----Cc Q lcl|NC_018861. 287 IEKANEVATVCTDF-------DVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GS 355 (465) Q Consensus 287 i~~l~~~at~~~~~-------~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~ 355 (465) |...-.....+.-. .+......++ +....|+..+. ..+.+...+|+++.....|+.. |- T Consensus 244 l~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~-~~i~~~~~~~~---------~~~~~~~~~v~n~~~~~~L~~lkd~~G~ 313 (415) T protein:vir:47 244 IDVITKGSTGSTSSGFEKEGKKLEVKKAKSL-DDIKDAINLNV---------KPNYEHNVAIVSQTMFAKLDKMKDKLGN 313 (415) T ss_pred hhccccCCccccccccccccceeccccccch-HHHHHHHHhhh---------hhccCCCEEEEcHHHHHHHHHhhccCCC Confidence 98662211111000 0111111111 23333333332 1233567889999999999764 22 Q ss_pred ccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc--------cceeeeeCCC Q lcl|NC_018861. 356 FVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI--------TLQQNLTDPV 427 (465) Q Consensus 356 ~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~--------~~~~~~~dp~ 427 (465) ..+.|.. .....++| .|++|++.++.+. | ...+..++|+.+.- .......|-. T Consensus 314 ~i~~~~~---------~~~~~~~l-~G~pV~~~~~~~~-----~----~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~ 374 (415) T protein:vir:47 314 YLIQPDV---------KEKTQQRL-LGAKIEILPDEVL-----G----QKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM 374 (415) T ss_pred eeeccCc---------CCCCCccc-cceeeEEeccccc-----c----CCCccEEEEEehhccEEEEeecceEEEeeccc Confidence 2222211 01122455 4667777665432 1 11112233333211 1111233556 Q ss_pred cccceeeeeeeeee-eecCcccccccceEEEeeccceeC Q lcl|NC_018861. 428 SGQPAMILNNRYDV-VATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 428 s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) +++-.+-...|++. +.+| ..|++-.--+..+ T Consensus 375 ~~~~~~~~~~r~d~~v~~~-------~a~~~~~~~~~~~ 406 (415) T protein:vir:47 375 HFGECLMIAVRQDCRILDY-------KSAIVIEYDDSER 406 (415) T ss_pred cCceEEEEEEEeccEEecc-------ccEEEEEeeccCC Confidence 66667777788876 4444 3344211111111 No 38 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=95.72 E-value=0.0016 Score=35.97 Aligned_cols=315 Identities=12% Similarity=0.049 Sum_probs=135.2 Q ss_pred CCc------cchh-----hhHHHhhh---hhhc-----cccccChhhhh--heehccccc---hhHH-Hh---h------ Q lcl|NC_018861. 1 MAD------KYLL-----DESTKEKF---ITSN-----LYPNLNESEKN--IMRTVLENQ---GNEV-KM---L------ 46 (465) Q Consensus 1 ~~~------~~~~-----~e~~~e~~---~~~~-----~~~~~~~~~~~--~~~~l~~n~---~~~~-~~---i------ 46 (465) |.+ +.+. .+.+.++= .... ..+.....+++ .....+... ..+. .+ + T Consensus 40 ~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 119 (415) T protein:vir:98 40 LEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI 119 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhh Confidence 000 0000 00000000 0000 00000000000 000111111 0011 00 1 Q ss_pred hhh-hhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccc Q lcl|NC_018861. 47 MES-TVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESAN 123 (465) Q Consensus 47 ~es-t~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ 123 (465) ... +++.+-...-|.-+ .+++++..+.+-.+++.|.||++..+-+--.| ..+.. . T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~-~------------------- 177 (415) T protein:vir:98 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QSEVA-A------------------- 177 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEe--ecCCc-c------------------- Confidence 011 11111111235443 46677788888899999999999887654443 11100 0 Q ss_pred ccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCc Q lcl|NC_018861. 124 KDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEAL 203 (465) Q Consensus 124 ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~ 203 (465) . .+.. T Consensus 178 -----------------~------~~v~---------------------------------------------------- 182 (415) T protein:vir:98 178 -----------------L------EKVE---------------------------------------------------- 182 (415) T ss_pred -----------------c------eeec---------------------------------------------------- Confidence 0 0000 Q ss_pred cccccccccccccchhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHh Q lcl|NC_018861. 204 WLKVLKNYTGPYATAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEI 282 (465) Q Consensus 204 ~~~~~~~~~~~~~Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEI 282 (465) .|.+.++.+ -++++++...|..+-...+|-||.+|- ..|.+++|.+-|+..|..-+ T Consensus 183 -------------------E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~ 239 (415) T protein:vir:98 183 -------------------ELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA----KVNVLQELKLWMARTIAATR 239 (415) T ss_pred -------------------cccccCcccccceeeEEeeeeeeEeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHH Confidence 000111111 135666666666666778999999984 35788999999999999999 Q ss_pred hHHHHhhhhheeeeee------eeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcc Q lcl|NC_018861. 283 DRTIIEKANEVATVCT------DFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSF 356 (465) Q Consensus 283 Nreii~~l~~~at~~~------~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~ 356 (465) |+.||...-.....+. ........+. ..+..|...+..+ .. ...+...+|+++.....|+..- T Consensus 240 ~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~~~----~~--~~~~~~~~v~n~~~~~~l~~lk-- 308 (415) T protein:vir:98 240 NKAIIDVITKGSTGSTSSGFEKEGKKLEVKKA---KSLDDIKDAINLN----VK--PNYEHNVAIVSQTMFAKLDKMK-- 308 (415) T ss_pred HHHHhhccccCccccccccccccccccccccc---cchhHHHHHHHhh----hh--hccCCCEEEEcHHHHHHHHHhh-- Confidence 9999986622100000 0000001111 1233333322222 11 1235667899999999997641 Q ss_pred cccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecc----cc----cceeeeeCCCc Q lcl|NC_018861. 357 VLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPY----NI----TLQQNLTDPVS 428 (465) Q Consensus 357 ~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY----~~----~~~~~~~dp~s 428 (465) ...++++..++ ......++| .|++|++.++.+.. - ..+..++|+.+ +. .......|-.. T Consensus 309 --d~~G~~l~~~~-~~~~~~~~l-~G~pV~~~~~~~~~-----~----~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~ 375 (415) T protein:vir:98 309 --DKLGNYLIQPD-VKEKTQQRL-LGAKIEILPDEVLG-----Q----KGNNTLIIGNLKDAIVLFDRSQYQASWTDYMH 375 (415) T ss_pred --ccCCceeeccC-cCCCCCcee-cceeeEEecccccC-----C----CCccEEEEEehhccEEEEeecceEEEEecccc Confidence 11112221111 011222455 56688877665321 1 11122333322 11 11112335566 Q ss_pred ccceeeeeeeeeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 429 GQPAMILNNRYDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 429 ~qp~~~~~tRY~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) ++..+....|++..+ .+| ..|+ +.++-..-. T Consensus 376 ~~~~~~~~~r~d~~v--~~~----~a~~~~~~~~~~~~ 407 (415) T protein:vir:98 376 FGECLMIAVRQDCRI--LDY----KSAIVIEYDDSERG 407 (415) T ss_pred CceEEEEEEEeccEE--ecc----ccEEEEEEeccCCC Confidence 777777788887632 222 3344 222222111 No 39 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=95.72 E-value=0.0016 Score=35.97 Aligned_cols=315 Identities=12% Similarity=0.049 Sum_probs=135.2 Q ss_pred CCc------cchh-----hhHHHhhh---hhhc-----cccccChhhhh--heehccccc---hhHH-Hh---h------ Q lcl|NC_018861. 1 MAD------KYLL-----DESTKEKF---ITSN-----LYPNLNESEKN--IMRTVLENQ---GNEV-KM---L------ 46 (465) Q Consensus 1 ~~~------~~~~-----~e~~~e~~---~~~~-----~~~~~~~~~~~--~~~~l~~n~---~~~~-~~---i------ 46 (465) |.+ +.+. .+.+.++= .... ..+.....+++ .....+... ..+. .+ + T Consensus 40 ~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 119 (415) T protein:vir:79 40 LEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI 119 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhh Confidence 000 0000 00000000 0000 00000000000 000111111 0011 00 1 Q ss_pred hhh-hhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccc Q lcl|NC_018861. 47 MES-TVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESAN 123 (465) Q Consensus 47 ~es-t~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ 123 (465) ... +++.+-...-|.-+ .+++++..+.+-.+++.|.||++..+-+--.| ..+.. . T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~-~------------------- 177 (415) T protein:vir:79 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QSEVA-A------------------- 177 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEe--ecCCc-c------------------- Confidence 011 11111111235443 46677788888899999999999887654443 11100 0 Q ss_pred ccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCc Q lcl|NC_018861. 124 KDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEAL 203 (465) Q Consensus 124 ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~ 203 (465) . .+.. T Consensus 178 -----------------~------~~v~---------------------------------------------------- 182 (415) T protein:vir:79 178 -----------------L------EKVE---------------------------------------------------- 182 (415) T ss_pred -----------------c------eeec---------------------------------------------------- Confidence 0 0000 Q ss_pred cccccccccccccchhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHh Q lcl|NC_018861. 204 WLKVLKNYTGPYATAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEI 282 (465) Q Consensus 204 ~~~~~~~~~~~~~Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEI 282 (465) .|.+.++.+ -++++++...|..+-...+|-||.+|- ..|.+++|.+-|+..|..-+ T Consensus 183 -------------------E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~ 239 (415) T protein:vir:79 183 -------------------ELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA----KVNVLQELKLWMARTIAATR 239 (415) T ss_pred -------------------cccccCcccccceeeEEeeeeeeEeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHH Confidence 000111111 135666666666666778999999984 35788999999999999999 Q ss_pred hHHHHhhhhheeeeee------eeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcc Q lcl|NC_018861. 283 DRTIIEKANEVATVCT------DFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSF 356 (465) Q Consensus 283 Nreii~~l~~~at~~~------~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~ 356 (465) |+.||...-.....+. ........+. ..+..|...+..+ .. ...+...+|+++.....|+..- T Consensus 240 ~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~~~----~~--~~~~~~~~v~n~~~~~~l~~lk-- 308 (415) T protein:vir:79 240 NKAIIDVITKGSTGSTSSGFEKEGKKLEVKKA---KSLDDIKDAINLN----VK--PNYEHNVAIVSQTMFAKLDKMK-- 308 (415) T ss_pred HHHHhhccccCccccccccccccccccccccc---cchhHHHHHHHhh----hh--hccCCCEEEEcHHHHHHHHHhh-- Confidence 9999986622100000 0000001111 1233333322222 11 1235667899999999997641 Q ss_pred cccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecc----cc----cceeeeeCCCc Q lcl|NC_018861. 357 VLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPY----NI----TLQQNLTDPVS 428 (465) Q Consensus 357 ~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY----~~----~~~~~~~dp~s 428 (465) ...++++..++ ......++| .|++|++.++.+.. - ..+..++|+.+ +. .......|-.. T Consensus 309 --d~~G~~l~~~~-~~~~~~~~l-~G~pV~~~~~~~~~-----~----~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~ 375 (415) T protein:vir:79 309 --DKLGNYLIQPD-VKEKTQQRL-LGAKIEILPDEVLG-----Q----KGNNTLIIGNLKDAIVLFDRSQYQASWTDYMH 375 (415) T ss_pred --ccCCceeeccC-cCCCCCcee-cceeeEEecccccC-----C----CCccEEEEEehhccEEEEeecceEEEEecccc Confidence 11112221111 011222455 56688877665321 1 11122333322 11 11112335566 Q ss_pred ccceeeeeeeeeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 429 GQPAMILNNRYDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 429 ~qp~~~~~tRY~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) ++..+....|++..+ .+| ..|+ +.++-..-. T Consensus 376 ~~~~~~~~~r~d~~v--~~~----~a~~~~~~~~~~~~ 407 (415) T protein:vir:79 376 FGECLMIAVRQDCRI--LDY----KSAIVIEYDDSERG 407 (415) T ss_pred CceEEEEEEEeccEE--ecc----ccEEEEEEeccCCC Confidence 777777788887632 222 3344 222222111 No 40 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=95.72 E-value=0.0016 Score=35.97 Aligned_cols=315 Identities=12% Similarity=0.049 Sum_probs=135.2 Q ss_pred CCc------cchh-----hhHHHhhh---hhhc-----cccccChhhhh--heehccccc---hhHH-Hh---h------ Q lcl|NC_018861. 1 MAD------KYLL-----DESTKEKF---ITSN-----LYPNLNESEKN--IMRTVLENQ---GNEV-KM---L------ 46 (465) Q Consensus 1 ~~~------~~~~-----~e~~~e~~---~~~~-----~~~~~~~~~~~--~~~~l~~n~---~~~~-~~---i------ 46 (465) |.+ +.+. .+.+.++= .... ..+.....+++ .....+... ..+. .+ + T Consensus 40 ~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 119 (415) T protein:vir:81 40 LEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI 119 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhh Confidence 000 0000 00000000 0000 00000000000 000111111 0011 00 1 Q ss_pred hhh-hhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccc Q lcl|NC_018861. 47 MES-TVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESAN 123 (465) Q Consensus 47 ~es-t~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ 123 (465) ... +++.+-...-|.-+ .+++++..+.+-.+++.|.||++..+-+--.| ..+.. . T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~-~------------------- 177 (415) T protein:vir:81 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QSEVA-A------------------- 177 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEe--ecCCc-c------------------- Confidence 011 11111111235443 46677788888899999999999887654443 11100 0 Q ss_pred ccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCc Q lcl|NC_018861. 124 KDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEAL 203 (465) Q Consensus 124 ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~ 203 (465) . .+.. T Consensus 178 -----------------~------~~v~---------------------------------------------------- 182 (415) T protein:vir:81 178 -----------------L------EKVE---------------------------------------------------- 182 (415) T ss_pred -----------------c------eeec---------------------------------------------------- Confidence 0 0000 Q ss_pred cccccccccccccchhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHh Q lcl|NC_018861. 204 WLKVLKNYTGPYATAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEI 282 (465) Q Consensus 204 ~~~~~~~~~~~~~Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEI 282 (465) .|.+.++.+ -++++++...|..+-...+|-||.+|- ..|.+++|.+-|+..|..-+ T Consensus 183 -------------------E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~ 239 (415) T protein:vir:81 183 -------------------ELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA----KVNVLQELKLWMARTIAATR 239 (415) T ss_pred -------------------cccccCcccccceeeEEeeeeeeEeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHH Confidence 000111111 135666666666666778999999984 35788999999999999999 Q ss_pred hHHHHhhhhheeeeee------eeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcc Q lcl|NC_018861. 283 DRTIIEKANEVATVCT------DFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSF 356 (465) Q Consensus 283 Nreii~~l~~~at~~~------~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~ 356 (465) |+.||...-.....+. ........+. ..+..|...+..+ .. ...+...+|+++.....|+..- T Consensus 240 ~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~~~----~~--~~~~~~~~v~n~~~~~~l~~lk-- 308 (415) T protein:vir:81 240 NKAIIDVITKGSTGSTSSGFEKEGKKLEVKKA---KSLDDIKDAINLN----VK--PNYEHNVAIVSQTMFAKLDKMK-- 308 (415) T ss_pred HHHHhhccccCccccccccccccccccccccc---cchhHHHHHHHhh----hh--hccCCCEEEEcHHHHHHHHHhh-- Confidence 9999986622100000 0000001111 1233333322222 11 1235667899999999997641 Q ss_pred cccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecc----cc----cceeeeeCCCc Q lcl|NC_018861. 357 VLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPY----NI----TLQQNLTDPVS 428 (465) Q Consensus 357 ~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY----~~----~~~~~~~dp~s 428 (465) ...++++..++ ......++| .|++|++.++.+.. - ..+..++|+.+ +. .......|-.. T Consensus 309 --d~~G~~l~~~~-~~~~~~~~l-~G~pV~~~~~~~~~-----~----~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~ 375 (415) T protein:vir:81 309 --DKLGNYLIQPD-VKEKTQQRL-LGAKIEILPDEVLG-----Q----KGNNTLIIGNLKDAIVLFDRSQYQASWTDYMH 375 (415) T ss_pred --ccCCceeeccC-cCCCCCcee-cceeeEEecccccC-----C----CCccEEEEEehhccEEEEeecceEEEEecccc Confidence 11112221111 011222455 56688877665321 1 11122333322 11 11112335566 Q ss_pred ccceeeeeeeeeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 429 GQPAMILNNRYDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 429 ~qp~~~~~tRY~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) ++..+....|++..+ .+| ..|+ +.++-..-. T Consensus 376 ~~~~~~~~~r~d~~v--~~~----~a~~~~~~~~~~~~ 407 (415) T protein:vir:81 376 FGECLMIAVRQDCRI--LDY----KSAIVIEYDDSERG 407 (415) T ss_pred CceEEEEEEEeccEE--ecc----ccEEEEEEeccCCC Confidence 777777788887632 222 3344 222222111 No 41 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=95.71 E-value=0.0016 Score=35.94 Aligned_cols=314 Identities=12% Similarity=0.062 Sum_probs=132.8 Q ss_pred CCccchhhhHHHhh-------hhhhccccc-cChhh--hhheehccccc---hhHHHh----------hhhh-hhccccc Q lcl|NC_018861. 1 MADKYLLDESTKEK-------FITSNLYPN-LNESE--KNIMRTVLENQ---GNEVKM----------LMES-TVTGDIA 56 (465) Q Consensus 1 ~~~~~~~~e~~~e~-------~~~~~~~~~-~~~~~--~~~~~~l~~n~---~~~~~~----------i~es-t~t~~v~ 56 (465) +....=..+++.++ ..+....+. ..+.+ .+....-+.++ .++.+- ...+ +++++-. T Consensus 51 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~ 130 (415) T protein:vir:94 51 IQEKQEELDKLKEKDGTSENNQQSVEVNEASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF 130 (415) T ss_pred HHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhcccccccc Confidence 00000000000000 000000000 00000 00000000000 011110 1111 1111111 Q ss_pred cccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccc Q lcl|NC_018861. 57 KFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPI 134 (465) Q Consensus 57 ~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~ 134 (465) ..-|.-+ .+++.+-+..+-.+++.|+||++..+-+--.+ ...... T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~------------------------------- 177 (415) T protein:vir:94 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QSEVAA------------------------------- 177 (415) T ss_pred ccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEe--ecCCcc------------------------------- Confidence 1224322 46677788888999999999998775543333 111000 Q ss_pred cccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcccccccccccc Q lcl|NC_018861. 135 EVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGP 214 (465) Q Consensus 135 ~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 214 (465) ..+.. T Consensus 178 ------------~~~v~--------------------------------------------------------------- 182 (415) T protein:vir:94 178 ------------LEKVE--------------------------------------------------------------- 182 (415) T ss_pred ------------ceecc--------------------------------------------------------------- Confidence 00000 Q ss_pred ccchhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhhe Q lcl|NC_018861. 215 YATAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEV 293 (465) Q Consensus 215 ~~Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~ 293 (465) .|...++.+ -++++++...|..+-.-.+|-||.+|-- .|.+++|.+-|...|..-+|+.||...-.. T Consensus 183 --------Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~~~~~~~~~~il~g~g~g 250 (415) T protein:vir:94 183 --------ELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKG 250 (415) T ss_pred --------ccccccccccccceeeEeeheeeeeechhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccC Confidence 001111211 1355555556655556679999999864 578899999999999999999999865221 Q ss_pred eeeeee------eeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccc Q lcl|NC_018861. 294 ATVCTD------FDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDA 367 (465) Q Consensus 294 at~~~~------~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~ 367 (465) ...+.. .......+. ..+..|...+...- ..+.....+|+++.....|+..- ...+++... T Consensus 251 ~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~~~~------~~~~~~~~~vmn~~~~~~l~~lk----d~~G~~l~~ 317 (415) T protein:vir:94 251 STGSTSSGFEKEGKKLEVKKA---KSLDDIKDAINLNV------KPNYEHNVAIVSQTMFAKLDKMK----DKLGNYLIQ 317 (415) T ss_pred ccccccccccccccccccccc---cchHHHHHHHHhhh------hhccCCCEEEEcHHHHHHHHHhh----ccCCCeeec Confidence 100000 000011111 11223333232221 11235677899999999997641 111111111 Q ss_pred cccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc--------cceeeeeCCCcccceeeeeeee Q lcl|NC_018861. 368 INSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI--------TLQQNLTDPVSGQPAMILNNRY 439 (465) Q Consensus 368 ~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~--------~~~~~~~dp~s~qp~~~~~tRY 439 (465) ++ ......++| .|++|++.+..+.. -.| +..++|+.+-- .......|-.+++-.+-...|+ T Consensus 318 ~~-~~~~~~~~l-~G~pV~~~~~~~~~-----~~~----~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~r~~~r~ 386 (415) T protein:vir:94 318 PD-VKEKTQQRL-LGAKIEILPDEVLG-----QKG----NNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQ 386 (415) T ss_pred cC-cCCCCCcee-cceeeEEecccccC-----CCC----ccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEe Confidence 11 011122455 46678877654321 111 11223332211 1112233555666667777888 Q ss_pred ee-eecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 440 DV-VATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 440 ~l-~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) +. ..+| .-|+ +.++-+.-. T Consensus 387 d~~~~~~-------~a~~~~~~~~~~~~ 407 (415) T protein:vir:94 387 DCRILDY-------KSAIVIEYDDSERG 407 (415) T ss_pred ccEEecc-------ccEEEEEEeccCCC Confidence 76 3443 3444 333322222 No 42 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=95.43 E-value=0.0021 Score=35.28 Aligned_cols=268 Identities=14% Similarity=0.021 Sum_probs=119.6 Q ss_pred ccccccccccccccccccccccc----hhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTD----NIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNY 211 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPT----gLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~ 211 (465) ++.... ......-|+ .+...+..+. .+.+-+.......+. .+.......+ +. T Consensus 1 ma~~~T--------~~~~~iiPev~~~~v~~~~~~~~-------~~~~~~~~~~~l~g~----~G~tv~ip~~-----~~ 56 (274) T protein:vir:93 1 MPQGIT--------KTSNQIIPEVLAPMMQAQLEKKL-------RFASFAEVDSTLQGQ----PGDTLTFPAF-----VY 56 (274) T ss_pred CCccce--------ehhheechHHHHHHHHHHHHhhh-------hhcccccccccccCC----CCCEEEEEee-----cc Confidence 111111 001111111 0111111100 000000010000000 0000000000 00 Q ss_pred cccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhh Q lcl|NC_018861. 212 TGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKAN 291 (465) Q Consensus 212 ~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~ 291 (465) .+-.....|...-++.++. ....+++-|.|+-.=+++=| +.+.+ +-|.-.+..+-++..+...++++++..+. T Consensus 57 -~g~~~~~~eg~~i~~~~it--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~d~~~~~~~~~~~~~a~~~d~~~~~~~~ 129 (274) T protein:vir:93 57 -SGDAQVVAEGEKIPTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALM 129 (274) T ss_pred -CCCcccccCCCcccccccc--cceeEEEeeeecccccccHH--HHHhh--ccchHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 0111111222122344443 44555555666522223322 22222 57899999999999999999999999884 Q ss_pred heeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccc Q lcl|NC_018861. 292 EVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSG 371 (465) Q Consensus 292 ~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~ 371 (465) ... .++. ......+.+-.+..++... ....++++|+|++++.|.......+..+...... -. T Consensus 130 ~a~-----~~~~--~~~~~~d~i~dA~~~l~d~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~--~~ 191 (274) T protein:vir:93 130 GAK-----LTVN--ADITKLNGLQSAIDKFNDE---------DLEPMVLFINPLDAGKLRGDASTNFTRATELGDD--II 191 (274) T ss_pred ccc-----cccc--ccccCHHHHHHHHHHhhhc---------cCCccEEEeCHHHHHHHHhhhhhccccccccccc--ce Confidence 321 1111 1122234444444444432 1257899999999999998765555443221111 11 Q ss_pred cceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecCccccc Q lcl|NC_018861. 372 IKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATPLHPEA 450 (465) Q Consensus 372 ~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~ 450 (465) ....+|++ .|++||+++..|.+-..+.-+|. +-|.--.+.....--|+.++.-.+-...|||+ ..||=. . T Consensus 192 ~~G~ig~~-~G~~Vi~s~~~p~~t~~l~~~ga------i~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~--~ 262 (274) T protein:vir:93 192 VKGAFGEA-LGAIIVRTNKLEAGTAILAKKGA------VKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESK--A 262 (274) T ss_pred eeccccee-cCeeEEEcCCCCcceEEEEeCCe------EEEEecCCcccccccchhhcccEEEEEEEEEEEEEcCCc--e Confidence 13367887 47899999998865443333332 11210111111223489999999999999998 566610 0 Q ss_pred ccceEE-Eeecc Q lcl|NC_018861. 451 FIRTFA-VNLNN 461 (465) Q Consensus 451 ~~~~f~-~~~~~ 461 (465) =..+|+ =.|.= T Consensus 263 v~~t~~~~s~~~ 274 (274) T protein:vir:93 263 VKITKGSGSLEM 274 (274) T ss_pred EEEeeCccccCC Confidence 001111 00000 No 43 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=94.99 E-value=0.003 Score=34.40 Aligned_cols=265 Identities=14% Similarity=0.017 Sum_probs=117.9 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCcccccccccc--ccccccccCCccCCCcccccCccccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEM--DKAATFATKKATVEAVYTNEALWLKVLKNYTG 213 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~--~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~ 213 (465) ++.... .....+ .|+---..+...+...- .+-+-+ +..+.+. .+....... ++. . T Consensus 1 ma~~~T-~~~d~i---~Pev~s~~v~~~~~~~~-------~~~~~~~~~~~l~g~------~G~tv~ip~-----~~~-~ 57 (274) T protein:vir:96 1 MAQGTT-KVSNLI---VPEVLAPMMQAELDKKL-------RFAQFADIDSTLVGQ------PGDTLTFPA-----FTY-S 57 (274) T ss_pred CCcccc-chhhhh---hhHHHHHHHHHHHHhhh-------hhcccccccccccCC------CCCEEEEEe-----ecc-C Confidence 111110 001100 00000000000010000 000000 0000100 000000000 000 0 Q ss_pred cccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHH-hhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhh Q lcl|NC_018861. 214 PYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLK-AQHGINAEKELADILSAEVALEIDRTIIEKANE 292 (465) Q Consensus 214 ~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLk-AiHGlDAe~EL~niLstEImlEINreii~~l~~ 292 (465) +-.....|....++.++.++= .+++-|.|+-.=+++ |+. +..+-|.-.+..+-++..+..+++++++..+.. T Consensus 58 g~~~~~~~g~~i~~~~it~~~--~~~~i~~~~~~~~i~-----D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~ 130 (274) T protein:vir:96 58 GDAQVIAEGEKIPVDQIGTSK--REAKVRKIGKGTELT-----DEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKG 130 (274) T ss_pred CCccccCCCCcCchhhcccce--eEEEEEeeeceeeec-----HHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 111111222222355554443 344445554222333 332 223678999999999999999999999998844 Q ss_pred eeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccccccccc Q lcl|NC_018861. 293 VATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGI 372 (465) Q Consensus 293 ~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~ 372 (465) ... .++ .+....+.+-.+..++.+.. ...++++|+|.+++.|.......|..+...... ... T Consensus 131 a~~-----~~~--~~~~~~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~--~~~ 192 (274) T protein:vir:96 131 ATL-----TVE--ADITKLDGLQTAIDKFNDED---------LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDN--IIV 192 (274) T ss_pred CCC-----CcC--cccccHHHHHHHHHHhcccC---------CCceEEEeCHHHHHHHHhccccccccccccccc--cee Confidence 211 111 11111244444444443321 256899999999999988765555433221111 111 Q ss_pred ceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecC-----c Q lcl|NC_018861. 373 KPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATP-----L 446 (465) Q Consensus 373 ~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nP-----f 446 (465) ...+|++ .|++||+|+..|.+-..+-=+|.-. |+.. .+...-.--|+.+++-.|-...+||+ ..|| + T Consensus 193 ~g~ig~~-~G~~Vi~s~~~p~~t~~l~~~gA~~-----~~~~-~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~ 265 (274) T protein:vir:96 193 KGAFGEA-LGAVIVRSNKLNKGEALLAKKGAVK-----LITK-RDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKI 265 (274) T ss_pred eccccee-cCeeEEEcCCCCcceEEEEeCccee-----eeec-CCcccccccchhhcccEEEEeeEEEEEEEcCccEEEE Confidence 3357787 5789999999886442221122211 1110 11111223388899999888889998 6677 2 Q ss_pred ccccccceE Q lcl|NC_018861. 447 HPEAFIRTF 455 (465) Q Consensus 447 ~~~~~~~~f 455 (465) +...-.+-. T Consensus 266 t~~~~~~~~ 274 (274) T protein:vir:96 266 TKGAGDEVM 274 (274) T ss_pred EcCcccccC Confidence 322222222 No 44 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=94.90 E-value=0.0032 Score=34.25 Aligned_cols=302 Identities=11% Similarity=0.025 Sum_probs=127.4 Q ss_pred CCccch---------------------hhhHHHhhhhhhccc---------cccChhhhhheehccccchhHHHhhhhhh Q lcl|NC_018861. 1 MADKYL---------------------LDESTKEKFITSNLY---------PNLNESEKNIMRTVLENQGNEVKMLMEST 50 (465) Q Consensus 1 ~~~~~~---------------------~~e~~~e~~~~~~~~---------~~~~~~~~~~~~~l~~n~~~~~~~i~est 50 (465) ++...+ .++...+.=...... +...++++.....|.++......-...++ T Consensus 34 ~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t 113 (397) T protein:vir:49 34 VSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDAS 113 (397) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccchhHHHHHHHHHHHHHHhcchhHHHHHhhccc Confidence 111111 110000000000000 00011333333333333322222222222 Q ss_pred hc-cccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccc Q lcl|NC_018861. 51 VT-GDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDF 127 (465) Q Consensus 51 ~t-~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~ 127 (465) ++ |.+. -|.-+ .+++...++.+-.+++.++||++++|-+.-++ ....... T Consensus 114 ~~~gg~~--vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~----------------------- 166 (397) T protein:vir:49 114 GSDAGLT--IPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEK--WTDITGL----------------------- 166 (397) T ss_pred cccCccc--ccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEe--eccCCcc----------------------- Confidence 22 1111 13322 45566777778889999999999998654333 1110000 Q ss_pred ccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccc Q lcl|NC_018861. 128 NYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKV 207 (465) Q Consensus 128 ~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~ 207 (465) + .|. T Consensus 167 -------------a------~~v--------------------------------------------------------- 170 (397) T protein:vir:49 167 -------------A------NID--------------------------------------------------------- 170 (397) T ss_pred -------------e------eee--------------------------------------------------------- Confidence 0 000 Q ss_pred cccccccccchhhhccCCchhhc-ceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHH Q lcl|NC_018861. 208 LKNYTGPYATAAGEKLGKDMKEM-GISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTI 286 (465) Q Consensus 208 ~~~~~~~~~Ta~~E~lg~~f~EM-~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINrei 286 (465) +.|..+++- ..++++++..+|.-+-...+|-||.+|- ..|.+++|.+-|+..|..-+|+.| T Consensus 171 --------------~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~d~ai 232 (397) T protein:vir:49 171 --------------DEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADS----AENILAWLSGWIAKKVVVTRNKAI 232 (397) T ss_pred --------------cCccccccccccceeeEEeeeeeEEeeehhHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHH Confidence 001112221 1245566666666666678999999985 257889999999999999999999 Q ss_pred HhhhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccc Q lcl|NC_018861. 287 IEKANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKID 366 (465) Q Consensus 287 i~~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~ 366 (465) |.-.-.....+ +--..+....|+..|... +.....+|+++.....|+..- ...+.+.. T Consensus 233 ~~G~g~~~~~~---------~~~~~d~i~~~~~~l~~~---------~~~~a~~vmn~~~~~~l~~lk----d~~G~~l~ 290 (397) T protein:vir:49 233 LEAIAALPTKP---------TLTKWDDIIDLEAKVDPA---------IKQTSFFLTNTSGFTALKKVK----NALGDYLM 290 (397) T ss_pred Hhhcccccccc---------ccccHHHHHHHHHhhhhh---------hcCCCEEEEcHHHHHHHHHhh----cCCCceee Confidence 98763222111 111123344444444321 224467899999999998751 11112111 Q ss_pred ccccccceEEEEecCceEEEEeC--CCC-----cceEEEE---------EecCCCccceeEEecccccceeeeeCCCccc Q lcl|NC_018861. 367 AINSGIKPNVGKFDNRYDVIVDN--FAE-----FDYCTVA---------YKGASNFDAGIFFAPYNITLQQNLTDPVSGQ 430 (465) Q Consensus 367 ~~~~~~~~~~G~l~~~~~vy~d~--~~~-----~dy~~vg---------~kg~~~~d~glfy~PY~~~~~~~~~dp~s~q 430 (465) .++ ......++| .|++|++.. ..+ ...+++| .++.-. +=+.+|.. .+-...+ T Consensus 291 ~~~-~~~~~~~~l-~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~----i~~~~~~~------~~~~~~~ 358 (397) T protein:vir:49 291 ERD-VKSPTGYSI-DGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMS----LLSTNIGG------GAFETDT 358 (397) T ss_pred ccC-cCCCCCcee-cceeeEEecccccccccCCceeEEEeeccceEEEEeecceE----EEEecccc------chhhcCc Confidence 111 001122455 455776522 111 1112222 111111 11111110 0011122 Q ss_pred ceeeeeeeeee-eecC--c---------ccccccceEEE Q lcl|NC_018861. 431 PAMILNNRYDV-VATP--L---------HPEAFIRTFAV 457 (465) Q Consensus 431 p~~~~~tRY~l-~~nP--f---------~~~~~~~~f~~ 457 (465) =.+-...|++. ..|| | .+....-+=|| T Consensus 359 ~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 359 TKVRVIDRFDVVATDTEAFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred eeEEEEeeeCcEEecccceEEEEeecccCCCCCcccccC Confidence 23334445554 3333 1 01111111112 No 45 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=94.50 E-value=0.0042 Score=33.59 Aligned_cols=300 Identities=12% Similarity=0.028 Sum_probs=126.7 Q ss_pred ccccChhhhhheehccccchhHHHhhhhhhhccccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEec Q lcl|NC_018861. 21 YPNLNESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYV 99 (465) Q Consensus 21 ~~~~~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~ 99 (465) -..|+|-|.+.+..-.++ ..+++++ ...-+.+. -+++.+.+..+-..+|.+.||+++..-|.-.. T Consensus 1 ~~~~~e~~~~~~~~~~~~---------~~~~~~~-~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~---- 66 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQG---------RLAHVPS-DLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTV---- 66 (338) T ss_pred CcchHHhhhhhccccccc---------ceecccc-cccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEe---- Confidence 122222222222111111 1111111 11222222 45567777778888999999998764443322 Q ss_pred CCCCcccccccccccCccccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCcccccccc Q lcl|NC_018861. 100 GDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGD 179 (465) Q Consensus 100 ~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~ 179 (465) . +.. +...++ +..++ T Consensus 67 ~---~~~----------------------------------a~~v~~--------------------------~~~~~-- 81 (338) T protein:vir:78 67 K---RPE----------------------------------VGQVGV--------------------------GTSNE-- 81 (338) T ss_pred c---Ccc----------------------------------ceeecc--------------------------ccccc-- Confidence 0 000 000000 00000 Q ss_pred ccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHH Q lcl|NC_018861. 180 EMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLK 259 (465) Q Consensus 180 e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLk 259 (465) .+| |...++-.-+++.++...+..+-...+|-||.+|- T Consensus 82 ---------------------------------------~~E--g~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds- 119 (338) T protein:vir:78 82 ---------------------------------------QRE--GGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMN- 119 (338) T ss_pred ---------------------------------------ccc--cccccccccceeEEEEEEEEEEEeehhhHHHHhcC- Confidence 000 12233333345666666666666778999999983 Q ss_pred hhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhhe--------eeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018861. 260 AQHGINAEKELADILSAEVALEIDRTIIEKANEV--------ATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQ 331 (465) Q Consensus 260 AiHGlDAe~EL~niLstEImlEINreii~~l~~~--------at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~ 331 (465) ..|.|++|.+-|+..|...||..+|..--.. .+....-........+. ....++..+..+...+... T Consensus 120 ---~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~ 194 (338) T protein:vir:78 120 ---PSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNVIVNTTNVDYLQT--GTTPLLDRFLDGYDLVSAN 194 (338) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccccccccccccccc--cchhhHHHHHHHHHHhhhh Confidence 3688999999999999999999999633110 00000000001111111 1122233334443333322 Q ss_pred cccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcc---------eEEEE--- Q lcl|NC_018861. 332 TRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFD---------YCTVA--- 399 (465) Q Consensus 332 T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~d---------y~~vg--- 399 (465) =. ...+.++++++....|...-.++.. .+.... ..+......++| .|++||++.+.|.+ -+++| T Consensus 195 ~~-~~~~~~~m~~~~~~~L~~~~~l~d~-~g~~l~-~~~~~~~~~~~l-~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs 270 (338) T protein:vir:78 195 TD-VDFNGWAADPRYRARLLRSQAYRDA-NGNVDP-TRINLAASAGDL-LGLPVQFGKAVGGDLGAATDSKVRVVGGDFS 270 (338) T ss_pred cc-ccceEEEEchHHHHHHHHHhhhccC-CCceee-cccccCCCCcee-eeeeEEEccccCccccccCCcccEEEEEecc Confidence 22 3667899999998888665433211 111110 111111223565 35699887764421 12222 Q ss_pred -----EecCCCccceeEEecccccceeeeeCCCc-----c-cceeee--eeeeee-eecC--cc-cccccceEE Q lcl|NC_018861. 400 -----YKGASNFDAGIFFAPYNITLQQNLTDPVS-----G-QPAMIL--NNRYDV-VATP--LH-PEAFIRTFA 456 (465) Q Consensus 400 -----~kg~~~~d~glfy~PY~~~~~~~~~dp~s-----~-qp~~~~--~tRY~l-~~nP--f~-~~~~~~~f~ 456 (465) ..+.-..+ .. .........||.. | +.-+++ ..|++. +.|| |. -..-..-|| T Consensus 271 ~~~~~~~~~~~i~----~~--~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 271 QLKYGFADEIRVK----MS--DTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred eEEEEeecccEEE----Ee--ecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecccCCCC Confidence 21111100 00 0001111112211 1 112222 456664 4555 21 011111222 No 46 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=94.16 E-value=0.0052 Score=33.10 Aligned_cols=297 Identities=11% Similarity=0.044 Sum_probs=129.6 Q ss_pred CCcc-----chhh-hHHHhhhhhhc-------cccccC---hhhhhheehc--------------ccc-------c---h Q lcl|NC_018861. 1 MADK-----YLLD-ESTKEKFITSN-------LYPNLN---ESEKNIMRTV--------------LEN-------Q---G 40 (465) Q Consensus 1 ~~~~-----~~~~-e~~~e~~~~~~-------~~~~~~---~~~~~~~~~l--------------~~n-------~---~ 40 (465) .++- .+.+ ++..+...... ...... ...+...... ..+ + . T Consensus 48 ~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 127 (400) T protein:vir:38 48 RAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPT 127 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhH Confidence 0000 0000 00000000000 000000 0000000000 000 0 0 Q ss_pred hHHHhhhhhhhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccc Q lcl|NC_018861. 41 NEVKMLMESTVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLK 118 (465) Q Consensus 41 ~~~~~i~est~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~ 118 (465) +....+.++.++.+-...-|.-+ .++++.-++.+..+++.|.||++.++-+--++.. .+. T Consensus 128 ~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~~~-------------- 189 (400) T protein:vir:38 128 DASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANA----TTK-------------- 189 (400) T ss_pred HHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecC----CCc-------------- Confidence 01111222222222111123222 3556666777888999999999887644333310 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcc Q lcl|NC_018861. 119 TESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVY 198 (465) Q Consensus 119 ~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~ 198 (465) ..+.. | T Consensus 190 ----------------------------~~~~~----------------------------E------------------ 195 (400) T protein:vir:38 190 ----------------------------MVTVA----------------------------E------------------ 195 (400) T ss_pred ----------------------------ccccc----------------------------c------------------ Confidence 00000 0 Q ss_pred cccCccccccccccccccchhhhccCCchhhc-ceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHH Q lcl|NC_018861. 199 TNEALWLKVLKNYTGPYATAAGEKLGKDMKEM-GISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAE 277 (465) Q Consensus 199 ~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM-~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstE 277 (465) |...++. ..+++.++..++.-+-...+|-||.+|- ..|.+++|.+.|... T Consensus 196 -------------------------~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~~ 246 (400) T protein:vir:38 196 -------------------------LEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDS----AIDLVGLIAQNGQQI 246 (400) T ss_pred -------------------------cccccccccccceeeEeehhheeeehhhHHHHHhhh----HHHHHHHHHHHHHHH Confidence 0001111 1235566677777777888999999985 357889999999999 Q ss_pred HHHHhhHHHHhhhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc---- Q lcl|NC_018861. 278 VALEIDRTIIEKANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI---- 353 (465) Q Consensus 278 ImlEINreii~~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~---- 353 (465) |...+|+-||........ .+-...+....++ .... ... .....|+++.....|+.. T Consensus 247 ~~~~~~~~i~~~~~~~~~----------~~~~~~~~~~~~~---~~~~-----~~~--~~a~~v~~~~~~~~l~~lkd~~ 306 (400) T protein:vir:38 247 KVNTTNGAVATLLKGFTA----------KTISSVDDLKHIN---NVDL-----DPA--YSRVIIASQSFYNFLDTVKDGN 306 (400) T ss_pred HHHHHHHhhhhccccccc----------cccccHHHHHHHH---Hhhh-----hhh--hCcEEEEcHHHHHHHHHhhccC Confidence 999999988876532111 1111112222221 1111 111 124567899999988764 Q ss_pred CcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc--------cceeeeeC Q lcl|NC_018861. 354 GSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI--------TLQQNLTD 425 (465) Q Consensus 354 ~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~--------~~~~~~~d 425 (465) |...+.|+.. ....++| .|++|++..+.+.. - ..+.-++|+.+.- .......| T Consensus 307 G~~i~~~~~~---------~~~~~~l-~G~pv~~~~~~~~~-----~----~g~~~~~~gd~s~~~~~~~~~~~~~~~~~ 367 (400) T protein:vir:38 307 GRYLLQDSIL---------TPSGKSV-LGMPIAVVSDDTLG-----A----AGEAHAFLGDIKRAILFANRADFMVRWVD 367 (400) T ss_pred CCeeeecCcC---------CCCcccc-ccceeEEecccccC-----C----CCceEEEEEeccccEEEEeecceEEEEec Confidence 3333332211 1112455 56677776654321 1 1122334433321 22233456 Q ss_pred CCcccceeeeeeeeee-eecCcccccccceEEEeeccceeC Q lcl|NC_018861. 426 PVSGQPAMILNNRYDV-VATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 426 p~s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) ...|+..+-...|++. +.+| ..|++ ++-+--+ T Consensus 368 ~~~~~~~~~~~~r~d~~~~~~-------~a~~~-l~~~~~a 400 (400) T protein:vir:38 368 DQIYGQFLQAGMRFGVSVADE-------KAGYF-LTYTPKA 400 (400) T ss_pred ccccceeEEEEEEeccEEecc-------cceEE-EEeecCC Confidence 6677777888889887 4555 22221 1111111 No 47 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=93.94 E-value=0.0059 Score=32.81 Aligned_cols=272 Identities=11% Similarity=-0.035 Sum_probs=108.1 Q ss_pred ccccchhhhheeeeeccC--cccccc----cccccc---ccccccCCccCCCcccccCccccccccccccccchhhhccC Q lcl|NC_018861. 154 QAGTDNIVNVLLRLESNS--TGSVAI----GDEMDK---AATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLG 224 (465) Q Consensus 154 ~TGPTgLifam~s~y~~~--~g~ea~----~~e~~t---~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg 224 (465) |++.+ ++...... .+...+ .++... ..+.-..-.....+... ...+..-.+-..+.--..| T Consensus 1 m~~~~-----~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-----~~~~p~~~~~~~a~~v~Eg 70 (330) T protein:vir:77 1 MAGST-----VPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPT-----GISIPHWTGAVSASWTGEA 70 (330) T ss_pred Ccccc-----cchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCC-----ceEEEEEcCCcceeEecCC Confidence 33222 01110000 000000 011000 00000000000000000 0000000011111111235 Q ss_pred CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhh---------hhheee Q lcl|NC_018861. 225 KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEK---------ANEVAT 295 (465) Q Consensus 225 ~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~---------l~~~at 295 (465) ..+++-..++++++...|..+-+..+|-||.+|- ..|.|++|.+-|+..|...+|+.+|.- +...+. T Consensus 71 ~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~ 146 (330) T protein:vir:77 71 ERKPITKGSFGKQELEPVKITTIFAESAEVVRLN----PLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETT 146 (330) T ss_pred CccccccceeeEEEEeEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcccccccccc Confidence 6778888889999999999999999999999983 578999999999999999999999842 111111 Q ss_pred -eeeeeeeccCCc-ccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCcccccccc Q lcl|NC_018861. 296 -VCTDFDVNSADG-RWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKIDAIN 369 (465) Q Consensus 296 -~~~~~~~~~~~~-~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~~~~ 369 (465) ....-....... .+....+..|...+ ..+.+. ....+..+++++....|+.. |-..+.|..... T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~----~~~~~~--~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~---- 216 (330) T protein:vir:77 147 KVVSLADTNLTTASGPQGNAYLAVNNAL----SLLVNS--GKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTE---- 216 (330) T ss_pred ccceeecccccccccccchhHHHHHHHH----Hhhhhc--CCCccEEEEcHHHHHHHHHHhccCCceeecCccccc---- Confidence 011111111111 11112233333333 333322 23556789999999999764 211112111000 Q ss_pred cccceEEEEecCceEEEEeCCCCc--------------ceEEEEEecCCCc----cceeEEecccccceeeeeCCCccc- Q lcl|NC_018861. 370 SGIKPNVGKFDNRYDVIVDNFAEF--------------DYCTVAYKGASNF----DAGIFFAPYNITLQQNLTDPVSGQ- 430 (465) Q Consensus 370 ~~~~~~~G~l~~~~~vy~d~~~~~--------------dy~~vg~kg~~~~----d~glfy~PY~~~~~~~~~dp~s~q- 430 (465) +.....-++|. |++||++...+. .++++|-.+.... ++.+.+.- .........+-+-|+ T Consensus 217 ~~~~~~~~~l~-G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~-~~~~~~~~~~~~~f~~ 294 (330) T protein:vir:77 217 QVGAIREGRIL-GRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGE-EQGGVWVPKLISLWQH 294 (330) T ss_pred cccccCCceec-ceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecc-cccccccccccchhhc Confidence 00011224553 589998887542 2223333322211 11111110 000000000000010 Q ss_pred ceee--eeeeeee-eecC--c----------ccccc Q lcl|NC_018861. 431 PAMI--LNNRYDV-VATP--L----------HPEAF 451 (465) Q Consensus 431 p~~~--~~tRY~l-~~nP--f----------~~~~~ 451 (465) ..++ ...|++. ..+| | +|+-. T Consensus 295 ~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 295 NMVAVRCEAEFAFMVNDKDAFVKLTDQVAGTDPEEE 330 (330) T ss_pred CcEEEEEEEEeccEEecccceEEEEeccCCcCCCCC Confidence 1111 1223333 2222 1 12222 No 48 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=93.48 E-value=0.0074 Score=32.26 Aligned_cols=256 Identities=11% Similarity=0.025 Sum_probs=110.3 Q ss_pred eeeeeccCccccccccccccccccccCCccC-C----CcccccCc--------ccccccccc--ccccchhhhccCCchh Q lcl|NC_018861. 164 LLRLESNSTGSVAIGDEMDKAATFATKKATV-E----AVYTNEAL--------WLKVLKNYT--GPYATAAGEKLGKDMK 228 (465) Q Consensus 164 m~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~-~----~~~~~~a~--------~~~~~~~~~--~~~~Ta~~E~lg~~f~ 228 (465) |....-.... -..+...+..-.... . ........ ..+...... .+-..+.--+.+..++ T Consensus 1 ma~~~~~~~~------~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~ 74 (304) T protein:vir:94 1 MATPTYTPGN------VILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQ 74 (304) T ss_pred Cccccccccc------ccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccc Confidence 3322111110 000100000000000 0 00000000 000000100 0111111112345677 Q ss_pred hcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhhe----------eeeee Q lcl|NC_018861. 229 EMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEV----------ATVCT 298 (465) Q Consensus 229 EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~----------at~~~ 298 (465) +-.-++++++++.|..+-...+|-||.+|- .+|.|+.|.+-|...|...||+.+|.---.. ..... T Consensus 75 ~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~ 150 (304) T protein:vir:94 75 TSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAE 150 (304) T ss_pred cccceeeEEEEEEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccc Confidence 888889999999999999999999999985 4788999999999999999999998632110 00000 Q ss_pred eeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEE Q lcl|NC_018861. 299 DFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGK 378 (465) Q Consensus 299 ~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~ 378 (465) ........+ ...+.-|+++-..+... +.....++|++.....|+..- ...+...-.+ ..|+ T Consensus 151 ~~~~~~~~~-------~~~~~~i~~~~~~l~~~--~~~~~~~v~~~~~~~~L~~lk----d~~G~~l~~~------~~~~ 211 (304) T protein:vir:94 151 EKGNVVTDT-------NNLYVDLSALMATIEDE--ELDPNGVLTTRSFRSKMRNAL----DANDRPLFDA------NGNE 211 (304) T ss_pred ccccccccc-------cchHHHHHHHHHHhhhc--cCCcCEEEEcHHHHHHHHHhh----ccCCcEeecC------CCcc Confidence 001111111 11222334444444332 234557899999999998641 1111111111 1256 Q ss_pred ecCceEEEEeCCCCcc------------eEEEEEecCCCccceeEEecccccce--eeeeCCC-----ccc-ceeee--e Q lcl|NC_018861. 379 FDNRYDVIVDNFAEFD------------YCTVAYKGASNFDAGIFFAPYNITLQ--QNLTDPV-----SGQ-PAMIL--N 436 (465) Q Consensus 379 l~~~~~vy~d~~~~~d------------y~~vg~kg~~~~d~glfy~PY~~~~~--~~~~dp~-----s~q-p~~~~--~ 436 (465) |. |++||++++.+.+ ++++|..+.-..+ ....... ..-.|++ -|+ .-++| . T Consensus 212 l~-G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~------~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~ 284 (304) T protein:vir:94 212 IM-GLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYA------ISEDATLTTLQASDASGQPVSLFERDMFALRAT 284 (304) T ss_pred cc-ceeeEEecccccCCCCcEEEEEehhhEEEEEecceEEE------EeecceeeeecccccCccchhhhhcCcEEEEEE Confidence 63 5799988876432 1223332221110 0000000 0111221 122 22333 4 Q ss_pred eeeee-eecCcccccccceEEEeeccc Q lcl|NC_018861. 437 NRYDV-VATPLHPEAFIRTFAVNLNNY 462 (465) Q Consensus 437 tRY~l-~~nPf~~~~~~~~f~~~~~~~ 462 (465) .||+. +.|| + .|++-..-- T Consensus 285 ~r~~~~v~~~---~----a~~~l~~a~ 304 (304) T protein:vir:94 285 MHIAYMNVKP---E----AFATLKPTE 304 (304) T ss_pred EEeccEeecc---c----ceEEEEecC Confidence 56666 3443 2 233111111 No 49 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=93.48 E-value=0.0074 Score=32.26 Aligned_cols=256 Identities=11% Similarity=0.025 Sum_probs=110.3 Q ss_pred eeeeeccCccccccccccccccccccCCccC-C----CcccccCc--------ccccccccc--ccccchhhhccCCchh Q lcl|NC_018861. 164 LLRLESNSTGSVAIGDEMDKAATFATKKATV-E----AVYTNEAL--------WLKVLKNYT--GPYATAAGEKLGKDMK 228 (465) Q Consensus 164 m~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~-~----~~~~~~a~--------~~~~~~~~~--~~~~Ta~~E~lg~~f~ 228 (465) |....-.... -..+...+..-.... . ........ ..+...... .+-..+.--+.+..++ T Consensus 1 ma~~~~~~~~------~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~ 74 (304) T protein:vir:10 1 MATPTYTPGN------VILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQ 74 (304) T ss_pred Cccccccccc------ccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccc Confidence 3322111110 000100000000000 0 00000000 000000100 0111111112345677 Q ss_pred hcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhhe----------eeeee Q lcl|NC_018861. 229 EMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEV----------ATVCT 298 (465) Q Consensus 229 EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~----------at~~~ 298 (465) +-.-++++++++.|..+-...+|-||.+|- .+|.|+.|.+-|...|...||+.+|.---.. ..... T Consensus 75 ~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~ 150 (304) T protein:vir:10 75 TSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAE 150 (304) T ss_pred cccceeeEEEEEEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccc Confidence 888889999999999999999999999985 4788999999999999999999998632110 00000 Q ss_pred eeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEE Q lcl|NC_018861. 299 DFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGK 378 (465) Q Consensus 299 ~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~ 378 (465) ........+ ...+.-|+++-..+... +.....++|++.....|+..- ...+...-.+ ..|+ T Consensus 151 ~~~~~~~~~-------~~~~~~i~~~~~~l~~~--~~~~~~~v~~~~~~~~L~~lk----d~~G~~l~~~------~~~~ 211 (304) T protein:vir:10 151 EKGNVVTDT-------NNLYVDLSALMATIEDE--ELDPNGVLTTRSFRSKMRNAL----DANDRPLFDA------NGNE 211 (304) T ss_pred ccccccccc-------cchHHHHHHHHHHhhhc--cCCcCEEEEcHHHHHHHHHhh----ccCCcEeecC------CCcc Confidence 001111111 11222334444444332 234557899999999998641 1111111111 1256 Q ss_pred ecCceEEEEeCCCCcc------------eEEEEEecCCCccceeEEecccccce--eeeeCCC-----ccc-ceeee--e Q lcl|NC_018861. 379 FDNRYDVIVDNFAEFD------------YCTVAYKGASNFDAGIFFAPYNITLQ--QNLTDPV-----SGQ-PAMIL--N 436 (465) Q Consensus 379 l~~~~~vy~d~~~~~d------------y~~vg~kg~~~~d~glfy~PY~~~~~--~~~~dp~-----s~q-p~~~~--~ 436 (465) |. |++||++++.+.+ ++++|..+.-..+ ....... ..-.|++ -|+ .-++| . T Consensus 212 l~-G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~------~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~ 284 (304) T protein:vir:10 212 IM-GLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYA------ISEDATLTTLQASDASGQPVSLFERDMFALRAT 284 (304) T ss_pred cc-ceeeEEecccccCCCCcEEEEEehhhEEEEEecceEEE------EeecceeeeecccccCccchhhhhcCcEEEEEE Confidence 63 5799988876432 1223332221110 0000000 0111221 122 22333 4 Q ss_pred eeeee-eecCcccccccceEEEeeccc Q lcl|NC_018861. 437 NRYDV-VATPLHPEAFIRTFAVNLNNY 462 (465) Q Consensus 437 tRY~l-~~nPf~~~~~~~~f~~~~~~~ 462 (465) .||+. +.|| + .|++-..-- T Consensus 285 ~r~~~~v~~~---~----a~~~l~~a~ 304 (304) T protein:vir:10 285 MHIAYMNVKP---E----AFATLKPTE 304 (304) T ss_pred EEeccEeecc---c----ceEEEEecC Confidence 56666 3443 2 233111111 No 50 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=93.29 E-value=0.0081 Score=32.05 Aligned_cols=261 Identities=14% Similarity=0.047 Sum_probs=118.6 Q ss_pred ccccccccccccccccccccccc----hhhhheeeeeccCccccccccc--cccccccccCCccCCCcccccCccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTD----NIVNVLLRLESNSTGSVAIGDE--MDKAATFATKKATVEAVYTNEALWLKVLK 209 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPT----gLifam~s~y~~~~g~ea~~~e--~~t~~s~~~~~~~~~~~~~~~a~~~~~~~ 209 (465) ++... +.....--|. .+...++.. ..+.+- .+..+++.. +....... + T Consensus 1 m~~~~--------T~l~d~i~Pev~~~~v~~~~~~~-------l~~~~~~~~~~~l~g~~------G~tv~iP~-----~ 54 (274) T protein:vir:95 1 MAQGM--------TKLTNQIVPEVLAPMMQAELEKK-------LRFASFAEIDNTLVGQP------GDTLTFPA-----F 54 (274) T ss_pred CCcce--------eehhheechHHHHHHHHHHHHhh-------hhccccceecccccCCC------CCEEEeee-----e Confidence 11100 1111111111 011001110 000000 011111100 00000000 0 Q ss_pred cccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhC-CCHHHHHHHHHHHHHHHHhhHHHHh Q lcl|NC_018861. 210 NYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHG-INAEKELADILSAEVALEIDRTIIE 288 (465) Q Consensus 210 ~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHG-lDAe~EL~niLstEImlEINreii~ 288 (465) +.. +-.+...|..+-+..++..+= .+++-+-|+ |+ |.+ -|+-+..+ -|.-.|..+-++..+..+++++++. T Consensus 55 ~~i-g~a~~~~~g~~i~~~~lt~~~--~~~~i~~~~-~a-~~i---~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~ 126 (274) T protein:vir:95 55 IYS-GDAKVVAEGEKIPTDILETKK--REAKIRKIA-KG-TSI---SDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLE 126 (274) T ss_pred cCC-CccccccCCCccchhhcccce--eEEEeeeee-cc-eee---hHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 100 111111221122344444333 333334443 22 222 26666654 5899999999999999999999998 Q ss_pred hhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccccc Q lcl|NC_018861. 289 KANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAI 368 (465) Q Consensus 289 ~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~ 368 (465) .+.... .++. ......+.+-....++..+. ..+++++++|++++.|...+...|..+..... T Consensus 127 ~l~~a~-----~~~~--~~~~~~d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~-- 188 (274) T protein:vir:95 127 ALKSAK-----LTVE--ADITKLTGLQTAIDKFNDED---------LEPMVLFISPLDAGKLRGDATTNFTRATELGD-- 188 (274) T ss_pred HHhccc-----cccc--ccccCHHHHHHHHHHhcccc---------ccccEEEeCHHHHHHHHhhccccccccccccc-- Confidence 884321 1111 11111233444444443321 25789999999999999987665544322110 Q ss_pred ccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecC-- Q lcl|NC_018861. 369 NSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATP-- 445 (465) Q Consensus 369 ~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nP-- 445 (465) +--....+|++ .|++||+|...|..-..+--+|+- .||.. -+...-.--||.+++-.+-..-+||+ ..|| T Consensus 189 ~~~~~G~ig~~-~G~~Vi~s~~~~~~t~~l~~~gA~-----~~~~~-~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:95 189 DVIVKGAFGEA-LGAVIVRSNKLEAGTAILAKKGAV-----KLITK-RDFFLETDRDPSTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred cceecccccee-cCeEEEEeCCCCCceEEEEeccce-----eeeec-CCcccccccccccccCEEEEeEEEEEEEEcCCc Confidence 11113357887 579999999887543222112211 11111 11111112389999999988999998 6777 Q ss_pred ---cccccccceE Q lcl|NC_018861. 446 ---LHPEAFIRTF 455 (465) Q Consensus 446 ---f~~~~~~~~f 455 (465) ++.....+-. T Consensus 262 ~v~~tk~~~~~~~ 274 (274) T protein:vir:95 262 AVKITKGSGSLEM 274 (274) T ss_pred EEEEEcCCccccC Confidence 3333333332 No 51 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=93.29 E-value=0.0081 Score=32.05 Aligned_cols=261 Identities=14% Similarity=0.047 Sum_probs=118.6 Q ss_pred ccccccccccccccccccccccc----hhhhheeeeeccCccccccccc--cccccccccCCccCCCcccccCccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTD----NIVNVLLRLESNSTGSVAIGDE--MDKAATFATKKATVEAVYTNEALWLKVLK 209 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPT----gLifam~s~y~~~~g~ea~~~e--~~t~~s~~~~~~~~~~~~~~~a~~~~~~~ 209 (465) ++... +.....--|. .+...++.. ..+.+- .+..+++.. +....... + T Consensus 1 m~~~~--------T~l~d~i~Pev~~~~v~~~~~~~-------l~~~~~~~~~~~l~g~~------G~tv~iP~-----~ 54 (274) T protein:vir:96 1 MAQGM--------TKLTNQIVPEVLAPMMQAELEKK-------LRFASFAEIDNTLVGQP------GDTLTFPA-----F 54 (274) T ss_pred CCcce--------eehhheechHHHHHHHHHHHHhh-------hhccccceecccccCCC------CCEEEeee-----e Confidence 11100 1111111111 011001110 000000 011111100 00000000 0 Q ss_pred cccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhC-CCHHHHHHHHHHHHHHHHhhHHHHh Q lcl|NC_018861. 210 NYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHG-INAEKELADILSAEVALEIDRTIIE 288 (465) Q Consensus 210 ~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHG-lDAe~EL~niLstEImlEINreii~ 288 (465) +.. +-.+...|..+-+..++..+= .+++-+-|+ |+ |.+ -|+-+..+ -|.-.|..+-++..+..+++++++. T Consensus 55 ~~i-g~a~~~~~g~~i~~~~lt~~~--~~~~i~~~~-~a-~~i---~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~ 126 (274) T protein:vir:96 55 IYS-GDAKVVAEGEKIPTDILETKK--REAKIRKIA-KG-TSI---SDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLE 126 (274) T ss_pred cCC-CccccccCCCccchhhcccce--eEEEeeeee-cc-eee---hHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 100 111111221122344444333 333334443 22 222 26666654 5899999999999999999999998 Q ss_pred hhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccccc Q lcl|NC_018861. 289 KANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAI 368 (465) Q Consensus 289 ~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~ 368 (465) .+.... .++. ......+.+-....++..+. ..+++++++|++++.|...+...|..+..... T Consensus 127 ~l~~a~-----~~~~--~~~~~~d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~-- 188 (274) T protein:vir:96 127 ALKSAK-----LTVE--ADITKLTGLQTAIDKFNDED---------LEPMVLFISPLDAGKLRGDATTNFTRATELGD-- 188 (274) T ss_pred HHhccc-----cccc--ccccCHHHHHHHHHHhcccc---------ccccEEEeCHHHHHHHHhhccccccccccccc-- Confidence 884321 1111 11111233444444443321 25789999999999999987665544322110 Q ss_pred ccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecC-- Q lcl|NC_018861. 369 NSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATP-- 445 (465) Q Consensus 369 ~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nP-- 445 (465) +--....+|++ .|++||+|...|..-..+--+|+- .||.. -+...-.--||.+++-.+-..-+||+ ..|| T Consensus 189 ~~~~~G~ig~~-~G~~Vi~s~~~~~~t~~l~~~gA~-----~~~~~-~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:96 189 DVIVKGAFGEA-LGAVIVRSNKLEAGTAILAKKGAV-----KLITK-RDFFLETDRDPSTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred cceecccccee-cCeEEEEeCCCCCceEEEEeccce-----eeeec-CCcccccccccccccCEEEEeEEEEEEEEcCCc Confidence 11113357887 579999999887543222112211 11111 11111112389999999988999998 6777 Q ss_pred ---cccccccceE Q lcl|NC_018861. 446 ---LHPEAFIRTF 455 (465) Q Consensus 446 ---f~~~~~~~~f 455 (465) ++.....+-. T Consensus 262 ~v~~tk~~~~~~~ 274 (274) T protein:vir:96 262 AVKITKGSGSLEM 274 (274) T ss_pred EEEEEcCCccccC Confidence 3333333332 No 52 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=93.26 E-value=0.0082 Score=32.02 Aligned_cols=323 Identities=13% Similarity=0.002 Sum_probs=127.2 Q ss_pred CCccchhhhHHHhhhhhhcc------ccccC------------hhhhhheehccccc-hhHHHhhhh----h------hh Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNL------YPNLN------------ESEKNIMRTVLENQ-GNEVKMLME----S------TV 51 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~------~~~~~------------~~~~~~~~~l~~n~-~~~~~~i~e----s------t~ 51 (465) +.+..... .+.+.-.-... +.... +..+..+....... ..+...+.+ . +. T Consensus 50 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 128 (419) T protein:vir:94 50 AARAALLR-TAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTI 128 (419) T ss_pred HHHHHHHH-HHHHHHHHHhhhhccccccccccccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccc Confidence 11111111 11110000000 00000 00111111111110 001111111 0 11 Q ss_pred ccccccccchhhh-h-hhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccc Q lcl|NC_018861. 52 TGDIAKFTPILVP-V-IRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNY 129 (465) Q Consensus 52 t~~v~~~~P~l~~-l-~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~ 129 (465) +......-|.+++ . ..+.-..++..+++.|.||++++. .-+| ... .+ . T Consensus 129 ~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~--~~~~--~~~--~~-~----------------------- 178 (419) T protein:vir:94 129 TNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVL--EYIR--DTS--GT-A----------------------- 178 (419) T ss_pred cCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCce--eeee--ecc--cc-c----------------------- Confidence 1111122344432 1 122223345678899999987652 2222 000 00 0 Q ss_pred ccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccc Q lcl|NC_018861. 130 TGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLK 209 (465) Q Consensus 130 Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~ 209 (465) .... .... T Consensus 179 -------~~~~--~~~~--------------------------------------------------------------- 186 (419) T protein:vir:94 179 -------GAGS--TWNK--------------------------------------------------------------- 186 (419) T ss_pred -------cccc--cCcc--------------------------------------------------------------- Confidence 0000 0000 Q ss_pred cccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhh Q lcl|NC_018861. 210 NYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEK 289 (465) Q Consensus 210 ~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~ 289 (465) +.-.+| |..+++...++++++..+|.=+-...+|-||.||.- +.+++|.+-|+..|...+|+.||.. T Consensus 187 ------a~~v~E--g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~aii~G 253 (419) T protein:vir:94 187 ------AAVVPE--GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-----QLMGYIQGRLTYGLRFLRDRQLLNG 253 (419) T ss_pred ------cceecC--CccccccccceeeEEeeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc Confidence 000011 234555556677777777777777889999999952 4689999999999999999999852 Q ss_pred hhh-----eeeeeeeeeeccCCccc---HHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCC Q lcl|NC_018861. 290 ANE-----VATVCTDFDVNSADGRW---FIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPA 361 (465) Q Consensus 290 l~~-----~at~~~~~~~~~~~~~~---~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~ 361 (465) =-. ..+...-.......+.. ....+..|...+..+.. .+...+.+|+++.....|...-- ... T Consensus 254 ~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~------~~~~~~~~v~n~~~~~~l~~~k~---~~~ 324 (419) T protein:vir:94 254 NGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEI------AGFPPDGVVVHPQDWESIELDQA---PGS 324 (419) T ss_pred cCcccccceecccccccccccccccccccchhHHHHHHHHHhhhh------ccCCCCEEEEcHHHHHHHHHHhh---cCC Confidence 100 00000000111111100 11223444443333322 23356789999999988864411 000 Q ss_pred cccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCc---c-cceee--e Q lcl|NC_018861. 362 GSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVS---G-QPAMI--L 435 (465) Q Consensus 362 ~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s---~-qp~~~--~ 435 (465) +......+ -.....++|. |++|+++...|..-+++|--. -+|--+.-..+..-+++.. | +..++ + T Consensus 325 ~~~~~~~~-~~~~~~~~l~-G~pV~~~~~~~~~~~~~gd~~-------~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~ 395 (419) T protein:vir:94 325 GVFRVIAN-VQGEATPRIW-GLNVVSTVAIAQGTALVGGFR-------QGATLWSRQGITVLMTDSHADFFTANTLVILA 395 (419) T ss_pred CceeecCC-cccCCCcccc-ceeeEEcCCCCCccEEEeecc-------ceEEEEEecceEEEEeccccchhhcCcEEEEE Confidence 00000000 0011224553 679999988776555554210 0000011001111112211 2 23333 4 Q ss_pred eeeeeeeecCccccccc-ceEE-Eee Q lcl|NC_018861. 436 NNRYDVVATPLHPEAFI-RTFA-VNL 459 (465) Q Consensus 436 ~tRY~l~~nPf~~~~~~-~~f~-~~~ 459 (465) ..|++.. |..+.+.. .+|+ +.+ T Consensus 396 ~~r~d~~--v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 396 EFRANLA--VYQPKAFVRVTFAAATT 419 (419) T ss_pred EEeeccE--EeccccEEEEEeccCCC Confidence 4566653 24444332 2343 222 No 53 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=93.04 E-value=0.009 Score=31.79 Aligned_cols=308 Identities=11% Similarity=0.018 Sum_probs=132.2 Q ss_pred CCccc------hh------hhHHHhhhhhhcccccc------C----------hhhhhheehccccchhHHHhhhhh-hh Q lcl|NC_018861. 1 MADKY------LL------DESTKEKFITSNLYPNL------N----------ESEKNIMRTVLENQGNEVKMLMES-TV 51 (465) Q Consensus 1 ~~~~~------~~------~e~~~e~~~~~~~~~~~------~----------~~~~~~~~~l~~n~~~~~~~i~es-t~ 51 (465) +++.. +. +.++.+.=......+.- . ++++.....+...+. ....+. .+ T Consensus 39 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~ 115 (395) T protein:vir:43 39 MNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEEAPKTAGQMVAESLKEQGVTSSLRGSHR---VSMPRSAIT 115 (395) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccchhhhHHHHHHHHHHHHHHHHHhhhhhh---hhhhhhhhc Confidence 00000 00 00000000000000000 0 011111111111110 000001 11 Q ss_pred ccccc---cccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccc Q lcl|NC_018861. 52 TGDIA---KFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDF 127 (465) Q Consensus 52 t~~v~---~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~ 127 (465) +.+.. ...|.+. .++.+.-+..+..+++.++||.+++.-+ .| +..... T Consensus 116 ~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~--~~--~~~~~~------------------------ 167 (395) T protein:vir:43 116 SIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEY--VR--ETGFVN------------------------ 167 (395) T ss_pred ccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEE--EE--EecCCC------------------------ Confidence 11111 1223333 5666677778888999999998765322 11 100000 Q ss_pred ccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccc Q lcl|NC_018861. 128 NYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKV 207 (465) Q Consensus 128 ~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~ 207 (465) .+ .|. T Consensus 168 ------------~a------~~v--------------------------------------------------------- 172 (395) T protein:vir:43 168 ------------NA------APV--------------------------------------------------------- 172 (395) T ss_pred ------------ce------eee--------------------------------------------------------- Confidence 00 000 Q ss_pred cccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHH Q lcl|NC_018861. 208 LKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTII 287 (465) Q Consensus 208 ~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii 287 (465) +| |...++-..++++++...|.-+-...+|-||.||.- +.++.|.+-|+..+...+|+.|| T Consensus 173 ------------~E--~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~v~~~la~a~~~~~d~~~l 233 (395) T protein:vir:43 173 ------------SE--GTQKPYSDLTFELENAPVRTIAHLFKASRQILDDAS-----ALQSYIDARARYGLMLVEECQLL 233 (395) T ss_pred ------------cC--CccccccccceeEEEEeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHH Confidence 00 112233334567777777777777889999999852 46799999999999999999988 Q ss_pred hh---------hhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccc Q lcl|NC_018861. 288 EK---------ANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVL 358 (465) Q Consensus 288 ~~---------l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~ 358 (465) .. |...+.+. .. ...+.. ....++..|..+...+. ..+.....+|+++.....|...- T Consensus 234 ~G~g~~~~~~Gi~~~~~~~---~~-~~~~~~---~~~~~~~~i~~~~~~~~--~~~~~~~~~vmn~~~~~~l~~lk---- 300 (395) T protein:vir:43 234 YGNGTGANLHGIIPQAQAY---AP-PSGVVV---TAEQRIDRIRLAILQAQ--LAEFPASGIVLNPIDWALIELNK---- 300 (395) T ss_pred hccCCCCcccccccccccc---cc-cccccc---ccchhHHHHHHHHHhhc--cccCCCcEEEEcHHHHHHHHHhh---- Confidence 53 21111100 00 000000 01112222333333332 23446678999999998886541 Q ss_pred cCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCC---cc-cceee Q lcl|NC_018861. 359 SPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPV---SG-QPAMI 434 (465) Q Consensus 359 ~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~---s~-qp~~~ 434 (465) ...+..... + -...-.++|. |++|+++++.|.+-+++|--.. ..+.. ...+... -+++. .| +..++ T Consensus 301 d~~G~~i~~-~-~~~~~~~~l~-G~pVv~~~~~~~~~~~~gd~~~-----~~~~~-~~~~~~i-~~~~~~~~~f~~~~~~ 370 (395) T protein:vir:43 301 DAENRYIIG-S-PQNGTTPTLW-RLPVVETQAITQDEFLTGAFSL-----GAQIF-DRMDIEV-LVSTENDKDFENNMVT 370 (395) T ss_pred ccCCceecc-c-cccCCCceec-ceeeEEcCCCCCCcEEEEeccc-----eEEEE-EecceEE-EEeccccchhhcCcEE Confidence 111121111 1 0011235664 5899999998876666543111 00000 0111111 12221 23 33444 Q ss_pred ee--eeeeeeecCcccccccceEE-Eeeccc Q lcl|NC_018861. 435 LN--NRYDVVATPLHPEAFIRTFA-VNLNNY 462 (465) Q Consensus 435 ~~--tRY~l~~nPf~~~~~~~~f~-~~~~~~ 462 (465) |. .|++..+ ..+. .|+ ++++.. T Consensus 371 ~r~~~r~d~~v--~~~~----a~~~~~~taa 395 (395) T protein:vir:43 371 IRAEERLAFAV--YRPE----AFVTGSLTAS 395 (395) T ss_pred EEEEEeeccEE--eccc----ceEEEEeccC Confidence 44 4666632 3333 355 555555 No 54 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=92.91 E-value=0.0095 Score=31.67 Aligned_cols=300 Identities=14% Similarity=0.082 Sum_probs=133.3 Q ss_pred CC-ccchhhhHHHhhhhhhccc-cccC-------------------------------hhhhhheehccccchhHHHhhh Q lcl|NC_018861. 1 MA-DKYLLDESTKEKFITSNLY-PNLN-------------------------------ESEKNIMRTVLENQGNEVKMLM 47 (465) Q Consensus 1 ~~-~~~~~~e~~~e~~~~~~~~-~~~~-------------------------------~~~~~~~~~l~~n~~~~~~~i~ 47 (465) .. -+.+.+|...|.|.-.... +.|+ +++++....|- +.+++-+. T Consensus 16 ~~e~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~---~~~~~a~~ 92 (371) T protein:vir:81 16 KEEARKLLAENKIEEAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAFVNHIR---TRFRNAMS 92 (371) T ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHHHH---HHHHHhhc Confidence 00 0112223223333222111 0000 01111100000 11122233 Q ss_pred hhhhc-cccccccch-hh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccc Q lcl|NC_018861. 48 ESTVT-GDIAKFTPI-LV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANK 124 (465) Q Consensus 48 est~t-~~v~~~~P~-l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~e 124 (465) +++++ |.+ .=|. +. -+++.+-++.+-.+++.+.||++.++-+.-.+ .... .. T Consensus 93 ~~t~~~gg~--~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~--~~~~-~~-------------------- 147 (371) T protein:vir:81 93 EGSNQDGGY--TVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKK--RSQQ-TG-------------------- 147 (371) T ss_pred cCCCccCce--eecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecCC-cc-------------------- Confidence 33221 111 1233 22 46677778888899999999988876654333 1100 00 Q ss_pred cccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcc Q lcl|NC_018861. 125 DDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALW 204 (465) Q Consensus 125 a~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~ 204 (465) ..+.. T Consensus 148 ----------------------a~~v~----------------------------------------------------- 152 (371) T protein:vir:81 148 ----------------------FVEVA----------------------------------------------------- 152 (371) T ss_pred ----------------------eeeec----------------------------------------------------- Confidence 00000 Q ss_pred ccccccccccccchhhhccCCchhhc-ceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_018861. 205 LKVLKNYTGPYATAAGEKLGKDMKEM-GISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEID 283 (465) Q Consensus 205 ~~~~~~~~~~~~Ta~~E~lg~~f~EM-~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEIN 283 (465) | |...++- ..+++++++.+|.-+-...+|-||.+|-. .|.++.|.+.|...|..-+| T Consensus 153 ----------------E--g~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~a~~~~~~ 210 (371) T protein:vir:81 153 ----------------E--GAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDST----EAIVNTLVRWIGDESRVTRN 210 (371) T ss_pred ----------------c--ccccccccccceeeEEeeeeEEEEeehhhHHHHhhhh----HHHHHHHHHHHHHHHHHHHH Confidence 0 1111211 12455566666666666789999999843 57889999999999999999 Q ss_pred HHHHhhhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----Cccccc Q lcl|NC_018861. 284 RTIIEKANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLS 359 (465) Q Consensus 284 reii~~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~ 359 (465) +.||......+. .+--..+....+ +... + ...+.....+++++.....|+.. |...+. T Consensus 211 ~~i~~g~g~~~~----------~~~~~~~~i~~~---~~~~---l--~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~ 272 (371) T protein:vir:81 211 GLIINVLNTKAK----------TAIADLDGLKQI---INVQ---L--DPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQ 272 (371) T ss_pred HHHHhhcccccc----------cccccHHHHHHH---HHhh---c--chhhhcCCEEEEcHHHHHHHHHhhccCCCeeee Confidence 998886632211 111111222222 1111 0 11222334688999999998764 222121 Q ss_pred CCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc-------cceeeeeCCCc---c Q lcl|NC_018861. 360 PAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI-------TLQQNLTDPVS---G 429 (465) Q Consensus 360 ~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~-------~~~~~~~dp~s---~ 429 (465) |.. .....|+| .|++||+..+.|...-.++ +...-..-++|+.+.. .-+...+++.. | T Consensus 273 ~~~---------~~~~~~~l-~G~pV~~~~~~~~~~~~~~--~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f 340 (371) T protein:vir:81 273 PSI---------SSPTGRQL-LGLPVVIVSNKVLANRVDG--GTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAF 340 (371) T ss_pred ccc---------CCCCCcee-cceeEEEecccccCccccc--cccCCcceEEEEehhceEEEEeecceEEEEeccccchh Confidence 111 11233677 4678888777654432221 1111223355554321 11112223332 2 Q ss_pred ---cceeeeeeeeee-eecCcccccccceEE-Eeeccc Q lcl|NC_018861. 430 ---QPAMILNNRYDV-VATPLHPEAFIRTFA-VNLNNY 462 (465) Q Consensus 430 ---qp~~~~~tRY~l-~~nPf~~~~~~~~f~-~~~~~~ 462 (465) +=.+-...|++. +.+| ..|+ +.++-- T Consensus 341 ~~~~v~~~~~~r~d~~~~~~-------~a~~~~~~~~A 371 (371) T protein:vir:81 341 ETDATLWRAIERMDVKMRDD-------EAFVFGEVQLA 371 (371) T ss_pred hcCceEEEEEEeeccEEecc-------cceEEEEEecC Confidence 234444556665 3443 2232 222111 No 55 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=92.87 E-value=0.0097 Score=31.63 Aligned_cols=310 Identities=12% Similarity=0.017 Sum_probs=131.4 Q ss_pred CCccchhhhHH---Hhh----hhhhccc--c--cc------Chhhhhheehccccchh---HHHhhhh-hh---hccccc Q lcl|NC_018861. 1 MADKYLLDEST---KEK----FITSNLY--P--NL------NESEKNIMRTVLENQGN---EVKMLME-ST---VTGDIA 56 (465) Q Consensus 1 ~~~~~~~~e~~---~e~----~~~~~~~--~--~~------~~~~~~~~~~l~~n~~~---~~~~i~e-st---~t~~v~ 56 (465) +.+-.=.++++ .++ =.+.... + .. ++..+.......+..+. +.+.... .. ++.+-. T Consensus 44 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 123 (390) T protein:vir:10 44 FATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDLFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGA 123 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccccccchhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhccccccccc Confidence 00000000011 110 0000000 0 00 00111111111111110 1111111 11 111111 Q ss_pred cccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccccccc Q lcl|NC_018861. 57 KFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIE 135 (465) Q Consensus 57 ~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~ 135 (465) ..-|.++ .++.+.-.+..-.++|.|.||++++.-+. | ....... T Consensus 124 ~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~--~--~~~~~~~------------------------------- 168 (390) T protein:vir:10 124 LTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYV--Q--ETGFVNN------------------------------- 168 (390) T ss_pred ccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEE--E--EecCCcc------------------------------- Confidence 2334333 55666666667778899999887652221 1 1000000 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPY 215 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 215 (465) + .|. T Consensus 169 -----a------~~v----------------------------------------------------------------- 172 (390) T protein:vir:10 169 -----A------AIV----------------------------------------------------------------- 172 (390) T ss_pred -----e------eee----------------------------------------------------------------- Confidence 0 000 Q ss_pred cchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhhe-- Q lcl|NC_018861. 216 ATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEV-- 293 (465) Q Consensus 216 ~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~-- 293 (465) +| |...++-..+++++++.+|..+....+|-||.||- .|.++.|.+-|+..|...||+.||.-=-.. T Consensus 173 ----~E--g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-----~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~ 241 (390) T protein:vir:10 173 ----AE--GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-----PQLASYMNNRLIRGLKVKEDAEILRGTGANDG 241 (390) T ss_pred ----cC--CccccccccceeEEEEeeEEEEEeehhhHHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcc Confidence 00 11233334467788888888888899999999984 257799999999999999999998531000 Q ss_pred -----eeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccccc Q lcl|NC_018861. 294 -----ATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAI 368 (465) Q Consensus 294 -----at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~ 368 (465) ...+..-......+.-..+....++. .+. ..+...+.+|+++.....|...- ...+.+.... T Consensus 242 p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~l~--~~~~~~~~~v~n~~~~~~L~~lk----d~~g~~l~~~ 308 (390) T protein:vir:10 242 LLGLIPQATTYAAPTTIAGATRVDQLRLAML-------QAS--LAEYPASGIVINPIDWAAIELAK----DANNQYLIGN 308 (390) T ss_pred ccccccccccccccccccccchHHHHHHHHH-------hhc--cccCCCCEEEEcHHHHHHHHHhh----cCCCceeecC Confidence 00000000111111111122222222 221 22346678899999999887542 1111111111 Q ss_pred ccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcc-cceee--eeeeeee-eec Q lcl|NC_018861. 369 NSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSG-QPAMI--LNNRYDV-VAT 444 (465) Q Consensus 369 ~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~-qp~~~--~~tRY~l-~~n 444 (465) .- ..-.++| .|++|++++..|.+-+++|--- .+++.+...-.......+..-| +..+. ...|++. +.+ T Consensus 309 ~~--~~~~~~l-~G~pv~~~~~~p~~~~~~gdf~-----~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~ 380 (390) T protein:vir:10 309 AR--GTLTPTL-WGLPVVATQAMAPGEFLVGAFD-----LAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYR 380 (390) T ss_pred Cc--CcCCcee-cceeeEEcCCCCCCcEEEEecc-----ceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEec Confidence 00 0112345 5779999999887766665310 1122221111111111111111 23333 3457766 444 Q ss_pred Ccccccc-cceEE Q lcl|NC_018861. 445 PLHPEAF-IRTFA 456 (465) Q Consensus 445 Pf~~~~~-~~~f~ 456 (465) | .+. ..+|| T Consensus 381 ~---~a~~~~~~a 390 (390) T protein:vir:10 381 P---EALISGSFA 390 (390) T ss_pred c---ccEEEEEeC Confidence 4 322 23444 No 56 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=92.25 E-value=0.012 Score=31.08 Aligned_cols=291 Identities=10% Similarity=0.036 Sum_probs=135.4 Q ss_pred cccchhHHHhhhhhhhccccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCccccccccccc Q lcl|NC_018861. 36 LENQGNEVKMLMESTVTGDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIV 114 (465) Q Consensus 36 ~~n~~~~~~~i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf 114 (465) .+-..+.+.+...++ +..-...-|.++ .+++++..+.+-.+++-+.||++++.-|- +... +.. T Consensus 1 ~g~~~e~~~~~~~~t-~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip----~~~~-~~~---------- 64 (397) T protein:vir:23 1 MGFSADHSQIAQTKD-TMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIP----HWTG-DVS---------- 64 (397) T ss_pred CCcCHHHHHHhhccC-CCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEE----EEcC-Ccc---------- Confidence 233333333333322 222222335444 44566666667777888888887652211 0000 000 Q ss_pred CccccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccC Q lcl|NC_018861. 115 LKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATV 194 (465) Q Consensus 115 ~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~ 194 (465) +.|. T Consensus 65 --------------------------------a~wv-------------------------------------------- 68 (397) T protein:vir:23 65 --------------------------------AQWI-------------------------------------------- 68 (397) T ss_pred --------------------------------eEEe-------------------------------------------- Confidence 0000 Q ss_pred CCcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHH Q lcl|NC_018861. 195 EAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADIL 274 (465) Q Consensus 195 ~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niL 274 (465) +| |..+++-..+++++++..|..+-.-.+|-||.+|-. .|.|++|.+.| T Consensus 69 -------------------------~E--g~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~----~~l~~~i~~~l 117 (397) T protein:vir:23 69 -------------------------GE--GDMKPITKGNMTKRDVHPAKIATIFVASAETVRANP----ANYLGTMRTKV 117 (397) T ss_pred -------------------------cC--CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHH Confidence 01 122333344577888888888888899999999863 67899999999 Q ss_pred HHHHHHHhhHHHHhhhhheeeeeeeee----eccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHH Q lcl|NC_018861. 275 SAEVALEIDRTIIEKANEVATVCTDFD----VNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATIL 350 (465) Q Consensus 275 stEImlEINreii~~l~~~at~~~~~~----~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L 350 (465) ...|...+|+.+|.--........-.. .....+....+....+ ...+.. .......++++++....| T Consensus 118 ~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~l~~--~~~~~a~~vmn~~~~~~L 188 (397) T protein:vir:23 118 ATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISPNAYQGLGVSG-------LTKLVT--DGKKWTHTLLDDTVEPVL 188 (397) T ss_pred HHHHHHHHHHHHhhcccCCcccccccccccceeeecccchhHHHHHH-------HHhhhh--cccCCCEEEEcHHHHHHH Confidence 999999999999964321100000000 0011111111111111 112221 233567789999999999 Q ss_pred Hhc----CcccccCCcccccccccccceEEEEecCceEEEEeCCCCcce----------EEEEEecCCCcc----ce--- Q lcl|NC_018861. 351 DEI----GSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDY----------CTVAYKGASNFD----AG--- 409 (465) Q Consensus 351 ~~~----~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy----------~~vg~kg~~~~d----~g--- 409 (465) +.. |...+.|....... .....|+| .+++|++++..+.+- +++|..+.-..+ ++ T Consensus 189 ~~lkd~~G~~i~~~~~~~~~~----~~~~~~tl-~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~ 263 (397) T protein:vir:23 189 NGSVDANGRPLFVESTYESLT----TPFREGRI-LGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNL 263 (397) T ss_pred HHhhccCCceeeccccccccc----ccccCcee-eeeeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeee Confidence 865 33333332211111 11233566 578999988865432 223332221110 10 Q ss_pred ----------eEEecccccceee-----eeCCCcccceeeeeeeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 410 ----------IFFAPYNITLQQN-----LTDPVSGQPAMILNNRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 410 ----------lfy~PY~~~~~~~-----~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) +|=.-.+-..... +.||+.|...-. .--+.+...-.+....-+|-|.++|.--+ T Consensus 264 ~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (397) T protein:vir:23 264 GSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTF--DPVLTTYALDLDGASAGNFTLSLDGKTSA 332 (397) T ss_pred ccccccceeeeeeccceeEEEEeeeccceecccceEEEee--ccccceeeecccccCcceEEEEecCcccc Confidence 1100000000000 123333221110 00011111123444566677666665444 No 57 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=92.15 E-value=0.013 Score=30.99 Aligned_cols=321 Identities=12% Similarity=-0.004 Sum_probs=124.1 Q ss_pred CCccchhhh---------HHHhh----------------hhhhcc-----ccccCh-hhhhheehccccc-----hhHHH Q lcl|NC_018861. 1 MADKYLLDE---------STKEK----------------FITSNL-----YPNLNE-SEKNIMRTVLENQ-----GNEVK 44 (465) Q Consensus 1 ~~~~~~~~e---------~~~e~----------------~~~~~~-----~~~~~~-~~~~~~~~l~~n~-----~~~~~ 44 (465) ++.+.+.++ .+.++ ...... .+...+ .++.-.....+-. ..+.+ T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 111 (413) T protein:vir:81 32 DAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYKSIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVK 111 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHH Confidence 111000000 00000 000000 000000 0000000000000 01111 Q ss_pred hhhhh----hhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccc Q lcl|NC_018861. 45 MLMES----TVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLK 118 (465) Q Consensus 45 ~i~es----t~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~ 118 (465) .+.+. +++.+....=|..+ .++...-+..+-.+++.|+||++++.-+.-.. -.. ... T Consensus 112 ~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-~~~--~~~-------------- 174 (413) T protein:vir:81 112 AASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTIKYLMEK-ANR--VVE-------------- 174 (413) T ss_pred hhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCceeEEEec-ccc--ccc-------------- Confidence 11111 11122222224333 46677777888889999999999874322111 000 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcc Q lcl|NC_018861. 119 TESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVY 198 (465) Q Consensus 119 ~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~ 198 (465) ..+.|. . T Consensus 175 --------------------------~~a~~v----------------------------~------------------- 181 (413) T protein:vir:81 175 --------------------------GGFKTV----------------------------A------------------- 181 (413) T ss_pred --------------------------ccccee----------------------------c------------------- Confidence 000000 0 Q ss_pred cccCccccccccccccccchhhhccCCchhhcce-EEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHH Q lcl|NC_018861. 199 TNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGI-SVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAE 277 (465) Q Consensus 199 ~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~F-sIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstE 277 (465) | |...+|... .++.++..+|..+-....|-||.+|-- +.++.|.+-|+.. T Consensus 182 ----------------------E--g~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~-----~l~~~i~~~la~~ 232 (413) T protein:vir:81 182 ----------------------E--GGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYD-----FLVSYINARLLEE 232 (413) T ss_pred ----------------------C--cccccccCcccceeeEeeeeeEEEeehhhHHHHHHHH-----HHHHHHHHHHHHH Confidence 0 111222222 244555555555556779999999862 2578888888888 Q ss_pred HHHHhhHHHHhhhhhe------eeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHH Q lcl|NC_018861. 278 VALEIDRTIIEKANEV------ATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILD 351 (465) Q Consensus 278 ImlEINreii~~l~~~------at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~ 351 (465) |..-+|+.||..--.. .....-..+....+.+ ++..|...-..+.....+ ..+.+|+++.....|. T Consensus 233 ~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~-------~~~~i~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~ 304 (413) T protein:vir:81 233 LAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDE-------LADSIYKAMTNISLATPF-QADALVINPLDYQELR 304 (413) T ss_pred HHHHHHHHHhccCCCCCcccccccccccccccccccch-------hHHHHHHHHHHhhhhccC-CCcEEEEcHHHHHHHH Confidence 8888888887531000 0000001111111111 122222222223222332 4456889999988886 Q ss_pred hc----CcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCC- Q lcl|NC_018861. 352 EI----GSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDP- 426 (465) Q Consensus 352 ~~----~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp- 426 (465) .. |-..+.+..... ..+......++|. |++|+++...|..-+++|---. +|--+...-+..-+++ T Consensus 305 ~lkd~~G~~l~~~~~~~~--~~~~~~~~~~~l~-G~pv~~s~~~~~~~~~~gd~~~-------~~~~~~~~~~~v~~~~~ 374 (413) T protein:vir:81 305 LAKDANGQYYGGGVFQGQ--YGSGGIMLDPAPW-GLRTVQSQVVPVGKPVVGAFRS-------AASVLRKGGVRIDSTNT 374 (413) T ss_pred HhhccCCceecccccccc--ccccccccCceec-ceeeEEcCCCCcccEEEEeccc-------EEEEEEecceEEEEecc Confidence 54 222222111100 0001112234554 7799998887766666553110 0110111111111111 Q ss_pred -----CcccceeeeeeeeeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 427 -----VSGQPAMILNNRYDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 427 -----~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) .+-+=.+-+..||+..+ .++ ..|+ +.+ .+..+ T Consensus 375 ~~~~~~~~~~~~r~~~r~d~~~--~~~----~a~~~l~~-~~~~~ 412 (413) T protein:vir:81 375 NVDDFENNLITVRAEERVGLMV--TFP----EAIVQLDV-AEVVT 412 (413) T ss_pred ccchhhcCcEEEEEEEeeccEE--ecc----cceEEEEe-cCCCC Confidence 12233444555666522 222 2333 221 11122 No 58 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=91.91 E-value=0.014 Score=30.81 Aligned_cols=300 Identities=12% Similarity=0.054 Sum_probs=124.4 Q ss_pred CCcc-------chhhhHH--Hhhhhhhc----cccc--------cChhhhhheehccccchhHHHhhhhhhhc-cccccc Q lcl|NC_018861. 1 MADK-------YLLDEST--KEKFITSN----LYPN--------LNESEKNIMRTVLENQGNEVKMLMESTVT-GDIAKF 58 (465) Q Consensus 1 ~~~~-------~~~~e~~--~e~~~~~~----~~~~--------~~~~~~~~~~~l~~n~~~~~~~i~est~t-~~v~~~ 58 (465) +.+- ...++++ .+...... .... ..++++.....|.++......-...++++ |.+. T Consensus 43 ~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~-- 120 (397) T protein:vir:49 43 KNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPLTKNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLT-- 120 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccchhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcce-- Confidence 0000 0000000 00000000 0000 01133333333333222211122222211 1111 Q ss_pred cchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccc Q lcl|NC_018861. 59 TPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEV 136 (465) Q Consensus 59 ~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~ 136 (465) =|.-+ .++...-++..-.+++.|+||++.+|-+--.| ....... T Consensus 121 iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~-------------------------------- 166 (397) T protein:vir:49 121 IPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEK--WADITGL-------------------------------- 166 (397) T ss_pred ecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEe--eccCCcc-------------------------------- Confidence 13333 45667777778889999999999886543222 1000000 Q ss_pred cccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcccccccccccccc Q lcl|NC_018861. 137 SFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYA 216 (465) Q Consensus 137 s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 216 (465) + .|.. T Consensus 167 ----a------~~v~----------------------------------------------------------------- 171 (397) T protein:vir:49 167 ----A------KLDD----------------------------------------------------------------- 171 (397) T ss_pred ----e------eeec----------------------------------------------------------------- Confidence 0 0000 Q ss_pred chhhhccCCchhhcce-EEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheee Q lcl|NC_018861. 217 TAAGEKLGKDMKEMGI-SVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVAT 295 (465) Q Consensus 217 Ta~~E~lg~~f~EM~F-sIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at 295 (465) | |..+++-.. +++.++..+|.-+-...+|-||.+|- .+|.+++|.+-|+..|..-+|+.||.-.-.... T Consensus 172 ----E--~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~ 241 (397) T protein:vir:49 172 ----E--GGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADS----AENILAWLSGWIAKKVVVTRNKAILEAIGTLPN 241 (397) T ss_pred ----c--ccccccccccceeeeEeeeeeeEeehhhHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 0 011122111 35666666666666678999999985 357889999999999999999999876522111 Q ss_pred eeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCcccccccccc Q lcl|NC_018861. 296 VCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKIDAINSG 371 (465) Q Consensus 296 ~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~~~~~~ 371 (465) ..+---.+....|...+. +.+.....+++++.....|+.. |...+.|.. T Consensus 242 ---------~~~~~~~d~i~~~~~~l~---------~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~--------- 294 (397) T protein:vir:49 242 ---------KPTLAKWDDIIDLQAKVD---------PAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDV--------- 294 (397) T ss_pred ---------cccccCHHHHHHHHHhhh---------hhhcCCCEEEEcHHHHHHHHHhhccCCceeecccc--------- Confidence 011001122233332222 2233556889999999999875 222122111 Q ss_pred cceEEEEecCceEEEEeC--CCCc-----ceEEEE---------EecCCCccceeEEecccccceeeeeCCCcccceeee Q lcl|NC_018861. 372 IKPNVGKFDNRYDVIVDN--FAEF-----DYCTVA---------YKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMIL 435 (465) Q Consensus 372 ~~~~~G~l~~~~~vy~d~--~~~~-----dy~~vg---------~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~ 435 (465) .....++|. |++|++.. ..+. .-+++| .++.-. +-..||.- .+-...+=.+-. T Consensus 295 ~~g~~~~l~-G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~----i~~~~~~~------~~~~~~~~~~~~ 363 (397) T protein:vir:49 295 KSPTGYSID-GFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLS----LLSTNIGG------GAFETDTTKVRV 363 (397) T ss_pred cCCCCceec-ceeeEEecccccccccCCceeEEEeeccceEEEEeecccE----EEEecccc------chhhcCeeeEEE Confidence 011124564 44555422 2111 112222 111110 11111110 001122333444 Q ss_pred eeeeeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 436 NNRYDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 436 ~tRY~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) ..|++.. |..+ ..|+ ++++.+--. T Consensus 364 ~~r~d~~--~~~~----~a~~~~~~~~~~~~ 388 (397) T protein:vir:49 364 IDRFDVV--STDT----EAFVPASFKAIADQ 388 (397) T ss_pred EEeeccE--Eecc----cceEEEEecccccc Confidence 5566552 2222 2233 222221111 No 59 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=91.83 E-value=0.014 Score=30.74 Aligned_cols=323 Identities=12% Similarity=0.069 Sum_probs=129.1 Q ss_pred CCccc--hhh------------hHHH---hhhhhhcccccc---Ch-hhhhheehccccc--hhHHHh--hh---hhhhc Q lcl|NC_018861. 1 MADKY--LLD------------ESTK---EKFITSNLYPNL---NE-SEKNIMRTVLENQ--GNEVKM--LM---ESTVT 52 (465) Q Consensus 1 ~~~~~--~~~------------e~~~---e~~~~~~~~~~~---~~-~~~~~~~~l~~n~--~~~~~~--i~---est~t 52 (465) |..+. +.+ |+.. +.....+..... .+ .++.....+.+.. ..+... +. .+++. T Consensus 88 ~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 167 (458) T protein:vir:10 88 VEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSV 167 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccC Confidence 11100 000 0000 000000000000 00 1111111111110 011111 11 11111 Q ss_pred cccccccc-hhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccc Q lcl|NC_018861. 53 GDIAKFTP-ILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYT 130 (465) Q Consensus 53 ~~v~~~~P-~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~S 130 (465) ......-| .+. .++.++.++.+..+++-|+||+++..-+. .. . +.. T Consensus 168 ~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~-~~----~-~~~-------------------------- 215 (458) T protein:vir:10 168 EVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTML-VE----P-DAG-------------------------- 215 (458) T ss_pred ccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEE-Ee----c-CCc-------------------------- Confidence 11111122 222 45677777888899999999988652111 11 0 000 Q ss_pred cccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcccccccc Q lcl|NC_018861. 131 GTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKN 210 (465) Q Consensus 131 g~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~ 210 (465) .+.|.......| T Consensus 216 ---------------~a~~v~e~~~~~----------------------------------------------------- 227 (458) T protein:vir:10 216 ---------------KATWVAASTYGT----------------------------------------------------- 227 (458) T ss_pred ---------------ceeecccccccc----------------------------------------------------- Confidence 000000000000 Q ss_pred ccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhh- Q lcl|NC_018861. 211 YTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEK- 289 (465) Q Consensus 211 ~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~- 289 (465) |. .....-.-+++++++.++.-+-...+|-||.+|-- .|.+++|.+-|..-|..-||+.+|.. T Consensus 228 ----------~~--~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~----~~~~~~i~~~l~~~i~~~~d~~~l~G~ 291 (458) T protein:vir:10 228 ----------DT--TTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAI----FSLLPLLRKRLIEAHAVSIEEAFMTGD 291 (458) T ss_pred ----------cc--cccccccccceeeEeeeeeEEeeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 00 00000111356666777777777889999988832 57889999999999999999999852 Q ss_pred -------hhheeeeeeeeeeccCCc-ccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----Cccc Q lcl|NC_018861. 290 -------ANEVATVCTDFDVNSADG-RWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFV 357 (465) Q Consensus 290 -------l~~~at~~~~~~~~~~~~-~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~ 357 (465) +...+......-+...++ .-..-.+..|...+.. +. ..+......|+++.....|+.. |... T Consensus 292 G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~----l~--~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i 365 (458) T protein:vir:10 292 GSGKPKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRK----LG--RHGLKLSKLVLIVSMDAYYDLLEDEEWQDV 365 (458) T ss_pred CCCccceeeecccccccceeecccccccccccHHHHHHHHHh----hh--hhhcCCCEEEEcHHHHHHHHhhcccCCcee Confidence 111111100000000000 0000123333332222 21 1222456789999999888754 2211 Q ss_pred ccCCcccccccccccceEEEEecCceEEEEeCCCCcc----eEEEEEecCCCccceeEEecccccceeeeeCCCccccee Q lcl|NC_018861. 358 LSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFD----YCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAM 433 (465) Q Consensus 358 ~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~d----y~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~ 433 (465) +.|..... ....-.++|. |++|+++.+.|.. -+++|.-++ +.++. .-..+.+..||-+-.+.+ T Consensus 366 ~~~~~~~~-----~~~~~~~~l~-G~pv~~~~~~p~~~~~~~~~~~~f~~-----~~~~~--~~~~~~v~~d~~~~~~~~ 432 (458) T protein:vir:10 366 AQVGNDSV-----KLQGQVGRIY-GLPVVVSEYFPAKANSAEFAVIVYKD-----NFVMP--RQRAVTVERERQAGKQRD 432 (458) T ss_pred eccccccc-----cccCcCceec-ceeeEEccccccccCCcceEEEEecc-----cEEEE--EeeceEEEeecccCCCce Confidence 11111000 0011124564 6899999886542 112221111 11111 111223345666556667 Q ss_pred eeeeeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 434 ILNNRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 434 ~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) +|.+.--+-...+.|. .|++ ++.-+ T Consensus 433 ~~~~~~r~~~~v~~~~----a~v~---~~~aa 457 (458) T protein:vir:10 433 AYYVTQRVNLQRYFAN----GVVS---GTYAA 457 (458) T ss_pred EEEEEEEecceEeccc----ceEE---Eeecc Confidence 7765333333333443 3332 22222 No 60 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=91.78 E-value=0.014 Score=30.70 Aligned_cols=309 Identities=12% Similarity=0.017 Sum_probs=129.5 Q ss_pred CCccchhhhHHHh---hhhhhc--------------cccccChhhhhheehccccch----hHHHhhhhhhhccccc--- Q lcl|NC_018861. 1 MADKYLLDESTKE---KFITSN--------------LYPNLNESEKNIMRTVLENQG----NEVKMLMESTVTGDIA--- 56 (465) Q Consensus 1 ~~~~~~~~e~~~e---~~~~~~--------------~~~~~~~~~~~~~~~l~~n~~----~~~~~i~est~t~~v~--- 56 (465) ..+..=..+++.+ +-.-.- ....-.+..++.+..+....+ +....+....+++... T Consensus 44 ~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 123 (390) T protein:vir:97 44 FATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGA 123 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhccccccccc Confidence 1000000001100 000000 000001122222222222211 1111111111111111 Q ss_pred cccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccccccc Q lcl|NC_018861. 57 KFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIE 135 (465) Q Consensus 57 ~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~ 135 (465) -.-|.++ .++.++-.+.+-.+++.+-||++++.-+--.. .... T Consensus 124 lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~----~~~~-------------------------------- 167 (390) T protein:vir:97 124 LTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQET----GFVN-------------------------------- 167 (390) T ss_pred ccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEe----cCCc-------------------------------- Confidence 1122222 55666677777788888988887763221111 0000 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPY 215 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 215 (465) . +.+. T Consensus 168 ----~------a~~v----------------------------------------------------------------- 172 (390) T protein:vir:97 168 ----N------AAIV----------------------------------------------------------------- 172 (390) T ss_pred ----c------eeee----------------------------------------------------------------- Confidence 0 0000 Q ss_pred cchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhh------ Q lcl|NC_018861. 216 ATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEK------ 289 (465) Q Consensus 216 ~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~------ 289 (465) +| |..+++-..++++++...|..+-...+|-||.+|- .+.++.|.+-|+..|...+|+.+|.. T Consensus 173 ----~E--g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds-----~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~ 241 (390) T protein:vir:97 173 ----AE--GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-----PQLASYMNNRLIRGLKVKEDAEILRGTGANDG 241 (390) T ss_pred ----cC--CccccccccceeEEEEeeeeEEEeehhhHHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcc Confidence 00 11222223345666666666666788999999984 25789999999999999999988852 Q ss_pred ---hhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccc Q lcl|NC_018861. 290 ---ANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKID 366 (465) Q Consensus 290 ---l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~ 366 (465) |...+. ..-......+.-..+. +...+..+ ...+...+.+|++++....|...- ...+.++. T Consensus 242 p~Gi~~~~~--~~~~~~~~~~~~~~d~---~~~~~~~~------~~~~~~~~~~v~n~~~~~~L~~lk----d~~G~~l~ 306 (390) T protein:vir:97 242 LLGLIPQAT--TYAAPTTIAGATRVDQ---LRLAMLQA------SLAEYPASGIVINPIDWAAIELAK----DANNQYLI 306 (390) T ss_pred ccceeeccc--cccccccccccchHHH---HHHHHHhh------ccccCCCCEEEEcHHHHHHHHHhh----cCCCceee Confidence 111111 0000111111111122 22222222 133346677899999999998542 11122211 Q ss_pred ccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcc-cceee--eeeeeeeee Q lcl|NC_018861. 367 AINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSG-QPAMI--LNNRYDVVA 443 (465) Q Consensus 367 ~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~-qp~~~--~~tRY~l~~ 443 (465) ..... .--++| .|++|++++..|.+-+++|--. .++++....-.......+..-| +..++ ...||+. T Consensus 307 ~~~~~--~~~~~l-~G~pV~~~~~~~~~~~~~gd~~-----~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~-- 376 (390) T protein:vir:97 307 GNARG--TLTPTL-WGLPVVATQAMAPGEFLVGAFD-----LAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLAL-- 376 (390) T ss_pred cCccC--CCCcee-cceeeEEcCCCCCCcEEEEecc-----ceEEEEEecceEEEEeecccccccCcEEEEEEEeecc-- Confidence 11000 112355 5779999998887766665310 1111111111112211111112 23334 3456766 Q ss_pred cCccccccc-ceEE Q lcl|NC_018861. 444 TPLHPEAFI-RTFA 456 (465) Q Consensus 444 nPf~~~~~~-~~f~ 456 (465) .|+.|.+.. -+|| T Consensus 377 ~v~~~~a~v~~~~a 390 (390) T protein:vir:97 377 VVYRPEALITGSFA 390 (390) T ss_pred EEeccccEEEEEeC Confidence 233333322 2344 No 61 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=91.73 E-value=0.014 Score=30.67 Aligned_cols=265 Identities=15% Similarity=0.075 Sum_probs=118.6 Q ss_pred ccccccccccccccccccccccc----hhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTD----NIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNY 211 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPT----gLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~ 211 (465) ++..+ +......-|+ .+.-.+.... .+.+-+.......+ ..+....... ++. T Consensus 1 Ma~~~--------T~~~~~iiPev~s~~v~~~~~~~~-------v~~~~~~~~~~l~g----~~G~tv~ip~-----~~~ 56 (278) T protein:vir:80 1 MADLT--------TKLANLIDPEVMGPMISAKLPKAI-------KFGKIAPIDNSLEG----QPGSEITVPK-----YKY 56 (278) T ss_pred CCCcc--------eehhheecHHHHHHHHHHHHHHhh-------hhcccceecccccC----CCCCEEEEee-----ecc Confidence 11100 0000111111 0000011000 00000000000000 0000000000 000 Q ss_pred cccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhh-CCCHHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_018861. 212 TGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQH-GINAEKELADILSAEVALEIDRTIIEKA 290 (465) Q Consensus 212 ~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiH-GlDAe~EL~niLstEImlEINreii~~l 290 (465) . +..+...| |.++..-..+..+++++-|.|+ |+ | + .-|+.+.. +-|.-.+..+-++.-+..+++++++..| T Consensus 57 ~-g~a~~~~~--g~~i~~~~lt~~~~~~~i~~~~-~a-~--~-v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l 128 (278) T protein:vir:80 57 I-GDAQDVAE--GAAIDYSALETESVKHGIKKAG-KG-V--K-LTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEA 128 (278) T ss_pred C-CcceeecC--CCcCcccccccceeeEeeehhh-cc-c--c-ccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 11111222 2233333445666666666665 22 2 2 34444443 6799999999999999999999999988 Q ss_pred hheeeeeeeeeecc-CCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccc Q lcl|NC_018861. 291 NEVATVCTDFDVNS-ADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAIN 369 (465) Q Consensus 291 ~~~at~~~~~~~~~-~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~ 369 (465) .... .++.. ....+.-..+..+.-...++ ...--. ...+++++|++.+.|...+...+......... T Consensus 129 ~~a~-----~~~~~~~t~~~~~~~~~~~~da~~~l----~~~~~~-~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~-- 196 (278) T protein:vir:80 129 LTTT-----LEVKGAINIGLIDKIENTFTDAPDAI----EDESIT-TTGVLFLNYKDTAKLREEAAGSWTKASQLGDD-- 196 (278) T ss_pred hccc-----cccccccccchhhhHHHHHHHHHHhh----cccCCC-cccEEEECHHHHHHHHhhhhhhcccccccccc-- Confidence 4321 11111 11111111122222222221 111111 23479999999999988876655533221111 Q ss_pred cccceEEEEecCceEEEEeCCCCcce-EEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecC-- Q lcl|NC_018861. 370 SGIKPNVGKFDNRYDVIVDNFAEFDY-CTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATP-- 445 (465) Q Consensus 370 ~~~~~~~G~l~~~~~vy~d~~~~~dy-~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nP-- 445 (465) ......+|++ .|++||+++..|..- ++++ +|+ -.|+.. .+.....--|+..++-.|-...+||+ ..|| T Consensus 197 ~~~~G~ig~~-~G~~Vi~s~~~p~~t~~l~~-~gA-----i~~~~~-~~~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~ 268 (278) T protein:vir:80 197 LLVKGAFGEL-LGWEIVRTKKLADGNALAVK-AGA-----LKTFLK-RNLLAESGRDMDHKLTKFNADQHYAVALVDETK 268 (278) T ss_pred ceeeccceee-cceeEEEcCCCCcceEEEEe-ccc-----eeeeec-CCcccccccchhhccceeeeeeEEEEEEEcCcc Confidence 1113468887 578999999987532 1111 121 112211 11111112388899999988999999 6677 Q ss_pred ---ccccccc Q lcl|NC_018861. 446 ---LHPEAFI 452 (465) Q Consensus 446 ---f~~~~~~ 452 (465) ++..+.. T Consensus 269 ~v~it~~a~~ 278 (278) T protein:vir:80 269 AVKVVPVAGN 278 (278) T ss_pred eEEEeeccCC Confidence 4555555 No 62 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=91.65 E-value=0.015 Score=30.61 Aligned_cols=261 Identities=11% Similarity=0.013 Sum_probs=112.0 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCc--ccccccccc--ccccccccCCccCCCcccccCccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNST--GSVAIGDEM--DKAATFATKKATVEAVYTNEALWLKVLKNY 211 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~--g~ea~~~e~--~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~ 211 (465) ++.... .....--|. +|+ .|.... ....+.+-+ +...++. .+....... ++. T Consensus 1 ma~~~T--------~~~d~iiPe--v~~---~~v~~~~~~~~~~~~~~~~~~~l~g~------~G~ti~iP~-----~~~ 56 (272) T protein:vir:36 1 MSKQKT--------TLADLVNPE--VLA---PIVSYELNKALRFAPLAQVDTTLQGQ------PGNTLKFPA-----FTY 56 (272) T ss_pred CCCcce--------ehhhhhchH--HHH---HHHHHHHHhhhhhccccccccccccC------CCCEEEEee-----ecc Confidence 111110 000001111 000 000000 000000000 0000000 000000000 000 Q ss_pred cccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhh-hCCCHHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_018861. 212 TGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQ-HGINAEKELADILSAEVALEIDRTIIEKA 290 (465) Q Consensus 212 ~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAi-HGlDAe~EL~niLstEImlEINreii~~l 290 (465) . +-.+...|...-+..++ +..+.+++-|-|+-.-++| |+-+. -+-|.-.|..+-++..+..+++++++..+ T Consensus 57 ~-gda~~~~eg~~i~~~~l--t~~~~~~~i~~~~k~~~vt-----D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l 128 (272) T protein:vir:36 57 I-GDAADVAEGGEISLDKI--GTTTKSVTIKKAAKGTEIT-----DEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAA 128 (272) T ss_pred C-ccccccCCCCccChhhc--CCcceeEeeehhhcccccc-----HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 0 11111222111123344 3455566666665322232 32222 25799999999999999999999999888 Q ss_pred hheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccccccc Q lcl|NC_018861. 291 NEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINS 370 (465) Q Consensus 291 ~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~ 370 (465) .... .++ ...-..+....+..++.++. ...++++|+|.++..|.....+........... T Consensus 129 ~~~~-----~~~---~~~~~~d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~--- 188 (272) T protein:vir:36 129 KTTS-----QTV---STKANVDGVQAALDIFNDED---------AQAYVLIVNPKDAAKIRKDANAKNIGSEVGANA--- 188 (272) T ss_pred cccc-----ccc---cccccHHHHHHHHHHhhhcC---------CCceEEEEcHHHHHHHhcccccccccccccccc--- Confidence 4321 111 11111232333333332221 246799999999999987766654432111100 Q ss_pred ccceEEEEecCceEEEEeCCCCcc---eEEEEE-ecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecC Q lcl|NC_018861. 371 GIKPNVGKFDNRYDVIVDNFAEFD---YCTVAY-KGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATP 445 (465) Q Consensus 371 ~~~~~~G~l~~~~~vy~d~~~~~d---y~~vg~-kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nP 445 (465) .....+|++ .|++|+++...|.+ |..+.. +|+- .+|..= ....-.--|+..++-.+--.-+||+ ..|| T Consensus 189 ~~~G~ig~~-~G~~Vv~s~~~p~~~~~~~~~~~~~gA~-----~~~~~~-~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~ 261 (272) T protein:vir:36 189 LINGTYADV-LGAQIVRSKKLAEGSALMFKIVSNSPAL-----KLVLKR-GVQVETDRDIVTKTTVITADEHYAAYLYDL 261 (272) T ss_pred eeeecccee-cCeeEEEeCCCCCCceeEEEEEecccce-----eeeecC-CcccccccchhhcCcEEEEEEEEEEEEEcC Confidence 112356777 45899999997643 111111 1211 111100 0011112288889988888888988 5565 Q ss_pred cccccccceEE-Eeeccc Q lcl|NC_018861. 446 LHPEAFIRTFA-VNLNNY 462 (465) Q Consensus 446 f~~~~~~~~f~-~~~~~~ 462 (465) ..++ ++..|- T Consensus 262 -------~~vv~~t~~g~ 272 (272) T protein:vir:36 262 -------TKVVNITFTGV 272 (272) T ss_pred -------ccEEEEeecCC Confidence 2222 222222 No 63 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=91.01 E-value=0.018 Score=30.16 Aligned_cols=313 Identities=13% Similarity=0.098 Sum_probs=124.6 Q ss_pred CCcc---------chhhhHHHhhhhhhccccc------c---ChhhhhheehccccchhHHHhhhhhhhccccccccchh Q lcl|NC_018861. 1 MADK---------YLLDESTKEKFITSNLYPN------L---NESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPIL 62 (465) Q Consensus 1 ~~~~---------~~~~e~~~e~~~~~~~~~~------~---~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l 62 (465) ..++ ....++....|...+.... . .+.|+.....|.++..+ ...-+-++++++-...=|.. T Consensus 79 ~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~-~e~~a~~~~t~~GG~lvP~~ 157 (434) T protein:vir:62 79 KEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDE-KEARALGLVTGNGSVTIPDF 157 (434) T ss_pred hcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccch-hhhhhhcccccccceecchh Confidence 0000 0000111111111110000 0 01111111111111100 00000011222100011333 Q ss_pred h--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccccc Q lcl|NC_018861. 63 V--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKT 140 (465) Q Consensus 63 ~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~t 140 (465) + .+++..-+..+...++-|.|+++..- |- + +.. +.. T Consensus 158 ~~~~Ii~~l~~~~~i~~~~~~~~~~~~~~--~p-~--~~~-~~~------------------------------------ 195 (434) T protein:vir:62 158 LSKEIITYAQEENFLRRLGTGVKTKENIK--YP-V--LVK-KAE------------------------------------ 195 (434) T ss_pred hHHHHHHhhhhhhhhhhhcceeccCCceE--EE-E--Eec-CCc------------------------------------ Confidence 3 25566666666677777777654310 00 0 000 000 Q ss_pred cccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhh Q lcl|NC_018861. 141 ATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAG 220 (465) Q Consensus 141 att~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~ 220 (465) + .+. .. T Consensus 196 a------~~~--------------------------------------------------------------------~~ 201 (434) T protein:vir:62 196 A------QGH--------------------------------------------------------------------KN 201 (434) T ss_pred c------cce--------------------------------------------------------------------ec Confidence 0 000 00 Q ss_pred hccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhh-------e Q lcl|NC_018861. 221 EKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANE-------V 293 (465) Q Consensus 221 E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~-------~ 293 (465) +..|...++-..++++++..+|.-+-...+|-||.+|- .+|.+++|.+-|+..|..-+++.||..==. . T Consensus 202 ~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~ 277 (434) T protein:vir:62 202 ERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLART----GLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGAL 277 (434) T ss_pred ccccccccccccceeeEEeeheeeEeehhhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccee Confidence 00011122222357777888888888889999999995 467899999999999999999999952200 0 Q ss_pred eeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccccccccc- Q lcl|NC_018861. 294 ATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGI- 372 (465) Q Consensus 294 at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~- 372 (465) +..+..+.. .... .+..|...+..+... +.+.-..|+++.....|...- ...+.+.-.+..+. T Consensus 278 ~~~~~~~~~--~~~~----~~d~l~~l~~~l~~~------~~~~a~~v~n~~~~~~L~~lk----d~~G~~l~~~~~~~~ 341 (434) T protein:vir:62 278 AKKAVEFKT--DEKN----LYDALVKMKNTPVKE------VRKKARWVLNTAALTKIETMK----TDDGFPLLRPFNQAE 341 (434) T ss_pred ecccccccc--cccc----hhhHHHHHHhhcchh------hhcCCEEEEcHHHHHHHHHhh----ccCCCEeeccCCCcc Confidence 111111111 1111 233333333332222 112234578999998887641 01112111111000 Q ss_pred ceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc---------cceeeeeCCCccccee--eeeeeeee Q lcl|NC_018861. 373 KPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI---------TLQQNLTDPVSGQPAM--ILNNRYDV 441 (465) Q Consensus 373 ~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~---------~~~~~~~dp~s~qp~~--~~~tRY~l 441 (465) ...-.+| .|++|+++.+.+.. -.|. ..-++|+.+.. ..+.+..+.-.-.-.+ ..+.|.+. T Consensus 342 ~g~~~tl-~G~pV~~~~~~~~~-----~~~~---~~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dg 412 (434) T protein:vir:62 342 GGIGYTL-LGFPVEEEDAIDIP-----DSPD---TPVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDA 412 (434) T ss_pred CCCCcee-cceeeEEecCccCc-----cCCC---ceEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecc Confidence 0111244 46788887665421 1110 11133332211 1222233332223334 44466643 Q ss_pred -ee-cCcccccccceEEEeeccceeC Q lcl|NC_018861. 442 -VA-TPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 442 -~~-nPf~~~~~~~~f~~~~~~~~~~ 465 (465) .+ .||.+. .+-+.+++-+ T Consensus 413 k~i~~~~~~~------~~~~~~~~~~ 432 (434) T protein:vir:62 413 QLIHSPFEVP------VYKYVLKAPT 432 (434) T ss_pred eeecCcccce------EEEEEeccCC Confidence 34 477765 2233344433 No 64 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=90.74 E-value=0.019 Score=29.99 Aligned_cols=270 Identities=14% Similarity=0.017 Sum_probs=116.1 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPY 215 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 215 (465) ++.... .....+ .|+---..+...++++.. +.+-+.......+ ..+....... ++.. +- T Consensus 1 ma~~~T-~~~d~i---iPev~~~~v~~~~~~~l~-------~~~~~~~d~~l~g----~~G~tv~iP~-----~~~~-g~ 59 (274) T protein:vir:94 1 MPQGLT-KTSDQI---IPEVLAPMMQAQLEKKLR-------FASFAEVDSTLQG----QPGDTLTFPA-----FVYS-GD 59 (274) T ss_pred CCccce-ehhhee---chHHHHHHHHHhhhhhhh-------hcccceecccccC----CCCCEEEEee-----ecCC-Cc Confidence 111110 001100 000000001111111100 0000000000000 0000000000 0000 11 Q ss_pred cchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheee Q lcl|NC_018861. 216 ATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVAT 295 (465) Q Consensus 216 ~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at 295 (465) .+...|...-+..++. ..+.+++.+-|+ |+ |.+.=-..+.+ +-|.-.+..+-++..|..+++.+++..+..... T Consensus 60 a~~~~~g~~i~~~~lt--~~~~~~~i~~~~-~~-~~i~D~~~~~~--~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~ 133 (274) T protein:vir:94 60 AQVVAEGEKIPTDILE--TKKREAKIRKIA-KG-TSITDEALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL 133 (274) T ss_pred cccccCCCcccccccc--cceeEEEeeeec-ce-ecccHHHHHhc--cchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc Confidence 1112222122344443 334444445555 22 22221122223 468889999999999999999999998854322 Q ss_pred eeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceE Q lcl|NC_018861. 296 VCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPN 375 (465) Q Consensus 296 ~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~ 375 (465) +++ ......+.+-.+..++..+. ...++++|+|++++.|...+...|..+...... -..... T Consensus 134 -----~~~--~~~~~~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~--~~~~G~ 195 (274) T protein:vir:94 134 -----TVN--ADITKLNGLQSAIDKFNDED---------LEPMVLFVNPLDAGKLRGDASTNFTRATELGDD--IIVKGA 195 (274) T ss_pred -----ccc--ccccCHHHHHHHHHHhhccC---------CCceEEEeCHHHHHHHHhhhhhhccccCccccc--ceeccc Confidence 111 12222344444444444321 256899999999999998765544433221111 011335 Q ss_pred EEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecCcccccccce Q lcl|NC_018861. 376 VGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATPLHPEAFIRT 454 (465) Q Consensus 376 ~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~~~~~ 454 (465) +|++ .|++||+++..|.+-..+-=+| .+-|.---+...-.--|+..+.-.+-..-+||+ ..||= . T Consensus 196 ig~~-~G~~Vi~s~~~p~~t~~l~~~g------A~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~-------~ 261 (274) T protein:vir:94 196 FGEA-LGAIIVRTNKLEAGTAILAKKG------AVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDES-------K 261 (274) T ss_pred ccee-cCeeEEEcCCCCcceEEEEeCc------ceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCC-------c Confidence 7887 5789999999885442221122 222210111111112388889888888889988 55551 0 Q ss_pred EEEeeccceeC Q lcl|NC_018861. 455 FAVNLNNYIIS 465 (465) Q Consensus 455 f~~~~~~~~~~ 465 (465) +|-++-..-| T Consensus 262 -vv~~t~~~~~ 271 (274) T protein:vir:94 262 -AVKITKGSGS 271 (274) T ss_pred -eEEEecCccc Confidence 1111111112 No 65 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=90.74 E-value=0.019 Score=29.99 Aligned_cols=270 Identities=14% Similarity=0.017 Sum_probs=116.1 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPY 215 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 215 (465) ++.... .....+ .|+---..+...++++.. +.+-+.......+ ..+....... ++.. +- T Consensus 1 ma~~~T-~~~d~i---iPev~~~~v~~~~~~~l~-------~~~~~~~d~~l~g----~~G~tv~iP~-----~~~~-g~ 59 (274) T protein:vir:97 1 MPQGLT-KTSDQI---IPEVLAPMMQAQLEKKLR-------FASFAEVDSTLQG----QPGDTLTFPA-----FVYS-GD 59 (274) T ss_pred CCccce-ehhhee---chHHHHHHHHHhhhhhhh-------hcccceecccccC----CCCCEEEEee-----ecCC-Cc Confidence 111110 001100 000000001111111100 0000000000000 0000000000 0000 11 Q ss_pred cchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheee Q lcl|NC_018861. 216 ATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVAT 295 (465) Q Consensus 216 ~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at 295 (465) .+...|...-+..++. ..+.+++.+-|+ |+ |.+.=-..+.+ +-|.-.+..+-++..|..+++.+++..+..... T Consensus 60 a~~~~~g~~i~~~~lt--~~~~~~~i~~~~-~~-~~i~D~~~~~~--~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~ 133 (274) T protein:vir:97 60 AQVVAEGEKIPTDILE--TKKREAKIRKIA-KG-TSITDEALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL 133 (274) T ss_pred cccccCCCcccccccc--cceeEEEeeeec-ce-ecccHHHHHhc--cchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc Confidence 1112222122344443 334444445555 22 22221122223 468889999999999999999999998854322 Q ss_pred eeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceE Q lcl|NC_018861. 296 VCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPN 375 (465) Q Consensus 296 ~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~ 375 (465) +++ ......+.+-.+..++..+. ...++++|+|++++.|...+...|..+...... -..... T Consensus 134 -----~~~--~~~~~~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~--~~~~G~ 195 (274) T protein:vir:97 134 -----TVN--ADITKLNGLQSAIDKFNDED---------LEPMVLFVNPLDAGKLRGDASTNFTRATELGDD--IIVKGA 195 (274) T ss_pred -----ccc--ccccCHHHHHHHHHHhhccC---------CCceEEEeCHHHHHHHHhhhhhhccccCccccc--ceeccc Confidence 111 12222344444444444321 256899999999999998765544433221111 011335 Q ss_pred EEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecCcccccccce Q lcl|NC_018861. 376 VGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATPLHPEAFIRT 454 (465) Q Consensus 376 ~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~~~~~ 454 (465) +|++ .|++||+++..|.+-..+-=+| .+-|.---+...-.--|+..+.-.+-..-+||+ ..||= . T Consensus 196 ig~~-~G~~Vi~s~~~p~~t~~l~~~g------A~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~-------~ 261 (274) T protein:vir:97 196 FGEA-LGAIIVRTNKLEAGTAILAKKG------AVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDES-------K 261 (274) T ss_pred ccee-cCeeEEEcCCCCcceEEEEeCc------ceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCC-------c Confidence 7887 5789999999885442221122 222210111111112388889888888889988 55551 0 Q ss_pred EEEeeccceeC Q lcl|NC_018861. 455 FAVNLNNYIIS 465 (465) Q Consensus 455 f~~~~~~~~~~ 465 (465) +|-++-..-| T Consensus 262 -vv~~t~~~~~ 271 (274) T protein:vir:97 262 -AVKITKGSGS 271 (274) T ss_pred -eEEEecCccc Confidence 1111111112 No 66 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=89.30 E-value=0.027 Score=29.17 Aligned_cols=265 Identities=13% Similarity=0.032 Sum_probs=111.4 Q ss_pred cccccccccchhhhheeeeeccCcc---cccccccccc---ccccccCCccCCCcccccCccccccccccccccchhhhc Q lcl|NC_018861. 149 VYSEKQAGTDNIVNVLLRLESNSTG---SVAIGDEMDK---AATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEK 222 (465) Q Consensus 149 t~~~~~TGPTgLifam~s~y~~~~g---~ea~~~e~~t---~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~ 222 (465) .+..++++-+ +...+ .+.+.++... ..+--..-.....+.... ....-.......-.+| T Consensus 1 ~g~~a~~~~~----------~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~----~~~~~~~~~~a~~v~E- 65 (299) T protein:vir:41 1 MGFNPDTTTM----------QSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPE----EEFTFMSGVGAFWVDE- 65 (299) T ss_pred CCcCCCcccc----------cCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCc----EEEEEEcCCceeeeec- Confidence 1111111100 00000 0000000000 000000000000000000 0000000111111223 Q ss_pred cCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhh--------hhee Q lcl|NC_018861. 223 LGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKA--------NEVA 294 (465) Q Consensus 223 lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l--------~~~a 294 (465) |.+++|..-++++++...|..+-...+|-||.+|-. .|.++.|.+.|...|...+|+.+|.-- ...+ T Consensus 66 -~~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~ 140 (299) T protein:vir:41 66 -AERIQTSKPTFTKAKMRSKKMGVIIPTTKENLNYSV----TNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSA 140 (299) T ss_pred -CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccc Confidence 566788888889999999999999999999999754 467899999999999999999988522 1111 Q ss_pred eeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccce Q lcl|NC_018861. 295 TVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKP 374 (465) Q Consensus 295 t~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~ 374 (465) +...... . .+. ..+..| .++-..+.. .....+.++++++....|...-- ..+.....++- .. T Consensus 141 ~~~~~~~--~-~~~---~~~~~l----~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~lkd----~~G~~l~~~~~--~~ 202 (299) T protein:vir:41 141 TDASNLV--E-ETA---NKYDDL----NEAIGLIEA--EDLEPNGIATIRKQRVKYRSTKD----GNGMPIFNTAT--SN 202 (299) T ss_pred cccceee--c-ccc---ccHHHH----HHHHHhhhc--ccCCcCEEEEcHHHHHHHHHhhc----cCCceeecCCc--CC Confidence 1010000 0 110 112222 233333332 23466789999999999987421 11121111110 01 Q ss_pred EEEEecCceEEEEeCCCCcce----EEEEEecCCCccceeEEecccccceeee--------eCCCc-----cc-ceeeee Q lcl|NC_018861. 375 NVGKFDNRYDVIVDNFAEFDY----CTVAYKGASNFDAGIFFAPYNITLQQNL--------TDPVS-----GQ-PAMILN 436 (465) Q Consensus 375 ~~G~l~~~~~vy~d~~~~~dy----~~vg~kg~~~~d~glfy~PY~~~~~~~~--------~dp~s-----~q-p~~~~~ 436 (465) -.++|. |++|++.+..+.+= +++|-- +.+++.........+. .|+.. || -.+.|. T Consensus 203 ~~~~l~-G~PV~~~~~~~~~~~~~~~~~gdf------s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 275 (299) T protein:vir:41 203 GVDDVL-GLPIAYTPKYTFGDKDISELVGDW------NQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIK 275 (299) T ss_pred CCceec-ceeeEEecccCCCCCceEEEEEec------ccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 124664 58998888766441 222210 0111122222222211 12211 22 123333 Q ss_pred --eeeeeeecCcccccccceEEEeec Q lcl|NC_018861. 437 --NRYDVVATPLHPEAFIRTFAVNLN 460 (465) Q Consensus 437 --tRY~l~~nPf~~~~~~~~f~~~~~ 460 (465) .|++.. +.+|.+..+--.+.-| T Consensus 276 ~~~~~d~~--v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 276 ATFEVGFM--VVKDEAFSAVQPKAGN 299 (299) T ss_pred EEEEeccE--EecccceEEEEeccCC Confidence 455542 2223333322222222 No 67 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=88.51 E-value=0.032 Score=28.79 Aligned_cols=290 Identities=11% Similarity=0.048 Sum_probs=115.0 Q ss_pred ccccccccccccccccc--cccccccccccc-ccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCc Q lcl|NC_018861. 127 FNYTGTPIEVSFKTATT--VKGKIVYSEKQA-GTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEAL 203 (465) Q Consensus 127 ~~~Sg~~~~~s~~tatt--~ggait~~~~~T-GPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~ 203 (465) ..+..+..+.+..+... .-..+|...-.. -...++|+- ....+++.... +.+..-.-. T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~----------------~~~d~~~~~~~---Gdtv~ip~~ 61 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTS----------------VVKTWGAQVKK---GDTFHVPRI 61 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhh----------------ccccccccccC---CceEEEecc Confidence 11111111101000000 000000000000 000011110 00111100000 000000000 Q ss_pred cccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_018861. 204 WLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEID 283 (465) Q Consensus 204 ~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEIN 283 (465) .......+..+.. -..+.+ .-.+.-++|||...-+-. + +-+|.. +. ..|...|+..-....++.+++ T Consensus 62 g~~~~~d~~~~~~-i~~~~~--~~~~~~itiD~~~~~~~~--i---~d~d~~---~~--~~d~~~~~~~~~~~aLA~~~D 128 (341) T protein:vir:94 62 SELGVEDKATDVP-VGVQPV--NDTDFVITVDTDRTTAVA--L---DDLLEI---QA--SYDLRAPYLEAMGYALAKDMT 128 (341) T ss_pred CcceeeeecCCCc-cccccc--cCceEEEEEeeeeeccee--e---chHHHH---hh--ccchHHHHHHHHHHHHHHHHH Confidence 0000001110000 001111 123555677665432210 0 122322 22 368889999999999999999 Q ss_pred HHHHhhhhheeeeeee-----eeeccC-Ccc-cHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcc Q lcl|NC_018861. 284 RTIIEKANEVATVCTD-----FDVNSA-DGR-WFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSF 356 (465) Q Consensus 284 reii~~l~~~at~~~~-----~~~~~~-~~~-~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~ 356 (465) +.|+..+...+..... ...... .+. ...+.+..+..++++.. ---.++++|++|++.+.|...+.| T Consensus 129 ~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~a~~~Lde~~-------VP~~gR~lvv~P~~~~~Ll~~~~~ 201 (341) T protein:vir:94 129 GSILGLRAAVQNTASQNVFSSSNGAITGNGQAFSFAVFLAARRLLLEAD-------VPEEKIVLLISPGQESALFTIPQF 201 (341) T ss_pred HHHHHHhhhccccccCccccCccccccCchhhhhHHHHHHHHHHHhhcC-------CCccCCEEEeCHHHHHHHhhchhh Confidence 9999877432211100 111100 111 11233334444443321 112568999999999999888777 Q ss_pred cccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEE------------------E------EecCCCccceeEE Q lcl|NC_018861. 357 VLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTV------------------A------YKGASNFDAGIFF 412 (465) Q Consensus 357 ~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~v------------------g------~kg~~~~d~glfy 412 (465) ......+. .+-....+|.+ -|+.||..++-|.+-... | +++......||++ T Consensus 202 ~~~~~~g~----~~l~~G~ig~i-~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~ 276 (341) T protein:vir:94 202 ISKDFINN----APIAQGQIGSL-MGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTG 276 (341) T ss_pred hhhhcccc----chhheeeeeeE-eceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEE Confidence 54432221 11223467776 589999988876543110 0 1122233356666 Q ss_pred ecccccceeeeeCCCcccce-----------------eeeeeeeeeeecCcccccccceEEEeecccee Q lcl|NC_018861. 413 APYNITLQQNLTDPVSGQPA-----------------MILNNRYDVVATPLHPEAFIRTFAVNLNNYII 464 (465) Q Consensus 413 ~PY~~~~~~~~~dp~s~qp~-----------------~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~ 464 (465) .+.--.. ...+||+.++.. -.+..||.+-+-++-|+.-..-.. .+--+ T Consensus 277 ~~~av~~-~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~---~~~~~ 341 (341) T protein:vir:94 277 NSRPVHT-AVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHT---TGDTV 341 (341) T ss_pred ecccccc-eeeecchhhhccccccccccccchhhhhhhhhhhhhhhcccccCcceeEEEec---CcCCC Confidence 5443222 225566655542 123467777776776665221111 11001 No 68 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=88.48 E-value=0.032 Score=28.78 Aligned_cols=312 Identities=14% Similarity=0.092 Sum_probs=107.6 Q ss_pred CCcc-chh--------hhHHH-hhhhhhc----------cccccCh----hhhhheehccccc--------------hh- Q lcl|NC_018861. 1 MADK-YLL--------DESTK-EKFITSN----------LYPNLNE----SEKNIMRTVLENQ--------------GN- 41 (465) Q Consensus 1 ~~~~-~~~--------~e~~~-e~~~~~~----------~~~~~~~----~~~~~~~~l~~n~--------------~~- 41 (465) +.+- .|. .|++. +.=.++. ..+.... ..+...+ +.... .. T Consensus 43 ~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 121 (428) T protein:vir:10 43 QQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAVIVKAEPKQYTGAGMTRMVMS-IAAAQGNLQDAAKFASDELNDQ 121 (428) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccccccccchhhhHHHHHHHHH-HHHhhhhHHHHHHHhhhhhhhh Confidence 1100 000 01000 0000000 0000000 0000000 00000 00 Q ss_pred -HHHhhhhhhhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccc Q lcl|NC_018861. 42 -EVKMLMESTVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLK 118 (465) Q Consensus 42 -~~~~i~est~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~ 118 (465) ....+..++++|.+. =|.-+ -++.++-+..+..++ |+..+++++|-+--.| . T Consensus 122 ~~~~~~~~~~~~gg~l--iP~~~~~~ii~~l~~~~~l~~~-~~~~~~~~~g~~~~p~--~-------------------- 176 (428) T protein:vir:10 122 SVSMAISTAAGSGGVL--IPQNIHSEVIELLRDRTIVRKL-GARSIPLPNGNMSLPR--L-------------------- 176 (428) T ss_pred hHhhhhcccccCCccc--cchhHHHHHHHHHhhhchhhhh-cceeeecCCcceEEEE--E-------------------- Confidence 000111111111110 01110 111222222222222 1111111111110000 0 Q ss_pred cccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcc Q lcl|NC_018861. 119 TESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVY 198 (465) Q Consensus 119 ~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~ 198 (465) + .+..+ T Consensus 177 --------------------------------------------------~--~~~~a---------------------- 182 (428) T protein:vir:10 177 --------------------------------------------------A--GGATA---------------------- 182 (428) T ss_pred --------------------------------------------------e--CCcce---------------------- Confidence 0 00000 Q ss_pred cccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHH Q lcl|NC_018861. 199 TNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEV 278 (465) Q Consensus 199 ~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEI 278 (465) .-.+| |...++...++++++...|.-+-...+|-||.+|- ..|.++.|.+.|...| T Consensus 183 ------------------~~v~E--g~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds----~~~l~~~i~~~l~~ai 238 (428) T protein:vir:10 183 ------------------SYTGE--NQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRA----GFNVEQLVLQDILTAI 238 (428) T ss_pred ------------------eeecc--CccccccccceeeEEeeeEEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHHH Confidence 00001 23445555667778888887777899999999883 2567899999999999 Q ss_pred HHHhhHHHHhhh---------hheeeeeee-eeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHH Q lcl|NC_018861. 279 ALEIDRTIIEKA---------NEVATVCTD-FDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVAT 348 (465) Q Consensus 279 mlEINreii~~l---------~~~at~~~~-~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~ 348 (465) ...+|+.+|..= ...++.... .......+.. .+....+...+ .+.....+. +......|+++.... T Consensus 239 ~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~--~~~~~~~v~n~~~~~ 314 (428) T protein:vir:10 239 SVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVN-LDTIDTYLDSI-ILMSMDGNS--NMISSGWGMSNRTYM 314 (428) T ss_pred HHHHHHHHhccCCCCcccccccccccccccccccccccccc-HHHHHHHHHHH-HHhhhcccc--ccccCEEEEcHHHHH Confidence 999999988531 111111100 0000001111 12222222222 222222221 123455678999988 Q ss_pred HHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc------cceee Q lcl|NC_018861. 349 ILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI------TLQQN 422 (465) Q Consensus 349 ~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~------~~~~~ 422 (465) .|....- ..+..+.... .-|+| .|++||++.+.|.+- |. ..-.+-++|+.+.. ..... T Consensus 315 ~L~~lkd----~~G~~i~~~~-----~~g~l-~G~pv~~~~~~p~~~---~~---~~~~~~i~~gd~s~~~i~~~~~i~i 378 (428) T protein:vir:10 315 KLFGLRD----GNGNKVYPEM-----AQGML-KGYPIQRTSAIPANL---GE---GGKESEIYFADFNDVVIGEDGNMKV 378 (428) T ss_pred HHHHhhc----cCCceeccCC-----CCCee-eceeeEEeccccccc---cC---CCccceEEEEecceEEEEEecceEE Confidence 8876421 1111221111 11455 578999988765541 00 00011122222111 00011 Q ss_pred eeCCC------------cc---cceeeeeeeeeeeecCcccccccceEE-Eeeccc Q lcl|NC_018861. 423 LTDPV------------SG---QPAMILNNRYDVVATPLHPEAFIRTFA-VNLNNY 462 (465) Q Consensus 423 ~~dp~------------s~---qp~~~~~tRY~l~~nPf~~~~~~~~f~-~~~~~~ 462 (465) .+++. .| +=.+=...|+++.+ .+|++ |+ +.--++ T Consensus 379 ~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v--~~p~a----~~~~t~~~~ 428 (428) T protein:vir:10 379 DFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGF--RHPEG----LVLGTGVLF 428 (428) T ss_pred EeecccccccccccccchhhcchhheeeeeeeCcee--eccce----EEEEeccCC Confidence 11111 01 11122344555522 12222 22 111111 No 69 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=88.14 E-value=0.034 Score=28.62 Aligned_cols=306 Identities=13% Similarity=0.037 Sum_probs=127.2 Q ss_pred CCcc---c----------h------hhhHHHhhhhhhccc----ccc------ChhhhhheehccccchhH----HHhhh Q lcl|NC_018861. 1 MADK---Y----------L------LDESTKEKFITSNLY----PNL------NESEKNIMRTVLENQGNE----VKMLM 47 (465) Q Consensus 1 ~~~~---~----------~------~~e~~~e~~~~~~~~----~~~------~~~~~~~~~~l~~n~~~~----~~~i~ 47 (465) ++.. . | .++++.+.-...... ... .+..+......-...+.. ..... T Consensus 32 ~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 111 (390) T protein:vir:81 32 LNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASAGRWNDRSARATMNIKAALN 111 (390) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHH Confidence 1000 0 0 000111110000000 000 001111111111111100 01110 Q ss_pred h---hhhccccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccc Q lcl|NC_018861. 48 E---STVTGDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESAN 123 (465) Q Consensus 48 e---st~t~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ 123 (465) - ++++..-....|..+ .++.+.-...+-.+++.+.||++++.-+.-.. .... T Consensus 112 ~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~----~~~~-------------------- 167 (390) T protein:vir:81 112 TASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQET----GFVN-------------------- 167 (390) T ss_pred hhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEe----cCCc-------------------- Confidence 0 111111112334444 56666777778888999999988763222111 0000 Q ss_pred ccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCc Q lcl|NC_018861. 124 KDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEAL 203 (465) Q Consensus 124 ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~ 203 (465) . ..|. T Consensus 168 ----------------~------a~~v----------------------------------------------------- 172 (390) T protein:vir:81 168 ----------------N------AAIV----------------------------------------------------- 172 (390) T ss_pred ----------------c------eeee----------------------------------------------------- Confidence 0 0000 Q ss_pred cccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_018861. 204 WLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEID 283 (465) Q Consensus 204 ~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEIN 283 (465) +| |..+++-..++++++.++|.-+-...+|-||.+|- . +.++.|.+-|+..|...+| T Consensus 173 ----------------~E--g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~--~---~~~~~i~~~l~~~~~~~~d 229 (390) T protein:vir:81 173 ----------------AE--GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA--P---QLASYMNNRLIRGLKVKED 229 (390) T ss_pred ----------------cC--CcccccccceeeEEEEeeeEEEEeehhhHHHHHhH--H---HHHHHHHHHHHHHHHHHHH Confidence 00 11222223345666666666666677899999984 2 5789999999999999999 Q ss_pred HHHHhhh---------hheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcC Q lcl|NC_018861. 284 RTIIEKA---------NEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIG 354 (465) Q Consensus 284 reii~~l---------~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~ 354 (465) +.||..- ...+. ..-......+....+....++.++. ..+...+.+|+++.....|...- T Consensus 230 ~a~l~G~g~~~~~~Gi~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~v~~~~~~~~l~~lk 298 (390) T protein:vir:81 230 AEILRGTGANDGLLGLIPQAT--TYAAPTTIAGATRVDQLRLAMLQAS---------LAEYNPSGIVINPIDWAAIELAK 298 (390) T ss_pred HHHHhcCCCCCcccceeeccc--ccccccccccchhHHHHHHHHHhhc---------cccCCCCEEEEcHHHHHHHHHhh Confidence 9988531 10011 0011111122222233333332222 22345667899999999887542 Q ss_pred cccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceee-eeC-CCccc-c Q lcl|NC_018861. 355 SFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQN-LTD-PVSGQ-P 431 (465) Q Consensus 355 ~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~-~~d-p~s~q-p 431 (465) ...+.+.-.. .. ..-.++| .|++|++.+..|.+-+++|---. ..+. +.-....+ ..+ +.-|+ - T Consensus 299 ----d~~G~~l~~~-~~-~~~~~~l-~G~pv~~~~~~p~~~~~~gd~~~-----~~~~--~~~~~~~v~~~~~~~~~~~~ 364 (390) T protein:vir:81 299 ----DANNQYLIGN-AR-GTLTPTL-WGLPVVATQAMAPGEFLVGAFDL-----AAQI--FDQWDARVEIGYVGEDFQRN 364 (390) T ss_pred ----cCCCceeecC-cc-cccCcee-cceeeEEcCCCCCCcEEEEehhc-----eEEE--EEecceEEEEecccchhhcC Confidence 1111111110 00 0112355 47799999998877666653210 0010 11111111 111 11121 2 Q ss_pred --eeeeeeeeee-eecCccccccc-ceEE Q lcl|NC_018861. 432 --AMILNNRYDV-VATPLHPEAFI-RTFA 456 (465) Q Consensus 432 --~~~~~tRY~l-~~nPf~~~~~~-~~f~ 456 (465) .+=...|++. +.+| .+.. .+|| T Consensus 365 ~v~~r~~~r~d~~v~~~---~a~v~~t~a 390 (390) T protein:vir:81 365 MITVLAEERLALVVYRP---EALISGSFA 390 (390) T ss_pred cEEEEEEEeeccEEecc---cceEEEEeC Confidence 2234556665 3333 2221 2333 No 70 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=87.01 E-value=0.042 Score=28.15 Aligned_cols=281 Identities=12% Similarity=0.025 Sum_probs=113.9 Q ss_pred cccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccc------cccccccccccccccC Q lcl|NC_018861. 117 LKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGS------VAIGDEMDKAATFATK 190 (465) Q Consensus 117 ~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~------ea~~~e~~t~~s~~~~ 190 (465) |... +.+.- ++....+ ..+++ +-|..|+.+.|...+- T Consensus 1 m~~~-------~~~~~------~t~~g~~------------------------~~~~d~~al~ik~f~~eV~~~f~~~s~ 43 (347) T protein:vir:94 1 MANV-------PGQKI------GTDQGKG------------------------KSSSDALALFLKVFAGEVLTAFTRRSV 43 (347) T ss_pred CCCC-------Ccccc------ccccccC------------------------CccccHHHHHHHHHhHHHHHHHHHHHh Confidence 1111 11000 0000000 00111 1122233332221110 Q ss_pred CccCCCcccccCccccccccccccccch----hhhcc-C----CchhhcceEEEEEEEEeecceecccchHHHHHHHHhh Q lcl|NC_018861. 191 KATVEAVYTNEALWLKVLKNYTGPYATA----AGEKL-G----KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQ 261 (465) Q Consensus 191 ~~~~~~~~~~~a~~~~~~~~~~~~~~Ta----~~E~l-g----~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAi 261 (465) ...-...-... ..+..-.+. -|-.+. .++.+ + ..-.|+-++||++.+ +..-+.-.-|.++ T Consensus 44 ~~~~~~~r~i~-~G~sv~i~~-iG~~tv~~~t~G~~l~~~~~~~~~~e~~itID~~~~--------~~~~VddiD~~q~- 112 (347) T protein:vir:94 44 TADKHIVRTIQ-NGKSAQFPV-MGRTSGVYLAPGERLSDKRKGIKHTEKVITIDGLLT--------ADVMIFDIEDAMN- 112 (347) T ss_pred hhccccccccc-ccceEEEec-ccceeeeeecCCCCcCCCCCCCCcceEEEEecchhh--------hhHHhhhHHHHhc- Confidence 00000000000 000000000 000000 11221 1 123456667766422 2333333334444 Q ss_pred hCCCHHHHHHHHHHHHHHHHhhHHHHhhhhhee-eeee------------eeeeccCCccc-HHHHHHHHHHHHHHHHHH Q lcl|NC_018861. 262 HGINAEKELADILSAEVALEIDRTIIEKANEVA-TVCT------------DFDVNSADGRW-FIEKARGLSMRISNEARE 327 (465) Q Consensus 262 HGlDAe~EL~niLstEImlEINreii~~l~~~a-t~~~------------~~~~~~~~~~~-~~e~~~~L~~~i~~~a~~ 327 (465) | .|..+|+..-....+..++++-|++.+...+ ..+. .+.+....... ..+....++..|-+.... T Consensus 113 ~-~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~ 191 (347) T protein:vir:94 113 H-YDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAK 191 (347) T ss_pred C-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHH Confidence 2 7899999999999999999999998774211 1111 11111011100 112233444444333333 Q ss_pred HHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCc----------ceEE Q lcl|NC_018861. 328 IGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEF----------DYCT 397 (465) Q Consensus 328 i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~----------dy~~ 397 (465) .-++=---.++|+|++|+.-.+|-....+....-. +..+.....+|.+ .|++||.-++.|. .|-+ T Consensus 192 Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~----~~~~~~~G~Vg~i-~G~~V~~Sn~lp~~~~t~~~~~~~~~~ 266 (347) T protein:vir:94 192 LTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYA----ALIDPETGNIRNV-MGFVVVEVPHLVQGGAGETRGDDGITI 266 (347) T ss_pred HhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhcc----ccccccccceEEE-eceEEEecCcccccccccccccCccee Confidence 33322222478999999999999766555432111 1112223478888 7899999888764 2222 Q ss_pred E-E------------EecCCCccceeEEecccccc-------eeeeeCCCcccceeeeeeeeee-eecCcc------ccc Q lcl|NC_018861. 398 V-A------------YKGASNFDAGIFFAPYNITL-------QQNLTDPVSGQPAMILNNRYDV-VATPLH------PEA 450 (465) Q Consensus 398 v-g------------~kg~~~~d~glfy~PY~~~~-------~~~~~dp~s~qp~~~~~tRY~l-~~nPf~------~~~ 450 (465) + | |+++-.-..+|+|-|=--+. ...--|+..|-=.|==+..||. ..+|=+ +.+ T Consensus 267 ~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A 346 (347) T protein:vir:94 267 ASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFSPA 346 (347) T ss_pred cCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEecCC Confidence 2 1 22332333567776653322 1122244444443322333444 445511 111 Q ss_pred c Q lcl|NC_018861. 451 F 451 (465) Q Consensus 451 ~ 451 (465) . T Consensus 347 ~ 347 (347) T protein:vir:94 347 E 347 (347) T ss_pred C Confidence 1 No 71 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=87.00 E-value=0.042 Score=28.15 Aligned_cols=283 Identities=11% Similarity=0.053 Sum_probs=125.0 Q ss_pred ccccch---hHHHhhhhhhhccccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCccccccc Q lcl|NC_018861. 35 VLENQG---NEVKMLMESTVTGDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTK 110 (465) Q Consensus 35 l~~n~~---~~~~~i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~ 110 (465) +-+.+. +.+.+...+++++. ...-|.+. .+++.+....+-.+++-+.||++.+.-|.- ..+ +.+ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~-~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~----~~~-~~~------ 68 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFK-GYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPH----WIG-DVS------ 68 (320) T ss_pred CCCCccCCHHHHHhhcccccccc-ccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEE----EeC-Ccc------ Confidence 223332 22333222222211 12334444 455666666777888888888776522211 100 000 Q ss_pred ccccCccccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccC Q lcl|NC_018861. 111 NAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATK 190 (465) Q Consensus 111 ~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~ 190 (465) +.|. T Consensus 69 ------------------------------------a~~v---------------------------------------- 72 (320) T protein:vir:10 69 ------------------------------------AQWI---------------------------------------- 72 (320) T ss_pred ------------------------------------eEEe---------------------------------------- Confidence 0000 Q ss_pred CccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHH Q lcl|NC_018861. 191 KATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKEL 270 (465) Q Consensus 191 ~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL 270 (465) +| |.++++-..++++++...|..+-...+|.||.+|-. .|.|+.| T Consensus 73 -----------------------------~E--~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i 117 (320) T protein:vir:10 73 -----------------------------GE--GDMKPITKGNMTSQNIAPHKIATIFVASAETVRANP----ANYLGTM 117 (320) T ss_pred -----------------------------cC--CccccccccceeEEEEeeEEEEEeehhhHHHHhcCh----HHHHHHH Confidence 00 122333333567778888888888899999999865 5788999 Q ss_pred HHHHHHHHHHHhhHHHHhhhhhe-----eeeeeeeeecc-----CCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEE Q lcl|NC_018861. 271 ADILSAEVALEIDRTIIEKANEV-----ATVCTDFDVNS-----ADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKL 340 (465) Q Consensus 271 ~niLstEImlEINreii~~l~~~-----at~~~~~~~~~-----~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~ 340 (465) .+.|...|...+|+.+|..--.. +...+...+.. ..+-+. ...+ +..+...+. ..+...... T Consensus 118 ~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~---~~~~~~~~~--~~~~~~~~~ 189 (320) T protein:vir:10 118 RTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGGATASDLTA---YDAV---AVNGLSLLV--NAKKKWTHT 189 (320) T ss_pred HHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceeccccccccccc---HHHH---HHHHHhhhh--cccCCCcEE Confidence 99999999999999998521100 00000000000 011111 1111 111111111 233356689 Q ss_pred EecHHHHHHHHhc----CcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEeccc Q lcl|NC_018861. 341 IVSPKVATILDEI----GSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYN 416 (465) Q Consensus 341 ~~s~~va~~L~~~----~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~ 416 (465) +++++....|+.. |...+.+..... ......-+++ .+++|++++..+.+-..+ +=|+- +.+++..+. T Consensus 190 v~n~~~~~~L~~lkd~~G~~l~~~~~~~~----~~~~~~~~~i-~g~pv~~~~~~~~~~~~~-~~gd~---~~~~~~~~~ 260 (320) T protein:vir:10 190 LLDDIVEPILNGAKDKNGRPLFIESTYTD----ENSPFRAGRI-VSRPTILSDHVADGTTVG-YMGDF---RNVIWGQVG 260 (320) T ss_pred EEcHHHHHHHHHhhccCCceeeccccccC----ccccccCcee-eeeeeEecCCCCCCceEE-EEeec---ceEEEEEec Confidence 9999999999763 222122111000 0001112333 477899888876543221 11111 112222222 Q ss_pred ccceeee--------eCCCc-----cc-ceeeee--eeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 417 ITLQQNL--------TDPVS-----GQ-PAMILN--NRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 417 ~~~~~~~--------~dp~s-----~q-p~~~~~--tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) ...+.+. .|+.. || -.++|. .|++.. +..+.+ |++. ++. -+ T Consensus 261 ~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~--v~~~~a----~~~l-~~~-~a 317 (320) T protein:vir:10 261 GLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFH--NNDKDA----FVKL-TNV-VT 317 (320) T ss_pred CeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccE--Eecccc----eEEE-Eec-cC Confidence 2111111 11111 11 123333 455542 233333 4322 111 12 No 72 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=86.86 E-value=0.043 Score=28.10 Aligned_cols=265 Identities=15% Similarity=0.112 Sum_probs=119.3 Q ss_pred hhhhhhhcccccccc---chhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccc Q lcl|NC_018861. 45 MLMESTVTGDIAKFT---PILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKT 119 (465) Q Consensus 45 ~i~est~t~~v~~~~---P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~ 119 (465) +| |+.+++.-+.+. |.-+ .++..+-++.+-.+++.|-||++.+|-+=-.+ ....+. T Consensus 1 ~l-~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~--~~~~~~---------------- 61 (293) T protein:vir:48 1 ML-DSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEK--WTDITG---------------- 61 (293) T ss_pred Cc-eeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEe--ecCCCc---------------- Confidence 22 222111111111 3332 35566666777777888888877664211111 000000 Q ss_pred ccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCccc Q lcl|NC_018861. 120 ESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYT 199 (465) Q Consensus 120 a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~ 199 (465) .+ .+ T Consensus 62 --------------------~a------~~-------------------------------------------------- 65 (293) T protein:vir:48 62 --------------------LA------NI-------------------------------------------------- 65 (293) T ss_pred --------------------ce------ee-------------------------------------------------- Confidence 00 00 Q ss_pred ccCccccccccccccccchhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHH Q lcl|NC_018861. 200 NEALWLKVLKNYTGPYATAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEV 278 (465) Q Consensus 200 ~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEI 278 (465) . +.|..++|.+ .++++++..+|.-+-...+|-||.+|. .+|.|++|.+-|+..| T Consensus 66 -------------------v--~Eg~~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~la~~~ 120 (293) T protein:vir:48 66 -------------------D--DEAGKIADIDDPKLSLIKYTIKRYAGISTVTNSLLADS----AENILAWLSGWIAKKV 120 (293) T ss_pred -------------------e--cCCcccccccccceeEEEEeeeEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHHH Confidence 0 0022344432 467788888888888889999999986 3688999999999999 Q ss_pred HHHhhHHHHhhhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccc Q lcl|NC_018861. 279 ALEIDRTIIEKANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVL 358 (465) Q Consensus 279 mlEINreii~~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~ 358 (465) ..-+|+.|+..+....+..... . .+....|+.++. .. +......++++.....|+..-- T Consensus 121 ~~~~~~~i~~g~~~~~~~~~~~-------~--~d~i~~~~~~l~-------~~--~~~~a~~vmn~~~~~~L~~lkd--- 179 (293) T protein:vir:48 121 VVTRNKAILGVVDKLPTKPTLT-------K--WDDIIDLEAKVD-------PA--IKQTSFFLTNTSGFTALKKVKN--- 179 (293) T ss_pred HHHHHhHHhhcccccccccccc-------C--HHHHHHHHHhhh-------hh--hcCCCEEEEcHHHHHHHHHhhc--- Confidence 9999999998764433321111 1 123333333332 11 1233467899999999876421 Q ss_pred cCCcccccccccccceEEEEecCceEEEE--eCCCCcceEEEEEecCCCccceeEEeccccc-------ceeeeeCC--- Q lcl|NC_018861. 359 SPAGSKIDAINSGIKPNVGKFDNRYDVIV--DNFAEFDYCTVAYKGASNFDAGIFFAPYNIT-------LQQNLTDP--- 426 (465) Q Consensus 359 ~~~~~~~~~~~~~~~~~~G~l~~~~~vy~--d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~-------~~~~~~dp--- 426 (465) ..+...-.++ -.....++| .|++|++ |.+.+.+ ...+..++|..+... .+...+++ T Consensus 180 -~~g~~l~~~~-~~~~~~~~l-~G~Pv~~~~~~~~~~~---------~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~ 247 (293) T protein:vir:48 180 -ALGDYLMERD-VKSPTGYSI-AGFAVKEISDRWLPNA---------SSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGG 247 (293) T ss_pred -cCCceEeecC-cCCCCCcee-cceeeEEecccccCCc---------cCCceEEEEEeccceEEEEEecceEEEEecccc Confidence 1111111111 001122455 3556664 2222210 000111222211100 00011111 Q ss_pred ---Ccccceeeeeeeeee-eecCccccccc-ceEE--EeeccceeC Q lcl|NC_018861. 427 ---VSGQPAMILNNRYDV-VATPLHPEAFI-RTFA--VNLNNYIIS 465 (465) Q Consensus 427 ---~s~qp~~~~~tRY~l-~~nPf~~~~~~-~~f~--~~~~~~~~~ 465 (465) .+-|=.+-...||+. ..+| .+.. .+|. ..--++.-+ T Consensus 248 ~~~~~~~~~~r~~~r~d~~~~~~---~a~~~l~~~~~~~~~~~~~~ 290 (293) T protein:vir:48 248 GAFETDTTKVRVIDRFDVVATDT---EAFVPASFKAIADQKGNIGS 290 (293) T ss_pred hhhhcCeEEEEEEEeeCcEEecc---cceEEEEeeccccCCccccc Confidence 222344555566666 3333 1110 0111 011111111 No 73 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=86.59 E-value=0.044 Score=28.00 Aligned_cols=291 Identities=11% Similarity=0.004 Sum_probs=136.5 Q ss_pred CCccchhhhHHHhhhhhhccccccChhhhhheehccccchhHHHhhhhhhhccccccccchhh--hhhhhhhhhhhhhhh Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILV--PVIRRALPSLIGTEI 78 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~--~l~~ra~~~lI~~DI 78 (465) |-+++..++.+++-|..+...+.++. ....- +.++.. .=|.-+ .++..+..+....++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a-----------------~~~~~-~~~~~~--~iP~~~~~~ii~~~~~~s~l~~l 60 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNP-----------------DNVMM-HEKKDG--TLMNEFTTPILQEVMENSKIMQL 60 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhcc-----------------ccccc-cCcCcc--ccchhHHHHHHHHHHhhchhhhh Confidence 99998888888766655544433321 11100 011111 112222 355666777777788 Q ss_pred eeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_018861. 79 AGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTD 158 (465) Q Consensus 79 wGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPT 158 (465) +-+-||++++--|.-.. +.+ . +.+ T Consensus 61 ~~~~~~~~~~~~~p~~~-------~~~----------------------------------~------a~~--------- 84 (324) T protein:vir:96 61 GKYEPMEGTEKKFTFWA-------DKP----------------------------------G------AYW--------- 84 (324) T ss_pred cceeeccCCceEEEEEe-------cCc----------------------------------c------eeE--------- Confidence 88888887652221110 000 0 000 Q ss_pred hhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEE Q lcl|NC_018861. 159 NIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVL 238 (465) Q Consensus 159 gLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~t 238 (465) .+| |..+++...++++++ T Consensus 85 ------------------------------------------------------------v~E--g~~~~~~~~~~~~v~ 102 (324) T protein:vir:96 85 ------------------------------------------------------------VGE--GQKIETSKATWVNAT 102 (324) T ss_pred ------------------------------------------------------------ecC--CccccccccceeEEE Confidence 001 223344444667777 Q ss_pred EEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeee----e-ccCCcccHHHH Q lcl|NC_018861. 239 AEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFD----V-NSADGRWFIEK 313 (465) Q Consensus 239 VtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~----~-~~~~~~~~~e~ 313 (465) ++.|.-+.-..+|-||.+|-. .|.+++|.+-|+..|...|++.+|.---.......-.. . ....+.. . T Consensus 103 ~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~---t 175 (324) T protein:vir:96 103 MRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDF---T 175 (324) T ss_pred EeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccc---c Confidence 777777777789999999863 67899999999999999999999864311100000000 0 0011111 1 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCC-- Q lcl|NC_018861. 314 ARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFA-- 391 (465) Q Consensus 314 ~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~-- 391 (465) +.. |.++...+.. .+.....++++++....|....-- .+..... .. ..++| .|++|++++.. T Consensus 176 ~~~----i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~d~----~G~~~~~-~~----~~~~l-~G~PV~~~~~~~~ 239 (324) T protein:vir:96 176 QDN----IIDLEALLED--DELEANAFISKTQNRSLLRKIVDP----ETKERIY-DR----NSDSL-DGLPVVNLKSSNL 239 (324) T ss_pred HHH----HHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhcc----CCCeeec-CC----CCCcc-cceeeEeeCCCCC Confidence 222 2233333322 334566799999999999765211 1111111 11 11344 35688877653 Q ss_pred CcceEEEEEecCCCccceeEEecccccceeeeeCC--------C-----cc-cceeeee--eeeeeeecCcccccccceE Q lcl|NC_018861. 392 EFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDP--------V-----SG-QPAMILN--NRYDVVATPLHPEAFIRTF 455 (465) Q Consensus 392 ~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp--------~-----s~-qp~~~~~--tRY~l~~nPf~~~~~~~~f 455 (465) +...+++|-. +.++++........+.-++ . -| +-.++|. .|++. .+.+|.+ | T Consensus 240 ~~~~~~~gd~------~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~--~v~~~~A----~ 307 (324) T protein:vir:96 240 KRGELITGDF------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL--HIADDKA----F 307 (324) T ss_pred CcceEEEEec------ceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEcc--EEecccc----e Confidence 3223433311 1122222222222211111 0 01 1223433 34444 2333333 3 Q ss_pred EEeeccc-----eeC Q lcl|NC_018861. 456 AVNLNNY-----IIS 465 (465) Q Consensus 456 ~~~~~~~-----~~~ 465 (465) ++ |++- --. T Consensus 308 ~~-l~~a~~~~~~~~ 321 (324) T protein:vir:96 308 AK-LVPADKRTDSVP 321 (324) T ss_pred EE-EecccccCCCCC Confidence 32 1110 000 No 74 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=86.59 E-value=0.044 Score=28.00 Aligned_cols=291 Identities=11% Similarity=0.004 Sum_probs=136.5 Q ss_pred CCccchhhhHHHhhhhhhccccccChhhhhheehccccchhHHHhhhhhhhccccccccchhh--hhhhhhhhhhhhhhh Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILV--PVIRRALPSLIGTEI 78 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~--~l~~ra~~~lI~~DI 78 (465) |-+++..++.+++-|..+...+.++. ....- +.++.. .=|.-+ .++..+..+....++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a-----------------~~~~~-~~~~~~--~iP~~~~~~ii~~~~~~s~l~~l 60 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVFNP-----------------DNVMM-HEKKDG--TLMNEFTTPILQEVMENSKIMQL 60 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhcc-----------------ccccc-cCcCcc--ccchhHHHHHHHHHHhhchhhhh Confidence 99998888888766655544433321 11100 011111 112222 355666777777788 Q ss_pred eeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_018861. 79 AGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTD 158 (465) Q Consensus 79 wGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPT 158 (465) +-+-||++++--|.-.. +.+ . +.+ T Consensus 61 ~~~~~~~~~~~~~p~~~-------~~~----------------------------------~------a~~--------- 84 (324) T protein:vir:78 61 GKYEPMEGTEKKFTFWA-------DKP----------------------------------G------AYW--------- 84 (324) T ss_pred cceeeccCCceEEEEEe-------cCc----------------------------------c------eeE--------- Confidence 88888887652221110 000 0 000 Q ss_pred hhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEE Q lcl|NC_018861. 159 NIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVL 238 (465) Q Consensus 159 gLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~t 238 (465) .+| |..+++...++++++ T Consensus 85 ------------------------------------------------------------v~E--g~~~~~~~~~~~~v~ 102 (324) T protein:vir:78 85 ------------------------------------------------------------VGE--GQKIETSKATWVNAT 102 (324) T ss_pred ------------------------------------------------------------ecC--CccccccccceeEEE Confidence 001 223344444667777 Q ss_pred EEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeee----e-ccCCcccHHHH Q lcl|NC_018861. 239 AEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFD----V-NSADGRWFIEK 313 (465) Q Consensus 239 VtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~----~-~~~~~~~~~e~ 313 (465) ++.|.-+.-..+|-||.+|-. .|.+++|.+-|+..|...|++.+|.---.......-.. . ....+.. . T Consensus 103 ~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~---t 175 (324) T protein:vir:78 103 MRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDF---T 175 (324) T ss_pred EeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccc---c Confidence 777777777789999999863 67899999999999999999999864311100000000 0 0011111 1 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCC-- Q lcl|NC_018861. 314 ARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFA-- 391 (465) Q Consensus 314 ~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~-- 391 (465) +.. |.++...+.. .+.....++++++....|....-- .+..... .. ..++| .|++|++++.. T Consensus 176 ~~~----i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~d~----~G~~~~~-~~----~~~~l-~G~PV~~~~~~~~ 239 (324) T protein:vir:78 176 QDN----IIDLEALLED--DELEANAFISKTQNRSLLRKIVDP----ETKERIY-DR----NSDSL-DGLPVVNLKSSNL 239 (324) T ss_pred HHH----HHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhcc----CCCeeec-CC----CCCcc-cceeeEeeCCCCC Confidence 222 2233333322 334566799999999999765211 1111111 11 11344 35688877653 Q ss_pred CcceEEEEEecCCCccceeEEecccccceeeeeCC--------C-----cc-cceeeee--eeeeeeecCcccccccceE Q lcl|NC_018861. 392 EFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDP--------V-----SG-QPAMILN--NRYDVVATPLHPEAFIRTF 455 (465) Q Consensus 392 ~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp--------~-----s~-qp~~~~~--tRY~l~~nPf~~~~~~~~f 455 (465) +...+++|-. +.++++........+.-++ . -| +-.++|. .|++. .+.+|.+ | T Consensus 240 ~~~~~~~gd~------~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~--~v~~~~A----~ 307 (324) T protein:vir:78 240 KRGELITGDF------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL--HIADDKA----F 307 (324) T ss_pred CcceEEEEec------ceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEcc--EEecccc----e Confidence 3223433311 1122222222222211111 0 01 1223433 34444 2333333 3 Q ss_pred EEeeccc-----eeC Q lcl|NC_018861. 456 AVNLNNY-----IIS 465 (465) Q Consensus 456 ~~~~~~~-----~~~ 465 (465) ++ |++- --. T Consensus 308 ~~-l~~a~~~~~~~~ 321 (324) T protein:vir:78 308 AK-LVPADKRTDSVP 321 (324) T ss_pred EE-EecccccCCCCC Confidence 32 1110 000 No 75 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=85.91 E-value=0.049 Score=27.75 Aligned_cols=274 Identities=9% Similarity=0.037 Sum_probs=124.2 Q ss_pred hhhhhhccccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccc Q lcl|NC_018861. 46 LMESTVTGDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANK 124 (465) Q Consensus 46 i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~e 124 (465) +++.++++.-.-.-+.+. .+++++..+.+-.+++-|.||++++--|--.. . +.. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~----~-~~~-------------------- 55 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA----T-LPE-------------------- 55 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEe----C-Ccc-------------------- Confidence 444332221111222222 45677777777888899999987762221111 0 000 Q ss_pred cccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcc Q lcl|NC_018861. 125 DDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALW 204 (465) Q Consensus 125 a~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~ 204 (465) +.|. .|.. T Consensus 56 ----------------------a~wv----------------------------~E~~---------------------- 63 (305) T protein:vir:25 56 ----------------------ADWV----------------------------GESA---------------------- 63 (305) T ss_pred ----------------------eEEe----------------------------eccc---------------------- Confidence 0000 0000 Q ss_pred ccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhH Q lcl|NC_018861. 205 LKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDR 284 (465) Q Consensus 205 ~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINr 284 (465) +.....++.-.-++++++..++..+-.-.+|-||.+|- ..|.|++|.+-|+..|...+|+ T Consensus 64 ----------------~~~~~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~~~a~~~d~ 123 (305) T protein:vir:25 64 ----------------TDPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQ 123 (305) T ss_pred ----------------ccccccccccccceeeEEeeeEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHhh Confidence 00000111112246667777777777788999999984 3688999999999999999999 Q ss_pred HHHhhhhhee-----ee--eeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCccc Q lcl|NC_018861. 285 TIIEKANEVA-----TV--CTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFV 357 (465) Q Consensus 285 eii~~l~~~a-----t~--~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~ 357 (465) .+|.--...- .. .....-+.+...........+...+.++...+...-. ..+-+++++.....|+.. + T Consensus 124 a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~l---k 198 (305) T protein:vir:25 124 AVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANI---R 198 (305) T ss_pred hheeccCCCCCccccccccccccccccccccccchhhhHHHHHHHHHHHhhhhccc--ccceeEecHHHHHHHHHh---h Confidence 9985321100 00 0000000111111112233444445544444443322 345578899988888653 1 Q ss_pred ccCCcccccccccccceEEEEecCceEEEEeCCCCcc----eEEEEEecCCCccceeEEecccccceeee--------eC Q lcl|NC_018861. 358 LSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFD----YCTVAYKGASNFDAGIFFAPYNITLQQNL--------TD 425 (465) Q Consensus 358 ~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~d----y~~vg~kg~~~~d~glfy~PY~~~~~~~~--------~d 425 (465) . ..+...-. -++| .+++|+|+.+.+.+ -+++|-. ..+++...-.....+. .. T Consensus 199 d-~~G~~i~~--------~~~l-~G~Pv~~~~~~~~~~~~~~~~~gd~------s~~~i~~~~~~~i~~~~~~~~~~~~~ 262 (305) T protein:vir:25 199 D-ANGNPVFR--------DDSF-AGFRTFFNRNGAWDADAAIEVIADS------SRVKIGVRQDITVKFLDQATLGTGEN 262 (305) T ss_pred c-cCCceeec--------CCcc-cccceEEcCccCCCCCccEEEEEec------ceEEEEEecCeEEEEeeeeeeecCCc Confidence 1 11111111 1345 45788887764332 1222210 0011111111111110 01 Q ss_pred CCc-cc-ceee--eeeeeee-eecCcccccccceEEEeeccceeC Q lcl|NC_018861. 426 PVS-GQ-PAMI--LNNRYDV-VATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 426 p~s-~q-p~~~--~~tRY~l-~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) +.+ || ..++ ...|||+ +.||=. | +-++++-.+ T Consensus 263 ~~~~~~~~~~~~R~~~r~~~~v~~p~a-------~-v~~~~~~~~ 299 (305) T protein:vir:25 263 QINLAERDMVALRLKARFAYVLGVSAT-------A-QGANKTPVA 299 (305) T ss_pred eeeeeecCcEEEEEEEeecceeeCccc-------E-EEEcccccc Confidence 111 21 1222 3567886 567631 1 233333333 No 76 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=85.62 E-value=0.051 Score=27.65 Aligned_cols=267 Identities=12% Similarity=0.014 Sum_probs=114.9 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCcccccccccc--ccccccccCCccCCCcccccCccccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEM--DKAATFATKKATVEAVYTNEALWLKVLKNYTG 213 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~--~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~ 213 (465) +.-.+.+.....+ .|+---..+...+.. ...+.+-+ +...++. .+....... ++.. T Consensus 1 ~~~~~~T~l~d~i---~PEv~~~~v~~~~~~-------~~~~~~~~~~~~~l~g~------~G~tv~iP~-----~~~i- 58 (275) T protein:vir:96 1 MALENMTKLANMV---NPEVLAPMMQAELDK-------KLKFAQFADIDNTLVGQ------PGNTITFPA-----FVYS- 58 (275) T ss_pred CCCcccchhhhhh---chHHHHHHHHHHHHH-------hhhhcccceecccccCC------CCCEEEeee-----eccC- Confidence 1000001111100 000000000000000 00010000 0111110 000000000 0100 Q ss_pred cccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhh-CCCHHHHHHHHHHHHHHHHhhHHHHhhhhh Q lcl|NC_018861. 214 PYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQH-GINAEKELADILSAEVALEIDRTIIEKANE 292 (465) Q Consensus 214 ~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiH-GlDAe~EL~niLstEImlEINreii~~l~~ 292 (465) +-.+...|...-+..++.+ .+.+++-|.|.-.=+++ |+-+.. +-|.-.|..+-++..|..+++.+++..+.. T Consensus 59 g~a~~~~~g~~i~~~~lt~--~~~~~~i~~~~~~~~i~-----D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~ 131 (275) T protein:vir:96 59 GDAKVVPEGEEIPIDLIET--KKRQATIRKIGKGTVLT-----DEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQG 131 (275) T ss_pred CccccccCCCCcchhhccc--ceeeEEeehhccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 1111122221223444443 44445555553333333 443333 468889999999999999999999988854 Q ss_pred eeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccccccccc Q lcl|NC_018861. 293 VATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGI 372 (465) Q Consensus 293 ~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~ 372 (465) ... .+ ..+....+.+-....++.++. ...++++++|++++.|.......+..+..... +--. T Consensus 132 a~~-----~~--~~~~~~~d~i~dA~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~--~~~~ 193 (275) T protein:vir:96 132 ATL-----KV--EADITKLAGLQTAIDKFNDED---------LEPMVLFVNPLDAGKLRASATDNFTRATLLGD--NVIV 193 (275) T ss_pred ccc-----cc--cccccCHHHHHHHHHHhcccc---------CCccEEEeCHHHHHHHHhcccccccccccccc--ccee Confidence 211 11 112222233434333333221 25789999999999997775443333211111 1111 Q ss_pred ceEEEEecCceEEEEeCCCCcceE-EEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecCc---c Q lcl|NC_018861. 373 KPNVGKFDNRYDVIVDNFAEFDYC-TVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATPL---H 447 (465) Q Consensus 373 ~~~~G~l~~~~~vy~d~~~~~dy~-~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nPf---~ 447 (465) ...+|++ .|++||++...|..=. ++| +|+ -.|+.. -+...-.--|+.+++-.|-...+||+ ..||= . T Consensus 194 ~G~ig~~-~G~~Vi~s~~~p~~t~~i~~-~gA-----~~~~~~-~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~ 265 (275) T protein:vir:96 194 KGAFGEA-LGAIIVRSNKIKEGEAILAK-RGA-----VKLITK-RDFFLETERHASHKSTALFSDKHYVAYLYDESKVVK 265 (275) T ss_pred cccccee-cCeeEEEeCCCCcceEEEEe-ccc-----eeeeec-CCcccccccchhhcCcEEEEeEEEEEEEEcCccEEE Confidence 3357776 7889999998774321 222 121 111110 01111112388899999988899987 66661 1 Q ss_pred cccccceEEE Q lcl|NC_018861. 448 PEAFIRTFAV 457 (465) Q Consensus 448 ~~~~~~~f~~ 457 (465) -+...-+.+| T Consensus 266 ~t~~~~~~~~ 275 (275) T protein:vir:96 266 ITKSASGLGV 275 (275) T ss_pred EEecccccCC Confidence 1222222223 No 77 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=85.40 E-value=0.053 Score=27.57 Aligned_cols=280 Identities=13% Similarity=0.080 Sum_probs=119.4 Q ss_pred ccccccccccccccccccchhhhheeeeeccCccc------cccccccccccccccCCccC----CCcccccCccccccc Q lcl|NC_018861. 140 TATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGS------VAIGDEMDKAATFATKKATV----EAVYTNEALWLKVLK 209 (465) Q Consensus 140 tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~------ea~~~e~~t~~s~~~~~~~~----~~~~~~~a~~~~~~~ 209 (465) ++...+ |.-...+......+++ |-+..|+.+.|...+-...- ....+.+. .+. T Consensus 1 ~a~~~~-------------~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv----~~~ 63 (347) T protein:vir:88 1 MANATG-------------GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSA----SFP 63 (347) T ss_pred CCCccc-------------chhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceE----EEe Confidence 110000 0000111111111222 11222333333221110000 00000000 000 Q ss_pred ccccc--ccchhhhccC-----CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhh-CCCHHHHHHHHHHHHHHHH Q lcl|NC_018861. 210 NYTGP--YATAAGEKLG-----KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQH-GINAEKELADILSAEVALE 281 (465) Q Consensus 210 ~~~~~--~~Ta~~E~lg-----~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiH-GlDAe~EL~niLstEImlE 281 (465) ..+.. .-...++.+. .+..|.-++||+.. |.=.+.-|+-... -.|.-+|+..-....+..+ T Consensus 64 ~iG~~~~~~~~~g~~l~~~~~~~~~~~~~i~ID~~~-----------y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~ 132 (347) T protein:vir:88 64 VMGRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLL-----------TSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIA 132 (347) T ss_pred eecceeeeeeccccCCCCCCCCCccceEEEEEechh-----------hhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHH Confidence 00000 0001122221 13567777888642 3333333433322 2788999999999999999 Q ss_pred hhHHHHhhhhheeeee-----------eeeeeccCCccc---HHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHH Q lcl|NC_018861. 282 IDRTIIEKANEVATVC-----------TDFDVNSADGRW---FIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVA 347 (465) Q Consensus 282 INreii~~l~~~at~~-----------~~~~~~~~~~~~---~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va 347 (465) +++-|++.|...+... ....+....+.- .......++..|-+.....-.+=---.++|+|++|+.. T Consensus 133 ~D~~i~~~l~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y 212 (347) T protein:vir:88 133 ADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDY 212 (347) T ss_pred HHHHHHHHHHHhhccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHH Confidence 9999998875432111 011111111100 00112333333333333333222222578999999998 Q ss_pred HHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCc---------ceE-EEE------------EecCCC Q lcl|NC_018861. 348 TILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEF---------DYC-TVA------------YKGASN 405 (465) Q Consensus 348 ~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~---------dy~-~vg------------~kg~~~ 405 (465) ..|-....+... + ..+..+.....+|.+ .+++||.-++.|. +++ ..+ |+++.. T Consensus 213 ~~Ll~~~~~~~~---~-~~~~~~~~~G~vg~i-~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~ 287 (347) T protein:vir:88 213 SAILSALMPNAA---N-YAALIDPETGNIRNV-MGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQN 287 (347) T ss_pred HHHhcchhhhhh---h-hccccchhcceeeee-ccceEEEeecccccccccccccccccccccccccccccccccccccC Confidence 888665544311 1 111112223466776 6889998887663 111 111 223333 Q ss_pred ccceeEEeccccc-------ceeeeeCCCcccceeeeeeeeee-eecC-----ccccccc Q lcl|NC_018861. 406 FDAGIFFAPYNIT-------LQQNLTDPVSGQPAMILNNRYDV-VATP-----LHPEAFI 452 (465) Q Consensus 406 ~d~glfy~PY~~~-------~~~~~~dp~s~qp~~~~~tRY~l-~~nP-----f~~~~~~ 452 (465) -..+|||.|=--+ ...+--||..|.=.|==+..||. ..+| ++....+ T Consensus 288 ~~~~l~~~~~a~g~v~~~d~~~e~~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 288 NVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred cEEEEEechhhhhheecccceeeeeechhhHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 3456777765322 23334577777665544555555 5566 2222222 No 78 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=83.71 E-value=0.066 Score=27.05 Aligned_cols=306 Identities=11% Similarity=0.016 Sum_probs=128.9 Q ss_pred CCc-cchhhhHHHhhhhhhccc-ccc---------Chhhh-hheehccccch---hHHHh--h--hhh---hhccccccc Q lcl|NC_018861. 1 MAD-KYLLDESTKEKFITSNLY-PNL---------NESEK-NIMRTVLENQG---NEVKM--L--MES---TVTGDIAKF 58 (465) Q Consensus 1 ~~~-~~~~~e~~~e~~~~~~~~-~~~---------~~~~~-~~~~~l~~n~~---~~~~~--i--~es---t~t~~v~~~ 58 (465) |+. +....+++..+-..+... +.+ .+.+. .....+..+.+ +.+.. + .+. +++++.... T Consensus 39 ~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 118 (379) T protein:vir:10 39 MTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKSEDKSDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGA 118 (379) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccchhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccc Confidence 111 000000111111000000 000 00000 00000000000 00000 0 010 111221111 Q ss_pred cchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccc Q lcl|NC_018861. 59 TPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEV 136 (465) Q Consensus 59 ~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~ 136 (465) =|.-+ .+++..-....-.|++.|.||++++.-|.-.. +.+ T Consensus 119 ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~------~~~-------------------------------- 160 (379) T protein:vir:10 119 QPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVREN------GAG-------------------------------- 160 (379) T ss_pred cchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEee------cCC-------------------------------- Confidence 12211 33444555556677788888877652221000 000 Q ss_pred cccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcccccccccccccc Q lcl|NC_018861. 137 SFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYA 216 (465) Q Consensus 137 s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 216 (465) .+...| T Consensus 161 -------~~~~~~------------------------------------------------------------------- 166 (379) T protein:vir:10 161 -------EGAIGA------------------------------------------------------------------- 166 (379) T ss_pred -------Cccccc------------------------------------------------------------------- Confidence 000000 Q ss_pred chhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeee Q lcl|NC_018861. 217 TAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATV 296 (465) Q Consensus 217 Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~ 296 (465) .+| |...+++..++++++..+|.=+--...|-||.||-- +.++.|.+-|+..|..-+|..++.-+....+. T Consensus 167 --v~E--g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~-----~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~ 237 (379) T protein:vir:10 167 --QVE--GATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLP-----FLTSFIPNALRRDYAKAENAAFNAVLAANATA 237 (379) T ss_pred --ccC--CccccccccceeeeEeeeeeEEeeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHhccccccccc Confidence 001 233455555666777777666666789999999952 36789999999999999999988776433221 Q ss_pred eeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCccccccccccc Q lcl|NC_018861. 297 CTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKIDAINSGI 372 (465) Q Consensus 297 ~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~~~~~~~ 372 (465) + .....+...++....++.++. ..+...+.+|+++.....|... |.....|..... + + T Consensus 238 ~----~~~~~~~~~~d~i~~~~~~~~---------~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~---~--~ 299 (379) T protein:vir:10 238 S----TEIITNKNKVEMLINEIAKQE---------NLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQ---D--N 299 (379) T ss_pred c----cccccCcccHHHHHHHHHhhh---------hccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCC---C--C Confidence 1 111122122233333332222 1233566788999998888654 222222111100 0 0 Q ss_pred ceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEeccccc-ceeeeeCCC--cccceeee--eeeeee-eecCc Q lcl|NC_018861. 373 KPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNIT-LQQNLTDPV--SGQPAMIL--NNRYDV-VATPL 446 (465) Q Consensus 373 ~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~-~~~~~~dp~--s~qp~~~~--~tRY~l-~~nPf 446 (465) . .-+| .|++|+++++.+...+++|=-. ..-+++ ..+ ...+..++. --+..++| ..|+|+ +.+| T Consensus 300 ~--~~~l-~G~pvv~s~~~~ag~~~~gdf~----~~~~~~---~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p- 368 (379) T protein:vir:10 300 G--VLRI-NGIPLFRATWLAANKYYVGDWT----RVTKVT---TEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQP- 368 (379) T ss_pred C--ccee-cceeeEecCCCCCCceEEeecc----cEEEEE---EeceEEEEeecccccccCCcEEEEEEEEeccEEecC- Confidence 0 0134 3679999998877665553211 011111 111 111111221 12344444 368877 5555 Q ss_pred ccccccceEE-Eeeccc Q lcl|NC_018861. 447 HPEAFIRTFA-VNLNNY 462 (465) Q Consensus 447 ~~~~~~~~f~-~~~~~~ 462 (465) ++ |+ ++++.- T Consensus 369 --~a----~v~~~~~~~ 379 (379) T protein:vir:10 369 --AA----LIFGDFTAV 379 (379) T ss_pred --cc----EEEEEecCC Confidence 33 44 455544 No 79 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=83.25 E-value=0.07 Score=26.92 Aligned_cols=290 Identities=11% Similarity=0.051 Sum_probs=121.1 Q ss_pred CCccchhhhHHHhhhhhhccccccChhhhhheehccccchhHHHhhhhhh-hccccccccchhh--hhhhhhhhhhhhhh Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNESEKNIMRTVLENQGNEVKMLMEST-VTGDIAKFTPILV--PVIRRALPSLIGTE 77 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~l~~n~~~~~~~i~est-~t~~v~~~~P~l~--~l~~ra~~~lI~~D 77 (465) ...+..........|...+.+..+.+-++..... .+ .+-+..++ ++|.+. =|.-+ .++..+.++.+-.+ T Consensus 84 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~-----~~-~~a~~~~~~~~gg~l--vP~~~~~~ii~~~~~~~~l~~ 155 (397) T protein:vir:12 84 GQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDS-----PE-FRAMSGINDEDGGIL--IPEDIGRQIHEFKRQFEPLEQ 155 (397) T ss_pred cchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhh-----hh-hhhccccccccCccc--CchhHHHHHHHhhhhhhhHHh Confidence 0000000011111122222222221111111000 00 00111111 112211 12222 35566777778889 Q ss_pred heeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_018861. 78 IAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGT 157 (465) Q Consensus 78 IwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGP 157 (465) ++.+.||+++.|-+--.|.. +.+ . +.|... T Consensus 156 ~~~~~~~~~~~~~~~~~~~~-----~~~----------------------------------~------a~~v~E----- 185 (397) T protein:vir:12 156 YVTVEPVTTRSGTRLLEKNA-----DMV----------------------------------P------FSPVEE----- 185 (397) T ss_pred hcceeeccCCceeEEEEEec-----CCc----------------------------------c------eeeecc----- Confidence 99999999988754322200 000 0 000000 Q ss_pred chhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcc-eEEEE Q lcl|NC_018861. 158 DNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMG-ISVQR 236 (465) Q Consensus 158 TgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~-FsIeK 236 (465) |...++-+ -++++ T Consensus 186 ------------------------------------------------------------------g~~~~~~~~~~~~~ 199 (397) T protein:vir:12 186 ------------------------------------------------------------------LGNLPEIDQPRFTK 199 (397) T ss_pred ------------------------------------------------------------------ccccccccccccee Confidence 00111111 13455 Q ss_pred EEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeccCCcccHHHHHHH Q lcl|NC_018861. 237 VLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFDVNSADGRWFIEKARG 316 (465) Q Consensus 237 ~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~~~~~~~~~~~e~~~~ 316 (465) ++..++.-+-...+|-||.+|-- +|.++.|.+.|...|...+|+.||.-.-.. .+.|--..+.... T Consensus 200 v~~~~~k~~~~~~is~e~l~ds~----~~l~~~i~~~l~~~~~~~~d~~il~G~g~~----------~~~g~~~~~~i~~ 265 (397) T protein:vir:12 200 VSYSIIDYGGIMTLSNSMLNDSD----QAIMTYVAKWFAKKSVVTRNNLILAAIASL----------KKVDIDGLDGIKK 265 (397) T ss_pred EEeeheeeEeeehhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHHhccccc----------cccccccHHHHHH Confidence 55555555555669999998853 577899999999999999999988765221 1122111222222 Q ss_pred HH-HHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCcccccccccccceEEEEecCceEEEEeCCC Q lcl|NC_018861. 317 LS-MRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFA 391 (465) Q Consensus 317 L~-~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~ 391 (465) ++ ..++ ..+..+..+++++...+.|+.. |-..+.|... ...-++| .|++|++.+.. T Consensus 266 ~~~~~l~---------~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~---------~g~~~~l-~G~pv~~~~~~ 326 (397) T protein:vir:12 266 ALNVTLD---------PMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPT---------NPTKKLL-DGRPVVPFTNR 326 (397) T ss_pred HHhhccc---------hhhhCCCEEEEcHHHHHHHHHhhccCCceeeccccc---------CCCCccc-cceeeEEeccc Confidence 11 1221 1223445688999999988764 3222222110 1122455 45577765431 Q ss_pred CcceEEEEEecCCCccceeEEecccc---------cceeeeeCCC----cccceeeeeeeeee-eecCcccccc-cceEE Q lcl|NC_018861. 392 EFDYCTVAYKGASNFDAGIFFAPYNI---------TLQQNLTDPV----SGQPAMILNNRYDV-VATPLHPEAF-IRTFA 456 (465) Q Consensus 392 ~~dy~~vg~kg~~~~d~glfy~PY~~---------~~~~~~~dp~----s~qp~~~~~tRY~l-~~nPf~~~~~-~~~f~ 456 (465) .. +. ...+.-++|+.+.- +.....-.+. +-+-.+-...|++. +.|| .+. ..+|+ T Consensus 327 ~~-----~~---~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~---~a~~~~~~t 395 (397) T protein:vir:12 327 VL-----KT---QKGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDE---DAVVFGQIT 395 (397) T ss_pred cc-----cc---CCCccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecc---cceEEEEEe Confidence 11 00 01111233333211 1111111111 22334445666766 3333 221 12222 Q ss_pred Ee Q lcl|NC_018861. 457 VN 458 (465) Q Consensus 457 ~~ 458 (465) +- T Consensus 396 ~~ 397 (397) T protein:vir:12 396 VE 397 (397) T ss_pred eC Confidence 22 No 80 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=83.07 E-value=0.072 Score=26.86 Aligned_cols=267 Identities=13% Similarity=0.016 Sum_probs=113.6 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPY 215 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 215 (465) ++... +.....+ .|+---..+...++.+ ..+.+-+.......+ ..+....... ++.. +- T Consensus 1 ma~~~-T~l~d~i---iPev~~~~v~~~~~~~-------l~~~~~~~~d~~l~g----~~G~tv~iP~-----~~~i-g~ 59 (274) T protein:vir:12 1 MAQGL-TKTSNQI---IPEVLAPMMQAQLEKK-------LRFASFAEVDSTLQG----QPGDTLTFPA-----FVYS-GD 59 (274) T ss_pred CCcce-eehhhhh---chHHHHHHHHHHHHhh-------hhhcccceecccccC----CCCCEEEEee-----ecCC-Cc Confidence 11100 0001100 0000000111111110 000000111000000 0000000000 0000 11 Q ss_pred cchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheee Q lcl|NC_018861. 216 ATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVAT 295 (465) Q Consensus 216 ~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at 295 (465) .....|..+-+..++..+=. +++-+-|+-.=+++=| ..+.+ +-|.-.+..+-++..|..+++.+++..+.... T Consensus 60 a~~~~~g~~i~~~~lt~~~~--~~~i~~~~~~~~i~D~--~~~~~--~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~- 132 (274) T protein:vir:12 60 AQVVAEGEKIPTDILETKKR--EAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAK- 132 (274) T ss_pred cccccCCCccchhhccccee--eEEeeeecceeeecHH--HHHhc--ccchHHHHHHHHHHHHHHHHHHHHHHHHhccc- Confidence 11122221223444444433 3333444322222211 12223 57888999999999999999999999885321 Q ss_pred eeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceE Q lcl|NC_018861. 296 VCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPN 375 (465) Q Consensus 296 ~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~ 375 (465) .++. ......+.+-....++..+. ..+++++++|++++.|...+...|..+.... .+-..... T Consensus 133 ----~~~~--~~a~~~d~i~dA~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g--~~~~~~G~ 195 (274) T protein:vir:12 133 ----LTVN--ADITKLNGLQSAIDKFNDED---------LEPMVLFINPLDAGKLRGDASTNFTRATELG--DDIIVKGA 195 (274) T ss_pred ----cccc--ccccCHHHHHHHHHHhcccc---------ccccEEEeCHHHHHHHHhhhhhhcccccccc--ccceeccc Confidence 1111 11222233333333333321 2568999999999999988755544432211 01111346 Q ss_pred EEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecC-----cc-c Q lcl|NC_018861. 376 VGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATP-----LH-P 448 (465) Q Consensus 376 ~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nP-----f~-~ 448 (465) +|++ .|++||+|...|..-..+--+|+-. ++. -.+...-.--||..++-.+-..-+||+ ..|| .+ + T Consensus 196 ig~~-~G~~Vi~s~~~p~~t~~l~~~gA~~-----~~~-~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:12 196 FGEA-LGAIIVRSNKLEAGTAILAKKGAVK-----LIL-KRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred ceee-cCeeEEEeCCCCcceEEEEecccee-----eee-cCCceeccccchhhcccEEEeeeEEEEEEEcCCceEEEEcC Confidence 8887 5789999998875332111112111 111 111111112388899998888888987 5566 11 1 Q ss_pred ccccce Q lcl|NC_018861. 449 EAFIRT 454 (465) Q Consensus 449 ~~~~~~ 454 (465) +++-+- T Consensus 269 ~~~~~~ 274 (274) T protein:vir:12 269 SGSLEM 274 (274) T ss_pred CccccC Confidence 111111 No 81 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=82.65 E-value=0.075 Score=26.75 Aligned_cols=305 Identities=13% Similarity=0.069 Sum_probs=131.8 Q ss_pred CCc-----------cchhhhHHHhh-hhh-hccc--cccChhhhhheehccccc--hhHHHh---------hhhhhhc-c Q lcl|NC_018861. 1 MAD-----------KYLLDESTKEK-FIT-SNLY--PNLNESEKNIMRTVLENQ--GNEVKM---------LMESTVT-G 53 (465) Q Consensus 1 ~~~-----------~~~~~e~~~e~-~~~-~~~~--~~~~~~~~~~~~~l~~n~--~~~~~~---------i~est~t-~ 53 (465) +.+ +.+.++...++ -.+ .... ....++++....-|.... .+.+.. +...+++ | T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~g 114 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDG 114 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCC Confidence 100 00000000000 000 0000 000112222222221111 011111 1122211 1 Q ss_pred ccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccc Q lcl|NC_018861. 54 DIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTG 131 (465) Q Consensus 54 ~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg 131 (465) .. .. |.-+ .+++....+..-.++++|.||++++|-+.-.+ ..+ .. T Consensus 115 g~-~v-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~~~--~~--------------------------- 161 (392) T protein:vir:10 115 GL-VI-PQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEK--NSD--MI--------------------------- 161 (392) T ss_pred ce-ec-chhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecC--Cc--------------------------- Confidence 11 11 3222 34455666667778999999999886432111 100 00 Q ss_pred ccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccc Q lcl|NC_018861. 132 TPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNY 211 (465) Q Consensus 132 ~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~ 211 (465) . +.|.. | T Consensus 162 --------~------a~~v~----------------------------E------------------------------- 168 (392) T protein:vir:10 162 --------P------FAEIT----------------------------E------------------------------- 168 (392) T ss_pred --------c------ceeec----------------------------c------------------------------- Confidence 0 00000 0 Q ss_pred cccccchhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_018861. 212 TGPYATAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKA 290 (465) Q Consensus 212 ~~~~~Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l 290 (465) |...++-+ -++++++..++.-+-...+|-||.+|- ..|.+++|.+.|...|...+|..|+.-. T Consensus 169 ------------~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~ 232 (392) T protein:vir:10 169 ------------MGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVI 232 (392) T ss_pred ------------cccccccccccceeEEeeeeeEEEeehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 00111110 135566666666666678999999994 3678899999999999999999998765 Q ss_pred hheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCccccc Q lcl|NC_018861. 291 NEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKID 366 (465) Q Consensus 291 ~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~ 366 (465) ..... ++... .+....++.. .+.+ .+-..-..|+++.....|+.. |-..+.|. T Consensus 233 g~~~~---------~~~~~-~d~i~~~~~~--~l~~------~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~----- 289 (392) T protein:vir:10 233 EKLTK---------QAIKS-LDDIKDVLNV--KLDP------AISPNAILLTNQDGFNYLDKLKDKDGKYILQSD----- 289 (392) T ss_pred ccccc---------cCccC-HHHHHHHHHH--hhhh------hhccCCEEEEcHHHHHHHHHhhccCCCeEeecC----- Confidence 32211 11111 1223332211 1122 222334578999999999775 22222221 Q ss_pred ccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEeccccc-------ceeeeeCCC------ccccee Q lcl|NC_018861. 367 AINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNIT-------LQQNLTDPV------SGQPAM 433 (465) Q Consensus 367 ~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~-------~~~~~~dp~------s~qp~~ 433 (465) ......++|.|...|+++... .++.++...-+..++|+++..+ .+...+++. +.+=.+ T Consensus 290 ----~~~~~~~tllG~~~v~~~~~~-----~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~ 360 (392) T protein:vir:10 290 ----PTQKNKKLFAGTNPVVVVSNR-----FLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDL 360 (392) T ss_pred ----ccCCccccccCcccEEEeccc-----ccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEE Confidence 111234677776677654432 1122333333444556554321 111122332 233445 Q ss_pred eeeeeeeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 434 ILNNRYDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 434 ~~~tRY~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) -...|+|.. +.++ ..|+ ++++...-+ T Consensus 361 r~~~r~d~~--v~~~----~a~~~l~~~~~a~~ 387 (392) T protein:vir:10 361 RAIQRDDVQ--MWDN----EAAVYGEIDLSAPV 387 (392) T ss_pred EEEEeeccE--Eecc----cceEEEEecccccc Confidence 666777752 2222 3344 344443333 No 82 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=82.65 E-value=0.075 Score=26.75 Aligned_cols=305 Identities=13% Similarity=0.069 Sum_probs=131.8 Q ss_pred CCc-----------cchhhhHHHhh-hhh-hccc--cccChhhhhheehccccc--hhHHHh---------hhhhhhc-c Q lcl|NC_018861. 1 MAD-----------KYLLDESTKEK-FIT-SNLY--PNLNESEKNIMRTVLENQ--GNEVKM---------LMESTVT-G 53 (465) Q Consensus 1 ~~~-----------~~~~~e~~~e~-~~~-~~~~--~~~~~~~~~~~~~l~~n~--~~~~~~---------i~est~t-~ 53 (465) +.+ +.+.++...++ -.+ .... ....++++....-|.... .+.+.. +...+++ | T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~g 114 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDG 114 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCC Confidence 100 00000000000 000 0000 000112222222221111 011111 1122211 1 Q ss_pred ccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccc Q lcl|NC_018861. 54 DIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTG 131 (465) Q Consensus 54 ~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg 131 (465) .. .. |.-+ .+++....+..-.++++|.||++++|-+.-.+ ..+ .. T Consensus 115 g~-~v-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~~~--~~--------------------------- 161 (392) T protein:vir:10 115 GL-VI-PQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEK--NSD--MI--------------------------- 161 (392) T ss_pred ce-ec-chhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecC--Cc--------------------------- Confidence 11 11 3222 34455666667778999999999886432111 100 00 Q ss_pred ccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccc Q lcl|NC_018861. 132 TPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNY 211 (465) Q Consensus 132 ~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~ 211 (465) . +.|.. | T Consensus 162 --------~------a~~v~----------------------------E------------------------------- 168 (392) T protein:vir:10 162 --------P------FAEIT----------------------------E------------------------------- 168 (392) T ss_pred --------c------ceeec----------------------------c------------------------------- Confidence 0 00000 0 Q ss_pred cccccchhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_018861. 212 TGPYATAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKA 290 (465) Q Consensus 212 ~~~~~Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l 290 (465) |...++-+ -++++++..++.-+-...+|-||.+|- ..|.+++|.+.|...|...+|..|+.-. T Consensus 169 ------------~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~ 232 (392) T protein:vir:10 169 ------------MGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVI 232 (392) T ss_pred ------------cccccccccccceeEEeeeeeEEEeehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 00111110 135566666666666678999999994 3678899999999999999999998765 Q ss_pred hheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCccccc Q lcl|NC_018861. 291 NEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKID 366 (465) Q Consensus 291 ~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~ 366 (465) ..... ++... .+....++.. .+.+ .+-..-..|+++.....|+.. |-..+.|. T Consensus 233 g~~~~---------~~~~~-~d~i~~~~~~--~l~~------~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~----- 289 (392) T protein:vir:10 233 EKLTK---------QAIKS-LDDIKDVLNV--KLDP------AISPNAILLTNQDGFNYLDKLKDKDGKYILQSD----- 289 (392) T ss_pred ccccc---------cCccC-HHHHHHHHHH--hhhh------hhccCCEEEEcHHHHHHHHHhhccCCCeEeecC----- Confidence 32211 11111 1223332211 1122 222334578999999999775 22222221 Q ss_pred ccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEeccccc-------ceeeeeCCC------ccccee Q lcl|NC_018861. 367 AINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNIT-------LQQNLTDPV------SGQPAM 433 (465) Q Consensus 367 ~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~-------~~~~~~dp~------s~qp~~ 433 (465) ......++|.|...|+++... .++.++...-+..++|+++..+ .+...+++. +.+=.+ T Consensus 290 ----~~~~~~~tllG~~~v~~~~~~-----~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~ 360 (392) T protein:vir:10 290 ----PTQKNKKLFAGTNPVVVVSNR-----FLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDL 360 (392) T ss_pred ----ccCCccccccCcccEEEeccc-----ccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEE Confidence 111234677776677654432 1122333333444556554321 111122332 233445 Q ss_pred eeeeeeeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 434 ILNNRYDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 434 ~~~tRY~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) -...|+|.. +.++ ..|+ ++++...-+ T Consensus 361 r~~~r~d~~--v~~~----~a~~~l~~~~~a~~ 387 (392) T protein:vir:10 361 RAIQRDDVQ--MWDN----EAAVYGEIDLSAPV 387 (392) T ss_pred EEEEeeccE--Eecc----cceEEEEecccccc Confidence 666777752 2222 3344 344443333 No 83 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=82.65 E-value=0.075 Score=26.75 Aligned_cols=305 Identities=13% Similarity=0.069 Sum_probs=131.8 Q ss_pred CCc-----------cchhhhHHHhh-hhh-hccc--cccChhhhhheehccccc--hhHHHh---------hhhhhhc-c Q lcl|NC_018861. 1 MAD-----------KYLLDESTKEK-FIT-SNLY--PNLNESEKNIMRTVLENQ--GNEVKM---------LMESTVT-G 53 (465) Q Consensus 1 ~~~-----------~~~~~e~~~e~-~~~-~~~~--~~~~~~~~~~~~~l~~n~--~~~~~~---------i~est~t-~ 53 (465) +.+ +.+.++...++ -.+ .... ....++++....-|.... .+.+.. +...+++ | T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~g 114 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDG 114 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCC Confidence 100 00000000000 000 0000 000112222222221111 011111 1122211 1 Q ss_pred ccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccc Q lcl|NC_018861. 54 DIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTG 131 (465) Q Consensus 54 ~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg 131 (465) .. .. |.-+ .+++....+..-.++++|.||++++|-+.-.+ ..+ .. T Consensus 115 g~-~v-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~~~--~~--------------------------- 161 (392) T protein:vir:10 115 GL-VI-PQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEK--NSD--MI--------------------------- 161 (392) T ss_pred ce-ec-chhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecC--Cc--------------------------- Confidence 11 11 3222 34455666667778999999999886432111 100 00 Q ss_pred ccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccc Q lcl|NC_018861. 132 TPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNY 211 (465) Q Consensus 132 ~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~ 211 (465) . +.|.. | T Consensus 162 --------~------a~~v~----------------------------E------------------------------- 168 (392) T protein:vir:10 162 --------P------FAEIT----------------------------E------------------------------- 168 (392) T ss_pred --------c------ceeec----------------------------c------------------------------- Confidence 0 00000 0 Q ss_pred cccccchhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_018861. 212 TGPYATAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKA 290 (465) Q Consensus 212 ~~~~~Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l 290 (465) |...++-+ -++++++..++.-+-...+|-||.+|- ..|.+++|.+.|...|...+|..|+.-. T Consensus 169 ------------~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~ 232 (392) T protein:vir:10 169 ------------MGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVI 232 (392) T ss_pred ------------cccccccccccceeEEeeeeeEEEeehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 00111110 135566666666666678999999994 3678899999999999999999998765 Q ss_pred hheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCccccc Q lcl|NC_018861. 291 NEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKID 366 (465) Q Consensus 291 ~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~ 366 (465) ..... ++... .+....++.. .+.+ .+-..-..|+++.....|+.. |-..+.|. T Consensus 233 g~~~~---------~~~~~-~d~i~~~~~~--~l~~------~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~----- 289 (392) T protein:vir:10 233 EKLTK---------QAIKS-LDDIKDVLNV--KLDP------AISPNAILLTNQDGFNYLDKLKDKDGKYILQSD----- 289 (392) T ss_pred ccccc---------cCccC-HHHHHHHHHH--hhhh------hhccCCEEEEcHHHHHHHHHhhccCCCeEeecC----- Confidence 32211 11111 1223332211 1122 222334578999999999775 22222221 Q ss_pred ccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEeccccc-------ceeeeeCCC------ccccee Q lcl|NC_018861. 367 AINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNIT-------LQQNLTDPV------SGQPAM 433 (465) Q Consensus 367 ~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~-------~~~~~~dp~------s~qp~~ 433 (465) ......++|.|...|+++... .++.++...-+..++|+++..+ .+...+++. +.+=.+ T Consensus 290 ----~~~~~~~tllG~~~v~~~~~~-----~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~ 360 (392) T protein:vir:10 290 ----PTQKNKKLFAGTNPVVVVSNR-----FLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDL 360 (392) T ss_pred ----ccCCccccccCcccEEEeccc-----ccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEE Confidence 111234677776677654432 1122333333444556554321 111122332 233445 Q ss_pred eeeeeeeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 434 ILNNRYDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 434 ~~~tRY~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) -...|+|.. +.++ ..|+ ++++...-+ T Consensus 361 r~~~r~d~~--v~~~----~a~~~l~~~~~a~~ 387 (392) T protein:vir:10 361 RAIQRDDVQ--MWDN----EAAVYGEIDLSAPV 387 (392) T ss_pred EEEEeeccE--Eecc----cceEEEEecccccc Confidence 666777752 2222 3344 344443333 No 84 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=82.65 E-value=0.075 Score=26.75 Aligned_cols=305 Identities=13% Similarity=0.069 Sum_probs=131.8 Q ss_pred CCc-----------cchhhhHHHhh-hhh-hccc--cccChhhhhheehccccc--hhHHHh---------hhhhhhc-c Q lcl|NC_018861. 1 MAD-----------KYLLDESTKEK-FIT-SNLY--PNLNESEKNIMRTVLENQ--GNEVKM---------LMESTVT-G 53 (465) Q Consensus 1 ~~~-----------~~~~~e~~~e~-~~~-~~~~--~~~~~~~~~~~~~l~~n~--~~~~~~---------i~est~t-~ 53 (465) +.+ +.+.++...++ -.+ .... ....++++....-|.... .+.+.. +...+++ | T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~g 114 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDG 114 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCC Confidence 100 00000000000 000 0000 000112222222221111 011111 1122211 1 Q ss_pred ccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccc Q lcl|NC_018861. 54 DIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTG 131 (465) Q Consensus 54 ~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg 131 (465) .. .. |.-+ .+++....+..-.++++|.||++++|-+.-.+ ..+ .. T Consensus 115 g~-~v-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~~~--~~--------------------------- 161 (392) T protein:vir:10 115 GL-VI-PQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEK--NSD--MI--------------------------- 161 (392) T ss_pred ce-ec-chhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecC--Cc--------------------------- Confidence 11 11 3222 34455666667778999999999886432111 100 00 Q ss_pred ccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccc Q lcl|NC_018861. 132 TPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNY 211 (465) Q Consensus 132 ~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~ 211 (465) . +.|.. | T Consensus 162 --------~------a~~v~----------------------------E------------------------------- 168 (392) T protein:vir:10 162 --------P------FAEIT----------------------------E------------------------------- 168 (392) T ss_pred --------c------ceeec----------------------------c------------------------------- Confidence 0 00000 0 Q ss_pred cccccchhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_018861. 212 TGPYATAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKA 290 (465) Q Consensus 212 ~~~~~Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l 290 (465) |...++-+ -++++++..++.-+-...+|-||.+|- ..|.+++|.+.|...|...+|..|+.-. T Consensus 169 ------------~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~ 232 (392) T protein:vir:10 169 ------------MGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVI 232 (392) T ss_pred ------------cccccccccccceeEEeeeeeEEEeehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 00111110 135566666666666678999999994 3678899999999999999999998765 Q ss_pred hheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCccccc Q lcl|NC_018861. 291 NEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKID 366 (465) Q Consensus 291 ~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~ 366 (465) ..... ++... .+....++.. .+.+ .+-..-..|+++.....|+.. |-..+.|. T Consensus 233 g~~~~---------~~~~~-~d~i~~~~~~--~l~~------~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~----- 289 (392) T protein:vir:10 233 EKLTK---------QAIKS-LDDIKDVLNV--KLDP------AISPNAILLTNQDGFNYLDKLKDKDGKYILQSD----- 289 (392) T ss_pred ccccc---------cCccC-HHHHHHHHHH--hhhh------hhccCCEEEEcHHHHHHHHHhhccCCCeEeecC----- Confidence 32211 11111 1223332211 1122 222334578999999999775 22222221 Q ss_pred ccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEeccccc-------ceeeeeCCC------ccccee Q lcl|NC_018861. 367 AINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNIT-------LQQNLTDPV------SGQPAM 433 (465) Q Consensus 367 ~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~-------~~~~~~dp~------s~qp~~ 433 (465) ......++|.|...|+++... .++.++...-+..++|+++..+ .+...+++. +.+=.+ T Consensus 290 ----~~~~~~~tllG~~~v~~~~~~-----~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~ 360 (392) T protein:vir:10 290 ----PTQKNKKLFAGTNPVVVVSNR-----FLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDL 360 (392) T ss_pred ----ccCCccccccCcccEEEeccc-----ccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEE Confidence 111234677776677654432 1122333333444556554321 111122332 233445 Q ss_pred eeeeeeeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 434 ILNNRYDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 434 ~~~tRY~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) -...|+|.. +.++ ..|+ ++++...-+ T Consensus 361 r~~~r~d~~--v~~~----~a~~~l~~~~~a~~ 387 (392) T protein:vir:10 361 RAIQRDDVQ--MWDN----EAAVYGEIDLSAPV 387 (392) T ss_pred EEEEeeccE--Eecc----cceEEEEecccccc Confidence 666777752 2222 3344 344443333 No 85 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=80.65 E-value=0.093 Score=26.24 Aligned_cols=285 Identities=12% Similarity=0.046 Sum_probs=126.2 Q ss_pred hccccccChhhhhheehccccchhHHHhhhhhhhccccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEE Q lcl|NC_018861. 18 SNLYPNLNESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVP 96 (465) Q Consensus 18 ~~~~~~~~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRS 96 (465) +--....+ .+++.+...+++.+... .-|.+. .+++.+-+..+..+++.|.||++++.-|- T Consensus 1 ~~~~~~~~--------------~e~~~~~~~~~~~~~~~-ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip---- 61 (318) T protein:vir:24 1 MAAGTAFA--------------VDHAQIAQTGDTMFKGY-LEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIP---- 61 (318) T ss_pred CCCCCCCC--------------HHHHHhhcccCccccee-echhHHHHHHHHHHhhchhhhhcceeeccCCceEEE---- Confidence 11111111 23333333332222221 222222 35566667777888889999987653221 Q ss_pred EecCCCCcccccccccccCccccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccc Q lcl|NC_018861. 97 HYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVA 176 (465) Q Consensus 97 rY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea 176 (465) +... +.. +.|. T Consensus 62 ~~~~-~~~------------------------------------------a~~v-------------------------- 72 (318) T protein:vir:24 62 HWVG-DVS------------------------------------------AQWI-------------------------- 72 (318) T ss_pred EEeC-Ccc------------------------------------------eEEe-------------------------- Confidence 1100 000 0000 Q ss_pred cccccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHH Q lcl|NC_018861. 177 IGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQ 256 (465) Q Consensus 177 ~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQ 256 (465) +| |.++++...++++++.+.|..+-...+|-||.+ T Consensus 73 -------------------------------------------~E--g~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ 107 (318) T protein:vir:24 73 -------------------------------------------GE--GDMKPITKGNMTSQTIAPHKIATIFVASAETVR 107 (318) T ss_pred -------------------------------------------cC--CccccccccceeEEEEeeEEEEEeehhhHHHhh Confidence 01 123344444577777777777778899999999 Q ss_pred HHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhh-------eeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018861. 257 DLKAQHGINAEKELADILSAEVALEIDRTIIEKANE-------VATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIG 329 (465) Q Consensus 257 DLkAiHGlDAe~EL~niLstEImlEINreii~~l~~-------~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~ 329 (465) |-. .|.+++|.+.|+..|...||+.+|..-.. ..+.............+.-+....++.. +. T Consensus 108 ds~----~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~ 176 (318) T protein:vir:24 108 ANP----ANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTGATTVYDQVAVNGLSL-------LV 176 (318) T ss_pred cCh----HHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccccccchHHHHHHHHHHh-------hc Confidence 843 68999999999999999999999864311 0010000000011111221122222221 11 Q ss_pred HhcccccccEEEecHHHHHHHHhc----CcccccCCcccccccccccceEE-EEecCceEEEEeCCCCcceEEEEEecCC Q lcl|NC_018861. 330 RQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKIDAINSGIKPNV-GKFDNRYDVIVDNFAEFDYCTVAYKGAS 404 (465) Q Consensus 330 ~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~~~~~~~~~~~-G~l~~~~~vy~d~~~~~dy~~vg~kg~~ 404 (465) -.......++++++....|+.. |-..+.|... . .....+- +.+ .+++|++.+..+..-.++ +-|+- T Consensus 177 --~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~---~--~~~~~~~~~~i-~g~pv~~~~~~~~~~~~~-~~gdf 247 (318) T protein:vir:24 177 --NDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTY---G--EAASPFRSGRI-VARPTILSDHVVEGTTVG-FMGDF 247 (318) T ss_pred --cccCCCCEEEEcHHHHHHHHHhhccCCceeecCccc---c--CccccccCceE-EEEeeEEeCCCCCCccEE-EEeec Confidence 2233556789999999999864 1111111100 0 0111111 222 245777777654322111 11110 Q ss_pred CccceeEEecccccceeee--------eCCCc-----cc---ceeeeeeeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 405 NFDAGIFFAPYNITLQQNL--------TDPVS-----GQ---PAMILNNRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 405 ~~d~glfy~PY~~~~~~~~--------~dp~s-----~q---p~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) +.++|+..-.+...+. .|+.. || =.+=...|++..+ ..+.+ |++ |++-.-. T Consensus 248 ---s~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v--~~~~a----~~~-i~~~~a~ 314 (318) T protein:vir:24 248 ---SQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHC--NDAEA----FVA-LTNVVSG 314 (318) T ss_pred ---ceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEE--ecccc----eEE-EEeeccC Confidence 1133433322222111 11111 21 2222355666632 22322 544 2221111 No 86 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=80.37 E-value=0.096 Score=26.18 Aligned_cols=276 Identities=13% Similarity=0.098 Sum_probs=125.7 Q ss_pred hhhhhccccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccc Q lcl|NC_018861. 47 MESTVTGDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKD 125 (465) Q Consensus 47 ~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea 125 (465) +-..++|.+. .-+.+. .++.++.++-+-.+++-|-||.+..- +|....+. T Consensus 1 mat~~~gg~l-vP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~-------~~p~~~~~--------------------- 51 (311) T protein:vir:81 1 MVALATGTFQ-LPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQ-------QYMTLTAP--------------------- 51 (311) T ss_pred CceecCCceE-cchhHHHHHHHHHHhcchhhhhcceeecCCCce-------EEEEEeCC--------------------- Confidence 3333343331 112222 45677777778888898888865431 12100000 Q ss_pred ccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccc Q lcl|NC_018861. 126 DFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWL 205 (465) Q Consensus 126 ~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~ 205 (465) ..+.|. T Consensus 52 -------------------~~a~wv------------------------------------------------------- 57 (311) T protein:vir:81 52 -------------------PRGEVV------------------------------------------------------- 57 (311) T ss_pred -------------------ceeEEe------------------------------------------------------- Confidence 000000 Q ss_pred cccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHH Q lcl|NC_018861. 206 KVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRT 285 (465) Q Consensus 206 ~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINre 285 (465) +| |..+++...++++++..+|.=+-....|-||.|+-.. -.++.|++|.+-|+..|...|+.- T Consensus 58 --------------~E--g~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~a 120 (311) T protein:vir:81 58 --------------GE--GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLI 120 (311) T ss_pred --------------ec--CcccccccceeeEEEEeeEEEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHh Confidence 01 1223333345566666666555666899999875432 245677888888999999999888 Q ss_pred HHhhhhhee-e----------e-eeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc Q lcl|NC_018861. 286 IIEKANEVA-T----------V-CTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI 353 (465) Q Consensus 286 ii~~l~~~a-t----------~-~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~ 353 (465) +|.-..... + . ........ .+.. ....-|..+-..+.. .+...+..+.+++....|+.. T Consensus 121 ~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~-~~~~------~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~l 191 (311) T protein:vir:81 121 GIHGINPLTGAALSGSPAKILDTTNIVELTT-GTSA------TPDLAVEAAVGLVLG--DNLSPDGVALDNTFSFMLATQ 191 (311) T ss_pred hhccccCCCCcccccccccccccceeeeecc-cccc------hHHHHHHHHHHHhhh--cCCCceEEEEcHHHHHHHHhh Confidence 886531100 0 0 00001100 1111 111223333333322 234677789999999999764 Q ss_pred CcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEE------EEEecCCC-----cc-ceeEEeccccccee Q lcl|NC_018861. 354 GSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCT------VAYKGASN-----FD-AGIFFAPYNITLQQ 421 (465) Q Consensus 354 ~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~------vg~kg~~~-----~d-~glfy~PY~~~~~~ 421 (465) . ..++...-.........|+|. |++|+++.+-+..-.. +...+... .| +.+++.......+. T Consensus 192 k-----d~~G~~l~~~~~~~~~~~tl~-G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~ 265 (311) T protein:vir:81 192 R-----DSQGRKLYPELGFGTDVASFA-GLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLE 265 (311) T ss_pred h-----ccCCCeeecCccccCCCceec-ceeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEE Confidence 1 111111111111122346774 5788887765432211 11111111 11 22333333334444 Q ss_pred eeeCCCc------cc-ceeeee--eeeee-eecCcccccccceEEEeeccceeC Q lcl|NC_018861. 422 NLTDPVS------GQ-PAMILN--NRYDV-VATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 422 ~~~dp~s------~q-p~~~~~--tRY~l-~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) +..|.+. || -.++|. .|+|. +.+| .+ |++ |++-.-+ T Consensus 266 ~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~---~a----~~~-l~~a~~~ 311 (311) T protein:vir:81 266 LIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMST---DA----FAV-VRDADES 311 (311) T ss_pred EeccCCCCcchhhhhcCcEEEEEEEEeccEeecc---cc----eEE-EEeeccC Confidence 4333222 21 124443 56664 4454 22 221 2232233 No 87 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=80.13 E-value=0.098 Score=26.12 Aligned_cols=277 Identities=12% Similarity=0.084 Sum_probs=132.8 Q ss_pred hhhhhhccccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccc Q lcl|NC_018861. 46 LMESTVTGDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANK 124 (465) Q Consensus 46 i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~e 124 (465) ++-+ +++.+ ...|.+. .++.++.+..+..+++.|.||++.+.-|. ++.. +.. T Consensus 1 m~t~-t~gg~-liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip----~~~~-~~~-------------------- 53 (303) T protein:vir:97 1 MGTE-TSKAS-LFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEF----TFTL-DSD-------------------- 53 (303) T ss_pred Cccc-CCCCe-EcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE----EEec-Ccc-------------------- Confidence 3322 23322 2334444 56677778888999999999886554331 1110 000 Q ss_pred cccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcc Q lcl|NC_018861. 125 DDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALW 204 (465) Q Consensus 125 a~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~ 204 (465) +.|. T Consensus 54 ----------------------a~wv------------------------------------------------------ 57 (303) T protein:vir:97 54 ----------------------IDVV------------------------------------------------------ 57 (303) T ss_pred ----------------------eEEe------------------------------------------------------ Confidence 0000 Q ss_pred ccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhH Q lcl|NC_018861. 205 LKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDR 284 (465) Q Consensus 205 ~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINr 284 (465) +| |..+++-..+++.++..+|.-+-....|-||.|.... ..++.+++|.+-|+..|...|+. T Consensus 58 ---------------~E--~~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~a~~~~ld~ 119 (303) T protein:vir:97 58 ---------------AE--NGKKTHGGLSLEPVTIVPIKVEYGARLSDEFLYATEE-EKIDILKAFNEGFAKKLARGIDL 119 (303) T ss_pred ---------------ec--CccccccccceeeEEeeeEEEEEeehhhHHHhhcCcc-chHHHHHHHHHHHHHHHHHHHHh Confidence 00 1122233334566666666666677899999864322 24677899999999999999999 Q ss_pred HHHhhhhhe----eeeeeeeeeccCCcccHHH--HHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccc Q lcl|NC_018861. 285 TIIEKANEV----ATVCTDFDVNSADGRWFIE--KARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVL 358 (465) Q Consensus 285 eii~~l~~~----at~~~~~~~~~~~~~~~~e--~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~ 358 (465) .+|.-.... ........+....+ ...+ ....++..|.++-+.+.. ..+..+.+|++++....|...- + T Consensus 120 a~l~G~~~~~g~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~lk--d- 193 (303) T protein:vir:97 120 MAMHGINPRTKKASDVIGTNHFDSKVT-QVVKFTESEDADANIEAAVNLIQG--AEGVVTGLAMDTEFSTALAKVT--N- 193 (303) T ss_pred hhhcccccCCccccccccccccccccc-cccccccccchHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHHHHhh--c- Confidence 998754210 00000001110000 0000 011122233333333322 2245667999999999886431 0 Q ss_pred cCCcccccccccccceEEEEecCceEEEEeCCCCcc--------eEEEEEecCCCccceeEEecccccceeee--eCCCc Q lcl|NC_018861. 359 SPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFD--------YCTVAYKGASNFDAGIFFAPYNITLQQNL--TDPVS 428 (465) Q Consensus 359 ~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~d--------y~~vg~kg~~~~d~glfy~PY~~~~~~~~--~dp~s 428 (465) ..+.....++-....-.|+|. |++|+++.+-+.. .+++| + ....+.+........++. .|++. T Consensus 194 -~~g~~~~~~~~~~~~~~~~l~-G~Pv~~s~~v~~~~~~~~~~~~~~~G---d--f~~~~~~~~~~~~~~~~~~~~~~d~ 266 (303) T protein:vir:97 194 -GEMGPKMYPELAWGANPDSIN-GLKSSVNTTVGAGADEAESKDLVIIG---D--FESMFKWGYAKQIPMEIIKYGDPDN 266 (303) T ss_pred -cCCCeEEecCccCCCCCceec-ceeeEEecccCCccccCCCccEEEEe---e--ccccEEEEEecCcEEEEeeccCCCC Confidence 011111111101111235675 5899988775421 12222 1 111122332222222221 13221 Q ss_pred -----cc-ceeee--eeeeee-eecCcccccccceEEEeeccce Q lcl|NC_018861. 429 -----GQ-PAMIL--NNRYDV-VATPLHPEAFIRTFAVNLNNYI 463 (465) Q Consensus 429 -----~q-p~~~~--~tRY~l-~~nPf~~~~~~~~f~~~~~~~~ 463 (465) |+ -.++| ..||+. +.|| .-|++-.+..+ T Consensus 267 ~~~~~~~~n~~~~r~~~r~~~~v~~p-------~af~~l~~~~~ 303 (303) T protein:vir:97 267 SGKDLKGYNQIYLRAEAYIGWGILDA-------KSFARVTKGEV 303 (303) T ss_pred cchhhhhcCcEEEEEEEEeccEeecc-------cceEEeeCCCC Confidence 22 23445 567777 4454 55777677777 No 88 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=80.01 E-value=0.099 Score=26.09 Aligned_cols=290 Identities=11% Similarity=0.044 Sum_probs=133.7 Q ss_pred CCccchhhhHHHhhhhhhccccccChhhhhheehccccchhHHHhhhhhhhccccccccch-hh-hhhhhhhhhhhhhhh Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPI-LV-PVIRRALPSLIGTEI 78 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~-l~-~l~~ra~~~lI~~DI 78 (465) |-+++..++.+..-|....+.+.++ +.... .++++... =|. +. -++..+..+.+..++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~-----------------a~~~~-~~~~~~~~--iP~~~~~~ii~~~~~~s~l~~~ 60 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFN-----------------PDNVM-MHEKKDGT--LMNEFTTPILQEVMENSKIMQL 60 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhc-----------------ccccc-ccCCCcce--echhHHHHHHHHHHhhcchhhh Confidence 9888888888766554444433322 11110 11112111 122 22 355677777788888 Q ss_pred eeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_018861. 79 AGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTD 158 (465) Q Consensus 79 wGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPT 158 (465) +-+.||++.+--|- | +.. .. . +.|. T Consensus 61 ~~~~~~~~~~~~ip--~--~~~--~~-----------------------------------~------a~~v-------- 85 (324) T protein:vir:97 61 GKYEPMEGTEKKFT--F--WAD--KP-----------------------------------G------AYWV-------- 85 (324) T ss_pred cceeeccCCceEEE--E--Eec--Cc-----------------------------------c------eeEe-------- Confidence 88999887662221 1 100 00 0 0000 Q ss_pred hhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEE Q lcl|NC_018861. 159 NIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVL 238 (465) Q Consensus 159 gLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~t 238 (465) +| |..+++...++++++ T Consensus 86 -------------------------------------------------------------~E--g~~~~~~~~~f~~v~ 102 (324) T protein:vir:97 86 -------------------------------------------------------------GE--GQKIETSKATWVNAT 102 (324) T ss_pred -------------------------------------------------------------cc--CccccccccceeEEE Confidence 01 122333344566666 Q ss_pred EEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeee-eec----cCCcccHHHH Q lcl|NC_018861. 239 AEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDF-DVN----SADGRWFIEK 313 (465) Q Consensus 239 VtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~-~~~----~~~~~~~~e~ 313 (465) .++|.-+--..+|-||.+|-. .|.+++|.+-|+..|...+++.+|..--.......-+ ... ...+... T Consensus 103 ~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~~--- 175 (324) T protein:vir:97 103 MRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFT--- 175 (324) T ss_pred EeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCccccccccccceeccccCC--- Confidence 666666666779999999863 6789999999999999999999986431110000000 000 0011111 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCc Q lcl|NC_018861. 314 ARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEF 393 (465) Q Consensus 314 ~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~ 393 (465) +. -|+++...+.. .+.....+++++.....|....- +.+..... . .-.|+| .|++|++.+..+. T Consensus 176 ~~----~i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~lkd----~~g~~~~~-~----~~~~tl-~G~PV~~~~~~~~ 239 (324) T protein:vir:97 176 QD----NIIDLEALLED--DELEANAFISKTQNRSLLRKIVD----PETKERIY-D----RNSDTL-DGLPVVNLKSSNL 239 (324) T ss_pred HH----HHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhc----CCCceeec-C----CCCccc-cceeeEeecCCCC Confidence 12 23333333332 22345578999999999985421 11111110 0 112455 4567877665332 Q ss_pred --ceEEEEEecCCCccceeEEecccccceeeeeCCCc--------------cc-ceeee--eeeeeeeecCcccccccce Q lcl|NC_018861. 394 --DYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVS--------------GQ-PAMIL--NNRYDVVATPLHPEAFIRT 454 (465) Q Consensus 394 --dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s--------------~q-p~~~~--~tRY~l~~nPf~~~~~~~~ 454 (465) ..+++|-. +.++++........ ..|... || -.++| ..||+..+ ..+.+ T Consensus 240 ~~~~~~~gd~------~~~~i~~~~~~~i~-~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v--~~~~a---- 306 (324) T protein:vir:97 240 KRGELITGDF------DKLIYGIPQLIEYK-IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHI--ADDKA---- 306 (324) T ss_pred CcceEEEEec------ccEEEEEecCcEEE-EeecccccccccccccchhhhhcCcEEEEEEEEeccEE--ecccc---- Confidence 22333311 11122211111111 111111 11 22332 35665522 12232 Q ss_pred EEEeeccceeC Q lcl|NC_018861. 455 FAVNLNNYIIS 465 (465) Q Consensus 455 f~~~~~~~~~~ 465 (465) |++ +++..-. T Consensus 307 ~~~-l~~~~~~ 316 (324) T protein:vir:97 307 FAK-LVPADKK 316 (324) T ss_pred eEE-EEeccCC Confidence 442 2222111 No 89 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=79.73 E-value=0.1 Score=26.03 Aligned_cols=273 Identities=13% Similarity=0.125 Sum_probs=128.4 Q ss_pred hhhhhhccccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccc Q lcl|NC_018861. 46 LMESTVTGDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANK 124 (465) Q Consensus 46 i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~e 124 (465) .+++++++... .-|.+. .++.++.+..+-.+++.+.||.+-..-+. . +.. +.. T Consensus 1 ma~~t~~~G~l-ip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p-~---~~~-~~~-------------------- 54 (300) T protein:vir:95 1 MSEAQLSKGNL-FNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREF-V---FDF-DSD-------------------- 54 (300) T ss_pred CcccccCCcce-echhhHHHHHHHHHhhhhhhhhcceeeccCCceEEE-E---Eec-Ccc-------------------- Confidence 66666555443 344444 45555666667788999999876432111 1 100 000 Q ss_pred cccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcc Q lcl|NC_018861. 125 DDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALW 204 (465) Q Consensus 125 a~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~ 204 (465) +.|. T Consensus 55 ----------------------a~wv------------------------------------------------------ 58 (300) T protein:vir:95 55 ----------------------IDIV------------------------------------------------------ 58 (300) T ss_pred ----------------------eEEe------------------------------------------------------ Confidence 0000 Q ss_pred ccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhH Q lcl|NC_018861. 205 LKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDR 284 (465) Q Consensus 205 ~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINr 284 (465) +| |...++...++++++.++|.-+-....|-||.+.... ..+|.+++|.+-|...|...+++ T Consensus 59 ---------------~E--g~~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~l~~aia~~~d~ 120 (300) T protein:vir:95 59 ---------------AE--NGKKTHGGVSLDPVTIVPLKVEYGARVSDEFLHASEE-AKVDMLTDFVEGFSKKLARGLDI 120 (300) T ss_pred ---------------eC--CcccccccccceeeEeeeEEEEEeehhhHHHhccCCC-CHHHHHHHHHHHHHHHHHHHHHH Confidence 01 1233444445677777777777778899998753322 24678899999999999999999 Q ss_pred HHHhhhhhe-ee----eee-eee---eccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCc Q lcl|NC_018861. 285 TIIEKANEV-AT----VCT-DFD---VNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGS 355 (465) Q Consensus 285 eii~~l~~~-at----~~~-~~~---~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~ 355 (465) .+|.-.... .+ .+. .++ .....+.. ...+.-|.++-..+.. .++..+..|++++....|....- T Consensus 121 ~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~lkd 193 (300) T protein:vir:95 121 MSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKD-----TNPDESMEDAVGMIDG--SERDITGAILDPIFTTALSKMKN 193 (300) T ss_pred hhhhcccCCCCCCcccccccccccccceeecccc-----cchHHHHHHHHHHhhh--cCCCccEEEECHHHHHHHHHhhc Confidence 999653210 00 000 000 00011111 1112223333332222 23466678999999998865421 Q ss_pred ccccCCcccccccccccceEEEEecCceEEEEeCCCCc------ceEEEEEecCCCccceeEEecccccceeeee--CCC Q lcl|NC_018861. 356 FVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEF------DYCTVAYKGASNFDAGIFFAPYNITLQQNLT--DPV 427 (465) Q Consensus 356 ~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~------dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~--dp~ 427 (465) - .+..+- .++......++| .|++|++++..+. +.+++|= +.-++++.......+++.. |++ T Consensus 194 ~----~G~~i~-~~~~~~~~~~~l-~G~Pv~~s~~v~~~~~~~~~~~~~GD-----f~~~~~~~~~~~~~~~v~~~~~~d 262 (300) T protein:vir:95 194 A----EGGKLY-PELAWGGVPDAI-NGLAVDKNRTVSYSQTDPKNTAIVGD-----FETMFKWGYAKEVPMEIIKYGDPD 262 (300) T ss_pred c----CCCeec-cCccccCCCcee-cceeeEEecCCCCCCCCCccEEEEee-----ccceEEEEEecccEEEEeeccCCC Confidence 1 111110 111111234677 4579998887543 2233321 1112223322222333221 222 Q ss_pred c-----c-cceeee--eeeeee-eecCcccccccceEEEeecc Q lcl|NC_018861. 428 S-----G-QPAMIL--NNRYDV-VATPLHPEAFIRTFAVNLNN 461 (465) Q Consensus 428 s-----~-qp~~~~--~tRY~l-~~nPf~~~~~~~~f~~~~~~ 461 (465) . | +..++| ..|+|. +.+| .+..+ .+.--| T Consensus 263 ~~~~~~f~~~~v~~r~~~r~d~~v~~~---~a~~~--l~~~~g 300 (300) T protein:vir:95 263 NSGRDLKGYNQIYIRCEAYIGWGIMDA---ASFAR--IVKTGG 300 (300) T ss_pred CcchhhhhcCcEEEEEEEeecceeecc---cceEE--EecCCC Confidence 1 2 122443 446665 3454 21111 012222 No 90 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=78.55 E-value=0.11 Score=25.77 Aligned_cols=265 Identities=13% Similarity=0.007 Sum_probs=117.0 Q ss_pred ccccccccccccchhhhheeeeeccCc--cccccccccccccccccCCccCCCcccccCccccccccccccccchhhhcc Q lcl|NC_018861. 146 GKIVYSEKQAGTDNIVNVLLRLESNST--GSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKL 223 (465) Q Consensus 146 gait~~~~~TGPTgLifam~s~y~~~~--g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~l 223 (465) .+.+.....--|.- |+ .|.... ....+.+=+.......+ ..+....-. .++. .|-+....|.. T Consensus 1 Ma~T~~~d~I~Pev--~~---~~V~e~~~~~~~~~~~~~~d~~L~g----~~G~ti~~P-----~~~~-igdae~~~eg~ 65 (270) T protein:vir:95 1 MTQTKKANLINPEV--LA---NVVSAQMQNAIRFTPYAVTDDTLVG----QPGDTITRP-----KYAY-IGAAEDLQEGV 65 (270) T ss_pred CCceehhhhcchHH--HH---HHHHHHHHhHHhhccccccccccCC----CCCCEEEee-----eecC-CCccccccCCC Confidence 11111111111211 11 111000 00001010000000000 000000000 0111 11122222322 Q ss_pred CCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhh-CCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeee Q lcl|NC_018861. 224 GKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQH-GINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFDV 302 (465) Q Consensus 224 g~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiH-GlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~~ 302 (465) .-+..+++ ..+.+++.|-|+-.=++| ||.+.- |-|.-.|..+-++.-|..+++.++|..|... .... T Consensus 66 ~i~~~~lt--~~~~~a~i~~~gk~~~it-----D~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a-----~~~~ 133 (270) T protein:vir:95 66 AMDTTQMS--MTTTKVTVKETGKAVEVT-----QTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKS-----KQTA 133 (270) T ss_pred ccchhhcc--cchheeeeehhhCcceec-----HHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccc-----cccc Confidence 22345554 456666667776545555 444433 4699999999999999999999999988432 1111 Q ss_pred ccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCc Q lcl|NC_018861. 303 NSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNR 382 (465) Q Consensus 303 ~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~ 382 (465) +.. . .+..+...+..+..+ ...-++++|.|.+++.|+...++++...+++... ...+|++ .| T Consensus 134 ~~~---~---t~~~~~dA~~~lgd~------~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~-----~G~ig~~-~G 195 (270) T protein:vir:95 134 TVS---A---DATGILDAIEVFNSE------NDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAIS-----KGDLVEI-VG 195 (270) T ss_pred ccc---c---CHHHHHHHHHHhccc------cCCCcEEEEcHHHHHHHHhhhcccccccccchhc-----cccccee-cc Confidence 110 0 122222222222221 2356789999999999999888876544443322 2357776 46 Q ss_pred eEEEEeCCCCcceEEEEEe-cCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecCcccccccceEEEeec Q lcl|NC_018861. 383 YDVIVDNFAEFDYCTVAYK-GASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATPLHPEAFIRTFAVNLN 460 (465) Q Consensus 383 ~~vy~d~~~~~dy~~vg~k-g~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~~~~~ 460 (465) ++|++|...+.+|-..-++ |+-. |+-.=.+. .-.--|+..++-.+--..+|++ ..||= ..=..||+ -. T Consensus 196 ~~Viv~s~~~~~~~~~l~~~gAi~-----~~~~~~~~-vEtdRd~~~~~d~i~~~~~y~v~~~~~s--kvv~~t~~--~a 265 (270) T protein:vir:95 196 VSDIVKSKRVSENTAFLQRYGAME-----IVNKKKPE-AYTDFDILKRTHLLSTNYHYSVNLKDET--GVVKVTFK--PS 265 (270) T ss_pred eeEEEeCCCCCceeEEEEecccee-----eeecCCce-eeeccchhhcccEEEeeeEEEEEEEccc--eEEEEEec--CC Confidence 8999998887777333222 1111 11100001 0112277777777666666766 33321 00112332 11 Q ss_pred cceeC Q lcl|NC_018861. 461 NYIIS 465 (465) Q Consensus 461 ~~~~~ 465 (465) |+--- T Consensus 266 ~~~~~ 270 (270) T protein:vir:95 266 GSLEM 270 (270) T ss_pred CCcCC Confidence 11100 No 91 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=77.79 E-value=0.12 Score=25.61 Aligned_cols=291 Identities=12% Similarity=0.050 Sum_probs=132.5 Q ss_pred CCccchhhhHHHhhhhh-hccccccChhhhhheehccccchhHHHhhhhhhhccccccccchhh-hhhhhhhhhhhhhhh Q lcl|NC_018861. 1 MADKYLLDESTKEKFIT-SNLYPNLNESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILV-PVIRRALPSLIGTEI 78 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~-~~~~~~~~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DI 78 (465) |-+++..+.++++ |.. ....+.++ +.... ++.+++. ..-+.+. .++..+..+.+-.++ T Consensus 1 ~~k~~~~~~~~~~-~~~~~~~~~~~~-----------------a~~~~-~~~~~~~-lip~~~~~~ii~~~~~~s~l~~~ 60 (324) T protein:vir:99 1 MEQTQKLKLNLQH-FASNNVKPQVFN-----------------PDNVM-MHEKKDG-TLLNDFTTPILQEVMENSKIMRL 60 (324) T ss_pred CCCchHhhHHHHH-HHHHhhhhhhcc-----------------cccee-ccCCCcc-eechhHHHHHHHHHHhhchhhhh Confidence 7777666655433 221 11111111 11111 1111111 1111122 344556666677788 Q ss_pred eeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_018861. 79 AGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTD 158 (465) Q Consensus 79 wGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPT 158 (465) +.+.||++.+.-|. ++.. +.. + .| T Consensus 61 ~~~~~~~~~~~~~p----~~~~-~~~------------------------------------a------~~--------- 84 (324) T protein:vir:99 61 GKYEPMEGTEKKFT----FWAD-KPG------------------------------------A------YW--------- 84 (324) T ss_pred cceeeccCCceEEE----EEec-Ccc------------------------------------e------eE--------- Confidence 88888887652221 1100 000 0 00 Q ss_pred hhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEE Q lcl|NC_018861. 159 NIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVL 238 (465) Q Consensus 159 gLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~t 238 (465) .+| |..+++...++++++ T Consensus 85 ------------------------------------------------------------v~E--g~~~~~~~~~~~~v~ 102 (324) T protein:vir:99 85 ------------------------------------------------------------VGE--GQKIETSKATWVNAT 102 (324) T ss_pred ------------------------------------------------------------ecc--CccccccccceeEEE Confidence 001 233445555677788 Q ss_pred EEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeee-----eeeeeccCCcccHHHH Q lcl|NC_018861. 239 AEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVC-----TDFDVNSADGRWFIEK 313 (465) Q Consensus 239 VtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~-----~~~~~~~~~~~~~~e~ 313 (465) ++.|.-+---..|-||.+|-. .|.+++|.+.|+..|...+++.+|..--...... .........+.- . T Consensus 103 ~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~---~ 175 (324) T protein:vir:99 103 MRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDF---T 175 (324) T ss_pred EeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccC---C Confidence 888877777889999999974 5789999999999999999999985421100000 000000001111 1 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCc Q lcl|NC_018861. 314 ARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEF 393 (465) Q Consensus 314 ~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~ 393 (465) +.. |.++-..+. ..+.....+++++.....|+...- +.++... ... --++| .|++|++.+..+. T Consensus 176 ~~~----i~~~~~~l~--~~~~~~~~~v~n~~~~~~L~~l~d----~~g~~~~-~~~----~~~~l-~G~PVv~~~~~~~ 239 (324) T protein:vir:99 176 QDN----IIDLEALLE--DDELEANAFISKTQNRSLLRKIVD----PETKERI-YDR----NSDTL-DGLPVVNLKSSNL 239 (324) T ss_pred HHH----HHHHHHhhh--hccCCCCEEEEcHHHHHHHHHhhc----CCCceee-cCC----CCccc-cceeEEeecCCCC Confidence 222 222333332 233466678999999999986421 1111111 000 11455 3468888776532 Q ss_pred --ceEEEEEecCCCccceeEEecccccceeeeeC--------CC----c-c-cceeee--eeeeeeeecCcccccccceE Q lcl|NC_018861. 394 --DYCTVAYKGASNFDAGIFFAPYNITLQQNLTD--------PV----S-G-QPAMIL--NNRYDVVATPLHPEAFIRTF 455 (465) Q Consensus 394 --dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~d--------p~----s-~-qp~~~~--~tRY~l~~nPf~~~~~~~~f 455 (465) ..+++|-. +.++++.-......+.-+ +. + | +-.+.| ..|++.. +..+.+ | T Consensus 240 ~~~~~i~gd~------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~--v~~~~a----~ 307 (324) T protein:vir:99 240 KRGELITGDF------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALH--IADDKA----F 307 (324) T ss_pred CcceEEEEec------ccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccE--Eecccc----e Confidence 23443321 111222212222221111 11 0 1 122333 3556542 223333 5 Q ss_pred EEeeccceeC Q lcl|NC_018861. 456 AVNLNNYIIS 465 (465) Q Consensus 456 ~~~~~~~~~~ 465 (465) ++ |++..-. T Consensus 308 ~~-lt~a~~~ 316 (324) T protein:vir:99 308 AK-LVPADKK 316 (324) T ss_pred EE-EEeccCC Confidence 43 3333333 No 92 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=77.68 E-value=0.12 Score=25.59 Aligned_cols=310 Identities=12% Similarity=0.060 Sum_probs=125.7 Q ss_pred CCccchhhhH--HHhhhhhhccccccC----h----hhhhheehccccc----hhHHHhhhhhh-hccccccccchhh-- Q lcl|NC_018861. 1 MADKYLLDES--TKEKFITSNLYPNLN----E----SEKNIMRTVLENQ----GNEVKMLMEST-VTGDIAKFTPILV-- 63 (465) Q Consensus 1 ~~~~~~~~e~--~~e~~~~~~~~~~~~----~----~~~~~~~~l~~n~----~~~~~~i~est-~t~~v~~~~P~l~-- 63 (465) .-++.+.+++ ..+.+.+.- ....+ + ..+....-+..+. ..+.+-+..++ ..|.+. . |.-+ T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~-v-P~~~~~ 134 (408) T protein:vir:74 58 ALREQLVEAQAEQVVNMREEE-KGPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLT-I-PQDIRT 134 (408) T ss_pred HHHHHHHHHHHHHHhhccccc-cccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCcee-e-chhHhh Confidence 1111111110 001110000 00000 0 1111111111110 00111111111 111111 1 2222 Q ss_pred hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccccccccccccccc Q lcl|NC_018861. 64 PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATT 143 (465) Q Consensus 64 ~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt 143 (465) .++..+-++....++++++||++.+|-+--.| ....+. .+ T Consensus 135 ~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~------------------------------------~~-- 174 (408) T protein:vir:74 135 MINTLVRQYDSLQQYVRVESVSTSSGSRVYEK--WTDVTP------------------------------------LK-- 174 (408) T ss_pred HHHHHHhhhcchhhhcceeeccCCcceEEEEe--ecCCcc------------------------------------cc-- Confidence 45556666777889999999999887654333 100000 00 Q ss_pred ccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhcc Q lcl|NC_018861. 144 VKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKL 223 (465) Q Consensus 144 ~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~l 223 (465) .+. .. T Consensus 175 ----~~v-----------------------------------------------------------------------~E 179 (408) T protein:vir:74 175 ----AMD-----------------------------------------------------------------------EE 179 (408) T ss_pred ----ccc-----------------------------------------------------------------------cc Confidence 000 00 Q ss_pred CCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeee Q lcl|NC_018861. 224 GKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFDV 302 (465) Q Consensus 224 g~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~~ 302 (465) |...++.+ -+++++++..+.-+-...+|-||.+| ..+|.++.|.+-|+..|..-+|+.||.-.-.... T Consensus 180 ~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~d----s~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~------- 248 (408) T protein:vir:74 180 DGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKD----TAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPK------- 248 (408) T ss_pred ccccccccccceeeEEeeeeeEEeeehhHHHHHhh----chHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------- Confidence 01122221 24666777777777778899999998 2467889999999999999999998865411110 Q ss_pred ccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCc Q lcl|NC_018861. 303 NSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNR 382 (465) Q Consensus 303 ~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~ 382 (465) .... -.+.+|...+. ..+.. .+...-.+||++.....|...- ...+......+ .....-++| .| T Consensus 249 -~~~~----~~~~~i~~~~~---~~l~~--~~~~~a~~v~n~~~~~~l~~lk----d~~G~~l~~~~-~~~~~~~~l-~G 312 (408) T protein:vir:74 249 -KPTI----ANFDDVITMIN---TSVDP--AIIATSSLLTNQSGLNKLALVK----TAEGKYLLEPD-PTKPNSYLI-KG 312 (408) T ss_pred -cccc----ccHHHHHHHHH---Hhhhh--hhcCCCEEEEcHHHHHHHHHhh----cCCCceEeccC-cCCCCCcee-cc Confidence 0011 11233333222 12211 1223345789999999998641 11111111111 001112455 46 Q ss_pred eEEEEeCCCCcceEEEEEecCCCccceeEEeccccc-------ceeeeeCC------Ccccceeeeeeeeee-eecCccc Q lcl|NC_018861. 383 YDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNIT-------LQQNLTDP------VSGQPAMILNNRYDV-VATPLHP 448 (465) Q Consensus 383 ~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~-------~~~~~~dp------~s~qp~~~~~tRY~l-~~nPf~~ 448 (465) ++||+-.+..+ +-.+.+ +.-+||+.+... -+...+++ ...+-.+-+..||+. ..+| T Consensus 313 ~pV~~~~~~~~-----~~~~~~--~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~--- 382 (408) T protein:vir:74 313 KQVIVVADRWL-----PNSGST--VYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDS--- 382 (408) T ss_pred eeeEEecCccc-----ccccCC--cceEEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecc--- Confidence 67765332111 000111 111233322110 01111222 244555666677776 3444 Q ss_pred ccc--------cceEEEeeccceeC Q lcl|NC_018861. 449 EAF--------IRTFAVNLNNYIIS 465 (465) Q Consensus 449 ~~~--------~~~f~~~~~~~~~~ 465 (465) .+. ....+.+-+++.-+ T Consensus 383 ~a~~~~~~~~~~~~~~~~~~~~~~~ 407 (408) T protein:vir:74 383 EALVAGSFTAIADQVGNFKTTTSTA 407 (408) T ss_pred cceEEEEeecccCCCCCCCCCcccc Confidence 211 11111222222222 No 93 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=75.87 E-value=0.14 Score=25.24 Aligned_cols=303 Identities=12% Similarity=0.070 Sum_probs=130.7 Q ss_pred CCccchhhh---HHHhhhhhhc---------------------------cccc----cC----hhhhhheehccccchhH Q lcl|NC_018861. 1 MADKYLLDE---STKEKFITSN---------------------------LYPN----LN----ESEKNIMRTVLENQGNE 42 (465) Q Consensus 1 ~~~~~~~~e---~~~e~~~~~~---------------------------~~~~----~~----~~~~~~~~~l~~n~~~~ 42 (465) ..+.....| ++.+.+.-.. .... .+ ..++.-....+...++. T Consensus 26 ~~~~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~ 105 (389) T protein:vir:10 26 LQDENASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGSKKGTDLSKKPIDAKKKAINDFIHSHGKV 105 (389) T ss_pred HHhHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccchhHHHHHHHHHHHHhhcchhh Confidence 000000000 1111110000 0000 00 00011111111222223 Q ss_pred HHhhhhhhhc-cccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccc Q lcl|NC_018861. 43 VKMLMESTVT-GDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKT 119 (465) Q Consensus 43 ~~~i~est~t-~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~ 119 (465) ++.+.+++++ |.+. =|.-+ .++++.....+-.+++.|.||+++++-+--++ ... ... T Consensus 106 ~~~~~~~t~~~gg~~--vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~---~~~------------- 165 (389) T protein:vir:10 106 IDATSKVTSTEAGVL--IPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILK--RAT---DRF------------- 165 (389) T ss_pred hhhhcccccCCccee--ehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEe--cCC---Ccc------------- Confidence 3334444332 2111 13333 46777888888899999999998875543333 100 000 Q ss_pred ccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCccc Q lcl|NC_018861. 120 ESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYT 199 (465) Q Consensus 120 a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~ 199 (465) .+.. | T Consensus 166 ----------------------------~~~~----------------------------E------------------- 170 (389) T protein:vir:10 166 ----------------------------SSVA----------------------------E------------------- 170 (389) T ss_pred ----------------------------cccc----------------------------c------------------- Confidence 0000 0 Q ss_pred ccCccccccccccccccchhhhccCCchh-hcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHH Q lcl|NC_018861. 200 NEALWLKVLKNYTGPYATAAGEKLGKDMK-EMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEV 278 (465) Q Consensus 200 ~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~-EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEI 278 (465) +...+ .-..++++++..+|.-+--..+|-||.+|- ..|.+++|.+.|...+ T Consensus 171 ------------------------~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~la~~~ 222 (389) T protein:vir:10 171 ------------------------LAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADS----AVDLTALVGQSIKEKS 222 (389) T ss_pred ------------------------cccccccccccceeeeeeheeeEeeehhhHHHHhhh----hHHHHHHHHHHHHHHH Confidence 00000 001134555555555555677999999984 3478889999999999 Q ss_pred HHHhhHHHHhhhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----C Q lcl|NC_018861. 279 ALEIDRTIIEKANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----G 354 (465) Q Consensus 279 mlEINreii~~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~ 354 (465) ..-+|+.|+..+... ....+. .... .+....++.. . ....+ ...+++++.....|... | T Consensus 223 ~~~~~~~i~~g~~~~----~~~~~~--~~~~-~d~l~~~~~~------~--~~~~~--~a~~~~n~~~~~~L~~lkd~~G 285 (389) T protein:vir:10 223 VNTYNAMIAPVLQSF----TAKKTT--TDTL-VDSLKHILNV------D--LDPAY--SRALVVTQSLFNTLDTLKDKNG 285 (389) T ss_pred HHHHHHHHhhhhccc----cccccc--cccc-HHHHHHHHHh------h--hhhhh--CcEEEecHHHHHHHHHhhccCC Confidence 999999998766221 111111 1111 1223332221 0 01111 24678999999999864 3 Q ss_pred cccccCCcccccccccccceEEEEecCceEEEE-eCC-CCcceEEEEEecCCCccceeEEecccc--------cceeeee Q lcl|NC_018861. 355 SFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIV-DNF-AEFDYCTVAYKGASNFDAGIFFAPYNI--------TLQQNLT 424 (465) Q Consensus 355 ~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~-d~~-~~~dy~~vg~kg~~~~d~glfy~PY~~--------~~~~~~~ 424 (465) ...+.|.... ......-++| .|++||+ +.. .+.+ ..+..++|+.+.- ....... T Consensus 286 ~~i~~~~~~~-----~~~~~~~~~l-~G~pV~~~~~~~~~~~----------~~~~~~~~gd~~~~~~~~~~~~~~i~~~ 349 (389) T protein:vir:10 286 RYLLHDASDS-----ITDGTAKGTI-LGVPVYVVGDTLLGSL----------AGDQKAFVGDLKRGVLFTDRQQVTLAWE 349 (389) T ss_pred CeeeecCccc-----cccccccccc-ccceeEEecccccCCC----------CCceEEEEeeccccEEEEeecceEEEee Confidence 3333322211 1111233455 4567664 322 1111 1112233333211 1122233 Q ss_pred CCCcccceeeeeeeeeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 425 DPVSGQPAMILNNRYDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 425 dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) |-..|.-.+-..-|++.. +.++.+ |. +.++.+--+ T Consensus 350 ~~~~~~~~~~~~~r~d~~--~~~~~a----~~~~~~~~~~~~ 385 (389) T protein:vir:10 350 DSKIYGKYLGAAFRFGVQ--KADSKA----GYFVTNTDVPGS 385 (389) T ss_pred ccccccceEEEEEEeccE--Eecccc----eEEEEeeccCCC Confidence 445566667777788874 233333 33 233333222 No 94 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=74.76 E-value=0.15 Score=25.03 Aligned_cols=291 Identities=11% Similarity=0.036 Sum_probs=131.6 Q ss_pred CCccchhhhHHHhhhhhh-ccccccChhhhhheehccccchhHHHhhhhhhhccccccccchhh-hhhhhhhhhhhhhhh Q lcl|NC_018861. 1 MADKYLLDESTKEKFITS-NLYPNLNESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILV-PVIRRALPSLIGTEI 78 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~-~~~~~~~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DI 78 (465) |-+++..+..+++ |..- ...+.++ +....- +.++.. ..-+.+. .++..+..+.+-.++ T Consensus 1 ~~~~~~~~~~~~~-f~~~~~~~~~~~-----------------a~~~~~-~~~~~~-liP~~~~~~ii~~~~~~s~l~~~ 60 (324) T protein:vir:10 1 MEQTQKLKLNLQH-FASNNVKPQVFN-----------------PDNVMM-HEKKDG-TLLNDFTTPILQEVMENSKIMQL 60 (324) T ss_pred CCCchHHHHHHHH-HHHHhhccceec-----------------ccceec-cCCCcc-eechhHHHHHHHHHHhhchhhhh Confidence 7666555544333 2211 1111111 111111 111111 0111122 344566666677888 Q ss_pred eeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_018861. 79 AGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTD 158 (465) Q Consensus 79 wGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPT 158 (465) +-+.||++.+.-|. | ... +.. + .| T Consensus 61 ~~~~~~~~~~~~~p--~--~~~-~~~------------------------------------a------~~--------- 84 (324) T protein:vir:10 61 GKYEPMEGTEKKFT--F--WAD-KPG------------------------------------A------YW--------- 84 (324) T ss_pred cceeeccCCceEEE--E--EeC-Ccc------------------------------------e------eE--------- Confidence 88888887652221 1 100 000 0 00 Q ss_pred hhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEE Q lcl|NC_018861. 159 NIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVL 238 (465) Q Consensus 159 gLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~t 238 (465) .+| |..+++...++++++ T Consensus 85 ------------------------------------------------------------v~E--g~~~~~~~~~~~~v~ 102 (324) T protein:vir:10 85 ------------------------------------------------------------VGE--GQKIETSKATWVNAT 102 (324) T ss_pred ------------------------------------------------------------ecc--CccccccccceeEEE Confidence 001 223444455678888 Q ss_pred EEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeee-----eeeeccCCcccHHHH Q lcl|NC_018861. 239 AEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCT-----DFDVNSADGRWFIEK 313 (465) Q Consensus 239 VtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~-----~~~~~~~~~~~~~e~ 313 (465) +..|..+-.-..|-||.+|-. .|.+++|.+.|+..|...+++.+|..--....... ........+.-..+. T Consensus 103 ~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~ 178 (324) T protein:vir:10 103 MRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) T ss_pred EeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHH Confidence 888888888899999999864 57899999999999999999999854311100000 000000011111122 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCC- Q lcl|NC_018861. 314 ARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAE- 392 (465) Q Consensus 314 ~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~- 392 (465) ...+ ...+. ..+.....+++++.....|+...-- .+.... ... --++| .|++|++++..+ T Consensus 179 i~~~-------~~~l~--~~~~~~~~~v~n~~~~~~L~~l~d~----~g~~~~-~~~----~~~~l-~G~PV~~~~~~~~ 239 (324) T protein:vir:10 179 IIDL-------EALLE--DDELEANAFISKTQNRSLLRKIVDP----ETKERI-YDR----NSDTL-DGLPVVNLKSSNL 239 (324) T ss_pred HHHH-------HHhhh--hccCCCCEEEEcHHHHHHHHHhhcc----CCceee-cCC----CCccc-cceeEEeecCCCC Confidence 2222 22322 2334666789999999999864211 111111 111 11344 356888877643 Q ss_pred -cceEEEEEecCCCccceeEEecccccceeeeeCC-------Cc------c-cceeeee--eeeeeeecCcccccccceE Q lcl|NC_018861. 393 -FDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDP-------VS------G-QPAMILN--NRYDVVATPLHPEAFIRTF 455 (465) Q Consensus 393 -~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp-------~s------~-qp~~~~~--tRY~l~~nPf~~~~~~~~f 455 (465) ...+++|-. +.++++......+.+.-|. .. | +-.++|. .|+|. .+..+.+ | T Consensus 240 ~~~~~~~gd~------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~--~v~~~~A----~ 307 (324) T protein:vir:10 240 KRGELITGDF------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL--HIADDKA----F 307 (324) T ss_pred CcceEEEEec------ccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEcc--EEecccc----e Confidence 223333321 1122222222222221111 01 1 1223443 45554 2333333 4 Q ss_pred EEeeccceeC Q lcl|NC_018861. 456 AVNLNNYIIS 465 (465) Q Consensus 456 ~~~~~~~~~~ 465 (465) ++ |++..-. T Consensus 308 ~~-l~~a~~~ 316 (324) T protein:vir:10 308 AK-LVPADKK 316 (324) T ss_pred EE-EEeccCC Confidence 43 2332222 No 95 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=74.47 E-value=0.16 Score=24.98 Aligned_cols=311 Identities=11% Similarity=0.044 Sum_probs=128.3 Q ss_pred CCccchhhhHH---Hhhhhhhcccccc------------Chhhhhheehccccc--hhHHHhhhh------hhhcccccc Q lcl|NC_018861. 1 MADKYLLDEST---KEKFITSNLYPNL------------NESEKNIMRTVLENQ--GNEVKMLME------STVTGDIAK 57 (465) Q Consensus 1 ~~~~~~~~e~~---~e~~~~~~~~~~~------------~~~~~~~~~~l~~n~--~~~~~~i~e------st~t~~v~~ 57 (465) +.+..-..+++ .++-..-...... .+..+.....+.+.. ..+...+.+ +++++.-.. T Consensus 67 ~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~l 146 (418) T protein:vir:10 67 LIKQGELQARLLEAEQKLARGGGSAELETPKTLGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSL 146 (418) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccccchhhhhhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccc Confidence 11000000111 1110000000000 001111111111100 001111111 111111111 Q ss_pred ccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccc Q lcl|NC_018861. 58 FTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEV 136 (465) Q Consensus 58 ~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~ 136 (465) .-|.+. .++....+..+-.+++.+-||++++.-+ .| ....+. T Consensus 147 vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~--~~--~~~~~~--------------------------------- 189 (418) T protein:vir:10 147 VVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEY--TV--ETGFTN--------------------------------- 189 (418) T ss_pred cchhHHHHHHHHHhhhhhHHhhcceeeccCCceeE--EE--EecCCC--------------------------------- Confidence 222222 4566777788888899999998775211 11 000000 Q ss_pred cccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcccccccccccccc Q lcl|NC_018861. 137 SFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYA 216 (465) Q Consensus 137 s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 216 (465) .+ .|. T Consensus 190 ---~a------~~v------------------------------------------------------------------ 194 (418) T protein:vir:10 190 ---NA------AAV------------------------------------------------------------------ 194 (418) T ss_pred ---ce------eee------------------------------------------------------------------ Confidence 00 000 Q ss_pred chhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhh------ Q lcl|NC_018861. 217 TAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKA------ 290 (465) Q Consensus 217 Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l------ 290 (465) +| |...++-..++++++..+|.-+-...+|-||.||.- |.++.|.+-|+..|..-+|+-||..- T Consensus 195 ---~E--~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~-----~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p 264 (418) T protein:vir:10 195 ---AE--GAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDAP-----ALQSYIDGRARYGLQLTEEGQILKGDGTGANI 264 (418) T ss_pred ---cc--CccccccccceeeEEEeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHhccCCCCccc Confidence 00 111222223567777788877777889999999852 57888999999999999998887521 Q ss_pred ---hheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccc Q lcl|NC_018861. 291 ---NEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDA 367 (465) Q Consensus 291 ---~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~ 367 (465) ...+.+. .-.....+.. .+..|...+. .+. ..+...+.+||+++....|...- ...+.++. T Consensus 265 ~Gi~~~~~~~-~~~~~~~~~~----~~~~i~~~~~----~~~--~~~~~~~~~v~n~~~~~~L~~lk----d~~G~~i~- 328 (418) T protein:vir:10 265 LGILPQASAF-MPSITLANAT----PIDKIRLALL----QAV--LAEFPATGIVLNPIDWASIELTK----DSQGRYIV- 328 (418) T ss_pred cccccccccc-cccccccccc----cHHHHHHHHH----hhc--cccCCCCEEEEcHHHHHHHHHhh----cCCCceec- Confidence 0001000 0000000111 1222222222 221 23446677999999999997542 11111111 Q ss_pred cccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCc---c-cceeee--eeeeee Q lcl|NC_018861. 368 INSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVS---G-QPAMIL--NNRYDV 441 (465) Q Consensus 368 ~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s---~-qp~~~~--~tRY~l 441 (465) .+.. ..-.|+|. |++|+++++.|.+-+++|---. .++. +.-..+...+|+.. | .-.+.| ..|++. T Consensus 329 ~~~~-~~~~~~l~-G~pV~~~~~~p~~~~~~gd~s~-----~~~~--~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~ 399 (418) T protein:vir:10 329 GNPV-NGTTPRLW-NLPVVETQAMTANEFLVGAFSM-----AAQI--FDRMEIEVLLSTENVDDFEKNMVSIRAEERLAL 399 (418) T ss_pred cccc-cCCCceec-ceeeEEcCCCCCCcEEEeeccc-----eEEE--EEecceEEEEecccchhhhcCceEEEEEEeecc Confidence 1100 11235664 5899999998876666653110 0101 11111112223322 2 222333 456665 Q ss_pred eecCcccccccceEEEeeccceeC Q lcl|NC_018861. 442 VATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 442 ~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) .+.+|.+ |++ ++-+.-+ T Consensus 400 --~~~~~~a----~~~-~~~~~~~ 416 (418) T protein:vir:10 400 --AVYRPES----FVT-GALVEQA 416 (418) T ss_pred --EEecccc----eEE-EEeccCC Confidence 2444433 432 1111111 No 96 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=74.31 E-value=0.16 Score=24.95 Aligned_cols=315 Identities=11% Similarity=0.041 Sum_probs=123.3 Q ss_pred CCc-------cchhh-hHHHhh---hhhhccccc--------c--ChhhhhheehccccchhHHHhhhhhhhcccccccc Q lcl|NC_018861. 1 MAD-------KYLLD-ESTKEK---FITSNLYPN--------L--NESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFT 59 (465) Q Consensus 1 ~~~-------~~~~~-e~~~e~---~~~~~~~~~--------~--~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~ 59 (465) ... |.-.+ +.+..+ ....+..+. + .++++.....|. +++..+-+.+++++..-.-.- T Consensus 66 ~~~e~~~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l~--~~e~~~al~~~t~~~gG~lvP 143 (425) T protein:vir:10 66 PTSDALAKVDKVSADLEALQAAVDEANIKIAAAQMGANGVKPLRDPEYTEAFKAHVK--RGDVQAALNKGEDSEGGYLTP 143 (425) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccHHHHHHHHHHhh--hhhhHHHhhcCcCCCCceecc Confidence 000 00000 011111 000000000 0 112322222221 123333444443221111112 Q ss_pred chhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccc Q lcl|NC_018861. 60 PILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSF 138 (465) Q Consensus 60 P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~ 138 (465) +.+. -++..+-...+..+++.|.||+++..-+.-. . ++. T Consensus 144 ~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~-----~-~~~---------------------------------- 183 (425) T protein:vir:10 144 IEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFN-----M-GGT---------------------------------- 183 (425) T ss_pred HhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEE-----c-CCc---------------------------------- Confidence 2222 3556666777788899999998776433210 0 000 Q ss_pred cccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccch Q lcl|NC_018861. 139 KTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATA 218 (465) Q Consensus 139 ~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta 218 (465) .+ .|...... T Consensus 184 -~a------~wv~E~~~--------------------------------------------------------------- 193 (425) T protein:vir:10 184 -TS------GWVGEASQ--------------------------------------------------------------- 193 (425) T ss_pred -ce------eeeccccc--------------------------------------------------------------- Confidence 00 00000000 Q ss_pred hhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhh--------h Q lcl|NC_018861. 219 AGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEK--------A 290 (465) Q Consensus 219 ~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~--------l 290 (465) ..|.....|.++.|+..|. +-...+|-||.+|- ..|.+++|.+-|+..|..-+|+.+|.- + T Consensus 194 ~~~~~~~~f~~v~~~~~k~-------~~~i~iS~ell~ds----~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gi 262 (425) T protein:vir:10 194 RPQTNAATFQPLSFASGEI-------YANPAATQQILDDA----EIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGL 262 (425) T ss_pred cccccccccceeeeeheee-------EeehHhHHHHHhcc----hhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCccee Confidence 0000001355555555554 44567999999985 367889999999999999999999862 1 Q ss_pred hheeeeeeee------eeccC-CcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcc Q lcl|NC_018861. 291 NEVATVCTDF------DVNSA-DGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGS 363 (465) Q Consensus 291 ~~~at~~~~~------~~~~~-~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~ 363 (465) ....+..... .+... .+.-..-.+..|...+..+... +.+.-..|+++.....|...- ...+. T Consensus 263 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~------~~~~a~~vmn~~~~~~L~~lk----D~~G~ 332 (425) T protein:vir:10 263 LTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSA------FTGNARFAMNRNTQRQVRKLK----DGQGN 332 (425) T ss_pred eeccccccccccccccccccccccccccccHHHHHHHHhhhhhh------hccCCEEEEchHHHHHHHHhh----cCCCc Confidence 1111111110 00000 0000011223333333222222 223345789999999987642 11111 Q ss_pred cccccccccceEEEEecCceEEEEeCCCCc-----ceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeee--e Q lcl|NC_018861. 364 KIDAINSGIKPNVGKFDNRYDVIVDNFAEF-----DYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMIL--N 436 (465) Q Consensus 364 ~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~-----dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~--~ 436 (465) +.-.++ ......++|. |++|+++.+.|. +-+++| +-. ...+. +.-..+.+..||-.-.-.++| . T Consensus 333 ~l~~~~-~~~g~~~~l~-G~PV~~~~~~p~~~~~~~~i~~G---d~~--~~~~i--~~~~~~~v~~d~~~~~~~~~~~~~ 403 (425) T protein:vir:10 333 YLWQPS-YVAGQPATLA-GYPVTEVPDMPDVAANSTPILFG---DFQ--QTYLI--IDRIGVRVLRDPYTAKPYVLFYTT 403 (425) T ss_pred eeeccC-ccCCCCceec-ceeeEEecCcCCccCCccEEEEE---ehh--ccEEE--EEecceEEEecccccCCcEEEEEE Confidence 111111 0011225664 578988877652 223332 100 00011 111112233344433333333 3 Q ss_pred eeeee-eecCcccccccceEEEeec Q lcl|NC_018861. 437 NRYDV-VATPLHPEAFIRTFAVNLN 460 (465) Q Consensus 437 tRY~l-~~nPf~~~~~~~~f~~~~~ 460 (465) .||+. +.+| .+...-...--. T Consensus 404 ~r~d~~v~~~---~A~~~l~~~as~ 425 (425) T protein:vir:10 404 KRVGGGLLNP---EPMRAMKVAASE 425 (425) T ss_pred EEeccEeecc---cceEEEEeeccC Confidence 45555 4444 221110000000 No 97 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=73.60 E-value=0.17 Score=24.83 Aligned_cols=279 Identities=12% Similarity=-0.005 Sum_probs=97.0 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPY 215 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 215 (465) ++.++....+..+... --.-+|-.++..-.- ..+.. +....+ ... ....-.+.+. T Consensus 1 Ma~~~~~~gg~~vP~~----~~~~ii~~l~~~s~i----~~l~~-~i~~~~--~~~--------------~ip~~~~~~~ 55 (315) T protein:vir:80 1 MADDFLSAGKLELPGS----MIGAVRDRAIDSGVL----AKLSP-EQPTIF--GPV--------------KGAVFSGVPR 55 (315) T ss_pred CCCCcCCcCceEcchH----HHHHHHHHHHhhchh----hhhcc-eeecCC--Cce--------------EEEEEeCCcc Confidence 2222222212111111 001111111110000 00000 000000 000 0000011111 Q ss_pred cchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheee Q lcl|NC_018861. 216 ATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVAT 295 (465) Q Consensus 216 ~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at 295 (465) +.-.+| |..+++...++++++..+|.-+-....|-||.+|.. .|+..+|.++|..++...|.|.+-+.+..... T Consensus 56 a~wv~E--g~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~s~----~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~ 129 (315) T protein:vir:80 56 AKIVGE--GEVKPSASVDVSAFTAQPIKVVTQQRVSDEFMWADA----DYRLGVLQDLISPALGASIGRAVDLIAFHGID 129 (315) T ss_pred eEEeeC--CccccccccceeeeEeeeeeEEeeehhhHHHhhcCc----hhHHHHHHHHHHHHHHHHHHHHHhhheeeccC Confidence 111223 455667777788888887777667789999998843 56666666666666666665555443321100 Q ss_pred e--eeeeee-c-c-CCcccHH----HHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccc Q lcl|NC_018861. 296 V--CTDFDV-N-S-ADGRWFI----EKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKID 366 (465) Q Consensus 296 ~--~~~~~~-~-~-~~~~~~~----e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~ 366 (465) . ++.... . . ....-.+ +.+..+...+. .+..... ...+..+++++....|+..--..-.+...... T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~----~~~~~~~-~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~ 204 (315) T protein:vir:80 130 PATGKAASAVHTSLNKTKNIVDATDSATADLVKAVG----LIAGAGL-QVPNGVALDPAFSFALSTEVYPKGSPLAGQPM 204 (315) T ss_pred CCCCccccccccccccccceeeccccchHHHHHHHH----HHhhccC-ccceEEEEcHHHHHHHHHHhhccCCccccccc Confidence 0 000000 0 0 0000000 11222222222 2222222 23456889999999997663222222111111 Q ss_pred ccccccceEEEEecCceEEEEeCCCCcc---------eEEEEEecCCCccceeEEecccccceeeee--CCC----c-cc Q lcl|NC_018861. 367 AINSGIKPNVGKFDNRYDVIVDNFAEFD---------YCTVAYKGASNFDAGIFFAPYNITLQQNLT--DPV----S-GQ 430 (465) Q Consensus 367 ~~~~~~~~~~G~l~~~~~vy~d~~~~~d---------y~~vg~kg~~~~d~glfy~PY~~~~~~~~~--dp~----s-~q 430 (465) .++ ....-.|+|. |++|+++.+.+.+ .++.|- - +.++|...-...+.+.- |++ + || T Consensus 205 ~~~-~~~g~~~tl~-G~PV~~~~~~~~~~~~~~~~~~~~~~GD---f---s~~~~g~~~~~~i~i~~~~~~~~~~~~~~~ 276 (315) T protein:vir:80 205 YPA-AGFAGLDNWR-GLNVGASSTVSGAPEMSPASGVKAIVGD---F---SRVHWGFQRNFPIELIEYGDPDQTGRDLKG 276 (315) T ss_pred ccc-cccCCCceec-ceeeEecCcCCcccccccccccEEEEee---c---ccEEEEEecCeeEEEeccccccCcccchhh Confidence 111 0011125664 5899988776422 122120 0 00111111111111110 111 1 11 Q ss_pred -ceeeee--eeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 431 -PAMILN--NRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 431 -p~~~~~--tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) -.++|. .|+|. ++.++++ |++ |.+..-. T Consensus 277 ~~~v~~r~~~r~~~--~v~~~~a----~~~-l~~~~a~ 307 (315) T protein:vir:80 277 HNEVMVRAEAVLYV--AIESLDS----FAV-VKEKAAP 307 (315) T ss_pred cCcEEEEEEEEecc--eeecccc----eEE-EeeccCC Confidence 123333 23332 2222221 111 1111000 No 98 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=71.11 E-value=0.2 Score=24.42 Aligned_cols=306 Identities=12% Similarity=0.040 Sum_probs=125.9 Q ss_pred CCccchh------h--hHHHhhhhhhccc-cc----cCh-------------------------hhhhheehccccc--h Q lcl|NC_018861. 1 MADKYLL------D--ESTKEKFITSNLY-PN----LNE-------------------------SEKNIMRTVLENQ--G 40 (465) Q Consensus 1 ~~~~~~~------~--e~~~e~~~~~~~~-~~----~~~-------------------------~~~~~~~~l~~n~--~ 40 (465) ..++... . ++|.++..-.... +. +.+ +++.....+.... . T Consensus 32 ~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 111 (421) T protein:vir:13 32 AKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGGRVIINGDSKEEKRSLQLSAMSKTIRGIQLSE 111 (421) T ss_pred hhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccchhHHHHHHHHHHHHHhhhccchhH Confidence 1111000 0 0111111100000 00 000 0000000000000 0 Q ss_pred hHHHhhhhhhhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccc Q lcl|NC_018861. 41 NEVKMLMESTVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLK 118 (465) Q Consensus 41 ~~~~~i~est~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~ 118 (465) +.+. ..++++-..-=|.-+ .++..+.+...-.+++.+.||+++++-+--.. . ... T Consensus 112 ~~ra----~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~----~-~~~-------------- 168 (421) T protein:vir:13 112 EERD----IMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRA----G-ASV-------------- 168 (421) T ss_pred HHhh----ccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEee----c-CCc-------------- Confidence 0011 111111111113222 34455556666778888889888765332111 0 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcc Q lcl|NC_018861. 119 TESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVY 198 (465) Q Consensus 119 ~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~ 198 (465) ....+ T Consensus 169 --------------------------~~~~~------------------------------------------------- 173 (421) T protein:vir:13 169 --------------------------DKLAN------------------------------------------------- 173 (421) T ss_pred --------------------------cceee------------------------------------------------- Confidence 00000 Q ss_pred cccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHH Q lcl|NC_018861. 199 TNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEV 278 (465) Q Consensus 199 ~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEI 278 (465) ...|...++-..++++++..++.-+-...+|-||.+|- ..|.++.|.+-|+..+ T Consensus 174 ----------------------~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds----~~~l~~~i~~~la~~~ 227 (421) T protein:vir:13 174 ----------------------LAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDS----EINFLEFVNEEFAEFA 227 (421) T ss_pred ----------------------ccccccccccccceeEEEeeeeeeEeehhhhHHHHhhh----HHHHHHHHHHHHHHHH Confidence 00011222223345666666666666678999999984 2467889999999999 Q ss_pred HHHhhHHHHhhhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccc Q lcl|NC_018861. 279 ALEIDRTIIEKANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVL 358 (465) Q Consensus 279 mlEINreii~~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~ 358 (465) ..-+|..|+..+.-..+ ...... .+....++..+.. .+.....+|+++.....|...- T Consensus 228 ~~~~~~~i~~~~~g~~~--------~~~~~~-~d~i~~~~~~l~~---------~~~~~a~~v~n~~~~~~l~~lk---- 285 (421) T protein:vir:13 228 VNTENAEIVKQAKAVLA--------EETIND-YAGLVKTINSLVP---------NARKRAIIVTNSDGRAYLDGLM---- 285 (421) T ss_pred HHHhhhhHhhhhhhccc--------cccccc-hHHHHHHHHHhhh---------hhcCCCEEEEcHHHHHHHHHhh---- Confidence 99999998876532211 111111 2334444444321 1234567889999999887641 Q ss_pred cCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc---------cceeeeeCCCcc Q lcl|NC_018861. 359 SPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI---------TLQQNLTDPVSG 429 (465) Q Consensus 359 ~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~---------~~~~~~~dp~s~ 429 (465) ...+.+.-.... ..--++| .|++|++.++.+.. - .-+..++|+.+.- +...+ .+-..| T Consensus 286 d~~G~~i~~~~~--~~~~~tl-~G~pV~~~~~~~~~-----~----~~~~~~~~gd~~~~~~~~~~~~~~v~~-~~~~~f 352 (421) T protein:vir:13 286 DKQGRPLLKELS--DGGDLVF-KGRPVIELEESIFD-----V----GDETKFIVSDFKTLIKFMDRKQYLIDQ-SKEAGY 352 (421) T ss_pred cCCCceeecCcC--CCCCcee-cceeeEEecccccc-----C----CCceEEEEEeccccEEEEEecceEEEe-eccccc Confidence 011111111000 0111344 45677776654421 0 1122334443211 11121 122223 Q ss_pred c---ceeeeeeeeeee-ecCccc-ccccceEE--EeeccceeC Q lcl|NC_018861. 430 Q---PAMILNNRYDVV-ATPLHP-EAFIRTFA--VNLNNYIIS 465 (465) Q Consensus 430 q---p~~~~~tRY~l~-~nPf~~-~~~~~~f~--~~~~~~~~~ 465 (465) + =.+-+..|++.+ .+|=.. -....+|+ |.++.+.-+ T Consensus 353 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~ 395 (421) T protein:vir:13 353 TKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKS 395 (421) T ss_pred ccCeeEEEEEeeecceeecchhhheeeecccceeeccccccCC Confidence 3 355567788774 333111 11122333 444444333 No 99 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=70.04 E-value=0.21 Score=24.25 Aligned_cols=303 Identities=14% Similarity=0.064 Sum_probs=113.0 Q ss_pred CCccc------hhh--hHHHhhhhhhccc------cccChhhhhheehccccchh-HHHh--------hhhhhhcccccc Q lcl|NC_018861. 1 MADKY------LLD--ESTKEKFITSNLY------PNLNESEKNIMRTVLENQGN-EVKM--------LMESTVTGDIAK 57 (465) Q Consensus 1 ~~~~~------~~~--e~~~e~~~~~~~~------~~~~~~~~~~~~~l~~n~~~-~~~~--------i~est~t~~v~~ 57 (465) -.++. +.+ ++..++....... ..+.+.+......+-..+.. .... ..+++ +.+... T Consensus 88 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~-~~~~g~ 166 (437) T protein:vir:10 88 SADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIA-LKDGKV 166 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcc-cccccc Confidence 00000 000 0000000000000 00000000000000000000 0001 11111 111100 Q ss_pred ccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccc Q lcl|NC_018861. 58 FTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEV 136 (465) Q Consensus 58 ~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~ 136 (465) .-|.-+ ..+..........+++.|.||+.+.+-+--.+.. +. T Consensus 167 lvp~~~~~~i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-----~~-------------------------------- 209 (437) T protein:vir:10 167 IIPETILTPEKEVHQFPRLGSLVRTESVTTTTGKLPIFNNS-----TD-------------------------------- 209 (437) T ss_pred cchHHHHHHHHHhhhhhhhhhcceeEeeccCceeeEEeecc-----cc-------------------------------- Confidence 011111 1112111222234556677766665433322200 00 Q ss_pred cccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcccccccccccccc Q lcl|NC_018861. 137 SFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYA 216 (465) Q Consensus 137 s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 216 (465) .. ++..|. T Consensus 210 ---~~----------------------------------~~~~e~----------------------------------- 217 (437) T protein:vir:10 210 ---LL----------------------------------TAHTEY----------------------------------- 217 (437) T ss_pred ---cc----------------------------------cccccc----------------------------------- Confidence 00 000000 Q ss_pred chhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeee Q lcl|NC_018861. 217 TAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATV 296 (465) Q Consensus 217 Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~ 296 (465) ....|.....|.++.|. ++.-+--..+|-||.+|- ..|.+++|.+.|+.-|..-+|..||........ T Consensus 218 ~~~~e~~~~~~~~v~~~-------~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~- 285 (437) T protein:vir:10 218 GQTTKNATPVITPILWD-------LKTYTGGYVFSQELISDS----SYDWQAELQSRLIELRDNTDDSLIITALTDGIK- 285 (437) T ss_pred ccccccccccceeeeee-------hhheeeehhhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc- Confidence 00000001234444444 444444578899999984 357889999999999999999999987632111 Q ss_pred eeeeeeccCCcccHHHHHHHHHHHHH-HHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCcccccccccc Q lcl|NC_018861. 297 CTDFDVNSADGRWFIEKARGLSMRIS-NEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKIDAINSG 371 (465) Q Consensus 297 ~~~~~~~~~~~~~~~e~~~~L~~~i~-~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~~~~~~ 371 (465) ....+.+. .+|...+. .+... +...-..|+++.....|... |-..+.|... T Consensus 286 ------~~~~~~~~----~~~~~~~~~~l~~~------~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~-------- 341 (437) T protein:vir:10 286 ------KTTSTYLL----GDLKKVLNVTLKPQ------DSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVT-------- 341 (437) T ss_pred ------ccccccch----hhHHHHHHhhhhhh------hhcCCEEEEcHHHHHHHHHhhccCCCeeeccCcc-------- Confidence 11122222 22322222 11121 22233569999999988775 2222222111 Q ss_pred cceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc--------c-ceeeeeCCCcccceeeeeeeeeee Q lcl|NC_018861. 372 IKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI--------T-LQQNLTDPVSGQPAMILNNRYDVV 442 (465) Q Consensus 372 ~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~--------~-~~~~~~dp~s~qp~~~~~tRY~l~ 442 (465) ...-++|. |++||+.+..... ....-+..+||+.+.- . ......+-..+...+.+..||+.. T Consensus 342 -~~~~~~l~-G~pv~~~~~~~~~-------~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~ 412 (437) T protein:vir:10 342 -AATGYTLL-GKTVVIVDDKLFP-------SASAGDVNIVVAPLKKAVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVV 412 (437) T ss_pred -CCCCcccc-cceeEEecccccC-------CcCCCceEEEEeeccccEEEEeeeceEEEEecccccccceeeEEEEEccE Confidence 11124564 4676654332110 0011112244444321 1 111112345556667777898773 Q ss_pred ecCcccccccceEEEeeccceeC Q lcl|NC_018861. 443 ATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 443 ~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) + .+|.+ |+ .|++.+-+ T Consensus 413 ~--~~~~a----~~-~l~~~~~~ 428 (437) T protein:vir:10 413 Q--ASKDL----IV-NLTGKLKA 428 (437) T ss_pred E--ecccc----eE-EEEeeccc Confidence 2 12333 33 33444333 No 100 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=70.00 E-value=0.21 Score=24.24 Aligned_cols=274 Identities=11% Similarity=-0.009 Sum_probs=118.2 Q ss_pred CCccchhhhHHHhhhhhhccccccChhhhhheehccccchhHHHhhhhhhhccccccccchhh-hhhhhhhhhhhhhhhe Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILV-PVIRRALPSLIGTEIA 79 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIw 79 (465) |+= |+-++.+..- ++++.. ..-+.+. .++..+.+.-+-..++ T Consensus 1 m~~-----------------------------------~~~~~~~~~~-t~~~~~-lvP~~~~~~ii~~~~~~s~l~~~~ 43 (297) T protein:vir:95 1 MTV-----------------------------------QTFNPENVLV-SQKKDG-TLHKEFTDIIMKEVAQNSLVMQLG 43 (297) T ss_pred CCc-----------------------------------cccccccccc-cCCCcc-eechhHHHHHHHHHHhhchhhhhc Confidence 110 0001111110 111111 1112222 3445555666777888 Q ss_pred eeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccccccccccccccccccccccch Q lcl|NC_018861. 80 GVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDN 159 (465) Q Consensus 80 GVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTg 159 (465) .+.||++++...+-.. ... .. +.+ T Consensus 44 ~~~~~~~~~~~~~~~~--~~~--~~------------------------------------------a~~---------- 67 (297) T protein:vir:95 44 QYQEMEGEQEKTVYVQ--TDG--IS------------------------------------------AYW---------- 67 (297) T ss_pred ceeecCCCccEEEEEE--cCC--ce------------------------------------------eEE---------- Confidence 9999988876554322 100 00 000 Q ss_pred hhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEEE Q lcl|NC_018861. 160 IVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLA 239 (465) Q Consensus 160 Lifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tV 239 (465) .+| |..+++-..++++++. T Consensus 68 -----------------------------------------------------------v~E--g~~~~~~~~~f~~v~l 86 (297) T protein:vir:95 68 -----------------------------------------------------------VNE--TEKIKTDKPEVVPVTL 86 (297) T ss_pred -----------------------------------------------------------eec--CccccccccceeEEEE Confidence 001 1223333345677778 Q ss_pred EeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeee---eeeeeccCCcccHHHHHHH Q lcl|NC_018861. 240 EAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVC---TDFDVNSADGRWFIEKARG 316 (465) Q Consensus 240 tAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~---~~~~~~~~~~~~~~e~~~~ 316 (465) ..|..+-...+|.||.+|-. .|.++.|.+.|+..|...+++.+|.--......+ ..-+......... -+.. T Consensus 87 ~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~~--t~~~ 160 (297) T protein:vir:95 87 KAHKLGIILVTSREALNYTW----KKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGPI--NYDN 160 (297) T ss_pred eeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceeccccc--CHHH Confidence 88888778889999999874 4788999999999999999999985321100000 0000000010000 1222 Q ss_pred HHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCC--Ccc Q lcl|NC_018861. 317 LSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFA--EFD 394 (465) Q Consensus 317 L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~--~~d 394 (465) |.++...+... +.....++++++....|+..-- ..+...... ..|+|. |++|+..+.. +.. T Consensus 161 ----i~~~~~~l~~~--~~~~~~~v~~~~~~~~L~~l~d----~~G~~i~~~------~~~~l~-G~Pv~~~~~~~~~~~ 223 (297) T protein:vir:95 161 ----ILKLQDALYDA--DVEPNAFVSKIQNRSALREARD----GNKVSIYDK------AANTID-GITTVDLKSARFEKG 223 (297) T ss_pred ----HHHHHHHhhhc--cCCcCEEEEcHHHHHHHHHhhc----cCCceeecC------CCCccc-ceeeEeecCCCCCCc Confidence 23333333332 2345678999999999986411 111111111 123443 4566654432 222 Q ss_pred eEEEEEecCCCccceeEEecccccceeee--------eCCC----c-cc-ceeeee--eeeee-eecCcccccccceEEE Q lcl|NC_018861. 395 YCTVAYKGASNFDAGIFFAPYNITLQQNL--------TDPV----S-GQ-PAMILN--NRYDV-VATPLHPEAFIRTFAV 457 (465) Q Consensus 395 y~~vg~kg~~~~d~glfy~PY~~~~~~~~--------~dp~----s-~q-p~~~~~--tRY~l-~~nPf~~~~~~~~f~~ 457 (465) -+++|=. ..+++...-...+.+. .|+. + || -.++|. .|++. +.|| . .|++ T Consensus 224 ~~~~gd~------s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~---~----a~~~ 290 (297) T protein:vir:95 224 DLLAGDF------DNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKT---D----AFAK 290 (297) T ss_pred eEEEEec------ccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecc---c----ceEE Confidence 2222210 0111222211111111 1111 1 22 223333 44554 2233 2 2331 Q ss_pred eecccee Q lcl|NC_018861. 458 NLNNYII 464 (465) Q Consensus 458 ~~~~~~~ 464 (465) -.-=|-+ T Consensus 291 l~~at~~ 297 (297) T protein:vir:95 291 LTPAERV 297 (297) T ss_pred EeecCCC Confidence 1100001 No 101 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=68.29 E-value=0.24 Score=23.99 Aligned_cols=301 Identities=11% Similarity=0.045 Sum_probs=107.9 Q ss_pred CCccchhh-----hHHHhhhhhhc-----------------ccccc---Ch---------hhhhhe-----ehccccchh Q lcl|NC_018861. 1 MADKYLLD-----ESTKEKFITSN-----------------LYPNL---NE---------SEKNIM-----RTVLENQGN 41 (465) Q Consensus 1 ~~~~~~~~-----e~~~e~~~~~~-----------------~~~~~---~~---------~~~~~~-----~~l~~n~~~ 41 (465) .++..+.. +.|.+++.-+. ....- .+ .-|+.+ ...+.+..+ T Consensus 34 ~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~ 113 (387) T protein:vir:94 34 IDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQR 113 (387) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHH Confidence 11111000 11222221110 00000 00 001110 000000011 Q ss_pred HHHhhhhhhhccccccccchhhh------hhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccC Q lcl|NC_018861. 42 EVKMLMESTVTGDIAKFTPILVP------VIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVL 115 (465) Q Consensus 42 ~~~~i~est~t~~v~~~~P~l~~------l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~ 115 (465) .++-+.+++.+ .+ ..||+ ++.+.-..-.-.+++.|.|+++.+. . |-.+.. . T Consensus 114 ~~~a~~~~~~~----~g-G~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~--p--~~~~~~--~------------ 170 (387) T protein:vir:94 114 LLHALPTGNDS----GG-DKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEI--P--RVSYTL--D------------ 170 (387) T ss_pred HHhhhccCCCC----CC-ceeechhHHHHHHHHHHhhchhhhhceeeecCCcee--e--eeeccC--C------------ Confidence 11112222211 11 23333 3344444445567777877764321 0 100100 0 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCC Q lcl|NC_018861. 116 KLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVE 195 (465) Q Consensus 116 ~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~ 195 (465) ++ .|. T Consensus 171 ------------------------~a------~~v--------------------------------------------- 175 (387) T protein:vir:94 171 ------------------------DD------DFI--------------------------------------------- 175 (387) T ss_pred ------------------------cc------ccc--------------------------------------------- Confidence 00 000 Q ss_pred CcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHH Q lcl|NC_018861. 196 AVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILS 275 (465) Q Consensus 196 ~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLs 275 (465) +| |...++...++++++..+|.-+-...+|-||.+|- ..|.|++|.+-|+ T Consensus 176 ------------------------~E--g~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds----~~~l~~~i~~~la 225 (387) T protein:vir:94 176 ------------------------TD--VETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS----DVDLVNWVENALQ 225 (387) T ss_pred ------------------------cc--cccccccccccceeeechheeeeechhhHHHHhhh----HHHHHHHHHHHHH Confidence 00 11122222234455555555555688999999984 4677899999999 Q ss_pred HHHHHHhhHHHHhhhhheeeee---eeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHh Q lcl|NC_018861. 276 AEVALEIDRTIIEKANEVATVC---TDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDE 352 (465) Q Consensus 276 tEImlEINreii~~l~~~at~~---~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~ 352 (465) ..|..-.|..++-.-.-..... ..-.+..+.+.-. +..|.. +-+.+... -|..+.|++-+...+.+|.. T Consensus 226 ~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~---~d~i~~----~~~~l~~~-y~~na~~imn~~t~~~~~~~ 297 (387) T protein:vir:94 226 SGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADM---YDAIIN----ALADLHED-YRDNATIYMRYADYVKIISV 297 (387) T ss_pred HHHHHHHHHhHhhcCCCccccceeeeccccccccccch---HHHHHH----HHhccChh-hhcCCEEEEechHHHHHHHH Confidence 9888766666653322111100 0001111222111 222222 22222221 12355565544444444433 Q ss_pred cCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccce Q lcl|NC_018861. 353 IGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPA 432 (465) Q Consensus 353 ~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~ 432 (465) . +. .+..... ..-++|. |++||+..+++. +++|- =+-||.-|......+..+..+.+-. T Consensus 298 ~---~~--~~~~~~~------~~~~~ll-G~PV~~~~~~~~--~~~GD-------f~~~~~~~~~~~~~~~~~~~~~~~~ 356 (387) T protein:vir:94 298 L---SN--GTTNFFD------TPAEKVF-GKPVVFTDAAVK--PIVGD-------FNYFGINYDGTTYDTDKDVKKGEYL 356 (387) T ss_pred H---hc--CCCcccc------cCCcccc-ccceEEecCCCc--eeeec-------hhhhhhhhhhhhheecccccCCceE Confidence 2 10 1111110 0113565 569998877654 33331 1112222222222222333333333 Q ss_pred eeeeeeeee-eecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 433 MILNNRYDV-VATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 433 ~~~~tRY~l-~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) +-...|++. +++| ++ |. +....---+ T Consensus 357 ~~~~~r~Dg~v~~~---~A----~~~l~~ka~~~~ 384 (387) T protein:vir:94 357 FVLTAWYDQQRTLD---SA----FRIAKAKENTGP 384 (387) T ss_pred EEEEEEeCcEeech---hh----eEEEEeecCCCC Confidence 333446665 3333 22 22 111000000 No 102 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=68.29 E-value=0.24 Score=23.99 Aligned_cols=301 Identities=11% Similarity=0.045 Sum_probs=107.9 Q ss_pred CCccchhh-----hHHHhhhhhhc-----------------ccccc---Ch---------hhhhhe-----ehccccchh Q lcl|NC_018861. 1 MADKYLLD-----ESTKEKFITSN-----------------LYPNL---NE---------SEKNIM-----RTVLENQGN 41 (465) Q Consensus 1 ~~~~~~~~-----e~~~e~~~~~~-----------------~~~~~---~~---------~~~~~~-----~~l~~n~~~ 41 (465) .++..+.. +.|.+++.-+. ....- .+ .-|+.+ ...+.+..+ T Consensus 34 ~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~ 113 (387) T protein:vir:26 34 IDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQR 113 (387) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHH Confidence 11111000 11222221110 00000 00 001110 000000011 Q ss_pred HHHhhhhhhhccccccccchhhh------hhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccC Q lcl|NC_018861. 42 EVKMLMESTVTGDIAKFTPILVP------VIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVL 115 (465) Q Consensus 42 ~~~~i~est~t~~v~~~~P~l~~------l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~ 115 (465) .++-+.+++.+ .+ ..||+ ++.+.-..-.-.+++.|.|+++.+. . |-.+.. . T Consensus 114 ~~~a~~~~~~~----~g-G~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~--p--~~~~~~--~------------ 170 (387) T protein:vir:26 114 LLHALPTGNDS----GG-DKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEI--P--RVSYTL--D------------ 170 (387) T ss_pred HHhhhccCCCC----CC-ceeechhHHHHHHHHHHhhchhhhhceeeecCCcee--e--eeeccC--C------------ Confidence 11112222211 11 23333 3344444445567777877764321 0 100100 0 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCC Q lcl|NC_018861. 116 KLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVE 195 (465) Q Consensus 116 ~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~ 195 (465) ++ .|. T Consensus 171 ------------------------~a------~~v--------------------------------------------- 175 (387) T protein:vir:26 171 ------------------------DD------DFI--------------------------------------------- 175 (387) T ss_pred ------------------------cc------ccc--------------------------------------------- Confidence 00 000 Q ss_pred CcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHH Q lcl|NC_018861. 196 AVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILS 275 (465) Q Consensus 196 ~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLs 275 (465) +| |...++...++++++..+|.-+-...+|-||.+|- ..|.|++|.+-|+ T Consensus 176 ------------------------~E--g~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds----~~~l~~~i~~~la 225 (387) T protein:vir:26 176 ------------------------TD--VETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS----DVDLVNWVENALQ 225 (387) T ss_pred ------------------------cc--cccccccccccceeeechheeeeechhhHHHHhhh----HHHHHHHHHHHHH Confidence 00 11122222234455555555555688999999984 4677899999999 Q ss_pred HHHHHHhhHHHHhhhhheeeee---eeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHh Q lcl|NC_018861. 276 AEVALEIDRTIIEKANEVATVC---TDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDE 352 (465) Q Consensus 276 tEImlEINreii~~l~~~at~~---~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~ 352 (465) ..|..-.|..++-.-.-..... ..-.+..+.+.-. +..|.. +-+.+... -|..+.|++-+...+.+|.. T Consensus 226 ~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~---~d~i~~----~~~~l~~~-y~~na~~imn~~t~~~~~~~ 297 (387) T protein:vir:26 226 SGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADM---YDAIIN----ALADLHED-YRDNATIYMRYADYVKIISV 297 (387) T ss_pred HHHHHHHHHhHhhcCCCccccceeeeccccccccccch---HHHHHH----HHhccChh-hhcCCEEEEechHHHHHHHH Confidence 9888766666653322111100 0001111222111 222222 22222221 12355565544444444433 Q ss_pred cCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccce Q lcl|NC_018861. 353 IGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPA 432 (465) Q Consensus 353 ~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~ 432 (465) . +. .+..... ..-++|. |++||+..+++. +++|- =+-||.-|......+..+..+.+-. T Consensus 298 ~---~~--~~~~~~~------~~~~~ll-G~PV~~~~~~~~--~~~GD-------f~~~~~~~~~~~~~~~~~~~~~~~~ 356 (387) T protein:vir:26 298 L---SN--GTTNFFD------TPAEKVF-GKPVVFTDAAVK--PIVGD-------FNYFGINYDGTTYDTDKDVKKGEYL 356 (387) T ss_pred H---hc--CCCcccc------cCCcccc-ccceEEecCCCc--eeeec-------hhhhhhhhhhhhheecccccCCceE Confidence 2 10 1111110 0113565 569998877654 33331 1112222222222222333333333 Q ss_pred eeeeeeeee-eecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 433 MILNNRYDV-VATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 433 ~~~~tRY~l-~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) +-...|++. +++| ++ |. +....---+ T Consensus 357 ~~~~~r~Dg~v~~~---~A----~~~l~~ka~~~~ 384 (387) T protein:vir:26 357 FVLTAWYDQQRTLD---SA----FRIAKAKENTGP 384 (387) T ss_pred EEEEEEeCcEeech---hh----eEEEEeecCCCC Confidence 333446665 3333 22 22 111000000 No 103 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=68.29 E-value=0.24 Score=23.99 Aligned_cols=301 Identities=11% Similarity=0.045 Sum_probs=107.9 Q ss_pred CCccchhh-----hHHHhhhhhhc-----------------ccccc---Ch---------hhhhhe-----ehccccchh Q lcl|NC_018861. 1 MADKYLLD-----ESTKEKFITSN-----------------LYPNL---NE---------SEKNIM-----RTVLENQGN 41 (465) Q Consensus 1 ~~~~~~~~-----e~~~e~~~~~~-----------------~~~~~---~~---------~~~~~~-----~~l~~n~~~ 41 (465) .++..+.. +.|.+++.-+. ....- .+ .-|+.+ ...+.+..+ T Consensus 34 ~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~ 113 (387) T protein:vir:96 34 IDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQR 113 (387) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHH Confidence 11111000 11222221110 00000 00 001110 000000011 Q ss_pred HHHhhhhhhhccccccccchhhh------hhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccC Q lcl|NC_018861. 42 EVKMLMESTVTGDIAKFTPILVP------VIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVL 115 (465) Q Consensus 42 ~~~~i~est~t~~v~~~~P~l~~------l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~ 115 (465) .++-+.+++.+ .+ ..||+ ++.+.-..-.-.+++.|.|+++.+. . |-.+.. . T Consensus 114 ~~~a~~~~~~~----~g-G~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~--p--~~~~~~--~------------ 170 (387) T protein:vir:96 114 LLHALPTGNDS----GG-DKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEI--P--RVSYTL--D------------ 170 (387) T ss_pred HHhhhccCCCC----CC-ceeechhHHHHHHHHHHhhchhhhhceeeecCCcee--e--eeeccC--C------------ Confidence 11112222211 11 23333 3344444445567777877764321 0 100100 0 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCC Q lcl|NC_018861. 116 KLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVE 195 (465) Q Consensus 116 ~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~ 195 (465) ++ .|. T Consensus 171 ------------------------~a------~~v--------------------------------------------- 175 (387) T protein:vir:96 171 ------------------------DD------DFI--------------------------------------------- 175 (387) T ss_pred ------------------------cc------ccc--------------------------------------------- Confidence 00 000 Q ss_pred CcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHH Q lcl|NC_018861. 196 AVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILS 275 (465) Q Consensus 196 ~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLs 275 (465) +| |...++...++++++..+|.-+-...+|-||.+|- ..|.|++|.+-|+ T Consensus 176 ------------------------~E--g~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds----~~~l~~~i~~~la 225 (387) T protein:vir:96 176 ------------------------TD--VETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS----DVDLVNWVENALQ 225 (387) T ss_pred ------------------------cc--cccccccccccceeeechheeeeechhhHHHHhhh----HHHHHHHHHHHHH Confidence 00 11122222234455555555555688999999984 4677899999999 Q ss_pred HHHHHHhhHHHHhhhhheeeee---eeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHh Q lcl|NC_018861. 276 AEVALEIDRTIIEKANEVATVC---TDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDE 352 (465) Q Consensus 276 tEImlEINreii~~l~~~at~~---~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~ 352 (465) ..|..-.|..++-.-.-..... ..-.+..+.+.-. +..|.. +-+.+... -|..+.|++-+...+.+|.. T Consensus 226 ~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~---~d~i~~----~~~~l~~~-y~~na~~imn~~t~~~~~~~ 297 (387) T protein:vir:96 226 SGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADM---YDAIIN----ALADLHED-YRDNATIYMRYADYVKIISV 297 (387) T ss_pred HHHHHHHHHhHhhcCCCccccceeeeccccccccccch---HHHHHH----HHhccChh-hhcCCEEEEechHHHHHHHH Confidence 9888766666653322111100 0001111222111 222222 22222221 12355565544444444433 Q ss_pred cCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccce Q lcl|NC_018861. 353 IGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPA 432 (465) Q Consensus 353 ~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~ 432 (465) . +. .+..... ..-++|. |++||+..+++. +++|- =+-||.-|......+..+..+.+-. T Consensus 298 ~---~~--~~~~~~~------~~~~~ll-G~PV~~~~~~~~--~~~GD-------f~~~~~~~~~~~~~~~~~~~~~~~~ 356 (387) T protein:vir:96 298 L---SN--GTTNFFD------TPAEKVF-GKPVVFTDAAVK--PIVGD-------FNYFGINYDGTTYDTDKDVKKGEYL 356 (387) T ss_pred H---hc--CCCcccc------cCCcccc-ccceEEecCCCc--eeeec-------hhhhhhhhhhhhheecccccCCceE Confidence 2 10 1111110 0113565 569998877654 33331 1112222222222222333333333 Q ss_pred eeeeeeeee-eecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 433 MILNNRYDV-VATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 433 ~~~~tRY~l-~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) +-...|++. +++| ++ |. +....---+ T Consensus 357 ~~~~~r~Dg~v~~~---~A----~~~l~~ka~~~~ 384 (387) T protein:vir:96 357 FVLTAWYDQQRTLD---SA----FRIAKAKENTGP 384 (387) T ss_pred EEEEEEeCcEeech---hh----eEEEEeecCCCC Confidence 333446665 3333 22 22 111000000 No 104 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=67.86 E-value=0.25 Score=23.92 Aligned_cols=321 Identities=12% Similarity=0.054 Sum_probs=118.1 Q ss_pred CCccchhh-----------------hHHHhhhhhhcc---ccccC----hhhhhhe------ehccccc-hhHHHhhhhh Q lcl|NC_018861. 1 MADKYLLD-----------------ESTKEKFITSNL---YPNLN----ESEKNIM------RTVLENQ-GNEVKMLMES 49 (465) Q Consensus 1 ~~~~~~~~-----------------e~~~e~~~~~~~---~~~~~----~~~~~~~------~~l~~n~-~~~~~~i~es 49 (465) |.++.-.- ++..+++--..+ .+... +.+++.. ..-.+.. .++++.+.+. T Consensus 4 L~e~~~e~~e~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~~ 83 (390) T protein:vir:40 4 LDKKDSETLNISTAFLNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKYYNEV 83 (390) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHHHHHH Confidence 00000000 001111100000 00000 0000000 0000000 1233333332 Q ss_pred hhccccccccchhhh------hhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccc Q lcl|NC_018861. 50 TVTGDIAKFTPILVP------VIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESAN 123 (465) Q Consensus 50 t~t~~v~~~~P~l~~------l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ 123 (465) ...+..+.+ ..||+ ++..+-..-+-.+++-|.||++....|.. ... ... T Consensus 84 ~~~~~~~~g-g~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~----~~~-~~~------------------- 138 (390) T protein:vir:40 84 IAGNGFAGV-TALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIIS----VGD-VAT------------------- 138 (390) T ss_pred HhccCcccC-cccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEE----EcC-Ccc------------------- Confidence 222222211 22222 33444444456677888888875544331 100 000 Q ss_pred ccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCc Q lcl|NC_018861. 124 KDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEAL 203 (465) Q Consensus 124 ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~ 203 (465) +.|.. |. . T Consensus 139 -----------------------a~~~~----------------------------E~---------------~------ 146 (390) T protein:vir:40 139 -----------------------AWWGP----------------------------LC---------------A------ 146 (390) T ss_pred -----------------------eeeec----------------------------cc---------------c------ Confidence 00000 00 0 Q ss_pred cccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_018861. 204 WLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEID 283 (465) Q Consensus 204 ~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEIN 283 (465) ...+.....|.+..|++.|..+ ....|-||.+|-- .|.|++|.+.|+..|..-+| T Consensus 147 --------------~~~~~~~~~f~~i~l~~~k~~~-------~i~iS~ell~ds~----~~l~~~i~~~la~~i~~~~~ 201 (390) T protein:vir:40 147 --------------EIKEVLDNGFDKIQTGMYKLSA-------YIPVCNAMLDLGP----SWLDQYVRTILGEAMALGLE 201 (390) T ss_pred --------------ccCccccccceeeEeeeeeEEE-------eehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHH Confidence 0000011246666666666554 3468889998853 47899999999999999999 Q ss_pred HHHHhh---------hhhee--eeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHh Q lcl|NC_018861. 284 RTIIEK---------ANEVA--TVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDE 352 (465) Q Consensus 284 reii~~---------l~~~a--t~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~ 352 (465) +.+|.- |+..+ +.+...... .+....+....+...+...-..-..+-. +.+.|++-....+..|+. T Consensus 202 ~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~--~~~~t~~~~~~~~~~l~~~~~~~~~~~~-~~a~~i~n~~t~~~~l~~ 278 (390) T protein:vir:40 202 AGIVNGSGKDQPIGMMRDLNNVTAGEHPVKT--ATPLTDLTPATLATKVMLPLTDNGKKSV-SDAILVINPADYWSKIYA 278 (390) T ss_pred hhhhcccCCCccceeeecccccccccccccc--ccccchhhHHHHHHHHHHHhhcchhhhh-cCceEEEcchhHHHHHHH Confidence 999963 21111 111111100 0101111122222222222111111112 244454444455667765 Q ss_pred cCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcc--c Q lcl|NC_018861. 353 IGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSG--Q 430 (465) Q Consensus 353 ~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~--q 430 (465) .-.+.. +.+.+. .+.+.-+++|+++++.|.+-++.|-- +.+ ++.-. ..+.+-++++.+ . T Consensus 279 ~~~~~d-~~G~~v----------~~~~~~g~pvv~~~~~p~~~i~~Gd~--s~~----~i~~~--~~~~v~~~~~~~f~~ 339 (390) T protein:vir:40 279 ATSYMT-PQGVWV----------TGILPVPLEIVQSVAVPVGKAVAGRA--KDY----FMGIG--SEQVIRTSTEYRLLD 339 (390) T ss_pred HhhccC-CCCccc----------cccCCCceeEEEcCCCCCCcEEEEee--ceE----EEEee--cceEEEecchhhhhc Confidence 444421 111111 12223478999999888766655421 001 11111 112222333322 2 Q ss_pred ceeeeeeeeeeeecCcccccc-------------cceEEEeeccceeC Q lcl|NC_018861. 431 PAMILNNRYDVVATPLHPEAF-------------IRTFAVNLNNYIIS 465 (465) Q Consensus 431 p~~~~~tRY~l~~nPf~~~~~-------------~~~f~~~~~~~~~~ 465 (465) ..++|..++=+-..|-.+++. .-.|+|+....-=- T Consensus 340 ~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 387 (390) T protein:vir:40 340 DETLYYAKQYANGRPKDNSSFLVFDITGLEGSPAIDVNVVNNATPSET 387 (390) T ss_pred CcEEEEEEEEeCCEEecccceEEEEeeccCCCCCCCcceeeCCCCCCC Confidence 334443333332233333321 11222222111110 No 105 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=66.05 E-value=0.27 Score=23.67 Aligned_cols=313 Identities=14% Similarity=0.074 Sum_probs=112.9 Q ss_pred CCccchhhhHHHhhhhhhccc-cccCh-hhhh---------heehc-------cc--cch--hHHHhhh----------- Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLY-PNLNE-SEKN---------IMRTV-------LE--NQG--NEVKMLM----------- 47 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~-~~~~~-~~~~---------~~~~l-------~~--n~~--~~~~~i~----------- 47 (465) .+++.+++|.. ++|.-+... +.|++ -++. +...+ .. .+. .+...++ T Consensus 27 ~~~~~lt~e~~-~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~r~~~ 105 (390) T protein:vir:62 27 FAGKEMTDEAR-EKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGSGAQRSADVDDDATLRAGNLGEARSFE 105 (390) T ss_pred hhcccccHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhcchHHHHHHhhhhhhhhHHHH Confidence 22222222211 111111000 00000 0000 00000 00 000 0000000 Q ss_pred ------hhhhccccccccchhh-hhhhhhh-hhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccc Q lcl|NC_018861. 48 ------ESTVTGDIAKFTPILV-PVIRRAL-PSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKT 119 (465) Q Consensus 48 ------est~t~~v~~~~P~l~-~l~~ra~-~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~ 119 (465) .++++++-...-|.+. -++.+.. ...+...++-|-||++...+-+-.. T Consensus 106 ~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~------------------------ 161 (390) T protein:vir:62 106 FAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVI------------------------ 161 (390) T ss_pred hhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEE------------------------ Confidence 0000000000000000 0001100 0111122222222222111111100 Q ss_pred ccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCccc Q lcl|NC_018861. 120 ESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYT 199 (465) Q Consensus 120 a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~ 199 (465) ++ . T Consensus 162 ---------------------------------------------------------------~~---~----------- 164 (390) T protein:vir:62 162 ---------------------------------------------------------------TG---R----------- 164 (390) T ss_pred ---------------------------------------------------------------cC---C----------- Confidence 00 0 Q ss_pred ccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHH Q lcl|NC_018861. 200 NEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVA 279 (465) Q Consensus 200 ~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEIm 279 (465) ..+. -.+| |..+++-.-++++++..+|..+-....|-||.+|- .+|.+++|.+-|+..|. T Consensus 165 ~~a~--------------wv~E--~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~ 224 (390) T protein:vir:62 165 SSAS--------------IVGE--TAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQ----VLDLVGFLVSDAGPAIG 224 (390) T ss_pred ccee--------------eecc--cccccccccceeeeEeeeeeEEeehHHHHHHHhhh----hHHHHHHHHHHHHHHHH Confidence 0000 0011 22344444457788888888888889999999992 46889999999999999 Q ss_pred HHhhHHHHhh------hhheeee-eeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHh Q lcl|NC_018861. 280 LEIDRTIIEK------ANEVATV-CTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDE 352 (465) Q Consensus 280 lEINreii~~------l~~~at~-~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~ 352 (465) .-+|..+|.- +...... ...+.......- .+..|...+. .+...-+ ..+ ..|+++.....|+. T Consensus 225 ~~~d~~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~----~~~~l~~~~~----~l~~~~~-~~a-~~vmn~~~~~~L~~ 294 (390) T protein:vir:62 225 DAMGRHFITGTGQPRGILTDASPATATFLATDTDSK----VSDALIDLFH----EVPSAYR-ANA-KYVVNDLRAAQMRK 294 (390) T ss_pred HHHHhhhhccCCccccccccccccccceeccccccc----chHHHHHHHH----hhhhhhh-cCC-EEEEchHHHHHHHH Confidence 9999999863 2111111 111211111111 1233333222 2222122 222 46889998888876 Q ss_pred cCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccce Q lcl|NC_018861. 353 IGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPA 432 (465) Q Consensus 353 ~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~ 432 (465) .- ...++++..++-. ...-++| .|++|+++++.|.+-+++|-- +. .+...--.....+..|+-.-... T Consensus 295 lk----d~~g~~l~~~~~~-~g~~~~l-~G~Pv~~~~~~p~~~i~~gd~--s~----~~i~~~~~~~v~~~~~~~~~~~~ 362 (390) T protein:vir:62 295 LK----DANGQYLWQSGLT-VGAPSLF-NGKVVETDDGMPADKILFADL--SK----YRVRFAGSLRVDRSVDAKFSTDQ 362 (390) T ss_pred hh----ccCCCeeecCCcC-CCcccee-cccceEEecCCCCccEEEeec--cc----eeEEeecceEEEeeccccccCCc Confidence 41 0111121111100 0111355 457999998887665544410 00 01110111122223343333333 Q ss_pred eeee--eeeeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 433 MILN--NRYDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 433 ~~~~--tRY~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) ++|. .|++. .|.++.+ |+ +..+. -+ T Consensus 363 ~~~~~~~r~d~--~~~~~~A----~~~l~~~~--~a 390 (390) T protein:vir:62 363 IVYRFLQRADG--LLVDARG----AKVLTVTP--GA 390 (390) T ss_pred EEEEEEEEeCc--Eeechhh----eEEEEeec--CC Confidence 4433 34443 3333333 22 11111 11 No 106 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=64.14 E-value=0.31 Score=23.41 Aligned_cols=255 Identities=12% Similarity=0.091 Sum_probs=125.0 Q ss_pred eeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhcc-CC----chhhcceEEEEEE Q lcl|NC_018861. 164 LLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKL-GK----DMKEMGISVQRVL 238 (465) Q Consensus 164 m~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~l-g~----~f~EM~FsIeK~t 238 (465) |...-. .|.-+-++-... . .... ...++.+ ++ .=.|.-++||+. T Consensus 1 ~vr~i~--~g~s~~~~~iG~-----~-----------------~~~~------~~~G~~l~~~~~~~~~~e~~itID~~- 49 (324) T protein:vir:99 1 MTRTIT--SGKSAQFPVMGR-----T-----------------KARY------LKQGQSLDDGREDIKHTEKVITIDGL- 49 (324) T ss_pred Ceeeee--cCceEEEeeeee-----e-----------------Eecc------ccCCCCcCCCcCCcCcccEEEEecch- Confidence 111111 111111111000 0 0000 0011111 11 124556667753 Q ss_pred EEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheee----eee--ee------eeccCC Q lcl|NC_018861. 239 AEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVAT----VCT--DF------DVNSAD 306 (465) Q Consensus 239 VtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at----~~~--~~------~~~~~~ 306 (465) |-+..-|+-.-|.++ | .|...|...-...+++.++++-|++.+...+- .+. -+ -+.... T Consensus 50 -------l~~~~~VdDiD~~qa-~-~Dlr~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~ 120 (324) T protein:vir:99 50 -------LTTDVLIYDIEDAMN-H-YDVRSEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITG 120 (324) T ss_pred -------hhhhhhhhhHHHHhc-C-ccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccc Confidence 334445555555555 4 89999999999999999999999888743211 000 00 011111 Q ss_pred cccH-HHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEE Q lcl|NC_018861. 307 GRWF-IEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDV 385 (465) Q Consensus 307 ~~~~-~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~v 385 (465) +.-. ......|+..|-..+...-++=---.++|+|++|+.-.+|.....+..... .+.++.-...||.+ .|++| T Consensus 121 ~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~----~~~~~~~~G~V~~i-~Gf~V 195 (324) T protein:vir:99 121 KKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYTDPDTYSAILAALMPNAANY----AALIDPETGNIRNV-MGFEV 195 (324) T ss_pred cccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHhhccccccccc----ccccceecceEEEE-eceEE Confidence 1100 112333444444444444443333367999999999999988766654322 22334445688888 89999 Q ss_pred EEeCCCCcceEE--------------------E--EEecCCCccceeEEeccccccee-------eeeCCCcccceeeee Q lcl|NC_018861. 386 IVDNFAEFDYCT--------------------V--AYKGASNFDAGIFFAPYNITLQQ-------NLTDPVSGQPAMILN 436 (465) Q Consensus 386 y~d~~~~~dy~~--------------------v--g~kg~~~~d~glfy~PY~~~~~~-------~~~dp~s~qp~~~~~ 436 (465) |.-++-|.-..+ . =|+++..-..||||.|=.-+..+ ..-|+..|- -.+. T Consensus 196 ~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~--d~i~ 273 (324) T protein:vir:99 196 VETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPEYQA--DQII 273 (324) T ss_pred EecCCccccccccccccccccccccccccccccccccccccCceeEEEEehhheEEEeeecceecceechhhHH--Hhhh Confidence 988887642211 1 14455555678888776433322 122444333 2346 Q ss_pred eeeeeeecCccccc-ccceEEEeec-ccee---C Q lcl|NC_018861. 437 NRYDVVATPLHPEA-FIRTFAVNLN-NYII---S 465 (465) Q Consensus 437 tRY~l~~nPf~~~~-~~~~f~~~~~-~~~~---~ 465 (465) .+|.+-+=++-|+. -+-+|-...+ |.+- + T Consensus 274 ~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~~~~~ 307 (324) T protein:vir:99 274 AKYAMGHGGLRPEAVGAIIFEDGETPAVAPDVIT 307 (324) T ss_pred hhhhhcCcccccceEEEEEEccCccccccchhhh Confidence 66766666665542 2223332221 1110 0 No 107 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=63.61 E-value=0.31 Score=23.34 Aligned_cols=298 Identities=12% Similarity=0.077 Sum_probs=107.4 Q ss_pred CCc--cc---hhh-hHHHhhhhhhc---------------ccc-ccChhhhhheeh----ccccchh--H--HHhh---- Q lcl|NC_018861. 1 MAD--KY---LLD-ESTKEKFITSN---------------LYP-NLNESEKNIMRT----VLENQGN--E--VKML---- 46 (465) Q Consensus 1 ~~~--~~---~~~-e~~~e~~~~~~---------------~~~-~~~~~~~~~~~~----l~~n~~~--~--~~~i---- 46 (465) |.+ +. +-+ |++.++..... ... ...+.+..-+.. +...++. . ...+ T Consensus 45 i~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 124 (435) T protein:vir:14 45 FSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGF 124 (435) T ss_pred HHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhh Confidence 111 00 000 11111111000 000 000111110011 1111100 0 0000 Q ss_pred hhh---hhccccccccchhhh------hhhhhhhhhhhhhh-eeeeccCCCcceEEEEEEEecCCCCcccccccccccCc Q lcl|NC_018861. 47 MES---TVTGDIAKFTPILVP------VIRRALPSLIGTEI-AGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLK 116 (465) Q Consensus 47 ~es---t~t~~v~~~~P~l~~------l~~ra~~~lI~~DI-wGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~ 116 (465) .|. ..+......+..||+ ++.++.++.+..++ +=+.||+... +-+.. ... T Consensus 125 ~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~-~~~p~---~~~---------------- 184 (435) T protein:vir:14 125 GEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGN-ITIPR---LKG---------------- 184 (435) T ss_pred hhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCc-eEEEE---EeC---------------- Confidence 010 001111111222322 33333344333333 2122222111 00000 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCC Q lcl|NC_018861. 117 LKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEA 196 (465) Q Consensus 117 ~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~ 196 (465) .+...+. T Consensus 185 -------------------------------------------------------~~~a~~v------------------ 191 (435) T protein:vir:14 185 -------------------------------------------------------GAIVGYI------------------ 191 (435) T ss_pred -------------------------------------------------------Ccceeee------------------ Confidence 0000000 Q ss_pred cccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHH Q lcl|NC_018861. 197 VYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSA 276 (465) Q Consensus 197 ~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLst 276 (465) +| |...++-.-++++++..++..+-....|-||.+| +....+.|+.|.+-|+. T Consensus 192 -----------------------~E--~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~~~l~~~i~~~l~~ 244 (435) T protein:vir:14 192 -----------------------GA--DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKY--AGVNPNVDQIVVGDLTA 244 (435) T ss_pred -----------------------cc--CccccccccceeEEEeeeEEEEEeehhhHHHHHh--hccCHHHHHHHHHHHHH Confidence 01 1223333445777788888887788899999999 32234588999999999 Q ss_pred HHHHHhhHHHHhhh---------hheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHH-hcccccccEEEecHHH Q lcl|NC_018861. 277 EVALEIDRTIIEKA---------NEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGR-QTRKGGGNKLIVSPKV 346 (465) Q Consensus 277 EImlEINreii~~l---------~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~-~T~~~~~~~~~~s~~v 346 (465) .|...+|+.||..- ...+.+....... .+. .+......|.++-..+.. ...+ .....|+++.. T Consensus 245 ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~--~~~----~~~~~~~~~~~l~~~~~~~~~~~-~~~~~v~n~~~ 317 (435) T protein:vir:14 245 AIGAREDKAFIRDDGTANTPKGLRFWALPSNVITAS--DAS----TLQKIETDLGKVILALENADANL-TQPGWIMAPRT 317 (435) T ss_pred HHHHHHHHHhhccCCCCccccceeecccccceeccc--ccc----chhhHHHHHHHHHHHhhhccccc-cCCEEEEcHHH Confidence 99999999888521 1111111111111 111 111111122222222221 1222 33457899999 Q ss_pred HHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcc----------------eEEEEEecCCCcccee Q lcl|NC_018861. 347 ATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFD----------------YCTVAYKGASNFDAGI 410 (465) Q Consensus 347 a~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~d----------------y~~vg~kg~~~~d~gl 410 (465) ...|...-- ..+.... +.. --|+|. |++|+++++.|.+ ++++|.++.-.. T Consensus 318 ~~~L~~lkd----~~G~~l~-~~~----~~g~l~-G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~~---- 383 (435) T protein:vir:14 318 FRFLEGLRD----GNGNKVY-PEL----ANGMLK-GYPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEEETLEI---- 383 (435) T ss_pred HHHHHHhhc----cCCceec-cCC----CCCeee-cceeEeeccccccccCCCccceEEEeecccEEEEEecccEE---- Confidence 999976521 1122211 111 125664 5788888765432 122333332221 Q ss_pred EEeccccc-------------------ce----eeeeCCCcccceeeeeeeeee Q lcl|NC_018861. 411 FFAPYNIT-------------------LQ----QNLTDPVSGQPAMILNNRYDV 441 (465) Q Consensus 411 fy~PY~~~-------------------~~----~~~~dp~s~qp~~~~~tRY~l 441 (465) -..||.-. .. ..+.||+.|...-|+- ||- T Consensus 384 ~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~--~~~ 435 (435) T protein:vir:14 384 DYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVA--WGA 435 (435) T ss_pred EEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCC--CCC Confidence 12222100 00 0123444444433321 222 No 108 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=62.90 E-value=0.33 Score=23.25 Aligned_cols=270 Identities=15% Similarity=0.037 Sum_probs=111.7 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCccccccccccc--cccccccCCccCCCcccccCccccccccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMD--KAATFATKKATVEAVYTNEALWLKVLKNYTG 213 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~--t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~ 213 (465) ++... +.-...+ .|+---..+...+..+ ..+.+=+. ....+. .+...... .++.. T Consensus 1 Ma~~~-T~l~d~i---~Pev~~~~v~~~~~~~-------~~~~~~~~~~~~l~g~------~G~ti~iP-----~~~~i- 57 (276) T protein:vir:10 1 MAQGT-TTKSTQI---VPEVLAPMMQAELDKK-------LRFAQFADIDSTLVGQ------PGDTLTFP-----AFVYS- 57 (276) T ss_pred CCcce-eehhhhh---chHHHHHHHHHHHHhh-------hhhcccceecccccCC------CCCEEEee-----eecCC- Confidence 11100 0001000 0100000111111100 01101000 000000 00000000 00111 Q ss_pred cccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhh-CCCHHHHHHHHHHHHHHHHhhHHHHhhhhh Q lcl|NC_018861. 214 PYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQH-GINAEKELADILSAEVALEIDRTIIEKANE 292 (465) Q Consensus 214 ~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiH-GlDAe~EL~niLstEImlEINreii~~l~~ 292 (465) +-++...| |.++..=..+..+.+++.|-|.-.=++| |+-+.. +.|.-.|..+-++.-|...++.+++..+.. T Consensus 58 gda~~~~e--g~~i~~~~lt~~~~~a~i~~~~k~~~~t-----D~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~ 130 (276) T protein:vir:10 58 GDATVVPE--GQKIPVDKIETNRREAKIHKIGKGTDIT-----DEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRG 130 (276) T ss_pred CccccccC--CCccCccccccceeeEEeehcccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 11112222 2333222233455555555554333333 444433 589999999999999999999999988843 Q ss_pred eeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccccccccc Q lcl|NC_018861. 293 VATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGI 372 (465) Q Consensus 293 ~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~ 372 (465) ... .+. .+....+.+-....++..+ -...++++++|++.+.|.......+..+... +.+-.. T Consensus 131 ~~~-----~~~--~~~~t~d~i~~A~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~--g~~~~~ 192 (276) T protein:vir:10 131 TKL-----TVS--ADIGTLAGLEAAIDTFDDE---------DLEPMVLFINPKDAGKLRSSASDNFTRATEL--GDNIIV 192 (276) T ss_pred ccc-----ccc--ccccCHHHHHHHHHHhccc---------cCcccEEEEcHHHHHHHHHhccccccccccc--ccccee Confidence 211 111 1111122222222222221 1367899999999999976433332221110 001111 Q ss_pred ceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecCcccccc Q lcl|NC_018861. 373 KPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATPLHPEAF 451 (465) Q Consensus 373 ~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~~ 451 (465) ...+|++ .|++|++|...|..-..+--+|+-.+ +.. -+...-.--|++.++-.|--.-+||+ ..||= ..= T Consensus 193 ~G~ig~~-~G~~Vi~s~~~p~~t~~l~~~gAi~~----~~~--~~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~~--~vv 263 (276) T protein:vir:10 193 KGAFGEA-LGAVIVRSKKLDEGEAILAKRGAVKL----ITK--RDFFLETDRDPSTKTTALYSDKHYVAYLYDES--KAV 263 (276) T ss_pred cccccee-cceeEEEcCCCCcceEEEEeccceee----eec--CCceeecccchhhcccEEEEeeEEEEEEEcCc--ceE Confidence 3467887 57899999998754332211222211 111 01111112278888888888888887 55551 000 Q ss_pred cceEE--Eeeccc Q lcl|NC_018861. 452 IRTFA--VNLNNY 462 (465) Q Consensus 452 ~~~f~--~~~~~~ 462 (465) ..+|+ ..-+|- T Consensus 264 ~~t~~~~~~~~~~ 276 (276) T protein:vir:10 264 KVTKGAGTTDSGA 276 (276) T ss_pred EEecCCcCCcCCC Confidence 01111 111111 No 109 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=62.41 E-value=0.34 Score=23.18 Aligned_cols=308 Identities=12% Similarity=0.051 Sum_probs=127.0 Q ss_pred CCccchhhhHH--Hhhhhhhccccc-----------------cChhhhhheehccccchhHHHhhhhhh-hccccccccc Q lcl|NC_018861. 1 MADKYLLDEST--KEKFITSNLYPN-----------------LNESEKNIMRTVLENQGNEVKMLMEST-VTGDIAKFTP 60 (465) Q Consensus 1 ~~~~~~~~e~~--~e~~~~~~~~~~-----------------~~~~~~~~~~~l~~n~~~~~~~i~est-~t~~v~~~~P 60 (465) +.+..-.++++ .|++......+. ..+.++.+..-|-.+......-....+ +.|.+. =| T Consensus 47 ~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~--vP 124 (394) T protein:vir:10 47 KARRDAINDQIKDLEAENKANSDPDKPVDNAQPNGTDLKKKPIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAGVL--IP 124 (394) T ss_pred HHHHHHHHHHHHHHHHHHHhhcchhhhhhhhcccccchhhhHHHHHHHHHHHHHhccchhhhhhhcccccccCcee--cc Confidence 11110111110 011111000000 001222222222222211111011111 111111 13 Q ss_pred hhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccc Q lcl|NC_018861. 61 ILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSF 138 (465) Q Consensus 61 ~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~ 138 (465) .-+ .++++..+..+-.+++.+.||+++++-+--.+ . .... T Consensus 125 ~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~---~~~~--------------------------------- 166 (394) T protein:vir:10 125 EEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILK--R---ATDR--------------------------------- 166 (394) T ss_pred HHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEe--c---CCCc--------------------------------- Confidence 222 46677777778889999999999876555433 0 0000 Q ss_pred cccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccch Q lcl|NC_018861. 139 KTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATA 218 (465) Q Consensus 139 ~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta 218 (465) ..+.. T Consensus 167 --------~~~~~------------------------------------------------------------------- 171 (394) T protein:vir:10 167 --------FSSVA------------------------------------------------------------------- 171 (394) T ss_pred --------ccccc------------------------------------------------------------------- Confidence 00000 Q ss_pred hhhccCCchhhc-ceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeee Q lcl|NC_018861. 219 AGEKLGKDMKEM-GISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVC 297 (465) Q Consensus 219 ~~E~lg~~f~EM-~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~ 297 (465) | +...++. ..++++++...|.-+-...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||..... + T Consensus 172 --E--~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~~~il~g~g~----~ 239 (394) T protein:vir:10 172 --E--LAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADS----AVDLTSLVGQSINEKSVNTYNAMIAPVLQS----F 239 (394) T ss_pred --c--cccccccccccceeEEeeeeeeEeeehhHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhhcccc----c Confidence 0 0111111 1235555555665566678999999984 357889999999999999999999876622 1 Q ss_pred eeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCcccccccccccc Q lcl|NC_018861. 298 TDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKIDAINSGIK 373 (465) Q Consensus 298 ~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~~~~~~~~ 373 (465) ....+ .+..++ +....++ ...... .+ . ..+|+++.....|+.. |-..+.|.... .... T Consensus 240 ~~~~~--~~~~~~-d~l~~~~---~~~~~~-----~~-~-a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~-----~~~~ 301 (394) T protein:vir:10 240 TAKAT--TTDTLV-DSLKHIL---NVDLDP-----AY-S-RALVVTQSLFNTLDTLKDKNGRYLLHDASDS-----ITDG 301 (394) T ss_pred ccccc--cccccH-HHHHHHH---Hhhhhh-----hc-c-CEEEecHHHHHHHHHhhccCCCeeeeccccc-----cccC Confidence 11111 111111 2222222 111111 11 1 3578999999988865 22222222111 0111 Q ss_pred eEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecc----c----ccceeeeeCCCcccceeeeeeeeee-eec Q lcl|NC_018861. 374 PNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPY----N----ITLQQNLTDPVSGQPAMILNNRYDV-VAT 444 (465) Q Consensus 374 ~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY----~----~~~~~~~~dp~s~qp~~~~~tRY~l-~~n 444 (465) ...++| -|++|++...... +....+.-++|+.+ + ........|...|.-.+-...|++. +.| T Consensus 302 ~~~~~L-~G~PV~~~~~~~~--------~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~~~~ 372 (394) T protein:vir:10 302 TAKGTV-LGVPVYVVGDALL--------GSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKIYGRYLGAAFRFGVKQAD 372 (394) T ss_pred Cccccc-ccceeEEeccccc--------CCCCCceEEEEeeccccEEEEeecceEEEEecccccceeEEEEEEeccEEec Confidence 222455 4567765332100 00011111233221 1 1112223455566666666778876 333 Q ss_pred Ccccccc-cceEEEeeccceeC Q lcl|NC_018861. 445 PLHPEAF-IRTFAVNLNNYIIS 465 (465) Q Consensus 445 Pf~~~~~-~~~f~~~~~~~~~~ 465 (465) | .+. ..++.--..|..-. T Consensus 373 ~---~ai~~~~~~~~~~~~~~~ 391 (394) T protein:vir:10 373 S---NAGYFVTNTDAASGSTSG 391 (394) T ss_pred c---ccEEEEEeecccCCCCCC Confidence 3 221 11111111111111 No 110 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=60.65 E-value=0.37 Score=22.96 Aligned_cols=293 Identities=10% Similarity=0.010 Sum_probs=130.9 Q ss_pred CCccchhhhHHHhhhhhhccccccChhhhhheehccccchhHHHhhhhhhhccccccccchhh-hhhhhhhhhhhhhhhe Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILV-PVIRRALPSLIGTEIA 79 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIw 79 (465) |-+++..+..+++-+......+.++ +.... +++++.. ..-+.+. -++..+..+.+..+++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~-----------------a~~~~-~~~~~~~-liP~~~~~~ii~~~~~~s~l~~l~ 61 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKPQVFN-----------------PDNVM-MHEKKDG-TLLNDFTTPILQEVMENSKIMQLG 61 (324) T ss_pred CchhHHHHHHHHHHHHhhhhhhhcc-----------------ccccc-ccCCCcc-eechhHHHHHHHHHHhhchhhhhc Confidence 7777777766554322222222211 00100 0111111 1112233 4556667777888889 Q ss_pred eeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccccccccccccccccccccccch Q lcl|NC_018861. 80 GVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDN 159 (465) Q Consensus 80 GVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTg 159 (465) .+-||++++--|.- ... .. .+ .+ T Consensus 62 ~~~~~~~~~~~ip~----~~~--~~-----------------------------------~a------~~---------- 84 (324) T protein:vir:93 62 KYEPMEGTEKKFTF----WAD--KP-----------------------------------GA------YW---------- 84 (324) T ss_pred ceeeccCCceEEEE----Eec--Cc-----------------------------------ce------ee---------- Confidence 99999887632221 100 00 00 00 Q ss_pred hhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEEE Q lcl|NC_018861. 160 IVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLA 239 (465) Q Consensus 160 Lifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tV 239 (465) .+| |..+++..-+++++++ T Consensus 85 -----------------------------------------------------------v~E--g~~~~~~~~~f~~i~~ 103 (324) T protein:vir:93 85 -----------------------------------------------------------VGE--GQKIETSKATWVNATM 103 (324) T ss_pred -----------------------------------------------------------ecC--CccccccccceeEEEE Confidence 001 1223333345677778 Q ss_pred EeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeee----e-ccCCcccHHHHH Q lcl|NC_018861. 240 EAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFD----V-NSADGRWFIEKA 314 (465) Q Consensus 240 tAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~----~-~~~~~~~~~e~~ 314 (465) +.|..+-....|-||.+|-. .|.+++|.+.|+..|...+++.+|..-........-+. . ....+.-..+.. T Consensus 104 ~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) T protein:vir:93 104 RAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) T ss_pred EeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCccccccccccceeccccccHHHH Confidence 88877777889999999953 57889999999999999999999864311100000000 0 001111111223 Q ss_pred HHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCC--C Q lcl|NC_018861. 315 RGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFA--E 392 (465) Q Consensus 315 ~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~--~ 392 (465) ..++.. +.. .+.....++|++.....|+...- +.++.... .. .-++| .|++|++.+.. . T Consensus 180 ~~~~~~-------l~~--~~~~~~~~v~n~~~~~~L~~l~d----~~G~~~~~-~~----~~~~l-~G~PVv~~~~~~~~ 240 (324) T protein:vir:93 180 IDLEAL-------LED--DELEANAFISKTQNRSLLRKIVD----PETKERIY-DR----NSDSL-DGLPVVNLKSSNLK 240 (324) T ss_pred HHHHHh-------hhh--ccCCCCEEEEcHHHHHHHHHhhC----CCCCeeec-CC----CCCcc-cceeeEeecCCCCC Confidence 333322 222 23456679999999999986411 11221111 11 12344 35688876653 2 Q ss_pred cceEEEEEecCCCccceeEEecccccceeeeeCC--------C-----cc---cceeeeeeeeeeeecCcccccccceEE Q lcl|NC_018861. 393 FDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDP--------V-----SG---QPAMILNNRYDVVATPLHPEAFIRTFA 456 (465) Q Consensus 393 ~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp--------~-----s~---qp~~~~~tRY~l~~nPf~~~~~~~~f~ 456 (465) ...+++|-. +-+++.........+.-+. . -| |=.+=...|||.. +..+. .|+ T Consensus 241 ~~~i~~gdf------s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~--v~~~~----a~~ 308 (324) T protein:vir:93 241 RGELITGDF------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALH--IADDK----AFA 308 (324) T ss_pred cceEEEEec------ceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccE--Eeccc----ceE Confidence 222333311 0011111111111111110 0 01 1223334566653 12222 233 Q ss_pred EeeccceeC Q lcl|NC_018861. 457 VNLNNYIIS 465 (465) Q Consensus 457 ~~~~~~~~~ 465 (465) +-..=+.-+ T Consensus 309 ~l~~a~~~~ 317 (324) T protein:vir:93 309 KLVPADKRT 317 (324) T ss_pred EEecccccC Confidence 211111111 No 111 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=59.97 E-value=0.38 Score=22.88 Aligned_cols=305 Identities=11% Similarity=0.030 Sum_probs=121.0 Q ss_pred CCccch-hhhHHHhhhhhhc------------------------cccccCh----------hhhhheehccccchhHHHh Q lcl|NC_018861. 1 MADKYL-LDESTKEKFITSN------------------------LYPNLNE----------SEKNIMRTVLENQGNEVKM 45 (465) Q Consensus 1 ~~~~~~-~~e~~~e~~~~~~------------------------~~~~~~~----------~~~~~~~~l~~n~~~~~~~ 45 (465) ..++.. .++.+.|...-+. +....+. ..+.....++ ...... T Consensus 28 ~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~ 104 (395) T protein:vir:38 28 AIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNKKPLPVKDGKPDAQAMKNQFV---KDFKNL 104 (395) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhHHHHHHHHHHH---HHHHHH Confidence 000000 0001111110000 0000000 0000000000 000111 Q ss_pred hhhhhh-ccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccc Q lcl|NC_018861. 46 LMESTV-TGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESA 122 (465) Q Consensus 46 i~est~-t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~ 122 (465) .+++++ +++-...=|.-+ .+++......+..+++.++||++++|-+--.+ -.+.+ . T Consensus 105 ~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~-~------------------ 163 (395) T protein:vir:38 105 VTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEK--LADIT-P------------------ 163 (395) T ss_pred HhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEe--eccCC-c------------------ Confidence 222222 222111124333 35666777778888999999999997642221 00000 0 Q ss_pred cccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccC Q lcl|NC_018861. 123 NKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEA 202 (465) Q Consensus 123 ~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a 202 (465) . +.|.. T Consensus 164 -----------------~------a~~v~--------------------------------------------------- 169 (395) T protein:vir:38 164 -----------------L------KDLDD--------------------------------------------------- 169 (395) T ss_pred -----------------c------ccccc--------------------------------------------------- Confidence 0 00000 Q ss_pred ccccccccccccccchhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHH Q lcl|NC_018861. 203 LWLKVLKNYTGPYATAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALE 281 (465) Q Consensus 203 ~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlE 281 (465) .|..++|.. -++++++..++.-+-...+|-||.+|- +.|.++.|.+-|+..|..- T Consensus 170 --------------------E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~ 225 (395) T protein:vir:38 170 --------------------ESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDT----VDNIIQWLVNWAAKKDVVT 225 (395) T ss_pred --------------------cccccccccccceeeEEeeeeeeEeehhhHHHHHhhh----HHHHHHHHHHHHHHHHHHH Confidence 001111111 134555666666666667999999993 4577899999999999999 Q ss_pred hhHHHHhhhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCC Q lcl|NC_018861. 282 IDRTIIEKANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPA 361 (465) Q Consensus 282 INreii~~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~ 361 (465) ||+.||.-.-.... ..+-...+....++.. .+...-+ ..-.++|++.....|...- . .. T Consensus 226 ~~~~il~g~g~~~~---------~~~~~~~~~i~~~~~~------~l~~~~~--~~a~~v~n~~~~~~L~~lk---d-~~ 284 (395) T protein:vir:38 226 RNAKILEVMGKAPK---------KPTISQFDNIKDLENN------TLDPAIE--STSSFITNQSGYNILSKVK---D-AD 284 (395) T ss_pred HHHHHhhccccccc---------ccccccHHHHHHHHHH------hhhhhhc--CCCEEEEcHHHHHHHHHhh---c-cC Confidence 99998875521111 1111111222222221 1111111 3346789999999997641 0 11 Q ss_pred cccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc---------cceeeeeCCC----c Q lcl|NC_018861. 362 GSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI---------TLQQNLTDPV----S 428 (465) Q Consensus 362 ~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~---------~~~~~~~dp~----s 428 (465) +.....++ -.....++| .|++|++....+.. ....+..++|+.+.- +.....-++. . T Consensus 285 G~~l~~~~-~~~~~~~~l-~G~pV~~~~~~~~~--------~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~ 354 (395) T protein:vir:38 285 GRYLMQPD-VTSPDKYLI-DGKPVIRIADKWLP--------DVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEH 354 (395) T ss_pred CceeeccC-cCCCCccee-ccceeEEecccccC--------cCCCcceEEEEeccccEEEEEecceEEEEeccccchhhc Confidence 11111110 001122355 36677765432111 000111233332211 1111111111 2 Q ss_pred ccceeeeeeeeee-eecCcccccccceEE-Eeeccce-------eC Q lcl|NC_018861. 429 GQPAMILNNRYDV-VATPLHPEAFIRTFA-VNLNNYI-------IS 465 (465) Q Consensus 429 ~qp~~~~~tRY~l-~~nPf~~~~~~~~f~-~~~~~~~-------~~ 465 (465) -+=.+-+..||+. ..+| ..|+ ++++..- .+ T Consensus 355 ~~~~~r~~~r~d~~~~~~-------~a~~~~~~~~~~~~~~~~~~~ 393 (395) T protein:vir:38 355 DTTKLRFIDRFDVQLIDD-------GAFAAASFKTVANQAQGTAGT 393 (395) T ss_pred CceEEEEEEeeccEEecc-------cceEEEEeecccCCCCCccCC Confidence 2345556677776 3333 2333 2221110 00 No 112 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=58.98 E-value=0.4 Score=22.76 Aligned_cols=292 Identities=11% Similarity=0.038 Sum_probs=131.9 Q ss_pred cCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccccccccccccccccccccccchhhhh Q lcl|NC_018861. 84 LKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNV 163 (465) Q Consensus 84 MTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifa 163 (465) ||.|||+|=+.. +..-+ ..+ .-.+++.-|+-+.-. T Consensus 1 ~~~~~~i~s~~~----~~~it-----v~~------------------------------------ll~~P~~I~~~i~e~ 35 (318) T protein:vir:10 1 MTAPTGIVSVSD----GPAIT-----VRE------------------------------------LVGNPLWIPTALKKM 35 (318) T ss_pred CCCCCcceeeec----CCcee-----hHH------------------------------------hhCCchhHHHHHHHH Confidence 999998875543 21100 000 001122333333333 Q ss_pred eeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEE-EEEEEEee Q lcl|NC_018861. 164 LLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISV-QRVLAEAK 242 (465) Q Consensus 164 m~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsI-eK~tVtAK 242 (465) +...+- .+.+|.++....++.-.... ....+..+-.... +.|.+|+.-+-.. ++....+| T Consensus 36 ~~~~~i----ad~lf~~~~a~~~~~v~f~~-------------~~p~~~~~d~e~V--aEggEiP~~~~~~G~~~ia~~~ 96 (318) T protein:vir:10 36 MVNQFI----SESLFRNGGANPNGVVAYNE-------------GNPSFLEDDVADV--AEFGEIPVSAGARGLPRTAFAV 96 (318) T ss_pred Hhccch----hhhhhhcccccccceeEEEe-------------cccccccCcHhhc--cCcccccccCCCCCchhhhhhe Confidence 322211 12222221100000000000 0000000000001 1122333333333 22222345 Q ss_pred cceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeccCCcccH------------ Q lcl|NC_018861. 243 TRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFDVNSADGRWF------------ 310 (465) Q Consensus 243 SRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~~~~~~~~~~------------ 310 (465) .+.||-++|=|.. .-+.+|+-.....-|++-|...+|+.+++.|...++..-. +++.|. T Consensus 97 K~G~~~~vS~Em~----~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~-----~s~~w~~~~~~~~d~~~A 167 (318) T protein:vir:10 97 KKALGVRVSKEMI----DENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLA-----VPTAWDNGGKVRTDIAIA 167 (318) T ss_pred hhccceeccHHHH----hhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-----CCcCCCCcccccccchhh Confidence 7899999998864 3367899999999999999999999999998765543211 222232 Q ss_pred HHHHHHHH-HHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCc-ccccCCcccccc--cccccceEEEEecCceEEE Q lcl|NC_018861. 311 IEKARGLS-MRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGS-FVLSPAGSKIDA--INSGIKPNVGKFDNRYDVI 386 (465) Q Consensus 311 ~e~~~~L~-~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~-~~~~~~~~~~~~--~~~~~~~~~G~l~~~~~vy 386 (465) +|..+.-. ..+.+...+-.++=.| ..|.||.+|...+.|....- .++.+..++..+ ..-+ -.|.|.+. |++|. T Consensus 168 ~e~v~~a~~~~~~a~~~~~~~~~GY-~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~t-g~~~g~~l-Gl~vi 244 (318) T protein:vir:10 168 IEQISTAAPTAYPAGVGSSDEYFGF-IPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWT-GNFPGSVM-GLNVI 244 (318) T ss_pred hhhhhhhhhhhhhhhhhhhhhccCc-cceeeEECHHHHHHHhcchhhhhhhhccchhhhhccccc-ccccceee-ceEEe Confidence 11111100 1111111111134454 78999999999999977744 333333322111 1101 12345443 49999 Q ss_pred EeCCCCcceEEEEEecCCCccceeEEecccccceeee----eCCCcccceeeeeeeee-----eeecCcccccccceEEE Q lcl|NC_018861. 387 VDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNL----TDPVSGQPAMILNNRYD-----VVATPLHPEAFIRTFAV 457 (465) Q Consensus 387 ~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~----~dp~s~qp~~~~~tRY~-----l~~nPf~~~~~~~~f~~ 457 (465) .+++.|.|-.+|==+|. -| ||+.=.|++.... -|| ..+|-..-..|+= -+..|+. ++ T Consensus 245 ~s~~~p~~~alvlq~g~----vG-~~~d~~pl~~t~~~~egg~~-~g~~~~s~~~~~~~~~~~~V~~PkA--------~~ 310 (318) T protein:vir:10 245 RSRTFPIDRVLIMERGT----VG-FYSDTRPLQFTALYPEGNGP-NGGPTESYRADASHKRALAVDQPKA--------AL 310 (318) T ss_pred ecCccCCCeeEEEecCC----cc-eeeccccceeeecccCCCCC-CCCcchhhheehheeeeeeeeCcce--------eE Confidence 99999988865533321 11 5655555543321 133 2334333332221 1233331 13 Q ss_pred eeccceeC Q lcl|NC_018861. 458 NLNNYIIS 465 (465) Q Consensus 458 ~~~~~~~~ 465 (465) =|||-|-- T Consensus 311 ~itgi~~~ 318 (318) T protein:vir:10 311 WLTGIVTP 318 (318) T ss_pred EEeeccCC Confidence 34442222 No 113 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=56.73 E-value=0.45 Score=22.49 Aligned_cols=196 Identities=16% Similarity=0.236 Sum_probs=103.3 Q ss_pred EEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeee----e---eeeccCC Q lcl|NC_018861. 234 VQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCT----D---FDVNSAD 306 (465) Q Consensus 234 IeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~----~---~~~~~~~ 306 (465) ||=. |=|..=++-.-+-++ | .|...|...=+..+++.++++-|++.+...+.-.. . .+++... T Consensus 1 iD~l--------L~a~~~VdDiD~aqa-~-~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a 70 (221) T protein:vir:17 1 MDDL--------LVASQFVYDLDEILA-Q-WNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGA 70 (221) T ss_pred CCcc--------hhHHHHHHhHHHHHh-h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccc Confidence 2211 223333333333344 3 88889999999999999999999988854332111 1 1111111 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHH-HHHHHHhc-CcccccCCcccccccccccceEEEEecCceE Q lcl|NC_018861. 307 GRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPK-VATILDEI-GSFVLSPAGSKIDAINSGIKPNVGKFDNRYD 384 (465) Q Consensus 307 ~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~-va~~L~~~-~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~ 384 (465) +. ......|+..|-+.+...-++=---.+.|+|++|+ ...+|+.. +-+.-...++.....+. -..+|.+. |++ T Consensus 71 ~~--t~~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~--g~~i~~v~-G~~ 145 (221) T protein:vir:17 71 GN--TNNAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNT--GKGLYVNA-GIR 145 (221) T ss_pred cc--cCCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccc--cceeeeec-CcE Confidence 10 01233444444445555544443347889999995 77777643 33321111111111111 11466664 899 Q ss_pred EEEeCCCCc----ceEEE------------EEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeeeeecCccc Q lcl|NC_018861. 385 VIVDNFAEF----DYCTV------------AYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDVVATPLHP 448 (465) Q Consensus 385 vy~d~~~~~----dy~~v------------g~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~ 448 (465) ||.-++.|. ++... .|+|+-.-..||||.|---++-+ .+.|.|--|-+.-| |.+- .|= T Consensus 146 V~~SnnlP~~~gt~~~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvk-l~~~~~~~~~~~~~--~~~~-~~~-- 219 (221) T protein:vir:17 146 IYKSNVLASLYGTNLVTDPGDATTSGENNGSYRPAITDRAGLVFHKEAADTVE-VLLPPSRPPLVISM--FSIR-RPD-- 219 (221) T ss_pred EEEeccCCcccccccccCCccccccccccccccccccceEEEEEcchheeeee-eecCCCCCceeeee--eecc-CCC-- Confidence 999999875 33211 33444445579999987666655 88888877764322 1110 111 Q ss_pred ccccc Q lcl|NC_018861. 449 EAFIR 453 (465) Q Consensus 449 ~~~~~ 453 (465) -| T Consensus 220 ---~~ 221 (221) T protein:vir:17 220 ---RR 221 (221) T ss_pred ---CC Confidence 11 No 114 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=56.55 E-value=0.45 Score=22.46 Aligned_cols=293 Identities=11% Similarity=-0.005 Sum_probs=113.9 Q ss_pred eehccccchhHHHhhhhh--hhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccc Q lcl|NC_018861. 32 MRTVLENQGNEVKMLMES--TVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVS 107 (465) Q Consensus 32 ~~~l~~n~~~~~~~i~es--t~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~ 107 (465) ||.|-|-.......-.+. +..++ ... |.-+ -++..+.++.+..+++-+.||++-.--|.-.. +.+ T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~-~li-P~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~-------~~~-- 69 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPS-DLL-PKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTV-------KRP-- 69 (333) T ss_pred CchhHHhhhhcccccccCceecCCc-ccc-chhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEe-------CCc-- Confidence 333322211100000000 00000 011 2222 35566667777788888888876432221111 000 Q ss_pred cccccccCccccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCcccccccccccccccc Q lcl|NC_018861. 108 PTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATF 187 (465) Q Consensus 108 ~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~ 187 (465) .+ .|... |... T Consensus 70 --------------------------------~a------~~v~e--------------------g~~~----------- 80 (333) T protein:vir:78 70 --------------------------------EV------GQVGV--------------------GTSN----------- 80 (333) T ss_pred --------------------------------ee------EeecC--------------------cccc----------- Confidence 00 00000 0000 Q ss_pred ccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHH Q lcl|NC_018861. 188 ATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAE 267 (465) Q Consensus 188 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe 267 (465) ..+| +...++-.-+++++++.+|--+--...|-||.+|-. .|.| T Consensus 81 ------------------------------~~~e--~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~----~~~~ 124 (333) T protein:vir:78 81 ------------------------------EQRE--GGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNP----SGLY 124 (333) T ss_pred ------------------------------cccc--cccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHH Confidence 0000 111222233445555544444445667888888754 5789 Q ss_pred HHHHHHHHHHHHHHhhHHHHhhhhhee--------eeeeeeeec--cCCcccHHHHHHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_018861. 268 KELADILSAEVALEIDRTIIEKANEVA--------TVCTDFDVN--SADGRWFIEKARGLSMRISNEAREIGRQTRKGGG 337 (465) Q Consensus 268 ~EL~niLstEImlEINreii~~l~~~a--------t~~~~~~~~--~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~ 337 (465) ++|.+.|...|...|+..+|.--.... +...-..+. ...+......+..|...+..+ ..+-. ... T Consensus 125 ~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~~~-~~~ 199 (333) T protein:vir:78 125 TKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLV----SANTD-VEF 199 (333) T ss_pred HHHHHHHHHHHHHHHHHHHhcccCCCCCcccccccccccccccccccccccccchhHHHHHHHHHhh----ccccc-cCc Confidence 999999999999999999985331110 000000000 000111111233333333322 22223 366 Q ss_pred cEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcc---------eEEE--------EE Q lcl|NC_018861. 338 NKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFD---------YCTV--------AY 400 (465) Q Consensus 338 ~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~d---------y~~v--------g~ 400 (465) +..+++|+....|.....+.-. .+...-.. +....-.|+|. |++|+++.+.+.+ .+++ |. T Consensus 200 ~~~vmn~~~~~~L~~~~~~~d~-~G~~i~~~-~~~~~~~~~l~-G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~ 276 (333) T protein:vir:78 200 NGWAVDPRFRAHLLRAQAYRDA-NGNVDPSR-INLAAQTGDVL-GLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGF 276 (333) T ss_pred eEEEEcchHHHHHHHHhhhcCC-CCceeecC-ccccCCCceee-ceeeEEccccCCCccccCCCccEEEEEecccEEEEE Confidence 7788899888877654333211 01111000 00011226665 5699988776543 2222 32 Q ss_pred ecCCCccceeEEecccccceeeeeCCC-----ccc-ceeee--eeeeee-eecCcccccccceEEEeecccee Q lcl|NC_018861. 401 KGASNFDAGIFFAPYNITLQQNLTDPV-----SGQ-PAMIL--NNRYDV-VATPLHPEAFIRTFAVNLNNYII 464 (465) Q Consensus 401 kg~~~~d~glfy~PY~~~~~~~~~dp~-----s~q-p~~~~--~tRY~l-~~nPf~~~~~~~~f~~~~~~~~~ 464 (465) .+.-+.+- .+|.- + .|.. -|| ..++| ..|++. +.+| +-|++-...+== T Consensus 277 ~~~~~i~~----~~~~~--~---~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~-------~a~~~l~~~~a~ 333 (333) T protein:vir:78 277 ADEIRIKM----SDTAT--L---TDSGSATVSMWQTNQIAILIEVTFGWLLGDK-------QAFVKFVDDEQP 333 (333) T ss_pred eeccEEEE----ecccc--c---cccccceeehhhcCcEEEEEEEEEccEEecc-------cceEEEeccCCC Confidence 22211111 11100 0 0000 011 11122 235554 2333 112111000000 No 115 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=51.87 E-value=0.57 Score=21.92 Aligned_cols=292 Identities=10% Similarity=0.015 Sum_probs=128.7 Q ss_pred CCccchhhhHHHhhhhhhccccccChhhhhheehccccchhHHHhhhhhhhccccccccchhh-hhhhhhhhhhhhhhhe Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILV-PVIRRALPSLIGTEIA 79 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIw 79 (465) |-+++.....+++-+..+...+.++. ....-++..+. ..-|.+. -+++.+..+.+..+++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a-----------------~~~~~~~~~~~--lip~~~~~~ii~~~~~~s~l~~l~ 61 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNP-----------------DNVMMHEKKDG--TLLNDFTTPILQEVMENSKIMQLG 61 (324) T ss_pred CCcchhhhHHHHHHHHhhhhhhhccc-----------------ccccccCCCcc--eechhHHHHHHHHHHhhchhhhhc Confidence 77776666666665544444333221 11100011111 1112222 3456666777788889 Q ss_pred eeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccccccccccccccccccccccch Q lcl|NC_018861. 80 GVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDN 159 (465) Q Consensus 80 GVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTg 159 (465) .+-||++++.-|.- +.. +.. + .+. T Consensus 62 ~~~~~~~~~~~~p~----~~~-~~~------------------------------------a------~~v--------- 85 (324) T protein:vir:96 62 KYEPMEGTEKKFTF----WAD-KPG------------------------------------A------YWV--------- 85 (324) T ss_pred ceeeccCCceEEEE----Eec-Ccc------------------------------------e------eee--------- Confidence 99999887633321 100 000 0 000 Q ss_pred hhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEEE Q lcl|NC_018861. 160 IVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLA 239 (465) Q Consensus 160 Lifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tV 239 (465) +| |..+++..-+++++++ T Consensus 86 ------------------------------------------------------------~E--g~~~~~~~~~f~~v~~ 103 (324) T protein:vir:96 86 ------------------------------------------------------------GE--GQKIETSKATWVNATM 103 (324) T ss_pred ------------------------------------------------------------cC--CccccccccceeEEEE Confidence 00 1112222234555555 Q ss_pred EeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeee----ee-eeeeccCCcccHHHHH Q lcl|NC_018861. 240 EAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATV----CT-DFDVNSADGRWFIEKA 314 (465) Q Consensus 240 tAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~----~~-~~~~~~~~~~~~~e~~ 314 (465) ..|.-+-....|-||.+|-. .|.+++|.+.|...|...+|+.+|..--..... .. ........+.- .+ T Consensus 104 ~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~---~~ 176 (324) T protein:vir:96 104 RAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIKGDF---TQ 176 (324) T ss_pred EeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCccccccccccceeccccc---ch Confidence 56555556669999999853 578899999999999999999998632110000 00 00000001100 12 Q ss_pred HHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCC--C Q lcl|NC_018861. 315 RGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFA--E 392 (465) Q Consensus 315 ~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~--~ 392 (465) ..|. .+-..|.. .+...+.+++++.....|+...- +.+..... +.. .++| .|++|++++.. + T Consensus 177 ~~i~----~~~~~i~~--~~~~~~~~i~n~~~~~~L~~lkd----~~G~~~~~-~~~----~~~l-~G~PV~~~~~~~~~ 240 (324) T protein:vir:96 177 DNII----DLEALLED--DELEANAFISKTQNRSLLRKIVD----PETKERIY-DRN----SDSL-DGLPVVNLKSSNLK 240 (324) T ss_pred HHHH----HHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhC----CCCCeeec-CCC----CCcc-cceeeEeecCCCCC Confidence 2222 23333322 33466789999999999986521 11111111 111 1344 46788876653 2 Q ss_pred cceEEEEEecCCCccceeEEecccccceeeee--------CCCc-----cc-ceee--eeeeeeeeecCcccccccceEE Q lcl|NC_018861. 393 FDYCTVAYKGASNFDAGIFFAPYNITLQQNLT--------DPVS-----GQ-PAMI--LNNRYDVVATPLHPEAFIRTFA 456 (465) Q Consensus 393 ~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~--------dp~s-----~q-p~~~--~~tRY~l~~nPf~~~~~~~~f~ 456 (465) ...+++|=. +.+++.........+.- |+.. |+ -.+. ..-|++.. +..+.+ |+ T Consensus 241 ~~~~~~gd~------s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~--v~~~~a----~~ 308 (324) T protein:vir:96 241 RGELITGDF------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALH--IADDKA----FA 308 (324) T ss_pred cceEEEEec------ceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccE--Eecccc----eE Confidence 222333310 01112211111111111 1110 11 1222 33455552 233333 44 Q ss_pred EeeccceeC Q lcl|NC_018861. 457 VNLNNYIIS 465 (465) Q Consensus 457 ~~~~~~~~~ 465 (465) + |++.--. T Consensus 309 ~-l~~a~~~ 316 (324) T protein:vir:96 309 K-LVPADKR 316 (324) T ss_pred E-Eeccccc Confidence 2 1111111 No 116 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=51.54 E-value=0.58 Score=21.89 Aligned_cols=273 Identities=12% Similarity=0.074 Sum_probs=113.4 Q ss_pred ccccchh------hhheeeeeccCcc-ccccccccccccccccCCccCCCcccccCccccccccccccccchhhh-ccCC Q lcl|NC_018861. 154 QAGTDNI------VNVLLRLESNSTG-SVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGE-KLGK 225 (465) Q Consensus 154 ~TGPTgL------ifam~s~y~~~~g-~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E-~lg~ 225 (465) |..+..+ ..++.-.|.+..- .+.++--+.....+.. -..+. ..+.....-+ +.+. T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~n~~~Iad~lfP~vpV~~~~~k----------------~~~f~-~e~f~~~~t~ra~~~ 63 (307) T protein:vir:79 1 MGRLSKLRIVDPVLTNLAIGYTNAEFIGQTLMPVVEVEKEGGK----------------IPKFG-KESFRLYQTERALRA 63 (307) T ss_pred CCCCCCCcccCHHHHHHHhhccchhhhhhhcCCcccccccccc----------------eeeec-cccccccccccccCC Confidence 1111100 0011111111000 0000000000000000 00000 0001000000 2234 Q ss_pred chhhcce-EEEEEEEEeecceecccchHHHHHHHH--hhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeee Q lcl|NC_018861. 226 DMKEMGI-SVQRVLAEAKTRKVKGTYTIEMLQDLK--AQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFDV 302 (465) Q Consensus 226 ~f~EM~F-sIeK~tVtAKSRaLKAEYT~ELAQDLk--AiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~~ 302 (465) ..+++-| .++..++..+-.+ +|..-|-+ +..++|.|+--..-|...|++..-.++-+.+...++....-++ T Consensus 64 ~~~~v~~~~~~~~~~~~~~~~------l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~ 137 (307) T protein:vir:79 64 KSNRMNPEDIDSVDVNLDEHD------LEYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKK 137 (307) T ss_pred Ccceeeeeccccccccccccc------hhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceE Confidence 4555444 2333333333333 33333333 3446787887777787777776666666666544554333333 Q ss_pred c-cCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecC Q lcl|NC_018861. 303 N-SADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDN 381 (465) Q Consensus 303 ~-~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~ 381 (465) . ..+++|.- .-.+-+..|++.-..|...+.+ ..|.+|.|++|..+|..++-+...-........ +...+. .|.+ T Consensus 138 tLsgt~~Wsd-~~sDPi~di~~~~~ai~~~~g~-~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~i--t~~~la-~l~~ 212 (307) T protein:vir:79 138 QLSATEKFTA-ANSDPVGVIEDGKEAIRTKIGR-RPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIV--TVDLLK-EIFE 212 (307) T ss_pred EEccCcccCC-CCCCcHHHHHHHHHHHHHhhCC-ccceEEeCHHHHHHHhcCHHHHHHhcCcccccc--CHHHHH-HHhC Confidence 3 23667863 3455566677777778887775 999999999999999998877532211110000 000111 1112 Q ss_pred ceEEEE--eCCCCcceEEEEEecCC--CccceeEEecccccceeeeeCCCcccceeeeeeeeeeeecCcccccc--cceE Q lcl|NC_018861. 382 RYDVIV--DNFAEFDYCTVAYKGAS--NFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDVVATPLHPEAF--IRTF 455 (465) Q Consensus 382 ~~~vy~--d~~~~~dy~~vg~kg~~--~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~--~~~f 455 (465) --+|++ -.|... ++.- -+...+..+ |++-..- ..+++.+.|..|+..|++- +|+..... .+.. T Consensus 213 v~~V~vg~a~y~~~-------~~~~~~iw~~~~~l~-y~~~~~~-~~~~~~~~ps~Gyt~~~~g--~~~~d~~~~~~~~~ 281 (307) T protein:vir:79 213 VENIAVGEAIYADD-------KDRFTDIWGANIVLA-YVPLQRG-GQQRTPYEPSYGYTLRKKG--NPVVDTRIEDGKLE 281 (307) T ss_pred ceeEEEeeeeeecc-------cccchhcCCCceEEE-ecccccC-CCCCcccccccceeEEecC--ceEEecccCCCcee Confidence 112222 222110 1211 123345555 5543332 4567778888888888742 33321111 1111 Q ss_pred EE------------eeccceeC Q lcl|NC_018861. 456 AV------------NLNNYIIS 465 (465) Q Consensus 456 ~~------------~~~~~~~~ 465 (465) -+ ---|+.|. T Consensus 282 ~vrv~~~~~~~i~~~~~G~li~ 303 (307) T protein:vir:79 282 LVRATDIFRPYLLGADAGYLIS 303 (307) T ss_pred EEeecccccceeeccccchhhc Confidence 10 00111222 No 117 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=51.26 E-value=0.59 Score=21.85 Aligned_cols=323 Identities=11% Similarity=0.035 Sum_probs=131.2 Q ss_pred CCccchh------------hhHHHhhhhhhccccccCh-hhhhheehccccc----------hhHHHh-hhh---h-hhc Q lcl|NC_018861. 1 MADKYLL------------DESTKEKFITSNLYPNLNE-SEKNIMRTVLENQ----------GNEVKM-LME---S-TVT 52 (465) Q Consensus 1 ~~~~~~~------------~e~~~e~~~~~~~~~~~~~-~~~~~~~~l~~n~----------~~~~~~-i~e---s-t~t 52 (465) +..+... ++.+.. ....-....... .+.+-.+.-.+.. +..+.. ... + +++ T Consensus 80 ~e~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) T protein:vir:78 80 VEVRNLKQIRKHLARAVIMNPELKN-ATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) T ss_pred HHhhhhhhHHHHHHHHHhhhHHHHh-hhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcc Confidence 1100000 011110 000000000000 0000000000000 000000 111 1 111 Q ss_pred cccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccc Q lcl|NC_018861. 53 GDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTG 131 (465) Q Consensus 53 ~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg 131 (465) +.. ..-|.+. .++...-+..+..|++.|.||+++..- |-.. . +++. T Consensus 159 gg~-~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~-~~~~--~---~~~~-------------------------- 205 (497) T protein:vir:78 159 FAP-GILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS-YLTE--S---AAHN-------------------------- 205 (497) T ss_pred ccc-ccchhhhHHHHHHHHhhhhHHhhccccccCCCceE-EEEE--c---CCCC-------------------------- Confidence 111 1112222 344555566677899999999887521 1111 0 0000 Q ss_pred ccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccc Q lcl|NC_018861. 132 TPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNY 211 (465) Q Consensus 132 ~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~ 211 (465) . +.|. T Consensus 206 --------~------a~wv------------------------------------------------------------- 210 (497) T protein:vir:78 206 --------N------AAAV------------------------------------------------------------- 210 (497) T ss_pred --------c------ceee------------------------------------------------------------- Confidence 0 0000 Q ss_pred cccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhh-- Q lcl|NC_018861. 212 TGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEK-- 289 (465) Q Consensus 212 ~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~-- 289 (465) +| |...+|...+++++++.+|.-+-...+|-||++|- . +.|+.|.+-|...|..-+|+.+|.- T Consensus 211 --------~E--~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~--~---~l~~~i~~~l~~~i~~~~d~~~l~G~G 275 (497) T protein:vir:78 211 --------AE--AGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA--P---ELFNFVQGRLLEGIQRKEEVQLLAGGG 275 (497) T ss_pred --------cc--CcccccccccceeeEeeeeeeEeecHhHHHHHHhH--H---HHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 01 12344445567888888888877889999999993 2 3679999999999999999999862 Q ss_pred ------hhheeeeee------e--------eeeccC---CcccHH-----HHHH-----------------------HHH Q lcl|NC_018861. 290 ------ANEVATVCT------D--------FDVNSA---DGRWFI-----EKAR-----------------------GLS 318 (465) Q Consensus 290 ------l~~~at~~~------~--------~~~~~~---~~~~~~-----e~~~-----------------------~L~ 318 (465) +...++... . +.+... ...|.+ ...+ .+. T Consensus 276 ~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (497) T protein:vir:78 276 YPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIA 355 (497) T ss_pred cccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhh Confidence 111111000 0 000000 011111 0000 111 Q ss_pred HHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCcccccccccccceEEEEecCceEEEEeCCCCcc Q lcl|NC_018861. 319 MRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFD 394 (465) Q Consensus 319 ~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~d 394 (465) ..+...-..+..... -.++..|.++.....|+.. |-..+.|....... .....-++|. |++|++.+..+.+ T Consensus 356 ~~~~~~~~~~~~~~~-~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~---~~~~~~~~l~-G~pV~~t~~~~~~ 430 (497) T protein:vir:78 356 ENVFDAFVDIQLTLF-QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYG---NPVNGGKNIW-GVPVVTTPLIPLG 430 (497) T ss_pred hHHHHHHhhhhhhcc-cCCCeEEEchHHHHHHHHhhcCCCceeccCccccccc---ccccCCceee-ceeeEecCCCCCC Confidence 112222223333333 3667788999888887654 43333332211111 0111223554 5899988887766 Q ss_pred eEEEEEecCCCccceeEEecccccceeeeeCCC---cc---cceeeeeeeeee-eecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 395 YCTVAYKGASNFDAGIFFAPYNITLQQNLTDPV---SG---QPAMILNNRYDV-VATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 395 y~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~---s~---qp~~~~~tRY~l-~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) -+++|--.. +.+... .-..+.+.+++. -| +=.+=+..|++. +.+| .-|+ +.+.-..-+ T Consensus 431 ~~~~Gd~~~----~~~~i~--~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p-------~A~~~l~~~~~~~~ 496 (497) T protein:vir:78 431 TILVGHFAP----SVIQTA--RREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRP-------SAFQLIQLKKGATG 496 (497) T ss_pred ceEEeeccc----ceEEEE--EecccEEEeecccchhhhcCcEEEEEEEeecceeecc-------ccEEEEEecCCccC Confidence 555542110 000000 001111112221 12 223334567876 5555 2344 333322222 No 118 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=51.26 E-value=0.59 Score=21.85 Aligned_cols=323 Identities=11% Similarity=0.035 Sum_probs=131.2 Q ss_pred CCccchh------------hhHHHhhhhhhccccccCh-hhhhheehccccc----------hhHHHh-hhh---h-hhc Q lcl|NC_018861. 1 MADKYLL------------DESTKEKFITSNLYPNLNE-SEKNIMRTVLENQ----------GNEVKM-LME---S-TVT 52 (465) Q Consensus 1 ~~~~~~~------------~e~~~e~~~~~~~~~~~~~-~~~~~~~~l~~n~----------~~~~~~-i~e---s-t~t 52 (465) +..+... ++.+.. ....-....... .+.+-.+.-.+.. +..+.. ... + +++ T Consensus 80 ~e~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) T protein:vir:10 80 VEVRNLKQIRKHLARAVIMNPELKN-ATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) T ss_pred HHhhhhhhHHHHHHHHHhhhHHHHh-hhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcc Confidence 1100000 011110 000000000000 0000000000000 000000 111 1 111 Q ss_pred cccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccc Q lcl|NC_018861. 53 GDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTG 131 (465) Q Consensus 53 ~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg 131 (465) +.. ..-|.+. .++...-+..+..|++.|.||+++..- |-.. . +++. T Consensus 159 gg~-~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~-~~~~--~---~~~~-------------------------- 205 (497) T protein:vir:10 159 FAP-GILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS-YLTE--S---AAHN-------------------------- 205 (497) T ss_pred ccc-ccchhhhHHHHHHHHhhhhHHhhccccccCCCceE-EEEE--c---CCCC-------------------------- Confidence 111 1112222 344555566677899999999887521 1111 0 0000 Q ss_pred ccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccc Q lcl|NC_018861. 132 TPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNY 211 (465) Q Consensus 132 ~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~ 211 (465) . +.|. T Consensus 206 --------~------a~wv------------------------------------------------------------- 210 (497) T protein:vir:10 206 --------N------AAAV------------------------------------------------------------- 210 (497) T ss_pred --------c------ceee------------------------------------------------------------- Confidence 0 0000 Q ss_pred cccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhh-- Q lcl|NC_018861. 212 TGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEK-- 289 (465) Q Consensus 212 ~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~-- 289 (465) +| |...+|...+++++++.+|.-+-...+|-||++|- . +.|+.|.+-|...|..-+|+.+|.- T Consensus 211 --------~E--~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~--~---~l~~~i~~~l~~~i~~~~d~~~l~G~G 275 (497) T protein:vir:10 211 --------AE--AGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA--P---ELFNFVQGRLLEGIQRKEEVQLLAGGG 275 (497) T ss_pred --------cc--CcccccccccceeeEeeeeeeEeecHhHHHHHHhH--H---HHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 01 12344445567888888888877889999999993 2 3679999999999999999999862 Q ss_pred ------hhheeeeee------e--------eeeccC---CcccHH-----HHHH-----------------------HHH Q lcl|NC_018861. 290 ------ANEVATVCT------D--------FDVNSA---DGRWFI-----EKAR-----------------------GLS 318 (465) Q Consensus 290 ------l~~~at~~~------~--------~~~~~~---~~~~~~-----e~~~-----------------------~L~ 318 (465) +...++... . +.+... ...|.+ ...+ .+. T Consensus 276 ~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (497) T protein:vir:10 276 YPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIA 355 (497) T ss_pred cccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhh Confidence 111111000 0 000000 011111 0000 111 Q ss_pred HHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCcccccccccccceEEEEecCceEEEEeCCCCcc Q lcl|NC_018861. 319 MRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFD 394 (465) Q Consensus 319 ~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~d 394 (465) ..+...-..+..... -.++..|.++.....|+.. |-..+.|....... .....-++|. |++|++.+..+.+ T Consensus 356 ~~~~~~~~~~~~~~~-~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~---~~~~~~~~l~-G~pV~~t~~~~~~ 430 (497) T protein:vir:10 356 ENVFDAFVDIQLTLF-QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYG---NPVNGGKNIW-GVPVVTTPLIPLG 430 (497) T ss_pred hHHHHHHhhhhhhcc-cCCCeEEEchHHHHHHHHhhcCCCceeccCccccccc---ccccCCceee-ceeeEecCCCCCC Confidence 112222223333333 3667788999888887654 43333332211111 0111223554 5899988887766 Q ss_pred eEEEEEecCCCccceeEEecccccceeeeeCCC---cc---cceeeeeeeeee-eecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 395 YCTVAYKGASNFDAGIFFAPYNITLQQNLTDPV---SG---QPAMILNNRYDV-VATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 395 y~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~---s~---qp~~~~~tRY~l-~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) -+++|--.. +.+... .-..+.+.+++. -| +=.+=+..|++. +.+| .-|+ +.+.-..-+ T Consensus 431 ~~~~Gd~~~----~~~~i~--~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p-------~A~~~l~~~~~~~~ 496 (497) T protein:vir:10 431 TILVGHFAP----SVIQTA--RREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRP-------SAFQLIQLKKGATG 496 (497) T ss_pred ceEEeeccc----ceEEEE--EecccEEEeecccchhhhcCcEEEEEEEeecceeecc-------ccEEEEEecCCccC Confidence 555542110 000000 001111112221 12 223334567876 5555 2344 333322222 No 119 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=50.89 E-value=0.6 Score=21.81 Aligned_cols=270 Identities=15% Similarity=0.126 Sum_probs=121.5 Q ss_pred eehccccchhHHHhhhhhhhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccc Q lcl|NC_018861. 32 MRTVLENQGNEVKMLMESTVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPT 109 (465) Q Consensus 32 ~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~ 109 (465) ||++=.+ +.. .. |.-+ .+++++-+..+..+++-+.||+....-|-- . .+. T Consensus 1 Mat~tt~--------------~g~-~v-P~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~----~---~~~----- 52 (311) T protein:vir:99 1 MATFGTG--------------NLK-NL-PRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIIT----F---NGR----- 52 (311) T ss_pred CceecCC--------------Cce-ec-cHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEE----E---eCC----- Confidence 2222111 111 11 2222 455666666677777888887754311100 0 000 Q ss_pred cccccCccccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCcccccccccccccccccc Q lcl|NC_018861. 110 KNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFAT 189 (465) Q Consensus 110 ~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~ 189 (465) ..+ .| T Consensus 53 -----------------------------~~a------~w---------------------------------------- 57 (311) T protein:vir:99 53 -----------------------------PKA------EF---------------------------------------- 57 (311) T ss_pred -----------------------------cee------EE---------------------------------------- Confidence 000 00 Q ss_pred CCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHH Q lcl|NC_018861. 190 KKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKE 269 (465) Q Consensus 190 ~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~E 269 (465) .+| |..+++...++++++..+|.-+-....|-||.++-.- -..|-+++ T Consensus 58 -----------------------------v~E--g~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~ 105 (311) T protein:vir:99 58 -----------------------------VGE--GQQKSSTTGEFDFVTSTPKKAQVTMRFNEEVQWADED-YQLGVLQT 105 (311) T ss_pred -----------------------------eec--CcccccccceeeEEEEeeEEEEEeehhhHHHhhcccc-cHHHHHHH Confidence 001 1233444445677777777777778899999764322 13677899 Q ss_pred HHHHHHHHHHHHhhHHHHhhhhhe-ee-e-ee-------eeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccE Q lcl|NC_018861. 270 LADILSAEVALEIDRTIIEKANEV-AT-V-CT-------DFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNK 339 (465) Q Consensus 270 L~niLstEImlEINreii~~l~~~-at-~-~~-------~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~ 339 (465) |.+.|...|...|++.+|.-.... .+ + +. .-.+......+. .+..-|+.+-..+...-.+...+. T Consensus 106 i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~~~~~~~~~~~~~ 180 (311) T protein:vir:99 106 LSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIA-----NPDLAIEAAVGLLVANGHPTPVNG 180 (311) T ss_pred HHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccc-----hhHHHHHHHHHHHhhhccCCCccE Confidence 999999999999999999754210 00 0 00 001111111110 111122222222222222345566 Q ss_pred EEecHHHHHHHHhc----CcccccCCcccccccccccceEEEEecCceEEEEeCCCC----------------cceEEEE Q lcl|NC_018861. 340 LIVSPKVATILDEI----GSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAE----------------FDYCTVA 399 (465) Q Consensus 340 ~~~s~~va~~L~~~----~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~----------------~dy~~vg 399 (465) .+++++....|... |-..++|... ....|+| .+++|++..+-+ .+++++| T Consensus 181 ~vmn~~~~~~L~~lkd~~G~~l~~~~~~---------~~~~~~l-~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~G 250 (311) T protein:vir:99 181 LALHPSIAWGLSTARYTDGRKKFPELGL---------GIGVSSF-EGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVG 250 (311) T ss_pred EEEcHHHHHHHHhhhccCCCeeecCccc---------CCCCcee-cceeeEeecccccccccccccchhhccCcceEEEe Confidence 89999999999764 1111221111 1112455 466788765432 2233332 Q ss_pred EecCCCccceeEEecccccceeeee--CCCcc-----cceeee--eeeeeeeecCcccccccceEEEeeccce Q lcl|NC_018861. 400 YKGASNFDAGIFFAPYNITLQQNLT--DPVSG-----QPAMIL--NNRYDVVATPLHPEAFIRTFAVNLNNYI 463 (465) Q Consensus 400 ~kg~~~~d~glfy~PY~~~~~~~~~--dp~s~-----qp~~~~--~tRY~l~~nPf~~~~~~~~f~~~~~~~~ 463 (465) = ...++.|.-.....+++.- |++.. ..-++| ..|+|..+ .++ -|++-++.+= T Consensus 251 d-----f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v--~~~-----~~v~~~~~~A 311 (311) T protein:vir:99 251 D-----FANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYV--FTD-----RFVVIENAVA 311 (311) T ss_pred e-----ccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeeccee--cCh-----hHeeeecccC Confidence 1 1112333322223333221 23321 122444 57888743 222 1444333333 No 120 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=50.82 E-value=0.6 Score=21.81 Aligned_cols=315 Identities=14% Similarity=0.093 Sum_probs=116.0 Q ss_pred CCccc------------hhhhHHHh-----------------hhhhhccccc-c--Chhhhhheehcccc-----chhHH Q lcl|NC_018861. 1 MADKY------------LLDESTKE-----------------KFITSNLYPN-L--NESEKNIMRTVLEN-----QGNEV 43 (465) Q Consensus 1 ~~~~~------------~~~e~~~e-----------------~~~~~~~~~~-~--~~~~~~~~~~l~~n-----~~~~~ 43 (465) +++.. -+++++.+ .-.+.+..+. . ++.+++.....+.+ ..+++ T Consensus 29 ~t~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~ 108 (409) T protein:vir:45 29 WTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPENNSQQDEKRAQVFDKWMRHGASELTSEER 108 (409) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCCCCcchhhHHHHHHHHHHHHhhhhhccHHHH Confidence 11111 00111100 0000000000 0 11111221222211 12344 Q ss_pred Hhhhhhhh--cccccccc---chhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCc Q lcl|NC_018861. 44 KMLMESTV--TGDIAKFT---PILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLK 116 (465) Q Consensus 44 ~~i~est~--t~~v~~~~---P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~ 116 (465) +.+.|... ++.-..++ |.-+ .+++..-+..+-.+++-|.||++.....+-..+ ... . T Consensus 109 ~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~---~~~-~------------ 172 (409) T protein:vir:45 109 KALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATAD---GTS-E------------ 172 (409) T ss_pred HHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeec---cCc-c------------ Confidence 44444311 11111111 2222 133444444445567777777654433321110 000 0 Q ss_pred cccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCC Q lcl|NC_018861. 117 LKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEA 196 (465) Q Consensus 117 ~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~ 196 (465) .+ .+. T Consensus 173 -----------------------~~------~~v---------------------------------------------- 177 (409) T protein:vir:45 173 -----------------------VG------VLL---------------------------------------------- 177 (409) T ss_pred -----------------------cc------ccc---------------------------------------------- Confidence 00 000 Q ss_pred cccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecce-ecccchHHHHHHHHhhhCCCHHHHHHHHHH Q lcl|NC_018861. 197 VYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRK-VKGTYTIEMLQDLKAQHGINAEKELADILS 275 (465) Q Consensus 197 ~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRa-LKAEYT~ELAQDLkAiHGlDAe~EL~niLs 275 (465) +| |...++-...++.++..++..+ -=..+|-||.+|- .+|.+++|.+-|+ T Consensus 178 -----------------------~E--~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds----~~~l~~~i~~~la 228 (409) T protein:vir:45 178 -----------------------GE--NEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDS----AIDMEAYLARRIA 228 (409) T ss_pred -----------------------cc--cccccccccccceeeeeeeeeeeeehhhhHHHHhcc----HHHHHHHHHHHHH Confidence 00 0111222222333333332221 1135799999994 2688999999999 Q ss_pred HHHHHHhhHHHHhhhhh-------ee--eeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccE-EEecHH Q lcl|NC_018861. 276 AEVALEIDRTIIEKANE-------VA--TVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNK-LIVSPK 345 (465) Q Consensus 276 tEImlEINreii~~l~~-------~a--t~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~-~~~s~~ 345 (465) ..|.+-+|+.||.-=-. .+ .+.... .....+.- .+..| +.+-+.+.-.=+ ..+.+ +++++. T Consensus 229 ~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~-~~~~~~~~---~~d~i----~~l~~~l~~~~~-~~a~~~~~~n~~ 299 (409) T protein:vir:45 229 ERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTT-QTAAANAV---KWQEI----LALKHSIDPAYR-RGPKFRLAFNDN 299 (409) T ss_pred HHHHHHHHHHhhccCCCCCccccceeeecccccc-cccccccc---chHHH----HHHHHhhhhhhc-cCCeEEEEECHH Confidence 99999999999862100 00 000000 00111111 12222 233333322222 24556 578998 Q ss_pred HHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecc---c-----c Q lcl|NC_018861. 346 VATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPY---N-----I 417 (465) Q Consensus 346 va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY---~-----~ 417 (465) ....|+..- ...+.....++ -...-.++|. |++|+++.+.|. +| . .+-.++|..+ + . T Consensus 300 ~~~~l~~lk----d~~G~~i~~~~-~~~~~~~~l~-G~PV~~~~~~p~----~~---~--~~~~i~~Gd~~~~~i~~~~~ 364 (409) T protein:vir:45 300 TLKLISEME----DGQGRPLWLPD-IVGVAPASVL-NVPYVIDQEIDD----IG---A--GKKFMFCGDFDRFIIRRVRY 364 (409) T ss_pred HHHHHHHhh----cCCCceeeccC-cCCCCCceec-ceeeEEecCcCC----cc---C--CccEEEEeehhhhheeeccc Confidence 888886541 11111111111 0011114564 469999887653 00 0 0011233222 1 1 Q ss_pred cceeeeeCCCcccceeeeee--eeeeeecCcccccccceEEE-eeccceeC Q lcl|NC_018861. 418 TLQQNLTDPVSGQPAMILNN--RYDVVATPLHPEAFIRTFAV-NLNNYIIS 465 (465) Q Consensus 418 ~~~~~~~dp~s~qp~~~~~t--RY~l~~nPf~~~~~~~~f~~-~~~~~~~~ 465 (465) ....+..|+-.=...++|.. ||+.. |..+++ |++ .....-=+ T Consensus 365 ~~~~~~~d~~~~~~~~~~~~~~r~d~~--~~~~~A----~~~l~~k~s~~~ 409 (409) T protein:vir:45 365 MILKRLVERYAEYDQTGFLAFHRFDCI--LEDTSA----IKALVGKGSVGG 409 (409) T ss_pred eEEEEeecccccCCcEEEEEEEEeccE--eechhh----eEEEEeccCCCC Confidence 22233445543334444444 55552 333332 331 11110001 No 121 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=50.76 E-value=0.6 Score=21.80 Aligned_cols=259 Identities=11% Similarity=-0.002 Sum_probs=104.8 Q ss_pred ccccccccCCcc-------------C-CCcccccCcc-ccccccccccccchhhhccCCchhhcceEEEEEEEEeeccee Q lcl|NC_018861. 182 DKAATFATKKAT-------------V-EAVYTNEALW-LKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKV 246 (465) Q Consensus 182 ~t~~s~~~~~~~-------------~-~~~~~~~a~~-~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaL 246 (465) .+...+...... . .......... .......-.+-..+.--..|.++++-..+++.++..+|.-+- T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 80 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeeeEEEE Confidence 111111000000 0 0000000000 000000000111111112345677777778888887776666 Q ss_pred cccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeee------ccCCcccH--HHHHHHHH Q lcl|NC_018861. 247 KGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFDV------NSADGRWF--IEKARGLS 318 (465) Q Consensus 247 KAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~~------~~~~~~~~--~e~~~~L~ 318 (465) ....|-||.|+--. -..+-+++|.+-|+..|..+|+..+|.-.... + +..... ........ .......+ T Consensus 81 ~~~iS~ell~~~~~-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~-~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) T protein:vir:94 81 GARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVNPR-L-GTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) T ss_pred eeehhHHHhccCCc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccccC-C-CcccccccccccccccccccccccccccHH Confidence 78899999865321 12445677777777778888887777542110 0 000000 00000000 00011112 Q ss_pred HHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCC------ Q lcl|NC_018861. 319 MRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAE------ 392 (465) Q Consensus 319 ~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~------ 392 (465) .-|..+-..+.. .+.+....+++++....|+...-- .+...-. ........|+|. |++|++++.-+ T Consensus 158 ~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~lkd~----~G~~l~~-~~~~~~~~~tl~-G~PV~~~~~v~~~~~~~ 229 (298) T protein:vir:94 158 GAIENAVELLTG--VDADVTGIAINPSFRSALAKQKDL----QGNALFP-ELKWGATPDTIN-GLPVDVNKTVSDMSLTQ 229 (298) T ss_pred HHHHHHHHhhhh--cCCCccEEEEcHHHHHHHHHhhcc----CCCeeec-CcccCCCCceec-ceeeEEecccccccCCC Confidence 223333333222 123556799999999999764210 1111100 000012235664 57899887643 Q ss_pred cceEEEEEecCCCccceeEEecccccceeee--eCCCc-----cc-ceeee--eeeeeeeecCcccccccceEEEeeccc Q lcl|NC_018861. 393 FDYCTVAYKGASNFDAGIFFAPYNITLQQNL--TDPVS-----GQ-PAMIL--NNRYDVVATPLHPEAFIRTFAVNLNNY 462 (465) Q Consensus 393 ~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~--~dp~s-----~q-p~~~~--~tRY~l~~nPf~~~~~~~~f~~~~~~~ 462 (465) .+.+++|- ...++.|.......+.+. .|++. || -.++| ..|+|. .+.+|. .|++-..-| T Consensus 230 ~~~~~~Gd-----fs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~--~~~~~~----a~~~l~~~t 298 (298) T protein:vir:94 230 RDRAIIGD-----FANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGW--GILDAT----KFARVTEAN 298 (298) T ss_pred ccEEEEee-----ccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEecc--Eeeccc----ceEEEEecC Confidence 22232221 011223443333333321 12221 22 23555 345554 333333 355433333 No 122 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=50.54 E-value=0.61 Score=21.77 Aligned_cols=288 Identities=10% Similarity=0.047 Sum_probs=122.2 Q ss_pred cccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCC----- Q lcl|NC_018861. 117 LKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKK----- 191 (465) Q Consensus 117 ~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~----- 191 (465) |... .+++.+.......|..+-.-++. -|-+..|+.+.|...+-. T Consensus 1 ~~~~----------------------~~~~~~~t~~g~~~~~~~~~al~--------ie~~~g~V~~~f~~~s~~~~~v~ 50 (347) T protein:vir:33 1 MANI----------------------QGGQQIGTNQGKGQSAADKLALF--------LKVFGGEVLTAFARTSVTMPRHM 50 (347) T ss_pred CCCC----------------------ccCcccccccccCCcccchHHHH--------HHHHHHHHHHHHHHHHhhhhhhc Confidence 0000 00000000000000000000000 011122222222111000 Q ss_pred -ccCC-C-cccccCccccccccccccccchhhhcc-C----CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhC Q lcl|NC_018861. 192 -ATVE-A-VYTNEALWLKVLKNYTGPYATAAGEKL-G----KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHG 263 (465) Q Consensus 192 -~~~~-~-~~~~~a~~~~~~~~~~~~~~Ta~~E~l-g----~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHG 263 (465) .+.. + ...-.-........+. .++.+ + ....|+-++||++- -+...|+-.-|.++ | T Consensus 51 ~r~~~~G~sv~i~~iG~~t~~~~~------~g~~l~~~~~~~~~~e~~ltiD~~~--------y~~~~VddiD~~q~-~- 114 (347) T protein:vir:33 51 LRSIASGKSAQFPVIGRTKAAYLK------PGENLDDKRKDIKHTEKVIHIDGLL--------TADVLIYDIEDAMN-H- 114 (347) T ss_pred cccccccceeEeeeccceeeeeec------CCCCCCCCCCCCccceEEEEechhh--------hhhHHHhhHHHHhc-C- Confidence 0000 0 0000000000000000 01111 1 22456666777542 23445665566666 3 Q ss_pred CCHHHHHHHHHHHHHHHHhhHHHHhhhhhe-----eeeeee--------eeeccC-Cc-ccHH-HHHHHHHHHHHHHHHH Q lcl|NC_018861. 264 INAEKELADILSAEVALEIDRTIIEKANEV-----ATVCTD--------FDVNSA-DG-RWFI-EKARGLSMRISNEARE 327 (465) Q Consensus 264 lDAe~EL~niLstEImlEINreii~~l~~~-----at~~~~--------~~~~~~-~~-~~~~-e~~~~L~~~i~~~a~~ 327 (465) .|..+|+..-....++..+++-|+..|... .++... +..... .| .|.. +....++..|-..... T Consensus 115 ~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~ 194 (347) T protein:vir:33 115 YDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARAS 194 (347) T ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccccccccccccccchhhhHHHHHHHHHHHHHH Confidence 789999999999999999999998876321 011100 111111 11 2211 2234444444444444 Q ss_pred HHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEE-------EEE Q lcl|NC_018861. 328 IGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCT-------VAY 400 (465) Q Consensus 328 i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~-------vg~ 400 (465) .-++=---.++|+|++|+.-..|-.+.-+..... .+..+.....||.+ .|++||+-+.-|.-..+ .|- T Consensus 195 Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~----~~~~~~~~G~V~~i-~G~~V~~Sn~lp~~~~~~~~~~~~ag~ 269 (347) T protein:vir:33 195 LTKNYVPAADRTFYTTPDNYSAILAALMPNAANY----QALLDPERGTIRNV-MGFEVVEVPHLTAGGAGDTREDAPADQ 269 (347) T ss_pred HhhcCCCccCcEEEeCHHHHHHHhcccccccccc----ccccccccceeEEE-eceeEEEecccccCccccccccccccc Confidence 4443322257899999999999988877653321 12223334678887 78999999987664321 111 Q ss_pred e------------cCCCccceeEEecccccc-------eeeeeCCCcccceeeeeeeeee-eecCcccccccceEE---E Q lcl|NC_018861. 401 K------------GASNFDAGIFFAPYNITL-------QQNLTDPVSGQPAMILNNRYDV-VATPLHPEAFIRTFA---V 457 (465) Q Consensus 401 k------------g~~~~d~glfy~PY~~~~-------~~~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~---~ 457 (465) + +.-.-..||||-|=-.+. .-+.-|+.+|-=.|=-+..||. +.+|=+ .=.|+ | T Consensus 270 ~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~vlrP~~----av~i~~~~~ 345 (347) T protein:vir:33 270 KHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRPEA----AGAIVLPKV 345 (347) T ss_pred cccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchhhhhHhhhhhhhcCCceecccc----eEEEecCCC Confidence 1 111112456665543332 2222366666655555556665 445521 11111 1 Q ss_pred ee Q lcl|NC_018861. 458 NL 459 (465) Q Consensus 458 ~~ 459 (465) +- T Consensus 346 ~~ 347 (347) T protein:vir:33 346 SE 347 (347) T ss_pred CC Confidence 11 No 123 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=50.50 E-value=0.61 Score=21.77 Aligned_cols=311 Identities=12% Similarity=0.019 Sum_probs=118.6 Q ss_pred CCccc---hhh--hHHHhhhh-----------------hhccc-cccC--hhhhhheehccccc------------hhHH Q lcl|NC_018861. 1 MADKY---LLD--ESTKEKFI-----------------TSNLY-PNLN--ESEKNIMRTVLENQ------------GNEV 43 (465) Q Consensus 1 ~~~~~---~~~--e~~~e~~~-----------------~~~~~-~~~~--~~~~~~~~~l~~n~------------~~~~ 43 (465) .+... +.. +.|.++.. ..... .... +.++.....+.++. ..+. T Consensus 28 ~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 107 (404) T protein:vir:10 28 VTAEELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNGALFVRAIADNLLKQKNQRGLNLSEKEI 107 (404) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHHHHHHHHHHhhhhcchhhHH Confidence 00000 000 01111110 00000 0000 00111110111110 0112 Q ss_pred Hhhhhhhh-ccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccc Q lcl|NC_018861. 44 KMLMESTV-TGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTE 120 (465) Q Consensus 44 ~~i~est~-t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a 120 (465) +-+.+++. +|.+ .. |.-+ .++..+-......+++++.||+++.|-+--.| ... ... T Consensus 108 ~a~~~~~~~~gg~-~v-P~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~--~~~-~~~---------------- 166 (404) T protein:vir:10 108 NAISENIDEDGGY-AV-PEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEK--RSK-QKP---------------- 166 (404) T ss_pred hhhccccCCCCce-ee-chhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEE--ecC-Ccc---------------- Confidence 22223221 1111 11 3222 45556666677889999999999998654333 100 000 Q ss_pred cccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccc Q lcl|NC_018861. 121 SANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTN 200 (465) Q Consensus 121 ~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~ 200 (465) ..+.......| T Consensus 167 --------------------------~~~v~e~~~~~------------------------------------------- 177 (404) T protein:vir:10 167 --------------------------MKPLSENQQIP------------------------------------------- 177 (404) T ss_pred --------------------------eeecccccccc------------------------------------------- Confidence 00000000000 Q ss_pred cCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHH Q lcl|NC_018861. 201 EALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVAL 280 (465) Q Consensus 201 ~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEIml 280 (465) + .....++++++.++|.-+-...+|-||.+|-. .+.++.|.+.|+..|.. T Consensus 178 --------------------~------~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~la~~~~~ 227 (404) T protein:vir:10 178 --------------------T------NGDNGKLERFNFKLKDLADFMSIPNDLLKFAD----KSLEDWIINWFVDKVRI 227 (404) T ss_pred --------------------c------cccccceeeeEeeheeeEeeehhhHHHHhhcH----HHHHHHHHHHHHHHHHH Confidence 0 00112345556666555556789999999843 46788999999999999 Q ss_pred HhhHHHHhhhhheeeee-----eeee-eccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc- Q lcl|NC_018861. 281 EIDRTIIEKANEVATVC-----TDFD-VNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI- 353 (465) Q Consensus 281 EINreii~~l~~~at~~-----~~~~-~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~- 353 (465) .+|+.||..--...... .... +...+...+ + .|...|. . .....+-..-.+|++++..+.|+.. T Consensus 228 ~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~~~~-~---~~~~~~~----~-~l~~~~~~~~~~v~n~~~~~~L~~lk 298 (404) T protein:vir:10 228 TRNAEILYGAGGDEHATGIMTANKFKKITLPKSPAL-K---DFKKCKN----V-ELLNVFKATSSWIVNQDGFNYLDSLE 298 (404) T ss_pred HHHHHHhhcCCCCCcccceeeccccceeeccccccH-H---HHHHHHH----h-hhhccccCCCEEEEcHHHHHHHHHhh Confidence 99999986431100000 0000 001111111 2 2222121 0 1112222223478999999999875 Q ss_pred ---CcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc---------ccee Q lcl|NC_018861. 354 ---GSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI---------TLQQ 421 (465) Q Consensus 354 ---~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~---------~~~~ 421 (465) |-..+.|.. .....++|. |++|++.+.... .....+..++|+.+.- +... T Consensus 299 d~~G~~l~~~~~---------~~~~~~~l~-G~PV~~~~~~~~--------~~~~~~~~~~~gd~s~~~~~~~~~~~~i~ 360 (404) T protein:vir:10 299 DKTGRPYLQPDP---------KDPTQYRFL-GLPVIELPNDLL--------LSTESAIPVLLGDTKEAYKYVSDGAYELA 360 (404) T ss_pred ccCCceeeccCc---------CCCCCcccc-ceeeEEeccccc--------CCCCCccEEEEEeccccEEEEEecceEEE Confidence 222222211 011224553 467775332100 0011122233332211 1111 Q ss_pred eeeCCC----cccceeeeeeeeee-eecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 422 NLTDPV----SGQPAMILNNRYDV-VATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 422 ~~~dp~----s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) +..++. ..+=.+-...|++. +.+| ..|+ +.+.-.--. T Consensus 361 ~~~~~~~~~~~~~~~~~~~~r~d~~v~~~-------~a~~~~~~~~aa~~ 403 (404) T protein:vir:10 361 TTNIGAGAFETNTTKARIIMRIDGNVKDS-------EALLIAEIPVESVQ 403 (404) T ss_pred EeccccchhhcCceEEEEEEeeccEEecc-------cceEEEEeecccCC Confidence 111221 22233445566666 3333 1222 111111000 No 124 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=50.14 E-value=0.62 Score=21.73 Aligned_cols=301 Identities=13% Similarity=0.078 Sum_probs=107.4 Q ss_pred CC-------------ccchhh-hHHHh----hhhhhccccccChhhh----h-heehccccch--------hHHHhhhhh Q lcl|NC_018861. 1 MA-------------DKYLLD-ESTKE----KFITSNLYPNLNESEK----N-IMRTVLENQG--------NEVKMLMES 49 (465) Q Consensus 1 ~~-------------~~~~~~-e~~~e----~~~~~~~~~~~~~~~~----~-~~~~l~~n~~--------~~~~~i~es 49 (465) +. .+.+.. |+..+ .-.+....+...+... . +-+..+.+.. ...+-+.++ T Consensus 57 ~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~ 136 (402) T protein:vir:93 57 LETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTG 136 (402) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccC Confidence 11 111110 00000 0000000000000000 0 0011111110 011112222 Q ss_pred hhccccccccchhhh------hhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccc Q lcl|NC_018861. 50 TVTGDIAKFTPILVP------VIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESAN 123 (465) Q Consensus 50 t~t~~v~~~~P~l~~------l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ 123 (465) +.+ .+ -.||+ ++......-+-.+++.|-|+++.+.- |-.+.. . T Consensus 137 t~~----~G-G~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~~~p----~~~~~~--~-------------------- 185 (402) T protein:vir:93 137 NDS----GG-DKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEIP----RVSYTL--D-------------------- 185 (402) T ss_pred CCc----CC-ccccchhHHHHHHHhHHhhhhhhhhceeeecCCceee----eeeccC--C-------------------- Confidence 111 11 22332 34444444455677777776543210 000100 0 Q ss_pred ccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCc Q lcl|NC_018861. 124 KDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEAL 203 (465) Q Consensus 124 ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~ 203 (465) ++ .|.. T Consensus 186 ----------------~a------~~v~---------------------------------------------------- 191 (402) T protein:vir:93 186 ----------------DD------DFIT---------------------------------------------------- 191 (402) T ss_pred ----------------cc------cccc---------------------------------------------------- Confidence 00 0000 Q ss_pred cccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_018861. 204 WLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEID 283 (465) Q Consensus 204 ~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEIN 283 (465) | |...++...++++++..++.-+-...+|-||.+|- ..|.|++|.+.|+..|..-.| T Consensus 192 -----------------E--g~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds----~~~l~~~i~~~la~~~~~~e~ 248 (402) T protein:vir:93 192 -----------------D--VETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS----DVDLVNWVENALQSGLAAKER 248 (402) T ss_pred -----------------c--cccccccccccceeeecceeeeeechhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHH Confidence 0 01111111223444555555555578999999985 467789999999999887666 Q ss_pred HHHHhhhhheeeeeeee---eeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccC Q lcl|NC_018861. 284 RTIIEKANEVATVCTDF---DVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSP 360 (465) Q Consensus 284 reii~~l~~~at~~~~~---~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~ 360 (465) ..++-.-.-...+.--+ .+..+.+.-..+....|+..+. .. -+..+.|++-+...+.++.-. + . T Consensus 249 ~~~~~~g~g~g~p~g~~~~~~~~~~~~~~~~d~l~~~~~~l~-------~~-y~~na~~imn~~t~~~~~~~~---~--d 315 (402) T protein:vir:93 249 KDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLH-------ED-YRDNATIYMRYADYVKIISVL---S--N 315 (402) T ss_pred HhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccC-------hh-hhcCCEEEEechHHHHHHHHH---h--c Confidence 66654332111110001 1111111111133333333222 11 123555655555445554432 1 1 Q ss_pred CcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeee Q lcl|NC_018861. 361 AGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYD 440 (465) Q Consensus 361 ~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~ 440 (465) .+..... ..-++|. |++||+...++. +++|- =+-||.=|....+.+..|+.+.+-.+-...|++ T Consensus 316 ~~~~~~~------~~~~~ll-G~PV~~t~~~~~--i~~GD-------f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~D 379 (402) T protein:vir:93 316 GTTNFFD------TPAEKVF-GKPVVFTDAAVK--PIVGD-------FNYFGINYDGTTYDTDKDVKKGEYLFVLTAWYD 379 (402) T ss_pred CCCcccc------cCCcccc-ccceEEecCCCc--eeeec-------hhhhhhhhhhhhhhhhhcccCCceEEEEEEEeC Confidence 1111110 0113565 579999887654 33332 111222222222333344444333333344666 Q ss_pred e-eecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 441 V-VATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 441 l-~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) . +.|| ++ |. +.+..---+ T Consensus 380 g~v~~~---~A----~~~l~ik~~~~~ 399 (402) T protein:vir:93 380 QQRTLD---SA----FRIAKAKENTGP 399 (402) T ss_pred cEEech---hh----eEEEEeecCCCC Confidence 5 3343 22 11 111111001 No 125 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=47.60 E-value=0.7 Score=21.45 Aligned_cols=269 Identities=12% Similarity=0.077 Sum_probs=123.9 Q ss_pred eeeeeccCccccccccccccccccccC--CccCCCcccccCcccc--cccccc--ccccch-hhhccCCchhhcceEEEE Q lcl|NC_018861. 164 LLRLESNSTGSVAIGDEMDKAATFATK--KATVEAVYTNEALWLK--VLKNYT--GPYATA-AGEKLGKDMKEMGISVQR 236 (465) Q Consensus 164 m~s~y~~~~g~ea~~~e~~t~~s~~~~--~~~~~~~~~~~a~~~~--~~~~~~--~~~~Ta-~~E~lg~~f~EM~FsIeK 236 (465) |.++- ..-+.+-|..+-+-. ........-....... ...-++ .+.... .-.+.+..++++-|.-+. T Consensus 1 ~~~~~-------~~~dp~LT~~A~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v~~~~~~ 73 (309) T protein:vir:99 1 MSNAP-------FPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATD 73 (309) T ss_pred CCCCC-------cCcCHhHHHHHhhccChhhhhhhcCCccccCccccceeeechhhcccccchhhccCCCcceEeecccC Confidence 33221 111222222221100 0000000000000000 000111 111111 112456678888888888 Q ss_pred EEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeec-cCCcccHHHHHH Q lcl|NC_018861. 237 VLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFDVN-SADGRWFIEKAR 315 (465) Q Consensus 237 ~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~~~-~~~~~~~~e~~~ 315 (465) -++..+-++|..-.--+-.++ |-++.|.|+.-.+-|..-|++..-+++-+.+...++.....++. ..+++|.- .-- T Consensus 74 ~~~~~~~~~L~~~i~~~~~~~--a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Lsgt~~wsd-~~S 150 (309) T protein:vir:99 74 ETGSTEDHGLDAPVPQADIDN--APTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSD-PTS 150 (309) T ss_pred ceeeecccceeecCCchhhhh--ccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEecCccccCC-CCC Confidence 899999999987777775554 44689999998888877666544444444333334444433333 34667752 222 Q ss_pred HHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcce Q lcl|NC_018861. 316 GLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDY 395 (465) Q Consensus 316 ~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy 395 (465) +-...|+..-..+ ++ ..|.+|.+.+|..+|..++.+...-..+.... |.+. ... .-.+...|. T Consensus 151 DPi~~i~~~~~~~----g~-~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~---------g~it--~~~-la~l~~ve~ 213 (309) T protein:vir:99 151 NPLPVITDALDSV----IL-RPNIGVLGRRTATILRRHPKIVKAYNGSLGDE---------GMVP--MAF-LQELLELDA 213 (309) T ss_pred CcHHHHHHHHHhh----CC-CcceEEechHHHHHHhhCHHHHHHhcCCCccc---------cccC--HHH-HHHHhCcce Confidence 2333333332222 33 89999999999999999988753321111100 1110 000 011224456 Q ss_pred EEEEE-------ecCCC-----ccceeEEecccccceeeeeCCCcccceeeeeeeeeeeecC--cccccccce-EE--Ee Q lcl|NC_018861. 396 CTVAY-------KGASN-----FDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDVVATP--LHPEAFIRT-FA--VN 458 (465) Q Consensus 396 ~~vg~-------kg~~~-----~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nP--f~~~~~~~~-f~--~~ 458 (465) ++||- +|.+. ++..+++.++.++.- .-..|..|+.-+||.-.+. +.+....+. .. +. T Consensus 214 V~vg~a~~n~a~~g~~~~~~~iwg~~~~L~y~~~~~~------~~~~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~ 287 (309) T protein:vir:99 214 IYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLAD------TRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVG 287 (309) T ss_pred EEeecceeeccccccccccccccCCcEEEEEcCCCCC------CcccccccceeecccccCCceeeeeeccCCceEEEEe Confidence 66652 24331 345555664444431 1235888888888775544 222111110 00 00 Q ss_pred e----------ccceeC Q lcl|NC_018861. 459 L----------NNYIIS 465 (465) Q Consensus 459 ~----------~~~~~~ 465 (465) - -|+.|. T Consensus 288 ~~~k~~i~~~d~G~li~ 304 (309) T protein:vir:99 288 ESVKELVTAPDLGFFFE 304 (309) T ss_pred ccccchhcchhcchhhh Confidence 0 011111 No 126 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=42.62 E-value=0.88 Score=20.90 Aligned_cols=292 Identities=12% Similarity=0.039 Sum_probs=132.1 Q ss_pred cccccChhhhhheehccccchhHHHhhhhhhhccccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEe Q lcl|NC_018861. 20 LYPNLNESEKNIMRTVLENQGNEVKMLMESTVTGDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHY 98 (465) Q Consensus 20 ~~~~~~~~~~~~~~~l~~n~~~~~~~i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY 98 (465) .|- ||.|.. -.| ..++.+-+.++++++... .-|.+. .+++.+.+..+-..++-+.||++++.-+- +. T Consensus 1 ~~~--~~~r~~--~~~---~~~e~~a~~~~~~~~g~~-ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p----~~ 68 (326) T protein:vir:42 1 MAV--NPDRTT--PFL---GVNDPKVAQTGDSMFEGY-LEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIP----HW 68 (326) T ss_pred CCC--Cccchh--hhc---CcchhhheeccccCCcce-echhhHHHHHHHHHhcchhhhhcceeeccCCceEEE----EE Confidence 111 111100 000 011222223332222111 112222 45666666667777888888887652211 00 Q ss_pred cCCCCcccccccccccCccccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccc Q lcl|NC_018861. 99 VGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIG 178 (465) Q Consensus 99 ~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~ 178 (465) . ++. . +.+ T Consensus 69 -~-~~~-----------------------------------~------a~~----------------------------- 76 (326) T protein:vir:42 69 -T-GDV-----------------------------------S------ASW----------------------------- 76 (326) T ss_pred -e-CCc-----------------------------------c------eEE----------------------------- Confidence 0 000 0 000 Q ss_pred cccccccccccCCccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHH Q lcl|NC_018861. 179 DEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDL 258 (465) Q Consensus 179 ~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDL 258 (465) .+| |..++|-..+++++++.+|...-.-.+|-||.+|- T Consensus 77 ----------------------------------------v~E--g~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s 114 (326) T protein:vir:42 77 ----------------------------------------IGE--GDMKPITKGNMTSQTIAPHKIATIFVASAETVRAN 114 (326) T ss_pred ----------------------------------------ecC--CccccccccceeEEEEeeEEEEEeehhhHHHHhcC Confidence 001 23344444567888888888888899999999984 Q ss_pred HhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhh--------hheeeeeeeeeeccCCcccHHHHHHHH--HHHHHHHHHHH Q lcl|NC_018861. 259 KAQHGINAEKELADILSAEVALEIDRTIIEKA--------NEVATVCTDFDVNSADGRWFIEKARGL--SMRISNEAREI 328 (465) Q Consensus 259 kAiHGlDAe~EL~niLstEImlEINreii~~l--------~~~at~~~~~~~~~~~~~~~~e~~~~L--~~~i~~~a~~i 328 (465) ..|.++.|.+-|+..|...+|+.+|..- ..... ..........+.+.......+ ...+...+ T Consensus 115 ----~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 186 (326) T protein:vir:42 115 ----PANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTK-EVSLVDPDGTGSNADLTVYDAVAVNALSLLV--- 186 (326) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccc-ccceeecccccccccchhHHHHHHHHHhhhh--- Confidence 3688999999999999999999998521 11110 000000011111110011111 11111122 Q ss_pred HHhcccccccEEEecHHHHHHHHhc----CcccccCCcccccccccccc-eEEEEecCceEEEEeCCCCcceEEEEEecC Q lcl|NC_018861. 329 GRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKIDAINSGIK-PNVGKFDNRYDVIVDNFAEFDYCTVAYKGA 403 (465) Q Consensus 329 ~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~~~~~~~~-~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~ 403 (465) ..+......|+++.....|+.. |...+.+..- ..... ...|+| .+++|+++++.+.+-.+ ++-|+ T Consensus 187 ---~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~-----~~~~~~~~~~~l-~G~pv~~~~~~~~~~~~-~~~Gd 256 (326) T protein:vir:42 187 ---NAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTY-----TEENSPFRLGRI-VARPTILSDHVASGTVV-GYQGD 256 (326) T ss_pred ---hhccCccEEEEeHHHHHHHHHhhccCCceeeccccc-----cCccccccCcee-eeeeEEEcCCCCCCceE-EEEee Confidence 2234567789999999999864 2222222110 00111 123444 46899999887654322 22222 Q ss_pred CCccceeEEecccccceeeee--------CCCc-----cc---ceeeeeeeeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 404 SNFDAGIFFAPYNITLQQNLT--------DPVS-----GQ---PAMILNNRYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 404 ~~~d~glfy~PY~~~~~~~~~--------dp~s-----~q---p~~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) - .-++|.........+.. |+.. || =.+=...|++..+ .++.+ |++ |++.--+ T Consensus 257 ~---s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v--~~~~a----~~~-l~~~~~~ 324 (326) T protein:vir:42 257 F---RQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHC--NDKDA----FVK-LTNVDAT 324 (326) T ss_pred c---ceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEE--ecccc----eEE-Eeecccc Confidence 1 11233333222222111 1111 22 2223456776632 33333 443 4443333 No 127 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=42.29 E-value=0.89 Score=20.86 Aligned_cols=216 Identities=13% Similarity=0.056 Sum_probs=107.9 Q ss_pred ccCCCcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHH Q lcl|NC_018861. 192 ATVEAVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELA 271 (465) Q Consensus 192 ~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~ 271 (465) ++...+.... ....+ -|-++..+|..--+..+|+++=.+ ++.|-++=.=++|=|. .|.+ + =|.-.|.. T Consensus 1 ~~~~~~Gdti-----t~P~~-iGda~~v~eG~~i~~~~l~~t~~~--atIk~~gk~~~itD~a--~l~~-~-gDp~~ea~ 68 (231) T protein:vir:73 1 ENGINLANLC-----EYPND-IGDAADVAEGGEISLDKIGTTTKS--VTIKKAAKGTEITDEA--ALSG-Y-GDPIGESN 68 (231) T ss_pred CccccCCceE-----Eeccc-ccchhhhcCCCcCChhhcccccee--eeEeeeccceeeeHHH--Hhhc-c-CchHHHHH Confidence 1100000000 01122 233344445333345667655444 4445543333333222 2444 2 48889999 Q ss_pred HHHHHHHHHHhhHHHHhhhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHH Q lcl|NC_018861. 272 DILSAEVALEIDRTIIEKANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILD 351 (465) Q Consensus 272 niLstEImlEINreii~~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~ 351 (465) +-|+..|...+|.+++..+..... .++ ..-| +..+...+...-.+ -...++++|+|+++..|+ T Consensus 69 ~Q~~~~iA~kvD~di~~~~~~a~l-----~~~--~~~t----~d~i~~A~~~fgde------~~~~~vivv~p~~~~~Lr 131 (231) T protein:vir:73 69 KQLGLSLANKVDDDLLKAAKTTSQ-----TVS--TKAN----VDGVQAALDIFNDE------DAQAYVLIVNPKDAAKIR 131 (231) T ss_pred HHHHHHHHHhhhHHHHHhhccccc-----ccc--cccc----HHHHHHHHHHhccc------cccceEEEEcchHHHhhh Confidence 999999999999999988843321 111 1112 22222222222222 135678999999999998 Q ss_pred hcCccccc--CCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEeccc--cc----ceeee Q lcl|NC_018861. 352 EIGSFVLS--PAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYN--IT----LQQNL 423 (465) Q Consensus 352 ~~~~~~~~--~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~--~~----~~~~~ 423 (465) ....+... ..++.... ...+|.+ .|++||++...+.+ ..++++|+ ++ ..++. T Consensus 132 k~~~~~~~~~~~g~~i~~-----~G~iG~i-~G~~Vi~S~~~~~~--------------~~~~~~~i~~~gAl~~~~k~~ 191 (231) T protein:vir:73 132 KDANAKNIGSEVGANALI-----NGTYADV-LGAQIVRSKKLAEG--------------SALMFKIVSNSPALKLVLKRG 191 (231) T ss_pred hccchhhhhhhhccceee-----ecccceE-cceEEEEcCCCCCC--------------ceeeeeEEeeccceeeeeccc Confidence 86544322 11222222 3367777 45899998876632 22344443 11 12221 Q ss_pred ------eCCCcccceeeeeeeeee-eecCcccccccceEEEeeccc Q lcl|NC_018861. 424 ------TDPVSGQPAMILNNRYDV-VATPLHPEAFIRTFAVNLNNY 462 (465) Q Consensus 424 ------~dp~s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~~~~~~~ 462 (465) -|+....-.+--.-.|++ ..||= ..=..||+ |- T Consensus 192 ~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~--~vv~~t~~----g~ 231 (231) T protein:vir:73 192 VQVETDRDIVTKTTVITADEHYAAYLYDLT--KVVNITFT----GV 231 (231) T ss_pred ceeeccccccccccEEEEeEEEEEEEEcCc--cEEEEEee----cC Confidence 177777777777777776 34441 11112333 22 No 128 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=41.94 E-value=0.91 Score=20.82 Aligned_cols=284 Identities=12% Similarity=0.080 Sum_probs=120.4 Q ss_pred cccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccc------cccccccccccccccC Q lcl|NC_018861. 117 LKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGS------VAIGDEMDKAATFATK 190 (465) Q Consensus 117 ~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~------ea~~~e~~t~~s~~~~ 190 (465) |. ++ ..++.+. .+...++.+|+ |-+..|+.+.|....- T Consensus 1 ma-------~~---------------~~~~~~~--------------t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~ 44 (347) T protein:vir:94 1 MA-------NM---------------NGGQQMG--------------KDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSV 44 (347) T ss_pred CC-------cc---------------ccccccc--------------cccccCCcccchHHHHHHHHhHHHHHHHHHHHh Confidence 00 00 0000000 01111111111 2233444443332211 Q ss_pred CccCCCcccccCccccccccccccccc----hhhhccC-----CchhhcceEEEEEEEEeecceecccchHHHHHHHHhh Q lcl|NC_018861. 191 KATVEAVYTNEALWLKVLKNYTGPYAT----AAGEKLG-----KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQ 261 (465) Q Consensus 191 ~~~~~~~~~~~a~~~~~~~~~~~~~~T----a~~E~lg-----~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAi 261 (465) ...-...-+.. ..+. .....-|-.+ ..++.+. ....|.-.+||..- =+...|+-.-|.++ T Consensus 45 ~~~~~~~rti~-~G~s-v~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~--------y~~~~VddiD~~q~- 113 (347) T protein:vir:94 45 TMNKHLVRSIQ-SGKS-AQFPVLGRTKAAYLQPGENLDDKRKDMKHTEKTINIDGLL--------TADVLIYDIEDAMN- 113 (347) T ss_pred hhhhhhheecc-ccce-EEeeeccceeEeeeecCcCCCCCcCCccccceEEEEcchh--------hhhhhhhhHHHHhc- Confidence 00000000000 0000 0000001111 1222221 12456666666532 24445555555555 Q ss_pred hCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheee-ee----------eeeeecc-----CCcccHHHHHHHHHHHHHHHH Q lcl|NC_018861. 262 HGINAEKELADILSAEVALEIDRTIIEKANEVAT-VC----------TDFDVNS-----ADGRWFIEKARGLSMRISNEA 325 (465) Q Consensus 262 HGlDAe~EL~niLstEImlEINreii~~l~~~at-~~----------~~~~~~~-----~~~~~~~e~~~~L~~~i~~~a 325 (465) | .|.-+|++.-...++..++++-||+.|...+- .. ..+.+.. ..+.- .+-...++..|-+.. T Consensus 114 ~-~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~~~~v~i~~~~~~~~~~-~~~~~~~~d~i~~a~ 191 (347) T protein:vir:94 114 H-YDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAGLGKAHVLEVGDQATLQGDQ-VKLGQAIIAQLTLAR 191 (347) T ss_pred C-cchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCcceeEeeeccccccccc-cccHHHHHHHHHHHH Confidence 3 79999999999999999999999987743211 00 0111111 01110 012233333333333 Q ss_pred HHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEE------- Q lcl|NC_018861. 326 REIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTV------- 398 (465) Q Consensus 326 ~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~v------- 398 (465) ...-.+=---.++|+|++|+....|-..-...+. + ..+.++.....+|.+ .|++||.-++.|..-... T Consensus 192 ~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~---~-~~~~~~~~~G~V~~v-~G~~V~~Sn~~p~~~~~~~~~~~~~ 266 (347) T protein:vir:94 192 AKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAA---N-YQALIDPSTGSIRNV-MGFEVIEVPHLTAGGAGDNRAEEGV 266 (347) T ss_pred HHhhhcCCCCCCCEEEeChHHHHHHHHhhccccc---c-cccccccccceeEEe-eceEEEEcCccccccCccccccccc Confidence 3332222222478999999999777654222211 1 122233445688888 789999988876533211 Q ss_pred ---------------EEecCCCccceeEEecccc-------cceeeeeCCCcccceeeeeeeeeeeecCcccccc---cc Q lcl|NC_018861. 399 ---------------AYKGASNFDAGIFFAPYNI-------TLQQNLTDPVSGQPAMILNNRYDVVATPLHPEAF---IR 453 (465) Q Consensus 399 ---------------g~kg~~~~d~glfy~PY~~-------~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~---~~ 453 (465) .|+++-.--.++||.|=-- .+..+-.|+..+--. +..+|++-+=|+-|+.- .. T Consensus 267 ~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~~~~~~~--i~~~~a~G~g~~rPe~a~~i~~ 344 (347) T protein:vir:94 267 APTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRANFQADQ--IIAKYAMGHGGLRPEACGALVF 344 (347) T ss_pred ccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeechhhhhhh--hhhhhhhcCcccccceeEEEEe Confidence 1222222225777766522 223333355444443 35566665544444332 11 Q ss_pred eEE Q lcl|NC_018861. 454 TFA 456 (465) Q Consensus 454 ~f~ 456 (465) +=| T Consensus 345 ~~a 347 (347) T protein:vir:94 345 KKA 347 (347) T ss_pred cCC Confidence 111 No 129 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=41.48 E-value=0.93 Score=20.77 Aligned_cols=272 Identities=13% Similarity=0.079 Sum_probs=125.6 Q ss_pred hhhhhhccccccccchhh-hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccc Q lcl|NC_018861. 46 LMESTVTGDIAKFTPILV-PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANK 124 (465) Q Consensus 46 i~est~t~~v~~~~P~l~-~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~e 124 (465) ++ ++.|.. .-|.+. -++..+-++.+-.+++.+.||++...-|. .. .. .. T Consensus 1 ma--~~gG~l--vp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip-~~---~~--~~-------------------- 50 (298) T protein:vir:16 1 MV--LNKGTL--FDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVF-TF---TM--DS-------------------- 50 (298) T ss_pred Cc--ccCcce--echhHHHHHHHHHHhhhhhhhhcceeeccCCceEEE-EE---ec--Cc-------------------- Confidence 22 222222 223333 44566667778899999999876432111 10 00 00 Q ss_pred cccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcc Q lcl|NC_018861. 125 DDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALW 204 (465) Q Consensus 125 a~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~ 204 (465) . +.| . T Consensus 51 ---------------~------a~~----------------------------v-------------------------- 55 (298) T protein:vir:16 51 ---------------E------IDV----------------------------V-------------------------- 55 (298) T ss_pred ---------------c------eEE----------------------------e-------------------------- Confidence 0 000 0 Q ss_pred ccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhH Q lcl|NC_018861. 205 LKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDR 284 (465) Q Consensus 205 ~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINr 284 (465) +| |.++++-..++++++..+|.-+-....|-||.++--- -..|-+++|.+-|+..|...|+. T Consensus 56 ---------------~E--~~~~~~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d-~~~~l~~~i~~~la~ai~~~~d~ 117 (298) T protein:vir:16 56 ---------------AE--SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQEFNDGFAKKVARGIDL 117 (298) T ss_pred ---------------cC--CccccccccceeEEEEeeeeEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHH Confidence 01 1234444445677777777777778899999876432 13567788999999999999999 Q ss_pred HHHhhhhheeeeeeee------eeccCCcccH--HHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcc Q lcl|NC_018861. 285 TIIEKANEVATVCTDF------DVNSADGRWF--IEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSF 356 (465) Q Consensus 285 eii~~l~~~at~~~~~------~~~~~~~~~~--~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~ 356 (465) .+|...... + ++.. .+........ .+....++..|..+...+.. .+.+....+++++....|....- T Consensus 118 ~~l~G~~~~-~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~lkd- 192 (298) T protein:vir:16 118 MAFHGVNPR-L-GTASAVIGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTG--VDADVTGIAINPSFRSALAKQKD- 192 (298) T ss_pred HhhccccCC-C-CcccccccccccccccccccccccccccHHHHHHHHHHHhhh--cCCCccEEEEcHHHHHHHHHhhc- Confidence 988653110 0 0000 0000000000 01111222333333333332 12355678999999998876421 Q ss_pred cccCCcccccccccccceEEEEecCceEEEEeCCCCc------ceEEEEEecCCCccceeEEecccccceee--eeCCCc Q lcl|NC_018861. 357 VLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEF------DYCTVAYKGASNFDAGIFFAPYNITLQQN--LTDPVS 428 (465) Q Consensus 357 ~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~------dy~~vg~kg~~~~d~glfy~PY~~~~~~~--~~dp~s 428 (465) ..+...- .........|+|. |++|+++.+.+. +.+++|- ...++.|..--...+++ ..|++. T Consensus 193 ---~~G~~i~-~~~~~~~~~~~l~-G~PV~~~~~v~~~~~~~~~~~~~GD-----fs~~~~~~~~~~~~~~~~~~~~~~~ 262 (298) T protein:vir:16 193 ---LQDNALF-PELKWGATPDTIN-GLPVDVNKTVSDMSLTQRDRAIIGD-----FANGFKWGYAKEVPLEVIQYGDPDN 262 (298) T ss_pred ---cCCCeee-cCcccCCCCceec-ceeeEEecccccccCCCccEEEEee-----ccceEEEEEecCceEEEeeccCCcC Confidence 1111111 1110111236774 569998877542 3344441 11112232222222222 113322 Q ss_pred -----cc-ceeee--eeeeee-eecCcccccccceEEEeeccc Q lcl|NC_018861. 429 -----GQ-PAMIL--NNRYDV-VATPLHPEAFIRTFAVNLNNY 462 (465) Q Consensus 429 -----~q-p~~~~--~tRY~l-~~nPf~~~~~~~~f~~~~~~~ 462 (465) || -.+++ ..|++. +.+| .-|++--.-| T Consensus 263 ~~~~~f~~~~v~~ra~~r~d~~v~~~-------~a~~~l~~at 298 (298) T protein:vir:16 263 SGLDLKGYNQVYIRAELFLGWGILDA-------TKFARVTEAN 298 (298) T ss_pred cchhhhhcCcEEEEEEEEEccEeecc-------cceEEEeecC Confidence 32 22444 346664 4444 2344222222 No 130 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=41.02 E-value=0.95 Score=20.72 Aligned_cols=287 Identities=10% Similarity=0.070 Sum_probs=124.4 Q ss_pred cccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCc---- Q lcl|NC_018861. 117 LKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKA---- 192 (465) Q Consensus 117 ~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~---- 192 (465) |..... | ..-++....++.. +-.-++. =|-+..|+.+.|...+-.. T Consensus 1 ma~~~~--------~----~~~~t~~~~~~~~----------~~~~a~~--------ie~f~g~V~~~f~~~s~~~~~~~ 50 (347) T protein:vir:15 1 MANIQG--------G----QQIGTNQGKGQSA----------ADKLALF--------LKVFGGEVLTAFARTSVTMPRHM 50 (347) T ss_pred CCcccc--------C----CccccccccCCCc----------chHHHHH--------HHHHHHHHHHHHHHhhhhhhccc Confidence 111100 0 0000000000000 0000000 0111222222222111000 Q ss_pred --cCCC--cccccCccccccccccccccchhhhcc-C----CchhhcceEEEEEEEEeecceecccchHHHHHHHHhhh- Q lcl|NC_018861. 193 --TVEA--VYTNEALWLKVLKNYTGPYATAAGEKL-G----KDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQH- 262 (465) Q Consensus 193 --~~~~--~~~~~a~~~~~~~~~~~~~~Ta~~E~l-g----~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiH- 262 (465) +..+ ...-.-........+ ..++.+ + ....|+-++||++. |.=.+--||-... T Consensus 51 ~~~~~~G~sv~i~~ig~~t~~~~------~~g~~l~~~~~~~~~~e~~ltID~~~-----------~~~~~VddlD~~q~ 113 (347) T protein:vir:15 51 LRSIASGKSAQFPVIGRTKAAYL------KPGENLDDKRKDIKHTEKVIHIDGLL-----------TADVLIYDIEDAMN 113 (347) T ss_pred cccccccceeEeeeccceeeeee------ccCCCCCCCCCCCccceEEEEechhh-----------hhhHHhhhHHHHhc Confidence 0000 000000000000000 011111 1 23567777787642 3333333444433 Q ss_pred CCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeee---ee---------e-eeccC-Ccc-cH-HHHHHHHHHHHHHHHH Q lcl|NC_018861. 263 GINAEKELADILSAEVALEIDRTIIEKANEVATVC---TD---------F-DVNSA-DGR-WF-IEKARGLSMRISNEAR 326 (465) Q Consensus 263 GlDAe~EL~niLstEImlEINreii~~l~~~at~~---~~---------~-~~~~~-~~~-~~-~e~~~~L~~~i~~~a~ 326 (465) =.|..+|+..-....++..+++-|++.|...+.-. .. + ..... .+. .. ...+..++..+-+... T Consensus 114 ~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~ 193 (347) T protein:vir:15 114 HYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARA 193 (347) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHH Confidence 27899999999999999999999998874321100 00 0 00011 111 11 1123344444433333 Q ss_pred HHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceE-------EEE Q lcl|NC_018861. 327 EIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYC-------TVA 399 (465) Q Consensus 327 ~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~-------~vg 399 (465) .+-++=---.++|+|++|+....|-...-+..... .+..+.....||.+ .|++||.-+.-|.... +.| T Consensus 194 ~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~----~~~~~~~~G~Vg~i-~G~~V~~Sn~lp~~~~t~~~~~~~~g 268 (347) T protein:vir:15 194 SLTKNYVPAADRTFYTTPDNYSAILAALMPNAANY----QALIDHERGTIRNV-MGFEVVEVPHLTAGGAGDTREDAPAD 268 (347) T ss_pred HHhhcCCCccCCEEEeCHHHHHHHhcccccccccc----cccccccceEEEEE-eceEEEeccccccccccccccccccc Confidence 33333222357899999999999988876643221 22223446788998 5899999888764322 222 Q ss_pred EecC-----C-------CccceeEEeccccccee-------eeeCCCcccceeeeeeeeee-eecCcccccccceEE--- Q lcl|NC_018861. 400 YKGA-----S-------NFDAGIFFAPYNITLQQ-------NLTDPVSGQPAMILNNRYDV-VATPLHPEAFIRTFA--- 456 (465) Q Consensus 400 ~kg~-----~-------~~d~glfy~PY~~~~~~-------~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~--- 456 (465) -+.. + .-..+|||.|.-.++.+ +.-|+..+-=.|=-+..||. +.+|=+ +-.|+ T Consensus 269 ~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~~~d~i~~~~~~G~~vlrP~~----av~~~~~~ 344 (347) T protein:vir:15 269 QKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRPEA----AGAIVLPK 344 (347) T ss_pred ccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccchhhhhhhehhhhcCCceecccc----EEEEecCC Confidence 2211 1 11257888887554433 12367777666655666666 455522 11122 Q ss_pred Eee Q lcl|NC_018861. 457 VNL 459 (465) Q Consensus 457 ~~~ 459 (465) |+- T Consensus 345 ~~~ 347 (347) T protein:vir:15 345 VSE 347 (347) T ss_pred CCC Confidence 111 No 131 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=38.73 E-value=1.1 Score=20.46 Aligned_cols=307 Identities=15% Similarity=0.111 Sum_probs=123.9 Q ss_pred CCccchhhhHHHhhhhh---hccccc---cC--------hhhhhheehccccch----hHHHhhhhhhhc-cccccccch Q lcl|NC_018861. 1 MADKYLLDESTKEKFIT---SNLYPN---LN--------ESEKNIMRTVLENQG----NEVKMLMESTVT-GDIAKFTPI 61 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~---~~~~~~---~~--------~~~~~~~~~l~~n~~----~~~~~i~est~t-~~v~~~~P~ 61 (465) ..+..-..+++.+.=.. -...+. .+ ++.|.....+..... .+.+-+..++.+ |... =|. T Consensus 53 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~--vP~ 130 (408) T protein:vir:10 53 KVRRDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLT--IPQ 130 (408) T ss_pred HHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCcee--ccH Confidence 11100011111111000 000000 00 022222222211111 111112222211 1110 133 Q ss_pred hh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccccccccccc Q lcl|NC_018861. 62 LV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFK 139 (465) Q Consensus 62 l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~ 139 (465) -+ -+++.+.......+++.+.||+++.|-+--.| ...... T Consensus 131 ~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~------------------------------------ 172 (408) T protein:vir:10 131 DIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEK--WTDVTP------------------------------------ 172 (408) T ss_pred hHHHHHHHHHHhhchhhhhcceeeccCCcceEEEee--cccccc------------------------------------ Confidence 22 35677777778889999999999998765444 100000 Q ss_pred ccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchh Q lcl|NC_018861. 140 TATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAA 219 (465) Q Consensus 140 tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~ 219 (465) .+ .+.. T Consensus 173 ~a------~~v~-------------------------------------------------------------------- 178 (408) T protein:vir:10 173 LT------VMDA-------------------------------------------------------------------- 178 (408) T ss_pred ce------eeec-------------------------------------------------------------------- Confidence 00 0000 Q ss_pred hhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeee Q lcl|NC_018861. 220 GEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCT 298 (465) Q Consensus 220 ~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~ 298 (465) | |...+|.+ -++++++..+|.-+-...+|-||.+|- .+|.+++|.+.|+..|..-+|+.||.-.-....+ T Consensus 179 -E--~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~-- 249 (408) T protein:vir:10 179 -E--DGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT----AENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK-- 249 (408) T ss_pred -C--ccccccccCcceeeEEeeeeeEEeeehhHHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-- Confidence 0 11122222 135555555555555677999999993 4678899999999999999999998766221110 Q ss_pred eeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc----CcccccCCcccccccccccce Q lcl|NC_018861. 299 DFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI----GSFVLSPAGSKIDAINSGIKP 374 (465) Q Consensus 299 ~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~----~~~~~~~~~~~~~~~~~~~~~ 374 (465) .... .+..|...+.. .+ ...+-..-.++|++.....|... |...+.|.. ... T Consensus 250 ------~~~~----~~~~l~~~~~~---~~--~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~---------~~~ 305 (408) T protein:vir:10 250 ------PTIA----KFDDVITMINT---AV--DPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDP---------TKP 305 (408) T ss_pred ------cccc----cHHHHHHHHHH---hh--hhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCc---------CCC Confidence 1111 12233332221 11 11111122578999999999765 222222211 011 Q ss_pred EEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc-------cceeeeeCCC------cccceeeeeeeeee Q lcl|NC_018861. 375 NVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI-------TLQQNLTDPV------SGQPAMILNNRYDV 441 (465) Q Consensus 375 ~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~-------~~~~~~~dp~------s~qp~~~~~tRY~l 441 (465) ..++| .|++|++..+. .++-.|++. ..+||+.+.. .-+....++. ..+=.+-+..|++. T Consensus 306 ~~~~l-~G~PV~~~~~~-----~~~~~~~~~--~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~ 377 (408) T protein:vir:10 306 NSYLI-KGKQVIVVADR-----WLPNTGSTV--YPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDV 377 (408) T ss_pred CCcee-cceeeEEeccc-----ccCccCCCc--eEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeecc Confidence 11345 45666653321 112222221 1234443321 1111222322 12233344456665 Q ss_pred -eecCccccccc-ceEE-----EeeccceeC Q lcl|NC_018861. 442 -VATPLHPEAFI-RTFA-----VNLNNYIIS 465 (465) Q Consensus 442 -~~nPf~~~~~~-~~f~-----~~~~~~~~~ 465 (465) +.+| .+.. .+|+ +=.+++--+ T Consensus 378 ~v~~~---~a~~~~~~~~~~~~~~~~~~~~~ 405 (408) T protein:vir:10 378 KATDS---EALVAGSFSAIADQVGNFKTTTS 405 (408) T ss_pred EEecc---ccEEEEEeeccccCCCCCCCCCc Confidence 3333 2211 1111 001111111 No 132 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=36.99 E-value=1.1 Score=20.27 Aligned_cols=264 Identities=13% Similarity=0.042 Sum_probs=104.6 Q ss_pred ccccccccccccchhhhh--eeeeeccCccccccccccccccccccCCccCCCcccccCcccccccccc-ccccchhhhc Q lcl|NC_018861. 146 GKIVYSEKQAGTDNIVNV--LLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYT-GPYATAAGEK 222 (465) Q Consensus 146 gait~~~~~TGPTgLifa--m~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~-~~~~Ta~~E~ 222 (465) .++... -|. +++ +...+... ..+.+-++..+.+.... +.+..-......+...+. .+... ..+. T Consensus 1 MA~~~~----~pe--i~~~~v~~~~~~~---lv~~~l~~~~~~~~~~~---GdTv~ip~~~~~~~~d~~~~~~~~-~~~~ 67 (273) T protein:vir:79 1 MAFNNF----IPE--LWSDMLLEEWTAQ---TVFANLVNREYEGIASK---GNVVHIAGVVAPTVKDYKAAGRQT-SADA 67 (273) T ss_pred Ccchhh----hHH--HHHHHHHHHHHhh---ccchhhhhccccccccC---CcEEEEeecCcccccccccCCCcc-Cccc Confidence 111000 010 000 00000000 00001111111111000 000000000000000000 00000 0011 Q ss_pred cCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeee Q lcl|NC_018861. 223 LGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFDV 302 (465) Q Consensus 223 lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~~ 302 (465) -...+.-++|+|...-+. + =+-+|..|+ | .|-+. ...=+...+..+++.+++..+.... ..... T Consensus 68 --~~~~~~~~tid~~~~~~~----~-i~d~d~~~~----~-~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~---~~~~~ 131 (273) T protein:vir:79 68 --ISDTGVDLLIDQEKSIDF----L-VDDIDRVQV----A-GSLEA-YTRAGATALATDTDKFIADMLVDNG---TALTG 131 (273) T ss_pred --cccceEEEEEeeecccce----e-eccHHHHhh----c-ccHHH-HHHHHHHHHHHHHHHHHHHHHhhcc---ccccc Confidence 123445556665322111 1 123344333 2 35554 4455667788999999998883322 11111 Q ss_pred cc-CCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecC Q lcl|NC_018861. 303 NS-ADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDN 381 (465) Q Consensus 303 ~~-~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~ 381 (465) .. .++.-..+.+..+..++++... --.++++|++|++.+.|..++.+...... ..+....-...+|.+. T Consensus 132 ~~~~~~~~~~~~i~~a~~~ld~~~v-------P~~~R~lvv~p~~~~~Ll~~~~~~~~~~~--~~~~~~l~~G~ig~~~- 201 (273) T protein:vir:79 132 SAPSDADDAFDLIASALKELTKANV-------PNVGRVVVVNAEMAFWLRSSGSKLTSADT--SGDAAGLRAGTIGNLL- 201 (273) T ss_pred ccccchhhHHHHHHHHHHHhhhccC-------CccCcEEEECHHHHHHHhhchhhhhhhhh--cccccceeeeEeeEEe- Confidence 11 1111222444444444433321 12467999999998877766543211111 1111111134678874 Q ss_pred ceEEEEeCCCCc--ce-EEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecCcccccccceEE- Q lcl|NC_018861. 382 RYDVIVDNFAEF--DY-CTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATPLHPEAFIRTFA- 456 (465) Q Consensus 382 ~~~vy~d~~~~~--dy-~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~- 456 (465) |+.||+.++-|. ++ ++.+.|+. +-|+-.. ......-||++|--.|=-..+||. .++| +..+ T Consensus 202 G~~i~~s~~lp~~~~~~~~a~~~~A------~~~a~~~-~~~e~~r~~~~~~~~v~~~~~yg~~v~~p-------~~vv~ 267 (273) T protein:vir:79 202 GARIVESNNLRDTDDEQFVAFHPSA------AAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRP-------TGVVV 267 (273) T ss_pred ceEEEecccccccCceEEEEEeccc------eeeeeeh-hhhhcccCcccceeeeeeeeeeeeEEecC-------ceEEE Confidence 589999877653 33 33343332 2223221 123334577777766666677887 4444 2444 Q ss_pred Eeeccc Q lcl|NC_018861. 457 VNLNNY 462 (465) Q Consensus 457 ~~~~~~ 462 (465) ++.+|+ T Consensus 268 ~~~~g~ 273 (273) T protein:vir:79 268 FNKTGS 273 (273) T ss_pred EeccCC Confidence 556666 No 133 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=36.38 E-value=1.2 Score=20.20 Aligned_cols=313 Identities=15% Similarity=0.142 Sum_probs=114.0 Q ss_pred CCccchhhhHHHhhhhhhccccccChh-----hhhheehccccch--hHHHh-------------hhhhhhccccccccc Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNLYPNLNES-----EKNIMRTVLENQG--NEVKM-------------LMESTVTGDIAKFTP 60 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~~~~~~~~-----~~~~~~~l~~n~~--~~~~~-------------i~est~t~~v~~~~P 60 (465) |+-+....-+-.+.=......+++.++ .|.+.+....... ..+.. +..++.+|... =| T Consensus 1 ~a~~~a~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~Gg~l--vP 78 (366) T protein:vir:57 1 MAAAVAVPVKAHSVAPGIIIKEELQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLSMAISTAAGSGGAL--IP 78 (366) T ss_pred CcccccccccccccccccccccccccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhhhhccccccCCccc--cc Confidence 322221110000000000111111111 1112211111110 00111 11111111110 02 Q ss_pred hhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccccc Q lcl|NC_018861. 61 ILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSF 138 (465) Q Consensus 61 ~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~ 138 (465) .-+ .++.+..+..+...+ |++.+.+++|-+-=.| .. T Consensus 79 ~~~~~~ii~~l~~~s~l~~l-g~~~v~~~~g~~~~p~--~t--------------------------------------- 116 (366) T protein:vir:57 79 QNMQNEVIELLRDRTVVRIL-GARSIPLPNGNLSMPR--LS--------------------------------------- 116 (366) T ss_pred hhHHHHHHHHHhhhcchhhh-ceeeeecCCCceEEEE--Ee--------------------------------------- Confidence 211 122222222111111 2222211221100000 00 Q ss_pred cccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccch Q lcl|NC_018861. 139 KTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATA 218 (465) Q Consensus 139 ~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta 218 (465) .+..+. - T Consensus 117 ---------------------------------~~~~a~----------------------------------------w 123 (366) T protein:vir:57 117 ---------------------------------GGATAG----------------------------------------Y 123 (366) T ss_pred ---------------------------------CCccee----------------------------------------e Confidence 000000 0 Q ss_pred hhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhh-------- Q lcl|NC_018861. 219 AGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKA-------- 290 (465) Q Consensus 219 ~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l-------- 290 (465) .+| |..+++...+++++++..|.-+-...+|-||.+|-. .|.|+.|.+-|...|...+|+.+|.-= T Consensus 124 v~E--~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~G 197 (366) T protein:vir:57 124 VGE--GKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAG----FNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKG 197 (366) T ss_pred ecc--CccccccccceeEEEEeeEEEEEeehhhHHHHhhhh----HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccc Confidence 011 233444455677888888877778889999998753 578899999999999999999998632 Q ss_pred -hheeeeeee-eeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccccc Q lcl|NC_018861. 291 -NEVATVCTD-FDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAI 368 (465) Q Consensus 291 -~~~at~~~~-~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~ 368 (465) ...++.... ....+....+. ....+ ++.+.........+......++++.....|.... ...+...-. T Consensus 198 i~~~~~~~~~~~~~~~t~~~~~--~~~~~---~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lk----d~~G~~l~~- 267 (366) T protein:vir:57 198 MKAVATAANRLVAWTGTAINLT--TIDEY---LDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLR----DGNGNKVYP- 267 (366) T ss_pred eeeccccccceeeccccccchh--hHHHH---HHHHHHhhhccccccccCEEEecHHHHHHHHhhh----ccCCceecc- Confidence 111111111 11111111111 11111 1112222222233334556789999999887642 111222111 Q ss_pred ccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEeccccc--------ceeeeeC-----CC-----cc- Q lcl|NC_018861. 369 NSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNIT--------LQQNLTD-----PV-----SG- 429 (465) Q Consensus 369 ~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~--------~~~~~~d-----p~-----s~- 429 (465) +. --|+| .||+|+++++.|.+- |...-..-++|+.+... ...+.-| +. .| T Consensus 268 ~~----~~g~l-~G~Pvv~s~~ip~~~------~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~ 336 (366) T protein:vir:57 268 EM----SQGIL-KGYPIQRTSAIPANL------GDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFA 336 (366) T ss_pred CC----CCCee-cceeeEEcccccccc------ccCCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhh Confidence 11 12566 568999988765431 11111122333333211 1111111 11 01 Q ss_pred cceeee--eeeeeeeecCcccccccceEEEeeccc Q lcl|NC_018861. 430 QPAMIL--NNRYDVVATPLHPEAFIRTFAVNLNNY 462 (465) Q Consensus 430 qp~~~~--~tRY~l~~nPf~~~~~~~~f~~~~~~~ 462 (465) +..++| ..|+++ .+.+|++...-=++ +. T Consensus 337 ~~~~~iR~~~~~d~--~v~~~~a~~~lt~~---~~ 366 (366) T protein:vir:57 337 RNQSLIRVVTEHDI--GFRHPEGLVLGTGV---IW 366 (366) T ss_pred cCceeEEeeeeeCc--EeeccccEEEEecc---cC Confidence 112333 334444 23444432221111 12 No 134 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=33.59 E-value=1.3 Score=19.88 Aligned_cols=311 Identities=12% Similarity=0.029 Sum_probs=110.5 Q ss_pred CCccchhhhHHHhhhhh---hcccccc----Chhhhhheehccccc-----hhHHHhhhhhhhccccccccchhh----- Q lcl|NC_018861. 1 MADKYLLDESTKEKFIT---SNLYPNL----NESEKNIMRTVLENQ-----GNEVKMLMESTVTGDIAKFTPILV----- 63 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~---~~~~~~~----~~~~~~~~~~l~~n~-----~~~~~~i~est~t~~v~~~~P~l~----- 63 (465) +.+..-...++++.... .+..... .+.++.....+.+.+ ..+...+.+....+........|| T Consensus 188 ~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~ 267 (543) T protein:vir:81 188 SDNVRAAATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLD 267 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhh Confidence 00000000011111100 0000000 011111111111111 111122222211111111111121 Q ss_pred -hhhhhhhhh-hhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccccccccccccc Q lcl|NC_018861. 64 -PVIRRALPS-LIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVSFKTA 141 (465) Q Consensus 64 -~l~~ra~~~-lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s~~ta 141 (465) .++.+.... -+...++-|.|++|.. .-.+ .. T Consensus 268 ~~ii~~~~~~~~~l~~~~~~~~~~g~~---~~~~--~~------------------------------------------ 300 (543) T protein:vir:81 268 PTVIITSNGSLNDIRRFARQVVATGDV---WHGV--SS------------------------------------------ 300 (543) T ss_pred hHHHHHHHhhhchhhhhcccccCCcce---EEEE--ec------------------------------------------ Confidence 111111111 1112222222222110 0000 00 Q ss_pred ccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccchhhh Q lcl|NC_018861. 142 TTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAAGE 221 (465) Q Consensus 142 tt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~~E 221 (465) .+..+ .-.+| T Consensus 301 ------------------------------~~~~a----------------------------------------~~v~E 310 (543) T protein:vir:81 301 ------------------------------AAVQW----------------------------------------SWDAE 310 (543) T ss_pred ------------------------------CCcce----------------------------------------eeccc Confidence 00000 00001 Q ss_pred ccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhh---------hhh Q lcl|NC_018861. 222 KLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEK---------ANE 292 (465) Q Consensus 222 ~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~---------l~~ 292 (465) |..+++-..+++.+++++|.-+=...+|-||.+|- . |.++.|.+-|...|...+|+.||.. |.. T Consensus 311 --g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~--~---~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~ 383 (543) T protein:vir:81 311 --FEEVSDDSPEFGQPEIPVKKAQGFVPISIEALQDE--A---NVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVT 383 (543) T ss_pred --CccccccccccceeeeeeeeeEeeehhhHHHHhcc--H---HHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchh Confidence 12233333456777788888777889999999873 2 7899999999999999999999852 111 Q ss_pred eeeeeeeeeec-cCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccc Q lcl|NC_018861. 293 VATVCTDFDVN-SADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSG 371 (465) Q Consensus 293 ~at~~~~~~~~-~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~ 371 (465) ..... ...+. ...+....+....|+..+. ..+.....+++++.+...|...-- ..+.++-.+.. T Consensus 384 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~---------~~~~~~~~~v~n~~~~~~l~~lkd----~~G~~l~~~~~- 448 (543) T protein:vir:81 384 ALAGT-AAEIAPVTAETFALADVYAVYEQLA---------ARHRRQGAWLANNLIYNKIRQFDT----QGGAGLWTTIG- 448 (543) T ss_pred hcccc-cccccccccccccHHHHHHHHHhhh---------ccccCCcEEEEcHHHHHHHHHhhc----CCCceeccCcC- Confidence 11100 01111 1112222233333333332 222233357899999999976421 11111111100 Q ss_pred cceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecc---cc---cceeeeeCCCcc--------cceeeeee Q lcl|NC_018861. 372 IKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPY---NI---TLQQNLTDPVSG--------QPAMILNN 437 (465) Q Consensus 372 ~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY---~~---~~~~~~~dp~s~--------qp~~~~~t 437 (465) ...-++| .|++||+..+.|..-...+=.| +.-++|+.+ +. .-+...+||..+ +=.+-+.. T Consensus 449 -~g~~~~l-~G~pv~~~~~~~~~~~~~~~~~----~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 522 (543) T protein:vir:81 449 -NGEPSQL-LGRPVGEAEAMDANWNTSASAD----NFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYY 522 (543) T ss_pred -CCCCccc-cceeeEEeccccccccccccCC----cceEEEeeccceeEEeecccEEEEeccccccchhhcCceEEEEEE Confidence 0112455 4578888877543221100000 111222221 11 112223344332 22333345 Q ss_pred eeee-eecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 438 RYDV-VATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 438 RY~l-~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) |+|. +.|| .+ |+ +++. --| T Consensus 523 r~d~~v~~~---~A----~~~l~~~--~~a 543 (543) T protein:vir:81 523 RMGADVVNP---NA----FRLLNVE--TAS 543 (543) T ss_pred eeccEeecc---cc----eEEEEec--ccC Confidence 6665 3333 33 32 1111 111 No 135 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=33.22 E-value=1.4 Score=19.83 Aligned_cols=264 Identities=13% Similarity=0.059 Sum_probs=106.6 Q ss_pred ccccccccccccchhhhh--eeeeeccCccccccccccccccccccCCccCCCcccccCcccccccccc-ccccchhhhc Q lcl|NC_018861. 146 GKIVYSEKQAGTDNIVNV--LLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYT-GPYATAAGEK 222 (465) Q Consensus 146 gait~~~~~TGPTgLifa--m~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~-~~~~Ta~~E~ 222 (465) .++... -|+ +++ +...+.. ..-+-+-++..+++.... +.+..-.-........+. .+... ..+. T Consensus 1 MA~~~~----~pe--~~~~~v~~~~~~---~lv~~~l~~~~~~~~~~~---Gdtv~ip~~~~~~~~d~~~~~~~~-~~~~ 67 (273) T protein:vir:10 1 MAFNNF----IPE--LWSDMLLEEWTA---QTVFANLVNREYEGTASK---GNVVHIAGVVAPTVKDYKAAGRQT-SADA 67 (273) T ss_pred Ccchhh----hHH--HHHHHHHHHHHh---hhccchhhcccccccccc---CceEEEeecccccccccccCCCcc-Cccc Confidence 111000 010 000 0000000 000011111111111000 000000000000000000 00000 0011 Q ss_pred cCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeee Q lcl|NC_018861. 223 LGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFDV 302 (465) Q Consensus 223 lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~~ 302 (465) + .-.+..++|+|...-+ ++ =+-+|.+|+. .|-++ +..-....++.+++.+++..+...++ .... T Consensus 68 ~--~~~~~~~tid~~~~~~----~~-i~d~d~~~~~-----~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~---~~~~ 131 (273) T protein:vir:10 68 I--SDTGVDLLIDQEKSID----FL-VDDIDRVQVA-----GSLEA-YTRAGATALATDTDKFIADMLVDNGT---ALTG 131 (273) T ss_pred c--ccceEEEEEeeeeecc----eE-eecHHHhhhh-----ccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc---cccc Confidence 1 1234445666532111 11 1234444432 24444 34445678889999999988843322 1111 Q ss_pred cc-CCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecC Q lcl|NC_018861. 303 NS-ADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDN 381 (465) Q Consensus 303 ~~-~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~ 381 (465) .. .+..-..+.+..+..++++..-- -.++++|++|++.+.|..++.+.......... ...-...+|.+. T Consensus 132 ~~~~~~~~~~~~i~~a~~~ld~~~vP-------~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~--~~l~~G~ig~i~- 201 (273) T protein:vir:10 132 SAPTDADDAFDLIAKALKELTKANVP-------NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTIGNLL- 201 (273) T ss_pred ccccchhHHHHHHHHHHHHhhhcCCC-------cCCCEEEECHHHHHHHhcchhhhhhhhccccc--cceeeeeeeEEe- Confidence 11 11111223344443333322211 24679999999999998887643222211111 111134678874 Q ss_pred ceEEEEeCCCCc---ceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecCcccccccceEE- Q lcl|NC_018861. 382 RYDVIVDNFAEF---DYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATPLHPEAFIRTFA- 456 (465) Q Consensus 382 ~~~vy~d~~~~~---dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~- 456 (465) |+.||+.++-|. ..++.+.|+. +-|+ -........-||++|--.|=-+.+||. +.+| ...+ T Consensus 202 G~~v~~s~~lp~~~~~~~~~~~~~A------~~~a-~q~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~-------~~~~~ 267 (273) T protein:vir:10 202 GARIVESNNLRDTDDEQFVAFHPSA------AAYV-SQIDTVEALRDQDSFSDRIRALHVYGGKVVRP-------TGVVV 267 (273) T ss_pred ceEEEEecccccCCccEEEEEeccc------eeee-eeeehhhcccCCCcceeeeeeeeeeeeeEecc-------ceEEE Confidence 589999877653 3355565532 2232 122233445688888655555666777 4444 2344 Q ss_pred Eeeccc Q lcl|NC_018861. 457 VNLNNY 462 (465) Q Consensus 457 ~~~~~~ 462 (465) ++.+|+ T Consensus 268 l~~~g~ 273 (273) T protein:vir:10 268 FNKTGS 273 (273) T ss_pred EeccCC Confidence 556666 No 136 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=33.22 E-value=1.4 Score=19.83 Aligned_cols=264 Identities=13% Similarity=0.059 Sum_probs=106.6 Q ss_pred ccccccccccccchhhhh--eeeeeccCccccccccccccccccccCCccCCCcccccCcccccccccc-ccccchhhhc Q lcl|NC_018861. 146 GKIVYSEKQAGTDNIVNV--LLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYT-GPYATAAGEK 222 (465) Q Consensus 146 gait~~~~~TGPTgLifa--m~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~-~~~~Ta~~E~ 222 (465) .++... -|+ +++ +...+.. ..-+-+-++..+++.... +.+..-.-........+. .+... ..+. T Consensus 1 MA~~~~----~pe--~~~~~v~~~~~~---~lv~~~l~~~~~~~~~~~---Gdtv~ip~~~~~~~~d~~~~~~~~-~~~~ 67 (273) T protein:vir:10 1 MAFNNF----IPE--LWSDMLLEEWTA---QTVFANLVNREYEGTASK---GNVVHIAGVVAPTVKDYKAAGRQT-SADA 67 (273) T ss_pred Ccchhh----hHH--HHHHHHHHHHHh---hhccchhhcccccccccc---CceEEEeecccccccccccCCCcc-Cccc Confidence 111000 010 000 0000000 000011111111111000 000000000000000000 00000 0011 Q ss_pred cCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeee Q lcl|NC_018861. 223 LGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFDV 302 (465) Q Consensus 223 lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~~ 302 (465) + .-.+..++|+|...-+ ++ =+-+|.+|+. .|-++ +..-....++.+++.+++..+...++ .... T Consensus 68 ~--~~~~~~~tid~~~~~~----~~-i~d~d~~~~~-----~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~---~~~~ 131 (273) T protein:vir:10 68 I--SDTGVDLLIDQEKSID----FL-VDDIDRVQVA-----GSLEA-YTRAGATALATDTDKFIADMLVDNGT---ALTG 131 (273) T ss_pred c--ccceEEEEEeeeeecc----eE-eecHHHhhhh-----ccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc---cccc Confidence 1 1234445666532111 11 1234444432 24444 34445678889999999988843322 1111 Q ss_pred cc-CCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecC Q lcl|NC_018861. 303 NS-ADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDN 381 (465) Q Consensus 303 ~~-~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~ 381 (465) .. .+..-..+.+..+..++++..-- -.++++|++|++.+.|..++.+.......... ...-...+|.+. T Consensus 132 ~~~~~~~~~~~~i~~a~~~ld~~~vP-------~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~--~~l~~G~ig~i~- 201 (273) T protein:vir:10 132 SAPTDADDAFDLIAKALKELTKANVP-------NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTIGNLL- 201 (273) T ss_pred ccccchhHHHHHHHHHHHHhhhcCCC-------cCCCEEEECHHHHHHHhcchhhhhhhhccccc--cceeeeeeeEEe- Confidence 11 11111223344443333322211 24679999999999998887643222211111 111134678874 Q ss_pred ceEEEEeCCCCc---ceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeee-eecCcccccccceEE- Q lcl|NC_018861. 382 RYDVIVDNFAEF---DYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDV-VATPLHPEAFIRTFA- 456 (465) Q Consensus 382 ~~~vy~d~~~~~---dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~- 456 (465) |+.||+.++-|. ..++.+.|+. +-|+ -........-||++|--.|=-+.+||. +.+| ...+ T Consensus 202 G~~v~~s~~lp~~~~~~~~~~~~~A------~~~a-~q~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~-------~~~~~ 267 (273) T protein:vir:10 202 GARIVESNNLRDTDDEQFVAFHPSA------AAYV-SQIDTVEALRDQDSFSDRIRALHVYGGKVVRP-------TGVVV 267 (273) T ss_pred ceEEEEecccccCCccEEEEEeccc------eeee-eeeehhhcccCCCcceeeeeeeeeeeeeEecc-------ceEEE Confidence 589999877653 3355565532 2232 122233445688888655555666777 4444 2344 Q ss_pred Eeeccc Q lcl|NC_018861. 457 VNLNNY 462 (465) Q Consensus 457 ~~~~~~ 462 (465) ++.+|+ T Consensus 268 l~~~g~ 273 (273) T protein:vir:10 268 FNKTGS 273 (273) T ss_pred EeccCC Confidence 556666 No 137 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=33.02 E-value=1.4 Score=19.81 Aligned_cols=275 Identities=12% Similarity=0.094 Sum_probs=103.5 Q ss_pred ccccchhhhheeeeeccCccccccccccccccccccC--CccCCCcccccCccccc--ccccc-ccccchhh-hccCCch Q lcl|NC_018861. 154 QAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATK--KATVEAVYTNEALWLKV--LKNYT-GPYATAAG-EKLGKDM 227 (465) Q Consensus 154 ~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~--~~~~~~~~~~~a~~~~~--~~~~~-~~~~Ta~~-E~lg~~f 227 (465) |.- ++ ....-+.+-|..+-+-. ........-.......+ ...++ .+.....- -+.+..+ T Consensus 1 m~~-------~~--------~~~~~dp~LT~~A~gy~n~~~ia~~l~P~vpv~~~~~k~~~f~~eaF~~~~t~r~~~~~~ 65 (307) T protein:vir:10 1 MGR-------LS--------KLRIVDPVLTNLAIGYTNAEFIGQSLMPVVEVEKEGGKIPKFGKESFRLYKTERALRARS 65 (307) T ss_pred CCC-------CC--------CCcccChhHHHHHHhhcchhhhhhhcCCcccccccccceeeECcccccchhhhcccCCCc Confidence 100 00 00011111111110000 00000000000000000 00010 11111111 1234456 Q ss_pred hhcceE-EEEEEEEeecceecccchHHHHHHHHh--hhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeec- Q lcl|NC_018861. 228 KEMGIS-VQRVLAEAKTRKVKGTYTIEMLQDLKA--QHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFDVN- 303 (465) Q Consensus 228 ~EM~Fs-IeK~tVtAKSRaLKAEYT~ELAQDLkA--iHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~~~- 303 (465) +++-|. +++.....+-..| |.+-|-++ ....|.|+--..-|...|++..-.++-+.+...+.....-++. T Consensus 66 ~~v~~~~~~~~~~~~~~~~L------~~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tL 139 (307) T protein:vir:10 66 NRMNPEDLGSIDIVLDEHDL------EYPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGGNKKQL 139 (307) T ss_pred ceeecccccccccccccccc------cccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCCceEEe Confidence 666664 3443444443333 33333333 2356777777777777776554444433332223332222222 Q ss_pred cCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCce Q lcl|NC_018861. 304 SADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRY 383 (465) Q Consensus 304 ~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~ 383 (465) ..+++|.- .-.+-+..|++.-..|.+.+.+ ..|.+|.|.+|..+|..++-+...-........ +...++-.| +-- T Consensus 140 sGt~~Wsd-~~sDPi~di~~~~~ai~~~~g~-~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~i--t~~~la~ll-~v~ 214 (307) T protein:vir:10 140 SATEKFTA-AGSDPVGVIEDGKEAIRTKIGR-RPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIV--TVDLLKEIF-EVE 214 (307) T ss_pred ccccccCC-CCCCcHHHHHHHHHHHHhhhCC-ccceEEeCHHHHHHHhcCHHHHHHhCCcccccc--CHHHHHHHh-Cce Confidence 13567853 3445555666666777777775 999999999999999999877532111110000 001111111 111 Q ss_pred EEEEe--CCCCcceEEEEEecCCC--ccceeEEecccccceeeeeCCCcccceeeeeeeeeeeecCccccccc--ceEEE Q lcl|NC_018861. 384 DVIVD--NFAEFDYCTVAYKGASN--FDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDVVATPLHPEAFI--RTFAV 457 (465) Q Consensus 384 ~vy~d--~~~~~dy~~vg~kg~~~--~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~--~~f~~ 457 (465) +|++. .|+.. ++.-. +...+..+ |+|-..- .-.+.-+.|..|+..|+ .-.|+...... +..-+ T Consensus 215 ~i~vg~a~~~~~-------~~~~~~iw~~~~vl~-yv~~~~~-~~~~~~~epsfGyT~~~--~g~~~~d~~~~~~~~~~~ 283 (307) T protein:vir:10 215 NIAVGEAIYADD-------KDRFTDIWGANIVLA-YVPLQRG-GQQRTPYEPSYGYTLRK--KGNPVVDTRIEDGKLELV 283 (307) T ss_pred eEEEeeeeeecc-------CCccceeCCCceEEE-ecccccC-CCCCcccccccceeEEE--cCCeEeeceecCCceeEE Confidence 22221 11100 11110 11222222 3322211 12344556777888773 22344322111 11100 Q ss_pred ------------eeccceeC Q lcl|NC_018861. 458 ------------NLNNYIIS 465 (465) Q Consensus 458 ------------~~~~~~~~ 465 (465) ---|+.|. T Consensus 284 r~~~~~~~~i~~~~~G~li~ 303 (307) T protein:vir:10 284 RSTDIFRPYLLGADAGYLIS 303 (307) T ss_pred eccccccceeecccccceec Confidence 00112222 No 138 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=33.01 E-value=1.4 Score=19.81 Aligned_cols=307 Identities=15% Similarity=0.109 Sum_probs=116.4 Q ss_pred CCc------------------------------cchhhhHHHhhhhhhccccccC-hhhhhheehccccchhHHH----- Q lcl|NC_018861. 1 MAD------------------------------KYLLDESTKEKFITSNLYPNLN-ESEKNIMRTVLENQGNEVK----- 44 (465) Q Consensus 1 ~~~------------------------------~~~~~e~~~e~~~~~~~~~~~~-~~~~~~~~~l~~n~~~~~~----- 44 (465) |.. +.+...++.......-+...-. ..++.+--.+.+-.+.++. T Consensus 269 l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~ 348 (632) T protein:vir:96 269 MNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMP 348 (632) T ss_pred HhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhh Confidence 111 1110000100000000000000 0000000000000011100 Q ss_pred ---hhhhhhhcccccccc----chhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccC Q lcl|NC_018861. 45 ---MLMESTVTGDIAKFT----PILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVL 115 (465) Q Consensus 45 ---~i~est~t~~v~~~~----P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~ 115 (465) ++.-.-.++....+. |.++ .++...-++.|...+ |++.+++.+|-+ + +....+ T Consensus 349 ~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l-~~~~~~~~~g~~---~--ip~~~~------------ 410 (632) T protein:vir:96 349 HEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM-GARMLPGLVGDV---D--IPKKTS------------ 410 (632) T ss_pred HHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhh-cceEeecCCcce---E--EEEEeC------------ Confidence 000000000001111 1111 112222333444443 444443333211 0 100000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCC Q lcl|NC_018861. 116 KLKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVE 195 (465) Q Consensus 116 ~~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~ 195 (465) +|+ ..+ T Consensus 411 ----------------------------------------~~~----------------a~w------------------ 416 (632) T protein:vir:96 411 ----------------------------------------GAN----------------FYW------------------ 416 (632) T ss_pred ----------------------------------------Cce----------------eEe------------------ Confidence 000 000 Q ss_pred CcccccCccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHH Q lcl|NC_018861. 196 AVYTNEALWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILS 275 (465) Q Consensus 196 ~~~~~~a~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLs 275 (465) .+| |...++-..++++++..+|.=+-...+|-||..| -.+|+|++|.+-|. T Consensus 417 -----------------------v~E--~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~d----s~~~~~~~i~~~l~ 467 (632) T protein:vir:96 417 -----------------------IGE--DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQ----SSIHVENLIREDLI 467 (632) T ss_pred -----------------------ecC--CccccccccceeeEEeeeeEEEEehhhHHHHHhc----cchHHHHHHHHHHH Confidence 001 2234444557888888888888888899998776 26789999999999 Q ss_pred HHHHHHhhHHHHhhhhheeee-eee-----eeeccC-CcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHH Q lcl|NC_018861. 276 AEVALEIDRTIIEKANEVATV-CTD-----FDVNSA-DGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVAT 348 (465) Q Consensus 276 tEImlEINreii~~l~~~at~-~~~-----~~~~~~-~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~ 348 (465) ..|...+++.+|.---..-.+ |-. ..+... .+.. .+....|..+|.... ........++++.... T Consensus 468 ~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~-~~~i~~~~~~i~~~~-------~~~~~~~~~~~~~~~~ 539 (632) T protein:vir:96 468 EGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVD-WASVVDMETKISTFN-------ADAGRLAYLTSVTQRG 539 (632) T ss_pred HHHHHHHHHHhhcccCCCCccceeeecccccceecccccCC-HHHHHHHHHHHhhcc-------cccCccEEEEchhHHH Confidence 999999999998632100000 000 001000 1111 123334433332221 1123345678888888 Q ss_pred HHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCc Q lcl|NC_018861. 349 ILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVS 428 (465) Q Consensus 349 ~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s 428 (465) .|+.....+. .+..+.. -|+| .||+|++.++.+.+-+++|-- +-+|++-+- -+...+||.+ T Consensus 540 ~l~~~~l~d~--~G~~i~~--------~~~l-~G~pv~~s~~ip~~~~~~gd~------s~~~i~~~~--~~~i~~~~~~ 600 (632) T protein:vir:96 540 AAKKAQVFDN--TGERIWQ--------NNEV-NGYRAEASNQIPADTWIFGDW------SQIVIAMWG--VLDLKVDPYT 600 (632) T ss_pred HHHHHhccCC--CCceeec--------CCee-cccceEeccccccCcEEEeec------ceEEEEEec--ceEEEEcccc Confidence 8876443321 1222211 1456 478999998887765554421 011222111 1222334433 Q ss_pred ----ccceeeeeeeeee-eecCcccccccceEEEeeccceeC Q lcl|NC_018861. 429 ----GQPAMILNNRYDV-VATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 429 ----~qp~~~~~tRY~l-~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) -+=.+=...|+++ +.+| ++ |++--.- + T Consensus 601 ~~~~~~v~~~~~~~~d~~v~~~---~a----f~~~k~~---A 632 (632) T protein:vir:96 601 KAASDGLVLRVFQDVDAGVRRK---EA----FCIAKKG---A 632 (632) T ss_pred ccccCceEEEEEeecCceeech---hh----hhheeec---C Confidence 2222333455555 3333 11 1110000 0 No 139 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=31.58 E-value=1.5 Score=19.64 Aligned_cols=305 Identities=13% Similarity=0.074 Sum_probs=106.8 Q ss_pred CC-------------ccchhhh--HHHhh---hhhhccccccCh-hhh----hheehccccc--------hhHHHhhhhh Q lcl|NC_018861. 1 MA-------------DKYLLDE--STKEK---FITSNLYPNLNE-SEK----NIMRTVLENQ--------GNEVKMLMES 49 (465) Q Consensus 1 ~~-------------~~~~~~e--~~~e~---~~~~~~~~~~~~-~~~----~~~~~l~~n~--------~~~~~~i~es 49 (465) +. ++.+... ...++ -.+....+.-.+ ..+ -+-+.++.++ .++.+-+.++ T Consensus 42 ~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~ 121 (387) T protein:vir:93 42 LETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQSLNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTG 121 (387) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCCCcchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccC Confidence 10 1111110 00000 000000000000 000 0111111221 1122223333 Q ss_pred hhccccccccchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccc Q lcl|NC_018861. 50 TVTGDIAKFTPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDF 127 (465) Q Consensus 50 t~t~~v~~~~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~ 127 (465) +.++.-. .=|.=+ .++.+.-..-+-.+++.|.|+++.+. . +-.+.. . T Consensus 122 t~s~gG~-~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~~~--p--~~~~~~---~----------------------- 170 (387) T protein:vir:93 122 NDSGGDK-LLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEI--P--RVSYTL---D----------------------- 170 (387) T ss_pred cCCCCce-eechhHHHHHHHHHHhhchhhhheeeeecCCceE--E--EEeecC---C----------------------- Confidence 2221100 013211 24444444445567888877764321 1 000100 0 Q ss_pred ccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccc Q lcl|NC_018861. 128 NYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKV 207 (465) Q Consensus 128 ~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~ 207 (465) + +.|.. T Consensus 171 ------------~------a~~v~-------------------------------------------------------- 176 (387) T protein:vir:93 171 ------------D------DDFIT-------------------------------------------------------- 176 (387) T ss_pred ------------c------ccccc-------------------------------------------------------- Confidence 0 00000 Q ss_pred cccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHH Q lcl|NC_018861. 208 LKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTII 287 (465) Q Consensus 208 ~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii 287 (465) | |...++...+++.++..+|.-+-...+|-||.+|- ..|.|++|.+-|+..|..-.|..++ T Consensus 177 -------------E--~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds----~~~l~~~i~~~la~~~~~~e~~~~~ 237 (387) T protein:vir:93 177 -------------D--VETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGS----DVDLVNWVENALQSGLAAKERKDAL 237 (387) T ss_pred -------------C--cccccccccccceeeeeheeeeeechhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhHh Confidence 0 01111112234445555555566688999999884 3567899999998888876677666 Q ss_pred hhhhheeeeeeee---eeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccc Q lcl|NC_018861. 288 EKANEVATVCTDF---DVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSK 364 (465) Q Consensus 288 ~~l~~~at~~~~~---~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~ 364 (465) -.-.-...+.--+ .+..+.+.-..+....|+.. +...= |..+.|++-+.....+|.-. +.. .++. T Consensus 238 ~~g~g~g~p~g~l~~~~~~~v~~~~~~d~i~~~~~~-------l~~~~-~~~a~~~mn~~t~~~~~~~~---~d~-~~~~ 305 (387) T protein:vir:93 238 AVSPKSGLDHMSFYNGSVKEVEGADMYDAIINALAD-------LHEDY-RDNATIYMRYADYVKIISVL---SNG-TTNF 305 (387) T ss_pred hcCCCccccceeeeccccccccccchHHHHHHHHhc-------cChhh-hcCCEEEEechHHHHHHHHH---hcC-CCcc Confidence 4332111100000 11111111111223333332 22211 23455654443334444332 100 0111 Q ss_pred ccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeee--eeeee Q lcl|NC_018861. 365 IDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNN--RYDVV 442 (465) Q Consensus 365 ~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~t--RY~l~ 442 (465) ... .-.+|. |++||+..+++. +++|-- +-||--|....+. .+.......++|.. |++. T Consensus 306 ~~~-------~~~~ll-G~PV~~~~~~~~--~~~GDf-------~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~r~d~- 365 (387) T protein:vir:93 306 FDT-------PAEKVF-GKPVVFTDAAVK--PIVGDF-------NYFGINYDGTTYD--TDKDVKKGEYLFVLTAWYDQ- 365 (387) T ss_pred ccc-------CCcccc-ccceEEecCCCc--eeeeeh-------hhhheehhhheee--ecccccCCceeEEEEeeeCc- Confidence 111 113565 568888776653 333421 1112112111111 12222344555655 4444 Q ss_pred ecCcccccccceEEEeeccceeC Q lcl|NC_018861. 443 ATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 443 ~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) .+..+++... +.+.--=-+ T Consensus 366 -~v~~~eA~~~---l~~k~~~~~ 384 (387) T protein:vir:93 366 -QRTLDSAFRI---AKAKENTGS 384 (387) T ss_pred -eeechhheEE---EEeecCCCC Confidence 2333333210 111100000 No 140 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=30.58 E-value=1.6 Score=19.52 Aligned_cols=317 Identities=12% Similarity=0.009 Sum_probs=119.5 Q ss_pred CCccchhhhHHHhhhhh----------hcccccc-Ch--------hhhhheehccccchhH--HHhhhhhhhcccccccc Q lcl|NC_018861. 1 MADKYLLDESTKEKFIT----------SNLYPNL-NE--------SEKNIMRTVLENQGNE--VKMLMESTVTGDIAKFT 59 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~----------~~~~~~~-~~--------~~~~~~~~l~~n~~~~--~~~i~est~t~~v~~~~ 59 (465) +++..-+++++.+.-.- ....+.. .+ .++.+-+-.+..++.. +.-...++++++-...- T Consensus 44 ~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~ 123 (392) T protein:vir:13 44 LTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGSGAQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLS 123 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccchhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCcccc Confidence 33333333333211000 0000000 00 0000000000110000 00011112222111111 Q ss_pred chhh-hhhhhhhhh-hhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccccccccccccccc Q lcl|NC_018861. 60 PILV-PVIRRALPS-LIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEVS 137 (465) Q Consensus 60 P~l~-~l~~ra~~~-lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~s 137 (465) |.+. .++.+.... .+..+++-|-|+++...+-+- + ... T Consensus 124 ~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~-~--~~~------------------------------------- 163 (392) T protein:vir:13 124 RTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFT-V--ITG------------------------------------- 163 (392) T ss_pred ccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEE-E--EcC------------------------------------- Confidence 2221 122222222 223333333333221111100 0 000 Q ss_pred ccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccccccccccccccc Q lcl|NC_018861. 138 FKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYAT 217 (465) Q Consensus 138 ~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~T 217 (465) .....+ T Consensus 164 ----------------------------------~~~a~~---------------------------------------- 169 (392) T protein:vir:13 164 ----------------------------------RATAGI---------------------------------------- 169 (392) T ss_pred ----------------------------------Ccceee---------------------------------------- Confidence 000000 Q ss_pred hhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhh-------- Q lcl|NC_018861. 218 AAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEK-------- 289 (465) Q Consensus 218 a~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~-------- 289 (465) .+| |..++|-...+++++...+..+-...+|-||.+|= ..|.++.|.+-|...|..-+|..+|.- T Consensus 170 -v~E--~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~G 242 (392) T protein:vir:13 170 -VGE--TAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQ----VLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRG 242 (392) T ss_pred -ecc--cccccccccceeeEEeeeeeEEeeehhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccc Confidence 001 22344444456667777777777788999999982 468889999999999999999999852 Q ss_pred hhheeeeee-eeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccccc Q lcl|NC_018861. 290 ANEVATVCT-DFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAI 368 (465) Q Consensus 290 l~~~at~~~-~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~ 368 (465) +...++... .+.... .+.- .+..|...+. .+... +.+....|+++.....|... + ...+.....+ T Consensus 243 il~~~~~~~~~~~~~~-~~~~---~~d~l~~~~~----~l~~~--~~~~a~~v~n~~~~~~l~~l---k-d~~G~~l~~~ 308 (392) T protein:vir:13 243 ILTDATGANAAFGEAD-ADSK---VSDALIDLFH----EVPSA--YRKNAKFVVNDLRAAQMRKL---K-DANGQYLWQS 308 (392) T ss_pred cccccccccccccccc-cccc---cHHHHHHHHH----hhhhh--hhcCCEEEEcHHHHHHHHHh---h-ccCCceeecC Confidence 111111111 111111 1111 1222322222 22211 22334568899998888753 1 1111111111 Q ss_pred ccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccccceeeeeCCCcccceeeeeeeeeeeecCccc Q lcl|NC_018861. 369 NSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNITLQQNLTDPVSGQPAMILNNRYDVVATPLHP 448 (465) Q Consensus 369 ~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~ 448 (465) +.+ ..--++| .|++||++.+.|.+-|++|-- + -.++..-......+..|+-.-...++|...+=+-..+..| T Consensus 309 ~~~-~g~~~~l-~G~Pv~~~~~~~~~~i~~Gdf--~----~~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~ 380 (392) T protein:vir:13 309 ALT-VGAPDTF-NGKVVETDDGMPADKVLFADL--S----KYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDA 380 (392) T ss_pred CcC-CCCCcee-cceeeEEcCCCCCCcEEEeec--c----ceeEEeecceEEEeeccccccCCcEEEEEEEEeccEEecc Confidence 100 0111355 468999999988776665421 0 0111111122233333443333344444433332334443 Q ss_pred ccccceEEEeeccceeC Q lcl|NC_018861. 449 EAFIRTFAVNLNNYIIS 465 (465) Q Consensus 449 ~~~~~~f~~~~~~~~~~ 465 (465) .+ |+ -++-+--+ T Consensus 381 ~A----~~-~~~~~~aa 392 (392) T protein:vir:13 381 RG----AK-VLTVTPAA 392 (392) T ss_pred cc----eE-EEEeeccC Confidence 33 22 12222222 No 141 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=30.54 E-value=1.6 Score=19.51 Aligned_cols=302 Identities=13% Similarity=0.073 Sum_probs=108.8 Q ss_pred CCccchhhhHHHh----------------hhhhhccccccChhhhhheeh----ccccch--hHHHh------hhhh--- Q lcl|NC_018861. 1 MADKYLLDESTKE----------------KFITSNLYPNLNESEKNIMRT----VLENQG--NEVKM------LMES--- 49 (465) Q Consensus 1 ~~~~~~~~e~~~e----------------~~~~~~~~~~~~~~~~~~~~~----l~~n~~--~~~~~------i~es--- 49 (465) +..-. ..|++.+ ...++...+...+.|..-.+. +...++ ..+.. +.|. T Consensus 52 i~~~e-~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 130 (435) T protein:vir:80 52 IERAE-AAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAM 130 (435) T ss_pred HHHHH-HHHHHHHhhcccccchhhhhccccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhh Confidence 00000 0000000 000110001101111111111 111110 00000 0000 Q ss_pred hhccccccccchhhh------hhhhhhhhhhhhhh-eeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccc Q lcl|NC_018861. 50 TVTGDIAKFTPILVP------VIRRALPSLIGTEI-AGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESA 122 (465) Q Consensus 50 t~t~~v~~~~P~l~~------l~~ra~~~lI~~DI-wGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~ 122 (465) ..+......+..||+ ++.++-++.+...+ +=+-||+.+. +-+.. .. + T Consensus 131 ~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~-~~~p~---~~---~------------------- 184 (435) T protein:vir:80 131 SLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGN-ITIPR---LK---G------------------- 184 (435) T ss_pred hhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCc-eEEEE---Ee---C------------------- Confidence 001111112222332 22333333333333 1122332221 00000 00 0 Q ss_pred cccccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccC Q lcl|NC_018861. 123 NKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEA 202 (465) Q Consensus 123 ~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a 202 (465) +| ...+ T Consensus 185 ---------------------------------~~----------------~a~~------------------------- 190 (435) T protein:vir:80 185 ---------------------------------GA----------------IVGY------------------------- 190 (435) T ss_pred ---------------------------------Cc----------------ceee------------------------- Confidence 00 0000 Q ss_pred ccccccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHh Q lcl|NC_018861. 203 LWLKVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEI 282 (465) Q Consensus 203 ~~~~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEI 282 (465) .+| |...++...++++++...+.-+-....|-||.+|-.- +.|.|+.|.+-|+.-|...+ T Consensus 191 ----------------v~E--~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~--~~~l~~~i~~~l~~a~~~~~ 250 (435) T protein:vir:80 191 ----------------IGA--DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGV--NPNVDQIVVGDLTAAIGARE 250 (435) T ss_pred ----------------ecc--CccccccccceeeEEEeeEEEEEeehhhHHHHHhhcc--cHHHHHHHHHHHHHHHHHHH Confidence 001 2234455556778888888887788899999999332 45788999999999999999 Q ss_pred hHHHHhhh---------hheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhc Q lcl|NC_018861. 283 DRTIIEKA---------NEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEI 353 (465) Q Consensus 283 Nreii~~l---------~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~ 353 (465) ++.+|..= ...+....... ...+....+-...+...+..+.+. -.+......|+++.....|... T Consensus 251 d~a~l~G~G~~~~p~Gi~~~~~~~~~~~--~~~~~~~~~~~~d~~~~~~~~~~~----~~~~~~~~~vmn~~~~~~L~~l 324 (435) T protein:vir:80 251 DKAFIRDDGTANTPKGLRFWALPGNVIT--ASDGSTLQKIETDLGKAILALENA----DANLTQPGWIMAPRTFRFLEGL 324 (435) T ss_pred HHHhhccCCCCCcccceeecccccceee--cccccchhhHHHHHHHHHHHhhcc----ccccccCEEEEcHHHHHHHHhh Confidence 99888531 00011111100 111111111122233333322222 1122445678999999999775 Q ss_pred CcccccCCcccccccccccceEEEEecCceEEEEeCCCCcc----------------eEEEEEecCCCccceeE---Ee- Q lcl|NC_018861. 354 GSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFD----------------YCTVAYKGASNFDAGIF---FA- 413 (465) Q Consensus 354 ~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~d----------------y~~vg~kg~~~~d~glf---y~- 413 (465) .-- .+.... +... -|+| .|++||++.+.|.+ +++||..+.-..+-+-. .- T Consensus 325 kd~----~G~~l~-~~~~----~~~l-~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~ 394 (435) T protein:vir:80 325 RDG----NGNKVY-PELA----NGML-KGYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDA 394 (435) T ss_pred hcc----CCceec-cCCC----CCeE-eeeeeEEeccccccccCCCCcceEEEEEcccEEEEeecceEEEEecccccccc Confidence 311 111111 1111 1455 45799988875432 12233333222111000 00 Q ss_pred --ccccc-----cee--------eeeCCCcccceeeeeeeeee Q lcl|NC_018861. 414 --PYNIT-----LQQ--------NLTDPVSGQPAMILNNRYDV 441 (465) Q Consensus 414 --PY~~~-----~~~--------~~~dp~s~qp~~~~~tRY~l 441 (465) .++.+ ... .+.+|++|+..-|+. ||- T Consensus 395 ~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~--~~~ 435 (435) T protein:vir:80 395 DGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVA--WGA 435 (435) T ss_pred ccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccC--CCC Confidence 00000 000 122444444332221 111 No 142 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=28.12 E-value=1.8 Score=19.21 Aligned_cols=267 Identities=14% Similarity=0.043 Sum_probs=100.8 Q ss_pred ccccchhhhheeeeecc--------Ccccccccccc---ccccccccCCccCCCcccccCccccccccccccc-cchhhh Q lcl|NC_018861. 154 QAGTDNIVNVLLRLESN--------STGSVAIGDEM---DKAATFATKKATVEAVYTNEALWLKVLKNYTGPY-ATAAGE 221 (465) Q Consensus 154 ~TGPTgLifam~s~y~~--------~~g~ea~~~e~---~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~-~Ta~~E 221 (465) |+- +-++ ..|.. ..-..++.+.. ...+.++...-.+.- ...+...+.++- .... + T Consensus 1 MA~---~n~a--~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i-------~~~gl~DY~R~~~g~~~-g 67 (299) T protein:vir:79 1 MAA---LNYA--KEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTI-------STTGRVDSNRDTIAVAQ-R 67 (299) T ss_pred Ccc---chhH--HHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEecc-------ccccccccccCCCcccc-c Confidence 110 0000 11110 00000111110 011111111111100 001111111110 0111 1 Q ss_pred ccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhC-CCHHHHHHHHHHHHHHHHhhHHHHhhhhheeee-eee Q lcl|NC_018861. 222 KLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHG-INAEKELADILSAEVALEIDRTIIEKANEVATV-CTD 299 (465) Q Consensus 222 ~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHG-lDAe~EL~niLstEImlEINreii~~l~~~at~-~~~ 299 (465) ....+|.++-+.=||. |...-+ ..|.-.-++ +.+-.-+.-...+.++-||++-++++|...++. ++. T Consensus 68 ~~~~~~~t~~ldqdr~------~~f~vD-----~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~ 136 (299) T protein:vir:79 68 NYDNAWEPKVLTNQRK------WSTLVH-----PADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNT 136 (299) T ss_pred ccCcceeEEEeecccc------ceeccc-----hhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCc Confidence 1222333332222221 111100 001111011 111122233345566789999999999665532 221 Q ss_pred eeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEe Q lcl|NC_018861. 300 FDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKF 379 (465) Q Consensus 300 ~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l 379 (465) -+-..++..-..+..+.+..++++. =--..+.++++||.+-.+|..++-|.-......... .-...||.| T Consensus 137 ~~~~~~T~~n~y~~i~~~~~~lde~-------~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~---~~~g~Vg~i 206 (299) T protein:vir:79 137 ADTTVLTTTNVLEVFDKLMEKMTEA-------RVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGT---SLNRQTTDI 206 (299) T ss_pred ccccccCHHHHHHHHHHHHHHHHhc-------CCCCCCeEEEeCHHHHHHHhhchhhhcccccccccc---eeeeeeeee Confidence 1111123333445555555544432 222367899999999999999988765544322211 224688999 Q ss_pred cCceEEEEeCCC--Ccce-EEEEEe-cCCCcc-ceeEEecccc-----cceeeeeCCCcccceeeeeeeeeeeecCcccc Q lcl|NC_018861. 380 DNRYDVIVDNFA--EFDY-CTVAYK-GASNFD-AGIFFAPYNI-----TLQQNLTDPVSGQPAMILNNRYDVVATPLHPE 449 (465) Q Consensus 380 ~~~~~vy~d~~~--~~dy-~~vg~k-g~~~~d-~glfy~PY~~-----~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~ 449 (465) + ++.||.-|-. ...| ++-|++ |...-+ -=+...|-.+ ....++.+|...|-. + .+.. .+ T Consensus 207 d-G~~Ii~Vps~r~~t~~~~~~G~~~~~~ak~in~ii~~~~a~~~~~K~~~~~~~~P~~~~~~-----~-~~~~----~r 275 (299) T protein:vir:79 207 D-TVKIIKVPSNLMKTAYDFTTGWKVGAGAKQIFMSLVHPSAIITPVSYQFSKLDEPTAVTEG-----K-YFYF----EE 275 (299) T ss_pred c-ceEEEEechhhcCccceeccCccccCcccccceEEEcCCeeeeeEeeeeEEeecCCCCCcc-----c-eeee----ee Confidence 7 6899975442 2222 344554 221111 1122233322 234445666666542 1 1111 12 Q ss_pred cccceEE-EeeccceeC Q lcl|NC_018861. 450 AFIRTFA-VNLNNYIIS 465 (465) Q Consensus 450 ~~~~~f~-~~~~~~~~~ 465 (465) .|.--|+ -|--.-|.. T Consensus 276 ~y~d~~v~~nk~~~i~~ 292 (299) T protein:vir:79 276 SFEDVFILNKKADAIQF 292 (299) T ss_pred eeeeeeeeccccCeEEE Confidence 2333344 222222222 No 143 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=27.93 E-value=1.8 Score=19.19 Aligned_cols=289 Identities=13% Similarity=0.056 Sum_probs=140.7 Q ss_pred ccccchhhhheeeeeccCccc-----cccccccccccccccCCccCCCcccccCccccccccccccccchh----hhccC Q lcl|NC_018861. 154 QAGTDNIVNVLLRLESNSTGS-----VAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAA----GEKLG 224 (465) Q Consensus 154 ~TGPTgLifam~s~y~~~~g~-----ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~----~E~lg 224 (465) |+-|. .-.|..+....++ |-+..|+.++|....-...-...-+ .-.. ......-.|-.++. ++.+. T Consensus 1 ms~~~---~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rt-i~~g-~s~~~~~iG~~~~~~~~pG~~l~ 75 (335) T protein:vir:63 1 MSFLN---DLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRD-LRGS-NVVRLDRLGNVEAKGRRAGEELE 75 (335) T ss_pred CCCcc---cchhhhcccccchhheehhhhhhhHHHHHHhhhhhccccceee-eccc-eeEEEeeeeeeeeecccCCcCcC Confidence 22221 1112223333333 3344566665544221110000000 0000 00001111222222 22222 Q ss_pred -C--chhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeee Q lcl|NC_018861. 225 -K--DMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFD 301 (465) Q Consensus 225 -~--~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~ 301 (465) + ...|+-++||..- =+...|.-.-|.++ | .|..+|++.-+...++.+.++-+++.|..-+.....-. T Consensus 76 ~~~~~~~k~~itVD~ll--------~a~~~I~dlDe~~~-~-yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~ 145 (335) T protein:vir:63 76 RSRVVNDKWNLTVDTLL--------YLRHQFDHQDEWTQ-S-FDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVD 145 (335) T ss_pred CCCccccceEEEeccee--------echhhhhhHHHHhc-C-chhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc Confidence 1 2456667777654 24445777777777 5 89999999999999999999999998865433211111 Q ss_pred ecc-----------CCcccHHHHHHHHHHHHHHHHHHHHHhccc--c-cccEEEecHHHHHHHHhcCcccccCCcccccc Q lcl|NC_018861. 302 VNS-----------ADGRWFIEKARGLSMRISNEAREIGRQTRK--G-GGNKLIVSPKVATILDEIGSFVLSPAGSKIDA 367 (465) Q Consensus 302 ~~~-----------~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~--~-~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~ 367 (465) +.. ..|.....-...|+..+...+...-.+=-- + ...|++++|++=.+|-..+-|.-..-+ +.++ T Consensus 146 ~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~-~s~~ 224 (335) T protein:vir:63 146 LEDAFSPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQ-ATGA 224 (335) T ss_pred cCCCcCCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccc-cccc Confidence 110 112111123445555555544444432211 1 348999999999999998766422111 1122 Q ss_pred cccccceEEEEecCceEEEEeCCCCcceEE--------EEEecCCCccceeEEeccc----cc---ceeeeeCCCcccce Q lcl|NC_018861. 368 INSGIKPNVGKFDNRYDVIVDNFAEFDYCT--------VAYKGASNFDAGIFFAPYN----IT---LQQNLTDPVSGQPA 432 (465) Q Consensus 368 ~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~--------vg~kg~~~~d~glfy~PY~----~~---~~~~~~dp~s~qp~ 432 (465) .++.....++.+ .+++||.-++-|.--++ =+|.|+...-.++||-|=. .+ +...--|+..+-- T Consensus 225 ~~~~~~g~v~~v-~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~- 302 (335) T protein:vir:63 225 TNDYVKSRVAIL-NGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSW- 302 (335) T ss_pred cccccCceeEEe-eceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhH- Confidence 333445677777 67889988886643222 1234444455788888742 22 2222334444432 Q ss_pred eeeeeeeeeeecCcccccccceEE-Eeeccce----eC Q lcl|NC_018861. 433 MILNNRYDVVATPLHPEAFIRTFA-VNLNNYI----IS 465 (465) Q Consensus 433 ~~~~tRY~l~~nPf~~~~~~~~f~-~~~~~~~----~~ 465 (465) -+.++|+.-+=|+-|++ .+ +.|+|.= -+ T Consensus 303 -~i~~~~a~G~g~lRPe~----a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:63 303 -VLDTFQMYNIGARRPDT----AGAIELKGIGAFDITA 335 (335) T ss_pred -HhHHHHHcCCcccccce----EEEEEEcCCCceeecC Confidence 34556666555555543 22 3333321 11 No 144 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=26.53 E-value=1.9 Score=19.01 Aligned_cols=290 Identities=13% Similarity=0.063 Sum_probs=141.9 Q ss_pred ccccchhhhheeeeeccCccc-----cccccccccccccccCCccCCCcccccCccccccccccccccchh----hhcc- Q lcl|NC_018861. 154 QAGTDNIVNVLLRLESNSTGS-----VAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYATAA----GEKL- 223 (465) Q Consensus 154 ~TGPTgLifam~s~y~~~~g~-----ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~Ta~----~E~l- 223 (465) |+-|. .-.+..+.....+ |-+..|+.+.|....-...-...- +.-..+ .....-.|-.++. ++.+ T Consensus 1 ms~~~---~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~r-ti~~g~-s~~~~~iG~~~~~~~~pG~~l~ 75 (335) T protein:vir:78 1 MSFLN---DLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIR-DLRGSN-VVRLDRLGNVEAKGRRAGEELE 75 (335) T ss_pred CCccc---cccccccccccchhhhhhhhhhhHHHHHHHHhhhhcccccee-eeccce-eEEEeeeeeeeecccccCcccC Confidence 11111 0112222222222 334455555554322111000000 000000 0000011122222 2222 Q ss_pred CC--chhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeee Q lcl|NC_018861. 224 GK--DMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEKANEVATVCTDFD 301 (465) Q Consensus 224 g~--~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~~~~ 301 (465) +. ...|.-++||..- =+...|.-.-|.++ | .|..+|++.-+...++.+.++-+++.|..-+-....-. T Consensus 76 ~~~~~~~k~~itID~ll--------~a~~~VddlDe~~~-~-yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~ 145 (335) T protein:vir:78 76 RSRVVNDKWNLTVDTLL--------YLRHQFDHQDEWTQ-S-FDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVD 145 (335) T ss_pred CCCcccCCeEEEeccee--------echhhHhhHHHhhc-C-chhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 22 2355566666543 24455777777777 5 89999999999999999999999988855432111111 Q ss_pred e-----cc------CCcccHHHHHHHHHHHHHHHHHHHHHhcccc---cccEEEecHHHHHHHHhcCcccccCCcccccc Q lcl|NC_018861. 302 V-----NS------ADGRWFIEKARGLSMRISNEAREIGRQTRKG---GGNKLIVSPKVATILDEIGSFVLSPAGSKIDA 367 (465) Q Consensus 302 ~-----~~------~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~---~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~ 367 (465) . .+ ..|......+..|...+.+.+..+.++---. .+.|++++|++=.+|-..+-+.-..-+ ..++ T Consensus 146 ~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~-~s~~ 224 (335) T protein:vir:78 146 LEDAFSPGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQ-ATGA 224 (335) T ss_pred cCCCcCCCcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccccc-cccc Confidence 1 10 1122222346677777777777777665432 358999999999999988766432111 1122 Q ss_pred cccccceEEEEecCceEEEEeCCCCcceEE--------EEEecCCCccceeEEecccc-------cceeeeeCCCcccce Q lcl|NC_018861. 368 INSGIKPNVGKFDNRYDVIVDNFAEFDYCT--------VAYKGASNFDAGIFFAPYNI-------TLQQNLTDPVSGQPA 432 (465) Q Consensus 368 ~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~--------vg~kg~~~~d~glfy~PY~~-------~~~~~~~dp~s~qp~ 432 (465) .++.....++.+ .+++||.-++-|..-++ =+|+++-.--.++||-|=.- .++.+--|+..+-- T Consensus 225 ~~~~~~g~v~~v-~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~- 302 (335) T protein:vir:78 225 TNDYVKSRVAIL-NGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSW- 302 (335) T ss_pred ccccccceeEEe-eceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchhhH- Confidence 333445678887 67899998887754222 12333333336777766532 22333334444332 Q ss_pred eeeeeeeeeeecCcccccccceEEEeecccee----C Q lcl|NC_018861. 433 MILNNRYDVVATPLHPEAFIRTFAVNLNNYII----S 465 (465) Q Consensus 433 ~~~~tRY~l~~nPf~~~~~~~~f~~~~~~~~~----~ 465 (465) -+.++|+.-+=|+-|++=. ++.|+|.-- + T Consensus 303 -~i~~~~a~G~g~lRPe~a~---~i~~tg~~~~~~~~ 335 (335) T protein:vir:78 303 -VLDTFQMYNIGARRPDTAG---AIELKGIEAFDITA 335 (335) T ss_pred -hhhHHHHcCCcccCcceEE---EEEecCCCcccccC Confidence 3455666655555554321 123333211 1 No 145 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=25.75 E-value=2 Score=18.91 Aligned_cols=282 Identities=13% Similarity=0.051 Sum_probs=110.3 Q ss_pred cccccccccccccccccccccccccccccchhhhheeee-eccCccccccccccccccccccCCccCCCcccccCccccc Q lcl|NC_018861. 129 YTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRL-ESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKV 207 (465) Q Consensus 129 ~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~-y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~ 207 (465) -+. +. .+..+....-.-+|.. .|....-+ .. ...+.+. ..+..+.+. .... ..... T Consensus 1 ~~~-~n-~ts~~qafi~~EiWsa--------~il~~l~~~Lv----~~~~~~~--~d~g~GDtV--~Ins-----Ig~~t 57 (322) T protein:vir:31 1 MST-GN-NTSNTQALIVSEIWAD--------EIEDILHEKLL----DVNIARV--VDFPDGDKL--TIPS-----VGTPV 57 (322) T ss_pred CCC-CC-CcccceEEeehhhhHH--------HHHHHhhhhhh----hhhhhcc--cccCCCCeE--Eecc-----ccccc Confidence 000 00 0000000111111110 00000000 00 0001110 011001100 0000 11111 Q ss_pred cccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHH Q lcl|NC_018861. 208 LKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTII 287 (465) Q Consensus 208 ~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii 287 (465) ...+...-... .+.+ +-.|+-+.|++. |--+++ .-|-++.-..|-...-....+..+..++++-+. T Consensus 58 V~dY~~~~~i~-~d~l--tt~~~~l~IDq~----KYfaf~-------VdDD~~Qa~~dl~~~~~~~aa~ala~~~D~fva 123 (322) T protein:vir:31 58 VRSRPEQGDFT-FDNL--DTGEISIILRDE----VYAGNA-------ISKKLRQDSRWISNVGAMLPAEQARAIMERYQT 123 (322) T ss_pred cccccCCCCcc-cccC--CCceEEEEEehh----hhhccc-------cchhHHHhhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 12221110000 0111 123444555541 211111 122222234444455555666666667777775 Q ss_pred hhhhhee----eeeeeeeecc------CCc---ccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcC Q lcl|NC_018861. 288 EKANEVA----TVCTDFDVNS------ADG---RWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIG 354 (465) Q Consensus 288 ~~l~~~a----t~~~~~~~~~------~~~---~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~ 354 (465) .-|...| .++..-.+++ +.| ....+.+..|..++++.+-= ..|+|+|+||++...|..++ T Consensus 124 ~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~~~~ay~~lv~l~~kLdkanVP-------~~gR~vVV~P~~~~~L~~i~ 196 (322) T protein:vir:31 124 DLLALGNAQFAGQNDPNVINGVPHRFVGTGTDQTMDVTDFSRVNYVMTQSKMP-------MGGMIGIIDPSVAHHLETIT 196 (322) T ss_pred HHHHHHhhhhhccCCcceecCCccceeccCCCchhhHHHHHHHHHHhccccCC-------CCCeEEEeCchhhhhhhhhh Confidence 5544322 1111111111 112 22335666676667665432 36799999999999887766 Q ss_pred cccccCCccc----ccccccccceEEEEecCceEEEEeCCCC-cce-EEEEEecCCCc--cceeEEecccccc------- Q lcl|NC_018861. 355 SFVLSPAGSK----IDAINSGIKPNVGKFDNRYDVIVDNFAE-FDY-CTVAYKGASNF--DAGIFFAPYNITL------- 419 (465) Q Consensus 355 ~~~~~~~~~~----~~~~~~~~~~~~G~l~~~~~vy~d~~~~-~dy-~~vg~kg~~~~--d~glfy~PY~~~~------- 419 (465) .+..-..... ..+--..+..++|.+ .|+.||+-+..+ ..| ++.|--|..-. .=.+|-|=-.++. T Consensus 197 ~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~-~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~ 275 (322) T protein:vir:31 197 NISNISNNPRWEGIVESGIAPDMQFVRSV-YGIDLFVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAW 275 (322) T ss_pred hhhhhhccccccccccccchhhHHHHHHH-hceeeeeeccccccccccccCcccccccceeecccccccchhhhhhhhHh Confidence 6521111100 011012233466766 578999877642 222 33344322211 0123333111111 Q ss_pred --e---eeeeCCCcccceeeeeeeeee-eecC-----cccccccceE Q lcl|NC_018861. 420 --Q---QNLTDPVSGQPAMILNNRYDV-VATP-----LHPEAFIRTF 455 (465) Q Consensus 420 --~---~~~~dp~s~qp~~~~~tRY~l-~~nP-----f~~~~~~~~f 455 (465) + .--.|+++|.--+--+.|||- .+.| ....+.--|| T Consensus 276 ~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l~~~~a~~~~~~~ 322 (322) T protein:vir:31 276 KEMPTTKSFIDDYNDDLNTATTARWGNGLVRDENLVCVLANADKVTF 322 (322) T ss_pred hhhhhhhcccCccccccceeeeeeecceeecccceEEEEeccccccC Confidence 1 224488888888888999997 4444 2344555566 No 146 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=25.72 E-value=2 Score=18.90 Aligned_cols=284 Identities=13% Similarity=0.052 Sum_probs=122.6 Q ss_pred cccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccc-----cccccccccccccccCC Q lcl|NC_018861. 117 LKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGS-----VAIGDEMDKAATFATKK 191 (465) Q Consensus 117 ~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~-----ea~~~e~~t~~s~~~~~ 191 (465) |.... + ...++| .++...+. |-+..|+.+.|....-. T Consensus 1 m~~~~-----------------------~------~~~t~~---------~~~~~~~~~~l~le~~~geV~~af~~~s~~ 42 (334) T protein:vir:80 1 MTYPA-----------------------A------NTHTRP---------GWGGANSDVSLHIEEHLGLVDASFMYSSKF 42 (334) T ss_pred CCCCc-----------------------C------CCcccc---------ccccccchheehhhhhhhHHHHHHHHhhhh Confidence 11110 0 000000 11111111 33344555544332110 Q ss_pred ccCC----CcccccCccccccccccccccch----hhhcc-CCc--hhhcceEEEEEEEEeecceecccchHHHHHHHHh Q lcl|NC_018861. 192 ATVE----AVYTNEALWLKVLKNYTGPYATA----AGEKL-GKD--MKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKA 260 (465) Q Consensus 192 ~~~~----~~~~~~a~~~~~~~~~~~~~~Ta----~~E~l-g~~--f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkA 260 (465) ..-. -..+.+.-.. .-|-.++ .++.+ +.. -.|+-++||-. |=+..-|+-.-|.++ T Consensus 43 ~~~~~~r~i~~G~s~~~~------~iG~~~~~~~~~g~~l~~~~~~~~~~~l~ID~~--------l~~~~~VddiD~~q~ 108 (334) T protein:vir:80 43 ASWMNVRSLRGTNQLRVD------RVGASTIAGRKAGEELVVQKNVSDKLNLTVDTV--------LYARHFFDKFDEWTS 108 (334) T ss_pred hccceeeeccccceEEEe------eecceeeeeecCCCCCCCCCcccCceEEEEeee--------eehhhhHhhHHHHhc Confidence 0000 0000000000 0011111 11111 111 23455555542 335666777777777 Q ss_pred hhCCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeeeee------------eeeeccCCccc--HHHHHHHHHHHHHHHHH Q lcl|NC_018861. 261 QHGINAEKELADILSAEVALEIDRTIIEKANEVATVCT------------DFDVNSADGRW--FIEKARGLSMRISNEAR 326 (465) Q Consensus 261 iHGlDAe~EL~niLstEImlEINreii~~l~~~at~~~------------~~~~~~~~~~~--~~e~~~~L~~~i~~~a~ 326 (465) | .|..+|++.-+..++..+.++-+++.|..-+.-.. .+.. ...|.- ..-....|+..+..... T Consensus 109 -~-~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~~~~-~~~g~~~~~~~~~~~l~~a~~~a~~ 185 (334) T protein:vir:80 109 -N-LDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGILLPS-TISGLAADAAADADVLVAAHRQGVE 185 (334) T ss_pred -C-cchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCcceee-cccccccchhhhHHHHHHHHHHHHH Confidence 5 89999999999999999999999988854331110 1111 112211 11124455555544444 Q ss_pred HHHHhccc---ccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEE--- Q lcl|NC_018861. 327 EIGRQTRK---GGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAY--- 400 (465) Q Consensus 327 ~i~~~T~~---~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~--- 400 (465) ..-.+=-- -.++|+|++|+.-++|-..+-|.-..-++ .++.++.....++.+ +|++||.-+..|.--++..- T Consensus 186 ~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~-s~~~~~~~~g~i~~v-~G~~V~~Sn~~P~~~~t~~~~g~ 263 (334) T protein:vir:80 186 AMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGA-KEGGNSFVGGRIAML-NGVRVVETPRFPQSAITANALGA 263 (334) T ss_pred HHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecc-ccccccccceeEEEE-eceEEEeecCCCCcccccccccc Confidence 43333221 14689999999999999987765221010 111122234457776 78999999888865444221 Q ss_pred -----ecCCCccceeEEecccccce-------eeeeCCCcccceeeeeeeeee-eecCcccccccceEE-Eeecccee Q lcl|NC_018861. 401 -----KGASNFDAGIFFAPYNITLQ-------QNLTDPVSGQPAMILNNRYDV-VATPLHPEAFIRTFA-VNLNNYII 464 (465) Q Consensus 401 -----kg~~~~d~glfy~PY~~~~~-------~~~~dp~s~qp~~~~~tRY~l-~~nPf~~~~~~~~f~-~~~~~~~~ 464 (465) .|+-.-..++||-|=..+.- ..--|+..|--.|==+.=||. ..+| ++ .+ +.|+.+=- T Consensus 264 ~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a~G~g~lRP---ea----a~vv~~~~~~~ 334 (334) T protein:vir:80 264 DFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQSYNIGQRRP---DA----VAVHDITVTNP 334 (334) T ss_pred ccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHHHcCCceecc---ce----EEEEEEeeecC Confidence 23333346788876543221 111133333221111111221 2222 10 01 11111111 No 147 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=25.64 E-value=2 Score=18.89 Aligned_cols=311 Identities=13% Similarity=0.066 Sum_probs=111.1 Q ss_pred CCccchhhhHHHhhhhhhcc---c----ccc--------------------Chhhhhheehcccc--chhHHHhhhhhhh Q lcl|NC_018861. 1 MADKYLLDESTKEKFITSNL---Y----PNL--------------------NESEKNIMRTVLEN--QGNEVKMLMESTV 51 (465) Q Consensus 1 ~~~~~~~~e~~~e~~~~~~~---~----~~~--------------------~~~~~~~~~~l~~n--~~~~~~~i~est~ 51 (465) |+-+.--.+++.|+=.-+++ . +.. .++++.+.++.-.+ ..++++.+.+... T Consensus 1 M~i~~~~~~~~~e~~~~l~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~ 80 (377) T protein:vir:96 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK 80 (377) T ss_pred CCccHHHHHHHHHHHHHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHh Confidence 44433222222222111110 0 000 01122221111000 1345555555433 Q ss_pred ccccccccchhh--hhhhhhhhhh----hhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCcccccccccc Q lcl|NC_018861. 52 TGDIAKFTPILV--PVIRRALPSL----IGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKD 125 (465) Q Consensus 52 t~~v~~~~P~l~--~l~~ra~~~l----I~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea 125 (465) .+..+.+. .|| -+..+.+-+| .-..+|-|+|+++++ |--+..... T Consensus 81 ~~~~~~gg-~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~------~i~~~~~~~---------------------- 131 (377) T protein:vir:96 81 NVGGKDKF-KLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRL------KALTAETSG---------------------- 131 (377) T ss_pred cCCCCCCc-eecCHHHHHHHHHHHHhhhhhhhhceeEecCCce------EEEEecCCc---------------------- Confidence 33333222 233 2334444333 334568888887653 211111000 Q ss_pred ccccccccccccccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCccc Q lcl|NC_018861. 126 DFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWL 205 (465) Q Consensus 126 ~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~ 205 (465) + +.|.. |. T Consensus 132 --------------~------a~wv~----------------------------e~------------------------ 139 (377) T protein:vir:96 132 --------------T------AVWGD----------------------------IF------------------------ 139 (377) T ss_pred --------------c------eeEee----------------------------cc------------------------ Confidence 0 00100 00 Q ss_pred cccccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHH Q lcl|NC_018861. 206 KVLKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRT 285 (465) Q Consensus 206 ~~~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINre 285 (465) . +..+.....|.++.|..-|... ....|-||.+| -.+|.|++|.+-|+..|..-+|+. T Consensus 140 -------~----~~~~~~~~~f~~i~l~~~kl~~-------~~~is~~ll~d----s~~~le~~i~~~l~~~~~~~~~~a 197 (377) T protein:vir:96 140 -------G----EIKGQLKQAFKEQDFSQFKLTA-------FVVIPKDALKF----GPKWLKQFITEQLKEAIAVALELA 197 (377) T ss_pred -------c----ccccccCccceeEeeeeeeEEe-------echhhHHHhhc----chhhHHHHHHHHHHHHHHHHHhhc Confidence 0 0000011235555555555543 24567777665 468899999999999999999999 Q ss_pred HHhh---------hhhe--eeeeee--------e-------eeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccE Q lcl|NC_018861. 286 IIEK---------ANEV--ATVCTD--------F-------DVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNK 339 (465) Q Consensus 286 ii~~---------l~~~--at~~~~--------~-------~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~ 339 (465) +|.- |+.. .++... + ++...+.....+....|...+-........... +++ . T Consensus 198 ~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~-~~a-~ 275 (377) T protein:vir:96 198 IVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIA-GQV-K 275 (377) T ss_pred eEeccCCCcceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhcccccccccccc-Cce-E Confidence 9862 2111 111100 0 000111111122222222222111111111111 233 2 Q ss_pred EEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCcc----ceeEEecc Q lcl|NC_018861. 340 LIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFD----AGIFFAPY 415 (465) Q Consensus 340 ~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d----~glfy~PY 415 (465) .++.+..+.-+ .|.....++.+ .++-.|.=+++|..++..|.+-++.|..+. |. .++=...+ T Consensus 276 ~~mn~~t~~~~--~~~~~~~~~~G----------~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~--Y~i~~r~~~~i~~~ 341 (377) T protein:vir:96 276 LLLNPEDRWTL--EAKFTSRNQFG----------EYVTVLPHGITILESLAVETGKAIAFVANR--YDAFMATASTIEEY 341 (377) T ss_pred EEEchhhHHhc--cccccccCCCC----------CceeccCCCceEEecCCCCcccEEEEEcCc--EEEEEecccEEEee Confidence 44555544322 12222222111 112222114556666666655454444211 10 01111111 Q ss_pred cccceeeeeCCCcccceeeeeeeeeeeecCcccccccceEE-Eeeccc Q lcl|NC_018861. 416 NITLQQNLTDPVSGQPAMILNNRYDVVATPLHPEAFIRTFA-VNLNNY 462 (465) Q Consensus 416 ~~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~~~~~~f~-~~~~~~ 462 (465) ....+. .-|=.+=.+-|++- .|..++ .|+ +++++- T Consensus 342 ~~~~~~------~d~~~f~~~~r~dG--~~~d~~----a~~vl~l~~~ 377 (377) T protein:vir:96 342 DQTFAM------EDLQLYLTKNYFYG--KAKDNH----TAALLTLAGG 377 (377) T ss_pred hhhhhh------cCCeEEEEEEEEcC--EEecCC----cEEEEEEecC Confidence 100000 01111222333332 222222 244 555555 No 148 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=23.35 E-value=2.3 Score=18.58 Aligned_cols=265 Identities=14% Similarity=0.072 Sum_probs=110.3 Q ss_pred ccccccccccccccccccccccchhhhheeeeeccCccccccccccccc--------cccccCCccCCCcccccCccccc Q lcl|NC_018861. 136 VSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKA--------ATFATKKATVEAVYTNEALWLKV 207 (465) Q Consensus 136 ~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~--------~s~~~~~~~~~~~~~~~a~~~~~ 207 (465) ++-..+ .+|... =++.+-.+..++ +.++...-.+. ....+ T Consensus 1 Main~a------------------------~~~~~~-Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~-------i~~~g 48 (290) T protein:vir:78 1 MAINYV------------------------DKYGKE-LDQKLVFGTYTNELETPNLLWLDAKTFKIQT-------ITTTG 48 (290) T ss_pred CchhHH------------------------HHHHHH-HHHHHHhhheeeeccccceeeccCCEEEEee-------eccCc Confidence 100000 011100 001111111111 11111110000 00011 Q ss_pred cccccccccchhhhccCCchhhcceEEEEEEEEeecceecccchHHHHHHHHhhhC-CCHHHHHHHHHHHHHHHHhhHHH Q lcl|NC_018861. 208 LKNYTGPYATAAGEKLGKDMKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHG-INAEKELADILSAEVALEIDRTI 286 (465) Q Consensus 208 ~~~~~~~~~Ta~~E~lg~~f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHG-lDAe~EL~niLstEImlEINrei 286 (465) ...+.+.-.-..++ ...+|.++-+.=||. |. ++|. ..|...-++ +.+...+.--.++.+.-||++-+ T Consensus 49 l~DY~R~~g~~~g~-v~~~~et~tl~qdR~------~~----F~vD-~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr 116 (290) T protein:vir:78 49 LKAHTRNKGYNEGS-ASNTNKSYTIDFDRD------VE----FFVD-VMDVDETGQALSAANVTKEFNSRHAGPEMDAYR 116 (290) T ss_pred ccccccCCCcccCc-cccceeeEEeecccc------ce----eecc-ccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHH Confidence 11121111111111 112333333332222 11 1111 112222121 23333344446677888999999 Q ss_pred HhhhhheeeeeeeeeeccCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCcccccCCccccc Q lcl|NC_018861. 287 IEKANEVATVCTDFDVNSADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKID 366 (465) Q Consensus 287 i~~l~~~at~~~~~~~~~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~ 366 (465) +++|...+.....-.-..++..-..+.++.+..++.+ . . ..+.++++||.+-.+|..++-|.......... T Consensus 117 ~skla~~a~~~~~~~~~t~t~~n~~~~i~~~~~~lde----v----p-~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~ 187 (290) T protein:vir:78 117 FSKLATAAKTNSNSVAEEITKDNVFTKLKAAIRKVKK----Y----G-TQNLVMYVSPDVMAALELSDDFVRAINVQNIG 187 (290) T ss_pred HHHHHhhhhccCcccccccCHHHHHHHHHHHHHHHHh----c----C-CCCeEEEECHHHHHHHhhChhhhccccccccc Confidence 9999766543221111122344455677777666653 1 2 36799999999999999998876443321111 Q ss_pred ccccccceEEEEecCceEEEEeCCC----CcceEEEEEecCC-Ccc-ceeEEecccc-----cceeeeeCCCcccceeee Q lcl|NC_018861. 367 AINSGIKPNVGKFDNRYDVIVDNFA----EFDYCTVAYKGAS-NFD-AGIFFAPYNI-----TLQQNLTDPVSGQPAMIL 435 (465) Q Consensus 367 ~~~~~~~~~~G~l~~~~~vy~d~~~----~~dy~~vg~kg~~-~~d-~glfy~PY~~-----~~~~~~~dp~s~qp~~~~ 435 (465) . ......||.|+ ++.||.-|-. ...-|+-|||... .-+ -=|.-.|=.+ ....++.+|...|-.-|. T Consensus 188 ~--~~i~~~V~~id-G~~ii~vps~~r~~t~~~f~~G~~~~~~ak~in~ii~~~~a~i~~~K~~~~~~~~P~~~~~~d~~ 264 (290) T protein:vir:78 188 P--SSIETRITAID-GTRIVEVEAEDRFYDTFDFTDGYKPAAGAKKLNFLLVNKGSVVGGAKHASIYLHAPGSVGQGDGW 264 (290) T ss_pred c--ccccceeeeec-CcEEEEecccchhhhhhhhcccccccCCccceeEEEEcCCceeeeeeeeEEEeeCCCCCcCccee Confidence 1 11245788886 5688876642 2233666887322 211 1122222222 234556678777765443 Q ss_pred eeeeeeeecCcccccccceEE-EeeccceeC Q lcl|NC_018861. 436 NNRYDVVATPLHPEAFIRTFA-VNLNNYIIS 465 (465) Q Consensus 436 ~tRY~l~~nPf~~~~~~~~f~-~~~~~~~~~ 465 (465) +.-| +.+.--|+ -|--.-|.. T Consensus 265 ~~~~---------r~y~d~~v~~nk~~~i~~ 286 (290) T protein:vir:78 265 LYQY---------RVYHDIFVLDQQKDGVIA 286 (290) T ss_pred eeee---------eeeeeeeeeccccCeeEE Confidence 3322 22333344 222222222 No 149 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=21.29 E-value=2.6 Score=18.28 Aligned_cols=289 Identities=13% Similarity=0.076 Sum_probs=121.3 Q ss_pred cccccccccccccccccccccccccccccccccccccccccchhhhheeeeeccCccc-----cccccccccccccccCC Q lcl|NC_018861. 117 LKTESANKDDFNYTGTPIEVSFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGS-----VAIGDEMDKAATFATKK 191 (465) Q Consensus 117 ~~~a~~~ea~~~~Sg~~~~~s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~-----ea~~~e~~t~~s~~~~~ 191 (465) |. ..+..+. .... ++| .+.....+ |-+..|+.+.|....-. T Consensus 1 ma------~~~~~~~---------------~n~~----~~~---------~~~~~~~~~al~ie~~~geV~~~f~~~s~~ 46 (344) T protein:vir:10 1 MA------NMTGGQQ---------------LGTN----QGK---------DVMAAGDKLALFLKVFGGEVLTAFARTSVT 46 (344) T ss_pred Cc------ccccccc---------------CCcc----cCC---------ccCCccchhHHHHHHHHHHHHHHHHHHhhh Confidence 00 0000000 0000 000 00000011 22333444444322111 Q ss_pred ccCCCcccccCccccccccccccccchh----hhccCC---c--hhhcceEEEEEEEEeecceecccchHHHHHHHHhhh Q lcl|NC_018861. 192 ATVEAVYTNEALWLKVLKNYTGPYATAA----GEKLGK---D--MKEMGISVQRVLAEAKTRKVKGTYTIEMLQDLKAQH 262 (465) Q Consensus 192 ~~~~~~~~~~a~~~~~~~~~~~~~~Ta~----~E~lg~---~--f~EM~FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiH 262 (465) ..-...- +.-..+. ....--|-.++. ++.+.. + -.|.-+.||+. |=+..-|.-.-|.++ | T Consensus 47 ~~~~~~r-~i~~g~s-~~~~~iG~~~~~~~~~G~~l~~t~~~~~~~e~~l~ID~~--------~y~~~~VdDiD~~q~-~ 115 (344) T protein:vir:10 47 TSRHMVR-SISSGKS-AQFPVLGRTQAAYLAPGENLDDIRKDIKHTEKVITIDGL--------LTADVLIYDIEDAMN-H 115 (344) T ss_pred cccceee-eecccce-EEEEeeceeEEEeeecCCCCCCCCCCcccceEEEEEcch--------hhhhhhhhhHHHHhc-C Confidence 0000000 0000000 000001111111 222211 1 24555666652 223444444444444 3 Q ss_pred CCCHHHHHHHHHHHHHHHHhhHHHHhhhhheeee-e----------ee--eeec--cCCcccHHHHHHHHHHHHHHHHHH Q lcl|NC_018861. 263 GINAEKELADILSAEVALEIDRTIIEKANEVATV-C----------TD--FDVN--SADGRWFIEKARGLSMRISNEARE 327 (465) Q Consensus 263 GlDAe~EL~niLstEImlEINreii~~l~~~at~-~----------~~--~~~~--~~~~~~~~e~~~~L~~~i~~~a~~ 327 (465) .|.-+|++.-...++..++++-|++.|...+-. . .. .+.+ ...+.-.......++..|-+.+.. T Consensus 116 -~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~ 194 (344) T protein:vir:10 116 -YDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAA 194 (344) T ss_pred -cchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccceeecccccccccchhhhHHHHHHHHHHHHHH Confidence 799999999999999999999999888432110 0 00 1111 001101112234455555444444 Q ss_pred HHHhcccccccEEEecHHHHHHHHhcCcccccCCcccccccccccceEEEEecCceEEEEeCCCCcce------------ Q lcl|NC_018861. 328 IGRQTRKGGGNKLIVSPKVATILDEIGSFVLSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDY------------ 395 (465) Q Consensus 328 i~~~T~~~~~~~~~~s~~va~~L~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy------------ 395 (465) .-++=---.++|+|++|+.-.+|..++-+.....+ +..+.....||.+ .|++||.-++-|.-. T Consensus 195 Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~----~~~~~~~G~V~~v-~G~~V~~Sn~lp~~~~~~~~~~~tg~~ 269 (344) T protein:vir:10 195 LTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYA----ALIDPEKGSIRNV-MGFEVVEVPHLTAGGAGTSREGTTGQK 269 (344) T ss_pred HhhcCCCccCCEEEeChHHHHHHhhcccccccccc----cccceeeeEEEEE-eceEEEeccccccccCCcccccccCcc Confidence 44443334678999999999999998877543322 2233445688888 499999999865321 Q ss_pred -EEEEEec-----CCCccceeEEecccccc-------eeeeeCCCcccceeeeeeeeeeeecCcccc-cccceEEEeecc Q lcl|NC_018861. 396 -CTVAYKG-----ASNFDAGIFFAPYNITL-------QQNLTDPVSGQPAMILNNRYDVVATPLHPE-AFIRTFAVNLNN 461 (465) Q Consensus 396 -~~vg~kg-----~~~~d~glfy~PY~~~~-------~~~~~dp~s~qp~~~~~tRY~l~~nPf~~~-~~~~~f~~~~~~ 461 (465) ..-+..+ +-.-..||||-|=-.+. ....-|+..|-- .+..+|..-+=++-|+ +..--|. - T Consensus 270 ~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d--~i~g~~~~G~~vlRPe~a~~v~~~----~ 343 (344) T protein:vir:10 270 HAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRANFQAD--QIIAKYAMGHGGLRPEAAGAVVFK----T 343 (344) T ss_pred ccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchhHHHH--HHHHHhhcccceecccceEEEEee----c Confidence 1111111 11112467775543222 111124444432 2344444433333333 1111222 1 Q ss_pred c Q lcl|NC_018861. 462 Y 462 (465) Q Consensus 462 ~ 462 (465) . T Consensus 344 ~ 344 (344) T protein:vir:10 344 K 344 (344) T ss_pred C Confidence 1 No 150 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=20.13 E-value=2.8 Score=18.11 Aligned_cols=310 Identities=14% Similarity=0.082 Sum_probs=122.2 Q ss_pred CCc-----cchhh-h----HHHhhhhhhcccc------ccChhhhhheehccccc-----hhHHHhhhhhhh-ccccccc Q lcl|NC_018861. 1 MAD-----KYLLD-E----STKEKFITSNLYP------NLNESEKNIMRTVLENQ-----GNEVKMLMESTV-TGDIAKF 58 (465) Q Consensus 1 ~~~-----~~~~~-e----~~~e~~~~~~~~~------~~~~~~~~~~~~l~~n~-----~~~~~~i~est~-t~~v~~~ 58 (465) ..+ +.+.. | .+.++-....+.. .-.+++++...-|-... ..+.+-+..++. .|.+. T Consensus 40 ~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~-- 117 (407) T protein:vir:48 40 AGEVETLNGKLAELENLKSDLEAELAEVKRPAGGTQNKVASEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYA-- 117 (407) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCccc-- Confidence 000 00000 0 0000100111110 00123333332222111 112223333322 11110 Q ss_pred cchhh--hhhhhhhhhhhhhhheeeeccCCCcceEEEEEEEecCCCCcccccccccccCccccccccccccccccccccc Q lcl|NC_018861. 59 TPILV--PVIRRALPSLIGTEIAGVQALKTPTAYLYAMVPHYVGDGNNSVSPTKNAIVLKLKTESANKDDFNYTGTPIEV 136 (465) Q Consensus 59 ~P~l~--~l~~ra~~~lI~~DIwGVQPMTgPTGLIFAMRSrY~~~~~~~~~~~~~aaf~~~~~a~~~ea~~~~Sg~~~~~ 136 (465) =|.-+ -++.++-...+-.+++.|-||++++.-+.-.. ++. T Consensus 118 iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~------~~~-------------------------------- 159 (407) T protein:vir:48 118 IPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNL------GGT-------------------------------- 159 (407) T ss_pred ccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEec------CCc-------------------------------- Confidence 12222 13344445556667788878777653322100 000 Q ss_pred cccccccccccccccccccccchhhhheeeeeccCccccccccccccccccccCCccCCCcccccCcccccccccccccc Q lcl|NC_018861. 137 SFKTATTVKGKIVYSEKQAGTDNIVNVLLRLESNSTGSVAIGDEMDKAATFATKKATVEAVYTNEALWLKVLKNYTGPYA 216 (465) Q Consensus 137 s~~tatt~ggait~~~~~TGPTgLifam~s~y~~~~g~ea~~~e~~t~~s~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 216 (465) . +.|. . T Consensus 160 ---~------a~~v----------------------------~------------------------------------- 165 (407) T protein:vir:48 160 ---T------SGWV----------------------------G------------------------------------- 165 (407) T ss_pred ---c------eeee----------------------------c------------------------------------- Confidence 0 0000 0 Q ss_pred chhhhccCCchhhcc-eEEEEEEEEeecceecccchHHHHHHHHhhhCCCHHHHHHHHHHHHHHHHhhHHHHhh------ Q lcl|NC_018861. 217 TAAGEKLGKDMKEMG-ISVQRVLAEAKTRKVKGTYTIEMLQDLKAQHGINAEKELADILSAEVALEIDRTIIEK------ 289 (465) Q Consensus 217 Ta~~E~lg~~f~EM~-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGlDAe~EL~niLstEImlEINreii~~------ 289 (465) | |...++.+ -.+++++...|.-+-...+|-||.+|- ..|.+++|.+-|+..|...+|+-+|.. T Consensus 166 ----E--~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p 235 (407) T protein:vir:48 166 ----E--TDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDA----FFNVEDWINSELALEFAEQEEIAFTSGDGSKKP 235 (407) T ss_pred ----c--cccccccccccceeEEeeeeeeEeehhhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhhhccCCCCcc Confidence 0 01111111 124555555555555668999999983 367889999999999999999988752 Q ss_pred --hhheeeeeee---e---e---ec-cCCcccHHHHHHHHHHHHHHHHHHHHHhcccccccEEEecHHHHHHHHhcCccc Q lcl|NC_018861. 290 --ANEVATVCTD---F---D---VN-SADGRWFIEKARGLSMRISNEAREIGRQTRKGGGNKLIVSPKVATILDEIGSFV 357 (465) Q Consensus 290 --l~~~at~~~~---~---~---~~-~~~~~~~~e~~~~L~~~i~~~a~~i~~~T~~~~~~~~~~s~~va~~L~~~~~~~ 357 (465) +....++... . . +. ...+. -.+..|...+..+... +-..-..|+++.....|...-- T Consensus 236 ~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~d~i~~l~~~l~~~------~~~~a~~v~n~~~~~~L~~lkD-- 304 (407) T protein:vir:48 236 KGFLAYESTDEDDKTRAFGKLQHIASGAASG---VTADAIIKLIYTLRKA------HRSGAKFMMNNSSLFAIRLLKD-- 304 (407) T ss_pred ceeeecccccccccccccccccccccccccc---cChHHHHHHHHhhchh------hhcCCEEEEcHHHHHHHHHhhc-- Confidence 1111111100 0 0 00 01111 1133333333332222 1122346899999999876421 Q ss_pred ccCCcccccccccccceEEEEecCceEEEEeCCCCcceEEEEEecCCCccceeEEecccc-------cceeeeeCCCccc Q lcl|NC_018861. 358 LSPAGSKIDAINSGIKPNVGKFDNRYDVIVDNFAEFDYCTVAYKGASNFDAGIFFAPYNI-------TLQQNLTDPVSGQ 430 (465) Q Consensus 358 ~~~~~~~~~~~~~~~~~~~G~l~~~~~vy~d~~~~~dy~~vg~kg~~~~d~glfy~PY~~-------~~~~~~~dp~s~q 430 (465) ..+..+-.++ ......++|. |++|+++.+.|. +| .. ..-++|..+.. ..+.+..||-.-+ T Consensus 305 --~~Gr~l~~~~-~~~g~~~~l~-G~PV~~~~~~p~----~~---~~--~~~i~~Gd~~~~~~i~~~~~~~i~~d~~~~~ 371 (407) T protein:vir:48 305 --NDGNYLWRPG-IELGQPSSLA-GYGIVENEQMPD----IA---AD--AKAIAFGNFKRGYTIVDRIGTRILRDPYTNK 371 (407) T ss_pred --cCCceeeccC-cCCCCCceec-ceeeEEecCcCC----cc---CC--ccEEEEEeccccEEEEEeeceEEEeeccccC Confidence 1111111111 0011224664 679998887653 00 00 01122222211 1122234554434 Q ss_pred ceeeeee--eeeeeecCcccccccceEEEeeccceeC Q lcl|NC_018861. 431 PAMILNN--RYDVVATPLHPEAFIRTFAVNLNNYIIS 465 (465) Q Consensus 431 p~~~~~t--RY~l~~nPf~~~~~~~~f~~~~~~~~~~ 465 (465) ..++|.. |++.. +..+. -|++ |+..--+ T Consensus 372 ~~~~~~~~~r~d~~--v~~~~----a~~~-l~~~aa~ 401 (407) T protein:vir:48 372 PFVGFYTTKRTGGM--LVDSQ----AIKL-MKIGAAT 401 (407) T ss_pred CcEEEEEEEEeccE--Eeccc----ceEE-EEeeccC Confidence 5555554 66652 22222 2332 1111111 Done!