Query lcl|NC_020871.1_cdsid_YP_007676745.1 [gene=AG2_078] [protein=major capsid protein precursor] [protein_id=YP_007676745.1] [location=44643..46049] Match_columns 468 No_of_seqs 51 out of 55 Neff 5.0 Searched_HMMs 1612 Date Thu Nov 7 19:28:20 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_77 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_77_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:63741 Length: 468 100.0 4E-211 2E-214 1174.3 34.5 468 1-468 1-468 (468) 2 protein:vir:80491 Length: 467 100.0 2E-209 1E-212 1164.9 33.5 467 1-468 1-467 (467) 3 protein:vir:96666 Length: 462 100.0 1E-199 8E-203 1111.0 34.4 459 1-461 1-462 (462) 4 protein:vir:80835 Length: 464 100.0 3E-198 2E-201 1103.5 33.6 462 1-467 2-464 (464) 5 protein:vir:95603 Length: 463 100.0 7E-198 4E-201 1101.6 33.1 461 1-462 1-463 (463) 6 protein:vir:99311 Length: 463 100.0 7E-198 4E-201 1101.6 33.1 461 1-462 1-463 (463) 7 protein:vir:100851 Length: 514 100.0 4E-192 3E-195 1069.8 30.4 463 1-468 23-514 (514) 8 protein:vir:102823 Length: 470 100.0 2E-153 1E-156 858.2 27.7 437 1-463 1-470 (470) 9 protein:vir:8843 Length: 317 # 99.5 7.1E-16 4.4E-19 103.6 15.1 295 23-336 1-317 (317) 10 protein:vir:97255 Length: 310 99.3 2.1E-12 1.3E-15 84.6 19.3 301 1-459 1-310 (310) 11 protein:vir:104388 Length: 566 99.2 1.1E-12 7.1E-16 86.0 15.9 301 136-468 1-397 (566) 12 protein:vir:94933 Length: 330 99.2 4.6E-12 2.8E-15 82.8 18.1 324 1-460 1-330 (330) 13 protein:vir:9979 Length: 567 # 99.2 2.9E-12 1.8E-15 83.8 15.7 302 135-468 1-398 (567) 14 protein:vir:2792 Length: 567 # 99.2 2.9E-12 1.8E-15 83.8 15.7 302 135-468 1-398 (567) 15 protein:vir:10145 Length: 567 99.2 2.9E-12 1.8E-15 83.8 15.7 302 135-468 1-398 (567) 16 protein:vir:3306 Length: 567 # 99.2 2.9E-12 1.8E-15 83.8 15.7 302 135-468 1-398 (567) 17 protein:vir:827 Length: 567 # 99.2 3.4E-12 2.1E-15 83.5 15.5 302 135-468 1-398 (567) 18 protein:vir:93631 Length: 580 98.9 1.4E-10 8.8E-14 74.6 13.5 252 150-468 1-279 (580) 19 protein:vir:5120 Length: 615 # 98.6 7E-09 4.4E-12 65.3 14.9 283 163-468 1-379 (615) 20 protein:vir:105563 Length: 396 98.4 1.1E-09 6.8E-13 69.7 6.6 266 150-468 1-293 (396) 21 protein:vir:107802 Length: 681 98.3 2.3E-08 1.4E-11 62.4 11.7 302 108-468 1-373 (681) 22 protein:vir:98487 Length: 681 98.3 2.3E-08 1.4E-11 62.4 11.7 302 108-468 1-373 (681) 23 protein:vir:107423 Length: 681 98.3 2.3E-08 1.4E-11 62.4 11.7 302 108-468 1-373 (681) 24 protein:vir:96223 Length: 324 98.3 6.3E-07 3.9E-10 54.6 19.1 311 1-360 1-324 (324) 25 protein:vir:9309 Length: 324 # 98.2 1.1E-06 6.8E-10 53.3 18.9 317 1-360 1-324 (324) 26 protein:vir:78830 Length: 324 98.2 1.9E-06 1.2E-09 51.9 20.0 318 1-372 1-324 (324) 27 protein:vir:96392 Length: 324 98.2 1.9E-06 1.2E-09 51.9 20.0 318 1-372 1-324 (324) 28 protein:vir:103955 Length: 324 98.1 4.1E-06 2.5E-09 50.1 20.1 313 1-360 1-324 (324) 29 protein:vir:97148 Length: 324 98.1 3.6E-06 2.2E-09 50.5 19.0 317 1-360 1-324 (324) 30 protein:vir:100135 Length: 418 97.9 7.4E-06 4.6E-09 48.7 18.1 309 1-370 87-418 (418) 31 protein:vir:4953 Length: 397 # 97.9 9.9E-06 6.1E-09 48.0 17.9 295 1-338 72-397 (397) 32 protein:vir:8102 Length: 543 # 97.8 1.3E-05 8.1E-09 47.4 17.8 304 1-351 202-543 (543) 33 protein:vir:94142 Length: 304 97.8 1.8E-05 1.1E-08 46.7 19.6 291 26-368 1-304 (304) 34 protein:vir:105905 Length: 304 97.8 1.8E-05 1.1E-08 46.7 19.6 291 26-368 1-304 (304) 35 protein:vir:4339 Length: 395 # 97.8 1.7E-05 1E-08 46.8 18.1 304 1-339 68-395 (395) 36 protein:vir:99749 Length: 324 97.8 2.2E-05 1.4E-08 46.1 20.3 315 1-360 1-324 (324) 37 protein:vir:97053 Length: 390 97.7 2.4E-05 1.5E-08 45.9 20.4 303 1-337 64-390 (390) 38 protein:vir:7771 Length: 330 # 97.6 4E-05 2.5E-08 44.7 19.7 316 23-401 1-330 (330) 39 protein:vir:8187 Length: 311 # 97.5 5.6E-05 3.5E-08 43.9 19.5 294 34-369 1-311 (311) 40 protein:vir:95318 Length: 328 97.5 3.3E-06 2E-09 50.7 10.1 269 1-318 1-328 (328) 41 protein:vir:78523 Length: 338 97.4 6.6E-05 4.1E-08 43.5 21.0 318 17-370 1-338 (338) 42 protein:vir:1886 Length: 385 # 97.4 6.7E-05 4.1E-08 43.5 21.4 306 1-340 66-385 (385) 43 protein:vir:191 Length: 385 # 97.4 6.7E-05 4.1E-08 43.5 21.4 306 1-340 66-385 (385) 44 protein:vir:95763 Length: 297 97.4 8E-05 4.9E-08 43.1 18.8 291 21-352 1-297 (297) 45 protein:vir:104085 Length: 320 97.3 4.6E-05 2.9E-08 44.4 14.3 290 26-338 1-320 (320) 46 protein:vir:94673 Length: 419 97.3 0.00011 6.9E-08 42.3 18.7 311 1-341 71-419 (419) 47 protein:vir:103759 Length: 330 97.2 4.4E-05 2.7E-08 44.5 13.8 290 1-405 1-330 (330) 48 protein:vir:4226 Length: 326 # 97.2 6E-05 3.7E-08 43.7 14.5 296 11-338 1-326 (326) 49 protein:vir:80376 Length: 435 97.2 0.00014 8.6E-08 41.7 16.7 308 1-327 77-435 (435) 50 protein:vir:9574 Length: 300 # 97.2 0.00014 8.8E-08 41.7 16.5 281 33-339 1-300 (300) 51 protein:vir:81070 Length: 390 97.1 0.00016 9.6E-08 41.5 20.0 288 1-350 67-390 (390) 52 protein:vir:41 Length: 299 # N 97.1 0.00015 9.5E-08 41.5 15.4 281 28-330 1-299 (299) 53 protein:vir:2504 Length: 305 # 97.1 0.00015 9.4E-08 41.5 15.3 285 32-338 1-305 (305) 54 protein:vir:107388 Length: 331 97.0 4.7E-05 2.9E-08 44.3 12.1 236 1-294 1-331 (331) 55 protein:vir:98525 Length: 331 97.0 4.7E-05 2.9E-08 44.3 12.1 236 1-294 1-331 (331) 56 protein:vir:107826 Length: 331 97.0 4.7E-05 2.9E-08 44.3 12.1 236 1-294 1-331 (331) 57 protein:vir:10364 Length: 390 97.0 0.00023 1.4E-07 40.5 21.4 297 1-337 71-390 (390) 58 protein:vir:105038 Length: 428 96.9 0.00023 1.5E-07 40.5 15.1 311 1-373 71-428 (428) 59 protein:vir:94771 Length: 298 96.9 0.0002 1.2E-07 40.9 14.7 280 36-333 1-298 (298) 60 protein:vir:2430 Length: 318 # 96.9 0.00028 1.7E-07 40.1 18.1 301 4-373 1-318 (318) 61 protein:vir:1433 Length: 435 # 96.8 0.00031 1.9E-07 39.8 17.8 314 1-341 78-435 (435) 62 protein:vir:7324 Length: 335 # 96.8 0.00013 7.8E-08 42.0 12.6 253 1-295 1-335 (335) 63 protein:vir:3845 Length: 395 # 96.8 0.00035 2.2E-07 39.5 15.7 297 1-341 74-395 (395) 64 protein:vir:80684 Length: 315 96.8 0.00036 2.2E-07 39.5 17.8 305 26-394 1-315 (315) 65 protein:vir:4830 Length: 397 # 96.7 0.00037 2.3E-07 39.4 17.0 308 1-371 71-397 (397) 66 protein:vir:4997 Length: 397 # 96.7 0.00038 2.4E-07 39.3 18.2 306 1-355 71-397 (397) 67 protein:vir:78223 Length: 333 96.7 0.00041 2.6E-07 39.1 16.2 297 16-328 1-333 (333) 68 protein:vir:1638 Length: 298 # 96.5 0.00061 3.8E-07 38.2 18.1 283 36-367 1-298 (298) 69 protein:vir:103370 Length: 418 96.5 0.00035 2.1E-07 39.6 13.0 314 1-339 1-418 (418) 70 protein:vir:99920 Length: 311 96.4 0.00061 3.8E-07 38.2 19.3 284 32-367 1-311 (311) 71 protein:vir:3991 Length: 404 # 96.4 0.00011 7E-08 42.2 10.3 295 1-309 75-404 (404) 72 protein:vir:4856 Length: 293 # 96.4 0.00055 3.4E-07 38.5 13.7 280 22-380 1-293 (293) 73 protein:vir:2344 Length: 397 # 96.3 0.00073 4.5E-07 37.8 18.9 332 1-401 1-397 (397) 74 protein:vir:7409 Length: 408 # 96.2 0.0003 1.9E-07 39.9 11.3 296 1-332 70-408 (408) 75 protein:vir:1268 Length: 397 # 96.0 0.0011 7E-07 36.7 18.6 295 1-339 70-397 (397) 76 protein:vir:1025 Length: 408 # 95.9 0.00041 2.6E-07 39.1 11.0 299 1-332 70-408 (408) 77 protein:vir:9759 Length: 303 # 95.8 0.0015 9.1E-07 36.1 20.4 290 34-383 1-303 (303) 78 protein:vir:100247 Length: 425 95.6 0.0018 1.1E-06 35.7 17.6 310 1-368 96-425 (425) 79 protein:vir:96442 Length: 418 95.6 0.00064 4E-07 38.1 10.6 328 1-369 30-418 (418) 80 protein:vir:81160 Length: 371 95.3 0.0023 1.4E-06 35.0 18.3 305 1-368 59-371 (371) 81 protein:vir:4511 Length: 409 # 95.3 0.0024 1.5E-06 35.0 21.4 313 1-355 71-409 (409) 82 protein:vir:104256 Length: 458 94.4 0.0045 2.8E-06 33.5 15.1 311 1-369 114-458 (458) 83 protein:vir:9410 Length: 415 # 94.4 0.0046 2.8E-06 33.4 19.2 318 1-376 68-415 (415) 84 protein:vir:98339 Length: 415 94.0 0.0058 3.6E-06 32.8 20.5 319 1-376 68-415 (415) 85 protein:vir:81100 Length: 415 94.0 0.0058 3.6E-06 32.8 20.5 319 1-376 68-415 (415) 86 protein:vir:79987 Length: 415 94.0 0.0058 3.6E-06 32.8 20.5 319 1-376 68-415 (415) 87 protein:vir:4600 Length: 415 # 93.8 0.0064 3.9E-06 32.6 19.0 318 1-376 68-415 (415) 88 protein:vir:4700 Length: 415 # 93.8 0.0064 3.9E-06 32.6 19.0 318 1-376 68-415 (415) 89 protein:vir:485 Length: 407 # 93.7 0.0066 4.1E-06 32.5 19.9 310 1-385 69-407 (407) 90 protein:vir:81227 Length: 413 93.5 0.0073 4.5E-06 32.3 16.4 312 1-357 58-413 (413) 91 protein:vir:1328 Length: 392 # 93.2 0.0083 5.2E-06 32.0 17.0 306 1-351 69-392 (392) 92 protein:vir:4197 Length: 314 # 92.8 0.0099 6.2E-06 31.6 17.9 288 19-342 1-314 (314) 93 protein:vir:95376 Length: 425 92.6 0.011 6.6E-06 31.4 16.4 295 1-355 90-425 (425) 94 protein:vir:6212 Length: 434 # 92.4 0.012 7.2E-06 31.2 17.9 305 1-355 75-434 (434) 95 protein:vir:4092 Length: 390 # 92.2 0.012 7.7E-06 31.0 15.5 310 1-343 38-390 (390) 96 protein:vir:9820 Length: 272 # 92.1 0.013 7.8E-06 31.0 18.2 261 27-342 1-272 (272) 97 protein:vir:3033 Length: 272 # 92.1 0.013 7.8E-06 31.0 18.2 261 27-342 1-272 (272) 98 protein:vir:102119 Length: 404 91.7 0.015 9.1E-06 30.6 19.0 313 1-354 62-404 (404) 99 protein:vir:5739 Length: 366 # 90.1 0.023 1.4E-05 29.6 17.3 309 1-349 7-366 (366) 100 protein:vir:95963 Length: 395 89.7 0.025 1.5E-05 29.4 11.3 304 1-346 46-395 (395) 101 protein:vir:78350 Length: 383 89.5 0.026 1.6E-05 29.3 10.5 291 1-329 55-383 (383) 102 protein:vir:3870 Length: 400 # 89.3 0.027 1.7E-05 29.2 14.6 277 1-351 82-400 (400) 103 protein:vir:3158 Length: 321 # 88.4 0.032 2E-05 28.7 17.7 306 1-354 1-321 (321) 104 protein:vir:8420 Length: 477 # 87.7 0.037 2.3E-05 28.4 18.7 336 1-401 90-477 (477) 105 protein:vir:9643 Length: 377 # 86.2 0.047 2.9E-05 27.9 15.4 289 1-337 38-377 (377) 106 protein:vir:4456 Length: 401 # 85.7 0.051 3.1E-05 27.7 18.9 308 1-381 70-401 (401) 107 protein:vir:105004 Length: 392 84.1 0.063 3.9E-05 27.2 20.0 296 1-359 59-392 (392) 108 protein:vir:102082 Length: 392 84.1 0.063 3.9E-05 27.2 20.0 296 1-359 59-392 (392) 109 protein:vir:107593 Length: 392 84.1 0.063 3.9E-05 27.2 20.0 296 1-359 59-392 (392) 110 protein:vir:102873 Length: 392 84.1 0.063 3.9E-05 27.2 20.0 296 1-359 59-392 (392) 111 protein:vir:98635 Length: 377 83.3 0.069 4.3E-05 26.9 14.0 310 1-368 51-377 (377) 112 protein:vir:107687 Length: 319 82.4 0.077 4.8E-05 26.7 16.0 299 1-378 1-319 (319) 113 protein:vir:96762 Length: 632 82.4 0.078 4.8E-05 26.7 16.0 294 1-332 288-632 (632) 114 protein:vir:100884 Length: 389 81.8 0.082 5.1E-05 26.5 20.6 307 1-387 71-389 (389) 115 protein:vir:101607 Length: 379 80.0 0.099 6.2E-05 26.1 17.8 293 1-378 71-379 (379) 116 protein:vir:7855 Length: 497 # 78.8 0.11 6.9E-05 25.8 17.5 305 1-356 113-497 (497) 117 protein:vir:101650 Length: 497 78.8 0.11 6.9E-05 25.8 17.5 305 1-356 113-497 (497) 118 protein:vir:80128 Length: 466 77.5 0.12 7.7E-05 25.5 13.3 320 1-359 91-466 (466) 119 protein:vir:100172 Length: 394 75.4 0.15 9.1E-05 25.1 16.7 307 1-371 74-394 (394) 120 protein:vir:1084 Length: 437 # 68.4 0.24 0.00015 24.0 15.6 301 1-360 117-437 (437) 121 protein:vir:6242 Length: 390 # 68.0 0.24 0.00015 23.9 19.3 285 1-351 73-390 (390) 122 protein:vir:103285 Length: 296 63.6 0.31 0.0002 23.3 14.9 278 32-391 1-296 (296) 123 protein:vir:80068 Length: 301 59.4 0.39 0.00024 22.8 13.7 274 37-378 1-301 (301) 124 protein:vir:93616 Length: 645 48.9 0.66 0.00041 21.6 19.5 324 1-386 280-645 (645) 125 protein:vir:97397 Length: 517 48.6 0.67 0.00041 21.6 16.3 294 1-340 198-517 (517) 126 protein:vir:79642 Length: 329 47.7 0.69 0.00043 21.5 15.2 301 1-381 6-329 (329) 127 protein:vir:9704 Length: 394 # 45.6 0.76 0.00047 21.2 18.1 287 1-354 78-394 (394) 128 protein:vir:5255 Length: 304 # 43.7 0.83 0.00052 21.0 10.6 266 37-322 1-304 (304) 129 protein:vir:93881 Length: 387 43.6 0.84 0.00052 21.0 13.2 296 1-329 71-387 (387) 130 protein:vir:9361 Length: 402 # 43.2 0.85 0.00053 21.0 13.7 296 1-329 86-402 (402) 131 protein:vir:4159 Length: 315 # 40.2 0.98 0.00061 20.6 16.5 292 1-336 1-315 (315) 132 protein:vir:962 Length: 397 # 38.2 1.1 0.00067 20.4 14.7 286 1-350 84-397 (397) 133 protein:vir:93742 Length: 274 35.9 1.2 0.00075 20.1 17.2 261 1-347 1-274 (274) 134 protein:vir:94494 Length: 274 33.4 1.4 0.00084 19.9 17.6 259 1-347 1-274 (274) 135 protein:vir:97433 Length: 274 33.4 1.4 0.00084 19.9 17.6 259 1-347 1-274 (274) 136 protein:vir:100632 Length: 381 33.3 1.4 0.00085 19.8 10.8 304 1-359 1-381 (381) 137 protein:vir:104342 Length: 314 28.4 1.8 0.0011 19.2 14.3 293 1-391 1-314 (314) No 1 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=100.00 E-value=3.6e-211 Score=1174.32 Aligned_cols=468 Identities=100% Similarity=1.378 Sum_probs=465.2 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey 80 (468) ||++|||+++||+|++++||+++|||+|||+|+|++|+||+||||||||++|++|+|+++||+||++|+|++++|||+|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~Ks~~agy~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~~a~stv~~y 80 (468) T protein:vir:63 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) T ss_pred CCCCcchhhccccChhHHHHHHHHHHHcCcccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcccchhhhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020871. 81 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 160 (468) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~G 160 (468) +++++||++||++|++|+|+++++||+|+||+++||||++++++|+++++||+|+||+++|+++||++|||+|||+|||| T Consensus 81 ~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyG 160 (468) T protein:vir:63 81 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 160 (468) T ss_pred eeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEe Q lcl|NC_020871. 161 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLV 240 (468) Q Consensus 161 d~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~ 240 (468) ||+|.++|++++|||||||++||++|||||+||++||+++|||||++|++|||++||+|||+++|++|++++|.+||+|+ T Consensus 161 ds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L~~q~~v~ 240 (468) T protein:vir:63 161 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLV 240 (468) T ss_pred ccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEEEE Confidence 99999899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEE Q lcl|NC_020871. 241 RDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVS 320 (468) Q Consensus 241 ~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtav 320 (468) ++|++.+.+|++|++|++++|+|+||||+||+++++|++.++..|++|+|++++||+..+++|++.+++.++|+|+|++| T Consensus 241 ~~n~~~~~~G~~v~g~~sa~G~I~l~gs~il~~~~~l~~~~~~~~~Apsp~~vsaT~~~~~~g~~~~~~~a~y~Y~v~~v 320 (468) T protein:vir:63 241 RDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQPAKVTATQEAGKKGQFRAEDLAAHEYKVVVS 320 (468) T ss_pred cCCCCceeeeecccceecceeeeeecCceeeccccCCCcccccccccccCCccceeeecccCCcccCCCcceEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEecccccCCeeEEecCCCCCCCC Q lcl|NC_020871. 321 SDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFYDLNDSIPET 400 (468) Q Consensus 321 n~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~~t~tf~D~N~~iPgt 400 (468) |++|||+||+++++|+++.+++++|+|||+++++++|+||+|||++.++|+||||+||+++++++++++|+|+|++|||| T Consensus 321 s~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~yv~IYR~~~gg~~f~li~~va~~~a~~gt~tf~D~n~~iPgT 400 (468) T protein:vir:63 321 SDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFYDLNDSIPET 400 (468) T ss_pred CCCCccccccceEEEecCcccceeEEEEecCCCCCcceEEEEEEeCCCCcceeEeeeEeeeecCCCeEEEEcCCcccCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceecCCChhhhhhhhhccchhccccccCccceeeeeeeeeheeeccceeEEEEEeeeeecccccCC Q lcl|NC_020871. 401 VDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKYIPVKNVHSN 468 (468) Q Consensus 401 ~~~fvge~np~vi~~~ellPm~k~pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~~~~~~~ 468 (468) ++.|||||+|+||+|+|||||||||||++|+.++|||+|||+|+|+|||||++||||+|||||||||| T Consensus 401 ~~~fVgem~~~~i~~~~llpm~~lplA~~n~~~~~~Vl~Ygalal~~Pk~~~~ikNv~~~~~~~~~~~ 468 (468) T protein:vir:63 401 VDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIRNVKYIPVKNVHSN 468 (468) T ss_pred cceeeeecChhHHHHHHHhccccCChhHhccchhhhhhhhhHHhhhccccceEEEEeeeeeeccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=100.00 E-value=1.9e-209 Score=1164.85 Aligned_cols=467 Identities=100% Similarity=1.378 Sum_probs=463.0 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey 80 (468) ||++||+ ++||+|++++||+|+|||+|||+|+|++|+||+||||||||++|++|+|+++||+||++|+|++++|||+|| T Consensus 1 ~~~~~~~-~~~~~n~~~~~e~~~Ks~~agy~~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~a~stv~~y 79 (467) T protein:vir:80 1 MPKNNKE-EVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 79 (467) T ss_pred CCCcchh-hhhhcccccCHHHHHHHHHcccccCCccccCcchhhhhhhhhhhheeeccccchhhhhhcccchhhhhhhhh Confidence 9999987 589999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020871. 81 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 160 (468) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~G 160 (468) +++++||++||++|++|+|+++++||+|+||+++||||++++++|+++++||+|+||+++|+++||+++||+|||+|||| T Consensus 80 ~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyG 159 (467) T protein:vir:80 80 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 159 (467) T ss_pred eeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEe Q lcl|NC_020871. 161 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLV 240 (468) Q Consensus 161 d~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~ 240 (468) ||+|.++|++++|||||||++||++|||||+||++||+++|||||++|++|||++||+|||+++|++|++++|.+||+|+ T Consensus 160 ds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L~~q~~v~ 239 (467) T protein:vir:80 160 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLV 239 (467) T ss_pred ccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEEEE Confidence 99999899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEE Q lcl|NC_020871. 241 RDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVS 320 (468) Q Consensus 241 ~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtav 320 (468) ++|++.+.+|++|++|++++|+|+||||+||+++++|++.++..|++|+|++++||+..+++|++.+++.++|+|+|++| T Consensus 240 ~~n~~~~~~G~~v~g~~sa~G~I~l~gs~il~~~~~l~~~~~~~~~Apsp~~vsaT~~~~~~g~~~~~~~a~y~Y~v~~v 319 (467) T protein:vir:80 240 RDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQPAKVTATQEAGKKGQFRAEDLAAHEYKVVVS 319 (467) T ss_pred cCCCCceeeeecccceecceeeeeecCceeeccccCCCcccccccccccCCccceeeecccCCcccCCCcceEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEecccccCCeeEEecCCCCCCCC Q lcl|NC_020871. 321 SDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFYDLNDSIPET 400 (468) Q Consensus 321 n~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~~t~tf~D~N~~iPgt 400 (468) |++|||+||+++++|+++.+++++|+|||+++++++|+||+|||++.++|+||||+||+++++++++++|+|+|++|||| T Consensus 320 s~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~yv~IYR~~~gg~~f~li~~va~~~a~~gt~tf~D~n~~iPgT 399 (467) T protein:vir:80 320 SDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFYDLNDSIPET 399 (467) T ss_pred CCCCccccccceEEEecCcccceeEEEEecCCCCCcceEEEEEEeCCCCcceeEeeeEeeeecCCCeEEEEcCCcccCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceecCCChhhhhhhhhccchhccccccCccceeeeeeeeeheeeccceeEEEEEeeeeecccccCC Q lcl|NC_020871. 401 VDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKYIPVKNVHSN 468 (468) Q Consensus 401 ~~~fvge~np~vi~~~ellPm~k~pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~~~~~~~ 468 (468) ++.|||||+|+||+|+|||||||||||++|+.++|||+|||+|+|+|||||++||||+|||||||||| T Consensus 400 ~~~fVgem~~~~i~~~~llpm~~lplA~~n~~~~~~Vl~Ygalal~~Pk~~~~ikNv~~~~~~~~~~~ 467 (467) T protein:vir:80 400 VDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIRNVKYIPVKNVHSN 467 (467) T ss_pred cceeeeecChhHHHHHHHhccccCChhHhccchhhhhhhhhHHhhhccccceEEEEeeeeeeccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=100.00 E-value=1.3e-199 Score=1110.98 Aligned_cols=459 Identities=68% Similarity=1.060 Sum_probs=447.8 Q ss_pred CCCccc-chhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhc Q lcl|NC_020871. 1 MPKNNK-EEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAK 79 (468) Q Consensus 1 ~~~~~~-~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~e 79 (468) ||...+ ...+||+|++.. |+++|||+|||+|+|++|++++||||||||++|++|+|+++||+|||+|+|++++|||+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~-e~~~KS~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~sTv~~ 79 (462) T protein:vir:96 1 MHKDTNLTAEQNKYADKFQ-EEVMKSYQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQSTVQK 79 (462) T ss_pred Cccccccchhhhhhhchhh-HHHHHHHhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhh Confidence 886554 567899999996 999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020871. 80 YDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFF 159 (468) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~ 159 (468) |+++++||++||++|++|+|+++++||+|+||+++||||++++++|++++|||+++||+++|+++||+++||+|||+||| T Consensus 80 y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~dai~~~a~tiE~a~Fy 159 (462) T protein:vir:96 80 YDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDAIAVVAKTIEWASFY 159 (462) T ss_pred heeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHHHHHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEE Q lcl|NC_020871. 160 GDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQL 239 (468) Q Consensus 160 Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v 239 (468) ||++|++++. ++|||||||.+||+++|||||||++||+++|||||++|+++||+|||+|||+++|++|+|++|++|||+ T Consensus 160 gds~l~~~~~-~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~ 238 (462) T protein:vir:96 160 GDASLTADPT-GQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQL 238 (462) T ss_pred hhcccCCCcc-ccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceEEE Confidence 9999996654 469999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcc-cceeEEEEEE Q lcl|NC_020871. 240 VRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAE-DLAAHEYKVV 318 (468) Q Consensus 240 ~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~-~~~~y~YkVt 318 (468) |++|++++.+|++|++|++++|+|+||||++|+++..+++.....|++|+|+.+++|+.++.+|.|.+. |.++|+|+|+ T Consensus 239 ~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~p~ap~~~~vsaTv~t~~~g~f~~~~d~~~y~Y~V~ 318 (462) T protein:vir:96 239 MQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPATVKATVETGKKGLFTDEHDRAELTYKVV 318 (462) T ss_pred EcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccCCCCCCCCceeEEEEeCCCCCCCCccCceeEEEEEE Confidence 999999999999999999999999999999999999999999999999999999999999989999887 7999999999 Q ss_pred EEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEeccccc-CCeeEEecCCCCC Q lcl|NC_020871. 319 VSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAE-NNVITFYDLNDSI 397 (468) Q Consensus 319 avn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~-~~t~tf~D~N~~i 397 (468) +||++|||+||++|++|+++.+++++|+|||+++++++|+||+|||+++++|.|++||||++++++ +++++|+|+|++| T Consensus 319 avs~dgeS~PS~~VtaTva~~~~gv~ltIt~~a~~~~~~~~~~IYRk~~~sg~y~li~rv~~~~~n~~gt~tf~D~n~~i 398 (462) T protein:vir:96 319 VNSDDAQSAPSEAVTATVNNATDGVKLEISVNAMYQQQPQFVSIYRQGRKTGDFYLIKRLGMKEVNDEGKLVFYDLNETI 398 (462) T ss_pred EECCCCccccceeeEeeeecccccceEEEEEcCCccccceEEEEEeecCCccccceeeeeeceeecCCcceeEeeccCCC Confidence 999999999999999999999999999999999999999999999999999999999999999986 7888999999999 Q ss_pred CCCccceecCCChhhhhhhhhccchhccccccCccceeeeeeeeeheeeccceeEEEEEeeeee Q lcl|NC_020871. 398 PETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKYIP 461 (468) Q Consensus 398 Pgt~~~fvge~np~vi~~~ellPm~k~pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~ 461 (468) |||+++|||||+|++|+|+|||||||||||+.|++++|||+|||+|+|+|||||++||||+||- T Consensus 399 Pgt~~~fVge~~p~vi~~~qllpm~~~plA~~n~~~~waVl~yG~Lal~~Pk~~~~ikNv~~~~ 462 (462) T protein:vir:96 399 PETTDVFVGEMSPQVLHLFELLPMMKLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKYIV 462 (462) T ss_pred CCcccceeecCCchhhhhhhhhhhhhcCcccccchhhhhhhhhhHHHhhcccccEEEEEEEEeC Confidence 9999999999999999999999999999999999999999999999999999999999999997 No 4 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=100.00 E-value=3e-198 Score=1103.50 Aligned_cols=462 Identities=72% Similarity=1.131 Sum_probs=453.1 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey 80 (468) --+.|+++.+||+++ +++|||+|||+++|++|+||+||||||||++|++|+|+++||+|||+|+|++++|||+|| T Consensus 2 ~~~~n~~~~~~~~~e-----~~~Ks~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~STV~~y 76 (464) T protein:vir:80 2 TEKKNTERQLTSVQE-----EVIKGFTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATSTVAKY 76 (464) T ss_pred CcchhhHhhcCcccH-----HHHHHHHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhh Confidence 346789999999987 468999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020871. 81 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 160 (468) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~G 160 (468) +++++||++||++|++|+|+++++||+|+||+++||||+++|++|++++|||++.||+.+|+++||+++||+|||+|||| T Consensus 77 ~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~~~d~~~~~~~dai~~va~tiE~a~FyG 156 (464) T protein:vir:80 77 DVYLAHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATGLVNNIEDPMRILTDDAISVVAKTIEWASFYG 156 (464) T ss_pred heeeccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEe Q lcl|NC_020871. 161 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLV 240 (468) Q Consensus 161 d~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~ 240 (468) |++|++.|++|+|||||||++||+++|||||||++||+++||+||++|+++||+|||+|||+++|++|+|++|++||+|+ T Consensus 157 ds~l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~q~~~~ 236 (464) T protein:vir:80 157 DSDLSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDRQVQVI 236 (464) T ss_pred ccccCCCCCCccccchhhhHhhcCCCceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCceeEEE Confidence 99999889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccc-eeEEEEEEE Q lcl|NC_020871. 241 RDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDL-AAHEYKVVV 319 (468) Q Consensus 241 ~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~-~~y~YkVta 319 (468) ++|++.+.+|++|++|+|++|+|+||||++|++++++++.+...|++|++|++++|+.++.+|+|.+++. +.|+|+|++ T Consensus 237 ~~n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~~ld~~~~~~~~apaapsvt~tv~~~~~g~f~~~~~~~~~~Ykv~~ 316 (464) T protein:vir:80 237 SDNGQNATMGFNVKGFNSARGFIRLHGSTVMELEQILDENRMQLPNAPQKATVKATLEAGTKGKFRDEDLTIDTEYKVVV 316 (464) T ss_pred cCCCCcceeeeecccccccccceeccCccccCcccccccccccCCCCcCCceeEEEecCCcccCCccccccceeEEEEEE Confidence 9999999999999999999999999999999999999999999999999999999999999999999885 569999999 Q ss_pred EcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEecccccCCeeEEecCCCCCCC Q lcl|NC_020871. 320 SSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFYDLNDSIPE 399 (468) Q Consensus 320 vn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~~t~tf~D~N~~iPg 399 (468) ||++|||+||+++++|+.+.+++++|+||+++++++.|+|++|||++.++|+||+|+||+++++++++++|+|+|++||| T Consensus 317 vn~~GeS~ps~~~~~ti~~~~~~V~l~it~~~~~~~~p~yv~IYR~~~~~g~f~~i~rv~~~~~~~gt~t~vD~n~~IPg 396 (464) T protein:vir:80 317 VSDDAESAPSDVASVVIDDKKKQVKLEITINNMYQARPQYVAIYRKGLETGLFYQIARVPASKAVEGVITFIDVNDEIPE 396 (464) T ss_pred ECCCCccccceeeeeeecCcccEEEEEEEeCCccccccceEEEEeecCCCCceeEEEEEeeccccCCceEEEecccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccceecCCChhhhhhhhhccchhccccccCccceeeeeeeeeheeeccceeEEEEEeeeeecccccC Q lcl|NC_020871. 400 TVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKYIPVKNVHS 467 (468) Q Consensus 400 t~~~fvge~np~vi~~~ellPm~k~pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~~~~~~ 467 (468) |++.|||||+|++|+|+|||||||||||+.|++++|||+|||+|+|+|||||+|||||+||+++++.. T Consensus 397 t~~vfVgems~~ti~l~ellPm~rlplA~~n~~~~waVl~YGaLal~aPk~~~~ikNv~~~~~~~~~~ 464 (464) T protein:vir:80 397 TADVFVGELTPSVVHLFELLPMMRLPLAQVNASVTFAVLWYGALALRAPKKWARIKNVKYIATGNVFN 464 (464) T ss_pred ceeEeeecCCchHHHHHHHHHhhhCCchhcccchhhhhhhhhHHhhhccccceEEEEEEEeecccCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999987 No 5 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=100.00 E-value=6.7e-198 Score=1101.55 Aligned_cols=461 Identities=66% Similarity=1.035 Sum_probs=452.0 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey 80 (468) ||..+||+++...+-+...|+++|||+|||+++|++|+||+||||||||++|++|+|+++||+|||+|+|++++|||+|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y 80 (463) T protein:vir:95 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) T ss_pred CCcccccchHHHHHHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020871. 81 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 160 (468) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~G 160 (468) +++++||++||++|++|+|+++++||+|+||+++||||+++++||+++++||+++||+++|+++||+++||+|||+|||| T Consensus 81 ~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~FyG 160 (463) T protein:vir:95 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) T ss_pred eeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEe Q lcl|NC_020871. 161 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLV 240 (468) Q Consensus 161 d~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~ 240 (468) |++|++. +.++|||||||.+||++||||||||++||+++||+||++|+++||+|||+|||+++|++|+|++|++|||+| T Consensus 161 ds~l~~~-~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~ 239 (463) T protein:vir:95 161 DASLTSE-VEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLM 239 (463) T ss_pred hhccCCC-cCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEE Confidence 9999965 555899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCC-CcCcccceeEEEEEEE Q lcl|NC_020871. 241 RDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKG-QFRAEDLAAHEYKVVV 319 (468) Q Consensus 241 ~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g-~~~~~~~~~y~YkVta 319 (468) ++|++++.+|++|++|++++|+|+||||++|++++.+++.+...|++|+||++++|+.+..++ +|.+++.+.|+|+|++ T Consensus 240 ~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~~Y~vv~ 319 (463) T protein:vir:95 240 QDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAGLSYKVVV 319 (463) T ss_pred cCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCccCceeEEEEeeccCCCCCCcccccceEEEEEE Confidence 999999999999999999999999999999999999999999999999999999998776655 5778999999999999 Q ss_pred EcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEeccccc-CCeeEEecCCCCCC Q lcl|NC_020871. 320 SSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAE-NNVITFYDLNDSIP 398 (468) Q Consensus 320 vn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~-~~t~tf~D~N~~iP 398 (468) ||++|||+||+++++|+++.++|++|+||++++++.+|+|++|||+++++|.|++|+||++++++ +++++|+|+|++|| T Consensus 320 ~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~~~~g~~~~i~rv~v~~an~~gttt~~D~n~~IP 399 (463) T protein:vir:95 320 NSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLP 399 (463) T ss_pred ECCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEEEEEeecCCCCcceeEEEEEecccCCCceEEEeecccccC Confidence 99999999999999999999999999999999999999999999999999999999999999985 68999999999999 Q ss_pred CCccceecCCChhhhhhhhhccchhccccccCccceeeeeeeeeheeeccceeEEEEEeeeeec Q lcl|NC_020871. 399 ETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKYIPV 462 (468) Q Consensus 399 gt~~~fvge~np~vi~~~ellPm~k~pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~ 462 (468) ||++.|||||+|++|+|+|||||||||||+.|++++|||+|||+|+|+|||||++||||+|||| T Consensus 400 gt~~vfVgems~~ti~~~ellPm~klpLA~~~~~~~waVl~YGaLal~~Pk~~~~ikNv~~~~v 463 (463) T protein:vir:95 400 ETADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) T ss_pred CceeEeeeccCchhhhhHhhhHhhhCCchhccchhhhHHHHhhHHHhhccccceEEEEeeEecC Confidence 9999999999999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=100.00 E-value=6.7e-198 Score=1101.55 Aligned_cols=461 Identities=66% Similarity=1.035 Sum_probs=452.0 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey 80 (468) ||..+||+++...+-+...|+++|||+|||+++|++|+||+||||||||++|++|+|+++||+|||+|+|++++|||+|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y 80 (463) T protein:vir:99 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) T ss_pred CCcccccchHHHHHHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020871. 81 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 160 (468) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~G 160 (468) +++++||++||++|++|+|+++++||+|+||+++||||+++++||+++++||+++||+++|+++||+++||+|||+|||| T Consensus 81 ~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~FyG 160 (463) T protein:vir:99 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) T ss_pred eeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEe Q lcl|NC_020871. 161 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLV 240 (468) Q Consensus 161 d~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~ 240 (468) |++|++. +.++|||||||.+||++||||||||++||+++||+||++|+++||+|||+|||+++|++|+|++|++|||+| T Consensus 161 ds~l~~~-~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~ 239 (463) T protein:vir:99 161 DASLTSE-VEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLM 239 (463) T ss_pred hhccCCC-cCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEE Confidence 9999965 555899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCC-CcCcccceeEEEEEEE Q lcl|NC_020871. 241 RDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKG-QFRAEDLAAHEYKVVV 319 (468) Q Consensus 241 ~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g-~~~~~~~~~y~YkVta 319 (468) ++|++++.+|++|++|++++|+|+||||++|++++.+++.+...|++|+||++++|+.+..++ +|.+++.+.|+|+|++ T Consensus 240 ~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~~Y~vv~ 319 (463) T protein:vir:99 240 QDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAGLSYKVVV 319 (463) T ss_pred cCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCccCceeEEEEeeccCCCCCCcccccceEEEEEE Confidence 999999999999999999999999999999999999999999999999999999998776655 5778999999999999 Q ss_pred EcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEeccccc-CCeeEEecCCCCCC Q lcl|NC_020871. 320 SSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAE-NNVITFYDLNDSIP 398 (468) Q Consensus 320 vn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~-~~t~tf~D~N~~iP 398 (468) ||++|||+||+++++|+++.++|++|+||++++++.+|+|++|||+++++|.|++|+||++++++ +++++|+|+|++|| T Consensus 320 ~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~~~~g~~~~i~rv~v~~an~~gttt~~D~n~~IP 399 (463) T protein:vir:99 320 NSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLP 399 (463) T ss_pred ECCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEEEEEeecCCCCcceeEEEEEecccCCCceEEEeecccccC Confidence 99999999999999999999999999999999999999999999999999999999999999985 68999999999999 Q ss_pred CCccceecCCChhhhhhhhhccchhccccccCccceeeeeeeeeheeeccceeEEEEEeeeeec Q lcl|NC_020871. 399 ETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKYIPV 462 (468) Q Consensus 399 gt~~~fvge~np~vi~~~ellPm~k~pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~ 462 (468) ||++.|||||+|++|+|+|||||||||||+.|++++|||+|||+|+|+|||||++||||+|||| T Consensus 400 gt~~vfVgems~~ti~~~ellPm~klpLA~~~~~~~waVl~YGaLal~~Pk~~~~ikNv~~~~v 463 (463) T protein:vir:99 400 ETADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) T ss_pred CceeEeeeccCchhhhhHhhhHhhhCCchhccchhhhHHHHhhHHHhhccccceEEEEeeEecC Confidence 9999999999999999999999999999999999999999999999999999999999999999 No 7 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=100.00 E-value=4.2e-192 Score=1069.75 Aligned_cols=463 Identities=42% Similarity=0.631 Sum_probs=447.5 Q ss_pred CCCcccchhhcccChhhHHHHHHHH-hhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKS-FTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAK 79 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ks-f~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~e 79 (468) --++|||++++..+|+. +.|| |+|||+|+|++|+||+||||||||++|++|+|+++||+||++|+|++++|||+| T Consensus 23 ~~~~~~~~~~~~~~~~~----~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ftf~~~i~k~~a~STV~e 98 (514) T protein:vir:10 23 AFDTNKEDILNENLPEN----VKKSAFTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERDFTLYNDIAKQPVDNTVLK 98 (514) T ss_pred eecCcHHHHHHHhcchh----hhhhhhccccccCCccccCccchhhhhhccceeEeeecCcchhhhhhcCCchhhHHHhh Confidence 56899999999999875 5566 999999999999999999999999999999999999999999999999999999 Q ss_pred cceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020871. 80 YDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFF 159 (468) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~ 159 (468) |+++++||++||++|++|+|+++++||+|+||+++||||++++++|+++++||++.||+++++++||++|||+|||+||| T Consensus 99 y~~~~~~G~~G~~~f~~E~gi~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~n~i~d~~~~~~~dai~~ia~tiE~a~Fy 178 (514) T protein:vir:10 99 YTQYYSHGRTGHSLFQPEIGIGDVNNPNERQRTINIKYIVDTHVTSIALQRANTIVDSLKVQEYAAISTVIKTDEWAMFY 178 (514) T ss_pred hhhhcccCcccccccccccccCcCCCcceEEEEEeeeeeeeeeeeeehhhhccchhhHHHHHHHHHHHHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEE Q lcl|NC_020871. 160 GDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQL 239 (468) Q Consensus 160 Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v 239 (468) |||+|+ +++.++|||||||++||+++|||||||++||+++|||||++|+++||+|||+|||+++|++|+|+++++||++ T Consensus 179 GDs~L~-s~~~~~gleFDGl~~lI~~~NvIDarG~~Ls~~~ln~aA~~i~~gfGt~TD~ylp~~vka~f~~~~~~~qRV~ 257 (514) T protein:vir:10 179 GDADLT-SGQKGEGLQFDGLFKLIAPENHIDLRGGRLSPAALNMAARKIGEGFGTPTDAYMPIGIKADFVNQHLNGQRVM 257 (514) T ss_pred hcccCC-CccccCcchhhhHHHhhcCCCeEecCCCCccHHHHhhhhhhhhcccCChhheeCchHHHHHHhhcccCcceEE Confidence 999999 6788899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccce-------- Q lcl|NC_020871. 240 VRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLA-------- 311 (468) Q Consensus 240 ~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~-------- 311 (468) |++|.+++++|+++++|++++|+|+||||+||+++|+|++.++..|+||+++++++++.+..+|.|.+++.+ T Consensus 258 ~~~n~~~~~~G~~v~~f~s~~G~I~L~gs~im~~~n~L~~~~~~~~~Ap~~~~va~svT~~~~g~~~~ad~t~~~g~~~~ 337 (514) T protein:vir:10 258 LPGQTGGMTTGLDIDKFLSAHGSIRIQGSTIMDSDNKLDFDRPVSPTAPTAPQLSATVTPDGGGLWHEADKTDSKGEVIL 337 (514) T ss_pred eecCccceeeeeeccceeEeccceeecCCeeecccccCccCCccCCcCCCCCcceEEEecCcccccCccccccccccccc Confidence 999999999999999999999999999999999999999999999999999999999877777777755552 Q ss_pred ------eEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecC--------------CCce Q lcl|NC_020871. 312 ------AHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGA--------------ETGL 371 (468) Q Consensus 312 ------~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~--------------~~G~ 371 (468) .|+|+|++||++|||.||+++++|+++++++++|+||++++++..|+|++|||++. ++|+ T Consensus 338 ~~~~g~~~sYaVv~~n~~GeS~ps~~vtaT~a~~~~~i~ltItp~~~~~~~p~yv~IYR~~~~~s~~~~~~~~~~~~tGd 417 (514) T protein:vir:10 338 NKEVGVEQSYVAVMVSRHGDSRPSLVQTATPTKKDDAITLTITPNAMQNVIPDYVAIYRKSNFDSDALEANTDASGNRGS 417 (514) T ss_pred ccccceeEEEEEEEECCCCcccccceeeeeeeccCceEEEEEEeccCcccccceEEEEeccCCCcchhhhhccccccccc Confidence 68899999999999999999999999999999999999999999999999999964 5699 Q ss_pred eEEEEEEecccccCCeeEEecCCCCCCCCccceecCCChhhhhhhhhccchhccccccCccceeeeeeeeeheeecccee Q lcl|NC_020871. 372 FYLIARVPASKAENNVITFYDLNDSIPETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKW 451 (468) Q Consensus 372 f~~igrv~~s~~~~~t~tf~D~N~~iPgt~~~fvge~np~vi~~~ellPm~k~pla~~~~~~~~~V~~ygaL~l~aPkk~ 451 (468) |++||||+++...+++++|+|+|++||||++.|||||+|+||+|+|||||||||||+.|+.++|||+|||+|+|+||||| T Consensus 418 f~li~rv~~~~~~~gttt~~D~n~~IPgT~~vfVgemspevi~l~ellPm~klpLA~~na~~~waVlwYGaLal~aPkr~ 497 (514) T protein:vir:10 418 YYLIGKVAVREQEGATITFVDTNARIAGCGDVFVIENRPETVALQEFIPLSKLNLAVTTTATSFVVLNYVALALYYPKRG 497 (514) T ss_pred eeEEEEEeeecCCCCeEEEeccccccCCcceeEEeeCchHHHHHHHHhhhhhcChhhhcchHHHHHHHHhHHHhhccccc Confidence 99999999988899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeeeeecccccCC Q lcl|NC_020871. 452 VRIKNVKYIPVKNVHSN 468 (468) Q Consensus 452 ~~ikNV~~~~~~~~~~~ 468 (468) ++||||+|+|||||.-. T Consensus 498 ~~IkNv~~~~v~~~~~~ 514 (514) T protein:vir:10 498 AVLENVVYSRVEDLELS 514 (514) T ss_pred eEEEeeeeeeccccccC Confidence 99999999999999877 No 8 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=100.00 E-value=1.7e-153 Score=858.15 Aligned_cols=437 Identities=20% Similarity=0.303 Sum_probs=368.8 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey 80 (468) ||-.. -+-.-|+.+|++++.- ..|+||||||||++|++|+|+++||+||++|+|++++|||+|| T Consensus 1 ~~~~~---------~~~~~~a~~~al~~a~-------~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey 64 (470) T protein:vir:10 1 MPYEH---------LKHLDEATLKALNAAG-------QVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEY 64 (470) T ss_pred CChhH---------hhhhhHHHHHHHHHhh-------hcchhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhh Confidence 66421 1223467888888833 2368899999999999999999999999999999999999999 Q ss_pred ceeee-eccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHh--hhcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 81 DVYMQ-HGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAG--LVNNIQDPMQILTDDAIVNIAKTIEWAS 157 (468) Q Consensus 81 ~~~~~-hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~--lv~~~~Dp~~~~~~~ai~~~~~~~e~a~ 157 (468) +++++ ||+.||++| +|+|+++++||+|+||+++||||+++++||+++. ++|+++||+++++++||+++||+|||+| T Consensus 65 ~~~~~rhG~~g~s~~-~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~ 143 (470) T protein:vir:10 65 NVVTARHDKIGYAAF-REGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLA 143 (470) T ss_pred hhhccccccccceee-cccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhh Confidence 99886 899999866 9999999999999999999999999999999974 5688999999999999999999999999 Q ss_pred hhcccccccC-CCCCCCccccchhhhcC---ccceeeccCCCCCHHHHhhhhh--hhhhccCceEEEecCHHHHhhHHHh Q lcl|NC_020871. 158 FFGDSDLSDS-PEPQAGLEFDGLAKLIN---QDNVHDARGASLTESLLNQAAV--MISKGYGTPTDAYMPVGVQADFVNQ 231 (468) Q Consensus 158 f~Gd~~l~~~-~~~~~gleFDGl~~li~---~~nviDarG~~ls~~~l~~~a~--~i~~~fG~~td~~m~~~v~a~~~~~ 231 (468) ||||++|++. +++++|||||||.++|| |+|||||||++||+++||++|. +++++||+|||+|||+++|++|+|+ T Consensus 144 FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~ 223 (470) T protein:vir:10 144 FYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQAS 223 (470) T ss_pred hhhccccccccCcccCceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHHHHh Confidence 9999999754 78899999999999998 6799999999999999999985 4479999999999999999999999 Q ss_pred hcCCceEEeecCCCcceeeeeccceeecCCccccCCCEeeccc-----ccccccccccCCCCCC-------cceeEEecC Q lcl|NC_020871. 232 QLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENE-----QILDERILALPTAPQQ-------AKVTATQEA 299 (468) Q Consensus 232 ~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~-----n~l~~~~~~~p~ap~~-------~~vtat~~~ 299 (468) ++++|||+|++|++++++|++|++|+|++|+|+||||++|+++ +.|++.....+ +|+. ...++.... T Consensus 224 ~~~~qRv~~~~N~~~~~~G~~v~~f~sa~G~I~L~~s~~m~~~~k~~p~~l~~~v~~~a-AP~~~~tv~~t~~~~a~~~~ 302 (470) T protein:vir:10 224 FYQISRVMTTADRRAGLLGADAQSYIGVRGEHSLYPSQFLGDFHKFNPARFGAEVGDFA-APSNSWTVSTTDNFVTLPYN 302 (470) T ss_pred hcCceEEEEecCCCceeeeeeccceeeeeeeeeecccccccchhhcCcccCCcccCCcc-cCceeEEeecCCCceeeccc Confidence 9999999999999999999999999999999999999999964 45555433222 2221 112233333 Q ss_pred CCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEe Q lcl|NC_020871. 300 GKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVP 379 (468) Q Consensus 300 ~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~ 379 (468) ++.|.|.+++++.|.|+|+.++ ||| +|.++++|++++..+.++++++++..+ ++|++|||+++++|.|++++||+ T Consensus 303 sk~g~~~~~~v~sy~y~v~~~~--gds-~s~~v~vt~t~~~v~kgv~ltI~~~~~--v~yv~IYRk~~~s~~~~li~rv~ 377 (470) T protein:vir:10 303 SGLGDPANTTVYSYAFKAANFY--GES-AAKYIDVYIDSTEAGKGVRFQFHGLVN--VKWLDVYRKDPGSQEYKFYKRVK 377 (470) T ss_pred CCCCcccCcceeEEEEEEEEec--CCC-CcceEEEEEeeehhcceeEEEEecCCC--CcEEEEEeecCCCCceeEEEEEe Confidence 4455678777776666666555 777 455665555554444444445544444 68999999999999999999999 Q ss_pred cccccCCeeEEecCCCCCCCCccc----------eecCCChhhhhhhhhccc--hhccccccCccceeeeeeeeeheeec Q lcl|NC_020871. 380 ASKAENNVITFYDLNDSIPETVDV----------FVGEMSANVVHLFELLPM--MRLPLAQINASVTFAVLWYGALALRA 447 (468) Q Consensus 380 ~s~~~~~t~tf~D~N~~iPgt~~~----------fvge~np~vi~~~ellPm--~k~pla~~~~~~~~~V~~ygaL~l~a 447 (468) +++++++.++|+|.|++||+|++. |||||+|++++|++|+|| +|||++..+....|.| |+|+|+| T Consensus 378 v~~~ng~~~~~~D~~e~i~tt~~v~~~~~~Pgt~~Vgemsp~v~sl~~~l~m~l~klp~a~~~~~v~~~v---galal~a 454 (470) T protein:vir:10 378 VSTVNGDFTWIDDGHETVTTPSGVYRWKKIPGTGVVVGIDPNVTTMAVWIGMELYRLPPALTHDYVIWKV---ASVFSRA 454 (470) T ss_pred eeeccCCEEEEecccccCCCcceeeeecccCcceeccccCcchhhhhhhhhhhhhhcCHHHHHHHHHHHH---HHHHHhc Confidence 999999999999999999999975 999999999999999998 6666666665555655 9999999 Q ss_pred cceeEEEEEeeeeecc Q lcl|NC_020871. 448 PKKWVRIKNVKYIPVK 463 (468) Q Consensus 448 Pkk~~~ikNV~~~~~~ 463 (468) ||||++||||+||||- T Consensus 455 PKr~~~IkNV~~~~~~ 470 (470) T protein:vir:10 455 PEFNFLIVNVGQEPIV 470 (470) T ss_pred cccceEEEEeeeeecC Confidence 9999999999999998 No 9 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=99.51 E-value=7.1e-16 Score=103.63 Aligned_cols=295 Identities=13% Similarity=0.117 Sum_probs=184.8 Q ss_pred HHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeecccccccccccccccc Q lcl|NC_020871. 23 LKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP 102 (468) Q Consensus 23 ~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~ 102 (468) |-.-.-+| ++--+...+|+|..+|.++.-.+. .|+..|.|.++++|.++|....=..-. . .-..||+++. T Consensus 1 ma~~~~~~------~t~~~~g~~~dl~~~I~~isp~dT--Pf~S~i~~~~a~~~~~~W~~d~l~~~~-~-~~~~EG~da~ 70 (317) T protein:vir:88 1 MATPTNAV------STVEINGKREDLIDIIYNIAPYDT--PFMSAIGKGVATAITHEWQTDELRQPG-K-NTRVEGEDAT 70 (317) T ss_pred CCccccce------EeeeeeeeeechhhhheecCCccC--cceeeecCceecccEEEEEeeecCCcc-c-cccccCcccc Confidence 11111112 223566789999999999887766 556677889999999999975522111 1 1345886543 Q ss_pred ccC-cceEEEEEEEEeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccch Q lcl|NC_020871. 103 VSD-PNIRQKTVNMKFASDTKNISIAAGLVN--NIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGL 179 (468) Q Consensus 103 ~~d-~~~~r~~~~~k~l~~~~~vs~~~~lv~--~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl 179 (468) ... ..-.|+...+--+.+..+||.-++.++ ++.|-++.|...++.-|..++|+++++|.+....+. ....=+.+|| T Consensus 71 ~~~~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~-~t~~r~~~Gl 149 (317) T protein:vir:88 71 IKAGSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRN-TTTPGQMANI 149 (317) T ss_pred cccccCCEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCC-CccchhhhhH Confidence 332 333444455555667777777777764 678999999999999999999999999997753221 2223478999 Q ss_pred hhhcCccceeeccCC----------------CCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEeecC Q lcl|NC_020871. 180 AKLINQDNVHDARGA----------------SLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDN 243 (468) Q Consensus 180 ~~li~~~nviDarG~----------------~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~~~n 243 (468) ...|+..++..+.|. .|+|+.|+++...+-...|.++.+|+++..|..|...+-++-..+- .. T Consensus 150 ~~~i~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~-~~ 228 (317) T protein:vir:88 150 FAYYKTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEIT-LD 228 (317) T ss_pred HHHhccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEE-Ec Confidence 999988877766555 5999999999999988999999999999999999666544433332 23 Q ss_pred CCcceeeeeccceeecCCccccCCCEeeccccc--ccccccccC-CCCCCcceeEEecCCCCCCcCcccceeEEEEEEEE Q lcl|NC_020871. 244 GNNVSVGFNIQGFHSARGFIKLHGSTVMENEQI--LDERILALP-TAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVS 320 (468) Q Consensus 244 ~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~--l~~~~~~~p-~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtav 320 (468) .....+|..|..|.+..|.+++..+-.|..+.. +|...+... .-|.. ....+-++. ....--.-.|.+-+. T Consensus 229 ~~~~~~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~----~e~laKtGd--~~k~~i~~E~tLe~~ 302 (317) T protein:vir:88 229 ASDNRIAQTVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYLRPFF----QHELAKTGD--SEKRQLLVEYTFRVN 302 (317) T ss_pred ccCeEEEEEEEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeecccce----eeccCCCcc--cceeEEEEEEEEEEc Confidence 345579999999999999999877766644332 222222221 11111 111111111 000111233444444 Q ss_pred cccCCcccccceeeee Q lcl|NC_020871. 321 SDDAESIASEVATATV 336 (468) Q Consensus 321 n~~GES~aS~~vt~Tv 336 (468) |..+-..-.. .++++ T Consensus 303 N~~a~a~i~~-l~~~~ 317 (317) T protein:vir:88 303 NEKSGALIRD-VVAQL 317 (317) T ss_pred CccceeEEEE-ecccC Confidence 4432111110 11111 No 10 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.26 E-value=2.1e-12 Score=84.56 Aligned_cols=301 Identities=13% Similarity=0.150 Sum_probs=169.3 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey 80 (468) ||.. +.+.| +-|....|...+.- .-.++-.++..++=..++.....| T Consensus 1 mpal----------tLaea---------------------~k~~~d~l~~~ViE--~~~~~s~lL~~LpF~~veg~~~~y 47 (310) T protein:vir:97 1 MASV----------TLAES---------------------AKLAQDELVAGVIE--NIITVNRMFDVLPFDSIEGNSLAY 47 (310) T ss_pred Cccc----------chHHH---------------------hhcCcchHHHHHHH--HHhccchHHHhCCcccccCCccee Confidence 3311 12222 22222223322211 111222344555555566667889 Q ss_pred ceeeeeccccc----cccccccccccccCcceEEEEEEEEeeeehhhhhhh-Hhhh-cchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 81 DVYMQHGKVGH----TRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIA-AGLV-NNIQDPMQILTDDAIVNIAKTIE 154 (468) Q Consensus 81 ~~~~~hG~~g~----~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~-~~lv-~~~~Dp~~~~~~~ai~~~~~~~e 154 (468) +|....++.+- ..+..| |++ -+..++.++...++-++....|... +++. ++..|-.++|.+-.|..+...+| T Consensus 48 nR~~~~~~~~~~~v~~~~~~~-g~~-~~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e 125 (310) T protein:vir:97 48 NRENVLGDVIMAGVGTTFSGA-GAG-KAAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQ 125 (310) T ss_pred eEeeccCCcccccccccccCC-Ccc-ccccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHH Confidence 98887666651 111122 222 2567789999999999999999875 6776 55899999999999999999999 Q ss_pred HHHhhcccccccCCCCCCCccccchhhhcCccceeec--cCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh Q lcl|NC_020871. 155 WASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDA--RGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ 232 (468) Q Consensus 155 ~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDa--rG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~ 232 (468) +.+++||+.-+ |||||.+.+++.++||+ +|+.|+++.|.++-..+-+.=|.+.-++||+.+...+...- T Consensus 126 ~~lINGD~a~n---------~F~GL~~~~~~~q~i~~~~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~ 196 (310) T protein:vir:97 126 DQLINGNGAGN---------EFAGLIQLCASGQKATTGATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALL 196 (310) T ss_pred HHhhccccCCC---------cccchhhcCCccceeecCCCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHH Confidence 99999998632 69999999999999998 77999999999977666555677888999998766663332 Q ss_pred cCC-ceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccce Q lcl|NC_020871. 233 LSK-QTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLA 311 (468) Q Consensus 233 ~~~-qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~ 311 (468) ..- .|-+-+.. . ...|-.|..| +|=-|+..+ + .|...++ .++ ++ . T Consensus 197 R~~~~~g~~~~~-~-~~~G~~v~~~---------~GiPi~~~d-~-------ip~~~~~--~~~---~g----------t 242 (310) T protein:vir:97 197 RALGGASINEVV-E-LPSGAEVPAY---------SGTPIFRND-Y-------IPTNQTK--GGT---TG----------C 242 (310) T ss_pred HHhcCCCCCCcc-c-cCCCCEEeee---------CCeEEEEeC-c-------cCCCccc--ccc---CC----------c Confidence 211 01111111 1 1244444433 222222211 1 0100000 000 00 0 Q ss_pred eEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEecccccCCeeEEe Q lcl|NC_020871. 312 AHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFY 391 (468) Q Consensus 312 ~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~~t~tf~ 391 (468) +==|.|..-. . ....+.+|-- .. + T Consensus 243 TsIya~r~Ge----~------------------------------------------~~~~Gv~Gl~-~~--~------- 266 (310) T protein:vir:97 243 TTIFAGTLDD----G------------------------------------------SRTHGIAGLT-AT--Q------- 266 (310) T ss_pred eeEEEEeeCc----c------------------------------------------ccccceeccc-cC--C------- Confidence 0111111110 0 0011111200 00 0 Q ss_pred cCCCCCCCCccceecCCChhhhhhhhhccchhccccccCccceeeeeeeeeheeeccceeEEEEEeee Q lcl|NC_020871. 392 DLNDSIPETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKY 459 (468) Q Consensus 392 D~N~~iPgt~~~fvge~np~vi~~~ellPm~k~pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~ 459 (468) .||-.--|+|+. .-.+...|.|.||..+++.-|+...+++||-= T Consensus 267 -----~~glsVr~~G~~-------------------~~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 267 -----AAGIQVVDVGES-------------------EDSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred -----ccceeEEeCCcc-------------------cCCcceeEEEEEeeeEEEecccceeeeccccC Confidence 011111111111 11256779999999999999999999999943 No 11 >protein:vir:104388 Length: 566 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794072;genbank:gi:116222017;genbank:GeneID:4397450 Probab=99.22 E-value=1.1e-12 Score=86.04 Aligned_cols=301 Identities=20% Similarity=0.199 Sum_probs=154.4 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccc-hhhhc-Cc-cceeeccCCC--CCHHHHhh----hhh Q lcl|NC_020871. 136 DPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDG-LAKLI-NQ-DNVHDARGAS--LTESLLNQ----AAV 206 (468) Q Consensus 136 Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDG-l~~li-~~-~nviDarG~~--ls~~~l~~----~a~ 206 (468) =|.+| ..|+-+| .|-|.- -+|=| .+ =.+...+|+- |-+.+|-+ .|. T Consensus 1 ~~~~~------------------~~~~~~~-------~~~~~~~~~~~~~M~~i~i~~f~Ge~Pr~~p~lLP~~~a~~A~ 55 (566) T protein:vir:10 1 MPIAI------------------LANSIIN-------PLIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPDHSAVLAE 55 (566) T ss_pred Cceee------------------ehhhhcc-------ceeecccccccceeeEEeecccccccccchhhhccccccceEE Confidence 11111 1111111 121100 00000 01 1456677776 44666653 335 Q ss_pred hhhhccCceEEEecCHHHHhhH----HHhhcCCceE-EeecCCCcceeeeecc-----ceeecCCccccCCCEeecc--- Q lcl|NC_020871. 207 MISKGYGTPTDAYMPVGVQADF----VNQQLSKQTQ-LVRDNGNNVSVGFNIQ-----GFHSARGFIKLHGSTVMEN--- 273 (468) Q Consensus 207 ~i~~~fG~~td~~m~~~v~a~~----~~~~~~~qr~-v~~~n~~~~~~G~~v~-----~~~s~~g~i~l~gs~i~~~--- 273 (468) ...-.+|.-+=...|..+..-| ...|+-+..+ +.=++...+.-+-..+ -|++..|..+.+...|.-. T Consensus 56 n~~~~~G~itP~~~~~~~~~~~~~~~kTif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tg~~~Pk~t~~diAt~g~~ 135 (566) T protein:vir:10 56 DCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSPVAQDNYGRIYYTDGKFPKVTAAEIATKGEG 135 (566) T ss_pred eeeecCCeeeeeecccccccccccCceeeeeecCcEeEEeCCceeeccCccccCCcceEEEeeCCcceeeecceeecccc Confidence 5566688888887776664444 3344422222 1111222222232232 2666667666655544322 Q ss_pred cccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccC-Ccccccc-eeeeeeccCcceEEEEEeec Q lcl|NC_020871. 274 EQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA-ESIASEV-ATATVTAKDDGVKLEIELAP 351 (468) Q Consensus 274 ~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G-ES~aS~~-vt~Tv~a~~~g~~ltIT~~~ 351 (468) -....-.++.+| +|+.+.+++.+..++.....+.+.++|.|++|+|++.| ||.||.+ ..+++.+.++.+.|++...+ T Consensus 136 ~~pa~~y~LgVP-aPs~apv~~~~~~sg~~~~~~~d~~tr~Yv~TfVt~~GeES~PS~~S~~v~v~~~gs~V~ltl~~~p 214 (566) T protein:vir:10 136 NFPAASYRLGIP-APTTAPVCTVQKGEGATDENPNDDETRFYTETFVSAYGEEGPPGPESLEVTVGIPDTPVQLTLSPVP 214 (566) T ss_pred ccccccccccCC-CCcccceeeccCCCcccCCCCcccceeEEEEEEEcCCCCcCCCccccceeEecCCCceEEEEecCCC Confidence 123444567777 44444455554444444466778899999999999999 8988864 34455554555666666666 Q ss_pred CCCcccceEEEEeecCCC--ceeEEEEEEecccccCCeeEEecC------CCCCC--------CC-------ccceecCC Q lcl|NC_020871. 352 MYSSRPQFVSIYRKGAET--GLFYLIARVPASKAENNVITFYDL------NDSIP--------ET-------VDVFVGEM 408 (468) Q Consensus 352 ~~ga~~~~y~IYR~~~~~--G~f~~igrv~~s~~~~~t~tf~D~------N~~iP--------gt-------~~~fvge~ 408 (468) ..+...+..+|||+..++ .+|+|+++.+.+ ..+|+|. ++.+| .. .++++-.| T Consensus 215 ~~~~~i~~~RIYRS~tg~~gtdy~lVael~as-----~~sf~Dd~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAgF 289 (566) T protein:vir:10 215 LQDANINRRRIYRSVSGGGEADFLLVAELEAS-----VLSYTDNIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAGF 289 (566) T ss_pred cCcCCceeEEEEEecCCCCceeEEEEeeeccc-----ceeeeccccccccCcccccccccCcCcccceeeecccceEEee Confidence 566667889999997654 599999998655 4678776 33332 21 11111111 Q ss_pred C-----------h----h---------hhhh----------------------hhhccchhccccccCccceeeeeeeee Q lcl|NC_020871. 409 S-----------A----N---------VVHL----------------------FELLPMMRLPLAQINASVTFAVLWYGA 442 (468) Q Consensus 409 n-----------p----~---------vi~~----------------------~ellPm~k~pla~~~~~~~~~V~~yga 442 (468) . | + +|.+ -...-++||+..+.=-+.+.+|.+=|. T Consensus 290 ~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qaCvS~rsiV~~~g~ 369 (566) T protein:vir:10 290 AGNEVMFSEAYLPYAWPEVNRHTTAEDIVAVCPLGTSLVVATKGEPYLFSGVSPSTISGSKIPSMQACLSRQSMVAMEGF 369 (566) T ss_pred cCCEEEEecCCCCcccchhhccCCCCCeEEEEeccceEEEEEcCceEEEEcCChhhccccccccccccccccceeeecce Confidence 1 1 0 1110 111112343433333466777777666 Q ss_pred heeeccceeEEEE-E--eeeeecccccCC Q lcl|NC_020871. 443 LALRAPKKWVRIK-N--VKYIPVKNVHSN 468 (468) Q Consensus 443 L~l~aPkk~~~ik-N--V~~~~~~~~~~~ 468 (468) ..--.|.-.|.|. | .+ +-=++|.+. T Consensus 370 v~Yas~dGLv~v~a~g~a~-vvT~~l~t~ 397 (566) T protein:vir:10 370 VLYAGTNGLVSVDANGNAA-LATEQIISP 397 (566) T ss_pred EEeecCCceEEEecCCChh-hhhhhhcCh Confidence 6666777777773 2 21 111222222 No 12 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.20 E-value=4.6e-12 Score=82.75 Aligned_cols=324 Identities=15% Similarity=0.136 Sum_probs=175.0 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey 80 (468) |-.. --|- -.-.|..+..-|= --.+..-+...++.|-...+...+.. .-.+.-.++..++=..+++....| T Consensus 1 ~~~~-----~~~~-~~~~~~~~~~~~p-~l~m~alTLaea~~l~~d~~~~~VIE--~l~~~s~iL~~lpf~~ve~~~~~~ 71 (330) T protein:vir:94 1 MVRI-----CTPP-LRGRWRTLTHQFP-ELKMPTVTLAESAKLSQDHLVSGLIE--TIVEVNPLYEMMPFTEIEGNALAY 71 (330) T ss_pred Ccee-----cCCc-cccceeehhcccc-ccchhhhhhhHHhhcCchhhHHHHHH--hhhccchHHhhcccccccCCccee Confidence 1000 0000 0112222221100 00011112333444444555444422 112222455666666678888999 Q ss_pred ceeeeecccccccccccc-ccccccCcceEEEEEEEEeeeehhhhhhhH-hhhcchhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020871. 81 DVYMQHGKVGHTRFTREI-GVAPVSDPNIRQKTVNMKFASDTKNISIAA-GLVNNIQDPMQILTDDAIVNIAKTIEWASF 158 (468) Q Consensus 81 ~~~~~hG~~g~~~fv~E~-g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~-~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f 158 (468) +|....++.. |.... +.++....++.|.+..++-++.-..|.... ++-++..|-+..|.+..|..+.+.+|+.+| T Consensus 72 ~r~~~lp~a~---~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~li 148 (330) T protein:vir:94 72 NRENVLGDVQ---FLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMI 148 (330) T ss_pred eeeecCCcce---eeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 9999765543 53322 233334567899999999999988887765 466778999999999999999999999999 Q ss_pred hcccccccCCCCCCCccccchhhhcCccceeec--cCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcC-C Q lcl|NC_020871. 159 FGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDA--RGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLS-K 235 (468) Q Consensus 159 ~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDa--rG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~-~ 235 (468) +||+.- -|||||.+.++++|+||+ +|+.|+++.|.++-..+-+--|.+.-++|+......+...-.. . T Consensus 149 nGDs~~---------~~F~GL~~~~~~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~ 219 (330) T protein:vir:94 149 TGDGTG---------NSFQGMMGLVAASQTISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALG 219 (330) T ss_pred ccCCCC---------ccccchhhcCCcccEEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhcc Confidence 999652 289999999999999999 7899999999998877766667787788777766666332221 1 Q ss_pred ceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEE Q lcl|NC_020871. 236 QTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEY 315 (468) Q Consensus 236 qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~Y 315 (468) .|-+-+.. .. ..|-.|..| +|=-|+-.| + .|.. .+ + T Consensus 220 ~~~v~~~~-~~-~~G~~v~~~---------~GvPi~~~d-~----------ip~~----~~-------~----------- 255 (330) T protein:vir:94 220 GAAIGEVM-TL-PSGRQIPTY---------RGVPWFVND-F----------IPSN----MT-------Q----------- 255 (330) T ss_pred CCCCCCcc-cc-cCCCEEeee---------CCeEEEecc-c----------ccCC----CC-------c----------- Confidence 11111110 00 134444322 121122110 0 0000 00 0 Q ss_pred EEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecC-CCceeEEEEEEecccccCCeeEEecCC Q lcl|NC_020871. 316 KVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGA-ETGLFYLIARVPASKAENNVITFYDLN 394 (468) Q Consensus 316 kVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~-~~G~f~~igrv~~s~~~~~t~tf~D~N 394 (468) |++ .|.+.-|..=+.+.. .-|+-++... + T Consensus 256 --------~~~--------------------------~~ttsIyav~~G~~~~~qgV~Gl~~~------g---------- 285 (330) T protein:vir:94 256 --------GTA--------------------------TNATAIFAGTFDDGSNKYGIAGLTAR------G---------- 285 (330) T ss_pred --------ccC--------------------------CCceeEEEEeecccccccceEeecCC------C---------- Confidence 000 000000000000000 0011111110 0 Q ss_pred CCCCCCccceecCCChhhhhhhhhccchhccccccCccceeeeeeeeeheeeccceeEEEEEeeee Q lcl|NC_020871. 395 DSIPETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKYI 460 (468) Q Consensus 395 ~~iPgt~~~fvge~np~vi~~~ellPm~k~pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~ 460 (468) .||-.--|+|+.+ -.+...|.|.||..+++.-|+...+++||.-= T Consensus 286 --~~glsVr~~G~~~-------------------~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 286 --SAGLRVQNVGAKE-------------------NADETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred --CCcceeeeCCCcc-------------------ccceeeEEEEEeeeeEEechhheeeeccccCC Confidence 0221111111111 11457889999999999999999999999544 No 13 >protein:vir:9979 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859109;genbank:gi:32170864;genbank:GeneID:2653256 Probab=99.17 E-value=2.9e-12 Score=83.81 Aligned_cols=302 Identities=20% Similarity=0.219 Sum_probs=159.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccc-hhhhc-C-ccceeeccCCC--CCHHHHhh----hh Q lcl|NC_020871. 135 QDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDG-LAKLI-N-QDNVHDARGAS--LTESLLNQ----AA 205 (468) Q Consensus 135 ~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDG-l~~li-~-~~nviDarG~~--ls~~~l~~----~a 205 (468) +-|.+|+ .|+-+| .|-|.- -+|=| . .=.+...+|+- |-+.+|-+ .| T Consensus 1 ~~~~~~~------------------~~~~~~-------~~~~~~~~~~~~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A 55 (567) T protein:vir:99 1 MMPIAIL------------------ANSIIN-------PLIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPEHSAVLA 55 (567) T ss_pred Ccchhhh------------------hhhhcc-------ceeecccccccceeeEEeecccccccccchhhhccccccceE Confidence 2222222 222221 121100 00101 1 11456677776 45666653 33 Q ss_pred hhhhhccCceEEEecCHHHHhhH----HHhhcCCce-EEeecCCCcceeeeecc-----ceeecCCccccCCCEeec--- Q lcl|NC_020871. 206 VMISKGYGTPTDAYMPVGVQADF----VNQQLSKQT-QLVRDNGNNVSVGFNIQ-----GFHSARGFIKLHGSTVME--- 272 (468) Q Consensus 206 ~~i~~~fG~~td~~m~~~v~a~~----~~~~~~~qr-~v~~~n~~~~~~G~~v~-----~~~s~~g~i~l~gs~i~~--- 272 (468) ....-.+|.-+=...|..+...| .+.|+=+.. -+.=++...+.-|-..+ -|++.+|..+.+...|.- T Consensus 56 ~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~ 135 (567) T protein:vir:99 56 EDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGD 135 (567) T ss_pred EeeeccCCeeeeeecccccccccccCceeeEEEcCcEEEEeCCceeeccCccccCCcceEEEecCCcceeeeeeeeecCC Confidence 55566688888877776664443 233332222 11112223332333333 377888888877777642 Q ss_pred ccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccC-Ccccccce-eeeeeccCcceEEEEEee Q lcl|NC_020871. 273 NEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA-ESIASEVA-TATVTAKDDGVKLEIELA 350 (468) Q Consensus 273 ~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G-ES~aS~~v-t~Tv~a~~~g~~ltIT~~ 350 (468) ...+....++.+|++.+++ +++++.+++.....+.|+.++.|++|+|++.| ||+||.+- .+++...+..+.|++... T Consensus 136 ~~~P~~~y~LgVpaps~aP-~~a~~~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~ 214 (567) T protein:vir:99 136 GNHPTSSYRLGIPAPTTAP-VCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPV 214 (567) T ss_pred CCCCcchhhcccCCccccc-eeeecCCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccC Confidence 2333344577777444443 44555555555556778899999999999999 89888653 334433344455555555 Q ss_pred cCCCcccceEEEEeecCCC--ceeEEEEEEecccccCCeeEEecC------CCCCC--------CC-------ccceecC Q lcl|NC_020871. 351 PMYSSRPQFVSIYRKGAET--GLFYLIARVPASKAENNVITFYDL------NDSIP--------ET-------VDVFVGE 407 (468) Q Consensus 351 ~~~ga~~~~y~IYR~~~~~--G~f~~igrv~~s~~~~~t~tf~D~------N~~iP--------gt-------~~~fvge 407 (468) +..+..-+..+|||+..++ ++|+|+++++.+ ..+|+|. ++.+| -. .++++-. T Consensus 215 ~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as-----~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAg 289 (567) T protein:vir:99 215 PLQNASIKRRRIYRSASGGGEADFLLVAELDAS-----VLSYTDKIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAG 289 (567) T ss_pred CccccccceEEEEEecCCCCceeeEEEEeeccc-----eeeeeeccchhhcccccccccccCcCcccceeeecccceEEe Confidence 5555556889999987664 599999998654 5678886 33332 11 1121111 Q ss_pred CC-----------h----h---------hhhh----------------------hhhccchhccccccCccceeeeeeee Q lcl|NC_020871. 408 MS-----------A----N---------VVHL----------------------FELLPMMRLPLAQINASVTFAVLWYG 441 (468) Q Consensus 408 ~n-----------p----~---------vi~~----------------------~ellPm~k~pla~~~~~~~~~V~~yg 441 (468) |. | + +|.+ -...-++||+..+.=-+.+.+|.+=| T Consensus 290 F~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qpCvS~rsiV~~~g 369 (567) T protein:vir:99 290 FAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPSTISGSKIPSMQACLSRRSMVAMEG 369 (567) T ss_pred ecCCEEEEecCCCCcccchhhccCCCCCeEEEeecccEEEEEEcCceEEEEcCChhhccccccccccccccccceeEecc Confidence 11 1 0 1110 11223344444444457777777777 Q ss_pred eheeeccceeEEEE-E--eeeeecccccCC Q lcl|NC_020871. 442 ALALRAPKKWVRIK-N--VKYIPVKNVHSN 468 (468) Q Consensus 442 aL~l~aPkk~~~ik-N--V~~~~~~~~~~~ 468 (468) +..--.|.-.|.|. | ++. -=++|.+. T Consensus 370 ~v~Yas~dGLv~i~a~G~a~v-vT~~l~t~ 398 (567) T protein:vir:99 370 FVLYAGTNGLVSVDANGNVAL-ATEQIVSP 398 (567) T ss_pred EEEeecCCcEEEEecCCchhh-hhhhccCh Confidence 76667788777774 3 211 11222222 No 14 >protein:vir:2792 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612909;genbank:gi:20065826;genbank:GeneID:935648 Probab=99.17 E-value=2.9e-12 Score=83.81 Aligned_cols=302 Identities=20% Similarity=0.219 Sum_probs=159.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccc-hhhhc-C-ccceeeccCCC--CCHHHHhh----hh Q lcl|NC_020871. 135 QDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDG-LAKLI-N-QDNVHDARGAS--LTESLLNQ----AA 205 (468) Q Consensus 135 ~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDG-l~~li-~-~~nviDarG~~--ls~~~l~~----~a 205 (468) +-|.+|+ .|+-+| .|-|.- -+|=| . .=.+...+|+- |-+.+|-+ .| T Consensus 1 ~~~~~~~------------------~~~~~~-------~~~~~~~~~~~~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A 55 (567) T protein:vir:27 1 MMPIAIL------------------ANSIIN-------PLIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPEHSAVLA 55 (567) T ss_pred Ccchhhh------------------hhhhcc-------ceeecccccccceeeEEeecccccccccchhhhccccccceE Confidence 2222222 222221 121100 00101 1 11456677776 45666653 33 Q ss_pred hhhhhccCceEEEecCHHHHhhH----HHhhcCCce-EEeecCCCcceeeeecc-----ceeecCCccccCCCEeec--- Q lcl|NC_020871. 206 VMISKGYGTPTDAYMPVGVQADF----VNQQLSKQT-QLVRDNGNNVSVGFNIQ-----GFHSARGFIKLHGSTVME--- 272 (468) Q Consensus 206 ~~i~~~fG~~td~~m~~~v~a~~----~~~~~~~qr-~v~~~n~~~~~~G~~v~-----~~~s~~g~i~l~gs~i~~--- 272 (468) ....-.+|.-+=...|..+...| .+.|+=+.. -+.=++...+.-|-..+ -|++.+|..+.+...|.- T Consensus 56 ~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~ 135 (567) T protein:vir:27 56 EDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGD 135 (567) T ss_pred EeeeccCCeeeeeecccccccccccCceeeEEEcCcEEEEeCCceeeccCccccCCcceEEEecCCcceeeeeeeeecCC Confidence 55566688888877776664443 233332222 11112223332333333 377888888877777642 Q ss_pred ccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccC-Ccccccce-eeeeeccCcceEEEEEee Q lcl|NC_020871. 273 NEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA-ESIASEVA-TATVTAKDDGVKLEIELA 350 (468) Q Consensus 273 ~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G-ES~aS~~v-t~Tv~a~~~g~~ltIT~~ 350 (468) ...+....++.+|++.+++ +++++.+++.....+.|+.++.|++|+|++.| ||+||.+- .+++...+..+.|++... T Consensus 136 ~~~P~~~y~LgVpaps~aP-~~a~~~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~ 214 (567) T protein:vir:27 136 GNHPTSSYRLGIPAPTTAP-VCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPV 214 (567) T ss_pred CCCCcchhhcccCCccccc-eeeecCCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccC Confidence 2333344577777444443 44555555555556778899999999999999 89888653 334433344455555555 Q ss_pred cCCCcccceEEEEeecCCC--ceeEEEEEEecccccCCeeEEecC------CCCCC--------CC-------ccceecC Q lcl|NC_020871. 351 PMYSSRPQFVSIYRKGAET--GLFYLIARVPASKAENNVITFYDL------NDSIP--------ET-------VDVFVGE 407 (468) Q Consensus 351 ~~~ga~~~~y~IYR~~~~~--G~f~~igrv~~s~~~~~t~tf~D~------N~~iP--------gt-------~~~fvge 407 (468) +..+..-+..+|||+..++ ++|+|+++++.+ ..+|+|. ++.+| -. .++++-. T Consensus 215 ~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as-----~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAg 289 (567) T protein:vir:27 215 PLQNASIKRRRIYRSASGGGEADFLLVAELDAS-----VLSYTDKIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAG 289 (567) T ss_pred CccccccceEEEEEecCCCCceeeEEEEeeccc-----eeeeeeccchhhcccccccccccCcCcccceeeecccceEEe Confidence 5555556889999987664 599999998654 5678886 33332 11 1121111 Q ss_pred CC-----------h----h---------hhhh----------------------hhhccchhccccccCccceeeeeeee Q lcl|NC_020871. 408 MS-----------A----N---------VVHL----------------------FELLPMMRLPLAQINASVTFAVLWYG 441 (468) Q Consensus 408 ~n-----------p----~---------vi~~----------------------~ellPm~k~pla~~~~~~~~~V~~yg 441 (468) |. | + +|.+ -...-++||+..+.=-+.+.+|.+=| T Consensus 290 F~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qpCvS~rsiV~~~g 369 (567) T protein:vir:27 290 FAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPSTISGSKIPSMQACLSRRSMVAMEG 369 (567) T ss_pred ecCCEEEEecCCCCcccchhhccCCCCCeEEEeecccEEEEEEcCceEEEEcCChhhccccccccccccccccceeEecc Confidence 11 1 0 1110 11223344444444457777777777 Q ss_pred eheeeccceeEEEE-E--eeeeecccccCC Q lcl|NC_020871. 442 ALALRAPKKWVRIK-N--VKYIPVKNVHSN 468 (468) Q Consensus 442 aL~l~aPkk~~~ik-N--V~~~~~~~~~~~ 468 (468) +..--.|.-.|.|. | ++. -=++|.+. T Consensus 370 ~v~Yas~dGLv~i~a~G~a~v-vT~~l~t~ 398 (567) T protein:vir:27 370 FVLYAGTNGLVSVDANGNVAL-ATEQIVSP 398 (567) T ss_pred EEEeecCCcEEEEecCCchhh-hhhhccCh Confidence 76667788777774 3 211 11222222 No 15 >protein:vir:10145 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859275;genbank:gi:32171031;genbank:GeneID:2653447 Probab=99.17 E-value=2.9e-12 Score=83.81 Aligned_cols=302 Identities=20% Similarity=0.219 Sum_probs=159.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccc-hhhhc-C-ccceeeccCCC--CCHHHHhh----hh Q lcl|NC_020871. 135 QDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDG-LAKLI-N-QDNVHDARGAS--LTESLLNQ----AA 205 (468) Q Consensus 135 ~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDG-l~~li-~-~~nviDarG~~--ls~~~l~~----~a 205 (468) +-|.+|+ .|+-+| .|-|.- -+|=| . .=.+...+|+- |-+.+|-+ .| T Consensus 1 ~~~~~~~------------------~~~~~~-------~~~~~~~~~~~~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A 55 (567) T protein:vir:10 1 MMPIAIL------------------ANSIIN-------PLIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPEHSAVLA 55 (567) T ss_pred Ccchhhh------------------hhhhcc-------ceeecccccccceeeEEeecccccccccchhhhccccccceE Confidence 2222222 222221 121100 00101 1 11456677776 45666653 33 Q ss_pred hhhhhccCceEEEecCHHHHhhH----HHhhcCCce-EEeecCCCcceeeeecc-----ceeecCCccccCCCEeec--- Q lcl|NC_020871. 206 VMISKGYGTPTDAYMPVGVQADF----VNQQLSKQT-QLVRDNGNNVSVGFNIQ-----GFHSARGFIKLHGSTVME--- 272 (468) Q Consensus 206 ~~i~~~fG~~td~~m~~~v~a~~----~~~~~~~qr-~v~~~n~~~~~~G~~v~-----~~~s~~g~i~l~gs~i~~--- 272 (468) ....-.+|.-+=...|..+...| .+.|+=+.. -+.=++...+.-|-..+ -|++.+|..+.+...|.- T Consensus 56 ~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~ 135 (567) T protein:vir:10 56 EDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGD 135 (567) T ss_pred EeeeccCCeeeeeecccccccccccCceeeEEEcCcEEEEeCCceeeccCccccCCcceEEEecCCcceeeeeeeeecCC Confidence 55566688888877776664443 233332222 11112223332333333 377888888877777642 Q ss_pred ccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccC-Ccccccce-eeeeeccCcceEEEEEee Q lcl|NC_020871. 273 NEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA-ESIASEVA-TATVTAKDDGVKLEIELA 350 (468) Q Consensus 273 ~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G-ES~aS~~v-t~Tv~a~~~g~~ltIT~~ 350 (468) ...+....++.+|++.+++ +++++.+++.....+.|+.++.|++|+|++.| ||+||.+- .+++...+..+.|++... T Consensus 136 ~~~P~~~y~LgVpaps~aP-~~a~~~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~ 214 (567) T protein:vir:10 136 GNHPTSSYRLGIPAPTTAP-VCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPV 214 (567) T ss_pred CCCCcchhhcccCCccccc-eeeecCCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccC Confidence 2333344577777444443 44555555555556778899999999999999 89888653 334433344455555555 Q ss_pred cCCCcccceEEEEeecCCC--ceeEEEEEEecccccCCeeEEecC------CCCCC--------CC-------ccceecC Q lcl|NC_020871. 351 PMYSSRPQFVSIYRKGAET--GLFYLIARVPASKAENNVITFYDL------NDSIP--------ET-------VDVFVGE 407 (468) Q Consensus 351 ~~~ga~~~~y~IYR~~~~~--G~f~~igrv~~s~~~~~t~tf~D~------N~~iP--------gt-------~~~fvge 407 (468) +..+..-+..+|||+..++ ++|+|+++++.+ ..+|+|. ++.+| -. .++++-. T Consensus 215 ~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as-----~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAg 289 (567) T protein:vir:10 215 PLQNASIKRRRIYRSASGGGEADFLLVAELDAS-----VLSYTDKIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAG 289 (567) T ss_pred CccccccceEEEEEecCCCCceeeEEEEeeccc-----eeeeeeccchhhcccccccccccCcCcccceeeecccceEEe Confidence 5555556889999987664 599999998654 5678886 33332 11 1121111 Q ss_pred CC-----------h----h---------hhhh----------------------hhhccchhccccccCccceeeeeeee Q lcl|NC_020871. 408 MS-----------A----N---------VVHL----------------------FELLPMMRLPLAQINASVTFAVLWYG 441 (468) Q Consensus 408 ~n-----------p----~---------vi~~----------------------~ellPm~k~pla~~~~~~~~~V~~yg 441 (468) |. | + +|.+ -...-++||+..+.=-+.+.+|.+=| T Consensus 290 F~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qpCvS~rsiV~~~g 369 (567) T protein:vir:10 290 FAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPSTISGSKIPSMQACLSRRSMVAMEG 369 (567) T ss_pred ecCCEEEEecCCCCcccchhhccCCCCCeEEEeecccEEEEEEcCceEEEEcCChhhccccccccccccccccceeEecc Confidence 11 1 0 1110 11223344444444457777777777 Q ss_pred eheeeccceeEEEE-E--eeeeecccccCC Q lcl|NC_020871. 442 ALALRAPKKWVRIK-N--VKYIPVKNVHSN 468 (468) Q Consensus 442 aL~l~aPkk~~~ik-N--V~~~~~~~~~~~ 468 (468) +..--.|.-.|.|. | ++. -=++|.+. T Consensus 370 ~v~Yas~dGLv~i~a~G~a~v-vT~~l~t~ 398 (567) T protein:vir:10 370 FVLYAGTNGLVSVDANGNVAL-ATEQIVSP 398 (567) T ss_pred EEEeecCCcEEEEecCCchhh-hhhhccCh Confidence 76667788777774 3 211 11222222 No 16 >protein:vir:3306 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049522;genbank:gi:9632528;genbank:GeneID:1262016 Probab=99.17 E-value=2.9e-12 Score=83.81 Aligned_cols=302 Identities=20% Similarity=0.219 Sum_probs=159.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccc-hhhhc-C-ccceeeccCCC--CCHHHHhh----hh Q lcl|NC_020871. 135 QDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDG-LAKLI-N-QDNVHDARGAS--LTESLLNQ----AA 205 (468) Q Consensus 135 ~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDG-l~~li-~-~~nviDarG~~--ls~~~l~~----~a 205 (468) +-|.+|+ .|+-+| .|-|.- -+|=| . .=.+...+|+- |-+.+|-+ .| T Consensus 1 ~~~~~~~------------------~~~~~~-------~~~~~~~~~~~~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A 55 (567) T protein:vir:33 1 MMPIAIL------------------ANSIIN-------PLIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPEHSAVLA 55 (567) T ss_pred Ccchhhh------------------hhhhcc-------ceeecccccccceeeEEeecccccccccchhhhccccccceE Confidence 2222222 222221 121100 00101 1 11456677776 45666653 33 Q ss_pred hhhhhccCceEEEecCHHHHhhH----HHhhcCCce-EEeecCCCcceeeeecc-----ceeecCCccccCCCEeec--- Q lcl|NC_020871. 206 VMISKGYGTPTDAYMPVGVQADF----VNQQLSKQT-QLVRDNGNNVSVGFNIQ-----GFHSARGFIKLHGSTVME--- 272 (468) Q Consensus 206 ~~i~~~fG~~td~~m~~~v~a~~----~~~~~~~qr-~v~~~n~~~~~~G~~v~-----~~~s~~g~i~l~gs~i~~--- 272 (468) ....-.+|.-+=...|..+...| .+.|+=+.. -+.=++...+.-|-..+ -|++.+|..+.+...|.- T Consensus 56 ~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~ 135 (567) T protein:vir:33 56 EDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGD 135 (567) T ss_pred EeeeccCCeeeeeecccccccccccCceeeEEEcCcEEEEeCCceeeccCccccCCcceEEEecCCcceeeeeeeeecCC Confidence 55566688888877776664443 233332222 11112223332333333 377888888877777642 Q ss_pred ccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccC-Ccccccce-eeeeeccCcceEEEEEee Q lcl|NC_020871. 273 NEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA-ESIASEVA-TATVTAKDDGVKLEIELA 350 (468) Q Consensus 273 ~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G-ES~aS~~v-t~Tv~a~~~g~~ltIT~~ 350 (468) ...+....++.+|++.+++ +++++.+++.....+.|+.++.|++|+|++.| ||+||.+- .+++...+..+.|++... T Consensus 136 ~~~P~~~y~LgVpaps~aP-~~a~~~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~ 214 (567) T protein:vir:33 136 GNHPTSSYRLGIPAPTTAP-VCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPV 214 (567) T ss_pred CCCCcchhhcccCCccccc-eeeecCCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccC Confidence 2333344577777444443 44555555555556778899999999999999 89888653 334433344455555555 Q ss_pred cCCCcccceEEEEeecCCC--ceeEEEEEEecccccCCeeEEecC------CCCCC--------CC-------ccceecC Q lcl|NC_020871. 351 PMYSSRPQFVSIYRKGAET--GLFYLIARVPASKAENNVITFYDL------NDSIP--------ET-------VDVFVGE 407 (468) Q Consensus 351 ~~~ga~~~~y~IYR~~~~~--G~f~~igrv~~s~~~~~t~tf~D~------N~~iP--------gt-------~~~fvge 407 (468) +..+..-+..+|||+..++ ++|+|+++++.+ ..+|+|. ++.+| -. .++++-. T Consensus 215 ~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as-----~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAg 289 (567) T protein:vir:33 215 PLQNASIKRRRIYRSASGGGEADFLLVAELDAS-----VLSYTDKIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAG 289 (567) T ss_pred CccccccceEEEEEecCCCCceeeEEEEeeccc-----eeeeeeccchhhcccccccccccCcCcccceeeecccceEEe Confidence 5555556889999987664 599999998654 5678886 33332 11 1121111 Q ss_pred CC-----------h----h---------hhhh----------------------hhhccchhccccccCccceeeeeeee Q lcl|NC_020871. 408 MS-----------A----N---------VVHL----------------------FELLPMMRLPLAQINASVTFAVLWYG 441 (468) Q Consensus 408 ~n-----------p----~---------vi~~----------------------~ellPm~k~pla~~~~~~~~~V~~yg 441 (468) |. | + +|.+ -...-++||+..+.=-+.+.+|.+=| T Consensus 290 F~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qpCvS~rsiV~~~g 369 (567) T protein:vir:33 290 FAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPSTISGSKIPSMQACLSRRSMVAMEG 369 (567) T ss_pred ecCCEEEEecCCCCcccchhhccCCCCCeEEEeecccEEEEEEcCceEEEEcCChhhccccccccccccccccceeEecc Confidence 11 1 0 1110 11223344444444457777777777 Q ss_pred eheeeccceeEEEE-E--eeeeecccccCC Q lcl|NC_020871. 442 ALALRAPKKWVRIK-N--VKYIPVKNVHSN 468 (468) Q Consensus 442 aL~l~aPkk~~~ik-N--V~~~~~~~~~~~ 468 (468) +..--.|.-.|.|. | ++. -=++|.+. T Consensus 370 ~v~Yas~dGLv~i~a~G~a~v-vT~~l~t~ 398 (567) T protein:vir:33 370 FVLYAGTNGLVSVDANGNVAL-ATEQIVSP 398 (567) T ss_pred EEEeecCCcEEEEecCCchhh-hhhhccCh Confidence 76667788777774 3 211 11222222 No 17 >protein:vir:827 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050560;genbank:gi:9633457;genbank:GeneID:1262210 Probab=99.15 E-value=3.4e-12 Score=83.48 Aligned_cols=302 Identities=20% Similarity=0.219 Sum_probs=158.6 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccc-hhhhc-C-ccceeeccCCC--CCHHHHhh----hh Q lcl|NC_020871. 135 QDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDG-LAKLI-N-QDNVHDARGAS--LTESLLNQ----AA 205 (468) Q Consensus 135 ~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDG-l~~li-~-~~nviDarG~~--ls~~~l~~----~a 205 (468) +-|.+|+ .|+-+| .|-|.- -+|=| . .=.+...+|+- |-+.+|-+ .| T Consensus 1 ~~~~~~~------------------~~~~~~-------~~~~~~~~~~~~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A 55 (567) T protein:vir:82 1 MMPIAIL------------------ANSIIN-------PLIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPEHSAVLA 55 (567) T ss_pred Ccchhhh------------------hhhhcc-------ceeecccccccceeeEEeecccccccccchhhhccccccceE Confidence 2222222 222221 121100 00101 1 11456677776 45666653 33 Q ss_pred hhhhhccCceEEEecCHHHHhhH----HHhhcCCceE-EeecCCCcceeeeecc-----ceeecCCccccCCCEeec--- Q lcl|NC_020871. 206 VMISKGYGTPTDAYMPVGVQADF----VNQQLSKQTQ-LVRDNGNNVSVGFNIQ-----GFHSARGFIKLHGSTVME--- 272 (468) Q Consensus 206 ~~i~~~fG~~td~~m~~~v~a~~----~~~~~~~qr~-v~~~n~~~~~~G~~v~-----~~~s~~g~i~l~gs~i~~--- 272 (468) ....-.+|.-+=...|..+...| ...|+-+..+ +.=++...+.-+-..+ -|++.+|..+.+...|.- T Consensus 56 ~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~ 135 (567) T protein:vir:82 56 EDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGD 135 (567) T ss_pred EeeeecCCeeeeeecccccccccccCceeeeeecCcEeEEeCCceeeccCccccCCcccEEEecCCcceeeeeeeeecCC Confidence 55566688888877776664443 3444422222 1112222232333333 377888888877777642 Q ss_pred ccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccC-Ccccccce-eeeeeccCcceEEEEEee Q lcl|NC_020871. 273 NEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA-ESIASEVA-TATVTAKDDGVKLEIELA 350 (468) Q Consensus 273 ~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G-ES~aS~~v-t~Tv~a~~~g~~ltIT~~ 350 (468) ...+....++.+|++.+++ +++++.+++...-.+.|+.++.|++|+|++.| ||+||.+- .+++...+..+.|++... T Consensus 136 ~~~P~~~y~LgVpaps~aP-~~a~~~~~~~~~~~p~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~ 214 (567) T protein:vir:82 136 GNHPTSSYRLGIPAPTTAP-VCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPV 214 (567) T ss_pred CCCCcchhhcccCCccccc-eeeecCCCCCCCCCCccccceEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccC Confidence 2333344577777444443 44555555555556678889999999999999 89888653 334433344455655555 Q ss_pred cCCCcccceEEEEeecCCC--ceeEEEEEEecccccCCeeEEecC------CCCCC--------CC-------ccceecC Q lcl|NC_020871. 351 PMYSSRPQFVSIYRKGAET--GLFYLIARVPASKAENNVITFYDL------NDSIP--------ET-------VDVFVGE 407 (468) Q Consensus 351 ~~~ga~~~~y~IYR~~~~~--G~f~~igrv~~s~~~~~t~tf~D~------N~~iP--------gt-------~~~fvge 407 (468) +..+..-+..+|||+..++ ++|+|+++++.+ ..+|+|. ++.+| -. .++++-. T Consensus 215 ~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as-----~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAg 289 (567) T protein:vir:82 215 PLQNASIKRRRIYRSASGGGEADFLLVAELDAS-----VLSYTDKIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAG 289 (567) T ss_pred CccccccceEEEEEecCCCCceeeEEEEeeccc-----eeeeeeccchhhcccccccccccCcCcccceeeecccceEEe Confidence 5555556889999987664 599999998654 5678886 33332 11 1121111 Q ss_pred CC-----------h----h---------hhhh----------------------hhhccchhccccccCccceeeeeeee Q lcl|NC_020871. 408 MS-----------A----N---------VVHL----------------------FELLPMMRLPLAQINASVTFAVLWYG 441 (468) Q Consensus 408 ~n-----------p----~---------vi~~----------------------~ellPm~k~pla~~~~~~~~~V~~yg 441 (468) |. | + +|.+ -...-++||+..+.=-+.+.+|.+=| T Consensus 290 F~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qpCvS~rsiV~~~g 369 (567) T protein:vir:82 290 FAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLRTSLVVATKGEPYLFSGVSPSTISGSKIPSMQACLSRRSMVAMEG 369 (567) T ss_pred ecCCEEEEecCCCCcccchhhccCCCCCeEEEEecccEEEEEEcCceEEEEcCChhhccccccccccccccccceeeecc Confidence 11 1 0 1110 11112234444333346677777777 Q ss_pred eheeeccceeEEEE-E--eeeeecccccCC Q lcl|NC_020871. 442 ALALRAPKKWVRIK-N--VKYIPVKNVHSN 468 (468) Q Consensus 442 aL~l~aPkk~~~ik-N--V~~~~~~~~~~~ 468 (468) +..--.|.-.|.|. | ++. -=++|.+. T Consensus 370 ~v~Yas~dGLv~i~a~G~a~v-vT~~l~t~ 398 (567) T protein:vir:82 370 FVLYAGTNGLVSVDANGNVAL-ATEQIVSP 398 (567) T ss_pred eEEeecCCcEEEEecCCchhh-hhhhccCh Confidence 66667777777773 3 211 11222222 No 18 >protein:vir:93631 Length: 580 # NCBI annotation: Bcep22gp67 # Family: family:all:1544 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944296;genbank:gi:38640373;genbank:GeneID:2658280 Probab=98.86 E-value=1.4e-10 Score=74.57 Aligned_cols=252 Identities=18% Similarity=0.135 Sum_probs=100.7 Q ss_pred HHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCC--CCHHHHhh----hhhhhhhccCceEEEecCHH Q lcl|NC_020871. 150 AKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGAS--LTESLLNQ----AAVMISKGYGTPTDAYMPVG 223 (468) Q Consensus 150 ~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~--ls~~~l~~----~a~~i~~~fG~~td~~m~~~ 223 (468) |-.|.- ...+|+- |-+.+|-+ .|....-.+|.-+=..=|-. T Consensus 1 M~~i~i---------------------------------~~f~Ge~Prl~p~lLP~~~a~~a~n~~~~~G~i~P~~~~~~ 47 (580) T protein:vir:93 1 MTIIKI---------------------------------TGFSGEIPRLVPRLLPDTAAQNATNARLESGGLTPYRKPKF 47 (580) T ss_pred CeeEee---------------------------------cccccccccchhhhccccccceEEeeeccCCeeeeeeCchh Confidence 222222 3333333 22333321 11222222333332211100 Q ss_pred H-------HhhHHHhhcCCceEEeecC-CCcce---eeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcc Q lcl|NC_020871. 224 V-------QADFVNQQLSKQTQLVRDN-GNNVS---VGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAK 292 (468) Q Consensus 224 v-------~a~~~~~~~~~qr~v~~~n-~~~~~---~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~ 292 (468) + .++....|+. ++.-+.=+ ...+. +..+ .-|++..|..+++- .-...++.+| +|+.+. T Consensus 48 ~~~~~~i~~~~~~t~~~~-~~~W~~w~~~V~~i~~PvA~D-Rvy~Td~g~Pkvt~--------~g~sy~lgVp-aPs~Ap 116 (580) T protein:vir:93 48 ITRISTIPAGQIETIYRN-GETWMAWDKPVYAAPGPVAAD-RLYVMGDGAPKMIV--------GGTTYPLAVP-MPSAAL 116 (580) T ss_pred hccccccCcCcceEEEec-CceeEEeCCceeeecCccccc-eeEEcCCcccceec--------CCccccccCC-CcccCc Confidence 0 0011111111 11111111 01000 1111 34566666555421 1112234444 344444 Q ss_pred eeEEecCCCCCCcCcccceeEEEEEEEEcccC-CcccccceeeeeeccCcceEEEEEeecCCCcc--cceEEEEeecCCC Q lcl|NC_020871. 293 VTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA-ESIASEVATATVTAKDDGVKLEIELAPMYSSR--PQFVSIYRKGAET 369 (468) Q Consensus 293 vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G-ES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~--~~~y~IYR~~~~~ 369 (468) +++++.++ ..+.++|.|+++.|+++| ||.||......... .+.+|+|+..+...+. .+..+|||+..++ T Consensus 117 t~~~~g~g------~l~~~~y~Yv~TfVt~~GeES~PS~~S~~vtv~--~g~tVtLs~~p~p~~~~~i~~~RIYRS~tG~ 188 (580) T protein:vir:93 117 TAATSGTG------TGDVFSRVYVYTFVTGFGEESEPSAISNEVNWQ--AGQTVTLSGFQAAPAGRNITKQRIYRSQTSL 188 (580) T ss_pred eeeecCCC------CcCccceEEEEEEEcCCCCcCCCcccccceeeC--CCCeEEEEecCCCCCCCccceEEEEEeccCC Confidence 44443222 125578999999999999 99988653332222 3445666666555554 4668999987663 Q ss_pred --ceeEEEEEEecccccCCeeEEecCCCCCCCCccceecCCChhhhhhh----hhccchhccccccCccceeeeeeeeeh Q lcl|NC_020871. 370 --GLFYLIARVPASKAENNVITFYDLNDSIPETVDVFVGEMSANVVHLF----ELLPMMRLPLAQINASVTFAVLWYGAL 443 (468) Q Consensus 370 --G~f~~igrv~~s~~~~~t~tf~D~N~~iPgt~~~fvge~np~vi~~~----ellPm~k~pla~~~~~~~~~V~~ygaL 443 (468) ++|+|+++++.+ +.+|+|...... +|+.=| +..|. .|..+..||++-+ +-..+=..||-=. T Consensus 189 ~gtdy~lVAel~Ag-----~~sF~Dd~s~a~------Lge~Lp-s~~~~~PP~~m~gL~~m~nGi~-agF~Gnev~fsEp 255 (580) T protein:vir:93 189 SGTDLYFIAERDAS-----AANFVDNVPLSD------QNEPLP-SLEWNAPPDDLTGLISLPNGMM-AAFRGKELWLCEP 255 (580) T ss_pred CceeEEEEeeeccc-----eeeeeecccccc------cccccc-hhhccCcCCCcceEEeeccceE-EEEeCCEEEEecC Confidence 599999997543 578988653310 011111 11111 1122333333311 1111212222111 Q ss_pred eeeccceeEEEEEeee-eecccccCC Q lcl|NC_020871. 444 ALRAPKKWVRIKNVKY-IPVKNVHSN 468 (468) Q Consensus 444 ~l~aPkk~~~ikNV~~-~~~~~~~~~ 468 (468) +.|.-|-.==+... .||-.+-.- T Consensus 256 --y~P~AWP~~yr~t~~~~Ivaia~~ 279 (580) T protein:vir:93 256 --WRPHAWPQKYVLTMDYNIVALGAY 279 (580) T ss_pred --CCCccchhhcCCCCCCCceeEeee Confidence 44444422001100 011111111 No 19 >protein:vir:5120 Length: 615 # NCBI annotation: unknown # Family: family:all:1544 # MgeID: mge:114 # MgeName: PBC5 # Cross-refs: genbank:acc:NP_542277;genbank:gi:18071220;genbank:GeneID:929342 Probab=98.58 E-value=7e-09 Score=65.28 Aligned_cols=283 Identities=17% Similarity=0.128 Sum_probs=123.2 Q ss_pred ccccCCCCCCCccc----cchhhhcCcc-------ceeeccCCCCC--HHHHhh----hhhhhhhccCceEEEecCHHHH Q lcl|NC_020871. 163 DLSDSPEPQAGLEF----DGLAKLINQD-------NVHDARGASLT--ESLLNQ----AAVMISKGYGTPTDAYMPVGVQ 225 (468) Q Consensus 163 ~l~~~~~~~~gleF----DGl~~li~~~-------nviDarG~~ls--~~~l~~----~a~~i~~~fG~~td~~m~~~v~ 225 (468) -+ +.+.--|.-- .-|+--.+.. .+...+|+-+- +.+|-+ .|.+..-..|.-+=...|-.+. T Consensus 1 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~M~~I~i~~f~Ge~Prl~P~lLP~~~A~~A~N~~~~~G~ltP~~~~~~~~ 78 (615) T protein:vir:51 1 MV--STGTRRGTLRSRAPSRLHCYLKQGYLGMVAIKISAFAGEQPMLLPRLLPETGATAAMNVRLNDGGLTPINKPIEVA 78 (615) T ss_pred Cc--ccccccceecccCcceeeeeeecCceeeEEEeecccccccccchhhhccCcccceEEeeeecCCeeeeecCccccc Confidence 11 2222222200 0011111111 35556676633 445543 2233444456665554444332 Q ss_pred hhHH----HhhcCCceEEeecCCCcce---eeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEec Q lcl|NC_020871. 226 ADFV----NQQLSKQTQLVRDNGNNVS---VGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQE 298 (468) Q Consensus 226 a~~~----~~~~~~qr~v~~~n~~~~~---~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~ 298 (468) ..+. ..|+-.---+.-+....+. +..+ .-|++.+|..+++ +... ..++.+| +|+.+.+++++. T Consensus 79 ~~~~~~~~Tif~~~~~W~~w~~~V~av~sPvA~D-Rvy~tgdg~Pkv~----~~~~----sY~LgVp-aPs~ap~~~~~g 148 (615) T protein:vir:51 79 TIATASQKTIYRHQGSWLSWPNVVNAVPGPVAQD-RLYFTGDGAPKVK----IGGV----DYALKVP-RPTGALTAALSG 148 (615) T ss_pred ccccccceeeeeecCceeccCCceeEccCCcccc-eeEEcCCCcceEe----eccc----Ccccccc-CCCccceEEecC Confidence 2221 1111110011111111111 1111 3466666665533 1111 1345555 344444444432 Q ss_pred CCCCCCcCcccceeEEEEEEEEcccC-CcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCC--CceeEEE Q lcl|NC_020871. 299 AGKKGQFRAEDLAAHEYKVVVSSDDA-ESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAE--TGLFYLI 375 (468) Q Consensus 299 ~~~~g~~~~~~~~~y~YkVtavn~~G-ES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~--~G~f~~i 375 (468) + ++ .|+.++.|++|.|+++| ||+||++.....-..+..++|..-..+..+...+..+|||+..+ +++|+|+ T Consensus 149 ~--g~----~d~etr~Yv~TfVt~~GeES~PSp~S~~v~v~~g~tVtLs~~pa~~~~~~i~~rRIYRS~tg~~gtdy~lV 222 (615) T protein:vir:51 149 T--GS----GDIQSRTYVYTWVTSFGEESAPCPASIIVDWKPGQTVTLSGFAATPGGRSITTQRIYRSQTGKTGTGLYLI 222 (615) T ss_pred C--CC----ccccceEEEEEEEcCCCCcCCCCccceeeEecCCCeEEEeeccCCcCCCceeeEEEEEeccCCCceeeEEE Confidence 2 21 25678999999999988 89999654433323344444444444444444567899998766 4699999 Q ss_pred EEEecccccCCeeEEecCC------CCCC--------CC-------ccceecCCC-----------h----h-------- Q lcl|NC_020871. 376 ARVPASKAENNVITFYDLN------DSIP--------ET-------VDVFVGEMS-----------A----N-------- 411 (468) Q Consensus 376 grv~~s~~~~~t~tf~D~N------~~iP--------gt-------~~~fvge~n-----------p----~-------- 411 (468) ++.+.+ ..+|+|.. +.+| -. .++++-.|. | + T Consensus 223 Ael~as-----~~sf~D~~~~~~Lg~~Lps~~w~~PP~~l~GL~~m~NGimAgF~GneV~FsEpy~PyAWP~~Yr~t~d~ 297 (615) T protein:vir:51 223 AERAAS-----AGNFTDNIAVDQFQEPLPSADWNEPPDGLAGLAEMPNGMMAAFVGRSIYFCEPYRPHAWPEKYSRNVGS 297 (615) T ss_pred eeeccc-----ceeeeeccchhhcCcccccccccCcCcchhhhhccccceEEeecCCEEEEecCCCCcccchhcccCcCC Confidence 997544 56788873 3333 11 111111111 1 0 Q ss_pred -hhhh----------------------hhhccchhccccccCccceeeeeeeeeheeeccceeEEEEEeee--eeccccc Q lcl|NC_020871. 412 -VVHL----------------------FELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKY--IPVKNVH 466 (468) Q Consensus 412 -vi~~----------------------~ellPm~k~pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~--~~~~~~~ 466 (468) +|.+ -...-++||+..+-=-+.+.+|.+=|+..--.|.-.|.|.+=.= +--++|. T Consensus 298 dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qpCvS~rsiV~~~~~v~Yas~dGLV~v~~~G~a~vvT~~l~ 377 (615) T protein:vir:51 298 DIVGIAALGSILVVVTKGKPYLLAGTHPDSMQQQQLEENLPCINARSIVDLGHAVCYASNDGLVAVRGDGSIRLVTEQLL 377 (615) T ss_pred CeeEEEecccEEEEEEcCceEEEEcCChhhccccccccccccccccceeEecceEEeecCCceEEEecCCchhhhhhhcc Confidence 1110 11112333333333335555665555555555665555532210 1112222 Q ss_pred CC Q lcl|NC_020871. 467 SN 468 (468) Q Consensus 467 ~~ 468 (468) +. T Consensus 378 t~ 379 (615) T protein:vir:51 378 SR 379 (615) T ss_pred Ch Confidence 21 No 20 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=98.42 E-value=1.1e-09 Score=69.71 Aligned_cols=266 Identities=14% Similarity=0.063 Sum_probs=107.2 Q ss_pred HHHHHHHHhhcccccccCCCCCCCccccchhhhcCccce-eeccCCC--------CCHHHHhh--hhhhhhhccCceEEE Q lcl|NC_020871. 150 AKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNV-HDARGAS--------LTESLLNQ--AAVMISKGYGTPTDA 218 (468) Q Consensus 150 ~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nv-iDarG~~--------ls~~~l~~--~a~~i~~~fG~~td~ 218 (468) |-+.--.-|-|=.++.....-+.|-|=|++ .+-|+.|| ||+.|+. |+..-|.- .+-.-.+.|+ T Consensus 1 ~~~~~~~~~~ginnv~~e~~l~~~~~~~~~-~~r~a~nvdi~~~G~~~~r~~~tr~~~g~l~~~~~~~~~~~~~~----- 74 (396) T protein:vir:10 1 MATTSLVPLAGINNVAEDAALQRGGESPRL-YVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFG----- 74 (396) T ss_pred CcceeeeeeecccccccccccccCCCcccc-eeeeeeeecccCCCchhhhccCcccCCceecccccCccccceee----- Confidence 111111222222222111100111111111 13345564 7777765 23222210 1111112221 Q ss_pred ecCHHHHhhHHHhhcCCceEEeecCCCcceeeeec---cceeecCCccccCCCEeecccccccccccccCCCCCCcceeE Q lcl|NC_020871. 219 YMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNI---QGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTA 295 (468) Q Consensus 219 ~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~v---~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vta 295 (468) ++..+--.+++... ++....+.+..-+.++. .-|++..+.+.++ ......++.+|++ +++.+.+ T Consensus 75 -~~~~tl~~~~~~~w---~~~~~v~v~~~pva~d~~~~Rvy~t~~~~p~~~--------~~~~~y~L~vp~P-~~a~~~a 141 (396) T protein:vir:10 75 -ALGDQWGKVDPHSW---TFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTY--------DGAQAERLTLDTP-APPLLVA 141 (396) T ss_pred -eCCceEEEEeCCeE---EEEeeeeeccCchhccccCCeEEEEcCCCceee--------eCCcceecCcCCC-ccccccc Confidence 11111111111111 12221211111111111 1233444443332 1234455666633 3333322 Q ss_pred EecCCCCCCcCcccceeEEEEEEEEcccC-CcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEE Q lcl|NC_020871. 296 TQEAGKKGQFRAEDLAAHEYKVVVSSDDA-ESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYL 374 (468) Q Consensus 296 t~~~~~~g~~~~~~~~~y~YkVtavn~~G-ES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~ 374 (468) . .|. -+.++|.|.++.|+..| ||+||. ++..++ ...+..|+|+ +-.++..+.++|||++++|+.|++ T Consensus 142 ~-----~Gs---l~~~~~~Y~~t~V~~~gEEs~p~~-~S~~v~-~~gg~~vtl~--~~~~~~i~~~RiYrS~~~G~~~~l 209 (396) T protein:vir:10 142 G-----AGS---LSQGTYGAAVAWLRGPQESAPSLI-AFAEVT-DAGALEVTFP--LCLDASVTGARLYLTRANGGELLL 209 (396) T ss_pred c-----cCc---cCCceEEEEEEEEecCCCcCcccc-cccccC-CCCCcEEEEE--cccCCCcceEEEEEeCCChhhhhh Confidence 1 111 13468999999999999 565554 444444 3445555544 545555678999999999999999 Q ss_pred EEEEecccccCCeeEEecCCCCCCCCccceecCCChhhhhhhhhccchhccccccCcccee-------eeeeeeeheeec Q lcl|NC_020871. 375 IARVPASKAENNVITFYDLNDSIPETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTF-------AVLWYGALALRA 447 (468) Q Consensus 375 igrv~~s~~~~~t~tf~D~N~~iPgt~~~fvge~np~vi~~~ellPm~k~pla~~~~~~~~-------~V~~ygaL~l~a 447 (468) +++.+.+ +.+|++-. +| .++.|. .+..|.|| |.+.+-+-..+ -.+||.-..+ T Consensus 210 ~aE~~a~-----~~s~vlPs--~~-------w~gpP~--~~~gL~pm---P~G~~~A~faGRi~~A~Gn~V~FSEp~~-- 268 (396) T protein:vir:10 210 AGDYPLG-----AATVILPT--LP-------ELGRPA--QFRHLSPM---PTGKHLAYWRGRLLIARANVLRFSEALA-- 268 (396) T ss_pred eehhccc-----eeeeeeec--CC-------CCCCCc--cccccccC---chhHhhhhhcceEEEEeCCEEEEecCCC-- Confidence 9988755 23444311 11 122232 24556665 66655444433 3333333222 Q ss_pred cceeEEEEEeeeeeccccc-----CC Q lcl|NC_020871. 448 PKKWVRIKNVKYIPVKNVH-----SN 468 (468) Q Consensus 448 Pkk~~~ikNV~~~~~~~~~-----~~ 468 (468) |.-|-.-++-.. ..+++- .+ T Consensus 269 Ph~~~~~~~~~~-~~~~Iv~lapv~~ 293 (396) T protein:vir:10 269 YHLHDERYGFVQ-MPQRITFVQPVDG 293 (396) T ss_pred CceecchhccCC-CCCceEEEEEecC Confidence 311111122111 111111 11 No 21 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=98.32 E-value=2.3e-08 Score=62.43 Aligned_cols=302 Identities=16% Similarity=0.126 Sum_probs=157.4 Q ss_pred eEEEEEEEEeeeehhhhhhhHhhhcchhhHH--HHHHHHHHHHHHHHHHHHH--hhcccccccCCCCCCCccccchhh-- Q lcl|NC_020871. 108 IRQKTVNMKFASDTKNISIAAGLVNNIQDPM--QILTDDAIVNIAKTIEWAS--FFGDSDLSDSPEPQAGLEFDGLAK-- 181 (468) Q Consensus 108 ~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~--~~~~~~ai~~~~~~~e~a~--f~Gd~~l~~~~~~~~gleFDGl~~-- 181 (468) +.|.+.- +-.-..+.-+|+ .+--.+.-.+-++.+|.++ -+|-..- --|++|=|-.+ T Consensus 1 m~~~~~~------------~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~------R~g~~~~~~~~~~ 62 (681) T protein:vir:10 1 MSNVRVL------------QRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAEN------RAGFAFVREVKDS 62 (681) T ss_pred CcceeEe------------eeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCcee------cChhHhhhhcCCC Confidence 1111110 111112333444 3333444445555555543 2332211 12444433222 Q ss_pred -----hcC----ccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEeecCCCcceeeee Q lcl|NC_020871. 182 -----LIN----QDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFN 252 (468) Q Consensus 182 -----li~----~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~ 252 (468) ||. .++.+.++=..--.+ +-+.-|...+--+|..+..-|....|+.-++.|.++. T Consensus 63 ~~~~rlipf~~~~~~~~~l~~g~~~~r--------~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~-------- 126 (681) T protein:vir:10 63 AKKVRLIPFTYSVTQTMVIELGAGYFR--------FHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADV-------- 126 (681) T ss_pred CCcEEEEEEEeCCCceEEEEEeCCeEE--------EEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCE-------- Confidence 332 233333321111111 1122222222223443444454455566666664441 Q ss_pred ccceeecCCccccCCCEe--ecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccC--Cccc Q lcl|NC_020871. 253 IQGFHSARGFIKLHGSTV--MENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA--ESIA 328 (468) Q Consensus 253 v~~~~s~~g~i~l~gs~i--~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G--ES~a 328 (468) ..-+|.... +..+ ...++|..+...+.+.+..|..++++...+. ...+|+|.|++++..+ ||.+ T Consensus 127 ---~~i~h~~~~--p~~L~r~~~~~W~l~~~~f~~~p~~p~~~~at~~~~~-------~~~t~~~~v~avda~t~~~s~~ 194 (681) T protein:vir:10 127 ---LTLVHPNYA--PRELRRLGATNWQLATIAFTSPVATPTSVTATSNNKG-------TDYTYRYVVTALDAEGKTESAP 194 (681) T ss_pred ---EEEECCCCc--ceEEEEccCCceEEEEEEeccccccceeeeeeccCCc-------cceeEeEEEEEeecccceeecC Confidence 122333333 2344 5778899998888876666666666543322 2357999999999887 8889 Q ss_pred ccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEecccccCCeeEEe------cCCCCCCCCcc Q lcl|NC_020871. 329 SEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFY------DLNDSIPETVD 402 (468) Q Consensus 329 S~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~~t~tf~------D~N~~iPgt~~ 402 (468) +..++++....+.+...+++|.++.|+ .+|+|||. .++.+++++... .+.+. |.+.++|.... T Consensus 195 ~~~~tvt~~~~~~~~~~t~~w~a~~g~--~~~~V~~~--~~gi~g~ig~~~-------~~~~~~~~~~~~~~~t~~~~~~ 263 (681) T protein:vir:10 195 SSAGTCTNNLFTNGGANTIAWSASSGA--SRYNVYKE--QGGLYGYIGQTT-------GTSLVDDNIAPDLSVTPPIYDA 263 (681) T ss_pred CcceEEeeeeecCCcceeEEEEecCCc--eeeeeccc--ceeEEEEeeccc-------eeeeeecccccCcccccccccc Confidence 998999988888888889999999998 46999995 568888887422 22333 44445566655 Q ss_pred ceecCCC-hhhhhhhhhccchhc-------------------------c----------cccc-Cccceeeeeeeeehee Q lcl|NC_020871. 403 VFVGEMS-ANVVHLFELLPMMRL-------------------------P----------LAQI-NASVTFAVLWYGALAL 445 (468) Q Consensus 403 ~fvge~n-p~vi~~~ellPm~k~-------------------------p----------la~~-~~~~~~~V~~ygaL~l 445 (468) +|=.... |+++.|+|.| +.| + .+.. ...+.|+|..=..|.+ T Consensus 264 ~~~~~~gyP~~v~f~q~R--L~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~lli~ 341 (681) T protein:vir:10 264 VFNAAGDYPAAVSYFEQR--RCFAGTTNKPQNIWMTRSGTESAMSYSLPVRDDDRVAFRVAAREANAIRHIVPLTELLLL 341 (681) T ss_pred ccccCCCceEEEEEEcce--EEEeeCCCCCcEEEEEcccCcccccccCCCCCCccEEEEEcCCcceeEEEEEecCcEEEE Confidence 5544333 8888888877 222 1 1111 1245666665444444 Q ss_pred eccceeEE-------E--EEeeeeecccccCC Q lcl|NC_020871. 446 RAPKKWVR-------I--KNVKYIPVKNVHSN 468 (468) Q Consensus 446 ~aPkk~~~-------i--kNV~~~~~~~~~~~ 468 (468) .+=.-|.+ | +|++..++-..-++ T Consensus 342 t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~~ 373 (681) T protein:vir:10 342 TSSGEWRVASVNSDAVTPTTISVRPQSYVGAT 373 (681) T ss_pred EcCcEEEEecCCCccccceeEEEEEeeeeccc Confidence 44444443 2 35544444333333 No 22 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=98.32 E-value=2.3e-08 Score=62.43 Aligned_cols=302 Identities=16% Similarity=0.126 Sum_probs=157.4 Q ss_pred eEEEEEEEEeeeehhhhhhhHhhhcchhhHH--HHHHHHHHHHHHHHHHHHH--hhcccccccCCCCCCCccccchhh-- Q lcl|NC_020871. 108 IRQKTVNMKFASDTKNISIAAGLVNNIQDPM--QILTDDAIVNIAKTIEWAS--FFGDSDLSDSPEPQAGLEFDGLAK-- 181 (468) Q Consensus 108 ~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~--~~~~~~ai~~~~~~~e~a~--f~Gd~~l~~~~~~~~gleFDGl~~-- 181 (468) +.|.+.- +-.-..+.-+|+ .+--.+.-.+-++.+|.++ -+|-..- --|++|=|-.+ T Consensus 1 m~~~~~~------------~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~------R~g~~~~~~~~~~ 62 (681) T protein:vir:98 1 MSNVRVL------------QRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAEN------RAGFAFVREVKDS 62 (681) T ss_pred CcceeEe------------eeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCcee------cChhHhhhhcCCC Confidence 1111110 111112333444 3333444445555555543 2332211 12444433222 Q ss_pred -----hcC----ccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEeecCCCcceeeee Q lcl|NC_020871. 182 -----LIN----QDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFN 252 (468) Q Consensus 182 -----li~----~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~ 252 (468) ||. .++.+.++=..--.+ +-+.-|...+--+|..+..-|....|+.-++.|.++. T Consensus 63 ~~~~rlipf~~~~~~~~~l~~g~~~~r--------~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~-------- 126 (681) T protein:vir:98 63 AKKVRLIPFTYSVTQTMVIELGAGYFR--------FHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADV-------- 126 (681) T ss_pred CCcEEEEEEEeCCCceEEEEEeCCeEE--------EEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCE-------- Confidence 332 233333321111111 1122222222223443444454455566666664441 Q ss_pred ccceeecCCccccCCCEe--ecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccC--Cccc Q lcl|NC_020871. 253 IQGFHSARGFIKLHGSTV--MENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA--ESIA 328 (468) Q Consensus 253 v~~~~s~~g~i~l~gs~i--~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G--ES~a 328 (468) ..-+|.... +..+ ...++|..+...+.+.+..|..++++...+. ...+|+|.|++++..+ ||.+ T Consensus 127 ---~~i~h~~~~--p~~L~r~~~~~W~l~~~~f~~~p~~p~~~~at~~~~~-------~~~t~~~~v~avda~t~~~s~~ 194 (681) T protein:vir:98 127 ---LTLVHPNYA--PRELRRLGATNWQLATIAFTSPVATPTSVTATSNNKG-------TDYTYRYVVTALDAEGKTESAP 194 (681) T ss_pred ---EEEECCCCc--ceEEEEccCCceEEEEEEeccccccceeeeeeccCCc-------cceeEeEEEEEeecccceeecC Confidence 122333333 2344 5778899998888876666666666543322 2357999999999887 8889 Q ss_pred ccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEecccccCCeeEEe------cCCCCCCCCcc Q lcl|NC_020871. 329 SEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFY------DLNDSIPETVD 402 (468) Q Consensus 329 S~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~~t~tf~------D~N~~iPgt~~ 402 (468) +..++++....+.+...+++|.++.|+ .+|+|||. .++.+++++... .+.+. |.+.++|.... T Consensus 195 ~~~~tvt~~~~~~~~~~t~~w~a~~g~--~~~~V~~~--~~gi~g~ig~~~-------~~~~~~~~~~~~~~~t~~~~~~ 263 (681) T protein:vir:98 195 SSAGTCTNNLFTNGGANTIAWSASSGA--SRYNVYKE--QGGLYGYIGQTT-------GTSLVDDNIAPDLSVTPPIYDA 263 (681) T ss_pred CcceEEeeeeecCCcceeEEEEecCCc--eeeeeccc--ceeEEEEeeccc-------eeeeeecccccCcccccccccc Confidence 998999988888888889999999998 46999995 568888887422 22333 44445566655 Q ss_pred ceecCCC-hhhhhhhhhccchhc-------------------------c----------cccc-Cccceeeeeeeeehee Q lcl|NC_020871. 403 VFVGEMS-ANVVHLFELLPMMRL-------------------------P----------LAQI-NASVTFAVLWYGALAL 445 (468) Q Consensus 403 ~fvge~n-p~vi~~~ellPm~k~-------------------------p----------la~~-~~~~~~~V~~ygaL~l 445 (468) +|=.... |+++.|+|.| +.| + .+.. ...+.|+|..=..|.+ T Consensus 264 ~~~~~~gyP~~v~f~q~R--L~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~lli~ 341 (681) T protein:vir:98 264 VFNAAGDYPAAVSYFEQR--RCFAGTTNKPQNIWMTRSGTESAMSYSLPVRDDDRVAFRVAAREANAIRHIVPLTELLLL 341 (681) T ss_pred ccccCCCceEEEEEEcce--EEEeeCCCCCcEEEEEcccCcccccccCCCCCCccEEEEEcCCcceeEEEEEecCcEEEE Confidence 5544333 8888888877 222 1 1111 1245666665444444 Q ss_pred eccceeEE-------E--EEeeeeecccccCC Q lcl|NC_020871. 446 RAPKKWVR-------I--KNVKYIPVKNVHSN 468 (468) Q Consensus 446 ~aPkk~~~-------i--kNV~~~~~~~~~~~ 468 (468) .+=.-|.+ | +|++..++-..-++ T Consensus 342 t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~~ 373 (681) T protein:vir:98 342 TSSGEWRVASVNSDAVTPTTISVRPQSYVGAT 373 (681) T ss_pred EcCcEEEEecCCCccccceeEEEEEeeeeccc Confidence 44444443 2 35544444333333 No 23 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=98.32 E-value=2.3e-08 Score=62.43 Aligned_cols=302 Identities=16% Similarity=0.126 Sum_probs=157.4 Q ss_pred eEEEEEEEEeeeehhhhhhhHhhhcchhhHH--HHHHHHHHHHHHHHHHHHH--hhcccccccCCCCCCCccccchhh-- Q lcl|NC_020871. 108 IRQKTVNMKFASDTKNISIAAGLVNNIQDPM--QILTDDAIVNIAKTIEWAS--FFGDSDLSDSPEPQAGLEFDGLAK-- 181 (468) Q Consensus 108 ~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~--~~~~~~ai~~~~~~~e~a~--f~Gd~~l~~~~~~~~gleFDGl~~-- 181 (468) +.|.+.- +-.-..+.-+|+ .+--.+.-.+-++.+|.++ -+|-..- --|++|=|-.+ T Consensus 1 m~~~~~~------------~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~------R~g~~~~~~~~~~ 62 (681) T protein:vir:10 1 MSNVRVL------------QRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAEN------RAGFAFVREVKDS 62 (681) T ss_pred CcceeEe------------eeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCcee------cChhHhhhhcCCC Confidence 1111110 111112333444 3333444445555555543 2332211 12444433222 Q ss_pred -----hcC----ccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEeecCCCcceeeee Q lcl|NC_020871. 182 -----LIN----QDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFN 252 (468) Q Consensus 182 -----li~----~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~ 252 (468) ||. .++.+.++=..--.+ +-+.-|...+--+|..+..-|....|+.-++.|.++. T Consensus 63 ~~~~rlipf~~~~~~~~~l~~g~~~~r--------~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~-------- 126 (681) T protein:vir:10 63 AKKVRLIPFTYSVTQTMVIELGAGYFR--------FHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADV-------- 126 (681) T ss_pred CCcEEEEEEEeCCCceEEEEEeCCeEE--------EEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCE-------- Confidence 332 233333321111111 1122222222223443444454455566666664441 Q ss_pred ccceeecCCccccCCCEe--ecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccC--Cccc Q lcl|NC_020871. 253 IQGFHSARGFIKLHGSTV--MENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA--ESIA 328 (468) Q Consensus 253 v~~~~s~~g~i~l~gs~i--~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G--ES~a 328 (468) ..-+|.... +..+ ...++|..+...+.+.+..|..++++...+. ...+|+|.|++++..+ ||.+ T Consensus 127 ---~~i~h~~~~--p~~L~r~~~~~W~l~~~~f~~~p~~p~~~~at~~~~~-------~~~t~~~~v~avda~t~~~s~~ 194 (681) T protein:vir:10 127 ---LTLVHPNYA--PRELRRLGATNWQLATIAFTSPVATPTSVTATSNNKG-------TDYTYRYVVTALDAEGKTESAP 194 (681) T ss_pred ---EEEECCCCc--ceEEEEccCCceEEEEEEeccccccceeeeeeccCCc-------cceeEeEEEEEeecccceeecC Confidence 122333333 2344 5778899998888876666666666543322 2357999999999887 8889 Q ss_pred ccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEecccccCCeeEEe------cCCCCCCCCcc Q lcl|NC_020871. 329 SEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFY------DLNDSIPETVD 402 (468) Q Consensus 329 S~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~~t~tf~------D~N~~iPgt~~ 402 (468) +..++++....+.+...+++|.++.|+ .+|+|||. .++.+++++... .+.+. |.+.++|.... T Consensus 195 ~~~~tvt~~~~~~~~~~t~~w~a~~g~--~~~~V~~~--~~gi~g~ig~~~-------~~~~~~~~~~~~~~~t~~~~~~ 263 (681) T protein:vir:10 195 SSAGTCTNNLFTNGGANTIAWSASSGA--SRYNVYKE--QGGLYGYIGQTT-------GTSLVDDNIAPDLSVTPPIYDA 263 (681) T ss_pred CcceEEeeeeecCCcceeEEEEecCCc--eeeeeccc--ceeEEEEeeccc-------eeeeeecccccCcccccccccc Confidence 998999988888888889999999998 46999995 568888887422 22333 44445566655 Q ss_pred ceecCCC-hhhhhhhhhccchhc-------------------------c----------cccc-Cccceeeeeeeeehee Q lcl|NC_020871. 403 VFVGEMS-ANVVHLFELLPMMRL-------------------------P----------LAQI-NASVTFAVLWYGALAL 445 (468) Q Consensus 403 ~fvge~n-p~vi~~~ellPm~k~-------------------------p----------la~~-~~~~~~~V~~ygaL~l 445 (468) +|=.... |+++.|+|.| +.| + .+.. ...+.|+|..=..|.+ T Consensus 264 ~~~~~~gyP~~v~f~q~R--L~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~lli~ 341 (681) T protein:vir:10 264 VFNAAGDYPAAVSYFEQR--RCFAGTTNKPQNIWMTRSGTESAMSYSLPVRDDDRVAFRVAAREANAIRHIVPLTELLLL 341 (681) T ss_pred ccccCCCceEEEEEEcce--EEEeeCCCCCcEEEEEcccCcccccccCCCCCCccEEEEEcCCcceeEEEEEecCcEEEE Confidence 5544333 8888888877 222 1 1111 1245666665444444 Q ss_pred eccceeEE-------E--EEeeeeecccccCC Q lcl|NC_020871. 446 RAPKKWVR-------I--KNVKYIPVKNVHSN 468 (468) Q Consensus 446 ~aPkk~~~-------i--kNV~~~~~~~~~~~ 468 (468) .+=.-|.+ | +|++..++-..-++ T Consensus 342 t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~~ 373 (681) T protein:vir:10 342 TSSGEWRVASVNSDAVTPTTISVRPQSYVGAT 373 (681) T ss_pred EcCcEEEEecCCCccccceeEEEEEeeeeccc Confidence 44444443 2 35544444333333 No 24 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.30 E-value=6.3e-07 Score=54.59 Aligned_cols=311 Identities=10% Similarity=0.024 Sum_probs=154.2 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCc---ccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITP---DTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTV 77 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p---~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv 77 (468) |-+..|+. -+... |.+....+-..++ ....+++.|-++.+..+|..+... ...+...+.+.++.+.- T Consensus 1 ~~~~~~~~----~~~~~----f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~--~s~l~~l~~~~~~~~~~ 70 (324) T protein:vir:96 1 MEQTQKLK----LNLQH----FASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVME--NSKIMQLGKYEPMEGTE 70 (324) T ss_pred CCcchhhh----HHHHH----HHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHh--hchhhhhcceeeccCCc Confidence 54443322 12222 3333332222222 122345567788887777554433 33456666666666655 Q ss_pred hccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 78 AKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWAS 157 (468) Q Consensus 78 ~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~ 157 (468) .+|.++... +...+++|++..+..++++.+.....+=++-.-.+|.-+- .++..|.+....+.-...+++.+|.++ T Consensus 71 ~~~p~~~~~---~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~l~~aia~~~d~~~ 146 (324) T protein:vir:96 71 KKFTFWADK---PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) T ss_pred eEEEEEecC---cceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHh Confidence 668777643 3456899999999999999999999999998888887432 245577888888888899999999999 Q ss_pred hhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCce Q lcl|NC_020871. 158 FFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQT 237 (468) Q Consensus 158 f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr 237 (468) |+|+-+- .+-.|+...+...+.... ..++.+.|..+...+...+..++-+.|++.+.+.+...--..-| T Consensus 147 l~G~g~~---------~~~~~~~~~~~~~~~~~~--~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~ 215 (324) T protein:vir:96 147 ILNQGNN---------PFGKSIAQSIKKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK 215 (324) T ss_pred hhcCCCC---------CcCccccccccccceecc--cccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCC Confidence 9998431 122345554444443322 23455666666656677788888899999999998544322233 Q ss_pred EEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeE------EecCCCCCC----cCc Q lcl|NC_020871. 238 QLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTA------TQEAGKKGQ----FRA 307 (468) Q Consensus 238 ~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vta------t~~~~~~g~----~~~ 307 (468) .+.++.....-.|.+|- ++...... .+..++.+..-+. .... ....+-. +......+. |.. T Consensus 216 ~~~~~~~~~~l~G~PV~--~~~~~~~~-~~~~~~gd~s~~~---~~~~---~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 286 (324) T protein:vir:96 216 ERIYDRNSDSLDGLPVV--NLKSSNLK-RGELITGDFDKLI---YGIP---QLIEYKIDETAQLSTVKNEDGTPVNLFEQ 286 (324) T ss_pred eeecCCCCCcccceeeE--eecCCCCC-cceEEEEecceEE---EEEe---cCcEEEEeecccccccccccccchhhhhc Confidence 33333333333666552 11111111 1223333222110 0000 0000000 000000100 100 Q ss_pred ccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceE Q lcl|NC_020871. 308 EDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFV 360 (468) Q Consensus 308 ~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y 360 (468) ..-.+++...=+-+=-.+...+-++.+..++.. | |-.+ T Consensus 287 ---n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~----~--------~~~~ 324 (324) T protein:vir:96 287 ---DMVALRATMHVALHIADDKAFAKLVPADKRTDS----V--------PGEV 324 (324) T ss_pred ---CcEEEEEEEEeccEEecccceEEEecccccCCC----C--------CCCC Confidence 011122221111110001111222211111100 1 1001 No 25 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.23 E-value=1.1e-06 Score=53.27 Aligned_cols=317 Identities=9% Similarity=0.016 Sum_probs=156.1 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey 80 (468) |-+..|++ .|.. ..+.+....+.|.|...+. ..+++.|-++.+..+|..+..... .+.+...+.+..+...+| T Consensus 1 ~~~~~~~~-~~~~-~f~~~~~~~~~~~a~~~~~---~~~~~~liP~~~~~~ii~~~~~~s--~l~~l~~~~~~~~~~~~i 73 (324) T protein:vir:93 1 MEQTQKLK-LNLQ-HFASNNVKPQVFNPDNVMM---HEKKDGTLLNDFTTPILQEVMENS--KIMQLGKYEPMEGTEKKF 73 (324) T ss_pred CchhHHHH-HHHH-HHHHhhhhhhhcccccccc---cCCCcceechhHHHHHHHHHHhhc--hhhhhcceeeccCCceEE Confidence 65544433 2222 2233334445555544322 223455778888887755544333 455555555666655567 Q ss_pred ceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020871. 81 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 160 (468) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~G 160 (468) .++.. .....+++|++..+..++++.+.....+=++..-.+|.-+- .++..|.+....+.--..+++.+|.++++| T Consensus 74 p~~~~---~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~aia~~~d~a~l~G 149 (324) T protein:vir:93 74 TFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAGILN 149 (324) T ss_pred EEEec---CcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 77763 33466999999999999999999999999998888877332 245567888888888889999999999999 Q ss_pred ccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEe Q lcl|NC_020871. 161 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLV 240 (468) Q Consensus 161 d~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~ 240 (468) +.. +.+..|+...+...+.... | .++.+.|.++.-.+..+++....+.|++.+.+.+...--..-|.+. T Consensus 150 ~g~---------~~~~~~~~~~~~~~~~~~~-~-~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~ 218 (324) T protein:vir:93 150 QGN---------NPFGKSIAQSIEKTNKVIK-G-DFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) T ss_pred CCC---------CCcCccccccccccceecc-c-cccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeee Confidence 743 1223355555444444332 2 3556666666666777788888899999999999543222223333 Q ss_pred ecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeE------EecCCCCCC-cCcccceeE Q lcl|NC_020871. 241 RDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTA------TQEAGKKGQ-FRAEDLAAH 313 (468) Q Consensus 241 ~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vta------t~~~~~~g~-~~~~~~~~y 313 (468) ++.....-.|.+|- .+...... .+..++.+.+-+. .... ....+-. +......+. +.--....- T Consensus 219 ~~~~~~~l~G~PVv--~~~~~~~~-~~~i~~gdfs~~~---~~~~---~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~ 289 (324) T protein:vir:93 219 YDRNSDSLDGLPVV--NLKSSNLK-RGELITGDFDKLI---YGIP---QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMV 289 (324) T ss_pred cCCCCCcccceeeE--eecCCCCC-cceEEEEecceEE---EEEe---cCcEEEEeecccccccccccccchhhhhcCcE Confidence 33333333565552 12111111 1122233222110 0000 0000000 000000110 000000011 Q ss_pred EEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceE Q lcl|NC_020871. 314 EYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFV 360 (468) Q Consensus 314 ~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y 360 (468) .+++..--+-+=--+...+-.+.+..++. +.|..+ T Consensus 290 ~~r~~~r~d~~v~~~~a~~~l~~a~~~~~------------~~~~~~ 324 (324) T protein:vir:93 290 ALRATMHVALHIADDKAFAKLVPADKRTD------------SVPGEV 324 (324) T ss_pred EEEEEEEeccEEecccceEEEecccccCC------------CCCCCC Confidence 11221111111000111111111111110 011111 No 26 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.22 E-value=1.9e-06 Score=51.92 Aligned_cols=318 Identities=10% Similarity=0.026 Sum_probs=155.8 Q ss_pred CCCcccchhhcccChhhHHHHH--HHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDA--LKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVA 78 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~--~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ 78 (468) |-+.+|+. -+-..-|... .+.+++. +.....+++.|-++.+...|.... .+...++..+.+.++.+.-. T Consensus 1 ~~~~~~~~----~~~~~~~~~~~~~~~~~a~---~~~~~~~~~~~iP~~~~~~ii~~~--~~~s~l~~l~~~~~~~~~~~ 71 (324) T protein:vir:78 1 MEQTQKLK----LNLQHFASNNVKPQVFNPD---NVMMHEKKDGTLMNEFTTPILQEV--MENSKIMQLGKYEPMEGTEK 71 (324) T ss_pred CCcchhhh----HHHHHHHHHhhhhhhhccc---cccccCcCccccchhHHHHHHHHH--HhhchhhhhcceeeccCCce Confidence 54443322 2223333322 2234433 223334567788888888775544 33345666666777766556 Q ss_pred ccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020871. 79 KYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASF 158 (468) Q Consensus 79 ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f 158 (468) +|.++... +...+++|++..+..++.+.+.....+=++---.+|.-+- .++..|.+....+.--..++..+|.++| T Consensus 72 ~~p~~~~~---~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~la~ai~~~~d~a~l 147 (324) T protein:vir:78 72 KFTFWADK---PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) T ss_pred EEEEEecC---cceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 67777643 3456899999999999999999999999988888887432 2455678888888888999999999999 Q ss_pred hcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceE Q lcl|NC_020871. 159 FGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQ 238 (468) Q Consensus 159 ~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~ 238 (468) +|+..- -+..|+.+.+...+... ...++.+.|.++.-.+..++..+.-+.|++.+.+.+...--.--|. T Consensus 148 ~G~g~~---------~~~~gi~~~~~~~~~~~--~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~ 216 (324) T protein:vir:78 148 LNQGNN---------PFGKSIAQSIEKTNKVI--KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKE 216 (324) T ss_pred ccCCCC---------CcCccccccccccceec--cccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCe Confidence 998431 12345666665544433 2345677777766667778888888999999999885433221222 Q ss_pred EeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCC----cCcccceeEE Q lcl|NC_020871. 239 LVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQ----FRAEDLAAHE 314 (468) Q Consensus 239 v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~----~~~~~~~~y~ 314 (468) +.++.....-.|.+|- .+....+. .+..++.+..-..--.......-.-...+.+......++ |.. .... T Consensus 217 ~~~~~~~~~l~G~PV~--~~~~~~~~-~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~---d~~~ 290 (324) T protein:vir:78 217 RIYDRNSDSLDGLPVV--NLKSSNLK-RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ---DMVA 290 (324) T ss_pred eecCCCCCcccceeeE--eeCCCCCC-cceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhc---CcEE Confidence 3322222223444442 11111111 112222221110000000000000000000000000000 000 0111 Q ss_pred EEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCcee Q lcl|NC_020871. 315 YKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLF 372 (468) Q Consensus 315 YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f 372 (468) |++..--+-+=-.|...+-.+....++. ++| |+. T Consensus 291 ~r~~~r~d~~v~~~~A~~~l~~a~~~~~------------~~~------------~~~ 324 (324) T protein:vir:78 291 LRATMHVALHIADDKAFAKLVPADKRTD------------SVP------------GEV 324 (324) T ss_pred EEEEEEEccEEecccceEEEecccccCC------------CCC------------CCC Confidence 1111111110000111111111100000 000 000 No 27 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.22 E-value=1.9e-06 Score=51.92 Aligned_cols=318 Identities=10% Similarity=0.026 Sum_probs=155.8 Q ss_pred CCCcccchhhcccChhhHHHHH--HHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDA--LKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVA 78 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~--~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ 78 (468) |-+.+|+. -+-..-|... .+.+++. +.....+++.|-++.+...|.... .+...++..+.+.++.+.-. T Consensus 1 ~~~~~~~~----~~~~~~~~~~~~~~~~~a~---~~~~~~~~~~~iP~~~~~~ii~~~--~~~s~l~~l~~~~~~~~~~~ 71 (324) T protein:vir:96 1 MEQTQKLK----LNLQHFASNNVKPQVFNPD---NVMMHEKKDGTLMNEFTTPILQEV--MENSKIMQLGKYEPMEGTEK 71 (324) T ss_pred CCcchhhh----HHHHHHHHHhhhhhhhccc---cccccCcCccccchhHHHHHHHHH--HhhchhhhhcceeeccCCce Confidence 54443322 2223333322 2234433 223334567788888888775544 33345666666777766556 Q ss_pred ccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020871. 79 KYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASF 158 (468) Q Consensus 79 ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f 158 (468) +|.++... +...+++|++..+..++.+.+.....+=++---.+|.-+- .++..|.+....+.--..++..+|.++| T Consensus 72 ~~p~~~~~---~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~la~ai~~~~d~a~l 147 (324) T protein:vir:96 72 KFTFWADK---PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) T ss_pred EEEEEecC---cceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 67777643 3456899999999999999999999999988888887432 2455678888888888999999999999 Q ss_pred hcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceE Q lcl|NC_020871. 159 FGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQ 238 (468) Q Consensus 159 ~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~ 238 (468) +|+..- -+..|+.+.+...+... ...++.+.|.++.-.+..++..+.-+.|++.+.+.+...--.--|. T Consensus 148 ~G~g~~---------~~~~gi~~~~~~~~~~~--~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~ 216 (324) T protein:vir:96 148 LNQGNN---------PFGKSIAQSIEKTNKVI--KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKE 216 (324) T ss_pred ccCCCC---------CcCccccccccccceec--cccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCe Confidence 998431 12345666665544433 2345677777766667778888888999999999885433221222 Q ss_pred EeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCC----cCcccceeEE Q lcl|NC_020871. 239 LVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQ----FRAEDLAAHE 314 (468) Q Consensus 239 v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~----~~~~~~~~y~ 314 (468) +.++.....-.|.+|- .+....+. .+..++.+..-..--.......-.-...+.+......++ |.. .... T Consensus 217 ~~~~~~~~~l~G~PV~--~~~~~~~~-~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~---d~~~ 290 (324) T protein:vir:96 217 RIYDRNSDSLDGLPVV--NLKSSNLK-RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ---DMVA 290 (324) T ss_pred eecCCCCCcccceeeE--eeCCCCCC-cceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhc---CcEE Confidence 3322222223444442 11111111 112222221110000000000000000000000000000 000 0111 Q ss_pred EEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCcee Q lcl|NC_020871. 315 YKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLF 372 (468) Q Consensus 315 YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f 372 (468) |++..--+-+=-.|...+-.+....++. ++| |+. T Consensus 291 ~r~~~r~d~~v~~~~A~~~l~~a~~~~~------------~~~------------~~~ 324 (324) T protein:vir:96 291 LRATMHVALHIADDKAFAKLVPADKRTD------------SVP------------GEV 324 (324) T ss_pred EEEEEEEccEEecccceEEEecccccCC------------CCC------------CCC Confidence 1111111110000111111111100000 000 000 No 28 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.12 E-value=4.1e-06 Score=50.14 Aligned_cols=313 Identities=9% Similarity=0.012 Sum_probs=152.1 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCc---ccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITP---DTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTV 77 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p---~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv 77 (468) |-+..|+. ..--.|.+.+..|...++ ....+++.|-++.+...|....- +...+.......++.+.- T Consensus 1 ~~~~~~~~--------~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~--~~s~l~~~~~~~~~~~~~ 70 (324) T protein:vir:10 1 MEQTQKLK--------LNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVM--ENSKIMQLGKYEPMEGTE 70 (324) T ss_pred CCCchHHH--------HHHHHHHHHhhccceecccceeccCCCcceechhHHHHHHHHHH--hhchhhhhcceeeccCCc Confidence 54443322 222234455444433222 22334555778877777744333 333556656666666554 Q ss_pred hccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 78 AKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWAS 157 (468) Q Consensus 78 ~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~ 157 (468) .+|.++. +.+...+++|++..+..++.+.......+=++..-.+|.-+- .++..|.+....+.-...+++.+|.++ T Consensus 71 ~~~p~~~---~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~ai~~~~d~a~ 146 (324) T protein:vir:10 71 KKFTFWA---DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) T ss_pred eEEEEEe---CCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHh Confidence 5677665 234567999999999999999999999999998888887432 245567888888888899999999999 Q ss_pred hhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCc- Q lcl|NC_020871. 158 FFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQ- 236 (468) Q Consensus 158 f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~q- 236 (468) |+|+-. +.+--|+.+.+...+.... ..++.+.|..+...+..++..+.-+.|++.+.+.+... .+.+ T Consensus 147 l~G~g~---------~~~~~~i~~~~~~~~~~~~--~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l-~d~~g 214 (324) T protein:vir:10 147 ILNQGN---------NPFGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI-VDPET 214 (324) T ss_pred hhcCCC---------CccCccccccccccceecc--ccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHh-hccCC Confidence 999843 1122355665555554432 34667777777777777888888899999999998543 2222 Q ss_pred eEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeE------EecCCCCCC-cCccc Q lcl|NC_020871. 237 TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTA------TQEAGKKGQ-FRAED 309 (468) Q Consensus 237 r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vta------t~~~~~~g~-~~~~~ 309 (468) |.+.++.....-.|.+|- ++...... .+..++.+..-. ..... ....+-. +...+..+. +.--. T Consensus 215 ~~~~~~~~~~~l~G~PV~--~~~~~~~~-~~~~~~gd~~~~---~~~~~---~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 285 (324) T protein:vir:10 215 KERIYDRNSDTLDGLPVV--NLKSSNLK-RGELITGDFDKL---IYGIP---QLIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) T ss_pred ceeecCCCCccccceeEE--eecCCCCC-cceEEEEecccE---EEEEe---cCcEEEEeecccccccccccccchhhhh Confidence 222222222222444431 11100000 111112111100 00000 0000000 000000000 00000 Q ss_pred ceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceE Q lcl|NC_020871. 310 LAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFV 360 (468) Q Consensus 310 ~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y 360 (468) .....+++...-+.+=--++..+-.+.....+.. |+.. + T Consensus 286 ~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~----~~~~--------~ 324 (324) T protein:vir:10 286 QDMVALRATMHVALHIADDKAFAKLVPADKKTDS----VPGE--------V 324 (324) T ss_pred cCcEEEEEEEEEccEEecccceEEEEeccCCCCC----CCCC--------C Confidence 0011111111111000001111111111000000 0011 1 No 29 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.08 E-value=3.6e-06 Score=50.46 Aligned_cols=317 Identities=11% Similarity=0.039 Sum_probs=153.8 Q ss_pred CCCcccchhhcccChhhHHHHHHHH--hhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKS--FTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVA 78 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ks--f~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ 78 (468) |-+..|+. + +...-|....+. +.+... ...++|+.|-++.+..+|.... .+...+.+...+.+..+--. T Consensus 1 ~~~~~~~~-~---~~~~f~~~~~~~~~~~a~~~---~~~~~~~~~iP~~~~~~ii~~~--~~~s~l~~~~~~~~~~~~~~ 71 (324) T protein:vir:97 1 MEQTQKLK-L---NLQHFASNNVKPQVFNPDNV---MMHEKKDGTLMNEFTTPILQEV--MENSKIMQLGKYEPMEGTEK 71 (324) T ss_pred CccchhHH-H---HHHHHHHhhhhhhhhccccc---cccCCCcceechhHHHHHHHHH--HhhcchhhhcceeeccCCce Confidence 54443222 1 122223222222 222221 2234467788888877774443 33335666666666665445 Q ss_pred ccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020871. 79 KYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASF 158 (468) Q Consensus 79 ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f 158 (468) +|.++.. .+...+++|++..+..++.+.......+=++---.+|.-+ +.++..|.+....+.-...++..+|.++| T Consensus 72 ~ip~~~~---~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~el-l~ds~~~l~~~i~~~l~~aia~~~d~a~l 147 (324) T protein:vir:97 72 KFTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEF-LNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) T ss_pred EEEEEec---CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHhh Confidence 6666653 3345689999999999999999999999999888888732 23455688888889999999999999999 Q ss_pred hcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh-cCCce Q lcl|NC_020871. 159 FGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ-LSKQT 237 (468) Q Consensus 159 ~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~-~~~qr 237 (468) .|+.. +.+.-|+.+.+...+.... ..++.+.|.++...+..++..+.-+.|++.+.+.|...- -.++. T Consensus 148 ~G~g~---------~~~~~gi~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~ 216 (324) T protein:vir:97 148 LNQGN---------NPFGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKE 216 (324) T ss_pred ccCCC---------CccCccccccccccceecc--ccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCce Confidence 99843 2234566666666665433 335677777777777788888888999999999985432 12233 Q ss_pred EEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCC----cCcccceeE Q lcl|NC_020871. 238 QLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQ----FRAEDLAAH 313 (468) Q Consensus 238 ~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~----~~~~~~~~y 313 (468) .++ +...+.-.|.+| +++....+. .+..++.+..-+---...-...-.-.....+...+.+++ |.. | .- T Consensus 217 ~~~-~~~~~tl~G~PV--~~~~~~~~~-~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~-d--~~ 289 (324) T protein:vir:97 217 RIY-DRNSDTLDGLPV--VNLKSSNLK-RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ-D--MV 289 (324) T ss_pred eec-CCCCccccceee--EeecCCCCC-cceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhc-C--cE Confidence 332 222222244443 111111111 111222221110000000000000000000000001110 100 0 01 Q ss_pred EEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceE Q lcl|NC_020871. 314 EYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFV 360 (468) Q Consensus 314 ~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y 360 (468) .+++..--+-+=--++..+- |++.- +..-++|-.+ T Consensus 290 ~~r~~~r~d~~v~~~~a~~~-----------l~~~~-~~~~~~~~~~ 324 (324) T protein:vir:97 290 ALRATMHVALHIADDKAFAK-----------LVPAD-KKTDSVPGEV 324 (324) T ss_pred EEEEEEEeccEEecccceEE-----------EEecc-CCCCCCCCCC Confidence 11111100000000010111 11110 0111111111 No 30 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=97.93 E-value=7.4e-06 Score=48.72 Aligned_cols=309 Identities=12% Similarity=0.006 Sum_probs=141.9 Q ss_pred CCCcccchhhcccChh----hHHHHHHHHhhcccc-------------cCcccccCccccchhhhhhHhhhhhhcccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLN----SVQEDALKSFTTGYG-------------ITPDTQTDAGALRREFLDDQISMLTWTENDLT 63 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~----~~~e~~~Ksf~agy~-------------~~p~~~~~gaALr~esld~~i~~L~~~~~~f~ 63 (468) +++..+....++.... ..-..+.+.+..+.. ....+-.+++.|-++.+..+|..+... +-. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~--~~~ 164 (418) T protein:vir:10 87 GGGSAELETPKTLGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQR--KMT 164 (418) T ss_pred cccccccchhhhhhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhh--hhh Confidence 2222222111111111 111122222222211 111233456678888888777554433 345 Q ss_pred chhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_020871. 64 FYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTD 143 (468) Q Consensus 64 ~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~ 143 (468) +++.+...++.+.-.+|.+.... .....+++|++..+.+++.+......++-++..-.+|.- +.+...|.+....+ T Consensus 165 l~~~~~~~~~~~~~~~~~~~~~~--~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~e--ll~ds~~l~~~i~~ 240 (418) T protein:vir:10 165 IRDLLMPGQTSSSSIEYTVETGF--TNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQ--ILDDAPALQSYIDG 240 (418) T ss_pred HHhhcceeeccCCceeEEEEecC--CCceeeeccCccccccccceeeEEEeeeeEEEeehhhHH--HHHhHHHHHHHHHH Confidence 66666666665443455555432 334568999999999999999999999999987777775 44444688888889 Q ss_pred HHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHH Q lcl|NC_020871. 144 DAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVG 223 (468) Q Consensus 144 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~ 223 (468) .-...+++.++.++|+|+.. +-+.-||.+.......--.-....+.+.|..+.-.+...++..+-++|++. T Consensus 241 ~l~~a~~~~~d~a~l~G~g~---------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~ 311 (418) T protein:vir:10 241 RARYGLQLTEEGQILKGDGT---------GANILGILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPATGIVLNPI 311 (418) T ss_pred HHHHHHHHHHHHHHhccCCC---------CccccccccccccccccccccccccHHHHHHHHHhhccccCCCCEEEEcHH Confidence 99999999999999999743 112356666543322211112223445555555555677788888999999 Q ss_pred HHhhHHHhhcCCc-eEEeecCCCcceeeeeccceeecCCccccCCCEeeccccccccccccc-----CCCCCCcceeEEe Q lcl|NC_020871. 224 VQADFVNQQLSKQ-TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILAL-----PTAPQQAKVTATQ 297 (468) Q Consensus 224 v~a~~~~~~~~~q-r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~-----p~ap~~~~vtat~ 297 (468) +.+.+.. ..+.+ |.+.+ +..... .-.|.|..++.++..-.....+- -....-..++-.. T Consensus 312 ~~~~L~~-lkd~~G~~i~~-~~~~~~-------------~~~l~G~pV~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i~~ 376 (418) T protein:vir:10 312 DWASIEL-TKDSQGRYIVG-NPVNGT-------------TPRLWNLPVVETQAMTANEFLVGAFSMAAQIFDRMEIEVLL 376 (418) T ss_pred HHHHHHH-hhcCCCceecc-ccccCC-------------CceecceeeEEcCCCCCCcEEEeeccceEEEEEecceEEEE Confidence 9988843 33331 22222 111000 01122332222211100000000 0000000000000 Q ss_pred cCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCc Q lcl|NC_020871. 298 EAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETG 370 (468) Q Consensus 298 ~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G 370 (468) .......|.. ....|++...=+-+---|...+-.+++ +.. +| T Consensus 377 ~~~~~~~f~~---~~~~~r~~~~~d~~~~~~~a~~~~~~~-------------~~~---------------~g 418 (418) T protein:vir:10 377 STENVDDFEK---NMVSIRAEERLALAVYRPESFVTGALV-------------EQA---------------GG 418 (418) T ss_pred ecccchhhhc---CceEEEEEEeeccEEecccceEEEEec-------------cCC---------------CC Confidence 0000000100 111121111100000001111111111 111 11 No 31 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.87 E-value=9.9e-06 Score=48.03 Aligned_cols=295 Identities=11% Similarity=0.033 Sum_probs=139.7 Q ss_pred CCCcccchhhcccChhhHHH-HHHHHhhcccc-----cCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQE-DALKSFTTGYG-----ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPAT 74 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e-~~~Ksf~agy~-----~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~ 74 (468) .+..++....++....+.|. +|.+.+..+.. .+-.+.++|+.|.++.+..+|..+..... .+++.+...++. T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~--~l~~~~~~~~~~ 149 (397) T protein:vir:49 72 SEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYD--SLQEYVNVENVT 149 (397) T ss_pred ccccccccccchhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhh--hHHhhhceeecc Confidence 11111111111111122221 23333332321 12234466888999999888866555444 455555555555 Q ss_pred hhhhccceeeeeccccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 75 STVAKYDVYMQHGKVGHTRFTREIGVA-PVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTI 153 (468) Q Consensus 75 stv~ey~~~~~hG~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~ 153 (468) +....|.......+.+...+++|++.. +.+++.+......++-++.-..+|.-+ +.++..|.+....+.-...++..+ T Consensus 150 ~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~~~~~~ 228 (397) T protein:vir:49 150 TLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSL-LADSAENILAWLSGWIAKKVVVTR 228 (397) T ss_pred cCccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHH-HhhhHHHHHHHHHHHHHHHHHHHH Confidence 433344332223344556799999885 578999999999999999888888754 245667888888899999999999 Q ss_pred HHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh- Q lcl|NC_020871. 154 EWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ- 232 (468) Q Consensus 154 e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~- 232 (468) +.+++.|+..-... . ...+.+.|.++-.-+..+|...+..+|++.+.+.+...- T Consensus 229 d~ai~~G~g~~~~~---~----------------------~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd 283 (397) T protein:vir:49 229 NKAILEAIAALPTK---P----------------------TLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKN 283 (397) T ss_pred HHHHHhhccccccc---c----------------------ccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhc Confidence 99999998653211 1 111223333333334455666678899999998884432 Q ss_pred cCCceEEeecCC----Ccceeeeecc----ceeecCCccccCCCEeecccc--cccccccccCCCCCCcceeEEecCCCC Q lcl|NC_020871. 233 LSKQTQLVRDNG----NNVSVGFNIQ----GFHSARGFIKLHGSTVMENEQ--ILDERILALPTAPQQAKVTATQEAGKK 302 (468) Q Consensus 233 ~~~qr~v~~~n~----~~~~~G~~v~----~~~s~~g~i~l~gs~i~~~~n--~l~~~~~~~p~ap~~~~vtat~~~~~~ 302 (468) -.+++.++ ++. +..-.|.+|- .++...+.- ....++.+.. .+...+.... + ....... T Consensus 284 ~~G~~l~~-~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~--~~~i~~gd~~~~~~~~~~~~~~-------i--~~~~~~~ 351 (397) T protein:vir:49 284 ALGDYLME-RDVKSPTGYSIDGFAVKEVADRWLANGTGG--AMPLYFGDLKQAVTLFDRQHMS-------L--LSTNIGG 351 (397) T ss_pred CCCceeec-cCcCCCCCceecceeeEEecccccccccCC--ceeEEEeeccceEEEEeecceE-------E--EEecccc Confidence 23444443 322 2233555541 111111100 0012222111 1110000000 0 0000000 Q ss_pred CCcCcc----------cceeEEE-EEEEEc--ccCCcccccceeeeeec Q lcl|NC_020871. 303 GQFRAE----------DLAAHEY-KVVVSS--DDAESIASEVATATVTA 338 (468) Q Consensus 303 g~~~~~----------~~~~y~Y-kVtavn--~~GES~aS~~vt~Tv~a 338 (468) ..|... |...+.- .++.+. .-.++.+.. .+++. T Consensus 352 ~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~---~~~~~ 397 (397) T protein:vir:49 352 GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNL---GSTAV 397 (397) T ss_pred chhhcCceeEEEEeeeCcEEecccceEEEEeecccCCCCCc---ccccC Confidence 001100 0000000 111111 111222221 11111 No 32 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=97.82 E-value=1.3e-05 Score=47.36 Aligned_cols=304 Identities=11% Similarity=-0.005 Sum_probs=138.9 Q ss_pred CCC-----cccchhhcccChhhHHHHHH-------------HHhhcccccCcccccCccccchhhhhhHh-hhhhhcccc Q lcl|NC_020871. 1 MPK-----NNKEEEVKEVNLNSVQEDAL-------------KSFTTGYGITPDTQTDAGALRREFLDDQI-SMLTWTEND 61 (468) Q Consensus 1 ~~~-----~~~~~~~~~~n~~~~~e~~~-------------Ksf~agy~~~p~~~~~gaALr~esld~~i-~~L~~~~~~ 61 (468) +-+ .++....+.......+...+ +++...-. ...+.++|+.|.++.+..++ ..+. ++ T Consensus 202 ~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~-~~~t~~~gg~lip~~~~~~ii~~~~---~~ 277 (543) T protein:vir:81 202 FDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRA-MGLTKADGGYLVPFQLDPTVIITSN---GS 277 (543) T ss_pred HHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhh-cccccccCcccCchhhhhHHHHHHH---hh Confidence 000 00000000001111111111 12211111 11234457778777766554 2222 22 Q ss_pred ccchhhhc-ccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHH Q lcl|NC_020871. 62 LTFYKDIA-KKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQI 140 (468) Q Consensus 62 f~~~~~i~-k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~ 140 (468) ...+..+. .......+ .|.+.. +.....+++|++..+.+++.+......++-++.-..+|.-+- .++ .|.+.. T Consensus 278 ~~~l~~~~~~~~~~g~~-~~~~~~---~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell-~d~-~~~~~~ 351 (543) T protein:vir:81 278 LNDIRRFARQVVATGDV-WHGVSS---AAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEAL-QDE-ANVTET 351 (543) T ss_pred hchhhhhcccccCCcce-EEEEec---CCcceeecccCccccccccccceeeeeeeeeEeeehhhHHHH-hcc-HHHHHH Confidence 22222222 22222322 222222 334577899999999999999999999999999988988542 345 589999 Q ss_pred HHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCc--cceeeccCCCCCHHHHhhhhhhhhhccCceEEE Q lcl|NC_020871. 141 LTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ--DNVHDARGASLTESLLNQAAVMISKGYGTPTDA 218 (468) Q Consensus 141 ~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~--~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~ 218 (468) ..+.-...++..++.++|+||-. |-++.||.+.-.. ..+..+.+..++.+.+..+...+-.+|.....+ T Consensus 352 i~~~l~~~~~~~~d~ail~G~Gt---------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 422 (543) T protein:vir:81 352 VALLFAEGKDELEAVTLTTGTGQ---------GNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHRRQGAW 422 (543) T ss_pred HHHHHHHHHHHHHHHHHhccCCC---------CcccccchhhcccccccccccccccccHHHHHHHHHhhhccccCCcEE Confidence 99999999999999999999832 2367788774432 245566666677777777666666778777779 Q ss_pred ecCHHHHhhHHHhhcCC--ceEEeecCCCcceeeeeccceeecCCccccCCCEeeccccccccccccc------------ Q lcl|NC_020871. 219 YMPVGVQADFVNQQLSK--QTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILAL------------ 284 (468) Q Consensus 219 ~m~~~v~a~~~~~~~~~--qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~------------ 284 (468) +|++.+.+.+.. .-+. ...+.+...+.. -.|.|.-++..++.-....+.. T Consensus 423 v~n~~~~~~l~~-lkd~~G~~l~~~~~~g~~---------------~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~ 486 (543) T protein:vir:81 423 LANNLIYNKIRQ-FDTQGGAGLWTTIGNGEP---------------SQLLGRPVGEAEAMDANWNTSASADNFVLLYGNF 486 (543) T ss_pred EEcHHHHHHHHH-hhcCCCceeccCcCCCCC---------------ccccceeeEEeccccccccccccCCcceEEEeec Confidence 999999999854 3332 233322111110 1122332222111100000000 Q ss_pred --CCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeec Q lcl|NC_020871. 285 --PTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAP 351 (468) Q Consensus 285 --p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~ 351 (468) -....-..++-......... ..-..+...|++..--+.+---+...+.+++ +-++ T Consensus 487 ~~~~i~~~~~~~i~~~~~~~~~-~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~-----------~~~a 543 (543) T protein:vir:81 487 QNYVIADRIGMTVEFIPHLFGT-NRRPNGSRGWFAYYRMGADVVNPNAFRLLNV-----------ETAS 543 (543) T ss_pred cceeEEeecccEEEEecccccc-chhhcCceEEEEEEeeccEeecccceEEEEe-----------cccC Confidence 00000000000000000000 0000011122221111110000111111111 1111 No 33 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=97.81 E-value=1.8e-05 Score=46.66 Aligned_cols=291 Identities=12% Similarity=0.052 Sum_probs=142.1 Q ss_pred hhccc--ccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeeccccccccccccccccc Q lcl|NC_020871. 26 FTTGY--GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPV 103 (468) Q Consensus 26 f~agy--~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~ 103 (468) |..+. ..+..+-.+|++|-++.+.+++....... -.+.+...+.+..+--.+|.++. +.....+++|++..+. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~--~~l~~~~~~~~~~~~~~~ip~~~---~~~~a~~v~E~~~~~~ 75 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMAN--SAIMKLAKNEPMTAQKKKFTYLA---KGVGAYWVSETERIQT 75 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhc--cchhhhcceeeccCCceEEEEEe---CCcceEEeecCccccc Confidence 32221 11222234566788888877775444333 34666666666665445566655 3335668999999999 Q ss_pred cCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhc Q lcl|NC_020871. 104 SDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLI 183 (468) Q Consensus 104 ~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li 183 (468) .++.+.......+=++.--.+|.-+ +.++..|.+....+.-...+++.+|.++|+|+..-. ..|..-+|+..-. T Consensus 76 ~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~-----~~~~~~~~~~~~~ 149 (304) T protein:vir:94 76 SKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPY-----NTSTSGKPLVEGA 149 (304) T ss_pred ccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCc-----ccccccccccccc Confidence 9999999999999999888888754 345677888888888889999999999999986532 1233223333322 Q ss_pred CccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCC-ceEEeecCCCcceeeeeccceeecCCc Q lcl|NC_020871. 184 NQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSK-QTQLVRDNGNNVSVGFNIQGFHSARGF 262 (468) Q Consensus 184 ~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~-qr~v~~~n~~~~~~G~~v~~~~s~~g~ 262 (468) .. ......+....-+.|.++...+..++....-+.|++.+.+.|.. ..+. -|.+.+++.+. -.|.+| +.+..-. T Consensus 150 ~~-~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~-lkd~~G~~l~~~~~~~-l~G~PV--~~~~~~~ 224 (304) T protein:vir:94 150 EE-KGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRN-ALDANDRPLFDANGNE-IMGLPL--SYTGADV 224 (304) T ss_pred cc-cccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH-hhccCCcEeecCCCcc-ccceee--EEecccc Confidence 22 22333344445555555555566667777789999999999954 3322 23344343221 234443 1111111 Q ss_pred cccCCC-EeecccccccccccccCCCCCCcceeEEec--------CCCCCCcC-cccceeEEEEEEEEcccCCcccccce Q lcl|NC_020871. 263 IKLHGS-TVMENEQILDERILALPTAPQQAKVTATQE--------AGKKGQFR-AEDLAAHEYKVVVSSDDAESIASEVA 332 (468) Q Consensus 263 i~l~gs-~i~~~~n~l~~~~~~~p~ap~~~~vtat~~--------~~~~g~~~-~~~~~~y~YkVtavn~~GES~aS~~v 332 (468) ...... .++.+.+-+ ....- - ...+.-... ....|+.. .--...-.|++...- T Consensus 225 ~~~~~~~~~~gd~~~~---~~~~~-~--~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~----------- 287 (304) T protein:vir:94 225 YDKKKSLALMGDWDYA---RYGIL-Q--GIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHI----------- 287 (304) T ss_pred cCCCCcEEEEEehhhE---EEEEe-c--ceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEe----------- Confidence 110011 111111000 00000 0 000000000 00000000 000001111211111 Q ss_pred eeeeeccCcceEEEEEeecCCCcccceEEEEeecCC Q lcl|NC_020871. 333 TATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAE 368 (468) Q Consensus 333 t~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~ 368 (468) .|.... |+-+.+-.. ++ T Consensus 288 ---------------~~~v~~---~~a~~~l~~-a~ 304 (304) T protein:vir:94 288 ---------------AYMNVK---PEAFATLKP-TE 304 (304) T ss_pred ---------------ccEeec---ccceEEEEe-cC Confidence 111100 011111111 11 No 34 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=97.81 E-value=1.8e-05 Score=46.66 Aligned_cols=291 Identities=12% Similarity=0.052 Sum_probs=142.1 Q ss_pred hhccc--ccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeeccccccccccccccccc Q lcl|NC_020871. 26 FTTGY--GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPV 103 (468) Q Consensus 26 f~agy--~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~ 103 (468) |..+. ..+..+-.+|++|-++.+.+++....... -.+.+...+.+..+--.+|.++. +.....+++|++..+. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~--~~l~~~~~~~~~~~~~~~ip~~~---~~~~a~~v~E~~~~~~ 75 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMAN--SAIMKLAKNEPMTAQKKKFTYLA---KGVGAYWVSETERIQT 75 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhc--cchhhhcceeeccCCceEEEEEe---CCcceEEeecCccccc Confidence 32221 11222234566788888877775444333 34666666666665445566655 3335668999999999 Q ss_pred cCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhc Q lcl|NC_020871. 104 SDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLI 183 (468) Q Consensus 104 ~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li 183 (468) .++.+.......+=++.--.+|.-+ +.++..|.+....+.-...+++.+|.++|+|+..-. ..|..-+|+..-. T Consensus 76 ~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~-----~~~~~~~~~~~~~ 149 (304) T protein:vir:10 76 SKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPY-----NTSTSGKPLVEGA 149 (304) T ss_pred ccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCc-----ccccccccccccc Confidence 9999999999999999888888754 345677888888888889999999999999986532 1233223333322 Q ss_pred CccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCC-ceEEeecCCCcceeeeeccceeecCCc Q lcl|NC_020871. 184 NQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSK-QTQLVRDNGNNVSVGFNIQGFHSARGF 262 (468) Q Consensus 184 ~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~-qr~v~~~n~~~~~~G~~v~~~~s~~g~ 262 (468) .. ......+....-+.|.++...+..++....-+.|++.+.+.|.. ..+. -|.+.+++.+. -.|.+| +.+..-. T Consensus 150 ~~-~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~-lkd~~G~~l~~~~~~~-l~G~PV--~~~~~~~ 224 (304) T protein:vir:10 150 EE-KGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRN-ALDANDRPLFDANGNE-IMGLPL--SYTGADV 224 (304) T ss_pred cc-cccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH-hhccCCcEeecCCCcc-ccceee--EEecccc Confidence 22 22333344445555555555566667777789999999999954 3322 23344343221 234443 1111111 Q ss_pred cccCCC-EeecccccccccccccCCCCCCcceeEEec--------CCCCCCcC-cccceeEEEEEEEEcccCCcccccce Q lcl|NC_020871. 263 IKLHGS-TVMENEQILDERILALPTAPQQAKVTATQE--------AGKKGQFR-AEDLAAHEYKVVVSSDDAESIASEVA 332 (468) Q Consensus 263 i~l~gs-~i~~~~n~l~~~~~~~p~ap~~~~vtat~~--------~~~~g~~~-~~~~~~y~YkVtavn~~GES~aS~~v 332 (468) ...... .++.+.+-+ ....- - ...+.-... ....|+.. .--...-.|++...- T Consensus 225 ~~~~~~~~~~gd~~~~---~~~~~-~--~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~----------- 287 (304) T protein:vir:10 225 YDKKKSLALMGDWDYA---RYGIL-Q--GIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHI----------- 287 (304) T ss_pred cCCCCcEEEEEehhhE---EEEEe-c--ceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEe----------- Confidence 110011 111111000 00000 0 000000000 00000000 000001111211111 Q ss_pred eeeeeccCcceEEEEEeecCCCcccceEEEEeecCC Q lcl|NC_020871. 333 TATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAE 368 (468) Q Consensus 333 t~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~ 368 (468) .|.... |+-+.+-.. ++ T Consensus 288 ---------------~~~v~~---~~a~~~l~~-a~ 304 (304) T protein:vir:10 288 ---------------AYMNVK---PEAFATLKP-TE 304 (304) T ss_pred ---------------ccEeec---ccceEEEEe-cC Confidence 111100 011111111 11 No 35 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=97.80 E-value=1.7e-05 Score=46.76 Aligned_cols=304 Identities=13% Similarity=0.042 Sum_probs=142.6 Q ss_pred CCCccc---chhhcccC-----hhhHHHHHHHHhhcccc-------cCcccccCccccchhhhhhHhhhhhhccccccch Q lcl|NC_020871. 1 MPKNNK---EEEVKEVN-----LNSVQEDALKSFTTGYG-------ITPDTQTDAGALRREFLDDQISMLTWTENDLTFY 65 (468) Q Consensus 1 ~~~~~~---~~~~~~~n-----~~~~~e~~~Ksf~agy~-------~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~ 65 (468) +....+ .+...+.. ....-+.+.+.+..+.. ++..+.. ++.|-+..+..+|..+.. +...++ T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~vp~~~~~~ii~~~~--~~~~l~ 144 (395) T protein:vir:43 68 MLANEKRDGGEEAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGS-GGALVAPDRRPGVVAAPQ--RRLTIR 144 (395) T ss_pred HHhhhccccccchhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCC-CccccchhhHHHHHHHHH--hhhhHH Confidence 111100 00000000 00111122222222221 1222333 444555556666644433 344577 Q ss_pred hhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHH Q lcl|NC_020871. 66 KDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDA 145 (468) Q Consensus 66 ~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~a 145 (468) +.+.+.++.+...+|.+.... .+...+++|++..+.+++.+......++=++-...+|.-+ .+...+.+....+.- T Consensus 145 ~l~~~~~~~~~~~~~~~~~~~--~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~el--l~d~~~l~~~v~~~l 220 (395) T protein:vir:43 145 DLVAPGTTESNSVEYVRETGF--VNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQI--LDDASALQSYIDARA 220 (395) T ss_pred hhccceecCCCceEEEEEecC--CCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHH--HHhHHHHHHHHHHHH Confidence 777777777766677776633 3345689999999999999999999999999988888764 343457778888888 Q ss_pred HHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccce--eeccCCCCCHHHHhhhhhhhhhccCceEEEecCHH Q lcl|NC_020871. 146 IVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNV--HDARGASLTESLLNQAAVMISKGYGTPTDAYMPVG 223 (468) Q Consensus 146 i~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nv--iDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~ 223 (468) ...++..++.++++|+.. +-.+.||.+......+ ...-......+.|..+...+..+++...-+.|++. T Consensus 221 a~a~~~~~d~~~l~G~g~---------~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~ 291 (395) T protein:vir:43 221 RYGLMLVEECQLLYGNGT---------GANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPI 291 (395) T ss_pred HHHHHHHHHHHHHhccCC---------CCccccccccccccccccccccccchhHHHHHHHHHhhccccCCCcEEEEcHH Confidence 889999999999999732 2245677664432211 11112223455555565666677888888999999 Q ss_pred HHhhHHHhhcCCc-eEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccC------CCCCCcceeEE Q lcl|NC_020871. 224 VQADFVNQQLSKQ-TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALP------TAPQQAKVTAT 296 (468) Q Consensus 224 v~a~~~~~~~~~q-r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p------~ap~~~~vtat 296 (468) +.+.|.. ..+.+ |.+.++ ..+.. .-.|.|..|+.++.. .......- ....-..++-. T Consensus 292 ~~~~l~~-lkd~~G~~i~~~-~~~~~-------------~~~l~G~pVv~~~~~-~~~~~~~gd~~~~~~~~~~~~~~i~ 355 (395) T protein:vir:43 292 DWALIEL-NKDAENRYIIGS-PQNGT-------------TPTLWRLPVVETQAI-TQDEFLTGAFSLGAQIFDRMDIEVL 355 (395) T ss_pred HHHHHHH-hhccCCceeccc-cccCC-------------CceecceeeEEcCCC-CCCcEEEEeccceEEEEEecceEEE Confidence 9988843 33332 222221 11111 011223222222111 00000000 00000000000 Q ss_pred ecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeecc Q lcl|NC_020871. 297 QEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAK 339 (468) Q Consensus 297 ~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~ 339 (468) ........|. .....|++..--+-+---|...+.+++++. T Consensus 356 ~~~~~~~~f~---~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 356 VSTENDKDFE---NNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred Eeccccchhh---cCcEEEEEEEeeccEEecccceEEEEeccC Confidence 0000000011 011222222111111111112222232221 No 36 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=97.75 E-value=2.2e-05 Score=46.10 Aligned_cols=315 Identities=10% Similarity=0.015 Sum_probs=147.0 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey 80 (468) |-+..|+. .+. --.+.+......|.+... ....+++.|-++.+...|..+..... .+.....+.+..+.-.+| T Consensus 1 ~~k~~~~~-~~~-~~~~~~~~~~~~~~a~~~---~~~~~~~~lip~~~~~~ii~~~~~~s--~l~~~~~~~~~~~~~~~~ 73 (324) T protein:vir:99 1 MEQTQKLK-LNL-QHFASNNVKPQVFNPDNV---MMHEKKDGTLLNDFTTPILQEVMENS--KIMRLGKYEPMEGTEKKF 73 (324) T ss_pred CCCchHhh-HHH-HHHHHHhhhhhhccccce---eccCCCcceechhHHHHHHHHHHhhc--hhhhhcceeeccCCceEE Confidence 65442222 111 001111111112333222 22234555778888777755443333 455555555555443456 Q ss_pred ceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020871. 81 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 160 (468) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~G 160 (468) .++. +.+...+++|++..+..++.+.+....++=++..-.+|.-+- .++..|.+....+.-..++++.+|.++++| T Consensus 74 p~~~---~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G 149 (324) T protein:vir:99 74 TFWA---DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAGILN 149 (324) T ss_pred EEEe---cCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 6655 234567999999999999999999999999998888887432 245568888888899999999999999999 Q ss_pred ccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh-cCCceEE Q lcl|NC_020871. 161 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ-LSKQTQL 239 (468) Q Consensus 161 d~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~-~~~qr~v 239 (468) +-. +.+.-|+.+.+...+.... ..++.+.|.++...+..++..+.-+.|++.+.+.+...- -.++..+ T Consensus 150 ~g~---------~~~~~~~~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~ 218 (324) T protein:vir:99 150 QGN---------NPFGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) T ss_pred CCC---------CccCccccccccccceecc--ccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceee Confidence 843 1223355555555544332 235666677766677778888888999999999985432 1222222 Q ss_pred eecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccc--------e Q lcl|NC_020871. 240 VRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDL--------A 311 (468) Q Consensus 240 ~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~--------~ 311 (468) +......-.|.+|- ++...... .+..++.+..-. ..... ....+-....+.. ..+...+. . T Consensus 219 -~~~~~~~l~G~PVv--~~~~~~~~-~~~~i~gd~~~~---~~~~~---~~~~i~~~~~~~~-~~~~~~~~~~~~~f~~~ 287 (324) T protein:vir:99 219 -YDRNSDTLDGLPVV--NLKSSNLK-RGELITGDFDKL---IYGIP---QLIEYKIDETAQL-STVKNEDGTPVNLFEQD 287 (324) T ss_pred -cCCCCccccceeEE--eecCCCCC-cceEEEEecccE---EEEEe---cCcEEEEeecccc-cccccccccchhhhhcC Confidence 22212222444431 11100000 111222211100 00000 0000000000000 00000000 0 Q ss_pred eEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceE Q lcl|NC_020871. 312 AHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFV 360 (468) Q Consensus 312 ~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y 360 (468) ...+++...=+.+=--++..+-.|.+...++. ++.. + T Consensus 288 ~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~----~~~~--------~ 324 (324) T protein:vir:99 288 MVALRATMHVALHIADDKAFAKLVPADKKTDS----VPGE--------V 324 (324) T ss_pred cEEEEEEEEEccEEecccceEEEEeccCCCCC----CCCC--------C Confidence 11111111100000001111111111111100 0001 1 No 37 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=97.73 E-value=2.4e-05 Score=45.91 Aligned_cols=303 Identities=12% Similarity=0.021 Sum_probs=145.4 Q ss_pred CCCcccchhh--cc-cChhhHH---HHHHHHhhcccc------------cCcccccCccccchhhhhhHhhhhhhccccc Q lcl|NC_020871. 1 MPKNNKEEEV--KE-VNLNSVQ---EDALKSFTTGYG------------ITPDTQTDAGALRREFLDDQISMLTWTENDL 62 (468) Q Consensus 1 ~~~~~~~~~~--~~-~n~~~~~---e~~~Ksf~agy~------------~~p~~~~~gaALr~esld~~i~~L~~~~~~f 62 (468) +-..++.... ++ ....... +++.+.+.-+.. ....+-.+++.|-++.+.+.|..+.... . T Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~--~ 141 (390) T protein:vir:97 64 LEGNGAGGDVQHVSVGDMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDAR--L 141 (390) T ss_pred HHhcccccccccccchhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhh--h Confidence 1100000000 00 0010111 122222111100 0112234455566666666675544333 3 Q ss_pred cchhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHH Q lcl|NC_020871. 63 TFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILT 142 (468) Q Consensus 63 ~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~ 142 (468) .+++.+...++.+....|.+.... .+...+++|++..+.+++.+.+....++-++..-.+|.-+ ++...+.+.... T Consensus 142 ~i~~~~~~~~~~~~~~~~~~~~~~--~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el--l~ds~~l~~~i~ 217 (390) T protein:vir:97 142 TVRDLIGSGRTDSALIEYVQETGF--VNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI--LSDAPQLASYMN 217 (390) T ss_pred hhHhhcceeeccCCceEEEEEecC--CcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHH--HHhHHHHHHHHH Confidence 566666776776665667776633 3345789999999999999999999999999988888853 333357889999 Q ss_pred HHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCH Q lcl|NC_020871. 143 DDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPV 222 (468) Q Consensus 143 ~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~ 222 (468) +.-...+.+.++.++|+|+-. +-++.||.+........-........+.|..+-..+...|...+-++|++ T Consensus 218 ~~la~a~~~~~d~a~l~G~g~---------~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~ 288 (390) T protein:vir:97 218 NRLIRGLKVKEDAEILRGTGA---------NDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINP 288 (390) T ss_pred HHHHHHHHHHHHHHHhhcCCC---------CccccceeeccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcH Confidence 999999999999999999732 12457777655444433333444555666665556666677777899999 Q ss_pred HHHhhHHHhhcCCc-eEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccC-----CCCCCcceeEE Q lcl|NC_020871. 223 GVQADFVNQQLSKQ-TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALP-----TAPQQAKVTAT 296 (468) Q Consensus 223 ~v~a~~~~~~~~~q-r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p-----~ap~~~~vtat 296 (468) .+.+.|.. .-+.+ |.+.++..+ . +.-.|.|..+...+..-.....+-. .......++-. T Consensus 289 ~~~~~L~~-lkd~~G~~l~~~~~~-~-------------~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~ 353 (390) T protein:vir:97 289 IDWAAIEL-AKDANNQYLIGNARG-T-------------LTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVE 353 (390) T ss_pred HHHHHHHH-hhcCCCceeecCccC-C-------------CCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEE Confidence 99998853 33221 333222111 0 0011223333222211000000000 00000000000 Q ss_pred ecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeee Q lcl|NC_020871. 297 QEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVT 337 (468) Q Consensus 297 ~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~ 337 (468) . ......|.. +...|++..--+-+---|...+-++.+ T Consensus 354 ~-~~~~~~f~~---~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 354 I-GYVNDDFQR---NMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred E-eeccccccc---CcEEEEEEEeeccEEeccccEEEEEeC Confidence 0 000000110 011122211111111111112222222 No 38 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=97.60 E-value=4e-05 Score=44.71 Aligned_cols=316 Identities=11% Similarity=0.041 Sum_probs=147.3 Q ss_pred HHHh--hcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeecccccccccccccc Q lcl|NC_020871. 23 LKSF--TTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGV 100 (468) Q Consensus 23 ~Ksf--~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~ 100 (468) |... .+...+ .+..+|+.+..+..++-+..|. +...+.+.+.+.+..+.-.+|.+... .....+++|++. T Consensus 1 m~~~~~~a~~~~--~t~~~g~~i~~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~p~~~~---~~~a~~v~Eg~~ 72 (330) T protein:vir:77 1 MAGSTVPSTQVA--LTGDFSAFLTPEQSQDYFAEIE---KTSIVQRIARKVPMGPTGISIPHWTG---AVSASWTGEAER 72 (330) T ss_pred Ccccccchhhcc--ccCCCcceechhHHHHHHHHHH---hccchhhhcceeeccCCceEEEEEcC---CcceeEecCCCc Confidence 1111 111111 1333466677776554443332 23356666666676665566777663 334568999999 Q ss_pred ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchh Q lcl|NC_020871. 101 APVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLA 180 (468) Q Consensus 101 ~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~ 180 (468) .+.+++++.+....++=++.--.+|.-+ +.++..|.+....+.-...+++.+|.++|+|+.+ |-+++|+. T Consensus 73 ~~~~~~~f~~i~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~---------~~~~~g~~ 142 (330) T protein:vir:77 73 KPITKGSFGKQELEPVKITTIFAESAEV-VRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDK---------PSAFKGYL 142 (330) T ss_pred cccccceeeEEEEeEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------CCcccccc Confidence 9999999999999999999888888842 2356778889999999999999999999999843 34567887 Q ss_pred hhcCccc-eeec---cCCCCC---HHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCC-ceEEeecCCCcce-eee Q lcl|NC_020871. 181 KLINQDN-VHDA---RGASLT---ESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSK-QTQLVRDNGNNVS-VGF 251 (468) Q Consensus 181 ~li~~~n-viDa---rG~~ls---~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~-qr~v~~~n~~~~~-~G~ 251 (468) +.+...+ +.+. .+...+ .+.|.++-..+.+.+...+-.+|++.+.+.+.. ..+. .|.+.+++.+... .+. T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~~~ 221 (330) T protein:vir:77 143 AETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNT-AVDGNGRPLFVESTYTEQVGAI 221 (330) T ss_pred ccccccceeecccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHH-HhccCCceeecCcccccccccc Confidence 7654332 2211 122223 233444444556678888889999999999854 3332 3444444322211 110 Q ss_pred eccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccC-Cccccc Q lcl|NC_020871. 252 NIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA-ESIASE 330 (468) Q Consensus 252 ~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G-ES~aS~ 330 (468) .- ..|.|.-++..++. |+. +.++-..--.+.+++.+ .....| +-..|. T Consensus 222 ~~---------~~l~G~PV~~~~~~--------p~~-------------~~~~~~~~~~gd~s~~~-i~~~~~~~i~~~~ 270 (330) T protein:vir:77 222 RE---------GRILGRPTYVADNV--------VNG-------------TVGNRVVGVMGDFSQVI-WGQIGGLSFDVTD 270 (330) T ss_pred CC---------ceecceeeEEeccc--------cCC-------------CCCCccEEEEEecceEE-EEEecCcEEEEee Confidence 00 01222222222211 110 00000000112222222 112222 111111 Q ss_pred ceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEecccccC-CeeEEecCCC-CCCCCc Q lcl|NC_020871. 331 VATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAEN-NVITFYDLND-SIPETV 401 (468) Q Consensus 331 ~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~-~t~tf~D~N~-~iPgt~ 401 (468) ....+......+.. .+. .+..|.+. --.|+-..|+...-... .....+.... ..|+-. T Consensus 271 e~~~~~~~~~~~~~---~~~--------~~~~f~~~--~~~~r~~~r~d~~v~~~~a~~~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 271 QATLDFGEEQGGVW---VPK--------LISLWQHN--MVAVRCEAEFAFMVNDKDAFVKLTDQVAGTDPEEE 330 (330) T ss_pred cceeeecccccccc---ccc--------ccchhhcC--cEEEEEEEEeccEEecccceEEEEeccCCcCCCCC Confidence 11111000000000 000 11111110 01111111111110000 0001111110 012221 No 39 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=97.49 E-value=5.6e-05 Score=43.88 Aligned_cols=294 Identities=11% Similarity=0.059 Sum_probs=146.5 Q ss_pred cccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEE Q lcl|NC_020871. 34 PDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTV 113 (468) Q Consensus 34 p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~ 113 (468) =.+.+.|+.|-++.+.++|..+.... ..+.+.....+..+---+|.++. +.....+++|++..+.+++.+...+. T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~--s~i~~~~~~i~~~~~~~~~p~~~---~~~~a~wv~Eg~~~~~~~~~f~~v~l 75 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQ--SVLARLSMAEPQEFGEQQYMTLT---APPRGEVVGEGAQKSESTATFAPVTA 75 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhc--chhhhhcceeecCCCceEEEEEe---CCceeEEeecCcccccccceeeEEEE Confidence 33444567788888888885544333 34444444444443334455554 34456789999999999999999999 Q ss_pred EEEeeeehhhhhhhHhhh--cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcC-ccceee Q lcl|NC_020871. 114 NMKFASDTKNISIAAGLV--NNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLIN-QDNVHD 190 (468) Q Consensus 114 ~~k~l~~~~~vs~~~~lv--~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~-~~nviD 190 (468) ..+=++.--.+|.-+-.. ....+.+....+.....+++.++.++++|+.+- .|..+.|+.+.+- ..+++. T Consensus 76 ~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~-------~~~~~~gi~~~~~~~~~~~~ 148 (311) T protein:vir:81 76 IPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPL-------TGAALSGSPAKILDTTNIVE 148 (311) T ss_pred eeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCC-------CCcccccccccccccceeee Confidence 999888777777664332 234567788888888999999999999998432 3556788888664 446665 Q ss_pred ccCCCCC--HHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcC--CceEEeecCCCcceeeeeccceeecCCccccC Q lcl|NC_020871. 191 ARGASLT--ESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLS--KQTQLVRDNGNNVSVGFNIQGFHSARGFIKLH 266 (468) Q Consensus 191 arG~~ls--~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~--~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~ 266 (468) ..+.... ...|..+-..+..+.+.++-..||+.+...+.. ..+ ++.... +.... ...-.|. T Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~-lkd~~G~~l~~-~~~~~-------------~~~~tl~ 213 (311) T protein:vir:81 149 LTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLAT-QRDSQGRKLYP-ELGFG-------------TDVASFA 213 (311) T ss_pred ecccccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHh-hhccCCCeeec-Ccccc-------------CCCceec Confidence 5444322 334555555556667788889999999998843 332 233221 11110 0111123 Q ss_pred CCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEE-----EEcccCCcccccceeeeee-ccC Q lcl|NC_020871. 267 GSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVV-----VSSDDAESIASEVATATVT-AKD 340 (468) Q Consensus 267 gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVt-----avn~~GES~aS~~vt~Tv~-a~~ 340 (468) |.-+..++ ..+..|.+..+...... ........--+|-..|.|.+. -+++++... -++. ... T Consensus 214 G~Pv~~~~-----~i~~~~~~~~~~~~~~~-~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~------~~~~~~~~ 281 (311) T protein:vir:81 214 GLNAAVSD-----TVRGGPEAVTASTGVYR-TTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPD------GLGDLKRQ 281 (311) T ss_pred ceeEEecc-----cccccccccccccchhc-ccCCccEEEEEecccEEEEEeccceEEEeccCCCC------cchhhhhc Confidence 33333222 12222212222111110 011110000011111211111 011222100 0000 000 Q ss_pred cceEEE----EEeecCCCcccceEEEEeecCCC Q lcl|NC_020871. 341 DGVKLE----IELAPMYSSRPQFVSIYRKGAET 369 (468) Q Consensus 341 ~g~~lt----IT~~~~~ga~~~~y~IYR~~~~~ 369 (468) +.+.+. +.|.... |+.+..-+..... T Consensus 282 ~~v~~r~~~r~d~~v~~---~~a~~~l~~a~~~ 311 (311) T protein:vir:81 282 NQIAIRAEVVYGIGIMS---TDAFAVVRDADES 311 (311) T ss_pred CcEEEEEEEEeccEeec---ccceEEEEeeccC Confidence 111111 1111111 1223333333222 No 40 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=97.47 E-value=3.3e-06 Score=50.67 Aligned_cols=269 Identities=19% Similarity=0.192 Sum_probs=136.5 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccC-ccccchhhhhhHh-hhhhhccccccchhhhcccchh-hhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTD-AGALRREFLDDQI-SMLTWTENDLTFYKDIAKKPAT-STV 77 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~-gaALr~esld~~i-~~L~~~~~~f~~~~~i~k~~~~-stv 77 (468) ||... ..+ -+... +.-|-+..+...| ..|+..+ .++.+++=..++ .|= T Consensus 1 m~~~~-----~~~---------------------~TL~e~Akr~~~d~~~~~VIE~l~~~n---~IL~~lpf~e~n~gt~ 51 (328) T protein:vir:95 1 MAVKG-----LTA---------------------LTLADWGKRVDPNGKVDKIIELLGQTN---PILQDMPFVEGNLPTG 51 (328) T ss_pred CCccc-----ccc---------------------ccHHHHHhhhCcchhHHHHHHHHhccc---hhHhhcceeecccCCc Confidence 44321 001 11111 0012222222222 1233222 356666666665 566 Q ss_pred hccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhc-chhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 78 AKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVN-NIQDPMQILTDDAIVNIAKTIEWA 156 (468) Q Consensus 78 ~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~-~~~Dp~~~~~~~ai~~~~~~~e~a 156 (468) |.|+++..--+.+ |..=....+-+.++..|++..++-|..-..|.+...-.+ +..+-+++|.+.-|..+.+.++.. T Consensus 52 ~~~~v~~~LP~~~---fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~ 128 (328) T protein:vir:95 52 HRTTIRSGLPSAT---WRLLNYGVQPSKSTTVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQT 128 (328) T ss_pred ceeeEeeccCCce---eeecCCccCcccceeEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHH Confidence 8899888655554 422233455677899999999999999999999776665 477888999999999999999999 Q ss_pred HhhcccccccCCCCCCCccccchhhhcC------ccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHH Q lcl|NC_020871. 157 SFFGDSDLSDSPEPQAGLEFDGLAKLIN------QDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVN 230 (468) Q Consensus 157 ~f~Gd~~l~~~~~~~~gleFDGl~~li~------~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~ 230 (468) +||||+..+ +-+||||.+... .+|+||+.|..-+.-.|+ ++.=+=....=+| |-+-++-|+- T Consensus 129 ~iyGdsa~~-------p~~F~GL~~R~~~~s~~~a~qiidaGgtg~~~TSi~----~v~~g~~~~~giy-PkG~~~Gl~~ 196 (328) T protein:vir:95 129 LFYGDSSVN-------PQQFMGLSSRYSSLSAGNAQNIIDAGGTGTDNTSIW----LVVWGENTVHGIF-PKGKKAGIQM 196 (328) T ss_pred HhcCCccCC-------hhhhcchhhhcCccccccccceeecccCCCCceEEE----EEEEcCCeEEEec-ccccccCcee Confidence 999998874 349999999764 459999999875543332 1211122333355 8888888766 Q ss_pred hhcCCceEEeecCCCcc-e--------eeeecccee--ecCCccc---cC------------------------CCE-ee Q lcl|NC_020871. 231 QQLSKQTQLVRDNGNNV-S--------VGFNIQGFH--SARGFIK---LH------------------------GST-VM 271 (468) Q Consensus 231 ~~~~~qr~v~~~n~~~~-~--------~G~~v~~~~--s~~g~i~---l~------------------------gs~-i~ 271 (468) .-+..|.... ++++-. . .|..|...- ..-.+|+ |. |+. || T Consensus 197 ~d~g~~~~~~-~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y 275 (328) T protein:vir:95 197 EDKGQVTLED-ANGGKYEGYRTHYKWDNGLALRDWRYVVRIANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFY 275 (328) T ss_pred eecCceeeec-CCCCeeeEEEEEEEeeeeeEEcCcccEEEEecCcccccccccChhhHHHHHHHHHHHhccCCCCcceee Confidence 6666666652 222221 1 222222111 1122221 11 111 22 Q ss_pred cc---cccccccccccCCCCCCcceeEEecCCCC-CCcC-----cccce-eEEEEEE Q lcl|NC_020871. 272 EN---EQILDERILALPTAPQQAKVTATQEAGKK-GQFR-----AEDLA-AHEYKVV 318 (468) Q Consensus 272 ~~---~n~l~~~~~~~p~ap~~~~vtat~~~~~~-g~~~-----~~~~~-~y~YkVt 318 (468) -+ -++|+......-+. .++-+-..+.. -.|. --|+- .=.=+|+ T Consensus 276 ~n~~v~~~L~~q~~~~~n~----~~~~~~~~g~~~t~~~gipir~~dai~~tE~~vv 328 (328) T protein:vir:95 276 MNRTVGQALDLQSLEKTSL----AISVKETEGEWWTSFRGVPIRETDALLETEARVV 328 (328) T ss_pred hhHHHHHHHHHHHhcCcce----eeeeeccCCcceeEECCeEEEEEeeeecCccccC Confidence 21 11221111111100 00000001110 0010 00000 0000111 No 41 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=97.45 E-value=6.6e-05 Score=43.51 Aligned_cols=318 Identities=14% Similarity=0.020 Sum_probs=152.0 Q ss_pred hHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeee-----eccccc Q lcl|NC_020871. 17 SVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQ-----HGKVGH 91 (468) Q Consensus 17 ~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~-----hG~~g~ 91 (468) -.|=+-+++..+|........+.+++|-++.+..+|..+... ...+.+...+.+..+.-.+|.++.. |-+.+. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~--~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~ 78 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQE--SSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGT 78 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHh--hchhhhhcceeeccCCceEEEEEecCccceeecccc Confidence 222334455666655444555567779999888888555443 3456666677777766566666552 223345 Q ss_pred cccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCC Q lcl|NC_020871. 92 TRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQ 171 (468) Q Consensus 92 ~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~ 171 (468) ..+++|++..+..++++.......+=++--..+|.-+ +.++..|.+....+.-...+.+.+|.++++|+..- T Consensus 79 ~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~el-l~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~------- 150 (338) T protein:vir:78 79 SNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEF-ARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPL------- 150 (338) T ss_pred cccccccccccccccceeEEEEEEEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC------- Confidence 7789999999999999999999998888777777632 23466888888999999999999999999999643 Q ss_pred CCccccchhhhcCcc--ceeeccCC--CCCHHHHhh-hhhhhhhccCceEEEecCHHHHhhHHHhh--cCCc-eEEeecC Q lcl|NC_020871. 172 AGLEFDGLAKLINQD--NVHDARGA--SLTESLLNQ-AAVMISKGYGTPTDAYMPVGVQADFVNQQ--LSKQ-TQLVRDN 243 (468) Q Consensus 172 ~gleFDGl~~li~~~--nviDarG~--~ls~~~l~~-~a~~i~~~fG~~td~~m~~~v~a~~~~~~--~~~q-r~v~~~n 243 (468) .+.++.|+.+..... ...|.-+. ....+.|.. .+.++......++-.+|++.+.+.|...- .+.+ |.+.++. T Consensus 151 ~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~ 230 (338) T protein:vir:78 151 TGSALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRI 230 (338) T ss_pred ccccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeeccc Confidence 234567777644332 22332222 222344443 44555556778888999999998885543 2332 3333322 Q ss_pred CCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEccc Q lcl|NC_020871. 244 GNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDD 323 (468) Q Consensus 244 ~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~ 323 (468) .... +.-.|.|--++.+++. +....++..+... ..-+..+.+.-.+...+.+ .++++ T Consensus 231 ~~~~-------------~~~~l~G~PV~~~~~i-----p~~~~~~~~~~~~--~~~gdfs~~~~~~~~~~~i---~~~~~ 287 (338) T protein:vir:78 231 NLAA-------------SAGDLLGLPVQFGKAV-----GGDLGAATDSKVR--VVGGDFSQLKYGFADEIRV---KMSDT 287 (338) T ss_pred ccCC-------------CCceeeeeeEEEcccc-----CccccccCCcccE--EEEEecceEEEEeecccEE---EEeec Confidence 1111 0111122222211111 0000000000000 0000000000000000000 01111 Q ss_pred C--Ccccccceeeeeecc-CcceEEE----EEeecCCCcccceEEEEeecCCCc Q lcl|NC_020871. 324 A--ESIASEVATATVTAK-DDGVKLE----IELAPMYSSRPQFVSIYRKGAETG 370 (468) Q Consensus 324 G--ES~aS~~vt~Tv~a~-~~g~~lt----IT~~~~~ga~~~~y~IYR~~~~~G 370 (468) + .-...+ ....+... .+-+.+. +-|..+... -..+|=.-+++.- T Consensus 288 ~~~~~~~~~-~~~~~~~~~~~~~~~r~~~r~d~~v~~~~--a~~~l~~~~~~~~ 338 (338) T protein:vir:78 288 ATLTDNTSP-TPQTVSMWQTNQIAILIEVTFGWLLGDKQ--AFVKFVDDEDPDA 338 (338) T ss_pred ccccccccc-cccchhhhhcCcEEEEEEEEeccEeeccc--ceEEEecccCCCC Confidence 0 000000 00000000 0000000 111111000 0111111111100 No 42 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=97.44 E-value=6.7e-05 Score=43.48 Aligned_cols=306 Identities=12% Similarity=0.088 Sum_probs=142.8 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhccc----------ccCcccccCccccchhhhhhHhhhhhhccccccchhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGY----------GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAK 70 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy----------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k 70 (468) ....++.... ...+..-+++.|.+...- .+...+..+|..+..+ +...|..+.. ....+++.++. T Consensus 66 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~-~~~~ii~~~~--~~~~l~~~~~~ 140 (385) T protein:vir:18 66 SGAENPGEKK--SFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPM-QIPGIIMPGL--RRLTIRDLLAQ 140 (385) T ss_pred ccccccchhh--hhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecch-hhhHHHHHhh--hccchhhhcce Confidence 1111111100 011122235555543210 1122233334445444 5555544333 33456776777 Q ss_pred cchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHH Q lcl|NC_020871. 71 KPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIA 150 (468) Q Consensus 71 ~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~ 150 (468) .++.+.-.+|.+... ..+...+++|++..+..++.+.+....++=++....+|.- +.+...+.+....+.-...+. T Consensus 141 ~~~~~~~~~~~~~~~--~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e--ll~d~~~l~~~i~~~la~a~~ 216 (385) T protein:vir:18 141 GRTSSNALEYVREEV--FTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ--VMDDAPMLQSYINNRLMYGLA 216 (385) T ss_pred ecccCcceEEEEEec--CCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH--HHhhHHHHHHHHHHHHHHHHH Confidence 666654445666553 2334568999999999999999999999999998888875 344445677888888889999 Q ss_pred HHHHHHHhhcccccccCCCCCCCccccchhhhcCcc-ceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHH Q lcl|NC_020871. 151 KTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFV 229 (468) Q Consensus 151 ~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~-nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~ 229 (468) ..++.++++|+-. |-.+.||.+..... ...... .....+.|-.+...+...++..+-++||+.+.+.+. T Consensus 217 ~~~d~~~l~G~g~---------~~~~~Gi~~~~~~~~~~~~~~-~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~ 286 (385) T protein:vir:18 217 LKEEGQLLNGDGT---------GDNLEGLNKVATAYDTSLNAT-GDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIA 286 (385) T ss_pred HHHHHHHHhccCC---------CCccccccccccccccccccc-ccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHH Confidence 9999999999732 23467777654322 222222 224556666666666778888899999999998885 Q ss_pred Hhh-cCCceEEeecCCC--cceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcC Q lcl|NC_020871. 230 NQQ-LSKQTQLVRDNGN--NVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFR 306 (468) Q Consensus 230 ~~~-~~~qr~v~~~n~~--~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~ 306 (468) ..- -.++..+..+..+ ..-.|.+| +++..-. -+..++.+..- ...... ...++-.........|. T Consensus 287 ~lkd~~G~~l~~~~~~~~~~~l~G~pV--~~~~~~p---~~~~~~gd~~~---~~~~~~----~~~~~v~~~~~~~~~~~ 354 (385) T protein:vir:18 287 LLKDNEGRYIFGGPQAFTSNIMWGLPV--VPTKAQA---AGTFTVGGFDM---ASQVWD----RMDATVEVSREDRDNFV 354 (385) T ss_pred HhhcCCCceeccCcccCCCceecceee--EEcCcCC---CCcEEEeeccc---EEEEEE----ecceEEEEeccccchhh Confidence 432 1223332211111 11122222 1111000 01111111100 000000 00000000000000000 Q ss_pred cccceeEEEEEEEEcccCCcccccceeeeeeccC Q lcl|NC_020871. 307 AEDLAAHEYKVVVSSDDAESIASEVATATVTAKD 340 (468) Q Consensus 307 ~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~ 340 (468) .....|++..--+-.---|...+-+++++.. T Consensus 355 ---~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 355 ---KNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred ---cCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 0111222211111110111112222222111 No 43 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=97.44 E-value=6.7e-05 Score=43.48 Aligned_cols=306 Identities=12% Similarity=0.088 Sum_probs=142.8 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhccc----------ccCcccccCccccchhhhhhHhhhhhhccccccchhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGY----------GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAK 70 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy----------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k 70 (468) ....++.... ...+..-+++.|.+...- .+...+..+|..+..+ +...|..+.. ....+++.++. T Consensus 66 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~-~~~~ii~~~~--~~~~l~~~~~~ 140 (385) T protein:vir:19 66 SGAENPGEKK--SFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPM-QIPGIIMPGL--RRLTIRDLLAQ 140 (385) T ss_pred ccccccchhh--hhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecch-hhhHHHHHhh--hccchhhhcce Confidence 1111111100 011122235555543210 1122233334445444 5555544333 33456776777 Q ss_pred cchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHH Q lcl|NC_020871. 71 KPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIA 150 (468) Q Consensus 71 ~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~ 150 (468) .++.+.-.+|.+... ..+...+++|++..+..++.+.+....++=++....+|.- +.+...+.+....+.-...+. T Consensus 141 ~~~~~~~~~~~~~~~--~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e--ll~d~~~l~~~i~~~la~a~~ 216 (385) T protein:vir:19 141 GRTSSNALEYVREEV--FTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ--VMDDAPMLQSYINNRLMYGLA 216 (385) T ss_pred ecccCcceEEEEEec--CCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH--HHhhHHHHHHHHHHHHHHHHH Confidence 666654445666553 2334568999999999999999999999999998888875 344445677888888889999 Q ss_pred HHHHHHHhhcccccccCCCCCCCccccchhhhcCcc-ceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHH Q lcl|NC_020871. 151 KTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFV 229 (468) Q Consensus 151 ~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~-nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~ 229 (468) ..++.++++|+-. |-.+.||.+..... ...... .....+.|-.+...+...++..+-++||+.+.+.+. T Consensus 217 ~~~d~~~l~G~g~---------~~~~~Gi~~~~~~~~~~~~~~-~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~ 286 (385) T protein:vir:19 217 LKEEGQLLNGDGT---------GDNLEGLNKVATAYDTSLNAT-GDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIA 286 (385) T ss_pred HHHHHHHHhccCC---------CCccccccccccccccccccc-ccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHH Confidence 9999999999732 23467777654322 222222 224556666666666778888899999999998885 Q ss_pred Hhh-cCCceEEeecCCC--cceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcC Q lcl|NC_020871. 230 NQQ-LSKQTQLVRDNGN--NVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFR 306 (468) Q Consensus 230 ~~~-~~~qr~v~~~n~~--~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~ 306 (468) ..- -.++..+..+..+ ..-.|.+| +++..-. -+..++.+..- ...... ...++-.........|. T Consensus 287 ~lkd~~G~~l~~~~~~~~~~~l~G~pV--~~~~~~p---~~~~~~gd~~~---~~~~~~----~~~~~v~~~~~~~~~~~ 354 (385) T protein:vir:19 287 LLKDNEGRYIFGGPQAFTSNIMWGLPV--VPTKAQA---AGTFTVGGFDM---ASQVWD----RMDATVEVSREDRDNFV 354 (385) T ss_pred HhhcCCCceeccCcccCCCceecceee--EEcCcCC---CCcEEEeeccc---EEEEEE----ecceEEEEeccccchhh Confidence 432 1223332211111 11122222 1111000 01111111100 000000 00000000000000000 Q ss_pred cccceeEEEEEEEEcccCCcccccceeeeeeccC Q lcl|NC_020871. 307 AEDLAAHEYKVVVSSDDAESIASEVATATVTAKD 340 (468) Q Consensus 307 ~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~ 340 (468) .....|++..--+-.---|...+-+++++.. T Consensus 355 ---~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 355 ---KNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred ---cCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 0111222211111110111112222222111 No 44 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=97.38 E-value=8e-05 Score=43.05 Aligned_cols=291 Identities=9% Similarity=-0.001 Sum_probs=141.0 Q ss_pred HHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhh-hccceeeeeccccccccccccc Q lcl|NC_020871. 21 DALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTV-AKYDVYMQHGKVGHTRFTREIG 99 (468) Q Consensus 21 ~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv-~ey~~~~~hG~~g~~~fv~E~g 99 (468) .-.+.|.+. +..+.+++++|-++.+.++|..+..... .+.+...+.+..+.- ..+.+.. +.....+++|++ T Consensus 1 m~~~~~~~~---~~~~t~~~~~lvP~~~~~~ii~~~~~~s--~l~~~~~~~~~~~~~~~~~~~~~---~~~~a~~v~Eg~ 72 (297) T protein:vir:95 1 MTVQTFNPE---NVLVSQKKDGTLHKEFTDIIMKEVAQNS--LVMQLGQYQEMEGEQEKTVYVQT---DGISAYWVNETE 72 (297) T ss_pred CCccccccc---cccccCCCcceechhHHHHHHHHHHhhc--hhhhhcceeecCCCccEEEEEEc---CCceeEEeecCc Confidence 111112111 1223345667888888777765544333 455555555554322 2233333 222456899999 Q ss_pred cccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccch Q lcl|NC_020871. 100 VAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGL 179 (468) Q Consensus 100 ~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl 179 (468) ..+..++++.......+=++-.-.+|..+ +.++..|.+....+.--..+.+.+|.++++|+.+- .+ .|+ T Consensus 73 ~~~~~~~~f~~v~l~~~k~~~~~~is~el-l~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~-----~~-----~gi 141 (297) T protein:vir:95 73 KIKTDKPEVVPVTLKAHKLGIILVTSREA-LNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTP-----FA-----NSV 141 (297) T ss_pred cccccccceeEEEEeeEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCc-----cc-----ccc Confidence 99999999999999999999888888732 23466788888888889999999999999998431 11 345 Q ss_pred hhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCC-ceEEeecCCCcceeeeeccceee Q lcl|NC_020871. 180 AKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSK-QTQLVRDNGNNVSVGFNIQGFHS 258 (468) Q Consensus 180 ~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~-qr~v~~~n~~~~~~G~~v~~~~s 258 (468) .+.+...+.... ..++.+.|-++...+..++...+-+.|++.+.+.|.. ..+. -|.+.++..+ .-.|.++- .+ T Consensus 142 ~~~~~~~~~~~~--~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~-l~d~~G~~i~~~~~~-~l~G~Pv~--~~ 215 (297) T protein:vir:95 142 AKAAKDANKVIG--GPINYDNILKLQDALYDADVEPNAFVSKIQNRSALRE-ARDGNKVSIYDKAAN-TIDGITTV--DL 215 (297) T ss_pred cccccccceecc--cccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH-hhccCCceeecCCCC-cccceeeE--ee Confidence 555544444332 3356555555555566777788889999999999954 3333 2333333322 22444441 11 Q ss_pred cCCccccCCCEeecccccccccccc---cCCCCCCcceeEEecCCCCCC-cCcccceeEEEEEEEEcccCCcccccceee Q lcl|NC_020871. 259 ARGFIKLHGSTVMENEQILDERILA---LPTAPQQAKVTATQEAGKKGQ-FRAEDLAAHEYKVVVSSDDAESIASEVATA 334 (468) Q Consensus 259 ~~g~i~l~gs~i~~~~n~l~~~~~~---~p~ap~~~~vtat~~~~~~g~-~~~~~~~~y~YkVtavn~~GES~aS~~vt~ 334 (468) ...... .+..++.+..-+.--... .-.... .+........++ ++--....-.+|+...-+-+---|...+.. T Consensus 216 ~~~~~~-~~~~~~gd~s~~~~~~~~~~~i~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l 291 (297) T protein:vir:95 216 KSARFE-KGDLLAGDFDNLIYGVPYNITYKISEE---GQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKL 291 (297) T ss_pred cCCCCC-CceEEEEecccEEEEEecCeEEEEeec---cccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEE Confidence 111100 112222221110000000 000000 000000000000 000000112222222211110001111111 Q ss_pred eeeccCcceEEEEEeecC Q lcl|NC_020871. 335 TVTAKDDGVKLEIELAPM 352 (468) Q Consensus 335 Tv~a~~~g~~ltIT~~~~ 352 (468) +.+. ++ T Consensus 292 ~~at------------~~ 297 (297) T protein:vir:95 292 TPAE------------RV 297 (297) T ss_pred eecC------------CC Confidence 1110 00 No 45 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=97.27 E-value=4.6e-05 Score=44.36 Aligned_cols=290 Identities=13% Similarity=0.037 Sum_probs=134.4 Q ss_pred hhcccccCcc------c-ccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeecccccccccccc Q lcl|NC_020871. 26 FTTGYGITPD------T-QTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREI 98 (468) Q Consensus 26 f~agy~~~p~------~-~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~ 98 (468) |-+|...+++ + -++++.+-++.+..++-.+.. +...+.+.+...+..+.-.+|.+... .....+++|+ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~--~~s~l~~~~~~~~~~~~~~~~p~~~~---~~~a~~v~E~ 75 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAE--KTSIVQQFAQKVPMGTTGQKIPHWIG---DVSAQWIGEG 75 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHH--hccchhhhcceeeccCCceEEEEEeC---CcceEEecCC Confidence 4444333321 1 122333444444455533322 33356666666666654455666652 3345699999 Q ss_pred ccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccc Q lcl|NC_020871. 99 GVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDG 178 (468) Q Consensus 99 g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDG 178 (468) +..+.+++++.+....++=++..-.+|+-+-. ++..|.+....+.-...+++.+|.++|.|+.+- .++.+.| T Consensus 76 ~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~-------~~~~~~~ 147 (320) T protein:vir:10 76 DMKPITKGNMTSQNIAPHKIATIFVASAETVR-ANPANYLGTMRTKVATAFAMAFDSAALNGTDSP-------FPTYLAQ 147 (320) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHh-cChHHHHHHHHHHHHHHHHHHHHHHhhcccCCC-------CCccccc Confidence 99999999999999999999988888876433 455688888888889999999999999999542 2233334 Q ss_pred hhhhcCccceeeccCCC---C--CHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCc-eEEeecCCCc------ Q lcl|NC_020871. 179 LAKLINQDNVHDARGAS---L--TESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQ-TQLVRDNGNN------ 246 (468) Q Consensus 179 l~~li~~~nviDarG~~---l--s~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~q-r~v~~~n~~~------ 246 (468) +.+ ..++....|.- + -.+.+-.+...+..++....-+.||+.+.+.|. ..-+.+ |.+.++.... T Consensus 148 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~-~lkd~~G~~l~~~~~~~~~~~~~ 223 (320) T protein:vir:10 148 TTK---SVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILN-GAKDKNGRPLFIESTYTDENSPF 223 (320) T ss_pred ccc---cccceecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHH-HhhccCCceeeccccccCccccc Confidence 333 22222222222 1 123344444555566777888999999999995 333321 2222221111 Q ss_pred ---ceeeeeccceeecCCccccCCCEe-ecccccc-c--ccccccCCCCCCcceeEEecCCCC-CCc--C-cccceeEEE Q lcl|NC_020871. 247 ---VSVGFNIQGFHSARGFIKLHGSTV-MENEQIL-D--ERILALPTAPQQAKVTATQEAGKK-GQF--R-AEDLAAHEY 315 (468) Q Consensus 247 ---~~~G~~v~~~~s~~g~i~l~gs~i-~~~~n~l-~--~~~~~~p~ap~~~~vtat~~~~~~-g~~--~-~~~~~~y~Y 315 (468) .-.|.+| +.+.. +.-....+ +.+..-+ . .....+...........+...+.. .-| + -.-....++ T Consensus 224 ~~~~i~g~pv--~~~~~--~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~ 299 (320) T protein:vir:10 224 RAGRIVSRPT--ILSDH--VADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEY 299 (320) T ss_pred cCceeeeeee--EecCC--CCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEee Confidence 1122222 11111 00000011 1111100 0 000000000000000000000000 000 0 001112333 Q ss_pred EEEEEcccCCcccccceeeeeec Q lcl|NC_020871. 316 KVVVSSDDAESIASEVATATVTA 338 (468) Q Consensus 316 kVtavn~~GES~aS~~vt~Tv~a 338 (468) -+..++...-.....+ + ++.+ T Consensus 300 d~~v~~~~a~~~l~~~-~-ap~~ 320 (320) T protein:vir:10 300 AFHNNDKDAFVKLTNV-V-TPDA 320 (320) T ss_pred ccEEecccceEEEEec-c-CCCC Confidence 3333333221111111 1 1111 No 46 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=97.27 E-value=0.00011 Score=42.28 Aligned_cols=311 Identities=10% Similarity=0.022 Sum_probs=140.0 Q ss_pred CCCcccchh-hcccCh-hhHHHHHHHHhh--------------------cc-cccCcccccCccccchhhhhhHhhhhhh Q lcl|NC_020871. 1 MPKNNKEEE-VKEVNL-NSVQEDALKSFT--------------------TG-YGITPDTQTDAGALRREFLDDQISMLTW 57 (468) Q Consensus 1 ~~~~~~~~~-~~~~n~-~~~~e~~~Ksf~--------------------ag-y~~~p~~~~~gaALr~esld~~i~~L~~ 57 (468) .+..+.... .++... .... +..|++. .+ ........+++..+..+.+...+..+.. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~ 149 (419) T protein:vir:94 71 TPLTPAEAGTFRSLAQRFADS-DGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPD 149 (419) T ss_pred ccccccccccccchhhhhhhH-HHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHh Confidence 111111000 000000 0000 1111111 00 1111123345666677777776655543 Q ss_pred ccccccchhhhcccchhhhhhccceeeee-----ccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhc Q lcl|NC_020871. 58 TENDLTFYKDIAKKPATSTVAKYDVYMQH-----GKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVN 132 (468) Q Consensus 58 ~~~~f~~~~~i~k~~~~stv~ey~~~~~h-----G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~ 132 (468) ..- .+.+.+...+..+-...|.+.+.+ ++.+.+.+++|++..+.+++.+.+.+..++-++.--.+|.. +.+ T Consensus 150 ~~~--~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e--ll~ 225 (419) T protein:vir:94 150 LPL--LVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQ--AAD 225 (419) T ss_pred hhh--hhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHH--HHH Confidence 332 223333334444444455555432 22334678999999999999999999999999988888875 344 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCcc--ceeeccCCC---CCHHHHhhhhhh Q lcl|NC_020871. 133 NIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD--NVHDARGAS---LTESLLNQAAVM 207 (468) Q Consensus 133 ~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~--nviDarG~~---ls~~~l~~~a~~ 207 (468) ...+.+....+.--..++..++.++++||-. + +.-||.+.-... ++-...... ...+.|.++-.. T Consensus 226 d~~~l~~~i~~~la~a~~~~~d~aii~G~G~-----~-----~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~ 295 (419) T protein:vir:94 226 DNSQLMGYIQGRLTYGLRFLRDRQLLNGNGS-----T-----EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTV 295 (419) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhccCc-----c-----cccceecccccccccccccccccccchhHHHHHHHHHh Confidence 4467788888889999999999999999854 1 233555532211 111111111 223444444444 Q ss_pred hhhccCceEEEecCHHHHhhHHHhhcC-CceEEeecCCCcce----eeeeccceeecCCccccCCCEeeccccccccccc Q lcl|NC_020871. 208 ISKGYGTPTDAYMPVGVQADFVNQQLS-KQTQLVRDNGNNVS----VGFNIQGFHSARGFIKLHGSTVMENEQILDERIL 282 (468) Q Consensus 208 i~~~fG~~td~~m~~~v~a~~~~~~~~-~qr~v~~~n~~~~~----~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~ 282 (468) +...+..++-.+|++.+...+...--. +.+.+.+++..+.. .|.+| +++.. ++ .+..++.+..-. .. T Consensus 296 ~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV--~~~~~--~~-~~~~~~gd~~~~---~~ 367 (419) T protein:vir:94 296 AEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNV--VSTVA--IA-QGTALVGGFRQG---AT 367 (419) T ss_pred hhhccCCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCccccceee--EEcCC--CC-CccEEEeeccce---EE Confidence 445666777899999999998655443 34445555433221 22222 11110 00 011112111100 00 Q ss_pred ccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCc Q lcl|NC_020871. 283 ALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDD 341 (468) Q Consensus 283 ~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~ 341 (468) .....+ ++-........-|. .....|++..--+.+=-.|+..+-++.++.++ T Consensus 368 ~~~~~~----~~v~~~~~~~~~~~---~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 368 LWSRQG----ITVLMTDSHADFFT---ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred EEEecc----eEEEEeccccchhh---cCcEEEEEEEeeccEEeccccEEEEEeccCCC Confidence 000000 00000000000011 01222333222222211122233333333222 No 47 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=97.23 E-value=4.4e-05 Score=44.45 Aligned_cols=290 Identities=17% Similarity=0.156 Sum_probs=137.2 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchh-hhhhc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPAT-STVAK 79 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~-stv~e 79 (468) ||.-. +++-+..+ ++|-+ +++... +-..|. |+..+ .++.+++=...+ .|=|. T Consensus 1 m~~~~-----~~a~TL~e---~AKr~------~~d~~~---~~IIE~-------l~~tn---~IL~~lpf~e~N~~tg~~ 53 (330) T protein:vir:10 1 MATLS-----TNNPTMAD---VAKRL------DPNGKV---DIIVEM-------LNQTN---PVLQDMTAIEGNLPTGHR 53 (330) T ss_pred CCcCC-----CCcccHHH---HHhhc------CcchhH---HHHHHH-------HhcCc---hHHhhcchhhccCCcccc Confidence 55321 11112222 11211 111111 111222 22222 123333333333 23344 Q ss_pred cceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhc-chhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020871. 80 YDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVN-NIQDPMQILTDDAIVNIAKTIEWASF 158 (468) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~-~~~Dp~~~~~~~ai~~~~~~~e~a~f 158 (468) +.+++.-.+.+ |-.=.....-+.++..|++..++.|..-..|-+...-.+ +..+-+++|.+.-|..+.++++..+| T Consensus 54 t~vrt~LP~~~---fR~lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~i 130 (330) T protein:vir:10 54 TSVRTGLPTPT---WRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLF 130 (330) T ss_pred eeEEeecCCch---hhhcCCccccccceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 44445444332 422222334456999999999999999999988765554 67888899999999999999999999 Q ss_pred hcccccccCCCCCCCccccchhhhcC------ccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh Q lcl|NC_020871. 159 FGDSDLSDSPEPQAGLEFDGLAKLIN------QDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ 232 (468) Q Consensus 159 ~Gd~~l~~~~~~~~gleFDGl~~li~------~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~ 232 (468) |||++.+ +-+||||.+... ++|+||+.|..-..-.|+- +.=+=..+.=+| |-+-|+-|+-.- T Consensus 131 yGD~a~~-------p~~F~GL~kR~~~~ta~~~~qvIdaGGtG~~~TSi~~----v~wg~~~~~giy-PkG~kaGl~~~d 198 (330) T protein:vir:10 131 YGNDGIA-------PAEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWL----VVWGPNTCHSIY-PKGSKAGLSVED 198 (330) T ss_pred cCCCCCC-------hhhccchhhhcCCCCCCchhheeeccccccCceEEEE----EEEcCCeEEEEc-ccCccccceeee Confidence 9998874 359999999774 4699999998765433331 111112222344 888888886555 Q ss_pred cCCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCccccee Q lcl|NC_020871. 233 LSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAA 312 (468) Q Consensus 233 ~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~ 312 (468) +..|+..-.+..|+.=-| -...|...-|. -..- T Consensus 199 ~g~~~~~~~dg~gg~y~~-~~~~~~w~~Gl----------------------------------------------~i~d 231 (330) T protein:vir:10 199 KGQVTIENADGNGGRMEG-YRTHYKWDIGL----------------------------------------------TLRD 231 (330) T ss_pred ccceeeecccCCCCceeE-Eeeeeeeeeee----------------------------------------------EEeC Confidence 665555432222211111 11111111010 0112 Q ss_pred EEEEEEEEcccCCcccccce---------e--eeeecc---------------------CcceEEEEEeecCCCcccceE Q lcl|NC_020871. 313 HEYKVVVSSDDAESIASEVA---------T--ATVTAK---------------------DDGVKLEIELAPMYSSRPQFV 360 (468) Q Consensus 313 y~YkVtavn~~GES~aS~~v---------t--~Tv~a~---------------------~~g~~ltIT~~~~~ga~~~~y 360 (468) +.|.|..||=+.-.+..+.- . -.++.- .+..++.+|+....| ..+ T Consensus 232 ~r~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g---~~~ 308 (330) T protein:vir:10 232 WRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSG---ERV 308 (330) T ss_pred cccEEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCC---eee Confidence 45555555533211111100 0 000100 122334555544333 223 Q ss_pred EEEeecCCCceeEEEEEEecccccCCeeEEecCCCCCCCCcccee Q lcl|NC_020871. 361 SIYRKGAETGLFYLIARVPASKAENNVITFYDLNDSIPETVDVFV 405 (468) Q Consensus 361 ~IYR~~~~~G~f~~igrv~~s~~~~~t~tf~D~N~~iPgt~~~fv 405 (468) .-||. -.++...- |=+|-..+| T Consensus 309 t~~~g----ipir~~Da-------------------il~tE~~vv 330 (330) T protein:vir:10 309 MTFDG----IPVQRTDA-------------------LLNTESRVV 330 (330) T ss_pred EEECC----eEEEEEee-------------------eecCccccC Confidence 33331 11111111 112222222 No 48 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=97.23 E-value=6e-05 Score=43.73 Aligned_cols=296 Identities=13% Similarity=0.044 Sum_probs=132.5 Q ss_pred cccChhhHHH----HHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeee Q lcl|NC_020871. 11 KEVNLNSVQE----DALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQH 86 (468) Q Consensus 11 ~~~n~~~~~e----~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~h 86 (468) =..|++..-| +-.|+++++... ++.|-++.+..+|..... +.-.+.+...+.+..+.-.+|.+.. T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~~~-------~g~~ip~~~~~~ii~~~~--~~s~i~~~~~~~~~~~~~~~~p~~~-- 69 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGDSM-------FEGYLEPEQAQDYFAEAE--KISIVQQFAQKIPMGTTGQKIPHWT-- 69 (326) T ss_pred CCCCccchhhhcCcchhhheeccccC-------CcceechhhHHHHHHHHH--hcchhhhhcceeeccCCceEEEEEe-- Confidence 1122222211 124556554322 333455555555533322 2234555555555554444566655 Q ss_pred ccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_020871. 87 GKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSD 166 (468) Q Consensus 87 G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~ 166 (468) +.....|++|++..+.+++.+......++=++..-.+|.-+ +.++..|.+....+.-...++..+|.++|+|+.+- T Consensus 70 -~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~el-l~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~-- 145 (326) T protein:vir:42 70 -GDVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAET-VRANPANYLGTMRTKVATAFAMAFDNAAINGTDSP-- 145 (326) T ss_pred -CCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC-- Confidence 33445699999999999999999999999999998888854 34567888898899999999999999999998541 Q ss_pred CCCCCCCccccchhhhcCccceeeccCCC-----CCHHH-HhhhhhhhhhccCceEEEecCHHHHhhHHHhhcC--CceE Q lcl|NC_020871. 167 SPEPQAGLEFDGLAKLINQDNVHDARGAS-----LTESL-LNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLS--KQTQ 238 (468) Q Consensus 167 ~~~~~~gleFDGl~~li~~~nviDarG~~-----ls~~~-l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~--~qr~ 238 (468) .| .|+.+...........+.. ...+. +-.+.......+...+...|++.+.+.|.. .-+ ++.. T Consensus 146 ~p--------~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~-lkd~~G~~l 216 (326) T protein:vir:42 146 FP--------TFLAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNG-AKDKSGRPL 216 (326) T ss_pred cc--------ccccccccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHH-hhccCCcee Confidence 11 2344333222222222221 22222 222333344556666778899999999953 332 3333 Q ss_pred EeecCCCcc--------eeeeeccceeecCCccccCCCEe--ecccccc---cccccccCCCCCCcceeEEecCCC---- Q lcl|NC_020871. 239 LVRDNGNNV--------SVGFNIQGFHSARGFIKLHGSTV--MENEQIL---DERILALPTAPQQAKVTATQEAGK---- 301 (468) Q Consensus 239 v~~~n~~~~--------~~G~~v~~~~s~~g~i~l~gs~i--~~~~n~l---~~~~~~~p~ap~~~~vtat~~~~~---- 301 (468) +++...++. -.|.++ +++. .+. .+..+ +.+..-+ +.....+-..-.. ...-.....+ T Consensus 217 ~~~~~~~~~~~~~~~~~l~G~pv--~~~~--~~~-~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~-~~~~~~~~~~~~~~ 290 (326) T protein:vir:42 217 FIESTYTEENSPFRLGRIVARPT--ILSD--HVA-SGTVVGYQGDFRQLVWGQVGGLSFDVTDQA-TLNLGTPQAPNFVS 290 (326) T ss_pred eccccccCccccccCceeeeeeE--EEcC--CCC-CCceEEEEeecceEEEEEecceEEEEeecc-eeeecccccccchh Confidence 333221111 123333 1111 111 11111 1111100 0000000000000 0000000000 Q ss_pred -CCCcCcccceeEEEEEEEEcccCCcccccceeeeeec Q lcl|NC_020871. 302 -KGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTA 338 (468) Q Consensus 302 -~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a 338 (468) ..+.--.-....++-+..++...-......+.+- + T Consensus 291 ~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~--~ 326 (326) T protein:vir:42 291 LWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDATE--A 326 (326) T ss_pred hhhcCcEEEEEEEEeccEEecccceEEEeeccccC--C Confidence 0000000001122222222221110000000000 0 No 49 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=97.18 E-value=0.00014 Score=41.73 Aligned_cols=308 Identities=12% Similarity=0.067 Sum_probs=136.8 Q ss_pred CCCcccchhh---cccChhhHHHHHHHHhhcccc---------------------cCcccccCccccchhhhhhHhhhhh Q lcl|NC_020871. 1 MPKNNKEEEV---KEVNLNSVQEDALKSFTTGYG---------------------ITPDTQTDAGALRREFLDDQISMLT 56 (468) Q Consensus 1 ~~~~~~~~~~---~~~n~~~~~e~~~Ksf~agy~---------------------~~p~~~~~gaALr~esld~~i~~L~ 56 (468) .....+.... .+.........+.|++..+-+ ++..+...|+.|-++.+..+|..+. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l 156 (435) T protein:vir:80 77 TASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELL 156 (435) T ss_pred ccccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHH Confidence 0000000000 111111222334444432210 2223334567788888888876544 Q ss_pred hccccccchhhhccc--chhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc- Q lcl|NC_020871. 57 WTENDLTFYKDIAKK--PATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNN- 133 (468) Q Consensus 57 ~~~~~f~~~~~i~k~--~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~- 133 (468) ... ..+..+..+ +..+--.+|.++. +.....|++|++..+..++.+.+....++=++....+|.-+ +.++ T Consensus 157 ~~~---~~i~~~~~~~v~~~~~~~~~p~~~---~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~el-l~ds~ 229 (435) T protein:vir:80 157 RPK---SVVRKLGARTLPLSNGNITIPRLK---GGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDL-IKYAG 229 (435) T ss_pred hhh---chhhhccceeeecCCCceEEEEEe---CCcceeeeccCccccccccceeeEEEeeEEEEEeehhhHHH-HHhhc Confidence 322 222233222 2223334566665 33346689999999999999999999999888888888765 3333 Q ss_pred h-hhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCC-CCC--HHHHhhhhhhhh Q lcl|NC_020871. 134 I-QDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGA-SLT--ESLLNQAAVMIS 209 (468) Q Consensus 134 ~-~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~-~ls--~~~l~~~a~~i~ 209 (468) + .+.+....+.-...+...+|.++|+|+.. +-+..||.+.....++...-.. ... ...+.++-..+. T Consensus 230 ~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~---------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~ 300 (435) T protein:vir:80 230 VNPNVDQIVVGDLTAAIGAREDKAFIRDDGT---------ANTPKGLRFWALPGNVITASDGSTLQKIETDLGKAILALE 300 (435) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHhhccCCC---------CCcccceeecccccceeecccccchhhHHHHHHHHHHHhh Confidence 2 35678889999999999999999999732 1234577665555555444332 222 112223222222 Q ss_pred hc--cCceEEEecCHHHHhhHHHhh-cCCceEEeecCCCcceeeeeccceeecCCcccc----CCCEe-ecccc--cc-c Q lcl|NC_020871. 210 KG--YGTPTDAYMPVGVQADFVNQQ-LSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKL----HGSTV-MENEQ--IL-D 278 (468) Q Consensus 210 ~~--fG~~td~~m~~~v~a~~~~~~-~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l----~gs~i-~~~~n--~l-~ 278 (468) .+ +-...-..|++.+.+.+...- -.+++. .+...++.-.|.+| +.+..-+..+ ....| +.+.. ++ + T Consensus 301 ~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l-~~~~~~~~l~G~pv--~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~ 377 (435) T protein:vir:80 301 NADANLTQPGWIMAPRTFRFLEGLRDGNGNKV-YPELANGMLKGYPV--GKTTQVPINLGEAGKESEIYFTDFGDVFIGE 377 (435) T ss_pred ccccccccCEEEEcHHHHHHHHhhhccCCcee-ccCCCCCeEeeeee--EEeccccccccCCCCcceEEEEEcccEEEEe Confidence 22 222334679999998884422 233443 33333333456554 1111110000 11112 22111 00 0 Q ss_pred ccccccCCCCCCcceeEE-ecCCCCCCcCcccceeEEEE--------EEEEcccCCcc Q lcl|NC_020871. 279 ERILALPTAPQQAKVTAT-QEAGKKGQFRAEDLAAHEYK--------VVVSSDDAESI 327 (468) Q Consensus 279 ~~~~~~p~ap~~~~vtat-~~~~~~g~~~~~~~~~y~Yk--------Vtavn~~GES~ 327 (468) .....+-......-.... ..-+.--...-.-....++- |+.++.-+..+ T Consensus 378 ~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 378 EETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred ecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 000000000000000000 00000000000000122221 22222222222 No 50 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=97.17 E-value=0.00014 Score=41.68 Aligned_cols=281 Identities=10% Similarity=-0.032 Sum_probs=142.3 Q ss_pred CcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEE Q lcl|NC_020871. 33 TPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKT 112 (468) Q Consensus 33 ~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~ 112 (468) =.++.++++.|-++.+..+|-...... ..+.+.....+..+.--.|.++. +.+...+++|++..+.+++.+.+.. T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~--s~i~~l~~~~~~~~~~~~~p~~~---~~~~a~wv~Eg~~~~~s~~~f~~v~ 75 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGH--SSIAKLSPQKPIPFNGQREFVFD---FDSDIDIVAENGKKTHGGVSLDPVT 75 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhh--hhhhhhcceeeccCCceEEEEEe---cCcceEEeeCCcccccccccceeeE Confidence 122223344455555555553322222 22322223334443323455555 3346679999999999999999999 Q ss_pred EEEEeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcC-cccee Q lcl|NC_020871. 113 VNMKFASDTKNISIAAGLVN--NIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLIN-QDNVH 189 (468) Q Consensus 113 ~~~k~l~~~~~vs~~~~lv~--~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~-~~nvi 189 (468) ...+=++---.+|.-+-... ...|.+....+.-...+++.++.++|+|+.+- ++.+....|....-. ..++. T Consensus 76 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~-----~g~~~~~~~~~~~~~~~~~~~ 150 (300) T protein:vir:95 76 IVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPR-----TKQASTIIGDNCFDKKVTQTV 150 (300) T ss_pred eeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC-----CCCCcccccccccccccceee Confidence 99988888888888765443 45777888888899999999999999997432 223333333322211 12444 Q ss_pred eccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh-cCCceEEeecCC----CcceeeeeccceeecCCccc Q lcl|NC_020871. 190 DARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ-LSKQTQLVRDNG----NNVSVGFNIQGFHSARGFIK 264 (468) Q Consensus 190 DarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~-~~~qr~v~~~n~----~~~~~G~~v~~~~s~~g~i~ 264 (468) ...|..+ -+.|..+...+...++.++-..||+.+.+.+...- -+++... ++.. ..--.|.+| +++..-... T Consensus 151 ~~~~~~~-~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~-~~~~~~~~~~~l~G~Pv--~~s~~v~~~ 226 (300) T protein:vir:95 151 PFKDTNP-DESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLY-PELAWGGVPDAINGLAV--DKNRTVSYS 226 (300) T ss_pred cccccch-HHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeec-cCccccCCCceecceee--EEecCCCCC Confidence 4444443 45666666566677888888999999999884433 2334433 2221 111245444 111110000 Q ss_pred cC--CCEe-ecccc----cccccccccCCCCCCcceeEEecCCCCCC----cCcccceeEEEEEEEEcccCCccccccee Q lcl|NC_020871. 265 LH--GSTV-MENEQ----ILDERILALPTAPQQAKVTATQEAGKKGQ----FRAEDLAAHEYKVVVSSDDAESIASEVAT 333 (468) Q Consensus 265 l~--gs~i-~~~~n----~l~~~~~~~p~ap~~~~vtat~~~~~~g~----~~~~~~~~y~YkVtavn~~GES~aS~~vt 333 (468) .. ...+ +.+.. |..+.....- + ...++..++ |.. ....+++..--+-+=--|+..+- T Consensus 227 ~~~~~~~~~~GDf~~~~~~~~~~~~~~~-------v--~~~~~~d~~~~~~f~~---~~v~~r~~~r~d~~v~~~~a~~~ 294 (300) T protein:vir:95 227 QTDPKNTAIVGDFETMFKWGYAKEVPME-------I--IKYGDPDNSGRDLKGY---NQIYIRCEAYIGWGIMDAASFAR 294 (300) T ss_pred CCCCccEEEEeeccceEEEEEecccEEE-------E--eeccCCCCcchhhhhc---CcEEEEEEEeecceeecccceEE Confidence 00 1112 22221 1111111000 0 000011111 111 12444554433322222443444 Q ss_pred eeeecc Q lcl|NC_020871. 334 ATVTAK 339 (468) Q Consensus 334 ~Tv~a~ 339 (468) ++-.+. T Consensus 295 l~~~~g 300 (300) T protein:vir:95 295 IVKTGG 300 (300) T ss_pred EecCCC Confidence 444333 No 51 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=97.14 E-value=0.00016 Score=41.46 Aligned_cols=288 Identities=11% Similarity=0.035 Sum_probs=138.7 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhc----cc-------------ccCcccccCccccchhhhhhHhhhhhhcccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTT----GY-------------GITPDTQTDAGALRREFLDDQISMLTWTENDLT 63 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~a----gy-------------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~ 63 (468) -....+.....+.+... ..+..|.+.. +. ..+..+..+|+.+..| +.+.|..+. .+... T Consensus 67 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~-~~~~ii~~~--~~~~~ 142 (390) T protein:vir:81 67 NGAGGDVQHVSVGDMFV-ASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPN-RLPGFITPP--DARLT 142 (390) T ss_pred cccccccccccchhhhh-hhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechh-hhHHHHHHH--hhhhh Confidence 11111111111111111 1122233211 10 1111223334445554 445554433 33345 Q ss_pred chhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_020871. 64 FYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTD 143 (468) Q Consensus 64 ~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~ 143 (468) +.+.+...+..+--.+|.++.. ..+...+++|++..+..++.+......++-++---.+|.-+ +.++ .+.+....+ T Consensus 143 l~~~~~~~~~~~~~~~~~~~~~--~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~d~-~~~~~~i~~ 218 (390) T protein:vir:81 143 VRDLIGSGRTDSALIEYVQETG--FVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI-LSDA-PQLASYMNN 218 (390) T ss_pred hhhhcceeeccCCceEEEEEec--CCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHH-HHhH-HHHHHHHHH Confidence 6666666666665556666653 33345689999999999999999999999999888888853 3344 478888888 Q ss_pred HHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHH Q lcl|NC_020871. 144 DAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVG 223 (468) Q Consensus 144 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~ 223 (468) .-...+...++.++++|+-. |-.+.||.+......+--.-+.....+.|..+.-.+...+...+-++||+. T Consensus 219 ~l~~~~~~~~d~a~l~G~g~---------~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 289 (390) T protein:vir:81 219 RLIRGLKVKEDAEILRGTGA---------NDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPI 289 (390) T ss_pred HHHHHHHHHHHHHHHhcCCC---------CCcccceeecccccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHH Confidence 89999999999999999733 224577776544333222223334445555555555566777778999999 Q ss_pred HHhhHHHhhcCC--ceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCC Q lcl|NC_020871. 224 VQADFVNQQLSK--QTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGK 301 (468) Q Consensus 224 v~a~~~~~~~~~--qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~ 301 (468) +.+.|.. ..+. ++.++++..+ +.-.|.|..++..+.. | ..+.+ T Consensus 290 ~~~~l~~-lkd~~G~~l~~~~~~~---------------~~~~l~G~pv~~~~~~--------p---~~~~~-------- 334 (390) T protein:vir:81 290 DWAAIEL-AKDANNQYLIGNARGT---------------LTPTLWGLPVVATQAM--------A---PGEFL-------- 334 (390) T ss_pred HHHHHHH-hhcCCCceeecCcccc---------------cCceecceeeEEcCCC--------C---CCcEE-------- Confidence 9998853 3222 3333221111 1112334444332221 1 00000 Q ss_pred CCCcCcccceeEEEEEEEEcccCCcc---------cccceeeeee--------ccCcceEEEEEee Q lcl|NC_020871. 302 KGQFRAEDLAAHEYKVVVSSDDAESI---------ASEVATATVT--------AKDDGVKLEIELA 350 (468) Q Consensus 302 ~g~~~~~~~~~y~YkVtavn~~GES~---------aS~~vt~Tv~--------a~~~g~~ltIT~~ 350 (468) -|.|. +.+..+.+.+-+. -+..+...+. ....- +.||+. T Consensus 335 ~gd~~--------~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~--v~~t~a 390 (390) T protein:vir:81 335 VGAFD--------LAAQIFDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEAL--ISGSFA 390 (390) T ss_pred EEehh--------ceEEEEEecceEEEEecccchhhcCcEEEEEEEeeccEEecccce--EEEEeC Confidence 01111 0000111111000 0010110000 00001 112222 No 52 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=97.08 E-value=0.00015 Score=41.49 Aligned_cols=281 Identities=13% Similarity=0.055 Sum_probs=140.7 Q ss_pred ccccc-CcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeeccccccccccccccccccCc Q lcl|NC_020871. 28 TGYGI-TPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDP 106 (468) Q Consensus 28 agy~~-~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~ 106 (468) -|+.- ...+.++|+.|-++.+.++|....... ..+.+.....+..+...++.+.. + ....|++|++..+..++ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~--s~l~~~~~~~~~~~~~~~~~~~~---~-~~a~~v~E~~~~~~~~~ 74 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNG--SAAMKLAKAVPMTKPEEEFTFMS---G-VGAFWVDEAERIQTSKP 74 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhc--chhhhhceeeecCCCcEEEEEEc---C-CceeeeecCcccccccc Confidence 44331 223344567788888888885544333 34556666666666666665543 2 23568999999999999 Q ss_pred ceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhh-hcCc Q lcl|NC_020871. 107 NIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAK-LINQ 185 (468) Q Consensus 107 ~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~-li~~ 185 (468) .+.......|-++---.+|.-+- .++..|.+....+.-...+++.+|.++++|+.+- .+. |+.+ .... T Consensus 75 ~f~~v~l~~~k~~~~~~is~ell-~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~-----~~~-----gil~~~~~~ 143 (299) T protein:vir:41 75 TFTKAKMRSKKMGVIIPTTKENL-NYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESP-----YNW-----NILKSATDA 143 (299) T ss_pred ceeEEEEeeEEEEEeehhhHHHH-hcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCc-----ccc-----ccccccccc Confidence 99999999999999888887432 2466788888889999999999999999998431 112 4444 3334 Q ss_pred cceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCc-eEEeecCCC---cceeeeeccceeecCC Q lcl|NC_020871. 186 DNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQ-TQLVRDNGN---NVSVGFNIQGFHSARG 261 (468) Q Consensus 186 ~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~q-r~v~~~n~~---~~~~G~~v~~~~s~~g 261 (468) .+.... ...+.+.|.++...+...+...+-+.|++.+.+.|.. ..+.+ |.+.+++.. ..-.|.+| +.+..- T Consensus 144 ~~~~~~--~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~-lkd~~G~~l~~~~~~~~~~~l~G~PV--~~~~~~ 218 (299) T protein:vir:41 144 SNLVEE--TANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRS-TKDGNGMPIFNTATSNGVDDVLGLPI--AYTPKY 218 (299) T ss_pred ceeecc--ccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHH-hhccCCceeecCCcCCCCceecceee--EEeccc Confidence 444322 2344455555555566777777889999999999854 43332 444433321 11244443 222211 Q ss_pred ccccCCCEe-ecccc---cccccccccCCCCCCcceeEEecCCC----CCCcCcccceeEEEEEEEEcccCC----cccc Q lcl|NC_020871. 262 FIKLHGSTV-MENEQ---ILDERILALPTAPQQAKVTATQEAGK----KGQFRAEDLAAHEYKVVVSSDDAE----SIAS 329 (468) Q Consensus 262 ~i~l~gs~i-~~~~n---~l~~~~~~~p~ap~~~~vtat~~~~~----~g~~~~~~~~~y~YkVtavn~~GE----S~aS 329 (468) ...-....+ +.+.. |.+......-..-........-..+. ..+.--.-....++=+...+...- -.++ T Consensus 219 ~~~~~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa 298 (299) T protein:vir:41 219 TFGDKDISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAG 298 (299) T ss_pred CCCCCceEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 100000011 11110 00000000000000000000000000 000000000011111111111110 0011 Q ss_pred c Q lcl|NC_020871. 330 E 330 (468) Q Consensus 330 ~ 330 (468) + T Consensus 299 ~ 299 (299) T protein:vir:41 299 N 299 (299) T ss_pred C Confidence 1 No 53 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=97.07 E-value=0.00015 Score=41.51 Aligned_cols=285 Identities=13% Similarity=0.123 Sum_probs=141.8 Q ss_pred cCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeecccccccccccccc-----ccccCc Q lcl|NC_020871. 32 ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGV-----APVSDP 106 (468) Q Consensus 32 ~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~-----~~~~d~ 106 (468) +...+-++|+.|-++.+.++|....... -.+.+.....+..+.-..|.+... .....+++|++. .+.+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~--s~l~~l~~~~~~~~~~~~~p~~~~---~~~a~wv~E~~~~~~~~~~~s~~ 75 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQG--STVLSAFQNVNMGTKTTHLPVLAT---LPEADWVGESATDPKGVKPTSKV 75 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhh--chhhhhcceeeccCCcEEEEEEeC---CcceEEeeccccccccccccccc Confidence 5555566688899999988885544433 356666666666655455666663 334558999874 456799 Q ss_pred ceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhh-hcCc Q lcl|NC_020871. 107 NIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAK-LINQ 185 (468) Q Consensus 107 ~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~-li~~ 185 (468) .+.+....++=++.--.+|.-+ +.++..|.+....+.-...+++.+|.++|+|+.+- .|++==++.. .... T Consensus 76 ~f~~i~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~-------~~~~~~~~~~~~~~~ 147 (305) T protein:vir:25 76 TWANRTLVAEEIAVIIPVHENV-IDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKP-------ASWVSPALIPAAVTA 147 (305) T ss_pred ceeeEEeeeEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCC-------CCccccccccccccc Confidence 9999999999888888888732 23467888999999999999999999999998531 1221011111 1122 Q ss_pred cceeeccCCCCC-H---HHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCC-ceEEeecCCCcceeeeeccceeecC Q lcl|NC_020871. 186 DNVHDARGASLT-E---SLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSK-QTQLVRDNGNNVSVGFNIQGFHSAR 260 (468) Q Consensus 186 ~nviDarG~~ls-~---~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~-qr~v~~~n~~~~~~G~~v~~~~s~~ 260 (468) .+.....+.... . +.+..+...+...+..++.++||+...+.+.. ..+. .|.+.+++ .-.|.++ +++.. T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~-lkd~~G~~i~~~~---~l~G~Pv--~~~~~ 221 (305) T protein:vir:25 148 GQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVAN-IRDANGNPVFRDD---SFAGFRT--FFNRN 221 (305) T ss_pred cccccccccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHH-hhccCCceeecCC---cccccce--EEcCc Confidence 333333333333 2 22334445556667777889999999988843 3322 23333332 2244433 22221 Q ss_pred CccccCC-CEeecccccc---cccccccCCCCCCcceeEEecCCCCCCc-Cc--ccceeEEEEEEEEcccCCcc--cccc Q lcl|NC_020871. 261 GFIKLHG-STVMENEQIL---DERILALPTAPQQAKVTATQEAGKKGQF-RA--EDLAAHEYKVVVSSDDAESI--ASEV 331 (468) Q Consensus 261 g~i~l~g-s~i~~~~n~l---~~~~~~~p~ap~~~~vtat~~~~~~g~~-~~--~~~~~y~YkVtavn~~GES~--aS~~ 331 (468) ....... ..++.+..-. +......-.... .+.....+.-.-| .. .-...-+|-+..++...--. -.+. T Consensus 222 ~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~---~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~ 298 (305) T protein:vir:25 222 GAWDADAAIEVIADSSRVKIGVRQDITVKFLDQ---ATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) T ss_pred cCCCCCccEEEEEecceEEEEEecCeEEEEeee---eeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccc Confidence 1111111 1222221100 000000000000 0000000000000 00 00112333333444333100 0011 Q ss_pred eeeeeec Q lcl|NC_020871. 332 ATATVTA 338 (468) Q Consensus 332 vt~Tv~a 338 (468) ..+|.++ T Consensus 299 ~~~~pa~ 305 (305) T protein:vir:25 299 AVVAPAA 305 (305) T ss_pred cccCCCC Confidence 1222222 No 54 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=97.02 E-value=4.7e-05 Score=44.29 Aligned_cols=236 Identities=21% Similarity=0.138 Sum_probs=111.1 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHh-hhhhhccccccchhhhcccchh-hhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQI-SMLTWTENDLTFYKDIAKKPAT-STVA 78 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i-~~L~~~~~~f~~~~~i~k~~~~-stv~ 78 (468) ||.-. .++-+..+| +|= +++ ..-|+..| ..|+..+ .++.+++=...+ .|=| T Consensus 1 m~~~~-----~~~~TL~e~---Ak~------~~~----------~~~l~~~IIE~l~~tn---~IL~~lpf~e~N~~t~~ 53 (331) T protein:vir:10 1 MPTLS-----TTNPTLADV---AAR------MTP----------DGKIDPQIVEMLNETN---EILDDMTVIEANGFTEH 53 (331) T ss_pred CCccc-----cCcccHHHH---HHh------cCc----------chhHHHHHHHHHhcCc---hHHhhceeeeccCCccc Confidence 54321 111111111 110 000 01122111 1122222 234444444454 3447 Q ss_pred ccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhc-chhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 79 KYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVN-NIQDPMQILTDDAIVNIAKTIEWAS 157 (468) Q Consensus 79 ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~-~~~Dp~~~~~~~ai~~~~~~~e~a~ 157 (468) .|.+++.--+. .|-.=...-+-+.++..|++..++.|..-..|.+...-.+ +..+-.++|.+.-|..+.++++..+ T Consensus 54 ~~~vrt~LP~~---~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~ 130 (331) T protein:vir:10 54 KTTVRSGLPTG---TWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTL 130 (331) T ss_pred eeeEEeccCCc---hhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77777744443 3522233445678899999999999999999999766654 5777789999999999999999999 Q ss_pred hhcccccccCCCCCCCccccchhhhcC------ccceeeccCCCCCHHHHh----------------------------- Q lcl|NC_020871. 158 FFGDSDLSDSPEPQAGLEFDGLAKLIN------QDNVHDARGASLTESLLN----------------------------- 202 (468) Q Consensus 158 f~Gd~~l~~~~~~~~gleFDGl~~li~------~~nviDarG~~ls~~~l~----------------------------- 202 (468) ||||++.+ +-+||||.+..+ .+|+||+.|..-..-.|+ T Consensus 131 iyGD~a~~-------p~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~ 203 (331) T protein:vir:10 131 FYGDSSID-------AEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGED 203 (331) T ss_pred hcCCcccC-------hhhhccchhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCce Confidence 99998874 349999999774 458999988763221111 Q ss_pred ---hhhhhhhhccCc-----------------------------------------------------e-EEEecCHHHH Q lcl|NC_020871. 203 ---QAAVMISKGYGT-----------------------------------------------------P-TDAYMPVGVQ 225 (468) Q Consensus 203 ---~~a~~i~~~fG~-----------------------------------------------------~-td~~m~~~v~ 225 (468) .+..-.-++|++ . +-+||+-.++ T Consensus 204 ~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~ 283 (331) T protein:vir:10 204 TLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIR 283 (331) T ss_pred eeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHH Confidence 000000011111 0 1122222222 Q ss_pred hhHHHhhcCCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCccee Q lcl|NC_020871. 226 ADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVT 294 (468) Q Consensus 226 a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vt 294 (468) +.|+-+-..+-.+. -.+....-...++.-+.|+ |.+-|-.++.. +.|+ T Consensus 284 ~~L~~q~~~~~~~~------~~~~~~~~g~~~t~~~gip-----ir~~dai~~tE----------~~Vv 331 (331) T protein:vir:10 284 SFLRRQITNKVAAS------TLTMEEIAGKKVVAFDGIP-----CRRTDALLLTE----------ARVV 331 (331) T ss_pred HHHHHHHhhcccee------eeeeeecCCcceeEECCee-----EEEeeeeecCc----------cccC Confidence 22211111110000 0000000111111111111 11222221111 1111 No 55 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=97.02 E-value=4.7e-05 Score=44.29 Aligned_cols=236 Identities=21% Similarity=0.138 Sum_probs=111.1 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHh-hhhhhccccccchhhhcccchh-hhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQI-SMLTWTENDLTFYKDIAKKPAT-STVA 78 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i-~~L~~~~~~f~~~~~i~k~~~~-stv~ 78 (468) ||.-. .++-+..+| +|= +++ ..-|+..| ..|+..+ .++.+++=...+ .|=| T Consensus 1 m~~~~-----~~~~TL~e~---Ak~------~~~----------~~~l~~~IIE~l~~tn---~IL~~lpf~e~N~~t~~ 53 (331) T protein:vir:98 1 MPTLS-----TTNPTLADV---AAR------MTP----------DGKIDPQIVEMLNETN---EILDDMTVIEANGFTEH 53 (331) T ss_pred CCccc-----cCcccHHHH---HHh------cCc----------chhHHHHHHHHHhcCc---hHHhhceeeeccCCccc Confidence 54321 111111111 110 000 01122111 1122222 234444444454 3447 Q ss_pred ccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhc-chhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 79 KYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVN-NIQDPMQILTDDAIVNIAKTIEWAS 157 (468) Q Consensus 79 ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~-~~~Dp~~~~~~~ai~~~~~~~e~a~ 157 (468) .|.+++.--+. .|-.=...-+-+.++..|++..++.|..-..|.+...-.+ +..+-.++|.+.-|..+.++++..+ T Consensus 54 ~~~vrt~LP~~---~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~ 130 (331) T protein:vir:98 54 KTTVRSGLPTG---TWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTL 130 (331) T ss_pred eeeEEeccCCc---hhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77777744443 3522233445678899999999999999999999766654 5777789999999999999999999 Q ss_pred hhcccccccCCCCCCCccccchhhhcC------ccceeeccCCCCCHHHHh----------------------------- Q lcl|NC_020871. 158 FFGDSDLSDSPEPQAGLEFDGLAKLIN------QDNVHDARGASLTESLLN----------------------------- 202 (468) Q Consensus 158 f~Gd~~l~~~~~~~~gleFDGl~~li~------~~nviDarG~~ls~~~l~----------------------------- 202 (468) ||||++.+ +-+||||.+..+ .+|+||+.|..-..-.|+ T Consensus 131 iyGD~a~~-------p~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~ 203 (331) T protein:vir:98 131 FYGDSSID-------AEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGED 203 (331) T ss_pred hcCCcccC-------hhhhccchhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCce Confidence 99998874 349999999774 458999988763221111 Q ss_pred ---hhhhhhhhccCc-----------------------------------------------------e-EEEecCHHHH Q lcl|NC_020871. 203 ---QAAVMISKGYGT-----------------------------------------------------P-TDAYMPVGVQ 225 (468) Q Consensus 203 ---~~a~~i~~~fG~-----------------------------------------------------~-td~~m~~~v~ 225 (468) .+..-.-++|++ . +-+||+-.++ T Consensus 204 ~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~ 283 (331) T protein:vir:98 204 TLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIR 283 (331) T ss_pred eeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHH Confidence 000000011111 0 1122222222 Q ss_pred hhHHHhhcCCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCccee Q lcl|NC_020871. 226 ADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVT 294 (468) Q Consensus 226 a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vt 294 (468) +.|+-+-..+-.+. -.+....-...++.-+.|+ |.+-|-.++.. +.|+ T Consensus 284 ~~L~~q~~~~~~~~------~~~~~~~~g~~~t~~~gip-----ir~~dai~~tE----------~~Vv 331 (331) T protein:vir:98 284 SFLRRQITNKVAAS------TLTMEEIAGKKVVAFDGIP-----CRRTDALLLTE----------ARVV 331 (331) T ss_pred HHHHHHHhhcccee------eeeeeecCCcceeEECCee-----EEEeeeeecCc----------cccC Confidence 22211111110000 0000000111111111111 11222221111 1111 No 56 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=97.02 E-value=4.7e-05 Score=44.29 Aligned_cols=236 Identities=21% Similarity=0.138 Sum_probs=111.1 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHh-hhhhhccccccchhhhcccchh-hhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQI-SMLTWTENDLTFYKDIAKKPAT-STVA 78 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i-~~L~~~~~~f~~~~~i~k~~~~-stv~ 78 (468) ||.-. .++-+..+| +|= +++ ..-|+..| ..|+..+ .++.+++=...+ .|=| T Consensus 1 m~~~~-----~~~~TL~e~---Ak~------~~~----------~~~l~~~IIE~l~~tn---~IL~~lpf~e~N~~t~~ 53 (331) T protein:vir:10 1 MPTLS-----TTNPTLADV---AAR------MTP----------DGKIDPQIVEMLNETN---EILDDMTVIEANGFTEH 53 (331) T ss_pred CCccc-----cCcccHHHH---HHh------cCc----------chhHHHHHHHHHhcCc---hHHhhceeeeccCCccc Confidence 54321 111111111 110 000 01122111 1122222 234444444454 3447 Q ss_pred ccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhc-chhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 79 KYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVN-NIQDPMQILTDDAIVNIAKTIEWAS 157 (468) Q Consensus 79 ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~-~~~Dp~~~~~~~ai~~~~~~~e~a~ 157 (468) .|.+++.--+. .|-.=...-+-+.++..|++..++.|..-..|.+...-.+ +..+-.++|.+.-|..+.++++..+ T Consensus 54 ~~~vrt~LP~~---~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~ 130 (331) T protein:vir:10 54 KTTVRSGLPTG---TWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTL 130 (331) T ss_pred eeeEEeccCCc---hhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77777744443 3522233445678899999999999999999999766654 5777789999999999999999999 Q ss_pred hhcccccccCCCCCCCccccchhhhcC------ccceeeccCCCCCHHHHh----------------------------- Q lcl|NC_020871. 158 FFGDSDLSDSPEPQAGLEFDGLAKLIN------QDNVHDARGASLTESLLN----------------------------- 202 (468) Q Consensus 158 f~Gd~~l~~~~~~~~gleFDGl~~li~------~~nviDarG~~ls~~~l~----------------------------- 202 (468) ||||++.+ +-+||||.+..+ .+|+||+.|..-..-.|+ T Consensus 131 iyGD~a~~-------p~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~ 203 (331) T protein:vir:10 131 FYGDSSID-------AEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGED 203 (331) T ss_pred hcCCcccC-------hhhhccchhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCce Confidence 99998874 349999999774 458999988763221111 Q ss_pred ---hhhhhhhhccCc-----------------------------------------------------e-EEEecCHHHH Q lcl|NC_020871. 203 ---QAAVMISKGYGT-----------------------------------------------------P-TDAYMPVGVQ 225 (468) Q Consensus 203 ---~~a~~i~~~fG~-----------------------------------------------------~-td~~m~~~v~ 225 (468) .+..-.-++|++ . +-+||+-.++ T Consensus 204 ~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~ 283 (331) T protein:vir:10 204 TLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIR 283 (331) T ss_pred eeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHH Confidence 000000011111 0 1122222222 Q ss_pred hhHHHhhcCCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCccee Q lcl|NC_020871. 226 ADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVT 294 (468) Q Consensus 226 a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vt 294 (468) +.|+-+-..+-.+. -.+....-...++.-+.|+ |.+-|-.++.. +.|+ T Consensus 284 ~~L~~q~~~~~~~~------~~~~~~~~g~~~t~~~gip-----ir~~dai~~tE----------~~Vv 331 (331) T protein:vir:10 284 SFLRRQITNKVAAS------TLTMEEIAGKKVVAFDGIP-----CRRTDALLLTE----------ARVV 331 (331) T ss_pred HHHHHHHhhcccee------eeeeeecCCcceeEECCee-----EEEeeeeecCc----------cccC Confidence 22211111110000 0000000111111111111 11222221111 1111 No 57 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=96.97 E-value=0.00023 Score=40.50 Aligned_cols=297 Identities=12% Similarity=0.024 Sum_probs=138.0 Q ss_pred CCCcccchhhcccChhhHHH---HHHHHhhccc-------------ccCcccccCccccchhhhhhHhhhhhhccccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQE---DALKSFTTGY-------------GITPDTQTDAGALRREFLDDQISMLTWTENDLTF 64 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e---~~~Ksf~agy-------------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~ 64 (468) .+...+. +.......+ ++......+. ..+..+..+|+-+-.+.+. .+..+. .+...+ T Consensus 71 ~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~ii~~~--~~~~~l 143 (390) T protein:vir:10 71 GDVQHVS----VGDLFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLP-GFITQP--DARLTV 143 (390) T ss_pred ccccccc----hhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHH-HHHHHH--Hhhchh Confidence 1111110 000111111 1111111110 1112233344455555554 443333 223356 Q ss_pred hhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHH Q lcl|NC_020871. 65 YKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDD 144 (468) Q Consensus 65 ~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ 144 (468) ++.+...++.+.-.+|.+... ..+...+++|++..+..|+++......++-++-...+|.-+ +.++ .+......+. T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~--~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~d~-~~l~~~i~~~ 219 (390) T protein:vir:10 144 RDLIGSGRTDSALIEYVQETG--FVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI-LSDA-PQLASYMNNR 219 (390) T ss_pred hhhcceeeccCCceEEEEEec--CCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHH-HHhH-HHHHHHHHHH Confidence 666666666655455665553 33445689999999999999999999999999988888864 3344 4778888888 Q ss_pred HHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCcc-ceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHH Q lcl|NC_020871. 145 AIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVG 223 (468) Q Consensus 145 ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~-nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~ 223 (468) -...++..++.+++.|+-. +-++.||.+..... .+....|.. ..+.|..+-..+..+|...+-++|++. T Consensus 220 l~~~~~~~~~~~il~G~G~---------~~~p~Gi~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~v~n~~ 289 (390) T protein:vir:10 220 LIRGLKVKEDAEILRGTGA---------NDGLLGLIPQATTYAAPTTIAGAT-RVDQLRLAMLQASLAEYPASGIVINPI 289 (390) T ss_pred HHHHHHHHHHHHHhhcCCC---------Cccccccccccccccccccccccc-hHHHHHHHHHhhccccCCCCEEEEcHH Confidence 8889999999999999732 22567877755433 233333433 344455444455567778888999999 Q ss_pred HHhhHHHhh-cCCceEEeecCCCcceeeeeccceeecCCccccCCCEeeccccccccccc----cc-CCCCCCcceeEEe Q lcl|NC_020871. 224 VQADFVNQQ-LSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERIL----AL-PTAPQQAKVTATQ 297 (468) Q Consensus 224 v~a~~~~~~-~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~----~~-p~ap~~~~vtat~ 297 (468) +.+.+...- -.+++.++++..+. .-.|.|..++.++..-..... .. -.......++-.. T Consensus 290 ~~~~L~~lkd~~g~~l~~~~~~~~---------------~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~~~~~~~~~i~~ 354 (390) T protein:vir:10 290 DWAAIELAKDANNQYLIGNARGTL---------------TPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEI 354 (390) T ss_pred HHHHHHHhhcCCCceeecCCcCcC---------------CceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEE Confidence 998885433 12233333222111 012233333332221000000 00 0000000000000 Q ss_pred cCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeee Q lcl|NC_020871. 298 EAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVT 337 (468) Q Consensus 298 ~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~ 337 (468) .. ....|.. ....|++..--+-.=--|...+-+|.+ T Consensus 355 ~~-~~~~~~~---~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 355 GY-VNDDFQR---NMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ee-ccccccc---CcEEEEEEEeeccEEeccccEEEEEeC Confidence 00 0000100 011111111100000111112222222 No 58 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=96.92 E-value=0.00023 Score=40.48 Aligned_cols=311 Identities=13% Similarity=0.090 Sum_probs=133.8 Q ss_pred CC-Ccccc--hhhcccChh-hHHHHHHHHhh--------------ccccc------CcccccCccccchhhhhhHhhhhh Q lcl|NC_020871. 1 MP-KNNKE--EEVKEVNLN-SVQEDALKSFT--------------TGYGI------TPDTQTDAGALRREFLDDQISMLT 56 (468) Q Consensus 1 ~~-~~~~~--~~~~~~n~~-~~~e~~~Ksf~--------------agy~~------~p~~~~~gaALr~esld~~i~~L~ 56 (468) .+ ...+. ....|.... +.-....+++. -.+.. ...+-..|+.|-++.+..+|..+. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l 150 (428) T protein:vir:10 71 KATQHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELL 150 (428) T ss_pred hchhhccccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHH Confidence 00 00000 000010000 00000111110 00000 011122467788888877775544 Q ss_pred hccccccchhhhcccc--hhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcch Q lcl|NC_020871. 57 WTENDLTFYKDIAKKP--ATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNI 134 (468) Q Consensus 57 ~~~~~f~~~~~i~k~~--~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~ 134 (468) .. ...+..+.-+. ..+--.+|.++.. .+...+++|++..+.+++.+.+.+...+=++.-..+|..+ +.++. T Consensus 151 ~~---~~~l~~~~~~~~~~~~g~~~~p~~~~---~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~el-l~ds~ 223 (428) T protein:vir:10 151 RD---RTIVRKLGARSIPLPNGNMSLPRLAG---GATASYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNAL-IGRAG 223 (428) T ss_pred hh---hchhhhhcceeeecCCcceEEEEEeC---CcceeeeccCccccccccceeeEEeeeEEEEEeehhhHHH-Hhhhh Confidence 32 22233332121 2222245666653 3346689999999999999999999999888887777765 33566 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCcc-ceee-ccCCCCCHHHHhhhhh------ Q lcl|NC_020871. 135 QDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHD-ARGASLTESLLNQAAV------ 206 (468) Q Consensus 135 ~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~-nviD-arG~~ls~~~l~~~a~------ 206 (468) .|.+....+.-...+.+.+|.++++||.. |-+++||.+..... .++. ..+...+.+.+..... T Consensus 224 ~~l~~~i~~~l~~ai~~~~d~~~l~G~G~---------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (428) T protein:vir:10 224 FNVEQLVLQDILTAISVREDKAFMRDDGT---------GDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMS 294 (428) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCC---------CccccccccccccccccccccccccccHHHHHHHHHHHHHhh Confidence 77888889999999999999999999842 34678888755433 3332 3345555554443221 Q ss_pred hhhhccCceEEEecCHHHHhhHHHhhcCCc-eEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccC Q lcl|NC_020871. 207 MISKGYGTPTDAYMPVGVQADFVNQQLSKQ-TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALP 285 (468) Q Consensus 207 ~i~~~fG~~td~~m~~~v~a~~~~~~~~~q-r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p 285 (468) .....+-...-.+|++.+...+.. .-+.+ |.+.++..++. |.|.-++..+.. | T Consensus 295 ~~~~~~~~~~~~v~n~~~~~~L~~-lkd~~G~~i~~~~~~g~-----------------l~G~pv~~~~~~--------p 348 (428) T protein:vir:10 295 MDGNSNMISSGWGMSNRTYMKLFG-LRDGNGNKVYPEMAQGM-----------------LKGYPIQRTSAI--------P 348 (428) T ss_pred hccccccccCEEEEcHHHHHHHHH-hhccCCceeccCCCCCe-----------------eeceeeEEeccc--------c Confidence 112223333456899999988843 33332 33333322221 223322221111 1 Q ss_pred CCCCCcceeEEecCCCCCCcCcccceeEEEEE--------EEEcccCCcccccceeeeeeccCcceEEE----EEeecCC Q lcl|NC_020871. 286 TAPQQAKVTATQEAGKKGQFRAEDLAAHEYKV--------VVSSDDAESIASEVATATVTAKDDGVKLE----IELAPMY 353 (468) Q Consensus 286 ~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkV--------tavn~~GES~aS~~vt~Tv~a~~~g~~lt----IT~~~~~ 353 (468) ..... ++..+.+- .+-+++.+ +.+++++...-+... ....-..+-+.+. +.|.... T Consensus 349 ---~~~~~-----~~~~~~i~---~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~-~~~~f~~~~~~~R~~~r~d~~v~~ 416 (428) T protein:vir:10 349 ---ANLGE-----GGKESEIY---FADFNDVVIGEDGNMKVDFSKEASYIDTDGK-LVSAFSRNQSLIRVVTEHDIGFRH 416 (428) T ss_pred ---ccccC-----CCccceEE---EEecceEEEEEecceEEEeeccccccccccc-ccchhhcchhheeeeeeeCceeec Confidence 00000 00000000 01111111 112222211111000 0000000011111 1111000 Q ss_pred CcccceEEEEeecCCCceeE Q lcl|NC_020871. 354 SSRPQFVSIYRKGAETGLFY 373 (468) Q Consensus 354 ga~~~~y~IYR~~~~~G~f~ 373 (468) |+-+.+=+ |+=+ T Consensus 417 ---p~a~~~~t-----~~~~ 428 (428) T protein:vir:10 417 ---PEGLVLGT-----GVLF 428 (428) T ss_pred ---cceEEEEe-----ccCC Confidence 01111111 0000 No 59 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=96.92 E-value=0.0002 Score=40.89 Aligned_cols=280 Identities=11% Similarity=0.011 Sum_probs=132.9 Q ss_pred cccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEE Q lcl|NC_020871. 36 TQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNM 115 (468) Q Consensus 36 ~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~ 115 (468) =-++|+.|-++-+..+|..+... ...+.+.....+..+.-.+|.++...+ ...+++|++..+.+++.+.+..... T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~--~s~i~~~~~~~~~~~~~~~~p~~~~~~---~a~~v~Eg~~~~~~~~~f~~v~l~~ 75 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAG--KSSIARLSAQKPIPFNGEKVFTFTMDS---EIDVVAESGKKTHGGVTLAPQTMVP 75 (298) T ss_pred CeeccccccChhHHHHHHHHHHh--hchhhhhcceeeccCCceEEEEEecCc---ceEEeeCCccccccccceeEEEEee Confidence 22345566666676666444332 234455455555555444677776443 3458999999999999999999999 Q ss_pred EeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCc-cce--ee Q lcl|NC_020871. 116 KFASDTKNISIAAGLVN--NIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ-DNV--HD 190 (468) Q Consensus 116 k~l~~~~~vs~~~~lv~--~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~-~nv--iD 190 (468) +=++.--.+|.-+-..+ ...+.++...++-...+++.+|.++++|...-+ +.....-|+...... .+. .+ T Consensus 76 ~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~-----g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:94 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRL-----GTASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred eEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC-----Ccccccccccccccccccccccc Confidence 99988888887753222 345677788888999999999999999953321 122222333322221 122 22 Q ss_pred ccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCc-eEEeecCC--Cc--ceeeeecc--ceeecCCcc Q lcl|NC_020871. 191 ARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQ-TQLVRDNG--NN--VSVGFNIQ--GFHSARGFI 263 (468) Q Consensus 191 arG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~q-r~v~~~n~--~~--~~~G~~v~--~~~s~~g~i 263 (468) ..+..+ .+.|..+...+..++..+.-..|++.+.+.+.. ..+.+ |.+.++.. +. --.|.+|- ..+...+. T Consensus 151 ~~~~~~-~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~- 227 (298) T protein:vir:94 151 RGIADP-NGAIENAVELLTGVDADVTGIAINPSFRSALAK-QKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSL- 227 (298) T ss_pred cccccH-HHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHH-hhccCCCeeecCcccCCCCceecceeeEEecccccccC- Confidence 222222 334555555566667778889999999999844 33333 33333322 11 12455541 11111000 Q ss_pred ccCCCEe-ecccc----cccccccccCCCCCCcceeEEe-cCCCCCCcCcccceeEEEEEEEEcccCCccccccee Q lcl|NC_020871. 264 KLHGSTV-MENEQ----ILDERILALPTAPQQAKVTATQ-EAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVAT 333 (468) Q Consensus 264 ~l~gs~i-~~~~n----~l~~~~~~~p~ap~~~~vtat~-~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt 333 (468) -....+ +.+.. |..+......-.+.. ....+. .....+. -......++-+...+...-..... +| T Consensus 228 -~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~-~~d~~~~~~f~~~~--v~~r~~~r~~~~~~~~~a~~~l~~-~t 298 (298) T protein:vir:94 228 -TQRDRAIIGDFANGFKWGYAKEVPLEVIQYG-DPDNSGLDLKGYNQ--VYIRAELFLGWGILDATKFARVTE-AN 298 (298) T ss_pred -CCccEEEEeeccceEEEEEecCceEEEeecC-CCcCcchhhhhcCc--EEEEEEEEeccEeecccceEEEEe-cC Confidence 001112 22221 101111110000000 000000 0000000 000011111111112111111111 11 No 60 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=96.88 E-value=0.00028 Score=40.05 Aligned_cols=301 Identities=9% Similarity=0.006 Sum_probs=141.3 Q ss_pred cccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcccee Q lcl|NC_020871. 4 NNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVY 83 (468) Q Consensus 4 ~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~ 83 (468) .+--.+.+| |+.. ..+++ -++++.+-++.+..++..+... +..+.+.....+..+.-.+|.+. T Consensus 1 ~~~~~~~~~------e~~~--~~~~~-------~~~~~~~ip~~~~~~ii~~~~~--~~~l~~~~~~~~~~~~~~~ip~~ 63 (318) T protein:vir:24 1 MAAGTAFAV------DHAQ--IAQTG-------DTMFKGYLEPEQAKDYFAEAEK--TSIVQQFAQKVPMGTTGQKIPHW 63 (318) T ss_pred CCCCCCCCH------HHHH--hhccc-------CcccceeechhHHHHHHHHHHh--hchhhhhcceeeccCCceEEEEE Confidence 122222222 2211 12222 1335556666677776444333 33566666666666655667776 Q ss_pred eeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020871. 84 MQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSD 163 (468) Q Consensus 84 ~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~ 163 (468) .. .+...+++|++..+.+++.+.......+=++-.-.+|.-+ +.++..|.+....+.-...+++.+|.++++|+.+ T Consensus 64 ~~---~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~-l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~ 139 (318) T protein:vir:24 64 VG---DVSAQWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAET-VRANPANYLGTMRTKVATAFAMAFDGAAMHGTDS 139 (318) T ss_pred eC---CcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHH-hhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCC Confidence 63 4456799999999999999999999999998877777732 2346678889999999999999999999999853 Q ss_pred cccCCCCCCCccccchhhhcCccceeeccCC-CCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCc-eEEee Q lcl|NC_020871. 164 LSDSPEPQAGLEFDGLAKLINQDNVHDARGA-SLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQ-TQLVR 241 (468) Q Consensus 164 l~~~~~~~~gleFDGl~~li~~~nviDarG~-~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~q-r~v~~ 241 (468) - .+ .|+......-..-...+. ....+.+.++...+...+....-..|++.+.+.+.. ..+.+ |.+.+ T Consensus 140 ~-----~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~-lkd~~G~~l~~ 208 (318) T protein:vir:24 140 P-----FP-----TYIGQTTKAISIADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNG-AKDQNGRPLFI 208 (318) T ss_pred C-----CC-----cccccccccccccccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH-hhccCCceeec Confidence 1 11 233333322211111111 222334444555566667777789999999999953 33321 22222 Q ss_pred cCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEc Q lcl|NC_020871. 242 DNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSS 321 (468) Q Consensus 242 ~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn 321 (468) ++..+. +...+.+..++....... ++.|....... - +-+++.+ ... T Consensus 209 ~~~~~~-------------~~~~~~~~~i~g~pv~~~------~~~~~~~~~~~------~--------gdfs~~~-~~~ 254 (318) T protein:vir:24 209 ESTYGE-------------AASPFRSGRIVARPTILS------DHVVEGTTVGF------M--------GDFSQLI-WGQ 254 (318) T ss_pred CccccC-------------ccccccCceEEEEeeEEe------CCCCCCccEEE------E--------eecceEE-EEE Confidence 222111 111112222222111100 11111111000 0 1111111 111 Q ss_pred ccC-Ccccccceeeeeec---------c-CcceEEE----EEeecCCCcccceEEEEeecCCCceeE Q lcl|NC_020871. 322 DDA-ESIASEVATATVTA---------K-DDGVKLE----IELAPMYSSRPQFVSIYRKGAETGLFY 373 (468) Q Consensus 322 ~~G-ES~aS~~vt~Tv~a---------~-~~g~~lt----IT~~~~~ga~~~~y~IYR~~~~~G~f~ 373 (468) ..+ +-..+...+.+.-. . .+-+.++ +.|.... |.-+..-.....+|.++ T Consensus 255 ~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~---~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 255 IGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCND---AEAFVALTNVVSGGGEG 318 (318) T ss_pred ecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEec---ccceEEEEeeccCCCCC Confidence 111 10011111100000 0 0111111 1111111 12233333322233333 No 61 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=96.83 E-value=0.00031 Score=39.83 Aligned_cols=314 Identities=11% Similarity=0.056 Sum_probs=146.4 Q ss_pred CCCcccchh--hcccChhhHHHHHHHHhhcc---------------c------ccCcccccCccccchhhhhhHhhhhhh Q lcl|NC_020871. 1 MPKNNKEEE--VKEVNLNSVQEDALKSFTTG---------------Y------GITPDTQTDAGALRREFLDDQISMLTW 57 (468) Q Consensus 1 ~~~~~~~~~--~~~~n~~~~~e~~~Ksf~ag---------------y------~~~p~~~~~gaALr~esld~~i~~L~~ 57 (468) -+..++... .........-..+.|++..+ + ..+..+...|+.|-++.+..+|..+.. T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~ 157 (435) T protein:vir:14 78 APAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLR 157 (435) T ss_pred hccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHh Confidence 000000000 00011111122344443321 1 122223344677888888888765443 Q ss_pred ccccccchhhhcccch--hhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchh Q lcl|NC_020871. 58 TENDLTFYKDIAKKPA--TSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQ 135 (468) Q Consensus 58 ~~~~f~~~~~i~k~~~--~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~ 135 (468) .. ..+..+..+.+ .+---+|.++.. .+...+++|++..+..|+.+.+.+..++=++....+|.-+ +.++.. T Consensus 158 ~~---~~i~~~~~~~~~~~~~~~~~p~~~~---~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~ 230 (435) T protein:vir:14 158 PK---SVVRKLGARTLPLSNGNITIPRLKG---GAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDL-IKYAGV 230 (435) T ss_pred hh---chhhhhcceeeecCCCceEEEEEeC---CcceeeeccCccccccccceeEEEeeeEEEEEeehhhHHH-HHhhcc Confidence 32 23333322222 222335666663 2345689999999999999999999999888888888655 233322 Q ss_pred --hHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCC-CC--HHHHhhhhhhhhh Q lcl|NC_020871. 136 --DPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGAS-LT--ESLLNQAAVMISK 210 (468) Q Consensus 136 --Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~-ls--~~~l~~~a~~i~~ 210 (468) +.+....+.-...+.+.++.++++|+.. +-+..||.+...+.++...-+.. .+ ...|.++...+.. T Consensus 231 ~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~---------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 301 (435) T protein:vir:14 231 NPNVDQIVVGDLTAAIGAREDKAFIRDDGT---------ANTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALEN 301 (435) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHhhccCCC---------CccccceeecccccceeccccccchhhHHHHHHHHHHHhhh Confidence 3557778888889999999999999832 23567777655555444433222 22 2223332222222 Q ss_pred ---ccCceEEEecCHHHHhhHHHhh-cCCceEEeecCCCcceeeeecc--ceeecCCccc-cCCCEeecccccccccccc Q lcl|NC_020871. 211 ---GYGTPTDAYMPVGVQADFVNQQ-LSKQTQLVRDNGNNVSVGFNIQ--GFHSARGFIK-LHGSTVMENEQILDERILA 283 (468) Q Consensus 211 ---~fG~~td~~m~~~v~a~~~~~~-~~~qr~v~~~n~~~~~~G~~v~--~~~s~~g~i~-l~gs~i~~~~n~l~~~~~~ 283 (468) ++..+ -..|++.+.+.+...- -.+++ +.+...++.-.|.+|- .++-...... -.+..++.+..-. ..+ T Consensus 302 ~~~~~~~~-~~v~n~~~~~~L~~lkd~~G~~-l~~~~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~---~i~ 376 (435) T protein:vir:14 302 ADANLTQP-GWIMAPRTFRFLEGLRDGNGNK-VYPELANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGDV---FIG 376 (435) T ss_pred ccccccCC-EEEEcHHHHHHHHHhhccCCce-eccCCCCCeeecceeEeeccccccccCCCccceEEEeecccE---EEE Confidence 23332 3789999998884332 23333 3343333333554441 1111100000 0111223321110 000 Q ss_pred cCCCCCCcceeEEe---cCCCC----CCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCc Q lcl|NC_020871. 284 LPTAPQQAKVTATQ---EAGKK----GQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDD 341 (468) Q Consensus 284 ~p~ap~~~~vtat~---~~~~~----g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~ 341 (468) . - ..-.+--.. ..... .-|.. ....+++...-+.+--.|+..+.++-.+++. T Consensus 377 ~--~-~~~~~~~~~~~~~~~~~~~~~~~f~~---~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 377 E--E-ETLEIDYSKEATYKDADGHMVSAFQR---DQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred E--e-cccEEEEeccccccccccchhhhhhc---ChhheeeeeeeCceeecccceEEEecCCCCC Confidence 0 0 000000000 00000 01211 2344555544444444455555555554444 No 62 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=96.77 E-value=0.00013 Score=41.97 Aligned_cols=253 Identities=17% Similarity=0.134 Sum_probs=106.6 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchh-hhhhc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPAT-STVAK 79 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~-stv~e 79 (468) ||.-. +++-+..+ ++|-+- ++.. .+-..|. |+..+ .++.+++=...+ .|=|. T Consensus 1 m~~~~-----~~a~TL~E---~Akr~~------~d~~---~~~IIE~-------l~~tn---eIL~~lpf~e~N~~tg~~ 53 (335) T protein:vir:73 1 MALIG-----QTLPSLLD---IYNRTD------KNGR---IARIVEQ-------LAKTN---DILTDAIYVPCNDGSKHK 53 (335) T ss_pred CCcCC-----CCchhHHH---HHhhcC------cchh---HHHHHHH-------HhcCc---hHHhhcchhcccCCcccc Confidence 55321 11212221 111110 0000 0001222 22222 123333333333 23344 Q ss_pred cceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhc-chhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020871. 80 YDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVN-NIQDPMQILTDDAIVNIAKTIEWASF 158 (468) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~-~~~Dp~~~~~~~ai~~~~~~~e~a~f 158 (468) +.+++.-.+.+ |-.=.....-+.++..|++..++.|..-..|-+...-.+ +..+-+++|.+.-|..+.++++..+| T Consensus 54 ~~vrt~LP~~~---fR~lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~i 130 (335) T protein:vir:73 54 TTIRAGIPEPV---WRRYNQGVQPTKTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSI 130 (335) T ss_pred eeEEEecCCch---hhhcCCccccccceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 44445444333 422222334456999999999999999999998776665 57888999999999999999999999 Q ss_pred hcccccccCCCCCCCccccchhhhcC---------ccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHH Q lcl|NC_020871. 159 FGDSDLSDSPEPQAGLEFDGLAKLIN---------QDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFV 229 (468) Q Consensus 159 ~Gd~~l~~~~~~~~gleFDGl~~li~---------~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~ 229 (468) |||++.+ +-+||||.+..+ .+|+||+.|..-..-.|+--. .++ ..+ ..+-|-+-|+-|+ T Consensus 131 yGDsa~~-------p~~FdGL~kR~~~~st~~a~~a~~iIdaGGtG~~~TSi~~v~--wg~--~~~-~giyPkG~kaGl~ 198 (335) T protein:vir:73 131 YGNTDAE-------PEAFMGLAPRFNTLSTSKAASAENVFSAGGSGSTNTSIWFMS--WGE--NTA-HMIYPEGMVAGFQ 198 (335) T ss_pred cCCcCCC-------hhhccchhhhhcCccccccCcccceeeccccccCceEEEEEE--EcC--Cee-EEEcccCccccce Confidence 9998874 349999999652 469999988764332222100 000 000 0011222222111 Q ss_pred HhhcCCceEEeecCC-----------------------------C------cceeee-------------eccceee--- Q lcl|NC_020871. 230 NQQLSKQTQLVRDNG-----------------------------N------NVSVGF-------------NIQGFHS--- 258 (468) Q Consensus 230 ~~~~~~qr~v~~~n~-----------------------------~------~~~~G~-------------~v~~~~s--- 258 (468) -.-+..|.......+ . ..+.|. .||..-. T Consensus 199 ~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~ 278 (335) T protein:vir:73 199 HEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKE 278 (335) T ss_pred eeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCce Confidence 111111110000000 0 000000 0111000 Q ss_pred ---c----CCcccc----CCCEeecccccccc---cccccC------CCCCCcceeE Q lcl|NC_020871. 259 ---A----RGFIKL----HGSTVMENEQILDE---RILALP------TAPQQAKVTA 295 (468) Q Consensus 259 ---~----~g~i~l----~gs~i~~~~n~l~~---~~~~~p------~ap~~~~vta 295 (468) . +...++ ..++-+..++.-.. +-..+| -.-+.+.|+| T Consensus 279 ~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g~~~t~~~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 279 VIYANKTIHAWLHKQAMNAKNVNLTIEEYGGKKIVSFLGIPIRRVDAILNTESAVTA 335 (335) T ss_pred EEEechHHHHHHHHHHhccCceeeeeeccCCceeEEECCeEEEEEeeeecCcccccC Confidence 0 000000 00000000000000 000111 0112222333 No 63 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=96.77 E-value=0.00035 Score=39.51 Aligned_cols=297 Identities=12% Similarity=0.024 Sum_probs=130.2 Q ss_pred CCCcccchhhcccCh--hhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNL--NSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVA 78 (468) Q Consensus 1 ~~~~~~~~~~~~~n~--~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ 78 (468) ++.+........... .+-...+.|++...-.....+-.+|+.|-++.+.++|..+..... .+.+.+...++.+... T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~--~l~~~~~~~~~~~~~~ 151 (395) T protein:vir:38 74 EPVNKKPLPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFT--SLESLANVENVTTSHG 151 (395) T ss_pred ccccccccchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhc--chhhhcceeeccCCcc Confidence 333322222222222 233345556554322222233345888999999888855444333 4555555555554444 Q ss_pred ccc--eeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 79 KYD--VYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEW 155 (468) Q Consensus 79 ey~--~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~ 155 (468) .|. +..+++ +...+++|++..+ ..++.+.+.....+-++.-..+|.-+= .++..|.+....+.-...+...++. T Consensus 152 ~~~~~~~~~~~--~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~~ 228 (395) T protein:vir:38 152 SRVYEKLADIT--PLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLL-KDTVDNIIQWLVNWAAKKDVVTRNA 228 (395) T ss_pred eEEEEeeccCC--ccccccccccccccccccceeeEEeeeeeeEeehhhHHHHH-hhhHHHHHHHHHHHHHHHHHHHHHH Confidence 432 222322 2355899998865 567999999999999988888777432 2355677788888888999999999 Q ss_pred HHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhc-C Q lcl|NC_020871. 156 ASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQL-S 234 (468) Q Consensus 156 a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~-~ 234 (468) ++++|+..-.. .....-+|.|.++++ ..+...|....-.+|++.+.+.+...-- . T Consensus 229 ~il~g~g~~~~---~~~~~~~~~i~~~~~---------------------~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~ 284 (395) T protein:vir:38 229 KILEVMGKAPK---KPTISQFDNIKDLEN---------------------NTLDPAIESTSSFITNQSGYNILSKVKDAD 284 (395) T ss_pred HHhhccccccc---ccccccHHHHHHHHH---------------------HhhhhhhcCCCEEEEcHHHHHHHHHhhccC Confidence 99999865321 112233344433321 1122223333346788888777743321 2 Q ss_pred CceEEeecCCC----cceeeeeccceeecCCccc-cCCC--Eeecccc--cccccccc--cCCCCCCc--------ceeE Q lcl|NC_020871. 235 KQTQLVRDNGN----NVSVGFNIQGFHSARGFIK-LHGS--TVMENEQ--ILDERILA--LPTAPQQA--------KVTA 295 (468) Q Consensus 235 ~qr~v~~~n~~----~~~~G~~v~~~~s~~g~i~-l~gs--~i~~~~n--~l~~~~~~--~p~ap~~~--------~vta 295 (468) ++.. .+++.. ..-.|.+|- ++...... ..+. .++.+.. .+...+.. .-...... ..-+ T Consensus 285 G~~l-~~~~~~~~~~~~l~G~pV~--~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~ 361 (395) T protein:vir:38 285 GRYL-MQPDVTSPDKYLIDGKPVI--RIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRF 361 (395) T ss_pred Ccee-eccCcCCCCcceeccceeE--EecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEE Confidence 3333 222211 122444441 11110000 0011 1122111 00000000 00000000 0000 Q ss_pred EecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCc Q lcl|NC_020871. 296 TQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDD 341 (468) Q Consensus 296 t~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~ 341 (468) ...-+.. ...+. +--..++++++ ..+++. ..+. . T Consensus 362 ~~r~d~~-~~~~~--a~~~~~~~~~~---~~~~~~----~~~~--~ 395 (395) T protein:vir:38 362 IDRFDVQ-LIDDG--AFAAASFKTVA---NQAQGT----AGTG--K 395 (395) T ss_pred EEeeccE-Eeccc--ceEEEEeeccc---CCCCCc----cCCC--C Confidence 0000000 00011 01111111111 111111 0111 1 No 64 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=96.76 E-value=0.00036 Score=39.49 Aligned_cols=305 Identities=13% Similarity=0.118 Sum_probs=134.6 Q ss_pred hhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeeccccccccccccccccccC Q lcl|NC_020871. 26 FTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSD 105 (468) Q Consensus 26 f~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d 105 (468) |.. .+-+.|+.|-++.+..+|...... ...+.+.....+..+.-.+|.++. +.+...+++|++..+.++ T Consensus 1 Ma~------~~~~~gg~~vP~~~~~~ii~~l~~--~s~i~~l~~~i~~~~~~~~ip~~~---~~~~a~wv~Eg~~~~~s~ 69 (315) T protein:vir:80 1 MAD------DFLSAGKLELPGSMIGAVRDRAID--SGVLAKLSPEQPTIFGPVKGAVFS---GVPRAKIVGEGEVKPSAS 69 (315) T ss_pred CCC------CcCCcCceEcchHHHHHHHHHHHh--hchhhhhcceeecCCCceEEEEEe---CCcceEEeeCCccccccc Confidence 322 223346666677776666443322 223433333334443333455554 444567999999999999 Q ss_pred cceEEEEEEEEeeeehhhhhhhHhhhcc---hhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhh Q lcl|NC_020871. 106 PNIRQKTVNMKFASDTKNISIAAGLVNN---IQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKL 182 (468) Q Consensus 106 ~~~~r~~~~~k~l~~~~~vs~~~~lv~~---~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~l 182 (468) +.+.......|=|+.--.+|..+-..+. +...+....++-...+++.+|.++|+|+...+ |-...|+... T Consensus 70 ~~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~-------~~~~~~~~~~ 142 (315) T protein:vir:80 70 VDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPAT-------GKAASAVHTS 142 (315) T ss_pred cceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCC-------Cccccccccc Confidence 9999999999988888788776443322 33356777888888999999999999985422 2234666665 Q ss_pred cCc-cceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEeecCCCcceeeeeccceeecCC Q lcl|NC_020871. 183 INQ-DNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARG 261 (468) Q Consensus 183 i~~-~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g 261 (468) +.. .+.+++-+.. ..++++.-+.+....+...+-..|++.+.+.+... .+.+- ....|..+..-....+ T Consensus 143 ~~~~~~~~~~~~~~-~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l-~~~~g--------~~~~g~~~~~~~~~g~ 212 (315) T protein:vir:80 143 LNKTKNIVDATDSA-TADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTE-VYPKG--------SPLAGQPMYPAAGFAG 212 (315) T ss_pred cccccceeeccccc-hHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHH-hhccC--------CcccccccccccccCC Confidence 543 4677776654 34555544444455555556689999999998433 22211 1111111110001111 Q ss_pred ccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEE-----EEcccCCcccccceeeee Q lcl|NC_020871. 262 FIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVV-----VSSDDAESIASEVATATV 336 (468) Q Consensus 262 ~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVt-----avn~~GES~aS~~vt~Tv 336 (468) .-.|.|.-++..++. +..+.....+.+.+ . -|.|. .|.|.+. -+++++....+ .. T Consensus 213 ~~tl~G~PV~~~~~~-----~~~~~~~~~~~~~~--~---~GDfs-----~~~~g~~~~~~i~i~~~~~~~~~-----~~ 272 (315) T protein:vir:80 213 LDNWRGLNVGASSTV-----SGAPEMSPASGVKA--I---VGDFS-----RVHWGFQRNFPIELIEYGDPDQT-----GR 272 (315) T ss_pred CceecceeeEecCcC-----CcccccccccccEE--E---Eeecc-----cEEEEEecCeeEEEeccccccCc-----cc Confidence 112233322221111 11100000000000 0 01111 1111110 01111110000 00 Q ss_pred ec-cCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEecccccCCeeEEecCC Q lcl|NC_020871. 337 TA-KDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFYDLN 394 (468) Q Consensus 337 ~a-~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~~t~tf~D~N 394 (468) .. ..+.+.+..+.- +.++ |-| ..-|-.+...+. ++-+..-.| T Consensus 273 ~~~~~~~v~~r~~~r-~~~~------v~~----~~a~~~l~~~~a-----~~~~~~~~~ 315 (315) T protein:vir:80 273 DLKGHNEVMVRAEAV-LYVA------IES----LDSFAVVKEKAA-----PKPNPPAEN 315 (315) T ss_pred chhhcCcEEEEEEEE-ecce------eec----ccceEEEeeccC-----CCCCCCCCC Confidence 00 000011110000 1111 000 001111111000 111111111 No 65 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=96.74 E-value=0.00037 Score=39.39 Aligned_cols=308 Identities=10% Similarity=0.067 Sum_probs=132.1 Q ss_pred CCCcccchh-hcccChhh-HHHHHHHHhhccc-----ccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccch Q lcl|NC_020871. 1 MPKNNKEEE-VKEVNLNS-VQEDALKSFTTGY-----GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPA 73 (468) Q Consensus 1 ~~~~~~~~~-~~~~n~~~-~~e~~~Ksf~agy-----~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~ 73 (468) +++.++.+. ....+..+ ...++.+.+..+. ..+-.+.++|+.|.++.+..+|..+..... .+++.+...++ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~--~l~~~~~~~~~ 148 (397) T protein:vir:48 71 MSEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYD--SLQEYVNVENV 148 (397) T ss_pred hhhhccccccchhhHHHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHH--HHHhhhceeec Confidence 111111111 11111111 1112222222221 111223456888999999888855544433 56666666666 Q ss_pred hhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHH Q lcl|NC_020871. 74 TSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKT 152 (468) Q Consensus 74 ~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~ 152 (468) .+...++.....-...+...+++|++..+ ..++.+.+.+..++-++.-..+|.-+ +.++..|.+....+.--..++.. T Consensus 149 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~v~~~l~~~~~~~ 227 (397) T protein:vir:48 149 TTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSL-LADSAENILAWLSGWIAKKVVVT 227 (397) T ss_pred cCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHH-HhhchHHHHHHHHHHHHHHHHHH Confidence 65444443332223334466899998865 55799999999999999888888754 33466778888888888999999 Q ss_pred HHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh Q lcl|NC_020871. 153 IEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ 232 (468) Q Consensus 153 ~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~ 232 (468) ++.+++.|+..-+.. .. ++ +-+.|..+-..+...|......+|++.+.+.|...= T Consensus 228 ~d~~il~G~g~~~~~---~~---------~~-------------~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lk 282 (397) T protein:vir:48 228 RNKAILEAIATLPTK---PT---------LT-------------KWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVK 282 (397) T ss_pred HHHHHhhcccccccc---cc---------cc-------------cHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhh Confidence 999999998664311 11 11 112222222233445666678899999999885422 Q ss_pred -cCCceEEeecCCCc----ceeeeeccc----eeecCCccccCCCEeecccc--cccccccccCCCCCCcceeEEecCCC Q lcl|NC_020871. 233 -LSKQTQLVRDNGNN----VSVGFNIQG----FHSARGFIKLHGSTVMENEQ--ILDERILALPTAPQQAKVTATQEAGK 301 (468) Q Consensus 233 -~~~qr~v~~~n~~~----~~~G~~v~~----~~s~~g~i~l~gs~i~~~~n--~l~~~~~~~p~ap~~~~vtat~~~~~ 301 (468) -.+++ +.+++... .-.|.+|-- ++...+.- ....++.+.. .+...+. .+.-...... T Consensus 283 d~~G~~-i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~--~~~~~~gd~~~~~~~~~~~---------~~~i~~~~~~ 350 (397) T protein:vir:48 283 NAFGDY-LMERDVKSPTGYSIDGFAVKEVADRWLANASSG--AMPLYFGDLKQAVTLFDRQ---------QMSLLSTNIG 350 (397) T ss_pred cCCCce-eeccCcCCCCCceeccceeEEecccccCCcCCC--ceEEEEEeccceEEEEeec---------ceEEEEeccc Confidence 12233 33333211 113333310 00000000 0001111100 0000000 0000000000 Q ss_pred CCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCce Q lcl|NC_020871. 302 KGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGL 371 (468) Q Consensus 302 ~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~ 371 (468) ...|.. +...|++..--+-.---|...+. |+..++...+++. .-.++ T Consensus 351 ~~~~~~---~~~~~r~~~r~d~~~~~~~a~~~-------------~~~~~~~~~~~~~-------~~~~~ 397 (397) T protein:vir:48 351 GGAFET---DTTKIRVIDRFDVVATDTESFVP-------------ASFKAIADQKGNL-------GSTAV 397 (397) T ss_pred hhhhhc---CceeEEEEeeeccEEecccceEE-------------EEecccccCCCCc-------cccCC Confidence 000000 01111111110000000111111 1111221111100 00001 No 66 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=96.73 E-value=0.00038 Score=39.33 Aligned_cols=306 Identities=11% Similarity=0.063 Sum_probs=132.3 Q ss_pred CCCcccchhhcc-cChhhHH-HHHHHHhhccc-----ccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccch Q lcl|NC_020871. 1 MPKNNKEEEVKE-VNLNSVQ-EDALKSFTTGY-----GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPA 73 (468) Q Consensus 1 ~~~~~~~~~~~~-~n~~~~~-e~~~Ksf~agy-----~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~ 73 (468) ++...+.+.... ....+.+ ..|.+.+..+- .....+.++|+.|.++.+...|..+..... .+++.+...++ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~--~l~~~~~~~~~ 148 (397) T protein:vir:49 71 MSEEEKKPLTKNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFD--SLQEYVNVENV 148 (397) T ss_pred ccccccccccchhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhh--hHhhhcceeec Confidence 222111111111 0111111 12333222221 122234556788889998888855544433 45555555555 Q ss_pred hhhhhc--cceeeeeccccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHH Q lcl|NC_020871. 74 TSTVAK--YDVYMQHGKVGHTRFTREIGVA-PVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIA 150 (468) Q Consensus 74 ~stv~e--y~~~~~hG~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~ 150 (468) ..---. |.+.. +..+.+.+++|++.. +...+.+...+..++-++....+|.-+- .++..|.+....+.....++ T Consensus 149 ~~~~~~~~~~~~~--~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~ 225 (397) T protein:vir:49 149 TTLTGSRVYEKWA--DITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLL-ADSAENILAWLSGWIAKKVV 225 (397) T ss_pred cCCcceEEEEeec--cCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHH-hhhhHHHHHHHHHHHHHHHH Confidence 433222 22222 222346689999975 4566899999999999998888876432 34567888889999999999 Q ss_pred HHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHH Q lcl|NC_020871. 151 KTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVN 230 (468) Q Consensus 151 ~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~ 230 (468) +.++.++++|+..-.. ....+ +-+.|.++...+...|......+|++.+.+.+.. T Consensus 226 ~~~d~ail~G~g~~~~---~~~~~----------------------~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~ 280 (397) T protein:vir:49 226 VTRNKAILEAIGTLPN---KPTLA----------------------KWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKK 280 (397) T ss_pred HHHHHHHHhccccccc---ccccc----------------------CHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH Confidence 9999999999865321 11112 2223333333344566677789999999988843 Q ss_pred hhcCC-ceEEeecCCCc----ceeeeeccceeecCCccc-c---CCCEeecccc--cccccccccCCCCCCcceeEEecC Q lcl|NC_020871. 231 QQLSK-QTQLVRDNGNN----VSVGFNIQGFHSARGFIK-L---HGSTVMENEQ--ILDERILALPTAPQQAKVTATQEA 299 (468) Q Consensus 231 ~~~~~-qr~v~~~n~~~----~~~G~~v~~~~s~~g~i~-l---~gs~i~~~~n--~l~~~~~~~p~ap~~~~vtat~~~ 299 (468) .-+. -|.+.+++... .-.|.+|-- +...... . ....++.+.. .+...+.. ++-.... T Consensus 281 -lkd~~g~~l~~~~~~~g~~~~l~G~pV~~--~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~---------~~i~~~~ 348 (397) T protein:vir:49 281 -VKNAMGDYLMERDVKSPTGYSIDGFVVKE--ISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQH---------LSLLSTN 348 (397) T ss_pred -hhccCCceeecccccCCCCceecceeeEE--ecccccccccCCceeEEEeeccceEEEEeecc---------cEEEEec Confidence 3322 23333332211 112222210 0000000 0 0001111100 00000000 0000000 Q ss_pred CCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCc Q lcl|NC_020871. 300 GKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSS 355 (468) Q Consensus 300 ~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga 355 (468) .....|. .+...|++..--+.+---+...+-+++++..+.... +...|| T Consensus 349 ~~~~~~~---~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~----~~~~~~ 397 (397) T protein:vir:49 349 IGGGAFE---TDTTKVRVIDRFDVVSTDTEAFVPASFKAIADQKAK----LSTAGA 397 (397) T ss_pred cccchhh---cCeeeEEEEEeeccEEecccceEEEEecccccccCc----ccccCC Confidence 0000011 011122222111111001111222222111110000 011122 No 67 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=96.68 E-value=0.00041 Score=39.14 Aligned_cols=297 Identities=11% Similarity=0.007 Sum_probs=143.9 Q ss_pred hhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeee-----cccc Q lcl|NC_020871. 16 NSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQH-----GKVG 90 (468) Q Consensus 16 ~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~h-----G~~g 90 (468) -|.-.+ +.+..+|........+.+++|-++.+..+|..+..... .+.+...+.+..+--.+|.+.... .+.| T Consensus 1 ~a~l~e-l~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s--~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg 77 (333) T protein:vir:78 1 MATLNE-LLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESS--LVLRMGEQIPISYGETIIPTTVKRPEVGQVGVG 77 (333) T ss_pred CchhHH-hhhhcccccccCceecCCccccchhHHHHHHHHHHhhc--hhhhhcceeeccCCceEEEEEeCCceeEeecCc Confidence 111111 12233444444444555667888888777755444333 455555566666544556666533 2334 Q ss_pred ccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhh-cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCC Q lcl|NC_020871. 91 HTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLV-NNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPE 169 (468) Q Consensus 91 ~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~ 169 (468) ...++.|++..+..++.+.+.....+=++.--.+|. ++. ++..|.+....+.-...+++.+|.++|+|+-.. T Consensus 78 ~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~--ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~----- 150 (333) T protein:vir:78 78 TSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSE--EFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPL----- 150 (333) T ss_pred ccccccccccccccccceeEEEEeeEEEEEeehhhH--HHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCC----- Confidence 566778888899999999999999999998888887 444 466788888889999999999999999999653 Q ss_pred CCCCccccchhhhcC--ccceee--ccCCCCCH-HHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhc--CCc-eEEee Q lcl|NC_020871. 170 PQAGLEFDGLAKLIN--QDNVHD--ARGASLTE-SLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQL--SKQ-TQLVR 241 (468) Q Consensus 170 ~~~gleFDGl~~li~--~~nviD--arG~~ls~-~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~--~~q-r~v~~ 241 (468) .+..+.|+.+... .-..++ ..+..+.. ++++.-..+...++..++-..|++...+.|.+... +.+ +.+.+ T Consensus 151 --~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~ 228 (333) T protein:vir:78 151 --TGSALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPS 228 (333) T ss_pred --CCcccccccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeec Confidence 2345677765221 111112 22223344 44444344455567778889999999888855442 221 33333 Q ss_pred cCCC----cceeeeeccceeecCCccc----c--CCCEeeccccc---ccccccccCCCCCC--cceeEEec-CCCCCC- Q lcl|NC_020871. 242 DNGN----NVSVGFNIQGFHSARGFIK----L--HGSTVMENEQI---LDERILALPTAPQQ--AKVTATQE-AGKKGQ- 304 (468) Q Consensus 242 ~n~~----~~~~G~~v~~~~s~~g~i~----l--~gs~i~~~~n~---l~~~~~~~p~ap~~--~~vtat~~-~~~~g~- 304 (468) +... ..-.|.+| +++..=..+ . ....++.+..- .++.....-..... .....+.. ....+. T Consensus 229 ~~~~~~~~~~l~G~Pv--~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v 306 (333) T protein:vir:78 229 RINLAAQTGDVLGLPA--QFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQI 306 (333) T ss_pred CccccCCCceeeceee--EEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcE Confidence 3221 11245544 111100000 0 00112221110 00000000000000 00000000 000000 Q ss_pred -cCc---ccceeEEE-EEEEEcccCCccc Q lcl|NC_020871. 305 -FRA---EDLAAHEY-KVVVSSDDAESIA 328 (468) Q Consensus 305 -~~~---~~~~~y~Y-kVtavn~~GES~a 328 (468) +.. .|.....- .++.+ -...+| T Consensus 307 ~~r~~~r~d~~v~~~~a~~~l--~~~~a~ 333 (333) T protein:vir:78 307 AILIEVTFGWLLGDKQAFVKF--VDDEQP 333 (333) T ss_pred EEEEEEEEccEEecccceEEE--eccCCC Confidence 000 01111110 01111 111112 No 68 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=96.46 E-value=0.00061 Score=38.23 Aligned_cols=283 Identities=12% Similarity=0.042 Sum_probs=137.4 Q ss_pred cccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEE Q lcl|NC_020871. 36 TQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNM 115 (468) Q Consensus 36 ~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~ 115 (468) =.+.|+.|-++.+..+|..+... ...+.+...+.+..+.-.+|.++.. .+...+++|++..+.+|+.+.+..... T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~--~s~i~~l~~~~~~~~~~~~ip~~~~---~~~a~~v~E~~~~~~~~~~f~~v~l~~ 75 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAG--KSSIARLSAQKPIPFNGEKVFTFTM---DSEIDVVAESGKKTHGGVTLAPQTMVP 75 (298) T ss_pred CcccCcceechhHHHHHHHHHHh--hhhhhhhcceeeccCCceEEEEEec---CcceEEecCCccccccccceeEEEEee Confidence 12335566666666666544433 3345555566666655556776663 345669999999999999999999999 Q ss_pred EeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCc-cce--ee Q lcl|NC_020871. 116 KFASDTKNISIAAGLVN--NIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ-DNV--HD 190 (468) Q Consensus 116 k~l~~~~~vs~~~~lv~--~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~-~nv--iD 190 (468) +=++..-.+|.-+-..+ ...|.+....+.-...+++.+|.++|+|...- .+.+..+-|+...... .+. .+ T Consensus 76 ~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~-----~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:16 76 IKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPR-----LGTASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred eeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCC-----CCcccccccccccccccccccccc Confidence 99998888888764433 35677777888888999999999999996332 2223333343333321 121 22 Q ss_pred ccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh-cCCceEEeecCCCcceeeeeccceeecCCccccCCCE Q lcl|NC_020871. 191 ARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ-LSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGST 269 (468) Q Consensus 191 arG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~-~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~ 269 (468) ..+..+ .+.|..+...+..++..+.-..|++.+.+.+...- -.+++. .++....... -.|.|-- T Consensus 151 ~~~~~~-~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i-~~~~~~~~~~-------------~~l~G~P 215 (298) T protein:vir:16 151 RGIADP-NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNAL-FPELKWGATP-------------DTINGLP 215 (298) T ss_pred cccccH-HHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCee-ecCcccCCCC-------------ceeccee Confidence 222222 23444555555667778888999999999884422 122222 2222111100 0112222 Q ss_pred eecccccccccccccCCCCCCcceeEEecCCCCCCcCcc----cceeEEEEEEEEcccCCcccccceeeeeecc-CcceE Q lcl|NC_020871. 270 VMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAE----DLAAHEYKVVVSSDDAESIASEVATATVTAK-DDGVK 344 (468) Q Consensus 270 i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~----~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~-~~g~~ 344 (468) +..+++ +|.....+... .. -|.|... ....+++++ ++++.+..+ +++.. .+-+. T Consensus 216 V~~~~~--------v~~~~~~~~~~--~~---~GDfs~~~~~~~~~~~~~~~---~~~~~~~~~-----~~~~f~~~~v~ 274 (298) T protein:vir:16 216 VDVNKT--------VSDMSLTQRDR--AI---IGDFANGFKWGYAKEVPLEV---IQYGDPDNS-----GLDLKGYNQVY 274 (298) T ss_pred eEEecc--------cccccCCCccE--EE---EeeccceEEEEEecCceEEE---eeccCCcCc-----chhhhhcCcEE Confidence 221111 11110111100 00 0112110 011122222 222221111 00000 01111 Q ss_pred EE----EEeecCCCcccceEEEEeecC Q lcl|NC_020871. 345 LE----IELAPMYSSRPQFVSIYRKGA 367 (468) Q Consensus 345 lt----IT~~~~~ga~~~~y~IYR~~~ 367 (468) +. +.|.... |+.+...+.+. T Consensus 275 ~ra~~r~d~~v~~---~~a~~~l~~at 298 (298) T protein:vir:16 275 IRAELFLGWGILD---ATKFARVTEAN 298 (298) T ss_pred EEEEEEEccEeec---ccceEEEeecC Confidence 11 1111111 12233333322 No 69 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=96.45 E-value=0.00035 Score=39.56 Aligned_cols=314 Identities=17% Similarity=0.140 Sum_probs=136.5 Q ss_pred CCCcccc--hhhcccChhhHHHHHHHHh------------------------------hccccc----------Cccccc Q lcl|NC_020871. 1 MPKNNKE--EEVKEVNLNSVQEDALKSF------------------------------TTGYGI----------TPDTQT 38 (468) Q Consensus 1 ~~~~~~~--~~~~~~n~~~~~e~~~Ksf------------------------------~agy~~----------~p~~~~ 38 (468) |.--.-. .-.||+.- =+||| +++|.- +..... T Consensus 1 ~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~PN~~~pll~li~~g~~~ta~ast~~w~~d~~~~~~~~~ta~a~a 74 (418) T protein:vir:10 1 MSVYAGIFNTTLNPQEL------NMKSFAGTILRRVPNGSAPLLAMTSVVGSTTAKASTHGYFSKTMVFASAVVTAEAAA 74 (418) T ss_pred CceeccccccCCChhhh------chhhhhhhhhhhcCCcchhhhhhhhcccccccceeEEEEEEEEEeeeeEEEEEEEec Confidence 2110000 11222211 23444 222210 000001 Q ss_pred CccccchhhhhhHhh-hhhhccccccch--hhhcccchhhhhhccceeeeeccccccc------------cccccccccc Q lcl|NC_020871. 39 DAGALRREFLDDQIS-MLTWTENDLTFY--KDIAKKPATSTVAKYDVYMQHGKVGHTR------------FTREIGVAPV 103 (468) Q Consensus 39 ~gaALr~esld~~i~-~L~~~~~~f~~~--~~i~k~~~~stv~ey~~~~~hG~~g~~~------------fv~E~g~~~~ 103 (468) ++.-|..|+=+---+ +|.|.+..+..+ ..|+ =..-.+....|++-+.+ -..||.+... T Consensus 75 ~~T~l~ve~~~~f~~~~l~~~~~~~Evirv~sVn-------g~~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~eEGsd~~t 147 (418) T protein:vir:10 75 DATVLTVENSDGLTKGMIFYNEATGENMRLELVN-------GLNLTVKRQTGRISAAIIAANTKLIVIGTAFEEGSQRPT 147 (418) T ss_pred CceEEEEcCcceeccccEEEEccCCeEEEEEEEe-------CCEEEEEEecCCeeEEEEecCceEEEeccccccccccCC Confidence 111111111111000 011111111000 0110 01112222222222221 2367666554 Q ss_pred cCcceEEEEEEE-Ee---eeehhhhhhhHhhh---cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCC--c Q lcl|NC_020871. 104 SDPNIRQKTVNM-KF---ASDTKNISIAAGLV---NNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAG--L 174 (468) Q Consensus 104 ~d~~~~r~~~~~-k~---l~~~~~vs~~~~lv---~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~g--l 174 (468) .. .+.+ +.+ +| +.+..++|.-++.+ -+++|+...+ .+.+.-.+.+||+++|+|-... +.+..| - T Consensus 148 a~--~~k~-~~vsNvtQIF~~avsvSgTaqAs~~q~Gvsn~~ese-~drk~~~av~iEkalI~G~~~~---~~~~~g~~R 220 (418) T protein:vir:10 148 AR--SIQP-VYVPNFTQIFRNAWALTDTARASYAEAGYSNITESR-RDCMDFHATEQETAIFFGQAFM---GTYNGQPLH 220 (418) T ss_pred cc--eecc-eeccchhhhhhhhhhhhhhhhhccccccCchHHHHH-HHHHHHHHHHHHHHHhcccccC---CCcCCcchh Confidence 43 2222 222 33 34455566655542 2678887666 4555555669999999997553 333345 3 Q ss_pred cccchhhhcC---ccceeeccCC-CCCHHHHhhhhhhhhh---ccCceEE-----EecCHHHHhhHHHhhcCCceEEeec Q lcl|NC_020871. 175 EFDGLAKLIN---QDNVHDARGA-SLTESLLNQAAVMISK---GYGTPTD-----AYMPVGVQADFVNQQLSKQTQLVRD 242 (468) Q Consensus 175 eFDGl~~li~---~~nviDarG~-~ls~~~l~~~a~~i~~---~fG~~td-----~~m~~~v~a~~~~~~~~~qr~v~~~ 242 (468) .++||..++. ++||+|+.+. .++.+.|.++...+.+ +-|..++ +++|...|.++... +. |-. - T Consensus 221 ~m~GIl~~vr~~~~gnVv~a~~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~-~~-~I~---~ 295 (418) T protein:vir:10 221 TTQGIVDAVRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRF-FG-EVT---V 295 (418) T ss_pred hHHHHHHHHhhhcccceeccCCCCccCHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhh-hh-hee---e Confidence 7899998886 6899999999 5999999988766543 4577766 55699999999544 44 422 2 Q ss_pred CCCcceeeeeccceeecCCccccCCCEe-----ecccccccccccccCC--------CCCCcceeE------EecCCC-C Q lcl|NC_020871. 243 NGNNVSVGFNIQGFHSARGFIKLHGSTV-----MENEQILDERILALPT--------APQQAKVTA------TQEAGK-K 302 (468) Q Consensus 243 n~~~~~~G~~v~~~~s~~g~i~l~gs~i-----~~~~n~l~~~~~~~p~--------ap~~~~vta------t~~~~~-~ 302 (468) ..+....|..+.++...+|.|.|+.+-+ |..+..|.-+...+-. ++..-.-++ ++..+. - T Consensus 296 ~~~e~~~G~vv~~~~~~~G~I~L~~~p~~~~~~lp~g~mlVvD~~~vkL~~L~~R~~~~E~l~k~G~~~~~~~~~~~~~~ 375 (418) T protein:vir:10 296 TQRETSYGMVFTEWKFFKGRLILKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGH 375 (418) T ss_pred cccceeeeEEEEEEEcceEEEEeecccccccccCCCceEEEEccccceEEEeccccccchhcccCCCccccccccccccc Confidence 3455579999999999999997766522 2222222222211111 111100000 000000 0 Q ss_pred CCcCcccceeEEEEEEEEcccCCcc------cccceeeeeecc Q lcl|NC_020871. 303 GQFRAEDLAAHEYKVVVSSDDAESI------ASEVATATVTAK 339 (468) Q Consensus 303 g~~~~~~~~~y~YkVtavn~~GES~------aS~~vt~Tv~a~ 339 (468) |-+.-.+--.=.|.+-..|..+-.. +.+.+..|.++- T Consensus 376 ~~D~~kG~iv~E~tLe~~N~~a~avitgl~~~~~~~~~t~p~~ 418 (418) T protein:vir:10 376 GVDAQGGSLTSEWALELLNPQGCAVITGLQKAKERVYLTAPAP 418 (418) T ss_pred ccccccceEEEEeeeeeecccceEEeeccceecccccCCCCCC Confidence 0000000011222333334332111 111122222211 No 70 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=96.45 E-value=0.00061 Score=38.19 Aligned_cols=284 Identities=11% Similarity=0.068 Sum_probs=135.7 Q ss_pred cCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEE Q lcl|NC_020871. 32 ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQK 111 (468) Q Consensus 32 ~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~ 111 (468) +- ...++|+.|-++.+..+|..+.... ..+.+...+.+..+--.+|.++.+ .....+++|++..+.+++++... T Consensus 1 Ma-t~tt~~g~~vP~~~~~~ii~~~~~~--s~l~~~~~~i~~~~~~~~~p~~~~---~~~a~wv~Eg~~~~~~~~~f~~v 74 (311) T protein:vir:99 1 MA-TFGTGNLKNLPRNIADGMVKDVVQG--STVAVLSARKPQRFGNEDIITFNG---RPKAEFVGEGQQKSSTTGEFDFV 74 (311) T ss_pred Cc-eecCCCceeccHHHHHHHHHHHHhh--chhhhhcceeeccCCceEEEEEeC---CceeEEeecCcccccccceeeEE Confidence 21 1124466677777777775544333 345555555556554446666653 33567999999999999999999 Q ss_pred EEEEEeeeehhhhhhhHhhh--cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCc-cce Q lcl|NC_020871. 112 TVNMKFASDTKNISIAAGLV--NNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ-DNV 188 (468) Q Consensus 112 ~~~~k~l~~~~~vs~~~~lv--~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~-~nv 188 (468) ....|=++.--.+|.-+-.. +...|.+....+.-...+++.+|.++|+|+..- .|..+-|+...+.. .+. T Consensus 75 ~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~-------~g~~~~g~~~~~~~~~~~ 147 (311) T protein:vir:99 75 TSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPL-------TGTVIPGWSNYLGAASKR 147 (311) T ss_pred EEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcc-------cCccccccccccccccce Confidence 99999888888888776333 346778899999999999999999999998532 23334556665543 355 Q ss_pred eeccCCCCC--HHHHhhhhhhhhhc--cCceEEEecCHHHHhhHHHhhcCCc-eEEeecCCCcc----eeeeeccceeec Q lcl|NC_020871. 189 HDARGASLT--ESLLNQAAVMISKG--YGTPTDAYMPVGVQADFVNQQLSKQ-TQLVRDNGNNV----SVGFNIQGFHSA 259 (468) Q Consensus 189 iDarG~~ls--~~~l~~~a~~i~~~--fG~~td~~m~~~v~a~~~~~~~~~q-r~v~~~n~~~~----~~G~~v~~~~s~ 259 (468) +...+.... ...|..+...+..+ ...++-+.||+.+.+.|.. .-+.+ |-+.++..... -.|.++ T Consensus 148 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~-lkd~~G~~l~~~~~~~~~~~~l~G~Pv------ 220 (311) T protein:vir:99 148 VELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLST-ARYTDGRKKFPELGLGIGVSSFEGIDA------ 220 (311) T ss_pred eeccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHh-hhccCCCeeecCcccCCCCceecceee------ Confidence 555444433 23444333322222 3455668999999999943 43332 22333322111 122222 Q ss_pred CCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCC-----CCcCcccceeEEEEEE-----EEcccCCcccc Q lcl|NC_020871. 260 RGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKK-----GQFRAEDLAAHEYKVV-----VSSDDAESIAS 329 (468) Q Consensus 260 ~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~-----g~~~~~~~~~y~YkVt-----avn~~GES~aS 329 (468) ...++. +..+...... .......+ |.|.. .+.|.+. -++++++. T Consensus 221 -----------~~s~~i-----~~~~~~~~~~---~~~~~~~~~~~~~Gdf~~----~~~~~~~~~~~~~~~~~~~~--- 274 (311) T protein:vir:99 221 -----------SVSDTV-----NGGDEADPDD---EDLDAARAVRGIVGDFAN----GIHWGVQRDIPVELIKYGDP--- 274 (311) T ss_pred -----------Eeeccc-----cccccccccc---chhhccCcceEEEeeccc----cEEEEEecCceEEEeecCCC--- Confidence 111110 0000000000 00000000 00100 0111110 01111110 Q ss_pred cceeeeeec-cCcceEE----EEEeecCCCcccceEEEEeecC Q lcl|NC_020871. 330 EVATATVTA-KDDGVKL----EIELAPMYSSRPQFVSIYRKGA 367 (468) Q Consensus 330 ~~vt~Tv~a-~~~g~~l----tIT~~~~~ga~~~~y~IYR~~~ 367 (468) ..++.. ..+-..+ .+.|.-. - |.++++=..++ T Consensus 275 ---~~~~~~~~~d~~~~r~~~r~d~~v~-~--~~~v~~~~~~A 311 (311) T protein:vir:99 275 ---DGQGDLKRHNQIALRLEIVYGWYVF-T--DRFVVIENAVA 311 (311) T ss_pred ---CcchhhhhcCcEEEEEEEeecceec-C--hhHeeeecccC Confidence 000000 0000111 1111100 0 11122111112 No 71 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=96.43 E-value=0.00011 Score=42.21 Aligned_cols=295 Identities=14% Similarity=0.060 Sum_probs=137.8 Q ss_pred CCCcccchhhcccChh-hHHHHHHHHhhcccc---------cCcccccCccccchhhhhhHhhhhhhccccccchhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLN-SVQEDALKSFTTGYG---------ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAK 70 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~-~~~e~~~Ksf~agy~---------~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k 70 (468) .++.+........+.. +.-++|.+.+..+-. ....+.++|+.|-++.+...|..+..... .+...+.. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~--~l~~~~~~ 152 (404) T protein:vir:39 75 REEEKGPLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYD--SLQQYVRV 152 (404) T ss_pred ccccccccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhh--hHHhhcce Confidence 1111111111111111 111223222222211 22344566788889999888855544433 56666666 Q ss_pred cchhhhhhccceeeeeccccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHH Q lcl|NC_020871. 71 KPATSTVAKYDVYMQHGKVGHTRFTREIGVA-PVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNI 149 (468) Q Consensus 71 ~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~ 149 (468) .+..+-...|.....-+..+...+++|++.. +.+++.+.+....++-++.-..+|.-+= .++..|.+....+.-...+ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~ 231 (404) T protein:vir:39 153 ESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLL-KDTAENILAWLSSWIAKKV 231 (404) T ss_pred eeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHH-hhchHHHHHHHHHHHHHHH Confidence 6665544444433322333456789999875 5789999999999999998888887432 3456778888888999999 Q ss_pred HHHHHHHHhhcccccccCCCCCCCccccchhhhcCc--cceeeccCCC-CCHHHHhhhhhhhhhccCceEEEecCHHHHh Q lcl|NC_020871. 150 AKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ--DNVHDARGAS-LTESLLNQAAVMISKGYGTPTDAYMPVGVQA 226 (468) Q Consensus 150 ~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~--~nviDarG~~-ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a 226 (468) ...++.++++|+..-. .....+-+|+|..++.. +..+...+.. ++...++....+. ..-|. .+|.|.. .. T Consensus 232 ~~~~d~~il~g~g~~~---~~~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lk-d~~G~--~l~~~~~-~~ 304 (404) T protein:vir:39 232 VVTRNQAIIAAMGTVP---KKPTIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVK-TAEGK--YLLEPDP-TK 304 (404) T ss_pred HHHHHHHHHhcccccc---cccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhh-ccCCc--eeeccCc-CC Confidence 9999999999997743 34456778998887642 2222222222 4455544444321 12232 3444321 22 Q ss_pred hHHHhhcCCceEEeecCCCc---------ceeeeecccee-ecCCcccc-----------CCCEeecccccccccccccC Q lcl|NC_020871. 227 DFVNQQLSKQTQLVRDNGNN---------VSVGFNIQGFH-SARGFIKL-----------HGSTVMENEQILDERILALP 285 (468) Q Consensus 227 ~~~~~~~~~qr~v~~~n~~~---------~~~G~~v~~~~-s~~g~i~l-----------~gs~i~~~~n~l~~~~~~~p 285 (468) .-... +-+-.++...+..- .-.|.-=..++ ..++.+.+ ++.+.++-...++-. ...| T Consensus 305 ~~~~~-l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~-~~~~ 382 (404) T protein:vir:39 305 PNSYL-IKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVK-TTDS 382 (404) T ss_pred CCcce-ecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccE-Eecc Confidence 22122 23333333221110 11111000111 11121111 111112211111111 1111 Q ss_pred CCCCCcceeEEecCCCCCCcCccc Q lcl|NC_020871. 286 TAPQQAKVTATQEAGKKGQFRAED 309 (468) Q Consensus 286 ~ap~~~~vtat~~~~~~g~~~~~~ 309 (468) .+=. .++.++.++..|+...+. T Consensus 383 ~a~~--~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 383 EALV--AGSFTAIADQVGNFTAGK 404 (404) T ss_pred cceE--EEEeeccccCCCCCCCCC Confidence 1111 122222222222222111 No 72 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=96.38 E-value=0.00055 Score=38.45 Aligned_cols=280 Identities=11% Similarity=0.052 Sum_probs=125.2 Q ss_pred HHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeec-ccccccccccccc Q lcl|NC_020871. 22 ALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHG-KVGHTRFTREIGV 100 (468) Q Consensus 22 ~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG-~~g~~~fv~E~g~ 100 (468) ++++++++.. ++|++|-++-+..+|..+..... .+.+.....+.....-++.. ..+. +.+...+++|++. T Consensus 1 ~l~~~~~~t~------~~gg~liP~~~~~~Ii~~~~~~~--~l~~~~~~~~~~~~~g~~~~-~~~~~~~~~a~~v~Eg~~ 71 (293) T protein:vir:48 1 MLDSKTDHSG------SDAGLTIPQDIRTAINTLVRQYD--SLQEYVNVENVTTLTGSRVY-EKWTDITGLANIDDEAGK 71 (293) T ss_pred Cceeeccccc------CcCceEechhHHHHHHHHHHhhh--hhhhhceeeeccCCcceEEE-EeecCCCcceeeecCCcc Confidence 7777776554 35778888888888755444333 34444444444433333322 2232 3345679999987 Q ss_pred cc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccch Q lcl|NC_020871. 101 AP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGL 179 (468) Q Consensus 101 ~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl 179 (468) .. .+++.+......+|-++....+|.-+- .++..|.+....+..-..++..++.++|.|...... ...-+ T Consensus 72 ~~~~~~~~~~~i~l~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~---~~~~~----- 142 (293) T protein:vir:48 72 IADIDDPKLSLIKYTIKRYAGISTVTNSLL-ADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT---KPTLT----- 142 (293) T ss_pred cccccccceeEEEEeeeEEEEeehhhHHHH-hhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc---ccccc----- Confidence 65 688999999999999998877776432 344567777788888888899999999988765321 11112 Q ss_pred hhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh-cCCceEEeecCCCcc----eeeeecc Q lcl|NC_020871. 180 AKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ-LSKQTQLVRDNGNNV----SVGFNIQ 254 (468) Q Consensus 180 ~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~-~~~qr~v~~~n~~~~----~~G~~v~ 254 (468) +-+.|..+---+..+|......+||+.+.+.+...- -.+ |.+.+++.... -.|.+|- T Consensus 143 -----------------~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g-~~l~~~~~~~~~~~~l~G~Pv~ 204 (293) T protein:vir:48 143 -----------------KWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALG-DYLMERDVKSPTGYSIAGFAVK 204 (293) T ss_pred -----------------CHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCC-ceEeecCcCCCCCceecceeeE Confidence 222222222222334555567899999999885432 233 33444432221 1333321 Q ss_pred ceeecCCcccc--CCC--Eeecccc--cccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCccc Q lcl|NC_020871. 255 GFHSARGFIKL--HGS--TVMENEQ--ILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIA 328 (468) Q Consensus 255 ~~~s~~g~i~l--~gs--~i~~~~n--~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~a 328 (468) +........ -+. .++.+.. .....+.. ++-.........|.. ....|++..--+-.---+ T Consensus 205 --~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~---------~~i~~~~~~~~~~~~---~~~~~r~~~r~d~~~~~~ 270 (293) T protein:vir:48 205 --EISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQ---------MSLLSTNIGGGAFET---DTTKVRVIDRFDVVATDT 270 (293) T ss_pred --EecccccCCccCCceEEEEEeccceEEEEEecc---------eEEEEecccchhhhc---CeEEEEEEEeeCcEEecc Confidence 111000000 000 1111100 00000000 000000000000110 011122111111000001 Q ss_pred ccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEec Q lcl|NC_020871. 329 SEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPA 380 (468) Q Consensus 329 S~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~ 380 (468) ...+..+.+ +.... ..-.+..++ T Consensus 271 ~a~~~l~~~-------------~~~~~----------------~~~~~~~~~ 293 (293) T protein:vir:48 271 EAFVPASFK-------------AIADQ----------------KGNIGSTAV 293 (293) T ss_pred cceEEEEee-------------ccccC----------------CccccccCC Confidence 111111111 00000 000000000 No 73 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=96.34 E-value=0.00073 Score=37.79 Aligned_cols=332 Identities=12% Similarity=0.071 Sum_probs=146.4 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey 80 (468) |- -||+.. ++.+++ +..+|+.|-.|...+-|..|. +...+.+...+.+..+.-.+| T Consensus 1 ~g----------~~~e~~-----~~~~~~------t~~~~g~l~~~~~~~ii~~l~---~~s~i~~l~~~~~~~~~~~~i 56 (397) T protein:vir:23 1 MG----------FSADHS-----QIAQTK------DTMFTGYLDPVQAKDYFAEAE---KTSIVQRVAQKIPMGATGIVI 56 (397) T ss_pred CC----------cCHHHH-----HHhhcc------CCCCccccchhHHHHHHHHHH---hccchhhhcceeeccCCceEE Confidence 11 111111 111111 112245566666555554443 334556656666666554556 Q ss_pred ceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020871. 81 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 160 (468) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~G 160 (468) .++.. .....|++|++..+.+++.+......+|=++-...+|.-+-. ++..|.+....+.-...+++.+|.++++| T Consensus 57 p~~~~---~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~-ds~~~l~~~i~~~l~~aia~~~d~a~l~G 132 (397) T protein:vir:23 57 PHWTG---DVSAQWIGEGDMKPITKGNMTKRDVHPAKIATIFVASAETVR-ANPANYLGTMRTKVATAIAMAFDNAALHG 132 (397) T ss_pred EEEcC---CcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 66663 334568999999999999999999999999888888875432 45678899999999999999999999999 Q ss_pred ccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh-cCCceEE Q lcl|NC_020871. 161 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ-LSKQTQL 239 (468) Q Consensus 161 d~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~-~~~qr~v 239 (468) +.. +....|+.......+ -..+.......++ +...+...+....-..|++.+.+.|...- -.++..+ T Consensus 133 ~gt---------~~~~~~~~~~~~~~~--~~~~~~~~~~~~~-~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~ 200 (397) T protein:vir:23 133 TNA---------PSAFQGYLDQSNKTQ--SISPNAYQGLGVS-GLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLF 200 (397) T ss_pred ccC---------Cccccccccccccee--eecccchhHHHHH-HHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceee Confidence 854 122344443322222 2223333333443 44445567778888999999999885422 1222322 Q ss_pred eecCCCc--------ceeeeecc------------------c-eeecCCccccCC-----------------------CE Q lcl|NC_020871. 240 VRDNGNN--------VSVGFNIQ------------------G-FHSARGFIKLHG-----------------------ST 269 (468) Q Consensus 240 ~~~n~~~--------~~~G~~v~------------------~-~~s~~g~i~l~g-----------------------s~ 269 (468) +++...+ .-.|.++- . ++..++.+.+.- .+ T Consensus 201 ~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v 280 (397) T protein:vir:23 201 VESTYESLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLV 280 (397) T ss_pred cccccccccccccCceeeeeeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccce Confidence 2221111 01222211 0 011111111110 00 Q ss_pred eeccccccccccccc------CCCCCCcceeEEecCCCCCCc----CcccceeEEEEEEEEcccC--Ccccc--cceeee Q lcl|NC_020871. 270 VMENEQILDERILAL------PTAPQQAKVTATQEAGKKGQF----RAEDLAAHEYKVVVSSDDA--ESIAS--EVATAT 335 (468) Q Consensus 270 i~~~~n~l~~~~~~~------p~ap~~~~vtat~~~~~~g~~----~~~~~~~y~YkVtavn~~G--ES~aS--~~vt~T 335 (468) -++-..+++-..... ...+...+.+.+..+.++|+| .+.-.....|-.++.+-.+ |.+.. ...+++ T Consensus 281 ~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 360 (397) T protein:vir:23 281 AVRVEAEYGLLINDVNAFVKLTFDPVLTTYALDLDGASAGNFTLSLDGKTSANIAYNASTATVKSAIVAIDDGVSADDVT 360 (397) T ss_pred eEEEEeeeccceecccceEEEeeccccceeeecccccCcceEEEEecCccccCcccccchhhhHHHhhhcccccccceee Confidence 011111111111000 001111111111112222222 2222222333333332222 22211 111112 Q ss_pred eeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEecccccCCeeEEecCCCCCCCCc Q lcl|NC_020871. 336 VTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFYDLNDSIPETV 401 (468) Q Consensus 336 v~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~~t~tf~D~N~~iPgt~ 401 (468) ++. ++...+|+......+. .-.+++- ... ..++-.+- T Consensus 361 ~~~--~~~~~~~~~~~~~~~~--------------~~~~~~~------~~~-------~~~~~~~~ 397 (397) T protein:vir:23 361 VTG--SAGDYTITVPGTLTAD--------------FSGLTDG------EGA-------SISVVSVG 397 (397) T ss_pred eec--CCceeEEEeccccccC--------------ccccccC------ccc-------cceeeecC Confidence 222 2333344443221110 0001110 000 00011110 No 74 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=96.17 E-value=0.0003 Score=39.88 Aligned_cols=296 Identities=14% Similarity=0.071 Sum_probs=130.6 Q ss_pred CCCcccchhhcc--cChhhHHHHHHHHhhc----cc---------ccCcccccCccccchhhhhhHhhhhhhccccccch Q lcl|NC_020871. 1 MPKNNKEEEVKE--VNLNSVQEDALKSFTT----GY---------GITPDTQTDAGALRREFLDDQISMLTWTENDLTFY 65 (468) Q Consensus 1 ~~~~~~~~~~~~--~n~~~~~e~~~Ksf~a----gy---------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~ 65 (468) +....+..+.+| ......-+++.|+|.. +- .....+..+|+.|-++.+...|..+... .-.+. T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~--~~~l~ 147 (408) T protein:vir:74 70 QVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQ--YDSLQ 147 (408) T ss_pred HHhhccccccccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhh--hcchh Confidence 111111111111 1111122334444421 11 0112334557788899888888554443 33567 Q ss_pred hhhcccchhhhhhccceeeeeccccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHH Q lcl|NC_020871. 66 KDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVA-PVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDD 144 (468) Q Consensus 66 ~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ 144 (468) +.+...+..+....|......+......+++|++.. +.+++.+.+....++-++.--.+|.-+= .++..|.+....+. T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~ 226 (408) T protein:vir:74 148 QYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLL-KDTAENILAWLSSW 226 (408) T ss_pred hhcceeeccCCcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHH-hhchHHHHHHHHHH Confidence 777777766544444322211222234578998875 4788999999999999998888887532 34666788888888 Q ss_pred HHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCc--cceeeccCCC-CCHHHHhhhhhhhhhccCceEEEecC Q lcl|NC_020871. 145 AIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ--DNVHDARGAS-LTESLLNQAAVMISKGYGTPTDAYMP 221 (468) Q Consensus 145 ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~--~nviDarG~~-ls~~~l~~~a~~i~~~fG~~td~~m~ 221 (468) --..+...++.+++.|+..-. ..+..+-+|+|..++.. +..+-..... .+...+.....+. .+-|. .+|.| T Consensus 227 l~~~~~~~~d~~il~G~G~~~---~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lk-d~~G~--~l~~~ 300 (408) T protein:vir:74 227 IAKKVVVTRNQAIIAAMGTVP---KKPTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVK-TAEGK--YLLEP 300 (408) T ss_pred HHHHHHHHHHHHHhhcccccc---cccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhh-cCCCc--eEecc Confidence 889999999999999997643 33456778888876531 1111011111 3444444333221 12232 34444 Q ss_pred HHHHhhHHHhhcCCceEEeecCCCcceeeeecccee----------ecCCcccc-----------CCCEeeccccccccc Q lcl|NC_020871. 222 VGVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFH----------SARGFIKL-----------HGSTVMENEQILDER 280 (468) Q Consensus 222 ~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~----------s~~g~i~l-----------~gs~i~~~~n~l~~~ 280 (468) .... ... .-+-+-.++..++..-.+.|.....++ ..++.+.+ ++.+.++-..+++-. T Consensus 301 ~~~~-~~~-~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~ 378 (408) T protein:vir:74 301 DPTK-PNS-YLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVK 378 (408) T ss_pred CcCC-CCC-ceecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcE Confidence 3211 111 123333444333221111111111010 11111111 111111111111111 Q ss_pred ccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccC---Ccccccce Q lcl|NC_020871. 281 ILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA---ESIASEVA 332 (468) Q Consensus 281 ~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G---ES~aS~~v 332 (468) ... |. +--..+++++...- .+....+| T Consensus 379 ~~~-~~------------------------a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:74 379 ATD-SE------------------------ALVAGSFTAIADQVGNFKTTTSTAV 408 (408) T ss_pred Eec-cc------------------------ceEEEEeecccCCCCCCCCCccccC Confidence 000 00 00000111111000 00000000 No 75 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=96.00 E-value=0.0011 Score=36.73 Aligned_cols=295 Identities=14% Similarity=0.075 Sum_probs=135.1 Q ss_pred CCC-------cccchhhcccChhhHHHHHHHHhhcccc---------------cCcccccCccccchhhhhhHhhhhhhc Q lcl|NC_020871. 1 MPK-------NNKEEEVKEVNLNSVQEDALKSFTTGYG---------------ITPDTQTDAGALRREFLDDQISMLTWT 58 (468) Q Consensus 1 ~~~-------~~~~~~~~~~n~~~~~e~~~Ksf~agy~---------------~~p~~~~~gaALr~esld~~i~~L~~~ 58 (468) .+. .+...........+.-.+|.|++..+.. .+..+-.+|+.|-++.+.+.|..+... T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~ 149 (397) T protein:vir:12 70 VPEQERNPEGQRSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQ 149 (397) T ss_pred hhhhhhhhcccccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhh Confidence 110 0111111111222222345555543321 112234557888899998888666554 Q ss_pred cccccchhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_020871. 59 ENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDP 137 (468) Q Consensus 59 ~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp 137 (468) .. .+++.+...++.+...+|......++. ...+++|++..+ .+++.+.......+-++.--.+|.-+- .++..|. T Consensus 150 ~~--~l~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l-~ds~~~l 225 (397) T protein:vir:12 150 FE--PLEQYVTVEPVTTRSGTRLLEKNADMV-PFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSML-NDSDQAI 225 (397) T ss_pred hh--hHHhhcceeeccCCceeEEEEEecCCc-ceeeecccccccccccccceeEEeeheeeEeeehhhHHHH-hhchHHH Confidence 33 466666666666544455444444443 456899998754 678999999999998887766666432 2355678 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEE Q lcl|NC_020871. 138 MQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTD 217 (468) Q Consensus 138 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td 217 (468) +....+.-...+++.++.++++|+..-. |.+ -+-+|+|.+++. ..+-.+|..... T Consensus 226 ~~~i~~~l~~~~~~~~d~~il~G~g~~~--~~g--~~~~~~i~~~~~---------------------~~l~~~~~~~a~ 280 (397) T protein:vir:12 226 MTYVAKWFAKKSVVTRNNLILAAIASLK--KVD--IDGLDGIKKALN---------------------VTLDPMVAPGSI 280 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccc--ccc--cccHHHHHHHHh---------------------hccchhhhCCCE Confidence 8888889999999999999999996632 221 122344433322 112223444445 Q ss_pred EecCHHHHhhHHHhh-cCCceEEeecCCCc----ceeeeeccceeecCCcccc-CCC--Eeecccc--cccccccccCCC Q lcl|NC_020871. 218 AYMPVGVQADFVNQQ-LSKQTQLVRDNGNN----VSVGFNIQGFHSARGFIKL-HGS--TVMENEQ--ILDERILALPTA 287 (468) Q Consensus 218 ~~m~~~v~a~~~~~~-~~~qr~v~~~n~~~----~~~G~~v~~~~s~~g~i~l-~gs--~i~~~~n--~l~~~~~~~p~a 287 (468) .+|++.+.+.|...- -.+++.++ ++... .-.|.+|- ++.+..... .+. .++.+.. ++...+ T Consensus 281 ~~~n~~~~~~L~~lkd~~G~~l~~-~~~~~g~~~~l~G~pv~--~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~------ 351 (397) T protein:vir:12 281 VLTNQDGYDWLDTLKDGTGRYLLQ-PDPTNPTKKLLDGRPVV--PFTNRVLKTQKGKAPLIIGNLKEAIVLFDR------ 351 (397) T ss_pred EEEcHHHHHHHHHhhccCCceeec-ccccCCCCccccceeeE--EecccccccCCCccEEEEEehhceEEEEee------ Confidence 789999988884432 23444332 22111 11233321 010000000 000 1111100 000000 Q ss_pred CCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeecc Q lcl|NC_020871. 288 PQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAK 339 (468) Q Consensus 288 p~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~ 339 (468) .. ++-.........|.. +...|++..--+-+---|...+-++++++ T Consensus 352 -~~--~~i~~~~~~~~~f~~---~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 352 -EQ--QSIASTDTGAGAFET---NSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred -cc--eEEEEeccccchhhc---CceEEEEEEeeccEEecccceEEEEEeeC Confidence 00 000000000000110 11222222111111111222333333333 No 76 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=95.93 E-value=0.00041 Score=39.13 Aligned_cols=299 Identities=15% Similarity=0.109 Sum_probs=135.6 Q ss_pred CCCcccchhhcccC--hhhHHHHHHHHhh----cccc---------cCcccccCccccchhhhhhHhhhhhhccccccch Q lcl|NC_020871. 1 MPKNNKEEEVKEVN--LNSVQEDALKSFT----TGYG---------ITPDTQTDAGALRREFLDDQISMLTWTENDLTFY 65 (468) Q Consensus 1 ~~~~~~~~~~~~~n--~~~~~e~~~Ksf~----agy~---------~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~ 65 (468) .....+....+|.+ ....-+...|+|. .+.+ ....+..+|+.|-++.+..+|..+..... .+. T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~--~l~ 147 (408) T protein:vir:10 70 QVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYD--SLQ 147 (408) T ss_pred HHhccccccccccccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhc--hhh Confidence 11111222222211 1111223344432 1110 12233456788889999888855544433 566 Q ss_pred hhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHH Q lcl|NC_020871. 66 KDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDD 144 (468) Q Consensus 66 ~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ 144 (468) +.+...++.+..-.+......++.+...+++|++..+ .+++.+.......+-++.-..+|.-+ +.++..|......+. T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~ 226 (408) T protein:vir:10 148 QYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS-LKDTAENILAWLSSW 226 (408) T ss_pred hhcceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHH-HhhchHHHHHHHHHH Confidence 6666666665555554444444556678999998765 67799999999999999887777653 234566778888888 Q ss_pred HHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCc--cceeeccCCC-CCHHHHhhhhhhhhhccCceEEEecC Q lcl|NC_020871. 145 AIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ--DNVHDARGAS-LTESLLNQAAVMISKGYGTPTDAYMP 221 (468) Q Consensus 145 ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~--~nviDarG~~-ls~~~l~~~a~~i~~~fG~~td~~m~ 221 (468) -...+...++.+++.|+..-.. .....-+|.|..++.. +.-+...+.. .+...++....+ ...-|. .+|.| T Consensus 227 l~~~~~~~~~~~il~g~g~~~~---~~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~l-kd~~G~--~i~~~ 300 (408) T protein:vir:10 227 IAKKVVVTRNQAIIEVMKAAPK---KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALV-KTAEGK--YLLEP 300 (408) T ss_pred HHHHHHHHHHHHHhhccccccc---ccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHh-hccCCc--eEecc Confidence 8899999999999999976432 2334557777665532 1222222222 345554443332 122232 34444 Q ss_pred HHHHhhHHHhhcCCceEEeecCCCcceeee--------eccc-e-eecCCcccc-----------CCCEeeccccccccc Q lcl|NC_020871. 222 VGVQADFVNQQLSKQTQLVRDNGNNVSVGF--------NIQG-F-HSARGFIKL-----------HGSTVMENEQILDER 280 (468) Q Consensus 222 ~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~--------~v~~-~-~s~~g~i~l-----------~gs~i~~~~n~l~~~ 280 (468) .- .+.-.. -|-+-.++...+..-+..|. +... + +..++.+.+ ++.+.++-...++-. T Consensus 301 ~~-~~~~~~-~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~ 378 (408) T protein:vir:10 301 DP-TKPNSY-LIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVK 378 (408) T ss_pred Cc-CCCCCc-eecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeeccE Confidence 31 111111 23333444322211111111 1111 1 111222221 122222222222211 Q ss_pred ccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccce Q lcl|NC_020871. 281 ILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVA 332 (468) Q Consensus 281 ~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~v 332 (468) ... |.+-. .++-++.+...+.. +-+.+|. | T Consensus 379 v~~-~~a~~--~~~~~~~~~~~~~~------------------~~~~~~~-~ 408 (408) T protein:vir:10 379 ATD-SEALV--AGSFSAIADQVGNF------------------KTTTSTA-V 408 (408) T ss_pred Eec-cccEE--EEEeeccccCCCCC------------------CCCCccc-C Confidence 111 00000 00000000000000 0000000 0 No 77 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=95.78 E-value=0.0015 Score=36.12 Aligned_cols=290 Identities=12% Similarity=0.056 Sum_probs=138.1 Q ss_pred cccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEE Q lcl|NC_020871. 34 PDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTV 113 (468) Q Consensus 34 p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~ 113 (468) =.+.+.|+.|-++.+..+|..+. .+...+.+...+.+..+---+|.++.. .+.+.+++|++..+.+++.+.+... T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l--~~~s~i~~l~~~~~~~~~~~~ip~~~~---~~~a~wv~E~~~~~~s~~~f~~v~l 75 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKV--KGHSSLAKLSSQKPIPFNGSKEFTFTL---DSDIDVVAENGKKTHGGLSLEPVTI 75 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHH--HhhchhhhhcceeecCCCceEEEEEec---CcceEEeecCccccccccceeeEEe Confidence 22334567788888888874433 334455555555556543345666653 3456799999999999999999999 Q ss_pred EEEeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCc-cceee Q lcl|NC_020871. 114 NMKFASDTKNISIAAGLVN--NIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ-DNVHD 190 (468) Q Consensus 114 ~~k~l~~~~~vs~~~~lv~--~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~-~nviD 190 (468) ..+=++.--.+|.-+-.++ ...+.+....+..-..+++.+|.++++|+.+-.. .+..--|+...... .++.- T Consensus 76 ~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g-----~~~~~~~~~~~~~~~~~~~~ 150 (303) T protein:vir:97 76 VPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTK-----KASDVIGTNHFDSKVTQVVK 150 (303) T ss_pred eeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCc-----cccccccccccccccccccc Confidence 9999998888887654433 3466778888999999999999999999743221 11111111111111 12222 Q ss_pred ccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCC--ceEEeecCCCcceeeeeccceeecCCccccCCC Q lcl|NC_020871. 191 ARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSK--QTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGS 268 (468) Q Consensus 191 arG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~--qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs 268 (468) .-+....-+.|.++...+..+++.++.+.||+.+.+.+.. ..+. .+.++ ++.+ .|.... .|.|- T Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~-lkd~~g~~~~~-~~~~---~~~~~~---------~l~G~ 216 (303) T protein:vir:97 151 FTESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAK-VTNGEMGPKMY-PELA---WGANPD---------SINGL 216 (303) T ss_pred cccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHH-hhccCCCeEEe-cCcc---CCCCCc---------eecce Confidence 2222233445555555566678888889999999998843 3333 33332 2211 111111 12232 Q ss_pred EeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEE--- Q lcl|NC_020871. 269 TVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKL--- 345 (468) Q Consensus 269 ~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~l--- 345 (468) -+..+++. +....+..+..+ . ..+-+++.+....+.+-+. ........++..+ T Consensus 217 Pv~~s~~v-----~~~~~~~~~~~~---~-----------~~Gdf~~~~~~~~~~~~~~-----~~~~~~~~d~~~~~~~ 272 (303) T protein:vir:97 217 KSSVNTTV-----GAGADEAESKDL---V-----------IIGDFESMFKWGYAKQIPM-----EIIKYGDPDNSGKDLK 272 (303) T ss_pred eeEEeccc-----CCccccCCCccE---E-----------EEeeccccEEEEEecCcEE-----EEeeccCCCCcchhhh Confidence 22222211 100000000000 0 0011222111111111000 0000000011000 Q ss_pred -----EEEeecCCCcccceEEEEeecCCCceeEEEEEEecccc Q lcl|NC_020871. 346 -----EIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKA 383 (468) Q Consensus 346 -----tIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~ 383 (468) .+-...-.+ ..|.+. .- +.++...+. T Consensus 273 ~~n~~~~r~~~r~~-----~~v~~p----~a---f~~l~~~~~ 303 (303) T protein:vir:97 273 GYNQIYLRAEAYIG-----WGILDA----KS---FARVTKGEV 303 (303) T ss_pred hcCcEEEEEEEEec-----cEeecc----cc---eEEeeCCCC Confidence 000000000 111111 11 122211211 No 78 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=95.60 E-value=0.0018 Score=35.68 Aligned_cols=310 Identities=12% Similarity=0.039 Sum_probs=141.0 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhccc---ccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGY---GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTV 77 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy---~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv 77 (468) .-.........+.+..+..++|..-+..|- ..+-.+..+|+.|-++.+..+|..+... ...+.+.....++.+.- T Consensus 96 ~~~~~~~~~~~~~~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~--~s~l~~l~~~~~~~~~~ 173 (425) T protein:vir:10 96 AAAQMGANGVKPLRDPEYTEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVL--ISPMRQLCRVQPVSKAG 173 (425) T ss_pred HhhhcccccccccccHHHHHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHh--hhhhhhhceeeeccCCc Confidence 000011122233333332332222111111 1112234567888899998888655433 33555555556666554 Q ss_pred hccceeeeecccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 78 AKYDVYMQHGKVGHTRFTREIGV-APVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWA 156 (468) Q Consensus 78 ~ey~~~~~hG~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a 156 (468) ..|.+.... ....+++|++. |+...+.+.+....++=++.-..+|.-+ +.++..|.+....+.-...++..++.+ T Consensus 174 ~~~~~~~~~---~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~el-l~ds~~~l~~~i~~~la~ai~~~~d~~ 249 (425) T protein:vir:10 174 FSKLFNMGG---TTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQI-LDDAEIDLESWLATEVQTEFAKQEGKA 249 (425) T ss_pred eEEEEEcCC---cceeeeccccccccccccccceeeeeheeeEeehHhHHHH-HhcchhHHHHHHHHHHHHHHHHHHHhh Confidence 555555532 24568999987 4566689999988888777766666642 234566788899999999999999999 Q ss_pred HhhcccccccCCCCCCCccccchhhhcCcc------------ceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHH Q lcl|NC_020871. 157 SFFGDSDLSDSPEPQAGLEFDGLAKLINQD------------NVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGV 224 (468) Q Consensus 157 ~f~Gd~~l~~~~~~~~gleFDGl~~li~~~------------nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v 224 (468) +++||-. + +-.||.+.+... .+.......++.+.|-.+...+...|-...-..|++.+ T Consensus 250 ~l~G~G~-~---------~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~ 319 (425) T protein:vir:10 250 FLAGDGT-N---------KPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNT 319 (425) T ss_pred hhcccCC-C---------CcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhccCCEEEEchHH Confidence 9999842 1 234666644311 11122223344444433333333445444457899999 Q ss_pred HhhHHHhhcCCc-eEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCC Q lcl|NC_020871. 225 QADFVNQQLSKQ-TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKG 303 (468) Q Consensus 225 ~a~~~~~~~~~q-r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g 303 (468) .+.+.. .-+.+ |.+.+++.... ..-.|.|.-++..++ +|...+... +.. T Consensus 320 ~~~L~~-lkD~~G~~l~~~~~~~g-------------~~~~l~G~PV~~~~~--------~p~~~~~~~----~i~---- 369 (425) T protein:vir:10 320 QRQVRK-LKDGQGNYLWQPSYVAG-------------QPATLAGYPVTEVPD--------MPDVAANST----PIL---- 369 (425) T ss_pred HHHHHH-hhcCCCceeeccCccCC-------------CCceecceeeEEecC--------cCCccCCcc----EEE---- Confidence 988843 33443 33333332111 001223333332222 111111100 000 Q ss_pred CcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcc--cceEEEEee-cCC Q lcl|NC_020871. 304 QFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSR--PQFVSIYRK-GAE 368 (468) Q Consensus 304 ~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~--~~~y~IYR~-~~~ 368 (468) .|.+++.+..+++.|-..-. ..-...+-+.+. .+.-+.++. |.-+++.+- ++. T Consensus 370 ------~Gd~~~~~~i~~~~~~~v~~-----d~~~~~~~~~~~-~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 370 ------FGDFQQTYLIIDRIGVRVLR-----DPYTAKPYVLFY-TTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred ------EEehhccEEEEEecceEEEe-----cccccCCcEEEE-EEEEeccEeecccceEEEEeeccC Confidence 01122111222322211000 000000111111 111122221 122222222 222 No 79 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=95.57 E-value=0.00064 Score=38.10 Aligned_cols=328 Identities=11% Similarity=0.068 Sum_probs=140.0 Q ss_pred CCCcccchhhcc----cChhhHHHH---HHHHhhccccc-Ccc--------cccCccccchhhhhhHhhhhhhcccccc- Q lcl|NC_020871. 1 MPKNNKEEEVKE----VNLNSVQED---ALKSFTTGYGI-TPD--------TQTDAGALRREFLDDQISMLTWTENDLT- 63 (468) Q Consensus 1 ~~~~~~~~~~~~----~n~~~~~e~---~~Ksf~agy~~-~p~--------~~~~gaALr~esld~~i~~L~~~~~~f~- 63 (468) +|..-- +.++- ..+.+.|-. +.|.++.-..+ +.+ +-.++..|+.-.| .|.+..+. T Consensus 30 ~PN~~~-p~l~~i~~g~~~~~~~~t~~w~~d~l~~~~~~~ta~~~a~~T~i~V~~~~~f~~~~l-------~~~~~~~Ev 101 (418) T protein:vir:96 30 VPNGSA-PLLAMTSVVGSTTAKASTHGYFSKTMVFASAVVTAEALADATVLTVENSDGLTKGMI-------FYNEATGEN 101 (418) T ss_pred cCCccc-chhhhhcccCccccceeEEEEEeeEeeeeeEEEEEEEecCceEEEecCCcccccccE-------EEEecCCeE Confidence 443111 01110 001111111 22232211111 111 1122233444332 22110000 Q ss_pred -chhhhcccchhhhhhccceee--eeccccc----cccccccccccccCcceE--EEEEEEEeeeehhhhhhhHhh-h-- Q lcl|NC_020871. 64 -FYKDIAKKPATSTVAKYDVYM--QHGKVGH----TRFTREIGVAPVSDPNIR--QKTVNMKFASDTKNISIAAGL-V-- 131 (468) Q Consensus 64 -~~~~i~k~~~~stv~ey~~~~--~hG~~g~----~~fv~E~g~~~~~d~~~~--r~~~~~k~l~~~~~vs~~~~l-v-- 131 (468) -..+|+-.+. ..+.-|.... .|--... +--+.||.+...+. .+. ++...+--+.+..+||.-++. + T Consensus 102 irVtsVng~~l-TV~RG~~~t~aa~iaag~~~~~ig~~~eEGsd~~ta~-~~k~~~vsN~tQIf~e~vsVSgTAqA~v~q 179 (418) T protein:vir:96 102 MRLELVNGLNL-TVKRQTGRIAAAIIAANTKLIVIGTAFEEGSQRPTAR-SIQPVYVPNFTQIFRNAWALTDTARASYAE 179 (418) T ss_pred EEEEEEeCCEE-EEEEccCCeeeeeeecCceEEEeecCcccccccCCcc-eecceeccchhheehhhhhhhhhhhhhhhh Confidence 0011110000 0111122211 1111110 11247777765554 122 222223334566677766544 2 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCC---CccccchhhhcCccceeeccCC-CCCHHHHhhhhhh Q lcl|NC_020871. 132 NNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQA---GLEFDGLAKLINQDNVHDARGA-SLTESLLNQAAVM 207 (468) Q Consensus 132 ~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~---gleFDGl~~li~~~nviDarG~-~ls~~~l~~~a~~ 207 (468) -++.|....+ .|+|.-.+..+|.++++|.+.+..+..... -=.-|||-.-+ +.||+++.+. .++++.|..+... T Consensus 180 aGvsn~~~~e-~d~l~~~kv~iE~ali~g~~~~~~~ng~p~~~t~R~m~gI~~f~-~~Nvi~ag~~~~~t~d~L~~~~~~ 257 (418) T protein:vir:96 180 AGYSNITESR-RDCMDFHATEQETAIFFGQAFMGTYNGQPLHTTQGIVDAIRQYA-PDNVNAMPNPTAVTYDDVVDATID 257 (418) T ss_pred cCcchhHHHH-HHHHHHHHHHHHHhhhccccccCCCCCcccccccchhHHHHhhc-cccccccCCCCcCCHHHHHHHHHH Confidence 2677777666 699999999999999999987742211100 00125555544 7899999988 4999999988776 Q ss_pred hhh---ccCceEE-----EecCHHHHhhHHHhhcCCceEEeecCCCcceeeeeccceeecCCccccCCCEe--------- Q lcl|NC_020871. 208 ISK---GYGTPTD-----AYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTV--------- 270 (468) Q Consensus 208 i~~---~fG~~td-----~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i--------- 270 (468) +.+ +-|..++ ++++...|..+.. +...-|. ......+|..|..|.+..|-+++--+-- T Consensus 258 a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k-~~~~I~~----~~~en~~G~vv~~~~Td~G~v~ii~n~~~pad~I~~g 332 (418) T protein:vir:96 258 AFKWSVNVGDNTQRVMFCDTVGMRTMQDIGR-FFGEVTV----TQRETSYGMVFTEWKFFKGRLIIKEHPLFSAIGISPG 332 (418) T ss_pred HHhhcCCCCCcccceEEEEEeChHHHHHHhh-hhceeEe----ccccceeceEEEEEEeeccEEEEEecCCCCccccCcc Confidence 655 3577766 5669999999954 4443333 2344479999999999999888733332 Q ss_pred ----ecccccccccccccCCCCCCcceeE------EecCCCC-CCcCcccceeEEEEEEEEcccCCcccccceeeeeecc Q lcl|NC_020871. 271 ----MENEQILDERILALPTAPQQAKVTA------TQEAGKK-GQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAK 339 (468) Q Consensus 271 ----~~~~n~l~~~~~~~p~ap~~~~vta------t~~~~~~-g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~ 339 (468) ++-+.+-.......+..+..-.-++ ++..+.+ |-+.-.+.-.=.|.+-..|..+-. -++ T Consensus 333 ~mlVvD~~~vkL~yL~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D~~~G~l~~Eltle~~N~~a~a--------~it-- 402 (418) T protein:vir:96 333 FAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVDAQGGSLTSEWALELLNPQGCA--------VIT-- 402 (418) T ss_pred eEEEEecCceEEEEecCCCccchhcccCCCcccccccccccccccccccCEEEEEEEEEeecccccE--------Eee-- Confidence 3333332222211221221111000 0000000 001100001112222223322210 011 Q ss_pred CcceEEEEEeecCCCcccceEEEEeecCCC Q lcl|NC_020871. 340 DDGVKLEIELAPMYSSRPQFVSIYRKGAET 369 (468) Q Consensus 340 ~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~ 369 (468) .+.-|-| +||-..+-. T Consensus 403 -----------gl~~~~~---~~~~~~~~~ 418 (418) T protein:vir:96 403 -----------GLQKAKE---RVYLTAPAP 418 (418) T ss_pred -----------ccccccc---ccccCCCCC Confidence 1111111 122111100 No 80 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=95.30 E-value=0.0023 Score=35.01 Aligned_cols=305 Identities=13% Similarity=0.026 Sum_probs=131.7 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccc--cCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYG--ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVA 78 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~--~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ 78 (468) .+.........+... +.-.+|.|.+..|.. .+-.+-.+|+.+-++.+.++|..+.. +...+++.+...++.+... T Consensus 59 ~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~--~~s~i~~~~~~~~~~~~~~ 135 (371) T protein:vir:81 59 IEDKEPLKPTVQVKE-NEVEAFVNHIRTRFRNAMSEGSNQDGGYTVPQDIQTRINELRE--SKDALQNLITVEPVTTLSG 135 (371) T ss_pred hccccccccchhhHH-HHHHHHHHHHHHHHHHhhccCCCccCceeecHhHHHHHHHHHH--hhhhhhhhceeeeccCCce Confidence 111111111111111 111233333222210 11112245788888888888744443 3335666677777766555 Q ss_pred ccceeeeeccccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 79 KYDVYMQHGKVGHTRFTREIGVA-PVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWAS 157 (468) Q Consensus 79 ey~~~~~hG~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~ 157 (468) +|......++ +...+++|++.. +.+++++.+.+...+-++....+|.-+ +.++..|.+....+.-...++..++.++ T Consensus 136 ~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~a~~~~~~~~i 213 (371) T protein:vir:81 136 SRVFKKRSQQ-TGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNEL-LNDSTEAIVNTLVRWIGDESRVTRNGLI 213 (371) T ss_pred eEEEEeecCC-cceeeeccccccccccccceeeEEeeeeEEEEeehhhHHH-HhhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 5555444333 346689999875 578999999999999999988888765 3445567778888888888999999999 Q ss_pred hhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcC--C Q lcl|NC_020871. 158 FFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLS--K 235 (468) Q Consensus 158 f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~--~ 235 (468) +.|+..... . ..+-.|++..++. ......|....-.+|++.+.+.+.. .-+ + T Consensus 214 ~~g~g~~~~--~--~~~~~~~i~~~~~---------------------~~l~~~~~~~a~~vmn~~~~~~L~~-lkd~~g 267 (371) T protein:vir:81 214 INVLNTKAK--T--AIADLDGLKQIIN---------------------VQLDPVFRSTSSVIVNQDAFNWLDT-LKDQNG 267 (371) T ss_pred Hhhcccccc--c--ccccHHHHHHHHH---------------------hhcchhhhcCCEEEEcHHHHHHHHH-hhccCC Confidence 999865421 1 1222333332221 1122233334458999999998843 322 2 Q ss_pred ceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEE Q lcl|NC_020871. 236 QTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEY 315 (468) Q Consensus 236 qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~Y 315 (468) ++.+ +++.... ..-.|.|.-++-.++. |.+.....+..+ . ...--.|.++. T Consensus 268 ~~l~-~~~~~~~-------------~~~~l~G~pV~~~~~~-----------~~~~~~~~~~~~-~---~~~i~~Gd~~~ 318 (371) T protein:vir:81 268 QYLL-QPSISSP-------------TGRQLLGLPVVIVSNK-----------VLANRVDGGTGA-Q---FAPIIVGDLKE 318 (371) T ss_pred Ceee-ecccCCC-------------CCceecceeEEEeccc-----------ccCccccccccC-C---cceEEEEehhc Confidence 3333 2221111 1111222222221111 000000000000 0 00000111221 Q ss_pred EEEEEcccCCcc-cccceeeeeeccCcceEEEEEeecCCCcc--cceEEEEeecCC Q lcl|NC_020871. 316 KVVVSSDDAESI-ASEVATATVTAKDDGVKLEIELAPMYSSR--PQFVSIYRKGAE 368 (468) Q Consensus 316 kVtavn~~GES~-aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~--~~~y~IYR~~~~ 368 (468) -+...++.|-+. .+.... .....+.+.+....- ..+.. |+.+++.+-+.- T Consensus 319 ~~~~~~~~~~~i~~~~~~~--~~f~~~~v~~~~~~r-~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 319 AVVMFDRQRTEIMSSNVAM--DAFETDATLWRAIER-MDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred eEEEEeecceEEEEecccc--chhhcCceEEEEEEe-eccEEecccceEEEEEecC Confidence 111111111000 000000 000000011110000 00000 111111111100 No 81 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=95.28 E-value=0.0024 Score=34.97 Aligned_cols=313 Identities=8% Similarity=-0.064 Sum_probs=129.4 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcc---------------cccCcccccCccccchhhhhhHhhhhhhccccccch Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTG---------------YGITPDTQTDAGALRREFLDDQISMLTWTENDLTFY 65 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~ag---------------y~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~ 65 (468) .+......+.+....+...++|.+.+..| -.....+..+|+.|-++-+..+|..+..... .+. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~--~l~ 148 (409) T protein:vir:45 71 EQRQNLDPENNSQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYG--GIA 148 (409) T ss_pred hhcccCCCCCcchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhh--hhh Confidence 11111111111111222222333222211 1111123345777888888888866654333 344 Q ss_pred hhhcccchhhhhhccceeeeecc-ccccccccccccccccCcceEEEEEEE-EeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_020871. 66 KDIAKKPATSTVAKYDVYMQHGK-VGHTRFTREIGVAPVSDPNIRQKTVNM-KFASDTKNISIAAGLVNNIQDPMQILTD 143 (468) Q Consensus 66 ~~i~k~~~~stv~ey~~~~~hG~-~g~~~fv~E~g~~~~~d~~~~r~~~~~-k~l~~~~~vs~~~~lv~~~~Dp~~~~~~ 143 (468) +.+...+..+- .+..+...++ ...+.+++|++..+..++.+....... |+.+.--.+|.-+ +.++..|.+....+ T Consensus 149 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~el-l~ds~~~l~~~i~~ 225 (409) T protein:vir:45 149 SVAQILTTSDG--RTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNEL-LQDSAIDMEAYLAR 225 (409) T ss_pred hhceeeecCCC--ceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHH-HhccHHHHHHHHHH Confidence 43333343321 1222222222 334568999999999999999887654 5544323344332 13355677788888 Q ss_pred HHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhh-hhhhhhccCceEE-EecC Q lcl|NC_020871. 144 DAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQA-AVMISKGYGTPTD-AYMP 221 (468) Q Consensus 144 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~-a~~i~~~fG~~td-~~m~ 221 (468) .--..+...++.++++|+..- ...+..||.+.....+....-+ .++.+.|-.+ ..+-..+-..+.- ++|+ T Consensus 226 ~la~a~~~~~~~a~l~G~G~~-------~~~~p~Gil~~~~~~~~~~~~~-~~~~d~i~~l~~~l~~~~~~~a~~~~~~n 297 (409) T protein:vir:45 226 RIAERIGRGEARYLIQGTGAG-------TPKQPKGLAASVTGTTQTAAAN-AVKWQEILALKHSIDPAYRRGPKFRLAFN 297 (409) T ss_pred HHHHHHHHHHHHHhhccCCCC-------Cccccceeeecccccccccccc-ccchHHHHHHHHhhhhhhccCCeEEEEEC Confidence 888889999999999999542 1245567776555444443333 3454444433 3333333344444 4679 Q ss_pred HHHHhhHHHhhcCC-ceEEeecCCCc----ceeeeeccceeecCCc-cccCCCEe-ecccccccccccccCCCCCCccee Q lcl|NC_020871. 222 VGVQADFVNQQLSK-QTQLVRDNGNN----VSVGFNIQGFHSARGF-IKLHGSTV-MENEQILDERILALPTAPQQAKVT 294 (468) Q Consensus 222 ~~v~a~~~~~~~~~-qr~v~~~n~~~----~~~G~~v~~~~s~~g~-i~l~gs~i-~~~~n~l~~~~~~~p~ap~~~~vt 294 (468) +.+.+.+.. .-+. -|.+.+++... .-.|.+| +++..-. +.-...+| +.+..- ... ..-...+ T Consensus 298 ~~~~~~l~~-lkd~~G~~i~~~~~~~~~~~~l~G~PV--~~~~~~p~~~~~~~~i~~Gd~~~-----~~i---~~~~~~~ 366 (409) T protein:vir:45 298 DNTLKLISE-MEDGQGRPLWLPDIVGVAPASVLNVPY--VIDQEIDDIGAGKKFMFCGDFDR-----FII---RRVRYMI 366 (409) T ss_pred HHHHHHHHH-hhcCCCceeeccCcCCCCCceecceee--EEecCcCCccCCccEEEEeehhh-----hhe---eeccceE Confidence 999888743 3222 23333332211 1122222 1110000 00000111 111000 000 0000000 Q ss_pred EEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCc Q lcl|NC_020871. 295 ATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSS 355 (468) Q Consensus 295 at~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga 355 (468) ....... |.. .+...|++..-=+.+=-.+...+. |+..+..|+ T Consensus 367 ~~~~~d~---~~~--~~~~~~~~~~r~d~~~~~~~A~~~-------------l~~k~s~~~ 409 (409) T protein:vir:45 367 LKRLVER---YAE--YDQTGFLAFHRFDCILEDTSAIKA-------------LVGKGSVGG 409 (409) T ss_pred EEEeecc---ccc--CCcEEEEEEEEeccEeechhheEE-------------EEeccCCCC Confidence 0000000 000 001111111100000000111111 222223332 No 82 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=94.41 E-value=0.0045 Score=33.45 Aligned_cols=311 Identities=13% Similarity=0.041 Sum_probs=132.5 Q ss_pred CC--CcccchhhcccChhhHHH--HHHHHhhccc-------------ccCcccccCccccchhhhhhHhhhhhhcccccc Q lcl|NC_020871. 1 MP--KNNKEEEVKEVNLNSVQE--DALKSFTTGY-------------GITPDTQTDAGALRREFLDDQISMLTWTENDLT 63 (468) Q Consensus 1 ~~--~~~~~~~~~~~n~~~~~e--~~~Ksf~agy-------------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~ 63 (468) +- ...+.......+.....+ .+.+-+.-+. .....+..+|+.+-++.+.++|..+..... . T Consensus 114 ~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~--~ 191 (458) T protein:vir:10 114 FVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKEL--V 191 (458) T ss_pred hhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhh--h Confidence 00 000000001111111111 1111111000 000112235777888888888866554433 4 Q ss_pred chhhhcccchhhhhhccceeeeecccccccccccccccc------ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_020871. 64 FYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP------VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDP 137 (468) Q Consensus 64 ~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~------~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp 137 (468) +.+.....++.+....|.+.... +.+.+++|++..+ .+++.+.+.....+=++.--.+|.-+ +.++..|. T Consensus 192 l~~~~~~~~~~~~~~~~~~~~~~---~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~el-l~ds~~~~ 267 (458) T protein:vir:10 192 VGALFEELPMSSKILTMLVEPDA---GKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDET-EEDAIFSL 267 (458) T ss_pred HHhhcceeecCCcceEEEEecCC---cceeecccccccccccccccccccceeeEeeeeeEEeeehhhHHH-HhcchHHH Confidence 55555666666666666665533 3355677776543 56788888888877777766677653 33466788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCcc--ce-eeccC---CCCCHHHHhhhhhhhhhc Q lcl|NC_020871. 138 MQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD--NV-HDARG---ASLTESLLNQAAVMISKG 211 (468) Q Consensus 138 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~--nv-iDarG---~~ls~~~l~~~a~~i~~~ 211 (468) +....+.....++..++.++|+||-. + +.-||.+-..-. ++ .+.-+ ..++.+.|-++-..+..+ T Consensus 268 ~~~i~~~l~~~i~~~~d~~~l~G~G~-----~-----~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~ 337 (458) T protein:vir:10 268 LPLLRKRLIEAHAVSIEEAFMTGDGS-----G-----KPKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRH 337 (458) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCC-----C-----ccceeeecccccccceeecccccccccccHHHHHHHHHhhhhh Confidence 88889999999999999999999842 1 334665533211 21 12222 224444444444444556 Q ss_pred cCceEEEecCHHHHhhHHHhhcC--CceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCC Q lcl|NC_020871. 212 YGTPTDAYMPVGVQADFVNQQLS--KQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQ 289 (468) Q Consensus 212 fG~~td~~m~~~v~a~~~~~~~~--~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~ 289 (468) |......+||+.+.+.|.. +-+ ++....+........|. .-.|.|.-|+..+.. |.... T Consensus 338 ~~~~~~~v~~~~~~~~l~~-lkd~~G~~i~~~~~~~~~~~~~----------~~~l~G~pv~~~~~~--------p~~~~ 398 (458) T protein:vir:10 338 GLKLSKLVLIVSMDAYYDL-LEDEEWQDVAQVGNDSVKLQGQ----------VGRIYGLPVVVSEYF--------PAKAN 398 (458) T ss_pred hcCCCEEEEcHHHHHHHHh-hcccCCceeeccccccccccCc----------CceecceeeEEcccc--------ccccC Confidence 6667779999999888843 322 23332221111110110 112334333332211 11100 Q ss_pred CcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEE-EeecCCCc--ccceEEEEeec Q lcl|NC_020871. 290 QAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEI-ELAPMYSS--RPQFVSIYRKG 366 (468) Q Consensus 290 ~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltI-T~~~~~ga--~~~~y~IYR~~ 366 (468) ...+ .. +-|.. .|++ ++..|-+.- .-...+...+.+ +..-+.++ .|+.+.. -+- T Consensus 399 ~~~~---~~----~~f~~------~~~~--~~~~~~~v~-------~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~-~~~ 455 (458) T protein:vir:10 399 SAEF---AV----IVYKD------NFVM--PRQRAVTVE-------RERQAGKQRDAYYVTQRVNLQRYFANGVVS-GTY 455 (458) T ss_pred Ccce---EE----EEecc------cEEE--EEeeceEEE-------eecccCCCceEEEEEEEecceEecccceEE-Eee Confidence 0000 00 00100 0111 111110000 000000000000 00001110 0111100 000 Q ss_pred CCC Q lcl|NC_020871. 367 AET 369 (468) Q Consensus 367 ~~~ 369 (468) +-+ T Consensus 456 aa~ 458 (458) T protein:vir:10 456 AAS 458 (458) T ss_pred ccC Confidence 000 No 83 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=94.38 E-value=0.0046 Score=33.41 Aligned_cols=318 Identities=16% Similarity=0.052 Sum_probs=138.5 Q ss_pred CCCcccchhhccc----------------ChhhHHHHHHHHh----hcccc--cCcccccCccccchhhhhhHhhhhhhc Q lcl|NC_020871. 1 MPKNNKEEEVKEV----------------NLNSVQEDALKSF----TTGYG--ITPDTQTDAGALRREFLDDQISMLTWT 58 (468) Q Consensus 1 ~~~~~~~~~~~~~----------------n~~~~~e~~~Ksf----~agy~--~~p~~~~~gaALr~esld~~i~~L~~~ 58 (468) .....+..+.++. .......+..++| ..+.. ....+..+|+.+.++.+...|..+... T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:94 68 SENNQQSVEVNEASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhccccccccchhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHh Confidence 1111100000000 0001111222333 22211 111234568888999888888665444 Q ss_pred cccccchhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_020871. 59 ENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDP 137 (468) Q Consensus 59 ~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp 137 (468) .. .+.+.+...++.+.-..|...... +.+...+++|++..+ ..++.+.+....++-++.-..+|.-+ +.++..|. T Consensus 148 ~~--~l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~ds~~~~ 223 (415) T protein:vir:94 148 EF--NLDKYVTVKRVTNGSGKYPVVRQS-EVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNV 223 (415) T ss_pred hh--hhhhhcceeeccCCceeEEEEeec-CCccceeccccccccccccccceeeEeeheeeeeechhhHHH-HhhchHHH Confidence 33 455556666665544555555433 334566899998865 67799999999999999887887753 33466788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEE Q lcl|NC_020871. 138 MQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTD 217 (468) Q Consensus 138 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td 217 (468) +....+.-...++..++.+++.|+..-.. .+ ++.......+.....+...-.++++ +-..+...+...+- T Consensus 224 ~~~i~~~l~~~~~~~~~~~il~g~g~g~~-----~~----~~~~~~~~~~~~~~~~~~~~~~i~~-~~~~~~~~~~~~~~ 293 (415) T protein:vir:94 224 LQELKLWMARTIAATRNKAIIDVITKGST-----GS----TSSGFEKEGKKLEVKKAKSLDDIKD-AINLNVKPNYEHNV 293 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCcc-----cc----ccccccccccccccccccchHHHHH-HHHhhhhhccCCCE Confidence 88888999999999999999999854211 11 1111222223444444433333443 33333445555677 Q ss_pred EecCHHHHhhHHHhhcCC-ceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEE Q lcl|NC_020871. 218 AYMPVGVQADFVNQQLSK-QTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTAT 296 (468) Q Consensus 218 ~~m~~~v~a~~~~~~~~~-qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat 296 (468) .+||+.+.+.|.. .-+. -|.+.+++.... ..-.|.|.-+...++. +.+++...+ T Consensus 294 ~vmn~~~~~~l~~-lkd~~G~~l~~~~~~~~-------------~~~~l~G~pV~~~~~~------~~~~~~~~~----- 348 (415) T protein:vir:94 294 AIVSQTMFAKLDK-MKDKLGNYLIQPDVKEK-------------TQQRLLGAKIEILPDE------VLGQKGNNT----- 348 (415) T ss_pred EEEcHHHHHHHHH-hhccCCCeeeccCcCCC-------------CCceecceeeEEeccc------ccCCCCccE----- Confidence 9999999998844 3222 122222221110 0112223322111100 000000000 Q ss_pred ecCCCCCCcCcccceeEEEEEEEEcccCCccc-ccceeeeeeccCcceEEEEEeecCCCcc--cceEEE--Eee-cCCCc Q lcl|NC_020871. 297 QEAGKKGQFRAEDLAAHEYKVVVSSDDAESIA-SEVATATVTAKDDGVKLEIELAPMYSSR--PQFVSI--YRK-GAETG 370 (468) Q Consensus 297 ~~~~~~g~~~~~~~~~y~YkVtavn~~GES~a-S~~vt~Tv~a~~~g~~ltIT~~~~~ga~--~~~y~I--YR~-~~~~G 370 (468) --.+-++-.+..+.+.|-+.. +... .....+.... -+.++. |+-+.+ |.. ..++| T Consensus 349 -----------i~~gd~~~~~~~~~~~~~~v~~~~~~-------~~~~~~r~~~-r~d~~~~~~~a~~~~~~~~~~~~~~ 409 (415) T protein:vir:94 349 -----------LIIGNLKDAIVLFDRSQYQASWTDYM-------HFGECLMIAV-RQDCRILDYKSAIVIEYDDSERGEG 409 (415) T ss_pred -----------EEEEehhccEEEEeecceEEEEeccc-------cCceEEEEEE-EeccEEeccccEEEEEEeccCCCCC Confidence 000111100111121111100 0000 0000011000 001100 000000 000 01223 Q ss_pred eeEEEE Q lcl|NC_020871. 371 LFYLIA 376 (468) Q Consensus 371 ~f~~ig 376 (468) +.++-. T Consensus 410 ~~~~~~ 415 (415) T protein:vir:94 410 DLGLEA 415 (415) T ss_pred ccccCC Confidence 332222 No 84 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=93.96 E-value=0.0058 Score=32.84 Aligned_cols=319 Identities=15% Similarity=0.010 Sum_probs=140.0 Q ss_pred CCCccc----chhhcccC----------------hhhHHHHHHHHhhccccc--CcccccCccccchhhhhhHhhhhhhc Q lcl|NC_020871. 1 MPKNNK----EEEVKEVN----------------LNSVQEDALKSFTTGYGI--TPDTQTDAGALRREFLDDQISMLTWT 58 (468) Q Consensus 1 ~~~~~~----~~~~~~~n----------------~~~~~e~~~Ksf~agy~~--~p~~~~~gaALr~esld~~i~~L~~~ 58 (468) +....+ .++....+ ..+....|.+.+..+... ...+-.+|+.|.++.+...|..+... T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:98 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHh Confidence 111111 01000000 111122233333233221 11233468889999998888665554 Q ss_pred cccccchhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_020871. 59 ENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDP 137 (468) Q Consensus 59 ~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp 137 (468) .. .+.+.+...+..+.-..|......++ .-..+++|++..+ ..++.+......++-++.-..+|.-+ +.++..|. T Consensus 148 ~~--~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l 223 (415) T protein:vir:98 148 EF--NLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNV 223 (415) T ss_pred hh--hhhhheeeeeccCCceeEEEEeecCC-ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHH-HhhchHHH Confidence 43 45555666666655556655554443 3456899988765 67799999999999999888888764 23566678 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEE Q lcl|NC_020871. 138 MQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTD 217 (468) Q Consensus 138 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td 217 (468) +....+.-...+++.++.+++.|+-.-+ + .+. +.......+.....+.. +.+.|-.+-..+...|....- T Consensus 224 ~~~i~~~l~~~~~~~~~~~il~g~g~g~--~---~~~----~~~~~~~~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~ 293 (415) T protein:vir:98 224 LQELKLWMARTIAATRNKAIIDVITKGS--T---GST----SSGFEKEGKKLEVKKAK-SLDDIKDAINLNVKPNYEHNV 293 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCc--c---ccc----ccccccccccccccccc-chhHHHHHHHhhhhhccCCCE Confidence 8888888889999999999999984411 1 111 11111222344444443 333333333333445555667 Q ss_pred EecCHHHHhhHHHhhcCC-ceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEE Q lcl|NC_020871. 218 AYMPVGVQADFVNQQLSK-QTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTAT 296 (468) Q Consensus 218 ~~m~~~v~a~~~~~~~~~-qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat 296 (468) .+||+.+.+.|.. .-+. -|.+.+++.... .+. .|.|.-+...++. +.+++.. ++. T Consensus 294 ~v~n~~~~~~l~~-lkd~~G~~l~~~~~~~~-~~~------------~l~G~pV~~~~~~------~~~~~~~---~~~- 349 (415) T protein:vir:98 294 AIVSQTMFAKLDK-MKDKLGNYLIQPDVKEK-TQQ------------RLLGAKIEILPDE------VLGQKGN---NTL- 349 (415) T ss_pred EEEcHHHHHHHHH-hhccCCceeeccCcCCC-CCc------------eecceeeEEeccc------ccCCCCc---cEE- Confidence 9999999998853 3322 233433332111 011 2223222111110 0000000 000 Q ss_pred ecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcc--cceEEEEee---cCCCce Q lcl|NC_020871. 297 QEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSR--PQFVSIYRK---GAETGL 371 (468) Q Consensus 297 ~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~--~~~y~IYR~---~~~~G~ 371 (468) ..|.++-.+..+++.+-+.- .+ .-......+.... -+.+.. |+-+.+..- ..++|+ T Consensus 350 ------------~~Gd~~~~~~~~~~~~~~v~-----~~-~~~~~~~~~~~~~-r~d~~v~~~~a~~~~~~~~~~~~~~~ 410 (415) T protein:vir:98 350 ------------IIGNLKDAIVLFDRSQYQAS-----WT-DYMHFGECLMIAV-RQDCRILDYKSAIVIEYDDSERGEGD 410 (415) T ss_pred ------------EEEehhccEEEEeecceEEE-----Ee-ccccCceEEEEEE-EeccEEeccccEEEEEEeccCCCCCc Confidence 00111100111111110000 00 0000000010000 011110 000100000 011233 Q ss_pred eEEEE Q lcl|NC_020871. 372 FYLIA 376 (468) Q Consensus 372 f~~ig 376 (468) .++-. T Consensus 411 ~~~~~ 415 (415) T protein:vir:98 411 LGLEA 415 (415) T ss_pred cccCC Confidence 22221 No 85 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=93.96 E-value=0.0058 Score=32.84 Aligned_cols=319 Identities=15% Similarity=0.010 Sum_probs=140.0 Q ss_pred CCCccc----chhhcccC----------------hhhHHHHHHHHhhccccc--CcccccCccccchhhhhhHhhhhhhc Q lcl|NC_020871. 1 MPKNNK----EEEVKEVN----------------LNSVQEDALKSFTTGYGI--TPDTQTDAGALRREFLDDQISMLTWT 58 (468) Q Consensus 1 ~~~~~~----~~~~~~~n----------------~~~~~e~~~Ksf~agy~~--~p~~~~~gaALr~esld~~i~~L~~~ 58 (468) +....+ .++....+ ..+....|.+.+..+... ...+-.+|+.|.++.+...|..+... T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:81 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHh Confidence 111111 01000000 111122233333233221 11233468889999998888665554 Q ss_pred cccccchhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_020871. 59 ENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDP 137 (468) Q Consensus 59 ~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp 137 (468) .. .+.+.+...+..+.-..|......++ .-..+++|++..+ ..++.+......++-++.-..+|.-+ +.++..|. T Consensus 148 ~~--~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l 223 (415) T protein:vir:81 148 EF--NLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNV 223 (415) T ss_pred hh--hhhhheeeeeccCCceeEEEEeecCC-ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHH-HhhchHHH Confidence 43 45555666666655556655554443 3456899988765 67799999999999999888888764 23566678 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEE Q lcl|NC_020871. 138 MQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTD 217 (468) Q Consensus 138 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td 217 (468) +....+.-...+++.++.+++.|+-.-+ + .+. +.......+.....+.. +.+.|-.+-..+...|....- T Consensus 224 ~~~i~~~l~~~~~~~~~~~il~g~g~g~--~---~~~----~~~~~~~~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~ 293 (415) T protein:vir:81 224 LQELKLWMARTIAATRNKAIIDVITKGS--T---GST----SSGFEKEGKKLEVKKAK-SLDDIKDAINLNVKPNYEHNV 293 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCc--c---ccc----ccccccccccccccccc-chhHHHHHHHhhhhhccCCCE Confidence 8888888889999999999999984411 1 111 11111222344444443 333333333333445555667 Q ss_pred EecCHHHHhhHHHhhcCC-ceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEE Q lcl|NC_020871. 218 AYMPVGVQADFVNQQLSK-QTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTAT 296 (468) Q Consensus 218 ~~m~~~v~a~~~~~~~~~-qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat 296 (468) .+||+.+.+.|.. .-+. -|.+.+++.... .+. .|.|.-+...++. +.+++.. ++. T Consensus 294 ~v~n~~~~~~l~~-lkd~~G~~l~~~~~~~~-~~~------------~l~G~pV~~~~~~------~~~~~~~---~~~- 349 (415) T protein:vir:81 294 AIVSQTMFAKLDK-MKDKLGNYLIQPDVKEK-TQQ------------RLLGAKIEILPDE------VLGQKGN---NTL- 349 (415) T ss_pred EEEcHHHHHHHHH-hhccCCceeeccCcCCC-CCc------------eecceeeEEeccc------ccCCCCc---cEE- Confidence 9999999998853 3322 233433332111 011 2223222111110 0000000 000 Q ss_pred ecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcc--cceEEEEee---cCCCce Q lcl|NC_020871. 297 QEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSR--PQFVSIYRK---GAETGL 371 (468) Q Consensus 297 ~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~--~~~y~IYR~---~~~~G~ 371 (468) ..|.++-.+..+++.+-+.- .+ .-......+.... -+.+.. |+-+.+..- ..++|+ T Consensus 350 ------------~~Gd~~~~~~~~~~~~~~v~-----~~-~~~~~~~~~~~~~-r~d~~v~~~~a~~~~~~~~~~~~~~~ 410 (415) T protein:vir:81 350 ------------IIGNLKDAIVLFDRSQYQAS-----WT-DYMHFGECLMIAV-RQDCRILDYKSAIVIEYDDSERGEGD 410 (415) T ss_pred ------------EEEehhccEEEEeecceEEE-----Ee-ccccCceEEEEEE-EeccEEeccccEEEEEEeccCCCCCc Confidence 00111100111111110000 00 0000000010000 011110 000100000 011233 Q ss_pred eEEEE Q lcl|NC_020871. 372 FYLIA 376 (468) Q Consensus 372 f~~ig 376 (468) .++-. T Consensus 411 ~~~~~ 415 (415) T protein:vir:81 411 LGLEA 415 (415) T ss_pred cccCC Confidence 22221 No 86 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=93.96 E-value=0.0058 Score=32.84 Aligned_cols=319 Identities=15% Similarity=0.010 Sum_probs=140.0 Q ss_pred CCCccc----chhhcccC----------------hhhHHHHHHHHhhccccc--CcccccCccccchhhhhhHhhhhhhc Q lcl|NC_020871. 1 MPKNNK----EEEVKEVN----------------LNSVQEDALKSFTTGYGI--TPDTQTDAGALRREFLDDQISMLTWT 58 (468) Q Consensus 1 ~~~~~~----~~~~~~~n----------------~~~~~e~~~Ksf~agy~~--~p~~~~~gaALr~esld~~i~~L~~~ 58 (468) +....+ .++....+ ..+....|.+.+..+... ...+-.+|+.|.++.+...|..+... T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:79 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHh Confidence 111111 01000000 111122233333233221 11233468889999998888665554 Q ss_pred cccccchhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_020871. 59 ENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDP 137 (468) Q Consensus 59 ~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp 137 (468) .. .+.+.+...+..+.-..|......++ .-..+++|++..+ ..++.+......++-++.-..+|.-+ +.++..|. T Consensus 148 ~~--~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l 223 (415) T protein:vir:79 148 EF--NLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNV 223 (415) T ss_pred hh--hhhhheeeeeccCCceeEEEEeecCC-ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHH-HhhchHHH Confidence 43 45555666666655556655554443 3456899988765 67799999999999999888888764 23566678 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEE Q lcl|NC_020871. 138 MQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTD 217 (468) Q Consensus 138 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td 217 (468) +....+.-...+++.++.+++.|+-.-+ + .+. +.......+.....+.. +.+.|-.+-..+...|....- T Consensus 224 ~~~i~~~l~~~~~~~~~~~il~g~g~g~--~---~~~----~~~~~~~~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~ 293 (415) T protein:vir:79 224 LQELKLWMARTIAATRNKAIIDVITKGS--T---GST----SSGFEKEGKKLEVKKAK-SLDDIKDAINLNVKPNYEHNV 293 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCc--c---ccc----ccccccccccccccccc-chhHHHHHHHhhhhhccCCCE Confidence 8888888889999999999999984411 1 111 11111222344444443 333333333333445555667 Q ss_pred EecCHHHHhhHHHhhcCC-ceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEE Q lcl|NC_020871. 218 AYMPVGVQADFVNQQLSK-QTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTAT 296 (468) Q Consensus 218 ~~m~~~v~a~~~~~~~~~-qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat 296 (468) .+||+.+.+.|.. .-+. -|.+.+++.... .+. .|.|.-+...++. +.+++.. ++. T Consensus 294 ~v~n~~~~~~l~~-lkd~~G~~l~~~~~~~~-~~~------------~l~G~pV~~~~~~------~~~~~~~---~~~- 349 (415) T protein:vir:79 294 AIVSQTMFAKLDK-MKDKLGNYLIQPDVKEK-TQQ------------RLLGAKIEILPDE------VLGQKGN---NTL- 349 (415) T ss_pred EEEcHHHHHHHHH-hhccCCceeeccCcCCC-CCc------------eecceeeEEeccc------ccCCCCc---cEE- Confidence 9999999998853 3322 233433332111 011 2223222111110 0000000 000 Q ss_pred ecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcc--cceEEEEee---cCCCce Q lcl|NC_020871. 297 QEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSR--PQFVSIYRK---GAETGL 371 (468) Q Consensus 297 ~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~--~~~y~IYR~---~~~~G~ 371 (468) ..|.++-.+..+++.+-+.- .+ .-......+.... -+.+.. |+-+.+..- ..++|+ T Consensus 350 ------------~~Gd~~~~~~~~~~~~~~v~-----~~-~~~~~~~~~~~~~-r~d~~v~~~~a~~~~~~~~~~~~~~~ 410 (415) T protein:vir:79 350 ------------IIGNLKDAIVLFDRSQYQAS-----WT-DYMHFGECLMIAV-RQDCRILDYKSAIVIEYDDSERGEGD 410 (415) T ss_pred ------------EEEehhccEEEEeecceEEE-----Ee-ccccCceEEEEEE-EeccEEeccccEEEEEEeccCCCCCc Confidence 00111100111111110000 00 0000000010000 011110 000100000 011233 Q ss_pred eEEEE Q lcl|NC_020871. 372 FYLIA 376 (468) Q Consensus 372 f~~ig 376 (468) .++-. T Consensus 411 ~~~~~ 415 (415) T protein:vir:79 411 LGLEA 415 (415) T ss_pred cccCC Confidence 22221 No 87 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=93.79 E-value=0.0064 Score=32.63 Aligned_cols=318 Identities=15% Similarity=0.014 Sum_probs=145.0 Q ss_pred CCCcccchhhccc--------------------ChhhHHHHHHHHhhcccc--cCcccccCccccchhhhhhHhhhhhhc Q lcl|NC_020871. 1 MPKNNKEEEVKEV--------------------NLNSVQEDALKSFTTGYG--ITPDTQTDAGALRREFLDDQISMLTWT 58 (468) Q Consensus 1 ~~~~~~~~~~~~~--------------------n~~~~~e~~~Ksf~agy~--~~p~~~~~gaALr~esld~~i~~L~~~ 58 (468) +-..++..+.+.. ...+...+|.+....+.. ....+-.+|+.+.++.+...|..+... T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:46 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHh Confidence 0000000000000 000111122222222211 111223468889999999988655444 Q ss_pred cccccchhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_020871. 59 ENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDP 137 (468) Q Consensus 59 ~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp 137 (468) .. .+++.+...++.+.-..|......++ ....+++|++..+ .+++.+.......+-++.-..+|.-+- .++..|. T Consensus 148 ~~--~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l 223 (415) T protein:vir:46 148 EF--NLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNV 223 (415) T ss_pred hh--hhhhhcceeeccCCceeEEEEEecCC-cceeecccccccccccccceeeEEeeeeeeEeeehhhHHHH-hhchHHH Confidence 33 56666666666665556665554443 3456899998776 678999999999999998877776432 3456788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEE Q lcl|NC_020871. 138 MQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTD 217 (468) Q Consensus 138 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td 217 (468) +....+.....+...++.+++.|+-.=. + .+ ++.......+.....+...-.++++ +-..+...+....- T Consensus 224 ~~~i~~~l~~~i~~~~d~~il~g~g~g~--~---~~----~~~~~~~~~~~~~~~~~~~~~~i~~-~~~~~~~~~~~~~~ 293 (415) T protein:vir:46 224 LQELKLWMARTIAATRNKAIIDVITKGS--T---GS----TSSGFEKEGKKLEVKKAKSLDDIKD-AINLNVKPNYEHNV 293 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCC--c---cc----cccccccccceeccccccchHHHHH-HHHhhhhhccCCCE Confidence 8889999999999999999999974311 1 11 1222222334444444443334443 32223344455667 Q ss_pred EecCHHHHhhHHHhhcCCc-eEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEE Q lcl|NC_020871. 218 AYMPVGVQADFVNQQLSKQ-TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTAT 296 (468) Q Consensus 218 ~~m~~~v~a~~~~~~~~~q-r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat 296 (468) .+||+.+.+.|.. .-+.+ |.+.+++..+.. .-.|.|..+...++. +.+++... . T Consensus 294 ~v~n~~~~~~L~~-lkd~~G~~i~~~~~~~~~-------------~~~l~G~pV~~~~~~------~~~~~~~~---~-- 348 (415) T protein:vir:46 294 AIVSQTMFAKLDK-MKDKLGNYLIQPDVKEKT-------------QQRLLGAKIEILPDE------VLGQKGNN---T-- 348 (415) T ss_pred EEEcHHHHHHHHH-hhccCCCeeeccCcCCCC-------------CccccceeeEEeccc------cccCCCcc---E-- Confidence 8999999998843 33332 333333321110 002223322211110 00000000 0 Q ss_pred ecCCCCCCcCcccceeEEEEEEEEcccCCccc-ccceeeeeeccCcceEEEEEeecCCCcc--cceEEEEe---ecCCCc Q lcl|NC_020871. 297 QEAGKKGQFRAEDLAAHEYKVVVSSDDAESIA-SEVATATVTAKDDGVKLEIELAPMYSSR--PQFVSIYR---KGAETG 370 (468) Q Consensus 297 ~~~~~~g~~~~~~~~~y~YkVtavn~~GES~a-S~~vt~Tv~a~~~g~~ltIT~~~~~ga~--~~~y~IYR---~~~~~G 370 (468) --.+.++..+...++.+-+.- +.-. .....+.... -..+.. |+.+.+-. .+.+.| T Consensus 349 -----------~~~gd~~~~~~~~~~~~~~v~~~~~~-------~~~~~~~~~~-r~d~~v~~~~a~~~~~~~~~~~~~~ 409 (415) T protein:vir:46 349 -----------LIIGNLKDAIVLFDRSQYQASWTDYM-------HFGECLMIAV-RQDCRILDYKSAIVIEYDDSERGEG 409 (415) T ss_pred -----------EEEEehhccEEEEeecceEEEeeccc-------cCceEEEEEE-EeccEEeccccEEEEEeeccCCCCC Confidence 001122211222222221110 0000 0011111111 111111 11111111 122446 Q ss_pred eeEEEE Q lcl|NC_020871. 371 LFYLIA 376 (468) Q Consensus 371 ~f~~ig 376 (468) +.++-. T Consensus 410 ~~~~~~ 415 (415) T protein:vir:46 410 DLGLEA 415 (415) T ss_pred CccCCC Confidence 665544 No 88 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=93.79 E-value=0.0064 Score=32.63 Aligned_cols=318 Identities=15% Similarity=0.014 Sum_probs=145.0 Q ss_pred CCCcccchhhccc--------------------ChhhHHHHHHHHhhcccc--cCcccccCccccchhhhhhHhhhhhhc Q lcl|NC_020871. 1 MPKNNKEEEVKEV--------------------NLNSVQEDALKSFTTGYG--ITPDTQTDAGALRREFLDDQISMLTWT 58 (468) Q Consensus 1 ~~~~~~~~~~~~~--------------------n~~~~~e~~~Ksf~agy~--~~p~~~~~gaALr~esld~~i~~L~~~ 58 (468) +-..++..+.+.. ...+...+|.+....+.. ....+-.+|+.+.++.+...|..+... T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:47 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHh Confidence 0000000000000 000111122222222211 111223468889999999988655444 Q ss_pred cccccchhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_020871. 59 ENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDP 137 (468) Q Consensus 59 ~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp 137 (468) .. .+++.+...++.+.-..|......++ ....+++|++..+ .+++.+.......+-++.-..+|.-+- .++..|. T Consensus 148 ~~--~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l 223 (415) T protein:vir:47 148 EF--NLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNV 223 (415) T ss_pred hh--hhhhhcceeeccCCceeEEEEEecCC-cceeecccccccccccccceeeEEeeeeeeEeeehhhHHHH-hhchHHH Confidence 33 56666666666665556665554443 3456899998776 678999999999999998877776432 3456788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEE Q lcl|NC_020871. 138 MQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTD 217 (468) Q Consensus 138 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td 217 (468) +....+.....+...++.+++.|+-.=. + .+ ++.......+.....+...-.++++ +-..+...+....- T Consensus 224 ~~~i~~~l~~~i~~~~d~~il~g~g~g~--~---~~----~~~~~~~~~~~~~~~~~~~~~~i~~-~~~~~~~~~~~~~~ 293 (415) T protein:vir:47 224 LQELKLWMARTIAATRNKAIIDVITKGS--T---GS----TSSGFEKEGKKLEVKKAKSLDDIKD-AINLNVKPNYEHNV 293 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCC--c---cc----cccccccccceeccccccchHHHHH-HHHhhhhhccCCCE Confidence 8889999999999999999999974311 1 11 1222222334444444443334443 32223344455667 Q ss_pred EecCHHHHhhHHHhhcCCc-eEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEE Q lcl|NC_020871. 218 AYMPVGVQADFVNQQLSKQ-TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTAT 296 (468) Q Consensus 218 ~~m~~~v~a~~~~~~~~~q-r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat 296 (468) .+||+.+.+.|.. .-+.+ |.+.+++..+.. .-.|.|..+...++. +.+++... . T Consensus 294 ~v~n~~~~~~L~~-lkd~~G~~i~~~~~~~~~-------------~~~l~G~pV~~~~~~------~~~~~~~~---~-- 348 (415) T protein:vir:47 294 AIVSQTMFAKLDK-MKDKLGNYLIQPDVKEKT-------------QQRLLGAKIEILPDE------VLGQKGNN---T-- 348 (415) T ss_pred EEEcHHHHHHHHH-hhccCCCeeeccCcCCCC-------------CccccceeeEEeccc------cccCCCcc---E-- Confidence 8999999998843 33332 333333321110 002223322211110 00000000 0 Q ss_pred ecCCCCCCcCcccceeEEEEEEEEcccCCccc-ccceeeeeeccCcceEEEEEeecCCCcc--cceEEEEe---ecCCCc Q lcl|NC_020871. 297 QEAGKKGQFRAEDLAAHEYKVVVSSDDAESIA-SEVATATVTAKDDGVKLEIELAPMYSSR--PQFVSIYR---KGAETG 370 (468) Q Consensus 297 ~~~~~~g~~~~~~~~~y~YkVtavn~~GES~a-S~~vt~Tv~a~~~g~~ltIT~~~~~ga~--~~~y~IYR---~~~~~G 370 (468) --.+.++..+...++.+-+.- +.-. .....+.... -..+.. |+.+.+-. .+.+.| T Consensus 349 -----------~~~gd~~~~~~~~~~~~~~v~~~~~~-------~~~~~~~~~~-r~d~~v~~~~a~~~~~~~~~~~~~~ 409 (415) T protein:vir:47 349 -----------LIIGNLKDAIVLFDRSQYQASWTDYM-------HFGECLMIAV-RQDCRILDYKSAIVIEYDDSERGEG 409 (415) T ss_pred -----------EEEEehhccEEEEeecceEEEeeccc-------cCceEEEEEE-EeccEEeccccEEEEEeeccCCCCC Confidence 001122211222222221110 0000 0011111111 111111 11111111 122446 Q ss_pred eeEEEE Q lcl|NC_020871. 371 LFYLIA 376 (468) Q Consensus 371 ~f~~ig 376 (468) +.++-. T Consensus 410 ~~~~~~ 415 (415) T protein:vir:47 410 DLGLEA 415 (415) T ss_pred CccCCC Confidence 665544 No 89 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=93.71 E-value=0.0066 Score=32.52 Aligned_cols=310 Identities=14% Similarity=0.093 Sum_probs=140.8 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccc----------cCcccccCccccchhhhhhHhhhhhhccccccchhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYG----------ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAK 70 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~----------~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k 70 (468) .|...+. ++ ...+.-++|.+.+-.|-. ..-.+..+|+.|-++.+.++|..+.... ..+++.+.. T Consensus 69 ~~~~~~~---~~-~~~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~--~~l~~~~~~ 142 (407) T protein:vir:48 69 RPAGGTQ---NK-VASEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDE--VVMRQEATV 142 (407) T ss_pred ccccccc---cc-hhhHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhh--hhhhhhcee Confidence 1111111 11 111122233333322211 1112334578899999999886655443 345555555 Q ss_pred cchhhhhhccceeeeeccccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHH Q lcl|NC_020871. 71 KPATSTVAKYDVYMQHGKVGHTRFTREIGVA-PVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNI 149 (468) Q Consensus 71 ~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~ 149 (468) .+..+.-..|-+..... ...+++|++.. +..++.+......++=++.-..+|.-+ +.++..|.+....+.-...+ T Consensus 143 ~~~~~~~~~~~~~~~~~---~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~i 218 (407) T protein:vir:48 143 ITLGGSDYKKLVNLGGT---TSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKM-LDDAFFNVEDWINSELALEF 218 (407) T ss_pred eecCCCceEEEEecCCc---ceeeecccccccccccccceeEEeeeeeeEeehhhHHHH-HhcchHHHHHHHHHHHHHHH Confidence 55555444444444322 35589999975 567799999999998777777777653 23466788888888888889 Q ss_pred HHHHHHHHhhcccccccCCCCCCCccccchhhhcCcc--ce----------eeccCCCCCHHHHhhhhhhhhhccCceEE Q lcl|NC_020871. 150 AKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD--NV----------HDARGASLTESLLNQAAVMISKGYGTPTD 217 (468) Q Consensus 150 ~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~--nv----------iDarG~~ls~~~l~~~a~~i~~~fG~~td 217 (468) ...+|.++++||-. + +..||.+..... +. .-..-..++.+.|-.+...+..+|-.... T Consensus 219 ~~~~~~a~l~G~G~-~---------~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~ 288 (407) T protein:vir:48 219 AEQEEIAFTSGDGS-K---------KPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAK 288 (407) T ss_pred HHHHHhhhhccCCC-C---------ccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCCE Confidence 99999999999843 1 345666543211 11 11122234545444433333444444445 Q ss_pred EecCHHHHhhHHHhhcCCc-eEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEE Q lcl|NC_020871. 218 AYMPVGVQADFVNQQLSKQ-TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTAT 296 (468) Q Consensus 218 ~~m~~~v~a~~~~~~~~~q-r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat 296 (468) .+|++.+.+.|. ..-+.+ |-+.+++.... . .-.|.|.-++..++. |...+...+ T Consensus 289 ~v~n~~~~~~L~-~lkD~~Gr~l~~~~~~~g---~----------~~~l~G~PV~~~~~~--------p~~~~~~~~--- 343 (407) T protein:vir:48 289 FMMNNSSLFAIR-LLKDNDGNYLWRPGIELG---Q----------PSSLAGYGIVENEQM--------PDIAADAKA--- 343 (407) T ss_pred EEEcHHHHHHHH-HhhccCCceeeccCcCCC---C----------CceecceeeEEecCc--------CCccCCccE--- Confidence 899999998884 343432 33433332211 0 001223333332221 110000000 Q ss_pred ecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeec-----CCCce Q lcl|NC_020871. 297 QEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKG-----AETGL 371 (468) Q Consensus 297 ~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~-----~~~G~ 371 (468) . ..|-+++.+..+++.|-.. ..........+ .++...|=. ++.-. T Consensus 344 -i----------~~Gd~~~~~~i~~~~~~~i-------~~d~~~~~~~~------------~~~~~~r~d~~v~~~~a~~ 393 (407) T protein:vir:48 344 -I----------AFGNFKRGYTIVDRIGTRI-------LRDPYTNKPFV------------GFYTTKRTGGMLVDSQAIK 393 (407) T ss_pred -E----------EEEeccccEEEEEeeceEE-------EeeccccCCcE------------EEEEEEEeccEEecccceE Confidence 0 0112222222222222110 00000000001 111112211 01112 Q ss_pred eEEEEEEecccccC Q lcl|NC_020871. 372 FYLIARVPASKAEN 385 (468) Q Consensus 372 f~~igrv~~s~~~~ 385 (468) ...++..+.+.+.. T Consensus 394 ~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 394 LMKIGAATRQKAAA 407 (407) T ss_pred EEEeeccCCCCCCC Confidence 22222222222111 No 90 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=93.51 E-value=0.0073 Score=32.30 Aligned_cols=312 Identities=11% Similarity=0.021 Sum_probs=130.2 Q ss_pred CCC----------------cccchhhc------ccChhhHHHHH--------HHHhhcccccCcccccCccccchhhhhh Q lcl|NC_020871. 1 MPK----------------NNKEEEVK------EVNLNSVQEDA--------LKSFTTGYGITPDTQTDAGALRREFLDD 50 (468) Q Consensus 1 ~~~----------------~~~~~~~~------~~n~~~~~e~~--------~Ksf~agy~~~p~~~~~gaALr~esld~ 50 (468) +.+ ..+..... ...-...++.. .+++. ......-+.++++.+-++.+.+ T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~vp~~~~~ 136 (413) T protein:vir:81 58 SVDSEKSGELTRKGEGYKSIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAAS-DPASTATLTDEFQGGYGTTWNR 136 (413) T ss_pred HHhHHHhhhHhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhh-hhhhhcccccccccccchhhHH Confidence 000 00000000 00000000000 01110 0011112235677777888888 Q ss_pred Hhhhhhhccccccchhhhcccchhhhhhccceeeeec-cccccccccccccccccC-cceEEEEEEEEeeeehhhhhhhH Q lcl|NC_020871. 51 QISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHG-KVGHTRFTREIGVAPVSD-PNIRQKTVNMKFASDTKNISIAA 128 (468) Q Consensus 51 ~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG-~~g~~~fv~E~g~~~~~d-~~~~r~~~~~k~l~~~~~vs~~~ 128 (468) +|..+..... .+.+.+...+..+.-.+|.+..... ..+...+++|++....++ +.+.+....++=++.-..+|..+ T Consensus 137 ~ii~~~~~~~--~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el 214 (413) T protein:vir:81 137 NIIYRRREKL--VVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEM 214 (413) T ss_pred HHHHHHhhhh--hHHhhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHH Confidence 8755544433 4555566666665555666655332 234567899998876665 78999999998888777888753 Q ss_pred hhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhh Q lcl|NC_020871. 129 GLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMI 208 (468) Q Consensus 129 ~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i 208 (468) |.++ .+.+....+.-...+++.+|.++++|+-. +-.+.||.+......+ -..+..-..+.|.++...+ T Consensus 215 -l~ds-~~l~~~i~~~la~~~~~~~d~~~l~G~G~---------~~~~~Gi~~~~~~~~~-~~~~~~~~~~~i~~~~~~~ 282 (413) T protein:vir:81 215 -IEDY-DFLVSYINARLLEELAIEEERQLLLGDGT---------GNNLTGLLKRDGIQTL-AVSNKDELADSIYKAMTNI 282 (413) T ss_pred -HHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCC---------CCcccccccccccccc-cccccchhHHHHHHHHHHh Confidence 3333 45667777777889999999999999721 2235677665433211 1111112244454444333 Q ss_pred h-hccCceEEEecCHHHHhhHHHhh-cCCceEEeecCCCcc----------eeeeeccceeecCCccccCCCEeeccccc Q lcl|NC_020871. 209 S-KGYGTPTDAYMPVGVQADFVNQQ-LSKQTQLVRDNGNNV----------SVGFNIQGFHSARGFIKLHGSTVMENEQI 276 (468) Q Consensus 209 ~-~~fG~~td~~m~~~v~a~~~~~~-~~~qr~v~~~n~~~~----------~~G~~v~~~~s~~g~i~l~gs~i~~~~n~ 276 (468) . .....++-++|++.+.+.+...- -.++..+.++-.+.. -.|.+| +++.. ++ .+..++.+..- T Consensus 283 ~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv--~~s~~--~~-~~~~~~gd~~~ 357 (413) T protein:vir:81 283 SLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRT--VQSQV--VP-VGKPVVGAFRS 357 (413) T ss_pred hhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceee--EEcCC--CC-cccEEEEeccc Confidence 3 33345666999999998884322 122333322211100 012221 11100 00 11111111110 Q ss_pred ccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcc Q lcl|NC_020871. 277 LDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSR 356 (468) Q Consensus 277 l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~ 356 (468) .....-. ....+-..... ..-|.. ....|++..--+-.---++..+ .|+++-++. T Consensus 358 ---~~~~~~~--~~~~v~~~~~~--~~~~~~---~~~~~r~~~r~d~~~~~~~a~~-----------~l~~~~~~~---- 412 (413) T protein:vir:81 358 ---AASVLRK--GGVRIDSTNTN--VDDFEN---NLITVRAEERVGLMVTFPEAIV-----------QLDVAEVVT---- 412 (413) T ss_pred ---EEEEEEe--cceEEEEeccc--cchhhc---CcEEEEEEEeeccEEecccceE-----------EEEecCCCC---- Confidence 0000000 00000000000 000000 1112222111110000011111 111111111 Q ss_pred c Q lcl|NC_020871. 357 P 357 (468) Q Consensus 357 ~ 357 (468) | T Consensus 413 p 413 (413) T protein:vir:81 413 P 413 (413) T ss_pred C Confidence 1 No 91 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=93.22 E-value=0.0083 Score=31.99 Aligned_cols=306 Identities=12% Similarity=0.043 Sum_probs=126.4 Q ss_pred CCCcccchhhcccChhhH--HHHHHHHhhccc---------ccCcccccCccccchhhhhhHhhhhhhccccccchhhhc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSV--QEDALKSFTTGY---------GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIA 69 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~--~e~~~Ksf~agy---------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~ 69 (468) -...+.+....+++.... -+.+.++...+. .....+..+|+.+-.+..++.|..+. +.+..++.+. T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~---~~~~~l~~~~ 145 (392) T protein:vir:13 69 SLLSGLQGSGSGAQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAV---ERSAIMRGGA 145 (392) T ss_pred HHhcccCCcccchhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHH---hhhhhhhhcc Confidence 000001111111111111 112233322221 11112223344555554444443332 2334444443 Q ss_pred ccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHH Q lcl|NC_020871. 70 KKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNI 149 (468) Q Consensus 70 k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~ 149 (468) ...-.+.-..|.....-| .....+++|++..+.+++.+.+....++=++.--.+|.-+ |.++..|.+....+.-...+ T Consensus 146 ~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~i 223 (392) T protein:vir:13 146 STFTTSDANPMDFTVITG-RATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEF-ATDQVLDLVGFLVSDAGPAI 223 (392) T ss_pred eeeecCCCceeEEEEEcC-CcceeeecccccccccccceeeEEeeeeeEEeeehhHHHH-HhcchHHHHHHHHHHHHHHH Confidence 322222223333222222 2345689999999999999999999998888777777653 23456678888888888999 Q ss_pred HHHHHHHHhhcccccccCCCCCCCccccchhhhcCc-c-ceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhh Q lcl|NC_020871. 150 AKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ-D-NVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQAD 227 (468) Q Consensus 150 ~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~-~-nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~ 227 (468) ++.++.++|+||-. + +--||.+.... . .+..+....++-+.|.++-.-+..+|....-..|++.+.+. T Consensus 224 ~~~~d~~~l~G~Gt-----~-----~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~ 293 (392) T protein:vir:13 224 GDAMGRHFLTGTGT-----G-----QPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQ 293 (392) T ss_pred HHHHHHHHhcccCC-----c-----cccccccccccccccccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHH Confidence 99999999999842 1 12355553321 1 12223334455444444333334456555568999999988 Q ss_pred HHHhhcCC-ceEEeecCCCcc----eeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCC Q lcl|NC_020871. 228 FVNQQLSK-QTQLVRDNGNNV----SVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKK 302 (468) Q Consensus 228 ~~~~~~~~-qr~v~~~n~~~~----~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~ 302 (468) |.. ..+. -|.+.+++.... -.|.+| +++..-. .+..++.+..- .. ...-..++-..... T Consensus 294 l~~-lkd~~G~~l~~~~~~~g~~~~l~G~Pv--~~~~~~~---~~~i~~Gdf~~-----~~---i~~~~~~~i~~~~~-- 357 (392) T protein:vir:13 294 MRK-LKDANGQYLWQSALTVGAPDTFNGKVV--ETDDGMP---ADKVLFADLSK-----YR---VRFAGSLRVDRSVD-- 357 (392) T ss_pred HHH-hhccCCceeecCCcCCCCCceecceee--EEcCCCC---CCcEEEeeccc-----ee---EEeecceEEEeecc-- Confidence 844 3333 233333322111 123222 1111000 01111111000 00 00000000000000 Q ss_pred CCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeec Q lcl|NC_020871. 303 GQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAP 351 (468) Q Consensus 303 g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~ 351 (468) ..|.. +...|++..--+..---|.. -+-++++..+ T Consensus 358 ~~~~~---~~~~~r~~~r~d~~~~~~~A-----------~~~~~~~~aa 392 (392) T protein:vir:13 358 AKFST---DQIVYRFLQRADGLLVDARG-----------AKVLTVTPAA 392 (392) T ss_pred ccccC---CcEEEEEEEEeccEEecccc-----------eEEEEeeccC Confidence 00000 01111111100000000111 1112222222 No 92 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=92.80 E-value=0.0099 Score=31.57 Aligned_cols=288 Identities=15% Similarity=0.113 Sum_probs=133.6 Q ss_pred HHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccch-hhhhhccceeeeecccc-c-cccc Q lcl|NC_020871. 19 QEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPA-TSTVAKYDVYMQHGKVG-H-TRFT 95 (468) Q Consensus 19 ~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~-~stv~ey~~~~~hG~~g-~-~~fv 95 (468) -|++.|.|+.=-.++-.+. +|+-|.+|-+++-|..|.... .|.+.+...+. .|.-.+..++ ++|+.- . ..-. T Consensus 1 ~~~~~~~~~~~k~it~~d~-~gG~L~P~~~~~~i~~l~e~s---~i~~~a~vi~t~~s~~~~i~~i-~~g~~~~~~~~~~ 75 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDL-GKGILAVQRFGEFVREVRENS---AIIKDARVLNALKSYEVDISRI-SLGVELEPGRNTS 75 (314) T ss_pred CchhhhHHHhhcccccccC-CCceeChHHHHHHHHHHHhcc---chhhheeeecccCccceeeccc-ccCcccccccccc Confidence 5666666664434444333 467899999987665554322 23333322111 2222223322 333321 1 1123 Q ss_pred cccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchh--hHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCC Q lcl|NC_020871. 96 REIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQ--DPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAG 173 (468) Q Consensus 96 ~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~--Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~g 173 (468) +|......+|+++.+....+|=|+.--.+|.- -|.++.. |-+....+.=..++....|.+.|-||.+...+ .+ T Consensus 76 ~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e-~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~----~~ 150 (314) T protein:vir:41 76 GTKVAPTADEVTVSTNTLEMKELVTKVVLEDE-ALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTG----RE 150 (314) T ss_pred cCCccCCcccccccceeeeeEEEEEeecccHH-HHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCc----cc Confidence 45555677889999888888877764444332 1234543 77777777778899999999999999765322 12 Q ss_pred c--cccchhhhcCccceeeccCC--CCCHHHHhhhhhhhhhcc-CceE--EEecCHHHHhhHHHhhcCCceEEeecCCCc Q lcl|NC_020871. 174 L--EFDGLAKLINQDNVHDARGA--SLTESLLNQAAVMISKGY-GTPT--DAYMPVGVQADFVNQQLSKQTQLVRDNGNN 246 (468) Q Consensus 174 l--eFDGl~~li~~~nviDarG~--~ls~~~l~~~a~~i~~~f-G~~t--d~~m~~~v~a~~~~~~~~~qr~v~~~n~~~ 246 (468) | +.||+.+... ..+.+..+. ..+.+.|..+-.-+-..| -..+ -.+|++.+...+....-++++-+-++. T Consensus 151 ~~~~p~G~l~~a~-~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~--- 226 (314) T protein:vir:41 151 LYRINDGWMKLAG-NQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSA--- 226 (314) T ss_pred chhcchhhhhhcc-cceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchh--- Confidence 3 7899988653 235554433 366777765544444434 2232 478999999888554444433321110 Q ss_pred ceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCC------------CcCcccceeEE Q lcl|NC_020871. 247 VSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKG------------QFRAEDLAAHE 314 (468) Q Consensus 247 ~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g------------~~~~~~~~~y~ 314 (468) +...+...+.|.-+.....+ |.+..+..... -++-+. .........+. T Consensus 227 ----------~~~~~~~~l~G~PV~~~~~~--------~~~~~~~~~i~--fgd~~nlv~~~~~~ir~~~~~~a~~~~~~ 286 (314) T protein:vir:41 227 ----------LIGATGLQYDGIPIQYVPAL--------DALGDDKARAL--LTVPTNLVYGFWRNIRIEPKRDAAMRRTE 286 (314) T ss_pred ----------hhCCCCceecceeeEecccc--------cccCCCCceEE--EechhheEEEeeceeEEeecccCcCCeEE Confidence 01111222222222221111 11111100000 000000 00000011222 Q ss_pred EEEEEEcccC--CcccccceeeeeeccCcc Q lcl|NC_020871. 315 YKVVVSSDDA--ESIASEVATATVTAKDDG 342 (468) Q Consensus 315 YkVtavn~~G--ES~aS~~vt~Tv~a~~~g 342 (468) |..+.-=+.+ ++.+ ++-+++...+.| T Consensus 287 ~~~~~r~d~~~~~~~a--a~~~~~~~~~~~ 314 (314) T protein:vir:41 287 YIASLRADCNYEDENA--AVAAVIDMSSGG 314 (314) T ss_pred EEEEEEeceEEEEcCc--EEEEEeeccCCC Confidence 3222211111 1111 112222222222 No 93 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=92.63 E-value=0.011 Score=31.41 Aligned_cols=295 Identities=15% Similarity=0.089 Sum_probs=130.1 Q ss_pred CCCcccchhhcccChhhHHH-------HHHHHhhc---ccc-------cCcccccCccccchhhhhhHhhhhhhcccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQE-------DALKSFTT---GYG-------ITPDTQTDAGALRREFLDDQISMLTWTENDLT 63 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e-------~~~Ksf~a---gy~-------~~p~~~~~gaALr~esld~~i~~L~~~~~~f~ 63 (468) -+..+...+....+.....+ ..++.... +-. ..-.+-++|+.|-++.+.++|........ . T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~--~ 167 (425) T protein:vir:95 90 KQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYT--T 167 (425) T ss_pred hccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhh--h Confidence 00000000000000000000 11111110 000 01122356888999999988855444332 4 Q ss_pred chhhhcccchhhhhhccceeeeeccccccccccccccccccC-cceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHH Q lcl|NC_020871. 64 FYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSD-PNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILT 142 (468) Q Consensus 64 ~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d-~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~ 142 (468) +++.+...+....+ ++.+. ++.+.+.|+.|++..+..+ +.+.......+=++.-..+|.-+ +.++..|.+.... T Consensus 168 i~~~~~~~~~~g~~-~ip~~---~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~ 242 (425) T protein:vir:95 168 LYPLVDKIRVKGTT-RILVD---TDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYL-LQDSIINLDDYVT 242 (425) T ss_pred HHHhhceeecCcee-EEEEe---cCCccccccccccccccccccccceeeeeheeeeeeehhhHHH-HhccHHHHHHHHH Confidence 55555554544433 44444 3445567999999865555 78988888888777655555532 2335567788888 Q ss_pred HHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEE--Eec Q lcl|NC_020871. 143 DDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTD--AYM 220 (468) Q Consensus 143 ~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td--~~m 220 (468) +.-...+++.+|.++|+|+..-+ -++.||.+-+...+.+...+..++.+.|..+...+..++..... .+| T Consensus 243 ~~l~~~i~~~~d~~il~G~G~~~--------~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 314 (425) T protein:vir:95 243 KKIARAIAKALDLAIVKGTGAAN--------KQPLGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVM 314 (425) T ss_pred HHHHHHHHHHHHHHhhccCCCCc--------cccceeecccccccccccccccchHHHHHHHHHhhhhhccccCceEEEE Confidence 88888999999999999984422 14567776554433333444555666666655555666654433 457 Q ss_pred CHHHH-hhHH--Hhhc--CCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeE Q lcl|NC_020871. 221 PVGVQ-ADFV--NQQL--SKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTA 295 (468) Q Consensus 221 ~~~v~-a~~~--~~~~--~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vta 295 (468) ++.+. +.+. ...- .++++.++++.+.. .|-|.-++.++.. |..+.+ T Consensus 315 ~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~----------------~l~G~pvv~~~~~-----------~~~~i~-- 365 (425) T protein:vir:95 315 KRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTP----------------DLLGLRVVFNNFL-----------DDDTVL-- 365 (425) T ss_pred eChHHHHHHHHHHhhcCCCCceeeccCCCCCc----------------cccceeeEEcCcC-----------CCccEE-- Confidence 76652 2221 1111 22333332322211 1222222221111 000000 Q ss_pred EecCCCCCCcCcccceeEEEEEEEEcccCCcc-ccccee---------------eeeeccCcceEEEEEeecCCCc Q lcl|NC_020871. 296 TQEAGKKGQFRAEDLAAHEYKVVVSSDDAESI-ASEVAT---------------ATVTAKDDGVKLEIELAPMYSS 355 (468) Q Consensus 296 t~~~~~~g~~~~~~~~~y~YkVtavn~~GES~-aS~~vt---------------~Tv~a~~~g~~ltIT~~~~~ga 355 (468) .|-++|.+. ..+.+-+. -|..+. .-+.-...-+.++||- |+.|| T Consensus 366 --------------~Gd~~~~~~-~~~~~~~i~~~~~~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~-~~~g~ 425 (425) T protein:vir:95 366 --------------FGEFEQYTL-VERENITIDSSTHVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITD-PVQGA 425 (425) T ss_pred --------------EEecccEEE-EeecceEEEeecccccccCceEEEEEEeeCcEeecccceEEEEecC-cCCCC Confidence 011111110 11111000 000000 0000001112222321 23333 No 94 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=92.40 E-value=0.012 Score=31.20 Aligned_cols=305 Identities=14% Similarity=0.134 Sum_probs=134.1 Q ss_pred CCCcccchhhcc---------------------------cChhhHHHHHHHHhh---ccc-------ccCcccccCcccc Q lcl|NC_020871. 1 MPKNNKEEEVKE---------------------------VNLNSVQEDALKSFT---TGY-------GITPDTQTDAGAL 43 (468) Q Consensus 1 ~~~~~~~~~~~~---------------------------~n~~~~~e~~~Ksf~---agy-------~~~p~~~~~gaAL 43 (468) .|+..+.+...+ .+....-.++.++|. +|. ..+. +-.+|+.| T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~-~t~~GG~l 153 (434) T protein:vir:62 75 DPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGL-VTGNGSVT 153 (434) T ss_pred hhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcc-ccccccee Confidence 111110000000 011111112333332 110 0011 11347788 Q ss_pred chhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhh Q lcl|NC_020871. 44 RREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKN 123 (468) Q Consensus 44 r~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~ 123 (468) -++.+...|..+..... .+.+...+.+..+. ..|.++...+.........|++..+..|+.+.+.....+=++.-.. T Consensus 154 vP~~~~~~Ii~~l~~~~--~i~~~~~~~~~~~~-~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~ 230 (434) T protein:vir:62 154 IPDFLSKEIITYAQEEN--FLRRLGTGVKTKEN-IKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALAT 230 (434) T ss_pred cchhhHHHHHHhhhhhh--hhhhhcceeccCCc-eEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehh Confidence 99999888866544322 33333333333333 4577776555443334567788888999999999999998888777 Q ss_pred hhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhh Q lcl|NC_020871. 124 ISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQ 203 (468) Q Consensus 124 vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~ 203 (468) +|.-+ |.++..|.+....+.-...+++.++.+++.|+-.=+ ...|+.. .+.......+...-.++++. T Consensus 231 iS~el-l~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~---------~~~g~~~--~~~~~~~~~~~~~~d~l~~l 298 (434) T protein:vir:62 231 VTKKL-LARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANN---------INDGALA--KKAVEFKTDEKNLYDALVKM 298 (434) T ss_pred hHHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc---------cccceee--cccccccccccchhhHHHHH Confidence 77653 234556788888889999999999999999984311 1234332 12222333333333444443 Q ss_pred hhhhhhhccCceEEEecCHHHHhhHHHhhcCC--ceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccc Q lcl|NC_020871. 204 AAVMISKGYGTPTDAYMPVGVQADFVNQQLSK--QTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERI 281 (468) Q Consensus 204 ~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~--qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~ 281 (468) -..+. ..|..-...+|++.+.+.+.. .-+. ++.+++.+... .|.. -.|.|.-++..+.. T Consensus 299 ~~~l~-~~~~~~a~~v~n~~~~~~L~~-lkd~~G~~l~~~~~~~~--~g~~----------~tl~G~pV~~~~~~----- 359 (434) T protein:vir:62 299 KNTPV-KEVRKKARWVLNTAALTKIET-MKTDDGFPLLRPFNQAE--GGIG----------YTLLGFPVEEEDAI----- 359 (434) T ss_pred Hhhcc-hhhhcCCEEEEcHHHHHHHHH-hhccCCCEeeccCCCcc--CCCC----------ceecceeeEEecCc----- Confidence 33333 334333357899999888843 4333 44444322110 1100 01222222211110 Q ss_pred cccCCCCCCcceeEEecCCCCCCcC-cccc---------------eeEEEEEEEEcccCCcccccceeeeeeccCcceEE Q lcl|NC_020871. 282 LALPTAPQQAKVTATQEAGKKGQFR-AEDL---------------AAHEYKVVVSSDDAESIASEVATATVTAKDDGVKL 345 (468) Q Consensus 282 ~~~p~ap~~~~vtat~~~~~~g~~~-~~~~---------------~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~l 345 (468) +.+.++..+.+ + -+.-+.+. .... ...-|++..- .+|.-.=|+...+ -.++ T Consensus 360 -~~~~~~~~~~i-~---~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r-~Dgk~i~~~~~~~-------~~~~ 426 (434) T protein:vir:62 360 -DIPDSPDTPVF-Y---FGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNL-LDAQLIHSPFEVP-------VYKY 426 (434) T ss_pred -cCccCCCceEE-E---EeeccceEEEEeeceeEEEeehhhhcccCceEEEEEee-ecceeecCcccce-------EEEE Confidence 00111100000 0 00000000 0000 0011111111 1332211111111 1122 Q ss_pred EEEeecCCCc Q lcl|NC_020871. 346 EIELAPMYSS 355 (468) Q Consensus 346 tIT~~~~~ga 355 (468) +++ +..++ T Consensus 427 ~~~--~~~~~ 434 (434) T protein:vir:62 427 VLK--APTGA 434 (434) T ss_pred Eec--cCCCC Confidence 222 22232 No 95 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=92.19 E-value=0.012 Score=31.03 Aligned_cols=310 Identities=13% Similarity=0.027 Sum_probs=131.9 Q ss_pred CCCcccchhhccc----Ch-------------hhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhcccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEV----NL-------------NSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLT 63 (468) Q Consensus 1 ~~~~~~~~~~~~~----n~-------------~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~ 63 (468) |-+-..++..+.+ +. ...-++..|.+++- ....+-++|+.|-++.+..+|..+..... . T Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~~--~~~~~~~~gg~lvP~~~~~~I~~~~~~~s--~ 113 (390) T protein:vir:40 38 MAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKYYNEV--IAGNGFAGVTALLPPTVFERVFEDLTVEH--P 113 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHHHHHH--HhccCcccCcccccHHHHHHHHHHHHhhh--h Confidence 0000000000000 00 00001112222211 01112346888999999888865544433 4 Q ss_pred chhhhcccchhhhhhccceeeeecccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHH Q lcl|NC_020871. 64 FYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGV-APVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILT 142 (468) Q Consensus 64 ~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~ 142 (468) +++.+...++.+.-..+.+. .+.+...++.|++. ++..++.+.+....++=++.-..+|.-+ +.++..|.+.... T Consensus 114 i~~~~~~~~~~~~~~~i~~~---~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~el-l~ds~~~l~~~i~ 189 (390) T protein:vir:40 114 LLSKINFVNTTATTEWIISV---GDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAM-LDLGPSWLDQYVR 189 (390) T ss_pred hhhhceeeecCCceeEEEEE---cCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHH-HhcchHHHHHHHH Confidence 56666666665533333333 34455668899776 4578999999999999888777777433 2346678889999 Q ss_pred HHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCcc---ceeeccCCCCCHHHHhhhhhhhh--------hc Q lcl|NC_020871. 143 DDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD---NVHDARGASLTESLLNQAAVMIS--------KG 211 (468) Q Consensus 143 ~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~---nviDarG~~ls~~~l~~~a~~i~--------~~ 211 (468) +.-...++..++.++++|+-. + +--||.+..... -..+.....++-..+......+. +. T Consensus 190 ~~la~~i~~~~~~a~l~G~G~-----~-----~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~ 259 (390) T protein:vir:40 190 TILGEAMALGLEAGIVNGSGK-----D-----QPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKS 259 (390) T ss_pred HHHHHHHHHHHHhhhhcccCC-----C-----ccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhh Confidence 999999999999999999842 1 123555533211 11111122233222111111111 12 Q ss_pred cCceEEEecCHHHHhhHHH---hhcCC--ceEEeecCCCcceeeeeccceeecCCccccCCCEeeccccc-cccccccc- Q lcl|NC_020871. 212 YGTPTDAYMPVGVQADFVN---QQLSK--QTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQI-LDERILAL- 284 (468) Q Consensus 212 fG~~td~~m~~~v~a~~~~---~~~~~--qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~-l~~~~~~~- 284 (468) ++.+. ++|++.+...+-. .+.+. +.+. +. .-.|..| +.+..- + .+..++.+..- ....+..+ T Consensus 260 ~~~a~-~i~n~~t~~~~l~~~~~~~d~~G~~v~--~~---~~~g~pv--v~~~~~--p-~~~i~~Gd~s~~~i~~~~~~~ 328 (390) T protein:vir:40 260 VSDAI-LVINPADYWSKIYAATSYMTPQGVWVT--GI---LPVPLEI--VQSVAV--P-VGKAVAGRAKDYFMGIGSEQV 328 (390) T ss_pred hcCce-EEEcchhHHHHHHHHhhccCCCCcccc--cc---CCCceeE--EEcCCC--C-CCcEEEEeeceEEEEeecceE Confidence 33222 4677766433212 11111 1110 00 0012222 111110 0 12233322110 00000000 Q ss_pred -CCCCC----C--cceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcce Q lcl|NC_020871. 285 -PTAPQ----Q--AKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGV 343 (468) Q Consensus 285 -p~ap~----~--~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~ 343 (468) ..... . ....+..--++.=. . ..+-...++++.+.+. .++-.+++++.++.+.+- T Consensus 329 v~~~~~~~f~~~~~~~r~~~r~dg~v~-~--~~A~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:40 329 IRTSTEYRLLDDETLYYAKQYANGRPK-D--NSSFLVFDITGLEGSP-AIDVNVVNNATPSETPAE 390 (390) T ss_pred EEecchhhhhcCcEEEEEEEEeCCEEe-c--ccceEEEEeeccCCCC-CCCcceeeCCCCCCCCCC Confidence 00000 0 00111111111000 0 0123334445444222 333445656666555555 No 96 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=92.14 E-value=0.013 Score=30.99 Aligned_cols=261 Identities=16% Similarity=0.128 Sum_probs=127.0 Q ss_pred hcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhc--c--ceeeeecccccccccccccccc Q lcl|NC_020871. 27 TTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAK--Y--DVYMQHGKVGHTRFTREIGVAP 102 (468) Q Consensus 27 ~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~e--y--~~~~~hG~~g~~~fv~E~g~~~ 102 (468) =|.+ +-+.+..+.+|.+.+.+....-. -..| ..+.. .+.++.. - -.+-.++..|...++.|++... T Consensus 1 MA~~-----~T~~~~~~iPev~s~~v~~~~~~--~~~~-~~~~~--~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~ 70 (272) T protein:vir:98 1 MAVG-----TTKMAQMLDPEVLADMIDAEVGK--AIRF-APLAE--VDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIP 70 (272) T ss_pred CCCc-----cccchheechHHHHHHHHHHHHH--Hhhh-hcccc--ccccccCCCCCEEEEEEecCCCCcccccCCCccc Confidence 1111 11235577777777766332111 1111 11111 0111111 0 0122344556677899999999 Q ss_pred ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhh Q lcl|NC_020871. 103 VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKL 182 (468) Q Consensus 103 ~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~l 182 (468) ..+.+......+++-++....+|..+.+ ++..|++....+.....+++.++-.+|= .++.. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~itd~~~~-~s~~d~~~~~~~~~~~~~a~~~d~~i~~---~~~~a--------------- 131 (272) T protein:vir:98 71 MTQLGFKKTTMTIKKAGKGVEITDEAIL-SGYGDPVGQAAKQIVEAIDHKVDADVLD---ALSKS--------------- 131 (272) T ss_pred ccccccceEEEEeeeeeeeeeecHHHHh-hccccHHHHHHHHHHHHHHHHHHHHHHH---Hhccc--------------- Confidence 9999999999999999999999988754 4567999999999999999999977662 22211 Q ss_pred cCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEeecCCCcceeeeeccceeecCCc Q lcl|NC_020871. 183 INQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGF 262 (468) Q Consensus 183 i~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~ 262 (468) .+.++ ...+.+.|..|.....+.....+-++||+.+.+.|...-+. +++.... .|..+ -..|. T Consensus 132 ---~~~~~---~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~--~~~~~~~-----~~~~~----~~~g~ 194 (272) T protein:vir:98 132 ---TQTVE---ATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAK--EWLGATE-----VGANR----VVSGV 194 (272) T ss_pred ---ccccc---cccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccc--ccccccc-----ccccc----ccccc Confidence 11111 12345666666666667777778899999999888433222 1111111 11111 01111 Q ss_pred c-ccCCCEeecccccccccccccCCCCCCcceeEE-----ecCCC-CCCcCcccceeEEEEEEEEcccCCcccccceeee Q lcl|NC_020871. 263 I-KLHGSTVMENEQILDERILALPTAPQQAKVTAT-----QEAGK-KGQFRAEDLAAHEYKVVVSSDDAESIASEVATAT 335 (468) Q Consensus 263 i-~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat-----~~~~~-~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~T 335 (468) + .+.|-.++.+++.-..+..... ..+-.... ..+.- ..++...-.+.+.|-+..++ |+.++..| T Consensus 195 ig~i~G~~Vi~s~~~p~~t~~~~~---~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~------~~~vv~~t 265 (272) T protein:vir:98 195 YGEVLGVQIVRSRKCPKGTAYMVR---KGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYK------AEKAVKIT 265 (272) T ss_pred chhhcCeeEEEcCCCCcceEEEEc---CCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEc------CCceEEEE Confidence 1 3456655555443111111000 00000000 00000 00011111122333333332 23344444 Q ss_pred eeccCcc Q lcl|NC_020871. 336 VTAKDDG 342 (468) Q Consensus 336 v~a~~~g 342 (468) +.+.... T Consensus 266 ~~~a~~~ 272 (272) T protein:vir:98 266 LKDAAKK 272 (272) T ss_pred ecccccC Confidence 4432222 No 97 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=92.14 E-value=0.013 Score=30.99 Aligned_cols=261 Identities=16% Similarity=0.128 Sum_probs=127.0 Q ss_pred hcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhc--c--ceeeeecccccccccccccccc Q lcl|NC_020871. 27 TTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAK--Y--DVYMQHGKVGHTRFTREIGVAP 102 (468) Q Consensus 27 ~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~e--y--~~~~~hG~~g~~~fv~E~g~~~ 102 (468) =|.+ +-+.+..+.+|.+.+.+....-. -..| ..+.. .+.++.. - -.+-.++..|...++.|++... T Consensus 1 MA~~-----~T~~~~~~iPev~s~~v~~~~~~--~~~~-~~~~~--~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~ 70 (272) T protein:vir:30 1 MAVG-----TTKMAQMLDPEVLADMIDAEVGK--AIRF-APLAE--VDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIP 70 (272) T ss_pred CCCc-----cccchheechHHHHHHHHHHHHH--Hhhh-hcccc--ccccccCCCCCEEEEEEecCCCCcccccCCCccc Confidence 1111 11235577777777766332111 1111 11111 0111111 0 0122344556677899999999 Q ss_pred ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhh Q lcl|NC_020871. 103 VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKL 182 (468) Q Consensus 103 ~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~l 182 (468) ..+.+......+++-++....+|..+.+ ++..|++....+.....+++.++-.+|= .++.. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~itd~~~~-~s~~d~~~~~~~~~~~~~a~~~d~~i~~---~~~~a--------------- 131 (272) T protein:vir:30 71 MTQLGFKKTTMTIKKAGKGVEITDEAIL-SGYGDPVGQAAKQIVEAIDHKVDADVLD---ALSKS--------------- 131 (272) T ss_pred ccccccceEEEEeeeeeeeeeecHHHHh-hccccHHHHHHHHHHHHHHHHHHHHHHH---Hhccc--------------- Confidence 9999999999999999999999988754 4567999999999999999999977662 22211 Q ss_pred cCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcCCceEEeecCCCcceeeeeccceeecCCc Q lcl|NC_020871. 183 INQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGF 262 (468) Q Consensus 183 i~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~ 262 (468) .+.++ ...+.+.|..|.....+.....+-++||+.+.+.|...-+. +++.... .|..+ -..|. T Consensus 132 ---~~~~~---~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~--~~~~~~~-----~~~~~----~~~g~ 194 (272) T protein:vir:30 132 ---TQTVE---ATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAK--EWLGATE-----VGANR----VVSGV 194 (272) T ss_pred ---ccccc---cccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccc--ccccccc-----ccccc----ccccc Confidence 11111 12345666666666667777778899999999888433222 1111111 11111 01111 Q ss_pred c-ccCCCEeecccccccccccccCCCCCCcceeEE-----ecCCC-CCCcCcccceeEEEEEEEEcccCCcccccceeee Q lcl|NC_020871. 263 I-KLHGSTVMENEQILDERILALPTAPQQAKVTAT-----QEAGK-KGQFRAEDLAAHEYKVVVSSDDAESIASEVATAT 335 (468) Q Consensus 263 i-~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat-----~~~~~-~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~T 335 (468) + .+.|-.++.+++.-..+..... ..+-.... ..+.- ..++...-.+.+.|-+..++ |+.++..| T Consensus 195 ig~i~G~~Vi~s~~~p~~t~~~~~---~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~------~~~vv~~t 265 (272) T protein:vir:30 195 YGEVLGVQIVRSRKCPKGTAYMVR---KGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYK------AEKAVKIT 265 (272) T ss_pred chhhcCeeEEEcCCCCcceEEEEc---CCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEc------CCceEEEE Confidence 1 3456655555443111111000 00000000 00000 00011111122333333332 23344444 Q ss_pred eeccCcc Q lcl|NC_020871. 336 VTAKDDG 342 (468) Q Consensus 336 v~a~~~g 342 (468) +.+.... T Consensus 266 ~~~a~~~ 272 (272) T protein:vir:30 266 LKDAAKK 272 (272) T ss_pred ecccccC Confidence 4432222 No 98 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=91.69 E-value=0.015 Score=30.63 Aligned_cols=313 Identities=10% Similarity=0.044 Sum_probs=134.1 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhh--------cc-cc--------cCcccccCccccchhhhhhHhhhhhhcccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFT--------TG-YG--------ITPDTQTDAGALRREFLDDQISMLTWTENDLT 63 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~--------ag-y~--------~~p~~~~~gaALr~esld~~i~~L~~~~~~f~ 63 (468) ..........+.......-..+.|++- .. .. .+..+..+|+.+-++.+..+|..+..... . T Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~--~ 139 (404) T protein:vir:10 62 EDNVKSLNTGKEENVIYNGALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTT--D 139 (404) T ss_pred hhhccccccccchhhHHHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhh--h Confidence 000000000001111111112223321 11 11 11123356778888888888865554433 5 Q ss_pred chhhhcccchhhhhhc--cceeeeeccccccccccccccccc--cCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHH Q lcl|NC_020871. 64 FYKDIAKKPATSTVAK--YDVYMQHGKVGHTRFTREIGVAPV--SDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQ 139 (468) Q Consensus 64 ~~~~i~k~~~~stv~e--y~~~~~hG~~g~~~fv~E~g~~~~--~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~ 139 (468) +++.+.+.++...... |.+.. +.....++.|++..+. .++.+.......+=++.-..+|.-+ +.++..+.+. T Consensus 140 l~~l~~~~~~~~~~g~~~~~~~~---~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~ 215 (404) T protein:vir:10 140 LYNMVDYEPVFTRSGSRTYEKRS---KQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDL-LKFADKSLED 215 (404) T ss_pred HhhhhceeeccCCccceEEEEec---CCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHH-HhhcHHHHHH Confidence 6666666666543333 44433 2334558999988655 3688999999998888888888743 3455667888 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhh-hhhhhhccCceEEE Q lcl|NC_020871. 140 ILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQA-AVMISKGYGTPTDA 218 (468) Q Consensus 140 ~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~-a~~i~~~fG~~td~ 218 (468) ...+.-...+...+|.++++|+.. |-.+.||...-.. +.+-..+. ...+.|..+ +..+-.+|...-.+ T Consensus 216 ~i~~~la~~~~~~~~~~il~G~g~---------~~~~~gi~~~~~~-~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~ 284 (404) T protein:vir:10 216 WIINWFVDKVRITRNAEILYGAGG---------DEHATGIMTANKF-KKITLPKS-PALKDFKKCKNVELLNVFKATSSW 284 (404) T ss_pred HHHHHHHHHHHHHHHHHHhhcCCC---------CCcccceeecccc-ceeecccc-ccHHHHHHHHHhhhhccccCCCEE Confidence 888889999999999999999742 2234555543222 22323333 344444433 33444555544457 Q ss_pred ecCHHHHhhHHHhhcCC-ceEEeecCCCc----ceeeeeccceeecCCccccCCC-Eeecccc--cccccccccCCCCCC Q lcl|NC_020871. 219 YMPVGVQADFVNQQLSK-QTQLVRDNGNN----VSVGFNIQGFHSARGFIKLHGS-TVMENEQ--ILDERILALPTAPQQ 290 (468) Q Consensus 219 ~m~~~v~a~~~~~~~~~-qr~v~~~n~~~----~~~G~~v~~~~s~~g~i~l~gs-~i~~~~n--~l~~~~~~~p~ap~~ 290 (468) +|++.+.+.|.. .-+. -|.+.+++... .-.|.+|--.-+.-........ .++.+.. .....+. T Consensus 285 v~n~~~~~~L~~-lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~-------- 355 (404) T protein:vir:10 285 IVNQDGFNYLDS-LEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDG-------- 355 (404) T ss_pred EEcHHHHHHHHH-hhccCCceeeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEec-------- Confidence 899999988854 3222 23343333211 1123322100000000000000 0111000 0000000 Q ss_pred cceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCC Q lcl|NC_020871. 291 AKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYS 354 (468) Q Consensus 291 ~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~g 354 (468) .++-.........|. .+...|++...=+.+ +.....-+.++++-++..+ T Consensus 356 -~~~i~~~~~~~~~~~---~~~~~~~~~~r~d~~-----------v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 356 -AYELATTNIGAGAFE---TNTTKARIIMRIDGN-----------VKDSEALLIAEIPVESVQA 404 (404) T ss_pred -ceEEEEeccccchhh---cCceEEEEEEeeccE-----------EecccceEEEEeecccCCC Confidence 000000000000000 011112221111111 0000011112222222222 No 99 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=90.08 E-value=0.023 Score=29.59 Aligned_cols=309 Identities=11% Similarity=0.047 Sum_probs=132.5 Q ss_pred CCCcccch------hhc-ccChhhHHHHHHHHhhcc--------------cc-------cCcccccCccccchhhhhhHh Q lcl|NC_020871. 1 MPKNNKEE------EVK-EVNLNSVQEDALKSFTTG--------------YG-------ITPDTQTDAGALRREFLDDQI 52 (468) Q Consensus 1 ~~~~~~~~------~~~-~~n~~~~~e~~~Ksf~ag--------------y~-------~~p~~~~~gaALr~esld~~i 52 (468) +|.-.++. +.+ +....+.-..+++++.++ ++ +. .+-..|+.|-++.+..+| T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~-~~~~~Gg~lvP~~~~~~i 85 (366) T protein:vir:57 7 VPVKAHSVAPGIIIKEELQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLSMAIS-TAAGSGGALIPQNMQNEV 85 (366) T ss_pred ccccccccccccccccccccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhhhhcc-ccccCCccccchhHHHHH Confidence 22111100 000 000001111122221110 00 11 112247777788888877 Q ss_pred hhhhhccccccchhhhcccchh--hhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhh Q lcl|NC_020871. 53 SMLTWTENDLTFYKDIAKKPAT--STVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGL 130 (468) Q Consensus 53 ~~L~~~~~~f~~~~~i~k~~~~--stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l 130 (468) ..+.... ..+..+.-+.+. +---+|.+++. .....+++|++..+.+++.+.+.....+=++.--.+|.-+- T Consensus 86 i~~l~~~---s~l~~lg~~~v~~~~g~~~~p~~t~---~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell- 158 (366) T protein:vir:57 86 IELLRDR---TVVRILGARSIPLPNGNLSMPRLSG---GATAGYVGEGKDVVATGATFDDVKLSAKTMIALVPVSNQLI- 158 (366) T ss_pred HHHHhhh---cchhhhceeeeecCCCceEEEEEeC---CcceeeeccCccccccccceeEEEEeeEEEEEeehhhHHHH- Confidence 6554432 233333222222 21224555552 23455899999999999999999999999987777765432 Q ss_pred hcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCc-cceeeccCCCCCHHHHhhhhhhhh Q lcl|NC_020871. 131 VNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ-DNVHDARGASLTESLLNQAAVMIS 209 (468) Q Consensus 131 v~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~-~nviDarG~~ls~~~l~~~a~~i~ 209 (468) .++.-|.+....++-...+++.++.++++||.. +-+..||.+.... ....+.-|..++...+......+. T Consensus 159 ~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~---------~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~ 229 (366) T protein:vir:57 159 GRAGFNVEQLLLGDILSAIATREDKAFLRDDGT---------GDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLI 229 (366) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCC---------CccccceeeccccccceeeccccccchhhHHHHHHHHH Confidence 244557888888999999999999999999832 2355777775554 346666677776555543322222 Q ss_pred ------hccCceEEEecCHHHHhhHHHhhcC--CceEEeecCCCcceeeeeccceeecCCc----cccCCCEe-eccccc Q lcl|NC_020871. 210 ------KGYGTPTDAYMPVGVQADFVNQQLS--KQTQLVRDNGNNVSVGFNIQGFHSARGF----IKLHGSTV-MENEQI 276 (468) Q Consensus 210 ------~~fG~~td~~m~~~v~a~~~~~~~~--~qr~v~~~n~~~~~~G~~v~~~~s~~g~----i~l~gs~i-~~~~n~ 276 (468) ..+....-..|++.+.+.+.. ..+ +++.+ ++..++.-.|.+|- .+..=. ..-....| +.+..- T Consensus 230 ~~~~~~~~~~~~a~~vmn~~~~~~L~~-lkd~~G~~l~-~~~~~g~l~G~Pvv--~s~~ip~~~~~~~~~~~i~~gdfs~ 305 (366) T protein:vir:57 230 LKHMDSNSNMIRCGWGLSNRTYMTLFG-LRDGNGNKVY-PEMSQGILKGYPIQ--RTSAIPANLGDDGNESEIYFCDFND 305 (366) T ss_pred HhhhccccccccCEEEecHHHHHHHHh-hhccCCceec-cCCCCCeecceeeE--EccccccccccCCCccEEEEEecce Confidence 223334446799999988854 322 23332 22222212233221 000000 00000011 111000 Q ss_pred ccccccccCCCCCCcceeEEe---cCCCCC----CcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEe Q lcl|NC_020871. 277 LDERILALPTAPQQAKVTATQ---EAGKKG----QFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIEL 349 (468) Q Consensus 277 l~~~~~~~p~ap~~~~vtat~---~~~~~g----~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~ 349 (468) +.-..+... .+--.. .....+ -|.. ...-+++...-+-+---|...+-.|- ++| T Consensus 306 ~~i~~~~~i------~i~~~~ea~~~~~~g~~~~~f~~---~~~~iR~~~~~d~~v~~~~a~~~lt~----------~~~ 366 (366) T protein:vir:57 306 VVIGEDGMM------KVDFSTEATYKDADGQLVSAFAR---NQSLIRVVTEHDIGFRHPEGLVLGTG----------VIW 366 (366) T ss_pred EEEEEecce------EEEEeeccccccccccchhhhhc---CceeEEeeeeeCcEeeccccEEEEec----------ccC Confidence 000000000 000000 000000 0000 00111111111111000111111111 233 No 100 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=89.74 E-value=0.025 Score=29.40 Aligned_cols=304 Identities=14% Similarity=0.035 Sum_probs=124.5 Q ss_pred CCCcccchhhcccChh----------------hHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLN----------------SVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTF 64 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~----------------~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~ 64 (468) ..+..++ ....++.+ +.-..|..+++ ..+..+|+.|-++.+.+.|..... +...+ T Consensus 46 ~~~~~~~-~~~e~~~~~~~~~~~~~r~~~~l~~ee~~~~~~~~------~~t~~~gG~liP~~~~~~Ii~~l~--~~s~i 116 (395) T protein:vir:95 46 SNDLQEE-ITAEINNRVVDNGILAKRSQDPLTSEERKFFNDIN------YDVGYTDEKILPETVVERVFDDLQ--KDHPL 116 (395) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHhhcCccccchHHHHHHHHHh------hccCCCCceeccHHHHHHHHHHHH--hhhhh Confidence 0000000 00000000 00011112222 234556888888888887754322 22355 Q ss_pred hhhhcccchhhhhhccceeeeecccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_020871. 65 YKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGV-APVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTD 143 (468) Q Consensus 65 ~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~ 143 (468) ++.+...++..++ .+....+.+.+.++.|.+. ...+++.+.+.....+=|+.-..+|..+ |.++..|.+....+ T Consensus 117 ~~~~~v~~~~~~~----~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~el-l~ds~~~ie~~i~~ 191 (395) T protein:vir:95 117 LSKINFQNAGIKT----RVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDL-STFGPAWIERFVRT 191 (395) T ss_pred hhhceeEecCCce----EEEEecCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHH-HhcchhHHHHHHHH Confidence 5555555555443 2333334445557777665 4578999999999999998777777665 45678889999999 Q ss_pred HHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccce--eecc-CCCCC-------HHHH----hhhhhh-h Q lcl|NC_020871. 144 DAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNV--HDAR-GASLT-------ESLL----NQAAVM-I 208 (468) Q Consensus 144 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nv--iDar-G~~ls-------~~~l----~~~a~~-i 208 (468) .--.++++.++.+++.|+-.-+ + |=-||.+-....+. .+.. ...++ ...| ..++.. . T Consensus 192 ~la~~ia~~~~~a~i~G~G~~~---~-----qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~ 263 (395) T protein:vir:95 192 QIQEAISVALESAIINGGGAAK---T-----QPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEK 263 (395) T ss_pred HHHHHHHHHHhhheeeccCCCC---c-----CceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccc Confidence 9999999999999999984311 0 11245443322110 0000 00011 1111 111111 1 Q ss_pred hhc--c-CceEEEecCHHHHhhHHHhhcCCceEEeecCCCcce--eeeeccceeecCCccccCCCEeecccc---ccccc Q lcl|NC_020871. 209 SKG--Y-GTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVS--VGFNIQGFHSARGFIKLHGSTVMENEQ---ILDER 280 (468) Q Consensus 209 ~~~--f-G~~td~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~--~G~~v~~~~s~~g~i~l~gs~i~~~~n---~l~~~ 280 (468) .+. | |.. .+.|++.+..++. +++..++.+ |... .|+.++-+.+..-. -+..++.+.. +.++. T Consensus 264 ~~~~~~~~~~-~~~mn~~t~~~~~-----g~~~~~~~~-G~~~~~lg~g~~v~~~~~~p---~~~i~fgdfs~y~i~~r~ 333 (395) T protein:vir:95 264 GKELKIDGKV-ALVVNPRDSWDVQ-----ARYTYLTAN-GGFVTVLPYNVTIITSEFVP---EGKLVAFVTDRYNAVRGG 333 (395) T ss_pred cchhhhcCce-EEEEcchhhhhcC-----CcceeccCC-CcceeccCCcceEEEcCCCC---CCcEEEEecccEEEEEec Confidence 111 1 122 2467777766653 334444333 2222 23333222222111 1223332222 00000 Q ss_pred ccccCCCC------CCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEE Q lcl|NC_020871. 281 ILALPTAP------QQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLE 346 (468) Q Consensus 281 ~~~~p~ap------~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~lt 346 (468) ...+-... -.....+..--+++- ..+ ... +|.-+...-+ ++ .+.++.+..++.--- T Consensus 334 ~~~i~~~~~~~~~~d~~~f~~~~r~dg~~-~~~---~A~--~~l~i~~~~~-~~---~~~~~~~~~~~~~~~ 395 (395) T protein:vir:95 334 GLTVKKFDQTLALEDAVLFTAKTFAYGQP-DDN---KAS--AVYDLKVASA-PR---RQTSAGGTTDGIAEA 395 (395) T ss_pred ceEEEeccchhhhCCcEEEEEEEEECCEE-ecc---ccE--EEEEeeccCC-CC---CCCCCCCCCCccccC Confidence 00000000 001111222221111 111 111 2222211100 00 011111111111000 No 101 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=89.45 E-value=0.026 Score=29.25 Aligned_cols=291 Identities=13% Similarity=0.031 Sum_probs=114.3 Q ss_pred CCCcccchhhcccChhhHHH---HHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQE---DALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTV 77 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e---~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv 77 (468) ++..++....+.-....... .+.+++.++ +-++|+.|-++.+.+.|..... +.-.+++.+...++.+.+ T Consensus 55 ~~~~~~~~~~~~~g~~~lt~~e~~~~~~~~~~------~~~~gg~lvP~~~~~~I~~~l~--~~s~l~~~~~v~~~~~~~ 126 (383) T protein:vir:78 55 ARQEADAYISASRTDKNITNEEIKFFNDINKE------VGYKEETLLPQTVVDEIFEDLT--TEHPFLASIGMRTTGLRT 126 (383) T ss_pred HHHHHHHHHHhcCChhhhhHHHHHHHHHHhcc------CCCCCccccCHHHHHHHHHHHH--hhccceeeeeeEecCCce Confidence 11110000000000000111 122233333 3456788888888887754222 223455555555554442 Q ss_pred hccceeeeecccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 78 AKYDVYMQHGKVGHTRFTREIGV-APVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWA 156 (468) Q Consensus 78 ~ey~~~~~hG~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a 156 (468) ++.+.. +.+.+.+++|.+. +...++.+.+.....+=|+.-..+|..+ |.++..|.+....+.--.++++.++.+ T Consensus 127 -~i~~~~---~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~el-l~Ds~~~ie~~i~~~l~~~~a~~~~~a 201 (383) T protein:vir:78 127 -KFLKSE---TSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDL-EKFGPAWVKRFVVTQIEEAFAVALESA 201 (383) T ss_pred -EEEEEc---CCcceEEeecccccccccCcceeeEeecceeeEeeccchHHH-hhccHHHHHHHHHHHHHHHHHHHHhhh Confidence 444433 4444558899775 4678999999999999998766666554 455788999999999999999999999 Q ss_pred HhhcccccccCCCCCCCccccchhhhcCcc-ceee---ccCC---CCCHH----HHhhhhhhhhhc-c----------Cc Q lcl|NC_020871. 157 SFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHD---ARGA---SLTES----LLNQAAVMISKG-Y----------GT 214 (468) Q Consensus 157 ~f~Gd~~l~~~~~~~~gleFDGl~~li~~~-nviD---arG~---~ls~~----~l~~~a~~i~~~-f----------G~ 214 (468) ++.||-. +.+ =||.+-+... .+.. ..+. .++-. ..+.....+... | |. T Consensus 202 ~i~G~G~-----~qP-----~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 271 (383) T protein:vir:78 202 YIVGDGN-----DKP-----IGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGK 271 (383) T ss_pred eEeccCC-----CCc-----eeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCc Confidence 9999842 112 2444433211 1111 0000 01111 111111111100 0 11 Q ss_pred eEEEecCHHHHhhHHHhhcCCceEEeecCCCcce--eeeeccceeecCCccccCCCEeecccc----------ccccccc Q lcl|NC_020871. 215 PTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVS--VGFNIQGFHSARGFIKLHGSTVMENEQ----------ILDERIL 282 (468) Q Consensus 215 ~td~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~--~G~~v~~~~s~~g~i~l~gs~i~~~~n----------~l~~~~~ 282 (468) + ...|.+....+. .++.-.+.++ |... .|+.++-+.+.... .++.++.+.. .+.... T Consensus 272 ~-~~~~n~~~~~~~-----~~~~~~~~~~-G~~~t~l~~~~~iv~s~~~p---~~~iifgdfs~Y~i~~r~~~~i~~~~- 340 (383) T protein:vir:78 272 V-TLLVNPTDAWDV-----KKQYTSLNAN-GVYVTALPFNLNIIESLFVP---EKKAISYVAERYDALIGGPLDIGTYD- 340 (383) T ss_pred e-EEEEcCcchhhh-----ccchhccCCC-CceeeecCCCceEEecCCCC---cccEEEeeccceEEEecccceEEecc- Confidence 1 122333222111 1111111111 1111 12221111111000 1111111100 000000 Q ss_pred ccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccc Q lcl|NC_020871. 283 ALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIAS 329 (468) Q Consensus 283 ~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS 329 (468) .....--.....+..-.+++= .++. +---+.++ +......++- T Consensus 341 ~~~f~~d~~~f~~~~r~dG~~-~~~~--A~~vl~~~-~~~~~~~~~~ 383 (383) T protein:vir:78 341 QTLAIEDLNLYAAKQFAYGKA-KDDK--AAAVWTLN-INPAEQTPEG 383 (383) T ss_pred hhhhhcCceEEEEEEEEcCEE-ecCC--eEEEEEEE-ecCCCCCCCC Confidence 000000011112222121111 1111 11122222 2222222222 No 102 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=89.28 E-value=0.027 Score=29.16 Aligned_cols=277 Identities=12% Similarity=0.047 Sum_probs=127.5 Q ss_pred CCCcccchh-----hcccChhhHH---------------------HHHHHHhhcccccCcccccCccccchhhhhhHhhh Q lcl|NC_020871. 1 MPKNNKEEE-----VKEVNLNSVQ---------------------EDALKSFTTGYGITPDTQTDAGALRREFLDDQISM 54 (468) Q Consensus 1 ~~~~~~~~~-----~~~~n~~~~~---------------------e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~ 54 (468) .|...+... .+........ +++..++++| -+..+|+.|.++.+...|.. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~gg~~vP~~~~~~ii~ 156 (400) T protein:vir:38 82 KPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAG-----VKAADAASTIPETISNTPQR 156 (400) T ss_pred cccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhc-----ccccCCcccccHHHHHHHHH Confidence 111111110 0000000010 0111111111 23456888999999888855 Q ss_pred hhhccccccchhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhh-c Q lcl|NC_020871. 55 LTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLV-N 132 (468) Q Consensus 55 L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-~ 132 (468) +.... ..+.+.+...++.+.--+|.+.... .+...+++|++... .+++.+.+....++-++.-..+|.- |. + T Consensus 157 ~~~~~--~~l~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~e--ll~d 230 (400) T protein:vir:38 157 ELQTV--VDLKPFTNVFQASTQKGTYPTVANA--TTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQE--SIDD 230 (400) T ss_pred HHHhh--hhhhhcceeEeccCcceEEEEEecC--CCccccccccccccccccccceeeEeehhheeeehhhHHH--HHhh Confidence 55433 3556666666665544456555533 34455788877665 6899999999999888877777763 43 4 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhcc Q lcl|NC_020871. 133 NIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGY 212 (468) Q Consensus 133 ~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~f 212 (468) +..|.+....+.....+..+++.++++|...... .....+|+|..++... +|.. + T Consensus 231 s~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~----~~~~~~~~~~~~~~~~--~~~~-------------------~ 285 (400) T protein:vir:38 231 SAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTA----KTISSVDDLKHINNVD--LDPA-------------------Y 285 (400) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc----cccccHHHHHHHHHhh--hhhh-------------------h Confidence 5677888888888899999999999999876532 1233455555443311 1111 1 Q ss_pred CceEEEecCHHHHhhHHHhh-cCCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCc Q lcl|NC_020871. 213 GTPTDAYMPVGVQADFVNQQ-LSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQA 291 (468) Q Consensus 213 G~~td~~m~~~v~a~~~~~~-~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~ 291 (468) ..-..|++.+.+.|...- -+++..++ ++.... +.-.|.|.-+.-.++. |. +... T Consensus 286 --~a~~v~~~~~~~~l~~lkd~~G~~i~~-~~~~~~-------------~~~~l~G~pv~~~~~~--------~~-~~~g 340 (400) T protein:vir:38 286 --SRVIIASQSFYNFLDTVKDGNGRYLLQ-DSILTP-------------SGKSVLGMPIAVVSDD--------TL-GAAG 340 (400) T ss_pred --CcEEEEcHHHHHHHHHhhccCCCeeee-cCcCCC-------------CccccccceeEEeccc--------cc-CCCC Confidence 124789999988884321 13344433 222111 1112334433222211 10 0000 Q ss_pred ceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcc-cccc------------eeeeeeccCcceEEEEEeec Q lcl|NC_020871. 292 KVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESI-ASEV------------ATATVTAKDDGVKLEIELAP 351 (468) Q Consensus 292 ~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~-aS~~------------vt~Tv~a~~~g~~ltIT~~~ 351 (468) ....- .|-++-.+..+++.+-+. .+.. ..+.+.....-+.|+++..+ T Consensus 341 ~~~~~-------------~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 341 EAHAF-------------LGDIKRAILFANRADFMVRWVDDQIYGQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred ceEEE-------------EEeccccEEEEeecceEEEEecccccceeEEEEEEeccEEecccceEEEEeecCC Confidence 00000 011111111121111000 0000 00001111111112222211 No 103 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=88.41 E-value=0.032 Score=28.74 Aligned_cols=306 Identities=14% Similarity=0.106 Sum_probs=140.3 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhcc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey 80 (468) |++ ..-.+.+..+.-.-.++..+..+|..+..+.-.+-+..+.... .|++.+.-.++++--.+ T Consensus 1 ~~~-------------k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s---~~l~~i~v~~v~~~~~~- 63 (321) T protein:vir:31 1 MAS-------------RTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEET---PLLDAIRTETVGAKKTR- 63 (321) T ss_pred Cch-------------HHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhh---hhhhhceeeeccCccee- Confidence 322 1111222222112245555666677787776665555544332 46777766665543322 Q ss_pred ceeeeecccccccccc-c-cccccccCcceEEEEEEEEeeeehhhhhhhHhhhcch--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 81 DVYMQHGKVGHTRFTR-E-IGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNI--QDPMQILTDDAIVNIAKTIEWA 156 (468) Q Consensus 81 ~~~~~hG~~g~~~fv~-E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~--~Dp~~~~~~~ai~~~~~~~e~a 156 (468) +...|-.+...-++ | .+....+++.+.+....++=+..--.+|.-+ |-++. .|-+....+.-..+++.+++.+ T Consensus 64 --i~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~-L~d~a~~~d~e~~i~~~ia~~~a~~~~~~ 140 (321) T protein:vir:31 64 --IPTLNIGERHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREV-VQENPEGEALADRILNLMTDAWSADVEDL 140 (321) T ss_pred --eeeeccCCcccccccccccccccccceeeeeeeeeEEEEeehhccHHH-HHhhhcchhHHHHHHHHHHHHHHHHHHhh Confidence 22222111111222 3 3345567888887777776666555555432 22332 4777888888888999999999 Q ss_pred HhhcccccccCCCCCCCccccchhhhcCcc-ceeeccCCCCCHHHHhhhhhhhhhccCc-eE-EEecCHHHHhhHHHhhc Q lcl|NC_020871. 157 SFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHDARGASLTESLLNQAAVMISKGYGT-PT-DAYMPVGVQADFVNQQL 233 (468) Q Consensus 157 ~f~Gd~~l~~~~~~~~gleFDGl~~li~~~-nviDarG~~ls~~~l~~~a~~i~~~fG~-~t-d~~m~~~v~a~~~~~~~ 233 (468) .|+||..-.+. .-...+|+.+++... +.++..+..++.+.|..+-..+-..|-. .+ -++|+..+.+.+..... T Consensus 141 ~~nGd~~~~~~----~~~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~ 216 (321) T protein:vir:31 141 AANGDEDAEDS----FENQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLT 216 (321) T ss_pred eeeccccCCCc----ccccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHh Confidence 99999763321 112558999987644 7789999999988888776667666643 22 26799998887766555 Q ss_pred CCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCC-c---Cc-c Q lcl|NC_020871. 234 SKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQ-F---RA-E 308 (468) Q Consensus 234 ~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~-~---~~-~ 308 (468) +++.-. +... +.......|-|--++..+.+=+.... ........-.-.-+..-. . .+ . T Consensus 217 ~~~~~~----------~~~~---l~~~~~~tl~G~pvv~~~~mP~~~il----~t~~~nl~~~~~~~~~~~~~~~~~~~~ 279 (321) T protein:vir:31 217 DRDTPL----------GDNV---IMGEADVNPFSFPIIGSGLWPDDKAM----FTDPQNLIYALYRDLEIDVLTESDKVS 279 (321) T ss_pred cCCCcc----------ccch---hhccccccccceeEEEcCCCCCCcEE----EeccccEEEEEeeccEEEEeecCcccc Confidence 554321 1111 11112222334433332221000000 001111110000000000 0 00 0 Q ss_pred cceeEEEEEEEEcccC--CcccccceeeeeeccCcceEEEE-EeecCCC Q lcl|NC_020871. 309 DLAAHEYKVVVSSDDA--ESIASEVATATVTAKDDGVKLEI-ELAPMYS 354 (468) Q Consensus 309 ~~~~y~YkVtavn~~G--ES~aS~~vt~Tv~a~~~g~~ltI-T~~~~~g 354 (468) +.....|.....+.+. |- .. ..|-+. ++...+ ++.+..+ T Consensus 280 ~~~~~~~~~~~~~~~~~ve~-~~--a~a~~~----~i~~~~~~~~~~~~ 321 (321) T protein:vir:31 280 ERDLHARYFMRGDDDFAIEN-TE--AVVLAE----GLGDPLEHLEEETS 321 (321) T ss_pred ccceeeEeeeeeecceeEec-cc--cEEEEe----cCCcchhcccCCCC Confidence 0111111111122111 11 00 000111 111100 0001111 No 104 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=87.69 E-value=0.037 Score=28.43 Aligned_cols=336 Identities=13% Similarity=0.050 Sum_probs=135.9 Q ss_pred CCCcccchhhcccChhhHHH--------------------HHHH----------------HhhcccccCcccccCccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQE--------------------DALK----------------SFTTGYGITPDTQTDAGALR 44 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e--------------------~~~K----------------sf~agy~~~p~~~~~gaALr 44 (468) +.....+.+.++.+....-. .+.+ ....--..+..+..||.... T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~ 169 (477) T protein:vir:84 90 VRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVP 169 (477) T ss_pred hcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcceeec Confidence 11111111111111110000 1111 00111112223334555566 Q ss_pred hhhhhhHhhhhhhccccccchhhhcccchhhhhhc--cceeeeeccccccccccccc-----cccccCcceEEEEEEEEe Q lcl|NC_020871. 45 REFLDDQISMLTWTENDLTFYKDIAKKPATSTVAK--YDVYMQHGKVGHTRFTREIG-----VAPVSDPNIRQKTVNMKF 117 (468) Q Consensus 45 ~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~e--y~~~~~hG~~g~~~fv~E~g-----~~~~~d~~~~r~~~~~k~ 117 (468) .|-+...|..+... ...+.+.+...+...+-.. |.+... +..-+..++|++ ..+.+|+.+.+....+|= T Consensus 170 ~~~~~~~ii~~l~~--~~~i~~~~~~~~~~~~~~~~~ip~~~~--~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k 245 (477) T protein:vir:84 170 PLWMMNRFIELARA--GRTYANLCPTEPLPGGTSSINIPKILT--GTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKT 245 (477) T ss_pred cchhHHHHHHHhhh--cchHHHhhceeeecCCcceeEEEEEec--CcceeeeeccCcccccccccccccceeeEEEeeee Confidence 66555555433322 2234444555555554443 333332 221233578875 346788999999999988 Q ss_pred eeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCC Q lcl|NC_020871. 118 ASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLT 197 (468) Q Consensus 118 l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls 197 (468) ++.--.+|..+= .++.-|.+....+.-...+++.++.++++|+-. +-+..||.+.... +.+++-+...+ T Consensus 246 ~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt---------~~~p~Gi~~~~~~-~~~~~~~~~~t 314 (477) T protein:vir:84 246 IAGQQGIAIQLL-DQAAVSVDEFVFRDLAADYANKLNVQVISGTGS---------NNQVVGVRATAGI-TQVTATSAGSA 314 (477) T ss_pred EEeeeHHHHHHH-hccchhHHHHHHHHHHHHHHHHHHHHHhccCCC---------CCccceeeecccc-ccccccccccc Confidence 888877776542 344557788888899999999999999999832 1134566654322 33444444444 Q ss_pred HHHHh-------hhhhhhhhccCce-EEEecCHHHHhhHHHhh-cCCceEEeecCCCcceeeeeccceeecCCccccCCC Q lcl|NC_020871. 198 ESLLN-------QAAVMISKGYGTP-TDAYMPVGVQADFVNQQ-LSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGS 268 (468) Q Consensus 198 ~~~l~-------~~a~~i~~~fG~~-td~~m~~~v~a~~~~~~-~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs 268 (468) ...+. .+..-+..+|+.. .-.+|++.+.+.+...- -+++..++++..+....+.....+ .....-.|.|. T Consensus 315 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~-~~~~~~~l~G~ 393 (477) T protein:vir:84 315 LEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVA-SQRVVGQMHGL 393 (477) T ss_pred hhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccc-cccccchhccc Confidence 32222 2222233455544 44778999988874433 245666655543322222221111 11111123333 Q ss_pred EeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEE Q lcl|NC_020871. 269 TVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIE 348 (468) Q Consensus 269 ~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT 348 (468) -++.++.. |. .+ +..+.+..-..+.+++.+.. ++ +..+.+. T Consensus 394 pVv~s~~~-----------p~--~~------~~~~d~~~i~~gd~~~~~i~-----~~---------------~~~~~~~ 434 (477) T protein:vir:84 394 PVVTDPTL-----------PT--TL------GTGTDQDVIHVLRASDLALF-----ES---------------SVRMRAL 434 (477) T ss_pred ceEecCcc-----------cc--cc------cccCCcceEEEEEeceEEEE-----ee---------------ceeEEec Confidence 33222111 11 00 00111111111223332221 11 1111111 Q ss_pred eecCCCcccceEEEEeecCCCceeEEEEEEecccccCCeeEEecCCCCCCCCc Q lcl|NC_020871. 349 LAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFYDLNDSIPETV 401 (468) Q Consensus 349 ~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~~t~tf~D~N~~iPgt~ 401 (468) .....+..-..|+||.--. | ...|.+.+= -.+|.+++ +-|--. T Consensus 435 ~~~~~~~~~~~~~v~~~~~----~-~~~r~~~af---v~~t~~~~--~~~~~~ 477 (477) T protein:vir:84 435 QETRAENLSVLLQVYGYLA----F-TAARFPQSV---VEIGGTAL--TAPTFA 477 (477) T ss_pred cccccccceeeeeehhhhh----h-hhhccccce---EEeecccc--cccccC Confidence 1100110012233332100 0 011111110 01111111 111111 No 105 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=86.22 E-value=0.047 Score=27.86 Aligned_cols=289 Identities=12% Similarity=-0.046 Sum_probs=129.0 Q ss_pred CC-------------CcccchhhcccC---hhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccc Q lcl|NC_020871. 1 MP-------------KNNKEEEVKEVN---LNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTF 64 (468) Q Consensus 1 ~~-------------~~~~~~~~~~~n---~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~ 64 (468) |- +.+..--.++.+ ..+.-+.+.+..+. .+-.+|+.|.++.+.+.|.... .+...+ T Consensus 38 ~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~~------~~~~~gg~lvP~~~~~~I~~~l--~~~s~i 109 (377) T protein:vir:96 38 AFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKN------VGGKDKFKLLPEETMVQVFDDL--VAEHPL 109 (377) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHhc------CCCCCCceecCHHHHHHHHHHH--Hhhhhh Confidence 00 000000001111 11111122222221 2345688888888877774422 223355 Q ss_pred hhhhcccchhhhhhccceeeeecccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_020871. 65 YKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGV-APVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTD 143 (468) Q Consensus 65 ~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~ 143 (468) ++.+...++.+.+ ++.+ ..+.+.+.+++|.+. ++..++.+.+.....+=|+.--.+|..+ |.++..|.+....+ T Consensus 110 ~~~~~v~~~~~~~-~i~~---~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~l-l~ds~~~le~~i~~ 184 (377) T protein:vir:96 110 LKVINFKNTSLRL-KALT---AETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA-LKFGPKWLKQFITE 184 (377) T ss_pred hhhceeEecCCce-EEEE---ecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHH-hhcchhhHHHHHHH Confidence 5555555554432 2332 334445668999876 5678999999999999998777777655 45688899999999 Q ss_pred HHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccc-----------eeec---cCC--CCCHHHH-hhhhh Q lcl|NC_020871. 144 DAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDN-----------VHDA---RGA--SLTESLL-NQAAV 206 (468) Q Consensus 144 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~n-----------viDa---rG~--~ls~~~l-~~~a~ 206 (468) .--.++++.++.+++.||-. + |--||.+-+.... +++. -|. .++.+.+ +.... T Consensus 185 ~l~~~~~~~~~~a~i~G~G~-----~-----~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (377) T protein:vir:96 185 QLKEAIAVALELAIVKGNGL-----L-----QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVP 254 (377) T ss_pred HHHHHHHHHHhhceEeccCC-----C-----cceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHH Confidence 99999999999999999942 1 2225554332111 1111 111 1222222 22121 Q ss_pred h---h-hhccCceEE------EecCHHHHhhHHHhhcCCceEEeecCCCcce--eeeeccceeecCCccccCCCEeeccc Q lcl|NC_020871. 207 M---I-SKGYGTPTD------AYMPVGVQADFVNQQLSKQTQLVRDNGNNVS--VGFNIQGFHSARGFIKLHGSTVMENE 274 (468) Q Consensus 207 ~---i-~~~fG~~td------~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~--~G~~v~~~~s~~g~i~l~gs~i~~~~ 274 (468) + . ..+.|.+.. +.|++.+..++ .+++..++++ |... +|+++.-..+.... .++.++.+. T Consensus 255 l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~-----~~~~~~~~~~-G~~~~~l~~p~~v~~s~~~p---~~~i~fgdf 325 (377) T protein:vir:96 255 VMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL-----EAKFTSRNQF-GEYVTVLPHGITILESLAVE---TGKAIAFVA 325 (377) T ss_pred HHHhhccccccccccccCceEEEEchhhHHhc-----cccccccCCC-CCceeccCCCceEEecCCCC---cccEEEEEc Confidence 1 1 112333332 55888776554 3444444333 3222 22222111111100 122222211 Q ss_pred c-cccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcc----cCCcccccceeeeee Q lcl|NC_020871. 275 Q-ILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSD----DAESIASEVATATVT 337 (468) Q Consensus 275 n-~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~----~GES~aS~~vt~Tv~ 337 (468) . .+...+ ..++-.... .-.|. . ..--|++..--+ +.|+. .+.++|+- T Consensus 326 ~~Y~i~~r---------~~~~i~~~~--~~~~~-~--d~~~f~~~~r~dG~~~d~~a~--~vl~l~~~ 377 (377) T protein:vir:96 326 NRYDAFMA---------TASTIEEYD--QTFAM-E--DLQLYLTKNYFYGKAKDNHTA--ALLTLAGG 377 (377) T ss_pred CcEEEEEe---------cccEEEeeh--hhhhh-c--CCeEEEEEEEEcCEEecCCcE--EEEEEecC Confidence 0 000000 000000000 00011 0 112233333322 33432 23344444 No 106 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=85.71 E-value=0.051 Score=27.68 Aligned_cols=308 Identities=15% Similarity=0.102 Sum_probs=139.5 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhccc-------ccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccch Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGY-------GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPA 73 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy-------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~ 73 (468) .|....+.... ......+..++|+....- ...-.+..+|+.|-++.+.++|..+... ...+.+.....++ T Consensus 70 ~~~~~~~~~~~-~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~--~~~l~~~~~~~~~ 146 (401) T protein:vir:44 70 RPARGAQNKVA-AEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKD--EVVMRQEATVITV 146 (401) T ss_pred ccccccccchh-HHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHh--hhhhhhhceeeec Confidence 11111111100 001111122222110000 0011122456778888888888654433 2245555555666 Q ss_pred hhhhhccceeeeecccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHH Q lcl|NC_020871. 74 TSTVAKYDVYMQHGKVGHTRFTREIGV-APVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKT 152 (468) Q Consensus 74 ~stv~ey~~~~~hG~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~ 152 (468) .+....|.+.. ++. ...+++|++. ++...+.+.+....++=++.--.+|.-+ +.++..|.+....+.-...++.. T Consensus 147 ~~~~~~~~~~~--~~~-~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~ai~~~ 222 (401) T protein:vir:44 147 GGSDYKKLVNL--GGT-ASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKM-LDDAFFNVEAWINSELATEFAEQ 222 (401) T ss_pred CCCceEEEEec--CCc-cceeeccccccCccccccceeeeeehhheeeehhhhHHH-HhcchHHHHHHHHHHHHHHHHHH Confidence 66555555544 222 2447999985 5677789999988888777766666642 23456788888888888899999 Q ss_pred HHHHHhhcccccccCCCCCCCccccchhhhcC------------ccceeeccCCCCCHHHHhhhhhhhhhccCceEEEec Q lcl|NC_020871. 153 IEWASFFGDSDLSDSPEPQAGLEFDGLAKLIN------------QDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYM 220 (468) Q Consensus 153 ~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~------------~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m 220 (468) ++.++++||-. + +-.||.+... .+.+.......++.+.|-.+.-.+...|....-.+| T Consensus 223 ~~~~~l~G~G~-~---------~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~ 292 (401) T protein:vir:44 223 EEIAFTTGDGT-K---------KPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMM 292 (401) T ss_pred HHhhhhccCCC-C---------ccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEE Confidence 99999999843 2 2234444332 123343444445555554443334445544445889 Q ss_pred CHHHHhhHHHhhcCCc-eEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecC Q lcl|NC_020871. 221 PVGVQADFVNQQLSKQ-TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEA 299 (468) Q Consensus 221 ~~~v~a~~~~~~~~~q-r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~ 299 (468) ++...+.+. ..-+.+ |.+.+++.... . .-.|.|.-++..++. |...+...+.. T Consensus 293 n~~~~~~L~-~lkd~~G~~l~~~~~~~g---~----------~~~l~G~PVv~~~~~--------p~~~~~~~~i~---- 346 (401) T protein:vir:44 293 NNNSLFAIR-LLKDTEGNYLWRPGLELG---Q----------PSSLAGYGIAENEQM--------PDIAADAKAIA---- 346 (401) T ss_pred cHHHHHHHH-HhhccCCceeecCCcCCC---C----------CceecceeeEEecCc--------CCccCCccEEE---- Confidence 999988884 344443 44433332211 0 001233333322221 11111100000 Q ss_pred CCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecC---CCceeEEEE Q lcl|NC_020871. 300 GKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGA---ETGLFYLIA 376 (468) Q Consensus 300 ~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~---~~G~f~~ig 376 (468) .|-+++.+..+++.|-+.. ......... ..++.+.|=.- ...-|.+ - T Consensus 347 ----------~Gd~~~~~~i~~~~~~~~~-------~~~~~~~~~------------v~~~a~~r~d~~~~~~~a~~~-l 396 (401) T protein:vir:44 347 ----------FGNFKRGYTIVDRIGTRIL-------RDPYTNKPF------------VGFYTTKRTGGMLVDSQAIKL-L 396 (401) T ss_pred ----------EeehhccEEEEEecceEEe-------eeccccCCc------------EEEEEEEEeccEEecccceEE-E Confidence 0122222223333331110 000000000 11122222110 0111222 2 Q ss_pred EEecc Q lcl|NC_020871. 377 RVPAS 381 (468) Q Consensus 377 rv~~s 381 (468) +++.+ T Consensus 397 ~~~aa 401 (401) T protein:vir:44 397 KIAAA 401 (401) T ss_pred EeecC Confidence 22222 No 107 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=84.14 E-value=0.063 Score=27.17 Aligned_cols=296 Identities=12% Similarity=0.046 Sum_probs=130.7 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhccc----------------ccCcccccCccccchhhhhhHhhhhhhccccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGY----------------GITPDTQTDAGALRREFLDDQISMLTWTENDLTF 64 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy----------------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~ 64 (468) .+...+.+........+.-+.+.|++.-+. ..+..+-++|+.|-++.+...|..+.... ..+ T Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~--s~l 136 (392) T protein:vir:10 59 RNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSF--DAL 136 (392) T ss_pred hhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhh--hhh Confidence 111111111111122222234444443221 11112335678888988888885544443 356 Q ss_pred hhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_020871. 65 YKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTD 143 (468) Q Consensus 65 ~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~ 143 (468) .+.+...++.+.-.+|......+ .....+++|++... ...+.+.......+-++--..+|.-+ +.++..|.+....+ T Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~ 214 (392) T protein:vir:10 137 EQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTK 214 (392) T ss_pred hhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHH-HhhhHHHHHHHHHH Confidence 66666666665444444333222 22456899999876 56799999999999998888888753 23456678888888 Q ss_pred HHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHH Q lcl|NC_020871. 144 DAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVG 223 (468) Q Consensus 144 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~ 223 (468) .--..+.+.++.+++.|+..... ...+-+|.|.+++. ..+..+|-.....+||+. T Consensus 215 ~l~~~i~~~~d~~~~~g~g~~~~----~~~~~~d~i~~~~~---------------------~~l~~~~~~~a~~vm~~~ 269 (392) T protein:vir:10 215 WLGKKSKVTRNVLILGVIEKLTK----QAIKSLDDIKDVLN---------------------VKLDPAISPNAILLTNQD 269 (392) T ss_pred HHHHHHHHHHHHHHhhccccccc----cCccCHHHHHHHHH---------------------HhhhhhhccCCEEEEcHH Confidence 88999999999999999876432 12233333333221 112223333344899999 Q ss_pred HHhhHHHhhc--CCceEEeecCCCcceeeeeccceeecCCccccCCC-EeecccccccccccccCCCCCCc--------- Q lcl|NC_020871. 224 VQADFVNQQL--SKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGS-TVMENEQILDERILALPTAPQQA--------- 291 (468) Q Consensus 224 v~a~~~~~~~--~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs-~i~~~~n~l~~~~~~~p~ap~~~--------- 291 (468) +.+.+.. .- .++.. .+++......+ .|.|. .++..++- .+..+...... T Consensus 270 ~~~~L~~-lkd~~G~~l-~~~~~~~~~~~-------------tllG~~~v~~~~~~----~~~~~~~~~~~~~~~~gdfs 330 (392) T protein:vir:10 270 GFNYLDK-LKDKDGKYI-LQSDPTQKNKK-------------LFAGTNPVVVVSNR----FLKSKGTTAKKAPLIIGDLK 330 (392) T ss_pred HHHHHHH-hhccCCCeE-eecCccCCccc-------------cccCcccEEEeccc----ccCCCcccCCceEEEEEehh Confidence 9999843 32 33333 33332111100 11111 11100000 00000000000 Q ss_pred ---------ceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccce Q lcl|NC_020871. 292 ---------KVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQF 359 (468) Q Consensus 292 ---------~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~ 359 (468) .++-.........|.. ....|++..--+-.--.+..+ +.++++..+.. .+|++ T Consensus 331 ~~~~i~~~~~~~~~~~~~~~~~f~~---~~~~~r~~~r~d~~v~~~~a~-----------~~l~~~~~a~~-~~~~~ 392 (392) T protein:vir:10 331 EAIVLFKREDMELASTDVGGKAFTR---NTLDLRAIQRDDVQMWDNEAA-----------VYGEIDLSAPV-EQPQG 392 (392) T ss_pred ceEEEEeecceEEEEeccccchhhc---CceEEEEEEeeccEEecccce-----------EEEEecccccc-cCCCC Confidence 0000000000000100 011122221111000001111 11222111111 01211 No 108 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=84.14 E-value=0.063 Score=27.17 Aligned_cols=296 Identities=12% Similarity=0.046 Sum_probs=130.7 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhccc----------------ccCcccccCccccchhhhhhHhhhhhhccccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGY----------------GITPDTQTDAGALRREFLDDQISMLTWTENDLTF 64 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy----------------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~ 64 (468) .+...+.+........+.-+.+.|++.-+. ..+..+-++|+.|-++.+...|..+.... ..+ T Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~--s~l 136 (392) T protein:vir:10 59 RNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSF--DAL 136 (392) T ss_pred hhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhh--hhh Confidence 111111111111122222234444443221 11112335678888988888885544443 356 Q ss_pred hhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_020871. 65 YKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTD 143 (468) Q Consensus 65 ~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~ 143 (468) .+.+...++.+.-.+|......+ .....+++|++... ...+.+.......+-++--..+|.-+ +.++..|.+....+ T Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~ 214 (392) T protein:vir:10 137 EQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTK 214 (392) T ss_pred hhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHH-HhhhHHHHHHHHHH Confidence 66666666665444444333222 22456899999876 56799999999999998888888753 23456678888888 Q ss_pred HHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHH Q lcl|NC_020871. 144 DAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVG 223 (468) Q Consensus 144 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~ 223 (468) .--..+.+.++.+++.|+..... ...+-+|.|.+++. ..+..+|-.....+||+. T Consensus 215 ~l~~~i~~~~d~~~~~g~g~~~~----~~~~~~d~i~~~~~---------------------~~l~~~~~~~a~~vm~~~ 269 (392) T protein:vir:10 215 WLGKKSKVTRNVLILGVIEKLTK----QAIKSLDDIKDVLN---------------------VKLDPAISPNAILLTNQD 269 (392) T ss_pred HHHHHHHHHHHHHHhhccccccc----cCccCHHHHHHHHH---------------------HhhhhhhccCCEEEEcHH Confidence 88999999999999999876432 12233333333221 112223333344899999 Q ss_pred HHhhHHHhhc--CCceEEeecCCCcceeeeeccceeecCCccccCCC-EeecccccccccccccCCCCCCc--------- Q lcl|NC_020871. 224 VQADFVNQQL--SKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGS-TVMENEQILDERILALPTAPQQA--------- 291 (468) Q Consensus 224 v~a~~~~~~~--~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs-~i~~~~n~l~~~~~~~p~ap~~~--------- 291 (468) +.+.+.. .- .++.. .+++......+ .|.|. .++..++- .+..+...... T Consensus 270 ~~~~L~~-lkd~~G~~l-~~~~~~~~~~~-------------tllG~~~v~~~~~~----~~~~~~~~~~~~~~~~gdfs 330 (392) T protein:vir:10 270 GFNYLDK-LKDKDGKYI-LQSDPTQKNKK-------------LFAGTNPVVVVSNR----FLKSKGTTAKKAPLIIGDLK 330 (392) T ss_pred HHHHHHH-hhccCCCeE-eecCccCCccc-------------cccCcccEEEeccc----ccCCCcccCCceEEEEEehh Confidence 9999843 32 33333 33332111100 11111 11100000 00000000000 Q ss_pred ---------ceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccce Q lcl|NC_020871. 292 ---------KVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQF 359 (468) Q Consensus 292 ---------~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~ 359 (468) .++-.........|.. ....|++..--+-.--.+..+ +.++++..+.. .+|++ T Consensus 331 ~~~~i~~~~~~~~~~~~~~~~~f~~---~~~~~r~~~r~d~~v~~~~a~-----------~~l~~~~~a~~-~~~~~ 392 (392) T protein:vir:10 331 EAIVLFKREDMELASTDVGGKAFTR---NTLDLRAIQRDDVQMWDNEAA-----------VYGEIDLSAPV-EQPQG 392 (392) T ss_pred ceEEEEeecceEEEEeccccchhhc---CceEEEEEEeeccEEecccce-----------EEEEecccccc-cCCCC Confidence 0000000000000100 011122221111000001111 11222111111 01211 No 109 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=84.14 E-value=0.063 Score=27.17 Aligned_cols=296 Identities=12% Similarity=0.046 Sum_probs=130.7 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhccc----------------ccCcccccCccccchhhhhhHhhhhhhccccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGY----------------GITPDTQTDAGALRREFLDDQISMLTWTENDLTF 64 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy----------------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~ 64 (468) .+...+.+........+.-+.+.|++.-+. ..+..+-++|+.|-++.+...|..+.... ..+ T Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~--s~l 136 (392) T protein:vir:10 59 RNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSF--DAL 136 (392) T ss_pred hhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhh--hhh Confidence 111111111111122222234444443221 11112335678888988888885544443 356 Q ss_pred hhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_020871. 65 YKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTD 143 (468) Q Consensus 65 ~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~ 143 (468) .+.+...++.+.-.+|......+ .....+++|++... ...+.+.......+-++--..+|.-+ +.++..|.+....+ T Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~ 214 (392) T protein:vir:10 137 EQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTK 214 (392) T ss_pred hhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHH-HhhhHHHHHHHHHH Confidence 66666666665444444333222 22456899999876 56799999999999998888888753 23456678888888 Q ss_pred HHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHH Q lcl|NC_020871. 144 DAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVG 223 (468) Q Consensus 144 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~ 223 (468) .--..+.+.++.+++.|+..... ...+-+|.|.+++. ..+..+|-.....+||+. T Consensus 215 ~l~~~i~~~~d~~~~~g~g~~~~----~~~~~~d~i~~~~~---------------------~~l~~~~~~~a~~vm~~~ 269 (392) T protein:vir:10 215 WLGKKSKVTRNVLILGVIEKLTK----QAIKSLDDIKDVLN---------------------VKLDPAISPNAILLTNQD 269 (392) T ss_pred HHHHHHHHHHHHHHhhccccccc----cCccCHHHHHHHHH---------------------HhhhhhhccCCEEEEcHH Confidence 88999999999999999876432 12233333333221 112223333344899999 Q ss_pred HHhhHHHhhc--CCceEEeecCCCcceeeeeccceeecCCccccCCC-EeecccccccccccccCCCCCCc--------- Q lcl|NC_020871. 224 VQADFVNQQL--SKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGS-TVMENEQILDERILALPTAPQQA--------- 291 (468) Q Consensus 224 v~a~~~~~~~--~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs-~i~~~~n~l~~~~~~~p~ap~~~--------- 291 (468) +.+.+.. .- .++.. .+++......+ .|.|. .++..++- .+..+...... T Consensus 270 ~~~~L~~-lkd~~G~~l-~~~~~~~~~~~-------------tllG~~~v~~~~~~----~~~~~~~~~~~~~~~~gdfs 330 (392) T protein:vir:10 270 GFNYLDK-LKDKDGKYI-LQSDPTQKNKK-------------LFAGTNPVVVVSNR----FLKSKGTTAKKAPLIIGDLK 330 (392) T ss_pred HHHHHHH-hhccCCCeE-eecCccCCccc-------------cccCcccEEEeccc----ccCCCcccCCceEEEEEehh Confidence 9999843 32 33333 33332111100 11111 11100000 00000000000 Q ss_pred ---------ceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccce Q lcl|NC_020871. 292 ---------KVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQF 359 (468) Q Consensus 292 ---------~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~ 359 (468) .++-.........|.. ....|++..--+-.--.+..+ +.++++..+.. .+|++ T Consensus 331 ~~~~i~~~~~~~~~~~~~~~~~f~~---~~~~~r~~~r~d~~v~~~~a~-----------~~l~~~~~a~~-~~~~~ 392 (392) T protein:vir:10 331 EAIVLFKREDMELASTDVGGKAFTR---NTLDLRAIQRDDVQMWDNEAA-----------VYGEIDLSAPV-EQPQG 392 (392) T ss_pred ceEEEEeecceEEEEeccccchhhc---CceEEEEEEeeccEEecccce-----------EEEEecccccc-cCCCC Confidence 0000000000000100 011122221111000001111 11222111111 01211 No 110 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=84.14 E-value=0.063 Score=27.17 Aligned_cols=296 Identities=12% Similarity=0.046 Sum_probs=130.7 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhccc----------------ccCcccccCccccchhhhhhHhhhhhhccccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGY----------------GITPDTQTDAGALRREFLDDQISMLTWTENDLTF 64 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy----------------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~ 64 (468) .+...+.+........+.-+.+.|++.-+. ..+..+-++|+.|-++.+...|..+.... ..+ T Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~--s~l 136 (392) T protein:vir:10 59 RNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSF--DAL 136 (392) T ss_pred hhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhh--hhh Confidence 111111111111122222234444443221 11112335678888988888885544443 356 Q ss_pred hhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_020871. 65 YKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTD 143 (468) Q Consensus 65 ~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~ 143 (468) .+.+...++.+.-.+|......+ .....+++|++... ...+.+.......+-++--..+|.-+ +.++..|.+....+ T Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~ 214 (392) T protein:vir:10 137 EQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTK 214 (392) T ss_pred hhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHH-HhhhHHHHHHHHHH Confidence 66666666665444444333222 22456899999876 56799999999999998888888753 23456678888888 Q ss_pred HHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHH Q lcl|NC_020871. 144 DAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVG 223 (468) Q Consensus 144 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~ 223 (468) .--..+.+.++.+++.|+..... ...+-+|.|.+++. ..+..+|-.....+||+. T Consensus 215 ~l~~~i~~~~d~~~~~g~g~~~~----~~~~~~d~i~~~~~---------------------~~l~~~~~~~a~~vm~~~ 269 (392) T protein:vir:10 215 WLGKKSKVTRNVLILGVIEKLTK----QAIKSLDDIKDVLN---------------------VKLDPAISPNAILLTNQD 269 (392) T ss_pred HHHHHHHHHHHHHHhhccccccc----cCccCHHHHHHHHH---------------------HhhhhhhccCCEEEEcHH Confidence 88999999999999999876432 12233333333221 112223333344899999 Q ss_pred HHhhHHHhhc--CCceEEeecCCCcceeeeeccceeecCCccccCCC-EeecccccccccccccCCCCCCc--------- Q lcl|NC_020871. 224 VQADFVNQQL--SKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGS-TVMENEQILDERILALPTAPQQA--------- 291 (468) Q Consensus 224 v~a~~~~~~~--~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs-~i~~~~n~l~~~~~~~p~ap~~~--------- 291 (468) +.+.+.. .- .++.. .+++......+ .|.|. .++..++- .+..+...... T Consensus 270 ~~~~L~~-lkd~~G~~l-~~~~~~~~~~~-------------tllG~~~v~~~~~~----~~~~~~~~~~~~~~~~gdfs 330 (392) T protein:vir:10 270 GFNYLDK-LKDKDGKYI-LQSDPTQKNKK-------------LFAGTNPVVVVSNR----FLKSKGTTAKKAPLIIGDLK 330 (392) T ss_pred HHHHHHH-hhccCCCeE-eecCccCCccc-------------cccCcccEEEeccc----ccCCCcccCCceEEEEEehh Confidence 9999843 32 33333 33332111100 11111 11100000 00000000000 Q ss_pred ---------ceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccce Q lcl|NC_020871. 292 ---------KVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQF 359 (468) Q Consensus 292 ---------~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~ 359 (468) .++-.........|.. ....|++..--+-.--.+..+ +.++++..+.. .+|++ T Consensus 331 ~~~~i~~~~~~~~~~~~~~~~~f~~---~~~~~r~~~r~d~~v~~~~a~-----------~~l~~~~~a~~-~~~~~ 392 (392) T protein:vir:10 331 EAIVLFKREDMELASTDVGGKAFTR---NTLDLRAIQRDDVQMWDNEAA-----------VYGEIDLSAPV-EQPQG 392 (392) T ss_pred ceEEEEeecceEEEEeccccchhhc---CceEEEEEEeeccEEecccce-----------EEEEecccccc-cCCCC Confidence 0000000000000100 011122221111000001111 11222111111 01211 No 111 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=83.34 E-value=0.069 Score=26.94 Aligned_cols=310 Identities=14% Similarity=0.011 Sum_probs=126.3 Q ss_pred CCCcccchhh---cccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhh Q lcl|NC_020871. 1 MPKNNKEEEV---KEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTV 77 (468) Q Consensus 1 ~~~~~~~~~~---~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv 77 (468) ..+.+..-.. ++.-..+..+.+.+..++| +-++|+.|-++.+.+.|.... .+...+++.+...++.+.+ T Consensus 51 ~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~~~------~~~~gg~~vP~~~~~~I~~~l--~~~s~i~~~~~v~~~~~~~ 122 (377) T protein:vir:98 51 EEEMERMFDLRDKNRELTAEEIKFFNDIDKNV------GGKDKFKLLPEETMVQVFDDL--VAEHPLLKVINFKNTSLRL 122 (377) T ss_pred HHHHHHHHHhccCCcccCHHHHHHHHHHHhcc------CCCCCccccCHHHHHHHHHHH--HHhhhhhhheeeEecCcce Confidence 0000000000 0111122233444444433 234677788888777764322 2223555555555554433 Q ss_pred hccceeeeecccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 78 AKYDVYMQHGKVGHTRFTREIGV-APVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWA 156 (468) Q Consensus 78 ~ey~~~~~hG~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a 156 (468) ++.+.. +.+.+.+++|.+. ++..++.+.+.....+=|+.--.+|.-+ |.++..|.+....+.--.++++.++.+ T Consensus 123 -~~~~~~---~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~el-L~ds~~~ie~~i~~~la~~~a~~~~~a 197 (377) T protein:vir:98 123 -KALTAE---TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA-LKFGPKWIKQFITEQLKEAIAVALELA 197 (377) T ss_pred -EEEEec---CCcceeEeecccccCcccCccceeEeecceeEEeeecccHHh-hhccHhHHHHHHHHHHHHHHHHHHhhc Confidence 344333 3444557899875 4578999999999998888776666554 456888999999999999999999999 Q ss_pred HhhcccccccCCCCCCCccccchhhhcCc-----cceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHh Q lcl|NC_020871. 157 SFFGDSDLSDSPEPQAGLEFDGLAKLINQ-----DNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQ 231 (468) Q Consensus 157 ~f~Gd~~l~~~~~~~~gleFDGl~~li~~-----~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~ 231 (468) ++.||-. + |--||.+-+.. ....++-+.....+.|-.+.-.....|..--...|...+.+..... T Consensus 198 ~i~G~G~-----~-----qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~kl 267 (377) T protein:vir:98 198 IVKGDGL-----L-----QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRP 267 (377) T ss_pred eEeccCC-----C-----cceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhh Confidence 9999942 1 22345553321 1222333333322222221111111111111123333333333111 Q ss_pred h-cCCceEEeecCCCcceeeeeccceeecCCc-ccc--CCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCc Q lcl|NC_020871. 232 Q-LSKQTQLVRDNGNNVSVGFNIQGFHSARGF-IKL--HGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRA 307 (468) Q Consensus 232 ~-~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~-i~l--~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~ 307 (468) - ..++.+.+ .|....-.-........+.|. ..+ +|-.+..++. +| .- . ..- T Consensus 268 kd~~G~~i~~-~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~--------~p---~~-~----i~f-------- 322 (377) T protein:vir:98 268 LKIAGQVKLI-LNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLA--------VE---TG-K----AIA-------- 322 (377) T ss_pred hccCCceEEE-ecccchhhccccccccCCCCccccccCCCceEEecCC--------CC---cc-c----EEE-------- Confidence 1 12222222 221110000000000011111 000 1112222111 11 00 0 000 Q ss_pred ccceeEE-EEEEEEcccCCcc-cccceeeeeeccCcceEEEEEeecCCCcc--cceEEEEeecCC Q lcl|NC_020871. 308 EDLAAHE-YKVVVSSDDAESI-ASEVATATVTAKDDGVKLEIELAPMYSSR--PQFVSIYRKGAE 368 (468) Q Consensus 308 ~~~~~y~-YkVtavn~~GES~-aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~--~~~y~IYR~~~~ 368 (468) +-++ |.+ +.+.|-+. .|...- +.. ...+-..+.=+.|.+ ++-.+|+-=+.| T Consensus 323 ---gdf~~Y~i--~~r~~~~i~~~~~~~----~~~-d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 323 ---FVANRYDA--FMATASTIEEYDQTF----AME-DLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ---EEecceeE--EeecceEEEeechhh----hhc-CceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 1122 322 33333211 111111 000 111111222222222 122444432211 No 112 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=82.38 E-value=0.077 Score=26.68 Aligned_cols=299 Identities=12% Similarity=0.155 Sum_probs=132.9 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccch---hhhhhHhhhhhhccccc-cchhhhcccchhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRR---EFLDDQISMLTWTENDL-TFYKDIAKKPATST 76 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~---esld~~i~~L~~~~~~f-~~~~~i~k~~~~st 76 (468) |-. -.-|.+.- .......--.++.-|+..+.+.+-. |-+|+.+....+..-.+ .|+.-...-.+-.. T Consensus 1 ~~~----~~~~~~~~-----~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~ 71 (319) T protein:vir:10 1 MTT----KKFDEADK-----SNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDK 71 (319) T ss_pred CCC----cchhHHhh-----HHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceE Confidence 321 11111111 1111221123366677777778877 55666665544433211 34432223333322 Q ss_pred hhccceeeeeccccccccccc-cccccccCcceEEEEEEEEeeeehhhhhhhHhh--hcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 77 VAKYDVYMQHGKVGHTRFTRE-IGVAPVSDPNIRQKTVNMKFASDTKNISIAAGL--VNNIQDPMQILTDDAIVNIAKTI 153 (468) Q Consensus 77 v~ey~~~~~hG~~g~~~fv~E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l--v~~~~Dp~~~~~~~ai~~~~~~~ 153 (468) ...|..+.. .|....++. ....+..|.++.|++..+..++..+.++..--. ...=.+..+..-..|.+.+.+.. T Consensus 72 ~~~~~~~~~---~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~ 148 (319) T protein:vir:10 72 TFEYMTFDK---VGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLV 148 (319) T ss_pred EEEeeeecc---ccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhh Confidence 333444443 344444444 444578899999999999999999999863222 22234555677777888888888 Q ss_pred HHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCC---CC-H---HHHhhhh---hhhhhccCceEEEecCHH Q lcl|NC_020871. 154 EWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGAS---LT-E---SLLNQAA---VMISKGYGTPTDAYMPVG 223 (468) Q Consensus 154 e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~---ls-~---~~l~~~a---~~i~~~fG~~td~~m~~~ 223 (468) ...+||||+.++ +-||.+-=+-...-...+.. -+ + +.|+.+- ...++++-.++.+.||+. T Consensus 149 n~i~f~G~~~~g----------~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~ 218 (319) T protein:vir:10 149 NRLVFKGSAPHK----------IVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPS 218 (319) T ss_pred ceEEEeeccccc----------ceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHH Confidence 899999997753 34554421111111122222 12 2 2344332 223557788999999999 Q ss_pred HHhhHHHhhcCCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCC Q lcl|NC_020871. 224 VQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKG 303 (468) Q Consensus 224 v~a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g 303 (468) ....+.. .. ++ +|..+-.++...+. +-.| ...| . .. + +++.| T Consensus 219 ~~~~L~~-------~~--~~-----~~~t~l~~lk~~~~----~l~I-----------~~~p---e---l~-~--ag~~g 260 (319) T protein:vir:10 219 MRKVLAI-------RM--PE-----TTMSYLDYFKSQNS----GIEI-----------DSIA---E---LE-D--IDGAG 260 (319) T ss_pred HHHhhhc-------cc--CC-----CCeeHHHHHHHhcC----CceE-----------EEee---e---ec-c--cCCCc Confidence 8877721 11 11 23444333222110 0001 0010 0 00 0 11112 Q ss_pred CcCcccceeEEEEEEEEcccCCcccccceee---eeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEE Q lcl|NC_020871. 304 QFRAEDLAAHEYKVVVSSDDAESIASEVATA---TVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARV 378 (468) Q Consensus 304 ~~~~~~~~~y~YkVtavn~~GES~aS~~vt~---Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv 378 (468) + -.. |+. .++-+. .+-++.. ..++...+-..++....-.+ .+.||| +..+.+..| | T Consensus 261 ~--------~~~-v~y-~~~~~~-~~~~v~~~~~~~~~e~~~l~~~~~~~~r~~----Gv~i~~---P~ai~~~dG-I 319 (319) T protein:vir:10 261 T--------KGV-LVY-EKNPMN-MSIEIPEAFNMLPAQPKDLHFKVPCTSKCT----GLTIYR---PMTIVLITG-V 319 (319) T ss_pred c--------eEE-EEE-ecCCce-EEEecCcceeeeeeeecCceEEEeeeeeeE----EEEEEc---cceeEeeec-C Confidence 1 111 111 111111 1101100 11111122222222222122 245555 333444444 1 No 113 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=82.35 E-value=0.078 Score=26.67 Aligned_cols=294 Identities=10% Similarity=0.039 Sum_probs=133.9 Q ss_pred CCCcccchhh--------------------c-----ccChhhHHHHH-------------------HHHhhcccccCccc Q lcl|NC_020871. 1 MPKNNKEEEV--------------------K-----EVNLNSVQEDA-------------------LKSFTTGYGITPDT 36 (468) Q Consensus 1 ~~~~~~~~~~--------------------~-----~~n~~~~~e~~-------------------~Ksf~agy~~~p~~ 36 (468) +|........ + .....+.-.++ .+++.++. T Consensus 288 ~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t------ 361 (632) T protein:vir:96 288 KPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKT------ 361 (632) T ss_pred hhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhccc------ Confidence 1110000000 0 00001111111 22333322 Q ss_pred ccCccccch-hhhhhHhhhhhhccccccchhhhcccchhhhh--hccceeeeeccccccccccccccccccCcceEEEEE Q lcl|NC_020871. 37 QTDAGALRR-EFLDDQISMLTWTENDLTFYKDIAKKPATSTV--AKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTV 113 (468) Q Consensus 37 ~~~gaALr~-esld~~i~~L~~~~~~f~~~~~i~k~~~~stv--~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~ 113 (468) ..+|+.|-. +.+..++..+-.. -..+..+.-+.+.... ..|.+++ +.+...+++|++..+.+++.+.+.+. T Consensus 362 ~~~gg~lvp~~~~~~~iie~lr~---~s~i~~l~~~~~~~~~g~~~ip~~~---~~~~a~wv~E~~~~~~s~~~f~~i~l 435 (632) T protein:vir:96 362 AGKGGELVATELLSEEFIDILRN---KAIIGQMGARMLPGLVGDVDIPKKT---SGANFYWIGEDEDVQDSDFDFTTLSF 435 (632) T ss_pred ccccccccccccchHHHHHHHhh---cchhhhhcceEeecCCcceEEEEEe---CCceeEeecCCccccccccceeeEEe Confidence 223555555 3344444332211 1222223222222111 2344555 33345689999999999999999999 Q ss_pred EEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccC Q lcl|NC_020871. 114 NMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARG 193 (468) Q Consensus 114 ~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG 193 (468) ..|=++.-..+|..+= .++.-|.+....++-...++..++.++++|+..-+ +--||.+.... +.+...+ T Consensus 436 ~~~k~~~~v~iS~ell-~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~---------~p~Gi~~~~~~-~~~~~~~ 504 (632) T protein:vir:96 436 SPKTIAGAVPVTRKLR-KQSSIHVENLIREDLIEGIGVALDLAMLTGTGLAN---------DPVGLLNMTGV-PALTYPA 504 (632) T ss_pred eeeEEEEehhhHHHHH-hccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCC---------ccceeeecccc-cceeccc Confidence 9999998878877652 24455778888899999999999999999974211 22355543332 2344445 Q ss_pred CCCCHHHHhhhhhhhhhccCceE--EEecCHHHHhhHHHh-hcCCc-eEEeecCCCcceeeeeccceeecCCccccCCCE Q lcl|NC_020871. 194 ASLTESLLNQAAVMISKGYGTPT--DAYMPVGVQADFVNQ-QLSKQ-TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGST 269 (468) Q Consensus 194 ~~ls~~~l~~~a~~i~~~fG~~t--d~~m~~~v~a~~~~~-~~~~q-r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~ 269 (468) ..++-+.|..+...+...++... -..|++.+...+..- ..+.+ +.+.+++ .-.|.++ +.+.. +. .+.. T Consensus 505 ~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~~---~l~G~pv--~~s~~--ip-~~~~ 576 (632) T protein:vir:96 505 GGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNN---EVNGYRA--EASNQ--IP-ADTW 576 (632) T ss_pred ccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeecCC---eecccce--Eeccc--cc-cCcE Confidence 55666666555555555565543 246788877766432 23332 3333322 2244444 11111 11 2344 Q ss_pred eecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccce Q lcl|NC_020871. 270 VMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVA 332 (468) Q Consensus 270 i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~v 332 (468) ++.+..-..-..+.-... ....-....++. -.-.....+-+.+....+-......+ T Consensus 577 ~~gd~s~~~i~~~~~~~i-----~~~~~~~~~~~~--v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 577 IFGDWSQIVIAMWGVLDL-----KVDPYTKAASDG--LVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred EEeecceEEEEEecceEE-----EEccccccccCc--eEEEEEeecCceeechhhhhheeecC Confidence 554432211111110000 000000000110 00112344444444444322222111 No 114 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=81.85 E-value=0.082 Score=26.54 Aligned_cols=307 Identities=12% Similarity=0.109 Sum_probs=130.7 Q ss_pred CCCcccchhhcccC-h--hhHHHHHHHHhhccc----ccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccch Q lcl|NC_020871. 1 MPKNNKEEEVKEVN-L--NSVQEDALKSFTTGY----GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPA 73 (468) Q Consensus 1 ~~~~~~~~~~~~~n-~--~~~~e~~~Ksf~agy----~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~ 73 (468) +|..+.+......+ . ....++|.+.+..+- .+...+-.+|+.+-++-+..+|..+..... .+.+.+...++ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~--~l~~~~~~~~~ 148 (389) T protein:vir:10 71 EPKDDGSKKGTDLSKKPIDAKKKAINDFIHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVV--DLSTLVTKTPV 148 (389) T ss_pred cccccccccccccchhHHHHHHHHHHHHhhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhh--hHHhhcceeec Confidence 33322221111111 1 111122222222221 122233456788888888888755544433 45555666666 Q ss_pred hhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHH Q lcl|NC_020871. 74 TSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKT 152 (468) Q Consensus 74 ~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~ 152 (468) .+.--+|.+....+ +...+++|++... .+++.+......++-++.-..+|.-+ +.++..|.+....+.-...+... T Consensus 149 ~~~~~~~~~~~~~~--~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~~~~~~ 225 (389) T protein:vir:10 149 TTPKGTYPILKRAT--DRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEA-IADSAVDLTALVGQSIKEKSVNT 225 (389) T ss_pred cCCeeEEEEEecCC--CccccccccccccccccccceeeeeeheeeEeeehhhHHH-HhhhhHHHHHHHHHHHHHHHHHH Confidence 66655666555322 3345788887554 78999999999999998888888754 34566778888888888889999 Q ss_pred HHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh Q lcl|NC_020871. 153 IEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ 232 (468) Q Consensus 153 ~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~ 232 (468) ++.++..|.......... ...-.|-|..+ .+.....+|+ .-++||+.+.+.+.. . T Consensus 226 ~~~~i~~g~~~~~~~~~~-~~~~~d~l~~~---------------------~~~~~~~~~~--a~~~~n~~~~~~L~~-l 280 (389) T protein:vir:10 226 YNAMIAPVLQSFTAKKTT-TDTLVDSLKHI---------------------LNVDLDPAYS--RALVVTQSLFNTLDT-L 280 (389) T ss_pred HHHHHhhhhccccccccc-ccccHHHHHHH---------------------HHhhhhhhhC--cEEEecHHHHHHHHH-h Confidence 999998887654211100 00111111111 1111222332 347899999888843 3 Q ss_pred cC--CceEEeecCCCcceeeeeccceeecCCccccCCCEee-cccccccccccccCCCCCCcceeEEecCCCCCCcCccc Q lcl|NC_020871. 233 LS--KQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVM-ENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAED 309 (468) Q Consensus 233 ~~--~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~-~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~ 309 (468) -+ ++..++ ++..+...+ .+.-.|.|.-|+ -++.+ .|.. .++ ..-- T Consensus 281 kd~~G~~i~~-~~~~~~~~~---------~~~~~l~G~pV~~~~~~~----------~~~~-----------~~~-~~~~ 328 (389) T protein:vir:10 281 KDKNGRYLLH-DASDSITDG---------TAKGTILGVPVYVVGDTL----------LGSL-----------AGD-QKAF 328 (389) T ss_pred hccCCCeeee-cCccccccc---------ccccccccceeEEecccc----------cCCC-----------CCc-eEEE Confidence 33 333333 332221111 111123333221 11111 0000 000 0000 Q ss_pred ceeEEEEEEEEcccCCcc-cccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEecccccCCe Q lcl|NC_020871. 310 LAAHEYKVVVSSDDAESI-ASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNV 387 (468) Q Consensus 310 ~~~y~YkVtavn~~GES~-aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~~t 387 (468) .|.++..+..+++.|-+. -+....-+ ..+.+.. -+. ..+.+. ....+.-+..++.++. +. T Consensus 329 ~gd~~~~~~~~~~~~~~i~~~~~~~~~-------~~~~~~~-r~d------~~~~~~--~a~~~~~~~~~~~~~~--~~ 389 (389) T protein:vir:10 329 VGDLKRGVLFTDRQQVTLAWEDSKIYG-------KYLGAAF-RFG------VQKADS--KAGYFVTNTDVPGSAL--GK 389 (389) T ss_pred EeeccccEEEEeecceEEEeecccccc-------ceEEEEE-Eec------cEEecc--cceEEEEeeccCCCCC--CC Confidence 011221111111111000 00000000 0000000 000 111111 1122222222222211 10 No 115 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=79.96 E-value=0.099 Score=26.08 Aligned_cols=293 Identities=10% Similarity=-0.025 Sum_probs=124.7 Q ss_pred CCCcccchhhcccChhhHHHHH-----HHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDA-----LKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATS 75 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~-----~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~s 75 (468) ....+.........-...+..+ .++..+...-+..+.++++.+-++.+...|..+. .+...+.+-+...+..+ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~--~~~~~i~~~~~~~~~~~ 148 (379) T protein:vir:10 71 KAKSEDKSDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNP--SQMLNVSDIVGAVSISG 148 (379) T ss_pred cccccccchhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhH--HhhhhHHhhceeeeccC Confidence 0000000000000000001100 1111111111112233444455666666653322 22334544455555555 Q ss_pred hhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 76 TVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEW 155 (468) Q Consensus 76 tv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~ 155 (468) .--+|.+.+..++. ...++.|++..+..++.+.+....++=++.-..+|.-+ +.+. .+.+....+.-...+++.++. T Consensus 149 ~~~~~~~~~~~~~~-~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~D~-~~l~~~i~~~la~~~~~~~~~ 225 (379) T protein:vir:10 149 GTYTFVRENGAGEG-AIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKM-ANNL-PFLTSFIPNALRRDYAKAENA 225 (379) T ss_pred CceEEEEeecCCCc-ccccccCCccccccccceeeeEeeeeeEEeeehhhHHH-HhhH-HHHHHHHHHHHHHHHHHHHHH Confidence 44566666644332 24479999999999999999999999999888888765 3333 456666666677788888888 Q ss_pred HHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh-cC Q lcl|NC_020871. 156 ASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ-LS 234 (468) Q Consensus 156 a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~-~~ 234 (468) +++-|+..-+. . + ..... ...+.+.|..+...+...+-.++-+.|++.+.+.+...- -. T Consensus 226 ~~~~g~~~~~~--~---~-----~~~~~----------~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~ 285 (379) T protein:vir:10 226 AFNAVLAANAT--A---S-----TEIIT----------NKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSV 285 (379) T ss_pred HHhcccccccc--c---c-----ccccc----------CcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccC Confidence 88877643110 0 1 11111 112344555555455566777778999999888774332 12 Q ss_pred CceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEE Q lcl|NC_020871. 235 KQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHE 314 (468) Q Consensus 235 ~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~ 314 (468) ++...+ ++... ...+...|.|--++.++.. | . |+. -.+-++ T Consensus 286 G~~l~~-~~~~~-----------~~~~~~~l~G~pvv~s~~~--------~---a-------------g~~---~~gdf~ 326 (379) T protein:vir:10 286 GAGYGL-PGVVT-----------QDNGVLRINGIPLFRATWL--------A---A-------------NKY---YVGDWT 326 (379) T ss_pred Cceecc-CCccC-----------CCCCcceecceeeEecCCC--------C---C-------------Cce---EEeecc Confidence 233322 22110 0111123333333322110 0 0 000 001111 Q ss_pred EEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcc---cce-------EEEEeecCCCceeEEEEEE Q lcl|NC_020871. 315 YKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSR---PQF-------VSIYRKGAETGLFYLIARV 378 (468) Q Consensus 315 YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~---~~~-------y~IYR~~~~~G~f~~igrv 378 (468) +.+... +. +.+|.+++.....+. ..+ ..|.| .+.-.+.-+.-| T Consensus 327 ~~~~~~-~~------------------~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~--p~a~v~~~~~~~ 379 (379) T protein:vir:10 327 RVTKVT-TE------------------GLSLEFSEVEGTNFVKNNITARIEAQVALAVEQ--PAALIFGDFTAV 379 (379) T ss_pred cEEEEE-Ee------------------ceEEEEeecccccccCCcEEEEEEEEeccEEec--CccEEEEEecCC Confidence 111111 11 111222111110010 000 01111 011111111111 No 116 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=78.79 E-value=0.11 Score=25.82 Aligned_cols=305 Identities=14% Similarity=0.066 Sum_probs=127.9 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccc-------cCcccccCccccchhhhhhHhhhhhhccccccchhhhcccch Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYG-------ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPA 73 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~-------~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~ 73 (468) .+.........++.......+..+.+..+-. ..-.+-.+|+.|-++.+..+|..+. .+...+.+-+...+. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~--~~~~~i~~l~~~~~~ 190 (497) T protein:vir:78 113 KFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQL--FYELSLADLISSRPV 190 (497) T ss_pred hhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHH--HhhhhHHhhcccccc Confidence 0000000000011111111111111111110 0001223466677777777775543 334466666666666 Q ss_pred hhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 74 TSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTI 153 (468) Q Consensus 74 ~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~ 153 (468) .+---.|.+.+ ++.+...+|+|++..+.+|+.+.......+=++.--.+|.-+ |.++ .+.+....+.-...+++.+ T Consensus 191 ~~~~~~~~~~~--~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~el-l~d~-~~l~~~i~~~l~~~i~~~~ 266 (497) T protein:vir:78 191 TSPNLSYLTES--AAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-LRDA-PELFNFVQGRLLEGIQRKE 266 (497) T ss_pred CCCceEEEEEc--CCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHH-HHhH-HHHHHHHHHHHHHHHHHHH Confidence 65433455444 333456689999999999999999999999999988888764 2333 4677888888889999999 Q ss_pred HHHHhhcccccccCCCCCCCccccchhhhcCcc----------------ceeec-----cCCCCC--------------- Q lcl|NC_020871. 154 EWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD----------------NVHDA-----RGASLT--------------- 197 (468) Q Consensus 154 e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~----------------nviDa-----rG~~ls--------------- 197 (468) +.++++||-.- +..||.+..... +.++. .+.... T Consensus 267 d~~~l~G~G~~----------~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 336 (497) T protein:vir:78 267 EVQLLAGGGYP----------GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTG 336 (497) T ss_pred HHHhhcCCCcc----------cccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhh Confidence 99999998321 123333321110 00000 000000 Q ss_pred ------------------HHHHhhhhhhhhhc-cCceEEEecCHHHHhhHHHhhcC--CceEEeecCCCcceeeeeccce Q lcl|NC_020871. 198 ------------------ESLLNQAAVMISKG-YGTPTDAYMPVGVQADFVNQQLS--KQTQLVRDNGNNVSVGFNIQGF 256 (468) Q Consensus 198 ------------------~~~l~~~a~~i~~~-fG~~td~~m~~~v~a~~~~~~~~--~qr~v~~~n~~~~~~G~~v~~~ 256 (468) ...+..+-..+... +-.++-..|++.+.+.+. ..-+ +++..+++.++ ..|..+ T Consensus 337 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~-~lkd~~G~~i~~~~~~~--~~~~~~--- 410 (497) T protein:vir:78 337 AAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLR-LTKDANGQYMGGNFFGN--AYGNPV--- 410 (497) T ss_pred hhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHH-HhhcCCCceeccCcccc--cccccc--- Confidence 11122222222223 334444667887777773 3322 23333222111 111111 Q ss_pred eecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcc--------- Q lcl|NC_020871. 257 HSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESI--------- 327 (468) Q Consensus 257 ~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~--------- 327 (468) .+...|-|.-++..+. .|.. +.. -|.|+ ...|.+ +++.+-+. T Consensus 411 ---~~~~~l~G~pV~~t~~-----------~~~~-----~~~---~Gd~~-----~~~~~i--~~r~~~~v~~~~~~~~~ 461 (497) T protein:vir:78 411 ---NGGKNIWGVPVVTTPL-----------IPLG-----TIL---VGHFA-----PSVIQT--ARREGVTMQMTNSNGTD 461 (497) T ss_pred ---cCCceeeceeeEecCC-----------CCCC-----ceE---Eeecc-----cceEEE--EEecccEEEeecccchh Confidence 0011111211111110 0000 000 01111 111211 11111110 Q ss_pred -cccceeeeee------ccCcceEEEEEeecCCCcc Q lcl|NC_020871. 328 -ASEVATATVT------AKDDGVKLEIELAPMYSSR 356 (468) Q Consensus 328 -aS~~vt~Tv~------a~~~g~~ltIT~~~~~ga~ 356 (468) -++-+.+..- ......=+.|++.+...++ T Consensus 462 f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 462 FVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred hhcCcEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 0111111100 0001111233444444432 No 117 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=78.79 E-value=0.11 Score=25.82 Aligned_cols=305 Identities=14% Similarity=0.066 Sum_probs=127.9 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccc-------cCcccccCccccchhhhhhHhhhhhhccccccchhhhcccch Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYG-------ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPA 73 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~-------~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~ 73 (468) .+.........++.......+..+.+..+-. ..-.+-.+|+.|-++.+..+|..+. .+...+.+-+...+. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~--~~~~~i~~l~~~~~~ 190 (497) T protein:vir:10 113 KFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQL--FYELSLADLISSRPV 190 (497) T ss_pred hhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHH--HhhhhHHhhcccccc Confidence 0000000000011111111111111111110 0001223466677777777775543 334466666666666 Q ss_pred hhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 74 TSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTI 153 (468) Q Consensus 74 ~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~ 153 (468) .+---.|.+.+ ++.+...+|+|++..+.+|+.+.......+=++.--.+|.-+ |.++ .+.+....+.-...+++.+ T Consensus 191 ~~~~~~~~~~~--~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~el-l~d~-~~l~~~i~~~l~~~i~~~~ 266 (497) T protein:vir:10 191 TSPNLSYLTES--AAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-LRDA-PELFNFVQGRLLEGIQRKE 266 (497) T ss_pred CCCceEEEEEc--CCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHH-HHhH-HHHHHHHHHHHHHHHHHHH Confidence 65433455444 333456689999999999999999999999999988888764 2333 4677888888889999999 Q ss_pred HHHHhhcccccccCCCCCCCccccchhhhcCcc----------------ceeec-----cCCCCC--------------- Q lcl|NC_020871. 154 EWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD----------------NVHDA-----RGASLT--------------- 197 (468) Q Consensus 154 e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~----------------nviDa-----rG~~ls--------------- 197 (468) +.++++||-.- +..||.+..... +.++. .+.... T Consensus 267 d~~~l~G~G~~----------~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 336 (497) T protein:vir:10 267 EVQLLAGGGYP----------GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTG 336 (497) T ss_pred HHHhhcCCCcc----------cccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhh Confidence 99999998321 123333321110 00000 000000 Q ss_pred ------------------HHHHhhhhhhhhhc-cCceEEEecCHHHHhhHHHhhcC--CceEEeecCCCcceeeeeccce Q lcl|NC_020871. 198 ------------------ESLLNQAAVMISKG-YGTPTDAYMPVGVQADFVNQQLS--KQTQLVRDNGNNVSVGFNIQGF 256 (468) Q Consensus 198 ------------------~~~l~~~a~~i~~~-fG~~td~~m~~~v~a~~~~~~~~--~qr~v~~~n~~~~~~G~~v~~~ 256 (468) ...+..+-..+... +-.++-..|++.+.+.+. ..-+ +++..+++.++ ..|..+ T Consensus 337 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~-~lkd~~G~~i~~~~~~~--~~~~~~--- 410 (497) T protein:vir:10 337 AAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLR-LTKDANGQYMGGNFFGN--AYGNPV--- 410 (497) T ss_pred hhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHH-HhhcCCCceeccCcccc--cccccc--- Confidence 11122222222223 334444667887777773 3322 23333222111 111111 Q ss_pred eecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcc--------- Q lcl|NC_020871. 257 HSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESI--------- 327 (468) Q Consensus 257 ~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~--------- 327 (468) .+...|-|.-++..+. .|.. +.. -|.|+ ...|.+ +++.+-+. T Consensus 411 ---~~~~~l~G~pV~~t~~-----------~~~~-----~~~---~Gd~~-----~~~~~i--~~r~~~~v~~~~~~~~~ 461 (497) T protein:vir:10 411 ---NGGKNIWGVPVVTTPL-----------IPLG-----TIL---VGHFA-----PSVIQT--ARREGVTMQMTNSNGTD 461 (497) T ss_pred ---cCCceeeceeeEecCC-----------CCCC-----ceE---Eeecc-----cceEEE--EEecccEEEeecccchh Confidence 0011111211111110 0000 000 01111 111211 11111110 Q ss_pred -cccceeeeee------ccCcceEEEEEeecCCCcc Q lcl|NC_020871. 328 -ASEVATATVT------AKDDGVKLEIELAPMYSSR 356 (468) Q Consensus 328 -aS~~vt~Tv~------a~~~g~~ltIT~~~~~ga~ 356 (468) -++-+.+..- ......=+.|++.+...++ T Consensus 462 f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 462 FVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred hhcCcEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 0111111100 0001111233444444432 No 118 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=77.48 E-value=0.12 Score=25.55 Aligned_cols=320 Identities=12% Similarity=0.077 Sum_probs=133.7 Q ss_pred CCCcccchhhcccCh---hhH-HH---HHHHHh--------------hcc------cccCcccccCccccchhhhhhHhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNL---NSV-QE---DALKSF--------------TTG------YGITPDTQTDAGALRREFLDDQIS 53 (468) Q Consensus 1 ~~~~~~~~~~~~~n~---~~~-~e---~~~Ksf--------------~ag------y~~~p~~~~~gaALr~esld~~i~ 53 (468) .+..+.++....... ... .+ .+.|.+ ... -.....+..||+++-+|.+-..|. T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~ 170 (466) T protein:vir:80 91 EPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLR 170 (466) T ss_pred hhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHH Confidence 232222211111100 000 00 001100 000 011223345667788887766664 Q ss_pred hhhhccccccchhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc Q lcl|NC_020871. 54 MLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNN 133 (468) Q Consensus 54 ~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~ 133 (468) .... +...+.+.+...++..++ .+.+ ++....+.+++|++..+..|+.+.+....++=++.--.+|.-+- .++ T Consensus 171 ~~l~--~~~~l~~~~~v~~~~g~~-~~~~---~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds 243 (466) T protein:vir:80 171 DNMH--RYSKLISKVRLRPLKGTA-RQNI---AGAIPEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTL-EDS 243 (466) T ss_pred Hhhh--hhhhhhhheeeeecCcee-Eeee---ecCCcceeecccccccccccccccceeecceeeeeehhhhHHHH-hcc Confidence 4322 222455655555554432 3333 33444566899999999999999998888887776655555432 346 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcC-------------------ccceeeccC- Q lcl|NC_020871. 134 IQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLIN-------------------QDNVHDARG- 193 (468) Q Consensus 134 ~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~-------------------~~nviDarG- 193 (468) ..|.+....+.-...++..++.+++.||-. +. + -||.+-.. ....+++.. T Consensus 244 ~~~l~~~i~~~la~~~~~~~~~ail~G~G~-----~~--P---~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (466) T protein:vir:80 244 DLNLADEILDAIGQAIGFALDKAILYGTGT-----KM--P---VGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPT 313 (466) T ss_pred hHHHHHHHHHHHHHHHHHHHhhheeeccCC-----CC--c---ceeeecccccccccccccccccccccchhhhhhhhhh Confidence 678889999999999999999999999842 11 1 14443221 111111111 Q ss_pred -CCCC--HHHHhhhhhhhhhccCceEEEecCHHH-HhhHHHhhc----CCceEEeecCCCcceeeeeccceeecCCcccc Q lcl|NC_020871. 194 -ASLT--ESLLNQAAVMISKGYGTPTDAYMPVGV-QADFVNQQL----SKQTQLVRDNGNNVSVGFNIQGFHSARGFIKL 265 (468) Q Consensus 194 -~~ls--~~~l~~~a~~i~~~fG~~td~~m~~~v-~a~~~~~~~----~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l 265 (468) ..-. -..+..+.......++.+.++|++... ...+....+ .++.+..+ +++..-.|.+|- .+. .+ . T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~-~~~~~i~G~pvv--~s~--~~-~ 387 (466) T protein:vir:80 314 GKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASL-NNTMPIVGGDIV--ILD--FI-P 387 (466) T ss_pred ccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccC-CCccccccccee--ecC--cc-C Confidence 0000 000111122234457888888776543 333322211 11111111 111111222220 000 00 0 Q ss_pred CCCEeeccccc-ccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceE Q lcl|NC_020871. 266 HGSTVMENEQI-LDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVK 344 (468) Q Consensus 266 ~gs~i~~~~n~-l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ 344 (468) .|..++++..- +...+ . .++-... ....|.. +..-|++..--+..=-.+...+.++++....+++ T Consensus 388 ~~~~~~g~~~~y~i~~r-------~--~~~i~~~--~~~~f~~---d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~~ 453 (466) T protein:vir:80 388 DNDIIGGYGSLYLLAER-------A--DIKLAQS--EHVRFIE---DQTVFKGTARYDGKPVFGEGFVAVNIANANPTTS 453 (466) T ss_pred ccceeeeccccEEEEee-------c--ceEEEec--hhhhhhc---CcEEEEEEEEEccEEeccCceEEEEecCCCcccc Confidence 11111111100 00000 0 0000100 0011111 1222444332221111233345555554444444 Q ss_pred EEEEeecCCCcccce Q lcl|NC_020871. 345 LEIELAPMYSSRPQF 359 (468) Q Consensus 345 ltIT~~~~~ga~~~~ 359 (468) + +-.|..+..|+- T Consensus 454 ~--~~~~~~~~~~~~ 466 (466) T protein:vir:80 454 I--TFAPDEANVPEV 466 (466) T ss_pred e--eeecCcCcCCCC Confidence 4 333555554432 No 119 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=75.39 E-value=0.15 Score=25.14 Aligned_cols=307 Identities=13% Similarity=0.091 Sum_probs=127.8 Q ss_pred CCCcccch-hhcccChhhHHHHHHHHhhccc-----ccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchh Q lcl|NC_020871. 1 MPKNNKEE-EVKEVNLNSVQEDALKSFTTGY-----GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPAT 74 (468) Q Consensus 1 ~~~~~~~~-~~~~~n~~~~~e~~~Ksf~agy-----~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~ 74 (468) .+...+.. ...........++|.+-+..+- .....+..+|+.|-++.+...|..+..... .+.+.+...++. T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~--~l~~~~~~~~~~ 151 (394) T protein:vir:10 74 VDNAQPNGTDLKKKPIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVV--DLSTLVTKTPVT 151 (394) T ss_pred hhhhcccccchhhhHHHHHHHHHHHHHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhh--hhhhhceeeecc Confidence 11111110 0111111112222322221111 011123455778888888888755444433 455555556665 Q ss_pred hhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 75 STVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTI 153 (468) Q Consensus 75 stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~ 153 (468) +.--+|..... +.+...+++|++... .+++.+.+....++=++.-..+|.-+ |.++..|.+....+.-...++..+ T Consensus 152 ~~~~~~~~~~~--~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~~~~~~~ 228 (394) T protein:vir:10 152 TPKGTYPILKR--ATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEA-IADSAVDLTSLVGQSINEKSVNTY 228 (394) T ss_pred CCceEEEEEec--CCCccccccccccccccccccceeEEeeeeeeEeeehhHHHH-HhhhhHHHHHHHHHHHHHHHHHHH Confidence 54444554432 234456899987755 68899999999999888776666643 234566777888888888999999 Q ss_pred HHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhh-hhhhhhhccCceEEEecCHHHHhhHHHhh Q lcl|NC_020871. 154 EWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQ-AAVMISKGYGTPTDAYMPVGVQADFVNQQ 232 (468) Q Consensus 154 e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~-~a~~i~~~fG~~td~~m~~~v~a~~~~~~ 232 (468) +.++..|...-.. . +..+.. +.+.|.. .......+|. .-++||+.+.+.|.. . T Consensus 229 ~~~il~g~g~~~~--~--------~~~~~~-------------~~d~l~~~~~~~~~~~~~--a~~vmn~~~~~~l~~-l 282 (394) T protein:vir:10 229 NAMIAPVLQSFTA--K--------ATTTDT-------------LVDSLKHILNVDLDPAYS--RALVVTQSLFNTLDT-L 282 (394) T ss_pred HHHHhhccccccc--c--------cccccc-------------cHHHHHHHHHhhhhhhcc--CEEEecHHHHHHHHH-h Confidence 9999998854211 0 011111 1122222 1122223332 348999999888843 3 Q ss_pred cCCc-eEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccc- Q lcl|NC_020871. 233 LSKQ-TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDL- 310 (468) Q Consensus 233 ~~~q-r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~- 310 (468) -+.+ |.+.+++..+...+ .+.-.|.|--++-.++- ..|.......+ .. |.|..... T Consensus 283 kd~~G~~i~~~~~~~~~~~---------~~~~~L~G~PV~~~~~~------~~~~~~~~~~i---~~----gd~s~~~~~ 340 (394) T protein:vir:10 283 KDKNGRYLLHDASDSITDG---------TAKGTVLGVPVYVVGDA------LLGSAAGDQKA---FV----GDLKRGVLF 340 (394) T ss_pred hccCCCeeeeccccccccC---------CcccccccceeEEeccc------ccCCCCCceEE---EE----eeccccEEE Confidence 3332 33333333221111 01112223322111100 00000000000 00 00000000 Q ss_pred e-eEEEEEEEEcccCCcc---cccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCce Q lcl|NC_020871. 311 A-AHEYKVVVSSDDAESI---ASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGL 371 (468) Q Consensus 311 ~-~y~YkVtavn~~GES~---aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~ 371 (468) . --.+.|-..+.....- +-.-+.+.+.....-+ .||.++...+++ ++.|. T Consensus 341 ~~~~~~~v~~~~~~~~~~~~~~~~r~d~~~~~~~ai~--~~~~~~~~~~~~---------~~~~~ 394 (394) T protein:vir:10 341 ADRQQVTLAWEDSKIYGRYLGAAFRFGVKQADSNAGY--FVTNTDAASGST---------SGTGK 394 (394) T ss_pred EeecceEEEEecccccceeEEEEEEeccEEeccccEE--EEEeecccCCCC---------CCCCC Confidence 0 0000000000000000 0000011111111111 233334444422 23333 No 120 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=68.38 E-value=0.24 Score=24.00 Aligned_cols=301 Identities=13% Similarity=0.031 Sum_probs=121.4 Q ss_pred CC--CcccchhhcccC----hhhHHHHHHHHhhcccc--cCcccccCccccchhhhhhHhhhhhhccccccchhhhcccc Q lcl|NC_020871. 1 MP--KNNKEEEVKEVN----LNSVQEDALKSFTTGYG--ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKP 72 (468) Q Consensus 1 ~~--~~~~~~~~~~~n----~~~~~e~~~Ksf~agy~--~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~ 72 (468) .. ...+..+....- ....-..+.+.+..+-. ....+..+|+.|.++.+...|..+.. .-.+...+...+ T Consensus 117 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~~~~~---~~~l~~~~~~~~ 193 (437) T protein:vir:10 117 EEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPEKEVHQ---FPRLGSLVRTES 193 (437) T ss_pred HHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcccccccccchHHHHHHHHHhhh---hhhhhhcceeEe Confidence 00 000000000000 00001122222222211 11123455777888888777655422 123444444445 Q ss_pred hhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHH Q lcl|NC_020871. 73 ATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAK 151 (468) Q Consensus 73 ~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~ 151 (468) ..+---+|......+ +-..+++|++... .+++.+.+.+..++=++.-..+|.-+ |.++..|......+.-...+.. T Consensus 194 ~~~~~~~~~~~~~~~--~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~l~~~~~~ 270 (437) T protein:vir:10 194 VTTTTGKLPIFNNST--DLLTAHTEYGQTTKNATPVITPILWDLKTYTGGYVFSQEL-ISDSSYDWQAELQSRLIELRDN 270 (437) T ss_pred eccCceeeEEeeccc--cccccccccccccccccccceeeeeehhheeeehhhhHHH-HhhhHHHHHHHHHHHHHHHHHH Confidence 544444455554333 2345788887665 78899999999888787766666643 3456667788888888899999 Q ss_pred HHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHh Q lcl|NC_020871. 152 TIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQ 231 (468) Q Consensus 152 ~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~ 231 (468) .++.+++.|+.+-. +........|.|..+++. .+..+|....-.+||+.+.+.|... T Consensus 271 ~~~~~i~~g~g~~~--~~~~~~~~~~~~~~~~~~---------------------~l~~~~~~~~~~~~~~~~~~~l~~l 327 (437) T protein:vir:10 271 TDDSLIITALTDGI--KKTTSTYLLGDLKKVLNV---------------------TLKPQDSAAASIVMSQSAYNLFDMA 327 (437) T ss_pred HHHHHHhhhhcccc--cccccccchhhHHHHHHh---------------------hhhhhhhcCCEEEEcHHHHHHHHHh Confidence 99999999986532 122122223333332211 1122232223478899998888443 Q ss_pred h-cCCceEEeecCCCc----ceeeeeccceeecC-Ccccc-CCC--Eeecccc--cccccccccCCCCCCcceeEEecCC Q lcl|NC_020871. 232 Q-LSKQTQLVRDNGNN----VSVGFNIQGFHSAR-GFIKL-HGS--TVMENEQ--ILDERILALPTAPQQAKVTATQEAG 300 (468) Q Consensus 232 ~-~~~qr~v~~~n~~~----~~~G~~v~~~~s~~-g~i~l-~gs--~i~~~~n--~l~~~~~~~p~ap~~~~vtat~~~~ 300 (468) - -.++..+ +++.+. .-.|.+|- ++.. ..... .|+ .++.+.. .+...+... .+ .... T Consensus 328 kd~~g~~~~-~~~~~~~~~~~l~G~pv~--~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~-------~~--~~~~- 394 (437) T protein:vir:10 328 TDAMGRPLL-QPNVTAATGYTLLGKTVV--IVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEI-------TG--QFQD- 394 (437) T ss_pred hccCCCeee-ccCccCCCCcccccceeE--EecccccCCcCCCceEEEEeeccccEEEEeeece-------EE--EEec- Confidence 1 2334443 333211 11333221 0000 00000 001 1122111 000000000 00 0000 Q ss_pred CCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCCcccceE Q lcl|NC_020871. 301 KKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFV 360 (468) Q Consensus 301 ~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y 360 (468) ..-.|........+|-+..++..+ .+- |+.+.++..-.+++-+ T Consensus 395 ~~~~~~~~~~~~~r~d~~~~~~~a------~~~-----------l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 395 TYDIWYKQLGIFLRQNVVQASKDL------IVN-----------LTGKLKAVTVVQSTAV 437 (437) T ss_pred ccccccceeeEEEEEccEEecccc------eEE-----------EEeeccccccCCCCCC Confidence 000000000111222222222111 111 1111111110000001 No 121 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=67.96 E-value=0.24 Score=23.94 Aligned_cols=285 Identities=12% Similarity=0.079 Sum_probs=126.0 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhccc---------ccCcccccCccccchhhhhhHhhhhhhccccccchhhhccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGY---------GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKK 71 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy---------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~ 71 (468) -++..+.....+.+.+. +++++++..+. ........+|+.+-.+..++.|..+... ..+++.+... T Consensus 73 ~~~~~~~~~~~~~~~~~--~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~---~~~l~~~~~~ 147 (390) T protein:vir:62 73 GLQGSGSGAQRSADVDD--DATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVER---SAIMRGGATT 147 (390) T ss_pred hcccccccchhhcchHH--HHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhh---hhhhhhccee Confidence 11111111112221111 23333332221 1122233345666666666666554332 3344433322 Q ss_pred chhhhhh--ccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHH Q lcl|NC_020871. 72 PATSTVA--KYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNI 149 (468) Q Consensus 72 ~~~stv~--ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~ 149 (468) .-.+.-. .+.+.. +.....+++|++..+.+++.+......++=++.-..+|.-+= .++..|.+....+.--..+ T Consensus 148 ~~~~~~~~~~~p~~~---~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~i 223 (390) T protein:vir:62 148 FTTSDANPLDFTVIT---GRSSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFA-TDQVLDLVGFLVSDAGPAI 223 (390) T ss_pred eecCCCceeEEEEEc---CCcceeeecccccccccccceeeeEeeeeeEEeehHHHHHHH-hhhhHHHHHHHHHHHHHHH Confidence 2122222 233333 333566899999999999999999999988887777775442 3455677788888888999 Q ss_pred HHHHHHHHhhcccccccCCCCCCCccccchhhhcCcc-ceeec-cCCCCCHHH-HhhhhhhhhhccCceEEEecCHHHHh Q lcl|NC_020871. 150 AKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHDA-RGASLTESL-LNQAAVMISKGYGTPTDAYMPVGVQA 226 (468) Q Consensus 150 ~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~-nviDa-rG~~ls~~~-l~~~a~~i~~~fG~~td~~m~~~v~a 226 (468) ++.++.++++|+- ++ -||.+..... +.+.. ....++.+. ++.-..+ ..+|-.---.+|++.+.+ T Consensus 224 ~~~~d~~~l~G~G-------~p-----~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l-~~~~~~~a~~vmn~~~~~ 290 (390) T protein:vir:62 224 GDAMGRHFITGTG-------QP-----RGILTDASPATATFLATDTDSKVSDALIDLFHEV-PSAYRANAKYVVNDLRAA 290 (390) T ss_pred HHHHHhhhhccCC-------cc-----ccccccccccccceecccccccchHHHHHHHHhh-hhhhhcCCEEEEchHHHH Confidence 9999999999972 11 2555544322 22222 222344333 3322222 233432224799999998 Q ss_pred hHHHhhcCCc-eEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCc Q lcl|NC_020871. 227 DFVNQQLSKQ-TQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQF 305 (468) Q Consensus 227 ~~~~~~~~~q-r~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~ 305 (468) .|.. .-+.+ |.+.+++.... . .-.|.|.-++..++ .|..+-+- |.| T Consensus 291 ~L~~-lkd~~g~~l~~~~~~~g---~----------~~~l~G~Pv~~~~~-----------~p~~~i~~--------gd~ 337 (390) T protein:vir:62 291 QMRK-LKDANGQYLWQSGLTVG---A----------PSLFNGKVVETDDG-----------MPADKILF--------ADL 337 (390) T ss_pred HHHH-hhccCCCeeecCCcCCC---c----------cceecccceEEecC-----------CCCccEEE--------eec Confidence 8853 33332 33333332111 0 00122332222111 11110000 001 Q ss_pred Cc---c---------------cceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeec Q lcl|NC_020871. 306 RA---E---------------DLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAP 351 (468) Q Consensus 306 ~~---~---------------~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~ 351 (468) .. . ..+...|++..-= +|. +.-...-+-|++++.+ T Consensus 338 s~~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~-d~~----------~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 338 SKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRA-DGL----------LVDARGAKVLTVTPGA 390 (390) T ss_pred cceeEEeecceEEEeeccccccCCcEEEEEEEEe-CcE----------eechhheEEEEeecCC Confidence 00 0 0000111111000 010 0000111223333332 No 122 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=63.56 E-value=0.31 Score=23.33 Aligned_cols=278 Identities=14% Similarity=0.119 Sum_probs=124.8 Q ss_pred cCcccccCccccch---hhhhhHhhhhhhccccc-cchhhhcccchhhhhhccceeeeecccccccccc-ccccccccCc Q lcl|NC_020871. 32 ITPDTQTDAGALRR---EFLDDQISMLTWTENDL-TFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTR-EIGVAPVSDP 106 (468) Q Consensus 32 ~~p~~~~~gaALr~---esld~~i~~L~~~~~~f-~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~-E~g~~~~~d~ 106 (468) ++-|...+++++-. |.+|+.+....+..-.+ +|+.-...-.+-...-.|..+. +.|....++ +....+..|. T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~---~~G~a~~~~~~~~dip~v~~ 77 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFD---GVGIAQIVADYTDDLPLVDA 77 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeee---ccCceeEeCCCccccceeec Confidence 66665666677766 45566665433322111 2332211112211122244443 344433333 3455678899 Q ss_pred ceEEEEEEEEeeeehhhhhhhHhh---hcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhc Q lcl|NC_020871. 107 NIRQKTVNMKFASDTKNISIAAGL---VNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLI 183 (468) Q Consensus 107 ~~~r~~~~~k~l~~~~~vs~~~~l---v~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li 183 (468) ++.|++..+..++..+.++.. ++ ...-.+..+.....|.+.+.+.....+||||+.++ +-||.+-= T Consensus 78 ~~~~~~~~i~~~~~~~~~~~~-El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g----------~~GLlN~p 146 (296) T protein:vir:10 78 LATERQGKVFRFGNAFLISID-EIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHG----------IPSVFDYP 146 (296) T ss_pred cceeEEEEEEEEEeeeeecHH-HHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccccc----------ceeEeecC Confidence 999999999999999988743 33 22334666777788889999999999999997763 23443311 Q ss_pred CccceeeccCCC--CC--HHHHhhhh---hhhhhccCceEEEecCHHHHhhHHHhhcCCceEEeecCCCcceeeeeccce Q lcl|NC_020871. 184 NQDNVHDARGAS--LT--ESLLNQAA---VMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGF 256 (468) Q Consensus 184 ~~~nviDarG~~--ls--~~~l~~~a---~~i~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~ 256 (468) +- ...-+.|.- .+ .+.|+++- ...++++=.++.+.||+.....+. |.+ ++ +|..+-.+ T Consensus 147 ~v-~~~~~~~~W~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~-------~~~--~~-----~~~t~l~~ 211 (296) T protein:vir:10 147 NI-NNVVSGGSWSQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQ-------NLV--PG-----TSVSYGEF 211 (296) T ss_pred CC-ccccccCCccCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHh-------hcc--CC-----CCccHHHH Confidence 10 112222211 11 33344333 334557778899999998887772 221 11 23333222 Q ss_pred eecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCccccccee--- Q lcl|NC_020871. 257 HSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVAT--- 333 (468) Q Consensus 257 ~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt--- 333 (468) +..... +-.|. .+| . .. .+++.|+. .. |+.. .+-+ -.+-++. T Consensus 212 ik~~~~----~l~i~-----------~~~---~---l~---~a~~~g~~--------~~-v~~~-~~~~-~~~~~v~~~~ 256 (296) T protein:vir:10 212 FRQNNS----GVTVE-----------FVQ---Y---LN---DYNGTGTS--------AA-IAYE-KDPN-NMAIEIPEAT 256 (296) T ss_pred HHHhcC----CceEE-----------Eee---e---ec---cCCCCcce--------EE-EEEE-cCCc-eEEEEcCcce Confidence 211000 00110 010 0 00 01111110 01 1111 1111 0110000 Q ss_pred eeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEecccccCCeeEEe Q lcl|NC_020871. 334 ATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKAENNVITFY 391 (468) Q Consensus 334 ~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~~s~~~~~t~tf~ 391 (468) .+.+.....-..++.+....+ .+.||| +..+.++.| +||. T Consensus 257 ~~~~~e~~~l~~~~~~~~~~~----Gv~i~~---P~ai~~~dG-----------I~~~ 296 (296) T protein:vir:10 257 NALPAQPKDLHFKIPVTSKAT----GLIVYR---PLTMAVMKG-----------ITFA 296 (296) T ss_pred eeecccccCceEEEeeEeeEE----EEEEEC---CceeEEEee-----------eecC Confidence 011111122222222222222 255555 233333333 4555 No 123 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=59.43 E-value=0.39 Score=22.81 Aligned_cols=274 Identities=10% Similarity=0.070 Sum_probs=120.7 Q ss_pred ccC--ccccch---hhhhhHhhhhhhccccc-cchhhhcccchhhhhhccceeeeecccccccccccc-ccccccCcceE Q lcl|NC_020871. 37 QTD--AGALRR---EFLDDQISMLTWTENDL-TFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREI-GVAPVSDPNIR 109 (468) Q Consensus 37 ~~~--gaALr~---esld~~i~~L~~~~~~f-~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~-g~~~~~d~~~~ 109 (468) +++ .+++-. |-+|+.+....+..-.+ .|+.-..+-.+-.....|...... |....++.. ...+..|.++. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~---G~~~~~~~~~~dip~~~~~~~ 77 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRS---GAAKIIANGADDLPLVDVDMV 77 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccc---eeEEEecCcccccccccccce Confidence 333 344444 44566665544433222 233333333343333445544433 333344443 33477899999 Q ss_pred EEEEEEEeeeehhhhhhhH--hhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCc-- Q lcl|NC_020871. 110 QKTVNMKFASDTKNISIAA--GLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ-- 185 (468) Q Consensus 110 r~~~~~k~l~~~~~vs~~~--~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~-- 185 (468) |++..+.-+...+.++..- .....=.+..+.....|.+.+.+.....+||||+.++ +-||.+-=+- T Consensus 78 ~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g----------~~GLlN~p~~~~ 147 (301) T protein:vir:80 78 RKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYA----------IKGAFEATGIQI 147 (301) T ss_pred eEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccccc----------ceeeecCCCccc Confidence 9999999999988877542 2233445666777888888999999999999998763 3455442211 Q ss_pred -cceeeccCCC-------CC--HHHHhhhhhhh---hhccCceEEEecCHHHHhhHHHhhcCCceEEeecCCCcceeeee Q lcl|NC_020871. 186 -DNVHDARGAS-------LT--ESLLNQAAVMI---SKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFN 252 (468) Q Consensus 186 -~nviDarG~~-------ls--~~~l~~~a~~i---~~~fG~~td~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~ 252 (468) ...-+..|+. +. .+.|+++-..+ ++++=.+..+.||+.....+.. ...+++ .|.. T Consensus 148 ~~~~~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~-------~~~~~~-----~~~t 215 (301) T protein:vir:80 148 DVSPTTGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINK-------KRYSNE-----DSRS 215 (301) T ss_pred ccccCcccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhh-------ccccCC-----CCee Confidence 1111222222 11 24455544333 2344468899999999888832 221122 2333 Q ss_pred ccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccce Q lcl|NC_020871. 253 IQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVA 332 (468) Q Consensus 253 v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~v 332 (468) +-.++.-. ++. -.|. ..| ... + +++.|+ -.. |..++ +-+- .+-.+ T Consensus 216 vl~~l~~~-~~~---~~I~-----------~~p------~L~-~--~g~~g~--------~~~-v~~~~-~~d~-~~~~v 260 (301) T protein:vir:80 216 VLKVLQDN-AWF---SAIV-----------RVP------DLA-G--MGTAGS--------DSF-AVIHD-SNET-AELII 260 (301) T ss_pred HHHHHHHH-cCc---ceEE-----------Ecc------eec-c--CCCCcc--------cEE-EEEec-CCcE-EEEEe Confidence 33322110 000 0110 000 000 0 111121 111 11111 1110 00000 Q ss_pred ee---eeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEE Q lcl|NC_020871. 333 TA---TVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARV 378 (468) Q Consensus 333 t~---Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv 378 (468) .. +.++...+-...+....-.+ .+.||| +..+.+..| | T Consensus 261 ~~~~~~~~~e~~~~~~~~~~~~r~~----Gv~i~~---P~ai~~~~G-I 301 (301) T protein:vir:80 261 PMDITRHPEEYSFPRTKVPFEERTA----GVVVRF---PAAIVRVDG-I 301 (301) T ss_pred cCceeeecceecCceeEeeeeeeeE----EEEEEc---cceEEEEec-C Confidence 00 01111111111111111111 244554 223333333 1 No 124 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=48.87 E-value=0.66 Score=21.59 Aligned_cols=324 Identities=8% Similarity=0.018 Sum_probs=124.6 Q ss_pred CCCcccchhh------------------cccChhhH----------HH-HHHHHhhcccccCcccccCccccchhhhhhH Q lcl|NC_020871. 1 MPKNNKEEEV------------------KEVNLNSV----------QE-DALKSFTTGYGITPDTQTDAGALRREFLDDQ 51 (468) Q Consensus 1 ~~~~~~~~~~------------------~~~n~~~~----------~e-~~~Ksf~agy~~~p~~~~~gaALr~esld~~ 51 (468) .|....+++. ++.++.+. .. .+.+++.+|..++ ...+|+-+-++.+..+ T Consensus 280 ~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~--~~~~Gg~~vp~~~~~~ 357 (645) T protein:vir:93 280 APVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTD--PQWAGSLSEYQEYAQD 357 (645) T ss_pred cccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhcccccc--ccccCCccCchhhHHH Confidence 1111111111 11111110 11 1233444444333 3345666777777776 Q ss_pred hhhhhhccccccchhhhcccchhhhhh-ccceee-eeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHh Q lcl|NC_020871. 52 ISMLTWTENDLTFYKDIAKKPATSTVA-KYDVYM-QHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAG 129 (468) Q Consensus 52 i~~L~~~~~~f~~~~~i~k~~~~stv~-ey~~~~-~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~ 129 (468) |..+.... ..+..+..+....... .++... ..-+.+.+.+++|++..+.+++.+...+...|=|+.--.+|.-+= T Consensus 358 ii~~l~~~---svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell 434 (645) T protein:vir:93 358 FIDYLRPQ---TIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELI 434 (645) T ss_pred HHHhhhhh---hhHHhhccccccccccccCceeeeeeecCcceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHH Confidence 65433221 2222222222221111 122211 111223467899999999999999999999999888777776542 Q ss_pred hhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhh Q lcl|NC_020871. 130 LVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMIS 209 (468) Q Consensus 130 lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~ 209 (468) . ++.-|.+....++-...+++.++.++|.|+..-. .+..+.|+- .|.. .+.+.| ....++.+..+.... T Consensus 435 ~-ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~-~~~~p~gi~-~~~~-------~~~~~~-~~~~d~~~~~~~~~~ 503 (645) T protein:vir:93 435 R-FSSPAADALVRNALAEAVVARLDTDFVDPKKAAV-ADVSPASIT-HDVK-------GTASSG-NPDADAEAAFGQFVA 503 (645) T ss_pred h-hchHHHHHHHHHHHHHHHHHHHHHHhhcCCCccc-CCcccccee-cccc-------cccccc-chHHHHHHHHHHHHh Confidence 2 3455677888889999999999999999885432 112223331 1111 111112 122344444344444 Q ss_pred hccCceEE-EecCHHHHhhHHHhhc-CCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCC Q lcl|NC_020871. 210 KGYGTPTD-AYMPVGVQADFVNQQL-SKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTA 287 (468) Q Consensus 210 ~~fG~~td-~~m~~~v~a~~~~~~~-~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~a 287 (468) .++...+- ..|++.+++.+...-- .++..+ + +. +..|. .|.|.-++..++. | T Consensus 504 a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~-~-~~-------------~~~~~-tL~G~PV~~s~~v--------p-- 557 (645) T protein:vir:93 504 ANLQPTGAVWLMSSTNALALSMRKNALGQKEY-P-DM-------------TLLGG-SFQGLPVIVSQYV--------G-- 557 (645) T ss_pred cCCCccccEEEEcHHHHHHHHhccccCCceee-c-CC-------------CCCCc-eeeceeeEEeccC--------C-- Confidence 44444444 4589999988833211 122222 1 10 00000 2223222222111 0 Q ss_pred CCCcceeEEecCCCCCCcCcccceeE--------EEEEEEEcccCCcccccceeeeeeccC-cceEEEEEeecCCCcccc Q lcl|NC_020871. 288 PQQAKVTATQEAGKKGQFRAEDLAAH--------EYKVVVSSDDAESIASEVATATVTAKD-DGVKLEIELAPMYSSRPQ 358 (468) Q Consensus 288 p~~~~vtat~~~~~~g~~~~~~~~~y--------~YkVtavn~~GES~aS~~vt~Tv~a~~-~g~~ltIT~~~~~ga~~~ 358 (468) ..+... ..+...-.+.+.. .|++.... ++-.... .....+.... +-+.+....- .+ T Consensus 558 ---~~~~~g----d~s~~~ig~~~~v~i~~s~~a~~~~~~~~-~~~~~~~-~~~~~v~lf~~d~vaira~~r--~d---- 622 (645) T protein:vir:93 558 ---DQLVLV----NAPDIYLADDGGVAVDMSREASLEMQSEP-TGDSTTP-SPVELVSMFQTGSVAIRAERW--IN---- 622 (645) T ss_pred ---cceeEe----ccccEEEEEecceEEEeecceeEEEeecc-ccccccc-ccccchhHhhcCceEEEEEEE--Ec---- Confidence 011100 0000000000001 11111000 0000000 0000000000 0011111100 00 Q ss_pred eEEEEeecCCCceeEEEEEEecccccCC Q lcl|NC_020871. 359 FVSIYRKGAETGLFYLIARVPASKAENN 386 (468) Q Consensus 359 ~y~IYR~~~~~G~f~~igrv~~s~~~~~ 386 (468) +.++|.. ..-.+.| |.-..+.++ T Consensus 623 -~~~~~p~---a~~~lt~-~~~g~~~~~ 645 (645) T protein:vir:93 623 -WRRRRTA---AVAVITG-VNYGSASGG 645 (645) T ss_pred -ceeeCcc---ceEEEec-ccCCcccCC Confidence 1111110 0011111 000000011 No 125 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=48.58 E-value=0.67 Score=21.55 Aligned_cols=294 Identities=11% Similarity=0.038 Sum_probs=108.8 Q ss_pred CCC-cccchhh-cccChhhHHHHH---HHHhhcccc-----cCcccccCccccchhhhhhHhhhhhhccccccchhhhcc Q lcl|NC_020871. 1 MPK-NNKEEEV-KEVNLNSVQEDA---LKSFTTGYG-----ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAK 70 (468) Q Consensus 1 ~~~-~~~~~~~-~~~n~~~~~e~~---~Ksf~agy~-----~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k 70 (468) -+. ..+..+. ...+....++.- .+...++.+ .+.....-+..+.+..+...+..+..... .+.+-+.. T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~--~i~~~~~~ 275 (517) T protein:vir:97 198 KILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEG--SLLPFIRH 275 (517) T ss_pred hhcccccccccchhhHHHHHHHHHHHHHHhcccccccceeeeecccccccccccchHHHHHHHHhhhhhc--cceeeeee Confidence 000 0000000 000011111111 111111111 01011111223333333333322221111 11111111 Q ss_pred cchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhh----HHHHHHHHHH Q lcl|NC_020871. 71 KPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQD----PMQILTDDAI 146 (468) Q Consensus 71 ~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~D----p~~~~~~~ai 146 (468) ..+ .+.+....-......++.|+...+.+|..+..++..++-++.-..+|..+ +.++.-| .+....+.-. T Consensus 276 ~~i-----~~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~~~~~S~ql-l~Ds~~dd~~~l~s~i~~~l~ 349 (517) T protein:vir:97 276 ENL-----PTLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIV-MNSNATDIAGAILTYVMNRLP 349 (517) T ss_pred ccc-----cceeeecccccceeeeeecCCcccccccceeeEEeeHhhhhhhhhhhHHH-HHHhhhccHHHHHHHHHHHHH Confidence 110 01111100011123467899999999999999999998888877777643 2233333 5666777788 Q ss_pred HHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHh Q lcl|NC_020871. 147 VNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQA 226 (468) Q Consensus 147 ~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a 226 (468) ..++...|.++..||-. |....|+..+........+.+..--.+++.....-..+.++ .-+.|++.+.+ T Consensus 350 ~~l~~~ee~a~l~GdGt---------g~~~~gi~~~a~~~~~~~~~~~~~~~d~i~~l~~a~~~a~~--a~~vmn~~t~~ 418 (517) T protein:vir:97 350 DMVIMAVNRAIIMGGVT---------GVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPKAAD--STLVIHRNDLA 418 (517) T ss_pred HHHHHHHHHHHhcccCC---------CcccccccccccccccccccccchHHHHHHHHHHHhhhccC--CEEEECHHHHH Confidence 88999999999999831 22333444443222222222222223344332222222222 23789999998 Q ss_pred hHHHhhc--CCceEEeecCCCc-ceeeeeccceeecCC-----ccccCCCEeecccccccccccccCCCCCCcceeEEec Q lcl|NC_020871. 227 DFVNQQL--SKQTQLVRDNGNN-VSVGFNIQGFHSARG-----FIKLHGSTVMENEQILDERILALPTAPQQAKVTATQE 298 (468) Q Consensus 227 ~~~~~~~--~~qr~v~~~n~~~-~~~G~~v~~~~s~~g-----~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~ 298 (468) .+ ...- ++++.+.+.-.+. ....+.+...+..-. ...+.|++|+..-..... T Consensus 419 ~I-~klKD~~G~Yl~~~~~~~~~~~~l~G~~~~~~~~~~~~~~~~~~~~y~i~~~~g~~~~------------------- 478 (517) T protein:vir:97 419 AI-RFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFE------------------- 478 (517) T ss_pred HH-HHhhcCCCCeeccCcCCcccccccCCccccccccccCceeEeeccccEEEeecceeee------------------- Confidence 88 4443 4566653321111 111111111110000 001122222211100000 Q ss_pred CCCCCCcCcccceeEEEEE---EEEcccC-CcccccceeeeeeccC Q lcl|NC_020871. 299 AGKKGQFRAEDLAAHEYKV---VVSSDDA-ESIASEVATATVTAKD 340 (468) Q Consensus 299 ~~~~g~~~~~~~~~y~YkV---tavn~~G-ES~aS~~vt~Tv~a~~ 340 (468) -+..-.+ ....|.+ +..+-.. |..+= .+.++++.+ T Consensus 479 ~~fd~~~-----n~~~f~~~~~~~g~i~~~~r~a~--~~~~p~~~~ 517 (517) T protein:vir:97 479 QGTILVE-----NNKEYLFEMPISGSLEYKGTTAY--GTYTPPVAG 517 (517) T ss_pred eeeeccc-----CceeEeeeeeeccccccccceEE--EEEcCCCCC Confidence 0000000 0111211 1111122 22221 233333322 No 126 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=47.72 E-value=0.69 Score=21.46 Aligned_cols=301 Identities=13% Similarity=0.138 Sum_probs=130.4 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhccc-ccCcccccCccccch---hhhhhHhhhhhhccccc-cchhhhcccchhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGY-GITPDTQTDAGALRR---EFLDDQISMLTWTENDL-TFYKDIAKKPATS 75 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy-~~~p~~~~~gaALr~---esld~~i~~L~~~~~~f-~~~~~i~k~~~~s 75 (468) |-+.-+-++ -|....++-+.. ..+.+ ..+++++-- |-+|+++....+..-.. .|+.-..+-++-. T Consensus 6 ~~~~~~~d~---------~~~~~~a~~~~~~~~~~~-~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~ 75 (329) T protein:vir:79 6 MSKEMKYDE---------FEANVIANHMQLRGAKND-ASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTD 75 (329) T ss_pred hhhhhccch---------hhhhhHhhhcccccceec-cchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCce Confidence 322222111 122333343432 22233 333444544 44677776644433211 3443333333333 Q ss_pred hhhccceeeeeccccccccccc-cccccccCcceEEEEEEEEeeeehhhhhhhHhh--hcchhhHHHHHHHHHHHHHHHH Q lcl|NC_020871. 76 TVAKYDVYMQHGKVGHTRFTRE-IGVAPVSDPNIRQKTVNMKFASDTKNISIAAGL--VNNIQDPMQILTDDAIVNIAKT 152 (468) Q Consensus 76 tv~ey~~~~~hG~~g~~~fv~E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l--v~~~~Dp~~~~~~~ai~~~~~~ 152 (468) ....|..+... |....++. ....+..|.++.|++..+.-++..+.++..--. ...=.+..+.....|.+.+.+. T Consensus 76 ~~~t~~~~~~~---G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~ 152 (329) T protein:vir:79 76 KTFEYQTFDKV---GHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQL 152 (329) T ss_pred eEEEeeeeecc---eeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHh Confidence 33445555543 44344443 445678899999999999999999888754222 2222355577777888888889 Q ss_pred HHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCC-------CC--HHHHhhhh-hhhh--hccCceEEEec Q lcl|NC_020871. 153 IEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGAS-------LT--ESLLNQAA-VMIS--KGYGTPTDAYM 220 (468) Q Consensus 153 ~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~-------ls--~~~l~~~a-~~i~--~~fG~~td~~m 220 (468) ....+||||+.++ +-||.+-=+-..+.-..+.. +. .+.|+++- .+.. ++.-.++.+.| T Consensus 153 ~n~i~f~G~~~~g----------~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~L 222 (329) T protein:vir:79 153 VNHLVFKGSKPHK----------IISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILI 222 (329) T ss_pred hccEEEeeccccc----------ceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEe Confidence 9999999997653 34444321111111112211 11 23455443 2222 34455788999 Q ss_pred CHHHHhhHHHhhcCCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCC Q lcl|NC_020871. 221 PVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAG 300 (468) Q Consensus 221 ~~~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~ 300 (468) |+.....+. + +. + ..|..+-.++-... +..-+.. +| .... ++ T Consensus 223 pp~~~~~L~-----~-~~---~-----~~~~tvl~~lk~~~-----~~l~I~~-------------~~---el~~---ag 264 (329) T protein:vir:79 223 PPSMRKVLM-----V-RM---P-----ETTMSYLDYFKQQN-----GGITIES-------------IS---ELED---ID 264 (329) T ss_pred cHHHHHHhh-----c-cc---C-----CCCccHHHHHHHhC-----CCcEEEE-------------cc---cccc---cC Confidence 998766551 1 11 1 12444443332111 0000000 00 0100 11 Q ss_pred CCCCcCcccceeEEEEEEEEc-ccC-Cc-ccccceeeeeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEE Q lcl|NC_020871. 301 KKGQFRAEDLAAHEYKVVVSS-DDA-ES-IASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIAR 377 (468) Q Consensus 301 ~~g~~~~~~~~~y~YkVtavn-~~G-ES-~aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igr 377 (468) +.|+ -.. |..++ .+. +- .|..-. +.+.....-..++....-.+ .+.||| +..+.+..|= T Consensus 265 ~~g~--------~~~-v~y~~~~~~~~~~vp~~~~--~l~~q~~~~~~~v~~~~r~~----Gv~i~~---P~ai~~~dGI 326 (329) T protein:vir:79 265 GAGT--------KAA-LVYEKDPMNMSIEIPEAFN--MLTAQPKDLHFKVPCTSKCT----GLTIYR---PLTLVLIKGL 326 (329) T ss_pred CCCc--------eEE-EEEecCCceEEEecCccee--eeeceecCceEEEceeeeEE----EEEEEC---cceeeeeeee Confidence 1121 111 11111 111 10 011111 11111122222222222222 256666 4455555552 Q ss_pred Eecc Q lcl|NC_020871. 378 VPAS 381 (468) Q Consensus 378 v~~s 381 (468) | +. T Consensus 327 ~-~~ 329 (329) T protein:vir:79 327 V-VG 329 (329) T ss_pred e-eC Confidence 2 22 No 127 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=45.64 E-value=0.76 Score=21.23 Aligned_cols=287 Identities=11% Similarity=0.063 Sum_probs=125.4 Q ss_pred CCCcccchhhcccChhhHHHHHHHHh--------------------hcc----cccCcccccCccccchhhhhhHhhhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSF--------------------TTG----YGITPDTQTDAGALRREFLDDQISMLT 56 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf--------------------~ag----y~~~p~~~~~gaALr~esld~~i~~L~ 56 (468) .++..++.+... .-+.+++++ ... -..+.-+..+|+.|.++.+...|..+. T Consensus 78 ~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~ 152 (394) T protein:vir:97 78 KEVTQEEKTYRE-----SVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREV 152 (394) T ss_pred cccchhhHHHHH-----HHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHh Confidence 111111110000 000111110 000 011223456688999999988886655 Q ss_pred hccccccchhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchh Q lcl|NC_020871. 57 WTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQ 135 (468) Q Consensus 57 ~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~ 135 (468) .... .+.+.+...+..+.-.+|.... . +.+...+++|++... .+++.+...+...+-++.-..+|.-+ +.++.. T Consensus 153 ~~~~--~l~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~el-l~ds~~ 227 (394) T protein:vir:97 153 KTVV--DLKPFTTVYQAKKASGKYPVLQ-R-ATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQES-IDDADV 227 (394) T ss_pred hhhh--hhhhhceeeeccCcceEEEEEe-c-CCCccceecccccccccccccceeEEeehhheeeehhhHHHH-HhhhhH Confidence 5443 3444444444444434555443 2 223455799998765 67899999999999888777777642 234556 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCce Q lcl|NC_020871. 136 DPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTP 215 (468) Q Consensus 136 Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~ 215 (468) |.+....+.-...+..+++.++..|..... +....-+|+|..+++.. ...+++ T Consensus 228 ~~~~~i~~~la~~~~~~~~~~i~~g~~~~~----~~~~~~~~~~~~~~~~~---------------------~~~~~~-- 280 (394) T protein:vir:97 228 DLVGIVSESISQIKVNTTNDAIAKVLKSFT----TKTVKNLDEIKALLNGG---------------------FDPAYN-- 280 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccc----ccccccHHHHHHHHHhh---------------------hhhhhC-- Confidence 777778888888888999999999876542 22233455555444211 111111 Q ss_pred EEEecCHHHHhhHHHhhcCCc-eEEeecCCCc----ceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCC Q lcl|NC_020871. 216 TDAYMPVGVQADFVNQQLSKQ-TQLVRDNGNN----VSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQ 290 (468) Q Consensus 216 td~~m~~~v~a~~~~~~~~~q-r~v~~~n~~~----~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~ 290 (468) ..+.|++.+.+.+.. .-+.+ |.+.+++... .-.|.+|- ++..-... .+..++.+..- ......... T Consensus 281 a~~v~n~~~~~~l~~-lkd~~G~~i~~~~~~~~~~~~l~G~pv~--~~~~~~~~-~~~~~~gd~~~---~~~~~~~~~-- 351 (394) T protein:vir:97 281 VSLIVSQSFYQTLDT-LKDGNGRYLLQDDITAVSGKVLLGKPVF--VLSDEVLG-ANKAFIGDFKR---GVLFADRKD-- 351 (394) T ss_pred CEEEEcHHHHHHHHH-hhccCCCeeeecCcCCCCCceeccceeE--EecccccC-CccEEEeeccc---cEEEEEecc-- Confidence 247899999988844 33332 3333333211 12343331 10000000 00011111100 000000000 Q ss_pred cceeEEecCCCCCCcCcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEEEeecCCC Q lcl|NC_020871. 291 AKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYS 354 (468) Q Consensus 291 ~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltIT~~~~~g 354 (468) ++-..... ..|. + .+++..- +.+.+.-...-+.|++|..+.+= T Consensus 352 --~~~~~~~~--~~~~-----~-~~~~~~r-----------~d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 352 --LGLRWADN--EIYG-----Q-YLQAVLR-----------FGVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred --eEEEEecc--cccc-----e-eEEEEEE-----------EccEEecccceEEEEecccccCC Confidence 00000000 0000 0 1111110 01111111122233333222111 No 128 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=43.72 E-value=0.83 Score=21.02 Aligned_cols=266 Identities=11% Similarity=0.009 Sum_probs=114.4 Q ss_pred ccCccccchh--hhhhHhhhhhhccccccchhhhc---ccchhhhhhccceeeeeccccccccccccccccccCcceEEE Q lcl|NC_020871. 37 QTDAGALRRE--FLDDQISMLTWTENDLTFYKDIA---KKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQK 111 (468) Q Consensus 37 ~~~gaALr~e--sld~~i~~L~~~~~~f~~~~~i~---k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~ 111 (468) +++.|=|-+| .+|+++...-+ .+++.-+.|+ .-++..+.-.|..+..+|+.-++-.-...++-+.-|.++.++ T Consensus 1 ~~~lafl~~qL~~id~~vye~~~--~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~ 78 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKY--PEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPT 78 (304) T ss_pred CchHHHHHHHHHHHhhhhhcccc--ccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeeccccee Confidence 5555544442 23333332111 2233333333 222222233344555555553221234556778899999999 Q ss_pred EEEEEeeeehhhhhhhHhhh---cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccce Q lcl|NC_020871. 112 TVNMKFASDTKNISIAAGLV---NNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNV 188 (468) Q Consensus 112 ~~~~k~l~~~~~vs~~~~lv---~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nv 188 (468) +..|.-.+.++.+|+. ++. ..=.+..+..-+.|.+.+-+.+-.-.||||.... .+=||.+-=+- .+ T Consensus 79 ~~~i~~~~~~~~y~~~-El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~---------g~~GllN~p~v-~~ 147 (304) T protein:vir:52 79 RSYIVPWAKSVTWTKP-ELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDS---------RLTGLLNNKSV-EV 147 (304) T ss_pred EEEEEEEeeeeeecHH-HHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeecccc---------ceEEEEeCCCc-ce Confidence 9999999999999743 442 1122555677777778888899999999986531 12345432111 11 Q ss_pred ee----ccCCCC---C-H---HHHhhhhhhh---hhccCceEEEecCHHHHhhHHHhhcC--C---ceEEeecCCCccee Q lcl|NC_020871. 189 HD----ARGASL---T-E---SLLNQAAVMI---SKGYGTPTDAYMPVGVQADFVNQQLS--K---QTQLVRDNGNNVSV 249 (468) Q Consensus 189 iD----arG~~l---s-~---~~l~~~a~~i---~~~fG~~td~~m~~~v~a~~~~~~~~--~---qr~v~~~n~~~~~~ 249 (468) +. .-|... + + +.|+.+-.-+ +.+.-.++.+.||+.....+..-... . -.+++.++. ... T Consensus 148 ~~~~~~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~--~~~ 225 (304) T protein:vir:52 148 YAIKGAAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLS--AAA 225 (304) T ss_pred eeecCCccCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcc--ccc Confidence 22 222211 1 2 3344444333 33345788999999988777321000 0 000111110 001 Q ss_pred ee--eccc---eeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCCC---CcCcccce-e--EEEEEE Q lcl|NC_020871. 250 GF--NIQG---FHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKKG---QFRAEDLA-A--HEYKVV 318 (468) Q Consensus 250 G~--~v~~---~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~g---~~~~~~~~-~--y~YkVt 318 (468) |- +|.. .....|.-.-...++|+++.--..-..++| -...+...-+... .+.....| . |-+.++ T Consensus 226 g~~l~I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p-----~~~l~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~ 300 (304) T protein:vir:52 226 GRQVAIKALPSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMS-----PTVLDAQPKGLLAFESGLRMAFGGVTFMEPDSAL 300 (304) T ss_pred CCcceEEEecccccccCCCCceEEEEEecChhheEEecCcc-----ccccchhhcCCceEEecceeeeeeEEEEccceee Confidence 11 1111 111112111122355655443222222222 1111110000000 00000001 1 222222 Q ss_pred EEcc Q lcl|NC_020871. 319 VSSD 322 (468) Q Consensus 319 avn~ 322 (468) -+|- T Consensus 301 y~D~ 304 (304) T protein:vir:52 301 YVDY 304 (304) T ss_pred eecC Confidence 2222 No 129 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=43.65 E-value=0.84 Score=21.01 Aligned_cols=296 Identities=9% Similarity=0.064 Sum_probs=123.0 Q ss_pred CCCcccchhhcccChhhH---HHHHHHHhhcccc--------------cCcccccCccccchhhhhhHhhhhhhcccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSV---QEDALKSFTTGYG--------------ITPDTQTDAGALRREFLDDQISMLTWTENDLT 63 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~---~e~~~Ksf~agy~--------------~~p~~~~~gaALr~esld~~i~~L~~~~~~f~ 63 (468) +++.++.+. ++...+.. ..++.+++..+.. .+..+-++|+.|.++.+..+|..+...... T Consensus 71 ~~~~~~~~~-~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~-- 147 (387) T protein:vir:93 71 VKDTGEAYQ-SLNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQ-- 147 (387) T ss_pred hhhccccCC-CcchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhch-- Confidence 222221111 11111111 2233444432211 112244567889999998888665554432 Q ss_pred chhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_020871. 64 FYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTD 143 (468) Q Consensus 64 ~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~ 143 (468) +.+.+...++.+ + ++.+... ..+...+++|++..+.+++.+......++=++.-..+|.- -|.++..|.+....+ T Consensus 148 l~~~~~v~~~~~-~-~~p~~~~--~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~e-ll~Ds~~~l~~~i~~ 222 (387) T protein:vir:93 148 LREKARLTNIKG-L-EIPRVSY--TLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDT-VIHGSDVDLVNWVEN 222 (387) T ss_pred hhhheeeeecCC-c-eEEEEee--cCCccccccCcccccccccccceeeeeheeeeeechhhHH-HHhhhHHHHHHHHHH Confidence 333333333322 2 2322221 1223558999999999999999988888877765556633 123466777777777 Q ss_pred HHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHH Q lcl|NC_020871. 144 DAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVG 223 (468) Q Consensus 144 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~ 223 (468) .-..++.+..+..+|-+..+. | +..|+.. ++ .+-...|..+-.++++--. -+..+|-...-.+|+.. T Consensus 223 ~la~~~~~~e~~~~~~~g~g~--------g-~p~g~l~--~~-~~~~v~~~~~~d~i~~~~~-~l~~~~~~~a~~~mn~~ 289 (387) T protein:vir:93 223 ALQSGLAAKERKDALAVSPKS--------G-LDHMSFY--NG-SVKEVEGADMYDAIINALA-DLHEDYRDNATIYMRYA 289 (387) T ss_pred HHHHHHHHHHHHhHhhcCCCc--------c-ccceeee--cc-ccccccccchHHHHHHHHh-ccChhhhcCCEEEEech Confidence 777777776555555433221 1 2223221 00 0111122222233333222 23334444345678877 Q ss_pred HHhhHHHhhcCCceEEeecCCCcceeeeeccceeecCCccccCCCEeeccccc--ccccc-cccC-CCCCCcceeEEecC Q lcl|NC_020871. 224 VQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQI--LDERI-LALP-TAPQQAKVTATQEA 299 (468) Q Consensus 224 v~a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~--l~~~~-~~~p-~ap~~~~vtat~~~ 299 (468) +...+....-+.++-++...+.. -.|.+| +++.... ..++.+..- +.... ...+ +-.....+.-.... T Consensus 290 t~~~~~~~~~d~~~~~~~~~~~~-llG~PV--~~~~~~~-----~~~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (387) T protein:vir:93 290 DYVKIISVLSNGTTNFFDTPAEK-VFGKPV--VFTDAAV-----KPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTA 361 (387) T ss_pred HHHHHHHHHhcCCCcccccCCcc-ccccce--EEecCCC-----ceeeeehhhhheehhhheeeecccccCCceeEEEEe Confidence 76655444444444444333322 245555 2222211 122222111 00000 0000 00000000000000 Q ss_pred CCCCCcCcccceeEEEEEEEEcccCCcccc Q lcl|NC_020871. 300 GKKGQFRAEDLAAHEYKVVVSSDDAESIAS 329 (468) Q Consensus 300 ~~~g~~~~~~~~~y~YkVtavn~~GES~aS 329 (468) --+++ .-+ .-..++.-+-...-|.|| T Consensus 362 r~d~~--v~~--~eA~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 362 WYDQQ--RTL--DSAFRIAKAKENTGSLPS 387 (387) T ss_pred eeCce--eec--hhheEEEEeecCCCCCCC Confidence 00000 000 001111222223345566 No 130 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=43.21 E-value=0.85 Score=20.96 Aligned_cols=296 Identities=9% Similarity=0.059 Sum_probs=123.7 Q ss_pred CCCcccchhhccc---ChhhHHHHHHHHhhccc--------------ccCcccccCccccchhhhhhHhhhhhhcccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEV---NLNSVQEDALKSFTTGY--------------GITPDTQTDAGALRREFLDDQISMLTWTENDLT 63 (468) Q Consensus 1 ~~~~~~~~~~~~~---n~~~~~e~~~Ksf~agy--------------~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~ 63 (468) ++..++. ..++. +....+.++.|++..+- ...-.+-++|+.|-++.+..+|..+..... . T Consensus 86 ~~~~~~~-~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~--~ 162 (402) T protein:vir:93 86 VKDKGEA-YQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKN--Q 162 (402) T ss_pred hhhcccc-CCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhh--h Confidence 2211111 11111 12222334444432210 011123345788999999888865544433 2 Q ss_pred chhhhcccchhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_020871. 64 FYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTD 143 (468) Q Consensus 64 ~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~ 143 (468) +.+.+...++.+ + .+.++. .+ .+...+++|++..+..++++......++=++.-..+|.- -|.++..|.+....+ T Consensus 163 l~~~~~v~~~~~-~-~~p~~~-~~-~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~e-ll~Ds~~~l~~~i~~ 237 (402) T protein:vir:93 163 LREKARLTNIKG-L-EIPRVS-YT-LDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDT-VIHGSDVDLVNWVEN 237 (402) T ss_pred hhhhceeeecCC-c-eeeeee-cc-CCccccccccccccccccccceeeecceeeeeechhhHH-HHhhhHHHHHHHHHH Confidence 333333333332 1 233322 22 123568999999999999999988888888766556643 123456666666666 Q ss_pred HHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHH Q lcl|NC_020871. 144 DAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVG 223 (468) Q Consensus 144 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~ 223 (468) .-...++...+..+|-+..+. | +..|+.. ++ .+--..|..+-.++++.-. -+..+|-.-.-.+|+.. T Consensus 238 ~la~~~~~~e~~~~~~~g~g~--------g-~p~g~~~--~~-~~~~~~~~~~~d~l~~~~~-~l~~~y~~na~~imn~~ 304 (402) T protein:vir:93 238 ALQSGLAAKERKDALAVSPKS--------G-LEHMSFY--NG-SVKEVEGADMYDAIINALA-DLHEDYRDNATIYMRYA 304 (402) T ss_pred HHHHHHHHHHHHhHhhcCCCc--------c-ccceeee--cc-ccccccccchHHHHHHHHh-ccChhhhcCCEEEEech Confidence 666666665444455432221 1 1223221 00 0111112222233333222 23334443334678877 Q ss_pred HHhhHHHhhcCCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccc--cccccccccCC--CCCCcceeEEecC Q lcl|NC_020871. 224 VQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQ--ILDERILALPT--APQQAKVTATQEA 299 (468) Q Consensus 224 v~a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n--~l~~~~~~~p~--ap~~~~vtat~~~ 299 (468) +...+....-+..+-++...+.. -.|.+| +++... +..++.+.. |........-. -+....+.--... T Consensus 305 t~~~~~~~~~d~~~~~~~~~~~~-llG~PV--~~t~~~-----~~i~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 376 (402) T protein:vir:93 305 DYVKIISVLSNGTTNFFDTPAEK-VFGKPV--VFTDAA-----VKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTA 376 (402) T ss_pred HHHHHHHHHhcCCCcccccCCcc-ccccce--EEecCC-----CceeeechhhhhhhhhhhhhhhhhcccCCceEEEEEE Confidence 76665444444444444333222 256655 223221 122333222 22211111100 0000001000000 Q ss_pred CCCCCcCcccceeEEEEEEEEcccCCcccc Q lcl|NC_020871. 300 GKKGQFRAEDLAAHEYKVVVSSDDAESIAS 329 (468) Q Consensus 300 ~~~g~~~~~~~~~y~YkVtavn~~GES~aS 329 (468) =-+|+- -+... .++.-+-..+.|.|| T Consensus 377 r~Dg~v--~~~~A--~~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 377 WYDQQR--TLDSA--FRIAKAKENTGPLPS 402 (402) T ss_pred EeCcEE--echhh--eEEEEeecCCCCCCC Confidence 001110 00011 122223334556677 No 131 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=40.24 E-value=0.98 Score=20.63 Aligned_cols=292 Identities=15% Similarity=0.099 Sum_probs=128.1 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccc-hhhhhhc Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKP-ATSTVAK 79 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~-~~stv~e 79 (468) |--.+-.- -.+ -..+.|++++ ++ .+|+.|.+|.+++-|..+... . .|.+.+.-.+ ..+--.+ T Consensus 1 ~~~~~~~~---~~~----~~~~~k~~t~-----~d--~~Gg~l~P~~~~~~i~~~~e~-s--~~l~~~~vi~~~~~~~~~ 63 (315) T protein:vir:41 1 MLTIEDIR---GGK----PFEIVPKIDV-----PD--LGRGVLSVDRFGEFVKAVRDS-A--VIIPEARIDNALKSYEKD 63 (315) T ss_pred Ccccchhh---cCC----hhhhhhhcCC-----cC--CCCceechHHHHHHHHHHHhh-h--hhhhhceeeecccccccc Confidence 11110000 011 1234566542 22 268889999998866554443 2 3444332211 1111111 Q ss_pred cceeeeeccc-c-ccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcch--hhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 80 YDVYMQHGKV-G-HTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNI--QDPMQILTDDAIVNIAKTIEW 155 (468) Q Consensus 80 y~~~~~hG~~-g-~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~--~Dp~~~~~~~ai~~~~~~~e~ 155 (468) ... ...|.. . ...-.+|.+.+..++|.+.+....++-+..--.+|.-+ |.++. .|.+......--.+++...|. T Consensus 64 i~~-~g~~~~~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~el-L~D~~~~~~~e~~l~~~~a~~~a~~~~~ 141 (315) T protein:vir:41 64 ISR-LSLVLDVGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDA-IEDNIEGKAFEQKIVTLLGEGISYVLEK 141 (315) T ss_pred ccc-cccCcccccccccccCcCCCCCCccccceeeeceeeeeeeccccHHH-HHhhhccccHHHHHHHHHHHHHHHHHHH Confidence 111 111110 0 01234666777778899999998888877654443322 22343 378888888888999999999 Q ss_pred HHhhcccccccCCCCCCCccccchhhhcCcc---ceeeccCCCCCHHHHhhhhhhhhhcc-Cce--EEEecCHHHHhhHH Q lcl|NC_020871. 156 ASFFGDSDLSDSPEPQAGLEFDGLAKLINQD---NVHDARGASLTESLLNQAAVMISKGY-GTP--TDAYMPVGVQADFV 229 (468) Q Consensus 156 a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~---nviDarG~~ls~~~l~~~a~~i~~~f-G~~--td~~m~~~v~a~~~ 229 (468) +.|.||..-. ++ .--+.||+.+.+... ...|.....++.+.|..+..-+-..| -.. --++|+..+.+.+. T Consensus 142 ~~~nGdg~s~---~p-~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~r 217 (315) T protein:vir:41 142 YYLHGDTSSS---DP-LLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYR 217 (315) T ss_pred HhhccCCcCc---Cc-cccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHH Confidence 9999997522 11 012679998876532 34555555567776665443333333 222 24789999998884 Q ss_pred HhhcCCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCccee-EEe--------cCC Q lcl|NC_020871. 230 NQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVT-ATQ--------EAG 300 (468) Q Consensus 230 ~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vt-at~--------~~~ 300 (468) ...-...+-+-++. +...+...|.|.-+...+ .+|....+.... -+- .-+ T Consensus 218 klk~~~g~~lw~~~-------------~~~g~~~tl~G~PV~~~~--------~m~~~~~~~~~ilf~d~~nl~~~~~~~ 276 (315) T protein:vir:41 218 DALKGRETGLGDQA-------------LTGANSILYDGRPVQYVP--------ALEALNDGKSRALFVVPTQLVYGFWRN 276 (315) T ss_pred HHhccCCCccccch-------------hhcCCCceecccceEecc--------cccccCCCCccEEEecccceEEEeccc Confidence 43322222111110 011111122222221111 111111110000 000 000 Q ss_pred CC-CCcCcccceeEEEEEEEE-c-ccCCcccccceeeee Q lcl|NC_020871. 301 KK-GQFRAEDLAAHEYKVVVS-S-DDAESIASEVATATV 336 (468) Q Consensus 301 ~~-g~~~~~~~~~y~YkVtav-n-~~GES~aS~~vt~Tv 336 (468) -. -.+.......+.|..+.- + ..+.+-+..+...+| T Consensus 277 i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 277 IKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred cEEEeeecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 00 000000011233333211 1 112222221111122 No 132 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=38.15 E-value=1.1 Score=20.40 Aligned_cols=286 Identities=14% Similarity=0.119 Sum_probs=113.5 Q ss_pred CCCcccchhhccc------------------ChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccc Q lcl|NC_020871. 1 MPKNNKEEEVKEV------------------NLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDL 62 (468) Q Consensus 1 ~~~~~~~~~~~~~------------------n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f 62 (468) .....+.....+. ......+.++++... -.....+..+++.+-++.+...|..+. +.. T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~vp~~~~~~i~~~~---~~~ 159 (397) T protein:vir:96 84 LAKAADPTDQKPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGA-EKRDGFTSVEGGALIPQELLQPQLEPK---DIV 159 (397) T ss_pred HHhhhhhhhhhhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhh-hhhhcccccccccchhHHHHHHHHHhh---hhh Confidence 0000000000000 001111111111110 011222344566777777777776542 222 Q ss_pred cchhhhcccchhhhhhccceeeeecccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHH Q lcl|NC_020871. 63 TFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQIL 141 (468) Q Consensus 63 ~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~ 141 (468) .+.+.+...++.+.-.+|.....++ +...++.|++... .+++.+.+....++=++.--.+|.-+ +.++..|.+... T Consensus 160 ~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~el-l~ds~~~l~~~i 236 (397) T protein:vir:96 160 DLSKYVRSVPVNSASGKFPVISKSG--SKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEM-IDDASYDVTGLI 236 (397) T ss_pred hHHHhhhhccccccceeEEEEeccC--CccccccccccccccccccccceeecHhHhhcchhhHHHH-HhhhHHHHHHHH Confidence 3444444444444444454444332 3455788888665 78999999999988777655555532 234556667777 Q ss_pred HHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecC Q lcl|NC_020871. 142 TDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMP 221 (468) Q Consensus 142 ~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~ 221 (468) .+.--..+..+++.+++.|+..-... ..+-+|+|.++|. .....+++ + -.+|| T Consensus 237 ~~~l~~~~~~~~~~~i~~g~g~~~~~----~~~~~d~~~~~~~---------------------~~~~~~~~-a-~~v~n 289 (397) T protein:vir:96 237 ADEIQDQSLNTKNADIAAVLKTATAK----SVVGVDGLKDLIN---------------------KEIKKVYD-V-KLFIS 289 (397) T ss_pred HHHHHHHHHHHHHHHHhhcccccccc----cccchHHHHHHHH---------------------HhhhhhcC-c-EEEEc Confidence 77777888889999999887553211 1223444433321 11222232 2 38999 Q ss_pred HHHHhhHHHhhcCCc-eEEeecCCCc----ceeeeeccceeecCCcccc-CCC--EeecccccccccccccCCCCCCcce Q lcl|NC_020871. 222 VGVQADFVNQQLSKQ-TQLVRDNGNN----VSVGFNIQGFHSARGFIKL-HGS--TVMENEQILDERILALPTAPQQAKV 293 (468) Q Consensus 222 ~~v~a~~~~~~~~~q-r~v~~~n~~~----~~~G~~v~~~~s~~g~i~l-~gs--~i~~~~n~l~~~~~~~p~ap~~~~v 293 (468) +.+.+.+.. .-+.+ |.+.+++... .-.|.+|-- +....... .|. .++.+..- ...... -..+ T Consensus 290 ~~~~~~l~~-lkd~~G~~~~~~~~~~~~~~~l~G~pv~~--~~~~~~~~~~~~~~~~~gd~~~---~~~~~~----~~~~ 359 (397) T protein:vir:96 290 ASMYSELDK-LKDKNGRYLLQDSITAASGKQLLGKEVVV--LDDDVIGKSVGNVVGFIGDAKA---FASFFD----RKQV 359 (397) T ss_pred HHHHHHHHH-hhccCCCeEeccCccCCCcccccccceEE--ecccccCCCCCceEEEEeehhc---ceEeEe----ecce Confidence 999998844 32332 3333332211 112332210 00000000 000 11111000 000000 0000 Q ss_pred eEEecCCCCCCcCcccceeEEEEEEEEcccC-CcccccceeeeeeccCcceEEEEEee Q lcl|NC_020871. 294 TATQEAGKKGQFRAEDLAAHEYKVVVSSDDA-ESIASEVATATVTAKDDGVKLEIELA 350 (468) Q Consensus 294 tat~~~~~~g~~~~~~~~~y~YkVtavn~~G-ES~aS~~vt~Tv~a~~~g~~ltIT~~ 350 (468) +-...... .| ...+++.. --+| =-.|...+. |++|.. T Consensus 360 ~~~~~~~~--~~------~~~~~~~~-r~d~~~~~~~a~~~-----------~~~~~a 397 (397) T protein:vir:96 360 SVSWVDNN--IY------GQLLAGII-RYDVKATDKKAGFY-----------VTFTIG 397 (397) T ss_pred EEEEeccc--cc------ceeEEEEE-EEccEEecccceEE-----------EEeecC Confidence 00000000 00 00111111 0011 001111111 222211 No 133 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=35.91 E-value=1.2 Score=20.14 Aligned_cols=261 Identities=13% Similarity=0.125 Sum_probs=119.5 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccch------hhhcccchh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFY------KDIAKKPAT 74 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~------~~i~k~~~~ 74 (468) |+..-- .=+.-+.+|.+.+.+. ..-.+...|. .++..++ - T Consensus 1 ma~~~T-------------------------------~~~~~iiPev~~~~v~--~~~~~~~~~~~~~~~~~~l~g~~-G 46 (274) T protein:vir:93 1 MPQGIT-------------------------------KTSNQIIPEVLAPMMQ--AQLEKKLRFASFAEVDSTLQGQP-G 46 (274) T ss_pred CCccce-------------------------------ehhheechHHHHHHHH--HHHHhhhhhcccccccccccCCC-C Confidence 332110 0011233333333331 1111111111 1111111 2 Q ss_pred hhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020871. 75 STVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIE 154 (468) Q Consensus 75 stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e 154 (468) +|| .+-.|+..|....+.|++.....+.+......+++..+-.+.+++...++ +..||+....+.....++..++ T Consensus 47 ~tv----~ip~~~~~g~~~~~~eg~~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~-~~~d~~~~~~~~~~~~~a~~~d 121 (274) T protein:vir:93 47 DTL----TFPAFVYSGDAQVVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLS-GYGDPQGEQVRQHGLAHANKVD 121 (274) T ss_pred CEE----EEEeeccCCCcccccCCCcccccccccceeEEEeeeecccccccHHHHHh-hccchHHHHHHHHHHHHHHHHH Confidence 233 12234445666677899999999999999999999999899999986655 4688998888888888888888 Q ss_pred HHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhhcC Q lcl|NC_020871. 155 WASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLS 234 (468) Q Consensus 155 ~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~~~ 234 (468) ..++= .++... + +..+..++.+.|..|.......-...+-++||+.+.+.|...-. T Consensus 122 ~~~~~---~~~~a~-----~---------------~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~- 177 (274) T protein:vir:93 122 NDVLE---ALMGAK-----L---------------TVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDAS- 177 (274) T ss_pred HHHHH---HHhccc-----c---------------cccccccCHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhhhh- Confidence 77662 222111 1 11223345566666665555555567789999999998853211 Q ss_pred CceEEeecCCCcceeeeeccceeecCCcc-ccCCCEeecccccccccccccCCCCCCcceeEEe-----cCCC-CCCcCc Q lcl|NC_020871. 235 KQTQLVRDNGNNVSVGFNIQGFHSARGFI-KLHGSTVMENEQILDERILALPTAPQQAKVTATQ-----EAGK-KGQFRA 307 (468) Q Consensus 235 ~qr~v~~~n~~~~~~G~~v~~~~s~~g~i-~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~-----~~~~-~g~~~~ 307 (468) .+++-.++. |..+- ..|.| .+.|-.|+.+++.-.-... . ....+-..... .+.- ..++.. T Consensus 178 -~~f~~~s~~-----g~~~~----~~G~ig~~~G~~Vi~s~~~p~~t~~-l--~~~gai~~~~~~~~~vE~~Rd~~~~~d 244 (274) T protein:vir:93 178 -TNFTRATEL-----GDDII----VKGAFGEALGAIIVRTNKLEAGTAI-L--AKKGAVKLILKRDFFLEVARDASTKTT 244 (274) T ss_pred -hcccccccc-----cccce----eecccceecCeeEEEcCCCCcceEE-E--EeCCeEEEEecCCcccccccchhhccc Confidence 233322221 11110 01111 2345555554442000000 0 00000000000 0000 001111 Q ss_pred ccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEE Q lcl|NC_020871. 308 EDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEI 347 (468) Q Consensus 308 ~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltI 347 (468) .-.+.+.|.+..++..+ ++..|.+ +.+++. T Consensus 245 ~i~~~~~y~~~~~~~~~------~v~~t~~----~~s~~~ 274 (274) T protein:vir:93 245 ALYSDKHYVAYLYDESK------AVKITKG----SGSLEM 274 (274) T ss_pred EEEEEEEEEEEEEcCCc------eEEEeeC----ccccCC Confidence 11133444444444322 2222322 111221 No 134 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=33.42 E-value=1.4 Score=19.86 Aligned_cols=259 Identities=13% Similarity=0.128 Sum_probs=116.3 Q ss_pred CCC--cccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccch------hhhcccc Q lcl|NC_020871. 1 MPK--NNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFY------KDIAKKP 72 (468) Q Consensus 1 ~~~--~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~------~~i~k~~ 72 (468) |+. +...+.. .+|-+.+.+ +....+.+.|- .++.-++ T Consensus 1 ma~~~T~~~d~i---------------------------------iPev~~~~v--~~~~~~~l~~~~~~~~d~~l~g~~ 45 (274) T protein:vir:94 1 MPQGLTKTSDQI---------------------------------IPEVLAPMM--QAQLEKKLRFASFAEVDSTLQGQP 45 (274) T ss_pred CCccceehhhee---------------------------------chHHHHHHH--HHhhhhhhhhcccceecccccCCC Confidence 443 1222223 333333332 11111111111 1111111 Q ss_pred hhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHH Q lcl|NC_020871. 73 ATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKT 152 (468) Q Consensus 73 ~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~ 152 (468) =.||+ +-.|+..|...-+.|+........+......+++..+-.+.+++...++ +..||+....+..-..++.. T Consensus 46 -G~tv~----iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~-~~~dp~~~~~~~~a~a~a~~ 119 (274) T protein:vir:94 46 -GDTLT----FPAFVYSGDAQVVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLS-GYGDPQGEQVRQHGLAHANK 119 (274) T ss_pred -CCEEE----EeeecCCCccccccCCCcccccccccceeEEEeeeecceecccHHHHHh-ccchHHHHHHHHHHHHHHHH Confidence 11221 1123334444445777777888888888899999988899999987655 56889877777777777777 Q ss_pred HHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh Q lcl|NC_020871. 153 IEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ 232 (468) Q Consensus 153 ~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~ 232 (468) ++..++ .-++.+ .+ +..+..++.+.|..|....+..-...+-++||+.+.+.|...- T Consensus 120 vd~~~~---~~l~~a-----~~---------------~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~ 176 (274) T protein:vir:94 120 VDNDVL---EALMGA-----KL---------------TVNADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDA 176 (274) T ss_pred HHHHHH---HHHhcc-----Cc---------------cccccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhh Confidence 776654 122211 11 1123345566666666665555556677999999999885422 Q ss_pred cCCceEEeecCCCcceeeeeccceeecCCcc-ccCCCEeecccccccccccccCCCCCCcceeEE-----ecCCCC-CCc Q lcl|NC_020871. 233 LSKQTQLVRDNGNNVSVGFNIQGFHSARGFI-KLHGSTVMENEQILDERILALPTAPQQAKVTAT-----QEAGKK-GQF 305 (468) Q Consensus 233 ~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i-~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat-----~~~~~~-g~~ 305 (468) . .+++..++.+ ..+- ..|.| .+.|-.|+.+++. ....... ..+.+-.... ..+.-. .++ T Consensus 177 ~--~~f~~~s~~g-----~~~~----~~G~ig~~~G~~Vi~s~~~-p~~t~~l--~~~gA~~~~~~~~~~vE~~Rd~~~~ 242 (274) T protein:vir:94 177 S--TNFTRATELG-----DDII----VKGAFGEALGAIIVRTNKL-EAGTAIL--AKKGAVKLILKRDFFLEVARDASTK 242 (274) T ss_pred h--hhccccCccc-----ccce----eccccceecCeeEEEcCCC-CcceEEE--EeCcceEeeecCCceeccccchhhc Confidence 1 2333222211 1110 01111 2345666555442 1111110 0000000000 000000 011 Q ss_pred CcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEE Q lcl|NC_020871. 306 RAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEI 347 (468) Q Consensus 306 ~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltI 347 (468) ...-.+-+.|.|..++..+ ++.+|-+ +.+++. T Consensus 243 ~d~i~~~~~y~~~~~~~~~------vv~~t~~----~~~~~~ 274 (274) T protein:vir:94 243 TTALYSDKHYVAYLYDESK------AVKITKG----SGSLEM 274 (274) T ss_pred ccEEEEEEEEEEEEEcCCc------eEEEecC----cccccC Confidence 1111234556555555433 2333322 112222 No 135 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=33.42 E-value=1.4 Score=19.86 Aligned_cols=259 Identities=13% Similarity=0.128 Sum_probs=116.3 Q ss_pred CCC--cccchhhcccChhhHHHHHHHHhhcccccCcccccCccccchhhhhhHhhhhhhccccccch------hhhcccc Q lcl|NC_020871. 1 MPK--NNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFY------KDIAKKP 72 (468) Q Consensus 1 ~~~--~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~------~~i~k~~ 72 (468) |+. +...+.. .+|-+.+.+ +....+.+.|- .++.-++ T Consensus 1 ma~~~T~~~d~i---------------------------------iPev~~~~v--~~~~~~~l~~~~~~~~d~~l~g~~ 45 (274) T protein:vir:97 1 MPQGLTKTSDQI---------------------------------IPEVLAPMM--QAQLEKKLRFASFAEVDSTLQGQP 45 (274) T ss_pred CCccceehhhee---------------------------------chHHHHHHH--HHhhhhhhhhcccceecccccCCC Confidence 443 1222223 333333332 11111111111 1111111 Q ss_pred hhhhhhccceeeeeccccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHH Q lcl|NC_020871. 73 ATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKT 152 (468) Q Consensus 73 ~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~ 152 (468) =.||+ +-.|+..|...-+.|+........+......+++..+-.+.+++...++ +..||+....+..-..++.. T Consensus 46 -G~tv~----iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~-~~~dp~~~~~~~~a~a~a~~ 119 (274) T protein:vir:97 46 -GDTLT----FPAFVYSGDAQVVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLS-GYGDPQGEQVRQHGLAHANK 119 (274) T ss_pred -CCEEE----EeeecCCCccccccCCCcccccccccceeEEEeeeecceecccHHHHHh-ccchHHHHHHHHHHHHHHHH Confidence 11221 1123334444445777777888888888899999988899999987655 56889877777777777777 Q ss_pred HHHHHhhcccccccCCCCCCCccccchhhhcCccceeeccCCCCCHHHHhhhhhhhhhccCceEEEecCHHHHhhHHHhh Q lcl|NC_020871. 153 IEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ 232 (468) Q Consensus 153 ~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nviDarG~~ls~~~l~~~a~~i~~~fG~~td~~m~~~v~a~~~~~~ 232 (468) ++..++ .-++.+ .+ +..+..++.+.|..|....+..-...+-++||+.+.+.|...- T Consensus 120 vd~~~~---~~l~~a-----~~---------------~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~ 176 (274) T protein:vir:97 120 VDNDVL---EALMGA-----KL---------------TVNADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDA 176 (274) T ss_pred HHHHHH---HHHhcc-----Cc---------------cccccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhh Confidence 776654 122211 11 1123345566666666665555556677999999999885422 Q ss_pred cCCceEEeecCCCcceeeeeccceeecCCcc-ccCCCEeecccccccccccccCCCCCCcceeEE-----ecCCCC-CCc Q lcl|NC_020871. 233 LSKQTQLVRDNGNNVSVGFNIQGFHSARGFI-KLHGSTVMENEQILDERILALPTAPQQAKVTAT-----QEAGKK-GQF 305 (468) Q Consensus 233 ~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i-~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat-----~~~~~~-g~~ 305 (468) . .+++..++.+ ..+- ..|.| .+.|-.|+.+++. ....... ..+.+-.... ..+.-. .++ T Consensus 177 ~--~~f~~~s~~g-----~~~~----~~G~ig~~~G~~Vi~s~~~-p~~t~~l--~~~gA~~~~~~~~~~vE~~Rd~~~~ 242 (274) T protein:vir:97 177 S--TNFTRATELG-----DDII----VKGAFGEALGAIIVRTNKL-EAGTAIL--AKKGAVKLILKRDFFLEVARDASTK 242 (274) T ss_pred h--hhccccCccc-----ccce----eccccceecCeeEEEcCCC-CcceEEE--EeCcceEeeecCCceeccccchhhc Confidence 1 2333222211 1110 01111 2345666555442 1111110 0000000000 000000 011 Q ss_pred CcccceeEEEEEEEEcccCCcccccceeeeeeccCcceEEEE Q lcl|NC_020871. 306 RAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEI 347 (468) Q Consensus 306 ~~~~~~~y~YkVtavn~~GES~aS~~vt~Tv~a~~~g~~ltI 347 (468) ...-.+-+.|.|..++..+ ++.+|-+ +.+++. T Consensus 243 ~d~i~~~~~y~~~~~~~~~------vv~~t~~----~~~~~~ 274 (274) T protein:vir:97 243 TTALYSDKHYVAYLYDESK------AVKITKG----SGSLEM 274 (274) T ss_pred ccEEEEEEEEEEEEEcCCc------eEEEecC----cccccC Confidence 1111234556555555433 2333322 112222 No 136 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=33.29 E-value=1.4 Score=19.84 Aligned_cols=304 Identities=13% Similarity=0.027 Sum_probs=115.4 Q ss_pred CCCcccchhhc-------------------------------c---cChhhHHH----------------HHHHHhhccc Q lcl|NC_020871. 1 MPKNNKEEEVK-------------------------------E---VNLNSVQE----------------DALKSFTTGY 30 (468) Q Consensus 1 ~~~~~~~~~~~-------------------------------~---~n~~~~~e----------------~~~Ksf~agy 30 (468) |+..++++..+ + .+.++..+ .+..+++ T Consensus 1 m~~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~l~~~e~~~~~~~~--- 77 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQTLSANQRNFFMDIN--- 77 (381) T ss_pred CchhHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccCHHHHHHHHHHh--- Confidence 11111110000 0 00000001 1111222 Q ss_pred ccCcccccCccccchhhhhhHhhhhhhccccccchhhhcccchhhhhhccceeeeecccccccccccccc-ccccCcceE Q lcl|NC_020871. 31 GITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGV-APVSDPNIR 109 (468) Q Consensus 31 ~~~p~~~~~gaALr~esld~~i~~L~~~~~~f~~~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~-~~~~d~~~~ 109 (468) ..+..+|+.|-++.+.+.|....... ..+++.+...++.+. .++.+.+ +.+.+.++.|.+. +...++.+. T Consensus 78 ---~~t~~~Gg~lvP~~~~~~I~~~l~~~--spir~~a~v~~~~~~-~~i~~~~---~~~~a~W~~e~~~~~~~~~~~f~ 148 (381) T protein:vir:10 78 ---KSVGYKEEKLLPEETIDRIFEDLTTN--HPLLADLGIKNAGLR-LKFLKSE---TSGVAVWGKIYGEIKGQLDAAFS 148 (381) T ss_pred ---hcCCCCCceecCHHHHHHHHHHHHhh--cceeeeeeeEecCcc-eEEEeec---CCcceEEeecccccccccCccce Confidence 23345677888888888775422222 244444444444333 2333332 3344557788765 567899999 Q ss_pred EEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccccCCCCCCCccccchhhhcCcc-ce Q lcl|NC_020871. 110 QKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NV 188 (468) Q Consensus 110 r~~~~~k~l~~~~~vs~~~~lv~~~~Dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~-nv 188 (468) +.....+=|+.--.+|..+ |.++..|.+....+.--.++++.++.++..||-. +.+ -||.+.+... ++ T Consensus 149 ~i~l~~~kl~a~i~is~el-L~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~-----~qP-----~Gil~~~~~~~~~ 217 (381) T protein:vir:10 149 EETAIQNKLTAFVVLPKDL-NDFGPAWIERFVRVQIEEAFAVALETAFLKGTGK-----DQP-----IGLNRQVQKGVSV 217 (381) T ss_pred eEeecceeEEeeccccHHH-HhccHHHHHHHHHHHHHHHHHHHhhceeEecccC-----CCc-----eeeeecCCccccc Confidence 9999999888766666554 4567889999999999999999999999999943 112 2454433322 21 Q ss_pred eecc-------CC--CCCHH-HHhh-------hhhh---hhhcc-CceEEEecCHHHHhhHHHhhcCCceEEeecCCCcc Q lcl|NC_020871. 189 HDAR-------GA--SLTES-LLNQ-------AAVM---ISKGY-GTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNV 247 (468) Q Consensus 189 iDar-------G~--~ls~~-~l~~-------~a~~---i~~~f-G~~td~~m~~~v~a~~~~~~~~~qr~v~~~n~~~~ 247 (468) .+.- |- .++.. .++. .+.. ....| |.+ -+.|.+.+...++-.. ..+.++ |.. T Consensus 218 ~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~vmn~~t~~~l~~~~-----~~~~~~-G~~ 290 (381) T protein:vir:10 218 TDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNV-TMVVNPSDAFEVQAQY-----THLNAN-GVY 290 (381) T ss_pred cccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCce-EEEEchhhHHhhcccc-----ccCCCC-Cce Confidence 1110 00 01111 1111 1110 00111 111 2346666655542111 111111 111 Q ss_pred eee--eeccceeecCCccccCCCEeecccc-cccccccccCCCCCCcceeEEecCCCCCCcCcccceeEEEEEEEEcccC Q lcl|NC_020871. 248 SVG--FNIQGFHSARGFIKLHGSTVMENEQ-ILDERILALPTAPQQAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDA 324 (468) Q Consensus 248 ~~G--~~v~~~~s~~g~i~l~gs~i~~~~n-~l~~~~~~~p~ap~~~~vtat~~~~~~g~~~~~~~~~y~YkVtavn~~G 324 (468) ... +..+-+.+.. ++ -+..++.+.. .+...+..+ .+ .. +..-.|.. +.--|++..-- +| T Consensus 291 v~~lp~g~~vv~~~~--~p-~~~i~fGDfs~Y~i~~r~~~-------~i--~~--~~~~~~~~---d~~~f~a~~r~-dG 352 (381) T protein:vir:10 291 VTALPFNLNVIESTV--QE-AGKVLTYVKGLYDGYLAGGI-------NV--QK--FKETLALD---DMDLYTAKQFA-YG 352 (381) T ss_pred eecCCCCceeEEcCC--CC-cCcEEEEEcccEEEEEeccc-------EE--Ee--echhhhhc---CceEEEEEEEE-cC Confidence 110 0000000000 00 0111221111 000000000 00 00 00000110 11122222221 12 Q ss_pred Ccc-cccceeeeeeccCcceEEEEEeecCCCcccce Q lcl|NC_020871. 325 ESI-ASEVATATVTAKDDGVKLEIELAPMYSSRPQF 359 (468) Q Consensus 325 ES~-aS~~vt~Tv~a~~~g~~ltIT~~~~~ga~~~~ 359 (468) .-. +...+..++...+ +.+++....++. T Consensus 353 ~~~~~~A~~v~~l~~~~-------~~~~~~~~~~~~ 381 (381) T protein:vir:10 353 KAKDNKVAAVWKLDLKG-------HKPALEDTEETL 381 (381) T ss_pred EEecCCcEEEEEEeecC-------CccccccccccC Confidence 111 1111111211111 111122211111 No 137 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=28.41 E-value=1.8 Score=19.25 Aligned_cols=293 Identities=11% Similarity=0.034 Sum_probs=121.4 Q ss_pred CCCcccchhhcccChhhHHHHHHHHhhcccccCcccccCcccc-ch--hhhhhHhhhhhhccccccchhhhcccchhhhh Q lcl|NC_020871. 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGAL-RR--EFLDDQISMLTWTENDLTFYKDIAKKPATSTV 77 (468) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~e~~~Ksf~agy~~~p~~~~~gaAL-r~--esld~~i~~L~~~~~~f~~~~~i~k~~~~stv 77 (468) |.- .=.++.......++ ..+..+...++++ -+ |.+|+++....+. +++.-+.|+ +.+.+ T Consensus 1 ~~~------~~~~~~~~~~~~~~-------~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~--~~~~~~~i~---v~~~~ 62 (314) T protein:vir:10 1 MAI------KFDAEQAKITTHLE-------QMGVEKADAAGIWAVSQLTAALNRAYEKEYA--ENSVVNIFP---VTNEI 62 (314) T ss_pred Ccc------chHHHHHHHHHHHH-------hhcccchhhhHHHHHHHHHHHHHHHhhhhcc--ccccceeec---cccCC Confidence 110 00112222222222 2222223333333 33 4666666543322 222222222 12222 Q ss_pred hccce---eeeeccccccccccc-cccccccCcceEEEEEEEEeeeehhhhhhhHhh--hcchhhHHHHHHHHHHHHHHH Q lcl|NC_020871. 78 AKYDV---YMQHGKVGHTRFTRE-IGVAPVSDPNIRQKTVNMKFASDTKNISIAAGL--VNNIQDPMQILTDDAIVNIAK 151 (468) Q Consensus 78 ~ey~~---~~~hG~~g~~~fv~E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l--v~~~~Dp~~~~~~~ai~~~~~ 151 (468) .+|.+ +......|....++. .+..+..|.++.|++..+..++..+.++..--. ...=.+..+.....|.+.+.+ T Consensus 63 ~~~~et~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~ 142 (314) T protein:vir:10 63 PGHAKYFEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDN 142 (314) T ss_pred CCceeEEEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHH Confidence 22221 112234455445554 344678899999999999999999999754222 222235567777888888888 Q ss_pred HHHHHHhhcccccccCCCCCCCccccchhhhcCccce--eeccCCCCC----HHHHhhhh---hhhhhccCceEEEecCH Q lcl|NC_020871. 152 TIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNV--HDARGASLT----ESLLNQAA---VMISKGYGTPTDAYMPV 222 (468) Q Consensus 152 ~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~li~~~nv--iDarG~~ls----~~~l~~~a---~~i~~~fG~~td~~m~~ 222 (468) .+-..+||||+.++ +-||.+- .+| .=+.+.--+ .+.|+++- ...+.++-.|+.+.||+ T Consensus 143 ~~n~i~f~G~~~~g----------~~GLlN~---p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp 209 (314) T protein:vir:10 143 LLDKLVWSGSAPHG----------IVSVFDQ---PNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPA 209 (314) T ss_pred hhceEEEeeccccc----------ceeEeec---CCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecH Confidence 99999999997653 2344331 111 000001111 23344322 33345677889999999 Q ss_pred HHHhhHHHhhcCCceEEeecCCCcceeeeeccceeecCCccccCCCEeecccccccccccccCCCCCCcceeEEecCCCC Q lcl|NC_020871. 223 GVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQQAKVTATQEAGKK 302 (468) Q Consensus 223 ~v~a~~~~~~~~~qr~v~~~n~~~~~~G~~v~~~~s~~g~i~l~gs~i~~~~n~l~~~~~~~p~ap~~~~vtat~~~~~~ 302 (468) .-.+.+ +|.. ++ .|..+-.++-.++ ++ -.| -. +|.- . + +++. T Consensus 210 ~~~~~L-------~~~~--~~-----~~~tvl~~l~~n~-~~---l~I-----------~~---~~el---~-~--ag~~ 251 (314) T protein:vir:10 210 SARRVM-------QGLV--PQ-----TNLSYGELFTRNN-PG---LTI-----------RF---LQFL---D-N--YDGA 251 (314) T ss_pred HHHHhh-------cccc--cC-----CCccHHHHHHHhC-CC---cEE-----------EE---cccc---c-c--cCCC Confidence 866544 1221 11 2344444432211 00 011 01 1110 0 0 1111 Q ss_pred CCcCcccceeEEEEEEEEcccCCcccccceee---eeeccCcceEEEEEeecCCCcccceEEEEeecCCCceeEEEEEEe Q lcl|NC_020871. 303 GQFRAEDLAAHEYKVVVSSDDAESIASEVATA---TVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVP 379 (468) Q Consensus 303 g~~~~~~~~~y~YkVtavn~~GES~aS~~vt~---Tv~a~~~g~~ltIT~~~~~ga~~~~y~IYR~~~~~G~f~~igrv~ 379 (468) |+ -.+.+. . .+-+. .+-.+.. ..+....+-..++....-.+ .+.||| +..+.+.. T Consensus 252 g~--------~~~v~y-~-~~~~~-~~~~vp~~~~~l~~e~~~~~~~~~~~~r~~----Gv~i~~---P~ai~~~d---- 309 (314) T protein:vir:10 252 GG--------KAALAF-E-KSPLN-MSIEIPEVTNVLPAQPKDLHFRYPVTSKAT----GLIVYR---PLTMAVIK---- 309 (314) T ss_pred cc--------eEEEEE-e-cCCcE-EEEecCccceeecceecCceEEEcceeeeE----EEEEEC---cceeEeee---- Confidence 11 122211 1 11111 1100000 01111111111111111111 244555 22233333 Q ss_pred cccccCCeeEEe Q lcl|NC_020871. 380 ASKAENNVITFY 391 (468) Q Consensus 380 ~s~~~~~t~tf~ 391 (468) ++||. T Consensus 310 -------GI~~~ 314 (314) T protein:vir:10 310 -------GITFA 314 (314) T ss_pred -------eeecC Confidence 35555 Done!