Query lcl|Aclame:protein:vir:103759|NCBI_annot:hypothetical protein|genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Match_columns 330 No_of_seqs 94 out of 102 Neff 5.8 Searched_HMMs 1612 Date Sat Nov 30 23:14:39 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:103759 Length: 330 100.0 5E-154 3E-157 861.1 33.6 330 1-330 1-330 (330) 2 protein:vir:7324 Length: 335 # 100.0 3E-149 2E-152 834.6 33.4 328 1-330 1-334 (335) 3 protein:vir:95318 Length: 328 100.0 2E-147 1E-150 824.5 33.7 328 1-330 1-328 (328) 4 protein:vir:107388 Length: 331 100.0 2E-146 1E-149 819.1 33.7 328 1-330 1-331 (331) 5 protein:vir:98525 Length: 331 100.0 2E-146 1E-149 819.1 33.7 328 1-330 1-331 (331) 6 protein:vir:107826 Length: 331 100.0 2E-146 1E-149 819.1 33.7 328 1-330 1-331 (331) 7 protein:vir:94933 Length: 330 100.0 8.4E-57 5.2E-60 328.0 19.7 230 1-244 20-330 (330) 8 protein:vir:97255 Length: 310 100.0 3.4E-49 2.1E-52 286.3 21.5 230 1-330 1-238 (310) 9 protein:vir:8187 Length: 311 # 99.3 1.2E-12 7.4E-16 86.0 19.5 230 1-330 1-247 (311) 10 protein:vir:7771 Length: 330 # 99.2 1.9E-12 1.2E-15 84.8 18.0 238 1-330 1-249 (330) 11 protein:vir:104085 Length: 320 99.2 3E-12 1.9E-15 83.8 18.6 234 1-330 1-247 (320) 12 protein:vir:99920 Length: 311 99.2 5.8E-12 3.6E-15 82.2 19.9 233 1-330 1-240 (311) 13 protein:vir:97053 Length: 390 99.2 1.3E-12 7.9E-16 85.8 15.3 229 1-330 107-336 (390) 14 protein:vir:4339 Length: 395 # 99.2 2.6E-12 1.6E-15 84.1 16.7 231 1-330 106-338 (395) 15 protein:vir:96223 Length: 324 99.2 2.5E-12 1.5E-15 84.2 16.4 225 1-330 21-245 (324) 16 protein:vir:94771 Length: 298 99.2 6.5E-12 4E-15 81.9 18.6 229 1-330 1-234 (298) 17 protein:vir:105905 Length: 304 99.2 3.7E-12 2.3E-15 83.3 17.0 229 1-330 1-233 (304) 18 protein:vir:94142 Length: 304 99.2 3.7E-12 2.3E-15 83.3 17.0 229 1-330 1-233 (304) 19 protein:vir:1886 Length: 385 # 99.2 9.6E-13 5.9E-16 86.5 13.7 229 1-330 96-327 (385) 20 protein:vir:191 Length: 385 # 99.2 9.6E-13 5.9E-16 86.5 13.7 229 1-330 96-327 (385) 21 protein:vir:100247 Length: 425 99.1 1.7E-11 1E-14 79.6 17.8 238 1-330 127-369 (425) 22 protein:vir:99749 Length: 324 99.1 8.4E-12 5.2E-15 81.3 16.0 225 1-330 21-245 (324) 23 protein:vir:103955 Length: 324 99.1 8.9E-12 5.5E-15 81.2 16.0 225 1-330 21-245 (324) 24 protein:vir:8102 Length: 543 # 99.1 7.1E-12 4.4E-15 81.7 15.3 229 1-330 243-482 (543) 25 protein:vir:9309 Length: 324 # 99.1 1.3E-11 8.1E-15 80.3 16.1 224 1-330 21-245 (324) 26 protein:vir:10364 Length: 390 99.1 7.4E-12 4.6E-15 81.6 14.7 229 1-330 107-336 (390) 27 protein:vir:100135 Length: 418 99.1 1.1E-11 6.9E-15 80.6 15.6 226 1-330 132-358 (418) 28 protein:vir:81070 Length: 390 99.1 1.3E-11 7.9E-15 80.3 14.8 229 1-330 95-336 (390) 29 protein:vir:41 Length: 299 # N 99.0 4.5E-11 2.8E-14 77.3 17.3 226 1-330 1-226 (299) 30 protein:vir:9574 Length: 300 # 99.0 8.5E-11 5.3E-14 75.8 18.8 230 1-330 1-235 (300) 31 protein:vir:4456 Length: 401 # 99.0 1.4E-11 8.6E-15 80.1 14.4 233 1-330 107-345 (401) 32 protein:vir:2344 Length: 397 # 99.0 7.1E-11 4.4E-14 76.2 18.2 224 4-330 1-236 (397) 33 protein:vir:78523 Length: 338 99.0 1.5E-10 9.2E-14 74.5 19.8 230 1-330 1-265 (338) 34 protein:vir:1638 Length: 298 # 99.0 6E-11 3.7E-14 76.6 17.3 229 1-330 1-234 (298) 35 protein:vir:96392 Length: 324 99.0 3.3E-11 2.1E-14 78.0 15.9 225 1-330 21-245 (324) 36 protein:vir:78830 Length: 324 99.0 3.3E-11 2.1E-14 78.0 15.9 225 1-330 21-245 (324) 37 protein:vir:2430 Length: 318 # 99.0 7.8E-11 4.8E-14 76.0 17.6 225 1-330 14-243 (318) 38 protein:vir:97148 Length: 324 99.0 4.2E-11 2.6E-14 77.5 15.8 224 1-330 21-245 (324) 39 protein:vir:4226 Length: 326 # 99.0 1.2E-10 7.6E-14 74.9 17.7 236 1-330 1-253 (326) 40 protein:vir:485 Length: 407 # 99.0 1.1E-10 7E-14 75.1 17.4 241 1-330 90-344 (407) 41 protein:vir:9759 Length: 303 # 99.0 2.6E-10 1.6E-13 73.1 18.9 230 1-330 1-238 (303) 42 protein:vir:1328 Length: 392 # 99.0 1.1E-10 6.5E-14 75.3 16.5 230 1-330 104-336 (392) 43 protein:vir:94673 Length: 419 99.0 1.2E-10 7.2E-14 75.1 16.5 231 1-330 121-360 (419) 44 protein:vir:104256 Length: 458 98.9 6.6E-11 4.1E-14 76.4 14.6 233 1-330 158-403 (458) 45 protein:vir:78223 Length: 333 98.9 5.5E-10 3.4E-13 71.4 19.4 230 1-330 1-265 (333) 46 protein:vir:1433 Length: 435 # 98.9 2.4E-10 1.5E-13 73.3 16.8 231 1-330 103-366 (435) 47 protein:vir:4197 Length: 314 # 98.9 1.1E-10 6.5E-14 75.3 14.7 235 1-330 1-257 (314) 48 protein:vir:80376 Length: 435 98.9 4E-10 2.5E-13 72.1 17.5 232 1-330 105-366 (435) 49 protein:vir:7855 Length: 497 # 98.9 3.3E-11 2E-14 78.1 11.4 274 1-330 151-435 (497) 50 protein:vir:101650 Length: 497 98.9 3.3E-11 2E-14 78.1 11.4 274 1-330 151-435 (497) 51 protein:vir:80684 Length: 315 98.9 3E-10 1.9E-13 72.8 16.2 228 1-330 1-242 (315) 52 protein:vir:5739 Length: 366 # 98.9 5.6E-10 3.5E-13 71.3 17.3 233 1-330 52-299 (366) 53 protein:vir:105038 Length: 428 98.9 1.3E-09 7.9E-13 69.4 19.0 234 1-330 113-361 (428) 54 protein:vir:6242 Length: 390 # 98.8 2.4E-10 1.5E-13 73.3 14.4 229 1-330 97-335 (390) 55 protein:vir:4600 Length: 415 # 98.8 3.9E-10 2.4E-13 72.2 15.2 230 1-330 116-350 (415) 56 protein:vir:4700 Length: 415 # 98.8 3.9E-10 2.4E-13 72.2 15.2 230 1-330 116-350 (415) 57 protein:vir:81100 Length: 415 98.8 8.2E-10 5.1E-13 70.4 15.9 230 1-330 116-350 (415) 58 protein:vir:79987 Length: 415 98.8 8.2E-10 5.1E-13 70.4 15.9 230 1-330 116-350 (415) 59 protein:vir:98339 Length: 415 98.8 8.2E-10 5.1E-13 70.4 15.9 230 1-330 116-350 (415) 60 protein:vir:1268 Length: 397 # 98.8 6.4E-10 4E-13 71.0 15.1 209 1-330 123-338 (397) 61 protein:vir:4159 Length: 315 # 98.8 2.1E-10 1.3E-13 73.7 12.2 236 1-330 1-262 (315) 62 protein:vir:2504 Length: 305 # 98.8 2.3E-09 1.4E-12 68.0 17.2 224 1-330 1-232 (305) 63 protein:vir:95763 Length: 297 98.8 1.1E-09 6.5E-13 69.8 15.3 224 1-330 1-226 (297) 64 protein:vir:4953 Length: 397 # 98.7 2.4E-09 1.5E-12 67.8 16.7 209 1-330 109-326 (397) 65 protein:vir:102119 Length: 404 98.7 5.4E-09 3.4E-12 65.9 18.6 230 1-330 92-341 (404) 66 protein:vir:81160 Length: 371 98.7 4.3E-09 2.7E-12 66.5 18.0 209 1-330 91-312 (371) 67 protein:vir:6212 Length: 434 # 98.7 4.9E-09 3E-12 66.1 17.7 226 1-330 131-370 (434) 68 protein:vir:4997 Length: 397 # 98.7 6.1E-09 3.8E-12 65.6 17.7 210 1-330 109-326 (397) 69 protein:vir:3991 Length: 404 # 98.7 4.5E-09 2.8E-12 66.3 16.7 217 1-330 98-334 (404) 70 protein:vir:9410 Length: 415 # 98.7 3.2E-09 2E-12 67.2 15.1 230 1-330 116-350 (415) 71 protein:vir:4830 Length: 397 # 98.7 8.1E-09 5E-12 65.0 17.3 210 1-330 109-326 (397) 72 protein:vir:7409 Length: 408 # 98.6 6.1E-09 3.8E-12 65.6 15.9 216 1-330 100-334 (408) 73 protein:vir:1025 Length: 408 # 98.6 1.4E-08 8.5E-12 63.7 17.8 216 1-330 100-334 (408) 74 protein:vir:3845 Length: 395 # 98.6 5.7E-09 3.5E-12 65.8 15.6 215 1-330 102-324 (395) 75 protein:vir:81227 Length: 413 98.6 6.6E-09 4.1E-12 65.4 15.8 229 1-330 113-353 (413) 76 protein:vir:3158 Length: 321 # 98.6 3.1E-09 1.9E-12 67.3 13.8 234 1-330 1-254 (321) 77 protein:vir:4511 Length: 409 # 98.5 6.5E-08 4E-11 60.0 18.9 233 1-330 93-350 (409) 78 protein:vir:4856 Length: 293 # 98.5 2.3E-08 1.4E-11 62.4 15.9 209 1-330 5-222 (293) 79 protein:vir:9704 Length: 394 # 98.4 4E-08 2.5E-11 61.2 15.7 211 1-330 121-336 (394) 80 protein:vir:105004 Length: 392 98.4 6.5E-08 4E-11 60.0 15.7 209 1-330 106-325 (392) 81 protein:vir:102873 Length: 392 98.4 6.5E-08 4E-11 60.0 15.7 209 1-330 106-325 (392) 82 protein:vir:107593 Length: 392 98.4 6.5E-08 4E-11 60.0 15.7 209 1-330 106-325 (392) 83 protein:vir:102082 Length: 392 98.4 6.5E-08 4E-11 60.0 15.7 209 1-330 106-325 (392) 84 protein:vir:101607 Length: 379 98.4 5.1E-08 3.1E-11 60.6 14.3 217 1-330 101-322 (379) 85 protein:vir:4092 Length: 390 # 98.3 2E-07 1.2E-10 57.3 16.9 231 1-330 64-312 (390) 86 protein:vir:9643 Length: 377 # 98.3 1.4E-07 8.7E-11 58.2 15.5 244 1-330 59-352 (377) 87 protein:vir:93616 Length: 645 98.3 2.9E-07 1.8E-10 56.4 16.7 229 1-330 331-569 (645) 88 protein:vir:8420 Length: 477 # 98.2 9.3E-07 5.7E-10 53.7 17.5 236 1-330 145-415 (477) 89 protein:vir:3870 Length: 400 # 98.2 2.2E-07 1.4E-10 57.1 13.9 211 1-330 120-345 (400) 90 protein:vir:95376 Length: 425 98.2 6.3E-07 3.9E-10 54.6 16.3 226 1-330 119-365 (425) 91 protein:vir:100172 Length: 394 98.1 7.3E-07 4.5E-10 54.2 15.9 215 1-330 103-330 (394) 92 protein:vir:1084 Length: 437 # 98.1 8.6E-07 5.3E-10 53.8 15.3 214 1-330 141-372 (437) 93 protein:vir:96762 Length: 632 98.1 2.4E-06 1.5E-09 51.4 17.7 223 1-330 316-579 (632) 94 protein:vir:100884 Length: 389 98.0 1.5E-06 9E-10 52.6 15.1 212 1-330 106-328 (389) 95 protein:vir:80128 Length: 466 98.0 6.2E-07 3.8E-10 54.6 12.9 249 1-330 131-393 (466) 96 protein:vir:1383 Length: 421 # 98.0 1.4E-06 8.6E-10 52.7 14.8 212 1-330 109-326 (421) 97 protein:vir:97255 Length: 310 98.0 9.2E-08 5.7E-11 59.2 8.2 210 1-243 74-310 (310) 98 protein:vir:101291 Length: 381 97.9 9.8E-07 6.1E-10 53.5 12.7 246 1-330 57-343 (381) 99 protein:vir:9509 Length: 381 # 97.9 9.8E-07 6.1E-10 53.5 12.7 246 1-330 57-343 (381) 100 protein:vir:95963 Length: 395 97.8 3E-06 1.9E-09 50.8 14.7 232 1-330 67-320 (395) 101 protein:vir:962 Length: 397 # 97.8 2E-06 1.3E-09 51.8 13.2 210 1-330 127-343 (397) 102 protein:vir:93881 Length: 387 97.8 1.6E-06 9.7E-10 52.4 12.6 215 1-330 100-329 (387) 103 protein:vir:95603 Length: 463 97.8 1.2E-06 7.2E-10 53.1 11.4 231 1-330 1-283 (463) 104 protein:vir:99311 Length: 463 97.8 1.2E-06 7.2E-10 53.1 11.4 231 1-330 1-283 (463) 105 protein:vir:78350 Length: 383 97.8 9.8E-07 6.1E-10 53.5 10.8 267 1-330 64-350 (383) 106 protein:vir:78640 Length: 352 97.8 6.5E-06 4E-09 49.0 15.2 215 1-330 64-293 (352) 107 protein:vir:9361 Length: 402 # 97.7 7E-06 4.4E-09 48.8 14.4 211 1-330 130-343 (402) 108 protein:vir:9820 Length: 272 # 97.6 9.1E-06 5.7E-09 48.2 14.6 209 1-330 1-217 (272) 109 protein:vir:3033 Length: 272 # 97.6 9.1E-06 5.7E-09 48.2 14.6 209 1-330 1-217 (272) 110 protein:vir:96666 Length: 462 97.6 2.5E-06 1.5E-09 51.3 10.7 233 1-330 1-283 (462) 111 protein:vir:2685 Length: 387 # 97.5 1.7E-05 1.1E-08 46.7 14.9 210 1-330 115-328 (387) 112 protein:vir:94424 Length: 387 97.5 1.7E-05 1.1E-08 46.7 14.9 210 1-330 115-328 (387) 113 protein:vir:96978 Length: 387 97.5 1.7E-05 1.1E-08 46.7 14.9 210 1-330 115-328 (387) 114 protein:vir:98635 Length: 377 97.5 2E-06 1.3E-09 51.8 9.7 256 1-330 59-352 (377) 115 protein:vir:100632 Length: 381 97.5 1.3E-05 7.8E-09 47.4 13.3 250 1-330 57-343 (381) 116 protein:vir:80835 Length: 464 97.1 2.8E-06 1.8E-09 51.0 6.4 296 1-330 1-402 (464) 117 protein:vir:100851 Length: 514 97.1 6.6E-06 4.1E-09 49.0 7.8 231 1-330 45-301 (514) 118 protein:vir:102823 Length: 470 96.5 0.00024 1.5E-07 40.4 12.6 238 4-330 1-281 (470) 119 protein:vir:63741 Length: 468 96.4 3.5E-05 2.1E-08 45.0 7.5 305 1-330 1-361 (468) 120 protein:vir:97397 Length: 517 96.4 0.00052 3.2E-07 38.6 13.8 229 1-330 226-500 (517) 121 protein:vir:80491 Length: 467 96.4 4.1E-05 2.5E-08 44.7 7.6 305 1-330 1-360 (467) 122 protein:vir:99424 Length: 360 96.2 0.00033 2E-07 39.7 11.7 255 1-330 15-298 (360) 123 protein:vir:8843 Length: 317 # 94.9 0.0031 1.9E-06 34.3 13.4 241 1-330 1-264 (317) 124 protein:vir:93742 Length: 274 92.4 0.012 7.2E-06 31.2 14.7 212 1-330 1-218 (274) 125 protein:vir:96123 Length: 274 89.0 0.029 1.8E-05 29.0 14.6 211 1-330 1-218 (274) 126 protein:vir:4074 Length: 480 # 88.3 0.033 2.1E-05 28.7 11.7 216 1-330 171-425 (480) 127 protein:vir:80930 Length: 278 86.8 0.043 2.7E-05 28.1 14.9 218 1-330 1-225 (278) 128 protein:vir:96833 Length: 275 86.7 0.044 2.7E-05 28.0 15.3 213 1-330 1-219 (275) 129 protein:vir:3613 Length: 272 # 76.3 0.14 8.5E-05 25.3 13.2 208 1-330 1-219 (272) 130 protein:vir:97433 Length: 274 73.2 0.17 0.00011 24.8 15.1 211 1-330 1-218 (274) 131 protein:vir:94494 Length: 274 73.2 0.17 0.00011 24.8 15.1 211 1-330 1-218 (274) 132 protein:vir:1239 Length: 274 # 62.2 0.34 0.00021 23.2 15.5 209 1-330 1-218 (274) 133 protein:vir:105334 Length: 276 56.5 0.46 0.00028 22.5 15.2 212 1-330 1-218 (276) 134 protein:vir:95898 Length: 274 27.4 1.8 0.0011 19.1 14.9 211 1-330 1-218 (274) 135 protein:vir:96262 Length: 274 27.4 1.8 0.0011 19.1 14.9 211 1-330 1-218 (274) No 1 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=100.00 E-value=4.8e-154 Score=861.11 Aligned_cols=330 Identities=100% Similarity=1.468 Sum_probs=328.6 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) ||+|++++|||+|+||++++|++++.|||+|+|+||||++|||+|||++++|++++|++||+++||++|+||++++++++ T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN~g~~~s~~tt~ 80 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhhcCCccccccceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcceee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVI 160 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~vi 160 (330) |++++|+||||++||||+|++++||.++||++|+++|+|+|+|+|+++|||||++++|++|+||+|||+++|+.++.|+| T Consensus 81 qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~~~qvI 160 (330) T protein:vir:10 81 QVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVI 160 (330) T ss_pred EEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCchhhee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEeec Q lcl|Aclame:pro 161 DAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCN 240 (330) Q Consensus 161 dAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~N 240 (330) ||||||+++||||+|+||++++|||||||+|+||+|+|+|++++++.||+||+|+||++||+|++||+|+|||||+|||| T Consensus 161 daGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI~N 240 (330) T protein:vir:10 161 DAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCN 240 (330) T ss_pred eccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEEee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEEe Q lcl|Aclame:pro 241 IDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRTD 320 (330) Q Consensus 241 Id~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~d 320 (330) ||+|+|+++++.+|||++|+.|++|||++.+|+++||||++|+++||+|+++|+|++++.++++|++||+|+|||||||| T Consensus 241 Idvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~t~~~gipir~~D 320 (330) T protein:vir:10 241 IDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRTD 320 (330) T ss_pred cccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCeeeEEECCeEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCccccC Q lcl|Aclame:pro 321 ALLNTESRVV 330 (330) Q Consensus 321 al~~tE~~Vv 330 (330) ||||||++|| T Consensus 321 ail~tE~~vv 330 (330) T protein:vir:10 321 ALLNTESRVV 330 (330) T ss_pred eeecCccccC Confidence 9999999999 No 2 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=100.00 E-value=3.4e-149 Score=834.55 Aligned_cols=328 Identities=44% Similarity=0.783 Sum_probs=321.7 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) ||+|++++|||+|+||++++|++++.|||+|+++||||++|||+|||++++|++++|++||+++||++|+||++++++++ T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~lN~g~~~s~~tt~ 80 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRRYNQGVQPTKTQTV 80 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhhcCCccccccceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccc---cCCcc Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLS---AENKD 157 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t---~~~~~ 157 (330) |++++|+||||++||||+|++++||.++||++|+++|+|||+|+|+++|||||++++|++|+||++||++++ +.++. T Consensus 81 qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~a~~a~ 160 (335) T protein:vir:73 81 PVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSKAASAE 160 (335) T ss_pred EEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccccCccc Confidence 999999999999999999999999999999999999999999999999999999999999999999999887 66789 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) |+|||||||+++||||+|+||++++|||||||+|+||+|+|+|++++. |++|++|+||++||+|++||+|+|||||+| T Consensus 161 ~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~--d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvR 238 (335) T protein:vir:73 161 NVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVS--DGNGGQFRAYRDEFKWDIGLSVRDWRSISR 238 (335) T ss_pred ceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeee--cCCCCEEeEEEeeeeeeeeeEEeCcccEEE Confidence 999999999999999999999999999999999999999999999985 778999999999999999999999999999 Q ss_pred eecccccccccchhH-HHHHHHHHHHH--HhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCe Q lcl|Aclame:pro 238 VCNIDVSDLATSANA-QALIKYMIMAA--ERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGI 314 (330) Q Consensus 238 I~NId~~~l~~~~~~-~~l~~~m~~a~--~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gv 314 (330) |||||+|+|+++++. .+|+++|++|+ ++||++++|+++||||++|+++|++|+++|+|++++.++++|+++|+|+|| T Consensus 239 I~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g~~~t~~~gi 318 (335) T protein:vir:73 239 ICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYGGKKIVSFLGI 318 (335) T ss_pred EeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccCCceeEEECCe Confidence 999999999999865 68999999998 699999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeeccCCccccC Q lcl|Aclame:pro 315 PVQRTDALLNTESRVV 330 (330) Q Consensus 315 pir~~dal~~tE~~Vv 330 (330) ||||||||||||++|| T Consensus 319 pir~~Dail~tE~~v~ 334 (335) T protein:vir:73 319 PIRRVDAILNTESAVT 334 (335) T ss_pred EEEEEeeeecCccccc Confidence 9999999999999999 No 3 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=100.00 E-value=2.3e-147 Score=824.53 Aligned_cols=328 Identities=62% Similarity=1.017 Sum_probs=324.9 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) ||++++++|||+|+||++++|+++++|||+|+++||||++|||+|+|+|++|.|++|++||+++||++|+||++++++++ T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~lN~g~~~s~~tt~ 80 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRLLNYGVQPSKSTTV 80 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeeecCCccCcccceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcceee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVI 160 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~vi 160 (330) |++++|+||||++||||+|++++||.++|||+|+++|+|+|+++|+++|||||++++|++|+||++||+++++.++.|+| T Consensus 81 q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~qii 160 (328) T protein:vir:95 81 QVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQNII 160 (328) T ss_pred EEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCcccccccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEeec Q lcl|Aclame:pro 161 DAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCN 240 (330) Q Consensus 161 dAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~N 240 (330) ||||||+++||||+|+||++++|||||||+|+||+|+|+|++++. |++|++|+||++||+|++||||+|||||||||| T Consensus 161 daGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~--~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~N 238 (328) T protein:vir:95 161 DAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLE--DANGGKYEGYRTHYKWDNGLALRDWRYVVRIAN 238 (328) T ss_pred ecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeee--cCCCCeeeEEEEEEEeeeeeEEcCcccEEEEec Confidence 999999999999999999999999999999999999999999986 788999999999999999999999999999999 Q ss_pred ccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEEe Q lcl|Aclame:pro 241 IDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRTD 320 (330) Q Consensus 241 Id~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~d 320 (330) ||+|+|+++++..+|+++|++|+++||++.+|+++||||++|+++|++|+++|+|+++++++++|+++|+|+|||||+|| T Consensus 239 Id~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t~~~gipir~~d 318 (328) T protein:vir:95 239 IDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWTSFRGVPIRETD 318 (328) T ss_pred CcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcceeEECCeEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCccccC Q lcl|Aclame:pro 321 ALLNTESRVV 330 (330) Q Consensus 321 al~~tE~~Vv 330 (330) |||+||++|| T Consensus 319 ai~~tE~~vv 328 (328) T protein:vir:95 319 ALLETEARVV 328 (328) T ss_pred eeecCccccC Confidence 9999999999 No 4 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=100.00 E-value=2.2e-146 Score=819.10 Aligned_cols=328 Identities=65% Similarity=1.018 Sum_probs=320.0 Q ss_pred CCccccccccHHHHHhhcCcccc-hHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGK-VDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~-~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 79 (330) ||++.+++|||+|+||+++.+++ .++|||+|+|+||||++|||+|||++++|.|++|++||+++||++|+||+++++++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 99999999999999999987776 46899999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) +|++++|+||||++||||+|++++||.++||++|+++|+|+|+|+|+++|||||++++|++|+||++||+++++.++.|+ T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q~ 160 (331) T protein:vir:10 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) |||||||+++||||+|+||++++|||||||+|+||+|+|+|++++. |++|++|+||++||+|++||+|+||||||||| T Consensus 161 IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~--~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLI--DAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred eecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeee--cCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 9999999999999999999999999999999999999999999986 78899999999999999999999999999999 Q ss_pred cccccccccchhH-HHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccc-eeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 240 NIDVSDLATSANA-QALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIAN-NLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 240 NId~~~l~~~~~~-~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~-~l~~~~~~g~~v~~~~gvpir 317 (330) |||+|+|+++++. .||+++|++|+++||++.+|+++||||++|+++|++|+++|+|+ +++.++++|++||+|+||||| T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 9999999998766 69999999999999999999999999999999999999999885 599999999999999999999 Q ss_pred EEeeccCCccccC Q lcl|Aclame:pro 318 RTDALLNTESRVV 330 (330) Q Consensus 318 ~~dal~~tE~~Vv 330 (330) +|||||+||++|| T Consensus 319 ~~dai~~tE~~Vv 331 (331) T protein:vir:10 319 RTDALLLTEARVV 331 (331) T ss_pred EeeeeecCccccC Confidence 9999999999999 No 5 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=100.00 E-value=2.2e-146 Score=819.10 Aligned_cols=328 Identities=65% Similarity=1.018 Sum_probs=320.0 Q ss_pred CCccccccccHHHHHhhcCcccc-hHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGK-VDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~-~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 79 (330) ||++.+++|||+|+||+++.+++ .++|||+|+|+||||++|||+|||++++|.|++|++||+++||++|+||+++++++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 99999999999999999987776 46899999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) +|++++|+||||++||||+|++++||.++||++|+++|+|+|+|+|+++|||||++++|++|+||++||+++++.++.|+ T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q~ 160 (331) T protein:vir:98 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) |||||||+++||||+|+||++++|||||||+|+||+|+|+|++++. |++|++|+||++||+|++||+|+||||||||| T Consensus 161 IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~--~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:98 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLI--DAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred eecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeee--cCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 9999999999999999999999999999999999999999999986 78899999999999999999999999999999 Q ss_pred cccccccccchhH-HHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccc-eeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 240 NIDVSDLATSANA-QALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIAN-NLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 240 NId~~~l~~~~~~-~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~-~l~~~~~~g~~v~~~~gvpir 317 (330) |||+|+|+++++. .||+++|++|+++||++.+|+++||||++|+++|++|+++|+|+ +++.++++|++||+|+||||| T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:98 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 9999999998766 69999999999999999999999999999999999999999885 599999999999999999999 Q ss_pred EEeeccCCccccC Q lcl|Aclame:pro 318 RTDALLNTESRVV 330 (330) Q Consensus 318 ~~dal~~tE~~Vv 330 (330) +|||||+||++|| T Consensus 319 ~~dai~~tE~~Vv 331 (331) T protein:vir:98 319 RTDALLLTEARVV 331 (331) T ss_pred EeeeeecCccccC Confidence 9999999999999 No 6 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=100.00 E-value=2.2e-146 Score=819.10 Aligned_cols=328 Identities=65% Similarity=1.018 Sum_probs=320.0 Q ss_pred CCccccccccHHHHHhhcCcccc-hHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGK-VDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~-~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 79 (330) ||++.+++|||+|+||+++.+++ .++|||+|+|+||||++|||+|||++++|.|++|++||+++||++|+||+++++++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 99999999999999999987776 46899999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) +|++++|+||||++||||+|++++||.++||++|+++|+|+|+|+|+++|||||++++|++|+||++||+++++.++.|+ T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q~ 160 (331) T protein:vir:10 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) |||||||+++||||+|+||++++|||||||+|+||+|+|+|++++. |++|++|+||++||+|++||+|+||||||||| T Consensus 161 IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~--~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLI--DAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred eecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeee--cCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 9999999999999999999999999999999999999999999986 78899999999999999999999999999999 Q ss_pred cccccccccchhH-HHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccc-eeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 240 NIDVSDLATSANA-QALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIAN-NLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 240 NId~~~l~~~~~~-~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~-~l~~~~~~g~~v~~~~gvpir 317 (330) |||+|+|+++++. .||+++|++|+++||++.+|+++||||++|+++|++|+++|+|+ +++.++++|++||+|+||||| T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 9999999998766 69999999999999999999999999999999999999999885 599999999999999999999 Q ss_pred EEeeccCCccccC Q lcl|Aclame:pro 318 RTDALLNTESRVV 330 (330) Q Consensus 318 ~~dal~~tE~~Vv 330 (330) +|||||+||++|| T Consensus 319 ~~dai~~tE~~Vv 331 (331) T protein:vir:10 319 RTDALLLTEARVV 331 (331) T ss_pred EeeeeecCccccC Confidence 9999999999999 No 7 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=100.00 E-value=8.4e-57 Score=328.02 Aligned_cols=230 Identities=17% Similarity=0.191 Sum_probs=197.1 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccc-eE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKS-ST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~-t~ 79 (330) .|.+++.++||+|++ ++.+|.+..+|||+|+++|+||++|||++++++ .|.|+|++++|+++||++|++|+++++ |+ T Consensus 20 ~p~l~m~alTLaea~-~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~-~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf 97 (330) T protein:vir:94 20 FPELKMPTVTLAESA-KLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGN-ALAYNRENVLGDVQFLAVGGTITAKNPATF 97 (330) T ss_pred ccccchhhhhhhHHh-hcCchhhHHHHHHhhhccchHHhhcccccccCC-cceeeeeecCCcceeeeccccccccCccee Confidence 799999999999955 588999999999999999999999999998755 699999999999999999999999875 67 Q ss_pred EEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccc------- Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLS------- 152 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t------- 152 (330) +|++++|++++|+++||++|++++|++.++|++|.++|+|+|+++|+++|||||++. ++|+||.+|+.... T Consensus 98 ~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~--~~F~GL~~~~~~~q~i~tg~~ 175 (330) T protein:vir:94 98 TKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTG--NSFQGMMGLVAASQTISAGAN 175 (330) T ss_pred eeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCC--ccccchhhcCCcccEEecCCC Confidence 999999999999999999999999999999999999999999999999999999774 58999999884110 Q ss_pred --------------------c---------------------------------CCccee-------------e--ecC- Q lcl|Aclame:pro 153 --------------------A---------------------------------ENKDNV-------------I--DAG- 163 (330) Q Consensus 153 --------------------~---------------------------------~~~~~v-------------i--dAG- 163 (330) + ..+.+| | +.+ T Consensus 176 gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~ 255 (330) T protein:vir:94 176 GGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQ 255 (330) T ss_pred CCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEeeeCCeEEEecccccCCCCc Confidence 0 001111 1 111 Q ss_pred CCCCCceEEEEEEeCCC----cEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 164 GTGSDNASAWLVVWGPN----TCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 164 gtg~~~tSi~~V~~g~~----~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) ++++++||||+|+||++ .|||++++| ..||++++.|+. ++. .+++.+++||||++|.+.++++++. T Consensus 256 ~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g-~~glsVr~~G~~--~~k-------~v~~~~v~~y~~~av~~~~a~~~L~ 325 (330) T protein:vir:94 256 GTATNATAIFAGTFDDGSNKYGIAGLTARG-SAGLRVQNVGAK--ENA-------DETITRVKMYCGFANFSQLGLAAIK 325 (330) T ss_pred ccCCCceeEEEEeecccccccceEeecCCC-CCcceeeeCCCc--ccc-------ceeeEEEEEeeeeEEechhheeeec Confidence 35678899999999975 899999887 579999999852 222 2357789999999999999999999 Q ss_pred ccccc Q lcl|Aclame:pro 240 NIDVS 244 (330) Q Consensus 240 NId~~ 244 (330) ||.+. T Consensus 326 ~V~~g 330 (330) T protein:vir:94 326 GLIPG 330 (330) T ss_pred cccCC Confidence 99987 No 8 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=100.00 E-value=3.4e-49 Score=286.29 Aligned_cols=230 Identities=17% Similarity=0.127 Sum_probs=188.7 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecC-----CccCcc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLY-----GGVLPN 75 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN-----~g~~~s 75 (330) || ++||+|++| +.+|.+..+|||+|+++|+||++|||++++++ .|.|+|++++|+++||++| +|++++ T Consensus 1 mp-----altLaea~k-~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~-~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~ 73 (310) T protein:vir:97 1 MA-----SVTLAESAK-LAQDELVAGVIENIITVNRMFDVLPFDSIEGN-SLAYNRENVLGDVIMAGVGTTFSGAGAGKA 73 (310) T ss_pred Cc-----ccchHHHhh-cCcchHHHHHHHHHhccchHHHhCCcccccCC-cceeeEeeccCCcccccccccccCCCcccc Confidence 65 569999765 99999999999999999999999999998744 7999999999999999876 777899 Q ss_pred cceEEEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccC Q lcl|Aclame:pro 76 KSSTAQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAE 154 (330) Q Consensus 76 ~~t~~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~ 154 (330) ++|++|++++|++++|+++||++|++++ +++.++++.|.++++|+++++|+++|||||++.| +|+||.+|+ T Consensus 74 ~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n--~F~GL~~~~------ 145 (310) T protein:vir:97 74 AATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGN--EFAGLIQLC------ 145 (310) T ss_pred ccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCC--cccchhhcC------ Confidence 9999999999999999999999999996 7899999999999999999999999999999987 499999986 Q ss_pred CcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEecccc Q lcl|Aclame:pro 155 NKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRY 234 (330) Q Consensus 155 ~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~ 234 (330) .+.|.||+|++|+.+| T Consensus 146 ~~~q~i~~~~~gg~~t---------------------------------------------------------------- 161 (310) T protein:vir:97 146 ASGQKATTGATGSAIS---------------------------------------------------------------- 161 (310) T ss_pred CccceeecCCCCCCCC---------------------------------------------------------------- Confidence 4678999887774431 Q ss_pred EEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccc--cceeeecccCCcceEEEC Q lcl|Aclame:pro 235 VARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKI--ANNLTWETVSGERVMTFD 312 (330) Q Consensus 235 v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~--~~~l~~~~~~g~~v~~~~ 312 (330) .+ .|.+.+.++|....+..+|+||++.+..++--..... .+.-...+..|++|..|+ T Consensus 162 -----------------~d----~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~ 220 (310) T protein:vir:97 162 -----------------FA----ILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYS 220 (310) T ss_pred -----------------HH----HHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeC Confidence 11 2222234455555566699999986555554333332 233344568899999999 Q ss_pred CeEEEEEeeccCCccccC Q lcl|Aclame:pro 313 GIPVQRTDALLNTESRVV 330 (330) Q Consensus 313 gvpir~~dal~~tE~~Vv 330 (330) ||||..||-|..+|..+= T Consensus 221 GiPi~~~d~ip~~~~~~~ 238 (310) T protein:vir:97 221 GTPIFRNDYIPTNQTKGG 238 (310) T ss_pred CeEEEEeCccCCCccccc Confidence 999999999998775533 No 9 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.29 E-value=1.2e-12 Score=85.96 Aligned_cols=230 Identities=11% Similarity=0.046 Sum_probs=155.1 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) |++..++..++. ......|||.+.+.|+|++..+.+....+ ...+.+.++-|+++|..=++.+++++.++. T Consensus 1 mat~~~gg~lvP--------~~~~~~ii~~~~~~s~i~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~ 71 (311) T protein:vir:81 1 MVALATGTFQLP--------KHLVPGVWQKAQGQSVLARLSMAEPQEFG-EQQYMTLTAPPRGEVVGEGAQKSESTATFA 71 (311) T ss_pred CceecCCceEcc--------hhHHHHHHHHHHhcchhhhhcceeecCCC-ceEEEEEeCCceeEEeecCcccccccceee Confidence 999877654433 33456799999999999999998876544 467888899999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCC-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGN-TAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~-~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) +++-..+-+++.+.|-+.+.+...+ ..++...-.....+++++.+...|+||+.+..+..+.|+-... ..+.++ T Consensus 72 ~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~-----~~~~~~ 146 (311) T protein:vir:81 72 PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKI-----LDTTNI 146 (311) T ss_pred EEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccc-----ccccee Confidence 9999999999999999988876553 3345555556678999999999999997555444444442210 000000 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) +..+ T Consensus 147 ~~~~---------------------------------------------------------------------------- 150 (311) T protein:vir:81 147 VELT---------------------------------------------------------------------------- 150 (311) T ss_pred eeec---------------------------------------------------------------------------- Confidence 0000 Q ss_pred cccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEE Q lcl|Aclame:pro 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT 319 (330) Q Consensus 240 NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~ 319 (330) ..+ .....++++++. .++...+.....|.||.+...+|++ .++.....+......+..+-.+.|.||..+ T Consensus 151 ~~~------~~~~~~~i~~~~---~~~~~~~~~~~~~vmn~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~ 220 (311) T protein:vir:81 151 TGT------SATPDLAVEAAV---GLVLGDNLSPDGVALDNTFSFMLAT-QRDSQGRKLYPELGFGTDVASFAGLNAAVS 220 (311) T ss_pred ccc------cchHHHHHHHHH---HHhhhcCCCceEEEEcHHHHHHHHh-hhccCCCeeecCccccCCCceecceeEEec Confidence 000 001122232222 2222222111359999999999986 466666666655555666778999999999 Q ss_pred eeccCCcccc----------------C Q lcl|Aclame:pro 320 DALLNTESRV----------------V 330 (330) Q Consensus 320 dal~~tE~~V----------------v 330 (330) ++|......+ + T Consensus 221 ~~i~~~~~~~~~~~~~~~~~~~~~~~~ 247 (311) T protein:vir:81 221 DTVRGGPEAVTASTGVYRTTNPNVKAI 247 (311) T ss_pred ccccccccccccccchhcccCCccEEE Confidence 9997543211 1 No 10 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.24 E-value=1.9e-12 Score=84.78 Aligned_cols=238 Identities=15% Similarity=0.049 Sum_probs=150.0 Q ss_pred CCcccc--ccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccce Q lcl|Aclame:pro 1 MATLST--NNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSS 78 (330) Q Consensus 1 M~~~~~--~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t 78 (330) |+.--. ...+...-+..+-+......||+.+.+.++|++..+.+...++ .+.+.+.++-|.+.|..=++.+++++.+ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~ 79 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPT-GISIPHWTGAVSASWTGEAERKPITKGS 79 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCC-ceEEEEEcCCcceeEecCCCccccccce Confidence 554211 1111111122233344567799999999999999999876543 5889999999999999999999999999 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) +.+++-.++-+++.+.|.+.+.+... -++.+.-.....++++++++..||+||... ..+.|+.+-. .... T Consensus 80 f~~i~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~--~~~~g~~~~~------~~~~ 149 (330) T protein:vir:77 80 FGKQELEPVKITTIFAESAEVVRLNP--LNYLNTMRTKIAEAIALKFDAAAIHGIDKP--SAFKGYLAET------TKVV 149 (330) T ss_pred eeEEEEeEEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhcccCCC--Cccccccccc------cccc Confidence 99999999999999999998877643 233344445677999999999999998532 2333332200 0000 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) .. . ... T Consensus 150 ~~-----~-----------------------------------------~~~---------------------------- 155 (330) T protein:vir:77 150 SL-----A-----------------------------------------DTN---------------------------- 155 (330) T ss_pred ee-----e-----------------------------------------ccc---------------------------- Confidence 00 0 000 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCc-----ceEEECC Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGE-----RVMTFDG 313 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~-----~v~~~~g 313 (330) .++ ..+...++++.+..++..+.........|+||++.+..|+. .++.....+-.....+. ....+.| T Consensus 156 -~~~-----~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~~~~~~~l~G 228 (330) T protein:vir:77 156 -LTT-----ASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNT-AVDGNGRPLFVESTYTEQVGAIREGRILG 228 (330) T ss_pred -ccc-----cccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHH-HhccCCceeecCccccccccccCCceecc Confidence 000 01111223333344444444445555689999999999995 45555555544333222 2347889 Q ss_pred eEEEEEeeccCCc----cccC Q lcl|Aclame:pro 314 IPVQRTDALLNTE----SRVV 330 (330) Q Consensus 314 vpir~~dal~~tE----~~Vv 330 (330) +||..++++.... ..++ T Consensus 229 ~PV~~~~~~p~~~~~~~~~~~ 249 (330) T protein:vir:77 229 RPTYVADNVVNGTVGNRVVGV 249 (330) T ss_pred eeeEEeccccCCCCCCccEEE Confidence 9999999998632 1222 No 11 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.23 E-value=3e-12 Score=83.76 Aligned_cols=234 Identities=12% Similarity=0.013 Sum_probs=150.0 Q ss_pred CCccccccccHHHHHhh-------cCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccC Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKR-------LDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVL 73 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~-------~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~ 73 (330) |+--....+.-...+.. +-+......||+.+.+.++|++.++.+...++ .+.+.+.++-|.++|..=++.++ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~ 79 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTT-GQKIPHWIGDVSAQWIGEGDMKP 79 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCC-ceEEEEEeCCcceEEecCCcccc Confidence 43311111122222111 22333567799999999999999999887543 57888888999999999999999 Q ss_pred cccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccc Q lcl|Aclame:pro 74 PNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLS 152 (330) Q Consensus 74 ~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t 152 (330) +++.++.+++-.++-+++.+.|.+.+.+... +..++ =.+...+++++.+.++|++|+-+..|..+.|+..- T Consensus 80 ~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~---i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~----- 151 (320) T protein:vir:10 80 ITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGT---MRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKS----- 151 (320) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHH---HHHHHHHHHHHHHHHHhhcccCCCCCccccccccc----- Confidence 9999999999999999999999998877633 33333 33456699999999999999854433222221100 Q ss_pred cCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEecc Q lcl|Aclame:pro 153 AENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDW 232 (330) Q Consensus 153 ~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~ 232 (330) ..+... + T Consensus 152 ----~~~~~~-------------------------------------~-------------------------------- 158 (320) T protein:vir:10 152 ----VSLADP-------------------------------------G-------------------------------- 158 (320) T ss_pred ----ccceec-------------------------------------c-------------------------------- Confidence 000000 0 Q ss_pred ccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcce---- Q lcl|Aclame:pro 233 RYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERV---- 308 (330) Q Consensus 233 r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v---- 308 (330) .+ +.+...++.+.+..+...+++......+|+||++....|+. .+++....+-.....+... T Consensus 159 -------~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~-lkd~~G~~l~~~~~~~~~~~~~~ 224 (320) T protein:vir:10 159 -------GA------TASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNG-AKDKNGRPLFIESTYTDENSPFR 224 (320) T ss_pred -------cc------cccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHH-hhccCCceeeccccccCcccccc Confidence 00 00001111122334445555566677899999999999984 5666555554433321111 Q ss_pred -EEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 309 -MTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 309 -~~~~gvpir~~dal~~tE~~Vv 330 (330) ..+.|+||..++++...+..++ T Consensus 225 ~~~i~g~pv~~~~~~~~~~~~~~ 247 (320) T protein:vir:10 225 AGRIVSRPTILSDHVADGTTVGY 247 (320) T ss_pred CceeeeeeeEecCCCCCCceEEE Confidence 3578999999999987766544 No 12 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.22 E-value=5.8e-12 Score=82.20 Aligned_cols=233 Identities=13% Similarity=0.063 Sum_probs=151.9 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) |+++.+..- -+-+......|||.+.+.++|++..+.+..+.+ ..++.+.++-|.++|..=++.+++++.++. T Consensus 1 Mat~tt~~g-------~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~ 72 (311) T protein:vir:99 1 MATFGTGNL-------KNLPRNIADGMVKDVVQGSTVAVLSARKPQRFG-NEDIITFNGRPKAEFVGEGQQKSSTTGEFD 72 (311) T ss_pred CceecCCCc-------eeccHHHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEeCCceeEEeecCcccccccceee Confidence 998755421 122334556799999999999998888776543 468889999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCC-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGN-TAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~-~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) +++-..+-+++.+.|-+.|.+...+ ..++...-.....+++++++.+++|||+....+..+.|+..... ...+. T Consensus 73 ~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~-----~~~~~ 147 (311) T protein:vir:99 73 FVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLG-----AASKR 147 (311) T ss_pred EEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccc-----cccce Confidence 9999999999999999999876543 33455555566789999999999999986555555544422110 00000 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) +..++++ . . T Consensus 148 ~~~~~~~-------------------------------------~----------------------------------~ 156 (311) T protein:vir:99 148 VELTADT-------------------------------------I----------------------------------A 156 (311) T ss_pred eeccccc-------------------------------------c----------------------------------c Confidence 0000000 0 0 Q ss_pred cccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEE Q lcl|Aclame:pro 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT 319 (330) Q Consensus 240 NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~ 319 (330) .. +....+++.+...+.. ......|.||.+....|+. .++.....+-.....+..+-.+.|+|+..+ T Consensus 157 ~~-------~~~i~~~~~~~~~~~~-----~~~~~~~vmn~~~~~~L~~-lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s 223 (311) T protein:vir:99 157 NP-------DLAIEAAVGLLVANGH-----PTPVNGLALHPSIAWGLST-ARYTDGRKKFPELGLGIGVSSFEGIDASVS 223 (311) T ss_pred hh-------HHHHHHHHHHHhhhcc-----CCCccEEEEcHHHHHHHHh-hhccCCCeeecCcccCCCCceecceeeEee Confidence 00 0112223222222111 1112359999999999985 566665666555455555668999999999 Q ss_pred eeccCCcc------ccC Q lcl|Aclame:pro 320 DALLNTES------RVV 330 (330) Q Consensus 320 dal~~tE~------~Vv 330 (330) +++..+-. .++ T Consensus 224 ~~i~~~~~~~~~~~~~~ 240 (311) T protein:vir:99 224 DTVNGGDEADPDDEDLD 240 (311) T ss_pred cccccccccccccchhh Confidence 98863211 111 No 13 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.20 E-value=1.3e-12 Score=85.80 Aligned_cols=229 Identities=14% Similarity=0.057 Sum_probs=144.0 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEec-cCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTG-LPTPTWRKLYGGVLPNKSST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~-lP~~~fR~lN~g~~~s~~t~ 79 (330) ........-+-...+..+-+......||+.+.+.++|++.++.....++ .+.|.++++ -|.+.|..=++.++++..++ T Consensus 107 ~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 185 (390) T protein:vir:97 107 KAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSA-LIEYVQETGFVNNAAIVAEGALKPESSLKF 185 (390) T ss_pred HHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCC-ceEEEEEecCCcceeeecCCccccccccce Confidence 0000000000011111122334567899999999999999999887644 467787766 47899999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) .+++-..+-+++.+.|.+.+.+...+ +...-.....+++++++.+.||+||...+ .+.||-. T Consensus 186 ~~i~~~~~k~~~~~~is~ell~ds~~---l~~~i~~~la~a~~~~~d~a~l~G~g~~~--~p~Gi~~------------- 247 (390) T protein:vir:97 186 AKKTDTTHVIAHTMKATRQILSDAPQ---LASYMNNRLIRGLKVKEDAEILRGTGAND--GLLGLIP------------- 247 (390) T ss_pred eEEEEeeeeEEEeehhhHHHHHhHHH---HHHHHHHHHHHHHHHHHHHHHhhcCCCCc--cccceee------------- Confidence 99999999999999999998875443 44444556789999999999999963321 2333310 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) .++.+. T Consensus 248 -~~~~~~------------------------------------------------------------------------- 253 (390) T protein:vir:97 248 -QATTYA------------------------------------------------------------------------- 253 (390) T ss_pred -cccccc------------------------------------------------------------------------- Confidence 000000 Q ss_pred cccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEE Q lcl|Aclame:pro 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT 319 (330) Q Consensus 240 NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~ 319 (330) . . ......+..+.+..++..+........+|+||++....|+. .++.....+-.+...+ ....+.|+||..+ T Consensus 254 -~-~----~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~-lkd~~G~~l~~~~~~~-~~~~l~G~pV~~~ 325 (390) T protein:vir:97 254 -A-P----TTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIEL-AKDANNQYLIGNARGT-LTPTLWGLPVVAT 325 (390) T ss_pred -c-c----ccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH-hhcCCCceeecCccCC-CCceecceeeEEc Confidence 0 0 00011111222333344444444445689999999999995 4666555554443332 2347899999999 Q ss_pred eeccCCccccC Q lcl|Aclame:pro 320 DALLNTESRVV 330 (330) Q Consensus 320 dal~~tE~~Vv 330 (330) |++..++..+. T Consensus 326 ~~~~~~~~~~g 336 (390) T protein:vir:97 326 QAMAPGEFLVG 336 (390) T ss_pred CCCCCCcEEEE Confidence 99987653322 No 14 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.20 E-value=2.6e-12 Score=84.14 Aligned_cols=231 Identities=13% Similarity=0.026 Sum_probs=147.1 Q ss_pred CCccccccccHHH-HHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEec-cCCcceeecCCccCcccce Q lcl|Aclame:pro 1 MATLSTNNPTMAD-VAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTG-LPTPTWRKLYGGVLPNKSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E-~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~-lP~~~fR~lN~g~~~s~~t 78 (330) +..+...+.|... .+-.+-+......||+.+.+.++|++.++.....++ .+.+.+.++ -|.++|..=++..++++.+ T Consensus 106 ~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 184 (395) T protein:vir:43 106 RVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESN-SVEYVRETGFVNNAAPVSEGTQKPYSDLT 184 (395) T ss_pred hhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCC-ceEEEEEecCCCceeeecCCccccccccc Confidence 1111111111000 011122333556799999999999999999877543 577888766 4789999999999999999 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) +.+++-.++-+++.+.|.+.+.+-.++ +...-.....++++..+..+||+|+...+| +.||-. T Consensus 185 ~~~i~~~~~k~~~~~~is~ell~d~~~---l~~~v~~~la~a~~~~~d~~~l~G~g~~~~--~~Gi~~------------ 247 (395) T protein:vir:43 185 FELENAPVRTIAHLFKASRQILDDASA---LQSYIDARARYGLMLVEECQLLYGNGTGAN--LHGIIP------------ 247 (395) T ss_pred eeEEEEeeeeEEEeehhhHHHHHhHHH---HHHHHHHHHHHHHHHHHHHHHHhccCCCCc--cccccc------------ Confidence 999999999999999999998875443 334444567799999999999999743321 333311 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) . .+. T Consensus 248 -----------------------------~---~~~-------------------------------------------- 251 (395) T protein:vir:43 248 -----------------------------Q---AQA-------------------------------------------- 251 (395) T ss_pred -----------------------------c---ccc-------------------------------------------- Confidence 0 000 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEE Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~ 318 (330) .+...+ ......+.++.+..++..++.......+|+||++....|+. .++.....+-.+...+. ...+.|+||.. T Consensus 252 ~~~~~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~-lkd~~G~~i~~~~~~~~-~~~l~G~pVv~ 326 (395) T protein:vir:43 252 YAPPSG---VVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIEL-NKDAENRYIIGSPQNGT-TPTLWRLPVVE 326 (395) T ss_pred cccccc---cccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHH-hhccCCceeccccccCC-CceecceeeEE Confidence 000000 00111223444445556665555556789999999999985 45665555654433333 33578999999 Q ss_pred EeeccCCccccC Q lcl|Aclame:pro 319 TDALLNTESRVV 330 (330) Q Consensus 319 ~dal~~tE~~Vv 330 (330) +|.+..++..+. T Consensus 327 ~~~~~~~~~~~g 338 (395) T protein:vir:43 327 TQAITQDEFLTG 338 (395) T ss_pred cCCCCCCcEEEE Confidence 999987653322 No 15 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.19 E-value=2.5e-12 Score=84.23 Aligned_cols=225 Identities=14% Similarity=0.104 Sum_probs=144.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) +.+......+..+.+..+-+......|++.+.+.++|++.++.+...++ .++|.++++.|.+.|.+=++..++++.++. T Consensus 21 ~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~ 99 (324) T protein:vir:96 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) T ss_pred hhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceeeecCCcccccccccee Confidence 2222111112122222234555677899999999999999999887644 578999999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcceee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVI 160 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~vi 160 (330) +++-..+-+++.+.|.+.+.+... .++...-.....++++..+...+|+|+.+.. + +. T Consensus 100 ~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~---~--------------~~--- 157 (324) T protein:vir:96 100 NATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP---F--------------GK--- 157 (324) T ss_pred EEEEEeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCC---c--------------Cc--- Confidence 999999999999999998877642 1233333455679999999999999973221 0 00 Q ss_pred ecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEeec Q lcl|Aclame:pro 161 DAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCN 240 (330) Q Consensus 161 dAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~N 240 (330) ++... .+ .. + + T Consensus 158 -----------------------~~~~~-~~-------~~---------~-----------------------------~ 168 (324) T protein:vir:96 158 -----------------------SIAQS-IK-------KT---------N-----------------------------K 168 (324) T ss_pred -----------------------ccccc-cc-------cc---------c-----------------------------e Confidence 00000 00 00 0 0 Q ss_pred ccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEEe Q lcl|Aclame:pro 241 IDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRTD 320 (330) Q Consensus 241 Id~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~d 320 (330) ... +...-++|++ ++..|+........|+||++.+..|+. .++.....+-. +...-.+.|+||..+. T Consensus 169 ~~~----~~~~~~~i~~----~~~~i~~~~~~~~~~i~n~~~~~~L~~-lkd~~G~~~~~----~~~~~~l~G~PV~~~~ 235 (324) T protein:vir:96 169 VIK----GDFTQDNIID----LEALLEDDELEANAFISKTQNRSLLRK-IVDPETKERIY----DRNSDSLDGLPVVNLK 235 (324) T ss_pred ecc----cccchHHHHH----HHHhhhhccCCCCEEEEcHHHHHHHHH-hhCCCCCeeec----CCCCCcccceeeEeec Confidence 000 0001123332 334444333444579999999999995 45554443322 2233458999999988 Q ss_pred eccCCccccC Q lcl|Aclame:pro 321 ALLNTESRVV 330 (330) Q Consensus 321 al~~tE~~Vv 330 (330) +...++..++ T Consensus 236 ~~~~~~~~~~ 245 (324) T protein:vir:96 236 SSNLKRGELI 245 (324) T ss_pred CCCCCcceEE Confidence 8877776666 No 16 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.19 E-value=6.5e-12 Score=81.91 Aligned_cols=229 Identities=14% Similarity=0.017 Sum_probs=149.5 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) |++= .+. + -+......|||.+.+.|+|++.++.+...++ .+++.+.++-|.++|..=++.++++..++. T Consensus 1 ma~~-gG~-l--------ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~ 69 (298) T protein:vir:94 1 MVLN-KGT-L--------FDPELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGVTLA 69 (298) T ss_pred Ceec-ccc-c--------cChhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceEEeeCCcccccccccee Confidence 7762 222 2 2233455799999999999999998876543 578889999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) +++-..+-+++.+.|.+.+.+... +..++...-.....++++..+..+++||....+ T Consensus 70 ~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~---------------------- 127 (298) T protein:vir:94 70 PQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRL---------------------- 127 (298) T ss_pred EEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC---------------------- Confidence 999999999999999998876544 445566656677889999999999999931100 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) |+.... .++ . . ..+. .. T Consensus 128 ----g~~~~~-------------~~~-----~----~----------~~~~---------------------------~~ 144 (298) T protein:vir:94 128 ----GTASAV-------------IGT-----N----H----------FDSK---------------------------VT 144 (298) T ss_pred ----Cccccc-------------ccc-----c----c----------cccc---------------------------cc Confidence 111000 000 0 0 0000 00 Q ss_pred cccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEE Q lcl|Aclame:pro 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT 319 (330) Q Consensus 240 NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~ 319 (330) |+... .+...++.+.+..++..++.......+|+||.+....|++ .++.....+-.+...+..+-.+.|+||..+ T Consensus 145 ~~~~~----~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~ 219 (298) T protein:vir:94 145 QKVEA----PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAK-QKDLQGNALFPELKWGATPDTINGLPVDVN 219 (298) T ss_pred ccccc----ccccccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHH-hhccCCCeeecCcccCCCCceecceeeEEe Confidence 00000 0111122333444455554434444579999999999986 456655556555444555567899999999 Q ss_pred eeccC--C--ccccC Q lcl|Aclame:pro 320 DALLN--T--ESRVV 330 (330) Q Consensus 320 dal~~--t--E~~Vv 330 (330) +++.. + +..++ T Consensus 220 ~~v~~~~~~~~~~~~ 234 (298) T protein:vir:94 220 KTVSDMSLTQRDRAI 234 (298) T ss_pred cccccccCCCccEEE Confidence 99874 1 22222 No 17 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.18 E-value=3.7e-12 Score=83.26 Aligned_cols=229 Identities=14% Similarity=0.128 Sum_probs=146.3 Q ss_pred CCccc--cccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccce Q lcl|Aclame:pro 1 MATLS--TNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSS 78 (330) Q Consensus 1 M~~~~--~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t 78 (330) |+.=. ....|..+...-+-+......||+.+.+.++|++..+.+...++ .+.+.+.++-+.+.|..=|+.+++++.+ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~~ 79 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQ-KKKFTYLAKGVGAYWVSETERIQTSKPE 79 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCC-ceEEEEEeCCcceEEeecCcccccccce Confidence 66522 12233333333345566777899999999999999988876543 4778889999999999999999999999 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) +.+++-..+-+++.+.|.+.+.+... -++...-.....+++++.....++|||-...|.. .. T Consensus 80 ~~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~---~~------------- 141 (304) T protein:vir:10 80 YAQAEMEAKKIGVIIPLSKEFLKWTA--KDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTS---TS------------- 141 (304) T ss_pred eeEEEEEEEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHhhheeccCCCcccc---cc------------- Confidence 99999999999999999998877643 1233333345679999999999999974432111 00 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) . .++++...+ T Consensus 142 -------~----------------~~~~~~~~~----------------------------------------------- 151 (304) T protein:vir:10 142 -------G----------------KPLVEGAEE----------------------------------------------- 151 (304) T ss_pred -------c----------------ccccccccc----------------------------------------------- Confidence 0 000000000 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEE Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~ 318 (330) .+.... .+....++|+ .++..++........|+||++.+..|+. .++.....+- .. .+..+.|+||.. T Consensus 152 ~~~~~~--~~~~~~~~i~----~~~~~l~~~~~~~~~~v~~~~~~~~L~~-lkd~~G~~l~-~~----~~~~l~G~PV~~ 219 (304) T protein:vir:10 152 KGNVVT--DTNNLYVDLS----ALMATIEDEELDPNGVLTTRSFRSKMRN-ALDANDRPLF-DA----NGNEIMGLPLSY 219 (304) T ss_pred cccccc--cccchHHHHH----HHHHHhhhccCCcCEEEEcHHHHHHHHH-hhccCCcEee-cC----CCccccceeeEE Confidence 000000 0111123343 3445554444455689999999999985 4554433332 21 224579999999 Q ss_pred EeeccCC--ccccC Q lcl|Aclame:pro 319 TDALLNT--ESRVV 330 (330) Q Consensus 319 ~dal~~t--E~~Vv 330 (330) ++++... +..++ T Consensus 220 ~~~~~~~~~~~~~~ 233 (304) T protein:vir:10 220 TGADVYDKKKSLAL 233 (304) T ss_pred ecccccCCCCcEEE Confidence 9999752 22222 No 18 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.18 E-value=3.7e-12 Score=83.26 Aligned_cols=229 Identities=14% Similarity=0.128 Sum_probs=146.3 Q ss_pred CCccc--cccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccce Q lcl|Aclame:pro 1 MATLS--TNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSS 78 (330) Q Consensus 1 M~~~~--~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t 78 (330) |+.=. ....|..+...-+-+......||+.+.+.++|++..+.+...++ .+.+.+.++-+.+.|..=|+.+++++.+ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~~ 79 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQ-KKKFTYLAKGVGAYWVSETERIQTSKPE 79 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCC-ceEEEEEeCCcceEEeecCcccccccce Confidence 66522 12233333333345566777899999999999999988876543 4778889999999999999999999999 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) +.+++-..+-+++.+.|.+.+.+... -++...-.....+++++.....++|||-...|.. .. T Consensus 80 ~~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~---~~------------- 141 (304) T protein:vir:94 80 YAQAEMEAKKIGVIIPLSKEFLKWTA--KDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTS---TS------------- 141 (304) T ss_pred eeEEEEEEEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHhhheeccCCCcccc---cc------------- Confidence 99999999999999999998877643 1233333345679999999999999974432111 00 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) . .++++...+ T Consensus 142 -------~----------------~~~~~~~~~----------------------------------------------- 151 (304) T protein:vir:94 142 -------G----------------KPLVEGAEE----------------------------------------------- 151 (304) T ss_pred -------c----------------ccccccccc----------------------------------------------- Confidence 0 000000000 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEE Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~ 318 (330) .+.... .+....++|+ .++..++........|+||++.+..|+. .++.....+- .. .+..+.|+||.. T Consensus 152 ~~~~~~--~~~~~~~~i~----~~~~~l~~~~~~~~~~v~~~~~~~~L~~-lkd~~G~~l~-~~----~~~~l~G~PV~~ 219 (304) T protein:vir:94 152 KGNVVT--DTNNLYVDLS----ALMATIEDEELDPNGVLTTRSFRSKMRN-ALDANDRPLF-DA----NGNEIMGLPLSY 219 (304) T ss_pred cccccc--cccchHHHHH----HHHHHhhhccCCcCEEEEcHHHHHHHHH-hhccCCcEee-cC----CCccccceeeEE Confidence 000000 0111123343 3445554444455689999999999985 4554433332 21 224579999999 Q ss_pred EeeccCC--ccccC Q lcl|Aclame:pro 319 TDALLNT--ESRVV 330 (330) Q Consensus 319 ~dal~~t--E~~Vv 330 (330) ++++... +..++ T Consensus 220 ~~~~~~~~~~~~~~ 233 (304) T protein:vir:94 220 TGADVYDKKKSLAL 233 (304) T ss_pred ecccccCCCCcEEE Confidence 9999752 22222 No 19 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.18 E-value=9.6e-13 Score=86.48 Aligned_cols=229 Identities=11% Similarity=0.014 Sum_probs=144.5 Q ss_pred CCcc--ccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEec-cCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MATL--STNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTG-LPTPTWRKLYGGVLPNKS 77 (330) Q Consensus 1 M~~~--~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~-lP~~~fR~lN~g~~~s~~ 77 (330) +... .....+..+.+-.+-+......||+.+.+.++|++.++.....++ .+.|.+.++ -+.+.|..=++.+++++. T Consensus 96 ~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 174 (385) T protein:vir:18 96 FGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSN-ALEYVREEVFTNNADVVAEKALKPESDI 174 (385) T ss_pred chhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCc-ceEEEEEecCCcceeeeccCcccccccc Confidence 0000 000000011111122333567799999999999999999876543 577888765 578899999999999999 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) ++.+++-..+-+++.+.|.+.+.+-.++ +...=.....++++..+...|++||...+| +.||.. T Consensus 175 ~~~~~~~~~~k~~~~~~is~ell~d~~~---l~~~i~~~la~a~~~~~d~~~l~G~g~~~~--~~Gi~~----------- 238 (385) T protein:vir:18 175 TFSKQTANVKTIAHWVQASRQVMDDAPM---LQSYINNRLMYGLALKEEGQLLNGDGTGDN--LEGLNK----------- 238 (385) T ss_pred ceeEEEEeeeeEEEeehhhHHHHhhHHH---HHHHHHHHHHHHHHHHHHHHHHhccCCCCc--cccccc----------- Confidence 9999999999999999999998775433 334445667899999999999999744332 333311 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) .+++ T Consensus 239 ---~~~~------------------------------------------------------------------------- 242 (385) T protein:vir:18 239 ---VATA------------------------------------------------------------------------- 242 (385) T ss_pred ---cccc------------------------------------------------------------------------- Confidence 0000 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) ..+... .......++|+ .++..|+.......+|+||++....|+. .++.....+-.... +.....+.|+||. T Consensus 243 -~~~~~~-~~~~~~~d~i~----~~~~~l~~~~~~~~~~~~~~~~~~~l~~-lkd~~G~~l~~~~~-~~~~~~l~G~pV~ 314 (385) T protein:vir:18 243 -YDTSLN-ATGDTRADIIA----HAIYQVTESEFSASGIVLNPRDWHNIAL-LKDNEGRYIFGGPQ-AFTSNIMWGLPVV 314 (385) T ss_pred -cccccc-ccccchHHHHH----HHHHhhccccCCCCEEEEcHHHHHHHHH-hhcCCCceeccCcc-cCCCceecceeeE Confidence 000000 00011223344 3445555555556789999999999986 45555555543332 2334567999999 Q ss_pred EEeeccCCccccC Q lcl|Aclame:pro 318 RTDALLNTESRVV 330 (330) Q Consensus 318 ~~dal~~tE~~Vv 330 (330) .++.+..++..+. T Consensus 315 ~~~~~p~~~~~~g 327 (385) T protein:vir:18 315 PTKAQAAGTFTVG 327 (385) T ss_pred EcCcCCCCcEEEe Confidence 9999986643222 No 20 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.18 E-value=9.6e-13 Score=86.48 Aligned_cols=229 Identities=11% Similarity=0.014 Sum_probs=144.5 Q ss_pred CCcc--ccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEec-cCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MATL--STNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTG-LPTPTWRKLYGGVLPNKS 77 (330) Q Consensus 1 M~~~--~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~-lP~~~fR~lN~g~~~s~~ 77 (330) +... .....+..+.+-.+-+......||+.+.+.++|++.++.....++ .+.|.+.++ -+.+.|..=++.+++++. T Consensus 96 ~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 174 (385) T protein:vir:19 96 FGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSN-ALEYVREEVFTNNADVVAEKALKPESDI 174 (385) T ss_pred chhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCc-ceEEEEEecCCcceeeeccCcccccccc Confidence 0000 000000011111122333567799999999999999999876543 577888765 578899999999999999 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) ++.+++-..+-+++.+.|.+.+.+-.++ +...=.....++++..+...|++||...+| +.||.. T Consensus 175 ~~~~~~~~~~k~~~~~~is~ell~d~~~---l~~~i~~~la~a~~~~~d~~~l~G~g~~~~--~~Gi~~----------- 238 (385) T protein:vir:19 175 TFSKQTANVKTIAHWVQASRQVMDDAPM---LQSYINNRLMYGLALKEEGQLLNGDGTGDN--LEGLNK----------- 238 (385) T ss_pred ceeEEEEeeeeEEEeehhhHHHHhhHHH---HHHHHHHHHHHHHHHHHHHHHHhccCCCCc--cccccc----------- Confidence 9999999999999999999998775433 334445667899999999999999744332 333311 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) .+++ T Consensus 239 ---~~~~------------------------------------------------------------------------- 242 (385) T protein:vir:19 239 ---VATA------------------------------------------------------------------------- 242 (385) T ss_pred ---cccc------------------------------------------------------------------------- Confidence 0000 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) ..+... .......++|+ .++..|+.......+|+||++....|+. .++.....+-.... +.....+.|+||. T Consensus 243 -~~~~~~-~~~~~~~d~i~----~~~~~l~~~~~~~~~~~~~~~~~~~l~~-lkd~~G~~l~~~~~-~~~~~~l~G~pV~ 314 (385) T protein:vir:19 243 -YDTSLN-ATGDTRADIIA----HAIYQVTESEFSASGIVLNPRDWHNIAL-LKDNEGRYIFGGPQ-AFTSNIMWGLPVV 314 (385) T ss_pred -cccccc-ccccchHHHHH----HHHHhhccccCCCCEEEEcHHHHHHHHH-hhcCCCceeccCcc-cCCCceecceeeE Confidence 000000 00011223344 3445555555556789999999999986 45555555543332 2334567999999 Q ss_pred EEeeccCCccccC Q lcl|Aclame:pro 318 RTDALLNTESRVV 330 (330) Q Consensus 318 ~~dal~~tE~~Vv 330 (330) .++.+..++..+. T Consensus 315 ~~~~~p~~~~~~g 327 (385) T protein:vir:19 315 PTKAQAAGTFTVG 327 (385) T ss_pred EcCcCCCCcEEEe Confidence 9999986643222 No 21 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.12 E-value=1.7e-11 Score=79.64 Aligned_cols=238 Identities=16% Similarity=0.184 Sum_probs=147.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCccc-ceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNK-SST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~-~t~ 79 (330) ...+..+ |-.+ ..-+-|......||+.+.+.++|++..+++..+++ .+.+.+.++-|.+.|..=++.++++. .++ T Consensus 127 ~~al~~~--t~~~-gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f 202 (425) T protein:vir:10 127 QAALNKG--EDSE-GGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKA-GFSKLFNMGGTTSGWVGEASQRPQTNAATF 202 (425) T ss_pred HHHhhcC--cCCC-CceeccHhHHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEEcCCcceeeecccccccccccccc Confidence 0000000 0000 00022344556799999999999999998877644 57888899999999999998888876 589 Q ss_pred EEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) .+++-.++-+++.+.|.+.+.+.. .+..++ -.....++++..+...|+|||-...|.+ +-.. .+ T Consensus 203 ~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~---i~~~la~ai~~~~d~~~l~G~G~~~p~G---il~~---~~------ 267 (425) T protein:vir:10 203 QPLSFASGEIYANPAATQQILDDAEIDLESW---LATEVQTEFAKQEGKAFLAGDGTNKPNG---LLTY---IA------ 267 (425) T ss_pred ceeeeeheeeEeehHhHHHHHhcchhHHHHH---HHHHHHHHHHHHHHhhhhcccCCCCcce---eeec---cc------ Confidence 999999999999999999998874 344444 3355679999999999999986544443 3110 00 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) +++... ....+.++. T Consensus 268 ----~~~~~~-----------~~~~~~~~~-------------------------------------------------- 282 (425) T protein:vir:10 268 ----GGANAA-----------KHPFGAIEV-------------------------------------------------- 282 (425) T ss_pred ----cccccc-----------ccccccccc-------------------------------------------------- Confidence 000000 000000000 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEE Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~ 318 (330) .+.. -.+....++|++++ +.++....++.+|+||++...+|+. .+++...++-.........-.+.|.||.. T Consensus 283 ~~~~---~~~~~~~d~l~~l~----~~l~~~~~~~a~~vmn~~~~~~L~~-lkD~~G~~l~~~~~~~g~~~~l~G~PV~~ 354 (425) T protein:vir:10 283 VNSG---AAADITSDGIIDLV----YDLPSAFTGNARFAMNRNTQRQVRK-LKDGQGNYLWQPSYVAGQPATLAGYPVTE 354 (425) T ss_pred cccc---ccccccHHHHHHHH----hhhhhhhccCCEEEEchHHHHHHHH-hhcCCCceeeccCccCCCCceecceeeEE Confidence 0000 00011234555543 3444445567789999999999995 56766555544443333345788999999 Q ss_pred EeeccCCcc--c-cC Q lcl|Aclame:pro 319 TDALLNTES--R-VV 330 (330) Q Consensus 319 ~dal~~tE~--~-Vv 330 (330) +|++....+ . |+ T Consensus 355 ~~~~p~~~~~~~~i~ 369 (425) T protein:vir:10 355 VPDMPDVAANSTPIL 369 (425) T ss_pred ecCcCCccCCccEEE Confidence 999885332 2 22 No 22 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.11 E-value=8.4e-12 Score=81.31 Aligned_cols=225 Identities=11% Similarity=0.054 Sum_probs=144.7 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) +.++.....+..+.+..+-+......||+.+.+.++|++.++.+..+++ .+.+.+.++-|.+.|.+-++.+++++.++. T Consensus 21 ~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) T protein:vir:99 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) T ss_pred hhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceeEeccCcccccccccee Confidence 2222222222223222244555677899999999999999999887654 477888999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcceee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVI 160 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~vi 160 (330) +++-.++-+++.+.|-+.+.+... .++...-.....+++++.+...+|+|+.... ...| T Consensus 100 ~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~--~~~~----------------- 158 (324) T protein:vir:99 100 NATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKS----------------- 158 (324) T ss_pred EEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCc--cCcc----------------- Confidence 999999999999999998887653 2333334455679999999999999963211 0000 Q ss_pred ecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEeec Q lcl|Aclame:pro 161 DAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCN 240 (330) Q Consensus 161 dAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~N 240 (330) ++. +.+.+ . . T Consensus 159 ------------------------~~~-~~~~~-------~-~------------------------------------- 168 (324) T protein:vir:99 159 ------------------------IAQ-SIEKT-------N-K------------------------------------- 168 (324) T ss_pred ------------------------ccc-ccccc-------c-e------------------------------------- Confidence 000 00000 0 0 Q ss_pred ccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEEe Q lcl|Aclame:pro 241 IDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRTD 320 (330) Q Consensus 241 Id~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~d 320 (330) .. .+...-++|++ ++..|+........|+||++.+..|+.. .+.....+-. +...-.+.|+||..++ T Consensus 169 ~~----~~~~~~~~i~~----~~~~l~~~~~~~~~~v~n~~~~~~L~~l-~d~~g~~~~~----~~~~~~l~G~PVv~~~ 235 (324) T protein:vir:99 169 VI----KGDFTQDNIID----LEALLEDDELEANAFISKTQNRSLLRKI-VDPETKERIY----DRNSDTLDGLPVVNLK 235 (324) T ss_pred ec----cccCCHHHHHH----HHHhhhhccCCCCEEEEcHHHHHHHHHh-hcCCCceeec----CCCCccccceeEEeec Confidence 00 00001233333 3344444344445799999999999853 4444333322 2222358999999999 Q ss_pred eccCCccccC Q lcl|Aclame:pro 321 ALLNTESRVV 330 (330) Q Consensus 321 al~~tE~~Vv 330 (330) +...+...++ T Consensus 236 ~~~~~~~~~i 245 (324) T protein:vir:99 236 SSNLKRGELI 245 (324) T ss_pred CCCCCcceEE Confidence 8877766666 No 23 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.11 E-value=8.9e-12 Score=81.18 Aligned_cols=225 Identities=12% Similarity=0.065 Sum_probs=143.8 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) +.++.....+..+-+..+-+......||+.+.+.++|++..+.+...++ .+.+.+.++.|.+.|.+-++..++++.++. T Consensus 21 ~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) T protein:vir:10 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) T ss_pred cceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEeCCcceeEeccCcccccccccee Confidence 2222111122222222244555677899999999999999999887644 478888999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcceee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVI 160 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~vi 160 (330) +++-..+-+++.+.|.+.+.+... .++...-.....+++++.+...+|+|+.... ...|+ . T Consensus 100 ~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~--~~~~i---------------~ 160 (324) T protein:vir:10 100 NATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSI---------------A 160 (324) T ss_pred EEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCc--cCccc---------------c Confidence 999999999999999998887643 2333333455679999999999999963311 00000 0 Q ss_pred ecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEeec Q lcl|Aclame:pro 161 DAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCN 240 (330) Q Consensus 161 dAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~N 240 (330) ++.+ .+ + . T Consensus 161 ~~~~----------------------------------~~---------~-----------------------------~ 168 (324) T protein:vir:10 161 QSIE----------------------------------KT---------N-----------------------------K 168 (324) T ss_pred cccc----------------------------------cc---------c-----------------------------e Confidence 0000 00 0 0 Q ss_pred ccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEEe Q lcl|Aclame:pro 241 IDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRTD 320 (330) Q Consensus 241 Id~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~d 320 (330) ... +....++|++ ++..|+.......+|+||++....|+.. .+.....+-.. ...-.+.|+||..++ T Consensus 169 ~~~----~~~t~~~i~~----~~~~l~~~~~~~~~~v~n~~~~~~L~~l-~d~~g~~~~~~----~~~~~l~G~PV~~~~ 235 (324) T protein:vir:10 169 VIK----GDFTQDNIID----LEALLEDDELEANAFISKTQNRSLLRKI-VDPETKERIYD----RNSDTLDGLPVVNLK 235 (324) T ss_pred ecc----ccCCHHHHHH----HHHhhhhccCCCCEEEEcHHHHHHHHHh-hccCCceeecC----CCCccccceeEEeec Confidence 000 0011233333 2334443334445799999999999853 44433333222 222358999999998 Q ss_pred eccCCccccC Q lcl|Aclame:pro 321 ALLNTESRVV 330 (330) Q Consensus 321 al~~tE~~Vv 330 (330) +...++..++ T Consensus 236 ~~~~~~~~~~ 245 (324) T protein:vir:10 236 SSNLKRGELI 245 (324) T ss_pred CCCCCcceEE Confidence 8777776666 No 24 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.11 E-value=7.1e-12 Score=81.71 Aligned_cols=229 Identities=14% Similarity=0.104 Sum_probs=142.0 Q ss_pred CCc--cccccccHHHHHhhcCcccchHHHH-HHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MAT--LSTNNPTMADVAKRLDPNGKVDIIV-EMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKS 77 (330) Q Consensus 1 M~~--~~~~a~TL~E~Ak~~~~d~~~~~VI-E~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~ 77 (330) ++. ......|..+-. .+-+......|| +.+.+.++|....++...+ + .+.+.+.++-|.+.|..=++.++.++. T Consensus 243 ~~~~~~~~~~~t~~~gg-~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~~-g-~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 319 (543) T protein:vir:81 243 RAINEVRAMGLTKADGG-YLVPFQLDPTVIITSNGSLNDIRRFARQVVAT-G-DVWHGVSSAAVQWSWDAEFEEVSDDSP 319 (543) T ss_pred hhhhhhhhcccccccCc-ccCchhhhhHHHHHHHhhhchhhhhcccccCC-c-ceEEEEecCCcceeecccCcccccccc Confidence 000 000111222211 122334455555 6567778888887776553 3 466778899999999999999999999 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) ++.+++-..+-+++.+.|.+.+.+... ++...-.....++++..+...|||||.. +..+.|+.... T Consensus 320 ~~~~i~~~~~k~~~~~~is~ell~d~~---~~~~~i~~~l~~~~~~~~d~ail~G~Gt--~~~p~Gi~~~~--------- 385 (543) T protein:vir:81 320 EFGQPEIPVKKAQGFVPISIEALQDEA---NVTETVALLFAEGKDELEAVTLTTGTGQ--GNQPTGIVTAL--------- 385 (543) T ss_pred ccceeeeeeeeeEeeehhhHHHHhccH---HHHHHHHHHHHHHHHHHHHHHHhccCCC--Ccccccchhhc--------- Confidence 999999999999999999999887543 5556666777899999999999999743 23566653210 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) +++... + . .+. T Consensus 386 -----~~~~~~------~-----------~--------------------~~~--------------------------- 396 (543) T protein:vir:81 386 -----AGTAAE------I-----------A--------------------PVT--------------------------- 396 (543) T ss_pred -----cccccc------c-----------c--------------------ccc--------------------------- Confidence 000000 0 0 000 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) - +....+++++ ++..++.....+.+|+||++....|+. ..+.....+-.....| ....+.|.||. T Consensus 397 ---~------~~~~~~~~~~----~~~~l~~~~~~~~~~v~n~~~~~~l~~-lkd~~G~~l~~~~~~g-~~~~l~G~pv~ 461 (543) T protein:vir:81 397 ---A------ETFALADVYA----VYEQLAARHRRQGAWLANNLIYNKIRQ-FDTQGGAGLWTTIGNG-EPSQLLGRPVG 461 (543) T ss_pred ---c------ccccHHHHHH----HHHhhhccccCCcEEEEcHHHHHHHHH-hhcCCCceeccCcCCC-CCccccceeeE Confidence 0 0001223333 334455455566789999999999995 4555444443332222 34468999999 Q ss_pred EEeeccCCc-------c-ccC Q lcl|Aclame:pro 318 RTDALLNTE-------S-RVV 330 (330) Q Consensus 318 ~~dal~~tE-------~-~Vv 330 (330) .+|++.... . .|+ T Consensus 462 ~~~~~~~~~~~~~~~~~~~i~ 482 (543) T protein:vir:81 462 EAEAMDANWNTSASADNFVLL 482 (543) T ss_pred EeccccccccccccCCcceEE Confidence 999986432 1 122 No 25 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.09 E-value=1.3e-11 Score=80.27 Aligned_cols=224 Identities=11% Similarity=0.050 Sum_probs=144.1 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) +..+.....|..+.+..+-+......||+.+.+.++|++.++.+...++ ..++.+.++.|.++|.+=++..++++.++. T Consensus 21 ~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~ 99 (324) T protein:vir:93 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) T ss_pred hhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceeeecCCcccccccccee Confidence 2222211223233333345666778899999999999999998876543 477889999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) +++-..+-+++.+.|-+.+.+.. .+..++ -.....+++++.+..++|+|+.+.. ...|+ T Consensus 100 ~i~~~~~k~~~~~~iS~ell~ds~~~l~~~---i~~~l~~aia~~~d~a~l~G~g~~~--~~~~~--------------- 159 (324) T protein:vir:93 100 NATMRAFKLGVILPVTKEFLNYTYSQFFEE---MKPMIAEAFYKKFDEAGILNQGNNP--FGKSI--------------- 159 (324) T ss_pred EEEEEeEEEEEeehhhHHHHhcchHHHHHH---HHHHHHHHHHHHHHHHHhcCCCCCC--cCccc--------------- Confidence 99999999999999998877764 233333 3344568999999999999963211 00000 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) .+..+.+ + T Consensus 160 ~~~~~~~-------------------------------------------~----------------------------- 167 (324) T protein:vir:93 160 AQSIEKT-------------------------------------------N----------------------------- 167 (324) T ss_pred ccccccc-------------------------------------------c----------------------------- Confidence 0000000 0 Q ss_pred cccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEE Q lcl|Aclame:pro 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT 319 (330) Q Consensus 240 NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~ 319 (330) ... . +....++|++ ++..|+........|+||++.+..|+.. ++.....+-. +.....+.|+||..+ T Consensus 168 ~~~-~---~~~~~~~i~~----~~~~l~~~~~~~~~~v~n~~~~~~L~~l-~d~~G~~~~~----~~~~~~l~G~PVv~~ 234 (324) T protein:vir:93 168 KVI-K---GDFTQDNIID----LEALLEDDELEANAFISKTQNRSLLRKI-VDPETKERIY----DRNSDSLDGLPVVNL 234 (324) T ss_pred eec-c---ccccHHHHHH----HHHhhhhccCCCCEEEEcHHHHHHHHHh-hCCCCCeeec----CCCCCcccceeeEee Confidence 000 0 0011223333 3344444344445899999999999854 5544333322 223346899999998 Q ss_pred eeccCCccccC Q lcl|Aclame:pro 320 DALLNTESRVV 330 (330) Q Consensus 320 dal~~tE~~Vv 330 (330) .+...++..++ T Consensus 235 ~~~~~~~~~i~ 245 (324) T protein:vir:93 235 KSSNLKRGELI 245 (324) T ss_pred cCCCCCcceEE Confidence 88776666555 No 26 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.09 E-value=7.4e-12 Score=81.61 Aligned_cols=229 Identities=15% Similarity=0.061 Sum_probs=141.9 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEecc-CCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGL-PTPTWRKLYGGVLPNKSST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~l-P~~~fR~lN~g~~~s~~t~ 79 (330) ++.......+....+-.+-+......||+.+.+.++|++.+++....++ .+.+.+.++- +.+.|..=++..+++..++ T Consensus 107 ~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 185 (390) T protein:vir:10 107 KAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSA-LIEYVQETGFVNNAAIVAEGALKPESSLKF 185 (390) T ss_pred HHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCCcceeeecCCccccccccce Confidence 0000000000000001112223456799999999999999999877544 4677777765 6899999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) .+++-..+-+++.+.|.+.+.+..++ +-..-.....++++....+.||+||... +.+.||-.- T Consensus 186 ~~i~~~~~k~~~~~~is~ell~d~~~---l~~~i~~~l~~~~~~~~~~~il~G~G~~--~~p~Gi~~~------------ 248 (390) T protein:vir:10 186 AKKTDTTHVIAHTMKATRQILSDAPQ---LASYMNNRLIRGLKVKEDAEILRGTGAN--DGLLGLIPQ------------ 248 (390) T ss_pred eEEEEeeEEEEEeehhhHHHHHhHHH---HHHHHHHHHHHHHHHHHHHHHhhcCCCC--ccccccccc------------ Confidence 99999999999999999998776543 3333445567899999999999997432 234444110 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) ++.++ T Consensus 249 --~~~~~------------------------------------------------------------------------- 253 (390) T protein:vir:10 249 --ATTYA------------------------------------------------------------------------- 253 (390) T ss_pred --ccccc------------------------------------------------------------------------- Confidence 00000 Q ss_pred cccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEE Q lcl|Aclame:pro 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT 319 (330) Q Consensus 240 NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~ 319 (330) + .+........+++++ ++..+........+|+||++....|+. .++....++-.....+. .-.+.|+||..+ T Consensus 254 -~-~~~~~~~~~~~~~~~----~~~~l~~~~~~~~~~v~n~~~~~~L~~-lkd~~g~~l~~~~~~~~-~~~l~G~pv~~~ 325 (390) T protein:vir:10 254 -A-PTTIAGATRVDQLRL----AMLQASLAEYPASGIVINPIDWAAIEL-AKDANNQYLIGNARGTL-TPTLWGLPVVAT 325 (390) T ss_pred -c-cccccccchHHHHHH----HHHhhccccCCCCEEEEcHHHHHHHHH-hhcCCCceeecCCcCcC-CceecceeeEEc Confidence 0 000001111233333 334443344445689999999999985 45655555544433332 235899999999 Q ss_pred eeccCCccccC Q lcl|Aclame:pro 320 DALLNTESRVV 330 (330) Q Consensus 320 dal~~tE~~Vv 330 (330) +.+..+...+. T Consensus 326 ~~~p~~~~~~g 336 (390) T protein:vir:10 326 QAMAPGEFLVG 336 (390) T ss_pred CCCCCCcEEEE Confidence 99986543222 No 27 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.09 E-value=1.1e-11 Score=80.65 Aligned_cols=226 Identities=12% Similarity=0.047 Sum_probs=144.1 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEec-cCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTG-LPTPTWRKLYGGVLPNKSST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~-lP~~~fR~lN~g~~~s~~t~ 79 (330) +...... |... +..+-+......||+.+.+.++|++.+++....++ .+.+.++++ -|.++|..=++.++.++.++ T Consensus 132 ~~~~~~~--~~~~-~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f 207 (418) T protein:vir:10 132 VPATVGS--GVSG-SNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSS-SIEYTVETGFTNNAAAVAEGAQKPTSDLKF 207 (418) T ss_pred hhhhccC--CCCC-CccccchhHHHHHHHHHhhhhhHHhhcceeeccCC-ceeEEEEecCCCceeeeccCccccccccce Confidence 0000000 0001 11122334556799999999999999999877544 456777766 58899999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) .+++-.++-+++.+.|.+.+.+..++ +...-.....+++++.+...||+||...+ .+.||..- T Consensus 208 ~~v~~~~~k~~~~~~is~ell~ds~~---l~~~i~~~l~~a~~~~~d~a~l~G~g~~~--~p~Gi~~~------------ 270 (418) T protein:vir:10 208 NLKNQPVRTIAHLFKASRQILDDAPA---LQSYIDGRARYGLQLTEEGQILKGDGTGA--NILGILPQ------------ 270 (418) T ss_pred eeEEEeeeeEEEeehhhHHHHHhHHH---HHHHHHHHHHHHHHHHHHHHHhccCCCCc--cccccccc------------ Confidence 99999999999999999999876543 44445566789999999999999974321 12233110 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) ++... . . T Consensus 271 --~~~~~-------------------------------------~---~------------------------------- 277 (418) T protein:vir:10 271 --ASAFM-------------------------------------P---S------------------------------- 277 (418) T ss_pred --ccccc-------------------------------------c---c------------------------------- Confidence 00000 0 0 Q ss_pred cccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEE Q lcl|Aclame:pro 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT 319 (330) Q Consensus 240 NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~ 319 (330) . + ..+....++|+++ +..+........+|+||++....|+. ..+.....+-.+...+ ..-.+.|+||..+ T Consensus 278 -~--~-~~~~~~~~~i~~~----~~~~~~~~~~~~~~v~n~~~~~~L~~-lkd~~G~~i~~~~~~~-~~~~l~G~pV~~~ 347 (418) T protein:vir:10 278 -I--T-LANATPIDKIRLA----LLQAVLAEFPATGIVLNPIDWASIEL-TKDSQGRYIVGNPVNG-TTPRLWNLPVVET 347 (418) T ss_pred -c--c-ccccccHHHHHHH----HHhhccccCCCCEEEEcHHHHHHHHH-hhcCCCceeccccccC-CCceecceeeEEc Confidence 0 0 0001122344443 34444444555689999999999985 4565555555443333 2346899999999 Q ss_pred eeccCCccccC Q lcl|Aclame:pro 320 DALLNTESRVV 330 (330) Q Consensus 320 dal~~tE~~Vv 330 (330) +++..++..+. T Consensus 348 ~~~p~~~~~~g 358 (418) T protein:vir:10 348 QAMTANEFLVG 358 (418) T ss_pred CCCCCCcEEEe Confidence 99987653222 No 28 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.06 E-value=1.3e-11 Score=80.31 Aligned_cols=229 Identities=13% Similarity=0.035 Sum_probs=142.8 Q ss_pred CC-ccccccccHHHH-----------HhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEecc-CCcceee Q lcl|Aclame:pro 1 MA-TLSTNNPTMADV-----------AKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGL-PTPTWRK 67 (330) Q Consensus 1 M~-~~~~~a~TL~E~-----------Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~l-P~~~fR~ 67 (330) +. -.+...+-+... +-.+-+......||+.+.+.++|++.++.....++ .+.+.+.++- +.+.|.. T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~ 173 (390) T protein:vir:81 95 WNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSA-LIEYVQETGFVNNAAIVA 173 (390) T ss_pred HhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCC-ceEEEEEecCCcceeeec Confidence 00 000000001000 11122233556799999999999999998877544 5777777764 6899999 Q ss_pred cCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhh Q lcl|Aclame:pro 68 LYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPR 147 (330) Q Consensus 68 lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R 147 (330) =++..++++.++.+++-..+-+++.+.|.+.+.+..++ +...-.....+++++.+.+.|++||... +.+.||.. T Consensus 174 Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~---~~~~i~~~l~~~~~~~~d~a~l~G~g~~--~~~~Gi~~- 247 (390) T protein:vir:81 174 EGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQ---LASYMNNRLIRGLKVKEDAEILRGTGAN--DGLLGLIP- 247 (390) T ss_pred CCcccccccceeeEEEEeeeEEEEeehhhHHHHHhHHH---HHHHHHHHHHHHHHHHHHHHHHhcCCCC--Ccccceee- Confidence 99999999999999999999999999999998886543 4444455677999999999999997432 22333311 Q ss_pred hcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeee Q lcl|Aclame:pro 148 YNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGL 227 (330) Q Consensus 148 ~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl 227 (330) .++..+ T Consensus 248 -------------~~~~~~------------------------------------------------------------- 253 (390) T protein:vir:81 248 -------------QATTYA------------------------------------------------------------- 253 (390) T ss_pred -------------cccccc------------------------------------------------------------- Confidence 000000 Q ss_pred EEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcc Q lcl|Aclame:pro 228 TLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGER 307 (330) Q Consensus 228 ~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~ 307 (330) + ..........++|+++ +..+.........|+||++....|+. .++.....+-.....+. T Consensus 254 -------------~-~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~v~~~~~~~~l~~-lkd~~G~~l~~~~~~~~- 313 (390) T protein:vir:81 254 -------------A-PTTIAGATRVDQLRLA----MLQASLAEYNPSGIVINPIDWAAIEL-AKDANNQYLIGNARGTL- 313 (390) T ss_pred -------------c-ccccccchhHHHHHHH----HHhhccccCCCCEEEEcHHHHHHHHH-hhcCCCceeecCccccc- Confidence 0 0000011122344443 33443334444589999999999985 45555555544333222 Q ss_pred eEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 308 VMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 308 v~~~~gvpir~~dal~~tE~~Vv 330 (330) ...++|+||..++++..+...+. T Consensus 314 ~~~l~G~pv~~~~~~p~~~~~~g 336 (390) T protein:vir:81 314 TPTLWGLPVVATQAMAPGEFLVG 336 (390) T ss_pred CceecceeeEEcCCCCCCcEEEE Confidence 23789999999999986653322 No 29 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.05 E-value=4.5e-11 Score=77.31 Aligned_cols=226 Identities=12% Similarity=0.050 Sum_probs=144.8 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) |..=+.+-.|..+ +..+-+......|||.+.+.++|++..+.+...++ .+.+.+.+ -|.++|..=++.+++++.++. T Consensus 1 ~g~~a~~~~~~~~-~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~~~~~-~~~a~~v~E~~~~~~~~~~f~ 77 (299) T protein:vir:41 1 MGFNPDTTTMQSA-KTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKP-EEEFTFMS-GVGAFWVDEAERIQTSKPTFT 77 (299) T ss_pred CCcCCCcccccCC-CceecchhHHHHHHHHHHhcchhhhhceeeecCCC-cEEEEEEc-CCceeeeecCcccccccccee Confidence 3332222222222 22344566777899999999999999998877544 45666655 589999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcceee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVI 160 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~vi 160 (330) +++-..+-+++.+.|-+.+.+... .++...=.....+++++.+...||+||....|. |+-... T Consensus 78 ~v~l~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~---gil~~~------------ 140 (299) T protein:vir:41 78 KAKMRSKKMGVIIPTTKENLNYSV--TNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNW---NILKSA------------ 140 (299) T ss_pred EEEEeeEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHHhhcccCcccc---cccccc------------ Confidence 999999999999999999988643 234444456678999999999999998543322 110000 Q ss_pred ecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEeec Q lcl|Aclame:pro 161 DAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCN 240 (330) Q Consensus 161 dAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~N 240 (330) + .+. ... . . T Consensus 141 --~-~~~----------------------------------~~~---~-~------------------------------ 149 (299) T protein:vir:41 141 --T-DAS----------------------------------NLV---E-E------------------------------ 149 (299) T ss_pred --c-ccc----------------------------------eee---c-c------------------------------ Confidence 0 000 000 0 0 Q ss_pred ccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEEe Q lcl|Aclame:pro 241 IDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRTD 320 (330) Q Consensus 241 Id~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~d 320 (330) +....++|+++ +..++.......+|+||++....|+. .++.....+-.....+.. ..+.|+||..+| T Consensus 150 -------~~~~~~~l~~~----~~~l~~~~~~~~~~v~n~~~~~~L~~-lkd~~G~~l~~~~~~~~~-~~l~G~PV~~~~ 216 (299) T protein:vir:41 150 -------TANKYDDLNEA----IGLIEAEDLEPNGIATIRKQRVKYRS-TKDGNGMPIFNTATSNGV-DDVLGLPIAYTP 216 (299) T ss_pred -------ccccHHHHHHH----HHhhhcccCCcCEEEEcHHHHHHHHH-hhccCCceeecCCcCCCC-ceecceeeEEec Confidence 00012344443 34444333344679999999999996 455554554444333322 257899999999 Q ss_pred eccCCccccC Q lcl|Aclame:pro 321 ALLNTESRVV 330 (330) Q Consensus 321 al~~tE~~Vv 330 (330) ++......++ T Consensus 217 ~~~~~~~~~~ 226 (299) T protein:vir:41 217 KYTFGDKDIS 226 (299) T ss_pred ccCCCCCceE Confidence 9986432222 No 30 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=99.05 E-value=8.5e-11 Score=75.79 Aligned_cols=230 Identities=11% Similarity=0.003 Sum_probs=146.6 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) ||.-.+..=+ +-+......|||.+.+.|+|++..+.+....+ ...+.+.++-|.++|..=++.+++++.++. T Consensus 1 ma~~t~~~G~-------lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~ 72 (300) T protein:vir:95 1 MSEAQLSKGN-------LFNPELVTKVINKVKGHSSIAKLSPQKPIPFN-GQREFVFDFDSDIDIVAENGKKTHGGVSLD 72 (300) T ss_pred CcccccCCcc-------eechhhHHHHHHHHHhhhhhhhhcceeeccCC-ceEEEEEecCcceEEeeCCcccccccccce Confidence 7764333211 22344667899999999999999888765433 467888899999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) +++-..+-+++.+.|-+.+.+... +..++...-.....++++..+..++|||+...+ T Consensus 73 ~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~---------------------- 130 (300) T protein:vir:95 73 PVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRT---------------------- 130 (300) T ss_pred eeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCC---------------------- Confidence 999999999999999998876543 333444444556789999999999999962211 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) |++... . | ... .++. . T Consensus 131 ----g~~~~~----------------~--~----~~~----------~~~~--~-------------------------- 146 (300) T protein:vir:95 131 ----KQASTI----------------I--G----DNC----------FDKK--V-------------------------- 146 (300) T ss_pred ----CCCccc----------------c--c----ccc----------cccc--c-------------------------- Confidence 111000 0 0 000 0000 0 Q ss_pred cccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEE Q lcl|Aclame:pro 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT 319 (330) Q Consensus 240 NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~ 319 (330) ..+.........++|.+ ++..+.....-..+|.||++....|++ .++.....+..+...+..+-.+.|+||..+ T Consensus 147 -~~~~~~~~~~~~~~i~~----~~~~~~~~~~~~~~~vmn~~~~~~L~~-lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s 220 (300) T protein:vir:95 147 -TQTVPFKDTNPDESMED----AVGMIDGSERDITGAILDPIFTTALSK-MKNAEGGKLYPELAWGGVPDAINGLAVDKN 220 (300) T ss_pred -ceeecccccchHHHHHH----HHHHhhhcCCCccEEEECHHHHHHHHH-hhccCCCeeccCccccCCCceecceeeEEe Confidence 00000000001122322 223332222222379999999999985 466666666655555556678999999999 Q ss_pred eeccCCc--cc--cC Q lcl|Aclame:pro 320 DALLNTE--SR--VV 330 (330) Q Consensus 320 dal~~tE--~~--Vv 330 (330) +++.... .. ++ T Consensus 221 ~~v~~~~~~~~~~~~ 235 (300) T protein:vir:95 221 RTVSYSQTDPKNTAI 235 (300) T ss_pred cCCCCCCCCCccEEE Confidence 9986422 22 22 No 31 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.04 E-value=1.4e-11 Score=80.12 Aligned_cols=233 Identities=16% Similarity=0.159 Sum_probs=144.2 Q ss_pred CCccc--cccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcc-cc Q lcl|Aclame:pro 1 MATLS--TNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPN-KS 77 (330) Q Consensus 1 M~~~~--~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s-~~ 77 (330) |.+-. .+-. +-|......|||.+.+.++|++.++.+...++ .+.+.+.++-+.++|..=++..+++ .. T Consensus 107 ~~~~~~~~GG~--------~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~wv~E~~~~~~~~~~ 177 (401) T protein:vir:44 107 LQVGTDEDGGY--------AVPEELDRSILSLLKDEVVMRQEATVITVGGS-DYKKLVNLGGTASGWVGETDTRSQTATS 177 (401) T ss_pred hhcCCCCCCce--------eccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCccceeeccccccCccccc Confidence 22110 0001 12334566799999999999999998876543 5788888899999999888888765 47 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENK 156 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~ 156 (330) ++.+++-..+-+.+.+.|-+.+.+... +..++-. ....++++..+..+|++||-...|.++..... . T Consensus 178 ~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~---------~ 245 (401) T protein:vir:44 178 RLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWIN---SELATEFAEQEEIAFTTGDGTKKPKGFLAYES---------T 245 (401) T ss_pred cceeeeeehhheeeehhhhHHHHhcchHHHHHHHH---HHHHHHHHHHHHhhhhccCCCCccceeecccc---------c Confidence 999999999999999999999988754 4444433 45678999999999999986655443221100 0 Q ss_pred ceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 157 DNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 157 ~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) .....+. .|+. . T Consensus 246 ~~~~~~~------------~~~~------------------------~-------------------------------- 257 (401) T protein:vir:44 246 EESDKAR------------AFGK------------------------L-------------------------------- 257 (401) T ss_pred ccccccc------------cccc------------------------c-------------------------------- Confidence 0000000 0000 0 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEE Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPV 316 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpi 316 (330) ..+- ..-.+....++|++++ +.++.....+.+|+||++.+.+|+. .+++....+-.........-.+.|.|| T Consensus 258 --~~~~-t~~~~~~~~d~i~~~~----~~l~~~~~~~a~~v~n~~~~~~L~~-lkd~~G~~l~~~~~~~g~~~~l~G~PV 329 (401) T protein:vir:44 258 --QHIV-SGEATAVTADAIIKLI----YTLRKAHRTGAKFMMNNNSLFAIRL-LKDTEGNYLWRPGLELGQPSSLAGYGI 329 (401) T ss_pred --cccc-cccccccCHHHHHHHH----HhcchhhhcCCEEEEcHHHHHHHHH-hhccCCceeecCCcCCCCCceecceee Confidence 0000 0000011134454432 3333333456789999999999995 566665555444433333456899999 Q ss_pred EEEeeccC--CccccC Q lcl|Aclame:pro 317 QRTDALLN--TESRVV 330 (330) Q Consensus 317 r~~dal~~--tE~~Vv 330 (330) ..+|++.. +...+| T Consensus 330 v~~~~~p~~~~~~~~i 345 (401) T protein:vir:44 330 AENEQMPDIAADAKAI 345 (401) T ss_pred EEecCcCCccCCccEE Confidence 99999875 333333 No 32 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.04 E-value=7.1e-11 Score=76.22 Aligned_cols=224 Identities=12% Similarity=0.035 Sum_probs=142.0 Q ss_pred cccccccH--HH-----HHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCccc Q lcl|Aclame:pro 4 LSTNNPTM--AD-----VAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNK 76 (330) Q Consensus 4 ~~~~a~TL--~E-----~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~ 76 (330) |+.++=.- ++ .+..+ +......||+.+.+.++|++..+.++..++ .+.+.++++-|.+.|..=++.+++++ T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l-~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~ 78 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYL-DPVQAKDYFAEAEKTSIVQRVAQKIPMGAT-GIVIPHWTGDVSAQWIGEGDMKPITK 78 (397) T ss_pred CCcCHHHHHHhhccCCCCcccc-chhHHHHHHHHHHhccchhhhcceeeccCC-ceEEEEEcCCcceEEecCCccccccc Confidence 22221111 11 01112 233466799999999999999999887543 57888999999999999999999999 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCc Q lcl|Aclame:pro 77 SSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENK 156 (330) Q Consensus 77 ~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~ 156 (330) .++.+++-..+-+++.+.|-+.+.+... .++...-.+...+++++++.+.||||+.. |+...|+.. T Consensus 79 ~~f~~v~l~~~k~~~~v~iS~ell~ds~--~~l~~~i~~~l~~aia~~~d~a~l~G~gt--~~~~~~~~~---------- 144 (397) T protein:vir:23 79 GNMTKRDVHPAKIATIFVASAETVRANP--ANYLGTMRTKVATAIAMAFDNAALHGTNA--PSAFQGYLD---------- 144 (397) T ss_pred cceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhcccC--Ccccccccc---------- Confidence 9999999999999999999999887643 22333334556799999999999999732 111111100 Q ss_pred ceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 157 DNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 157 ~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) .. + .. T Consensus 145 -----~~--~---------------------------------------------~~----------------------- 149 (397) T protein:vir:23 145 -----QS--N---------------------------------------------KT----------------------- 149 (397) T ss_pred -----cc--c---------------------------------------------ce----------------------- Confidence 00 0 00 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcc-----eEEE Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGER-----VMTF 311 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~-----v~~~ 311 (330) .. .......+++++.+ ..+.........|+||++.+..|+. .++.....+-.....+.. ...+ T Consensus 150 ----~~---~~~~~~~~~~~~~~----~~l~~~~~~~a~~vmn~~~~~~L~~-lkd~~G~~i~~~~~~~~~~~~~~~~tl 217 (397) T protein:vir:23 150 ----QS---ISPNAYQGLGVSGL----TKLVTDGKKWTHTLLDDTVEPVLNG-SVDANGRPLFVESTYESLTTPFREGRI 217 (397) T ss_pred ----ee---ecccchhHHHHHHH----HhhhhcccCCCEEEEcHHHHHHHHH-hhccCCceeecccccccccccccCcee Confidence 00 00001112222222 2222223334689999999999995 555555554444332221 2368 Q ss_pred CCeEEEEEeeccCCccccC Q lcl|Aclame:pro 312 DGIPVQRTDALLNTESRVV 330 (330) Q Consensus 312 ~gvpir~~dal~~tE~~Vv 330 (330) .|+|+..++++...+..++ T Consensus 218 ~G~Pv~~s~~~~~g~~~~~ 236 (397) T protein:vir:23 218 LGRPTILSDHVAEGDVVGY 236 (397) T ss_pred eeeeEEEeCCCCCCceEEE Confidence 9999999999987665444 No 33 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=99.04 E-value=1.5e-10 Score=74.47 Aligned_cols=230 Identities=16% Similarity=0.121 Sum_probs=146.6 Q ss_pred CCccccccccHHHHHhh----------------cCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKR----------------LDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPT 64 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~----------------~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~ 64 (330) |++| .|++.. +-+......|||.+.+.|+|++..+.+... +..+++.+.++-|.++ T Consensus 1 ~~~~-------~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~ 72 (338) T protein:vir:78 1 MATL-------NELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPIS-YGETIIPTTVKRPEVG 72 (338) T ss_pred Ccch-------HHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccce Confidence 5543 222221 234445577999999999999999998765 3468888888887766 Q ss_pred eee--------cCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCC Q lcl|Aclame:pro 65 WRK--------LYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDG 135 (330) Q Consensus 65 fR~--------lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~ 135 (330) |.. =++..++++.++.+++-.++-+++.+.|-+.+.+... +..++ -.....++++..+...||+||.. T Consensus 73 ~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~---i~~~la~a~~~~~d~~~l~G~g~ 149 (338) T protein:vir:78 73 QVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTK---LQADLAYAIGRGIDLAVFHGKSP 149 (338) T ss_pred eecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHH---HHHHHHHHHHHHHHHHhhcccCC Confidence 654 3566888999999999999999999999888776643 33333 33567899999999999999876 Q ss_pred cChhhccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCcee Q lcl|Aclame:pro 136 IAPAEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRME 215 (330) Q Consensus 136 ~~p~~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~ 215 (330) ..+.++.|+.. +++..+. T Consensus 150 ~~~~~~~gi~~--------------~~~~~~~------------------------------------------------ 167 (338) T protein:vir:78 150 LTGSALQGIDT--------------NNVIVNT------------------------------------------------ 167 (338) T ss_pred Ccccccccccc--------------ccccccc------------------------------------------------ Confidence 65555544421 0000000 Q ss_pred EEEEEEeeeeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhcc-CCCCCCEEEEeChHHHHHHHHHh--hc Q lcl|Aclame:pro 216 GYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIP-QLGMGRAVWYMNRNLREKLRLGI--VD 292 (330) Q Consensus 216 ~y~~~~~w~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip-~~~~g~~~~y~n~~v~~~L~~q~--~~ 292 (330) ...+... +....+++.+..++..++ +......+|.||++....|.... ++ T Consensus 168 -----------------------~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d 220 (338) T protein:vir:78 168 -----------------------TNVDYLQ----TGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRD 220 (338) T ss_pred -----------------------ccccccc----ccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhcc Confidence 0000000 011222333333444442 33344568999999999886432 34 Q ss_pred cccceeeecccCCcceEEECCeEEEEEeeccCC-------ccccC Q lcl|Aclame:pro 293 KIANNLTWETVSGERVMTFDGIPVQRTDALLNT-------ESRVV 330 (330) Q Consensus 293 ~~~~~l~~~~~~g~~v~~~~gvpir~~dal~~t-------E~~Vv 330 (330) .....+..+...+...-.+.|+||..+++|... ...++ T Consensus 221 ~~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~ 265 (338) T protein:vir:78 221 ANGNVDPTRINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVV 265 (338) T ss_pred CCCceeecccccCCCCceeeeeeEEEccccCccccccCCcccEEE Confidence 445566555555555678999999999998642 12222 No 34 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.03 E-value=6e-11 Score=76.62 Aligned_cols=229 Identities=14% Similarity=0.033 Sum_probs=143.9 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) |++= .+.+... .....|||.+.+.+.|++..+.+...++ ...+.+.++-|.++|..=++.++++..++. T Consensus 1 ma~~-gG~lvp~---------~~~~~ii~~~~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~ 69 (298) T protein:vir:16 1 MVLN-KGTLFDP---------TLVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGVTLA 69 (298) T ss_pred Cccc-Ccceech---------hHHHHHHHHHHhhhhhhhhcceeeccCC-ceEEEEEecCcceEEecCCcccccccccee Confidence 7753 3333222 3445799999999999999998876543 467888999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) +++-..+-+++.+.|.+.+.+... +..++...-.....+++++.+..+++||....+ T Consensus 70 ~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~---------------------- 127 (298) T protein:vir:16 70 PQTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRL---------------------- 127 (298) T ss_pred EEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCC---------------------- Confidence 999999999999999998876543 334454444566789999999999999942111 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) |+.... . |+...... . . T Consensus 128 ----g~~~~~-------------~---------~~~~~~~~---~----------------------------------~ 144 (298) T protein:vir:16 128 ----GTASAV-------------I---------GTNHFDSK---V----------------------------------T 144 (298) T ss_pred ----Cccccc-------------c---------cccccccc---c----------------------------------c Confidence 111000 0 00000000 0 0 Q ss_pred cccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEE Q lcl|Aclame:pro 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT 319 (330) Q Consensus 240 NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~ 319 (330) +.....-.....-.++.+ ++..+.........|+||++.+..|++ .++.....+-.+...+..+-.++|+||..+ T Consensus 145 ~~~~~~~~~~~~~~~i~~----~~~~~~~~~~~~~~~vmn~~~~~~l~~-lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~ 219 (298) T protein:vir:16 145 QKVEAPRGIADPNGAIEN----AVELLTGVDADVTGIAINPSFRSALAK-QKDLQDNALFPELKWGATPDTINGLPVDVN 219 (298) T ss_pred cccccccccccHHHHHHH----HHHHhhhcCCCccEEEEcHHHHHHHHH-hhccCCCeeecCcccCCCCceecceeeEEe Confidence 000000000001122222 222222222223479999999999986 466666666554444445567899999999 Q ss_pred eeccCC----ccccC Q lcl|Aclame:pro 320 DALLNT----ESRVV 330 (330) Q Consensus 320 dal~~t----E~~Vv 330 (330) +++... +..++ T Consensus 220 ~~v~~~~~~~~~~~~ 234 (298) T protein:vir:16 220 KTVSDMSLTQRDRAI 234 (298) T ss_pred cccccccCCCccEEE Confidence 998752 22233 No 35 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.03 E-value=3.3e-11 Score=78.04 Aligned_cols=225 Identities=12% Similarity=0.069 Sum_probs=142.8 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) ..+......+..+-...+-+......||+.+.+.|+|++.++.+..+++ .+++.+.++.|.++|.+=++.+++++.++. T Consensus 21 ~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) T protein:vir:96 21 PQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) T ss_pred hhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceeEecCCcccccccccee Confidence 0000000111111122234555667899999999999999999887643 588999999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcceee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVI 160 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~vi 160 (330) +++...+-+++.+.|.+.+.+... .++...-.....++++..+...+|||+...+ ...|+. T Consensus 100 ~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~--~~~gi~--------------- 160 (324) T protein:vir:96 100 NATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIA--------------- 160 (324) T ss_pred EEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--cCcccc--------------- Confidence 999999999999999998877643 1233333355679999999999999963221 000000 Q ss_pred ecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEeec Q lcl|Aclame:pro 161 DAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCN 240 (330) Q Consensus 161 dAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~N 240 (330) +..+. . + . T Consensus 161 ~~~~~----------------------------------~---------~-----------------------------~ 168 (324) T protein:vir:96 161 QSIEK----------------------------------T---------N-----------------------------K 168 (324) T ss_pred ccccc----------------------------------c---------c-----------------------------e Confidence 00000 0 0 0 Q ss_pred ccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEEe Q lcl|Aclame:pro 241 IDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRTD 320 (330) Q Consensus 241 Id~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~d 320 (330) ... +....++|++ ++..|+.......+|+||++....|+. .++.....+-. +.....+.|+||..+. T Consensus 169 ~~~----~~~t~~~i~~----~~~~l~~~~~~~~~~vmn~~~~~~L~~-l~d~~G~~~~~----~~~~~~l~G~PV~~~~ 235 (324) T protein:vir:96 169 VIK----GDFTQDNIID----LEALLEDDELEANAFISKTQNRSLLRK-IVDPETKERIY----DRNSDSLDGLPVVNLK 235 (324) T ss_pred ecc----ccccHHHHHH----HHHhhhhccCCCCEEEEcHHHHHHHHH-hhccCCCeeec----CCCCCcccceeeEeeC Confidence 000 0011233333 333344333444579999999999985 44544433322 2223458999999988 Q ss_pred eccCCccccC Q lcl|Aclame:pro 321 ALLNTESRVV 330 (330) Q Consensus 321 al~~tE~~Vv 330 (330) +..-++..++ T Consensus 236 ~~~~~~~~~~ 245 (324) T protein:vir:96 236 SSNLKRGELI 245 (324) T ss_pred CCCCCcceEE Confidence 8776766665 No 36 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.03 E-value=3.3e-11 Score=78.04 Aligned_cols=225 Identities=12% Similarity=0.069 Sum_probs=142.8 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) ..+......+..+-...+-+......||+.+.+.|+|++.++.+..+++ .+++.+.++.|.++|.+=++.+++++.++. T Consensus 21 ~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) T protein:vir:78 21 PQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) T ss_pred hhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceeEecCCcccccccccee Confidence 0000000111111122234555667899999999999999999887643 588999999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcceee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVI 160 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~vi 160 (330) +++...+-+++.+.|.+.+.+... .++...-.....++++..+...+|||+...+ ...|+. T Consensus 100 ~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~--~~~gi~--------------- 160 (324) T protein:vir:78 100 NATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIA--------------- 160 (324) T ss_pred EEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--cCcccc--------------- Confidence 999999999999999998877643 1233333355679999999999999963221 000000 Q ss_pred ecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEeec Q lcl|Aclame:pro 161 DAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCN 240 (330) Q Consensus 161 dAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~N 240 (330) +..+. . + . T Consensus 161 ~~~~~----------------------------------~---------~-----------------------------~ 168 (324) T protein:vir:78 161 QSIEK----------------------------------T---------N-----------------------------K 168 (324) T ss_pred ccccc----------------------------------c---------c-----------------------------e Confidence 00000 0 0 0 Q ss_pred ccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEEe Q lcl|Aclame:pro 241 IDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRTD 320 (330) Q Consensus 241 Id~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~d 320 (330) ... +....++|++ ++..|+.......+|+||++....|+. .++.....+-. +.....+.|+||..+. T Consensus 169 ~~~----~~~t~~~i~~----~~~~l~~~~~~~~~~vmn~~~~~~L~~-l~d~~G~~~~~----~~~~~~l~G~PV~~~~ 235 (324) T protein:vir:78 169 VIK----GDFTQDNIID----LEALLEDDELEANAFISKTQNRSLLRK-IVDPETKERIY----DRNSDSLDGLPVVNLK 235 (324) T ss_pred ecc----ccccHHHHHH----HHHhhhhccCCCCEEEEcHHHHHHHHH-hhccCCCeeec----CCCCCcccceeeEeeC Confidence 000 0011233333 333344333444579999999999985 44544433322 2223458999999988 Q ss_pred eccCCccccC Q lcl|Aclame:pro 321 ALLNTESRVV 330 (330) Q Consensus 321 al~~tE~~Vv 330 (330) +..-++..++ T Consensus 236 ~~~~~~~~~~ 245 (324) T protein:vir:78 236 SSNLKRGELI 245 (324) T ss_pred CCCCCcceEE Confidence 8776766665 No 37 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.02 E-value=7.8e-11 Score=76.01 Aligned_cols=225 Identities=13% Similarity=0.034 Sum_probs=146.6 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) |+...++ + +..+-+......||+.+.+.++|++.++.+...++ .+++.+.++-|.++|..=++.+++++.++. T Consensus 14 ~~~~~~~-----~-~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~ 86 (318) T protein:vir:24 14 IAQTGDT-----M-FKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTT-GQKIPHWVGDVSAQWIGEGDMKPITKGNMT 86 (318) T ss_pred hhcccCc-----c-cceeechhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEeCCcceEEecCCcccccccccee Confidence 2222111 1 11123445667899999999999999999887543 578889999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcceee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVI 160 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~vi 160 (330) +++-.++-+++...+-+.+.+... .++...-.....++++.++...|++|+....|..+. . T Consensus 87 ~i~~~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~---~-------------- 147 (318) T protein:vir:24 87 SQTIAPHKIATIFVASAETVRANP--ANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIG---Q-------------- 147 (318) T ss_pred EEEEeeEEEEEeehhhHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccc---c-------------- Confidence 999999999999999998777533 133444445678999999999999997433221110 0 Q ss_pred ecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEeec Q lcl|Aclame:pro 161 DAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCN 240 (330) Q Consensus 161 dAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~N 240 (330) .+. + . ......+ . T Consensus 148 ---~~~----------------------~-------~-----~~~~~~~------------------------------~ 160 (318) T protein:vir:24 148 ---TTK----------------------A-------I-----SIADTTG------------------------------A 160 (318) T ss_pred ---ccc----------------------c-------c-----ccccccc------------------------------c Confidence 000 0 0 0000000 0 Q ss_pred ccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcce-----EEECCeE Q lcl|Aclame:pro 241 IDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERV-----MTFDGIP 315 (330) Q Consensus 241 Id~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v-----~~~~gvp 315 (330) +.....++.+ +...++.......+|+||++....|+. .++.....+-.....+... ..+.|+| T Consensus 161 -------~~~~~~~~~~----~~~~~~~~~~~~~~~v~n~~~~~~L~~-lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~p 228 (318) T protein:vir:24 161 -------TTVYDQVAVN----GLSLLVNDGKKWTHTLLDDITEPILNG-AKDQNGRPLFIESTYGEAASPFRSGRIVARP 228 (318) T ss_pred -------cchHHHHHHH----HHHhhccccCCCCEEEEcHHHHHHHHH-hhccCCceeecCccccCccccccCceEEEEe Confidence 0000122222 334445556666799999999999985 5666555554444332222 2567999 Q ss_pred EEEEeeccCCccccC Q lcl|Aclame:pro 316 VQRTDALLNTESRVV 330 (330) Q Consensus 316 ir~~dal~~tE~~Vv 330 (330) +..++++..+...++ T Consensus 229 v~~~~~~~~~~~~~~ 243 (318) T protein:vir:24 229 TILSDHVVEGTTVGF 243 (318) T ss_pred eEEeCCCCCCccEEE Confidence 999999987666544 No 38 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.01 E-value=4.2e-11 Score=77.49 Aligned_cols=224 Identities=12% Similarity=0.074 Sum_probs=141.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) -..+.....+..+-+..+-+......|||.+.+.++|++.++.+..+++ .+.+.+.++.|.+.|.+=++.+++++.++. T Consensus 21 ~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~ 99 (324) T protein:vir:97 21 PQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) T ss_pred hhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCC-ceEEEEEecCcceeEeccCcccccccccee Confidence 0000000111112122233455667899999999999999999887644 478888999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) +++-.++-+++.+.|-+.+.+... +.. ..-.....++++...++.+|+|+.... ...|+.. T Consensus 100 ~v~~~~~k~~~~~~is~ell~ds~~~l~---~~i~~~l~~aia~~~d~a~l~G~g~~~--~~~gi~~------------- 161 (324) T protein:vir:97 100 NATMRAFKLGVILPVTKEFLNYTYSQFF---EEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQ------------- 161 (324) T ss_pred EEEEeeEEEEEeehhhHHHHhcchHHHH---HHHHHHHHHHHHHHHHHHhhccCCCCc--cCccccc------------- Confidence 999999999999999998887643 333 333355678999999999999974321 1111100 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) + .... + T Consensus 162 --~----------------------------------~~~~---------~----------------------------- 167 (324) T protein:vir:97 162 --S----------------------------------IEKT---------N----------------------------- 167 (324) T ss_pred --c----------------------------------cccc---------c----------------------------- Confidence 0 0000 0 Q ss_pred cccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEE Q lcl|Aclame:pro 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT 319 (330) Q Consensus 240 NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~ 319 (330) ++. .+....++|++ ++..|+.......+|+||...+..|+. ..+.....+-.. ...-.+.|.||..+ T Consensus 168 ~~~----~~~~~~~~i~~----~~~~l~~~~~~~~~~v~n~~~~~~L~~-lkd~~g~~~~~~----~~~~tl~G~PV~~~ 234 (324) T protein:vir:97 168 KVI----KGDFTQDNIID----LEALLEDDELEANAFISKTQNRSLLRK-IVDPETKERIYD----RNSDTLDGLPVVNL 234 (324) T ss_pred eec----cccCCHHHHHH----HHHhhhhccCCCCEEEEcHHHHHHHHH-hhcCCCceeecC----CCCccccceeeEee Confidence 000 00011223332 233333333334579999999999985 444443333222 22235799999999 Q ss_pred eeccCCccccC Q lcl|Aclame:pro 320 DALLNTESRVV 330 (330) Q Consensus 320 dal~~tE~~Vv 330 (330) ++...+...++ T Consensus 235 ~~~~~~~~~~~ 245 (324) T protein:vir:97 235 KSSNLKRGELI 245 (324) T ss_pred cCCCCCcceEE Confidence 98777766655 No 39 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.99 E-value=1.2e-10 Score=74.91 Aligned_cols=236 Identities=12% Similarity=0.026 Sum_probs=145.9 Q ss_pred CCcccc---ccccHHHH---------HhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeec Q lcl|Aclame:pro 1 MATLST---NNPTMADV---------AKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKL 68 (330) Q Consensus 1 M~~~~~---~a~TL~E~---------Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~l 68 (330) |++=+. ..+...|. +..+-+......|||.+.+.++|++..+++...++ .+++.+.++-|+++|..= T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~E 79 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTT-GQKIPHWTGDVSASWIGE 79 (326) T ss_pred CCCCccchhhhcCcchhhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCC-ceEEEEEeCCcceEEecC Confidence 222110 00111111 11123444667799999999999999999876533 578889999999999999 Q ss_pred CCccCcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhh Q lcl|Aclame:pro 69 YGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRY 148 (330) Q Consensus 69 N~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~ 148 (330) ++.+++++.++.+++-..+-+++.+.|-+.+.+... .++...-.+...+++++.+.+.+|+||.+..|..+.... T Consensus 80 g~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~--~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~--- 154 (326) T protein:vir:42 80 GDMKPITKGNMTSQTIAPHKIATIFVASAETVRANP--ANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTT--- 154 (326) T ss_pred CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccc--- Confidence 999999999999999999999999999998877643 233333445567899999999999998644332211000 Q ss_pred cccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeE Q lcl|Aclame:pro 149 NSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLT 228 (330) Q Consensus 149 ~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~ 228 (330) .....+...+ T Consensus 155 ------~~~~~~~~~~---------------------------------------------------------------- 164 (326) T protein:vir:42 155 ------KEVSLVDPDG---------------------------------------------------------------- 164 (326) T ss_pred ------cccceeeccc---------------------------------------------------------------- Confidence 0000000000 Q ss_pred EeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccC-Ccc Q lcl|Aclame:pro 229 LRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVS-GER 307 (330) Q Consensus 229 v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~-g~~ 307 (330) .+.+ .+....++. +..++..+........+|+||++.+..|+. .++.....+-..... |.. T Consensus 165 ----------~~~~-----~~~~~~~~~--~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~-lkd~~G~~l~~~~~~~~~~ 226 (326) T protein:vir:42 165 ----------TGSN-----ADLTVYDAV--AVNALSLLVNAGKKWTHTLLDDITEPILNG-AKDKSGRPLFIESTYTEEN 226 (326) T ss_pred ----------cccc-----ccchhHHHH--HHHHHhhhhhhccCccEEEEeHHHHHHHHH-hhccCCceeeccccccCcc Confidence 0000 000001110 111122223334455689999999999995 566655555444332 221 Q ss_pred ----eEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 308 ----VMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 308 ----v~~~~gvpir~~dal~~tE~~Vv 330 (330) -..+.|+||..++++..++..++ T Consensus 227 ~~~~~~~l~G~pv~~~~~~~~~~~~~~ 253 (326) T protein:vir:42 227 SPFRLGRIVARPTILSDHVASGTVVGY 253 (326) T ss_pred ccccCceeeeeeEEEcCCCCCCceEEE Confidence 23689999999999988776554 No 40 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.99 E-value=1.1e-10 Score=75.12 Aligned_cols=241 Identities=15% Similarity=0.138 Sum_probs=146.1 Q ss_pred CCccccccccHHHH---Hh-------hcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCC Q lcl|Aclame:pro 1 MATLSTNNPTMADV---AK-------RLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYG 70 (330) Q Consensus 1 M~~~~~~a~TL~E~---Ak-------~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~ 70 (330) |.......++-.|. .. -+-|......|++.+.+.++|++..+.+...++ .+.+.+.++-+.++|..=++ T Consensus 90 l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~E~~ 168 (407) T protein:vir:48 90 MRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGS-DYKKLVNLGGTTSGWVGETD 168 (407) T ss_pred HhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCcceeeecccc Confidence 11100011111110 00 022334566799999999999999988777544 57888889999999999888 Q ss_pred ccCccc-ceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhh Q lcl|Aclame:pro 71 GVLPNK-SSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRY 148 (330) Q Consensus 71 g~~~s~-~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~ 148 (330) ..++++ +++.+++-..+-+++.+.|-+.+.+... +..++-. ....+++++.+..+|++||-...| .||-.. T Consensus 169 ~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~l~~~i~~~~~~a~l~G~G~~~p---~Gil~~- 241 (407) T protein:vir:48 169 ARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWIN---SELALEFAEQEEIAFTSGDGSKKP---KGFLAY- 241 (407) T ss_pred cccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHH---HHHHHHHHHHHHhhhhccCCCCcc---ceeeec- Confidence 888765 7999999999999999999999988744 4444433 456788999999999999865443 343110 Q ss_pred cccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeE Q lcl|Aclame:pro 149 NSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLT 228 (330) Q Consensus 149 ~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~ 228 (330) ........+. .|+. T Consensus 242 -----~~~~~~~~~~------------~~~~------------------------------------------------- 255 (407) T protein:vir:48 242 -----ESTDEDDKTR------------AFGK------------------------------------------------- 255 (407) T ss_pred -----cccccccccc------------cccc------------------------------------------------- Confidence 0000000000 0000 Q ss_pred EeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcce Q lcl|Aclame:pro 229 LRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERV 308 (330) Q Consensus 229 v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v 308 (330) +..+... .......++|++++ +.++.....+.+|+||++....|+. .++.....+-........+ T Consensus 256 ---------~~~~~~~-~~~~~~~d~i~~l~----~~l~~~~~~~a~~v~n~~~~~~L~~-lkD~~Gr~l~~~~~~~g~~ 320 (407) T protein:vir:48 256 ---------LQHIASG-AASGVTADAIIKLI----YTLRKAHRSGAKFMMNNSSLFAIRL-LKDNDGNYLWRPGIELGQP 320 (407) T ss_pred ---------ccccccc-cccccChHHHHHHH----HhhchhhhcCCEEEEcHHHHHHHHH-hhccCCceeeccCcCCCCC Confidence 0000000 00011234455433 3344334456789999999999985 5666655554344443444 Q ss_pred EEECCeEEEEEeeccCCc--cccC Q lcl|Aclame:pro 309 MTFDGIPVQRTDALLNTE--SRVV 330 (330) Q Consensus 309 ~~~~gvpir~~dal~~tE--~~Vv 330 (330) -.+.|.||..+|.+.... ..+| T Consensus 321 ~~l~G~PV~~~~~~p~~~~~~~~i 344 (407) T protein:vir:48 321 SSLAGYGIVENEQMPDIAADAKAI 344 (407) T ss_pred ceecceeeEEecCcCCccCCccEE Confidence 578899999999987522 2222 No 41 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.98 E-value=2.6e-10 Score=73.14 Aligned_cols=230 Identities=9% Similarity=-0.006 Sum_probs=147.1 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) |++-.++..++- ......||+.+.+.|+|++..+.+...++ ..++.+.++-|.+.|..=++.+++++.++. T Consensus 1 m~t~t~gg~liP--------~~~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~ 71 (303) T protein:vir:97 1 MGTETSKASLFD--------KHLVSDLINKVKGHSSLAKLSSQKPIPFN-GSKEFTFTLDSDIDVVAENGKKTHGGLSLE 71 (303) T ss_pred CcccCCCCeEcc--------hhHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEecCcceEEeecCcccccccccee Confidence 888666554433 33456799999999999999988776544 457788889999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) +++-..+-+++.+.|-+.+.+... +.-++...=.....+++++.+...+++|+...+ T Consensus 72 ~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~---------------------- 129 (303) T protein:vir:97 72 PVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRT---------------------- 129 (303) T ss_pred eEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCC---------------------- Confidence 999999999999999998875543 333444445566789999999999999952221 Q ss_pred eecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEee Q lcl|Aclame:pro 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) Q Consensus 160 idAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~ 239 (330) |++.....+ .++. +.. . T Consensus 130 ----g~~~~~~~~-----------~~~~-----~~~---~---------------------------------------- 146 (303) T protein:vir:97 130 ----KKASDVIGT-----------NHFD-----SKV---T---------------------------------------- 146 (303) T ss_pred ----ccccccccc-----------cccc-----ccc---c---------------------------------------- Confidence 111110000 0000 000 0 Q ss_pred cccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeeccc-CCcceEEECCeEEEE Q lcl|Aclame:pro 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETV-SGERVMTFDGIPVQR 318 (330) Q Consensus 240 NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~-~g~~v~~~~gvpir~ 318 (330) +. ++.-......+++.++ +..+.........|.||++....|++ .++.....+-..+. .+.....+.|+||.. T Consensus 147 ~~-~~~~~~~~~~~~i~~~----~~~~~~~~~~~~~~vmn~~~~~~L~~-lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~ 220 (303) T protein:vir:97 147 QV-VKFTESEDADANIEAA----VNLIQGAEGVVTGLAMDTEFSTALAK-VTNGEMGPKMYPELAWGANPDSINGLKSSV 220 (303) T ss_pred cc-cccccccchHHHHHHH----HHHHhhcCCCccEEEEcHHHHHHHHH-hhccCCCeEEecCccCCCCCceecceeeEE Confidence 00 0000000112233332 33332222233469999999999985 46666555544443 344556789999999 Q ss_pred EeeccCCc------cccC Q lcl|Aclame:pro 319 TDALLNTE------SRVV 330 (330) Q Consensus 319 ~dal~~tE------~~Vv 330 (330) ++++.... ..++ T Consensus 221 s~~v~~~~~~~~~~~~~~ 238 (303) T protein:vir:97 221 NTTVGAGADEAESKDLVI 238 (303) T ss_pred ecccCCccccCCCccEEE Confidence 99987422 2222 No 42 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.97 E-value=1.1e-10 Score=75.28 Aligned_cols=230 Identities=14% Similarity=0.105 Sum_probs=139.5 Q ss_pred CCc-cccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcce-eeccCCccceeEEEeccCCcceeecCCccCcccce Q lcl|Aclame:pro 1 MAT-LSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTA-IEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSS 78 (330) Q Consensus 1 M~~-~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf-~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t 78 (330) +.. ......|...-.. +-+......+|+.+.+.+.+|..+.- ...+++..+...+.++-|.++|..=++.+++++.+ T Consensus 104 ~~~~~~~~~~t~~~~g~-~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 182 (392) T protein:vir:13 104 FEFAPEKRDGTKAGNPN-VLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAEIPESYPA 182 (392) T ss_pred HHhhhhhhcccccCCCc-cccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccccccc Confidence 000 0000001000011 11223445567777777777777654 44444445677788889999999999999999999 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) +.+++-..+-+.+.+.|.+.+.+... +..++-. ....++++..+..+|||||-...|+++. . T Consensus 183 f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~l~~~i~~~~d~~~l~G~Gt~~p~Gil---~----------- 245 (392) T protein:vir:13 183 TTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLV---SDAGPAIGDAMGRHFLTGTGTGQPRGIL---T----------- 245 (392) T ss_pred eeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHH---HHHHHHHHHHHHHHHhcccCCccccccc---c----------- Confidence 99999999999999999999988754 3444333 4467899999999999998544333221 0 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) ++++.... + . .+. T Consensus 246 ---~~~~~~~~------~-----------------------------~--~~~--------------------------- 258 (392) T protein:vir:13 246 ---DATGANAA------F-----------------------------G--EAD--------------------------- 258 (392) T ss_pred ---cccccccc------c-----------------------------c--ccc--------------------------- Confidence 00000000 0 0 000 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) .+....++|+++ +..++.....+.+|+||++.+..|+. .++.....+-.....+...-.+.|+||. T Consensus 259 ---------~~~~~~d~l~~~----~~~l~~~~~~~a~~v~n~~~~~~l~~-lkd~~G~~l~~~~~~~g~~~~l~G~Pv~ 324 (392) T protein:vir:13 259 ---------ADSKVSDALIDL----FHEVPSAYRKNAKFVVNDLRAAQMRK-LKDANGQYLWQSALTVGAPDTFNGKVVE 324 (392) T ss_pred ---------cccccHHHHHHH----HHhhhhhhhcCCEEEEcHHHHHHHHH-hhccCCceeecCCcCCCCCceecceeeE Confidence 001112344443 23333333445689999999999996 4666555555445444455688999999 Q ss_pred EEeeccCCccccC Q lcl|Aclame:pro 318 RTDALLNTESRVV 330 (330) Q Consensus 318 ~~dal~~tE~~Vv 330 (330) .+|++...+- ++ T Consensus 325 ~~~~~~~~~i-~~ 336 (392) T protein:vir:13 325 TDDGMPADKV-LF 336 (392) T ss_pred EcCCCCCCcE-EE Confidence 9999986541 22 No 43 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.96 E-value=1.2e-10 Score=75.05 Aligned_cols=231 Identities=13% Similarity=0.011 Sum_probs=130.6 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHH-hccchhHhhcceeeccCCccceeEEEec--------cCCcceeecCCc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEML-NQTNPVLQDMTAIEGNLPTGHRTSVRTG--------LPTPTWRKLYGG 71 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l-~~~s~iL~~lpf~e~n~g~~~~~~~~~~--------lP~~~fR~lN~g 71 (330) +..- .+ ++.+--....+. ....+|..+ ...+.|.+.++.....++ .+.|.++++ -+.++|..=++. T Consensus 121 ~~~~-~~--~~~~~~~~~~p~-~~~~~i~~~~~~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~ 195 (419) T protein:vir:94 121 RDAP-AG--TITNPNVPHLPQ-LVPGIVPTTPDLPLLVADLLDQQNADYN-VLEYIRDTSGTAGAGSTWNKAAVVPEGTA 195 (419) T ss_pred cccc-cc--cccCCcccccch-hhhHHHHHHHhhhhhhhhcceeeeccCC-ceeeeeeccccccccccCcccceecCCcc Confidence 0000 00 000000001112 233444444 444455666776655433 355555544 346789998999 Q ss_pred cCcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhccc Q lcl|Aclame:pro 72 VLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSL 151 (330) Q Consensus 72 ~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~ 151 (330) .++++.++.+++-.++-+++.+.|.+.+.+-.+ ++.+.-.....++++..+...|||||...+|++ +-. T Consensus 196 ~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~---~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~G---i~~----- 264 (419) T protein:vir:94 196 KPQSTLSFDTITTTLKTVAHWLPITRQAADDNS---QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQG---ILT----- 264 (419) T ss_pred ccccccceeeEEeeeeeEEEeehhhHHHHHhHH---HHHHHHHHHHHHHHHHHHHHHHHhccCcccccc---eec----- Confidence 999999999999999999999999999887543 344444455789999999999999986554433 211 Q ss_pred ccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEec Q lcl|Aclame:pro 152 SAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRD 231 (330) Q Consensus 152 t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d 231 (330) . .+...+.. +++ . T Consensus 265 -~-~~~~~~~~------------------------~~~--------------------~--------------------- 277 (419) T protein:vir:94 265 -T-PGIGTYQQ------------------------PKP--------------------T--------------------- 277 (419) T ss_pred -c-cccccccc------------------------ccc--------------------c--------------------- Confidence 0 00000000 000 0 Q ss_pred cccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEE Q lcl|Aclame:pro 232 WRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTF 311 (330) Q Consensus 232 ~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~ 311 (330) .........++|++ ++..+....-...+|+||++....|+..-.+.....+-.....+...-.+ T Consensus 278 ------------~~~t~~~~~~~l~~----~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l 341 (419) T protein:vir:94 278 ------------APATDEPPLVDIRR----AKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRI 341 (419) T ss_pred ------------cccccchhHHHHHH----HHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCccc Confidence 00000011233333 33444333334458999999999998653333444444444445555678 Q ss_pred CCeEEEEEeeccCCccccC Q lcl|Aclame:pro 312 DGIPVQRTDALLNTESRVV 330 (330) Q Consensus 312 ~gvpir~~dal~~tE~~Vv 330 (330) +|+||..++.+..++..+. T Consensus 342 ~G~pV~~~~~~~~~~~~~g 360 (419) T protein:vir:94 342 WGLNVVSTVAIAQGTALVG 360 (419) T ss_pred cceeeEEcCCCCCccEEEe Confidence 9999999999987653322 No 44 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=98.95 E-value=6.6e-11 Score=76.40 Aligned_cols=233 Identities=12% Similarity=0.069 Sum_probs=139.6 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcc----- Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPN----- 75 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s----- 75 (330) ++.+.++ .|... ...+-+......||+.+.+.++|++.++.+..+++ .+.+.++++-|.+.|..-+...+++ T Consensus 158 ~~a~~~~-~~~~~-g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~ 234 (458) T protein:vir:10 158 LKAVNQS-SSVEV-SSESYETIFSQRIIRDLQKELVVGALFEELPMSSK-ILTMLVEPDAGKATWVAASTYGTDTTTGEE 234 (458) T ss_pred hhhhhhc-ccCcc-ccceehhhHhHHHHHHHHhhhhHHhhcceeecCCc-ceEEEEecCCcceeeccccccccccccccc Confidence 1111000 00001 11122344566799999999999999998877644 5778889999999999988877754 Q ss_pred -cceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhccccc Q lcl|Aclame:pro 76 -KSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSA 153 (330) Q Consensus 76 -~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~ 153 (330) +.++.+++-..+-+++.+.|.+.+.+... +..++-. ....++++......|||||-...|. |+. T Consensus 235 ~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~---~~l~~~i~~~~d~~~l~G~G~~~p~---Gi~-------- 300 (458) T protein:vir:10 235 VKGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLR---KRLIEAHAVSIEEAFMTGDGSGKPK---GLL-------- 300 (458) T ss_pred ccccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHH---HHHHHHHHHHHHHHhhcCCCCCccc---eee-------- Confidence 56899999999999999999999887643 4444433 4467999999999999997443222 220 Q ss_pred CCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccc Q lcl|Aclame:pro 154 ENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWR 233 (330) Q Consensus 154 ~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r 233 (330) +..+.++. T Consensus 301 -------~~~~~~~~----------------------------------------------------------------- 308 (458) T protein:vir:10 301 -------TLASEDSA----------------------------------------------------------------- 308 (458) T ss_pred -------eccccccc----------------------------------------------------------------- Confidence 00000000 Q ss_pred cEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeeccc----CCcceE Q lcl|Aclame:pro 234 YVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETV----SGERVM 309 (330) Q Consensus 234 ~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~----~g~~v~ 309 (330) .-+.... ....+....++|++++ ..++......+.|+||++....|+. ..+.....+-...+ ....+- T Consensus 309 --~~~~~~~-~~~~~~~~~~~i~~~~----~~l~~~~~~~~~~v~~~~~~~~l~~-lkd~~G~~i~~~~~~~~~~~~~~~ 380 (458) T protein:vir:10 309 --KVVTEAK-ADGSVLVTAKTISKLR----RKLGRHGLKLSKLVLIVSMDAYYDL-LEDEEWQDVAQVGNDSVKLQGQVG 380 (458) T ss_pred --ceeeccc-ccccccccHHHHHHHH----HhhhhhhcCCCEEEEcHHHHHHHHh-hcccCCceeeccccccccccCcCc Confidence 0000000 0000111234555543 3445455566889999999999884 44444333322111 112233 Q ss_pred EECCeEEEEEeeccC--CccccC Q lcl|Aclame:pro 310 TFDGIPVQRTDALLN--TESRVV 330 (330) Q Consensus 310 ~~~gvpir~~dal~~--tE~~Vv 330 (330) .++|+||..+|++.. +...++ T Consensus 381 ~l~G~pv~~~~~~p~~~~~~~~~ 403 (458) T protein:vir:10 381 RIYGLPVVVSEYFPAKANSAEFA 403 (458) T ss_pred eecceeeEEccccccccCCcceE Confidence 678999999999976 233333 No 45 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.94 E-value=5.5e-10 Score=71.36 Aligned_cols=230 Identities=14% Similarity=0.107 Sum_probs=144.1 Q ss_pred CCccccccccHHHHHhh----------------cCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKR----------------LDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPT 64 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~----------------~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~ 64 (330) ||.| .|+... +-+......|||.+.+.++|++..+.+...+ ..+++.+.++-|.++ T Consensus 1 ~a~l-------~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~ 72 (333) T protein:vir:78 1 MATL-------NELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISY-GETIIPTTVKRPEVG 72 (333) T ss_pred Cchh-------HHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeE Confidence 5543 232211 2244456779999999999999999987754 357889999999998 Q ss_pred eeec--------CCccCcccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCC Q lcl|Aclame:pro 65 WRKL--------YGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDG 135 (330) Q Consensus 65 fR~l--------N~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~ 135 (330) |-.= ++..+.++.++.+++-..+-+++.+.|-+.+.+... +..++-. ....++++..+...|||||.. T Consensus 73 ~v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~---~~la~ai~~~~d~~~l~G~g~ 149 (333) T protein:vir:78 73 QVGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQ---GDLAYAIGRGIDLAVFHGKSP 149 (333) T ss_pred eecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHH---HHHHHHHHHHHHHHHhcccCC Confidence 8543 234678889999999999999999999999887644 4444433 456799999999999999866 Q ss_pred cChhhccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCcee Q lcl|Aclame:pro 136 IAPAEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRME 215 (330) Q Consensus 136 ~~p~~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~ 215 (330) ..|..|.|+..- ++.. .. T Consensus 150 ~~~~~~~g~~~~--------------~~~~--~~---------------------------------------------- 167 (333) T protein:vir:78 150 LTGSALQGIDTD--------------NVIA--NT---------------------------------------------- 167 (333) T ss_pred CCCccccccccc--------------cccc--cc---------------------------------------------- Confidence 655555444210 0000 00 Q ss_pred EEEEEEeeeeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhcc-CCCCCCEEEEeChHHHHHHHHHhh--c Q lcl|Aclame:pro 216 GYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIP-QLGMGRAVWYMNRNLREKLRLGIV--D 292 (330) Q Consensus 216 ~y~~~~~w~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip-~~~~g~~~~y~n~~v~~~L~~q~~--~ 292 (330) .+++...-.++...++|++ ++..++ +......+|.||.+....|+.... + T Consensus 168 -----------------------~~~~~~~~~~~~~~~~i~~----~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d 220 (333) T protein:vir:78 168 -----------------------TNVDYLQETGDPLLDRLLD----GYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRD 220 (333) T ss_pred -----------------------ccccccccccchhHHHHHH----HHHhhccccccCceEEEEcchHHHHHHHHhhhcC Confidence 0000000001111233333 333332 222233479999999988876443 3 Q ss_pred cccceeeecccCCcceEEECCeEEEEEeeccCC-cc------ccC Q lcl|Aclame:pro 293 KIANNLTWETVSGERVMTFDGIPVQRTDALLNT-ES------RVV 330 (330) Q Consensus 293 ~~~~~l~~~~~~g~~v~~~~gvpir~~dal~~t-E~------~Vv 330 (330) .....+......+...-.+.|+||..+++|..+ .+ .++ T Consensus 221 ~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~ 265 (333) T protein:vir:78 221 ANGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRII 265 (333) T ss_pred CCCceeecCccccCCCceeeceeeEEccccCCCccccCCCccEEE Confidence 444555544444444567899999999999853 21 222 No 46 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=98.92 E-value=2.4e-10 Score=73.30 Aligned_cols=231 Identities=15% Similarity=0.131 Sum_probs=134.7 Q ss_pred CC---------------------c-cc--cccccHHHHHhhcCcccchHHHHHHHhccchhHhh-cceeeccCCccceeE Q lcl|Aclame:pro 1 MA---------------------T-LS--TNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQD-MTAIEGNLPTGHRTS 55 (330) Q Consensus 1 M~---------------------~-~~--~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~-lpf~e~n~g~~~~~~ 55 (330) ++ . .+ .+..|-.+ .-.+-|......||+.+.+.++|++. ...+....+ ...+. T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~-gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~-~~~~p 180 (435) T protein:vir:14 103 RALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGA-GGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NITIP 180 (435) T ss_pred HHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCC-CccccchhHHHHHHHHHhhhchhhhhcceeeecCCC-ceEEE Confidence 00 0 00 00000000 00122444456799999998988875 445544433 46788 Q ss_pred EEeccCCcceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|Aclame:pro 56 VRTGLPTPTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGND 134 (330) Q Consensus 56 ~~~~lP~~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~ 134 (330) +.++-|.++|..=++.++++..++.+++-.++-+++.+.|-+.+.+..+ +.+ +...-.....+++++++.+.|++||. T Consensus 181 ~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~-l~~~i~~~l~~ai~~~~d~a~l~G~G 259 (435) T protein:vir:14 181 RLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPN-VDQIVVGDLTAAIGAREDKAFIRDDG 259 (435) T ss_pred EEeCCcceeeeccCccccccccceeEEEeeeEEEEEeehhhHHHHHhhccCHH-HHHHHHHHHHHHHHHHHHHHhhccCC Confidence 8889999999999999999999999999999999999999988877654 322 22333345679999999999999974 Q ss_pred CcChhhccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCce Q lcl|Aclame:pro 135 GIAPAEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRM 214 (330) Q Consensus 135 ~~~p~~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~ 214 (330) .. +.+.||.. ...+. . +. .. T Consensus 260 ~~--~~p~Gi~~-------------------------------------~~~~~----~----------~~--~~----- 279 (435) T protein:vir:14 260 TA--NTPKGLRF-------------------------------------WALPS----N----------VI--TA----- 279 (435) T ss_pred CC--ccccceee-------------------------------------ccccc----c----------ee--cc----- Confidence 32 12333310 00000 0 00 00 Q ss_pred eEEEEEEeeeeeeEEeccccEEEeecccccccccchh-HHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcc Q lcl|Aclame:pro 215 EGYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSAN-AQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK 293 (330) Q Consensus 215 ~~y~~~~~w~~Gl~v~d~r~v~RI~NId~~~l~~~~~-~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~ 293 (330) ++..+... ..++. .++.++...+. ...+.+|+||++....|+. .++. T Consensus 280 -----------------------------~~~~~~~~~~~~~~-~l~~~~~~~~~-~~~~~~~v~n~~~~~~L~~-lkd~ 327 (435) T protein:vir:14 280 -----------------------------SDASTLQKIETDLG-KVILALENADA-NLTQPGWIMAPRTFRFLEG-LRDG 327 (435) T ss_pred -----------------------------ccccchhhHHHHHH-HHHHHhhhccc-cccCCEEEEcHHHHHHHHH-hhcc Confidence 00000011 11222 22222222211 2234579999999999985 5565 Q ss_pred ccceeeecccCCcceEEECCeEEEEEeeccC------CccccC Q lcl|Aclame:pro 294 IANNLTWETVSGERVMTFDGIPVQRTDALLN------TESRVV 330 (330) Q Consensus 294 ~~~~l~~~~~~g~~v~~~~gvpir~~dal~~------tE~~Vv 330 (330) ....+-.+...| .++|+||..++.+.. ++..|+ T Consensus 328 ~G~~l~~~~~~g----~l~G~Pv~~~~~~p~~~~~~~~~~~i~ 366 (435) T protein:vir:14 328 NGNKVYPELANG----MLKGYPVGKTTQVPINLGETGKESEIY 366 (435) T ss_pred CCceeccCCCCC----eeecceeEeeccccccccCCCccceEE Confidence 555554433333 579999999999864 222333 No 47 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=98.92 E-value=1.1e-10 Score=75.28 Aligned_cols=235 Identities=13% Similarity=0.146 Sum_probs=136.9 Q ss_pred CCccc-----cccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEE------EeccCCcceeecC Q lcl|Aclame:pro 1 MATLS-----TNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSV------RTGLPTPTWRKLY 69 (330) Q Consensus 1 M~~~~-----~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~------~~~lP~~~fR~lN 69 (330) |=.|. +..+|+.+.....-.......+|+.+.++|+||..+..+...+ +|.... ++..++..|..-. T Consensus 1 ~~~~~~~~~~~k~it~~d~~gG~L~P~~~~~~i~~l~e~s~i~~~a~vi~t~~--s~~~~i~~i~~g~~~~~~~~~~~~~ 78 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDLGKGILAVQRFGEFVREVRENSAIIKDARVLNALK--SYEVDISRISLGVELEPGRNTSGTK 78 (314) T ss_pred CchhhhHHHhhcccccccCCCceeChHHHHHHHHHHHhccchhhheeeecccC--ccceeecccccCcccccccccccCC Confidence 32221 2234444433333232334568999999999999999875421 122111 1223455555555 Q ss_pred CccCcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcCh-----hhccCh Q lcl|Aclame:pro 70 GGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAP-----AEFTGL 144 (330) Q Consensus 70 ~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p-----~~F~GL 144 (330) ...+++..++.++.-.++-|...+.|-..+.+-+--..++...-..++.+.++...+..|||||.+..+ +.++|+ T Consensus 79 ~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~ 158 (314) T protein:vir:41 79 VAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGW 158 (314) T ss_pred ccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhh Confidence 556789999999999999999999996666554322234556666778899999999999999964321 133443 Q ss_pred hhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeee Q lcl|Aclame:pro 145 SPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWD 224 (330) Q Consensus 145 ~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~ 224 (330) -+. + +..+++ ..+. T Consensus 159 l~~------a-~~~~~~---------------------------------------------~~~~-------------- 172 (314) T protein:vir:41 159 MKL------A-GNQYTD---------------------------------------------AEPE-------------- 172 (314) T ss_pred hhh------c-ccceee---------------------------------------------cCcc-------------- Confidence 220 0 000000 0000 Q ss_pred eeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccC---CCCCCEEEEeChHHHHHHHHHhhccccceeeec Q lcl|Aclame:pro 225 IGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQ---LGMGRAVWYMNRNLREKLRLGIVDKIANNLTWE 301 (330) Q Consensus 225 ~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~---~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~ 301 (330) +.+...+++.. ++..+|. -+.++.+||||++....+++...++... +... T Consensus 173 -----------------------~~~~~~~~~~~---l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~-l~~~ 225 (314) T protein:vir:41 173 -----------------------DENWPLNLFDG---MMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETG-LGDS 225 (314) T ss_pred -----------------------ccccHHHHHHH---HHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCc-ccch Confidence 00111222222 2333343 2456789999999999988765555433 3333 Q ss_pred ccCCcceEEECCeEEEEEeeccC---CccccC Q lcl|Aclame:pro 302 TVSGERVMTFDGIPVQRTDALLN---TESRVV 330 (330) Q Consensus 302 ~~~g~~v~~~~gvpir~~dal~~---tE~~Vv 330 (330) ...+.....+.|+||..+..+.. .+..+. T Consensus 226 ~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~ 257 (314) T protein:vir:41 226 ALIGATGLQYDGIPIQYVPALDALGDDKARAL 257 (314) T ss_pred hhhCCCCceecceeeEecccccccCCCCceEE Confidence 34455677789999999988753 233333 No 48 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.91 E-value=4e-10 Score=72.10 Aligned_cols=232 Identities=15% Similarity=0.115 Sum_probs=136.9 Q ss_pred CCccc-------------------cccccHHHH--HhhcCcccchHHHHHHHhccchhHhh-cceeeccCCccceeEEEe Q lcl|Aclame:pro 1 MATLS-------------------TNNPTMADV--AKRLDPNGKVDIIVEMLNQTNPVLQD-MTAIEGNLPTGHRTSVRT 58 (330) Q Consensus 1 M~~~~-------------------~~a~TL~E~--Ak~~~~d~~~~~VIE~l~~~s~iL~~-lpf~e~n~g~~~~~~~~~ 58 (330) |..-. ...++.... ...+-|+.....|||.+.+.++|+.. ..++....+ ...+.+.+ T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~-~~~~p~~~ 183 (435) T protein:vir:80 105 LAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NITIPRLK 183 (435) T ss_pred HHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCC-ceEEEEEe Confidence 00000 000000000 00122444556799999999998875 445555544 47888999 Q ss_pred ccCCcceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcC Q lcl|Aclame:pro 59 GLPTPTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIA 137 (330) Q Consensus 59 ~lP~~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~ 137 (330) +-|.+.|..=++..++++.++.+++-..+-+++.+.|.+.+.+..+ +.+ +...-.....++++..+..+||+||...+ T Consensus 184 ~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~-l~~~i~~~l~~a~~~~~d~a~l~G~G~~~ 262 (435) T protein:vir:80 184 GGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPN-VDQIVVGDLTAAIGAREDKAFIRDDGTAN 262 (435) T ss_pred CCcceeeeccCccccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHH-HHHHHHHHHHHHHHHHHHHHhhccCCCCC Confidence 9999999999999999999999999999999999999888776643 322 33334455689999999999999974321 Q ss_pred hhhccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEE Q lcl|Aclame:pro 138 PAEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGY 217 (330) Q Consensus 138 p~~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y 217 (330) +..||... + +.++ + T Consensus 263 --~p~Gi~~~--------------~-~~~~------------------------------------~------------- 276 (435) T protein:vir:80 263 --TPKGLRFW--------------A-LPGN------------------------------------V------------- 276 (435) T ss_pred --cccceeec--------------c-cccc------------------------------------e------------- Confidence 12222110 0 0000 0 Q ss_pred EEEEeeeeeeEEeccccEEEeecccccccccc-hhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccc Q lcl|Aclame:pro 218 RTHYKWDIGLTLRDWRYVARVCNIDVSDLATS-ANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIAN 296 (330) Q Consensus 218 ~~~~~w~~Gl~v~d~r~v~RI~NId~~~l~~~-~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~ 296 (330) +..+...+. ....++.+++..-... .....+.+|+||++...+|+. .+++... T Consensus 277 -----------------------~~~~~~~~~~~~~~d~~~~~~~~~~~--~~~~~~~~~vmn~~~~~~L~~-lkd~~G~ 330 (435) T protein:vir:80 277 -----------------------ITASDGSTLQKIETDLGKAILALENA--DANLTQPGWIMAPRTFRFLEG-LRDGNGN 330 (435) T ss_pred -----------------------eecccccchhhHHHHHHHHHHHhhcc--ccccccCEEEEcHHHHHHHHh-hhccCCc Confidence 000000010 1112333333222221 122334689999999999986 5566555 Q ss_pred eeeecccCCcceEEECCeEEEEEeeccC------CccccC Q lcl|Aclame:pro 297 NLTWETVSGERVMTFDGIPVQRTDALLN------TESRVV 330 (330) Q Consensus 297 ~l~~~~~~g~~v~~~~gvpir~~dal~~------tE~~Vv 330 (330) .+-.+...| .+.|+||..++.+.. ++..++ T Consensus 331 ~l~~~~~~~----~l~G~pv~~~~~~p~~~~~~~~~~~i~ 366 (435) T protein:vir:80 331 KVYPELANG----MLKGYPVGKTTQVPINLGEAGKESEIY 366 (435) T ss_pred eeccCCCCC----eEeeeeeEEeccccccccCCCCcceEE Confidence 554433333 489999999999864 222233 No 49 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=98.90 E-value=3.3e-11 Score=78.07 Aligned_cols=274 Identities=14% Similarity=-0.007 Sum_probs=151.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEec-cCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTG-LPTPTWRKLYGGVLPNKSST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~-lP~~~fR~lN~g~~~s~~t~ 79 (330) |++-.++ +.. .+-+......||+.+.+.++|++.++......+ .+.|.++++ -|+++|..=++.++++..++ T Consensus 151 ~~~~~~~-----~gg-~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f 223 (497) T protein:vir:78 151 NPFGSTG-----TFA-PGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEF 223 (497) T ss_pred hhcccCc-----ccc-cccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeeccCcccccccccc Confidence 1110000 000 122334556799999999999999988777554 467777765 57899999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) .+++...+-+.+.+.|.+.+.+-.++..+ .-.....++++......|+|||-..+ ..||-..- .... T Consensus 224 ~~i~~~~~k~a~~~~iS~ell~d~~~l~~---~i~~~l~~~i~~~~d~~~l~G~G~~~---p~Gil~~~-------~~~~ 290 (497) T protein:vir:78 224 ARVYEQVGKVANALTITDEGLRDAPELFN---FVQGRLLEGIQRKEEVQLLAGGGYPG---VNGLLQRS-------TGFT 290 (497) T ss_pred eeeEeeeeeeEeecHhHHHHHHhHHHHHH---HHHHHHHHHHHHHHHHHhhcCCCccc---cccccccc-------cccc Confidence 99999999999999999999876544333 33455679999999999999997655 44553211 1111 Q ss_pred eecCCCCCCceEEE---EEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 160 IDAGGTGSDNASAW---LVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 160 idAGgtg~~~tSi~---~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) +..+.+....++.+ +..+..+....+ .+....+..+.. .+.++-. .|. T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~---~~~~~~~-----------------~~~--- 341 (497) T protein:vir:78 291 ASSASSLFGATSATVSNVKFPADGTNGAF------VGQDTVASLKYG---RVVTGAA-----------------GSG--- 341 (497) T ss_pred ccccccchhhhhhhhhhhhhhcccccchh------hhhhHHHHHHHH---Hhhhhhh-----------------hhc--- Confidence 22211111111110 000100000000 000000000000 0000000 000 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCC-CCCEEEEeChHHHHHHHHHhhccccceeeecccCC---cc---eE Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLG-MGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSG---ER---VM 309 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~-~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g---~~---v~ 309 (330) .++ +.......++.+.+..++..++... ....+|.||++-...|++ .++....++-.....+ .. .. T Consensus 342 --~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~-lkd~~G~~i~~~~~~~~~~~~~~~~~ 414 (497) T protein:vir:78 342 --SGV----AGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRL-TKDANGQYMGGNFFGNAYGNPVNGGK 414 (497) T ss_pred --cch----hccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHH-hhcCCCceeccCcccccccccccCCc Confidence 001 0111122345555555655554322 222379999999999984 5666555555443322 22 23 Q ss_pred EECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 310 TFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 310 ~~~gvpir~~dal~~tE~~Vv 330 (330) .++|+||..++++..+...|- T Consensus 415 ~l~G~pV~~t~~~~~~~~~~G 435 (497) T protein:vir:78 415 NIWGVPVVTTPLIPLGTILVG 435 (497) T ss_pred eeeceeeEecCCCCCCceEEe Confidence 678999999999986553221 No 50 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=98.90 E-value=3.3e-11 Score=78.07 Aligned_cols=274 Identities=14% Similarity=-0.007 Sum_probs=151.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEec-cCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTG-LPTPTWRKLYGGVLPNKSST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~-lP~~~fR~lN~g~~~s~~t~ 79 (330) |++-.++ +.. .+-+......||+.+.+.++|++.++......+ .+.|.++++ -|+++|..=++.++++..++ T Consensus 151 ~~~~~~~-----~gg-~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f 223 (497) T protein:vir:10 151 NPFGSTG-----TFA-PGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEF 223 (497) T ss_pred hhcccCc-----ccc-cccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeeccCcccccccccc Confidence 1110000 000 122334556799999999999999988777554 467777765 57899999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCccee Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~v 159 (330) .+++...+-+.+.+.|.+.+.+-.++..+ .-.....++++......|+|||-..+ ..||-..- .... T Consensus 224 ~~i~~~~~k~a~~~~iS~ell~d~~~l~~---~i~~~l~~~i~~~~d~~~l~G~G~~~---p~Gil~~~-------~~~~ 290 (497) T protein:vir:10 224 ARVYEQVGKVANALTITDEGLRDAPELFN---FVQGRLLEGIQRKEEVQLLAGGGYPG---VNGLLQRS-------TGFT 290 (497) T ss_pred eeeEeeeeeeEeecHhHHHHHHhHHHHHH---HHHHHHHHHHHHHHHHHhhcCCCccc---cccccccc-------cccc Confidence 99999999999999999999876544333 33455679999999999999997655 44553211 1111 Q ss_pred eecCCCCCCceEEE---EEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 160 IDAGGTGSDNASAW---LVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 160 idAGgtg~~~tSi~---~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) +..+.+....++.+ +..+..+....+ .+....+..+.. .+.++-. .|. T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~---~~~~~~~-----------------~~~--- 341 (497) T protein:vir:10 291 ASSASSLFGATSATVSNVKFPADGTNGAF------VGQDTVASLKYG---RVVTGAA-----------------GSG--- 341 (497) T ss_pred ccccccchhhhhhhhhhhhhhcccccchh------hhhhHHHHHHHH---Hhhhhhh-----------------hhc--- Confidence 22211111111110 000100000000 000000000000 0000000 000 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCC-CCCEEEEeChHHHHHHHHHhhccccceeeecccCC---cc---eE Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLG-MGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSG---ER---VM 309 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~-~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g---~~---v~ 309 (330) .++ +.......++.+.+..++..++... ....+|.||++-...|++ .++....++-.....+ .. .. T Consensus 342 --~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~-lkd~~G~~i~~~~~~~~~~~~~~~~~ 414 (497) T protein:vir:10 342 --SGV----AGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRL-TKDANGQYMGGNFFGNAYGNPVNGGK 414 (497) T ss_pred --cch----hccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHH-hhcCCCceeccCcccccccccccCCc Confidence 001 0111122345555555655554322 222379999999999984 5666555555443322 22 23 Q ss_pred EECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 310 TFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 310 ~~~gvpir~~dal~~tE~~Vv 330 (330) .++|+||..++++..+...|- T Consensus 415 ~l~G~pV~~t~~~~~~~~~~G 435 (497) T protein:vir:10 415 NIWGVPVVTTPLIPLGTILVG 435 (497) T ss_pred eeeceeeEecCCCCCCceEEe Confidence 678999999999986553221 No 51 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.89 E-value=3e-10 Score=72.80 Aligned_cols=228 Identities=10% Similarity=-0.013 Sum_probs=138.9 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccceEE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~ 80 (330) |++-.+.. ..-+-+......||+.+.+.|+|++..+.+....+ .+++.+.++-|+++|..=++.+++++.++. T Consensus 1 Ma~~~~~~------gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~ 73 (315) T protein:vir:80 1 MADDFLSA------GKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGEGEVKPSASVDVS 73 (315) T ss_pred CCCCcCCc------CceEcchHHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCcceEEeeCCcccccccccee Confidence 77532110 11223445567899999999999999988876533 578899999999999999999999999999 Q ss_pred EEEEEEEEecchhhhhHHHHHhCCC--HHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 81 QVTDNCGMLEAYAEVDKALADLNGN--TAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 81 ~~~~~l~ilgg~~eVDk~la~~~g~--~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) +++-..+-+++.+.|-+.+.+.... .+.++..-.....++++..+..++|||+....+....|+.. T Consensus 74 ~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~------------ 141 (315) T protein:vir:80 74 AFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHT------------ 141 (315) T ss_pred eeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccccccc------------ Confidence 9999999999999998888766443 22345555567789999999999999963221111111100 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) ..+. ..+ T Consensus 142 ----~~~~----------------------~~~----------------------------------------------- 148 (315) T protein:vir:80 142 ----SLNK----------------------TKN----------------------------------------------- 148 (315) T ss_pred ----cccc----------------------ccc----------------------------------------------- Confidence 0000 000 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccc-----cceeeecccCCcceEEECC Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKI-----ANNLTWETVSGERVMTFDG 313 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~-----~~~l~~~~~~g~~v~~~~g 313 (330) .++.+ .....++.+++ .+.. ........+|.||++.+..|++. ++.. ...+..+-..| .+-.+.| T Consensus 149 -~~~~~----~~~~~d~~~~~-~~~~--~~~~~~~~~~imn~~~~~~L~~l-~~~~g~~~~g~~~~~~~~~g-~~~tl~G 218 (315) T protein:vir:80 149 -IVDAT----DSATADLVKAV-GLIA--GAGLQVPNGVALDPAFSFALSTE-VYPKGSPLAGQPMYPAAGFA-GLDNWRG 218 (315) T ss_pred -eeecc----ccchHHHHHHH-HHHh--hccCccceEEEEcHHHHHHHHHH-hhccCCcccccccccccccC-CCceecc Confidence 00000 00112333222 1111 11111223699999999999864 2221 12222222222 2357899 Q ss_pred eEEEEEeeccCCcc-----cc--C Q lcl|Aclame:pro 314 IPVQRTDALLNTES-----RV--V 330 (330) Q Consensus 314 vpir~~dal~~tE~-----~V--v 330 (330) .||..++++..... .+ + T Consensus 219 ~PV~~~~~~~~~~~~~~~~~~~~~ 242 (315) T protein:vir:80 219 LNVGASSTVSGAPEMSPASGVKAI 242 (315) T ss_pred eeeEecCcCCcccccccccccEEE Confidence 99999999864321 11 1 No 52 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.88 E-value=5.6e-10 Score=71.32 Aligned_cols=233 Identities=18% Similarity=0.188 Sum_probs=140.9 Q ss_pred CCc--cccc----cccH-HHHHhhcCcccchHHHHHHHhccchhHhh-cceeeccCCccceeEEEeccCCcceeecCCcc Q lcl|Aclame:pro 1 MAT--LSTN----NPTM-ADVAKRLDPNGKVDIIVEMLNQTNPVLQD-MTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGV 72 (330) Q Consensus 1 M~~--~~~~----a~TL-~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~-lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~ 72 (330) |+. +... +++. .....-+-+......|||.+.+.+++... ...+....+ ...+.+.++-|.++|..=++.+ T Consensus 52 ~a~~~~~~~~~~~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g-~~~~p~~t~~~~a~wv~E~~~~ 130 (366) T protein:vir:57 52 FAATELGDTGLSMAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNG-NLSMPRLSGGATAGYVGEGKDV 130 (366) T ss_pred HHHHhhcchhhhhhccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCC-ceEEEEEeCCcceeeeccCccc Confidence 000 0000 0000 00011112444556799999999988765 444444333 4678888999999999999999 Q ss_pred CcccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhccc Q lcl|Aclame:pro 73 LPNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSL 151 (330) Q Consensus 73 ~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~ 151 (330) ++++.++.+++-..+-+++.+.|-+.+.+... +..++ -.....++++..+..+|++||... +++.||..- T Consensus 131 ~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~~~---i~~~l~~a~~~~~d~a~l~G~G~~--~~p~Gi~~~---- 201 (366) T protein:vir:57 131 VATGATFDDVKLSAKTMIALVPVSNQLIGRAGFNVEQL---LLGDILSAIATREDKAFLRDDGTG--DTPKGMKAV---- 201 (366) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhhhhHHHHHH---HHHHHHHHHHHHHHHHhhccCCCC--ccccceeec---- Confidence 99999999999999999999999999887643 44444 334567999999999999997432 123343110 Q ss_pred ccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEec Q lcl|Aclame:pro 152 SAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRD 231 (330) Q Consensus 152 t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d 231 (330) ++. ++. .+.+ .+ T Consensus 202 ----------~~~-~~~-----~~~~--------------------------------~~-------------------- 213 (366) T protein:vir:57 202 ----------ATA-ANR-----LVAW--------------------------------TG-------------------- 213 (366) T ss_pred ----------ccc-ccc-----eeec--------------------------------cc-------------------- Confidence 000 000 0000 00 Q ss_pred cccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEE Q lcl|Aclame:pro 232 WRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTF 311 (330) Q Consensus 232 ~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~ 311 (330) ...+ .+..+.+++++. +....++.......|+||++....|+. .++.....+-.....| .+ T Consensus 214 -------t~~~------~~~~~~~~~~~~-~~~~~~~~~~~~a~~vmn~~~~~~L~~-lkd~~G~~l~~~~~~g----~l 274 (366) T protein:vir:57 214 -------TAIN------LTTIDEYLDSLI-LKHMDSNSNMIRCGWGLSNRTYMTLFG-LRDGNGNKVYPEMSQG----IL 274 (366) T ss_pred -------cccc------hhhHHHHHHHHH-HhhhccccccccCEEEecHHHHHHHHh-hhccCCceeccCCCCC----ee Confidence 0011 012233444432 334455666677889999999999996 4555444554333233 57 Q ss_pred CCeEEEEEeeccCC------ccccC Q lcl|Aclame:pro 312 DGIPVQRTDALLNT------ESRVV 330 (330) Q Consensus 312 ~gvpir~~dal~~t------E~~Vv 330 (330) .|+||..++++..+ +..++ T Consensus 275 ~G~Pvv~s~~ip~~~~~~~~~~~i~ 299 (366) T protein:vir:57 275 KGYPIQRTSAIPANLGDDGNESEIY 299 (366) T ss_pred cceeeEEccccccccccCCCccEEE Confidence 99999999999752 22232 No 53 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=98.87 E-value=1.3e-09 Score=69.35 Aligned_cols=234 Identities=16% Similarity=0.137 Sum_probs=141.2 Q ss_pred CCcc--ccccc-----cHHHHHhhcCcccchHHHHHHHhccchhHhh-cceeeccCCccceeEEEeccCCcceeecCCcc Q lcl|Aclame:pro 1 MATL--STNNP-----TMADVAKRLDPNGKVDIIVEMLNQTNPVLQD-MTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGV 72 (330) Q Consensus 1 M~~~--~~~a~-----TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~-lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~ 72 (330) |+.- +.... +.....-.+-+......|||.+.+.++|++. ..++...++ .+.+.+.++-|.++|..=++.. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g-~~~~p~~~~~~~a~~v~Eg~~~ 191 (428) T protein:vir:10 113 FASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNG-NMSLPRLAGGATASYTGENQDA 191 (428) T ss_pred HhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCc-ceEEEEEeCCcceeeeccCccc Confidence 1100 00000 0000001122445567899999999998776 344444333 3778888999999999999999 Q ss_pred CcccceEEEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhccc Q lcl|Aclame:pro 73 LPNKSSTAQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSL 151 (330) Q Consensus 73 ~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~ 151 (330) ++++.++.+++-..+-+++.+.|-+.+.+.. .+..++ -.....++++.+..++|++||... ..+.||.... T Consensus 192 ~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~---i~~~l~~ai~~~~d~~~l~G~G~~--~~p~Gi~~~~--- 263 (428) T protein:vir:10 192 KVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQL---VLQDILTAISVREDKAFMRDDGTG--DTPIGMKARA--- 263 (428) T ss_pred cccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHH---HHHHHHHHHHHHHHHHHhccCCCC--cccccccccc--- Confidence 9999999999999999999999999987753 344433 345567999999999999997432 2444553210 Q ss_pred ccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEec Q lcl|Aclame:pro 152 SAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRD 231 (330) Q Consensus 152 t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d 231 (330) + .+ +.+.+-+..+ T Consensus 264 -----------~-~~----------------~~~~~~~~~~--------------------------------------- 276 (428) T protein:vir:10 264 -----------T-QW----------------NRLLPWAADA--------------------------------------- 276 (428) T ss_pred -----------c-cc----------------cccccccccc--------------------------------------- Confidence 0 00 0000000000 Q ss_pred cccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEE Q lcl|Aclame:pro 232 WRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTF 311 (330) Q Consensus 232 ~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~ 311 (330) ..++ ....++++.+ ......++....+..|+||.+...+|+. .++.....+-.....| .+ T Consensus 277 ------~~~~--------~~~~~~~~~~-~~~~~~~~~~~~~~~~v~n~~~~~~L~~-lkd~~G~~i~~~~~~g----~l 336 (428) T protein:vir:10 277 ------AVNL--------DTIDTYLDSI-ILMSMDGNSNMISSGWGMSNRTYMKLFG-LRDGNGNKVYPEMAQG----ML 336 (428) T ss_pred ------cccH--------HHHHHHHHHH-HHhhhccccccccCEEEEcHHHHHHHHH-hhccCCceeccCCCCC----ee Confidence 0011 1112233322 2234445555666789999999999986 4555545554433333 48 Q ss_pred CCeEEEEEeeccCC------ccccC Q lcl|Aclame:pro 312 DGIPVQRTDALLNT------ESRVV 330 (330) Q Consensus 312 ~gvpir~~dal~~t------E~~Vv 330 (330) .|+||..+|++... +..++ T Consensus 337 ~G~pv~~~~~~p~~~~~~~~~~~i~ 361 (428) T protein:vir:10 337 KGYPIQRTSAIPANLGEGGKESEIY 361 (428) T ss_pred eceeeEEeccccccccCCCccceEE Confidence 99999999998642 22333 No 54 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=98.85 E-value=2.4e-10 Score=73.27 Aligned_cols=229 Identities=14% Similarity=0.118 Sum_probs=135.4 Q ss_pred CCc------cc--cccccHHHHHhhcCcccchHHHHHHHhccchhHhh-cceeeccCCccceeEEEeccCCcceeecCCc Q lcl|Aclame:pro 1 MAT------LS--TNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQD-MTAIEGNLPTGHRTSVRTGLPTPTWRKLYGG 71 (330) Q Consensus 1 M~~------~~--~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~-lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g 71 (330) +.- .. ....|...-.-.+ +......+|..+.+.+.+|.. ......+++..+.+.+.++-|.+.|..=++. T Consensus 97 ~~~~~r~~~~~~~~~~~t~~~~g~~~-~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~ 175 (390) T protein:vir:62 97 NLGEARSFEFAPEKRDGTKAGNPNVL-SRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGETAE 175 (390) T ss_pred hhhhhHHHHhhhhhhcccccCCCccc-cccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeeccccc Confidence 000 00 0000111100011 122333455555555666654 4555555555577889999999999999999 Q ss_pred cCcccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcc Q lcl|Aclame:pro 72 VLPNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNS 150 (330) Q Consensus 72 ~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~ 150 (330) ++++..++.+++-..+-+++.+.|-+.+.+... +..++-. ....++++....+.|+|||. .|+ |+-. T Consensus 176 ~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~l~~~i~~~~d~~~l~G~G--~p~---Gi~~---- 243 (390) T protein:vir:62 176 IPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLV---SDAGPAIGDAMGRHFITGTG--QPR---GILT---- 243 (390) T ss_pred ccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHH---HHHHHHHHHHHHhhhhccCC--ccc---cccc---- Confidence 999999999999999999999999999988744 4444333 34568999999999999963 222 1100 Q ss_pred cccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEe Q lcl|Aclame:pro 151 LSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLR 230 (330) Q Consensus 151 ~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~ 230 (330) .-+.+. .. . T Consensus 244 -----------~~~~~~------------------------~~----------~-------------------------- 252 (390) T protein:vir:62 244 -----------DASPAT------------------------AT----------F-------------------------- 252 (390) T ss_pred -----------cccccc------------------------cc----------e-------------------------- Confidence 000000 00 0 Q ss_pred ccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEE Q lcl|Aclame:pro 231 DWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMT 310 (330) Q Consensus 231 d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~ 310 (330) ..... +....++|+++. +.++.....+.+|+||++...+|+. .+++...++-........... T Consensus 253 --------~~~~~----~~~~~~~l~~~~----~~l~~~~~~~a~~vmn~~~~~~L~~-lkd~~g~~l~~~~~~~g~~~~ 315 (390) T protein:vir:62 253 --------LATDT----DSKVSDALIDLF----HEVPSAYRANAKYVVNDLRAAQMRK-LKDANGQYLWQSGLTVGAPSL 315 (390) T ss_pred --------ecccc----cccchHHHHHHH----HhhhhhhhcCCEEEEchHHHHHHHH-hhccCCCeeecCCcCCCccce Confidence 00000 001123444332 2232223345689999999999995 567666666555555445568 Q ss_pred ECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 311 FDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 311 ~~gvpir~~dal~~tE~~Vv 330 (330) +.|.||..+|++..++-..- T Consensus 316 l~G~Pv~~~~~~p~~~i~~g 335 (390) T protein:vir:62 316 FNGKVVETDDGMPADKILFA 335 (390) T ss_pred ecccceEEecCCCCccEEEe Confidence 99999999999987542111 No 55 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.84 E-value=3.9e-10 Score=72.17 Aligned_cols=230 Identities=7% Similarity=-0.008 Sum_probs=138.2 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEeccCCcceeecCCccCc-ccce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTGLPTPTWRKLYGGVLP-NKSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~lP~~~fR~lN~g~~~-s~~t 78 (330) ...+.....|... ...+-|......||+.+.+.++|++.+.++...++.+ +.+...++.+.+.|..=++..++ +..+ T Consensus 116 ~~~~~~~~~~t~~-g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~ 194 (415) T protein:vir:46 116 RNDIQGGSLKTDS-GFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKP 194 (415) T ss_pred hhhhhhccccccC-CcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccc Confidence 0000001111111 2223444566789999999999999999887765532 33334456677788887777886 5689 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) +.+++-..+-+++.+.|.+.+.+...- ++...-.....++++..+...|++|+....+..+. T Consensus 195 ~~~v~~~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~---------------- 256 (415) T protein:vir:46 195 FFQLAYDINTHRGYFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTS---------------- 256 (415) T ss_pred eeeEEeeeeeeEeeehhhHHHHhhchH--HHHHHHHHHHHHHHHHHHHHHHhhccccCCccccc---------------- Confidence 999999999999999999999887441 23333345567899999999999997432211100 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) +.....+ . . T Consensus 257 -~~~~~~~-------------------------------------------~--~------------------------- 265 (415) T protein:vir:46 257 -SGFEKEG-------------------------------------------K--K------------------------- 265 (415) T ss_pred -ccccccc-------------------------------------------c--e------------------------- Confidence 0000000 0 0 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEE Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~ 318 (330) .. ..+....++|++++ ..++.......+|+||++....|+. ..+....++-.....+...-.++|.||+. T Consensus 266 ~~-----~~~~~~~~~i~~~~----~~~~~~~~~~~~~v~n~~~~~~L~~-lkd~~G~~i~~~~~~~~~~~~l~G~pV~~ 335 (415) T protein:vir:46 266 LE-----VKKAKSLDDIKDAI----NLNVKPNYEHNVAIVSQTMFAKLDK-MKDKLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) T ss_pred ec-----cccccchHHHHHHH----HhhhhhccCCCEEEEcHHHHHHHHH-hhccCCCeeeccCcCCCCCccccceeeEE Confidence 00 00111233444433 2222233345689999999999986 56665555544444444456789999999 Q ss_pred EeeccC-Cc-cc-cC Q lcl|Aclame:pro 319 TDALLN-TE-SR-VV 330 (330) Q Consensus 319 ~dal~~-tE-~~-Vv 330 (330) ++++.. +. .. ++ T Consensus 336 ~~~~~~~~~~~~~~~ 350 (415) T protein:vir:46 336 LPDEVLGQKGNNTLI 350 (415) T ss_pred eccccccCCCccEEE Confidence 998874 22 21 22 No 56 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.84 E-value=3.9e-10 Score=72.17 Aligned_cols=230 Identities=7% Similarity=-0.008 Sum_probs=138.2 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEeccCCcceeecCCccCc-ccce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTGLPTPTWRKLYGGVLP-NKSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~lP~~~fR~lN~g~~~-s~~t 78 (330) ...+.....|... ...+-|......||+.+.+.++|++.+.++...++.+ +.+...++.+.+.|..=++..++ +..+ T Consensus 116 ~~~~~~~~~~t~~-g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~ 194 (415) T protein:vir:47 116 RNDIQGGSLKTDS-GFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKP 194 (415) T ss_pred hhhhhhccccccC-CcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccc Confidence 0000001111111 2223444566789999999999999999887765532 33334456677788887777886 5689 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) +.+++-..+-+++.+.|.+.+.+...- ++...-.....++++..+...|++|+....+..+. T Consensus 195 ~~~v~~~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~---------------- 256 (415) T protein:vir:47 195 FFQLAYDINTHRGYFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTS---------------- 256 (415) T ss_pred eeeEEeeeeeeEeeehhhHHHHhhchH--HHHHHHHHHHHHHHHHHHHHHHhhccccCCccccc---------------- Confidence 999999999999999999999887441 23333345567899999999999997432211100 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) +.....+ . . T Consensus 257 -~~~~~~~-------------------------------------------~--~------------------------- 265 (415) T protein:vir:47 257 -SGFEKEG-------------------------------------------K--K------------------------- 265 (415) T ss_pred -ccccccc-------------------------------------------c--e------------------------- Confidence 0000000 0 0 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEE Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~ 318 (330) .. ..+....++|++++ ..++.......+|+||++....|+. ..+....++-.....+...-.++|.||+. T Consensus 266 ~~-----~~~~~~~~~i~~~~----~~~~~~~~~~~~~v~n~~~~~~L~~-lkd~~G~~i~~~~~~~~~~~~l~G~pV~~ 335 (415) T protein:vir:47 266 LE-----VKKAKSLDDIKDAI----NLNVKPNYEHNVAIVSQTMFAKLDK-MKDKLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) T ss_pred ec-----cccccchHHHHHHH----HhhhhhccCCCEEEEcHHHHHHHHH-hhccCCCeeeccCcCCCCCccccceeeEE Confidence 00 00111233444433 2222233345689999999999986 56665555544444444456789999999 Q ss_pred EeeccC-Cc-cc-cC Q lcl|Aclame:pro 319 TDALLN-TE-SR-VV 330 (330) Q Consensus 319 ~dal~~-tE-~~-Vv 330 (330) ++++.. +. .. ++ T Consensus 336 ~~~~~~~~~~~~~~~ 350 (415) T protein:vir:47 336 LPDEVLGQKGNNTLI 350 (415) T ss_pred eccccccCCCccEEE Confidence 998874 22 21 22 No 57 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.80 E-value=8.2e-10 Score=70.40 Aligned_cols=230 Identities=8% Similarity=0.015 Sum_probs=138.2 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEeccCCcceeecCCccCc-ccce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTGLPTPTWRKLYGGVLP-NKSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~lP~~~fR~lN~g~~~-s~~t 78 (330) .........|... ..-+-|......||+.+.+.++|++.+..+...++.+ +.+...++.+.+.|..=++.+++ +..+ T Consensus 116 ~~~~~~~~~~~~~-gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~ 194 (415) T protein:vir:81 116 RNDIQGGSLKTDS-GFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKP 194 (415) T ss_pred hhhhhhccccccc-cccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccc Confidence 0000001111111 1123344566779999999999999999887654432 33445556677888877777876 4578 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) +.+++-.++-+++.+.|.+.+.+...- ++...-.....+++.......|++|+....+... T Consensus 195 ~~~v~~~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~----------------- 255 (415) T protein:vir:81 195 FFQLAYDINTHRGYFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKGSTGST----------------- 255 (415) T ss_pred eeeEEeeeeeeEeeehhhHHHHhhchH--HHHHHHHHHHHHHHHHHHHHHHhhccccCccccc----------------- Confidence 999999999999999999999876431 2333344556789999999999999744322110 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) +.+.. ..+. T Consensus 256 -----~~~~~------------------~~~~------------------------------------------------ 264 (415) T protein:vir:81 256 -----SSGFE------------------KEGK------------------------------------------------ 264 (415) T ss_pred -----ccccc------------------cccc------------------------------------------------ Confidence 00000 0000 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEE Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~ 318 (330) ... ..+....++|++++ ..++.......+|+||++....|+. .++.....+-.....+...-.++|.||+. T Consensus 265 -~~~---~~~~~~~~~i~~~~----~~~~~~~~~~~~~v~n~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~~l~G~pV~~ 335 (415) T protein:vir:81 265 -KLE---VKKAKSLDDIKDAI----NLNVKPNYEHNVAIVSQTMFAKLDK-MKDKLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) T ss_pred -ccc---cccccchhHHHHHH----HhhhhhccCCCEEEEcHHHHHHHHH-hhccCCceeeccCcCCCCCceecceeeEE Confidence 000 00011123444432 3333333445689999999999995 56665555544444444556899999999 Q ss_pred EeeccC-Cccc--cC Q lcl|Aclame:pro 319 TDALLN-TESR--VV 330 (330) Q Consensus 319 ~dal~~-tE~~--Vv 330 (330) ++++.. +... ++ T Consensus 336 ~~~~~~~~~~~~~~~ 350 (415) T protein:vir:81 336 LPDEVLGQKGNNTLI 350 (415) T ss_pred ecccccCCCCccEEE Confidence 998874 2221 22 No 58 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.80 E-value=8.2e-10 Score=70.40 Aligned_cols=230 Identities=8% Similarity=0.015 Sum_probs=138.2 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEeccCCcceeecCCccCc-ccce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTGLPTPTWRKLYGGVLP-NKSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~lP~~~fR~lN~g~~~-s~~t 78 (330) .........|... ..-+-|......||+.+.+.++|++.+..+...++.+ +.+...++.+.+.|..=++.+++ +..+ T Consensus 116 ~~~~~~~~~~~~~-gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~ 194 (415) T protein:vir:79 116 RNDIQGGSLKTDS-GFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKP 194 (415) T ss_pred hhhhhhccccccc-cccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccc Confidence 0000001111111 1123344566779999999999999999887654432 33445556677888877777876 4578 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) +.+++-.++-+++.+.|.+.+.+...- ++...-.....+++.......|++|+....+... T Consensus 195 ~~~v~~~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~----------------- 255 (415) T protein:vir:79 195 FFQLAYDINTHRGYFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKGSTGST----------------- 255 (415) T ss_pred eeeEEeeeeeeEeeehhhHHHHhhchH--HHHHHHHHHHHHHHHHHHHHHHhhccccCccccc----------------- Confidence 999999999999999999999876431 2333344556789999999999999744322110 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) +.+.. ..+. T Consensus 256 -----~~~~~------------------~~~~------------------------------------------------ 264 (415) T protein:vir:79 256 -----SSGFE------------------KEGK------------------------------------------------ 264 (415) T ss_pred -----ccccc------------------cccc------------------------------------------------ Confidence 00000 0000 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEE Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~ 318 (330) ... ..+....++|++++ ..++.......+|+||++....|+. .++.....+-.....+...-.++|.||+. T Consensus 265 -~~~---~~~~~~~~~i~~~~----~~~~~~~~~~~~~v~n~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~~l~G~pV~~ 335 (415) T protein:vir:79 265 -KLE---VKKAKSLDDIKDAI----NLNVKPNYEHNVAIVSQTMFAKLDK-MKDKLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) T ss_pred -ccc---cccccchhHHHHHH----HhhhhhccCCCEEEEcHHHHHHHHH-hhccCCceeeccCcCCCCCceecceeeEE Confidence 000 00011123444432 3333333445689999999999995 56665555544444444556899999999 Q ss_pred EeeccC-Cccc--cC Q lcl|Aclame:pro 319 TDALLN-TESR--VV 330 (330) Q Consensus 319 ~dal~~-tE~~--Vv 330 (330) ++++.. +... ++ T Consensus 336 ~~~~~~~~~~~~~~~ 350 (415) T protein:vir:79 336 LPDEVLGQKGNNTLI 350 (415) T ss_pred ecccccCCCCccEEE Confidence 998874 2221 22 No 59 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.80 E-value=8.2e-10 Score=70.40 Aligned_cols=230 Identities=8% Similarity=0.015 Sum_probs=138.2 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEeccCCcceeecCCccCc-ccce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTGLPTPTWRKLYGGVLP-NKSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~lP~~~fR~lN~g~~~-s~~t 78 (330) .........|... ..-+-|......||+.+.+.++|++.+..+...++.+ +.+...++.+.+.|..=++.+++ +..+ T Consensus 116 ~~~~~~~~~~~~~-gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~ 194 (415) T protein:vir:98 116 RNDIQGGSLKTDS-GFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKP 194 (415) T ss_pred hhhhhhccccccc-cccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccc Confidence 0000001111111 1123344566779999999999999999887654432 33445556677888877777876 4578 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) +.+++-.++-+++.+.|.+.+.+...- ++...-.....+++.......|++|+....+... T Consensus 195 ~~~v~~~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~----------------- 255 (415) T protein:vir:98 195 FFQLAYDINTHRGYFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKGSTGST----------------- 255 (415) T ss_pred eeeEEeeeeeeEeeehhhHHHHhhchH--HHHHHHHHHHHHHHHHHHHHHHhhccccCccccc----------------- Confidence 999999999999999999999876431 2333344556789999999999999744322110 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) +.+.. ..+. T Consensus 256 -----~~~~~------------------~~~~------------------------------------------------ 264 (415) T protein:vir:98 256 -----SSGFE------------------KEGK------------------------------------------------ 264 (415) T ss_pred -----ccccc------------------cccc------------------------------------------------ Confidence 00000 0000 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEE Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~ 318 (330) ... ..+....++|++++ ..++.......+|+||++....|+. .++.....+-.....+...-.++|.||+. T Consensus 265 -~~~---~~~~~~~~~i~~~~----~~~~~~~~~~~~~v~n~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~~l~G~pV~~ 335 (415) T protein:vir:98 265 -KLE---VKKAKSLDDIKDAI----NLNVKPNYEHNVAIVSQTMFAKLDK-MKDKLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) T ss_pred -ccc---cccccchhHHHHHH----HhhhhhccCCCEEEEcHHHHHHHHH-hhccCCceeeccCcCCCCCceecceeeEE Confidence 000 00011123444432 3333333445689999999999995 56665555544444444556899999999 Q ss_pred EeeccC-Cccc--cC Q lcl|Aclame:pro 319 TDALLN-TESR--VV 330 (330) Q Consensus 319 ~dal~~-tE~~--Vv 330 (330) ++++.. +... ++ T Consensus 336 ~~~~~~~~~~~~~~~ 350 (415) T protein:vir:98 336 LPDEVLGQKGNNTLI 350 (415) T ss_pred ecccccCCCCccEEE Confidence 998874 2221 22 No 60 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=98.79 E-value=6.4e-10 Score=70.98 Aligned_cols=209 Identities=11% Similarity=0.059 Sum_probs=135.5 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCc-cceeEEEeccCCcceeecCCccCc-ccce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPT-GHRTSVRTGLPTPTWRKLYGGVLP-NKSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~-~~~~~~~~~lP~~~fR~lN~g~~~-s~~t 78 (330) |.+. |..+ ...+-|......||+.+.+.++|++.++.+...++. .+.+.+.++-|.++|..=++..++ +..+ T Consensus 123 ~~~~-----~~~~-gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 196 (397) T protein:vir:12 123 MSGI-----NDED-GGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPR 196 (397) T ss_pred cccc-----cccc-CcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeeccccccccccccc Confidence 2111 1111 011223445567999999999999998887765443 355677788899999999988886 5689 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) +.+++-.++-+++.+.|.+.+.+..+ +..++-. ....++++......|++|+.... T Consensus 197 ~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~---~~l~~~~~~~~d~~il~G~g~~~-------------------- 253 (397) T protein:vir:12 197 FTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVA---KWFAKKSVVTRNNLILAAIASLK-------------------- 253 (397) T ss_pred ceeEEeeheeeEeeehhhHHHHhhchHHHHHHHH---HHHHHHHHHHHHHHHHhcccccc-------------------- Confidence 99999999999999999999887655 3444433 44678899999999999963221 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) |+| + T Consensus 254 -----------------------------~~g----~------------------------------------------- 257 (397) T protein:vir:12 254 -----------------------------KVD----I------------------------------------------- 257 (397) T ss_pred -----------------------------ccc----c------------------------------------------- Confidence 111 0 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) ...++|++++.. .++.......+|+||++...+|+. .++....++-..+..+...-.+.|+||. T Consensus 258 ------------~~~~~i~~~~~~---~l~~~~~~~a~~~~n~~~~~~L~~-lkd~~G~~l~~~~~~~g~~~~l~G~pv~ 321 (397) T protein:vir:12 258 ------------DGLDGIKKALNV---TLDPMVAPGSIVLTNQDGYDWLDT-LKDGTGRYLLQPDPTNPTKKLLDGRPVV 321 (397) T ss_pred ------------ccHHHHHHHHhh---ccchhhhCCCEEEEcHHHHHHHHH-hhccCCceeecccccCCCCccccceeeE Confidence 001233332211 222233445789999999999985 4565555554444444445688999998 Q ss_pred EEeec-cCC---ccccC Q lcl|Aclame:pro 318 RTDAL-LNT---ESRVV 330 (330) Q Consensus 318 ~~dal-~~t---E~~Vv 330 (330) .+++. +.. ...++ T Consensus 322 ~~~~~~~~~~~~~~~~~ 338 (397) T protein:vir:12 322 PFTNRVLKTQKGKAPLI 338 (397) T ss_pred EecccccccCCCccEEE Confidence 77653 332 22233 No 61 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=98.79 E-value=2.1e-10 Score=73.69 Aligned_cols=236 Identities=13% Similarity=0.103 Sum_probs=131.3 Q ss_pred CCccc----------cccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEE----EeccCCccee Q lcl|Aclame:pro 1 MATLS----------TNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSV----RTGLPTPTWR 66 (330) Q Consensus 1 M~~~~----------~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~----~~~lP~~~fR 66 (330) |-|+. ..++|..+.....-.......+|+.+.++|+||+.+..+.......+.... ....++..|. T Consensus 1 ~~~~~~~~~~~~~~~~k~~t~~d~~Gg~l~P~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~ 80 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKIDVPDLGRGVLSVDRFGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDET 80 (315) T ss_pred CcccchhhcCChhhhhhhcCCcCCCCceechHHHHHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccccc Confidence 22211 123344443333322334456899999999999999875432111111111 1123444555 Q ss_pred ecCCccCcccceEEEEEEEEEEecchhhhhHHHH-HhC--CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc-Ch--hh Q lcl|Aclame:pro 67 KLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALA-DLN--GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGI-AP--AE 140 (330) Q Consensus 67 ~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la-~~~--g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~-~p--~~ 140 (330) .=....++++.++.++.-.++-+...+.|-..+. +-. .|.+++ -...+.++++...++.|||||.+. +| +. T Consensus 81 ~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~---l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~ 157 (315) T protein:vir:41 81 GQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQK---IVTLLGEGISYVLEKYYLHGDTSSSDPLLRM 157 (315) T ss_pred cCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHH---HHHHHHHHHHHHHHHHhhccCCcCcCccccc Confidence 5555667788999999999999999999955554 432 354444 445677999999999999998643 22 22 Q ss_pred ccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEE Q lcl|Aclame:pro 141 FTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTH 220 (330) Q Consensus 141 F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~ 220 (330) ++|+-+. |++. ....+ T Consensus 158 ~~G~l~~--------------a~~~--------------------------------------~~~~~------------ 173 (315) T protein:vir:41 158 SDGWLKL--------------ASEK--------------------------------------LTESD------------ 173 (315) T ss_pred cccceec--------------cccc--------------------------------------ccccc------------ Confidence 3333110 0000 00000 Q ss_pred EeeeeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccC---CCCCCEEEEeChHHHHHHHHHhhccccce Q lcl|Aclame:pro 221 YKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQ---LGMGRAVWYMNRNLREKLRLGIVDKIANN 297 (330) Q Consensus 221 ~~w~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~---~~~g~~~~y~n~~v~~~L~~q~~~~~~~~ 297 (330) ++... .+...+.|+++ +..||. -+.++.+|+||++.+..+++...+ .... T Consensus 174 --------------------~~~~a--~~~~~d~l~~l----~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~-~g~~ 226 (315) T protein:vir:41 174 --------------------VDPEA--EDWPMNLFDTM----IESLPTPYRNNLPNMKFYVTWDIYRAYRDALKG-RETG 226 (315) T ss_pred --------------------ccccc--ccccHHHHHHH----HHhcChHHhhcCCceEEEEcHHHHHHHHHHhcc-CCCc Confidence 00000 00011223332 233332 134578999999999999875433 3344 Q ss_pred eeecccCCcceEEECCeEEEEEeeccCC---ccccC Q lcl|Aclame:pro 298 LTWETVSGERVMTFDGIPVQRTDALLNT---ESRVV 330 (330) Q Consensus 298 l~~~~~~g~~v~~~~gvpir~~dal~~t---E~~Vv 330 (330) +-.....+..+..+.|.||..++++... +..+. T Consensus 227 lw~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~il 262 (315) T protein:vir:41 227 LGDQALTGANSILYDGRPVQYVPALEALNDGKSRAL 262 (315) T ss_pred cccchhhcCCCceecccceEecccccccCCCCccEE Confidence 4333444556678889999999998752 22222 No 62 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.77 E-value=2.3e-09 Score=67.98 Aligned_cols=224 Identities=18% Similarity=0.180 Sum_probs=137.6 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCc-----cCcc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGG-----VLPN 75 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g-----~~~s 75 (330) |+.+.+ .+. .-+-+......|+|.+.+.++|++..+++...++ .+.+.++++-|.+.|..=++. ++.+ T Consensus 1 ma~~t~-----~~g-g~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~-~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s 73 (305) T protein:vir:25 1 MADISR-----AEV-ASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTK-TTHLPVLATLPEADWVGESATDPKGVKPTS 73 (305) T ss_pred CCCccC-----Ccc-ceecCHHHHHHHHHHHHhhchhhhhcceeeccCC-cEEEEEEeCCcceEEeeccccccccccccc Confidence 777643 222 2244555678899999999999999999887544 477888999999999766654 4557 Q ss_pred cceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccC Q lcl|Aclame:pro 76 KSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAE 154 (330) Q Consensus 76 ~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~ 154 (330) +.++.+++-..+-+++.+.|-+.+.+... +..++ -.....+++++.+++.|||||.... .+.+ T Consensus 74 ~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~---i~~~l~~~~a~~~d~a~~~G~g~~~--~~~~----------- 137 (305) T protein:vir:25 74 KVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTE---VAELGGQAIGKKLDQAVIFGTDKPA--SWVS----------- 137 (305) T ss_pred ccceeeEEeeeEEEEEeehhhHHHHhcchHHHHHH---HHHHHHHHHHHHHhhhheeccCCCC--Cccc----------- Confidence 88999999999999999999999988743 33333 3344569999999999999973211 1000 Q ss_pred CcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEecccc Q lcl|Aclame:pro 155 NKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRY 234 (330) Q Consensus 155 ~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~ 234 (330) .++.|....++ + T Consensus 138 ----------------------------~~~~~~~~~~~----------------~------------------------ 149 (305) T protein:vir:25 138 ----------------------------PALIPAAVTAG----------------Q------------------------ 149 (305) T ss_pred ----------------------------ccccccccccc----------------c------------------------ Confidence 00111000000 0 Q ss_pred EEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCe Q lcl|Aclame:pro 235 VARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGI 314 (330) Q Consensus 235 v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gv 314 (330) .+... .+.....++.+.+..+...+-...-....|+||+.....|++ .+++....+-.. -.+.|. T Consensus 150 -----~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~-lkd~~G~~i~~~-------~~l~G~ 214 (305) T protein:vir:25 150 -----AVEVV--GGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVAN-IRDANGNPVFRD-------DSFAGF 214 (305) T ss_pred -----ccccc--ccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHH-hhccCCceeecC-------Cccccc Confidence 00000 000112334444444443332222222359999999999985 355444443321 257899 Q ss_pred EEEEEeeccCC--ccccC Q lcl|Aclame:pro 315 PVQRTDALLNT--ESRVV 330 (330) Q Consensus 315 pir~~dal~~t--E~~Vv 330 (330) |+..++++... +..++ T Consensus 215 Pv~~~~~~~~~~~~~~~~ 232 (305) T protein:vir:25 215 RTFFNRNGAWDADAAIEV 232 (305) T ss_pred ceEEcCccCCCCCccEEE Confidence 99988887642 22222 No 63 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.76 E-value=1.1e-09 Score=69.80 Aligned_cols=224 Identities=15% Similarity=0.108 Sum_probs=141.7 Q ss_pred CCcccc--ccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCcccce Q lcl|Aclame:pro 1 MATLST--NNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSS 78 (330) Q Consensus 1 M~~~~~--~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t 78 (330) |-+-.. .+.|...-+..+-+......|+|.+.+.++|++..+.....++....+.+.++-|.++|.+=++..++++.+ T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 80 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIKTDKPE 80 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccccccccc Confidence 333111 111112222234455667789999999999999999987765555566778888999999999999999999 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) +.+++-..+-+++.+.|.+.+.+... .++...-.....++++......+|+|+.+..|. T Consensus 81 f~~v~l~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~------------------- 139 (297) T protein:vir:95 81 VVPVTLKAHKLGIILVTSREALNYTW--KKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFAN------------------- 139 (297) T ss_pred eeEEEEeeEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHHhcccCCcccc------------------- Confidence 99999999999999999999888643 123333345567999999999999997432210 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) +|+... + ..+ T Consensus 140 -------------------------gi~~~~---~--------------~~~---------------------------- 149 (297) T protein:vir:95 140 -------------------------SVAKAA---K--------------DAN---------------------------- 149 (297) T ss_pred -------------------------cccccc---c--------------ccc---------------------------- Confidence 010000 0 000 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEE Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~ 318 (330) ++... ....++|+++ +..|+.......+|+||++....|+. .++.....+. .. ....+.|+||.. T Consensus 150 -~~~~~----~~t~~~i~~~----~~~l~~~~~~~~~~v~~~~~~~~L~~-l~d~~G~~i~-~~----~~~~l~G~Pv~~ 214 (297) T protein:vir:95 150 -KVIGG----PINYDNILKL----QDALYDADVEPNAFVSKIQNRSALRE-ARDGNKVSIY-DK----AANTIDGITTVD 214 (297) T ss_pred -eeccc----ccCHHHHHHH----HHHhhhccCCcCEEEEcHHHHHHHHH-hhccCCceee-cC----CCCcccceeeEe Confidence 00000 0012334432 33333333344689999999999985 3444433332 22 223578999988 Q ss_pred EeeccCCccccC Q lcl|Aclame:pro 319 TDALLNTESRVV 330 (330) Q Consensus 319 ~dal~~tE~~Vv 330 (330) +.+...+...++ T Consensus 215 ~~~~~~~~~~~~ 226 (297) T protein:vir:95 215 LKSARFEKGDLL 226 (297) T ss_pred ecCCCCCCceEE Confidence 777666666655 No 64 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=98.75 E-value=2.4e-09 Score=67.84 Aligned_cols=209 Identities=11% Similarity=0.078 Sum_probs=134.1 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEec-cCCcceeecCCccCc-ccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTG-LPTPTWRKLYGGVLP-NKS 77 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~-lP~~~fR~lN~g~~~-s~~ 77 (330) |.+. |..+ ..-+-|......||+.+.+.++|++.+......++.+ +.+.+..+ -+.++|.+=++.+++ +.. T Consensus 109 ~~~~-----t~~~-gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 182 (397) T protein:vir:49 109 KTDA-----SGSD-AGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDP 182 (397) T ss_pred hhcc-----cccc-CcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCcccccccccc Confidence 2211 1111 1112233455679999999999999988876544333 44544443 478899999999986 678 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENK 156 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~ 156 (330) ++.+++..++-+++.+.|-+.+.+... +..++ -.....++++......|++|+....+ T Consensus 183 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~---i~~~l~~~~~~~~d~ai~~G~g~~~~------------------ 241 (397) T protein:vir:49 183 KLSLIKYTIKRYAGISTVTNSLLADSAENILAW---LSGWIAKKVVVTRNKAILEAIAALPT------------------ 241 (397) T ss_pred ceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHH---HHHHHHHHHHHHHHHHHHhhcccccc------------------ Confidence 999999999999999999998887643 33444 34456799999999999999643211 Q ss_pred ceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 157 DNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 157 ~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) + .++ T Consensus 242 -------------------------------~---~~~------------------------------------------ 245 (397) T protein:vir:49 242 -------------------------------K---PTL------------------------------------------ 245 (397) T ss_pred -------------------------------c---ccc------------------------------------------ Confidence 0 000 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEE Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPV 316 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpi 316 (330) ...+++++++ ..|+.......+||||++....|+. .++.....+-..+..+...-.+.|+|| T Consensus 246 -------------~~~d~i~~~~----~~l~~~~~~~a~~vmn~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~~l~G~PV 307 (397) T protein:vir:49 246 -------------TKWDDIIDLE----AKVDPAIKQTSFFLTNTSGFTALKK-VKNALGDYLMERDVKSPTGYSIDGFAV 307 (397) T ss_pred -------------ccHHHHHHHH----HhhhhhhcCCCEEEEcHHHHHHHHH-hhcCCCceeeccCcCCCCCceecceee Confidence 0113344332 2233233445789999999999996 455555555555555555568999999 Q ss_pred EEEeec--cCCc-cc--cC Q lcl|Aclame:pro 317 QRTDAL--LNTE-SR--VV 330 (330) Q Consensus 317 r~~dal--~~tE-~~--Vv 330 (330) ..+++- .+.. .. ++ T Consensus 308 ~~~~~~~~~~~~~~~~~i~ 326 (397) T protein:vir:49 308 KEVADRWLANGTGGAMPLY 326 (397) T ss_pred EEecccccccccCCceeEE Confidence 987652 2211 11 22 No 65 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=98.74 E-value=5.4e-09 Score=65.89 Aligned_cols=230 Identities=14% Similarity=0.096 Sum_probs=137.3 Q ss_pred CCcccccc--ccHHHHHh----------hcCcccchHHHHHHHhccchhHhhcceeeccCCc-cceeEEEeccCCcceee Q lcl|Aclame:pro 1 MATLSTNN--PTMADVAK----------RLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPT-GHRTSVRTGLPTPTWRK 67 (330) Q Consensus 1 M~~~~~~a--~TL~E~Ak----------~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~-~~~~~~~~~lP~~~fR~ 67 (330) +....... .+..|... .+-+......||+.+.+.++|++.++.+....+. .+.|.+.++.+++.|.. T Consensus 92 ~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~ 171 (404) T protein:vir:10 92 LKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLS 171 (404) T ss_pred HHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeecc Confidence 00000000 11111110 0113345667999999999999999998765443 36678889999999999 Q ss_pred cCCccCccc--ceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccCh Q lcl|Aclame:pro 68 LYGGVLPNK--SSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGL 144 (330) Q Consensus 68 lN~g~~~s~--~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL 144 (330) -++..+.+. .++.+++-+.+-+++.+.|-+.+.+... +..++ =.....++++..+...|++|+...+ .+.|| T Consensus 172 e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~---i~~~la~~~~~~~~~~il~G~g~~~--~~~gi 246 (404) T protein:vir:10 172 ENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDW---IINWFVDKVRITRNAEILYGAGGDE--HATGI 246 (404) T ss_pred ccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHH---HHHHHHHHHHHHHHHHHhhcCCCCC--cccce Confidence 999888764 6799999999999999999998877643 33333 3345679999999999999974332 23333 Q ss_pred hhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeee Q lcl|Aclame:pro 145 SPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWD 224 (330) Q Consensus 145 ~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~ 224 (330) ... . ++. +. T Consensus 247 ~~~---------------~-~~~-----------------------------------~~-------------------- 255 (404) T protein:vir:10 247 MTA---------------N-KFK-----------------------------------KI-------------------- 255 (404) T ss_pred eec---------------c-ccc-----------------------------------ee-------------------- Confidence 110 0 000 00 Q ss_pred eeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccC Q lcl|Aclame:pro 225 IGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVS 304 (330) Q Consensus 225 ~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~ 304 (330) - ..+....+++.+.+... ++.....+.+||||++....|+.. ++.....+-..... T Consensus 256 -----------------~---~~~~~~~~~~~~~~~~~---l~~~~~~~~~~v~n~~~~~~L~~l-kd~~G~~l~~~~~~ 311 (404) T protein:vir:10 256 -----------------T---LPKSPALKDFKKCKNVE---LLNVFKATSSWIVNQDGFNYLDSL-EDKTGRPYLQPDPK 311 (404) T ss_pred -----------------e---ccccccHHHHHHHHHhh---hhccccCCCEEEEcHHHHHHHHHh-hccCCceeeccCcC Confidence 0 00111123443333221 222334456899999999999964 55444444333444 Q ss_pred CcceEEECCeEEEEEee-ccC-Cccc--cC Q lcl|Aclame:pro 305 GERVMTFDGIPVQRTDA-LLN-TESR--VV 330 (330) Q Consensus 305 g~~v~~~~gvpir~~da-l~~-tE~~--Vv 330 (330) +...-.++|.||..++. ++. +... ++ T Consensus 312 ~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~ 341 (404) T protein:vir:10 312 DPTQYRFLGLPVIELPNDLLLSTESAIPVL 341 (404) T ss_pred CCCCccccceeeEEecccccCCCCCccEEE Confidence 44555789999986654 332 3222 22 No 66 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=98.74 E-value=4.3e-09 Score=66.46 Aligned_cols=209 Identities=10% Similarity=-0.003 Sum_probs=134.0 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCc-cceeEEEeccCCcceeecCCccCc-ccce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPT-GHRTSVRTGLPTPTWRKLYGGVLP-NKSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~-~~~~~~~~~lP~~~fR~lN~g~~~-s~~t 78 (330) |.+- |..+ ...+-|......||+.+.+.++|++.++.+...++. .+.+.+..+-|.+.|.+=++..++ +..+ T Consensus 91 ~~~~-----t~~~-gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 164 (371) T protein:vir:81 91 MSEG-----SNQD-GGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQ 164 (371) T ss_pred hccC-----CCcc-CceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccccccccccc Confidence 2221 0000 111223345567999999999999999887764432 244555667788999988888875 6789 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) +.+++-.++-+++.+.|-+.+.+.. .+..++ -.....+++++.....|++|+....| T Consensus 165 f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~---i~~~l~~a~~~~~~~~i~~g~g~~~~------------------- 222 (371) T protein:vir:81 165 FTLLQYQVKKYAGFFRVTNELLNDSTEAIVNT---LVRWIGDESRVTRNGLIINVLNTKAK------------------- 222 (371) T ss_pred eeeEEeeeeEEEEeehhhHHHHhhhhHHHHHH---HHHHHHHHHHHHHHHHHHhhcccccc------------------- Confidence 9999999999999999999887764 344444 34456789999999999998642110 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) +| + T Consensus 223 ------------------------------~~----~------------------------------------------- 225 (371) T protein:vir:81 223 ------------------------------TA----I------------------------------------------- 225 (371) T ss_pred ------------------------------cc----c------------------------------------------- Confidence 00 0 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) ...+++++++.. .++.......+|+||++....|+. .++.....+-.....+...-.+.|.||. T Consensus 226 ------------~~~~~i~~~~~~---~l~~~~~~~a~~vmn~~~~~~L~~-lkd~~g~~l~~~~~~~~~~~~l~G~pV~ 289 (371) T protein:vir:81 226 ------------ADLDGLKQIINV---QLDPVFRSTSSVIVNQDAFNWLDT-LKDQNGQYLLQPSISSPTGRQLLGLPVV 289 (371) T ss_pred ------------ccHHHHHHHHHh---hcchhhhcCCEEEEcHHHHHHHHH-hhccCCCeeeecccCCCCCceecceeEE Confidence 001222222211 112222344689999999999996 4555555665545555555788999999 Q ss_pred EEeeccCC----------ccccC Q lcl|Aclame:pro 318 RTDALLNT----------ESRVV 330 (330) Q Consensus 318 ~~dal~~t----------E~~Vv 330 (330) .+|++... +..++ T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~i~ 312 (371) T protein:vir:81 290 IVSNKVLANRVDGGTGAQFAPII 312 (371) T ss_pred EecccccCccccccccCCcceEE Confidence 99988631 11122 No 67 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=98.72 E-value=4.9e-09 Score=66.15 Aligned_cols=226 Identities=15% Similarity=0.108 Sum_probs=137.3 Q ss_pred CCc-c-c--cccccHHHH-HhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeec---CCcc Q lcl|Aclame:pro 1 MAT-L-S--TNNPTMADV-AKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKL---YGGV 72 (330) Q Consensus 1 M~~-~-~--~~a~TL~E~-Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~l---N~g~ 72 (330) |.. . . ..+++.... ..-+-|......||+.+.+.++|......+..+ ..+.+.+.+.-+++.|... +... T Consensus 131 l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~--~~~~~p~~~~~~~a~~~~~~~e~~~~ 208 (434) T protein:vir:62 131 IVGNIDEKEARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTK--ENIKYPVLVKKAEAQGHKNERTNNEM 208 (434) T ss_pred hccccchhhhhhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccC--CceEEEEEecCCcccceecccccccc Confidence 000 0 0 000000000 001224445667999999999998887765543 2466777778888888654 5677 Q ss_pred CcccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhccc Q lcl|Aclame:pro 73 LPNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSL 151 (330) Q Consensus 73 ~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~ 151 (330) +.+..++.+++-..+-+++.+.|.+.+.+..+ +..++-. ....++++..+...||+||-..+|- .| T Consensus 209 ~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~la~~~~~~~d~~~l~G~G~~~~~--~g-------- 275 (434) T protein:vir:62 209 PETDIEFDEIELSPTEFDALATVTKKLLARTGLPIEQIVM---DELKKAYVRKETQYMVNGDEANNIN--DG-------- 275 (434) T ss_pred cccccceeeEEeeheeeEeehhhHHHHHhcchHHHHHHHH---HHHHHHHHHHHHHHHhccCCCCccc--cc-------- Confidence 88889999999999999999999999988754 5544433 4567999999999999997332210 00 Q ss_pred ccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEec Q lcl|Aclame:pro 152 SAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRD 231 (330) Q Consensus 152 t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d 231 (330) +.. +.++ . T Consensus 276 ---------------------------------~~~---~~~~---------------------------------~--- 283 (434) T protein:vir:62 276 ---------------------------------ALA---KKAV---------------------------------E--- 283 (434) T ss_pred ---------------------------------eee---cccc---------------------------------c--- Confidence 000 0000 0 Q ss_pred cccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeec--ccCCcceE Q lcl|Aclame:pro 232 WRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWE--TVSGERVM 309 (330) Q Consensus 232 ~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~--~~~g~~v~ 309 (330) .. ..+....++|++ ++..++.....+.+|+||++...+|++ .++....++-.. ...+...- T Consensus 284 ---------~~---~~~~~~~d~l~~----l~~~l~~~~~~~a~~v~n~~~~~~L~~-lkd~~G~~l~~~~~~~~~g~~~ 346 (434) T protein:vir:62 284 ---------FK---TDEKNLYDALVK----MKNTPVKEVRKKARWVLNTAALTKIET-MKTDDGFPLLRPFNQAEGGIGY 346 (434) T ss_pred ---------cc---ccccchhhHHHH----HHhhcchhhhcCCEEEEcHHHHHHHHH-hhccCCCEeeccCCCccCCCCc Confidence 00 000111234443 344555444566789999999999996 466655555332 22334445 Q ss_pred EECCeEEEEEeeccCCcc---ccC Q lcl|Aclame:pro 310 TFDGIPVQRTDALLNTES---RVV 330 (330) Q Consensus 310 ~~~gvpir~~dal~~tE~---~Vv 330 (330) .++|.||..+|.+...++ .++ T Consensus 347 tl~G~pV~~~~~~~~~~~~~~~~i 370 (434) T protein:vir:62 347 TLLGFPVEEEDAIDIPDSPDTPVF 370 (434) T ss_pred eecceeeEEecCccCccCCCceEE Confidence 799999999999864332 222 No 68 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=98.70 E-value=6.1e-09 Score=65.62 Aligned_cols=210 Identities=10% Similarity=0.023 Sum_probs=130.6 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEe-ccCCcceeecCCccCccc-c Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRT-GLPTPTWRKLYGGVLPNK-S 77 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~-~lP~~~fR~lN~g~~~s~-~ 77 (330) |... |..+ ...+-|......||+.+.+.++|++.++......+.+ +.+.+.. ..|.+.|.+=++.++++. . T Consensus 109 ~~~~-----t~~~-gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 182 (397) T protein:vir:49 109 KTDG-----SGSD-AGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDP 182 (397) T ss_pred hhcc-----CCcc-CcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeecccccccccccc Confidence 2211 1111 0111233345679999999999998887766654433 4454443 357889999999998876 6 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) ++.+++-.++-+++.+.|.+.+.+... -++...-.....++++......|++|+....| T Consensus 183 ~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~------------------- 241 (397) T protein:vir:49 183 KLSLIRYAIKRYAGISTVTNSLLADSA--ENILAWLSGWIAKKVVVTRNKAILEAIGTLPN------------------- 241 (397) T ss_pred ceeeeEeeeeeeEeehhhHHHHHhhhh--HHHHHHHHHHHHHHHHHHHHHHHHhccccccc------------------- Confidence 899999999999999999998887644 12333334556789999999999999632110 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) ++ + T Consensus 242 ------------------------------~~---~-------------------------------------------- 244 (397) T protein:vir:49 242 ------------------------------KP---T-------------------------------------------- 244 (397) T ss_pred ------------------------------cc---c-------------------------------------------- Confidence 00 0 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) ....++|+++ ...++........|+||++....|+. .++.....+-..+..+...-.++|.||+ T Consensus 245 -----------~~~~d~i~~~----~~~l~~~~~~~a~~v~n~~~~~~l~~-lkd~~g~~l~~~~~~~g~~~~l~G~pV~ 308 (397) T protein:vir:49 245 -----------LAKWDDIIDL----QAKVDPAIKQTSLFLTNTSGFTALKK-VKNAMGDYLMERDVKSPTGYSIDGFVVK 308 (397) T ss_pred -----------ccCHHHHHHH----HHhhhhhhcCCCEEEEcHHHHHHHHH-hhccCCceeecccccCCCCceecceeeE Confidence 0011234332 22333333445689999999999996 4555555554334444444579999999 Q ss_pred EEeecc--CC---ccccC Q lcl|Aclame:pro 318 RTDALL--NT---ESRVV 330 (330) Q Consensus 318 ~~dal~--~t---E~~Vv 330 (330) .|+... +. +..++ T Consensus 309 ~~~~~~~~~~~~~~~~~~ 326 (397) T protein:vir:49 309 EISDRFLPNGTGGAMPLY 326 (397) T ss_pred EecccccccccCCceeEE Confidence 876422 21 11222 No 69 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.69 E-value=4.5e-09 Score=66.33 Aligned_cols=217 Identities=10% Similarity=-0.014 Sum_probs=133.4 Q ss_pred CCcc--ccccccHHHHH----------hhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEe-ccCCccee Q lcl|Aclame:pro 1 MATL--STNNPTMADVA----------KRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRT-GLPTPTWR 66 (330) Q Consensus 1 M~~~--~~~a~TL~E~A----------k~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~-~lP~~~fR 66 (330) +.-+ ....++-.|.. -.+-|......||+.+.+.++|++.+.......+.+ +.+.... .-|.+.|. T Consensus 98 ~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v 177 (404) T protein:vir:39 98 VNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMD 177 (404) T ss_pred HHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeee Confidence 0000 00000000100 001244456789999999999999998877655543 3344333 34778999 Q ss_pred ecCCccCc-ccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChh Q lcl|Aclame:pro 67 KLYGGVLP-NKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLS 145 (330) Q Consensus 67 ~lN~g~~~-s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~ 145 (330) .=++..++ ++.++.+++-.++-+++.+.|-+.+.+... .++...-.....++++......+++|+.... T Consensus 178 ~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~-------- 247 (404) T protein:vir:39 178 AEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTA--ENILAWLSSWIAKKVVVTRNQAIIAAMGTVP-------- 247 (404) T ss_pred cCccccccccccceeeEEeeeeeEEeeehhHHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------- Confidence 99999986 679999999999999999999998887643 2333334455779999999999999862210 Q ss_pred hhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeee Q lcl|Aclame:pro 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) Q Consensus 146 ~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~ 225 (330) |++ T Consensus 248 -----------------------------------------~~~------------------------------------ 250 (404) T protein:vir:39 248 -----------------------------------------KKP------------------------------------ 250 (404) T ss_pred -----------------------------------------ccc------------------------------------ Confidence 000 Q ss_pred eeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCC Q lcl|Aclame:pro 226 GLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSG 305 (330) Q Consensus 226 Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g 305 (330) +....+++++++.... +.......+|+||++....|+. .++.....+-.....+ T Consensus 251 ----------------------~~~~~~~i~~~~~~~~---~~~~~~~a~~v~n~~~~~~L~~-lkd~~G~~l~~~~~~~ 304 (404) T protein:vir:39 251 ----------------------TIAKFDDVITMINTSV---DPAIIATSSLLTNQSGLNKLAL-VKTAEGKYLLEPDPTK 304 (404) T ss_pred ----------------------ccccHHHHHHHHHHhh---hhhhccCCEEEEcHHHHHHHHH-hhccCCceeeccCcCC Confidence 0011233444332211 1112234689999999999995 5666655665555555 Q ss_pred cceEEECCeEEEEEeeccCCc-----cccC Q lcl|Aclame:pro 306 ERVMTFDGIPVQRTDALLNTE-----SRVV 330 (330) Q Consensus 306 ~~v~~~~gvpir~~dal~~tE-----~~Vv 330 (330) ...-.+.|.||..||+..... ..++ T Consensus 305 ~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~ 334 (404) T protein:vir:39 305 PNSYLIKGKKVIVVADRWLPNSGSTVYPLY 334 (404) T ss_pred CCcceecceeEEEecccccCccCCCccEEE Confidence 555689999999987633211 1122 No 70 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.66 E-value=3.2e-09 Score=67.16 Aligned_cols=230 Identities=8% Similarity=0.012 Sum_probs=137.0 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEeccCCcceeecCCccCc-ccce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTGLPTPTWRKLYGGVLP-NKSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~lP~~~fR~lN~g~~~-s~~t 78 (330) ......+..+... ...+-|......||+.+.+.++|++.+.++...++.+ +.+...++.+.+.|..=++.+++ +..+ T Consensus 116 ~~~~~~~~~~~~~-g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~ 194 (415) T protein:vir:94 116 RNDIQGGSLKTDS-GFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKP 194 (415) T ss_pred hhhhhhhcccccc-ccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceecccccccccccccc Confidence 0000000011111 1112344466789999999999999999888754432 33445566778888877777775 4678 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcce Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~ 158 (330) +.+++-.++-+++.+.|.+.+.+... .++...-.....+++.......|++|+....+..+. T Consensus 195 ~~~i~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~---------------- 256 (415) T protein:vir:94 195 FFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTS---------------- 256 (415) T ss_pred ceeeEeeheeeeeechhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccc---------------- Confidence 99999999999999999999887643 233333445577899999999999997443221100 Q ss_pred eeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEEe Q lcl|Aclame:pro 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) Q Consensus 159 vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI 238 (330) .+ .. +.+. . T Consensus 257 ------~~-~~-----------------~~~~------------~----------------------------------- 265 (415) T protein:vir:94 257 ------SG-FE-----------------KEGK------------K----------------------------------- 265 (415) T ss_pred ------cc-cc-----------------cccc------------c----------------------------------- Confidence 00 00 0000 0 Q ss_pred ecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEE Q lcl|Aclame:pro 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) Q Consensus 239 ~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~ 318 (330) .. ..+....++|++++ +.+........+|+||++....|+. .++.....+-.....+...-.++|.||+. T Consensus 266 --~~---~~~~~~~~~i~~~~----~~~~~~~~~~~~~vmn~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~~l~G~pV~~ 335 (415) T protein:vir:94 266 --LE---VKKAKSLDDIKDAI----NLNVKPNYEHNVAIVSQTMFAKLDK-MKDKLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) T ss_pred --cc---cccccchHHHHHHH----HhhhhhccCCCEEEEcHHHHHHHHH-hhccCCCeeeccCcCCCCCceecceeeEE Confidence 00 00111223444433 2222233345689999999999986 46665555544444444556899999999 Q ss_pred EeeccCCcc---ccC Q lcl|Aclame:pro 319 TDALLNTES---RVV 330 (330) Q Consensus 319 ~dal~~tE~---~Vv 330 (330) ++++..... .++ T Consensus 336 ~~~~~~~~~~~~~i~ 350 (415) T protein:vir:94 336 LPDEVLGQKGNNTLI 350 (415) T ss_pred ecccccCCCCccEEE Confidence 998875221 122 No 71 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=98.66 E-value=8.1e-09 Score=64.95 Aligned_cols=210 Identities=10% Similarity=0.023 Sum_probs=132.5 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEE-eccCCcceeecCCccCcc-cc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVR-TGLPTPTWRKLYGGVLPN-KS 77 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~-~~lP~~~fR~lN~g~~~s-~~ 77 (330) |..- |..+ +..+-|......||+.+.+.++|++.++.+...++.+ +.+... +.-+.+.|..=++.++++ +. T Consensus 109 ~~~~-----t~~~-gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 182 (397) T protein:vir:48 109 KTDA-----SGSD-AGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDP 182 (397) T ss_pred hhcc-----CCcc-ccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeecccccccccccc Confidence 2110 1111 1112234455679999999999999998877654433 223332 345678899999999877 47 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) ++.+++-..+-+++.+.|-+.+.+... .++...-.....++++......|++|+....+ T Consensus 183 ~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~------------------- 241 (397) T protein:vir:48 183 KLYPIRYAIKRYAGISTVTNSLLADSA--ENILAWLSGWIAKKVVVTRNKAILEAIATLPT------------------- 241 (397) T ss_pred ceeeEEeeheeeeeehhhHHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------- Confidence 999999999999999999998887643 12333334457799999999999998632110 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) .+ ++ T Consensus 242 ------------------------------~~---~~------------------------------------------- 245 (397) T protein:vir:48 242 ------------------------------KP---TL------------------------------------------- 245 (397) T ss_pred ------------------------------cc---cc------------------------------------------- Confidence 00 00 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) ...++|++++ ..++.......+|+||++.+..|+. .++.....+-..+..+...-.++|.||. T Consensus 246 ------------~~~d~i~~~~----~~l~~~~~~~a~~v~n~~~~~~L~~-lkd~~G~~i~~~~~~~~~~~~l~G~PV~ 308 (397) T protein:vir:48 246 ------------TKWDDIIDLQ----AKVDPAIKQTSFFLTNTSGFTALKK-VKNAFGDYLMERDVKSPTGYSIDGFAVK 308 (397) T ss_pred ------------ccHHHHHHHH----HHhhhhhcCCCEEEECHHHHHHHHH-hhcCCCceeeccCcCCCCCceeccceeE Confidence 0012333322 2233233345789999999999996 4555555555555555555689999999 Q ss_pred EEeeccC--C---ccccC Q lcl|Aclame:pro 318 RTDALLN--T---ESRVV 330 (330) Q Consensus 318 ~~dal~~--t---E~~Vv 330 (330) .+|+... . +..++ T Consensus 309 ~~~~~~~~~~~~~~~~~~ 326 (397) T protein:vir:48 309 EVADRWLANASSGAMPLY 326 (397) T ss_pred EecccccCCcCCCceEEE Confidence 8875332 1 22222 No 72 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=98.64 E-value=6.1e-09 Score=65.64 Aligned_cols=216 Identities=12% Similarity=0.031 Sum_probs=132.7 Q ss_pred CCccccccccHHHHH----------hhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEec-cCCcceeec Q lcl|Aclame:pro 1 MATLSTNNPTMADVA----------KRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTG-LPTPTWRKL 68 (330) Q Consensus 1 M~~~~~~a~TL~E~A----------k~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~-lP~~~fR~l 68 (330) +.......+...|.. -.+-|......||+.+.+.++|++.++......+.+ +.+.+..+ -+.+.|..= T Consensus 100 ~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E 179 (408) T protein:vir:74 100 MVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEE 179 (408) T ss_pred HHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCccccccccc Confidence 000000000111100 011233455789999999999999998877654432 34444443 356679999 Q ss_pred CCccCc-ccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhh Q lcl|Aclame:pro 69 YGGVLP-NKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSP 146 (330) Q Consensus 69 N~g~~~-s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~ 146 (330) ++.+++ ++.++.+++-.++-+++.+.|-+.+.+... +...+-. ....++++......|++||....| T Consensus 180 ~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~l~~~~~~~~d~~il~G~G~~~~-------- 248 (408) T protein:vir:74 180 DGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLS---SWIAKKVVVTRNQAIIAAMGTVPK-------- 248 (408) T ss_pred ccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHH---HHHHHHHHHHHHHHHhhccccccc-------- Confidence 999987 678999999999999999999999887644 3444433 446789999999999998632110 Q ss_pred hhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeee Q lcl|Aclame:pro 147 RYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIG 226 (330) Q Consensus 147 R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~G 226 (330) ++ + T Consensus 249 -----------------------------------------~~---~--------------------------------- 251 (408) T protein:vir:74 249 -----------------------------------------KP---T--------------------------------- 251 (408) T ss_pred -----------------------------------------cc---c--------------------------------- Confidence 00 0 Q ss_pred eEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCc Q lcl|Aclame:pro 227 LTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGE 306 (330) Q Consensus 227 l~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~ 306 (330) ....++|++++.. .++.......+|+||++....|+. .++.....+-..+..+. T Consensus 252 ----------------------~~~~~~i~~~~~~---~l~~~~~~~a~~v~n~~~~~~l~~-lkd~~G~~l~~~~~~~~ 305 (408) T protein:vir:74 252 ----------------------IANFDDVITMINT---SVDPAIIATSSLLTNQSGLNKLAL-VKTAEGKYLLEPDPTKP 305 (408) T ss_pred ----------------------cccHHHHHHHHHH---hhhhhhcCCCEEEEcHHHHHHHHH-hhcCCCceEeccCcCCC Confidence 0012344443322 222223345689999999999996 45665666654455555 Q ss_pred ceEEECCeEEEEEee-cc-CC---ccccC Q lcl|Aclame:pro 307 RVMTFDGIPVQRTDA-LL-NT---ESRVV 330 (330) Q Consensus 307 ~v~~~~gvpir~~da-l~-~t---E~~Vv 330 (330) ..-.+.|.||..++. .+ .. +..++ T Consensus 306 ~~~~l~G~pV~~~~~~~~~~~~~~~~~i~ 334 (408) T protein:vir:74 306 NSYLIKGKQVIVVADRWLPNSGSTVYPLY 334 (408) T ss_pred CCceecceeeEEecCcccccccCCcceEE Confidence 556899999998864 22 22 22222 No 73 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.63 E-value=1.4e-08 Score=63.70 Aligned_cols=216 Identities=13% Similarity=0.008 Sum_probs=133.2 Q ss_pred CCccccccccHHHH----------HhhcCcccchHHHHHHHhccchhHhhcceeeccCCccc-eeEEEe-ccCCcceeec Q lcl|Aclame:pro 1 MATLSTNNPTMADV----------AKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGH-RTSVRT-GLPTPTWRKL 68 (330) Q Consensus 1 M~~~~~~a~TL~E~----------Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~-~~~~~~-~lP~~~fR~l 68 (330) +.--+...+...|. ..-+-|......||+.+.+.++|++.+.+.....+.+. .+.... .-+.+.|..= T Consensus 100 ~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 179 (408) T protein:vir:10 100 MVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAE 179 (408) T ss_pred HhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecC Confidence 00000000011110 01112344567799999999999999999887555443 233333 3477889999 Q ss_pred CCccCc-ccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhh Q lcl|Aclame:pro 69 YGGVLP-NKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSP 146 (330) Q Consensus 69 N~g~~~-s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~ 146 (330) ++.+++ +..++.+++-.++-+++.+.|-+.+.+..+ +..++-. ....++++......|++|+....+ T Consensus 180 ~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~l~~~~~~~~~~~il~g~g~~~~-------- 248 (408) T protein:vir:10 180 DGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLS---SWIAKKVVVTRNQAIIEVMKAAPK-------- 248 (408) T ss_pred ccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHH---HHHHHHHHHHHHHHHhhccccccc-------- Confidence 999986 568999999999999999999999887643 4555443 345789999999999988632100 Q ss_pred hhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeee Q lcl|Aclame:pro 147 RYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIG 226 (330) Q Consensus 147 R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~G 226 (330) + . T Consensus 249 -----------------------------------------~---~---------------------------------- 250 (408) T protein:vir:10 249 -----------------------------------------K---P---------------------------------- 250 (408) T ss_pred -----------------------------------------c---c---------------------------------- Confidence 0 0 Q ss_pred eEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCc Q lcl|Aclame:pro 227 LTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGE 306 (330) Q Consensus 227 l~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~ 306 (330) +....++|++++...+ +....+..+|+||++...+|+. .++.....+-....... T Consensus 251 ---------------------~~~~~~~l~~~~~~~~---~~~~~~~a~~v~n~~~~~~l~~-lkd~~G~~i~~~~~~~~ 305 (408) T protein:vir:10 251 ---------------------TIAKFDDVITMINTAV---DPAIIATSSLLTNQSGLNKLAL-VKTAEGKYLLEPDPTKP 305 (408) T ss_pred ---------------------ccccHHHHHHHHHHhh---hhhhccCCEEEEcHHHHHHHHH-hhccCCceEeccCcCCC Confidence 0012345555443322 2222345689999999999996 45655555543344444 Q ss_pred ceEEECCeEEEEEeec--cCCcc---ccC Q lcl|Aclame:pro 307 RVMTFDGIPVQRTDAL--LNTES---RVV 330 (330) Q Consensus 307 ~v~~~~gvpir~~dal--~~tE~---~Vv 330 (330) ..-.+.|.||..+++. ....+ .++ T Consensus 306 ~~~~l~G~PV~~~~~~~~~~~~~~~~~i~ 334 (408) T protein:vir:10 306 NSYLIKGKQVIVVADRWLPNTGSTVYPLY 334 (408) T ss_pred CCceecceeeEEecccccCccCCCceEEE Confidence 4458899999998642 22221 122 No 74 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=98.63 E-value=5.7e-09 Score=65.78 Aligned_cols=215 Identities=9% Similarity=-0.040 Sum_probs=133.8 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEec-cCCcceeecCCccCcc-cc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTG-LPTPTWRKLYGGVLPN-KS 77 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~-lP~~~fR~lN~g~~~s-~~ 77 (330) +-.+.....|..+ ..-+-|......||+.+.+.++|++.++......+.+ +.+....+ -|.+.|..-++.++++ +. T Consensus 102 ~~~~~~~~~~~~~-gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 180 (395) T protein:vir:38 102 KNLVTSGTTGTGN-AGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDP 180 (395) T ss_pred HHHHhhccCccCC-CceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCcccccccccccccccccc Confidence 0000001111111 1112234455679999999999999988876644433 34444444 3677899999999876 58 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENK 156 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~ 156 (330) ++.+++-.++-+++.+.|.+.+.+..+ +..++ -.....++++......|++|+.... T Consensus 181 ~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~---i~~~la~~~~~~~~~~il~g~g~~~------------------- 238 (395) T protein:vir:38 181 ELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQW---LVNWAAKKDVVTRNAKILEVMGKAP------------------- 238 (395) T ss_pred ceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHH---HHHHHHHHHHHHHHHHHhhcccccc------------------- Confidence 999999999999999999999887643 34444 3455678999999999999863211 Q ss_pred ceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 157 DNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 157 ~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) .. .+ T Consensus 239 -----------~~----------------------~~------------------------------------------- 242 (395) T protein:vir:38 239 -----------KK----------------------PT------------------------------------------- 242 (395) T ss_pred -----------cc----------------------cc------------------------------------------- Confidence 00 00 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEE Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPV 316 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpi 316 (330) ....+++++++... ++.......+|+||++....|+. .++.....+-.....+...-.+.|.|| T Consensus 243 ------------~~~~~~i~~~~~~~---l~~~~~~~a~~v~n~~~~~~L~~-lkd~~G~~l~~~~~~~~~~~~l~G~pV 306 (395) T protein:vir:38 243 ------------ISQFDNIKDLENNT---LDPAIESTSSFITNQSGYNILSK-VKDADGRYLMQPDVTSPDKYLIDGKPV 306 (395) T ss_pred ------------cccHHHHHHHHHHh---hhhhhcCCCEEEEcHHHHHHHHH-hhccCCceeeccCcCCCCcceecccee Confidence 00123344433221 22222345689999999999985 566666666544555555567899999 Q ss_pred EEEeeccCC----ccccC Q lcl|Aclame:pro 317 QRTDALLNT----ESRVV 330 (330) Q Consensus 317 r~~dal~~t----E~~Vv 330 (330) ..+|+.... +..++ T Consensus 307 ~~~~~~~~~~~~~~~~i~ 324 (395) T protein:vir:38 307 IRIADKWLPDVSGSHPLY 324 (395) T ss_pred EEecccccCcCCCcceEE Confidence 999864321 22223 No 75 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=98.63 E-value=6.6e-09 Score=65.43 Aligned_cols=229 Identities=12% Similarity=0.004 Sum_probs=134.7 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccC----CcceeecCCccCccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLP----TPTWRKLYGGVLPNK 76 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP----~~~fR~lN~g~~~s~ 76 (330) +.......-+..+ +..+-+......||+.+.+.++|++.+++....++ ...|.+++..+ .+.|..=++..+++. T Consensus 113 ~~~~~~~~~~~~~-~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 190 (413) T protein:vir:81 113 ASDPASTATLTDE-FQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNT-TIKYLMEKANRVVEGGFKTVAEGGKKPYMR 190 (413) T ss_pred hhhhhhhcccccc-cccccchhhHHHHHHHHhhhhhHHhhcceeeccCC-ceeEEEeccccccccccceecCcccccccC Confidence 0000000000011 11122444667799999999999999998776543 45677776654 457887777787776 Q ss_pred -ceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCC Q lcl|Aclame:pro 77 -SSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) Q Consensus 77 -~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~ 155 (330) +++.+++...+-+++.+.|.+.+.+..++. ...-.....++++....+.||||+.... .+.||..- T Consensus 191 ~~~f~~i~~~~~k~~~~~~iS~ell~ds~~l---~~~i~~~la~~~~~~~d~~~l~G~G~~~--~~~Gi~~~-------- 257 (413) T protein:vir:81 191 FADFDIVTESLSKIAGLTKITDEMIEDYDFL---VSYINARLLEELAIEEERQLLLGDGTGN--NLTGLLKR-------- 257 (413) T ss_pred cccceeeEeeeeeEEEeehhhHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhccCCCCC--cccccccc-------- Confidence 689999999999999999999988765433 3333445678999999999999974332 24444220 Q ss_pred cceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccE Q lcl|Aclame:pro 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) Q Consensus 156 ~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v 235 (330) +|.. + T Consensus 258 ------------------------------------~~~~-------~-------------------------------- 262 (413) T protein:vir:81 258 ------------------------------------DGIQ-------T-------------------------------- 262 (413) T ss_pred ------------------------------------cccc-------c-------------------------------- Confidence 0000 0 Q ss_pred EEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCC-------cce Q lcl|Aclame:pro 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSG-------ERV 308 (330) Q Consensus 236 ~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g-------~~v 308 (330) +-++ ..+...+++.+.+..+. .+... ...+|+||+.....|+ ..++.....+-.....+ ... T Consensus 263 -----~~~~--~~~~~~~~i~~~~~~~~--~~~~~-~~~~~vmn~~~~~~l~-~lkd~~G~~l~~~~~~~~~~~~~~~~~ 331 (413) T protein:vir:81 263 -----LAVS--NKDELADSIYKAMTNIS--LATPF-QADALVINPLDYQELR-LAKDANGQYYGGGVFQGQYGSGGIMLD 331 (413) T ss_pred -----cccc--ccchhHHHHHHHHHHhh--hhccC-CCcEEEEcHHHHHHHH-HhhccCCceeccccccccccccccccC Confidence 0000 00011223333332221 11111 1236999999999998 45566555554433321 122 Q ss_pred EEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 309 MTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 309 ~~~~gvpir~~dal~~tE~~Vv 330 (330) ..+.|.||..+|++..+...+. T Consensus 332 ~~l~G~pv~~s~~~~~~~~~~g 353 (413) T protein:vir:81 332 PAPWGLRTVQSQVVPVGKPVVG 353 (413) T ss_pred ceecceeeEEcCCCCcccEEEE Confidence 3678999999999986643322 No 76 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=98.62 E-value=3.1e-09 Score=67.26 Aligned_cols=234 Identities=11% Similarity=0.082 Sum_probs=132.0 Q ss_pred CCccccccccHHHHHhhc------------CcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccC-Ccceee Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRL------------DPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLP-TPTWRK 67 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~------------~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP-~~~fR~ 67 (330) ||-.-..+ -|.+++|.. -+......+++.+.+.+++|..+..+...+..+ .. ...+.+ .+.|.. T Consensus 1 ~~~k~~~~-~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~-~i-~~~~~~~~~~~~~ 77 (321) T protein:vir:31 1 MASRTINN-DLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKT-RI-PTLNIGERHRRPQ 77 (321) T ss_pred CchHHHHH-HHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcce-ee-eeeccCCcccccc Confidence 55421111 123333221 112345678999999999999999876543322 11 122233 344544 Q ss_pred c--CCccCcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChh---hcc Q lcl|Aclame:pro 68 L--YGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPA---EFT 142 (330) Q Consensus 68 l--N~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~---~F~ 142 (330) - +...++++.++.+++-.|+-+...+.|...+.+..-...++...-...+.++++..+.+.+||||....|. ..+ T Consensus 78 ~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~ 157 (321) T protein:vir:31 78 DEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQND 157 (321) T ss_pred cccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccch Confidence 2 22344566789999999999999999999888764211245555556778999999999999998654431 011 Q ss_pred ChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEe Q lcl|Aclame:pro 143 GLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYK 222 (330) Q Consensus 143 GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~ 222 (330) |+-+. +++ + . ...+..+ T Consensus 158 G~l~~---------------------------------------a~~---~------~----~~~~~~~----------- 174 (321) T protein:vir:31 158 GFITV---------------------------------------AEG---D------V----ETIDAAD----------- 174 (321) T ss_pred hhhhh---------------------------------------hcc---c------c----ccccccc----------- Confidence 11100 000 0 0 0000000 Q ss_pred eeeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccC--CCCCCEEEEeChHHHHHHHHHhhccccceeee Q lcl|Aclame:pro 223 WDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQ--LGMGRAVWYMNRNLREKLRLGIVDKIANNLTW 300 (330) Q Consensus 223 w~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~--~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~ 300 (330) .++ ..+.|++ .+..||. .+.++.+||||++....++....++... +-. T Consensus 175 ----------------~~~---------~~d~l~~----l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~~-~~~ 224 (321) T protein:vir:31 175 ----------------DIL---------DNDLVIR----TIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDTP-LGD 224 (321) T ss_pred ----------------ccc---------CHHHHHH----HHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCCc-ccc Confidence 011 1122222 2334443 2456789999999988777665555432 222 Q ss_pred cccCCcceEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 301 ETVSGERVMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 301 ~~~~g~~v~~~~gvpir~~dal~~tE~~Vv 330 (330) ....+.....+.|+||..++.+......+. T Consensus 225 ~~l~~~~~~tl~G~pvv~~~~mP~~~il~t 254 (321) T protein:vir:31 225 NVIMGEADVNPFSFPIIGSGLWPDDKAMFT 254 (321) T ss_pred chhhccccccccceeEEEcCCCCCCcEEEe Confidence 223344555789999999999886543333 No 77 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=98.53 E-value=6.5e-08 Score=60.00 Aligned_cols=233 Identities=11% Similarity=0.046 Sum_probs=135.9 Q ss_pred CCccc--cccccHHHHHh----------------hcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEec-cC Q lcl|Aclame:pro 1 MATLS--TNNPTMADVAK----------------RLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTG-LP 61 (330) Q Consensus 1 M~~~~--~~a~TL~E~Ak----------------~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~-lP 61 (330) +..+. ...++-.|... .+-+......||+.+.+.++|+.....+...++....+.+..+ .+ T Consensus 93 ~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (409) T protein:vir:45 93 DKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSE 172 (409) T ss_pred HHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCcc Confidence 00000 00111111100 0113334567999999999999988877665443344444443 35 Q ss_pred CcceeecCCccCcccceEEEEEEEE-EEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhh Q lcl|Aclame:pro 62 TPTWRKLYGGVLPNKSSTAQVTDNC-GMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAE 140 (330) Q Consensus 62 ~~~fR~lN~g~~~s~~t~~~~~~~l-~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~ 140 (330) .+.|..=++..+++..++.+++-.. ++.+..+.|-+.+.+... -++...-.....++++.+..+.|+|||-...+.+ T Consensus 173 ~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~--~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~ 250 (409) T protein:vir:45 173 VGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSA--IDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQ 250 (409) T ss_pred ccccccccccccccccccceeeeeeeeeeeeehhhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccc Confidence 6679999999999999999988665 555678889998877642 1333334455679999999999999986554445 Q ss_pred ccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEE Q lcl|Aclame:pro 141 FTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTH 220 (330) Q Consensus 141 F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~ 220 (330) +.|+-... ++.. T Consensus 251 p~Gil~~~----------------~~~~---------------------------------------------------- 262 (409) T protein:vir:45 251 PKGLAASV----------------TGTT---------------------------------------------------- 262 (409) T ss_pred cceeeecc----------------cccc---------------------------------------------------- Confidence 55542100 0000 Q ss_pred EeeeeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCC--CCCCEEEEeChHHHHHHHHHhhcccccee Q lcl|Aclame:pro 221 YKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQL--GMGRAVWYMNRNLREKLRLGIVDKIANNL 298 (330) Q Consensus 221 ~~w~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~--~~g~~~~y~n~~v~~~L~~q~~~~~~~~l 298 (330) +.... .....++|++++ ..|+.. .....+||||+.....|++ .++....++ T Consensus 263 -------------------~~~~~---~~~~~d~i~~l~----~~l~~~~~~~a~~~~~~n~~~~~~l~~-lkd~~G~~i 315 (409) T protein:vir:45 263 -------------------QTAAA---NAVKWQEILALK----HSIDPAYRRGPKFRLAFNDNTLKLISE-MEDGQGRPL 315 (409) T ss_pred -------------------ccccc---cccchHHHHHHH----HhhhhhhccCCeEEEEECHHHHHHHHH-hhcCCCcee Confidence 00000 001123344332 222221 2334578999999999985 456666666 Q ss_pred eecccCCcceEEECCeEEEEEeeccC---CccccC Q lcl|Aclame:pro 299 TWETVSGERVMTFDGIPVQRTDALLN---TESRVV 330 (330) Q Consensus 299 ~~~~~~g~~v~~~~gvpir~~dal~~---tE~~Vv 330 (330) -.....+.....+.|.||..+|.+.. +...|+ T Consensus 316 ~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~ 350 (409) T protein:vir:45 316 WLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMF 350 (409) T ss_pred eccCcCCCCCceecceeeEEecCcCCccCCccEEE Confidence 55444444556789999999999874 222222 No 78 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=98.51 E-value=2.3e-08 Score=62.45 Aligned_cols=209 Identities=11% Similarity=0.040 Sum_probs=131.7 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEe-ccCCcceeecCCccCc-ccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRT-GLPTPTWRKLYGGVLP-NKS 77 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~-~lP~~~fR~lN~g~~~-s~~ 77 (330) |.+- |..+ ...+-+......|||.+.+.++|++....+......+ +.+.... .-+.+.|..=++.+++ ++. T Consensus 5 ~~~~-----t~~~-gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 78 (293) T protein:vir:48 5 KTDH-----SGSD-AGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDP 78 (293) T ss_pred eccc-----ccCc-CceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCccccccccc Confidence 3321 2222 1123344556679999999999988877665543332 4444443 4578899999999987 678 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENK 156 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~ 156 (330) ++.+++-.++-+++.+.|.+.+.+... +..++-. ....++++....+.|++|+... T Consensus 79 ~~~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~la~~~~~~~~~~i~~g~~~~-------------------- 135 (293) T protein:vir:48 79 KLSLIKYTIKRYAGISTVTNSLLADSAENILAWLS---GWIAKKVVVTRNKAILGVVDKL-------------------- 135 (293) T ss_pred ceeEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHH---HHHHHHHHHHHHhHHhhccccc-------------------- Confidence 999999999999999999998877643 4444433 3356788888777777664210 Q ss_pred ceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 157 DNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 157 ~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) ++. . T Consensus 136 -------~~~-------------------------~-------------------------------------------- 139 (293) T protein:vir:48 136 -------PTK-------------------------P-------------------------------------------- 139 (293) T ss_pred -------ccc-------------------------c-------------------------------------------- Confidence 000 0 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEE Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPV 316 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpi 316 (330) +....++|++++ ..++.......+|+||++....|+. .++.....+-.....+...-.+.|.|| T Consensus 140 -----------~~~~~d~i~~~~----~~l~~~~~~~a~~vmn~~~~~~L~~-lkd~~g~~l~~~~~~~~~~~~l~G~Pv 203 (293) T protein:vir:48 140 -----------TLTKWDDIIDLE----AKVDPAIKQTSFFLTNTSGFTALKK-VKNALGDYLMERDVKSPTGYSIAGFAV 203 (293) T ss_pred -----------cccCHHHHHHHH----HhhhhhhcCCCEEEEcHHHHHHHHH-hhccCCceEeecCcCCCCCceecceee Confidence 000123444433 3333333445789999999999995 456555555555555555568999999 Q ss_pred EEEeeccC--Cccc---cC Q lcl|Aclame:pro 317 QRTDALLN--TESR---VV 330 (330) Q Consensus 317 r~~dal~~--tE~~---Vv 330 (330) +.|+.... ..+- ++ T Consensus 204 ~~~~~~~~~~~~~~~~~~~ 222 (293) T protein:vir:48 204 KEISDRWLPNASSGVMPLY 222 (293) T ss_pred EEecccccCCccCCceEEE Confidence 98865432 2221 11 No 79 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=98.45 E-value=4e-08 Score=61.15 Aligned_cols=211 Identities=9% Similarity=0.001 Sum_probs=127.2 Q ss_pred CCccc--cccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEe-ccCCcceeecCCccCc-cc Q lcl|Aclame:pro 1 MATLS--TNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRT-GLPTPTWRKLYGGVLP-NK 76 (330) Q Consensus 1 M~~~~--~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~-~lP~~~fR~lN~g~~~-s~ 76 (330) +.... ....|..+ ..-+-|......||+.+.+.++|+..+++....++. +.+.+.. +-+.++|..=++..++ +. T Consensus 121 ~~~~~~~~~~~t~~~-gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~E~~~~~~~~~ 198 (394) T protein:vir:97 121 TTPVEPQKDGIKKEN-AKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKAS-GKYPVLQRATTKMVTVAELEKNPALAK 198 (394) T ss_pred hhhhhhhcccccccc-ccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcc-eEEEEEecCCCccceeccccccccccc Confidence 00000 00011111 111233445567999999999999999988766554 4445443 4567788877777876 56 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCC Q lcl|Aclame:pro 77 SSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) Q Consensus 77 ~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~ 155 (330) .++.+++-.++-+++.+.|-+.+.+..+ +..++- .....++++....+.|++|..+. T Consensus 199 ~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i---~~~la~~~~~~~~~~i~~g~~~~------------------- 256 (394) T protein:vir:97 199 PDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIV---SESISQIKVNTTNDAIAKVLKSF------------------- 256 (394) T ss_pred ccceeEEeehhheeeehhhHHHHHhhhhHHHHHHH---HHHHHHHHHHHHHHHHhhccccc------------------- Confidence 8999999999999999999998888644 344443 34467888888888887763210 Q ss_pred cceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccE Q lcl|Aclame:pro 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) Q Consensus 156 ~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v 235 (330) + |+ + T Consensus 257 ---------~---------------------~~----~------------------------------------------ 260 (394) T protein:vir:97 257 ---------T---------------------TK----T------------------------------------------ 260 (394) T ss_pred ---------c---------------------cc----c------------------------------------------ Confidence 0 00 0 Q ss_pred EEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeE Q lcl|Aclame:pro 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIP 315 (330) Q Consensus 236 ~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvp 315 (330) ....++|++++...+. |. .+..|+||++....|+. .++.....+-.....+...-.++|.| T Consensus 261 -------------~~~~~~~~~~~~~~~~--~~---~~a~~v~n~~~~~~l~~-lkd~~G~~i~~~~~~~~~~~~l~G~p 321 (394) T protein:vir:97 261 -------------VKNLDEIKALLNGGFD--PA---YNVSLIVSQSFYQTLDT-LKDGNGRYLLQDDITAVSGKVLLGKP 321 (394) T ss_pred -------------cccHHHHHHHHHhhhh--hh---hCCEEEEcHHHHHHHHH-hhccCCCeeeecCcCCCCCceeccce Confidence 0012334433322111 11 23579999999999985 55555444433334344445889999 Q ss_pred EEEEeeccCCccccC Q lcl|Aclame:pro 316 VQRTDALLNTESRVV 330 (330) Q Consensus 316 ir~~dal~~tE~~Vv 330 (330) |..+++.......++ T Consensus 322 v~~~~~~~~~~~~~~ 336 (394) T protein:vir:97 322 VFVLSDEVLGANKAF 336 (394) T ss_pred eEEecccccCCccEE Confidence 999876554333333 No 80 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=98.39 E-value=6.5e-08 Score=59.99 Aligned_cols=209 Identities=10% Similarity=0.028 Sum_probs=130.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEeccCCcceeecCCccCcc-cce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTGLPTPTWRKLYGGVLPN-KSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~lP~~~fR~lN~g~~~s-~~t 78 (330) |... |-.+ ..-+-+......||+.+.+.++|++.++.....++.+ +.+.+.++-++++|..=++..+++ ..+ T Consensus 106 ~~~~-----t~~~-gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~ 179 (392) T protein:vir:10 106 MSGL-----TGED-GGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPK 179 (392) T ss_pred cccc-----ccCC-CceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeeccccccccccccc Confidence 2211 1111 1112333455679999999999999988877654433 445566778899999988998876 589 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) +.+++-..+-+++.+.|.+.+.+.. .+..++- .....+++++.....|++|+....+ T Consensus 180 ~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i---~~~l~~~i~~~~d~~~~~g~g~~~~------------------- 237 (392) T protein:vir:10 180 FSNVQYAVKDRAGILPLSRSLLQDSDQNILKYV---TKWLGKKSKVTRNVLILGVIEKLTK------------------- 237 (392) T ss_pred ceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHH---HHHHHHHHHHHHHHHHhhccccccc------------------- Confidence 9999999999999999999887653 3444443 3445688888888888887632110 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) ++ T Consensus 238 ------------------------------~~------------------------------------------------ 239 (392) T protein:vir:10 238 ------------------------------QA------------------------------------------------ 239 (392) T ss_pred ------------------------------cC------------------------------------------------ Confidence 00 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) ....++|++++.. .++.....+.+|+||++....|++ .++.....+-.....+...-.+.|.|++ T Consensus 240 -----------~~~~d~i~~~~~~---~l~~~~~~~a~~vm~~~~~~~L~~-lkd~~G~~l~~~~~~~~~~~tllG~~~v 304 (392) T protein:vir:10 240 -----------IKSLDDIKDVLNV---KLDPAISPNAILLTNQDGFNYLDK-LKDKDGKYILQSDPTQKNKKLFAGTNPV 304 (392) T ss_pred -----------ccCHHHHHHHHHH---hhhhhhccCCEEEEcHHHHHHHHH-hhccCCCeEeecCccCCccccccCcccE Confidence 0112344443321 122233345689999999999985 5666656665444444444567888655 Q ss_pred E-Eeecc-------CCccccC Q lcl|Aclame:pro 318 R-TDALL-------NTESRVV 330 (330) Q Consensus 318 ~-~dal~-------~tE~~Vv 330 (330) . +|+.. .++..++ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~ 325 (392) T protein:vir:10 305 VVVSNRFLKSKGTTAKKAPLI 325 (392) T ss_pred EEecccccCCCcccCCceEEE Confidence 4 33321 1122222 No 81 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=98.39 E-value=6.5e-08 Score=59.99 Aligned_cols=209 Identities=10% Similarity=0.028 Sum_probs=130.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEeccCCcceeecCCccCcc-cce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTGLPTPTWRKLYGGVLPN-KSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~lP~~~fR~lN~g~~~s-~~t 78 (330) |... |-.+ ..-+-+......||+.+.+.++|++.++.....++.+ +.+.+.++-++++|..=++..+++ ..+ T Consensus 106 ~~~~-----t~~~-gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~ 179 (392) T protein:vir:10 106 MSGL-----TGED-GGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPK 179 (392) T ss_pred cccc-----ccCC-CceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeeccccccccccccc Confidence 2211 1111 1112333455679999999999999988877654433 445566778899999988998876 589 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) +.+++-..+-+++.+.|.+.+.+.. .+..++- .....+++++.....|++|+....+ T Consensus 180 ~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i---~~~l~~~i~~~~d~~~~~g~g~~~~------------------- 237 (392) T protein:vir:10 180 FSNVQYAVKDRAGILPLSRSLLQDSDQNILKYV---TKWLGKKSKVTRNVLILGVIEKLTK------------------- 237 (392) T ss_pred ceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHH---HHHHHHHHHHHHHHHHhhccccccc------------------- Confidence 9999999999999999999887653 3444443 3445688888888888887632110 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) ++ T Consensus 238 ------------------------------~~------------------------------------------------ 239 (392) T protein:vir:10 238 ------------------------------QA------------------------------------------------ 239 (392) T ss_pred ------------------------------cC------------------------------------------------ Confidence 00 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) ....++|++++.. .++.....+.+|+||++....|++ .++.....+-.....+...-.+.|.|++ T Consensus 240 -----------~~~~d~i~~~~~~---~l~~~~~~~a~~vm~~~~~~~L~~-lkd~~G~~l~~~~~~~~~~~tllG~~~v 304 (392) T protein:vir:10 240 -----------IKSLDDIKDVLNV---KLDPAISPNAILLTNQDGFNYLDK-LKDKDGKYILQSDPTQKNKKLFAGTNPV 304 (392) T ss_pred -----------ccCHHHHHHHHHH---hhhhhhccCCEEEEcHHHHHHHHH-hhccCCCeEeecCccCCccccccCcccE Confidence 0112344443321 122233345689999999999985 5666656665444444444567888655 Q ss_pred E-Eeecc-------CCccccC Q lcl|Aclame:pro 318 R-TDALL-------NTESRVV 330 (330) Q Consensus 318 ~-~dal~-------~tE~~Vv 330 (330) . +|+.. .++..++ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~ 325 (392) T protein:vir:10 305 VVVSNRFLKSKGTTAKKAPLI 325 (392) T ss_pred EEecccccCCCcccCCceEEE Confidence 4 33321 1122222 No 82 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=98.39 E-value=6.5e-08 Score=59.99 Aligned_cols=209 Identities=10% Similarity=0.028 Sum_probs=130.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEeccCCcceeecCCccCcc-cce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTGLPTPTWRKLYGGVLPN-KSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~lP~~~fR~lN~g~~~s-~~t 78 (330) |... |-.+ ..-+-+......||+.+.+.++|++.++.....++.+ +.+.+.++-++++|..=++..+++ ..+ T Consensus 106 ~~~~-----t~~~-gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~ 179 (392) T protein:vir:10 106 MSGL-----TGED-GGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPK 179 (392) T ss_pred cccc-----ccCC-CceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeeccccccccccccc Confidence 2211 1111 1112333455679999999999999988877654433 445566778899999988998876 589 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) +.+++-..+-+++.+.|.+.+.+.. .+..++- .....+++++.....|++|+....+ T Consensus 180 ~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i---~~~l~~~i~~~~d~~~~~g~g~~~~------------------- 237 (392) T protein:vir:10 180 FSNVQYAVKDRAGILPLSRSLLQDSDQNILKYV---TKWLGKKSKVTRNVLILGVIEKLTK------------------- 237 (392) T ss_pred ceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHH---HHHHHHHHHHHHHHHHhhccccccc------------------- Confidence 9999999999999999999887653 3444443 3445688888888888887632110 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) ++ T Consensus 238 ------------------------------~~------------------------------------------------ 239 (392) T protein:vir:10 238 ------------------------------QA------------------------------------------------ 239 (392) T ss_pred ------------------------------cC------------------------------------------------ Confidence 00 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) ....++|++++.. .++.....+.+|+||++....|++ .++.....+-.....+...-.+.|.|++ T Consensus 240 -----------~~~~d~i~~~~~~---~l~~~~~~~a~~vm~~~~~~~L~~-lkd~~G~~l~~~~~~~~~~~tllG~~~v 304 (392) T protein:vir:10 240 -----------IKSLDDIKDVLNV---KLDPAISPNAILLTNQDGFNYLDK-LKDKDGKYILQSDPTQKNKKLFAGTNPV 304 (392) T ss_pred -----------ccCHHHHHHHHHH---hhhhhhccCCEEEEcHHHHHHHHH-hhccCCCeEeecCccCCccccccCcccE Confidence 0112344443321 122233345689999999999985 5666656665444444444567888655 Q ss_pred E-Eeecc-------CCccccC Q lcl|Aclame:pro 318 R-TDALL-------NTESRVV 330 (330) Q Consensus 318 ~-~dal~-------~tE~~Vv 330 (330) . +|+.. .++..++ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~ 325 (392) T protein:vir:10 305 VVVSNRFLKSKGTTAKKAPLI 325 (392) T ss_pred EEecccccCCCcccCCceEEE Confidence 4 33321 1122222 No 83 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=98.39 E-value=6.5e-08 Score=59.99 Aligned_cols=209 Identities=10% Similarity=0.028 Sum_probs=130.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEeccCCcceeecCCccCcc-cce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTGLPTPTWRKLYGGVLPN-KSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~lP~~~fR~lN~g~~~s-~~t 78 (330) |... |-.+ ..-+-+......||+.+.+.++|++.++.....++.+ +.+.+.++-++++|..=++..+++ ..+ T Consensus 106 ~~~~-----t~~~-gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~ 179 (392) T protein:vir:10 106 MSGL-----TGED-GGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPK 179 (392) T ss_pred cccc-----ccCC-CceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeeccccccccccccc Confidence 2211 1111 1112333455679999999999999988877654433 445566778899999988998876 589 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) +.+++-..+-+++.+.|.+.+.+.. .+..++- .....+++++.....|++|+....+ T Consensus 180 ~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i---~~~l~~~i~~~~d~~~~~g~g~~~~------------------- 237 (392) T protein:vir:10 180 FSNVQYAVKDRAGILPLSRSLLQDSDQNILKYV---TKWLGKKSKVTRNVLILGVIEKLTK------------------- 237 (392) T ss_pred ceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHH---HHHHHHHHHHHHHHHHhhccccccc------------------- Confidence 9999999999999999999887653 3444443 3445688888888888887632110 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) ++ T Consensus 238 ------------------------------~~------------------------------------------------ 239 (392) T protein:vir:10 238 ------------------------------QA------------------------------------------------ 239 (392) T ss_pred ------------------------------cC------------------------------------------------ Confidence 00 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) ....++|++++.. .++.....+.+|+||++....|++ .++.....+-.....+...-.+.|.|++ T Consensus 240 -----------~~~~d~i~~~~~~---~l~~~~~~~a~~vm~~~~~~~L~~-lkd~~G~~l~~~~~~~~~~~tllG~~~v 304 (392) T protein:vir:10 240 -----------IKSLDDIKDVLNV---KLDPAISPNAILLTNQDGFNYLDK-LKDKDGKYILQSDPTQKNKKLFAGTNPV 304 (392) T ss_pred -----------ccCHHHHHHHHHH---hhhhhhccCCEEEEcHHHHHHHHH-hhccCCCeEeecCccCCccccccCcccE Confidence 0112344443321 122233345689999999999985 5666656665444444444567888655 Q ss_pred E-Eeecc-------CCccccC Q lcl|Aclame:pro 318 R-TDALL-------NTESRVV 330 (330) Q Consensus 318 ~-~dal~-------~tE~~Vv 330 (330) . +|+.. .++..++ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~ 325 (392) T protein:vir:10 305 VVVSNRFLKSKGTTAKKAPLI 325 (392) T ss_pred EEecccccCCCcccCCceEEE Confidence 4 33321 1122222 No 84 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=98.36 E-value=5.1e-08 Score=60.59 Aligned_cols=217 Identities=12% Similarity=0.055 Sum_probs=128.0 Q ss_pred CCccccccccH-HHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcce--eecCCccCcccc Q lcl|Aclame:pro 1 MATLSTNNPTM-ADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTW--RKLYGGVLPNKS 77 (330) Q Consensus 1 M~~~~~~a~TL-~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~f--R~lN~g~~~s~~ 77 (330) +.+.....+|. ...+ .+-+......||+.+.+.++|.+.++.....++ .+.|.+.++.+++.| ..=++..++++. T Consensus 101 ~~~~~~~~~~~~~~~~-~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~ 178 (379) T protein:vir:10 101 IQVKAVGDMTLPVNLT-GAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGG-TYTFVRENGAGEGAIGAQVEGATKGQKDY 178 (379) T ss_pred hhhhhhcccccCCCCc-cccchhhhhHHHHhHHhhhhHHhhceeeeccCC-ceEEEEeecCCCcccccccCCcccccccc Confidence 11111111111 1111 122344567799999999999999988777544 578888887766665 444556788899 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) ++.+++-..+-+++.+.|.+.+.+-.++..++-. ....++++......|++|+.+. T Consensus 179 ~f~~i~~~~~k~~~~~~iS~ell~D~~~l~~~i~---~~la~~~~~~~~~~~~~g~~~~--------------------- 234 (379) T protein:vir:10 179 DISMIDVNTDFIAGFTRYSKKMANNLPFLTSFIP---NALRRDYAKAENAAFNAVLAAN--------------------- 234 (379) T ss_pred ceeeeEeeeeeEEeeehhhHHHHhhHHHHHHHHH---HHHHHHHHHHHHHHHhcccccc--------------------- Confidence 9999999999999999999998776554443333 3456777877777776664210 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) |+++.. .. T Consensus 235 ------~~~~~~---------------------------------~~--------------------------------- 242 (379) T protein:vir:10 235 ------ATASTE---------------------------------II--------------------------------- 242 (379) T ss_pred ------cccccc---------------------------------cc--------------------------------- Confidence 000000 00 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccC--CcceEEECCeE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVS--GERVMTFDGIP 315 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~--g~~v~~~~gvp 315 (330) ++....+++++.+ ..+....-...+|+||+.....|++ .++....++-..... +...-.+.|+| T Consensus 243 ---------~~~~~~d~i~~~~----~~~~~~~~~~~~~vmn~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~~~~l~G~p 308 (379) T protein:vir:10 243 ---------TNKNKVEMLINEI----AKQENLDFPVTAIVLRPTDYYDILV-TQKSVGAGYGLPGVVTQDNGVLRINGIP 308 (379) T ss_pred ---------cCcccHHHHHHHH----HhhhhccCCCCEEEEcHHHHHHHHH-hhccCCceeccCCccCCCCCcceeccee Confidence 0001123344332 2222222223479999999999985 455554444332221 22334788999 Q ss_pred EEEEeeccCCccccC Q lcl|Aclame:pro 316 VQRTDALLNTESRVV 330 (330) Q Consensus 316 ir~~dal~~tE~~Vv 330 (330) |..++++..+. .++ T Consensus 309 vv~s~~~~ag~-~~~ 322 (379) T protein:vir:10 309 LFRATWLAANK-YYV 322 (379) T ss_pred eEecCCCCCCc-eEE Confidence 99999886543 222 No 85 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=98.33 E-value=2e-07 Score=57.31 Aligned_cols=231 Identities=16% Similarity=0.149 Sum_probs=131.9 Q ss_pred CCccccccccHHH------H--------HhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCccee Q lcl|Aclame:pro 1 MATLSTNNPTMAD------V--------AKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWR 66 (330) Q Consensus 1 M~~~~~~a~TL~E------~--------Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR 66 (330) +.......+|-.| + .-.+-|......|++.+.+.++|++.+.++...++ ...+.++++-|.+.|. T Consensus 64 ~~~~~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~-~~~i~~~~~~~~a~~~ 142 (390) T protein:vir:40 64 LASRGANALTSDESKYYNEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTAT-TEWIISVGDVATAWWG 142 (390) T ss_pred HHhcCchhccHHHHHHHHHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCc-eeEEEEEcCCcceeee Confidence 0000000111111 0 00112333466799999999999999998877543 3446788899999999 Q ss_pred ecCCccCc-ccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChh Q lcl|Aclame:pro 67 KLYGGVLP-NKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLS 145 (330) Q Consensus 67 ~lN~g~~~-s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~ 145 (330) .-++.+++ ++.++.+++-.++-+.+.+.|.+.+.+...- ++-+.-.....++++..+.+.|++||-...|. |+- T Consensus 143 ~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~--~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~---Gil 217 (390) T protein:vir:40 143 PLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPS--WLDQYVRTILGEAMALGLEAGIVNGSGKDQPI---GMM 217 (390) T ss_pred ccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchH--HHHHHHHHHHHHHHHHHHHhhhhcccCCCccc---eee Confidence 99988875 5789999999999999999999999887542 23333445577999999999999997543332 221 Q ss_pred hhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeee Q lcl|Aclame:pro 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) Q Consensus 146 ~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~ 225 (330) .. .+ +.+. +..+ +.+++ T Consensus 218 ~~---~~----------~~~~-----------------~~~~-~~~~~-------------------------------- 234 (390) T protein:vir:40 218 RD---LN----------NVTA-----------------GEHP-VKTAT-------------------------------- 234 (390) T ss_pred ec---cc----------cccc-----------------cccc-ccccc-------------------------------- Confidence 10 00 0000 0000 00000 Q ss_pred eeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChH-HHHHHHHH--hhccccceeeecc Q lcl|Aclame:pro 226 GLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRN-LREKLRLG--IVDKIANNLTWET 302 (330) Q Consensus 226 Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~-v~~~L~~q--~~~~~~~~l~~~~ 302 (330) .+ +.....++++.+..++..-+....++.+|+||+. ....|... ..++....+.. T Consensus 235 --------------~~------t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~-- 292 (390) T protein:vir:40 235 --------------PL------TDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTG-- 292 (390) T ss_pred --------------cc------chhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCccccc-- Confidence 00 0111233343344444433333456778999965 34444422 12222222221 Q ss_pred cCCcceEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 303 VSGERVMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 303 ~~g~~v~~~~gvpir~~dal~~tE~~Vv 330 (330) ..+.|+||..++++.... |+ T Consensus 293 ------~~~~g~pvv~~~~~p~~~--i~ 312 (390) T protein:vir:40 293 ------ILPVPLEIVQSVAVPVGK--AV 312 (390) T ss_pred ------cCCCceeEEEcCCCCCCc--EE Confidence 123689999999886543 44 No 86 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.30 E-value=1.4e-07 Score=58.16 Aligned_cols=244 Identities=16% Similarity=0.174 Sum_probs=131.4 Q ss_pred CCccccccccHHH------HHh--------hcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCccee Q lcl|Aclame:pro 1 MATLSTNNPTMAD------VAK--------RLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWR 66 (330) Q Consensus 1 M~~~~~~a~TL~E------~Ak--------~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR 66 (330) ........+|-.| +.+ -+-|......|+|.+.+.+||+..+.+.... ..+...+.++-|+++|. T Consensus 59 ~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~--~~~~i~~~~~~~~a~wv 136 (377) T protein:vir:96 59 DLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS--LRLKALTAETSGTAVWG 136 (377) T ss_pred HhccCCcccCHHHHHHHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecC--CceEEEEecCCcceeEe Confidence 1100011111111 000 0122335567899999999999999887653 34678888899999999 Q ss_pred ecCCccCc-ccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccCh Q lcl|Aclame:pro 67 KLYGGVLP-NKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGL 144 (330) Q Consensus 67 ~lN~g~~~-s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL 144 (330) ..++..++ +..++.+++-.++-|.+.+.|.+.|.+..+ +..+|-.. ...++++....+.||+||-...|.+ | T Consensus 137 ~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~---~l~~~~~~~~~~a~i~G~G~~~P~G---i 210 (377) T protein:vir:96 137 DIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITE---QLKEAIAVALELAIVKGNGLLQPVG---L 210 (377) T ss_pred ecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHH---HHHHHHHHHHhhceEeccCCCccee---e Confidence 99988865 679999999999999999999999988755 56666554 4568999999999999997655544 3 Q ss_pred hhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeee Q lcl|Aclame:pro 145 SPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWD 224 (330) Q Consensus 145 ~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~ 224 (330) -.. .+ ....++. ++.. ..++|+.....| . T Consensus 211 l~~---~~----~~~~~~~-------------~~~~-~~~~~~~~~~~~-------------------~----------- 239 (377) T protein:vir:96 211 LKD---LS----QPTVDQS-------------TGRD-ITTYKTDKEAIA-------------------D----------- 239 (377) T ss_pred eec---cc----ccccccc-------------cccc-ccceeecccccc-------------------c----------- Confidence 211 10 0011100 0000 011222110000 0 Q ss_pred eeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHH----hccCCCCCCEEEEeChHHHHHHHHHhhccccceeee Q lcl|Aclame:pro 225 IGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAE----RIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTW 300 (330) Q Consensus 225 ~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~----~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~ 300 (330) +..++ .....+++..|..++. .-|....++.+|+||+.....+..+.. . T Consensus 240 -------------~~~~~------~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~--------~ 292 (377) T protein:vir:96 240 -------------LSDLD------PDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT--------S 292 (377) T ss_pred -------------cccCC------hhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhcccccc--------c Confidence 00111 1122233333444442 234455677889999876443322111 1 Q ss_pred cccCCcceEEE-CCeEEEEEeecc-----------------------------CCccccC Q lcl|Aclame:pro 301 ETVSGERVMTF-DGIPVQRTDALL-----------------------------NTESRVV 330 (330) Q Consensus 301 ~~~~g~~v~~~-~gvpir~~dal~-----------------------------~tE~~Vv 330 (330) ....|..++-+ .|++|..++++. ..+..++ T Consensus 293 ~~~~G~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~ 352 (377) T protein:vir:96 293 RNQFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQL 352 (377) T ss_pred cCCCCCceeccCCCceEEecCCCCcccEEEEEcCcEEEEEecccEEEeehhhhhhcCCeE Confidence 11223333322 133344444333 3332222 No 87 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=98.28 E-value=2.9e-07 Score=56.43 Aligned_cols=229 Identities=12% Similarity=0.058 Sum_probs=128.2 Q ss_pred CCccccccccHHHHH-hhcCcccchHHHHHHHhccchhHhhcce-eeccC--CccceeEEEeccCCcceeecCCccCccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVA-KRLDPNGKVDIIVEMLNQTNPVLQDMTA-IEGNL--PTGHRTSVRTGLPTPTWRKLYGGVLPNK 76 (330) Q Consensus 1 M~~~~~~a~TL~E~A-k~~~~d~~~~~VIE~l~~~s~iL~~lpf-~e~n~--g~~~~~~~~~~lP~~~fR~lN~g~~~s~ 76 (330) ...+.....|-.+.+ ..+-+......|||.+.+.+.+.+..+- +.... +......+.++-|.++|..=++..+.++ T Consensus 331 ~~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~ 410 (645) T protein:vir:93 331 KSAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTK 410 (645) T ss_pred hhhhhccccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCccccccc Confidence 001111110111101 0112233456799999988877655322 11111 2235667778889999999999999999 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCC Q lcl|Aclame:pro 77 SSTAQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) Q Consensus 77 ~t~~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~ 155 (330) .++.+++-..+-|++.+.|.+.|.+.. .+..++.. ....++++..+..+||+|+... T Consensus 411 ~~f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~---~~l~~aia~~~d~a~l~g~g~~------------------- 468 (645) T protein:vir:93 411 FDFESITFSHAKVSAIAVLTEELIRFSSPAADALVR---NALAEAVVARLDTDFVDPKKAA------------------- 468 (645) T ss_pred cceeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHH---HHHHHHHHHHHHHHhhcCCCcc------------------- Confidence 999999999999999999999987764 45555443 4467999999999999986211 Q ss_pred cceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccE Q lcl|Aclame:pro 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) Q Consensus 156 ~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v 235 (330) +++.. |+|...+. . .. -.. T Consensus 469 --------~~~~~------------------p~gi~~~~----~---~~---~~~------------------------- 487 (645) T protein:vir:93 469 --------VADVS------------------PASITHDV----K---GT---ASS------------------------- 487 (645) T ss_pred --------cCCcc------------------ccceeccc----c---cc---ccc------------------------- Confidence 11111 11100000 0 00 000 Q ss_pred EEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeE Q lcl|Aclame:pro 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIP 315 (330) Q Consensus 236 ~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvp 315 (330) .+. ..|+..++ .++.. .+..+...+|+||++....|+.. ++.....+.++ ..... -.+.|.| T Consensus 488 ---~~~----------~~d~~~~~-~~~~~-a~~~~~~a~~vmn~~~~~~L~~l-kd~~G~~~~~~-~~~~~-~tL~G~P 549 (645) T protein:vir:93 488 ---GNP----------DADAEAAF-GQFVA-ANLQPTGAVWLMSSTNALALSMR-KNALGQKEYPD-MTLLG-GSFQGLP 549 (645) T ss_pred ---cch----------HHHHHHHH-HHHHh-cCCCccccEEEEcHHHHHHHHhc-cccCCceeecC-CCCCC-ceeecee Confidence 000 01222211 11110 22344557899999999999864 44433444332 11111 2689999 Q ss_pred EEEEeeccCCcc-----ccC Q lcl|Aclame:pro 316 VQRTDALLNTES-----RVV 330 (330) Q Consensus 316 ir~~dal~~tE~-----~Vv 330 (330) |..++++.++-. .+. T Consensus 550 V~~s~~vp~~~~~gd~s~~~ 569 (645) T protein:vir:93 550 VIVSQYVGDQLVLVNAPDIY 569 (645) T ss_pred eEEeccCCcceeEeccccEE Confidence 999998764211 111 No 88 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.18 E-value=9.3e-07 Score=53.66 Aligned_cols=236 Identities=11% Similarity=0.035 Sum_probs=130.2 Q ss_pred CCcc----ccccccHHHHH-hhcCccc-chHHHHHHHhccchhHhhcceeeccCCcc-ceeEEEeccCC-cceeecCC-- Q lcl|Aclame:pro 1 MATL----STNNPTMADVA-KRLDPNG-KVDIIVEMLNQTNPVLQDMTAIEGNLPTG-HRTSVRTGLPT-PTWRKLYG-- 70 (330) Q Consensus 1 M~~~----~~~a~TL~E~A-k~~~~d~-~~~~VIE~l~~~s~iL~~lpf~e~n~g~~-~~~~~~~~lP~-~~fR~lN~-- 70 (330) .... ...+++-..-+ -.+-+.. ....|||.+.+.++|++.+..+...++.+ +.+.+..+-+. +.|..=+. T Consensus 145 ~~~~~~~~~~~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~ 224 (477) T protein:vir:84 145 RKIAKVGEEYRDLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAAL 224 (477) T ss_pred HHHHHhhhhhccccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCccc Confidence 0000 00000000000 0011111 23469999999899988777765544433 45555444443 44554332 Q ss_pred ---ccCcccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhh Q lcl|Aclame:pro 71 ---GVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSP 146 (330) Q Consensus 71 ---g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~ 146 (330) ..+.++.++.+++-..+-+++.+.|.+.+.+... +..++-. ....++++.....+|+|||-..+ +..||-. T Consensus 225 ~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~l~~~~~~~~d~~~l~G~Gt~~--~p~Gi~~ 299 (477) T protein:vir:84 225 TAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVF---RDLAADYANKLNVQVISGTGSNN--QVVGVRA 299 (477) T ss_pred ccccccccccceeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHH---HHHHHHHHHHHHHHHhccCCCCC--ccceeee Confidence 3467778899999999999999999999888755 6655544 44668999999999999974321 1223311 Q ss_pred hhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeee Q lcl|Aclame:pro 147 RYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIG 226 (330) Q Consensus 147 R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~G 226 (330) ..|.+ T Consensus 300 ---------------~~~~~------------------------------------------------------------ 304 (477) T protein:vir:84 300 ---------------TAGIT------------------------------------------------------------ 304 (477) T ss_pred ---------------ccccc------------------------------------------------------------ Confidence 00000 Q ss_pred eEEeccccEEEeecccccc-cccchhHHHHHHHHHHHHHhccC-CCCCCEEEEeChHHHHHHHHHhhccccceeeecc-- Q lcl|Aclame:pro 227 LTLRDWRYVARVCNIDVSD-LATSANAQALIKYMIMAAERIPQ-LGMGRAVWYMNRNLREKLRLGIVDKIANNLTWET-- 302 (330) Q Consensus 227 l~v~d~r~v~RI~NId~~~-l~~~~~~~~l~~~m~~a~~~ip~-~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~-- 302 (330) ++.... .++.+....+.+.++.+.+.+.. ......+|+||.+....|+.. ++.....+-.-. T Consensus 305 -------------~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-kd~~G~~l~~~~~~ 370 (477) T protein:vir:84 305 -------------QVTATSAGSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAI-FAGDDRPLIVPSGP 370 (477) T ss_pred -------------cccccccccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHh-hccCCCeeeecCcc Confidence 000000 00111223444555556665543 233345899999999999863 555444332211 Q ss_pred -----------cCCcceEEECCeEEEEEeeccCC------ccccC Q lcl|Aclame:pro 303 -----------VSGERVMTFDGIPVQRTDALLNT------ESRVV 330 (330) Q Consensus 303 -----------~~g~~v~~~~gvpir~~dal~~t------E~~Vv 330 (330) ......-.++|+||..++++..+ +..++ T Consensus 371 ~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~ 415 (477) T protein:vir:84 371 GFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIH 415 (477) T ss_pred cccccccccccccccccchhcccceEecCcccccccccCCcceEE Confidence 11122236789999999999743 33344 No 89 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=98.16 E-value=2.2e-07 Score=57.07 Aligned_cols=211 Identities=9% Similarity=-0.031 Sum_probs=127.0 Q ss_pred CCccc---------cccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEe-ccCCcceeecCC Q lcl|Aclame:pro 1 MATLS---------TNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRT-GLPTPTWRKLYG 70 (330) Q Consensus 1 M~~~~---------~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~-~lP~~~fR~lN~ 70 (330) ..... ....|..+ ..-+-|......||+.+.+.++|++.++++...++. ..+.+.. +-+.++|..=+. T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~-gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~E~~ 197 (400) T protein:vir:38 120 AVLRAVPTDASDAVNAGVKAAD-AASTIPETISNTPQRELQTVVDLKPFTNVFQASTQK-GTYPTVANATTKMVTVAELE 197 (400) T ss_pred hhhhhhhHHHHHHHhhcccccC-CcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcc-eEEEEEecCCCccccccccc Confidence 00000 00011111 111223345677999999999999999998775543 3455544 446778877677 Q ss_pred ccCc-ccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhh Q lcl|Aclame:pro 71 GVLP-NKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRY 148 (330) Q Consensus 71 g~~~-s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~ 148 (330) ..++ +..++.+++...+-+++.+.|.+.+.+... +..++-. ....+++.......+++|.... T Consensus 198 ~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~---~~l~~~~~~~~~~~i~~~~~~~------------ 262 (400) T protein:vir:38 198 KNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIA---QNGQQIKVNTTNGAVATLLKGF------------ 262 (400) T ss_pred cccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHH---HHHHHHHHHHHHHhhhhccccc------------ Confidence 7775 578999999999999999999998887643 4444433 3456778888888887774210 Q ss_pred cccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeE Q lcl|Aclame:pro 149 NSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLT 228 (330) Q Consensus 149 ~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~ 228 (330) ++ . ++ T Consensus 263 ----------------~~---------------------~----~~---------------------------------- 267 (400) T protein:vir:38 263 ----------------TA---------------------K----TI---------------------------------- 267 (400) T ss_pred ----------------cc---------------------c----cc---------------------------------- Confidence 00 0 00 Q ss_pred EeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcce Q lcl|Aclame:pro 229 LRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERV 308 (330) Q Consensus 229 v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v 308 (330) ...++|++++.... + ...+.+|+||++....|+. .++.....+-..+..+... T Consensus 268 ---------------------~~~~~~~~~~~~~~---~--~~~~a~~v~~~~~~~~l~~-lkd~~G~~i~~~~~~~~~~ 320 (400) T protein:vir:38 268 ---------------------SSVDDLKHINNVDL---D--PAYSRVIIASQSFYNFLDT-VKDGNGRYLLQDSILTPSG 320 (400) T ss_pred ---------------------ccHHHHHHHHHhhh---h--hhhCcEEEEcHHHHHHHHH-hhccCCCeeeecCcCCCCc Confidence 00112222211111 1 1124689999999999994 5666555554444544445 Q ss_pred EEECCeEEEEEeeccC-Cccc--cC Q lcl|Aclame:pro 309 MTFDGIPVQRTDALLN-TESR--VV 330 (330) Q Consensus 309 ~~~~gvpir~~dal~~-tE~~--Vv 330 (330) -.+.|.||+.+|++.. +... ++ T Consensus 321 ~~l~G~pv~~~~~~~~~~~g~~~~~ 345 (400) T protein:vir:38 321 KSVLGMPIAVVSDDTLGAAGEAHAF 345 (400) T ss_pred cccccceeEEecccccCCCCceEEE Confidence 5799999999998774 2222 22 No 90 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=98.16 E-value=6.3e-07 Score=54.57 Aligned_cols=226 Identities=16% Similarity=0.139 Sum_probs=131.0 Q ss_pred CCc----cccccc-cHHHH--------HhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceee Q lcl|Aclame:pro 1 MAT----LSTNNP-TMADV--------AKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRK 67 (330) Q Consensus 1 M~~----~~~~a~-TL~E~--------Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~ 67 (330) |-. ...... ...+. .-.+-|......|++.+.+.++|++.......+ + .+.+.+.++-|.+.|.+ T Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-g-~~~ip~~~~~~~a~~v~ 196 (425) T protein:vir:95 119 LKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVK-G-TTRILVDTDTSPATWIE 196 (425) T ss_pred HhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecC-c-eeEEEEecCCccccccc Confidence 000 000000 00000 000223445667999999999999998877654 3 47788999999999999 Q ss_pred cCCccCccc-ceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChh Q lcl|Aclame:pro 68 LYGGVLPNK-SSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLS 145 (330) Q Consensus 68 lN~g~~~s~-~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~ 145 (330) =++.++++. .++.+++-..+-+.+.+.|-+.+.+... +..++-. ....++++.+..+.||+||-..+ .++.|+- T Consensus 197 E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~l~~~i~~~~d~~il~G~G~~~-~~p~Gil 272 (425) T protein:vir:95 197 QSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYVT---KKIARAIAKALDLAIVKGTGAAN-KQPLGII 272 (425) T ss_pred cccccccccccccceeeeeheeeeeeehhhHHHHhccHHHHHHHHH---HHHHHHHHHHHHHHhhccCCCCc-cccceee Confidence 999998877 5899999999999999999999888755 4444433 55678999999999999973321 1222331 Q ss_pred hhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeee Q lcl|Aclame:pro 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) Q Consensus 146 ~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~ 225 (330) +. ++. +. T Consensus 273 ~~---~~~-----------~~----------------------------------------------------------- 279 (425) T protein:vir:95 273 PS---LPP-----------EN----------------------------------------------------------- 279 (425) T ss_pred cc---ccc-----------cc----------------------------------------------------------- Confidence 10 000 00 Q ss_pred eeEEeccccEEEeecccccccccchhHHHHHHH---HHHHHHhccCCCCCCEEEEeChHH-HHHHHHH--hhccccceee Q lcl|Aclame:pro 226 GLTLRDWRYVARVCNIDVSDLATSANAQALIKY---MIMAAERIPQLGMGRAVWYMNRNL-REKLRLG--IVDKIANNLT 299 (330) Q Consensus 226 Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~---m~~a~~~ip~~~~g~~~~y~n~~v-~~~L~~q--~~~~~~~~l~ 299 (330) ++.. .......++++++ +..+. ...+..+|+||+.. ...|... ..++...++. T Consensus 280 --------------~~~~--~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~ 338 (425) T protein:vir:95 280 --------------QVTV--EADNNLLKNLVKQIGLIDTGD-----DSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVG 338 (425) T ss_pred --------------cccc--ccccchHHHHHHHHHhhhhhc-----cccCceEEEEeChHHHHHHHHHHhhcCCCCceee Confidence 0000 0000011223322 21111 23456788999764 3333211 1344444443 Q ss_pred ecccCCcceEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 300 WETVSGERVMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 300 ~~~~~g~~v~~~~gvpir~~dal~~tE~~Vv 330 (330) ... .+... .+.|.||..+|++.... |+ T Consensus 339 ~~~-~~~~~-~l~G~pvv~~~~~~~~~--i~ 365 (425) T protein:vir:95 339 KLP-NLRTP-DLLGLRVVFNNFLDDDT--VL 365 (425) T ss_pred ccC-CCCCc-cccceeeEEcCcCCCcc--EE Confidence 322 22222 46799999999998653 33 No 91 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=98.12 E-value=7.3e-07 Score=54.23 Aligned_cols=215 Identities=12% Similarity=0.064 Sum_probs=123.4 Q ss_pred CCc---cccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccCc-cc Q lcl|Aclame:pro 1 MAT---LSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLP-NK 76 (330) Q Consensus 1 M~~---~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~-s~ 76 (330) +-. .+....|..+ .-.+-|......||+.+.+.++|++.++.....++.++......+-+.+.|..=+...++ +. T Consensus 103 ~~~~~~~~~~~~t~~~-gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~ 181 (394) T protein:vir:10 103 HGKVIDNAAGHVTSTE-AGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPALAE 181 (394) T ss_pred cchhhhhhhccccccc-CceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCcccccccccccccccc Confidence 000 0000011111 001223345667999999999999999887765554332223334467788877777775 67 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCC Q lcl|Aclame:pro 77 SSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) Q Consensus 77 ~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~ 155 (330) .++.+++-..+-+++.+.|-+.+.+... +..++- .....+++.......+++|+.... T Consensus 182 ~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i---~~~la~~~~~~~~~~il~g~g~~~------------------ 240 (394) T protein:vir:10 182 PEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLV---GQSINEKSVNTYNAMIAPVLQSFT------------------ 240 (394) T ss_pred ccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHH---HHHHHHHHHHHHHHHHhhcccccc------------------ Confidence 8999999999999999999999888643 444443 334567788888877777752110 Q ss_pred cceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccE Q lcl|Aclame:pro 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) Q Consensus 156 ~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v 235 (330) |++ T Consensus 241 -------------------------------~~~---------------------------------------------- 243 (394) T protein:vir:10 241 -------------------------------AKA---------------------------------------------- 243 (394) T ss_pred -------------------------------ccc---------------------------------------------- Confidence 000 Q ss_pred EEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeeccc----CCcceEEE Q lcl|Aclame:pro 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETV----SGERVMTF 311 (330) Q Consensus 236 ~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~----~g~~v~~~ 311 (330) ..+....++|++++-.... +. .+.+|+||++....|+. .++.....+-...+ .+...-.+ T Consensus 244 ----------~~~~~~~d~l~~~~~~~~~--~~---~~a~~vmn~~~~~~l~~-lkd~~G~~i~~~~~~~~~~~~~~~~L 307 (394) T protein:vir:10 244 ----------TTTDTLVDSLKHILNVDLD--PA---YSRALVVTQSLFNTLDT-LKDKNGRYLLHDASDSITDGTAKGTV 307 (394) T ss_pred ----------ccccccHHHHHHHHHhhhh--hh---ccCEEEecHHHHHHHHH-hhccCCCeeeeccccccccCCccccc Confidence 0001122344433211111 11 24689999999999996 45554333322222 23333578 Q ss_pred CCeEEEEEeec-cCC---ccccC Q lcl|Aclame:pro 312 DGIPVQRTDAL-LNT---ESRVV 330 (330) Q Consensus 312 ~gvpir~~dal-~~t---E~~Vv 330 (330) .|+||..+|+. +.+ +..++ T Consensus 308 ~G~PV~~~~~~~~~~~~~~~~i~ 330 (394) T protein:vir:10 308 LGVPVYVVGDALLGSAAGDQKAF 330 (394) T ss_pred ccceeEEecccccCCCCCceEEE Confidence 99999988643 322 22223 No 92 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=98.07 E-value=8.6e-07 Score=53.84 Aligned_cols=214 Identities=12% Similarity=0.037 Sum_probs=123.9 Q ss_pred CCc----------cccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEE-eccCCcceeecC Q lcl|Aclame:pro 1 MAT----------LSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVR-TGLPTPTWRKLY 69 (330) Q Consensus 1 M~~----------~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~-~~lP~~~fR~lN 69 (330) +.. ......+..+ ...+-+.... .+|..+.+.++|...+..+....+. ..+.++ .+-+.++|..=+ T Consensus 141 ~~~~~~~~~~~e~~~~~~~~~~~-~g~lvp~~~~-~~i~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~e~ 217 (437) T protein:vir:10 141 VTAFADYLKTGEVRDVTGIALKD-GKVIIPETIL-TPEKEVHQFPRLGSLVRTESVTTTT-GKLPIFNNSTDLLTAHTEY 217 (437) T ss_pred hhhhHHHHHhhhhhhhhhccccc-ccccchHHHH-HHHHHhhhhhhhhhcceeEeeccCc-eeeEEeecccccccccccc Confidence 000 0000011111 0011122222 3455667888888888877665553 445555 455788998888 Q ss_pred CccCc-ccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhh Q lcl|Aclame:pro 70 GGVLP-NKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPR 147 (330) Q Consensus 70 ~g~~~-s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R 147 (330) +..++ +..++.+++-..+-+++.+.|-+.+.+... +..++ =.....++++......|++|+.. T Consensus 218 ~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~~~~~---i~~~l~~~~~~~~~~~i~~g~g~------------ 282 (437) T protein:vir:10 218 GQTTKNATPVITPILWDLKTYTGGYVFSQELISDSSYDWQAE---LQSRLIELRDNTDDSLIITALTD------------ 282 (437) T ss_pred ccccccccccceeeeeehhheeeehhhhHHHHhhhHHHHHHH---HHHHHHHHHHHHHHHHHhhhhcc------------ Confidence 88875 558999999999999999999998877644 34443 33445688888888889888621 Q ss_pred hcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeee Q lcl|Aclame:pro 148 YNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGL 227 (330) Q Consensus 148 ~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl 227 (330) |. |+ + T Consensus 283 ------------------~~-------------------~~----~---------------------------------- 287 (437) T protein:vir:10 283 ------------------GI-------------------KK----T---------------------------------- 287 (437) T ss_pred ------------------cc-------------------cc----c---------------------------------- Confidence 00 00 0 Q ss_pred EEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcc Q lcl|Aclame:pro 228 TLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGER 307 (330) Q Consensus 228 ~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~ 307 (330) ......++|++++.. .++.....+.+|+||++....|+. .++....++-...+.+.. T Consensus 288 -------------------~~~~~~~~~~~~~~~---~l~~~~~~~~~~~~~~~~~~~l~~-lkd~~g~~~~~~~~~~~~ 344 (437) T protein:vir:10 288 -------------------TSTYLLGDLKKVLNV---TLKPQDSAAASIVMSQSAYNLFDM-ATDAMGRPLLQPNVTAAT 344 (437) T ss_pred -------------------ccccchhhHHHHHHh---hhhhhhhcCCEEEEcHHHHHHHHH-hhccCCCeeeccCccCCC Confidence 000011233332221 122222345689999999999986 456555555444444444 Q ss_pred eEEECCeEEEEEeec--cC---CccccC Q lcl|Aclame:pro 308 VMTFDGIPVQRTDAL--LN---TESRVV 330 (330) Q Consensus 308 v~~~~gvpir~~dal--~~---tE~~Vv 330 (330) .-.+.|.||..+++. .+ +...++ T Consensus 345 ~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 372 (437) T protein:vir:10 345 GYTLLGKTVVIVDDKLFPSASAGDVNIV 372 (437) T ss_pred CcccccceeEEecccccCCcCCCceEEE Confidence 557999999999864 22 122223 No 93 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=98.06 E-value=2.4e-06 Score=51.39 Aligned_cols=223 Identities=12% Similarity=0.118 Sum_probs=124.8 Q ss_pred CCc----------------------------cccccccHHHHH--h-------hcCccc-chHHHHHHHhccchhHhh-c Q lcl|Aclame:pro 1 MAT----------------------------LSTNNPTMADVA--K-------RLDPNG-KVDIIVEMLNQTNPVLQD-M 41 (330) Q Consensus 1 M~~----------------------------~~~~a~TL~E~A--k-------~~~~d~-~~~~VIE~l~~~s~iL~~-l 41 (330) ++. ++.. .|..-| . -+-+.. ....|||.+.+.+.+.+. . T Consensus 316 ~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~--~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~ 393 (632) T protein:vir:96 316 AATGDWSKAGFEREVSLAIADASGKEARGFYMPHE--VLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGA 393 (632) T ss_pred hhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHH--HHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcc Confidence 000 0000 000000 0 011222 245789999886666553 2 Q ss_pred ceeeccCCccceeEEEeccCCcceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHH Q lcl|Aclame:pro 42 TAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEG 120 (330) Q Consensus 42 pf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika 120 (330) ..+....+ .+.+.+.++-|+++|..=++.+++++.++.+++-..+-+++.+.|-+.+.+... +..++- .....++ T Consensus 394 ~~~~~~~g-~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i---~~~l~~a 469 (632) T protein:vir:96 394 RMLPGLVG-DVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLI---REDLIEG 469 (632) T ss_pred eEeecCCc-ceEEEEEeCCceeEeecCCccccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHH---HHHHHHH Confidence 33444433 477888999999999999999999999999999999999999999999877643 443332 3446799 Q ss_pred HHHHHHHhhccCCCCcChhhccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceecccc Q lcl|Aclame:pro 121 MNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKG 200 (330) Q Consensus 121 ~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g 200 (330) ++......||||+...+ +..|+- +..+.+ T Consensus 470 ~~~~~d~a~l~G~G~~~--~p~Gi~---------------~~~~~~---------------------------------- 498 (632) T protein:vir:96 470 IGVALDLAMLTGTGLAN--DPVGLL---------------NMTGVP---------------------------------- 498 (632) T ss_pred HHHHHHHHhhcccCCCC--ccceee---------------eccccc---------------------------------- Confidence 99999999999973211 111220 000000 Q ss_pred ccccccccccCCceeEEEEEEeeeeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeCh Q lcl|Aclame:pro 201 QVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNR 280 (330) Q Consensus 201 ~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~ 280 (330) .+. ..+ ..++ -....++..++.. .+.+.++.+|+||. T Consensus 499 --~~~-~~~-----------------------------~~~~------~~~i~~~~~~i~~-----~~~~~~~~~~~~~~ 535 (632) T protein:vir:96 499 --ALT-YPA-----------------------------GGVD------WASVVDMETKIST-----FNADAGRLAYLTSV 535 (632) T ss_pred --cee-ccc-----------------------------ccCC------HHHHHHHHHHHhh-----cccccCccEEEEch Confidence 000 000 0000 0011122222211 12334567899999 Q ss_pred HHHHHHHHHh-hccccceeeecccCCcceEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 281 NLREKLRLGI-VDKIANNLTWETVSGERVMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 281 ~v~~~L~~q~-~~~~~~~l~~~~~~g~~v~~~~gvpir~~dal~~tE~~Vv 330 (330) .....|.... .+.....+... + .++|.|+..++++...-.... T Consensus 536 ~~~~~l~~~~l~d~~G~~i~~~---~----~l~G~pv~~s~~ip~~~~~~g 579 (632) T protein:vir:96 536 TQRGAAKKAQVFDNTGERIWQN---N----EVNGYRAEASNQIPADTWIFG 579 (632) T ss_pred hHHHHHHHHhccCCCCceeecC---C----eecccceEeccccccCcEEEe Confidence 8888888643 23333333221 1 468999999988875422111 No 94 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=97.98 E-value=1.5e-06 Score=52.59 Aligned_cols=212 Identities=13% Similarity=0.072 Sum_probs=122.0 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEe-ccCCcceeecCCccCc-ccce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRT-GLPTPTWRKLYGGVLP-NKSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~-~lP~~~fR~lN~g~~~-s~~t 78 (330) +..+. ..|..+ ...+-|......|++.+.+.++|++.++.+...++.+ .|.+.. +-+.++|..=+...++ +..+ T Consensus 106 ~~~~~--~~t~~~-gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~E~~~~~~~~~~~ 181 (389) T protein:vir:10 106 IDATS--KVTSTE-AGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKG-TYPILKRATDRFSSVAELAENPKLAEPE 181 (389) T ss_pred hhhhc--ccccCC-cceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCee-EEEEEecCCCcccccccccccccccccc Confidence 11110 011111 0011222345679999999999999999887765543 344333 3455567666667774 7889 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) +.+++-.++-+.+.+.|-+.+.+... +..++ -.....++++......|++|+... T Consensus 182 ~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~---i~~~la~~~~~~~~~~i~~g~~~~--------------------- 237 (389) T protein:vir:10 182 FNKVDWSVATYRGAIPLSEEAIADSAVDLTAL---VGQSIKEKSVNTYNAMIAPVLQSF--------------------- 237 (389) T ss_pred ceeeeeeheeeEeeehhhHHHHhhhhHHHHHH---HHHHHHHHHHHHHHHHHhhhhccc--------------------- Confidence 99999999999999999999887643 33333 334456777777777776653210 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) + |++ T Consensus 238 -------~---------------------~~~------------------------------------------------ 241 (389) T protein:vir:10 238 -------T---------------------AKK------------------------------------------------ 241 (389) T ss_pred -------c---------------------ccc------------------------------------------------ Confidence 0 000 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeeccc----CCcceEEECC Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETV----SGERVMTFDG 313 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~----~g~~v~~~~g 313 (330) .++....++|++++...+. |. .+.+|+||++....|+. .++.....+-.... .+...-.+.| T Consensus 242 --------~~~~~~~d~l~~~~~~~~~--~~---~~a~~~~n~~~~~~L~~-lkd~~G~~i~~~~~~~~~~~~~~~~l~G 307 (389) T protein:vir:10 242 --------TTTDTLVDSLKHILNVDLD--PA---YSRALVVTQSLFNTLDT-LKDKNGRYLLHDASDSITDGTAKGTILG 307 (389) T ss_pred --------ccccccHHHHHHHHHhhhh--hh---hCcEEEecHHHHHHHHH-hhccCCCeeeecCccccccccccccccc Confidence 0111223455544432221 11 24689999999999996 45544334332222 2333457999 Q ss_pred eEEEEEeec-cCC---ccccC Q lcl|Aclame:pro 314 IPVQRTDAL-LNT---ESRVV 330 (330) Q Consensus 314 vpir~~dal-~~t---E~~Vv 330 (330) +||+.+++. +.+ +..++ T Consensus 308 ~pV~~~~~~~~~~~~~~~~~~ 328 (389) T protein:vir:10 308 VPVYVVGDTLLGSLAGDQKAF 328 (389) T ss_pred ceeEEecccccCCCCCceEEE Confidence 999887653 322 22233 No 95 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=97.97 E-value=6.2e-07 Score=54.62 Aligned_cols=249 Identities=13% Similarity=0.182 Sum_probs=134.7 Q ss_pred CC-------------ccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceee Q lcl|Aclame:pro 1 MA-------------TLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRK 67 (330) Q Consensus 1 M~-------------~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~ 67 (330) +. ......-+... ...+-|+.....|++.+.+.++|+........+ + ...+.+.+..|.+.|.. T Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~g-~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~-g-~~~~~~~~~~~~a~wv~ 207 (466) T protein:vir:80 131 LIARSEVKEFLAQVRTLAQQKRAVSG-AELTIPDVMLELLRDNMHRYSKLISKVRLRPLK-G-TARQNIAGAIPEGVWTE 207 (466) T ss_pred HHHHHHHHHHHHHHHHHhhhhhhhcc-ccccccHHHHHHHHHhhhhhhhhhhheeeeecC-c-eeEeeeecCCcceeecc Confidence 00 00000000000 001223344556889999999999988876654 2 35677888999999999 Q ss_pred cCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhh Q lcl|Aclame:pro 68 LYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSP 146 (330) Q Consensus 68 lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~ 146 (330) -++.++++..++.+++-.++-+.+.+.|-+.+.+..+ +..++-. ....++++.....+||+||-...|. ||-. T Consensus 208 E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~la~~~~~~~~~ail~G~G~~~P~---Gil~ 281 (466) T protein:vir:80 208 AVANLNELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEIL---DAIGQAIGFALDKAILYGTGTKMPV---GIVT 281 (466) T ss_pred cccccccccccccceeecceeeeeehhhhHHHHhcchHHHHHHHH---HHHHHHHHHHHhhheeeccCCCCcc---eeee Confidence 9999999999999999999999999999999988754 4555443 3467899999999999998665554 4422 Q ss_pred hhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeee Q lcl|Aclame:pro 147 RYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIG 226 (330) Q Consensus 147 R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~G 226 (330) . +.. .+....++. ++. ...+.-...+ T Consensus 282 ~---~~~----------~~~~~~~~~------------------~~~-~~~~~~~~~~---------------------- 307 (466) T protein:vir:80 282 R---LAQ----------TTQPPNWGT------------------KAP-AWTNLSTTNL---------------------- 307 (466) T ss_pred c---ccc----------ccccccccc------------------ccc-cccccchhhh---------------------- Confidence 1 110 000000000 000 0000000000 Q ss_pred eEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCc Q lcl|Aclame:pro 227 LTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGE 306 (330) Q Consensus 227 l~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~ 306 (330) .+++.. ...+...+.+++.....+..+...+..+|.||......|....... +.........+. T Consensus 308 ------------~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~-~~~g~~~~~~~~ 371 (466) T protein:vir:80 308 ------------LKIDPT---GKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITF-NSAGALVASLNN 371 (466) T ss_pred ------------hhhhhh---ccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccc-cCCccccccCCC Confidence 000000 0001111112222222223445566778999999988876432111 111011000111 Q ss_pred ceEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 307 RVMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 307 ~v~~~~gvpir~~dal~~tE~~Vv 330 (330) . ..+.|.||..++++...+ .++ T Consensus 372 ~-~~i~G~pvv~s~~~~~~~-~~~ 393 (466) T protein:vir:80 372 T-MPIVGGDIVILDFIPDND-IIG 393 (466) T ss_pred c-ccccccceeecCccCccc-eee Confidence 1 136789999998876543 222 No 96 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=97.97 E-value=1.4e-06 Score=52.71 Aligned_cols=212 Identities=16% Similarity=0.113 Sum_probs=119.8 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcce--eecCCccCcccce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTW--RKLYGGVLPNKSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~f--R~lN~g~~~s~~t 78 (330) +.......+|..+ ...+-|......||+.+.+.++|++.++.+...++ ...|.+.+.-+..+| ..=+...+++..+ T Consensus 109 ~~~~~ra~~t~~~-gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~ 186 (421) T protein:vir:13 109 LSEEERDIMSSTN-NGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRN-AGKMPVRAGASVDKLANLAKDTELVKAMLK 186 (421) T ss_pred hhHHHhhccccCC-cceecchhhHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEeecCCccceeeccccccccccccc Confidence 1000001111111 11122344567799999999999999998877544 456666676666555 5556678888899 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) +.+++-..+-+.+.+.|.+.+.+..+ +..++-. ....+++.......+++ T Consensus 187 f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~---~~la~~~~~~~~~~i~~-------------------------- 237 (421) T protein:vir:13 187 TQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVN---EEFAEFAVNTENAEIVK-------------------------- 237 (421) T ss_pred eeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHH---HHHHHHHHHHhhhhHhh-------------------------- Confidence 99999999999999999998876643 2333322 22233333322111110 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) -|+|..+. . T Consensus 238 ----------------------------~~~g~~~~----------------~--------------------------- 246 (421) T protein:vir:13 238 ----------------------------QAKAVLAE----------------E--------------------------- 246 (421) T ss_pred ----------------------------hhhhcccc----------------c--------------------------- Confidence 01111000 0 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) +....+++++++. .++........|+||++....|+. .++....++-.+ ......-.++|.||. T Consensus 247 ----------~~~~~d~i~~~~~----~l~~~~~~~a~~v~n~~~~~~l~~-lkd~~G~~i~~~-~~~~~~~tl~G~pV~ 310 (421) T protein:vir:13 247 ----------TINDYAGLVKTIN----SLVPNARKRAIIVTNSDGRAYLDG-LMDKQGRPLLKE-LSDGGDLVFKGRPVI 310 (421) T ss_pred ----------cccchHHHHHHHH----HhhhhhcCCCEEEEcHHHHHHHHH-hhcCCCceeecC-cCCCCCceecceeeE Confidence 0001234444332 222222334689999999999995 455554555433 333334579999999 Q ss_pred EEeeccCCc---cccC Q lcl|Aclame:pro 318 RTDALLNTE---SRVV 330 (330) Q Consensus 318 ~~dal~~tE---~~Vv 330 (330) .+|++.... ..++ T Consensus 311 ~~~~~~~~~~~~~~~~ 326 (421) T protein:vir:13 311 ELEESIFDVGDETKFI 326 (421) T ss_pred EeccccccCCCceEEE Confidence 999987421 1222 No 97 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=97.96 E-value=9.2e-08 Score=59.16 Aligned_cols=210 Identities=16% Similarity=0.154 Sum_probs=98.3 Q ss_pred CCccc------cccccHHHHHhhcC---cccchHHH---HHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeec Q lcl|Aclame:pro 1 MATLS------TNNPTMADVAKRLD---PNGKVDII---VEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKL 68 (330) Q Consensus 1 M~~~~------~~a~TL~E~Ak~~~---~d~~~~~V---IE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~l 68 (330) =++.. ..+-|.+|+=+++. .+....++ +++.++.=---..-.|+.+++. .+. |-.| T Consensus 74 ~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a-~n~-----------F~GL 141 (310) T protein:vir:97 74 AATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGA-GNE-----------FAGL 141 (310) T ss_pred ccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccC-CCc-----------ccch Confidence 00000 01123333322211 01111112 2221111111112233444321 111 2111 Q ss_pred CCccCcccceEEEEEEE---EEEecchhhhhHHHHHhC---CCHHHHHHH-HHHHHHHHHHHHHHHhhccC-CCCcC--h Q lcl|Aclame:pro 69 YGGVLPNKSSTAQVTDN---CGMLEAYAEVDKALADLN---GNTAAFRLS-EDRAQIEGMNQEVAQTLFYG-NDGIA--P 138 (330) Q Consensus 69 N~g~~~s~~t~~~~~~~---l~ilgg~~eVDk~la~~~---g~~~~~ra~-e~~~~ika~~~~~~~~~iyG-D~~~~--p 138 (330) =.-+.+ .|+.+. =+.+ .--+.|+-|.... |.+.-+.+. .+..+|++++++....-+|= +.... + T Consensus 142 ~~~~~~-----~q~i~~~~~gg~~-t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~ 215 (310) T protein:vir:97 142 IQLCAS-----GQKATTGATGSAI-SFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAE 215 (310) T ss_pred hhcCCc-----cceeecCCCCCCC-CHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCE Confidence 111111 111110 0111 1247888787652 333333332 23346777777664332322 11111 1 Q ss_pred -hhccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCC----cEEEEccccccccceeccccccccccccccCCc Q lcl|Aclame:pro 139 -AEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPN----TCHSIYPKGSKAGLSVEDKGQVTIENADGNGGR 213 (330) Q Consensus 139 -~~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~----~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~ 213 (330) ..+.|+.-.-.+.-+.+ ...++++++||||+|++|++ .+||++..| +.||+++++|+. ++.+ T Consensus 216 v~~~~GiPi~~~d~ip~~-----~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~-~~glsVr~~G~~--~~~~----- 282 (310) T protein:vir:97 216 VPAYSGTPIFRNDYIPTN-----QTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQ-AAGIQVVDVGES--EDSD----- 282 (310) T ss_pred EeeeCCeEEEEeCccCCC-----ccccccCCceeEEEEeeCccccccceeccccCC-ccceeEEeCCcc--cCCc----- Confidence 24556543322222221 12356777999999999985 578887543 679999999952 2222 Q ss_pred eeEEEEEEeeeeeeEEeccccEEEeecccc Q lcl|Aclame:pro 214 MEGYRTHYKWDIGLTLRDWRYVARVCNIDV 243 (330) Q Consensus 214 ~~~y~~~~~w~~Gl~v~d~r~v~RI~NId~ 243 (330) +++++++||||++|.+.++++++.||-- T Consensus 283 --v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 283 --EHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred --ceeEEEEEeeeEEEecccceeeeccccC Confidence 3477889999999999999999999942 No 98 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=97.89 E-value=9.8e-07 Score=53.51 Aligned_cols=246 Identities=17% Similarity=0.177 Sum_probs=123.6 Q ss_pred CCccccccccHHH------HHhh-------cCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceee Q lcl|Aclame:pro 1 MATLSTNNPTMAD------VAKR-------LDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRK 67 (330) Q Consensus 1 M~~~~~~a~TL~E------~Ak~-------~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~ 67 (330) +.......+|-.| +.+. +-|......|+|.+.+.++|+..+.+.... ......+.++-|.++|-. T Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~--~~~~i~~~~~~~~a~w~~ 134 (381) T protein:vir:10 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWGK 134 (381) T ss_pred HhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC--cceEEEEecCCcceeeec Confidence 1111112222222 1111 123445678999999999999999887653 346788888999999999 Q ss_pred cCCccC-cccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChh Q lcl|Aclame:pro 68 LYGGVL-PNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLS 145 (330) Q Consensus 68 lN~g~~-~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~ 145 (330) .+++++ ++..++.+++-..+-|.+.+.|-+.|.+... +..+|-.. ...++++....+.|++||-...|.+ |- T Consensus 135 e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~---~la~~~a~~~~~a~i~G~G~~qP~G---il 208 (381) T protein:vir:10 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRV---QIEEAFAVALETAFLKGTGKDQPIG---LN 208 (381) T ss_pred ccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHH---HHHHHHHHHhhheeEeccCCCCcee---ee Confidence 999887 4578999999999999999999999988754 55555443 4568999999999999997665544 31 Q ss_pred hhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeee Q lcl|Aclame:pro 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) Q Consensus 146 ~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~ 225 (330) .- + .+.... ++| .+|.....+ ...+. T Consensus 209 ~~---~---~~~~~~-~~g--------------------~~~~~~~~~-t~t~~-------------------------- 234 (381) T protein:vir:10 209 RQ---V---QKGVSV-TEG--------------------AYPEKEEQG-TLTFA-------------------------- 234 (381) T ss_pred ec---c---Cccccc-ccc--------------------ccccccccc-ccccc-------------------------- Confidence 10 1 111000 111 111110000 00000 Q ss_pred eeEEeccccEEEeecccccccccchhHHHHHHHHHHHHH----hccCCCCCCEEEEeChHHHHHHHHHhhcc--cccee- Q lcl|Aclame:pro 226 GLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAE----RIPQLGMGRAVWYMNRNLREKLRLGIVDK--IANNL- 298 (330) Q Consensus 226 Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~----~ip~~~~g~~~~y~n~~v~~~L~~q~~~~--~~~~l- 298 (330) ++ ....+.|.+ +..+.- .-+....++.+|.||+.....|+.+.... ....+ T Consensus 235 --------------~~-------~~~~~~l~~-~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~ 292 (381) T protein:vir:10 235 --------------NP-------RATVNELTQ-VFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVT 292 (381) T ss_pred --------------cc-------hhhHHHHHH-HHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceee Confidence 00 000000000 000000 00011122333444444333332211000 00000 Q ss_pred ---------eecccC------Cc----ceEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 299 ---------TWETVS------GE----RVMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 299 ---------~~~~~~------g~----~v~~~~gvpir~~dal~~tE~~Vv 330 (330) ..+..+ |. .+....|+-|+++|.....+..+. T Consensus 293 ~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~ 343 (381) T protein:vir:10 293 ALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDL 343 (381) T ss_pred cCCCCceEEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeE Confidence 000000 11 112334566666666555555555 No 99 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=97.89 E-value=9.8e-07 Score=53.51 Aligned_cols=246 Identities=17% Similarity=0.177 Sum_probs=123.6 Q ss_pred CCccccccccHHH------HHhh-------cCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceee Q lcl|Aclame:pro 1 MATLSTNNPTMAD------VAKR-------LDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRK 67 (330) Q Consensus 1 M~~~~~~a~TL~E------~Ak~-------~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~ 67 (330) +.......+|-.| +.+. +-|......|+|.+.+.++|+..+.+.... ......+.++-|.++|-. T Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~--~~~~i~~~~~~~~a~w~~ 134 (381) T protein:vir:95 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWGK 134 (381) T ss_pred HhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC--cceEEEEecCCcceeeec Confidence 1111112222222 1111 123445678999999999999999887653 346788888999999999 Q ss_pred cCCccC-cccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChh Q lcl|Aclame:pro 68 LYGGVL-PNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLS 145 (330) Q Consensus 68 lN~g~~-~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~ 145 (330) .+++++ ++..++.+++-..+-|.+.+.|-+.|.+... +..+|-.. ...++++....+.|++||-...|.+ |- T Consensus 135 e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~---~la~~~a~~~~~a~i~G~G~~qP~G---il 208 (381) T protein:vir:95 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRV---QIEEAFAVALETAFLKGTGKDQPIG---LN 208 (381) T ss_pred ccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHH---HHHHHHHHHhhheeEeccCCCCcee---ee Confidence 999887 4578999999999999999999999988754 55555443 4568999999999999997665544 31 Q ss_pred hhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeee Q lcl|Aclame:pro 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) Q Consensus 146 ~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~ 225 (330) .- + .+.... ++| .+|.....+ ...+. T Consensus 209 ~~---~---~~~~~~-~~g--------------------~~~~~~~~~-t~t~~-------------------------- 234 (381) T protein:vir:95 209 RQ---V---QKGVSV-TEG--------------------AYPEKEEQG-TLTFA-------------------------- 234 (381) T ss_pred ec---c---Cccccc-ccc--------------------ccccccccc-ccccc-------------------------- Confidence 10 1 111000 111 111110000 00000 Q ss_pred eeEEeccccEEEeecccccccccchhHHHHHHHHHHHHH----hccCCCCCCEEEEeChHHHHHHHHHhhcc--cccee- Q lcl|Aclame:pro 226 GLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAE----RIPQLGMGRAVWYMNRNLREKLRLGIVDK--IANNL- 298 (330) Q Consensus 226 Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~----~ip~~~~g~~~~y~n~~v~~~L~~q~~~~--~~~~l- 298 (330) ++ ....+.|.+ +..+.- .-+....++.+|.||+.....|+.+.... ....+ T Consensus 235 --------------~~-------~~~~~~l~~-~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~ 292 (381) T protein:vir:95 235 --------------NP-------RATVNELTQ-VFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVT 292 (381) T ss_pred --------------cc-------hhhHHHHHH-HHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceee Confidence 00 000000000 000000 00011122333444444333332211000 00000 Q ss_pred ---------eecccC------Cc----ceEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 299 ---------TWETVS------GE----RVMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 299 ---------~~~~~~------g~----~v~~~~gvpir~~dal~~tE~~Vv 330 (330) ..+..+ |. .+....|+-|+++|.....+..+. T Consensus 293 ~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~ 343 (381) T protein:vir:95 293 ALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDL 343 (381) T ss_pred cCCCCceEEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeE Confidence 000000 11 112334566666666555555555 No 100 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=97.84 E-value=3e-06 Score=50.84 Aligned_cols=232 Identities=12% Similarity=0.148 Sum_probs=129.5 Q ss_pred CCccccccccHHHHH------hhc-------CcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceee Q lcl|Aclame:pro 1 MATLSTNNPTMADVA------KRL-------DPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRK 67 (330) Q Consensus 1 M~~~~~~a~TL~E~A------k~~-------~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~ 67 (330) +.......+|..|-+ +.. -|......||+.+.+.|+|++...+.... + .....+.++-|.+.|.. T Consensus 67 ~~~r~~~~l~~ee~~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~-~-~~~i~~~~~~~~a~w~~ 144 (395) T protein:vir:95 67 LAKRSQDPLTSEERKFFNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAG-I-KTRVIKADPAGQAVWGK 144 (395) T ss_pred HhhcCccccchHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-C-ceEEEEecCCcceEEee Confidence 222222233332211 111 23335677999999999999999887664 3 35677788889999998 Q ss_pred cCCccC-cccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc--ChhhccC Q lcl|Aclame:pro 68 LYGGVL-PNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGI--APAEFTG 143 (330) Q Consensus 68 lN~g~~-~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~--~p~~F~G 143 (330) .+...+ ++.+++.+++-.++-|.+.+.|.+.|.+..+ |..+|-.. ...++++....+.|++||-.. .|.++ T Consensus 145 e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~---~la~~ia~~~~~a~i~G~G~~~~qP~Gi-- 219 (395) T protein:vir:95 145 VFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRT---QIQEAISVALESAIINGGGAAKTQPVGL-- 219 (395) T ss_pred cccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHH---HHHHHHHHHHhhheeeccCCCCcCceee-- Confidence 877775 5789999999999999999999999988754 56666543 456899999999999998543 14432 Q ss_pred hhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEee Q lcl|Aclame:pro 144 LSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKW 223 (330) Q Consensus 144 L~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w 223 (330) -.. . ++... -++.+..++.... T Consensus 220 -l~~---~-----------~~~~~-----------------~~~~~~~~~~~t~-------------------------- 241 (395) T protein:vir:95 220 -MKD---V-----------NTNSG-----------------AVTDKASSGTLTF-------------------------- 241 (395) T ss_pred -eec---c-----------ccccc-----------------ccccccccchhhh-------------------------- Confidence 110 0 00000 0000111110000 Q ss_pred eeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHh----ccCCCCCCEEEEeChHHHHHHHHHhhccccceee Q lcl|Aclame:pro 224 DIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAER----IPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLT 299 (330) Q Consensus 224 ~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~----ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~ 299 (330) .+++ .....|.++. ..... -.....++..|.||++-+.-+. .... T Consensus 242 ---------------~~~~-------~~~~~l~~~~-~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~--------g~~~ 290 (395) T protein:vir:95 242 ---------------ADAD-------TTILELNDVL-KNLSVDEKGKELKIDGKVALVVNPRDSWDVQ--------ARYT 290 (395) T ss_pred ---------------hhhH-------hhHHHHHHHH-HhhccccccchhhhcCceEEEEcchhhhhcC--------Ccce Confidence 0111 0001111100 00000 0112234567889976543211 1222 Q ss_pred ecccCCcceEEE-CCeEEEEEeeccCCccccC Q lcl|Aclame:pro 300 WETVSGERVMTF-DGIPVQRTDALLNTESRVV 330 (330) Q Consensus 300 ~~~~~g~~v~~~-~gvpir~~dal~~tE~~Vv 330 (330) +.+..|..++-+ .|+||..++++...+ |+ T Consensus 291 ~~~~~G~~~~~lg~g~~v~~~~~~p~~~--i~ 320 (395) T protein:vir:95 291 YLTANGGFVTVLPYNVTIITSEFVPEGK--LV 320 (395) T ss_pred eccCCCcceeccCCcceEEEcCCCCCCc--EE Confidence 223455555443 578888888887544 33 No 101 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=97.81 E-value=2e-06 Score=51.78 Aligned_cols=210 Identities=8% Similarity=0.031 Sum_probs=121.8 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEE-eccCCcceeecCCccCc-ccce Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVR-TGLPTPTWRKLYGGVLP-NKSS 78 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~-~~lP~~~fR~lN~g~~~-s~~t 78 (330) +..-.....+..+ ....-+......|++ +.+.++|+..+..+....+. +.+.+. .+-..++|..=++..++ +..+ T Consensus 127 ~~~~~~~~~~~~~-~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~E~~~~~~~~~~~ 203 (397) T protein:vir:96 127 KGAEKRDGFTSVE-GGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSAS-GKFPVISKSGSKMATVQQLEKNPQLANPK 203 (397) T ss_pred hhhhhhhcccccc-cccchhHHHHHHHHH-hhhhhhHHHhhhhccccccc-eeEEEEeccCCcccccccccccccccccc Confidence 1000001111111 111223334556776 56778888888887654443 333332 23355667666666775 6799 Q ss_pred EEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 79 TAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 79 ~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) +.+++..++-+.+.+.|.+.+.+... +..++- .....++++......|++|+.... T Consensus 204 ~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i---~~~l~~~~~~~~~~~i~~g~g~~~-------------------- 260 (397) T protein:vir:96 204 MVEIDYSVATRRGYIPISQEMIDDASYDVTGLI---ADEIQDQSLNTKNADIAAVLKTAT-------------------- 260 (397) T ss_pred ccceeecHhHhhcchhhHHHHHhhhHHHHHHHH---HHHHHHHHHHHHHHHHhhcccccc-------------------- Confidence 99999999999999999998888754 344443 344568888888888887753211 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) |.| + T Consensus 261 -----------------------------~~~----~------------------------------------------- 264 (397) T protein:vir:96 261 -----------------------------AKS----V------------------------------------------- 264 (397) T ss_pred -----------------------------ccc----c------------------------------------------- Confidence 000 0 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) ...++|++++..+. +.. .+.+|+||.+....|+. .++....++-...+.+...-.+.|.||. T Consensus 265 ------------~~~d~~~~~~~~~~---~~~--~~a~~v~n~~~~~~l~~-lkd~~G~~~~~~~~~~~~~~~l~G~pv~ 326 (397) T protein:vir:96 265 ------------VGVDGLKDLINKEI---KKV--YDVKLFISASMYSELDK-LKDKNGRYLLQDSITAASGKQLLGKEVV 326 (397) T ss_pred ------------cchHHHHHHHHHhh---hhh--cCcEEEEcHHHHHHHHH-hhccCCCeEeccCccCCCcccccccceE Confidence 01234444443222 221 24689999999999986 4566555554334444445579999999 Q ss_pred EEeec-cCCccc---cC Q lcl|Aclame:pro 318 RTDAL-LNTESR---VV 330 (330) Q Consensus 318 ~~dal-~~tE~~---Vv 330 (330) .++.. +.++.- ++ T Consensus 327 ~~~~~~~~~~~~~~~~~ 343 (397) T protein:vir:96 327 VLDDDVIGKSVGNVVGF 343 (397) T ss_pred EecccccCCCCCceEEE Confidence 87654 333322 12 No 102 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=97.81 E-value=1.6e-06 Score=52.42 Aligned_cols=215 Identities=13% Similarity=0.080 Sum_probs=117.7 Q ss_pred CCc-ccc-ccccHHHHH----------hhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEE-EeccCCcceee Q lcl|Aclame:pro 1 MAT-LST-NNPTMADVA----------KRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSV-RTGLPTPTWRK 67 (330) Q Consensus 1 M~~-~~~-~a~TL~E~A----------k~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~-~~~lP~~~fR~ 67 (330) ..- ... ...+..+.+ --+-|......||+.+.+.++|.+........+ ..+.+ ..+.++++|.. T Consensus 100 ~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~a~~v~ 176 (387) T protein:vir:93 100 LPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFIT 176 (387) T ss_pred hhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCC---ceEEEEeecCCcccccc Confidence 000 000 000000000 001233345679999999999998888765432 22332 34567899999 Q ss_pred cCCccCcccceEEEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHH-hhccCCCCcChhhccChh Q lcl|Aclame:pro 68 LYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQ-TLFYGNDGIAPAEFTGLS 145 (330) Q Consensus 68 lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~-~~iyGD~~~~p~~F~GL~ 145 (330) =++..++++.++.+++-.++-+.+.+.|.+.|.+-. .+..+|-.. ...++++.+-.. .|.+|+. T Consensus 177 E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~---~la~~~~~~e~~~~~~~g~g----------- 242 (387) T protein:vir:93 177 DVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVEN---ALQSGLAAKERKDALAVSPK----------- 242 (387) T ss_pred CcccccccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHH---HHHHHHHHHHHHhHhhcCCC----------- Confidence 999999999999999999999999999999887753 455555443 344555554222 2333321 Q ss_pred hhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeee Q lcl|Aclame:pro 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) Q Consensus 146 ~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~ 225 (330) +|-. +|+.- ++++ T Consensus 243 -------------------~g~p--------------~g~l~---~~~~------------------------------- 255 (387) T protein:vir:93 243 -------------------SGLD--------------HMSFY---NGSV------------------------------- 255 (387) T ss_pred -------------------cccc--------------ceeee---cccc------------------------------- Confidence 1100 01100 0000 Q ss_pred eeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCC Q lcl|Aclame:pro 226 GLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSG 305 (330) Q Consensus 226 Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g 305 (330) ..+ ...++.|.++.+++.++.....+.+|+||+.....+.....+.....+. | T Consensus 256 -------------~~v---------~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~~~~~-----~ 308 (387) T protein:vir:93 256 -------------KEV---------EGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----T 308 (387) T ss_pred -------------ccc---------cccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----c Confidence 000 0111223334455666655566778999987765555444444333321 1 Q ss_pred cceEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 306 ERVMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 306 ~~v~~~~gvpir~~dal~~tE~~Vv 330 (330) ....+.|.||..+|.... .++ T Consensus 309 -~~~~llG~PV~~~~~~~~---~~~ 329 (387) T protein:vir:93 309 -PAEKVFGKPVVFTDAAVK---PIV 329 (387) T ss_pred -CCccccccceEEecCCCc---eee Confidence 223577999999986431 122 No 103 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=97.77 E-value=1.2e-06 Score=53.11 Aligned_cols=231 Identities=17% Similarity=0.140 Sum_probs=121.5 Q ss_pred CCccccccccHHHHHhhc----------------------------CcccchHHHHHHHhccc---hhHhhcceeeccCC Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRL----------------------------DPNGKVDIIVEMLNQTN---PVLQDMTAIEGNLP 49 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~----------------------------~~d~~~~~VIE~l~~~s---~iL~~lpf~e~n~g 49 (330) ||. ...++-++ ++.+ ....+.. -|-.|+..+ -++.+++-..+ .. T Consensus 1 ~~~--~~~~~~~~-~~~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~-~i~~Lt~~~~~f~~~~~i~k~~a-~S 75 (463) T protein:vir:95 1 MTI--EKNLSDVQ-QKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDD-QITMLTWTNEDLIFYRDISRRPA-QS 75 (463) T ss_pred CCc--ccccchHH-HHHHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhh-hhheeeecccchhhhhhcCCchh-hh Confidence 332 11122111 0000 0000110 011111111 12333333222 12 Q ss_pred ccceeEEEeccCC---cceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 50 TGHRTSVRTGLPT---PTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVA 126 (330) Q Consensus 50 ~~~~~~~~~~lP~---~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~ 126 (330) |-|+|.+...--+ ..|-.=-.-.+-+.+++.+++..++.++.--.|-...-..++ ..+-.+.+.+..|..+.++++ T Consensus 76 TV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~-~~d~~~~~~~dai~~ia~tiE 154 (463) T protein:vir:95 76 TVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNN-IADPSQILTEDAIAVVAKTIE 154 (463) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcc-cccHHHHHHHHHHHHHHHHHH Confidence 3466665554444 344222223455678999999999999999999886655444 446678888889999999999 Q ss_pred HhhccCCCCcChh------hccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceecccc Q lcl|Aclame:pro 127 QTLFYGNDGIAPA------EFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKG 200 (330) Q Consensus 127 ~~~iyGD~~~~p~------~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g 200 (330) ..+||||+...|+ +||||.+.. ++.|+|||.|.- .|+-++-|.. T Consensus 155 ~a~FyGds~l~~~~~~~gleFDGl~~lI------d~enviDarG~~---Ls~~~ln~Aa--------------------- 204 (463) T protein:vir:95 155 WASFYGDASLTSEVEGEGLEFDGLAKLI------DKNNVINAKGNQ---LTEKHLNEAA--------------------- 204 (463) T ss_pred HHHhhhhhccCCCcCccccchhhhhhhc------CCCCeeecCCCc---ccHHHHhhhh--------------------- Confidence 9999999999996 899997765 678999998543 1111111110 Q ss_pred ccccccccccCCceeEEEEEEeeeeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeCh Q lcl|Aclame:pro 201 QVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNR 280 (330) Q Consensus 201 ~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~ 280 (330) .++.. +|+ --+-+||+- T Consensus 205 -~~i~~---------~fG-----------------------------------------------------t~TD~~lp~ 221 (463) T protein:vir:95 205 -VRIGK---------GFG-----------------------------------------------------TATDAYMPI 221 (463) T ss_pred -hhhhc---------ccC-----------------------------------------------------Chhheecch Confidence 00000 000 012355666 Q ss_pred HHHHHHHHHhhccccceeeeccc-------CCcceEEECCeEEE-----EEeeccCCccccC Q lcl|Aclame:pro 281 NLREKLRLGIVDKIANNLTWETV-------SGERVMTFDGIPVQ-----RTDALLNTESRVV 330 (330) Q Consensus 281 ~v~~~L~~q~~~~~~~~l~~~~~-------~g~~v~~~~gvpir-----~~dal~~tE~~Vv 330 (330) -+...|.-+...+..+.+..+.- ..+.+++-+.|.+. ..+++++-|...+ T Consensus 222 ~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~ 283 (463) T protein:vir:95 222 GVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPL 283 (463) T ss_pred HHHHHHHHHhcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcC Confidence 66666665555554444433322 12334433334333 6677777664433 No 104 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=97.77 E-value=1.2e-06 Score=53.11 Aligned_cols=231 Identities=17% Similarity=0.140 Sum_probs=121.5 Q ss_pred CCccccccccHHHHHhhc----------------------------CcccchHHHHHHHhccc---hhHhhcceeeccCC Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRL----------------------------DPNGKVDIIVEMLNQTN---PVLQDMTAIEGNLP 49 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~----------------------------~~d~~~~~VIE~l~~~s---~iL~~lpf~e~n~g 49 (330) ||. ...++-++ ++.+ ....+.. -|-.|+..+ -++.+++-..+ .. T Consensus 1 ~~~--~~~~~~~~-~~~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~-~i~~Lt~~~~~f~~~~~i~k~~a-~S 75 (463) T protein:vir:99 1 MTI--EKNLSDVQ-QKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDD-QITMLTWTNEDLIFYRDISRRPA-QS 75 (463) T ss_pred CCc--ccccchHH-HHHHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhh-hhheeeecccchhhhhhcCCchh-hh Confidence 332 11122111 0000 0000110 011111111 12333333222 12 Q ss_pred ccceeEEEeccCC---cceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 50 TGHRTSVRTGLPT---PTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVA 126 (330) Q Consensus 50 ~~~~~~~~~~lP~---~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~ 126 (330) |-|+|.+...--+ ..|-.=-.-.+-+.+++.+++..++.++.--.|-...-..++ ..+-.+.+.+..|..+.++++ T Consensus 76 TV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~-~~d~~~~~~~dai~~ia~tiE 154 (463) T protein:vir:99 76 TVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNN-IADPSQILTEDAIAVVAKTIE 154 (463) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcc-cccHHHHHHHHHHHHHHHHHH Confidence 3466665554444 344222223455678999999999999999999886655444 446678888889999999999 Q ss_pred HhhccCCCCcChh------hccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceecccc Q lcl|Aclame:pro 127 QTLFYGNDGIAPA------EFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKG 200 (330) Q Consensus 127 ~~~iyGD~~~~p~------~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g 200 (330) ..+||||+...|+ +||||.+.. ++.|+|||.|.- .|+-++-|.. T Consensus 155 ~a~FyGds~l~~~~~~~gleFDGl~~lI------d~enviDarG~~---Ls~~~ln~Aa--------------------- 204 (463) T protein:vir:99 155 WASFYGDASLTSEVEGEGLEFDGLAKLI------DKNNVINAKGNQ---LTEKHLNEAA--------------------- 204 (463) T ss_pred HHHhhhhhccCCCcCccccchhhhhhhc------CCCCeeecCCCc---ccHHHHhhhh--------------------- Confidence 9999999999996 899997765 678999998543 1111111110 Q ss_pred ccccccccccCCceeEEEEEEeeeeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeCh Q lcl|Aclame:pro 201 QVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNR 280 (330) Q Consensus 201 ~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~ 280 (330) .++.. +|+ --+-+||+- T Consensus 205 -~~i~~---------~fG-----------------------------------------------------t~TD~~lp~ 221 (463) T protein:vir:99 205 -VRIGK---------GFG-----------------------------------------------------TATDAYMPI 221 (463) T ss_pred -hhhhc---------ccC-----------------------------------------------------Chhheecch Confidence 00000 000 012355666 Q ss_pred HHHHHHHHHhhccccceeeeccc-------CCcceEEECCeEEE-----EEeeccCCccccC Q lcl|Aclame:pro 281 NLREKLRLGIVDKIANNLTWETV-------SGERVMTFDGIPVQ-----RTDALLNTESRVV 330 (330) Q Consensus 281 ~v~~~L~~q~~~~~~~~l~~~~~-------~g~~v~~~~gvpir-----~~dal~~tE~~Vv 330 (330) -+...|.-+...+..+.+..+.- ..+.+++-+.|.+. ..+++++-|...+ T Consensus 222 ~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~ 283 (463) T protein:vir:99 222 GVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPL 283 (463) T ss_pred HHHHHHHHHhcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcC Confidence 66666665555554444433322 12334433334333 6677777664433 No 105 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=97.76 E-value=9.8e-07 Score=53.54 Aligned_cols=267 Identities=14% Similarity=0.112 Sum_probs=126.1 Q ss_pred CCccccccccHHH------HHhh-------cCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceee Q lcl|Aclame:pro 1 MATLSTNNPTMAD------VAKR-------LDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRK 67 (330) Q Consensus 1 M~~~~~~a~TL~E------~Ak~-------~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~ 67 (330) +.......+|-.| +.+. +-|......|+|.|.+.|+|+..+.+.... + .+...+.++-|.++|.. T Consensus 64 ~~~~g~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~-~-~~~i~~~~~~~~a~w~~ 141 (383) T protein:vir:78 64 SASRTDKNITNEEIKFFNDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTG-L-RTKFLKSETSGVAVWGK 141 (383) T ss_pred HhcCChhhhhHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecC-C-ceEEEEEcCCcceEEee Confidence 1111112222222 0111 123346678999999999999999887653 3 46788899999999999 Q ss_pred cCCccC-cccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChh Q lcl|Aclame:pro 68 LYGGVL-PNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLS 145 (330) Q Consensus 68 lN~g~~-~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~ 145 (330) ...+.+ .+.+++.+++-.++-|.+.+.|.+.|.+..+ +..+|-.. ...++++....+.||+||-...|.+|. T Consensus 142 e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~---~l~~~~a~~~~~a~i~G~G~~qP~Gil--- 215 (383) T protein:vir:78 142 IFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVT---QIEEAFAVALESAYIVGDGNDKPIGLN--- 215 (383) T ss_pred cccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHH---HHHHHHHHHHhhheEeccCCCCceeee--- Confidence 988875 5689999999999999999999999988755 55555554 456899999999999999766555541 Q ss_pred hhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeee Q lcl|Aclame:pro 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) Q Consensus 146 ~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~ 225 (330) . +.++++ ..+++.+|.....+....+.-...+. .+.+|.....|.. T Consensus 216 ~--------------~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~-------~l~~~~~~~~~~~ 261 (383) T protein:vir:78 216 R--------------KVGKGS-------------TVVDGVYAEKAATGTLTFANPKTTVN-------ELTDVYKYHSVKE 261 (383) T ss_pred e--------------ccCCcc-------------cccccccccccccchhhhhhhHHHHH-------HHHHHHhccchhc Confidence 1 111111 01223333332222111110000000 0000000000000 Q ss_pred eeEEeccccEE---EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeC--hHHHHHHHHHhhccccceeee Q lcl|Aclame:pro 226 GLTLRDWRYVA---RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMN--RNLREKLRLGIVDKIANNLTW 300 (330) Q Consensus 226 Gl~v~d~r~v~---RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n--~~v~~~L~~q~~~~~~~~l~~ 300 (330) .-..++++. =+.|= .+..+.+.... .....|.++-.++ ..+. .-..... +. .-. T Consensus 262 --~~~~~~~~~~~~~~~n~-----------~~~~~~~~~~~---~~~~~G~~~t~l~~~~~iv---~s~~~p~-~~-iif 320 (383) T protein:vir:78 262 --NGHPLNVAGKVTLLVNP-----------TDAWDVKKQYT---SLNANGVYVTALPFNLNII---ESLFVPE-KK-AIS 320 (383) T ss_pred --ccchhhhcCceEEEEcC-----------cchhhhccchh---ccCCCCceeeecCCCceEE---ecCCCCc-cc-EEE Confidence 000011111 11110 00000000000 0011222221111 0000 0000000 00 001 Q ss_pred cccCCcceEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 301 ETVSGERVMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 301 ~~~~g~~v~~~~gvpir~~dal~~tE~~Vv 330 (330) .++.-=.+....|+-|.++|.....+..++ T Consensus 321 gdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~ 350 (383) T protein:vir:78 321 YVAERYDALIGGPLDIGTYDQTLAIEDLNL 350 (383) T ss_pred eeccceEEEecccceEEecchhhhhcCceE Confidence 111111123345677777777666665555 No 106 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=97.75 E-value=6.5e-06 Score=49.02 Aligned_cols=215 Identities=13% Similarity=0.076 Sum_probs=119.2 Q ss_pred CCccc--cccccHHHHH-----------hhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceee Q lcl|Aclame:pro 1 MATLS--TNNPTMADVA-----------KRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRK 67 (330) Q Consensus 1 M~~~~--~~a~TL~E~A-----------k~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~ 67 (330) |.... .......+.. -.+-|......||+.+.+.++|.+...++...+. ....+..+.+++.|.. T Consensus 64 ~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~~--~~p~~~~~~~~a~~v~ 141 (352) T protein:vir:78 64 ILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL--EIPRVSYTLDDDDFIT 141 (352) T ss_pred hhhhHHHHHHhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCCc--eEEEEecCCCcccccc Confidence 00000 0000000000 0022334556799999999999998888765322 2223344568899999 Q ss_pred cCCccCcccceEEEEEEEEEEecchhhhhHHHHHh-CCCHHHHHHHHHHHHHHHHHHH-HHHhhccCCCCcChhhccChh Q lcl|Aclame:pro 68 LYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADL-NGNTAAFRLSEDRAQIEGMNQE-VAQTLFYGNDGIAPAEFTGLS 145 (330) Q Consensus 68 lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~-~g~~~~~ra~e~~~~ika~~~~-~~~~~iyGD~~~~p~~F~GL~ 145 (330) =++..++++.++.+++-..+-+++.+.|.+.|.+. ..+..++-..+. .++++.. .+..|..|+. T Consensus 142 E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~l---a~~~~~~e~~~~~~~g~g----------- 207 (352) T protein:vir:78 142 DVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENAL---QSGLAAKERKDALAVSPK----------- 207 (352) T ss_pred cccccccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHH---HHHHHHHHHHhhhhcCCC----------- Confidence 99999999999999999999999999999998776 446666655443 3444433 2222333321 Q ss_pred hhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeee Q lcl|Aclame:pro 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) Q Consensus 146 ~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~ 225 (330) +|.... +++-+ ++ T Consensus 208 -------------------~~~~~g-------------~l~~~----~~------------------------------- 220 (352) T protein:vir:78 208 -------------------SGLEHM-------------SFYNG----SV------------------------------- 220 (352) T ss_pred -------------------Cccccc-------------ceecc----cc------------------------------- Confidence 110000 00000 00 Q ss_pred eeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCC Q lcl|Aclame:pro 226 GLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSG 305 (330) Q Consensus 226 Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g 305 (330) .........++|++ +++.++.....+.+|+||+.....|.....+.....+. | T Consensus 221 ------------------~~~t~~~~~d~i~~----~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~-----~ 273 (352) T protein:vir:78 221 ------------------KEVEGANMYDAIIN----ALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----T 273 (352) T ss_pred ------------------ccccccchHHHHHH----HHhccChhhhcCCEEEEehHHHHHHHHHHhccCCcccc-----c Confidence 00011112234444 44555555455678999999888877654443333321 2 Q ss_pred cceEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 306 ERVMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 306 ~~v~~~~gvpir~~dal~~tE~~Vv 330 (330) ....+.|.||..+|... .++ T Consensus 274 -~~~~llG~PV~~~~~~~----~~~ 293 (352) T protein:vir:78 274 -PAEKVFGKPVVFTDAAV----KPI 293 (352) T ss_pred -CCccccccceEEecCCC----cee Confidence 22346699999998543 233 No 107 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=97.68 E-value=7e-06 Score=48.83 Aligned_cols=211 Identities=14% Similarity=0.088 Sum_probs=117.0 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEE-EeccCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSV-RTGLPTPTWRKLYGGVLPNKSST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~-~~~lP~~~fR~lN~g~~~s~~t~ 79 (330) ...+..+..+ + .-.+-|......||+.+.+.++|.+.+.++...+ ..+.+ -.+.++++|..=++..++++.++ T Consensus 130 ~~a~~~~t~~--~-GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f 203 (402) T protein:vir:93 130 LHALPTGNDS--G-GDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDVETAKELKAKG 203 (402) T ss_pred HhhhccCCCc--C-CccccchhHHHHHHHhHHhhhhhhhhceeeecCC---ceeeeeeccCCcccccccccccccccccc Confidence 0000000000 0 0001123344569999999999999888865532 22333 33567899999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHHh-CCCHHHHHHHHHHHHHHHHHHHHHH-hhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADL-NGNTAAFRLSEDRAQIEGMNQEVAQ-TLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~-~g~~~~~ra~e~~~~ika~~~~~~~-~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) .+++-..+-+.+.+.|.+.+.+- ..+..+|-..+ ..++++..-.. .|..|+ T Consensus 204 ~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~---la~~~~~~e~~~~~~~g~------------------------ 256 (402) T protein:vir:93 204 DTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENA---LQSGLAAKERKDALAVSP------------------------ 256 (402) T ss_pred ceeeecceeeeeechhhHHHHhhhHHHHHHHHHHH---HHHHHHHHHHHhHhhcCC------------------------ Confidence 99999999999999999987775 34665554433 33455543222 222232 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~R 237 (330) |+|.. +|+.- .+++ T Consensus 257 ------g~g~p--------------~g~~~---~~~~------------------------------------------- 270 (402) T protein:vir:93 257 ------KSGLE--------------HMSFY---NGSV------------------------------------------- 270 (402) T ss_pred ------Ccccc--------------ceeee---cccc------------------------------------------- Confidence 11100 11100 0000 Q ss_pred eecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEE Q lcl|Aclame:pro 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) Q Consensus 238 I~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir 317 (330) ..+ ......++|+ .+++.++.....+.+|+||+.....|.....+... .+.. | ....+.|.||. T Consensus 271 -~~~-----~~~~~~d~l~----~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~-~~~~----~-~~~~llG~PV~ 334 (402) T protein:vir:93 271 -KEV-----EGADMYDAII----NALADLHEDYRDNATIYMRYADYVKIISVLSNGTT-NFFD----T-PAEKVFGKPVV 334 (402) T ss_pred -ccc-----cccchHHHHH----HHHhccChhhhcCCEEEEechHHHHHHHHHhcCCC-cccc----c-CCccccccceE Confidence 000 0011123444 44556655555677899998887666655444433 2221 1 22356799999 Q ss_pred EEeeccCCccccC Q lcl|Aclame:pro 318 RTDALLNTESRVV 330 (330) Q Consensus 318 ~~dal~~tE~~Vv 330 (330) .+|... .++ T Consensus 335 ~t~~~~----~i~ 343 (402) T protein:vir:93 335 FTDAAV----KPI 343 (402) T ss_pred EecCCC----cee Confidence 998643 233 No 108 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=97.64 E-value=9.1e-06 Score=48.22 Aligned_cols=209 Identities=15% Similarity=0.108 Sum_probs=120.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhccee----eccCCccceeEEEeccCCcceeecCCccCccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAI----EGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNK 76 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~----e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~ 76 (330) ||.- .-++.+ .+-|.-..+.|+|.+.+.+.+ ..+... ++..|......+....+++.|..=++..++++ T Consensus 1 MA~~---~T~~~~---~~iPev~s~~v~~~~~~~~~~-~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~ 73 (272) T protein:vir:98 1 MAVG---TTKMAQ---MLDPEVLADMIDAEVGKAIRF-APLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQ 73 (272) T ss_pred CCCc---cccchh---eechHHHHHHHHHHHHHHhhh-hccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccc Confidence 6642 224444 344555556677777665543 333322 23334345566667789999999899999999 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHHh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCC Q lcl|Aclame:pro 77 SSTAQVTDNCGMLEAYAEVDKALADL-NGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) Q Consensus 77 ~t~~~~~~~l~ilgg~~eVDk~la~~-~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~ 155 (330) .++.+++..++-++..+.|+.....+ .++..+.. .++..+++++++...+|.- +.. T Consensus 74 ~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~---~~~~~~~~a~~~d~~i~~~------------------~~~-- 130 (272) T protein:vir:98 74 LGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQA---AKQIVEAIDHKVDADVLDA------------------LSK-- 130 (272) T ss_pred cccceEEEEeeeeeeeeeecHHHHhhccccHHHHH---HHHHHHHHHHHHHHHHHHH------------------hcc-- Confidence 99999999999999999999877655 45544333 3445677777777665521 000 Q ss_pred cceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccE Q lcl|Aclame:pro 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) Q Consensus 156 ~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v 235 (330) ..+.+ .+ T Consensus 131 a~~~~-----------------------------------------------~~-------------------------- 137 (272) T protein:vir:98 131 STQTV-----------------------------------------------EA-------------------------- 137 (272) T ss_pred ccccc-----------------------------------------------cc-------------------------- Confidence 00000 00 Q ss_pred EEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcc---ccceeeecccCCcceEEEC Q lcl|Aclame:pro 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK---IANNLTWETVSGERVMTFD 312 (330) Q Consensus 236 ~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~---~~~~l~~~~~~g~~v~~~~ 312 (330) ....+++.++ ...+........+|+||..+...|++..... .+.... .....-.+-.+. T Consensus 138 -------------~~t~d~i~da----~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~-~~~~~g~ig~i~ 199 (272) T protein:vir:98 138 -------------TATVDGVSKA----LDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGA-NRVVSGVYGEVL 199 (272) T ss_pred -------------ccCHHHHHHH----HHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccc-cccccccchhhc Confidence 0001223332 2223222333458999999999998653221 111111 111112345789 Q ss_pred CeEEEEEeeccCCccccC Q lcl|Aclame:pro 313 GIPVQRTDALLNTESRVV 330 (330) Q Consensus 313 gvpir~~dal~~tE~~Vv 330 (330) |+||..++++...-.-++ T Consensus 200 G~~Vi~s~~~p~~t~~~~ 217 (272) T protein:vir:98 200 GVQIVRSRKCPKGTAYMV 217 (272) T ss_pred CeeEEEcCCCCcceEEEE Confidence 999999999975443333 No 109 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=97.64 E-value=9.1e-06 Score=48.22 Aligned_cols=209 Identities=15% Similarity=0.108 Sum_probs=120.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhccee----eccCCccceeEEEeccCCcceeecCCccCccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAI----EGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNK 76 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~----e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~ 76 (330) ||.- .-++.+ .+-|.-..+.|+|.+.+.+.+ ..+... ++..|......+....+++.|..=++..++++ T Consensus 1 MA~~---~T~~~~---~~iPev~s~~v~~~~~~~~~~-~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~ 73 (272) T protein:vir:30 1 MAVG---TTKMAQ---MLDPEVLADMIDAEVGKAIRF-APLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQ 73 (272) T ss_pred CCCc---cccchh---eechHHHHHHHHHHHHHHhhh-hccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccc Confidence 6642 224444 344555556677777665543 333322 23334345566667789999999899999999 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHHh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCC Q lcl|Aclame:pro 77 SSTAQVTDNCGMLEAYAEVDKALADL-NGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) Q Consensus 77 ~t~~~~~~~l~ilgg~~eVDk~la~~-~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~ 155 (330) .++.+++..++-++..+.|+.....+ .++..+.. .++..+++++++...+|.- +.. T Consensus 74 ~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~---~~~~~~~~a~~~d~~i~~~------------------~~~-- 130 (272) T protein:vir:30 74 LGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQA---AKQIVEAIDHKVDADVLDA------------------LSK-- 130 (272) T ss_pred cccceEEEEeeeeeeeeeecHHHHhhccccHHHHH---HHHHHHHHHHHHHHHHHHH------------------hcc-- Confidence 99999999999999999999877655 45544333 3445677777777665521 000 Q ss_pred cceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccE Q lcl|Aclame:pro 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) Q Consensus 156 ~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v 235 (330) ..+.+ .+ T Consensus 131 a~~~~-----------------------------------------------~~-------------------------- 137 (272) T protein:vir:30 131 STQTV-----------------------------------------------EA-------------------------- 137 (272) T ss_pred ccccc-----------------------------------------------cc-------------------------- Confidence 00000 00 Q ss_pred EEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcc---ccceeeecccCCcceEEEC Q lcl|Aclame:pro 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK---IANNLTWETVSGERVMTFD 312 (330) Q Consensus 236 ~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~---~~~~l~~~~~~g~~v~~~~ 312 (330) ....+++.++ ...+........+|+||..+...|++..... .+.... .....-.+-.+. T Consensus 138 -------------~~t~d~i~da----~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~-~~~~~g~ig~i~ 199 (272) T protein:vir:30 138 -------------TATVDGVSKA----LDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGA-NRVVSGVYGEVL 199 (272) T ss_pred -------------ccCHHHHHHH----HHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccc-cccccccchhhc Confidence 0001223332 2223222333458999999999998653221 111111 111112345789 Q ss_pred CeEEEEEeeccCCccccC Q lcl|Aclame:pro 313 GIPVQRTDALLNTESRVV 330 (330) Q Consensus 313 gvpir~~dal~~tE~~Vv 330 (330) |+||..++++...-.-++ T Consensus 200 G~~Vi~s~~~p~~t~~~~ 217 (272) T protein:vir:30 200 GVQIVRSRKCPKGTAYMV 217 (272) T ss_pred CeeEEEcCCCCcceEEEE Confidence 999999999975443333 No 110 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=97.58 E-value=2.5e-06 Score=51.30 Aligned_cols=233 Identities=16% Similarity=0.137 Sum_probs=120.2 Q ss_pred CCccccccccHHH----------HHhhcCc------ccch------H----HHHHHHhccch---hHhhcceeeccCCcc Q lcl|Aclame:pro 1 MATLSTNNPTMAD----------VAKRLDP------NGKV------D----IIVEMLNQTNP---VLQDMTAIEGNLPTG 51 (330) Q Consensus 1 M~~~~~~a~TL~E----------~Ak~~~~------d~~~------~----~VIE~l~~~s~---iL~~lpf~e~n~g~~ 51 (330) |+- .+.+++.+ +.|.+.. ++.. . .-|-.|+..++ ++.+++-..+ ..|- T Consensus 1 ~~~--~~~~~~~~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a-~sTv 77 (462) T protein:vir:96 1 MHK--DTNLTAEQNKYADKFQEEVMKSYQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPA-QSTV 77 (462) T ss_pred Ccc--ccccchhhhhhhchhhHHHHHHHhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchh-hhhh Confidence 442 12233322 1111111 1100 0 00111111111 1223332222 1234 Q ss_pred ceeEEEeccCC---cceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 52 HRTSVRTGLPT---PTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQT 128 (330) Q Consensus 52 ~~~~~~~~lP~---~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~ 128 (330) |+|.+...--+ ..|-.=-.-.+-+.+++.+++..++.|+..-.|+-+.--.++ ..+-.++|.+..|..+.++++.. T Consensus 78 ~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~-~~d~~~~~~~dai~~~a~tiE~a 156 (462) T protein:vir:96 78 QKYDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNN-IQDPMQILTEDAIAVVAKTIEWA 156 (462) T ss_pred hhheeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccc-hhhHHHHHHHHHHHHHHHHHHHH Confidence 66655554443 444222223456678999999999999998888876544444 55556889999999999999999 Q ss_pred hccCCCCcCh------hhccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceecccccc Q lcl|Aclame:pro 129 LFYGNDGIAP------AEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQV 202 (330) Q Consensus 129 ~iyGD~~~~p------~~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~ 202 (330) .||||+...| -+||||.+.. +++|||||.|. ..|+-++-|.. T Consensus 157 ~Fygds~l~~~~~~~gleFDGl~~lI------~~~NViDarG~---~Ls~~~ln~aa----------------------- 204 (462) T protein:vir:96 157 SFYGDASLTADPTGQGLEFDGLAKLI------DKDNVIDAKGE---SLTETLLNRSA----------------------- 204 (462) T ss_pred HhhhhcccCCCccccccchhhhhhhc------CCCceeecCCC---CccHHHHhhhh----------------------- Confidence 9999999999 7899998865 68899999972 11211111111 Q ss_pred ccccccccCCceeEEEEEEeeeeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHH Q lcl|Aclame:pro 203 TIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNL 282 (330) Q Consensus 203 ~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v 282 (330) +||- .--.--+-+||+--+ T Consensus 205 ---------------------------------~~i~----------------------------~~fGt~TD~~~p~~v 223 (462) T protein:vir:96 205 ---------------------------------VLIG----------------------------KSFGTATDAYMPIGV 223 (462) T ss_pred ---------------------------------hhcc----------------------------cccCChhheecchHH Confidence 1110 000001235555555 Q ss_pred HHHHHHHhhccccceeeeccc-------CCcceEEECCeEEE-----EEeeccCCccccC Q lcl|Aclame:pro 283 REKLRLGIVDKIANNLTWETV-------SGERVMTFDGIPVQ-----RTDALLNTESRVV 330 (330) Q Consensus 283 ~~~L~~q~~~~~~~~l~~~~~-------~g~~v~~~~gvpir-----~~dal~~tE~~Vv 330 (330) ...|.-+...+..+.+..+.- ..+.+++-+.|.+. .-+++++-|.... T Consensus 224 ~a~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~ 283 (462) T protein:vir:96 224 HADFVNSVLGRQMQLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPL 283 (462) T ss_pred HHHHHHhhcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccC Confidence 555555555444433333322 12233333333333 4566665555433 No 111 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=97.55 E-value=1.7e-05 Score=46.72 Aligned_cols=210 Identities=14% Similarity=0.093 Sum_probs=117.7 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEE-EeccCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSV-RTGLPTPTWRKLYGGVLPNKSST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~-~~~lP~~~fR~lN~g~~~s~~t~ 79 (330) ...+..+..+ + .--+-|......||+.+.+.++|++.+.+....+ ..+.+ -.+.++++|..=++..++++.++ T Consensus 115 ~~a~~~~~~~--~-gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f 188 (387) T protein:vir:26 115 LHALPTGNDS--G-GDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDVETAKELKAKG 188 (387) T ss_pred HhhhccCCCC--C-CceeechhHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCcccccccccccccccccc Confidence 1111000000 0 0001223345569999999999999888865532 22333 33567899999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHH-hhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQ-TLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~-~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) .+++-..+-+++.+.|.+.|.+.. .+..++-..+ ..++++..-.. .|..|+ T Consensus 189 ~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~---la~~~~~~e~~~~~~~g~------------------------ 241 (387) T protein:vir:26 189 DTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENA---LQSGLAAKERKDALAVSP------------------------ 241 (387) T ss_pred ceeeechheeeeechhhHHHHhhhHHHHHHHHHHH---HHHHHHHHHHHhHhhcCC------------------------ Confidence 999999999999999999988764 4666554433 33455443222 222222 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEE-ccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSI-YPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~i-ypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) |+|.. +++ +.. ++ T Consensus 242 ------g~g~~--------------~g~~~~~----~~------------------------------------------ 255 (387) T protein:vir:26 242 ------KSGLE--------------HMSFYNG----SV------------------------------------------ 255 (387) T ss_pred ------Ccccc--------------ceeeecc----cc------------------------------------------ Confidence 11100 111 000 00 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEE Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPV 316 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpi 316 (330) ..+ . ..+.++.++.+++.++.....+.+||||+.....|.....+.. ..+.. | ....+.|.|| T Consensus 256 --~~~-----~----~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~-~~~~~----~-~~~~llG~PV 318 (387) T protein:vir:26 256 --KEV-----E----GADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGT-TNFFD----T-PAEKVFGKPV 318 (387) T ss_pred --ccc-----c----ccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCC-Ccccc----c-CCccccccce Confidence 000 0 0112233344456666555567789999887766665544433 33321 1 2235679999 Q ss_pred EEEeeccCCccccC Q lcl|Aclame:pro 317 QRTDALLNTESRVV 330 (330) Q Consensus 317 r~~dal~~tE~~Vv 330 (330) ..+|... .+| T Consensus 319 ~~~~~~~----~~~ 328 (387) T protein:vir:26 319 VFTDAAV----KPI 328 (387) T ss_pred EEecCCC----cee Confidence 9998643 233 No 112 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=97.55 E-value=1.7e-05 Score=46.72 Aligned_cols=210 Identities=14% Similarity=0.093 Sum_probs=117.7 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEE-EeccCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSV-RTGLPTPTWRKLYGGVLPNKSST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~-~~~lP~~~fR~lN~g~~~s~~t~ 79 (330) ...+..+..+ + .--+-|......||+.+.+.++|++.+.+....+ ..+.+ -.+.++++|..=++..++++.++ T Consensus 115 ~~a~~~~~~~--~-gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f 188 (387) T protein:vir:94 115 LHALPTGNDS--G-GDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDVETAKELKAKG 188 (387) T ss_pred HhhhccCCCC--C-CceeechhHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCcccccccccccccccccc Confidence 1111000000 0 0001223345569999999999999888865532 22333 33567899999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHH-hhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQ-TLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~-~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) .+++-..+-+++.+.|.+.|.+.. .+..++-..+ ..++++..-.. .|..|+ T Consensus 189 ~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~---la~~~~~~e~~~~~~~g~------------------------ 241 (387) T protein:vir:94 189 DTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENA---LQSGLAAKERKDALAVSP------------------------ 241 (387) T ss_pred ceeeechheeeeechhhHHHHhhhHHHHHHHHHHH---HHHHHHHHHHHhHhhcCC------------------------ Confidence 999999999999999999988764 4666554433 33455443222 222222 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEE-ccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSI-YPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~i-ypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) |+|.. +++ +.. ++ T Consensus 242 ------g~g~~--------------~g~~~~~----~~------------------------------------------ 255 (387) T protein:vir:94 242 ------KSGLE--------------HMSFYNG----SV------------------------------------------ 255 (387) T ss_pred ------Ccccc--------------ceeeecc----cc------------------------------------------ Confidence 11100 111 000 00 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEE Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPV 316 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpi 316 (330) ..+ . ..+.++.++.+++.++.....+.+||||+.....|.....+.. ..+.. | ....+.|.|| T Consensus 256 --~~~-----~----~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~-~~~~~----~-~~~~llG~PV 318 (387) T protein:vir:94 256 --KEV-----E----GADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGT-TNFFD----T-PAEKVFGKPV 318 (387) T ss_pred --ccc-----c----ccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCC-Ccccc----c-CCccccccce Confidence 000 0 0112233344456666555567789999887766665544433 33321 1 2235679999 Q ss_pred EEEeeccCCccccC Q lcl|Aclame:pro 317 QRTDALLNTESRVV 330 (330) Q Consensus 317 r~~dal~~tE~~Vv 330 (330) ..+|... .+| T Consensus 319 ~~~~~~~----~~~ 328 (387) T protein:vir:94 319 VFTDAAV----KPI 328 (387) T ss_pred EEecCCC----cee Confidence 9998643 233 No 113 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=97.55 E-value=1.7e-05 Score=46.72 Aligned_cols=210 Identities=14% Similarity=0.093 Sum_probs=117.7 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEE-EeccCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSV-RTGLPTPTWRKLYGGVLPNKSST 79 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~-~~~lP~~~fR~lN~g~~~s~~t~ 79 (330) ...+..+..+ + .--+-|......||+.+.+.++|++.+.+....+ ..+.+ -.+.++++|..=++..++++.++ T Consensus 115 ~~a~~~~~~~--~-gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f 188 (387) T protein:vir:96 115 LHALPTGNDS--G-GDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDVETAKELKAKG 188 (387) T ss_pred HhhhccCCCC--C-CceeechhHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCcccccccccccccccccc Confidence 1111000000 0 0001223345569999999999999888865532 22333 33567899999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHHhC-CCHHHHHHHHHHHHHHHHHHHHHH-hhccCCCCcChhhccChhhhhcccccCCcc Q lcl|Aclame:pro 80 AQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQ-TLFYGNDGIAPAEFTGLSPRYNSLSAENKD 157 (330) Q Consensus 80 ~~~~~~l~ilgg~~eVDk~la~~~-g~~~~~ra~e~~~~ika~~~~~~~-~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~ 157 (330) .+++-..+-+++.+.|.+.|.+.. .+..++-..+ ..++++..-.. .|..|+ T Consensus 189 ~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~---la~~~~~~e~~~~~~~g~------------------------ 241 (387) T protein:vir:96 189 DTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENA---LQSGLAAKERKDALAVSP------------------------ 241 (387) T ss_pred ceeeechheeeeechhhHHHHhhhHHHHHHHHHHH---HHHHHHHHHHHhHhhcCC------------------------ Confidence 999999999999999999988764 4666554433 33455443222 222222 Q ss_pred eeeecCCCCCCceEEEEEEeCCCcEEEE-ccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 158 NVIDAGGTGSDNASAWLVVWGPNTCHSI-YPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 158 ~vidAGgtg~~~tSi~~V~~g~~~~~~i-ypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) |+|.. +++ +.. ++ T Consensus 242 ------g~g~~--------------~g~~~~~----~~------------------------------------------ 255 (387) T protein:vir:96 242 ------KSGLE--------------HMSFYNG----SV------------------------------------------ 255 (387) T ss_pred ------Ccccc--------------ceeeecc----cc------------------------------------------ Confidence 11100 111 000 00 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEECCeEE Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPV 316 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpi 316 (330) ..+ . ..+.++.++.+++.++.....+.+||||+.....|.....+.. ..+.. | ....+.|.|| T Consensus 256 --~~~-----~----~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~-~~~~~----~-~~~~llG~PV 318 (387) T protein:vir:96 256 --KEV-----E----GADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGT-TNFFD----T-PAEKVFGKPV 318 (387) T ss_pred --ccc-----c----ccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCC-Ccccc----c-CCccccccce Confidence 000 0 0112233344456666555567789999887766665544433 33321 1 2235679999 Q ss_pred EEEeeccCCccccC Q lcl|Aclame:pro 317 QRTDALLNTESRVV 330 (330) Q Consensus 317 r~~dal~~tE~~Vv 330 (330) ..+|... .+| T Consensus 319 ~~~~~~~----~~~ 328 (387) T protein:vir:96 319 VFTDAAV----KPI 328 (387) T ss_pred EEecCCC----cee Confidence 9998643 233 No 114 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=97.54 E-value=2e-06 Score=51.80 Aligned_cols=256 Identities=13% Similarity=0.123 Sum_probs=131.2 Q ss_pred CCccccccccHHHHH--------------hhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCccee Q lcl|Aclame:pro 1 MATLSTNNPTMADVA--------------KRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWR 66 (330) Q Consensus 1 M~~~~~~a~TL~E~A--------------k~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR 66 (330) ++......+|-.|.. --+-|......|+|.+.+.++|+..+.+.... + ..++.+.++-|++.|. T Consensus 59 ~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~-~~~~~~~~~~~~a~w~ 136 (377) T protein:vir:98 59 DLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-L-RLKALTAETSGTAVWG 136 (377) T ss_pred HhccCCcccCHHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecC-c-ceEEEEecCCcceeEe Confidence 111111111111100 00122335567999999999999999876653 3 3678888999999999 Q ss_pred ecCCccC-cccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccCh Q lcl|Aclame:pro 67 KLYGGVL-PNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGL 144 (330) Q Consensus 67 ~lN~g~~-~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL 144 (330) ...+..+ ++++++.+++-..+-|.+...|.+.|.+..+ |..+|-..+ ..++++......||+||-...|.++. T Consensus 137 ~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~---la~~~a~~~~~a~i~G~G~~qP~Gil-- 211 (377) T protein:vir:98 137 DIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQ---LKEAIAVALELAIVKGDGLLQPVGLL-- 211 (377) T ss_pred ecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHH---HHHHHHHHHhhceEeccCCCcceeee-- Confidence 9988876 4678999999999999999999999988755 666665544 55899999999999999766666542 Q ss_pred hhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeee Q lcl|Aclame:pro 145 SPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWD 224 (330) Q Consensus 145 ~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~ 224 (330) . .++ ..+..+. ++. ...+.+|- .+.-+..+.+... T Consensus 212 -~---~~~----~~~~~~~-~~~-------------~~~~~~~~------------------~~~~~~l~~~~~~----- 246 (377) T protein:vir:98 212 -K---DLS----QPTVDQS-TGR-------------DITTYKTD------------------KEAIADLSDLTPD----- 246 (377) T ss_pred -e---ccc----ccccccc-ccc-------------ccccccch------------------hhhHhhhhhhchh----- Confidence 1 110 0000000 000 00011110 0000000000000 Q ss_pred eeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccc-c-------- Q lcl|Aclame:pro 225 IGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKI-A-------- 295 (330) Q Consensus 225 ~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~-~-------- 295 (330) .| .....-++..+....-+-++-..|+++|.||.+-.-.+.-+..... + T Consensus 247 --------~~--------------~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg 304 (377) T protein:vir:98 247 --------NA--------------PKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLP 304 (377) T ss_pred --------HH--------------HHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccC Confidence 00 0001111111111112224567889999999864332221110000 0 Q ss_pred ---ceeeecccCCcce----------EEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 296 ---NNLTWETVSGERV----------MTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 296 ---~~l~~~~~~g~~v----------~~~~gvpir~~dal~~tE~~Vv 330 (330) ..+..+..+...+ ....|+-|.++|.....|..++ T Consensus 305 ~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~ 352 (377) T protein:vir:98 305 HGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQL 352 (377) T ss_pred CCceEEecCCCCcccEEEEEecceeEEeecceEEEeechhhhhcCceE Confidence 0001111111111 1223555555555555454444 No 115 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=97.47 E-value=1.3e-05 Score=47.44 Aligned_cols=250 Identities=15% Similarity=0.160 Sum_probs=125.9 Q ss_pred CCccccccccHHHHH------hh-------cCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceee Q lcl|Aclame:pro 1 MATLSTNNPTMADVA------KR-------LDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRK 67 (330) Q Consensus 1 M~~~~~~a~TL~E~A------k~-------~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~ 67 (330) +.......+|-.|.. +. +-|......|+|.|.+.|||+..+.++... ..+...+.++-|.+.|.. T Consensus 57 ~~~~~~~~l~~~e~~~~~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~--~~~~i~~~~~~~~a~W~~ 134 (381) T protein:vir:10 57 SLPKSAQTLSANQRNFFMDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWGK 134 (381) T ss_pred HhcccccccCHHHHHHHHHHhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecC--cceEEEeecCCcceEEee Confidence 111112222332211 11 123345678999999999999999887653 346778888899999999 Q ss_pred cCCccC-cccceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChh Q lcl|Aclame:pro 68 LYGGVL-PNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLS 145 (330) Q Consensus 68 lN~g~~-~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~ 145 (330) ...+.+ ++.+++.+++-.++-|.+.+.|.+.|.+..+ +.++|-..+ ..++++......|++||-...|.+| - T Consensus 135 e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~---la~~~a~~~~~afi~GdG~~qP~Gi---l 208 (381) T protein:vir:10 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQ---IEEAFAVALETAFLKGTGKDQPIGL---N 208 (381) T ss_pred cccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHH---HHHHHHHHhhceeEecccCCCceee---e Confidence 988876 5578999999999999999999999988755 666665544 5589999999999999976666544 1 Q ss_pred hhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeee Q lcl|Aclame:pro 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) Q Consensus 146 ~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~ 225 (330) . ...+....-.|.+. ..++.-.+.+- |. ...+ + T Consensus 209 ~------~~~~~~~~~~g~~~-~~~~~~~~t~~-------------------~~-~~~~---~----------------- 241 (381) T protein:vir:10 209 R------QVQKGVSVTDGAYP-EKEEQGTLTFA-------------------NP-RATV---N----------------- 241 (381) T ss_pred e------cCCccccccccccc-ccccccccccc-------------------ch-hhHH---H----------------- Confidence 1 11111111111110 10111000000 00 0000 0 Q ss_pred eeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhh--ccccceee---- Q lcl|Aclame:pro 226 GLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIV--DKIANNLT---- 299 (330) Q Consensus 226 Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~--~~~~~~l~---- 299 (330) ...+++..+..-...-+....++..|.||+.-.-.|+.... +....+++ T Consensus 242 -------------------------~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~lp~ 296 (381) T protein:vir:10 242 -------------------------ELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPF 296 (381) T ss_pred -------------------------HHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceeecCCC Confidence 00000000000000000011122333344333322221110 00000000 Q ss_pred ------eccc------CCc----ceEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 300 ------WETV------SGE----RVMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 300 ------~~~~------~g~----~v~~~~gvpir~~dal~~tE~~Vv 330 (330) .+.. .|. .+....|+-|+++|....+|..++ T Consensus 297 g~~vv~~~~~p~~~i~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~ 343 (381) T protein:vir:10 297 NLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDL 343 (381) T ss_pred CceeEEcCCCCcCcEEEEEcccEEEEEecccEEEeechhhhhcCceE Confidence 0000 111 123345677777777777776666 No 116 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=97.14 E-value=2.8e-06 Score=50.99 Aligned_cols=296 Identities=19% Similarity=0.191 Sum_probs=124.7 Q ss_pred CCccccccccH----HHHHhhcC-----------------cccchHHHHHHHhccch---hHhhcceeeccCCccceeEE Q lcl|Aclame:pro 1 MATLSTNNPTM----ADVAKRLD-----------------PNGKVDIIVEMLNQTNP---VLQDMTAIEGNLPTGHRTSV 56 (330) Q Consensus 1 M~~~~~~a~TL----~E~Ak~~~-----------------~d~~~~~VIE~l~~~s~---iL~~lpf~e~n~g~~~~~~~ 56 (330) |..--+.+.+| .|+.|.+. ...+.. -|-.|+..+. ++.+++-..+ ..|-|+|.+ T Consensus 1 ~~~~~n~~~~~~~~~e~~~Ks~ttgy~~~p~~q~~~~AlRrEsL~~-~i~~Lt~~~~~f~f~~di~k~~a-~STV~~y~~ 78 (464) T protein:vir:80 1 MTEKKNTERQLTSVQEEVIKGFTTGYGITPESQTDAAALRREFLDD-QITMLTWADGDLSFYRDITKRPA-TSTVAKYDV 78 (464) T ss_pred CCcchhhHhhcCcccHHHHHHHHhCCccCcccccCcchhhhhhhhh-hhheeeecccchhhhhhcCCchh-hhhhhhhhe Confidence 32221111111 11111111 001111 1111222111 2333433222 123455555 Q ss_pred EeccCC---cceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|Aclame:pro 57 RTGLPT---PTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGN 133 (330) Q Consensus 57 ~~~lP~---~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD 133 (330) ...--+ ..|-.=-.-.+-+.+++.+++..++.|.+-=.|+.+.--.+. ..+-..+|.+..|..+.++++...|||| T Consensus 79 ~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~-~~d~~~~~~~dai~~va~tiE~a~FyGd 157 (464) T protein:vir:80 79 YLAHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATGLVNN-IEDPMRILTDDAISVVAKTIEWASFYGD 157 (464) T ss_pred eeccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehhhhcc-hhhHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 444433 444222223455678999999999999998888766533333 3455568888899999999999999999 Q ss_pred CCcChh-------hccChhhhhcccccCCcceeeecCCCCCCceEEE----EEEeCCCcEEEEc-cccccccc--eeccc Q lcl|Aclame:pro 134 DGIAPA-------EFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAW----LVVWGPNTCHSIY-PKGSKAGL--SVEDK 199 (330) Q Consensus 134 ~~~~p~-------~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~----~V~~g~~~~~~iy-pkg~kagl--~~~D~ 199 (330) +...|. +||||.+.. ++.|||||.|..-.-.-|+ .|.-+-++..-+| |-|-|+-+ ++.+. T Consensus 158 s~l~~~~~~~~gleFDGl~~lI------~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~ 231 (464) T protein:vir:80 158 SDLSENPDAGSGLEFDGLAKLI------DKHNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDR 231 (464) T ss_pred cccCCCCCCccccchhhhHhhc------CCCceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCc Confidence 999875 899999866 6889999998662211121 2333444444455 66666554 33333 Q ss_pred cccccccccccCCceeEE-----EEE---Ee----------------------------------------eeeeeEEec Q lcl|Aclame:pro 200 GQVTIENADGNGGRMEGY-----RTH---YK----------------------------------------WDIGLTLRD 231 (330) Q Consensus 200 g~~~~~~~d~~gg~~~~y-----~~~---~~----------------------------------------w~~Gl~v~d 231 (330) ..... .|-+++.-.|| .+. ++ |+-..-=.+ T Consensus 232 q~~~~--~~n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~~ld~~~~~~~~apaapsvt~tv~~~~~g~f~~~~~~~~ 309 (464) T protein:vir:80 232 QVQVI--SDNGQNATMGFNVKGFNSARGFIRLHGSTVMELEQILDENRMQLPNAPQKATVKATLEAGTKGKFRDEDLTID 309 (464) T ss_pred eeEEE--cCCCCcceeeeecccccccccceeccCccccCcccccccccccCCCCcCCceeEEEecCCcccCCccccccce Confidence 32222 12111112222 111 00 000000011 Q ss_pred cccEEEeecccccccccc------hhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCC Q lcl|Aclame:pro 232 WRYVARVCNIDVSDLATS------ANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSG 305 (330) Q Consensus 232 ~r~v~RI~NId~~~l~~~------~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g 305 (330) ..|.++.||=|-...+.+ ++.++.+++-+ -++.+-..+| .-+.++.+..++ T Consensus 310 ~~Ykv~~vn~~GeS~ps~~~~~ti~~~~~~V~l~i----t~~~~~~~~p-------------------~yv~IYR~~~~~ 366 (464) T protein:vir:80 310 TEYKVVVVSDDAESAPSDVASVVIDDKKKQVKLEI----TINNMYQARP-------------------QYVAIYRKGLET 366 (464) T ss_pred eEEEEEEECCCCccccceeeeeeecCcccEEEEEE----EeCCcccccc-------------------ceEEEEeecCCC Confidence 123333332221111100 00000000000 0000000000 001111111111 Q ss_pred cceEEECCeEEE-----------EEeeccCCccccC Q lcl|Aclame:pro 306 ERVMTFDGIPVQ-----------RTDALLNTESRVV 330 (330) Q Consensus 306 ~~v~~~~gvpir-----------~~dal~~tE~~Vv 330 (330) ...-.+..||+. .++-|..|-..+| T Consensus 367 g~f~~i~rv~~~~~~~gt~t~vD~n~~IPgt~~vfV 402 (464) T protein:vir:80 367 GLFYQIARVPASKAVEGVITFIDVNDEIPETADVFV 402 (464) T ss_pred CceeEEEEEeeccccCCceEEEecccccCCceeEee Confidence 111111222222 2334555555555 No 117 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=97.07 E-value=6.6e-06 Score=48.97 Aligned_cols=231 Identities=16% Similarity=0.081 Sum_probs=111.0 Q ss_pred CCc-cccccccHHHHHhhcCcccchHHHHHHHhccc---hhHhhcceeeccCCccceeEEEeccCCc---ce-eecCCcc Q lcl|Aclame:pro 1 MAT-LSTNNPTMADVAKRLDPNGKVDIIVEMLNQTN---PVLQDMTAIEGNLPTGHRTSVRTGLPTP---TW-RKLYGGV 72 (330) Q Consensus 1 M~~-~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s---~iL~~lpf~e~n~g~~~~~~~~~~lP~~---~f-R~lN~g~ 72 (330) |-+ -.+.--+|...|. +.-..+.+. +-.|+..+ .++.+|+-..+ ..|-|+|.+...--+. .| +.+. =. T Consensus 45 ~t~gy~~~~~~~t~gaA-lR~EsLd~~-l~~Lt~~~~~ftf~~~i~k~~a-~STV~ey~~~~~~G~~G~~~f~~E~g-i~ 120 (514) T protein:vir:10 45 FTAGHSITPDTQTDGAA-NRIESLNRD-LKVTTWGERDFTLYNDIAKQPV-DNTVLKYTQYYSHGRTGHSLFQPEIG-IG 120 (514) T ss_pred hccccccCCccccCccc-hhhhhhccc-eeEeeecCcchhhhhhcCCchh-hHHHhhhhhhcccCcccccccccccc-cC Confidence 000 0000001111010 000011110 11111111 12344443333 2334555554444333 33 2222 22 Q ss_pred CcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcCh------hhccChhh Q lcl|Aclame:pro 73 LPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAP------AEFTGLSP 146 (330) Q Consensus 73 ~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p------~~F~GL~~ 146 (330) +-+.++..+++...+.+.....|-..+--.++ ..+-.+.+.+..|..+.++++..+||||+...| -+||||.+ T Consensus 121 ~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~n~-i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~ 199 (514) T protein:vir:10 121 DVNNPNERQRTINIKYIVDTHVTSIALQRANT-IVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFK 199 (514) T ss_pred cCCCcceEEEEEeeeeeeeeeeeeehhhhccc-hhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHH Confidence 34568899999999999999999888776664 445556666888999999999999999999888 78999998 Q ss_pred hhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeee Q lcl|Aclame:pro 147 RYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIG 226 (330) Q Consensus 147 R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~G 226 (330) .+ ++.|+|||.|.- .|+-++-|.. +.+. .+.|.+ T Consensus 200 lI------~~~NvIDarG~~---Ls~~~ln~aA----------------------~~i~--~gfGt~------------- 233 (514) T protein:vir:10 200 LI------APENHIDLRGGR---LSPAALNMAA----------------------RKIG--EGFGTP------------- 233 (514) T ss_pred hh------cCCCeEecCCCC---ccHHHHhhhh----------------------hhhh--cccCCh------------- Confidence 76 588999998762 2222221111 0000 000000 Q ss_pred eEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeec----- Q lcl|Aclame:pro 227 LTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWE----- 301 (330) Q Consensus 227 l~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~----- 301 (330) +-+||+--+...|.-+..++..+.+.-+ T Consensus 234 -----------------------------------------------TD~ylp~~vka~f~~~~~~~qRV~~~~n~~~~~ 266 (514) T protein:vir:10 234 -----------------------------------------------TDAYMPIGIKADFVNQHLNGQRVMLPGQTGGMT 266 (514) T ss_pred -----------------------------------------------hheeCchHHHHHHhhcccCcceEEeecCcccee Confidence 0122222222222222222111111111 Q ss_pred --ccCCcceE-----EECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 302 --TVSGERVM-----TFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 302 --~~~g~~v~-----~~~gvpir~~dal~~tE~~Vv 330 (330) -.-.+.++ .++|=-|+..+++|+ |.+.| T Consensus 267 ~G~~v~~f~s~~G~I~L~gs~im~~~n~L~-~~~~~ 301 (514) T protein:vir:10 267 TGLDIDKFLSAHGSIRIQGSTIMDSDNKLD-FDRPV 301 (514) T ss_pred eeeeccceeEeccceeecCCeeecccccCc-cCCcc Confidence 00111111 345555777788887 55555 No 118 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=96.53 E-value=0.00024 Score=40.43 Aligned_cols=238 Identities=13% Similarity=0.077 Sum_probs=118.6 Q ss_pred cccccccHHHHHh--hcCcccchHHH--HHHHh-------ccc---hhHhhcceeeccCCccceeEEEecc-CCcceeec Q lcl|Aclame:pro 4 LSTNNPTMADVAK--RLDPNGKVDII--VEMLN-------QTN---PVLQDMTAIEGNLPTGHRTSVRTGL-PTPTWRKL 68 (330) Q Consensus 4 ~~~~a~TL~E~Ak--~~~~d~~~~~V--IE~l~-------~~s---~iL~~lpf~e~n~g~~~~~~~~~~l-P~~~fR~l 68 (330) ||...+--+|-|- -+...+...+- +|-|. ..+ .++.+|+-..+ ..|-|+|.+...- -.+++-.+ T Consensus 1 ~~~~~~~~~~~a~~~al~~a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a-~STV~ey~~~~~rhG~~g~s~~ 79 (470) T protein:vir:10 1 MPYEHLKHLDEATLKALNAAGQVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKA-KAYEHEYNVVTARHDKIGYAAF 79 (470) T ss_pred CChhHhhhhhHHHHHHHHHhhhcchhhhhhhhccceeEeeecCccchhhhhcCCchh-hhHhhhhhhhccccccccceee Confidence 4454443332221 11111111111 11111 111 12233332222 2345677654432 22232222 Q ss_pred CCc--cCcccceEEEEEEEEEEecchhhhhHH-HHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc--------C Q lcl|Aclame:pro 69 YGG--VLPNKSSTAQVTDNCGMLEAYAEVDKA-LADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGI--------A 137 (330) Q Consensus 69 N~g--~~~s~~t~~~~~~~l~ilgg~~eVDk~-la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~--------~ 137 (330) +++ .+-+.++..+++..++.|+.-.+|... +..+..+..+....+.+..|-.+.++++..+||||+.. + T Consensus 80 ~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~s~~~g~~~ 159 (470) T protein:vir:10 80 REGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLGDDVPGSPN 159 (470) T ss_pred cccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhccccccccCcccC Confidence 332 344578999999999999999999876 44555666677788888999999999999999999844 4 Q ss_pred hhhccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEE Q lcl|Aclame:pro 138 PAEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGY 217 (330) Q Consensus 138 p~~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y 217 (330) +-+||||.+-. ...++.|+|||.|.- .|+-++.|+...+ - ..++| T Consensus 160 gleFDGl~~lI---d~~~~~NViDarG~~---Ls~~~L~~aa~~I----------------------~-------~~~~f 204 (470) T protein:vir:10 160 NLQQDGIINII---KRGAPQNVLDAGGRP---LSIDLLWEAESRV----------------------V-------STQAF 204 (470) T ss_pred ceeccchhhhc---cCCCCccccccCCCC---ccHHHHHHHHhhh----------------------c-------ccccc Confidence 45899998754 234577899998543 2332222222111 0 00001 Q ss_pred EEEEeeeeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccce Q lcl|Aclame:pro 218 RTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANN 297 (330) Q Consensus 218 ~~~~~w~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~ 297 (330) ++ -|-+||+--+...|.-+...+..+. T Consensus 205 Gt-----------------------------------------------------~TD~~lp~~vka~f~~~~~~~qRv~ 231 (470) T protein:vir:10 205 AN-----------------------------------------------------PTAVFISYVDKLNLQASFYQISRVM 231 (470) T ss_pred cC-----------------------------------------------------hhhhccchhHHHHHHHhhcCceEEE Confidence 00 0224555555555555555544444 Q ss_pred eeecccC-------CcceEEE-----CCeEEEE-----EeeccCCccccC Q lcl|Aclame:pro 298 LTWETVS-------GERVMTF-----DGIPVQR-----TDALLNTESRVV 330 (330) Q Consensus 298 l~~~~~~-------g~~v~~~-----~gvpir~-----~dal~~tE~~Vv 330 (330) +..+.-. .+.+++- +|=-+.. -.++++.|..=+ T Consensus 232 ~~~N~~~~~~G~~v~~f~sa~G~I~L~~s~~m~~~~k~~p~~l~~~v~~~ 281 (470) T protein:vir:10 232 TTADRRAGLLGADAQSYIGVRGEHSLYPSQFLGDFHKFNPARFGAEVGDF 281 (470) T ss_pred EecCCCceeeeeeccceeeeeeeeeecccccccchhhcCcccCCcccCCc Confidence 4433221 2222222 2222222 134444442211 No 119 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=96.44 E-value=3.5e-05 Score=45.04 Aligned_cols=305 Identities=17% Similarity=0.162 Sum_probs=118.9 Q ss_pred CCcc-------ccccccHHHHHhh------------------cCcccchHHHHHHHhccch---hHhhcceeeccCCccc Q lcl|Aclame:pro 1 MATL-------STNNPTMADVAKR------------------LDPNGKVDIIVEMLNQTNP---VLQDMTAIEGNLPTGH 52 (330) Q Consensus 1 M~~~-------~~~a~TL~E~Ak~------------------~~~d~~~~~VIE~l~~~s~---iL~~lpf~e~n~g~~~ 52 (330) ||-- |...=...|.+.+ +....+... |-.|+..+. ++.+++-..+ ..|-| T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~Ks~~agy~~~p~~q~~~~AlR~EsL~~~-i~~L~~~~~~f~~~~di~k~~a-~stv~ 78 (468) T protein:vir:63 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQ-ISMLTWTENDLTFYKDIAKKPA-TSTVA 78 (468) T ss_pred CCCCcchhhccccChhHHHHHHHHHHHcCcccCCccccCcchhhhhhhhhh-hheeeecccchhhhhhcccchh-hhhhh Confidence 3310 0000111111110 111111111 111111111 2333333223 23446 Q ss_pred eeEEEeccCC---cceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 53 RTSVRTGLPT---PTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTL 129 (330) Q Consensus 53 ~~~~~~~lP~---~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~ 129 (330) +|.+...--+ ..|-.=-.-.+-+.+++.+++..++.|+.--+|--+.-..+ +..+-.++|.+..|..+.++++..+ T Consensus 79 ~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n-~i~d~~~~~~~~ai~~~a~tiE~a~ 157 (468) T protein:vir:63 79 KYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVN-NIQDPMQILTDDAIVNIAKTIEWAS 157 (468) T ss_pred hheeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhc-chhhHHHHHHHHHHHHHHHHHHHHh Confidence 6665554444 34422222345567899999999999999888776655543 3556668999999999999999999 Q ss_pred ccCCCCcChh-------hccChhhhhcccccCCcceeeecCCCCCCceEEE----EEEeCCCcEEEEc-cccccccceec Q lcl|Aclame:pro 130 FYGNDGIAPA-------EFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAW----LVVWGPNTCHSIY-PKGSKAGLSVE 197 (330) Q Consensus 130 iyGD~~~~p~-------~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~----~V~~g~~~~~~iy-pkg~kagl~~~ 197 (330) ||||+..++. +||||.+-. ++.|+||+.|..-...-|+ +++-|-+...-+| |-|-|+-|+-. T Consensus 158 FyGds~l~~s~~~~~glqfDGi~~li------~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~ 231 (468) T protein:vir:63 158 FFGDSDLSDSPEPQAGLEFDGLAKLI------NQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQ 231 (468) T ss_pred hhcccccccCCCccccccccceeEEe------cCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhh Confidence 9999988553 899998754 5679999998763322221 2333334444444 77777666333 Q ss_pred cccccccccccccCCceeEEEE-EEeeeeeeEEeccccEE-Eeecc---cccccccchhHHHHHHHHHHHHHhccCCCCC Q lcl|Aclame:pro 198 DKGQVTIENADGNGGRMEGYRT-HYKWDIGLTLRDWRYVA-RVCNI---DVSDLATSANAQALIKYMIMAAERIPQLGMG 272 (330) Q Consensus 198 D~g~~~~~~~d~~gg~~~~y~~-~~~w~~Gl~v~d~r~v~-RI~NI---d~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g 272 (330) -+.++..--.+-++....|+.. .|..--|. |+-.+++. +=.|| |+...++..+..++..- ...-.-+....+ T Consensus 232 ~L~~q~~v~~~n~~~~~~G~~v~g~~sa~G~-I~l~gs~il~~~~~l~~~~~~~~~Apsp~~vsaT--~~~~~~g~~~~~ 308 (468) T protein:vir:63 232 QLSKQTQLVRDNGNNVSVGFNIQGFHSARGF-IKLHGSTVMENEQILDERILALPTAPQPAKVTAT--QEAGKKGQFRAE 308 (468) T ss_pred hcCceEEEEcCCCCceeeeecccceecceee-eeecCceeeccccCCCcccccccccccCCcccee--eecccCCcccCC Confidence 3333222112211222222211 01111110 01011111 11111 00000000000000000 000000000000 Q ss_pred C--EEEE----eChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEE-eeccC-CccccC Q lcl|Aclame:pro 273 R--AVWY----MNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT-DALLN-TESRVV 330 (330) Q Consensus 273 ~--~~~y----~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~-dal~~-tE~~Vv 330 (330) - .+-| ++..= +-..+. .+..+.... -+|+-+..+ .++.. +..-|. T Consensus 309 ~~a~y~Y~v~~vs~~G----ES~pS~--~vtvTVaa~-------~dg~~ltIt~~~~~~~~p~yv~ 361 (468) T protein:vir:63 309 DLAAHEYKVVVSSDDA----ESIASE--VATATVTAK-------DDGVKLEIELAPMYSSRPQFVS 361 (468) T ss_pred CcceEEEEEEEECCCC----cccccc--ceEEEecCc-------ccceeEEEEecCCCCCcceEEE Confidence 0 0111 11100 000000 000010000 011111111 22222 111111 No 120 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=96.41 E-value=0.00052 Score=38.61 Aligned_cols=229 Identities=17% Similarity=0.166 Sum_probs=110.8 Q ss_pred CCcc---cccc-c-cHHH--HHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceeecCCccC Q lcl|Aclame:pro 1 MATL---STNN-P-TMAD--VAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVL 73 (330) Q Consensus 1 M~~~---~~~a-~-TL~E--~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~ 73 (330) +... +... . .+.+ +-.-..+......+...+.+.+++++..+.... + .......+.-..+.|..-+...| T Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i--~-~~~~~~~~~~~~a~~~~eG~~kp 302 (517) T protein:vir:97 226 SASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL--P-TLVVGGDNALTQGTGHTTGTDKT 302 (517) T ss_pred HhcccccccceeeeecccccccccccchHHHHHHHHhhhhhccceeeeeeccc--c-ceeeecccccceeeeeecCCccc Confidence 0000 0000 0 0000 000011222233455666666777666654221 1 11111222223456777777788 Q ss_pred cccceEEEEEEEEEEecchhhhhHHHHHhCC--CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhccc Q lcl|Aclame:pro 74 PNKSSTAQVTDNCGMLEAYAEVDKALADLNG--NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSL 151 (330) Q Consensus 74 ~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g--~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~ 151 (330) ++..++.+++...+.+.+.+.+.+.+.+... +..++...-......+++.+.+.+|++||- T Consensus 303 ~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdG----------------- 365 (517) T protein:vir:97 303 ESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGV----------------- 365 (517) T ss_pred ccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccC----------------- Confidence 8899999999999999999999998765432 222344444455678899999999999973 Q ss_pred ccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEec Q lcl|Aclame:pro 152 SAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRD 231 (330) Q Consensus 152 t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d 231 (330) +|.+.+ +++|... +.. . T Consensus 366 -------------tg~~~~-------------gi~~~a~-----------------~~~-----------------~--- 382 (517) T protein:vir:97 366 -------------TGVSET-------------QIYPVVG-----------------DAW-----------------A--- 382 (517) T ss_pred -------------CCcccc-------------ccccccc-----------------ccc-----------------c--- Confidence 111111 1111100 000 0 Q ss_pred cccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCCcceEEE Q lcl|Aclame:pro 232 WRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTF 311 (330) Q Consensus 232 ~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~ 311 (330) ..........|++..+..|+..-+ ...|+||+.....|++ .+|....++-..-..+..+... T Consensus 383 ------------~~~~~~~~~~d~i~~l~~a~~~a~-----~a~~vmn~~t~~~I~k-lKD~~G~Yl~~~~~~~~~~~~l 444 (517) T protein:vir:97 383 ------------TNVTGTTNIQELLEKLSVATPKAA-----DSTLVIHRNDLAAIRF-LKDKNGNYVFPVGVSNQTIATH 444 (517) T ss_pred ------------ccccccchHHHHHHHHHHHhhhcc-----CCEEEECHHHHHHHHH-hhcCCCCeeccCcCCccccccc Confidence 000001123456666666654322 3569999999999996 4666556665443333333222 Q ss_pred CCe----EEEEEeec-------------------------cC-----Ccccc---C Q lcl|Aclame:pro 312 DGI----PVQRTDAL-------------------------LN-----TESRV---V 330 (330) Q Consensus 312 ~gv----pir~~dal-------------------------~~-----tE~~V---v 330 (330) .|+ |..-.++. .| .|.++ | T Consensus 445 ~G~~~~~~~~~~~~~~~~~~~~y~i~~~~g~~~~~~fd~~~n~~~f~~~~~~~g~i 500 (517) T protein:vir:97 445 FGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSL 500 (517) T ss_pred CCccccccccccCceeEeeccccEEEeecceeeeeeeecccCceeEeeeeeecccc Confidence 231 11111100 00 11111 1 No 121 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=96.39 E-value=4.1e-05 Score=44.66 Aligned_cols=305 Identities=17% Similarity=0.162 Sum_probs=118.8 Q ss_pred CCcc------cccc-ccHHHHHhhc-----------------CcccchHHHHHHHhccch---hHhhcceeeccCCccce Q lcl|Aclame:pro 1 MATL------STNN-PTMADVAKRL-----------------DPNGKVDIIVEMLNQTNP---VLQDMTAIEGNLPTGHR 53 (330) Q Consensus 1 M~~~------~~~a-~TL~E~Ak~~-----------------~~d~~~~~VIE~l~~~s~---iL~~lpf~e~n~g~~~~ 53 (330) ||-- |-.. -|--.+.|.+ ....+... |-.|+..+. ++.+++-..+ ..|-|+ T Consensus 1 ~~~~~~~~~~~~n~~~~~e~~~Ks~~agy~~~p~tq~~~~AlR~EsL~~~-i~~Lt~~~~~f~~~~di~k~~a-~stv~~ 78 (467) T protein:vir:80 1 MPKNNKEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQ-ISMLTWTENDLTFYKDIAKKPA-TSTVAK 78 (467) T ss_pred CCCcchhhhhhcccccCHHHHHHHHHcccccCCccccCcchhhhhhhhhh-hheeeccccchhhhhhcccchh-hhhhhh Confidence 3321 0000 0111111111 00111111 111111111 2333333223 234466 Q ss_pred eEEEeccCC---cceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 54 TSVRTGLPT---PTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLF 130 (330) Q Consensus 54 ~~~~~~lP~---~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~i 130 (330) |.+...--+ ..|-.=-.-.+-+.+++.+++..++.|+.--+|--+.-..+ +..+-.++|.+..|..+.++++..+| T Consensus 79 y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n-~i~d~~~~~~~~ai~~~a~tiE~a~F 157 (467) T protein:vir:80 79 YDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVN-NIQDPMQILTDDAIVNIAKTIEWASF 157 (467) T ss_pred heeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhc-chhhHHHHHHHHHHHHHHHHHHHHhh Confidence 665554444 34422222345567899999999999999888776655543 35566689999999999999999999 Q ss_pred cCCCCcChh-------hccChhhhhcccccCCcceeeecCCCCCCceEEE----EEEeCCCcEEEEc-cccccccceecc Q lcl|Aclame:pro 131 YGNDGIAPA-------EFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAW----LVVWGPNTCHSIY-PKGSKAGLSVED 198 (330) Q Consensus 131 yGD~~~~p~-------~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~----~V~~g~~~~~~iy-pkg~kagl~~~D 198 (330) |||+..++. +||||.+-. ++.|+||+.|..-...-|+ +++-|-+...-+| |-|-|+-|+-.- T Consensus 158 yGds~l~~s~~~~~glqfDGi~~li------~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~ 231 (467) T protein:vir:80 158 FGDSDLSDSPEPQAGLEFDGLAKLI------NQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ 231 (467) T ss_pred hcccccccCCCccccccccceeEEe------cCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhh Confidence 999988553 899998754 5679999998763322221 2333334444444 777776663333 Q ss_pred ccccccccccccCCceeEEEE-EEeeeeeeEEeccccEE-Eeecc---cccccccchhHHHHHHHHHHHHHhccCCCCCC Q lcl|Aclame:pro 199 KGQVTIENADGNGGRMEGYRT-HYKWDIGLTLRDWRYVA-RVCNI---DVSDLATSANAQALIKYMIMAAERIPQLGMGR 273 (330) Q Consensus 199 ~g~~~~~~~d~~gg~~~~y~~-~~~w~~Gl~v~d~r~v~-RI~NI---d~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~ 273 (330) +.++..--.+-++....|+.. .|..--|. |+-.+++. +=.|| |+...++..+..++..- ...-.-+....+- T Consensus 232 L~~q~~v~~~n~~~~~~G~~v~g~~sa~G~-I~l~gs~il~~~~~l~~~~~~~~~Apsp~~vsaT--~~~~~~g~~~~~~ 308 (467) T protein:vir:80 232 LSKQTQLVRDNGNNVSVGFNIQGFHSARGF-IKLHGSTVMENEQILDERILALPTAPQPAKVTAT--QEAGKKGQFRAED 308 (467) T ss_pred cCceEEEEcCCCCceeeeecccceecceee-eeecCceeeccccCCCcccccccccccCCcccee--eecccCCcccCCC Confidence 333222112211222222211 01111110 01011111 11111 00000000000000000 0000000000000 Q ss_pred --EEEE----eChHHHHHHHHHhhccccceeeecccCCcceEEECCeEEEEE-eeccC-CccccC Q lcl|Aclame:pro 274 --AVWY----MNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT-DALLN-TESRVV 330 (330) Q Consensus 274 --~~~y----~n~~v~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~-dal~~-tE~~Vv 330 (330) .+-| ++..= +-..+. .+..+.... -+|+-+..+ .++.. +..-|. T Consensus 309 ~a~y~Y~v~~vs~~G----ES~pS~--~vtvTVaa~-------~dg~~ltIt~~~~~~~~p~yv~ 360 (467) T protein:vir:80 309 LAAHEYKVVVSSDDA----ESIASE--VATATVTAK-------DDGVKLEIELAPMYSSRPQFVS 360 (467) T ss_pred cceEEEEEEEECCCC----cccccc--ceEEEecCc-------ccceeEEEEecCCCCCcceEEE Confidence 0111 11100 000000 000010000 011111111 22222 111111 No 122 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=96.22 E-value=0.00033 Score=39.70 Aligned_cols=255 Identities=13% Similarity=0.102 Sum_probs=122.8 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcceeeccCCccceeEEEeccCCcceee------------- Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRK------------- 67 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~------------- 67 (330) +..+.-..+|+.++-+..-+.......++...+.++||+.+.++..-+.++. ++..++-. T Consensus 15 ~~~i~k~~it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~e-------i~kig~G~r~~r~~~e~~~~~ 87 (360) T protein:vir:99 15 MNSLSQKDIGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEME-------VPQFGVPRLSGHTRDEEGSRT 87 (360) T ss_pred HHHHHhhhccccccCceeecHHHHHHHHHHHhhccchhhhcceeeccccccc-------ccccccceeeccccccCCCCC Confidence 3444344466666655555555667788999999999999999754333221 22222211 Q ss_pred cCCccCcccceEE---EEEEEEEEecchhhhhHHHHH-hCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcC------ Q lcl|Aclame:pro 68 LYGGVLPNKSSTA---QVTDNCGMLEAYAEVDKALAD-LNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIA------ 137 (330) Q Consensus 68 lN~g~~~s~~t~~---~~~~~l~ilgg~~eVDk~la~-~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~------ 137 (330) =+.+.+.+...+. .+...+.+ +...+.+ .+-...++--.-+.++.++.+..++...|+||+.+. T Consensus 88 ~~~~~~~~~v~~~~~~~~~~~~~i------~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~~ 161 (360) T protein:vir:99 88 ENSEAESGSVKFNATDKSYYILVE------PKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSIG 161 (360) T ss_pred cCCcCccccCccccccceeeEeec------hHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccccCc Confidence 1122222222221 22222222 2333322 111111222223467889999999999999997652 Q ss_pred -hhh----ccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCC Q lcl|Aclame:pro 138 -PAE----FTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGG 212 (330) Q Consensus 138 -p~~----F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg 212 (330) .+. .+|.-+.+ .+..+.||..|--.. +..+|..- ...| T Consensus 162 ~~d~fl~~~dGwlKka-----~~~~~~id~a~d~t~-------------------------~~~~~~~~---~~~~---- 204 (360) T protein:vir:99 162 GAAELDNTFKGWIARA-----EGDAQSVDDAGDSTR-------------------------IGLEDTAT---ADAD---- 204 (360) T ss_pred ccchhhhhhHHHHHHh-----hcccchhhccccccc-------------------------cccccccc---cccc---- Confidence 122 35553432 122344543321110 01111110 0001 Q ss_pred ceeEEEEEEeeeeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHH-HhccCCCCCCEEEEeChHHHHHHHHHhh Q lcl|Aclame:pro 213 RMEGYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAA-ERIPQLGMGRAVWYMNRNLREKLRLGIV 291 (330) Q Consensus 213 ~~~~y~~~~~w~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~-~~ip~~~~g~~~~y~n~~v~~~L~~q~~ 291 (330) +..-++|-+...-+ .+....|.+.|++++ .+.-+-.+...+|||+......-+.+.. T Consensus 205 ---------------------~~~~~~~~~g~~~~-~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~ 262 (360) T protein:vir:99 205 ---------------------SMPSIANTDGSGNP-QPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLT 262 (360) T ss_pred ---------------------cchhhhcccccccc-ccchHHHHHHHHHhcchhhhcCcccceEEEccCchHHHHHHHHh Confidence 01111233211100 011234544554444 1111122224589999998888887776 Q ss_pred ccccceeeecccCCcceEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 292 DKIANNLTWETVSGERVMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 292 ~~~~~~l~~~~~~g~~v~~~~gvpir~~dal~~tE~~Vv 330 (330) ++.- -|--.-..|+-...+.|+||..|..+.+. .|. T Consensus 263 ~R~t-~LGd~~l~g~~~~~~~Gipi~~v~~~pd~--~~m 298 (360) T protein:vir:99 263 ERED-PLGSAVIFGDSDITPFSYDLVGVNGFPDE--YMM 298 (360) T ss_pred ccCc-ccchhheecccccccceeeeEEcCCCCCC--ceE Confidence 6543 22222334445566889999999988753 233 No 123 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=94.95 E-value=0.0031 Score=34.33 Aligned_cols=241 Identities=13% Similarity=0.159 Sum_probs=126.0 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccch----hHhhcceeeccCCccceeEEEeccCCcceeecCCcc---C Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNP----VLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGV---L 73 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~----iL~~lpf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~---~ 73 (330) ||+-+++.-| ....+...-+.+++.+-+| +|..+-=..+ ..+-|.|.. .+|-.+.=..-.||- . T Consensus 1 ma~~~~~~~t-------~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a-~~~~~~W~~-d~l~~~~~~~~~EG~da~~ 71 (317) T protein:vir:88 1 MATPTNAVST-------VEINGKREDLIDIIYNIAPYDTPFMSAIGKGVA-TAITHEWQT-DELRQPGKNTRVEGEDATI 71 (317) T ss_pred CCccccceEe-------eeeeeeeechhhhheecCCccCcceeeecCcee-cccEEEEEe-eecCCccccccccCccccc Confidence 8876666555 2222333334444444444 3433221112 122344432 445444333333442 2 Q ss_pred cccceEEEEEEEEEEecchhhhhHHHHHh--CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCC------CcChhhccChh Q lcl|Aclame:pro 74 PNKSSTAQVTDNCGMLEAYAEVDKALADL--NGNTAAFRLSEDRAQIEGMNQEVAQTLFYGND------GIAPAEFTGLS 145 (330) Q Consensus 74 ~s~~t~~~~~~~l~ilgg~~eVDk~la~~--~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~------~~~p~~F~GL~ 145 (330) .....+....-.|-|+.-.+.|-.-.... .| ..+..+.|.+.+++.+...++..||+|.- +..|....||. T Consensus 72 ~~~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G-~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~ 150 (317) T protein:vir:88 72 KAGSFTTMLNNYCQISDETLQVTGTADRVKKAG-RKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIF 150 (317) T ss_pred ccccCCEEeccEEEEEEeEEEEeehhhhhhhcC-ccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHH Confidence 22245667777788888888888755444 33 34677889999999999999999999963 33356677765 Q ss_pred hhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeee Q lcl|Aclame:pro 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) Q Consensus 146 ~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~ 225 (330) .-++ ..+++.++|..+. .|... .. -++.+ T Consensus 151 ~~i~------t~~~~~~~g~~~~----------------------------~~~~~-~~--t~~t~-------------- 179 (317) T protein:vir:88 151 AYYK------TNGSLGANGVAPV----------------------------GDGSN-TG--TAGDL-------------- 179 (317) T ss_pred HHhc------cCceeccCccccc----------------------------cCCCc-cc--ccccc-------------- Confidence 4331 1122222221100 00000 00 01110 Q ss_pred eeEEeccccEEEeecccccccccchhHHH-HHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeee---c Q lcl|Aclame:pro 226 GLTLRDWRYVARVCNIDVSDLATSANAQA-LIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTW---E 301 (330) Q Consensus 226 Gl~v~d~r~v~RI~NId~~~l~~~~~~~~-l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~---~ 301 (330) ....++ |.+.+.+.|+ -+.....||||...+.++.....+. ..++.. + T Consensus 180 -----------------------~~lte~~l~~~l~~i~~----~Gg~~~~i~v~a~~k~~i~~~~~~~-~~~i~~~~~~ 231 (317) T protein:vir:88 180 -----------------------RLLTEDMLLNASESIWR----NGGQANSIQTSSSIKKAISKNMKGR-ATEITLDASD 231 (317) T ss_pred -----------------------ccccHHHHHHHHHHHHh----cCCCCCEEEeChHHHHHHHHHhcCC-ceeEEEcccC Confidence 012243 4555555565 2233347899999999998765432 223221 2 Q ss_pred ccCCcc----eEEECCeEEEEEeeccCCccccC Q lcl|Aclame:pro 302 TVSGER----VMTFDGIPVQRTDALLNTESRVV 330 (330) Q Consensus 302 ~~~g~~----v~~~~gvpir~~dal~~tE~~Vv 330 (330) ...|-. ++.|.-|.|+-.=.|...+.-++ T Consensus 232 ~~~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~ 264 (317) T protein:vir:88 232 NRIAQTVDVYESDFGKYTIRANRWFHENTLFVF 264 (317) T ss_pred eEEEEEEEEEEeCCeEEEEEeCCCCCCCeEEEE Confidence 233333 35677777777766665655555 No 124 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=92.38 E-value=0.012 Score=31.18 Aligned_cols=212 Identities=10% Similarity=-0.017 Sum_probs=109.8 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcce---eeccCCccceeEEEeccCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTA---IEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKS 77 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf---~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~ 77 (330) ||. +.=.|.+ .+-|.-..+-|.|.+.+..-+....+- .++..|.--.......++.+.+..=++++++++- T Consensus 1 ma~---~~T~~~~---~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~i 74 (274) T protein:vir:93 1 MPQ---GITKTSN---QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDIL 74 (274) T ss_pred CCc---cceehhh---eechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCccccccc Confidence 655 2223433 233444444555555443222122211 1232232233444455778888888899999999 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADL-NGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENK 156 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~-~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~ 156 (330) ++.+.+..+.-.+..+.|+...+.+ .+++ .....++...+++.++...++.. | .+ T Consensus 75 t~~~~~~~i~~~~~~~~i~D~~~~~~~~d~---~~~~~~~~~~~~a~~~d~~~~~~-----------~----------~~ 130 (274) T protein:vir:93 75 ETKKREAKIRKIAKGTSITDEALLSGYGDP---QGEQVRQHGLAHANKVDNDVLEA-----------L----------MG 130 (274) T ss_pred ccceeEEEeeeecccccccHHHHHhhccch---HHHHHHHHHHHHHHHHHHHHHHH-----------H----------hc Confidence 9999999998888778777655444 3443 34444555677777666544411 0 00 Q ss_pred ceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 157 DNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 157 ~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) +.+ ..+. T Consensus 131 ----------a~~--------------------------------------~~~~------------------------- 137 (274) T protein:vir:93 131 ----------AKL--------------------------------------TVNA------------------------- 137 (274) T ss_pred ----------ccc--------------------------------------cccc------------------------- Confidence 000 0000 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcc--ccceeeecccCCcceEEECCe Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK--IANNLTWETVSGERVMTFDGI 314 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~--~~~~l~~~~~~g~~v~~~~gv 314 (330) + ....+.+++++.. +.. +...+ .+++||..+...|+++.... ....+...-...-.+-.+.|+ T Consensus 138 -------~----~~~~d~i~dA~~~-l~d--~~~~~-~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~ 202 (274) T protein:vir:93 138 -------D----ITKLNGLQSAIDK-FND--EDLEP-MVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGA 202 (274) T ss_pred -------c----ccCHHHHHHHHHH-hhh--ccCCc-cEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCe Confidence 0 0011233333221 111 11223 47999999999998653221 111110000111134578999 Q ss_pred EEEEEeeccCCccccC Q lcl|Aclame:pro 315 PVQRTDALLNTESRVV 330 (330) Q Consensus 315 pir~~dal~~tE~~Vv 330 (330) ||..+|.+.....-++ T Consensus 203 ~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:93 203 IIVRTNKLEAGTAILA 218 (274) T ss_pred eEEEcCCCCcceEEEE Confidence 9999999987665555 No 125 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=88.96 E-value=0.029 Score=29.00 Aligned_cols=211 Identities=13% Similarity=0.087 Sum_probs=103.7 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhccee----eccCCccceeEEEeccCCcceeecCCccCccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAI----EGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNK 76 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~----e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~ 76 (330) ||.. .-+|.++ +.|.-..+-|.|.|.+.. ++..+.-. ++..|..-.......++.+....=+.++++++ T Consensus 1 ma~~---~T~~~d~---i~Pev~s~~v~~~~~~~~-~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~ 73 (274) T protein:vir:96 1 MAQG---TTKVSNL---IVPEVLAPMMQAELDKKL-RFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQ 73 (274) T ss_pred CCcc---ccchhhh---hhhHHHHHHHHHHHHhhh-hhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhh Confidence 6643 3355553 334444455566554332 22222111 22223223333333456777666677888888 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHHh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCC Q lcl|Aclame:pro 77 SSTAQVTDNCGMLEAYAEVDKALADL-NGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) Q Consensus 77 ~t~~~~~~~l~ilgg~~eVDk~la~~-~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~ 155 (330) -++...+..+.-.+-.+.++-.-+.+ .+++- ..-.++...+++.++...++.- | + T Consensus 74 it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~---~~~~~~~~~~~a~~~d~~i~~~-----------l----------~ 129 (274) T protein:vir:96 74 IGTSKREAKVRKIGKGTELTDEAVLSGFGDPQ---GEAVRQHGLAIANKVDNDVLEA-----------L----------K 129 (274) T ss_pred cccceeEEEEEeeeceeeecHHHHHhhcchHH---HHHHHHHHHHHHHHHHHHHHHH-----------H----------h Confidence 88888887777766666766544333 34443 3333445566666666554410 0 0 Q ss_pred cceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccE Q lcl|Aclame:pro 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) Q Consensus 156 ~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v 235 (330) |++ . .+ +... T Consensus 130 -------~a~---~---------------------------------~~-----~~~~---------------------- 139 (274) T protein:vir:96 130 -------GAT---L---------------------------------TV-----EADI---------------------- 139 (274) T ss_pred -------cCC---C---------------------------------Cc-----Cccc---------------------- Confidence 000 0 00 0000 Q ss_pred EEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcccc--ceeeecccCCcceEEECC Q lcl|Aclame:pro 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIA--NNLTWETVSGERVMTFDG 313 (330) Q Consensus 236 ~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~--~~l~~~~~~g~~v~~~~g 313 (330) ...+.+++++ ..+-..+.--.+++||..+...|+++....-. ......-...-.+-.+.| T Consensus 140 --------------~~~d~i~dA~----~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G 201 (274) T protein:vir:96 140 --------------TKLDGLQTAI----DKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALG 201 (274) T ss_pred --------------ccHHHHHHHH----HHhcccCCCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecC Confidence 0012223222 11111111225799999999999976432110 000000001113568899 Q ss_pred eEEEEEeeccCCccccC Q lcl|Aclame:pro 314 IPVQRTDALLNTESRVV 330 (330) Q Consensus 314 vpir~~dal~~tE~~Vv 330 (330) ++|..+|++..+.+-++ T Consensus 202 ~~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:96 202 AVIVRSNKLNKGEALLA 218 (274) T ss_pred eeEEEcCCCCcceEEEE Confidence 99999999987665555 No 126 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=88.26 E-value=0.033 Score=28.67 Aligned_cols=216 Identities=15% Similarity=0.065 Sum_probs=85.4 Q ss_pred CCccc-cc------cccHHHHHhhcCcccchHHH---------------------------HHHHhccchhHhhcceeec Q lcl|Aclame:pro 1 MATLS-TN------NPTMADVAKRLDPNGKVDII---------------------------VEMLNQTNPVLQDMTAIEG 46 (330) Q Consensus 1 M~~~~-~~------a~TL~E~Ak~~~~d~~~~~V---------------------------IE~l~~~s~iL~~lpf~e~ 46 (330) |+.-. .. ...+.+...... ....... ++...+.+++...++..+ T Consensus 171 ~~~~~~~~~~~~~~~~e~r~~~~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 248 (480) T protein:vir:40 171 REASIPSEKPEDAERKFMRELGSKMA-EMPEQGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQGLTLAE- 248 (480) T ss_pred hhhhccccchhhhhhHHHHHHHHHhc-cchhhhhhhhhhhhccccccccccccccchhhheeechhhhhhhhhcceeee- Confidence 11000 00 000001000000 0000000 000011111111111110 Q ss_pred cCCccceeEEEeccCCcceee----cCCccCcccceEEEEEEEEEEecchhhhhHHHHHhCCCHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 47 NLPTGHRTSVRTGLPTPTWRK----LYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMN 122 (330) Q Consensus 47 n~g~~~~~~~~~~lP~~~fR~----lN~g~~~s~~t~~~~~~~l~ilgg~~eVDk~la~~~g~~~~~ra~e~~~~ika~~ 122 (330) .+.....|.. -++.-+++. .......-.+.+....+.++..++..+..++..--.....++++ T Consensus 249 -----------~g~~~~~~~~e~~~~~~~~~~~~--~~~~~~~~~~v~~l~~~~k~t~~lLDDa~~l~~~i~~~l~~~~~ 315 (480) T protein:vir:40 249 -----------DGVDDTFISGTFKAGTDKNKSQT--ATKRSLRPQMAEAYLQMDKATVRGVNDSGALSEYVMSEMVNRVI 315 (480) T ss_pred -----------ccccceeeeeeeecccccccccc--cccchhhHHHHHHHHHhHHHHHHHhhhhHHHHHHHHHHHHHHHH Confidence 0111112222 111111111 11111111123455566666655544433333333344568888 Q ss_pred HHHHHhhccCCCCcChhhccChhhhhcccccCCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceecccccc Q lcl|Aclame:pro 123 QEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQV 202 (330) Q Consensus 123 ~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~ 202 (330) .+.+.+|++||... .+.|.|+ |.. . T Consensus 316 ~~ee~a~l~G~g~g-~~~~~g~------------------------------------------~~~-------~----- 340 (480) T protein:vir:40 316 QKVEYNMILGSVDG-SNGFYGL------------------------------------------KTA-------T----- 340 (480) T ss_pred HHHHHHhhccCCCC-ccccccc------------------------------------------eee-------c----- Confidence 99999999996211 1111111 000 0 Q ss_pred ccccccccCCceeEEEEEEeeeeeeEEeccccEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHH Q lcl|Aclame:pro 203 TIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNL 282 (330) Q Consensus 203 ~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v 282 (330) ++ ++ .+ ....|+++.|+.|+. +....+.+.|+||++. T Consensus 341 -----~~-----------------~~----------~~---------~~~~d~id~L~~al~--~~y~~~a~~~vmn~~t 377 (480) T protein:vir:40 341 -----DG-----------------WT----------KQ---------IEYTDLFEGITDAVA--ECSISDAITIVMSPQT 377 (480) T ss_pred -----cc-----------------cc----------cc---------chhHHHHHHHHHhhh--HHhhCCCCEEEECHHH Confidence 00 00 00 011344544444443 2223445579999999 Q ss_pred HHHHHHHhhccccceeeecccCCcceEEECCeEEEEEeecc-CCccccC Q lcl|Aclame:pro 283 REKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRTDALL-NTESRVV 330 (330) Q Consensus 283 ~~~L~~q~~~~~~~~l~~~~~~g~~v~~~~gvpir~~dal~-~tE~~Vv 330 (330) ...|++ .+|....++-........+-...|.||..++... ..+..|. T Consensus 378 ~~~I~k-lKD~~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~~~ 425 (480) T protein:vir:40 378 FAELRK-AKGTDGHSRFNELATKEQIAQSFGAVNLETRVWMPKDEVAVY 425 (480) T ss_pred HHHHHH-hhcCCCCeeccCcccccCcceecccceeeeeccccCCcceee Confidence 999996 4666666666555555556666789987665443 3454444 No 127 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=86.78 E-value=0.043 Score=28.07 Aligned_cols=218 Identities=15% Similarity=0.078 Sum_probs=108.0 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcce----eeccCCccceeEEEeccCCcceeecCCccCccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTA----IEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNK 76 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf----~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~ 76 (330) ||.+.| +|.+ .+-|.-..+-|.|.|.+ +-++..+.- .++..|.--.......+..+.+..=+.++++++ T Consensus 1 Ma~~~T---~~~~---~iiPev~s~~v~~~~~~-~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~ 73 (278) T protein:vir:80 1 MADLTT---KLAN---LIDPEVMGPMISAKLPK-AIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSA 73 (278) T ss_pred CCCcce---ehhh---eecHHHHHHHHHHHHHH-hhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcccc Confidence 776422 2322 23344444555666644 222222221 123223222333444456677777678888999 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHHhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCC Q lcl|Aclame:pro 77 SSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) Q Consensus 77 ~t~~~~~~~l~ilgg~~eVDk~la~~~g-~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~ 155 (330) -++.+.+..+.-.+-.++|+...+.+.+ +.- ..-.+++-.+++.++...++.. +.| T Consensus 74 lt~~~~~~~i~~~~~a~~v~D~~~~~~~~d~~---~~~~~~~a~~~a~~~d~~l~~~--------l~~------------ 130 (278) T protein:vir:80 74 LETESVKHGIKKAGKGVKLTDESVLSGYGDPV---EEAQKQIRMAIASKVDNDILEE--------ALT------------ 130 (278) T ss_pred cccceeeEeeehhhccccccHHHHhhccccHH---HHHHHHHHHHHHHHHHHHHHHH--------Hhc------------ Confidence 8888888888887778888886655543 443 3344455677777777555421 100 Q ss_pred cceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccE Q lcl|Aclame:pro 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) Q Consensus 156 ~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v 235 (330) +.+.++ +.+ +. + T Consensus 131 a~~~~~---------------------------~~~-----------t~---~--------------------------- 142 (278) T protein:vir:80 131 TTLEVK---------------------------GAI-----------NI---G--------------------------- 142 (278) T ss_pred cccccc---------------------------ccc-----------cc---c--------------------------- Confidence 000000 000 00 0 Q ss_pred EEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcc--ccceeeecccCCcceEEECC Q lcl|Aclame:pro 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK--IANNLTWETVSGERVMTFDG 313 (330) Q Consensus 236 ~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~--~~~~l~~~~~~g~~v~~~~g 313 (330) .+|.+ -....|+.+++.. +.+|. . -+++||..+...|++..... +...+...-.-.-.+-++.| T Consensus 143 ----~~~~~----~~~~~da~~~l~~--~~~~~--~--~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G 208 (278) T protein:vir:80 143 ----LIDKI----ENTFTDAPDAIED--ESITT--T--GVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLG 208 (278) T ss_pred ----hhhhH----HHHHHHHHHhhcc--cCCCc--c--cEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecc Confidence 00000 0011122222211 23332 1 25889999999998653211 11111111011123558899 Q ss_pred eEEEEEeeccCCccccC Q lcl|Aclame:pro 314 IPVQRTDALLNTESRVV 330 (330) Q Consensus 314 vpir~~dal~~tE~~Vv 330 (330) ++|..+|++.+...-++ T Consensus 209 ~~Vi~s~~~p~~t~~l~ 225 (278) T protein:vir:80 209 WEIVRTKKLADGNALAV 225 (278) T ss_pred eeEEEcCCCCcceEEEE Confidence 99999999987655555 No 128 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=86.72 E-value=0.044 Score=28.04 Aligned_cols=213 Identities=13% Similarity=0.045 Sum_probs=105.9 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcce---eeccCCccceeEEEeccCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTA---IEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKS 77 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf---~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~ 77 (330) || |++. =+|++ ..-|.-..+-|.|.+.+..-+-..... .++..|.--....-..++.+....=+..+++.+- T Consensus 1 ~~-~~~~-T~l~d---~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~l 75 (275) T protein:vir:96 1 MA-LENM-TKLAN---MVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLI 75 (275) T ss_pred CC-Cccc-chhhh---hhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhc Confidence 54 2221 24444 234555556677777654333222221 1222221122222234556666666678888888 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADL-NGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENK 156 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~-~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~ 156 (330) ++.+.+..+.-.+-.+.++-.-+.+ .+|+-...+ ++.-.+++.++...++. .+ ++ T Consensus 76 t~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~---~~~a~~~a~~~d~~ll~--------~l-------------~~ 131 (275) T protein:vir:96 76 ETKKRQATIRKIGKGTVLTDEALLSGYGDPKGEAV---RQHGLAIANKVDNDVLE--------AL-------------QG 131 (275) T ss_pred ccceeeEEeehhcccccccHHHHHhhccchHHHHH---HHHHHHHHHHHHHHHHH--------HH-------------hc Confidence 8888888887777778886654444 455543333 33445566555544330 00 00 Q ss_pred ceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 157 DNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 157 ~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) ++. ..+. T Consensus 132 -------a~~-----------------------------------------~~~~------------------------- 138 (275) T protein:vir:96 132 -------ATL-----------------------------------------KVEA------------------------- 138 (275) T ss_pred -------ccc-----------------------------------------cccc------------------------- Confidence 000 0000 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcc--ccceeeecccCCcceEEECCe Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK--IANNLTWETVSGERVMTFDGI 314 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~--~~~~l~~~~~~g~~v~~~~gv 314 (330) + . ...+.+.+ |...+-.....-.+++||..+...|+++.... ....+.......-.+-.+.|+ T Consensus 139 -----~--~----~~~d~i~d----A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~ 203 (275) T protein:vir:96 139 -----D--I----TKLAGLQT----AIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGA 203 (275) T ss_pred -----c--c----cCHHHHHH----HHHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCe Confidence 0 0 01122222 22222111112247999999999998764221 111111111111235679999 Q ss_pred EEEEEeeccCCccccC Q lcl|Aclame:pro 315 PVQRTDALLNTESRVV 330 (330) Q Consensus 315 pir~~dal~~tE~~Vv 330 (330) +|.++|.+....+-++ T Consensus 204 ~Vi~s~~~p~~t~~i~ 219 (275) T protein:vir:96 204 IIVRSNKIKEGEAILA 219 (275) T ss_pred eEEEeCCCCcceEEEE Confidence 9999999987776665 No 129 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=76.31 E-value=0.14 Score=25.32 Aligned_cols=208 Identities=16% Similarity=0.136 Sum_probs=98.6 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhc---ceeeccCCccceeEEEeccCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDM---TAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKS 77 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~l---pf~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~ 77 (330) ||. +.=+|+++ ..|.-..+-|.|.|.+..-+-... .-.++..|.--....-..+..+.+..=+..+++++- T Consensus 1 ma~---~~T~~~d~---iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~l 74 (272) T protein:vir:36 1 MSK---QKTTLADL---VNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKI 74 (272) T ss_pred CCC---cceehhhh---hchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhc Confidence 553 33345442 334444455666665432222211 112333222222233333444555444566888888 Q ss_pred eEEEEEEEEEEecchhhhhHHHHHh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKALADL-NGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENK 156 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~la~~-~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~ 156 (330) ++.+.+..+.-.+-.+.|+-..+.+ .+++-. .-.+++..+++.++...++.. | . T Consensus 75 t~~~~~~~i~~~~k~~~vtD~~~~~~~~d~~~---~~~~~~a~~~a~~~d~~i~~~-----------l----------~- 129 (272) T protein:vir:36 75 GTTTKSVTIKKAAKGTEITDEAALSGYGDPIG---ESNKQLGLSLANKVDDDLLSA-----------A----------K- 129 (272) T ss_pred CCcceeEeeehhhccccccHHHHhhccchHHH---HHHHHHHHHHHHHHHHHHHHH-----------h----------c- Confidence 8888888888888888887644444 344433 333444455666655443310 0 0 Q ss_pred ceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 157 DNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 157 ~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) |..++ + .+ . T Consensus 130 ---------~~~~~---------------------------------~---~~---~----------------------- 138 (272) T protein:vir:36 130 ---------TTSQT---------------------------------V---ST---K----------------------- 138 (272) T ss_pred ---------ccccc---------------------------------c---cc---c----------------------- Confidence 00000 0 00 0 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhccccceeeecccCC----cceEEEC Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSG----ERVMTFD 312 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~~~~~l~~~~~~g----~~v~~~~ 312 (330) .+.| ...++..+|-.+ ..+ -.+++||..+...|+++... .........+ -.+-.+. T Consensus 139 --~~~d--------~i~~A~~~lgd~------~~~-~~~ivv~p~~~~~L~k~~~~---~~~~~~~~~~~~~~G~ig~~~ 198 (272) T protein:vir:36 139 --ANVD--------GVQAALDIFNDE------DAQ-AYVLIVNPKDAAKIRKDANA---KNIGSEVGANALINGTYADVL 198 (272) T ss_pred --ccHH--------HHHHHHHHhhhc------CCC-ceEEEEcHHHHHHHhccccc---ccccccccccceeeeccceec Confidence 0000 111222222111 112 25899999999999864321 1111111000 1245789 Q ss_pred CeEEEEEeeccCCccc---cC Q lcl|Aclame:pro 313 GIPVQRTDALLNTESR---VV 330 (330) Q Consensus 313 gvpir~~dal~~tE~~---Vv 330 (330) |++|..+|++..+-.. ++ T Consensus 199 G~~Vv~s~~~p~~~~~~~~~~ 219 (272) T protein:vir:36 199 GAQIVRSKKLAEGSALMFKIV 219 (272) T ss_pred CeeEEEeCCCCCCceeEEEEE Confidence 9999999999864332 22 No 130 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=73.21 E-value=0.17 Score=24.76 Aligned_cols=211 Identities=11% Similarity=0.063 Sum_probs=100.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhc-ce---eeccCCccceeEEEeccCCcceeecCCccCccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDM-TA---IEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNK 76 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~l-pf---~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~ 76 (330) ||. +.=+|++ .+-|.-..+-|.|.+.+.. ++..+ .. .++..|.--.......+..+.-..=+.++++.+ T Consensus 1 ma~---~~T~~~d---~iiPev~~~~v~~~~~~~l-~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~ 73 (274) T protein:vir:97 1 MPQ---GLTKTSD---QIIPEVLAPMMQAQLEKKL-RFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) T ss_pred CCc---cceehhh---eechHHHHHHHHHhhhhhh-hhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccc Confidence 554 3224444 2334444444555553321 11111 11 122222212222223345555555567888888 Q ss_pred ceEEEEEEEEEEecchhhhhHHH-HHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCC Q lcl|Aclame:pro 77 SSTAQVTDNCGMLEAYAEVDKAL-ADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) Q Consensus 77 ~t~~~~~~~l~ilgg~~eVDk~l-a~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~ 155 (330) -++.+.+..+.-.+-.++|+-.- +.-.+++- ....++...++..++...++.- + . T Consensus 74 lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~---~~~~~~~a~a~a~~vd~~~~~~--------l-------------~ 129 (274) T protein:vir:97 74 LETKKREAKIRKIAKGTSITDEALLSGYGDPQ---GEQVRQHGLAHANKVDNDVLEA--------L-------------M 129 (274) T ss_pred cccceeEEEeeeecceecccHHHHHhccchHH---HHHHHHHHHHHHHHHHHHHHHH--------H-------------h Confidence 88888888877777566665533 33345543 3334445566666665444310 0 0 Q ss_pred cceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccE Q lcl|Aclame:pro 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) Q Consensus 156 ~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v 235 (330) +++ +. + ++. . T Consensus 130 --------~a~---------------------------~~--------~---~~~--~---------------------- 139 (274) T protein:vir:97 130 --------GAK---------------------------LT--------V---NAD--I---------------------- 139 (274) T ss_pred --------ccC---------------------------cc--------c---ccc--c---------------------- Confidence 000 00 0 000 0 Q ss_pred EEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcc--ccceeeecccCCcceEEECC Q lcl|Aclame:pro 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK--IANNLTWETVSGERVMTFDG 313 (330) Q Consensus 236 ~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~--~~~~l~~~~~~g~~v~~~~g 313 (330) ...+.+++++ ..+-....--.+++||..+...|+++.... ....+...-.-.-.+-.+.| T Consensus 140 --------------~~~d~i~dA~----~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G 201 (274) T protein:vir:97 140 --------------TKLNGLQSAI----DKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG 201 (274) T ss_pred --------------cCHHHHHHHH----HHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecC Confidence 0012233322 222111222257999999999999753221 11111000011112457899 Q ss_pred eEEEEEeeccCCccccC Q lcl|Aclame:pro 314 IPVQRTDALLNTESRVV 330 (330) Q Consensus 314 vpir~~dal~~tE~~Vv 330 (330) ++|..+|.+.....-++ T Consensus 202 ~~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:97 202 AIIVRTNKLEAGTAILA 218 (274) T ss_pred eeEEEcCCCCcceEEEE Confidence 99999999987766655 No 131 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=73.21 E-value=0.17 Score=24.76 Aligned_cols=211 Identities=11% Similarity=0.063 Sum_probs=100.4 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhc-ce---eeccCCccceeEEEeccCCcceeecCCccCccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDM-TA---IEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNK 76 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~l-pf---~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~ 76 (330) ||. +.=+|++ .+-|.-..+-|.|.+.+.. ++..+ .. .++..|.--.......+..+.-..=+.++++.+ T Consensus 1 ma~---~~T~~~d---~iiPev~~~~v~~~~~~~l-~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~ 73 (274) T protein:vir:94 1 MPQ---GLTKTSD---QIIPEVLAPMMQAQLEKKL-RFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) T ss_pred CCc---cceehhh---eechHHHHHHHHHhhhhhh-hhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccc Confidence 554 3224444 2334444444555553321 11111 11 122222212222223345555555567888888 Q ss_pred ceEEEEEEEEEEecchhhhhHHH-HHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCC Q lcl|Aclame:pro 77 SSTAQVTDNCGMLEAYAEVDKAL-ADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) Q Consensus 77 ~t~~~~~~~l~ilgg~~eVDk~l-a~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~ 155 (330) -++.+.+..+.-.+-.++|+-.- +.-.+++- ....++...++..++...++.- + . T Consensus 74 lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~---~~~~~~~a~a~a~~vd~~~~~~--------l-------------~ 129 (274) T protein:vir:94 74 LETKKREAKIRKIAKGTSITDEALLSGYGDPQ---GEQVRQHGLAHANKVDNDVLEA--------L-------------M 129 (274) T ss_pred cccceeEEEeeeecceecccHHHHHhccchHH---HHHHHHHHHHHHHHHHHHHHHH--------H-------------h Confidence 88888888877777566665533 33345543 3334445566666665444310 0 0 Q ss_pred cceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccE Q lcl|Aclame:pro 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) Q Consensus 156 ~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v 235 (330) +++ +. + ++. . T Consensus 130 --------~a~---------------------------~~--------~---~~~--~---------------------- 139 (274) T protein:vir:94 130 --------GAK---------------------------LT--------V---NAD--I---------------------- 139 (274) T ss_pred --------ccC---------------------------cc--------c---ccc--c---------------------- Confidence 000 00 0 000 0 Q ss_pred EEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcc--ccceeeecccCCcceEEECC Q lcl|Aclame:pro 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK--IANNLTWETVSGERVMTFDG 313 (330) Q Consensus 236 ~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~--~~~~l~~~~~~g~~v~~~~g 313 (330) ...+.+++++ ..+-....--.+++||..+...|+++.... ....+...-.-.-.+-.+.| T Consensus 140 --------------~~~d~i~dA~----~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G 201 (274) T protein:vir:94 140 --------------TKLNGLQSAI----DKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG 201 (274) T ss_pred --------------cCHHHHHHHH----HHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecC Confidence 0012233322 222111222257999999999999753221 11111000011112457899 Q ss_pred eEEEEEeeccCCccccC Q lcl|Aclame:pro 314 IPVQRTDALLNTESRVV 330 (330) Q Consensus 314 vpir~~dal~~tE~~Vv 330 (330) ++|..+|.+.....-++ T Consensus 202 ~~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:94 202 AIIVRTNKLEAGTAILA 218 (274) T ss_pred eeEEEcCCCCcceEEEE Confidence 99999999987766655 No 132 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=62.20 E-value=0.34 Score=23.16 Aligned_cols=209 Identities=12% Similarity=0.083 Sum_probs=99.9 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhccee----eccCCccceeEE--EeccCCcceeecCCccCc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAI----EGNLPTGHRTSV--RTGLPTPTWRKLYGGVLP 74 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf~----e~n~g~~~~~~~--~~~lP~~~fR~lN~g~~~ 74 (330) ||. +.=+|+++ .-|.-..+-|.|.|.+. -++..+.-. ++.. |...++ -..++.+.-..=+.++++ T Consensus 1 ma~---~~T~l~d~---iiPev~~~~v~~~~~~~-l~~~~~~~~d~~l~g~~--G~tv~iP~~~~ig~a~~~~~g~~i~~ 71 (274) T protein:vir:12 1 MAQ---GLTKTSNQ---IIPEVLAPMMQAQLEKK-LRFASFAEVDSTLQGQP--GDTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) T ss_pred CCc---ceeehhhh---hchHHHHHHHHHHHHhh-hhhcccceecccccCCC--CCEEEEeeecCCCccccccCCCccch Confidence 554 33244442 33444445555555432 112222111 2221 222221 123445554455577888 Q ss_pred ccceEEEEEEEEEEecchhhhhH-HHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhccccc Q lcl|Aclame:pro 75 NKSSTAQVTDNCGMLEAYAEVDK-ALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSA 153 (330) Q Consensus 75 s~~t~~~~~~~l~ilgg~~eVDk-~la~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~ 153 (330) ++-++.+.+..+.-.+-.++|+- +.+.-.+|+-+.. .++.-.+++.++...++. .+ T Consensus 72 ~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~---~~q~~~~~a~~vd~~~l~--------~~------------ 128 (274) T protein:vir:12 72 DILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQ---VRQHGLAHANKVDNDVLE--------AL------------ 128 (274) T ss_pred hhcccceeeEEeeeecceeeecHHHHHhcccchHHHH---HHHHHHHHHHHHHHHHHH--------HH------------ Confidence 88888888888777777788844 4455456654333 334445556555543331 00 Q ss_pred CCcceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccc Q lcl|Aclame:pro 154 ENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWR 233 (330) Q Consensus 154 ~~~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r 233 (330) .+ ++ + . .+.+ . T Consensus 129 -~~-------a~----------------------------~--------~---~~~~--a-------------------- 139 (274) T protein:vir:12 129 -MG-------AK----------------------------L--------T---VNAD--I-------------------- 139 (274) T ss_pred -hc-------cc----------------------------c--------c---cccc--c-------------------- Confidence 00 00 0 0 0000 0 Q ss_pred cEEEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcc--ccceeeecccCCcceEEE Q lcl|Aclame:pro 234 YVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK--IANNLTWETVSGERVMTF 311 (330) Q Consensus 234 ~v~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~--~~~~l~~~~~~g~~v~~~ 311 (330) ...+.+++ |...+-.....-.+++||..+...|+++.... .......+-.-.-.+-.+ T Consensus 140 ----------------~~~d~i~d----A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:12 140 ----------------TKLNGLQS----AIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) T ss_pred ----------------cCHHHHHH----HHHHhccccccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceee Confidence 01122222 22222211122247999999999999764221 111111111111234578 Q ss_pred CCeEEEEEeeccCCccccC Q lcl|Aclame:pro 312 DGIPVQRTDALLNTESRVV 330 (330) Q Consensus 312 ~gvpir~~dal~~tE~~Vv 330 (330) .|++|..+|.+....+-++ T Consensus 200 ~G~~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:12 200 LGAIIVRSNKLEAGTAILA 218 (274) T ss_pred cCeeEEEeCCCCcceEEEE Confidence 9999999999987766555 No 133 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=56.46 E-value=0.46 Score=22.45 Aligned_cols=212 Identities=13% Similarity=0.056 Sum_probs=100.0 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcce---eeccCCccceeEEEeccCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTA---IEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKS 77 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lpf---~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~~ 77 (330) ||. +.=+|.++ +-|.-..+-|.|.+.+.+-+-....- .++..|.--....-..+..+....=+..+++.+- T Consensus 1 Ma~---~~T~l~d~---i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~l 74 (276) T protein:vir:10 1 MAQ---GTTTKSTQ---IVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKI 74 (276) T ss_pred CCc---ceeehhhh---hchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCcccc Confidence 553 22255553 34555556677776554433222222 1222221111222233344444334456677877 Q ss_pred eEEEEEEEEEEecchhhhhHHH-HHhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCCc Q lcl|Aclame:pro 78 STAQVTDNCGMLEAYAEVDKAL-ADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENK 156 (330) Q Consensus 78 t~~~~~~~l~ilgg~~eVDk~l-a~~~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~~ 156 (330) ++.+.+..+.-.+-.+.++-.- +...+|+-...+ ++.-.+++.++...++. .+ . . T Consensus 75 t~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~---~~~~~~~a~~~d~~~~~--------~l----------~--~- 130 (276) T protein:vir:10 75 ETNRREAKIHKIGKGTDITDEALLSGYGDPQGEAV---RQHGLAIANKVDNDVLE--------AL----------R--G- 130 (276) T ss_pred ccceeeEEeehccccccccHHHHHhhccchHHHHH---HHHHHHHHHHHHHHHHH--------HH----------h--c- Confidence 7777777777777777776543 444566544444 44445566655544330 00 0 0 Q ss_pred ceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccEE Q lcl|Aclame:pro 157 DNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVA 236 (330) Q Consensus 157 ~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v~ 236 (330) ++.. + +. T Consensus 131 -------~~~~------------------------------------~-----~~------------------------- 137 (276) T protein:vir:10 131 -------TKLT------------------------------------V-----SA------------------------- 137 (276) T ss_pred -------cccc------------------------------------c-----cc------------------------- Confidence 0000 0 00 Q ss_pred EeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcc-c-cceeeecccCCcceEEECCe Q lcl|Aclame:pro 237 RVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK-I-ANNLTWETVSGERVMTFDGI 314 (330) Q Consensus 237 RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~-~-~~~l~~~~~~g~~v~~~~gv 314 (330) .. ...+.+ ..|+..+-..+---.+++||..+...|+++.... . ...+.......-.+-.+.|+ T Consensus 138 -------~~----~t~d~i----~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~ 202 (276) T protein:vir:10 138 -------DI----GTLAGL----EAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGA 202 (276) T ss_pred -------cc----cCHHHH----HHHHHHhccccCcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecce Confidence 00 000111 1222222111112257899999999999753221 0 11111111111224578999 Q ss_pred EEEEEeeccCCccccC Q lcl|Aclame:pro 315 PVQRTDALLNTESRVV 330 (330) Q Consensus 315 pir~~dal~~tE~~Vv 330 (330) +|..+|.+....+-++ T Consensus 203 ~Vi~s~~~p~~t~~l~ 218 (276) T protein:vir:10 203 VIVRSKKLDEGEAILA 218 (276) T ss_pred eEEEcCCCCcceEEEE Confidence 9999999876555444 No 134 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=27.44 E-value=1.8 Score=19.13 Aligned_cols=211 Identities=9% Similarity=0.000 Sum_probs=95.1 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcc-e---eeccCCccceeEEEeccCCcceeecCCccCccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMT-A---IEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNK 76 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lp-f---~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~ 76 (330) ||. ..=+|.+ ...|.-..+-|.|.+.+.. ++..+- . .++..|.--.......+..+.-..=+.++++.+ T Consensus 1 m~~---~~T~l~d---~i~Pev~~~~v~~~~~~~l-~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~ 73 (274) T protein:vir:95 1 MAQ---GMTKLTN---QIVPEVLAPMMQAELEKKL-RFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDI 73 (274) T ss_pred CCc---ceeehhh---eechHHHHHHHHHHHHhhh-hccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhh Confidence 554 3234544 2334444444555554322 121111 1 122222111111222234454444456777777 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHHh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCC Q lcl|Aclame:pro 77 SSTAQVTDNCGMLEAYAEVDKALADL-NGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) Q Consensus 77 ~t~~~~~~~l~ilgg~~eVDk~la~~-~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~ 155 (330) -++.+.+..+.-.+-.+.++-.-+.+ .+|+-+.. .++.-.+++.++...++. .+ . + T Consensus 74 lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~---~~~~~~~~a~~vd~~i~~--------~l----------~--~ 130 (274) T protein:vir:95 74 LETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQ---VRQHGLAHANKVDDDVLE--------AL----------K--S 130 (274) T ss_pred cccceeEEEeeeeecceeehHHHHhhccchHHHHH---HHHHHHHHHHHHHHHHHH--------HH----------h--c Confidence 77777777776665566666433323 34544333 334445666655543330 00 0 0 Q ss_pred cceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccE Q lcl|Aclame:pro 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) Q Consensus 156 ~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v 235 (330) +.+.+ +. T Consensus 131 a~~~~-----------------------------------------------~~-------------------------- 137 (274) T protein:vir:95 131 AKLTV-----------------------------------------------EA-------------------------- 137 (274) T ss_pred ccccc-----------------------------------------------cc-------------------------- Confidence 00000 00 Q ss_pred EEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcc--ccceeeecccCCcceEEECC Q lcl|Aclame:pro 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK--IANNLTWETVSGERVMTFDG 313 (330) Q Consensus 236 ~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~--~~~~l~~~~~~g~~v~~~~g 313 (330) .. ...+.+.+.+.+ +.. ....+ .+++||..+...|+++.... ....+..+-..--.+-.+.| T Consensus 138 --------~~----~~~d~i~~A~~~-lgd--~~~~~-~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G 201 (274) T protein:vir:95 138 --------DI----TKLTGLQTAIDK-FND--EDLEP-MVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALG 201 (274) T ss_pred --------cc----cCHHHHHHHHHH-hcc--ccccc-cEEEeCHHHHHHHHhhccccccccccccccceeccccceecC Confidence 00 011222222211 111 11233 47999999999999753221 11111111111112557899 Q ss_pred eEEEEEeeccCCccccC Q lcl|Aclame:pro 314 IPVQRTDALLNTESRVV 330 (330) Q Consensus 314 vpir~~dal~~tE~~Vv 330 (330) ++|..+|++....+-++ T Consensus 202 ~~Vi~s~~~~~~t~~l~ 218 (274) T protein:vir:95 202 AVIVRSNKLEAGTAILA 218 (274) T ss_pred eEEEEeCCCCCceEEEE Confidence 99999999987666555 No 135 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=27.44 E-value=1.8 Score=19.13 Aligned_cols=211 Identities=9% Similarity=0.000 Sum_probs=95.1 Q ss_pred CCccccccccHHHHHhhcCcccchHHHHHHHhccchhHhhcc-e---eeccCCccceeEEEeccCCcceeecCCccCccc Q lcl|Aclame:pro 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMT-A---IEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNK 76 (330) Q Consensus 1 M~~~~~~a~TL~E~Ak~~~~d~~~~~VIE~l~~~s~iL~~lp-f---~e~n~g~~~~~~~~~~lP~~~fR~lN~g~~~s~ 76 (330) ||. ..=+|.+ ...|.-..+-|.|.+.+.. ++..+- . .++..|.--.......+..+.-..=+.++++.+ T Consensus 1 m~~---~~T~l~d---~i~Pev~~~~v~~~~~~~l-~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~ 73 (274) T protein:vir:96 1 MAQ---GMTKLTN---QIVPEVLAPMMQAELEKKL-RFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDI 73 (274) T ss_pred CCc---ceeehhh---eechHHHHHHHHHHHHhhh-hccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhh Confidence 554 3234544 2334444444555554322 121111 1 122222111111222234454444456777777 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHHh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcChhhccChhhhhcccccCC Q lcl|Aclame:pro 77 SSTAQVTDNCGMLEAYAEVDKALADL-NGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) Q Consensus 77 ~t~~~~~~~l~ilgg~~eVDk~la~~-~g~~~~~ra~e~~~~ika~~~~~~~~~iyGD~~~~p~~F~GL~~R~~~~t~~~ 155 (330) -++.+.+..+.-.+-.+.++-.-+.+ .+|+-+.. .++.-.+++.++...++. .+ . + T Consensus 74 lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~---~~~~~~~~a~~vd~~i~~--------~l----------~--~ 130 (274) T protein:vir:96 74 LETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQ---VRQHGLAHANKVDDDVLE--------AL----------K--S 130 (274) T ss_pred cccceeEEEeeeeecceeehHHHHhhccchHHHHH---HHHHHHHHHHHHHHHHHH--------HH----------h--c Confidence 77777777776665566666433323 34544333 334445666655543330 00 0 0 Q ss_pred cceeeecCCCCCCceEEEEEEeCCCcEEEEccccccccceeccccccccccccccCCceeEEEEEEeeeeeeEEeccccE Q lcl|Aclame:pro 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) Q Consensus 156 ~~~vidAGgtg~~~tSi~~V~~g~~~~~~iypkg~kagl~~~D~g~~~~~~~d~~gg~~~~y~~~~~w~~Gl~v~d~r~v 235 (330) +.+.+ +. T Consensus 131 a~~~~-----------------------------------------------~~-------------------------- 137 (274) T protein:vir:96 131 AKLTV-----------------------------------------------EA-------------------------- 137 (274) T ss_pred ccccc-----------------------------------------------cc-------------------------- Confidence 00000 00 Q ss_pred EEeecccccccccchhHHHHHHHHHHHHHhccCCCCCCEEEEeChHHHHHHHHHhhcc--ccceeeecccCCcceEEECC Q lcl|Aclame:pro 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK--IANNLTWETVSGERVMTFDG 313 (330) Q Consensus 236 ~RI~NId~~~l~~~~~~~~l~~~m~~a~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~~--~~~~l~~~~~~g~~v~~~~g 313 (330) .. ...+.+.+.+.+ +.. ....+ .+++||..+...|+++.... ....+..+-..--.+-.+.| T Consensus 138 --------~~----~~~d~i~~A~~~-lgd--~~~~~-~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G 201 (274) T protein:vir:96 138 --------DI----TKLTGLQTAIDK-FND--EDLEP-MVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALG 201 (274) T ss_pred --------cc----cCHHHHHHHHHH-hcc--ccccc-cEEEeCHHHHHHHHhhccccccccccccccceeccccceecC Confidence 00 011222222211 111 11233 47999999999999753221 11111111111112557899 Q ss_pred eEEEEEeeccCCccccC Q lcl|Aclame:pro 314 IPVQRTDALLNTESRVV 330 (330) Q Consensus 314 vpir~~dal~~tE~~Vv 330 (330) ++|..+|++....+-++ T Consensus 202 ~~Vi~s~~~~~~t~~l~ 218 (274) T protein:vir:96 202 AVIVRSNKLEAGTAILA 218 (274) T ss_pred eEEEEeCCCCCceEEEE Confidence 99999999987666555 Done!