Query lcl|Aclame:protein:vir:98525|NCBI_annot:hypothetical protein predicted by GeneMark|genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Match_columns 331 No_of_seqs 96 out of 105 Neff 5.8 Searched_HMMs 1612 Date Sun Dec 1 09:16:37 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_43 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_43_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:107826 Length: 331 100.0 4E-153 3E-156 856.0 34.8 331 1-331 1-331 (331) 2 protein:vir:98525 Length: 331 100.0 4E-153 3E-156 856.0 34.8 331 1-331 1-331 (331) 3 protein:vir:107388 Length: 331 100.0 4E-153 3E-156 856.0 34.8 331 1-331 1-331 (331) 4 protein:vir:103759 Length: 330 100.0 8E-152 5E-155 848.8 34.2 328 1-331 1-330 (330) 5 protein:vir:7324 Length: 335 # 100.0 1E-151 9E-155 847.6 34.4 329 1-331 1-334 (335) 6 protein:vir:95318 Length: 328 100.0 4E-149 3E-152 834.1 34.5 328 1-331 1-328 (328) 7 protein:vir:94933 Length: 330 100.0 2.6E-59 1.6E-62 341.8 19.8 224 1-243 20-330 (330) 8 protein:vir:97255 Length: 310 100.0 7.4E-53 4.6E-56 306.4 20.2 218 1-242 1-310 (310) 9 protein:vir:8187 Length: 311 # 99.3 8.3E-13 5.2E-16 86.8 19.7 230 1-331 1-247 (311) 10 protein:vir:104085 Length: 320 99.3 1.4E-12 8.9E-16 85.5 19.8 236 1-331 1-247 (320) 11 protein:vir:96223 Length: 324 99.2 1.2E-12 7.2E-16 86.0 16.4 225 1-331 21-245 (324) 12 protein:vir:1886 Length: 385 # 99.2 4.9E-13 3.1E-16 88.0 13.9 229 1-331 96-327 (385) 13 protein:vir:191 Length: 385 # 99.2 4.9E-13 3.1E-16 88.0 13.9 229 1-331 96-327 (385) 14 protein:vir:97053 Length: 390 99.2 1E-12 6.4E-16 86.3 15.5 229 1-331 96-336 (390) 15 protein:vir:105905 Length: 304 99.2 3.4E-12 2.1E-15 83.4 17.5 229 1-331 1-233 (304) 16 protein:vir:94142 Length: 304 99.2 3.4E-12 2.1E-15 83.4 17.5 229 1-331 1-233 (304) 17 protein:vir:7771 Length: 330 # 99.2 3.6E-12 2.3E-15 83.3 17.5 239 1-331 1-249 (330) 18 protein:vir:94771 Length: 298 99.2 5.4E-12 3.3E-15 82.4 18.4 229 1-331 1-234 (298) 19 protein:vir:4339 Length: 395 # 99.2 1.4E-12 8.7E-16 85.6 15.0 231 1-331 106-338 (395) 20 protein:vir:9759 Length: 303 # 99.2 1.7E-11 1E-14 79.7 20.3 231 1-331 1-238 (303) 21 protein:vir:103955 Length: 324 99.2 3.8E-12 2.3E-15 83.2 16.6 225 1-331 21-245 (324) 22 protein:vir:99749 Length: 324 99.2 3.8E-12 2.4E-15 83.2 16.6 225 1-331 21-245 (324) 23 protein:vir:81070 Length: 390 99.2 2.4E-12 1.5E-15 84.2 14.8 229 1-331 96-336 (390) 24 protein:vir:99920 Length: 311 99.1 1.7E-11 1E-14 79.7 19.1 230 1-331 1-240 (311) 25 protein:vir:100247 Length: 425 99.1 1.4E-11 8.9E-15 80.0 18.1 240 1-331 118-369 (425) 26 protein:vir:9309 Length: 324 # 99.1 9.1E-12 5.6E-15 81.1 16.9 225 1-331 21-245 (324) 27 protein:vir:2430 Length: 318 # 99.1 3.6E-11 2.2E-14 77.8 19.7 228 1-331 1-243 (318) 28 protein:vir:8102 Length: 543 # 99.1 5.9E-12 3.7E-15 82.2 15.0 229 1-331 243-482 (543) 29 protein:vir:41 Length: 299 # N 99.1 1.8E-11 1.1E-14 79.5 17.3 226 1-331 1-228 (299) 30 protein:vir:4226 Length: 326 # 99.1 2E-11 1.2E-14 79.2 17.3 236 1-331 1-253 (326) 31 protein:vir:2344 Length: 397 # 99.1 1.2E-11 7.3E-15 80.5 15.9 226 1-331 1-236 (397) 32 protein:vir:97148 Length: 324 99.1 1.6E-11 9.7E-15 79.8 16.4 225 1-331 21-245 (324) 33 protein:vir:10364 Length: 390 99.1 3.4E-11 2.1E-14 78.0 18.1 229 1-331 96-336 (390) 34 protein:vir:100135 Length: 418 99.1 1.3E-11 8.3E-15 80.2 15.8 227 1-331 121-358 (418) 35 protein:vir:96392 Length: 324 99.1 1.8E-11 1.1E-14 79.5 16.3 225 1-331 1-245 (324) 36 protein:vir:78830 Length: 324 99.1 1.8E-11 1.1E-14 79.5 16.3 225 1-331 1-245 (324) 37 protein:vir:4456 Length: 401 # 99.0 4.9E-11 3.1E-14 77.1 17.2 233 1-331 107-345 (401) 38 protein:vir:1638 Length: 298 # 99.0 1.4E-10 8.5E-14 74.7 18.9 229 1-331 1-234 (298) 39 protein:vir:4197 Length: 314 # 99.0 4.1E-11 2.5E-14 77.5 15.1 235 1-331 1-257 (314) 40 protein:vir:9574 Length: 300 # 99.0 2.4E-10 1.5E-13 73.3 19.2 230 1-331 1-235 (300) 41 protein:vir:78523 Length: 338 99.0 4.5E-10 2.8E-13 71.8 20.4 231 1-331 1-265 (338) 42 protein:vir:485 Length: 407 # 99.0 2.3E-10 1.5E-13 73.4 18.3 241 1-331 90-344 (407) 43 protein:vir:1328 Length: 392 # 98.9 3.1E-10 1.9E-13 72.7 17.8 230 1-331 97-336 (392) 44 protein:vir:6212 Length: 434 # 98.9 1.7E-10 1.1E-13 74.1 16.2 227 1-331 131-370 (434) 45 protein:vir:80684 Length: 315 98.9 2.4E-10 1.5E-13 73.4 16.8 227 1-331 1-242 (315) 46 protein:vir:94673 Length: 419 98.9 3.9E-10 2.4E-13 72.1 17.9 229 1-331 123-360 (419) 47 protein:vir:1433 Length: 435 # 98.9 3.4E-10 2.1E-13 72.5 17.4 230 1-331 124-366 (435) 48 protein:vir:78223 Length: 333 98.9 1.1E-09 6.6E-13 69.8 20.0 231 1-331 1-265 (333) 49 protein:vir:104256 Length: 458 98.9 3E-10 1.9E-13 72.8 16.5 234 1-331 158-403 (458) 50 protein:vir:80376 Length: 435 98.9 6.3E-10 3.9E-13 71.0 17.8 230 1-331 105-366 (435) 51 protein:vir:4159 Length: 315 # 98.9 5.4E-11 3.4E-14 76.9 11.7 236 1-331 1-262 (315) 52 protein:vir:95763 Length: 297 98.9 3.8E-10 2.4E-13 72.2 16.3 224 1-331 1-226 (297) 53 protein:vir:102119 Length: 404 98.8 9.1E-10 5.7E-13 70.1 17.4 232 1-331 92-341 (404) 54 protein:vir:98339 Length: 415 98.8 7.1E-10 4.4E-13 70.7 16.4 230 1-331 116-350 (415) 55 protein:vir:79987 Length: 415 98.8 7.1E-10 4.4E-13 70.7 16.4 230 1-331 116-350 (415) 56 protein:vir:81100 Length: 415 98.8 7.1E-10 4.4E-13 70.7 16.4 230 1-331 116-350 (415) 57 protein:vir:2504 Length: 305 # 98.8 1E-09 6.3E-13 69.9 16.8 225 1-331 1-232 (305) 58 protein:vir:4700 Length: 415 # 98.8 6.3E-10 3.9E-13 71.0 15.6 229 1-331 116-350 (415) 59 protein:vir:4600 Length: 415 # 98.8 6.3E-10 3.9E-13 71.0 15.6 229 1-331 116-350 (415) 60 protein:vir:105038 Length: 428 98.8 2.3E-09 1.4E-12 68.0 18.3 234 1-331 113-361 (428) 61 protein:vir:4997 Length: 397 # 98.8 2.2E-09 1.3E-12 68.1 17.7 208 1-331 109-326 (397) 62 protein:vir:101650 Length: 497 98.8 3.2E-10 2E-13 72.7 12.9 275 1-331 151-435 (497) 63 protein:vir:7855 Length: 497 # 98.8 3.2E-10 2E-13 72.7 12.9 275 1-331 151-435 (497) 64 protein:vir:4830 Length: 397 # 98.8 3.4E-09 2.1E-12 67.0 18.2 210 1-331 109-326 (397) 65 protein:vir:4953 Length: 397 # 98.8 3.8E-09 2.3E-12 66.8 18.1 210 1-331 109-326 (397) 66 protein:vir:6242 Length: 390 # 98.8 2.6E-09 1.6E-12 67.7 17.2 227 1-331 97-333 (390) 67 protein:vir:5739 Length: 366 # 98.7 3.3E-09 2.1E-12 67.1 17.3 233 1-331 52-299 (366) 68 protein:vir:1268 Length: 397 # 98.7 2.4E-09 1.5E-12 67.8 15.7 208 1-331 123-338 (397) 69 protein:vir:81227 Length: 413 98.7 2E-09 1.3E-12 68.2 15.2 229 1-331 110-353 (413) 70 protein:vir:9410 Length: 415 # 98.7 5.7E-09 3.5E-12 65.8 17.4 230 1-331 110-350 (415) 71 protein:vir:81160 Length: 371 98.7 8.9E-09 5.5E-12 64.7 17.9 207 1-331 91-312 (371) 72 protein:vir:3991 Length: 404 # 98.7 1.6E-08 9.7E-12 63.4 18.8 217 1-331 98-334 (404) 73 protein:vir:3158 Length: 321 # 98.6 2.1E-09 1.3E-12 68.2 13.1 235 1-331 1-254 (321) 74 protein:vir:7409 Length: 408 # 98.6 1.8E-08 1.1E-11 63.0 18.1 216 1-331 98-334 (408) 75 protein:vir:1025 Length: 408 # 98.6 3.6E-08 2.2E-11 61.4 18.3 216 1-331 100-334 (408) 76 protein:vir:4856 Length: 293 # 98.6 1.8E-08 1.1E-11 63.1 16.5 209 1-331 5-222 (293) 77 protein:vir:101607 Length: 379 98.6 9.2E-09 5.7E-12 64.6 14.8 218 1-331 101-322 (379) 78 protein:vir:4092 Length: 390 # 98.5 2.9E-08 1.8E-11 61.9 17.2 231 1-331 64-312 (390) 79 protein:vir:3845 Length: 395 # 98.5 5.8E-08 3.6E-11 60.3 18.0 213 1-331 105-324 (395) 80 protein:vir:4511 Length: 409 # 98.5 5.9E-08 3.6E-11 60.2 17.9 233 1-331 93-350 (409) 81 protein:vir:8420 Length: 477 # 98.5 6.6E-08 4.1E-11 60.0 17.0 239 1-331 145-415 (477) 82 protein:vir:93616 Length: 645 98.4 5.3E-08 3.3E-11 60.5 16.4 230 1-331 331-569 (645) 83 protein:vir:9704 Length: 394 # 98.3 9.1E-08 5.7E-11 59.2 15.5 211 1-331 121-336 (394) 84 protein:vir:96762 Length: 632 98.3 4.7E-07 2.9E-10 55.3 18.4 219 1-331 347-579 (632) 85 protein:vir:102082 Length: 392 98.3 4.2E-07 2.6E-10 55.5 17.2 210 1-331 106-325 (392) 86 protein:vir:107593 Length: 392 98.3 4.2E-07 2.6E-10 55.5 17.2 210 1-331 106-325 (392) 87 protein:vir:102873 Length: 392 98.3 4.2E-07 2.6E-10 55.5 17.2 210 1-331 106-325 (392) 88 protein:vir:105004 Length: 392 98.3 4.2E-07 2.6E-10 55.5 17.2 210 1-331 106-325 (392) 89 protein:vir:9643 Length: 377 # 98.2 2.7E-07 1.7E-10 56.6 15.3 244 1-331 59-352 (377) 90 protein:vir:1383 Length: 421 # 98.2 5.7E-07 3.5E-10 54.8 16.9 213 1-331 104-326 (421) 91 protein:vir:80128 Length: 466 98.2 1.6E-07 9.9E-11 57.8 13.5 250 1-331 123-393 (466) 92 protein:vir:100172 Length: 394 98.2 9.3E-07 5.8E-10 53.6 17.3 214 1-331 103-330 (394) 93 protein:vir:1084 Length: 437 # 98.2 3.9E-07 2.4E-10 55.7 15.1 215 1-331 141-372 (437) 94 protein:vir:95376 Length: 425 98.1 1.3E-06 7.9E-10 52.9 16.8 226 1-331 123-365 (425) 95 protein:vir:100884 Length: 389 98.1 1.6E-06 9.9E-10 52.4 16.8 211 1-331 109-328 (389) 96 protein:vir:3870 Length: 400 # 98.1 1.6E-06 9.7E-10 52.4 16.7 211 1-331 120-345 (400) 97 protein:vir:78640 Length: 352 98.0 1.6E-06 9.9E-10 52.4 15.3 215 1-331 64-293 (352) 98 protein:vir:9509 Length: 381 # 97.9 1.3E-06 8E-10 52.9 13.4 236 1-331 57-343 (381) 99 protein:vir:101291 Length: 381 97.9 1.3E-06 8E-10 52.9 13.4 236 1-331 57-343 (381) 100 protein:vir:98635 Length: 377 97.9 4E-07 2.5E-10 55.7 10.0 248 1-331 59-352 (377) 101 protein:vir:93881 Length: 387 97.8 3.7E-06 2.3E-09 50.4 15.0 215 1-331 100-329 (387) 102 protein:vir:9361 Length: 402 # 97.8 3E-06 1.9E-09 50.9 14.1 214 1-331 115-343 (402) 103 protein:vir:94424 Length: 387 97.7 7.3E-06 4.5E-09 48.7 15.4 213 1-331 104-328 (387) 104 protein:vir:2685 Length: 387 # 97.7 7.3E-06 4.5E-09 48.7 15.4 213 1-331 104-328 (387) 105 protein:vir:96978 Length: 387 97.7 7.3E-06 4.5E-09 48.7 15.4 213 1-331 104-328 (387) 106 protein:vir:95603 Length: 463 97.7 2.1E-07 1.3E-10 57.2 6.3 292 1-331 1-340 (463) 107 protein:vir:99311 Length: 463 97.7 2.1E-07 1.3E-10 57.2 6.3 292 1-331 1-340 (463) 108 protein:vir:962 Length: 397 # 97.7 4E-06 2.5E-09 50.2 13.0 210 1-331 127-343 (397) 109 protein:vir:95963 Length: 395 97.7 6.1E-06 3.8E-09 49.2 13.8 230 1-331 67-320 (395) 110 protein:vir:9820 Length: 272 # 97.6 8.5E-06 5.2E-09 48.4 14.4 210 1-331 1-217 (272) 111 protein:vir:3033 Length: 272 # 97.6 8.5E-06 5.2E-09 48.4 14.4 210 1-331 1-217 (272) 112 protein:vir:96666 Length: 462 97.6 3E-06 1.9E-09 50.8 11.1 233 1-331 1-283 (462) 113 protein:vir:102823 Length: 470 97.0 5.5E-06 3.4E-09 49.4 6.9 286 1-331 1-332 (470) 114 protein:vir:100632 Length: 381 97.0 9.9E-05 6.1E-08 42.5 13.7 245 1-255 57-381 (381) 115 protein:vir:80835 Length: 464 96.8 1E-05 6.2E-09 48.0 6.5 295 1-331 1-341 (464) 116 protein:vir:100851 Length: 514 96.7 4.8E-06 3E-09 49.8 4.6 263 1-331 36-336 (514) 117 protein:vir:97397 Length: 517 96.7 0.00025 1.6E-07 40.3 13.7 224 1-331 237-500 (517) 118 protein:vir:78350 Length: 383 96.7 0.00044 2.7E-07 39.0 17.0 240 1-252 64-383 (383) 119 protein:vir:97255 Length: 310 95.8 0.00019 1.2E-07 41.0 8.5 154 170-331 1-238 (310) 120 protein:vir:80491 Length: 467 95.6 0.00014 8.7E-08 41.7 7.1 278 1-331 1-328 (467) 121 protein:vir:63741 Length: 468 95.6 0.00014 8.4E-08 41.8 6.9 278 1-331 1-329 (468) 122 protein:vir:8843 Length: 317 # 95.4 0.0011 7E-07 36.7 11.4 245 1-331 1-264 (317) 123 protein:vir:99424 Length: 360 94.3 0.0031 1.9E-06 34.4 11.0 252 1-331 15-298 (360) 124 protein:vir:94933 Length: 330 93.9 0.00012 7.7E-08 42.0 2.6 219 68-331 1-265 (330) 125 protein:vir:93742 Length: 274 93.1 0.0089 5.5E-06 31.8 15.2 213 1-331 1-218 (274) 126 protein:vir:96123 Length: 274 89.9 0.024 1.5E-05 29.5 15.4 211 1-331 1-218 (274) 127 protein:vir:3613 Length: 272 # 81.9 0.082 5.1E-05 26.5 13.9 210 1-331 1-219 (272) 128 protein:vir:4074 Length: 480 # 79.3 0.11 6.6E-05 25.9 10.2 218 1-331 171-425 (480) 129 protein:vir:96833 Length: 275 76.8 0.13 8.2E-05 25.4 15.6 212 1-331 1-219 (275) 130 protein:vir:1239 Length: 274 # 76.2 0.14 8.6E-05 25.3 15.9 210 1-331 1-218 (274) 131 protein:vir:97433 Length: 274 73.4 0.17 0.00011 24.8 16.1 211 1-331 1-218 (274) 132 protein:vir:94494 Length: 274 73.4 0.17 0.00011 24.8 16.1 211 1-331 1-218 (274) 133 protein:vir:105334 Length: 276 69.0 0.23 0.00014 24.1 14.7 213 1-331 1-218 (276) 134 protein:vir:80930 Length: 278 46.3 0.74 0.00046 21.3 14.8 217 1-331 1-225 (278) 135 protein:vir:739 Length: 231 # 36.6 1.2 0.00072 20.2 11.5 171 45-331 1-178 (231) 136 protein:vir:95898 Length: 274 36.5 1.2 0.00073 20.2 15.3 206 1-331 1-218 (274) 137 protein:vir:96262 Length: 274 36.5 1.2 0.00073 20.2 15.3 206 1-331 1-218 (274) No 1 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=100.00 E-value=4.1e-153 Score=856.01 Aligned_cols=331 Identities=100% Similarity=1.440 Sum_probs=328.8 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) ||+|.+++|||+|+||+++++++++++|||+|+|+||||++|||+|||++++|.|++|++||+++||++|+||+++++++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 99999999999999999999999988999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccccee Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~v 160 (331) +|++++|+||||++||||+|++++||.++||++|+++|+|+|+|+|+++|||||++++|++|+||+|||++.+++++.|+ T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q~ 160 (331) T protein:vir:10 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeecc Q lcl|Aclame:pro 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) Q Consensus 161 idaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NI 240 (331) |||||||+++||||+|+||++++|||||||+++||+|+|+|+++++|++|++|+||++||+|++||+|+|||||+||||| T Consensus 161 IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NI 240 (331) T protein:vir:10 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) T ss_pred eecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEEE Q lcl|Aclame:pro 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) Q Consensus 241 d~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~ 320 (331) |+|+|++++++++||++||++|+++||+.++|+++|||||+|+++||+|+++|.|+++++++|++|++||+|+|||||+| T Consensus 241 dvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~ 320 (331) T protein:vir:10 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) T ss_pred chhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEEe Confidence 99999999999999999999999999999999999999999999999999999999899999999999999999999999 Q ss_pred EeccCCCcccC Q lcl|Aclame:pro 321 DALLLTEARVV 331 (331) Q Consensus 321 dai~~tE~~Vv 331 (331) |||++||++|| T Consensus 321 dai~~tE~~Vv 331 (331) T protein:vir:10 321 DALLLTEARVV 331 (331) T ss_pred eeeecCccccC Confidence 99999999999 No 2 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=100.00 E-value=4.1e-153 Score=856.01 Aligned_cols=331 Identities=100% Similarity=1.440 Sum_probs=328.8 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) ||+|.+++|||+|+||+++++++++++|||+|+|+||||++|||+|||++++|.|++|++||+++||++|+||+++++++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 99999999999999999999999988999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccccee Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~v 160 (331) +|++++|+||||++||||+|++++||.++||++|+++|+|+|+|+|+++|||||++++|++|+||+|||++.+++++.|+ T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q~ 160 (331) T protein:vir:98 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeecc Q lcl|Aclame:pro 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) Q Consensus 161 idaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NI 240 (331) |||||||+++||||+|+||++++|||||||+++||+|+|+|+++++|++|++|+||++||+|++||+|+|||||+||||| T Consensus 161 IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NI 240 (331) T protein:vir:98 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) T ss_pred eecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEEE Q lcl|Aclame:pro 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) Q Consensus 241 d~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~ 320 (331) |+|+|++++++++||++||++|+++||+.++|+++|||||+|+++||+|+++|.|+++++++|++|++||+|+|||||+| T Consensus 241 dvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~ 320 (331) T protein:vir:98 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) T ss_pred chhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEEe Confidence 99999999999999999999999999999999999999999999999999999999899999999999999999999999 Q ss_pred EeccCCCcccC Q lcl|Aclame:pro 321 DALLLTEARVV 331 (331) Q Consensus 321 dai~~tE~~Vv 331 (331) |||++||++|| T Consensus 321 dai~~tE~~Vv 331 (331) T protein:vir:98 321 DALLLTEARVV 331 (331) T ss_pred eeeecCccccC Confidence 99999999999 No 3 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=100.00 E-value=4.1e-153 Score=856.01 Aligned_cols=331 Identities=100% Similarity=1.440 Sum_probs=328.8 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) ||+|.+++|||+|+||+++++++++++|||+|+|+||||++|||+|||++++|.|++|++||+++||++|+||+++++++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 99999999999999999999999988999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccccee Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~v 160 (331) +|++++|+||||++||||+|++++||.++||++|+++|+|+|+|+|+++|||||++++|++|+||+|||++.+++++.|+ T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q~ 160 (331) T protein:vir:10 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeecc Q lcl|Aclame:pro 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) Q Consensus 161 idaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NI 240 (331) |||||||+++||||+|+||++++|||||||+++||+|+|+|+++++|++|++|+||++||+|++||+|+|||||+||||| T Consensus 161 IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NI 240 (331) T protein:vir:10 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) T ss_pred eecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEEE Q lcl|Aclame:pro 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) Q Consensus 241 d~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~ 320 (331) |+|+|++++++++||++||++|+++||+.++|+++|||||+|+++||+|+++|.|+++++++|++|++||+|+|||||+| T Consensus 241 dvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~ 320 (331) T protein:vir:10 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) T ss_pred chhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEEe Confidence 99999999999999999999999999999999999999999999999999999999899999999999999999999999 Q ss_pred EeccCCCcccC Q lcl|Aclame:pro 321 DALLLTEARVV 331 (331) Q Consensus 321 dai~~tE~~Vv 331 (331) |||++||++|| T Consensus 321 dai~~tE~~Vv 331 (331) T protein:vir:10 321 DALLLTEARVV 331 (331) T ss_pred eeeecCccccC Confidence 99999999999 No 4 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=100.00 E-value=8.4e-152 Score=848.84 Aligned_cols=328 Identities=65% Similarity=1.023 Sum_probs=321.0 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) ||+|++++|||+|+||++++|+++ ++|||+|+++||||++|||+|||++++|++++|++||+++||++|+||+++++++ T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~-~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN~g~~~s~~tt 79 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKV-DIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSST 79 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhH-HHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhhcCCccccccceE Confidence 999999999999999999999886 5899999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccccee Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~v 160 (331) +|++++|+||||++||||+|++++||.++||++|+++|+|+|+|+|+++|||||++++|++|+||+|||+++++.++.|+ T Consensus 80 ~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~~~qv 159 (330) T protein:vir:10 80 AQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) T ss_pred EEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCchhhe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeecc--CCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDA--AGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 161 idaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~--~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) |||||||+++||||+|+||++++|||||||+||||+|+|+|+++++|+ +||+|+||++||+|++||+|+|||||+||| T Consensus 160 IdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI~ 239 (330) T protein:vir:10 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) T ss_pred eeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEEe Confidence 999999999999999999999999999999999999999999999855 778999999999999999999999999999 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir 318 (331) |||+|+|.+++.+. +|++||++|++|||++++|+++|||||+|+++||+|+++|+|+ +++.+|++|++|++|+||||| T Consensus 240 NIdvs~l~~~~~~~-~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~-~l~~~~~~g~~~t~~~gipir 317 (330) T protein:vir:10 240 NIDVSDLATSANAQ-ALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIAN-NLTWETVSGERVMTFDGIPVQ 317 (330) T ss_pred ecccccCCCCccHH-HHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccc-eeeeeecCCeeeEEECCeEEE Confidence 99999999987776 8999999999999999999999999999999999999999887 589999999999999999999 Q ss_pred EEEeccCCCcccC Q lcl|Aclame:pro 319 RTDALLLTEARVV 331 (331) Q Consensus 319 ~~dai~~tE~~Vv 331 (331) |||||||||++|| T Consensus 318 ~~Dail~tE~~vv 330 (330) T protein:vir:10 318 RTDALLNTESRVV 330 (330) T ss_pred EEeeeecCccccC Confidence 9999999999999 No 5 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=100.00 E-value=1.4e-151 Score=847.56 Aligned_cols=329 Identities=47% Similarity=0.816 Sum_probs=322.5 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) ||+|++++|||+|+||++++|+++ +.|||+|+++||||++|||+|||++++|++++|++||+++||++|+||+++++++ T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~-~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~lN~g~~~s~~tt 79 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRI-ARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRRYNQGVQPTKTQT 79 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhH-HHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhhcCCccccccceE Confidence 999999999999999999999876 5799999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccc---cccc Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLS---AENG 157 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~---~~~~ 157 (331) +|++++|+||||++||||+|++++||.++||++|+++|+|||+|+|+++|||||++++|++|+||+|||++++ +.++ T Consensus 80 ~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~a~~a 159 (335) T protein:vir:73 80 VPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSKAASA 159 (335) T ss_pred EEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccccCcc Confidence 9999999999999999999999999999999999999999999999999999999999999999999998877 5568 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) .|+|||||||+++||||+|+||++++|||||||++|||+|+|+|+++++|++|++|+||++||+|++||+|+|||||+|| T Consensus 160 ~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvRI 239 (335) T protein:vir:73 160 ENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISRI 239 (335) T ss_pred cceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccccccCCCCccchhhHHHHHHHHH--HHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCe Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAV--ELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI 315 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~--~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gv 315 (331) ||||+|+|+++++++++|++||++|+ ++||++++|+++|||||+|+++||+|+++|+|+ +++.+|++|+++++|+|| T Consensus 240 ~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~-~l~~~~~~g~~~t~~~gi 318 (335) T protein:vir:73 240 CNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNV-NLTIEEYGGKKIVSFLGI 318 (335) T ss_pred eecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCce-eeeeeccCCceeEEECCe Confidence 99999999999999999999999998 589999999999999999999999999999987 599999999999999999 Q ss_pred EEEEEEeccCCCcccC Q lcl|Aclame:pro 316 PCRRTDALLLTEARVV 331 (331) Q Consensus 316 pir~~dai~~tE~~Vv 331 (331) ||||||||||||++|| T Consensus 319 pir~~Dail~tE~~v~ 334 (335) T protein:vir:73 319 PIRRVDAILNTESAVT 334 (335) T ss_pred EEEEEeeeecCccccc Confidence 9999999999999999 No 6 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=100.00 E-value=4.1e-149 Score=834.08 Aligned_cols=328 Identities=61% Similarity=0.977 Sum_probs=322.2 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) ||+|++++|||+|+||++++|++ .++|||+|+++||||++|||+|+|++++|.|++|++||+++||++|+||+++++++ T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~-~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~lN~g~~~s~~tt 79 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGK-VDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRLLNYGVQPSKSTT 79 (328) T ss_pred CCccccccccHHHHHhhhCcchh-HHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeeecCCccCccccee Confidence 99999999999999999988875 57899999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccccee Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~v 160 (331) +|++++|+||||++||||+|++++||.++|||+|+++|+|+|+++|+++|||||++++|++|+||++||++.+++++.|+ T Consensus 80 ~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~qi 159 (328) T protein:vir:95 80 VQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQNI 159 (328) T ss_pred EEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCccccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeecc Q lcl|Aclame:pro 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) Q Consensus 161 idaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NI 240 (331) |||||||+++||||+|+||++++|||||||+++||+|+|+|+++++|++|++|+||++||+|++||+|+|||||+||||| T Consensus 160 idaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~NI 239 (328) T protein:vir:95 160 IDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIANI 239 (328) T ss_pred eecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEEE Q lcl|Aclame:pro 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) Q Consensus 241 d~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~ 320 (331) |+|+|++++. +.+|++||++|+++||++++|+++|||||+|+++||+|+++|+|++ ++++|++|+++|+|+|||||+| T Consensus 240 d~~~l~~~~~-~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~-~~~~~~~g~~~t~~~gipir~~ 317 (328) T protein:vir:95 240 DVSNLSEPSS-AANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLA-ISVKETEGEWWTSFRGVPIRET 317 (328) T ss_pred cccccccccC-hhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCccee-eeeeccCCcceeEECCeEEEEE Confidence 9999998864 8899999999999999999999999999999999999999999985 9999999999999999999999 Q ss_pred EeccCCCcccC Q lcl|Aclame:pro 321 DALLLTEARVV 331 (331) Q Consensus 321 dai~~tE~~Vv 331 (331) |||++||++|| T Consensus 318 dai~~tE~~vv 328 (328) T protein:vir:95 318 DALLETEARVV 328 (328) T ss_pred eeeecCccccC Confidence 99999999999 No 7 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=100.00 E-value=2.6e-59 Score=341.77 Aligned_cols=224 Identities=17% Similarity=0.222 Sum_probs=198.1 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccc-e Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKS-R 79 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~-t 79 (331) .|.|++.++||+|++| +.+| .+..+|||+|+++|+||++|||++++++ .|.|++++++|+++||++|++|+++++ | T Consensus 20 ~p~l~m~alTLaea~~-l~~d-~~~~~VIE~l~~~s~iL~~lpf~~ve~~-~~~~~r~~~lp~a~~r~~n~~~~~~~~~T 96 (330) T protein:vir:94 20 FPELKMPTVTLAESAK-LSQD-HLVSGLIETIVEVNPLYEMMPFTEIEGN-ALAYNRENVLGDVQFLAVGGTITAKNPAT 96 (330) T ss_pred ccccchhhhhhhHHhh-cCch-hhHHHHHHhhhccchHHhhcccccccCC-cceeeeeecCCcceeeeccccccccCcce Confidence 7999999999999765 5555 4568899999999999999999998866 688999999999999999999999875 6 Q ss_pred EEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccce Q lcl|Aclame:pro 80 TVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) Q Consensus 80 ~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~ 159 (331) ++|++++|++++|+++||++|++++|++.++|++|.++|+|+|+++|+++|||||++. ++|+||.+|+. +.| T Consensus 97 f~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~--~~F~GL~~~~~------~~q 168 (330) T protein:vir:94 97 FTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTG--NSFQGMMGLVA------ASQ 168 (330) T ss_pred eeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCC--ccccchhhcCC------ccc Confidence 7999999999999999999999999999999999999999999999999999999874 58999999983 344 Q ss_pred eeccCCCC------------------------------------------------------------------------ Q lcl|Aclame:pro 160 IIDAGGTG------------------------------------------------------------------------ 167 (331) Q Consensus 160 vidaGgtG------------------------------------------------------------------------ 167 (331) +||+|++| T Consensus 169 ~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ 248 (330) T protein:vir:94 169 TISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDF 248 (330) T ss_pred EEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEeeeCCeEEEeccc Confidence 44444443 Q ss_pred ----------CCceEEEEEEeCCC----cEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccc Q lcl|Aclame:pro 168 ----------SDNASIWLTVWGPN----TLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRY 233 (331) Q Consensus 168 ----------~~~tSI~~V~~g~~----~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~ 233 (331) ++.||||+|+||++ .|||++++| ..||+++++|+.+.. .+++.+++||||++|.+.++ T Consensus 249 ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g-~~glsVr~~G~~~~k-------~v~~~~v~~y~~~av~~~~a 320 (330) T protein:vir:94 249 IPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARG-SAGLRVQNVGAKENA-------DETITRVKMYCGFANFSQLG 320 (330) T ss_pred ccCCCCcccCCCceeEEEEeecccccccceEeecCCC-CCcceeeeCCCcccc-------ceeeEEEEEeeeeEEechhh Confidence 34699999999975 899999887 579999999986655 45788999999999999999 Q ss_pred eeeeeccccc Q lcl|Aclame:pro 234 VVRIANVDVS 243 (331) Q Consensus 234 v~RI~NId~s 243 (331) ++++.||.+- T Consensus 321 ~~~L~~V~~g 330 (330) T protein:vir:94 321 LAAIKGLIPG 330 (330) T ss_pred eeeeccccCC Confidence 9999999875 No 8 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=100.00 E-value=7.4e-53 Score=306.37 Aligned_cols=218 Identities=17% Similarity=0.194 Sum_probs=187.3 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecC-----CccCc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLN-----YGVQP 75 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN-----~g~~~ 75 (331) || ++||+|++| +. ++.+..+|||+|+++|+||++|||++++++ .|.|++++++|+++||++| +|+++ T Consensus 1 mp-----altLaea~k-~~-~d~l~~~ViE~~~~~s~lL~~LpF~~veg~-~~~ynR~~~~~~~~~~~v~~~~~~~g~~~ 72 (310) T protein:vir:97 1 MA-----SVTLAESAK-LA-QDELVAGVIENIITVNRMFDVLPFDSIEGN-SLAYNRENVLGDVIMAGVGTTFSGAGAGK 72 (310) T ss_pred Cc-----ccchHHHhh-cC-cchHHHHHHHHHhccchHHHhCCcccccCC-cceeeEeeccCCcccccccccccCCCccc Confidence 76 569999986 44 455678999999999999999999998855 6999999999999999875 77789 Q ss_pred ccceEEEEEEEEEEecchhhhhHHHHhhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccc Q lcl|Aclame:pro 76 EKSRTVQVKDSMGMLETYAEVDKALADLN-GNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) Q Consensus 76 s~~t~~~~~~~l~ilgg~~eVDr~la~~~-gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~ 154 (331) +++|++|++++|++++|+++||++|++++ ++..+++++|.++++|+++++|+++|||||++.+ +|+||.+|+.. T Consensus 73 ~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n--~F~GL~~~~~~--- 147 (310) T protein:vir:97 73 AAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGN--EFAGLIQLCAS--- 147 (310) T ss_pred cccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCC--cccchhhcCCc--- Confidence 99999999999999999999999999996 7799999999999999999999999999999987 59999999732 Q ss_pred cccceeeccC---------------------------------------------------------------------- Q lcl|Aclame:pro 155 ENGQNIIDAG---------------------------------------------------------------------- 164 (331) Q Consensus 155 ~~~~~vidaG---------------------------------------------------------------------- 164 (331) +|.||+| T Consensus 148 ---~q~i~~~~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi 224 (310) T protein:vir:97 148 ---GQKATTGATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPI 224 (310) T ss_pred ---cceeecCCCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEE Confidence 2222222 Q ss_pred ------------CCCCCceEEEEEEeCCC----cEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEE Q lcl|Aclame:pro 165 ------------GTGSDNASIWLTVWGPN----TLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) Q Consensus 165 ------------gtG~~~tSI~~V~~g~~----~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v 228 (331) +++++.||||+|++|++ .+||++..| +.||+++++|+.+.. .+++++++||||++| T Consensus 225 ~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~-~~glsVr~~G~~~~~-------~v~~~~V~~Y~~~av 296 (310) T protein:vir:97 225 FRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQ-AAGIQVVDVGESEDS-------DEHIWRVKWYCGLAL 296 (310) T ss_pred EEeCccCCCccccccCCceeEEEEeeCccccccceeccccCC-ccceeEEeCCcccCC-------cceeEEEEEeeeEEE Confidence 33455899999999985 688888533 679999999986655 356888999999999 Q ss_pred ecccceeeeecccc Q lcl|Aclame:pro 229 RDWRYVVRIANVDV 242 (331) Q Consensus 229 ~d~r~v~RI~NId~ 242 (331) .++++++++.||-. T Consensus 297 ~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 297 FSEKGLACADGITN 310 (310) T ss_pred ecccceeeeccccC Confidence 99999999999853 No 9 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.31 E-value=8.3e-13 Score=86.81 Aligned_cols=230 Identities=11% Similarity=0.050 Sum_probs=155.1 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) |++..++..++.+- +...|||.+.+.|+|++..+.+...++ ...+.+.++-|+++|..=++.+++++.++ T Consensus 1 mat~~~gg~lvP~~---------~~~~ii~~~~~~s~i~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f 70 (311) T protein:vir:81 1 MVALATGTFQLPKH---------LVPGVWQKAQGQSVLARLSMAEPQEFG-EQQYMTLTAPPRGEVVGEGAQKSESTATF 70 (311) T ss_pred CceecCCceEcchh---------HHHHHHHHHHhcchhhhhcceeecCCC-ceEEEEEeCCceeEEeecCccccccccee Confidence 99887766544432 245799999999999999998876544 47788889999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCC-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccce Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGN-SAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn-~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~ 159 (331) .+++-..+-+++.+.|-+.+.+...+ ..++...-.....+++++.+...|+||+.+..+..+.|+-.... .+.+ T Consensus 71 ~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~-----~~~~ 145 (311) T protein:vir:81 71 APVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKIL-----DTTN 145 (311) T ss_pred eEEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCccccccccccc-----ccce Confidence 99999999999999999988876654 33455555566889999999999999975444444444422110 0000 Q ss_pred eeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeec Q lcl|Aclame:pro 160 IIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIAN 239 (331) Q Consensus 160 vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~N 239 (331) .+.. T Consensus 146 ~~~~---------------------------------------------------------------------------- 149 (311) T protein:vir:81 146 IVEL---------------------------------------------------------------------------- 149 (311) T ss_pred eeee---------------------------------------------------------------------------- Confidence 0000 Q ss_pred cccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEE Q lcl|Aclame:pro 240 VDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRR 319 (331) Q Consensus 240 Id~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~ 319 (331) .+.+......++..++.++......+..|.||.+...+|+.. .+..+.. +-+....+...-.+.|.||.. T Consensus 150 --------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~l-kd~~G~~-l~~~~~~~~~~~tl~G~Pv~~ 219 (311) T protein:vir:81 150 --------TTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQ-RDSQGRK-LYPELGFGTDVASFAGLNAAV 219 (311) T ss_pred --------cccccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhh-hccCCCe-eecCccccCCCceecceeEEe Confidence 000001122334445555544444445699999999999865 4444443 223334445567799999999 Q ss_pred EEeccCCCcc----------------cC Q lcl|Aclame:pro 320 TDALLLTEAR----------------VV 331 (331) Q Consensus 320 ~dai~~tE~~----------------Vv 331 (331) +++|...... ++ T Consensus 220 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 247 (311) T protein:vir:81 220 SDTVRGGPEAVTASTGVYRTTNPNVKAI 247 (311) T ss_pred cccccccccccccccchhcccCCccEEE Confidence 9988643311 11 No 10 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.29 E-value=1.4e-12 Score=85.51 Aligned_cols=236 Identities=9% Similarity=-0.014 Sum_probs=153.2 Q ss_pred CCcCccccccHHHHHHh-------cCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCcc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAAR-------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGV 73 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~-------~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~ 73 (331) |+.-..-.+.-...+.. +-+. .+...||+.+.+.++|++.++.+...++ .+.+.+..+-|.++|..=++.+ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~ 78 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEP-EQAKDYFAEAEKTSIVQQFAQKVPMGTT-GQKIPHWIGDVSAQWIGEGDMK 78 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccH-HHHHHHHHHHHhccchhhhcceeeccCC-ceEEEEEeCCcceEEecCCccc Confidence 44311111112222111 1121 2456799999999999999999886544 5778888899999999999999 Q ss_pred CcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccc Q lcl|Aclame:pro 74 QPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLS 153 (331) Q Consensus 74 ~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~ 153 (331) ++++.++.+++-.++-+++.+.|.+.+.+... .++...-.+...+++++.+..+|++|+-+..|..+.|+.+ T Consensus 79 ~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~------ 150 (320) T protein:vir:10 79 PITKGNMTSQNIAPHKIATIFVASAETVRANP--ANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTK------ 150 (320) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcCh--HHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccc------ Confidence 99999999999999999999999998877543 1233333345779999999999999985443322222211 Q ss_pred ccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccc Q lcl|Aclame:pro 154 AENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRY 233 (331) Q Consensus 154 ~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~ 233 (331) + T Consensus 151 -------------~------------------------------------------------------------------ 151 (320) T protein:vir:10 151 -------------S------------------------------------------------------------------ 151 (320) T ss_pred -------------c------------------------------------------------------------------ Confidence 0 Q ss_pred eeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCc----ee Q lcl|Aclame:pro 234 VVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGK----KV 309 (331) Q Consensus 234 v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~----~v 309 (331) +++-..... .+..-.++.+++++++..+++....+.+|+||++....|+.. .++.....+.+.-..|. .- T Consensus 152 ----~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~l-kd~~G~~l~~~~~~~~~~~~~~~ 225 (320) T protein:vir:10 152 ----VSLADPGGA-TASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGA-KDKNGRPLFIESTYTDENSPFRA 225 (320) T ss_pred ----ccceecccc-cccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHh-hccCCceeeccccccCccccccC Confidence 000000000 011112345668888888988888899999999999999854 44444332221111111 11 Q ss_pred EEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 310 VAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 310 ~~~~gvpir~~dai~~tE~~Vv 331 (331) ..+.|+||..++++-..+..++ T Consensus 226 ~~i~g~pv~~~~~~~~~~~~~~ 247 (320) T protein:vir:10 226 GRIVSRPTILSDHVADGTTVGY 247 (320) T ss_pred ceeeeeeeEecCCCCCCceEEE Confidence 2578999999999876665543 No 11 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.23 E-value=1.2e-12 Score=86.03 Aligned_cols=225 Identities=15% Similarity=0.170 Sum_probs=148.0 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) +........+..+.+..+-+. .+...|++.+.+.++|++.++.+...++ .+++.++++.|.+.|.+=++..++++.++ T Consensus 21 ~~~~~a~~~~~~~~~~~lip~-~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f 98 (324) T protein:vir:96 21 PQVFNPDNVMMHEKKDGTLLN-DFTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) T ss_pred hhhcccccccccCCCcceech-hHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceeeecCCccccccccce Confidence 111111111111111112233 2446799999999999999999887654 57899999999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccccee Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~v 160 (331) .+++-..+-+++.+.|.+.+.+... .++...-.....+++++.+...+|+|+.+.. + .. T Consensus 99 ~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~---~--------------~~-- 157 (324) T protein:vir:96 99 VNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP---F--------------GK-- 157 (324) T ss_pred eEEEEEeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCC---c--------------Cc-- Confidence 9999999999999999998777643 2333444455789999999999999974321 0 00 Q ss_pred eccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeecc Q lcl|Aclame:pro 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) Q Consensus 161 idaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NI 240 (331) ++... .. ... +. T Consensus 158 ------------------------~~~~~-~~---------~~~----------------------------------~~ 169 (324) T protein:vir:96 158 ------------------------SIAQS-IK---------KTN----------------------------------KV 169 (324) T ss_pred ------------------------ccccc-cc---------ccc----------------------------------ee Confidence 00000 00 000 00 Q ss_pred ccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEEE Q lcl|Aclame:pro 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) Q Consensus 241 d~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~ 320 (331) ..+....+.|.+++.+++.....+..|+||++.+..|+.. .+......+ . +...-.+.|+||..+ T Consensus 170 ---------~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~l-kd~~G~~~~-~----~~~~~~l~G~PV~~~ 234 (324) T protein:vir:96 170 ---------IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI-VDPETKERI-Y----DRNSDSLDGLPVVNL 234 (324) T ss_pred ---------cccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHh-hCCCCCeee-c----CCCCCcccceeeEee Confidence 0001113446677777776667778999999999999965 455444322 1 122345899999998 Q ss_pred EeccCCCcccC Q lcl|Aclame:pro 321 DALLLTEARVV 331 (331) Q Consensus 321 dai~~tE~~Vv 331 (331) .+...++..++ T Consensus 235 ~~~~~~~~~~~ 245 (324) T protein:vir:96 235 KSSNLKRGELI 245 (324) T ss_pred cCCCCCcceEE Confidence 88877776665 No 12 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.22 E-value=4.9e-13 Score=88.05 Aligned_cols=229 Identities=10% Similarity=0.026 Sum_probs=148.5 Q ss_pred CCcC--ccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEee-cCCcceeecCCccCccc Q lcl|Aclame:pro 1 MPTL--STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRKLNYGVQPEK 77 (331) Q Consensus 1 M~~l--~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~-lP~~~fR~lN~g~~~s~ 77 (331) +... .....+-.+.+-.+.+. .+...||+.+.+.++|++.++.....++ .+.+.+.++ -+.+.|..=++.+++++ T Consensus 96 ~~~~~~~~~~~~~~~~~g~~i~~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~ 173 (385) T protein:vir:18 96 FGAKTFNKSLGSDADSAGSLIQP-MQIPGIIMPGLRRLTIRDLLAQGRTSSN-ALEYVREEVFTNNADVVAEKALKPESD 173 (385) T ss_pred chhhHHHhhhccccccCCceecc-hhhhHHHHHhhhccchhhhcceecccCc-ceEEEEEecCCcceeeeccCccccccc Confidence 0000 00000000101011121 2346799999999999999999876544 577887765 57889999999999999 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENG 157 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~ 157 (331) .++.+++-..+-+++.+.|.+.+.+-.+ .+...-.....++++..+...|++||.+.+| +.||..- T Consensus 174 ~~~~~~~~~~~k~~~~~~is~ell~d~~---~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~--~~Gi~~~--------- 239 (385) T protein:vir:18 174 ITFSKQTANVKTIAHWVQASRQVMDDAP---MLQSYINNRLMYGLALKEEGQLLNGDGTGDN--LEGLNKV--------- 239 (385) T ss_pred cceeEEEEeeeeEEEeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHhccCCCCc--ccccccc--------- Confidence 9999999999999999999999877543 3444455668899999999999999744331 3333110 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) T Consensus 240 -------------------------------------------------------------------------------- 239 (385) T protein:vir:18 240 -------------------------------------------------------------------------------- 239 (385) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEE Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPC 317 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpi 317 (331) ++..... ...++....+.+++++..+......+.+|+||++....|+.. .+..+...+ + +..+.....+.|+|| T Consensus 240 ~~~~~~~---~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l-kd~~G~~l~-~-~~~~~~~~~l~G~pV 313 (385) T protein:vir:18 240 ATAYDTS---LNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALL-KDNEGRYIF-G-GPQAFTSNIMWGLPV 313 (385) T ss_pred ccccccc---ccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHh-hcCCCceec-c-CcccCCCceecceee Confidence 0000000 111223346778888888887778889999999999999865 344443322 2 222223346789999 Q ss_pred EEEEeccCCCcccC Q lcl|Aclame:pro 318 RRTDALLLTEARVV 331 (331) Q Consensus 318 r~~dai~~tE~~Vv 331 (331) ..++.+-.++..+. T Consensus 314 ~~~~~~p~~~~~~g 327 (385) T protein:vir:18 314 VPTKAQAAGTFTVG 327 (385) T ss_pred EEcCcCCCCcEEEe Confidence 99999865432222 No 13 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.22 E-value=4.9e-13 Score=88.05 Aligned_cols=229 Identities=10% Similarity=0.026 Sum_probs=148.5 Q ss_pred CCcC--ccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEee-cCCcceeecCCccCccc Q lcl|Aclame:pro 1 MPTL--STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRKLNYGVQPEK 77 (331) Q Consensus 1 M~~l--~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~-lP~~~fR~lN~g~~~s~ 77 (331) +... .....+-.+.+-.+.+. .+...||+.+.+.++|++.++.....++ .+.+.+.++ -+.+.|..=++.+++++ T Consensus 96 ~~~~~~~~~~~~~~~~~g~~i~~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~ 173 (385) T protein:vir:19 96 FGAKTFNKSLGSDADSAGSLIQP-MQIPGIIMPGLRRLTIRDLLAQGRTSSN-ALEYVREEVFTNNADVVAEKALKPESD 173 (385) T ss_pred chhhHHHhhhccccccCCceecc-hhhhHHHHHhhhccchhhhcceecccCc-ceEEEEEecCCcceeeeccCccccccc Confidence 0000 00000000101011121 2346799999999999999999876544 577887765 57889999999999999 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENG 157 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~ 157 (331) .++.+++-..+-+++.+.|.+.+.+-.+ .+...-.....++++..+...|++||.+.+| +.||..- T Consensus 174 ~~~~~~~~~~~k~~~~~~is~ell~d~~---~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~--~~Gi~~~--------- 239 (385) T protein:vir:19 174 ITFSKQTANVKTIAHWVQASRQVMDDAP---MLQSYINNRLMYGLALKEEGQLLNGDGTGDN--LEGLNKV--------- 239 (385) T ss_pred cceeEEEEeeeeEEEeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHhccCCCCc--ccccccc--------- Confidence 9999999999999999999999877543 3444455668899999999999999744331 3333110 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) T Consensus 240 -------------------------------------------------------------------------------- 239 (385) T protein:vir:19 240 -------------------------------------------------------------------------------- 239 (385) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEE Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPC 317 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpi 317 (331) ++..... ...++....+.+++++..+......+.+|+||++....|+.. .+..+...+ + +..+.....+.|+|| T Consensus 240 ~~~~~~~---~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l-kd~~G~~l~-~-~~~~~~~~~l~G~pV 313 (385) T protein:vir:19 240 ATAYDTS---LNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALL-KDNEGRYIF-G-GPQAFTSNIMWGLPV 313 (385) T ss_pred ccccccc---ccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHh-hcCCCceec-c-CcccCCCceecceee Confidence 0000000 111223346778888888887778889999999999999865 344443322 2 222223346789999 Q ss_pred EEEEeccCCCcccC Q lcl|Aclame:pro 318 RRTDALLLTEARVV 331 (331) Q Consensus 318 r~~dai~~tE~~Vv 331 (331) ..++.+-.++..+. T Consensus 314 ~~~~~~p~~~~~~g 327 (385) T protein:vir:19 314 VPTKAQAAGTFTVG 327 (385) T ss_pred EEcCcCCCCcEEEe Confidence 99999865432222 No 14 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.21 E-value=1e-12 Score=86.28 Aligned_cols=229 Identities=11% Similarity=0.071 Sum_probs=149.2 Q ss_pred CCcCccccccHHH-----------HHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEee-cCCcceee Q lcl|Aclame:pro 1 MPTLSTTNPTLAD-----------VAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRK 68 (331) Q Consensus 1 M~~l~~~a~TL~E-----------~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~-lP~~~fR~ 68 (331) +--.....+.+.. .+..+.+. .+...||+.+.+.++|++.+++....++ .+.+.++++ -|.+.|.. T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~-~~~~~ii~~~~~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~ 173 (390) T protein:vir:97 96 NDRSARATMNIKAALNTASTDAAGSAGALTTP-NRLPGFITPPDARLTVRDLIGSGRTDSA-LIEYVQETGFVNNAAIVA 173 (390) T ss_pred hhhhhhhhhHHHHHHHhhhcccccccccccch-hhhHHHHHHHhhhhhhHhhcceeeccCC-ceEEEEEecCCcceeeec Confidence 0000000000000 01111122 2346799999999999999999887654 467887766 47889999 Q ss_pred cCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhh Q lcl|Aclame:pro 69 LNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPR 148 (331) Q Consensus 69 lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R 148 (331) =++.++++..++.+++-..+-+++.+.|.+.+.+... ++...-.....+++++++.+.||+||...+ .+.||-.- T Consensus 174 Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~---~l~~~i~~~la~a~~~~~d~a~l~G~g~~~--~p~Gi~~~ 248 (390) T protein:vir:97 174 EGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP---QLASYMNNRLIRGLKVKEDAEILRGTGAND--GLLGLIPQ 248 (390) T ss_pred CCccccccccceeEEEEeeeeEEEeehhhHHHHHhHH---HHHHHHHHHHHHHHHHHHHHHHhhcCCCCc--cccceeec Confidence 9999999999999999999999999999999887544 344444556899999999999999963321 23333110 Q ss_pred hccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEE Q lcl|Aclame:pro 149 FNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) Q Consensus 149 ~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v 228 (331) ++.+ T Consensus 249 --------------~~~~-------------------------------------------------------------- 252 (390) T protein:vir:97 249 --------------ATTY-------------------------------------------------------------- 252 (390) T ss_pred --------------cccc-------------------------------------------------------------- Confidence 0000 Q ss_pred ecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCce Q lcl|Aclame:pro 229 RDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK 308 (331) Q Consensus 229 ~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~ 308 (331) ++ ..+.++....+.+++++..+......+..|+||++....|+.. .+......+. +...+ . T Consensus 253 ----------~~------~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~l-kd~~G~~l~~-~~~~~-~ 313 (390) T protein:vir:97 253 ----------AA------PTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELA-KDANNQYLIG-NARGT-L 313 (390) T ss_pred ----------cc------cccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHh-hcCCCceeec-CccCC-C Confidence 00 0122334456678888888887777788999999999999854 4555443322 22222 2 Q ss_pred eEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 309 VVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 309 v~~~~gvpir~~dai~~tE~~Vv 331 (331) ...+.|+||..+|++-.++..+. T Consensus 314 ~~~l~G~pV~~~~~~~~~~~~~g 336 (390) T protein:vir:97 314 TPTLWGLPVVATQAMAPGEFLVG 336 (390) T ss_pred CceecceeeEEcCCCCCCcEEEE Confidence 24688999999999865542222 No 15 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.20 E-value=3.4e-12 Score=83.44 Aligned_cols=229 Identities=12% Similarity=0.091 Sum_probs=150.8 Q ss_pred CCcCc--cccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MPTLS--TTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKS 78 (331) Q Consensus 1 M~~l~--~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~ 78 (331) |+.=- ....|..+...-+-+. .+...||+.+.+.++|++..+.....++ .+++.+.++-+.+.|..=|+.+++++. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~ 78 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPA-EQGTLIMKDIMANSAIMKLAKNEPMTAQ-KKKFTYLAKGVGAYWVSETERIQTSKP 78 (304) T ss_pred CcccccccccccccCCCceecch-hHHHHHHHHHHhccchhhhcceeeccCC-ceEEEEEeCCcceEEeecCcccccccc Confidence 66421 1122222222223333 2456799999999999999988876543 477888899999999999999999999 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-..+-+++.+.|.+.+.+... -++...-.....+++++.....|+|||-+..|.+- T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~---------------- 140 (304) T protein:vir:10 79 EYAQAEMEAKKIGVIIPLSKEFLKWTA--KDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTST---------------- 140 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHhhheeccCCCccccc---------------- Confidence 999999999999999999998877654 22333344557899999999999999744321100 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) . ..++++. .. -+ T Consensus 141 ----------~-------------~~~~~~~-----~~----------------------------------------~~ 152 (304) T protein:vir:10 141 ----------S-------------GKPLVEG-----AE----------------------------------------EK 152 (304) T ss_pred ----------c-------------ccccccc-----cc----------------------------------------cc Confidence 0 0000000 00 00 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir 318 (331) .. ..+.+....+.+.+++.+++.....+..|+||++.+..|+.. .+....+.+. .. ...+.|+||. T Consensus 153 ~~-------~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l-kd~~G~~l~~--~~----~~~l~G~PV~ 218 (304) T protein:vir:10 153 GN-------VVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNA-LDANDRPLFD--AN----GNEIMGLPLS 218 (304) T ss_pred cc-------ccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHh-hccCCcEeec--CC----CccccceeeE Confidence 11 111223345667788888887777788999999999999854 4554443221 11 2357899999 Q ss_pred EEEeccC--CCcccC Q lcl|Aclame:pro 319 RTDALLL--TEARVV 331 (331) Q Consensus 319 ~~dai~~--tE~~Vv 331 (331) .++++.. ++..++ T Consensus 219 ~~~~~~~~~~~~~~~ 233 (304) T protein:vir:10 219 YTGADVYDKKKSLAL 233 (304) T ss_pred EecccccCCCCcEEE Confidence 9999843 233322 No 16 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.20 E-value=3.4e-12 Score=83.44 Aligned_cols=229 Identities=12% Similarity=0.091 Sum_probs=150.8 Q ss_pred CCcCc--cccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MPTLS--TTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKS 78 (331) Q Consensus 1 M~~l~--~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~ 78 (331) |+.=- ....|..+...-+-+. .+...||+.+.+.++|++..+.....++ .+++.+.++-+.+.|..=|+.+++++. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~ 78 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPA-EQGTLIMKDIMANSAIMKLAKNEPMTAQ-KKKFTYLAKGVGAYWVSETERIQTSKP 78 (304) T ss_pred CcccccccccccccCCCceecch-hHHHHHHHHHHhccchhhhcceeeccCC-ceEEEEEeCCcceEEeecCcccccccc Confidence 66421 1122222222223333 2456799999999999999988876543 477888899999999999999999999 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-..+-+++.+.|.+.+.+... -++...-.....+++++.....|+|||-+..|.+- T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~---------------- 140 (304) T protein:vir:94 79 EYAQAEMEAKKIGVIIPLSKEFLKWTA--KDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTST---------------- 140 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHhhheeccCCCccccc---------------- Confidence 999999999999999999998877654 22333344557899999999999999744321100 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) . ..++++. .. -+ T Consensus 141 ----------~-------------~~~~~~~-----~~----------------------------------------~~ 152 (304) T protein:vir:94 141 ----------S-------------GKPLVEG-----AE----------------------------------------EK 152 (304) T ss_pred ----------c-------------ccccccc-----cc----------------------------------------cc Confidence 0 0000000 00 00 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir 318 (331) .. ..+.+....+.+.+++.+++.....+..|+||++.+..|+.. .+....+.+. .. ...+.|+||. T Consensus 153 ~~-------~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l-kd~~G~~l~~--~~----~~~l~G~PV~ 218 (304) T protein:vir:94 153 GN-------VVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNA-LDANDRPLFD--AN----GNEIMGLPLS 218 (304) T ss_pred cc-------ccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHh-hccCCcEeec--CC----CccccceeeE Confidence 11 111223345667788888887777788999999999999854 4554443221 11 2357899999 Q ss_pred EEEeccC--CCcccC Q lcl|Aclame:pro 319 RTDALLL--TEARVV 331 (331) Q Consensus 319 ~~dai~~--tE~~Vv 331 (331) .++++.. ++..++ T Consensus 219 ~~~~~~~~~~~~~~~ 233 (304) T protein:vir:94 219 YTGADVYDKKKSLAL 233 (304) T ss_pred EecccccCCCCcEEE Confidence 9999843 233322 No 17 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.19 E-value=3.6e-12 Score=83.30 Aligned_cols=239 Identities=11% Similarity=0.042 Sum_probs=152.8 Q ss_pred CCcCcc--ccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MPTLST--TNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKS 78 (331) Q Consensus 1 M~~l~~--~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~ 78 (331) |+.--. ...+.-..+..+-+.. +...||+.+.+.++|++..+.....++ .+.+.+.++-|.+.|..=++.+++++. T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~-~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~ 78 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPE-QSQDYFAEIEKTSIVQRIARKVPMGPT-GISIPHWTGAVSASWTGEAERKPITKG 78 (330) T ss_pred CcccccchhhccccCCCcceechh-HHHHHHHHHHhccchhhhcceeeccCC-ceEEEEEcCCcceeEecCCCccccccc Confidence 443211 1111111111222222 345699999999999999999886543 588999999999999999999999999 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-.++-+++.+.|.+.+.+... -++.+.-.....+++++++...||+||.+. ..+.|+.+-. T Consensus 79 ~f~~i~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~--~~~~g~~~~~--------- 145 (330) T protein:vir:77 79 SFGKQELEPVKITTIFAESAEVVRLNP--LNYLNTMRTKIAEAIALKFDAAAIHGIDKP--SAFKGYLAET--------- 145 (330) T ss_pred eeeEEEEeEEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhcccCCC--Cccccccccc--------- Confidence 999999999999999999998877643 233444445688999999999999998542 2334442210 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) ++.. . ..+.+ T Consensus 146 -------~~~~------------~----------------------~~~~~----------------------------- 155 (330) T protein:vir:77 146 -------TKVV------------S----------------------LADTN----------------------------- 155 (330) T ss_pred -------cccc------------e----------------------eeccc----------------------------- Confidence 0000 0 00000 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccC----CceeEEEcC Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIA----GKKVVAFDG 314 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~----g~~v~~~~g 314 (331) .+ +.+....++.+-|.+++.++......+..|+||++.+..|+.. ++......+.+.... +..-..+.| T Consensus 156 ~~------~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~~~~~~~l~G 228 (330) T protein:vir:77 156 LT------TASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTA-VDGNGRPLFVESTYTEQVGAIREGRILG 228 (330) T ss_pred cc------ccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHH-hccCCceeecCccccccccccCCceecc Confidence 00 0112223345567777777777777778999999999999964 444443323222222 223346889 Q ss_pred eEEEEEEeccCCC----cccC Q lcl|Aclame:pro 315 IPCRRTDALLLTE----ARVV 331 (331) Q Consensus 315 vpir~~dai~~tE----~~Vv 331 (331) +||..++++.... ..++ T Consensus 229 ~PV~~~~~~p~~~~~~~~~~~ 249 (330) T protein:vir:77 229 RPTYVADNVVNGTVGNRVVGV 249 (330) T ss_pred eeeEEeccccCCCCCCccEEE Confidence 9999999985422 2222 No 18 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.19 E-value=5.4e-12 Score=82.36 Aligned_cols=229 Identities=11% Similarity=0.017 Sum_probs=152.6 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) |++= .+. ++.+ .+...|||.+.+.|+|++.++.+...++ .+++.+.++-|.++|..=++.++++..++ T Consensus 1 ma~~-gG~-lip~---------~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f 68 (298) T protein:vir:94 1 MVLN-KGT-LFDP---------ELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGVTL 68 (298) T ss_pred Ceec-ccc-ccCh---------hHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceEEeeCCccccccccce Confidence 7762 222 2211 2345699999999999999998876543 57888899999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccce Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~ 159 (331) .+++-..+-+++.+.|.+.+.+... +..++...-.....+++++.+..+++||... T Consensus 69 ~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~----------------------- 125 (298) T protein:vir:94 69 APQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNP----------------------- 125 (298) T ss_pred eEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhccccc----------------------- Confidence 9999999999999999998876654 3456666666778999999999999999311 Q ss_pred eeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeec Q lcl|Aclame:pro 160 IIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIAN 239 (331) Q Consensus 160 vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~N 239 (331) ++|... ...++. + . ..-..| T Consensus 126 -----~~g~~~-----------~~~~~~--~--------------~----------------------------~~~~~~ 145 (298) T protein:vir:94 126 -----RLGTAS-----------AVIGTN--H--------------F----------------------------DSKVTQ 145 (298) T ss_pred -----CCCccc-----------cccccc--c--------------c----------------------------cccccc Confidence 111000 000000 0 0 000001 Q ss_pred cccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEE Q lcl|Aclame:pro 240 VDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRR 319 (331) Q Consensus 240 Id~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~ 319 (331) +. ...++..+.++.+.+++.++......+..|+||.+....|+.. .+..+...+ .+...+...-.++|+||.. T Consensus 146 ~~-----~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~l-kd~~G~~l~-~~~~~~~~~~tl~G~PV~~ 218 (298) T protein:vir:94 146 KV-----EAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQ-KDLQGNALF-PELKWGATPDTINGLPVDV 218 (298) T ss_pred cc-----ccccccccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHh-hccCCCeee-cCcccCCCCceecceeeEE Confidence 10 0011223456678888888876666678899999999999875 444444322 2333333445789999999 Q ss_pred EEeccC----CCcccC Q lcl|Aclame:pro 320 TDALLL----TEARVV 331 (331) Q Consensus 320 ~dai~~----tE~~Vv 331 (331) ++++-. ++..++ T Consensus 219 ~~~v~~~~~~~~~~~~ 234 (298) T protein:vir:94 219 NKTVSDMSLTQRDRAI 234 (298) T ss_pred ecccccccCCCccEEE Confidence 998842 222222 No 19 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.19 E-value=1.4e-12 Score=85.56 Aligned_cols=231 Identities=10% Similarity=0.043 Sum_probs=148.7 Q ss_pred CCcCccccccHHHH-HHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEee-cCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MPTLSTTNPTLADV-AARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRKLNYGVQPEKS 78 (331) Q Consensus 1 M~~l~~~a~TL~E~-Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~-lP~~~fR~lN~g~~~s~~ 78 (331) +..+...+.|.... +-.+-+. .+...||+.+.+.++|++.++.....++ .+.+.+.++ -|.++|..=++..++++. T Consensus 106 ~~~~~~~~~~~~~~~~g~~vp~-~~~~~ii~~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 183 (395) T protein:vir:43 106 RVSMPRSAITSIDGSGGALVAP-DRRPGVVAAPQRRLTIRDLVAPGTTESN-SVEYVRETGFVNNAAPVSEGTQKPYSDL 183 (395) T ss_pred hhhhhhhhhcccCCCCccccch-hhHHHHHHHHHhhhhHHhhccceecCCC-ceEEEEEecCCCceeeecCCcccccccc Confidence 11000001000000 0001122 1346799999999999999999886543 577887766 478899999999999999 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-.++-+++.+.|.+.+.+-.++ +...-.....++++..+...||+|+...+| +.||-.- T Consensus 184 ~~~~i~~~~~k~~~~~~is~ell~d~~~---l~~~v~~~la~a~~~~~d~~~l~G~g~~~~--~~Gi~~~---------- 248 (395) T protein:vir:43 184 TFELENAPVRTIAHLFKASRQILDDASA---LQSYIDARARYGLMLVEECQLLYGNGTGAN--LHGIIPQ---------- 248 (395) T ss_pred ceeEEEEeeeeEEEeehhhHHHHHhHHH---HHHHHHHHHHHHHHHHHHHHHHhccCCCCc--ccccccc---------- Confidence 9999999999999999999998875443 444445668899999999999999743321 3333110 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) +-+. T Consensus 249 ----------------------------------------------------------------------------~~~~ 252 (395) T protein:vir:43 249 ----------------------------------------------------------------------------AQAY 252 (395) T ss_pred ----------------------------------------------------------------------------cccc Confidence 0000 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir 318 (331) +...+ .+.+.....+.+++++..++.....+.+|+||++....|+.. .+..+.. +-+.-..|. ...+.|+||. T Consensus 253 ~~~~~----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~l-kd~~G~~-i~~~~~~~~-~~~l~G~pVv 325 (395) T protein:vir:43 253 APPSG----VVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELN-KDAENRY-IIGSPQNGT-TPTLWRLPVV 325 (395) T ss_pred ccccc----cccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHh-hccCCce-eccccccCC-CceecceeeE Confidence 00000 112223456778888888887777788999999999999865 3444443 222222222 2357899999 Q ss_pred EEEeccCCCcccC Q lcl|Aclame:pro 319 RTDALLLTEARVV 331 (331) Q Consensus 319 ~~dai~~tE~~Vv 331 (331) .++.+..++..+. T Consensus 326 ~~~~~~~~~~~~g 338 (395) T protein:vir:43 326 ETQAITQDEFLTG 338 (395) T ss_pred EcCCCCCCcEEEE Confidence 9999876553222 No 20 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.18 E-value=1.7e-11 Score=79.68 Aligned_cols=231 Identities=11% Similarity=-0.012 Sum_probs=154.6 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) |++-.++..++-+. +...||+.+.+.|+|++..+.+...++ ..++.+.++-|.+.|..=++.+++++.++ T Consensus 1 m~t~t~gg~liP~~---------~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~E~~~~~~s~~~f 70 (303) T protein:vir:97 1 MGTETSKASLFDKH---------LVSDLINKVKGHSSLAKLSSQKPIPFN-GSKEFTFTLDSDIDVVAENGKKTHGGLSL 70 (303) T ss_pred CcccCCCCeEcchh---------HHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEecCcceEEeecCccccccccce Confidence 88765555444332 345799999999999999988776544 46778888899999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCC-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccce Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGN-SAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn-~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~ 159 (331) .+++-..+-+++.+.|-+.+.....+ ..++...-.....+++++.+...+++|+...+ T Consensus 71 ~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~--------------------- 129 (303) T protein:vir:97 71 EPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRT--------------------- 129 (303) T ss_pred eeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCC--------------------- Confidence 99999999999999999988765443 33455555566889999999999999952211 Q ss_pred eeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeec Q lcl|Aclame:pro 160 IIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIAN 239 (331) Q Consensus 160 vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~N 239 (331) |++.....+ +++.. .. .+ T Consensus 130 -----g~~~~~~~~-----------~~~~~------------------------------------~~----------~~ 147 (303) T protein:vir:97 130 -----KKASDVIGT-----------NHFDS------------------------------------KV----------TQ 147 (303) T ss_pred -----ccccccccc-----------ccccc------------------------------------cc----------cc Confidence 111110000 00000 00 00 Q ss_pred cccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEE Q lcl|Aclame:pro 240 VDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRR 319 (331) Q Consensus 240 Id~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~ 319 (331) . . ..++...+.+-|.+++.++......+..|.||++....|+.. .+..+...+.++-..|.....++|+||.. T Consensus 148 ~--~----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~l-kd~~g~~~~~~~~~~~~~~~~l~G~Pv~~ 220 (303) T protein:vir:97 148 V--V----KFTESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKV-TNGEMGPKMYPELAWGANPDSINGLKSSV 220 (303) T ss_pred c--c----ccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHh-hccCCCeEEecCccCCCCCceecceeeEE Confidence 0 0 001112234456677777765566677899999999999854 56655544444434444556789999999 Q ss_pred EEeccCC------CcccC Q lcl|Aclame:pro 320 TDALLLT------EARVV 331 (331) Q Consensus 320 ~dai~~t------E~~Vv 331 (331) ++++... +..++ T Consensus 221 s~~v~~~~~~~~~~~~~~ 238 (303) T protein:vir:97 221 NTTVGAGADEAESKDLVI 238 (303) T ss_pred ecccCCccccCCCccEEE Confidence 9988432 22222 No 21 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.17 E-value=3.8e-12 Score=83.20 Aligned_cols=225 Identities=14% Similarity=0.138 Sum_probs=147.3 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) +.++.....+..+-+..+-+. .+...||+.+.+.++|++..+.+...++ .+.+.+.++.|.+.|.+-++..++++.++ T Consensus 21 ~~~~~a~~~~~~~~~~~liP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) T protein:vir:10 21 PQVFNPDNVMMHEKKDGTLLN-DFTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) T ss_pred cceecccceeccCCCcceech-hHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEeCCcceeEeccCccccccccce Confidence 111111111111211112233 2456799999999999999999887655 47788899999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccccee Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~v 160 (331) .+++-..+-+++.+.|.+.+.+... .++...-.....+++++.+...+|+|+.... ...|+. T Consensus 99 ~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~--~~~~i~-------------- 160 (324) T protein:vir:10 99 VNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIA-------------- 160 (324) T ss_pred eEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCc--cCcccc-------------- Confidence 9999999999999999998887653 2333444455789999999999999963311 000000 Q ss_pred eccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeecc Q lcl|Aclame:pro 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) Q Consensus 161 idaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NI 240 (331) ++ + +.+. .. T Consensus 161 -~~----------------------~-~~~~-----------------------------------------------~~ 169 (324) T protein:vir:10 161 -QS----------------------I-EKTN-----------------------------------------------KV 169 (324) T ss_pred -cc----------------------c-cccc-----------------------------------------------ee Confidence 00 0 0000 00 Q ss_pred ccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEEE Q lcl|Aclame:pro 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) Q Consensus 241 d~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~ 320 (331) . ++..-.+.+.+++..++.....+..|+||++.+..|+.. .+..+...+. . ...-.+.|+||..+ T Consensus 170 ~---------~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l-~d~~g~~~~~----~-~~~~~l~G~PV~~~ 234 (324) T protein:vir:10 170 I---------KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI-VDPETKERIY----D-RNSDTLDGLPVVNL 234 (324) T ss_pred c---------cccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHh-hccCCceeec----C-CCCccccceeEEee Confidence 0 000113446677777776667778999999999999854 4444333221 1 12235899999999 Q ss_pred EeccCCCcccC Q lcl|Aclame:pro 321 DALLLTEARVV 331 (331) Q Consensus 321 dai~~tE~~Vv 331 (331) ++...++..++ T Consensus 235 ~~~~~~~~~~~ 245 (324) T protein:vir:10 235 KSSNLKRGELI 245 (324) T ss_pred cCCCCCcceEE Confidence 88776666665 No 22 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.17 E-value=3.8e-12 Score=83.18 Aligned_cols=225 Identities=13% Similarity=0.126 Sum_probs=148.3 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) +.++.....+..+.+..+-+. .+...||+.+.+.++|++.++.+...++ .+.+.+.++-|.+.|.+-++.+++++.++ T Consensus 21 ~~~~~a~~~~~~~~~~~lip~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) T protein:vir:99 21 PQVFNPDNVMMHEKKDGTLLN-DFTTPILQEVMENSKIMRLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) T ss_pred hhhccccceeccCCCcceech-hHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceeEeccCccccccccce Confidence 222211112222222222233 2456799999999999999999887655 47788889999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccccee Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~v 160 (331) .+++-.++-+++.+.|-+.+.+... .++...-.....+++++.+...+|+|+.... ...| T Consensus 99 ~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~--~~~~---------------- 158 (324) T protein:vir:99 99 VNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKS---------------- 158 (324) T ss_pred eEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCc--cCcc---------------- Confidence 9999999999999999998887653 2333444455889999999999999963211 0000 Q ss_pred eccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeecc Q lcl|Aclame:pro 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) Q Consensus 161 idaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NI 240 (331) |+. + ..... . T Consensus 159 -------------------------~~~-~---------~~~~~----------------------------------~- 168 (324) T protein:vir:99 159 -------------------------IAQ-S---------IEKTN----------------------------------K- 168 (324) T ss_pred -------------------------ccc-c---------ccccc----------------------------------e- Confidence 000 0 00000 0 Q ss_pred ccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEEE Q lcl|Aclame:pro 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) Q Consensus 241 d~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~ 320 (331) .+++....+.+.+++..+......+..|+||++.+..|+.. .+..+...+. +...-.+.|+||..+ T Consensus 169 --------~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l-~d~~g~~~~~-----~~~~~~l~G~PVv~~ 234 (324) T protein:vir:99 169 --------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI-VDPETKERIY-----DRNSDTLDGLPVVNL 234 (324) T ss_pred --------eccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHh-hcCCCceeec-----CCCCccccceeEEee Confidence 00001113446677777776667778999999999999854 4444433221 112235899999999 Q ss_pred EeccCCCcccC Q lcl|Aclame:pro 321 DALLLTEARVV 331 (331) Q Consensus 321 dai~~tE~~Vv 331 (331) ++...+...++ T Consensus 235 ~~~~~~~~~~i 245 (324) T protein:vir:99 235 KSSNLKRGELI 245 (324) T ss_pred cCCCCCcceEE Confidence 98877766666 No 23 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.15 E-value=2.4e-12 Score=84.25 Aligned_cols=229 Identities=10% Similarity=0.067 Sum_probs=146.7 Q ss_pred CCcCccccccHHHHHHh-----------cCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeec-CCcceee Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAAR-----------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGL-PTGTWRK 68 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~-----------~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~l-P~~~fR~ 68 (331) ....+...+-+...... +.+. .....||+.+.+.++|++.++.....++ .+.+.+.++- +++.|.. T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~ 173 (390) T protein:vir:81 96 NDRSARATMNIKAALNTASTDAAGSAGALTTP-NRLPGFITPPDARLTVRDLIGSGRTDSA-LIEYVQETGFVNNAAIVA 173 (390) T ss_pred hhhhhhhhhHHHHHHHhhccccccCCcceech-hhhHHHHHHHhhhhhhhhhcceeeccCC-ceEEEEEecCCcceeeec Confidence 00000000111111100 1111 1346799999999999999998876544 5777777764 6889999 Q ss_pred cCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhh Q lcl|Aclame:pro 69 LNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPR 148 (331) Q Consensus 69 lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R 148 (331) =++..++++.++.+++-..+-+++.+.|.+.+.+..++ +...-.....+++++.+...||+||...+ .+.||..- T Consensus 174 Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~---~~~~i~~~l~~~~~~~~d~a~l~G~g~~~--~~~Gi~~~ 248 (390) T protein:vir:81 174 EGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQ---LASYMNNRLIRGLKVKEDAEILRGTGAND--GLLGLIPQ 248 (390) T ss_pred CCcccccccceeeEEEEeeeEEEEeehhhHHHHHhHHH---HHHHHHHHHHHHHHHHHHHHHHhcCCCCC--cccceeec Confidence 99999999999999999999999999999998876543 44445556889999999999999974322 23333210 Q ss_pred hccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEE Q lcl|Aclame:pro 149 FNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) Q Consensus 149 ~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v 228 (331) ++.. T Consensus 249 --------------~~~~-------------------------------------------------------------- 252 (390) T protein:vir:81 249 --------------ATTY-------------------------------------------------------------- 252 (390) T ss_pred --------------cccc-------------------------------------------------------------- Confidence 0000 Q ss_pred ecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCce Q lcl|Aclame:pro 229 RDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK 308 (331) Q Consensus 229 ~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~ 308 (331) ++ ..+.+.....+.+++++..+.........|+||++....|+.. .+..+...+.+ ...+ . T Consensus 253 ----------~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-kd~~G~~l~~~-~~~~-~ 313 (390) T protein:vir:81 253 ----------AA------PTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELA-KDANNQYLIGN-ARGT-L 313 (390) T ss_pred ----------cc------ccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHh-hcCCCceeecC-cccc-c Confidence 00 0011122334557777888877777778999999999999854 45544432222 1222 1 Q ss_pred eEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 309 VVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 309 v~~~~gvpir~~dai~~tE~~Vv 331 (331) ...++|+||..++++-.+...+. T Consensus 314 ~~~l~G~pv~~~~~~p~~~~~~g 336 (390) T protein:vir:81 314 TPTLWGLPVVATQAMAPGEFLVG 336 (390) T ss_pred CceecceeeEEcCCCCCCcEEEE Confidence 23679999999999865442222 No 24 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.15 E-value=1.7e-11 Score=79.69 Aligned_cols=230 Identities=13% Similarity=0.048 Sum_probs=150.0 Q ss_pred CCcCcccc-ccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccce Q lcl|Aclame:pro 1 MPTLSTTN-PTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSR 79 (331) Q Consensus 1 M~~l~~~a-~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t 79 (331) |+++.+.. .++ +. .+...|||.+.+.+.|++..+.+...++ ..++.+.++-|.++|..=++.+++++.+ T Consensus 1 Mat~tt~~g~~v--------P~-~~~~~ii~~~~~~s~l~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~Eg~~~~~~~~~ 70 (311) T protein:vir:99 1 MATFGTGNLKNL--------PR-NIADGMVKDVVQGSTVAVLSARKPQRFG-NEDIITFNGRPKAEFVGEGQQKSSTTGE 70 (311) T ss_pred CceecCCCceec--------cH-HHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEeCCceeEEeecCcccccccce Confidence 88764432 122 22 2345799999999999998888776543 4688899999999999999999999999 Q ss_pred EEEEEEEEEEecchhhhhHHHHhhCCC-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 80 TVQVKDSMGMLETYAEVDKALADLNGN-SAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 80 ~~~~~~~l~ilgg~~eVDr~la~~~gn-~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) +.+++-..+-+++.+.|-+.|.+...+ ..++...-.....+++++++.+++|||+....+..+.|+...... .. T Consensus 71 f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~-----~~ 145 (311) T protein:vir:99 71 FDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGA-----AS 145 (311) T ss_pred eeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCcccccccccccc-----cc Confidence 999999999999999999999876654 334555555678999999999999999865555555554332100 00 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) +.+..+ T Consensus 146 ~~~~~~-------------------------------------------------------------------------- 151 (311) T protein:vir:99 146 KRVELT-------------------------------------------------------------------------- 151 (311) T ss_pred ceeecc-------------------------------------------------------------------------- Confidence 000000 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcC--CCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNV--GMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIP 316 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~--~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvp 316 (331) +++...+...+..++.++... ...+..|.||++....|+.. .++.+...+ .....+...-.+.|+| T Consensus 152 ----------~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~l-kd~~G~~l~-~~~~~~~~~~~l~G~P 219 (311) T protein:vir:99 152 ----------ADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTA-RYTDGRKKF-PELGLGIGVSSFEGID 219 (311) T ss_pred ----------ccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhh-hccCCCeee-cCcccCCCCceeccee Confidence 000011222233333333221 22234699999999999864 455544433 3333344456799999 Q ss_pred EEEEEeccCCC------cccC Q lcl|Aclame:pro 317 CRRTDALLLTE------ARVV 331 (331) Q Consensus 317 ir~~dai~~tE------~~Vv 331 (331) +..++++..+- ..++ T Consensus 220 v~~s~~i~~~~~~~~~~~~~~ 240 (311) T protein:vir:99 220 ASVSDTVNGGDEADPDDEDLD 240 (311) T ss_pred eEeecccccccccccccchhh Confidence 99999874211 1111 No 25 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.13 E-value=1.4e-11 Score=80.04 Aligned_cols=240 Identities=13% Similarity=0.146 Sum_probs=148.4 Q ss_pred CCcCccccccHHHHHHh-------cCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCcc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAAR-------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGV 73 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~-------~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~ 73 (331) ..-+..+.. -..+.+. +-+. .+...||+.+.+.++|++..++...+++ .+.+.+.++-+.+.|..=++.+ T Consensus 118 ~~~l~~~e~-~~al~~~t~~~gG~lvP~-~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~~~~~~~~~~a~wv~E~~~~ 194 (425) T protein:vir:10 118 KAHVKRGDV-QAALNKGEDSEGGYLTPI-EWDRTITNKLVLISPMRQLCRVQPVSKA-GFSKLFNMGGTTSGWVGEASQR 194 (425) T ss_pred HHHhhhhhh-HHHhhcCcCCCCceeccH-hHHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEEcCCcceeeecccccc Confidence 000000000 0000000 1122 2346799999999999999998877654 5778889999999999999888 Q ss_pred Cccc-ceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcc Q lcl|Aclame:pro 74 QPEK-SRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNS 151 (331) Q Consensus 74 ~~s~-~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~ 151 (331) +++. .++.+++-.++-+++.+.|.+.+.+... +..++ -...+.++++.++...|+|||-...|.+ +-... T Consensus 195 ~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~---i~~~la~ai~~~~d~~~l~G~G~~~p~G---il~~~-- 266 (425) T protein:vir:10 195 PQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESW---LATEVQTEFAKQEGKAFLAGDGTNKPNG---LLTYI-- 266 (425) T ss_pred ccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHH---HHHHHHHHHHHHHHhhhhcccCCCCcce---eeecc-- Confidence 8876 5899999999999999999999988653 44443 4455889999999999999986544443 32211 Q ss_pred ccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecc Q lcl|Aclame:pro 152 LSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDW 231 (331) Q Consensus 152 ~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~ 231 (331) + +++... ....+.++ T Consensus 267 -~----------~~~~~~-----------~~~~~~~~------------------------------------------- 281 (425) T protein:vir:10 267 -A----------GGANAA-----------KHPFGAIE------------------------------------------- 281 (425) T ss_pred -c----------cccccc-----------cccccccc------------------------------------------- Confidence 0 000000 00000000 Q ss_pred cceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEE Q lcl|Aclame:pro 232 RYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVA 311 (331) Q Consensus 232 r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~ 311 (331) . +.. .++.....+-|++++..++....++.+|+||++...+|+.. .++.+.+.+.+.-..|. ... T Consensus 282 -------~-----~~~-~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~l-kD~~G~~l~~~~~~~g~-~~~ 346 (425) T protein:vir:10 282 -------V-----VNS-GAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKL-KDGQGNYLWQPSYVAGQ-PAT 346 (425) T ss_pred -------c-----ccc-cccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHh-hcCCCceeeccCccCCC-Cce Confidence 0 000 01111113335566667776677789999999999999964 56665544444333443 356 Q ss_pred EcCeEEEEEEeccCCC--cc-cC Q lcl|Aclame:pro 312 FDGIPCRRTDALLLTE--AR-VV 331 (331) Q Consensus 312 ~~gvpir~~dai~~tE--~~-Vv 331 (331) +.|.||..+|++.... .. |+ T Consensus 347 l~G~PV~~~~~~p~~~~~~~~i~ 369 (425) T protein:vir:10 347 LAGYPVTEVPDMPDVAANSTPIL 369 (425) T ss_pred ecceeeEEecCcCCccCCccEEE Confidence 8899999999985322 22 22 No 26 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.13 E-value=9.1e-12 Score=81.12 Aligned_cols=225 Identities=14% Similarity=0.134 Sum_probs=147.3 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) +-.+.....|..+.+..+-++. +...|++.+.+.|+|++.++.+...++ ..++.+.++.|.++|.+=++..++++.++ T Consensus 21 ~~~~~a~~~~~~~~~~~liP~~-~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f 98 (324) T protein:vir:93 21 PQVFNPDNVMMHEKKDGTLLND-FTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) T ss_pred hhhcccccccccCCCcceechh-HHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceeeecCCccccccccce Confidence 2222111122222222233443 456799999999999999998876544 47788999999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccccee Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~v 160 (331) .+++-..+-+++.+.|-+.+.+... .++...-.....+++++.+..++|+|+.+.. ...| T Consensus 99 ~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~--~~~~---------------- 158 (324) T protein:vir:93 99 VNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKS---------------- 158 (324) T ss_pred eEEEEEeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC--cCcc---------------- Confidence 9999999999999999987777643 1233333344679999999999999963211 0000 Q ss_pred eccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeecc Q lcl|Aclame:pro 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) Q Consensus 161 idaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NI 240 (331) ++.. + +... .. T Consensus 159 -------------------------~~~~-----~-----~~~~----------------------------------~~ 169 (324) T protein:vir:93 159 -------------------------IAQS-----I-----EKTN----------------------------------KV 169 (324) T ss_pred -------------------------cccc-----c-----cccc----------------------------------ee Confidence 0000 0 0000 00 Q ss_pred ccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEEE Q lcl|Aclame:pro 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) Q Consensus 241 d~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~ 320 (331) +++....+-+.+++..++.....+..|+||++.+..|+.. .+......+ . +.....+.|+||..+ T Consensus 170 ---------~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l-~d~~G~~~~-~----~~~~~~l~G~PVv~~ 234 (324) T protein:vir:93 170 ---------IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI-VDPETKERI-Y----DRNSDSLDGLPVVNL 234 (324) T ss_pred ---------ccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHh-hCCCCCeee-c----CCCCCcccceeeEee Confidence 0001113446677777776667778999999999999865 444443322 1 122346899999998 Q ss_pred EeccCCCcccC Q lcl|Aclame:pro 321 DALLLTEARVV 331 (331) Q Consensus 321 dai~~tE~~Vv 331 (331) .+...++..++ T Consensus 235 ~~~~~~~~~i~ 245 (324) T protein:vir:93 235 KSSNLKRGELI 245 (324) T ss_pred cCCCCCcceEE Confidence 88766665555 No 27 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.12 E-value=3.6e-11 Score=77.84 Aligned_cols=228 Identities=11% Similarity=0.016 Sum_probs=147.8 Q ss_pred CCcCccccccHHHHHH----------hcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecC Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAA----------RMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLN 70 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak----------~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN 70 (331) |. -+.-+-+|-++ .+-++ .+...||+.+.+.++|++.++.+...++ .+++.+.++-|.++|..=+ T Consensus 1 ~~---~~~~~~~e~~~~~~~~~~~~~~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~Eg 75 (318) T protein:vir:24 1 MA---AGTAFAVDHAQIAQTGDTMFKGYLEP-EQAKDYFAEAEKTSIVQQFAQKVPMGTT-GQKIPHWVGDVSAQWIGEG 75 (318) T ss_pred CC---CCCCCCHHHHHhhcccCcccceeech-hHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEeCCcceEEecCC Confidence 11 11111111111 11222 2456799999999999999999886544 5788899999999999999 Q ss_pred CccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhc Q lcl|Aclame:pro 71 YGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFN 150 (331) Q Consensus 71 ~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~ 150 (331) +.+++++.++.+++-.++-+++...+-+.+.+... .++...-.....++++.++...|++|+.+..|..+. . T Consensus 76 ~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~---~--- 147 (318) T protein:vir:24 76 DMKPITKGNMTSQTIAPHKIATIFVASAETVRANP--ANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIG---Q--- 147 (318) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccc---c--- Confidence 99999999999999999999999999998776543 234444456689999999999999997432221110 0 Q ss_pred cccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEec Q lcl|Aclame:pro 151 SLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRD 230 (331) Q Consensus 151 ~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d 230 (331) . T Consensus 148 ----------------------------------~--------------------------------------------- 148 (318) T protein:vir:24 148 ----------------------------------T--------------------------------------------- 148 (318) T ss_pred ----------------------------------c--------------------------------------------- Confidence 0 Q ss_pred ccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCC-ce- Q lcl|Aclame:pro 231 WRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAG-KK- 308 (331) Q Consensus 231 ~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g-~~- 308 (331) +..+..+... .......+.+.+++..+.+....+.+|+||++....|+.. .+..+...+ .....+ .. T Consensus 149 ------~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~l-kd~~G~~l~-~~~~~~~~~~ 217 (318) T protein:vir:24 149 ------TKAISIADTT---GATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGA-KDQNGRPLF-IESTYGEAAS 217 (318) T ss_pred ------cccccccccc---cccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHh-hccCCceee-cCccccCccc Confidence 0000000000 0001123446667777888888889999999999999854 455444322 222211 11 Q ss_pred ---eEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 309 ---VVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 309 ---v~~~~gvpir~~dai~~tE~~Vv 331 (331) -..+.|+|+..++++-.+...++ T Consensus 218 ~~~~~~i~g~pv~~~~~~~~~~~~~~ 243 (318) T protein:vir:24 218 PFRSGRIVARPTILSDHVVEGTTVGF 243 (318) T ss_pred cccCceEEEEeeEEeCCCCCCccEEE Confidence 12467899999999876665443 No 28 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.11 E-value=5.9e-12 Score=82.15 Aligned_cols=229 Identities=15% Similarity=0.133 Sum_probs=144.5 Q ss_pred CC--cCccccccHHHHHHhcCCccchhHHHH-HHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCccc Q lcl|Aclame:pro 1 MP--TLSTTNPTLADVAARMTPDGKIDPQIV-EMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEK 77 (331) Q Consensus 1 M~--~l~~~a~TL~E~Ak~~~~~~~~~~~VI-E~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~ 77 (331) ++ .......|..+... +-+. .+...|| +.+.+.++|.....+.... ..+.+.+.++-|.+.|..=++.++.++ T Consensus 243 ~~~~~~~~~~~t~~~gg~-lip~-~~~~~ii~~~~~~~~~l~~~~~~~~~~--g~~~~~~~~~~~~a~~v~Eg~~~~~~~ 318 (543) T protein:vir:81 243 RAINEVRAMGLTKADGGY-LVPF-QLDPTVIITSNGSLNDIRRFARQVVAT--GDVWHGVSSAAVQWSWDAEFEEVSDDS 318 (543) T ss_pred hhhhhhhhcccccccCcc-cCch-hhhhHHHHHHHhhhchhhhhcccccCC--cceEEEEecCCcceeecccCccccccc Confidence 00 00001111111111 1122 2344555 6677778888887776553 246677888999999999999999999 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENG 157 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~ 157 (331) .++.+++-..+-+++.+.|.+.+.+... ++...-.....++++..+...|||||... ..+.|+.... T Consensus 319 ~~~~~i~~~~~k~~~~~~is~ell~d~~---~~~~~i~~~l~~~~~~~~d~ail~G~Gt~--~~p~Gi~~~~-------- 385 (543) T protein:vir:81 319 PEFGQPEIPVKKAQGFVPISIEALQDEA---NVTETVALLFAEGKDELEAVTLTTGTGQG--NQPTGIVTAL-------- 385 (543) T ss_pred cccceeeeeeeeeEeeehhhHHHHhccH---HHHHHHHHHHHHHHHHHHHHHHhccCCCC--cccccchhhc-------- Confidence 9999999999999999999999887543 56666667789999999999999997432 3566663211 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) +++... T Consensus 386 ------~~~~~~-------------------------------------------------------------------- 391 (543) T protein:vir:81 386 ------AGTAAE-------------------------------------------------------------------- 391 (543) T ss_pred ------cccccc-------------------------------------------------------------------- Confidence 000000 Q ss_pred eccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEE Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPC 317 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpi 317 (331) + + .+++.....+.+++++..++.....+.+|+||++.+..|+.. .+..+.+.+.+ -..| ....+.|.|| T Consensus 392 --~-----~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~l-kd~~G~~l~~~-~~~g-~~~~l~G~pv 460 (543) T protein:vir:81 392 --I-----A-PVTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQF-DTQGGAGLWTT-IGNG-EPSQLLGRPV 460 (543) T ss_pred --c-----c-ccccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHh-hcCCCceeccC-cCCC-CCccccceee Confidence 0 0 000001124456777777877777778999999999999965 34443332222 1222 2346899999 Q ss_pred EEEEeccCC-------Cc-ccC Q lcl|Aclame:pro 318 RRTDALLLT-------EA-RVV 331 (331) Q Consensus 318 r~~dai~~t-------E~-~Vv 331 (331) ..+|+|-.. .. .|+ T Consensus 461 ~~~~~~~~~~~~~~~~~~~~i~ 482 (543) T protein:vir:81 461 GEAEAMDANWNTSASADNFVLL 482 (543) T ss_pred EEeccccccccccccCCcceEE Confidence 999997422 11 122 No 29 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.10 E-value=1.8e-11 Score=79.46 Aligned_cols=226 Identities=15% Similarity=0.097 Sum_probs=147.4 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) |+.=+.+-.|..+.. .+-+. .+...|||.+.+.++|++..+.....++ .+.+.+.+ -|+++|..=++.+++++.++ T Consensus 1 ~g~~a~~~~~~~~~~-~~iP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~~~~~-~~~a~~v~E~~~~~~~~~~f 76 (299) T protein:vir:41 1 MGFNPDTTTMQSAKT-GSIPI-NISEQIITGVKNGSAAMKLAKAVPMTKP-EEEFTFMS-GVGAFWVDEAERIQTSKPTF 76 (299) T ss_pred CCcCCCcccccCCCc-eecch-hHHHHHHHHHHhcchhhhhceeeecCCC-cEEEEEEc-CCceeeeecCccccccccce Confidence 433222222222222 12233 2456799999999999999998886544 45666554 58899999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccccee Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~v 160 (331) .+++-..+-+++.+.|-+.+.+... .++...-.....+++++.+...||+||.+..|. |+-... T Consensus 77 ~~v~l~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~---gil~~~----------- 140 (299) T protein:vir:41 77 TKAKMRSKKMGVIIPTTKENLNYSV--TNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNW---NILKSA----------- 140 (299) T ss_pred eEEEEeeEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHHhhcccCcccc---cccccc----------- Confidence 9999999999999999999988654 234444556689999999999999998543322 111100 Q ss_pred eccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeecc Q lcl|Aclame:pro 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) Q Consensus 161 idaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NI 240 (331) +.+. +. T Consensus 141 ----~~~~----------------------------------------------------------------------~~ 146 (299) T protein:vir:41 141 ----TDAS----------------------------------------------------------------------NL 146 (299) T ss_pred ----cccc----------------------------------------------------------------------ee Confidence 0000 00 Q ss_pred ccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEEE Q lcl|Aclame:pro 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) Q Consensus 241 d~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~ 320 (331) .+++..-.+-+++++.+++.....+.+|+||++...+|+.. .+..+...+.+.-..|. ..+.|+||..+ T Consensus 147 --------~~~~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l-kd~~G~~l~~~~~~~~~--~~l~G~PV~~~ 215 (299) T protein:vir:41 147 --------VEETANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRST-KDGNGMPIFNTATSNGV--DDVLGLPIAYT 215 (299) T ss_pred --------eccccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHh-hccCCceeecCCcCCCC--ceecceeeEEe Confidence 00001112346677777776667778999999999999965 44444433333323333 25789999999 Q ss_pred EeccCCCcc--cC Q lcl|Aclame:pro 321 DALLLTEAR--VV 331 (331) Q Consensus 321 dai~~tE~~--Vv 331 (331) |++-..... ++ T Consensus 216 ~~~~~~~~~~~~~ 228 (299) T protein:vir:41 216 PKYTFGDKDISEL 228 (299) T ss_pred cccCCCCCceEEE Confidence 998643322 21 No 30 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=99.09 E-value=2e-11 Score=79.25 Aligned_cols=236 Identities=10% Similarity=0.006 Sum_probs=148.0 Q ss_pred CCcCccc---cccHHHHH---------HhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceee Q lcl|Aclame:pro 1 MPTLSTT---NPTLADVA---------ARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRK 68 (331) Q Consensus 1 M~~l~~~---a~TL~E~A---------k~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~ 68 (331) |.+=+.. -+...|.+ ..+-+. .+...|||.+.+.++|++..++....++ .+++.+.++-|+++|.. T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~~~~g~~ip~-~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~ 78 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGDSMFEGYLEP-EQAQDYFAEAEKISIVQQFAQKIPMGTT-GQKIPHWTGDVSASWIG 78 (326) T ss_pred CCCCccchhhhcCcchhhheeccccCCcceech-hhHHHHHHHHHhcchhhhhcceeeccCC-ceEEEEEeCCcceEEec Confidence 2221100 01111111 001122 2346799999999999999999876543 57888999999999999 Q ss_pred cCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhh Q lcl|Aclame:pro 69 LNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPR 148 (331) Q Consensus 69 lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R 148 (331) =++.+++++.++.+++-..+-+++.+.|-+.+.+... .++...-.+...+++++.+.+.+|+||.+..|.++.... T Consensus 79 Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~--~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~-- 154 (326) T protein:vir:42 79 EGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANP--ANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTT-- 154 (326) T ss_pred CCccccccccceeEEEEeeEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccc-- Confidence 9999999999999999999999999999998877643 234444445578999999999999998644432221100 Q ss_pred hccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEE Q lcl|Aclame:pro 149 FNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) Q Consensus 149 ~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v 228 (331) ++.+...++ T Consensus 155 ---------------~~~~~~~~~-------------------------------------------------------- 163 (326) T protein:vir:42 155 ---------------KEVSLVDPD-------------------------------------------------------- 163 (326) T ss_pred ---------------cccceeecc-------------------------------------------------------- Confidence 000000000 Q ss_pred ecccceeeeeccccccCCCCccchhhHHH-HHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCc Q lcl|Aclame:pro 229 RDWRYVVRIANVDVSELTKNASAGADLID-LMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGK 307 (331) Q Consensus 229 ~d~r~v~RI~NId~s~l~~~~~~~~~l~~-lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~ 307 (331) .+...+.....+ .+..++..+.+......+|+||++.+..|+.. .+..+...+.+....|. T Consensus 164 -----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~l-kd~~G~~l~~~~~~~~~ 225 (326) T protein:vir:42 164 -----------------GTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGA-KDKSGRPLFIESTYTEE 225 (326) T ss_pred -----------------cccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHh-hccCCceeeccccccCc Confidence 000011111111 23444555555566678999999999999964 45544443333332222 Q ss_pred ----eeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 308 ----KVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 308 ----~v~~~~gvpir~~dai~~tE~~Vv 331 (331) .-..+.|+||..++++-.++..++ T Consensus 226 ~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 253 (326) T protein:vir:42 226 NSPFRLGRIVARPTILSDHVASGTVVGY 253 (326) T ss_pred cccccCceeeeeeEEEcCCCCCCceEEE Confidence 223688999999999877665544 No 31 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.09 E-value=1.2e-11 Score=80.49 Aligned_cols=226 Identities=8% Similarity=-0.021 Sum_probs=145.8 Q ss_pred CCcCccccccHHHHHH------hcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccC Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAA------RMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQ 74 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak------~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~ 74 (331) |. ..+=.-.-+.+ .+-+.. +...||+.+.+.++|++..+.++..++ .+.+.++++-|.+.|..=++.++ T Consensus 1 ~g---~~~e~~~~~~~~t~~~~g~l~~~-~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~ 75 (397) T protein:vir:23 1 MG---FSADHSQIAQTKDTMFTGYLDPV-QAKDYFAEAEKTSIVQRVAQKIPMGAT-GIVIPHWTGDVSAQWIGEGDMKP 75 (397) T ss_pred CC---cCHHHHHHhhccCCCCccccchh-HHHHHHHHHHhccchhhhcceeeccCC-ceEEEEEcCCcceEEecCCcccc Confidence 22 22111111100 011222 345799999999999999999887544 57788999999999999999999 Q ss_pred cccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccc Q lcl|Aclame:pro 75 PEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) Q Consensus 75 ~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~ 154 (331) +++.++.+++-..+-+++.+.|-+.+.+... .++...-.+...+++++++.+.||||+.. |+...|+.. T Consensus 76 ~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~--~~l~~~i~~~l~~aia~~~d~a~l~G~gt--~~~~~~~~~------- 144 (397) T protein:vir:23 76 ITKGNMTKRDVHPAKIATIFVASAETVRANP--ANYLGTMRTKVATAIAMAFDNAALHGTNA--PSAFQGYLD------- 144 (397) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhcccC--Ccccccccc------- Confidence 9999999999999999999999999887653 23344444558899999999999999732 111111100 Q ss_pred cccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccce Q lcl|Aclame:pro 155 ENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYV 234 (331) Q Consensus 155 ~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v 234 (331) .+ +. T Consensus 145 ----------~~-------------------------------------------~~----------------------- 148 (397) T protein:vir:23 145 ----------QS-------------------------------------------NK----------------------- 148 (397) T ss_pred ----------cc-------------------------------------------cc----------------------- Confidence 00 00 Q ss_pred eeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCce----eE Q lcl|Aclame:pro 235 VRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK----VV 310 (331) Q Consensus 235 ~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~----v~ 310 (331) ... .+.....+.+++++..+......+..|+||++.+..|+.. ++......+.+....+.. .. T Consensus 149 ----~~~--------~~~~~~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~l-kd~~G~~i~~~~~~~~~~~~~~~~ 215 (397) T protein:vir:23 149 ----TQS--------ISPNAYQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGS-VDANGRPLFVESTYESLTTPFREG 215 (397) T ss_pred ----eee--------ecccchhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHh-hccCCceeecccccccccccccCc Confidence 000 0000112234455555655566678999999999999964 455444433333333322 23 Q ss_pred EEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 311 AFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 311 ~~~gvpir~~dai~~tE~~Vv 331 (331) .+.|+|+..++++-..+..++ T Consensus 216 tl~G~Pv~~s~~~~~g~~~~~ 236 (397) T protein:vir:23 216 RILGRPTILSDHVAEGDVVGY 236 (397) T ss_pred eeeeeeEEEeCCCCCCceEEE Confidence 689999999999876554433 No 32 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.09 E-value=1.6e-11 Score=79.83 Aligned_cols=225 Identities=14% Similarity=0.139 Sum_probs=146.0 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) -..+.....+..+-..-+-+. .+...|+|.+.+.++|++.++.+..+++ .+++.+.++.|.+.|.+=++.+++++.++ T Consensus 21 ~~~~~a~~~~~~~~~~~~iP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f 98 (324) T protein:vir:97 21 PQVFNPDNVMMHEKKDGTLMN-EFTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) T ss_pred hhhhccccccccCCCcceech-hHHHHHHHHHHhhcchhhhcceeeccCC-ceEEEEEecCcceeEeccCccccccccce Confidence 000000001111111112232 2456799999999999999999887654 47788899999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccccee Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~v 160 (331) .+++-.++-+++.+.|-+.+.+... -++...-.....++++...++.+|+|+.+.. ...|+.. T Consensus 99 ~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~--~~~gi~~------------- 161 (324) T protein:vir:97 99 VNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQ------------- 161 (324) T ss_pred eEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhccCCCCc--cCccccc------------- Confidence 9999999999999999998887654 1233333455789999999999999974321 0111100 Q ss_pred eccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeecc Q lcl|Aclame:pro 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) Q Consensus 161 idaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NI 240 (331) .+ ++ .+. T Consensus 162 ------------------------~~-~~------------------------------------------------~~~ 168 (324) T protein:vir:97 162 ------------------------SI-EK------------------------------------------------TNK 168 (324) T ss_pred ------------------------cc-cc------------------------------------------------cce Confidence 00 00 000 Q ss_pred ccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEEE Q lcl|Aclame:pro 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) Q Consensus 241 d~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~ 320 (331) .+.+....+.|.+++..+......+..|+||+..+..|+.. .+......+. +...-.+.|.||..+ T Consensus 169 --------~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l-kd~~g~~~~~-----~~~~~tl~G~PV~~~ 234 (324) T protein:vir:97 169 --------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI-VDPETKERIY-----DRNSDTLDGLPVVNL 234 (324) T ss_pred --------eccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHh-hcCCCceeec-----CCCCccccceeeEee Confidence 00001113446677777777777778999999999999854 3444333221 112235899999999 Q ss_pred EeccCCCcccC Q lcl|Aclame:pro 321 DALLLTEARVV 331 (331) Q Consensus 321 dai~~tE~~Vv 331 (331) ++...+...++ T Consensus 235 ~~~~~~~~~~~ 245 (324) T protein:vir:97 235 KSSNLKRGELI 245 (324) T ss_pred cCCCCCcceEE Confidence 98776666555 No 33 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.08 E-value=3.4e-11 Score=77.98 Aligned_cols=229 Identities=11% Similarity=0.072 Sum_probs=147.0 Q ss_pred CCcCccccccHHHH-----------HHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeec-CCcceee Q lcl|Aclame:pro 1 MPTLSTTNPTLADV-----------AARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGL-PTGTWRK 68 (331) Q Consensus 1 M~~l~~~a~TL~E~-----------Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~l-P~~~fR~ 68 (331) +.-.....+.+... +-.+.+. .+...||+.+.+.++|++.+++....++ .+.+.+.++- +.+.|.. T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~ 173 (390) T protein:vir:10 96 NDRSARATMNIKAALNTASTDAAGSAGALTTP-NRLPGFITQPDARLTVRDLIGSGRTDSA-LIEYVQETGFVNNAAIVA 173 (390) T ss_pred hhhhhhhhhHHHHHHHhhhcccccccccccch-hHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCCcceeeec Confidence 00000000000000 0001111 1345799999999999999999887554 4677777764 6889999 Q ss_pred cCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhh Q lcl|Aclame:pro 69 LNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPR 148 (331) Q Consensus 69 lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R 148 (331) =++..+++..++.+++-..+-+++.+.|.+.+.+...+ +-..-.....++++....+.||+|+...+ .+.||-.- T Consensus 174 Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~---l~~~i~~~l~~~~~~~~~~~il~G~G~~~--~p~Gi~~~ 248 (390) T protein:vir:10 174 EGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQ---LASYMNNRLIRGLKVKEDAEILRGTGAND--GLLGLIPQ 248 (390) T ss_pred CCccccccccceeEEEEeeEEEEEeehhhHHHHHhHHH---HHHHHHHHHHHHHHHHHHHHHhhcCCCCc--cccccccc Confidence 99999999999999999999999999999998775543 33444555788999999999999974322 34444210 Q ss_pred hccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEE Q lcl|Aclame:pro 149 FNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) Q Consensus 149 ~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v 228 (331) ++.+ T Consensus 249 --------------~~~~-------------------------------------------------------------- 252 (390) T protein:vir:10 249 --------------ATTY-------------------------------------------------------------- 252 (390) T ss_pred --------------cccc-------------------------------------------------------------- Confidence 0000 Q ss_pred ecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCce Q lcl|Aclame:pro 229 RDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK 308 (331) Q Consensus 229 ~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~ 308 (331) ++ ..+.++.++.+.+++++..+......+..|+||++....|+.. .+..+...+.+ ...+- T Consensus 253 ----------~~------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l-kd~~g~~l~~~-~~~~~- 313 (390) T protein:vir:10 253 ----------AA------PTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELA-KDANNQYLIGN-ARGTL- 313 (390) T ss_pred ----------cc------cccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHh-hcCCCceeecC-CcCcC- Confidence 00 0112334456667888888877777788999999999999864 34444432222 22222 Q ss_pred eEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 309 VVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 309 v~~~~gvpir~~dai~~tE~~Vv 331 (331) .-.++|+||..++.+-.+...+. T Consensus 314 ~~~l~G~pv~~~~~~p~~~~~~g 336 (390) T protein:vir:10 314 TPTLWGLPVVATQAMAPGEFLVG 336 (390) T ss_pred CceecceeeEEcCCCCCCcEEEE Confidence 23579999999999865432222 No 34 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.08 E-value=1.3e-11 Score=80.20 Aligned_cols=227 Identities=13% Similarity=0.070 Sum_probs=145.3 Q ss_pred CCcCccccccHHHHHHh----------cCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEee-cCCcceeec Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAAR----------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRKL 69 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~----------~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~-lP~~~fR~l 69 (331) |.. .....+.+.... +-+. .+...||+.+.+.++|++.+++....++ .+.+.+.++ -|.+.|..= T Consensus 121 ~~~--~~~~~~~~~~~~~~~~~~~~g~lvp~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E 196 (418) T protein:vir:10 121 RVR--VDRKSIMNVPATVGSGVSGSNSLVVA-DRQAGIIAPPQRKMTIRDLLMPGQTSSS-SIEYTVETGFTNNAAAVAE 196 (418) T ss_pred hhh--hHHHHHHHhhhhccCCCCCCccccch-hHHHHHHHHHhhhhhHHhhcceeeccCC-ceeEEEEecCCCceeeecc Confidence 000 000000000000 1111 2445799999999999999999877544 456777666 588899999 Q ss_pred CCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhh Q lcl|Aclame:pro 70 NYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRF 149 (331) Q Consensus 70 N~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~ 149 (331) ++.++.++.++.+++-.++-+++.+.|.+.+.+..+ ++...-.....+++++.+...||+||...+ .+.||..- T Consensus 197 ~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~---~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~--~p~Gi~~~- 270 (418) T protein:vir:10 197 GAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDAP---ALQSYIDGRARYGLQLTEEGQILKGDGTGA--NILGILPQ- 270 (418) T ss_pred CccccccccceeeEEEeeeeEEEeehhhHHHHHhHH---HHHHHHHHHHHHHHHHHHHHHHhccCCCCc--cccccccc- Confidence 999999999999999999999999999999987654 344555566899999999999999974321 12333210 Q ss_pred ccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEe Q lcl|Aclame:pro 150 NSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLR 229 (331) Q Consensus 150 ~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~ 229 (331) ++... T Consensus 271 -------------~~~~~-------------------------------------------------------------- 275 (418) T protein:vir:10 271 -------------ASAFM-------------------------------------------------------------- 275 (418) T ss_pred -------------ccccc-------------------------------------------------------------- Confidence 00000 Q ss_pred cccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCcee Q lcl|Aclame:pro 230 DWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKV 309 (331) Q Consensus 230 d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v 309 (331) . . .+.+..+..+-+++++..+......+..|+||++....|+.. .+..+...+ +....| .. T Consensus 276 ----------~---~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~l-kd~~G~~i~-~~~~~~-~~ 336 (418) T protein:vir:10 276 ----------P---S---ITLANATPIDKIRLALLQAVLAEFPATGIVLNPIDWASIELT-KDSQGRYIV-GNPVNG-TT 336 (418) T ss_pred ----------c---c---ccccccccHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHh-hcCCCceec-cccccC-CC Confidence 0 0 000111223446677777777777778999999999999864 344444322 222222 23 Q ss_pred EEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 310 VAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 310 ~~~~gvpir~~dai~~tE~~Vv 331 (331) -.++|+||..++++..++..+. T Consensus 337 ~~l~G~pV~~~~~~p~~~~~~g 358 (418) T protein:vir:10 337 PRLWNLPVVETQAMTANEFLVG 358 (418) T ss_pred ceecceeeEEcCCCCCCcEEEe Confidence 3689999999999875542222 No 35 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.08 E-value=1.8e-11 Score=79.54 Aligned_cols=225 Identities=14% Similarity=0.129 Sum_probs=146.6 Q ss_pred CCc--------------------CccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEee Q lcl|Aclame:pro 1 MPT--------------------LSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG 60 (331) Q Consensus 1 M~~--------------------l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~ 60 (331) |=- ......+..+....+-++ .+...||+.+.+.|+|++.++.+...++ .+++.+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~-~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~~p~~~~ 78 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMN-EFTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWAD 78 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccch-hHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEec Confidence 000 000000111111112233 2456799999999999999999887644 588999999 Q ss_pred cCCcceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChh Q lcl|Aclame:pro 61 LPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAE 140 (331) Q Consensus 61 lP~~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~ 140 (331) .|.++|.+=++.+++++.++.+++...+-+++.+.|.+.+.+... .++...-.....++++..+...+|||+.+.+ T Consensus 79 ~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~-- 154 (324) T protein:vir:96 79 KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP-- 154 (324) T ss_pred CcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCC-- Confidence 999999999999999999999999999999999999998877653 1333334455789999999999999963221 Q ss_pred hhcCchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEE Q lcl|Aclame:pro 141 KFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHY 220 (331) Q Consensus 141 ~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~ 220 (331) ...|+.. ..+.+ T Consensus 155 ~~~gi~~---------------~~~~~----------------------------------------------------- 166 (324) T protein:vir:96 155 FGKSIAQ---------------SIEKT----------------------------------------------------- 166 (324) T ss_pred cCccccc---------------ccccc----------------------------------------------------- Confidence 0001000 00000 Q ss_pred EeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeee Q lcl|Aclame:pro 221 KWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLT 300 (331) Q Consensus 221 ~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~ 300 (331) +- .+.+....+.+.+++..++.....+.+|+||++.+..|+.. .+..+...+. T Consensus 167 ------------------~~--------~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l-~d~~G~~~~~ 219 (324) T protein:vir:96 167 ------------------NK--------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI-VDPETKERIY 219 (324) T ss_pred ------------------ce--------eccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHh-hccCCCeeec Confidence 00 00001113446677777777777778999999999999865 4444433221 Q ss_pred ecccCCceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 301 MEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 301 ~~~~~g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) +.....+.|+||..+.+..-++..++ T Consensus 220 -----~~~~~~l~G~PV~~~~~~~~~~~~~~ 245 (324) T protein:vir:96 220 -----DRNSDSLDGLPVVNLKSSNLKRGELI 245 (324) T ss_pred -----CCCCCcccceeeEeeCCCCCCcceEE Confidence 12234589999999888766666655 No 36 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.08 E-value=1.8e-11 Score=79.54 Aligned_cols=225 Identities=14% Similarity=0.129 Sum_probs=146.6 Q ss_pred CCc--------------------CccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEee Q lcl|Aclame:pro 1 MPT--------------------LSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG 60 (331) Q Consensus 1 M~~--------------------l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~ 60 (331) |=- ......+..+....+-++ .+...||+.+.+.|+|++.++.+...++ .+++.+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~-~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~~p~~~~ 78 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMN-EFTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWAD 78 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccch-hHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEec Confidence 000 000000111111112233 2456799999999999999999887644 588999999 Q ss_pred cCCcceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChh Q lcl|Aclame:pro 61 LPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAE 140 (331) Q Consensus 61 lP~~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~ 140 (331) .|.++|.+=++.+++++.++.+++...+-+++.+.|.+.+.+... .++...-.....++++..+...+|||+.+.+ T Consensus 79 ~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~-- 154 (324) T protein:vir:78 79 KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP-- 154 (324) T ss_pred CcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCC-- Confidence 999999999999999999999999999999999999998877653 1333334455789999999999999963221 Q ss_pred hhcCchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEE Q lcl|Aclame:pro 141 KFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHY 220 (331) Q Consensus 141 ~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~ 220 (331) ...|+.. ..+.+ T Consensus 155 ~~~gi~~---------------~~~~~----------------------------------------------------- 166 (324) T protein:vir:78 155 FGKSIAQ---------------SIEKT----------------------------------------------------- 166 (324) T ss_pred cCccccc---------------ccccc----------------------------------------------------- Confidence 0001000 00000 Q ss_pred EeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeee Q lcl|Aclame:pro 221 KWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLT 300 (331) Q Consensus 221 ~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~ 300 (331) +- .+.+....+.+.+++..++.....+.+|+||++.+..|+.. .+..+...+. T Consensus 167 ------------------~~--------~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l-~d~~G~~~~~ 219 (324) T protein:vir:78 167 ------------------NK--------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI-VDPETKERIY 219 (324) T ss_pred ------------------ce--------eccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHh-hccCCCeeec Confidence 00 00001113446677777777777778999999999999865 4444433221 Q ss_pred ecccCCceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 301 MEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 301 ~~~~~g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) +.....+.|+||..+.+..-++..++ T Consensus 220 -----~~~~~~l~G~PV~~~~~~~~~~~~~~ 245 (324) T protein:vir:78 220 -----DRNSDSLDGLPVVNLKSSNLKRGELI 245 (324) T ss_pred -----CCCCCcccceeeEeeCCCCCCcceEE Confidence 12234589999999888766666655 No 37 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.04 E-value=4.9e-11 Score=77.10 Aligned_cols=233 Identities=13% Similarity=0.195 Sum_probs=144.4 Q ss_pred CCcCc--cccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcc-c Q lcl|Aclame:pro 1 MPTLS--TTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPE-K 77 (331) Q Consensus 1 M~~l~--~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s-~ 77 (331) |.+-. .+-. +-+. .+...|||.+.+.++|++.++.....++ .+.+.+.++-+.++|..=++..+++ . T Consensus 107 ~~~~~~~~GG~--------~iP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~wv~E~~~~~~~~~ 176 (401) T protein:vir:44 107 LQVGTDEDGGY--------AVPE-ELDRSILSLLKDEVVMRQEATVITVGGS-DYKKLVNLGGTASGWVGETDTRSQTAT 176 (401) T ss_pred hhcCCCCCCce--------eccH-hHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCccceeeccccccCcccc Confidence 11100 0001 1121 2456799999999999999998876544 5778888888999999888888765 4 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~ 156 (331) .++.+++-..+-+.+.+.|-+.+.+... +..++ -.....++++..+..+|++||-...|+++..... T Consensus 177 ~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~---i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~--------- 244 (401) T protein:vir:44 177 SRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAW---INSELATEFAEQEEIAFTTGDGTKKPKGFLAYES--------- 244 (401) T ss_pred ccceeeeeehhheeeehhhhHHHHhcchHHHHHH---HHHHHHHHHHHHHHhhhhccCCCCccceeecccc--------- Confidence 6999999999999999999999988754 33433 3445788999999999999986655444321110 Q ss_pred cceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceee Q lcl|Aclame:pro 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) Q Consensus 157 ~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~R 236 (331) ...... ...|+. T Consensus 245 ~~~~~~------------~~~~~~-------------------------------------------------------- 256 (401) T protein:vir:44 245 TEESDK------------ARAFGK-------------------------------------------------------- 256 (401) T ss_pred cccccc------------cccccc-------------------------------------------------------- Confidence 000000 000000 Q ss_pred eeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeE Q lcl|Aclame:pro 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIP 316 (331) Q Consensus 237 I~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvp 316 (331) +..+ .....+... .+-++++++.++.....+.+|+||++.+.+|+.. .++.+.+.+.+.-..|. .-.+.|.| T Consensus 257 ~~~~-----~t~~~~~~~-~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~l-kd~~G~~l~~~~~~~g~-~~~l~G~P 328 (401) T protein:vir:44 257 LQHI-----VSGEATAVT-ADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLL-KDTEGNYLWRPGLELGQ-PSSLAGYG 328 (401) T ss_pred cccc-----ccccccccC-HHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHh-hccCCceeecCCcCCCC-Cceeccee Confidence 0000 000000000 2335566666665556678999999999999964 55555443333322332 34688999 Q ss_pred EEEEEeccC--CCcccC Q lcl|Aclame:pro 317 CRRTDALLL--TEARVV 331 (331) Q Consensus 317 ir~~dai~~--tE~~Vv 331 (331) |..+|++-. +...+| T Consensus 329 Vv~~~~~p~~~~~~~~i 345 (401) T protein:vir:44 329 IAENEQMPDIAADAKAI 345 (401) T ss_pred eEEecCcCCccCCccEE Confidence 999998742 333333 No 38 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.02 E-value=1.4e-10 Score=74.66 Aligned_cols=229 Identities=12% Similarity=0.013 Sum_probs=149.2 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) |++= .+.+.-.| +...|||.+.+.+.|++..+.++..++ ...+.+.++-|.++|..=++.++++..++ T Consensus 1 ma~~-gG~lvp~~----------~~~~ii~~~~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~~f 68 (298) T protein:vir:16 1 MVLN-KGTLFDPT----------LVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGVTL 68 (298) T ss_pred Cccc-Ccceechh----------HHHHHHHHHHhhhhhhhhcceeeccCC-ceEEEEEecCcceEEecCCccccccccce Confidence 7752 23333233 235699999999999999998876543 46788899999999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCC-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccce Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGN-SAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn-~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~ 159 (331) .+++-..+-+++.+.|.+.+.+...+ ..++...-.....+++++.+..+|+||....+ T Consensus 69 ~~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~--------------------- 127 (298) T protein:vir:16 69 APQTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRL--------------------- 127 (298) T ss_pred eEEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCC--------------------- Confidence 99999999999999999988766543 34555555566899999999999999942111 Q ss_pred eeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeec Q lcl|Aclame:pro 160 IIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIAN 239 (331) Q Consensus 160 vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~N 239 (331) |+... ..++ .. .. + -..+ T Consensus 128 -----g~~~~-------------~~~~---------~~--------~~-----------------~----------~~~~ 145 (298) T protein:vir:16 128 -----GTASA-------------VIGT---------NH--------FD-----------------S----------KVTQ 145 (298) T ss_pred -----Ccccc-------------cccc---------cc--------cc-----------------c----------cccc Confidence 00000 0000 00 00 0 0000 Q ss_pred cccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEE Q lcl|Aclame:pro 240 VDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRR 319 (331) Q Consensus 240 Id~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~ 319 (331) . ....++..++.+.+.+++.++......+..|+||++.+..|+.+ .+..+... -+....+...-.++|+||.. T Consensus 146 ~-----~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~l-kd~~G~~i-~~~~~~~~~~~~l~G~PV~~ 218 (298) T protein:vir:16 146 K-----VEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQ-KDLQDNAL-FPELKWGATPDTINGLPVDV 218 (298) T ss_pred c-----cccccccccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHh-hccCCCee-ecCcccCCCCceecceeeEE Confidence 0 00011123344556677777765556667899999999999865 45544432 23332333345789999999 Q ss_pred EEeccC----CCcccC Q lcl|Aclame:pro 320 TDALLL----TEARVV 331 (331) Q Consensus 320 ~dai~~----tE~~Vv 331 (331) ++++.. ++..++ T Consensus 219 ~~~v~~~~~~~~~~~~ 234 (298) T protein:vir:16 219 NKTVSDMSLTQRDRAI 234 (298) T ss_pred ecccccccCCCccEEE Confidence 998853 222333 No 39 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=98.99 E-value=4.1e-11 Score=77.54 Aligned_cols=235 Identities=17% Similarity=0.184 Sum_probs=142.9 Q ss_pred CCcCc-----cccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEE------EeecCCcceeec Q lcl|Aclame:pro 1 MPTLS-----TTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTV------RSGLPTGTWRKL 69 (331) Q Consensus 1 M~~l~-----~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~------~~~lP~~~fR~l 69 (331) |-.|. ...+|+.+........... ...|+.+.++|+||..+..+...+. |.... ++..++..|..- T Consensus 1 ~~~~~~~~~~~k~it~~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s--~~~~i~~i~~g~~~~~~~~~~~~ 77 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKS--YEVDISRISLGVELEPGRNTSGT 77 (314) T ss_pred CchhhhHHHhhcccccccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCc--cceeecccccCcccccccccccC Confidence 43321 2234555544333333333 4688999999999999998754221 22211 122344455554 Q ss_pred CCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCC---h--hhhcC Q lcl|Aclame:pro 70 NYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSID---A--EKFMG 144 (331) Q Consensus 70 N~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~---~--~~F~G 144 (331) ....+++..++.++.-.++-+...+.|-..+.+-+--..++...-..++.+.++...+..|||||.+.. | +.++| T Consensus 78 ~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G 157 (314) T protein:vir:41 78 KVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDG 157 (314) T ss_pred CccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchh Confidence 455678899999999999999999999666654432112455666677999999999999999996431 1 23444 Q ss_pred chhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeee Q lcl|Aclame:pro 145 LTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDI 224 (331) Q Consensus 145 L~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~ 224 (331) +-+.- +..++ +. T Consensus 158 ~l~~a-------~~~~~---------------------------------------------~~---------------- 169 (314) T protein:vir:41 158 WMKLA-------GNQYT---------------------------------------------DA---------------- 169 (314) T ss_pred hhhhc-------cccee---------------------------------------------ec---------------- Confidence 32210 00000 00 Q ss_pred ceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcC---CCCceEEEeCHHHHHHHHHHhhcCCcceeeee Q lcl|Aclame:pro 225 GLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNV---GMGRPAFYMPRKIRSFLRRQITNKVAASTLTM 301 (331) Q Consensus 225 Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~---~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~ 301 (331) .+.+.....+++++++..+|.. ..++.+||||++....++.+..++... +-. T Consensus 170 -----------------------~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~--l~~ 224 (314) T protein:vir:41 170 -----------------------EPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETG--LGD 224 (314) T ss_pred -----------------------CccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCc--ccc Confidence 0011122356677777888763 346789999999999998876666543 222 Q ss_pred cccCCceeEEEcCeEEEEEEeccC---CCcccC Q lcl|Aclame:pro 302 EEIAGKKVVAFDGIPCRRTDALLL---TEARVV 331 (331) Q Consensus 302 ~~~~g~~v~~~~gvpir~~dai~~---tE~~Vv 331 (331) .-..|.....+.|+||..|..+.. .+..+. T Consensus 225 ~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~ 257 (314) T protein:vir:41 225 SALIGATGLQYDGIPIQYVPALDALGDDKARAL 257 (314) T ss_pred hhhhCCCCceecceeeEecccccccCCCCceEE Confidence 233344566788999999988732 333333 No 40 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.99 E-value=2.4e-10 Score=73.30 Aligned_cols=230 Identities=11% Similarity=-0.004 Sum_probs=150.2 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccceE Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t~ 80 (331) |+.-.+..=+| -+. .+...|||.+.+.|.|++..+.+...++ ..++.+.++-|.++|..=++.+++++.++ T Consensus 1 ma~~t~~~G~l-------ip~-~~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f 71 (300) T protein:vir:95 1 MSEAQLSKGNL-------FNP-ELVTKVINKVKGHSSIAKLSPQKPIPFN-GQREFVFDFDSDIDIVAENGKKTHGGVSL 71 (300) T ss_pred CcccccCCcce-------ech-hhHHHHHHHHHhhhhhhhhcceeeccCC-ceEEEEEecCcceEEeeCCcccccccccc Confidence 77643322111 122 2456799999999999999888765433 46788888899999999999999999999 Q ss_pred EEEEEEEEEecchhhhhHHHHhhCCC-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccce Q lcl|Aclame:pro 81 VQVKDSMGMLETYAEVDKALADLNGN-SAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) Q Consensus 81 ~~~~~~l~ilgg~~eVDr~la~~~gn-~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~ 159 (331) .+++-..+-+++.+.|-+.+.+...+ ..++-..-.....++++..+..+||||+...+ T Consensus 72 ~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~--------------------- 130 (300) T protein:vir:95 72 DPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRT--------------------- 130 (300) T ss_pred eeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCC--------------------- Confidence 99999999999999999988765433 33454555566899999999999999963211 Q ss_pred eeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeec Q lcl|Aclame:pro 160 IIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIAN 239 (331) Q Consensus 160 vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~N 239 (331) |++.. +.+.. ..+.. . .+ T Consensus 131 -----g~~~~----------------~~~~~--------------~~~~~-----------------~----------~~ 148 (300) T protein:vir:95 131 -----KQAST----------------IIGDN--------------CFDKK-----------------V----------TQ 148 (300) T ss_pred -----CCCcc----------------ccccc--------------ccccc-----------------c----------ce Confidence 11100 00000 00000 0 00 Q ss_pred cccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEE Q lcl|Aclame:pro 240 VDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRR 319 (331) Q Consensus 240 Id~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~ 319 (331) + .+.++.+..+.+.+++.++.....-+.+|.||++....|+.. .+..+.. +-+....+...-.+.|+||.. T Consensus 149 --~-----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~l-kd~~G~~-i~~~~~~~~~~~~l~G~Pv~~ 219 (300) T protein:vir:95 149 --T-----VPFKDTNPDESMEDAVGMIDGSERDITGAILDPIFTTALSKM-KNAEGGK-LYPELAWGGVPDAINGLAVDK 219 (300) T ss_pred --e-----ecccccchHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHh-hccCCCe-eccCccccCCCceecceeeEE Confidence 0 001122334567777777765555567899999999999865 3444443 333444444556799999999 Q ss_pred EEeccCC----CcccC Q lcl|Aclame:pro 320 TDALLLT----EARVV 331 (331) Q Consensus 320 ~dai~~t----E~~Vv 331 (331) ++++... +..++ T Consensus 220 s~~v~~~~~~~~~~~~ 235 (300) T protein:vir:95 220 NRTVSYSQTDPKNTAI 235 (300) T ss_pred ecCCCCCCCCCccEEE Confidence 9998432 22222 No 41 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.98 E-value=4.5e-10 Score=71.82 Aligned_cols=231 Identities=16% Similarity=0.137 Sum_probs=146.7 Q ss_pred CCcCccccccHHHHHHhc----------------CCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARM----------------TPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTG 64 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~----------------~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~ 64 (331) |+ ||.|++... -+. .+...|||.+.+.|+|+...+.+... +..+++.+.++-|.+ T Consensus 1 ~~-------~~~e~~~~~~~~~~~~~~~~~~~~liP~-~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a 71 (338) T protein:vir:78 1 MA-------TLNELAPNTAGSNHQGRLAHVPSDLLPK-EIVGPIFDKAQESSLVLRLGENIPIS-YGETIIPTTVKRPEV 71 (338) T ss_pred Cc-------chHHhhhhhcccccccceecccccccch-HHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccc Confidence 55 455554331 121 13357999999999999999998765 346888888887776 Q ss_pred ceee--------cCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCC Q lcl|Aclame:pro 65 TWRK--------LNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSS 136 (331) Q Consensus 65 ~fR~--------lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~ 136 (331) +|.. =++..++++.++.+++-..+-+++.+.|-+.+.+... .++...-...+.++++..+...||+||.. T Consensus 72 ~~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~a~~~~~d~~~l~G~g~ 149 (338) T protein:vir:78 72 GQVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNP--SGLYTKLQADLAYAIGRGIDLAVFHGKSP 149 (338) T ss_pred eeecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHhhcccCC Confidence 6553 4566888999999999999999999999888776643 22333333568999999999999999876 Q ss_pred CChhhhcCchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEE Q lcl|Aclame:pro 137 IDAEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGY 216 (331) Q Consensus 137 ~~~~~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~ 216 (331) ..++++.|+..= ++..+ T Consensus 150 ~~~~~~~gi~~~--------------~~~~~------------------------------------------------- 166 (338) T protein:vir:78 150 LTGSALQGIDTN--------------NVIVN------------------------------------------------- 166 (338) T ss_pred Cccccccccccc--------------ccccc------------------------------------------------- Confidence 655555544210 00000 Q ss_pred EEEEEeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhh-cCCCCceEEEeCHHHHHHHHHH--hhcC Q lcl|Aclame:pro 217 RTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIP-NVGMGRPAFYMPRKIRSFLRRQ--ITNK 293 (331) Q Consensus 217 ~t~~~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip-~~~~g~~~~y~n~~v~~~L~~~--~~~~ 293 (331) ....+. . .+....+.+.+.+++.++. +....+.+|.||++....|... ..+. T Consensus 167 --------------------~~~~~~--~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~ 221 (338) T protein:vir:78 167 --------------------TTNVDY--L---QTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDA 221 (338) T ss_pred --------------------cccccc--c---cccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccC Confidence 000000 0 0111234566777777664 3444567899999999988643 2344 Q ss_pred CcceeeeecccCCceeEEEcCeEEEEEEeccC-------CCcccC Q lcl|Aclame:pro 294 VAASTLTMEEIAGKKVVAFDGIPCRRTDALLL-------TEARVV 331 (331) Q Consensus 294 ~~~~~~~~~~~~g~~v~~~~gvpir~~dai~~-------tE~~Vv 331 (331) .+...+ ++...+...-.+.|+||..+++|-. +...++ T Consensus 222 ~g~~l~-~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~ 265 (338) T protein:vir:78 222 NGNVDP-TRINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVV 265 (338) T ss_pred CCceee-cccccCCCCceeeeeeEEEccccCccccccCCcccEEE Confidence 444322 3333333356799999999998742 122222 No 42 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.97 E-value=2.3e-10 Score=73.38 Aligned_cols=241 Identities=15% Similarity=0.187 Sum_probs=146.4 Q ss_pred CCcCccccccHHHH---HHh-------cCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecC Q lcl|Aclame:pro 1 MPTLSTTNPTLADV---AAR-------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLN 70 (331) Q Consensus 1 M~~l~~~a~TL~E~---Ak~-------~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN 70 (331) |..-....++-.|. ... +-+. .+...|++.+.+.++|++..+.+...++ .+.+.+.++-+.++|..=+ T Consensus 90 l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~-~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~E~ 167 (407) T protein:vir:48 90 MRKGREDGLRELERKALQVGNDEDGGYAIPE-ELDRTILTLLKDEVVMRQEATVITLGGS-DYKKLVNLGGTTSGWVGET 167 (407) T ss_pred HhccchhhhhHHHHHhhhcccCCCCcccccH-hHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCcceeeeccc Confidence 21100011111111 000 1121 2456799999999999999888776544 5788888999999999988 Q ss_pred CccCccc-ceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhh Q lcl|Aclame:pro 71 YGVQPEK-SRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPR 148 (331) Q Consensus 71 ~g~~~s~-~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R 148 (331) +..++++ +++.+++-..+-+++.+.|-+.+.+... +..++- .....+++++.+..+|++||-...| .||-.. T Consensus 168 ~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i---~~~l~~~i~~~~~~a~l~G~G~~~p---~Gil~~ 241 (407) T protein:vir:48 168 DARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWI---NSELALEFAEQEEIAFTSGDGSKKP---KGFLAY 241 (407) T ss_pred ccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHH---HHHHHHHHHHHHHhhhhccCCCCcc---ceeeec Confidence 8888765 7999999999999999999999988754 444443 3457889999999999999865443 344211 Q ss_pred hccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEE Q lcl|Aclame:pro 149 FNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) Q Consensus 149 ~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v 228 (331) - + . .+. +.+ ..||. T Consensus 242 ~---~---~-------~~~-~~~----~~~~~------------------------------------------------ 255 (407) T protein:vir:48 242 E---S---T-------DED-DKT----RAFGK------------------------------------------------ 255 (407) T ss_pred c---c---c-------ccc-ccc----ccccc------------------------------------------------ Confidence 0 0 0 000 000 00000 Q ss_pred ecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCce Q lcl|Aclame:pro 229 RDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK 308 (331) Q Consensus 229 ~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~ 308 (331) +..+.. .+.+..+ .+-|+++++.++.....+..|+||++.+..|+.. .++.+.+.+.+.-..|. T Consensus 256 --------~~~~~~-----~~~~~~~-~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~l-kD~~Gr~l~~~~~~~g~- 319 (407) T protein:vir:48 256 --------LQHIAS-----GAASGVT-ADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLL-KDNDGNYLWRPGIELGQ- 319 (407) T ss_pred --------cccccc-----ccccccC-hHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHh-hccCCceeeccCcCCCC- Confidence 001100 0000011 2335566666766666778999999999999864 45555443434323333 Q ss_pred eEEEcCeEEEEEEeccC--CCcccC Q lcl|Aclame:pro 309 VVAFDGIPCRRTDALLL--TEARVV 331 (331) Q Consensus 309 v~~~~gvpir~~dai~~--tE~~Vv 331 (331) .-.+.|.||..+|.|-. +...+| T Consensus 320 ~~~l~G~PV~~~~~~p~~~~~~~~i 344 (407) T protein:vir:48 320 PSSLAGYGIVENEQMPDIAADAKAI 344 (407) T ss_pred CceecceeeEEecCcCCccCCccEE Confidence 34678999999998743 222222 No 43 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.93 E-value=3.1e-10 Score=72.73 Aligned_cols=230 Identities=12% Similarity=0.066 Sum_probs=138.4 Q ss_pred CCc------Cc--cccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcce-ecccCCccceeEEEeecCCcceeecCC Q lcl|Aclame:pro 1 MPT------LS--TTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTV-IEANGFTEHKTTVRSGLPTGTWRKLNY 71 (331) Q Consensus 1 M~~------l~--~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf-~e~n~~~~~~~~~~~~lP~~~fR~lN~ 71 (331) +.- .. ....|...-.. +.+. .+...+|+.+.+.+.+|..+.- ....++..+...+.++-|.++|..=++ T Consensus 97 ~~~~~~~~~~~~~~~~~t~~~~g~-~~~~-~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 174 (392) T protein:vir:13 97 NLGEARSFEFAPEKRDGTKAGNPN-VLSR-TLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETA 174 (392) T ss_pred chhhhHHHHhhhhhhcccccCCCc-cccc-cchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccc Confidence 000 00 00000000000 1111 1234567766777777776654 344444456677788889999999999 Q ss_pred ccCcccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhc Q lcl|Aclame:pro 72 GVQPEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFN 150 (331) Q Consensus 72 g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~ 150 (331) .+++++.++.+++-..+-+.+.+.|.+.+.+... +..++ -.....++++..+..+|||||-...|+++ -. T Consensus 175 ~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~---i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gi---l~--- 245 (392) T protein:vir:13 175 EIPESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGF---LVSDAGPAIGDAMGRHFLTGTGTGQPRGI---LT--- 245 (392) T ss_pred cccccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHH---HHHHHHHHHHHHHHHHHhcccCCcccccc---cc--- Confidence 9999999999999999999999999999988754 33333 33457899999999999999854433332 10 Q ss_pred cccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEec Q lcl|Aclame:pro 151 SLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRD 230 (331) Q Consensus 151 ~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d 230 (331) .. ++.+ ..+ T Consensus 246 ~~-------------~~~~-~~~--------------------------------------------------------- 254 (392) T protein:vir:13 246 DA-------------TGAN-AAF--------------------------------------------------------- 254 (392) T ss_pred cc-------------cccc-ccc--------------------------------------------------------- Confidence 00 0000 000 Q ss_pred ccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeE Q lcl|Aclame:pro 231 WRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVV 310 (331) Q Consensus 231 ~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~ 310 (331) . .+++.....+-|++++..++...+.+.+|+||++.+.+|+.. .+..+...+.+.-..| ..- T Consensus 255 ---------------~-~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~l-kd~~G~~l~~~~~~~g-~~~ 316 (392) T protein:vir:13 255 ---------------G-EADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKL-KDANGQYLWQSALTVG-APD 316 (392) T ss_pred ---------------c-ccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHh-hccCCceeecCCcCCC-CCc Confidence 0 000111113335556666665556678999999999999864 5555544343433333 335 Q ss_pred EEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 311 AFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 311 ~~~gvpir~~dai~~tE~~Vv 331 (331) .+.|+||..+|++-..+ .++ T Consensus 317 ~l~G~Pv~~~~~~~~~~-i~~ 336 (392) T protein:vir:13 317 TFNGKVVETDDGMPADK-VLF 336 (392) T ss_pred eecceeeEEcCCCCCCc-EEE Confidence 78999999999985433 122 No 44 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=98.93 E-value=1.7e-10 Score=74.13 Aligned_cols=227 Identities=13% Similarity=0.099 Sum_probs=139.9 Q ss_pred CCc-C-c--cccccHHHHH-HhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeec---CCc Q lcl|Aclame:pro 1 MPT-L-S--TTNPTLADVA-ARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKL---NYG 72 (331) Q Consensus 1 M~~-l-~--~~a~TL~E~A-k~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~l---N~g 72 (331) |.. . . ..+++..... --+-+. .+...||+.+.+.++|......+..+ ..+.+.+...-+++.|... +.. T Consensus 131 l~~~~~~~e~~a~~~~t~~GG~lvP~-~~~~~Ii~~l~~~~~i~~~~~~~~~~--~~~~~p~~~~~~~a~~~~~~~e~~~ 207 (434) T protein:vir:62 131 IVGNIDEKEARALGLVTGNGSVTIPD-FLSKEIITYAQEENFLRRLGTGVKTK--ENIKYPVLVKKAEAQGHKNERTNNE 207 (434) T ss_pred hccccchhhhhhhcccccccceecch-hhHHHHHHhhhhhhhhhhhcceeccC--CceEEEEEecCCcccceeccccccc Confidence 000 0 0 0000000000 001122 23456999999999998887765543 2466777777888888644 567 Q ss_pred cCcccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcc Q lcl|Aclame:pro 73 VQPEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNS 151 (331) Q Consensus 73 ~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~ 151 (331) .+.+..++.+++-..+-+++.+.|.+.+.+... +..++- .....++++..+...||+||-..+| T Consensus 208 ~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i---~~~la~~~~~~~d~~~l~G~G~~~~------------ 272 (434) T protein:vir:62 208 MPETDIEFDEIELSPTEFDALATVTKKLLARTGLPIEQIV---MDELKKAYVRKETQYMVNGDEANNI------------ 272 (434) T ss_pred ccccccceeeEEeeheeeEeehhhHHHHHhcchHHHHHHH---HHHHHHHHHHHHHHHHhccCCCCcc------------ Confidence 788889999999999999999999999988755 444443 3457899999999999999733221 Q ss_pred ccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecc Q lcl|Aclame:pro 152 LSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDW 231 (331) Q Consensus 152 ~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~ 231 (331) ++ |+.. .. + T Consensus 273 --------------~~-----------------g~~~---~~-------------------------------~------ 281 (434) T protein:vir:62 273 --------------ND-----------------GALA---KK-------------------------------A------ 281 (434) T ss_pred --------------cc-----------------ceee---cc-------------------------------c------ Confidence 00 0000 00 0 Q ss_pred cceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeee-cccCCceeE Q lcl|Aclame:pro 232 RYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTM-EEIAGKKVV 310 (331) Q Consensus 232 r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~-~~~~g~~v~ 310 (331) +. .+++.....+.+++++..++.....+.+|+||+....+|+.. .+..+.+.+.+ ....+-... T Consensus 282 --------~~------~~~~~~~~~d~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~l-kd~~G~~l~~~~~~~~~g~~~ 346 (434) T protein:vir:62 282 --------VE------FKTDEKNLYDALVKMKNTPVKEVRKKARWVLNTAALTKIETM-KTDDGFPLLRPFNQAEGGIGY 346 (434) T ss_pred --------cc------ccccccchhhHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHh-hccCCCEeeccCCCccCCCCc Confidence 00 011112234556677777776667788999999999999975 45554443332 223333345 Q ss_pred EEcCeEEEEEEeccCCCc---ccC Q lcl|Aclame:pro 311 AFDGIPCRRTDALLLTEA---RVV 331 (331) Q Consensus 311 ~~~gvpir~~dai~~tE~---~Vv 331 (331) .++|.||..++.+...++ .++ T Consensus 347 tl~G~pV~~~~~~~~~~~~~~~~i 370 (434) T protein:vir:62 347 TLLGFPVEEEDAIDIPDSPDTPVF 370 (434) T ss_pred eecceeeEEecCccCccCCCceEE Confidence 689999999999843221 222 No 45 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.92 E-value=2.4e-10 Score=73.35 Aligned_cols=227 Identities=13% Similarity=0.014 Sum_probs=139.7 Q ss_pred CCcCc--cccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MPTLS--TTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKS 78 (331) Q Consensus 1 M~~l~--~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~ 78 (331) |++-. .+-.++ +. .+...||+.+.+.|+|++..+.+...++ ..++.+.++-|+++|..=++.++.++. T Consensus 1 Ma~~~~~~gg~~v--------P~-~~~~~ii~~l~~~s~i~~l~~~i~~~~~-~~~ip~~~~~~~a~wv~Eg~~~~~s~~ 70 (315) T protein:vir:80 1 MADDFLSAGKLEL--------PG-SMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGEGEVKPSASV 70 (315) T ss_pred CCCCcCCcCceEc--------ch-HHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCcceEEeeCCcccccccc Confidence 77532 122222 22 2456799999999999999988876543 578899999999999999999999999 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCH--HHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNS--AAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~--~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~ 156 (331) ++.+++-..+-+++.+.|-+.+.+..... ..++..-...+.++++..+...+|||+....+....|+.. T Consensus 71 ~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~--------- 141 (315) T protein:vir:80 71 DVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHT--------- 141 (315) T ss_pred ceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccccccc--------- Confidence 99999999999999999988887665432 2355555667899999999999999963222111111100 Q ss_pred cceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceee Q lcl|Aclame:pro 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) Q Consensus 157 ~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~R 236 (331) ..+. . T Consensus 142 -------~~~~----------------------~---------------------------------------------- 146 (315) T protein:vir:80 142 -------SLNK----------------------T---------------------------------------------- 146 (315) T ss_pred -------cccc----------------------c---------------------------------------------- Confidence 0000 0 Q ss_pred eeccccccCCCCccchhhHHHHHHHHHHHhh-cCCCCceEEEeCHHHHHHHHHHhhcCC---cceeeeecccCCceeEEE Q lcl|Aclame:pro 237 IANVDVSELTKNASAGADLIDLMTQAVELIP-NVGMGRPAFYMPRKIRSFLRRQITNKV---AASTLTMEEIAGKKVVAF 312 (331) Q Consensus 237 I~NId~s~l~~~~~~~~~l~~lm~~a~~~ip-~~~~g~~~~y~n~~v~~~L~~~~~~~~---~~~~~~~~~~~g~~v~~~ 312 (331) ...++. . ..+-.+|. +++.++. +....+..|.||++.+..|+....... +...+-++-..| -.-.+ T Consensus 147 ~~~~~~---~--~~~~~d~~----~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g-~~~tl 216 (315) T protein:vir:80 147 KNIVDA---T--DSATADLV----KAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFA-GLDNW 216 (315) T ss_pred cceeec---c--ccchHHHH----HHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccC-CCcee Confidence 000000 0 01112232 3333332 222334579999999999986532221 111112222222 22478 Q ss_pred cCeEEEEEEeccCCC-----ccc--C Q lcl|Aclame:pro 313 DGIPCRRTDALLLTE-----ARV--V 331 (331) Q Consensus 313 ~gvpir~~dai~~tE-----~~V--v 331 (331) .|.||..++++.... ..+ + T Consensus 217 ~G~PV~~~~~~~~~~~~~~~~~~~~~ 242 (315) T protein:vir:80 217 RGLNVGASSTVSGAPEMSPASGVKAI 242 (315) T ss_pred cceeeEecCcCCcccccccccccEEE Confidence 999999999984321 111 1 No 46 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.92 E-value=3.9e-10 Score=72.14 Aligned_cols=229 Identities=13% Similarity=0.096 Sum_probs=137.5 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHH-hccchhHhhcceecccCCccceeEEEee--------cCCcceeecCC Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEML-NETNEILDDMTVIEANGFTEHKTTVRSG--------LPTGTWRKLNY 71 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l-~~~s~iL~~lpf~e~n~~~~~~~~~~~~--------lP~~~fR~lN~ 71 (331) ++.-.+...+. .+.+ .....+|..+ ...+.|.+.+......++ .+.|.++++ -+.++|..=++ T Consensus 123 ~~~~~~~~~~~-----~~~p--~~~~~~i~~~~~~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~Eg~ 194 (419) T protein:vir:94 123 APAGTITNPNV-----PHLP--QLVPGIVPTTPDLPLLVADLLDQQNADYN-VLEYIRDTSGTAGAGSTWNKAAVVPEGT 194 (419) T ss_pred cccccccCCcc-----cccc--hhhhHHHHHHHhhhhhhhhcceeeeccCC-ceeeeeeccccccccccCcccceecCCc Confidence 11100000000 0011 1223444444 444455666776665433 355655544 34577999899 Q ss_pred ccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcc Q lcl|Aclame:pro 72 GVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNS 151 (331) Q Consensus 72 g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~ 151 (331) ..++++.++.+++-.++-+++.+.|.+.+.+-.+ .+.+.-.....++++..+...|||||...+|+ |+-.- T Consensus 195 ~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~---~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~---Gi~~~--- 265 (419) T protein:vir:94 195 AKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS---QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQ---GILTT--- 265 (419) T ss_pred cccccccceeeEEeeeeeEEEeehhhHHHHHhHH---HHHHHHHHHHHHHHHHHHHHHHHhccCccccc---ceecc--- Confidence 9999999999999999999999999999887443 34444455589999999999999998654443 22110 Q ss_pred ccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecc Q lcl|Aclame:pro 152 LSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDW 231 (331) Q Consensus 152 ~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~ 231 (331) .+...+.. + T Consensus 266 ----~~~~~~~~------------------------~------------------------------------------- 274 (419) T protein:vir:94 266 ----PGIGTYQQ------------------------P------------------------------------------- 274 (419) T ss_pred ----cccccccc------------------------c------------------------------------------- Confidence 00000000 0 Q ss_pred cceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEE Q lcl|Aclame:pro 232 RYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVA 311 (331) Q Consensus 232 r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~ 311 (331) ... ...+.....+-+.+++..+......+.+|+||++....|+......... .+.+....+...-. T Consensus 275 ------------~~~-~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~-~~~~~~~~~~~~~~ 340 (419) T protein:vir:94 275 ------------KPT-APATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGV-FRVIANVQGEATPR 340 (419) T ss_pred ------------ccc-cccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCCCc-eeecCCcccCCCcc Confidence 000 0111223456678888887766677789999999999998664333332 23444444444557 Q ss_pred EcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 312 FDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 312 ~~gvpir~~dai~~tE~~Vv 331 (331) ++|+||..++.+-.++..+. T Consensus 341 l~G~pV~~~~~~~~~~~~~g 360 (419) T protein:vir:94 341 IWGLNVVSTVAIAQGTALVG 360 (419) T ss_pred ccceeeEEcCCCCCccEEEe Confidence 89999999999865542222 No 47 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=98.91 E-value=3.4e-10 Score=72.46 Aligned_cols=230 Identities=11% Similarity=0.163 Sum_probs=133.8 Q ss_pred CCc-Cc--cccccHHHHHHhcCCccchhHHHHHHHhccchhHhh-cceecccCCccceeEEEeecCCcceeecCCccCcc Q lcl|Aclame:pro 1 MPT-LS--TTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDD-MTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPE 76 (331) Q Consensus 1 M~~-l~--~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~-lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s 76 (331) +.. .+ .+..|-.+ .-.+-++ .+...||+.+.+.++|++. ...+...++ ...+.+.++-|.+.|..=++.++++ T Consensus 124 ~~~~~~~~~~~~t~~~-gg~~vP~-~~~~~ii~~l~~~~~i~~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~ 200 (435) T protein:vir:14 124 FGEEVAMSLNTLSPGA-GGVLVPE-NLSSEVIELLRPKSVVRKLGARTLPLSNG-NITIPRLKGGAIVGYIGADTDIPTT 200 (435) T ss_pred hhhhhhhhcccCCcCC-Cccccch-hHHHHHHHHHhhhchhhhhcceeeecCCC-ceEEEEEeCCcceeeeccCcccccc Confidence 000 00 00000000 0001122 2345699999998988875 445454433 4678888899999999999999999 Q ss_pred cceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccc Q lcl|Aclame:pro 77 KSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) Q Consensus 77 ~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~ 155 (331) ..++.+++-.++-+++.+.|-+.+.+..+ +++ +...-.....+++++++...|++||...+ .+.||... T Consensus 201 ~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~-l~~~i~~~l~~ai~~~~d~a~l~G~G~~~--~p~Gi~~~------- 270 (435) T protein:vir:14 201 QQQFDDLKLTAKKMAALVPIANDLIKYAGVNPN-VDQIVVGDLTAAIGAREDKAFIRDDGTAN--TPKGLRFW------- 270 (435) T ss_pred ccceeEEEeeeEEEEEeehhhHHHHHhhccCHH-HHHHHHHHHHHHHHHHHHHHhhccCCCCc--cccceeec------- Confidence 99999999999999999999988877654 322 33333445789999999999999974322 23333100 Q ss_pred ccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccccee Q lcl|Aclame:pro 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) Q Consensus 156 ~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~ 235 (331) .... T Consensus 271 ----------------------------------------------------------------------------~~~~ 274 (435) T protein:vir:14 271 ----------------------------------------------------------------------------ALPS 274 (435) T ss_pred ----------------------------------------------------------------------------cccc Confidence 0000 Q ss_pred eeeccccccCCCCccchhhHHHHHHHHHHHhhcC--CCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEc Q lcl|Aclame:pro 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNV--GMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFD 313 (331) Q Consensus 236 RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~--~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~ 313 (331) -+... + ++.+...+...+.+.+..+... ...+.+|+||++....|+.. .+......+ ++...| .++ T Consensus 275 ~~~~~--~----~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~l-kd~~G~~l~-~~~~~g----~l~ 342 (435) T protein:vir:14 275 NVITA--S----DASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGL-RDGNGNKVY-PELANG----MLK 342 (435) T ss_pred ceecc--c----cccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHh-hccCCceec-cCCCCC----eee Confidence 00000 0 0011111122233333333322 33457899999999999865 344444322 333333 578 Q ss_pred CeEEEEEEeccC------CCcccC Q lcl|Aclame:pro 314 GIPCRRTDALLL------TEARVV 331 (331) Q Consensus 314 gvpir~~dai~~------tE~~Vv 331 (331) |+||..++.+-. ++..|+ T Consensus 343 G~Pv~~~~~~p~~~~~~~~~~~i~ 366 (435) T protein:vir:14 343 GYPVGKTTQVPINLGETGKESEIY 366 (435) T ss_pred cceeEeeccccccccCCCccceEE Confidence 999999998732 233333 No 48 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.91 E-value=1.1e-09 Score=69.79 Aligned_cols=231 Identities=15% Similarity=0.090 Sum_probs=145.7 Q ss_pred CCcCccccccHHHHHHhc-C--C------------ccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARM-T--P------------DGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGT 65 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~-~--~------------~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~ 65 (331) |+ +|.|+.... + . ...+...|||.+.+.++|++..+.+...+ ..+++.+.++-|.++ T Consensus 1 ~a-------~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~ 72 (333) T protein:vir:78 1 MA-------TLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISY-GETIIPTTVKRPEVG 72 (333) T ss_pred Cc-------hhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeE Confidence 66 355553211 0 0 11234669999999999999999988654 357889999999998 Q ss_pred eeec--------CCccCcccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCC Q lcl|Aclame:pro 66 WRKL--------NYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSS 136 (331) Q Consensus 66 fR~l--------N~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~ 136 (331) |-.= ++..+.++.++.+++-..+-+++.+.|-+.+.+... +..++-. ....+++++.+...|||||.. T Consensus 73 ~v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~---~~la~ai~~~~d~~~l~G~g~ 149 (333) T protein:vir:78 73 QVGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQ---GDLAYAIGRGIDLAVFHGKSP 149 (333) T ss_pred eecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHH---HHHHHHHHHHHHHHHhcccCC Confidence 8543 244678889999999999999999999999887644 3333433 458899999999999999876 Q ss_pred CChhhhcCchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEE Q lcl|Aclame:pro 137 IDAEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGY 216 (331) Q Consensus 137 ~~~~~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~ 216 (331) ..+.+|.|+..-- +. ...+ T Consensus 150 ~~~~~~~g~~~~~--------------~~--~~~~--------------------------------------------- 168 (333) T protein:vir:78 150 LTGSALQGIDTDN--------------VI--ANTT--------------------------------------------- 168 (333) T ss_pred CCCcccccccccc--------------cc--cccc--------------------------------------------- Confidence 6655555542200 00 0000 Q ss_pred EEEEEeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhc-CCCCceEEEeCHHHHHHHHHHhh--cC Q lcl|Aclame:pro 217 RTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPN-VGMGRPAFYMPRKIRSFLRRQIT--NK 293 (331) Q Consensus 217 ~t~~~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~-~~~g~~~~y~n~~v~~~L~~~~~--~~ 293 (331) +++ +.++++....+.+.+++..++. ......+|.||.+....|+.... |. T Consensus 169 ----------------------~~~-----~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~ 221 (333) T protein:vir:78 169 ----------------------NVD-----YLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDA 221 (333) T ss_pred ----------------------ccc-----ccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCC Confidence 000 0011112234446666666642 23445689999999988875432 33 Q ss_pred CcceeeeecccCCceeEEEcCeEEEEEEeccC-CCc------ccC Q lcl|Aclame:pro 294 VAASTLTMEEIAGKKVVAFDGIPCRRTDALLL-TEA------RVV 331 (331) Q Consensus 294 ~~~~~~~~~~~~g~~v~~~~gvpir~~dai~~-tE~------~Vv 331 (331) .... +-+....+-..-.+.|+||..+++|-. ..+ .++ T Consensus 222 ~G~~-i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~ 265 (333) T protein:vir:78 222 NGNV-DPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRII 265 (333) T ss_pred CCce-eecCccccCCCceeeceeeEEccccCCCccccCCCccEEE Confidence 3333 223233333345788999999999853 222 222 No 49 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=98.90 E-value=3e-10 Score=72.78 Aligned_cols=234 Identities=9% Similarity=0.051 Sum_probs=139.6 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcc---- Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPE---- 76 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s---- 76 (331) ++.+.+. .|... ...+-+. .+...||+.+.+.++|++.++.+..+++ .+.+.++++-|.+.|..-+...+++ T Consensus 158 ~~a~~~~-~~~~~-g~~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~e~~~~~~~~~~~ 233 (458) T protein:vir:10 158 LKAVNQS-SSVEV-SSESYET-IFSQRIIRDLQKELVVGALFEELPMSSK-ILTMLVEPDAGKATWVAASTYGTDTTTGE 233 (458) T ss_pred hhhhhhc-ccCcc-ccceehh-hHhHHHHHHHHhhhhHHhhcceeecCCc-ceEEEEecCCcceeecccccccccccccc Confidence 1111000 00000 0001121 2446799999999999999988877654 5778889999999999988877654 Q ss_pred --cceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccc Q lcl|Aclame:pro 77 --KSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLS 153 (331) Q Consensus 77 --~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~ 153 (331) +.++.+++-..+-+++.+.|.+.+.+... +...+- .....++++......|||||-...| .|+... T Consensus 234 ~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i---~~~l~~~i~~~~d~~~l~G~G~~~p---~Gi~~~----- 302 (458) T protein:vir:10 234 EVKGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLL---RKRLIEAHAVSIEEAFMTGDGSGKP---KGLLTL----- 302 (458) T ss_pred cccccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHH---HHHHHHHHHHHHHHHhhcCCCCCcc---ceeeec----- Confidence 56899999999999999999999877653 444443 3447899999999999999743222 222110 Q ss_pred ccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccc Q lcl|Aclame:pro 154 AENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRY 233 (331) Q Consensus 154 ~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~ 233 (331) ++++ . T Consensus 303 ---------~~~~------------------------------------------------------------------~ 307 (458) T protein:vir:10 303 ---------ASED------------------------------------------------------------------S 307 (458) T ss_pred ---------cccc------------------------------------------------------------------c Confidence 0000 0 Q ss_pred eeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeee---cccCCceeE Q lcl|Aclame:pro 234 VVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTM---EEIAGKKVV 310 (331) Q Consensus 234 v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~---~~~~g~~v~ 310 (331) ..-+...... .....+ .+-|++++..++.....+..|+||++....|+.. .+........+ .....-..- T Consensus 308 ~~~~~~~~~~--~~~~~~----~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~l-kd~~G~~i~~~~~~~~~~~~~~~ 380 (458) T protein:vir:10 308 AKVVTEAKAD--GSVLVT----AKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLL-EDEEWQDVAQVGNDSVKLQGQVG 380 (458) T ss_pred cceeeccccc--cccccc----HHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhh-cccCCceeeccccccccccCcCc Confidence 0000000000 000011 2335566677777777889999999999999853 33333222111 111112223 Q ss_pred EEcCeEEEEEEeccC--CCcccC Q lcl|Aclame:pro 311 AFDGIPCRRTDALLL--TEARVV 331 (331) Q Consensus 311 ~~~gvpir~~dai~~--tE~~Vv 331 (331) .++|+||..+|++-. +...++ T Consensus 381 ~l~G~pv~~~~~~p~~~~~~~~~ 403 (458) T protein:vir:10 381 RIYGLPVVVSEYFPAKANSAEFA 403 (458) T ss_pred eecceeeEEccccccccCCcceE Confidence 678999999999843 233333 No 50 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.88 E-value=6.3e-10 Score=71.01 Aligned_cols=230 Identities=12% Similarity=0.176 Sum_probs=136.1 Q ss_pred CCcCcccccc-----------HHHHHH-----------hcCCccchhHHHHHHHhccchhHhh-cceecccCCccceeEE Q lcl|Aclame:pro 1 MPTLSTTNPT-----------LADVAA-----------RMTPDGKIDPQIVEMLNETNEILDD-MTVIEANGFTEHKTTV 57 (331) Q Consensus 1 M~~l~~~a~T-----------L~E~Ak-----------~~~~~~~~~~~VIE~l~~~s~iL~~-lpf~e~n~~~~~~~~~ 57 (331) |..- ..... -.+.+. .+-++ .+...|||.+.+.++|+.. ..++....+ ..++.+ T Consensus 105 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~-~~~~~ii~~l~~~~~i~~~~~~~v~~~~~-~~~~p~ 181 (435) T protein:vir:80 105 LAAA-RGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPE-NLSSEVIELLRPKSVVRKLGARTLPLSNG-NITIPR 181 (435) T ss_pred HHhc-cchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccch-hHHHHHHHHHhhhchhhhccceeeecCCC-ceEEEE Confidence 0000 00000 000000 01122 2445699999999988875 445555544 478888 Q ss_pred EeecCCcceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCC Q lcl|Aclame:pro 58 RSGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSS 136 (331) Q Consensus 58 ~~~lP~~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~ 136 (331) .++-|.+.|..=++.++++..++.+++-..+-+++.+.|.+.+.+..+ +.+ +...-.....++++..+..+||+||.. T Consensus 182 ~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~-l~~~i~~~l~~a~~~~~d~a~l~G~G~ 260 (435) T protein:vir:80 182 LKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPN-VDQIVVGDLTAAIGAREDKAFIRDDGT 260 (435) T ss_pred EeCCcceeeeccCccccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHH-HHHHHHHHHHHHHHHHHHHHhhccCCC Confidence 999999999999999999999999999999999999999888776654 322 333344558999999999999999743 Q ss_pred CChhhhcCchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEE Q lcl|Aclame:pro 137 IDAEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGY 216 (331) Q Consensus 137 ~~~~~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~ 216 (331) .+ ...||...- +.++ T Consensus 261 ~~--~p~Gi~~~~---------------~~~~------------------------------------------------ 275 (435) T protein:vir:80 261 AN--TPKGLRFWA---------------LPGN------------------------------------------------ 275 (435) T ss_pred CC--cccceeecc---------------cccc------------------------------------------------ Confidence 22 122321100 0000 Q ss_pred EEEEEeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcC--CCCceEEEeCHHHHHHHHHHhhcCC Q lcl|Aclame:pro 217 RTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNV--GMGRPAFYMPRKIRSFLRRQITNKV 294 (331) Q Consensus 217 ~t~~~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~--~~g~~~~y~n~~v~~~L~~~~~~~~ 294 (331) .+..+. ..+...+...|.+++..+.+. ...+.+|+||++...+|+.. .++. T Consensus 276 ----------------------~~~~~~----~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~l-kd~~ 328 (435) T protein:vir:80 276 ----------------------VITASD----GSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGL-RDGN 328 (435) T ss_pred ----------------------eeeccc----ccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhh-hccC Confidence 000000 011111222244444444322 34467899999999999864 3444 Q ss_pred cceeeeecccCCceeEEEcCeEEEEEEeccC------CCcccC Q lcl|Aclame:pro 295 AASTLTMEEIAGKKVVAFDGIPCRRTDALLL------TEARVV 331 (331) Q Consensus 295 ~~~~~~~~~~~g~~v~~~~gvpir~~dai~~------tE~~Vv 331 (331) +.. +-++...| .+.|+||..++.+-. ++..++ T Consensus 329 G~~-l~~~~~~~----~l~G~pv~~~~~~p~~~~~~~~~~~i~ 366 (435) T protein:vir:80 329 GNK-VYPELANG----MLKGYPVGKTTQVPINLGEAGKESEIY 366 (435) T ss_pred Cce-eccCCCCC----eEeeeeeEEeccccccccCCCCcceEE Confidence 443 22333333 488999999999732 222333 No 51 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=98.87 E-value=5.4e-11 Score=76.87 Aligned_cols=236 Identities=16% Similarity=0.152 Sum_probs=134.1 Q ss_pred CCcC----------ccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEE----EeecCCcce Q lcl|Aclame:pro 1 MPTL----------STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTV----RSGLPTGTW 66 (331) Q Consensus 1 M~~l----------~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~----~~~lP~~~f 66 (331) |=++ ...++|..+........... ..+|+.+.++|+||+.+..+...+...+.... ....++..| T Consensus 1 ~~~~~~~~~~~~~~~~k~~t~~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~ 79 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKIDVPDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDE 79 (315) T ss_pred CcccchhhcCChhhhhhhcCCcCCCCceechHHH-HHHHHHHHhhhhhhhhceeeeccccccccccccccCccccccccc Confidence 1111 01334444433333233333 45889999999999999875432221111111 112234445 Q ss_pred eecCCccCcccceEEEEEEEEEEecchhhhhHHHH-hhC--CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC-Ch--h Q lcl|Aclame:pro 67 RKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALA-DLN--GNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSI-DA--E 140 (331) Q Consensus 67 R~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la-~~~--gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~-~~--~ 140 (331) ..=....++++.++.++.-.++-+...+.|-..+. +-. .|.+++ -...+.++++...+..|||||.+. +| + T Consensus 80 ~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~---l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~ 156 (315) T protein:vir:41 80 TGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQK---IVTLLGEGISYVLEKYYLHGDTSSSDPLLR 156 (315) T ss_pred ccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHH---HHHHHHHHHHHHHHHHhhccCCcCcCcccc Confidence 55555667788999999999999999999955554 432 344444 445588999999999999998753 22 2 Q ss_pred hhcCchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEE Q lcl|Aclame:pro 141 KFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHY 220 (331) Q Consensus 141 ~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~ 220 (331) .++|+-+. |++. +. T Consensus 157 ~~~G~l~~--------------a~~~--------------------------------------~~-------------- 170 (315) T protein:vir:41 157 MSDGWLKL--------------ASEK--------------------------------------LT-------------- 170 (315) T ss_pred ccccceec--------------cccc--------------------------------------cc-------------- Confidence 33443210 0000 00 Q ss_pred EeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcC---CCCceEEEeCHHHHHHHHHHhhcCCcce Q lcl|Aclame:pro 221 KWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNV---GMGRPAFYMPRKIRSFLRRQITNKVAAS 297 (331) Q Consensus 221 ~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~---~~g~~~~y~n~~v~~~L~~~~~~~~~~~ 297 (331) ...++ ..+...-.+++++.+..||.. .+.+.+|+||++.+..++....++.+.. T Consensus 171 ----------------~~~~~-------~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~l 227 (315) T protein:vir:41 171 ----------------ESDVD-------PEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGL 227 (315) T ss_pred ----------------ccccc-------cccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCcc Confidence 00011 011111134556666666642 3457899999999999998766655432 Q ss_pred eeeecccCCceeEEEcCeEEEEEEeccC---CCcccC Q lcl|Aclame:pro 298 TLTMEEIAGKKVVAFDGIPCRRTDALLL---TEARVV 331 (331) Q Consensus 298 ~~~~~~~~g~~v~~~~gvpir~~dai~~---tE~~Vv 331 (331) -..-..+.....+.|+||..+++|-. .+..+. T Consensus 228 --w~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~il 262 (315) T protein:vir:41 228 --GDQALTGANSILYDGRPVQYVPALEALNDGKSRAL 262 (315) T ss_pred --ccchhhcCCCceecccceEecccccccCCCCccEE Confidence 22222233456788999999999843 222222 No 52 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.87 E-value=3.8e-10 Score=72.20 Aligned_cols=224 Identities=13% Similarity=0.113 Sum_probs=144.0 Q ss_pred CCcCcccc--ccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccc Q lcl|Aclame:pro 1 MPTLSTTN--PTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKS 78 (331) Q Consensus 1 M~~l~~~a--~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~ 78 (331) |-+-...+ .|...-+.-+-+. .+...|+|.+.+.++|++..+.....+.+...+.+.++-|.++|.+=++..++++. T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 79 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHK-EFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIKTDKP 79 (297) T ss_pred CCccccccccccccCCCcceech-hHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCcccccccc Confidence 33211111 1111111112232 24467999999999999999998766555556667788889999999999999999 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-..+-+++.+.|.+.+.+... .++...-.....++++......+|+|+.+..|. T Consensus 80 ~f~~v~l~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~------------------ 139 (297) T protein:vir:95 80 EVVPVTLKAHKLGIILVTSREALNYTW--KKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFAN------------------ 139 (297) T ss_pred ceeEEEEeeEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHHhcccCCcccc------------------ Confidence 999999999999999999999888653 223333345578999999999999997432210 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) ||... .... . T Consensus 140 --------------------------gi~~~---------------~~~~-----------------------------~ 149 (297) T protein:vir:95 140 --------------------------SVAKA---------------AKDA-----------------------------N 149 (297) T ss_pred --------------------------ccccc---------------cccc-----------------------------c Confidence 11100 0000 0 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir 318 (331) ++.. +. .+ .+-|++++.++......+.+|+||++....|+.. .+..+...+ .... ..+.|+||. T Consensus 150 ~~~~-----~~---~t-~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l-~d~~G~~i~--~~~~----~~l~G~Pv~ 213 (297) T protein:vir:95 150 KVIG-----GP---IN-YDNILKLQDALYDADVEPNAFVSKIQNRSALREA-RDGNKVSIY--DKAA----NTIDGITTV 213 (297) T ss_pred eecc-----cc---cC-HHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHh-hccCCceee--cCCC----CcccceeeE Confidence 0000 00 11 2335667777776666778999999999999864 444433222 2222 357899998 Q ss_pred EEEeccCCCcccC Q lcl|Aclame:pro 319 RTDALLLTEARVV 331 (331) Q Consensus 319 ~~dai~~tE~~Vv 331 (331) .+.+...+...++ T Consensus 214 ~~~~~~~~~~~~~ 226 (297) T protein:vir:95 214 DLKSARFEKGDLL 226 (297) T ss_pred eecCCCCCCceEE Confidence 8777665666655 No 53 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=98.84 E-value=9.1e-10 Score=70.15 Aligned_cols=232 Identities=14% Similarity=0.149 Sum_probs=137.7 Q ss_pred CCcCccc--cccHHHHHH---hcCCc------cchhHHHHHHHhccchhHhhcceecccCCc-cceeEEEeecCCcceee Q lcl|Aclame:pro 1 MPTLSTT--NPTLADVAA---RMTPD------GKIDPQIVEMLNETNEILDDMTVIEANGFT-EHKTTVRSGLPTGTWRK 68 (331) Q Consensus 1 M~~l~~~--a~TL~E~Ak---~~~~~------~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~-~~~~~~~~~lP~~~fR~ 68 (331) +...... ..+..|... ..+.+ ..+...|++.+.+.++|++.++......+. .+.|.+.++.+++.|.. T Consensus 92 ~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~ 171 (404) T protein:vir:10 92 LKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLS 171 (404) T ss_pred HHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeecc Confidence 0000000 011111110 00111 124467999999999999999998765443 35677888999999999 Q ss_pred cCCccCccc--ceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCch Q lcl|Aclame:pro 69 LNYGVQPEK--SRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLT 146 (331) Q Consensus 69 lN~g~~~s~--~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~ 146 (331) -++..+.+. .++.+++-+.+-+++.+.|-+.+.+... .++...-.....++++..+...|++|+...+ .+.||. T Consensus 172 e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~la~~~~~~~~~~il~G~g~~~--~~~gi~ 247 (404) T protein:vir:10 172 ENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFAD--KSLEDWIINWFVDKVRITRNAEILYGAGGDE--HATGIM 247 (404) T ss_pred ccccccccccccceeeeEeeheeeEeeehhhHHHHhhcH--HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCC--ccccee Confidence 999888764 6799999999999999999998877643 1233333455789999999999999975332 233332 Q ss_pred hhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeece Q lcl|Aclame:pro 147 PRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGL 226 (331) Q Consensus 147 ~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl 226 (331) .. .++. T Consensus 248 ~~----------------~~~~---------------------------------------------------------- 253 (404) T protein:vir:10 248 TA----------------NKFK---------------------------------------------------------- 253 (404) T ss_pred ec----------------cccc---------------------------------------------------------- Confidence 10 0000 Q ss_pred EEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCC Q lcl|Aclame:pro 227 TLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAG 306 (331) Q Consensus 227 ~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g 306 (331) .+ ... .+.+-.+|.+++.. .+++....+.+||||++....|+.. .+..+...+.+ ++.+ T Consensus 254 ------------~~---~~~-~~~~~~~~~~~~~~---~l~~~~~~~~~~v~n~~~~~~L~~l-kd~~G~~l~~~-~~~~ 312 (404) T protein:vir:10 254 ------------KI---TLP-KSPALKDFKKCKNV---ELLNVFKATSSWIVNQDGFNYLDSL-EDKTGRPYLQP-DPKD 312 (404) T ss_pred ------------ee---ecc-ccccHHHHHHHHHh---hhhccccCCCEEEEcHHHHHHHHHh-hccCCceeecc-CcCC Confidence 00 000 11111222222221 3445556678899999999999975 44444433333 3444 Q ss_pred ceeEEEcCeEEEEEEe-cc-CCCcc--cC Q lcl|Aclame:pro 307 KKVVAFDGIPCRRTDA-LL-LTEAR--VV 331 (331) Q Consensus 307 ~~v~~~~gvpir~~da-i~-~tE~~--Vv 331 (331) ...-.++|.||..++. +. .+... ++ T Consensus 313 ~~~~~l~G~PV~~~~~~~~~~~~~~~~~~ 341 (404) T protein:vir:10 313 PTQYRFLGLPVIELPNDLLLSTESAIPVL 341 (404) T ss_pred CCCccccceeeEEecccccCCCCCccEEE Confidence 4445789999987654 32 22222 22 No 54 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.83 E-value=7.1e-10 Score=70.75 Aligned_cols=230 Identities=9% Similarity=0.027 Sum_probs=139.0 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEeecCCcceeecCCccCcc-cc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRSGLPTGTWRKLNYGVQPE-KS 78 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~~lP~~~fR~lN~g~~~s-~~ 78 (331) .-.......|... ..-+-+. .+...|++.+.+.++|++.+..+...++.+ +.+...++.+.+.|..=++.+++. .. T Consensus 116 ~~~~~~~~~~~~~-gg~~iP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~ 193 (415) T protein:vir:98 116 RNDIQGGSLKTDS-GFVVIPE-EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVK 193 (415) T ss_pred hhhhhhccccccc-cccccch-HHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCccccc Confidence 0000000111111 0111222 345679999999999999999887654432 333344566677888777778764 57 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-.++-+++.+.|.+.+.+...- ++...-.....++++......|++|+.+..+... T Consensus 194 ~~~~v~~~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~---------------- 255 (415) T protein:vir:98 194 PFFQLAYDINTHRGYFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKGSTGST---------------- 255 (415) T ss_pred ceeeEEeeeeeeEeeehhhHHHHhhchH--HHHHHHHHHHHHHHHHHHHHHHhhccccCccccc---------------- Confidence 8999999999999999999999876441 3444444557889999999999999743322110 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) +++.. ..+ . T Consensus 256 ------~~~~~------------------~~~-----------------------------------------------~ 264 (415) T protein:vir:98 256 ------SSGFE------------------KEG-----------------------------------------------K 264 (415) T ss_pred ------ccccc------------------ccc-----------------------------------------------c Confidence 00000 000 0 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir 318 (331) .. +.+....++-+++++..++.....+..|+||++....|+.. .+..+...+.+ ++.+...-.++|.||+ T Consensus 265 ~~--------~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~G~~l~~~-~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:98 265 KL--------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM-KDKLGNYLIQP-DVKEKTQQRLLGAKIE 334 (415) T ss_pred cc--------ccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHh-hccCCceeecc-CcCCCCCceecceeeE Confidence 00 01111123445667777766666778999999999999864 55555443333 3444445589999999 Q ss_pred EEEeccC-CCcc--cC Q lcl|Aclame:pro 319 RTDALLL-TEAR--VV 331 (331) Q Consensus 319 ~~dai~~-tE~~--Vv 331 (331) .++++.. +... ++ T Consensus 335 ~~~~~~~~~~~~~~~~ 350 (415) T protein:vir:98 335 ILPDEVLGQKGNNTLI 350 (415) T ss_pred EecccccCCCCccEEE Confidence 9998742 2221 22 No 55 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.83 E-value=7.1e-10 Score=70.75 Aligned_cols=230 Identities=9% Similarity=0.027 Sum_probs=139.0 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEeecCCcceeecCCccCcc-cc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRSGLPTGTWRKLNYGVQPE-KS 78 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~~lP~~~fR~lN~g~~~s-~~ 78 (331) .-.......|... ..-+-+. .+...|++.+.+.++|++.+..+...++.+ +.+...++.+.+.|..=++.+++. .. T Consensus 116 ~~~~~~~~~~~~~-gg~~iP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~ 193 (415) T protein:vir:79 116 RNDIQGGSLKTDS-GFVVIPE-EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVK 193 (415) T ss_pred hhhhhhccccccc-cccccch-HHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCccccc Confidence 0000000111111 0111222 345679999999999999999887654432 333344566677888777778764 57 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-.++-+++.+.|.+.+.+...- ++...-.....++++......|++|+.+..+... T Consensus 194 ~~~~v~~~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~---------------- 255 (415) T protein:vir:79 194 PFFQLAYDINTHRGYFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKGSTGST---------------- 255 (415) T ss_pred ceeeEEeeeeeeEeeehhhHHHHhhchH--HHHHHHHHHHHHHHHHHHHHHHhhccccCccccc---------------- Confidence 8999999999999999999999876441 3444444557889999999999999743322110 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) +++.. ..+ . T Consensus 256 ------~~~~~------------------~~~-----------------------------------------------~ 264 (415) T protein:vir:79 256 ------SSGFE------------------KEG-----------------------------------------------K 264 (415) T ss_pred ------ccccc------------------ccc-----------------------------------------------c Confidence 00000 000 0 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir 318 (331) .. +.+....++-+++++..++.....+..|+||++....|+.. .+..+...+.+ ++.+...-.++|.||+ T Consensus 265 ~~--------~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~G~~l~~~-~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:79 265 KL--------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM-KDKLGNYLIQP-DVKEKTQQRLLGAKIE 334 (415) T ss_pred cc--------ccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHh-hccCCceeecc-CcCCCCCceecceeeE Confidence 00 01111123445667777766666778999999999999864 55555443333 3444445589999999 Q ss_pred EEEeccC-CCcc--cC Q lcl|Aclame:pro 319 RTDALLL-TEAR--VV 331 (331) Q Consensus 319 ~~dai~~-tE~~--Vv 331 (331) .++++.. +... ++ T Consensus 335 ~~~~~~~~~~~~~~~~ 350 (415) T protein:vir:79 335 ILPDEVLGQKGNNTLI 350 (415) T ss_pred EecccccCCCCccEEE Confidence 9998742 2221 22 No 56 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.83 E-value=7.1e-10 Score=70.75 Aligned_cols=230 Identities=9% Similarity=0.027 Sum_probs=139.0 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEeecCCcceeecCCccCcc-cc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRSGLPTGTWRKLNYGVQPE-KS 78 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~~lP~~~fR~lN~g~~~s-~~ 78 (331) .-.......|... ..-+-+. .+...|++.+.+.++|++.+..+...++.+ +.+...++.+.+.|..=++.+++. .. T Consensus 116 ~~~~~~~~~~~~~-gg~~iP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~ 193 (415) T protein:vir:81 116 RNDIQGGSLKTDS-GFVVIPE-EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVK 193 (415) T ss_pred hhhhhhccccccc-cccccch-HHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCccccc Confidence 0000000111111 0111222 345679999999999999999887654432 333344566677888777778764 57 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-.++-+++.+.|.+.+.+...- ++...-.....++++......|++|+.+..+... T Consensus 194 ~~~~v~~~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~---------------- 255 (415) T protein:vir:81 194 PFFQLAYDINTHRGYFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKGSTGST---------------- 255 (415) T ss_pred ceeeEEeeeeeeEeeehhhHHHHhhchH--HHHHHHHHHHHHHHHHHHHHHHhhccccCccccc---------------- Confidence 8999999999999999999999876441 3444444557889999999999999743322110 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) +++.. ..+ . T Consensus 256 ------~~~~~------------------~~~-----------------------------------------------~ 264 (415) T protein:vir:81 256 ------SSGFE------------------KEG-----------------------------------------------K 264 (415) T ss_pred ------ccccc------------------ccc-----------------------------------------------c Confidence 00000 000 0 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir 318 (331) .. +.+....++-+++++..++.....+..|+||++....|+.. .+..+...+.+ ++.+...-.++|.||+ T Consensus 265 ~~--------~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~G~~l~~~-~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:81 265 KL--------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM-KDKLGNYLIQP-DVKEKTQQRLLGAKIE 334 (415) T ss_pred cc--------ccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHh-hccCCceeecc-CcCCCCCceecceeeE Confidence 00 01111123445667777766666778999999999999864 55555443333 3444445589999999 Q ss_pred EEEeccC-CCcc--cC Q lcl|Aclame:pro 319 RTDALLL-TEAR--VV 331 (331) Q Consensus 319 ~~dai~~-tE~~--Vv 331 (331) .++++.. +... ++ T Consensus 335 ~~~~~~~~~~~~~~~~ 350 (415) T protein:vir:81 335 ILPDEVLGQKGNNTLI 350 (415) T ss_pred EecccccCCCCccEEE Confidence 9998742 2221 22 No 57 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.82 E-value=1e-09 Score=69.88 Aligned_cols=225 Identities=12% Similarity=0.144 Sum_probs=141.8 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCc-----cCc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYG-----VQP 75 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g-----~~~ 75 (331) |+.+.+ .+..- +-+. .+...|+|.+.+.++|++..+++...++ .+.+.++++-|.+.|..=++. ++. T Consensus 1 ma~~t~-----~~gg~-liP~-~~~~~Ii~~~~~~s~l~~l~~~~~~~~~-~~~~p~~~~~~~a~wv~E~~~~~~~~~~~ 72 (305) T protein:vir:25 1 MADISR-----AEVAS-LIQE-AYSDTLLAAAKQGSTVLSAFQNVNMGTK-TTHLPVLATLPEADWVGESATDPKGVKPT 72 (305) T ss_pred CCCccC-----Cccce-ecCH-HHHHHHHHHHHhhchhhhhcceeeccCC-cEEEEEEeCCcceEEeecccccccccccc Confidence 777633 22221 2232 2456799999999999999999887544 477888999999999766654 455 Q ss_pred ccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccc Q lcl|Aclame:pro 76 EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) Q Consensus 76 s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~ 155 (331) ++.++.+++-..+-+++.+.|-+.+.+...- ++...-.....+++++.+++.|||||.+.. .+. T Consensus 73 s~~~f~~i~~~~~k~~~~~~is~ell~ds~~--~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~--~~~------------ 136 (305) T protein:vir:25 73 SKVTWANRTLVAEEIAVIIPVHENVIDDATV--AVLTEVAELGGQAIGKKLDQAVIFGTDKPA--SWV------------ 136 (305) T ss_pred cccceeeEEeeeEEEEEeehhhHHHHhcchH--HHHHHHHHHHHHHHHHHHhhhheeccCCCC--Ccc------------ Confidence 6889999999999999999999999876541 233333345789999999999999973211 110 Q ss_pred ccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccccee Q lcl|Aclame:pro 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) Q Consensus 156 ~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~ 235 (331) .+++.|....++- T Consensus 137 ---------------------------~~~~~~~~~~~~~---------------------------------------- 149 (305) T protein:vir:25 137 ---------------------------SPALIPAAVTAGQ---------------------------------------- 149 (305) T ss_pred ---------------------------ccccccccccccc---------------------------------------- Confidence 0001110000000 Q ss_pred eeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCe Q lcl|Aclame:pro 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI 315 (331) Q Consensus 236 RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gv 315 (331) .+.. ........++.+.+.+++..+-........|+||+.....|+.. .++.....+.+ ..+.|. T Consensus 150 ---~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-kd~~G~~i~~~--------~~l~G~ 214 (305) T protein:vir:25 150 ---AVEV---VGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI-RDANGNPVFRD--------DSFAGF 214 (305) T ss_pred ---cccc---cccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHh-hccCCceeecC--------Cccccc Confidence 0000 01122334566667777666655444445699999999999864 45444432222 257888 Q ss_pred EEEEEEeccC--CCcccC Q lcl|Aclame:pro 316 PCRRTDALLL--TEARVV 331 (331) Q Consensus 316 pir~~dai~~--tE~~Vv 331 (331) |+..++++.. ++..++ T Consensus 215 Pv~~~~~~~~~~~~~~~~ 232 (305) T protein:vir:25 215 RTFFNRNGAWDADAAIEV 232 (305) T ss_pred ceEEcCccCCCCCccEEE Confidence 9888888743 222232 No 58 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.81 E-value=6.3e-10 Score=71.03 Aligned_cols=229 Identities=10% Similarity=0.038 Sum_probs=137.5 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEE--eecCCcceeecCCccCc-cc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVR--SGLPTGTWRKLNYGVQP-EK 77 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~--~~lP~~~fR~lN~g~~~-s~ 77 (331) .........|... ...+-+. .+...|++.+.+.++|++.+.++...++. +.+.+. ++.+.+.|..=++..++ +. T Consensus 116 ~~~~~~~~~~t~~-g~~~iP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~v~Eg~~~~~~~~ 192 (415) T protein:vir:47 116 RNDIQGGSLKTDS-GFVVIPE-EIVTDILKLKEVEFNLDKYVTVKRVTNGS-GKYPVVRQSEVAALEKVEELEENPELAV 192 (415) T ss_pred hhhhhhccccccC-CcccccH-HHHHHHHHHHHhhhhhhhhcceeeccCCc-eeEEEEEecCCcceeecccccccccccc Confidence 0000000001111 1111222 23467999999999999999988776553 344443 55667788887777886 56 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENG 157 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~ 157 (331) .++.+++-..+-+++.+.|.+.+.+...- ++...-.....++++..+...|++|+.+..+..+. T Consensus 193 ~~~~~v~~~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~-------------- 256 (415) T protein:vir:47 193 KPFFQLAYDINTHRGYFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTS-------------- 256 (415) T ss_pred cceeeEEeeeeeeEeeehhhHHHHhhchH--HHHHHHHHHHHHHHHHHHHHHHhhccccCCccccc-------------- Confidence 89999999999999999999999886542 33344445578899999999999997432211100 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) .+.. +.+. . T Consensus 257 --------~~~~------------------~~~~---------------------------------------------~ 265 (415) T protein:vir:47 257 --------SGFE------------------KEGK---------------------------------------------K 265 (415) T ss_pred --------cccc------------------cccc---------------------------------------------e Confidence 0000 0000 0 Q ss_pred eccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEE Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPC 317 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpi 317 (331) .. ...+. ..+-+++++..++.....+.+|+||++...+|+.. .+..+...+.+ ++.+...-.++|+|| T Consensus 266 ~~------~~~~~----~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~l-kd~~G~~i~~~-~~~~~~~~~l~G~pV 333 (415) T protein:vir:47 266 LE------VKKAK----SLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM-KDKLGNYLIQP-DVKEKTQQRLLGAKI 333 (415) T ss_pred ec------ccccc----chHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHh-hccCCCeeecc-CcCCCCCccccceee Confidence 00 00011 12234566666666666778999999999999864 45555443322 333334457899999 Q ss_pred EEEEeccC-CC-cc-cC Q lcl|Aclame:pro 318 RRTDALLL-TE-AR-VV 331 (331) Q Consensus 318 r~~dai~~-tE-~~-Vv 331 (331) +.++++.. +. .. ++ T Consensus 334 ~~~~~~~~~~~~~~~~~ 350 (415) T protein:vir:47 334 EILPDEVLGQKGNNTLI 350 (415) T ss_pred EEeccccccCCCccEEE Confidence 99998742 22 11 22 No 59 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.81 E-value=6.3e-10 Score=71.03 Aligned_cols=229 Identities=10% Similarity=0.038 Sum_probs=137.5 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEE--eecCCcceeecCCccCc-cc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVR--SGLPTGTWRKLNYGVQP-EK 77 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~--~~lP~~~fR~lN~g~~~-s~ 77 (331) .........|... ...+-+. .+...|++.+.+.++|++.+.++...++. +.+.+. ++.+.+.|..=++..++ +. T Consensus 116 ~~~~~~~~~~t~~-g~~~iP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~v~Eg~~~~~~~~ 192 (415) T protein:vir:46 116 RNDIQGGSLKTDS-GFVVIPE-EIVTDILKLKEVEFNLDKYVTVKRVTNGS-GKYPVVRQSEVAALEKVEELEENPELAV 192 (415) T ss_pred hhhhhhccccccC-CcccccH-HHHHHHHHHHHhhhhhhhhcceeeccCCc-eeEEEEEecCCcceeecccccccccccc Confidence 0000000001111 1111222 23467999999999999999988776553 344443 55667788887777886 56 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENG 157 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~ 157 (331) .++.+++-..+-+++.+.|.+.+.+...- ++...-.....++++..+...|++|+.+..+..+. T Consensus 193 ~~~~~v~~~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~-------------- 256 (415) T protein:vir:46 193 KPFFQLAYDINTHRGYFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTS-------------- 256 (415) T ss_pred cceeeEEeeeeeeEeeehhhHHHHhhchH--HHHHHHHHHHHHHHHHHHHHHHhhccccCCccccc-------------- Confidence 89999999999999999999999886542 33344445578899999999999997432211100 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) .+.. +.+. . T Consensus 257 --------~~~~------------------~~~~---------------------------------------------~ 265 (415) T protein:vir:46 257 --------SGFE------------------KEGK---------------------------------------------K 265 (415) T ss_pred --------cccc------------------cccc---------------------------------------------e Confidence 0000 0000 0 Q ss_pred eccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEE Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPC 317 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpi 317 (331) .. ...+. ..+-+++++..++.....+.+|+||++...+|+.. .+..+...+.+ ++.+...-.++|+|| T Consensus 266 ~~------~~~~~----~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~l-kd~~G~~i~~~-~~~~~~~~~l~G~pV 333 (415) T protein:vir:46 266 LE------VKKAK----SLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM-KDKLGNYLIQP-DVKEKTQQRLLGAKI 333 (415) T ss_pred ec------ccccc----chHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHh-hccCCCeeecc-CcCCCCCccccceee Confidence 00 00011 12234566666666666778999999999999864 45555443322 333334457899999 Q ss_pred EEEEeccC-CC-cc-cC Q lcl|Aclame:pro 318 RRTDALLL-TE-AR-VV 331 (331) Q Consensus 318 r~~dai~~-tE-~~-Vv 331 (331) +.++++.. +. .. ++ T Consensus 334 ~~~~~~~~~~~~~~~~~ 350 (415) T protein:vir:46 334 EILPDEVLGQKGNNTLI 350 (415) T ss_pred EEeccccccCCCccEEE Confidence 99998742 22 11 22 No 60 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=98.80 E-value=2.3e-09 Score=67.96 Aligned_cols=234 Identities=15% Similarity=0.156 Sum_probs=138.6 Q ss_pred CCcCc--cccc-----cHHHHHHhcCCccchhHHHHHHHhccchhHhh-cceecccCCccceeEEEeecCCcceeecCCc Q lcl|Aclame:pro 1 MPTLS--TTNP-----TLADVAARMTPDGKIDPQIVEMLNETNEILDD-MTVIEANGFTEHKTTVRSGLPTGTWRKLNYG 72 (331) Q Consensus 1 M~~l~--~~a~-----TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~-lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g 72 (331) |+.-. .... |.....--+-+. .+...|||.+.+.++|++. ..+....++ .+.+.+.++-|.++|..=++. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~gg~liP~-~~~~~ii~~l~~~~~l~~~~~~~~~~~~g-~~~~p~~~~~~~a~~v~Eg~~ 190 (428) T protein:vir:10 113 FASDELNDQSVSMAISTAAGSGGVLIPQ-NIHSEVIELLRDRTIVRKLGARSIPLPNG-NMSLPRLAGGATASYTGENQD 190 (428) T ss_pred HhhhhhhhhhHhhhhcccccCCccccch-hHHHHHHHHHhhhchhhhhcceeeecCCc-ceEEEEEeCCcceeeeccCcc Confidence 11000 0000 000000001122 3446799999999998776 344444333 377888899999999999999 Q ss_pred cCcccceEEEEEEEEEEecchhhhhHHHHhhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcc Q lcl|Aclame:pro 73 VQPEKSRTVQVKDSMGMLETYAEVDKALADLN-GNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNS 151 (331) Q Consensus 73 ~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~-gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~ 151 (331) .++++.++.+++-..+-+++.+.|-+.+.+.. .+.. ..-.....++++.+....|++||... ..+.||...- T Consensus 191 ~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~---~~i~~~l~~ai~~~~d~~~l~G~G~~--~~p~Gi~~~~-- 263 (428) T protein:vir:10 191 AKVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVE---QLVLQDILTAISVREDKAFMRDDGTG--DTPIGMKARA-- 263 (428) T ss_pred ccccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHH---HHHHHHHHHHHHHHHHHHHhccCCCC--cccccccccc-- Confidence 99999999999999999999999999987754 3333 33445588999999999999997432 2344553210 Q ss_pred ccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecc Q lcl|Aclame:pro 152 LSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDW 231 (331) Q Consensus 152 ~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~ 231 (331) + . + +.+.+-+. T Consensus 264 -~-----------~-~----------------~~~~~~~~---------------------------------------- 274 (428) T protein:vir:10 264 -T-----------Q-W----------------NRLLPWAA---------------------------------------- 274 (428) T ss_pred -c-----------c-c----------------cccccccc---------------------------------------- Confidence 0 0 0 00000000 Q ss_pred cceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEE Q lcl|Aclame:pro 232 RYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVA 311 (331) Q Consensus 232 r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~ 311 (331) .+.. ......++++++ ..+...++....+..|+||++...+|+.. .++.+...+ +....| . T Consensus 275 -----~~~~-------~~~~~~~~~~~~-~~~~~~~~~~~~~~~~v~n~~~~~~L~~l-kd~~G~~i~-~~~~~g----~ 335 (428) T protein:vir:10 275 -----DAAV-------NLDTIDTYLDSI-ILMSMDGNSNMISSGWGMSNRTYMKLFGL-RDGNGNKVY-PEMAQG----M 335 (428) T ss_pred -----cccc-------cHHHHHHHHHHH-HHhhhccccccccCEEEEcHHHHHHHHHh-hccCCceec-cCCCCC----e Confidence 0000 001111122222 22344455556678999999999999875 445444433 223333 4 Q ss_pred EcCeEEEEEEeccC------CCcccC Q lcl|Aclame:pro 312 FDGIPCRRTDALLL------TEARVV 331 (331) Q Consensus 312 ~~gvpir~~dai~~------tE~~Vv 331 (331) ++|+||..+|++-. ++..++ T Consensus 336 l~G~pv~~~~~~p~~~~~~~~~~~i~ 361 (428) T protein:vir:10 336 LKGYPIQRTSAIPANLGEGGKESEIY 361 (428) T ss_pred eeceeeEEeccccccccCCCccceEE Confidence 89999999998832 233333 No 61 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=98.79 E-value=2.2e-09 Score=68.10 Aligned_cols=208 Identities=9% Similarity=0.037 Sum_probs=132.2 Q ss_pred CCc--CccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEe-ecCCcceeecCCccCcc Q lcl|Aclame:pro 1 MPT--LSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRS-GLPTGTWRKLNYGVQPE 76 (331) Q Consensus 1 M~~--l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~-~lP~~~fR~lN~g~~~s 76 (331) |.. ...+-.++ +. .+...|++.+.+.++|++.+.......+.+ +.+.+.. ..|.+.|..=++.++++ T Consensus 109 ~~~~t~~~gg~~i--------P~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~ 179 (397) T protein:vir:49 109 KTDGSGSDAGLTI--------PQ-DIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQN 179 (397) T ss_pred hhccCCccCccee--------cH-HHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeeccccccccc Confidence 111 11111111 11 234579999999999998887766554433 3444443 34788999999999887 Q ss_pred c-ceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccc Q lcl|Aclame:pro 77 K-SRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) Q Consensus 77 ~-~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~ 155 (331) . .++.+++-.++-+++.+.|.+.+.+... .++...-.....++++......|++|+....| T Consensus 180 ~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~---------------- 241 (397) T protein:vir:49 180 DDPKLSLIRYAIKRYAGISTVTNSLLADSA--ENILAWLSGWIAKKVVVTRNKAILEAIGTLPN---------------- 241 (397) T ss_pred cccceeeeEeeeeeeEeehhhHHHHHhhhh--HHHHHHHHHHHHHHHHHHHHHHHHhccccccc---------------- Confidence 6 6899999999999999999998887644 23334444557899999999999999632110 Q ss_pred ccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccccee Q lcl|Aclame:pro 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) Q Consensus 156 ~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~ 235 (331) ++ T Consensus 242 ---------------------------------~~--------------------------------------------- 243 (397) T protein:vir:49 242 ---------------------------------KP--------------------------------------------- 243 (397) T ss_pred ---------------------------------cc--------------------------------------------- Confidence 00 Q ss_pred eeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCe Q lcl|Aclame:pro 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI 315 (331) Q Consensus 236 RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gv 315 (331) ...+ .+-+++++..++.....+..|+||++....|+.. .+..+...+.+ ++.+...-.++|. T Consensus 244 ------------~~~~----~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~l-kd~~g~~l~~~-~~~~g~~~~l~G~ 305 (397) T protein:vir:49 244 ------------TLAK----WDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKV-KNAMGDYLMER-DVKSPTGYSIDGF 305 (397) T ss_pred ------------cccC----HHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHh-hccCCceeecc-cccCCCCceecce Confidence 0000 1124455666665666778999999999999975 45544443333 3333333569999 Q ss_pred EEEEEEec-c-C---CCcccC Q lcl|Aclame:pro 316 PCRRTDAL-L-L---TEARVV 331 (331) Q Consensus 316 pir~~dai-~-~---tE~~Vv 331 (331) ||+.|+.. . . ++..++ T Consensus 306 pV~~~~~~~~~~~~~~~~~~~ 326 (397) T protein:vir:49 306 VVKEISDRFLPNGTGGAMPLY 326 (397) T ss_pred eeEEecccccccccCCceeEE Confidence 99987642 1 1 111222 No 62 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=98.78 E-value=3.2e-10 Score=72.66 Aligned_cols=275 Identities=16% Similarity=0.041 Sum_probs=150.2 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEee-cCCcceeecCCccCcccce Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRKLNYGVQPEKSR 79 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~-lP~~~fR~lN~g~~~s~~t 79 (331) |++-.++ +.. .+-+. .+...||+.+.+.++|++.++.....++ ...|.++++ -|.++|..=++.++++..+ T Consensus 151 ~~~~~~~-----~gg-~~vp~-~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~E~~~~~~s~~~ 222 (497) T protein:vir:10 151 NPFGSTG-----TFA-PGILP-TFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) T ss_pred hhcccCc-----ccc-cccch-hhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeeccCccccccccc Confidence 1111000 000 01121 2356799999999999999988776554 467777765 5789999999999999999 Q ss_pred EEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccce Q lcl|Aclame:pro 80 TVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) Q Consensus 80 ~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~ 159 (331) +.+++...+-+.+.+.|.+.+.+-.++ +...-.....++++......|||||-..+ ..||-..- +.. T Consensus 223 f~~i~~~~~k~a~~~~iS~ell~d~~~---l~~~i~~~l~~~i~~~~d~~~l~G~G~~~---p~Gil~~~-------~~~ 289 (497) T protein:vir:10 223 FARVYEQVGKVANALTITDEGLRDAPE---LFNFVQGRLLEGIQRKEEVQLLAGGGYPG---VNGLLQRS-------TGF 289 (497) T ss_pred ceeeEeeeeeeEeecHhHHHHHHhHHH---HHHHHHHHHHHHHHHHHHHHhhcCCCccc---cccccccc-------ccc Confidence 999999999999999999999876544 33444455889999999999999986655 44553311 111 Q ss_pred eeccCCCCCCceEEE---EEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceee Q lcl|Aclame:pro 160 IIDAGGTGSDNASIW---LTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) Q Consensus 160 vidaGgtG~~~tSI~---~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~R 236 (331) .+..+.+....++.+ +..+.. |......+...+....-+.. ..|... |. T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~---------~~~~~~--~~---- 341 (497) T protein:vir:10 290 TASSASSLFGATSATVSNVKFPAD-------------GTNGAFVGQDTVASLKYGRV---------VTGAAG--SG---- 341 (497) T ss_pred cccccccchhhhhhhhhhhhhhcc-------------cccchhhhhhHHHHHHHHHh---------hhhhhh--hc---- Confidence 111111111101000 000000 00000000000000000000 000000 00 Q ss_pred eeccccccCCCCccchhhHHHHHHHHHHHhhcC-CCCceEEEeCHHHHHHHHHHhhcCCcceeeee--cccCCcee---E Q lcl|Aclame:pro 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNV-GMGRPAFYMPRKIRSFLRRQITNKVAASTLTM--EEIAGKKV---V 310 (331) Q Consensus 237 I~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~-~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~--~~~~g~~v---~ 310 (331) .++ .....+..++.+.+..++..++.. ...+.+|.||++-...|++. .+..+...+.+ ....|.++ . T Consensus 342 -~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~l-kd~~G~~i~~~~~~~~~~~~~~~~~ 414 (497) T protein:vir:10 342 -SGV-----AGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLT-KDANGQYMGGNFFGNAYGNPVNGGK 414 (497) T ss_pred -cch-----hccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHh-hcCCCceeccCcccccccccccCCc Confidence 000 001122344566677777776533 34456899999999999865 44444332211 11122222 2 Q ss_pred EEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 311 AFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 311 ~~~gvpir~~dai~~tE~~Vv 331 (331) .++|+||..++++-.+...|- T Consensus 415 ~l~G~pV~~t~~~~~~~~~~G 435 (497) T protein:vir:10 415 NIWGVPVVTTPLIPLGTILVG 435 (497) T ss_pred eeeceeeEecCCCCCCceEEe Confidence 678999999999865543221 No 63 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=98.78 E-value=3.2e-10 Score=72.66 Aligned_cols=275 Identities=16% Similarity=0.041 Sum_probs=150.2 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEee-cCCcceeecCCccCcccce Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRKLNYGVQPEKSR 79 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~-lP~~~fR~lN~g~~~s~~t 79 (331) |++-.++ +.. .+-+. .+...||+.+.+.++|++.++.....++ ...|.++++ -|.++|..=++.++++..+ T Consensus 151 ~~~~~~~-----~gg-~~vp~-~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~E~~~~~~s~~~ 222 (497) T protein:vir:78 151 NPFGSTG-----TFA-PGILP-TFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) T ss_pred hhcccCc-----ccc-cccch-hhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeeccCccccccccc Confidence 1111000 000 01121 2356799999999999999988776554 467777765 5789999999999999999 Q ss_pred EEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccce Q lcl|Aclame:pro 80 TVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) Q Consensus 80 ~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~ 159 (331) +.+++...+-+.+.+.|.+.+.+-.++ +...-.....++++......|||||-..+ ..||-..- +.. T Consensus 223 f~~i~~~~~k~a~~~~iS~ell~d~~~---l~~~i~~~l~~~i~~~~d~~~l~G~G~~~---p~Gil~~~-------~~~ 289 (497) T protein:vir:78 223 FARVYEQVGKVANALTITDEGLRDAPE---LFNFVQGRLLEGIQRKEEVQLLAGGGYPG---VNGLLQRS-------TGF 289 (497) T ss_pred ceeeEeeeeeeEeecHhHHHHHHhHHH---HHHHHHHHHHHHHHHHHHHHhhcCCCccc---cccccccc-------ccc Confidence 999999999999999999999876544 33444455889999999999999986655 44553311 111 Q ss_pred eeccCCCCCCceEEE---EEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceee Q lcl|Aclame:pro 160 IIDAGGTGSDNASIW---LTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) Q Consensus 160 vidaGgtG~~~tSI~---~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~R 236 (331) .+..+.+....++.+ +..+.. |......+...+....-+.. ..|... |. T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~---------~~~~~~--~~---- 341 (497) T protein:vir:78 290 TASSASSLFGATSATVSNVKFPAD-------------GTNGAFVGQDTVASLKYGRV---------VTGAAG--SG---- 341 (497) T ss_pred cccccccchhhhhhhhhhhhhhcc-------------cccchhhhhhHHHHHHHHHh---------hhhhhh--hc---- Confidence 111111111101000 000000 00000000000000000000 000000 00 Q ss_pred eeccccccCCCCccchhhHHHHHHHHHHHhhcC-CCCceEEEeCHHHHHHHHHHhhcCCcceeeee--cccCCcee---E Q lcl|Aclame:pro 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNV-GMGRPAFYMPRKIRSFLRRQITNKVAASTLTM--EEIAGKKV---V 310 (331) Q Consensus 237 I~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~-~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~--~~~~g~~v---~ 310 (331) .++ .....+..++.+.+..++..++.. ...+.+|.||++-...|++. .+..+...+.+ ....|.++ . T Consensus 342 -~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~l-kd~~G~~i~~~~~~~~~~~~~~~~~ 414 (497) T protein:vir:78 342 -SGV-----AGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLT-KDANGQYMGGNFFGNAYGNPVNGGK 414 (497) T ss_pred -cch-----hccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHh-hcCCCceeccCcccccccccccCCc Confidence 000 001122344566677777776533 34456899999999999865 44444332211 11122222 2 Q ss_pred EEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 311 AFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 311 ~~~gvpir~~dai~~tE~~Vv 331 (331) .++|+||..++++-.+...|- T Consensus 415 ~l~G~pV~~t~~~~~~~~~~G 435 (497) T protein:vir:78 415 NIWGVPVVTTPLIPLGTILVG 435 (497) T ss_pred eeeceeeEecCCCCCCceEEe Confidence 678999999999865543221 No 64 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=98.77 E-value=3.4e-09 Score=67.03 Aligned_cols=210 Identities=8% Similarity=0.043 Sum_probs=132.5 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccc-eeEEE-eecCCcceeecCCccCcc-c Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEH-KTTVR-SGLPTGTWRKLNYGVQPE-K 77 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~-~~~~~-~~lP~~~fR~lN~g~~~s-~ 77 (331) |..- |..+. ..+-+. .+...||+.+.+.++|++.++.....++.+. .+... ..-+.+.|..=++.++++ + T Consensus 109 ~~~~-----t~~~g-g~~iP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 181 (397) T protein:vir:48 109 KTDA-----SGSDA-GLTIPQ-DIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDD 181 (397) T ss_pred hhcc-----CCccc-cccccH-HHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccccc Confidence 1110 00010 001122 2346799999999999999888776544332 22222 344678899999999877 4 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENG 157 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~ 157 (331) .++.+++-..+-+++.+.|-+.+.+...- ++...-...+.++++......|++|+....+ T Consensus 182 ~~~~~v~~~~~k~~~~~~iS~ell~ds~~--~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~------------------ 241 (397) T protein:vir:48 182 PKLYPIRYAIKRYAGISTVTNSLLADSAE--NILAWLSGWIAKKVVVTRNKAILEAIATLPT------------------ 241 (397) T ss_pred cceeeEEeeheeeeeehhhHHHHHhhchH--HHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------ Confidence 79999999999999999999988876431 3333344458899999999999998632110 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) .+ + T Consensus 242 -------------------------------~~---~------------------------------------------- 244 (397) T protein:vir:48 242 -------------------------------KP---T------------------------------------------- 244 (397) T ss_pred -------------------------------cc---c------------------------------------------- Confidence 00 0 Q ss_pred eccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEE Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPC 317 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpi 317 (331) ..+ .+-+++++..++.....+..|+||++.+..|+.. .+..+...+. .++.+...-.++|.|| T Consensus 245 -----------~~~----~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~l-kd~~G~~i~~-~~~~~~~~~~l~G~PV 307 (397) T protein:vir:48 245 -----------LTK----WDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKV-KNAFGDYLME-RDVKSPTGYSIDGFAV 307 (397) T ss_pred -----------ccc----HHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHh-hcCCCceeec-cCcCCCCCceecccee Confidence 000 1124445555665556678999999999999975 3444444333 3444444568899999 Q ss_pred EEEEec-cC----CCcccC Q lcl|Aclame:pro 318 RRTDAL-LL----TEARVV 331 (331) Q Consensus 318 r~~dai-~~----tE~~Vv 331 (331) ..+|+. +. ++..++ T Consensus 308 ~~~~~~~~~~~~~~~~~~~ 326 (397) T protein:vir:48 308 KEVADRWLANASSGAMPLY 326 (397) T ss_pred EEecccccCCcCCCceEEE Confidence 998763 21 122222 No 65 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=98.76 E-value=3.8e-09 Score=66.76 Aligned_cols=210 Identities=9% Similarity=0.046 Sum_probs=133.6 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEe-ecCCcceeecCCccCc-cc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRS-GLPTGTWRKLNYGVQP-EK 77 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~-~lP~~~fR~lN~g~~~-s~ 77 (331) |... |..+-. -+-+. .+...||+.+.+.++|++.+......+..+ +.+.+.. .-+.++|.+=++.+++ +. T Consensus 109 ~~~~-----t~~~gg-~~vP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 181 (397) T protein:vir:49 109 KTDA-----SGSDAG-LTIPQ-DIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDD 181 (397) T ss_pred hhcc-----ccccCc-ccccH-hHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCccccccccc Confidence 2211 111100 01122 234679999999999999888866544332 3444444 3477899999999986 67 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENG 157 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~ 157 (331) .++.+++-.++-+++.+.|-+.+.+... -++...-.....++++......||+|+....+ T Consensus 182 ~~~~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~------------------ 241 (397) T protein:vir:49 182 PKLSLIKYTIKRYAGISTVTNSLLADSA--ENILAWLSGWIAKKVVVTRNKAILEAIAALPT------------------ 241 (397) T ss_pred cceeeEEeeeeeEEeeehhHHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------------ Confidence 8999999999999999999998887643 12333344557899999999999999643211 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) ++ |+ T Consensus 242 -------------------------------~~---~~------------------------------------------ 245 (397) T protein:vir:49 242 -------------------------------KP---TL------------------------------------------ 245 (397) T ss_pred -------------------------------cc---cc------------------------------------------ Confidence 00 00 Q ss_pred eccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEE Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPC 317 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpi 317 (331) .. .+-+++++..++.....+.+||||++....|+.. .+..+...+ ..++.+...-.+.|+|| T Consensus 246 ------------~~----~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~l-kd~~G~~l~-~~~~~~~~~~~l~G~PV 307 (397) T protein:vir:49 246 ------------TK----WDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKV-KNALGDYLM-ERDVKSPTGYSIDGFAV 307 (397) T ss_pred ------------cc----HHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHh-hcCCCceee-ccCcCCCCCceecceee Confidence 00 1224455555655556678999999999999975 444444433 33444444567899999 Q ss_pred EEEEec--cC-CCc--ccC Q lcl|Aclame:pro 318 RRTDAL--LL-TEA--RVV 331 (331) Q Consensus 318 r~~dai--~~-tE~--~Vv 331 (331) ..+++. -+ +-. .++ T Consensus 308 ~~~~~~~~~~~~~~~~~i~ 326 (397) T protein:vir:49 308 KEVADRWLANGTGGAMPLY 326 (397) T ss_pred EEecccccccccCCceeEE Confidence 987752 12 111 122 No 66 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=98.75 E-value=2.6e-09 Score=67.67 Aligned_cols=227 Identities=10% Similarity=0.055 Sum_probs=133.3 Q ss_pred CCc-C-----c--cccccHHHHHHhcCCccchhHHHHHHHhccchhHhh-cceecccCCccceeEEEeecCCcceeecCC Q lcl|Aclame:pro 1 MPT-L-----S--TTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDD-MTVIEANGFTEHKTTVRSGLPTGTWRKLNY 71 (331) Q Consensus 1 M~~-l-----~--~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~-lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~ 71 (331) +.- . . ....|...-.- +.+... ...+|..+.+.+.+|.. ......+++..+.+.+.++-|.+.|..=++ T Consensus 97 ~~~~~r~~~~~~~~~~~t~~~~g~-~~~~~~-~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E~~ 174 (390) T protein:vir:62 97 NLGEARSFEFAPEKRDGTKAGNPN-VLSRTL-YGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGETA 174 (390) T ss_pred hhhhhHHHHhhhhhhcccccCCCc-cccccc-hHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccc Confidence 000 0 0 00001110000 111111 23445444455666654 455555545456788899999999999999 Q ss_pred ccCcccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhc Q lcl|Aclame:pro 72 GVQPEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFN 150 (331) Q Consensus 72 g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~ 150 (331) .++++..++.+++-..+-+++.+.|-+.+.+... +..++- .....++++......|+|||. .|+ ||-.- T Consensus 175 ~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i---~~~l~~~i~~~~d~~~l~G~G--~p~---Gi~~~-- 244 (390) T protein:vir:62 175 EIPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFL---VSDAGPAIGDAMGRHFITGTG--QPR---GILTD-- 244 (390) T ss_pred cccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHH---HHHHHHHHHHHHHhhhhccCC--ccc---ccccc-- Confidence 9999999999999999999999999999988754 334333 344778999999999999963 222 21100 Q ss_pred cccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEec Q lcl|Aclame:pro 151 SLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRD 230 (331) Q Consensus 151 ~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d 230 (331) . +.++. T Consensus 245 -~----------~~~~~--------------------------------------------------------------- 250 (390) T protein:vir:62 245 -A----------SPATA--------------------------------------------------------------- 250 (390) T ss_pred -c----------ccccc--------------------------------------------------------------- Confidence 0 00000 Q ss_pred ccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeE Q lcl|Aclame:pro 231 WRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVV 310 (331) Q Consensus 231 ~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~ 310 (331) .+.. +.+...-.+.|+++++.++.....+.+|+||++...+|+.. .++.+...+.+.-..| ... T Consensus 251 --------~~~~------~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~l-kd~~g~~l~~~~~~~g-~~~ 314 (390) T protein:vir:62 251 --------TFLA------TDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKL-KDANGQYLWQSGLTVG-APS 314 (390) T ss_pred --------ceec------ccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHh-hccCCCeeecCCcCCC-ccc Confidence 0000 00000012224455555554445567899999999999864 4555544333333333 336 Q ss_pred EEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 311 AFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 311 ~~~gvpir~~dai~~tE~~Vv 331 (331) .++|.||..+|++-.++ |+ T Consensus 315 ~l~G~Pv~~~~~~p~~~--i~ 333 (390) T protein:vir:62 315 LFNGKVVETDDGMPADK--IL 333 (390) T ss_pred eecccceEEecCCCCcc--EE Confidence 79999999999986543 33 No 67 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.74 E-value=3.3e-09 Score=67.06 Aligned_cols=233 Identities=16% Similarity=0.190 Sum_probs=138.6 Q ss_pred CCc--CccccccH-----HHHHHhcCCccchhHHHHHHHhccchhHhh-cceecccCCccceeEEEeecCCcceeecCCc Q lcl|Aclame:pro 1 MPT--LSTTNPTL-----ADVAARMTPDGKIDPQIVEMLNETNEILDD-MTVIEANGFTEHKTTVRSGLPTGTWRKLNYG 72 (331) Q Consensus 1 M~~--l~~~a~TL-----~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~-lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g 72 (331) |+. +....+.. ....--+-+. .+...|||.+++.+++... ...+....+ ...+.+.++-|.++|..=++. T Consensus 52 ~a~~~~~~~~~~~a~~~~~~~Gg~lvP~-~~~~~ii~~l~~~s~l~~lg~~~v~~~~g-~~~~p~~t~~~~a~wv~E~~~ 129 (366) T protein:vir:57 52 FAATELGDTGLSMAISTAAGSGGALIPQ-NMQNEVIELLRDRTVVRILGARSIPLPNG-NLSMPRLSGGATAGYVGEGKD 129 (366) T ss_pred HHHHhhcchhhhhhccccccCCccccch-hHHHHHHHHHhhhcchhhhceeeeecCCC-ceEEEEEeCCcceeeeccCcc Confidence 000 00000000 0000001122 2445699999998988665 444444333 467888889999999999999 Q ss_pred cCcccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcc Q lcl|Aclame:pro 73 VQPEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNS 151 (331) Q Consensus 73 ~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~ 151 (331) +++++.++.+++-..+-+++.+.|-+.+.+... +.+++ -.....++++..+..+|++||...+ ++.||..-- T Consensus 130 ~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~~~---i~~~l~~a~~~~~d~a~l~G~G~~~--~p~Gi~~~~-- 202 (366) T protein:vir:57 130 VVATGATFDDVKLSAKTMIALVPVSNQLIGRAGFNVEQL---LLGDILSAIATREDKAFLRDDGTGD--TPKGMKAVA-- 202 (366) T ss_pred ccccccceeEEEEeeEEEEEeehhhHHHHhhhhHHHHHH---HHHHHHHHHHHHHHHHhhccCCCCc--cccceeecc-- Confidence 999999999999999999999999999887643 44443 3345789999999999999974321 233432100 Q ss_pred ccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecc Q lcl|Aclame:pro 152 LSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDW 231 (331) Q Consensus 152 ~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~ 231 (331) + . ++. .+.+ . | T Consensus 203 -~-----------~-~~~-----~~~~------------------------------~---------------~------ 213 (366) T protein:vir:57 203 -T-----------A-ANR-----LVAW------------------------------T---------------G------ 213 (366) T ss_pred -c-----------c-ccc-----eeec------------------------------c---------------c------ Confidence 0 0 000 0000 0 0 Q ss_pred cceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEE Q lcl|Aclame:pro 232 RYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVA 311 (331) Q Consensus 232 r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~ 311 (331) ..++ ......+.++| .+....++.......|+||++....|+.. .+......+ +....| . T Consensus 214 ------t~~~-------~~~~~~~~~~~-~~~~~~~~~~~~~a~~vmn~~~~~~L~~l-kd~~G~~l~-~~~~~g----~ 273 (366) T protein:vir:57 214 ------TAIN-------LTTIDEYLDSL-ILKHMDSNSNMIRCGWGLSNRTYMTLFGL-RDGNGNKVY-PEMSQG----I 273 (366) T ss_pred ------cccc-------hhhHHHHHHHH-HHhhhccccccccCEEEecHHHHHHHHhh-hccCCceec-cCCCCC----e Confidence 0011 01111233333 34455666677789999999999999965 344443322 222233 5 Q ss_pred EcCeEEEEEEeccC------CCcccC Q lcl|Aclame:pro 312 FDGIPCRRTDALLL------TEARVV 331 (331) Q Consensus 312 ~~gvpir~~dai~~------tE~~Vv 331 (331) ++|+||..+++|-. ++..++ T Consensus 274 l~G~Pvv~s~~ip~~~~~~~~~~~i~ 299 (366) T protein:vir:57 274 LKGYPIQRTSAIPANLGDDGNESEIY 299 (366) T ss_pred ecceeeEEccccccccccCCCccEEE Confidence 89999999999843 222233 No 68 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=98.71 E-value=2.4e-09 Score=67.85 Aligned_cols=208 Identities=12% Similarity=0.059 Sum_probs=132.8 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCc-cceeEEEeecCCcceeecCCccCc-ccc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFT-EHKTTVRSGLPTGTWRKLNYGVQP-EKS 78 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~-~~~~~~~~~lP~~~fR~lN~g~~~-s~~ 78 (331) |... |-.+-. -+-+ ..+...||+.+.+.++|++.++.....++. .+.+.+.++-|.++|..=++..++ +.. T Consensus 123 ~~~~-----~~~~gg-~lvP-~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 195 (397) T protein:vir:12 123 MSGI-----NDEDGG-ILIP-EDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQP 195 (397) T ss_pred cccc-----ccccCc-ccCc-hhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeecccccccccccc Confidence 2111 000000 0111 123467999999999999998887765443 355667788889999999988886 568 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENG 157 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~ 157 (331) ++.+++-.++-+++.+.|.+.+.+..+ +..++- .....++++......|++|+.+.. T Consensus 196 ~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i---~~~l~~~~~~~~d~~il~G~g~~~------------------- 253 (397) T protein:vir:12 196 RFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYV---AKWFAKKSVVTRNNLILAAIASLK------------------- 253 (397) T ss_pred cceeEEeeheeeEeeehhhHHHHhhchHHHHHHH---HHHHHHHHHHHHHHHHHhcccccc------------------- Confidence 999999999999999999999887655 334443 344788899999999999963321 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) |+|. T Consensus 254 ------------------------------~~g~---------------------------------------------- 257 (397) T protein:vir:12 254 ------------------------------KVDI---------------------------------------------- 257 (397) T ss_pred ------------------------------cccc---------------------------------------------- Confidence 1110 Q ss_pred eccccccCCCCccchhhHHHHHHHHHH-HhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeE Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAVE-LIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIP 316 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~~-~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvp 316 (331) .+..+| +++++ .++.......+|+||++...+|+.. .+..+...+.+ ++.+-..-.++|+| T Consensus 258 ------------~~~~~i----~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~l-kd~~G~~l~~~-~~~~g~~~~l~G~p 319 (397) T protein:vir:12 258 ------------DGLDGI----KKALNVTLDPMVAPGSIVLTNQDGYDWLDTL-KDGTGRYLLQP-DPTNPTKKLLDGRP 319 (397) T ss_pred ------------ccHHHH----HHHHhhccchhhhCCCEEEEcHHHHHHHHHh-hccCCceeecc-cccCCCCcccccee Confidence 001122 33332 3444445668999999999999864 45545443333 33333345789999 Q ss_pred EEEEEec-cCC---CcccC Q lcl|Aclame:pro 317 CRRTDAL-LLT---EARVV 331 (331) Q Consensus 317 ir~~dai-~~t---E~~Vv 331 (331) |..+++. ..+ ...++ T Consensus 320 v~~~~~~~~~~~~~~~~~~ 338 (397) T protein:vir:12 320 VVPFTNRVLKTQKGKAPLI 338 (397) T ss_pred eEEecccccccCCCccEEE Confidence 9877653 332 22233 No 69 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=98.70 E-value=2e-09 Score=68.23 Aligned_cols=229 Identities=11% Similarity=0.004 Sum_probs=134.5 Q ss_pred CCcC---ccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecC----CcceeecCCcc Q lcl|Aclame:pro 1 MPTL---STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLP----TGTWRKLNYGV 73 (331) Q Consensus 1 M~~l---~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP----~~~fR~lN~g~ 73 (331) +..+ ....-+..+.. .+-+. .+...||+.+.+.++|++.+++....++ ...|.+++..+ .+.|..=++.. T Consensus 110 ~~~~~~~~~~~~~~~~~~-~~vp~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~v~Eg~~~ 186 (413) T protein:vir:81 110 VKAASDPASTATLTDEFQ-GGYGT-TWNRNIIYRRREKLVVADLMDNLTMTNT-TIKYLMEKANRVVEGGFKTVAEGGKK 186 (413) T ss_pred HHhhhhhhhhcccccccc-cccch-hhHHHHHHHHhhhhhHHhhcceeeccCC-ceeEEEeccccccccccceecCcccc Confidence 0000 00000000100 11122 2456799999999999999998776544 45677776654 45788777778 Q ss_pred Cccc-ceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccc Q lcl|Aclame:pro 74 QPEK-SRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSL 152 (331) Q Consensus 74 ~~s~-~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~ 152 (331) +++. +++.+++...+-+++.+.|.+.+.+..++ +...-.....++++......||||+...+ .+.||..- T Consensus 187 ~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~---l~~~i~~~la~~~~~~~d~~~l~G~G~~~--~~~Gi~~~---- 257 (413) T protein:vir:81 187 PYMRFADFDIVTESLSKIAGLTKITDEMIEDYDF---LVSYINARLLEELAIEEERQLLLGDGTGN--NLTGLLKR---- 257 (413) T ss_pred cccCcccceeeEeeeeeEEEeehhhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhccCCCCC--cccccccc---- Confidence 7776 68999999999999999999998876543 33334455788999999999999974333 24444220 Q ss_pred cccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccc Q lcl|Aclame:pro 153 SAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWR 232 (331) Q Consensus 153 ~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r 232 (331) T Consensus 258 -------------------------------------------------------------------------------- 257 (413) T protein:vir:81 258 -------------------------------------------------------------------------------- 257 (413) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred ceeeeeccccccCCCCccchhhHHHHHHHHHHHhh-cCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeeccc------C Q lcl|Aclame:pro 233 YVVRIANVDVSELTKNASAGADLIDLMTQAVELIP-NVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI------A 305 (331) Q Consensus 233 ~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip-~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~------~ 305 (331) .++... ...+..++.+.+.+++..+. +....+.+|+||+.....|+.. .+..+...+.+... . T Consensus 258 -----~~~~~~----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~~~ 327 (413) T protein:vir:81 258 -----DGIQTL----AVSNKDELADSIYKAMTNISLATPFQADALVINPLDYQELRLA-KDANGQYYGGGVFQGQYGSGG 327 (413) T ss_pred -----cccccc----cccccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHh-hccCCceeccccccccccccc Confidence 000000 00111223444555554432 1222345799999999999854 34444332221111 1 Q ss_pred CceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 306 GKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 306 g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) +.....+.|.||..+|++..+...+. T Consensus 328 ~~~~~~l~G~pv~~s~~~~~~~~~~g 353 (413) T protein:vir:81 328 IMLDPAPWGLRTVQSQVVPVGKPVVG 353 (413) T ss_pred cccCceecceeeEEcCCCCcccEEEE Confidence 11223578999999999865542222 No 70 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.70 E-value=5.7e-09 Score=65.80 Aligned_cols=230 Identities=9% Similarity=0.032 Sum_probs=136.6 Q ss_pred CC------cCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEeecCCcceeecCCcc Q lcl|Aclame:pro 1 MP------TLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRSGLPTGTWRKLNYGV 73 (331) Q Consensus 1 M~------~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~~lP~~~fR~lN~g~ 73 (331) +- .......+... ...+-+. .+...||+.+.+.++|++.+.++...++.+ +.+...++.+.+.|..=++.+ T Consensus 110 ~~~~~~~~~~~~~~~~~~~-g~~~iP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~ 187 (415) T protein:vir:94 110 TEYLETRNDIQGGSLKTDS-GFVVIPE-EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEEN 187 (415) T ss_pred HHHhhhhhhhhhhcccccc-ccccCcH-HHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceeccccccc Confidence 00 00000000000 0011122 245679999999999999999888654432 233444566777888777777 Q ss_pred Cc-ccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccc Q lcl|Aclame:pro 74 QP-EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSL 152 (331) Q Consensus 74 ~~-s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~ 152 (331) ++ +..++.+++-.++-+++.+.|.+.+.+... .++...-.....++++......|++|+....+..+. T Consensus 188 ~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~--------- 256 (415) T protein:vir:94 188 PELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTS--------- 256 (415) T ss_pred cccccccceeeEeeheeeeeechhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccc--------- Confidence 75 457899999999999999999999887643 234444455588899999999999997443321110 Q ss_pred cccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccc Q lcl|Aclame:pro 153 SAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWR 232 (331) Q Consensus 153 ~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r 232 (331) ++ .. +++. T Consensus 257 -------------~~-~~-----------------~~~~----------------------------------------- 264 (415) T protein:vir:94 257 -------------SG-FE-----------------KEGK----------------------------------------- 264 (415) T ss_pred -------------cc-cc-----------------cccc----------------------------------------- Confidence 00 00 0000 Q ss_pred ceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEE Q lcl|Aclame:pro 233 YVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAF 312 (331) Q Consensus 233 ~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~ 312 (331) .. +.+.....+-++++++.+......+.+|+||++...+|+.. .+..+...+. ..+.+...-.| T Consensus 265 ------~~--------~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~l-kd~~G~~l~~-~~~~~~~~~~l 328 (415) T protein:vir:94 265 ------KL--------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM-KDKLGNYLIQ-PDVKEKTQQRL 328 (415) T ss_pred ------cc--------ccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHh-hccCCCeeec-cCcCCCCCcee Confidence 00 00001112334556666665566678999999999999864 5555544332 23333344579 Q ss_pred cCeEEEEEEeccC-CCc--ccC Q lcl|Aclame:pro 313 DGIPCRRTDALLL-TEA--RVV 331 (331) Q Consensus 313 ~gvpir~~dai~~-tE~--~Vv 331 (331) +|.||+.++++.. +.. .++ T Consensus 329 ~G~pV~~~~~~~~~~~~~~~i~ 350 (415) T protein:vir:94 329 LGAKIEILPDEVLGQKGNNTLI 350 (415) T ss_pred cceeeEEecccccCCCCccEEE Confidence 9999999998742 221 122 No 71 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=98.68 E-value=8.9e-09 Score=64.73 Aligned_cols=207 Identities=11% Similarity=0.044 Sum_probs=130.1 Q ss_pred CCcC--ccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCc-cceeEEEeecCCcceeecCCccCc-c Q lcl|Aclame:pro 1 MPTL--STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFT-EHKTTVRSGLPTGTWRKLNYGVQP-E 76 (331) Q Consensus 1 M~~l--~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~-~~~~~~~~~lP~~~fR~lN~g~~~-s 76 (331) |.+- +.+..+ -+. .+...||+.+.+.++|++.++.+...++. .+.+.+..+-|.+.|.+=++..++ + T Consensus 91 ~~~~t~~~gg~~--------vP~-~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 161 (371) T protein:vir:81 91 MSEGSNQDGGYT--------VPQ-DIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGEKA 161 (371) T ss_pred hccCCCccCcee--------ecH-hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeecccccccccc Confidence 2211 001111 121 24567999999999999999887764432 234455566788899988888875 6 Q ss_pred cceEEEEEEEEEEecchhhhhHHHHhhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccc Q lcl|Aclame:pro 77 KSRTVQVKDSMGMLETYAEVDKALADLN-GNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) Q Consensus 77 ~~t~~~~~~~l~ilgg~~eVDr~la~~~-gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~ 155 (331) ..++.+++-.++-+++.+.|-+.+.+.. .+..++ -.....+++++.....|++|+....| T Consensus 162 ~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~---i~~~l~~a~~~~~~~~i~~g~g~~~~---------------- 222 (371) T protein:vir:81 162 TPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNT---LVRWIGDESRVTRNGLIINVLNTKAK---------------- 222 (371) T ss_pred ccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHH---HHHHHHHHHHHHHHHHHHhhcccccc---------------- Confidence 7899999999999999999999887754 333444 44557889999999999998642110 Q ss_pred ccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccccee Q lcl|Aclame:pro 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) Q Consensus 156 ~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~ 235 (331) +|. T Consensus 223 ---------------------------------~~~-------------------------------------------- 225 (371) T protein:vir:81 223 ---------------------------------TAI-------------------------------------------- 225 (371) T ss_pred ---------------------------------ccc-------------------------------------------- Confidence 000 Q ss_pred eeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCe Q lcl|Aclame:pro 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI 315 (331) Q Consensus 236 RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gv 315 (331) .+..++.+++. ..++.....+.+|+||++....|+.. .+..+...+ .....+...-.++|. T Consensus 226 --------------~~~~~i~~~~~---~~l~~~~~~~a~~vmn~~~~~~L~~l-kd~~g~~l~-~~~~~~~~~~~l~G~ 286 (371) T protein:vir:81 226 --------------ADLDGLKQIIN---VQLDPVFRSTSSVIVNQDAFNWLDTL-KDQNGQYLL-QPSISSPTGRQLLGL 286 (371) T ss_pred --------------ccHHHHHHHHH---hhcchhhhcCCEEEEcHHHHHHHHHh-hccCCCeee-ecccCCCCCceecce Confidence 00011222221 12333344567999999999999975 344443323 223333444678999 Q ss_pred EEEEEEeccC----------CCcccC Q lcl|Aclame:pro 316 PCRRTDALLL----------TEARVV 331 (331) Q Consensus 316 pir~~dai~~----------tE~~Vv 331 (331) ||..+|++.. .+..++ T Consensus 287 pV~~~~~~~~~~~~~~~~~~~~~~i~ 312 (371) T protein:vir:81 287 PVVIVSNKVLANRVDGGTGAQFAPII 312 (371) T ss_pred eEEEecccccCccccccccCCcceEE Confidence 9999998731 111222 No 72 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.66 E-value=1.6e-08 Score=63.38 Aligned_cols=217 Identities=7% Similarity=-0.006 Sum_probs=130.9 Q ss_pred CCcC--ccccccHHHHHH---h-------cCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEe-ecCCcce Q lcl|Aclame:pro 1 MPTL--STTNPTLADVAA---R-------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRS-GLPTGTW 66 (331) Q Consensus 1 M~~l--~~~a~TL~E~Ak---~-------~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~-~lP~~~f 66 (331) +.-+ ....++-.|... . +-+. .+...||+.+.+.++|++.+......++.+ +.+.... .-|.+.| T Consensus 98 ~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 176 (404) T protein:vir:39 98 VNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQ-DIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVM 176 (404) T ss_pred HHHHhcchhhhhhhhhhhhhcccccCCceeccH-HHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceee Confidence 0000 000001111000 0 0121 345679999999999999998877655433 3333333 3377899 Q ss_pred eecCCccCc-ccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCc Q lcl|Aclame:pro 67 RKLNYGVQP-EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGL 145 (331) Q Consensus 67 R~lN~g~~~-s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL 145 (331) ..=++..++ ++.++.+++-.++-+++.+.|-+.+.+... .++...-.....++++......+++|+.... T Consensus 177 v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~------- 247 (404) T protein:vir:39 177 DAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTA--ENILAWLSSWIAKKVVVTRNQAIIAAMGTVP------- 247 (404) T ss_pred ecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 999999986 679999999999999999999998887643 2334444455889999999999999862210 Q ss_pred hhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeec Q lcl|Aclame:pro 146 TPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIG 225 (331) Q Consensus 146 ~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~G 225 (331) |++ T Consensus 248 ------------------------------------------~~~----------------------------------- 250 (404) T protein:vir:39 248 ------------------------------------------KKP----------------------------------- 250 (404) T ss_pred ------------------------------------------ccc----------------------------------- Confidence 000 Q ss_pred eEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccC Q lcl|Aclame:pro 226 LTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIA 305 (331) Q Consensus 226 l~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~ 305 (331) ...+..+|.+++.. .++.......+|+||++.+..|+.. .+..+...+ ...+. T Consensus 251 ----------------------~~~~~~~i~~~~~~---~~~~~~~~~a~~v~n~~~~~~L~~l-kd~~G~~l~-~~~~~ 303 (404) T protein:vir:39 251 ----------------------TIAKFDDVITMINT---SVDPAIIATSSLLTNQSGLNKLALV-KTAEGKYLL-EPDPT 303 (404) T ss_pred ----------------------ccccHHHHHHHHHH---hhhhhhccCCEEEEcHHHHHHHHHh-hccCCceee-ccCcC Confidence 00111233333322 2323334457899999999999964 455554433 33444 Q ss_pred CceeEEEcCeEEEEEEeccCC-----CcccC Q lcl|Aclame:pro 306 GKKVVAFDGIPCRRTDALLLT-----EARVV 331 (331) Q Consensus 306 g~~v~~~~gvpir~~dai~~t-----E~~Vv 331 (331) +...-.+.|.||..+|+.... ...++ T Consensus 304 ~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~ 334 (404) T protein:vir:39 304 KPNSYLIKGKKVIVVADRWLPNSGSTVYPLY 334 (404) T ss_pred CCCcceecceeEEEecccccCccCCCccEEE Confidence 445568999999999863211 11222 No 73 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=98.63 E-value=2.1e-09 Score=68.20 Aligned_cols=235 Identities=11% Similarity=0.091 Sum_probs=132.9 Q ss_pred CCcCccccccHHHHHHhcCC-----------ccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecC-Ccceee Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTP-----------DGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLP-TGTWRK 68 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~-----------~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP-~~~fR~ 68 (331) ||-.-..+ -|.++++.... ...+...+++.+.+.+++|..+..+...+.. ... ...+.+ .+.|.. T Consensus 1 ~~~k~~~~-~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~-~~i-~~~~~~~~~~~~~ 77 (321) T protein:vir:31 1 MASRTINN-DLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKK-TRI-PTLNIGERHRRPQ 77 (321) T ss_pred CchHHHHH-HHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcc-eee-eeeccCCcccccc Confidence 55432111 12333332110 1123456899999999999999987654332 221 122333 344544 Q ss_pred c--CCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChh---hhc Q lcl|Aclame:pro 69 L--NYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAE---KFM 143 (331) Q Consensus 69 l--N~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~---~F~ 143 (331) - +...++++.++.+++-.|+-+...+.|...+.+-.-...++...-...+.++++..+.+.+||||....|. ..+ T Consensus 78 ~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~ 157 (321) T protein:vir:31 78 DEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQND 157 (321) T ss_pred cccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccch Confidence 2 22344566788999999999999999999888764321245555666788999999999999998654431 111 Q ss_pred CchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEee Q lcl|Aclame:pro 144 GLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWD 223 (331) Q Consensus 144 GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~ 223 (331) |+-+. ++. + T Consensus 158 G~l~~---------------------------------------a~~---~----------------------------- 166 (321) T protein:vir:31 158 GFITV---------------------------------------AEG---D----------------------------- 166 (321) T ss_pred hhhhh---------------------------------------hcc---c----------------------------- Confidence 21110 000 0 Q ss_pred eceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcC--CCCceEEEeCHHHHHHHHHHhhcCCcceeeee Q lcl|Aclame:pro 224 IGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNV--GMGRPAFYMPRKIRSFLRRQITNKVAASTLTM 301 (331) Q Consensus 224 ~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~--~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~ 301 (331) ...++. ..+. .+ .+.+++++..||.. ..++.+||||++....++....++... +-. T Consensus 167 -------------~~~~~~---~~~~---~~-~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~~--~~~ 224 (321) T protein:vir:31 167 -------------VETIDA---ADDI---LD-NDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDTP--LGD 224 (321) T ss_pred -------------cccccc---cccc---cC-HHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCCc--ccc Confidence 000110 0000 11 23456666667643 345689999999988777655555432 222 Q ss_pred cccCCceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 302 EEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 302 ~~~~g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) .-..+.....+.|+||..++.+-.....+. T Consensus 225 ~~l~~~~~~tl~G~pvv~~~~mP~~~il~t 254 (321) T protein:vir:31 225 NVIMGEADVNPFSFPIIGSGLWPDDKAMFT 254 (321) T ss_pred chhhccccccccceeEEEcCCCCCCcEEEe Confidence 223333445688999999988754322222 No 74 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=98.62 E-value=1.8e-08 Score=63.03 Aligned_cols=216 Identities=9% Similarity=0.040 Sum_probs=130.1 Q ss_pred CCc--CccccccHHHHHH----------hcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEee-cCCcce Q lcl|Aclame:pro 1 MPT--LSTTNPTLADVAA----------RMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRSG-LPTGTW 66 (331) Q Consensus 1 M~~--l~~~a~TL~E~Ak----------~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~~-lP~~~f 66 (331) +.. .....+...+... .+-+. .+...||+.+.+.++|++.++.....++.+ +.+.+..+ -+.+.| T Consensus 98 ~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~-~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (408) T protein:vir:74 98 VNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQ-DIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAM 176 (408) T ss_pred HHHHhcchhhhhhhhhhhhcccccCCCceeech-hHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccc Confidence 000 0000001111100 01121 245679999999999999998877644432 33444433 356679 Q ss_pred eecCCccCc-ccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcC Q lcl|Aclame:pro 67 RKLNYGVQP-EKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMG 144 (331) Q Consensus 67 R~lN~g~~~-s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~G 144 (331) ..=++.+++ ++.++.+++-.++-+++.+.|-+.+.+... +...+ -.....++++......|++|+....| T Consensus 177 v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~---i~~~l~~~~~~~~d~~il~G~G~~~~----- 248 (408) T protein:vir:74 177 DEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAW---LSSWIAKKVVVTRNQAIIAAMGTVPK----- 248 (408) T ss_pred cccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHH---HHHHHHHHHHHHHHHHHhhccccccc----- Confidence 999999987 678999999999999999999999887644 34443 33457899999999999999632110 Q ss_pred chhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeee Q lcl|Aclame:pro 145 LTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDI 224 (331) Q Consensus 145 L~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~ 224 (331) ++ T Consensus 249 --------------------------------------------~~---------------------------------- 250 (408) T protein:vir:74 249 --------------------------------------------KP---------------------------------- 250 (408) T ss_pred --------------------------------------------cc---------------------------------- Confidence 00 Q ss_pred ceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeeccc Q lcl|Aclame:pro 225 GLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI 304 (331) Q Consensus 225 Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~ 304 (331) ...+..+|.+++. ..++.....+.+|+||++....|+.. .+..+...+.+ +. T Consensus 251 -----------------------~~~~~~~i~~~~~---~~l~~~~~~~a~~v~n~~~~~~l~~l-kd~~G~~l~~~-~~ 302 (408) T protein:vir:74 251 -----------------------TIANFDDVITMIN---TSVDPAIIATSSLLTNQSGLNKLALV-KTAEGKYLLEP-DP 302 (408) T ss_pred -----------------------ccccHHHHHHHHH---HhhhhhhcCCCEEEEcHHHHHHHHHh-hcCCCceEecc-Cc Confidence 0001122333222 24444445668999999999999975 44545443333 33 Q ss_pred CCceeEEEcCeEEEEEEe-cc----CCCcccC Q lcl|Aclame:pro 305 AGKKVVAFDGIPCRRTDA-LL----LTEARVV 331 (331) Q Consensus 305 ~g~~v~~~~gvpir~~da-i~----~tE~~Vv 331 (331) .+...-.+.|.||..++. .+ .++..++ T Consensus 303 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~ 334 (408) T protein:vir:74 303 TKPNSYLIKGKQVIVVADRWLPNSGSTVYPLY 334 (408) T ss_pred CCCCCceecceeeEEecCcccccccCCcceEE Confidence 333345789999998864 22 1222223 No 75 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.57 E-value=3.6e-08 Score=61.41 Aligned_cols=216 Identities=8% Similarity=0.004 Sum_probs=131.0 Q ss_pred CCcCccccccHHHHH---Hh-------cCCccchhHHHHHHHhccchhHhhcceecccCCccc-eeEEE-eecCCcceee Q lcl|Aclame:pro 1 MPTLSTTNPTLADVA---AR-------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEH-KTTVR-SGLPTGTWRK 68 (331) Q Consensus 1 M~~l~~~a~TL~E~A---k~-------~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~-~~~~~-~~lP~~~fR~ 68 (331) +.--+...+...+.. .. +-+. .+...||+.+.+.++|++.+.+....+..+. .+... +.-+.+.|.. T Consensus 100 ~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~-~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 178 (408) T protein:vir:10 100 MVRNPMAFMNTVSSKTETSGSDSAAGLTIPQ-DIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDA 178 (408) T ss_pred HhhcchhhhhhhhhhhhhcccccCCceeccH-hHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeec Confidence 000000011111110 00 1122 2446799999999999999998887655443 23333 3347788999 Q ss_pred cCCccCc-ccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCch Q lcl|Aclame:pro 69 LNYGVQP-EKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLT 146 (331) Q Consensus 69 lN~g~~~-s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~ 146 (331) =++.+++ +..++.+++-..+-+++.+.|-+.+.+... +..++-. ....++++......|++|+....+ T Consensus 179 E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~l~~~~~~~~~~~il~g~g~~~~------- 248 (408) T protein:vir:10 179 EDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLS---SWIAKKVVVTRNQAIIEVMKAAPK------- 248 (408) T ss_pred CccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHH---HHHHHHHHHHHHHHHhhccccccc------- Confidence 9999986 568999999999999999999999887644 4444433 447889999999999988632110 Q ss_pred hhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeece Q lcl|Aclame:pro 147 PRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGL 226 (331) Q Consensus 147 ~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl 226 (331) ++ T Consensus 249 ------------------------------------------~~------------------------------------ 250 (408) T protein:vir:10 249 ------------------------------------------KP------------------------------------ 250 (408) T ss_pred ------------------------------------------cc------------------------------------ Confidence 00 Q ss_pred EEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCC Q lcl|Aclame:pro 227 TLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAG 306 (331) Q Consensus 227 ~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g 306 (331) ...+..+|.++|.. .++....++.+|+||++...+|+.. .+..+...+.+ ++.. T Consensus 251 ---------------------~~~~~~~l~~~~~~---~~~~~~~~~a~~v~n~~~~~~l~~l-kd~~G~~i~~~-~~~~ 304 (408) T protein:vir:10 251 ---------------------TIAKFDDVITMINT---AVDPAIIATSSLLTNQSGLNKLALV-KTAEGKYLLEP-DPTK 304 (408) T ss_pred ---------------------ccccHHHHHHHHHH---hhhhhhccCCEEEEcHHHHHHHHHh-hccCCceEecc-CcCC Confidence 00112233333322 3444445668999999999999975 45555444433 3332 Q ss_pred ceeEEEcCeEEEEEEec-cCC-Cc---ccC Q lcl|Aclame:pro 307 KKVVAFDGIPCRRTDAL-LLT-EA---RVV 331 (331) Q Consensus 307 ~~v~~~~gvpir~~dai-~~t-E~---~Vv 331 (331) ...-.+.|.||..+++. +.+ .+ .++ T Consensus 305 ~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~ 334 (408) T protein:vir:10 305 PNSYLIKGKQVIVVADRWLPNTGSTVYPLY 334 (408) T ss_pred CCCceecceeeEEecccccCccCCCceEEE Confidence 33357899999998752 221 11 122 No 76 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=98.56 E-value=1.8e-08 Score=63.10 Aligned_cols=209 Identities=9% Similarity=0.059 Sum_probs=129.9 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEe-ecCCcceeecCCccCc-cc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRS-GLPTGTWRKLNYGVQP-EK 77 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~-~lP~~~fR~lN~g~~~-s~ 77 (331) |..- |..+- ..+-+. .+...|||.+.+.++|++....+......+ +.+.... .-+.++|..=++.+++ ++ T Consensus 5 ~~~~-----t~~~g-g~liP~-~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 77 (293) T protein:vir:48 5 KTDH-----SGSDA-GLTIPQ-DIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDD 77 (293) T ss_pred eccc-----ccCcC-ceEech-hHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccc Confidence 3321 11111 112222 244679999999999988877655443322 4444443 4578899999999987 67 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~ 156 (331) .++.+++-.++-+++.+.|.+.+.+... +..++-. ....++++......|++|+... T Consensus 78 ~~~~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~la~~~~~~~~~~i~~g~~~~------------------- 135 (293) T protein:vir:48 78 PKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLS---GWIAKKVVVTRNKAILGVVDKL------------------- 135 (293) T ss_pred cceeEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHH---HHHHHHHHHHHHhHHhhccccc------------------- Confidence 8999999999999999999998877643 3444433 3467888888887777764210 Q ss_pred cceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceee Q lcl|Aclame:pro 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) Q Consensus 157 ~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~R 236 (331) ++.. T Consensus 136 --------~~~~-------------------------------------------------------------------- 139 (293) T protein:vir:48 136 --------PTKP-------------------------------------------------------------------- 139 (293) T ss_pred --------cccc-------------------------------------------------------------------- Confidence 0000 Q ss_pred eeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeE Q lcl|Aclame:pro 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIP 316 (331) Q Consensus 237 I~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvp 316 (331) ...+-.+ +++++.++......+.+|+||++.+..|+.+ .+..+...+ ..++.+-..-.+.|.| T Consensus 140 -----------~~~~~d~----i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~l-kd~~g~~l~-~~~~~~~~~~~l~G~P 202 (293) T protein:vir:48 140 -----------TLTKWDD----IIDLEAKVDPAIKQTSFFLTNTSGFTALKKV-KNALGDYLM-ERDVKSPTGYSIAGFA 202 (293) T ss_pred -----------cccCHHH----HHHHHHhhhhhhcCCCEEEEcHHHHHHHHHh-hccCCceEe-ecCcCCCCCceeccee Confidence 0001122 4445556655566678999999999999965 444444323 3334434445799999 Q ss_pred EEEEEecc--CCCcc---cC Q lcl|Aclame:pro 317 CRRTDALL--LTEAR---VV 331 (331) Q Consensus 317 ir~~dai~--~tE~~---Vv 331 (331) |+.|+... +..+- ++ T Consensus 203 v~~~~~~~~~~~~~~~~~~~ 222 (293) T protein:vir:48 203 VKEISDRWLPNASSGVMPLY 222 (293) T ss_pred eEEecccccCCccCCceEEE Confidence 99887532 22221 11 No 77 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=98.56 E-value=9.2e-09 Score=64.64 Aligned_cols=218 Identities=11% Similarity=0.090 Sum_probs=130.1 Q ss_pred CCcCccccccH-HHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcce--eecCCccCccc Q lcl|Aclame:pro 1 MPTLSTTNPTL-ADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTW--RKLNYGVQPEK 77 (331) Q Consensus 1 M~~l~~~a~TL-~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~f--R~lN~g~~~s~ 77 (331) +.+.....+|. .+.+- +-+. .+...||+.+.+.++|++.++.....++ .+.|.+.++.+++.| ..=++..++++ T Consensus 101 ~~~~~~~~~~~~~~~~~-~ip~-~~~~~ii~~~~~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~v~Eg~~~~~~~ 177 (379) T protein:vir:10 101 IQVKAVGDMTLPVNLTG-AQPK-DYNFDVVLNPSQMLNVSDIVGAVSISGG-TYTFVRENGAGEGAIGAQVEGATKGQKD 177 (379) T ss_pred hhhhhhcccccCCCCcc-ccch-hhhhHHHHhHHhhhhHHhhceeeeccCC-ceEEEEeecCCCcccccccCCccccccc Confidence 11100111110 00000 1122 2356799999999999999988776544 578888887766655 44456678889 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENG 157 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~ 157 (331) .++.+++-..+-+++.+.|.+.+.+-.++..+|- .....++++......|++|+.+. T Consensus 178 ~~f~~i~~~~~k~~~~~~iS~ell~D~~~l~~~i---~~~la~~~~~~~~~~~~~g~~~~-------------------- 234 (379) T protein:vir:10 178 YDISMIDVNTDFIAGFTRYSKKMANNLPFLTSFI---PNALRRDYAKAENAAFNAVLAAN-------------------- 234 (379) T ss_pred cceeeeEeeeeeEEeeehhhHHHHhhHHHHHHHH---HHHHHHHHHHHHHHHHhcccccc-------------------- Confidence 9999999999999999999999877654433333 34467788887777777664210 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) ++++.. + T Consensus 235 -------~~~~~~-------------------~----------------------------------------------- 241 (379) T protein:vir:10 235 -------ATASTE-------------------I----------------------------------------------- 241 (379) T ss_pred -------cccccc-------------------c----------------------------------------------- Confidence 000000 0 Q ss_pred eccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeeccc-CCceeEEEcCeE Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI-AGKKVVAFDGIP 316 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~-~g~~v~~~~gvp 316 (331) . ++.+ .++-+++++..+......+.+|+||+.....|+.. .+..+...+.+.-. .+.....++|+| T Consensus 242 -------~-~~~~----~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~~~~l~G~p 308 (379) T protein:vir:10 242 -------I-TNKN----KVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVT-QKSVGAGYGLPGVVTQDNGVLRINGIP 308 (379) T ss_pred -------c-cCcc----cHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHh-hccCCceeccCCccCCCCCcceeccee Confidence 0 0001 12335566666655555667899999999999865 34444332222111 112223688999 Q ss_pred EEEEEeccCCCcccC Q lcl|Aclame:pro 317 CRRTDALLLTEARVV 331 (331) Q Consensus 317 ir~~dai~~tE~~Vv 331 (331) |..++++-.+. .++ T Consensus 309 vv~s~~~~ag~-~~~ 322 (379) T protein:vir:10 309 LFRATWLAANK-YYV 322 (379) T ss_pred eEecCCCCCCc-eEE Confidence 99999885432 222 No 78 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=98.54 E-value=2.9e-08 Score=61.92 Aligned_cols=231 Identities=12% Similarity=0.099 Sum_probs=132.1 Q ss_pred CCcCccccccHHH-----HH-Hh--------cCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcce Q lcl|Aclame:pro 1 MPTLSTTNPTLAD-----VA-AR--------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTW 66 (331) Q Consensus 1 M~~l~~~a~TL~E-----~A-k~--------~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~f 66 (331) +..-....+|-.+ .+ +. +-+. .+...|++.+.+.++|++.+.++...++ ...+.++++-|.+.| T Consensus 64 ~~~~~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~-~~~~~I~~~~~~~s~i~~~~~~~~~~~~-~~~i~~~~~~~~a~~ 141 (390) T protein:vir:40 64 LASRGANALTSDESKYYNEVIAGNGFAGVTALLPP-TVFERVFEDLTVEHPLLSKINFVNTTAT-TEWIISVGDVATAWW 141 (390) T ss_pred HHhcCchhccHHHHHHHHHHHhccCcccCcccccH-HHHHHHHHHHHhhhhhhhhceeeecCCc-eeEEEEEcCCcceee Confidence 0000001111111 00 00 1111 2345699999999999999998876544 344677889999999 Q ss_pred eecCCccCc-ccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCc Q lcl|Aclame:pro 67 RKLNYGVQP-EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGL 145 (331) Q Consensus 67 R~lN~g~~~-s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL 145 (331) ..-++.+++ ++.++.+++-.++-+.+.+.|.+.+.+...- ++-+.-...+.++++..+.+.|++||-...|. |+ T Consensus 142 ~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~--~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~---Gi 216 (390) T protein:vir:40 142 GPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPS--WLDQYVRTILGEAMALGLEAGIVNGSGKDQPI---GM 216 (390) T ss_pred eccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchH--HHHHHHHHHHHHHHHHHHHhhhhcccCCCccc---ee Confidence 999988875 5789999999999999999999999887652 23344445588999999999999997543332 22 Q ss_pred hhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeec Q lcl|Aclame:pro 146 TPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIG 225 (331) Q Consensus 146 ~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~G 225 (331) -... . +.+. +..+ +.+ T Consensus 217 l~~~---~----------~~~~-----------------~~~~-~~~--------------------------------- 232 (390) T protein:vir:40 217 MRDL---N----------NVTA-----------------GEHP-VKT--------------------------------- 232 (390) T ss_pred eecc---c----------cccc-----------------cccc-ccc--------------------------------- Confidence 1100 0 0000 0000 000 Q ss_pred eEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCH-HHHHHHHHH--hhcCCcceeeeec Q lcl|Aclame:pro 226 LTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPR-KIRSFLRRQ--ITNKVAASTLTME 302 (331) Q Consensus 226 l~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~-~v~~~L~~~--~~~~~~~~~~~~~ 302 (331) +.-+... ...++.+-+..++..-+....++.+|+||+ +....|... ..++.... +.. T Consensus 233 --------~~~~t~~----------~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~-v~~- 292 (390) T protein:vir:40 233 --------ATPLTDL----------TPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVW-VTG- 292 (390) T ss_pred --------ccccchh----------hHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCcc-ccc- Confidence 0000011 112344445555544444456678999995 445555422 22222221 111 Q ss_pred ccCCceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 303 EIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 303 ~~~g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) ..+.|+||..++++-.. .|+ T Consensus 293 -------~~~~g~pvv~~~~~p~~--~i~ 312 (390) T protein:vir:40 293 -------ILPVPLEIVQSVAVPVG--KAV 312 (390) T ss_pred -------cCCCceeEEEcCCCCCC--cEE Confidence 12358999999888432 244 No 79 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=98.51 E-value=5.8e-08 Score=60.26 Aligned_cols=213 Identities=9% Similarity=-0.005 Sum_probs=130.2 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEee-cCCcceeecCCccCcc-c Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRSG-LPTGTWRKLNYGVQPE-K 77 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~~-lP~~~fR~lN~g~~~s-~ 77 (331) |.. ...|..+.. -+-+. .+...||+.+.+.++|++.++.....++.+ +.+....+ -|.+.|..-++.++++ . T Consensus 105 ~~~---~~~~~~~gg-~~vP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 179 (395) T protein:vir:38 105 VTS---GTTGTGNAG-LTIPE-DIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDD 179 (395) T ss_pred Hhh---ccCccCCCc-eecch-hHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccccccccccccccccc Confidence 111 011111100 01122 234679999999999999988876544433 33444443 3677899999999876 5 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENG 157 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~ 157 (331) .++.+++-.++-+++.+.|.+.+.+.... ++...-...+.++++......|++|+....+ T Consensus 180 ~~f~~v~~~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~la~~~~~~~~~~il~g~g~~~~------------------ 239 (395) T protein:vir:38 180 PELTVVKYLIHRYAGITTVTNTLLKDTVD--NIIQWLVNWAAKKDVVTRNAKILEVMGKAPK------------------ 239 (395) T ss_pred cceeeEEeeeeeeEeehhhHHHHHhhhHH--HHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------ Confidence 89999999999999999999998876442 2333344558899999999999998632110 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) . + T Consensus 240 ------------~-------------------~----------------------------------------------- 241 (395) T protein:vir:38 240 ------------K-------------------P----------------------------------------------- 241 (395) T ss_pred ------------c-------------------c----------------------------------------------- Confidence 0 0 Q ss_pred eccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEE Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPC 317 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpi 317 (331) ......+|.+++.. .++.......+|+||+.....|+.. .+..+...+.+ .+.+...-.+.|.|| T Consensus 242 ----------~~~~~~~i~~~~~~---~l~~~~~~~a~~v~n~~~~~~L~~l-kd~~G~~l~~~-~~~~~~~~~l~G~pV 306 (395) T protein:vir:38 242 ----------TISQFDNIKDLENN---TLDPAIESTSSFITNQSGYNILSKV-KDADGRYLMQP-DVTSPDKYLIDGKPV 306 (395) T ss_pred ----------ccccHHHHHHHHHH---hhhhhhcCCCEEEEcHHHHHHHHHh-hccCCceeecc-CcCCCCcceecccee Confidence 00011233333332 2333445568999999999999864 45555443333 333334457889999 Q ss_pred EEEEecc-C---CCcccC Q lcl|Aclame:pro 318 RRTDALL-L---TEARVV 331 (331) Q Consensus 318 r~~dai~-~---tE~~Vv 331 (331) ..+|+.. . .+..++ T Consensus 307 ~~~~~~~~~~~~~~~~i~ 324 (395) T protein:vir:38 307 IRIADKWLPDVSGSHPLY 324 (395) T ss_pred EEecccccCcCCCcceEE Confidence 9998642 1 222233 No 80 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=98.50 E-value=5.9e-08 Score=60.23 Aligned_cols=233 Identities=12% Similarity=0.065 Sum_probs=133.5 Q ss_pred CCcCc--cccccHHHHHH-----h-----------cCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEee-c Q lcl|Aclame:pro 1 MPTLS--TTNPTLADVAA-----R-----------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-L 61 (331) Q Consensus 1 M~~l~--~~a~TL~E~Ak-----~-----------~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~-l 61 (331) +..+. ...++-.|... . +-+. .+...||+.+.+.++|+.....+...++....+.+..+ . T Consensus 93 ~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 171 (409) T protein:vir:45 93 DKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPE-TFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTS 171 (409) T ss_pred HHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccH-hHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCc Confidence 00000 00111111110 0 0111 23456999999999999988877665443333444433 3 Q ss_pred CCcceeecCCccCcccceEEEEEEEE-EEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChh Q lcl|Aclame:pro 62 PTGTWRKLNYGVQPEKSRTVQVKDSM-GMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAE 140 (331) Q Consensus 62 P~~~fR~lN~g~~~s~~t~~~~~~~l-~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~ 140 (331) +.+.|..=++..+++..++.+++-.. ++.+..+.|-+.+.+... -++...-.....++++.+..+.|+|||-+..+. T Consensus 172 ~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~--~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~ 249 (409) T protein:vir:45 172 EVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSA--IDMEAYLARRIAERIGRGEARYLIQGTGAGTPK 249 (409) T ss_pred cccccccccccccccccccceeeeeeeeeeeeehhhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcc Confidence 55679999999999999999988665 555678889988877642 134444445678999999999999998665444 Q ss_pred hhcCchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEE Q lcl|Aclame:pro 141 KFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHY 220 (331) Q Consensus 141 ~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~ 220 (331) ++.|+-... ++.. T Consensus 250 ~p~Gil~~~----------------~~~~--------------------------------------------------- 262 (409) T protein:vir:45 250 QPKGLAASV----------------TGTT--------------------------------------------------- 262 (409) T ss_pred ccceeeecc----------------cccc--------------------------------------------------- Confidence 555552100 0000 Q ss_pred EeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcC--CCCceEEEeCHHHHHHHHHHhhcCCccee Q lcl|Aclame:pro 221 KWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNV--GMGRPAFYMPRKIRSFLRRQITNKVAAST 298 (331) Q Consensus 221 ~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~--~~g~~~~y~n~~v~~~L~~~~~~~~~~~~ 298 (331) +.. ..+..+ .+.+++++..|+.. .....+||||++....|+.. .+..+... T Consensus 263 ------------------~~~----~~~~~~----~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~l-kd~~G~~i 315 (409) T protein:vir:45 263 ------------------QTA----AANAVK----WQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEM-EDGQGRPL 315 (409) T ss_pred ------------------ccc----cccccc----hHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHh-hcCCCcee Confidence 000 000111 12234444455432 23345789999999999864 45555443 Q ss_pred eeecccCCceeEEEcCeEEEEEEeccC---CCcccC Q lcl|Aclame:pro 299 LTMEEIAGKKVVAFDGIPCRRTDALLL---TEARVV 331 (331) Q Consensus 299 ~~~~~~~g~~v~~~~gvpir~~dai~~---tE~~Vv 331 (331) +.+ .+.+.....+.|.||..+|.+-. +...|+ T Consensus 316 ~~~-~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~ 350 (409) T protein:vir:45 316 WLP-DIVGVAPASVLNVPYVIDQEIDDIGAGKKFMF 350 (409) T ss_pred ecc-CcCCCCCceecceeeEEecCcCCccCCccEEE Confidence 333 33323335789999999998742 222222 No 81 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.45 E-value=6.6e-08 Score=59.95 Aligned_cols=239 Identities=8% Similarity=0.027 Sum_probs=132.0 Q ss_pred CCcC----ccccccHH-HHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEeecCC-cceeecCC-- Q lcl|Aclame:pro 1 MPTL----STTNPTLA-DVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRSGLPT-GTWRKLNY-- 71 (331) Q Consensus 1 M~~l----~~~a~TL~-E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~~lP~-~~fR~lN~-- 71 (331) ...+ ...+++-. ...-.+-+...+...|||.+.+.++|++.+..+...++.+ +.+.+..+-+. +.|-.=+. T Consensus 145 ~~~~~~~~~~~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~ 224 (477) T protein:vir:84 145 RKIAKVGEEYRDLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAAL 224 (477) T ss_pred HHHHHhhhhhccccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCccc Confidence 0000 00000000 0000011222223459999999899988777665544433 44554444443 44554332 Q ss_pred ---ccCcccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchh Q lcl|Aclame:pro 72 ---GVQPEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTP 147 (331) Q Consensus 72 ---g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~ 147 (331) ..+.++.++.+++-..+-+++.+.|.+.+.+... +..++-.. ...++++.....+|+|||-..+ ...||-. T Consensus 225 ~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~---~l~~~~~~~~d~~~l~G~Gt~~--~p~Gi~~ 299 (477) T protein:vir:84 225 TAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFR---DLAADYANKLNVQVISGTGSNN--QVVGVRA 299 (477) T ss_pred ccccccccccceeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHH---HHHHHHHHHHHHHHhccCCCCC--ccceeee Confidence 3467778899999999999999999999988855 55555444 4789999999999999974321 1233321 Q ss_pred hhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceE Q lcl|Aclame:pro 148 RFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLT 227 (331) Q Consensus 148 R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~ 227 (331) . .+.+ T Consensus 300 ~---------------~~~~------------------------------------------------------------ 304 (477) T protein:vir:84 300 T---------------AGIT------------------------------------------------------------ 304 (477) T ss_pred c---------------cccc------------------------------------------------------------ Confidence 0 0000 Q ss_pred EecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcC-CCCceEEEeCHHHHHHHHHHhhcCCcceeeeecc--- Q lcl|Aclame:pro 228 LRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNV-GMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEE--- 303 (331) Q Consensus 228 v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~-~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~--- 303 (331) ++....-......-..+.+.+++++..+... .....+|+||.+...+|+.. .+..+.+...+.. T Consensus 305 -----------~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-kd~~G~~l~~~~~~~~ 372 (477) T protein:vir:84 305 -----------QVTATSAGSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAI-FAGDDRPLIVPSGPGF 372 (477) T ss_pred -----------cccccccccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHh-hccCCCeeeecCcccc Confidence 0110000000011123455678888776533 33446899999999999875 3444443332221 Q ss_pred ---------cCCceeEEEcCeEEEEEEeccC------CCcccC Q lcl|Aclame:pro 304 ---------IAGKKVVAFDGIPCRRTDALLL------TEARVV 331 (331) Q Consensus 304 ---------~~g~~v~~~~gvpir~~dai~~------tE~~Vv 331 (331) +.....-.++|+||..+++|-. .+..++ T Consensus 373 ~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~ 415 (477) T protein:vir:84 373 NNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIH 415 (477) T ss_pred cccccccccccccccchhcccceEecCcccccccccCCcceEE Confidence 1122233688999999998832 233444 No 82 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=98.45 E-value=5.3e-08 Score=60.46 Aligned_cols=230 Identities=12% Similarity=0.063 Sum_probs=126.1 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcce-ecccC--CccceeEEEeecCCcceeecCCccCccc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTV-IEANG--FTEHKTTVRSGLPTGTWRKLNYGVQPEK 77 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf-~e~n~--~~~~~~~~~~~lP~~~fR~lN~g~~~s~ 77 (331) +..+..+..|=.+.+-.+.-...+...|||.+.+.+.+....+- +.... +......+.++-|.++|..=++..+.++ T Consensus 331 ~~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~ 410 (645) T protein:vir:93 331 KSAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTK 410 (645) T ss_pred hhhhhccccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCccccccc Confidence 00000000000010100111112345699999888877655322 11111 2235566778889999999999999999 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~ 156 (331) .++.+++-..+-|++.+.|.+.|.+... +.+++-. ....++++..+..+||+|+.. T Consensus 411 ~~f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~---~~l~~aia~~~d~a~l~g~g~-------------------- 467 (645) T protein:vir:93 411 FDFESITFSHAKVSAIAVLTEELIRFSSPAADALVR---NALAEAVVARLDTDFVDPKKA-------------------- 467 (645) T ss_pred cceeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHH---HHHHHHHHHHHHHHhhcCCCc-------------------- Confidence 9999999999999999999999877653 5554433 447899999999999988622 Q ss_pred cceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceee Q lcl|Aclame:pro 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) Q Consensus 157 ~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~R 236 (331) |+++.. |+|-..|. . T Consensus 468 -------~~~~~~------------------p~gi~~~~------------------~---------------------- 482 (645) T protein:vir:93 468 -------AVADVS------------------PASITHDV------------------K---------------------- 482 (645) T ss_pred -------ccCCcc------------------ccceeccc------------------c---------------------- Confidence 111111 11100000 0 Q ss_pred eeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeE Q lcl|Aclame:pro 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIP 316 (331) Q Consensus 237 I~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvp 316 (331) .+. ..+....++..++.. +.. .+..+...+|+||++....|+.+ .+......+..-...| -.+.|.| T Consensus 483 --~~~-----~~~~~~~d~~~~~~~-~~~-a~~~~~~a~~vmn~~~~~~L~~l-kd~~G~~~~~~~~~~~---~tL~G~P 549 (645) T protein:vir:93 483 --GTA-----SSGNPDADAEAAFGQ-FVA-ANLQPTGAVWLMSSTNALALSMR-KNALGQKEYPDMTLLG---GSFQGLP 549 (645) T ss_pred --ccc-----cccchHHHHHHHHHH-HHh-cCCCccccEEEEcHHHHHHHHhc-cccCCceeecCCCCCC---ceeecee Confidence 000 001111233332222 111 12234457899999999999865 3333333221111222 2589999 Q ss_pred EEEEEeccCC-----CcccC Q lcl|Aclame:pro 317 CRRTDALLLT-----EARVV 331 (331) Q Consensus 317 ir~~dai~~t-----E~~Vv 331 (331) |..++++-.+ -..+. T Consensus 550 V~~s~~vp~~~~~gd~s~~~ 569 (645) T protein:vir:93 550 VIVSQYVGDQLVLVNAPDIY 569 (645) T ss_pred eEEeccCCcceeEeccccEE Confidence 9999886321 11111 No 83 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=98.35 E-value=9.1e-08 Score=59.18 Aligned_cols=211 Identities=9% Similarity=-0.016 Sum_probs=124.6 Q ss_pred CCcC--ccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEe-ecCCcceeecCCccCc-c Q lcl|Aclame:pro 1 MPTL--STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRS-GLPTGTWRKLNYGVQP-E 76 (331) Q Consensus 1 M~~l--~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~-~lP~~~fR~lN~g~~~-s 76 (331) +... .....|..+- .-+-+. .+...||+.+.+.++|+..+++....++. +.+.+.. +-++++|..=++..++ + T Consensus 121 ~~~~~~~~~~~t~~~g-g~liP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~E~~~~~~~~ 197 (394) T protein:vir:97 121 TTPVEPQKDGIKKENA-KPVSSE-EILYTPAREVKTVVDLKPFTTVYQAKKAS-GKYPVLQRATTKMVTVAELEKNPALA 197 (394) T ss_pred hhhhhhhccccccccc-cccChH-HHHHHHHHHhhhhhhhhhhceeeeccCcc-eEEEEEecCCCccceecccccccccc Confidence 0000 0000011110 001122 23456999999999999999988765554 4455443 4467788877777876 5 Q ss_pred cceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccc Q lcl|Aclame:pro 77 KSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) Q Consensus 77 ~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~ 155 (331) ..++.+++-.++-+++.+.|-+.+.+... +..++ -.....++++....+.|++|..+. T Consensus 198 ~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~---i~~~la~~~~~~~~~~i~~g~~~~------------------ 256 (394) T protein:vir:97 198 KPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGI---VSESISQIKVNTTNDAIAKVLKSF------------------ 256 (394) T ss_pred cccceeEEeehhheeeehhhHHHHHhhhhHHHHHH---HHHHHHHHHHHHHHHHHhhccccc------------------ Confidence 68999999999999999999998888654 34443 344477888888888888764210 Q ss_pred ccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccccee Q lcl|Aclame:pro 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) Q Consensus 156 ~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~ 235 (331) + |+ T Consensus 257 ----------~---------------------~~---------------------------------------------- 259 (394) T protein:vir:97 257 ----------T---------------------TK---------------------------------------------- 259 (394) T ss_pred ----------c---------------------cc---------------------------------------------- Confidence 0 00 Q ss_pred eeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCe Q lcl|Aclame:pro 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI 315 (331) Q Consensus 236 RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gv 315 (331) .+.+..+|.+++... ++. ..+..|+||++....|+.. .+..+...+.+ ++.+...-.++|. T Consensus 260 ------------~~~~~~~~~~~~~~~---~~~--~~~a~~v~n~~~~~~l~~l-kd~~G~~i~~~-~~~~~~~~~l~G~ 320 (394) T protein:vir:97 260 ------------TVKNLDEIKALLNGG---FDP--AYNVSLIVSQSFYQTLDTL-KDGNGRYLLQD-DITAVSGKVLLGK 320 (394) T ss_pred ------------ccccHHHHHHHHHhh---hhh--hhCCEEEEcHHHHHHHHHh-hccCCCeeeec-CcCCCCCceeccc Confidence 001112233333221 211 2246899999999999864 45544443322 3333333478999 Q ss_pred EEEEEEeccCCCcccC Q lcl|Aclame:pro 316 PCRRTDALLLTEARVV 331 (331) Q Consensus 316 pir~~dai~~tE~~Vv 331 (331) ||..+++.......++ T Consensus 321 pv~~~~~~~~~~~~~~ 336 (394) T protein:vir:97 321 PVFVLSDEVLGANKAF 336 (394) T ss_pred eeEEecccccCCccEE Confidence 9999887543332232 No 84 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=98.30 E-value=4.7e-07 Score=55.25 Aligned_cols=219 Identities=13% Similarity=0.138 Sum_probs=125.9 Q ss_pred CCcCccccccHHHHHHh---------cCCccchhHHHHHHHhccchhHhh-cceecccCCccceeEEEeecCCcceeecC Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAAR---------MTPDGKIDPQIVEMLNETNEILDD-MTVIEANGFTEHKTTVRSGLPTGTWRKLN 70 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~---------~~~~~~~~~~VIE~l~~~s~iL~~-lpf~e~n~~~~~~~~~~~~lP~~~fR~lN 70 (331) ||. . .|..-|.. +-+...+...+||.+.+.+.+... ...+....+ .+.+.+.++-|+++|..=+ T Consensus 347 ~~~---~--~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g-~~~ip~~~~~~~a~wv~E~ 420 (632) T protein:vir:96 347 MPH---E--VLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVG-DVDIPKKTSGANFYWIGED 420 (632) T ss_pred hhH---H--HHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCc-ceEEEEEeCCceeEeecCC Confidence 221 0 01111100 112222335689999886665553 233444433 4778888899999999999 Q ss_pred CccCcccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhh Q lcl|Aclame:pro 71 YGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRF 149 (331) Q Consensus 71 ~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~ 149 (331) +.++.++.++.+++-..+-+++.+.|-+.+.+... +..++ -.....++++......||||+...+ +..|+-. T Consensus 421 ~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~---i~~~l~~a~~~~~d~a~l~G~G~~~--~p~Gi~~-- 493 (632) T protein:vir:96 421 EDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENL---IREDLIEGIGVALDLAMLTGTGLAN--DPVGLLN-- 493 (632) T ss_pred ccccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHH---HHHHHHHHHHHHHHHHhhcccCCCC--ccceeee-- Confidence 99999999999999999999999999999877643 33333 3345789999999999999963211 1112100 Q ss_pred ccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEe Q lcl|Aclame:pro 150 NSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLR 229 (331) Q Consensus 150 ~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~ 229 (331) .+| + T Consensus 494 ---------------~~~---------------------------~---------------------------------- 497 (632) T protein:vir:96 494 ---------------MTG---------------------------V---------------------------------- 497 (632) T ss_pred ---------------ccc---------------------------c---------------------------------- Confidence 000 0 Q ss_pred cccceeeeeccccccCCCCccchhhHHHHHHHHHHHhh--cCCCCceEEEeCHHHHHHHHHHh-hcCCcceeeeecccCC Q lcl|Aclame:pro 230 DWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIP--NVGMGRPAFYMPRKIRSFLRRQI-TNKVAASTLTMEEIAG 306 (331) Q Consensus 230 d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip--~~~~g~~~~y~n~~v~~~L~~~~-~~~~~~~~~~~~~~~g 306 (331) .++.. ...+.+-.++.+ ++.++. +...++.+|+||.....+|.... .+......+. .| T Consensus 498 --------~~~~~---~~~~~~~~~i~~----~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~----~~ 558 (632) T protein:vir:96 498 --------PALTY---PAGGVDWASVVD----METKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ----NN 558 (632) T ss_pred --------cceec---ccccCCHHHHHH----HHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeec----CC Confidence 00000 000111122222 223332 23455688999999998888643 2333332221 11 Q ss_pred ceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 307 KKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 307 ~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) .++|.|+..+++|-..-.... T Consensus 559 ----~l~G~pv~~s~~ip~~~~~~g 579 (632) T protein:vir:96 559 ----EVNGYRAEASNQIPADTWIFG 579 (632) T ss_pred ----eecccceEeccccccCcEEEe Confidence 467889988888743321111 No 85 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=98.25 E-value=4.2e-07 Score=55.52 Aligned_cols=210 Identities=11% Similarity=0.036 Sum_probs=127.4 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEeecCCcceeecCCccCcc-cc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRSGLPTGTWRKLNYGVQPE-KS 78 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~~lP~~~fR~lN~g~~~s-~~ 78 (331) |... |-.+ ..-+-+. .+...||+.+.+.++|++.+......++.+ +.+.+.++-++++|..=++..+++ .. T Consensus 106 ~~~~-----t~~~-gg~~vP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 178 (392) T protein:vir:10 106 MSGL-----TGED-GGLVIPQ-DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNP 178 (392) T ss_pred cccc-----ccCC-Cceecch-hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccc Confidence 2211 1000 0001122 244679999999999999988877654433 345556777889999988898876 57 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-..+-+++.+.|.+.+.+... -++...-.....+++++.....|++|+.+..+ T Consensus 179 ~~~~v~l~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~------------------- 237 (392) T protein:vir:10 179 KFSNVQYAVKDRAGILPLSRSLLQDSD--QNILKYVTKWLGKKSKVTRNVLILGVIEKLTK------------------- 237 (392) T ss_pred cceeEEeeeeeEEEeehhhHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------- Confidence 999999999999999999998876532 12333344557788899998888888643110 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) ++ T Consensus 238 ------------------------------~~------------------------------------------------ 239 (392) T protein:vir:10 238 ------------------------------QA------------------------------------------------ 239 (392) T ss_pred ------------------------------cC------------------------------------------------ Confidence 00 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir 318 (331) ..+..+|.+++. ..++.....+..|+||++.+..|+.. .+..+...+.+ ++.+...-.+.|.|++ T Consensus 240 ----------~~~~d~i~~~~~---~~l~~~~~~~a~~vm~~~~~~~L~~l-kd~~G~~l~~~-~~~~~~~~tllG~~~v 304 (392) T protein:vir:10 240 ----------IKSLDDIKDVLN---VKLDPAISPNAILLTNQDGFNYLDKL-KDKDGKYILQS-DPTQKNKKLFAGTNPV 304 (392) T ss_pred ----------ccCHHHHHHHHH---HhhhhhhccCCEEEEcHHHHHHHHHh-hccCCCeEeec-CccCCccccccCcccE Confidence 000122333222 13444555668899999999999864 55555443322 3333333467888655 Q ss_pred E-EEecc-------CCCcccC Q lcl|Aclame:pro 319 R-TDALL-------LTEARVV 331 (331) Q Consensus 319 ~-~dai~-------~tE~~Vv 331 (331) . +|+.. .++..++ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~ 325 (392) T protein:vir:10 305 VVVSNRFLKSKGTTAKKAPLI 325 (392) T ss_pred EEecccccCCCcccCCceEEE Confidence 5 33321 1222222 No 86 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=98.25 E-value=4.2e-07 Score=55.52 Aligned_cols=210 Identities=11% Similarity=0.036 Sum_probs=127.4 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEeecCCcceeecCCccCcc-cc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRSGLPTGTWRKLNYGVQPE-KS 78 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~~lP~~~fR~lN~g~~~s-~~ 78 (331) |... |-.+ ..-+-+. .+...||+.+.+.++|++.+......++.+ +.+.+.++-++++|..=++..+++ .. T Consensus 106 ~~~~-----t~~~-gg~~vP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 178 (392) T protein:vir:10 106 MSGL-----TGED-GGLVIPQ-DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNP 178 (392) T ss_pred cccc-----ccCC-Cceecch-hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccc Confidence 2211 1000 0001122 244679999999999999988877654433 345556777889999988898876 57 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-..+-+++.+.|.+.+.+... -++...-.....+++++.....|++|+.+..+ T Consensus 179 ~~~~v~l~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~------------------- 237 (392) T protein:vir:10 179 KFSNVQYAVKDRAGILPLSRSLLQDSD--QNILKYVTKWLGKKSKVTRNVLILGVIEKLTK------------------- 237 (392) T ss_pred cceeEEeeeeeEEEeehhhHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------- Confidence 999999999999999999998876532 12333344557788899998888888643110 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) ++ T Consensus 238 ------------------------------~~------------------------------------------------ 239 (392) T protein:vir:10 238 ------------------------------QA------------------------------------------------ 239 (392) T ss_pred ------------------------------cC------------------------------------------------ Confidence 00 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir 318 (331) ..+..+|.+++. ..++.....+..|+||++.+..|+.. .+..+...+.+ ++.+...-.+.|.|++ T Consensus 240 ----------~~~~d~i~~~~~---~~l~~~~~~~a~~vm~~~~~~~L~~l-kd~~G~~l~~~-~~~~~~~~tllG~~~v 304 (392) T protein:vir:10 240 ----------IKSLDDIKDVLN---VKLDPAISPNAILLTNQDGFNYLDKL-KDKDGKYILQS-DPTQKNKKLFAGTNPV 304 (392) T ss_pred ----------ccCHHHHHHHHH---HhhhhhhccCCEEEEcHHHHHHHHHh-hccCCCeEeec-CccCCccccccCcccE Confidence 000122333222 13444555668899999999999864 55555443322 3333333467888655 Q ss_pred E-EEecc-------CCCcccC Q lcl|Aclame:pro 319 R-TDALL-------LTEARVV 331 (331) Q Consensus 319 ~-~dai~-------~tE~~Vv 331 (331) . +|+.. .++..++ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~ 325 (392) T protein:vir:10 305 VVVSNRFLKSKGTTAKKAPLI 325 (392) T ss_pred EEecccccCCCcccCCceEEE Confidence 5 33321 1222222 No 87 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=98.25 E-value=4.2e-07 Score=55.52 Aligned_cols=210 Identities=11% Similarity=0.036 Sum_probs=127.4 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEeecCCcceeecCCccCcc-cc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRSGLPTGTWRKLNYGVQPE-KS 78 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~~lP~~~fR~lN~g~~~s-~~ 78 (331) |... |-.+ ..-+-+. .+...||+.+.+.++|++.+......++.+ +.+.+.++-++++|..=++..+++ .. T Consensus 106 ~~~~-----t~~~-gg~~vP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 178 (392) T protein:vir:10 106 MSGL-----TGED-GGLVIPQ-DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNP 178 (392) T ss_pred cccc-----ccCC-Cceecch-hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccc Confidence 2211 1000 0001122 244679999999999999988877654433 345556777889999988898876 57 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-..+-+++.+.|.+.+.+... -++...-.....+++++.....|++|+.+..+ T Consensus 179 ~~~~v~l~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~------------------- 237 (392) T protein:vir:10 179 KFSNVQYAVKDRAGILPLSRSLLQDSD--QNILKYVTKWLGKKSKVTRNVLILGVIEKLTK------------------- 237 (392) T ss_pred cceeEEeeeeeEEEeehhhHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------- Confidence 999999999999999999998876532 12333344557788899998888888643110 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) ++ T Consensus 238 ------------------------------~~------------------------------------------------ 239 (392) T protein:vir:10 238 ------------------------------QA------------------------------------------------ 239 (392) T ss_pred ------------------------------cC------------------------------------------------ Confidence 00 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir 318 (331) ..+..+|.+++. ..++.....+..|+||++.+..|+.. .+..+...+.+ ++.+...-.+.|.|++ T Consensus 240 ----------~~~~d~i~~~~~---~~l~~~~~~~a~~vm~~~~~~~L~~l-kd~~G~~l~~~-~~~~~~~~tllG~~~v 304 (392) T protein:vir:10 240 ----------IKSLDDIKDVLN---VKLDPAISPNAILLTNQDGFNYLDKL-KDKDGKYILQS-DPTQKNKKLFAGTNPV 304 (392) T ss_pred ----------ccCHHHHHHHHH---HhhhhhhccCCEEEEcHHHHHHHHHh-hccCCCeEeec-CccCCccccccCcccE Confidence 000122333222 13444555668899999999999864 55555443322 3333333467888655 Q ss_pred E-EEecc-------CCCcccC Q lcl|Aclame:pro 319 R-TDALL-------LTEARVV 331 (331) Q Consensus 319 ~-~dai~-------~tE~~Vv 331 (331) . +|+.. .++..++ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~ 325 (392) T protein:vir:10 305 VVVSNRFLKSKGTTAKKAPLI 325 (392) T ss_pred EEecccccCCCcccCCceEEE Confidence 5 33321 1222222 No 88 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=98.25 E-value=4.2e-07 Score=55.52 Aligned_cols=210 Identities=11% Similarity=0.036 Sum_probs=127.4 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCcc-ceeEEEeecCCcceeecCCccCcc-cc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTE-HKTTVRSGLPTGTWRKLNYGVQPE-KS 78 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~-~~~~~~~~lP~~~fR~lN~g~~~s-~~ 78 (331) |... |-.+ ..-+-+. .+...||+.+.+.++|++.+......++.+ +.+.+.++-++++|..=++..+++ .. T Consensus 106 ~~~~-----t~~~-gg~~vP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 178 (392) T protein:vir:10 106 MSGL-----TGED-GGLVIPQ-DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNP 178 (392) T ss_pred cccc-----ccCC-Cceecch-hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccc Confidence 2211 1000 0001122 244679999999999999988877654433 345556777889999988898876 57 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-..+-+++.+.|.+.+.+... -++...-.....+++++.....|++|+.+..+ T Consensus 179 ~~~~v~l~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~------------------- 237 (392) T protein:vir:10 179 KFSNVQYAVKDRAGILPLSRSLLQDSD--QNILKYVTKWLGKKSKVTRNVLILGVIEKLTK------------------- 237 (392) T ss_pred cceeEEeeeeeEEEeehhhHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------- Confidence 999999999999999999998876532 12333344557788899998888888643110 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) ++ T Consensus 238 ------------------------------~~------------------------------------------------ 239 (392) T protein:vir:10 238 ------------------------------QA------------------------------------------------ 239 (392) T ss_pred ------------------------------cC------------------------------------------------ Confidence 00 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEE Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir 318 (331) ..+..+|.+++. ..++.....+..|+||++.+..|+.. .+..+...+.+ ++.+...-.+.|.|++ T Consensus 240 ----------~~~~d~i~~~~~---~~l~~~~~~~a~~vm~~~~~~~L~~l-kd~~G~~l~~~-~~~~~~~~tllG~~~v 304 (392) T protein:vir:10 240 ----------IKSLDDIKDVLN---VKLDPAISPNAILLTNQDGFNYLDKL-KDKDGKYILQS-DPTQKNKKLFAGTNPV 304 (392) T ss_pred ----------ccCHHHHHHHHH---HhhhhhhccCCEEEEcHHHHHHHHHh-hccCCCeEeec-CccCCccccccCcccE Confidence 000122333222 13444555668899999999999864 55555443322 3333333467888655 Q ss_pred E-EEecc-------CCCcccC Q lcl|Aclame:pro 319 R-TDALL-------LTEARVV 331 (331) Q Consensus 319 ~-~dai~-------~tE~~Vv 331 (331) . +|+.. .++..++ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~ 325 (392) T protein:vir:10 305 VVVSNRFLKSKGTTAKKAPLI 325 (392) T ss_pred EEecccccCCCcccCCceEEE Confidence 5 33321 1222222 No 89 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.22 E-value=2.7e-07 Score=56.61 Aligned_cols=244 Identities=13% Similarity=0.137 Sum_probs=130.4 Q ss_pred CCcCccccccHHH------HHHh--------cCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcce Q lcl|Aclame:pro 1 MPTLSTTNPTLAD------VAAR--------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTW 66 (331) Q Consensus 1 M~~l~~~a~TL~E------~Ak~--------~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~f 66 (331) ...-....+|=.| +.+. +-+. .+...|+|.+.+.+||+..+.+.... ..++..+.++-|+++| T Consensus 59 ~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~lvP~-~~~~~I~~~l~~~s~i~~~~~v~~~~--~~~~i~~~~~~~~a~w 135 (377) T protein:vir:96 59 DLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPE-ETMVQVFDDLVAEHPLLKVINFKNTS--LRLKALTAETSGTAVW 135 (377) T ss_pred HhccCCcccCHHHHHHHHHHHhcCCCCCCceecCH-HHHHHHHHHHHhhhhhhhhceeEecC--CceEEEEecCCcceeE Confidence 0000011111111 1010 1111 24457999999999999999887653 3467888888999999 Q ss_pred eecCCccCc-ccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcC Q lcl|Aclame:pro 67 RKLNYGVQP-EKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMG 144 (331) Q Consensus 67 R~lN~g~~~-s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~G 144 (331) ...++..++ +..++.+++-.++-|.+.+.|.+.|.+..+ +..+|-..+ ..++++....+.||+||-...|. | T Consensus 136 v~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~---l~~~~~~~~~~a~i~G~G~~~P~---G 209 (377) T protein:vir:96 136 GDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQ---LKEAIAVALELAIVKGNGLLQPV---G 209 (377) T ss_pred eecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHH---HHHHHHHHHhhceEeccCCCcce---e Confidence 999988865 678999999999999999999999988766 566665544 78999999999999999765555 4 Q ss_pred chhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeee Q lcl|Aclame:pro 145 LTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDI 224 (331) Q Consensus 145 L~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~ 224 (331) |-+.. +. . ..+. .++. ...++|+..... T Consensus 210 il~~~---~~----~-----~~~~--------~~~~-~~~~~~~~~~~~------------------------------- 237 (377) T protein:vir:96 210 LLKDL---SQ----P-----TVDQ--------STGR-DITTYKTDKEAI------------------------------- 237 (377) T ss_pred eeecc---cc----c-----cccc--------cccc-cccceeeccccc------------------------------- Confidence 42211 10 0 0000 0111 111122211000 Q ss_pred ceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHH----hhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeee Q lcl|Aclame:pro 225 GLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVEL----IPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLT 300 (331) Q Consensus 225 Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~----ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~ 300 (331) .-+..+ ++.+..+|..-|..++.. .|+...++.+|+||+.-...+..+ +. T Consensus 238 ----------~~~~~~-------~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~---------~~ 291 (377) T protein:vir:96 238 ----------ADLSDL-------DPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK---------FT 291 (377) T ss_pred ----------cccccC-------ChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhcccc---------cc Confidence 000011 122222333334444432 234445778999998754333211 11 Q ss_pred ecccCCceeEEEc-CeEEEEEEec-----------------------------cCCCcccC Q lcl|Aclame:pro 301 MEEIAGKKVVAFD-GIPCRRTDAL-----------------------------LLTEARVV 331 (331) Q Consensus 301 ~~~~~g~~v~~~~-gvpir~~dai-----------------------------~~tE~~Vv 331 (331) .....|..++-+. |++|..++++ ...+..++ T Consensus 292 ~~~~~G~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~ 352 (377) T protein:vir:96 292 SRNQFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQL 352 (377) T ss_pred ccCCCCCceeccCCCceEEecCCCCcccEEEEEcCcEEEEEecccEEEeehhhhhhcCCeE Confidence 1123344443221 3333333333 22222222 No 90 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=98.21 E-value=5.7e-07 Score=54.83 Aligned_cols=213 Identities=13% Similarity=0.143 Sum_probs=120.3 Q ss_pred CCcC-----ccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcce--eecCCcc Q lcl|Aclame:pro 1 MPTL-----STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTW--RKLNYGV 73 (331) Q Consensus 1 M~~l-----~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~f--R~lN~g~ 73 (331) |... ....+|..+-. -+-+. .+...||+.+.+.++|++.+.++...++ ...|.+.+.-+..+| ..=+... T Consensus 104 ~~~~~~~~~~ra~~t~~~gg-~liP~-~~~~~Ii~~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~E~~~~ 180 (421) T protein:vir:13 104 IRGIQLSEEERDIMSSTNNG-AVIPQ-EFVNEFEKLKEGYPSLKEHCHVIPVNRN-AGKMPVRAGASVDKLANLAKDTEL 180 (421) T ss_pred hhccchhHHHhhccccCCcc-eecch-hhHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEeecCCccceeeccccccc Confidence 1000 00011111100 01122 2446799999999999999998876554 456777777766555 5556678 Q ss_pred CcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccc Q lcl|Aclame:pro 74 QPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLS 153 (331) Q Consensus 74 ~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~ 153 (331) +++..++.+++-..+-+.+.+.|.+.+.+... .++...-.....+++.......+++ T Consensus 181 ~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~--~~l~~~i~~~la~~~~~~~~~~i~~--------------------- 237 (421) T protein:vir:13 181 VKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSE--INFLEFVNEEFAEFAVNTENAEIVK--------------------- 237 (421) T ss_pred cccccceeEEEeeeeeeEeehhhhHHHHhhhH--HHHHHHHHHHHHHHHHHHhhhhHhh--------------------- Confidence 88889999999999999999999998876543 1232222222334444322211110 Q ss_pred ccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccc Q lcl|Aclame:pro 154 AENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRY 233 (331) Q Consensus 154 ~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~ 233 (331) -|+|..+ T Consensus 238 ---------------------------------~~~g~~~---------------------------------------- 244 (421) T protein:vir:13 238 ---------------------------------QAKAVLA---------------------------------------- 244 (421) T ss_pred ---------------------------------hhhhccc---------------------------------------- Confidence 0111000 Q ss_pred eeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEc Q lcl|Aclame:pro 234 VVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFD 313 (331) Q Consensus 234 v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~ 313 (331) ...... .+-++++++.+......+..|+||++....|+.. .+......+ .. ...-..-.++ T Consensus 245 ------------~~~~~~----~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~l-kd~~G~~i~-~~-~~~~~~~tl~ 305 (421) T protein:vir:13 245 ------------EETIND----YAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGL-MDKQGRPLL-KE-LSDGGDLVFK 305 (421) T ss_pred ------------cccccc----hHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHh-hcCCCceee-cC-cCCCCCceec Confidence 000001 1224455555555555568999999999999865 455544333 22 2222235799 Q ss_pred CeEEEEEEeccCC---CcccC Q lcl|Aclame:pro 314 GIPCRRTDALLLT---EARVV 331 (331) Q Consensus 314 gvpir~~dai~~t---E~~Vv 331 (331) |.||..+|++... ...++ T Consensus 306 G~pV~~~~~~~~~~~~~~~~~ 326 (421) T protein:vir:13 306 GRPVIELEESIFDVGDETKFI 326 (421) T ss_pred ceeeEEeccccccCCCceEEE Confidence 9999999987532 12222 No 91 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=98.18 E-value=1.6e-07 Score=57.85 Aligned_cols=250 Identities=14% Similarity=0.156 Sum_probs=135.7 Q ss_pred CC--------c---CccccccHHHHHH---------hcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEee Q lcl|Aclame:pro 1 MP--------T---LSTTNPTLADVAA---------RMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG 60 (331) Q Consensus 1 M~--------~---l~~~a~TL~E~Ak---------~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~ 60 (331) |. . .....-.+.+.+. .+-++ .+...|++.+.+.++|+........+ + ..++.+.+. T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~-~~~~~i~~~l~~~~~l~~~~~v~~~~-g-~~~~~~~~~ 199 (466) T protein:vir:80 123 MPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPD-VMLELLRDNMHRYSKLISKVRLRPLK-G-TARQNIAGA 199 (466) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccH-HHHHHHHHhhhhhhhhhhheeeeecC-c-eeEeeeecC Confidence 00 0 0000000000000 01122 23455889999999999988876654 2 356778889 Q ss_pred cCCcceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCh Q lcl|Aclame:pro 61 LPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDA 139 (331) Q Consensus 61 lP~~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~ 139 (331) .|.+.|..-++.++++..++.+++-.++-+.+.+.|-+.+.+..+ +..++-. ....++++.....+||+||-...| T Consensus 200 ~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~---~~la~~~~~~~~~ail~G~G~~~P 276 (466) T protein:vir:80 200 IPEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEIL---DAIGQAIGFALDKAILYGTGTKMP 276 (466) T ss_pred CcceeecccccccccccccccceeecceeeeeehhhhHHHHhcchHHHHHHHH---HHHHHHHHHHHhhheeeccCCCCc Confidence 999999999999999999999999999999999999999988765 4444433 457899999999999999866555 Q ss_pred hhhcCchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEE Q lcl|Aclame:pro 140 EKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTH 219 (331) Q Consensus 140 ~~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~ 219 (331) . ||-... . +.+....++.-.-.+-. +...++ T Consensus 277 ~---Gil~~~---~----------~~~~~~~~~~~~~~~~~--------------~~~~~~------------------- 307 (466) T protein:vir:80 277 V---GIVTRL---A----------QTTQPPNWGTKAPAWTN--------------LSTTNL------------------- 307 (466) T ss_pred c---eeeecc---c----------ccccccccccccccccc--------------cchhhh------------------- Confidence 4 442211 1 01110000000000000 000000 Q ss_pred EEeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceee Q lcl|Aclame:pro 220 YKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTL 299 (331) Q Consensus 220 ~~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~ 299 (331) .+++ ... ..+...+.+++.....+..+...+..+|.||......|..........-.+ T Consensus 308 ------------------~~~~--~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~ 365 (466) T protein:vir:80 308 ------------------LKID--PTG--KSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGAL 365 (466) T ss_pred ------------------hhhh--hhc--cchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccc Confidence 0000 000 111122334433333334455667788999999888775332111110001 Q ss_pred eecccCCceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 300 TMEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 300 ~~~~~~g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) -.....+ ..+.|.||..++++-..+ .++ T Consensus 366 ~~~~~~~---~~i~G~pvv~s~~~~~~~-~~~ 393 (466) T protein:vir:80 366 VASLNNT---MPIVGGDIVILDFIPDND-IIG 393 (466) T ss_pred cccCCCc---ccccccceeecCccCccc-eee Confidence 0000011 126788998888774322 122 No 92 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=98.16 E-value=9.3e-07 Score=53.65 Aligned_cols=214 Identities=10% Similarity=0.001 Sum_probs=122.2 Q ss_pred CCc---CccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEE-eecCCcceeecCCccCc- Q lcl|Aclame:pro 1 MPT---LSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVR-SGLPTGTWRKLNYGVQP- 75 (331) Q Consensus 1 M~~---l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~-~~lP~~~fR~lN~g~~~- 75 (331) +-. -.....|..+-. -+-+. .+...||+.+.+.++|++.++.....++.+ .+.+. .+-+.+.|..=+...++ T Consensus 103 ~~~~~~~~~~~~t~~~gg-~~vP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~E~~~~~~~ 179 (394) T protein:vir:10 103 HGKVIDNAAGHVTSTEAG-VLIPE-EIIYDPTAEVNSVVDLSTLVTKTPVTTPKG-TYPILKRATDRFSSVAELAENPAL 179 (394) T ss_pred cchhhhhhhcccccccCc-eeccH-HHHHHHHHHHHhhhhhhhhceeeeccCCce-EEEEEecCCCcccccccccccccc Confidence 000 000001111100 01122 245679999999999999998877655543 34333 34467778777777775 Q ss_pred ccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccc Q lcl|Aclame:pro 76 EKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) Q Consensus 76 s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~ 154 (331) +..++.+++-..+-+++.+.|-+.+.+... +..++ -.....++++......+++|+.... T Consensus 180 ~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~---i~~~la~~~~~~~~~~il~g~g~~~---------------- 240 (394) T protein:vir:10 180 AEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSL---VGQSINEKSVNTYNAMIAPVLQSFT---------------- 240 (394) T ss_pred ccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHH---HHHHHHHHHHHHHHHHHhhcccccc---------------- Confidence 668999999999999999999999888653 33333 3344677888888888887752110 Q ss_pred cccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccce Q lcl|Aclame:pro 155 ENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYV 234 (331) Q Consensus 155 ~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v 234 (331) |++. T Consensus 241 ---------------------------------~~~~------------------------------------------- 244 (394) T protein:vir:10 241 ---------------------------------AKAT------------------------------------------- 244 (394) T ss_pred ---------------------------------cccc------------------------------------------- Confidence 0000 Q ss_pred eeeeccccccCCCCccchhhHHHHHHHHHHH-hhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeec---ccCCceeE Q lcl|Aclame:pro 235 VRIANVDVSELTKNASAGADLIDLMTQAVEL-IPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTME---EIAGKKVV 310 (331) Q Consensus 235 ~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~-ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~---~~~g~~v~ 310 (331) + ...+..+|. ++++. ++. ..+.+|+||++....|+.. .+....+.+.+. ...+...- T Consensus 245 -----------~-~~~~~d~l~----~~~~~~~~~--~~~a~~vmn~~~~~~l~~l-kd~~G~~i~~~~~~~~~~~~~~~ 305 (394) T protein:vir:10 245 -----------T-TDTLVDSLK----HILNVDLDP--AYSRALVVTQSLFNTLDTL-KDKNGRYLLHDASDSITDGTAKG 305 (394) T ss_pred -----------c-ccccHHHHH----HHHHhhhhh--hccCEEEecHHHHHHHHHh-hccCCCeeeeccccccccCCccc Confidence 0 001111222 22221 221 2347899999999999975 344443333222 11223334 Q ss_pred EEcCeEEEEEEec-cCC---CcccC Q lcl|Aclame:pro 311 AFDGIPCRRTDAL-LLT---EARVV 331 (331) Q Consensus 311 ~~~gvpir~~dai-~~t---E~~Vv 331 (331) .+.|+||+.+|+. +.+ +..++ T Consensus 306 ~L~G~PV~~~~~~~~~~~~~~~~i~ 330 (394) T protein:vir:10 306 TVLGVPVYVVGDALLGSAAGDQKAF 330 (394) T ss_pred ccccceeEEecccccCCCCCceEEE Confidence 6899999988753 332 22233 No 93 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=98.16 E-value=3.9e-07 Score=55.74 Aligned_cols=215 Identities=11% Similarity=0.042 Sum_probs=124.2 Q ss_pred CCcC----------ccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEE-eecCCcceeec Q lcl|Aclame:pro 1 MPTL----------STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVR-SGLPTGTWRKL 69 (331) Q Consensus 1 M~~l----------~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~-~~lP~~~fR~l 69 (331) +..+ .....|..+-. .+-+. .+ ..+|..+.+.++|...+......++. ..+.++ .+-+.++|..= T Consensus 141 ~~~~~~~~~~~e~~~~~~~~~~~~g-~lvp~-~~-~~~i~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~e 216 (437) T protein:vir:10 141 VTAFADYLKTGEVRDVTGIALKDGK-VIIPE-TI-LTPEKEVHQFPRLGSLVRTESVTTTT-GKLPIFNNSTDLLTAHTE 216 (437) T ss_pred hhhhHHHHHhhhhhhhhhccccccc-ccchH-HH-HHHHHHhhhhhhhhhcceeEeeccCc-eeeEEeeccccccccccc Confidence 0000 00000111100 01111 12 23455667888888888777665553 445555 45578889888 Q ss_pred CCccCc-ccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhh Q lcl|Aclame:pro 70 NYGVQP-EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPR 148 (331) Q Consensus 70 N~g~~~-s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R 148 (331) ++..++ +..++.+++-..+-+++.+.|-+.+.+...- ++...-.....++++......|++|+.+.. T Consensus 217 ~~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~--~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~---------- 284 (437) T protein:vir:10 217 YGQTTKNATPVITPILWDLKTYTGGYVFSQELISDSSY--DWQAELQSRLIELRDNTDDSLIITALTDGI---------- 284 (437) T ss_pred cccccccccccceeeeeehhheeeehhhhHHHHhhhHH--HHHHHHHHHHHHHHHHHHHHHHhhhhcccc---------- Confidence 888875 5579999999999999999999988776441 233333445778888888899998862100 Q ss_pred hccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEE Q lcl|Aclame:pro 149 FNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) Q Consensus 149 ~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v 228 (331) | T Consensus 285 ---------------------------------------~---------------------------------------- 285 (437) T protein:vir:10 285 ---------------------------------------K---------------------------------------- 285 (437) T ss_pred ---------------------------------------c---------------------------------------- Confidence 0 Q ss_pred ecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCce Q lcl|Aclame:pro 229 RDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK 308 (331) Q Consensus 229 ~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~ 308 (331) ....+.+..+|.+++.- .++..-..+.+|+||++....|+.. .+..+.+.+.+ ++.+-. T Consensus 286 ----------------~~~~~~~~~~~~~~~~~---~l~~~~~~~~~~~~~~~~~~~l~~l-kd~~g~~~~~~-~~~~~~ 344 (437) T protein:vir:10 286 ----------------KTTSTYLLGDLKKVLNV---TLKPQDSAAASIVMSQSAYNLFDMA-TDAMGRPLLQP-NVTAAT 344 (437) T ss_pred ----------------ccccccchhhHHHHHHh---hhhhhhhcCCEEEEcHHHHHHHHHh-hccCCCeeecc-CccCCC Confidence 00001111233333221 3444445567999999999999875 44544443433 333333 Q ss_pred eEEEcCeEEEEEEecc--C---CCcccC Q lcl|Aclame:pro 309 VVAFDGIPCRRTDALL--L---TEARVV 331 (331) Q Consensus 309 v~~~~gvpir~~dai~--~---tE~~Vv 331 (331) .-.+.|.||..+++.. + +...++ T Consensus 345 ~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 372 (437) T protein:vir:10 345 GYTLLGKTVVIVDDKLFPSASAGDVNIV 372 (437) T ss_pred CcccccceeEEecccccCCcCCCceEEE Confidence 4579999999998642 1 122223 No 94 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=98.10 E-value=1.3e-06 Score=52.91 Aligned_cols=226 Identities=12% Similarity=0.136 Sum_probs=129.5 Q ss_pred CCcCccccc-cHHHHHH-h-------cCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCC Q lcl|Aclame:pro 1 MPTLSTTNP-TLADVAA-R-------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNY 71 (331) Q Consensus 1 M~~l~~~a~-TL~E~Ak-~-------~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~ 71 (331) +........ ...++.. . +-+. .+...|++.+.+.++|++.......+ ..+.+.+.++.|.+.|.+=++ T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~-~~~~~Ii~~l~~~~~i~~~~~~~~~~--g~~~ip~~~~~~~a~~v~E~~ 199 (425) T protein:vir:95 123 EYYKRSEVVEFYEKFRNLRAVAGGELTIPE-VVVNRIMDIMGDYTTLYPLVDKIRVK--GTTRILVDTDTSPATWIEQSG 199 (425) T ss_pred hhhhhhHHHHHHHHHHhhcccccCceeccH-HHHHHHHHHHHhhhhHHHhhceeecC--ceeEEEEecCCcccccccccc Confidence 000000000 0000000 0 1122 24556999999999999998877653 247788999999999999999 Q ss_pred ccCccc-ceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhh Q lcl|Aclame:pro 72 GVQPEK-SRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRF 149 (331) Q Consensus 72 g~~~s~-~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~ 149 (331) .++++. .++.+++-..+-+.+.+.|-+.+.+... +..++ -.....++++.+....||+||-..+ .++.|+-+.. T Consensus 200 ~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~---i~~~l~~~i~~~~d~~il~G~G~~~-~~p~Gil~~~ 275 (425) T protein:vir:95 200 ALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDY---VTKKIARAIAKALDLAIVKGTGAAN-KQPLGIIPSL 275 (425) T ss_pred ccccccccccceeeeeheeeeeeehhhHHHHhccHHHHHHH---HHHHHHHHHHHHHHHHhhccCCCCc-cccceeeccc Confidence 998877 5899999999999999999999888765 44444 3455789999999999999973211 1222221100 Q ss_pred ccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEe Q lcl|Aclame:pro 150 NSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLR 229 (331) Q Consensus 150 ~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~ 229 (331) + +.. T Consensus 276 ---~-------------~~~------------------------------------------------------------ 279 (425) T protein:vir:95 276 ---P-------------PEN------------------------------------------------------------ 279 (425) T ss_pred ---c-------------ccc------------------------------------------------------------ Confidence 0 000 Q ss_pred cccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhc--CCCCceEEEeCHH-HHHHH---HHHhhcCCcceeeeecc Q lcl|Aclame:pro 230 DWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPN--VGMGRPAFYMPRK-IRSFL---RRQITNKVAASTLTMEE 303 (331) Q Consensus 230 d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~--~~~g~~~~y~n~~-v~~~L---~~~~~~~~~~~~~~~~~ 303 (331) ++. ...++. ..+-+.+++..+.. ...+..+|+||+. ....| +.+ .++.+.. +.... T Consensus 280 ---------~~~---~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~-kd~~g~~-i~~~~ 341 (425) T protein:vir:95 280 ---------QVT---VEADNN----LLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQ-VDSNGNV-VGKLP 341 (425) T ss_pred ---------ccc---cccccc----hHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhh-cCCCCce-eeccC Confidence 000 000011 12223334443332 2345678999965 34433 322 2333332 22211 Q ss_pred cCCceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 304 IAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 304 ~~g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) .+. .-.+.|.||..+|++-... |+ T Consensus 342 -~~~-~~~l~G~pvv~~~~~~~~~--i~ 365 (425) T protein:vir:95 342 -NLR-TPDLLGLRVVFNNFLDDDT--VL 365 (425) T ss_pred -CCC-CccccceeeEEcCcCCCcc--EE Confidence 222 2247799999999996442 33 No 95 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=98.07 E-value=1.6e-06 Score=52.36 Aligned_cols=211 Identities=9% Similarity=-0.017 Sum_probs=120.9 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEe-ecCCcceeecCCccCc-ccc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRS-GLPTGTWRKLNYGVQP-EKS 78 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~-~lP~~~fR~lN~g~~~-s~~ 78 (331) |... |..+-. -+-+. .+...|++.+.+.++|++.++.+...++.+ .|.+.. +-..+.|..=+...++ +.. T Consensus 109 ~~~~-----t~~~gg-~~vP~-~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~E~~~~~~~~~~ 180 (389) T protein:vir:10 109 TSKV-----TSTEAG-VLIPE-EIIYDPTAEVNSVVDLSTLVTKTPVTTPKG-TYPILKRATDRFSSVAELAENPKLAEP 180 (389) T ss_pred hccc-----ccCCcc-eeehH-HHHHHHHHHHHhhhhHHhhcceeeccCCee-EEEEEecCCCccccccccccccccccc Confidence 2211 111100 01111 234679999999999999999887765543 344443 3345567666666774 788 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~ 158 (331) ++.+++-.++-+.+.+.|-+.+.+...- ++...-.....++++......|++|+.... T Consensus 181 ~~~~i~~~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~la~~~~~~~~~~i~~g~~~~~-------------------- 238 (389) T protein:vir:10 181 EFNKVDWSVATYRGAIPLSEEAIADSAV--DLTALVGQSIKEKSVNTYNAMIAPVLQSFT-------------------- 238 (389) T ss_pred cceeeeeeheeeEeeehhhHHHHhhhhH--HHHHHHHHHHHHHHHHHHHHHHhhhhcccc-------------------- Confidence 9999999999999999999998876432 233334445677888777777776642100 Q ss_pred eeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeeee Q lcl|Aclame:pro 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) Q Consensus 159 ~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~ 238 (331) |++ T Consensus 239 -----------------------------~~~------------------------------------------------ 241 (389) T protein:vir:10 239 -----------------------------AKK------------------------------------------------ 241 (389) T ss_pred -----------------------------ccc------------------------------------------------ Confidence 000 Q ss_pred ccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeec---ccCCceeEEEcCe Q lcl|Aclame:pro 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTME---EIAGKKVVAFDGI 315 (331) Q Consensus 239 NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~---~~~g~~v~~~~gv 315 (331) . ....+-.+|.+++.. .++. ..+.+|+||++....|+.. .+..+.+.+.+. ...+...-.++|+ T Consensus 242 ------~-~~~~~~d~l~~~~~~---~~~~--~~~a~~~~n~~~~~~L~~l-kd~~G~~i~~~~~~~~~~~~~~~~l~G~ 308 (389) T protein:vir:10 242 ------T-TTDTLVDSLKHILNV---DLDP--AYSRALVVTQSLFNTLDTL-KDKNGRYLLHDASDSITDGTAKGTILGV 308 (389) T ss_pred ------c-cccccHHHHHHHHHh---hhhh--hhCcEEEecHHHHHHHHHh-hccCCCeeeecCcccccccccccccccc Confidence 0 001111223333321 2222 2247899999999999975 344433333221 1122333479999 Q ss_pred EEEEEEec-cCC---CcccC Q lcl|Aclame:pro 316 PCRRTDAL-LLT---EARVV 331 (331) Q Consensus 316 pir~~dai-~~t---E~~Vv 331 (331) ||+.+++. ..+ +..++ T Consensus 309 pV~~~~~~~~~~~~~~~~~~ 328 (389) T protein:vir:10 309 PVYVVGDTLLGSLAGDQKAF 328 (389) T ss_pred eeEEecccccCCCCCceEEE Confidence 99987653 322 22233 No 96 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=98.06 E-value=1.6e-06 Score=52.41 Aligned_cols=211 Identities=12% Similarity=-0.039 Sum_probs=123.7 Q ss_pred CCcCc--c-------ccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEe-ecCCcceeecC Q lcl|Aclame:pro 1 MPTLS--T-------TNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRS-GLPTGTWRKLN 70 (331) Q Consensus 1 M~~l~--~-------~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~-~lP~~~fR~lN 70 (331) ..... . ...|..+- .-+-+. .+...||+.+.+.+.|++.+++....++. ..+.+.. +-+.++|..=+ T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~g-g~~vP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~E~ 196 (400) T protein:vir:38 120 AVLRAVPTDASDAVNAGVKAADA-ASTIPE-TISNTPQRELQTVVDLKPFTNVFQASTQK-GTYPTVANATTKMVTVAEL 196 (400) T ss_pred hhhhhhhHHHHHHHhhcccccCC-cccccH-HHHHHHHHHHHhhhhhhhcceeEeccCcc-eEEEEEecCCCcccccccc Confidence 00000 0 00011100 001111 23567999999999999999988765543 3455544 44667887767 Q ss_pred CccCc-ccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhh Q lcl|Aclame:pro 71 YGVQP-EKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPR 148 (331) Q Consensus 71 ~g~~~-s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R 148 (331) ...++ +..++.+++...+-+++.+.|.+.+.+... +..++- .....+++.......+++|..+. T Consensus 197 ~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i---~~~l~~~~~~~~~~~i~~~~~~~----------- 262 (400) T protein:vir:38 197 EKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLI---AQNGQQIKVNTTNGAVATLLKGF----------- 262 (400) T ss_pred ccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHH---HHHHHHHHHHHHHHhhhhccccc----------- Confidence 77775 578999999999999999999998887643 344443 34467788888888888774210 Q ss_pred hccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEE Q lcl|Aclame:pro 149 FNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) Q Consensus 149 ~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v 228 (331) +. .+ T Consensus 263 -----------------~~---------------------~~-------------------------------------- 266 (400) T protein:vir:38 263 -----------------TA---------------------KT-------------------------------------- 266 (400) T ss_pred -----------------cc---------------------cc-------------------------------------- Confidence 00 00 Q ss_pred ecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCce Q lcl|Aclame:pro 229 RDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK 308 (331) Q Consensus 229 ~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~ 308 (331) ..+..+|.+ ++.... ....+.+|+||++....|+.. .+..+.+.+.+ ++.+.. T Consensus 267 --------------------~~~~~~~~~----~~~~~~-~~~~~a~~v~~~~~~~~l~~l-kd~~G~~i~~~-~~~~~~ 319 (400) T protein:vir:38 267 --------------------ISSVDDLKH----INNVDL-DPAYSRVIIASQSFYNFLDTV-KDGNGRYLLQD-SILTPS 319 (400) T ss_pred --------------------cccHHHHHH----HHHhhh-hhhhCcEEEEcHHHHHHHHHh-hccCCCeeeec-CcCCCC Confidence 000111222 222111 112247899999999999864 45554443333 444444 Q ss_pred eEEEcCeEEEEEEecc-CCCcc--cC Q lcl|Aclame:pro 309 VVAFDGIPCRRTDALL-LTEAR--VV 331 (331) Q Consensus 309 v~~~~gvpir~~dai~-~tE~~--Vv 331 (331) .-.+.|.||+.+|++. .+... ++ T Consensus 320 ~~~l~G~pv~~~~~~~~~~~g~~~~~ 345 (400) T protein:vir:38 320 GKSVLGMPIAVVSDDTLGAAGEAHAF 345 (400) T ss_pred ccccccceeEEecccccCCCCceEEE Confidence 4579999999999874 22222 22 No 97 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=97.98 E-value=1.6e-06 Score=52.36 Aligned_cols=215 Identities=14% Similarity=0.171 Sum_probs=122.4 Q ss_pred CCcC--ccccccHHHHHHh-----------cCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCccee Q lcl|Aclame:pro 1 MPTL--STTNPTLADVAAR-----------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWR 67 (331) Q Consensus 1 M~~l--~~~a~TL~E~Ak~-----------~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR 67 (331) |... ........+..+. +-+. .+...||+.+.+.++|.+...+....+. ....+..+.+++.|. T Consensus 64 ~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~-~~~~~Ii~~l~~~s~l~~~~~v~~~~~~--~~p~~~~~~~~a~~v 140 (352) T protein:vir:78 64 ILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPK-TLSKEIVSEPFAKNQLREKARLTNIKGL--EIPRVSYTLDDDDFI 140 (352) T ss_pred hhhhHHHHHHhhHHHHHHHhccCCCCCCceeccH-hHHHHHHHHHHhhcchhhheeeEecCCc--eEEEEecCCCccccc Confidence 0000 0000000000000 1122 2456799999999999998887665322 222234456889999 Q ss_pred ecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhC-CCHHHHHHHHHHHHHHHHHHH-HHHhhccCCCCCChhhhcCc Q lcl|Aclame:pro 68 KLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLN-GNSAAWRLSEDRAFIEGMNQT-QATTLFYGDSSIDAEKFMGL 145 (331) Q Consensus 68 ~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~-gn~~~~ra~q~~~~ika~~~~-f~~~~iyGD~~~~~~~F~GL 145 (331) .=++..++++.++.+++-..+-+++.+.|.+.+.+.. .|..++-..+ ..++++.. .+..|..|+. T Consensus 141 ~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~---la~~~~~~e~~~~~~~g~g---------- 207 (352) T protein:vir:78 141 TDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENA---LQSGLAAKERKDALAVSPK---------- 207 (352) T ss_pred ccccccccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHH---HHHHHHHHHHHhhhhcCCC---------- Confidence 9999999999999999999999999999999987764 4666665544 44455543 2223333321 Q ss_pred hhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeec Q lcl|Aclame:pro 146 TPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIG 225 (331) Q Consensus 146 ~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~G 225 (331) +|....+ ++-+ ++. T Consensus 208 --------------------~~~~~g~-------------l~~~----~~~----------------------------- 221 (352) T protein:vir:78 208 --------------------SGLEHMS-------------FYNG----SVK----------------------------- 221 (352) T ss_pred --------------------Ccccccc-------------eecc----ccc----------------------------- Confidence 1100000 0000 000 Q ss_pred eEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccC Q lcl|Aclame:pro 226 LTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIA 305 (331) Q Consensus 226 l~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~ 305 (331) . .++.++.|-++++++.++.....+.+|+||+.....|.....+..+. .+ . T Consensus 222 ------------------~-----~t~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~-~~-----~ 272 (352) T protein:vir:78 222 ------------------E-----VEGANMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTN-FF-----D 272 (352) T ss_pred ------------------c-----ccccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCc-cc-----c Confidence 0 01122345567777778766677789999998887776554443332 22 1 Q ss_pred CceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 306 GKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 306 g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) |.+ ..+.|.||..+|..- .++ T Consensus 273 ~~~-~~llG~PV~~~~~~~----~~~ 293 (352) T protein:vir:78 273 TPA-EKVFGKPVVFTDAAV----KPI 293 (352) T ss_pred cCC-ccccccceEEecCCC----cee Confidence 222 246699999988542 233 No 98 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=97.89 E-value=1.3e-06 Score=52.86 Aligned_cols=236 Identities=14% Similarity=0.118 Sum_probs=125.0 Q ss_pred CCcCccccccHHH------HHHhcCCc------cchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceee Q lcl|Aclame:pro 1 MPTLSTTNPTLAD------VAARMTPD------GKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRK 68 (331) Q Consensus 1 M~~l~~~a~TL~E------~Ak~~~~~------~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~ 68 (331) +.......+|-.| +.+..+.+ ..+...|+|.+.+.|+|+..+.+.... ......+.++-|.++|-. T Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~--~~~~i~~~~~~~~a~w~~ 134 (381) T protein:vir:95 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWGK 134 (381) T ss_pred HhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC--cceEEEEecCCcceeeec Confidence 1111111222222 11111111 234567999999999999999887653 346778888899999999 Q ss_pred cCCccC-cccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCch Q lcl|Aclame:pro 69 LNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLT 146 (331) Q Consensus 69 lN~g~~-~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~ 146 (331) .+++++ ++..++.+++-..+-|.+.+.|-+.|.+... +.++|-.. ...++++....+.|++||-...|.++ - T Consensus 135 e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~---~la~~~a~~~~~a~i~G~G~~qP~Gi---l 208 (381) T protein:vir:95 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRV---QIEEAFAVALETAFLKGTGKDQPIGL---N 208 (381) T ss_pred ccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHH---HHHHHHHHHhhheeEeccCCCCceee---e Confidence 999887 4578999999999999999999999987754 55555444 47789999999999999876555443 2 Q ss_pred hhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeece Q lcl|Aclame:pro 147 PRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGL 226 (331) Q Consensus 147 ~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl 226 (331) +-. + +.. ....|.+|.....| ++.+.+ T Consensus 209 ~~~---~-----------~~~-------------~~~~g~~~~~~~~~---------t~t~~~----------------- 235 (381) T protein:vir:95 209 RQV---Q-----------KGV-------------SVTEGAYPEKEEQG---------TLTFAN----------------- 235 (381) T ss_pred ecc---C-----------ccc-------------cccccccccccccc---------cccccc----------------- Confidence 211 0 000 00111111110000 000000 Q ss_pred EEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhh-------cCCCCceEEEeCHHHHHHHHHHhhcCCcceee Q lcl|Aclame:pro 227 TLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIP-------NVGMGRPAFYMPRKIRSFLRRQITNKVAASTL 299 (331) Q Consensus 227 ~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip-------~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~ 299 (331) . ..+++.|...+..++ ..-.++.+|.||++-...|+.+.... T Consensus 236 ----------------------~---~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~------ 284 (381) T protein:vir:95 236 ----------------------P---RATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL------ 284 (381) T ss_pred ----------------------c---hhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccC------ Confidence 0 011122222222221 12234556667765544444221110 Q ss_pred eecccCCcee------------------------------EEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 300 TMEEIAGKKV------------------------------VAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 300 ~~~~~~g~~v------------------------------~~~~gvpir~~dai~~tE~~Vv 331 (331) +..|..+ ....|+-|+++|.....+..+. T Consensus 285 ---~~~G~~v~~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~ 343 (381) T protein:vir:95 285 ---NANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDL 343 (381) T ss_pred ---CCCCceeecCCCCceEEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeE Confidence 1122222 2223344444444443333333 No 99 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=97.89 E-value=1.3e-06 Score=52.86 Aligned_cols=236 Identities=14% Similarity=0.118 Sum_probs=125.0 Q ss_pred CCcCccccccHHH------HHHhcCCc------cchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceee Q lcl|Aclame:pro 1 MPTLSTTNPTLAD------VAARMTPD------GKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRK 68 (331) Q Consensus 1 M~~l~~~a~TL~E------~Ak~~~~~------~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~ 68 (331) +.......+|-.| +.+..+.+ ..+...|+|.+.+.|+|+..+.+.... ......+.++-|.++|-. T Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~--~~~~i~~~~~~~~a~w~~ 134 (381) T protein:vir:10 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWGK 134 (381) T ss_pred HhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC--cceEEEEecCCcceeeec Confidence 1111111222222 11111111 234567999999999999999887653 346778888899999999 Q ss_pred cCCccC-cccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCch Q lcl|Aclame:pro 69 LNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLT 146 (331) Q Consensus 69 lN~g~~-~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~ 146 (331) .+++++ ++..++.+++-..+-|.+.+.|-+.|.+... +.++|-.. ...++++....+.|++||-...|.++ - T Consensus 135 e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~---~la~~~a~~~~~a~i~G~G~~qP~Gi---l 208 (381) T protein:vir:10 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRV---QIEEAFAVALETAFLKGTGKDQPIGL---N 208 (381) T ss_pred ccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHH---HHHHHHHHHhhheeEeccCCCCceee---e Confidence 999887 4578999999999999999999999987754 55555444 47789999999999999876555443 2 Q ss_pred hhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeece Q lcl|Aclame:pro 147 PRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGL 226 (331) Q Consensus 147 ~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl 226 (331) +-. + +.. ....|.+|.....| ++.+.+ T Consensus 209 ~~~---~-----------~~~-------------~~~~g~~~~~~~~~---------t~t~~~----------------- 235 (381) T protein:vir:10 209 RQV---Q-----------KGV-------------SVTEGAYPEKEEQG---------TLTFAN----------------- 235 (381) T ss_pred ecc---C-----------ccc-------------cccccccccccccc---------cccccc----------------- Confidence 211 0 000 00111111110000 000000 Q ss_pred EEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhh-------cCCCCceEEEeCHHHHHHHHHHhhcCCcceee Q lcl|Aclame:pro 227 TLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIP-------NVGMGRPAFYMPRKIRSFLRRQITNKVAASTL 299 (331) Q Consensus 227 ~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip-------~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~ 299 (331) . ..+++.|...+..++ ..-.++.+|.||++-...|+.+.... T Consensus 236 ----------------------~---~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~------ 284 (381) T protein:vir:10 236 ----------------------P---RATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL------ 284 (381) T ss_pred ----------------------c---hhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccC------ Confidence 0 011122222222221 12234556667765544444221110 Q ss_pred eecccCCcee------------------------------EEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 300 TMEEIAGKKV------------------------------VAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 300 ~~~~~~g~~v------------------------------~~~~gvpir~~dai~~tE~~Vv 331 (331) +..|..+ ....|+-|+++|.....+..+. T Consensus 285 ---~~~G~~v~~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~ 343 (381) T protein:vir:10 285 ---NANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDL 343 (381) T ss_pred ---CCCCceeecCCCCceEEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeE Confidence 1122222 2223344444444443333333 No 100 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=97.85 E-value=4e-07 Score=55.67 Aligned_cols=248 Identities=14% Similarity=0.111 Sum_probs=129.3 Q ss_pred CCcCccccccHHHH------HH-hcCCc------cchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCccee Q lcl|Aclame:pro 1 MPTLSTTNPTLADV------AA-RMTPD------GKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWR 67 (331) Q Consensus 1 M~~l~~~a~TL~E~------Ak-~~~~~------~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR 67 (331) +.......+|=.|. .+ ...++ ..+...|+|.+.+.++|+..+.+.... + ..++.+.++-|++.|. T Consensus 59 ~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~-~~~~~~~~~~~~a~w~ 136 (377) T protein:vir:98 59 DLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-L-RLKALTAETSGTAVWG 136 (377) T ss_pred HhccCCcccCHHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecC-c-ceEEEEecCCcceeEe Confidence 11000111111111 00 00111 124457999999999999998876653 2 3678888899999999 Q ss_pred ecCCccC-cccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCc Q lcl|Aclame:pro 68 KLNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGL 145 (331) Q Consensus 68 ~lN~g~~-~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL 145 (331) ...+..+ +++.++.+++-..+-|.+...|.+.|.+..+ |.++|-..+ ..++++......||+||-...|.++. T Consensus 137 ~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~---la~~~a~~~~~a~i~G~G~~qP~Gil-- 211 (377) T protein:vir:98 137 DIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQ---LKEAIAVALELAIVKGDGLLQPVGLL-- 211 (377) T ss_pred ecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHH---HHHHHHHHHhhceEeccCCCcceeee-- Confidence 9988876 4678999999999999999999999987755 566665544 67999999999999999766666553 Q ss_pred hhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeec Q lcl|Aclame:pro 146 TPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIG 225 (331) Q Consensus 146 ~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~G 225 (331) + ..+. .+..+ .++. ...+.||-. ..++ ++..+. T Consensus 212 -~---~~~~----~~~~~-~~~~-------------~~~~~~~~~-------~~~~--~l~~~~---------------- 244 (377) T protein:vir:98 212 -K---DLSQ----PTVDQ-STGR-------------DITTYKTDK-------EAIA--DLSDLT---------------- 244 (377) T ss_pred -e---cccc----ccccc-cccc-------------ccccccchh-------hhHh--hhhhhc---------------- Confidence 1 1100 00000 0000 010111100 0000 000000 Q ss_pred eEEecccceeeeeccccccCCCCccchhhHHHHH-HHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeeccc Q lcl|Aclame:pro 226 LTLRDWRYVVRIANVDVSELTKNASAGADLIDLM-TQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI 304 (331) Q Consensus 226 l~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm-~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~ 304 (331) .. .| .+.+ .-+..-+ ..++. -+....|+++|+||++-.-.+-.+ ++.... T Consensus 245 -~~---~~------------~~~a---~~~m~~~t~~~~~-klkd~~G~~i~~~n~~~~~~~~p~---------~~~~~~ 295 (377) T protein:vir:98 245 -PD---NA------------PKKL---VPVMKHLSVNDKK-RPLKIAGQVKLILNPEDRWALEAQ---------FTSRNQ 295 (377) T ss_pred -hh---HH------------HHHH---HHHHHHHHHHHHh-hhhccCCceEEEecccchhhcccc---------ccccCC Confidence 00 00 0000 1121112 22333 346788899999997643222111 111112 Q ss_pred CCceeE------------------------------EEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 305 AGKKVV------------------------------AFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 305 ~g~~v~------------------------------~~~gvpir~~dai~~tE~~Vv 331 (331) .|..++ ...|+-|.++|.....|..++ T Consensus 296 ~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~ 352 (377) T protein:vir:98 296 FGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQL 352 (377) T ss_pred CCccccccCCCceEEecCCCCcccEEEEEecceeEEeecceEEEeechhhhhcCceE Confidence 222222 223444444444444343333 No 101 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=97.83 E-value=3.7e-06 Score=50.37 Aligned_cols=215 Identities=15% Similarity=0.177 Sum_probs=120.9 Q ss_pred CCcC-cc-ccccHHHHHH----------hcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEE-EeecCCccee Q lcl|Aclame:pro 1 MPTL-ST-TNPTLADVAA----------RMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTV-RSGLPTGTWR 67 (331) Q Consensus 1 M~~l-~~-~a~TL~E~Ak----------~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~-~~~lP~~~fR 67 (331) ...- .. ...+..+.+. -+-+. .+...||+.+.+.++|.+........+ ..+.+ ..+.++++|. T Consensus 100 ~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~-~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~a~~v 175 (387) T protein:vir:93 100 LPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPK-TLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFI 175 (387) T ss_pred hhhhhhhhhhhhHHHHHhhccCcCCCCceeech-hHHHHHHHHHHhhchhhhheeeeecCC---ceEEEEeecCCccccc Confidence 0000 00 0000001000 01122 234569999999999988888765432 22332 3456789999 Q ss_pred ecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhC-CCHHHHHHHHHHHHHHHHHHHHHH-hhccCCCCCChhhhcCc Q lcl|Aclame:pro 68 KLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLN-GNSAAWRLSEDRAFIEGMNQTQAT-TLFYGDSSIDAEKFMGL 145 (331) Q Consensus 68 ~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~-gn~~~~ra~q~~~~ika~~~~f~~-~~iyGD~~~~~~~F~GL 145 (331) .=++..++++.++.+++-.++-+++.+.|.+.|.+-. .|..+|-..+ +.++++.+-.. .|.+|+ T Consensus 176 ~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~---la~~~~~~e~~~~~~~g~----------- 241 (387) T protein:vir:93 176 TDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENA---LQSGLAAKERKDALAVSP----------- 241 (387) T ss_pred cCcccccccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHH---HHHHHHHHHHHhHhhcCC----------- Confidence 9999999999999999999999999999999887653 4555554433 55566654222 233332 Q ss_pred hhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeec Q lcl|Aclame:pro 146 TPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIG 225 (331) Q Consensus 146 ~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~G 225 (331) |+|.. +|+.- +++ T Consensus 242 -------------------g~g~p--------------~g~l~---~~~------------------------------- 254 (387) T protein:vir:93 242 -------------------KSGLD--------------HMSFY---NGS------------------------------- 254 (387) T ss_pred -------------------Ccccc--------------ceeee---ccc------------------------------- Confidence 11100 01100 000 Q ss_pred eEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccC Q lcl|Aclame:pro 226 LTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIA 305 (331) Q Consensus 226 l~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~ 305 (331) +.. .++.++.+-++++++.++..-..+.+|+||+.....|.....+..... + . T Consensus 255 -----------~~~----------v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~~~-~-----~ 307 (387) T protein:vir:93 255 -----------VKE----------VEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNF-F-----D 307 (387) T ss_pred -----------ccc----------ccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcc-c-----c Confidence 000 112234566778888887766778899999876655544444443322 2 1 Q ss_pred CceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 306 GKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 306 g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) |.+ ..+.|.||..+|..-. .++ T Consensus 308 ~~~-~~llG~PV~~~~~~~~---~~~ 329 (387) T protein:vir:93 308 TPA-EKVFGKPVVFTDAAVK---PIV 329 (387) T ss_pred cCC-ccccccceEEecCCCc---eee Confidence 221 3567999999886421 122 No 102 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=97.80 E-value=3e-06 Score=50.86 Aligned_cols=214 Identities=14% Similarity=0.183 Sum_probs=120.6 Q ss_pred CC----cCccccccHHHHH-H-------hcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEE-EeecCCccee Q lcl|Aclame:pro 1 MP----TLSTTNPTLADVA-A-------RMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTV-RSGLPTGTWR 67 (331) Q Consensus 1 M~----~l~~~a~TL~E~A-k-------~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~-~~~lP~~~fR 67 (331) +. ..-........+. . .+-+. .+...||+.+.+.++|.+.+.+....+ ..+.+ ..+.++++|. T Consensus 115 ~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~-~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~a~~v 190 (402) T protein:vir:93 115 LPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPK-TLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFI 190 (402) T ss_pred hhhhHHHHHHhHHHHHhhhccCCCcCCccccch-hHHHHHHHhHHhhhhhhhhceeeecCC---ceeeeeeccCCccccc Confidence 00 0000000000000 0 01121 134569999999999998888765432 22333 3356788999 Q ss_pred ecCCccCcccceEEEEEEEEEEecchhhhhHHHHhh-CCCHHHHHHHHHHHHHHHHHHHHHH-hhccCCCCCChhhhcCc Q lcl|Aclame:pro 68 KLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADL-NGNSAAWRLSEDRAFIEGMNQTQAT-TLFYGDSSIDAEKFMGL 145 (331) Q Consensus 68 ~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~-~gn~~~~ra~q~~~~ika~~~~f~~-~~iyGD~~~~~~~F~GL 145 (331) .=++..++++.++.+++-..+-+.+.+.|.+.+.+. ..+..+|-..+ ..++++..-.. .|..|+ T Consensus 191 ~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~---la~~~~~~e~~~~~~~g~----------- 256 (402) T protein:vir:93 191 TDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENA---LQSGLAAKERKDALAVSP----------- 256 (402) T ss_pred cccccccccccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHH---HHHHHHHHHHHhHhhcCC----------- Confidence 999999999999999999999999999999987775 34555554443 45555543222 222232 Q ss_pred hhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeec Q lcl|Aclame:pro 146 TPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIG 225 (331) Q Consensus 146 ~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~G 225 (331) |+|. -+|+.- .++ T Consensus 257 -------------------g~g~--------------p~g~~~---~~~------------------------------- 269 (402) T protein:vir:93 257 -------------------KSGL--------------EHMSFY---NGS------------------------------- 269 (402) T ss_pred -------------------Cccc--------------cceeee---ccc------------------------------- Confidence 1110 011110 000 Q ss_pred eEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccC Q lcl|Aclame:pro 226 LTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIA 305 (331) Q Consensus 226 l~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~ 305 (331) +. +.++.++.+-|+++++.++..-..+.+|+||+.....|.....+..... + . T Consensus 270 -----------~~----------~~~~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~-~-----~ 322 (402) T protein:vir:93 270 -----------VK----------EVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNF-F-----D 322 (402) T ss_pred -----------cc----------cccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcc-c-----c Confidence 00 0122334566778888887666678899999887666655544443321 2 1 Q ss_pred CceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 306 GKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 306 g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) |.+ ..+.|.||..+|... .++ T Consensus 323 ~~~-~~llG~PV~~t~~~~----~i~ 343 (402) T protein:vir:93 323 TPA-EKVFGKPVVFTDAAV----KPI 343 (402) T ss_pred cCC-ccccccceEEecCCC----cee Confidence 221 346799999988642 233 No 103 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=97.75 E-value=7.3e-06 Score=48.74 Aligned_cols=213 Identities=14% Similarity=0.169 Sum_probs=121.1 Q ss_pred CCcCccccccHHHHH--------HhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEE-EeecCCcceeecCC Q lcl|Aclame:pro 1 MPTLSTTNPTLADVA--------ARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTV-RSGLPTGTWRKLNY 71 (331) Q Consensus 1 M~~l~~~a~TL~E~A--------k~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~-~~~lP~~~fR~lN~ 71 (331) .............+. -.+-+. .+...||+.+.+.++|++.+.+....+ ..+.+ ..+.++++|..=++ T Consensus 104 ~~~~~~~~~~~~~a~~~~~~~~gG~lIP~-~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~a~~v~Eg~ 179 (387) T protein:vir:94 104 FEKPSMEAQRLLHALPTGNDSGGDKLLPK-TLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDVE 179 (387) T ss_pred HHHHHHHHHHHHhhhccCCCCCCceeech-hHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCccccccccc Confidence 000000000000000 001121 134569999999999999888765432 22332 33567899999999 Q ss_pred ccCcccceEEEEEEEEEEecchhhhhHHHHhhC-CCHHHHHHHHHHHHHHHHHHHHHH-hhccCCCCCChhhhcCchhhh Q lcl|Aclame:pro 72 GVQPEKSRTVQVKDSMGMLETYAEVDKALADLN-GNSAAWRLSEDRAFIEGMNQTQAT-TLFYGDSSIDAEKFMGLTPRF 149 (331) Q Consensus 72 g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~-gn~~~~ra~q~~~~ika~~~~f~~-~~iyGD~~~~~~~F~GL~~R~ 149 (331) ..++++.++.+++-..+-+++.+.|.+.|.+.. .|..+|-..+ ..++++..-.. .|.+|+ T Consensus 180 ~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~---la~~~~~~e~~~~~~~g~--------------- 241 (387) T protein:vir:94 180 TAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENA---LQSGLAAKERKDALAVSP--------------- 241 (387) T ss_pred cccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHH---HHHHHHHHHHHhHhhcCC--------------- Confidence 999999999999999999999999999987764 4555554433 44555543222 222222 Q ss_pred ccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEc-cCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEE Q lcl|Aclame:pro 150 NSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIY-PKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) Q Consensus 150 ~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giy-pkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v 228 (331) |+|.. +|+. .++ + T Consensus 242 ---------------g~g~~--------------~g~~~~~~----~--------------------------------- 255 (387) T protein:vir:94 242 ---------------KSGLE--------------HMSFYNGS----V--------------------------------- 255 (387) T ss_pred ---------------Ccccc--------------ceeeeccc----c--------------------------------- Confidence 11110 1111 000 0 Q ss_pred ecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCce Q lcl|Aclame:pro 229 RDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK 308 (331) Q Consensus 229 ~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~ 308 (331) . ..++.++.+-++++++.++..-..+..||||+.....|.....+.... .+ . |.+ T Consensus 256 ---------~----------~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~-~~-~----~~~ 310 (387) T protein:vir:94 256 ---------K----------EVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTN-FF-D----TPA 310 (387) T ss_pred ---------c----------cccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCc-cc-c----cCC Confidence 0 011223456677888888776667889999987766665554444332 12 1 222 Q ss_pred eEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 309 VVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 309 v~~~~gvpir~~dai~~tE~~Vv 331 (331) ..+.|.||..+|... .+| T Consensus 311 -~~llG~PV~~~~~~~----~~~ 328 (387) T protein:vir:94 311 -EKVFGKPVVFTDAAV----KPI 328 (387) T ss_pred -ccccccceEEecCCC----cee Confidence 346799999998642 233 No 104 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=97.75 E-value=7.3e-06 Score=48.74 Aligned_cols=213 Identities=14% Similarity=0.169 Sum_probs=121.1 Q ss_pred CCcCccccccHHHHH--------HhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEE-EeecCCcceeecCC Q lcl|Aclame:pro 1 MPTLSTTNPTLADVA--------ARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTV-RSGLPTGTWRKLNY 71 (331) Q Consensus 1 M~~l~~~a~TL~E~A--------k~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~-~~~lP~~~fR~lN~ 71 (331) .............+. -.+-+. .+...||+.+.+.++|++.+.+....+ ..+.+ ..+.++++|..=++ T Consensus 104 ~~~~~~~~~~~~~a~~~~~~~~gG~lIP~-~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~a~~v~Eg~ 179 (387) T protein:vir:26 104 FEKPSMEAQRLLHALPTGNDSGGDKLLPK-TLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDVE 179 (387) T ss_pred HHHHHHHHHHHHhhhccCCCCCCceeech-hHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCccccccccc Confidence 000000000000000 001121 134569999999999999888765432 22332 33567899999999 Q ss_pred ccCcccceEEEEEEEEEEecchhhhhHHHHhhC-CCHHHHHHHHHHHHHHHHHHHHHH-hhccCCCCCChhhhcCchhhh Q lcl|Aclame:pro 72 GVQPEKSRTVQVKDSMGMLETYAEVDKALADLN-GNSAAWRLSEDRAFIEGMNQTQAT-TLFYGDSSIDAEKFMGLTPRF 149 (331) Q Consensus 72 g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~-gn~~~~ra~q~~~~ika~~~~f~~-~~iyGD~~~~~~~F~GL~~R~ 149 (331) ..++++.++.+++-..+-+++.+.|.+.|.+.. .|..+|-..+ ..++++..-.. .|.+|+ T Consensus 180 ~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~---la~~~~~~e~~~~~~~g~--------------- 241 (387) T protein:vir:26 180 TAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENA---LQSGLAAKERKDALAVSP--------------- 241 (387) T ss_pred cccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHH---HHHHHHHHHHHhHhhcCC--------------- Confidence 999999999999999999999999999987764 4555554433 44555543222 222222 Q ss_pred ccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEc-cCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEE Q lcl|Aclame:pro 150 NSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIY-PKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) Q Consensus 150 ~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giy-pkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v 228 (331) |+|.. +|+. .++ + T Consensus 242 ---------------g~g~~--------------~g~~~~~~----~--------------------------------- 255 (387) T protein:vir:26 242 ---------------KSGLE--------------HMSFYNGS----V--------------------------------- 255 (387) T ss_pred ---------------Ccccc--------------ceeeeccc----c--------------------------------- Confidence 11110 1111 000 0 Q ss_pred ecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCce Q lcl|Aclame:pro 229 RDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK 308 (331) Q Consensus 229 ~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~ 308 (331) . ..++.++.+-++++++.++..-..+..||||+.....|.....+.... .+ . |.+ T Consensus 256 ---------~----------~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~-~~-~----~~~ 310 (387) T protein:vir:26 256 ---------K----------EVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTN-FF-D----TPA 310 (387) T ss_pred ---------c----------cccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCc-cc-c----cCC Confidence 0 011223456677888888776667889999987766665554444332 12 1 222 Q ss_pred eEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 309 VVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 309 v~~~~gvpir~~dai~~tE~~Vv 331 (331) ..+.|.||..+|... .+| T Consensus 311 -~~llG~PV~~~~~~~----~~~ 328 (387) T protein:vir:26 311 -EKVFGKPVVFTDAAV----KPI 328 (387) T ss_pred -ccccccceEEecCCC----cee Confidence 346799999998642 233 No 105 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=97.75 E-value=7.3e-06 Score=48.74 Aligned_cols=213 Identities=14% Similarity=0.169 Sum_probs=121.1 Q ss_pred CCcCccccccHHHHH--------HhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEE-EeecCCcceeecCC Q lcl|Aclame:pro 1 MPTLSTTNPTLADVA--------ARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTV-RSGLPTGTWRKLNY 71 (331) Q Consensus 1 M~~l~~~a~TL~E~A--------k~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~-~~~lP~~~fR~lN~ 71 (331) .............+. -.+-+. .+...||+.+.+.++|++.+.+....+ ..+.+ ..+.++++|..=++ T Consensus 104 ~~~~~~~~~~~~~a~~~~~~~~gG~lIP~-~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~a~~v~Eg~ 179 (387) T protein:vir:96 104 FEKPSMEAQRLLHALPTGNDSGGDKLLPK-TLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDVE 179 (387) T ss_pred HHHHHHHHHHHHhhhccCCCCCCceeech-hHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCccccccccc Confidence 000000000000000 001121 134569999999999999888765432 22332 33567899999999 Q ss_pred ccCcccceEEEEEEEEEEecchhhhhHHHHhhC-CCHHHHHHHHHHHHHHHHHHHHHH-hhccCCCCCChhhhcCchhhh Q lcl|Aclame:pro 72 GVQPEKSRTVQVKDSMGMLETYAEVDKALADLN-GNSAAWRLSEDRAFIEGMNQTQAT-TLFYGDSSIDAEKFMGLTPRF 149 (331) Q Consensus 72 g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~-gn~~~~ra~q~~~~ika~~~~f~~-~~iyGD~~~~~~~F~GL~~R~ 149 (331) ..++++.++.+++-..+-+++.+.|.+.|.+.. .|..+|-..+ ..++++..-.. .|.+|+ T Consensus 180 ~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~---la~~~~~~e~~~~~~~g~--------------- 241 (387) T protein:vir:96 180 TAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENA---LQSGLAAKERKDALAVSP--------------- 241 (387) T ss_pred cccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHH---HHHHHHHHHHHhHhhcCC--------------- Confidence 999999999999999999999999999987764 4555554433 44555543222 222222 Q ss_pred ccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEc-cCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEE Q lcl|Aclame:pro 150 NSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIY-PKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) Q Consensus 150 ~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giy-pkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v 228 (331) |+|.. +|+. .++ + T Consensus 242 ---------------g~g~~--------------~g~~~~~~----~--------------------------------- 255 (387) T protein:vir:96 242 ---------------KSGLE--------------HMSFYNGS----V--------------------------------- 255 (387) T ss_pred ---------------Ccccc--------------ceeeeccc----c--------------------------------- Confidence 11110 1111 000 0 Q ss_pred ecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCce Q lcl|Aclame:pro 229 RDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK 308 (331) Q Consensus 229 ~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~ 308 (331) . ..++.++.+-++++++.++..-..+..||||+.....|.....+.... .+ . |.+ T Consensus 256 ---------~----------~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~-~~-~----~~~ 310 (387) T protein:vir:96 256 ---------K----------EVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTN-FF-D----TPA 310 (387) T ss_pred ---------c----------cccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCc-cc-c----cCC Confidence 0 011223456677888888776667889999987766665554444332 12 1 222 Q ss_pred eEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 309 VVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 309 v~~~~gvpir~~dai~~tE~~Vv 331 (331) ..+.|.||..+|... .+| T Consensus 311 -~~llG~PV~~~~~~~----~~~ 328 (387) T protein:vir:96 311 -EKVFGKPVVFTDAAV----KPI 328 (387) T ss_pred -ccccccceEEecCCC----cee Confidence 346799999998642 233 No 106 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=97.70 E-value=2.1e-07 Score=57.21 Aligned_cols=292 Identities=20% Similarity=0.180 Sum_probs=132.9 Q ss_pred CCcCccccccHHH---------HH-HhcC------CccchhHH--HHHHH-------hccc---hhHhhcceecccCCcc Q lcl|Aclame:pro 1 MPTLSTTNPTLAD---------VA-ARMT------PDGKIDPQ--IVEML-------NETN---EILDDMTVIEANGFTE 52 (331) Q Consensus 1 M~~l~~~a~TL~E---------~A-k~~~------~~~~~~~~--VIE~l-------~~~s---~iL~~lpf~e~n~~~~ 52 (331) ||. ...+|-++ .. |.+. ++++...+ =.|-| +..+ -++.+++-..+ ..|- T Consensus 1 ~~~--~~~~~~~~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a-~STV 77 (463) T protein:vir:95 1 MTI--EKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPA-QSTV 77 (463) T ss_pred CCc--ccccchHHHHHHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchh-hhhh Confidence 442 11111111 00 1110 11111000 01111 1111 11233332222 2345 Q ss_pred ceeEEEeecCCc---ceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 53 HKTTVRSGLPTG---TWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATT 129 (331) Q Consensus 53 ~~~~~~~~lP~~---~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~ 129 (331) |+|.+...--+. .|-.=-.-.+-+.+++.+++..++.++.--.|-...-..++ ..+-.+.+.+..|..+.++++.. T Consensus 78 ~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~-~~d~~~~~~~dai~~ia~tiE~a 156 (463) T protein:vir:95 78 VKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNN-IADPSQILTEDAIAVVAKTIEWA 156 (463) T ss_pred hhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcc-cccHHHHHHHHHHHHHHHHHHHH Confidence 677766655443 34222223455678999999999999999999886655444 44566778888999999999999 Q ss_pred hccCCCCCChh------hhcCchhhhccccccccceeeccCCCCCCceEEE----EEEeCCCcEEEEc-cCCCccceeec Q lcl|Aclame:pro 130 LFYGDSSIDAE------KFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIW----LTVWGPNTLHTIY-PKGSQAGLQSR 198 (331) Q Consensus 130 ~iyGD~~~~~~------~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~----~V~~g~~~~~giy-pkg~~aGl~~~ 198 (331) +||||+...|+ +||||.+.. +..|+|||.|.--.-..|| .+.-+-+...-+| |-|.++-|+-. T Consensus 157 ~FyGds~l~~~~~~~gleFDGl~~lI------d~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~ 230 (463) T protein:vir:95 157 SFYGDASLTSEVEGEGLEFDGLAKLI------DKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNS 230 (463) T ss_pred HhhhhhccCCCcCccccchhhhhhhc------CCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHH Confidence 99999999985 999998876 5789999988663333333 2333445666677 88888888866 Q ss_pred cccceeee-ccCCCeeE-EEEE-EEEeeece-EEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcC-CCCc Q lcl|Aclame:pro 199 DLGEDTLI-DAAGGRYQ-GYRT-HYKWDIGL-TLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNV-GMGR 273 (331) Q Consensus 199 d~g~~~~~-d~~g~~~~-~~~t-~~~w~~Gl-~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~-~~g~ 273 (331) -++.+++. .+|++.++ ||.- .|...-|. .|...+-.-.-..+|-+ ..+.|+. .+.. T Consensus 231 ~l~~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~-------------------~~~~p~ap~~~~ 291 (463) T protein:vir:95 231 ILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDES-------------------LQPLPNAPQPAK 291 (463) T ss_pred hcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccch-------------------hhcCCCCccCce Confidence 66666555 56655432 2211 11111121 12211111111111111 1112211 1111 Q ss_pred eEEEeCHHHHHHHHHHhhcCCc-ceeeeecccCCceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 274 PAFYMPRKIRSFLRRQITNKVA-ASTLTMEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 274 ~~~y~n~~v~~~L~~~~~~~~~-~~~~~~~~~~g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) +.+=+-.+=...+.... +... ...-......|+ ++|... +..|++.|. T Consensus 292 ~tatv~~~~~~~~~~~~-~~a~~~Y~vv~~s~~ge------S~pS~i---vtaT~a~~~ 340 (463) T protein:vir:95 292 VTATVETKQKGAFENEE-DRAGLSYKVVVNSDDAQ------SAPSEE---VTATVSNVD 340 (463) T ss_pred eEEEEeeccCCCCCCcc-cccceEEEEEEECCCCC------cccchh---eeeeeeecc Confidence 11100000000000000 0000 000000011111 111111 112222222 No 107 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=97.70 E-value=2.1e-07 Score=57.21 Aligned_cols=292 Identities=20% Similarity=0.180 Sum_probs=132.9 Q ss_pred CCcCccccccHHH---------HH-HhcC------CccchhHH--HHHHH-------hccc---hhHhhcceecccCCcc Q lcl|Aclame:pro 1 MPTLSTTNPTLAD---------VA-ARMT------PDGKIDPQ--IVEML-------NETN---EILDDMTVIEANGFTE 52 (331) Q Consensus 1 M~~l~~~a~TL~E---------~A-k~~~------~~~~~~~~--VIE~l-------~~~s---~iL~~lpf~e~n~~~~ 52 (331) ||. ...+|-++ .. |.+. ++++...+ =.|-| +..+ -++.+++-..+ ..|- T Consensus 1 ~~~--~~~~~~~~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a-~STV 77 (463) T protein:vir:99 1 MTI--EKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPA-QSTV 77 (463) T ss_pred CCc--ccccchHHHHHHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchh-hhhh Confidence 442 11111111 00 1110 11111000 01111 1111 11233332222 2345 Q ss_pred ceeEEEeecCCc---ceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 53 HKTTVRSGLPTG---TWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATT 129 (331) Q Consensus 53 ~~~~~~~~lP~~---~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~ 129 (331) |+|.+...--+. .|-.=-.-.+-+.+++.+++..++.++.--.|-...-..++ ..+-.+.+.+..|..+.++++.. T Consensus 78 ~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~-~~d~~~~~~~dai~~ia~tiE~a 156 (463) T protein:vir:99 78 VKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNN-IADPSQILTEDAIAVVAKTIEWA 156 (463) T ss_pred hhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcc-cccHHHHHHHHHHHHHHHHHHHH Confidence 677766655443 34222223455678999999999999999999886655444 44566778888999999999999 Q ss_pred hccCCCCCChh------hhcCchhhhccccccccceeeccCCCCCCceEEE----EEEeCCCcEEEEc-cCCCccceeec Q lcl|Aclame:pro 130 LFYGDSSIDAE------KFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIW----LTVWGPNTLHTIY-PKGSQAGLQSR 198 (331) Q Consensus 130 ~iyGD~~~~~~------~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~----~V~~g~~~~~giy-pkg~~aGl~~~ 198 (331) +||||+...|+ +||||.+.. +..|+|||.|.--.-..|| .+.-+-+...-+| |-|.++-|+-. T Consensus 157 ~FyGds~l~~~~~~~gleFDGl~~lI------d~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~ 230 (463) T protein:vir:99 157 SFYGDASLTSEVEGEGLEFDGLAKLI------DKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNS 230 (463) T ss_pred HhhhhhccCCCcCccccchhhhhhhc------CCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHH Confidence 99999999985 999998876 5789999988663333333 2333445666677 88888888866 Q ss_pred cccceeee-ccCCCeeE-EEEE-EEEeeece-EEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcC-CCCc Q lcl|Aclame:pro 199 DLGEDTLI-DAAGGRYQ-GYRT-HYKWDIGL-TLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNV-GMGR 273 (331) Q Consensus 199 d~g~~~~~-d~~g~~~~-~~~t-~~~w~~Gl-~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~-~~g~ 273 (331) -++.+++. .+|++.++ ||.- .|...-|. .|...+-.-.-..+|-+ ..+.|+. .+.. T Consensus 231 ~l~~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~-------------------~~~~p~ap~~~~ 291 (463) T protein:vir:99 231 ILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDES-------------------LQPLPNAPQPAK 291 (463) T ss_pred hcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccch-------------------hhcCCCCccCce Confidence 66666555 56655432 2211 11111121 12211111111111111 1112211 1111 Q ss_pred eEEEeCHHHHHHHHHHhhcCCc-ceeeeecccCCceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 274 PAFYMPRKIRSFLRRQITNKVA-ASTLTMEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 274 ~~~y~n~~v~~~L~~~~~~~~~-~~~~~~~~~~g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) +.+=+-.+=...+.... +... ...-......|+ ++|... +..|++.|. T Consensus 292 ~tatv~~~~~~~~~~~~-~~a~~~Y~vv~~s~~ge------S~pS~i---vtaT~a~~~ 340 (463) T protein:vir:99 292 VTATVETKQKGAFENEE-DRAGLSYKVVVNSDDAQ------SAPSEE---VTATVSNVD 340 (463) T ss_pred eEEEEeeccCCCCCCcc-cccceEEEEEEECCCCC------cccchh---eeeeeeecc Confidence 11100000000000000 0000 000000011111 111111 112222222 No 108 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=97.67 E-value=4e-06 Score=50.15 Aligned_cols=210 Identities=11% Similarity=0.045 Sum_probs=119.4 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEE-eecCCcceeecCCccCc-ccc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVR-SGLPTGTWRKLNYGVQP-EKS 78 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~-~~lP~~~fR~lN~g~~~-s~~ 78 (331) +..-.....+..+.. ...+. .+...|++ +.+.++|+..+......++. +.+.+. .+-..++|..=++..++ +.. T Consensus 127 ~~~~~~~~~~~~~~~-~~vp~-~~~~~i~~-~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~E~~~~~~~~~~ 202 (397) T protein:vir:96 127 KGAEKRDGFTSVEGG-ALIPQ-ELLQPQLE-PKDIVDLSKYVRSVPVNSAS-GKFPVISKSGSKMATVQQLEKNPQLANP 202 (397) T ss_pred hhhhhhhcccccccc-cchhH-HHHHHHHH-hhhhhhHHHhhhhccccccc-eeEEEEeccCCccccccccccccccccc Confidence 000000011111111 01111 23345776 46778888888876654443 333332 23355667666666775 679 Q ss_pred eEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccc Q lcl|Aclame:pro 79 RTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENG 157 (331) Q Consensus 79 t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~ 157 (331) ++.+++-.++-+.+.+.|.+.+.+... +..++- .....++++......|++|+.... T Consensus 203 ~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i---~~~l~~~~~~~~~~~i~~g~g~~~------------------- 260 (397) T protein:vir:96 203 KMVEIDYSVATRRGYIPISQEMIDDASYDVTGLI---ADEIQDQSLNTKNADIAAVLKTAT------------------- 260 (397) T ss_pred cccceeecHhHhhcchhhHHHHHhhhHHHHHHHH---HHHHHHHHHHHHHHHHhhcccccc------------------- Confidence 999999999999999999998888754 333433 344678888888888888753211 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) |.| T Consensus 261 ------------------------------~~~----------------------------------------------- 263 (397) T protein:vir:96 261 ------------------------------AKS----------------------------------------------- 263 (397) T ss_pred ------------------------------ccc----------------------------------------------- Confidence 000 Q ss_pred eccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCeEE Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPC 317 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpi 317 (331) ..+-.+|.+++..+ ++. ..+.+|+||++....|+.. .+..+.+.+.+ .+.+-..-.++|.|| T Consensus 264 -----------~~~~d~~~~~~~~~---~~~--~~~a~~v~n~~~~~~l~~l-kd~~G~~~~~~-~~~~~~~~~l~G~pv 325 (397) T protein:vir:96 264 -----------VVGVDGLKDLINKE---IKK--VYDVKLFISASMYSELDKL-KDKNGRYLLQD-SITAASGKQLLGKEV 325 (397) T ss_pred -----------ccchHHHHHHHHHh---hhh--hcCcEEEEcHHHHHHHHHh-hccCCCeEecc-CccCCCcccccccce Confidence 00112233333222 222 2247899999999999875 45555443333 343334457999999 Q ss_pred EEEEec-cCCCcc---cC Q lcl|Aclame:pro 318 RRTDAL-LLTEAR---VV 331 (331) Q Consensus 318 r~~dai-~~tE~~---Vv 331 (331) ..++.. ..++.- ++ T Consensus 326 ~~~~~~~~~~~~~~~~~~ 343 (397) T protein:vir:96 326 VVLDDDVIGKSVGNVVGF 343 (397) T ss_pred EEecccccCCCCCceEEE Confidence 987653 444432 22 No 109 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=97.66 E-value=6.1e-06 Score=49.17 Aligned_cols=230 Identities=9% Similarity=0.071 Sum_probs=130.5 Q ss_pred CCcCccccccHHHHH------HhcCC------ccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceee Q lcl|Aclame:pro 1 MPTLSTTNPTLADVA------ARMTP------DGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRK 68 (331) Q Consensus 1 M~~l~~~a~TL~E~A------k~~~~------~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~ 68 (331) +..-....+|..|-+ +..+. ...+...|++.+.+.|+|++...+....+ .....+.++-|.+.|.. T Consensus 67 ~~~r~~~~l~~ee~~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~w~~ 144 (395) T protein:vir:95 67 LAKRSQDPLTSEERKFFNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGI--KTRVIKADPAGQAVWGK 144 (395) T ss_pred HhhcCccccchHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEee Confidence 222122233433311 11111 12245679999999999999998877642 35677788889999998 Q ss_pred cCCccC-cccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCC--hhhhcC Q lcl|Aclame:pro 69 LNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSID--AEKFMG 144 (331) Q Consensus 69 lN~g~~-~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~--~~~F~G 144 (331) .+.+.+ ++.+++.+++-.++-|.+.+.|.+.|.+..+ |.++|-.. ...++++....+.|++||-... |.++ T Consensus 145 e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~---~la~~ia~~~~~a~i~G~G~~~~qP~Gi-- 219 (395) T protein:vir:95 145 VFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRT---QIQEAISVALESAIINGGGAAKTQPVGL-- 219 (395) T ss_pred cccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHH---HHHHHHHHHHhhheeeccCCCCcCceee-- Confidence 877775 5789999999999999999999999988765 55655444 4789999999999999985431 4433 Q ss_pred chhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeee Q lcl|Aclame:pro 145 LTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDI 224 (331) Q Consensus 145 L~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~ 224 (331) -+.. . ++... +.+ +..++... T Consensus 220 -l~~~---~-----------~~~~~------~~~-----------~~~~~~~t--------------------------- 240 (395) T protein:vir:95 220 -MKDV---N-----------TNSGA------VTD-----------KASSGTLT--------------------------- 240 (395) T ss_pred -eecc---c-----------ccccc------ccc-----------ccccchhh--------------------------- Confidence 1110 0 00000 000 00000000 Q ss_pred ceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHh-------hcCCCCceEEEeCHHHHHHHHHHhhcCCcce Q lcl|Aclame:pro 225 GLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELI-------PNVGMGRPAFYMPRKIRSFLRRQITNKVAAS 297 (331) Q Consensus 225 Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~i-------p~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~ 297 (331) ..+|+ .+.+.+.+++..+ .....++..|.||++-+.-+. .+ T Consensus 241 ------------~~~~~------------~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~------g~-- 288 (395) T protein:vir:95 241 ------------FADAD------------TTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQ------AR-- 288 (395) T ss_pred ------------hhhhH------------hhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcC------Cc-- Confidence 00110 0112222222211 112245678999976443111 11 Q ss_pred eeeecccCCceeEEE-cCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 298 TLTMEEIAGKKVVAF-DGIPCRRTDALLLTEARVV 331 (331) Q Consensus 298 ~~~~~~~~g~~v~~~-~gvpir~~dai~~tE~~Vv 331 (331) .......|.+++-+ .|+||..++++-..+ |+ T Consensus 289 -~~~~~~~G~~~~~lg~g~~v~~~~~~p~~~--i~ 320 (395) T protein:vir:95 289 -YTYLTANGGFVTVLPYNVTIITSEFVPEGK--LV 320 (395) T ss_pred -ceeccCCCcceeccCCcceEEEcCCCCCCc--EE Confidence 22223566666544 588888888875433 33 No 110 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=97.64 E-value=8.5e-06 Score=48.39 Aligned_cols=210 Identities=11% Similarity=0.104 Sum_probs=122.8 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhccee----cccCCccceeEEEeecCCcceeecCCccCcc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVI----EANGFTEHKTTVRSGLPTGTWRKLNYGVQPE 76 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~----e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s 76 (331) |+.. .-++.++ +.+. .....|+|.+.+.+.+ ..+... ++..|......+....+++.|..=++..+++ T Consensus 1 MA~~---~T~~~~~---~iPe-v~s~~v~~~~~~~~~~-~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~ 72 (272) T protein:vir:98 1 MAVG---TTKMAQM---LDPE-VLADMIDAEVGKAIRF-APLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMT 72 (272) T ss_pred CCCc---cccchhe---echH-HHHHHHHHHHHHHhhh-hccccccccccCCCCCEEEEEEecCCCCcccccCCCccccc Confidence 7742 2345554 2232 2344566776654433 333322 2333333455666778999999989999999 Q ss_pred cceEEEEEEEEEEecchhhhhHHHHhh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccc Q lcl|Aclame:pro 77 KSRTVQVKDSMGMLETYAEVDKALADL-NGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) Q Consensus 77 ~~t~~~~~~~l~ilgg~~eVDr~la~~-~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~ 155 (331) +.++.+++..++-++..+.|+.....+ .+|..+. -.++..+++++++...+|.- | .. T Consensus 73 ~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~---~~~~~~~~~a~~~d~~i~~~-----------~-------~~- 130 (272) T protein:vir:98 73 QLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQ---AAKQIVEAIDHKVDADVLDA-----------L-------SK- 130 (272) T ss_pred ccccceEEEEeeeeeeeeeecHHHHhhccccHHHH---HHHHHHHHHHHHHHHHHHHH-----------h-------cc- Confidence 999999999999999999999877655 3444333 34446777777777666521 0 00 Q ss_pred ccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccccee Q lcl|Aclame:pro 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) Q Consensus 156 ~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~ 235 (331) +.+.+ T Consensus 131 -a~~~~-------------------------------------------------------------------------- 135 (272) T protein:vir:98 131 -STQTV-------------------------------------------------------------------------- 135 (272) T ss_pred -ccccc-------------------------------------------------------------------------- Confidence 00000 Q ss_pred eeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCC-cceee-eecccCCceeEEEc Q lcl|Aclame:pro 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKV-AASTL-TMEEIAGKKVVAFD 313 (331) Q Consensus 236 RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~-~~~~~-~~~~~~g~~v~~~~ 313 (331) + .+.+ .+.+.+|+.++........+|+||..+...|++...... ..... ...-..| .+-.+. T Consensus 136 ----------~-~~~t----~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g-~ig~i~ 199 (272) T protein:vir:98 136 ----------E-ATAT----VDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSG-VYGEVL 199 (272) T ss_pred ----------c-cccC----HHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccc-cchhhc Confidence 0 0000 233566667776555667899999999999986532211 10000 1111122 245789 Q ss_pred CeEEEEEEeccCCCcccC Q lcl|Aclame:pro 314 GIPCRRTDALLLTEARVV 331 (331) Q Consensus 314 gvpir~~dai~~tE~~Vv 331 (331) |+||..++++-..-.-++ T Consensus 200 G~~Vi~s~~~p~~t~~~~ 217 (272) T protein:vir:98 200 GVQIVRSRKCPKGTAYMV 217 (272) T ss_pred CeeEEEcCCCCcceEEEE Confidence 999999999854333233 No 111 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=97.64 E-value=8.5e-06 Score=48.39 Aligned_cols=210 Identities=11% Similarity=0.104 Sum_probs=122.8 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhccee----cccCCccceeEEEeecCCcceeecCCccCcc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVI----EANGFTEHKTTVRSGLPTGTWRKLNYGVQPE 76 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~----e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s 76 (331) |+.. .-++.++ +.+. .....|+|.+.+.+.+ ..+... ++..|......+....+++.|..=++..+++ T Consensus 1 MA~~---~T~~~~~---~iPe-v~s~~v~~~~~~~~~~-~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~ 72 (272) T protein:vir:30 1 MAVG---TTKMAQM---LDPE-VLADMIDAEVGKAIRF-APLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMT 72 (272) T ss_pred CCCc---cccchhe---echH-HHHHHHHHHHHHHhhh-hccccccccccCCCCCEEEEEEecCCCCcccccCCCccccc Confidence 7742 2345554 2232 2344566776654433 333322 2333333455666778999999989999999 Q ss_pred cceEEEEEEEEEEecchhhhhHHHHhh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccc Q lcl|Aclame:pro 77 KSRTVQVKDSMGMLETYAEVDKALADL-NGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) Q Consensus 77 ~~t~~~~~~~l~ilgg~~eVDr~la~~-~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~ 155 (331) +.++.+++..++-++..+.|+.....+ .+|..+. -.++..+++++++...+|.- | .. T Consensus 73 ~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~---~~~~~~~~~a~~~d~~i~~~-----------~-------~~- 130 (272) T protein:vir:30 73 QLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQ---AAKQIVEAIDHKVDADVLDA-----------L-------SK- 130 (272) T ss_pred ccccceEEEEeeeeeeeeeecHHHHhhccccHHHH---HHHHHHHHHHHHHHHHHHHH-----------h-------cc- Confidence 999999999999999999999877655 3444333 34446777777777666521 0 00 Q ss_pred ccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccccee Q lcl|Aclame:pro 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) Q Consensus 156 ~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~ 235 (331) +.+.+ T Consensus 131 -a~~~~-------------------------------------------------------------------------- 135 (272) T protein:vir:30 131 -STQTV-------------------------------------------------------------------------- 135 (272) T ss_pred -ccccc-------------------------------------------------------------------------- Confidence 00000 Q ss_pred eeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCC-cceee-eecccCCceeEEEc Q lcl|Aclame:pro 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKV-AASTL-TMEEIAGKKVVAFD 313 (331) Q Consensus 236 RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~-~~~~~-~~~~~~g~~v~~~~ 313 (331) + .+.+ .+.+.+|+.++........+|+||..+...|++...... ..... ...-..| .+-.+. T Consensus 136 ----------~-~~~t----~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g-~ig~i~ 199 (272) T protein:vir:30 136 ----------E-ATAT----VDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSG-VYGEVL 199 (272) T ss_pred ----------c-cccC----HHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccc-cchhhc Confidence 0 0000 233566667776555667899999999999986532211 10000 1111122 245789 Q ss_pred CeEEEEEEeccCCCcccC Q lcl|Aclame:pro 314 GIPCRRTDALLLTEARVV 331 (331) Q Consensus 314 gvpir~~dai~~tE~~Vv 331 (331) |+||..++++-..-.-++ T Consensus 200 G~~Vi~s~~~p~~t~~~~ 217 (272) T protein:vir:30 200 GVQIVRSRKCPKGTAYMV 217 (272) T ss_pred CeeEEEcCCCCcceEEEE Confidence 999999999854333233 No 112 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=97.57 E-value=3e-06 Score=50.82 Aligned_cols=233 Identities=18% Similarity=0.102 Sum_probs=123.2 Q ss_pred CCcCccccccHH---------HHH-HhcCC------ccchhHHH--HHHH-------hccch---hHhhcceecccCCcc Q lcl|Aclame:pro 1 MPTLSTTNPTLA---------DVA-ARMTP------DGKIDPQI--VEML-------NETNE---ILDDMTVIEANGFTE 52 (331) Q Consensus 1 M~~l~~~a~TL~---------E~A-k~~~~------~~~~~~~V--IE~l-------~~~s~---iL~~lpf~e~n~~~~ 52 (331) ||.. +.+++. |.. |.+.. ++++..+- .|-| +..++ ++.+++-..+ ..|- T Consensus 1 ~~~~--~~~~~~~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a-~sTv 77 (462) T protein:vir:96 1 MHKD--TNLTAEQNKYADKFQEEVMKSYQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPA-QSTV 77 (462) T ss_pred Cccc--cccchhhhhhhchhhHHHHHHHhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchh-hhhh Confidence 5532 222222 111 22211 22221110 1222 11111 1222222222 2344 Q ss_pred ceeEEEeecCCc---ceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 53 HKTTVRSGLPTG---TWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATT 129 (331) Q Consensus 53 ~~~~~~~~lP~~---~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~ 129 (331) |+|.+...--+. .|-.=-.-.+-+.+++.+++..++.|+..-.|+-..--.++ ..+-.+.|.+..|..+.++++.. T Consensus 78 ~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~-~~d~~~~~~~dai~~~a~tiE~a 156 (462) T protein:vir:96 78 QKYDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNN-IQDPMQILTEDAIAVVAKTIEWA 156 (462) T ss_pred hhheeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccc-hhhHHHHHHHHHHHHHHHHHHHH Confidence 666666555443 34222223456678999999999999998888876544444 44555888889999999999999 Q ss_pred hccCCCCCCh------hhhcCchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccce Q lcl|Aclame:pro 130 LFYGDSSIDA------EKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGED 203 (331) Q Consensus 130 ~iyGD~~~~~------~~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~ 203 (331) .||||+...| -+||||.+.. ++.|+|||.|.- .|+-++. T Consensus 157 ~Fygds~l~~~~~~~gleFDGl~~lI------~~~NViDarG~~---Ls~~~ln-------------------------- 201 (462) T protein:vir:96 157 SFYGDASLTADPTGQGLEFDGLAKLI------DKDNVIDAKGES---LTETLLN-------------------------- 201 (462) T ss_pred HhhhhcccCCCccccccchhhhhhhc------CCCceeecCCCC---ccHHHHh-------------------------- Confidence 9999999998 8999998876 478999998721 1111111 Q ss_pred eeeccCCCeeEEEEEEEEeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHH Q lcl|Aclame:pro 204 TLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIR 283 (331) Q Consensus 204 ~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~ 283 (331) .+++|| -....-++-+||+--++ T Consensus 202 ----------------------------~aa~~i-----------------------------~~~fGt~TD~~~p~~v~ 224 (462) T protein:vir:96 202 ----------------------------RSAVLI-----------------------------GKSFGTATDAYMPIGVH 224 (462) T ss_pred ----------------------------hhhhhc-----------------------------ccccCChhheecchHHH Confidence 111222 11122245667777777 Q ss_pred HHHHHHhhcCCcceeeeecccC--------CceeEEEcCeEEE-----EEEeccCCCcccC Q lcl|Aclame:pro 284 SFLRRQITNKVAASTLTMEEIA--------GKKVVAFDGIPCR-----RTDALLLTEARVV 331 (331) Q Consensus 284 ~~L~~~~~~~~~~~~~~~~~~~--------g~~v~~~~gvpir-----~~dai~~tE~~Vv 331 (331) ..|+-+...+.-+. .+.+.. .+.+++-..|-+. .-+++++-|.... T Consensus 225 a~f~~~~l~~qrv~--~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~ 283 (462) T protein:vir:96 225 ADFVNSVLGRQMQL--MQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPL 283 (462) T ss_pred HHHHHhhcCceEEE--EcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccC Confidence 77765555554332 222221 1233333333333 3444544444322 No 113 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=97.01 E-value=5.5e-06 Score=49.44 Aligned_cols=286 Identities=14% Similarity=0.097 Sum_probs=132.9 Q ss_pred CCcCccccccHHHHH--HhcCCccchhHHHHHHHhcc--chhHhhcce-------------ecccCCccceeEEEeec-C Q lcl|Aclame:pro 1 MPTLSTTNPTLADVA--ARMTPDGKIDPQIVEMLNET--NEILDDMTV-------------IEANGFTEHKTTVRSGL-P 62 (331) Q Consensus 1 M~~l~~~a~TL~E~A--k~~~~~~~~~~~VIE~l~~~--s~iL~~lpf-------------~e~n~~~~~~~~~~~~l-P 62 (331) || ...+--+|-| |.+...+... +.++.+ .+-|..+.| ..+ ..|-|+|++...- - T Consensus 1 ~~---~~~~~~~~~a~~~al~~a~~~g----~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a-~STV~ey~~~~~rhG 72 (470) T protein:vir:10 1 MP---YEHLKHLDEATLKALNAAGQVA----ESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKA-KAYEHEYNVVTARHD 72 (470) T ss_pred CC---hhHhhhhhHHHHHHHHHhhhcc----hhhhhhhhccceeEeeecCccchhhhhcCCchh-hhHhhhhhhhccccc Confidence 55 4444332222 1121112111 112221 122233333 222 2345777664442 2 Q ss_pred CcceeecCCc--cCcccceEEEEEEEEEEecchhhhhHH-HHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC-- Q lcl|Aclame:pro 63 TGTWRKLNYG--VQPEKSRTVQVKDSMGMLETYAEVDKA-LADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSI-- 137 (331) Q Consensus 63 ~~~fR~lN~g--~~~s~~t~~~~~~~l~ilgg~~eVDr~-la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~-- 137 (331) .+++-.++++ .+-+.++..+++..++.|+.-.+|... +..+..+..+....+.+..|-.+.++++..+||||+.. T Consensus 73 ~~g~s~~~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~s 152 (470) T protein:vir:10 73 KIGYAAFREGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLGD 152 (470) T ss_pred cccceeecccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhcccccc Confidence 2333233333 344578999999999999999999876 44555566677788888999999999999999999844 Q ss_pred ------ChhhhcCchhhhccccccccceeeccCCCCCCceEEEEEEe---------CCCcEEEEc-cCCCccceeecccc Q lcl|Aclame:pro 138 ------DAEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVW---------GPNTLHTIY-PKGSQAGLQSRDLG 201 (331) Q Consensus 138 ------~~~~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~---------g~~~~~giy-pkg~~aGl~~~d~g 201 (331) ++-+||||.+-. +...+.|+|||.|.-- |+-++.| +-++..-+| |-|.++-|+-.-++ T Consensus 153 ~~~g~~~gleFDGl~~lI---d~~~~~NViDarG~~L---s~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~ 226 (470) T protein:vir:10 153 DVPGSPNNLQQDGIINII---KRGAPQNVLDAGGRPL---SIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQ 226 (470) T ss_pred ccCcccCceeccchhhhc---cCCCCccccccCCCCc---cHHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcC Confidence 455899998854 3345789999987653 4432222 224445566 87888887777777 Q ss_pred ceeee-ccCCCeeEEEEEEEEeeeceEEecccceeeeeccccccCCCCccchh---hHHHHHHHHHHHhhcCCCCceEEE Q lcl|Aclame:pro 202 EDTLI-DAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGA---DLIDLMTQAVELIPNVGMGRPAFY 277 (331) Q Consensus 202 ~~~~~-d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~---~l~~lm~~a~~~ip~~~~g~~~~y 277 (331) .+++. .+|++.++ ++..+....+- --+|+. .++.-+. ++..+..++ ..++.. .| .+. T Consensus 227 ~qRv~~~~N~~~~~-~G~~v~~f~sa----------~G~I~L----~~s~~m~~~~k~~p~~l~~--~v~~~a-AP-~~~ 287 (470) T protein:vir:10 227 ISRVMTTADRRAGL-LGADAQSYIGV----------RGEHSL----YPSQFLGDFHKFNPARFGA--EVGDFA-AP-SNS 287 (470) T ss_pred ceEEEEecCCCcee-eeeeccceeee----------eeeeee----cccccccchhhcCcccCCc--ccCCcc-cC-cee Confidence 77666 44665543 33333332222 122221 0111000 000011111 011100 11 011 Q ss_pred eC--HHH-HHHHHHHhhcCCcceeeeecccCCceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 278 MP--RKI-RSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 278 ~n--~~v-~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) .- .+. ..++... .|.. .+.++.|-.+. .-++.++. ...+..| T Consensus 288 ~tv~~t~~~~a~~~~--sk~g-------~~~~~~v~sy~-y~v~~~~g--ds~s~~v 332 (470) T protein:vir:10 288 WTVSTTDNFVTLPYN--SGLG-------DPANTTVYSYA-FKAANFYG--ESAAKYI 332 (470) T ss_pred EEeecCCCceeeccc--CCCC-------cccCcceeEEE-EEEEEecC--CCCcceE Confidence 10 000 0011110 0000 01112111110 01111110 0111111 No 114 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=97.00 E-value=9.9e-05 Score=42.54 Aligned_cols=245 Identities=13% Similarity=0.074 Sum_probs=129.8 Q ss_pred CCcCccccccHHHHH------HhcCCc------cchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceee Q lcl|Aclame:pro 1 MPTLSTTNPTLADVA------ARMTPD------GKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRK 68 (331) Q Consensus 1 M~~l~~~a~TL~E~A------k~~~~~------~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~ 68 (331) +.......+|-.|.. +..+.+ ..+...|+|.+.+.|||+..+.++... ..+...+.++-|.+.|.. T Consensus 57 ~~~~~~~~l~~~e~~~~~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~--~~~~i~~~~~~~~a~W~~ 134 (381) T protein:vir:10 57 SLPKSAQTLSANQRNFFMDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWGK 134 (381) T ss_pred HhcccccccCHHHHHHHHHHhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecC--cceEEEeecCCcceEEee Confidence 111111222332211 111111 124567999999999999999887653 346778888889999999 Q ss_pred cCCccC-cccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCch Q lcl|Aclame:pro 69 LNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLT 146 (331) Q Consensus 69 lN~g~~-~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~ 146 (331) ...+.+ ++..++.+++-.++-|.+.+.|.+.|.+... |.++|-..+ ..++++......|++||-...|.+|. T Consensus 135 e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~---la~~~a~~~~~afi~GdG~~qP~Gil--- 208 (381) T protein:vir:10 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQ---IEEAFAVALETAFLKGTGKDQPIGLN--- 208 (381) T ss_pred cccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHH---HHHHHHHHhhceeEecccCCCceeee--- Confidence 988876 5578999999999999999999999988755 566665554 67899999999999999877776652 Q ss_pred hhhccc-ccccc---c-----ee--------------------eccCCCC--CCceEEEEEE------------eCC--C Q lcl|Aclame:pro 147 PRFNSL-SAENG---Q-----NI--------------------IDAGGTG--SDNASIWLTV------------WGP--N 181 (331) Q Consensus 147 ~R~~~~-~~~~~---~-----~v--------------------idaGgtG--~~~tSI~~V~------------~g~--~ 181 (331) +-.+.. ....+ . .+ .+..+.. ...-.+|++. +.. + T Consensus 209 ~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G 288 (381) T protein:vir:10 209 RQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANG 288 (381) T ss_pred ecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCC Confidence 100000 00000 0 00 0000000 0001122211 000 0 Q ss_pred cEEEEccCC--------Cccc-eeeccccceeeeccCCCeeEEE----------EEEEEeeeceEEecccceeeeecccc Q lcl|Aclame:pro 182 TLHTIYPKG--------SQAG-LQSRDLGEDTLIDAAGGRYQGY----------RTHYKWDIGLTLRDWRYVVRIANVDV 242 (331) Q Consensus 182 ~~~giypkg--------~~aG-l~~~d~g~~~~~d~~g~~~~~~----------~t~~~w~~Gl~v~d~r~v~RI~NId~ 242 (331) ..-+..|-| -.+| +-..|.....+.|..|-.+.-+ .-+..++++-.+.|..+++=+ -|.+ T Consensus 289 ~~v~~lp~g~~vv~~~~~p~~~i~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dG~~~~~~A~~v~-~l~~ 367 (381) T protein:vir:10 289 VYVTALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVW-KLDL 367 (381) T ss_pred ceeecCCCCceeEEcCCCCcCcEEEEEcccEEEEEecccEEEeechhhhhcCceEEEEEEEEcCEEecCCcEEEE-EEee Confidence 000000101 0011 3344444444445544333221 233456667777777765543 3344 Q ss_pred ccCCCCccchh--hH Q lcl|Aclame:pro 243 SELTKNASAGA--DL 255 (331) Q Consensus 243 s~l~~~~~~~~--~l 255 (331) +..+ ++...+ .| T Consensus 368 ~~~~-~~~~~~~~~~ 381 (381) T protein:vir:10 368 KGHK-PALEDTEETL 381 (381) T ss_pred cCCc-cccccccccC Confidence 4422 221111 12 No 115 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=96.77 E-value=1e-05 Score=47.99 Aligned_cols=295 Identities=19% Similarity=0.178 Sum_probs=128.0 Q ss_pred CCcCccccccHH----HHHHhc------CCccchhHH--HHHH-------Hhccch---hHhhcceecccCCccceeEEE Q lcl|Aclame:pro 1 MPTLSTTNPTLA----DVAARM------TPDGKIDPQ--IVEM-------LNETNE---ILDDMTVIEANGFTEHKTTVR 58 (331) Q Consensus 1 M~~l~~~a~TL~----E~Ak~~------~~~~~~~~~--VIE~-------l~~~s~---iL~~lpf~e~n~~~~~~~~~~ 58 (331) |-.-.+.+.+|. |+-|.+ +++++...+ =.|- |+..+. ++.+++-..+ ..|-|+|.+. T Consensus 1 ~~~~~n~~~~~~~~~e~~~Ks~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a-~STV~~y~~~ 79 (464) T protein:vir:80 1 MTEKKNTERQLTSVQEEVIKGFTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPA-TSTVAKYDVY 79 (464) T ss_pred CCcchhhHhhcCcccHHHHHHHHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchh-hhhhhhhhee Confidence 332222111111 111211 111111100 0111 111111 2333333222 2344666665 Q ss_pred eecCC---cceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|Aclame:pro 59 SGLPT---GTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDS 135 (331) Q Consensus 59 ~~lP~---~~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~ 135 (331) ..--+ ..|-.=-.-.+-+.+++.+++..++.|.+-=.|+.+.--.+. ..+-..+|.+..|..+.++++...||||+ T Consensus 80 ~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~-~~d~~~~~~~dai~~va~tiE~a~FyGds 158 (464) T protein:vir:80 80 LAHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATGLVNN-IEDPMRILTDDAISVVAKTIEWASFYGDS 158 (464) T ss_pred eccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehhhhcc-hhhHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 55544 334222223456678999999999999998888766533333 33445577788999999999999999999 Q ss_pred CCChh-------hhcCchhhhccccccccceeeccCCCCCCceEEE----EEEeCCCcEEEEc-cCCCccceeeccccce Q lcl|Aclame:pro 136 SIDAE-------KFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIW----LTVWGPNTLHTIY-PKGSQAGLQSRDLGED 203 (331) Q Consensus 136 ~~~~~-------~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~----~V~~g~~~~~giy-pkg~~aGl~~~d~g~~ 203 (331) ...|. +||||.+.. ++.|+|||.|..-.-.=|| .+.-+-++..-+| |-|-++-+.-.-+..+ T Consensus 159 ~l~~~~~~~~gleFDGl~~lI------~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~q 232 (464) T protein:vir:80 159 DLSENPDAGSGLEFDGLAKLI------DKHNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDRQ 232 (464) T ss_pred ccCCCCCCccccchhhhHhhc------CCCceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCce Confidence 88875 899999876 5789999999872211111 2233445556666 7777766533333333 Q ss_pred -eeeccCCCeeEEEEEEEEeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHH Q lcl|Aclame:pro 204 -TLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKI 282 (331) Q Consensus 204 -~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v 282 (331) .+.-+||+.++ ++..+....+ + --+|+.- .+...++...+..+..+.|+.- .++.+=. ++ T Consensus 233 ~~~~~~n~~~~~-~G~~v~~f~s--------a--~G~i~L~-----~s~~m~~~~~ld~~~~~~~~ap-aapsvt~--tv 293 (464) T protein:vir:80 233 VQVISDNGQNAT-MGFNVKGFNS--------A--RGFIRLH-----GSTVMELEQILDENRMQLPNAP-QKATVKA--TL 293 (464) T ss_pred eEEEcCCCCcce-eeeecccccc--------c--ccceecc-----CccccCcccccccccccCCCCc-CCceeEE--Ee Confidence 33345555432 2222222111 1 1122110 1111111111222223333211 1111111 00 Q ss_pred HHHHHHHhhcCCcceeeeecccCCc---eeE--EEc--CeEEEEEEe-ccCCCcccC Q lcl|Aclame:pro 283 RSFLRRQITNKVAASTLTMEEIAGK---KVV--AFD--GIPCRRTDA-LLLTEARVV 331 (331) Q Consensus 283 ~~~L~~~~~~~~~~~~~~~~~~~g~---~v~--~~~--gvpir~~da-i~~tE~~Vv 331 (331) +...+ ..+-.++.+|. +|. ..+ .+|....++ +...+..|= T Consensus 294 -------~~~~~--g~f~~~~~~~~~~Ykv~~vn~~GeS~ps~~~~~ti~~~~~~V~ 341 (464) T protein:vir:80 294 -------EAGTK--GKFRDEDLTIDTEYKVVVVSDDAESAPSDVASVVIDDKKKQVK 341 (464) T ss_pred -------cCCcc--cCCccccccceeEEEEEEECCCCccccceeeeeeecCcccEEE Confidence 00000 00111111110 000 000 122221111 111111110 No 116 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=96.75 E-value=4.8e-06 Score=49.75 Aligned_cols=263 Identities=17% Similarity=0.175 Sum_probs=126.4 Q ss_pred CCc------Ccccc----ccHHHHHHhcCCccchhHHHHHHHhccc---hhHhhcceecccCCccceeEEEeecCCcc-- Q lcl|Aclame:pro 1 MPT------LSTTN----PTLADVAARMTPDGKIDPQIVEMLNETN---EILDDMTVIEANGFTEHKTTVRSGLPTGT-- 65 (331) Q Consensus 1 M~~------l~~~a----~TL~E~Ak~~~~~~~~~~~VIE~l~~~s---~iL~~lpf~e~n~~~~~~~~~~~~lP~~~-- 65 (331) -|- +-.++ -+|-..|. +... .+++.+. .|+..+ -++.+|+-..+ ..|-|+|.+...--+.+ T Consensus 36 ~~~~~~k~a~t~gy~~~~~~~t~gaA-lR~E-sLd~~l~-~Lt~~~~~ftf~~~i~k~~a-~STV~ey~~~~~~G~~G~~ 111 (514) T protein:vir:10 36 LPENVKKSAFTAGHSITPDTQTDGAA-NRIE-SLNRDLK-VTTWGERDFTLYNDIAKQPV-DNTVLKYTQYYSHGRTGHS 111 (514) T ss_pred cchhhhhhhhccccccCCccccCccc-hhhh-hhcccee-EeeecCcchhhhhhcCCchh-hHHHhhhhhhcccCccccc Confidence 000 00000 00111000 0000 0111110 111111 12444443333 23456666655544442 Q ss_pred -e-eecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCh---- Q lcl|Aclame:pro 66 -W-RKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDA---- 139 (331) Q Consensus 66 -f-R~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~---- 139 (331) | +.+. =.+-+..+..+++...+.++....|-..+--.++-. +-...+.+..|..+.++++..+||||+...| T Consensus 112 ~f~~E~g-i~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~n~i~-d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~ 189 (514) T protein:vir:10 112 LFQPEIG-IGDVNNPNERQRTINIKYIVDTHVTSIALQRANTIV-DSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKG 189 (514) T ss_pred ccccccc-cCcCCCcceEEEEEeeeeeeeeeeeeehhhhccchh-hHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccc Confidence 3 2222 223456889999999999999999998877766444 4455566888999999999999999998887 Q ss_pred --hhhcCchhhhccccccccceeeccCCCCCCceEE----EEEEeCCCcEEEEc-cCCCccceeeccccceeee-ccCCC Q lcl|Aclame:pro 140 --EKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASI----WLTVWGPNTLHTIY-PKGSQAGLQSRDLGEDTLI-DAAGG 211 (331) Q Consensus 140 --~~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI----~~V~~g~~~~~giy-pkg~~aGl~~~d~g~~~~~-d~~g~ 211 (331) -+||||.+.+ +..|+|||.|.--.-.=| -+++-|-++..-+| |-|.++-+.-..++.+++. ..|++ T Consensus 190 ~gleFDGl~~lI------~~~NvIDarG~~Ls~~~ln~aA~~i~~gfGt~TD~ylp~~vka~f~~~~~~~qRV~~~~n~~ 263 (514) T protein:vir:10 190 EGLQFDGLFKLI------APENHIDLRGGRLSPAALNMAARKIGEGFGTPTDAYMPIGIKADFVNQHLNGQRVMLPGQTG 263 (514) T ss_pred CcchhhhHHHhh------cCCCeEecCCCCccHHHHhhhhhhhhcccCChhheeCchHHHHHHhhcccCcceEEeecCcc Confidence 8999999987 478999999874211111 23444445666677 8788887777777777665 33444 Q ss_pred eeEEEEEEEEeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhh Q lcl|Aclame:pro 212 RYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQIT 291 (331) Q Consensus 212 ~~~~~~t~~~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~ 291 (331) .++ .|+.+. +.+.---+|+. . |++ +|.+ .+-|+-++. T Consensus 264 ~~~---------~G~~v~--~f~s~~G~I~L---~-------------------------gs~--im~~--~n~L~~~~~ 300 (514) T protein:vir:10 264 GMT---------TGLDID--KFLSAHGSIRI---Q-------------------------GST--IMDS--DNKLDFDRP 300 (514) T ss_pred cee---------eeeecc--ceeEeccceee---c-------------------------CCe--eecc--cccCccCCc Confidence 332 233332 22222233321 0 001 1110 011110000 Q ss_pred cCCcceeeeecccCCceeEEEcCeEEEE-EEe--------ccCCCcccC Q lcl|Aclame:pro 292 NKVAASTLTMEEIAGKKVVAFDGIPCRR-TDA--------LLLTEARVV 331 (331) Q Consensus 292 ~~~~~~~~~~~~~~g~~v~~~~gvpir~-~da--------i~~tE~~Vv 331 (331) -..+ .+.+ + -+++-. .|+ ..++-..|. T Consensus 301 ~~~~--Ap~~----~-------~va~svT~~~~g~~~~ad~t~~~g~~~ 336 (514) T protein:vir:10 301 VSPT--APTA----P-------QLSATVTPDGGGLWHEADKTDSKGEVI 336 (514) T ss_pred cCCc--CCCC----C-------cceEEEecCcccccCcccccccccccc Confidence 0000 0000 0 011111 111 011111122 No 117 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=96.69 E-value=0.00025 Score=40.33 Aligned_cols=224 Identities=13% Similarity=0.111 Sum_probs=109.0 Q ss_pred CCcCccccccHHHH-HHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceeecCCccCcccce Q lcl|Aclame:pro 1 MPTLSTTNPTLADV-AARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSR 79 (331) Q Consensus 1 M~~l~~~a~TL~E~-Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~~t 79 (331) ++.- +.+- .........+...+...+.+.+++++..+.... + .......+.-..+.|..-+...+++..+ T Consensus 237 ~~~~------~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i--~-~~~~~~~~~~~~a~~~~eG~~kp~s~~t 307 (517) T protein:vir:97 237 WTAE------LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL--P-TLVVGGDNALTQGTGHTTGTDKTESNIT 307 (517) T ss_pred eeee------cccccccccccchHHHHHHHHhhhhhccceeeeeeccc--c-ceeeecccccceeeeeecCCcccccccc Confidence 1100 0000 000000111223355566666666666554221 1 1111122222245577777778888899 Q ss_pred EEEEEEEEEEecchhhhhHHHHhhCC-C-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccccc Q lcl|Aclame:pro 80 TVQVKDSMGMLETYAEVDKALADLNG-N-SAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENG 157 (331) Q Consensus 80 ~~~~~~~l~ilgg~~eVDr~la~~~g-n-~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~ 157 (331) +.+++...+-+.+.+.+.+.+.+... | ..+....-......+++.+.+.+|++||- T Consensus 308 f~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdG---------------------- 365 (517) T protein:vir:97 308 LQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGV---------------------- 365 (517) T ss_pred eeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccC---------------------- Confidence 99999999999999999998765432 2 23344444555788899999999999973 Q ss_pred ceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceeee Q lcl|Aclame:pro 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) Q Consensus 158 ~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI 237 (331) +|.+.+ +|+|.... .. . T Consensus 366 --------tg~~~~-------------gi~~~a~~---------------~~-----------------~---------- 382 (517) T protein:vir:97 366 --------TGVSET-------------QIYPVVGD---------------AW-----------------A---------- 382 (517) T ss_pred --------CCcccc-------------cccccccc---------------cc-----------------c---------- Confidence 121111 11111000 00 0 Q ss_pred eccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCe-- Q lcl|Aclame:pro 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI-- 315 (331) Q Consensus 238 ~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gv-- 315 (331) .. . ..+.+..++++.+..|+...+ +..|+||+.....|++. +|..+.. +-+.-..+..+....|+ T Consensus 383 ~~-----~-~~~~~~~d~i~~l~~a~~~a~-----~a~~vmn~~t~~~I~kl-KD~~G~Y-l~~~~~~~~~~~~l~G~~~ 449 (517) T protein:vir:97 383 TN-----V-TGTTNIQELLEKLSVATPKAA-----DSTLVIHRNDLAAIRFL-KDKNGNY-VFPVGVSNQTIATHFGFNR 449 (517) T ss_pred cc-----c-cccchHHHHHHHHHHHhhhcc-----CCEEEECHHHHHHHHHh-hcCCCCe-eccCcCCcccccccCCccc Confidence 00 0 011223455566666665433 36799999999999987 3443333 22332333333322231 Q ss_pred --EEEEEEec-------------------------c-----CCCccc---C Q lcl|Aclame:pro 316 --PCRRTDAL-------------------------L-----LTEARV---V 331 (331) Q Consensus 316 --pir~~dai-------------------------~-----~tE~~V---v 331 (331) |..-.++. . -.|.++ | T Consensus 450 ~~~~~~~~~~~~~~~~~y~i~~~~g~~~~~~fd~~~n~~~f~~~~~~~g~i 500 (517) T protein:vir:97 450 LVQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSL 500 (517) T ss_pred cccccccCceeEeeccccEEEeecceeeeeeeecccCceeEeeeeeecccc Confidence 11111100 0 011111 1 No 118 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=96.65 E-value=0.00044 Score=39.00 Aligned_cols=240 Identities=11% Similarity=0.062 Sum_probs=120.5 Q ss_pred CCcCccccccHHHH------HHhcCCc------cchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceee Q lcl|Aclame:pro 1 MPTLSTTNPTLADV------AARMTPD------GKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRK 68 (331) Q Consensus 1 M~~l~~~a~TL~E~------Ak~~~~~------~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~ 68 (331) +.......+|-.|- .+..+.+ ..+...|+|.+.+.|+|+..+.+.... + .++..+..+-|.++|.. T Consensus 64 ~~~~g~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~-~-~~~i~~~~~~~~a~w~~ 141 (383) T protein:vir:78 64 SASRTDKNITNEEIKFFNDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTG-L-RTKFLKSETSGVAVWGK 141 (383) T ss_pred HhcCChhhhhHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecC-C-ceEEEEEcCCcceEEee Confidence 11111122222221 1111111 124567999999999999999887653 3 46788889999999999 Q ss_pred cCCccC-cccceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCch Q lcl|Aclame:pro 69 LNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLT 146 (331) Q Consensus 69 lN~g~~-~s~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~ 146 (331) ...+.+ .+.+++.+++-.++-|.+.+.|.+.|.+..+ |.++|-..+ ..++++......||+||-...|.+|. T Consensus 142 e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~---l~~~~a~~~~~a~i~G~G~~qP~Gil--- 215 (383) T protein:vir:78 142 IFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQ---IEEAFAVALESAYIVGDGNDKPIGLN--- 215 (383) T ss_pred cccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHH---HHHHHHHHHhhheEeccCCCCceeee--- Confidence 988875 5689999999999999999999999988765 455555544 77899999999999999766666652 Q ss_pred hhhccccccccc-----e------eec--------------cC--CCCCCceE----EEEEEeCCCcEEEEccC---CCc Q lcl|Aclame:pro 147 PRFNSLSAENGQ-----N------IID--------------AG--GTGSDNAS----IWLTVWGPNTLHTIYPK---GSQ 192 (331) Q Consensus 147 ~R~~~~~~~~~~-----~------vid--------------aG--gtG~~~tS----I~~V~~g~~~~~giypk---g~~ 192 (331) .-.+........ . ..| +. .++....+ .|++. +....-++|. ... T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n--~~~~~~~~~~~~~~~~ 293 (383) T protein:vir:78 216 RKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVN--PTDAWDVKKQYTSLNA 293 (383) T ss_pred eccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEc--CcchhhhccchhccCC Confidence 111000000000 0 000 00 00000000 12221 1000011111 001 Q ss_pred cc----------------------eeeccccceeeeccCCCeeEEE----------EEEEEeeeceEEecccceeeeecc Q lcl|Aclame:pro 193 AG----------------------LQSRDLGEDTLIDAAGGRYQGY----------RTHYKWDIGLTLRDWRYVVRIANV 240 (331) Q Consensus 193 aG----------------------l~~~d~g~~~~~d~~g~~~~~~----------~t~~~w~~Gl~v~d~r~v~RI~NI 240 (331) +| +-..|.....+.|..|-.+.-+ .-+...++|-.+.|..+++=+ -| T Consensus 294 ~G~~~t~l~~~~~iv~s~~~p~~~iifgdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl-~~ 372 (383) T protein:vir:78 294 NGVYVTALPFNLNIIESLFVPEKKAISYVAERYDALIGGPLDIGTYDQTLAIEDLNLYAAKQFAYGKAKDDKAAAVW-TL 372 (383) T ss_pred CCceeeecCCCceEEecCCCCcccEEEeeccceEEEecccceEEecchhhhhcCceEEEEEEEEcCEEecCCeEEEE-EE Confidence 11 1112222222223322222110 112334555566666553322 12 Q ss_pred ccccCCCCccch Q lcl|Aclame:pro 241 DVSELTKNASAG 252 (331) Q Consensus 241 d~s~l~~~~~~~ 252 (331) .+..-. ..-+| T Consensus 373 ~~~~~~-~~~~~ 383 (383) T protein:vir:78 373 NINPAE-QTPEG 383 (383) T ss_pred EecCCC-CCCCC Confidence 221111 11122 No 119 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=95.76 E-value=0.00019 Score=40.96 Aligned_cols=154 Identities=14% Similarity=0.107 Sum_probs=64.7 Q ss_pred ceEEEEEEeCCCcEEEEccCCCccceeecccc-ceee------eccCCCeeEEEEEEEEeeeceEEecccc--------- Q lcl|Aclame:pro 170 NASIWLTVWGPNTLHTIYPKGSQAGLQSRDLG-EDTL------IDAAGGRYQGYRTHYKWDIGLTLRDWRY--------- 233 (331) Q Consensus 170 ~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g-~~~~------~d~~g~~~~~~~t~~~w~~Gl~v~d~r~--------- 233 (331) ..+|-+..|...... .-..|+ ++-+- +..+ .+.+|+-|..-|..-.-..++.=-+|++ T Consensus 1 mpaltLaea~k~~~d-----~l~~~V-iE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~ 74 (310) T protein:vir:97 1 MASVTLAESAKLAQD-----ELVAGV-IENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAA 74 (310) T ss_pred CcccchHHHhhcCcc-----hHHHHH-HHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCccccc Confidence 333422222210000 000000 00000 0000 0112222211111100000000000000 Q ss_pred ----------------------e------------------------------------------eeeeccccccCCC-C Q lcl|Aclame:pro 234 ----------------------V------------------------------------------VRIANVDVSELTK-N 248 (331) Q Consensus 234 ----------------------v------------------------------------------~RI~NId~s~l~~-~ 248 (331) + .=.+|++.+.... . T Consensus 75 ~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~ 154 (310) T protein:vir:97 75 ATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTG 154 (310) T ss_pred cccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecC Confidence 0 0011222211111 1 Q ss_pred ccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHH---HHhhcCCcceeeeecccCCceeEEEcCeEEEEEEeccC Q lcl|Aclame:pro 249 ASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLR---RQITNKVAASTLTMEEIAGKKVVAFDGIPCRRTDALLL 325 (331) Q Consensus 249 ~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~---~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~dai~~ 325 (331) +.++.-=.+.|.+.+++++....++.+|+||++.+.+++ ++...+. +. ....+..|++|..|+||||..||-|-. T Consensus 155 ~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g-~~-~~~~~~~G~~v~~~~GiPi~~~d~ip~ 232 (310) T protein:vir:97 155 ATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGAS-IN-EVVELPSGAEVPAYSGTPIFRNDYIPT 232 (310) T ss_pred CCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCC-CC-CccccCCCCEEeeeCCeEEEEeCccCC Confidence 111221246777778888877788999999987544333 3333222 22 345678999999999999999999977 Q ss_pred CCcccC Q lcl|Aclame:pro 326 TEARVV 331 (331) Q Consensus 326 tE~~Vv 331 (331) +|..+= T Consensus 233 ~~~~~~ 238 (310) T protein:vir:97 233 NQTKGG 238 (310) T ss_pred Cccccc Confidence 665422 No 120 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=95.58 E-value=0.00014 Score=41.70 Aligned_cols=278 Identities=20% Similarity=0.204 Sum_probs=126.1 Q ss_pred CCcC------ccccccHHHHH-HhcC------CccchhHH--HHHHH-------hccch---hHhhcceecccCCcccee Q lcl|Aclame:pro 1 MPTL------STTNPTLADVA-ARMT------PDGKIDPQ--IVEML-------NETNE---ILDDMTVIEANGFTEHKT 55 (331) Q Consensus 1 M~~l------~~~a~TL~E~A-k~~~------~~~~~~~~--VIE~l-------~~~s~---iL~~lpf~e~n~~~~~~~ 55 (331) ||-- |-..-...|.. |.+. ++++...+ =.|-| +..+. ++.+++-..+ ..|-|+| T Consensus 1 ~~~~~~~~~~~~n~~~~~e~~~Ks~~agy~~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~a-~stv~~y 79 (467) T protein:vir:80 1 MPKNNKEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPA-TSTVAKY 79 (467) T ss_pred CCCcchhhhhhcccccCHHHHHHHHHcccccCCccccCcchhhhhhhhhhhheeeccccchhhhhhcccchh-hhhhhhh Confidence 5531 11111122221 2211 11111000 01111 11111 1333332222 2345777 Q ss_pred EEEeecCCc---ceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 56 TVRSGLPTG---TWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFY 132 (331) Q Consensus 56 ~~~~~lP~~---~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iy 132 (331) .+...--+. .|-.=-.-.+-+.+++.+++..++.|+.--+|--+.-..++ ..+-.+.|.+..|..+.++++..+|| T Consensus 80 ~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~-i~d~~~~~~~~ai~~~a~tiE~a~Fy 158 (467) T protein:vir:80 80 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNN-IQDPMQILTDDAIVNIAKTIEWASFF 158 (467) T ss_pred eeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcc-hhhHHHHHHHHHHHHHHHHHHHHhhh Confidence 766655543 34222223455678999999999999998887766554443 55556888899999999999999999 Q ss_pred CCCCCChh-------hhcCchhhhccccccccceeeccCCCCCCceEEE----EEEeCCCcEEEEc-cCCCccceeeccc Q lcl|Aclame:pro 133 GDSSIDAE-------KFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIW----LTVWGPNTLHTIY-PKGSQAGLQSRDL 200 (331) Q Consensus 133 GD~~~~~~-------~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~----~V~~g~~~~~giy-pkg~~aGl~~~d~ 200 (331) ||+...+. +||||.+-. ++.|+||+.|..-...=|+ +++-|-+...=+| |-|.++-|+-.-+ T Consensus 159 Gds~l~~s~~~~~glqfDGi~~li------~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L 232 (467) T protein:vir:80 159 GDSDLSDSPEPQAGLEFDGLAKLI------NQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQL 232 (467) T ss_pred cccccccCCCccccccccceeEEe------cCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhc Confidence 99987543 899998765 4579999999873322221 2233334444556 7777777766666 Q ss_pred ccee-eeccCCCeeE-EEEEE-EEeeeceEEeccccee-eeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEE Q lcl|Aclame:pro 201 GEDT-LIDAAGGRYQ-GYRTH-YKWDIGLTLRDWRYVV-RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAF 276 (331) Q Consensus 201 g~~~-~~d~~g~~~~-~~~t~-~~w~~Gl~v~d~r~v~-RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~ 276 (331) ..+. +.-+|+...+ |+... |..--|. |.-.+++. +=.|| |..+....|+. T Consensus 233 ~~q~~v~~~n~~~~~~G~~v~g~~sa~G~-I~l~gs~il~~~~~------------------l~~~~~~~~~A------- 286 (467) T protein:vir:80 233 SKQTQLVRDNGNNVSVGFNIQGFHSARGF-IKLHGSTVMENEQI------------------LDERILALPTA------- 286 (467) T ss_pred CceEEEEcCCCCceeeeecccceecceee-eeecCceeeccccC------------------CCccccccccc------- Confidence 6533 3333443332 21110 1011110 00011110 00010 00000001100 Q ss_pred EeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCe------EEEEEEeccCCCcccC Q lcl|Aclame:pro 277 YMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI------PCRRTDALLLTEARVV 331 (331) Q Consensus 277 y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gv------pir~~dai~~tE~~Vv 331 (331) -.+.+ ++.+-..++.-..-.|. .|+.||+ ..|++.. T Consensus 287 psp~~-----------------vsaT~~~~~~g~~~~~~~a~y~Y~v~~vs~--~GES~pS 328 (467) T protein:vir:80 287 PQPAK-----------------VTATQEAGKKGQFRAEDLAAHEYKVVVSSD--DAESIAS 328 (467) T ss_pred ccCCc-----------------cceeeecccCCcccCCCcceEEEEEEEECC--CCccccc Confidence 00000 00000011110011111 2233333 3455543 No 121 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=95.56 E-value=0.00014 Score=41.78 Aligned_cols=278 Identities=20% Similarity=0.206 Sum_probs=125.9 Q ss_pred CCcC-------ccccccHHHHHHh-c------CCccchhHH--HHHHH-------hccch---hHhhcceecccCCccce Q lcl|Aclame:pro 1 MPTL-------STTNPTLADVAAR-M------TPDGKIDPQ--IVEML-------NETNE---ILDDMTVIEANGFTEHK 54 (331) Q Consensus 1 M~~l-------~~~a~TL~E~Ak~-~------~~~~~~~~~--VIE~l-------~~~s~---iL~~lpf~e~n~~~~~~ 54 (331) ||-- |...=...|.+++ + +++++...+ =.|-| +..+. ++.+++-..+ ..|-|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~Ks~~agy~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~~a-~stv~~ 79 (468) T protein:vir:63 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPA-TSTVAK 79 (468) T ss_pred CCCCcchhhccccChhHHHHHHHHHHHcCcccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcccchh-hhhhhh Confidence 4420 1111122232211 1 111111000 01111 11111 2333332222 234577 Q ss_pred eEEEeecCCc---ceeecCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 55 TTVRSGLPTG---TWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLF 131 (331) Q Consensus 55 ~~~~~~lP~~---~fR~lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~i 131 (331) |.+...--+. .|-.=-.-.+-+.+++.+++..++.|+.--+|--+.-..++ ..+-.+.|.+..|..+.++++..+| T Consensus 80 y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~-i~d~~~~~~~~ai~~~a~tiE~a~F 158 (468) T protein:vir:63 80 YDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNN-IQDPMQILTDDAIVNIAKTIEWASF 158 (468) T ss_pred heeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcc-hhhHHHHHHHHHHHHHHHHHHHHhh Confidence 7766655443 34222223455678999999999999998887766554443 5555688889999999999999999 Q ss_pred cCCCCCChh-------hhcCchhhhccccccccceeeccCCCCCCceEEE----EEEeCCCcEEEEc-cCCCccceeecc Q lcl|Aclame:pro 132 YGDSSIDAE-------KFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIW----LTVWGPNTLHTIY-PKGSQAGLQSRD 199 (331) Q Consensus 132 yGD~~~~~~-------~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~----~V~~g~~~~~giy-pkg~~aGl~~~d 199 (331) |||+...+. +||||.+-. ++.|+||+.|..-...=|+ +++-|-+...=+| |-|.++-|+-.- T Consensus 159 yGds~l~~s~~~~~glqfDGi~~li------~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~ 232 (468) T protein:vir:63 159 FGDSDLSDSPEPQAGLEFDGLAKLI------NQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQ 232 (468) T ss_pred hcccccccCCCccccccccceeEEe------cCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhh Confidence 999987543 899998765 4579999999873322221 2233334444556 777777776666 Q ss_pred cccee-eeccCCCeeE-EEEEE-EEeeeceEEeccccee-eeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceE Q lcl|Aclame:pro 200 LGEDT-LIDAAGGRYQ-GYRTH-YKWDIGLTLRDWRYVV-RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPA 275 (331) Q Consensus 200 ~g~~~-~~d~~g~~~~-~~~t~-~~w~~Gl~v~d~r~v~-RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~ 275 (331) +..+. +.-+|+...+ |+... |..--|. |.-.+++. +=.|| |..+....|+. T Consensus 233 L~~q~~v~~~n~~~~~~G~~v~g~~sa~G~-I~l~gs~il~~~~~------------------l~~~~~~~~~A------ 287 (468) T protein:vir:63 233 LSKQTQLVRDNGNNVSVGFNIQGFHSARGF-IKLHGSTVMENEQI------------------LDERILALPTA------ 287 (468) T ss_pred cCceEEEEcCCCCceeeeecccceecceee-eeecCceeeccccC------------------CCccccccccc------ Confidence 66533 3333443332 21110 1011110 00011110 00011 00000001100 Q ss_pred EEeCHHHHHHHHHHhhcCCcceeeeecccCCceeEEEcCe------EEEEEEeccCCCcccC Q lcl|Aclame:pro 276 FYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI------PCRRTDALLLTEARVV 331 (331) Q Consensus 276 ~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gv------pir~~dai~~tE~~Vv 331 (331) -.+.+ ++.+-..++.-..-.|. .|+.||+ ..|++.. T Consensus 288 -psp~~-----------------vsaT~~~~~~g~~~~~~~a~y~Y~v~~vs~--~GES~pS 329 (468) T protein:vir:63 288 -PQPAK-----------------VTATQEAGKKGQFRAEDLAAHEYKVVVSSD--DAESIAS 329 (468) T ss_pred -ccCCc-----------------cceeeecccCCcccCCCcceEEEEEEEECC--CCccccc Confidence 00000 00000011110011111 2233333 3455543 No 122 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=95.38 E-value=0.0011 Score=36.73 Aligned_cols=245 Identities=11% Similarity=0.048 Sum_probs=123.7 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccch----hHhhcceecccCCccceeEEEeecCCcceeecCCcc--- Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNE----ILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGV--- 73 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~----iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~lN~g~--- 73 (331) |++..++..| -+. .+.. ..+.+++.+-+| +|..+-=..+ ..+-|.+.. .+|-.+.=..-.||- T Consensus 1 ma~~~~~~~t-~~~---~g~~----~dl~~~I~~isp~dTPf~S~i~~~~a-~~~~~~W~~-d~l~~~~~~~~~EG~da~ 70 (317) T protein:vir:88 1 MATPTNAVST-VEI---NGKR----EDLIDIIYNIAPYDTPFMSAIGKGVA-TAITHEWQT-DELRQPGKNTRVEGEDAT 70 (317) T ss_pred CCccccceEe-eee---eeee----echhhhheecCCccCcceeeecCcee-cccEEEEEe-eecCCccccccccCcccc Confidence 8876555555 221 2221 124444444444 3332221122 122344432 445444333333442 Q ss_pred CcccceEEEEEEEEEEecchhhhhHHHHhhCCC-HHHHHHHHHHHHHHHHHHHHHHhhccCCC------CCChhhhcCch Q lcl|Aclame:pro 74 QPEKSRTVQVKDSMGMLETYAEVDKALADLNGN-SAAWRLSEDRAFIEGMNQTQATTLFYGDS------SIDAEKFMGLT 146 (331) Q Consensus 74 ~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn-~~~~ra~q~~~~ika~~~~f~~~~iyGD~------~~~~~~F~GL~ 146 (331) ......+....-.|-|+.-.+.|-.-.....-. ..+..+.|.+.+++.+...++..||+|.- +..|..+.||. T Consensus 71 ~~~~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~ 150 (317) T protein:vir:88 71 IKAGSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIF 150 (317) T ss_pred cccccCCEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHH Confidence 222245667777788888888887755444221 34667889999999999999999999963 23356777775 Q ss_pred hhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeece Q lcl|Aclame:pro 147 PRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGL 226 (331) Q Consensus 147 ~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl 226 (331) .-+. ..+++.++|..+. +....++ .+..+.+ T Consensus 151 ~~i~------t~~~~~~~g~~~~---------~~~~~~~--------------------t~~t~~~-------------- 181 (317) T protein:vir:88 151 AYYK------TNGSLGANGVAPV---------GDGSNTG--------------------TAGDLRL-------------- 181 (317) T ss_pred HHhc------cCceeccCccccc---------cCCCccc--------------------ccccccc-------------- Confidence 5432 1122222211100 0000000 0000000 Q ss_pred EEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCccee-eeecccC Q lcl|Aclame:pro 227 TLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAAST-LTMEEIA 305 (331) Q Consensus 227 ~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~-~~~~~~~ 305 (331) |+ .+++.+++.+|=..+.++..||||-+.+..|.....+.....+ ...+... T Consensus 182 ------------------lt---------e~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~ 234 (317) T protein:vir:88 182 ------------------LT---------EDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNRI 234 (317) T ss_pred ------------------cc---------HHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEE Confidence 11 2234444444444566778899999999999876654332221 1122222 Q ss_pred C----ceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 306 G----KKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 306 g----~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) | ..++.|.-|.|+-.=.|-..+.-++ T Consensus 235 g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~ 264 (317) T protein:vir:88 235 AQTVDVYESDFGKYTIRANRWFHENTLFVF 264 (317) T ss_pred EEEEEEEEeCCeEEEEEeCCCCCCCeEEEE Confidence 2 3445666666665555544444444 No 123 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=94.32 E-value=0.0031 Score=34.37 Aligned_cols=252 Identities=14% Similarity=0.099 Sum_probs=121.3 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcceecccCCccceeEEEeecCCcceee------------ Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRK------------ 68 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~e~n~~~~~~~~~~~~lP~~~fR~------------ 68 (331) +..|.-..+|+.++-+.+-+.. .....++...+.++||+.+.++..-+.... ++..++-. T Consensus 15 ~~~i~k~~it~~~l~~g~L~p~-~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~e-------i~kig~G~r~~r~~~e~~~~ 86 (360) T protein:vir:99 15 MNSLSQKDIGLAELDGFQLPVD-VTEEFLERMQKGVQILGMADTMTLARLEME-------VPQFGVPRLSGHTRDEEGSR 86 (360) T ss_pred HHHHHhhhccccccCceeecHH-HHHHHHHHHhhccchhhhcceeeccccccc-------ccccccceeeccccccCCCC Confidence 4444444466666554443333 235678888899999999999764333222 22222211 Q ss_pred -cCCccCcccceEE---EEEEEEEEecchhhhhHHHHh-hCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCC----- Q lcl|Aclame:pro 69 -LNYGVQPEKSRTV---QVKDSMGMLETYAEVDKALAD-LNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSID----- 138 (331) Q Consensus 69 -lN~g~~~s~~t~~---~~~~~l~ilgg~~eVDr~la~-~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~----- 138 (331) =+.+.+.+...+. .+...+.+ +...+.+ .+-...++--.-+.++.++.+..++...|+||+++. T Consensus 87 ~~~~~~~~~~v~~~~~~~~~~~~~i------~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~ 160 (360) T protein:vir:99 87 TENSEAESGSVKFNATDKSYYILVE------PKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSI 160 (360) T ss_pred CcCCcCccccCccccccceeeEeec------hHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccccC Confidence 1122222222221 22222222 2333222 111111222223477899999999999999997642 Q ss_pred ----h--hhhcCchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCe Q lcl|Aclame:pro 139 ----A--EKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGR 212 (331) Q Consensus 139 ----~--~~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~ 212 (331) | +..+|.-|.. .+..+.||..|-... +..+|+ . T Consensus 161 ~~~d~fl~~~dGwlKka-----~~~~~~id~a~d~t~-------------------------~~~~~~-----------~ 199 (360) T protein:vir:99 161 GGAAELDNTFKGWIARA-----EGDAQSVDDAGDSTR-------------------------IGLEDT-----------A 199 (360) T ss_pred cccchhhhhhHHHHHHh-----hcccchhhccccccc-------------------------cccccc-----------c Confidence 1 3345554433 112233332221100 001111 0 Q ss_pred eEEEEEEEEeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCC----ceEEEeCHHHHHHHHH Q lcl|Aclame:pro 213 YQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMG----RPAFYMPRKIRSFLRR 288 (331) Q Consensus 213 ~~~~~t~~~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g----~~~~y~n~~v~~~L~~ 288 (331) + .+--+..-++|-+.+ .+. .+....|..+++..||..-+. ..+|||+......-+. T Consensus 200 ~---------------~~~~~~~~~~~~~g~---~~~--~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~ 259 (360) T protein:vir:99 200 T---------------ADADSMPSIANTDGS---GNP--QPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQVQSYTM 259 (360) T ss_pred c---------------cccccchhhhccccc---ccc--ccchHHHHHHHHHhcchhhhcCcccceEEEccCchHHHHHH Confidence 0 001112223343211 111 122344455666667744432 4489999888777777 Q ss_pred HhhcCCcceeeeecccCCceeEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 289 QITNKVAASTLTMEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 289 ~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~dai~~tE~~Vv 331 (331) +..++.-. +-..-..|.-...+.|+||..|..+-.. .|. T Consensus 260 ~L~~R~t~--LGd~~l~g~~~~~~~Gipi~~v~~~pd~--~~m 298 (360) T protein:vir:99 260 SLTEREDP--LGSAVIFGDSDITPFSYDLVGVNGFPDE--YMM 298 (360) T ss_pred HHhccCcc--cchhheecccccccceeeeEEcCCCCCC--ceE Confidence 77666521 1112223344466889999999988532 233 No 124 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=93.92 E-value=0.00012 Score=42.01 Aligned_cols=219 Identities=14% Similarity=0.175 Sum_probs=77.5 Q ss_pred ecCCccCcccceEEEEE---EEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHHH---HHHhh---ccCCCCCC Q lcl|Aclame:pro 68 KLNYGVQPEKSRTVQVK---DSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQT---QATTL---FYGDSSID 138 (331) Q Consensus 68 ~lN~g~~~s~~t~~~~~---~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~~---f~~~~---iyGD~~~~ 138 (331) -+---.|+-...+...+ ..|+|- +.--++ -+++..+ ...++.++.+.++ +..-- +.|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~l~m~-alTLae--a~~l~~d------~~~~~VIE~l~~~s~iL~~lpf~~ve~~~--- 68 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPELKMP-TVTLAE--SAKLSQD------HLVSGLIETIVEVNPLYEMMPFTEIEGNA--- 68 (330) T ss_pred CceecCCccccceeehhccccccchh-hhhhhH--HhhcCch------hhHHHHHHhhhccchHHhhcccccccCCc--- Confidence 11111122223222222 222222 111111 1111111 1123344444432 11110 12211 Q ss_pred hhhhcCchhhhccccccccceeec--cCCCCCCceEEEEEEeCCCcEEE-------------EccC--CCcc---ceeec Q lcl|Aclame:pro 139 AEKFMGLTPRFNSLSAENGQNIID--AGGTGSDNASIWLTVWGPNTLHT-------------IYPK--GSQA---GLQSR 198 (331) Q Consensus 139 ~~~F~GL~~R~~~~~~~~~~~vid--aGgtG~~~tSI~~V~~g~~~~~g-------------iypk--g~~a---Gl~~~ 198 (331) .+|+-.....+....+ .|-+.+. +..+++.-+. ++. +|+. ...+ .+-++ T Consensus 69 --------~~~~r~~~lp~a~~r~~n~~~~~~~--~~Tf~q~t~~-l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ie 137 (330) T protein:vir:94 69 --------LAYNRENVLGDVQFLAVGGTITAKN--PATFTKVTSE-LTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAK 137 (330) T ss_pred --------ceeeeeecCCcceeeeccccccccC--cceeeeeeec-hhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHH Confidence 1222111111112222 2212211 1112222111 000 1100 0000 00000 Q ss_pred cccc----eeee-ccCCCeeEEEEEEEEeeeceEEecccceeeeeccccccCCC-Ccc-chhhHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 199 DLGE----DTLI-DAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTK-NAS-AGADLIDLMTQAVELIPNVGM 271 (331) Q Consensus 199 d~g~----~~~~-d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NId~s~l~~-~~~-~~~~l~~lm~~a~~~ip~~~~ 271 (331) -+++ .-++ |..++.|.|+-.++ +-+...+ +++ +..+ .+.|-+.+.+++.... T Consensus 138 al~~~~e~~linGDs~~~~F~GL~~~~--------------------~~~q~i~tg~~gg~~T-~d~LDeLl~~v~~~~g 196 (330) T protein:vir:94 138 SIGRQYQASMITGDGTGNSFQGMMGLV--------------------AASQTISAGANGGTLT-FELLDQLLDLVKDKDG 196 (330) T ss_pred HHHHHHHHHhhccCCCCccccchhhcC--------------------CcccEEecCCCCCCCC-HHHHHHHHHHhcCCCC Confidence 0000 0111 11123344432222 1111111 111 1122 3446666667766566 Q ss_pred CceEEEeCHHHHHHHHHHhh--cCCcceeeeecccCCceeEEEcCeEEEEEEeccCCCccc--------C Q lcl|Aclame:pro 272 GRPAFYMPRKIRSFLRRQIT--NKVAASTLTMEEIAGKKVVAFDGIPCRRTDALLLTEARV--------V 331 (331) Q Consensus 272 g~~~~y~n~~v~~~L~~~~~--~~~~~~~~~~~~~~g~~v~~~~gvpir~~dai~~tE~~V--------v 331 (331) .+.+|+||+.....++.-.- ++..+. ....+..|++|..|+||||..||-|-.+|... . T Consensus 197 ~~~~~l~n~a~~r~I~a~~R~~~~~~v~-~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIy 265 (330) T protein:vir:94 197 QVDYLMSSFAMRRKYFSLLRALGGAAIG-EVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTATNATAIF 265 (330) T ss_pred CCcEEEechhHHHHHHHHHHhccCCCCC-CcccccCCCEEeeeCCeEEEecccccCCCCcccCCCceeEE Confidence 78999999775555542221 222221 24567899999999999999999887665431 1 No 125 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=93.07 E-value=0.0089 Score=31.83 Aligned_cols=213 Identities=10% Similarity=0.072 Sum_probs=110.6 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcce---ecccCCccceeEEEeecCCcceeecCCccCccc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTV---IEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEK 77 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf---~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~ 77 (331) ||. +.=.|.++ +.+. -..+.|.|.+.+..-+....+- .++..|..-.......++++.+..=++++++++ T Consensus 1 ma~---~~T~~~~~---iiPe-v~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~ 73 (274) T protein:vir:93 1 MPQ---GITKTSNQ---IIPE-VLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) T ss_pred CCc---cceehhhe---echH-HHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccc Confidence 664 33344443 2222 1223345554332211111111 123222223344445567888887789999999 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhhC-CCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADLN-GNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~~-gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~ 156 (331) -++.+.+..+.-.+..+.|+...+.+. +++ .....++...+++.++...++.. +. T Consensus 74 it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~---~~~~~~~~~~~~a~~~d~~~~~~--------~~------------- 129 (274) T protein:vir:93 74 LETKKREAKIRKIAKGTSITDEALLSGYGDP---QGEQVRQHGLAHANKVDNDVLEA--------LM------------- 129 (274) T ss_pred cccceeEEEeeeecccccccHHHHHhhccch---HHHHHHHHHHHHHHHHHHHHHHH--------Hh------------- Confidence 999999999988887787776554444 444 34444556677777766554411 00 Q ss_pred cceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceee Q lcl|Aclame:pro 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) Q Consensus 157 ~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~R 236 (331) + + .++ T Consensus 130 ~-------a---~~~----------------------------------------------------------------- 134 (274) T protein:vir:93 130 G-------A---KLT----------------------------------------------------------------- 134 (274) T ss_pred c-------c---ccc----------------------------------------------------------------- Confidence 0 0 000 Q ss_pred eeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeeccc-CCceeEEEcCe Q lcl|Aclame:pro 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI-AGKKVVAFDGI 315 (331) Q Consensus 237 I~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~-~g~~v~~~~gv 315 (331) .+. ++.+ .+.+.+|+.++-.......+++||..+...|++.....--...-..+.. ..-.+-.+.|+ T Consensus 135 ---~~~-----~~~~----~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~ 202 (274) T protein:vir:93 135 ---VNA-----DITK----LNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGA 202 (274) T ss_pred ---ccc-----cccC----HHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCe Confidence 000 0000 1234555555544344557999999999999864322110000000110 00124578999 Q ss_pred EEEEEEeccCCCcccC Q lcl|Aclame:pro 316 PCRRTDALLLTEARVV 331 (331) Q Consensus 316 pir~~dai~~tE~~Vv 331 (331) ||..+|.+-....-++ T Consensus 203 ~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:93 203 IIVRTNKLEAGTAILA 218 (274) T ss_pred eEEEcCCCCcceEEEE Confidence 9999999865554444 No 126 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=89.87 E-value=0.024 Score=29.47 Aligned_cols=211 Identities=12% Similarity=0.088 Sum_probs=105.7 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhccee----cccCCccceeEEEeecCCcceeecCCccCcc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVI----EANGFTEHKTTVRSGLPTGTWRKLNYGVQPE 76 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~----e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s 76 (331) ||. ..-+|.|+- .+. -..+-|.|.|.+. -++..+.-. ++..|..-.......++++....=+.+++++ T Consensus 1 ma~---~~T~~~d~i---~Pe-v~s~~v~~~~~~~-~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~ 72 (274) T protein:vir:96 1 MAQ---GTTKVSNLI---VPE-VLAPMMQAELDKK-LRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD 72 (274) T ss_pred CCc---cccchhhhh---hhH-HHHHHHHHHHHhh-hhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchh Confidence 664 333566652 221 1233455555332 222222111 2222222223333334667666667788888 Q ss_pred cceEEEEEEEEEEecchhhhhHHHHhh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccc Q lcl|Aclame:pro 77 KSRTVQVKDSMGMLETYAEVDKALADL-NGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) Q Consensus 77 ~~t~~~~~~~l~ilgg~~eVDr~la~~-~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~ 155 (331) +-++...+..+.-.+-.+.++-.-+.+ .+++ ...-.++...+++.++...++.- +. T Consensus 73 ~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~---~~~~~~~~~~~~a~~~d~~i~~~--------l~------------ 129 (274) T protein:vir:96 73 QIGTSKREAKVRKIGKGTELTDEAVLSGFGDP---QGEAVRQHGLAIANKVDNDVLEA--------LK------------ 129 (274) T ss_pred hcccceeEEEEEeeeceeeecHHHHHhhcchH---HHHHHHHHHHHHHHHHHHHHHHH--------Hh------------ Confidence 888888888777766667776544333 3444 33334445666776666555421 00 Q ss_pred ccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccccee Q lcl|Aclame:pro 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) Q Consensus 156 ~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~ 235 (331) |++- . T Consensus 130 --------~a~~--------------~----------------------------------------------------- 134 (274) T protein:vir:96 130 --------GATL--------------T----------------------------------------------------- 134 (274) T ss_pred --------cCCC--------------C----------------------------------------------------- Confidence 0000 0 Q ss_pred eeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeeccc--CCceeEEEc Q lcl|Aclame:pro 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI--AGKKVVAFD 313 (331) Q Consensus 236 RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~--~g~~v~~~~ 313 (331) .+ .++.+ .+.+.+|..++-...-...+++||..+...|+++....-....-..+.. .| .+-.+. T Consensus 135 ----~~-----~~~~~----~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g-~ig~~~ 200 (274) T protein:vir:96 135 ----VE-----ADITK----LDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKG-AFGEAL 200 (274) T ss_pred ----cC-----ccccc----HHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhcccccccccccccccceeec-ccceec Confidence 00 00000 2334555555543333457899999999999875322111000000010 11 356789 Q ss_pred CeEEEEEEeccCCCcccC Q lcl|Aclame:pro 314 GIPCRRTDALLLTEARVV 331 (331) Q Consensus 314 gvpir~~dai~~tE~~Vv 331 (331) |++|..+|++-.+.+-++ T Consensus 201 G~~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:96 201 GAVIVRSNKLNKGEALLA 218 (274) T ss_pred CeeEEEcCCCCcceEEEE Confidence 999999999865554444 No 127 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=81.88 E-value=0.082 Score=26.55 Aligned_cols=210 Identities=14% Similarity=0.105 Sum_probs=100.5 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcc---eecccCCccceeEEEeecCCcceeecCCccCccc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMT---VIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEK 77 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lp---f~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~ 77 (331) |+. +.=+|+++ +.|. -..+-|.|.|.+..-+-...+ -.++..|..-....-..+..+.+..=+..+++++ T Consensus 1 ma~---~~T~~~d~---iiPe-v~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~ 73 (272) T protein:vir:36 1 MSK---QKTTLADL---VNPE-VLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDK 73 (272) T ss_pred CCC---cceehhhh---hchH-HHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhh Confidence 653 34455554 2222 122335555543322211111 1223222222222223333455544456688888 Q ss_pred ceEEEEEEEEEEecchhhhhHHHHhh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKALADL-NGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~la~~-~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~ 156 (331) -++.+.+..++-.+-.+.|+-..+.+ .+|+- ..-.++...+++.++...++.. +. T Consensus 74 lt~~~~~~~i~~~~k~~~vtD~~~~~~~~d~~---~~~~~~~a~~~a~~~d~~i~~~--------l~------------- 129 (272) T protein:vir:36 74 IGTTTKSVTIKKAAKGTEITDEAALSGYGDPI---GESNKQLGLSLANKVDDDLLSA--------AK------------- 129 (272) T ss_pred cCCcceeEeeehhhccccccHHHHhhccchHH---HHHHHHHHHHHHHHHHHHHHHH--------hc------------- Confidence 88888888888888888887644444 34443 3333345556666665444310 00 Q ss_pred cceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceee Q lcl|Aclame:pro 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) Q Consensus 157 ~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~R 236 (331) |..++ T Consensus 130 ----------~~~~~----------------------------------------------------------------- 134 (272) T protein:vir:36 130 ----------TTSQT----------------------------------------------------------------- 134 (272) T ss_pred ----------ccccc----------------------------------------------------------------- Confidence 00000 Q ss_pred eeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecc--cCCceeEEEcC Q lcl|Aclame:pro 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEE--IAGKKVVAFDG 314 (331) Q Consensus 237 I~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~--~~g~~v~~~~g 314 (331) + +. . .-.+.+.+|..++-...--..+++||..+...|+++........ ....+ ..| .+-.+.| T Consensus 135 ---~-----~~-~----~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~-~~~~~~~~~G-~ig~~~G 199 (272) T protein:vir:36 135 ---V-----ST-K----ANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGS-EVGANALING-TYADVLG 199 (272) T ss_pred ---c-----cc-c----ccHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccc-cccccceeee-ccceecC Confidence 0 00 0 00223445555554333345799999999999985422111000 00000 011 2457899 Q ss_pred eEEEEEEeccCCCcc---cC Q lcl|Aclame:pro 315 IPCRRTDALLLTEAR---VV 331 (331) Q Consensus 315 vpir~~dai~~tE~~---Vv 331 (331) ++|..+|++-.+-.. ++ T Consensus 200 ~~Vv~s~~~p~~~~~~~~~~ 219 (272) T protein:vir:36 200 AQIVRSKKLAEGSALMFKIV 219 (272) T ss_pred eeEEEeCCCCCCceeEEEEE Confidence 999999998543221 22 No 128 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=79.30 E-value=0.11 Score=25.93 Aligned_cols=218 Identities=16% Similarity=0.096 Sum_probs=82.2 Q ss_pred CCcCcc-c------cccHHHHHHhcCCc---------------------cchhHH----HHHHHhccchhHhhcceeccc Q lcl|Aclame:pro 1 MPTLST-T------NPTLADVAARMTPD---------------------GKIDPQ----IVEMLNETNEILDDMTVIEAN 48 (331) Q Consensus 1 M~~l~~-~------a~TL~E~Ak~~~~~---------------------~~~~~~----VIE~l~~~s~iL~~lpf~e~n 48 (331) |+.-.. . .-.+.+........ +.+-+. +++...+.+++...++.. T Consensus 171 ~~~~~~~~~~~~~~~~e~r~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 247 (480) T protein:vir:40 171 REASIPSEKPEDAERKFMRELGSKMAEMPEQGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQGLTLA--- 247 (480) T ss_pred hhhhccccchhhhhhHHHHHHHHHhccchhhhhhhhhhhhccccccccccccccchhhheeechhhhhhhhhcceee--- Confidence 111000 0 00000100000000 000000 000000001111111110 Q ss_pred CCccceeEEEeecCCcceee----cCCccCcccceEEEEEEEEEEecchhhhhHHHHhhCCCHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 49 GFTEHKTTVRSGLPTGTWRK----LNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQ 124 (331) Q Consensus 49 ~~~~~~~~~~~~lP~~~fR~----lN~g~~~s~~t~~~~~~~l~ilgg~~eVDr~la~~~gn~~~~ra~q~~~~ika~~~ 124 (331) ..+.....|.. -++.-+++. .......-.+.+....+.++..++--+..++...-.....++++. T Consensus 248 ---------~~g~~~~~~~~e~~~~~~~~~~~~--~~~~~~~~~~v~~l~~~~k~t~~lLDDa~~l~~~i~~~l~~~~~~ 316 (480) T protein:vir:40 248 ---------EDGVDDTFISGTFKAGTDKNKSQT--ATKRSLRPQMAEAYLQMDKATVRGVNDSGALSEYVMSEMVNRVIQ 316 (480) T ss_pred ---------eccccceeeeeeeecccccccccc--cccchhhHHHHHHHHHhHHHHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 00111112221 111111111 111111111234555566666555444333444344456788999 Q ss_pred HHHHhhccCCCCCChhhhcCchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeecccccee Q lcl|Aclame:pro 125 TQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDT 204 (331) Q Consensus 125 ~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~ 204 (331) +-+.+|++||.. ..++|.|+.. .+++ T Consensus 317 ~ee~a~l~G~g~-g~~~~~g~~~----------------~~~~------------------------------------- 342 (480) T protein:vir:40 317 KVEYNMILGSVD-GSNGFYGLKT----------------ATDG------------------------------------- 342 (480) T ss_pred HHHHHhhccCCC-Ccccccccee----------------eccc------------------------------------- Confidence 999999999621 1122222200 0000 Q ss_pred eeccCCCeeEEEEEEEEeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHH Q lcl|Aclame:pro 205 LIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRS 284 (331) Q Consensus 205 ~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~ 284 (331) ++. ..++.++++.|+.|+..-- ..+.+.|+||++... T Consensus 343 ---------------------~~~--------------------~~~~~d~id~L~~al~~~y--~~~a~~~vmn~~t~~ 379 (480) T protein:vir:40 343 ---------------------WTK--------------------QIEYTDLFEGITDAVAECS--ISDAITIVMSPQTFA 379 (480) T ss_pred ---------------------ccc--------------------cchhHHHHHHHHHhhhHHh--hCCCCEEEECHHHHH Confidence 000 0011345555555554221 233457999999999 Q ss_pred HHHHHhhcCCcceeeeecccCCceeEEEcCeEEEEEEec-cCCCcccC Q lcl|Aclame:pro 285 FLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRTDAL-LLTEARVV 331 (331) Q Consensus 285 ~L~~~~~~~~~~~~~~~~~~~g~~v~~~~gvpir~~dai-~~tE~~Vv 331 (331) .|++. ++..+.. +-..-.....+-...|.||..++.. ...+..|. T Consensus 380 ~I~kl-KD~~G~Y-i~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~~~ 425 (480) T protein:vir:40 380 ELRKA-KGTDGHS-RFNELATKEQIAQSFGAVNLETRVWMPKDEVAVY 425 (480) T ss_pred HHHHh-hcCCCCe-eccCcccccCcceecccceeeeeccccCCcceee Confidence 99986 3444433 2222333344455668998766543 33444444 No 129 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=76.76 E-value=0.13 Score=25.40 Aligned_cols=212 Identities=11% Similarity=0.080 Sum_probs=107.4 Q ss_pred CCcCcccc-ccHHHHHHhcCCccchhHHHHHHHhccchhHhhccee----cccCCccceeEEEeecCCcceeecCCccCc Q lcl|Aclame:pro 1 MPTLSTTN-PTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVI----EANGFTEHKTTVRSGLPTGTWRKLNYGVQP 75 (331) Q Consensus 1 M~~l~~~a-~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~----e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~ 75 (331) |+ +.+ =+|+++ +.+. -..+-|.|.+.+..-+ ..+--+ ++..|..-....-..++.+....=+..+++ T Consensus 1 ~~---~~~~T~l~d~---i~PE-v~~~~v~~~~~~~~~~-~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~ 72 (275) T protein:vir:96 1 MA---LENMTKLANM---VNPE-VLAPMMQAELDKKLKF-AQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPI 72 (275) T ss_pred CC---Ccccchhhhh---hchH-HHHHHHHHHHHHhhhh-cccceecccccCCCCCEEEeeeeccCCccccccCCCCcch Confidence 44 322 345554 3232 1234466666543333 222211 222121112222233456666666678888 Q ss_pred ccceEEEEEEEEEEecchhhhhHHHHhh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccc Q lcl|Aclame:pro 76 EKSRTVQVKDSMGMLETYAEVDKALADL-NGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) Q Consensus 76 s~~t~~~~~~~l~ilgg~~eVDr~la~~-~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~ 154 (331) .+-++.+.+..+.-.+-.+.++-.-+.+ .+|+-...+ ++.-.+++.++...++. .+. T Consensus 73 ~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~---~~~a~~~a~~~d~~ll~--------~l~----------- 130 (275) T protein:vir:96 73 DLIETKKRQATIRKIGKGTVLTDEALLSGYGDPKGEAV---RQHGLAIANKVDNDVLE--------ALQ----------- 130 (275) T ss_pred hhcccceeeEEeehhcccccccHHHHHhhccchHHHHH---HHHHHHHHHHHHHHHHH--------HHh----------- Confidence 8888888888887778788887654444 455543333 33555566655554430 000 Q ss_pred cccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccce Q lcl|Aclame:pro 155 ENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYV 234 (331) Q Consensus 155 ~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v 234 (331) + ++. . T Consensus 131 --~-------a~~-----------------------------------------------------------~------- 135 (275) T protein:vir:96 131 --G-------ATL-----------------------------------------------------------K------- 135 (275) T ss_pred --c-------ccc-----------------------------------------------------------c------- Confidence 0 000 0 Q ss_pred eeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeeccc-CCceeEEEc Q lcl|Aclame:pro 235 VRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI-AGKKVVAFD 313 (331) Q Consensus 235 ~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~-~g~~v~~~~ 313 (331) ++.+ ..+ .+.+.+|..++-.......+++||..+...|+++....--......++. .--.+-.+. T Consensus 136 -----~~~~-----~~~----~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~ 201 (275) T protein:vir:96 136 -----VEAD-----ITK----LAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEAL 201 (275) T ss_pred -----cccc-----ccC----HHHHHHHHHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccceeccccceec Confidence 0000 000 3345666666643333457899999999999876422110000001111 011355789 Q ss_pred CeEEEEEEeccCCCcccC Q lcl|Aclame:pro 314 GIPCRRTDALLLTEARVV 331 (331) Q Consensus 314 gvpir~~dai~~tE~~Vv 331 (331) |++|.++|.+-...+-++ T Consensus 202 G~~Vi~s~~~p~~t~~i~ 219 (275) T protein:vir:96 202 GAIIVRSNKIKEGEAILA 219 (275) T ss_pred CeeEEEeCCCCcceEEEE Confidence 999999999876666555 No 130 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=76.20 E-value=0.14 Score=25.30 Aligned_cols=210 Identities=11% Similarity=0.086 Sum_probs=102.6 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhccee----cccCCccceeEE--EeecCCcceeecCCccC Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVI----EANGFTEHKTTV--RSGLPTGTWRKLNYGVQ 74 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~----e~n~~~~~~~~~--~~~lP~~~fR~lN~g~~ 74 (331) |+. ..=+|.|+ +.+. -..+.|.|.|.+. -++..+.-. ++.. |....+ -..++.+.--.=+.+++ T Consensus 1 ma~---~~T~l~d~---iiPe-v~~~~v~~~~~~~-l~~~~~~~~d~~l~g~~--G~tv~iP~~~~ig~a~~~~~g~~i~ 70 (274) T protein:vir:12 1 MAQ---GLTKTSNQ---IIPE-VLAPMMQAQLEKK-LRFASFAEVDSTLQGQP--GDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) T ss_pred CCc---ceeehhhh---hchH-HHHHHHHHHHHhh-hhhcccceecccccCCC--CCEEEEeeecCCCccccccCCCccc Confidence 654 33355554 3232 1223344444321 111121111 2221 222222 12234555444456788 Q ss_pred cccceEEEEEEEEEEecchhhhhH-HHHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccc Q lcl|Aclame:pro 75 PEKSRTVQVKDSMGMLETYAEVDK-ALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLS 153 (331) Q Consensus 75 ~s~~t~~~~~~~l~ilgg~~eVDr-~la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~ 153 (331) +++-++.+.+..+.-.+-.++|+- +.+.-.+|+-... .++...+++.++...++.- +. T Consensus 71 ~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~---~~q~~~~~a~~vd~~~l~~--------~~---------- 129 (274) T protein:vir:12 71 TDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQ---VRQHGLAHANKVDNDVLEA--------LM---------- 129 (274) T ss_pred hhhcccceeeEEeeeecceeeecHHHHHhcccchHHHH---HHHHHHHHHHHHHHHHHHH--------Hh---------- Confidence 888888888888877787888844 5555556654333 3334556666555444310 00 Q ss_pred ccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccc Q lcl|Aclame:pro 154 AENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRY 233 (331) Q Consensus 154 ~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~ 233 (331) + ++. . T Consensus 130 ---~-------a~~-----------------------------------------------------------~------ 134 (274) T protein:vir:12 130 ---G-------AKL-----------------------------------------------------------T------ 134 (274) T ss_pred ---c-------ccc-----------------------------------------------------------c------ Confidence 0 000 0 Q ss_pred eeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcC-CcceeeeecccCCceeEEE Q lcl|Aclame:pro 234 VVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNK-VAASTLTMEEIAGKKVVAF 312 (331) Q Consensus 234 v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~-~~~~~~~~~~~~g~~v~~~ 312 (331) ++. ++ .-.+.+.+|..++-.......+++||..+...|+++.... .........-...-.+-.+ T Consensus 135 ------~~~-----~a----~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:12 135 ------VNA-----DI----TKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) T ss_pred ------ccc-----cc----cCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceee Confidence 000 00 0134466677666543345578999999999998753211 1100000000001124468 Q ss_pred cCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 313 DGIPCRRTDALLLTEARVV 331 (331) Q Consensus 313 ~gvpir~~dai~~tE~~Vv 331 (331) .|++|..+|.+-...+-++ T Consensus 200 ~G~~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:12 200 LGAIIVRSNKLEAGTAILA 218 (274) T ss_pred cCeeEEEeCCCCcceEEEE Confidence 9999999998866555444 No 131 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=73.39 E-value=0.17 Score=24.79 Aligned_cols=211 Identities=11% Similarity=0.071 Sum_probs=102.3 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhc-cee---cccCCccceeEEEeecCCcceeecCCccCcc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDM-TVI---EANGFTEHKTTVRSGLPTGTWRKLNYGVQPE 76 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~l-pf~---e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s 76 (331) ||. +.=+|.|+ +.+. -..+-|.|.+.+. -++..+ ..+ ++..|..-.......+..+.-..=+.++++. T Consensus 1 ma~---~~T~~~d~---iiPe-v~~~~v~~~~~~~-l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~ 72 (274) T protein:vir:97 1 MPQ---GLTKTSDQ---IIPE-VLAPMMQAQLEKK-LRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCc---cceehhhe---echH-HHHHHHHHhhhhh-hhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccc Confidence 664 33355554 3222 1223344444321 111111 111 2222211122222334455545556778888 Q ss_pred cceEEEEEEEEEEecchhhhhHH-HHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccc Q lcl|Aclame:pro 77 KSRTVQVKDSMGMLETYAEVDKA-LADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) Q Consensus 77 ~~t~~~~~~~l~ilgg~~eVDr~-la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~ 155 (331) +-++.+.+..+.-.+-.++|+-. .+.-.+|+- ....++...+++.++...++.- +. T Consensus 73 ~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~---~~~~~~~a~a~a~~vd~~~~~~---------------l~----- 129 (274) T protein:vir:97 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQ---GEQVRQHGLAHANKVDNDVLEA---------------LM----- 129 (274) T ss_pred ccccceeEEEeeeecceecccHHHHHhccchHH---HHHHHHHHHHHHHHHHHHHHHH---------------Hh----- Confidence 88888888888777756666553 344445543 3344445566666666544410 00 Q ss_pred ccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccccee Q lcl|Aclame:pro 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) Q Consensus 156 ~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~ 235 (331) +++. .+ T Consensus 130 ---------~a~~---------------------------~~-------------------------------------- 135 (274) T protein:vir:97 130 ---------GAKL---------------------------TV-------------------------------------- 135 (274) T ss_pred ---------ccCc---------------------------cc-------------------------------------- Confidence 0000 00 Q ss_pred eeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCC-cceeeeeccc-CCceeEEEc Q lcl|Aclame:pro 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKV-AASTLTMEEI-AGKKVVAFD 313 (331) Q Consensus 236 RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~-~~~~~~~~~~-~g~~v~~~~ 313 (331) +.+ +.+ .+.+.+|..++-.......+++||..+...|+++....- ..... .+.. ..-.+-.+. T Consensus 136 -----~~~-----~~~----~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~ig~~~ 200 (274) T protein:vir:97 136 -----NAD-----ITK----LNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATEL-GDDIIVKGAFGEAL 200 (274) T ss_pred -----ccc-----ccC----HHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcc-cccceeccccceec Confidence 000 000 233555666655444456899999999999986532211 00000 0000 001245789 Q ss_pred CeEEEEEEeccCCCcccC Q lcl|Aclame:pro 314 GIPCRRTDALLLTEARVV 331 (331) Q Consensus 314 gvpir~~dai~~tE~~Vv 331 (331) |++|..+|.+-....-++ T Consensus 201 G~~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:97 201 GAIIVRTNKLEAGTAILA 218 (274) T ss_pred CeeEEEcCCCCcceEEEE Confidence 999999998866555544 No 132 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=73.39 E-value=0.17 Score=24.79 Aligned_cols=211 Identities=11% Similarity=0.071 Sum_probs=102.3 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhc-cee---cccCCccceeEEEeecCCcceeecCCccCcc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDM-TVI---EANGFTEHKTTVRSGLPTGTWRKLNYGVQPE 76 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~l-pf~---e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s 76 (331) ||. +.=+|.|+ +.+. -..+-|.|.+.+. -++..+ ..+ ++..|..-.......+..+.-..=+.++++. T Consensus 1 ma~---~~T~~~d~---iiPe-v~~~~v~~~~~~~-l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~ 72 (274) T protein:vir:94 1 MPQ---GLTKTSDQ---IIPE-VLAPMMQAQLEKK-LRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCc---cceehhhe---echH-HHHHHHHHhhhhh-hhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccc Confidence 664 33355554 3222 1223344444321 111111 111 2222211122222334455545556778888 Q ss_pred cceEEEEEEEEEEecchhhhhHH-HHhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccc Q lcl|Aclame:pro 77 KSRTVQVKDSMGMLETYAEVDKA-LADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) Q Consensus 77 ~~t~~~~~~~l~ilgg~~eVDr~-la~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~ 155 (331) +-++.+.+..+.-.+-.++|+-. .+.-.+|+- ....++...+++.++...++.- +. T Consensus 73 ~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~---~~~~~~~a~a~a~~vd~~~~~~---------------l~----- 129 (274) T protein:vir:94 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQ---GEQVRQHGLAHANKVDNDVLEA---------------LM----- 129 (274) T ss_pred ccccceeEEEeeeecceecccHHHHHhccchHH---HHHHHHHHHHHHHHHHHHHHHH---------------Hh----- Confidence 88888888888777756666553 344445543 3344445566666666544410 00 Q ss_pred ccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccccee Q lcl|Aclame:pro 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) Q Consensus 156 ~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~ 235 (331) +++. .+ T Consensus 130 ---------~a~~---------------------------~~-------------------------------------- 135 (274) T protein:vir:94 130 ---------GAKL---------------------------TV-------------------------------------- 135 (274) T ss_pred ---------ccCc---------------------------cc-------------------------------------- Confidence 0000 00 Q ss_pred eeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCC-cceeeeeccc-CCceeEEEc Q lcl|Aclame:pro 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKV-AASTLTMEEI-AGKKVVAFD 313 (331) Q Consensus 236 RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~-~~~~~~~~~~-~g~~v~~~~ 313 (331) +.+ +.+ .+.+.+|..++-.......+++||..+...|+++....- ..... .+.. ..-.+-.+. T Consensus 136 -----~~~-----~~~----~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~ig~~~ 200 (274) T protein:vir:94 136 -----NAD-----ITK----LNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATEL-GDDIIVKGAFGEAL 200 (274) T ss_pred -----ccc-----ccC----HHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcc-cccceeccccceec Confidence 000 000 233555666655444456899999999999986532211 00000 0000 001245789 Q ss_pred CeEEEEEEeccCCCcccC Q lcl|Aclame:pro 314 GIPCRRTDALLLTEARVV 331 (331) Q Consensus 314 gvpir~~dai~~tE~~Vv 331 (331) |++|..+|.+-....-++ T Consensus 201 G~~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:94 201 GAIIVRTNKLEAGTAILA 218 (274) T ss_pred CeeEEEcCCCCcceEEEE Confidence 999999998866555544 No 133 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=68.96 E-value=0.23 Score=24.09 Aligned_cols=213 Identities=11% Similarity=0.065 Sum_probs=101.7 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcce---ecccCCccceeEEEeecCCcceeecCCccCccc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTV---IEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEK 77 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf---~e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s~ 77 (331) |+. +.=+|.++ +.+. -..+.|.|.+.+.+-+-....- .++..|..-....-..+..+....=+..+++.+ T Consensus 1 Ma~---~~T~l~d~---i~Pe-v~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~ 73 (276) T protein:vir:10 1 MAQ---GTTTKSTQ---IVPE-VLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDK 73 (276) T ss_pred CCc---ceeehhhh---hchH-HHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccc Confidence 663 33366665 3232 2234566666443333222221 122211111111112333444433345667777 Q ss_pred ceEEEEEEEEEEecchhhhhHHH-HhhCCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhccccccc Q lcl|Aclame:pro 78 SRTVQVKDSMGMLETYAEVDKAL-ADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) Q Consensus 78 ~t~~~~~~~l~ilgg~~eVDr~l-a~~~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~ 156 (331) -++.+.+..+.-.+-.+.++-.- +...+|+-...+ ++.-.+++.++...++. .+. T Consensus 74 lt~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~---~~~~~~~a~~~d~~~~~--------~l~------------- 129 (276) T protein:vir:10 74 IETNRREAKIHKIGKGTDITDEALLSGYGDPQGEAV---RQHGLAIANKVDNDVLE--------ALR------------- 129 (276) T ss_pred cccceeeEEeehccccccccHHHHHhhccchHHHHH---HHHHHHHHHHHHHHHHH--------HHh------------- Confidence 77777777777777777776543 444466544444 34555666666544431 000 Q ss_pred cceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccceee Q lcl|Aclame:pro 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) Q Consensus 157 ~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~R 236 (331) +++.. T Consensus 130 -------~~~~~-------------------------------------------------------------------- 134 (276) T protein:vir:10 130 -------GTKLT-------------------------------------------------------------------- 134 (276) T ss_pred -------ccccc-------------------------------------------------------------------- Confidence 00000 Q ss_pred eeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecc-cCCceeEEEcCe Q lcl|Aclame:pro 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEE-IAGKKVVAFDGI 315 (331) Q Consensus 237 I~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~-~~g~~v~~~~gv 315 (331) +..+ ..+ .+.+.+|..++-...-...+++||..+...|+++....-....-..++ ...-.+-.+.|+ T Consensus 135 ---~~~~-----~~t----~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~ 202 (276) T protein:vir:10 135 ---VSAD-----IGT----LAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGA 202 (276) T ss_pred ---cccc-----ccC----HHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecce Confidence 0000 000 233555666553222345789999999999986532221100000011 001124578999 Q ss_pred EEEEEEeccCCCcccC Q lcl|Aclame:pro 316 PCRRTDALLLTEARVV 331 (331) Q Consensus 316 pir~~dai~~tE~~Vv 331 (331) +|..+|.+-...+-++ T Consensus 203 ~Vi~s~~~p~~t~~l~ 218 (276) T protein:vir:10 203 VIVRSKKLDEGEAILA 218 (276) T ss_pred eEEEcCCCCcceEEEE Confidence 9999999855444433 No 134 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=46.34 E-value=0.74 Score=21.31 Aligned_cols=217 Identities=14% Similarity=0.102 Sum_probs=106.6 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhccee----cccCCccceeEEEeecCCcceeecCCccCcc Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVI----EANGFTEHKTTVRSGLPTGTWRKLNYGVQPE 76 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lpf~----e~n~~~~~~~~~~~~lP~~~fR~lN~g~~~s 76 (331) ||.+ .=+|.++ +.+. ...+.|.|.|.+ +-++..+.-+ ++..|..-.......+..+.+..=+.+++++ T Consensus 1 Ma~~---~T~~~~~---iiPe-v~s~~v~~~~~~-~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~ 72 (278) T protein:vir:80 1 MADL---TTKLANL---IDPE-VMGPMISAKLPK-AIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYS 72 (278) T ss_pred CCCc---ceehhhe---ecHH-HHHHHHHHHHHH-hhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCccc Confidence 7743 1133332 2221 123345555543 2222222211 2222222223333445567776667888888 Q ss_pred cceEEEEEEEEEEecchhhhhHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccccc Q lcl|Aclame:pro 77 KSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) Q Consensus 77 ~~t~~~~~~~l~ilgg~~eVDr~la~~~g-n~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~ 155 (331) +-++.+.+..+.-.+-.++|+...+.+.+ +.- ..-.+++..++++++...++.. +.| T Consensus 73 ~lt~~~~~~~i~~~~~a~~v~D~~~~~~~~d~~---~~~~~~~a~~~a~~~d~~l~~~--------l~~----------- 130 (278) T protein:vir:80 73 ALETESVKHGIKKAGKGVKLTDESVLSGYGDPV---EEAQKQIRMAIASKVDNDILEE--------ALT----------- 130 (278) T ss_pred ccccceeeEeeehhhccccccHHHHhhccccHH---HHHHHHHHHHHHHHHHHHHHHH--------Hhc----------- Confidence 98888888888887888888886655544 443 4444456777777777655521 110 Q ss_pred ccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEeccccee Q lcl|Aclame:pro 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) Q Consensus 156 ~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~ 235 (331) +.+.++ + . T Consensus 131 -a~~~~~----~-----------------------~-------------------------------------------- 138 (278) T protein:vir:80 131 -TTLEVK----G-----------------------A-------------------------------------------- 138 (278) T ss_pred -cccccc----c-----------------------c-------------------------------------------- Confidence 000000 0 0 Q ss_pred eeeccccccCCCCccchhhHHHHHHHHHHHhhcC-CCCceEEEeCHHHHHHHHHHhhc-CCcceeeeeccc-CCceeEEE Q lcl|Aclame:pro 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNV-GMGRPAFYMPRKIRSFLRRQITN-KVAASTLTMEEI-AGKKVVAF 312 (331) Q Consensus 236 RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~-~~g~~~~y~n~~v~~~L~~~~~~-~~~~~~~~~~~~-~g~~v~~~ 312 (331) .+.+ ...+..+++.+|..++-.. .+-.-+++||..+...|++.... ......+ .+.. .--.+-.+ T Consensus 139 --~t~~---------~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~-g~~~~~~G~ig~~ 206 (278) T protein:vir:80 139 --INIG---------LIDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQL-GDDLLVKGAFGEL 206 (278) T ss_pred --cccc---------hhhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccc-cccceeeccceee Confidence 0000 0001123445555444211 11123699999999999854221 1111000 0110 00125578 Q ss_pred cCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 313 DGIPCRRTDALLLTEARVV 331 (331) Q Consensus 313 ~gvpir~~dai~~tE~~Vv 331 (331) .|++|..+|++-+...-++ T Consensus 207 ~G~~Vi~s~~~p~~t~~l~ 225 (278) T protein:vir:80 207 LGWEIVRTKKLADGNALAV 225 (278) T ss_pred cceeEEEcCCCCcceEEEE Confidence 9999999999865444444 No 135 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=36.57 E-value=1.2 Score=20.22 Aligned_cols=171 Identities=13% Similarity=0.087 Sum_probs=88.3 Q ss_pred ecccCCccceeEEEeecCCcceeecCCc--cCcccceEEEEEEEEEEecchhhh-hHHHHhhCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 45 IEANGFTEHKTTVRSGLPTGTWRKLNYG--VQPEKSRTVQVKDSMGMLETYAEV-DKALADLNGNSAAWRLSEDRAFIEG 121 (331) Q Consensus 45 ~e~n~~~~~~~~~~~~lP~~~fR~lN~g--~~~s~~t~~~~~~~l~ilgg~~eV-Dr~la~~~gn~~~~ra~q~~~~ika 121 (331) .++.+. |-.-++..-+ .....+.+| +++.+=++.+.+..++-.+-.++| |.+...-.||+-...++| +-.+ T Consensus 1 ~~~~~~-Gdtit~P~~i--Gda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q---~~~~ 74 (231) T protein:vir:73 1 ENGINL-ANLCEYPNDI--GDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQ---LGLS 74 (231) T ss_pred CccccC-CceEEecccc--cchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHH---HHHH Confidence 222222 2222222111 133445444 455556667777777777777776 445566677775444443 5566 Q ss_pred HHHHHHHhhccCCCCCChhhhcCchhhhccccccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeecccc Q lcl|Aclame:pro 122 MNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLG 201 (331) Q Consensus 122 ~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g 201 (331) |+.++..-++. +.. ++.+ T Consensus 75 iA~kvD~di~~------------------------------~~~-~a~l------------------------------- 92 (231) T protein:vir:73 75 LANKVDDDLLK------------------------------AAK-TTSQ------------------------------- 92 (231) T ss_pred HHHhhhHHHHH------------------------------hhc-cccc------------------------------- Confidence 66655543330 000 0000 Q ss_pred ceeeeccCCCeeEEEEEEEEeeeceEEecccceeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHH Q lcl|Aclame:pro 202 EDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRK 281 (331) Q Consensus 202 ~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~ 281 (331) + ++. ..-.+.+.+|+..+-.....+.+++||.. T Consensus 93 -------------------------~------------~~~----------~~t~d~i~~A~~~fgde~~~~~vivv~p~ 125 (231) T protein:vir:73 93 -------------------------T------------VST----------KANVDGVQAALDIFNDEDAQAYVLIVNPK 125 (231) T ss_pred -------------------------c------------ccc----------cccHHHHHHHHHHhccccccceEEEEcch Confidence 0 000 00145577788877766667789999999 Q ss_pred HHHHHHHHhhcCCcceeeeeccc-CCceeEEEcCeEEEEEEeccCCCccc---C Q lcl|Aclame:pro 282 IRSFLRRQITNKVAASTLTMEEI-AGKKVVAFDGIPCRRTDALLLTEARV---V 331 (331) Q Consensus 282 v~~~L~~~~~~~~~~~~~~~~~~-~g~~v~~~~gvpir~~dai~~tE~~V---v 331 (331) ....||+-. +..........++ .-=.+-.+.|+||.+++.+-.+...- + T Consensus 126 ~~~~Lrk~~-~~~~~~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i 178 (231) T protein:vir:73 126 DAAKIRKDA-NAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIV 178 (231) T ss_pred HHHhhhhcc-chhhhhhhhccceeeecccceEcceEEEEcCCCCCCceeeeeEE Confidence 999998632 1111000000111 01134588999999999997655432 2 No 136 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=36.49 E-value=1.2 Score=20.21 Aligned_cols=206 Identities=9% Similarity=0.062 Sum_probs=97.2 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcc-e---ecccCCccceeEE--EeecCCcceeecCCccC Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMT-V---IEANGFTEHKTTV--RSGLPTGTWRKLNYGVQ 74 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lp-f---~e~n~~~~~~~~~--~~~lP~~~fR~lN~g~~ 74 (331) |+. ..=+|.++ +.+. -..+-|.|.+.+ .-++..+- . .++.. |....+ ...+.++.-..=+.+++ T Consensus 1 m~~---~~T~l~d~---i~Pe-v~~~~v~~~~~~-~l~~~~~~~~~~~l~g~~--G~tv~iP~~~~ig~a~~~~~g~~i~ 70 (274) T protein:vir:95 1 MAQ---GMTKLTNQ---IVPE-VLAPMMQAELEK-KLRFASFAEIDNTLVGQP--GDTLTFPAFIYSGDAKVVAEGEKIP 70 (274) T ss_pred CCc---ceeehhhe---echH-HHHHHHHHHHHh-hhhccccceecccccCCC--CCEEEeeeecCCCccccccCCCccc Confidence 654 33355554 3232 122334444432 21121111 1 12221 222222 12234454444456777 Q ss_pred cccceEEEEEEEEEEecchhhhhHHHHhh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccc Q lcl|Aclame:pro 75 PEKSRTVQVKDSMGMLETYAEVDKALADL-NGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLS 153 (331) Q Consensus 75 ~s~~t~~~~~~~l~ilgg~~eVDr~la~~-~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~ 153 (331) +.+-++.+.+..+.-.+-.+.++-.-+.+ .+|+-+.. .++...+++.++...++. .+. T Consensus 71 ~~~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~---~~~~~~~~a~~vd~~i~~--------~l~---------- 129 (274) T protein:vir:95 71 TDILETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQ---VRQHGLAHANKVDDDVLE--------ALK---------- 129 (274) T ss_pred hhhcccceeEEEeeeeecceeehHHHHhhccchHHHHH---HHHHHHHHHHHHHHHHHH--------HHh---------- Confidence 77777777777776666666776433333 34543333 334555666665544330 000 Q ss_pred ccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccc Q lcl|Aclame:pro 154 AENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRY 233 (331) Q Consensus 154 ~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~ 233 (331) + ++. ++ T Consensus 130 ---~-------a~~----~~------------------------------------------------------------ 135 (274) T protein:vir:95 130 ---S-------AKL----TV------------------------------------------------------------ 135 (274) T ss_pred ---c-------ccc----cc------------------------------------------------------------ Confidence 0 000 00 Q ss_pred eeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCC-----ce Q lcl|Aclame:pro 234 VVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAG-----KK 308 (331) Q Consensus 234 v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g-----~~ 308 (331) +.+ +.+ .+.+.+|+.++-.......+++||..+...|+.+....- ....+.+. -. T Consensus 136 -------~~~-----~~~----~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f----~~~s~~g~~~~~~G~ 195 (274) T protein:vir:95 136 -------EAD-----ITK----LTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNF----TRATELGDDVIVKGA 195 (274) T ss_pred -------ccc-----ccC----HHHHHHHHHHhccccccccEEEeCHHHHHHHHhhccccc----cccccccccceeccc Confidence 000 000 233455555554233345789999999999986532210 01111111 12 Q ss_pred eEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 309 VVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 309 v~~~~gvpir~~dai~~tE~~Vv 331 (331) +-.+.|++|..+|++-...+-++ T Consensus 196 ig~~~G~~Vi~s~~~~~~t~~l~ 218 (274) T protein:vir:95 196 FGEALGAVIVRSNKLEAGTAILA 218 (274) T ss_pred cceecCeEEEEeCCCCCceEEEE Confidence 55789999999999866555554 No 137 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=36.49 E-value=1.2 Score=20.21 Aligned_cols=206 Identities=9% Similarity=0.062 Sum_probs=97.2 Q ss_pred CCcCccccccHHHHHHhcCCccchhHHHHHHHhccchhHhhcc-e---ecccCCccceeEE--EeecCCcceeecCCccC Q lcl|Aclame:pro 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMT-V---IEANGFTEHKTTV--RSGLPTGTWRKLNYGVQ 74 (331) Q Consensus 1 M~~l~~~a~TL~E~Ak~~~~~~~~~~~VIE~l~~~s~iL~~lp-f---~e~n~~~~~~~~~--~~~lP~~~fR~lN~g~~ 74 (331) |+. ..=+|.++ +.+. -..+-|.|.+.+ .-++..+- . .++.. |....+ ...+.++.-..=+.+++ T Consensus 1 m~~---~~T~l~d~---i~Pe-v~~~~v~~~~~~-~l~~~~~~~~~~~l~g~~--G~tv~iP~~~~ig~a~~~~~g~~i~ 70 (274) T protein:vir:96 1 MAQ---GMTKLTNQ---IVPE-VLAPMMQAELEK-KLRFASFAEIDNTLVGQP--GDTLTFPAFIYSGDAKVVAEGEKIP 70 (274) T ss_pred CCc---ceeehhhe---echH-HHHHHHHHHHHh-hhhccccceecccccCCC--CCEEEeeeecCCCccccccCCCccc Confidence 654 33355554 3232 122334444432 21121111 1 12221 222222 12234454444456777 Q ss_pred cccceEEEEEEEEEEecchhhhhHHHHhh-CCCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCChhhhcCchhhhcccc Q lcl|Aclame:pro 75 PEKSRTVQVKDSMGMLETYAEVDKALADL-NGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLS 153 (331) Q Consensus 75 ~s~~t~~~~~~~l~ilgg~~eVDr~la~~-~gn~~~~ra~q~~~~ika~~~~f~~~~iyGD~~~~~~~F~GL~~R~~~~~ 153 (331) +.+-++.+.+..+.-.+-.+.++-.-+.+ .+|+-+.. .++...+++.++...++. .+. T Consensus 71 ~~~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~---~~~~~~~~a~~vd~~i~~--------~l~---------- 129 (274) T protein:vir:96 71 TDILETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQ---VRQHGLAHANKVDDDVLE--------ALK---------- 129 (274) T ss_pred hhhcccceeEEEeeeeecceeehHHHHhhccchHHHHH---HHHHHHHHHHHHHHHHHH--------HHh---------- Confidence 77777777777776666666776433333 34543333 334555666665544330 000 Q ss_pred ccccceeeccCCCCCCceEEEEEEeCCCcEEEEccCCCccceeeccccceeeeccCCCeeEEEEEEEEeeeceEEecccc Q lcl|Aclame:pro 154 AENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRY 233 (331) Q Consensus 154 ~~~~~~vidaGgtG~~~tSI~~V~~g~~~~~giypkg~~aGl~~~d~g~~~~~d~~g~~~~~~~t~~~w~~Gl~v~d~r~ 233 (331) + ++. ++ T Consensus 130 ---~-------a~~----~~------------------------------------------------------------ 135 (274) T protein:vir:96 130 ---S-------AKL----TV------------------------------------------------------------ 135 (274) T ss_pred ---c-------ccc----cc------------------------------------------------------------ Confidence 0 000 00 Q ss_pred eeeeeccccccCCCCccchhhHHHHHHHHHHHhhcCCCCceEEEeCHHHHHHHHHHhhcCCcceeeeecccCC-----ce Q lcl|Aclame:pro 234 VVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAG-----KK 308 (331) Q Consensus 234 v~RI~NId~s~l~~~~~~~~~l~~lm~~a~~~ip~~~~g~~~~y~n~~v~~~L~~~~~~~~~~~~~~~~~~~g-----~~ 308 (331) +.+ +.+ .+.+.+|+.++-.......+++||..+...|+.+....- ....+.+. -. T Consensus 136 -------~~~-----~~~----~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f----~~~s~~g~~~~~~G~ 195 (274) T protein:vir:96 136 -------EAD-----ITK----LTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNF----TRATELGDDVIVKGA 195 (274) T ss_pred -------ccc-----ccC----HHHHHHHHHHhccccccccEEEeCHHHHHHHHhhccccc----cccccccccceeccc Confidence 000 000 233455555554233345789999999999986532210 01111111 12 Q ss_pred eEEEcCeEEEEEEeccCCCcccC Q lcl|Aclame:pro 309 VVAFDGIPCRRTDALLLTEARVV 331 (331) Q Consensus 309 v~~~~gvpir~~dai~~tE~~Vv 331 (331) +-.+.|++|..+|++-...+-++ T Consensus 196 ig~~~G~~Vi~s~~~~~~t~~l~ 218 (274) T protein:vir:96 196 FGEALGAVIVRSNKLEAGTAILA 218 (274) T ss_pred cceecCeEEEEeCCCCCceEEEE Confidence 55789999999999866555554 Done!