Query lcl|Aclame:protein:vir:98143|NCBI_annot:gp23 precursor of major head subunit|genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Match_columns 524 No_of_seqs 161 out of 438 Neff 5.2 Searched_HMMs 1612 Date Sun Dec 1 15:51:01 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_135 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_135_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:98143 Length: 524 100.0 2E-268 2E-271 1487.7 39.3 524 1-524 1-524 (524) 2 protein:vir:80986 Length: 528 100.0 4E-252 2E-255 1398.9 38.0 519 1-524 1-528 (528) 3 protein:vir:6901 Length: 522 # 100.0 3E-250 2E-253 1388.2 36.9 518 1-524 4-522 (522) 4 protein:vir:100603 Length: 529 100.0 2E-248 2E-251 1378.1 37.6 519 1-524 1-529 (529) 5 protein:vir:6601 Length: 528 # 100.0 8E-248 5E-251 1375.1 37.8 517 1-524 1-528 (528) 6 protein:vir:103463 Length: 521 100.0 2E-247 2E-250 1372.6 37.4 517 1-524 3-521 (521) 7 protein:vir:107947 Length: 519 100.0 3E-246 2E-249 1367.0 37.9 518 1-524 1-519 (519) 8 protein:vir:101811 Length: 529 100.0 5E-246 3E-249 1365.5 37.6 518 1-524 1-529 (529) 9 protein:vir:7214 Length: 521 # 100.0 8E-246 5E-249 1364.1 37.7 517 1-524 3-521 (521) 10 protein:vir:101039 Length: 529 100.0 1E-245 9E-249 1362.9 36.8 517 1-524 3-529 (529) 11 protein:vir:106286 Length: 534 100.0 4E-243 3E-246 1349.3 37.7 516 1-524 1-534 (534) 12 protein:vir:5670 Length: 514 # 100.0 2E-233 1E-236 1296.7 35.5 508 5-524 1-514 (514) 13 protein:vir:104915 Length: 470 100.0 1E-221 8E-225 1231.6 34.0 460 1-524 3-469 (470) 14 protein:vir:106998 Length: 468 100.0 3E-218 2E-221 1212.9 34.5 457 1-524 1-467 (468) 15 protein:vir:104549 Length: 462 100.0 1E-215 9E-219 1198.5 34.9 451 1-524 1-461 (462) 16 protein:vir:103181 Length: 457 100.0 3E-212 2E-215 1180.5 35.0 449 1-524 1-456 (457) 17 protein:vir:5942 Length: 523 # 100.0 4E-192 3E-195 1069.6 32.1 442 1-524 1-521 (523) 18 protein:vir:6601 Length: 528 # 98.0 1.3E-06 8.1E-10 52.8 15.4 407 1-480 32-528 (528) 19 protein:vir:1886 Length: 385 # 97.3 0.00011 6.8E-08 42.3 20.9 346 1-504 1-385 (385) 20 protein:vir:191 Length: 385 # 97.3 0.00011 6.8E-08 42.3 20.9 346 1-504 1-385 (385) 21 protein:vir:4953 Length: 397 # 97.2 0.00013 8.3E-08 41.8 17.9 332 1-521 1-397 (397) 22 protein:vir:78523 Length: 338 96.4 0.00069 4.3E-07 37.9 16.1 309 33-508 1-338 (338) 23 protein:vir:100135 Length: 418 95.7 0.0016 1E-06 35.9 15.9 354 1-511 21-418 (418) 24 protein:vir:4997 Length: 397 # 95.6 0.0018 1.1E-06 35.7 17.5 332 1-512 1-397 (397) 25 protein:vir:81070 Length: 390 95.6 0.0018 1.1E-06 35.6 19.5 344 1-510 1-390 (390) 26 protein:vir:81100 Length: 415 95.6 0.0018 1.1E-06 35.6 14.2 360 1-510 1-415 (415) 27 protein:vir:79987 Length: 415 95.6 0.0018 1.1E-06 35.6 14.2 360 1-510 1-415 (415) 28 protein:vir:98339 Length: 415 95.6 0.0018 1.1E-06 35.6 14.2 360 1-510 1-415 (415) 29 protein:vir:41 Length: 299 # N 95.3 0.0024 1.5E-06 34.9 17.6 275 72-511 1-299 (299) 30 protein:vir:81227 Length: 413 95.2 0.0025 1.6E-06 34.8 18.7 356 1-524 1-410 (413) 31 protein:vir:81160 Length: 371 95.1 0.0028 1.7E-06 34.6 16.6 335 1-510 1-371 (371) 32 protein:vir:4856 Length: 293 # 94.8 0.0035 2.2E-06 34.0 15.8 259 63-507 1-293 (293) 33 protein:vir:4830 Length: 397 # 94.4 0.0045 2.8E-06 33.4 15.6 337 1-512 1-397 (397) 34 protein:vir:10364 Length: 390 94.1 0.0055 3.4E-06 33.0 20.9 345 1-510 30-390 (390) 35 protein:vir:97053 Length: 390 93.6 0.0069 4.3E-06 32.4 18.8 347 1-510 1-390 (390) 36 protein:vir:7409 Length: 408 # 93.3 0.0081 5E-06 32.0 15.9 343 1-510 4-408 (408) 37 protein:vir:8420 Length: 477 # 93.0 0.009 5.6E-06 31.8 19.4 367 1-510 60-477 (477) 38 protein:vir:4339 Length: 395 # 92.4 0.012 7.2E-06 31.2 18.7 354 1-510 1-395 (395) 39 protein:vir:101607 Length: 379 91.2 0.017 1.1E-05 30.3 20.3 342 1-524 1-379 (379) 40 protein:vir:7771 Length: 330 # 91.1 0.017 1.1E-05 30.2 17.5 298 64-514 1-330 (330) 41 protein:vir:9410 Length: 415 # 90.6 0.02 1.3E-05 29.9 15.5 366 1-510 1-415 (415) 42 protein:vir:2504 Length: 305 # 90.4 0.021 1.3E-05 29.8 17.1 285 79-504 1-305 (305) 43 protein:vir:9759 Length: 303 # 89.8 0.024 1.5E-05 29.4 18.1 286 79-505 1-303 (303) 44 protein:vir:3870 Length: 400 # 89.5 0.026 1.6E-05 29.3 15.8 332 1-507 1-400 (400) 45 protein:vir:2344 Length: 397 # 89.3 0.027 1.7E-05 29.2 18.0 303 72-524 1-329 (397) 46 protein:vir:80376 Length: 435 89.1 0.028 1.8E-05 29.1 20.3 378 1-508 1-435 (435) 47 protein:vir:4700 Length: 415 # 88.1 0.034 2.1E-05 28.6 15.4 359 1-510 1-415 (415) 48 protein:vir:4600 Length: 415 # 88.1 0.034 2.1E-05 28.6 15.4 359 1-510 1-415 (415) 49 protein:vir:78223 Length: 333 88.0 0.035 2.2E-05 28.5 15.2 307 60-496 1-333 (333) 50 protein:vir:9820 Length: 272 # 87.9 0.036 2.2E-05 28.5 15.4 270 151-513 1-272 (272) 51 protein:vir:3033 Length: 272 # 87.9 0.036 2.2E-05 28.5 15.4 270 151-513 1-272 (272) 52 protein:vir:104085 Length: 320 87.0 0.042 2.6E-05 28.1 12.1 296 151-513 1-320 (320) 53 protein:vir:96123 Length: 274 86.2 0.048 2.9E-05 27.8 15.3 270 131-501 1-274 (274) 54 protein:vir:93742 Length: 274 86.1 0.048 3E-05 27.8 14.2 269 170-515 1-274 (274) 55 protein:vir:104256 Length: 458 85.4 0.053 3.3E-05 27.6 17.1 346 1-512 81-458 (458) 56 protein:vir:1638 Length: 298 # 85.1 0.055 3.4E-05 27.5 15.0 280 81-504 1-298 (298) 57 protein:vir:96762 Length: 632 83.8 0.065 4E-05 27.1 18.3 336 1-495 238-632 (632) 58 protein:vir:1268 Length: 397 # 82.0 0.081 5E-05 26.6 14.7 337 1-522 5-397 (397) 59 protein:vir:101650 Length: 497 81.1 0.088 5.5E-05 26.4 21.1 371 1-512 53-497 (497) 60 protein:vir:7855 Length: 497 # 81.1 0.088 5.5E-05 26.4 21.1 371 1-512 53-497 (497) 61 protein:vir:2685 Length: 387 # 80.1 0.098 6.1E-05 26.1 13.9 338 1-524 1-381 (387) 62 protein:vir:94424 Length: 387 80.1 0.098 6.1E-05 26.1 13.9 338 1-524 1-381 (387) 63 protein:vir:96978 Length: 387 80.1 0.098 6.1E-05 26.1 13.9 338 1-524 1-381 (387) 64 protein:vir:105905 Length: 304 80.0 0.099 6.2E-05 26.1 18.4 280 69-504 1-304 (304) 65 protein:vir:94142 Length: 304 80.0 0.099 6.2E-05 26.1 18.4 280 69-504 1-304 (304) 66 protein:vir:9574 Length: 300 # 77.9 0.12 7.4E-05 25.6 17.6 281 79-523 1-300 (300) 67 protein:vir:9704 Length: 394 # 77.0 0.13 8E-05 25.5 17.2 331 1-524 31-390 (394) 68 protein:vir:94673 Length: 419 75.4 0.15 9.1E-05 25.1 20.8 363 1-524 1-417 (419) 69 protein:vir:6242 Length: 390 # 73.8 0.17 0.0001 24.9 12.8 353 1-507 4-390 (390) 70 protein:vir:1025 Length: 408 # 73.2 0.17 0.00011 24.8 14.4 339 1-521 4-408 (408) 71 protein:vir:102119 Length: 404 71.9 0.19 0.00012 24.5 14.0 348 1-498 1-404 (404) 72 protein:vir:95898 Length: 274 71.6 0.19 0.00012 24.5 15.8 270 131-509 1-274 (274) 73 protein:vir:96262 Length: 274 71.6 0.19 0.00012 24.5 15.8 270 131-509 1-274 (274) 74 protein:vir:99920 Length: 311 70.8 0.2 0.00013 24.4 13.9 282 164-510 1-311 (311) 75 protein:vir:9309 Length: 324 # 70.6 0.21 0.00013 24.3 18.2 303 36-513 1-324 (324) 76 protein:vir:3845 Length: 395 # 70.5 0.21 0.00013 24.3 17.2 339 1-512 1-395 (395) 77 protein:vir:3991 Length: 404 # 69.9 0.22 0.00013 24.2 16.8 344 1-512 5-404 (404) 78 protein:vir:96223 Length: 324 68.8 0.23 0.00014 24.1 17.3 302 24-513 1-324 (324) 79 protein:vir:94494 Length: 274 66.0 0.27 0.00017 23.7 15.3 272 131-515 1-274 (274) 80 protein:vir:97433 Length: 274 66.0 0.27 0.00017 23.7 15.3 272 131-515 1-274 (274) 81 protein:vir:1433 Length: 435 # 64.9 0.29 0.00018 23.5 19.2 347 1-508 30-435 (435) 82 protein:vir:4226 Length: 326 # 62.3 0.34 0.00021 23.2 18.2 303 44-513 1-326 (326) 83 protein:vir:1383 Length: 421 # 61.8 0.35 0.00022 23.1 14.5 340 1-524 4-415 (421) 84 protein:vir:8187 Length: 311 # 59.9 0.38 0.00024 22.9 17.5 288 79-506 1-311 (311) 85 protein:vir:100172 Length: 394 55.7 0.47 0.00029 22.4 17.9 348 1-518 1-394 (394) 86 protein:vir:1781 Length: 221 # 54.5 0.5 0.00031 22.2 15.2 204 258-499 1-221 (221) 87 protein:vir:100247 Length: 425 52.9 0.54 0.00034 22.0 19.0 334 1-499 50-425 (425) 88 protein:vir:78830 Length: 324 48.8 0.66 0.00041 21.6 17.5 300 15-508 1-324 (324) 89 protein:vir:96392 Length: 324 48.8 0.66 0.00041 21.6 17.5 300 15-508 1-324 (324) 90 protein:vir:95763 Length: 297 47.4 0.7 0.00044 21.4 16.8 277 64-506 1-297 (297) 91 protein:vir:100884 Length: 389 47.0 0.71 0.00044 21.4 18.3 340 1-512 1-389 (389) 92 protein:vir:3613 Length: 272 # 44.0 0.82 0.00051 21.0 13.3 267 160-524 1-272 (272) 93 protein:vir:9361 Length: 402 # 43.4 0.84 0.00052 21.0 14.6 337 1-524 16-396 (402) 94 protein:vir:105004 Length: 392 42.3 0.89 0.00055 20.9 18.5 326 1-511 35-392 (392) 95 protein:vir:102082 Length: 392 42.3 0.89 0.00055 20.9 18.5 326 1-511 35-392 (392) 96 protein:vir:107593 Length: 392 42.3 0.89 0.00055 20.9 18.5 326 1-511 35-392 (392) 97 protein:vir:102873 Length: 392 42.3 0.89 0.00055 20.9 18.5 326 1-511 35-392 (392) 98 protein:vir:6212 Length: 434 # 42.2 0.89 0.00055 20.9 19.1 347 1-511 31-434 (434) 99 protein:vir:94771 Length: 298 40.4 0.97 0.0006 20.7 15.2 279 81-504 1-298 (298) 100 protein:vir:80684 Length: 315 38.1 1.1 0.00067 20.4 14.5 281 170-513 1-315 (315) 101 protein:vir:105334 Length: 276 37.8 1.1 0.00068 20.4 14.1 273 151-512 1-276 (276) 102 protein:vir:97148 Length: 324 37.7 1.1 0.00068 20.3 16.4 294 1-500 1-324 (324) 103 protein:vir:93881 Length: 387 37.5 1.1 0.00069 20.3 13.9 334 1-524 1-381 (387) 104 protein:vir:2430 Length: 318 # 37.5 1.1 0.00069 20.3 18.0 290 44-511 1-318 (318) 105 protein:vir:99749 Length: 324 36.2 1.2 0.00074 20.2 18.1 296 21-500 1-324 (324) 106 protein:vir:4092 Length: 390 # 35.8 1.2 0.00075 20.1 16.0 352 1-504 1-390 (390) 107 protein:vir:739 Length: 231 # 32.9 1.4 0.00087 19.8 13.6 219 195-524 1-231 (231) 108 protein:vir:105038 Length: 428 31.0 1.5 0.00095 19.6 18.5 335 1-506 53-428 (428) 109 protein:vir:1239 Length: 274 # 30.1 1.6 0.00099 19.5 15.3 272 131-509 1-274 (274) 110 protein:vir:3364 Length: 347 # 26.2 2 0.0012 19.0 11.9 311 131-524 1-345 (347) 111 protein:vir:9643 Length: 377 # 23.8 2.3 0.0014 18.6 15.6 344 1-511 1-377 (377) 112 protein:vir:103955 Length: 324 23.3 2.3 0.0014 18.6 19.3 294 1-500 1-324 (324) 113 protein:vir:1541 Length: 347 # 21.4 2.6 0.0016 18.3 17.6 292 180-502 1-347 (347) 114 protein:vir:5739 Length: 366 # 20.4 2.8 0.0017 18.1 15.0 333 1-506 1-366 (366) No 1 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=100.00 E-value=2.5e-268 Score=1487.70 Aligned_cols=524 Identities=100% Similarity=1.420 Sum_probs=519.8 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~ 80 (524) ||++|+|+|||+||||++||+|+|++.+||+|+++|||||+||++++++|||++++++|+.+|+||++.|||||+++||+ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~ 80 (524) T protein:vir:98 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) T ss_pred CcchHHHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHHHhcCccccchHHHHhhhcccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) Q Consensus 81 ~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG 160 (524) +|++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++.++++|++|+|||+++++||++||| T Consensus 81 ~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG 160 (524) T protein:vir:98 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) T ss_pred ccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCcccccccccccccccccccccCC Confidence 99999999999999999999999999999999999999999999999999999999899999999999999999999999 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhhc Q lcl|Aclame:pro 161 EGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQ 240 (524) Q Consensus 161 ~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal 240 (524) .++.++++..+.+.+...+++.++.+..++..+.++..+++.+.++++|..++.........+..++++.||+|+.+|+| T Consensus 161 ~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL 240 (524) T protein:vir:98 161 EGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQ 240 (524) T ss_pred ccccccccccccccccccccccccccccccceeccccccCcccccccccccccccccccccccceeecccccchhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999989999999999999999999 Q ss_pred cccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheee Q lcl|Aclame:pro 241 ENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQV 320 (524) Q Consensus 241 ~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~ 320 (524) ++||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++||+ T Consensus 241 ~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~ 320 (524) T protein:vir:98 241 ENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQV 320 (524) T ss_pred ccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhccccccc Q lcl|Aclame:pro 321 GKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 400 (524) Q Consensus 321 ~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~ 400 (524) +++|||+++++++|+|||++++|++++||++||+|+|++|||+|||+|+|+|+||+||||||||+||++|+|+++||+++ T Consensus 321 ~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~ 400 (524) T protein:vir:98 321 GKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 400 (524) T ss_pred ceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred chhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeee Q lcl|Aclame:pro 401 SQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 480 (524) Q Consensus 401 s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tR 480 (524) |+|++.++++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||| T Consensus 401 s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tR 480 (524) T protein:vir:98 401 SQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 480 (524) T ss_pred cchhhcccccCCccceEEEEecCceEEEecCCCCcceEEEEeeCCcccccceeeccccccccccccCCccccceeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccEecCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 481 YGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 481 Y~l~~nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) |||++|||+++.++++++|||+|+||+++||+|+|||+|+|||| T Consensus 481 Y~l~~NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~l 524 (524) T protein:vir:98 481 YGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) T ss_pred eceeecCcccccCCccccccccCcchHhhcCccceeeEeeeccC Confidence 99999999999999999999999999999999999999999999 No 2 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=100.00 E-value=3.9e-252 Score=1398.88 Aligned_cols=519 Identities=72% Similarity=1.129 Sum_probs=476.0 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~ 80 (524) |+++|+|+|||+||||| ||+|+|++.|||+|+++|||||||+++++|+|||++++++|+.+|+||++.|||||+++||+ T Consensus 1 ~~~~~~l~~kw~p~l~~-~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~ 79 (528) T protein:vir:80 1 MKTTKELMEKWSPLLEN-EKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIA 79 (528) T ss_pred CcchHHHHHhhhHhhcC-CccchhcchhhhhhhhhhhhhhhHHhhccccccchHHHHhhhhhccccccccccCCcccccc Confidence 99999999999999997 88999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) Q Consensus 81 ~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG 160 (524) ||++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++...+ -+||||+++++|+.||+ T Consensus 80 es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~----~~ea~~~~~~~da~fS~ 155 (528) T protein:vir:80 80 AGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQ----AKEAFHPMYAPDAFHSS 155 (528) T ss_pred ccccccccccCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCccccc----ccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999875433 37999999999999998 Q ss_pred cccc---------ccccccccccccccccccccccccccccccccccccCCcccccCccccccccccccccccccccccc Q lcl|Aclame:pro 161 EGAH---------TAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVG 231 (524) Q Consensus 161 ~g~~---------~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~G 231 (524) ..+. +.++..+.+.+...|+...+.+...+....+++.......+.......+.........+..++++.| T Consensus 156 ~~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~G 235 (528) T protein:vir:80 156 LAAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFG 235 (528) T ss_pred ccccccccccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccccccccc Confidence 6443 3345555555666666666655555555544443222221111111222333344556778999999 Q ss_pred ccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHH Q lcl|Aclame:pro 232 MATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIV 311 (524) Q Consensus 232 mtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii 311 (524) |+|+.+|+++.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||| T Consensus 236 m~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii 315 (528) T protein:vir:80 236 MATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIV 315 (528) T ss_pred cchhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhh Q lcl|Aclame:pro 312 DLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALA 391 (524) Q Consensus 312 ~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~ 391 (524) ++|+++|++++++|+..+++++|+|||++++|+.++||++||+|+|++|||+|||+|+|+|+||+||||||||+||++|+ T Consensus 316 ~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~ 395 (528) T protein:vir:80 316 DVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILA 395 (528) T ss_pred hhhhheeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCccc Q lcl|Aclame:pro 392 RIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNF 471 (524) Q Consensus 392 ~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~ 471 (524) |+|.+++++.++.+..+++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+|| T Consensus 396 ~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sf 475 (528) T protein:vir:80 396 SADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSF 475 (528) T ss_pred hccccccccccccccccccCCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeeeeeeeccEecCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 472 QPVMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 472 qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) ||+|||||||||++|||+++.+|++++|||||+||+++||+|+|||||+|||| T Consensus 476 qP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:80 476 HPVLGFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred cceeeeeeeeceeecCcccccCCcccccccccchhhhhcCccceeEEeeeccC Confidence 99999999999999999999999999999999999999999999999999999 No 3 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=100.00 E-value=3.5e-250 Score=1388.22 Aligned_cols=518 Identities=71% Similarity=1.133 Sum_probs=497.5 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~ 80 (524) |+++|+|+|||+||||| ||+|+|++ +||+|+++|||||||+++++|.|||++++++|+.||+||+|.|||||+++||+ T Consensus 4 ~~~~e~l~~kw~p~l~~-~~~~~~~~-~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~ 81 (522) T protein:vir:69 4 IKTKAQLVDKWKELLEG-EGLPEIAN-SKQAIIAKIFENQEKDFEVSPEYKDEKIAQAFGSFLTEAEIGGDHGYNAQNIA 81 (522) T ss_pred cchHHHHHHhhHHHhcC-CCCCcccc-chhhhhhhhhhhhhHHhhcccccchhHHHHhhhhhhhhhccccccCCCccccc Confidence 77889999999999997 88999987 59999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) Q Consensus 81 ~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG 160 (524) ||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++.+..+. .|||++++++|++||| T Consensus 82 es~~t~~v~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~----~eaf~~~neadt~fSG 157 (522) T protein:vir:69 82 AGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGA----KEAFHPMYAPDAMFSG 157 (522) T ss_pred ccccccccccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCcc----cccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999998765433 6889999999999999 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhhc Q lcl|Aclame:pro 161 EGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQ 240 (524) Q Consensus 161 ~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal 240 (524) .+..+.++..+.+++...|+...+.+...+.+............+++++..++.+.......+.+|+++.||+|+.+|++ T Consensus 158 ~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal 237 (522) T protein:vir:69 158 QGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQ 237 (522) T ss_pred ccccccccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccccceeeccccchhhhhhc Confidence 99999999999999999999888888888888888887777788888888888888888888899999999999999999 Q ss_pred cccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheee Q lcl|Aclame:pro 241 ENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQV 320 (524) Q Consensus 241 ~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~ 320 (524) +.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++||+ T Consensus 238 ~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~ 317 (522) T protein:vir:69 238 EGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQV 317 (522) T ss_pred ccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhccccccc Q lcl|Aclame:pro 321 GKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 400 (524) Q Consensus 321 ~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~ 400 (524) +++|++....+++|+|||++++|+.++||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|++.++.++ T Consensus 318 ~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~ 397 (522) T protein:vir:69 318 GKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYA 397 (522) T ss_pred eccccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999988888888 Q ss_pred chhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeee Q lcl|Aclame:pro 401 SQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 480 (524) Q Consensus 401 s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tR 480 (524) ++..+.+.++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||| T Consensus 398 ~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tR 477 (522) T protein:vir:69 398 AQGLASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 477 (522) T ss_pred cccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeee Confidence 88888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccEecCcccccCCCccccccccch-HHhhccchhhhhhhhcccC Q lcl|Aclame:pro 481 YGIGINPFANSRSQAPADRITSGMI-SKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 481 Y~l~~nP~~~~~~~~~~~~i~~~~~-~~~~a~~~~~~~~~~V~~~ 524 (524) |||++|||++..+|++++|||||+| |.+.+|+|.|||||+|||| T Consensus 478 Y~l~vNP~~~~~~~~~~~ri~~g~p~~~~~~~~n~y~r~v~v~~~ 522 (522) T protein:vir:69 478 YGIGVNPFAESSLQAPGARIQSGMPSILNSLGKNAYFRRVYVKGI 522 (522) T ss_pred eceeecCcccccCCcccceeecccchhhcccCCcceeeEEEeecC Confidence 9999999999999999999999995 6699999999999999999 No 4 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=100.00 E-value=2.4e-248 Score=1378.08 Aligned_cols=519 Identities=76% Similarity=1.177 Sum_probs=479.5 Q ss_pred CC-chHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCcccc Q lcl|Aclame:pro 1 MS-KKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNI 79 (524) Q Consensus 1 m~-~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~ 79 (524) |+ +.|+|+|||+||||| ||+|+|++.|||+|+++|||||||+++|+|+|||..+.++++++|+|+++.|+|||+++|| T Consensus 1 ~~~~~~~l~~kw~p~l~~-~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i 79 (529) T protein:vir:10 1 MSLKTKEILNKWTPLLEG-EGLPEIAGKNKQALVAQILEAQEKDSKTDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNI 79 (529) T ss_pred CccchHHHHHHhhHhhcC-CccchhcchhhhhhhhhhhhhHHHHhhcccccchhhhhhhhhhccchhhcccccccccccc Confidence 54 446899999999997 8899999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccccc Q lcl|Aclame:pro 80 ASGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYS 159 (524) Q Consensus 80 ~~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fS 159 (524) ++|++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++.+.. ...|||+++++||+.|| T Consensus 80 a~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~----~g~eaf~~~~e~dt~~S 155 (529) T protein:vir:10 80 AAGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAA----GAKEAFHPMYAPDAWHS 155 (529) T ss_pred cccccccccccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCC----ccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999987543 23789999999999999 Q ss_pred ccccc--------ccccccccccccccccccccccccccccccccccccCCccccc-Ccccccccccccccccccccccc Q lcl|Aclame:pro 160 GEGAH--------TAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGA-DPAALDAAVIAENEKGTLAEISV 230 (524) Q Consensus 160 G~g~~--------~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt-~p~~~~~~~~~~~~~g~~~~~~~ 230 (524) |.+.. +.++..+.+.+...+++..+++.+.+..+...+..+......+ .....+.........+..+++++ T Consensus 156 G~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~ 235 (529) T protein:vir:10 156 GLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAE 235 (529) T ss_pred ccccccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCcccccccccccccccccccc Confidence 97654 3455666677777788888888887777776655433322111 11223333344556677899999 Q ss_pred cccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHH Q lcl|Aclame:pro 231 GMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREI 310 (524) Q Consensus 231 GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINrei 310 (524) ||+|+.+|+|++||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||| T Consensus 236 gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINRei 315 (529) T protein:vir:10 236 GMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREV 315 (529) T ss_pred ccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhh Q lcl|Aclame:pro 311 VDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSAL 390 (524) Q Consensus 311 i~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L 390 (524) |++|+++||++++||++++++++|+|||+++.|++++||++||+|+|++|||+|||+|+|+|+||+||||||||+||++| T Consensus 316 i~~i~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L 395 (529) T protein:vir:10 316 IDWINYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSAL 395 (529) T ss_pred HHHhhhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCcc Q lcl|Aclame:pro 391 ARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKN 470 (524) Q Consensus 391 ~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s 470 (524) +|++.++.+.....+.+.++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+| T Consensus 396 ~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~s 475 (529) T protein:vir:10 396 ALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKN 475 (529) T ss_pred hhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCc Confidence 99888888887777888899999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeeeeeeeccEecCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 471 FQPVMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 471 ~qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) |||+|||||||||++|||+++.++++.+||+||+||++++|||+|||+|+|||| T Consensus 476 fqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 476 FQPVMGFKTRYAIGVNPFAESRTQAPTSRISNGMPGAHSVGKNAYFRRVWVKGL 529 (529) T ss_pred ccceeeeeeeeceeecCccccccccccccccCCcchhhhcCccceeeEeeeccC Confidence 999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=100.00 E-value=8.4e-248 Score=1375.13 Aligned_cols=517 Identities=72% Similarity=1.134 Sum_probs=466.0 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~ 80 (524) |+++|+|+|||+||||| ||+|+|++.|||+|+++|||||||+++++|+|||++++++|+.+|+|+++.|||||++.+|+ T Consensus 1 ~~~~~~l~~kw~p~l~~-~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~ 79 (528) T protein:vir:66 1 MKTTKELMEKWSPLLEN-EKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIA 79 (528) T ss_pred CcchHHHHHHhHHhhcC-CCcchhcchhhhhhhhhhhhhhHHHhhcccchhhHHHHHhhhhhhhhhcccccccccchhcc Confidence 99999999999999997 88999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) Q Consensus 81 ~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG 160 (524) +|++|++|++|||+||+|||||+|||||+|||||||||||||||||||++|+++++.+++ .||||+++.+++.||+ T Consensus 80 es~~t~~v~~~~P~Li~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~----~eAfh~~~g~ea~fse 155 (528) T protein:vir:66 80 AGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGA----REAFHPMYAPDAFHSS 155 (528) T ss_pred ccccccccccCchhHHHHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCcccccc----cccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999999877655 7899999999999998 Q ss_pred ccccccccccccc---------cccccccccccccccccccccccccccCC--cccccCccccccccccccccccccccc Q lcl|Aclame:pro 161 EGAHTAFAKITTG---------TAIATGAIVYHIFQETGIAYFQNVTSGNV--TVTGADPAALDAAVIAENEKGTLAEIS 229 (524) Q Consensus 161 ~g~~~~~s~~~~g---------ta~~~g~~~~~~~~~~~~~~~~~~~~g~~--~~tgt~p~~~~~~~~~~~~~g~~~~~~ 229 (524) ....-.....+++ .+...++.+.+....++............ ...++.+ .+.........+..++++ T Consensus 156 a~t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~--g~~~~~~~~a~~~~~~~~ 233 (528) T protein:vir:66 156 LAAKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSES--EDEVVMKLIEEGKLAEIA 233 (528) T ss_pred cccccccccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccc--cccccccccccccceecc Confidence 6443222222221 12222333332222222222222211111 1111111 122333445556789999 Q ss_pred ccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHH Q lcl|Aclame:pro 230 VGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINRE 309 (524) Q Consensus 230 ~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINre 309 (524) .||+|+.+|+++.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||| T Consensus 234 ~Gm~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINRE 313 (528) T protein:vir:66 234 FGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINRE 313 (528) T ss_pred cccchhhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhh Q lcl|Aclame:pro 310 IVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSA 389 (524) Q Consensus 310 ii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~ 389 (524) ||++|+++|++++++|+..+++++|+|||++++|+.++||++||+|+|++|||+|||+|+|+|+||+||||||||+||++ T Consensus 314 ii~~i~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~ 393 (528) T protein:vir:66 314 IVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNI 393 (528) T ss_pred HHhhhhheeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCc Q lcl|Aclame:pro 390 LARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPK 469 (524) Q Consensus 390 L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~ 469 (524) |+|+|.+++++.++.+..+++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+ T Consensus 394 L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~ 473 (528) T protein:vir:66 394 LASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQ 473 (528) T ss_pred HhhccccccccccccccccccCCCCceeEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeeeeeeccEecCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 470 NFQPVMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 470 s~qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) ||||+|||||||||++|||+++.+|++++|||||+||+++||||+|||+|+|||| T Consensus 474 sfqP~~g~~tRY~l~vNP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:66 474 SFHPVLGFKTRYGIGINPFADSKSQEPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred cccceeeeeeeeceeecCcccccCccccccccccchhhhhcCccceeEEeeeccC Confidence 9999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=100.00 E-value=2.5e-247 Score=1372.58 Aligned_cols=517 Identities=71% Similarity=1.121 Sum_probs=496.1 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~ 80 (524) |+++|+|+|||+||||| ||+|+|++ +||+|+|+|||||||+++++|+|||++++++|+.+|+|+++.|+|+++++||+ T Consensus 3 ~~~~~~l~~kw~p~l~~-~~~~~i~~-~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~i~ 80 (521) T protein:vir:10 3 IKTKAELLNKWKPLLEG-EGLPEIAN-SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIA 80 (521) T ss_pred cchhHHHHHhhhhhhcc-CCCCcccc-chhhhhhhhhhhhhhhhhhccccchhHHHHHHhhhhhhhcccCcccccccccc Confidence 99999999999999998 89999987 59999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) Q Consensus 81 ~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG 160 (524) ||++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++...+ .+|||++++++|+.||| T Consensus 81 es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~----g~eaf~~~~~ada~fSG 156 (521) T protein:vir:10 81 AGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAG----AKEAFHPMYGPDAMFSG 156 (521) T ss_pred ccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccc----cccccchhccccccccc Confidence 99999999999999999999999999999999999999999999999999999976543 26888888999999999 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhhc Q lcl|Aclame:pro 161 EGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQ 240 (524) Q Consensus 161 ~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal 240 (524) +++.+.++....+++...|+...+.+...+.++..........++++++..++.........+..|+++.||+|+.+|+| T Consensus 157 ~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal 236 (521) T protein:vir:10 157 QGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQ 236 (521) T ss_pred cccccccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhhHhhh Confidence 99999999999999999999999888888888888887777778888888888888888888999999999999999999 Q ss_pred cccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheee Q lcl|Aclame:pro 241 ENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQV 320 (524) Q Consensus 241 ~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~ 320 (524) +.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|++ T Consensus 237 ~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~ 316 (521) T protein:vir:10 237 ESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQV 316 (521) T ss_pred ccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhccccccc Q lcl|Aclame:pro 321 GKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 400 (524) Q Consensus 321 ~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~ 400 (524) |++||+.++++++|+|||+++.|+.++||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|++.++.++ T Consensus 317 ~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~ 396 (521) T protein:vir:10 317 GKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYA 396 (521) T ss_pred eeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999988888888 Q ss_pred chhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeee Q lcl|Aclame:pro 401 SQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 480 (524) Q Consensus 401 s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tR 480 (524) ++.++.+.++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||| T Consensus 397 ~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tR 476 (521) T protein:vir:10 397 AQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 476 (521) T ss_pred cccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeee Confidence 88888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccEecCcccccCCCccccccccchHHhhc--cchhhhhhhhcccC Q lcl|Aclame:pro 481 YGIGINPFANSRSQAPADRITSGMISKEMC--GKNAYFRKVWVKGL 524 (524) Q Consensus 481 Y~l~~nP~~~~~~~~~~~~i~~~~~~~~~a--~~~~~~~~~~V~~~ 524 (524) |||++|||+++.+|+++ |+|++++|++++ ++|.|||||+|||| T Consensus 477 Y~l~~NP~~~~~~~~~~-~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:10 477 YGIGINPFAESAAQAPA-SRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eceeecCcccccCCccc-eeecccchhhhccccccceeeeeeecCC Confidence 99999999999999887 889999999877 66789999999999 No 7 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=100.00 E-value=2.6e-246 Score=1366.98 Aligned_cols=518 Identities=70% Similarity=1.115 Sum_probs=496.6 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~ 80 (524) |+|. +|+|||+||||| ||+|+|++.|||+|+++||||||||++++++|||++++++|+.||+|+++.|+|||+++||+ T Consensus 1 ~~~~-~l~~kw~p~l~~-~~~~~i~~~~~~~i~~~~~en~~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~t~i~ 78 (519) T protein:vir:10 1 MKKN-ALVQKWSALLEN-EALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIA 78 (519) T ss_pred Cchh-HHHHHhHHhhcc-cccchhhhhhhHHHHHHHHHHHHHHhhhcccccchHHHHHHhhhcchhccCCccccCccccc Confidence 8776 799999999997 88999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) Q Consensus 81 ~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG 160 (524) ++++|++|++|+|+||+|+||++|||||+|||||||||||||||||||+||++++... ...|||++++++|+.||| T Consensus 79 ~~~~t~~v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~----~g~ea~~~~nEadt~fSG 154 (519) T protein:vir:10 79 AGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAA----GAKEAFHPMYAPNAMFSG 154 (519) T ss_pred cccccccccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCcccc----ccccccccccccccccCc Confidence 9999999999999999999999999999999999999999999999999999987543 237889999999999999 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhhc Q lcl|Aclame:pro 161 EGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQ 240 (524) Q Consensus 161 ~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal 240 (524) +++.+..+.++.+.....++...+.+...+.+.............++++..++.........+..++++.||+|+.+|++ T Consensus 155 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal 234 (519) T protein:vir:10 155 QGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQ 234 (519) T ss_pred cccccccccccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccccccchhhcc Confidence 99999999999998889999988888888888877777777777777777777777778888899999999999999999 Q ss_pred cccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheee Q lcl|Aclame:pro 241 ENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQV 320 (524) Q Consensus 241 ~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~ 320 (524) +.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||+||++|||+ T Consensus 235 ~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~ 314 (519) T protein:vir:10 235 EGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQV 314 (519) T ss_pred ccCCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhccccccc Q lcl|Aclame:pro 321 GKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 400 (524) Q Consensus 321 ~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~ 400 (524) +++|+++++++++|+|||++++|++++||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|++.++.++ T Consensus 315 ~~~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~ 394 (519) T protein:vir:10 315 GKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYA 394 (519) T ss_pred ceeecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998889999 Q ss_pred chhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeee Q lcl|Aclame:pro 401 SQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 480 (524) Q Consensus 401 s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tR 480 (524) +.+++.+.++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||| T Consensus 395 ~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tR 474 (519) T protein:vir:10 395 AQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 474 (519) T ss_pred cccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEecCcccccceeeccccccccccccCCccccceeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccEecCcccccCCCccccccccch-HHhhccchhhhhhhhcccC Q lcl|Aclame:pro 481 YGIGINPFANSRSQAPADRITSGMI-SKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 481 Y~l~~nP~~~~~~~~~~~~i~~~~~-~~~~a~~~~~~~~~~V~~~ 524 (524) |||++|||++..+|++++||+||+| |.+.+++|.|||||+|||| T Consensus 475 Y~l~~NP~~~~~~~~~~~~i~~g~~~~a~~~~~n~y~r~v~v~~~ 519 (519) T protein:vir:10 475 YGIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) T ss_pred eceeecCcccccccCccceeccCchhhhccccCceeeeeeeeecC Confidence 9999999999999999999999977 7888899999999999999 No 8 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=100.00 E-value=4.8e-246 Score=1365.48 Aligned_cols=518 Identities=73% Similarity=1.150 Sum_probs=474.7 Q ss_pred CC-chHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCcccc Q lcl|Aclame:pro 1 MS-KKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNI 79 (524) Q Consensus 1 m~-~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~ 79 (524) |+ +.|+|+|||+||||| ||+|+|++.|||+|+++|||||||++++++.|||+++.++++.+|+|++|.|+|||+++|| T Consensus 1 ~~~~~~~l~~kw~p~l~~-~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i 79 (529) T protein:vir:10 1 MSLKNKEILNKWTPLLEG-EGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNI 79 (529) T ss_pred CccchHHHHHHhhHhhcC-CccchhccchhhhhhhhhhhhhHHHHhcccccchhhhhhhhhccchhhccccccccccccc Confidence 44 235799999999997 8899999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccccc Q lcl|Aclame:pro 80 ASGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYS 159 (524) Q Consensus 80 ~~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fS 159 (524) ++|++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||+++++..+ .+||||+++.||+.|| T Consensus 80 ~~st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~----~~eaf~~~~~pda~~s 155 (529) T protein:vir:10 80 AAGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAG----AKEAFHPMYAPDAWHS 155 (529) T ss_pred ccccccccccccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCccccc----cccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999999876533 3788999999999999 Q ss_pred cccccc--------cccccccccccccccccccccccccccccccccccCCcccccCcc--ccccccccccccccccccc Q lcl|Aclame:pro 160 GEGAHT--------AFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPA--ALDAAVIAENEKGTLAEIS 229 (524) Q Consensus 160 G~g~~~--------~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~--~~~~~~~~~~~~g~~~~~~ 229 (524) |.+... ++...+.+...+.+++..++|.+.+..+...+..... ..++++. ..+.........+..++++ T Consensus 156 ga~~~ga~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~-~~g~~~t~~~~~~~~~~~~a~~~~~~~~ 234 (529) T protein:vir:10 156 SLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASV-TVGTNETGEALDKLINAAIGEGKLAEIA 234 (529) T ss_pred ccccccccccccccccccccccccccccccceeeecccCceeecccccccc-ccCccccCcccccccccccccccccccc Confidence 975432 3444455566677778888888888887766543222 2222222 1233344455677899999 Q ss_pred ccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHH Q lcl|Aclame:pro 230 VGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINRE 309 (524) Q Consensus 230 ~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINre 309 (524) +||+|+.+|+|++||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||| T Consensus 235 ~GmsTa~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINRe 314 (529) T protein:vir:10 235 EGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINRE 314 (529) T ss_pred cchhhhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhh Q lcl|Aclame:pro 310 IVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSA 389 (524) Q Consensus 310 ii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~ 389 (524) ||++|+.+|++++.+|+.++++++|+|||+++++++++||++||||+|++|||+|||+|+|+|+||+||||||||+||++ T Consensus 315 ii~~l~~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~ 394 (529) T protein:vir:10 315 VIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSA 394 (529) T ss_pred HHHHHhhhhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCc Q lcl|Aclame:pro 390 LARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPK 469 (524) Q Consensus 390 L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~ 469 (524) |+|++....++-++...+.++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+ T Consensus 395 L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~ 474 (529) T protein:vir:10 395 LALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGFDPK 474 (529) T ss_pred HHhhcccccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCC Confidence 99988777666666677778999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeeeeeeccEecCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 470 NFQPVMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 470 s~qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) ||||+|||||||||++|||+++.++++++|||||+||++++|+|+|||||+|||| T Consensus 475 sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 475 NFQPVMGFKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred cccceeeeeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 9999999999999999999999999999999999999999999999999999999 No 9 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=100.00 E-value=8.5e-246 Score=1364.15 Aligned_cols=517 Identities=70% Similarity=1.115 Sum_probs=483.5 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~ 80 (524) |+++|+|+|||+||||| ||+|+|++ +||+|+|+|||||||+++++++|||++++++|+.+|+|+++.|+|+++++||+ T Consensus 3 ~~~~~~l~~kw~p~l~~-~~~~~i~~-~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia 80 (521) T protein:vir:72 3 IKTKAELLNKWKPLLEG-EGLPEIAN-SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIA 80 (521) T ss_pred cchhHHHHHhhhhhhcc-CCCCcccc-chhhhhhhhhhhhhhhhhhcccccchHHHHHHhhhhhhhcccCccccCccccc Confidence 99999999999999998 89999997 59999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) Q Consensus 81 ~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG 160 (524) ||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++...+ -.|||++++++|+.||| T Consensus 81 es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~----g~ea~~~e~~~da~fSG 156 (521) T protein:vir:72 81 AGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAG----AKEAFHPMYGPDAMFSG 156 (521) T ss_pred ccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcc----cccccchhccccccccc Confidence 99999999999999999999999999999999999999999999999999999875432 25677777889999999 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhhc Q lcl|Aclame:pro 161 EGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQ 240 (524) Q Consensus 161 ~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal 240 (524) .++.+.++....++..+.|+...+.+...+.++.............+++...+.........+..++++.||+|+.+|++ T Consensus 157 ~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal 236 (521) T protein:vir:72 157 QGAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQ 236 (521) T ss_pred ccccccccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccchhhhhhh Confidence 99999888888888888888888888777766655554444433444444555555666777789999999999999999 Q ss_pred cccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheee Q lcl|Aclame:pro 241 ENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQV 320 (524) Q Consensus 241 ~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~ 320 (524) +++|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++||+ T Consensus 237 ~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~ 316 (521) T protein:vir:72 237 EGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQV 316 (521) T ss_pred cccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhccccccc Q lcl|Aclame:pro 321 GKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 400 (524) Q Consensus 321 ~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~ 400 (524) |++||+.++++++|+|||+++.|+.++||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|++.++.++ T Consensus 317 g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~ 396 (521) T protein:vir:72 317 GKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYA 396 (521) T ss_pred eeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999888 Q ss_pred chhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeee Q lcl|Aclame:pro 401 SQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 480 (524) Q Consensus 401 s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tR 480 (524) +..++.+.++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||| T Consensus 397 ~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tR 476 (521) T protein:vir:72 397 AQGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 476 (521) T ss_pred cccccccccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeee Confidence 88888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccEecCcccccCCCccccccccchHHhhc--cchhhhhhhhcccC Q lcl|Aclame:pro 481 YGIGINPFANSRSQAPADRITSGMISKEMC--GKNAYFRKVWVKGL 524 (524) Q Consensus 481 Y~l~~nP~~~~~~~~~~~~i~~~~~~~~~a--~~~~~~~~~~V~~~ 524 (524) |||++|||+++.+|+++ |+|++++|++++ ++|.|||+|+|||| T Consensus 477 Y~l~~NP~~~~~~~~~a-~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:72 477 YGIGINPFAESAAQAPA-SRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eceeecCcccccCcccc-eeecCcChhhhcCccccceeeeeeecCC Confidence 99999999999999776 889999999877 56779999999999 No 10 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=100.00 E-value=1.4e-245 Score=1362.88 Aligned_cols=517 Identities=73% Similarity=1.152 Sum_probs=473.0 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~ 80 (524) |++. +|+|||+||||| ||+|+|++.|||+|+++|||||||++++++.|||+++.++++.+|+|++|.|+|||++++|+ T Consensus 3 ~~~~-~l~~kw~p~l~~-~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~~~~~~~~~~~~~~~i~ 80 (529) T protein:vir:10 3 LKNK-EILNKWTPLLEG-EGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIA 80 (529) T ss_pred ccHH-HHHHHhHHHhcC-CccchhccchhhhhhhhhhhhhHHHHhhccccchhhhhhhhhcccchhhccccccccccccc Confidence 5555 799999999997 88999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) Q Consensus 81 ~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG 160 (524) ||++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||+++++..+ .+||||++++||+.||| T Consensus 81 est~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~----~~eaf~~~y~Pda~~sg 156 (529) T protein:vir:10 81 AGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAG----AKEAFHPMYAPDAWHSS 156 (529) T ss_pred cccccccccccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCccccc----ccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999876533 37899999999999999 Q ss_pred cccccc--------ccccccccccccccccccccccccccccccccccCCcccccCcc--cccccccccccccccccccc Q lcl|Aclame:pro 161 EGAHTA--------FAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPA--ALDAAVIAENEKGTLAEISV 230 (524) Q Consensus 161 ~g~~~~--------~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~--~~~~~~~~~~~~g~~~~~~~ 230 (524) ...... ...++.....+.+++..++|.+.+..+..+..... ...++++. ..+.........+..++++. T Consensus 157 a~~~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~-~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~ 235 (529) T protein:vir:10 157 LATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGAS-VTVGTNETGEALDKLINAAIGEGKLAEIAE 235 (529) T ss_pred ccccccccccCccccccccccccccccCcceeeeecccceecccccccc-cccCccccCccccccccccccccccccccc Confidence 654322 33344455556677777788888887766543222 22222221 12233344556678899999 Q ss_pred cccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHH Q lcl|Aclame:pro 231 GMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREI 310 (524) Q Consensus 231 GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINrei 310 (524) ||+|+.+|+|++||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||| T Consensus 236 Gm~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINRei 315 (529) T protein:vir:10 236 GMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREV 315 (529) T ss_pred ccchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhh Q lcl|Aclame:pro 311 VDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSAL 390 (524) Q Consensus 311 i~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L 390 (524) |++|+++|++++.+|+.++++++|+|||+++++++++||++||||+|++|||+|||+|+|+|+||+||||||||+||++| T Consensus 316 i~~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L 395 (529) T protein:vir:10 316 IDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSAL 395 (529) T ss_pred HHhHhhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCcc Q lcl|Aclame:pro 391 ARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKN 470 (524) Q Consensus 391 ~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s 470 (524) +|++....++.++...+.++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+| T Consensus 396 ~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~s 475 (529) T protein:vir:10 396 ALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKN 475 (529) T ss_pred HhhhhhccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCc Confidence 99877776666677777889999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeeeeeeeccEecCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 471 FQPVMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 471 ~qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) |||+|||||||||++|||+++.++++++|||||+||++++|+|+|||||+|||| T Consensus 476 fqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 476 FQPVMGFKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred ccceeeeeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 999999999999999999999999999999999999999999999999999999 No 11 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=100.00 E-value=4.4e-243 Score=1349.26 Aligned_cols=516 Identities=59% Similarity=0.934 Sum_probs=460.0 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhcc--ccccchhhhhhhccc--------cccccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETD--PVYRDEKIVESFGGF--------LAEAEIAG 70 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~--~~~~~~~~~~~~~~~--------l~ea~~~g 70 (524) |++. +|+|||+||||| ||+|+|++.|||+|+++|||||+|||+++ ++|||++++++|+.| |+|+++.| T Consensus 1 ~~~~-~l~~kw~p~l~~-~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~ 78 (534) T protein:vir:10 1 MSKK-SLLKKWQPLVES-EGMPAIASMKRKDIVARIFENQDEDIAHNEGGVYTDQVVVNSMVDVKGRIEEARLAEANIGG 78 (534) T ss_pred Cchh-HHHHHhHHhhcC-CccccccchhhhhhhhhhhhhHHHHHhhhcccccchhhhhhhhhccccchhhcccccccccc Confidence 7765 799999999997 88999999999999999999999999776 699999999999887 99999999 Q ss_pred ccccCccccccccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcc Q lcl|Aclame:pro 71 DHNYDQTNIASGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHP 150 (524) Q Consensus 71 ~~~~~~~~~~~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~ 150 (524) ||||+++||+||++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||.++... +.+.||||+ T Consensus 79 ~~g~~~~~ia~s~~s~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~----~s~~EAf~n 154 (534) T protein:vir:10 79 DHGYDATKIASGETSGSITNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQD----ANAREAFHP 154 (534) T ss_pred ccccccccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCC----ccccccccc Confidence 9999999999999999999999999999999999999999999999999999999999999987543 223466666 Q ss_pred cccccccccccccccccccccccccccccccccccc-----ccccccccccccccCCcccccCccccccccccccccccc Q lcl|Aclame:pro 151 MFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIF-----QETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTL 225 (524) Q Consensus 151 ~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~-----~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~ 225 (524) .+.+|++|||+++....+.+..+.+...+....... ..++....+..........-+++.............+.. T Consensus 155 e~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~ 234 (534) T protein:vir:10 155 TYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYA 234 (534) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccccc Confidence 677999999998888777776666665554433221 222222222222222222222222222223334445678 Q ss_pred ccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHH Q lcl|Aclame:pro 226 AEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLE 305 (524) Q Consensus 226 ~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~E 305 (524) ++++.||+|+.+|+|+.|+++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+| T Consensus 235 y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlE 314 (534) T protein:vir:10 235 VETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHE 314 (534) T ss_pred eecccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchh Q lcl|Aclame:pro 306 INREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRN 385 (524) Q Consensus 306 INreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~ 385 (524) ||||||++|+++|++++.+++..+++++|+|||+++.|+.++||++||+|+|++|||+|||+|+|+|+||+||||||||+ T Consensus 315 INReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~ 394 (534) T protein:vir:10 315 INREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRN 394 (534) T ss_pred hhHHHHHHHhhhhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhcccccccchh--hhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccc Q lcl|Aclame:pro 386 VVSALARIDSGITPASQG--LQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPL 463 (524) Q Consensus 386 va~~L~~~~~g~~~~s~~--~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~ 463 (524) ||++|+| +|++++.+. .+.++++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|+ T Consensus 395 Va~~L~~--~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~ 472 (534) T protein:vir:10 395 VAAALGH--TDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVEDYFTVGYKGASEMDAGLYYCPYVALTPL 472 (534) T ss_pred HHHHHhh--ccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccc Confidence 9999999 677766654 466789999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCccccceeeeeeeeccEecCcccccCCCccccccccch-HHhhccchhhhhhhhcccC Q lcl|Aclame:pro 464 RGSDPKNFQPVMGFKTRYGIGINPFANSRSQAPADRITSGMI-SKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 464 ~~~dp~s~qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~~~~~-~~~~a~~~~~~~~~~V~~~ 524 (524) +++||+||||+|||||||||++|||++..++++.+||+||++ |++++|+|+|||||+|||| T Consensus 473 ~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~~i~~g~~~~~~~ag~n~~~~~~~Vk~l 534 (534) T protein:vir:10 473 RGTDPKNFQPVLGFKTRYGVKLHPMADATQNKGFAKISNGMPQHTNMFGKNAFFRRVLVAGV 534 (534) T ss_pred cccCCccccceeeeeeeeceeecCcccccCCccccccccCCcchhhhcccccceeeeeeecC Confidence 999999999999999999999999999999999999999975 9999999999999999999 No 12 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=100.00 E-value=1.7e-233 Score=1296.72 Aligned_cols=508 Identities=60% Similarity=0.945 Sum_probs=438.7 Q ss_pred HHHHHHhhHhhccccc--chhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccccc Q lcl|Aclame:pro 5 NELMEKWNDLLESQEG--LPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIASG 82 (524) Q Consensus 5 ~~l~~kw~p~l~~~~~--~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~~s 82 (524) -+|+|||+||||| || +|+|++.+||+|+++|||||+||++++++|||++++++|+.+|+|+++.|||||++.||++| T Consensus 1 ~~l~~kw~p~l~~-~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia~s 79 (514) T protein:vir:56 1 MNLTEKWKDLLEA-EGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQG 79 (514) T ss_pred CchhhhhhHHhcc-cccccccccchhhhhhhhhhhhhHHHHHhcCCcccchhhhhhhhcccccccccccccccccccccc Confidence 4799999999997 67 89999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccc Q lcl|Aclame:pro 83 KSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEG 162 (524) Q Consensus 83 t~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g 162 (524) ++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++.+ +.||||+++++|++|||++ T Consensus 80 ~~t~~v~~~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~t------g~EAf~~~nEadt~fSG~~ 153 (514) T protein:vir:56 80 VTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT------GAEAFHPTRQADASFSGQA 153 (514) T ss_pred cccccccccchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcc------cccccccccccCcCccccc Confidence 9999999999999999999999999999999999999999999999999998643 2588889999999999998 Q ss_pred ccccccccccccccccccccccccccccccccccccc-cCCcccccCcccccccccccccccccccccccccchhhhhcc Q lcl|Aclame:pro 163 AHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTS-GNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQE 241 (524) Q Consensus 163 ~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~-g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~ 241 (524) +...++..+.......|..........+......... .......................+..++++.||+|+.+|+++ T Consensus 154 ~~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~ 233 (514) T protein:vir:56 154 AASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQE 233 (514) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhcc Confidence 7777666555444444433322221111111111000 000000011111111223344556789999999999999999 Q ss_pred ccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeee Q lcl|Aclame:pro 242 NFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVG 321 (524) Q Consensus 242 ~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~ 321 (524) +||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|+++ T Consensus 234 ~lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~ 313 (514) T protein:vir:56 234 NFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIG 313 (514) T ss_pred cCCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeeh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccc Q lcl|Aclame:pro 322 KSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAS 401 (524) Q Consensus 322 ~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s 401 (524) +.+|++++. .+|+|||+++.|+.++||++||||+|++|||||+|+|+|+|+||+||||||||+||++|+| +|+++++ T Consensus 314 ~~~~~~~~~-~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~--sg~l~~~ 390 (514) T protein:vir:56 314 KSGWTQGAG-AAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSM--TDTLVGP 390 (514) T ss_pred hcccccccc-cccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHh--hhhhccc Confidence 999988875 4799999999999999999999999999999999999999999999999999999999999 6666543 Q ss_pred hh---hhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCccccceeeee Q lcl|Aclame:pro 402 QG---LQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 478 (524) Q Consensus 402 ~~---~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~ 478 (524) +. ...++++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||++++++||+||||+|||| T Consensus 391 ~~~g~~~~~~~~d~~~~~~aG~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 470 (514) T protein:vir:56 391 AAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFK 470 (514) T ss_pred cccCccccccccccCcceEEEEecCceEEEecCCCCcceEEEEEecCcceecceeeccccccccccccCCccccceeeee Confidence 33 345789999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccEecCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 479 TRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 479 tRY~l~~nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) |||||++|||++..++. .++.|+++-.+..++|.|||+|+|||| T Consensus 471 tRY~l~~NPy~~~~~~~--~~~~~~~~~~a~~~~n~y~r~v~v~~l 514 (514) T protein:vir:56 471 TRYGVQVNPFADPTASA--TKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) T ss_pred eeeceeeCCCCCccccc--cccCCcchhhhcccccceeeeEEEecC Confidence 99999999998544432 234444444444469999999999999 No 13 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=100.00 E-value=1.3e-221 Score=1231.59 Aligned_cols=460 Identities=40% Similarity=0.699 Sum_probs=413.8 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhccccccc-ccccccccCcccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEA-EIAGDHNYDQTNI 79 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea-~~~g~~~~~~~~~ 79 (524) |+++|+|+|||+||||| ||+|+|++.|||+|+++|||||+|+|+|++ .+|+|+ ++.++||+++++| T Consensus 3 ~~~~e~l~~kw~p~l~~-~~~~~i~~~~~~~v~a~l~enq~~~~~~~~------------~~l~e~~~~~~~~~~~~~~i 69 (470) T protein:vir:10 3 MFNSEYLQEKWAPILDY-DGLDPIKDSHRRSVTAVLLENQEKELREER------------NFLSEAPNVNTNSGATAGFS 69 (470) T ss_pred cchhHHHHHhhhhhhcC-CccchhcchhhhhhhhhhhhhhHHHHhhcc------------chhhhhhhcccccccccccc Confidence 99999999999999997 889999999999999999999999999999 469999 5788999999999 Q ss_pred ccccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccccc Q lcl|Aclame:pro 80 ASGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYS 159 (524) Q Consensus 80 ~~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fS 159 (524) +||++|++|++|||+||+||||++|||||+|||||||||||||||||||+||.++ +|+|++|+|| |++|| T Consensus 70 ~~st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~---sG~EaffnEA-------~T~fS 139 (470) T protein:vir:10 70 ADATAAGPVAGFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQ---SGTEALFNEA-------DTAFS 139 (470) T ss_pred ccccccccccccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCC---CccceeeecC-------CcccC Confidence 9999999999999999999999999999999999999999999999999999987 5788999885 89999 Q ss_pred cccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhh Q lcl|Aclame:pro 160 GEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAEL 239 (524) Q Consensus 160 G~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEa 239 (524) |.++............. .....+....++++|..++... ....+..++++.||+|+.+|. T Consensus 140 G~~~~~~~~~~~~~~~a------------------~~~g~~~~~~~gt~~~~~~~~~--~~a~~~~y~~~~GMsTa~aE~ 199 (470) T protein:vir:10 140 GQPDGLDDTSGFTATGA------------------NNVGLGTTAQQGSNPGLLNSTA--AQTNATDYNVGQGMRTDSAED 199 (470) T ss_pred ccccccccccccccccc------------------cccccccccccccccccccccc--ccccccccccccccchHHhhh Confidence 98665443222111100 1111223334455554443332 233456789999999999997 Q ss_pred ccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhee Q lcl|Aclame:pro 240 QENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQ 319 (524) Q Consensus 240 l~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~ 319 (524) | |++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|+ T Consensus 200 l---g~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~ 276 (470) T protein:vir:10 200 L---GDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAE 276 (470) T ss_pred c---CCCCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhh Confidence 6 5788899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccc Q lcl|Aclame:pro 320 VGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITP 399 (524) Q Consensus 320 ~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~ 399 (524) +++..++ .++|+|||+++.+ +||++|+||+|++||++++|+|+|+|+||+||||||||+||++|+| +||++ T Consensus 277 ~~k~~~~----~~~Gv~Dl~~~~~---gr~~~e~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~--sG~l~ 347 (470) T protein:vir:10 277 PGAQANV----AAAGTFDLDTDSN---GRWSVEKFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTM--AGVLD 347 (470) T ss_pred hceeccc----cccceEEeecccc---hhHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhh--ccccc Confidence 9988654 4479999999865 9999999999999999999999999999999999999999999999 89999 Q ss_pred cchhhhcccccccccceeEEEecCcEEEEecCC------CCcceEEEEEecCCCccceeEeecccccccccccCCccccc Q lcl|Aclame:pro 400 ASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQY------ARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQP 473 (524) Q Consensus 400 ~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y------~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP 473 (524) +.|+++..+++|+++++|+|+|+|||+|||||| +++|||+|||||++++|+||||||||||++++++||+|||| T Consensus 348 ~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP 427 (470) T protein:vir:10 348 YTPALNANLNVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVGYKGSSPYDAGLFYCPYVPLQMVRAVGQDTFQP 427 (470) T ss_pred cccccccccccCCCCceEEEEecCceEEEeeccccccCcccccEEEEEEecCcceecceeeccccccccCCCCCCccccc Confidence 999999999999999999999999999999997 78899999999999999999999999999999999999999 Q ss_pred eeeeeeeeccEecCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 474 VMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 474 ~~~~~tRY~l~~nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) +|||||||||++|||+...++.++ ||++ |+|.|||||+|||| T Consensus 428 ~~g~~tRY~l~~NP~~~~~~~~~~-~i~~--------~~n~y~r~~~v~~l 469 (470) T protein:vir:10 428 KIGFKTRYGLVENPFSQGTTQGLG-TLTR--------NSNRYYRRVKVANL 469 (470) T ss_pred eeeeeeeeceeecCcccCCCcccc-cccC--------CCCceeeEEEeecc Confidence 999999999999999999998655 7776 88999999999999 No 14 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=100.00 E-value=3.3e-218 Score=1212.88 Aligned_cols=457 Identities=37% Similarity=0.635 Sum_probs=398.2 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccc-c Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTN-I 79 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~-~ 79 (524) |+|+|+|+|||+||||| ||+|+|++.|||+|+++||||||||+++++.|++|.+.++|+ +|+....| + T Consensus 1 ~~~~e~l~~kW~plLe~-~~~~~i~~~~k~~i~a~llENQe~~~~~~~~~~~~~~~~~~~----------~~~~~~~n~~ 69 (468) T protein:vir:10 1 MFNAEHLQEKWSPVLNH-GEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLG----------AGTIAPAGSA 69 (468) T ss_pred CcchHHHHHhhhHhhcC-CccchhccchhhhhhhhhhhhHHHHHhccccccchhhHhhcC----------Ccccchhhhh Confidence 99999999999999997 889999999999999999999999999999999999887774 44444444 5 Q ss_pred ccccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccccc Q lcl|Aclame:pro 80 ASGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYS 159 (524) Q Consensus 80 ~~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fS 159 (524) +++++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||.++ .++|++|+|| |++|| T Consensus 70 ~~~~~t~~v~~~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~---~g~EAf~nEa-------dt~fS 139 (468) T protein:vir:10 70 LGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQ---AGEEALFNEP-------DTGFT 139 (468) T ss_pred hhhcccccccccCchhhhhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCC---CCccceeccc-------ccccc Confidence 5789999999999999999999999999999999999999999999999999998 4778888775 89999 Q ss_pred cccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhh Q lcl|Aclame:pro 160 GEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAEL 239 (524) Q Consensus 160 G~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEa 239 (524) |.+..+....... ..........+++|...+.+ .+..++++.||+|+.+|. T Consensus 140 g~~~~~~~~~~~~-----------------------~~~~~~~~~~g~~~~~~~~a------~~~~~~~g~gMsTa~aE~ 190 (468) T protein:vir:10 140 GGYDASQGDYAVR-----------------------TGAGVGGDSEGNNPALLNDA------APGTYEVGSKMPREDLER 190 (468) T ss_pred ccccccccccccc-----------------------cccccccCCCCCcccccccc------cccccccccccchHHHhh Confidence 9754432111100 00111223344444443322 345688999999999998 Q ss_pred ccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhee Q lcl|Aclame:pro 240 QENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQ 319 (524) Q Consensus 240 l~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~ 319 (524) ++ +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|+ T Consensus 191 lG----~~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~ 266 (468) T protein:vir:10 191 MG----EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAK 266 (468) T ss_pred cC----CCCcccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhh Confidence 84 45678999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred eeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccc Q lcl|Aclame:pro 320 VGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITP 399 (524) Q Consensus 320 ~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~ 399 (524) ++++ ..++++|+|||+++.+ +||++|++|+|++|||+|+|+|+|+|+||+||||||||+||++|+| +||++ T Consensus 267 ~~k~----~g~~~~Gv~d~~~~~~---~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~--sG~l~ 337 (468) T protein:vir:10 267 KGAQ----NNVANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAM--AGVLD 337 (468) T ss_pred heec----cccccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhh--cCcce Confidence 8876 2356789999999855 9999999999999999999999999999999999999999999998 99999 Q ss_pred cchhhhcc-----cccccccceeEEEecCcEEEEecCCCC----cceEEEEEecCCCccceeEeecccccccccccCCcc Q lcl|Aclame:pro 400 ASQGLQKT-----LNVDTTKAVFAGVLGGTYKVYIDQYAR----QDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKN 470 (524) Q Consensus 400 ~s~~~~~~-----~~~d~~~~~~~G~l~~~~~vy~D~y~~----~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s 470 (524) +++.++.. +++|+++.+|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||+|++++||+| T Consensus 338 ~~~~~~~~~~~~~~~~D~tg~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s 417 (468) T protein:vir:10 338 YSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNT 417 (468) T ss_pred ecccccccccccccccccCcceEEEEecCceEEEEccccccCCccceEEEEEecCcceeceeeeccccccccccccCCCc Confidence 99887643 479999999999999999999999964 899999999999999999999999999999999999 Q ss_pred ccceeeeeeeeccEecCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 471 FQPVMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 471 ~qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) |||+|||||||||++|||+...+...+ .+++.+|. +|+|.|||||+|||| T Consensus 418 fqP~~g~~tRY~l~~NP~~~~~~~~~g--~~~~~~~~--~~~N~y~r~~~v~~l 467 (468) T protein:vir:10 418 FQPKIGFKTRYGMVSNPFVTTNGLYNG--TPDGEALT--PNANMYYRRVQVTNL 467 (468) T ss_pred ccceeeeeeeeceeecccceeccccCC--Cccccccc--ccccceeeeEEEecc Confidence 999999999999999999965432222 35555655 489999999999999 No 15 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=100.00 E-value=1.4e-215 Score=1198.52 Aligned_cols=451 Identities=39% Similarity=0.680 Sum_probs=394.8 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~ 80 (524) ||+ |+|+|||.||||| ||+|+|++.+||+|+++|||||||+|++++ ++|+|+ .|+||+++. T Consensus 1 ms~-~~l~~~w~~~l~~-~~~~~i~~~~~~~~~~~~~enq~~~~~~~~------------~~l~ea--~~~~g~~~~--- 61 (462) T protein:vir:10 1 MSI-QQLQEKWAPVLNH-ESVPEIKDSYKKGVVAQLLENQENAIREEG------------QVLNET--LQTTGYTTG--- 61 (462) T ss_pred Cch-HHHHHHhhhhhcc-cccchhhhhhHHHHHHHHhhhHHHHHHhcc------------cchhcc--ccccCCCcC--- Confidence 998 5899999999997 789999999999999999999999999887 689999 499999965 Q ss_pred cccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCC---cccccchhhhhccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLA---GGTPADVREAFHPMFAPDTM 157 (524) Q Consensus 81 ~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~---~gteA~~nEAf~~~~~~dt~ 157 (524) +++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++.+ +++||+|+| +|+. T Consensus 62 -~~~t~~~~~~~P~Li~l~Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~gtEAlfnE-------adt~ 133 (462) T protein:vir:10 62 -DTATGPVAGFDPVLISLIRRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSDFREALFNE-------PNAG 133 (462) T ss_pred -cccccccccccchhhhHHHHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccccchhhhcc-------CCcC Confidence 57799999999999999999999999999999999999999999999999987643 467777776 4999 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhh Q lcl|Aclame:pro 158 YSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVA 237 (524) Q Consensus 158 fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~a 237 (524) |||.++............. ......+++|...+.... .....+.++.||+|+.+ T Consensus 134 fSg~~~~~~~~~~~~~~~~-----------------------~~~~~~g~~~~~~~~~~~---g~~~~~~~~~GM~Ta~a 187 (462) T protein:vir:10 134 FSGGAGTGLSNYDPTASSS-----------------------AVNDAEGANPGLLNDSPA---GTYEVTGDATGMATATA 187 (462) T ss_pred ccccccccccccccccccc-----------------------cccccccccceeecCCCc---cceecccccccccchhc Confidence 9987655432221111100 011122333333332221 12234567889999999 Q ss_pred hhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhh Q lcl|Aclame:pro 238 ELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYT 317 (524) Q Consensus 238 Eal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~ 317 (524) |+|++ ++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++ T Consensus 188 E~lg~--~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~ 265 (462) T protein:vir:10 188 EALDD--SSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVN 265 (462) T ss_pred cccCC--ccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhh Confidence 99973 5667899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccc Q lcl|Aclame:pro 318 AQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGI 397 (524) Q Consensus 318 a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~ 397 (524) |++++.+++ .++|+|||+++. .+||++|++|+|++||+++||+|+|+|+||+||||||||+||++|+| +|| T Consensus 266 a~~~k~~~~----~~~Gv~dl~~~~---~gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~--sG~ 336 (462) T protein:vir:10 266 AVKGAIANT----ATDGIFDLDVDS---NGRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGM--AGV 336 (462) T ss_pred heeeecccc----cccceeeecccc---chHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhh--ccc Confidence 999988765 347999998885 49999999999999999999999999999999999999999999998 899 Q ss_pred cccchhhhc---ccccccccceeEEEecCcEEEEecCC----CCcceEEEEEecCCCccceeEeecccccccccccCCcc Q lcl|Aclame:pro 398 TPASQGLQK---TLNVDTTKAVFAGVLGGTYKVYIDQY----ARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKN 470 (524) Q Consensus 398 ~~~s~~~~~---~~~~d~~~~~~~G~l~~~~~vy~D~y----~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s 470 (524) +++.|+++. ..++||++.+|+|+|+|||+|||||| +++|||+|||||++++|+||||||||||++++++||+| T Consensus 337 l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~s 416 (462) T protein:vir:10 337 LDYAPGLQGNSALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPNT 416 (462) T ss_pred hhccccccccccccccccccceeEEEecCceEEEEecccCCCcccceEEEEEeCCcccccceeeccccccccccccCCcc Confidence 988887653 45799999999999999999999998 67999999999999999999999999999999999999 Q ss_pred ccceeeeeeeeccEecCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 471 FQPVMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 471 ~qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) |||+|||||||||++|||+.+.+++++ |++ +|+|.|||||+|||| T Consensus 417 fqP~~g~~tRY~l~~NP~t~~~~~~~~-~~~--------~~~n~y~r~~~v~~l 461 (462) T protein:vir:10 417 FQPKIGFKTRYGMVSNPFSGGLTQGSG-ALT--------ANANKYYRRVQVANL 461 (462) T ss_pred ccceeeeeeeeeeeecCCCCCcCCccc-ccc--------ccCcceeeeEEeecc Confidence 999999999999999999999999775 555 489999999999999 No 16 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=100.00 E-value=2.7e-212 Score=1180.51 Aligned_cols=449 Identities=41% Similarity=0.697 Sum_probs=392.7 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~ 80 (524) ||+ |+|+|||.||||| ||+|+|++.|||+|+++|||||||+|++++ ++|+|| .|+||+++++ T Consensus 1 m~~-~~l~~~w~~~l~~-~~~~~i~~~~~~~~~~~~lenq~~~~~~~~------------~~l~ea--~~~~g~~~~s-- 62 (457) T protein:vir:10 1 MSF-QNLQEKWAPVLEH-DSLPEIGDSYKKGVVAQLLENQEKAIAEEG------------KILTET--LQTTGYTGGD-- 62 (457) T ss_pred Cch-HHHHHHhhHhhcc-CccchhhhhHHHHHHHHHhhhHHHHHHhcc------------cccccc--ccccCCCccc-- Confidence 998 5799999999997 889999999999999999999999999887 689999 4999999875 Q ss_pred cccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) Q Consensus 81 ~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG 160 (524) ++|++|++|||+||+||||++|||||+|||||||||||||||||||+||.++.... .+.+.||| ++++|+.||| T Consensus 63 --~~t~~v~~~~P~Li~l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~--~a~~~EAl--~nEadt~fSg 136 (457) T protein:vir:10 63 --TVTGPVAGFDPVLISLIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPA--AAGYDEAF--FNEPNAGFSG 136 (457) T ss_pred --ccccccccccchhhhhhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccc--ccccccee--eeccCcccCc Confidence 57899999999999999999999999999999999999999999999999886431 12334444 2346999998 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhhc Q lcl|Aclame:pro 161 EGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQ 240 (524) Q Consensus 161 ~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal 240 (524) ..+....... .......+++|...+....+ ..+.++++.||+|+.+|.| T Consensus 137 ~~~~~~~~~~----------------------------~~~~~~~gt~~~~~~~~~~~---~~~~~~~~~gmsTA~aE~l 185 (457) T protein:vir:10 137 GPGAYDPGAT----------------------------GVTNDAEGTNPALLNDSPAG---TYEQADDATGMSTATVEAL 185 (457) T ss_pred cccccccccc----------------------------ccccccccccccccCccccc---cccccccccchhhhhhhcc Confidence 7544321110 00112234444444433322 3356789999999999999 Q ss_pred cccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheee Q lcl|Aclame:pro 241 ENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQV 320 (524) Q Consensus 241 ~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~ 320 (524) ++ +++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|++ T Consensus 186 gd--~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~ 263 (457) T protein:vir:10 186 DD--STANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAVA 263 (457) T ss_pred CC--CCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhhee Confidence 63 5677899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhccccccc Q lcl|Aclame:pro 321 GKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 400 (524) Q Consensus 321 ~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~ 400 (524) ++.+++.+ +|+|||+++.+ +||++|++|+|++||++++|+|+|+|+||+||||||||+||++|+| +||+.+ T Consensus 264 ~~~~~~~~----~gv~dl~~~~~---g~~~~e~~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~--sg~l~~ 334 (457) T protein:vir:10 264 GAQNNTAT----AGVFDLDVDSN---GRWSVEKFKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGM--AGVLDY 334 (457) T ss_pred eecccccc----ceeeeeecccc---chhhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhh--cccccc Confidence 99877644 79999988854 9999999999999999999999999999999999999999999999 888888 Q ss_pred chhhhc---ccccccccceeEEEecCcEEEEecCCC----CcceEEEEEecCCCccceeEeecccccccccccCCccccc Q lcl|Aclame:pro 401 SQGLQK---TLNVDTTKAVFAGVLGGTYKVYIDQYA----RQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQP 473 (524) Q Consensus 401 s~~~~~---~~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP 473 (524) +|+++. ..++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||++++++||+|||| T Consensus 335 ~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP 414 (457) T protein:vir:10 335 TPALNGNNGLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPDTFQP 414 (457) T ss_pred cchhhccccccccccccceeEEEecCCeEEEEecccccCCccceEEEEEeCCcceecceeecccccccccCccCCccccc Confidence 887754 467899999999999999999999987 4899999999999999999999999999999999999999 Q ss_pred eeeeeeeeccEecCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 474 VMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 474 ~~~~~tRY~l~~nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) +|||||||||++|||+...+|+++ |++. |.|.|+||+.|+|| T Consensus 415 ~~g~~tRY~l~~NP~~~~~~~~~~-~~~~--------~~n~~~~rs~vs~l 456 (457) T protein:vir:10 415 KIGFKTRYGMVSNPFAGGLTQGSG-ALTV--------NANKYYRRVQVANL 456 (457) T ss_pred eeeeeeeeeeeecccccccccccc-cccc--------cchhhcceeeeeec Confidence 999999999999999999999876 5554 56789999999999 No 17 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=100.00 E-value=4.5e-192 Score=1069.63 Aligned_cols=442 Identities=24% Similarity=0.337 Sum_probs=344.0 Q ss_pred CCch---HHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCcc Q lcl|Aclame:pro 1 MSKK---NELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQT 77 (524) Q Consensus 1 m~~~---~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~ 77 (524) ||++ |+|+|||+||||+ |++.|||+|+|+|||||+|| ++ ++|+|++ T Consensus 1 ~~~~~~~e~l~~kw~p~l~~------~~~~~~~~~~a~llenq~~~---~~------------~~l~e~~---------- 49 (523) T protein:vir:59 1 MSQPKINEQLIEKWQPLLEG------CRNDWERHTLATLLENQYRE---AK------------KHLMETT---------- 49 (523) T ss_pred CCcchhhHHHHHhhhhhhcc------cCChhHHHHHHHHhhhhhHH---HH------------Hhhhhhh---------- Confidence 9987 8999999999984 66779999999999999874 12 4788873 Q ss_pred ccccccccccccccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhh-------hhcc Q lcl|Aclame:pro 78 NIASGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVRE-------AFHP 150 (524) Q Consensus 78 ~~~~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nE-------Af~~ 150 (524) .+++|++|+| ||+||||++|||||+||||||||||||||||||||||.++. ++|++|+| +..+ T Consensus 50 ------~~~~~~~~~~-~~~~v~r~~p~l~a~DIWGVQPMTGPTGLIFAMRSRY~~q~---gteA~yg~~~~~~~~a~~~ 119 (523) T protein:vir:59 50 ------QTTEVDGWNL-ALPIVRRVFANLRATDLVSVQPLSLPTGLVFYLDFKSPELP---GNGSVYGGTGLTTDTATGG 119 (523) T ss_pred ------hccccccccc-hhhhhhhHhhhhhhhhccccccCCCCcceeEEEEeeccCCC---CcccccCccccCccccccc Confidence 4778999996 99999999999999999999999999999999999999984 56676654 4455 Q ss_pred cccccccccccccccccccccc--------------ccccccccccc--c---ccccccccc------------------ Q lcl|Aclame:pro 151 MFAPDTMYSGEGAHTAFAKITT--------------GTAIATGAIVY--H---IFQETGIAY------------------ 193 (524) Q Consensus 151 ~~~~dt~fSG~g~~~~~s~~~~--------------gta~~~g~~~~--~---~~~~~~~~~------------------ 193 (524) +.++++.|++.+.+........ +.+...+.... . .....+... T Consensus 120 ~~ean~~~s~~~~~~~~~~d~~~sg~~~~~~~a~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga 199 (523) T protein:vir:59 120 LYDENARLSRREYETTITVDLATAQQATMRDVGFDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPEN 199 (523) T ss_pred ccccccccccccccCccCCCcccccccccccccccccchhhccccceeeeeccccccccccccccccccccccccccccc Confidence 6677777777554433211110 00000000000 0 000000000 Q ss_pred --------cc-------ccccc-CCcccccCcccccccccccccccccccccccccchhhhhccccC--CCCCcccccce Q lcl|Aclame:pro 194 --------FQ-------NVTSG-NVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFN--GSSANPWNEMA 255 (524) Q Consensus 194 --------~~-------~~~~g-~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~g--gss~~~f~EMs 255 (524) .. ....+ .....++++................++.+.||+|+.+|.++.++ ++++++|+||+ T Consensus 200 ~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~ 279 (523) T protein:vir:59 200 TVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEIN 279 (523) T ss_pred cccchhhccccccccccccccccccccccccccccCCCcccccccccccccccccchhhcccccccccccccccccccee Confidence 00 00000 00001111111111111222233468889999999999998755 57788999999 Q ss_pred eEEEEEEEEeecccccccccHHHHHHHHhhc-CCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccce Q lcl|Aclame:pro 256 FRIDKQVIEARSRQLKAQYSVELAQDLRAVH-GMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAG 334 (524) Q Consensus 256 FsIEK~tVtAKSRALKAEYT~ELAQDLkAiH-GLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G 334 (524) |+||||+|||||||||||||||||||||||| |||||+||+||||||||+|||||||++|+++|++++.+++. ++| T Consensus 280 FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~----~~g 355 (523) T protein:vir:59 280 LELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFW----SEV 355 (523) T ss_pred eEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeecccc----ccc Confidence 9999999999999999999999999999999 99999999999999999999999999999999999886543 479 Q ss_pred eecccccccccccchH--------HHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhc Q lcl|Aclame:pro 335 SFDFQDPVDIRGARWA--------GESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQK 406 (524) Q Consensus 335 ~fdl~~~~~~~~~~~~--------~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~ 406 (524) +|||.++.| ++|. +||+|+|+++||+|+|+|+|+|+||+||||||||+||++|++ +||++. T Consensus 356 ~~~~~~~~~---~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~--------~~~~~~ 424 (523) T protein:vir:59 356 VGEYYDETS---GNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLES--------MPGFTP 424 (523) T ss_pred eeeeccccc---chhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHh--------cccccc Confidence 999999865 4443 899999999999999999999999999999999999999985 666652 Q ss_pred --ccccccccceeEEEecCcEEEEecCCCCcceEEEEEecC-CCccceeEeeccccccccccc-CCccccceeeeeeeec Q lcl|Aclame:pro 407 --TLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGD-NEMDAGIYYAPYVALTPLRGS-DPKNFQPVMGFKTRYG 482 (524) Q Consensus 407 --~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~-~~~~~~~fyaPYv~~~~~~~~-dp~s~qP~~~~~tRY~ 482 (524) ....|++..+|+|+|+|||+||||||+++|||+|||||. +++|+|||||||||+.+++++ ||+||||+|||||||| T Consensus 425 ~~~~~~~~~~~~~~g~l~~~~~vy~d~~~~~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~ 504 (523) T protein:vir:59 425 GNDNRDGGTGIFYVGMVQGRYRLYKNIYQNQPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYA 504 (523) T ss_pred CCccccccccceeEEEecCceEEEecCCCCcceEEEEecccCCcccccceecccchhhcccccccCCcccceeeeeeehh Confidence 345677888999999999999999999999999999995 599999999999999999985 9999999999999999 Q ss_pred cEe-cCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 483 IGI-NPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 483 l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) |++ |||+.+.-- ||=| T Consensus 505 l~v~nP~~~~~~~--------------------------~~~~ 521 (523) T protein:vir:59 505 LEVVRPEFYGLLY--------------------------VKLL 521 (523) T ss_pred heecchhHhhhhh--------------------------hhhc Confidence 986 999865421 1111 No 18 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=98.01 E-value=1.3e-06 Score=52.83 Aligned_cols=407 Identities=15% Similarity=0.090 Sum_probs=129.1 Q ss_pred CCch----------------HHHHHHhhHhh---------------cccccc---------hhhcchhHHHHHHHHHHHH Q lcl|Aclame:pro 1 MSKK----------------NELMEKWNDLL---------------ESQEGL---------PDIATKSKKQLVAAILEAQ 40 (524) Q Consensus 1 m~~~----------------~~l~~kw~p~l---------------~~~~~~---------~~i~~~~~~~~~~~l~enq 40 (524) |.+- +.+.|-....| -. ||- |..-.+-||.+-.-| T Consensus 32 ~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~-es~~t~~v~~~~P~Li~lvRRa~p~LI---- 106 (528) T protein:vir:66 32 VAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIA-AGQTTGAITNVGPAVIGMVRRAIPNLI---- 106 (528) T ss_pred hhhhhhhhHHHhhcccchhhHHHHHhhhhhhhhhcccccccccchhcc-ccccccccccCchhHHHHHHHHHHhhh---- Confidence 1111 11221111111 11 110 111122333332222 Q ss_pred HHHHhccccccchhhhhhhcccc-----cccccccccccCccccccccccccccccCchhhhHHHHHHhhhhhhheeeee Q lcl|Aclame:pro 41 EKDAETDPVYRDEKIVESFGGFL-----AEAEIAGDHNYDQTNIASGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQ 115 (524) Q Consensus 41 ~~~~~~~~~~~~~~~~~~~~~~l-----~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQ 115 (524) |.|-..++-+-.-. .-+--..+.+..++..+....-++-+.|.+.- .-+ T Consensus 107 ---------a~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~-----------------t~~ 160 (528) T protein:vir:66 107 ---------AFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLA-----------------AKE 160 (528) T ss_pred ---------hhhhheeecCCchhhhheeeeeeecCCcccccccccccccccccccccccc-----------------ccc Confidence 33333333321100 00000001111111111111112222222211 111 Q ss_pred c-CCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 116 P-MTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYF 194 (524) Q Consensus 116 P-mTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~ 194 (524) . ..||||||||||++|.++. .+++++|+|| |++|||..........+.......+..........+ .. T Consensus 161 a~~gGpTGliFAm~s~y~s~~--~g~ea~~nea-------~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~--~~ 229 (528) T protein:vir:66 161 ATVGSPTGTAFAKLTLSQAIT--AGDIVYHTFA-------ETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEG--KL 229 (528) T ss_pred ccccCCccceeeccccccccc--ccceeeeccc-------ccceeeeccccccccccCcccccccccccccccccc--cc Confidence 1 2689999999999998874 4788888886 888988654443322222211111111000000000 00 Q ss_pred ccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccc Q lcl|Aclame:pro 195 QNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQY 274 (524) Q Consensus 195 ~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEY 274 (524) .+...+-.+..+.....+.... .....+.+.-+.-.+.++-.. .-=+| ++||= -.-|||=- T Consensus 230 ~~~~~Gm~Ta~aEale~lg~~s-----~~~f~EMaFsIeK~tVtAKSR------aLKAE--YTiEL------AQDLKAIH 290 (528) T protein:vir:66 230 AEIAFGMATSIAEIQEGFNGSS-----NNPWAEMSMRIDKQVVEAKSR------QLKAR--YSIEV------AQDLRAVH 290 (528) T ss_pred eecccccchhhhhhhcccCCCc-----ccchhhcceEEEeEEEEeecc------ceecc--ccHHH------HHHHHHhc Confidence 0000011110000000000000 000111111111111111000 00001 11110 12356643 Q ss_pred cHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeeccccccc-ccccchHHHH Q lcl|Aclame:pro 275 SVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVD-IRGARWAGES 353 (524) Q Consensus 275 T~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~-~~~~~~~~e~ 353 (524) =+..-.-| ..-|++-+.-||-.||-+-|-.+..+.++...+++..+.+-. ..-|-.+... .-.++|.-.. T Consensus 291 GLDAEtEL--------sNILStEImlEINREii~~i~~~a~~~~~~~t~~~~~~aG~~-dl~~~~d~~g~rw~~e~~k~L 361 (528) T protein:vir:66 291 GMDADAEL--------NAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVF-DLQDPIDTRGARWAGESFKSL 361 (528) T ss_pred CCChHHHH--------HHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeecccccccee-ecccccccccchhHHHHHHHH Confidence 34333334 345888889999999997654667777776543332221100 1111112212 2345677776 Q ss_pred HHHHHHHHHHHHHHHHHhc---cccCCCE--EEEchh---------hhhhhhhhcccccccchhhhcccc--cccc---c Q lcl|Aclame:pro 354 YKALLIQIDKEANEIARQT---GRGAGNF--IIASRN---------VVSALARIDSGITPASQGLQKTLN--VDTT---K 414 (524) Q Consensus 354 ~r~L~~~i~~~a~~I~~~T---~~g~gn~--~v~S~~---------va~~L~~~~~g~~~~s~~~~~~~~--~d~~---~ 414 (524) ++.+-....++..+-.|-- --++.++ +++|.. ....+..=..+-+ +.+-++..+. .|.. . T Consensus 362 ~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~-~~G~l~~~~~vy~D~y~~~d 440 (528) T protein:vir:66 362 IYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAV-FAGVLAGKYKVFIDQYARQD 440 (528) T ss_pred HHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCce-eEEEecCceEEEecCCCCcc Confidence 6666666655555544411 0000010 111111 1111110000000 0011111100 0110 1 Q ss_pred ceeEEEecCcEE----EEecCCCCcceEEEEEecCCCcc-------ceeEeecccccc-cc---c---------ccCCcc Q lcl|Aclame:pro 415 AVFAGVLGGTYK----VYIDQYARQDYFTVGFKGDNEMD-------AGIYYAPYVALT-PL---R---------GSDPKN 470 (524) Q Consensus 415 ~~~~G~l~~~~~----vy~D~y~~~dy~~vG~KG~~~~~-------~~~fyaPYv~~~-~~---~---------~~dp~s 470 (524) ..-+|+= |-.. +|-=||.+..+. +++-=.+.++ =||.=-||+-.. .- | ....+- T Consensus 441 y~~vG~K-G~~~~~~glfyaPYv~l~~~-~~~dp~sfqP~~g~~tRY~l~vNP~~~~~~~~~~~ri~~g~~~~~~ag~n~ 518 (528) T protein:vir:66 441 YFTVGYK-GDNEMDAGIYYAPYVALTPL-RATDPQSFHPVLGFKTRYGIGINPFADSKSQEPSARITSGMLSKDSVGKNA 518 (528) T ss_pred eEEEEEe-CCcccccceeecccccceee-EeeCCccccceeeeeeeeceeecCcccccCccccccccccchhhhhcCccc Confidence 1111221 1111 122344443332 2222122111 123333443211 00 0 012222 Q ss_pred ccceeeeeee Q lcl|Aclame:pro 471 FQPVMGFKTR 480 (524) Q Consensus 471 ~qP~~~~~tR 480 (524) |--++++|.= T Consensus 519 ~~r~~~Vk~~ 528 (528) T protein:vir:66 519 YFRRVWVKGC 528 (528) T ss_pred eeEEeeeccC Confidence 3333333322 No 19 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=97.27 E-value=0.00011 Score=42.31 Aligned_cols=346 Identities=13% Similarity=0.072 Sum_probs=146.0 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHH-----HHHHHHHHHHHH------------------hccccc------c Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQL-----VAAILEAQEKDA------------------ETDPVY------R 51 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~-----~~~l~enq~~~~------------------~~~~~~------~ 51 (524) |++.++|.++...+.+ | +-++.+..+..+ ..+=|+++.+.+ ...+.. . T Consensus 1 M~~l~el~~~~~~~~~--e-~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:18 1 MSELALIQKAIEESQQ--K-MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChHHHHHHHHHHHHHH--H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh Confidence 9999999999877765 2 222322211110 001111211111 100000 0 Q ss_pred chhhhhhhcccccccccccccccCcccccccccccc-ccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeee Q lcl|Aclame:pro 52 DEKIVESFGGFLAEAEIAGDHNYDQTNIASGKSSGA-ITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRA 129 (524) Q Consensus 52 ~~~~~~~~~~~l~ea~~~g~~~~~~~~~~~st~sg~-v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRs 129 (524) .++..+.+...+...... ......+.+-.+++++ -....|.++ .+++++..+..-.++|-++||++++.-+ . T Consensus 78 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~----~ 151 (385) T protein:vir:18 78 SERAAEELIKSWDGKQGT--FGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEY----V 151 (385) T ss_pred HHHHHHHHHHHHHHhhcc--chhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEE----E Confidence 011111111111111000 0000000000111111 001223333 4555556677788889999988765211 0 Q ss_pred eecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCc Q lcl|Aclame:pro 130 VYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADP 209 (524) Q Consensus 130 rY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p 209 (524) +..... . . ..| T Consensus 152 ~~~~~~---~------~---------a~~--------------------------------------------------- 162 (385) T protein:vir:18 152 REEVFT---N------N---------ADV--------------------------------------------------- 162 (385) T ss_pred EEecCC---c------c---------eee--------------------------------------------------- Confidence 110000 0 0 000 Q ss_pred ccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCC Q lcl|Aclame:pro 210 AALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMD 289 (524) Q Consensus 210 ~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLD 289 (524) .+| +..+++-..++++++.+.|.-+-...+|.||.||-- + T Consensus 163 --------------------------v~E---------~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~ 202 (385) T protein:vir:18 163 --------------------------VAE---------KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----M 202 (385) T ss_pred --------------------------ecc---------CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----H Confidence 001 012334445566677777777777889999999852 3 Q ss_pred hHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 290 ADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIA 369 (524) Q Consensus 290 AEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~ 369 (524) .++.|.+-|+..|..-+|+.||.- . | ...++.|++......... ..... -..+..|..+...|. T Consensus 203 l~~~i~~~la~a~~~~~d~~~l~G--------~-g---~~~~~~Gi~~~~~~~~~~-~~~~~---~~~~d~i~~~~~~l~ 266 (385) T protein:vir:18 203 LQSYINNRLMYGLALKEEGQLLNG--------D-G---TGDNLEGLNKVATAYDTS-LNATG---DTRADIIAHAIYQVT 266 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc--------c-C---CCCccccccccccccccc-ccccc---cchHHHHHHHHHhhc Confidence 478888888888888888888841 0 0 011223443322111100 00000 112223333334442 Q ss_pred HhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCcc Q lcl|Aclame:pro 370 RQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMD 449 (524) Q Consensus 370 ~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~ 449 (524) ..+...+.+||||+....|...... .+. .+..+.. ...-++|.| ++|+++++.|..-+++|--- T Consensus 267 --~~~~~~~~~~~~~~~~~~l~~lkd~----~G~---~l~~~~~-~~~~~~l~G-~pV~~~~~~p~~~~~~gd~~----- 330 (385) T protein:vir:18 267 --ESEFSASGIVLNPRDWHNIALLKDN----EGR---YIFGGPQ-AFTSNIMWG-LPVVPTKAQAAGTFTVGGFD----- 330 (385) T ss_pred --cccCCCCEEEEcHHHHHHHHHhhcC----CCc---eeccCcc-cCCCceecc-eeeEEcCcCCCCcEEEeecc----- Confidence 3344678899999999988754321 110 0111111 111357777 79999999987666655210 Q ss_pred ceeEeecccc-ccccc----ccCC-ccccceeeeeeeecc-EecCccc-ccCCCccccccccc Q lcl|Aclame:pro 450 AGIYYAPYVA-LTPLR----GSDP-KNFQPVMGFKTRYGI-GINPFAN-SRSQAPADRITSGM 504 (524) Q Consensus 450 ~~~fyaPYv~-~~~~~----~~dp-~s~qP~~~~~tRY~l-~~nP~~~-~~~~~~~~~i~~~~ 504 (524) .++.. +.. ...+. ..|+ ..-+=.+-...||+. +.+|=+. ..+-+.+ . T Consensus 331 ~~~~~--~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa------~ 385 (385) T protein:vir:18 331 MASQV--WDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSG------S 385 (385) T ss_pred cEEEE--EEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC------C Confidence 01111 111 00011 1111 111223334457776 3444111 1111111 1 No 20 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=97.27 E-value=0.00011 Score=42.31 Aligned_cols=346 Identities=13% Similarity=0.072 Sum_probs=146.0 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHH-----HHHHHHHHHHHH------------------hccccc------c Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQL-----VAAILEAQEKDA------------------ETDPVY------R 51 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~-----~~~l~enq~~~~------------------~~~~~~------~ 51 (524) |++.++|.++...+.+ | +-++.+..+..+ ..+=|+++.+.+ ...+.. . T Consensus 1 M~~l~el~~~~~~~~~--e-~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:19 1 MSELALIQKAIEESQQ--K-MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChHHHHHHHHHHHHHH--H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh Confidence 9999999999877765 2 222322211110 001111211111 100000 0 Q ss_pred chhhhhhhcccccccccccccccCcccccccccccc-ccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeee Q lcl|Aclame:pro 52 DEKIVESFGGFLAEAEIAGDHNYDQTNIASGKSSGA-ITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRA 129 (524) Q Consensus 52 ~~~~~~~~~~~l~ea~~~g~~~~~~~~~~~st~sg~-v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRs 129 (524) .++..+.+...+...... ......+.+-.+++++ -....|.++ .+++++..+..-.++|-++||++++.-+ . T Consensus 78 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~----~ 151 (385) T protein:vir:19 78 SERAAEELIKSWDGKQGT--FGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEY----V 151 (385) T ss_pred HHHHHHHHHHHHHHhhcc--chhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEE----E Confidence 011111111111111000 0000000000111111 001223333 4555556677788889999988765211 0 Q ss_pred eecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCc Q lcl|Aclame:pro 130 VYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADP 209 (524) Q Consensus 130 rY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p 209 (524) +..... . . ..| T Consensus 152 ~~~~~~---~------~---------a~~--------------------------------------------------- 162 (385) T protein:vir:19 152 REEVFT---N------N---------ADV--------------------------------------------------- 162 (385) T ss_pred EEecCC---c------c---------eee--------------------------------------------------- Confidence 110000 0 0 000 Q ss_pred ccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCC Q lcl|Aclame:pro 210 AALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMD 289 (524) Q Consensus 210 ~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLD 289 (524) .+| +..+++-..++++++.+.|.-+-...+|.||.||-- + T Consensus 163 --------------------------v~E---------~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~ 202 (385) T protein:vir:19 163 --------------------------VAE---------KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----M 202 (385) T ss_pred --------------------------ecc---------CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----H Confidence 001 012334445566677777777777889999999852 3 Q ss_pred hHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 290 ADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIA 369 (524) Q Consensus 290 AEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~ 369 (524) .++.|.+-|+..|..-+|+.||.- . | ...++.|++......... ..... -..+..|..+...|. T Consensus 203 l~~~i~~~la~a~~~~~d~~~l~G--------~-g---~~~~~~Gi~~~~~~~~~~-~~~~~---~~~~d~i~~~~~~l~ 266 (385) T protein:vir:19 203 LQSYINNRLMYGLALKEEGQLLNG--------D-G---TGDNLEGLNKVATAYDTS-LNATG---DTRADIIAHAIYQVT 266 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc--------c-C---CCCccccccccccccccc-ccccc---cchHHHHHHHHHhhc Confidence 478888888888888888888841 0 0 011223443322111100 00000 112223333334442 Q ss_pred HhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCcc Q lcl|Aclame:pro 370 RQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMD 449 (524) Q Consensus 370 ~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~ 449 (524) ..+...+.+||||+....|...... .+. .+..+.. ...-++|.| ++|+++++.|..-+++|--- T Consensus 267 --~~~~~~~~~~~~~~~~~~l~~lkd~----~G~---~l~~~~~-~~~~~~l~G-~pV~~~~~~p~~~~~~gd~~----- 330 (385) T protein:vir:19 267 --ESEFSASGIVLNPRDWHNIALLKDN----EGR---YIFGGPQ-AFTSNIMWG-LPVVPTKAQAAGTFTVGGFD----- 330 (385) T ss_pred --cccCCCCEEEEcHHHHHHHHHhhcC----CCc---eeccCcc-cCCCceecc-eeeEEcCcCCCCcEEEeecc----- Confidence 3344678899999999988754321 110 0111111 111357777 79999999987666655210 Q ss_pred ceeEeecccc-ccccc----ccCC-ccccceeeeeeeecc-EecCccc-ccCCCccccccccc Q lcl|Aclame:pro 450 AGIYYAPYVA-LTPLR----GSDP-KNFQPVMGFKTRYGI-GINPFAN-SRSQAPADRITSGM 504 (524) Q Consensus 450 ~~~fyaPYv~-~~~~~----~~dp-~s~qP~~~~~tRY~l-~~nP~~~-~~~~~~~~~i~~~~ 504 (524) .++.. +.. ...+. ..|+ ..-+=.+-...||+. +.+|=+. ..+-+.+ . T Consensus 331 ~~~~~--~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa------~ 385 (385) T protein:vir:19 331 MASQV--WDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSG------S 385 (385) T ss_pred cEEEE--EEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC------C Confidence 01111 111 00011 1111 111223334457776 3444111 1111111 1 No 21 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.20 E-value=0.00013 Score=41.83 Aligned_cols=332 Identities=13% Similarity=0.138 Sum_probs=132.2 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhH----------------HHHHHHHH---HHHHHHHhccc-------ccc--- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSK----------------KQLVAAIL---EAQEKDAETDP-------VYR--- 51 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~----------------~~~~~~l~---enq~~~~~~~~-------~~~--- 51 (524) |.+.++|.++|..+-+. +-++...-+ +.-+..+. |.+.+.+.+.. .-+ T Consensus 1 Mk~~~el~~~~~~~~~~---~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDK---VENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKK 77 (397) T ss_pred CchHHHHHHHHHHHHHH---HHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 99999998888765432 111111000 00011110 11111111100 000 Q ss_pred -----chhhh----hhhcccccccccccccccCccc--ccccc-ccccccccCch-hh-hHHHHHHhhhhhhheeeeecC Q lcl|Aclame:pro 52 -----DEKIV----ESFGGFLAEAEIAGDHNYDQTN--IASGK-SSGAITNIGPA-VI-GMVRRAIPNLIAFDICGVQPM 117 (524) Q Consensus 52 -----~~~~~----~~~~~~l~ea~~~g~~~~~~~~--~~~st-~sg~v~~~~P~-li-~l~Rra~~nLIa~DI~GVQPm 117 (524) ++... .+|..+|.. +...-. ...++ +.|.+. -|. +. .+++.+.++.+-.++|.++|| T Consensus 78 ~~~~~~~~~~~~~~~~~~~~l~~-------~~~~~~~~~~~~t~~~gg~~--vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 148 (397) T protein:vir:49 78 PLTKSEEEVKAGFVKDFKNLVRG-------RYQNLLDSKTDASGSDAGLT--IPQDIQTAIHTLVSQYDSLQEYVNVENV 148 (397) T ss_pred ccccchhHHHHHHHHHHHHHHhc-------chhHHHHHhhccccccCccc--ccHhHHHHHHHHHHhhhhHHhhhceeec Confidence 00000 001111111 100000 00111 112221 122 11 455555567788889999999 Q ss_pred CchhhhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 118 TGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNV 197 (524) Q Consensus 118 TgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~ 197 (524) ++++|-+.-++- .+.. +. +.|-+ T Consensus 149 ~~~~~~~~~~~~--~~~~---~~---------------a~~v~------------------------------------- 171 (397) T protein:vir:49 149 TTLTGSRVYEKW--TDIT---GL---------------ANIDD------------------------------------- 171 (397) T ss_pred ccCccceEEEee--ccCC---cc---------------eeeec------------------------------------- Confidence 998874321111 1100 00 00000 Q ss_pred cccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHH Q lcl|Aclame:pro 198 TSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVE 277 (524) Q Consensus 198 ~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~E 277 (524) +| +.. ...+...|.++.|++.|. +-...+|-| T Consensus 172 --------------------------------E~------~~~---~~~~~~~~~~i~~~~~k~-------~~~~~iS~e 203 (397) T protein:vir:49 172 --------------------------------EA------GKI---ADVDDPKLSLIKYTIKRY-------AGISTVTNS 203 (397) T ss_pred --------------------------------Cc------ccc---ccccccceeeEEeeeeeE-------EeeehhHHH Confidence 00 000 000112344444544444 445679999 Q ss_pred HHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHH Q lcl|Aclame:pro 278 LAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKAL 357 (524) Q Consensus 278 LAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L 357 (524) |.+|-. .|.+++|.+-|+..|..-+|+.|+.-.-... ...|+.++ +-...| T Consensus 204 ll~ds~----~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~------------~~~~~~~~-------------d~i~~~ 254 (397) T protein:vir:49 204 LLADSA----ENILAWLSGWIAKKVVVTRNKAILEAIAALP------------TKPTLTKW-------------DDIIDL 254 (397) T ss_pred HHhhhH----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------cccccccH-------------HHHHHH Confidence 999852 5679999999999999999999986221111 12233332 223444 Q ss_pred HHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEe--cCCCCc Q lcl|Aclame:pro 358 LIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYI--DQYARQ 435 (524) Q Consensus 358 ~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~ 435 (524) +..|... +.....+|++|.....|..+...- + +-....+.+ ....++|.| ++|++ +...+. T Consensus 255 ~~~l~~~---------~~~~a~~vmn~~~~~~l~~lkd~~----G--~~l~~~~~~-~~~~~~l~G-~PV~~~~~~~~~~ 317 (397) T protein:vir:49 255 EAKVDPA---------IKQTSFFLTNTSGFTALKKVKNAL----G--DYLMERDVK-SPTGYSIDG-FAVKEVADRWLAN 317 (397) T ss_pred HHhhhhh---------hcCCCEEEEcHHHHHHHHHhhcCC----C--ceeeccCcC-CCCCceecc-eeeEEeccccccc Confidence 4444321 224578899999999987642111 1 001111111 112357877 57775 222211 Q ss_pred ----c-eEEEE---------EecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe-cC--ccc---ccCCC Q lcl|Aclame:pro 436 ----D-YFTVG---------FKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NP--FAN---SRSQA 495 (524) Q Consensus 436 ----d-y~~vG---------~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP--~~~---~~~~~ 495 (524) + -+++| .++..+ +=+.+|.. .+-...+-.+-...|++..+ || |.. ...-+ T Consensus 318 ~~~~~~~i~~gd~~~~~~~~~~~~~~----i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~ 387 (397) T protein:vir:49 318 GTGGAMPLYFGDLKQAVTLFDRQHMS----LLSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIAD 387 (397) T ss_pred ccCCceeEEEeeccceEEEEeecceE----EEEecccc------chhhcCceeEEEEeeeCcEEecccceEEEEeecccC Confidence 1 12222 221111 11122211 01122233344445555432 22 100 00000 Q ss_pred ccccccccchHHhhccchhhhhhhhc Q lcl|Aclame:pro 496 PADRITSGMISKEMCGKNAYFRKVWV 521 (524) Q Consensus 496 ~~~~i~~~~~~~~~a~~~~~~~~~~V 521 (524) +.. ..++ +.| T Consensus 388 ~~~--~~~~--------------~~~ 397 (397) T protein:vir:49 388 QKG--NLGS--------------TAV 397 (397) T ss_pred CCC--Cccc--------------ccC Confidence 000 0000 000 No 22 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=96.37 E-value=0.00069 Score=37.92 Aligned_cols=309 Identities=15% Similarity=0.058 Sum_probs=128.5 Q ss_pred HHHHHHHHHHHHhccccccchhhhhhhccccccccc--ccccccCccccccccccccccccCchhh-hHHHHHHhhhhhh Q lcl|Aclame:pro 33 VAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEI--AGDHNYDQTNIASGKSSGAITNIGPAVI-GMVRRAIPNLIAF 109 (524) Q Consensus 33 ~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~--~g~~~~~~~~~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~ 109 (524) +|.| +|-.. .|....+ ...++.++ ..-+.+. .+++.+.+..+-. T Consensus 1 ~~~~---------------------------~e~~~~~~~~~~~~---~~~~~~~~---liP~~~~~~ii~~~~~~s~l~ 47 (338) T protein:vir:78 1 MATL---------------------------NELAPNTAGSNHQG---RLAHVPSD---LLPKEIVGPIFDKAQESSLVL 47 (338) T ss_pred Ccch---------------------------HHhhhhhccccccc---ceeccccc---ccchHHHHHHHHHHHhhchhh Confidence 2222 22211 1111110 01111111 2222222 4566666778888 Q ss_pred heeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 110 DICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQET 189 (524) Q Consensus 110 DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~ 189 (524) .+|.+.||+++..-|.-. .... .+.+-+.+. T Consensus 48 ~l~~~~~~~~~~~~ip~~----~~~~-------------------~a~~v~~~~-------------------------- 78 (338) T protein:vir:78 48 RLGENIPISYGETIIPTT----VKRP-------------------EVGQVGVGT-------------------------- 78 (338) T ss_pred hhcceeeccCCceEEEEE----ecCc-------------------cceeecccc-------------------------- Confidence 999999998864332211 1110 011100000 Q ss_pred cccccccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeeccc Q lcl|Aclame:pro 190 GIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQ 269 (524) Q Consensus 190 ~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRA 269 (524) ....+| +...++-.-+++.++...+..+ T Consensus 79 -------------------------------------------~~~~~E---------g~~~~~~~~~f~~v~l~~~k~~ 106 (338) T protein:vir:78 79 -------------------------------------------SNEQRE---------GGTKPLSGTAWDTRSVAPIKLA 106 (338) T ss_pred -------------------------------------------cccccc---------cccccccccceeEEEEEEEEEE Confidence 000011 0112333334455555555555 Q ss_pred ccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccch Q lcl|Aclame:pro 270 LKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARW 349 (524) Q Consensus 270 LKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~ 349 (524) -...+|-||.+|- ..|.+++|.+-|+..|...||..||.=.-...-.+..++..... ..+.... ..-+ T Consensus 107 ~~~~is~ell~ds----~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~-~~~~~~~-------~~~~ 174 (338) T protein:vir:78 107 TIVTVSEEFARMN----PSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNV-IVNTTNV-------DYLQ 174 (338) T ss_pred EeehhhHHHHhcC----HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccc-ccccccc-------cccc Confidence 6677899999983 36789999999999999999999985211100000011110000 0000000 0001 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEe Q lcl|Aclame:pro 350 AGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYI 429 (524) Q Consensus 350 ~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~ 429 (524) .. ...++..+.++...|...=.+ ..+.+|++|+....|..+. ...+..+. ....++....-.++|.| ++||+ T Consensus 175 ~~--~~~~~~~~~~~~~~~~~~~~~-~~~~~~m~~~~~~~L~~~~-~l~d~~g~---~l~~~~~~~~~~~~l~G-~PV~~ 246 (338) T protein:vir:78 175 TG--TTPLLDRFLDGYDLVSANTDV-DFNGWAADPRYRARLLRSQ-AYRDANGN---VDPTRINLAASAGDLLG-LPVQF 246 (338) T ss_pred cc--chhhHHHHHHHHHHhhhhccc-cceEEEEchHHHHHHHHHh-hhccCCCc---eeecccccCCCCceeee-eeEEE Confidence 10 123344444444444332222 5578999999888775431 11111110 00001111111357787 59998 Q ss_pred cCCCCcc---------eEEEE--------EecCCCccceeEeecccccccccccCCcc-----cc---ceeeeeeeecc- Q lcl|Aclame:pro 430 DQYARQD---------YFTVG--------FKGDNEMDAGIYYAPYVALTPLRGSDPKN-----FQ---PVMGFKTRYGI- 483 (524) Q Consensus 430 D~y~~~d---------y~~vG--------~KG~~~~~~~~fyaPYv~~~~~~~~dp~s-----~q---P~~~~~tRY~l- 483 (524) +.+.+.+ -+++| ..+.-.. =..+| .......||.. || =.+=...|++. T Consensus 247 ~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i----~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~ 320 (338) T protein:vir:78 247 GKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRV----KMSDT--ATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWL 320 (338) T ss_pred ccccCccccccCCcccEEEEEecceEEEEeecccEE----EEeec--ccccccccccccchhhhhcCcEEEEEEEEeccE Confidence 8775421 13333 2211110 00011 11111223321 11 12223568874 Q ss_pred EecCcccccCCCccccccccchHHh Q lcl|Aclame:pro 484 GINPFANSRSQAPADRITSGMISKE 508 (524) Q Consensus 484 ~~nP~~~~~~~~~~~~i~~~~~~~~ 508 (524) ..||= ...++.++..-++ T Consensus 321 v~~~~-------a~~~l~~~~~~~~ 338 (338) T protein:vir:78 321 LGDKQ-------AFVKFVDDEDPDA 338 (338) T ss_pred eeccc-------ceEEEecccCCCC Confidence 34441 1224444433333 No 23 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=95.69 E-value=0.0016 Score=35.89 Aligned_cols=354 Identities=11% Similarity=0.010 Sum_probs=132.6 Q ss_pred CC--------------chHHHHHHhhHhhccccc-chhhcchhHHHHHHH--HHHHHHHHHhccc-------cccchh-- Q lcl|Aclame:pro 1 MS--------------KKNELMEKWNDLLESQEG-LPDIATKSKKQLVAA--ILEAQEKDAETDP-------VYRDEK-- 54 (524) Q Consensus 1 m~--------------~~~~l~~kw~p~l~~~~~-~~~i~~~~~~~~~~~--l~enq~~~~~~~~-------~~~~~~-- 54 (524) +- +.+++.++...-++.-+. ..+++.. ...+.+. -|+.+.+++.... ....++ T Consensus 21 ~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~-~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~ 99 (418) T protein:vir:10 21 EQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKAT-VDELLIKQGELQARLLEAEQKLARGGGSAELETPKTL 99 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Confidence 11 112222222211110000 1111110 0000000 0111111111110 000000 Q ss_pred --------hhhhhcccccccccccc---cccCccccccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhh Q lcl|Aclame:pro 55 --------IVESFGGFLAEAEIAGD---HNYDQTNIASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTG 122 (524) Q Consensus 55 --------~~~~~~~~l~ea~~~g~---~~~~~~~~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTG 122 (524) -...+..++.+....-. .-..-.....+++++.-...-|.+. .+++.+.+..+-.++|.+-||++++. T Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 179 (418) T protein:vir:10 100 GQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSI 179 (418) T ss_pred hHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCce Confidence 00111111111100000 0000000011111111111222222 45555667778888899999887642 Q ss_pred hheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCC Q lcl|Aclame:pro 123 QVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNV 202 (524) Q Consensus 123 LIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~ 202 (524) - |.-... .+. .+.| T Consensus 180 ~-------~~~~~~-~~~--------------~a~~-------------------------------------------- 193 (418) T protein:vir:10 180 E-------YTVETG-FTN--------------NAAA-------------------------------------------- 193 (418) T ss_pred e-------EEEEec-CCC--------------ceee-------------------------------------------- Confidence 1 111000 000 0000 Q ss_pred cccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHH Q lcl|Aclame:pro 203 TVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDL 282 (524) Q Consensus 203 ~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDL 282 (524) .+| +...++-..++++++..+|.-+-...+|-||.||. T Consensus 194 ---------------------------------v~E---------~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds 231 (418) T protein:vir:10 194 ---------------------------------VAE---------GAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDA 231 (418) T ss_pred ---------------------------------ecc---------CccccccccceeeEEEeeeeEEEeehhhHHHHHhH Confidence 001 00122223455667777777677788999999986 Q ss_pred HhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHH Q lcl|Aclame:pro 283 RAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQID 362 (524) Q Consensus 283 kAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~ 362 (524) - |.++.|.+-|+..|..-+|+-||.=-- +...+.|++..........+--. ...+..|. T Consensus 232 ~-----~l~~~i~~~l~~a~~~~~d~a~l~G~g------------~~~~p~Gi~~~~~~~~~~~~~~~----~~~~~~i~ 290 (418) T protein:vir:10 232 P-----ALQSYIDGRARYGLQLTEEGQILKGDG------------TGANILGILPQASAFMPSITLAN----ATPIDKIR 290 (418) T ss_pred H-----HHHHHHHHHHHHHHHHHHHHHHhccCC------------CCccccccccccccccccccccc----cccHHHHH Confidence 2 468889999999999988888874100 00112343322211100000000 01122223 Q ss_pred HHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEE Q lcl|Aclame:pro 363 KEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGF 442 (524) Q Consensus 363 ~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~ 442 (524) .+...+. ..+...+.+||+|.....|...... .+. ....+.+. .-.|+|.| ++|+++++.+.+-+++|- T Consensus 291 ~~~~~~~--~~~~~~~~~v~n~~~~~~L~~lkd~----~G~---~i~~~~~~-~~~~~l~G-~pV~~~~~~p~~~~~~gd 359 (418) T protein:vir:10 291 LALLQAV--LAEFPATGIVLNPIDWASIELTKDS----QGR---YIVGNPVN-GTTPRLWN-LPVVETQAMTANEFLVGA 359 (418) T ss_pred HHHHhhc--cccCCCCEEEEcHHHHHHHHHhhcC----CCc---eecccccc-CCCceecc-eeeEEcCCCCCCcEEEee Confidence 3333332 2344667899999999888754221 110 01111111 11367877 799999998876565552 Q ss_pred ecCC-----CccceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhcc Q lcl|Aclame:pro 443 KGDN-----EMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCG 511 (524) Q Consensus 443 KG~~-----~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~ 511 (524) --.. ..+-.+=..||... +-...+=.+=+..|++..+ +|=+ +.-+.-....+| T Consensus 360 ~s~~~~~~~~~~~~i~~~~~~~~------~f~~~~~~~r~~~~~d~~~~~~~a----------~~~~~~~~~~~g 418 (418) T protein:vir:10 360 FSMAAQIFDRMEIEVLLSTENVD------DFEKNMVSIRAEERLALAVYRPES----------FVTGALVEQAGG 418 (418) T ss_pred ccceEEEEEecceEEEEecccch------hhhcCceEEEEEEeeccEEecccc----------eEEEEeccCCCC Confidence 1100 00000111111110 0112222333455776543 2311 111111122233 No 24 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=95.61 E-value=0.0018 Score=35.69 Aligned_cols=332 Identities=15% Similarity=0.170 Sum_probs=135.3 Q ss_pred CCchHHHHHHhhHhhcccccchhhcch-------------hHHHHHHHH---------HHHHHHHHhcccc--------- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATK-------------SKKQLVAAI---------LEAQEKDAETDPV--------- 49 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~-------------~~~~~~~~l---------~enq~~~~~~~~~--------- 49 (524) |.+.++|.+.|..+.+. +.++... .-+++.+.| ++.+.++.++... T Consensus 1 Mk~~~eL~~~~~~~~~~---~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDK---VENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKK 77 (397) T ss_pred CchHHHHHHHHHHHHHH---HHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 99999999999888763 2222111 001111111 1111111111110 Q ss_pred --ccchh-----hhhhhcccccccccccccccCccccccccccccccccCchhh--hHHHHHHhhhhhhheeeeecCCch Q lcl|Aclame:pro 50 --YRDEK-----IVESFGGFLAEAEIAGDHNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGP 120 (524) Q Consensus 50 --~~~~~-----~~~~~~~~l~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmTgP 120 (524) .+.+. -...|..+|... ...........+++.|.+. . |.-+ .+++.+-++..-.+++.|+||++. T Consensus 78 ~~~~~~~~~~~~~~~~~~~~l~~~----~~~~~~~~~~~t~~~gg~~-i-P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 151 (397) T protein:vir:49 78 PLTKNEEEVKANFVKDFKNLVRGR----YQNLLDSKTDGSGSDAGLT-I-PQDIRTAINTLVRQFDSLQEYVNVENVTTL 151 (397) T ss_pred cccchhhHHHHHHHHHHHHHhhcc----hhhHHHhhhccCCccCcce-e-cHHHHHHHHHHHHhhhhHhhhcceeeccCC Confidence 00000 001111111110 0000000000011112111 1 2222 355555667778899999999987 Q ss_pred hhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 121 TGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSG 200 (524) Q Consensus 121 TGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g 200 (524) +|-+- |.......+ . +.|-+ T Consensus 152 ~~~~~-----~~~~~~~~~------~---------a~~v~---------------------------------------- 171 (397) T protein:vir:49 152 TGSRV-----YEKWADITG------L---------AKLDD---------------------------------------- 171 (397) T ss_pred cceEE-----EEeeccCCc------c---------eeeec---------------------------------------- Confidence 75421 111100000 0 00000 Q ss_pred CCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccce-eEEEEEEEEeecccccccccHHHH Q lcl|Aclame:pro 201 NVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMA-FRIDKQVIEARSRQLKAQYSVELA 279 (524) Q Consensus 201 ~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMs-FsIEK~tVtAKSRALKAEYT~ELA 279 (524) | +..+++-. -+++.++..++.-+-...+|-||. T Consensus 172 -------------------------------------E---------~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell 205 (397) T protein:vir:49 172 -------------------------------------E---------GGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLL 205 (397) T ss_pred -------------------------------------c---------ccccccccccceeeeEeeeeeeEeehhhHHHHH Confidence 0 00011111 123444444444455577999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHH Q lcl|Aclame:pro 280 QDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLI 359 (524) Q Consensus 280 QDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~ 359 (524) +|-. +|.+++|.+-|+..|..-+|+.||.=. | +..+..+++++ +-...|+. T Consensus 206 ~ds~----~~l~~~i~~~l~~~~~~~~d~ail~G~---------g---~~~~~~~~~~~-------------d~i~~~~~ 256 (397) T protein:vir:49 206 ADSA----ENILAWLSGWIAKKVVVTRNKAILEAI---------G---TLPNKPTLAKW-------------DDIIDLQA 256 (397) T ss_pred hhhh----HHHHHHHHHHHHHHHHHHHHHHHHhcc---------c---cccccccccCH-------------HHHHHHHH Confidence 9853 567999999999999999999998511 0 11112233322 11233333 Q ss_pred HHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEe--cCCCCc-- Q lcl|Aclame:pro 360 QIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYI--DQYARQ-- 435 (524) Q Consensus 360 ~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~-- 435 (524) .+. +.+.....+|++|.....|..+...-..+ ....+.+ ....++|.| ++|++ |...+. T Consensus 257 ~l~---------~~~~~~a~~v~n~~~~~~l~~lkd~~g~~------l~~~~~~-~g~~~~l~G-~pV~~~~~~~~~~~~ 319 (397) T protein:vir:49 257 KVD---------PAIKQTSLFLTNTSGFTALKKVKNAMGDY------LMERDVK-SPTGYSIDG-FVVKEISDRFLPNGT 319 (397) T ss_pred hhh---------hhhcCCCEEEEcHHHHHHHHHhhccCCce------eeccccc-CCCCceecc-eeeEEeccccccccc Confidence 332 22335578999999999887643221100 0001111 111257887 46664 222221 Q ss_pred ------------ceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe-cC-------cccccCCC Q lcl|Aclame:pro 436 ------------DYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NP-------FANSRSQA 495 (524) Q Consensus 436 ------------dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP-------~~~~~~~~ 495 (524) +|++++..+.-. +-..||.. .+-...+=.+-...|++..+ +| ++...++. T Consensus 320 ~~~~~~~~gd~~~~~~~~~~~~~~----i~~~~~~~------~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:49 320 GGAMPLYFGDLKQAVTLFDRQHLS----LLSTNIGG------GAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIADQK 389 (397) T ss_pred CCceeEEEeeccceEEEEeecccE----EEEecccc------chhhcCeeeEEEEEeeccEEecccceEEEEeccccccc Confidence 122222222111 11222211 01122333344445555432 22 11122222 Q ss_pred ccccccccchHHhhccc Q lcl|Aclame:pro 496 PADRITSGMISKEMCGK 512 (524) Q Consensus 496 ~~~~i~~~~~~~~~a~~ 512 (524) +..+. .|. T Consensus 390 ~~~~~---------~~~ 397 (397) T protein:vir:49 390 AKLST---------AGA 397 (397) T ss_pred Ccccc---------cCC Confidence 22111 111 No 25 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=95.58 E-value=0.0018 Score=35.62 Aligned_cols=344 Identities=13% Similarity=0.051 Sum_probs=137.2 Q ss_pred CCch-HHHHHHhhHhhcccccchhhcch----------hHH---HHHHHH--HHHHHHHHh----ccc--cccchhhhhh Q lcl|Aclame:pro 1 MSKK-NELMEKWNDLLESQEGLPDIATK----------SKK---QLVAAI--LEAQEKDAE----TDP--VYRDEKIVES 58 (524) Q Consensus 1 m~~~-~~l~~kw~p~l~~~~~~~~i~~~----------~~~---~~~~~l--~enq~~~~~----~~~--~~~~~~~~~~ 58 (524) |++. ++|++.+..+.+. +.++... -++ .+.+.+ |+.+.+++. +.. .-..+.-... T Consensus 1 m~~l~~~l~~~~~~~~~~---~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 77 (390) T protein:vir:81 1 MTDITSKLEATLANVTDS---LRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVS 77 (390) T ss_pred ChHHHHHHHHHHHHHHHH---HHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 9988 4477777777653 2222111 011 111111 112211111 100 0000000000 Q ss_pred hcccccccc----c--cccccc--------Cccccc-cccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhh Q lcl|Aclame:pro 59 FGGFLAEAE----I--AGDHNY--------DQTNIA-SGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTG 122 (524) Q Consensus 59 ~~~~l~ea~----~--~g~~~~--------~~~~~~-~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTG 122 (524) .+....+.+ . .+.... ...+.. .++++.+-....|..+ .++++.-+..+-.++|.+.||++++. T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 157 (390) T protein:vir:81 78 VGDMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALI 157 (390) T ss_pred chhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCce Confidence 011110100 0 000000 000000 0011111112233333 45555556677778888888887652 Q ss_pred hheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCC Q lcl|Aclame:pro 123 QVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNV 202 (524) Q Consensus 123 LIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~ 202 (524) -+ .......+ + ..| T Consensus 158 ~~-------~~~~~~~~------~---------a~~-------------------------------------------- 171 (390) T protein:vir:81 158 EY-------VQETGFVN------N---------AAI-------------------------------------------- 171 (390) T ss_pred EE-------EEEecCCc------c---------eee-------------------------------------------- Confidence 11 11100000 0 000 Q ss_pred cccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHH Q lcl|Aclame:pro 203 TVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDL 282 (524) Q Consensus 203 ~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDL 282 (524) .+| +..+++-..++++++.+.|.-+-...+|-||.+|- T Consensus 172 ---------------------------------v~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~ 209 (390) T protein:vir:81 172 ---------------------------------VAE---------GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA 209 (390) T ss_pred ---------------------------------ecC---------CcccccccceeeEEEEeeeEEEEeehhhHHHHHhH Confidence 000 00122222334444444444445567899999984 Q ss_pred HhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccc---cccchHHHHHHHHHH Q lcl|Aclame:pro 283 RAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDI---RGARWAGESYKALLI 359 (524) Q Consensus 283 kAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~---~~~~~~~e~~r~L~~ 359 (524) . +.++.|.+-|+..|...+|+.||.- . | ++..+.|++........ ..+....+....++. T Consensus 210 --~---~~~~~i~~~l~~~~~~~~d~a~l~G--------~-g---~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~ 272 (390) T protein:vir:81 210 --P---QLASYMNNRLIRGLKVKEDAEILRG--------T-G---ANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAML 272 (390) T ss_pred --H---HHHHHHHHHHHHHHHHHHHHHHHhc--------C-C---CCCcccceeecccccccccccccchhHHHHHHHHH Confidence 2 4699999999999999999988841 0 0 01123344433221111 111122233333333 Q ss_pred HHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEE Q lcl|Aclame:pro 360 QIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFT 439 (524) Q Consensus 360 ~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~ 439 (524) ++. ..+...+.+|++|.....|.....+ .+. ....+... .-.++|.| ++|++.+..|.+-++ T Consensus 273 ~~~---------~~~~~~~~~v~~~~~~~~l~~lkd~----~G~---~l~~~~~~-~~~~~l~G-~pv~~~~~~p~~~~~ 334 (390) T protein:vir:81 273 QAS---------LAEYNPSGIVINPIDWAAIELAKDA----NNQ---YLIGNARG-TLTPTLWG-LPVVATQAMAPGEFL 334 (390) T ss_pred hhc---------cccCCCCEEEEcHHHHHHHHHhhcC----CCc---eeecCccc-ccCceecc-eeeEEcCCCCCCcEE Confidence 322 2233567889999998888753211 110 01111111 11246776 699999998876666 Q ss_pred EEEecCCCccceeEeecccccccccccC-C---ccccceeeeeeeecc-EecCcccccCCCccccccccchHHhhc Q lcl|Aclame:pro 440 VGFKGDNEMDAGIYYAPYVALTPLRGSD-P---KNFQPVMGFKTRYGI-GINPFANSRSQAPADRITSGMISKEMC 510 (524) Q Consensus 440 vG~KG~~~~~~~~fyaPYv~~~~~~~~d-p---~s~qP~~~~~tRY~l-~~nP~~~~~~~~~~~~i~~~~~~~~~a 510 (524) +|---. .++.. ......+...+ + .+-+=.+=...|++. +.+|=+ ..+++= | T Consensus 335 ~gd~~~-----~~~~~-~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a-------~v~~t~-------a 390 (390) T protein:vir:81 335 VGAFDL-----AAQIF-DQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEA-------LISGSF-------A 390 (390) T ss_pred EEehhc-----eEEEE-EecceEEEEecccchhhcCcEEEEEEEeeccEEecccc-------eEEEEe-------C Confidence 553210 00100 00011111111 1 112223335566665 333311 111110 1 No 26 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=95.56 E-value=0.0018 Score=35.57 Aligned_cols=360 Identities=15% Similarity=0.124 Sum_probs=142.1 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhH-----------HHHHHHH--HHHHHHHHhccc------------------- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSK-----------KQLVAAI--LEAQEKDAETDP------------------- 48 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~-----------~~~~~~l--~enq~~~~~~~~------------------- 48 (524) |...++|.++=..+++.. .+++..-+ +.+...+ |+.|.+.+.+.. T Consensus 1 mk~~~el~~~l~el~~~~---~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQI---DLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) T ss_pred CchHHHHHHHHHHHHHHH---HHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 888777777766664421 11111100 0111111 111111111100 Q ss_pred ----cccchhhhhhhccccccccccc-------ccccCccccc-cccccccccccCchhh--hHHHHHHhhhhhhheeee Q lcl|Aclame:pro 49 ----VYRDEKIVESFGGFLAEAEIAG-------DHNYDQTNIA-SGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGV 114 (524) Q Consensus 49 ----~~~~~~~~~~~~~~l~ea~~~g-------~~~~~~~~~~-~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GV 114 (524) ..++..-...+...+.+-...+ .......... .++++.+-...-|.-+ .+++++..+..-.+++.| T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~ 157 (415) T protein:vir:81 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTV 157 (415) T ss_pred chhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheee Confidence 0000000000011111100000 0000000000 0111111111124333 455666677888999999 Q ss_pred ecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 QPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYF 194 (524) Q Consensus 115 QPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~ 194 (524) +||++..+-+--.+. ... . + ..|-+. T Consensus 158 ~~~~~~~~~~~~~~~--~~~-----~-----~---------~~~v~E--------------------------------- 183 (415) T protein:vir:81 158 KRVTNGSGKYPVVRQ--SEV-----A-----A---------LEKVEE--------------------------------- 183 (415) T ss_pred eeccCCceeEEEEee--cCC-----c-----c---------ceeecc--------------------------------- Confidence 999887764322211 000 0 0 000000 Q ss_pred ccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccc Q lcl|Aclame:pro 195 QNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQY 274 (524) Q Consensus 195 ~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEY 274 (524) .+ +. ...+...|.+..|.+.|. +-...+ T Consensus 184 ----------~~-------------------------------~~----~~~~~~~~~~v~~~~~k~-------~~~~~i 211 (415) T protein:vir:81 184 ----------LE-------------------------------EN----PELAVKPFFQLAYDINTH-------RGYFRI 211 (415) T ss_pred ----------cc-------------------------------cc----CcccccceeeEEeeeeee-------Eeeehh Confidence 00 00 000011244444544444 445669 Q ss_pred cHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHH Q lcl|Aclame:pro 275 SVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESY 354 (524) Q Consensus 275 T~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~ 354 (524) |-||.+|- ..|.+++|.+-|+..|..-+|+.|+.-.-...-.+ +..... ..++ ..... +.-..+.. T Consensus 212 S~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--~~~~~~--~~~~-----~~~~~-~~~~~~~i 277 (415) T protein:vir:81 212 SREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--TSSGFE--KEGK-----KLEVK-KAKSLDDI 277 (415) T ss_pred hHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc--cccccc--cccc-----ccccc-cccchhHH Confidence 99999984 35679999999999999999999996432211111 000000 0000 00000 00011222 Q ss_pred HHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCC Q lcl|Aclame:pro 355 KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYAR 434 (524) Q Consensus 355 r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~ 434 (524) ..++.. +.. .+-+.+.+||+|.....|..+...-..+ ....+.+ ....++|.| ++|++.++.+ T Consensus 278 ~~~~~~-------~~~--~~~~~~~~v~n~~~~~~l~~lkd~~G~~------l~~~~~~-~~~~~~l~G-~pV~~~~~~~ 340 (415) T protein:vir:81 278 KDAINL-------NVK--PNYEHNVAIVSQTMFAKLDKMKDKLGNY------LIQPDVK-EKTQQRLLG-AKIEILPDEV 340 (415) T ss_pred HHHHHh-------hhh--hccCCCEEEEcHHHHHHHHHhhccCCce------eeccCcC-CCCCceecc-eeeEEecccc Confidence 333333 222 1225678899999988887542221100 0001111 112357877 6888876654 Q ss_pred cceEEEEEecCCCccceeEeec----cc----ccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccch Q lcl|Aclame:pro 435 QDYFTVGFKGDNEMDAGIYYAP----YV----ALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMI 505 (524) Q Consensus 435 ~dy~~vG~KG~~~~~~~~fyaP----Yv----~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~ 505 (524) .. -.|+ ..++|+- |+ ....+...|-.+++..+....|++..+ +|=+...-.-.. -.--.++ T Consensus 341 ~~-----~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~-~~~~~~~ 410 (415) T protein:vir:81 341 LG-----QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDD-SERGEGD 410 (415) T ss_pred cC-----CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEEEec-cCCCCCc Confidence 21 0111 1122221 11 111122345567777888888998643 442110000000 0001122 Q ss_pred HHhhc Q lcl|Aclame:pro 506 SKEMC 510 (524) Q Consensus 506 ~~~~a 510 (524) ...-+ T Consensus 411 ~~~~~ 415 (415) T protein:vir:81 411 LGLEA 415 (415) T ss_pred cccCC Confidence 22222 No 27 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=95.56 E-value=0.0018 Score=35.57 Aligned_cols=360 Identities=15% Similarity=0.124 Sum_probs=142.1 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhH-----------HHHHHHH--HHHHHHHHhccc------------------- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSK-----------KQLVAAI--LEAQEKDAETDP------------------- 48 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~-----------~~~~~~l--~enq~~~~~~~~------------------- 48 (524) |...++|.++=..+++.. .+++..-+ +.+...+ |+.|.+.+.+.. T Consensus 1 mk~~~el~~~l~el~~~~---~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQI---DLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) T ss_pred CchHHHHHHHHHHHHHHH---HHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 888777777766664421 11111100 0111111 111111111100 Q ss_pred ----cccchhhhhhhccccccccccc-------ccccCccccc-cccccccccccCchhh--hHHHHHHhhhhhhheeee Q lcl|Aclame:pro 49 ----VYRDEKIVESFGGFLAEAEIAG-------DHNYDQTNIA-SGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGV 114 (524) Q Consensus 49 ----~~~~~~~~~~~~~~l~ea~~~g-------~~~~~~~~~~-~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GV 114 (524) ..++..-...+...+.+-...+ .......... .++++.+-...-|.-+ .+++++..+..-.+++.| T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~ 157 (415) T protein:vir:79 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTV 157 (415) T ss_pred chhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheee Confidence 0000000000011111100000 0000000000 0111111111124333 455666677888999999 Q ss_pred ecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 QPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYF 194 (524) Q Consensus 115 QPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~ 194 (524) +||++..+-+--.+. ... . + ..|-+. T Consensus 158 ~~~~~~~~~~~~~~~--~~~-----~-----~---------~~~v~E--------------------------------- 183 (415) T protein:vir:79 158 KRVTNGSGKYPVVRQ--SEV-----A-----A---------LEKVEE--------------------------------- 183 (415) T ss_pred eeccCCceeEEEEee--cCC-----c-----c---------ceeecc--------------------------------- Confidence 999887764322211 000 0 0 000000 Q ss_pred ccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccc Q lcl|Aclame:pro 195 QNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQY 274 (524) Q Consensus 195 ~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEY 274 (524) .+ +. ...+...|.+..|.+.|. +-...+ T Consensus 184 ----------~~-------------------------------~~----~~~~~~~~~~v~~~~~k~-------~~~~~i 211 (415) T protein:vir:79 184 ----------LE-------------------------------EN----PELAVKPFFQLAYDINTH-------RGYFRI 211 (415) T ss_pred ----------cc-------------------------------cc----CcccccceeeEEeeeeee-------Eeeehh Confidence 00 00 000011244444544444 445669 Q ss_pred cHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHH Q lcl|Aclame:pro 275 SVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESY 354 (524) Q Consensus 275 T~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~ 354 (524) |-||.+|- ..|.+++|.+-|+..|..-+|+.|+.-.-...-.+ +..... ..++ ..... +.-..+.. T Consensus 212 S~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--~~~~~~--~~~~-----~~~~~-~~~~~~~i 277 (415) T protein:vir:79 212 SREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--TSSGFE--KEGK-----KLEVK-KAKSLDDI 277 (415) T ss_pred hHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc--cccccc--cccc-----ccccc-cccchhHH Confidence 99999984 35679999999999999999999996432211111 000000 0000 00000 00011222 Q ss_pred HHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCC Q lcl|Aclame:pro 355 KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYAR 434 (524) Q Consensus 355 r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~ 434 (524) ..++.. +.. .+-+.+.+||+|.....|..+...-..+ ....+.+ ....++|.| ++|++.++.+ T Consensus 278 ~~~~~~-------~~~--~~~~~~~~v~n~~~~~~l~~lkd~~G~~------l~~~~~~-~~~~~~l~G-~pV~~~~~~~ 340 (415) T protein:vir:79 278 KDAINL-------NVK--PNYEHNVAIVSQTMFAKLDKMKDKLGNY------LIQPDVK-EKTQQRLLG-AKIEILPDEV 340 (415) T ss_pred HHHHHh-------hhh--hccCCCEEEEcHHHHHHHHHhhccCCce------eeccCcC-CCCCceecc-eeeEEecccc Confidence 333333 222 1225678899999988887542221100 0001111 112357877 6888876654 Q ss_pred cceEEEEEecCCCccceeEeec----cc----ccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccch Q lcl|Aclame:pro 435 QDYFTVGFKGDNEMDAGIYYAP----YV----ALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMI 505 (524) Q Consensus 435 ~dy~~vG~KG~~~~~~~~fyaP----Yv----~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~ 505 (524) .. -.|+ ..++|+- |+ ....+...|-.+++..+....|++..+ +|=+...-.-.. -.--.++ T Consensus 341 ~~-----~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~-~~~~~~~ 410 (415) T protein:vir:79 341 LG-----QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDD-SERGEGD 410 (415) T ss_pred cC-----CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEEEec-cCCCCCc Confidence 21 0111 1122221 11 111122345567777888888998643 442110000000 0001122 Q ss_pred HHhhc Q lcl|Aclame:pro 506 SKEMC 510 (524) Q Consensus 506 ~~~~a 510 (524) ...-+ T Consensus 411 ~~~~~ 415 (415) T protein:vir:79 411 LGLEA 415 (415) T ss_pred cccCC Confidence 22222 No 28 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=95.56 E-value=0.0018 Score=35.57 Aligned_cols=360 Identities=15% Similarity=0.124 Sum_probs=142.1 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhH-----------HHHHHHH--HHHHHHHHhccc------------------- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSK-----------KQLVAAI--LEAQEKDAETDP------------------- 48 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~-----------~~~~~~l--~enq~~~~~~~~------------------- 48 (524) |...++|.++=..+++.. .+++..-+ +.+...+ |+.|.+.+.+.. T Consensus 1 mk~~~el~~~l~el~~~~---~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQI---DLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) T ss_pred CchHHHHHHHHHHHHHHH---HHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 888777777766664421 11111100 0111111 111111111100 Q ss_pred ----cccchhhhhhhccccccccccc-------ccccCccccc-cccccccccccCchhh--hHHHHHHhhhhhhheeee Q lcl|Aclame:pro 49 ----VYRDEKIVESFGGFLAEAEIAG-------DHNYDQTNIA-SGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGV 114 (524) Q Consensus 49 ----~~~~~~~~~~~~~~l~ea~~~g-------~~~~~~~~~~-~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GV 114 (524) ..++..-...+...+.+-...+ .......... .++++.+-...-|.-+ .+++++..+..-.+++.| T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~ 157 (415) T protein:vir:98 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTV 157 (415) T ss_pred chhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheee Confidence 0000000000011111100000 0000000000 0111111111124333 455666677888999999 Q ss_pred ecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 QPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYF 194 (524) Q Consensus 115 QPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~ 194 (524) +||++..+-+--.+. ... . + ..|-+. T Consensus 158 ~~~~~~~~~~~~~~~--~~~-----~-----~---------~~~v~E--------------------------------- 183 (415) T protein:vir:98 158 KRVTNGSGKYPVVRQ--SEV-----A-----A---------LEKVEE--------------------------------- 183 (415) T ss_pred eeccCCceeEEEEee--cCC-----c-----c---------ceeecc--------------------------------- Confidence 999887764322211 000 0 0 000000 Q ss_pred ccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccc Q lcl|Aclame:pro 195 QNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQY 274 (524) Q Consensus 195 ~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEY 274 (524) .+ +. ...+...|.+..|.+.|. +-...+ T Consensus 184 ----------~~-------------------------------~~----~~~~~~~~~~v~~~~~k~-------~~~~~i 211 (415) T protein:vir:98 184 ----------LE-------------------------------EN----PELAVKPFFQLAYDINTH-------RGYFRI 211 (415) T ss_pred ----------cc-------------------------------cc----CcccccceeeEEeeeeee-------Eeeehh Confidence 00 00 000011244444544444 445669 Q ss_pred cHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHH Q lcl|Aclame:pro 275 SVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESY 354 (524) Q Consensus 275 T~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~ 354 (524) |-||.+|- ..|.+++|.+-|+..|..-+|+.|+.-.-...-.+ +..... ..++ ..... +.-..+.. T Consensus 212 S~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--~~~~~~--~~~~-----~~~~~-~~~~~~~i 277 (415) T protein:vir:98 212 SREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--TSSGFE--KEGK-----KLEVK-KAKSLDDI 277 (415) T ss_pred hHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc--cccccc--cccc-----ccccc-cccchhHH Confidence 99999984 35679999999999999999999996432211111 000000 0000 00000 00011222 Q ss_pred HHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCC Q lcl|Aclame:pro 355 KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYAR 434 (524) Q Consensus 355 r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~ 434 (524) ..++.. +.. .+-+.+.+||+|.....|..+...-..+ ....+.+ ....++|.| ++|++.++.+ T Consensus 278 ~~~~~~-------~~~--~~~~~~~~v~n~~~~~~l~~lkd~~G~~------l~~~~~~-~~~~~~l~G-~pV~~~~~~~ 340 (415) T protein:vir:98 278 KDAINL-------NVK--PNYEHNVAIVSQTMFAKLDKMKDKLGNY------LIQPDVK-EKTQQRLLG-AKIEILPDEV 340 (415) T ss_pred HHHHHh-------hhh--hccCCCEEEEcHHHHHHHHHhhccCCce------eeccCcC-CCCCceecc-eeeEEecccc Confidence 333333 222 1225678899999988887542221100 0001111 112357877 6888876654 Q ss_pred cceEEEEEecCCCccceeEeec----cc----ccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccch Q lcl|Aclame:pro 435 QDYFTVGFKGDNEMDAGIYYAP----YV----ALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMI 505 (524) Q Consensus 435 ~dy~~vG~KG~~~~~~~~fyaP----Yv----~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~ 505 (524) .. -.|+ ..++|+- |+ ....+...|-.+++..+....|++..+ +|=+...-.-.. -.--.++ T Consensus 341 ~~-----~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~-~~~~~~~ 410 (415) T protein:vir:98 341 LG-----QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDD-SERGEGD 410 (415) T ss_pred cC-----CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEEEec-cCCCCCc Confidence 21 0111 1122221 11 111122345567777888888998643 442110000000 0001122 Q ss_pred HHhhc Q lcl|Aclame:pro 506 SKEMC 510 (524) Q Consensus 506 ~~~~a 510 (524) ...-+ T Consensus 411 ~~~~~ 415 (415) T protein:vir:98 411 LGLEA 415 (415) T ss_pred cccCC Confidence 22222 No 29 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=95.27 E-value=0.0024 Score=34.94 Aligned_cols=275 Identities=12% Similarity=0.070 Sum_probs=129.8 Q ss_pred cccCccccccccccccccccCchhh--hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhc Q lcl|Aclame:pro 72 HNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFH 149 (524) Q Consensus 72 ~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~ 149 (524) .|+++-+... ++++.. .. |.-+ .+++++..+.+-.++|-+-||++.+.-+ ...+. .++ T Consensus 1 ~g~~a~~~~~-~~~~~~-~i-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~-----~~~~~-----~~a------- 60 (299) T protein:vir:41 1 MGFNPDTTTM-QSAKTG-SI-PINISEQIITGVKNGSAAMKLAKAVPMTKPEEEF-----TFMSG-----VGA------- 60 (299) T ss_pred CCcCCCcccc-cCCCce-ec-chhHHHHHHHHHHhcchhhhhceeeecCCCcEEE-----EEEcC-----Cce------- Confidence 4555433111 111111 12 3322 6777778888899999999998765211 11000 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCccccccccccccccccccccc Q lcl|Aclame:pro 150 PMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEIS 229 (524) Q Consensus 150 ~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~ 229 (524) .| T Consensus 61 -------~~----------------------------------------------------------------------- 62 (299) T protein:vir:41 61 -------FW----------------------------------------------------------------------- 62 (299) T ss_pred -------ee----------------------------------------------------------------------- Confidence 00 Q ss_pred ccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHH Q lcl|Aclame:pro 230 VGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINRE 309 (524) Q Consensus 230 ~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINre 309 (524) .+| +.+++|...++++++...|..+-...+|-||.+|-. .|.++.|.+.|...|...+++. T Consensus 63 ------v~E---------~~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a 123 (299) T protein:vir:41 63 ------VDE---------AERIQTSKPTFTKAKMRSKKMGVIIPTTKENLNYSV----TNFFSLMQAEIVEAFYKKFDQA 123 (299) T ss_pred ------eec---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHH Confidence 001 112344445667888888888888899999999854 4568999999999999999998 Q ss_pred HHhhhhhheeeeeeccccccCccceeeccccc-ccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhh Q lcl|Aclame:pro 310 IVDLINYTAQVGKSGFTQTVGSKAGSFDFQDP-VDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVS 388 (524) Q Consensus 310 ii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~-~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~ 388 (524) |+.=--. ..+.|++..... .... ......+.-|.++.+.+... +.+++.+||+|+... T Consensus 124 ~l~G~g~-------------~~~~gil~~~~~~~~~~------~~~~~~~~~l~~~~~~l~~~--~~~~~~~v~n~~~~~ 182 (299) T protein:vir:41 124 VFTGVES-------------PYNWNILKSATDASNLV------EETANKYDDLNEAIGLIEAE--DLEPNGIATIRKQRV 182 (299) T ss_pred HhhcccC-------------cccccccccccccceee------ccccccHHHHHHHHHhhhcc--cCCcCEEEEcHHHHH Confidence 8841000 011122111000 0000 00001122234444444432 336678999999999 Q ss_pred hhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcce----EEEEEecCCCccceeEeeccccccc-- Q lcl|Aclame:pro 389 ALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDY----FTVGFKGDNEMDAGIYYAPYVALTP-- 462 (524) Q Consensus 389 ~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy----~~vG~KG~~~~~~~~fyaPYv~~~~-- 462 (524) .|.....+ .+. .....+.+.. .++|.| ++|++.++.+.+= +++|-- +..++...-.... T Consensus 183 ~L~~lkd~----~G~--~l~~~~~~~~--~~~l~G-~PV~~~~~~~~~~~~~~~~~gdf------s~~~i~~~~~~~i~~ 247 (299) T protein:vir:41 183 KYRSTKDG----NGM--PIFNTATSNG--VDDVLG-LPIAYTPKYTFGDKDISELVGDW------NQAYYGILRGVEYEI 247 (299) T ss_pred HHHHhhcc----CCc--eeecCCcCCC--Cceecc-eeeEEecccCCCCCceEEEEEec------ccEEEEEecCcEEEE Confidence 98753211 110 0011111111 246776 7999888876541 222211 0011111111111 Q ss_pred ------ccccCCcc-----ccc-eee--eeeeeccEe-cCcccccCCCccccccccchHHhhcc Q lcl|Aclame:pro 463 ------LRGSDPKN-----FQP-VMG--FKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCG 511 (524) Q Consensus 463 ------~~~~dp~s-----~qP-~~~--~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~ 511 (524) ....|++. ||- .+. ...|++..+ ||=+.. ++. . +.+| T Consensus 248 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~-------~l~-~----~aa~ 299 (299) T protein:vir:41 248 LTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFS-------AVQ-P----KAGN 299 (299) T ss_pred eecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceE-------EEE-e----ccCC Confidence 11122221 222 233 345777654 331111 111 0 1112 No 30 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=95.22 E-value=0.0025 Score=34.84 Aligned_cols=356 Identities=14% Similarity=0.067 Sum_probs=125.6 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchh----------------HHHHHHHHHHHHHH------HHhccc-cccchhhhh Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKS----------------KKQLVAAILEAQEK------DAETDP-VYRDEKIVE 57 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~----------------~~~~~~~l~enq~~------~~~~~~-~~~~~~~~~ 57 (524) |-+- ...++|.....- ..+++... .+.....+ +..++ +..... ..+...... T Consensus 1 ~~ke-~~~~~~~~~~~~---~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (413) T protein:vir:81 1 MVKE-AGDAPTNAQVAE---IAEVKSMVEQFKADEDAKRERAKSVKANQDFL-RELQEATAGSVDSEKSGELTRKGEGYK 75 (413) T ss_pred Chhh-HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhHHhHHHhhhHhhhhhhhh Confidence 3222 122223221110 01111100 00000000 00000 000000 000000000 Q ss_pred hhccccccc-------------------cccc--ccccCccccccccccccccccCchhh--hHHHHHHhhhhhhheeee Q lcl|Aclame:pro 58 SFGGFLAEA-------------------EIAG--DHNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGV 114 (524) Q Consensus 58 ~~~~~l~ea-------------------~~~g--~~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GV 114 (524) .++..+.+. +... ...........++++ +....=|..+ .+++.+-+..+..++|.| T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~ 154 (413) T protein:vir:81 76 SIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTD-EFQGGYGTTWNRNIIYRRREKLVVADLMDN 154 (413) T ss_pred hhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhccccc-ccccccchhhHHHHHHHHhhhhhHHhhcce Confidence 000000000 0000 000000000011111 1111113222 345555567778899999 Q ss_pred ecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 QPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYF 194 (524) Q Consensus 115 QPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~ 194 (524) +||++++.-+.-.+ .... ... ++.| T Consensus 155 ~~~~~~~~~~~~~~----~~~~-~~~--------------~a~~------------------------------------ 179 (413) T protein:vir:81 155 LTMTNTTIKYLMEK----ANRV-VEG--------------GFKT------------------------------------ 179 (413) T ss_pred eeccCCceeEEEec----cccc-ccc--------------ccce------------------------------------ Confidence 99998764221111 0000 000 0000 Q ss_pred ccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccc Q lcl|Aclame:pro 195 QNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQY 274 (524) Q Consensus 195 ~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEY 274 (524) +++|-. ..| +....|.+..|.+.|. +-...+ T Consensus 180 ---------------------------------v~Eg~~--~~~-------~~~~~f~~i~~~~~k~-------~~~~~i 210 (413) T protein:vir:81 180 ---------------------------------VAEGGK--KPY-------MRFADFDIVTESLSKI-------AGLTKI 210 (413) T ss_pred ---------------------------------ecCccc--ccc-------cCcccceeeEeeeeeE-------EEeehh Confidence 000000 000 0011244444555444 445678 Q ss_pred cHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHH Q lcl|Aclame:pro 275 SVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESY 354 (524) Q Consensus 275 T~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~ 354 (524) |-||.+|--+ .++.|.+-|+..|..-+|+.||. |. | +...+.|++..........+ .. T Consensus 211 S~ell~ds~~-----l~~~i~~~la~~~~~~~d~~~l~--------G~-G---~~~~~~Gi~~~~~~~~~~~~-----~~ 268 (413) T protein:vir:81 211 TDEMIEDYDF-----LVSYINARLLEELAIEEERQLLL--------GD-G---TGNNLTGLLKRDGIQTLAVS-----NK 268 (413) T ss_pred hHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHhc--------cC-C---CCCccccccccccccccccc-----cc Confidence 9999998632 48888888888899988888874 11 1 11123455443222111000 01 Q ss_pred HHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhh--cccccccccceeEEEecCcEEEEecCC Q lcl|Aclame:pro 355 KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQ--KTLNVDTTKAVFAGVLGGTYKVYIDQY 432 (524) Q Consensus 355 r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~--~~~~~d~~~~~~~G~l~~~~~vy~D~y 432 (524) ..++.-|.+....+.....+ ..+.+|++|.....|......-..+- |.. .....+. .....++|.| ++|+++.. T Consensus 269 ~~~~~~i~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~l-~~~~~~~~~~~~-~~~~~~~l~G-~pv~~s~~ 344 (413) T protein:vir:81 269 DELADSIYKAMTNISLATPF-QADALVINPLDYQELRLAKDANGQYY-GGGVFQGQYGSG-GIMLDPAPWG-LRTVQSQV 344 (413) T ss_pred chhHHHHHHHHHHhhhhccC-CCcEEEEcHHHHHHHHHhhccCCcee-cccccccccccc-ccccCceecc-eeeEEcCC Confidence 12222233333334333444 34668899998888764321110000 000 0000000 0111246776 69999998 Q ss_pred CCcceEEEEEecCC--Cc---cceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchH Q lcl|Aclame:pro 433 ARQDYFTVGFKGDN--EM---DAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMIS 506 (524) Q Consensus 433 ~~~dy~~vG~KG~~--~~---~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~ 506 (524) .+..-+++|---.. -. .-.+=..+|.. -+-.+-+=.+-+..||++.+ +|= T Consensus 345 ~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~------------------ 400 (413) T protein:vir:81 345 VPVGKPVVGAFRSAASVLRKGGVRIDSTNTNV------DDFENNLITVRAEERVGLMVTFPE------------------ 400 (413) T ss_pred CCcccEEEEecccEEEEEEecceEEEEecccc------chhhcCcEEEEEEEeeccEEeccc------------------ Confidence 77665555532100 00 00011111110 01123344555556776543 220 Q ss_pred HhhccchhhhhhhhcccC Q lcl|Aclame:pro 507 KEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 507 ~~~a~~~~~~~~~~V~~~ 524 (524) .|+++-++.. T Consensus 401 --------a~~~l~~~~~ 410 (413) T protein:vir:81 401 --------AIVQLDVAEV 410 (413) T ss_pred --------ceEEEEecCC Confidence 0111111111 No 31 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=95.07 E-value=0.0028 Score=34.56 Aligned_cols=335 Identities=15% Similarity=0.110 Sum_probs=134.2 Q ss_pred CCch-HHHHHHhhHhhcc------cccchhhcchhH--H------HHHHHHHHHHHHHHhcccccc-----chhhhhhhc Q lcl|Aclame:pro 1 MSKK-NELMEKWNDLLES------QEGLPDIATKSK--K------QLVAAILEAQEKDAETDPVYR-----DEKIVESFG 60 (524) Q Consensus 1 m~~~-~~l~~kw~p~l~~------~~~~~~i~~~~~--~------~~~~~l~enq~~~~~~~~~~~-----~~~~~~~~~ 60 (524) |++. ++|.|+=..+.+. ++-..++..+.. + +....++|.+.+.......-+ .......|. T Consensus 1 M~k~l~~l~e~~~~~~~e~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (371) T protein:vir:81 1 MPKELRELLEQINNKKEEARKLLAENKIEEAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAFV 80 (371) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHH Confidence 8854 4444443222221 111222221100 0 111111111111111111000 000111222 Q ss_pred ccccccccccccccCccccccccccccccccCch-hh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCc Q lcl|Aclame:pro 61 GFLAEAEIAGDHNYDQTNIASGKSSGAITNIGPA-VI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAG 138 (524) Q Consensus 61 ~~l~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~-li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~ 138 (524) .+|-.. .........+.+|.+. =|. +. .+++.+.++.+-.+++.+.||++.++-+.-.+ .... T Consensus 81 ~~l~~~-------~~~a~~~~t~~~gg~~--vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~--~~~~---- 145 (371) T protein:vir:81 81 NHIRTR-------FRNAMSEGSNQDGGYT--VPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKK--RSQQ---- 145 (371) T ss_pred HHHHHH-------HHHhhccCCCccCcee--ecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecCC---- Confidence 222110 0000001112222211 132 22 46666667888999999999988765543221 1110 Q ss_pred ccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccc Q lcl|Aclame:pro 139 GTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIA 218 (524) Q Consensus 139 gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~ 218 (524) . ++ .|- T Consensus 146 -~-----~a---------~~v----------------------------------------------------------- 151 (371) T protein:vir:81 146 -T-----GF---------VEV----------------------------------------------------------- 151 (371) T ss_pred -c-----ce---------eee----------------------------------------------------------- Confidence 0 00 000 Q ss_pred cccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHH Q lcl|Aclame:pro 219 ENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAIL 298 (524) Q Consensus 219 ~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnIL 298 (524) +|.. .....+...|.+..+...|.. -...+|-||.+|-. .|.++.|.+.| T Consensus 152 ------------------~Eg~-~~~~~~~~~f~~i~~~~~k~~-------~~~~iS~ell~ds~----~~l~~~i~~~l 201 (371) T protein:vir:81 152 ------------------AEGA-AIGEKATPQFTLLQYQVKKYA-------GFFRVTNELLNDST----EAIVNTLVRWI 201 (371) T ss_pred ------------------cccc-ccccccccceeeEEeeeeEEE-------EeehhhHHHHhhhh----HHHHHHHHHHH Confidence 0000 000001123555555555555 44579999999853 46689999999 Q ss_pred HHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCC Q lcl|Aclame:pro 299 ATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGN 378 (524) Q Consensus 299 StEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn 378 (524) ...|..-+|+.|+.-.... .+.|+..++ ....++... ....+.... T Consensus 202 ~~a~~~~~~~~i~~g~g~~-------------~~~~~~~~~-------------~i~~~~~~~--------l~~~~~~~a 247 (371) T protein:vir:81 202 GDESRVTRNGLIINVLNTK-------------AKTAIADLD-------------GLKQIINVQ--------LDPVFRSTS 247 (371) T ss_pred HHHHHHHHHHHHHhhcccc-------------cccccccHH-------------HHHHHHHhh--------cchhhhcCC Confidence 9999999999888632211 112332211 112221111 111222345 Q ss_pred EEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeeccc Q lcl|Aclame:pro 379 FIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYV 458 (524) Q Consensus 379 ~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv 458 (524) .+|++|.....|.....+-.. + ....+.+ ....|+|.| ++||+..+.+...-.++--+. -...++|+.+- T Consensus 248 ~~vmn~~~~~~L~~lkd~~g~--~----l~~~~~~-~~~~~~l~G-~pV~~~~~~~~~~~~~~~~~~--~~~~i~~Gd~~ 317 (371) T protein:vir:81 248 SVIVNQDAFNWLDTLKDQNGQ--Y----LLQPSIS-SPTGRQLLG-LPVVIVSNKVLANRVDGGTGA--QFAPIIVGDLK 317 (371) T ss_pred EEEEcHHHHHHHHHhhccCCC--e----eeecccC-CCCCceecc-eeEEEecccccCccccccccC--CcceEEEEehh Confidence 789999998888754221100 0 0000111 112468887 699988776643322111111 11223444321 Q ss_pred cc-------ccccccCCc------cccceeeeeeeeccE-ecCcccccCCCccccccccchHHhhc Q lcl|Aclame:pro 459 AL-------TPLRGSDPK------NFQPVMGFKTRYGIG-INPFANSRSQAPADRITSGMISKEMC 510 (524) Q Consensus 459 ~~-------~~~~~~dp~------s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~~~~~~~~a 510 (524) .+ .+.-.+++. +-+=.+-...||+.. .||=+...-.-.. | T Consensus 318 ~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~------------A 371 (371) T protein:vir:81 318 EAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQL------------A 371 (371) T ss_pred ceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEec------------C Confidence 10 011112222 223455556666653 3331110000000 0 No 32 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=94.78 E-value=0.0035 Score=34.04 Aligned_cols=259 Identities=14% Similarity=0.109 Sum_probs=113.8 Q ss_pred ccccccccccccCccccccccccccccccCchhh--hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCccc Q lcl|Aclame:pro 63 LAEAEIAGDHNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGT 140 (524) Q Consensus 63 l~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gt 140 (524) +-|+-..++ ++.|.+ .. |.-+ .+++.+-++.+-.++|.+-||++.+|- ..+.......+ T Consensus 1 ~l~~~~~~t-----------~~~gg~-li-P~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~-----~~~~~~~~~~~- 61 (293) T protein:vir:48 1 MLDSKTDHS-----------GSDAGL-TI-PQDIRTAINTLVRQYDSLQEYVNVENVTTLTGS-----RVYEKWTDITG- 61 (293) T ss_pred Cceeecccc-----------cCcCce-Ee-chhHHHHHHHHHHhhhhhhhhceeeeccCCcce-----EEEEeecCCCc- Confidence 112111110 111111 11 2222 345555566777788888888775541 11111100000 Q ss_pred ccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccc Q lcl|Aclame:pro 141 PADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAEN 220 (524) Q Consensus 141 eA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~ 220 (524) .+. T Consensus 62 --------------~a~--------------------------------------------------------------- 64 (293) T protein:vir:48 62 --------------LAN--------------------------------------------------------------- 64 (293) T ss_pred --------------cee--------------------------------------------------------------- Confidence 000 Q ss_pred cccccccccccccchhhhhccccCCCCCcccccce-eEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHH Q lcl|Aclame:pro 221 EKGTLAEISVGMATSVAELQENFNGSSANPWNEMA-FRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILA 299 (524) Q Consensus 221 ~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMs-FsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILS 299 (524) -.+| +..++|.+ .++++++..+|.-+-...+|-||.+|. .+|.+++|.+-|+ T Consensus 65 --------------~v~E---------g~~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~la 117 (293) T protein:vir:48 65 --------------IDDE---------AGKIADIDDPKLSLIKYTIKRYAGISTVTNSLLADS----AENILAWLSGWIA 117 (293) T ss_pred --------------eecC---------CcccccccccceeEEEEeeeEEEEeehhhHHHHhhh----hHHHHHHHHHHHH Confidence 0011 11233332 456666667777777788999999986 3678999999999 Q ss_pred HHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCE Q lcl|Aclame:pro 300 TEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNF 379 (524) Q Consensus 300 tEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~ 379 (524) ..|..-+|+.|+.-+...+. ..+.+++ +....|+.++ ... +..... T Consensus 118 ~~~~~~~~~~i~~g~~~~~~------------~~~~~~~-------------d~i~~~~~~l-------~~~--~~~~a~ 163 (293) T protein:vir:48 118 KKVVVTRNKAILGVVDKLPT------------KPTLTKW-------------DDIIDLEAKV-------DPA--IKQTSF 163 (293) T ss_pred HHHHHHHHhHHhhccccccc------------cccccCH-------------HHHHHHHHhh-------hhh--hcCCCE Confidence 99999999999864332111 1122221 2233344433 222 223457 Q ss_pred EEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEe--cCCCCc----c----------eEEEEEe Q lcl|Aclame:pro 380 IIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYI--DQYARQ----D----------YFTVGFK 443 (524) Q Consensus 380 ~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~----d----------y~~vG~K 443 (524) .+|+|.....|..+...- + .-...++......++|.| ++|++ |.+.+. + ++.++.+ T Consensus 164 ~vmn~~~~~~L~~lkd~~----g---~~l~~~~~~~~~~~~l~G-~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~ 235 (293) T protein:vir:48 164 FLTNTSGFTALKKVKNAL----G---DYLMERDVKSPTGYSIAG-FAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDR 235 (293) T ss_pred EEEcHHHHHHHHHhhccC----C---ceEeecCcCCCCCceecc-eeeEEecccccCCccCCceEEEEEeccceEEEEEe Confidence 789999988886542221 1 001111111112357777 57765 333221 1 2222222 Q ss_pred cCCCccceeEeecccccccccccCCccccceeeeeeeecc---------------EecCcccccCCCccccccccchHH Q lcl|Aclame:pro 444 GDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGI---------------GINPFANSRSQAPADRITSGMISK 507 (524) Q Consensus 444 G~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l---------------~~nP~~~~~~~~~~~~i~~~~~~~ 507 (524) +.-.. -..++.. .+-.+-|=.+-...||+. .+-|......- .. T Consensus 236 ~~~~i----~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~-----------~~ 293 (293) T protein:vir:48 236 QQMSL----LSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIGST-----------AV 293 (293) T ss_pred cceEE----EEecccc------hhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCcccccc-----------CC Confidence 21111 1111100 011122333444445543 33332221111 11 No 33 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=94.39 E-value=0.0045 Score=33.43 Aligned_cols=337 Identities=13% Similarity=0.122 Sum_probs=127.4 Q ss_pred CCchHHHHHHhhHhhcc----cccchhhcc---hhH---HHHHHHH---------HHHHHHHHhccccc----------- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLES----QEGLPDIAT---KSK---KQLVAAI---------LEAQEKDAETDPVY----------- 50 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~----~~~~~~i~~---~~~---~~~~~~l---------~enq~~~~~~~~~~----------- 50 (524) |.+.++|.+.|..+=+. .+-+.+... ..+ +++-+.| ++.++++.+..+.. T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcccccc Confidence 99999988777655210 000000000 000 0111111 11111111111100 Q ss_pred -cchh----hhhhhcccccccccccccccCccccccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhh Q lcl|Aclame:pro 51 -RDEK----IVESFGGFLAEAEIAGDHNYDQTNIASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQV 124 (524) Q Consensus 51 -~~~~----~~~~~~~~l~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLI 124 (524) ..+. ....+..++.+... ..........++.|.+. .-+.+. .+++.+.++..-.++|.++||++++|-+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~t~~~gg~~-iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 155 (397) T protein:vir:48 81 KSEEEVKAGFVKDFKNLVRGRYQ----NLLDSKTDASGSDAGLT-IPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSR 155 (397) T ss_pred chhhHHHHHHHHHHHHHHhhhhh----HHHHHhhccCCcccccc-ccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceE Confidence 0000 01111122211100 00000000011112111 111111 3444445566778889999999988754 Q ss_pred eeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcc Q lcl|Aclame:pro 125 FALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTV 204 (524) Q Consensus 125 FAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~ 204 (524) --++. .... +. +.|-+ T Consensus 156 ~~~~~--~~~~---~~---------------a~~v~-------------------------------------------- 171 (397) T protein:vir:48 156 VYEKW--ADIT---GL---------------AKLDD-------------------------------------------- 171 (397) T ss_pred EEEee--cCCC---cc---------------eeeec-------------------------------------------- Confidence 32221 1110 00 00000 Q ss_pred cccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHh Q lcl|Aclame:pro 205 TGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRA 284 (524) Q Consensus 205 tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkA 284 (524) +| +.. ..+....|.++.|++.|.. -...+|-||.+|-. T Consensus 172 -------------------------E~------~~~---~~~~~~~~~~v~~~~~k~~-------~~~~iS~ell~ds~- 209 (397) T protein:vir:48 172 -------------------------EA------GSI---GTNDDPKLYPIRYAIKRYA-------GISTVTNSLLADSA- 209 (397) T ss_pred -------------------------cc------ccc---ccccccceeeEEeeheeee-------eehhhHHHHHhhch- Confidence 00 000 0001123555555555554 44679999999843 Q ss_pred hcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHH Q lcl|Aclame:pro 285 VHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKE 364 (524) Q Consensus 285 iHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~ 364 (524) .|.+++|.+-|+..|..-+|+.|+.-.-.. ....++.++ +-...++.. T Consensus 210 ---~~l~~~v~~~l~~~~~~~~d~~il~G~g~~------------~~~~~~~~~-------------d~i~~~~~~---- 257 (397) T protein:vir:48 210 ---ENILAWLSGWIAKKVVVTRNKAILEAIATL------------PTKPTLTKW-------------DDIIDLQAK---- 257 (397) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHhhccccc------------ccccccccH-------------HHHHHHHHH---- Confidence 577999999999999999999998521110 011122211 122333333 Q ss_pred HHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEe-c-CCCC-----c-- Q lcl|Aclame:pro 365 ANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYI-D-QYAR-----Q-- 435 (524) Q Consensus 365 a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~-D-~y~~-----~-- 435 (524) +... +..+..+||+|.....|..+..+-..+ ....|.+. .-.++|.| ++|++ | ...+ . T Consensus 258 ---l~~~--~~~~a~~v~n~~~~~~L~~lkd~~G~~------i~~~~~~~-~~~~~l~G-~PV~~~~~~~~~~~~~~~~~ 324 (397) T protein:vir:48 258 ---VDPA--IKQTSFFLTNTSGFTALKKVKNAFGDY------LMERDVKS-PTGYSIDG-FAVKEVADRWLANASSGAMP 324 (397) T ss_pred ---hhhh--hcCCCEEEECHHHHHHHHHhhcCCCce------eeccCcCC-CCCceecc-ceeEEecccccCCcCCCceE Confidence 3322 224578899999999997643221110 00111111 11257877 57764 2 1211 1 Q ss_pred -------ceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccE-ecC--c-----ccccCCCccccc Q lcl|Aclame:pro 436 -------DYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG-INP--F-----ANSRSQAPADRI 500 (524) Q Consensus 436 -------dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~-~nP--~-----~~~~~~~~~~~i 500 (524) +|++++..+.-... ..++.. .+-.+.+=.+-...||+.. .|| | +....+.+. T Consensus 325 ~~~gd~~~~~~~~~~~~~~i~----~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~--- 391 (397) T protein:vir:48 325 LYFGDLKQAVTLFDRQQMSLL----STNIGG------GAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKGN--- 391 (397) T ss_pred EEEEeccceEEEEeecceEEE----Eeccch------hhhhcCceeEEEEeeeccEEecccceEEEEecccccCCCC--- Confidence 12333333221111 111100 0111222233333333322 122 0 111111000 Q ss_pred cccchHHhhccc Q lcl|Aclame:pro 501 TSGMISKEMCGK 512 (524) Q Consensus 501 ~~~~~~~~~a~~ 512 (524) .+.- +- T Consensus 392 -~~~~-----~~ 397 (397) T protein:vir:48 392 -LGST-----AV 397 (397) T ss_pred -cccc-----CC Confidence 0000 00 No 34 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=94.08 E-value=0.0055 Score=32.99 Aligned_cols=345 Identities=12% Similarity=0.041 Sum_probs=136.6 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccc-ccc-------chhhhhhhccccccccccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDP-VYR-------DEKIVESFGGFLAEAEIAGDH 72 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~-~~~-------~~~~~~~~~~~l~ea~~~g~~ 72 (524) -...++..++|..+.. | +.+++..- +.+- ..++.-++...... .-+ .......+-.+..+....... T Consensus 30 ~~~~~e~~~~~~~~~~--e-~~~l~~~i-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 104 (390) T protein:vir:10 30 GELNASARSKVDELFA--T-VGNLSAEV-QAAR-QRVAELEGNGAGGDVQHVSVGDLFVASEQFQASAGRWNDRSARATM 104 (390) T ss_pred cccCHHHHHHHHHHHH--H-HHHHHHHH-HHHH-HHHHHHHhhcccccccccchhhhhhhhHHHHHHHHhhhhhhhhhhh Confidence 1122445566665543 2 11111100 0111 11111111111100 000 000000000011010000000 Q ss_pred c-cCcccccc-ccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhc Q lcl|Aclame:pro 73 N-YDQTNIAS-GKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFH 149 (524) Q Consensus 73 ~-~~~~~~~~-st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~ 149 (524) - ....+.+. ++++.+-...-|.++ .++.++-.+..-.++|.+.||++++.-+. +..... + +| T Consensus 105 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~----~~~~~~---~------~a-- 169 (390) T protein:vir:10 105 NIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYV----QETGFV---N------NA-- 169 (390) T ss_pred HHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEE----EEecCC---c------ce-- Confidence 0 00000000 011111111223333 44444555666778899999876542111 000000 0 00 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCccccccccccccccccccccc Q lcl|Aclame:pro 150 PMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEIS 229 (524) Q Consensus 150 ~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~ 229 (524) .| T Consensus 170 -------~~----------------------------------------------------------------------- 171 (390) T protein:vir:10 170 -------AI----------------------------------------------------------------------- 171 (390) T ss_pred -------ee----------------------------------------------------------------------- Confidence 00 Q ss_pred ccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHH Q lcl|Aclame:pro 230 VGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINRE 309 (524) Q Consensus 230 ~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINre 309 (524) .+| +...++-..+++++++.+|..+....+|-||.||-- |.++.|.+-|+..|...||+. T Consensus 172 ------v~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~l~~~~~~~~~~~ 231 (390) T protein:vir:10 172 ------VAE---------GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-----QLASYMNNRLIRGLKVKEDAE 231 (390) T ss_pred ------ecC---------CccccccccceeEEEEeeEEEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHH Confidence 001 011233345566777777777778899999999852 468999999999999999999 Q ss_pred HHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhh Q lcl|Aclame:pro 310 IVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSA 389 (524) Q Consensus 310 ii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~ 389 (524) ||. |. | .+..+.|++..........+. +. ..++..+..+...+. ..+...+.+|++|..... T Consensus 232 il~--------G~-G---~~~~p~Gi~~~~~~~~~~~~~-~~---~~~~~~~~~~~~~l~--~~~~~~~~~v~n~~~~~~ 293 (390) T protein:vir:10 232 ILR--------GT-G---ANDGLLGLIPQATTYAAPTTI-AG---ATRVDQLRLAMLQAS--LAEYPASGIVINPIDWAA 293 (390) T ss_pred Hhh--------cC-C---CCccccccccccccccccccc-cc---cchHHHHHHHHHhhc--cccCCCCEEEEcHHHHHH Confidence 885 10 1 011234554432211110000 00 111122222223332 223366789999999888 Q ss_pred hhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccC-- Q lcl|Aclame:pro 390 LARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSD-- 467 (524) Q Consensus 390 L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~d-- 467 (524) |.....+-..+ .-.++.. .-.++|.| ++|++++..|.+-+++|--- .+++.+.. ....+...+ T Consensus 294 L~~lkd~~g~~-------l~~~~~~-~~~~~l~G-~pv~~~~~~p~~~~~~gdf~-----~~~~~~~~-~~~~i~~~~~~ 358 (390) T protein:vir:10 294 IELAKDANNQY-------LIGNARG-TLTPTLWG-LPVVATQAMAPGEFLVGAFD-----LAAQIFDQ-WDARVEIGYVN 358 (390) T ss_pred HHHhhcCCCce-------eecCCcC-cCCceecc-eeeEEcCCCCCCcEEEEecc-----ceEEEEEe-cceEEEEeecc Confidence 87533221110 0011110 01246766 69999999887666655210 11111111 111111111 Q ss_pred --CccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhc Q lcl|Aclame:pro 468 --PKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMC 510 (524) Q Consensus 468 --p~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a 510 (524) -.+-+=.+-...||+..+ +|=+ ..++. +| T Consensus 359 ~~~~~~~~~~r~~~r~d~~v~~~~a-------~~~~~-------~a 390 (390) T protein:vir:10 359 DDFQRNMVTVLAEERLALVVYRPEA-------LISGS-------FA 390 (390) T ss_pred cccccCcEEEEEEEeeccEEecccc-------EEEEE-------eC Confidence 122222333445776543 2210 11111 11 No 35 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=93.62 E-value=0.0069 Score=32.42 Aligned_cols=347 Identities=13% Similarity=0.057 Sum_probs=137.7 Q ss_pred CCch-HHHHHHhhHhhcccccchhhcch---------hHHHHHHHH------HHHHHHHHhcccc-------ccch--h- Q lcl|Aclame:pro 1 MSKK-NELMEKWNDLLESQEGLPDIATK---------SKKQLVAAI------LEAQEKDAETDPV-------YRDE--K- 54 (524) Q Consensus 1 m~~~-~~l~~kw~p~l~~~~~~~~i~~~---------~~~~~~~~l------~enq~~~~~~~~~-------~~~~--~- 54 (524) |++. ++|.+.+..+++.-+ ++.+. -.+.-+..| |+.|.+++++... ..+. + T Consensus 1 m~~~~~~l~~~~~~~~~~~~---~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 77 (390) T protein:vir:97 1 MTDITAKLEATLANVTDSLK---AFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVS 77 (390) T ss_pred ChHHHHHHHHHHHHHHHHHH---HHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 8887 468888988887422 22111 011001111 1111111111000 0000 0 Q ss_pred hh---------hhhcccccccccccc-cccCccc--cccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchh Q lcl|Aclame:pro 55 IV---------ESFGGFLAEAEIAGD-HNYDQTN--IASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPT 121 (524) Q Consensus 55 ~~---------~~~~~~l~ea~~~g~-~~~~~~~--~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPT 121 (524) .. ..+...+.+...... ..-..-+ ...+++++.. ..-|.++ .+++++-++.+-.++|.+-||++++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~ 156 (390) T protein:vir:97 78 VGDMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGA-LTTPNRLPGFITPPDARLTVRDLIGSGRTDSAL 156 (390) T ss_pred chhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhccccccccc-ccchhhhHHHHHHHhhhhhhHhhcceeeccCCc Confidence 00 000000000000000 0000000 0001111111 1111222 4455555566677788888887655 Q ss_pred hhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccC Q lcl|Aclame:pro 122 GQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGN 201 (524) Q Consensus 122 GLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~ 201 (524) .-+ .-.. +++. . ..| T Consensus 157 ~~~-------~~~~--~~~~----~---------a~~------------------------------------------- 171 (390) T protein:vir:97 157 IEY-------VQET--GFVN----N---------AAI------------------------------------------- 171 (390) T ss_pred eEE-------EEEe--cCCc----c---------eee------------------------------------------- Confidence 321 1100 0000 0 000 Q ss_pred CcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHH Q lcl|Aclame:pro 202 VTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQD 281 (524) Q Consensus 202 ~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQD 281 (524) .+| +..+++-..++++++...|..+-...+|-||.+| T Consensus 172 ----------------------------------v~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d 208 (390) T protein:vir:97 172 ----------------------------------VAE---------GALKPESSLKFAKKTDTTHVIAHTMKATRQILSD 208 (390) T ss_pred ----------------------------------ecC---------CccccccccceeEEEEeeeeEEEeehhhHHHHHh Confidence 001 0012222233444555555555567899999998 Q ss_pred HHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHH Q lcl|Aclame:pro 282 LRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQI 361 (524) Q Consensus 282 LkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i 361 (524) -- +.++.|.+-|+..|...||+.|+.- .| +...+.|++..........+.-....+- .| T Consensus 209 s~-----~l~~~i~~~la~a~~~~~d~a~l~G---------~g---~~~~p~Gi~~~~~~~~~~~~~~~~~~~d----~~ 267 (390) T protein:vir:97 209 AP-----QLASYMNNRLIRGLKVKEDAEILRG---------TG---ANDGLLGLIPQATTYAAPTTIAGATRVD----QL 267 (390) T ss_pred HH-----HHHHHHHHHHHHHHHHHHHHHHhhc---------CC---CCccccceeeccccccccccccccchHH----HH Confidence 52 4689999999999999999988841 00 0112334443221111100000001111 12 Q ss_pred HHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEE Q lcl|Aclame:pro 362 DKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVG 441 (524) Q Consensus 362 ~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG 441 (524) ..+...+ ...+...+.+|++|.....|......- + .....+... .--++|.| ++|++++..+.+-+++| T Consensus 268 ~~~~~~~--~~~~~~~~~~v~n~~~~~~L~~lkd~~----G---~~l~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~g 336 (390) T protein:vir:97 268 RLAMLQA--SLAEYPASGIVINPIDWAAIELAKDAN----N---QYLIGNARG-TLTPTLWG-LPVVATQAMAPGEFLVG 336 (390) T ss_pred HHHHHhh--ccccCCCCEEEEcHHHHHHHHHhhcCC----C---ceeecCccC-CCCceecc-eeeEEcCCCCCCcEEEE Confidence 2222222 233346678899999988887542111 1 011111111 11246776 79999999887766665 Q ss_pred EecCCCccceeEeecccccccccccCC---ccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhc Q lcl|Aclame:pro 442 FKGDNEMDAGIYYAPYVALTPLRGSDP---KNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMC 510 (524) Q Consensus 442 ~KG~~~~~~~~fyaPYv~~~~~~~~dp---~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a 510 (524) --- ..+++...-.++.....+. .+-+=.+-+..||++.+ +|=+ ..+|. +| T Consensus 337 d~~-----~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a-------~v~~~-------~a 390 (390) T protein:vir:97 337 AFD-----LAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEA-------LITGS-------FA 390 (390) T ss_pred ecc-----ceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEecccc-------EEEEE-------eC Confidence 211 0111111111111111111 12232344556887654 2311 11111 11 No 36 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=93.28 E-value=0.0081 Score=32.04 Aligned_cols=343 Identities=13% Similarity=0.098 Sum_probs=129.9 Q ss_pred CCchHHHHHHhhHhhcccccch-hhcch----h--H---HHHHHHH---------HHHHHHHHhcccc------------ Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLP-DIATK----S--K---KQLVAAI---------LEAQEKDAETDPV------------ 49 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~-~i~~~----~--~---~~~~~~l---------~enq~~~~~~~~~------------ 49 (524) |-+.++|.++|..+.+.-+.+- ++... . . +.+.+.+ +++|.++....+. T Consensus 4 ~m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (408) T protein:vir:74 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (408) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 3366788888888765322211 11110 0 0 1111111 1111111110000 Q ss_pred ccchhh----hhhhcccccccccccccccCccccc-----cccccccccccCchhh--hHHHHHHhhhhhhheeeeecCC Q lcl|Aclame:pro 50 YRDEKI----VESFGGFLAEAEIAGDHNYDQTNIA-----SGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMT 118 (524) Q Consensus 50 ~~~~~~----~~~~~~~l~ea~~~g~~~~~~~~~~-----~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmT 118 (524) ...... ...|..++- +....-..... .++..|.+.- |--+ .+++.+-++....++|.++||+ T Consensus 84 ~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~a~~~~~~~~gg~~v--P~~~~~~Ii~~~~~~~~l~~~~~~~~~~ 156 (408) T protein:vir:74 84 KSENELKDKFVKDFVNMVR-----NPMAFLNTVSSKTETSGSDSAAGLTI--PQDIRTMINTLVRQYDSLQQYVRVESVS 156 (408) T ss_pred chhhhhHHHHHHHHHHHHh-----cchhhhhhhhhhhhcccccCCCceee--chhHhhHHHHHHhhhcchhhhcceeecc Confidence 000000 001111100 00000000000 0111121111 1111 3444445666778999999999 Q ss_pred chhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 119 GPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVT 198 (524) Q Consensus 119 gPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~ 198 (524) +.+|-+--.+- ... +..+ .|- T Consensus 157 ~~~~~~~~~~~--~~~----~~~~--------------~~v--------------------------------------- 177 (408) T protein:vir:74 157 TSSGSRVYEKW--TDV----TPLK--------------AMD--------------------------------------- 177 (408) T ss_pred CCcceEEEEee--cCC----cccc--------------ccc--------------------------------------- Confidence 88765422221 010 0000 000 Q ss_pred ccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccce-eEEEEEEEEeecccccccccHH Q lcl|Aclame:pro 199 SGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMA-FRIDKQVIEARSRQLKAQYSVE 277 (524) Q Consensus 199 ~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMs-FsIEK~tVtAKSRALKAEYT~E 277 (524) +| +...++.+ .+++++++..+..+-...+|-| T Consensus 178 --------------------------------------~E---------~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~e 210 (408) T protein:vir:74 178 --------------------------------------EE---------DGKIPDLDNPRLTIIKYLIKRYAGIITATNT 210 (408) T ss_pred --------------------------------------cc---------ccccccccccceeeEEeeeeeEEeeehhHHH Confidence 00 00112211 3344455555555555679999 Q ss_pred HHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHH Q lcl|Aclame:pro 278 LAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKAL 357 (524) Q Consensus 278 LAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L 357 (524) |.+|- .+|.+++|.+-|+..|..-+|+.||. |. | ++....|+.+++ .| T Consensus 211 ll~ds----~~~l~~~i~~~l~~~~~~~~d~~il~--------G~-G---~~~~~~~~~~~~----------------~i 258 (408) T protein:vir:74 211 LLKDT----AENILAWLSSWIAKKVVVTRNQAIIA--------AM-G---TVPKKPTIANFD----------------DV 258 (408) T ss_pred HHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhh--------cc-c---ccccccccccHH----------------HH Confidence 99983 45779999999999999999999885 11 1 111122333221 11 Q ss_pred HHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCC--CCc Q lcl|Aclame:pro 358 LIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQY--ARQ 435 (524) Q Consensus 358 ~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y--~~~ 435 (524) ...+ ...+.. .+...-.+||+|.....|..+..+-.. ....++......++|.| ++||+-.+ .+. T Consensus 259 ~~~~---~~~l~~--~~~~~a~~v~n~~~~~~l~~lkd~~G~-------~l~~~~~~~~~~~~l~G-~pV~~~~~~~~~~ 325 (408) T protein:vir:74 259 ITMI---NTSVDP--AIIATSSLLTNQSGLNKLALVKTAEGK-------YLLEPDPTKPNSYLIKG-KQVIVVADRWLPN 325 (408) T ss_pred HHHH---HHhhhh--hhcCCCEEEEcHHHHHHHHHhhcCCCc-------eEeccCcCCCCCceecc-eeeEEecCccccc Confidence 1111 112221 222335688999999998864322210 11111111112257877 57775322 221 Q ss_pred ----ce-EEEE-----EecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe-cCccc------ccCCCccc Q lcl|Aclame:pro 436 ----DY-FTVG-----FKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFAN------SRSQAPAD 498 (524) Q Consensus 436 ----dy-~~vG-----~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~------~~~~~~~~ 498 (524) ++ +++| |..-....-.+=+.||.- .+-...+-.+-+..||+..+ +|=+. ....++++ T Consensus 326 ~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~ 399 (408) T protein:vir:74 326 SGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQVGN 399 (408) T ss_pred ccCCcceEEEEehhccEEEEEecceEEEEecccc------chhhcceeeEEEEEeeCcEEecccceEEEEeecccCCCCC Confidence 11 2222 100000001111222211 01123444444555555432 22000 00000000 Q ss_pred cccccchHHhhc Q lcl|Aclame:pro 499 RITSGMISKEMC 510 (524) Q Consensus 499 ~i~~~~~~~~~a 510 (524) .+....... T Consensus 400 ---~~~~~~~~~ 408 (408) T protein:vir:74 400 ---FKTTTSTAV 408 (408) T ss_pred ---CCCCccccC Confidence 000111100 No 37 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=93.03 E-value=0.009 Score=31.79 Aligned_cols=367 Identities=13% Similarity=0.164 Sum_probs=136.9 Q ss_pred CCchHHHHHH---hhHhhcccccchhh----------cc----hhHHHHHHHHHH---HHHHHHhccccccchhhhhhhc Q lcl|Aclame:pro 1 MSKKNELMEK---WNDLLESQEGLPDI----------AT----KSKKQLVAAILE---AQEKDAETDPVYRDEKIVESFG 60 (524) Q Consensus 1 m~~~~~l~~k---w~p~l~~~~~~~~i----------~~----~~~~~~~~~l~e---nq~~~~~~~~~~~~~~~~~~~~ 60 (524) +..-++++++ ....++. +...+. .. ..++.-....+. ++++...... ..+ +...... T Consensus 60 i~~le~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~ 136 (477) T protein:vir:84 60 LDKVEDLDEQIRELESEIER-SGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEP-AKE-RLRRHMV 136 (477) T ss_pred HHHHHHHHHHHHHHHHHHHH-hhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhH-HHH-HHHHHHh Confidence 1111122211 1111100 000000 00 000000000000 0000000000 000 0000000 Q ss_pred c-ccc-ccccccccccCccccccccccccccccCchhh--hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCC Q lcl|Aclame:pro 61 G-FLA-EAEIAGDHNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPL 136 (524) Q Consensus 61 ~-~l~-ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~ 136 (524) . ... +-......+.....+..++++|.. ..-|-.+ .++...-++.+-.++|++.||++.+|-+--.|.. . T Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~-lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~--~--- 210 (477) T protein:vir:84 137 DVESDKEIRKIAKVGEEYRDLDRNGGTGGY-AVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKIL--T--- 210 (477) T ss_pred hhhhhhhHHHHHHhhhhhccccccCCCcce-eeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEe--c--- Confidence 0 000 000000000000011111111111 1112222 2455455677788999999999988754322211 0 Q ss_pred CcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccc Q lcl|Aclame:pro 137 AGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAV 216 (524) Q Consensus 137 ~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~ 216 (524) +. ..+ .+.+.+. T Consensus 211 --~~----~~a---------~~~~Eg~----------------------------------------------------- 222 (477) T protein:vir:84 211 --GT----STA---------IQAADNA----------------------------------------------------- 222 (477) T ss_pred --Cc----cee---------eeeccCc----------------------------------------------------- Confidence 00 000 0000000 Q ss_pred cccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHH Q lcl|Aclame:pro 217 IAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSA 296 (524) Q Consensus 217 ~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsn 296 (524) . ......++...+++.+++.+|.-+-...+|-||.+|-. .|.++.|.+ T Consensus 223 ----------------~------------~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~ 270 (477) T protein:vir:84 223 ----------------A------------LTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAA----VSVDEFVFR 270 (477) T ss_pred ----------------c------------cccccccccccceeeEEEeeeeEEeeeHHHHHHHhccc----hhHHHHHHH Confidence 0 00112344456677788888888888899999999943 567999999 Q ss_pred HHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccc----cccchHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 297 ILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDI----RGARWAGESYKALLIQIDKEANEIARQT 372 (524) Q Consensus 297 ILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~----~~~~~~~e~~r~L~~~i~~~a~~I~~~T 372 (524) -|+..|..-|++.||. |. | +...+.|++.......+ ....|. ....++..|-...+.+.... T Consensus 271 ~l~~~~~~~~d~~~l~--------G~-G---t~~~p~Gi~~~~~~~~~~~~~~~~t~~--~~~~~~~~i~~~~~~~~~~~ 336 (477) T protein:vir:84 271 DLAADYANKLNVQVIS--------GT-G---SNNQVVGVRATAGITQVTATSAGSALE--KHQIIYQKIADAIQRVHTSR 336 (477) T ss_pred HHHHHHHHHHHHHHhc--------cC-C---CCCccceeeeccccccccccccccchh--hHHHHHHHHHHHHhhccccc Confidence 9999999999999885 11 1 01123466544322111 111121 12233444444444444333 Q ss_pred cccCCCEEEEchhhhhhhhhhccccc-----ccchhhhc-ccccccccceeEEEecCcEEEEecCCCCcc--------eE Q lcl|Aclame:pro 373 GRGAGNFIIASRNVVSALARIDSGIT-----PASQGLQK-TLNVDTTKAVFAGVLGGTYKVYIDQYARQD--------YF 438 (524) Q Consensus 373 ~~g~gn~~v~S~~va~~L~~~~~g~~-----~~s~~~~~-~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y~ 438 (524) +. .+..+|++|.....|..+..+-. +.-+..+. ....+.-.....|+|.| ++|+++++.|.+ -| T Consensus 337 ~~-~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G-~pVv~s~~~p~~~~~~~d~~~i 414 (477) T protein:vir:84 337 FL-EPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHG-LPVVTDPTLPTTLGTGTDQDVI 414 (477) T ss_pred cC-CccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcc-cceEecCcccccccccCCcceE Confidence 32 34678888887777654322211 00000000 01111112223467876 799999988753 34 Q ss_pred EEEEecCCCccceeEeecccccccccccCCcccc--ceeeeeeeec-----cEecC--cccccCCCccccccccchHHhh Q lcl|Aclame:pro 439 TVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQ--PVMGFKTRYG-----IGINP--FANSRSQAPADRITSGMISKEM 509 (524) Q Consensus 439 ~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~q--P~~~~~tRY~-----l~~nP--~~~~~~~~~~~~i~~~~~~~~~ 509 (524) ++|--.+.- .- ..+..+ .++|.++. ....|.. |+ .+.+| |.. .+..... .-.+ T Consensus 415 ~~gd~~~~~------i~--~~~~~~-~~~~~~~~~~~~~~~~v-~~~~~~~~~r~~~afv~-~t~~~~~-------~~~~ 476 (477) T protein:vir:84 415 HVLRASDLA------LF--ESSVRM-RALQETRAENLSVLLQV-YGYLAFTAARFPQSVVE-IGGTALT-------APTF 476 (477) T ss_pred EEEEeceEE------EE--eeceeE-Eeccccccccceeeeee-hhhhhhhhhccccceEE-eeccccc-------cccc Confidence 444332110 00 001001 12222211 2222211 22 12244 221 1110000 0012 Q ss_pred c Q lcl|Aclame:pro 510 C 510 (524) Q Consensus 510 a 510 (524) + T Consensus 477 ~ 477 (477) T protein:vir:84 477 A 477 (477) T ss_pred C Confidence 2 No 38 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=92.38 E-value=0.012 Score=31.18 Aligned_cols=354 Identities=14% Similarity=0.069 Sum_probs=137.1 Q ss_pred CCch----HHHHHHhhHhhccccc-ch----hhcch------hHHH----------HHHHHHHHHHHHHhccccccchhh Q lcl|Aclame:pro 1 MSKK----NELMEKWNDLLESQEG-LP----DIATK------SKKQ----------LVAAILEAQEKDAETDPVYRDEKI 55 (524) Q Consensus 1 m~~~----~~l~~kw~p~l~~~~~-~~----~i~~~------~~~~----------~~~~l~enq~~~~~~~~~~~~~~~ 55 (524) |++. ++|++++..+-+.-+. .. +++.. .++. +-+++.+.+.+....+..-..+.. T Consensus 1 m~~~~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQIKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEEA 80 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccch Confidence 8875 4555555544321000 01 11100 0110 001111100000000000000000 Q ss_pred hhhhcccccccc--------cccccccCcccccccccccc-ccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhhe Q lcl|Aclame:pro 56 VESFGGFLAEAE--------IAGDHNYDQTNIASGKSSGA-ITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVF 125 (524) Q Consensus 56 ~~~~~~~l~ea~--------~~g~~~~~~~~~~~st~sg~-v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIF 125 (524) .........+.. ..+........-+..+++++ -...-|.++ .++++.-+..+..++|.++||.+++.-+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~- 159 (395) T protein:vir:43 81 PKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEY- 159 (395) T ss_pred hhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEE- Confidence 000000000000 00000000000000011111 011223222 4555556677788889999887653211 Q ss_pred eeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCccc Q lcl|Aclame:pro 126 ALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVT 205 (524) Q Consensus 126 AMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~t 205 (524) .| ..... .. ..| T Consensus 160 -~~--~~~~~---~~---------------a~~----------------------------------------------- 171 (395) T protein:vir:43 160 -VR--ETGFV---NN---------------AAP----------------------------------------------- 171 (395) T ss_pred -EE--EecCC---Cc---------------eee----------------------------------------------- Confidence 00 00000 00 000 Q ss_pred ccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhh Q lcl|Aclame:pro 206 GADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAV 285 (524) Q Consensus 206 gt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAi 285 (524) .+| +...++-..+++++++..+.-+-...+|-||.||.- T Consensus 172 ------------------------------v~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-- 210 (395) T protein:vir:43 172 ------------------------------VSE---------GTQKPYSDLTFELENAPVRTIAHLFKASRQILDDAS-- 210 (395) T ss_pred ------------------------------ecC---------CccccccccceeEEEEeeeeEEEeehhhHHHHHhHH-- Confidence 001 001233334455566666666666789999999863 Q ss_pred cCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 286 HGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEA 365 (524) Q Consensus 286 HGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a 365 (524) +.++.|.+-|+..|...+|+.||. |. | ....+.|++..........+ ... ....++..|.++. T Consensus 211 ---~l~~~v~~~la~a~~~~~d~~~l~--------G~-g---~~~~~~Gi~~~~~~~~~~~~-~~~-~~~~~~~~i~~~~ 273 (395) T protein:vir:43 211 ---ALQSYIDARARYGLMLVEECQLLY--------GN-G---TGANLHGIIPQAQAYAPPSG-VVV-TAEQRIDRIRLAI 273 (395) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHh--------cc-C---CCCccccccccccccccccc-ccc-ccchhHHHHHHHH Confidence 358899999999999999999884 10 0 01112344332211110000 000 0112333444444 Q ss_pred HHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecC Q lcl|Aclame:pro 366 NEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGD 445 (524) Q Consensus 366 ~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~ 445 (524) +.+.. .+..+..+|++|.....|..+.. ..+. ....+.. ..-.++|.| ++|+++++.+.+=+++|--.. T Consensus 274 ~~~~~--~~~~~~~~vmn~~~~~~l~~lkd----~~G~---~i~~~~~-~~~~~~l~G-~pVv~~~~~~~~~~~~gd~~~ 342 (395) T protein:vir:43 274 LQAQL--AEFPASGIVLNPIDWALIELNKD----AENR---YIIGSPQ-NGTTPTLWR-LPVVETQAITQDEFLTGAFSL 342 (395) T ss_pred Hhhcc--ccCCCcEEEEcHHHHHHHHHhhc----cCCc---eeccccc-cCCCceecc-eeeEEcCCCCCCcEEEEeccc Confidence 44433 34466789999999888865321 1111 1111111 111356776 799999998776555543211 Q ss_pred CCccceeEeecccccccccccC-C-cccc---ceeeeeeeeccEe-cCcccccCCCccccccccchHHhhc Q lcl|Aclame:pro 446 NEMDAGIYYAPYVALTPLRGSD-P-KNFQ---PVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMC 510 (524) Q Consensus 446 ~~~~~~~fyaPYv~~~~~~~~d-p-~s~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a 510 (524) . -+++ .-....+...+ . ..|+ =.+-+..|++..+ +|=+ ..++. -- .+ T Consensus 343 ~----~~~~--~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a-------~~~~~---~t--aa 395 (395) T protein:vir:43 343 G----AQIF--DRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEA-------FVTGS---LT--AS 395 (395) T ss_pred e----EEEE--EecceEEEEeccccchhhcCcEEEEEEEeeccEEecccc-------eEEEE---ec--cC Confidence 0 0000 00111111111 1 1232 2333445777654 2311 00100 00 00 No 39 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=91.15 E-value=0.017 Score=30.25 Aligned_cols=342 Identities=10% Similarity=0.030 Sum_probs=127.7 Q ss_pred CCchHHHHHHhhHhhccccc-----chhhcch-----------hH---HHH---HHHHH---HHHHHHHhcccc--ccch Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEG-----LPDIATK-----------SK---KQL---VAAIL---EAQEKDAETDPV--YRDE 53 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~-----~~~i~~~-----------~~---~~~---~~~l~---enq~~~~~~~~~--~~~~ 53 (524) |.-+| ++++=..+++..+. ..+++.. .+ ..+ +..|. +..++....... .... T Consensus 1 m~~~e-~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~~~~~~ 79 (379) T protein:vir:10 1 MEALE-IKVALEAIKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKSEDKSD 79 (379) T ss_pred CCHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccch Confidence 55442 44333333221000 0000000 00 000 01110 111111111110 0000 Q ss_pred hhhhhhcccccccccccccccCccccccc-cccccccccCchhh--hHHHHHHhhhhhhheeeeecCCchhhhheeeeee Q lcl|Aclame:pro 54 KIVESFGGFLAEAEIAGDHNYDQTNIASG-KSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAV 130 (524) Q Consensus 54 ~~~~~~~~~l~ea~~~g~~~~~~~~~~~s-t~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsr 130 (524) .....+........-+..........+.+ +++++....=|.-+ .+++..-....-.++|.|.||++++.-| T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~------ 153 (379) T protein:vir:10 80 SLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTF------ 153 (379) T ss_pred hHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEE------ Confidence 00111110000000000000001111111 11111111112211 2333333455666788888887664211 Q ss_pred ecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcc Q lcl|Aclame:pro 131 YGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPA 210 (524) Q Consensus 131 Y~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~ 210 (524) .-.. ++++. . T Consensus 154 -~~~~---------------------~~~~~------------------------------------------------~ 163 (379) T protein:vir:10 154 -VREN---------------------GAGEG------------------------------------------------A 163 (379) T ss_pred -EEee---------------------cCCCc------------------------------------------------c Confidence 1110 00000 0 Q ss_pred cccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCh Q lcl|Aclame:pro 211 ALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDA 290 (524) Q Consensus 211 ~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDA 290 (524) ....+| +...+++..++++++..+|.=+-...+|-||.||--. . T Consensus 164 ----------------------~~~v~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~~-----l 207 (379) T protein:vir:10 164 ----------------------IGAQVE---------GATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLPF-----L 207 (379) T ss_pred ----------------------cccccC---------CccccccccceeeeEeeeeeEEeeehhhHHHHhhHHH-----H Confidence 000011 1123444555556666665555567899999999632 5 Q ss_pred HHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 291 DAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIAR 370 (524) Q Consensus 291 EaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~ 370 (524) ++.|.+-|+..|..-+|..++.-+...+..+..+ .. +....+..+.++.++. . T Consensus 208 ~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~----------~~----------~~~~~d~i~~~~~~~~-------~ 260 (379) T protein:vir:10 208 TSFIPNALRRDYAKAENAAFNAVLAANATASTEI----------IT----------NKNKVEMLINEIAKQE-------N 260 (379) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccccccccccc----------cc----------CcccHHHHHHHHHhhh-------h Confidence 8899999999999999998886544332221111 00 0011222333333332 1 Q ss_pred hccccCCCEEEEchhhhhhhhhhcccccccchhhhccccc-ccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCcc Q lcl|Aclame:pro 371 QTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNV-DTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMD 449 (524) Q Consensus 371 ~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~-d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~ 449 (524) .+-..+.+|++|.....|......-..+- .+..... +.+. -+|.| ++|+++++.+...+++|=-.. T Consensus 261 --~~~~~~~~vmn~~~~~~l~~lkd~~G~~l--~~~~~~~~~~~~----~~l~G-~pvv~s~~~~ag~~~~gdf~~---- 327 (379) T protein:vir:10 261 --LDFPVTAIVLRPTDYYDILVTQKSVGAGY--GLPGVVTQDNGV----LRING-IPLFRATWLAANKYYVGDWTR---- 327 (379) T ss_pred --ccCCCCEEEEcHHHHHHHHHhhccCCcee--ccCCccCCCCCc----ceecc-eeeEecCCCCCCceEEeeccc---- Confidence 22356779999999888865422221110 0101000 1111 15666 799999998776555542211 Q ss_pred ceeEeecccccccccc-cC----CccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccchhhhhhhhccc Q lcl|Aclame:pro 450 AGIYYAPYVALTPLRG-SD----PKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKG 523 (524) Q Consensus 450 ~~~fyaPYv~~~~~~~-~d----p~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~ 523 (524) .-+++- ....+.. .+ -.+-+=.+=+..|+|+.+ +|=+ |.++-+.. T Consensus 328 ~~~~~~---~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a--------------------------~v~~~~~~ 378 (379) T protein:vir:10 328 VTKVTT---EGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAA--------------------------LIFGDFTA 378 (379) T ss_pred EEEEEE---eceEEEEeecccccccCCcEEEEEEEEeccEEecCcc--------------------------EEEEEecC Confidence 111111 1111110 11 112222222345776543 3411 11111111 Q ss_pred C Q lcl|Aclame:pro 524 L 524 (524) Q Consensus 524 ~ 524 (524) | T Consensus 379 ~ 379 (379) T protein:vir:10 379 V 379 (379) T ss_pred C Confidence 1 No 40 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=91.12 E-value=0.017 Score=30.23 Aligned_cols=298 Identities=11% Similarity=0.051 Sum_probs=124.3 Q ss_pred cccccccccccCccccccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCccccc Q lcl|Aclame:pro 64 AEAEIAGDHNYDQTNIASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPA 142 (524) Q Consensus 64 ~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA 142 (524) +.. ....+.+.......|.+ .-|.++ .+++++.++.+-.+++-+.||+++.- +|.-.. .+.++ T Consensus 1 m~~-----~~~~a~~~~~t~~~g~~--i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-------~~p~~~--~~~~a 64 (330) T protein:vir:77 1 MAG-----STVPSTQVALTGDFSAF--LTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGI-------SIPHWT--GAVSA 64 (330) T ss_pred Ccc-----cccchhhccccCCCcce--echhHHHHHHHHHHhccchhhhcceeeccCCce-------EEEEEc--CCcce Confidence 111 11111111111111111 224444 56677778888889999999887542 111100 00000 Q ss_pred chhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccc Q lcl|Aclame:pro 143 DVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEK 222 (524) Q Consensus 143 ~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~ 222 (524) .|- T Consensus 65 --------------~~v--------------------------------------------------------------- 67 (330) T protein:vir:77 65 --------------SWT--------------------------------------------------------------- 67 (330) T ss_pred --------------eEe--------------------------------------------------------------- Confidence 000 Q ss_pred cccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|Aclame:pro 223 GTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEI 302 (524) Q Consensus 223 g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI 302 (524) +| +.++++-..+++++++..|..+-+..+|-||.+|- ..|.|++|.+-|+..| T Consensus 68 --------------~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~ai 120 (330) T protein:vir:77 68 --------------GE---------AERKPITKGSFGKQELEPVKITTIFAESAEVVRLN----PLNYLNTMRTKIAEAI 120 (330) T ss_pred --------------cC---------CCccccccceeeEEEEeEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHH Confidence 01 11233444556777777777777788999999983 5678999999999999 Q ss_pred HHHhhHHHHhhhhhheeeeeeccccccCccceeeccc----ccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCC Q lcl|Aclame:pro 303 MLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQ----DPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGN 378 (524) Q Consensus 303 ~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~----~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn 378 (524) ...||+.||. |... ...+.|++... ...+......+ .....++..+.++-..+.+. ....+ T Consensus 121 ~~~~~~~~l~--------G~g~----~~~~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~~~~--~~~~~ 185 (330) T protein:vir:77 121 ALKFDAAAIH--------GIDK----PSAFKGYLAETTKVVSLADTNLTTAS-GPQGNAYLAVNNALSLLVNS--GKKWT 185 (330) T ss_pred HHHHHHHhhc--------ccCC----CCccccccccccccceeecccccccc-cccchhHHHHHHHHHhhhhc--CCCcc Confidence 9999999984 1000 00001111000 00000000000 01122334444444444443 23556 Q ss_pred EEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCc--------------ceEEEEEec Q lcl|Aclame:pro 379 FIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ--------------DYFTVGFKG 444 (524) Q Consensus 379 ~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--------------dy~~vG~KG 444 (524) .+||+|.....|......-. .+-.+.....+......-++|.| ++||+..+.+. .++++|-.+ T Consensus 186 ~~vmn~~~~~~l~~lkd~~G--~~l~~~~~~~~~~~~~~~~~l~G-~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~ 262 (330) T protein:vir:77 186 GTLLDNVTEPILNTAVDGNG--RPLFVESTYTEQVGAIREGRILG-RPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIG 262 (330) T ss_pred EEEEcHHHHHHHHHHhccCC--ceeecCccccccccccCCceecc-eeeEEeccccCCCCCCccEEEEEecceEEEEEec Confidence 78999999988875321110 00000000000111112356776 79999988653 122334333 Q ss_pred CCCc----cceeEeecccccccccccCCccc---cceeeeeeeeccEe-cC--cc---cccCCCccccccccchHHhhcc Q lcl|Aclame:pro 445 DNEM----DAGIYYAPYVALTPLRGSDPKNF---QPVMGFKTRYGIGI-NP--FA---NSRSQAPADRITSGMISKEMCG 511 (524) Q Consensus 445 ~~~~----~~~~fyaPYv~~~~~~~~dp~s~---qP~~~~~tRY~l~~-nP--~~---~~~~~~~~~~i~~~~~~~~~a~ 511 (524) ..+. ++.+.+.- .........+-+-| +=.+=...|++..+ +| |. .....+++-. T Consensus 263 ~~~i~~~~e~~~~~~~-~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~~~~~~------------ 329 (330) T protein:vir:77 263 GLSFDVTDQATLDFGE-EQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQVAGTDPEE------------ 329 (330) T ss_pred CcEEEEeecceeeecc-cccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEeccCCcCCCC------------ Confidence 2221 11111100 00000000000001 11112223444322 22 10 0011111100 Q ss_pred chh Q lcl|Aclame:pro 512 KNA 514 (524) Q Consensus 512 ~~~ 514 (524) . T Consensus 330 --~ 330 (330) T protein:vir:77 330 --E 330 (330) T ss_pred --C Confidence 0 No 41 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=90.56 E-value=0.02 Score=29.88 Aligned_cols=366 Identities=16% Similarity=0.140 Sum_probs=137.5 Q ss_pred CCchHHHHHHhhHhhccccc-chhhcch-------hHHHHHHHH--HHHHHHHHhcc---------------c------- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEG-LPDIATK-------SKKQLVAAI--LEAQEKDAETD---------------P------- 48 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~-~~~i~~~-------~~~~~~~~l--~enq~~~~~~~---------------~------- 48 (524) |...++|.++=..+++..+. ..++... -.+.+...+ |++|++.+.+. + T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccch Confidence 65555554443333221000 0000000 001111111 11111111110 0 Q ss_pred -cccchhhhhhhccccccccccc-------ccccCccccc-cccccccccccCchhh--hHHHHHHhhhhhhheeeeecC Q lcl|Aclame:pro 49 -VYRDEKIVESFGGFLAEAEIAG-------DHNYDQTNIA-SGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPM 117 (524) Q Consensus 49 -~~~~~~~~~~~~~~l~ea~~~g-------~~~~~~~~~~-~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPm 117 (524) ..........+...+.+....+ .......... .++++++-...-|.-+ .+++.+-+..+-.++|.++|| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:94 81 STYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeec Confidence 0000000011111111111000 0000000001 1111111111124222 456666678888999999999 Q ss_pred CchhhhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 118 TGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNV 197 (524) Q Consensus 118 TgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~ 197 (524) ++..+-+--.+ .... . + ..|-+.+ T Consensus 161 ~~~~~~~~~~~--~~~~-----~-----~---------~~~v~Eg----------------------------------- 184 (415) T protein:vir:94 161 TNGSGKYPVVR--QSEV-----A-----A---------LEKVEEL----------------------------------- 184 (415) T ss_pred cCCceeEEEEe--ecCC-----c-----c---------ceecccc----------------------------------- Confidence 87654322111 1110 0 0 0000000 Q ss_pred cccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHH Q lcl|Aclame:pro 198 TSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVE 277 (524) Q Consensus 198 ~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~E 277 (524) + +.. ..+...|.+..|.+.|.. -.-.+|-| T Consensus 185 --------~-------------------------------~~~----~~~~~~~~~i~~~~~k~~-------~~~~is~e 214 (415) T protein:vir:94 185 --------E-------------------------------ENP----ELAVKPFFQLAYDINTHR-------GYFRISRE 214 (415) T ss_pred --------c-------------------------------ccc----ccccccceeeEeeheeee-------eechhhHH Confidence 0 000 000112455555555554 44569999 Q ss_pred HHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHH Q lcl|Aclame:pro 278 LAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKAL 357 (524) Q Consensus 278 LAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L 357 (524) |.+|.- .|.+++|.+-|...|..-+|+.|+.-.-...-.+... ... ..++ .....+.. ..+....+ T Consensus 215 ll~ds~----~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~--~~~--~~~~-----~~~~~~~~-~~~~i~~~ 280 (415) T protein:vir:94 215 AIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS--GFE--KEGK-----KLEVKKAK-SLDDIKDA 280 (415) T ss_pred HHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccc--ccc--cccc-----cccccccc-chHHHHHH Confidence 999864 4679999999999999999999996432221111000 000 0000 00000000 11222333 Q ss_pred HHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCc-- Q lcl|Aclame:pro 358 LIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ-- 435 (524) Q Consensus 358 ~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-- 435 (524) +..+ .. ..+ +.+.+|++|.....|..+...- +. -....+.+ ....++|.| ++|++.+..+. T Consensus 281 ~~~~-------~~-~~~-~~~~~vmn~~~~~~l~~lkd~~----G~--~l~~~~~~-~~~~~~l~G-~pV~~~~~~~~~~ 343 (415) T protein:vir:94 281 INLN-------VK-PNY-EHNVAIVSQTMFAKLDKMKDKL----GN--YLIQPDVK-EKTQQRLLG-AKIEILPDEVLGQ 343 (415) T ss_pred HHhh-------hh-hcc-CCCEEEEcHHHHHHHHHhhccC----CC--eeeccCcC-CCCCceecc-eeeEEecccccCC Confidence 3322 21 222 5678999999988887532211 10 00011111 112357777 58888776542 Q ss_pred --ce-EEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhc Q lcl|Aclame:pro 436 --DY-FTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMC 510 (524) Q Consensus 436 --dy-~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a 510 (524) +. +++|--.. . +..... ....+...|-.+++-.+-...|+++.+ +|=+...-.-.. ...-.++...-+ T Consensus 344 ~~~~~i~~gd~~~----~-~~~~~~-~~~~v~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~-~~~~~~~~~~~~ 415 (415) T protein:vir:94 344 KGNNTLIIGNLKD----A-IVLFDR-SQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDD-SERGEGDLGLEA 415 (415) T ss_pred CCccEEEEEehhc----c-EEEEee-cceEEEEeccccCceEEEEEEEeccEEeccccEEEEEEec-cCCCCCccccCC Confidence 11 23331000 0 000000 111222345556677777788888653 441110000000 000112222222 No 42 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=90.41 E-value=0.021 Score=29.79 Aligned_cols=285 Identities=12% Similarity=0.105 Sum_probs=121.8 Q ss_pred cccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccc Q lcl|Aclame:pro 79 IASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTM 157 (524) Q Consensus 79 ~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~ 157 (524) .+..++++.=...-+.+. .+++++.++.+..+++-+.||++++--|--. ... . .+. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~----~~~-----~--------------~a~ 57 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVL----ATL-----P--------------EAD 57 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEE----eCC-----c--------------ceE Confidence 122222111111222222 5666677777888889999998765211110 000 0 011 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhh Q lcl|Aclame:pro 158 YSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVA 237 (524) Q Consensus 158 fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~a 237 (524) |-+.+.. T Consensus 58 wv~E~~~------------------------------------------------------------------------- 64 (305) T protein:vir:25 58 WVGESAT------------------------------------------------------------------------- 64 (305) T ss_pred Eeecccc------------------------------------------------------------------------- Confidence 1110000 Q ss_pred hhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhh Q lcl|Aclame:pro 238 ELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYT 317 (524) Q Consensus 238 Eal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~ 317 (524) . ....++.-..++++++..++..+-...+|-||.+|-. .|.|++|.+-|+..|...++..++.=.-.. T Consensus 65 --~------~~~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~ 132 (305) T protein:vir:25 65 --D------PKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDAT----VAVLTEVAELGGQAIGKKLDQAVIFGTDKP 132 (305) T ss_pred --c------ccccccccccceeeEEeeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHhhhheeccCCC Confidence 0 0001111123445555555556666789999999843 578999999999999999999998410000 Q ss_pred eeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccc Q lcl|Aclame:pro 318 AQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGI 397 (524) Q Consensus 318 a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~ 397 (524) .++-..........--.... .--.......++.-+.++...+...-. ..+-+|++|.....|..+. T Consensus 133 -----~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~lk--- 198 (305) T protein:vir:25 133 -----ASWVSPALIPAAVTAGQAVE----VVGGVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANIR--- 198 (305) T ss_pred -----CCcccccccccccccccccc----ccccchhhhHHHHHHHHHHHhhhhccc--ccceeEecHHHHHHHHHhh--- Confidence 00000000000000000000 000111223344444445444444322 3345788999888876321 Q ss_pred cccchhhhcccccccccceeE-EEecCcEEEEecCCCCcc----eEEEE--------EecCCCccceeEeeccccccccc Q lcl|Aclame:pro 398 TPASQGLQKTLNVDTTKAVFA-GVLGGTYKVYIDQYARQD----YFTVG--------FKGDNEMDAGIYYAPYVALTPLR 464 (524) Q Consensus 398 ~~~s~~~~~~~~~d~~~~~~~-G~l~~~~~vy~D~y~~~d----y~~vG--------~KG~~~~~~~~fyaPYv~~~~~~ 464 (524) +..+ ...|. ++|.| ++|+|..+.+.+ -+++| ..+.-+.+- ..+.-+. . T Consensus 199 -d~~G-----------~~i~~~~~l~G-~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~----~~~~~~~--~ 259 (305) T protein:vir:25 199 -DANG-----------NPVFRDDSFAG-FRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKF----LDQATLG--T 259 (305) T ss_pred -ccCC-----------ceeecCCcccc-cceEEcCccCCCCCccEEEEEecceEEEEEecCeEEEE----eeeeeee--c Confidence 1111 11111 46766 689888775432 12222 221111110 0000000 0 Q ss_pred ccCCcc-cc-ceee--eeeeecc-EecCccc-ccCCCccccccccc Q lcl|Aclame:pro 465 GSDPKN-FQ-PVMG--FKTRYGI-GINPFAN-SRSQAPADRITSGM 504 (524) Q Consensus 465 ~~dp~s-~q-P~~~--~~tRY~l-~~nP~~~-~~~~~~~~~i~~~~ 504 (524) .-.+.+ || ..++ ...|||+ +.||=+- ..+..+.+-|+-.. T Consensus 260 ~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~pa~ 305 (305) T protein:vir:25 260 GENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) T ss_pred CCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCCCC Confidence 011111 22 1222 4668996 5587332 22222222222111 No 43 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=89.81 E-value=0.024 Score=29.44 Aligned_cols=286 Identities=14% Similarity=0.140 Sum_probs=125.4 Q ss_pred cccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccc Q lcl|Aclame:pro 79 IASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTM 157 (524) Q Consensus 79 ~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~ 157 (524) .+..++. .+ ...|.+. .+++++.+..+..++|.+.||++.+.-|. ++... ..+ . T Consensus 1 m~t~t~g-g~-liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip----~~~~~-----~~a--------------~ 55 (303) T protein:vir:97 1 MGTETSK-AS-LFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEF----TFTLD-----SDI--------------D 55 (303) T ss_pred CcccCCC-Ce-EcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE----EEecC-----cce--------------E Confidence 2212222 21 2334443 56666777888999999999875433221 11110 000 0 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhh Q lcl|Aclame:pro 158 YSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVA 237 (524) Q Consensus 158 fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~a 237 (524) | .+ T Consensus 56 w-----------------------------------------------------------------------------v~ 58 (303) T protein:vir:97 56 V-----------------------------------------------------------------------------VA 58 (303) T ss_pred E-----------------------------------------------------------------------------ee Confidence 0 00 Q ss_pred hhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhh Q lcl|Aclame:pro 238 ELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYT 317 (524) Q Consensus 238 Eal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~ 317 (524) | +.++++-..+++.++..+|.-+-...+|-||.|.... ..++-+++|.+-|+..|...|+..++.=.... T Consensus 59 E---------~~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~ 128 (303) T protein:vir:97 59 E---------NGKKTHGGLSLEPVTIVPIKVEYGARLSDEFLYATEE-EKIDILKAFNEGFAKKLARGIDLMAMHGINPR 128 (303) T ss_pred c---------CccccccccceeeEEeeeEEEEEeehhhHHHhhcCcc-chHHHHHHHHHHHHHHHHHHHHhhhhcccccC Confidence 1 0112222333455555555555667899999863322 24667899999999999999999988522111 Q ss_pred eeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccc Q lcl|Aclame:pro 318 AQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGI 397 (524) Q Consensus 318 a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~ 397 (524) .. .+....|...+.......-... ....++.-|.++-+.+.. ..+..+.+|++|.....|..+.. T Consensus 129 ~g--------~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~lkd-- 193 (303) T protein:vir:97 129 TK--------KASDVIGTNHFDSKVTQVVKFT---ESEDADANIEAAVNLIQG--AEGVVTGLAMDTEFSTALAKVTN-- 193 (303) T ss_pred Cc--------cccccccccccccccccccccc---cccchHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHHHHhhc-- Confidence 10 1111111111111000000000 001223344444444432 23455779999999888864321 Q ss_pred cccchhhhcccccccccceeEEEecCcEEEEecCCCCcce-----EEEEEecCCCccceeEeeccc--ccccccccCCcc Q lcl|Aclame:pro 398 TPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDY-----FTVGFKGDNEMDAGIYYAPYV--ALTPLRGSDPKN 470 (524) Q Consensus 398 ~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-----~~vG~KG~~~~~~~~fyaPYv--~~~~~~~~dp~s 470 (524) ..+. .....+.....-.|+|.| ++|+++.+.+... -.+.+-|+- ...+.+...- ++...+..|++. T Consensus 194 --~~g~--~~~~~~~~~~~~~~~l~G-~Pv~~s~~v~~~~~~~~~~~~~~~Gdf--~~~~~~~~~~~~~~~~~~~~~~d~ 266 (303) T protein:vir:97 194 --GEMG--PKMYPELAWGANPDSING-LKSSVNTTVGAGADEAESKDLVIIGDF--ESMFKWGYAKQIPMEIIKYGDPDN 266 (303) T ss_pred --cCCC--eEEecCccCCCCCceecc-eeeEEecccCCccccCCCccEEEEeec--cccEEEEEecCcEEEEeeccCCCC Confidence 1110 001111111112357887 7999988754311 011122221 1111122111 122222223322 Q ss_pred -----ccc-eeee--eeeeccE-ecCcccccCCCccccccccch Q lcl|Aclame:pro 471 -----FQP-VMGF--KTRYGIG-INPFANSRSQAPADRITSGMI 505 (524) Q Consensus 471 -----~qP-~~~~--~tRY~l~-~nP~~~~~~~~~~~~i~~~~~ 505 (524) |+- .++| ..||+.. .||= -..++.++.- T Consensus 267 ~~~~~~~~n~~~~r~~~r~~~~v~~p~-------af~~l~~~~~ 303 (303) T protein:vir:97 267 SGKDLKGYNQIYLRAEAYIGWGILDAK-------SFARVTKGEV 303 (303) T ss_pred cchhhhhcCcEEEEEEEEeccEeeccc-------ceEEeeCCCC Confidence 221 2344 5677754 3441 1123333222 No 44 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=89.54 E-value=0.026 Score=29.30 Aligned_cols=332 Identities=14% Similarity=0.106 Sum_probs=123.9 Q ss_pred CCchH--------------HHHHHhhHh---hccccc-------------c----hhhcchh-HHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 1 MSKKN--------------ELMEKWNDL---LESQEG-------------L----PDIATKS-KKQLVAAILEAQEKDAE 45 (524) Q Consensus 1 m~~~~--------------~l~~kw~p~---l~~~~~-------------~----~~i~~~~-~~~~~~~l~enq~~~~~ 45 (524) |.-.| ++.++...+ ++.... + .+|+.+. +......+.+..++... T Consensus 1 ~~l~e~i~e~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~ 80 (400) T protein:vir:38 1 MTLDEKLAAVKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSG 80 (400) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 11111 111111111 110000 0 0011100 00000111110000000 Q ss_pred cccc-ccchhhhhhhccccccc-------c----ccc------ccccCccc-cccc--cccccccccCch--hhhHHHHH Q lcl|Aclame:pro 46 TDPV-YRDEKIVESFGGFLAEA-------E----IAG------DHNYDQTN-IASG--KSSGAITNIGPA--VIGMVRRA 102 (524) Q Consensus 46 ~~~~-~~~~~~~~~~~~~l~ea-------~----~~g------~~~~~~~~-~~~s--t~sg~v~~~~P~--li~l~Rra 102 (524) ..+. .......+.+....... . ..+ ........ ...+ +..|.+ .-|. .-.++++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~--~vP~~~~~~ii~~~ 158 (400) T protein:vir:38 81 KKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAAS--TIPETISNTPQREL 158 (400) T ss_pred ccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcc--cccHHHHHHHHHHH Confidence 0000 00000000000000000 0 000 00000000 0011 111111 1122 11344445 Q ss_pred HhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 103 IPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIV 182 (524) Q Consensus 103 ~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~ 182 (524) -++.+..+++.+.||++.++-+--++. . .+. ..|-+. T Consensus 159 ~~~~~l~~~~~~~~~~~~~~~~~~~~~----~---~~~---------------~~~~~E--------------------- 195 (400) T protein:vir:38 159 QTVVDLKPFTNVFQASTQKGTYPTVAN----A---TTK---------------MVTVAE--------------------- 195 (400) T ss_pred HhhhhhhhcceeEeccCcceEEEEEec----C---CCc---------------cccccc--------------------- Confidence 567788899999999887653322211 0 000 000000 Q ss_pred ccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccc-eeEEEEE Q lcl|Aclame:pro 183 YHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEM-AFRIDKQ 261 (524) Q Consensus 183 ~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EM-sFsIEK~ 261 (524) .+. .++. ...++.+ T Consensus 196 ----------------------~~~-------------------------------------------~~~~~~~~f~~i 210 (400) T protein:vir:38 196 ----------------------LEK-------------------------------------------NPAMAKPEFKPV 210 (400) T ss_pred ----------------------ccc-------------------------------------------ccccccccceee Confidence 000 0111 1233445 Q ss_pred EEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeeccccc Q lcl|Aclame:pro 262 VIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDP 341 (524) Q Consensus 262 tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~ 341 (524) +...+.-+-...+|-||.+|- ..|.+++|.+.|+..|...+|+-|+.-... . ...|+..+ T Consensus 211 ~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~--------~-----~~~~~~~~--- 270 (400) T protein:vir:38 211 NWSVETYRQALPVSQESIDDS----AIDLVGLIAQNGQQIKVNTTNGAVATLLKG--------F-----TAKTISSV--- 270 (400) T ss_pred EeehhheeeehhhHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhhhhcccc--------c-----cccccccH--- Confidence 555555555778999999985 346788999999999999999988853221 1 11122111 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEe Q lcl|Aclame:pro 342 VDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVL 421 (524) Q Consensus 342 ~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l 421 (524) .....++.... ...+ . ...|++|.....|..+...-..+ ....+.+. ...++| T Consensus 271 ----------~~~~~~~~~~~--------~~~~-~-a~~v~~~~~~~~l~~lkd~~G~~------i~~~~~~~-~~~~~l 323 (400) T protein:vir:38 271 ----------DDLKHINNVDL--------DPAY-S-RVIIASQSFYNFLDTVKDGNGRY------LLQDSILT-PSGKSV 323 (400) T ss_pred ----------HHHHHHHHhhh--------hhhh-C-cEEEEcHHHHHHHHHhhccCCCe------eeecCcCC-CCcccc Confidence 11122222111 1111 2 45788999988887542111000 00111111 112578 Q ss_pred cCcEEEEecCCCCcceEEEEEecCCCccceeEeeccc--------ccccccccCCccccceeeeeeeeccEe-cCccccc Q lcl|Aclame:pro 422 GGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYV--------ALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSR 492 (524) Q Consensus 422 ~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv--------~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~ 492 (524) .| ++|++..+.+.. -.| +.-++|+.+- ....++..|-..|+..+-...||+..+ +|-+. T Consensus 324 ~G-~pv~~~~~~~~~-----~~g----~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~-- 391 (400) T protein:vir:38 324 LG-MPIAVVSDDTLG-----AAG----EAHAFLGDIKRAILFANRADFMVRWVDDQIYGQFLQAGMRFGVSVADEKAG-- 391 (400) T ss_pred cc-ceeEEecccccC-----CCC----ceEEEEEeccccEEEEeecceEEEEecccccceeEEEEEEeccEEecccce-- Confidence 87 588877664431 111 1112222211 122233456667777888888998654 33111 Q ss_pred CCCccccccccchHH Q lcl|Aclame:pro 493 SQAPADRITSGMISK 507 (524) Q Consensus 493 ~~~~~~~i~~~~~~~ 507 (524) +.+.-.+.. T Consensus 392 ------~~l~~~~~a 400 (400) T protein:vir:38 392 ------YFLTYTPKA 400 (400) T ss_pred ------EEEEeecCC Confidence 111111111 No 45 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=89.34 E-value=0.027 Score=29.19 Aligned_cols=303 Identities=11% Similarity=0.060 Sum_probs=125.0 Q ss_pred cccCccccccccc-ccccc-ccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhh Q lcl|Aclame:pro 72 HNYDQTNIASGKS-SGAIT-NIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAF 148 (524) Q Consensus 72 ~~~~~~~~~~st~-sg~v~-~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf 148 (524) -|+++-+.....+ +.+.. ..-|.++ .+++++..+.+-.+++-+.||++++. +...-.. +.+ T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-----~ip~~~~----~~~------- 64 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGI-----VIPHWTG----DVS------- 64 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCce-----EEEEEcC----Ccc------- Confidence 4455544433311 11111 1234443 44555555666777888888876541 1111010 000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccc Q lcl|Aclame:pro 149 HPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEI 228 (524) Q Consensus 149 ~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~ 228 (524) +.| T Consensus 65 -------a~w---------------------------------------------------------------------- 67 (397) T protein:vir:23 65 -------AQW---------------------------------------------------------------------- 67 (397) T ss_pred -------eEE---------------------------------------------------------------------- Confidence 000 Q ss_pred cccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhH Q lcl|Aclame:pro 229 SVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINR 308 (524) Q Consensus 229 ~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINr 308 (524) .+| +.++++-..+++++++..|..+-.-.+|-||.+|-. .|.+++|.+.|...|...|++ T Consensus 68 -------v~E---------g~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~ 127 (397) T protein:vir:23 68 -------IGE---------GDMKPITKGNMTKRDVHPAKIATIFVASAETVRANP----ANYLGTMRTKVATAIAMAFDN 127 (397) T ss_pred -------ecC---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHH Confidence 001 012333344566777777777777889999999863 667999999999999999999 Q ss_pred HHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhh Q lcl|Aclame:pro 309 EIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVS 388 (524) Q Consensus 309 eii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~ 388 (524) .+|.=-..-. ...+..+..... .-+... ..+..+..+...+.. .+...+.+|++|+... T Consensus 128 a~l~G~gt~~------------~~~~~~~~~~~~----~~~~~~---~~~~~~~~~~~~l~~--~~~~~a~~vmn~~~~~ 186 (397) T protein:vir:23 128 AALHGTNAPS------------AFQGYLDQSNKT----QSISPN---AYQGLGVSGLTKLVT--DGKKWTHTLLDDTVEP 186 (397) T ss_pred HHhhcccCCc------------ccccccccccce----eeeccc---chhHHHHHHHHhhhh--cccCCCEEEEcHHHHH Confidence 9984111000 001111110000 000000 011112222222322 2346678999999999 Q ss_pred hhhhhccccccc--chhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccc- Q lcl|Aclame:pro 389 ALARIDSGITPA--SQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRG- 465 (524) Q Consensus 389 ~L~~~~~g~~~~--s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~- 465 (524) .|..+...-..+ .+..... .......|+|.| ++|+++++.+.+-+ +++.|+-. .+||.-. ....++. T Consensus 187 ~L~~lkd~~G~~i~~~~~~~~----~~~~~~~~tl~G-~Pv~~s~~~~~g~~-~~~~gDfs---~~~i~~~-~~i~i~~~ 256 (397) T protein:vir:23 187 VLNGSVDANGRPLFVESTYES----LTTPFREGRILG-RPTILSDHVAEGDV-VGYAGDFS---QIIWGQV-GGLSFDVT 256 (397) T ss_pred HHHHhhccCCceeeccccccc----ccccccCceeee-eeEEEeCCCCCCce-EEEEeecc---eEEEEEE-eceEEEEe Confidence 988643221100 0100100 001112367876 79999998765321 11222210 1111100 0001111 Q ss_pred --------cCCcc-----c---cceeeeeeeeccE-ecC--cccccC-CCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 466 --------SDPKN-----F---QPVMGFKTRYGIG-INP--FANSRS-QAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 466 --------~dp~s-----~---qP~~~~~tRY~l~-~nP--~~~~~~-~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) .|+.. | |=.+=+..|++.. .+| |..-.. ..+...+. ...+......+|-+++= T Consensus 257 ~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~ 329 (397) T protein:vir:23 257 DQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYAL------DLDGASAGNFTLSLDGK 329 (397) T ss_pred eeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceeee------cccccCcceEEEEecCc Confidence 01100 0 1122223344431 122 110000 00000000 00011122222222222 No 46 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=89.08 E-value=0.028 Score=29.06 Aligned_cols=378 Identities=15% Similarity=0.144 Sum_probs=131.6 Q ss_pred CCchHHHHHHhhHhhcccccchhhcch----------hHH---HHHH--HHHHHHHHHHhccccccchhhhhhhcccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATK----------SKK---QLVA--AILEAQEKDAETDPVYRDEKIVESFGGFLAE 65 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~----------~~~---~~~~--~l~enq~~~~~~~~~~~~~~~~~~~~~~l~e 65 (524) |.- ++|++|++.+++. +-+|.+. .++ .+.+ +=|+.|++.+++... ..........+ T Consensus 1 M~l-~eL~~~r~~~~~~---~~~l~~~~~e~~~l~~ee~~~~~~l~~ei~~l~~~i~~~e~~e~-----~~~~~~~~~~~ 71 (435) T protein:vir:80 1 MNV-NELRRERAAVNQR---VQALAQIEVGGTALSVEQQAEFDQLSSKFNELTAQIERAEAAER-----MAAAAAVPVDP 71 (435) T ss_pred CCH-HHHHHHHHHHHHH---HHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhhcccccc Confidence 765 4699998888763 2223211 011 1111 112334443332110 00000000000 Q ss_pred cc-----cccccccCcccccccccc--c----cc--ccc--CchhhhHHHHHHhhhhhhheee--------eecCCchhh Q lcl|Aclame:pro 66 AE-----IAGDHNYDQTNIASGKSS--G----AI--TNI--GPAVIGMVRRAIPNLIAFDICG--------VQPMTGPTG 122 (524) Q Consensus 66 a~-----~~g~~~~~~~~~~~st~s--g----~v--~~~--~P~li~l~Rra~~nLIa~DI~G--------VQPmTgPTG 122 (524) .. -.+.......+..+.... + ++ ... +....-..||....-+...+-. +-|.+-.+. T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ 151 (435) T protein:vir:80 72 NPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSE 151 (435) T ss_pred hhhhhccccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHH Confidence 00 000000000000000000 0 00 000 0000000111111111111100 001110111 Q ss_pred hheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCC Q lcl|Aclame:pro 123 QVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNV 202 (524) Q Consensus 123 LIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~ 202 (524) +|=.+|... ++. .+. .. ..+..+ +...... T Consensus 152 ii~~l~~~~------------------~i~----~~~---~~----~v~~~~----~~~~~p~----------------- 181 (435) T protein:vir:80 152 VIELLRPKS------------------VVR----KLG---AR----TLPLSN----GNITIPR----------------- 181 (435) T ss_pred HHHHHhhhc------------------hhh----hcc---ce----eeecCC----CceEEEE----------------- Confidence 111111100 000 000 00 000000 0000000 Q ss_pred cccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHH Q lcl|Aclame:pro 203 TVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDL 282 (524) Q Consensus 203 ~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDL 282 (524) .++ .+ . ..-.+| +..+++...++++++...+.-+-....|.||.+|- T Consensus 182 -~~~-~~--------------~--------a~~v~E---------~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds 228 (435) T protein:vir:80 182 -LKG-GA--------------I--------VGYIGA---------DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYA 228 (435) T ss_pred -EeC-Cc--------------c--------eeeecc---------CccccccccceeeEEEeeEEEEEeehhhHHHHHhh Confidence 000 00 0 000112 12345556677777777777777888999999994 Q ss_pred HhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHH Q lcl|Aclame:pro 283 RAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQID 362 (524) Q Consensus 283 kAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~ 362 (524) .- +.|.|+.|.+-|+..|...+++-|+. | .|. ...+.|++.......+.... .+.....+...+. T Consensus 229 ~~--~~~l~~~i~~~l~~a~~~~~d~a~l~--------G-~G~---~~~p~Gi~~~~~~~~~~~~~-~~~~~~~~~~d~~ 293 (435) T protein:vir:80 229 GV--NPNVDQIVVGDLTAAIGAREDKAFIR--------D-DGT---ANTPKGLRFWALPGNVITAS-DGSTLQKIETDLG 293 (435) T ss_pred cc--cHHHHHHHHHHHHHHHHHHHHHHhhc--------c-CCC---CCcccceeecccccceeecc-cccchhhHHHHHH Confidence 32 45678899999999999999988875 1 010 01233544332221111110 1111122222233 Q ss_pred HHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcc------ Q lcl|Aclame:pro 363 KEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQD------ 436 (524) Q Consensus 363 ~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------ 436 (524) +.-..+.....+-.....|++|.....|.....+- +. ....+.+ -|+|.| ++||++.+.|.+ T Consensus 294 ~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~----G~---~l~~~~~----~~~l~G-~pv~~~~~~p~~~~~~~~ 361 (435) T protein:vir:80 294 KAILALENADANLTQPGWIMAPRTFRFLEGLRDGN----GN---KVYPELA----NGMLKG-YPVGKTTQVPINLGEAGK 361 (435) T ss_pred HHHHHhhccccccccCEEEEcHHHHHHHHhhhccC----Cc---eeccCCC----CCeEee-eeeEEeccccccccCCCC Confidence 32222222221224466799999999987643221 10 1111222 256776 699998886532 Q ss_pred --eEE--------EEEecCCCccceeEeecccccccccccCCccc---cceeeeeeeeccEecCcccccCCCcccccccc Q lcl|Aclame:pro 437 --YFT--------VGFKGDNEMDAGIYYAPYVALTPLRGSDPKNF---QPVMGFKTRYGIGINPFANSRSQAPADRITSG 503 (524) Q Consensus 437 --y~~--------vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~---qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~~~ 503 (524) -++ ||-.+.-..+ ..+|.-+......--..| +=.+=+.-|+++.+. +.++-.+++| T Consensus 362 ~~~i~~gd~s~~~i~~~~~~~i~----~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~-------~~~a~~~l~~ 430 (435) T protein:vir:80 362 ESEIYFTDFGDVFIGEEETLEID----YSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPR-------HVESIAVLSG 430 (435) T ss_pred cceEEEEEcccEEEEeecceEEE----EeccccccccccchhhhhhcCcceeeeeeeeCcEee-------cccceEEEec Confidence 122 2322222111 111111000000000001 122234456664441 1122234455 Q ss_pred chHHh Q lcl|Aclame:pro 504 MISKE 508 (524) Q Consensus 504 ~~~~~ 508 (524) -.|.. T Consensus 431 ~~~~~ 435 (435) T protein:vir:80 431 VAWGA 435 (435) T ss_pred cCCCC Confidence 55554 No 47 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=88.10 E-value=0.034 Score=28.61 Aligned_cols=359 Identities=15% Similarity=0.112 Sum_probs=142.1 Q ss_pred CCchHHHHHHhhHhhccccc-chhhcch------h-HHHHHHHH--HHHHHHHHhccc---------------------- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEG-LPDIATK------S-KKQLVAAI--LEAQEKDAETDP---------------------- 48 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~-~~~i~~~------~-~~~~~~~l--~enq~~~~~~~~---------------------- 48 (524) |-..++|.++=..+.+..+. +-+++.. . .+.+...+ |+.|.+.+.... T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 65555444443333221000 0000000 0 00111000 112211111100 Q ss_pred -cccchhhhhhhccccccccccc----------ccccCccccccccccccccccCchhh--hHHHHHHhhhhhhheeeee Q lcl|Aclame:pro 49 -VYRDEKIVESFGGFLAEAEIAG----------DHNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQ 115 (524) Q Consensus 49 -~~~~~~~~~~~~~~l~ea~~~g----------~~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQ 115 (524) ..++.......+..+.+..... ..+... .+.++++.+-..-=|..+ .+++.+.+...-.++|.+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:47 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI--QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhh--hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhccee Confidence 0000000000011110000000 000000 001111111111124332 4666677788889999999 Q ss_pred cCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 116 PMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQ 195 (524) Q Consensus 116 PmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~ 195 (524) ||+++++-+.-.+ .. .+. + ..|- T Consensus 159 ~~~~~~~~~~~~~-----~~--~~~-----~---------~~~v------------------------------------ 181 (415) T protein:vir:47 159 RVTNGSGKYPVVR-----QS--EVA-----A---------LEKV------------------------------------ 181 (415) T ss_pred eccCCceeEEEEE-----ec--CCc-----c---------eeec------------------------------------ Confidence 9999876432221 10 000 0 0000 Q ss_pred cccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccce-eEEEEEEEEeecccccccc Q lcl|Aclame:pro 196 NVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMA-FRIDKQVIEARSRQLKAQY 274 (524) Q Consensus 196 ~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMs-FsIEK~tVtAKSRALKAEY 274 (524) +| +..+++.+ -++++++..++..+-...+ T Consensus 182 -----------------------------------------~E---------g~~~~~~~~~~~~~v~~~~~k~~~~~~i 211 (415) T protein:vir:47 182 -----------------------------------------EE---------LEENPELAVKPFFQLAYDINTHRGYFRI 211 (415) T ss_pred -----------------------------------------cc---------ccccccccccceeeEEeeeeeeEeeehh Confidence 00 01122222 2455566666666666789 Q ss_pred cHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHH Q lcl|Aclame:pro 275 SVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESY 354 (524) Q Consensus 275 T~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~ 354 (524) |-||.+|-. .|.+++|.+-|+..|..-+|+.|+.-.-.....+... .... ....+.-. +.. ..+-. T Consensus 212 S~ell~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~--~~~~-~~~~~~~~------~~~-~~~~i 277 (415) T protein:vir:47 212 SREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS--GFEK-EGKKLEVK------KAK-SLDDI 277 (415) T ss_pred hHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccc--cccc-ccceeccc------ccc-chHHH Confidence 999999843 5679999999999999999999996432211111100 0000 00111000 001 11223 Q ss_pred HHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCC Q lcl|Aclame:pro 355 KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYAR 434 (524) Q Consensus 355 r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~ 434 (524) ..|+..+.. .+.+.+.+|++|.....|..+...- +. -....+.+. ...++|.| ++|++.++.+ T Consensus 278 ~~~~~~~~~---------~~~~~~~~v~n~~~~~~L~~lkd~~----G~--~i~~~~~~~-~~~~~l~G-~pV~~~~~~~ 340 (415) T protein:vir:47 278 KDAINLNVK---------PNYEHNVAIVSQTMFAKLDKMKDKL----GN--YLIQPDVKE-KTQQRLLG-AKIEILPDEV 340 (415) T ss_pred HHHHHhhhh---------hccCCCEEEEcHHHHHHHHHhhccC----CC--eeeccCcCC-CCCccccc-eeeEEecccc Confidence 334333332 2235678999999988887532211 11 000111111 11357777 5888766554 Q ss_pred cceEEEEEecCCCccceeEeeccc--------ccccccccCCccccceeeeeeeeccEe-cCccc-ccCCCccccccccc Q lcl|Aclame:pro 435 QDYFTVGFKGDNEMDAGIYYAPYV--------ALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFAN-SRSQAPADRITSGM 504 (524) Q Consensus 435 ~dy~~vG~KG~~~~~~~~fyaPYv--------~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~-~~~~~~~~~i~~~~ 504 (524) . |-.| +..++|+.|- ....+...|-.+++-.+-...|++..+ +|=+. ..+-... .--.+ T Consensus 341 ~-----~~~~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~--~~~~~ 409 (415) T protein:vir:47 341 L-----GQKG----NNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS--ERGEG 409 (415) T ss_pred c-----cCCC----ccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccccEEEEEeecc--CCCCC Confidence 2 1001 1112222211 111122234566677777888988643 44111 0000000 00112 Q ss_pred hHHhhc Q lcl|Aclame:pro 505 ISKEMC 510 (524) Q Consensus 505 ~~~~~a 510 (524) +...-+ T Consensus 410 ~~~~~~ 415 (415) T protein:vir:47 410 DLGLEA 415 (415) T ss_pred CccCCC Confidence 222222 No 48 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=88.10 E-value=0.034 Score=28.61 Aligned_cols=359 Identities=15% Similarity=0.112 Sum_probs=142.1 Q ss_pred CCchHHHHHHhhHhhccccc-chhhcch------h-HHHHHHHH--HHHHHHHHhccc---------------------- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEG-LPDIATK------S-KKQLVAAI--LEAQEKDAETDP---------------------- 48 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~-~~~i~~~------~-~~~~~~~l--~enq~~~~~~~~---------------------- 48 (524) |-..++|.++=..+.+..+. +-+++.. . .+.+...+ |+.|.+.+.... T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 65555444443333221000 0000000 0 00111000 112211111100 Q ss_pred -cccchhhhhhhccccccccccc----------ccccCccccccccccccccccCchhh--hHHHHHHhhhhhhheeeee Q lcl|Aclame:pro 49 -VYRDEKIVESFGGFLAEAEIAG----------DHNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQ 115 (524) Q Consensus 49 -~~~~~~~~~~~~~~l~ea~~~g----------~~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQ 115 (524) ..++.......+..+.+..... ..+... .+.++++.+-..-=|..+ .+++.+.+...-.++|.+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:46 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI--QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhh--hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhccee Confidence 0000000000011110000000 000000 001111111111124332 4666677788889999999 Q ss_pred cCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 116 PMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQ 195 (524) Q Consensus 116 PmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~ 195 (524) ||+++++-+.-.+ .. .+. + ..|- T Consensus 159 ~~~~~~~~~~~~~-----~~--~~~-----~---------~~~v------------------------------------ 181 (415) T protein:vir:46 159 RVTNGSGKYPVVR-----QS--EVA-----A---------LEKV------------------------------------ 181 (415) T ss_pred eccCCceeEEEEE-----ec--CCc-----c---------eeec------------------------------------ Confidence 9999876432221 10 000 0 0000 Q ss_pred cccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccce-eEEEEEEEEeecccccccc Q lcl|Aclame:pro 196 NVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMA-FRIDKQVIEARSRQLKAQY 274 (524) Q Consensus 196 ~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMs-FsIEK~tVtAKSRALKAEY 274 (524) +| +..+++.+ -++++++..++..+-...+ T Consensus 182 -----------------------------------------~E---------g~~~~~~~~~~~~~v~~~~~k~~~~~~i 211 (415) T protein:vir:46 182 -----------------------------------------EE---------LEENPELAVKPFFQLAYDINTHRGYFRI 211 (415) T ss_pred -----------------------------------------cc---------ccccccccccceeeEEeeeeeeEeeehh Confidence 00 01122222 2455566666666666789 Q ss_pred cHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHH Q lcl|Aclame:pro 275 SVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESY 354 (524) Q Consensus 275 T~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~ 354 (524) |-||.+|-. .|.+++|.+-|+..|..-+|+.|+.-.-.....+... .... ....+.-. +.. ..+-. T Consensus 212 S~ell~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~--~~~~-~~~~~~~~------~~~-~~~~i 277 (415) T protein:vir:46 212 SREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS--GFEK-EGKKLEVK------KAK-SLDDI 277 (415) T ss_pred hHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccc--cccc-ccceeccc------ccc-chHHH Confidence 999999843 5679999999999999999999996432211111100 0000 00111000 001 11223 Q ss_pred HHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCC Q lcl|Aclame:pro 355 KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYAR 434 (524) Q Consensus 355 r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~ 434 (524) ..|+..+.. .+.+.+.+|++|.....|..+...- +. -....+.+. ...++|.| ++|++.++.+ T Consensus 278 ~~~~~~~~~---------~~~~~~~~v~n~~~~~~L~~lkd~~----G~--~i~~~~~~~-~~~~~l~G-~pV~~~~~~~ 340 (415) T protein:vir:46 278 KDAINLNVK---------PNYEHNVAIVSQTMFAKLDKMKDKL----GN--YLIQPDVKE-KTQQRLLG-AKIEILPDEV 340 (415) T ss_pred HHHHHhhhh---------hccCCCEEEEcHHHHHHHHHhhccC----CC--eeeccCcCC-CCCccccc-eeeEEecccc Confidence 334333332 2235678999999988887532211 11 000111111 11357777 5888766554 Q ss_pred cceEEEEEecCCCccceeEeeccc--------ccccccccCCccccceeeeeeeeccEe-cCccc-ccCCCccccccccc Q lcl|Aclame:pro 435 QDYFTVGFKGDNEMDAGIYYAPYV--------ALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFAN-SRSQAPADRITSGM 504 (524) Q Consensus 435 ~dy~~vG~KG~~~~~~~~fyaPYv--------~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~-~~~~~~~~~i~~~~ 504 (524) . |-.| +..++|+.|- ....+...|-.+++-.+-...|++..+ +|=+. ..+-... .--.+ T Consensus 341 ~-----~~~~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~--~~~~~ 409 (415) T protein:vir:46 341 L-----GQKG----NNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS--ERGEG 409 (415) T ss_pred c-----cCCC----ccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccccEEEEEeecc--CCCCC Confidence 2 1001 1112222211 111122234566677777888988643 44111 0000000 00112 Q ss_pred hHHhhc Q lcl|Aclame:pro 505 ISKEMC 510 (524) Q Consensus 505 ~~~~~a 510 (524) +...-+ T Consensus 410 ~~~~~~ 415 (415) T protein:vir:46 410 DLGLEA 415 (415) T ss_pred CccCCC Confidence 222222 No 49 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=87.96 E-value=0.035 Score=28.54 Aligned_cols=307 Identities=14% Similarity=0.069 Sum_probs=118.1 Q ss_pred cccccccccccccccCccccccccccccccccCchhh--hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCC Q lcl|Aclame:pro 60 GGFLAEAEIAGDHNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLA 137 (524) Q Consensus 60 ~~~l~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~ 137 (524) =.-|+|-... ..|.+..+-..+..++ .. |--+ .+++.+.++.+..+++-+.||++..- ++..... T Consensus 1 ~a~l~el~~~-~~~~~~~g~~~~~~~~---li-P~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~-------~~p~~~~- 67 (333) T protein:vir:78 1 MATLNELLPN-SAGSNHQGRLAHVPSD---LL-PKEIVGPIFDKAQESSLVLRMGEQIPISYGET-------IIPTTVK- 67 (333) T ss_pred CchhHHhhhh-cccccccCceecCCcc---cc-chhHHHHHHHHHHhhchhhhhcceeeccCCce-------EEEEEeC- Confidence 0122222100 0011111111111111 11 3322 45666667778888899999875221 1111100 Q ss_pred cccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCccccccccc Q lcl|Aclame:pro 138 GGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVI 217 (524) Q Consensus 138 ~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~ 217 (524) . +.+.|-+.+ T Consensus 68 -~--------------~~a~~v~eg------------------------------------------------------- 77 (333) T protein:vir:78 68 -R--------------PEVGQVGVG------------------------------------------------------- 77 (333) T ss_pred -C--------------ceeEeecCc------------------------------------------------------- Confidence 0 011111100 Q ss_pred ccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHH Q lcl|Aclame:pro 218 AENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAI 297 (524) Q Consensus 218 ~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnI 297 (524) -....+|.-. -..+...|.+..++..|..+ -...|-||.+|-. .|.+++|.+. T Consensus 78 --------------~~~~~~e~~~--~~~~~~~f~~i~l~~~kl~~-------~~~is~ell~~s~----~~~~~~i~~~ 130 (333) T protein:vir:78 78 --------------TSNEQREGGL--KPLSGTAWDTRSVSPIKLAT-------IVTVSEEFARMNP----SGLYTKLQGD 130 (333) T ss_pred --------------cccccccccc--ccccccceeEEEEeeEEEEE-------eehhhHHHHhcCH----HHHHHHHHHH Confidence 0000011000 00112235555555555544 4557888888754 4679999999 Q ss_pred HHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCC Q lcl|Aclame:pro 298 LATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAG 377 (524) Q Consensus 298 LStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~g 377 (524) |...|...|+..||.=-......+..|+.+. .++... .............+..|.++-..+...-.+ .. T Consensus 131 la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~----~~~~~~------~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~~ 199 (333) T protein:vir:78 131 LAYAIGRGIDLAVFHGKSPLTGSALQGIDTD----NVIANT------TNVDYLQETGDPLLDRLLDGYDLVSANTDV-EF 199 (333) T ss_pred HHHHHHHHHHHHHhcccCCCCCccccccccc----cccccc------ccccccccccchhHHHHHHHHHhhcccccc-Cc Confidence 9999999999999852111111111111111 011000 000000011111222233333333333333 56 Q ss_pred CEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcc---------eEEEE------- Q lcl|Aclame:pro 378 NFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQD---------YFTVG------- 441 (524) Q Consensus 378 n~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y~~vG------- 441 (524) +.+|+.|.....|..+.. .-+..+. .....+....-.|+|.| ++|+++.+.+.+ .+++| T Consensus 200 ~~~vmn~~~~~~L~~~~~-~~d~~G~---~i~~~~~~~~~~~~l~G-~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~ 274 (333) T protein:vir:78 200 NGWAVDPRFRAHLLRAQA-YRDANGN---VDPSRINLAAQTGDVLG-LPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKF 274 (333) T ss_pred eEEEEcchHHHHHHHHhh-hcCCCCc---eeecCccccCCCceeec-eeeEEccccCCCccccCCCccEEEEEecccEEE Confidence 788899988777653210 0000000 00000000111267887 699998876544 23333 Q ss_pred -EecCCCccceeEeecccccccccccCCcccc-cee--eeeeeeccE-ecC--cccc-cCCCc Q lcl|Aclame:pro 442 -FKGDNEMDAGIYYAPYVALTPLRGSDPKNFQ-PVM--GFKTRYGIG-INP--FANS-RSQAP 496 (524) Q Consensus 442 -~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~q-P~~--~~~tRY~l~-~nP--~~~~-~~~~~ 496 (524) ..+..+. -..+|.-.......--.-|| -.+ =...|++.. .+| |..- ...+| T Consensus 275 g~~~~~~i----~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 275 GFADEIRI----KMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEeeccEE----EEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCCC Confidence 2222111 11222110000000000111 112 234577744 555 3221 11122 No 50 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=87.87 E-value=0.036 Score=28.50 Aligned_cols=270 Identities=13% Similarity=0.030 Sum_probs=117.1 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccc Q lcl|Aclame:pro 151 MFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISV 230 (524) Q Consensus 151 ~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~ 230 (524) |- ++. +..+.+-...-.+. .+. ..... ..+..+......+-.. . .|....+.. T Consensus 1 MA--~~~-------T~~~~~~iPev~s~-~v~-~~~~~------~~~~~~~~~~~~~~~g---------~-~G~tv~iP~ 53 (272) T protein:vir:98 1 MA--VGT-------TKMAQMLDPEVLAD-MID-AEVGK------AIRFAPLAEVDTTLEG---------Q-PGTTLTVPK 53 (272) T ss_pred CC--Ccc-------ccchheechHHHHH-HHH-HHHHH------HhhhhccccccccccC---------C-CCCEEEEEE Confidence 10 000 00000000000000 000 00000 0000000100000000 0 000000000 Q ss_pred cccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHH Q lcl|Aclame:pro 231 GMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREI 310 (524) Q Consensus 231 GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINrei 310 (524) --....++-.. . +..++.=..+.+.++++.|.++-.-++|=|++.+ -+-|.++++.+-|+..|..+|+++| T Consensus 54 ~~~~~~a~~v~---e--g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i 124 (272) T protein:vir:98 54 WDYIGDAEDVA---E--GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADV 124 (272) T ss_pred ecCCCCccccc---C--CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHH Confidence 00001111110 0 1223333455777788888887666777666533 2478999999999999999999999 Q ss_pred HhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhh Q lcl|Aclame:pro 311 VDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSAL 390 (524) Q Consensus 311 i~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L 390 (524) +..+...... .. +... .+-+-.+..++.++ ....+++||+|.++..| T Consensus 125 ~~~~~~a~~~-~~----------~~~t-------------~d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L 171 (272) T protein:vir:98 125 LDALSKSTQT-VE----------ATAT-------------VDGVSKALDIFNDE---------DDAETVIVMNPADASTL 171 (272) T ss_pred HHHhcccccc-cc----------cccC-------------HHHHHHHHHHHhcc---------CCCccEEEEcHHHHHHH Confidence 9765332110 00 1000 12223333333322 23568999999999988 Q ss_pred hhhc-ccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCc Q lcl|Aclame:pro 391 ARID-SGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPK 469 (524) Q Consensus 391 ~~~~-~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~ 469 (524) .... .-+..++.... +.......|.+.| ++|+++++.+.+=+++.-+|.- +++-.. +.......|+. T Consensus 172 ~k~~~~~~~~~~~~~~-----~~~~~g~ig~i~G-~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~--~~~ve~~r~~~ 239 (272) T protein:vir:98 172 RLDAAKEWLGATEVGA-----NRVVSGVYGEVLG-VQIVRSRKCPKGTAYMVRKGAL----RIMLKR--NTMVETDRDIT 239 (272) T ss_pred HHhccccccccccccc-----cccccccchhhcC-eeEEEcCCCCcceEEEEcCCeE----EEEecC--Cceeeeccccc Confidence 6421 11111211111 1111223577877 7999999998655444333311 111111 11222235888 Q ss_pred cccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccch Q lcl|Aclame:pro 470 NFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKN 513 (524) Q Consensus 470 s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~ 513 (524) +++=.+-..-|||+.+ ||- ...+++- +.|+|- T Consensus 240 ~~~~~i~~~~~~~~~v~~~~-------~vv~~t~-----~~a~~~ 272 (272) T protein:vir:98 240 KAINQIVANKHYGVYLYKAE-------KAVKITL-----KDAAKK 272 (272) T ss_pred cceeEEEEEEEEEEEEEcCC-------ceEEEEe-----cccccC Confidence 8988888888999753 331 0111111 123333 No 51 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=87.87 E-value=0.036 Score=28.50 Aligned_cols=270 Identities=13% Similarity=0.030 Sum_probs=117.1 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccc Q lcl|Aclame:pro 151 MFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISV 230 (524) Q Consensus 151 ~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~ 230 (524) |- ++. +..+.+-...-.+. .+. ..... ..+..+......+-.. . .|....+.. T Consensus 1 MA--~~~-------T~~~~~~iPev~s~-~v~-~~~~~------~~~~~~~~~~~~~~~g---------~-~G~tv~iP~ 53 (272) T protein:vir:30 1 MA--VGT-------TKMAQMLDPEVLAD-MID-AEVGK------AIRFAPLAEVDTTLEG---------Q-PGTTLTVPK 53 (272) T ss_pred CC--Ccc-------ccchheechHHHHH-HHH-HHHHH------HhhhhccccccccccC---------C-CCCEEEEEE Confidence 10 000 00000000000000 000 00000 0000000100000000 0 000000000 Q ss_pred cccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHH Q lcl|Aclame:pro 231 GMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREI 310 (524) Q Consensus 231 GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINrei 310 (524) --....++-.. . +..++.=..+.+.++++.|.++-.-++|=|++.+ -+-|.++++.+-|+..|..+|+++| T Consensus 54 ~~~~~~a~~v~---e--g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i 124 (272) T protein:vir:30 54 WDYIGDAEDVA---E--GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADV 124 (272) T ss_pred ecCCCCccccc---C--CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHH Confidence 00001111110 0 1223333455777788888887666777666533 2478999999999999999999999 Q ss_pred HhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhh Q lcl|Aclame:pro 311 VDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSAL 390 (524) Q Consensus 311 i~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L 390 (524) +..+...... .. +... .+-+-.+..++.++ ....+++||+|.++..| T Consensus 125 ~~~~~~a~~~-~~----------~~~t-------------~d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L 171 (272) T protein:vir:30 125 LDALSKSTQT-VE----------ATAT-------------VDGVSKALDIFNDE---------DDAETVIVMNPADASTL 171 (272) T ss_pred HHHhcccccc-cc----------cccC-------------HHHHHHHHHHHhcc---------CCCccEEEEcHHHHHHH Confidence 9765332110 00 1000 12223333333322 23568999999999988 Q ss_pred hhhc-ccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCc Q lcl|Aclame:pro 391 ARID-SGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPK 469 (524) Q Consensus 391 ~~~~-~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~ 469 (524) .... .-+..++.... +.......|.+.| ++|+++++.+.+=+++.-+|.- +++-.. +.......|+. T Consensus 172 ~k~~~~~~~~~~~~~~-----~~~~~g~ig~i~G-~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~--~~~ve~~r~~~ 239 (272) T protein:vir:30 172 RLDAAKEWLGATEVGA-----NRVVSGVYGEVLG-VQIVRSRKCPKGTAYMVRKGAL----RIMLKR--NTMVETDRDIT 239 (272) T ss_pred HHhccccccccccccc-----cccccccchhhcC-eeEEEcCCCCcceEEEEcCCeE----EEEecC--Cceeeeccccc Confidence 6421 11111211111 1111223577877 7999999998655444333311 111111 11222235888 Q ss_pred cccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccch Q lcl|Aclame:pro 470 NFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKN 513 (524) Q Consensus 470 s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~ 513 (524) +++=.+-..-|||+.+ ||- ...+++- +.|+|- T Consensus 240 ~~~~~i~~~~~~~~~v~~~~-------~vv~~t~-----~~a~~~ 272 (272) T protein:vir:30 240 KAINQIVANKHYGVYLYKAE-------KAVKITL-----KDAAKK 272 (272) T ss_pred cceeEEEEEEEEEEEEEcCC-------ceEEEEe-----cccccC Confidence 8988888888999753 331 0111111 123333 No 52 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=86.96 E-value=0.042 Score=28.14 Aligned_cols=296 Identities=13% Similarity=0.044 Sum_probs=106.5 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccc---cccc-ccCCcccccCcccccccccccccccc-c Q lcl|Aclame:pro 151 MFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYF---QNVT-SGNVTVTGADPAALDAAVIAENEKGT-L 225 (524) Q Consensus 151 ~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~---~~~~-~g~~~~tgt~p~~~~~~~~~~~~~g~-~ 225 (524) |.+ .+.|.-. ...... .++..+.+-+...... .-...+ .... .......... ... . T Consensus 1 ~~~-~~~~~~~--~~~~~~--t~~~~~~~~ip~~~~~-~ii~~~~~~s~l~~~~~~~~~~~~-------------~~~~p 61 (320) T protein:vir:10 1 MAA-GTAFQVD--HAQIAQ--TGDTMFKGYLEPEQAK-DYFAEAEKTSIVQQFAQKVPMGTT-------------GQKIP 61 (320) T ss_pred CCC-CccCCHH--HHHhhc--cccccccccccHHHHH-HHHHHHHhccchhhhcceeeccCC-------------ceEEE Confidence 211 1111100 000000 0000000000000000 000000 0000 0000000000 000 0 Q ss_pred ccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHH Q lcl|Aclame:pro 226 AEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLE 305 (524) Q Consensus 226 ~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~E 305 (524) ...+.+-....+| +..+++-..+++++++..|..+-...+|.||.+|-. .|.++.|.+.|...|... T Consensus 62 ~~~~~~~a~~v~E---------~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~a~a~~ 128 (320) T protein:vir:10 62 HWIGDVSAQWIGE---------GDMKPITKGNMTSQNIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMA 128 (320) T ss_pred EEeCCcceEEecC---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCh----HHHHHHHHHHHHHHHHHH Confidence 0011111122223 234667677778888999999999999999999865 567999999999999999 Q ss_pred hhHHHHh-hhhhheeeeeeccccc-cCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEc Q lcl|Aclame:pro 306 INREIVD-LINYTAQVGKSGFTQT-VGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIAS 383 (524) Q Consensus 306 INreii~-~i~~~a~~~~~g~~~~-~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S 383 (524) +|+.|+. .=..... +-.+..+. +....+.... ..-+..+ .+ +..+...+. ..+.....+||+ T Consensus 129 ~d~a~l~G~g~~~~~-~~~~~~~~~~~~~~~~~~~-------~~~~~~~---~~---~~~~~~~~~--~~~~~~~~~v~n 192 (320) T protein:vir:10 129 FDSAALNGTDSPFPT-YLAQTTKSVSLADPGGATA-------SDLTAYD---AV---AVNGLSLLV--NAKKKWTHTLLD 192 (320) T ss_pred HHHHhhcccCCCCCc-ccccccccccceecccccc-------cccccHH---HH---HHHHHhhhh--cccCCCcEEEEc Confidence 9999874 1000000 00000000 0000111000 0011111 11 112222222 233355789999 Q ss_pred hhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccc Q lcl|Aclame:pro 384 RNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPL 463 (524) Q Consensus 384 ~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~ 463 (524) |.....|......-.. +-..............-++|.| ++|++++..+.+=.. ++-|+-. .+++.-+-..... T Consensus 193 ~~~~~~L~~lkd~~G~--~l~~~~~~~~~~~~~~~~~i~g-~pv~~~~~~~~~~~~-~~~gd~~---~~~~~~~~~~~i~ 265 (320) T protein:vir:10 193 DIVEPILNGAKDKNGR--PLFIESTYTDENSPFRAGRIVS-RPTILSDHVADGTTV-GYMGDFR---NVIWGQVGGLSFD 265 (320) T ss_pred HHHHHHHHHhhccCCc--eeeccccccCccccccCceeee-eeeEecCCCCCCceE-EEEeecc---eEEEEEecCeEEE Confidence 9999998753222100 0000000011111222355655 799999887654211 1112111 1122211111110 Q ss_pred c--------ccCCcc-----cc---ceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccch Q lcl|Aclame:pro 464 R--------GSDPKN-----FQ---PVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKN 513 (524) Q Consensus 464 ~--------~~dp~s-----~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~ 513 (524) . ..|+.. || =.+=...|+++.+ +|=+ ..++..-. |-+. T Consensus 266 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a-------~~~l~~~~-----ap~~ 320 (320) T protein:vir:10 266 VTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDA-------FVKLTNVV-----TPDA 320 (320) T ss_pred EeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccc-------eEEEEecc-----CCCC Confidence 0 011111 11 1122335666433 2311 11111100 0001 No 53 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=86.16 E-value=0.048 Score=27.84 Aligned_cols=270 Identities=10% Similarity=0.016 Sum_probs=114.8 Q ss_pred ecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcc Q lcl|Aclame:pro 131 YGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPA 210 (524) Q Consensus 131 Y~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~ 210 (524) -++.. +.-..-.-.|-+.++..... - ......+.......-+. T Consensus 1 ma~~~-T~~~d~i~Pev~s~~v~~~~--~----------------------------------~~~~~~~~~~~~~~l~g 43 (274) T protein:vir:96 1 MAQGT-TKVSNLIVPEVLAPMMQAEL--D----------------------------------KKLRFAQFADIDSTLVG 43 (274) T ss_pred CCccc-cchhhhhhhHHHHHHHHHHH--H----------------------------------hhhhhcccccccccccC Confidence 00000 00000011121111100000 0 00000000000000000 Q ss_pred cccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCh Q lcl|Aclame:pro 211 ALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDA 290 (524) Q Consensus 211 ~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDA 290 (524) ..|....+..-=.+..+|.. .....-++.++.+. ..+++.+-|+-.-+++=|. ++..+-|. T Consensus 44 ----------~~G~tv~ip~~~~~g~~~~~---~~g~~i~~~~it~~--~~~~~i~~~~~~~~i~D~~----~~~~~~d~ 104 (274) T protein:vir:96 44 ----------QPGDTLTFPAFTYSGDAQVI---AEGEKIPVDQIGTS--KREAKVRKIGKGTELTDEA----VLSGFGDP 104 (274) T ss_pred ----------CCCCEEEEEeeccCCCcccc---CCCCcCchhhcccc--eeEEEEEeeeceeeecHHH----HHhhcchH Confidence 00111111100001111111 11112234444433 3344445454222333222 12236788 Q ss_pred HHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 291 DAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIAR 370 (524) Q Consensus 291 EaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~ 370 (524) -.+..+-++..++.+++++++..+....... .+.. ...+.+-.+..++.++. T Consensus 105 ~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~----------~~~~-------------~~~d~i~dA~~~l~d~~----- 156 (274) T protein:vir:96 105 QGEAVRQHGLAIANKVDNDVLEALKGATLTV----------EADI-------------TKLDGLQTAIDKFNDED----- 156 (274) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCc----------Cccc-------------ccHHHHHHHHHHhcccC----- Confidence 8999999999999999999998764422110 0011 11333444444444322 Q ss_pred hccccCCCEEEEchhhhhhhhhhcc-cccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceE-EEEEecCCCc Q lcl|Aclame:pro 371 QTGRGAGNFIIASRNVVSALARIDS-GITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYF-TVGFKGDNEM 448 (524) Q Consensus 371 ~T~~g~gn~~v~S~~va~~L~~~~~-g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~-~vG~KG~~~~ 448 (524) ..+++++|+|.+++.|..... -+.+++...+ ........|.+.| ++||+|...|..=. ++| +|.-. T Consensus 157 ----~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~-----~~~~~g~ig~~~G-~~Vi~s~~~p~~t~~l~~-~gA~~- 224 (274) T protein:vir:96 157 ----LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGD-----NIIVKGAFGEALG-AVIVRSNKLNKGEALLAK-KGAVK- 224 (274) T ss_pred ----CCceEEEeCHHHHHHHHhcccccccccccccc-----cceeecccceecC-eeEEEcCCCCcceEEEEe-Cccee- Confidence 256899999999999975321 2323332211 1111224678876 89999999886432 222 22211 Q ss_pred cceeEeecccccccccccCCccccceeeeeeeeccEe-cC-cccccCCCcccccc Q lcl|Aclame:pro 449 DAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NP-FANSRSQAPADRIT 501 (524) Q Consensus 449 ~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP-~~~~~~~~~~~~i~ 501 (524) |+.. -+...-...|+..++-.+-...+||+.+ || =....+.+.+.++. T Consensus 225 ----~~~~-~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 225 ----LITK-RDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred ----eeec-CCcccccccchhhcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 1111 1122222469999999999999999865 55 11112222222222 No 54 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=86.08 E-value=0.048 Score=27.81 Aligned_cols=269 Identities=10% Similarity=-0.006 Sum_probs=117.8 Q ss_pred ccccccccccccccc-ccc-ccccc-ccccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCC Q lcl|Aclame:pro 170 ITTGTAIATGAIVYH-IFQ-ETGIA-YFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGS 246 (524) Q Consensus 170 ~~~gta~~~g~~~~~-~~~-~~~~~-~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggs 246 (524) .++.. +.-.+.... .+. ..... .......+.......-+ . ..|....+..-=.+..++.. ... T Consensus 1 ma~~~-T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~---------g-~~G~tv~ip~~~~~g~~~~~---~eg 66 (274) T protein:vir:93 1 MPQGI-TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ---------G-QPGDTLTFPAFVYSGDAQVV---AEG 66 (274) T ss_pred CCccc-eehhheechHHHHHHHHHHHHhhhhhccccccccccc---------C-CCCCEEEEEeeccCCCcccc---cCC Confidence 11100 000000000 000 00000 00001111111110000 0 01111111110011122221 111 Q ss_pred CCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccc Q lcl|Aclame:pro 247 SANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFT 326 (524) Q Consensus 247 s~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~ 326 (524) ..-++.++. ..+.+++-|-|+-.=+++=| +.+.+ +-|.-.+..+-++..+...++++++..+...... + T Consensus 67 ~~i~~~~it--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~-~---- 135 (274) T protein:vir:93 67 EKIPTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT-V---- 135 (274) T ss_pred Ccccccccc--cceeEEEeeeecccccccHH--HHHhh--ccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-c---- Confidence 122344444 44555555666532233333 22333 5788999999999999999999999766432211 0 Q ss_pred cccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhc-ccccccchhhh Q lcl|Aclame:pro 327 QTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARID-SGITPASQGLQ 405 (524) Q Consensus 327 ~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~-~g~~~~s~~~~ 405 (524) . +.. ...+-+-.+..++.++. ..+++++|+|.+++.|..-. -.+.+++...+ T Consensus 136 ---~--~~~-------------~~~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~ 188 (274) T protein:vir:93 136 ---N--ADI-------------TKLNGLQSAIDKFNDED---------LEPMVLFINPLDAGKLRGDASTNFTRATELGD 188 (274) T ss_pred ---c--ccc-------------cCHHHHHHHHHHhhhcc---------CCccEEEeCHHHHHHHHhhhhhcccccccccc Confidence 0 011 12233444444444321 25689999999999997410 12333332111 Q ss_pred cccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe Q lcl|Aclame:pro 406 KTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI 485 (524) Q Consensus 406 ~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~ 485 (524) +...+...|.+.| ++||+|+..|..-..+.-+|. +-|.---+.......|++++.=.+-...|||+.+ T Consensus 189 -----~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~ga------i~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~ 256 (274) T protein:vir:93 189 -----DIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKGA------VKLILKRDFFLEVARDASTKTTALYSDKHYVAYL 256 (274) T ss_pred -----cceeecccceecC-eeEEEcCCCCcceEEEEeCCe------EEEEecCCcccccccchhhcccEEEEEEEEEEEE Confidence 1122334678876 899999998865433333332 1121111222223469999999999999999764 Q ss_pred -cCcccccCCCccccccccchHHhhccchhh Q lcl|Aclame:pro 486 -NPFANSRSQAPADRITSGMISKEMCGKNAY 515 (524) Q Consensus 486 -nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~ 515 (524) || ..-..+.. .+++=.| T Consensus 257 ~~~-------~~~v~~t~------~~~s~~~ 274 (274) T protein:vir:93 257 YDE-------SKAVKITK------GSGSLEM 274 (274) T ss_pred EcC-------CceEEEee------CccccCC Confidence 33 01111111 1122222 No 55 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=85.38 E-value=0.053 Score=27.57 Aligned_cols=346 Identities=12% Similarity=0.124 Sum_probs=124.4 Q ss_pred CCchHHHHHHhhHhhc-----------ccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhccccccccc- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLE-----------SQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEI- 68 (524) Q Consensus 1 m~~~~~l~~kw~p~l~-----------~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~- 68 (524) .-......+|...-+. ..+..+.+... ....+... .+....+... ..+..++.+... T Consensus 81 ~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~----~~~~~~~~-~~~~~~~~e~------~~~~~~~~~~~~~ 149 (458) T protein:vir:10 81 NELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDS----VAKALYGT-QENFEDEVEK------LVLLSYVMEKGVF 149 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh----hhccchhh-hhhHHHHHHH------HHHHHHHHhhccc Confidence 0001111222211111 00000000000 00000000 0000000000 000000000000 Q ss_pred ccccc---cCccccccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccch Q lcl|Aclame:pro 69 AGDHN---YDQTNIASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADV 144 (524) Q Consensus 69 ~g~~~---~~~~~~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~ 144 (524) -.... ..+.+.+.+.+.|.. ..-|.+. .++.++.++.+..++|-++||+++..-++ -.. .+. T Consensus 150 ~~~~~~~~~~a~~~~~~~~~g~~-~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~-------~~~--~~~---- 215 (458) T protein:vir:10 150 ETEHGQRHLKAVNQSSSVEVSSE-SYETIFSQRIIRDLQKELVVGALFEELPMSSKILTML-------VEP--DAG---- 215 (458) T ss_pred hhhhhhhhhhhhhhcccCccccc-eehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEE-------Eec--CCc---- Confidence 00000 000000001111111 1112222 44555667778899999999987642111 110 000 Q ss_pred hhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccc Q lcl|Aclame:pro 145 REAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGT 224 (524) Q Consensus 145 nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~ 224 (524) .+.|-+.+.. .+. T Consensus 216 ----------~a~~v~e~~~-------------------------------------------~~~-------------- 228 (458) T protein:vir:10 216 ----------KATWVAASTY-------------------------------------------GTD-------------- 228 (458) T ss_pred ----------ceeecccccc-------------------------------------------ccc-------------- Confidence 0111100000 000 Q ss_pred cccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHH Q lcl|Aclame:pro 225 LAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIML 304 (524) Q Consensus 225 ~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~ 304 (524) +.. ...-..+++++++.++.-+....+|-||.+|-- .|.+++|.+-|...|.. T Consensus 229 ---------~~~--------------~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~----~~~~~~i~~~l~~~i~~ 281 (458) T protein:vir:10 229 ---------TTT--------------GEEVKGALKEIHFSTYKLAAKSFITDETEEDAI----FSLLPLLRKRLIEAHAV 281 (458) T ss_pred ---------ccc--------------cccccccceeeEeeeeeEEeeehhhHHHHhcch----HHHHHHHHHHHHHHHHH Confidence 000 000112234555555555666789999988832 46789999999999999 Q ss_pred HhhHHHHhhhhhheeeeeeccccccCccceeeccccccc------ccccchHHHHHHHHHHHHHHHHHHHHHhccccCCC Q lcl|Aclame:pro 305 EINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVD------IRGARWAGESYKALLIQIDKEANEIARQTGRGAGN 378 (524) Q Consensus 305 EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~------~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn 378 (524) -||+.||. |. | ++.+.|++......+ ...+.-..-.+..| +++-+.+.. .+.... T Consensus 282 ~~d~~~l~--------G~-G----~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i----~~~~~~l~~--~~~~~~ 342 (458) T protein:vir:10 282 SIEEAFMT--------GD-G----SGKPKGLLTLASEDSAKVVTEAKADGSVLVTAKTI----SKLRRKLGR--HGLKLS 342 (458) T ss_pred HHHHHhhc--------CC-C----CCccceeeecccccccceeecccccccccccHHHH----HHHHHhhhh--hhcCCC Confidence 99999985 11 1 112334433221110 00000000012222 222222221 222456 Q ss_pred EEEEchhhhhhhhhhccccccc--chhhhcccccccccceeEEEecCcEEEEecCCCCc-----ceEEEEEecCCCccce Q lcl|Aclame:pro 379 FIIASRNVVSALARIDSGITPA--SQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ-----DYFTVGFKGDNEMDAG 451 (524) Q Consensus 379 ~~v~S~~va~~L~~~~~g~~~~--s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG~~~~~~~ 451 (524) ..||+|.....|......-..+ .+... ....+.+ -++|.| ++|+++.+.|. +.++..++ + + T Consensus 343 ~~v~~~~~~~~l~~lkd~~G~~i~~~~~~-~~~~~~~----~~~l~G-~pv~~~~~~p~~~~~~~~~~~~f~-~-----~ 410 (458) T protein:vir:10 343 KLVLIVSMDAYYDLLEDEEWQDVAQVGND-SVKLQGQ----VGRIYG-LPVVVSEYFPAKANSAEFAVIVYK-D-----N 410 (458) T ss_pred EEEEcHHHHHHHHhhcccCCceeeccccc-cccccCc----Cceecc-eeeEEccccccccCCcceEEEEec-c-----c Confidence 7899999988886532211000 00000 0001111 136776 79999988654 22222221 1 0 Q ss_pred eEeecccccccccccCCccccceeeee--eeeccE-ecCcccccCCCccccccccchHHhhccc Q lcl|Aclame:pro 452 IYYAPYVALTPLRGSDPKNFQPVMGFK--TRYGIG-INPFANSRSQAPADRITSGMISKEMCGK 512 (524) Q Consensus 452 ~fyaPYv~~~~~~~~dp~s~qP~~~~~--tRY~l~-~nP~~~~~~~~~~~~i~~~~~~~~~a~~ 512 (524) .++... ..+....||-+-...++|. .|.|+. .+|=+. +.+.-. .. T Consensus 411 ~~~~~~--~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~----------v~~~~a----a~ 458 (458) T protein:vir:10 411 FVMPRQ--RAVTVERERQAGKQRDAYYVTQRVNLQRYFANGV----------VSGTYA----AS 458 (458) T ss_pred EEEEEe--eceEEEeecccCCCceEEEEEEEecceEecccce----------EEEeec----cC Confidence 111101 1111123554445556665 466543 344111 111111 11 No 56 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=85.10 E-value=0.055 Score=27.48 Aligned_cols=280 Identities=13% Similarity=0.032 Sum_probs=125.1 Q ss_pred cccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYS 159 (524) Q Consensus 81 ~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fS 159 (524) -.+++|.+ .-|.+. .+++.+-++.+-.++|.+.||++... +|+-.. ++. +| .| T Consensus 1 ma~~gG~l--vp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~-------~ip~~~--~~~-----~a---------~~- 54 (298) T protein:vir:16 1 MVLNKGTL--FDPTLVTDLISKVAGKSSIARLSAQKPIPFNGE-------KVFTFT--MDS-----EI---------DV- 54 (298) T ss_pred CcccCcce--echhHHHHHHHHHHhhhhhhhhcceeeccCCce-------EEEEEe--cCc-----ce---------EE- Confidence 12222322 223333 45555666788899999999875321 111100 000 00 00 Q ss_pred cccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhh Q lcl|Aclame:pro 160 GEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAEL 239 (524) Q Consensus 160 G~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEa 239 (524) .+| T Consensus 55 ----------------------------------------------------------------------------v~E- 57 (298) T protein:vir:16 55 ----------------------------------------------------------------------------VAE- 57 (298) T ss_pred ----------------------------------------------------------------------------ecC- Confidence 001 Q ss_pred ccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhee Q lcl|Aclame:pro 240 QENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQ 319 (524) Q Consensus 240 l~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~ 319 (524) +.++++-..++++++..+|.-+-....|-||.++--- -..|-+++|.+-|+..|...|+..++.-.... . T Consensus 58 --------~~~~~~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~-~ 127 (298) T protein:vir:16 58 --------SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQEFNDGFAKKVARGIDLMAFHGVNPR-L 127 (298) T ss_pred --------CccccccccceeEEEEeeeeEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHHhhccccCC-C Confidence 0123343445566666666666678899999876432 12556888999999999999998888521100 0 Q ss_pred eeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccc Q lcl|Aclame:pro 320 VGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITP 399 (524) Q Consensus 320 ~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~ 399 (524) -...++... .++-....... -.......++..|.++...+.. .+.+...+|++|.....|...... T Consensus 128 g~~~~~~~~----~~~~~~~~~~~-----~~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~lkd~--- 193 (298) T protein:vir:16 128 GTASAVIGT----NHFDSKVTQKV-----EAPRGIADPNGAIENAVELLTG--VDADVTGIAINPSFRSALAKQKDL--- 193 (298) T ss_pred Ccccccccc----ccccccccccc-----ccccccccHHHHHHHHHHHhhh--cCCCccEEEEcHHHHHHHHHhhcc--- Confidence 000000000 00000000000 0001112233344455444443 123556789999999888753211 Q ss_pred cchhhhcccccccccceeEEEecCcEEEEecCCCCc------ceEEEEEecCCCccceeEeeccc--ccccccccCCcc- Q lcl|Aclame:pro 400 ASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ------DYFTVGFKGDNEMDAGIYYAPYV--ALTPLRGSDPKN- 470 (524) Q Consensus 400 ~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~~~~~fyaPYv--~~~~~~~~dp~s- 470 (524) .+. .....+.+. .-.|+|.| ++|+++.+.+. +.+++|- - ..++.|..-- ++...+..|+++ T Consensus 194 -~G~--~i~~~~~~~-~~~~~l~G-~PV~~~~~v~~~~~~~~~~~~~GD---f--s~~~~~~~~~~~~~~~~~~~~~~~~ 263 (298) T protein:vir:16 194 -QDN--ALFPELKWG-ATPDTING-LPVDVNKTVSDMSLTQRDRAIIGD---F--ANGFKWGYAKEVPLEVIQYGDPDNS 263 (298) T ss_pred -CCC--eeecCcccC-CCCceecc-eeeEEecccccccCCCccEEEEee---c--cceEEEEEecCceEEEeeccCCcCc Confidence 110 000001111 11267888 59999887542 3444441 1 0111222111 122222234432 Q ss_pred ----cc-ceeee--eeeecc-EecCcccccCCCccccccccc Q lcl|Aclame:pro 471 ----FQ-PVMGF--KTRYGI-GINPFANSRSQAPADRITSGM 504 (524) Q Consensus 471 ----~q-P~~~~--~tRY~l-~~nP~~~~~~~~~~~~i~~~~ 504 (524) || =.++| ..|++. ..+| +...++.+.. T Consensus 264 ~~~~f~~~~v~~ra~~r~d~~v~~~-------~a~~~l~~at 298 (298) T protein:vir:16 264 GLDLKGYNQVYIRAELFLGWGILDA-------TKFARVTEAN 298 (298) T ss_pred chhhhhcCcEEEEEEEEEccEeecc-------cceEEEeecC Confidence 22 11333 557773 3344 1223444444 No 57 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=83.83 E-value=0.065 Score=27.08 Aligned_cols=336 Identities=15% Similarity=0.177 Sum_probs=120.6 Q ss_pred CCchH---HHHHHhh-----------Hhhcc----------------cccchhhcch-----hHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 1 MSKKN---ELMEKWN-----------DLLES----------------QEGLPDIATK-----SKKQLVAAILEAQEKDAE 45 (524) Q Consensus 1 m~~~~---~l~~kw~-----------p~l~~----------------~~~~~~i~~~-----~~~~~~~~l~enq~~~~~ 45 (524) |.+.. .+..+|. .+||. ..+.+++.+. ..+.+..--|.+..+.+. T Consensus 238 l~~~~~~~~~~~~ai~~g~sld~~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a 317 (632) T protein:vir:96 238 IGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAA 317 (632) T ss_pred HHHHhhhhhhHHHHHhccccHHHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhh Confidence 11111 1111110 00100 0001111111 001111000111111111 Q ss_pred ccccccchhhhhhhcccccccccccc--ccc--------CccccccccccccccccCchhh-hHHHHHHhhhhhhheeee Q lcl|Aclame:pro 46 TDPVYRDEKIVESFGGFLAEAEIAGD--HNY--------DQTNIASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGV 114 (524) Q Consensus 46 ~~~~~~~~~~~~~~~~~l~ea~~~g~--~~~--------~~~~~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GV 114 (524) ..+. ........+...+.+. .|. .|. .......+.++|...--...+- .++...-|..|...+ |+ T Consensus 318 ~~~~-~~a~~~~e~a~~~a~~--~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l-~~ 393 (632) T protein:vir:96 318 TGDW-SKAGFEREVSLAIADA--SGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM-GA 393 (632) T ss_pred ccch-hhhhhhhHHHHHHHHh--hhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhh-cc Confidence 1110 0000000000000000 000 000 0000000011111100000010 122222234444433 33 Q ss_pred ecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 QPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYF 194 (524) Q Consensus 115 QPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~ 194 (524) +.+++.+|- .+++... ++. T Consensus 394 ~~~~~~~g~-----~~ip~~~--~~~------------------------------------------------------ 412 (632) T protein:vir:96 394 RMLPGLVGD-----VDIPKKT--SGA------------------------------------------------------ 412 (632) T ss_pred eEeecCCcc-----eEEEEEe--CCc------------------------------------------------------ Confidence 333333221 0111100 000 Q ss_pred ccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccc Q lcl|Aclame:pro 195 QNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQY 274 (524) Q Consensus 195 ~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEY 274 (524) .. .-.+| +...++-..+++++++.+|+=+-...+ T Consensus 413 -----------------------------~a--------~wv~E---------~~~~~~s~~~f~~i~l~~~k~~~~v~i 446 (632) T protein:vir:96 413 -----------------------------NF--------YWIGE---------DEDVQDSDFDFTTLSFSPKTIAGAVPV 446 (632) T ss_pred -----------------------------ee--------EeecC---------CccccccccceeeEEeeeeEEEEehhh Confidence 00 00011 112444456777888888877778888 Q ss_pred cHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccc----cccchH Q lcl|Aclame:pro 275 SVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDI----RGARWA 350 (524) Q Consensus 275 T~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~----~~~~~~ 350 (524) |-||..| -.+|.|++|.+-|...|...+++.+|. |. | ....+.|++.......+ ....| T Consensus 447 S~ell~d----s~~~~~~~i~~~l~~a~~~~~d~a~l~--------G~-G---~~~~p~Gi~~~~~~~~~~~~~~~~~~- 509 (632) T protein:vir:96 447 TRKLRKQ----SSIHVENLIREDLIEGIGVALDLAMLT--------GT-G---LANDPVGLLNMTGVPALTYPAGGVDW- 509 (632) T ss_pred HHHHHhc----cchHHHHHHHHHHHHHHHHHHHHHhhc--------cc-C---CCCccceeeecccccceecccccCCH- Confidence 9998776 257899999999999999999999985 11 1 01123455443322111 01112 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeE--EEecCcEEEE Q lcl|Aclame:pro 351 GESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFA--GVLGGTYKVY 428 (524) Q Consensus 351 ~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~--G~l~~~~~vy 428 (524) +.+..|...| ...-........||+|.....|... .. .|.++..-+ |+|.| |+|+ T Consensus 510 -~~i~~~~~~i-------~~~~~~~~~~~~~~~~~~~~~l~~~--~l------------~d~~G~~i~~~~~l~G-~pv~ 566 (632) T protein:vir:96 510 -ASVVDMETKI-------STFNADAGRLAYLTSVTQRGAAKKA--QV------------FDNTGERIWQNNEVNG-YRAE 566 (632) T ss_pred -HHHHHHHHHH-------hhcccccCccEEEEchhHHHHHHHH--hc------------cCCCCceeecCCeecc-cceE Confidence 2233333333 2221111234578898877666531 11 122221111 57776 7999 Q ss_pred ecCCCCcceEEEEEecCCCccceeEeecccccccccccCC----ccccceeeeeeeeccEe-cC--cccccCCC Q lcl|Aclame:pro 429 IDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDP----KNFQPVMGFKTRYGIGI-NP--FANSRSQA 495 (524) Q Consensus 429 ~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp----~s~qP~~~~~tRY~l~~-nP--~~~~~~~~ 495 (524) +.++.+.+-+++|--. -+|+.-+-.+. -.+|| .+-+=.+=...|+++.+ +| |......+ T Consensus 567 ~s~~ip~~~~~~gd~s------~~~i~~~~~~~--i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 567 ASNQIPADTWIFGDWS------QIVIAMWGVLD--LKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred eccccccCcEEEeecc------eEEEEEecceE--EEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 9988877655544210 11111110000 01233 33333444566666532 33 22111111 No 58 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=82.00 E-value=0.081 Score=26.58 Aligned_cols=337 Identities=15% Similarity=0.076 Sum_probs=122.3 Q ss_pred CCch-HHHHHH----hh---Hhhcccccch-------hhcchhH-HHHHHHHHHHHHHHHhccc---------------- Q lcl|Aclame:pro 1 MSKK-NELMEK----WN---DLLESQEGLP-------DIATKSK-KQLVAAILEAQEKDAETDP---------------- 48 (524) Q Consensus 1 m~~~-~~l~~k----w~---p~l~~~~~~~-------~i~~~~~-~~~~~~l~enq~~~~~~~~---------------- 48 (524) |++. ++|.++ +. .+++..+ .. +++.+.+ .+-+.+..|.+.+++.... T Consensus 5 m~k~l~el~~~~~~~~~~~~~~~~~~~-~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (397) T protein:vir:12 5 MSKKEIALRQQFTEKKQQADKALQEGN-TDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQRSQ 83 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcccccc Confidence 4332 122222 22 2222110 11 1111100 0000011111111111000 Q ss_pred cccchh----hhhhh-----cccccccccc-cccccCccccccccccccccccCchhh--hHHHHHHhhhhhhheeeeec Q lcl|Aclame:pro 49 VYRDEK----IVESF-----GGFLAEAEIA-GDHNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQP 116 (524) Q Consensus 49 ~~~~~~----~~~~~-----~~~l~ea~~~-g~~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQP 116 (524) ...++. ...+| +..+.+.+.. .............+++|.+.- |.-+ .+++.+.++.+-.++|.+.| T Consensus 84 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lv--P~~~~~~ii~~~~~~~~l~~~~~~~~ 161 (397) T protein:vir:12 84 GQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILI--PEDIGRQIHEFKRQFEPLEQYVTVEP 161 (397) T ss_pred cchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccC--chhHHHHHHHhhhhhhhHHhhcceee Confidence 000000 00111 1111111100 000000000001112222211 2221 35555566777889999999 Q ss_pred CCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 117 MTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQN 196 (524) Q Consensus 117 mTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~ 196 (524) |+++.|-+- +..+. +... +.|-+.+ T Consensus 162 ~~~~~~~~~-----~~~~~--~~~~--------------a~~v~Eg---------------------------------- 186 (397) T protein:vir:12 162 VTTRSGTRL-----LEKNA--DMVP--------------FSPVEEL---------------------------------- 186 (397) T ss_pred ccCCceeEE-----EEEec--CCcc--------------eeeeccc---------------------------------- Confidence 998877432 11110 0000 0000000 Q ss_pred ccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccH Q lcl|Aclame:pro 197 VTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSV 276 (524) Q Consensus 197 ~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ 276 (524) +..| .++...|.++.|+..|..+- ..+|- T Consensus 187 ---------~~~~-----------------------------------~~~~~~~~~v~~~~~k~~~~-------~~is~ 215 (397) T protein:vir:12 187 ---------GNLP-----------------------------------EIDQPRFTKVSYSIIDYGGI-------MTLSN 215 (397) T ss_pred ---------cccc-----------------------------------ccccccceeEEeeheeeEee-------ehhhH Confidence 0000 00112355555666555554 55999 Q ss_pred HHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHH Q lcl|Aclame:pro 277 ELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKA 356 (524) Q Consensus 277 ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~ 356 (524) ||.+|-- +|.++.|.+.|...|...+|..|+.-.- .+.+.|+..+++ ... T Consensus 216 e~l~ds~----~~l~~~i~~~l~~~~~~~~d~~il~G~g-------------~~~~~g~~~~~~-------------i~~ 265 (397) T protein:vir:12 216 SMLNDSD----QAIMTYVAKWFAKKSVVTRNNLILAAIA-------------SLKKVDIDGLDG-------------IKK 265 (397) T ss_pred HHHhhch----HHHHHHHHHHHHHHHHHHHHHHHHhccc-------------cccccccccHHH-------------HHH Confidence 9998854 5678999999999999999999885211 112335433211 122 Q ss_pred -HHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCc Q lcl|Aclame:pro 357 -LLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ 435 (524) Q Consensus 357 -L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~ 435 (524) ++..++ ..+..+..+||+|.....|..+..+-..+ ....+.+. ..-++|.| ++|++.+.... T Consensus 266 ~~~~~l~---------~~~~~~a~~~~n~~~~~~L~~lkd~~G~~------l~~~~~~~-g~~~~l~G-~pv~~~~~~~~ 328 (397) T protein:vir:12 266 ALNVTLD---------PMVAPGSIVLTNQDGYDWLDTLKDGTGRY------LLQPDPTN-PTKKLLDG-RPVVPFTNRVL 328 (397) T ss_pred HHhhccc---------hhhhCCCEEEEcHHHHHHHHHhhccCCce------eecccccC-CCCccccc-eeeEEeccccc Confidence 222222 12234567899999988887542211100 00011111 11256777 58876443211 Q ss_pred -----ce-EEEEEecCCCccceeEeecccccccccc----cCCccccceeeeeeeeccEe-cCcccccCCCccccccccc Q lcl|Aclame:pro 436 -----DY-FTVGFKGDNEMDAGIYYAPYVALTPLRG----SDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGM 504 (524) Q Consensus 436 -----dy-~~vG~KG~~~~~~~~fyaPYv~~~~~~~----~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~ 504 (524) +. +++|-- .....++.=-.+..... .+-.+-+-.+-...|++..+ ||=+... . T Consensus 329 ~~~~~~~~~~~gd~-----~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~--------~--- 392 (397) T protein:vir:12 329 KTQKGKAPLIIGNL-----KEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVF--------G--- 392 (397) T ss_pred ccCCCccEEEEEeh-----hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEE--------E--- Confidence 11 222210 00000000000000000 01112334555666666533 3311100 0 Q ss_pred hHHhhccchhhhhhhhcc Q lcl|Aclame:pro 505 ISKEMCGKNAYFRKVWVK 522 (524) Q Consensus 505 ~~~~~a~~~~~~~~~~V~ 522 (524) .+-+| T Consensus 393 -------------~~t~~ 397 (397) T protein:vir:12 393 -------------QITVE 397 (397) T ss_pred -------------EEeeC Confidence 00000 No 59 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=81.15 E-value=0.088 Score=26.36 Aligned_cols=371 Identities=12% Similarity=0.024 Sum_probs=131.1 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHH---HHHHhccccccchhhhhhhcc-----cc--c------ Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQ---EKDAETDPVYRDEKIVESFGG-----FL--A------ 64 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq---~~~~~~~~~~~~~~~~~~~~~-----~l--~------ 64 (524) ..+.+++.+++..++... .++.....+.-...+.+.. .+..+.++..+.......... .. . T Consensus 53 ~~~~~~~~~~~~~~~a~~---~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 129 (497) T protein:vir:10 53 HERAQEMLKSLGGADAAK---DGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGT 129 (497) T ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHH Confidence 111112222333332210 1110000000000000000 000000000000000000000 00 0 Q ss_pred ---ccccccccccCc-----cccccccccccc---cccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecC Q lcl|Aclame:pro 65 ---EAEIAGDHNYDQ-----TNIASGKSSGAI---TNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGK 133 (524) Q Consensus 65 ---ea~~~g~~~~~~-----~~~~~st~sg~v---~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~ 133 (524) |.......+... .+...+++++.. ..+.+-+ ++..-+..+..+++.+.||+++.. .|.- T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~i---i~~~~~~~~i~~l~~~~~~~~~~~-------~~~~ 199 (497) T protein:vir:10 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGI---VEQLFYELSLADLISSRPVTSPNL-------SYLT 199 (497) T ss_pred HHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHH---HHHHHhhhhHHhhccccccCCCce-------EEEE Confidence 000000000000 000111222222 1222233 333344566678888888876531 1111 Q ss_pred CCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCccccc Q lcl|Aclame:pro 134 DPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALD 213 (524) Q Consensus 134 ~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~ 213 (524) .. +++ .+ +. T Consensus 200 ~~--~~~----~~---------a~-------------------------------------------------------- 208 (497) T protein:vir:10 200 ES--AAH----NN---------AA-------------------------------------------------------- 208 (497) T ss_pred Ec--CCC----Cc---------ce-------------------------------------------------------- Confidence 10 000 00 00 Q ss_pred ccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHH Q lcl|Aclame:pro 214 AAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAE 293 (524) Q Consensus 214 ~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaE 293 (524) ..+| +..++|...+++++++.+|.-+-...+|-||++|-- +.++. T Consensus 209 ---------------------wv~E---------~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~ 253 (497) T protein:vir:10 209 ---------------------AVAE---------AGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP-----ELFNF 253 (497) T ss_pred ---------------------eecc---------CcccccccccceeeEeeeeeeEeecHhHHHHHHhHH-----HHHHH Confidence 0011 112344555677888888887778899999999942 25899 Q ss_pred HHHHHHHHHHHHhhHHHHh---------hhhhheeeeeec-ccccc--CccceeecccccccccccchHH-----HHH-- Q lcl|Aclame:pro 294 LSAILATEIMLEINREIVD---------LINYTAQVGKSG-FTQTV--GSKAGSFDFQDPVDIRGARWAG-----ESY-- 354 (524) Q Consensus 294 LsnILStEI~~EINreii~---------~i~~~a~~~~~g-~~~~~--~~~~G~fdl~~~~~~~~~~~~~-----e~~-- 354 (524) |.+-|+..|..-+|..||. +++...-..... ..... ....+..++... ....|.+ ... T Consensus 254 i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 330 (497) T protein:vir:10 254 VQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD---GTNGAFVGQDTVASLKY 330 (497) T ss_pred HHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcc---cccchhhhhhHHHHHHH Confidence 9999999999999999985 011110000000 00000 000000000000 0000111 000 Q ss_pred ---------------------HHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccc Q lcl|Aclame:pro 355 ---------------------KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTT 413 (524) Q Consensus 355 ---------------------r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~ 413 (524) ..+...+...-..+.+...+ .++.+|.+|....+|.....+-..+-- .......... T Consensus 331 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~i~-~~~~~~~~~~ 408 (497) T protein:vir:10 331 GRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMG-GNFFGNAYGN 408 (497) T ss_pred HHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhccc-CCCeEEEchHHHHHHHHhhcCCCceec-cCcccccccc Confidence 11222233333444444444 567788899887777653222111100 0000000000 Q ss_pred cceeEEEecCcEEEEecCCCCcceEEEEEecCC------CccceeEeecccccccccccCCccccceeeeeeeecc-Eec Q lcl|Aclame:pro 414 KAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDN------EMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGI-GIN 486 (524) Q Consensus 414 ~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~------~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l-~~n 486 (524) ....-++|.| ++|++.+..+.+=+++|--... ..+-.+-..||.. .+=.+-+=.+=+..|+++ +.+ T Consensus 409 ~~~~~~~l~G-~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~------~~f~~n~v~~r~~~r~~~~v~~ 481 (497) T protein:vir:10 409 PVNGGKNIWG-VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNG------TDFVDGKVTVRAEERLGLLVYR 481 (497) T ss_pred cccCCceeec-eeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccc------hhhhcCcEEEEEEEeecceeec Confidence 0001135666 7999988887655555422110 0011111222210 011222334445678876 567 Q ss_pred CcccccCCCccccccccchHHhhccc Q lcl|Aclame:pro 487 PFANSRSQAPADRITSGMISKEMCGK 512 (524) Q Consensus 487 P~~~~~~~~~~~~i~~~~~~~~~a~~ 512 (524) |=+.-.-+-. ....+. T Consensus 482 p~A~~~l~~~----------~~~~~~ 497 (497) T protein:vir:10 482 PSAFQLIQLK----------KGATGS 497 (497) T ss_pred cccEEEEEec----------CCccCC Confidence 7322111000 000111 No 60 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=81.15 E-value=0.088 Score=26.36 Aligned_cols=371 Identities=12% Similarity=0.024 Sum_probs=131.1 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHH---HHHHhccccccchhhhhhhcc-----cc--c------ Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQ---EKDAETDPVYRDEKIVESFGG-----FL--A------ 64 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq---~~~~~~~~~~~~~~~~~~~~~-----~l--~------ 64 (524) ..+.+++.+++..++... .++.....+.-...+.+.. .+..+.++..+.......... .. . T Consensus 53 ~~~~~~~~~~~~~~~a~~---~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 129 (497) T protein:vir:78 53 HERAQEMLKSLGGADAAK---DGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGT 129 (497) T ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHH Confidence 111112222333332210 1110000000000000000 000000000000000000000 00 0 Q ss_pred ---ccccccccccCc-----cccccccccccc---cccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecC Q lcl|Aclame:pro 65 ---EAEIAGDHNYDQ-----TNIASGKSSGAI---TNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGK 133 (524) Q Consensus 65 ---ea~~~g~~~~~~-----~~~~~st~sg~v---~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~ 133 (524) |.......+... .+...+++++.. ..+.+-+ ++..-+..+..+++.+.||+++.. .|.- T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~i---i~~~~~~~~i~~l~~~~~~~~~~~-------~~~~ 199 (497) T protein:vir:78 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGI---VEQLFYELSLADLISSRPVTSPNL-------SYLT 199 (497) T ss_pred HHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHH---HHHHHhhhhHHhhccccccCCCce-------EEEE Confidence 000000000000 000111222222 1222233 333344566678888888876531 1111 Q ss_pred CCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCccccc Q lcl|Aclame:pro 134 DPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALD 213 (524) Q Consensus 134 ~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~ 213 (524) .. +++ .+ +. T Consensus 200 ~~--~~~----~~---------a~-------------------------------------------------------- 208 (497) T protein:vir:78 200 ES--AAH----NN---------AA-------------------------------------------------------- 208 (497) T ss_pred Ec--CCC----Cc---------ce-------------------------------------------------------- Confidence 10 000 00 00 Q ss_pred ccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHH Q lcl|Aclame:pro 214 AAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAE 293 (524) Q Consensus 214 ~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaE 293 (524) ..+| +..++|...+++++++.+|.-+-...+|-||++|-- +.++. T Consensus 209 ---------------------wv~E---------~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~ 253 (497) T protein:vir:78 209 ---------------------AVAE---------AGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP-----ELFNF 253 (497) T ss_pred ---------------------eecc---------CcccccccccceeeEeeeeeeEeecHhHHHHHHhHH-----HHHHH Confidence 0011 112344555677888888887778899999999942 25899 Q ss_pred HHHHHHHHHHHHhhHHHHh---------hhhhheeeeeec-ccccc--CccceeecccccccccccchHH-----HHH-- Q lcl|Aclame:pro 294 LSAILATEIMLEINREIVD---------LINYTAQVGKSG-FTQTV--GSKAGSFDFQDPVDIRGARWAG-----ESY-- 354 (524) Q Consensus 294 LsnILStEI~~EINreii~---------~i~~~a~~~~~g-~~~~~--~~~~G~fdl~~~~~~~~~~~~~-----e~~-- 354 (524) |.+-|+..|..-+|..||. +++...-..... ..... ....+..++... ....|.+ ... T Consensus 254 i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 330 (497) T protein:vir:78 254 VQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD---GTNGAFVGQDTVASLKY 330 (497) T ss_pred HHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcc---cccchhhhhhHHHHHHH Confidence 9999999999999999985 011110000000 00000 000000000000 0000111 000 Q ss_pred ---------------------HHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccc Q lcl|Aclame:pro 355 ---------------------KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTT 413 (524) Q Consensus 355 ---------------------r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~ 413 (524) ..+...+...-..+.+...+ .++.+|.+|....+|.....+-..+-- .......... T Consensus 331 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~i~-~~~~~~~~~~ 408 (497) T protein:vir:78 331 GRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMG-GNFFGNAYGN 408 (497) T ss_pred HHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhccc-CCCeEEEchHHHHHHHHhhcCCCceec-cCcccccccc Confidence 11222233333444444444 567788899887777653222111100 0000000000 Q ss_pred cceeEEEecCcEEEEecCCCCcceEEEEEecCC------CccceeEeecccccccccccCCccccceeeeeeeecc-Eec Q lcl|Aclame:pro 414 KAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDN------EMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGI-GIN 486 (524) Q Consensus 414 ~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~------~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l-~~n 486 (524) ....-++|.| ++|++.+..+.+=+++|--... ..+-.+-..||.. .+=.+-+=.+=+..|+++ +.+ T Consensus 409 ~~~~~~~l~G-~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~------~~f~~n~v~~r~~~r~~~~v~~ 481 (497) T protein:vir:78 409 PVNGGKNIWG-VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNG------TDFVDGKVTVRAEERLGLLVYR 481 (497) T ss_pred cccCCceeec-eeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccc------hhhhcCcEEEEEEEeecceeec Confidence 0001135666 7999988887655555422110 0011111222210 011222334445678876 567 Q ss_pred CcccccCCCccccccccchHHhhccc Q lcl|Aclame:pro 487 PFANSRSQAPADRITSGMISKEMCGK 512 (524) Q Consensus 487 P~~~~~~~~~~~~i~~~~~~~~~a~~ 512 (524) |=+.-.-+-. ....+. T Consensus 482 p~A~~~l~~~----------~~~~~~ 497 (497) T protein:vir:78 482 PSAFQLIQLK----------KGATGS 497 (497) T ss_pred cccEEEEEec----------CCccCC Confidence 7322111000 000111 No 61 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=80.05 E-value=0.098 Score=26.10 Aligned_cols=338 Identities=15% Similarity=0.141 Sum_probs=117.8 Q ss_pred CCchHHHHHHhhHhhccccc---------------chhhcchhH--HHHHHHH--HHHHHHHHhccc---------c--- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEG---------------LPDIATKSK--KQLVAAI--LEAQEKDAETDP---------V--- 49 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~---------------~~~i~~~~~--~~~~~~l--~enq~~~~~~~~---------~--- 49 (524) |.+.++|+++|..+.+.-+. .++|....+ ..+.+++ |+.|.+.+..+. . T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 99998898888776542100 012221111 0111111 222222221100 0 Q ss_pred -ccchhhhhhhcccccccccccc----cccCccccccccccccccccCchhhh------HHHHHHhhhhhhheeeeecCC Q lcl|Aclame:pro 50 -YRDEKIVESFGGFLAEAEIAGD----HNYDQTNIASGKSSGAITNIGPAVIG------MVRRAIPNLIAFDICGVQPMT 118 (524) Q Consensus 50 -~~~~~~~~~~~~~l~ea~~~g~----~~~~~~~~~~st~sg~v~~~~P~li~------l~Rra~~nLIa~DI~GVQPmT 118 (524) -..++...++..++.... .+. .....-....+-+++.-+.+ ..||+ ++++....-.-.+++.|.|++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~-~~~~~~~~~~~~~~~~~a~~~~~~~~g-G~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~ 158 (387) T protein:vir:26 81 LSDNEKMVKAKAEFYRHAI-LPNEFEKPSMEAQRLLHALPTGNDSGG-DKLLPKTLSKEIVSEPFAKNQLREKARLTNIK 158 (387) T ss_pred CchhHHHHHHHHHHHHHHH-hhhhHHHHHHHHHHHHhhhccCCCCCC-ceeechhHHHHHHHHHHhhchhhhhceeeecC Confidence 001111111111110000 000 00000000000001110111 11222 222233333446777777765 Q ss_pred chhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 119 GPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVT 198 (524) Q Consensus 119 gPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~ 198 (524) +.+.- |-.+... ...| T Consensus 159 ~~~~p----~~~~~~~--------------------~a~~---------------------------------------- 174 (387) T protein:vir:26 159 GLEIP----RVSYTLD--------------------DDDF---------------------------------------- 174 (387) T ss_pred Cceee----eeeccCC--------------------cccc---------------------------------------- Confidence 42210 0000000 0000 Q ss_pred ccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHH Q lcl|Aclame:pro 199 SGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVEL 278 (524) Q Consensus 199 ~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~EL 278 (524) .+| +...++...++++++..+|.-+-...+|-|| T Consensus 175 -------------------------------------v~E---------g~~~~~~~~~f~~v~l~~~k~~~~i~iS~el 208 (387) T protein:vir:26 175 -------------------------------------ITD---------VETAKELKAKGDTVKFTTNKFKVFAAISDTV 208 (387) T ss_pred -------------------------------------ccc---------cccccccccccceeeechheeeeechhhHHH Confidence 001 0011222223344455555555567899999 Q ss_pred HHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHH Q lcl|Aclame:pro 279 AQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALL 358 (524) Q Consensus 279 AQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~ 358 (524) .+|- ..|.+++|.+-|+..|..-.|..++-.-+-+.+ +.|++.=.....+. +. .++ T Consensus 209 l~ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~------------~~g~~~~~~~~~~~-~~-------~~~ 264 (387) T protein:vir:26 209 IHGS----DVDLVNWVENALQSGLAAKERKDALAVSPKSGL------------EHMSFYNGSVKEVE-GA-------DMY 264 (387) T ss_pred Hhhh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccc------------cceeeecccccccc-cc-------chH Confidence 9985 355688999999888877656655522111111 11221110000000 11 112 Q ss_pred HHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceE Q lcl|Aclame:pro 359 IQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYF 438 (524) Q Consensus 359 ~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 438 (524) -.|..+-+.+...= +..+.|++-+...+.++...+.+ ++ .+ .+.. -++|.| ++||+..+++. + T Consensus 265 d~i~~~~~~l~~~y-~~na~~imn~~t~~~~~~~~~~~-----~~---~~-~~~~----~~~llG-~PV~~~~~~~~--~ 327 (387) T protein:vir:26 265 DAIINALADLHEDY-RDNATIYMRYADYVKIISVLSNG-----TT---NF-FDTP----AEKVFG-KPVVFTDAAVK--P 327 (387) T ss_pred HHHHHHHhccChhh-hcCCEEEEechHHHHHHHHHhcC-----CC---cc-cccC----Cccccc-cceEEecCCCc--e Confidence 23333333333321 23555554333334444332111 00 00 0111 135776 59998877653 3 Q ss_pred EEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccchhhhh Q lcl|Aclame:pro 439 TVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKNAYFR 517 (524) Q Consensus 439 ~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~ 517 (524) ++| +- +-||.=|......+..+..+.+-.+-...||+..+ +| .-|+ T Consensus 328 ~~G---Df----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~--------------------------~A~~ 374 (387) T protein:vir:26 328 IVG---DF----NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLD--------------------------SAFR 374 (387) T ss_pred eee---ch----hhhhhhhhhhhheecccccCCceEEEEEEEeCcEeech--------------------------hheE Confidence 343 11 11222121111111123333333333344665432 23 1111 Q ss_pred hhhcccC Q lcl|Aclame:pro 518 KVWVKGL 524 (524) Q Consensus 518 ~~~V~~~ 524 (524) .+.||-= T Consensus 375 ~l~~ka~ 381 (387) T protein:vir:26 375 IAKAKEN 381 (387) T ss_pred EEEeecC Confidence 2222111 No 62 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=80.05 E-value=0.098 Score=26.10 Aligned_cols=338 Identities=15% Similarity=0.141 Sum_probs=117.8 Q ss_pred CCchHHHHHHhhHhhccccc---------------chhhcchhH--HHHHHHH--HHHHHHHHhccc---------c--- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEG---------------LPDIATKSK--KQLVAAI--LEAQEKDAETDP---------V--- 49 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~---------------~~~i~~~~~--~~~~~~l--~enq~~~~~~~~---------~--- 49 (524) |.+.++|+++|..+.+.-+. .++|....+ ..+.+++ |+.|.+.+..+. . T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 99998898888776542100 012221111 0111111 222222221100 0 Q ss_pred -ccchhhhhhhcccccccccccc----cccCccccccccccccccccCchhhh------HHHHHHhhhhhhheeeeecCC Q lcl|Aclame:pro 50 -YRDEKIVESFGGFLAEAEIAGD----HNYDQTNIASGKSSGAITNIGPAVIG------MVRRAIPNLIAFDICGVQPMT 118 (524) Q Consensus 50 -~~~~~~~~~~~~~l~ea~~~g~----~~~~~~~~~~st~sg~v~~~~P~li~------l~Rra~~nLIa~DI~GVQPmT 118 (524) -..++...++..++.... .+. .....-....+-+++.-+.+ ..||+ ++++....-.-.+++.|.|++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~-~~~~~~~~~~~~~~~~~a~~~~~~~~g-G~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~ 158 (387) T protein:vir:94 81 LSDNEKMVKAKAEFYRHAI-LPNEFEKPSMEAQRLLHALPTGNDSGG-DKLLPKTLSKEIVSEPFAKNQLREKARLTNIK 158 (387) T ss_pred CchhHHHHHHHHHHHHHHH-hhhhHHHHHHHHHHHHhhhccCCCCCC-ceeechhHHHHHHHHHHhhchhhhhceeeecC Confidence 001111111111110000 000 00000000000001110111 11222 222233333446777777765 Q ss_pred chhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 119 GPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVT 198 (524) Q Consensus 119 gPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~ 198 (524) +.+.- |-.+... ...| T Consensus 159 ~~~~p----~~~~~~~--------------------~a~~---------------------------------------- 174 (387) T protein:vir:94 159 GLEIP----RVSYTLD--------------------DDDF---------------------------------------- 174 (387) T ss_pred Cceee----eeeccCC--------------------cccc---------------------------------------- Confidence 42210 0000000 0000 Q ss_pred ccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHH Q lcl|Aclame:pro 199 SGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVEL 278 (524) Q Consensus 199 ~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~EL 278 (524) .+| +...++...++++++..+|.-+-...+|-|| T Consensus 175 -------------------------------------v~E---------g~~~~~~~~~f~~v~l~~~k~~~~i~iS~el 208 (387) T protein:vir:94 175 -------------------------------------ITD---------VETAKELKAKGDTVKFTTNKFKVFAAISDTV 208 (387) T ss_pred -------------------------------------ccc---------cccccccccccceeeechheeeeechhhHHH Confidence 001 0011222223344455555555567899999 Q ss_pred HHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHH Q lcl|Aclame:pro 279 AQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALL 358 (524) Q Consensus 279 AQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~ 358 (524) .+|- ..|.+++|.+-|+..|..-.|..++-.-+-+.+ +.|++.=.....+. +. .++ T Consensus 209 l~ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~------------~~g~~~~~~~~~~~-~~-------~~~ 264 (387) T protein:vir:94 209 IHGS----DVDLVNWVENALQSGLAAKERKDALAVSPKSGL------------EHMSFYNGSVKEVE-GA-------DMY 264 (387) T ss_pred Hhhh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccc------------cceeeecccccccc-cc-------chH Confidence 9985 355688999999888877656655522111111 11221110000000 11 112 Q ss_pred HHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceE Q lcl|Aclame:pro 359 IQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYF 438 (524) Q Consensus 359 ~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 438 (524) -.|..+-+.+...= +..+.|++-+...+.++...+.+ ++ .+ .+.. -++|.| ++||+..+++. + T Consensus 265 d~i~~~~~~l~~~y-~~na~~imn~~t~~~~~~~~~~~-----~~---~~-~~~~----~~~llG-~PV~~~~~~~~--~ 327 (387) T protein:vir:94 265 DAIINALADLHEDY-RDNATIYMRYADYVKIISVLSNG-----TT---NF-FDTP----AEKVFG-KPVVFTDAAVK--P 327 (387) T ss_pred HHHHHHHhccChhh-hcCCEEEEechHHHHHHHHHhcC-----CC---cc-cccC----Cccccc-cceEEecCCCc--e Confidence 23333333333321 23555554333334444332111 00 00 0111 135776 59998877653 3 Q ss_pred EEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccchhhhh Q lcl|Aclame:pro 439 TVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKNAYFR 517 (524) Q Consensus 439 ~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~ 517 (524) ++| +- +-||.=|......+..+..+.+-.+-...||+..+ +| .-|+ T Consensus 328 ~~G---Df----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~--------------------------~A~~ 374 (387) T protein:vir:94 328 IVG---DF----NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLD--------------------------SAFR 374 (387) T ss_pred eee---ch----hhhhhhhhhhhheecccccCCceEEEEEEEeCcEeech--------------------------hheE Confidence 343 11 11222121111111123333333333344665432 23 1111 Q ss_pred hhhcccC Q lcl|Aclame:pro 518 KVWVKGL 524 (524) Q Consensus 518 ~~~V~~~ 524 (524) .+.||-= T Consensus 375 ~l~~ka~ 381 (387) T protein:vir:94 375 IAKAKEN 381 (387) T ss_pred EEEeecC Confidence 2222111 No 63 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=80.05 E-value=0.098 Score=26.10 Aligned_cols=338 Identities=15% Similarity=0.141 Sum_probs=117.8 Q ss_pred CCchHHHHHHhhHhhccccc---------------chhhcchhH--HHHHHHH--HHHHHHHHhccc---------c--- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEG---------------LPDIATKSK--KQLVAAI--LEAQEKDAETDP---------V--- 49 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~---------------~~~i~~~~~--~~~~~~l--~enq~~~~~~~~---------~--- 49 (524) |.+.++|+++|..+.+.-+. .++|....+ ..+.+++ |+.|.+.+..+. . T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 99998898888776542100 012221111 0111111 222222221100 0 Q ss_pred -ccchhhhhhhcccccccccccc----cccCccccccccccccccccCchhhh------HHHHHHhhhhhhheeeeecCC Q lcl|Aclame:pro 50 -YRDEKIVESFGGFLAEAEIAGD----HNYDQTNIASGKSSGAITNIGPAVIG------MVRRAIPNLIAFDICGVQPMT 118 (524) Q Consensus 50 -~~~~~~~~~~~~~l~ea~~~g~----~~~~~~~~~~st~sg~v~~~~P~li~------l~Rra~~nLIa~DI~GVQPmT 118 (524) -..++...++..++.... .+. .....-....+-+++.-+.+ ..||+ ++++....-.-.+++.|.|++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~-~~~~~~~~~~~~~~~~~a~~~~~~~~g-G~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~ 158 (387) T protein:vir:96 81 LSDNEKMVKAKAEFYRHAI-LPNEFEKPSMEAQRLLHALPTGNDSGG-DKLLPKTLSKEIVSEPFAKNQLREKARLTNIK 158 (387) T ss_pred CchhHHHHHHHHHHHHHHH-hhhhHHHHHHHHHHHHhhhccCCCCCC-ceeechhHHHHHHHHHHhhchhhhhceeeecC Confidence 001111111111110000 000 00000000000001110111 11222 222233333446777777765 Q ss_pred chhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 119 GPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVT 198 (524) Q Consensus 119 gPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~ 198 (524) +.+.- |-.+... ...| T Consensus 159 ~~~~p----~~~~~~~--------------------~a~~---------------------------------------- 174 (387) T protein:vir:96 159 GLEIP----RVSYTLD--------------------DDDF---------------------------------------- 174 (387) T ss_pred Cceee----eeeccCC--------------------cccc---------------------------------------- Confidence 42210 0000000 0000 Q ss_pred ccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHH Q lcl|Aclame:pro 199 SGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVEL 278 (524) Q Consensus 199 ~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~EL 278 (524) .+| +...++...++++++..+|.-+-...+|-|| T Consensus 175 -------------------------------------v~E---------g~~~~~~~~~f~~v~l~~~k~~~~i~iS~el 208 (387) T protein:vir:96 175 -------------------------------------ITD---------VETAKELKAKGDTVKFTTNKFKVFAAISDTV 208 (387) T ss_pred -------------------------------------ccc---------cccccccccccceeeechheeeeechhhHHH Confidence 001 0011222223344455555555567899999 Q ss_pred HHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHH Q lcl|Aclame:pro 279 AQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALL 358 (524) Q Consensus 279 AQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~ 358 (524) .+|- ..|.+++|.+-|+..|..-.|..++-.-+-+.+ +.|++.=.....+. +. .++ T Consensus 209 l~ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~------------~~g~~~~~~~~~~~-~~-------~~~ 264 (387) T protein:vir:96 209 IHGS----DVDLVNWVENALQSGLAAKERKDALAVSPKSGL------------EHMSFYNGSVKEVE-GA-------DMY 264 (387) T ss_pred Hhhh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccc------------cceeeecccccccc-cc-------chH Confidence 9985 355688999999888877656655522111111 11221110000000 11 112 Q ss_pred HHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceE Q lcl|Aclame:pro 359 IQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYF 438 (524) Q Consensus 359 ~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 438 (524) -.|..+-+.+...= +..+.|++-+...+.++...+.+ ++ .+ .+.. -++|.| ++||+..+++. + T Consensus 265 d~i~~~~~~l~~~y-~~na~~imn~~t~~~~~~~~~~~-----~~---~~-~~~~----~~~llG-~PV~~~~~~~~--~ 327 (387) T protein:vir:96 265 DAIINALADLHEDY-RDNATIYMRYADYVKIISVLSNG-----TT---NF-FDTP----AEKVFG-KPVVFTDAAVK--P 327 (387) T ss_pred HHHHHHHhccChhh-hcCCEEEEechHHHHHHHHHhcC-----CC---cc-cccC----Cccccc-cceEEecCCCc--e Confidence 23333333333321 23555554333334444332111 00 00 0111 135776 59998877653 3 Q ss_pred EEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccchhhhh Q lcl|Aclame:pro 439 TVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKNAYFR 517 (524) Q Consensus 439 ~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~ 517 (524) ++| +- +-||.=|......+..+..+.+-.+-...||+..+ +| .-|+ T Consensus 328 ~~G---Df----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~--------------------------~A~~ 374 (387) T protein:vir:96 328 IVG---DF----NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLD--------------------------SAFR 374 (387) T ss_pred eee---ch----hhhhhhhhhhhheecccccCCceEEEEEEEeCcEeech--------------------------hheE Confidence 343 11 11222121111111123333333333344665432 23 1111 Q ss_pred hhhcccC Q lcl|Aclame:pro 518 KVWVKGL 524 (524) Q Consensus 518 ~~~V~~~ 524 (524) .+.||-= T Consensus 375 ~l~~ka~ 381 (387) T protein:vir:96 375 IAKAKEN 381 (387) T ss_pred EEEeecC Confidence 2222111 No 64 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=79.98 E-value=0.099 Score=26.09 Aligned_cols=280 Identities=13% Similarity=0.080 Sum_probs=126.6 Q ss_pred ccccccCccccccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhh Q lcl|Aclame:pro 69 AGDHNYDQTNIASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREA 147 (524) Q Consensus 69 ~g~~~~~~~~~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEA 147 (524) +=-...++.+.. .++.|.. ..-+.+. .+++++.++.+..++|-+-||++.+- +|.-.. ++. + T Consensus 1 ma~~~~~~~~~~-~t~~gg~-lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-------~ip~~~--~~~-----~- 63 (304) T protein:vir:10 1 MATPTYTPGNVI-LSDFKNG-VIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKK-------KFTYLA--KGV-----G- 63 (304) T ss_pred Cccccccccccc-ccCCCce-ecchhHHHHHHHHHHhccchhhhcceeeccCCce-------EEEEEe--CCc-----c- Confidence 111112222221 1122221 1222232 56666667777888888888876432 111110 000 0 Q ss_pred hccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCccccccccccccccccccc Q lcl|Aclame:pro 148 FHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAE 227 (524) Q Consensus 148 f~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~ 227 (524) +.| T Consensus 64 --------a~~--------------------------------------------------------------------- 66 (304) T protein:vir:10 64 --------AYW--------------------------------------------------------------------- 66 (304) T ss_pred --------eEE--------------------------------------------------------------------- Confidence 000 Q ss_pred ccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 228 ISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEIN 307 (524) Q Consensus 228 ~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EIN 307 (524) .+| +.++++-.-+++++++..|..+-...+|-||.+|- .+|.++.|.+-|...|...|| T Consensus 67 --------v~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~ia~~~d 125 (304) T protein:vir:10 67 --------VSE---------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AKDFFNEVKPLIAEAFYKAFD 125 (304) T ss_pred --------eec---------CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHH Confidence 001 01133333455667777777777888999999875 477899999999999999999 Q ss_pred HHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhh Q lcl|Aclame:pro 308 REIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVV 387 (524) Q Consensus 308 reii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va 387 (524) +.++.=--...-. + ....+++.-..... . ........+..|+++.+.+... +.....+||+|... T Consensus 126 ~~~l~G~g~~~~~---~-----~~~~~~~~~~~~~~----~-~~~~~~~~~~~i~~~~~~l~~~--~~~~~~~v~~~~~~ 190 (304) T protein:vir:10 126 QAVIFGTKSPYNT---S-----TSGKPLVEGAEEKG----N-VVTDTNNLYVDLSALMATIEDE--ELDPNGVLTTRSFR 190 (304) T ss_pred hhheeccCCCccc---c-----cccccccccccccc----c-ccccccchHHHHHHHHHHhhhc--cCCcCEEEEcHHHH Confidence 9998511000000 0 00011111000000 0 0000112233455555555442 23456789999999 Q ss_pred hhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcc------------eEEEEEecCCCccceeEee Q lcl|Aclame:pro 388 SALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQD------------YFTVGFKGDNEMDAGIYYA 455 (524) Q Consensus 388 ~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------y~~vG~KG~~~~~~~~fya 455 (524) ..|..... ..+. -... . ..|+|.| ++||++++.+.+ ++++|..+..+.+ T Consensus 191 ~~L~~lkd----~~G~--~l~~--~----~~~~l~G-~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~------ 251 (304) T protein:vir:10 191 SKMRNALD----ANDR--PLFD--A----NGNEIMG-LPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYA------ 251 (304) T ss_pred HHHHHhhc----cCCc--Eeec--C----CCccccc-eeeEEecccccCCCCcEEEEEehhhEEEEEecceEEE------ Confidence 88875321 1110 0000 0 1256776 699988886542 1233333222110 Q ss_pred cccccc--cccccCCc-----ccc---ceeeeeeeeccEe-cCcccccCCCccccccccc Q lcl|Aclame:pro 456 PYVALT--PLRGSDPK-----NFQ---PVMGFKTRYGIGI-NPFANSRSQAPADRITSGM 504 (524) Q Consensus 456 PYv~~~--~~~~~dp~-----s~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~ 504 (524) ...+.. +....|++ -|+ =.+=+..||++.+ || ....++...+ T Consensus 252 ~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~-------~a~~~l~~a~ 304 (304) T protein:vir:10 252 ISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKP-------EAFATLKPTE 304 (304) T ss_pred EeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecc-------cceEEEEecC Confidence 000111 11112222 122 2333456787654 23 1222344333 No 65 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=79.98 E-value=0.099 Score=26.09 Aligned_cols=280 Identities=13% Similarity=0.080 Sum_probs=126.6 Q ss_pred ccccccCccccccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhh Q lcl|Aclame:pro 69 AGDHNYDQTNIASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREA 147 (524) Q Consensus 69 ~g~~~~~~~~~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEA 147 (524) +=-...++.+.. .++.|.. ..-+.+. .+++++.++.+..++|-+-||++.+- +|.-.. ++. + T Consensus 1 ma~~~~~~~~~~-~t~~gg~-lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-------~ip~~~--~~~-----~- 63 (304) T protein:vir:94 1 MATPTYTPGNVI-LSDFKNG-VIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKK-------KFTYLA--KGV-----G- 63 (304) T ss_pred Cccccccccccc-ccCCCce-ecchhHHHHHHHHHHhccchhhhcceeeccCCce-------EEEEEe--CCc-----c- Confidence 111112222221 1122221 1222232 56666667777888888888876432 111110 000 0 Q ss_pred hccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCccccccccccccccccccc Q lcl|Aclame:pro 148 FHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAE 227 (524) Q Consensus 148 f~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~ 227 (524) +.| T Consensus 64 --------a~~--------------------------------------------------------------------- 66 (304) T protein:vir:94 64 --------AYW--------------------------------------------------------------------- 66 (304) T ss_pred --------eEE--------------------------------------------------------------------- Confidence 000 Q ss_pred ccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 228 ISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEIN 307 (524) Q Consensus 228 ~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EIN 307 (524) .+| +.++++-.-+++++++..|..+-...+|-||.+|- .+|.++.|.+-|...|...|| T Consensus 67 --------v~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~ia~~~d 125 (304) T protein:vir:94 67 --------VSE---------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AKDFFNEVKPLIAEAFYKAFD 125 (304) T ss_pred --------eec---------CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHH Confidence 001 01133333455667777777777888999999875 477899999999999999999 Q ss_pred HHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhh Q lcl|Aclame:pro 308 REIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVV 387 (524) Q Consensus 308 reii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va 387 (524) +.++.=--...-. + ....+++.-..... . ........+..|+++.+.+... +.....+||+|... T Consensus 126 ~~~l~G~g~~~~~---~-----~~~~~~~~~~~~~~----~-~~~~~~~~~~~i~~~~~~l~~~--~~~~~~~v~~~~~~ 190 (304) T protein:vir:94 126 QAVIFGTKSPYNT---S-----TSGKPLVEGAEEKG----N-VVTDTNNLYVDLSALMATIEDE--ELDPNGVLTTRSFR 190 (304) T ss_pred hhheeccCCCccc---c-----cccccccccccccc----c-ccccccchHHHHHHHHHHhhhc--cCCcCEEEEcHHHH Confidence 9998511000000 0 00011111000000 0 0000112233455555555442 23456789999999 Q ss_pred hhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcc------------eEEEEEecCCCccceeEee Q lcl|Aclame:pro 388 SALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQD------------YFTVGFKGDNEMDAGIYYA 455 (524) Q Consensus 388 ~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------y~~vG~KG~~~~~~~~fya 455 (524) ..|..... ..+. -... . ..|+|.| ++||++++.+.+ ++++|..+..+.+ T Consensus 191 ~~L~~lkd----~~G~--~l~~--~----~~~~l~G-~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~------ 251 (304) T protein:vir:94 191 SKMRNALD----ANDR--PLFD--A----NGNEIMG-LPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYA------ 251 (304) T ss_pred HHHHHhhc----cCCc--Eeec--C----CCccccc-eeeEEecccccCCCCcEEEEEehhhEEEEEecceEEE------ Confidence 88875321 1110 0000 0 1256776 699988886542 1233333222110 Q ss_pred cccccc--cccccCCc-----ccc---ceeeeeeeeccEe-cCcccccCCCccccccccc Q lcl|Aclame:pro 456 PYVALT--PLRGSDPK-----NFQ---PVMGFKTRYGIGI-NPFANSRSQAPADRITSGM 504 (524) Q Consensus 456 PYv~~~--~~~~~dp~-----s~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~ 504 (524) ...+.. +....|++ -|+ =.+=+..||++.+ || ....++...+ T Consensus 252 ~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~-------~a~~~l~~a~ 304 (304) T protein:vir:94 252 ISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKP-------EAFATLKPTE 304 (304) T ss_pred EeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecc-------cceEEEEecC Confidence 000111 11112222 122 2333456787654 23 1222344333 No 66 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=77.91 E-value=0.12 Score=25.64 Aligned_cols=281 Identities=14% Similarity=0.078 Sum_probs=123.8 Q ss_pred cccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccc Q lcl|Aclame:pro 79 IASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTM 157 (524) Q Consensus 79 ~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~ 157 (524) -+++++++... .-|.+. .++.++.+..+..++|.+.||++-.. +|.-.. ++. .+. T Consensus 1 ma~~t~~~G~l-ip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-------~~p~~~--~~~--------------~a~ 56 (300) T protein:vir:95 1 MSEAQLSKGNL-FNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQ-------REFVFD--FDS--------------DID 56 (300) T ss_pred CcccccCCcce-echhhHHHHHHHHHhhhhhhhhcceeeccCCce-------EEEEEe--cCc--------------ceE Confidence 34445554442 233333 34444455666678899999876321 111110 000 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhh Q lcl|Aclame:pro 158 YSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVA 237 (524) Q Consensus 158 fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~a 237 (524) | .+ T Consensus 57 w-----------------------------------------------------------------------------v~ 59 (300) T protein:vir:95 57 I-----------------------------------------------------------------------------VA 59 (300) T ss_pred E-----------------------------------------------------------------------------ee Confidence 0 00 Q ss_pred hhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhh Q lcl|Aclame:pro 238 ELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYT 317 (524) Q Consensus 238 Eal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~ 317 (524) | +.+.++...+++.+++.+|.-+-...+|-||.+-... ..+|-+++|.+-|...|...++..++.=... T Consensus 60 E---------g~~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~- 128 (300) T protein:vir:95 60 E---------NGKKTHGGVSLDPVTIVPLKVEYGARVSDEFLHASEE-AKVDMLTDFVEGFSKKLARGLDIMSIHGINP- 128 (300) T ss_pred C---------CcccccccccceeeEeeeEEEEEeehhhHHHhccCCC-CHHHHHHHHHHHHHHHHHHHHHHhhhhcccC- Confidence 1 1123444455566666666666677889998753322 2466788999999999999999999841100 Q ss_pred eeeeeeccccccCccceeecccccc-cccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhccc Q lcl|Aclame:pro 318 AQVGKSGFTQTVGSKAGSFDFQDPV-DIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSG 396 (524) Q Consensus 318 a~~~~~g~~~~~~~~~G~fdl~~~~-~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g 396 (524) .+.+.....|........ ....+. ....+.-|.++...+.. .+.+.+.+|++|.....|...... T Consensus 129 -------~~g~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~lkd~ 194 (300) T protein:vir:95 129 -------RTKQASTIIGDNCFDKKVTQTVPFK-----DTNPDESMEDAVGMIDG--SERDITGAILDPIFTTALSKMKNA 194 (300) T ss_pred -------CCCCCcccccccccccccceeeccc-----ccchHHHHHHHHHHhhh--cCCCccEEEECHHHHHHHHHhhcc Confidence 000000001110000000 000000 01223334444443332 234667789999998888653221 Q ss_pred ccccchhhhcccccccccceeEEEecCcEEEEecCCCCc------ceEEEEEecCCCccceeEeecccc--cccccccCC Q lcl|Aclame:pro 397 ITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ------DYFTVGFKGDNEMDAGIYYAPYVA--LTPLRGSDP 468 (524) Q Consensus 397 ~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~~~~~fyaPYv~--~~~~~~~dp 468 (524) - +. .....+.+ ....++|.| ++|+++...+. +.+++|= +..+++|..... +...+-.|+ T Consensus 195 ~----G~--~i~~~~~~-~~~~~~l~G-~Pv~~s~~v~~~~~~~~~~~~~GD-----f~~~~~~~~~~~~~~~v~~~~~~ 261 (300) T protein:vir:95 195 E----GG--KLYPELAW-GGVPDAING-LAVDKNRTVSYSQTDPKNTAIVGD-----FETMFKWGYAKEVPMEIIKYGDP 261 (300) T ss_pred C----CC--eeccCccc-cCCCceecc-eeeEEecCCCCCCCCCccEEEEee-----ccceEEEEEecccEEEEeeccCC Confidence 1 10 00011111 112467888 69999888643 2233331 111122222111 111111233 Q ss_pred cc-----cc---ceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccchhhhhhhhccc Q lcl|Aclame:pro 469 KN-----FQ---PVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKG 523 (524) Q Consensus 469 ~s-----~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~ 523 (524) ++ || =.+=+..|+++.+ +|=+ +.+..++.| T Consensus 262 d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a-------------------------~~~l~~~~g 300 (300) T protein:vir:95 262 DNSGRDLKGYNQIYIRCEAYIGWGIMDAAS-------------------------FARIVKTGG 300 (300) T ss_pred CCcchhhhhcCcEEEEEEEeecceeecccc-------------------------eEEEecCCC Confidence 21 21 2233455777544 5511 111111111 No 67 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=77.01 E-value=0.13 Score=25.45 Aligned_cols=331 Identities=15% Similarity=0.117 Sum_probs=128.0 Q ss_pred CCchHHHHHHhhHhhcccccchhh----cchhH-HHHHHHHHHHHHHHHhccc-------cccchh--hhhhhccccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDI----ATKSK-KQLVAAILEAQEKDAETDP-------VYRDEK--IVESFGGFLAEA 66 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i----~~~~~-~~~~~~l~enq~~~~~~~~-------~~~~~~--~~~~~~~~l~ea 66 (524) ++. +-.++|..+.. | +.++ ....+ ..-.-...|.+.+...... .+++.+ ............ T Consensus 31 ~~~--~~~~~~~~l~~--e-ie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 105 (394) T protein:vir:97 31 LES--DDLEAARSIKA--E-VEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDS 105 (394) T ss_pred hch--hhHHHHHHHHH--H-HHHHHHHHHHHHHHHHHHHHHhhhhccccccccccchhhHHHHHHHHHHHHHHHHHhhhh Confidence 222 12344554442 1 1111 11100 0000001111110000000 000000 000000000000 Q ss_pred c---------cccccccCccccccccccccccccCchhh--hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCC Q lcl|Aclame:pro 67 E---------IAGDHNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDP 135 (524) Q Consensus 67 ~---------~~g~~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~ 135 (524) . ..+.........+.+.++.+-...-|.-+ .+++.+-+......+|.+.||+++++-+--++ .. T Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-----~~ 180 (394) T protein:vir:97 106 LRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQ-----RA 180 (394) T ss_pred hhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEe-----cC Confidence 0 00000000000011111111111123322 35555556677788899999887754321111 00 Q ss_pred CCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCccccccc Q lcl|Aclame:pro 136 LAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAA 215 (524) Q Consensus 136 ~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~ 215 (524) +++ ..| T Consensus 181 --~~~---------------~~~--------------------------------------------------------- 186 (394) T protein:vir:97 181 --TTK---------------MVT--------------------------------------------------------- 186 (394) T ss_pred --CCc---------------cce--------------------------------------------------------- Confidence 000 000 Q ss_pred ccccccccccccccccccchhhhhccccCCCCCcccccc-eeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHH Q lcl|Aclame:pro 216 VIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEM-AFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAEL 294 (524) Q Consensus 216 ~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EM-sFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaEL 294 (524) .+|. ...++. ...++++++.++.-+-...+|-||++|- ..|.+++| T Consensus 187 --------------------v~E~---------~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds----~~~~~~~i 233 (394) T protein:vir:97 187 --------------------VAEL---------EKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA----DVDLVGIV 233 (394) T ss_pred --------------------eccc---------ccccccccccceeEEeehhheeeehhhHHHHHhhh----hHHHHHHH Confidence 0000 001111 1345666667777777788999999986 34678889 Q ss_pred HHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 295 SAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGR 374 (524) Q Consensus 295 snILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~ 374 (524) .+-|+..|..-+|..||.-+... ...+...+ +....++... ....+ T Consensus 234 ~~~la~~~~~~~~~~i~~g~~~~-------------~~~~~~~~-------------~~~~~~~~~~--------~~~~~ 279 (394) T protein:vir:97 234 SESISQIKVNTTNDAIAKVLKSF-------------TTKTVKNL-------------DEIKALLNGG--------FDPAY 279 (394) T ss_pred HHHHHHHHHHHHHHHHhhccccc-------------cccccccH-------------HHHHHHHHhh--------hhhhh Confidence 99999999998888888632211 11122111 1112222111 11222 Q ss_pred cCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEe--cCCCCcceEEEEEecCCCcccee Q lcl|Aclame:pro 375 GAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYI--DQYARQDYFTVGFKGDNEMDAGI 452 (524) Q Consensus 375 g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~dy~~vG~KG~~~~~~~~ 452 (524) . .-+||+|.+...|..+..+- +. -....+.+. ..-++|.| ++|++ |...+..-+++|-- ..+. T Consensus 280 -~-a~~v~n~~~~~~l~~lkd~~----G~--~i~~~~~~~-~~~~~l~G-~pv~~~~~~~~~~~~~~~gd~-----~~~~ 344 (394) T protein:vir:97 280 -N-VSLIVSQSFYQTLDTLKDGN----GR--YLLQDDITA-VSGKVLLG-KPVFVLSDEVLGANKAFIGDF-----KRGV 344 (394) T ss_pred -C-CEEEEcHHHHHHHHHhhccC----CC--eeeecCcCC-CCCceecc-ceeEEecccccCCccEEEeec-----cccE Confidence 2 34679999988887642211 10 000011111 11257888 57766 44445444444421 0111 Q ss_pred EeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 453 YYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 453 fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) ++..- ....+...|...++..+-...||+..+ +|= -|..+-++.. T Consensus 345 ~~~~~-~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~--------------------------a~~~~~~~~~ 390 (394) T protein:vir:97 345 LFADR-KDLGLRWADNEIYGQYLQAVLRFGVSKVDDK--------------------------AGYYVTFTPE 390 (394) T ss_pred EEEEe-cceEEEEecccccceeEEEEEEEccEEeccc--------------------------ceEEEEeccc Confidence 11111 111222334455555556667777543 331 1111111111 No 68 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=75.41 E-value=0.15 Score=25.15 Aligned_cols=363 Identities=13% Similarity=0.052 Sum_probs=131.1 Q ss_pred CCchH---HHHHHhhHhhccccc-chhhcch--hHHHHHHHH------HHHHHHHHhcccc----------ccc---hhh Q lcl|Aclame:pro 1 MSKKN---ELMEKWNDLLESQEG-LPDIATK--SKKQLVAAI------LEAQEKDAETDPV----------YRD---EKI 55 (524) Q Consensus 1 m~~~~---~l~~kw~p~l~~~~~-~~~i~~~--~~~~~~~~l------~enq~~~~~~~~~----------~~~---~~~ 55 (524) |..-. ++...|....+..+. ..+.... -.++....| ++.+++.+++... -++ +.. T Consensus 1 m~~~~~lee~~a~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) T protein:vir:94 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 54443 333334433321100 0001000 000111111 1111111111000 000 000 Q ss_pred hhhhcccccccc----ccccc--ccC-------------ccccccccccccccccCchhhh-HH-HHHHhhhhhhheeee Q lcl|Aclame:pro 56 VESFGGFLAEAE----IAGDH--NYD-------------QTNIASGKSSGAITNIGPAVIG-MV-RRAIPNLIAFDICGV 114 (524) Q Consensus 56 ~~~~~~~l~ea~----~~g~~--~~~-------------~~~~~~st~sg~v~~~~P~li~-l~-Rra~~nLIa~DI~GV 114 (524) ....+....+.+ ..+-+ +.- ......+..+.+....-|.+++ ++ .+.-..++..++|.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~ 160 (419) T protein:vir:94 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) T ss_pred ccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhccee Confidence 000000000000 00000 000 0000011111111112233331 11 111223455778888 Q ss_pred ecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 QPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYF 194 (524) Q Consensus 115 QPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~ 194 (524) .||++++.-+ +| ... .+.+ ...+ T Consensus 161 ~~~~~~~~~~--~~-----~~~--~~~~--------------~~~~---------------------------------- 183 (419) T protein:vir:94 161 QNADYNVLEY--IR-----DTS--GTAG--------------AGST---------------------------------- 183 (419) T ss_pred eeccCCceee--ee-----ecc--cccc--------------cccc---------------------------------- Confidence 8887664211 11 100 0000 0000 Q ss_pred ccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccc Q lcl|Aclame:pro 195 QNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQY 274 (524) Q Consensus 195 ~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEY 274 (524) . + -+...+| +..+++...++++++..+|.=+-...+ T Consensus 184 -----------~-~-----------------------~a~~v~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~i 219 (419) T protein:vir:94 184 -----------W-N-----------------------KAAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPI 219 (419) T ss_pred -----------C-c-----------------------ccceecC---------CccccccccceeeEEeeeeeEEEeehh Confidence 0 0 0001112 113455555666666666666666789 Q ss_pred cHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeeccccccccc-ccchHHHH Q lcl|Aclame:pro 275 SVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIR-GARWAGES 353 (524) Q Consensus 275 T~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~-~~~~~~e~ 353 (524) |-||.||.- +.+++|.+-|+..|...+|+.||. |. ..+.+.|++-......+. ..-+.... T Consensus 220 s~ell~d~~-----~l~~~i~~~la~a~~~~~d~aii~--------G~-----G~~~p~Gi~~~~~~~~~~~~~~~~~~t 281 (419) T protein:vir:94 220 TRQAADDNS-----QLMGYIQGRLTYGLRFLRDRQLLN--------GN-----GSTEMQGILTTPGIGTYQQPKPTAPAT 281 (419) T ss_pred hHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHHh--------cc-----Ccccccceecccccccccccccccccc Confidence 999999963 358999999999999999999984 10 001122332211100000 00001111 Q ss_pred HHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcc-cccccchhhhcccccccccceeEEEecCcEEEEecCC Q lcl|Aclame:pro 354 YKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDS-GITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQY 432 (524) Q Consensus 354 ~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~-g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y 432 (524) .-..+..|.++-+.+.. .+...+.+||+|.....|..+.. +-.++ ..+. +. .....++|.| ++|+++.. T Consensus 282 ~~~~~~~l~~~~~~~~~--~~~~~~~~v~n~~~~~~l~~~k~~~~~~~--~~~~----~~-~~~~~~~l~G-~pV~~~~~ 351 (419) T protein:vir:94 282 DEPPLVDIRRAKTVAEI--AGFPPDGVVVHPQDWESIELDQAPGSGVF--RVIA----NV-QGEATPRIWG-LNVVSTVA 351 (419) T ss_pred cchhHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHHhhcCCCce--eecC----Cc-ccCCCccccc-eeeEEcCC Confidence 12233344444444432 23356789999999888764311 11000 0111 10 0111346776 69999998 Q ss_pred CCcceEEEEEecCC-----CccceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchH Q lcl|Aclame:pro 433 ARQDYFTVGFKGDN-----EMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMIS 506 (524) Q Consensus 433 ~~~dy~~vG~KG~~-----~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~ 506 (524) .+..-+++|--... ..+-.+-..++.... -..-+=.+=+..||++.+ +|=+ T Consensus 352 ~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~------~~~~~~~~r~~~r~d~~v~~~~a----------------- 408 (419) T protein:vir:94 352 IAQGTALVGGFRQGATLWSRQGITVLMTDSHADF------FTANTLVILAEFRANLAVYQPKA----------------- 408 (419) T ss_pred CCCccEEEeeccceEEEEEecceEEEEeccccch------hhcCcEEEEEEEeeccEEecccc----------------- Confidence 77654555421100 000111111111000 011222334455666443 2210 Q ss_pred HhhccchhhhhhhhcccC Q lcl|Aclame:pro 507 KEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 507 ~~~a~~~~~~~~~~V~~~ 524 (524) |.++-++-. T Consensus 409 ---------~~~~~~~aa 417 (419) T protein:vir:94 409 ---------FVRVTFAAA 417 (419) T ss_pred ---------EEEEEeccC Confidence 111111111 No 69 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=73.80 E-value=0.17 Score=24.86 Aligned_cols=353 Identities=12% Similarity=0.047 Sum_probs=119.5 Q ss_pred CCch---HHHHHHhh---Hhhccccc---chhhcchhHH-HHHHHHHHHHHHHHhccccccchh---------hhh--hh Q lcl|Aclame:pro 1 MSKK---NELMEKWN---DLLESQEG---LPDIATKSKK-QLVAAILEAQEKDAETDPVYRDEK---------IVE--SF 59 (524) Q Consensus 1 m~~~---~~l~~kw~---p~l~~~~~---~~~i~~~~~~-~~~~~l~enq~~~~~~~~~~~~~~---------~~~--~~ 59 (524) |.=+ |+..++|. .|++.... -++.+..... ..--.-|+.|++...+...-.++. ... .. T Consensus 4 ~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (390) T protein:vir:62 4 TTLSANFEARERATAELRTLTDEFAGKEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGSGAQR 83 (390) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh Confidence 2211 11222222 12221000 0111111000 000001111111111110000000 000 00 Q ss_pred ccccccccc--ccccccC-----ccccccccccccccccCchhh-hHHHHHH-hhhhhhheeeeecCCchhhhheeeeee Q lcl|Aclame:pro 60 GGFLAEAEI--AGDHNYD-----QTNIASGKSSGAITNIGPAVI-GMVRRAI-PNLIAFDICGVQPMTGPTGQVFALRAV 130 (524) Q Consensus 60 ~~~l~ea~~--~g~~~~~-----~~~~~~st~sg~v~~~~P~li-~l~Rra~-~nLIa~DI~GVQPmTgPTGLIFAMRsr 130 (524) .....+... -|..+.. ......++++++-...-|.+. .++..+. ...+...+|-|-||++...+-+... T Consensus 84 ~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~-- 161 (390) T protein:vir:62 84 SADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVI-- 161 (390) T ss_pred hcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEE-- Confidence 000000000 0000000 000011111111101111111 1111111 1223344444444433222111100 Q ss_pred ecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcc Q lcl|Aclame:pro 131 YGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPA 210 (524) Q Consensus 131 Y~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~ 210 (524) .. +. + T Consensus 162 -~~-----~~-----~---------------------------------------------------------------- 166 (390) T protein:vir:62 162 -TG-----RS-----S---------------------------------------------------------------- 166 (390) T ss_pred -cC-----Cc-----c---------------------------------------------------------------- Confidence 00 00 0 Q ss_pred cccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCh Q lcl|Aclame:pro 211 ALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDA 290 (524) Q Consensus 211 ~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDA 290 (524) +...+| +..+++-.-++++++..+|..+-...+|-||.+|- .+|. T Consensus 167 ----------------------a~wv~E---------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds----~~~l 211 (390) T protein:vir:62 167 ----------------------ASIVGE---------TAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQ----VLDL 211 (390) T ss_pred ----------------------eeeecc---------cccccccccceeeeEeeeeeEEeehHHHHHHHhhh----hHHH Confidence 000111 11233334455667777777777789999999992 4678 Q ss_pred HHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 291 DAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIAR 370 (524) Q Consensus 291 EaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~ 370 (524) +++|.+-|+..|..-+|..||. | .+.+.|++......... ...... -.--+..|+.+-+.+.. T Consensus 212 ~~~i~~~l~~~i~~~~d~~~l~--------G-------~G~p~Gi~~~~~~~~~~-~~~~~~-~~~~~~~l~~~~~~l~~ 274 (390) T protein:vir:62 212 VGFLVSDAGPAIGDAMGRHFIT--------G-------TGQPRGILTDASPATAT-FLATDT-DSKVSDALIDLFHEVPS 274 (390) T ss_pred HHHHHHHHHHHHHHHHHhhhhc--------c-------CCccccccccccccccc-eecccc-cccchHHHHHHHHhhhh Confidence 9999999999999999999885 1 11223444332111000 000000 00011122333334433 Q ss_pred hccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccc Q lcl|Aclame:pro 371 QTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDA 450 (524) Q Consensus 371 ~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~ 450 (524) .-+. . -..||+|.....|..+..+- +. -....+.+. ..-++|.| ++|+++.+.+.+=|++|-- . T Consensus 275 ~~~~-~-a~~vmn~~~~~~L~~lkd~~----g~--~l~~~~~~~-g~~~~l~G-~Pv~~~~~~p~~~i~~gd~---s--- 338 (390) T protein:vir:62 275 AYRA-N-AKYVVNDLRAAQMRKLKDAN----GQ--YLWQSGLTV-GAPSLFNG-KVVETDDGMPADKILFADL---S--- 338 (390) T ss_pred hhhc-C-CEEEEchHHHHHHHHhhccC----CC--eeecCCcCC-Cccceecc-cceEEecCCCCccEEEeec---c--- Confidence 2221 2 25688999888886542211 10 000011110 11246787 6999999988765554411 0 Q ss_pred eeEeecccc-cccccccCCcc--ccceeeeeeeeccE-ecCcccccCCCccccccccchHH Q lcl|Aclame:pro 451 GIYYAPYVA-LTPLRGSDPKN--FQPVMGFKTRYGIG-INPFANSRSQAPADRITSGMISK 507 (524) Q Consensus 451 ~~fyaPYv~-~~~~~~~dp~s--~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~~~~~~ 507 (524) -|+-.... .......|+-. -+=.+=+..|++.. .|| ++-+++...... T Consensus 339 -~~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~--------~A~~~l~~~~~a 390 (390) T protein:vir:62 339 -KYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDA--------RGAKVLTVTPGA 390 (390) T ss_pred -ceeEEeecceEEEeeccccccCCcEEEEEEEEeCcEeech--------hheEEEEeecCC Confidence 01110000 11111112211 12223344566533 222 222333221111 No 70 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=73.20 E-value=0.17 Score=24.76 Aligned_cols=339 Identities=14% Similarity=0.123 Sum_probs=125.7 Q ss_pred CCchHHHHHHhhHhhccccc---------------chhhcchhHH--HHHHH--HHHHHHHHHhcccc------ccc--- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEG---------------LPDIATKSKK--QLVAA--ILEAQEKDAETDPV------YRD--- 52 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~---------------~~~i~~~~~~--~~~~~--l~enq~~~~~~~~~------~~~--- 52 (524) |-+.++|.++|..+.+.-+. ..+|....++ ....+ -|+.|..+...... -++ T Consensus 4 ~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (408) T protein:vir:10 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (408) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 33567788888655431110 1122111100 00110 01222222211110 000 Q ss_pred -------hhhhhhhcccccccccccccccCccccc----ccc-ccccccccCchhh--hHHHHHHhhhhhhheeeeecCC Q lcl|Aclame:pro 53 -------EKIVESFGGFLAEAEIAGDHNYDQTNIA----SGK-SSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMT 118 (524) Q Consensus 53 -------~~~~~~~~~~l~ea~~~g~~~~~~~~~~----~st-~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmT 118 (524) +....+|..++- +.++....... .++ ..|... . |.-+ .+++.+.......++|.+.||+ T Consensus 84 ~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~a~~~~t~~~gg~~-v-P~~~~~~Ii~~~~~~~~l~~~~~~~~~~ 156 (408) T protein:vir:10 84 KSENELKDKFVKDFVNMVR-----NPMAFMNTVSSKTETSGSDSAAGLT-I-PQDIRTMINTLVRQYDSLQQYVRVESVS 156 (408) T ss_pred cchhhhHHHHHHHHHHHhh-----cchhhhhhhhhhhhhcccccCCcee-c-cHhHHHHHHHHHHhhchhhhhcceeecc Confidence 000111111110 11111100000 011 111111 1 3222 3555566677788999999999 Q ss_pred chhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 119 GPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVT 198 (524) Q Consensus 119 gPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~ 198 (524) ++.|-+--.+-. +. .+ .+.|-+. T Consensus 157 ~~~~~~~~~~~~--~~---~~---------------~a~~v~E------------------------------------- 179 (408) T protein:vir:10 157 TSNGSRVYEKWT--DV---TP---------------LTVMDAE------------------------------------- 179 (408) T ss_pred CCcceEEEeecc--cc---cc---------------ceeeecC------------------------------------- Confidence 888765432210 00 00 0000000 Q ss_pred ccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHH Q lcl|Aclame:pro 199 SGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVEL 278 (524) Q Consensus 199 ~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~EL 278 (524) | ++.. ..+...|.++.|.+.|..+ ...+|-|| T Consensus 180 --------------------------------~-----~~~~----~~~~~~~~~i~~~~~k~~~-------~~~iS~el 211 (408) T protein:vir:10 180 --------------------------------D-----GKIP----DLDNPQLTIIKYLIKRYAG-------IITATNTS 211 (408) T ss_pred --------------------------------c-----cccc----cccCcceeeEEeeeeeEEe-------eehhHHHH Confidence 0 0000 0011235555565555554 45699999 Q ss_pred HHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHH Q lcl|Aclame:pro 279 AQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALL 358 (524) Q Consensus 279 AQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~ 358 (524) .+|- .+|.+++|.+.|+..|..-+|+.|+.-.-... ...|+.++ +....++ T Consensus 212 l~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~------------~~~~~~~~-------------~~l~~~~ 262 (408) T protein:vir:10 212 LKDT----AENILAWLSSWIAKKVVVTRNQAIIEVMKAAP------------KKPTIAKF-------------DDVITMI 262 (408) T ss_pred Hhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------cccccccH-------------HHHHHHH Confidence 9994 45679999999999999999999885221110 11222221 1112211 Q ss_pred -HHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCC--CCc Q lcl|Aclame:pro 359 -IQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQY--ARQ 435 (524) Q Consensus 359 -~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y--~~~ 435 (524) ..+. ..+-..-.+||+|.....|..+...-..+- .+ .+.+. ...++|.| ++|++-.+ .+. T Consensus 263 ~~~~~---------~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i--~~----~~~~~-~~~~~l~G-~PV~~~~~~~~~~ 325 (408) T protein:vir:10 263 NTAVD---------PAIIATSSLLTNQSGLNKLALVKTAEGKYL--LE----PDPTK-PNSYLIKG-KQVIVVADRWLPN 325 (408) T ss_pred HHhhh---------hhhccCCEEEEcHHHHHHHHHhhccCCceE--ec----cCcCC-CCCceecc-eeeEEecccccCc Confidence 1111 121122357899999888876432211100 01 11111 11246777 57766322 111 Q ss_pred --------------ceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe-cCccc------ccCC Q lcl|Aclame:pro 436 --------------DYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFAN------SRSQ 494 (524) Q Consensus 436 --------------dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~------~~~~ 494 (524) ++++++.++... +=+.++.- .+-.+.+=.+-+..||++.+ +|=+. .... T Consensus 326 ~~~~~~~i~~gd~~~~~~~~~~~~~~----v~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~ 395 (408) T protein:vir:10 326 TGSTVYPLYYGDMSQAITLFDRENMS----LLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIAD 395 (408) T ss_pred cCCCceEEEEEehhccEEEEEecceE----EEEccccc------chhhcCceEEEEEEeeccEEeccccEEEEEeecccc Confidence 112222221111 00111100 00112233344445555432 22000 0000 Q ss_pred CccccccccchHHhhccchhhhhhhhc Q lcl|Aclame:pro 495 APADRITSGMISKEMCGKNAYFRKVWV 521 (524) Q Consensus 495 ~~~~~i~~~~~~~~~a~~~~~~~~~~V 521 (524) ..+ ..+....- + | T Consensus 396 ~~~---~~~~~~~~----~-------~ 408 (408) T protein:vir:10 396 QVG---NFKTTTST----A-------V 408 (408) T ss_pred CCC---CCCCCCcc----c-------C Confidence 000 00000000 0 0 No 71 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=71.93 E-value=0.19 Score=24.55 Aligned_cols=348 Identities=13% Similarity=0.077 Sum_probs=127.2 Q ss_pred CCch-HHHHHHhhH-------hhccccc-chhhcchhHHHHHHHHHHHHHHHHhccc---------------cccchhhh Q lcl|Aclame:pro 1 MSKK-NELMEKWND-------LLESQEG-LPDIATKSKKQLVAAILEAQEKDAETDP---------------VYRDEKIV 56 (524) Q Consensus 1 m~~~-~~l~~kw~p-------~l~~~~~-~~~i~~~~~~~~~~~l~enq~~~~~~~~---------------~~~~~~~~ 56 (524) |++. ++|+++=.. +++..+. ..+++...++ +.. |+.|.+..++.. .-++.... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~~ee~~~~~~e--~~~-l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKDGVTAEELNKTSNE--IDI-LQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVI 77 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHH--HHH-HHHHHHHHHHHHHHHHHHhhhhccccccccchhhH Confidence 9973 355554433 3332121 1223221111 111 122221111000 00000000 Q ss_pred hhh--------cccccccccccc-cccCccc-cccc-cccccccccCchhh--hHHHHHHhhhhhhheeeeecCCchhhh Q lcl|Aclame:pro 57 ESF--------GGFLAEAEIAGD-HNYDQTN-IASG-KSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQ 123 (524) Q Consensus 57 ~~~--------~~~l~ea~~~g~-~~~~~~~-~~~s-t~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmTgPTGL 123 (524) ... ..++.+.+..+- ....-.. ...+ +++|.+. . |.-+ .+++.+-.+....+++++.||+++.|- T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~-v-P~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~ 155 (404) T protein:vir:10 78 YNGALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYA-V-PEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGS 155 (404) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCcee-e-chhHHHHHHHHHhhhhhHhhhhceeeccCCccc Confidence 000 111111110000 0000000 0011 1222221 1 2222 345555566778899999999999875 Q ss_pred heeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCc Q lcl|Aclame:pro 124 VFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVT 203 (524) Q Consensus 124 IFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~ 203 (524) +--.|. ... . ...|-+.+ T Consensus 156 ~~~~~~--~~~-----~--------------~~~~v~e~----------------------------------------- 173 (404) T protein:vir:10 156 RTYEKR--SKQ-----K--------------PMKPLSEN----------------------------------------- 173 (404) T ss_pred eEEEEe--cCC-----c--------------ceeecccc----------------------------------------- Confidence 332210 010 0 00000000 Q ss_pred ccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHH Q lcl|Aclame:pro 204 VTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLR 283 (524) Q Consensus 204 ~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLk 283 (524) +..+ + .....++++++.+.|.-+-...+|-||.+|-. T Consensus 174 --~~~~----------------------------~-------------~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~ 210 (404) T protein:vir:10 174 --QQIP----------------------------T-------------NGDNGKLERFNFKLKDLADFMSIPNDLLKFAD 210 (404) T ss_pred --cccc----------------------------c-------------cccccceeeeEeeheeeEeeehhhHHHHhhcH Confidence 0000 0 00112234444444444455679999999843 Q ss_pred hhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHH Q lcl|Aclame:pro 284 AVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDK 363 (524) Q Consensus 284 AiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~ 363 (524) .+.+++|.+.|+..|...+|+.||.=-- +...+.|+......... . +.. ...+..++. T Consensus 211 ----~~l~~~i~~~la~~~~~~~~~~il~G~g------------~~~~~~gi~~~~~~~~~-~--~~~---~~~~~~~~~ 268 (404) T protein:vir:10 211 ----KSLEDWIINWFVDKVRITRNAEILYGAG------------GDEHATGIMTANKFKKI-T--LPK---SPALKDFKK 268 (404) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHhhcCC------------CCCcccceeecccccee-e--ccc---cccHHHHHH Confidence 3568889999999999999998884111 01122344332211100 0 000 001111221 Q ss_pred HHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCC-CCcceEEEEE Q lcl|Aclame:pro 364 EANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQY-ARQDYFTVGF 442 (524) Q Consensus 364 ~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y-~~~dy~~vG~ 442 (524) .-+. .....+...-.+||+|+....|.....+-..+ ....+.+ ...-++|.| ++|++.+. .+.. T Consensus 269 ~~~~-~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~------l~~~~~~-~~~~~~l~G-~PV~~~~~~~~~~------ 333 (404) T protein:vir:10 269 CKNV-ELLNVFKATSSWIVNQDGFNYLDSLEDKTGRP------YLQPDPK-DPTQYRFLG-LPVIELPNDLLLS------ 333 (404) T ss_pred HHHh-hhhccccCCCEEEEcHHHHHHHHHhhccCCce------eeccCcC-CCCCccccc-eeeEEecccccCC------ Confidence 1111 11233323335799999999887642211100 0011111 111246777 57775322 1110 Q ss_pred ecCCCccceeEeeccc---------ccccccccCC----ccccceeeeeeeeccEe-cC--ccc--ccCCCccc Q lcl|Aclame:pro 443 KGDNEMDAGIYYAPYV---------ALTPLRGSDP----KNFQPVMGFKTRYGIGI-NP--FAN--SRSQAPAD 498 (524) Q Consensus 443 KG~~~~~~~~fyaPYv---------~~~~~~~~dp----~s~qP~~~~~tRY~l~~-nP--~~~--~~~~~~~~ 498 (524) ...+..++|+.+- .+......++ ...+=.+-...|+++.+ +| |.. -..-+.++ T Consensus 334 ---~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 334 ---TESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred ---CCCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 0001111222111 0111111122 23334455666776543 33 210 01111111 No 72 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=71.62 E-value=0.19 Score=24.50 Aligned_cols=270 Identities=14% Similarity=0.054 Sum_probs=112.4 Q ss_pred ecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcc Q lcl|Aclame:pro 131 YGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPA 210 (524) Q Consensus 131 Y~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~ 210 (524) -++.. +.-..-.-.|-+.++-.. .+ ......++.......-.. T Consensus 1 m~~~~-T~l~d~i~Pev~~~~v~~--~~----------------------------------~~~l~~~~~~~~~~~l~g 43 (274) T protein:vir:95 1 MAQGM-TKLTNQIVPEVLAPMMQA--EL----------------------------------EKKLRFASFAEIDNTLVG 43 (274) T ss_pred CCcce-eehhheechHHHHHHHHH--HH----------------------------------HhhhhccccceecccccC Confidence 00000 000000001111100000 00 000000000000000000 Q ss_pred cccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcC-CC Q lcl|Aclame:pro 211 ALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHG-MD 289 (524) Q Consensus 211 ~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHG-LD 289 (524) ..|...++..--....+|... ....-...++..+ +++++.+-|. |+ |.+. |+.+..+ -| T Consensus 44 ----------~~G~tv~iP~~~~ig~a~~~~---~g~~i~~~~lt~~--~~~~~i~~~~-~a-~~i~---D~~~~~~~~d 103 (274) T protein:vir:95 44 ----------QPGDTLTFPAFIYSGDAKVVA---EGEKIPTDILETK--KREAKIRKIA-KG-TSIS---DEALLSGYGD 103 (274) T ss_pred ----------CCCCEEEeeeecCCCcccccc---CCCccchhhcccc--eeEEEeeeee-cc-eeeh---HHHHhhccch Confidence 001111111000011222111 1111223343333 3333334443 22 2222 5555553 47 Q ss_pred hHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 290 ADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIA 369 (524) Q Consensus 290 AEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~ 369 (524) --.|..+-++..++.+++++++..+..... .+ ....+ ..+.+-....++.++.. T Consensus 104 ~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~-~~---------~~~~~-------------~~d~i~~A~~~lgd~~~--- 157 (274) T protein:vir:95 104 PQGEQVRQHGLAHANKVDDDVLEALKSAKL-TV---------EADIT-------------KLTGLQTAIDKFNDEDL--- 157 (274) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cc---------ccccc-------------CHHHHHHHHHHhccccc--- Confidence 889999999999999999999976643211 10 01111 12333444444443321 Q ss_pred HhccccCCCEEEEchhhhhhhhhhcc-cccccchhhhcccccccccceeEEEecCcEEEEecCCCCcce-EEEEEecCCC Q lcl|Aclame:pro 370 RQTGRGAGNFIIASRNVVSALARIDS-GITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDY-FTVGFKGDNE 447 (524) Q Consensus 370 ~~T~~g~gn~~v~S~~va~~L~~~~~-g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~ 447 (524) .++++||+|.+++.|..... -+..++.... .-..+...|.+.| ++||+|...+..- +++| +|. T Consensus 158 ------~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~-----~~~~~G~ig~~~G-~~Vi~s~~~~~~t~~l~~-~gA-- 222 (274) T protein:vir:95 158 ------EPMVLFISPLDAGKLRGDATTNFTRATELGD-----DVIVKGAFGEALG-AVIVRSNKLEAGTAILAK-KGA-- 222 (274) T ss_pred ------cccEEEeCHHHHHHHHhhccccccccccccc-----cceeccccceecC-eEEEEeCCCCCceEEEEe-ccc-- Confidence 56899999999999975210 1112221110 1112224678876 8999999887433 2222 221 Q ss_pred ccceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhh Q lcl|Aclame:pro 448 MDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEM 509 (524) Q Consensus 448 ~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~ 509 (524) -.||.. -+...-...||++++=.+-..-+||+.+ || ....+++.|+-.-.| T Consensus 223 ---~~~~~~-~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~-------~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 223 ---VKLITK-RDFFLETDRDPSTKTTALYSDKHYVAYLYDE-------SKAVKITKGSGSLEM 274 (274) T ss_pred ---eeeeec-CCcccccccccccccCEEEEeEEEEEEEEcC-------CcEEEEEcCCccccC Confidence 122221 1122222369999999999999999764 33 222344433322233 No 73 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=71.62 E-value=0.19 Score=24.50 Aligned_cols=270 Identities=14% Similarity=0.054 Sum_probs=112.4 Q ss_pred ecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcc Q lcl|Aclame:pro 131 YGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPA 210 (524) Q Consensus 131 Y~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~ 210 (524) -++.. +.-..-.-.|-+.++-.. .+ ......++.......-.. T Consensus 1 m~~~~-T~l~d~i~Pev~~~~v~~--~~----------------------------------~~~l~~~~~~~~~~~l~g 43 (274) T protein:vir:96 1 MAQGM-TKLTNQIVPEVLAPMMQA--EL----------------------------------EKKLRFASFAEIDNTLVG 43 (274) T ss_pred CCcce-eehhheechHHHHHHHHH--HH----------------------------------HhhhhccccceecccccC Confidence 00000 000000001111100000 00 000000000000000000 Q ss_pred cccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcC-CC Q lcl|Aclame:pro 211 ALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHG-MD 289 (524) Q Consensus 211 ~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHG-LD 289 (524) ..|...++..--....+|... ....-...++..+ +++++.+-|. |+ |.+. |+.+..+ -| T Consensus 44 ----------~~G~tv~iP~~~~ig~a~~~~---~g~~i~~~~lt~~--~~~~~i~~~~-~a-~~i~---D~~~~~~~~d 103 (274) T protein:vir:96 44 ----------QPGDTLTFPAFIYSGDAKVVA---EGEKIPTDILETK--KREAKIRKIA-KG-TSIS---DEALLSGYGD 103 (274) T ss_pred ----------CCCCEEEeeeecCCCcccccc---CCCccchhhcccc--eeEEEeeeee-cc-eeeh---HHHHhhccch Confidence 001111111000011222111 1111223343333 3333334443 22 2222 5555553 47 Q ss_pred hHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 290 ADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIA 369 (524) Q Consensus 290 AEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~ 369 (524) --.|..+-++..++.+++++++..+..... .+ ....+ ..+.+-....++.++.. T Consensus 104 ~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~-~~---------~~~~~-------------~~d~i~~A~~~lgd~~~--- 157 (274) T protein:vir:96 104 PQGEQVRQHGLAHANKVDDDVLEALKSAKL-TV---------EADIT-------------KLTGLQTAIDKFNDEDL--- 157 (274) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cc---------ccccc-------------CHHHHHHHHHHhccccc--- Confidence 889999999999999999999976643211 10 01111 12333444444443321 Q ss_pred HhccccCCCEEEEchhhhhhhhhhcc-cccccchhhhcccccccccceeEEEecCcEEEEecCCCCcce-EEEEEecCCC Q lcl|Aclame:pro 370 RQTGRGAGNFIIASRNVVSALARIDS-GITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDY-FTVGFKGDNE 447 (524) Q Consensus 370 ~~T~~g~gn~~v~S~~va~~L~~~~~-g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~ 447 (524) .++++||+|.+++.|..... -+..++.... .-..+...|.+.| ++||+|...+..- +++| +|. T Consensus 158 ------~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~-----~~~~~G~ig~~~G-~~Vi~s~~~~~~t~~l~~-~gA-- 222 (274) T protein:vir:96 158 ------EPMVLFISPLDAGKLRGDATTNFTRATELGD-----DVIVKGAFGEALG-AVIVRSNKLEAGTAILAK-KGA-- 222 (274) T ss_pred ------cccEEEeCHHHHHHHHhhccccccccccccc-----cceeccccceecC-eEEEEeCCCCCceEEEEe-ccc-- Confidence 56899999999999975210 1112221110 1112224678876 8999999887433 2222 221 Q ss_pred ccceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhh Q lcl|Aclame:pro 448 MDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEM 509 (524) Q Consensus 448 ~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~ 509 (524) -.||.. -+...-...||++++=.+-..-+||+.+ || ....+++.|+-.-.| T Consensus 223 ---~~~~~~-~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~-------~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 223 ---VKLITK-RDFFLETDRDPSTKTTALYSDKHYVAYLYDE-------SKAVKITKGSGSLEM 274 (274) T ss_pred ---eeeeec-CCcccccccccccccCEEEEeEEEEEEEEcC-------CcEEEEEcCCccccC Confidence 122221 1122222369999999999999999764 33 222344433322233 No 74 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=70.83 E-value=0.2 Score=24.37 Aligned_cols=282 Identities=10% Similarity=0.030 Sum_probs=106.8 Q ss_pred cccccccccccccccccccccccccccccccc----cccccCCcccccCcccccccccccccccccccccccccchhhhh Q lcl|Aclame:pro 164 HTAFAKITTGTAIATGAIVYHIFQETGIAYFQ----NVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAEL 239 (524) Q Consensus 164 ~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~----~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEa 239 (524) .. +.+.+ .|..........-..... ....+.......+.. ......+.+-..-.+| T Consensus 1 Ma---t~tt~----~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~------------~~p~~~~~~~a~wv~E- 60 (311) T protein:vir:99 1 MA---TFGTG----NLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNE------------DIITFNGRPKAEFVGE- 60 (311) T ss_pred Cc---eecCC----CceeccHHHHHHHHHHHHhhchhhhhcceeeccCCce------------EEEEEeCCceeEEeec- Confidence 11 11100 000000000000000000 000000000000000 0001111111112233 Q ss_pred ccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhee Q lcl|Aclame:pro 240 QENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQ 319 (524) Q Consensus 240 l~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~ 319 (524) +.++++...++++++..+|.-+-....|-||.|+-.- -..|-+++|.+.|...|+..|++.++.-.....- T Consensus 61 --------g~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g 131 (311) T protein:vir:99 61 --------GQQKSSTTGEFDFVTSTPKKAQVTMRFNEEVQWADED-YQLGVLQTLSEAGAEALARALDLGLYHRINPLTG 131 (311) T ss_pred --------CcccccccceeeEEEEeeEEEEEeehhhHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccC Confidence 2346666777788888888888888999999763321 1355688999999999999999998852211000 Q ss_pred eeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccc Q lcl|Aclame:pro 320 VGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITP 399 (524) Q Consensus 320 ~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~ 399 (524) -+..+...-.....+...+... .+ -.+..-|+.+...+...-.+...+-.|++|+....|...... T Consensus 132 ~~~~g~~~~~~~~~~~~~~~~~------~~-----~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~--- 197 (311) T protein:vir:99 132 TVIPGWSNYLGAASKRVELTAD------TI-----ANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYT--- 197 (311) T ss_pred ccccccccccccccceeecccc------cc-----chhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhcc--- Confidence 0001110000000011111110 00 111122233333332222223446689999999888643211 Q ss_pred cchhhhcccccccccceeEEEecCcEEEEecCCCC----------------cceEEEEEecCCCccceeEeecccccc-- Q lcl|Aclame:pro 400 ASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYAR----------------QDYFTVGFKGDNEMDAGIYYAPYVALT-- 461 (524) Q Consensus 400 ~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~----------------~dy~~vG~KG~~~~~~~~fyaPYv~~~-- 461 (524) .+ +.....+.+ ....|+|.| ++|++..+-+ .+++++|=- ..++.|.-.-... T Consensus 198 -~G--~~l~~~~~~-~~~~~~l~G-~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf-----~~~~~~~~~~~~~~~ 267 (311) T protein:vir:99 198 -DG--RKKFPELGL-GIGVSSFEG-IDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDF-----ANGIHWGVQRDIPVE 267 (311) T ss_pred -CC--CeeecCccc-CCCCceecc-eeeEeecccccccccccccchhhccCcceEEEeec-----cccEEEEEecCceEE Confidence 11 000000111 111257777 5888876532 233333311 1122222111111 Q ss_pred cccccCCcccc-----ceeee--eeeeccEecCcccccCCCccccccccchHHhhc Q lcl|Aclame:pro 462 PLRGSDPKNFQ-----PVMGF--KTRYGIGINPFANSRSQAPADRITSGMISKEMC 510 (524) Q Consensus 462 ~~~~~dp~s~q-----P~~~~--~tRY~l~~nP~~~~~~~~~~~~i~~~~~~~~~a 510 (524) ..+.-|++... --++| ..|||..+-+ ....++.++. | T Consensus 268 ~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~-------~~~v~~~~~~-----A 311 (311) T protein:vir:99 268 LIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFT-------DRFVVIENAV-----A 311 (311) T ss_pred EeecCCCCcchhhhhcCcEEEEEEEeecceecC-------hhHeeeeccc-----C Confidence 11111333211 12333 6788865422 1122333211 1 No 75 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=70.58 E-value=0.21 Score=24.33 Aligned_cols=303 Identities=10% Similarity=0.026 Sum_probs=123.1 Q ss_pred HHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccccccccccccccCchhh-hHHHHHHhhhhhhheeee Q lcl|Aclame:pro 36 ILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGV 114 (524) Q Consensus 36 l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GV 114 (524) ..|+|+.+.+-.. |...+-+.+..+ +.+.. ++++++. ..-+.+. .+++.+..+.+..++|-+ T Consensus 1 ~~~~~~~~~~~~~----------f~~~~~~~~~~~-----a~~~~-~~~~~~~-liP~~~~~~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:93 1 MEQTQKLKLNLQH----------FASNNVKPQVFN-----PDNVM-MHEKKDG-TLLNDFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred CchhHHHHHHHHH----------HHHhhhhhhhcc-----ccccc-ccCCCcc-eechhHHHHHHHHHHhhchhhhhcce Confidence 1111111111000 000011100000 00000 1111111 1122233 456666678888899999 Q ss_pred ecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 QPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYF 194 (524) Q Consensus 115 QPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~ 194 (524) -||++++--| +-.. ++. + +.| T Consensus 64 ~~~~~~~~~i-------p~~~--~~~-----~---------a~~------------------------------------ 84 (324) T protein:vir:93 64 EPMEGTEKKF-------TFWA--DKP-----G---------AYW------------------------------------ 84 (324) T ss_pred eeccCCceEE-------EEEe--cCc-----c---------eee------------------------------------ Confidence 9998765322 1110 000 0 000 Q ss_pred ccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccc Q lcl|Aclame:pro 195 QNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQY 274 (524) Q Consensus 195 ~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEY 274 (524) .+| +..+++..-++++++++.+..+-.... T Consensus 85 -----------------------------------------v~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~i 114 (324) T protein:vir:93 85 -----------------------------------------VGE---------GQKIETSKATWVNATMRAFKLGVILPV 114 (324) T ss_pred -----------------------------------------ecC---------CccccccccceeEEEEEeEEEEEeehh Confidence 001 012333334456666666666667789 Q ss_pred cHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccc-cccchHHHH Q lcl|Aclame:pro 275 SVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDI-RGARWAGES 353 (524) Q Consensus 275 T~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~-~~~~~~~e~ 353 (524) |-||.+|-. .|.+++|.+.|+..|...+++.+|.--... ....|+++....... ..+. T Consensus 115 S~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g~~------------~~~~~~~~~~~~~~~~~~~~----- 173 (324) T protein:vir:93 115 TKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN------------PFGKSIAQSIEKTNKVIKGD----- 173 (324) T ss_pred hHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhcCCCCC------------CcCccccccccccceecccc----- Confidence 999999953 467999999999999999999998521100 011233222111110 0010 Q ss_pred HHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCC Q lcl|Aclame:pro 354 YKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYA 433 (524) Q Consensus 354 ~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~ 433 (524) ..+..|.++.+.|.. .+.....+||+|.....|..+... .+. ....+.. .+.|.| ++|++.+.. T Consensus 174 --~~~~~i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~l~d~----~G~---~~~~~~~----~~~l~G-~PVv~~~~~ 237 (324) T protein:vir:93 174 --FTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVDP----ETK---ERIYDRN----SDSLDG-LPVVNLKSS 237 (324) T ss_pred --ccHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhCC----CCC---eeecCCC----CCcccc-eeeEeecCC Confidence 112223333333333 234567899999999988753211 110 0111111 245766 688886653 Q ss_pred --CcceEE--------EEEecCCCccceeEeecccccccccccCC------ccccceeeeeeeeccEe-cCcccccCCCc Q lcl|Aclame:pro 434 --RQDYFT--------VGFKGDNEMDAGIYYAPYVALTPLRGSDP------KNFQPVMGFKTRYGIGI-NPFANSRSQAP 496 (524) Q Consensus 434 --~~dy~~--------vG~KG~~~~~~~~fyaPYv~~~~~~~~dp------~s~qP~~~~~tRY~l~~-nP~~~~~~~~~ 496 (524) +...++ +|..++-+.+ ...+..+......|. ..-|=.+=+..||++.+ +|= - T Consensus 238 ~~~~~~i~~gdfs~~~~~~~~~~~i~----~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~-------a 306 (324) T protein:vir:93 238 NLKRGELITGDFDKLIYGIPQLIEYK----IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDK-------A 306 (324) T ss_pred CCCcceEEEEecceEEEEEecCcEEE----EeecccccccccccccchhhhhcCcEEEEEEEEeccEEeccc-------c Confidence 222333 3333222111 001100111000010 01122334445666543 220 0 Q ss_pred cccccccchHHh--hccch Q lcl|Aclame:pro 497 ADRITSGMISKE--MCGKN 513 (524) Q Consensus 497 ~~~i~~~~~~~~--~a~~~ 513 (524) ..++. +-++.. ..|+- T Consensus 307 ~~~l~-~a~~~~~~~~~~~ 324 (324) T protein:vir:93 307 FAKLV-PADKRTDSVPGEV 324 (324) T ss_pred eEEEe-cccccCCCCCCCC Confidence 00111 011110 01111 No 76 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=70.47 E-value=0.21 Score=24.32 Aligned_cols=339 Identities=13% Similarity=0.078 Sum_probs=127.8 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHH-------HHHH------HHHHHHHhccc-----cccchh---hhhhh Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLV-------AAIL------EAQEKDAETDP-----VYRDEK---IVESF 59 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~-------~~l~------enq~~~~~~~~-----~~~~~~---~~~~~ 59 (524) |.- ++|+++|..+.+. +.++.+..++... ...+ .++.+.+++.. ...++. -.... T Consensus 1 M~~-~eL~~~~~~~~~~---~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (395) T protein:vir:38 1 MNI-NQLKDAFDMAGQK---VQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPV 76 (395) T ss_pred CCH-HHHHHHHHHHHHH---HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 765 4588888777542 2333222111110 0000 01111111100 000000 00000 Q ss_pred ccccccccccc----------ccccCccccccccccccccccCchhh--hHHHHHHhhhhhhheeeeecCCchhhhheee Q lcl|Aclame:pro 60 GGFLAEAEIAG----------DHNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFAL 127 (524) Q Consensus 60 ~~~l~ea~~~g----------~~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAM 127 (524) .....+..... -.+....-.+..+++++-...=|.-+ .+++.+....+..++|.++||++++|-+-- T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~- 155 (395) T protein:vir:38 77 NKKPLPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVY- 155 (395) T ss_pred cccccchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEE- Confidence 00000000000 00000000001111111111113222 355555567778889999999998875311 Q ss_pred eeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCccccc Q lcl|Aclame:pro 128 RAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGA 207 (524) Q Consensus 128 RsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt 207 (524) ...... .. ...|- T Consensus 156 --~~~~~~---~~--------------~a~~v------------------------------------------------ 168 (395) T protein:vir:38 156 --EKLADI---TP--------------LKDLD------------------------------------------------ 168 (395) T ss_pred --EeeccC---Cc--------------ccccc------------------------------------------------ Confidence 100000 00 00000 Q ss_pred CcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcC Q lcl|Aclame:pro 208 DPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHG 287 (524) Q Consensus 208 ~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHG 287 (524) +.| +.. ..+....|.+..|...|.. -...+|-||.+|- . T Consensus 169 ---------------------~E~------~~~---~~~~~~~f~~v~~~~~k~~-------~~~~iS~ell~ds----~ 207 (395) T protein:vir:38 169 ---------------------DES------ALI---GDNDDPELTVVKYLIHRYA-------GITTVTNTLLKDT----V 207 (395) T ss_pred ---------------------ccc------ccc---ccccccceeeEEeeeeeeE-------eehhhHHHHHhhh----H Confidence 000 000 0001122455555555544 4456999999993 3 Q ss_pred CChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 288 MDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANE 367 (524) Q Consensus 288 LDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~ 367 (524) .|-++.|.+-|+..|..-||..|+.-.-.. ....|..++ +....++.... T Consensus 208 ~~l~~~i~~~la~~~~~~~~~~il~g~g~~------------~~~~~~~~~-------------~~i~~~~~~~l----- 257 (395) T protein:vir:38 208 DNIIQWLVNWAAKKDVVTRNAKILEVMGKA------------PKKPTISQF-------------DNIKDLENNTL----- 257 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccc------------ccccccccH-------------HHHHHHHHHhh----- Confidence 566899999999999999999888511100 011122111 11223322211 Q ss_pred HHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCc-----ce-EEEE Q lcl|Aclame:pro 368 IARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ-----DY-FTVG 441 (524) Q Consensus 368 I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy-~~vG 441 (524) .. .+.....+||+|.....|......- + ..+...+......++|.| ++|++....+. +. +++| T Consensus 258 -~~--~~~~~a~~v~n~~~~~~L~~lkd~~----G---~~l~~~~~~~~~~~~l~G-~pV~~~~~~~~~~~~~~~~i~~g 326 (395) T protein:vir:38 258 -DP--AIESTSSFITNQSGYNILSKVKDAD----G---RYLMQPDVTSPDKYLIDG-KPVIRIADKWLPDVSGSHPLYFG 326 (395) T ss_pred -hh--hhcCCCEEEEcHHHHHHHHHhhccC----C---ceeeccCcCCCCcceecc-ceeEEecccccCcCCCcceEEEE Confidence 11 1113456899999988887532111 1 001011111112246776 58877543211 11 2222 Q ss_pred ---------EecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe-cC-------cccccCCCccccccccc Q lcl|Aclame:pro 442 ---------FKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NP-------FANSRSQAPADRITSGM 504 (524) Q Consensus 442 ---------~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP-------~~~~~~~~~~~~i~~~~ 504 (524) .+.. ..+=+.++. ..+-..-+=.+-+..||+..+ +| +....+++++. T Consensus 327 d~~~~~~i~~~~~----~~i~~~~~~------~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~------ 390 (395) T protein:vir:38 327 DLKQGITLFDRQQ----MQIDTTNVG------AGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVANQAQGT------ 390 (395) T ss_pred eccccEEEEEecc----eEEEEeccc------cchhhcCceEEEEEEeeccEEecccceEEEEeecccCCCCCc------ Confidence 1111 011111110 001122234455556666543 23 11112222221 Q ss_pred hHHhhccc Q lcl|Aclame:pro 505 ISKEMCGK 512 (524) Q Consensus 505 ~~~~~a~~ 512 (524) --.|| T Consensus 391 ---~~~~~ 395 (395) T protein:vir:38 391 ---AGTGK 395 (395) T ss_pred ---cCCCC Confidence 12244 No 77 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=69.93 E-value=0.22 Score=24.23 Aligned_cols=344 Identities=15% Similarity=0.119 Sum_probs=131.3 Q ss_pred CCchHHHHHHhhHhhccccc-chhhcch---------hHHHHHHHH---------HHHHHHHHhccc---------ccc- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEG-LPDIATK---------SKKQLVAAI---------LEAQEKDAETDP---------VYR- 51 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~-~~~i~~~---------~~~~~~~~l---------~enq~~~~~~~~---------~~~- 51 (524) | +.++|.++|..+.+.-+. ..++... ..+.+.+.+ +++|.++..... ... T Consensus 5 m-~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (404) T protein:vir:39 5 L-TVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (404) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 4 446788888777542111 0111000 001111111 111111111100 000 Q ss_pred ------chhhhhhhcccccccc-cccccccCccccccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhh Q lcl|Aclame:pro 52 ------DEKIVESFGGFLAEAE-IAGDHNYDQTNIASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQ 123 (524) Q Consensus 52 ------~~~~~~~~~~~l~ea~-~~g~~~~~~~~~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGL 123 (524) .+....+|..++.... ........+-. ..++++|.+. .-+.+. .+++.+-++....++|.++||+++++- T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~-~~t~~~gg~~-iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 161 (404) T protein:vir:39 84 KSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTET-SGSDSAAGLT-IPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGS 161 (404) T ss_pred cchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhh-cccccCCcee-ccHHHHHHHHHHHHhhhhHHhhcceeeccCCcce Confidence 0001111111111000 00000000000 0111122211 111221 344445567778889999999988765 Q ss_pred heeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCc Q lcl|Aclame:pro 124 VFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVT 203 (524) Q Consensus 124 IFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~ 203 (524) +--.|- .... + .+.|-+. T Consensus 162 ~~~~~~--~~~~---~---------------~a~~v~E------------------------------------------ 179 (404) T protein:vir:39 162 RVYEKW--TDVT---P---------------LTVMDAE------------------------------------------ 179 (404) T ss_pred EEEEee--cCCc---c---------------ceeeecC------------------------------------------ Confidence 432211 0100 0 0000000 Q ss_pred ccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHH Q lcl|Aclame:pro 204 VTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLR 283 (524) Q Consensus 204 ~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLk 283 (524) | ++. ...+...|.++.|++.|..+- ..+|-||.+|- T Consensus 180 ---------------------------g-----~~~----~~~~~~~f~~i~~~~~k~~~~-------~~iS~ell~ds- 215 (404) T protein:vir:39 180 ---------------------------D-----GKI----PDLDNPRLTIIKYLIKRYAGI-------ITATNTLLKDT- 215 (404) T ss_pred ---------------------------c-----ccc----ccccccceeeEEeeeeeEEee-------ehhHHHHHhhc- Confidence 0 000 000122466777777776655 34999999984 Q ss_pred hhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHH Q lcl|Aclame:pro 284 AVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDK 363 (524) Q Consensus 284 AiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~ 363 (524) ..|.+++|.+-|+..|..-+|..||.-. | ......+..++++ ...++... T Consensus 216 ---~~~l~~~i~~~l~~~~~~~~d~~il~g~---------g---~~~~~~~~~~~~~-------------i~~~~~~~-- 265 (404) T protein:vir:39 216 ---AENILAWLSSWIAKKVVVTRNQAIIAAM---------G---TVPKKPTIAKFDD-------------VITMINTS-- 265 (404) T ss_pred ---hHHHHHHHHHHHHHHHHHHHHHHHHhcc---------c---ccccccccccHHH-------------HHHHHHHh-- Confidence 2567999999999999999999998521 1 0111223333211 11111110 Q ss_pred HHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCC--C----cce Q lcl|Aclame:pro 364 EANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYA--R----QDY 437 (524) Q Consensus 364 ~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~--~----~dy 437 (524) + ...+.....+||+|.....|..+...-.. + ....+.+. ...++|.| ++|++-.+. + .++ T Consensus 266 ----~--~~~~~~~a~~v~n~~~~~~L~~lkd~~G~--~----l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~~~~ 331 (404) T protein:vir:39 266 ----V--DPAIIATSSLLTNQSGLNKLALVKTAEGK--Y----LLEPDPTK-PNSYLIKG-KKVIVVADRWLPNSGSTVY 331 (404) T ss_pred ----h--hhhhccCCEEEEcHHHHHHHHHhhccCCc--e----eeccCcCC-CCcceecc-eeEEEecccccCccCCCcc Confidence 0 11122345789999999888864222100 0 00001111 11246777 577653221 1 111 Q ss_pred -EEEE-Eec----CCCccceeEeecccccccccccCCccccceeeeeeeeccEe-cCccc------ccCCCccccccccc Q lcl|Aclame:pro 438 -FTVG-FKG----DNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFAN------SRSQAPADRITSGM 504 (524) Q Consensus 438 -~~vG-~KG----~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~------~~~~~~~~~i~~~~ 504 (524) +++| ++. ....+-.+=..+|+.. +-...+=.+-...||+..+ +|-+. ....+.+ T Consensus 332 ~~~~gd~~~~~~~~~~~~~~i~~~~~~~~------~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~~------- 398 (404) T protein:vir:39 332 PLYYGDMSQAITLFDRENMSLLPTNIGAG------AFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQVG------- 398 (404) T ss_pred EEEEEeccccEEEEeecceEEEEeccchh------hhhhceeeEEEEeeeccEEecccceEEEEeeccccCCC------- Confidence 2222 110 0000011111122110 0112334455566776543 34110 0111111 Q ss_pred hHHhhccc Q lcl|Aclame:pro 505 ISKEMCGK 512 (524) Q Consensus 505 ~~~~~a~~ 512 (524) ..-+|| T Consensus 399 --~~~~~~ 404 (404) T protein:vir:39 399 --NFTAGK 404 (404) T ss_pred --CCCCCC Confidence 122455 No 78 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=68.82 E-value=0.23 Score=24.07 Aligned_cols=302 Identities=10% Similarity=0.021 Sum_probs=118.9 Q ss_pred hcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccccccccccccccCchhh-hHHHHH Q lcl|Aclame:pro 24 IATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIASGKSSGAITNIGPAVI-GMVRRA 102 (524) Q Consensus 24 i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li-~l~Rra 102 (524) |+...|.+.-.+-+.+- +-+....+ +.+...++..+.+ .-|.+. .+++.+ T Consensus 1 ~~~~~~~~~~~~~f~~~----------------------~~~~~~~~-----a~~~~~~~~~~~l--ip~~~~~~ii~~~ 51 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASN----------------------NVKPQVFN-----PDNVMMHEKKDGT--LLNDFTTPILQEV 51 (324) T ss_pred CCcchhhhHHHHHHHHh----------------------hhhhhhcc-----cccccccCCCcce--echhHHHHHHHHH Confidence 32222222111111111 00000000 0010001111111 223333 455666 Q ss_pred HhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 103 IPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIV 182 (524) Q Consensus 103 ~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~ 182 (524) ..+.+..+++.+-||++++.-|. -.. +.. + +.|-+ T Consensus 52 ~~~s~l~~l~~~~~~~~~~~~~p-------~~~--~~~-----~---------a~~v~---------------------- 86 (324) T protein:vir:96 52 MENSKIMQLGKYEPMEGTEKKFT-------FWA--DKP-----G---------AYWVG---------------------- 86 (324) T ss_pred HhhchhhhhcceeeccCCceEEE-------EEe--cCc-----c---------eeeec---------------------- Confidence 67788889999999987653221 110 000 0 00000 Q ss_pred ccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEE Q lcl|Aclame:pro 183 YHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQV 262 (524) Q Consensus 183 ~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~t 262 (524) + ++.. ..+...|.+..+.+.|.. T Consensus 87 -----------------------------------------------E------g~~~----~~~~~~f~~v~~~~~k~~ 109 (324) T protein:vir:96 87 -----------------------------------------------E------GQKI----ETSKATWVNATMRAFKLG 109 (324) T ss_pred -----------------------------------------------C------Cccc----cccccceeEEEEEeEEEE Confidence 0 0000 001122555555555544 Q ss_pred EEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccc Q lcl|Aclame:pro 263 IEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPV 342 (524) Q Consensus 263 VtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~ 342 (524) + ....|-||.+|-. .|.+++|.+.|...|...+++.||.--.. ...+.|++...... T Consensus 110 ~-------~~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~~~l~G~g~------------~~~~~~~~~~~~~~ 166 (324) T protein:vir:96 110 V-------ILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGN------------NPFGKSIAQSIKKT 166 (324) T ss_pred E-------eehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCC------------CCcCcccccccccc Confidence 4 4558999999853 56799999999999999999998852100 01112332221111 Q ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEec Q lcl|Aclame:pro 343 DIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLG 422 (524) Q Consensus 343 ~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~ 422 (524) ..... ....+..|.++.+.|.. .+...+.+||+|.....|..+... .+. ....+.. .++|. T Consensus 167 ~~~~~------~~~~~~~i~~~~~~i~~--~~~~~~~~i~n~~~~~~L~~lkd~----~G~---~~~~~~~----~~~l~ 227 (324) T protein:vir:96 167 NKVIK------GDFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVDP----ETK---ERIYDRN----SDSLD 227 (324) T ss_pred ceecc------cccchHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhCC----CCC---eeecCCC----CCccc Confidence 00000 00112223344444433 334667899999998888754211 111 0011111 24566 Q ss_pred CcEEEEecCCCC--cceEEE--------EEecCCCccceeEeecccccccccccCCcc-----c---cceeeeeeeecc- Q lcl|Aclame:pro 423 GTYKVYIDQYAR--QDYFTV--------GFKGDNEMDAGIYYAPYVALTPLRGSDPKN-----F---QPVMGFKTRYGI- 483 (524) Q Consensus 423 ~~~~vy~D~y~~--~dy~~v--------G~KG~~~~~~~~fyaPYv~~~~~~~~dp~s-----~---qP~~~~~tRY~l- 483 (524) | ++|++++..+ ...+++ |..+.-..+. ..+ .......|+.. | +=.+=+.-||++ T Consensus 228 G-~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~----~~~--~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~ 300 (324) T protein:vir:96 228 G-LPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKI----DET--AQLSTVKNEDGTPVNLFEQDMVALRATMHVALH 300 (324) T ss_pred c-eeeEeecCCCCCcceEEEEecceEEEEEecCcEEEE----eec--ccccccccccccchhhhhcCcEEEEEEEEeccE Confidence 6 6888866542 222333 3332211100 000 00000011110 1 122334456665 Q ss_pred EecCcccccCCCccccccccchHH--hhccch Q lcl|Aclame:pro 484 GINPFANSRSQAPADRITSGMISK--EMCGKN 513 (524) Q Consensus 484 ~~nP~~~~~~~~~~~~i~~~~~~~--~~a~~~ 513 (524) ..+|=+ ..++. +-++. ...|+- T Consensus 301 v~~~~a-------~~~l~-~a~~~~~~~~~~~ 324 (324) T protein:vir:96 301 IADDKA-------FAKLV-PADKRTDSVPGEV 324 (324) T ss_pred Eecccc-------eEEEe-cccccCCCCCCCC Confidence 333410 00111 00000 011111 No 79 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=66.01 E-value=0.27 Score=23.66 Aligned_cols=272 Identities=11% Similarity=0.017 Sum_probs=112.7 Q ss_pred ecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcc Q lcl|Aclame:pro 131 YGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPA 210 (524) Q Consensus 131 Y~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~ 210 (524) -++.. +.-..-.-.|-+.++-.. .+- .....++.......-+. T Consensus 1 ma~~~-T~~~d~iiPev~~~~v~~--~~~----------------------------------~~l~~~~~~~~d~~l~g 43 (274) T protein:vir:94 1 MPQGL-TKTSDQIIPEVLAPMMQA--QLE----------------------------------KKLRFASFAEVDSTLQG 43 (274) T ss_pred CCccc-eehhheechHHHHHHHHH--hhh----------------------------------hhhhhcccceecccccC Confidence 00000 000000111211111000 000 00000000000000000 Q ss_pred cccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCh Q lcl|Aclame:pro 211 ALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDA 290 (524) Q Consensus 211 ~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDA 290 (524) ..|...++..-=.+..+|.. .....-+..++. ..+.+++.+-|+-.=+++=| ..+.+ +-|. T Consensus 44 ----------~~G~tv~iP~~~~~g~a~~~---~~g~~i~~~~lt--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~dp 104 (274) T protein:vir:94 44 ----------QPGDTLTFPAFVYSGDAQVV---AEGEKIPTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDP 104 (274) T ss_pred ----------CCCCEEEEeeecCCCccccc---cCCCcccccccc--cceeEEEeeeecceecccHH--HHHhc--cchH Confidence 00111111000001111211 011112233443 33344444555522222222 23333 4578 Q ss_pred HHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 291 DAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIAR 370 (524) Q Consensus 291 EaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~ 370 (524) -.|..+-++..|..+++.+++..+...+.. +. +..+ ..+-+-.+..++.++. T Consensus 105 ~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~-~~---------~~~~-------------~~d~i~dA~~~l~d~~----- 156 (274) T protein:vir:94 105 QGEQVRQHGLAHANKVDNDVLEALMGAKLT-VN---------ADIT-------------KLNGLQSAIDKFNDED----- 156 (274) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCcc-cc---------cccc-------------CHHHHHHHHHHhhccC----- Confidence 888999999999999999999876543321 10 0111 1233444444444432 Q ss_pred hccccCCCEEEEchhhhhhhhhhc-ccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCcc Q lcl|Aclame:pro 371 QTGRGAGNFIIASRNVVSALARID-SGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMD 449 (524) Q Consensus 371 ~T~~g~gn~~v~S~~va~~L~~~~-~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~ 449 (524) ..+++++|+|.+++.|..-. -.+.+++...+ ....+...|.+.| ++||+|+..|..-..+--+| T Consensus 157 ----~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~-----~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~g----- 221 (274) T protein:vir:94 157 ----LEPMVLFVNPLDAGKLRGDASTNFTRATELGD-----DIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKG----- 221 (274) T ss_pred ----CCceEEEeCHHHHHHHHhhhhhhccccCcccc-----cceeccccceecC-eeEEEcCCCCcceEEEEeCc----- Confidence 25689999999999987410 12333333111 1112234678876 79999999885432222122 Q ss_pred ceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccchhh Q lcl|Aclame:pro 450 AGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKNAY 515 (524) Q Consensus 450 ~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~ 515 (524) .+-|.---+.......|+..+.=.+-..-+||+.+ || ..-.+++.+. ++-.| T Consensus 222 -A~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~-------~~vv~~t~~~------~~~~~ 274 (274) T protein:vir:94 222 -AVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDE-------SKAVKITKGS------GSLEM 274 (274) T ss_pred -ceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcC-------CceEEEecCc------ccccC Confidence 22221111222222469999999999999999754 33 1111222111 11112 No 80 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=66.01 E-value=0.27 Score=23.66 Aligned_cols=272 Identities=11% Similarity=0.017 Sum_probs=112.7 Q ss_pred ecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcc Q lcl|Aclame:pro 131 YGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPA 210 (524) Q Consensus 131 Y~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~ 210 (524) -++.. +.-..-.-.|-+.++-.. .+- .....++.......-+. T Consensus 1 ma~~~-T~~~d~iiPev~~~~v~~--~~~----------------------------------~~l~~~~~~~~d~~l~g 43 (274) T protein:vir:97 1 MPQGL-TKTSDQIIPEVLAPMMQA--QLE----------------------------------KKLRFASFAEVDSTLQG 43 (274) T ss_pred CCccc-eehhheechHHHHHHHHH--hhh----------------------------------hhhhhcccceecccccC Confidence 00000 000000111211111000 000 00000000000000000 Q ss_pred cccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCh Q lcl|Aclame:pro 211 ALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDA 290 (524) Q Consensus 211 ~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDA 290 (524) ..|...++..-=.+..+|.. .....-+..++. ..+.+++.+-|+-.=+++=| ..+.+ +-|. T Consensus 44 ----------~~G~tv~iP~~~~~g~a~~~---~~g~~i~~~~lt--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~dp 104 (274) T protein:vir:97 44 ----------QPGDTLTFPAFVYSGDAQVV---AEGEKIPTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDP 104 (274) T ss_pred ----------CCCCEEEEeeecCCCccccc---cCCCcccccccc--cceeEEEeeeecceecccHH--HHHhc--cchH Confidence 00111111000001111211 011112233443 33344444555522222222 23333 4578 Q ss_pred HHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 291 DAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIAR 370 (524) Q Consensus 291 EaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~ 370 (524) -.|..+-++..|..+++.+++..+...+.. +. +..+ ..+-+-.+..++.++. T Consensus 105 ~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~-~~---------~~~~-------------~~d~i~dA~~~l~d~~----- 156 (274) T protein:vir:97 105 QGEQVRQHGLAHANKVDNDVLEALMGAKLT-VN---------ADIT-------------KLNGLQSAIDKFNDED----- 156 (274) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCcc-cc---------cccc-------------CHHHHHHHHHHhhccC----- Confidence 888999999999999999999876543321 10 0111 1233444444444432 Q ss_pred hccccCCCEEEEchhhhhhhhhhc-ccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCcc Q lcl|Aclame:pro 371 QTGRGAGNFIIASRNVVSALARID-SGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMD 449 (524) Q Consensus 371 ~T~~g~gn~~v~S~~va~~L~~~~-~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~ 449 (524) ..+++++|+|.+++.|..-. -.+.+++...+ ....+...|.+.| ++||+|+..|..-..+--+| T Consensus 157 ----~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~-----~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~g----- 221 (274) T protein:vir:97 157 ----LEPMVLFVNPLDAGKLRGDASTNFTRATELGD-----DIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKG----- 221 (274) T ss_pred ----CCceEEEeCHHHHHHHHhhhhhhccccCcccc-----cceeccccceecC-eeEEEcCCCCcceEEEEeCc----- Confidence 25689999999999987410 12333333111 1112234678876 79999999885432222122 Q ss_pred ceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccchhh Q lcl|Aclame:pro 450 AGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKNAY 515 (524) Q Consensus 450 ~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~ 515 (524) .+-|.---+.......|+..+.=.+-..-+||+.+ || ..-.+++.+. ++-.| T Consensus 222 -A~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~-------~~vv~~t~~~------~~~~~ 274 (274) T protein:vir:97 222 -AVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDE-------SKAVKITKGS------GSLEM 274 (274) T ss_pred -ceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcC-------CceEEEecCc------ccccC Confidence 22221111222222469999999999999999754 33 1111222111 11112 No 81 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=64.90 E-value=0.29 Score=23.51 Aligned_cols=347 Identities=12% Similarity=0.117 Sum_probs=120.2 Q ss_pred CCc------------hHHHHHHhhHh-------------hccccc------chhhc-----chhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MSK------------KNELMEKWNDL-------------LESQEG------LPDIA-----TKSKKQLVAAILEAQEKDA 44 (524) Q Consensus 1 m~~------------~~~l~~kw~p~-------------l~~~~~------~~~i~-----~~~~~~~~~~l~enq~~~~ 44 (524) ++. -++|.++..-+ .+..+. .+... ...|..-..+.+ +.+ T Consensus 30 lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~ 105 (435) T protein:vir:14 30 LSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMARMV----RAL 105 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhccccccccccchhhhhHHHHHHHH----HHH Confidence 111 11222221111 000000 00000 000000001110 000 Q ss_pred hccccccchhhhhhhcccccccccccccccCccccccccccccccccCchhh--hHHHHHHhhhhhhhe-eeeecCCchh Q lcl|Aclame:pro 45 ETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDI-CGVQPMTGPT 121 (524) Q Consensus 45 ~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI-~GVQPmTgPT 121 (524) ...........-......+.|... ...+ ..+...|.+. =|.-+ .+++++.++.+..++ +-+.||+... T Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~------~~~~-~~t~~~gg~~--vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~ 176 (435) T protein:vir:14 106 AAARGDAQLASKLAIERGFGEEVA------MSLN-TLSPGAGGVL--VPENLSSEVIELLRPKSVVRKLGARTLPLSNGN 176 (435) T ss_pred HhhcchhhHHHHHHHhhhhhhhhh------hhcc-cCCcCCCccc--cchhHHHHHHHHHhhhchhhhhcceeeecCCCc Confidence 000000000000000000000000 0000 0001111110 02111 233333344444443 2233332110 Q ss_pred hhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccC Q lcl|Aclame:pro 122 GQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGN 201 (524) Q Consensus 122 GLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~ 201 (524) + +|+..+ ++. . T Consensus 177 -~------~~p~~~--~~~--------------~---------------------------------------------- 187 (435) T protein:vir:14 177 -I------TIPRLK--GGA--------------I---------------------------------------------- 187 (435) T ss_pred -e------EEEEEe--CCc--------------c---------------------------------------------- Confidence 0 000000 000 0 Q ss_pred CcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHH Q lcl|Aclame:pro 202 VTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQD 281 (524) Q Consensus 202 ~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQD 281 (524) .+- .+| +..+++..-++++++..++..+-....|-||.+| T Consensus 188 -----------------------a~~--------v~E---------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d 227 (435) T protein:vir:14 188 -----------------------VGY--------IGA---------DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKY 227 (435) T ss_pred -----------------------eee--------ecc---------CccccccccceeEEEeeeEEEEEeehhhHHHHHh Confidence 000 011 0123333445666666777777778899999999 Q ss_pred HHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHH Q lcl|Aclame:pro 282 LRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQI 361 (524) Q Consensus 282 LkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i 361 (524) -. -..+.|+.|.+.|+..|...+|+-|+. |. | +...+.|++....+..+...-.. .-+..+...+ T Consensus 228 s~--~~~~l~~~i~~~l~~ai~~~~d~a~l~--------G~-G---~~~~p~Gi~~~~~~~~~~~~~~~-~~~~~~~~~~ 292 (435) T protein:vir:14 228 AG--VNPNVDQIVVGDLTAAIGAREDKAFIR--------DD-G---TANTPKGLRFWALPSNVITASDA-STLQKIETDL 292 (435) T ss_pred hc--cCHHHHHHHHHHHHHHHHHHHHHHhhc--------cC-C---CCccccceeecccccceeccccc-cchhhHHHHH Confidence 32 133478889999999999999988874 11 1 00123455443222111111000 0111222223 Q ss_pred HHHHHHHHH-hccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcc---- Q lcl|Aclame:pro 362 DKEANEIAR-QTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQD---- 436 (524) Q Consensus 362 ~~~a~~I~~-~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---- 436 (524) .++-..+.. ...+ .....|++|.....|..+..+-.. ....+.+ .|+|.| ++||++++.|.+ T Consensus 293 ~~l~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lkd~~G~-------~l~~~~~----~g~l~G-~Pv~~~~~~p~~~~~~ 359 (435) T protein:vir:14 293 GKVILALENADANL-TQPGWIMAPRTFRFLEGLRDGNGN-------KVYPELA----NGMLKG-YPVGKTTQVPINLGET 359 (435) T ss_pred HHHHHHhhhccccc-cCCEEEEcHHHHHHHHHhhccCCc-------eeccCCC----CCeeec-ceeEeeccccccccCC Confidence 333322222 1233 235679999999998764322211 1111221 367877 699998775432 Q ss_pred ----eEEE--------EEecCCCccceeEeecccccccccccCCccc---cceeeeeeeeccEecCcccccCCCcccccc Q lcl|Aclame:pro 437 ----YFTV--------GFKGDNEMDAGIYYAPYVALTPLRGSDPKNF---QPVMGFKTRYGIGINPFANSRSQAPADRIT 501 (524) Q Consensus 437 ----y~~v--------G~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~---qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~ 501 (524) -+++ |..+.-. +-..||..........-..| |=.+=+..|++..+ -+.++-.++ T Consensus 360 ~~~~~i~~gd~s~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~-------~~~~a~~~l 428 (435) T protein:vir:14 360 GKESEIYFTDFGDVFIGEEETLE----IDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGP-------RHVESIAVL 428 (435) T ss_pred CccceEEEeecccEEEEEecccE----EEEeccccccccccchhhhhhcChhheeeeeeeCcee-------ecccceEEE Confidence 1222 3222222 22333321111100000001 12333455666432 111222344 Q ss_pred ccchHHh Q lcl|Aclame:pro 502 SGMISKE 508 (524) Q Consensus 502 ~~~~~~~ 508 (524) .|-+|.. T Consensus 429 ~~~~~~~ 435 (435) T protein:vir:14 429 AGVAWGA 435 (435) T ss_pred ecCCCCC Confidence 5555554 No 82 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=62.25 E-value=0.34 Score=23.16 Aligned_cols=303 Identities=10% Similarity=0.043 Sum_probs=124.2 Q ss_pred HhccccccchhhhhhhcccccccccccccccCcccccccccc-ccccccCchhhhHHHHHHhhhhhhheeeeecCCchhh Q lcl|Aclame:pro 44 AETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIASGKSS-GAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTG 122 (524) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~~st~s-g~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTG 122 (524) +.-++-=..+++ ...|...+. .++++ |.+ --.+.+=.+++.+.+..+-..+|-+.||++++. T Consensus 1 ~~~~~~r~~~~~------~~~e~~a~~----------~~~~~~g~~-ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~ 63 (326) T protein:vir:42 1 MAVNPDRTTPFL------GVNDPKVAQ----------TGDSMFEGY-LEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQ 63 (326) T ss_pred CCCCccchhhhc------Ccchhhhee----------ccccCCcce-echhhHHHHHHHHHhcchhhhhcceeeccCCce Confidence 111110001110 011111100 00111 111 111121145555555666777888888876542 Q ss_pred hheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCC Q lcl|Aclame:pro 123 QVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNV 202 (524) Q Consensus 123 LIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~ 202 (524) - |+-.. ++.+ ..| T Consensus 64 ~-------~p~~~--~~~~--------------a~~-------------------------------------------- 76 (326) T protein:vir:42 64 K-------IPHWT--GDVS--------------ASW-------------------------------------------- 76 (326) T ss_pred E-------EEEEe--CCcc--------------eEE-------------------------------------------- Confidence 1 11100 0000 000 Q ss_pred cccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHH Q lcl|Aclame:pro 203 TVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDL 282 (524) Q Consensus 203 ~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDL 282 (524) .+| +..++|-..+++++++.+|...-.-.+|-||.+|- T Consensus 77 ---------------------------------v~E---------g~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s 114 (326) T protein:vir:42 77 ---------------------------------IGE---------GDMKPITKGNMTSQTIAPHKIATIFVASAETVRAN 114 (326) T ss_pred ---------------------------------ecC---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcC Confidence 001 11234444566777777887777888999999984 Q ss_pred HhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccc----cccchHHHHHHHHH Q lcl|Aclame:pro 283 RAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDI----RGARWAGESYKALL 358 (524) Q Consensus 283 kAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~----~~~~~~~e~~r~L~ 358 (524) ..|.++.|.+-|+..|...+++.++. |.. .+.+.|+......... ..+-+.......+. T Consensus 115 ----~~~~~~~i~~~l~~a~~~~~d~a~l~--------G~g-----s~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (326) T protein:vir:42 115 ----PANYLGTMRTKVATAFAMAFDNAAIN--------GTD-----SPFPTFLAQTTKEVSLVDPDGTGSNADLTVYDAV 177 (326) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHhhc--------ccC-----CCccccccccccccceeecccccccccchhHHHH Confidence 36789999999999999999999984 100 0001111111000000 00000000111111 Q ss_pred HHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceE Q lcl|Aclame:pro 359 IQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYF 438 (524) Q Consensus 359 ~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 438 (524) +..+.... ...+...+..|++|.....|......-. .+-.+...-.........|+|.| ++|+++++.+.+=. T Consensus 178 --~~~~~~~~--~~~~~~~a~~v~n~~~~~~L~~lkd~~G--~~l~~~~~~~~~~~~~~~~~l~G-~pv~~~~~~~~~~~ 250 (326) T protein:vir:42 178 --AVNALSLL--VNAGKKWTHTLLDDITEPILNGAKDKSG--RPLFIESTYTEENSPFRLGRIVA-RPTILSDHVASGTV 250 (326) T ss_pred --HHHHHhhh--hhhccCccEEEEeHHHHHHHHHhhccCC--ceeeccccccCccccccCceeee-eeEEEcCCCCCCce Confidence 11111111 1223356788999999999875322110 00000000000111122356777 79999998765422 Q ss_pred EEEEecCCCccceeEeecccccccccc---------cCCcc-----cc---ceeeeeeeeccEe-cCcccccCCCccccc Q lcl|Aclame:pro 439 TVGFKGDNEMDAGIYYAPYVALTPLRG---------SDPKN-----FQ---PVMGFKTRYGIGI-NPFANSRSQAPADRI 500 (524) Q Consensus 439 ~vG~KG~~~~~~~~fyaPYv~~~~~~~---------~dp~s-----~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~i 500 (524) +++-|+-. -+||...-.+ .++. .|+.. || =.+=...|++..+ +| ...+++ T Consensus 251 -~~~~Gd~s---~~~~~~~~~~-~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~-------~a~~~l 318 (326) T protein:vir:42 251 -VGYQGDFR---QLVWGQVGGL-SFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDK-------DAFVKL 318 (326) T ss_pred -EEEEeecc---eEEEEEecce-EEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecc-------cceEEE Confidence 22223211 1223222111 1111 11111 22 3334566777543 22 111122 Q ss_pred cccchHHhhccch Q lcl|Aclame:pro 501 TSGMISKEMCGKN 513 (524) Q Consensus 501 ~~~~~~~~~a~~~ 513 (524) .. ..++++ T Consensus 319 ~~-----~~~~~~ 326 (326) T protein:vir:42 319 TN-----VDATEA 326 (326) T ss_pred ee-----ccccCC Confidence 21 122223 No 83 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=61.76 E-value=0.35 Score=23.10 Aligned_cols=340 Identities=12% Similarity=0.064 Sum_probs=120.6 Q ss_pred CCchHHHHHH--------------hhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|Aclame:pro 1 MSKKNELMEK--------------WNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVY---------------- 50 (524) Q Consensus 1 m~~~~~l~~k--------------w~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~---------------- 50 (524) .-+-++|++| =..+++..+ ..++.... .=+.. |+++.+.+++...- T Consensus 4 ~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~-~~e~~~~~--~e~~~-l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (421) T protein:vir:13 4 FERLKELRAKKKELEEKRCGIVEEIRSLAKEKK-EEEARSKA--LEREK-IEARMEIIEEEIESVMTAIDEERKNTNFTG 79 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-hHHHHHHH--HHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhhcccc Confidence 1111112222 222222111 11111110 00111 11111111111000 Q ss_pred ------cc---hh---hhhhhcccccccccccccccCccccccccccccc---cccCchhhhHHHHHHhhhhhhheeeee Q lcl|Aclame:pro 51 ------RD---EK---IVESFGGFLAEAEIAGDHNYDQTNIASGKSSGAI---TNIGPAVIGMVRRAIPNLIAFDICGVQ 115 (524) Q Consensus 51 ------~~---~~---~~~~~~~~l~ea~~~g~~~~~~~~~~~st~sg~v---~~~~P~li~l~Rra~~nLIa~DI~GVQ 115 (524) .+ ++ ....|..++- |..........-+++.|.+ ..+.+.++.+. .+...-.++|-+. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~---~~~~~l~~l~~~~ 151 (421) T protein:vir:13 80 GRVIINGDSKEEKRSLQLSAMSKTIR-----GIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLK---EGYPSLKEHCHVI 151 (421) T ss_pred cccccccchhHHHHHHHHHHHHHhhh-----ccchhHHHhhccccCCcceecchhhHHHHHHHH---Hhhhhhhhhceee Confidence 00 00 0001111110 0000000000111111211 11222333344 4455667888899 Q ss_pred cCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 116 PMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQ 195 (524) Q Consensus 116 PmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~ 195 (524) ||+++++-+- +.... ..+ + +. T Consensus 152 ~~~~~~~~~~-----~~~~~--~~~-----~-----------~~------------------------------------ 172 (421) T protein:vir:13 152 PVNRNAGKMP-----VRAGA--SVD-----K-----------LA------------------------------------ 172 (421) T ss_pred eccCCceEEE-----EeecC--Ccc-----c-----------ee------------------------------------ Confidence 9887664221 11110 000 0 00 Q ss_pred cccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeeccccccccc Q lcl|Aclame:pro 196 NVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYS 275 (524) Q Consensus 196 ~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT 275 (524) . .+| +...++-..++++++...+.-+-...+| T Consensus 173 -------------------------------~--------~~E---------~~~~~~s~~~f~~i~~~~~k~~~~v~iS 204 (421) T protein:vir:13 173 -------------------------------N--------LAK---------DTELVKAMLKTQPMAYDIDDYGLLAPID 204 (421) T ss_pred -------------------------------e--------ccc---------cccccccccceeEEEeeeeeeEeehhhh Confidence 0 000 0012222233445555555555557799 Q ss_pred HHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHH Q lcl|Aclame:pro 276 VELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYK 355 (524) Q Consensus 276 ~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r 355 (524) -||.+|-- .|.++.|.+-|+..+..-+|..|+..+ +|+.+. +++.++ +..+ T Consensus 205 ~ell~ds~----~~l~~~i~~~la~~~~~~~~~~i~~~~--------~g~~~~----~~~~~~-------------d~i~ 255 (421) T protein:vir:13 205 NSLLEDSE----INFLEFVNEEFAEFAVNTENAEIVKQA--------KAVLAE----ETINDY-------------AGLV 255 (421) T ss_pred HHHHhhhH----HHHHHHHHHHHHHHHHHHhhhhHhhhh--------hhcccc----ccccch-------------HHHH Confidence 99999853 456888888888888888888887532 122111 122111 2344 Q ss_pred HHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCc Q lcl|Aclame:pro 356 ALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ 435 (524) Q Consensus 356 ~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~ 435 (524) .++..+.. .+.....+|++|.....|.....+ .+. -...+... .--++|.| ++|++..+.+. T Consensus 256 ~~~~~l~~---------~~~~~a~~v~n~~~~~~l~~lkd~----~G~---~i~~~~~~-~~~~tl~G-~pV~~~~~~~~ 317 (421) T protein:vir:13 256 KTINSLVP---------NARKRAIIVTNSDGRAYLDGLMDK----QGR---PLLKELSD-GGDLVFKG-RPVIELEESIF 317 (421) T ss_pred HHHHHhhh---------hhcCCCEEEEcHHHHHHHHHhhcC----CCc---eeecCcCC-CCCceecc-eeeEEeccccc Confidence 45544432 223556789999998888753222 111 01111110 01246777 58887766542 Q ss_pred c--------------eEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe-----------cCcc- Q lcl|Aclame:pro 436 D--------------YFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-----------NPFA- 489 (524) Q Consensus 436 d--------------y~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-----------nP~~- 489 (524) . |+.++.++.-..+.+- + .+-..-+=.+-+..||++++ .++. T Consensus 318 ~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~----~--------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a 385 (421) T protein:vir:13 318 DVGDETKFIVSDFKTLIKFMDRKQYLIDQSK----E--------AGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGV 385 (421) T ss_pred cCCCceEEEEEeccccEEEEEecceEEEeec----c--------cccccCeeEEEEEeeecceeecchhhheeeecccce Confidence 1 2222222221111100 0 01112222344455554332 1111 Q ss_pred -cccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 490 -NSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 490 -~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) ...++.+.+-...+..-+ +||-+ +.|.+= T Consensus 386 ~v~~~~~~~~~~~~~~~~~--~~~~~----~~~~~~ 415 (421) T protein:vir:13 386 IVKLQEVLKSSPRSGKNKN--ESKEE----IKEEGE 415 (421) T ss_pred eeccccccCCCCcCCCCcc--ccchh----eeeccc Confidence 111111221111111100 12211 111111 No 84 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=59.89 E-value=0.38 Score=22.87 Aligned_cols=288 Identities=11% Similarity=0.081 Sum_probs=120.1 Q ss_pred cccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccc Q lcl|Aclame:pro 79 IASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTM 157 (524) Q Consensus 79 ~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~ 157 (524) -+ .+++|.+. .-+.+. .+++++-++-+-.++|-|-||++.. .+|+-.+ ++. .+. T Consensus 1 ma-t~~~gg~l-vP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~-------~~~p~~~--~~~--------------~a~ 55 (311) T protein:vir:81 1 MV-ALATGTFQ-LPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-------QQYMTLT--APP--------------RGE 55 (311) T ss_pred Cc-eecCCceE-cchhHHHHHHHHHHhcchhhhhcceeecCCCc-------eEEEEEe--CCc--------------eeE Confidence 01 12222221 112222 4666666788888999999886532 1221110 000 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhh Q lcl|Aclame:pro 158 YSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVA 237 (524) Q Consensus 158 fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~a 237 (524) | .+ T Consensus 56 w-----------------------------------------------------------------------------v~ 58 (311) T protein:vir:81 56 V-----------------------------------------------------------------------------VG 58 (311) T ss_pred E-----------------------------------------------------------------------------ee Confidence 0 00 Q ss_pred hhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhh Q lcl|Aclame:pro 238 ELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYT 317 (524) Q Consensus 238 Eal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~ 317 (524) | +..+++...++++++..+|.-+-....|-||.|+-.. -.++-+++|.+-|+..|...|+.-++.=.... T Consensus 59 E---------g~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~ 128 (311) T protein:vir:81 59 E---------GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGINPL 128 (311) T ss_pred c---------CcccccccceeeEEEEeeEEEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHhhhccccCC Confidence 1 0112333334455555555555556789999875332 13456888888888888888888887521100 Q ss_pred eeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccc Q lcl|Aclame:pro 318 AQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGI 397 (524) Q Consensus 318 a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~ 397 (524) .-....++.+............. .. ...++.-|.++-..+.. .+...+-+|++|.....|.....+ T Consensus 129 ~~~~~~gi~~~~~~~~~~~~~~~------~~-----~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~lkd~- 194 (311) T protein:vir:81 129 TGAALSGSPAKILDTTNIVELTT------GT-----SATPDLAVEAAVGLVLG--DNLSPDGVALDNTFSFMLATQRDS- 194 (311) T ss_pred CCcccccccccccccceeeeecc------cc-----cchHHHHHHHHHHHhhh--cCCCceEEEEcHHHHHHHHhhhcc- Confidence 00000111110000001111110 00 01122234444444432 234667789999998888653211 Q ss_pred cccchhhhcccccccccceeEEEecCcEEEEecCCCCcceE------EEEEecCCCc-----c-ceeEeecccccccc-- Q lcl|Aclame:pro 398 TPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYF------TVGFKGDNEM-----D-AGIYYAPYVALTPL-- 463 (524) Q Consensus 398 ~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~------~vG~KG~~~~-----~-~~~fyaPYv~~~~~-- 463 (524) .+. ..-.+.......|+|.| ++|+++.+.+..-. .+...+.... | +.+++...-+.... T Consensus 195 ---~G~---~l~~~~~~~~~~~tl~G-~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~ 267 (311) T protein:vir:81 195 ---QGR---KLYPELGFGTDVASFAG-LNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELI 267 (311) T ss_pred ---CCC---eeecCccccCCCceecc-eeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEe Confidence 110 00011111112467887 69998876543211 1111111110 1 12233322222221 Q ss_pred cccCCcc----ccc-eeee--eeeeccE-ecCcccccCCCccccccccchH Q lcl|Aclame:pro 464 RGSDPKN----FQP-VMGF--KTRYGIG-INPFANSRSQAPADRITSGMIS 506 (524) Q Consensus 464 ~~~dp~s----~qP-~~~~--~tRY~l~-~nP~~~~~~~~~~~~i~~~~~~ 506 (524) +-.|+.. ||- .++| ..|++.. .+|=+ ..++.+.... T Consensus 268 ~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a-------~~~l~~a~~~ 311 (311) T protein:vir:81 268 EFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDA-------FAVVRDADES 311 (311) T ss_pred ccCCCCcchhhhhcCcEEEEEEEEeccEeecccc-------eEEEEeeccC Confidence 1123221 222 1333 4678744 55511 1122222221 No 85 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=55.68 E-value=0.47 Score=22.36 Aligned_cols=348 Identities=15% Similarity=0.136 Sum_probs=133.8 Q ss_pred CCchHHHHHHhhHhhcccc------------cchhhcch---h-HHHHHHHHHHHHHHHHhcccc--------------- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQE------------GLPDIATK---S-KKQLVAAILEAQEKDAETDPV--------------- 49 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~------------~~~~i~~~---~-~~~~~~~l~enq~~~~~~~~~--------------- 49 (524) |++-++|.++=...++.-. ...++... . +..--..-|+.|.+++..... T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 80 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQPN 80 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhccc Confidence 8886666555433322100 01111111 0 000011123333333322210 Q ss_pred ccchh------hhhhhcccccccccccccccCccccccc--cccccccccCchhhhHHHHHHhhhhhhheeeeecCCchh Q lcl|Aclame:pro 50 YRDEK------IVESFGGFLAEAEIAGDHNYDQTNIASG--KSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPT 121 (524) Q Consensus 50 ~~~~~------~~~~~~~~l~ea~~~g~~~~~~~~~~~s--t~sg~v~~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPT 121 (524) ..+.+ --.++..+|-. .+....+.+.+ ++.|.+.--.+..-.++++..+..+-.++|.+.||++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~l~~------~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 154 (394) T protein:vir:10 81 GTDLKKKPIDAKKKAINDFIHS------HGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPK 154 (394) T ss_pred ccchhhhHHHHHHHHHHHHHhc------cchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCc Confidence 00000 00112222111 11111111111 111222111111224666666777788999999998876 Q ss_pred hhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccC Q lcl|Aclame:pro 122 GQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGN 201 (524) Q Consensus 122 GLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~ 201 (524) +-+--.+ .. .+. ..|-+. T Consensus 155 ~~~~~~~-----~~--~~~---------------~~~~~E---------------------------------------- 172 (394) T protein:vir:10 155 GTYPILK-----RA--TDR---------------FSSVAE---------------------------------------- 172 (394) T ss_pred eEEEEEe-----cC--CCc---------------cccccc---------------------------------------- Confidence 4433222 10 000 000000 Q ss_pred CcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHH Q lcl|Aclame:pro 202 VTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQD 281 (524) Q Consensus 202 ~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQD 281 (524) .+..| ..+...|.+..|.+.|. +-...+|-||.+| T Consensus 173 ---~~~~~-----------------------------------~~~~~~~~~v~l~~~k~-------~~~~~iS~ell~d 207 (394) T protein:vir:10 173 ---LAENP-----------------------------------ALAEPEFEQVDWSVSTY-------RGAIPLSEEAIAD 207 (394) T ss_pred ---ccccc-----------------------------------ccccccceeEEeeeeee-------EeeehhHHHHHhh Confidence 00000 00112345555555544 4456799999998 Q ss_pred HHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHH Q lcl|Aclame:pro 282 LRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQI 361 (524) Q Consensus 282 LkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i 361 (524) - ..|.+++|.+-|+..|..-+|+.|+.-..... ..++.... ..+....++... T Consensus 208 s----~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~-------------~~~~~~~~----------~~d~l~~~~~~~ 260 (394) T protein:vir:10 208 S----AVDLTSLVGQSINEKSVNTYNAMIAPVLQSFT-------------AKATTTDT----------LVDSLKHILNVD 260 (394) T ss_pred h----hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------cccccccc----------cHHHHHHHHHhh Confidence 4 25679999999999999999999986432110 01111100 111222222211 Q ss_pred HHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecC--CCCc---c Q lcl|Aclame:pro 362 DKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQ--YARQ---D 436 (524) Q Consensus 362 ~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~--y~~~---d 436 (524) ... .+ . ..+|++|.....|..+..+-..+ -.+.. ..+.+.....++|.| ++||+.. +.+. + T Consensus 261 ~~~--------~~-~-a~~vmn~~~~~~l~~lkd~~G~~--i~~~~-~~~~~~~~~~~~L~G-~PV~~~~~~~~~~~~~~ 326 (394) T protein:vir:10 261 LDP--------AY-S-RALVVTQSLFNTLDTLKDKNGRY--LLHDA-SDSITDGTAKGTVLG-VPVYVVGDALLGSAAGD 326 (394) T ss_pred hhh--------hc-c-CEEEecHHHHHHHHHhhccCCCe--eeecc-ccccccCCccccccc-ceeEEecccccCCCCCc Confidence 111 11 2 35789999988887543221100 00000 011112223357887 5877632 2221 1 Q ss_pred e-EEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccchh Q lcl|Aclame:pro 437 Y-FTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKNA 514 (524) Q Consensus 437 y-~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~~ 514 (524) . +++|---. .+....- ....+...|...|.-.+-...|++..+ ||-+. +++...+. ..|.-. T Consensus 327 ~~i~~gd~s~-----~~~~~~~-~~~~v~~~~~~~~~~~~~~~~r~d~~~~~~~ai--------~~~~~~~~--~~~~~~ 390 (394) T protein:vir:10 327 QKAFVGDLKR-----GVLFADR-QQVTLAWEDSKIYGRYLGAAFRFGVKQADSNAG--------YFVTNTDA--ASGSTS 390 (394) T ss_pred eEEEEeeccc-----cEEEEee-cceEEEEecccccceeEEEEEEeccEEeccccE--------EEEEeecc--cCCCCC Confidence 1 22221000 0000000 111122234455555666677887543 33111 01000000 001111 Q ss_pred hhhh Q lcl|Aclame:pro 515 YFRK 518 (524) Q Consensus 515 ~~~~ 518 (524) +--+ T Consensus 391 ~~~~ 394 (394) T protein:vir:10 391 GTGK 394 (394) T ss_pred CCCC Confidence 1111 No 86 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=54.54 E-value=0.5 Score=22.23 Aligned_cols=204 Identities=15% Similarity=0.165 Sum_probs=104.7 Q ss_pred EEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeec Q lcl|Aclame:pro 258 IDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFD 337 (524) Q Consensus 258 IEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fd 337 (524) ||= =|-|..=++-.-+-++ | .|--.|.+.-+..+++.++++-|++.+...|+-.. .++..++ ..+ T Consensus 1 iD~--------lL~a~~~VdDiD~aqa-~-~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~-p~~~~~~----g~~ 65 (221) T protein:vir:17 1 MDD--------LLVASQFVYDLDEILA-Q-WNTRSEISKQIGEALAIHYDERIARVLASASIAAA-PVTGQDG----GFS 65 (221) T ss_pred CCc--------chhHHHHHHhHHHHHh-h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcC-ccccccc----Ccc Confidence 221 1223333444444455 4 78889999999999999999999988776665322 2222222 111 Q ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchh-hhhhhhhhcccccccchhhhcccccccccce Q lcl|Aclame:pro 338 FQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRN-VVSALARIDSGITPASQGLQKTLNVDTTKAV 416 (524) Q Consensus 338 l~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~-va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~ 416 (524) .... .+..-. ...|+..|-+.+...-.+----.|-|+|++|+ ...+|+.-+..+... .+.......+. .. T Consensus 66 ~~~~---a~~t~~---~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~-d~~~s~g~~~~--g~ 136 (221) T protein:vir:17 66 VNIG---AGNTNN---AQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNR-EIGNTQGDMNT--GK 136 (221) T ss_pred eecc---ccccCC---HHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeee-ecccccccccc--cc Confidence 1100 000001 12333333333333333333346789999996 566665333444322 11111111111 11 Q ss_pred eEEEecCcEEEEecCCCCc----ceEE------------EEEecCCCccceeEeecccccccccccCCccccceeeeeee Q lcl|Aclame:pro 417 FAGVLGGTYKVYIDQYARQ----DYFT------------VGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 480 (524) Q Consensus 417 ~~G~l~~~~~vy~D~y~~~----dy~~------------vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tR 480 (524) .+|.+.| ++||.=++.|+ +|.. =.|.|+-.-..||||.|=. +--++.+.|-|--|.+.-| T Consensus 137 ~i~~v~G-~~V~~SnnlP~~~gt~~~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~A-vgtvkl~~~~~~~~~~~~~-- 212 (221) T protein:vir:17 137 GLYVNAG-IRIYKSNVLASLYGTNLVTDPGDATTSGENNGSYRPAITDRAGLVFHKEA-ADTVEVLLPPSRPPLVISM-- 212 (221) T ss_pred eeeeecC-cEEEEeccCCcccccccccCCccccccccccccccccccceEEEEEcchh-eeeeeeecCCCCCceeeee-- Confidence 3677885 89999999876 3321 1344555455789998863 3334556777766654322 Q ss_pred eccEecCcccccCCCcccc Q lcl|Aclame:pro 481 YGIGINPFANSRSQAPADR 499 (524) Q Consensus 481 Y~l~~nP~~~~~~~~~~~~ 499 (524) |.-. .+..| T Consensus 213 -------~~~~---~~~~~ 221 (221) T protein:vir:17 213 -------FSIR---RPDRR 221 (221) T ss_pred -------eecc---CCCCC Confidence 2111 12222 No 87 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=52.88 E-value=0.54 Score=22.04 Aligned_cols=334 Identities=16% Similarity=0.193 Sum_probs=129.2 Q ss_pred CCc--------------hHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHH---Hhccccccchhhhhhhcccc Q lcl|Aclame:pro 1 MSK--------------KNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKD---AETDPVYRDEKIVESFGGFL 63 (524) Q Consensus 1 m~~--------------~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~---~~~~~~~~~~~~~~~~~~~l 63 (524) +.. ..+++++=.- ++ .+|..+ ++-+....+.+.+. .++....+++....+|..+| T Consensus 50 ~k~~~~~~~~~~~~~~~~~e~~~~~~~-~~-----~ei~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l 121 (425) T protein:vir:10 50 FKAEHTKQLDAVKAGLPTSDALAKVDK-VS-----ADLEAL--QAAVDEANIKIAAAQMGANGVKPLRDPEYTEAFKAHV 121 (425) T ss_pred HHHHHHHHHHHHHhhhccHHHHHHHHH-HH-----HHHHHH--HHHHHHHHHHHHhhhcccccccccccHHHHHHHHHHh Confidence 111 1111111100 11 011111 11111111111111 01111122222233444444 Q ss_pred cccccccccccCcccccccc-ccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccc Q lcl|Aclame:pro 64 AEAEIAGDHNYDQTNIASGK-SSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTP 141 (524) Q Consensus 64 ~ea~~~g~~~~~~~~~~~st-~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gte 141 (524) ...++-. .+..++ +.|.+ ..-+.+. .+++.+-...+..++|.|.||+++..-+. -.. ++. T Consensus 122 ~~~e~~~-------al~~~t~~~gG~-lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~-------~~~--~~~- 183 (425) T protein:vir:10 122 KRGDVQA-------ALNKGEDSEGGY-LTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKL-------FNM--GGT- 183 (425) T ss_pred hhhhhHH-------HhhcCcCCCCce-eccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEE-------EEc--CCc- Confidence 3332110 000111 11111 1112222 25555556777888999999987654222 110 000 Q ss_pred cchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCccccccccccccc Q lcl|Aclame:pro 142 ADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENE 221 (524) Q Consensus 142 A~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~ 221 (524) .+.|-+. T Consensus 184 -------------~a~wv~E------------------------------------------------------------ 190 (425) T protein:vir:10 184 -------------TSGWVGE------------------------------------------------------------ 190 (425) T ss_pred -------------ceeeecc------------------------------------------------------------ Confidence 0000000 Q ss_pred ccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHH Q lcl|Aclame:pro 222 KGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATE 301 (524) Q Consensus 222 ~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStE 301 (524) ++.. ..+....|.++.|.+.|..+ ...+|-||.+|- ..|.+++|.+-|+.. T Consensus 191 ---------------~~~~---~~~~~~~f~~v~~~~~k~~~-------~i~iS~ell~ds----~~~l~~~i~~~la~a 241 (425) T protein:vir:10 191 ---------------ASQR---PQTNAATFQPLSFASGEIYA-------NPAATQQILDDA----EIDLESWLATEVQTE 241 (425) T ss_pred ---------------cccc---ccccccccceeeeeheeeEe-------ehHhHHHHHhcc----hhHHHHHHHHHHHHH Confidence 0000 00001136666666666654 566999999985 356799999999999 Q ss_pred HHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccc---------------cccccchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 302 IMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPV---------------DIRGARWAGESYKALLIQIDKEAN 366 (524) Q Consensus 302 I~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~---------------~~~~~~~~~e~~r~L~~~i~~~a~ 366 (524) |..-+|+.||. | .| ...+.|++...... ....+.-..+....|+..+.. T Consensus 242 i~~~~d~~~l~--------G-~G----~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~--- 305 (425) T protein:vir:10 242 FAKQEGKAFLA--------G-DG----TNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPS--- 305 (425) T ss_pred HHHHHHhhhhc--------c-cC----CCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhh--- Confidence 99999999885 1 00 00122332211100 000111111223334433221 Q ss_pred HHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCc-----ceEEEE Q lcl|Aclame:pro 367 EIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ-----DYFTVG 441 (524) Q Consensus 367 ~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG 441 (524) .+-+....|++|.....|..+..+ .+. .+..++......++|.| ++|+++.+.+. +-+++| T Consensus 306 ------~~~~~a~~vmn~~~~~~L~~lkD~----~G~---~l~~~~~~~g~~~~l~G-~PV~~~~~~p~~~~~~~~i~~G 371 (425) T protein:vir:10 306 ------AFTGNARFAMNRNTQRQVRKLKDG----QGN---YLWQPSYVAGQPATLAG-YPVTEVPDMPDVAANSTPILFG 371 (425) T ss_pred ------hhccCCEEEEchHHHHHHHHhhcC----CCc---eeeccCccCCCCceecc-eeeEEecCcCCccCCccEEEEE Confidence 222334678999998888753221 110 00011111111257887 69999887653 334443 Q ss_pred EecCCCccceeEeecccccccccccCCccccc--eeeeeeeeccE-ecCcccccCCCcccc Q lcl|Aclame:pro 442 FKGDNEMDAGIYYAPYVALTPLRGSDPKNFQP--VMGFKTRYGIG-INPFANSRSQAPADR 499 (524) Q Consensus 442 ~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP--~~~~~tRY~l~-~nP~~~~~~~~~~~~ 499 (524) +-.. ..+...= ..+....|+-.-.- .+-...||+.. .+|-+...-+-..++ T Consensus 372 ---d~~~--~~~i~~~--~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 372 ---DFQQ--TYLIIDR--IGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred ---ehhc--cEEEEEe--cceEEEecccccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 1110 0111110 11111123322222 23344567653 344332221111111 No 88 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=48.77 E-value=0.66 Score=21.58 Aligned_cols=300 Identities=11% Similarity=0.034 Sum_probs=120.1 Q ss_pred hcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccccccccccccccCch Q lcl|Aclame:pro 15 LESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIASGKSSGAITNIGPA 94 (524) Q Consensus 15 l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~ 94 (524) .+ + ++....+++.....+.+-+ .+ ++.+.. .+.++.. .=|. T Consensus 1 ~~--~--~~~~~~~~~~~~~~~~~~~---------------------~~-----------~a~~~~-~~~~~~~--~iP~ 41 (324) T protein:vir:78 1 ME--Q--TQKLKLNLQHFASNNVKPQ---------------------VF-----------NPDNVM-MHEKKDG--TLMN 41 (324) T ss_pred CC--c--chhhhHHHHHHHHHhhhhh---------------------hh-----------cccccc-ccCcCcc--ccch Confidence 11 0 1111111221111111110 00 000000 0111111 1122 Q ss_pred hh--hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 95 VI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITT 172 (524) Q Consensus 95 li--~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~ 172 (524) -+ .+++.+..+....+++-+-||++++-- |.-.. ++. .+.| T Consensus 42 ~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~-------~p~~~--~~~--------------~a~~-------------- 84 (324) T protein:vir:78 42 EFTTPILQEVMENSKIMQLGKYEPMEGTEKK-------FTFWA--DKP--------------GAYW-------------- 84 (324) T ss_pred hHHHHHHHHHHhhchhhhhcceeeccCCceE-------EEEEe--cCc--------------ceeE-------------- Confidence 22 355556667777888888888765421 11110 000 0000 Q ss_pred ccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccc Q lcl|Aclame:pro 173 GTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWN 252 (524) Q Consensus 173 gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~ 252 (524) .+| +..++ T Consensus 85 ---------------------------------------------------------------v~E---------g~~~~ 92 (324) T protein:vir:78 85 ---------------------------------------------------------------VGE---------GQKIE 92 (324) T ss_pred ---------------------------------------------------------------ecC---------Ccccc Confidence 001 01233 Q ss_pred cceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCcc Q lcl|Aclame:pro 253 EMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSK 332 (524) Q Consensus 253 EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~ 332 (524) +...+++++++..+.-+.-..+|-||.+|-. .|.+++|.+.|+..|...|++.+|.=--.+ ..+ T Consensus 93 ~~~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~------------~~~ 156 (324) T protein:vir:78 93 TSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN------------PFG 156 (324) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCC------------CcC Confidence 4444555566666666666679999999864 567999999999999999999998521111 112 Q ss_pred ceeecccccccc-cccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccc Q lcl|Aclame:pro 333 AGSFDFQDPVDI-RGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVD 411 (524) Q Consensus 333 ~G~fdl~~~~~~-~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d 411 (524) .|+......... ..+. ..+..|.++.+.+.. .+...+.+|++|.....|..+..+- +. ....+ T Consensus 157 ~gi~~~~~~~~~~~~~~-------~t~~~i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~d~~----G~---~~~~~ 220 (324) T protein:vir:78 157 KSIAQSIEKTNKVIKGD-------FTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVDPE----TK---ERIYD 220 (324) T ss_pred ccccccccccceecccc-------ccHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhccC----CC---eeecC Confidence 233332211110 0111 112233444444433 3345677999999999887542221 10 01111 Q ss_pred cccceeEEEecCcEEEEecCCCC--cceEEEE--------EecCCCccceeEeecccccccccccCCc-----cc---cc Q lcl|Aclame:pro 412 TTKAVFAGVLGGTYKVYIDQYAR--QDYFTVG--------FKGDNEMDAGIYYAPYVALTPLRGSDPK-----NF---QP 473 (524) Q Consensus 412 ~~~~~~~G~l~~~~~vy~D~y~~--~dy~~vG--------~KG~~~~~~~~fyaPYv~~~~~~~~dp~-----s~---qP 473 (524) .. .++|.| ++|++++... ...+++| ..+.-...- ..+..... ..|+. -| += T Consensus 221 ~~----~~~l~G-~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~----~~~~~~~~--~~~~~~~~~~~f~~d~~ 289 (324) T protein:vir:78 221 RN----SDSLDG-LPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKI----DETAQLST--VKNEDGTPVNLFEQDMV 289 (324) T ss_pred CC----CCcccc-eeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEE----eecccccc--cccccccchhhhhcCcE Confidence 11 245666 6888877643 2234333 322211100 00000000 00110 01 11 Q ss_pred eeeeeeeeccEe-cC--cccccCCCccccccccchHHh Q lcl|Aclame:pro 474 VMGFKTRYGIGI-NP--FANSRSQAPADRITSGMISKE 508 (524) Q Consensus 474 ~~~~~tRY~l~~-nP--~~~~~~~~~~~~i~~~~~~~~ 508 (524) .+=...||+..+ +| |.. .+.+.. .....+.+- T Consensus 290 ~~r~~~r~d~~v~~~~A~~~-l~~a~~--~~~~~~~~~ 324 (324) T protein:vir:78 290 ALRATMHVALHIADDKAFAK-LVPADK--RTDSVPGEV 324 (324) T ss_pred EEEEEEEEccEEecccceEE-Eecccc--cCCCCCCCC Confidence 222234555432 22 110 000000 000111111 No 89 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=48.77 E-value=0.66 Score=21.58 Aligned_cols=300 Identities=11% Similarity=0.034 Sum_probs=120.1 Q ss_pred hcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccccccccccccccCch Q lcl|Aclame:pro 15 LESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIASGKSSGAITNIGPA 94 (524) Q Consensus 15 l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~ 94 (524) .+ + ++....+++.....+.+-+ .+ ++.+.. .+.++.. .=|. T Consensus 1 ~~--~--~~~~~~~~~~~~~~~~~~~---------------------~~-----------~a~~~~-~~~~~~~--~iP~ 41 (324) T protein:vir:96 1 ME--Q--TQKLKLNLQHFASNNVKPQ---------------------VF-----------NPDNVM-MHEKKDG--TLMN 41 (324) T ss_pred CC--c--chhhhHHHHHHHHHhhhhh---------------------hh-----------cccccc-ccCcCcc--ccch Confidence 11 0 1111111221111111110 00 000000 0111111 1122 Q ss_pred hh--hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 95 VI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITT 172 (524) Q Consensus 95 li--~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~ 172 (524) -+ .+++.+..+....+++-+-||++++-- |.-.. ++. .+.| T Consensus 42 ~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~-------~p~~~--~~~--------------~a~~-------------- 84 (324) T protein:vir:96 42 EFTTPILQEVMENSKIMQLGKYEPMEGTEKK-------FTFWA--DKP--------------GAYW-------------- 84 (324) T ss_pred hHHHHHHHHHHhhchhhhhcceeeccCCceE-------EEEEe--cCc--------------ceeE-------------- Confidence 22 355556667777888888888765421 11110 000 0000 Q ss_pred ccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccc Q lcl|Aclame:pro 173 GTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWN 252 (524) Q Consensus 173 gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~ 252 (524) .+| +..++ T Consensus 85 ---------------------------------------------------------------v~E---------g~~~~ 92 (324) T protein:vir:96 85 ---------------------------------------------------------------VGE---------GQKIE 92 (324) T ss_pred ---------------------------------------------------------------ecC---------Ccccc Confidence 001 01233 Q ss_pred cceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCcc Q lcl|Aclame:pro 253 EMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSK 332 (524) Q Consensus 253 EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~ 332 (524) +...+++++++..+.-+.-..+|-||.+|-. .|.+++|.+.|+..|...|++.+|.=--.+ ..+ T Consensus 93 ~~~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~------------~~~ 156 (324) T protein:vir:96 93 TSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN------------PFG 156 (324) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCC------------CcC Confidence 4444555566666666666679999999864 567999999999999999999998521111 112 Q ss_pred ceeecccccccc-cccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccc Q lcl|Aclame:pro 333 AGSFDFQDPVDI-RGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVD 411 (524) Q Consensus 333 ~G~fdl~~~~~~-~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d 411 (524) .|+......... ..+. ..+..|.++.+.+.. .+...+.+|++|.....|..+..+- +. ....+ T Consensus 157 ~gi~~~~~~~~~~~~~~-------~t~~~i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~d~~----G~---~~~~~ 220 (324) T protein:vir:96 157 KSIAQSIEKTNKVIKGD-------FTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVDPE----TK---ERIYD 220 (324) T ss_pred ccccccccccceecccc-------ccHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhccC----CC---eeecC Confidence 233332211110 0111 112233444444433 3345677999999999887542221 10 01111 Q ss_pred cccceeEEEecCcEEEEecCCCC--cceEEEE--------EecCCCccceeEeecccccccccccCCc-----cc---cc Q lcl|Aclame:pro 412 TTKAVFAGVLGGTYKVYIDQYAR--QDYFTVG--------FKGDNEMDAGIYYAPYVALTPLRGSDPK-----NF---QP 473 (524) Q Consensus 412 ~~~~~~~G~l~~~~~vy~D~y~~--~dy~~vG--------~KG~~~~~~~~fyaPYv~~~~~~~~dp~-----s~---qP 473 (524) .. .++|.| ++|++++... ...+++| ..+.-...- ..+..... ..|+. -| += T Consensus 221 ~~----~~~l~G-~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~----~~~~~~~~--~~~~~~~~~~~f~~d~~ 289 (324) T protein:vir:96 221 RN----SDSLDG-LPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKI----DETAQLST--VKNEDGTPVNLFEQDMV 289 (324) T ss_pred CC----CCcccc-eeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEE----eecccccc--cccccccchhhhhcCcE Confidence 11 245666 6888877643 2234333 322211100 00000000 00110 01 11 Q ss_pred eeeeeeeeccEe-cC--cccccCCCccccccccchHHh Q lcl|Aclame:pro 474 VMGFKTRYGIGI-NP--FANSRSQAPADRITSGMISKE 508 (524) Q Consensus 474 ~~~~~tRY~l~~-nP--~~~~~~~~~~~~i~~~~~~~~ 508 (524) .+=...||+..+ +| |.. .+.+.. .....+.+- T Consensus 290 ~~r~~~r~d~~v~~~~A~~~-l~~a~~--~~~~~~~~~ 324 (324) T protein:vir:96 290 ALRATMHVALHIADDKAFAK-LVPADK--RTDSVPGEV 324 (324) T ss_pred EEEEEEEEccEEecccceEE-Eecccc--cCCCCCCCC Confidence 222234555432 22 110 000000 000111111 No 90 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=47.44 E-value=0.7 Score=21.43 Aligned_cols=277 Identities=10% Similarity=0.042 Sum_probs=122.3 Q ss_pred cccccccccccCccccccccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCccccc Q lcl|Aclame:pro 64 AEAEIAGDHNYDQTNIASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPA 142 (524) Q Consensus 64 ~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA 142 (524) |. --.+++-|...+ +++.. ..-+.+. .+++.+.+.-+-..+|.+.||++++...+-... . +. T Consensus 1 m~-----~~~~~~~~~~~t-~~~~~-lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~----~----~~-- 63 (297) T protein:vir:95 1 MT-----VQTFNPENVLVS-QKKDG-TLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQT----D----GI-- 63 (297) T ss_pred CC-----cccccccccccc-CCCcc-eechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEc----C----Cc-- Confidence 11 122233332211 12221 1222222 455556667778888999998887654432110 0 00 Q ss_pred chhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccc Q lcl|Aclame:pro 143 DVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEK 222 (524) Q Consensus 143 ~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~ 222 (524) + ..| T Consensus 64 ---~---------a~~---------------------------------------------------------------- 67 (297) T protein:vir:95 64 ---S---------AYW---------------------------------------------------------------- 67 (297) T ss_pred ---e---------eEE---------------------------------------------------------------- Confidence 0 000 Q ss_pred cccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|Aclame:pro 223 GTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEI 302 (524) Q Consensus 223 g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI 302 (524) .+| +..+++-..++++++...|..+-...+|.||.+|-. .|.+.+|.+.|+..| T Consensus 68 -------------v~E---------g~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai 121 (297) T protein:vir:95 68 -------------VNE---------TEKIKTDKPEVVPVTLKAHKLGIILVTSREALNYTW----KKFFEDMKPQIVEAF 121 (297) T ss_pred -------------eec---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHH Confidence 001 011333334456666666666667779999999875 467999999999999 Q ss_pred HHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEE Q lcl|Aclame:pro 303 MLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIA 382 (524) Q Consensus 303 ~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~ 382 (524) ...+++.||. |.. ...+.|++..........+. .-.+..|.++...|... +...+.+|| T Consensus 122 ~~~~d~a~l~--------G~g-----~~~~~gi~~~~~~~~~~~~~------~~t~~~i~~~~~~l~~~--~~~~~~~v~ 180 (297) T protein:vir:95 122 YKKIDEAGLL--------GHD-----TPFANSVAKAAKDANKVIGG------PINYDNILKLQDALYDA--DVEPNAFVS 180 (297) T ss_pred HHHHHHHHhc--------ccC-----Ccccccccccccccceeccc------ccCHHHHHHHHHHhhhc--cCCcCEEEE Confidence 9999999984 110 00112333221111110000 01122344455555443 234567899 Q ss_pred chhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCC--CcceEEE--------EEecCCCcccee Q lcl|Aclame:pro 383 SRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYA--RQDYFTV--------GFKGDNEMDAGI 452 (524) Q Consensus 383 S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~--~~dy~~v--------G~KG~~~~~~~~ 452 (524) +|+....|...... .+. -.. .. ..++|.| ++|++-+.. +..-+++ |..+.-+.+- T Consensus 181 ~~~~~~~L~~l~d~----~G~--~i~--~~----~~~~l~G-~Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~-- 245 (297) T protein:vir:95 181 KIQNRSALREARDG----NKV--SIY--DK----AANTIDG-ITTVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKI-- 245 (297) T ss_pred cHHHHHHHHHhhcc----CCc--eee--cC----CCCcccc-eeeEeecCCCCCCceEEEEecccEEEEEecCeEEEE-- Confidence 99999888753211 110 000 11 1245665 577754432 2222332 2222211100 Q ss_pred EeecccccccccccCCc-----ccc---ceeeeeeeeccEe-cCcccccCCCccccccccchH Q lcl|Aclame:pro 453 YYAPYVALTPLRGSDPK-----NFQ---PVMGFKTRYGIGI-NPFANSRSQAPADRITSGMIS 506 (524) Q Consensus 453 fyaPYv~~~~~~~~dp~-----s~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~ 506 (524) .. +.......|+. -|| =.+=...|++..+ ||= ...++....+. T Consensus 246 --~~--~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~-------a~~~l~~at~~ 297 (297) T protein:vir:95 246 --SE--EGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTD-------AFAKLTPAERV 297 (297) T ss_pred --ee--ccccccccccCccchhhhhcCcEEEEEEEEeccEeeccc-------ceEEEeecCCC Confidence 00 00000111211 122 1222335666543 331 11233333333 No 91 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=47.04 E-value=0.71 Score=21.38 Aligned_cols=340 Identities=16% Similarity=0.182 Sum_probs=131.0 Q ss_pred CCchHHHHHHhhHh-------hcc-----cccchhhcchhH--HHHHHH--HHHHHHHHHhcccc--------ccchh-- Q lcl|Aclame:pro 1 MSKKNELMEKWNDL-------LES-----QEGLPDIATKSK--KQLVAA--ILEAQEKDAETDPV--------YRDEK-- 54 (524) Q Consensus 1 m~~~~~l~~kw~p~-------l~~-----~~~~~~i~~~~~--~~~~~~--l~enq~~~~~~~~~--------~~~~~-- 54 (524) |.+.+++.+.=... |+. ...+.++..+.. ....++ =|+.|++++..... ...++ T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~~~~~~~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGSKKG 80 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 66543333322111 110 001122211110 001111 12334333322110 00000 Q ss_pred ------hh----hhhcccccccccccccccCcccccccc-ccccccccCchhh--hHHHHHHhhhhhhheeeeecCCchh Q lcl|Aclame:pro 55 ------IV----ESFGGFLAEAEIAGDHNYDQTNIASGK-SSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPT 121 (524) Q Consensus 55 ------~~----~~~~~~l~ea~~~g~~~~~~~~~~~st-~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmTgPT 121 (524) .. .++..+|- + .+.-.-...+++ +.|.+.- |--+ .++++..+..+..++|.|.||++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~lr-----~-~~~~~~~~~~~t~~~gg~~v--P~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~ 152 (389) T protein:vir:10 81 TDLSKKPIDAKKKAINDFIH-----S-HGKVIDATSKVTSTEAGVLI--PEEIIYDPTAEVNSVVDLSTLVTKTPVTTPK 152 (389) T ss_pred cccchhHHHHHHHHHHHHhh-----c-chhhhhhhcccccCCcceee--hHHHHHHHHHHHHhhhhHHhhcceeeccCCe Confidence 00 00111110 0 000000011111 2222211 3222 4566666777888999999999876 Q ss_pred hhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccC Q lcl|Aclame:pro 122 GQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGN 201 (524) Q Consensus 122 GLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~ 201 (524) +-+--++. ... ..+...| T Consensus 153 ~~~~~~~~--~~~-----~~~~~~E------------------------------------------------------- 170 (389) T protein:vir:10 153 GTYPILKR--ATD-----RFSSVAE------------------------------------------------------- 170 (389) T ss_pred eEEEEEec--CCC-----ccccccc------------------------------------------------------- Confidence 43222211 000 0000000 Q ss_pred CcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHH Q lcl|Aclame:pro 202 VTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQD 281 (524) Q Consensus 202 ~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQD 281 (524) .+ +.. ..+...|.+..+++.|. +--..+|-||.+| T Consensus 171 ---~~-------------------------------~~~----~~~~~~~~~i~~~~~k~-------~~~~~iS~ell~d 205 (389) T protein:vir:10 171 ---LA-------------------------------ENP----KLAEPEFNKVDWSVATY-------RGAIPLSEEAIAD 205 (389) T ss_pred ---cc-------------------------------ccc----ccccccceeeeeeheee-------EeeehhhHHHHhh Confidence 00 000 00112355555555555 4446789999998 Q ss_pred HHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHH Q lcl|Aclame:pro 282 LRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQI 361 (524) Q Consensus 282 LkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i 361 (524) - ..|.+++|.+-|...+..-+|..|+.-+....-.+ ..+... .+.+..++... T Consensus 206 s----~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~----------~~~~~~-------------~d~l~~~~~~~ 258 (389) T protein:vir:10 206 S----AVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTAKK----------TTTDTL-------------VDSLKHILNVD 258 (389) T ss_pred h----hHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc----------cccccc-------------HHHHHHHHHhh Confidence 4 34678899999999999999999986433211100 011100 11222322211 Q ss_pred HHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEe-cCC-CCc---c Q lcl|Aclame:pro 362 DKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYI-DQY-ARQ---D 436 (524) Q Consensus 362 ~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~-D~y-~~~---d 436 (524) . ...+ ...+||+|.....|..+...-. .+-.+.. ..+.+...+-++|.| ++||+ |.. .+. | T Consensus 259 ~--------~~~~--~a~~~~n~~~~~~L~~lkd~~G--~~i~~~~-~~~~~~~~~~~~l~G-~pV~~~~~~~~~~~~~~ 324 (389) T protein:vir:10 259 L--------DPAY--SRALVVTQSLFNTLDTLKDKNG--RYLLHDA-SDSITDGTAKGTILG-VPVYVVGDTLLGSLAGD 324 (389) T ss_pred h--------hhhh--CcEEEecHHHHHHHHHhhccCC--CeeeecC-ccccccccccccccc-ceeEEecccccCCCCCc Confidence 1 1122 2467899999888886432211 0000000 011112223357888 57765 322 221 1 Q ss_pred e-EEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccEe-cCcc--c-ccCCCccccccccchHHhhcc Q lcl|Aclame:pro 437 Y-FTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFA--N-SRSQAPADRITSGMISKEMCG 511 (524) Q Consensus 437 y-~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~--~-~~~~~~~~~i~~~~~~~~~a~ 511 (524) . +++|= +..+..+... ....+...|-..|.-.+-..-|++..+ ||=+ . ..+..++ ..++ T Consensus 325 ~~~~~gd-----~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~----------~~~~ 388 (389) T protein:vir:10 325 QKAFVGD-----LKRGVLFTDR-QQVTLAWEDSKIYGKYLGAAFRFGVQKADSKAGYFVTNTDVPG----------SALG 388 (389) T ss_pred eEEEEee-----ccccEEEEee-cceEEEeeccccccceEEEEEEeccEEecccceEEEEeeccCC----------CCCC Confidence 1 33330 0000000000 111222334455666777778988653 3311 0 0011111 1123 Q ss_pred c Q lcl|Aclame:pro 512 K 512 (524) Q Consensus 512 ~ 512 (524) | T Consensus 389 ~ 389 (389) T protein:vir:10 389 K 389 (389) T ss_pred C Confidence 3 No 92 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=43.95 E-value=0.82 Score=21.04 Aligned_cols=267 Identities=10% Similarity=0.038 Sum_probs=110.1 Q ss_pred cccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhh Q lcl|Aclame:pro 160 GEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAEL 239 (524) Q Consensus 160 G~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEa 239 (524) -....+..+..-.+.- ....+.-.+ ......++....... +. + . .|....+..-=....+|. T Consensus 1 ma~~~T~~~d~iiPev-~~~~v~~~~-------~~~~~~~~~~~~~~~----l~----g-~-~G~ti~iP~~~~~gda~~ 62 (272) T protein:vir:36 1 MSKQKTTLADLVNPEV-LAPIVSYEL-------NKALRFAPLAQVDTT----LQ----G-Q-PGNTLKFPAFTYIGDAAD 62 (272) T ss_pred CCCcceehhhhhchHH-HHHHHHHHH-------Hhhhhhccccccccc----cc----c-C-CCCEEEEeeeccCccccc Confidence 0000000000000000 000000000 000011111111000 00 0 0 112221111101122222 Q ss_pred ccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhee Q lcl|Aclame:pro 240 QENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQ 319 (524) Q Consensus 240 l~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~ 319 (524) +. ....-+..++ +..+.+++-|-|+-.-++|=|. ++.-+-|.-.|..+-++..++.+++++|+..+..... T Consensus 63 ~~---eg~~i~~~~l--t~~~~~~~i~~~~k~~~vtD~~----~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~ 133 (272) T protein:vir:36 63 VA---EGGEISLDKI--GTTTKSVTIKKAAKGTEITDEA----ALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQ 133 (272) T ss_pred cC---CCCccChhhc--CCcceeEeeehhhccccccHHH----HhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 21 1111223344 3455566666665322333222 1223578999999999999999999999976532111 Q ss_pred eeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccc Q lcl|Aclame:pro 320 VGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITP 399 (524) Q Consensus 320 ~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~ 399 (524) +. .+.+. .+.+-.+..++.++. ...++++|+|.++..|..-...... T Consensus 134 --------~~---~~~~~-------------~d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~~~ 180 (272) T protein:vir:36 134 --------TV---STKAN-------------VDGVQAALDIFNDED---------AQAYVLIVNPKDAAKIRKDANAKNI 180 (272) T ss_pred --------cc---ccccc-------------HHHHHHHHHHhhhcC---------CCceEEEEcHHHHHHHhcccccccc Confidence 10 01111 122223333333322 2457999999999998642222211 Q ss_pred cchhhhcccccccccceeEEEecCcEEEEecCCCCcc---eEEEEEe-cCCCccceeEeecccccccccccCCcccccee Q lcl|Aclame:pro 400 ASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQD---YFTVGFK-GDNEMDAGIYYAPYVALTPLRGSDPKNFQPVM 475 (524) Q Consensus 400 ~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---y~~vG~K-G~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~ 475 (524) .+.. ... ...+...|.+.| ++|++|...|.+ |..+.+. |. -.+|..= ....-...|+..++=.+ T Consensus 181 ~~~~---~~~--~~~~G~ig~~~G-~~Vv~s~~~p~~~~~~~~~~~~~gA-----~~~~~~~-~~~vE~~R~~~~~~d~i 248 (272) T protein:vir:36 181 GSEV---GAN--ALINGTYADVLG-AQIVRSKKLAEGSALMFKIVSNSPA-----LKLVLKR-GVQVETDRDIVTKTTVI 248 (272) T ss_pred cccc---ccc--ceeeeccceecC-eeEEEeCCCCCCceeEEEEEecccc-----eeeeecC-CcccccccchhhcCcEE Confidence 1110 000 011123577777 899999998764 2222221 21 1122211 11111236999999998 Q ss_pred eeeeeeccEe-cCcccccCCCccccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 476 GFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 476 ~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) --.-+||+.+ || +. ...+-.||| T Consensus 249 ~~~~~y~~~v~~~-------~~-------------------vv~~t~~g~ 272 (272) T protein:vir:36 249 TADEHYAAYLYDL-------TK-------------------VVNITFTGV 272 (272) T ss_pred EEEEEEEEEEEcC-------cc-------------------EEEEeecCC Confidence 8889998765 33 11 112223333 No 93 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=43.44 E-value=0.84 Score=20.99 Aligned_cols=337 Identities=16% Similarity=0.146 Sum_probs=116.5 Q ss_pred CCchHHHHHHhhHhhccc----cc-----------chhhcchhH--HHHHHHH--HHHHHHHHhccc------------- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQ----EG-----------LPDIATKSK--KQLVAAI--LEAQEKDAETDP------------- 48 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~----~~-----------~~~i~~~~~--~~~~~~l--~enq~~~~~~~~------------- 48 (524) |.+.++|+++|..+.+.- +. .++|....+ ..+-+++ |+.|.+.+..+. T Consensus 16 mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 95 (402) T protein:vir:93 16 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 95 (402) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 877777877766554210 00 122222111 0111111 222222221110 Q ss_pred cccchhhhhhhccccccccccccc-------ccCccc-ccccccc-ccccccCchhh--hHHHHHHhhhhhhheeeeecC Q lcl|Aclame:pro 49 VYRDEKIVESFGGFLAEAEIAGDH-------NYDQTN-IASGKSS-GAITNIGPAVI--GMVRRAIPNLIAFDICGVQPM 117 (524) Q Consensus 49 ~~~~~~~~~~~~~~l~ea~~~g~~-------~~~~~~-~~~st~s-g~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPm 117 (524) .-..++...++..++.... .+.. .....+ ...++.+ |.+. =|.=+ .+++.....-+-.++|.|.|+ T Consensus 96 ~~~~~~~~~~~~~~~r~~~-~~~~~~~~~~~~~~~~~a~~~~t~~~GG~l--IP~~~~~~Ii~~~~~~~~l~~~~~v~~~ 172 (402) T protein:vir:93 96 LSDNEKMVKAKAEFYRHAI-LPNEFEKPSMEAQRLLHALPTGNDSGGDKL--LPKTLSKEIVSEPFAKNQLREKARLTNI 172 (402) T ss_pred CchhHHHHHHHHHHHHHHH-hhhhHHHHHHhHHHHHhhhccCCCcCCccc--cchhHHHHHHHhHHhhhhhhhhceeeec Confidence 0011111111111111000 0000 000000 0001111 1100 02111 123333333444677777766 Q ss_pred CchhhhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 118 TGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNV 197 (524) Q Consensus 118 TgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~ 197 (524) ++.+.- |-.+... +..| T Consensus 173 ~~~~~p----~~~~~~~--------------------~a~~--------------------------------------- 189 (402) T protein:vir:93 173 KGLEIP----RVSYTLD--------------------DDDF--------------------------------------- 189 (402) T ss_pred CCceee----eeeccCC--------------------cccc--------------------------------------- Confidence 532210 0000000 0000 Q ss_pred cccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHH Q lcl|Aclame:pro 198 TSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVE 277 (524) Q Consensus 198 ~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~E 277 (524) .++++.. ..+...|.+..|.+ +.-+-...+|-| T Consensus 190 ------------------------------------v~Eg~~~----~~~~~~f~~i~~~~-------~k~~~~i~iS~e 222 (402) T protein:vir:93 190 ------------------------------------ITDVETA----KELKAKGDTVKFTT-------NKFKVFAAISDT 222 (402) T ss_pred ------------------------------------ccccccc----cccccccceeeecc-------eeeeeechhhHH Confidence 0000000 00112244444444 444445789999 Q ss_pred HHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHH Q lcl|Aclame:pro 278 LAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKAL 357 (524) Q Consensus 278 LAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L 357 (524) |.+|- ..|.+++|.+-|+..|..-.|..++-.-+-+.+ +.|++.-.....+.+... .+....| T Consensus 223 ll~Ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~------------p~g~~~~~~~~~~~~~~~-~d~l~~~ 285 (402) T protein:vir:93 223 VIHGS----DVDLVNWVENALQSGLAAKERKDALAVSPKSGL------------EHMSFYNGSVKEVEGADM-YDAIINA 285 (402) T ss_pred HHhhh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccc------------cceeeeccccccccccch-HHHHHHH Confidence 99985 355689999999988887666655532111111 123222111111111111 1223333 Q ss_pred HHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcce Q lcl|Aclame:pro 358 LIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDY 437 (524) Q Consensus 358 ~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy 437 (524) +. .+...= +..+.|++-+.....++...+.+ ++ .. ....+ ++|.| ++||+..+++. T Consensus 286 ~~-------~l~~~y-~~na~~imn~~t~~~~~~~~~d~-----~~---~~-~~~~~----~~llG-~PV~~t~~~~~-- 341 (402) T protein:vir:93 286 LA-------DLHEDY-RDNATIYMRYADYVKIISVLSNG-----TT---NF-FDTPA----EKVFG-KPVVFTDAAVK-- 341 (402) T ss_pred Hh-------ccChhh-hcCCEEEEechHHHHHHHHHhcC-----CC---cc-cccCC----ccccc-cceEEecCCCc-- Confidence 33 332221 23555544444434444322111 00 00 01111 35776 69999877654 Q ss_pred EEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccE-ecCcccccCCCccccccccchHHhhccchhhh Q lcl|Aclame:pro 438 FTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG-INPFANSRSQAPADRITSGMISKEMCGKNAYF 516 (524) Q Consensus 438 ~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~~~~~~~~a~~~~~~ 516 (524) +++|-- +-||.=|.....-+..|+.+.+-.+-...|++.. +||=+ | T Consensus 342 i~~GDf-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A--------------------------~ 388 (402) T protein:vir:93 342 PIVGDF-------NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSA--------------------------F 388 (402) T ss_pred eeeech-------hhhhhhhhhhhhhhhhcccCCceEEEEEEEeCcEEechhh--------------------------e Confidence 334421 1122212111111223444433333334466533 23311 2 Q ss_pred hhhhcccC Q lcl|Aclame:pro 517 RKVWVKGL 524 (524) Q Consensus 517 ~~~~V~~~ 524 (524) +.+.||.- T Consensus 389 ~~l~ik~~ 396 (402) T protein:vir:93 389 RIAKAKEN 396 (402) T ss_pred EEEEeecC Confidence 22222222 No 94 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=42.27 E-value=0.89 Score=20.86 Aligned_cols=326 Identities=14% Similarity=0.084 Sum_probs=122.1 Q ss_pred CCchHHHHHHhhHhhcccc-------cch-------hhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhccccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQE-------GLP-------DIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEA 66 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~-------~~~-------~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea 66 (524) +..-+.|+++....-+-.+ .-+ +-...+|+. ..+.|.++ ....+++ . +.....|. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~-------~~~~~~~--~-~~~~~~~~ 103 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNK-------PLNAEER--E-FLEDDLEQ 103 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcc-------cccHHHH--H-HHhhhhhh Confidence 3333344444332111000 000 000112221 22222211 1000000 0 00000010 Q ss_pred ccccccccCcccccccc-cccccc---ccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCccccc Q lcl|Aclame:pro 67 EIAGDHNYDQTNIASGK-SSGAIT---NIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPA 142 (524) Q Consensus 67 ~~~g~~~~~~~~~~~st-~sg~v~---~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA 142 (524) .. ...++ +.|.+. .+.+- +++.+..+..-.++|.+.||++++|-+. ..+..+. .. T Consensus 104 ~~----------~~~~t~~~gg~~vP~~~~~~---ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~~-----~~- 162 (392) T protein:vir:10 104 RA----------MSGLTGEDGGLVIPQDIQTQ---INELARSFDALEQYVTVEPVRTRSGSRV--LEKNSDM-----IP- 162 (392) T ss_pred hh----------ccccccCCCceecchhHHHH---HHHHHHhhhhhhhhceeeeccCCceeEE--EEeecCC-----cc- Confidence 00 00111 112111 12233 3444445666678999999998876422 1111110 00 Q ss_pred chhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccc Q lcl|Aclame:pro 143 DVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEK 222 (524) Q Consensus 143 ~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~ 222 (524) ..|-+. T Consensus 163 -------------a~~v~E------------------------------------------------------------- 168 (392) T protein:vir:10 163 -------------FAEITE------------------------------------------------------------- 168 (392) T ss_pred -------------ceeecc------------------------------------------------------------- Confidence 000000 Q ss_pred cccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|Aclame:pro 223 GTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEI 302 (524) Q Consensus 223 g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI 302 (524) | ++. ..+....|.++.|...| -+-...+|-||.+|- ..|.+++|.+.|...| T Consensus 169 --------~-----~~~----~~~~~~~~~~v~l~~~k-------~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i 220 (392) T protein:vir:10 169 --------M-----GEI----PETDNPKFSNVQYAVKD-------RAGILPLSRSLLQDS----DQNILKYVTKWLGKKS 220 (392) T ss_pred --------c-----ccc----cccccccceeEEeeeee-------EEEeehhhHHHHhhh----HHHHHHHHHHHHHHHH Confidence 0 000 00011224444454444 444567999999994 2567999999999999 Q ss_pred HHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEE Q lcl|Aclame:pro 303 MLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIA 382 (524) Q Consensus 303 ~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~ 382 (524) ...++..|+.-.... .+.|+..+ +....++... . ...+-..-..|+ T Consensus 221 ~~~~d~~~~~g~g~~-------------~~~~~~~~-------------d~i~~~~~~~--l------~~~~~~~a~~vm 266 (392) T protein:vir:10 221 KVTRNVLILGVIEKL-------------TKQAIKSL-------------DDIKDVLNVK--L------DPAISPNAILLT 266 (392) T ss_pred HHHHHHHHhhccccc-------------cccCccCH-------------HHHHHHHHHh--h------hhhhccCCEEEE Confidence 999999988522211 11232222 1222222111 1 112223355789 Q ss_pred chhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeeccccc-- Q lcl|Aclame:pro 383 SRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVAL-- 460 (524) Q Consensus 383 S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~-- 460 (524) +|.....|..+...- + ...-..+......++|.|...|+++.... ++.+|...-+..++|+.+-.+ T Consensus 267 ~~~~~~~L~~lkd~~----G---~~l~~~~~~~~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~ 334 (392) T protein:vir:10 267 NQDGFNYLDKLKDKD----G---KYILQSDPTQKNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIV 334 (392) T ss_pred cHHHHHHHHHhhccC----C---CeEeecCccCCccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEE Confidence 999999887542111 1 00001111112235777765666543221 111111112222333322110 Q ss_pred -----ccccccCC------ccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhcc Q lcl|Aclame:pro 461 -----TPLRGSDP------KNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCG 511 (524) Q Consensus 461 -----~~~~~~dp------~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~ 511 (524) .+.-.+++ .+.+=.+-...|++..+ +|=+...-. +.-..+...-+| T Consensus 335 i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~a~~~~~~~ 392 (392) T protein:vir:10 335 LFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-----IDLSAPVEQPQG 392 (392) T ss_pred EEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-----ecccccccCCCC Confidence 00001122 23344566667776543 331110000 000011111122 No 95 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=42.27 E-value=0.89 Score=20.86 Aligned_cols=326 Identities=14% Similarity=0.084 Sum_probs=122.1 Q ss_pred CCchHHHHHHhhHhhcccc-------cch-------hhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhccccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQE-------GLP-------DIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEA 66 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~-------~~~-------~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea 66 (524) +..-+.|+++....-+-.+ .-+ +-...+|+. ..+.|.++ ....+++ . +.....|. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~-------~~~~~~~--~-~~~~~~~~ 103 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNK-------PLNAEER--E-FLEDDLEQ 103 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcc-------cccHHHH--H-HHhhhhhh Confidence 3333344444332111000 000 000112221 22222211 1000000 0 00000010 Q ss_pred ccccccccCcccccccc-cccccc---ccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCccccc Q lcl|Aclame:pro 67 EIAGDHNYDQTNIASGK-SSGAIT---NIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPA 142 (524) Q Consensus 67 ~~~g~~~~~~~~~~~st-~sg~v~---~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA 142 (524) .. ...++ +.|.+. .+.+- +++.+..+..-.++|.+.||++++|-+. ..+..+. .. T Consensus 104 ~~----------~~~~t~~~gg~~vP~~~~~~---ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~~-----~~- 162 (392) T protein:vir:10 104 RA----------MSGLTGEDGGLVIPQDIQTQ---INELARSFDALEQYVTVEPVRTRSGSRV--LEKNSDM-----IP- 162 (392) T ss_pred hh----------ccccccCCCceecchhHHHH---HHHHHHhhhhhhhhceeeeccCCceeEE--EEeecCC-----cc- Confidence 00 00111 112111 12233 3444445666678999999998876422 1111110 00 Q ss_pred chhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccc Q lcl|Aclame:pro 143 DVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEK 222 (524) Q Consensus 143 ~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~ 222 (524) ..|-+. T Consensus 163 -------------a~~v~E------------------------------------------------------------- 168 (392) T protein:vir:10 163 -------------FAEITE------------------------------------------------------------- 168 (392) T ss_pred -------------ceeecc------------------------------------------------------------- Confidence 000000 Q ss_pred cccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|Aclame:pro 223 GTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEI 302 (524) Q Consensus 223 g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI 302 (524) | ++. ..+....|.++.|...| -+-...+|-||.+|- ..|.+++|.+.|...| T Consensus 169 --------~-----~~~----~~~~~~~~~~v~l~~~k-------~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i 220 (392) T protein:vir:10 169 --------M-----GEI----PETDNPKFSNVQYAVKD-------RAGILPLSRSLLQDS----DQNILKYVTKWLGKKS 220 (392) T ss_pred --------c-----ccc----cccccccceeEEeeeee-------EEEeehhhHHHHhhh----HHHHHHHHHHHHHHHH Confidence 0 000 00011224444454444 444567999999994 2567999999999999 Q ss_pred HHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEE Q lcl|Aclame:pro 303 MLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIA 382 (524) Q Consensus 303 ~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~ 382 (524) ...++..|+.-.... .+.|+..+ +....++... . ...+-..-..|+ T Consensus 221 ~~~~d~~~~~g~g~~-------------~~~~~~~~-------------d~i~~~~~~~--l------~~~~~~~a~~vm 266 (392) T protein:vir:10 221 KVTRNVLILGVIEKL-------------TKQAIKSL-------------DDIKDVLNVK--L------DPAISPNAILLT 266 (392) T ss_pred HHHHHHHHhhccccc-------------cccCccCH-------------HHHHHHHHHh--h------hhhhccCCEEEE Confidence 999999988522211 11232222 1222222111 1 112223355789 Q ss_pred chhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeeccccc-- Q lcl|Aclame:pro 383 SRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVAL-- 460 (524) Q Consensus 383 S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~-- 460 (524) +|.....|..+...- + ...-..+......++|.|...|+++.... ++.+|...-+..++|+.+-.+ T Consensus 267 ~~~~~~~L~~lkd~~----G---~~l~~~~~~~~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~ 334 (392) T protein:vir:10 267 NQDGFNYLDKLKDKD----G---KYILQSDPTQKNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIV 334 (392) T ss_pred cHHHHHHHHHhhccC----C---CeEeecCccCCccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEE Confidence 999999887542111 1 00001111112235777765666543221 111111112222333322110 Q ss_pred -----ccccccCC------ccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhcc Q lcl|Aclame:pro 461 -----TPLRGSDP------KNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCG 511 (524) Q Consensus 461 -----~~~~~~dp------~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~ 511 (524) .+.-.+++ .+.+=.+-...|++..+ +|=+...-. +.-..+...-+| T Consensus 335 i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~a~~~~~~~ 392 (392) T protein:vir:10 335 LFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-----IDLSAPVEQPQG 392 (392) T ss_pred EEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-----ecccccccCCCC Confidence 00001122 23344566667776543 331110000 000011111122 No 96 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=42.27 E-value=0.89 Score=20.86 Aligned_cols=326 Identities=14% Similarity=0.084 Sum_probs=122.1 Q ss_pred CCchHHHHHHhhHhhcccc-------cch-------hhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhccccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQE-------GLP-------DIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEA 66 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~-------~~~-------~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea 66 (524) +..-+.|+++....-+-.+ .-+ +-...+|+. ..+.|.++ ....+++ . +.....|. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~-------~~~~~~~--~-~~~~~~~~ 103 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNK-------PLNAEER--E-FLEDDLEQ 103 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcc-------cccHHHH--H-HHhhhhhh Confidence 3333344444332111000 000 000112221 22222211 1000000 0 00000010 Q ss_pred ccccccccCcccccccc-cccccc---ccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCccccc Q lcl|Aclame:pro 67 EIAGDHNYDQTNIASGK-SSGAIT---NIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPA 142 (524) Q Consensus 67 ~~~g~~~~~~~~~~~st-~sg~v~---~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA 142 (524) .. ...++ +.|.+. .+.+- +++.+..+..-.++|.+.||++++|-+. ..+..+. .. T Consensus 104 ~~----------~~~~t~~~gg~~vP~~~~~~---ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~~-----~~- 162 (392) T protein:vir:10 104 RA----------MSGLTGEDGGLVIPQDIQTQ---INELARSFDALEQYVTVEPVRTRSGSRV--LEKNSDM-----IP- 162 (392) T ss_pred hh----------ccccccCCCceecchhHHHH---HHHHHHhhhhhhhhceeeeccCCceeEE--EEeecCC-----cc- Confidence 00 00111 112111 12233 3444445666678999999998876422 1111110 00 Q ss_pred chhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccc Q lcl|Aclame:pro 143 DVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEK 222 (524) Q Consensus 143 ~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~ 222 (524) ..|-+. T Consensus 163 -------------a~~v~E------------------------------------------------------------- 168 (392) T protein:vir:10 163 -------------FAEITE------------------------------------------------------------- 168 (392) T ss_pred -------------ceeecc------------------------------------------------------------- Confidence 000000 Q ss_pred cccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|Aclame:pro 223 GTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEI 302 (524) Q Consensus 223 g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI 302 (524) | ++. ..+....|.++.|...| -+-...+|-||.+|- ..|.+++|.+.|...| T Consensus 169 --------~-----~~~----~~~~~~~~~~v~l~~~k-------~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i 220 (392) T protein:vir:10 169 --------M-----GEI----PETDNPKFSNVQYAVKD-------RAGILPLSRSLLQDS----DQNILKYVTKWLGKKS 220 (392) T ss_pred --------c-----ccc----cccccccceeEEeeeee-------EEEeehhhHHHHhhh----HHHHHHHHHHHHHHHH Confidence 0 000 00011224444454444 444567999999994 2567999999999999 Q ss_pred HHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEE Q lcl|Aclame:pro 303 MLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIA 382 (524) Q Consensus 303 ~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~ 382 (524) ...++..|+.-.... .+.|+..+ +....++... . ...+-..-..|+ T Consensus 221 ~~~~d~~~~~g~g~~-------------~~~~~~~~-------------d~i~~~~~~~--l------~~~~~~~a~~vm 266 (392) T protein:vir:10 221 KVTRNVLILGVIEKL-------------TKQAIKSL-------------DDIKDVLNVK--L------DPAISPNAILLT 266 (392) T ss_pred HHHHHHHHhhccccc-------------cccCccCH-------------HHHHHHHHHh--h------hhhhccCCEEEE Confidence 999999988522211 11232222 1222222111 1 112223355789 Q ss_pred chhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeeccccc-- Q lcl|Aclame:pro 383 SRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVAL-- 460 (524) Q Consensus 383 S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~-- 460 (524) +|.....|..+...- + ...-..+......++|.|...|+++.... ++.+|...-+..++|+.+-.+ T Consensus 267 ~~~~~~~L~~lkd~~----G---~~l~~~~~~~~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~ 334 (392) T protein:vir:10 267 NQDGFNYLDKLKDKD----G---KYILQSDPTQKNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIV 334 (392) T ss_pred cHHHHHHHHHhhccC----C---CeEeecCccCCccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEE Confidence 999999887542111 1 00001111112235777765666543221 111111112222333322110 Q ss_pred -----ccccccCC------ccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhcc Q lcl|Aclame:pro 461 -----TPLRGSDP------KNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCG 511 (524) Q Consensus 461 -----~~~~~~dp------~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~ 511 (524) .+.-.+++ .+.+=.+-...|++..+ +|=+...-. +.-..+...-+| T Consensus 335 i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~a~~~~~~~ 392 (392) T protein:vir:10 335 LFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-----IDLSAPVEQPQG 392 (392) T ss_pred EEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-----ecccccccCCCC Confidence 00001122 23344566667776543 331110000 000011111122 No 97 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=42.27 E-value=0.89 Score=20.86 Aligned_cols=326 Identities=14% Similarity=0.084 Sum_probs=122.1 Q ss_pred CCchHHHHHHhhHhhcccc-------cch-------hhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhccccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQE-------GLP-------DIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEA 66 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~-------~~~-------~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea 66 (524) +..-+.|+++....-+-.+ .-+ +-...+|+. ..+.|.++ ....+++ . +.....|. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~-------~~~~~~~--~-~~~~~~~~ 103 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNK-------PLNAEER--E-FLEDDLEQ 103 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcc-------cccHHHH--H-HHhhhhhh Confidence 3333344444332111000 000 000112221 22222211 1000000 0 00000010 Q ss_pred ccccccccCcccccccc-cccccc---ccCchhhhHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCccccc Q lcl|Aclame:pro 67 EIAGDHNYDQTNIASGK-SSGAIT---NIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPA 142 (524) Q Consensus 67 ~~~g~~~~~~~~~~~st-~sg~v~---~~~P~li~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA 142 (524) .. ...++ +.|.+. .+.+- +++.+..+..-.++|.+.||++++|-+. ..+..+. .. T Consensus 104 ~~----------~~~~t~~~gg~~vP~~~~~~---ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~~-----~~- 162 (392) T protein:vir:10 104 RA----------MSGLTGEDGGLVIPQDIQTQ---INELARSFDALEQYVTVEPVRTRSGSRV--LEKNSDM-----IP- 162 (392) T ss_pred hh----------ccccccCCCceecchhHHHH---HHHHHHhhhhhhhhceeeeccCCceeEE--EEeecCC-----cc- Confidence 00 00111 112111 12233 3444445666678999999998876422 1111110 00 Q ss_pred chhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccc Q lcl|Aclame:pro 143 DVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEK 222 (524) Q Consensus 143 ~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~ 222 (524) ..|-+. T Consensus 163 -------------a~~v~E------------------------------------------------------------- 168 (392) T protein:vir:10 163 -------------FAEITE------------------------------------------------------------- 168 (392) T ss_pred -------------ceeecc------------------------------------------------------------- Confidence 000000 Q ss_pred cccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|Aclame:pro 223 GTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEI 302 (524) Q Consensus 223 g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI 302 (524) | ++. ..+....|.++.|...| -+-...+|-||.+|- ..|.+++|.+.|...| T Consensus 169 --------~-----~~~----~~~~~~~~~~v~l~~~k-------~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i 220 (392) T protein:vir:10 169 --------M-----GEI----PETDNPKFSNVQYAVKD-------RAGILPLSRSLLQDS----DQNILKYVTKWLGKKS 220 (392) T ss_pred --------c-----ccc----cccccccceeEEeeeee-------EEEeehhhHHHHhhh----HHHHHHHHHHHHHHHH Confidence 0 000 00011224444454444 444567999999994 2567999999999999 Q ss_pred HHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEE Q lcl|Aclame:pro 303 MLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIA 382 (524) Q Consensus 303 ~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~ 382 (524) ...++..|+.-.... .+.|+..+ +....++... . ...+-..-..|+ T Consensus 221 ~~~~d~~~~~g~g~~-------------~~~~~~~~-------------d~i~~~~~~~--l------~~~~~~~a~~vm 266 (392) T protein:vir:10 221 KVTRNVLILGVIEKL-------------TKQAIKSL-------------DDIKDVLNVK--L------DPAISPNAILLT 266 (392) T ss_pred HHHHHHHHhhccccc-------------cccCccCH-------------HHHHHHHHHh--h------hhhhccCCEEEE Confidence 999999988522211 11232222 1222222111 1 112223355789 Q ss_pred chhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeeccccc-- Q lcl|Aclame:pro 383 SRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVAL-- 460 (524) Q Consensus 383 S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~-- 460 (524) +|.....|..+...- + ...-..+......++|.|...|+++.... ++.+|...-+..++|+.+-.+ T Consensus 267 ~~~~~~~L~~lkd~~----G---~~l~~~~~~~~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~ 334 (392) T protein:vir:10 267 NQDGFNYLDKLKDKD----G---KYILQSDPTQKNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIV 334 (392) T ss_pred cHHHHHHHHHhhccC----C---CeEeecCccCCccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEE Confidence 999999887542111 1 00001111112235777765666543221 111111112222333322110 Q ss_pred -----ccccccCC------ccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhcc Q lcl|Aclame:pro 461 -----TPLRGSDP------KNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCG 511 (524) Q Consensus 461 -----~~~~~~dp------~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~ 511 (524) .+.-.+++ .+.+=.+-...|++..+ +|=+...-. +.-..+...-+| T Consensus 335 i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~a~~~~~~~ 392 (392) T protein:vir:10 335 LFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-----IDLSAPVEQPQG 392 (392) T ss_pred EEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-----ecccccccCCCC Confidence 00001122 23344566667776543 331110000 000011111122 No 98 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=42.21 E-value=0.89 Score=20.85 Aligned_cols=347 Identities=12% Similarity=0.098 Sum_probs=124.9 Q ss_pred CC--------chHHHHHHhhHhhcccccchhhcch----------------------------------hHHHHHHHHHH Q lcl|Aclame:pro 1 MS--------KKNELMEKWNDLLESQEGLPDIATK----------------------------------SKKQLVAAILE 38 (524) Q Consensus 1 m~--------~~~~l~~kw~p~l~~~~~~~~i~~~----------------------------------~~~~~~~~l~e 38 (524) +. ..++|.++...+-+. +.++... ..|+...+.+. T Consensus 31 ~~ee~~~~~~e~~~l~~~~~~l~~~---i~~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 107 (434) T protein:vir:62 31 RSEELAAVKAEVEQLTKEIQTISEE---LAKLEEKEKEEDPAKKKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIA 107 (434) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHhhhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHH Confidence 11 112233333322110 0011000 00000000000 Q ss_pred HHHHHHhccccccchhhhhhhcccccccccccccccCccccccccccccccccCchhh--hHHHHHHhhhhhhheeeeec Q lcl|Aclame:pro 39 AQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQP 116 (524) Q Consensus 39 nq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQP 116 (524) +..+.. ..........-.+|..+|... ....-. -+-++++++-.-.=|.-+ .+++..-++.+...++-|.| T Consensus 108 ~~~~~~-~~~~~~~~e~r~a~~~~l~~~-----~~~~e~-~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~ 180 (434) T protein:vir:62 108 AALSTK-GHRTNKETEIRSVFANYIVGN-----IDEKEA-RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVK 180 (434) T ss_pred hhhhhc-cccchHHHHHHHHHHHHhccc-----cchhhh-hhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceec Confidence 000000 000000001111122222110 000000 001111211000013332 25555556667777787777 Q ss_pred CCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 117 MTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQN 196 (524) Q Consensus 117 mTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~ 196 (524) +++..- | .++.... + ..+- T Consensus 181 ~~~~~~--~---p~~~~~~----~---------------a~~~------------------------------------- 199 (434) T protein:vir:62 181 TKENIK--Y---PVLVKKA----E---------------AQGH------------------------------------- 199 (434) T ss_pred cCCceE--E---EEEecCC----c---------------ccce------------------------------------- Confidence 654210 0 0000000 0 0000 Q ss_pred ccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccH Q lcl|Aclame:pro 197 VTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSV 276 (524) Q Consensus 197 ~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ 276 (524) ....| +...++-..++++++..+|.-+-...+|- T Consensus 200 -------------------------------------~~~~e---------~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ 233 (434) T protein:vir:62 200 -------------------------------------KNERT---------NNEMPETDIEFDEIELSPTEFDALATVTK 233 (434) T ss_pred -------------------------------------ecccc---------cccccccccceeeEEeeheeeEeehhhHH Confidence 00000 01112222355666677777777788999 Q ss_pred HHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHH Q lcl|Aclame:pro 277 ELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKA 356 (524) Q Consensus 277 ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~ 356 (524) ||.+|- .+|.+++|.+-|+..|..-+++.||. |. | .+..+.|++.-..... .+..+ -.+.. T Consensus 234 ell~ds----~~~l~~~i~~~la~~~~~~~d~~~l~--------G~-G---~~~~~~g~~~~~~~~~--~~~~~-~~~d~ 294 (434) T protein:vir:62 234 KLLART----GLPIEQIVMDELKKAYVRKETQYMVN--------GD-E---ANNINDGALAKKAVEF--KTDEK-NLYDA 294 (434) T ss_pred HHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhc--------cC-C---CCccccceeecccccc--ccccc-chhhH Confidence 999995 45679999999999999999999994 10 0 0001112211100000 00000 01222 Q ss_pred HHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcc Q lcl|Aclame:pro 357 LLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQD 436 (524) Q Consensus 357 L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d 436 (524) | -++-..+... +.+.-..|++|.....|..+..+-. .+-.+...... ...-.+|.| ++|+++.+.+.. T Consensus 295 l----~~l~~~l~~~--~~~~a~~v~n~~~~~~L~~lkd~~G--~~l~~~~~~~~---~g~~~tl~G-~pV~~~~~~~~~ 362 (434) T protein:vir:62 295 L----VKMKNTPVKE--VRKKARWVLNTAALTKIETMKTDDG--FPLLRPFNQAE---GGIGYTLLG-FPVEEEDAIDIP 362 (434) T ss_pred H----HHHHhhcchh--hhcCCEEEEcHHHHHHHHHhhccCC--CEeeccCCCcc---CCCCceecc-eeeEEecCccCc Confidence 2 2233333222 2233356889999888865322211 00000000000 001125777 699888765421 Q ss_pred eEEEEEecCCCccceeEe---ecccc------cccccccCC--ccccceeeeeeeeccE-e-cCcccccCCCcccccccc Q lcl|Aclame:pro 437 YFTVGFKGDNEMDAGIYY---APYVA------LTPLRGSDP--KNFQPVMGFKTRYGIG-I-NPFANSRSQAPADRITSG 503 (524) Q Consensus 437 y~~vG~KG~~~~~~~~fy---aPYv~------~~~~~~~dp--~s~qP~~~~~tRY~l~-~-nP~~~~~~~~~~~~i~~~ 503 (524) - .|.. .-++| +-|.- ....+..+. .+-|=.+..+.|++-. + .|++... +.. T Consensus 363 ~-----~~~~---~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~--------~~~ 426 (434) T protein:vir:62 363 D-----SPDT---PVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPV--------YKY 426 (434) T ss_pred c-----CCCc---eEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceE--------EEE Confidence 1 1100 01111 11111 111111222 2223335566777533 4 3776432 222 Q ss_pred chHHhhcc Q lcl|Aclame:pro 504 MISKEMCG 511 (524) Q Consensus 504 ~~~~~~a~ 511 (524) .-.....+ T Consensus 427 ~~~~~~~~ 434 (434) T protein:vir:62 427 VLKAPTGA 434 (434) T ss_pred EeccCCCC Confidence 21222222 No 99 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=40.43 E-value=0.97 Score=20.65 Aligned_cols=279 Identities=12% Similarity=0.065 Sum_probs=113.5 Q ss_pred cccccccccccCchhh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYS 159 (524) Q Consensus 81 ~st~sg~v~~~~P~li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fS 159 (524) -.+++|.+ .-|.+. .+++.+.++.+-.++|.+.||++.. + +|+-.. ++.++ .|- T Consensus 1 ma~~gG~l--ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-----~--~~p~~~--~~~~a--------------~~v 55 (298) T protein:vir:94 1 MVLNKGTL--FDPELVTDLISKVAGKSSIARLSAQKPIPFNG-----E--KVFTFT--MDSEI--------------DVV 55 (298) T ss_pred Ceeccccc--cChhHHHHHHHHHHhhchhhhhcceeeccCCc-----e--EEEEEe--cCcce--------------EEe Confidence 22222322 234443 4666666778888889998886532 1 111110 00000 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhh Q lcl|Aclame:pro 160 GEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAEL 239 (524) Q Consensus 160 G~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEa 239 (524) ++ +|. T Consensus 56 ---------------------------------------------------------------------~E------g~~ 60 (298) T protein:vir:94 56 ---------------------------------------------------------------------AE------SGK 60 (298) T ss_pred ---------------------------------------------------------------------eC------Ccc Confidence 00 000 Q ss_pred ccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhee Q lcl|Aclame:pro 240 QENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQ 319 (524) Q Consensus 240 l~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~ 319 (524) . ..+...|.++.|...|..+ ....|-||.|+--. -..+-+++|.+-|+..|..+|+.-++.-..... T Consensus 61 ~----~~~~~~f~~v~l~~~k~~~-------~~~iS~ell~~~~~-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~- 127 (298) T protein:vir:94 61 K----THGGVTLAPQTMVPIKVEY-------GARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVNPRL- 127 (298) T ss_pred c----cccccceeEEEEeeeEEEE-------eeehhHHHhccCCc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC- Confidence 0 0011235555555555543 56788898764321 013346677777777777777777774211000 Q ss_pred eeeeccccccCccceeecccc-cccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhccccc Q lcl|Aclame:pro 320 VGKSGFTQTVGSKAGSFDFQD-PVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGIT 398 (524) Q Consensus 320 ~~~~g~~~~~~~~~G~fdl~~-~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~ 398 (524) + .+....|.--+.. ..+... .......++.-+.++...+.. .+.+...+|++|.....|..... T Consensus 128 -g------~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~lkd--- 192 (298) T protein:vir:94 128 -G------TASAVIGTNHFDSKVTQKVE---APRGIADPNGAIENAVELLTG--VDADVTGIAINPSFRSALAKQKD--- 192 (298) T ss_pred -C------cccccccccccccccccccc---cccccccHHHHHHHHHHhhhh--cCCCccEEEEcHHHHHHHHHhhc--- Confidence 0 0000001000000 000000 000011222334444443333 12355679999999988875321 Q ss_pred ccchhhhcccccccccceeEEEecCcEEEEecCCCCc------ceEEEEEecCCCccceeEeecccccc--cccccCCcc Q lcl|Aclame:pro 399 PASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ------DYFTVGFKGDNEMDAGIYYAPYVALT--PLRGSDPKN 470 (524) Q Consensus 399 ~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~~~~~fyaPYv~~~--~~~~~dp~s 470 (524) ..+. .....+.+. -..|+|.| ++|++++.-+. +.+++| +-. .++.|...-.+. ..+..||+. T Consensus 193 -~~G~--~l~~~~~~~-~~~~tl~G-~PV~~~~~v~~~~~~~~~~~~~G---dfs--~~~~~~~~~~~~~~~~~~~~~d~ 262 (298) T protein:vir:94 193 -LQGN--ALFPELKWG-ATPDTING-LPVDVNKTVSDMSLTQRDRAIIG---DFA--NGFKWGYAKEVPLEVIQYGDPDN 262 (298) T ss_pred -cCCC--eeecCcccC-CCCceecc-eeeEEecccccccCCCccEEEEe---ecc--ceEEEEEecCceEEEeecCCCcC Confidence 1110 000011111 11367887 69998886542 223333 111 112233221111 111123321 Q ss_pred -----cc-ceeee--eeeeccE-ecCcccccCCCccccccccc Q lcl|Aclame:pro 471 -----FQ-PVMGF--KTRYGIG-INPFANSRSQAPADRITSGM 504 (524) Q Consensus 471 -----~q-P~~~~--~tRY~l~-~nP~~~~~~~~~~~~i~~~~ 504 (524) || =.++| ..|+++. .+| +...++.+.. T Consensus 263 ~~~~~f~~~~v~~r~~~r~~~~~~~~-------~a~~~l~~~t 298 (298) T protein:vir:94 263 SGLDLKGYNQVYIRAELFLGWGILDA-------TKFARVTEAN 298 (298) T ss_pred cchhhhhcCcEEEEEEEEeccEeecc-------cceEEEEecC Confidence 22 12334 5577754 333 1223444444 No 100 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=38.09 E-value=1.1 Score=20.39 Aligned_cols=281 Identities=11% Similarity=-0.049 Sum_probs=97.2 Q ss_pred cccccccccccccccccccccccccccc----cccCCcccccCcccccccccccccccccccccccccchhhhhccccCC Q lcl|Aclame:pro 170 ITTGTAIATGAIVYHIFQETGIAYFQNV----TSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNG 245 (524) Q Consensus 170 ~~~gta~~~g~~~~~~~~~~~~~~~~~~----~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~gg 245 (524) +..++....|......+...-...+... ..........+.. ......+.+-+...+| T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~------------~ip~~~~~~~a~wv~E------- 61 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPV------------KGAVFSGVPRAKIVGE------- 61 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCce------------EEEEEeCCcceEEeeC------- Confidence 1111111111111111100000000000 0000000000000 0001112222222334 Q ss_pred CCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeecc Q lcl|Aclame:pro 246 SSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGF 325 (524) Q Consensus 246 ss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~ 325 (524) +..+++...+++++++.+|.-+-....|-||.+|.. .|+..+|.++|..++...|.|.+=..+.. |.... T Consensus 62 --g~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~s~----~~~~~~l~~~i~~~la~ai~~~~d~a~~~----G~~~~ 131 (315) T protein:vir:80 62 --GEVKPSASVDVSAFTAQPIKVVTQQRVSDEFMWADA----DYRLGVLQDLISPALGASIGRAVDLIAFH----GIDPA 131 (315) T ss_pred --CccccccccceeeeEeeeeeEEeeehhhHHHhhcCc----hhHHHHHHHHHHHHHHHHHHHHHhhheee----ccCCC Confidence 234566667777777777776667789999998843 45566666666666666665555433221 11000 Q ss_pred ccccCccceeeccc-ccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhh Q lcl|Aclame:pro 326 TQTVGSKAGSFDFQ-DPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGL 404 (524) Q Consensus 326 ~~~~~~~~G~fdl~-~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~ 404 (524) +.. ...|+-... ...+ .++..-..+.-+.++...+.....+ ..+-.|++|+....|..+..+-. .+.. T Consensus 132 ~~~--~~~~~~~~~~~~~~------~~~~~~~~~~d~~~~~~~~~~~~~~-~~~~~imn~~~~~~L~~l~~~~g--~~~~ 200 (315) T protein:vir:80 132 TGK--AASAVHTSLNKTKN------IVDATDSATADLVKAVGLIAGAGLQ-VPNGVALDPAFSFALSTEVYPKG--SPLA 200 (315) T ss_pred CCc--cccccccccccccc------eeeccccchHHHHHHHHHHhhccCc-cceEEEEcHHHHHHHHHHhhccC--Cccc Confidence 000 011111110 0000 0000001111222333233222222 34568899999988875422111 0100 Q ss_pred hcccccccccceeEEEecCcEEEEecCCCCcc---------eE--------EEEEecCCCccceeEeecccccccccccC Q lcl|Aclame:pro 405 QKTLNVDTTKAVFAGVLGGTYKVYIDQYARQD---------YF--------TVGFKGDNEMDAGIYYAPYVALTPLRGSD 467 (524) Q Consensus 405 ~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y~--------~vG~KG~~~~~~~~fyaPYv~~~~~~~~d 467 (524) ......+... .-.++|.| ++|+++.+.+.+ .+ .+|+.+... +-..+|. | T Consensus 201 g~~~~~~~~~-g~~~tl~G-~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~----i~i~~~~--------~ 266 (315) T protein:vir:80 201 GQPMYPAAGF-AGLDNWRG-LNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFP----IELIEYG--------D 266 (315) T ss_pred cccccccccc-CCCceecc-eeeEecCcCCcccccccccccEEEEeecccEEEEEecCee----EEEeccc--------c Confidence 0000001100 01257888 699998886532 12 222222111 1111221 1 Q ss_pred Cc----c-ccc-eeeee--eeeccE-ecC--cccccCCCccccccccc-hHHhhccch Q lcl|Aclame:pro 468 PK----N-FQP-VMGFK--TRYGIG-INP--FANSRSQAPADRITSGM-ISKEMCGKN 513 (524) Q Consensus 468 p~----s-~qP-~~~~~--tRY~l~-~nP--~~~~~~~~~~~~i~~~~-~~~~~a~~~ 513 (524) ++ + ||. .++|. .|+|.. .+| |.. +.+.. +-..-.+.| T Consensus 267 ~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~---------l~~~~a~~~~~~~~~ 315 (315) T protein:vir:80 267 PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAV---------VKEKAAPKPNPPAEN 315 (315) T ss_pred ccCcccchhhcCcEEEEEEEEecceeecccceEE---------EeeccCCCCCCCCCC Confidence 11 1 211 13332 345432 333 111 00000 001111112 No 101 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=37.76 E-value=1.1 Score=20.35 Aligned_cols=273 Identities=11% Similarity=0.022 Sum_probs=113.2 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccc Q lcl|Aclame:pro 151 MFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISV 230 (524) Q Consensus 151 ~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~ 230 (524) |. . ..+..+.+-.+.-.+ ..+. ... . ......+....... +. . ..|...++.. T Consensus 1 Ma--------~-~~T~l~d~i~Pev~~-~~v~-~~~--~----~~~~~~~~~~~~~~----l~-----g-~~G~ti~iP~ 53 (276) T protein:vir:10 1 MA--------Q-GTTTKSTQIVPEVLA-PMMQ-AEL--D----KKLRFAQFADIDST----LV-----G-QPGDTLTFPA 53 (276) T ss_pred CC--------c-ceeehhhhhchHHHH-HHHH-HHH--H----hhhhhcccceeccc----cc-----C-CCCCEEEeee Confidence 10 0 001111100000000 0000 000 0 00000111110000 00 0 0111111111 Q ss_pred cccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhc-CCChHHHHHHHHHHHHHHHhhHH Q lcl|Aclame:pro 231 GMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVH-GMDADAELSAILATEIMLEINRE 309 (524) Q Consensus 231 GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiH-GLDAEaELsnILStEI~~EINre 309 (524) -=....+|... . +.++..=..+..+.+++.+-|.-.=++| |+-+.. +.|.-.|..+-++..|+..++.+ T Consensus 54 ~~~igda~~~~---e--g~~i~~~~lt~~~~~a~i~~~~k~~~~t-----D~a~~~~~~dp~~~~~~~~~~~~a~~~d~~ 123 (276) T protein:vir:10 54 FVYSGDATVVP---E--GQKIPVDKIETNRREAKIHKIGKGTDIT-----DEALLSGYGDPQGEAVRQHGLAIANKVDND 123 (276) T ss_pred ecCCCcccccc---C--CCccCccccccceeeEEeehcccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHH Confidence 00111222221 1 1122222233445555555554333333 332222 57999999999999999999999 Q ss_pred HHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhh Q lcl|Aclame:pro 310 IVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSA 389 (524) Q Consensus 310 ii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~ 389 (524) ++..+.....- . .++.+. .+.+-....++.++. ...+++||+|.+++. T Consensus 124 ~~~~l~~~~~~--------~--~~~~~t-------------~d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~ 171 (276) T protein:vir:10 124 VLEALRGTKLT--------V--SADIGT-------------LAGLEAAIDTFDDED---------LEPMVLFINPKDAGK 171 (276) T ss_pred HHHHHhccccc--------c--cccccC-------------HHHHHHHHHHhcccc---------CcccEEEEcHHHHHH Confidence 99765432211 0 011111 122222233333221 257899999999999 Q ss_pred hhhh-cccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCccceeEeecccccccccccCC Q lcl|Aclame:pro 390 LARI-DSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDP 468 (524) Q Consensus 390 L~~~-~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp 468 (524) |.-. ...|..++... .+...+..+|.+.| ++|++|...|..-..+--+|.-. |+.. -+.......|+ T Consensus 172 L~k~~~~~f~~~s~~g-----~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~gAi~-----~~~~-~~~~vE~dRd~ 239 (276) T protein:vir:10 172 LRSSASDNFTRATELG-----DNIIVKGAFGEALG-AVIVRSKKLDEGEAILAKRGAVK-----LITK-RDFFLETDRDP 239 (276) T ss_pred HHHhcccccccccccc-----ccceeccccceecc-eeEEEcCCCCcceEEEEecccee-----eeec-CCceeecccch Confidence 8531 13343333211 11122334678876 89999999875433222122221 1111 11222223699 Q ss_pred ccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhhccc Q lcl|Aclame:pro 469 KNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGK 512 (524) Q Consensus 469 ~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~a~~ 512 (524) +.++=.|-...+||+.. || ..-.++..+. |..-+|. T Consensus 240 ~~~~d~i~~~~~y~~~~~~~-------~~vv~~t~~~-~~~~~~~ 276 (276) T protein:vir:10 240 STKTTALYSDKHYVAYLYDE-------SKAVKVTKGA-GTTDSGA 276 (276) T ss_pred hhcccEEEEeeEEEEEEEcC-------cceEEEecCC-cCCcCCC Confidence 99999999999998753 33 1111222222 2212222 No 102 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=37.71 E-value=1.1 Score=20.35 Aligned_cols=294 Identities=10% Similarity=0.059 Sum_probs=118.2 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~ 80 (524) |-++++ .+.-.+-+.+-..... .+ .+.+.. T Consensus 1 ~~~~~~-----------------------~~~~~~~f~~~~~~~~----------------~~-----------~a~~~~ 30 (324) T protein:vir:97 1 MEQTQK-----------------------LKLNLQHFASNNVKPQ----------------VF-----------NPDNVM 30 (324) T ss_pred Cccchh-----------------------HHHHHHHHHHhhhhhh----------------hh-----------cccccc Confidence 222211 1111111111100000 00 000000 Q ss_pred cccccccccccCch-hh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPA-VI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMY 158 (524) Q Consensus 81 ~st~sg~v~~~~P~-li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~f 158 (524) .++++... . |. +. .+++.+..+.+..+++-+.||++.+--| .-.. +.. + +.| T Consensus 31 -~~~~~~~~-i-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~i-------p~~~--~~~-----~---------a~~ 84 (324) T protein:vir:97 31 -MHEKKDGT-L-MNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKF-------TFWA--DKP-----G---------AYW 84 (324) T ss_pred -ccCCCcce-e-chhHHHHHHHHHHhhcchhhhcceeeccCCceEE-------EEEe--cCc-----c---------eeE Confidence 11111111 1 22 22 3555666778888899999988755211 1110 000 0 000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhh Q lcl|Aclame:pro 159 SGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAE 238 (524) Q Consensus 159 SG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aE 238 (524) - +| T Consensus 85 v-----------------------------------------------------------------------------~E 87 (324) T protein:vir:97 85 V-----------------------------------------------------------------------------GE 87 (324) T ss_pred e-----------------------------------------------------------------------------cc Confidence 0 01 Q ss_pred hccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhe Q lcl|Aclame:pro 239 LQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTA 318 (524) Q Consensus 239 al~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a 318 (524) +..+++...++++++...|.-+.-..+|-||.+|-. .|.+++|.+-|+..|...+++.||.---.. T Consensus 88 ---------g~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g~~- 153 (324) T protein:vir:97 88 ---------GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN- 153 (324) T ss_pred ---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhccCCCC- Confidence 011223333444555555555555569999999863 567999999999999999999999521100 Q ss_pred eeeeeccccccCccceeeccccccc-ccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccc Q lcl|Aclame:pro 319 QVGKSGFTQTVGSKAGSFDFQDPVD-IRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGI 397 (524) Q Consensus 319 ~~~~~g~~~~~~~~~G~fdl~~~~~-~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~ 397 (524) ..+.|++....... ...+.. .+..|+++.+.|.. .+.....+||+|.....|..+...- T Consensus 154 -----------~~~~gi~~~~~~~~~~~~~~~-------~~~~i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~lkd~~ 213 (324) T protein:vir:97 154 -----------PFGKSIAQSIEKTNKVIKGDF-------TQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVDPE 213 (324) T ss_pred -----------ccCccccccccccceeccccC-------CHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhcCC Confidence 01123322111100 000111 12234444444443 2334567899999998887542221 Q ss_pred cccchhhhcccccccccceeEEEecCcEEEEecCCCCc--ceEEEEEecCCCccceeEeecccccccccccCCc------ Q lcl|Aclame:pro 398 TPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ--DYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPK------ 469 (524) Q Consensus 398 ~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~------ 469 (524) .. ....+.. .++|.| ++|++.+..+. ..+++|-. +.+++... ....++..|.. T Consensus 214 g~-------~~~~~~~----~~tl~G-~PV~~~~~~~~~~~~~~~gd~------~~~~i~~~-~~~~i~~~~~~~~~~~~ 274 (324) T protein:vir:97 214 TK-------ERIYDRN----SDTLDG-LPVVNLKSSNLKRGELITGDF------DKLIYGIP-QLIEYKIDETAQLSTVK 274 (324) T ss_pred Cc-------eeecCCC----Cccccc-eeeEeecCCCCCcceEEEEec------ccEEEEEe-cCcEEEEeecccccccc Confidence 10 1111111 246777 58887665432 22333311 00111100 11001110000 Q ss_pred --c------c---cceeeeeeeecc-EecC--ccc-----ccCCCccccc Q lcl|Aclame:pro 470 --N------F---QPVMGFKTRYGI-GINP--FAN-----SRSQAPADRI 500 (524) Q Consensus 470 --s------~---qP~~~~~tRY~l-~~nP--~~~-----~~~~~~~~~i 500 (524) . | +=.+=+..||+. ..|| |.. ....+.++++ T Consensus 275 ~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) T protein:vir:97 275 NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred cccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCCCCCCC Confidence 0 1 111222345553 2233 110 1111222233 No 103 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=37.52 E-value=1.1 Score=20.33 Aligned_cols=334 Identities=16% Similarity=0.143 Sum_probs=119.2 Q ss_pred CCchHHHHHHhhHhhccccc---------------chhhcchhH--HHHHHHH--HHHHHHHHhccc------------- Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEG---------------LPDIATKSK--KQLVAAI--LEAQEKDAETDP------------- 48 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~---------------~~~i~~~~~--~~~~~~l--~enq~~~~~~~~------------- 48 (524) |.+.++|+++|..+.+.-+. .++|....+ ..+-+++ |+.|.+++..+. T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCCC Confidence 99888888777665442110 011211100 0011111 223322221110 Q ss_pred cccchhhhhhhcccccccccccccccCccc--------ccccccc-ccccccCchhh--hHHHHHHhhhhhhheeeeecC Q lcl|Aclame:pro 49 VYRDEKIVESFGGFLAEAEIAGDHNYDQTN--------IASGKSS-GAITNIGPAVI--GMVRRAIPNLIAFDICGVQPM 117 (524) Q Consensus 49 ~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~--------~~~st~s-g~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPm 117 (524) .-..++...++..++... +.+........ +..++.+ |.+ .=|.=+ .++++....-+-.++|.|.|+ T Consensus 81 ~~~~~~~~~~~~~~~r~~-~~~~~~~~~~~~~~~~~~al~~~t~s~gG~--~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~ 157 (387) T protein:vir:93 81 LNDHEKMVKAKAEFYRHA-ILPNEFEKPSMEAQRLLHALPTGNDSGGDK--LLPKTLSKEIVSEPFAKNQLREKARLTNI 157 (387) T ss_pred cchhhHHHHHHHHHHHHH-hhhhhhhhhhhhhHHHHHhhccCcCCCCce--eechhHHHHHHHHHHhhchhhhheeeeec Confidence 001111222222222111 11111100000 0011111 111 012211 233333334445677888776 Q ss_pred CchhhhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 118 TGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNV 197 (524) Q Consensus 118 TgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~ 197 (524) ++.+. . +-.++.. + ..| T Consensus 158 ~~~~~--p--~~~~~~~-----------~---------a~~--------------------------------------- 174 (387) T protein:vir:93 158 KGLEI--P--RVSYTLD-----------D---------DDF--------------------------------------- 174 (387) T ss_pred CCceE--E--EEeecCC-----------c---------ccc--------------------------------------- Confidence 54220 0 0000000 0 000 Q ss_pred cccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHH Q lcl|Aclame:pro 198 TSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVE 277 (524) Q Consensus 198 ~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~E 277 (524) .+|. ...++...+++.++..++.-+-...+|-| T Consensus 175 --------------------------------------v~E~---------~~~~~~~~~f~~v~~~~~k~~~~~~iS~e 207 (387) T protein:vir:93 175 --------------------------------------ITDV---------ETAKELKLKGDTVKFTTNKFKVFAAISDT 207 (387) T ss_pred --------------------------------------ccCc---------ccccccccccceeeeeheeeeeechhhHH Confidence 0010 00111122333444555555556889999 Q ss_pred HHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHH Q lcl|Aclame:pro 278 LAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKAL 357 (524) Q Consensus 278 LAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L 357 (524) |.||- ..|.|++|.+-|+..|..-.|..++-.-+-+.+ +.|++.-.....+.+.. + T Consensus 208 ll~Ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~------------p~g~l~~~~~~~v~~~~--------~ 263 (387) T protein:vir:93 208 VIHGS----DVDLVNWVENALQSGLAAKERKDALAVSPKSGL------------DHMSFYNGSVKEVEGAD--------M 263 (387) T ss_pred HHhhh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccc------------cceeeeccccccccccc--------h Confidence 99984 345688999988888886666666532111111 12322211111111111 1 Q ss_pred HHHHHHHHHHHHHhccccCCCEEEEchhh-hhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcc Q lcl|Aclame:pro 358 LIQIDKEANEIARQTGRGAGNFIIASRNV-VSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQD 436 (524) Q Consensus 358 ~~~i~~~a~~I~~~T~~g~gn~~v~S~~v-a~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d 436 (524) +-.|..+-+.+...=+ ..+.| |+++.. ..+|...+.+-.++ | ...+ .+|.| ++||+..+++. T Consensus 264 ~d~i~~~~~~l~~~~~-~~a~~-~mn~~t~~~~~~~~~d~~~~~--~-------~~~~----~~llG-~PV~~~~~~~~- 326 (387) T protein:vir:93 264 YDAIINALADLHEDYR-DNATI-YMRYADYVKIISVLSNGTTNF--F-------DTPA----EKVFG-KPVVFTDAAVK- 326 (387) T ss_pred HHHHHHHHhccChhhh-cCCEE-EEechHHHHHHHHHhcCCCcc--c-------ccCC----ccccc-cceEEecCCCc- Confidence 2233333333433222 24455 555544 44444322111000 0 0011 35776 68888776643 Q ss_pred eEEEEEecCCCccceeEeecccccccccccCCccccceeeeee--eeccE-ecCcccccCCCccccccccchHHhhccch Q lcl|Aclame:pro 437 YFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKT--RYGIG-INPFANSRSQAPADRITSGMISKEMCGKN 513 (524) Q Consensus 437 y~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~t--RY~l~-~nP~~~~~~~~~~~~i~~~~~~~~~a~~~ 513 (524) +++|-- +-||-=|.. +....+.......++|.. |++.. ++|=+ T Consensus 327 -~~~GDf-------~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA------------------------ 372 (387) T protein:vir:93 327 -PIVGDF-------NYFGINYDG--TTYDTDKDVKKGEYLFVLTAWYDQQRTLDSA------------------------ 372 (387) T ss_pred -eeeeeh-------hhhheehhh--heeeecccccCCceeEEEEeeeCceeechhh------------------------ Confidence 334421 111111111 111122223345566655 55432 23311 Q ss_pred hhhhhhhcccC Q lcl|Aclame:pro 514 AYFRKVWVKGL 524 (524) Q Consensus 514 ~~~~~~~V~~~ 524 (524) |+.+.||-= T Consensus 373 --~~~l~~k~~ 381 (387) T protein:vir:93 373 --FRIAKAKEN 381 (387) T ss_pred --eEEEEeecC Confidence 111111111 No 104 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=37.51 E-value=1.1 Score=20.33 Aligned_cols=290 Identities=9% Similarity=0.014 Sum_probs=120.6 Q ss_pred HhccccccchhhhhhhcccccccccccccccCccccccc-cccccccccCc-hhh-hHHHHHHhhhhhhheeeeecCCch Q lcl|Aclame:pro 44 AETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIASG-KSSGAITNIGP-AVI-GMVRRAIPNLIAFDICGVQPMTGP 120 (524) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~~s-t~sg~v~~~~P-~li-~l~Rra~~nLIa~DI~GVQPmTgP 120 (524) ++.+. .+++-+.... +++.+-...=| .+. .+++.+-+..+..++|.+.||+++ T Consensus 1 ~~~~~------------------------~~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 56 (318) T protein:vir:24 1 MAAGT------------------------AFAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTT 56 (318) T ss_pred CCCCC------------------------CCCHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCC Confidence 22222 1222111111 11111111112 221 344555667788888999998875 Q ss_pred hhhheeeeeeecCCCCCcccccchhhhhcccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 121 TGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSG 200 (524) Q Consensus 121 TGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g 200 (524) +.- |.-.. .+. +| .| T Consensus 57 ~~~-------ip~~~--~~~-----~a---------~~------------------------------------------ 71 (318) T protein:vir:24 57 GQK-------IPHWV--GDV-----SA---------QW------------------------------------------ 71 (318) T ss_pred ceE-------EEEEe--CCc-----ce---------EE------------------------------------------ Confidence 422 11110 000 00 00 Q ss_pred CCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 201 NVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQ 280 (524) Q Consensus 201 ~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQ 280 (524) .+| +.++++...++++++.+.|..+-...+|-||.+ T Consensus 72 -----------------------------------v~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ 107 (318) T protein:vir:24 72 -----------------------------------IGE---------GDMKPITKGNMTSQTIAPHKIATIFVASAETVR 107 (318) T ss_pred -----------------------------------ecC---------CccccccccceeEEEEeeEEEEEeehhhHHHhh Confidence 001 012333344556666666666667789999999 Q ss_pred HHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccc----cccchHHHHHHH Q lcl|Aclame:pro 281 DLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDI----RGARWAGESYKA 356 (524) Q Consensus 281 DLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~----~~~~~~~e~~r~ 356 (524) |-. .|.+++|.+.|+..|...|+..++.-... ..+.|++........ ...-+..+.... T Consensus 108 ds~----~~~~~~i~~~l~~~~~~~~d~a~l~G~g~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (318) T protein:vir:24 108 ANP----ANYLGTMRTKVATAFAMAFDGAAMHGTDS-------------PFPTYIGQTTKAISIADTTGATTVYDQVAVN 170 (318) T ss_pred cCh----HHHHHHHHHHHHHHHHHHHHHhhhcccCC-------------CCCcccccccccccccccccccchHHHHHHH Confidence 843 57899999999999999999999842111 011122221111100 011111111222 Q ss_pred HHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhccccccccccee-EEEecCcEEEEecCCCCc Q lcl|Aclame:pro 357 LLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVF-AGVLGGTYKVYIDQYARQ 435 (524) Q Consensus 357 L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~-~G~l~~~~~vy~D~y~~~ 435 (524) ++ ..+. -.......+||+|.....|......-..+- |..... ......+ -+.+.| ++|++.+..+. T Consensus 171 ~~-------~~~~--~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l-~~~~~~--~~~~~~~~~~~i~g-~pv~~~~~~~~ 237 (318) T protein:vir:24 171 GL-------SLLV--NDGKKWTHTLLDDITEPILNGAKDQNGRPL-FIESTY--GEAASPFRSGRIVA-RPTILSDHVVE 237 (318) T ss_pred HH-------Hhhc--cccCCCCEEEEcHHHHHHHHHhhccCCcee-ecCccc--cCccccccCceEEE-EeeEEeCCCCC Confidence 22 2222 223355788999999999875422211000 000000 0011111 123443 57777777643 Q ss_pred c--eEEEEEecCCCccceeEeecccccccccc---------cCCc----c-c---cceeeeeeeeccEe-cCcccccCCC Q lcl|Aclame:pro 436 D--YFTVGFKGDNEMDAGIYYAPYVALTPLRG---------SDPK----N-F---QPVMGFKTRYGIGI-NPFANSRSQA 495 (524) Q Consensus 436 d--y~~vG~KG~~~~~~~~fyaPYv~~~~~~~---------~dp~----s-~---qP~~~~~tRY~l~~-nP~~~~~~~~ 495 (524) . .+++| +- +.++|+-.-.+ .++. .|+. + | |=.+=...||+..+ +|=+ T Consensus 238 ~~~~~~~g---df---s~~~~~~~~~l-~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a------ 304 (318) T protein:vir:24 238 GTTVGFMG---DF---SQLIWGQIGGL-SFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEA------ 304 (318) T ss_pred CccEEEEe---ec---ceEEEEEecCe-EEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccc------ Confidence 2 11221 11 11233322111 1111 1111 1 2 23334456777653 3311 Q ss_pred ccccccccchHHhhcc Q lcl|Aclame:pro 496 PADRITSGMISKEMCG 511 (524) Q Consensus 496 ~~~~i~~~~~~~~~a~ 511 (524) .++|. +--+.-..| T Consensus 305 -~~~i~-~~~a~~~~~ 318 (318) T protein:vir:24 305 -FVALT-NVVSGGGEG 318 (318) T ss_pred -eEEEE-eeccCCCCC Confidence 11111 111111111 No 105 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=36.21 E-value=1.2 Score=20.18 Aligned_cols=296 Identities=9% Similarity=0.042 Sum_probs=121.5 Q ss_pred chhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccccccccccccccCchhh-hHH Q lcl|Aclame:pro 21 LPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIASGKSSGAITNIGPAVI-GMV 99 (524) Q Consensus 21 ~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~~st~sg~v~~~~P~li-~l~ 99 (524) |.+. +.++...|...... -+.+..+. .+. .++.+++. ..-+.+. .++ T Consensus 1 ~~k~----------~~~~~~~~~~~~~~---------------~~~~~~~a-----~~~-~~~~~~~~-lip~~~~~~ii 48 (324) T protein:vir:99 1 MEQT----------QKLKLNLQHFASNN---------------VKPQVFNP-----DNV-MMHEKKDG-TLLNDFTTPIL 48 (324) T ss_pred CCCc----------hHhhHHHHHHHHHh---------------hhhhhccc-----cce-eccCCCcc-eechhHHHHHH Confidence 1111 01111111111111 00000000 000 01111111 1112222 344 Q ss_pred HHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 100 RRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATG 179 (524) Q Consensus 100 Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g 179 (524) +.+..+.+-.++|.+.||++.+.-|. +.... . + +.| T Consensus 49 ~~~~~~s~l~~~~~~~~~~~~~~~~p----~~~~~-----~-----~---------a~~--------------------- 84 (324) T protein:vir:99 49 QEVMENSKIMRLGKYEPMEGTEKKFT----FWADK-----P-----G---------AYW--------------------- 84 (324) T ss_pred HHHHhhchhhhhcceeeccCCceEEE----EEecC-----c-----c---------eeE--------------------- Confidence 55556777788888888887542111 00000 0 0 000 Q ss_pred cccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEE Q lcl|Aclame:pro 180 AIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRID 259 (524) Q Consensus 180 ~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIE 259 (524) .+| +..+++...+++ T Consensus 85 --------------------------------------------------------v~E---------g~~~~~~~~~~~ 99 (324) T protein:vir:99 85 --------------------------------------------------------VGE---------GQKIETSKATWV 99 (324) T ss_pred --------------------------------------------------------ecc---------Ccccccccccee Confidence 001 112344445566 Q ss_pred EEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeeccc Q lcl|Aclame:pro 260 KQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQ 339 (524) Q Consensus 260 K~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~ 339 (524) ++++..|.-+---..|-||.+|-. .|.+++|.+.|+..|...+++.||.--... ..+.|++... T Consensus 100 ~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~------------~~~~~~~~~~ 163 (324) T protein:vir:99 100 NATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN------------PFGKSIAQSI 163 (324) T ss_pred EEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCCC------------ccCccccccc Confidence 666666666667789999999974 467999999999999999999998521100 0111222111 Q ss_pred cccc-ccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeE Q lcl|Aclame:pro 340 DPVD-IRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFA 418 (524) Q Consensus 340 ~~~~-~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~ 418 (524) .... ...+.-. +..|.++.+.|.. .+...+.+|++|.....|......- + .....+.. . T Consensus 164 ~~~~~~~~~~~~-------~~~i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~l~d~~----g---~~~~~~~~----~ 223 (324) T protein:vir:99 164 EKTNKVIKGDFT-------QDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVDPE----T---KERIYDRN----S 223 (324) T ss_pred cccceeccccCC-------HHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhcCC----C---ceeecCCC----C Confidence 1000 0001111 2233344444432 3345677899999999887642221 1 01111111 2 Q ss_pred EEecCcEEEEecCCCCc--ceEEEEEecCCCccceeEeecccc--------cccccccCCc--------cccceeeeeee Q lcl|Aclame:pro 419 GVLGGTYKVYIDQYARQ--DYFTVGFKGDNEMDAGIYYAPYVA--------LTPLRGSDPK--------NFQPVMGFKTR 480 (524) Q Consensus 419 G~l~~~~~vy~D~y~~~--dy~~vG~KG~~~~~~~~fyaPYv~--------~~~~~~~dp~--------s~qP~~~~~tR 480 (524) ++|.| ++|+|.+..+. ..+++|-.. .+++..--. .......|+. +-+=.+=...| T Consensus 224 ~~l~G-~PVv~~~~~~~~~~~~i~gd~~------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r 296 (324) T protein:vir:99 224 DTLDG-LPVVNLKSSNLKRGELITGDFD------KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH 296 (324) T ss_pred ccccc-eeEEeecCCCCCcceEEEEecc------cEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 46777 68888776533 234433221 011110000 0000001111 11122223356 Q ss_pred eccE-ecC--ccc-----ccCCCccccc Q lcl|Aclame:pro 481 YGIG-INP--FAN-----SRSQAPADRI 500 (524) Q Consensus 481 Y~l~-~nP--~~~-----~~~~~~~~~i 500 (524) |+.. .|| |.. ..+....+.| T Consensus 297 ~d~~v~~~~a~~~lt~a~~~~~~~~~~~ 324 (324) T protein:vir:99 297 VALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred EccEEecccceEEEEeccCCCCCCCCCC Confidence 6633 233 110 1111112222 No 106 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=35.77 E-value=1.2 Score=20.13 Aligned_cols=352 Identities=10% Similarity=0.072 Sum_probs=122.1 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccc----------cccchhhh-hhhcccccccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDP----------VYRDEKIV-ESFGGFLAEAEIA 69 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~----------~~~~~~~~-~~~~~~l~ea~~~ 69 (524) |-+-+++.+|...+-.. + +..+....+..-..+.+++..+.+.++. ..++.... ......|.+.+- T Consensus 1 ik~L~e~~~e~~e~~~~-~-~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r- 77 (390) T protein:vir:40 1 MNNLDKKDSETLNISTA-F-LNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDES- 77 (390) T ss_pred CchHHHHHHHHHHHHHH-H-HHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHH- Confidence 77766666665443221 1 1222222211111111121111111110 00000000 000011111100 Q ss_pred cccccCccccccccccccccccCchh-h-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhh Q lcl|Aclame:pro 70 GDHNYDQTNIASGKSSGAITNIGPAV-I-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREA 147 (524) Q Consensus 70 g~~~~~~~~~~~st~sg~v~~~~P~l-i-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEA 147 (524) .. +.......+++.|.. .=|.- . .+.+.+-..-+-.++|-+.||++....|. +.... . + T Consensus 78 -~~-~~~~~~~~~~~~gg~--lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~----~~~~~-----~-----~- 138 (390) T protein:vir:40 78 -KY-YNEVIAGNGFAGVTA--LLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWII----SVGDV-----A-----T- 138 (390) T ss_pred -HH-HHHHHhccCcccCcc--cccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEE----EEcCC-----c-----c- Confidence 00 000000001111111 11221 1 23333333445567888888877443332 10000 0 0 Q ss_pred hccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCccccccccccccccccccc Q lcl|Aclame:pro 148 FHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAE 227 (524) Q Consensus 148 f~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~ 227 (524) +.|-+. .+ T Consensus 139 --------a~~~~E-------------------------------------------~~--------------------- 146 (390) T protein:vir:40 139 --------AWWGPL-------------------------------------------CA--------------------- 146 (390) T ss_pred --------eeeecc-------------------------------------------cc--------------------- Confidence 000000 00 Q ss_pred ccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 228 ISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEIN 307 (524) Q Consensus 228 ~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EIN 307 (524) +. -..+...|.+..|++.|..+- ...|-||.+|-- .|.|++|.+.|+..|..-+| T Consensus 147 ----------~~----~~~~~~~f~~i~l~~~k~~~~-------i~iS~ell~ds~----~~l~~~i~~~la~~i~~~~~ 201 (390) T protein:vir:40 147 ----------EI----KEVLDNGFDKIQTGMYKLSAY-------IPVCNAMLDLGP----SWLDQYVRTILGEAMALGLE 201 (390) T ss_pred ----------cc----CccccccceeeEeeeeeEEEe-------ehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHH Confidence 00 001123477777777776643 458889999863 46799999999999999999 Q ss_pred HHHHhhhhhheeeeeeccccccCccceeeccccccc------ccccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEE Q lcl|Aclame:pro 308 REIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVD------IRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFII 381 (524) Q Consensus 308 reii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~------~~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v 381 (524) +.||. |. | .+.+.|++--..... ....-...+-.-.++..+......-.... ++. -..| T Consensus 202 ~a~l~--------G~-G----~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~-~~~-a~~i 266 (390) T protein:vir:40 202 AGIVN--------GS-G----KDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKS-VSD-AILV 266 (390) T ss_pred hhhhc--------cc-C----CCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhh-hcC-ceEE Confidence 99995 10 0 011223321100000 00000000111222222222111111111 123 3346 Q ss_pred Echhh-hhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEe--------cCCCcccee Q lcl|Aclame:pro 382 ASRNV-VSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFK--------GDNEMDAGI 452 (524) Q Consensus 382 ~S~~v-a~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K--------G~~~~~~~~ 452 (524) |+|.- +..|... . .+ .|..+....+.+.-+++|+++++.+.+-++.|-- +....+.+- T Consensus 267 ~n~~t~~~~l~~~-----------~-~~-~d~~G~~v~~~~~~g~pvv~~~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~~ 333 (390) T protein:vir:40 267 INPADYWSKIYAA-----------T-SY-MTPQGVWVTGILPVPLEIVQSVAVPVGKAVAGRAKDYFMGIGSEQVIRTST 333 (390) T ss_pred EcchhHHHHHHHH-----------h-hc-cCCCCccccccCCCceeEEEcCCCCCCcEEEEeeceEEEEeecceEEEecc Confidence 66654 3333210 0 00 1122222222333457999999887755554422 221111100 Q ss_pred --Eeecc--c-----ccccccccCCccccceeeeeeee-ccEecCcccccCCCccccccccc Q lcl|Aclame:pro 453 --YYAPY--V-----ALTPLRGSDPKNFQPVMGFKTRY-GIGINPFANSRSQAPADRITSGM 504 (524) Q Consensus 453 --fyaPY--v-----~~~~~~~~dp~s~qP~~~~~tRY-~l~~nP~~~~~~~~~~~~i~~~~ 504 (524) +|. + + .-.....+||+.|. ++=++.== .-.+.||....+..+. ..+. T Consensus 334 ~~~f~-~~~~~~r~~~r~dg~v~~~~A~~-~l~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 390 (390) T protein:vir:40 334 EYRLL-DDETLYYAKQYANGRPKDNSSFL-VFDITGLEGSPAIDVNVVNNATPSE---TPAE 390 (390) T ss_pred hhhhh-cCcEEEEEEEEeCCEEecccceE-EEEeeccCCCCCCCcceeeCCCCCC---CCCC Confidence 000 0 0 00011123455443 00011000 0122333332221111 1111 No 107 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=32.86 E-value=1.4 Score=19.79 Aligned_cols=219 Identities=11% Similarity=0.059 Sum_probs=100.7 Q ss_pred ccccccCCcccccCcccccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccc Q lcl|Aclame:pro 195 QNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQY 274 (524) Q Consensus 195 ~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEY 274 (524) .+ +.+.+.+-+-| .+ ...+|.+.. ...-+..+|+++ .++++.|-+.=.=++ T Consensus 1 ~~--~~~~Gdtit~P-----------------~~-----iGda~~v~e---G~~i~~~~l~~t--~~~atIk~~gk~~~i 51 (231) T protein:vir:73 1 EN--GINLANLCEYP-----------------ND-----IGDAADVAE---GGEISLDKIGTT--TKSVTIKKAAKGTEI 51 (231) T ss_pred Cc--cccCCceEEec-----------------cc-----ccchhhhcC---CCcCChhhcccc--ceeeeEeeeccceee Confidence 00 00011111111 11 122232211 111234455544 444444544333344 Q ss_pred cHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHH Q lcl|Aclame:pro 275 SVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESY 354 (524) Q Consensus 275 T~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~ 354 (524) |=|- .|.+ +| |--.|..+-|+..|+..++.|++..+..++.-. . ..+. .-+-. T Consensus 52 tD~a--~l~~-~g-Dp~~ea~~Q~~~~iA~kvD~di~~~~~~a~l~~-~----------~~~t------------~d~i~ 104 (231) T protein:vir:73 52 TDEA--ALSG-YG-DPIGESNKQLGLSLANKVDDDLLKAAKTTSQTV-S----------TKAN------------VDGVQ 104 (231) T ss_pred eHHH--Hhhc-cC-chHHHHHHHHHHHHHHhhhHHHHHhhccccccc-c----------cccc------------HHHHH Confidence 3322 2555 33 888999999999999999999997554333211 0 1111 11111 Q ss_pred HHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCC Q lcl|Aclame:pro 355 KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYAR 434 (524) Q Consensus 355 r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~ 434 (524) +.| ..+.++ -....++||+|+++..|...-..+...+.. +.+.- .+-.+|.+.| ++|+++...+ T Consensus 105 ~A~-~~fgde---------~~~~~vivv~p~~~~~Lrk~~~~~~~~~~~---g~~i~--~~G~iG~i~G-~~Vi~S~~~~ 168 (231) T protein:vir:73 105 AAL-DIFNDE---------DAQAYVLIVNPKDAAKIRKDANAKNIGSEV---GANAL--INGTYADVLG-AQIVRSKKLA 168 (231) T ss_pred HHH-HHhccc---------cccceEEEEcchHHHhhhhccchhhhhhhh---cccee--eecccceEcc-eEEEEcCCCC Confidence 111 112221 135689999999999886421111101110 11111 1123577766 8999988776 Q ss_pred cceEEEEEecCCCccceeEeecccc------------cccccccCCccccceeeeeeeeccEecCcccccCCCccccccc Q lcl|Aclame:pro 435 QDYFTVGFKGDNEMDAGIYYAPYVA------------LTPLRGSDPKNFQPVMGFKTRYGIGINPFANSRSQAPADRITS 502 (524) Q Consensus 435 ~dy~~vG~KG~~~~~~~~fyaPYv~------------~~~~~~~dp~s~qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~~ 502 (524) .+ +.++++|+. ...-...|+..+.-.+--.-.|++.. .+.+. T Consensus 169 ~~--------------~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l------~~~~~------ 222 (231) T protein:vir:73 169 EG--------------SALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYL------YDLTK------ 222 (231) T ss_pred CC--------------ceeeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEE------EcCcc------ Confidence 42 223333321 01111257777777777777777554 11111 Q ss_pred cchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 503 GMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 503 ~~~~~~~a~~~~~~~~~~V~~~ 524 (524) ..++.+||+ T Consensus 223 -------------vv~~t~~g~ 231 (231) T protein:vir:73 223 -------------VVNITFTGV 231 (231) T ss_pred -------------EEEEEeecC Confidence 123444555 No 108 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=31.01 E-value=1.5 Score=19.57 Aligned_cols=335 Identities=16% Similarity=0.177 Sum_probs=117.5 Q ss_pred CCch---HHHHHH-hhHhhccccc-chhhc---chhHHHHHHH----HHHHHHHHHhccccccchhhhhhhccccccccc Q lcl|Aclame:pro 1 MSKK---NELMEK-WNDLLESQEG-LPDIA---TKSKKQLVAA----ILEAQEKDAETDPVYRDEKIVESFGGFLAEAEI 68 (524) Q Consensus 1 m~~~---~~l~~k-w~p~l~~~~~-~~~i~---~~~~~~~~~~----l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~ 68 (524) +... +++.+. =.|+-...++ .+..+ ...+..-..+ +.+..-. +...... ... ....+... T Consensus 53 i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-----~~~--~~~~~~~~ 124 (428) T protein:vir:10 53 MDRMEATERAAALVAKPVKATQHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGN-LQDAAKF-----ASD--ELNDQSVS 124 (428) T ss_pred HHHHHHHHHHHHHHhhhhhchhhccccccccccchhhhHHHHHHHHHHHHhhhh-HHHHHHH-----hhh--hhhhhhHh Confidence 2211 111111 0011100000 01010 0111111111 1111100 0000000 000 00000000 Q ss_pred ccccccCcccccccccccccc---ccCchhhhHHHHHHhhhhhhhe-eeeecCCchhhhheeeeeeecCCCCCcccccch Q lcl|Aclame:pro 69 AGDHNYDQTNIASGKSSGAIT---NIGPAVIGMVRRAIPNLIAFDI-CGVQPMTGPTGQVFALRAVYGKDPLAGGTPADV 144 (524) Q Consensus 69 ~g~~~~~~~~~~~st~sg~v~---~~~P~li~l~Rra~~nLIa~DI-~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~ 144 (524) . ....++++|.+. .+.+-+|.+.| +..+..++ +-+-|| ++|-+ +++-.. ++. T Consensus 125 ~--------~~~~~~~~gg~liP~~~~~~ii~~l~---~~~~l~~~~~~~~~~--~~g~~-----~~p~~~--~~~---- 180 (428) T protein:vir:10 125 M--------AISTAAGSGGVLIPQNIHSEVIELLR---DRTIVRKLGARSIPL--PNGNM-----SLPRLA--GGA---- 180 (428) T ss_pred h--------hhcccccCCccccchhHHHHHHHHHh---hhchhhhhcceeeec--CCcce-----EEEEEe--CCc---- Confidence 0 000111122211 11122223332 33333443 112222 22211 000000 000 Q ss_pred hhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccc Q lcl|Aclame:pro 145 REAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGT 224 (524) Q Consensus 145 nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~ 224 (524) . T Consensus 181 ----------~--------------------------------------------------------------------- 181 (428) T protein:vir:10 181 ----------T--------------------------------------------------------------------- 181 (428) T ss_pred ----------c--------------------------------------------------------------------- Confidence 0 Q ss_pred cccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHH Q lcl|Aclame:pro 225 LAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIML 304 (524) Q Consensus 225 ~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~ 304 (524) . .-.+| +..+++...++++++...|.-+-...+|-||.+|- ..|.++.|.+.|...|.. T Consensus 182 a--------~~v~E---------g~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds----~~~l~~~i~~~l~~ai~~ 240 (428) T protein:vir:10 182 A--------SYTGE---------NQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRA----GFNVEQLVLQDILTAISV 240 (428) T ss_pred e--------eeecc---------CccccccccceeeEEeeeEEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHHHHH Confidence 0 00011 11244455566666777777777789999999884 245689999999999999 Q ss_pred HhhHHHHhhhhhheeeeeeccccccCccceeecccccccc-----cccchHHHHHHHHHHHHHHHHHHHHHhccccCCCE Q lcl|Aclame:pro 305 EINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDI-----RGARWAGESYKALLIQIDKEANEIARQTGRGAGNF 379 (524) Q Consensus 305 EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~-----~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~ 379 (524) .+++.||. |. | ....+.|++......+. ...--.......+. .+..+...+.+.-. .... T Consensus 241 ~~d~~~l~--------G~-G---~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~--~~~~ 305 (428) T protein:vir:10 241 REDKAFMR--------DD-G---TGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYL-DSIILMSMDGNSNM--ISSG 305 (428) T ss_pred HHHHHHhc--------cC-C---CCccccccccccccccccccccccccccHHHHHHHH-HHHHHhhhcccccc--ccCE Confidence 99998884 10 0 00112243321110000 00000011222222 22223333333222 3355 Q ss_pred EEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcc----------------eEEEEEe Q lcl|Aclame:pro 380 IIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQD----------------YFTVGFK 443 (524) Q Consensus 380 ~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----------------y~~vG~K 443 (524) .|++|.....|..+..+ .+. ....+. .-|+|.| ++||++.+.|.+ ++++|.. T Consensus 306 ~v~n~~~~~~L~~lkd~----~G~---~i~~~~----~~g~l~G-~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~ 373 (428) T protein:vir:10 306 WGMSNRTYMKLFGLRDG----NGN---KVYPEM----AQGMLKG-YPIQRTSAIPANLGEGGKESEIYFADFNDVVIGED 373 (428) T ss_pred EEEcHHHHHHHHHhhcc----CCc---eeccCC----CCCeeec-eeeEEeccccccccCCCccceEEEEecceEEEEEe Confidence 67899988888653211 111 111111 1257877 699998876543 1223333 Q ss_pred cCCCccceeEeecccccccccccCCccc---cceeeeeeeeccEec-CcccccCCCccccccccchH Q lcl|Aclame:pro 444 GDNEMDAGIYYAPYVALTPLRGSDPKNF---QPVMGFKTRYGIGIN-PFANSRSQAPADRITSGMIS 506 (524) Q Consensus 444 G~~~~~~~~fyaPYv~~~~~~~~dp~s~---qP~~~~~tRY~l~~n-P~~~~~~~~~~~~i~~~~~~ 506 (524) +.-+.+ ..+|..........-..| +=.+=...|+++.+. | ++-.+..+-.| T Consensus 374 ~~i~i~----~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p--------~a~~~~t~~~~ 428 (428) T protein:vir:10 374 GNMKVD----FSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHP--------EGLVLGTGVLF 428 (428) T ss_pred cceEEE----eecccccccccccccchhhcchhheeeeeeeCceeecc--------ceEEEEeccCC Confidence 222211 122211111000000011 122335567765552 3 11233444555 No 109 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=30.11 E-value=1.6 Score=19.46 Aligned_cols=272 Identities=11% Similarity=0.007 Sum_probs=110.3 Q ss_pred ecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcc Q lcl|Aclame:pro 131 YGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPA 210 (524) Q Consensus 131 Y~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~ 210 (524) -++.. +.-..-.-.|-+.++-. ..+-. ....++.......-.. T Consensus 1 ma~~~-T~l~d~iiPev~~~~v~--~~~~~----------------------------------~l~~~~~~~~d~~l~g 43 (274) T protein:vir:12 1 MAQGL-TKTSNQIIPEVLAPMMQ--AQLEK----------------------------------KLRFASFAEVDSTLQG 43 (274) T ss_pred CCcce-eehhhhhchHHHHHHHH--HHHHh----------------------------------hhhhcccceecccccC Confidence 11100 00000011111111100 00000 0000000000000000 Q ss_pred cccccccccccccccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCh Q lcl|Aclame:pro 211 ALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDA 290 (524) Q Consensus 211 ~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDA 290 (524) ..|....+..-=....+|.. .....-...++..+=. +++-+-|+-.=+++=| ..+.+ +-|- T Consensus 44 ----------~~G~tv~iP~~~~ig~a~~~---~~g~~i~~~~lt~~~~--~~~i~~~~~~~~i~D~--~~~~~--~~d~ 104 (274) T protein:vir:12 44 ----------QPGDTLTFPAFVYSGDAQVV---AEGEKIPTDILETKKR--EAKIRKIAKGTSITDE--ALLSG--YGDP 104 (274) T ss_pred ----------CCCCEEEEeeecCCCccccc---cCCCccchhhccccee--eEEeeeecceeeecHH--HHHhc--ccch Confidence 00111111000001111211 1111122344443333 3333444322222221 22333 4688 Q ss_pred HHHHHHHHHHHHHHHhhHHHHhhhhhheeeeeeccccccCccceeecccccccccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 291 DAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIAR 370 (524) Q Consensus 291 EaELsnILStEI~~EINreii~~i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~~~~~~~~e~~r~L~~~i~~~a~~I~~ 370 (524) -.|..+-++..|..+++.+++..+..+..- +. ...+ ..+-+-....++.++. T Consensus 105 ~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~-~~---------~~a~-------------~~d~i~dA~~~lgd~~----- 156 (274) T protein:vir:12 105 QGEQVRQHGLAHANKVDNDVLEALMGAKLT-VN---------ADIT-------------KLNGLQSAIDKFNDED----- 156 (274) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccc-cc---------cccc-------------CHHHHHHHHHHhcccc----- Confidence 889999999999999999999766543211 10 0111 1233333344444332 Q ss_pred hccccCCCEEEEchhhhhhhhhhc-ccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCCcc Q lcl|Aclame:pro 371 QTGRGAGNFIIASRNVVSALARID-SGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMD 449 (524) Q Consensus 371 ~T~~g~gn~~v~S~~va~~L~~~~-~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~ 449 (524) ..++++||+|.|++.|..-. .-|.+++.... .-..+...|.+.| ++||+|...|..-..+--+|. T Consensus 157 ----~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~-----~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~gA---- 222 (274) T protein:vir:12 157 ----LEPMVLFINPLDAGKLRGDASTNFTRATELGD-----DIIVKGAFGEALG-AIIVRSNKLEAGTAILAKKGA---- 222 (274) T ss_pred ----ccccEEEeCHHHHHHHHhhhhhhccccccccc-----cceecccceeecC-eeEEEeCCCCcceEEEEeccc---- Confidence 15689999999999987521 01222222111 1112234688876 899999988753221111121 Q ss_pred ceeEeecccccccccccCCccccceeeeeeeeccEe-cCcccccCCCccccccccchHHhh Q lcl|Aclame:pro 450 AGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEM 509 (524) Q Consensus 450 ~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~~~~ 509 (524) -.||. --+...-...||..++-.+-..-+||+.+ || ..-.+++.++-.-.| T Consensus 223 -~~~~~-~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~-------~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 223 -VKLIL-KRDFFLEVARDASTKTTALYSDKHYVAYLYDE-------SKAVKITKGSGSLEM 274 (274) T ss_pred -eeeee-cCCceeccccchhhcccEEEeeeEEEEEEEcC-------CceEEEEcCCccccC Confidence 11222 11222222469999999999999999654 44 111122221111122 No 110 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=26.23 E-value=2 Score=18.97 Aligned_cols=311 Identities=14% Similarity=0.079 Sum_probs=114.4 Q ss_pred ecCCCCCcccccchhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCcc Q lcl|Aclame:pro 131 YGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPA 210 (524) Q Consensus 131 Y~~~~~~~gteA~~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~ 210 (524) -+++. +|. .. .-..+++|..+..- +. ..-...+.+.. .|+... +....+..- +..+ T Consensus 1 ~~~~~--~~~-----~~-----~t~~g~~~~~~~~~-al---~ie~~~g~V~~-~f~~~s-~~~~~v~~r--~~~~---- 56 (347) T protein:vir:33 1 MANIQ--GGQ-----QI-----GTNQGKGQSAADKL-AL---FLKVFGGEVLT-AFARTS-VTMPRHMLR--SIAS---- 56 (347) T ss_pred CCCCc--cCc-----cc-----ccccccCCcccchH-HH---HHHHHHHHHHH-HHHHHH-hhhhhhccc--cccc---- Confidence 11110 000 00 00112222111100 00 00000111110 111110 111111000 0000 Q ss_pred cccccccccccccccccccccccchhhhhccccCCC-CCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCC Q lcl|Aclame:pro 211 ALDAAVIAENEKGTLAEISVGMATSVAELQENFNGS-SANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMD 289 (524) Q Consensus 211 ~~~~~~~~~~~~g~~~~~~~GmtTs~aEal~~~ggs-s~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLD 289 (524) .+..-.......+...+.. ++.+ .++ .+.+..|+-++||++- -+...++-.-+.++ | .| T Consensus 57 -G~sv~i~~iG~~t~~~~~~------g~~l---~~~~~~~~~~e~~ltiD~~~--------y~~~~VddiD~~q~-~-~D 116 (347) T protein:vir:33 57 -GKSAQFPVIGRTKAAYLKP------GENL---DDKRKDIKHTEKVIHIDGLL--------TADVLIYDIEDAMN-H-YD 116 (347) T ss_pred -cceeEeeeccceeeeeecC------CCCC---CCCCCCCccceEEEEechhh--------hhhHHHhhHHHHhc-C-Cc Confidence 0111111111111111111 1111 111 1234567778888653 34556776677777 4 78 Q ss_pred hHHHHHHHHHHHHHHHhhHHHHhhhhhheeeeee--ccccccCccceeecccccccccccchHHH-HHHHHHHHHHHHHH Q lcl|Aclame:pro 290 ADAELSAILATEIMLEINREIVDLINYTAQVGKS--GFTQTVGSKAGSFDFQDPVDIRGARWAGE-SYKALLIQIDKEAN 366 (524) Q Consensus 290 AEaELsnILStEI~~EINreii~~i~~~a~~~~~--g~~~~~~~~~G~fdl~~~~~~~~~~~~~e-~~r~L~~~i~~~a~ 366 (524) -..|++.-....++..+++-|+..|..-...-.. ...+.. ...+.+..... ..+.-|..+ -...+|..|.+... T Consensus 117 ~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~-~~~~~~~~~~~--~tg~~~d~~~~a~~i~~~i~~a~~ 193 (347) T protein:vir:33 117 VRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGL-GKPTVLTLVKP--TTGSLTDPVELGKAIIAQLTIARA 193 (347) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc-ccccccccccc--ccccccchhhhHHHHHHHHHHHHH Confidence 8999999999999999999998655321111000 000000 01111111111 011122222 22333433333333 Q ss_pred HHHHhccccCCCEEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCC Q lcl|Aclame:pro 367 EIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDN 446 (524) Q Consensus 367 ~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~ 446 (524) ..-.+=---.|-|+|++|+.-.+|-.....+..... ..++.....+|.+.| ++||.-+..|.-.++- ... T Consensus 194 ~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~------~~~~~~~G~V~~i~G-~~V~~Sn~lp~~~~~~---~~~ 263 (347) T protein:vir:33 194 SLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQ------ALLDPERGTIRNVMG-FEVVEVPHLTAGGAGD---TRE 263 (347) T ss_pred HHhhcCCCccCcEEEeCHHHHHHHhccccccccccc------cccccccceeEEEec-eeEEEecccccCcccc---ccc Confidence 322222222568999999999998765433311111 122344456788877 9999999877643220 111 Q ss_pred Cccc----------------------eeEeecccc----c---ccccccCCccccceeeeeeeeccEe-cCcccccCCCc Q lcl|Aclame:pro 447 EMDA----------------------GIYYAPYVA----L---TPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAP 496 (524) Q Consensus 447 ~~~~----------------------~~fyaPYv~----~---~~~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~ 496 (524) ...+ ||||.|=.. + ..-+..|++.|-=.|=-+..||..+ +|=.- T Consensus 264 ~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~vlrP~~a------ 337 (347) T protein:vir:33 264 DAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRPEAA------ 337 (347) T ss_pred cccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchhhhhHhhhhhhhcCCceecccce------ Confidence 1112 333332211 1 1111233433333333333333211 12000 Q ss_pred cccccccchHHhhccchhhhhhhhcccC Q lcl|Aclame:pro 497 ADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) Q Consensus 497 ~~~i~~~~~~~~~a~~~~~~~~~~V~~~ 524 (524) .-+..|.| T Consensus 338 --------------------v~i~~~~~ 345 (347) T protein:vir:33 338 --------------------GAIVLPKV 345 (347) T ss_pred --------------------EEEecCCC Confidence 00000111 No 111 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=23.77 E-value=2.3 Score=18.64 Aligned_cols=344 Identities=9% Similarity=0.023 Sum_probs=112.2 Q ss_pred CCch----HHHHHHhhHhhcccc-cchhhcchhHHHHHHHHHHHHHHHHhccc--cccchhhhhhhc--------ccccc Q lcl|Aclame:pro 1 MSKK----NELMEKWNDLLESQE-GLPDIATKSKKQLVAAILEAQEKDAETDP--VYRDEKIVESFG--------GFLAE 65 (524) Q Consensus 1 m~~~----~~l~~kw~p~l~~~~-~~~~i~~~~~~~~~~~l~enq~~~~~~~~--~~~~~~~~~~~~--------~~l~e 65 (524) |.-+ +++.||=..+++..+ +..+. ....-...++++.++++.... .+++........ .++++ T Consensus 1 M~i~~~~~~~~~e~~~~l~~~~~~~~~~e---~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~ 77 (377) T protein:vir:96 1 MAINLKELPKYREAVAELSAKISAGATPE---EQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFND 77 (377) T ss_pred CCccHHHHHHHHHHHHHHHHHHhhcccHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHH Confidence 4433 345555444444211 01110 011111112222222211110 111111111001 11111 Q ss_pred cccccccccCccccccccccccccccCch-hh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccc Q lcl|Aclame:pro 66 AEIAGDHNYDQTNIASGKSSGAITNIGPA-VI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPAD 143 (524) Q Consensus 66 a~~~g~~~~~~~~~~~st~sg~v~~~~P~-li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~ 143 (524) ... .++.++. .-.=|. ++ .+++.....=.-..+|-|+|+++++ |-.+.... + T Consensus 78 ~~~------------~~~~~~g-g~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~------~i~~~~~~---~---- 131 (377) T protein:vir:96 78 IDK------------NVGGKDK-FKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRL------KALTAETS---G---- 131 (377) T ss_pred HHh------------cCCCCCC-ceecCHHHHHHHHHHHHhhhhhhhhceeEecCCce------EEEEecCC---c---- Confidence 110 0011110 001132 22 2222222223445578888887652 22221110 0 Q ss_pred hhhhhccccccccccccccccccccccccccccccccccccccccccccccccccccCCcccccCccccccccccccccc Q lcl|Aclame:pro 144 VREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKG 223 (524) Q Consensus 144 ~nEAf~~~~~~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g 223 (524) .+.|.+. . T Consensus 132 -----------~a~wv~e-------------------------------------------~------------------ 139 (377) T protein:vir:96 132 -----------TAVWGDI-------------------------------------------F------------------ 139 (377) T ss_pred -----------ceeEeec-------------------------------------------c------------------ Confidence 0011000 0 Q ss_pred ccccccccccchhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHH Q lcl|Aclame:pro 224 TLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIM 303 (524) Q Consensus 224 ~~~~~~~GmtTs~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~ 303 (524) +|. -.+....|.++.|..-|... ....|-||.+| -.+|.|++|.+-|+..|. T Consensus 140 -------------~~~----~~~~~~~f~~i~l~~~kl~~-------~~~is~~ll~d----s~~~le~~i~~~l~~~~~ 191 (377) T protein:vir:96 140 -------------GEI----KGQLKQAFKEQDFSQFKLTA-------FVVIPKDALKF----GPKWLKQFITEQLKEAIA 191 (377) T ss_pred -------------ccc----ccccCccceeEeeeeeeEEe-------echhhHHHhhc----chhhHHHHHHHHHHHHHH Confidence 000 00112346777777666654 23467777666 467889999999999999 Q ss_pred HHhhHHHHh---------hhhhheeeeeeccccccCccceeecccc-cc--cccccchHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 304 LEINREIVD---------LINYTAQVGKSGFTQTVGSKAGSFDFQD-PV--DIRGARWAGESYKALLIQIDKEANEIARQ 371 (524) Q Consensus 304 ~EINreii~---------~i~~~a~~~~~g~~~~~~~~~G~fdl~~-~~--~~~~~~~~~e~~r~L~~~i~~~a~~I~~~ 371 (524) .-+++.||. .++..+...+..- ....+ .++.+... .. ...+.....+.+..|+..+-.... .. T Consensus 192 ~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~---~~ 266 (377) T protein:vir:96 192 VALELAIVKGNGLLQPVGLLKDLSQPTVDQS-TGRDI-TTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDK---KH 266 (377) T ss_pred HHHhhceEeccCCCcceeeeecccccccccc-ccccc-cceeeccccccccccCChhHHHHHHHHHHHhhccccc---cc Confidence 999999985 2222111111100 00000 01111000 00 001111122222222222111111 11 Q ss_pred ccccCCC-EEEEchhhhhhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceEEEEEecCCC--c Q lcl|Aclame:pro 372 TGRGAGN-FIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNE--M 448 (524) Q Consensus 372 T~~g~gn-~~v~S~~va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~--~ 448 (524) ..+..|+ +.++.|....-+. .. .-|.+..+ .+.-.|.=.++|..++..|.+-++.|..+..- . T Consensus 267 ~~~~~~~a~~~mn~~t~~~~~---~~----~~~~~~~G-------~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~ 332 (377) T protein:vir:96 267 PLKIAGQVKLLLNPEDRWTLE---AK----FTSRNQFG-------EYVTVLPHGITILESLAVETGKAIAFVANRYDAFM 332 (377) T ss_pred cccccCceEEEEchhhHHhcc---cc----ccccCCCC-------CceeccCCCceEEecCCCCcccEEEEEcCcEEEEE Confidence 1111122 3556665432221 11 11111111 11222222456666666666555555432210 0 Q ss_pred cceeEeecccccccccccCCccccceeeeeeeec-cEecCcccccCCCccccccccchHHhhcc Q lcl|Aclame:pro 449 DAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYG-IGINPFANSRSQAPADRITSGMISKEMCG 511 (524) Q Consensus 449 ~~~~fyaPYv~~~~~~~~dp~s~qP~~~~~tRY~-l~~nP~~~~~~~~~~~~i~~~~~~~~~a~ 511 (524) ..++=...+.+..+.. -|=.+=.+.|++ ..++| ++..|.+= ..| T Consensus 333 r~~~~i~~~~~~~~~~------d~~~f~~~~r~dG~~~d~--------~a~~vl~l-----~~~ 377 (377) T protein:vir:96 333 ATASTIEEYDQTFAME------DLQLYLTKNYFYGKAKDN--------HTAALLTL-----AGG 377 (377) T ss_pred ecccEEEeehhhhhhc------CCeEEEEEEEEcCEEecC--------CcEEEEEE-----ecC Confidence 0111112221111111 111222333332 11222 11111110 001 No 112 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=23.27 E-value=2.3 Score=18.57 Aligned_cols=294 Identities=10% Similarity=0.044 Sum_probs=124.1 Q ss_pred CCchHHHHHHhhHhhcccccchhhcchhHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccCccccc Q lcl|Aclame:pro 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) Q Consensus 1 m~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~~~~~~ 80 (524) |-+. .|.+.-.+-+.+- .. .++.+. +. +. T Consensus 1 ~~~~-----------------------~~~~~~~~~f~~~------~~----------~~~~~~-a~----------~~- 29 (324) T protein:vir:10 1 MEQT-----------------------QKLKLNLQHFASN------NV----------KPQVFN-PD----------NV- 29 (324) T ss_pred CCCc-----------------------hHHHHHHHHHHHH------hh----------ccceec-cc----------ce- Confidence 2211 1211111111100 00 000110 10 00 Q ss_pred cccccccccccCch-hh-hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhcccccccccc Q lcl|Aclame:pro 81 SGKSSGAITNIGPA-VI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMY 158 (524) Q Consensus 81 ~st~sg~v~~~~P~-li-~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~~dt~f 158 (524) .++.+++. .. |. +. .+++.+..+.+..++|-+.||++.+.-| .-.. ++. + +.| T Consensus 30 ~~~~~~~~-li-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~-------p~~~--~~~-----~---------a~~ 84 (324) T protein:vir:10 30 MMHEKKDG-TL-LNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF-------TFWA--DKP-----G---------AYW 84 (324) T ss_pred eccCCCcc-ee-chhHHHHHHHHHHhhchhhhhcceeeccCCceEE-------EEEe--CCc-----c---------eeE Confidence 01111111 01 22 22 3455555677778888888888754221 1100 000 0 000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccCCcccccCcccccccccccccccccccccccccchhhh Q lcl|Aclame:pro 159 SGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAE 238 (524) Q Consensus 159 SG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~GmtTs~aE 238 (524) .+| T Consensus 85 -----------------------------------------------------------------------------v~E 87 (324) T protein:vir:10 85 -----------------------------------------------------------------------------VGE 87 (324) T ss_pred -----------------------------------------------------------------------------ecc Confidence 001 Q ss_pred hccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhe Q lcl|Aclame:pro 239 LQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTA 318 (524) Q Consensus 239 al~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~~a 318 (524) +.++++...+++++++..|..+..-..|-||.+|-. .|.+++|.+.|+..|...+++.+|.--.... T Consensus 88 ---------g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~ 154 (324) T protein:vir:10 88 ---------GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP 154 (324) T ss_pred ---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCc Confidence 112344445667777777777777889999999864 4679999999999999999999985211110 Q ss_pred eeeeeccccccCccceeecccccccc-cccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhhhhcccc Q lcl|Aclame:pro 319 QVGKSGFTQTVGSKAGSFDFQDPVDI-RGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGI 397 (524) Q Consensus 319 ~~~~~g~~~~~~~~~G~fdl~~~~~~-~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~~~g~ 397 (524) .+.|++........ ..+.-. +..|.++.+.|.. .+...+.+|++|.....|..+...- T Consensus 155 ------------~~~~i~~~~~~~~~~~~~~~t-------~~~i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~l~d~~ 213 (324) T protein:vir:10 155 ------------FGKSIAQSIEKTNKVIKGDFT-------QDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVDPE 213 (324) T ss_pred ------------cCccccccccccceeccccCC-------HHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhccC Confidence 11122221111000 001111 2223334444432 3346677899999998887542221 Q ss_pred cccchhhhcccccccccceeEEEecCcEEEEecCCCCc--ceEEEEEecCCCccceeEeecccccccccc---------c Q lcl|Aclame:pro 398 TPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ--DYFTVGFKGDNEMDAGIYYAPYVALTPLRG---------S 466 (524) Q Consensus 398 ~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~---------~ 466 (524) + .....+.. .++|.| ++|++.+..+. ..+++|-. +.+++... ....++. . T Consensus 214 ----g---~~~~~~~~----~~~l~G-~PV~~~~~~~~~~~~~~~gd~------~~~~~~~~-~~~~i~~~~~~~~~~~~ 274 (324) T protein:vir:10 214 ----T---KERIYDRN----SDTLDG-LPVVNLKSSNLKRGELITGDF------DKLIYGIP-QLIEYKIDETAQLSTVK 274 (324) T ss_pred ----C---ceeecCCC----Cccccc-eeEEeecCCCCCcceEEEEec------ccEEEEEe-cCcEEEEeecccccccc Confidence 1 01111111 245776 58888776532 23443321 01111111 0000000 1 Q ss_pred CCc--------cccceeeeeeeeccE-ecC--ccc-----ccCCCccccc Q lcl|Aclame:pro 467 DPK--------NFQPVMGFKTRYGIG-INP--FAN-----SRSQAPADRI 500 (524) Q Consensus 467 dp~--------s~qP~~~~~tRY~l~-~nP--~~~-----~~~~~~~~~i 500 (524) |+. +-+=.+=...||+.. .|| |.. ..+.+.+++| T Consensus 275 ~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:10 275 NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred cccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 111 112233334567653 334 111 0111112222 No 113 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=21.42 E-value=2.6 Score=18.30 Aligned_cols=292 Identities=14% Similarity=0.082 Sum_probs=107.2 Q ss_pred cccccccccccccccccccccCCcccccCccccccccc-----ccc-------ccccccccccc-------ccchhhhhc Q lcl|Aclame:pro 180 AIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVI-----AEN-------EKGTLAEISVG-------MATSVAELQ 240 (524) Q Consensus 180 ~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~-----~~~-------~~g~~~~~~~G-------mtTs~aEal 240 (524) .... +..+....+...++ ...+...+..... ... .....+++..| +...+++.. T Consensus 1 ma~~---~~~~~~~t~~~~~~----~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~~t~~~~ 73 (347) T protein:vir:15 1 MANI---QGGQQIGTNQGKGQ----SAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYL 73 (347) T ss_pred CCcc---ccCCccccccccCC----CcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccceeeeee Confidence 1000 00000000000000 0000000000000 000 00000000000 001111110 Q ss_pred --c-ccCCC-CCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Q lcl|Aclame:pro 241 --E-NFNGS-SANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINY 316 (524) Q Consensus 241 --~-~~ggs-s~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~i~~ 316 (524) + ...++ ......|+-+.||.+.. +..-|+-.-+.++ | .|-..|+..-....++..+++-|+..|.. T Consensus 74 ~~g~~l~~~~~~~~~~e~~ltID~~~~--------~~~~VddlD~~q~-~-~D~~~~~~~~~g~aLA~~~D~~i~~~l~~ 143 (347) T protein:vir:15 74 KPGENLDDKRKDIKHTEKVIHIDGLLT--------ADVLIYDIEDAMN-H-YDVRAEYTAQLGESLAMAADGAVLAELAG 143 (347) T ss_pred ccCCCCCCCCCCCccceEEEEechhhh--------hhHHhhhHHHHhc-C-CcchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 00111 01234556666665321 2233333333333 3 58888888888999999999999976532 Q ss_pred heee---eeeccccccCccceeecccccccccccc-hHHH-HHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhhhhhh Q lcl|Aclame:pro 317 TAQV---GKSGFTQTVGSKAGSFDFQDPVDIRGAR-WAGE-SYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALA 391 (524) Q Consensus 317 ~a~~---~~~g~~~~~~~~~G~fdl~~~~~~~~~~-~~~e-~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~ 391 (524) -+.. .+.+ ....+. .++...... ..+. ..++ .+..++..+-+.....-.+=---.|-|+|++|+...+|- T Consensus 144 ~~~~~~~~~~~-~~~~g~-~~~~~~~~~---~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL 218 (347) T protein:vir:15 144 LVNLPDASNEN-IEGLGK-PTVLTLVKP---TTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAIL 218 (347) T ss_pred Hhhcccccccc-ccccCc-ccccccccc---ccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHh Confidence 2111 1111 000000 111111111 0111 1112 223333333222222222222225789999999999987 Q ss_pred hhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcceE-------EEEEecCC------------Ccccee Q lcl|Aclame:pro 392 RIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYF-------TVGFKGDN------------EMDAGI 452 (524) Q Consensus 392 ~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~-------~vG~KG~~------------~~~~~~ 452 (524) .....+ .... ....+.....+|.|.| ++||.-+.-|.... +.|-+... .-..+| T Consensus 219 ~~~~~~-~~d~-----~~~~~~~~G~Vg~i~G-~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l 291 (347) T protein:vir:15 219 AALMPN-AANY-----QALIDHERGTIRNVMG-FEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGL 291 (347) T ss_pred cccccc-cccc-----cccccccceEEEEEec-eEEEecccccccccccccccccccccccccccccceeeeccccceee Confidence 643332 1111 1112234456788876 99999888765321 22222110 012456 Q ss_pred Eeecccc----ccc---ccccCCccccceeeeeeeeccEe-cCcccccCCCccccccc Q lcl|Aclame:pro 453 YYAPYVA----LTP---LRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITS 502 (524) Q Consensus 453 fyaPYv~----~~~---~~~~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~ 502 (524) ||.|... ++. .+..|+..|-=.|=-+..||..+ +|=.-..= .--||-. T Consensus 292 ~~h~~A~g~v~~~~~~~e~~~~~~~~~d~i~~~~~~G~~vlrP~~av~~--~~~~~~~ 347 (347) T protein:vir:15 292 FQHRSAVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRPEAAGAI--VLPKVSE 347 (347) T ss_pred eeccceeeeeEeeceeeeecccchhhhhhhehhhhcCCceeccccEEEE--ecCCCCC Confidence 6666532 221 12245555555444444555322 22100000 0001111 No 114 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=20.38 E-value=2.8 Score=18.15 Aligned_cols=333 Identities=14% Similarity=0.155 Sum_probs=114.4 Q ss_pred CCch--HHHHHHhhH--hhcccccchhhcchh-HHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccC Q lcl|Aclame:pro 1 MSKK--NELMEKWND--LLESQEGLPDIATKS-KKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYD 75 (524) Q Consensus 1 m~~~--~~l~~kw~p--~l~~~~~~~~i~~~~-~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~g~~~~~ 75 (524) |.-+ +..++++.+ .+...| .++-+..- .|.+. .|...+ ....+..++.. ..+.+... . T Consensus 1 ~a~~~a~~~~~~~~~~~~~~~~~-~~~~kg~~~~~~~~-a~a~~~------g~~~~a~~~a~---~~~~~~~~------~ 63 (366) T protein:vir:57 1 MAAAVAVPVKAHSVAPGIIIKEE-LQQYKGAGMTRMVM-SIAAGK------GNLADAAKFAA---TELGDTGL------S 63 (366) T ss_pred Ccccccccccccccccccccccc-cccccchhHHHHHH-HHHhcc------cchhHHHHHHH---Hhhcchhh------h Confidence 2111 111112210 000000 01100000 01111 111110 00000000000 00111100 0 Q ss_pred ccccccccccccccccCchhh--hHHHHHHhhhhhhheeeeecCCchhhhheeeeeeecCCCCCcccccchhhhhccccc Q lcl|Aclame:pro 76 QTNIASGKSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFA 153 (524) Q Consensus 76 ~~~~~~st~sg~v~~~~P~li--~l~Rra~~nLIa~DI~GVQPmTgPTGLIFAMRsrY~~~~~~~gteA~~nEAf~~~~~ 153 (524) ..+..++++|.+. =|.-+ .+++++-+..+...+ |++.+.+++|-+ +|+-.. ++. T Consensus 64 -~a~~~~~~~Gg~l--vP~~~~~~ii~~l~~~s~l~~l-g~~~v~~~~g~~-----~~p~~t--~~~------------- 119 (366) T protein:vir:57 64 -MAISTAAGSGGAL--IPQNMQNEVIELLRDRTVVRIL-GARSIPLPNGNL-----SMPRLS--GGA------------- 119 (366) T ss_pred -hhccccccCCccc--cchhHHHHHHHHHhhhcchhhh-ceeeeecCCCce-----EEEEEe--CCc------------- Confidence 0011111122210 02211 122221122222211 111111111100 000000 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccCCcccccCccccccccccccccccccccccccc Q lcl|Aclame:pro 154 PDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMA 233 (524) Q Consensus 154 ~dt~fSG~g~~~~~s~~~~gta~~~g~~~~~~~~~~~~~~~~~~~~g~~~~tgt~p~~~~~~~~~~~~~g~~~~~~~Gmt 233 (524) ..+ T Consensus 120 ----------------------------------------------------------------------~a~------- 122 (366) T protein:vir:57 120 ----------------------------------------------------------------------TAG------- 122 (366) T ss_pred ----------------------------------------------------------------------cee------- Confidence 000 Q ss_pred chhhhhccccCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhh Q lcl|Aclame:pro 234 TSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDL 313 (524) Q Consensus 234 Ts~aEal~~~ggss~~~f~EMsFsIEK~tVtAKSRALKAEYT~ELAQDLkAiHGLDAEaELsnILStEI~~EINreii~~ 313 (524) -.+| +..+++...+++++++..|.-+-...+|-||.+|-. .|.|+.|.+-|+..|...+++.||.= T Consensus 123 -wv~E---------~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l~G 188 (366) T protein:vir:57 123 -YVGE---------GKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAG----FNVEQLLLGDILSAIATREDKAFLRD 188 (366) T ss_pred -eecc---------CccccccccceeEEEEeeEEEEEeehhhHHHHhhhh----HHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 0012 112344445566777777777777789999998853 46799999999999999999998851 Q ss_pred hhhheeeeeeccccccCccceeecccccccc------cccchHHHHHHHHHHHHHHHHHHHHHhccccCCCEEEEchhhh Q lcl|Aclame:pro 314 INYTAQVGKSGFTQTVGSKAGSFDFQDPVDI------RGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVV 387 (524) Q Consensus 314 i~~~a~~~~~g~~~~~~~~~G~fdl~~~~~~------~~~~~~~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va 387 (524) . |- ...+.|++......+. ....|. ....+ ++.+.........+......|++|... T Consensus 189 --------~-G~---~~~p~Gi~~~~~~~~~~~~~~~t~~~~~--~~~~~---~~~~~~~~~~~~~~~~~a~~vmn~~~~ 251 (366) T protein:vir:57 189 --------D-GT---GDTPKGMKAVATAANRLVAWTGTAINLT--TIDEY---LDSLILKHMDSNSNMIRCGWGLSNRTY 251 (366) T ss_pred --------C-CC---Cccccceeeccccccceeeccccccchh--hHHHH---HHHHHHhhhccccccccCEEEecHHHH Confidence 0 00 0012233322211100 001111 11111 111112222222333456678999998 Q ss_pred hhhhhhcccccccchhhhcccccccccceeEEEecCcEEEEecCCCCcc----------------eEEEEEecCCCccce Q lcl|Aclame:pro 388 SALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQD----------------YFTVGFKGDNEMDAG 451 (524) Q Consensus 388 ~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----------------y~~vG~KG~~~~~~~ 451 (524) ..|..... ..+. ..-.+.+ -|+|.| |+|+++.+.|.+ ++++|-.+..+.+ T Consensus 252 ~~L~~lkd----~~G~---~l~~~~~----~g~l~G-~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~-- 317 (366) T protein:vir:57 252 MTLFGLRD----GNGN---KVYPEMS----QGILKG-YPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVD-- 317 (366) T ss_pred HHHHhhhc----cCCc---eeccCCC----CCeecc-eeeEEccccccccccCCCccEEEEEecceEEEEEecceEEE-- Confidence 88865321 1110 1111221 257877 799998876542 1222222222211 Q ss_pred eEeecccccccccc---cCCccccceeeeeeeeccEe-cCcccccCCCccccccccchH Q lcl|Aclame:pro 452 IYYAPYVALTPLRG---SDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMIS 506 (524) Q Consensus 452 ~fyaPYv~~~~~~~---~dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~~~~~ 506 (524) ..++.....-.. ..=.+-+=.+=...||++.+ +| ++--+..|-.| T Consensus 318 --~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~--------~a~~~lt~~~~ 366 (366) T protein:vir:57 318 --FSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHP--------EGLVLGTGVIW 366 (366) T ss_pred --EeeccccccccccchhhhhcCceeEEeeeeeCcEeecc--------ccEEEEecccC Confidence 011100000000 00001112333455666544 12 22234455566 Done!