Query lcl|Aclame:protein:vir:107947|NCBI_annot:gp23 major head protein|genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Match_columns 519 No_of_seqs 163 out of 436 Neff 5.1 Searched_HMMs 1612 Date Mon Dec 2 18:32:50 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_46 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_46_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:107947 Length: 519 100.0 3E-261 2E-264 1449.1 37.3 519 1-519 1-519 (519) 2 protein:vir:6901 Length: 522 # 100.0 2E-257 1E-260 1428.0 37.2 518 1-519 5-522 (522) 3 protein:vir:103463 Length: 521 100.0 3E-254 2E-257 1410.4 37.2 518 1-519 4-521 (521) 4 protein:vir:80986 Length: 528 100.0 3E-254 2E-257 1410.4 36.1 518 1-519 2-528 (528) 5 protein:vir:7214 Length: 521 # 100.0 1E-253 8E-257 1407.0 37.2 518 1-519 4-521 (521) 6 protein:vir:100603 Length: 529 100.0 1E-252 9E-256 1401.3 35.9 518 1-519 3-529 (529) 7 protein:vir:98143 Length: 524 100.0 1E-250 7E-254 1391.1 36.4 518 1-519 2-524 (524) 8 protein:vir:6601 Length: 528 # 100.0 3E-250 2E-253 1388.6 36.3 518 1-519 2-528 (528) 9 protein:vir:101039 Length: 529 100.0 5E-249 3E-252 1382.1 36.6 518 1-519 3-529 (529) 10 protein:vir:101811 Length: 529 100.0 7E-249 4E-252 1381.0 37.2 518 1-519 3-529 (529) 11 protein:vir:106286 Length: 534 100.0 3E-247 2E-250 1372.2 37.6 519 1-519 1-534 (534) 12 protein:vir:5670 Length: 514 # 100.0 3E-239 2E-242 1328.2 34.3 510 4-519 1-514 (514) 13 protein:vir:104915 Length: 470 100.0 1E-221 7E-225 1232.1 33.8 459 1-519 4-469 (470) 14 protein:vir:106998 Length: 468 100.0 9E-221 6E-224 1226.8 34.5 457 1-519 2-467 (468) 15 protein:vir:104549 Length: 462 100.0 4E-218 3E-221 1212.4 33.5 456 1-519 1-461 (462) 16 protein:vir:103181 Length: 457 100.0 9E-215 6E-218 1194.1 34.0 451 1-519 1-456 (457) 17 protein:vir:5942 Length: 523 # 100.0 1E-193 9E-197 1077.7 32.4 447 1-499 1-523 (523) 18 protein:vir:6601 Length: 528 # 98.1 1.3E-07 7.8E-11 58.4 10.6 420 1-474 26-528 (528) 19 protein:vir:4953 Length: 397 # 96.8 0.00034 2.1E-07 39.6 16.0 333 1-505 1-397 (397) 20 protein:vir:81227 Length: 413 96.2 0.00091 5.6E-07 37.3 18.7 358 1-505 1-413 (413) 21 protein:vir:41 Length: 299 # N 96.1 0.00097 6E-07 37.1 16.3 278 70-506 1-299 (299) 22 protein:vir:1886 Length: 385 # 95.9 0.0014 8.4E-07 36.3 19.4 345 1-505 1-385 (385) 23 protein:vir:191 Length: 385 # 95.9 0.0014 8.4E-07 36.3 19.4 345 1-505 1-385 (385) 24 protein:vir:8420 Length: 477 # 95.7 0.0015 9.5E-07 36.0 21.4 364 1-502 66-477 (477) 25 protein:vir:1268 Length: 397 # 95.6 0.0017 1.1E-06 35.8 15.4 328 1-517 40-397 (397) 26 protein:vir:4830 Length: 397 # 94.9 0.0032 2E-06 34.3 16.5 333 1-505 1-397 (397) 27 protein:vir:10364 Length: 390 94.9 0.0033 2E-06 34.2 20.2 347 1-502 1-390 (390) 28 protein:vir:1433 Length: 435 # 94.7 0.0036 2.2E-06 34.0 23.5 353 1-502 1-435 (435) 29 protein:vir:4997 Length: 397 # 94.5 0.0043 2.7E-06 33.6 16.8 330 1-507 1-397 (397) 30 protein:vir:78523 Length: 338 93.8 0.0063 3.9E-06 32.6 19.1 311 50-502 1-338 (338) 31 protein:vir:104256 Length: 458 93.6 0.0071 4.4E-06 32.4 19.5 352 1-504 36-458 (458) 32 protein:vir:81160 Length: 371 93.0 0.0091 5.6E-06 31.8 17.7 334 1-505 1-371 (371) 33 protein:vir:100135 Length: 418 92.2 0.012 7.6E-06 31.1 21.1 351 1-506 27-418 (418) 34 protein:vir:79987 Length: 415 92.1 0.013 8.1E-06 30.9 18.1 352 1-509 7-415 (415) 35 protein:vir:98339 Length: 415 92.1 0.013 8.1E-06 30.9 18.1 352 1-509 7-415 (415) 36 protein:vir:81100 Length: 415 92.1 0.013 8.1E-06 30.9 18.1 352 1-509 7-415 (415) 37 protein:vir:3033 Length: 272 # 91.9 0.014 8.6E-06 30.8 17.7 269 164-507 1-272 (272) 38 protein:vir:9820 Length: 272 # 91.9 0.014 8.6E-06 30.8 17.7 269 164-507 1-272 (272) 39 protein:vir:78223 Length: 333 91.6 0.015 9.3E-06 30.6 16.3 310 50-499 1-333 (333) 40 protein:vir:7409 Length: 408 # 91.4 0.016 9.9E-06 30.4 20.4 339 1-509 5-408 (408) 41 protein:vir:96392 Length: 324 90.4 0.021 1.3E-05 29.8 17.9 304 35-508 1-324 (324) 42 protein:vir:78830 Length: 324 90.4 0.021 1.3E-05 29.8 17.9 304 35-508 1-324 (324) 43 protein:vir:4856 Length: 293 # 90.2 0.022 1.4E-05 29.6 16.5 268 77-514 1-293 (293) 44 protein:vir:96262 Length: 274 89.5 0.026 1.6E-05 29.3 13.7 271 164-507 1-274 (274) 45 protein:vir:95898 Length: 274 89.5 0.026 1.6E-05 29.3 13.7 271 164-507 1-274 (274) 46 protein:vir:4700 Length: 415 # 88.1 0.034 2.1E-05 28.6 17.9 352 1-509 29-415 (415) 47 protein:vir:4600 Length: 415 # 88.1 0.034 2.1E-05 28.6 17.9 352 1-509 29-415 (415) 48 protein:vir:96762 Length: 632 88.0 0.035 2.2E-05 28.6 16.7 331 1-492 242-632 (632) 49 protein:vir:96223 Length: 324 87.2 0.04 2.5E-05 28.2 17.7 304 35-508 1-324 (324) 50 protein:vir:100247 Length: 425 87.2 0.04 2.5E-05 28.2 17.7 333 1-506 50-425 (425) 51 protein:vir:97148 Length: 324 86.9 0.043 2.6E-05 28.1 17.9 307 35-508 1-324 (324) 52 protein:vir:80376 Length: 435 86.1 0.048 3E-05 27.8 21.3 355 1-502 41-435 (435) 53 protein:vir:102119 Length: 404 85.9 0.049 3E-05 27.8 15.1 355 1-507 1-404 (404) 54 protein:vir:93742 Length: 274 85.9 0.05 3.1E-05 27.7 18.0 273 164-507 1-274 (274) 55 protein:vir:96123 Length: 274 85.8 0.05 3.1E-05 27.7 15.5 271 164-515 1-274 (274) 56 protein:vir:4339 Length: 395 # 85.1 0.055 3.4E-05 27.5 19.4 349 1-504 1-395 (395) 57 protein:vir:9309 Length: 324 # 84.4 0.06 3.7E-05 27.3 15.9 307 35-508 1-324 (324) 58 protein:vir:2430 Length: 318 # 84.2 0.062 3.9E-05 27.2 15.7 288 42-506 1-318 (318) 59 protein:vir:3845 Length: 395 # 83.8 0.066 4.1E-05 27.1 20.0 341 1-507 1-395 (395) 60 protein:vir:3870 Length: 400 # 83.7 0.066 4.1E-05 27.0 16.5 331 1-502 10-400 (400) 61 protein:vir:9410 Length: 415 # 83.6 0.067 4.2E-05 27.0 18.9 357 1-509 13-415 (415) 62 protein:vir:95763 Length: 297 83.6 0.067 4.2E-05 27.0 14.9 278 67-500 1-297 (297) 63 protein:vir:104085 Length: 320 82.4 0.077 4.8E-05 26.7 15.2 292 47-502 1-320 (320) 64 protein:vir:6212 Length: 434 # 82.4 0.077 4.8E-05 26.7 21.4 336 1-506 30-434 (434) 65 protein:vir:101607 Length: 379 81.3 0.087 5.4E-05 26.4 21.6 333 1-501 1-379 (379) 66 protein:vir:81070 Length: 390 80.7 0.092 5.7E-05 26.3 22.1 333 1-497 32-390 (390) 67 protein:vir:7771 Length: 330 # 80.6 0.093 5.8E-05 26.2 13.4 299 151-509 1-330 (330) 68 protein:vir:9704 Length: 394 # 80.0 0.099 6.1E-05 26.1 17.9 331 1-510 31-394 (394) 69 protein:vir:1025 Length: 408 # 78.8 0.11 6.9E-05 25.8 17.2 334 1-501 5-408 (408) 70 protein:vir:105038 Length: 428 78.0 0.12 7.4E-05 25.6 20.2 348 1-500 31-428 (428) 71 protein:vir:1638 Length: 298 # 77.7 0.12 7.6E-05 25.6 15.0 277 77-498 1-298 (298) 72 protein:vir:8187 Length: 311 # 77.4 0.13 7.8E-05 25.5 18.4 289 78-500 1-311 (311) 73 protein:vir:101650 Length: 497 75.0 0.15 9.4E-05 25.1 21.2 352 1-508 54-497 (497) 74 protein:vir:7855 Length: 497 # 75.0 0.15 9.4E-05 25.1 21.2 352 1-508 54-497 (497) 75 protein:vir:9574 Length: 300 # 74.7 0.16 9.7E-05 25.0 17.1 283 77-503 1-300 (300) 76 protein:vir:2504 Length: 305 # 74.4 0.16 9.8E-05 25.0 16.4 285 77-505 1-305 (305) 77 protein:vir:99749 Length: 324 72.9 0.18 0.00011 24.7 18.6 305 35-508 1-324 (324) 78 protein:vir:2344 Length: 397 # 71.9 0.19 0.00012 24.5 19.6 309 70-519 1-354 (397) 79 protein:vir:97053 Length: 390 71.3 0.2 0.00012 24.4 24.5 338 1-502 32-390 (390) 80 protein:vir:3991 Length: 404 # 71.1 0.2 0.00012 24.4 20.1 337 1-507 5-404 (404) 81 protein:vir:9759 Length: 303 # 69.8 0.22 0.00014 24.2 16.4 284 77-499 1-303 (303) 82 protein:vir:99920 Length: 311 69.6 0.22 0.00014 24.2 17.5 286 77-502 1-311 (311) 83 protein:vir:94142 Length: 304 66.6 0.27 0.00016 23.7 16.5 279 151-498 1-304 (304) 84 protein:vir:105905 Length: 304 66.6 0.27 0.00016 23.7 16.5 279 151-498 1-304 (304) 85 protein:vir:103955 Length: 324 64.2 0.3 0.00019 23.4 16.0 305 35-508 1-324 (324) 86 protein:vir:97433 Length: 274 63.5 0.32 0.0002 23.3 17.3 274 164-507 1-274 (274) 87 protein:vir:94494 Length: 274 63.5 0.32 0.0002 23.3 17.3 274 164-507 1-274 (274) 88 protein:vir:4226 Length: 326 # 61.4 0.35 0.00022 23.1 18.4 300 34-508 1-326 (326) 89 protein:vir:94673 Length: 419 61.2 0.36 0.00022 23.0 21.8 351 1-500 21-419 (419) 90 protein:vir:5739 Length: 366 # 57.2 0.44 0.00027 22.5 19.6 334 14-500 1-366 (366) 91 protein:vir:94771 Length: 298 52.9 0.54 0.00034 22.0 11.5 278 162-498 1-298 (298) 92 protein:vir:1781 Length: 221 # 43.2 0.85 0.00053 21.0 16.1 204 223-493 1-221 (221) 93 protein:vir:105334 Length: 276 40.1 0.99 0.00061 20.6 15.6 268 164-502 1-276 (276) 94 protein:vir:100172 Length: 394 37.3 1.1 0.0007 20.3 18.2 344 1-513 1-394 (394) 95 protein:vir:80180 Length: 381 37.1 1.1 0.0007 20.3 17.7 321 63-519 1-361 (381) 96 protein:vir:4456 Length: 401 # 37.0 1.1 0.00071 20.3 18.0 344 1-519 3-401 (401) 97 protein:vir:100884 Length: 389 36.8 1.2 0.00072 20.2 21.0 337 1-507 18-389 (389) 98 protein:vir:4511 Length: 409 # 33.4 1.4 0.00084 19.9 20.2 348 1-507 1-409 (409) 99 protein:vir:107593 Length: 392 33.3 1.4 0.00085 19.8 17.2 322 1-506 34-392 (392) 100 protein:vir:102082 Length: 392 33.3 1.4 0.00085 19.8 17.2 322 1-506 34-392 (392) 101 protein:vir:105004 Length: 392 33.3 1.4 0.00085 19.8 17.2 322 1-506 34-392 (392) 102 protein:vir:102873 Length: 392 33.3 1.4 0.00085 19.8 17.2 322 1-506 34-392 (392) 103 protein:vir:739 Length: 231 # 33.0 1.4 0.00086 19.8 15.7 219 193-519 1-231 (231) 104 protein:vir:80684 Length: 315 31.2 1.5 0.00094 19.6 15.0 288 145-508 1-315 (315) 105 protein:vir:94622 Length: 341 29.9 1.6 0.001 19.4 16.7 297 70-501 1-341 (341) 106 protein:vir:1239 Length: 274 # 29.6 1.6 0.001 19.4 16.3 273 164-507 1-274 (274) 107 protein:vir:8102 Length: 543 # 28.0 1.8 0.0011 19.2 17.1 341 1-505 159-543 (543) 108 protein:vir:80930 Length: 278 27.8 1.8 0.0011 19.2 16.3 271 164-504 1-278 (278) 109 protein:vir:1383 Length: 421 # 27.0 1.9 0.0012 19.1 20.8 345 1-519 1-413 (421) 110 protein:vir:94711 Length: 347 26.7 1.9 0.0012 19.0 13.8 310 115-506 1-347 (347) 111 protein:vir:4092 Length: 390 # 24.6 2.1 0.0013 18.8 17.0 350 1-509 1-390 (390) 112 protein:vir:3364 Length: 347 # 24.0 2.2 0.0014 18.7 14.9 312 115-519 1-345 (347) 113 protein:vir:1084 Length: 437 # 20.7 2.7 0.0017 18.2 17.7 343 1-509 48-437 (437) No 1 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=100.00 E-value=2.7e-261 Score=1449.09 Aligned_cols=519 Identities=100% Similarity=1.427 Sum_probs=514.1 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~es 80 (519) |++|+|+|||+|||||||+|||++.|||+|+++|||||||||.++++||+|++.++++.||+||+++++|||++++|+++ T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~i~~~~~en~~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~t~i~~~ 80 (519) T protein:vir:10 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) T ss_pred CchhHHHHHhHHhhcccccchhhhhhhHHHHHHHHHHHHHHhhhcccccchHHHHHHhhhcchhccCCccccCccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccc Q lcl|Aclame:pro 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) Q Consensus 81 t~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~ 160 (519) ++|++|++|+|+||+|+||++|||||+||||||||||||||||||||||+++++++++.|+||+|||+|++|||+++... T Consensus 81 ~~t~~v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fSG~~~~~~ 160 (519) T protein:vir:10 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) T ss_pred cccccccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccccccccccCccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCC Q lcl|Aclame:pro 161 FEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGS 240 (519) Q Consensus 161 ~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs 240 (519) ..+++.+.....++...+.+..+++++...+...+++++++++..++.......+.+.+|++++||+|+.+|+++++|++ T Consensus 161 ~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal~~lggs 240 (519) T protein:vir:10 161 FEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGS 240 (519) T ss_pred cccccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccccccchhhccccCCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 241 TDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) Q Consensus 241 ~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t 320 (519) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||+|||+|||+|++|+| T Consensus 241 s~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t 320 (519) T protein:vir:10 241 TDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) T ss_pred cccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhcceeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccc Q lcl|Aclame:pro 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) Q Consensus 321 ~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~ 400 (519) +++++++|||||++++|++++||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|+++++++++.++ T Consensus 321 ~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~~~~~ 400 (519) T protein:vir:10 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) T ss_pred cCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceeec Q lcl|Aclame:pro 401 GFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGIN 480 (519) Q Consensus 401 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~n 480 (519) ++++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||||||++| T Consensus 401 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~N 480 (519) T protein:vir:10 401 GFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGIN 480 (519) T ss_pred cccccCCCceEEEEecCceEEEecCCCCcceEEEEEecCcccccceeeccccccccccccCCccccceeeeeeeeceeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 481 PFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 481 P~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||+++.+|+++.||+||||.+|++++||.|||||+|||| T Consensus 481 P~~~~~~~~~~~~i~~g~~~~a~~~~~n~y~r~v~v~~~ 519 (519) T protein:vir:10 481 PFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) T ss_pred CcccccccCccceeccCchhhhccccCceeeeeeeeecC Confidence 999999999999999999999999999999999999999 No 2 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=100.00 E-value=1.9e-257 Score=1427.97 Aligned_cols=518 Identities=84% Similarity=1.272 Sum_probs=506.3 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~es 80 (519) +++|+|+|||+|||||||+|+|++ +||+|+|+|||||||+++++++||++++.++|+.||+||+++|||||++++|+|| T Consensus 5 ~~~e~l~~kw~p~l~~~~~~~~~~-~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~es 83 (522) T protein:vir:69 5 KTKAQLVDKWKELLEGEGLPEIAN-SKQAIIAKIFENQEKDFEVSPEYKDEKIAQAFGSFLTEAEIGGDHGYNAQNIAAG 83 (522) T ss_pred chHHHHHHhhHHHhcCCCCCcccc-chhhhhhhhhhhhhHHhhcccccchhHHHHhhhhhhhhhccccccCCCccccccc Confidence 899999999999999999999987 5999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccc Q lcl|Aclame:pro 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) Q Consensus 81 t~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~ 160 (519) ++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++.++++|+|++|||+|++|||++.... T Consensus 84 ~~t~~v~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~~t~ 163 (522) T protein:vir:69 84 QTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGAAKK 163 (522) T ss_pred ccccccccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCcccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCC Q lcl|Aclame:pro 161 FEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGS 240 (519) Q Consensus 161 ~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs 240 (519) ....+..+....++...+.+...++++.......+.....+++..++.++......+.+|++++||+|+.+|+|+++|++ T Consensus 164 ~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~lggs 243 (522) T protein:vir:69 164 FPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEGFNGS 243 (522) T ss_pred ccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccccceeeccccchhhhhhcccCCCC Confidence 99999999999999999999999999998888877777777777788888889999999999999999999999999999 Q ss_pred CccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 241 TDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) Q Consensus 241 ~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t 320 (519) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||+|||+|||+|++|++ T Consensus 244 s~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t 323 (522) T protein:vir:69 244 TDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMT 323 (522) T ss_pred cccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeeccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccc Q lcl|Aclame:pro 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) Q Consensus 321 ~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~ 400 (519) .+....+|+|||+++.|+.++||++||||+|++|||+|+|+|+|+|+||+||||||||+|+++|+|+|+++++++++.+. T Consensus 324 ~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~ 403 (522) T protein:vir:69 324 NIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLAS 403 (522) T ss_pred cccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccccccccccccc Confidence 88888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceeec Q lcl|Aclame:pro 401 GFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGIN 480 (519) Q Consensus 401 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~n 480 (519) ++++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||||||++| T Consensus 404 g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~vN 483 (522) T protein:vir:69 404 GFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGVN 483 (522) T ss_pred cccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 481 PFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 481 P~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||+...+|+++.||+||||.+++..++|.|||||+|||| T Consensus 484 P~~~~~~~~~~~ri~~g~p~~~~~~~~n~y~r~v~v~~~ 522 (522) T protein:vir:69 484 PFAESSLQAPGARIQSGMPSILNSLGKNAYFRRVYVKGI 522 (522) T ss_pred CcccccCCcccceeecccchhhcccCCcceeeEEEeecC Confidence 999999999999999999999999999999999999999 No 3 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=100.00 E-value=3.2e-254 Score=1410.36 Aligned_cols=518 Identities=84% Similarity=1.269 Sum_probs=504.3 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~es 80 (519) +++|+|+|||+|||||||||+|++ +||+|+|+|||||||+++|+++||+++++++|+.+|+||+++++|||++++|+|| T Consensus 4 ~~~~~l~~kw~p~l~~~~~~~i~~-~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~i~es 82 (521) T protein:vir:10 4 KTKAELLNKWKPLLEGEGLPEIAN-SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAAG 82 (521) T ss_pred chhHHHHHhhhhhhccCCCCcccc-chhhhhhhhhhhhhhhhhhccccchhHHHHHHhhhhhhhcccCcccccccccccc Confidence 999999999999999999999987 5999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccc Q lcl|Aclame:pro 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) Q Consensus 81 t~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~ 160 (519) ++|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++++++++|+|++++++|+.|||+++... T Consensus 83 ~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at~ 162 (521) T protein:vir:10 83 QTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAKK 162 (521) T ss_pred ccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCC Q lcl|Aclame:pro 161 FEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGS 240 (519) Q Consensus 161 ~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs 240 (519) ++.....+....++...+.+...++++.......+.....+++...+.........+.+|++++||+|+++|+|+++|++ T Consensus 163 ~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~s 242 (521) T protein:vir:10 163 FAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGS 242 (521) T ss_pred cccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhhHhhhccCCCC Confidence 99999999999999999999999999998888888888888888888889999999999999999999999999999999 Q ss_pred CccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 241 TDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) Q Consensus 241 ~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t 320 (519) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||+||++|||+|++++| T Consensus 243 s~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t 322 (521) T protein:vir:10 243 TDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMT 322 (521) T ss_pred ccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccc Q lcl|Aclame:pro 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) Q Consensus 321 ~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~ 400 (519) .++++.+|+|||++++|+.++||++||||+|++|||+|+|+|+|+|+||+||||||||+||++|+|+|+++++++++.+. T Consensus 323 ~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~ 402 (521) T protein:vir:10 323 LTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLAT 402 (521) T ss_pred eccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceeec Q lcl|Aclame:pro 401 GFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGIN 480 (519) Q Consensus 401 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~n 480 (519) ++++|+|+++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||||||++| T Consensus 403 g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~N 482 (521) T protein:vir:10 403 GFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGIN 482 (521) T ss_pred cccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 481 PFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 481 P~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||+...+|.+.++|.++++...+..++|.|||||+|||| T Consensus 483 P~~~~~~~~~~~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:10 483 PFAESAAQAPASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred CcccccCCccceeecccchhhhccccccceeeeeeecCC Confidence 999999999988888886555557788999999999999 No 4 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=100.00 E-value=3.1e-254 Score=1410.40 Aligned_cols=518 Identities=68% Similarity=1.085 Sum_probs=489.6 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~es 80 (519) +++|+|+|||+|||||||+|||++.|||+|+|+|||||||+++++++||++++.|+|+.+|.||+++|||||++++|+|| T Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~es 81 (528) T protein:vir:80 2 KTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAAG 81 (528) T ss_pred cchHHHHHhhhHhhcCCccchhcchhhhhhhhhhhhhhhHHhhccccccchHHHHhhhhhccccccccccCCcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccc Q lcl|Aclame:pro 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) Q Consensus 81 t~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~ 160 (519) ++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||+++++.++++||||+|+++|+.||+..+... T Consensus 82 ~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~~~ 161 (528) T protein:vir:80 82 QTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAKGA 161 (528) T ss_pred ccccccccCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCcccccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998765433 Q ss_pred ---------cccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhh Q lcl|Aclame:pro 161 ---------FEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIA 231 (519) Q Consensus 161 ---------~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~A 231 (519) ++..........|+...+.+..++......+......+....+...+.........+.+++++.||+|+.+ T Consensus 162 a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~A 241 (528) T protein:vir:80 162 AVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSIA 241 (528) T ss_pred ccccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccccccccccchhhh Confidence 33334445556677777777777777777666655555555555555556666778889999999999999 Q ss_pred hhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhh Q lcl|Aclame:pro 232 ELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYS 311 (519) Q Consensus 232 Eal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~ 311 (519) |.++++|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|+++ T Consensus 242 E~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~~ 321 (528) T protein:vir:80 242 EIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFT 321 (528) T ss_pred hhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccc Q lcl|Aclame:pro 312 AQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSV 391 (519) Q Consensus 312 a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~ 391 (519) |++|+++|+.+.+.++|+|||++++|++|+||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|+++ T Consensus 322 a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~~ 401 (528) T protein:vir:80 322 AQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGI 401 (528) T ss_pred eeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccc Confidence 99999999988888999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeee Q lcl|Aclame:pro 392 SYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGF 471 (519) Q Consensus 392 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~ 471 (519) +.++++.++++++|+|+++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+||| T Consensus 402 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~ 481 (528) T protein:vir:80 402 SLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGF 481 (528) T ss_pred cccccccccccccCCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCccccceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeceeecCcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 472 KTRYGIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 472 ~tRY~l~~nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||||||++|||+++.+|.++.||++|+|| .+.+++|.|||||+|||| T Consensus 482 ~tRY~l~~NP~~~~~~~~~~~r~~~g~~~-~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:80 482 KTRYGIGINPFADSKSQAPSARITSGMLS-KDSVGKNAYFRRVWVKGC 528 (528) T ss_pred eeeeceeecCcccccCCcccccccccchh-hhhcCccceeEEeeeccC Confidence 99999999999999999999999999999 578999999999999999 No 5 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=100.00 E-value=1.3e-253 Score=1407.04 Aligned_cols=518 Identities=85% Similarity=1.279 Sum_probs=505.8 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~es 80 (519) +++|+|+|||+|||||||+|+|++ +||+|+|+|||||||+++|+++||++++.++++.+|+|++++++|||++++|+|| T Consensus 4 ~~~~~l~~kw~p~l~~~~~~~i~~-~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~iaes 82 (521) T protein:vir:72 4 KTKAELLNKWKPLLEGEGLPEIAN-SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAAG 82 (521) T ss_pred chhHHHHHhhhhhhccCCCCcccc-chhhhhhhhhhhhhhhhhhcccccchHHHHHHhhhhhhhcccCccccCccccccc Confidence 999999999999999999999997 5999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccc Q lcl|Aclame:pro 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) Q Consensus 81 t~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~ 160 (519) ++|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++++++++|||+.++++|++|||+++... T Consensus 83 ~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~~~ 162 (521) T protein:vir:72 83 QTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAAKK 162 (521) T ss_pred ccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCC Q lcl|Aclame:pro 161 FEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGS 240 (519) Q Consensus 161 ~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs 240 (519) +.....++....|+...+.+...|+++.......+.+++.+++...+..+......+.+|+++.||+|+.+|+++++|++ T Consensus 163 ~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g~s 242 (521) T protein:vir:72 163 FPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQEGFNGS 242 (521) T ss_pred ccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccchhhhhhhcccCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 241 TDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) Q Consensus 241 ~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t 320 (519) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||+||++|||+|++++| T Consensus 243 s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~t 322 (521) T protein:vir:72 243 TDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMT 322 (521) T ss_pred ccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccc Q lcl|Aclame:pro 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) Q Consensus 321 ~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~ 400 (519) .++++.+|+|||++++|+.++||++||||+|++|||+|+|+|+|+|+||+||||||||+||++|+|+|+++++++++.+. T Consensus 323 ~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~ 402 (521) T protein:vir:72 323 LTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLAT 402 (521) T ss_pred eccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceeec Q lcl|Aclame:pro 401 GFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGIN 480 (519) Q Consensus 401 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~n 480 (519) ++++|+|+++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||||||++| T Consensus 403 g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~N 482 (521) T protein:vir:72 403 GFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGIN 482 (521) T ss_pred cccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 481 PFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 481 P~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||+...+|.++++|.++++...+..++|.|||||+|||| T Consensus 483 P~~~~~~~~~a~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:72 483 PFAESAAQAPASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred CcccccCcccceeecCcChhhhcCccccceeeeeeecCC Confidence 999999999988888776555557788999999999999 No 6 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=100.00 E-value=1.4e-252 Score=1401.35 Aligned_cols=518 Identities=72% Similarity=1.149 Sum_probs=486.4 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~es 80 (519) |++|+|+|||+|||||||+|||++.|||+|+|+|||||||+++|++.|||++++|+++.+|+|++++|+|||++.+|+|| T Consensus 3 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~ia~s 82 (529) T protein:vir:10 3 LKTKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKTDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAAG 82 (529) T ss_pred cchHHHHHHhhHhhcCCccchhcchhhhhhhhhhhhhHHHHhhcccccchhhhhhhhhhccchhhccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccc Q lcl|Aclame:pro 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) Q Consensus 81 t~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~ 160 (519) ++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++.+++.||||+|+|+|++|||.+.... T Consensus 83 ~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~~~~~~ 162 (529) T protein:vir:10 83 QSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGLAAKGA 162 (529) T ss_pred ccccccccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCcccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999876543 Q ss_pred --------cccccccccccccccccccccccccccccccccccCCCCCCCc-cccccccccccccccceecccccchhhh Q lcl|Aclame:pro 161 --------FEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDA-AKLDAAVTALVEAGQLAEIAEGMATSIA 231 (519) Q Consensus 161 --------~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~-~~~~~~~~~~~~~g~~~~~~~GmsTa~A 231 (519) ....+.+.....++.....+...+..+.....+.....+.+.+ ...+.......+.+.++++++||+|+.+ T Consensus 163 ~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsTa~a 242 (529) T protein:vir:10 163 TTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMATSIA 242 (529) T ss_pred cccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccccccccccccccchhhh Confidence 2333334444455555666666666666666655555544433 2344556667788889999999999999 Q ss_pred hhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhh Q lcl|Aclame:pro 232 ELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYS 311 (519) Q Consensus 232 Eal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~ 311 (519) |+|++||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||||||++ T Consensus 243 Eal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i~~~ 322 (529) T protein:vir:10 243 ELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYT 322 (529) T ss_pred hccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccc Q lcl|Aclame:pro 312 AQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSV 391 (519) Q Consensus 312 a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~ 391 (519) ||+|++||+.+++..+|+|||+++.|++|+||++||+|+|++|||+|||+|+|+|+||+||||||||+||++|+|.|+++ T Consensus 323 a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~ 402 (529) T protein:vir:10 323 AQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALVDAGI 402 (529) T ss_pred ceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeee Q lcl|Aclame:pro 392 SYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGF 471 (519) Q Consensus 392 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~ 471 (519) ++++++.+.++++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+||| T Consensus 403 ~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~ 482 (529) T protein:vir:10 403 TPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVMGF 482 (529) T ss_pred ccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeceeecCcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 472 KTRYGIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 472 ~tRY~l~~nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||||||++|||+.+.+|.++.||+||+|| ++.+++|.|||||+|||| T Consensus 483 ~tRY~l~~NP~~~~~~~~~~~r~~~g~~~-~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 483 KTRYAIGVNPFAESRTQAPTSRISNGMPG-AHSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeceeecCccccccccccccccCCcch-hhhcCccceeeEeeeccC Confidence 99999999999999999889999999997 789999999999999999 No 7 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=100.00 E-value=1.1e-250 Score=1391.05 Aligned_cols=518 Identities=71% Similarity=1.121 Sum_probs=499.8 Q ss_pred CChHHHHHhhhhhhCC-CccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhcc Q lcl|Aclame:pro 1 MKKNALVQKWSALLEN-EALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAA 79 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~-~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~e 79 (519) +++|+|+|||+||||+ ||||||++.+||+|+|+|||||||+++++++|||+++.|+|+.+|.||+++|+|||++.+|+| T Consensus 2 ~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~~ 81 (524) T protein:vir:98 2 SKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIAS 81 (524) T ss_pred cchHHHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHHHhcCccccchHHHHhhhccccccccccccccccccccc Confidence 8899999999999996 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCc----ccccccccccccccCcc Q lcl|Aclame:pro 80 GQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGA----KEAFHPMYAPNAMFSGQ 155 (519) Q Consensus 80 st~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~----~eA~~~fnEadt~fSG~ 155 (519) |++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++++++. +|||++++++|+.|||. T Consensus 82 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG~ 161 (524) T protein:vir:98 82 GKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGE 161 (524) T ss_pred cccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCcccccccccccccccccccccCCc Confidence 99999999999999999999999999999999999999999999999999999766544 78999999999999999 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcc Q lcl|Aclame:pro 156 GAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQE 235 (519) Q Consensus 156 ~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~ 235 (519) ++..+.+..+.++....++.....+...|..+............++++..++.........+..++++.||+|+.+|+|+ T Consensus 162 g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL~ 241 (524) T protein:vir:98 162 GAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQE 241 (524) T ss_pred cccccccccccccccccccccccccccccceeccccccCcccccccccccccccccccccccceeecccccchhhhhhhc Confidence 99999999999999999999999999999999998888888888888888888888888999999999999999999999 Q ss_pred cCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhh Q lcl|Aclame:pro 236 GFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVG 315 (519) Q Consensus 236 ~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~ 315 (519) ++|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||+||+++||+| T Consensus 242 ~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~~ 321 (524) T protein:vir:98 242 NFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVG 321 (524) T ss_pred cCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccccc Q lcl|Aclame:pro 316 KSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAA 395 (519) Q Consensus 316 ~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~ 395 (519) ++|||+++++.+|+|||+++.|.+++||++||+|+|++|||+|||+|+|+|+||+||||||||+||++|+|+|....+++ T Consensus 322 ~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~s 401 (524) T protein:vir:98 322 KSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAS 401 (524) T ss_pred eeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999766666666 Q ss_pred ccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeee Q lcl|Aclame:pro 396 QGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRY 475 (519) Q Consensus 396 ~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY 475 (519) ++.+..++.|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+||||||| T Consensus 402 ~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY 481 (524) T protein:vir:98 402 QGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRY 481 (524) T ss_pred chhhcccccCCccceEEEEecCceEEEecCCCCcceEEEEeeCCcccccceeeccccccccccccCCccccceeeeeeee Confidence 67788899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 476 GIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 476 ~l~~nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||++|||+++.+|.+..||++|+|| .+.+++|.|||||+|||| T Consensus 482 ~l~~NP~~~~~~~~~~~ri~~g~~~-~~~ag~n~~~r~~~Vk~l 524 (524) T protein:vir:98 482 GIGINPFANSRSQAPADRITSGMIS-KEMCGKNAYFRKVWVKGL 524 (524) T ss_pred ceeecCcccccCCccccccccCcch-HhhcCccceeeEeeeccC Confidence 9999999999999888899999999 468899999999999999 No 8 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=100.00 E-value=2.9e-250 Score=1388.63 Aligned_cols=518 Identities=67% Similarity=1.077 Sum_probs=474.9 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~es 80 (519) +++|+|+|||+|||||||+|||++.|||+|+|+|||||||+++++++||+++++|+|+.+|+||+++++|||++++|+|| T Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~es 81 (528) T protein:vir:66 2 KTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAAG 81 (528) T ss_pred cchHHHHHHhHHhhcCCCcchhcchhhhhhhhhhhhhhHHHhhcccchhhHHHHHhhhhhhhhhcccccccccchhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccc Q lcl|Aclame:pro 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) Q Consensus 81 t~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~ 160 (519) ++|++|++|||+||+|||||+|||||+|||||||||||||||||||++|+++++++++.+|||+.+.+++.||+...... T Consensus 82 ~~t~~v~~~~P~Li~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~~a 161 (528) T protein:vir:66 82 QTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAKEA 161 (528) T ss_pred ccccccccCchhHHHHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCcccccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998765544 Q ss_pred ccccccc---------ccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhh Q lcl|Aclame:pro 161 FEALAAS---------KVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIA 231 (519) Q Consensus 161 ~~~~~~~---------t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~A 231 (519) ...++.+ .....++.+.+.+..++.+...........+........+.........+..++++.||+|+.+ T Consensus 162 ~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~a 241 (528) T protein:vir:66 162 TVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATSIA 241 (528) T ss_pred cccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccccccccccccccccceecccccchhhh Confidence 3222221 1222233333333334433333333333333333333334445556667788999999999999 Q ss_pred hhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhh Q lcl|Aclame:pro 232 ELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYS 311 (519) Q Consensus 232 Eal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~ 311 (519) |+++++|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++ T Consensus 242 Eale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~~~ 321 (528) T protein:vir:66 242 EIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFT 321 (528) T ss_pred hhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccc Q lcl|Aclame:pro 312 AQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSV 391 (519) Q Consensus 312 a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~ 391 (519) |++|+++|+.+.+.++|+|||++++|+.|+||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|+++ T Consensus 322 a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~~ 401 (528) T protein:vir:66 322 AQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGI 401 (528) T ss_pred eeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccc Confidence 99999999988888999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeee Q lcl|Aclame:pro 392 SYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGF 471 (519) Q Consensus 392 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~ 471 (519) +.++++.++++++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+||| T Consensus 402 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~ 481 (528) T protein:vir:66 402 SLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGF 481 (528) T ss_pred cccccccccccccCCCCceeEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCccccceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeceeecCcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 472 KTRYGIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 472 ~tRY~l~~nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||||||++|||+++.+|+++.||++|+|| .+.+++|.|||||+|||| T Consensus 482 ~tRY~l~vNP~~~~~~~~~~~ri~~g~~~-~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:66 482 KTRYGIGINPFADSKSQEPSARITSGMLS-KDSVGKNAYFRRVWVKGC 528 (528) T ss_pred eeeeceeecCcccccCccccccccccchh-hhhcCccceeEEeeeccC Confidence 99999999999999999999999999999 578999999999999999 No 9 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=100.00 E-value=4.6e-249 Score=1382.07 Aligned_cols=518 Identities=70% Similarity=1.131 Sum_probs=484.7 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~es 80 (519) |++|+|+|||+|||||||+|||++.|||+|+|+|||||||+++++++|||++++|+++.+|+|++++|+|||++++|+|| T Consensus 3 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~~~~~~~~~~~~~~~i~es 82 (529) T protein:vir:10 3 LKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAAG 82 (529) T ss_pred ccHHHHHHHhHHHhcCCccchhccchhhhhhhhhhhhhHHHHhhccccchhhhhhhhhcccchhhccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccc Q lcl|Aclame:pro 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) Q Consensus 81 t~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~ 160 (519) ++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++++++++||||+++.|++.|||...... T Consensus 83 t~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~y~Pda~~sga~~~ga 162 (529) T protein:vir:10 83 QSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKGA 162 (529) T ss_pred cccccccccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCcccccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999876544 Q ss_pred ccc--------ccccccccccccccccccccccccccccccccCCCCCCCcc-ccccccccccccccceecccccchhhh Q lcl|Aclame:pro 161 FEA--------LAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAA-KLDAAVTALVEAGQLAEIAEGMATSIA 231 (519) Q Consensus 161 ~~~--------~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~-~~~~~~~~~~~~g~~~~~~~GmsTa~A 231 (519) ... ++.......++...+.|...++++.....+.....+.+... ..+.........+.++++++||+|+++ T Consensus 163 ~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm~Ta~a 242 (529) T protein:vir:10 163 TTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIA 242 (529) T ss_pred ccccCccccccccccccccccCcceeeeecccceecccccccccccCccccCcccccccccccccccccccccccchhhh Confidence 322 22233344455556677777777766655555555444332 234445666778899999999999999 Q ss_pred hhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhh Q lcl|Aclame:pro 232 ELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYS 311 (519) Q Consensus 232 Eal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~ 311 (519) |+|+++|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||||+|+.+ T Consensus 243 EaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~ 322 (529) T protein:vir:10 243 ELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYT 322 (529) T ss_pred hccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccc Q lcl|Aclame:pro 312 AQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSV 391 (519) Q Consensus 312 a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~ 391 (519) |++++.+|+.+++..+|+|||++++|++|+||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|+++ T Consensus 323 a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~ 402 (529) T protein:vir:10 323 AQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNI 402 (529) T ss_pred hhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeee Q lcl|Aclame:pro 392 SYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGF 471 (519) Q Consensus 392 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~ 471 (519) ++++++...++++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+||| T Consensus 403 ~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~ 482 (529) T protein:vir:10 403 SPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVMGF 482 (529) T ss_pred cccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeceeecCcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 472 KTRYGIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 472 ~tRY~l~~nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||||||++|||+.+.+|.++.||+||+|| .+.+|+|.|||||+|||| T Consensus 483 ~tRY~l~~NP~~~~~~~~~~~r~~~g~~~-~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 483 KTRYAIGVNPFAESRTQAPQGRITSGMPG-VNSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeceeecCccccccccccccccCCcch-hhhcCccceeEEeeeccC Confidence 99999999999999999999999999997 678999999999999999 No 10 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=100.00 E-value=7.1e-249 Score=1381.03 Aligned_cols=518 Identities=69% Similarity=1.128 Sum_probs=486.3 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~es 80 (519) |++|+|+|||+|||||||+|||++.|||+|+|+|||||||++++++.|||++++|+++.+|+|++++|+|||++++|+|| T Consensus 3 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i~~s 82 (529) T protein:vir:10 3 LKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAAG 82 (529) T ss_pred cchHHHHHHhhHhhcCCccchhccchhhhhhhhhhhhhHHHHhcccccchhhhhhhhhccchhhcccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccc Q lcl|Aclame:pro 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) Q Consensus 81 t~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~ 160 (519) ++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++++++++||||..+.+++.|||++.... T Consensus 83 t~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~~~pda~~sga~~~ga 162 (529) T protein:vir:10 83 QSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKGA 162 (529) T ss_pred cccccccccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCcccccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999876544 Q ss_pred cc--------cccccccccccccccccccccccccccccccccCCCCCCCcc-ccccccccccccccceecccccchhhh Q lcl|Aclame:pro 161 FE--------ALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAA-KLDAAVTALVEAGQLAEIAEGMATSIA 231 (519) Q Consensus 161 ~~--------~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~-~~~~~~~~~~~~g~~~~~~~GmsTa~A 231 (519) .. ..+.......++...+.|...++++.....+.....+.+... ..+.........+.++++++||+|+.+ T Consensus 163 ~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsTa~a 242 (529) T protein:vir:10 163 TTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIA 242 (529) T ss_pred cccccccccccccccccccccccceeeecccCceeeccccccccccCccccCcccccccccccccccccccccchhhhhh Confidence 32 222333445555666777777777776666655555544432 244556667788999999999999999 Q ss_pred hhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhh Q lcl|Aclame:pro 232 ELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYS 311 (519) Q Consensus 232 Eal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~ 311 (519) |+|++||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||||+|+.+ T Consensus 243 EaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~ 322 (529) T protein:vir:10 243 ELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYT 322 (529) T ss_pred hccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccc Q lcl|Aclame:pro 312 AQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSV 391 (519) Q Consensus 312 a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~ 391 (519) |++++.+|+.+++..+|+|||++++|++|+||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|.+. T Consensus 323 a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~ 402 (529) T protein:vir:10 323 AQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNI 402 (529) T ss_pred hhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeee Q lcl|Aclame:pro 392 SYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGF 471 (519) Q Consensus 392 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~ 471 (519) +++.++...++++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+||| T Consensus 403 ~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~ 482 (529) T protein:vir:10 403 SPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGFDPKNFQPVMGF 482 (529) T ss_pred cccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeceeecCcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 472 KTRYGIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 472 ~tRY~l~~nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||||||++|||+.+.+|.++.||+||+|| .+.+|+|.|||||+|||| T Consensus 483 ~tRY~l~~NP~~~~~~~~~~~r~~~g~~~-~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 483 KTRYAIGVNPFAESRTQAPQGRITSGMPG-VNSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeceeecCccccccccccccccCCcch-hhhcCccceeEEeeeccC Confidence 99999999999999999999999999997 678999999999999999 No 11 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=100.00 E-value=2.9e-247 Score=1372.22 Aligned_cols=519 Identities=59% Similarity=0.954 Sum_probs=485.1 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhh--hhccchhhhhhhhhh--------hhhhhhcccc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTA--PEYRDEKISEAFGSF--------LTEAEIGGDH 70 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~--~~~~~~~~~~~~~~~--------~~~~~~~~~~ 70 (519) |++|+|+|||+|||||||+|||++.|||+|+|+|||||||+++++ +.|||+++.++++.| |.|++++++| T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~ 80 (534) T protein:vir:10 1 MSKKSLLKKWQPLVESEGMPAIASMKRKDIVARIFENQDEDIAHNEGGVYTDQVVVNSMVDVKGRIEEARLAEANIGGDH 80 (534) T ss_pred CchhHHHHHhHHhhcCCccccccchhhhhhhhhhhhhHHHHHhhhcccccchhhhhhhhhccccchhhcccccccccccc Confidence 999999999999999999999999999999999999999999776 899999999999887 9999999999 Q ss_pred ccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCccccccccccccc Q lcl|Aclame:pro 71 GYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNA 150 (519) Q Consensus 71 g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt 150 (519) ||++++|+||++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||.++++..++.|||+..+.+|+ T Consensus 81 g~~~~~ia~s~~s~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt 160 (534) T protein:vir:10 81 GYDATKIASGETSGSITNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGPDA 160 (534) T ss_pred ccccccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999888788888875556999 Q ss_pred ccCcccccccccccccccccccccccccc-----ccccccccccccccccCCCCCCCccccccccccccccccceecccc Q lcl|Aclame:pro 151 MFSGQGAAETFEALAASKVLEVGKIYSHF-----FEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEG 225 (519) Q Consensus 151 ~fSG~~~~~~~~~~~~~t~~~~g~~~~~~-----~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~G 225 (519) +|||+++......+...++...++..... +...|...........++...++....+.........+..++++.| T Consensus 161 ~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~g 240 (534) T protein:vir:10 161 DFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSA 240 (534) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccccceecccc Confidence 99999988887777777776666655433 3345555555555545555444544445555666677889999999 Q ss_pred cchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHH Q lcl|Aclame:pro 226 MATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVI 305 (519) Q Consensus 226 msTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii 305 (519) |+|+.+|+|+.+|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||| T Consensus 241 m~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii 320 (534) T protein:vir:10 241 MATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMV 320 (534) T ss_pred cchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHH Q lcl|Aclame:pro 306 DWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLA 385 (519) Q Consensus 306 ~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~ 385 (519) |+|+.+|++++.+++.+++..+|+|||+++.|+.++||++||+|+|+++||+|||+|+|+|+||+||||||||+||++|+ T Consensus 321 ~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~ 400 (534) T protein:vir:10 321 LWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALG 400 (534) T ss_pred HHHhhhhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccc Q lcl|Aclame:pro 386 AVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNF 465 (519) Q Consensus 386 ~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~ 465 (519) |+|.++++|+++.+.++++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+|| T Consensus 401 ~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sf 480 (534) T protein:vir:10 401 HTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVEDYFTVGYKGASEMDAGLYYCPYVALTPLRGTDPKNF 480 (534) T ss_pred hccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeeeeeeeceeecCcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 466 QPVMGFKTRYGIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 466 qP~~g~~tRY~l~~nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||+|||||||||++|||+...+|.+..+|+||||.+++.+++|.|||||+|||| T Consensus 481 qP~~g~~tRY~l~~NP~~~~~~~~~~~~i~~g~~~~~~~ag~n~~~~~~~Vk~l 534 (534) T protein:vir:10 481 QPVLGFKTRYGVKLHPMADATQNKGFAKISNGMPQHTNMFGKNAFFRRVLVAGV 534 (534) T ss_pred cceeeeeeeeceeecCcccccCCccccccccCCcchhhhcccccceeeeeeecC Confidence 999999999999999999999999889999999999999999999999999999 No 12 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=100.00 E-value=3.1e-239 Score=1328.16 Aligned_cols=510 Identities=61% Similarity=0.967 Sum_probs=459.4 Q ss_pred HHHHHhhhhhhCCCc--cccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhcccc Q lcl|Aclame:pro 4 NALVQKWSALLENEA--LPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQ 81 (519) Q Consensus 4 ~~l~~kw~p~l~~~~--~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est 81 (519) -+|+|||+||||||| +|||++.|||+|+|+|||||||++++++.|||+++.|+|..+|.|++++|+|||++.+|+||+ T Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia~s~ 80 (514) T protein:vir:56 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) T ss_pred CchhhhhhHHhcccccccccccchhhhhhhhhhhhhHHHHHhcCCcccchhhhhhhhccccccccccccccccccccccc Confidence 689999999999998 999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCcccccccc Q lcl|Aclame:pro 82 TSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETF 161 (519) Q Consensus 82 ~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~ 161 (519) +|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++.+ +.|||+++||+|++|||+.+.... T Consensus 81 ~t~~v~~~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~t--g~EAf~~~nEadt~fSG~~~~~~~ 158 (514) T protein:vir:56 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT--GAEAFHPTRQADASFSGQAAASTI 158 (514) T ss_pred ccccccccchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcc--cccccccccccCcCcccccccccc Confidence 999999999999999999999999999999999999999999999999998764 469999999999999999988887 Q ss_pred cccccccccccccccccccccccccccccccc-ccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCC Q lcl|Aclame:pro 162 EALAASKVLEVGKIYSHFFEATGSAHFQAVEA-VTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGS 240 (519) Q Consensus 162 ~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~-~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs 240 (519) ...+...+...++.....+...+......... .......+...............+.+|++++||+|+.+|+++++|++ T Consensus 159 ~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~lggs 238 (514) T protein:vir:56 159 ADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGS 238 (514) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhcccCCCC Confidence 77776666666666554443333322211110 00011111111222334456677889999999999999999999999 Q ss_pred CccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 241 TDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) Q Consensus 241 ~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t 320 (519) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||+||+..+++++.+|+ T Consensus 239 ~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~~~~~ 318 (514) T protein:vir:56 239 SNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWT 318 (514) T ss_pred cccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccc Q lcl|Aclame:pro 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) Q Consensus 321 ~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~ 400 (519) +++.. +|+|||++++|+.|+||++||||+|+++||||+|+|+|+|+||+||||||||+||++|+|+|.++++++++... T Consensus 319 ~~~~~-~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~ 397 (514) T protein:vir:56 319 QGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQD 397 (514) T ss_pred ccccc-ccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccCccc Confidence 87755 79999999999999999999999999999999999999999999999999999999999999999999887655 Q ss_pred -cccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee Q lcl|Aclame:pro 401 -GFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI 479 (519) Q Consensus 401 -~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~ 479 (519) .+++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||++++++||+||||+|||||||||++ T Consensus 398 ~~~~~d~~~~~~aG~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~ 477 (514) T protein:vir:56 398 GSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYGVQV 477 (514) T ss_pred cccccccCcceEEEEecCceEEEecCCCCcceEEEEEecCcceecceeeccccccccccccCCccccceeeeeeeeceee Confidence 4899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 480 NPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 480 nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) |||++..++ ..++.|+|+.+++ .++|.|||||+|||| T Consensus 478 NPy~~~~~~--~~~~~~~~~~~a~-~~~n~y~r~v~v~~l 514 (514) T protein:vir:56 478 NPFADPTAS--ATKVGNGAPVAAS-MGKNAYFRRVFVKGL 514 (514) T ss_pred CCCCCcccc--ccccCCcchhhhc-ccccceeeeEEEecC Confidence 999975544 3567888887764 588999999999999 No 13 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=100.00 E-value=1.1e-221 Score=1232.05 Aligned_cols=459 Identities=38% Similarity=0.641 Sum_probs=402.7 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhh-hhccccccchhhhcc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEA-EIGGDHGYDATNIAA 79 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~g~~~~~~~e 79 (519) +++|+|+|||+|||||||+|||++.|||+|+|+|||||||+++|++.++ +|+ +++++||+++.+|+| T Consensus 4 ~~~e~l~~kw~p~l~~~~~~~i~~~~~~~v~a~l~enq~~~~~~~~~~l------------~e~~~~~~~~~~~~~~i~~ 71 (470) T protein:vir:10 4 FNSEYLQEKWAPILDYDGLDPIKDSHRRSVTAVLLENQEKELREERNFL------------SEAPNVNTNSGATAGFSAD 71 (470) T ss_pred chhHHHHHhhhhhhcCCccchhcchhhhhhhhhhhhhhHHHHhhccchh------------hhhhhcccccccccccccc Confidence 9999999999999999999999999999999999999999999988764 444 689999999999999 Q ss_pred ccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCcccccc Q lcl|Aclame:pro 80 GQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAE 159 (519) Q Consensus 80 st~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~ 159 (519) |++|++|++|||+||+||||++|||||+|||||||||||||||||||+||.++.+ +| ++|+|+|+.|||.+++. T Consensus 72 st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~sG----~E--affnEA~T~fSG~~~~~ 145 (470) T protein:vir:10 72 ATAAGPVAGFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQSG----TE--ALFNEADTAFSGQPDGL 145 (470) T ss_pred ccccccccccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCCc----cc--eeeecCCcccCcccccc Confidence 9999999999999999999999999999999999999999999999999998853 34 55899999999988776 Q ss_pred ccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCC Q lcl|Aclame:pro 160 TFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNG 239 (519) Q Consensus 160 ~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~gg 239 (519) ........+...... .++ .... ++++..++. ....+.+..++++.||+|+.+|.| |+ T Consensus 146 ~~~~~~~~~~a~~~g--------~~~---------~~~~-gt~~~~~~~--~~~~a~~~~y~~~~GMsTa~aE~l---g~ 202 (470) T protein:vir:10 146 DDTSGFTATGANNVG--------LGT---------TAQQ-GSNPGLLNS--TAAQTNATDYNVGQGMRTDSAEDL---GD 202 (470) T ss_pred ccccccccccccccc--------ccc---------cccc-ccccccccc--ccccccccccccccccchHHhhhc---CC Confidence 543322221111000 000 0001 111111111 122234557889999999999976 67 Q ss_pred CCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcc Q lcl|Aclame:pro 240 STDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGM 319 (519) Q Consensus 240 s~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~ 319 (519) +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||||+|+.+|++++..+ T Consensus 203 s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~k~~~ 282 (470) T protein:vir:10 203 GTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPGAQAN 282 (470) T ss_pred CCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhceecc Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999988876 Q ss_pred cccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccccccccc Q lcl|Aclame:pro 320 TNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLG 399 (519) Q Consensus 320 t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~ 399 (519) +. .+|+|||+++.| +||++|+||+|++||++++|+|+|+|+||+||||||||+||++|+|+|.+.+.|+. + T Consensus 283 ~~----~~Gv~Dl~~~~~---gr~~~e~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~--~ 353 (470) T protein:vir:10 283 VA----AAGTFDLDTDSN---GRWSVEKFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPAL--N 353 (470) T ss_pred cc----ccceEEeecccc---hhHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhcccccccccc--c Confidence 64 489999998766 89999999999999999999999999999999999999999999999999998765 4 Q ss_pred ccccccCCCceEEEEecCcEEEEecCC------CccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeee Q lcl|Aclame:pro 400 QGFNVDTTKAVFAGVLGGKYRVYIDQY------ARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKT 473 (519) Q Consensus 400 ~~~~~d~~~~~~~G~l~~~~~vy~D~y------~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~t 473 (519) ..+++|+|+++|+|+|+|||+|||||| +++|||+|||||++++|+||||||||||++++++||+||||+||||| T Consensus 354 ~~~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~t 433 (470) T protein:vir:10 354 ANLNVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVGYKGSSPYDAGLFYCPYVPLQMVRAVGQDTFQPKIGFKT 433 (470) T ss_pred cccccCCCCceEEEEecCceEEEeeccccccCcccccEEEEEEecCcceecceeeccccccccCCCCCCccccceeeeee Confidence 568999999999999999999999997 77899999999999999999999999999999999999999999999 Q ss_pred eeceeecCcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 474 RYGIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 474 RY~l~~nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||||++|||+..++|. +++|++ ++|.|||||+|||| T Consensus 434 RY~l~~NP~~~~~~~~-~~~i~~---------~~n~y~r~~~v~~l 469 (470) T protein:vir:10 434 RYGLVENPFSQGTTQG-LGTLTR---------NSNRYYRRVKVANL 469 (470) T ss_pred eeceeecCcccCCCcc-cccccC---------CCCceeeEEEeecc Confidence 9999999999888864 455653 67889999999999 No 14 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=100.00 E-value=9.4e-221 Score=1226.84 Aligned_cols=457 Identities=36% Similarity=0.617 Sum_probs=395.2 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccc-cchhhhcc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHG-YDATNIAA 79 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~~~~~~e 79 (519) |++|+|+|||+|||||||+|||++.|||+|+|+|||||||++++++.||+|+++++++ +|+ ...-++++ T Consensus 2 ~~~e~l~~kW~plLe~~~~~~i~~~~k~~i~a~llENQe~~~~~~~~~~~~~~~~~~~----------~~~~~~~n~~~~ 71 (468) T protein:vir:10 2 FNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLG----------AGTIAPAGSALG 71 (468) T ss_pred cchHHHHHhhhHhhcCCccchhccchhhhhhhhhhhhHHHHHhccccccchhhHhhcC----------Ccccchhhhhhh Confidence 9999999999999999999999999999999999999999999999999999998873 333 33446778 Q ss_pred ccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCcccccc Q lcl|Aclame:pro 80 GQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAE 159 (519) Q Consensus 80 st~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~ 159 (519) +++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||.++.+ +||| |||||++|||++... T Consensus 72 ~~~t~~v~~~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~g----~EAf--~nEadt~fSg~~~~~ 145 (468) T protein:vir:10 72 SANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAG----EEAL--FNEPDTGFTGGYDAS 145 (468) T ss_pred hcccccccccCchhhhhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCCCC----ccce--ecccccccccccccc Confidence 8999999999999999999999999999999999999999999999999998853 4555 799999999976543 Q ss_pred ccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCC Q lcl|Aclame:pro 160 TFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNG 239 (519) Q Consensus 160 ~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~gg 239 (519) ......... ....+ ...++++...+ ...+..++++.||+|+.+|.|+ T Consensus 146 ~~~~~~~~~----------------------~~~~~-~~~g~~~~~~~------~a~~~~~~~g~gMsTa~aE~lG---- 192 (468) T protein:vir:10 146 QGDYAVRTG----------------------AGVGG-DSEGNNPALLN------DAAPGTYEVGSKMPREDLERMG---- 192 (468) T ss_pred ccccccccc----------------------ccccc-CCCCCcccccc------cccccccccccccchHHHhhcC---- Confidence 322111000 00001 11112222211 2234567899999999999984 Q ss_pred CCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcc Q lcl|Aclame:pro 240 STDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGM 319 (519) Q Consensus 240 s~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~ 319 (519) +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||||+|+.+|+++++. T Consensus 193 ~~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~- 271 (468) T protein:vir:10 193 EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQN- 271 (468) T ss_pred CCCcccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheecc- Confidence 4567899999999999999999999999999999999999999999999999999999999999999988888877752 Q ss_pred cccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccccccccc Q lcl|Aclame:pro 320 TNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLG 399 (519) Q Consensus 320 t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~ 399 (519) ....+|+|||+++.| +||++|+||+|++|||+++|+|+|+|+||+||||||||+||++|+|+|.++++|+...+ T Consensus 272 ---g~~~~Gv~d~~~~~~---~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~ 345 (468) T protein:vir:10 272 ---NVANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGA 345 (468) T ss_pred ---ccccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceeccccccc Confidence 234589999998866 89999999999999999999999999999999999999999999999999999987766 Q ss_pred cc---ccccCCCceEEEEecCcEEEEecCCCc----cceEEEEEecCCCccceeEeecccccccccccCcccccceeeee Q lcl|Aclame:pro 400 QG---FNVDTTKAVFAGVLGGKYRVYIDQYAR----SDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 472 (519) Q Consensus 400 ~~---~~~d~~~~~~~G~l~~~~~vy~D~y~~----~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~ 472 (519) .+ +++|+++++|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||+|++++||+||||+|||| T Consensus 346 ~~~~~~~~D~tg~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 425 (468) T protein:vir:10 346 GGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFK 425 (468) T ss_pred ccccccccccCcceEEEEecCceEEEEccccccCCccceEEEEEecCcceeceeeeccccccccccccCCCcccceeeee Confidence 55 479999999999999999999999975 79999999999999999999999999999999999999999999 Q ss_pred eeeceeecCcccccccCCcceeecCCchh-hhcccchhhhhhhhhcCC Q lcl|Aclame:pro 473 TRYGIGINPFADPAAQAPTKRIQNGMPDI-VNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 473 tRY~l~~nP~~~~~~~~~~~~i~~~~d~~-a~~~~~~~y~r~v~v~~~ 519 (519) |||||++|||+... .|.||++.. +...++|.|||||+|||| T Consensus 426 tRY~l~~NP~~~~~------~~~~g~~~~~~~~~~~N~y~r~~~v~~l 467 (468) T protein:vir:10 426 TRYGMVSNPFVTTN------GLYNGTPDGEALTPNANMYYRRVQVTNL 467 (468) T ss_pred eeeceeecccceec------cccCCCcccccccccccceeeeEEEecc Confidence 99999999999522 345554332 225689999999999999 No 15 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=100.00 E-value=4.1e-218 Score=1212.39 Aligned_cols=456 Identities=38% Similarity=0.653 Sum_probs=394.2 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~es 80 (519) |++|+|+|||+|||||||+|+|++.|||+|+++|||||||++++++.+ |+|+. ++|||++. + T Consensus 1 ms~~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~------------l~ea~--~~~g~~~~----~ 62 (462) T protein:vir:10 1 MSIQQLQEKWAPVLNHESVPEIKDSYKKGVVAQLLENQENAIREEGQV------------LNETL--QTTGYTTG----D 62 (462) T ss_pred CchHHHHHHhhhhhcccccchhhhhhHHHHHHHHhhhHHHHHHhcccc------------hhccc--cccCCCcC----c Confidence 999999999999999999999999999999999999999999886655 55553 89999864 5 Q ss_pred cccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccc Q lcl|Aclame:pro 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) Q Consensus 81 t~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~ 160 (519) ++|+++++|||+||+|||||+|||||+|||||||||||||||||||+||++++.+.+....+++|||+|+.|||..+... T Consensus 63 ~~t~~~~~~~P~Li~l~Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~gtEAlfnEadt~fSg~~~~~~ 142 (462) T protein:vir:10 63 TATGPVAGFDPVLISLIRRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSDFREALFNEPNAGFSGGAGTGL 142 (462) T ss_pred ccccccccccchhhhHHHHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccccchhhhccCCcCccccccccc Confidence 66999999999999999999999999999999999999999999999999887655433345558999999999876543 Q ss_pred cccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCC Q lcl|Aclame:pro 161 FEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGS 240 (519) Q Consensus 161 ~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs 240 (519) ........... . ....++++...+.+. ......++++.||+|+.+|+|+. ++ T Consensus 143 ~~~~~~~~~~~----------------------~-~~~~g~~~~~~~~~~---~g~~~~~~~~~GM~Ta~aE~lg~--~s 194 (462) T protein:vir:10 143 SNYDPTASSSA----------------------V-NDAEGANPGLLNDSP---AGTYEVTGDATGMATATAEALDD--SS 194 (462) T ss_pred ccccccccccc----------------------c-cccccccceeecCCC---ccceecccccccccchhccccCC--cc Confidence 22111110000 0 000011111111111 11223456788999999999963 56 Q ss_pred CccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 241 TDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) Q Consensus 241 ~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t 320 (519) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||||+|+.+|++|+.+++ T Consensus 195 ~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~~k~~~~ 274 (462) T protein:vir:10 195 ASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVNAVKGAIANT 274 (462) T ss_pred CCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhheeeecccc Confidence 68899999999999999999999999999999999999999999999999999999999999999999999999988777 Q ss_pred ccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccc Q lcl|Aclame:pro 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) Q Consensus 321 ~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~ 400 (519) . .+|||||+++.+ +||++|++|+|++||++++|+|+|+|+||+||||||||+||++|+|+|.+.+.|+...+. T Consensus 275 ~----~~Gv~dl~~~~~---gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~p~~~~~~ 347 (462) T protein:vir:10 275 A----TDGIFDLDVDSN---GRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMAGVLDYAPGLQGNS 347 (462) T ss_pred c----ccceeeeccccc---hHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhccchhccccccccc Confidence 4 489999987755 899999999999999999999999999999999999999999999999999999877776 Q ss_pred cc-cccCCCceEEEEecCcEEEEecCC----CccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeee Q lcl|Aclame:pro 401 GF-NVDTTKAVFAGVLGGKYRVYIDQY----ARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRY 475 (519) Q Consensus 401 ~~-~~d~~~~~~~G~l~~~~~vy~D~y----~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY 475 (519) .+ ++|+++.+|+|+|+|||+|||||| +++|||+|||||++++|+||||||||||++++++||+||||+||||||| T Consensus 348 ~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY 427 (462) T protein:vir:10 348 ALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPNTFQPKIGFKTRY 427 (462) T ss_pred cccccccccceeEEEecCceEEEEecccCCCcccceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeee Confidence 65 799999999999999999999998 6689999999999999999999999999999999999999999999999 Q ss_pred ceeecCcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 476 GIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 476 ~l~~nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||++|||+.+.+|.+. ++ ..++|.|||||+|||| T Consensus 428 ~l~~NP~t~~~~~~~~-~~---------~~~~n~y~r~~~v~~l 461 (462) T protein:vir:10 428 GMVSNPFSGGLTQGSG-AL---------TANANKYYRRVQVANL 461 (462) T ss_pred eeeecCCCCCcCCccc-cc---------cccCcceeeeEEeecc Confidence 9999999998887653 33 3577899999999999 No 16 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=100.00 E-value=8.9e-215 Score=1194.08 Aligned_cols=451 Identities=41% Similarity=0.686 Sum_probs=394.2 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~es 80 (519) |++|+|+|||+|||||||||||++.|||+|+++|||||||++.+++.+ |+||. ++|||++.. T Consensus 1 m~~~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~lenq~~~~~~~~~~------------l~ea~--~~~g~~~~s---- 62 (457) T protein:vir:10 1 MSFQNLQEKWAPVLEHDSLPEIGDSYKKGVVAQLLENQEKAIAEEGKI------------LTETL--QTTGYTGGD---- 62 (457) T ss_pred CchHHHHHHhhHhhccCccchhhhhHHHHHHHHHhhhHHHHHHhcccc------------ccccc--cccCCCccc---- Confidence 999999999999999999999999999999999999999998876654 55553 999998765 Q ss_pred cccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccc Q lcl|Aclame:pro 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) Q Consensus 81 t~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~ 160 (519) ++|++|++|||+||+||||++|||||+|||||||||||||||||||+||+++.+.......|++|||||+.|||..+... T Consensus 63 ~~t~~v~~~~P~Li~l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a~~~EAl~nEadt~fSg~~~~~~ 142 (457) T protein:vir:10 63 TVTGPVAGFDPVLISLIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAAGYDEAFFNEPNAGFSGGPGAYD 142 (457) T ss_pred ccccccccccchhhhhhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccccccccceeeeccCcccCccccccc Confidence 56899999999999999999999999999999999999999999999999887654333345558999999999765533 Q ss_pred cccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCC Q lcl|Aclame:pro 161 FEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGS 240 (519) Q Consensus 161 ~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs 240 (519) ..... ......++++...+.... .....++++.||+|+++|+|+. ++ T Consensus 143 ~~~~~----------------------------~~~~~~gt~~~~~~~~~~---~~~~~~~~~~gmsTA~aE~lgd--~~ 189 (457) T protein:vir:10 143 PGATG----------------------------VTNDAEGTNPALLNDSPA---GTYEQADDATGMSTATVEALDD--ST 189 (457) T ss_pred ccccc----------------------------cccccccccccccCcccc---ccccccccccchhhhhhhccCC--CC Confidence 21100 000011112222222222 2334678899999999999963 56 Q ss_pred CccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 241 TDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) Q Consensus 241 ~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t 320 (519) +++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||||+|+.+|+.++.+++ T Consensus 190 ~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~~~~~~ 269 (457) T protein:vir:10 190 ANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAVAGAQNNT 269 (457) T ss_pred CccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeecccc Confidence 77899999999999999999999999999999999999999999999999999999999999999999999999998887 Q ss_pred ccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccc Q lcl|Aclame:pro 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) Q Consensus 321 ~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~ 400 (519) .+ +|||||+++.| +||++|+||+|++||++++|+|+|+|+||+||||||||+||++|+|+|.+.++|+..... T Consensus 270 ~~----~gv~dl~~~~~---g~~~~e~~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~ 342 (457) T protein:vir:10 270 AT----AGVFDLDVDSN---GRWSVEKFKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGMAGVLDYTPALNGNN 342 (457) T ss_pred cc----ceeeeeecccc---chhhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhhcccccccchhhccc Confidence 54 89999987765 899999999999999999999999999999999999999999999999999999877776 Q ss_pred cc-cccCCCceEEEEecCcEEEEecCCCc----cceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeee Q lcl|Aclame:pro 401 GF-NVDTTKAVFAGVLGGKYRVYIDQYAR----SDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRY 475 (519) Q Consensus 401 ~~-~~d~~~~~~~G~l~~~~~vy~D~y~~----~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY 475 (519) +. ++|+++.+|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||++++++||+||||+||||||| T Consensus 343 ~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY 422 (457) T protein:vir:10 343 GLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPDTFQPKIGFKTRY 422 (457) T ss_pred cccccccccceeEEEecCCeEEEEecccccCCccceEEEEEeCCcceecceeecccccccccCccCCccccceeeeeeee Confidence 64 68999999999999999999998874 79999999999999999999999999999999999999999999999 Q ss_pred ceeecCcccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 476 GIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 476 ~l~~nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) ||++|||+.+.+|.++ +++ .|+|.||||++|+|| T Consensus 423 ~l~~NP~~~~~~~~~~-~~~---------~~~n~~~~rs~vs~l 456 (457) T protein:vir:10 423 GMVSNPFAGGLTQGSG-ALT---------VNANKYYRRVQVANL 456 (457) T ss_pred eeeecccccccccccc-ccc---------ccchhhcceeeeeec Confidence 9999999998887654 343 356789999999999 No 17 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=100.00 E-value=1.5e-193 Score=1077.74 Aligned_cols=447 Identities=23% Similarity=0.316 Sum_probs=344.9 Q ss_pred CC----hHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhh Q lcl|Aclame:pro 1 MK----KNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATN 76 (519) Q Consensus 1 ~~----~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (519) |+ +|+|+|||+||||+ |++.|||+|+|+|||||||+ .+ ++ T Consensus 1 ~~~~~~~e~l~~kw~p~l~~-----~~~~~~~~~~a~llenq~~~---~~----------------------------~~ 44 (523) T protein:vir:59 1 MSQPKINEQLIEKWQPLLEG-----CRNDWERHTLATLLENQYRE---AK----------------------------KH 44 (523) T ss_pred CCcchhhHHHHHhhhhhhcc-----cCChhHHHHHHHHhhhhhHH---HH----------------------------Hh Confidence 55 56799999999997 66779999999999999873 11 24 Q ss_pred hccccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCC--------Cccccccccccc Q lcl|Aclame:pro 77 IAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAA--------GAKEAFHPMYAP 148 (519) Q Consensus 77 ~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~--------~~~eA~~~fnEa 148 (519) |+|++.+++|++|+| ||+||||++|||||+||||||||||||||||||||||.++.++. .+.+++.+++++ T Consensus 45 l~e~~~~~~~~~~~~-~~~~v~r~~p~l~a~DIWGVQPMTGPTGLIFAMRSRY~~q~gteA~yg~~~~~~~~a~~~~~ea 123 (523) T protein:vir:59 45 LMETTQTTEVDGWNL-ALPIVRRVFANLRATDLVSVQPLSLPTGLVFYLDFKSPELPGNGSVYGGTGLTTDTATGGLYDE 123 (523) T ss_pred hhhhhhccccccccc-hhhhhhhHhhhhhhhhccccccCCCCcceeEEEEeeccCCCCcccccCccccCccccccccccc Confidence 566777999999996 99999999999999999999999999999999999999986542 122345566788 Q ss_pred ccccCcccccccccccccccc----cccccccccccc---------------ccccccc--------------------- Q lcl|Aclame:pro 149 NAMFSGQGAAETFEALAASKV----LEVGKIYSHFFE---------------ATGSAHF--------------------- 188 (519) Q Consensus 149 dt~fSG~~~~~~~~~~~~~t~----~~~g~~~~~~~~---------------~~G~~~~--------------------- 188 (519) ++.||+.............+. ...++.....+. ..+...+ T Consensus 124 n~~~s~~~~~~~~~~d~~~sg~~~~~~~a~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~ 203 (523) T protein:vir:59 124 NARLSRREYETTITVDLATAQQATMRDVGFDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAY 203 (523) T ss_pred ccccccccccCccCCCcccccccccccccccccchhhccccceeeeeccccccccccccccccccccccccccccccccc Confidence 888887654433221111000 000000000000 0000000 Q ss_pred ----------cccc---cccCCCCCCCccccccccccccccccceecccccchhhhhhcccCC--CCCccccccceeEEE Q lcl|Aclame:pro 189 ----------QAVE---AVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFN--GSTDNPWNEMGFRID 253 (519) Q Consensus 189 ----------~~~~---~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~g--gs~~~~f~EMsFsIE 253 (519) .... ........++..............+..++.+.||+|+.+|.++..+ ++.+++|+||+|+|| T Consensus 204 s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIe 283 (523) T protein:vir:59 204 PLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELR 283 (523) T ss_pred hhhccccccccccccccccccccccccccccCCCcccccccccccccccccchhhccccccccccccccccccceeeEEE Confidence 0000 0000000011111111111222334568889999999999987655 577899999999999 Q ss_pred EEEEEeecccccccccHHHHHHHHhhc-CCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeec Q lcl|Aclame:pro 254 KQVIEAKSRQLKASYSIELAQDLRAVH-GMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDF 332 (519) Q Consensus 254 K~TVtAKSRALKAEYTmELAQDLKAiH-GLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl 332 (519) ||+|||||||||||||||||||||||| |||||+||+||||||||+||||||||+|+.+|++++.+++.+ +||||| T Consensus 284 K~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~~----~g~~~~ 359 (523) T protein:vir:59 284 SRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFWS----EVVGEY 359 (523) T ss_pred eEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccccc----cceeee Confidence 999999999999999999999999999 999999999999999999999999999999999988876643 899999 Q ss_pred ccccc---ccccchH--HHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCC Q lcl|Aclame:pro 333 QDPID---IRGARWA--GESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTT 407 (519) Q Consensus 333 ~~~~d---~~~~~~a--~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~ 407 (519) .++.| +.|.+|. +||+|.||++||||+|+|+|+|+||+||||||||+||++|+++|.+.. ......|++ T Consensus 360 ~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~------~~~~~~~~~ 433 (523) T protein:vir:59 360 YDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTP------GNDNRDGGT 433 (523) T ss_pred cccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhcccccc------CCccccccc Confidence 98776 2333343 899999999999999999999999999999999999999999997654 244678899 Q ss_pred CceEEEEecCcEEEEecCCCccceEEEEEecC-CCccceeEeeccccccccccc-Ccccccceeeeeeeeceee-cCccc Q lcl|Aclame:pro 408 KAVFAGVLGGKYRVYIDQYARSDYFTIGYKGS-NEMDAGIYYAPYVALTPLRGS-DPKNFQPVMGFKTRYGIGI-NPFAD 484 (519) Q Consensus 408 ~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~-~~~~~~~fyaPYv~~~~~~~~-dp~s~qP~~g~~tRY~l~~-nP~~~ 484 (519) +.+|+|+|+|||+||||||+++|||+|||||. +++|+|||||||||+.+++.+ ||+||||+|||||||||++ |||+. T Consensus 434 ~~~~~g~l~~~~~vy~d~~~~~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~ 513 (523) T protein:vir:59 434 GIFYVGMVQGRYRLYKNIYQNQPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFY 513 (523) T ss_pred cceeEEEecCceEEEecCCCCcceEEEEecccCCcccccceecccchhhcccccccCCcccceeeeeeehhheecchhHh Confidence 99999999999999999999999999999995 599999999999999999996 9999999999999999986 99986 Q ss_pred ccccCCcceeecCCc Q lcl|Aclame:pro 485 PAAQAPTKRIQNGMP 499 (519) Q Consensus 485 ~~~~~~~~~i~~~~d 499 (519) +.-- .++.. | T Consensus 514 ~~~~---~~~~~--~ 523 (523) T protein:vir:59 514 GLLY---VKLLQ--P 523 (523) T ss_pred hhhh---hhhcC--C Confidence 4331 11100 0 No 18 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=98.06 E-value=1.3e-07 Score=58.41 Aligned_cols=420 Identities=16% Similarity=0.072 Sum_probs=146.5 Q ss_pred CChHHHHHhhhhhhCCCccccccc-cchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhhh-hhccccccchhhhc Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVG-ASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEA-EIGGDHGYDATNIA 78 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~-~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~g~~~~~~~ 78 (519) ..++++. +-|||++.-.--.+ -||.+.+ .|--...|.|....-+.. ..+.++.|. ..+.-.++.+.+|. T Consensus 26 ~~~~~~~---a~l~enq~~~~~~~~~~~~~~~---~~~~~~~l~ea~~~~~~~---~~~~~i~es~~t~~v~~~~P~Li~ 96 (528) T protein:vir:66 26 ASKQKLV---AKILESQEADFAVDPIYKDEKV---VEAFGGFIAEAEVAGDHG---YDASQIAAGQTTGAITNVGPAVIG 96 (528) T ss_pred hhhhhhh---hhhhhhhHHHhhcccchhhHHH---HHhhhhhhhhhccccccc---ccchhccccccccccccCchhHHH Confidence 3344433 45788754211111 1554432 222223333322111000 012233333 34445566666654 Q ss_pred ccc------ccccccccCceehh--h----------------HHHHH-----hhhhhhhceeecc-CCccchhheeeeee Q lcl|Aclame:pro 79 AGQ------TSGAVTQIGPAVMG--M----------------VRRAI-----PHLIAFDICGVQP-LNNPTGQVFALRAV 128 (519) Q Consensus 79 est------~tg~v~~~~P~L~~--l----------------~Rra~-----p~LIa~DI~GVQP-mTGPTGLIFAMRsr 128 (519) --. ..-+|-+..|-=.| | ++-++ |.--+...-.=+. ..||||||||||++ T Consensus 97 lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~~a~~gGpTGliFAm~s~ 176 (528) T protein:vir:66 97 MVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAKEATVGSPTGTAFAKLTL 176 (528) T ss_pred HHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCcccccccccccccccccccccccccccccccCCccceeecccc Confidence 322 12345555554332 0 11111 1111222222222 24699999999999 Q ss_pred ecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCcccccc Q lcl|Aclame:pro 129 YGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDA 208 (519) Q Consensus 129 Y~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~ 208 (519) |.++.. ++++ +|+|+|+.|||..........+.......++.........+..+.......+..+.. ...++. T Consensus 177 y~s~~~---g~ea--~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~aEa--le~lg~ 249 (528) T protein:vir:66 177 SQAITA---GDIV--YHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATSIAEI--QEGFNG 249 (528) T ss_pred cccccc---ccee--eecccccceeeeccccccccccCcccccccccccccccccccceecccccchhhhhh--hcccCC Confidence 987653 4555 489999999986555433222222111111111100000010110000000000000 000000 Q ss_pred cc---ccccccccceecccccchh-hhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCH Q lcl|Aclame:pro 209 AV---TALVEAGQLAEIAEGMATS-IAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDA 284 (519) Q Consensus 209 ~~---~~~~~~g~~~~~~~GmsTa-~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDA 284 (519) +. ...+.-..-.-+.++-+++ +|| +|+|- -.-|||=--+..-.-| T Consensus 250 ~s~~~f~EMaFsIeK~tVtAKSRaLKAE----------------------YTiEL-AQDLKAIHGLDAEtEL-------- 298 (528) T protein:vir:66 250 SSNNPWAEMSMRIDKQVVEAKSRQLKAR----------------------YSIEV-AQDLRAVHGMDADAEL-------- 298 (528) T ss_pred CcccchhhcceEEEeEEEEeeccceecc----------------------ccHHH-HHHHHHhcCCChHHHH-------- Confidence 00 0000000000011111111 111 11110 1235554333333334 Q ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeecccccccc-ccchHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 285 DAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIR-GARWAGESFKALLFQIDKEAAEIA 363 (519) Q Consensus 285 EaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~-~~~~a~e~~r~L~~~i~~~a~~I~ 363 (519) ..-|++-+.-||-.||-|-|-.+..+.++...++..++.+.. ..-|-.+....+ -+.|.-..++.+-....++..+-. T Consensus 299 sNILStEImlEINREii~~i~~~a~~~~~~~t~~~~~~aG~~-dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~ 377 (528) T protein:vir:66 299 NAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVF-DLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTG 377 (528) T ss_pred HHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeecccccccee-ecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhc Confidence 445888899999999997543466766665443222211111 011111111111 134666655555555555555444 Q ss_pred hhc-------cccCCCEEEEchH---------HHHHHHhcCcccccccccccccccc--c---CCCceEEEEe-----cC Q lcl|Aclame:pro 364 RQT-------GRGAGNFIIASRN---------VVNVLAAVDTSVSYAAQGLGQGFNV--D---TTKAVFAGVL-----GG 417 (519) Q Consensus 364 ~~T-------~rg~gn~~v~S~~---------va~~L~~~g~~~~~~~~~~~~~~~~--d---~~~~~~~G~l-----~~ 417 (519) |-- ++-+ -+++|.. ....+.. +.-....+...+..+.+ | .....-+|.= ++ T Consensus 378 r~~gn~vi~S~~Va--~~L~~~g~~~~~~~~~~~~~~~~-d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~ 454 (528) T protein:vir:66 378 RGAGNFVIASRNVV--NILASADQGISLAMQGAAKGLNT-DTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDA 454 (528) T ss_pred cccccEEEEchHHH--HHHhhcccccccccccccccccc-CCCCceeEEEecCceEEEecCCCCcceEEEEEeCCccccc Confidence 311 1100 0111111 1111110 00000000000111111 1 0111112221 11 Q ss_pred cEEEEecCCCccceEEEEEecCCCcc-------ceeEeecccccc-cc---c---------ccCcccccceeeeeee Q lcl|Aclame:pro 418 KYRVYIDQYARSDYFTIGYKGSNEMD-------AGIYYAPYVALT-PL---R---------GSDPKNFQPVMGFKTR 474 (519) Q Consensus 418 ~~~vy~D~y~~~dy~~vG~KG~~~~~-------~~~fyaPYv~~~-~~---~---------~~dp~s~qP~~g~~tR 474 (519) + +|-=||.+..+. +++-=.+.++ =||.=-||+-.. .- | ....+-|--++++|.= T Consensus 455 g--lfyaPYv~l~~~-~~~dp~sfqP~~g~~tRY~l~vNP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:66 455 G--IYYAPYVALTPL-RATDPQSFHPVLGFKTRYGIGINPFADSKSQEPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred c--eeecccccceee-EeeCCccccceeeeeeeeceeecCcccccCccccccccccchhhhhcCccceeEEeeeccC Confidence 1 222355554433 3332222222 233334554211 10 0 0122333444444433 No 19 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=96.78 E-value=0.00034 Score=39.59 Aligned_cols=333 Identities=13% Similarity=0.063 Sum_probs=129.1 Q ss_pred CC-hHHHHHhhhhhhCCCccccccccc--------------------hhhhhhhhh---hhHHHHHhhhhhccc------ Q lcl|Aclame:pro 1 MK-KNALVQKWSALLENEALPEIVGAS--------------------KQAIIAKIF---ENQEQDILTAPEYRD------ 50 (519) Q Consensus 1 ~~-~~~l~~kw~p~l~~~~~~~~~~~~--------------------~~~~~~~~~---enq~~~~~~~~~~~~------ 50 (519) |+ .++|.++|..+-+. |++.. +..| ..+. |.+++.+.+...... T Consensus 1 Mk~~~el~~~~~~~~~~-----~~~l~~~~~~~~~~~~~~~ee~~~~~~~i-~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 74 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDK-----VENLNEKLNVAMLDDSVSAEELQAIKNER-DTAKMKRDMFKEQYTEARANEVANMSEE 74 (397) T ss_pred CchHHHHHHHHHHHHHH-----HHHHHHHHHHHHhhhhcCHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 77 55666666544322 11100 0000 1111 111111111000000 Q ss_pred ---------hhhhhhhhhhhhhhhhccccccchhhhcccccc-ccccccCceehhhHHHHHhhhhhhhceeeccCCccch Q lcl|Aclame:pro 51 ---------EKISEAFGSFLTEAEIGGDHGYDATNIAAGQTS-GAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTG 120 (519) Q Consensus 51 ---------~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~t-g~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTG 120 (519) +.....+..-+......+..... .....++++ |.+.--..+.-.+++.+.+..+-.++|.++||++++| T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 153 (397) T protein:vir:49 75 EKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLL-DSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTG 153 (397) T ss_pred cccccccchhHHHHHHHHHHHHHHhcchhHHH-HHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCcc Confidence 00000000001011111111000 011112211 2221111112234444556777888999999999887 Q ss_pred hheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCC Q lcl|Aclame:pro 121 QVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGA 200 (519) Q Consensus 121 LIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~ 200 (519) -+.-++- .+... .+.| T Consensus 154 ~~~~~~~--~~~~~--------------~a~~------------------------------------------------ 169 (397) T protein:vir:49 154 SRVYEKW--TDITG--------------LANI------------------------------------------------ 169 (397) T ss_pred ceEEEee--ccCCc--------------ceee------------------------------------------------ Confidence 4321111 00000 0000 Q ss_pred CCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhc Q lcl|Aclame:pro 201 TDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVH 280 (519) Q Consensus 201 t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiH 280 (519) +++|-.. ...+...|.++.|++.|.. -...+|-||.+|- T Consensus 170 ---------------------v~E~~~~---------~~~~~~~~~~i~~~~~k~~-------~~~~iS~ell~ds---- 208 (397) T protein:vir:49 170 ---------------------DDEAGKI---------ADVDDPKLSLIKYTIKRYA-------GISTVTNSLLADS---- 208 (397) T ss_pred ---------------------ecCcccc---------ccccccceeeEEeeeeeEE-------eeehhHHHHHhhh---- Confidence 0111000 0001233555555555544 4557999999985 Q ss_pred CCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 281 GMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAA 360 (519) Q Consensus 281 GLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~ 360 (519) ..|.+++|.+-|+..|..-+|+.||.-.- +.....|+++++ -...|+..|... T Consensus 209 ~~~l~~~i~~~l~~~~~~~~d~ai~~G~g------------~~~~~~~~~~~d-------------~i~~~~~~l~~~-- 261 (397) T protein:vir:49 209 AENILAWLSGWIAKKVVVTRNKAILEAIA------------ALPTKPTLTKWD-------------DIIDLEAKVDPA-- 261 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------ccccccccccHH-------------HHHHHHHhhhhh-- Confidence 25679999999999999999999886211 111223443332 224444444321 Q ss_pred HHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEe--cCCCcc--------- Q lcl|Aclame:pro 361 EIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYI--DQYARS--------- 429 (519) Q Consensus 361 ~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~--------- 429 (519) +.....+|++|.....|...= + + .+...+..+.+. -..++|.| ++|++ |...+. T Consensus 262 -------~~~~a~~vmn~~~~~~l~~lk--d-~---~G~~l~~~~~~~-~~~~~l~G-~PV~~~~~~~~~~~~~~~~~i~ 326 (397) T protein:vir:49 262 -------IKQTSFFLTNTSGFTALKKVK--N-A---LGDYLMERDVKS-PTGYSIDG-FAVKEVADRWLANGTGGAMPLY 326 (397) T ss_pred -------hcCCCEEEEcHHHHHHHHHhh--c-C---CCceeeccCcCC-CCCceecc-eeeEEecccccccccCCceeEE Confidence 224478899999999887541 1 1 011112222221 11257877 57775 322221 Q ss_pred -----ceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeecee-ecCc-------ccccccCCcceeec Q lcl|Aclame:pro 430 -----DYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG-INPF-------ADPAAQAPTKRIQN 496 (519) Q Consensus 430 -----dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~-~nP~-------~~~~~~~~~~~i~~ 496 (519) +|++++.++..+. =+.||.. .+-...+-.+-...|++.. .||- +...+..+.. T Consensus 327 ~gd~~~~~~~~~~~~~~i----~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~---- 392 (397) T protein:vir:49 327 FGDLKQAVTLFDRQHMSL----LSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNL---- 392 (397) T ss_pred EeeccceEEEEeecceEE----EEecccc------chhhcCceeEEEEeeeCcEEecccceEEEEeecccCCCCCc---- Confidence 1333333322221 1222211 0111222233334444432 2221 1111111100 Q ss_pred CCchhhhcc Q lcl|Aclame:pro 497 GMPDIVNSL 505 (519) Q Consensus 497 ~~d~~a~~~ 505 (519) ..-| . T Consensus 393 --~~~~--~ 397 (397) T protein:vir:49 393 --GSTA--V 397 (397) T ss_pred --cccc--C Confidence 0001 1 No 20 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=96.18 E-value=0.00091 Score=37.27 Aligned_cols=358 Identities=14% Similarity=0.039 Sum_probs=129.4 Q ss_pred CChHHHHHhhhhhhC-CCccccccccchhh--hhhhh---hhhHHHHHhhhhhcc----chh--------------hh-- Q lcl|Aclame:pro 1 MKKNALVQKWSALLE-NEALPEIVGASKQA--IIAKI---FENQEQDILTAPEYR----DEK--------------IS-- 54 (519) Q Consensus 1 ~~~~~l~~kw~p~l~-~~~~~~~~~~~~~~--~~~~~---~enq~~~~~~~~~~~----~~~--------------~~-- 54 (519) |=+|....+|..... .+.+-+..+..+.. -...+ +++..+.+.+..+.+ +.. .. T Consensus 1 ~~ke~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (413) T protein:vir:81 1 MVKEAGDAPTNAQVAEIAEVKSMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYKSIGEF 80 (413) T ss_pred ChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhhhhhhh Confidence 555555555543221 11110000100000 00000 000000000000000 000 00 Q ss_pred --hhhhhhhhh-------------hhhccccccchhhhccccccccc----cccCceehhhHHHHHhhhhhhhceeeccC Q lcl|Aclame:pro 55 --EAFGSFLTE-------------AEIGGDHGYDATNIAAGQTSGAV----TQIGPAVMGMVRRAIPHLIAFDICGVQPL 115 (519) Q Consensus 55 --~~~~~~~~~-------------~~~~~~~g~~~~~~~est~tg~v----~~~~P~L~~l~Rra~p~LIa~DI~GVQPm 115 (519) +.......+ ................++++..- ..+.+- +++.+-+..+..+++.|+|| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~---ii~~~~~~~~l~~~~~~~~~ 157 (413) T protein:vir:81 81 FAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRN---IIYRRREKLVVADLMDNLTM 157 (413) T ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHH---HHHHHhhhhhHHhhcceeec Confidence 000000000 00000000001111111111111 112223 34444456777899999999 Q ss_pred CccchhheeeeeeecCCCCCCCcccccccccccccccCcccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 116 NNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVT 195 (519) Q Consensus 116 TGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~ 195 (519) ++++.-+.-.+. .... ..++ T Consensus 158 ~~~~~~~~~~~~-~~~~--------------~~~a--------------------------------------------- 177 (413) T protein:vir:81 158 TNTTIKYLMEKA-NRVV--------------EGGF--------------------------------------------- 177 (413) T ss_pred cCCceeEEEecc-cccc--------------cccc--------------------------------------------- Confidence 997642211110 0000 0000 Q ss_pred CCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHH Q lcl|Aclame:pro 196 VDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQD 275 (519) Q Consensus 196 ~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQD 275 (519) ..+++|-.. .| +....|.+..|.+.|.. -...+|-||.+| T Consensus 178 ------------------------~~v~Eg~~~--~~-------~~~~~f~~i~~~~~k~~-------~~~~iS~ell~d 217 (413) T protein:vir:81 178 ------------------------KTVAEGGKK--PY-------MRFADFDIVTESLSKIA-------GLTKITDEMIED 217 (413) T ss_pred ------------------------ceecCcccc--cc-------cCcccceeeEeeeeeEE-------EeehhhHHHHHH Confidence 001111000 00 01123454555554444 455789999998 Q ss_pred HHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHH Q lcl|Aclame:pro 276 LRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQI 355 (519) Q Consensus 276 LKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i 355 (519) --+ .++.|.+.|+..|..-+|+.||. |. .+...+.|+++........ ....-.++.-| T Consensus 218 s~~-----l~~~i~~~la~~~~~~~d~~~l~--------G~----G~~~~~~Gi~~~~~~~~~~-----~~~~~~~~~~i 275 (413) T protein:vir:81 218 YDF-----LVSYINARLLEELAIEEERQLLL--------GD----GTGNNLTGLLKRDGIQTLA-----VSNKDELADSI 275 (413) T ss_pred HHH-----HHHHHHHHHHHHHHHHHHHHHhc--------cC----CCCCccccccccccccccc-----ccccchhHHHH Confidence 632 47888888888888888887774 10 1122244555433221110 01112233333 Q ss_pred HHHHHHHHhhccccCCCEEEEchHHHHHHHhc----CcccccccccccccccccCCCceEEEEecCcEEEEecCCCccce Q lcl|Aclame:pro 356 DKEAAEIARQTGRGAGNFIIASRNVVNVLAAV----DTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDY 431 (519) Q Consensus 356 ~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~----g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy 431 (519) .+....+.....+. .+.+|++|.....|... |...+.+......+ + -.....++|.| ++|+++...+..- T Consensus 276 ~~~~~~~~~~~~~~-~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~---~-~~~~~~~~l~G-~pv~~s~~~~~~~ 349 (413) T protein:vir:81 276 YKAMTNISLATPFQ-ADALVINPLDYQELRLAKDANGQYYGGGVFQGQYG---S-GGIMLDPAPWG-LRTVQSQVVPVGK 349 (413) T ss_pred HHHHHHhhhhccCC-CcEEEEcHHHHHHHHHhhccCCceecccccccccc---c-cccccCceecc-eeeEEcCCCCccc Confidence 33334444444443 36688999988777532 21111111000000 0 00011235666 6999998877654 Q ss_pred EEEEEecCC--Cc---cceeEeecccccccccccCcccccceeeeeeeeceee-cCcccccccCCcceeecCCchhhhcc Q lcl|Aclame:pro 432 FTIGYKGSN--EM---DAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSL 505 (519) Q Consensus 432 ~~vG~KG~~--~~---~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~ 505 (519) +++|---.. -. .-.+=..+|... +-.+-+=.+-+..||++.+ +|= ...++.-. .-.+ . T Consensus 350 ~~~gd~~~~~~~~~~~~~~v~~~~~~~~------~~~~~~~~~r~~~r~d~~~~~~~-------a~~~l~~~-~~~~--p 413 (413) T protein:vir:81 350 PVVGAFRSAASVLRKGGVRIDSTNTNVD------DFENNLITVRAEERVGLMVTFPE-------AIVQLDVA-EVVT--P 413 (413) T ss_pred EEEEecccEEEEEEecceEEEEeccccc------hhhcCcEEEEEEEeeccEEeccc-------ceEEEEec-CCCC--C Confidence 444421100 00 000111111100 1123344555666777544 111 11111111 0000 0 No 21 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=96.13 E-value=0.00097 Score=37.10 Aligned_cols=278 Identities=9% Similarity=0.006 Sum_probs=133.8 Q ss_pred cccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccc Q lcl|Aclame:pro 70 HGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPN 149 (519) Q Consensus 70 ~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEad 149 (519) .|+++.....+...+.. --....-.+++++..+.+-.+++-+-||++.+.- ....+. . . T Consensus 1 ~g~~a~~~~~~~~~~~~-iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~-----~~~~~~--~-------------~ 59 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGS-IPINISEQIITGVKNGSAAMKLAKAVPMTKPEEE-----FTFMSG--V-------------G 59 (299) T ss_pred CCcCCCcccccCCCcee-cchhHHHHHHHHHHhcchhhhhceeeecCCCcEE-----EEEEcC--C-------------c Confidence 67777664433322221 1111223466667778888899999999876521 111000 0 0 Q ss_pred cccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchh Q lcl|Aclame:pro 150 AMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATS 229 (519) Q Consensus 150 t~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa 229 (519) +. -++ T Consensus 60 a~---------------------------------------------------------------------~v~------ 64 (299) T protein:vir:41 60 AF---------------------------------------------------------------------WVD------ 64 (299) T ss_pred ee---------------------------------------------------------------------eee------ Confidence 00 000 Q ss_pred hhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHH Q lcl|Aclame:pro 230 IAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWIN 309 (519) Q Consensus 230 ~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~ 309 (519) | +.+++|...++++++...|..+-...+|-||.+|-. .|.++.|.+.|...|...+++.||.=-. T Consensus 65 --E---------~~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l~G~g 129 (299) T protein:vir:41 65 --E---------AERIQTSKPTFTKAKMRSKKMGVIIPTTKENLNYSV----TNFFSLMQAEIVEAFYKKFDQAVFTGVE 129 (299) T ss_pred --c---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 1 122344445567888888888888899999999743 4568999999999999999988885100 Q ss_pred hhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCc Q lcl|Aclame:pro 310 YSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDT 389 (519) Q Consensus 310 ~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~ 389 (519) ...+.|++........ ........+.-|+++-+.+... ..+++.+||+|+....|...- T Consensus 130 -------------~~~~~gil~~~~~~~~-----~~~~~~~~~~~l~~~~~~l~~~--~~~~~~~v~n~~~~~~L~~lk- 188 (299) T protein:vir:41 130 -------------SPYNWNILKSATDASN-----LVEETANKYDDLNEAIGLIEAE--DLEPNGIATIRKQRVKYRSTK- 188 (299) T ss_pred -------------Ccccccccccccccce-----eeccccccHHHHHHHHHhhhcc--cCCcCEEEEcHHHHHHHHHhh- Confidence 0111232211100000 0000011123344444555442 335678999999999988532 Q ss_pred ccccccccccccccccCCCceEEEEecCcEEEEecCCCccc----eEEEEEecCCCccceeEeeccccccc--------c Q lcl|Aclame:pro 390 SVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD----YFTIGYKGSNEMDAGIYYAPYVALTP--------L 457 (519) Q Consensus 390 ~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----y~~vG~KG~~~~~~~~fyaPYv~~~~--------~ 457 (519) +. .+...+..+.+.. .++|.| ++|++.++.+.+ .+++|-- +..++...-.... . T Consensus 189 -d~----~G~~l~~~~~~~~--~~~l~G-~PV~~~~~~~~~~~~~~~~~gdf------s~~~i~~~~~~~i~~~~~~~~~ 254 (299) T protein:vir:41 189 -DG----NGMPIFNTATSNG--VDDVLG-LPIAYTPKYTFGDKDISELVGDW------NQAYYGILRGVEYEILTEATLT 254 (299) T ss_pred -cc----CCceeecCCcCCC--Cceecc-eeeEEecccCCCCCceEEEEEec------ccEEEEEecCcEEEEeeccccc Confidence 11 0111122222222 146776 799988887753 1222211 0011111111111 1 Q ss_pred cccCccc-----ccc-eee--eeeeeceee-cCcccccccCCcceeecCCchhhhccc Q lcl|Aclame:pro 458 RGSDPKN-----FQP-VMG--FKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSLG 506 (519) Q Consensus 458 ~~~dp~s-----~qP-~~g--~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~~ 506 (519) ...|++. ||- .+. ...|++..+ ||=+ -.++.. ..+ + T Consensus 255 ~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A-------~~~l~~---~aa---~ 299 (299) T protein:vir:41 255 TVADETGKPLNLAERDMAAIKATFEVGFMVVKDEA-------FSAVQP---KAG---N 299 (299) T ss_pred ccccccccchhhhhcCcEEEEEEEEeccEEecccc-------eEEEEe---ccC---C Confidence 1112221 222 233 345777654 2211 223321 111 1 No 22 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=95.85 E-value=0.0014 Score=36.30 Aligned_cols=345 Identities=12% Similarity=0.071 Sum_probs=132.1 Q ss_pred CCh-HHHHHhhhhhhCCCccccccccchhhh-----hhhhhhhHHHHHhhhhhccchhhhhh------------------ Q lcl|Aclame:pro 1 MKK-NALVQKWSALLENEALPEIVGASKQAI-----IAKIFENQEQDILTAPEYRDEKISEA------------------ 56 (519) Q Consensus 1 ~~~-~~l~~kw~p~l~~~~~~~~~~~~~~~~-----~~~~~enq~~~~~~~~~~~~~~~~~~------------------ 56 (519) |++ ++|.++..-+.+. +-++.+.-+..+ ...=||++-+.+..+-+-+.+.+.+. T Consensus 1 M~~l~el~~~~~~~~~e--~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (385) T protein:vir:18 1 MSELALIQKAIEESQQK--MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFS 78 (385) T ss_pred ChHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhH Confidence 654 3444444333221 111111111100 00001111111100000001111100 Q ss_pred ------hhhhhhhhhhccccccchhhhccccccc-cc--cccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeee Q lcl|Aclame:pro 57 ------FGSFLTEAEIGGDHGYDATNIAAGQTSG-AV--TQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRA 127 (519) Q Consensus 57 ------~~~~~~~~~~~~~~g~~~~~~~est~tg-~v--~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRs 127 (519) +...+..........-..+.+..+++++ .+ ..+.+ .+++++.++..-.++|-++||++++.-+ . T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~---~ii~~~~~~~~l~~~~~~~~~~~~~~~~----~ 151 (385) T protein:vir:18 79 ERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIP---GIIMPGLRRLTIRDLLAQGRTSSNALEY----V 151 (385) T ss_pred HHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhh---HHHHHhhhccchhhhcceecccCcceEE----E Confidence 0011100000000000001111111111 11 11222 2344444566677888888887765211 0 Q ss_pred eecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccc Q lcl|Aclame:pro 128 VYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLD 207 (519) Q Consensus 128 rY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~ 207 (519) +...... .+.| T Consensus 152 ~~~~~~~--------------~a~~------------------------------------------------------- 162 (385) T protein:vir:18 152 REEVFTN--------------NADV------------------------------------------------------- 162 (385) T ss_pred EEecCCc--------------ceee------------------------------------------------------- Confidence 1100000 0000 Q ss_pred cccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHH Q lcl|Aclame:pro 208 AAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAE 287 (519) Q Consensus 208 ~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaE 287 (519) ++ | +..+++-..++++++.+.|.-+-...+|.||.||-- +.++. T Consensus 163 --------------v~--------E---------~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~ 206 (385) T protein:vir:18 163 --------------VA--------E---------KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSY 206 (385) T ss_pred --------------ec--------c---------CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHH Confidence 00 1 112233344556667777777777889999999852 24777 Q ss_pred HHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 288 LSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTG 367 (519) Q Consensus 288 LsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~ 367 (519) |.+-|+..|..-+|+.||.= . .+...+.|++.......... -... -..+..|..+...|. .. T Consensus 207 i~~~la~a~~~~~d~~~l~G--------~----g~~~~~~Gi~~~~~~~~~~~-~~~~---~~~~d~i~~~~~~l~--~~ 268 (385) T protein:vir:18 207 INNRLMYGLALKEEGQLLNG--------D----GTGDNLEGLNKVATAYDTSL-NATG---DTRADIIAHAIYQVT--ES 268 (385) T ss_pred HHHHHHHHHHHHHHHHHHhc--------c----CCCCcccccccccccccccc-cccc---cchHHHHHHHHHhhc--cc Confidence 88888888888888777741 0 01112345543322111000 0000 112333444444443 23 Q ss_pred ccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeE Q lcl|Aclame:pro 368 RGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIY 447 (519) Q Consensus 368 rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~f 447 (519) +..++.+||||+....|...- +.. ++..+.....+ --++|.| ++|+++++.|..=+++|-- +-+ T Consensus 269 ~~~~~~~~~~~~~~~~l~~lk--d~~----G~~l~~~~~~~--~~~~l~G-~pV~~~~~~p~~~~~~gd~-------~~~ 332 (385) T protein:vir:18 269 EFSASGIVLNPRDWHNIALLK--DNE----GRYIFGGPQAF--TSNIMWG-LPVVPTKAQAAGTFTVGGF-------DMA 332 (385) T ss_pred cCCCCEEEEcHHHHHHHHHhh--cCC----CceeccCcccC--CCceecc-eeeEEcCcCCCCcEEEeec-------ccE Confidence 446678999999998887532 110 01111111111 1256776 7999999999765555421 001 Q ss_pred eeccccccc-cccc--Ccccc-cceee--eeeeeceee-cCcccccccCCcceeecCCchhhhcc Q lcl|Aclame:pro 448 YAPYVALTP-LRGS--DPKNF-QPVMG--FKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSL 505 (519) Q Consensus 448 yaPYv~~~~-~~~~--dp~s~-qP~~g--~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~ 505 (519) |--+....+ +... +..-| +..++ ...||+..+ +|=+ -.++. ++..+ T Consensus 333 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a-------~~~~~-----~~aa~ 385 (385) T protein:vir:18 333 SQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTA-------IIKGT-----FSSGS 385 (385) T ss_pred EEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccc-------eEEEE-----eccCC Confidence 110111000 0000 00112 22334 445777643 2211 12221 11111 No 23 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=95.85 E-value=0.0014 Score=36.30 Aligned_cols=345 Identities=12% Similarity=0.071 Sum_probs=132.1 Q ss_pred CCh-HHHHHhhhhhhCCCccccccccchhhh-----hhhhhhhHHHHHhhhhhccchhhhhh------------------ Q lcl|Aclame:pro 1 MKK-NALVQKWSALLENEALPEIVGASKQAI-----IAKIFENQEQDILTAPEYRDEKISEA------------------ 56 (519) Q Consensus 1 ~~~-~~l~~kw~p~l~~~~~~~~~~~~~~~~-----~~~~~enq~~~~~~~~~~~~~~~~~~------------------ 56 (519) |++ ++|.++..-+.+. +-++.+.-+..+ ...=||++-+.+..+-+-+.+.+.+. T Consensus 1 M~~l~el~~~~~~~~~e--~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (385) T protein:vir:19 1 MSELALIQKAIEESQQK--MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFS 78 (385) T ss_pred ChHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhH Confidence 654 3444444333221 111111111100 00001111111100000001111100 Q ss_pred ------hhhhhhhhhhccccccchhhhccccccc-cc--cccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeee Q lcl|Aclame:pro 57 ------FGSFLTEAEIGGDHGYDATNIAAGQTSG-AV--TQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRA 127 (519) Q Consensus 57 ------~~~~~~~~~~~~~~g~~~~~~~est~tg-~v--~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRs 127 (519) +...+..........-..+.+..+++++ .+ ..+.+ .+++++.++..-.++|-++||++++.-+ . T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~---~ii~~~~~~~~l~~~~~~~~~~~~~~~~----~ 151 (385) T protein:vir:19 79 ERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIP---GIIMPGLRRLTIRDLLAQGRTSSNALEY----V 151 (385) T ss_pred HHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhh---HHHHHhhhccchhhhcceecccCcceEE----E Confidence 0011100000000000001111111111 11 11222 2344444566677888888887765211 0 Q ss_pred eecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccc Q lcl|Aclame:pro 128 VYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLD 207 (519) Q Consensus 128 rY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~ 207 (519) +...... .+.| T Consensus 152 ~~~~~~~--------------~a~~------------------------------------------------------- 162 (385) T protein:vir:19 152 REEVFTN--------------NADV------------------------------------------------------- 162 (385) T ss_pred EEecCCc--------------ceee------------------------------------------------------- Confidence 1100000 0000 Q ss_pred cccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHH Q lcl|Aclame:pro 208 AAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAE 287 (519) Q Consensus 208 ~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaE 287 (519) ++ | +..+++-..++++++.+.|.-+-...+|.||.||-- +.++. T Consensus 163 --------------v~--------E---------~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~ 206 (385) T protein:vir:19 163 --------------VA--------E---------KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSY 206 (385) T ss_pred --------------ec--------c---------CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHH Confidence 00 1 112233344556667777777777889999999852 24777 Q ss_pred HHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 288 LSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTG 367 (519) Q Consensus 288 LsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~ 367 (519) |.+-|+..|..-+|+.||.= . .+...+.|++.......... -... -..+..|..+...|. .. T Consensus 207 i~~~la~a~~~~~d~~~l~G--------~----g~~~~~~Gi~~~~~~~~~~~-~~~~---~~~~d~i~~~~~~l~--~~ 268 (385) T protein:vir:19 207 INNRLMYGLALKEEGQLLNG--------D----GTGDNLEGLNKVATAYDTSL-NATG---DTRADIIAHAIYQVT--ES 268 (385) T ss_pred HHHHHHHHHHHHHHHHHHhc--------c----CCCCcccccccccccccccc-cccc---cchHHHHHHHHHhhc--cc Confidence 88888888888888777741 0 01112345543322111000 0000 112333444444443 23 Q ss_pred ccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeE Q lcl|Aclame:pro 368 RGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIY 447 (519) Q Consensus 368 rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~f 447 (519) +..++.+||||+....|...- +.. ++..+.....+ --++|.| ++|+++++.|..=+++|-- +-+ T Consensus 269 ~~~~~~~~~~~~~~~~l~~lk--d~~----G~~l~~~~~~~--~~~~l~G-~pV~~~~~~p~~~~~~gd~-------~~~ 332 (385) T protein:vir:19 269 EFSASGIVLNPRDWHNIALLK--DNE----GRYIFGGPQAF--TSNIMWG-LPVVPTKAQAAGTFTVGGF-------DMA 332 (385) T ss_pred cCCCCEEEEcHHHHHHHHHhh--cCC----CceeccCcccC--CCceecc-eeeEEcCcCCCCcEEEeec-------ccE Confidence 446678999999998887532 110 01111111111 1256776 7999999999765555421 001 Q ss_pred eeccccccc-cccc--Ccccc-cceee--eeeeeceee-cCcccccccCCcceeecCCchhhhcc Q lcl|Aclame:pro 448 YAPYVALTP-LRGS--DPKNF-QPVMG--FKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSL 505 (519) Q Consensus 448 yaPYv~~~~-~~~~--dp~s~-qP~~g--~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~ 505 (519) |--+....+ +... +..-| +..++ ...||+..+ +|=+ -.++. ++..+ T Consensus 333 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a-------~~~~~-----~~aa~ 385 (385) T protein:vir:19 333 SQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTA-------IIKGT-----FSSGS 385 (385) T ss_pred EEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccc-------eEEEE-----eccCC Confidence 110111000 0000 00112 22334 445777643 2211 12221 11111 No 24 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=95.74 E-value=0.0015 Score=36.02 Aligned_cols=364 Identities=14% Similarity=0.086 Sum_probs=141.4 Q ss_pred CCh--HHHHHhhhhhhCCC-------ccc---cccccchhhhhhhhhhhHHHHHh-------hhhhccchhhhhhhhhhh Q lcl|Aclame:pro 1 MKK--NALVQKWSALLENE-------ALP---EIVGASKQAIIAKIFENQEQDIL-------TAPEYRDEKISEAFGSFL 61 (519) Q Consensus 1 ~~~--~~l~~kw~p~l~~~-------~~~---~~~~~~~~~~~~~~~enq~~~~~-------~~~~~~~~~~~~~~~~~~ 61 (519) +.. ++|.++=......+ +.+ +.....++.-....+.+..+..+ ..+..+.....+..... T Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 144 (477) T protein:vir:84 66 LDEQIRELESEIERSGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEI- 144 (477) T ss_pred HHHHHHHHHHHHHHhhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhH- Confidence 110 11111111000000 000 00000111000011110000000 00000000000000000 Q ss_pred hhhhhccccccchhhhccccccccccccCceeh--hhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcc Q lcl|Aclame:pro 62 TEAEIGGDHGYDATNIAAGQTSGAVTQIGPAVM--GMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAK 139 (519) Q Consensus 62 ~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L~--~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~ 139 (519) + .....+.....+..++++|.. ..-|-.+ .++...-+..+..++|++.||++.+|-+-=.|..-+ + . T Consensus 145 --~-~~~~~~~~~~~~~~~~~~gg~-lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~----~---~ 213 (477) T protein:vir:84 145 --R-KIAKVGEEYRDLDRNGGTGGY-AVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTG----T---S 213 (477) T ss_pred --H-HHHHhhhhhccccccCCCcce-eeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecC----c---c Confidence 0 000011111222211222111 1122221 255555567778899999999998875422221100 0 0 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccc Q lcl|Aclame:pro 140 EAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQL 219 (519) Q Consensus 140 eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~ 219 (519) .+. + T Consensus 214 ~a~---------~------------------------------------------------------------------- 217 (477) T protein:vir:84 214 TAI---------Q------------------------------------------------------------------- 217 (477) T ss_pred eee---------e------------------------------------------------------------------- Confidence 000 0 Q ss_pred eecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 220 AEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLE 299 (519) Q Consensus 220 ~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlE 299 (519) +++|-. ......++...+++.+++.+|.-+-...+|-||.+|- ..|.++.|.+-|+..|..- T Consensus 218 --~~Eg~~------------~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~ 279 (477) T protein:vir:84 218 --AADNAA------------LTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQA----AVSVDEFVFRDLAADYANK 279 (477) T ss_pred --eccCcc------------cccccccccccceeeEEEeeeeEEeeeHHHHHHHhcc----chhHHHHHHHHHHHHHHHH Confidence 001000 0011234445566778888888888889999999994 3567999999999999999 Q ss_pred hhHHHHHHHHhhhhhhhhcccccccccceeeeccccccc--cccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEc Q lcl|Aclame:pro 300 INREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDI--RGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIAS 377 (519) Q Consensus 300 INReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~--~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S 377 (519) |++.||. | +-+...+.|++.......+ .++--.......++..|-...+.+....+. .+..+|++ T Consensus 280 ~d~~~l~--------G----~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~-~~~~~v~~ 346 (477) T protein:vir:84 280 LNVQVIS--------G----TGSNNQVVGVRATAGITQVTATSAGSALEKHQIIYQKIADAIQRVHTSRFL-EPEVIVMH 346 (477) T ss_pred HHHHHhc--------c----CCCCCccceeeeccccccccccccccchhhHHHHHHHHHHHHhhccccccC-CccEEEEc Confidence 9998885 1 1112235677654321110 000001122233444444444444443333 33678888 Q ss_pred hHHHHHHHhc----Ccccccccccc--cccccccCCCceEEEEecCcEEEEecCCCccc--------eEEEEEecCCCcc Q lcl|Aclame:pro 378 RNVVNVLAAV----DTSVSYAAQGL--GQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD--------YFTIGYKGSNEMD 443 (519) Q Consensus 378 ~~va~~L~~~----g~~~~~~~~~~--~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y~~vG~KG~~~~~ 443 (519) |.....|... |...+.|.... ...+..+.-.....|+|.| ++|+++++.|.+ -|++|--.+. T Consensus 347 ~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G-~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~--- 422 (477) T protein:vir:84 347 PRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHG-LPVVTDPTLPTTLGTGTDQDVIHVLRASDL--- 422 (477) T ss_pred HHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcc-cceEecCcccccccccCCcceEEEEEeceE--- Confidence 8776665432 21111111000 0011112222233467876 699999998753 3444433211 Q ss_pred ceeEeecccccccccccCccccc--ceeeeeeeecee-----ecCcccccccCCcceeecCC----chhh Q lcl|Aclame:pro 444 AGIYYAPYVALTPLRGSDPKNFQ--PVMGFKTRYGIG-----INPFADPAAQAPTKRIQNGM----PDIV 502 (519) Q Consensus 444 ~~~fyaPYv~~~~~~~~dp~s~q--P~~g~~tRY~l~-----~nP~~~~~~~~~~~~i~~~~----d~~a 502 (519) +.- +..+.-.++|.++. ....|.. ||+. .+|=+ .. ++-|. |-++ T Consensus 423 ---~i~---~~~~~~~~~~~~~~~~~~~~~~v-~~~~~~~~~r~~~a-------fv-~~t~~~~~~~~~~ 477 (477) T protein:vir:84 423 ---ALF---ESSVRMRALQETRAENLSVLLQV-YGYLAFTAARFPQS-------VV-EIGGTALTAPTFA 477 (477) T ss_pred ---EEE---eeceeEEeccccccccceeeeee-hhhhhhhhhccccc-------eE-EeecccccccccC Confidence 000 00001112333222 2222211 2211 12221 11 22232 3333 No 25 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=95.64 E-value=0.0017 Score=35.76 Aligned_cols=328 Identities=11% Similarity=0.006 Sum_probs=119.9 Q ss_pred CChHHHHHhhhhhhCC--------------CccccccccchhhhhhhhhhhHHHHHhhh-hhccchhhhhhhhhhhhhhh Q lcl|Aclame:pro 1 MKKNALVQKWSALLEN--------------EALPEIVGASKQAIIAKIFENQEQDILTA-PEYRDEKISEAFGSFLTEAE 65 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~--------------~~~~~~~~~~~~~~~~~~~enq~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 65 (519) =.-+.|.++...+-+. ...++......+......-+.+....+.. +..+...+.+.....+... T Consensus 40 ~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~- 118 (397) T protein:vir:12 40 DEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQRSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSP- 118 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcccccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhh- Confidence 0011122222211110 00000000000000000000000000000 0000000110000111000 Q ss_pred hccccccchhhhcccc-ccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCccccccc Q lcl|Aclame:pro 66 IGGDHGYDATNIAAGQ-TSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHP 144 (519) Q Consensus 66 ~~~~~g~~~~~~~est-~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~ 144 (519) ....+..++ ++|.+.--....-.+++.+.++.+-.+++.+.||+++.|-+--.|.. ..+ T Consensus 119 -------~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-----~~~-------- 178 (397) T protein:vir:12 119 -------EFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNA-----DMV-------- 178 (397) T ss_pred -------hhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEec-----CCc-------- Confidence 011111111 12222111111122444455677778999999999988743211111 000 Q ss_pred ccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceeccc Q lcl|Aclame:pro 145 MYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAE 224 (519) Q Consensus 145 fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~ 224 (519) .+.| +++ T Consensus 179 ----~a~~---------------------------------------------------------------------v~E 185 (397) T protein:vir:12 179 ----PFSP---------------------------------------------------------------------VEE 185 (397) T ss_pred ----ceee---------------------------------------------------------------------ecc Confidence 0000 000 Q ss_pred ccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHH Q lcl|Aclame:pro 225 GMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREV 304 (519) Q Consensus 225 GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINRei 304 (519) |- +. ...+...|.++.|+..|..+- ..+|-||.+|-- +|.++.|.+.|...|...+|..| T Consensus 186 g~-----~~----~~~~~~~~~~v~~~~~k~~~~-------~~is~e~l~ds~----~~l~~~i~~~l~~~~~~~~d~~i 245 (397) T protein:vir:12 186 LG-----NL----PEIDQPRFTKVSYSIIDYGGI-------MTLSNSMLNDSD----QAIMTYVAKWFAKKSVVTRNNLI 245 (397) T ss_pred cc-----cc----cccccccceeEEeeheeeEee-------ehhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHH Confidence 00 00 001122355555555555554 459999998853 56788999999999999999888 Q ss_pred HHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHH Q lcl|Aclame:pro 305 IDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVL 384 (519) Q Consensus 305 i~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L 384 (519) +.-. ..+.+.|+..+++-. +.++..++ ..+..+..++|+|.....| T Consensus 246 l~G~-------------g~~~~~g~~~~~~i~------------~~~~~~l~---------~~~~~~a~~~~n~~~~~~L 291 (397) T protein:vir:12 246 LAAI-------------ASLKKVDIDGLDGIK------------KALNVTLD---------PMVAPGSIVLTNQDGYDWL 291 (397) T ss_pred Hhcc-------------ccccccccccHHHHH------------HHHhhccc---------hhhhCCCEEEEcHHHHHHH Confidence 7511 112345664433211 11222222 1233456789999998888 Q ss_pred HhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccc--------ccc Q lcl|Aclame:pro 385 AAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVA--------LTP 456 (519) Q Consensus 385 ~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~--------~~~ 456 (519) ...= + + .+...+..+.+.. .-++|.| ++|++.+..... . ..-+.-++|+.|-. ... T Consensus 292 ~~lk--d-~---~G~~l~~~~~~~g-~~~~l~G-~pv~~~~~~~~~-----~---~~~~~~~~~gd~~~~~~~~~~~~~~ 355 (397) T protein:vir:12 292 DTLK--D-G---TGRYLLQPDPTNP-TKKLLDG-RPVVPFTNRVLK-----T---QKGKAPLIIGNLKEAIVLFDREQQS 355 (397) T ss_pred HHhh--c-c---CCceeecccccCC-CCccccc-eeeEEecccccc-----c---CCCccEEEEEehhceEEEEeecceE Confidence 6531 0 0 0011111121111 1246777 588764432110 0 00001122222110 000 Q ss_pred cccc-----Ccccccceeeeeeeeceee-cCcccccccCCcceeecCCchhhhcccchhhhhhhhhc Q lcl|Aclame:pro 457 LRGS-----DPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVK 517 (519) Q Consensus 457 ~~~~-----dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~ 517 (519) +... +-.+-+-.+-...|++..+ ||-+ ...+. +.+| T Consensus 356 i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a-------~~~~~------------------~t~~ 397 (397) T protein:vir:12 356 IASTDTGAGAFETNSTKVRGIEREDVRKWDEDA-------VVFGQ------------------ITVE 397 (397) T ss_pred EEEeccccchhhcCceEEEEEEeeccEEecccc-------eEEEE------------------EeeC Confidence 0000 0112344555666776543 2211 11110 0111 No 26 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=94.93 E-value=0.0032 Score=34.30 Aligned_cols=333 Identities=14% Similarity=0.094 Sum_probs=121.5 Q ss_pred CC-hHHHHHhhhhhhCC----C-------ccccccccchhhhhhhhhhhHHH---------HHhhhhhc----------- Q lcl|Aclame:pro 1 MK-KNALVQKWSALLEN----E-------ALPEIVGASKQAIIAKIFENQEQ---------DILTAPEY----------- 48 (519) Q Consensus 1 ~~-~~~l~~kw~p~l~~----~-------~~~~~~~~~~~~~~~~~~enq~~---------~~~~~~~~----------- 48 (519) |+ .++|.+.|..+=+. + ...+....-.+++-+.|-+.+++ ..+..... T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcccccc Confidence 33 23344434333110 0 00000000001111111111110 00000000 Q ss_pred -cch----hhhhhhhhhhhhhhhccccccchhhhcccccccccc---ccCceehhhHHHHHhhhhhhhceeeccCCccch Q lcl|Aclame:pro 49 -RDE----KISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVT---QIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTG 120 (519) Q Consensus 49 -~~~----~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~---~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTG 120 (519) ..+ .....+..++... ...-.......+++.|.+. .+.+- +++.+.+...-.+++.++||++++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~t~~~gg~~iP~~~~~~---ii~~~~~~~~l~~~~~~~~~~~~~~ 153 (397) T protein:vir:48 81 KSEEEVKAGFVKDFKNLVRGR----YQNLLDSKTDASGSDAGLTIPQDIQTA---IHTLVRQYDSLQEYVNVENVTTLTG 153 (397) T ss_pred chhhHHHHHHHHHHHHHHhhh----hhHHHHHhhccCCccccccccHHHHHH---HHHHHHHHHHHHhhhceeeccCCcc Confidence 000 0000111111111 0000001111111112211 22233 3344445666788899999999887 Q ss_pred hheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCC Q lcl|Aclame:pro 121 QVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGA 200 (519) Q Consensus 121 LIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~ 200 (519) -+--++. ..... .+. T Consensus 154 ~~~~~~~--~~~~~--------------~a~------------------------------------------------- 168 (397) T protein:vir:48 154 SRVYEKW--ADITG--------------LAK------------------------------------------------- 168 (397) T ss_pred eEEEEee--cCCCc--------------cee------------------------------------------------- Confidence 5542221 10000 000 Q ss_pred CCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhc Q lcl|Aclame:pro 201 TDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVH 280 (519) Q Consensus 201 t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiH 280 (519) .+++|- + . ..+....|.++.|++.|.. -...+|-||.+|-. T Consensus 169 --------------------~v~E~~-----~-~---~~~~~~~~~~v~~~~~k~~-------~~~~iS~ell~ds~--- 209 (397) T protein:vir:48 169 --------------------LDDEAG-----S-I---GTNDDPKLYPIRYAIKRYA-------GISTVTNSLLADSA--- 209 (397) T ss_pred --------------------eecccc-----c-c---ccccccceeeEEeeheeee-------eehhhHHHHHhhch--- Confidence 000000 0 0 0011223555555555554 44679999999843 Q ss_pred CCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 281 GMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAA 360 (519) Q Consensus 281 GLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~ 360 (519) .|.+++|.+-|+..|..-+|+.|+.-.- +.....++.+++ -...++ + T Consensus 210 -~~l~~~v~~~l~~~~~~~~d~~il~G~g------------~~~~~~~~~~~d-------------~i~~~~-------~ 256 (397) T protein:vir:48 210 -ENILAWLSGWIAKKVVVTRNKAILEAIA------------TLPTKPTLTKWD-------------DIIDLQ-------A 256 (397) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHhhccc------------ccccccccccHH-------------HHHHHH-------H Confidence 5779999999999999999999986211 111122332221 122333 3 Q ss_pred HHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEe--cCCCcc--------- Q lcl|Aclame:pro 361 EIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYI--DQYARS--------- 429 (519) Q Consensus 361 ~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~--------- 429 (519) .+... +..+..+||+|.....|...= +.. +...+..+.+.. --++|.| ++|++ |...+. T Consensus 257 ~l~~~--~~~~a~~v~n~~~~~~L~~lk--d~~----G~~i~~~~~~~~-~~~~l~G-~PV~~~~~~~~~~~~~~~~~~~ 326 (397) T protein:vir:48 257 KVDPA--IKQTSFFLTNTSGFTALKKVK--NAF----GDYLMERDVKSP-TGYSIDG-FAVKEVADRWLANASSGAMPLY 326 (397) T ss_pred Hhhhh--hcCCCEEEECHHHHHHHHHhh--cCC----CceeeccCcCCC-CCceecc-ceeEEecccccCCcCCCceEEE Confidence 33332 224578899999999997531 100 011111221111 1246777 57664 212111 Q ss_pred -----ceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-cCc-------ccccccCCcceeec Q lcl|Aclame:pro 430 -----DYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPF-------ADPAAQAPTKRIQN 496 (519) Q Consensus 430 -----dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~-------~~~~~~~~~~~i~~ 496 (519) +|++++..+.-...- .++.. .+-.+.+=.+-...||+..+ ||- +...++.+ T Consensus 327 ~gd~~~~~~~~~~~~~~i~~----~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~------ 390 (397) T protein:vir:48 327 FGDLKQAVTLFDRQQMSLLS----TNIGG------GAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKG------ 390 (397) T ss_pred EEeccceEEEEeecceEEEE----eccch------hhhhcCceeEEEEeeeccEEecccceEEEEecccccCCC------ Confidence 133333332221111 11100 01112222333334443321 221 11111111 Q ss_pred CCchhhhcc Q lcl|Aclame:pro 497 GMPDIVNSL 505 (519) Q Consensus 497 ~~d~~a~~~ 505 (519) +.-+... T Consensus 391 --~~~~~~~ 397 (397) T protein:vir:48 391 --NLGSTAV 397 (397) T ss_pred --CccccCC Confidence 1100001 No 27 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=94.88 E-value=0.0033 Score=34.21 Aligned_cols=347 Identities=10% Similarity=0.021 Sum_probs=131.7 Q ss_pred CC--hHHHHHhhhhhhCCC-----------cc-ccccccchhhhhhhh------hhhHHHHHhh--hhhccc-------- Q lcl|Aclame:pro 1 MK--KNALVQKWSALLENE-----------AL-PEIVGASKQAIIAKI------FENQEQDILT--APEYRD-------- 50 (519) Q Consensus 1 ~~--~~~l~~kw~p~l~~~-----------~~-~~~~~~~~~~~~~~~------~enq~~~~~~--~~~~~~-------- 50 (519) |+ .++|.+++.-+.+.- .+ .+....+. ++.+.+ +|..++.+.+ ...... T Consensus 1 m~e~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~-~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (390) T protein:vir:10 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVD-ELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVG 79 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccchh Confidence 22 344454454444310 00 11111110 000000 0110010000 000000 Q ss_pred hh--hhhhhhhhhhhhhhc-cccccchhhhcc---ccccccc-cccCc-eehhhHHHHHhhhhhhhceeeccCCccchhh Q lcl|Aclame:pro 51 EK--ISEAFGSFLTEAEIG-GDHGYDATNIAA---GQTSGAV-TQIGP-AVMGMVRRAIPHLIAFDICGVQPLNNPTGQV 122 (519) Q Consensus 51 ~~--~~~~~~~~~~~~~~~-~~~g~~~~~~~e---st~tg~v-~~~~P-~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLI 122 (519) +. -.+.+..++...... +......+.... +.++++- .-.-| .+-.+++++-+.....++|.+.||++++.-+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:10 80 DLFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEE Confidence 00 000000111000000 000000001000 0111100 01111 1122333334455566788888887654211 Q ss_pred eeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCC Q lcl|Aclame:pro 123 FALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATD 202 (519) Q Consensus 123 FAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~ 202 (519) . +...... ++ .| T Consensus 160 ~----~~~~~~~-----~a---------~~-------------------------------------------------- 171 (390) T protein:vir:10 160 V----QETGFVN-----NA---------AI-------------------------------------------------- 171 (390) T ss_pred E----EEecCCc-----ce---------ee-------------------------------------------------- Confidence 1 0000000 00 00 Q ss_pred ccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCC Q lcl|Aclame:pro 203 AAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGM 282 (519) Q Consensus 203 ~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGL 282 (519) ++ | +...++-..+++++++.+|..+....+|-||.||-- T Consensus 172 -------------------v~--------E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~----- 210 (390) T protein:vir:10 172 -------------------VA--------E---------GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP----- 210 (390) T ss_pred -------------------ec--------C---------CccccccccceeEEEEeeEEEEEeehhhHHHHHhHH----- Confidence 00 1 011233334556677777777778899999999852 Q ss_pred CHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 283 DADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEI 362 (519) Q Consensus 283 DAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I 362 (519) |.++.|.+-|+..|...||+.||. | +-++..+.|++........... -....++..+..+...+ T Consensus 211 ~l~~~i~~~l~~~~~~~~~~~il~--------G----~G~~~~p~Gi~~~~~~~~~~~~----~~~~~~~~~~~~~~~~l 274 (390) T protein:vir:10 211 QLASYMNNRLIRGLKVKEDAEILR--------G----TGANDGLLGLIPQATTYAAPTT----IAGATRVDQLRLAMLQA 274 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhh--------c----CCCCcccccccccccccccccc----ccccchHHHHHHHHHhh Confidence 468999999999999999998885 1 0011224555543221110000 00011222233333333 Q ss_pred HhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCc Q lcl|Aclame:pro 363 ARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEM 442 (519) Q Consensus 363 ~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~ 442 (519) .. .+..++.+|++|.....|...- +.. +...+..+.... .++|.| ++|++++..|.+-+++|--- T Consensus 275 ~~--~~~~~~~~v~n~~~~~~L~~lk--d~~----g~~l~~~~~~~~--~~~l~G-~pv~~~~~~p~~~~~~gdf~---- 339 (390) T protein:vir:10 275 SL--AEYPASGIVINPIDWAAIELAK--DAN----NQYLIGNARGTL--TPTLWG-LPVVATQAMAPGEFLVGAFD---- 339 (390) T ss_pred cc--ccCCCCEEEEcHHHHHHHHHhh--cCC----CceeecCCcCcC--Cceecc-eeeEEcCCCCCCcEEEEecc---- Confidence 32 2335678999999988887422 100 011111111111 245766 69999999887655555210 Q ss_pred cceeEeecccccccccccC----cccccceeeeeeeeceee-cCcccccccCCcceeecCCchhh Q lcl|Aclame:pro 443 DAGIYYAPYVALTPLRGSD----PKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIV 502 (519) Q Consensus 443 ~~~~fyaPYv~~~~~~~~d----p~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a 502 (519) .+++.+.. ....+...+ -.+-+=.+-...||+..+ +|= ....+. +| T Consensus 340 -~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~-------a~~~~~-----~a 390 (390) T protein:vir:10 340 -LAAQIFDQ-WDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPE-------ALISGS-----FA 390 (390) T ss_pred -ceEEEEEe-cceEEEEeecccccccCcEEEEEEEeeccEEeccc-------cEEEEE-----eC Confidence 11112111 111111111 122222333446777543 121 112222 12 No 28 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=94.74 E-value=0.0036 Score=33.97 Aligned_cols=353 Identities=15% Similarity=0.165 Sum_probs=126.6 Q ss_pred CChHHHHHhhhhhhCCCccccccccc--hhhhhhhh-------------hhhHHHHHhhhhhccchh------------- Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGAS--KQAIIAKI-------------FENQEQDILTAPEYRDEK------------- 52 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~--~~~~~~~~-------------~enq~~~~~~~~~~~~~~------------- 52 (519) |+-++|.|+++.+++.- -++.+.. .|.+.+.- |++|-+.+.+ .+.+..+ T Consensus 1 M~i~eL~e~r~~~~~~~--~~l~~~~~e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~-~~~~~~~~~~~~~~~~~~~~ 77 (435) T protein:vir:14 1 MNVNELRRERAAVNQRV--QALAQIEVGGTALSVEQQAEFDQLSSKFSELTAQIERAEA-AERMAAAAAVPVDPNPTAVA 77 (435) T ss_pred CCHHHHHHHHHHHHHHH--HHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhcccccchhhhhh Confidence 99999999999987631 1111100 01111100 0111110000 0000000 Q ss_pred ----------------hhhhhhhhhhhhh-hccc----------cccchhhhccccccccccccCceehh------hHHH Q lcl|Aclame:pro 53 ----------------ISEAFGSFLTEAE-IGGD----------HGYDATNIAAGQTSGAVTQIGPAVMG------MVRR 99 (519) Q Consensus 53 ----------------~~~~~~~~~~~~~-~~~~----------~g~~~~~~~est~tg~v~~~~P~L~~------l~Rr 99 (519) ....++.++.... ..++ .++. ...+...+++. ......|+| ++++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~t-~~~gg~~vP~~~~~~ii~~ 155 (435) T protein:vir:14 78 APAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFG-EEVAMSLNTLS-PGAGGVLVPENLSSEVIEL 155 (435) T ss_pred hccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhh-hhhhhhcccCC-cCCCccccchhHHHHHHHH Confidence 0000000000000 0000 0000 00000000000 000111211 1111 Q ss_pred HHhhhhhhhc-eeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccc Q lcl|Aclame:pro 100 AIPHLIAFDI-CGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSH 178 (519) Q Consensus 100 a~p~LIa~DI-~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~ 178 (519) +.++.+..++ +=+-||+... + +|+. +++ T Consensus 156 l~~~~~i~~~~~~~~~~~~~~-~------~~p~--------------------~~~------------------------ 184 (435) T protein:vir:14 156 LRPKSVVRKLGARTLPLSNGN-I------TIPR--------------------LKG------------------------ 184 (435) T ss_pred HhhhchhhhhcceeeecCCCc-e------EEEE--------------------EeC------------------------ Confidence 1122222221 1011111000 0 0000 000 Q ss_pred cccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEE Q lcl|Aclame:pro 179 FFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIE 258 (519) Q Consensus 179 ~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVt 258 (519) .....-++ | +..+++..-++++++.. T Consensus 185 -------------------------------------~~~a~~v~--------E---------~~~~~~~~~~f~~i~~~ 210 (435) T protein:vir:14 185 -------------------------------------GAIVGYIG--------A---------DTDIPTTQQQFDDLKLT 210 (435) T ss_pred -------------------------------------Ccceeeec--------c---------CccccccccceeEEEee Confidence 00000011 1 11233344456666666 Q ss_pred eecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccc Q lcl|Aclame:pro 259 AKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDI 338 (519) Q Consensus 259 AKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~ 338 (519) ++..+-....|-||.+| +....+.|+.|.+.|+..|...+|+-|+. | +-+...+.|++.......+ T Consensus 211 ~~k~~~~~~iS~ell~d--s~~~~~l~~~i~~~l~~ai~~~~d~a~l~--------G----~G~~~~p~Gi~~~~~~~~~ 276 (435) T protein:vir:14 211 AKKMAALVPIANDLIKY--AGVNPNVDQIVVGDLTAAIGAREDKAFIR--------D----DGTANTPKGLRFWALPSNV 276 (435) T ss_pred eEEEEEeehhhHHHHHh--hccCHHHHHHHHHHHHHHHHHHHHHHhhc--------c----CCCCccccceeecccccce Confidence 77677777899999999 32233478888888888888888887773 1 0011234565443211110 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHh-hccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecC Q lcl|Aclame:pro 339 RGARWAGESFKALLFQIDKEAAEIAR-QTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGG 417 (519) Q Consensus 339 ~~~~~a~e~~r~L~~~i~~~a~~I~~-~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~ 417 (519) ...- ....+..+...+.++-..+.. ...+.. ...|++|.....|...- + .. ...+-.+.+. |+|.| T Consensus 277 ~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~v~n~~~~~~L~~lk--d-~~----G~~l~~~~~~----g~l~G 343 (435) T protein:vir:14 277 ITAS-DASTLQKIETDLGKVILALENADANLTQ-PGWIMAPRTFRFLEGLR--D-GN----GNKVYPELAN----GMLKG 343 (435) T ss_pred eccc-cccchhhHHHHHHHHHHHhhhccccccC-CEEEEcHHHHHHHHHhh--c-cC----CceeccCCCC----Ceeec Confidence 0000 001112222233333333332 123323 56799999998887532 1 00 0111122222 57777 Q ss_pred cEEEEecCCCccc--------eEE--------EEEecCCCccceeEeecccccccccccCcccc---cceeeeeeeecee Q lcl|Aclame:pro 418 KYRVYIDQYARSD--------YFT--------IGYKGSNEMDAGIYYAPYVALTPLRGSDPKNF---QPVMGFKTRYGIG 478 (519) Q Consensus 418 ~~~vy~D~y~~~d--------y~~--------vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~---qP~~g~~tRY~l~ 478 (519) ++||++++.|.+ -++ ||..+.-. +-..||..........-..| |=.+=...|++.. T Consensus 344 -~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~ 418 (435) T protein:vir:14 344 -YPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEEETLE----IDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFG 418 (435) T ss_pred -ceeEeeccccccccCCCccceEEEeecccEEEEEecccE----EEEeccccccccccchhhhhhcChhheeeeeeeCce Confidence 699998876532 122 23222222 22333321111100000001 1233345566643 Q ss_pred ecCcccccccCCcceeecCCchhh Q lcl|Aclame:pro 479 INPFADPAAQAPTKRIQNGMPDIV 502 (519) Q Consensus 479 ~nP~~~~~~~~~~~~i~~~~d~~a 502 (519) + .+...-....|-.|-| T Consensus 419 ~-------~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 419 P-------RHVESIAVLAGVAWGA 435 (435) T ss_pred e-------ecccceEEEecCCCCC Confidence 3 1111122334445543 No 29 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=94.49 E-value=0.0043 Score=33.57 Aligned_cols=330 Identities=15% Similarity=0.109 Sum_probs=129.5 Q ss_pred CC-hHHHHHhhhhhhCCCcccccccc----------------chhhhhhhhhh------hHHHHHhhhhhc--------- Q lcl|Aclame:pro 1 MK-KNALVQKWSALLENEALPEIVGA----------------SKQAIIAKIFE------NQEQDILTAPEY--------- 48 (519) Q Consensus 1 ~~-~~~l~~kw~p~l~~~~~~~~~~~----------------~~~~~~~~~~e------nq~~~~~~~~~~--------- 48 (519) |+ .++|.+.|..+.+. +.+. --+++.+.|-+ -+++.+.+.+.. T Consensus 1 Mk~~~eL~~~~~~~~~~-----~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDK-----VENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEE 75 (397) T ss_pred CchHHHHHHHHHHHHHH-----HHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 65 67787777776653 1111 00111111111 000000000000 Q ss_pred -----cchh-----hhhhhhhhhhhhhhccccccchhhhccccc-cccccccCceehhhHHHHHhhhhhhhceeeccCCc Q lcl|Aclame:pro 49 -----RDEK-----ISEAFGSFLTEAEIGGDHGYDATNIAAGQT-SGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNN 117 (519) Q Consensus 49 -----~~~~-----~~~~~~~~~~~~~~~~~~g~~~~~~~est~-tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTG 117 (519) ..+. ...++..+| . ++....... ...+++ .|.+.--....-.+++.+-+...-.+++.|+||++ T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~l---~-~~~~~~~~~-~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 150 (397) T protein:vir:49 76 KKPLTKNEEEVKANFVKDFKNLV---R-GRYQNLLDS-KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTT 150 (397) T ss_pred cccccchhhHHHHHHHHHHHHHh---h-cchhhHHHh-hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccC Confidence 0000 000011111 0 000000000 111111 11111101111234455556777789999999998 Q ss_pred cchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCC Q lcl|Aclame:pro 118 PTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVD 197 (519) Q Consensus 118 PTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ 197 (519) .+|-+- |....... +.+.|- T Consensus 151 ~~~~~~-----~~~~~~~~-----------~~a~~v-------------------------------------------- 170 (397) T protein:vir:49 151 LTGSRV-----YEKWADIT-----------GLAKLD-------------------------------------------- 170 (397) T ss_pred CcceEE-----EEeeccCC-----------cceeee-------------------------------------------- Confidence 876421 11110000 000000 Q ss_pred CCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHH Q lcl|Aclame:pro 198 AGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLR 277 (519) Q Consensus 198 ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLK 277 (519) ++|-.. ..+....|.++ +..++.-+-...+|-||.+|-. T Consensus 171 -------------------------~E~~~~---------~~~~~~~~~~v-------~~~~~k~~~~~~iS~ell~ds~ 209 (397) T protein:vir:49 171 -------------------------DEGGQI---------GQNDDPKLSLI-------RYAIKRYAGISTVTNSLLADSA 209 (397) T ss_pred -------------------------cccccc---------ccccccceeee-------EeeeeeeEeehhhHHHHHhhhh Confidence 000000 00011224444 4444444455679999999853 Q ss_pred hhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHH Q lcl|Aclame:pro 278 AVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDK 357 (519) Q Consensus 278 AiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~ 357 (519) +|.+++|.+-|+..|..-+|+.||.= +.+..+..++++++ -...|+..+ T Consensus 210 ----~~l~~~i~~~l~~~~~~~~d~ail~G------------~g~~~~~~~~~~~d-------------~i~~~~~~l-- 258 (397) T protein:vir:49 210 ----ENILAWLSGWIAKKVVVTRNKAILEA------------IGTLPNKPTLAKWD-------------DIIDLQAKV-- 258 (397) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHhc------------cccccccccccCHH-------------HHHHHHHhh-- Confidence 57799999999999999999998851 11122233444332 122333333 Q ss_pred HHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEe--cCCCcc------ Q lcl|Aclame:pro 358 EAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYI--DQYARS------ 429 (519) Q Consensus 358 ~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~------ 429 (519) . +.+.....+|++|.....|...= +.. +...+..+.+.. ..++|.| ++|++ |...+. T Consensus 259 -----~--~~~~~~a~~v~n~~~~~~l~~lk--d~~----g~~l~~~~~~~g-~~~~l~G-~pV~~~~~~~~~~~~~~~~ 323 (397) T protein:vir:49 259 -----D--PAIKQTSLFLTNTSGFTALKKVK--NAM----GDYLMERDVKSP-TGYSIDG-FVVKEISDRFLPNGTGGAM 323 (397) T ss_pred -----h--hhhcCCCEEEEcHHHHHHHHHhh--ccC----CceeecccccCC-CCceecc-eeeEEecccccccccCCce Confidence 2 22335578999999998887542 100 011111111111 1246877 46664 222221 Q ss_pred --------ceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-cCcc-------cccccCCcce Q lcl|Aclame:pro 430 --------DYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFA-------DPAAQAPTKR 493 (519) Q Consensus 430 --------dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~-------~~~~~~~~~~ 493 (519) +|++++..+.-. +-..||.. .+-...+-.+-...|++..+ +|-+ ...++.+ T Consensus 324 ~~~~gd~~~~~~~~~~~~~~----i~~~~~~~------~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~--- 390 (397) T protein:vir:49 324 PLYFGDLKQAVTLFDRQHLS----LLSTNIGG------GAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIADQKA--- 390 (397) T ss_pred eEEEeeccceEEEEeecccE----EEEecccc------chhhcCeeeEEEEEeeccEEecccceEEEEecccccccC--- Confidence 122222222111 11222211 11123334444556665443 2211 1111111 Q ss_pred eecCCchhhhcccc Q lcl|Aclame:pro 494 IQNGMPDIVNSLGL 507 (519) Q Consensus 494 i~~~~d~~a~~~~~ 507 (519) ..-+++. T Consensus 391 -------~~~~~~~ 397 (397) T protein:vir:49 391 -------KLSTAGA 397 (397) T ss_pred -------cccccCC Confidence 1111222 No 30 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=93.80 E-value=0.0063 Score=32.64 Aligned_cols=311 Identities=13% Similarity=0.015 Sum_probs=128.8 Q ss_pred chhhhhhhhhhhhhhhhccccccchhhhccccccccccccCcee-hhhHHHHHhhhhhhhceeeccCCccchhheeeeee Q lcl|Aclame:pro 50 DEKISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAV-MGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAV 128 (519) Q Consensus 50 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L-~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsr 128 (519) -..++|..+.-+ |-+...-. +++++. -.-+.+ -.+++.+.+..+-..+|.+.||+++..-|.-. T Consensus 1 ~~~~~e~~~~~~---------~~~~~~~~--~~~~~~-liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~--- 65 (338) T protein:vir:78 1 MATLNELAPNTA---------GSNHQGRL--AHVPSD-LLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTT--- 65 (338) T ss_pred CcchHHhhhhhc---------ccccccce--eccccc-ccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEE--- Confidence 011111111111 11111101 111111 111111 23455566677788999999998864333221 Q ss_pred ecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCcccccc Q lcl|Aclame:pro 129 YGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDA 208 (519) Q Consensus 129 Y~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~ 208 (519) ... +.+.+-|... T Consensus 66 -~~~---------------~~a~~v~~~~--------------------------------------------------- 78 (338) T protein:vir:78 66 -VKR---------------PEVGQVGVGT--------------------------------------------------- 78 (338) T ss_pred -ecC---------------ccceeecccc--------------------------------------------------- Confidence 110 0110100000 Q ss_pred ccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHH Q lcl|Aclame:pro 209 AVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAEL 288 (519) Q Consensus 209 ~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaEL 288 (519) ...++ | +...++-.-+++.++...+..+-...+|-||.+|- ..|.|++| T Consensus 79 ----------~~~~~--------E---------g~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds----~~~~~~~i 127 (338) T protein:vir:78 79 ----------SNEQR--------E---------GGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMN----PSGLYTKL 127 (338) T ss_pred ----------ccccc--------c---------cccccccccceeEEEEEEEEEEEeehhhHHHHhcC----HHHHHHHH Confidence 00001 1 11122222334455555555555667899999983 36789999 Q ss_pred HHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 289 SGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGR 368 (519) Q Consensus 289 sNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~r 368 (519) .+-|+..|...||..||.=.-...--+..++... ....+....+. .++ ....++..+.++...|...=.+ T Consensus 128 ~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~-~~~~~~~~~~~-------~~~--~~~~~~~~~~~~~~~~~~~~~~ 197 (338) T protein:vir:78 128 QADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTN-NVIVNTTNVDY-------LQT--GTTPLLDRFLDGYDLVSANTDV 197 (338) T ss_pred HHHHHHHHHHHHHHHhhcccCCCccccccccccc-ccccccccccc-------ccc--cchhhHHHHHHHHHHhhhhccc Confidence 9999999999999988851110000000011100 00001100000 000 1123444555555555433333 Q ss_pred cCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccc---------eEEEE---- Q lcl|Aclame:pro 369 GAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD---------YFTIG---- 435 (519) Q Consensus 369 g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y~~vG---- 435 (519) ..+.+|++|+....|...-.+....+ ...+..+.+.. -.++|.| ++||++.+.+.+ -+++| T Consensus 198 -~~~~~~m~~~~~~~L~~~~~l~d~~g---~~l~~~~~~~~-~~~~l~G-~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~ 271 (338) T protein:vir:78 198 -DFNGWAADPRYRARLLRSQAYRDANG---NVDPTRINLAA-SAGDLLG-LPVQFGKAVGGDLGAATDSKVRVVGGDFSQ 271 (338) T ss_pred -cceEEEEchHHHHHHHHHhhhccCCC---ceeecccccCC-CCceeee-eeEEEccccCccccccCCcccEEEEEecce Confidence 44789999999887754321111100 00111111111 1257777 599998775521 22223 Q ss_pred ----EecCCCccceeEeecccccccccccCccc-----cc-ceee--eeeeecee-ecCcccccccCCcceeecCCchhh Q lcl|Aclame:pro 436 ----YKGSNEMDAGIYYAPYVALTPLRGSDPKN-----FQ-PVMG--FKTRYGIG-INPFADPAAQAPTKRIQNGMPDIV 502 (519) Q Consensus 436 ----~KG~~~~~~~~fyaPYv~~~~~~~~dp~s-----~q-P~~g--~~tRY~l~-~nP~~~~~~~~~~~~i~~~~d~~a 502 (519) ..+.-.+ =..+| .......||.. || --++ ...|++.. .||=+ -.++.++....| T Consensus 272 ~~~~~~~~~~i----~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a-------~~~l~~~~~~~~ 338 (338) T protein:vir:78 272 LKYGFADEIRV----KMSDT--ATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQA-------FVKFVDDEDPDA 338 (338) T ss_pred EEEEeecccEE----EEeec--ccccccccccccchhhhhcCcEEEEEEEEeccEeecccc-------eEEEecccCCCC Confidence 2221110 00011 11111223321 11 1133 35688844 34322 356676655444 No 31 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=93.58 E-value=0.0071 Score=32.37 Aligned_cols=352 Identities=13% Similarity=0.129 Sum_probs=125.6 Q ss_pred CChHH----------HHHhhh-----hhhCCCccccccccc-----hhhhhhhhhhhHHHHHhhhhhccchhhhhhh--- Q lcl|Aclame:pro 1 MKKNA----------LVQKWS-----ALLENEALPEIVGAS-----KQAIIAKIFENQEQDILTAPEYRDEKISEAF--- 57 (519) Q Consensus 1 ~~~~~----------l~~kw~-----p~l~~~~~~~~~~~~-----~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~--- 57 (519) +++++ +.+|.. .+-+. +-++++.- .....+..+|.+++.+.+...-+...+.... T Consensus 36 ~~ke~~~~~l~~~~e~~~k~~~E~~~~le~~--~ee~k~l~ee~~~~~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~ 113 (458) T protein:vir:10 36 MRKEQEEKELARMNDLVSKAVGEDRKRLEEA--LELVKSLDEKSKKSNELFAQTVEKQQETIVGLQDEIKSLLTAREGRS 113 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 11111 111110 00000 00010000 0011111111111111111000000000000 Q ss_pred --------------------------hhhhhhhh-hccccccc-hhhhccccccccccccCc-ee-hhhHHHHHhhhhhh Q lcl|Aclame:pro 58 --------------------------GSFLTEAE-IGGDHGYD-ATNIAAGQTSGAVTQIGP-AV-MGMVRRAIPHLIAF 107 (519) Q Consensus 58 --------------------------~~~~~~~~-~~~~~g~~-~~~~~est~tg~v~~~~P-~L-~~l~Rra~p~LIa~ 107 (519) ...+.+.. .....+.. -+....+++.......-| .+ -.++.++.+..+.. T Consensus 114 ~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~ 193 (458) T protein:vir:10 114 FVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVG 193 (458) T ss_pred hhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHH Confidence 00000000 00000000 000111111111110111 11 12444555677888 Q ss_pred hceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCcccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 108 DICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAH 187 (519) Q Consensus 108 DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~ 187 (519) ++|-++||+++..-++ . .... +.+.|-+.+... T Consensus 194 ~~~~~~~~~~~~~~~~-~------~~~~------------~~a~~v~e~~~~---------------------------- 226 (458) T protein:vir:10 194 ALFEELPMSSKILTML-V------EPDA------------GKATWVAASTYG---------------------------- 226 (458) T ss_pred hhcceeecCCcceEEE-E------ecCC------------cceeeccccccc---------------------------- Confidence 9999999988642111 0 0000 000111000000 Q ss_pred ccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeeccccccc Q lcl|Aclame:pro 188 FQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAS 267 (519) Q Consensus 188 ~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAE 267 (519) + .+.. .......| +++++.++.-+.... T Consensus 227 ----------~----------------------------~~~~-------~~~~~~~~-------~~i~~~~~k~~~~v~ 254 (458) T protein:vir:10 227 ----------T----------------------------DTTT-------GEEVKGAL-------KEIHFSTYKLAAKSF 254 (458) T ss_pred ----------c----------------------------cccc-------cccccccc-------eeeEeeeeeEEeeeh Confidence 0 0000 00111223 455555555556678 Q ss_pred ccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeecccccc------cccc Q lcl|Aclame:pro 268 YSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPID------IRGA 341 (519) Q Consensus 268 YTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d------~~~~ 341 (519) +|-||.+|-- .|.+++|.+-|...|..-||+.||. | +..+.+.|++......+ ..+. T Consensus 255 is~ell~ds~----~~~~~~i~~~l~~~i~~~~d~~~l~--------G-----~G~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) T protein:vir:10 255 ITDETEEDAI----FSLLPLLRKRLIEAHAVSIEEAFMT--------G-----DGSGKPKGLLTLASEDSAKVVTEAKAD 317 (458) T ss_pred hhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhc--------C-----CCCCccceeeecccccccceeeccccc Confidence 9999988832 4678999999999999999998875 1 01123445544321111 0000 Q ss_pred chHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhc----CcccccccccccccccccCCCceEEEEecC Q lcl|Aclame:pro 342 RWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAV----DTSVSYAAQGLGQGFNVDTTKAVFAGVLGG 417 (519) Q Consensus 342 ~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~----g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~ 417 (519) .-..-.+..| +++-+.+.. .+......||+|.....|... |...+.+.. .....+.++ ++|.| T Consensus 318 ~~~~~~~~~i----~~~~~~l~~--~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~---~~~~~~~~~----~~l~G 384 (458) T protein:vir:10 318 GSVLVTAKTI----SKLRRKLGR--HGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGN---DSVKLQGQV----GRIYG 384 (458) T ss_pred ccccccHHHH----HHHHHhhhh--hhcCCCEEEEcHHHHHHHHhhcccCCceeecccc---ccccccCcC----ceecc Confidence 0000011222 222223322 222446789999998888643 111111000 000111111 35776 Q ss_pred cEEEEecCCCcc-----ceEEEEEecCCCccceeEeecccccccccccCcccccceeeee--eeeceee-cCcccccccC Q lcl|Aclame:pro 418 KYRVYIDQYARS-----DYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK--TRYGIGI-NPFADPAAQA 489 (519) Q Consensus 418 ~~~vy~D~y~~~-----dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~--tRY~l~~-nP~~~~~~~~ 489 (519) ++|+++.+.|. +.++..++ + +.++.. -..+....||-+-...++|. .|+|+.+ +| T Consensus 385 -~pv~~~~~~p~~~~~~~~~~~~f~-~-----~~~~~~--~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~-------- 447 (458) T protein:vir:10 385 -LPVVVSEYFPAKANSAEFAVIVYK-D-----NFVMPR--QRAVTVERERQAGKQRDAYYVTQRVNLQRYFA-------- 447 (458) T ss_pred -eeeEEccccccccCCcceEEEEec-c-----cEEEEE--eeceEEEeecccCCCceEEEEEEEecceEecc-------- Confidence 79999988764 22222221 1 011110 11111123555445556665 4665432 33 Q ss_pred CcceeecCCchhhhc Q lcl|Aclame:pro 490 PTKRIQNGMPDIVNS 504 (519) Q Consensus 490 ~~~~i~~~~d~~a~~ 504 (519) .+-|. ..+|.+ T Consensus 448 -~a~v~---~~~aa~ 458 (458) T protein:vir:10 448 -NGVVS---GTYAAS 458 (458) T ss_pred -cceEE---EeeccC Confidence 22222 123322 No 32 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=93.02 E-value=0.0091 Score=31.77 Aligned_cols=334 Identities=16% Similarity=0.097 Sum_probs=131.1 Q ss_pred CChHHHHH-------hhhh---hhCCCccccccccchh---hhhhhhhhhHHHHHhhhhh-ccch--------hhhhhhh Q lcl|Aclame:pro 1 MKKNALVQ-------KWSA---LLENEALPEIVGASKQ---AIIAKIFENQEQDILTAPE-YRDE--------KISEAFG 58 (519) Q Consensus 1 ~~~~~l~~-------kw~p---~l~~~~~~~~~~~~~~---~~~~~~~enq~~~~~~~~~-~~~~--------~~~~~~~ 58 (519) |+++ |++ ++.. +++.+.+-++.. .+. .+-++ ++.+++.+.+.+. ...+ ...+... T Consensus 1 M~k~-l~~l~e~~~~~~~e~~~~~~~~~~e~~~~-~~~ei~~l~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (371) T protein:vir:81 1 MPKE-LRELLEQINNKKEEARKLLAENKIEEAKK-LKEEIVALQEK-FDVAKELYEEQKQTIEDKEPLKPTVQVKENEVE 77 (371) T ss_pred CcHH-HHHHHHHHHHHHHHHHHHhhHHHHHHHHH-HHHHHHHHHHH-HHHHHHHHHHHHHhhccccccccchhhHHHHHH Confidence 6642 222 2221 111111111110 000 00000 0111111100000 0000 0000001 Q ss_pred hhhhhhhhccccccchhhhcccccc-ccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCC Q lcl|Aclame:pro 59 SFLTEAEIGGDHGYDATNIAAGQTS-GAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAG 137 (519) Q Consensus 59 ~~~~~~~~~~~~g~~~~~~~est~t-g~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~ 137 (519) .|+.... +-....+..++++ |.+.--....-.+++++.++....+++.+.||++.++-+.-.+. .... T Consensus 78 ~~~~~l~-----~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~--~~~~---- 146 (371) T protein:vir:81 78 AFVNHIR-----TRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKR--SQQT---- 146 (371) T ss_pred HHHHHHH-----HHHHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEee--cCCc---- Confidence 1111100 1111222222222 22111111112355666678888899999999887655432221 1000 Q ss_pred cccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccc Q lcl|Aclame:pro 138 AKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAG 217 (519) Q Consensus 138 ~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g 217 (519) ++ . T Consensus 147 --~a---------~------------------------------------------------------------------ 149 (371) T protein:vir:81 147 --GF---------V------------------------------------------------------------------ 149 (371) T ss_pred --ce---------e------------------------------------------------------------------ Confidence 00 0 Q ss_pred cceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 218 QLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIM 297 (519) Q Consensus 218 ~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEIm 297 (519) -+++|-. . ...+...|.+..++..|..+ ...+|-||.+|-. .|.++.|.+.|...|. T Consensus 150 ---~v~Eg~~--~-------~~~~~~~f~~i~~~~~k~~~-------~~~iS~ell~ds~----~~l~~~i~~~l~~a~~ 206 (371) T protein:vir:81 150 ---EVAEGAA--I-------GEKATPQFTLLQYQVKKYAG-------FFRVTNELLNDST----EAIVNTLVRWIGDESR 206 (371) T ss_pred ---eeccccc--c-------ccccccceeeEEeeeeEEEE-------eehhhHHHHhhhh----HHHHHHHHHHHHHHHH Confidence 0011100 0 00112335555555555554 4579999999853 4668899999999999 Q ss_pred HHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEc Q lcl|Aclame:pro 298 LEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIAS 377 (519) Q Consensus 298 lEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S 377 (519) .-+|+.|+.-... ..+.|+.++++ ...++... + ...+.....+|++ T Consensus 207 ~~~~~~i~~g~g~-------------~~~~~~~~~~~-------------i~~~~~~~------l--~~~~~~~a~~vmn 252 (371) T protein:vir:81 207 VTRNGLIINVLNT-------------KAKTAIADLDG-------------LKQIINVQ------L--DPVFRSTSSVIVN 252 (371) T ss_pred HHHHHHHHhhccc-------------ccccccccHHH-------------HHHHHHhh------c--chhhhcCCEEEEc Confidence 9888888772211 12234433321 12221110 1 1122234578999 Q ss_pred hHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccc---- Q lcl|Aclame:pro 378 RNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVA---- 453 (519) Q Consensus 378 ~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~---- 453 (519) |.....|...= +.. ++..+..+.+. -..|+|.| ++||+..+.+...-.++--+. -...++|+.+-. T Consensus 253 ~~~~~~L~~lk--d~~----g~~l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~~~~~~--~~~~i~~Gd~~~~~~~ 322 (371) T protein:vir:81 253 QDAFNWLDTLK--DQN----GQYLLQPSISS-PTGRQLLG-LPVVIVSNKVLANRVDGGTGA--QFAPIIVGDLKEAVVM 322 (371) T ss_pred HHHHHHHHHhh--ccC----CCeeeecccCC-CCCceecc-eeEEEecccccCccccccccC--CcceEEEEehhceEEE Confidence 99998887531 100 01111112121 12367887 699988777643222111111 112244443221 Q ss_pred ---cccccccCcc------cccceeeeeeeeceee-cCcccccccCCcceeecCCchhhhcc Q lcl|Aclame:pro 454 ---LTPLRGSDPK------NFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSL 505 (519) Q Consensus 454 ---~~~~~~~dp~------s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~ 505 (519) ..+.-.+++. +-+=.+-...||+..+ ||-+ ...+. ++. + T Consensus 323 ~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a-------~~~~~-----~~~-A 371 (371) T protein:vir:81 323 FDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEA-------FVFGE-----VQL-A 371 (371) T ss_pred EeecceEEEEeccccchhhcCceEEEEEEeeccEEecccc-------eEEEE-----Eec-C Confidence 0111112322 2334555666776533 2211 11111 111 1 No 33 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=92.25 E-value=0.012 Score=31.07 Aligned_cols=351 Identities=10% Similarity=0.003 Sum_probs=127.0 Q ss_pred CCh---------HHHHHhhhhhhCC-Cc-cccccccchhhhhhhh--hhhHHHHHhhhhhc----cchhh---------- Q lcl|Aclame:pro 1 MKK---------NALVQKWSALLEN-EA-LPEIVGASKQAIIAKI--FENQEQDILTAPEY----RDEKI---------- 53 (519) Q Consensus 1 ~~~---------~~l~~kw~p~l~~-~~-~~~~~~~~~~~~~~~~--~enq~~~~~~~~~~----~~~~~---------- 53 (519) +.. +.+.++..--++. ++ ..|+..... ...+.+ ++.+.+++...... ..... T Consensus 27 ~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~-~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 105 (418) T protein:vir:10 27 VTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATVD-ELLIKQGELQARLLEAEQKLARGGGSAELETPKTLGQLVTE 105 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhhHHhhh Confidence 110 1112222111100 00 111111100 000000 00110000000000 00000 Q ss_pred ---hhhhhhhhhhhhhccc---cccchhhhccccccccccccCceeh-hhHHHHHhhhhhhhceeeccCCccchhheeee Q lcl|Aclame:pro 54 ---SEAFGSFLTEAEIGGD---HGYDATNIAAGQTSGAVTQIGPAVM-GMVRRAIPHLIAFDICGVQPLNNPTGQVFALR 126 (519) Q Consensus 54 ---~~~~~~~~~~~~~~~~---~g~~~~~~~est~tg~v~~~~P~L~-~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMR 126 (519) ...+...+.+...... .-.+......+++++.-.-.-|.+. .+++.+.+..+..++|.+-||++++.- T Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~----- 180 (418) T protein:vir:10 106 SEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIE----- 180 (418) T ss_pred HHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCcee----- Confidence 0000000000000000 0000011111111111111122221 344555667778888999999876521 Q ss_pred eeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCcccc Q lcl|Aclame:pro 127 AVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKL 206 (519) Q Consensus 127 srY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~ 206 (519) |.-.... .+.+.| T Consensus 181 --~~~~~~~-----------~~~a~~------------------------------------------------------ 193 (418) T protein:vir:10 181 --YTVETGF-----------TNNAAA------------------------------------------------------ 193 (418) T ss_pred --EEEEecC-----------CCceee------------------------------------------------------ Confidence 1100000 000000 Q ss_pred ccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHH Q lcl|Aclame:pro 207 DAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADA 286 (519) Q Consensus 207 ~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEa 286 (519) +++| ...++-..++++++..+|.-+-...+|-||.||.- |.++ T Consensus 194 ---------------v~E~-----------------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~-----~l~~ 236 (418) T protein:vir:10 194 ---------------VAEG-----------------AQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDAP-----ALQS 236 (418) T ss_pred ---------------eccC-----------------ccccccccceeeEEEeeeeEEEeehhhHHHHHhHH-----HHHH Confidence 0010 01122223455666666666667789999999852 4688 Q ss_pred HHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 287 ELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQT 366 (519) Q Consensus 287 ELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T 366 (519) .|.+-|+..|..-+|+-||. | +-+...+.|++...........--. ...+..|..+-..+. . T Consensus 237 ~i~~~l~~a~~~~~d~a~l~--------G----~g~~~~p~Gi~~~~~~~~~~~~~~~----~~~~~~i~~~~~~~~--~ 298 (418) T protein:vir:10 237 YIDGRARYGLQLTEEGQILK--------G----DGTGANILGILPQASAFMPSITLAN----ATPIDKIRLALLQAV--L 298 (418) T ss_pred HHHHHHHHHHHHHHHHHHhc--------c----CCCCccccccccccccccccccccc----cccHHHHHHHHHhhc--c Confidence 88888888888888887764 1 0011123455433221111000000 011223333333332 2 Q ss_pred cccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCcccee Q lcl|Aclame:pro 367 GRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGI 446 (519) Q Consensus 367 ~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~ 446 (519) .+...+.+||+|.....|...= +.. +...+ .+.+.. -.|+|.| ++|+++++.+.+=+++|---. .. T Consensus 299 ~~~~~~~~v~n~~~~~~L~~lk--d~~----G~~i~-~~~~~~-~~~~l~G-~pV~~~~~~p~~~~~~gd~s~-----~~ 364 (418) T protein:vir:10 299 AEFPATGIVLNPIDWASIELTK--DSQ----GRYIV-GNPVNG-TTPRLWN-LPVVETQAMTANEFLVGAFSM-----AA 364 (418) T ss_pred ccCCCCEEEEcHHHHHHHHHhh--cCC----Cceec-cccccC-CCceecc-eeeEEcCCCCCCcEEEeeccc-----eE Confidence 3445678999999998886421 110 01111 111111 1257777 799999998865444442100 00 Q ss_pred EeecccccccccccCccc---cc---ceeeeeeeeceee-cCcccccccCCcceeecCCchhhhccc Q lcl|Aclame:pro 447 YYAPYVALTPLRGSDPKN---FQ---PVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSLG 506 (519) Q Consensus 447 fyaPYv~~~~~~~~dp~s---~q---P~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~~ 506 (519) +. +.-..+.-.+|+.. |+ =.+=+..|++..+ +|=+ ..++ ....-. +| T Consensus 365 ~~--~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a-------~~~~-~~~~~~---~g 418 (418) T protein:vir:10 365 QI--FDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPES-------FVTG-ALVEQA---GG 418 (418) T ss_pred EE--EEecceEEEEecccchhhhcCceEEEEEEeeccEEecccc-------eEEE-EeccCC---CC Confidence 00 10001101112211 22 2333455676543 1211 1111 111111 22 No 34 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=92.06 E-value=0.013 Score=30.92 Aligned_cols=352 Identities=18% Similarity=0.132 Sum_probs=130.8 Q ss_pred CC------hHHHHHhhhhh---hCCCccc-------cccccchhhhhhhhhhhHHH--HHh----------------hhh Q lcl|Aclame:pro 1 MK------KNALVQKWSAL---LENEALP-------EIVGASKQAIIAKIFENQEQ--DIL----------------TAP 46 (519) Q Consensus 1 ~~------~~~l~~kw~p~---l~~~~~~-------~~~~~~~~~~~~~~~enq~~--~~~----------------~~~ 46 (519) |. .+++.++++.. |..+..- |+.+ +-++|-+.|++ .+. +++ T Consensus 7 l~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~-----l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (415) T protein:vir:79 7 LQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITD-----LRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEAR 81 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHhhhhhcccccccchhh Confidence 00 11122222111 1111110 1111 11111111110 000 000 Q ss_pred hccch-----hhhhhhhhhhhhhhh-----ccccccchhhhccccccccccccCcee--hhhHHHHHhhhhhhhceeecc Q lcl|Aclame:pro 47 EYRDE-----KISEAFGSFLTEAEI-----GGDHGYDATNIAAGQTSGAVTQIGPAV--MGMVRRAIPHLIAFDICGVQP 114 (519) Q Consensus 47 ~~~~~-----~~~~~~~~~~~~~~~-----~~~~g~~~~~~~est~tg~v~~~~P~L--~~l~Rra~p~LIa~DI~GVQP 114 (519) ..+.. .........+.+++. ....+.......-++..|.. .-|.- -.++|++.+...-.+++.|.| T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~--~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 159 (415) T protein:vir:79 82 TYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFV--VIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) T ss_pred hHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhcccccccccc--ccchHHHHHHHHHHHhhhhhhhheeeee Confidence 00000 000000000000000 00000011111111111211 12221 224555566777889999999 Q ss_pred CCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAV 194 (519) Q Consensus 115 mTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~ 194 (519) |++..+-+--.|.. ... .+ .| T Consensus 160 ~~~~~~~~~~~~~~--~~~------~~---------~~------------------------------------------ 180 (415) T protein:vir:79 160 VTNGSGKYPVVRQS--EVA------AL---------EK------------------------------------------ 180 (415) T ss_pred ccCCceeEEEEeec--CCc------cc---------ee------------------------------------------ Confidence 99877643222111 000 00 00 Q ss_pred cCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 195 TVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQ 274 (519) Q Consensus 195 ~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQ 274 (519) +++|- +. ...+...|.+..|++.| .+-...+|-||.+ T Consensus 181 ---------------------------v~E~~-----~~----~~~~~~~~~~v~~~~~k-------~~~~~~iS~ell~ 217 (415) T protein:vir:79 181 ---------------------------VEELE-----EN----PELAVKPFFQLAYDINT-------HRGYFRISREAIE 217 (415) T ss_pred ---------------------------ecccc-----cc----CcccccceeeEEeeeee-------eEeeehhhHHHHh Confidence 00000 00 00011234444444444 4455679999999 Q ss_pred HHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccccccccccee-eeccccccccccchHHHHHHHHHH Q lcl|Aclame:pro 275 DLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGV-FDFQDPIDIRGARWAGESFKALLF 353 (519) Q Consensus 275 DLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~-fDl~~~~d~~~~~~a~e~~r~L~~ 353 (519) |- ..|.+++|.+-|+..|..-+|+.|+.-.....-.+. .... ...++ ...... -..+. T Consensus 218 ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~--~~~~--~~~~~~~~~~~~-------~~~~~------ 276 (415) T protein:vir:79 218 DA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST--SSGF--EKEGKKLEVKKA-------KSLDD------ 276 (415) T ss_pred hc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc--cccc--cccccccccccc-------cchhH------ Confidence 84 357899999999999999999999873322111110 0000 00000 000000 01122 Q ss_pred HHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEE Q lcl|Aclame:pro 354 QIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFT 433 (519) Q Consensus 354 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~ 433 (519) |..+-+.+.. .+-+++.+||+|.....|...- +.. +.-.+..+.+.. ..++|.| ++|++.++.+.. T Consensus 277 -i~~~~~~~~~--~~~~~~~~v~n~~~~~~l~~lk--d~~----G~~l~~~~~~~~-~~~~l~G-~pV~~~~~~~~~--- 342 (415) T protein:vir:79 277 -IKDAINLNVK--PNYEHNVAIVSQTMFAKLDKMK--DKL----GNYLIQPDVKEK-TQQRLLG-AKIEILPDEVLG--- 342 (415) T ss_pred -HHHHHHhhhh--hccCCCEEEEcHHHHHHHHHhh--ccC----CceeeccCcCCC-CCceecc-eeeEEecccccC--- Confidence 2233333322 1224578899999998887531 100 011111222211 2247777 688887765521 Q ss_pred EEEecCCCccceeEeec----cc----ccccccccCcccccceeeeeeeeceee-cCcccccccCCcceeecC-Cchhhh Q lcl|Aclame:pro 434 IGYKGSNEMDAGIYYAP----YV----ALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNG-MPDIVN 503 (519) Q Consensus 434 vG~KG~~~~~~~~fyaP----Yv----~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~-~d~~a~ 503 (519) -.|+ ..++|+- |+ ....+...|-.+++..+....|++..+ +|-+- ..+.-. ...-.| T Consensus 343 --~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~-------~~~~~~~~~~~~~ 409 (415) T protein:vir:79 343 --QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSA-------IVIEYDDSERGEG 409 (415) T ss_pred --CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccE-------EEEEEeccCCCCC Confidence 1111 1122221 11 111122235567778888888998654 22221 111100 000011 Q ss_pred cccchh Q lcl|Aclame:pro 504 SLGLNG 509 (519) Q Consensus 504 ~~~~~~ 509 (519) ..+.-. T Consensus 410 ~~~~~~ 415 (415) T protein:vir:79 410 DLGLEA 415 (415) T ss_pred ccccCC Confidence 111111 No 35 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=92.06 E-value=0.013 Score=30.92 Aligned_cols=352 Identities=18% Similarity=0.132 Sum_probs=130.8 Q ss_pred CC------hHHHHHhhhhh---hCCCccc-------cccccchhhhhhhhhhhHHH--HHh----------------hhh Q lcl|Aclame:pro 1 MK------KNALVQKWSAL---LENEALP-------EIVGASKQAIIAKIFENQEQ--DIL----------------TAP 46 (519) Q Consensus 1 ~~------~~~l~~kw~p~---l~~~~~~-------~~~~~~~~~~~~~~~enq~~--~~~----------------~~~ 46 (519) |. .+++.++++.. |..+..- |+.+ +-++|-+.|++ .+. +++ T Consensus 7 l~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~-----l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (415) T protein:vir:98 7 LQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITD-----LRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEAR 81 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHhhhhhcccccccchhh Confidence 00 11122222111 1111110 1111 11111111110 000 000 Q ss_pred hccch-----hhhhhhhhhhhhhhh-----ccccccchhhhccccccccccccCcee--hhhHHHHHhhhhhhhceeecc Q lcl|Aclame:pro 47 EYRDE-----KISEAFGSFLTEAEI-----GGDHGYDATNIAAGQTSGAVTQIGPAV--MGMVRRAIPHLIAFDICGVQP 114 (519) Q Consensus 47 ~~~~~-----~~~~~~~~~~~~~~~-----~~~~g~~~~~~~est~tg~v~~~~P~L--~~l~Rra~p~LIa~DI~GVQP 114 (519) ..+.. .........+.+++. ....+.......-++..|.. .-|.- -.++|++.+...-.+++.|.| T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~--~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 159 (415) T protein:vir:98 82 TYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFV--VIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) T ss_pred hHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhcccccccccc--ccchHHHHHHHHHHHhhhhhhhheeeee Confidence 00000 000000000000000 00000011111111111211 12221 224555566777889999999 Q ss_pred CCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAV 194 (519) Q Consensus 115 mTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~ 194 (519) |++..+-+--.|.. ... .+ .| T Consensus 160 ~~~~~~~~~~~~~~--~~~------~~---------~~------------------------------------------ 180 (415) T protein:vir:98 160 VTNGSGKYPVVRQS--EVA------AL---------EK------------------------------------------ 180 (415) T ss_pred ccCCceeEEEEeec--CCc------cc---------ee------------------------------------------ Confidence 99877643222111 000 00 00 Q ss_pred cCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 195 TVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQ 274 (519) Q Consensus 195 ~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQ 274 (519) +++|- +. ...+...|.+..|++.| .+-...+|-||.+ T Consensus 181 ---------------------------v~E~~-----~~----~~~~~~~~~~v~~~~~k-------~~~~~~iS~ell~ 217 (415) T protein:vir:98 181 ---------------------------VEELE-----EN----PELAVKPFFQLAYDINT-------HRGYFRISREAIE 217 (415) T ss_pred ---------------------------ecccc-----cc----CcccccceeeEEeeeee-------eEeeehhhHHHHh Confidence 00000 00 00011234444444444 4455679999999 Q ss_pred HHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccccccccccee-eeccccccccccchHHHHHHHHHH Q lcl|Aclame:pro 275 DLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGV-FDFQDPIDIRGARWAGESFKALLF 353 (519) Q Consensus 275 DLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~-fDl~~~~d~~~~~~a~e~~r~L~~ 353 (519) |- ..|.+++|.+-|+..|..-+|+.|+.-.....-.+. .... ...++ ...... -..+. T Consensus 218 ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~--~~~~--~~~~~~~~~~~~-------~~~~~------ 276 (415) T protein:vir:98 218 DA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST--SSGF--EKEGKKLEVKKA-------KSLDD------ 276 (415) T ss_pred hc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc--cccc--cccccccccccc-------cchhH------ Confidence 84 357899999999999999999999873322111110 0000 00000 000000 01122 Q ss_pred HHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEE Q lcl|Aclame:pro 354 QIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFT 433 (519) Q Consensus 354 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~ 433 (519) |..+-+.+.. .+-+++.+||+|.....|...- +.. +.-.+..+.+.. ..++|.| ++|++.++.+.. T Consensus 277 -i~~~~~~~~~--~~~~~~~~v~n~~~~~~l~~lk--d~~----G~~l~~~~~~~~-~~~~l~G-~pV~~~~~~~~~--- 342 (415) T protein:vir:98 277 -IKDAINLNVK--PNYEHNVAIVSQTMFAKLDKMK--DKL----GNYLIQPDVKEK-TQQRLLG-AKIEILPDEVLG--- 342 (415) T ss_pred -HHHHHHhhhh--hccCCCEEEEcHHHHHHHHHhh--ccC----CceeeccCcCCC-CCceecc-eeeEEecccccC--- Confidence 2233333322 1224578899999998887531 100 011111222211 2247777 688887765521 Q ss_pred EEEecCCCccceeEeec----cc----ccccccccCcccccceeeeeeeeceee-cCcccccccCCcceeecC-Cchhhh Q lcl|Aclame:pro 434 IGYKGSNEMDAGIYYAP----YV----ALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNG-MPDIVN 503 (519) Q Consensus 434 vG~KG~~~~~~~~fyaP----Yv----~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~-~d~~a~ 503 (519) -.|+ ..++|+- |+ ....+...|-.+++..+....|++..+ +|-+- ..+.-. ...-.| T Consensus 343 --~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~-------~~~~~~~~~~~~~ 409 (415) T protein:vir:98 343 --QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSA-------IVIEYDDSERGEG 409 (415) T ss_pred --CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccE-------EEEEEeccCCCCC Confidence 1111 1122221 11 111122235567778888888998654 22221 111100 000011 Q ss_pred cccchh Q lcl|Aclame:pro 504 SLGLNG 509 (519) Q Consensus 504 ~~~~~~ 509 (519) ..+.-. T Consensus 410 ~~~~~~ 415 (415) T protein:vir:98 410 DLGLEA 415 (415) T ss_pred ccccCC Confidence 111111 No 36 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=92.06 E-value=0.013 Score=30.92 Aligned_cols=352 Identities=18% Similarity=0.132 Sum_probs=130.8 Q ss_pred CC------hHHHHHhhhhh---hCCCccc-------cccccchhhhhhhhhhhHHH--HHh----------------hhh Q lcl|Aclame:pro 1 MK------KNALVQKWSAL---LENEALP-------EIVGASKQAIIAKIFENQEQ--DIL----------------TAP 46 (519) Q Consensus 1 ~~------~~~l~~kw~p~---l~~~~~~-------~~~~~~~~~~~~~~~enq~~--~~~----------------~~~ 46 (519) |. .+++.++++.. |..+..- |+.+ +-++|-+.|++ .+. +++ T Consensus 7 l~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~-----l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (415) T protein:vir:81 7 LQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITD-----LRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEAR 81 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHhhhhhcccccccchhh Confidence 00 11122222111 1111110 1111 11111111110 000 000 Q ss_pred hccch-----hhhhhhhhhhhhhhh-----ccccccchhhhccccccccccccCcee--hhhHHHHHhhhhhhhceeecc Q lcl|Aclame:pro 47 EYRDE-----KISEAFGSFLTEAEI-----GGDHGYDATNIAAGQTSGAVTQIGPAV--MGMVRRAIPHLIAFDICGVQP 114 (519) Q Consensus 47 ~~~~~-----~~~~~~~~~~~~~~~-----~~~~g~~~~~~~est~tg~v~~~~P~L--~~l~Rra~p~LIa~DI~GVQP 114 (519) ..+.. .........+.+++. ....+.......-++..|.. .-|.- -.++|++.+...-.+++.|.| T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~--~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 159 (415) T protein:vir:81 82 TYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFV--VIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) T ss_pred hHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhcccccccccc--ccchHHHHHHHHHHHhhhhhhhheeeee Confidence 00000 000000000000000 00000011111111111211 12221 224555566777889999999 Q ss_pred CCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAV 194 (519) Q Consensus 115 mTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~ 194 (519) |++..+-+--.|.. ... .+ .| T Consensus 160 ~~~~~~~~~~~~~~--~~~------~~---------~~------------------------------------------ 180 (415) T protein:vir:81 160 VTNGSGKYPVVRQS--EVA------AL---------EK------------------------------------------ 180 (415) T ss_pred ccCCceeEEEEeec--CCc------cc---------ee------------------------------------------ Confidence 99877643222111 000 00 00 Q ss_pred cCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 195 TVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQ 274 (519) Q Consensus 195 ~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQ 274 (519) +++|- +. ...+...|.+..|++.| .+-...+|-||.+ T Consensus 181 ---------------------------v~E~~-----~~----~~~~~~~~~~v~~~~~k-------~~~~~~iS~ell~ 217 (415) T protein:vir:81 181 ---------------------------VEELE-----EN----PELAVKPFFQLAYDINT-------HRGYFRISREAIE 217 (415) T ss_pred ---------------------------ecccc-----cc----CcccccceeeEEeeeee-------eEeeehhhHHHHh Confidence 00000 00 00011234444444444 4455679999999 Q ss_pred HHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccccccccccee-eeccccccccccchHHHHHHHHHH Q lcl|Aclame:pro 275 DLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGV-FDFQDPIDIRGARWAGESFKALLF 353 (519) Q Consensus 275 DLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~-fDl~~~~d~~~~~~a~e~~r~L~~ 353 (519) |- ..|.+++|.+-|+..|..-+|+.|+.-.....-.+. .... ...++ ...... -..+. T Consensus 218 ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~--~~~~--~~~~~~~~~~~~-------~~~~~------ 276 (415) T protein:vir:81 218 DA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST--SSGF--EKEGKKLEVKKA-------KSLDD------ 276 (415) T ss_pred hc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc--cccc--cccccccccccc-------cchhH------ Confidence 84 357899999999999999999999873322111110 0000 00000 000000 01122 Q ss_pred HHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEE Q lcl|Aclame:pro 354 QIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFT 433 (519) Q Consensus 354 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~ 433 (519) |..+-+.+.. .+-+++.+||+|.....|...- +.. +.-.+..+.+.. ..++|.| ++|++.++.+.. T Consensus 277 -i~~~~~~~~~--~~~~~~~~v~n~~~~~~l~~lk--d~~----G~~l~~~~~~~~-~~~~l~G-~pV~~~~~~~~~--- 342 (415) T protein:vir:81 277 -IKDAINLNVK--PNYEHNVAIVSQTMFAKLDKMK--DKL----GNYLIQPDVKEK-TQQRLLG-AKIEILPDEVLG--- 342 (415) T ss_pred -HHHHHHhhhh--hccCCCEEEEcHHHHHHHHHhh--ccC----CceeeccCcCCC-CCceecc-eeeEEecccccC--- Confidence 2233333322 1224578899999998887531 100 011111222211 2247777 688887765521 Q ss_pred EEEecCCCccceeEeec----cc----ccccccccCcccccceeeeeeeeceee-cCcccccccCCcceeecC-Cchhhh Q lcl|Aclame:pro 434 IGYKGSNEMDAGIYYAP----YV----ALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNG-MPDIVN 503 (519) Q Consensus 434 vG~KG~~~~~~~~fyaP----Yv----~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~-~d~~a~ 503 (519) -.|+ ..++|+- |+ ....+...|-.+++..+....|++..+ +|-+- ..+.-. ...-.| T Consensus 343 --~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~-------~~~~~~~~~~~~~ 409 (415) T protein:vir:81 343 --QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSA-------IVIEYDDSERGEG 409 (415) T ss_pred --CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccE-------EEEEEeccCCCCC Confidence 1111 1122221 11 111122235567778888888998654 22221 111100 000011 Q ss_pred cccchh Q lcl|Aclame:pro 504 SLGLNG 509 (519) Q Consensus 504 ~~~~~~ 509 (519) ..+.-. T Consensus 410 ~~~~~~ 415 (415) T protein:vir:81 410 DLGLEA 415 (415) T ss_pred ccccCC Confidence 111111 No 37 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=91.87 E-value=0.014 Score=30.77 Aligned_cols=269 Identities=13% Similarity=0.036 Sum_probs=119.8 Q ss_pred ccccccccccccccccccccccccccccccccCCC--CCCCccccccccccccccccceecccccchhhhhhcccCCCCC Q lcl|Aclame:pro 164 LAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDA--GATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGST 241 (519) Q Consensus 164 ~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~a--g~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~ 241 (519) .+..+ +...+...+.... .-............ ...+.. +.. ..|....++.=-....++-. +. T Consensus 1 MA~~~-T~~~~~~iPev~s--~~v~~~~~~~~~~~~~~~~~~~-~~g------~~G~tv~iP~~~~~~~a~~v---~e-- 65 (272) T protein:vir:30 1 MAVGT-TKMAQMLDPEVLA--DMIDAEVGKAIRFAPLAEVDTT-LEG------QPGTTLTVPKWDYIGDAEDV---AE-- 65 (272) T ss_pred CCCcc-ccchheechHHHH--HHHHHHHHHHhhhhcccccccc-ccC------CCCCEEEEEEecCCCCcccc---cC-- Confidence 11111 0111111110000 00000000000000 000000 000 01111111100001111111 00 Q ss_pred ccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccc Q lcl|Aclame:pro 242 DNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTN 321 (519) Q Consensus 242 ~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~ 321 (519) +..++.=..+.+.++++.|.++-.-++|=|++.+ -+-|.++++.+-|+..|..+|+.+|+..+..... +. T Consensus 66 g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~------~~ 135 (272) T protein:vir:30 66 GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ------TV 135 (272) T ss_pred CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc------cc Confidence 1233344455777888888887667777666543 2579999999999999999999999985532111 00 Q ss_pred cccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccccccccccc Q lcl|Aclame:pro 322 TVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQG 401 (519) Q Consensus 322 ~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~ 401 (519) .+... .+-+-.+..++.++ ....+++||+|.++..|.......+..+. .. T Consensus 136 -----~~~~t-------------~d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~~~~---~~ 185 (272) T protein:vir:30 136 -----EATAT-------------VDGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWLGAT---EV 185 (272) T ss_pred -----ccccC-------------HHHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhccccccccc---cc Confidence 11111 12222233333222 23468999999999999766544332211 11 Q ss_pred ccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-c Q lcl|Aclame:pro 402 FNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-N 480 (519) Q Consensus 402 ~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-n 480 (519) . .+...+-.+|.+.| ++|+++++.+.+=+++.-+|.- +++-..- .......|+.+++=.+-..-|||+.+ | T Consensus 186 ~-~~~~~~g~ig~i~G-~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~~--~~ve~~r~~~~~~~~i~~~~~~~~~v~~ 257 (272) T protein:vir:30 186 G-ANRVVSGVYGEVLG-VQIVRSRKCPKGTAYMVRKGAL----RIMLKRN--TMVETDRDITKAINQIVANKHYGVYLYK 257 (272) T ss_pred c-ccccccccchhhcC-eeEEEcCCCCcceEEEEcCCeE----EEEecCC--ceeeeccccccceeEEEEEEEEEEEEEc Confidence 1 11112223567776 7999999998654333323311 1111111 12222358888988888888999753 2 Q ss_pred CcccccccCCcceeecCCchhhhcccc Q lcl|Aclame:pro 481 PFADPAAQAPTKRIQNGMPDIVNSLGL 507 (519) Q Consensus 481 P~~~~~~~~~~~~i~~~~d~~a~~~~~ 507 (519) |- .-.++.- +-+..| T Consensus 258 ~~-------~vv~~t~-----~~a~~~ 272 (272) T protein:vir:30 258 AE-------KAVKITL-----KDAAKK 272 (272) T ss_pred CC-------ceEEEEe-----cccccC Confidence 21 1122221 111222 No 38 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=91.87 E-value=0.014 Score=30.77 Aligned_cols=269 Identities=13% Similarity=0.036 Sum_probs=119.8 Q ss_pred ccccccccccccccccccccccccccccccccCCC--CCCCccccccccccccccccceecccccchhhhhhcccCCCCC Q lcl|Aclame:pro 164 LAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDA--GATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGST 241 (519) Q Consensus 164 ~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~a--g~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~ 241 (519) .+..+ +...+...+.... .-............ ...+.. +.. ..|....++.=-....++-. +. T Consensus 1 MA~~~-T~~~~~~iPev~s--~~v~~~~~~~~~~~~~~~~~~~-~~g------~~G~tv~iP~~~~~~~a~~v---~e-- 65 (272) T protein:vir:98 1 MAVGT-TKMAQMLDPEVLA--DMIDAEVGKAIRFAPLAEVDTT-LEG------QPGTTLTVPKWDYIGDAEDV---AE-- 65 (272) T ss_pred CCCcc-ccchheechHHHH--HHHHHHHHHHhhhhcccccccc-ccC------CCCCEEEEEEecCCCCcccc---cC-- Confidence 11111 0111111110000 00000000000000 000000 000 01111111100001111111 00 Q ss_pred ccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccc Q lcl|Aclame:pro 242 DNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTN 321 (519) Q Consensus 242 ~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~ 321 (519) +..++.=..+.+.++++.|.++-.-++|=|++.+ -+-|.++++.+-|+..|..+|+.+|+..+..... +. T Consensus 66 g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~------~~ 135 (272) T protein:vir:98 66 GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ------TV 135 (272) T ss_pred CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc------cc Confidence 1233344455777888888887667777666543 2579999999999999999999999985532111 00 Q ss_pred cccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccccccccccc Q lcl|Aclame:pro 322 TVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQG 401 (519) Q Consensus 322 ~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~ 401 (519) .+... .+-+-.+..++.++ ....+++||+|.++..|.......+..+. .. T Consensus 136 -----~~~~t-------------~d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~~~~---~~ 185 (272) T protein:vir:98 136 -----EATAT-------------VDGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWLGAT---EV 185 (272) T ss_pred -----ccccC-------------HHHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhccccccccc---cc Confidence 11111 12222233333222 23468999999999999766544332211 11 Q ss_pred ccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-c Q lcl|Aclame:pro 402 FNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-N 480 (519) Q Consensus 402 ~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-n 480 (519) . .+...+-.+|.+.| ++|+++++.+.+=+++.-+|.- +++-..- .......|+.+++=.+-..-|||+.+ | T Consensus 186 ~-~~~~~~g~ig~i~G-~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~~--~~ve~~r~~~~~~~~i~~~~~~~~~v~~ 257 (272) T protein:vir:98 186 G-ANRVVSGVYGEVLG-VQIVRSRKCPKGTAYMVRKGAL----RIMLKRN--TMVETDRDITKAINQIVANKHYGVYLYK 257 (272) T ss_pred c-ccccccccchhhcC-eeEEEcCCCCcceEEEEcCCeE----EEEecCC--ceeeeccccccceeEEEEEEEEEEEEEc Confidence 1 11112223567776 7999999998654333323311 1111111 12222358888988888888999753 2 Q ss_pred CcccccccCCcceeecCCchhhhcccc Q lcl|Aclame:pro 481 PFADPAAQAPTKRIQNGMPDIVNSLGL 507 (519) Q Consensus 481 P~~~~~~~~~~~~i~~~~d~~a~~~~~ 507 (519) |- .-.++.- +-+..| T Consensus 258 ~~-------~vv~~t~-----~~a~~~ 272 (272) T protein:vir:98 258 AE-------KAVKITL-----KDAAKK 272 (272) T ss_pred CC-------ceEEEEe-----cccccC Confidence 21 1122221 111222 No 39 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=91.61 E-value=0.015 Score=30.57 Aligned_cols=310 Identities=13% Similarity=0.051 Sum_probs=118.7 Q ss_pred chhhhhhhhhhhhhhhhccccccchhhhccccccccccccCc-eehhhHHHHHhhhhhhhceeeccCCccchhheeeeee Q lcl|Aclame:pro 50 DEKISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGP-AVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAV 128 (519) Q Consensus 50 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P-~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsr 128 (519) =+.++|..+... |-+...-..++.++ ..-+ +.-.+++.+.+..+..+++-+.||++..- + T Consensus 1 ~a~l~el~~~~~---------~~~~~g~~~~~~~~---liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~-------~ 61 (333) T protein:vir:78 1 MATLNELLPNSA---------GSNHQGRLAHVPSD---LLPKEIVGPIFDKAQESSLVLRMGEQIPISYGET-------I 61 (333) T ss_pred CchhHHhhhhcc---------cccccCceecCCcc---ccchhHHHHHHHHHHhhchhhhhcceeeccCCce-------E Confidence 111111111100 00000000001010 1111 11224555556777888888999875322 1 Q ss_pred ecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCcccccc Q lcl|Aclame:pro 129 YGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDA 208 (519) Q Consensus 129 Y~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~ 208 (519) +...... +.+.|-+. T Consensus 62 ~p~~~~~------------~~a~~v~e----------------------------------------------------- 76 (333) T protein:vir:78 62 IPTTVKR------------PEVGQVGV----------------------------------------------------- 76 (333) T ss_pred EEEEeCC------------ceeEeecC----------------------------------------------------- Confidence 1111000 00111110 Q ss_pred ccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHH Q lcl|Aclame:pro 209 AVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAEL 288 (519) Q Consensus 209 ~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaEL 288 (519) |-.....|.- .-..++..|.+..++..|..+ -...|-||.+|-. .|.|++| T Consensus 77 ----------------g~~~~~~e~~--~~~~~~~~f~~i~l~~~kl~~-------~~~is~ell~~s~----~~~~~~i 127 (333) T protein:vir:78 77 ----------------GTSNEQREGG--LKPLSGTAWDTRSVSPIKLAT-------IVTVSEEFARMNP----SGLYTKL 127 (333) T ss_pred ----------------cccccccccc--cccccccceeEEEEeeEEEEE-------eehhhHHHHhcCH----HHHHHHH Confidence 0000011100 001123445555555555554 4467888887644 4679999 Q ss_pred HHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 289 SGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGR 368 (519) Q Consensus 289 sNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~r 368 (519) .+.|...|...|+..+|.=-......+..++... .++..... ...........+..|.++-..+...-.+ T Consensus 128 ~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~----~~~~~~~~------~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 197 (333) T protein:vir:78 128 QGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTD----NVIANTTN------VDYLQETGDPLLDRLLDGYDLVSANTDV 197 (333) T ss_pred HHHHHHHHHHHHHHHHhcccCCCCCccccccccc----cccccccc------ccccccccchhHHHHHHHHHhhcccccc Confidence 9999999999999999851111111111111100 01100000 0000111111222333333333333333 Q ss_pred cCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccc---------eEEEE---- Q lcl|Aclame:pro 369 GAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD---------YFTIG---- 435 (519) Q Consensus 369 g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y~~vG---- 435 (519) ..+.+|+.|.-...|.....+.... +...+..+... .-.|+|.| ++|+++.+.+.+ .+++| T Consensus 198 -~~~~~vmn~~~~~~L~~~~~~~d~~---G~~i~~~~~~~-~~~~~l~G-~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~ 271 (333) T protein:vir:78 198 -EFNGWAVDPRFRAHLLRAQAYRDAN---GNVDPSRINLA-AQTGDVLG-LPAQFGRAVGGDLGAAVDSKTRIIGGDFSQ 271 (333) T ss_pred -CceEEEEcchHHHHHHHHhhhcCCC---CceeecCcccc-CCCceeec-eeeEEccccCCCccccCCCccEEEEEeccc Confidence 4478889998877765432211110 00011111111 01257777 699998876643 23333 Q ss_pred ----EecCCCccceeEeecccccccccccCccccc-ceee--eeeeecee-ecCcccccccCCcceeecC-Cc Q lcl|Aclame:pro 436 ----YKGSNEMDAGIYYAPYVALTPLRGSDPKNFQ-PVMG--FKTRYGIG-INPFADPAAQAPTKRIQNG-MP 499 (519) Q Consensus 436 ----~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~q-P~~g--~~tRY~l~-~nP~~~~~~~~~~~~i~~~-~d 499 (519) ..+..+. -..+|.-.......--.-|| -.++ ...|++.. .+|-+ -.+|... .| T Consensus 272 ~~~g~~~~~~i----~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a-------~~~l~~~~a~ 333 (333) T protein:vir:78 272 LKFGFADEIRI----KMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQA-------FVKFVDDEQP 333 (333) T ss_pred EEEEEeeccEE----EEeccccccccccceeehhhcCcEEEEEEEEEccEEecccc-------eEEEeccCCC Confidence 2222111 11222110000000000111 1122 34577754 34421 2334433 13 No 40 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=91.40 E-value=0.016 Score=30.43 Aligned_cols=339 Identities=14% Similarity=0.069 Sum_probs=130.0 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhh---------hhhHHHHHh---hhhhccchhhhhhh----------- Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKI---------FENQEQDIL---TAPEYRDEKISEAF----------- 57 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~---------~enq~~~~~---~~~~~~~~~~~~~~----------- 57 (519) |+.++|.++|..+.+. ++...+ .+-..+ ++...+++. +..+-+.+++.+.+ T Consensus 5 m~i~el~~~~~~~~~~-----~~~~~~-e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (408) T protein:vir:74 5 LTVNQLNEAWIASGDK-----VTDFND-QINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEE 78 (408) T ss_pred hhHHHHHHHHHHHHHH-----HHHHHH-HHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 8889999999888654 322111 110000 001111110 00000001111100 Q ss_pred ----------------hhhhhhhhhcccc---ccchhhhccccc-ccccc---ccCceehhhHHHHHhhhhhhhceeecc Q lcl|Aclame:pro 58 ----------------GSFLTEAEIGGDH---GYDATNIAAGQT-SGAVT---QIGPAVMGMVRRAIPHLIAFDICGVQP 114 (519) Q Consensus 58 ----------------~~~~~~~~~~~~~---g~~~~~~~est~-tg~v~---~~~P~L~~l~Rra~p~LIa~DI~GVQP 114 (519) ..|+... -.+.+ .-....+..++. .|.+. .+.+ .+++.+.+.....++++++| T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~---~Ii~~~~~~~~l~~~~~~~~ 154 (408) T protein:vir:74 79 KGPLNKSENELKDKFVKDFVNMV-RNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRT---MINTLVRQYDSLQQYVRVES 154 (408) T ss_pred cccccchhhhhHHHHHHHHHHHH-hcchhhhhhhhhhhhcccccCCCceeechhHhh---HHHHHHhhhcchhhhcceee Confidence 0011000 00000 000111111111 11111 1222 23344445666788899999 Q ss_pred CCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAV 194 (519) Q Consensus 115 mTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~ 194 (519) |++.+|-+--.+-. .... ...| T Consensus 155 ~~~~~~~~~~~~~~--~~~~--------------~~~~------------------------------------------ 176 (408) T protein:vir:74 155 VSTSSGSRVYEKWT--DVTP--------------LKAM------------------------------------------ 176 (408) T ss_pred ccCCcceEEEEeec--CCcc--------------cccc------------------------------------------ Confidence 99887654222110 0000 0000 Q ss_pred cCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccce-eEEEEEEEEeecccccccccHHHH Q lcl|Aclame:pro 195 TVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMG-FRIDKQVIEAKSRQLKASYSIELA 273 (519) Q Consensus 195 ~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYTmELA 273 (519) +++| ...++.+ .+++++++..+..+-...+|-||. T Consensus 177 ---------------------------v~E~-----------------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell 212 (408) T protein:vir:74 177 ---------------------------DEED-----------------GKIPDLDNPRLTIIKYLIKRYAGIITATNTLL 212 (408) T ss_pred ---------------------------cccc-----------------cccccccccceeeEEeeeeeEEeeehhHHHHH Confidence 0000 0111111 233445555555555567999999 Q ss_pred HHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHH Q lcl|Aclame:pro 274 QDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLF 353 (519) Q Consensus 274 QDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~ 353 (519) +|- .+|.+++|.+-|+..|..-+|+.||. | +.++....++.++++ |.. T Consensus 213 ~ds----~~~l~~~i~~~l~~~~~~~~d~~il~--------G----~G~~~~~~~~~~~~~----------------i~~ 260 (408) T protein:vir:74 213 KDT----AENILAWLSSWIAKKVVVTRNQAIIA--------A----MGTVPKKPTIANFDD----------------VIT 260 (408) T ss_pred hhc----hHHHHHHHHHHHHHHHHHHHHHHHhh--------c----ccccccccccccHHH----------------HHH Confidence 982 45779999999999999999998875 1 112222334433321 111 Q ss_pred HHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCC--Ccc-- Q lcl|Aclame:pro 354 QIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQY--ARS-- 429 (519) Q Consensus 354 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y--~~~-- 429 (519) .+ ...+.. .+...-.+||+|.....|...= + +. +...+..+.+.. ..++|.| ++||+-.+ .+. T Consensus 261 ~~---~~~l~~--~~~~~a~~v~n~~~~~~l~~lk--d-~~---G~~l~~~~~~~~-~~~~l~G-~pV~~~~~~~~~~~~ 327 (408) T protein:vir:74 261 MI---NTSVDP--AIIATSSLLTNQSGLNKLALVK--T-AE---GKYLLEPDPTKP-NSYLIKG-KQVIVVADRWLPNSG 327 (408) T ss_pred HH---HHhhhh--hhcCCCEEEEcHHHHHHHHHhh--c-CC---CceEeccCcCCC-CCceecc-eeeEEecCccccccc Confidence 11 112222 2223356889999999988531 1 00 111122222221 1246777 57775322 221 Q ss_pred --ce-EEE---------EEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-cCcccccccCCcceeec Q lcl|Aclame:pro 430 --DY-FTI---------GYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQN 496 (519) Q Consensus 430 --dy-~~v---------G~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~ 496 (519) ++ +++ +-++.. .+=+.||.- .+-...+-.+-+..||+..+ +|-+-..-. ...+.. T Consensus 328 ~~~~~i~~gd~~~~~~~~~~~~~----~i~~~~~~~------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~--~~~~~~ 395 (408) T protein:vir:74 328 STVYPLYYGDMSQAITLFDRENM----SLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGS--FTAIAD 395 (408) T ss_pred CCcceEEEEehhccEEEEEecce----EEEEecccc------chhhcceeeEEEEEeeCcEEecccceEEEE--eecccC Confidence 11 222 211111 111222211 01123445555556665432 221100000 000000 Q ss_pred C-Cchhhhcccchh Q lcl|Aclame:pro 497 G-MPDIVNSLGLNG 509 (519) Q Consensus 497 ~-~d~~a~~~~~~~ 509 (519) . -+..+..+. ++ T Consensus 396 ~~~~~~~~~~~-~~ 408 (408) T protein:vir:74 396 QVGNFKTTTST-AV 408 (408) T ss_pred CCCCCCCCccc-cC Confidence 0 000011111 11 No 41 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=90.36 E-value=0.021 Score=29.75 Aligned_cols=304 Identities=9% Similarity=-0.000 Sum_probs=121.2 Q ss_pred hhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeecc Q lcl|Aclame:pro 35 FENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQP 114 (519) Q Consensus 35 ~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQP 114 (519) ||-. +.+.+.+ +.|..... +..-+++.....+. ++...--....-.+++.+.......+++-+-| T Consensus 1 ~~~~--------~~~~~~~----~~~~~~~~--~~~~~~a~~~~~~~-~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~ 65 (324) T protein:vir:96 1 MEQT--------QKLKLNL----QHFASNNV--KPQVFNPDNVMMHE-KKDGTLMNEFTTPILQEVMENSKIMQLGKYEP 65 (324) T ss_pred CCcc--------hhhhHHH----HHHHHHhh--hhhhhccccccccC-cCccccchhHHHHHHHHHHhhchhhhhcceee Confidence 2211 1111111 11111110 11112222222111 12111111122235555666777788888888 Q ss_pred CCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAV 194 (519) Q Consensus 115 mTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~ 194 (519) |++++-- |.-.... .+| T Consensus 66 ~~~~~~~-------~p~~~~~---~~a----------------------------------------------------- 82 (324) T protein:vir:96 66 MEGTEKK-------FTFWADK---PGA----------------------------------------------------- 82 (324) T ss_pred ccCCceE-------EEEEecC---cce----------------------------------------------------- Confidence 8776421 1110000 000 Q ss_pred cCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 195 TVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQ 274 (519) Q Consensus 195 ~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQ 274 (519) .-++ | +..+++...++++++++.+.-+.-..+|-||.+ T Consensus 83 -------------------------~~v~--------E---------g~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ 120 (324) T protein:vir:96 83 -------------------------YWVG--------E---------GQKIETSKATWVNATMRAFKLGVILPVTKEFLN 120 (324) T ss_pred -------------------------eEec--------C---------CccccccccceeEEEEeeEEEEEeehhhHHHHh Confidence 0001 1 112233334455555555655666679999999 Q ss_pred HHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHH Q lcl|Aclame:pro 275 DLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQ 354 (519) Q Consensus 275 DLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~ 354 (519) |-. .|.+++|.+.|+..|...|++.+|.=-- ++..+.|+.......... ......+.. T Consensus 121 ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g------------~~~~~~gi~~~~~~~~~~------~~~~~t~~~ 178 (324) T protein:vir:96 121 YTY----SQFFEEMKPMIAEAFYKKFDEAGILNQG------------NNPFGKSIAQSIEKTNKV------IKGDFTQDN 178 (324) T ss_pred cch----HHHHHHHHHHHHHHHHHHHHHHHhccCC------------CCCcCcccccccccccee------ccccccHHH Confidence 863 5679999999999999999998886110 111223433321111100 000112333 Q ss_pred HHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc--ceE Q lcl|Aclame:pro 355 IDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS--DYF 432 (519) Q Consensus 355 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--dy~ 432 (519) |+++.+.+.. .+...+.+|++|.....|...-.-. .+.+-.+... ++|.| ++|++++.... .-+ T Consensus 179 i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~d~~-------G~~~~~~~~~----~~l~G-~PV~~~~~~~~~~~~~ 244 (324) T protein:vir:96 179 IIDLEALLED--DELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNS----DSLDG-LPVVNLKSSNLKRGEL 244 (324) T ss_pred HHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhccC-------CCeeecCCCC----Ccccc-eeeEeeCCCCCCcceE Confidence 4444444433 3335567999999999887542111 0111112222 35666 68888766442 223 Q ss_pred E--------EEEecCCCccceeEeecccccccccccCcc-----cc---cceeeeeeeeceee-cCcccccccCCcceee Q lcl|Aclame:pro 433 T--------IGYKGSNEMDAGIYYAPYVALTPLRGSDPK-----NF---QPVMGFKTRYGIGI-NPFADPAAQAPTKRIQ 495 (519) Q Consensus 433 ~--------vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~-----s~---qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~ 495 (519) + +|..+.-...- ..+..... ..|+. -| +=.+=...||+..+ +|=+ ..++. T Consensus 245 ~~gd~~~~~~g~~~~~~i~~----~~~~~~~~--~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A-------~~~l~ 311 (324) T protein:vir:96 245 ITGDFDKLIYGIPQLIEYKI----DETAQLST--VKNEDGTPVNLFEQDMVALRATMHVALHIADDKA-------FAKLV 311 (324) T ss_pred EEEecceEEEEEecCcEEEE----eecccccc--cccccccchhhhhcCcEEEEEEEEEccEEecccc-------eEEEe Confidence 3 33332211100 00000000 00110 01 11222234555433 1211 11121 Q ss_pred cCCchhh-hcccch Q lcl|Aclame:pro 496 NGMPDIV-NSLGLN 508 (519) Q Consensus 496 ~~~d~~a-~~~~~~ 508 (519) .. ++.. -..+.- T Consensus 312 ~a-~~~~~~~~~~~ 324 (324) T protein:vir:96 312 PA-DKRTDSVPGEV 324 (324) T ss_pred cc-cccCCCCCCCC Confidence 11 0000 000000 No 42 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=90.36 E-value=0.021 Score=29.75 Aligned_cols=304 Identities=9% Similarity=-0.000 Sum_probs=121.2 Q ss_pred hhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeecc Q lcl|Aclame:pro 35 FENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQP 114 (519) Q Consensus 35 ~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQP 114 (519) ||-. +.+.+.+ +.|..... +..-+++.....+. ++...--....-.+++.+.......+++-+-| T Consensus 1 ~~~~--------~~~~~~~----~~~~~~~~--~~~~~~a~~~~~~~-~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~ 65 (324) T protein:vir:78 1 MEQT--------QKLKLNL----QHFASNNV--KPQVFNPDNVMMHE-KKDGTLMNEFTTPILQEVMENSKIMQLGKYEP 65 (324) T ss_pred CCcc--------hhhhHHH----HHHHHHhh--hhhhhccccccccC-cCccccchhHHHHHHHHHHhhchhhhhcceee Confidence 2211 1111111 11111110 11112222222111 12111111122235555666777788888888 Q ss_pred CCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAV 194 (519) Q Consensus 115 mTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~ 194 (519) |++++-- |.-.... .+| T Consensus 66 ~~~~~~~-------~p~~~~~---~~a----------------------------------------------------- 82 (324) T protein:vir:78 66 MEGTEKK-------FTFWADK---PGA----------------------------------------------------- 82 (324) T ss_pred ccCCceE-------EEEEecC---cce----------------------------------------------------- Confidence 8776421 1110000 000 Q ss_pred cCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 195 TVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQ 274 (519) Q Consensus 195 ~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQ 274 (519) .-++ | +..+++...++++++++.+.-+.-..+|-||.+ T Consensus 83 -------------------------~~v~--------E---------g~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ 120 (324) T protein:vir:78 83 -------------------------YWVG--------E---------GQKIETSKATWVNATMRAFKLGVILPVTKEFLN 120 (324) T ss_pred -------------------------eEec--------C---------CccccccccceeEEEEeeEEEEEeehhhHHHHh Confidence 0001 1 112233334455555555655666679999999 Q ss_pred HHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHH Q lcl|Aclame:pro 275 DLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQ 354 (519) Q Consensus 275 DLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~ 354 (519) |-. .|.+++|.+.|+..|...|++.+|.=-- ++..+.|+.......... ......+.. T Consensus 121 ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g------------~~~~~~gi~~~~~~~~~~------~~~~~t~~~ 178 (324) T protein:vir:78 121 YTY----SQFFEEMKPMIAEAFYKKFDEAGILNQG------------NNPFGKSIAQSIEKTNKV------IKGDFTQDN 178 (324) T ss_pred cch----HHHHHHHHHHHHHHHHHHHHHHHhccCC------------CCCcCcccccccccccee------ccccccHHH Confidence 863 5679999999999999999998886110 111223433321111100 000112333 Q ss_pred HHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc--ceE Q lcl|Aclame:pro 355 IDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS--DYF 432 (519) Q Consensus 355 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--dy~ 432 (519) |+++.+.+.. .+...+.+|++|.....|...-.-. .+.+-.+... ++|.| ++|++++.... .-+ T Consensus 179 i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~d~~-------G~~~~~~~~~----~~l~G-~PV~~~~~~~~~~~~~ 244 (324) T protein:vir:78 179 IIDLEALLED--DELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNS----DSLDG-LPVVNLKSSNLKRGEL 244 (324) T ss_pred HHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhccC-------CCeeecCCCC----Ccccc-eeeEeeCCCCCCcceE Confidence 4444444433 3335567999999999887542111 0111112222 35666 68888766442 223 Q ss_pred E--------EEEecCCCccceeEeecccccccccccCcc-----cc---cceeeeeeeeceee-cCcccccccCCcceee Q lcl|Aclame:pro 433 T--------IGYKGSNEMDAGIYYAPYVALTPLRGSDPK-----NF---QPVMGFKTRYGIGI-NPFADPAAQAPTKRIQ 495 (519) Q Consensus 433 ~--------vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~-----s~---qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~ 495 (519) + +|..+.-...- ..+..... ..|+. -| +=.+=...||+..+ +|=+ ..++. T Consensus 245 ~~gd~~~~~~g~~~~~~i~~----~~~~~~~~--~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A-------~~~l~ 311 (324) T protein:vir:78 245 ITGDFDKLIYGIPQLIEYKI----DETAQLST--VKNEDGTPVNLFEQDMVALRATMHVALHIADDKA-------FAKLV 311 (324) T ss_pred EEEecceEEEEEecCcEEEE----eecccccc--cccccccchhhhhcCcEEEEEEEEEccEEecccc-------eEEEe Confidence 3 33332211100 00000000 00110 01 11222234555433 1211 11121 Q ss_pred cCCchhh-hcccch Q lcl|Aclame:pro 496 NGMPDIV-NSLGLN 508 (519) Q Consensus 496 ~~~d~~a-~~~~~~ 508 (519) .. ++.. -..+.- T Consensus 312 ~a-~~~~~~~~~~~ 324 (324) T protein:vir:78 312 PA-DKRTDSVPGEV 324 (324) T ss_pred cc-cccCCCCCCCC Confidence 11 0000 000000 No 43 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=90.18 E-value=0.022 Score=29.65 Aligned_cols=268 Identities=12% Similarity=0.072 Sum_probs=111.8 Q ss_pred hcccccccccccc----Ccee-hhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccc Q lcl|Aclame:pro 77 IAAGQTSGAVTQI----GPAV-MGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAM 151 (519) Q Consensus 77 ~~est~tg~v~~~----~P~L-~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~ 151 (519) +.++.+++.-+.+ -+.+ -.+++.+-+..+-.+++.+=||++.+| +..+...... T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g-----~~~~~~~~~~---------------- 59 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTG-----SRVYEKWTDI---------------- 59 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcc-----eEEEEeecCC---------------- Confidence 1111111111111 1111 124444445666677788878777654 1111110000 Q ss_pred cCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhh Q lcl|Aclame:pro 152 FSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIA 231 (519) Q Consensus 152 fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~A 231 (519) ++ ...-+++| T Consensus 60 -~~---------------------------------------------------------------~a~~v~Eg------ 69 (293) T protein:vir:48 60 -TG---------------------------------------------------------------LANIDDEA------ 69 (293) T ss_pred -Cc---------------------------------------------------------------ceeeecCC------ Confidence 00 00001111 Q ss_pred hhcccCCCCCccccccce-eEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHh Q lcl|Aclame:pro 232 ELQEGFNGSTDNPWNEMG-FRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINY 310 (519) Q Consensus 232 Eal~~~ggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~ 310 (519) ..++|.+ .++++++..+|.-+-...+|-||.+|. .+|.|++|.+-|+..|..-+|+.|+.-+.. T Consensus 70 -----------~~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~~~i~~g~~~ 134 (293) T protein:vir:48 70 -----------GKIADIDDPKLSLIKYTIKRYAGISTVTNSLLADS----AENILAWLSGWIAKKVVVTRNKAILGVVDK 134 (293) T ss_pred -----------cccccccccceeEEEEeeeEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHHHHHHHHhHHhhcccc Confidence 1223332 345566666666677778999999986 367899999999999999999888863221 Q ss_pred hhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcc Q lcl|Aclame:pro 311 SAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTS 390 (519) Q Consensus 311 ~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~ 390 (519) . ....+.+++ +....|+.++ ... +......+|+|.....|...- T Consensus 135 ~------------~~~~~~~~~-------------d~i~~~~~~l-------~~~--~~~~a~~vmn~~~~~~L~~lk-- 178 (293) T protein:vir:48 135 L------------PTKPTLTKW-------------DDIIDLEAKV-------DPA--IKQTSFFLTNTSGFTALKKVK-- 178 (293) T ss_pred c------------cccccccCH-------------HHHHHHHHhh-------hhh--hcCCCEEEEcHHHHHHHHHhh-- Confidence 1 111222222 2223333333 322 223357889999988886431 Q ss_pred cccccccccccccccCCCceEEEEecCcEEEEe--cCCCcc--------------ceEEEEEecCCCccceeEeeccccc Q lcl|Aclame:pro 391 VSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYI--DQYARS--------------DYFTIGYKGSNEMDAGIYYAPYVAL 454 (519) Q Consensus 391 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~--------------dy~~vG~KG~~~~~~~~fyaPYv~~ 454 (519) +.. +.-.+..+.+.. ..++|.| ++|++ |.+.+. +++.++.++.-.. -..++.. T Consensus 179 d~~----g~~l~~~~~~~~-~~~~l~G-~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i----~~~~~~~- 247 (293) T protein:vir:48 179 NAL----GDYLMERDVKSP-TGYSIAG-FAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSL----LSTNIGG- 247 (293) T ss_pred ccC----CceEeecCcCCC-CCceecc-eeeEEecccccCCccCCceEEEEEeccceEEEEEecceEE----EEecccc- Confidence 110 011112221111 1246777 57765 333221 1222222221111 1111100 Q ss_pred ccccccCcccccceeeeeeeeceee-cCcccccccCCccee--ecCCchhhhcccchhhhhhh Q lcl|Aclame:pro 455 TPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRI--QNGMPDIVNSLGLNGYFRRV 514 (519) Q Consensus 455 ~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i--~~~~d~~a~~~~~~~y~r~v 514 (519) .+-.+-|=.+-...||+..+ +|-+ ...+ .......+ .++. +-| T Consensus 248 -----~~~~~~~~~~r~~~r~d~~~~~~~a-------~~~l~~~~~~~~~~-~~~~----~~~ 293 (293) T protein:vir:48 248 -----GAFETDTTKVRVIDRFDVVATDTEA-------FVPASFKAIADQKG-NIGS----TAV 293 (293) T ss_pred -----hhhhcCeEEEEEEEeeCcEEecccc-------eEEEEeeccccCCc-cccc----cCC Confidence 01122334444555555432 2211 0101 00000000 0000 011 No 44 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=89.51 E-value=0.026 Score=29.28 Aligned_cols=271 Identities=13% Similarity=0.056 Sum_probs=117.7 Q ss_pred ccccccccccccccccccccccccccccccccCCCCC--CCccccccccccccccccceecccccchhhhhhcccCCCCC Q lcl|Aclame:pro 164 LAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGA--TDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGST 241 (519) Q Consensus 164 ~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~--t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~ 241 (519) .+.... .-.+...+-.-. .. ...........++. .+.. +.. ..|.+.+++.=-.+..+|.+. ... T Consensus 1 m~~~~T-~l~d~i~Pev~~-~~-v~~~~~~~l~~~~~~~~~~~-l~g------~~G~tv~iP~~~~ig~a~~~~---~g~ 67 (274) T protein:vir:96 1 MAQGMT-KLTNQIVPEVLA-PM-MQAELEKKLRFASFAEIDNT-LVG------QPGDTLTFPAFIYSGDAKVVA---EGE 67 (274) T ss_pred CCccee-ehhheechHHHH-HH-HHHHHHhhhhccccceeccc-ccC------CCCCEEEeeeecCCCcccccc---CCC Confidence 111110 111111110000 00 00000000000000 0000 000 011222221100111222221 111 Q ss_pred ccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcC-CCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 242 DNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHG-MDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) Q Consensus 242 ~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHG-LDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t 320 (519) .-...+++.+ +.+++-+-|. |+ |.+ -|+-+..+ -|.-.|..+-++..+..+++.+++..+...... T Consensus 68 ~i~~~~lt~~--~~~~~i~~~~-~a-~~i---~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~------ 134 (274) T protein:vir:96 68 KIPTDILETK--KREAKIRKIA-KG-TSI---SDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT------ 134 (274) T ss_pred ccchhhcccc--eeEEEeeeee-cc-eee---hHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------ Confidence 2233344333 3333334443 22 222 26655553 588999999999999999999999866422110 Q ss_pred ccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccc Q lcl|Aclame:pro 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) Q Consensus 321 ~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~ 400 (519) .....++ .+.+-....++.++. ..++++||+|.+++.|.......|..+. T Consensus 135 ----~~~~~~~-------------~d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s---- 184 (274) T protein:vir:96 135 ----VEADITK-------------LTGLQTAIDKFNDED---------LEPMVLFISPLDAGKLRGDATTNFTRAT---- 184 (274) T ss_pred ----ccccccC-------------HHHHHHHHHHhcccc---------ccccEEEeCHHHHHHHHhhccccccccc---- Confidence 0011122 233333334443322 2568999999999999986644443211 Q ss_pred cccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceeec Q lcl|Aclame:pro 401 GFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGIN 480 (519) Q Consensus 401 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~n 480 (519) ........+-.+|.+.| ++||+|...+..-..+--+|. -.||.. -+...-...||++++=.+-..-+||+.+ T Consensus 185 ~~g~~~~~~G~ig~~~G-~~Vi~s~~~~~~t~~l~~~gA-----~~~~~~-~~~~vE~~Rd~~~~~d~i~~~~~y~~~~- 256 (274) T protein:vir:96 185 ELGDDVIVKGAFGEALG-AVIVRSNKLEAGTAILAKKGA-----VKLITK-RDFFLETDRDPSTKTTALYSDKHYVAYL- 256 (274) T ss_pred cccccceeccccceecC-eEEEEeCCCCCceEEEEeccc-----eeeeec-CCcccccccccccccCEEEEeEEEEEEE- Confidence 00111222334678876 899999988743222111221 122221 1222222359999999999999999765 Q ss_pred CcccccccCCcceeecCCchhhhcccc Q lcl|Aclame:pro 481 PFADPAAQAPTKRIQNGMPDIVNSLGL 507 (519) Q Consensus 481 P~~~~~~~~~~~~i~~~~d~~a~~~~~ 507 (519) . ..++-.++.-+ +++..| T Consensus 257 --~---~~~~~v~~tk~----~~~~~~ 274 (274) T protein:vir:96 257 --Y---DESKAVKITKG----SGSLEM 274 (274) T ss_pred --E---cCCcEEEEEcC----CccccC Confidence 1 22223344433 123333 No 45 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=89.51 E-value=0.026 Score=29.28 Aligned_cols=271 Identities=13% Similarity=0.056 Sum_probs=117.7 Q ss_pred ccccccccccccccccccccccccccccccccCCCCC--CCccccccccccccccccceecccccchhhhhhcccCCCCC Q lcl|Aclame:pro 164 LAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGA--TDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGST 241 (519) Q Consensus 164 ~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~--t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~ 241 (519) .+.... .-.+...+-.-. .. ...........++. .+.. +.. ..|.+.+++.=-.+..+|.+. ... T Consensus 1 m~~~~T-~l~d~i~Pev~~-~~-v~~~~~~~l~~~~~~~~~~~-l~g------~~G~tv~iP~~~~ig~a~~~~---~g~ 67 (274) T protein:vir:95 1 MAQGMT-KLTNQIVPEVLA-PM-MQAELEKKLRFASFAEIDNT-LVG------QPGDTLTFPAFIYSGDAKVVA---EGE 67 (274) T ss_pred CCccee-ehhheechHHHH-HH-HHHHHHhhhhccccceeccc-ccC------CCCCEEEeeeecCCCcccccc---CCC Confidence 111110 111111110000 00 00000000000000 0000 000 011222221100111222221 111 Q ss_pred ccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcC-CCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 242 DNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHG-MDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) Q Consensus 242 ~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHG-LDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t 320 (519) .-...+++.+ +.+++-+-|. |+ |.+ -|+-+..+ -|.-.|..+-++..+..+++.+++..+...... T Consensus 68 ~i~~~~lt~~--~~~~~i~~~~-~a-~~i---~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~------ 134 (274) T protein:vir:95 68 KIPTDILETK--KREAKIRKIA-KG-TSI---SDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT------ 134 (274) T ss_pred ccchhhcccc--eeEEEeeeee-cc-eee---hHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------ Confidence 2233344333 3333334443 22 222 26655553 588999999999999999999999866422110 Q ss_pred ccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccc Q lcl|Aclame:pro 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) Q Consensus 321 ~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~ 400 (519) .....++ .+.+-....++.++. ..++++||+|.+++.|.......|..+. T Consensus 135 ----~~~~~~~-------------~d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s---- 184 (274) T protein:vir:95 135 ----VEADITK-------------LTGLQTAIDKFNDED---------LEPMVLFISPLDAGKLRGDATTNFTRAT---- 184 (274) T ss_pred ----ccccccC-------------HHHHHHHHHHhcccc---------ccccEEEeCHHHHHHHHhhccccccccc---- Confidence 0011122 233333334443322 2568999999999999986644443211 Q ss_pred cccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceeec Q lcl|Aclame:pro 401 GFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGIN 480 (519) Q Consensus 401 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~n 480 (519) ........+-.+|.+.| ++||+|...+..-..+--+|. -.||.. -+...-...||++++=.+-..-+||+.+ T Consensus 185 ~~g~~~~~~G~ig~~~G-~~Vi~s~~~~~~t~~l~~~gA-----~~~~~~-~~~~vE~~Rd~~~~~d~i~~~~~y~~~~- 256 (274) T protein:vir:95 185 ELGDDVIVKGAFGEALG-AVIVRSNKLEAGTAILAKKGA-----VKLITK-RDFFLETDRDPSTKTTALYSDKHYVAYL- 256 (274) T ss_pred cccccceeccccceecC-eEEEEeCCCCCceEEEEeccc-----eeeeec-CCcccccccccccccCEEEEeEEEEEEE- Confidence 00111222334678876 899999988743222111221 122221 1222222359999999999999999765 Q ss_pred CcccccccCCcceeecCCchhhhcccc Q lcl|Aclame:pro 481 PFADPAAQAPTKRIQNGMPDIVNSLGL 507 (519) Q Consensus 481 P~~~~~~~~~~~~i~~~~d~~a~~~~~ 507 (519) . ..++-.++.-+ +++..| T Consensus 257 --~---~~~~~v~~tk~----~~~~~~ 274 (274) T protein:vir:95 257 --Y---DESKAVKITKG----SGSLEM 274 (274) T ss_pred --E---cCCcEEEEEcC----CccccC Confidence 1 22223344433 123333 No 46 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=88.15 E-value=0.034 Score=28.63 Aligned_cols=352 Identities=16% Similarity=0.100 Sum_probs=134.4 Q ss_pred CChHHHHHhhhhhhCCCccccccccch--hhhhhhhhhhHHHHH--------hhhhhccch--------------hhhhh Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASK--QAIIAKIFENQEQDI--------LTAPEYRDE--------------KISEA 56 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~--~~~~~~~~enq~~~~--------~~~~~~~~~--------------~~~~~ 56 (519) ++.+.+.+-- .+.+ .+-++....+ ++-...+-|.++... .+.+..+.. ...+. T Consensus 29 ~~~~~~e~~~-~~~~--ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 105 (415) T protein:vir:47 29 LNNDELEKAE-KLEQ--EITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQE 105 (415) T ss_pred hchhhHHHHH-HHHH--HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHH Confidence 3332221110 0000 0000000000 000011111000000 000000000 00000 Q ss_pred hhhhhhhhhhccccccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCC Q lcl|Aclame:pro 57 FGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAA 136 (519) Q Consensus 57 ~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~ 136 (519) ...+..... .+.+......++..|...--....-.+++.+.+...-.+++.+.||+++++-+.-.+.. .. T Consensus 106 ~~~~~~~~~----~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-----~~- 175 (415) T protein:vir:47 106 VRDFTEYLE----TRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS-----EV- 175 (415) T ss_pred HHHHHHHHh----hhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEec-----CC- Confidence 001110000 00011111111112221111111123566666778889999999999987643222110 00 Q ss_pred CcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCcccccccccccccc Q lcl|Aclame:pro 137 GAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEA 216 (519) Q Consensus 137 ~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~ 216 (519) .++ . T Consensus 176 --~~~---------~----------------------------------------------------------------- 179 (415) T protein:vir:47 176 --AAL---------E----------------------------------------------------------------- 179 (415) T ss_pred --cce---------e----------------------------------------------------------------- Confidence 000 0 Q ss_pred ccceecccccchhhhhhcccCCCCCccccccce-eEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 217 GQLAEIAEGMATSIAELQEGFNGSTDNPWNEMG-FRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATE 295 (519) Q Consensus 217 g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTE 295 (519) .+++| ...++.+ -++++++..++..+-...+|-||.+|-. .|.+++|.+-|+.. T Consensus 180 ----~v~Eg-----------------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~ 234 (415) T protein:vir:47 180 ----KVEEL-----------------EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK----VNVLQELKLWMART 234 (415) T ss_pred ----ecccc-----------------cccccccccceeeEEeeeeeeEeeehhhHHHHhhch----HHHHHHHHHHHHHH Confidence 00000 1122222 2344566666666666689999999843 57799999999999 Q ss_pred HHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEE Q lcl|Aclame:pro 296 IMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFII 375 (519) Q Consensus 296 ImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v 375 (519) |..-+|+.|+.-.-...-.+.. ..+.. ....+..... . ..+-...|+..+.. .+.+++.+| T Consensus 235 i~~~~d~~il~g~g~g~~~~~~--~~~~~-~~~~~~~~~~------~-~~~~i~~~~~~~~~---------~~~~~~~~v 295 (415) T protein:vir:47 235 IAATRNKAIIDVITKGSTGSTS--SGFEK-EGKKLEVKKA------K-SLDDIKDAINLNVK---------PNYEHNVAI 295 (415) T ss_pred HHHHHHHHHhhccccCCccccc--ccccc-ccceeccccc------c-chHHHHHHHHhhhh---------hccCCCEEE Confidence 9999999998732221111110 00000 0011110000 0 11223344333332 223567899 Q ss_pred EchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeeccc--- Q lcl|Aclame:pro 376 ASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYV--- 452 (519) Q Consensus 376 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv--- 452 (519) ++|.....|...- + +. +...+..+.+.. ..++|.| ++|++.++.+. |-.| +..++|+.|- T Consensus 296 ~n~~~~~~L~~lk--d-~~---G~~i~~~~~~~~-~~~~l~G-~pV~~~~~~~~-----~~~~----~~~~~~gd~~~~~ 358 (415) T protein:vir:47 296 VSQTMFAKLDKMK--D-KL---GNYLIQPDVKEK-TQQRLLG-AKIEILPDEVL-----GQKG----NNTLIIGNLKDAI 358 (415) T ss_pred EcHHHHHHHHHhh--c-cC---CCeeeccCcCCC-CCccccc-eeeEEeccccc-----cCCC----ccEEEEEehhccE Confidence 9999998887531 1 00 011111221111 1256777 58887766552 1111 1112222211 Q ss_pred -----ccccccccCcccccceeeeeeeeceee-cCcccccccCCcceeecC-Cchhhhcccchh Q lcl|Aclame:pro 453 -----ALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNG-MPDIVNSLGLNG 509 (519) Q Consensus 453 -----~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~-~d~~a~~~~~~~ 509 (519) ....+...|-.+++-.+-...|++..+ +|-+ ...+.-. --.-.|..+.-. T Consensus 359 ~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a-------~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:47 359 VLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKS-------AIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEEeecceEEEeeccccCceEEEEEEEeccEEecccc-------EEEEEeeccCCCCCCccCCC Confidence 111122235566677777888988654 3321 1111100 000011111111 No 47 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=88.15 E-value=0.034 Score=28.63 Aligned_cols=352 Identities=16% Similarity=0.100 Sum_probs=134.4 Q ss_pred CChHHHHHhhhhhhCCCccccccccch--hhhhhhhhhhHHHHH--------hhhhhccch--------------hhhhh Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASK--QAIIAKIFENQEQDI--------LTAPEYRDE--------------KISEA 56 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~--~~~~~~~~enq~~~~--------~~~~~~~~~--------------~~~~~ 56 (519) ++.+.+.+-- .+.+ .+-++....+ ++-...+-|.++... .+.+..+.. ...+. T Consensus 29 ~~~~~~e~~~-~~~~--ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 105 (415) T protein:vir:46 29 LNNDELEKAE-KLEQ--EITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQE 105 (415) T ss_pred hchhhHHHHH-HHHH--HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHH Confidence 3332221110 0000 0000000000 000011111000000 000000000 00000 Q ss_pred hhhhhhhhhhccccccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCC Q lcl|Aclame:pro 57 FGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAA 136 (519) Q Consensus 57 ~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~ 136 (519) ...+..... .+.+......++..|...--....-.+++.+.+...-.+++.+.||+++++-+.-.+.. .. T Consensus 106 ~~~~~~~~~----~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-----~~- 175 (415) T protein:vir:46 106 VRDFTEYLE----TRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS-----EV- 175 (415) T ss_pred HHHHHHHHh----hhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEec-----CC- Confidence 001110000 00011111111112221111111123566666778889999999999987643222110 00 Q ss_pred CcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCcccccccccccccc Q lcl|Aclame:pro 137 GAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEA 216 (519) Q Consensus 137 ~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~ 216 (519) .++ . T Consensus 176 --~~~---------~----------------------------------------------------------------- 179 (415) T protein:vir:46 176 --AAL---------E----------------------------------------------------------------- 179 (415) T ss_pred --cce---------e----------------------------------------------------------------- Confidence 000 0 Q ss_pred ccceecccccchhhhhhcccCCCCCccccccce-eEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 217 GQLAEIAEGMATSIAELQEGFNGSTDNPWNEMG-FRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATE 295 (519) Q Consensus 217 g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTE 295 (519) .+++| ...++.+ -++++++..++..+-...+|-||.+|-. .|.+++|.+-|+.. T Consensus 180 ----~v~Eg-----------------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~ 234 (415) T protein:vir:46 180 ----KVEEL-----------------EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK----VNVLQELKLWMART 234 (415) T ss_pred ----ecccc-----------------cccccccccceeeEEeeeeeeEeeehhhHHHHhhch----HHHHHHHHHHHHHH Confidence 00000 1122222 2344566666666666689999999843 57799999999999 Q ss_pred HHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEE Q lcl|Aclame:pro 296 IMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFII 375 (519) Q Consensus 296 ImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v 375 (519) |..-+|+.|+.-.-...-.+.. ..+.. ....+..... . ..+-...|+..+.. .+.+++.+| T Consensus 235 i~~~~d~~il~g~g~g~~~~~~--~~~~~-~~~~~~~~~~------~-~~~~i~~~~~~~~~---------~~~~~~~~v 295 (415) T protein:vir:46 235 IAATRNKAIIDVITKGSTGSTS--SGFEK-EGKKLEVKKA------K-SLDDIKDAINLNVK---------PNYEHNVAI 295 (415) T ss_pred HHHHHHHHHhhccccCCccccc--ccccc-ccceeccccc------c-chHHHHHHHHhhhh---------hccCCCEEE Confidence 9999999998732221111110 00000 0011110000 0 11223344333332 223567899 Q ss_pred EchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeeccc--- Q lcl|Aclame:pro 376 ASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYV--- 452 (519) Q Consensus 376 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv--- 452 (519) ++|.....|...- + +. +...+..+.+.. ..++|.| ++|++.++.+. |-.| +..++|+.|- T Consensus 296 ~n~~~~~~L~~lk--d-~~---G~~i~~~~~~~~-~~~~l~G-~pV~~~~~~~~-----~~~~----~~~~~~gd~~~~~ 358 (415) T protein:vir:46 296 VSQTMFAKLDKMK--D-KL---GNYLIQPDVKEK-TQQRLLG-AKIEILPDEVL-----GQKG----NNTLIIGNLKDAI 358 (415) T ss_pred EcHHHHHHHHHhh--c-cC---CCeeeccCcCCC-CCccccc-eeeEEeccccc-----cCCC----ccEEEEEehhccE Confidence 9999998887531 1 00 011111221111 1256777 58887766552 1111 1112222211 Q ss_pred -----ccccccccCcccccceeeeeeeeceee-cCcccccccCCcceeecC-Cchhhhcccchh Q lcl|Aclame:pro 453 -----ALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNG-MPDIVNSLGLNG 509 (519) Q Consensus 453 -----~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~-~d~~a~~~~~~~ 509 (519) ....+...|-.+++-.+-...|++..+ +|-+ ...+.-. --.-.|..+.-. T Consensus 359 ~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a-------~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:46 359 VLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKS-------AIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEEeecceEEEeeccccCceEEEEEEEeccEEecccc-------EEEEEeeccCCCCCCccCCC Confidence 111122235566677777888988654 3321 1111100 000011111111 No 48 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=87.99 E-value=0.035 Score=28.56 Aligned_cols=331 Identities=14% Similarity=0.160 Sum_probs=124.5 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHH--------------h-------hhhhccchhhhhhhh- Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDI--------------L-------TAPEYRDEKISEAFG- 58 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~--------------~-------~~~~~~~~~~~~~~~- 58 (519) +..+.+..+|.- ..+..+..|+.++..+-..+.+.. . ..++.+...+..... T Consensus 242 ~~~~~~~~~ai~------~g~sld~~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a 315 (632) T protein:vir:96 242 FSQRSLAQEAIQ------KGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINA 315 (632) T ss_pred hhhhhhHHHHHh------ccccHHHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHh Confidence 222222222210 001112233332222210000000 0 000000000000000 Q ss_pred ---------hhhhhh--hhccccccchh------------hhcc-ccccccccccCceeh-hhHHHHHhhhhhhhceeec Q lcl|Aclame:pro 59 ---------SFLTEA--EIGGDHGYDAT------------NIAA-GQTSGAVTQIGPAVM-GMVRRAIPHLIAFDICGVQ 113 (519) Q Consensus 59 ---------~~~~~~--~~~~~~g~~~~------------~~~e-st~tg~v~~~~P~L~-~l~Rra~p~LIa~DI~GVQ 113 (519) .+.-|. .+....|.+.. .+.. +.++|...--...+- .++.+..|..|...+ |++ T Consensus 316 ~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l-~~~ 394 (632) T protein:vir:96 316 AATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM-GAR 394 (632) T ss_pred hhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhh-cce Confidence 000000 00001111100 0000 000111000000110 122222234444443 444 Q ss_pred cCCccchhheeeeeeecCCCCCCCcccccccccccccccCcccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 114 PLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEA 193 (519) Q Consensus 114 PmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~ 193 (519) .+++.+|- .+++.+.+. T Consensus 395 ~~~~~~g~-----~~ip~~~~~---------------------------------------------------------- 411 (632) T protein:vir:96 395 MLPGLVGD-----VDIPKKTSG---------------------------------------------------------- 411 (632) T ss_pred EeecCCcc-----eEEEEEeCC---------------------------------------------------------- Confidence 44433331 111110000 Q ss_pred ccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHH Q lcl|Aclame:pro 194 VTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELA 273 (519) Q Consensus 194 ~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELA 273 (519) ...+-++ | +...++-..+++++++.+|+=+-...+|-||. T Consensus 412 -----------------------~~a~wv~--------E---------~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell 451 (632) T protein:vir:96 412 -----------------------ANFYWIG--------E---------DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLR 451 (632) T ss_pred -----------------------ceeEeec--------C---------CccccccccceeeEEeeeeEEEEehhhHHHHH Confidence 0000011 1 12234444566777888887777788899987 Q ss_pred HHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccc----cccchHHHHHH Q lcl|Aclame:pro 274 QDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDI----RGARWAGESFK 349 (519) Q Consensus 274 QDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~----~~~~~a~e~~r 349 (519) .| -.+|.|++|.+-|...|...+++.+|. | .| +...+.|++.......+ .+..| .... T Consensus 452 ~d----s~~~~~~~i~~~l~~a~~~~~d~a~l~--------G-~G---~~~~p~Gi~~~~~~~~~~~~~~~~~~--~~i~ 513 (632) T protein:vir:96 452 KQ----SSIHVENLIREDLIEGIGVALDLAMLT--------G-TG---LANDPVGLLNMTGVPALTYPAGGVDW--ASVV 513 (632) T ss_pred hc----cchHHHHHHHHHHHHHHHHHHHHHhhc--------c-cC---CCCccceeeecccccceecccccCCH--HHHH Confidence 76 267899999999999999999999885 1 01 11234566543322111 11111 2233 Q ss_pred HHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc Q lcl|Aclame:pro 350 ALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS 429 (519) Q Consensus 350 ~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~ 429 (519) .|+. .|...-........||+|.....|...-..+.. +...+. + |+|.| |+|++.++.+. T Consensus 514 ~~~~-------~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~----G~~i~~----~----~~l~G-~pv~~s~~ip~ 573 (632) T protein:vir:96 514 DMET-------KISTFNADAGRLAYLTSVTQRGAAKKAQVFDNT----GERIWQ----N----NEVNG-YRAEASNQIPA 573 (632) T ss_pred HHHH-------HHhhcccccCccEEEEchhHHHHHHHHhccCCC----Cceeec----C----Ceecc-cceEecccccc Confidence 3333 332222112234578999887777643221111 011111 1 46776 79999999886 Q ss_pred ceEEEE--------EecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-cCcccccccCCcc Q lcl|Aclame:pro 430 DYFTIG--------YKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTK 492 (519) Q Consensus 430 dy~~vG--------~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~ 492 (519) +=+++| -.|.-.+ -..||. +..+-+=.+=...|+++.+ +|=.-..-.. .+ T Consensus 574 ~~~~~gd~s~~~i~~~~~~~i----~~~~~~--------~~~~~~v~~~~~~~~d~~v~~~~af~~~k~-~A 632 (632) T protein:vir:96 574 DTWIFGDWSQIVIAMWGVLDL----KVDPYT--------KAASDGLVLRVFQDVDAGVRRKEAFCIAKK-GA 632 (632) T ss_pred CcEEEeecceEEEEEecceEE----EEcccc--------ccccCceEEEEEeecCceeechhhhhheee-cC Confidence 544433 2222111 112321 2233333444566666533 3322111111 11 No 49 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=87.20 E-value=0.04 Score=28.23 Aligned_cols=304 Identities=10% Similarity=0.042 Sum_probs=119.9 Q ss_pred hhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccccccccccccCcee-hhhHHHHHhhhhhhhceeec Q lcl|Aclame:pro 35 FENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAV-MGMVRRAIPHLIAFDICGVQ 113 (519) Q Consensus 35 ~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L-~~l~Rra~p~LIa~DI~GVQ 113 (519) +|..+|--.+.+. |..-.. .++. +++.....++..+.+ .-|.+ -.+++.+..+.+..+++.+- T Consensus 1 ~~~~~~~~~~~~~------------f~~~~~-~~~~-~~a~~~~~~~~~~~l--ip~~~~~~ii~~~~~~s~l~~l~~~~ 64 (324) T protein:vir:96 1 MEQTQKLKLNLQH------------FASNNV-KPQV-FNPDNVMMHEKKDGT--LLNDFTTPILQEVMENSKIMQLGKYE 64 (324) T ss_pred CCcchhhhHHHHH------------HHHhhh-hhhh-cccccccccCCCcce--echhHHHHHHHHHHhhchhhhhccee Confidence 3322221111111 111000 0000 111111111111211 11222 22445556677788899999 Q ss_pred cCCccchhheeeeeeecCCCCCCCcccccccccccccccCcccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 114 PLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEA 193 (519) Q Consensus 114 PmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~ 193 (519) ||++++.-|. ++... .++ .| T Consensus 65 ~~~~~~~~~p----~~~~~------~~a---------~~----------------------------------------- 84 (324) T protein:vir:96 65 PMEGTEKKFT----FWADK------PGA---------YW----------------------------------------- 84 (324) T ss_pred eccCCceEEE----EEecC------cce---------ee----------------------------------------- Confidence 9987653221 01000 000 00 Q ss_pred ccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHH Q lcl|Aclame:pro 194 VTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELA 273 (519) Q Consensus 194 ~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELA 273 (519) +++| +.. ......|.+..+.+.|..+ ....|-||. T Consensus 85 ----------------------------v~Eg------~~~----~~~~~~f~~v~~~~~k~~~-------~~~is~ell 119 (324) T protein:vir:96 85 ----------------------------VGEG------QKI----ETSKATWVNATMRAFKLGV-------ILPVTKEFL 119 (324) T ss_pred ----------------------------ecCC------ccc----cccccceeEEEEEeEEEEE-------eehhhHHHH Confidence 0111 000 0012335555555555544 445899999 Q ss_pred HHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHH Q lcl|Aclame:pro 274 QDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLF 353 (519) Q Consensus 274 QDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~ 353 (519) +|-. .|.+++|.+.|...|...+++.||.--. +...+.|++........ +. .....+. T Consensus 120 ~ds~----~~l~~~i~~~l~~aia~~~d~~~l~G~g------------~~~~~~~~~~~~~~~~~----~~--~~~~~~~ 177 (324) T protein:vir:96 120 NYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQG------------NNPFGKSIAQSIKKTNK----VI--KGDFTQD 177 (324) T ss_pred hcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCC------------CCCcCccccccccccce----ec--ccccchH Confidence 9853 5678999999999999999998885100 11112233322111000 00 0011122 Q ss_pred HHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc--c- Q lcl|Aclame:pro 354 QIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS--D- 430 (519) Q Consensus 354 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--d- 430 (519) .|.++-+.|.. .+...+.+||+|.....|...---. .+.+-.+... ++|.| ++|++++.... . T Consensus 178 ~i~~~~~~i~~--~~~~~~~~i~n~~~~~~L~~lkd~~-------G~~~~~~~~~----~~l~G-~PV~~~~~~~~~~~~ 243 (324) T protein:vir:96 178 NIIDLEALLED--DELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNS----DSLDG-LPVVNLKSSNLKRGE 243 (324) T ss_pred HHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhCCC-------CCeeecCCCC----Ccccc-eeeEeecCCCCCcce Confidence 33444444433 2345578999999999887542100 0111112222 35666 68888665442 1 Q ss_pred -------eEEEEEecCCCccceeEeecccccccccccCccc-----c---cceeeeeeeecee-ecCcccccccCCccee Q lcl|Aclame:pro 431 -------YFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKN-----F---QPVMGFKTRYGIG-INPFADPAAQAPTKRI 494 (519) Q Consensus 431 -------y~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s-----~---qP~~g~~tRY~l~-~nP~~~~~~~~~~~~i 494 (519) ++++|..+.-..+. ..+ .......|+.. | +=.+=..-||++. .+|=+ ..++ T Consensus 244 ~~~gd~s~~~~~~~~~~~i~~----~~~--~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a-------~~~l 310 (324) T protein:vir:96 244 LITGDFDKLIYGIPQLIEYKI----DET--AQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA-------FAKL 310 (324) T ss_pred EEEEecceEEEEEecCcEEEE----eec--ccccccccccccchhhhhcCcEEEEEEEEeccEEecccc-------eEEE Confidence 23333333221100 000 00000011110 1 1223344566653 23311 1222 Q ss_pred ecCCchhhhcccch Q lcl|Aclame:pro 495 QNGMPDIVNSLGLN 508 (519) Q Consensus 495 ~~~~d~~a~~~~~~ 508 (519) ......-.-..++- T Consensus 311 ~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 311 VPADKRTDSVPGEV 324 (324) T ss_pred ecccccCCCCCCCC Confidence 21111101111211 No 50 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=87.19 E-value=0.04 Score=28.23 Aligned_cols=333 Identities=13% Similarity=0.161 Sum_probs=128.7 Q ss_pred CChH---------------HHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHH---Hhhhhhccchhhhhhhhhhhh Q lcl|Aclame:pro 1 MKKN---------------ALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQD---ILTAPEYRDEKISEAFGSFLT 62 (519) Q Consensus 1 ~~~~---------------~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~---~~~~~~~~~~~~~~~~~~~~~ 62 (519) ++.+ +++++=.-+-. +|.. -+++ +....+.+.+. ..+....+.+....++..++. T Consensus 50 ~k~~~~~~~~~~~~~~~~~e~~~~~~~~~~-----ei~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l~ 122 (425) T protein:vir:10 50 FKAEHTKQLDAVKAGLPTSDALAKVDKVSA-----DLEA-LQAA-VDEANIKIAAAQMGANGVKPLRDPEYTEAFKAHVK 122 (425) T ss_pred HHHHHHHHHHHHHhhhccHHHHHHHHHHHH-----HHHH-HHHH-HHHHHHHHHhhhcccccccccccHHHHHHHHHHhh Confidence 2111 11111111000 1110 0011 11111100000 011122222223334444442 Q ss_pred hhhhccccccchhhhcccccc-ccccccCceeh-hhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCccc Q lcl|Aclame:pro 63 EAEIGGDHGYDATNIAAGQTS-GAVTQIGPAVM-GMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKE 140 (519) Q Consensus 63 ~~~~~~~~g~~~~~~~est~t-g~v~~~~P~L~-~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~e 140 (519) ... ....+..++++ |.+. .-+.+. .+++++-+..+..++|.|-||+++..-+. + .... T Consensus 123 ~~e-------~~~al~~~t~~~gG~l-vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~-----~--~~~~----- 182 (425) T protein:vir:10 123 RGD-------VQAALNKGEDSEGGYL-TPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKL-----F--NMGG----- 182 (425) T ss_pred hhh-------hHHHhhcCcCCCCcee-ccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEE-----E--EcCC----- Confidence 111 11122222221 1111 112221 24454555667788999999987654221 1 1000 Q ss_pred ccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccce Q lcl|Aclame:pro 141 AFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLA 220 (519) Q Consensus 141 A~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~ 220 (519) +.+.| T Consensus 183 -------~~a~w-------------------------------------------------------------------- 187 (425) T protein:vir:10 183 -------TTSGW-------------------------------------------------------------------- 187 (425) T ss_pred -------cceee-------------------------------------------------------------------- Confidence 00000 Q ss_pred ecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 221 EIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEI 300 (519) Q Consensus 221 ~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEI 300 (519) +++| +.. ..+....|.++.|++.|..+ ...+|-||.+|- .+|.+++|.+-|+..|..-+ T Consensus 188 -v~E~------~~~---~~~~~~~f~~v~~~~~k~~~-------~i~iS~ell~ds----~~~l~~~i~~~la~ai~~~~ 246 (425) T protein:vir:10 188 -VGEA------SQR---PQTNAATFQPLSFASGEIYA-------NPAATQQILDDA----EIDLESWLATEVQTEFAKQE 246 (425) T ss_pred -eccc------ccc---ccccccccceeeeeheeeEe-------ehHhHHHHHhcc----hhHHHHHHHHHHHHHHHHHH Confidence 0000 000 00111236666666666654 556999999985 35679999999999999999 Q ss_pred hHHHHHHHHhhhhhhhhcccccccccceeeecccccc---------------ccccchHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 301 NREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPID---------------IRGARWAGESFKALLFQIDKEAAEIARQ 365 (519) Q Consensus 301 NReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d---------------~~~~~~a~e~~r~L~~~i~~~a~~I~~~ 365 (519) |+.||. | .| .+.+.|++....... .....-..+....|+..+. . T Consensus 247 d~~~l~--------G-~G----~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~-------~- 305 (425) T protein:vir:10 247 GKAFLA--------G-DG----TNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLP-------S- 305 (425) T ss_pred Hhhhhc--------c-cC----CCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhh-------h- Confidence 998885 1 00 112334433211000 0000001122233333222 1 Q ss_pred ccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc-----ceEEEEEecCC Q lcl|Aclame:pro 366 TGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS-----DYFTIGYKGSN 440 (519) Q Consensus 366 T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG~~ 440 (519) .+-+....|++|.....|...= +.. +...+..+.+.. ..++|.| ++|+++.+.+. +.+++| +- T Consensus 306 -~~~~~a~~vmn~~~~~~L~~lk--D~~----G~~l~~~~~~~g-~~~~l~G-~PV~~~~~~p~~~~~~~~i~~G---d~ 373 (425) T protein:vir:10 306 -AFTGNARFAMNRNTQRQVRKLK--DGQ----GNYLWQPSYVAG-QPATLAG-YPVTEVPDMPDVAANSTPILFG---DF 373 (425) T ss_pred -hhccCCEEEEchHHHHHHHHhh--cCC----CceeeccCccCC-CCceecc-eeeEEecCcCCccCCccEEEEE---eh Confidence 2223356789999988887421 110 011122222211 1257877 69999888763 334443 11 Q ss_pred CccceeEeecccccccccccCcccccc--eeeeeeeeceee-cCcccccccCCcceeecCCchhhhccc Q lcl|Aclame:pro 441 EMDAGIYYAPYVALTPLRGSDPKNFQP--VMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSLG 506 (519) Q Consensus 441 ~~~~~~fyaPYv~~~~~~~~dp~s~qP--~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~~ 506 (519) . ...+... ...+....||-.-.- .+-...||+..+ +|-+- ..+. ++ .+. T Consensus 374 ~--~~~~i~~--~~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~-------~~l~-----~~-as~ 425 (425) T protein:vir:10 374 Q--QTYLIID--RIGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPM-------RAMK-----VA-ASE 425 (425) T ss_pred h--ccEEEEE--ecceEEEecccccCCcEEEEEEEEeccEeecccce-------EEEE-----ee-ccC Confidence 1 0011111 111111123332222 233445776543 33321 1111 11 011 No 51 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=86.87 E-value=0.043 Score=28.10 Aligned_cols=307 Identities=9% Similarity=0.002 Sum_probs=119.0 Q ss_pred hhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeecc Q lcl|Aclame:pro 35 FENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQP 114 (519) Q Consensus 35 ~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQP 114 (519) ||-- .+.+..+ +.|.....-. .-+++.... .++++...--..+.-.+++.+....+..+++-+.| T Consensus 1 ~~~~--------~~~~~~~----~~f~~~~~~~--~~~~a~~~~-~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~ 65 (324) T protein:vir:97 1 MEQT--------QKLKLNL----QHFASNNVKP--QVFNPDNVM-MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP 65 (324) T ss_pred Cccc--------hhHHHHH----HHHHHhhhhh--hhhcccccc-ccCCCcceechhHHHHHHHHHHhhcchhhhcceee Confidence 2211 1111111 0110000000 011122211 11122221111122234555666778888898999 Q ss_pred CCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAV 194 (519) Q Consensus 115 mTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~ 194 (519) |++.+--| ....+. ..| . T Consensus 66 ~~~~~~~i-------p~~~~~---~~a---------~------------------------------------------- 83 (324) T protein:vir:97 66 MEGTEKKF-------TFWADK---PGA---------Y------------------------------------------- 83 (324) T ss_pred ccCCceEE-------EEEecC---cce---------e------------------------------------------- Confidence 88765211 110000 000 0 Q ss_pred cCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 195 TVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQ 274 (519) Q Consensus 195 ~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQ 274 (519) -+++| ..+++...++++++.+.|.-+.-..+|-||.+ T Consensus 84 --------------------------~v~Eg-----------------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ 120 (324) T protein:vir:97 84 --------------------------WVGEG-----------------QKIETSKATWVNATMRAFKLGVILPVTKEFLN 120 (324) T ss_pred --------------------------EeccC-----------------ccccccccceeEEEEeeEEEEEeehhhHHHHh Confidence 00110 11223333344444445544455569999999 Q ss_pred HHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHH Q lcl|Aclame:pro 275 DLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQ 354 (519) Q Consensus 275 DLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~ 354 (519) |-. .|.+++|.+-|+..|...+++.||.--- ....+.|++........ . ....-.+.. T Consensus 121 ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g------------~~~~~~gi~~~~~~~~~-----~-~~~~~~~~~ 178 (324) T protein:vir:97 121 YTY----SQFFEEMKPMIAEAFYKKFDEAGILNQG------------NNPFGKSIAQSIEKTNK-----V-IKGDFTQDN 178 (324) T ss_pred cch----HHHHHHHHHHHHHHHHHHHHHHhhccCC------------CCccCccccccccccce-----e-ccccCCHHH Confidence 863 6679999999999999999999986111 11112233321111000 0 000111233 Q ss_pred HHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccc--eE Q lcl|Aclame:pro 355 IDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD--YF 432 (519) Q Consensus 355 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--y~ 432 (519) |+++.+.|.. .+.....+||+|.....|...- + +- .+....+... ++|.| ++|++.+..... .+ T Consensus 179 i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~lk--d-~~----g~~~~~~~~~----~tl~G-~PV~~~~~~~~~~~~~ 244 (324) T protein:vir:97 179 IIDLEALLED--DELEANAFISKTQNRSLLRKIV--D-PE----TKERIYDRNS----DTLDG-LPVVNLKSSNLKRGEL 244 (324) T ss_pred HHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhh--c-CC----CceeecCCCC----ccccc-eeeEeecCCCCCcceE Confidence 4455555543 2234467899999998887532 1 10 0111112222 45777 588876654421 23 Q ss_pred EE--------EEecCCCccceeEeecccccccccccCcc---ccc---ceeeeeeeecee-ecCcccccccCCcceeecC Q lcl|Aclame:pro 433 TI--------GYKGSNEMDAGIYYAPYVALTPLRGSDPK---NFQ---PVMGFKTRYGIG-INPFADPAAQAPTKRIQNG 497 (519) Q Consensus 433 ~v--------G~KG~~~~~~~~fyaPYv~~~~~~~~dp~---s~q---P~~g~~tRY~l~-~nP~~~~~~~~~~~~i~~~ 497 (519) ++ |..+.-..+- ..+.-+.....-|.. -|| =.+=+..||+.. .||=+ .++|..- T Consensus 245 ~~gd~~~~~i~~~~~~~i~~----~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a-------~~~l~~~ 313 (324) T protein:vir:97 245 ITGDFDKLIYGIPQLIEYKI----DETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA-------FAKLVPA 313 (324) T ss_pred EEEecccEEEEEecCcEEEE----eecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc-------eEEEEec Confidence 33 3332211100 000000000000000 011 122233566642 22221 1111111 Q ss_pred Cchhhhcccch Q lcl|Aclame:pro 498 MPDIVNSLGLN 508 (519) Q Consensus 498 ~d~~a~~~~~~ 508 (519) ...-.....+- T Consensus 314 ~~~~~~~~~~~ 324 (324) T protein:vir:97 314 DKKTDSVPGEV 324 (324) T ss_pred cCCCCCCCCCC Confidence 10000011100 No 52 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=86.12 E-value=0.048 Score=27.82 Aligned_cols=355 Identities=15% Similarity=0.140 Sum_probs=119.1 Q ss_pred CC--hHHHHHhhhhhhCCCc----cccccccchhh---hhhhhhhhHHH-----HHhhhhhccchhhhhhhhhhhhhhhh Q lcl|Aclame:pro 1 MK--KNALVQKWSALLENEA----LPEIVGASKQA---IIAKIFENQEQ-----DILTAPEYRDEKISEAFGSFLTEAEI 66 (519) Q Consensus 1 ~~--~~~l~~kw~p~l~~~~----~~~~~~~~~~~---~~~~~~enq~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (519) |+ -+.|.+...-+-+-+. +..-.+..+.. -......+|++ ...-.+..+ .+....+........ T Consensus 41 l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 118 (435) T protein:vir:80 41 LSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVR--ALAAARGDAQLASKL 118 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhccccccccccccchhhhhHHHHHHHHH--HHHhccchhHHHHHH Confidence 11 1223333322211000 00000000000 00000011110 000000000 000000000000000 Q ss_pred ccccccchhhhccccccccccccCceehh------hHHHHHhhhhhhhc-eeeccCCccchhheeeeeeecCCCCCCCcc Q lcl|Aclame:pro 67 GGDHGYDATNIAAGQTSGAVTQIGPAVMG------MVRRAIPHLIAFDI-CGVQPLNNPTGQVFALRAVYGKDPIAAGAK 139 (519) Q Consensus 67 ~~~~g~~~~~~~est~tg~v~~~~P~L~~------l~Rra~p~LIa~DI-~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~ 139 (519) .-..++. ...+...+++. ......|+| +++++-+..+...+ +=+-||+.+. -+|+-. T Consensus 119 ~~~~~~~-~~~~~~~~~~~-~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~-------~~~p~~------- 182 (435) T protein:vir:80 119 AIERGFG-EEVAMSLNTLS-PGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGN-------ITIPRL------- 182 (435) T ss_pred HHhhhhh-hhhhhhhcccC-CCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCc-------eEEEEE------- Confidence 0000000 00000011111 112223333 22222233333333 1122222211 011000 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccc Q lcl|Aclame:pro 140 EAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQL 219 (519) Q Consensus 140 eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~ 219 (519) ++. .. . T Consensus 183 -------------~~~---------------------------------------------~~----------------a 188 (435) T protein:vir:80 183 -------------KGG---------------------------------------------AI----------------V 188 (435) T ss_pred -------------eCC---------------------------------------------cc----------------e Confidence 000 00 0 Q ss_pred eecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 220 AEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLE 299 (519) Q Consensus 220 ~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlE 299 (519) .-++ | +..+++...++++++...+.-+-....|.||.+|-.- +.|.|+.|.+-|+..|... T Consensus 189 ~~v~--------E---------~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~--~~~l~~~i~~~l~~a~~~~ 249 (435) T protein:vir:80 189 GYIG--------A---------DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGV--NPNVDQIVVGDLTAAIGAR 249 (435) T ss_pred eeec--------c---------CccccccccceeeEEEeeEEEEEeehhhHHHHHhhcc--cHHHHHHHHHHHHHHHHHH Confidence 0011 1 1123444455666666666666777899999999432 4567888888888888888 Q ss_pred hhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchH Q lcl|Aclame:pro 300 INREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRN 379 (519) Q Consensus 300 INReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~ 379 (519) +++-|+.= .| +...+.|++.......+.. -.++.....+...+.+.-..+.....+-.....|++|. T Consensus 250 ~d~a~l~G---------~G---~~~~p~Gi~~~~~~~~~~~-~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~ 316 (435) T protein:vir:80 250 EDKAFIRD---------DG---TANTPKGLRFWALPGNVIT-ASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPR 316 (435) T ss_pred HHHHhhcc---------CC---CCCcccceeecccccceee-cccccchhhHHHHHHHHHHHhhccccccccCEEEEcHH Confidence 88877751 01 1112345543221111000 00111112222223332222222221223466799999 Q ss_pred HHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccc--------eEE--------EEEecCCCcc Q lcl|Aclame:pro 380 VVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD--------YFT--------IGYKGSNEMD 443 (519) Q Consensus 380 va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y~~--------vG~KG~~~~~ 443 (519) ....|...- +.. +...+ .+.++ |+|.| ++||++.+.|.+ -++ ||-.+.-.. T Consensus 317 ~~~~L~~lk--d~~----G~~l~-~~~~~----~~l~G-~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~i- 383 (435) T protein:vir:80 317 TFRFLEGLR--DGN----GNKVY-PELAN----GMLKG-YPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEETLEI- 383 (435) T ss_pred HHHHHHhhh--ccC----Cceec-cCCCC----CeEee-eeeEEeccccccccCCCCcceEEEEEcccEEEEeecceEE- Confidence 999887542 110 01111 22333 46766 699998886532 122 332222211 Q ss_pred ceeEeecccccccccccCcccc---cceeeeeeeeceeecCcccccccCCcceeecCCchhh Q lcl|Aclame:pro 444 AGIYYAPYVALTPLRGSDPKNF---QPVMGFKTRYGIGINPFADPAAQAPTKRIQNGMPDIV 502 (519) Q Consensus 444 ~~~fyaPYv~~~~~~~~dp~s~---qP~~g~~tRY~l~~nP~~~~~~~~~~~~i~~~~d~~a 502 (519) -..+|.-+..-...--..| +=.+=+.-|+++.+. +...-.+..|-.|-| T Consensus 384 ---~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~-------~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 384 ---DYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPR-------HVESIAVLSGVAWGA 435 (435) T ss_pred ---EEeccccccccccchhhhhhcCcceeeeeeeeCcEee-------cccceEEEeccCCCC Confidence 1111111100000000001 122334556665441 111223344555544 No 53 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=85.95 E-value=0.049 Score=27.76 Aligned_cols=355 Identities=11% Similarity=0.020 Sum_probs=127.6 Q ss_pred CChH--HHHHhhhhh-------hCCCccc--cccccch--hhhhhhhhhhHHHH-----Hhhh-----hhccchhhhhhh Q lcl|Aclame:pro 1 MKKN--ALVQKWSAL-------LENEALP--EIVGASK--QAIIAKIFENQEQD-----ILTA-----PEYRDEKISEAF 57 (519) Q Consensus 1 ~~~~--~l~~kw~p~-------l~~~~~~--~~~~~~~--~~~~~~~~enq~~~-----~~~~-----~~~~~~~~~~~~ 57 (519) |++| +|+++=+-+ ++..... ++....+ ..+.++|=+-++.. +.+. ..-..+...+.. T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKDGVTAEELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNG 80 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHH Confidence 8876 555544433 3322221 2211110 01111110000000 0000 000000000011 Q ss_pred h--------hhhhhhhhcc-ccccc-hhhhcccc-ccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeee Q lcl|Aclame:pro 58 G--------SFLTEAEIGG-DHGYD-ATNIAAGQ-TSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALR 126 (519) Q Consensus 58 ~--------~~~~~~~~~~-~~g~~-~~~~~est-~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMR 126 (519) . .++.+....+ ..... ...+..++ ++|.+.--..+.-.+++.+.......+++++.||+++.|-+--.| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~ 160 (404) T protein:vir:10 81 ALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEK 160 (404) T ss_pred HHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEE Confidence 1 1111111010 11111 11111222 112211101111124444445667788999999999988532211 Q ss_pred eeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCcccc Q lcl|Aclame:pro 127 AVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKL 206 (519) Q Consensus 127 srY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~ 206 (519) ....+ ...|- T Consensus 161 -----~~~~~------------~~~~v----------------------------------------------------- 170 (404) T protein:vir:10 161 -----RSKQK------------PMKPL----------------------------------------------------- 170 (404) T ss_pred -----ecCCc------------ceeec----------------------------------------------------- Confidence 10000 00000 Q ss_pred ccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHH Q lcl|Aclame:pro 207 DAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADA 286 (519) Q Consensus 207 ~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEa 286 (519) ++| |.. ..+ ....++++++.+.|.-+-...+|-||.+|-. .+.++ T Consensus 171 ----------------~e~------~~~---~~~------~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~ 215 (404) T protein:vir:10 171 ----------------SEN------QQI---PTN------GDNGKLERFNFKLKDLADFMSIPNDLLKFAD----KSLED 215 (404) T ss_pred ----------------ccc------ccc---ccc------ccccceeeeEeeheeeEeeehhhHHHHhhcH----HHHHH Confidence 000 000 000 0112233444444444455689999999843 35688 Q ss_pred HHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 287 ELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQT 366 (519) Q Consensus 287 ELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T 366 (519) .|.+.|+..|...+|+.||.=- .+...+.|+.......... +.. ...+..++..-+. .... T Consensus 216 ~i~~~la~~~~~~~~~~il~G~------------g~~~~~~gi~~~~~~~~~~---~~~---~~~~~~~~~~~~~-~l~~ 276 (404) T protein:vir:10 216 WIINWFVDKVRITRNAEILYGA------------GGDEHATGIMTANKFKKIT---LPK---SPALKDFKKCKNV-ELLN 276 (404) T ss_pred HHHHHHHHHHHHHHHHHHhhcC------------CCCCcccceeeccccceee---ccc---cccHHHHHHHHHh-hhhc Confidence 8888888888888888887410 1112234444322111000 000 0011122221111 1223 Q ss_pred cccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEec-CCCccceEEEEEecCCCccce Q lcl|Aclame:pro 367 GRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYID-QYARSDYFTIGYKGSNEMDAG 445 (519) Q Consensus 367 ~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D-~y~~~dy~~vG~KG~~~~~~~ 445 (519) .+...-.+||+|+....|...- +.. +.-.+..+.+. ..-++|.| ++|++. ...+.. ...+.. T Consensus 277 ~~~~~~~~v~n~~~~~~L~~lk--d~~----G~~l~~~~~~~-~~~~~l~G-~PV~~~~~~~~~~---------~~~~~~ 339 (404) T protein:vir:10 277 VFKATSSWIVNQDGFNYLDSLE--DKT----GRPYLQPDPKD-PTQYRFLG-LPVIELPNDLLLS---------TESAIP 339 (404) T ss_pred cccCCCEEEEcHHHHHHHHHhh--ccC----CceeeccCcCC-CCCccccc-eeeEEecccccCC---------CCCccE Confidence 3333345799999999887642 100 01111112111 11246777 577753 221110 000111 Q ss_pred eEeeccc---------ccccccccCc----ccccceeeeeeeeceee-cCcccccccCCcceeecCCchhhhcccc Q lcl|Aclame:pro 446 IYYAPYV---------ALTPLRGSDP----KNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSLGL 507 (519) Q Consensus 446 ~fyaPYv---------~~~~~~~~dp----~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~~~ 507 (519) ++|+.+- .+......++ ...+=.+-...|+++.+ +|-+ ...+.-. .+ .+.. T Consensus 340 ~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a-------~~~~~~~---~a-a~~~ 404 (404) T protein:vir:10 340 VLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSEA-------LLIAEIP---VE-SVQA 404 (404) T ss_pred EEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccc-------eEEEEee---cc-cCCC Confidence 2222111 0111111122 23344566777887643 2211 1111111 00 0010 No 54 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=85.86 E-value=0.05 Score=27.73 Aligned_cols=273 Identities=11% Similarity=0.013 Sum_probs=119.5 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCcc Q lcl|Aclame:pro 164 LAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDN 243 (519) Q Consensus 164 ~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~ 243 (519) .+.... ...+...+-... ... ...........+.... +....+ ..|.+..++.=-.+..+|.+ .....- T Consensus 1 ma~~~T-~~~~~iiPev~~-~~v-~~~~~~~~~~~~~~~~---~~~l~g--~~G~tv~ip~~~~~g~~~~~---~eg~~i 69 (274) T protein:vir:93 1 MPQGIT-KTSNQIIPEVLA-PMM-QAQLEKKLRFASFAEV---DSTLQG--QPGDTLTFPAFVYSGDAQVV---AEGEKI 69 (274) T ss_pred CCccce-ehhheechHHHH-HHH-HHHHHhhhhhcccccc---cccccC--CCCCEEEEEeeccCCCcccc---cCCCcc Confidence 111110 111111110000 000 0000000000000000 000000 01112222110001122221 111122 Q ss_pred ccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccc Q lcl|Aclame:pro 244 PWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTV 323 (519) Q Consensus 244 ~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~ 323 (519) ...++. ....+++-|-|+-.=+++=| +.+.+ +-|.-.+..+-++..+...++++++..+...... T Consensus 70 ~~~~it--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~--------- 134 (274) T protein:vir:93 70 PTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------- 134 (274) T ss_pred cccccc--cceeEEEeeeecccccccHH--HHHhh--ccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------- Confidence 334443 44555555666532233332 22223 5789999999999999999999999865432110 Q ss_pred cccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccccccccccccc Q lcl|Aclame:pro 324 GAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFN 403 (519) Q Consensus 324 ~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~ 403 (519) ..+..+ ..+-+-.+..++.++. ..+++++|+|.+++.|.....+.|..+. ..- T Consensus 135 -~~~~~~-------------~~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s----~~g 187 (274) T protein:vir:93 135 -VNADIT-------------KLNGLQSAIDKFNDED---------LEPMVLFINPLDAGKLRGDASTNFTRAT----ELG 187 (274) T ss_pred -cccccc-------------CHHHHHHHHHHhhhcc---------CCccEEEeCHHHHHHHHhhhhhcccccc----ccc Confidence 001111 2333334444443321 2568999999999999875544332211 111 Q ss_pred ccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceeecCcc Q lcl|Aclame:pro 404 VDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGINPFA 483 (519) Q Consensus 404 ~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~nP~~ 483 (519) .+...+-.+|.+.| ++||+|+..|..-..+.-+|. +-|.---+.......|++++.=.+-...|||+.+ T Consensus 188 ~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~ga------i~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~---- 256 (274) T protein:vir:93 188 DDIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKGA------VKLILKRDFFLEVARDASTKTTALYSDKHYVAYL---- 256 (274) T ss_pred ccceeecccceecC-eeEEEcCCCCcceEEEEeCCe------EEEEecCCcccccccchhhcccEEEEEEEEEEEE---- Confidence 12222335678876 899999998865333332331 1121111222223359999999999999999765 Q ss_pred cccccC-CcceeecCCchhhhcccc Q lcl|Aclame:pro 484 DPAAQA-PTKRIQNGMPDIVNSLGL 507 (519) Q Consensus 484 ~~~~~~-~~~~i~~~~d~~a~~~~~ 507 (519) .+. +-..+... +++-.| T Consensus 257 ---~~~~~~v~~t~~----~~s~~~ 274 (274) T protein:vir:93 257 ---YDESKAVKITKG----SGSLEM 274 (274) T ss_pred ---EcCCceEEEeeC----ccccCC Confidence 111 11222211 223333 No 55 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=85.84 E-value=0.05 Score=27.72 Aligned_cols=271 Identities=11% Similarity=0.034 Sum_probs=118.5 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCCCc-cccccccccccccccceecccccchhhhhhcccCCCCCc Q lcl|Aclame:pro 164 LAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDA-AKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTD 242 (519) Q Consensus 164 ~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~-~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~ 242 (519) .+..+. ...+...+-.-. .. ............+.... ..+.. ..|.+..++.=-.+..+|.. ..... T Consensus 1 ma~~~T-~~~d~i~Pev~s-~~-v~~~~~~~~~~~~~~~~~~~l~g------~~G~tv~ip~~~~~g~~~~~---~~g~~ 68 (274) T protein:vir:96 1 MAQGTT-KVSNLIVPEVLA-PM-MQAELDKKLRFAQFADIDSTLVG------QPGDTLTFPAFTYSGDAQVI---AEGEK 68 (274) T ss_pred CCcccc-chhhhhhhHHHH-HH-HHHHHHhhhhhcccccccccccC------CCCCEEEEEeeccCCCcccc---CCCCc Confidence 111111 111111111000 00 00000000000000000 00000 01122222110011122211 11123 Q ss_pred cccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccccc Q lcl|Aclame:pro 243 NPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNT 322 (519) Q Consensus 243 ~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~ 322 (519) -.+.++.++ ..+++.|-|+-.=+++=|. ++..+-|.-.+..+-++..++.+++++++..+...... T Consensus 69 i~~~~it~~--~~~~~i~~~~~~~~i~D~~----~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~-------- 134 (274) T protein:vir:96 69 IPVDQIGTS--KREAKVRKIGKGTELTDEA----VLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT-------- 134 (274) T ss_pred Cchhhcccc--eeEEEEEeeeceeeecHHH----HHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-------- Confidence 344455444 3444445554322333222 12346789999999999999999999999866432110 Q ss_pred ccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccc Q lcl|Aclame:pro 323 VGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGF 402 (519) Q Consensus 323 ~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~ 402 (519) ..+..+ .++.+-.+..++.++. ..+++++|+|.+++.|.......|..+... + T Consensus 135 --~~~~~~-------------~~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~--g- 187 (274) T protein:vir:96 135 --VEADIT-------------KLDGLQTAIDKFNDED---------LEPMVLFVNPLDAGGLRTSASDNFTRPTQL--G- 187 (274) T ss_pred --cCcccc-------------cHHHHHHHHHHhcccC---------CCceEEEeCHHHHHHHHhcccccccccccc--c- Confidence 001111 2333344444444321 256899999999999987654444322110 0 Q ss_pred cccCCCceEEEEecCcEEEEecCCCccce-EEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-c Q lcl|Aclame:pro 403 NVDTTKAVFAGVLGGKYRVYIDQYARSDY-FTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-N 480 (519) Q Consensus 403 ~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-n 480 (519) .....+-.+|.+.| ++||+|...|..= +++| +|.-. |+.. -+...-...|+..++-.|-...+||+.+ | T Consensus 188 -~~~~~~g~ig~~~G-~~Vi~s~~~p~~t~~l~~-~gA~~-----~~~~-~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~ 258 (274) T protein:vir:96 188 -DNIIVKGAFGEALG-AVIVRSNKLNKGEALLAK-KGAVK-----LITK-RDFFLEKDRDASRKSTALYSDKHYVAYLYD 258 (274) T ss_pred -ccceeecccceecC-eeEEEcCCCCcceEEEEe-Cccee-----eeec-CCcccccccchhhcccEEEEeeEEEEEEEc Confidence 11222334678876 8999999998642 2222 22211 1111 1122222369999999999999999765 2 Q ss_pred CcccccccCCcceeecCCchhhhcccchhhhhhhh Q lcl|Aclame:pro 481 PFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVY 515 (519) Q Consensus 481 P~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~ 515 (519) | ++-.++..+. +++ |+ T Consensus 259 ~-------~~vv~~t~~~------~~~------~~ 274 (274) T protein:vir:96 259 E-------SKVVKITKGA------GDE------VM 274 (274) T ss_pred C-------ccEEEEEcCc------ccc------cC Confidence 2 1223333321 111 11 No 56 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=85.13 E-value=0.055 Score=27.49 Aligned_cols=349 Identities=12% Similarity=0.044 Sum_probs=131.4 Q ss_pred CCh-----HHHHHhhhhhhCCCccccccccchhhh--hhhhhhhHHHHHhhh---hhccchhhhhhh------------- Q lcl|Aclame:pro 1 MKK-----NALVQKWSALLENEALPEIVGASKQAI--IAKIFENQEQDILTA---PEYRDEKISEAF------------- 57 (519) Q Consensus 1 ~~~-----~~l~~kw~p~l~~~~~~~~~~~~~~~~--~~~~~enq~~~~~~~---~~~~~~~~~~~~------------- 57 (519) |+. ++|.+++.-+-+. +-++.+.-+..+ +..+.+++++.+.+. ..-++..+.+.. T Consensus 1 m~~~~k~l~el~~~~~~~~~~--~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQ--IKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGE 78 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Confidence 332 2333333333221 000000000000 011222222221100 000000111000 Q ss_pred ------hhhhhhhh-----hccccccch-----hhhccccccccccccCc-eehhhHHHHHhhhhhhhceeeccCCccch Q lcl|Aclame:pro 58 ------GSFLTEAE-----IGGDHGYDA-----TNIAAGQTSGAVTQIGP-AVMGMVRRAIPHLIAFDICGVQPLNNPTG 120 (519) Q Consensus 58 ------~~~~~~~~-----~~~~~g~~~-----~~~~est~tg~v~~~~P-~L~~l~Rra~p~LIa~DI~GVQPmTGPTG 120 (519) .....+.. .....+... +.+...+.++.. -.-| ..-.++++..+..+..++|.++||.+++. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~ 157 (395) T protein:vir:43 79 EAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGA-LVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSV 157 (395) T ss_pred chhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCcc-ccchhhHHHHHHHHHhhhhHHhhccceecCCCce Confidence 00000000 000000000 111111111110 0111 12234455556777888899999877642 Q ss_pred hheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCC Q lcl|Aclame:pro 121 QVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGA 200 (519) Q Consensus 121 LIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~ 200 (519) -+ . +...... .+ T Consensus 158 ~~--~--~~~~~~~--------------~a-------------------------------------------------- 169 (395) T protein:vir:43 158 EY--V--RETGFVN--------------NA-------------------------------------------------- 169 (395) T ss_pred EE--E--EEecCCC--------------ce-------------------------------------------------- Confidence 11 0 1000000 00 Q ss_pred CCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhc Q lcl|Aclame:pro 201 TDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVH 280 (519) Q Consensus 201 t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiH 280 (519) .-+++| ...++-..+++++++..+.-+-...+|-||.||.- T Consensus 170 -------------------~~v~E~-----------------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~--- 210 (395) T protein:vir:43 170 -------------------APVSEG-----------------TQKPYSDLTFELENAPVRTIAHLFKASRQILDDAS--- 210 (395) T ss_pred -------------------eeecCC-----------------ccccccccceeEEEEeeeeEEEeehhhHHHHHhHH--- Confidence 000111 11222233444555555555566789999999853 Q ss_pred CCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 281 GMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAA 360 (519) Q Consensus 281 GLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~ 360 (519) +.++.|.+-|+..|...+|+.||. | +-+...+.|++......... .-... ....++..|.++.. T Consensus 211 --~l~~~v~~~la~a~~~~~d~~~l~--------G----~g~~~~~~Gi~~~~~~~~~~-~~~~~-~~~~~~~~i~~~~~ 274 (395) T protein:vir:43 211 --ALQSYIDARARYGLMLVEECQLLY--------G----NGTGANLHGIIPQAQAYAPP-SGVVV-TAEQRIDRIRLAIL 274 (395) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHh--------c----cCCCCccccccccccccccc-ccccc-ccchhHHHHHHHHH Confidence 358889999999999999988874 1 00111234544322111000 00000 01123444444444 Q ss_pred HHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCC Q lcl|Aclame:pro 361 EIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSN 440 (519) Q Consensus 361 ~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~ 440 (519) .+.. .+..+..+|++|.....|...= +.. ++ .+..+... .-.++|.| ++|+++++.+.+=+++|--.. T Consensus 275 ~~~~--~~~~~~~~vmn~~~~~~l~~lk--d~~----G~-~i~~~~~~-~~~~~l~G-~pVv~~~~~~~~~~~~gd~~~- 342 (395) T protein:vir:43 275 QAQL--AEFPASGIVLNPIDWALIELNK--DAE----NR-YIIGSPQN-GTTPTLWR-LPVVETQAITQDEFLTGAFSL- 342 (395) T ss_pred hhcc--ccCCCcEEEEcHHHHHHHHHhh--ccC----Cc-eecccccc-CCCceecc-eeeEEcCCCCCCcEEEEeccc- Confidence 5543 3345578999999988875321 100 01 11111111 01246776 799999998865444442110 Q ss_pred CccceeEeecccccccccccC-c-cccc---ceeeeeeeeceee-cCcccccccCCcceeecCCchhhhc Q lcl|Aclame:pro 441 EMDAGIYYAPYVALTPLRGSD-P-KNFQ---PVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNS 504 (519) Q Consensus 441 ~~~~~~fyaPYv~~~~~~~~d-p-~s~q---P~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~ 504 (519) ..+.. .-....+...+ . ..|+ =.+-+..|++..+ +|= ...++. ++.+ T Consensus 343 ----~~~~~-~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~-------a~~~~~-----~taa 395 (395) T protein:vir:43 343 ----GAQIF-DRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPE-------AFVTGS-----LTAS 395 (395) T ss_pred ----eEEEE-EecceEEEEeccccchhhcCcEEEEEEEeeccEEeccc-------ceEEEE-----eccC Confidence 00000 00111111111 1 1232 2333445777654 111 111221 1111 No 57 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=84.44 E-value=0.06 Score=27.27 Aligned_cols=307 Identities=9% Similarity=0.005 Sum_probs=127.1 Q ss_pred hhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeecc Q lcl|Aclame:pro 35 FENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQP 114 (519) Q Consensus 35 ~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQP 114 (519) +|-+.+--.+.+.+....+. ... +.+-....++ ++...--....-.+++.+..+.+..+++.+-| T Consensus 1 ~~~~~~~~~~~~~f~~~~~~---~~~-----------~~a~~~~~~~-~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~ 65 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVK---PQV-----------FNPDNVMMHE-KKDGTLLNDFTTPILQEVMENSKIMQLGKYEP 65 (324) T ss_pred CchhHHHHHHHHHHHHhhhh---hhh-----------cccccccccC-CCcceechhHHHHHHHHHHhhchhhhhcceee Confidence 33333322222222111110 000 0111111111 11111111222235555666778888999999 Q ss_pred CCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAV 194 (519) Q Consensus 115 mTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~ 194 (519) |++++-- |+-.... .+| T Consensus 66 ~~~~~~~-------ip~~~~~---~~a----------------------------------------------------- 82 (324) T protein:vir:93 66 MEGTEKK-------FTFWADK---PGA----------------------------------------------------- 82 (324) T ss_pred ccCCceE-------EEEEecC---cce----------------------------------------------------- Confidence 9876532 2110000 000 Q ss_pred cCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 195 TVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQ 274 (519) Q Consensus 195 ~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQ 274 (519) ..++ | +..+++..-++++++++.+..+-....|-||.+ T Consensus 83 -------------------------~~v~--------E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ 120 (324) T protein:vir:93 83 -------------------------YWVG--------E---------GQKIETSKATWVNATMRAFKLGVILPVTKEFLN 120 (324) T ss_pred -------------------------eeec--------C---------CccccccccceeEEEEEeEEEEEeehhhHHHHh Confidence 0001 1 111233333445666666666667789999999 Q ss_pred HHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHH Q lcl|Aclame:pro 275 DLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQ 354 (519) Q Consensus 275 DLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~ 354 (519) |-. .|.+++|.+.|+..|...+++.+|.=-. .+..+.|+++........ ......+.. T Consensus 121 ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g------------~~~~~~~~~~~~~~~~~~------~~~~~~~~~ 178 (324) T protein:vir:93 121 YTY----SQFFEEMKPMIAEAFYKKFDEAGILNQG------------NNPFGKSIAQSIEKTNKV------IKGDFTQDN 178 (324) T ss_pred cch----HHHHHHHHHHHHHHHHHHHHHHHhcCCC------------CCCcCcccccccccccee------ccccccHHH Confidence 953 4678999999999999999998875210 011122333221110000 000111223 Q ss_pred HHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCc--cce- Q lcl|Aclame:pro 355 IDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYAR--SDY- 431 (519) Q Consensus 355 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~--~dy- 431 (519) |.++-+.|.. .+...+.+||+|.....|...- + + . .+..-.+..+ +.|.| ++|++.+... ... T Consensus 179 i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~l~--d-~--~--G~~~~~~~~~----~~l~G-~PVv~~~~~~~~~~~i 244 (324) T protein:vir:93 179 IIDLEALLED--DELEANAFISKTQNRSLLRKIV--D-P--E--TKERIYDRNS----DSLDG-LPVVNLKSSNLKRGEL 244 (324) T ss_pred HHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhh--C-C--C--CCeeecCCCC----Ccccc-eeeEeecCCCCCcceE Confidence 3344444433 2335568999999999997542 1 0 0 0111112222 45666 6888766533 222 Q ss_pred -------EEEEEecCCCccceeEeecccccccccccCc------ccccceeeeeeeeceee-cCcccccccCCcceeecC Q lcl|Aclame:pro 432 -------FTIGYKGSNEMDAGIYYAPYVALTPLRGSDP------KNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNG 497 (519) Q Consensus 432 -------~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp------~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~ 497 (519) +++|..+.-+.+ ...+..+......|. ..-|=.+=+..||+..+ +|= ..++|... T Consensus 245 ~~gdfs~~~~~~~~~~~i~----~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~-------a~~~l~~a 313 (324) T protein:vir:93 245 ITGDFDKLIYGIPQLIEYK----IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDK-------AFAKLVPA 313 (324) T ss_pred EEEecceEEEEEecCcEEE----EeecccccccccccccchhhhhcCcEEEEEEEEeccEEeccc-------ceEEEecc Confidence 333443322211 011111111110110 01123344456776543 221 11223221 Q ss_pred Cchhhhcccch Q lcl|Aclame:pro 498 MPDIVNSLGLN 508 (519) Q Consensus 498 ~d~~a~~~~~~ 508 (519) ...-.-..++- T Consensus 314 ~~~~~~~~~~~ 324 (324) T protein:vir:93 314 DKRTDSVPGEV 324 (324) T ss_pred cccCCCCCCCC Confidence 10000011111 No 58 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=84.20 E-value=0.062 Score=27.19 Aligned_cols=288 Identities=9% Similarity=-0.008 Sum_probs=119.9 Q ss_pred Hhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccccc-cccccccCc--eehhhHHHHHhhhhhhhceeeccCCcc Q lcl|Aclame:pro 42 ILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQT-SGAVTQIGP--AVMGMVRRAIPHLIAFDICGVQPLNNP 118 (519) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~-tg~v~~~~P--~L~~l~Rra~p~LIa~DI~GVQPmTGP 118 (519) ++ -...+++...+..++ +.+-...=| ..-.+++.+.+..+..+++.+-||+++ T Consensus 1 ~~------------------------~~~~~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 56 (318) T protein:vir:24 1 MA------------------------AGTAFAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTT 56 (318) T ss_pred CC------------------------CCCCCCHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCC Confidence 11 111222222221111 111111111 112234445566677888889898775 Q ss_pred chhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCC Q lcl|Aclame:pro 119 TGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDA 198 (519) Q Consensus 119 TGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~a 198 (519) +.- |+-.... .+| T Consensus 57 ~~~-------ip~~~~~---~~a--------------------------------------------------------- 69 (318) T protein:vir:24 57 GQK-------IPHWVGD---VSA--------------------------------------------------------- 69 (318) T ss_pred ceE-------EEEEeCC---cce--------------------------------------------------------- Confidence 422 1110000 000 Q ss_pred CCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHh Q lcl|Aclame:pro 199 GATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRA 278 (519) Q Consensus 199 g~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKA 278 (519) .-+++ +.++++...++++++.+.|..+-...+|-||.+|-. T Consensus 70 ---------------------~~v~E-----------------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~- 110 (318) T protein:vir:24 70 ---------------------QWIGE-----------------GDMKPITKGNMTSQTIAPHKIATIFVASAETVRANP- 110 (318) T ss_pred ---------------------EEecC-----------------CccccccccceeEEEEeeEEEEEeehhhHHHhhcCh- Confidence 00011 112233344456666666666667789999999843 Q ss_pred hcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeecccccccc----ccchHHHHHHHHHHH Q lcl|Aclame:pro 279 VHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIR----GARWAGESFKALLFQ 354 (519) Q Consensus 279 iHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~----~~~~a~e~~r~L~~~ 354 (519) .|.+++|.+.|+..|...|+..++.-.. .+.+.|++......... ..-+....... T Consensus 111 ---~~~~~~i~~~l~~~~~~~~d~a~l~G~g-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 170 (318) T protein:vir:24 111 ---ANYLGTMRTKVATAFAMAFDGAAMHGTD-------------SPFPTYIGQTTKAISIADTTGATTVYDQVAVN---- 170 (318) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHhhhcccC-------------CCCCcccccccccccccccccccchHHHHHHH---- Confidence 6789999999999999999999985111 01112222211111100 00111111112 Q ss_pred HHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCce---E-EEEecCcEEEEecCCCccc Q lcl|Aclame:pro 355 IDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAV---F-AGVLGGKYRVYIDQYARSD 430 (519) Q Consensus 355 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~---~-~G~l~~~~~vy~D~y~~~d 430 (519) +...+.. .......+||+|.....|...= +.. +...+..+.+... + -+.+.+ ++|++.+..+.. T Consensus 171 ---~~~~~~~--~~~~~~~~v~n~~~~~~L~~lk--d~~----G~~l~~~~~~~~~~~~~~~~~i~g-~pv~~~~~~~~~ 238 (318) T protein:vir:24 171 ---GLSLLVN--DGKKWTHTLLDDITEPILNGAK--DQN----GRPLFIESTYGEAASPFRSGRIVA-RPTILSDHVVEG 238 (318) T ss_pred ---HHHhhcc--ccCCCCEEEEcHHHHHHHHHhh--ccC----CceeecCccccCccccccCceEEE-EeeEEeCCCCCC Confidence 2222222 2234478899999999997431 110 0011111111111 1 123333 577777766531 Q ss_pred --eEEEEEecCCCccceeEeecccccc--------cccccCccc-----c---cceeeeeeeeceee-cCcccccccCCc Q lcl|Aclame:pro 431 --YFTIGYKGSNEMDAGIYYAPYVALT--------PLRGSDPKN-----F---QPVMGFKTRYGIGI-NPFADPAAQAPT 491 (519) Q Consensus 431 --y~~vG~KG~~~~~~~~fyaPYv~~~--------~~~~~dp~s-----~---qP~~g~~tRY~l~~-nP~~~~~~~~~~ 491 (519) .+++| +- +.++|+-.-.+. .....|+.. | |=.+=...||+..+ +|- .. T Consensus 239 ~~~~~~g---df---s~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~-------a~ 305 (318) T protein:vir:24 239 TTVGFMG---DF---SQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAE-------AF 305 (318) T ss_pred ccEEEEe---ec---ceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEeccc-------ce Confidence 11111 11 112232211110 011111111 2 23334567887664 221 12 Q ss_pred ceeecCCchhhhccc Q lcl|Aclame:pro 492 KRIQNGMPDIVNSLG 506 (519) Q Consensus 492 ~~i~~~~d~~a~~~~ 506 (519) .+|.... .++..+ T Consensus 306 ~~i~~~~--a~~~~~ 318 (318) T protein:vir:24 306 VALTNVV--SGGGEG 318 (318) T ss_pred EEEEeec--cCCCCC Confidence 3343211 112222 No 59 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=83.77 E-value=0.066 Score=27.06 Aligned_cols=341 Identities=11% Similarity=0.034 Sum_probs=131.0 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhh-------hhhhhHHHHHhhhhhccchhhh-------hhh--------- Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIA-------KIFENQEQDILTAPEYRDEKIS-------EAF--------- 57 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~-------~~~enq~~~~~~~~~~~~~~~~-------~~~--------- 57 (519) |+.++|.++|+.+.+. +-++.+..++.... ...|..++ +..+.+-+.++.. +.. T Consensus 1 M~~~eL~~~~~~~~~~--~~~l~e~~~~~~~~~~~~~~~~~~ee~~~-l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (395) T protein:vir:38 1 MNINQLKDAFDMAGQK--VQDLEDKRAQFAIDLGNDASSHSVDDINK-LNASLKNAKMAQELAKSAYEDARANLNAEPVN 77 (395) T ss_pred CCHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 9999999999988542 33333322222111 11111100 0000000000000 000 Q ss_pred -------hhhhhhhhh--ccccccchhhhccccccccccccCcee--hhhHHHHHhhhhhhhceeeccCCccchhheeee Q lcl|Aclame:pro 58 -------GSFLTEAEI--GGDHGYDATNIAAGQTSGAVTQIGPAV--MGMVRRAIPHLIAFDICGVQPLNNPTGQVFALR 126 (519) Q Consensus 58 -------~~~~~~~~~--~~~~g~~~~~~~est~tg~v~~~~P~L--~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMR 126 (519) ......+.. .--++.........+++++-...=|.- -.+++.+.+..+..++|.+.||++++|-+--.+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 157 (395) T protein:vir:38 78 KKPLPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEK 157 (395) T ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEe Confidence 000000000 000011111111111221111111211 124455556677888999999999987531110 Q ss_pred eeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCcccc Q lcl|Aclame:pro 127 AVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKL 206 (519) Q Consensus 127 srY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~ 206 (519) .....+ .+.| T Consensus 158 -~~~~~~---------------~a~~------------------------------------------------------ 167 (395) T protein:vir:38 158 -LADITP---------------LKDL------------------------------------------------------ 167 (395) T ss_pred -eccCCc---------------cccc------------------------------------------------------ Confidence 000000 0000 Q ss_pred ccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHH Q lcl|Aclame:pro 207 DAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADA 286 (519) Q Consensus 207 ~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEa 286 (519) +++|-.. ..+....|.+..|+..|..+ ...+|.||.+|- +.|-++ T Consensus 168 ---------------v~E~~~~---------~~~~~~~f~~v~~~~~k~~~-------~~~iS~ell~ds----~~~l~~ 212 (395) T protein:vir:38 168 ---------------DDESALI---------GDNDDPELTVVKYLIHRYAG-------ITTVTNTLLKDT----VDNIIQ 212 (395) T ss_pred ---------------ccccccc---------ccccccceeeEEeeeeeeEe-------ehhhHHHHHhhh----HHHHHH Confidence 0010000 00112335555555555544 456999999993 356688 Q ss_pred HHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 287 ELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQT 366 (519) Q Consensus 287 ELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T 366 (519) .|.+-|+..|..-||..|+.=.- +.....|+.+++ ....++... +... T Consensus 213 ~i~~~la~~~~~~~~~~il~g~g------------~~~~~~~~~~~~-------------~i~~~~~~~------l~~~- 260 (395) T protein:vir:38 213 WLVNWAAKKDVVTRNAKILEVMG------------KAPKKPTISQFD-------------NIKDLENNT------LDPA- 260 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhccc------------ccccccccccHH-------------HHHHHHHHh------hhhh- Confidence 99999999999888888885110 111112332222 112222211 1111 Q ss_pred cccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc-----c-eEEEE----- Q lcl|Aclame:pro 367 GRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS-----D-YFTIG----- 435 (519) Q Consensus 367 ~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----d-y~~vG----- 435 (519) +.....+||+|.....|...= .+. +...+..+.+. -..++|.| ++|++....+. + -+++| T Consensus 261 -~~~~a~~v~n~~~~~~L~~lk---d~~---G~~l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~~~~i~~gd~~~~ 331 (395) T protein:vir:38 261 -IESTSSFITNQSGYNILSKVK---DAD---GRYLMQPDVTS-PDKYLIDG-KPVIRIADKWLPDVSGSHPLYFGDLKQG 331 (395) T ss_pred -hcCCCEEEEcHHHHHHHHHhh---ccC---CceeeccCcCC-CCcceecc-ceeEEecccccCcCCCcceEEEEecccc Confidence 113356899999998886431 110 01111111111 11246776 58877543221 1 12222 Q ss_pred ----EecCCCccceeEeecccccccccccCcccccceeeeeeeeceeec-CcccccccCCcceeecC----Cchhhhccc Q lcl|Aclame:pro 436 ----YKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGIN-PFADPAAQAPTKRIQNG----MPDIVNSLG 506 (519) Q Consensus 436 ----~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~n-P~~~~~~~~~~~~i~~~----~d~~a~~~~ 506 (519) .+... .+=+.++. ..+-..-+=.+-+..||+..+- |-+- .++.-. .+..+-..| T Consensus 332 ~~i~~~~~~----~i~~~~~~------~~~~~~~~~~~r~~~r~d~~~~~~~a~-------~~~~~~~~~~~~~~~~~~~ 394 (395) T protein:vir:38 332 ITLFDRQQM----QIDTTNVG------AGSFEHDTTKLRFIDRFDVQLIDDGAF-------AAASFKTVANQAQGTAGTG 394 (395) T ss_pred EEEEEecce----EEEEeccc------cchhhcCceEEEEEEeeccEEecccce-------EEEEeecccCCCCCccCCC Confidence 11110 01111110 0011222344555566665431 2110 111000 111111233 Q ss_pred c Q lcl|Aclame:pro 507 L 507 (519) Q Consensus 507 ~ 507 (519) | T Consensus 395 ~ 395 (395) T protein:vir:38 395 K 395 (395) T ss_pred C Confidence 3 No 60 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=83.71 E-value=0.066 Score=27.05 Aligned_cols=331 Identities=15% Similarity=0.078 Sum_probs=124.1 Q ss_pred CCh------HHHHHhhhh---hhCCCccccccccchhhh---hhhh------hhhHHHHHhhhh----hcc--------c Q lcl|Aclame:pro 1 MKK------NALVQKWSA---LLENEALPEIVGASKQAI---IAKI------FENQEQDILTAP----EYR--------D 50 (519) Q Consensus 1 ~~~------~~l~~kw~p---~l~~~~~~~~~~~~~~~~---~~~~------~enq~~~~~~~~----~~~--------~ 50 (519) |.+ +++.++.+. +++.+..-+. ...-..+ +..| |+.|.+...... ... . T Consensus 10 ~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~-~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (400) T protein:vir:38 10 VKKQLDEKRSALPAMKTELRSLLEGEDSEEN-LKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSGKKPDHPEE 88 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhh Confidence 111 111111111 1111110000 0000000 0000 011100000000 000 0 Q ss_pred hhhhhhhhh-------hhhh-----hhhc-----cccccchhh-hcccccc--ccccccCceehhhHHHHHhhhhhhhce Q lcl|Aclame:pro 51 EKISEAFGS-------FLTE-----AEIG-----GDHGYDATN-IAAGQTS--GAVTQIGPAVMGMVRRAIPHLIAFDIC 110 (519) Q Consensus 51 ~~~~~~~~~-------~~~~-----~~~~-----~~~g~~~~~-~~est~t--g~v~~~~P~L~~l~Rra~p~LIa~DI~ 110 (519) ....+.+.. .+.. .... ......... +..++++ |.+.--.+..-.++++..+..+..+++ T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~ 168 (400) T protein:vir:38 89 HSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFT 168 (400) T ss_pred hhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcc Confidence 000000000 0000 0000 000000000 1111111 111100111222444445667788899 Q ss_pred eeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 111 GVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQA 190 (519) Q Consensus 111 GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~ 190 (519) .+.||++.++-+--++.. .+. ..| T Consensus 169 ~~~~~~~~~~~~~~~~~~----~~~--------------~~~-------------------------------------- 192 (400) T protein:vir:38 169 NVFQASTQKGTYPTVANA----TTK--------------MVT-------------------------------------- 192 (400) T ss_pred eeEeccCcceEEEEEecC----CCc--------------ccc-------------------------------------- Confidence 999998886533222210 000 000 Q ss_pred cccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccH Q lcl|Aclame:pro 191 VEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSI 270 (519) Q Consensus 191 ~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTm 270 (519) +++|-.. ...++..|. .++...+.-+-...+|- T Consensus 193 -------------------------------~~E~~~~---------~~~~~~~f~-------~i~~~~~k~~~~~~is~ 225 (400) T protein:vir:38 193 -------------------------------VAELEKN---------PAMAKPEFK-------PVNWSVETYRQALPVSQ 225 (400) T ss_pred -------------------------------ccccccc---------cccccccce-------eeEeehhheeeehhhHH Confidence 0000000 000122233 45555555566778999 Q ss_pred HHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHH Q lcl|Aclame:pro 271 ELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKA 350 (519) Q Consensus 271 ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~ 350 (519) ||.+|- ..|.+++|.+.|...|...+|+-|+.-... +...|+..++ .... T Consensus 226 ell~ds----~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~-------------~~~~~~~~~~-------------~~~~ 275 (400) T protein:vir:38 226 ESIDDS----AIDLVGLIAQNGQQIKVNTTNGAVATLLKG-------------FTAKTISSVD-------------DLKH 275 (400) T ss_pred HHHhhh----HHHHHHHHHHHHHHHHHHHHHHhhhhcccc-------------ccccccccHH-------------HHHH Confidence 999985 346788999999999999988888752211 1112332221 1122 Q ss_pred HHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhc----CcccccccccccccccccCCCceEEEEecCcEEEEecCC Q lcl|Aclame:pro 351 LLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAV----DTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQY 426 (519) Q Consensus 351 L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~----g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y 426 (519) ++..... ..+ . ...|++|.....|... |...+ ..+.+.. -.++|.| ++|++..+ T Consensus 276 ~~~~~~~--------~~~-~-a~~v~~~~~~~~l~~lkd~~G~~i~----------~~~~~~~-~~~~l~G-~pv~~~~~ 333 (400) T protein:vir:38 276 INNVDLD--------PAY-S-RVIIASQSFYNFLDTVKDGNGRYLL----------QDSILTP-SGKSVLG-MPIAVVSD 333 (400) T ss_pred HHHhhhh--------hhh-C-cEEEEcHHHHHHHHHhhccCCCeee----------ecCcCCC-Ccccccc-ceeEEecc Confidence 2221111 111 2 4578899998888753 32222 1121111 1246877 58887776 Q ss_pred Cccc-----eEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-cCcccccccCCcceeecCCch Q lcl|Aclame:pro 427 ARSD-----YFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPD 500 (519) Q Consensus 427 ~~~d-----y~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~ 500 (519) .+.. .+++|--- ..+..... ....++..|-..|+..+-...||+..+ +|-+ ...|. -.+. T Consensus 334 ~~~~~~g~~~~~~gd~s-----~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a-------~~~l~-~~~~ 399 (400) T protein:vir:38 334 DTLGAAGEAHAFLGDIK-----RAILFANR-ADFMVRWVDDQIYGQFLQAGMRFGVSVADEKA-------GYFLT-YTPK 399 (400) T ss_pred cccCCCCceEEEEEecc-----ccEEEEee-cceEEEEecccccceeEEEEEEeccEEecccc-------eEEEE-eecC Confidence 5531 22322200 00001101 122223346667777888888998654 2211 11111 1111 Q ss_pred hh Q lcl|Aclame:pro 501 IV 502 (519) Q Consensus 501 ~a 502 (519) | T Consensus 400 -a 400 (400) T protein:vir:38 400 -A 400 (400) T ss_pred -C Confidence 1 No 61 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=83.61 E-value=0.067 Score=27.02 Aligned_cols=357 Identities=17% Similarity=0.112 Sum_probs=133.5 Q ss_pred CChHHHHHhhhhhhCC---Cccc-------cccccchhhhhhhhhhhHHH--HHhh-----hhhccchh-----hhhh-- Q lcl|Aclame:pro 1 MKKNALVQKWSALLEN---EALP-------EIVGASKQAIIAKIFENQEQ--DILT-----APEYRDEK-----ISEA-- 56 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~---~~~~-------~~~~~~~~~~~~~~~enq~~--~~~~-----~~~~~~~~-----~~~~-- 56 (519) --.+.+.+|++...+. +..- ||.+ +-++|-+.+++ .+.+ ....+... ..+. T Consensus 13 el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~-----l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (415) T protein:vir:94 13 DIKRQIDLKVKYATRALNNDELEKAEKLEQEITD-----LRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEASTYRNQA 87 (415) T ss_pred HHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHH Confidence 1122334444332211 1100 1111 11111111000 0000 00000000 0000 Q ss_pred ----hh-----hhhhhhhhcc--c---cccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhh Q lcl|Aclame:pro 57 ----FG-----SFLTEAEIGG--D---HGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQV 122 (519) Q Consensus 57 ----~~-----~~~~~~~~~~--~---~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLI 122 (519) .. ..+...+... + .+........++++|...--....-.+++.+-+..+-.+++.|+||++.++-+ T Consensus 88 ~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 167 (415) T protein:vir:94 88 NINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKY 167 (415) T ss_pred HHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeE Confidence 00 0000000000 0 00011111111222222211112223555556677889999999998766432 Q ss_pred eeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCC Q lcl|Aclame:pro 123 FALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATD 202 (519) Q Consensus 123 FAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~ 202 (519) --.+ ..... + ..| T Consensus 168 ~~~~--~~~~~------~---------~~~-------------------------------------------------- 180 (415) T protein:vir:94 168 PVVR--QSEVA------A---------LEK-------------------------------------------------- 180 (415) T ss_pred EEEe--ecCCc------c---------cee-------------------------------------------------- Confidence 2111 10000 0 000 Q ss_pred ccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCC Q lcl|Aclame:pro 203 AAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGM 282 (519) Q Consensus 203 ~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGL 282 (519) +++|- +. ...+...|.+..|++.|.. -.-.+|-||.+|-- + T Consensus 181 -------------------v~Eg~-----~~----~~~~~~~~~~i~~~~~k~~-------~~~~is~ell~ds~----~ 221 (415) T protein:vir:94 181 -------------------VEELE-----EN----PELAVKPFFQLAYDINTHR-------GYFRISREAIEDAK----V 221 (415) T ss_pred -------------------ccccc-----cc----cccccccceeeEeeheeee-------eechhhHHHHhhch----H Confidence 00000 00 0001123445555555554 44569999999864 4 Q ss_pred CHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccccccccccee-eeccccccccccchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 283 DADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGV-FDFQDPIDIRGARWAGESFKALLFQIDKEAAE 361 (519) Q Consensus 283 DAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~-fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~ 361 (519) |.+++|.+-|...|..-+|+.|+.-.-...-.+. ...+. ..++ ...... -..+....++.. T Consensus 222 ~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~--~~~~~--~~~~~~~~~~~-------~~~~~i~~~~~~------- 283 (415) T protein:vir:94 222 NVLQELKLWMARTIAATRNKAIIDVITKGSTGST--SSGFE--KEGKKLEVKKA-------KSLDDIKDAINL------- 283 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccCccccc--ccccc--ccccccccccc-------cchHHHHHHHHh------- Confidence 6799999999999999999999874322111110 00000 0000 000000 011222333332 Q ss_pred HHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc----c-eEEEEE Q lcl|Aclame:pro 362 IARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS----D-YFTIGY 436 (519) Q Consensus 362 I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~----d-y~~vG~ 436 (519) +.. ..+ +++.+|++|.....|...- +.. ++..+..+.+.. ..++|.| ++|++.+..+. + -+++|- T Consensus 284 ~~~-~~~-~~~~~vmn~~~~~~l~~lk--d~~----G~~l~~~~~~~~-~~~~l~G-~pV~~~~~~~~~~~~~~~i~~gd 353 (415) T protein:vir:94 284 NVK-PNY-EHNVAIVSQTMFAKLDKMK--DKL----GNYLIQPDVKEK-TQQRLLG-AKIEILPDEVLGQKGNNTLIIGN 353 (415) T ss_pred hhh-hcc-CCCEEEEcHHHHHHHHHhh--ccC----CCeeeccCcCCC-CCceecc-eeeEEecccccCCCCccEEEEEe Confidence 222 222 4578999999999887531 100 011111222111 1246777 58888776552 1 123331 Q ss_pred ecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-cCcccccccCCcceeecC-Cchhhhcccchh Q lcl|Aclame:pro 437 KGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNG-MPDIVNSLGLNG 509 (519) Q Consensus 437 KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~-~d~~a~~~~~~~ 509 (519) -.. . +..... ....+...|-.+++-.+-...|+++.+ +|-+ ...+.-. ...-.|..+.-. T Consensus 354 ~~~----~-~~~~~~-~~~~v~~~~~~~~~~~~r~~~r~d~~~~~~~a-------~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:94 354 LKD----A-IVLFDR-SQYQASWTDYMHFGECLMIAVRQDCRILDYKS-------AIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred hhc----c-EEEEee-cceEEEEeccccCceEEEEEEEeccEEecccc-------EEEEEEeccCCCCCccccCC Confidence 000 0 000000 111222235566777777888988654 2221 1111100 000011111111 No 62 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=83.58 E-value=0.067 Score=27.01 Aligned_cols=278 Identities=9% Similarity=-0.006 Sum_probs=122.4 Q ss_pred ccccccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCccccccccc Q lcl|Aclame:pro 67 GGDHGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMY 146 (519) Q Consensus 67 ~~~~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fn 146 (519) -.-.++++.+...++ +++..--....-.+++.+.+.-+-..++-+.||++++...+-... .. .+| T Consensus 1 m~~~~~~~~~~~~t~-~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~--~~-------~~a----- 65 (297) T protein:vir:95 1 MTVQTFNPENVLVSQ-KKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQT--DG-------ISA----- 65 (297) T ss_pred CCccccccccccccC-CCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEc--CC-------cee----- Confidence 122334444443222 222111111112344555556677788999999887655432110 00 000 Q ss_pred ccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceeccccc Q lcl|Aclame:pro 147 APNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGM 226 (519) Q Consensus 147 Eadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~Gm 226 (519) .| +++| T Consensus 66 ----~~---------------------------------------------------------------------v~Eg- 71 (297) T protein:vir:95 66 ----YW---------------------------------------------------------------------VNET- 71 (297) T ss_pred ----EE---------------------------------------------------------------------eecC- Confidence 00 0111 Q ss_pred chhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHH Q lcl|Aclame:pro 227 ATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVID 306 (519) Q Consensus 227 sTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~ 306 (519) ..+++-..++++++...|..+-.-.+|.||.+|-. .|.+..|.+.|+..|...+++.+|. T Consensus 72 ----------------~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~ 131 (297) T protein:vir:95 72 ----------------EKIKTDKPEVVPVTLKAHKLGIILVTSREALNYTW----KKFFEDMKPQIVEAFYKKIDEAGLL 131 (297) T ss_pred ----------------ccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhc Confidence 11233333445666666666667779999999864 4679999999999999999999984 Q ss_pred HHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHh Q lcl|Aclame:pro 307 WINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAA 386 (519) Q Consensus 307 ~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~ 386 (519) |. ....+.|++........ ... ..-.+.-|.++...|...- ...+.+||+|+....|.. T Consensus 132 --------G~-----g~~~~~gi~~~~~~~~~----~~~--~~~t~~~i~~~~~~l~~~~--~~~~~~v~~~~~~~~L~~ 190 (297) T protein:vir:95 132 --------GH-----DTPFANSVAKAAKDANK----VIG--GPINYDNILKLQDALYDAD--VEPNAFVSKIQNRSALRE 190 (297) T ss_pred --------cc-----CCcccccccccccccce----ecc--cccCHHHHHHHHHHhhhcc--CCcCEEEEcHHHHHHHHH Confidence 10 00112233322111110 000 0111233445555555432 234678999999988874 Q ss_pred cCcccccccccccccccccCCCceEEEEecCcEEEEecCCCc--cc--------eEEEEEecCCCccceeEeeccccccc Q lcl|Aclame:pro 387 VDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYAR--SD--------YFTIGYKGSNEMDAGIYYAPYVALTP 456 (519) Q Consensus 387 ~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~--~d--------y~~vG~KG~~~~~~~~fyaPYv~~~~ 456 (519) .- +.. +...+. .. .++|.| ++|++-+... .. ++++|..+.-+.+- .. +... T Consensus 191 l~--d~~----G~~i~~--~~----~~~l~G-~Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~----~~--~~~~ 251 (297) T protein:vir:95 191 AR--DGN----KVSIYD--KA----ANTIDG-ITTVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKI----SE--EGQI 251 (297) T ss_pred hh--ccC----Cceeec--CC----CCcccc-eeeEeecCCCCCCceEEEEecccEEEEEecCeEEEE----ee--cccc Confidence 21 100 011111 11 145665 5776544432 12 22233332211110 00 0000 Q ss_pred ccccCcc-----ccc-ceee--eeeeeceee-cCcccccccCCcceeecCCch Q lcl|Aclame:pro 457 LRGSDPK-----NFQ-PVMG--FKTRYGIGI-NPFADPAAQAPTKRIQNGMPD 500 (519) Q Consensus 457 ~~~~dp~-----s~q-P~~g--~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~ 500 (519) ....|+. -|| =.++ ...|++..+ ||-+ -+++....+- T Consensus 252 ~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a-------~~~l~~at~~ 297 (297) T protein:vir:95 252 STITNADGTPINLFEQEMIAIRATMDIAVMITKTDA-------FAKLTPAERV 297 (297) T ss_pred ccccccCccchhhhhcCcEEEEEEEEeccEeecccc-------eEEEeecCCC Confidence 0111111 122 1122 335676554 2221 1223222221 No 63 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=82.43 E-value=0.077 Score=26.69 Aligned_cols=292 Identities=13% Similarity=0.080 Sum_probs=114.6 Q ss_pred hccchhhhhhhhhhhhhhhhccccccchhh--hcccccccccc-ccCceeh-hhHHHHHhhhhhhhceeeccCCccchhh Q lcl|Aclame:pro 47 EYRDEKISEAFGSFLTEAEIGGDHGYDATN--IAAGQTSGAVT-QIGPAVM-GMVRRAIPHLIAFDICGVQPLNNPTGQV 122 (519) Q Consensus 47 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~--~~est~tg~v~-~~~P~L~-~l~Rra~p~LIa~DI~GVQPmTGPTGLI 122 (519) ..+ ..+++... +++.+ ++... ..-|.+. .+++.+....+-.+++-+.||++.+.-| T Consensus 1 ~~~-------------------~~~~~~~~~~~~~t~-~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~ 60 (320) T protein:vir:10 1 MAA-------------------GTAFQVDHAQIAQTG-DTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKI 60 (320) T ss_pred CCC-------------------CccCCHHHHHhhccc-cccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEE Confidence 010 01111111 11111 11111 1122222 2334444456677888888887654221 Q ss_pred eeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCC Q lcl|Aclame:pro 123 FALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATD 202 (519) Q Consensus 123 FAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~ 202 (519) . +.... .+| .| T Consensus 61 p----~~~~~------~~a---------~~-------------------------------------------------- 71 (320) T protein:vir:10 61 P----HWIGD------VSA---------QW-------------------------------------------------- 71 (320) T ss_pred E----EEeCC------cce---------EE-------------------------------------------------- Confidence 1 11000 000 00 Q ss_pred ccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCC Q lcl|Aclame:pro 203 AAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGM 282 (519) Q Consensus 203 ~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGL 282 (519) +++| ..+++-..++++++...|..+-...+|.||.+|-. . T Consensus 72 -------------------v~E~-----------------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~----~ 111 (320) T protein:vir:10 72 -------------------IGEG-----------------DMKPITKGNMTSQNIAPHKIATIFVASAETVRANP----A 111 (320) T ss_pred -------------------ecCC-----------------ccccccccceeEEEEeeEEEEEeehhhHHHHhcCh----H Confidence 0010 11222233345666666777777789999999854 5 Q ss_pred CHHHHHHHHHHHHHHHHhhHHHHH-HHHhhhhhhhhcccc-cccccceeeeccccccccccchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 283 DADAELSGILATEIMLEINREVID-WINYSAQVGKSGMTN-TVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAA 360 (519) Q Consensus 283 DAEaELsNILSTEImlEINReii~-~i~~~a~~~~~~~t~-~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~ 360 (519) |.|+.|.+.|...|...||+.|+. .=..... +-.+..+ .+....+..... .-++.+ .+ +..+.. T Consensus 112 ~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~-~~~~~~~~~~~~~~~~~~~~-------~~~~~~---~~---~~~~~~ 177 (320) T protein:vir:10 112 NYLGTMRTKVATAFAMAFDSAALNGTDSPFPT-YLAQTTKSVSLADPGGATAS-------DLTAYD---AV---AVNGLS 177 (320) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccCCCCCc-ccccccccccceeccccccc-------ccccHH---HH---HHHHHh Confidence 678999999999999999988874 1000000 0000000 000001110000 001111 11 112222 Q ss_pred HHHhhccccCCCEEEEchHHHHHHHhcC----cccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEE Q lcl|Aclame:pro 361 EIARQTGRGAGNFIIASRNVVNVLAAVD----TSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGY 436 (519) Q Consensus 361 ~I~~~T~rg~gn~~v~S~~va~~L~~~g----~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~ 436 (519) .+. ..+.....+||+|.....|...- ...+.+. ...+......-+++.| ++|+++++.+.+=.. ++ T Consensus 178 ~~~--~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~------~~~~~~~~~~~~~i~g-~pv~~~~~~~~~~~~-~~ 247 (320) T protein:vir:10 178 LLV--NAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIES------TYTDENSPFRAGRIVS-RPTILSDHVADGTTV-GY 247 (320) T ss_pred hhh--cccCCCcEEEEcHHHHHHHHHhhccCCceeeccc------cccCccccccCceeee-eeeEecCCCCCCceE-EE Confidence 222 22334578999999999987421 1111110 0111112222345555 799999887753111 11 Q ss_pred ecCCCccceeEeecccccccc--------cccCccc-----cc---ceeeeeeeeceee-cCcccccccCCcceeecC-C Q lcl|Aclame:pro 437 KGSNEMDAGIYYAPYVALTPL--------RGSDPKN-----FQ---PVMGFKTRYGIGI-NPFADPAAQAPTKRIQNG-M 498 (519) Q Consensus 437 KG~~~~~~~~fyaPYv~~~~~--------~~~dp~s-----~q---P~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~-~ 498 (519) -|+-. .+++.-+-..... ...|+.. || =.+=...|+++.+ +| +...+|..- - T Consensus 248 ~gd~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~-------~a~~~l~~~~a 317 (320) T protein:vir:10 248 MGDFR---NVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDK-------DAFVKLTNVVT 317 (320) T ss_pred Eeecc---eEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecc-------cceEEEEeccC Confidence 11111 1112211111110 0111111 11 1122335666543 11 112233211 1 Q ss_pred chhh Q lcl|Aclame:pro 499 PDIV 502 (519) Q Consensus 499 d~~a 502 (519) |. | T Consensus 318 p~-~ 320 (320) T protein:vir:10 318 PD-A 320 (320) T ss_pred CC-C Confidence 11 1 No 64 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=82.41 E-value=0.077 Score=26.69 Aligned_cols=336 Identities=12% Similarity=0.105 Sum_probs=121.8 Q ss_pred CChHHH---HHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhh-----------------------------h-- Q lcl|Aclame:pro 1 MKKNAL---VQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTA-----------------------------P-- 46 (519) Q Consensus 1 ~~~~~l---~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~-----------------------------~-- 46 (519) +..+++ .++|.-+-.. |.. .+.. +++| |.+++...+. + T Consensus 30 ~~~ee~~~~~~e~~~l~~~-----~~~-l~~~-i~~l-e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 101 (434) T protein:vir:62 30 VRSEELAAVKAEVEQLTKE-----IQT-ISEE-LAKL-EEKEKEEDPAKKKDDDPEKKEDPTAKENPNEKTELSEEQRSA 101 (434) T ss_pred ccHHHHHHHHHHHHHHHHH-----HHH-HHHH-HHHH-HHHHHHHHHHhhhcchhhhhcchhhhcchhhhHHHHHHHHHH Confidence 222221 1222222110 100 0000 0000 1111100000 0 Q ss_pred ------------h---ccchhhhhhhhhhhhhhhhccccccchhhhccccccccccccCceeh--hhHHHHHhhhhhhhc Q lcl|Aclame:pro 47 ------------E---YRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAVM--GMVRRAIPHLIAFDI 109 (519) Q Consensus 47 ------------~---~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L~--~l~Rra~p~LIa~DI 109 (519) . ........++..+| .+..... ..-+-++++++-.-.=|.-+ .+++..-+..+...+ T Consensus 102 ~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l-----~~~~~~~-e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~ 175 (434) T protein:vir:62 102 ISASIAAALSTKGHRTNKETEIRSVFANYI-----VGNIDEK-EARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRL 175 (434) T ss_pred HHHHHHhhhhhccccchHHHHHHHHHHHHh-----ccccchh-hhhhhcccccccceecchhhHHHHHHhhhhhhhhhhh Confidence 0 00000001111111 0000000 00011122221000012221 244445556666777 Q ss_pred eeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCcccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 110 CGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQ 189 (519) Q Consensus 110 ~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~ 189 (519) +-|.|+++..- | .+....+. +.+ T Consensus 176 ~~~~~~~~~~~--~---p~~~~~~~---------------a~~------------------------------------- 198 (434) T protein:vir:62 176 GTGVKTKENIK--Y---PVLVKKAE---------------AQG------------------------------------- 198 (434) T ss_pred cceeccCCceE--E---EEEecCCc---------------ccc------------------------------------- Confidence 77777654210 0 00000000 000 Q ss_pred ccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeeccccccccc Q lcl|Aclame:pro 190 AVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYS 269 (519) Q Consensus 190 ~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT 269 (519) ..... | +...++-..++++++..+|.-+-...+| T Consensus 199 -----------------------------~~~~~--------e---------~~~~~~~~~~f~~v~~~~~k~~~~~~iS 232 (434) T protein:vir:62 199 -----------------------------HKNER--------T---------NNEMPETDIEFDEIELSPTEFDALATVT 232 (434) T ss_pred -----------------------------eeccc--------c---------cccccccccceeeEEeeheeeEeehhhH Confidence 00000 0 1111222234556677777777778899 Q ss_pred HHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHH Q lcl|Aclame:pro 270 IELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFK 349 (519) Q Consensus 270 mELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r 349 (519) -||.+|- .+|.+++|.+-|+..|..-+++.||. |. .++..+.|++.-... ... ... . T Consensus 233 ~ell~ds----~~~l~~~i~~~la~~~~~~~d~~~l~--------G~----G~~~~~~g~~~~~~~-~~~-~~~-----~ 289 (434) T protein:vir:62 233 KKLLART----GLPIEQIVMDELKKAYVRKETQYMVN--------GD----EANNINDGALAKKAV-EFK-TDE-----K 289 (434) T ss_pred HHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhc--------cC----CCCccccceeecccc-ccc-ccc-----c Confidence 9999995 46779999999999999999999885 10 011112233211000 000 000 0 Q ss_pred HHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccc-----cCCCceEEEEecCcEEEEec Q lcl|Aclame:pro 350 ALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNV-----DTTKAVFAGVLGGKYRVYID 424 (519) Q Consensus 350 ~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~-----d~~~~~~~G~l~~~~~vy~D 424 (519) ..+..|-+|-..+...-+ +.-..|++|.....|...= +.. ++-.+.. +.++ .+|.| ++|+++ T Consensus 290 ~~~d~l~~l~~~l~~~~~--~~a~~v~n~~~~~~L~~lk--d~~----G~~l~~~~~~~~~g~~----~tl~G-~pV~~~ 356 (434) T protein:vir:62 290 NLYDALVKMKNTPVKEVR--KKARWVLNTAALTKIETMK--TDD----GFPLLRPFNQAEGGIG----YTLLG-FPVEEE 356 (434) T ss_pred chhhHHHHHHhhcchhhh--cCCEEEEcHHHHHHHHHhh--ccC----CCEeeccCCCccCCCC----ceecc-eeeEEe Confidence 112222233333332222 3345688999988886421 110 0111111 1112 25777 699888 Q ss_pred CCCccceEEEEEecCCCccceeEe---ecccc------cccccccCc--ccccceeeeeeeecee-e-cCcccccccCCc Q lcl|Aclame:pro 425 QYARSDYFTIGYKGSNEMDAGIYY---APYVA------LTPLRGSDP--KNFQPVMGFKTRYGIG-I-NPFADPAAQAPT 491 (519) Q Consensus 425 ~y~~~dy~~vG~KG~~~~~~~~fy---aPYv~------~~~~~~~dp--~s~qP~~g~~tRY~l~-~-nP~~~~~~~~~~ 491 (519) .+.+.. -.|.. .-++| +-|.- ....+..+. .+-|=.+..+.|++-. + .|++-. T Consensus 357 ~~~~~~-----~~~~~---~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~------ 422 (434) T protein:vir:62 357 DAIDIP-----DSPDT---PVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVP------ 422 (434) T ss_pred cCccCc-----cCCCc---eEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccce------ Confidence 776521 01100 01111 11111 111111222 2223335566777533 4 388642 Q ss_pred ceeecCCchhhhccc Q lcl|Aclame:pro 492 KRIQNGMPDIVNSLG 506 (519) Q Consensus 492 ~~i~~~~d~~a~~~~ 506 (519) |+...-..| ..+ T Consensus 423 --~~~~~~~~~-~~~ 434 (434) T protein:vir:62 423 --VYKYVLKAP-TGA 434 (434) T ss_pred --EEEEEeccC-CCC Confidence 221211111 122 No 65 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=81.29 E-value=0.087 Score=26.40 Aligned_cols=333 Identities=10% Similarity=0.012 Sum_probs=124.2 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhc-----------cchhhh--------------- Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEY-----------RDEKIS--------------- 54 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~-----------~~~~~~--------------- 54 (519) |...++.++=.-+++.-. +......+. +..+.|.-++++..+... +.+.+. T Consensus 1 m~~~e~~~~~~~~~~~l~--~~~~~~~~e-~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~~~~ 77 (379) T protein:vir:10 1 MEALEIKVALEAIKGQVD--SKSSAQALE-VKGLIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKSEDK 77 (379) T ss_pred CCHHHHHHHHHHHHHHHH--HHHHHHHHH-HHHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 666666655555443210 000000000 011111111111100000 000000 Q ss_pred -----hhhhh---hhhhhhhccccccchhhhccccccccccc-----cCceehhhHHHHHhhhhhhhceeeccCCccchh Q lcl|Aclame:pro 55 -----EAFGS---FLTEAEIGGDHGYDATNIAAGQTSGAVTQ-----IGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQ 121 (519) Q Consensus 55 -----~~~~~---~~~~~~~~~~~g~~~~~~~est~tg~v~~-----~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGL 121 (519) ++... ...+.... +......-+..+++++... +.+-++-+. -....-.++|.|.||++++.- T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~---~~~~~i~~~~~~~~~~~~~~~ 152 (379) T protein:vir:10 78 SDSLVKSITENFNDIKEVRNG--KSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNP---SQMLNVSDIVGAVSISGGTYT 152 (379) T ss_pred chhHHHHHHHHHHhHHHHHhh--hhhhhhhhcccccCCCCccccchhhhhHHHHhH---HhhhhHHhhceeeeccCCceE Confidence 00000 00000000 0000000011111222111 222222333 334456677777777766321 Q ss_pred heeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCC Q lcl|Aclame:pro 122 VFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGAT 201 (519) Q Consensus 122 IFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t 201 (519) |.-..+ +++ T Consensus 153 -------~~~~~~-----------------~~~----------------------------------------------- 161 (379) T protein:vir:10 153 -------FVRENG-----------------AGE----------------------------------------------- 161 (379) T ss_pred -------EEEeec-----------------CCC----------------------------------------------- Confidence 110000 000 Q ss_pred CccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcC Q lcl|Aclame:pro 202 DAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHG 281 (519) Q Consensus 202 ~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHG 281 (519) .. ..- .+| +...+++..++++++..+|.=+-...+|-||.||--. T Consensus 162 -~~--------------~~~--------v~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~~--- 206 (379) T protein:vir:10 162 -GA--------------IGA--------QVE---------GATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLPF--- 206 (379) T ss_pred -cc--------------ccc--------ccC---------CccccccccceeeeEeeeeeEEeeehhhHHHHhhHHH--- Confidence 00 000 011 1223444445555555555555557899999999632 Q ss_pred CCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 282 MDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAE 361 (519) Q Consensus 282 LDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~ 361 (519) .++.|.+-|+..|..-+|..++.-+...+..+.. +..+ ...++..+.++.++. T Consensus 207 --l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~----------~~~~----------~~~~d~i~~~~~~~~----- 259 (379) T protein:vir:10 207 --LTSFIPNALRRDYAKAENAAFNAVLAANATASTE----------IITN----------KNKVEMLINEIAKQE----- 259 (379) T ss_pred --HHHHHHHHHHHHHHHHHHHHHhcccccccccccc----------cccC----------cccHHHHHHHHHhhh----- Confidence 5888999999999988888887644322211111 1100 012233333333332 Q ss_pred HHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCce-EEEEecCcEEEEecCCCccceEEEEEecCC Q lcl|Aclame:pro 362 IARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAV-FAGVLGGKYRVYIDQYARSDYFTIGYKGSN 440 (519) Q Consensus 362 I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~-~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~ 440 (519) . .+-..+.+|++|.....|...= +.. ++..+..+.+.+. -.-+|.| ++|+++++.+..-+++|=-.. T Consensus 260 --~--~~~~~~~~vmn~~~~~~l~~lk--d~~----G~~l~~~~~~~~~~~~~~l~G-~pvv~s~~~~ag~~~~gdf~~- 327 (379) T protein:vir:10 260 --N--LDFPVTAIVLRPTDYYDILVTQ--KSV----GAGYGLPGVVTQDNGVLRING-IPLFRATWLAANKYYVGDWTR- 327 (379) T ss_pred --h--ccCCCCEEEEcHHHHHHHHHhh--ccC----CceeccCCccCCCCCcceecc-eeeEecCCCCCCceEEeeccc- Confidence 1 2225577899999888776431 100 0111111110000 0014665 799999998865444432110 Q ss_pred CccceeEeecccccccccc-cC----cccccceeeeeeeeceee-cCcccccccCCcceeecCCchh Q lcl|Aclame:pro 441 EMDAGIYYAPYVALTPLRG-SD----PKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDI 501 (519) Q Consensus 441 ~~~~~~fyaPYv~~~~~~~-~d----p~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~ 501 (519) .-+++- ....+.. .+ -.+-+=.+=+..|+|+.+ +| .+-+.--...+ T Consensus 328 ---~~~~~~---~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p---------~a~v~~~~~~~ 379 (379) T protein:vir:10 328 ---VTKVTT---EGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQP---------AALIFGDFTAV 379 (379) T ss_pred ---EEEEEE---eceEEEEeecccccccCCcEEEEEEEEeccEEecC---------ccEEEEEecCC Confidence 111111 1111100 11 112222233345776544 23 22121101111 No 66 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=80.74 E-value=0.092 Score=26.26 Aligned_cols=333 Identities=10% Similarity=0.039 Sum_probs=122.9 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhh--hhhHHHHHhh-----hhhccc--hhhh--hhhhhhh---hhhhh Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKI--FENQEQDILT-----APEYRD--EKIS--EAFGSFL---TEAEI 66 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~--~enq~~~~~~-----~~~~~~--~~~~--~~~~~~~---~~~~~ 66 (519) ++. ...++-.-+... + +.+-++| +|.+...+.. ...-+. +... +.+..++ .+... T Consensus 32 ~~~-e~~~~~~~l~~e-----~-----~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 100 (390) T protein:vir:81 32 LNA-SARSKVDELFAT-----V-----GNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASAGRWNDRSA 100 (390) T ss_pred cCH-HHHHHHHHHHHH-----H-----HHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHHhhhhh Confidence 111 111111111100 1 0111111 1111000000 000000 0000 0000000 00000 Q ss_pred ccccccchh---hhccccccccccccCc-eehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCccccc Q lcl|Aclame:pro 67 GGDHGYDAT---NIAAGQTSGAVTQIGP-AVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAF 142 (519) Q Consensus 67 ~~~~g~~~~---~~~est~tg~v~~~~P-~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~ 142 (519) ....-..+. .....++++.. -..| ..-.++++.-+..+-.++|.+.||++++.- |....... ..+ T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~-------~~~~~~~~--~~a- 169 (390) T protein:vir:81 101 RATMNIKAALNTASTDAAGSAGA-LTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIE-------YVQETGFV--NNA- 169 (390) T ss_pred hhhhHHHHHHHhhccccccCCcc-eechhhhHHHHHHHhhhhhhhhhcceeeccCCceE-------EEEEecCC--cce- Confidence 000000000 00001111111 1111 112244444556677889999999877631 11110000 000 Q ss_pred ccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceec Q lcl|Aclame:pro 143 HPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEI 222 (519) Q Consensus 143 ~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~ 222 (519) .-+ T Consensus 170 -----------------------------------------------------------------------------~~v 172 (390) T protein:vir:81 170 -----------------------------------------------------------------------------AIV 172 (390) T ss_pred -----------------------------------------------------------------------------eee Confidence 000 Q ss_pred ccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhH Q lcl|Aclame:pro 223 AEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINR 302 (519) Q Consensus 223 ~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINR 302 (519) ++|-. . ..++..|.++.+++.|.. -...+|-||.+|- . +.++.|.+-|+..|...+|+ T Consensus 173 ~Eg~~------~----~~~~~~~~~i~~~~~k~~-------~~~~is~ell~d~--~---~~~~~i~~~l~~~~~~~~d~ 230 (390) T protein:vir:81 173 AEGAL------K----PESSLKFAKKTDTTHVIA-------HTMKATRQILSDA--P---QLASYMNNRLIRGLKVKEDA 230 (390) T ss_pred cCCcc------c----ccccceeeEEEEeeeEEE-------EeehhhHHHHHhH--H---HHHHHHHHHHHHHHHHHHHH Confidence 01000 0 001223445555555544 4556899999984 2 46889999999999999998 Q ss_pred HHHHHHHhhhhhhhhcccccccccceeeeccccccc---cccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchH Q lcl|Aclame:pro 303 EVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDI---RGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRN 379 (519) Q Consensus 303 eii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~---~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~ 379 (519) .||. | - -++..+.|++........ .......+....++.++. ..+...+.+|++|. T Consensus 231 a~l~--------G---~-g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~v~~~~ 289 (390) T protein:vir:81 231 EILR--------G---T-GANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQAS---------LAEYNPSGIVINPI 289 (390) T ss_pred HHHh--------c---C-CCCCcccceeecccccccccccccchhHHHHHHHHHhhc---------cccCCCCEEEEcHH Confidence 8874 1 0 011224565543221110 111122333333333222 22335578899999 Q ss_pred HHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccc Q lcl|Aclame:pro 380 VVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRG 459 (519) Q Consensus 380 va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~ 459 (519) ....|...= +.. +...+ .+.... -.++|.| ++|++.+..|.+-+++|---. ..+.. ......+.. T Consensus 290 ~~~~l~~lk--d~~----G~~l~-~~~~~~-~~~~l~G-~pv~~~~~~p~~~~~~gd~~~-----~~~~~-~~~~~~v~~ 354 (390) T protein:vir:81 290 DWAAIELAK--DAN----NQYLI-GNARGT-LTPTLWG-LPVVATQAMAPGEFLVGAFDL-----AAQIF-DQWDARVEI 354 (390) T ss_pred HHHHHHHhh--cCC----Cceee-cCcccc-cCceecc-eeeEEcCCCCCCcEEEEehhc-----eEEEE-EecceEEEE Confidence 988886431 100 00011 111110 1146766 699999998876555543210 00100 000111111 Q ss_pred cC-c---ccccceeeeeeeecee-ecCcccccccCCcceeecC Q lcl|Aclame:pro 460 SD-P---KNFQPVMGFKTRYGIG-INPFADPAAQAPTKRIQNG 497 (519) Q Consensus 460 ~d-p---~s~qP~~g~~tRY~l~-~nP~~~~~~~~~~~~i~~~ 497 (519) .+ + .+-+=.+=...|++.. .+|-+ ..++.=+ T Consensus 355 ~~~~~~~~~~~v~~r~~~r~d~~v~~~~a-------~v~~t~a 390 (390) T protein:vir:81 355 GYVGEDFQRNMITVLAEERLALVVYRPEA-------LISGSFA 390 (390) T ss_pred ecccchhhcCcEEEEEEEeeccEEecccc-------eEEEEeC Confidence 11 1 1122233456677653 22222 2233222 No 67 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=80.62 E-value=0.093 Score=26.23 Aligned_cols=299 Identities=10% Similarity=0.009 Sum_probs=109.9 Q ss_pred ccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCcccccccccccccccc-ceecccccchh Q lcl|Aclame:pro 151 MFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQ-LAEIAEGMATS 229 (519) Q Consensus 151 ~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~-~~~~~~GmsTa 229 (519) +.+-.. ...... .+... +....... ... .....-..+. ............+. .+..-.+ T Consensus 1 m~~~~~-~a~~~~---~t~~~-g~~i~~~~--~~~-ii~~~~~~s~--------l~~~~~~~~~~~~~~~~p~~~~---- 60 (330) T protein:vir:77 1 MAGSTV-PSTQVA---LTGDF-SAFLTPEQ--SQD-YFAEIEKTSI--------VQRIARKVPMGPTGISIPHWTG---- 60 (330) T ss_pred Cccccc-chhhcc---ccCCC-cceechhH--HHH-HHHHHHhccc--------hhhhcceeeccCCceEEEEEcC---- Confidence 111000 000000 00000 00000000 000 0000000000 00000000000000 0010001 Q ss_pred hhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHH Q lcl|Aclame:pro 230 IAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWIN 309 (519) Q Consensus 230 ~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~ 309 (519) ..++. +- .-+..+++-..++++++...|..+-+..+|-||.+|- ..|.|++|.+-|+..|...||+.||. T Consensus 61 ~~~a~--~v-~Eg~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~ai~~~~~~~~l~--- 130 (330) T protein:vir:77 61 AVSAS--WT-GEAERKPITKGSFGKQELEPVKITTIFAESAEVVRLN----PLNYLNTMRTKIAEAIALKFDAAAIH--- 130 (330) T ss_pred Cccee--Ee-cCCCccccccceeeEEEEeEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhc--- Confidence 01110 00 1135667777888999999998888899999999983 57889999999999999999998884 Q ss_pred hhhhhhhhcccccccccceeeecc----ccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHH Q lcl|Aclame:pro 310 YSAQVGKSGMTNTVGAKAGVFDFQ----DPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLA 385 (519) Q Consensus 310 ~~a~~~~~~~t~~~~~~~G~fDl~----~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~ 385 (519) |. .....+.|++... ...+......+ .....++..+.++-..+.+. ....+.+||+|.....|. T Consensus 131 -----G~----g~~~~~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~~~~--~~~~~~~vmn~~~~~~l~ 198 (330) T protein:vir:77 131 -----GI----DKPSAFKGYLAETTKVVSLADTNLTTAS-GPQGNAYLAVNNALSLLVNS--GKKWTGTLLDNVTEPILN 198 (330) T ss_pred -----cc----CCCCccccccccccccceeecccccccc-cccchhHHHHHHHHHhhhhc--CCCccEEEEcHHHHHHHH Confidence 00 0000111111100 00000000000 11122344444554555443 234567899999998887 Q ss_pred hc----CcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc--------------ceEEEEEecCCCc----c Q lcl|Aclame:pro 386 AV----DTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS--------------DYFTIGYKGSNEM----D 443 (519) Q Consensus 386 ~~----g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--------------dy~~vG~KG~~~~----~ 443 (519) .. |...+.+ ....+......-++|.| ++||++.+.+. .++++|-.+..+. + T Consensus 199 ~lkd~~G~~l~~~------~~~~~~~~~~~~~~l~G-~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e 271 (330) T protein:vir:77 199 TAVDGNGRPLFVE------STYTEQVGAIREGRILG-RPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQ 271 (330) T ss_pred HHhccCCceeecC------ccccccccccCCceecc-eeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeec Confidence 42 1111110 00011111112245666 79999988652 1223443332221 1 Q ss_pred ceeEeecccccccccccCcccc---cceeeeeeeeceee-cCcccccccCCcceeecCCchhhhcccchh Q lcl|Aclame:pro 444 AGIYYAPYVALTPLRGSDPKNF---QPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSLGLNG 509 (519) Q Consensus 444 ~~~fyaPYv~~~~~~~~dp~s~---qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~ 509 (519) +.+.+.- .........+-+-| +=.+=...|++..+ +|=+ ..+|...- ++.-..-- T Consensus 272 ~~~~~~~-~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a-------~~~i~~~~---~~~~~~~~ 330 (330) T protein:vir:77 272 ATLDFGE-EQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDA-------FVKLTDQV---AGTDPEEE 330 (330) T ss_pred ceeeecc-cccccccccccchhhcCcEEEEEEEEeccEEecccc-------eEEEEecc---CCcCCCCC Confidence 1111100 00000000000111 11222334555443 2211 12221110 00000000 No 68 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=80.03 E-value=0.099 Score=26.10 Aligned_cols=331 Identities=14% Similarity=0.097 Sum_probs=125.7 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhh--hhhhhHHHHH-------hhhhhccch--hhhhhhhhhhh------- Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIA--KIFENQEQDI-------LTAPEYRDE--KISEAFGSFLT------- 62 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~--~~~enq~~~~-------~~~~~~~~~--~~~~~~~~~~~------- 62 (519) ++.+ -.++|.-+... |.+. ++.+-. .-++..++.. .....-..+ ...+++..++. T Consensus 31 ~~~~-~~~~~~~l~~e-----ie~l-~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 103 (394) T protein:vir:97 31 LESD-DLEAARSIKAE-----VEQA-KANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVN 103 (394) T ss_pred hchh-hHHHHHHHHHH-----HHHH-HHHHHHHHHHHHHHHHHhhhhccccccccccchhhHHHHHHHHHHHHHHHHHhh Confidence 3322 23444443311 1111 111100 0000000000 000000000 00000000000 Q ss_pred ---------hhhhccccccchhhhcccccc--ccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecC Q lcl|Aclame:pro 63 ---------EAEIGGDHGYDATNIAAGQTS--GAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGK 131 (519) Q Consensus 63 ---------~~~~~~~~g~~~~~~~est~t--g~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~ 131 (519) +..............+.+.++ |.+.--....-.+++.+-+......++.|.||+++++-+--++ T Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~----- 178 (394) T protein:vir:97 104 DSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQ----- 178 (394) T ss_pred hhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEe----- Confidence 000000111111111111111 1111111122234555556667788899999988764331111 Q ss_pred CCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccc Q lcl|Aclame:pro 132 DPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVT 211 (519) Q Consensus 132 ~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~ 211 (519) .... . . T Consensus 179 ~~~~----~-~--------------------------------------------------------------------- 184 (394) T protein:vir:97 179 RATT----K-M--------------------------------------------------------------------- 184 (394) T ss_pred cCCC----c-c--------------------------------------------------------------------- Confidence 0000 0 0 Q ss_pred cccccccceecccccchhhhhhcccCCCCCccccccc-eeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHH Q lcl|Aclame:pro 212 ALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEM-GFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSG 290 (519) Q Consensus 212 ~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EM-sFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsN 290 (519) .-+++|- ..++. ...++++++.++.-+-...+|-||++|- +.|.+++|.+ T Consensus 185 --------~~v~E~~-----------------~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds----~~~~~~~i~~ 235 (394) T protein:vir:97 185 --------VTVAELE-----------------KNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA----DVDLVGIVSE 235 (394) T ss_pred --------ceecccc-----------------cccccccccceeEEeehhheeeehhhHHHHHhhh----hHHHHHHHHH Confidence 0001110 01111 1345566677777777788999999986 3467888888 Q ss_pred HHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccC Q lcl|Aclame:pro 291 ILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGA 370 (519) Q Consensus 291 ILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~ 370 (519) -|+..|..-+|..||.-+.. ..+.+...++ ....++... ....+. T Consensus 236 ~la~~~~~~~~~~i~~g~~~-------------~~~~~~~~~~-------------~~~~~~~~~--------~~~~~~- 280 (394) T protein:vir:97 236 SISQIKVNTTNDAIAKVLKS-------------FTTKTVKNLD-------------EIKALLNGG--------FDPAYN- 280 (394) T ss_pred HHHHHHHHHHHHHHhhcccc-------------ccccccccHH-------------HHHHHHHhh--------hhhhhC- Confidence 88888888888877753211 1122322221 112222111 112222 Q ss_pred CCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEe--cCCCccceEEEEEecCCCccceeEe Q lcl|Aclame:pro 371 GNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYI--DQYARSDYFTIGYKGSNEMDAGIYY 448 (519) Q Consensus 371 gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~dy~~vG~KG~~~~~~~~fy 448 (519) .-+||+|.+...|...= +.. +...+..+.+.. .-++|.| ++|++ |...+..-+++|-- ..+.++ T Consensus 281 -a~~v~n~~~~~~l~~lk--d~~----G~~i~~~~~~~~-~~~~l~G-~pv~~~~~~~~~~~~~~~gd~-----~~~~~~ 346 (394) T protein:vir:97 281 -VSLIVSQSFYQTLDTLK--DGN----GRYLLQDDITAV-SGKVLLG-KPVFVLSDEVLGANKAFIGDF-----KRGVLF 346 (394) T ss_pred -CEEEEcHHHHHHHHHhh--ccC----CCeeeecCcCCC-CCceecc-ceeEEecccccCCccEEEeec-----cccEEE Confidence 34679999988887541 100 011111121111 1246777 57776 44444444444421 011111 Q ss_pred ecccccccccccCcccccceeeeeeeeceee-cCcccccccCCcceeecCCchhhhcccchhh Q lcl|Aclame:pro 449 APYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSLGLNGY 510 (519) Q Consensus 449 aPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y 510 (519) ..- ....+...|...++..+-...||+..+ +|-+ ...+ ...+-. -++ T Consensus 347 ~~~-~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a-------~~~~-~~~~~~------~p~ 394 (394) T protein:vir:97 347 ADR-KDLGLRWADNEIYGQYLQAVLRFGVSKVDDKA-------GYYV-TFTPEP------LPL 394 (394) T ss_pred EEe-cceEEEEecccccceeEEEEEEEccEEecccc-------eEEE-Eecccc------cCC Confidence 111 111222335555565666667887644 2211 1111 111100 011 No 69 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=78.78 E-value=0.11 Score=25.82 Aligned_cols=334 Identities=13% Similarity=0.131 Sum_probs=123.4 Q ss_pred CChHHHHHhhhhhhCC--------------Cc--cccccccchhhhhhhhhhhH---HHHHhhhh---------hccc-- Q lcl|Aclame:pro 1 MKKNALVQKWSALLEN--------------EA--LPEIVGASKQAIIAKIFENQ---EQDILTAP---------EYRD-- 50 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~--------------~~--~~~~~~~~~~~~~~~~~enq---~~~~~~~~---------~~~~-- 50 (519) |+.++|.++|.-+.+. +. +-+|.. .+..+ ..+.+.+ ++.+.+.. +-+. T Consensus 5 m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (408) T protein:vir:10 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSE-LKNKR-DNEKVRRDALREQLVEAQAEQVVNMREEEKGPL 82 (408) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 8888899888665442 00 001100 11111 0111110 00110000 0000 Q ss_pred --------hhhhhhhhhhhhhhhhcccccc----chhhhcccccc-ccc---cccCceehhhHHHHHhhhhhhhceeecc Q lcl|Aclame:pro 51 --------EKISEAFGSFLTEAEIGGDHGY----DATNIAAGQTS-GAV---TQIGPAVMGMVRRAIPHLIAFDICGVQP 114 (519) Q Consensus 51 --------~~~~~~~~~~~~~~~~~~~~g~----~~~~~~est~t-g~v---~~~~P~L~~l~Rra~p~LIa~DI~GVQP 114 (519) ++...++..++ -+.++. ....+..++.+ |.. ..+.+- +++.+.......++|.+.| T Consensus 83 ~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~---Ii~~~~~~~~l~~~~~~~~ 154 (408) T protein:vir:10 83 NKSENELKDKFVKDFVNMV-----RNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTM---INTLVRQYDSLQQYVRVES 154 (408) T ss_pred ccchhhhHHHHHHHHHHHh-----hcchhhhhhhhhhhhhcccccCCceeccHhHHHH---HHHHHHhhchhhhhcceee Confidence 00000011111 011111 01111112211 111 112222 4455555667788999999 Q ss_pred CCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAV 194 (519) Q Consensus 115 mTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~ 194 (519) |+++.|-+--.|-. +.. +.+.| T Consensus 155 ~~~~~~~~~~~~~~--~~~--------------~~a~~------------------------------------------ 176 (408) T protein:vir:10 155 VSTSNGSRVYEKWT--DVT--------------PLTVM------------------------------------------ 176 (408) T ss_pred ccCCcceEEEeecc--ccc--------------cceee------------------------------------------ Confidence 99888765422210 000 00000 Q ss_pred cCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 195 TVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQ 274 (519) Q Consensus 195 ~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQ 274 (519) +++|- +. .......|.++.|++.|..+ ...+|-||.+ T Consensus 177 ---------------------------v~E~~-----~~----~~~~~~~~~~i~~~~~k~~~-------~~~iS~ell~ 213 (408) T protein:vir:10 177 ---------------------------DAEDG-----KI----PDLDNPQLTIIKYLIKRYAG-------IITATNTSLK 213 (408) T ss_pred ---------------------------ecCcc-----cc----ccccCcceeeEEeeeeeEEe-------eehhHHHHHh Confidence 01110 00 00112345556666555554 4569999999 Q ss_pred HHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHH Q lcl|Aclame:pro 275 DLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQ 354 (519) Q Consensus 275 DLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~ 354 (519) |- .+|.+++|.+.|+..|..-+|+.|+.-.- +.....|+.++++ ...++.. T Consensus 214 ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g------------~~~~~~~~~~~~~-------------l~~~~~~ 264 (408) T protein:vir:10 214 DT----AENILAWLSSWIAKKVVVTRNQAIIEVMK------------AAPKKPTIAKFDD-------------VITMINT 264 (408) T ss_pred hc----hHHHHHHHHHHHHHHHHHHHHHHHhhccc------------ccccccccccHHH-------------HHHHHHH Confidence 94 46779999999999999999988875211 1111223322221 1111111 Q ss_pred HHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecC--CCcc--- Q lcl|Aclame:pro 355 IDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQ--YARS--- 429 (519) Q Consensus 355 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~--y~~~--- 429 (519) .+ ...+-..-.+||+|.....|...= +.. +.-.+..+.+.. .-++|.| ++|++-. ..+. T Consensus 265 ------~~--~~~~~~~a~~v~n~~~~~~l~~lk--d~~----G~~i~~~~~~~~-~~~~l~G-~PV~~~~~~~~~~~~~ 328 (408) T protein:vir:10 265 ------AV--DPAIIATSSLLTNQSGLNKLALVK--TAE----GKYLLEPDPTKP-NSYLIKG-KQVIVVADRWLPNTGS 328 (408) T ss_pred ------hh--hhhhccCCEEEEcHHHHHHHHHhh--ccC----CceEeccCcCCC-CCceecc-eeeEEecccccCccCC Confidence 11 112212236789999988887541 100 011111111111 1136766 5776632 2221 Q ss_pred -----------ceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeecee-ecC-------cccccccCC Q lcl|Aclame:pro 430 -----------DYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG-INP-------FADPAAQAP 490 (519) Q Consensus 430 -----------dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~-~nP-------~~~~~~~~~ 490 (519) ++++++.++.... =+.++.-. +-.+.+=.+-+..||++. .+| |+......| T Consensus 329 ~~~~i~~gd~~~~~~~~~~~~~~v----~~~~~~~~------~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~~ 398 (408) T protein:vir:10 329 TVYPLYYGDMSQAITLFDRENMSL----LPTNIGAG------AFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVG 398 (408) T ss_pred CceEEEEEehhccEEEEEecceEE----EEcccccc------hhhcCceEEEEEEeeccEEeccccEEEEEeeccccCCC Confidence 1333333322111 11111000 001112222233333322 111 000001111 Q ss_pred cceeecCCchh Q lcl|Aclame:pro 491 TKRIQNGMPDI 501 (519) Q Consensus 491 ~~~i~~~~d~~ 501 (519) ..... ..... T Consensus 399 ~~~~~-~~~~~ 408 (408) T protein:vir:10 399 NFKTT-TSTAV 408 (408) T ss_pred CCCCC-CcccC Confidence 10000 00111 No 70 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=77.96 E-value=0.12 Score=25.65 Aligned_cols=348 Identities=16% Similarity=0.151 Sum_probs=117.8 Q ss_pred CChHH-------------HHHhhhhhhCCCccc-----cccc-cchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhh Q lcl|Aclame:pro 1 MKKNA-------------LVQKWSALLENEALP-----EIVG-ASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFL 61 (519) Q Consensus 1 ~~~~~-------------l~~kw~p~l~~~~~~-----~~~~-~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~ 61 (519) ++.|+ |.++...+-..|.+- ++.. ..++....+--+.|.+.....+..+ .+..+.+.+. T Consensus 31 lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 108 (428) T protein:vir:10 31 LTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAVIVKAEPKQYTGAGMTRMVM--SIAAAQGNLQ 108 (428) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccccccccchhhhHHHHHHHH--HHHHhhhhHH Confidence 34332 222221110000000 0000 0000000000111111000000000 0000000000 Q ss_pred hhhhhccc-cccchh--hhcccccccccc---ccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCC Q lcl|Aclame:pro 62 TEAEIGGD-HGYDAT--NIAAGQTSGAVT---QIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIA 135 (519) Q Consensus 62 ~~~~~~~~-~g~~~~--~~~est~tg~v~---~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~ 135 (519) ..+..... .+.... .+..++++|.+. .+.+-++.+. .+..+..++ |+..+++++|-+ +++-. T Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l---~~~~~l~~~-~~~~~~~~~g~~-----~~p~~--- 176 (428) T protein:vir:10 109 DAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELL---RDRTIVRKL-GARSIPLPNGNM-----SLPRL--- 176 (428) T ss_pred HHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHH---hhhchhhhh-cceeeecCCcce-----EEEEE--- Confidence 00000000 000001 111111122211 1111122222 223333333 222222222221 01000 Q ss_pred CCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccc Q lcl|Aclame:pro 136 AGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVE 215 (519) Q Consensus 136 ~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~ 215 (519) ++. T Consensus 177 -----------------~~~------------------------------------------------------------ 179 (428) T protein:vir:10 177 -----------------AGG------------------------------------------------------------ 179 (428) T ss_pred -----------------eCC------------------------------------------------------------ Confidence 000 Q ss_pred cccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 216 AGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATE 295 (519) Q Consensus 216 ~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTE 295 (519) ....-++ | +...++...++++++...|.-+-...+|-||.+|- ..|.++.|.+.|... T Consensus 180 -~~a~~v~--------E---------g~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds----~~~l~~~i~~~l~~a 237 (428) T protein:vir:10 180 -ATASYTG--------E---------NQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRA----GFNVEQLVLQDILTA 237 (428) T ss_pred -cceeeec--------c---------CccccccccceeeEEeeeEEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHH Confidence 0000011 1 12234444556666666666666789999999884 245688999999999 Q ss_pred HHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccc-----cccchHHHHHHHHHHHHHHHHHHHHhhccccC Q lcl|Aclame:pro 296 IMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDI-----RGARWAGESFKALLFQIDKEAAEIARQTGRGA 370 (519) Q Consensus 296 ImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~-----~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~ 370 (519) |...+++.||. | +-++..+.|++.-...... ...--.......+ ..+..+...+...-. . T Consensus 238 i~~~~d~~~l~--------G----~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~--~ 302 (428) T protein:vir:10 238 ISVREDKAFMR--------D----DGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTY-LDSIILMSMDGNSNM--I 302 (428) T ss_pred HHHHHHHHHhc--------c----CCCCccccccccccccccccccccccccccHHHHHHH-HHHHHHhhhcccccc--c Confidence 99999988874 1 0011233455432110000 0000011222222 222233333333222 2 Q ss_pred CCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccc----------------eEEE Q lcl|Aclame:pro 371 GNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD----------------YFTI 434 (519) Q Consensus 371 gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----------------y~~v 434 (519) ....|++|.....|...- +.. ...+-.+... |+|.| ++||++.+.+.+ ++++ T Consensus 303 ~~~~v~n~~~~~~L~~lk--d~~-----G~~i~~~~~~----g~l~G-~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i 370 (428) T protein:vir:10 303 SSGWGMSNRTYMKLFGLR--DGN-----GNKVYPEMAQ----GMLKG-YPIQRTSAIPANLGEGGKESEIYFADFNDVVI 370 (428) T ss_pred cCEEEEcHHHHHHHHHhh--ccC-----CceeccCCCC----Ceeec-eeeEEeccccccccCCCccceEEEEecceEEE Confidence 355678999888887532 100 0111112222 57777 699998876643 1223 Q ss_pred EEecCCCccceeEeecccccccccccCcccc---cceeeeeeeeceeec-CcccccccCCcceeecCCch Q lcl|Aclame:pro 435 GYKGSNEMDAGIYYAPYVALTPLRGSDPKNF---QPVMGFKTRYGIGIN-PFADPAAQAPTKRIQNGMPD 500 (519) Q Consensus 435 G~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~---qP~~g~~tRY~l~~n-P~~~~~~~~~~~~i~~~~d~ 500 (519) |..+.-+.+ ..+|..........-..| +=.+=...|+++.+. |= .-.+..+..| T Consensus 371 ~~~~~i~i~----~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~--------a~~~~t~~~~ 428 (428) T protein:vir:10 371 GEDGNMKVD----FSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPE--------GLVLGTGVLF 428 (428) T ss_pred EEecceEEE----eecccccccccccccchhhcchhheeeeeeeCceeeccc--------eEEEEeccCC Confidence 333222211 112211111000000011 122235566665542 21 1222333444 No 71 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=77.70 E-value=0.12 Score=25.59 Aligned_cols=277 Identities=14% Similarity=0.070 Sum_probs=122.5 Q ss_pred hccccccccccccCce-ehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCcc Q lcl|Aclame:pro 77 IAAGQTSGAVTQIGPA-VMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQ 155 (519) Q Consensus 77 ~~est~tg~v~~~~P~-L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~ 155 (519) .+ +++|.+ .-|. .-.+++.+-++.+-.++|.+.||++... +|+-..+. .+| T Consensus 1 ma--~~gG~l--vp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~-------~ip~~~~~---~~a-------------- 52 (298) T protein:vir:16 1 MV--LNKGTL--FDPTLVTDLISKVAGKSSIARLSAQKPIPFNGE-------KVFTFTMD---SEI-------------- 52 (298) T ss_pred Cc--ccCcce--echhHHHHHHHHHHhhhhhhhhcceeeccCCce-------EEEEEecC---cce-------------- Confidence 12 222221 1121 1134444556778899999999875321 11100000 000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcc Q lcl|Aclame:pro 156 GAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQE 235 (519) Q Consensus 156 ~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~ 235 (519) .-+++ T Consensus 53 ----------------------------------------------------------------~~v~E----------- 57 (298) T protein:vir:16 53 ----------------------------------------------------------------DVVAE----------- 57 (298) T ss_pred ----------------------------------------------------------------EEecC----------- Confidence 00111 Q ss_pred cCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhh Q lcl|Aclame:pro 236 GFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVG 315 (519) Q Consensus 236 ~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~ 315 (519) +.++++-..++++++..+|.-+-....|-||.++--- -..|-+++|.+-|+..|...|+..++.-... T Consensus 58 ------~~~~~~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~----- 125 (298) T protein:vir:16 58 ------SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQEFNDGFAKKVARGIDLMAFHGVNP----- 125 (298) T ss_pred ------CccccccccceeEEEEeeeeEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHHhhccccC----- Confidence 1123333444556666666666678899999875432 1255688888888888888888888752110 Q ss_pred hhcccccccccce---eeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccc Q lcl|Aclame:pro 316 KSGMTNTVGAKAG---VFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVS 392 (519) Q Consensus 316 ~~~~t~~~~~~~G---~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~ 392 (519) -+.++....| +........ -.......++..|.++...+... +.+...+|++|.....|...- +. T Consensus 126 ---~~g~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~vmn~~~~~~l~~lk--d~ 193 (298) T protein:vir:16 126 ---RLGTASAVIGTNHFDSKVTQKV-----EAPRGIADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAKQK--DL 193 (298) T ss_pred ---CCCccccccccccccccccccc-----ccccccccHHHHHHHHHHHhhhc--CCCccEEEEcHHHHHHHHHhh--cc Confidence 0000000001 000000000 00111122344455555554442 234456899999998887432 11 Q ss_pred cccccccccccccCCCceEEEEecCcEEEEecCCCcc------ceEEEEEecCCCccceeEeeccc--ccccccccCccc Q lcl|Aclame:pro 393 YAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS------DYFTIGYKGSNEMDAGIYYAPYV--ALTPLRGSDPKN 464 (519) Q Consensus 393 ~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~~~~~fyaPYv--~~~~~~~~dp~s 464 (519) . +...+..+.+.. -.|+|.| ++|+++.+.+. +.+++|- - ..++.|..-- ++...+..|+++ T Consensus 194 ~----G~~i~~~~~~~~-~~~~l~G-~PV~~~~~v~~~~~~~~~~~~~GD---f--s~~~~~~~~~~~~~~~~~~~~~~~ 262 (298) T protein:vir:16 194 Q----DNALFPELKWGA-TPDTING-LPVDVNKTVSDMSLTQRDRAIIGD---F--ANGFKWGYAKEVPLEVIQYGDPDN 262 (298) T ss_pred C----CCeeecCcccCC-CCceecc-eeeEEecccccccCCCccEEEEee---c--cceEEEEEecCceEEEeeccCCcC Confidence 0 111111111111 1257887 59999887652 3444441 0 0111222111 122222234432 Q ss_pred -----cc-ceeee--eeeecee-ecCcccccccCCcceeecCC Q lcl|Aclame:pro 465 -----FQ-PVMGF--KTRYGIG-INPFADPAAQAPTKRIQNGM 498 (519) Q Consensus 465 -----~q-P~~g~--~tRY~l~-~nP~~~~~~~~~~~~i~~~~ 498 (519) || =.++| ..|++.. .+|= ...++.+.. T Consensus 263 ~~~~~f~~~~v~~ra~~r~d~~v~~~~-------a~~~l~~at 298 (298) T protein:vir:16 263 SGLDLKGYNQVYIRAELFLGWGILDAT-------KFARVTEAN 298 (298) T ss_pred cchhhhhcCcEEEEEEEEEccEeeccc-------ceEEEeecC Confidence 22 11333 5577743 3332 234454433 No 72 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=77.41 E-value=0.13 Score=25.53 Aligned_cols=289 Identities=11% Similarity=0.072 Sum_probs=122.9 Q ss_pred ccccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCcccc Q lcl|Aclame:pro 78 AAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGA 157 (519) Q Consensus 78 ~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~ 157 (519) |..+++|.+.--....=.+++++-+.-+..+++-|-||++.. .+|+-.... .+| T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~-------~~~p~~~~~---~~a---------------- 54 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-------QQYMTLTAP---PRG---------------- 54 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCc-------eEEEEEeCC---cee---------------- Confidence 444555554322222234556666778888999999886532 122111000 000 Q ss_pred ccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccC Q lcl|Aclame:pro 158 AETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGF 237 (519) Q Consensus 158 ~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ 237 (519) .-+++| T Consensus 55 --------------------------------------------------------------~wv~Eg------------ 60 (311) T protein:vir:81 55 --------------------------------------------------------------EVVGEG------------ 60 (311) T ss_pred --------------------------------------------------------------EEeecC------------ Confidence 001111 Q ss_pred CCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 238 NGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKS 317 (519) Q Consensus 238 ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~ 317 (519) ..+++...++++++..+|.-+-....|-||.|+--. -.++-|++|.+-|+..|...|+.-++.=.....-.... T Consensus 61 -----~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~ 134 (311) T protein:vir:81 61 -----AQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALS 134 (311) T ss_pred -----cccccccceeeEEEEeeEEEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccc Confidence 112222333455555555445556789999875322 13455788888888888888887777521100000001 Q ss_pred cccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccccccc Q lcl|Aclame:pro 318 GMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQG 397 (519) Q Consensus 318 ~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~ 397 (519) ++.........+...... ....++.-|..+-..+.. .+...+-+|++|.....|...- +. . T Consensus 135 gi~~~~~~~~~~~~~~~~-----------~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~lk--d~----~ 195 (311) T protein:vir:81 135 GSPAKILDTTNIVELTTG-----------TSATPDLAVEAAVGLVLG--DNLSPDGVALDNTFSFMLATQR--DS----Q 195 (311) T ss_pred cccccccccceeeeeccc-----------ccchHHHHHHHHHHHhhh--cCCCceEEEEcHHHHHHHHhhh--cc----C Confidence 111000001111111111 001223334444444432 2235567899999998887421 11 0 Q ss_pred ccccccccCCCceEEEEecCcEEEEecCCCccceEE------EEEecCCCc-----c-ceeEeecccccccc--cccCcc Q lcl|Aclame:pro 398 LGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFT------IGYKGSNEM-----D-AGIYYAPYVALTPL--RGSDPK 463 (519) Q Consensus 398 ~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~------vG~KG~~~~-----~-~~~fyaPYv~~~~~--~~~dp~ 463 (519) ++-.+..+.+. -..|+|.| ++|+++.+.+..-.. +...+.... | +.+++...-+.... +-.|+. T Consensus 196 G~~l~~~~~~~-~~~~tl~G-~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~ 273 (311) T protein:vir:81 196 GRKLYPELGFG-TDVASFAG-LNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPD 273 (311) T ss_pred CCeeecCcccc-CCCceecc-eeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCC Confidence 11111112111 12367887 699988776532111 111111110 1 12233322222221 112332 Q ss_pred c----ccc-eeee--eeeecee-ecCcccccccCCcceeecCCch Q lcl|Aclame:pro 464 N----FQP-VMGF--KTRYGIG-INPFADPAAQAPTKRIQNGMPD 500 (519) Q Consensus 464 s----~qP-~~g~--~tRY~l~-~nP~~~~~~~~~~~~i~~~~d~ 500 (519) . ||- .++| ..|+|.. .+|=+ ..++....+. T Consensus 274 ~~~~~~~~~~v~~r~~~r~d~~v~~~~a-------~~~l~~a~~~ 311 (311) T protein:vir:81 274 GLGDLKRQNQIAIRAEVVYGIGIMSTDA-------FAVVRDADES 311 (311) T ss_pred cchhhhhcCcEEEEEEEEeccEeecccc-------eEEEEeeccC Confidence 1 222 1333 4677744 44421 2334332221 No 73 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=74.98 E-value=0.15 Score=25.07 Aligned_cols=352 Identities=13% Similarity=0.051 Sum_probs=134.2 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhh---------------------h Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFG---------------------S 59 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~---------------------~ 59 (519) ...+++.+++..++.... ++.+...+.-...+. +++.-........+.+.+... . T Consensus 54 ~~~~~~~~~~~~~~a~~~--~~~~~~~~~e~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 129 (497) T protein:vir:10 54 ERAQEMLKSLGGADAAKD--GLDNDIPEVEVRNLK--QIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGT 129 (497) T ss_pred HHHHHHHHHHHHHHHHHH--HHHHHHHHHHhhhhh--hHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHH Confidence 222333344444433210 000000000000000 000000000000000000000 0 Q ss_pred hhhhhhhccccccch-----hhhcccccccccc---ccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecC Q lcl|Aclame:pro 60 FLTEAEIGGDHGYDA-----TNIAAGQTSGAVT---QIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGK 131 (519) Q Consensus 60 ~~~~~~~~~~~g~~~-----~~~~est~tg~v~---~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~ 131 (519) ...|..-....+-.. .+...+++++... .+.+-++.+. .+.....+++.+-||+++.. .|.. T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~---~~~~~i~~l~~~~~~~~~~~-------~~~~ 199 (497) T protein:vir:10 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQL---FYELSLADLISSRPVTSPNL-------SYLT 199 (497) T ss_pred HHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHH---HhhhhHHhhccccccCCCce-------EEEE Confidence 000000000000000 0011111222211 2333333333 34566678888888887642 1111 Q ss_pred CCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccc Q lcl|Aclame:pro 132 DPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVT 211 (519) Q Consensus 132 ~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~ 211 (519) ..+.. . .+ T Consensus 200 ~~~~~--~---------~a------------------------------------------------------------- 207 (497) T protein:vir:10 200 ESAAH--N---------NA------------------------------------------------------------- 207 (497) T ss_pred EcCCC--C---------cc------------------------------------------------------------- Confidence 10000 0 00 Q ss_pred cccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHH Q lcl|Aclame:pro 212 ALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGI 291 (519) Q Consensus 212 ~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNI 291 (519) .-++ | +..++|...+++++++.+|.-+-...+|-||++|-- +.++.|.+- T Consensus 208 --------~wv~--------E---------~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~i~~~ 257 (497) T protein:vir:10 208 --------AAVA--------E---------AGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP-----ELFNFVQGR 257 (497) T ss_pred --------eeec--------c---------CcccccccccceeeEeeeeeeEeecHhHHHHHHhHH-----HHHHHHHHH Confidence 0001 1 112344445667778888877778899999999942 258999999 Q ss_pred HHHHHHHHhhHHHHH-HHHhhhhhhhhcccccccccceeeeccc-----------------------cccccccchHH-- Q lcl|Aclame:pro 292 LATEIMLEINREVID-WINYSAQVGKSGMTNTVGAKAGVFDFQD-----------------------PIDIRGARWAG-- 345 (519) Q Consensus 292 LSTEImlEINReii~-~i~~~a~~~~~~~t~~~~~~~G~fDl~~-----------------------~~d~~~~~~a~-- 345 (519) |...|..-+|..||. .= ++.+.|++.-.. ..+ ....|.+ T Consensus 258 l~~~i~~~~d~~~l~G~G--------------~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 322 (497) T protein:vir:10 258 LLEGIQRKEEVQLLAGGG--------------YPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD-GTNGAFVGQ 322 (497) T ss_pred HHHHHHHHHHHHhhcCCC--------------cccccccccccccccccccccchhhhhhhhhhhhhhcc-cccchhhhh Confidence 999999999999886 10 000111111000 000 0001111 Q ss_pred ---HHH-----------------------HHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHh----cCccccccc Q lcl|Aclame:pro 346 ---ESF-----------------------KALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAA----VDTSVSYAA 395 (519) Q Consensus 346 ---e~~-----------------------r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~----~g~~~~~~~ 395 (519) ... -.+...+...-..+.+...+ .++.+|.+|.....|.. -|...+.+. T Consensus 323 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~ 401 (497) T protein:vir:10 323 DTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMGGNF 401 (497) T ss_pred hHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhccc-CCCeEEEchHHHHHHHHhhcCCCceeccCc Confidence 000 11122233333444454444 45778889887777653 243333322 Q ss_pred ccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCC------CccceeEeecccccccccccCccccccee Q lcl|Aclame:pro 396 QGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSN------EMDAGIYYAPYVALTPLRGSDPKNFQPVM 469 (519) Q Consensus 396 ~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~------~~~~~~fyaPYv~~~~~~~~dp~s~qP~~ 469 (519) .+...+..... -++|.| ++|++.+..+.+=+++|--... ..+-.+-..||.. .+=.+-+=.+ T Consensus 402 ~~~~~~~~~~~-----~~~l~G-~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~------~~f~~n~v~~ 469 (497) T protein:vir:10 402 FGNAYGNPVNG-----GKNIWG-VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNG------TDFVDGKVTV 469 (497) T ss_pred ccccccccccC-----Cceeec-eeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccc------hhhhcCcEEE Confidence 22112111111 125665 7999888887654444321100 0001111222210 0112223344 Q ss_pred eeeeeece-eecCcccccccCCcceeecCCchhhhcccch Q lcl|Aclame:pro 470 GFKTRYGI-GINPFADPAAQAPTKRIQNGMPDIVNSLGLN 508 (519) Q Consensus 470 g~~tRY~l-~~nP~~~~~~~~~~~~i~~~~d~~a~~~~~~ 508 (519) =+..|+++ +.+|-+- .++. +-. .+.-| T Consensus 470 r~~~r~~~~v~~p~A~-------~~l~--~~~---~~~~~ 497 (497) T protein:vir:10 470 RAEERLGLLVYRPSAF-------QLIQ--LKK---GATGS 497 (497) T ss_pred EEEEeecceeeccccE-------EEEE--ecC---CccCC Confidence 45678876 4455432 1121 111 11111 No 74 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=74.98 E-value=0.15 Score=25.07 Aligned_cols=352 Identities=13% Similarity=0.051 Sum_probs=134.2 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhh---------------------h Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFG---------------------S 59 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~---------------------~ 59 (519) ...+++.+++..++.... ++.+...+.-...+. +++.-........+.+.+... . T Consensus 54 ~~~~~~~~~~~~~~a~~~--~~~~~~~~~e~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 129 (497) T protein:vir:78 54 ERAQEMLKSLGGADAAKD--GLDNDIPEVEVRNLK--QIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGT 129 (497) T ss_pred HHHHHHHHHHHHHHHHHH--HHHHHHHHHHhhhhh--hHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHH Confidence 222333344444433210 000000000000000 000000000000000000000 0 Q ss_pred hhhhhhhccccccch-----hhhcccccccccc---ccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecC Q lcl|Aclame:pro 60 FLTEAEIGGDHGYDA-----TNIAAGQTSGAVT---QIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGK 131 (519) Q Consensus 60 ~~~~~~~~~~~g~~~-----~~~~est~tg~v~---~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~ 131 (519) ...|..-....+-.. .+...+++++... .+.+-++.+. .+.....+++.+-||+++.. .|.. T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~---~~~~~i~~l~~~~~~~~~~~-------~~~~ 199 (497) T protein:vir:78 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQL---FYELSLADLISSRPVTSPNL-------SYLT 199 (497) T ss_pred HHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHH---HhhhhHHhhccccccCCCce-------EEEE Confidence 000000000000000 0011111222211 2333333333 34566678888888887642 1111 Q ss_pred CCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccc Q lcl|Aclame:pro 132 DPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVT 211 (519) Q Consensus 132 ~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~ 211 (519) ..+.. . .+ T Consensus 200 ~~~~~--~---------~a------------------------------------------------------------- 207 (497) T protein:vir:78 200 ESAAH--N---------NA------------------------------------------------------------- 207 (497) T ss_pred EcCCC--C---------cc------------------------------------------------------------- Confidence 10000 0 00 Q ss_pred cccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHH Q lcl|Aclame:pro 212 ALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGI 291 (519) Q Consensus 212 ~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNI 291 (519) .-++ | +..++|...+++++++.+|.-+-...+|-||++|-- +.++.|.+- T Consensus 208 --------~wv~--------E---------~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~i~~~ 257 (497) T protein:vir:78 208 --------AAVA--------E---------AGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP-----ELFNFVQGR 257 (497) T ss_pred --------eeec--------c---------CcccccccccceeeEeeeeeeEeecHhHHHHHHhHH-----HHHHHHHHH Confidence 0001 1 112344445667778888877778899999999942 258999999 Q ss_pred HHHHHHHHhhHHHHH-HHHhhhhhhhhcccccccccceeeeccc-----------------------cccccccchHH-- Q lcl|Aclame:pro 292 LATEIMLEINREVID-WINYSAQVGKSGMTNTVGAKAGVFDFQD-----------------------PIDIRGARWAG-- 345 (519) Q Consensus 292 LSTEImlEINReii~-~i~~~a~~~~~~~t~~~~~~~G~fDl~~-----------------------~~d~~~~~~a~-- 345 (519) |...|..-+|..||. .= ++.+.|++.-.. ..+ ....|.+ T Consensus 258 l~~~i~~~~d~~~l~G~G--------------~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 322 (497) T protein:vir:78 258 LLEGIQRKEEVQLLAGGG--------------YPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD-GTNGAFVGQ 322 (497) T ss_pred HHHHHHHHHHHHhhcCCC--------------cccccccccccccccccccccchhhhhhhhhhhhhhcc-cccchhhhh Confidence 999999999999886 10 000111111000 000 0001111 Q ss_pred ---HHH-----------------------HHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHh----cCccccccc Q lcl|Aclame:pro 346 ---ESF-----------------------KALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAA----VDTSVSYAA 395 (519) Q Consensus 346 ---e~~-----------------------r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~----~g~~~~~~~ 395 (519) ... -.+...+...-..+.+...+ .++.+|.+|.....|.. -|...+.+. T Consensus 323 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~ 401 (497) T protein:vir:78 323 DTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMGGNF 401 (497) T ss_pred hHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhccc-CCCeEEEchHHHHHHHHhhcCCCceeccCc Confidence 000 11122233333444454444 45778889887777653 243333322 Q ss_pred ccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCC------CccceeEeecccccccccccCccccccee Q lcl|Aclame:pro 396 QGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSN------EMDAGIYYAPYVALTPLRGSDPKNFQPVM 469 (519) Q Consensus 396 ~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~------~~~~~~fyaPYv~~~~~~~~dp~s~qP~~ 469 (519) .+...+..... -++|.| ++|++.+..+.+=+++|--... ..+-.+-..||.. .+=.+-+=.+ T Consensus 402 ~~~~~~~~~~~-----~~~l~G-~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~------~~f~~n~v~~ 469 (497) T protein:vir:78 402 FGNAYGNPVNG-----GKNIWG-VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNG------TDFVDGKVTV 469 (497) T ss_pred ccccccccccC-----Cceeec-eeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccc------hhhhcCcEEE Confidence 22112111111 125665 7999888887654444321100 0001111222210 0112223344 Q ss_pred eeeeeece-eecCcccccccCCcceeecCCchhhhcccch Q lcl|Aclame:pro 470 GFKTRYGI-GINPFADPAAQAPTKRIQNGMPDIVNSLGLN 508 (519) Q Consensus 470 g~~tRY~l-~~nP~~~~~~~~~~~~i~~~~d~~a~~~~~~ 508 (519) =+..|+++ +.+|-+- .++. +-. .+.-| T Consensus 470 r~~~r~~~~v~~p~A~-------~~l~--~~~---~~~~~ 497 (497) T protein:vir:78 470 RAEERLGLLVYRPSAF-------QLIQ--LKK---GATGS 497 (497) T ss_pred EEEEeecceeeccccE-------EEEE--ecC---CccCC Confidence 45678876 4455432 1121 111 11111 No 75 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=74.65 E-value=0.16 Score=25.01 Aligned_cols=283 Identities=12% Similarity=0.040 Sum_probs=121.9 Q ss_pred hccccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccc Q lcl|Aclame:pro 77 IAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQG 156 (519) Q Consensus 77 ~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~ 156 (519) .+++++++...--....-.++.++.+..+..+++.+.||++-.. +|+-.... +.+.| T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-------~~p~~~~~------------~~a~w---- 57 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQ-------REFVFDFD------------SDIDI---- 57 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCce-------EEEEEecC------------cceEE---- Confidence 45556555443222222223333445556678999999876321 12111000 00000 Q ss_pred cccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhccc Q lcl|Aclame:pro 157 AAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEG 236 (519) Q Consensus 157 ~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~ 236 (519) +++| T Consensus 58 -----------------------------------------------------------------v~Eg----------- 61 (300) T protein:vir:95 58 -----------------------------------------------------------------VAEN----------- 61 (300) T ss_pred -----------------------------------------------------------------eeCC----------- Confidence 1111 Q ss_pred CCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhh Q lcl|Aclame:pro 237 FNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGK 316 (519) Q Consensus 237 ~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~ 316 (519) .+.++...+++.+++++|.-+-...+|-||.+.... ..+|-+++|.+-|...|...++..++.=... T Consensus 62 ------~~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~------ 128 (300) T protein:vir:95 62 ------GKKTHGGVSLDPVTIVPLKVEYGARVSDEFLHASEE-AKVDMLTDFVEGFSKKLARGLDIMSIHGINP------ 128 (300) T ss_pred ------cccccccccceeeEeeeEEEEEeehhhHHHhccCCC-CHHHHHHHHHHHHHHHHHHHHHHhhhhcccC------ Confidence 123334444556666666666677889998753222 2366788888889999999998888852110 Q ss_pred hcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccc Q lcl|Aclame:pro 317 SGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQ 396 (519) Q Consensus 317 ~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~ 396 (519) .+.+.....|...+...... . ........+.-|.++...+.. .+.+.+.+|++|.....|...- +. T Consensus 129 --~~g~~~~~~~~~~~~~~~~~---~-~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~lk--d~---- 194 (300) T protein:vir:95 129 --RTKQASTIIGDNCFDKKVTQ---T-VPFKDTNPDESMEDAVGMIDG--SERDITGAILDPIFTTALSKMK--NA---- 194 (300) T ss_pred --CCCCCcccccccccccccce---e-ecccccchHHHHHHHHHHhhh--cCCCccEEEECHHHHHHHHHhh--cc---- Confidence 00000000111000000000 0 000011223334444444432 2345567899999988886432 10 Q ss_pred cccccccccCCCceEEEEecCcEEEEecCCCcc------ceEEEEEecCCCccceeEeecccc--cccccccCccc---- Q lcl|Aclame:pro 397 GLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS------DYFTIGYKGSNEMDAGIYYAPYVA--LTPLRGSDPKN---- 464 (519) Q Consensus 397 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~~~~~fyaPYv~--~~~~~~~dp~s---- 464 (519) .+...+..+.+.. ..++|.| ++|+++...+. +.+++|= +..+++|..... +...+-.|+++ T Consensus 195 ~G~~i~~~~~~~~-~~~~l~G-~Pv~~s~~v~~~~~~~~~~~~~GD-----f~~~~~~~~~~~~~~~v~~~~~~d~~~~~ 267 (300) T protein:vir:95 195 EGGKLYPELAWGG-VPDAING-LAVDKNRTVSYSQTDPKNTAIVGD-----FETMFKWGYAKEVPMEIIKYGDPDNSGRD 267 (300) T ss_pred CCCeeccCccccC-CCceecc-eeeEEecCCCCCCCCCccEEEEee-----ccceEEEEEecccEEEEeeccCCCCcchh Confidence 0011111222111 1367877 69998887653 1233331 001112221111 11111113321 Q ss_pred -cc---ceeeeeeeeceee-cCcccccccCCcceeecCCchhhh Q lcl|Aclame:pro 465 -FQ---PVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVN 503 (519) Q Consensus 465 -~q---P~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~ 503 (519) || =.+=+..|+++.+ +|=+- .+|.+ .++ T Consensus 268 ~f~~~~v~~r~~~r~d~~v~~~~a~-------~~l~~----~~g 300 (300) T protein:vir:95 268 LKGYNQIYIRCEAYIGWGIMDAASF-------ARIVK----TGG 300 (300) T ss_pred hhhcCcEEEEEEEeecceeecccce-------EEEec----CCC Confidence 21 2233345777544 33321 22221 111 No 76 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=74.41 E-value=0.16 Score=24.97 Aligned_cols=285 Identities=12% Similarity=0.075 Sum_probs=120.0 Q ss_pred hccccccccccccCcee-hhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCcc Q lcl|Aclame:pro 77 IAAGQTSGAVTQIGPAV-MGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQ 155 (519) Q Consensus 77 ~~est~tg~v~~~~P~L-~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~ 155 (519) ++..+++..=.-.-+.+ -.+++++.++.+..+++-+.||++++--|.- .... +.+.|- T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~----~~~~---------------~~a~wv-- 59 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPV----LATL---------------PEADWV-- 59 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEE----EeCC---------------cceEEe-- Confidence 23333222111111111 2355666677777888999999876521110 0000 000111 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcc Q lcl|Aclame:pro 156 GAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQE 235 (519) Q Consensus 156 ~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~ 235 (519) ++|-.. T Consensus 60 -------------------------------------------------------------------~E~~~~------- 65 (305) T protein:vir:25 60 -------------------------------------------------------------------GESATD------- 65 (305) T ss_pred -------------------------------------------------------------------eccccc------- Confidence 111000 Q ss_pred cCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhh Q lcl|Aclame:pro 236 GFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVG 315 (519) Q Consensus 236 ~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~ 315 (519) ....++.-..++++++..++..+-.-.+|-||.+|- ..|.|++|.+-|+..|...++..++.=.-. - T Consensus 66 -----~~~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~--~-- 132 (305) T protein:vir:25 66 -----PKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIFGTDK--P-- 132 (305) T ss_pred -----ccccccccccceeeEEeeeEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHhhhheeccCC--C-- Confidence 000111112334455555555666678999999984 357899999999999999999999841000 0 Q ss_pred hhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccccc Q lcl|Aclame:pro 316 KSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAA 395 (519) Q Consensus 316 ~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~ 395 (519) .+.-........+.--....... .....-.++.-+.++...+...-. ..+-+|++|.....|... .+ + T Consensus 133 -~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~l--kd-~-- 200 (305) T protein:vir:25 133 -ASWVSPALIPAAVTAGQAVEVVG----GVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANI--RD-A-- 200 (305) T ss_pred -CCccccccccccccccccccccc----cchhhhHHHHHHHHHHHhhhhccc--ccceeEecHHHHHHHHHh--hc-c-- Confidence 00000000000000000000000 111223344444444444544322 234478899988888632 11 1 Q ss_pred ccccccccccCCCceEEEEecCcEEEEecCCCccc----eE--------EEEEecCCCccceeEeecccccccccccCcc Q lcl|Aclame:pro 396 QGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD----YF--------TIGYKGSNEMDAGIYYAPYVALTPLRGSDPK 463 (519) Q Consensus 396 ~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----y~--------~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~ 463 (519) .+...+. + ++|.| ++|+|..+.+.+ -+ .+|..+.-+.+- ..+.-+. ..-.+. T Consensus 201 -~G~~i~~----~----~~l~G-~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~----~~~~~~~--~~~~~~ 264 (305) T protein:vir:25 201 -NGNPVFR----D----DSFAG-FRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKF----LDQATLG--TGENQI 264 (305) T ss_pred -CCceeec----C----Ccccc-cceEEcCccCCCCCccEEEEEecceEEEEEecCeEEEE----eeeeeee--cCCcee Confidence 0111111 1 35766 688888776532 12 222222211111 0110000 001111 Q ss_pred c-cc-ceee--eeeeece-eecCcccccccCCcceeecCCch--hhhcc Q lcl|Aclame:pro 464 N-FQ-PVMG--FKTRYGI-GINPFADPAAQAPTKRIQNGMPD--IVNSL 505 (519) Q Consensus 464 s-~q-P~~g--~~tRY~l-~~nP~~~~~~~~~~~~i~~~~d~--~a~~~ 505 (519) + || ..++ ...|||+ +.||-+- ....+.++ ....+ T Consensus 265 ~~~~~~~~~~R~~~r~~~~v~~p~a~--------v~~~~~~~~~~~pa~ 305 (305) T protein:vir:25 265 NLAERDMVALRLKARFAYVLGVSATA--------QGANKTPVAVVAPAA 305 (305) T ss_pred eeeecCcEEEEEEEeecceeeCcccE--------EEEccccccccCCCC Confidence 1 22 1222 4668996 4576542 11112222 11111 No 77 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=72.89 E-value=0.18 Score=24.71 Aligned_cols=305 Identities=9% Similarity=0.009 Sum_probs=121.3 Q ss_pred hhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeecc Q lcl|Aclame:pro 35 FENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQP 114 (519) Q Consensus 35 ~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQP 114 (519) ||-.|+.-.+.+.+ ..-..-. .-+++.....++..+.. --..+.-.+++.+..+.+..+++.+.| T Consensus 1 ~~k~~~~~~~~~~~------------~~~~~~~--~~~~a~~~~~~~~~~~l-ip~~~~~~ii~~~~~~s~l~~~~~~~~ 65 (324) T protein:vir:99 1 MEQTQKLKLNLQHF------------ASNNVKP--QVFNPDNVMMHEKKDGT-LLNDFTTPILQEVMENSKIMRLGKYEP 65 (324) T ss_pred CCCchHhhHHHHHH------------HHHhhhh--hhccccceeccCCCcce-echhHHHHHHHHHHhhchhhhhcceee Confidence 22111111111111 1000000 00111111111111111 011111223344445666777888888 Q ss_pred CCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAV 194 (519) Q Consensus 115 mTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~ 194 (519) |++.+.-|. +.... .++ . T Consensus 66 ~~~~~~~~p----~~~~~------~~a---------~------------------------------------------- 83 (324) T protein:vir:99 66 MEGTEKKFT----FWADK------PGA---------Y------------------------------------------- 83 (324) T ss_pred ccCCceEEE----EEecC------cce---------e------------------------------------------- Confidence 876542111 11000 000 0 Q ss_pred cCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 195 TVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQ 274 (519) Q Consensus 195 ~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQ 274 (519) -++ | +..+++...++++++++.|.-+---..|-||.+ T Consensus 84 --------------------------~v~--------E---------g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ 120 (324) T protein:vir:99 84 --------------------------WVG--------E---------GQKIETSKATWVNATMRAFKLGVILPVTKEFLN 120 (324) T ss_pred --------------------------Eec--------c---------CccccccccceeEEEEeeEEEEEeehhhHHHHh Confidence 001 1 112334444556666666666666789999999 Q ss_pred HHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHH Q lcl|Aclame:pro 275 DLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQ 354 (519) Q Consensus 275 DLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~ 354 (519) |-. .|.+++|.+.|+..|...+++.||.--. ++..+.|+++....... -.. ....+.. T Consensus 121 ds~----~~l~~~i~~~l~~ai~~~~d~~~l~G~g------------~~~~~~~~~~~~~~~~~---~~~---~~~~~~~ 178 (324) T protein:vir:99 121 YTY----SQFFEEMKPMIAEAFYKKFDEAGILNQG------------NNPFGKSIAQSIEKTNK---VIK---GDFTQDN 178 (324) T ss_pred cch----HHHHHHHHHHHHHHHHHHHHHHhhhcCC------------CCccCccccccccccce---ecc---ccCCHHH Confidence 974 4679999999999999999999985110 01111222221110000 000 0111233 Q ss_pred HHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc--ceE Q lcl|Aclame:pro 355 IDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS--DYF 432 (519) Q Consensus 355 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--dy~ 432 (519) |.++.+.|.. .+...+.+|++|.....|...-- + . .+....+..+ ++|.| ++|+|.+.... ..+ T Consensus 179 i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~l~d---~--~--g~~~~~~~~~----~~l~G-~PVv~~~~~~~~~~~~ 244 (324) T protein:vir:99 179 IIDLEALLED--DELEANAFISKTQNRSLLRKIVD---P--E--TKERIYDRNS----DTLDG-LPVVNLKSSNLKRGEL 244 (324) T ss_pred HHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhc---C--C--CceeecCCCC----ccccc-eeEEeecCCCCCcceE Confidence 4444444432 33455678999999999885421 0 0 0111112222 45777 58888776553 233 Q ss_pred EEEEecCCCccceeEeeccc--------ccccccccCccc-----c---cceeeeeeeeceee-cCcccccccCCcceee Q lcl|Aclame:pro 433 TIGYKGSNEMDAGIYYAPYV--------ALTPLRGSDPKN-----F---QPVMGFKTRYGIGI-NPFADPAAQAPTKRIQ 495 (519) Q Consensus 433 ~vG~KG~~~~~~~~fyaPYv--------~~~~~~~~dp~s-----~---qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~ 495 (519) ++|-.. .+++..-- +.......|+.. | +=.+=...||+..+ ||=+ ..++. T Consensus 245 i~gd~~------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a-------~~~lt 311 (324) T protein:vir:99 245 ITGDFD------KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA-------FAKLV 311 (324) T ss_pred EEEecc------cEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccc-------eEEEE Confidence 333211 01111000 000000011110 1 12222345666432 2211 11221 Q ss_pred cCCchhhhcccch Q lcl|Aclame:pro 496 NGMPDIVNSLGLN 508 (519) Q Consensus 496 ~~~d~~a~~~~~~ 508 (519) .......-..+.= T Consensus 312 ~a~~~~~~~~~~~ 324 (324) T protein:vir:99 312 PADKKTDSVPGEV 324 (324) T ss_pred eccCCCCCCCCCC Confidence 1110000001100 No 78 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=71.91 E-value=0.19 Score=24.54 Aligned_cols=309 Identities=13% Similarity=0.053 Sum_probs=120.8 Q ss_pred cccchhhhccccccccccc--cCcee-hhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCccccccccc Q lcl|Aclame:pro 70 HGYDATNIAAGQTSGAVTQ--IGPAV-MGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMY 146 (519) Q Consensus 70 ~g~~~~~~~est~tg~v~~--~~P~L-~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fn 146 (519) -||++.+.....++..... .-|.+ -.+++++....+..+++-+.||++++ ++...... . T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~-----~~ip~~~~--~----------- 62 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATG-----IVIPHWTG--D----------- 62 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCc-----eEEEEEcC--C----------- Confidence 5565554443332221111 11211 12233344455567778888887654 11111000 0 Q ss_pred ccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceeccccc Q lcl|Aclame:pro 147 APNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGM 226 (519) Q Consensus 147 Eadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~Gm 226 (519) +.+. -+++ T Consensus 63 -~~a~---------------------------------------------------------------------wv~E-- 70 (397) T protein:vir:23 63 -VSAQ---------------------------------------------------------------------WIGE-- 70 (397) T ss_pred -cceE---------------------------------------------------------------------EecC-- Confidence 0000 0011 Q ss_pred chhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHH Q lcl|Aclame:pro 227 ATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVID 306 (519) Q Consensus 227 sTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~ 306 (519) +..+++-..+++++++..|..+-.-.+|-||.+|-. .|.|++|.+.|...|...|++.+|. T Consensus 71 ---------------g~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~ 131 (397) T protein:vir:23 71 ---------------GDMKPITKGNMTKRDVHPAKIATIFVASAETVRANP----ANYLGTMRTKVATAIAMAFDNAALH 131 (397) T ss_pred ---------------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 112233334456677777777777789999999863 6789999999999999999999985 Q ss_pred HHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHh Q lcl|Aclame:pro 307 WINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAA 386 (519) Q Consensus 307 ~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~ 386 (519) =-.. .....++.+.....- -++. ...+..+..+...+.. .+...+.+|++|+....|.. T Consensus 132 G~gt------------~~~~~~~~~~~~~~~----~~~~---~~~~~~~~~~~~~l~~--~~~~~a~~vmn~~~~~~L~~ 190 (397) T protein:vir:23 132 GTNA------------PSAFQGYLDQSNKTQ----SISP---NAYQGLGVSGLTKLVT--DGKKWTHTLLDDTVEPVLNG 190 (397) T ss_pred cccC------------Cccccccccccccee----eecc---cchhHHHHHHHHhhhh--cccCCCEEEEcHHHHHHHHH Confidence 1110 000111111100000 0000 0111112222333332 23456789999999988875 Q ss_pred c----CcccccccccccccccccCCCceEEEEecCcEEEEecCCCccc----------eEEEEEecCCCcc----ceeE- Q lcl|Aclame:pro 387 V----DTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD----------YFTIGYKGSNEMD----AGIY- 447 (519) Q Consensus 387 ~----g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----------y~~vG~KG~~~~~----~~~f- 447 (519) . |...+.+.. ..........|+|.| ++|+++++.+.+ .+++|..+.-... +++- T Consensus 191 lkd~~G~~i~~~~~------~~~~~~~~~~~tl~G-~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~ 263 (397) T protein:vir:23 191 SVDANGRPLFVEST------YESLTTPFREGRILG-RPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNL 263 (397) T ss_pred hhccCCceeecccc------cccccccccCceeee-eeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeee Confidence 3 222221110 001111112357766 699999887742 2233333322110 0000 Q ss_pred -----eeccc----cccccc--------ccCcccccceeeeee--eeceeecCcccccccCCcceeecCC--chhhhc-c Q lcl|Aclame:pro 448 -----YAPYV----ALTPLR--------GSDPKNFQPVMGFKT--RYGIGINPFADPAAQAPTKRIQNGM--PDIVNS-L 505 (519) Q Consensus 448 -----yaPYv----~~~~~~--------~~dp~s~qP~~g~~t--RY~l~~nP~~~~~~~~~~~~i~~~~--d~~a~~-~ 505 (519) ..||. -..-+| .+||+.|-....--. =|-+.+.|-+ ........+|. ..++-. . T Consensus 264 ~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~a~ 339 (397) T protein:vir:23 264 GSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYALDLDGAS----AGNFTLSLDGKTSANIAYNAS 339 (397) T ss_pred ccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceeeecccccC----cceEEEEecCccccCcccccc Confidence 00110 000011 122322211111000 0111111111 00011112221 000000 0 Q ss_pred cchhhhhhhhhcC-C Q lcl|Aclame:pro 506 GLNGYFRRVYVKG-I 519 (519) Q Consensus 506 ~~~~y~r~v~v~~-~ 519 (519) ....=-..+.+-| + T Consensus 340 ~~~~~~~~~~~~~~~ 354 (397) T protein:vir:23 340 TATVKSAIVAIDDGV 354 (397) T ss_pred hhhhHHHhhhccccc Confidence 0000001111111 0 No 79 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=71.30 E-value=0.2 Score=24.45 Aligned_cols=338 Identities=11% Similarity=-0.005 Sum_probs=125.3 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhH---HHHHhhhhh----ccch----hhhhhhhhhhhhhhhc-c Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQ---EQDILTAPE----YRDE----KISEAFGSFLTEAEIG-G 68 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq---~~~~~~~~~----~~~~----~~~~~~~~~~~~~~~~-~ 68 (519) +..+. .++-.-+.. ++ +.+.++|-+-+ ++....... -+.. .-.+.+..++...... + T Consensus 32 ~~~e~-~~~~~~~~~-----e~-----~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 100 (390) T protein:vir:97 32 LNASA-RSKVDELFA-----TV-----GNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASTGRWNDRSA 100 (390) T ss_pred CCHHH-HHHHHHHHH-----HH-----HHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHhhhhhh Confidence 11111 111111110 01 01111111100 000000000 0000 0001111111111000 0 Q ss_pred ccccchhhhcc-----ccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccc Q lcl|Aclame:pro 69 DHGYDATNIAA-----GQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFH 143 (519) Q Consensus 69 ~~g~~~~~~~e-----st~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~ 143 (519) ........... +++++...-....+-.+++++.++.+..+++.+-||++++.-+.- ...... T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~----~~~~~~--------- 167 (390) T protein:vir:97 101 RATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQ----ETGFVN--------- 167 (390) T ss_pred hhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEE----EecCCc--------- Confidence 00111111111 111111111111223344445556677788899888776532110 000000 Q ss_pred cccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecc Q lcl|Aclame:pro 144 PMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIA 223 (519) Q Consensus 144 ~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~ 223 (519) .+. -++ T Consensus 168 -----~a~---------------------------------------------------------------------~v~ 173 (390) T protein:vir:97 168 -----NAA---------------------------------------------------------------------IVA 173 (390) T ss_pred -----cee---------------------------------------------------------------------eec Confidence 000 000 Q ss_pred cccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHH Q lcl|Aclame:pro 224 EGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINRE 303 (519) Q Consensus 224 ~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINRe 303 (519) +| ..+++-..++++++...|.-+-...+|-||.+|-- +.++.|.+-|+..|...||+. T Consensus 174 Eg-----------------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~-----~l~~~i~~~la~a~~~~~d~a 231 (390) T protein:vir:97 174 EG-----------------ALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-----QLASYMNNRLIRGLKVKEDAE 231 (390) T ss_pred CC-----------------ccccccccceeEEEEeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHH Confidence 11 01112222234444444444446789999999852 468999999999999999888 Q ss_pred HHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHH Q lcl|Aclame:pro 304 VIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNV 383 (519) Q Consensus 304 ii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~ 383 (519) ||. | +-++..+.|++............-. ...+..|..+-..+. ..+...+.+|++|..... T Consensus 232 ~l~--------G----~g~~~~p~Gi~~~~~~~~~~~~~~~----~~~~d~~~~~~~~~~--~~~~~~~~~v~n~~~~~~ 293 (390) T protein:vir:97 232 ILR--------G----TGANDGLLGLIPQATTYAAPTTIAG----ATRVDQLRLAMLQAS--LAEYPASGIVINPIDWAA 293 (390) T ss_pred Hhh--------c----CCCCccccceeeccccccccccccc----cchHHHHHHHHHhhc--cccCCCCEEEEcHHHHHH Confidence 874 1 1112234565543211111000000 111112222222222 233356788999999888 Q ss_pred HHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCc- Q lcl|Aclame:pro 384 LAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDP- 462 (519) Q Consensus 384 L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp- 462 (519) |...= + + .+...+.....+. -++|.| ++|++++..+.+-+++|--. ..+++...-.++.....+. T Consensus 294 L~~lk--d-~---~G~~l~~~~~~~~--~~~l~G-~pV~~~~~~~~~~~~~gd~~-----~~~~~~~~~~~~i~~~~~~~ 359 (390) T protein:vir:97 294 IELAK--D-A---NNQYLIGNARGTL--TPTLWG-LPVVATQAMAPGEFLVGAFD-----LAAQIFDQWDARVEIGYVND 359 (390) T ss_pred HHHhh--c-C---CCceeecCccCCC--Cceecc-eeeEEcCCCCCCcEEEEecc-----ceEEEEEecceEEEEeeccc Confidence 87422 1 1 0011111111111 246766 69999998887655555210 0111111111111111111 Q ss_pred --ccccceeeeeeeeceee-cCcccccccCCcceeecCCchhh Q lcl|Aclame:pro 463 --KNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIV 502 (519) Q Consensus 463 --~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a 502 (519) .+-+=.+-+..||++.+ +|- ...++.- | T Consensus 360 ~f~~~~~~~r~~~r~d~~v~~~~-------a~v~~~~-----a 390 (390) T protein:vir:97 360 DFQRNMVTVLAEERLALVVYRPE-------ALITGSF-----A 390 (390) T ss_pred ccccCcEEEEEEEeeccEEeccc-------cEEEEEe-----C Confidence 12222344556777654 121 1222221 1 No 80 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=71.05 E-value=0.2 Score=24.41 Aligned_cols=337 Identities=14% Similarity=0.099 Sum_probs=132.5 Q ss_pred CChHHHHHhhhhhhCCCccccccccch----------------hhhhhhh------hhhHHHHHhhh------------h Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASK----------------QAIIAKI------FENQEQDILTA------------P 46 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~----------------~~~~~~~------~enq~~~~~~~------------~ 46 (519) |+.++|.++|..+.+. ++...+ +.+.+.+ ++.++.++.+. + T Consensus 5 m~l~el~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (404) T protein:vir:39 5 LTVNQLNEAWIASGDK-----VTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (404) T ss_pred HHHHHHHHHHHHHHHH-----HHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 8899999999887654 111110 0111111 00000000000 0 Q ss_pred hcc-------chhhhhhhhhhhhhhhhccccc---cchhhhcccccc-cccc---ccCceehhhHHHHHhhhhhhhceee Q lcl|Aclame:pro 47 EYR-------DEKISEAFGSFLTEAEIGGDHG---YDATNIAAGQTS-GAVT---QIGPAVMGMVRRAIPHLIAFDICGV 112 (519) Q Consensus 47 ~~~-------~~~~~~~~~~~~~~~~~~~~~g---~~~~~~~est~t-g~v~---~~~P~L~~l~Rra~p~LIa~DI~GV 112 (519) ... .++...++..++ . .+... .....+..++++ |.+. .+.+. +++.+.+.....++|.+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~---~-~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~---ii~~~~~~~~l~~~~~~ 152 (404) T protein:vir:39 80 GPLNKSEYELKDKFVKEFVNMV---R-NPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTM---INTLVRQYDSLQQYVRV 152 (404) T ss_pred cccccchhhhHHHHHHHHHHHH---h-cchhhhhhhhhhhhhcccccCCceeccHHHHHH---HHHHHHhhhhHHhhcce Confidence 000 000001111111 0 00000 011111112211 1111 22223 33334456678888999 Q ss_pred ccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 113 QPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVE 192 (519) Q Consensus 113 QPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~ 192 (519) .||+++++-+--.|- ..... .+.| T Consensus 153 ~~~~~~~~~~~~~~~--~~~~~--------------~a~~---------------------------------------- 176 (404) T protein:vir:39 153 ESVSTSNGSRVYEKW--TDVTP--------------LTVM---------------------------------------- 176 (404) T ss_pred eeccCCcceEEEEee--cCCcc--------------ceee---------------------------------------- Confidence 999988765432211 00000 0000 Q ss_pred cccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHH Q lcl|Aclame:pro 193 AVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIEL 272 (519) Q Consensus 193 ~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmEL 272 (519) +++|- +. ...+...|.++.|++.|..+-. .+|-|| T Consensus 177 -----------------------------v~Eg~-----~~----~~~~~~~f~~i~~~~~k~~~~~-------~iS~el 211 (404) T protein:vir:39 177 -----------------------------DAEDG-----KI----PDLDNPRLTIIKYLIKRYAGII-------TATNTL 211 (404) T ss_pred -----------------------------ecCcc-----cc----ccccccceeeEEeeeeeEEeee-------hhHHHH Confidence 01110 00 0012345667777777776554 499999 Q ss_pred HHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHH Q lcl|Aclame:pro 273 AQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALL 352 (519) Q Consensus 273 AQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~ 352 (519) .+|- ..|.+++|.+-|+..|..-+|..||.- +.++....++.++++. ..++ T Consensus 212 l~ds----~~~l~~~i~~~l~~~~~~~~d~~il~g------------~g~~~~~~~~~~~~~i-------------~~~~ 262 (404) T protein:vir:39 212 LKDT----AENILAWLSSWIAKKVVVTRNQAIIAA------------MGTVPKKPTIAKFDDV-------------ITMI 262 (404) T ss_pred Hhhc----hHHHHHHHHHHHHHHHHHHHHHHHHhc------------ccccccccccccHHHH-------------HHHH Confidence 9984 356799999999999999999988851 1111122344333211 1111 Q ss_pred HHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceE Q lcl|Aclame:pro 353 FQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYF 432 (519) Q Consensus 353 ~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 432 (519) .. .+ ...+.....+||+|.....|...= +.. +...+..+.+.. -.++|.| ++|++-.+.. T Consensus 263 ~~------~~--~~~~~~~a~~v~n~~~~~~L~~lk--d~~----G~~l~~~~~~~~-~~~~l~G-~pV~~~~~~~---- 322 (404) T protein:vir:39 263 NT------SV--DPAIIATSSLLTNQSGLNKLALVK--TAE----GKYLLEPDPTKP-NSYLIKG-KKVIVVADRW---- 322 (404) T ss_pred HH------hh--hhhhccCCEEEEcHHHHHHHHHhh--ccC----CceeeccCcCCC-Ccceecc-eeEEEecccc---- Confidence 10 11 111223457899999999988531 100 001111111111 1146776 4776532211 Q ss_pred EEEEecCCCccceeEeeccccc------cccc-ccCc------ccccceeeeeeeeceee-cCcccccccCCcceeecCC Q lcl|Aclame:pro 433 TIGYKGSNEMDAGIYYAPYVAL------TPLR-GSDP------KNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGM 498 (519) Q Consensus 433 ~vG~KG~~~~~~~~fyaPYv~~------~~~~-~~dp------~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~ 498 (519) ++-.+.. +..+||.-+-.+ ..+. .+++ ...+=.+-...||+..+ +|-+-.. ..+.... T Consensus 323 -~~~~~~~--~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~-----~~~~~~a 394 (404) T protein:vir:39 323 -LPNSGST--VYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVA-----GSFTAIA 394 (404) T ss_pred -cCccCCC--ccEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEE-----EEeeccc Confidence 1111100 011222211100 0000 0111 12344555667777543 3321000 0000000 Q ss_pred c-hhhhcccc Q lcl|Aclame:pro 499 P-DIVNSLGL 507 (519) Q Consensus 499 d-~~a~~~~~ 507 (519) + .-....|| T Consensus 395 ~~~~~~~~~~ 404 (404) T protein:vir:39 395 DQVGNFTAGK 404 (404) T ss_pred cCCCCCCCCC Confidence 1 01122344 No 81 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=69.75 E-value=0.22 Score=24.21 Aligned_cols=284 Identities=13% Similarity=0.129 Sum_probs=123.0 Q ss_pred hccccccccccccCcee-hhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCcc Q lcl|Aclame:pro 77 IAAGQTSGAVTQIGPAV-MGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQ 155 (519) Q Consensus 77 ~~est~tg~v~~~~P~L-~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~ 155 (519) ++ ..+++.+. ..|.+ -.+++++.+..+..++|.+-||++.+.-|. ++... .+|. T Consensus 1 m~-t~t~gg~l-iP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip----~~~~~------~~a~------------- 55 (303) T protein:vir:97 1 MG-TETSKASL-FDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEF----TFTLD------SDID------------- 55 (303) T ss_pred Cc-ccCCCCeE-cchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE----EEecC------cceE------------- Confidence 33 22333332 23333 245666667888899999999876443221 11000 0000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcc Q lcl|Aclame:pro 156 GAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQE 235 (519) Q Consensus 156 ~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~ 235 (519) -+++| T Consensus 56 -----------------------------------------------------------------wv~E~---------- 60 (303) T protein:vir:97 56 -----------------------------------------------------------------VVAEN---------- 60 (303) T ss_pred -----------------------------------------------------------------EeecC---------- Confidence 01111 Q ss_pred cCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhh Q lcl|Aclame:pro 236 GFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVG 315 (519) Q Consensus 236 ~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~ 315 (519) ..+++-..+++.++..+|.-+-....|-||.|.... ..++-+++|.+-|+..|...|+..++.=...... T Consensus 61 -------~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g-- 130 (303) T protein:vir:97 61 -------GKKTHGGLSLEPVTIVPIKVEYGARLSDEFLYATEE-EKIDILKAFNEGFAKKLARGIDLMAMHGINPRTK-- 130 (303) T ss_pred -------ccccccccceeeEEeeeEEEEEeehhhHHHhhcCcc-chHHHHHHHHHHHHHHHHHHHHhhhhcccccCCc-- Confidence 112222233345555555555566899999863322 2466788999999999999999888852211111 Q ss_pred hhcccccccccceeeeccc--cccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccc Q lcl|Aclame:pro 316 KSGMTNTVGAKAGVFDFQD--PIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSY 393 (519) Q Consensus 316 ~~~~t~~~~~~~G~fDl~~--~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~ 393 (519) +.....|...+.. ..-+. .-....++.-|.++-+.+.. .....+.+|++|.....|...- +.. T Consensus 131 ------~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~lk--d~~ 195 (303) T protein:vir:97 131 ------KASDVIGTNHFDSKVTQVVK-----FTESEDADANIEAAVNLIQG--AEGVVTGLAMDTEFSTALAKVT--NGE 195 (303) T ss_pred ------cccccccccccccccccccc-----cccccchHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHHHHhh--ccC Confidence 0111111111100 00000 00011233444444444433 2235567999999998886321 110 Q ss_pred ccccccccccccCCCceEEEEecCcEEEEecCCCccce-----EEEEEecCCCccceeEeeccc--ccccccccCccc-- Q lcl|Aclame:pro 394 AAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDY-----FTIGYKGSNEMDAGIYYAPYV--ALTPLRGSDPKN-- 464 (519) Q Consensus 394 ~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-----~~vG~KG~~~~~~~~fyaPYv--~~~~~~~~dp~s-- 464 (519) +...+..+..-..-.|+|.| ++|+++.+.+... -.+.+-|+- ...+.+...- ++...+..|++. T Consensus 196 ----g~~~~~~~~~~~~~~~~l~G-~Pv~~s~~v~~~~~~~~~~~~~~~Gdf--~~~~~~~~~~~~~~~~~~~~~~d~~~ 268 (303) T protein:vir:97 196 ----MGPKMYPELAWGANPDSING-LKSSVNTTVGAGADEAESKDLVIIGDF--ESMFKWGYAKQIPMEIIKYGDPDNSG 268 (303) T ss_pred ----CCeEEecCccCCCCCceecc-eeeEEecccCCccccCCCccEEEEeec--cccEEEEEecCcEEEEeeccCCCCcc Confidence 01111111111111257887 7999988765311 011122221 1111122111 122222223321 Q ss_pred ---ccc-eeee--eeeeceee-cCcccccccCCcceeecCCc Q lcl|Aclame:pro 465 ---FQP-VMGF--KTRYGIGI-NPFADPAAQAPTKRIQNGMP 499 (519) Q Consensus 465 ---~qP-~~g~--~tRY~l~~-nP~~~~~~~~~~~~i~~~~d 499 (519) |+- .++| ..||+..+ ||=+ ..+|.++.= T Consensus 269 ~~~~~~n~~~~r~~~r~~~~v~~p~a-------f~~l~~~~~ 303 (303) T protein:vir:97 269 KDLKGYNQIYLRAEAYIGWGILDAKS-------FARVTKGEV 303 (303) T ss_pred hhhhhcCcEEEEEEEEeccEeecccc-------eEEeeCCCC Confidence 221 2333 55776543 3321 233333211 No 82 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=69.60 E-value=0.22 Score=24.18 Aligned_cols=286 Identities=13% Similarity=0.043 Sum_probs=116.6 Q ss_pred hccccccccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccc Q lcl|Aclame:pro 77 IAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQG 156 (519) Q Consensus 77 ~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~ 156 (519) .+..++++...--....-.+++++-+..+..+++-+-||+.... +|+-.... .+| T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~-------~~p~~~~~---~~a--------------- 55 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNE-------DIITFNGR---PKA--------------- 55 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCce-------EEEEEeCC---cee--------------- Confidence 22222222221111112335555666677777777877765321 11110000 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhccc Q lcl|Aclame:pro 157 AAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEG 236 (519) Q Consensus 157 ~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~ 236 (519) .-+++| T Consensus 56 ---------------------------------------------------------------~wv~Eg----------- 61 (311) T protein:vir:99 56 ---------------------------------------------------------------EFVGEG----------- 61 (311) T ss_pred ---------------------------------------------------------------EEeecC----------- Confidence 001111 Q ss_pred CCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhh Q lcl|Aclame:pro 237 FNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGK 316 (519) Q Consensus 237 ~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~ 316 (519) ..+++...++++++..+|.-+-....|-||.|+-.- -..|-+++|.+.|...|+..|++.+|.-.....--+. T Consensus 62 ------~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~ 134 (311) T protein:vir:99 62 ------QQKSSTTGEFDFVTSTPKKAQVTMRFNEEVQWADED-YQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVI 134 (311) T ss_pred ------cccccccceeeEEEEeeEEEEEeehhhHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccc Confidence 122333344456666666666678899999763321 1355688888888888888888888862110000000 Q ss_pred hcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccc Q lcl|Aclame:pro 317 SGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQ 396 (519) Q Consensus 317 ~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~ 396 (519) .+...--....+.+..... .+ -.+..-|+.+-..+...-.+...+-.|++|+....|...- +.. T Consensus 135 ~g~~~~~~~~~~~~~~~~~------~~-----~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk--d~~--- 198 (311) T protein:vir:99 135 PGWSNYLGAASKRVELTAD------TI-----ANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTAR--YTD--- 198 (311) T ss_pred cccccccccccceeecccc------cc-----chhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhh--ccC--- Confidence 0110000000111111110 00 1112223333333333222233466899999999887431 100 Q ss_pred cccccccccCCCceEEEEecCcEEEEecCCCc----------------cceEEEEEecCCCccceeEeeccccccc--cc Q lcl|Aclame:pro 397 GLGQGFNVDTTKAVFAGVLGGKYRVYIDQYAR----------------SDYFTIGYKGSNEMDAGIYYAPYVALTP--LR 458 (519) Q Consensus 397 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~----------------~dy~~vG~KG~~~~~~~~fyaPYv~~~~--~~ 458 (519) +...+..+.+.. -.++|.| ++|++..+-+ .+++++|= ...++.|.-.-...+ .+ T Consensus 199 -G~~l~~~~~~~~-~~~~l~G-~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gd-----f~~~~~~~~~~~~~~~~~~ 270 (311) T protein:vir:99 199 -GRKKFPELGLGI-GVSSFEG-IDASVSDTVNGGDEADPDDEDLDAARAVRGIVGD-----FANGIHWGVQRDIPVELIK 270 (311) T ss_pred -CCeeecCcccCC-CCceecc-eeeEeecccccccccccccchhhccCcceEEEee-----ccccEEEEEecCceEEEee Confidence 011111111111 1256777 5888876533 12333321 011222221111111 11 Q ss_pred ccCccccc-----ceeee--eeeeceeecCcccccccCCcceeecCCchhh Q lcl|Aclame:pro 459 GSDPKNFQ-----PVMGF--KTRYGIGINPFADPAAQAPTKRIQNGMPDIV 502 (519) Q Consensus 459 ~~dp~s~q-----P~~g~--~tRY~l~~nP~~~~~~~~~~~~i~~~~d~~a 502 (519) .-|++... --++| ..|||..+-+ +...++.++ .| T Consensus 271 ~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~-------~~~v~~~~~---~A 311 (311) T protein:vir:99 271 YGDPDGQGDLKRHNQIALRLEIVYGWYVFT-------DRFVVIENA---VA 311 (311) T ss_pred cCCCCcchhhhhcCcEEEEEEEeecceecC-------hhHeeeecc---cC Confidence 11233211 12333 5788865422 112333321 12 No 83 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=66.59 E-value=0.27 Score=23.74 Aligned_cols=279 Identities=12% Similarity=-0.005 Sum_probs=113.1 Q ss_pred ccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccc-e--ecccccc Q lcl|Aclame:pro 151 MFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQL-A--EIAEGMA 227 (519) Q Consensus 151 ~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~-~--~~~~Gms 227 (519) +.-+...... .+....+....+.... ............ ...........++.. + ..+.+-. T Consensus 1 ma~~~~~~~~------~~~t~~gg~lip~~~~--~~ii~~~~~~~~--------l~~~~~~~~~~~~~~~ip~~~~~~~a 64 (304) T protein:vir:94 1 MATPTYTPGN------VILSDFKNGVIPAEQG--TLIMKDIMANSA--------IMKLAKNEPMTAQKKKFTYLAKGVGA 64 (304) T ss_pred Cccccccccc------ccccCCCceecchhHH--HHHHHHHHhccc--------hhhhcceeeccCCceEEEEEeCCcce Confidence 1111100000 0000000000000000 000000000000 000000000000000 0 0011000 Q ss_pred hhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHH Q lcl|Aclame:pro 228 TSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDW 307 (519) Q Consensus 228 Ta~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~ 307 (519) .-.+| +..+++-.-++++++++.|..+-...+|-||.+|- .+|.|+.|.+-|...|...||+.++.= T Consensus 65 ~~v~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~ia~~~d~~~l~G 131 (304) T protein:vir:94 65 YWVSE---------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AKDFFNEVKPLIAEAFYKAFDQAVIFG 131 (304) T ss_pred EEeec---------CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhheec Confidence 01112 34567777778888998888888999999999975 477899999999999999999988751 Q ss_pred HHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhc Q lcl|Aclame:pro 308 INYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAV 387 (519) Q Consensus 308 i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~ 387 (519) --... ++.....+++.-...... ........+.-|+++...+...= .....+||+|.....|... T Consensus 132 ~g~~~--------~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~~--~~~~~~v~~~~~~~~L~~l 196 (304) T protein:vir:94 132 TKSPY--------NTSTSGKPLVEGAEEKGN-----VVTDTNNLYVDLSALMATIEDEE--LDPNGVLTTRSFRSKMRNA 196 (304) T ss_pred cCCCc--------cccccccccccccccccc-----ccccccchHHHHHHHHHHhhhcc--CCcCEEEEcHHHHHHHHHh Confidence 00000 000001111110000000 00111223444556656665432 2445689999999988742 Q ss_pred CcccccccccccccccccCCCceEEEEecCcEEEEecCCCccc------------eEEEEEecCCCccceeEeecccccc Q lcl|Aclame:pro 388 DTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD------------YFTIGYKGSNEMDAGIYYAPYVALT 455 (519) Q Consensus 388 g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------y~~vG~KG~~~~~~~~fyaPYv~~~ 455 (519) = + +. +...+ +++ .|+|.| ++||++++.+.+ ++++|..+....+ ...+.. T Consensus 197 k--d-~~---G~~l~----~~~--~~~l~G-~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~------~~~e~~ 257 (304) T protein:vir:94 197 L--D-AN---DRPLF----DAN--GNEIMG-LPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYA------ISEDAT 257 (304) T ss_pred h--c-cC---CcEee----cCC--Cccccc-eeeEEecccccCCCCcEEEEEehhhEEEEEecceEEE------Eeecce Confidence 1 1 10 00011 111 156776 699988887642 2333333322110 000110 Q ss_pred --cccccCcc-----ccc---ceeeeeeeeceeecCcccccccCCcceeecCC Q lcl|Aclame:pro 456 --PLRGSDPK-----NFQ---PVMGFKTRYGIGINPFADPAAQAPTKRIQNGM 498 (519) Q Consensus 456 --~~~~~dp~-----s~q---P~~g~~tRY~l~~nP~~~~~~~~~~~~i~~~~ 498 (519) +....|++ -|+ =.+=+..||++.+ ...+...++...+ T Consensus 258 ~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v------~~~~a~~~l~~a~ 304 (304) T protein:vir:94 258 LTTLQASDASGQPVSLFERDMFALRATMHIAYMN------VKPEAFATLKPTE 304 (304) T ss_pred eeeecccccCccchhhhhcCcEEEEEEEEeccEe------ecccceEEEEecC Confidence 11111222 122 2233456777554 1112234454443 No 84 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=66.59 E-value=0.27 Score=23.74 Aligned_cols=279 Identities=12% Similarity=-0.005 Sum_probs=113.1 Q ss_pred ccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccc-e--ecccccc Q lcl|Aclame:pro 151 MFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQL-A--EIAEGMA 227 (519) Q Consensus 151 ~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~-~--~~~~Gms 227 (519) +.-+...... .+....+....+.... ............ ...........++.. + ..+.+-. T Consensus 1 ma~~~~~~~~------~~~t~~gg~lip~~~~--~~ii~~~~~~~~--------l~~~~~~~~~~~~~~~ip~~~~~~~a 64 (304) T protein:vir:10 1 MATPTYTPGN------VILSDFKNGVIPAEQG--TLIMKDIMANSA--------IMKLAKNEPMTAQKKKFTYLAKGVGA 64 (304) T ss_pred Cccccccccc------ccccCCCceecchhHH--HHHHHHHHhccc--------hhhhcceeeccCCceEEEEEeCCcce Confidence 1111100000 0000000000000000 000000000000 000000000000000 0 0011000 Q ss_pred hhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHH Q lcl|Aclame:pro 228 TSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDW 307 (519) Q Consensus 228 Ta~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~ 307 (519) .-.+| +..+++-.-++++++++.|..+-...+|-||.+|- .+|.|+.|.+-|...|...||+.++.= T Consensus 65 ~~v~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~ia~~~d~~~l~G 131 (304) T protein:vir:10 65 YWVSE---------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AKDFFNEVKPLIAEAFYKAFDQAVIFG 131 (304) T ss_pred EEeec---------CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhheec Confidence 01112 34567777778888998888888999999999975 477899999999999999999988751 Q ss_pred HHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhc Q lcl|Aclame:pro 308 INYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAV 387 (519) Q Consensus 308 i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~ 387 (519) --... ++.....+++.-...... ........+.-|+++...+...= .....+||+|.....|... T Consensus 132 ~g~~~--------~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~~--~~~~~~v~~~~~~~~L~~l 196 (304) T protein:vir:10 132 TKSPY--------NTSTSGKPLVEGAEEKGN-----VVTDTNNLYVDLSALMATIEDEE--LDPNGVLTTRSFRSKMRNA 196 (304) T ss_pred cCCCc--------cccccccccccccccccc-----ccccccchHHHHHHHHHHhhhcc--CCcCEEEEcHHHHHHHHHh Confidence 00000 000001111110000000 00111223444556656665432 2445689999999988742 Q ss_pred CcccccccccccccccccCCCceEEEEecCcEEEEecCCCccc------------eEEEEEecCCCccceeEeecccccc Q lcl|Aclame:pro 388 DTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD------------YFTIGYKGSNEMDAGIYYAPYVALT 455 (519) Q Consensus 388 g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------y~~vG~KG~~~~~~~~fyaPYv~~~ 455 (519) = + +. +...+ +++ .|+|.| ++||++++.+.+ ++++|..+....+ ...+.. T Consensus 197 k--d-~~---G~~l~----~~~--~~~l~G-~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~------~~~e~~ 257 (304) T protein:vir:10 197 L--D-AN---DRPLF----DAN--GNEIMG-LPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYA------ISEDAT 257 (304) T ss_pred h--c-cC---CcEee----cCC--Cccccc-eeeEEecccccCCCCcEEEEEehhhEEEEEecceEEE------Eeecce Confidence 1 1 10 00011 111 156776 699988887642 2333333322110 000110 Q ss_pred --cccccCcc-----ccc---ceeeeeeeeceeecCcccccccCCcceeecCC Q lcl|Aclame:pro 456 --PLRGSDPK-----NFQ---PVMGFKTRYGIGINPFADPAAQAPTKRIQNGM 498 (519) Q Consensus 456 --~~~~~dp~-----s~q---P~~g~~tRY~l~~nP~~~~~~~~~~~~i~~~~ 498 (519) +....|++ -|+ =.+=+..||++.+ ...+...++...+ T Consensus 258 ~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v------~~~~a~~~l~~a~ 304 (304) T protein:vir:10 258 LTTLQASDASGQPVSLFERDMFALRATMHIAYMN------VKPEAFATLKPTE 304 (304) T ss_pred eeeecccccCccchhhhhcCcEEEEEEEEeccEe------ecccceEEEEecC Confidence 11111222 122 2233456777554 1112234454443 No 85 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=64.25 E-value=0.3 Score=23.42 Aligned_cols=305 Identities=8% Similarity=-0.009 Sum_probs=125.8 Q ss_pred hhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeecc Q lcl|Aclame:pro 35 FENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQP 114 (519) Q Consensus 35 ~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQP 114 (519) +|..+|.-.+.+.+ ..-..-+. -+++.....++..+.. --..+.-.+++.+..+-+..++|-+-| T Consensus 1 ~~~~~~~~~~~~~f------------~~~~~~~~--~~~a~~~~~~~~~~~l-iP~~~~~~ii~~~~~~s~l~~~~~~~~ 65 (324) T protein:vir:10 1 MEQTQKLKLNLQHF------------ASNNVKPQ--VFNPDNVMMHEKKDGT-LLNDFTTPILQEVMENSKIMQLGKYEP 65 (324) T ss_pred CCCchHHHHHHHHH------------HHHhhccc--eecccceeccCCCcce-echhHHHHHHHHHHhhchhhhhcceee Confidence 33222211111111 10000000 1111111111111111 011111223444555666778888888 Q ss_pred CCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAV 194 (519) Q Consensus 115 mTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~ 194 (519) |++.+.-|. +.... .+| . T Consensus 66 ~~~~~~~~p----~~~~~------~~a---------~------------------------------------------- 83 (324) T protein:vir:10 66 MEGTEKKFT----FWADK------PGA---------Y------------------------------------------- 83 (324) T ss_pred ccCCceEEE----EEeCC------cce---------e------------------------------------------- Confidence 886542111 00000 000 0 Q ss_pred cCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 195 TVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQ 274 (519) Q Consensus 195 ~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQ 274 (519) -++ | +..+++...+++++++..|..+..-..|-||.+ T Consensus 84 --------------------------~v~--------E---------g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ 120 (324) T protein:vir:10 84 --------------------------WVG--------E---------GQKIETSKATWVNATMRAFKLGVILPVTKEFLN 120 (324) T ss_pred --------------------------Eec--------c---------CccccccccceeEEEEeeEEEEEeehhhHHHHh Confidence 001 1 122344445566777777777777889999999 Q ss_pred HHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHH Q lcl|Aclame:pro 275 DLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQ 354 (519) Q Consensus 275 DLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~ 354 (519) |-. .|.+++|.+.|+..|...|++.+|.--.. +..+.|+++........ .. ..-.+.. T Consensus 121 ds~----~~l~~~i~~~l~~ai~~~~d~a~l~G~g~------------~~~~~~i~~~~~~~~~~---~~---~~~t~~~ 178 (324) T protein:vir:10 121 YTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGN------------NPFGKSIAQSIEKTNKV---IK---GDFTQDN 178 (324) T ss_pred cch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCC------------CccCcccccccccccee---cc---ccCCHHH Confidence 864 46799999999999999999999852111 11122332211110000 00 0011223 Q ss_pred HHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc--ceE Q lcl|Aclame:pro 355 IDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS--DYF 432 (519) Q Consensus 355 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--dy~ 432 (519) |.++.+.|.. .+...+.+|++|.....|...-- + + .+..-.+..+ ++|.| ++|++.+.... ..+ T Consensus 179 i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~l~d---~--~--g~~~~~~~~~----~~l~G-~PV~~~~~~~~~~~~~ 244 (324) T protein:vir:10 179 IIDLEALLED--DELEANAFISKTQNRSLLRKIVD---P--E--TKERIYDRNS----DTLDG-LPVVNLKSSNLKRGEL 244 (324) T ss_pred HHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhc---c--C--CceeecCCCC----ccccc-eeEEeecCCCCCcceE Confidence 3444444432 33455778999999998875421 1 0 0111112222 35666 58888776553 233 Q ss_pred EEEEecCCCccceeEeecccc--------cccccccCcc--------cccceeeeeeeeceee-cCcccccccCCcceee Q lcl|Aclame:pro 433 TIGYKGSNEMDAGIYYAPYVA--------LTPLRGSDPK--------NFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQ 495 (519) Q Consensus 433 ~vG~KG~~~~~~~~fyaPYv~--------~~~~~~~dp~--------s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~ 495 (519) ++|-. +.+++...-. .......|+. +-+=.+=...||+..+ ||=+ .+++. T Consensus 245 ~~gd~------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A-------~~~l~ 311 (324) T protein:vir:10 245 ITGDF------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA-------FAKLV 311 (324) T ss_pred EEEec------ccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccc-------eEEEE Confidence 33321 0111111000 0000001111 1122333446777533 3321 12232 Q ss_pred cCCchhhhcccch Q lcl|Aclame:pro 496 NGMPDIVNSLGLN 508 (519) Q Consensus 496 ~~~d~~a~~~~~~ 508 (519) .....-.-..++= T Consensus 312 ~a~~~~~~~~~~~ 324 (324) T protein:vir:10 312 PADKKTDSVPGEV 324 (324) T ss_pred eccCCCCCCCCCC Confidence 2111100011111 No 86 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=63.46 E-value=0.32 Score=23.32 Aligned_cols=274 Identities=10% Similarity=0.008 Sum_probs=116.3 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCcc Q lcl|Aclame:pro 164 LAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDN 243 (519) Q Consensus 164 ~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~ 243 (519) .+... +...+...+-.. +... ...........+.... +....+ ..|.+.+++.=-.+..+|.+. ....- T Consensus 1 ma~~~-T~~~d~iiPev~-~~~v-~~~~~~~l~~~~~~~~---d~~l~g--~~G~tv~iP~~~~~g~a~~~~---~g~~i 69 (274) T protein:vir:97 1 MPQGL-TKTSDQIIPEVL-APMM-QAQLEKKLRFASFAEV---DSTLQG--QPGDTLTFPAFVYSGDAQVVA---EGEKI 69 (274) T ss_pred CCccc-eehhheechHHH-HHHH-HHhhhhhhhhccccee---cccccC--CCCCEEEEeeecCCCcccccc---CCCcc Confidence 11111 011111111000 0000 0000000000000000 000000 012222222100111222221 11122 Q ss_pred ccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccc Q lcl|Aclame:pro 244 PWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTV 323 (519) Q Consensus 244 ~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~ 323 (519) ...++. ..+.+++.+-|+-.=+++=| ..+.+ +-|.-.|..+-++..|..+++.+++..+..++... T Consensus 70 ~~~~lt--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~-------- 135 (274) T protein:vir:97 70 PTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV-------- 135 (274) T ss_pred cccccc--cceeEEEeeeecceecccHH--HHHhc--cchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc-------- Confidence 334443 33444444555522222222 22333 46888899999999999999999998764322110 Q ss_pred cccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccccccccccccc Q lcl|Aclame:pro 324 GAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFN 403 (519) Q Consensus 324 ~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~ 403 (519) .+..++ ++-+-.+..++.++. ..+++++|+|.+++.|.......|..+- ... T Consensus 136 --~~~~~~-------------~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s----~~g 187 (274) T protein:vir:97 136 --NADITK-------------LNGLQSAIDKFNDED---------LEPMVLFVNPLDAGKLRGDASTNFTRAT----ELG 187 (274) T ss_pred --cccccC-------------HHHHHHHHHHhhccC---------CCceEEEeCHHHHHHHHhhhhhhccccC----ccc Confidence 011122 233334444444321 2568999999999999875433332210 011 Q ss_pred ccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceeecCcc Q lcl|Aclame:pro 404 VDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGINPFA 483 (519) Q Consensus 404 ~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~nP~~ 483 (519) .....+-.+|.+.| ++||+|+..|..-..+--+| .+-|.---+.......|+..+.=.+-..-+||+.+ . T Consensus 188 ~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~g------A~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~---~ 257 (274) T protein:vir:97 188 DDIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKG------AVKLILKRDFFLEVARDASTKTTALYSDKHYVAYL---Y 257 (274) T ss_pred ccceeccccceecC-eeEEEcCCCCcceEEEEeCc------ceEeeecCCceeccccchhhcccEEEEEEEEEEEE---E Confidence 11122334678876 79999999885432222122 22221111222222359999999999999999754 1 Q ss_pred cccccCCcceeecCCchhhhcccc Q lcl|Aclame:pro 484 DPAAQAPTKRIQNGMPDIVNSLGL 507 (519) Q Consensus 484 ~~~~~~~~~~i~~~~d~~a~~~~~ 507 (519) ...+-.++... .++-.| T Consensus 258 ---~~~~vv~~t~~----~~~~~~ 274 (274) T protein:vir:97 258 ---DESKAVKITKG----SGSLEM 274 (274) T ss_pred ---cCCceEEEecC----cccccC Confidence 11111222211 112233 No 87 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=63.46 E-value=0.32 Score=23.32 Aligned_cols=274 Identities=10% Similarity=0.008 Sum_probs=116.3 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCcc Q lcl|Aclame:pro 164 LAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDN 243 (519) Q Consensus 164 ~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~ 243 (519) .+... +...+...+-.. +... ...........+.... +....+ ..|.+.+++.=-.+..+|.+. ....- T Consensus 1 ma~~~-T~~~d~iiPev~-~~~v-~~~~~~~l~~~~~~~~---d~~l~g--~~G~tv~iP~~~~~g~a~~~~---~g~~i 69 (274) T protein:vir:94 1 MPQGL-TKTSDQIIPEVL-APMM-QAQLEKKLRFASFAEV---DSTLQG--QPGDTLTFPAFVYSGDAQVVA---EGEKI 69 (274) T ss_pred CCccc-eehhheechHHH-HHHH-HHhhhhhhhhccccee---cccccC--CCCCEEEEeeecCCCcccccc---CCCcc Confidence 11111 011111111000 0000 0000000000000000 000000 012222222100111222221 11122 Q ss_pred ccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccc Q lcl|Aclame:pro 244 PWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTV 323 (519) Q Consensus 244 ~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~ 323 (519) ...++. ..+.+++.+-|+-.=+++=| ..+.+ +-|.-.|..+-++..|..+++.+++..+..++... T Consensus 70 ~~~~lt--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~-------- 135 (274) T protein:vir:94 70 PTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV-------- 135 (274) T ss_pred cccccc--cceeEEEeeeecceecccHH--HHHhc--cchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc-------- Confidence 334443 33444444555522222222 22333 46888899999999999999999998764322110 Q ss_pred cccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccccccccccccc Q lcl|Aclame:pro 324 GAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFN 403 (519) Q Consensus 324 ~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~ 403 (519) .+..++ ++-+-.+..++.++. ..+++++|+|.+++.|.......|..+- ... T Consensus 136 --~~~~~~-------------~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s----~~g 187 (274) T protein:vir:94 136 --NADITK-------------LNGLQSAIDKFNDED---------LEPMVLFVNPLDAGKLRGDASTNFTRAT----ELG 187 (274) T ss_pred --cccccC-------------HHHHHHHHHHhhccC---------CCceEEEeCHHHHHHHHhhhhhhccccC----ccc Confidence 011122 233334444444321 2568999999999999875433332210 011 Q ss_pred ccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceeecCcc Q lcl|Aclame:pro 404 VDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGINPFA 483 (519) Q Consensus 404 ~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~nP~~ 483 (519) .....+-.+|.+.| ++||+|+..|..-..+--+| .+-|.---+.......|+..+.=.+-..-+||+.+ . T Consensus 188 ~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~g------A~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~---~ 257 (274) T protein:vir:94 188 DDIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKG------AVKLILKRDFFLEVARDASTKTTALYSDKHYVAYL---Y 257 (274) T ss_pred ccceeccccceecC-eeEEEcCCCCcceEEEEeCc------ceEeeecCCceeccccchhhcccEEEEEEEEEEEE---E Confidence 11122334678876 79999999885432222122 22221111222222359999999999999999754 1 Q ss_pred cccccCCcceeecCCchhhhcccc Q lcl|Aclame:pro 484 DPAAQAPTKRIQNGMPDIVNSLGL 507 (519) Q Consensus 484 ~~~~~~~~~~i~~~~d~~a~~~~~ 507 (519) ...+-.++... .++-.| T Consensus 258 ---~~~~vv~~t~~----~~~~~~ 274 (274) T protein:vir:94 258 ---DESKAVKITKG----SGSLEM 274 (274) T ss_pred ---cCCceEEEecC----cccccC Confidence 11111222211 112233 No 88 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=61.37 E-value=0.35 Score=23.05 Aligned_cols=300 Identities=9% Similarity=0.000 Sum_probs=123.9 Q ss_pred hhhhHHHHHhhhhhccchhhhhhhhhhhhhhhhccccccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeec Q lcl|Aclame:pro 34 IFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQ 113 (519) Q Consensus 34 ~~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQ 113 (519) |.=|-++ ..+.+...+. +.+..+++++.-.--.+.+=.+++.+.+..+-..++-+- T Consensus 1 ~~~~~~r--------~~~~~~~~e~----------------~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~ 56 (326) T protein:vir:42 1 MAVNPDR--------TTPFLGVNDP----------------KVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKI 56 (326) T ss_pred CCCCccc--------hhhhcCcchh----------------hheeccccCCcceechhhHHHHHHHHHhcchhhhhccee Confidence 1111100 0011111111 111111211111111222233445555556667778888 Q ss_pred cCCccchhheeeeeeecCCCCCCCcccccccccccccccCcccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 114 PLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEA 193 (519) Q Consensus 114 PmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~ 193 (519) ||++++. | |+-.... . .+ T Consensus 57 ~~~~~~~-----~--~p~~~~~---~---------~a------------------------------------------- 74 (326) T protein:vir:42 57 PMGTTGQ-----K--IPHWTGD---V---------SA------------------------------------------- 74 (326) T ss_pred eccCCce-----E--EEEEeCC---c---------ce------------------------------------------- Confidence 8876542 1 1100000 0 00 Q ss_pred ccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHH Q lcl|Aclame:pro 194 VTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELA 273 (519) Q Consensus 194 ~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELA 273 (519) .-++ | +..++|-..+++++++.+|...-.-.+|-||. T Consensus 75 --------------------------~~v~--------E---------g~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell 111 (326) T protein:vir:42 75 --------------------------SWIG--------E---------GDMKPITKGNMTSQTIAPHKIATIFVASAETV 111 (326) T ss_pred --------------------------EEec--------C---------CccccccccceeEEEEeeEEEEEeehhhHHHH Confidence 0001 1 12233444556677777777777888999999 Q ss_pred HHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHH-HHHhhhhhhhhcccccccccceeeeccccccc----cccchHHHHH Q lcl|Aclame:pro 274 QDLRAVHGMDADAELSGILATEIMLEINREVID-WINYSAQVGKSGMTNTVGAKAGVFDFQDPIDI----RGARWAGESF 348 (519) Q Consensus 274 QDLKAiHGLDAEaELsNILSTEImlEINReii~-~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~----~~~~~a~e~~ 348 (519) +|- ..|.++.|.+-|+..|...+++.++. .=. +.+.|+......... .-+-+..... T Consensus 112 ~~s----~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs--------------~~p~gi~~~~~~~~~~~~~~~~~~~~~~~ 173 (326) T protein:vir:42 112 RAN----PANYLGTMRTKVATAFAMAFDNAAINGTDS--------------PFPTFLAQTTKEVSLVDPDGTGSNADLTV 173 (326) T ss_pred hcC----HHHHHHHHHHHHHHHHHHHHHHHhhcccCC--------------Cccccccccccccceeecccccccccchh Confidence 984 36789999999999999999999985 100 001111111000000 0000000001 Q ss_pred HHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhc----CcccccccccccccccccCCCceEEEEecCcEEEEec Q lcl|Aclame:pro 349 KALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAV----DTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYID 424 (519) Q Consensus 349 r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~----g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D 424 (519) ..+. +..+..... ..+..++..|++|.....|... |...+.+. ...........++|.| ++|+++ T Consensus 174 ~~~~--~~~~~~~~~--~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~------~~~~~~~~~~~~~l~G-~pv~~~ 242 (326) T protein:vir:42 174 YDAV--AVNALSLLV--NAGKKWTHTLLDDITEPILNGAKDKSGRPLFIES------TYTEENSPFRLGRIVA-RPTILS 242 (326) T ss_pred HHHH--HHHHHhhhh--hhccCccEEEEeHHHHHHHHHhhccCCceeeccc------cccCccccccCceeee-eeEEEc Confidence 1111 111111111 2233557889999999998753 22111111 0011111222356666 799999 Q ss_pred CCCccceEEEEEecCCCccceeEeecccccccccc---------cCccc-----cc---ceeeeeeeeceeecCcccccc Q lcl|Aclame:pro 425 QYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRG---------SDPKN-----FQ---PVMGFKTRYGIGINPFADPAA 487 (519) Q Consensus 425 ~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~---------~dp~s-----~q---P~~g~~tRY~l~~nP~~~~~~ 487 (519) ++.+.+=. +++-|+-. -+||...-.. .++. .|+.. || =.+=...|++..+ .. T Consensus 243 ~~~~~~~~-~~~~Gd~s---~~~~~~~~~~-~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v------~~ 311 (326) T protein:vir:42 243 DHVASGTV-VGYQGDFR---QLVWGQVGGL-SFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHC------ND 311 (326) T ss_pred CCCCCCce-EEEEeecc---eEEEEEecce-EEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEE------ec Confidence 98775321 12222211 1223222111 1111 11111 22 3334566777544 11 Q ss_pred cCCcceeecCCchhhhcccch Q lcl|Aclame:pro 488 QAPTKRIQNGMPDIVNSLGLN 508 (519) Q Consensus 488 ~~~~~~i~~~~d~~a~~~~~~ 508 (519) .+.-++|.+- .+ .++ T Consensus 312 ~~a~~~l~~~---~~---~~~ 326 (326) T protein:vir:42 312 KDAFVKLTNV---DA---TEA 326 (326) T ss_pred ccceEEEeec---cc---cCC Confidence 1112334321 11 111 No 89 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=61.15 E-value=0.36 Score=23.02 Aligned_cols=351 Identities=11% Similarity=0.015 Sum_probs=126.0 Q ss_pred CC--hH----------HHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhh----hhh----------ccchhhh Q lcl|Aclame:pro 1 MK--KN----------ALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILT----APE----------YRDEKIS 54 (519) Q Consensus 1 ~~--~~----------~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~----~~~----------~~~~~~~ 54 (519) .+ .+ .|.++...-++. -+.. ...++..++.... ... -..+... T Consensus 21 ~~~~~~~~~~~~~e~~~~~~~~~~~~~~---------~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (419) T protein:vir:94 21 TSLTTEQVQEIVAEARGLADALQAESDR---------AAAR--AALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFA 89 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHH--HHHHHHHHHHHHHHhhhhccccccccccccchhhhhh Confidence 00 01 111111111111 0000 0011111110000 000 0000000 Q ss_pred hhhhhhhhhhhhcccccc-------------chhhhccccccccccccCceehhhHH--HHHhhhhhhhceeeccCCccc Q lcl|Aclame:pro 55 EAFGSFLTEAEIGGDHGY-------------DATNIAAGQTSGAVTQIGPAVMGMVR--RAIPHLIAFDICGVQPLNNPT 119 (519) Q Consensus 55 ~~~~~~~~~~~~~~~~g~-------------~~~~~~est~tg~v~~~~P~L~~l~R--ra~p~LIa~DI~GVQPmTGPT 119 (519) +. ..+.+.......+. ..+....+.++.+-...-|.+++=.. +.-..+...++|.+.||++++ T Consensus 90 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~ 167 (419) T protein:vir:94 90 DS--DGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNV 167 (419) T ss_pred hH--HHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCc Confidence 00 00000000000000 00000111111111112233222111 111234557889999998764 Q ss_pred hhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCC Q lcl|Aclame:pro 120 GQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAG 199 (519) Q Consensus 120 GLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag 199 (519) .-+ +|..-.+. ...++. T Consensus 168 ~~~--~~~~~~~~-----------------~~~~~~-------------------------------------------- 184 (419) T protein:vir:94 168 LEY--IRDTSGTA-----------------GAGSTW-------------------------------------------- 184 (419) T ss_pred eee--eeeccccc-----------------cccccC-------------------------------------------- Confidence 221 11110000 000000 Q ss_pred CCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhh Q lcl|Aclame:pro 200 ATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAV 279 (519) Q Consensus 200 ~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAi 279 (519) + ...-+++ +..+++...++++++..+|.=+-...+|-||.||.- T Consensus 185 --~---------------~a~~v~E-----------------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-- 228 (419) T protein:vir:94 185 --N---------------KAAVVPE-----------------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-- 228 (419) T ss_pred --c---------------ccceecC-----------------CccccccccceeeEEeeeeeEEEeehhhHHHHHhHH-- Confidence 0 0000111 122444445555666666666666789999999962 Q ss_pred cCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccc-cccchHHHHHHHHHHHHHHH Q lcl|Aclame:pro 280 HGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDI-RGARWAGESFKALLFQIDKE 358 (519) Q Consensus 280 HGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~-~~~~~a~e~~r~L~~~i~~~ 358 (519) +.+++|.+-|+..|...+|+.||. | +..+.+.|++........ ...-+.....-..+..|.++ T Consensus 229 ---~l~~~i~~~la~a~~~~~d~aii~--------G-----~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~ 292 (419) T protein:vir:94 229 ---QLMGYIQGRLTYGLRFLRDRQLLN--------G-----NGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRA 292 (419) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHh--------c-----cCcccccceecccccccccccccccccccchhHHHHHHH Confidence 358999999999999999999985 1 011123344322110000 00001111112234444444 Q ss_pred HHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEec Q lcl|Aclame:pro 359 AAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKG 438 (519) Q Consensus 359 a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG 438 (519) -+.+.. .+...+.+||+|.....|...= +.. +....+..+... -..++|.| ++|+++...+..=+++|--. T Consensus 293 ~~~~~~--~~~~~~~~v~n~~~~~~l~~~k--~~~---~~~~~~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~gd~~ 363 (419) T protein:vir:94 293 KTVAEI--AGFPPDGVVVHPQDWESIELDQ--APG---SGVFRVIANVQG-EATPRIWG-LNVVSTVAIAQGTALVGGFR 363 (419) T ss_pred HHhhhh--ccCCCCEEEEcHHHHHHHHHHh--hcC---CCceeecCCccc-CCCccccc-eeeEEcCCCCCccEEEeecc Confidence 444443 2235678999999988876431 100 000011111111 11246776 69999998775434444110 Q ss_pred CC-----CccceeEeecccccccccccCcccccceeeeeeeeceeecCcccccccCCcceeecC-Cch Q lcl|Aclame:pro 439 SN-----EMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGINPFADPAAQAPTKRIQNG-MPD 500 (519) Q Consensus 439 ~~-----~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~i~~~-~d~ 500 (519) .. ..+-.+-..++.... =..-+=.+=+..||++.+ . ..+...++.-. -+. T Consensus 364 ~~~~~~~~~~~~v~~~~~~~~~------~~~~~~~~r~~~r~d~~v---~---~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 364 QGATLWSRQGITVLMTDSHADF------FTANTLVILAEFRANLAV---Y---QPKAFVRVTFAAATT 419 (419) T ss_pred ceEEEEEecceEEEEeccccch------hhcCcEEEEEEEeeccEE---e---ccccEEEEEeccCCC Confidence 00 000011111111000 011223344556776544 0 11111222110 011 No 90 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=57.21 E-value=0.44 Score=22.54 Aligned_cols=334 Identities=15% Similarity=0.131 Sum_probs=114.2 Q ss_pred hCCCccccccccchhhhhhhhhhh-HHHHHhhhhhccchhhhhhhhhhhhhhhhccccccc---hhhhcccccccccccc Q lcl|Aclame:pro 14 LENEALPEIVGASKQAIIAKIFEN-QEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYD---ATNIAAGQTSGAVTQI 89 (519) Q Consensus 14 l~~~~~~~~~~~~~~~~~~~~~en-q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~~est~tg~v~~~ 89 (519) |-..-.=++++...++...+-=|. |.|-..-.+..+ .+....+..+..+...-.+..+ ...+..++.+|.+. T Consensus 1 ~a~~~a~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~--a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~Gg~l-- 76 (366) T protein:vir:57 1 MAAAVAVPVKAHSVAPGIIIKEELQQYKGAGMTRMVM--SIAAGKGNLADAAKFAATELGDTGLSMAISTAAGSGGAL-- 76 (366) T ss_pred CcccccccccccccccccccccccccccchhHHHHHH--HHHhcccchhHHHHHHHHhhcchhhhhhccccccCCccc-- Confidence 211111112222222211111110 001000000000 0000000000000000000000 11111111122111 Q ss_pred Cceeh--hhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCcccccccccccccc Q lcl|Aclame:pro 90 GPAVM--GMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAAS 167 (519) Q Consensus 90 ~P~L~--~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~ 167 (519) =|.-+ .+++++-+..+...+ |++.+++++|-+ +|+... + T Consensus 77 vP~~~~~~ii~~l~~~s~l~~l-g~~~v~~~~g~~-----~~p~~t--------------------~------------- 117 (366) T protein:vir:57 77 IPQNMQNEVIELLRDRTVVRIL-GARSIPLPNGNL-----SMPRLS--------------------G------------- 117 (366) T ss_pred cchhHHHHHHHHHhhhcchhhh-ceeeeecCCCce-----EEEEEe--------------------C------------- Confidence 02111 111111111111111 222222222110 000000 0 Q ss_pred ccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCcccccc Q lcl|Aclame:pro 168 KVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNE 247 (519) Q Consensus 168 t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~E 247 (519) . ...+-++ | +..+++ T Consensus 118 --------------------------------~----------------~~a~wv~--------E---------~~~~~~ 132 (366) T protein:vir:57 118 --------------------------------G----------------ATAGYVG--------E---------GKDVVA 132 (366) T ss_pred --------------------------------C----------------cceeeec--------c---------Cccccc Confidence 0 0000011 1 112334 Q ss_pred ceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccc Q lcl|Aclame:pro 248 MGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKA 327 (519) Q Consensus 248 MsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~ 327 (519) ...+++++++..|.-+-...+|-||.+|-- .|.|+.|.+-|...|...+++.||.= . | +...+. T Consensus 133 s~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l~G--------~-G---~~~~p~ 196 (366) T protein:vir:57 133 TGATFDDVKLSAKTMIALVPVSNQLIGRAG----FNVEQLLLGDILSAIATREDKAFLRD--------D-G---TGDTPK 196 (366) T ss_pred cccceeEEEEeeEEEEEeehhhHHHHhhhh----HHHHHHHHHHHHHHHHHHHHHHhhcc--------C-C---CCcccc Confidence 444556666777766677789999998753 46799999999999999999888851 0 0 111234 Q ss_pred eeeeccccccccccchHH--HHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccccccccccccccc Q lcl|Aclame:pro 328 GVFDFQDPIDIRGARWAG--ESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVD 405 (519) Q Consensus 328 G~fDl~~~~d~~~~~~a~--e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d 405 (519) |++........ ...+.+ --+..+-..++.+.........+..+...|++|.....|...- +.. +...+ .+ T Consensus 197 Gi~~~~~~~~~-~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lk--d~~----G~~l~-~~ 268 (366) T protein:vir:57 197 GMKAVATAANR-LVAWTGTAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLR--DGN----GNKVY-PE 268 (366) T ss_pred ceeeccccccc-eeeccccccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhh--ccC----Cceec-cC Confidence 44432211100 000000 0001111111222222222233334466789999988887532 110 01111 22 Q ss_pred CCCceEEEEecCcEEEEecCCCccc----------------eEEEEEecCCCccceeEeecccccccccccCcc------ Q lcl|Aclame:pro 406 TTKAVFAGVLGGKYRVYIDQYARSD----------------YFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPK------ 463 (519) Q Consensus 406 ~~~~~~~G~l~~~~~vy~D~y~~~d----------------y~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~------ 463 (519) .+. |+|.| |+|+++.+.|.+ ++++|-.+..+.+ .+++... .|+. T Consensus 269 ~~~----g~l~G-~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~----~~~ea~~-----~~~~g~~~~~ 334 (366) T protein:vir:57 269 MSQ----GILKG-YPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVD----FSTEATY-----KDADGQLVSA 334 (366) T ss_pred CCC----Ceecc-eeeEEccccccccccCCCccEEEEEecceEEEEEecceEEE----Eeecccc-----ccccccchhh Confidence 222 57877 799998886642 1122222222211 0111000 0111 Q ss_pred --cccceeeeeeeeceeecCcccccccCCcceeecCCch Q lcl|Aclame:pro 464 --NFQPVMGFKTRYGIGINPFADPAAQAPTKRIQNGMPD 500 (519) Q Consensus 464 --s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~i~~~~d~ 500 (519) +-+=.+=...||++.+ .....-.+..|..| T Consensus 335 f~~~~~~iR~~~~~d~~v-------~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 335 FARNQSLIRVVTEHDIGF-------RHPEGLVLGTGVIW 366 (366) T ss_pred hhcCceeEEeeeeeCcEe-------eccccEEEEecccC Confidence 1112333445566544 11112223334455 No 91 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=52.88 E-value=0.54 Score=22.04 Aligned_cols=278 Identities=14% Similarity=0.077 Sum_probs=96.4 Q ss_pred ccccccccccc---cccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCC Q lcl|Aclame:pro 162 EALAASKVLEV---GKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFN 238 (519) Q Consensus 162 ~~~~~~t~~~~---g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~g 238 (519) .+..++..... .......... ..+... ....+ .... ..... +..+.+-..-.+| T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~---s~i~~~--~~~~~--~~~~--~~~~p--------~~~~~~~a~~v~E------ 57 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGK---SSIARL--SAQKP--IPFN--GEKVF--------TFTMDSEIDVVAE------ 57 (298) T ss_pred CeeccccccChhHHHHHHHHHHhh---chhhhh--cceee--ccCC--ceEEE--------EEecCcceEEeeC------ Confidence 11111100000 0000000000 000000 00000 0000 00000 0001100001122 Q ss_pred CCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhc Q lcl|Aclame:pro 239 GSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSG 318 (519) Q Consensus 239 gs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~ 318 (519) +.++++-..+++.++...|.-+-....|-||.|+--. -..+-+++|.+-|+..|..+|+.-++.-..... | T Consensus 58 ---g~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~~-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~--g--- 128 (298) T protein:vir:94 58 ---SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVNPRL--G--- 128 (298) T ss_pred ---CccccccccceeEEEEeeeEEEEeeehhHHHhccCCc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC--C--- Confidence 2334444555555555555555567889998764221 013345666666666666666666664211000 0 Q ss_pred ccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccc Q lcl|Aclame:pro 319 MTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGL 398 (519) Q Consensus 319 ~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~ 398 (519) ++....|.--+.... ...--.....-..+.-+.++-..+... +.+...+|++|.....|...- +.. + T Consensus 129 ---~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~vmn~~~~~~l~~lk--d~~----G 195 (298) T protein:vir:94 129 ---TASAVIGTNHFDSKV--TQKVEAPRGIADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAKQK--DLQ----G 195 (298) T ss_pred ---ccccccccccccccc--ccccccccccccHHHHHHHHHHhhhhc--CCCccEEEEcHHHHHHHHHhh--ccC----C Confidence 000000100000000 000000011112233344444444331 234567999999998886422 110 0 Q ss_pred cccccccCCCceEEEEecCcEEEEecCCCcc------ceEEEEEecCCCccceeEeecccccc--cccccCccc-----c Q lcl|Aclame:pro 399 GQGFNVDTTKAVFAGVLGGKYRVYIDQYARS------DYFTIGYKGSNEMDAGIYYAPYVALT--PLRGSDPKN-----F 465 (519) Q Consensus 399 ~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~~~~~fyaPYv~~~--~~~~~dp~s-----~ 465 (519) +-.+..+.++. -.|+|.| ++|++++.-+. +.+++| +-. .++.|...-.+. ..+..||+. | T Consensus 196 ~~l~~~~~~~~-~~~tl~G-~PV~~~~~v~~~~~~~~~~~~~G---dfs--~~~~~~~~~~~~~~~~~~~~~d~~~~~~f 268 (298) T protein:vir:94 196 NALFPELKWGA-TPDTING-LPVDVNKTVSDMSLTQRDRAIIG---DFA--NGFKWGYAKEVPLEVIQYGDPDNSGLDLK 268 (298) T ss_pred CeeecCcccCC-CCceecc-eeeEEecccccccCCCccEEEEe---ecc--ceEEEEEecCceEEEeecCCCcCcchhhh Confidence 11111222211 1257777 69998886542 223322 111 112233221111 112223321 2 Q ss_pred c-ceeee--eeeeceee-cCcccccccCCcceeecCC Q lcl|Aclame:pro 466 Q-PVMGF--KTRYGIGI-NPFADPAAQAPTKRIQNGM 498 (519) Q Consensus 466 q-P~~g~--~tRY~l~~-nP~~~~~~~~~~~~i~~~~ 498 (519) | =.++| ..|+++.+ +|= ...+|.+.. T Consensus 269 ~~~~v~~r~~~r~~~~~~~~~-------a~~~l~~~t 298 (298) T protein:vir:94 269 GYNQVYIRAELFLGWGILDAT-------KFARVTEAN 298 (298) T ss_pred hcCcEEEEEEEEeccEeeccc-------ceEEEEecC Confidence 2 12334 55777543 221 234444333 No 92 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=43.17 E-value=0.85 Score=20.96 Aligned_cols=204 Identities=16% Similarity=0.195 Sum_probs=104.4 Q ss_pred ccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhH Q lcl|Aclame:pro 223 AEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINR 302 (519) Q Consensus 223 ~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINR 302 (519) -++.-+ |..=++-.-+-++ | .|--.|.+.-+..+++.++++ T Consensus 1 iD~lL~-------------------------------------a~~~VdDiD~aqa-~-~dvr~e~t~e~G~ALA~~~D~ 41 (221) T protein:vir:17 1 MDDLLV-------------------------------------ASQFVYDLDEILA-Q-WNTRSEISKQIGEALAIHYDE 41 (221) T ss_pred CCcchh-------------------------------------HHHHHHhHHHHHh-h-hHHHHHHHHHHHHHHHHHHHH Confidence 122222 2222233333444 4 889999999999999999999 Q ss_pred HHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchH-HH Q lcl|Aclame:pro 303 EVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRN-VV 381 (519) Q Consensus 303 eii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~-va 381 (519) -|++.+...|+-.. .++..+ |..+... ..+.- .....|+..|-+.+...-.+---..|-|+|++|+ .. T Consensus 42 ~i~~~~~~aA~~~~-p~~~~~----~g~~~~~---~a~~t---~~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~ 110 (221) T protein:vir:17 42 RIARVLASASIAAA-PVTGQD----GGFSVNI---GAGNT---NNAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYY 110 (221) T ss_pred HHHHHHHhhhhhcC-cccccc----cCcceec---ccccc---CCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHH Confidence 99998776665322 222211 1111110 01100 1123445555555555555544457789999996 55 Q ss_pred HHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc----ceEE------------EEEecCCCccce Q lcl|Aclame:pro 382 NVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS----DYFT------------IGYKGSNEMDAG 445 (519) Q Consensus 382 ~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~----dy~~------------vG~KG~~~~~~~ 445 (519) .+|+..+.+....-.+ ......... .-+|.+.| ++||.=++.|. +|.. =.|.|+-.-..| T Consensus 111 ~LL~~~d~~~~n~d~~-~s~g~~~~g--~~i~~v~G-~~V~~SnnlP~~~gt~~~~~ag~~~~~~~~~~~yr~~fs~~~g 186 (221) T protein:vir:17 111 SLISSVDTNILNREIG-NTQGDMNTG--KGLYVNAG-IRIYKSNVLASLYGTNLVTDPGDATTSGENNGSYRPAITDRAG 186 (221) T ss_pred HHHHhcCcceeeeecc-ccccccccc--ceeeeecC-cEEEEeccCCcccccccccCCccccccccccccccccccceEE Confidence 5555433332111000 011111111 13677775 89999999886 3321 134455445578 Q ss_pred eEeecccccccccccCcccccceeeeeeeeceeecCcccccccCCcce Q lcl|Aclame:pro 446 IYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGINPFADPAAQAPTKR 493 (519) Q Consensus 446 ~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~ 493 (519) |||.|=. +--++.+.|-|--|.+.-| |.+- .|.+| T Consensus 187 lv~~~~A-vgtvkl~~~~~~~~~~~~~--~~~~----------~~~~~ 221 (221) T protein:vir:17 187 LVFHKEA-ADTVEVLLPPSRPPLVISM--FSIR----------RPDRR 221 (221) T ss_pred EEEcchh-eeeeeeecCCCCCceeeee--eecc----------CCCCC Confidence 8888863 3345566777776654322 1110 11122 No 93 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=40.07 E-value=0.99 Score=20.61 Aligned_cols=268 Identities=9% Similarity=0.008 Sum_probs=113.9 Q ss_pred ccccccccccccccccccccccccccccccccCCCC--CCCccccccccccccccccceecccccchhhhhhcccCCCCC Q lcl|Aclame:pro 164 LAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAG--ATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGST 241 (519) Q Consensus 164 ~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag--~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~ 241 (519) .+.... ...+...+-... .-..+........++ ..+. .+.. ..|.+.+++.=-....+|.+. . T Consensus 1 Ma~~~T-~l~d~i~Pev~~--~~v~~~~~~~~~~~~~~~~~~-~l~g------~~G~ti~iP~~~~igda~~~~---e-- 65 (276) T protein:vir:10 1 MAQGTT-TKSTQIVPEVLA--PMMQAELDKKLRFAQFADIDS-TLVG------QPGDTLTFPAFVYSGDATVVP---E-- 65 (276) T ss_pred CCccee-ehhhhhchHHHH--HHHHHHHHhhhhhcccceecc-cccC------CCCCEEEeeeecCCCcccccc---C-- Confidence 111110 001111100000 000000000000000 0000 0000 012222221100011222221 1 Q ss_pred ccccccceeEEEEEEEEeecccccccccHHHHHHHHhhc-CCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 242 DNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVH-GMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) Q Consensus 242 ~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiH-GLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t 320 (519) +.++..=..+..+.+++.+-|.-.=++| |+-+.. +.|.-.|..+-++..|...++.+++..+.....- . T Consensus 66 g~~i~~~~lt~~~~~a~i~~~~k~~~~t-----D~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~----~- 135 (276) T protein:vir:10 66 GQKIPVDKIETNRREAKIHKIGKGTDIT-----DEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLT----V- 135 (276) T ss_pred CCccCccccccceeeEEeehcccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc----c- Confidence 1223222333445555555554333333 333332 6799999999999999999999999865432110 0 Q ss_pred ccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcC--cccccccccc Q lcl|Aclame:pro 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVD--TSVSYAAQGL 398 (519) Q Consensus 321 ~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g--~~~~~~~~~~ 398 (519) .++.+.+ +.+-....++.++ -..++++||+|.+.+.|.... .|...+.. T Consensus 136 -----~~~~~t~-------------d~i~~A~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-- 186 (276) T protein:vir:10 136 -----SADIGTL-------------AGLEAAIDTFDDE---------DLEPMVLFINPKDAGKLRSSASDNFTRATEL-- 186 (276) T ss_pred -----cccccCH-------------HHHHHHHHHhccc---------cCcccEEEEcHHHHHHHHHhccccccccccc-- Confidence 1222222 2222222222221 125689999999999996432 33333211 Q ss_pred cccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeecee Q lcl|Aclame:pro 399 GQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG 478 (519) Q Consensus 399 ~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~ 478 (519) ..+...+-.+|.+.| ++|++|...+..-..+--+|.-. |+.. -+.......|++.++=.|--..+||+. T Consensus 187 ----g~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~gAi~-----~~~~-~~~~vE~dRd~~~~~d~i~~~~~y~~~ 255 (276) T protein:vir:10 187 ----GDNIIVKGAFGEALG-AVIVRSKKLDEGEAILAKRGAVK-----LITK-RDFFLETDRDPSTKTTALYSDKHYVAY 255 (276) T ss_pred ----cccceeccccceecc-eeEEEcCCCCcceEEEEecccee-----eeec-CCceeecccchhhcccEEEEeeEEEEE Confidence 112223334678876 89999999875432222122221 1111 112222235999999999999999875 Q ss_pred ecCcccccccCCcceeecC---Cchhh Q lcl|Aclame:pro 479 INPFADPAAQAPTKRIQNG---MPDIV 502 (519) Q Consensus 479 ~nP~~~~~~~~~~~~i~~~---~d~~a 502 (519) . . ...+-.++..+ .|.-| T Consensus 256 ~---~---~~~~vv~~t~~~~~~~~~~ 276 (276) T protein:vir:10 256 L---Y---DESKAVKVTKGAGTTDSGA 276 (276) T ss_pred E---E---cCcceEEEecCCcCCcCCC Confidence 4 1 11112233322 12211 No 94 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=37.33 E-value=1.1 Score=20.31 Aligned_cols=344 Identities=16% Similarity=0.136 Sum_probs=125.6 Q ss_pred CCh-HHHHHhhhhh-------hC----CC-----ccccccccchhhhhhh--hhhhHHHHHhhhhhcc------------ Q lcl|Aclame:pro 1 MKK-NALVQKWSAL-------LE----NE-----ALPEIVGASKQAIIAK--IFENQEQDILTAPEYR------------ 49 (519) Q Consensus 1 ~~~-~~l~~kw~p~-------l~----~~-----~~~~~~~~~~~~~~~~--~~enq~~~~~~~~~~~------------ 49 (519) |++ ++|+++=+-. ++ .+ .+.++.+...+. .+. -|+.|.+++....... T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~-~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~ 79 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAA-KARRDAINDQIKDLEAENKANSDPDKPVDNAQP 79 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcchhhhhhhhcc Confidence 544 2222221111 11 10 011111111100 000 1122222111110000 Q ss_pred ---ch------hhhhhhhhhhhhhhhccccccchhhhccccccccccccCceehhhHHHHHhhhhhhhceeeccCCccch Q lcl|Aclame:pro 50 ---DE------KISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTG 120 (519) Q Consensus 50 ---~~------~~~~~~~~~~~~~~~~~~~g~~~~~~~est~tg~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTG 120 (519) +. .-..++..+|-. +...-+......+++.|.+.--.+..-.++++..+..+-.++|.+.||+++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~l~~----~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 155 (394) T protein:vir:10 80 NGTDLKKKPIDAKKKAINDFIHS----HGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKG 155 (394) T ss_pred cccchhhhHHHHHHHHHHHHHhc----cchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCce Confidence 00 000111121100 00000001101111112222111222235566666777789999999988864 Q ss_pred hheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCC Q lcl|Aclame:pro 121 QVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGA 200 (519) Q Consensus 121 LIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~ 200 (519) -+--.+ .... . ..| T Consensus 156 ~~~~~~-----~~~~----~---------~~~------------------------------------------------ 169 (394) T protein:vir:10 156 TYPILK-----RATD----R---------FSS------------------------------------------------ 169 (394) T ss_pred EEEEEe-----cCCC----c---------ccc------------------------------------------------ Confidence 433222 1000 0 000 Q ss_pred CCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhc Q lcl|Aclame:pro 201 TDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVH 280 (519) Q Consensus 201 t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiH 280 (519) ++++- +. ...+...|.+..|.+.|. +-...+|-||.+|- T Consensus 170 ---------------------~~E~~-----~~----~~~~~~~~~~v~l~~~k~-------~~~~~iS~ell~ds---- 208 (394) T protein:vir:10 170 ---------------------VAELA-----EN----PALAEPEFEQVDWSVSTY-------RGAIPLSEEAIADS---- 208 (394) T ss_pred ---------------------ccccc-----cc----cccccccceeEEeeeeee-------EeeehhHHHHHhhh---- Confidence 00000 00 001123355555555554 44567999999984 Q ss_pred CCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 281 GMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAA 360 (519) Q Consensus 281 GLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~ 360 (519) ..|.+++|.+-|+..|..-+|+.|+.-... +.+.++.... ..+....++...... T Consensus 209 ~~~l~~~i~~~la~~~~~~~~~~il~g~g~-------------~~~~~~~~~~----------~~d~l~~~~~~~~~~-- 263 (394) T protein:vir:10 209 AVDLTSLVGQSINEKSVNTYNAMIAPVLQS-------------FTAKATTTDT----------LVDSLKHILNVDLDP-- 263 (394) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------cccccccccc----------cHHHHHHHHHhhhhh-- Confidence 256799999999999999999998863321 1111221111 112222222211111 Q ss_pred HHHhhccccCCCEEEEchHHHHHHHhcCccccccccccccccccc---CCCceEEEEecCcEEEEecC--CCcc---ce- Q lcl|Aclame:pro 361 EIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVD---TTKAVFAGVLGGKYRVYIDQ--YARS---DY- 431 (519) Q Consensus 361 ~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d---~~~~~~~G~l~~~~~vy~D~--y~~~---dy- 431 (519) .+ . ..+|++|.....|...= +. . ++-.+..+ .+.....++|.| ++||+.. +.+. +. T Consensus 264 ------~~-~-a~~vmn~~~~~~l~~lk--d~-~---G~~i~~~~~~~~~~~~~~~~L~G-~PV~~~~~~~~~~~~~~~~ 328 (394) T protein:vir:10 264 ------AY-S-RALVVTQSLFNTLDTLK--DK-N---GRYLLHDASDSITDGTAKGTVLG-VPVYVVGDALLGSAAGDQK 328 (394) T ss_pred ------hc-c-CEEEecHHHHHHHHHhh--cc-C---CCeeeeccccccccCCccccccc-ceeEEecccccCCCCCceE Confidence 11 1 35789999888887431 10 0 00001111 122222357777 5777632 2221 11 Q ss_pred EEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-cCcccccccCCcceeecCCchhhhcccchhh Q lcl|Aclame:pro 432 FTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSLGLNGY 510 (519) Q Consensus 432 ~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~y 510 (519) +++|---. ++....- ....+...|...|.-.+-...|++..+ ||-+- ..+.. .+. .++.-+| T Consensus 329 i~~gd~s~-----~~~~~~~-~~~~v~~~~~~~~~~~~~~~~r~d~~~~~~~ai-------~~~~~-~~~---~~~~~~~ 391 (394) T protein:vir:10 329 AFVGDLKR-----GVLFADR-QQVTLAWEDSKIYGRYLGAAFRFGVKQADSNAG-------YFVTN-TDA---ASGSTSG 391 (394) T ss_pred EEEeeccc-----cEEEEee-cceEEEEecccccceeEEEEEEeccEEeccccE-------EEEEe-ecc---cCCCCCC Confidence 22220000 0000000 111122234455555666677887543 22211 11110 000 0011111 Q ss_pred hhh Q lcl|Aclame:pro 511 FRR 513 (519) Q Consensus 511 ~r~ 513 (519) --| T Consensus 392 ~~~ 394 (394) T protein:vir:10 392 TGK 394 (394) T ss_pred CCC Confidence 111 No 95 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=37.11 E-value=1.1 Score=20.28 Aligned_cols=321 Identities=13% Similarity=0.131 Sum_probs=139.9 Q ss_pred hhhhccccccchhhhccccccccccccCcee-hhhHHHHH-hhhhhhhceeeccCCccchhheeeeeeecCCCCCCCccc Q lcl|Aclame:pro 63 EAEIGGDHGYDATNIAAGQTSGAVTQIGPAV-MGMVRRAI-PHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKE 140 (519) Q Consensus 63 ~~~~~~~~g~~~~~~~est~tg~v~~~~P~L-~~l~Rra~-p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~e 140 (519) =|+++|..||.++.+. +.++..|-|-+ -+.+...+ ++|+...++ -+.. T Consensus 1 ~~~~~~~~~~~~~~~~----~t~~~~fiPev~s~~v~~~l~~~lv~~~l~--------------~~~~------------ 50 (381) T protein:vir:80 1 MATIQGTGGYKGSAVD----LSNVQVFIPEVWSSEVRMFRDQKFAALEAT--------------KKIP------------ 50 (381) T ss_pred CceecccccccCcccc----hhhHHhhhhHHHHHHHHHHHHHhhhhhhcc--------------cccc------------ Confidence 3788899999888876 44455665632 22332222 233322110 0111 Q ss_pred ccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccce Q lcl|Aclame:pro 141 AFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLA 220 (519) Q Consensus 141 A~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~ 220 (519) |+|..+..- .++ ..|.. ... .+ T Consensus 51 -----------~~~~~GdTV--~ip----------------~~g~~---------------~a~--------------d~ 72 (381) T protein:vir:80 51 -----------FEGKKGDLI--HIP----------------NISRA---------------AVY--------------DK 72 (381) T ss_pred -----------ceeecCceE--Eee----------------ccCcc---------------eee--------------ee Confidence 111111000 000 00000 000 00 Q ss_pred ecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 221 EIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEI 300 (519) Q Consensus 221 ~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEI 300 (519) .-+..+. .+ ...-.+..++|+|...-+ .-+.-....+. | .|...|+..-+...|...+ T Consensus 73 ~~g~~i~---~~---------~~~~~~~~itID~~~~~~--------~~Idd~D~~~~-~-~D~~~~~~~~~~~aLA~~~ 130 (381) T protein:vir:80 73 QPQTPVN---LQ---------ARTDSEFTFTVTKYKESS--------FMIEDIVNTQA-S-YTLRQYYTKEAGYALARDM 130 (381) T ss_pred cCCCccc---cc---------ccCCceEEEEEeeeeecc--------eeechHHHHhh-c-cChHHHHHHHHHHHHHHHH Confidence 0000000 00 111234557776654322 11222222233 3 6999999999999999999 Q ss_pred hHHHHHHHHhhhhhhhh-cccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchH Q lcl|Aclame:pro 301 NREVIDWINYSAQVGKS-GMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRN 379 (519) Q Consensus 301 NReii~~i~~~a~~~~~-~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~ 379 (519) ++.|+..+......... ..+.+..-..+.+.... ......-..+.+..+..++.+. .+= ..|-|+|++|+ T Consensus 131 D~~i~~~~~~~~~~~~~~~~t~~~~i~~~~~~~~~--t~~~~~~t~~~i~~a~~~Lde~--~VP-----~egR~lvv~P~ 201 (381) T protein:vir:80 131 DNFALAHRAVINAFPSQRIYSYDTTLGDGTVNAHL--TGTPAPLTYAALLLAKQKLDEA--DVP-----QEGRIVMVSPA 201 (381) T ss_pred HHHHHHHHhhccccccccccccccccccccccccc--ccchhhHHHHHHHHHHHHHhhc--CCC-----cCCcEEEeCHH Confidence 99999865433321111 01111000001110000 0000111234444444444433 121 13469999999 Q ss_pred HHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEec-------CCCccceeEeec-- Q lcl|Aclame:pro 380 VVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKG-------SNEMDAGIYYAP-- 450 (519) Q Consensus 380 va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG-------~~~~~~~~fyaP-- 450 (519) +...|.....+...-. ...+...+-.+|.|.| ++||..++-|.+-. .+++- ..+...|-=|+| T Consensus 202 ~~~~Ll~~~~~~~ad~------~~~~~l~~G~Ig~i~G-~~Vv~Sn~lp~~~~-t~~~~~agap~~~~~~~~~~~~~g~~ 273 (381) T protein:vir:80 202 QYIDLLSINQFISVDF------SQVKPVTSGVVGTILG-MEVIVTTQIGINSL-TGYVNGQGAPTQPTPGVLGSPYLPDQ 273 (381) T ss_pred HHHHHhhchhhhhhhh------ccchhhhceeeeEEcc-eEEEeecccccccc-cceeeecccccccccccccccccccc Confidence 9999987765554321 1122233334677777 99999888876422 34431 111111111111 Q ss_pred ------------c---cc----cccc---cccCcccccceeeeeeee-----ceeecCcccccccCCcceeec-CCchhh Q lcl|Aclame:pro 451 ------------Y---VA----LTPL---RGSDPKNFQPVMGFKTRY-----GIGINPFADPAAQAPTKRIQN-GMPDIV 502 (519) Q Consensus 451 ------------Y---v~----~~~~---~~~dp~s~qP~~g~~tRY-----~l~~nP~~~~~~~~~~~~i~~-~~d~~a 502 (519) | ++ +..+ -..++....+..|...++ ||.++|....+ .++.-++. |.+. T Consensus 274 s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~-- 349 (381) T protein:vir:80 274 AGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQTLGSFGGANRWATAVVCHPDWLAV--GVQQNVKSESSRE-- 349 (381) T ss_pred ccceeeeeeeeeeceeeeeeeccceeeecceeeecCCCceeeeehhhhhhhhhcccccccccc--cceeEeecccchh-- Confidence 1 00 0000 114788888888877665 67777776522 33444444 2111 Q ss_pred hcccchhhhhhhhhcCC Q lcl|Aclame:pro 503 NSLGLNGYFRRVYVKGI 519 (519) Q Consensus 503 ~~~~~~~y~r~v~v~~~ 519 (519) -+|.--|+|--+ T Consensus 350 -----~~~~~~~~~~~~ 361 (381) T protein:vir:80 350 -----TMYLADAFVTSC 361 (381) T ss_pred -----heeehhhhhhhh Confidence 112222222222 No 96 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=36.97 E-value=1.1 Score=20.26 Aligned_cols=344 Identities=13% Similarity=0.184 Sum_probs=128.3 Q ss_pred CChH-------HHHHhhhhhhCCCc--cccccccch------hhhhhhh--hhhH----HHHHhhhh-------hccchh Q lcl|Aclame:pro 1 MKKN-------ALVQKWSALLENEA--LPEIVGASK------QAIIAKI--FENQ----EQDILTAP-------EYRDEK 52 (519) Q Consensus 1 ~~~~-------~l~~kw~p~l~~~~--~~~~~~~~~------~~~~~~~--~enq----~~~~~~~~-------~~~~~~ 52 (519) |+-+ +|.++...+-+..+ +-++....+ +.+-++| +|++ ++...+.+ ...... T Consensus 3 ~~lk~l~~~~~el~~~~~~~k~~~~~~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 82 (401) T protein:vir:44 3 VDIKDVEQVAQELQQKFDDFKAKNDKRVEAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNKVAAE 82 (401) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhHH Confidence 3322 34444433211100 111111000 0011111 1222 11111110 000111 Q ss_pred hhhhhhhhhhhhhhccccccchhhhccccc-cccc---cccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeee Q lcl|Aclame:pro 53 ISEAFGSFLTEAEIGGDHGYDATNIAAGQT-SGAV---TQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAV 128 (519) Q Consensus 53 ~~~~~~~~~~~~~~~~~~g~~~~~~~est~-tg~v---~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsr 128 (519) ...++..+|-......-.....+.+..++. .|.+ ..+.+-++.+.| ...+..++|-+.||++++..+.- T Consensus 83 ~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~---- 155 (401) T protein:vir:44 83 HKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLK---DEVVMRQEATVITVGGSDYKKLV---- 155 (401) T ss_pred HHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHH---hhhhhhhhceeeecCCCceEEEE---- Confidence 123334443211111111112222232222 1111 244455555555 34456778999999887532211 Q ss_pred ecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCcccccc Q lcl|Aclame:pro 129 YGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDA 208 (519) Q Consensus 129 Y~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~ 208 (519) .... +...|- T Consensus 156 ---~~~~------------~~a~wv------------------------------------------------------- 165 (401) T protein:vir:44 156 ---NLGG------------TASGWV------------------------------------------------------- 165 (401) T ss_pred ---ecCC------------ccceee------------------------------------------------------- Confidence 0000 000000 Q ss_pred ccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHH Q lcl|Aclame:pro 209 AVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAEL 288 (519) Q Consensus 209 ~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaEL 288 (519) ++|-. ........|.+..|.+.|. +--..+|-||.+|- .+|.+++| T Consensus 166 --------------~E~~~---------~~~~~~~~~~~v~~~~~k~-------~~~~~iS~ell~ds----~~~l~~~i 211 (401) T protein:vir:44 166 --------------GETDT---------RSQTATSRLGLIEPFMGEI-------YGNPQATQKMLDDA----FFNVEAWI 211 (401) T ss_pred --------------ccccc---------cCccccccceeeeeehhhe-------eeehhhhHHHHhcc----hHHHHHHH Confidence 00000 0001112345555555444 44457899999983 46779999 Q ss_pred HHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeecccccccc---------------ccchHHHHHHHHHH Q lcl|Aclame:pro 289 SGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIR---------------GARWAGESFKALLF 353 (519) Q Consensus 289 sNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~---------------~~~~a~e~~r~L~~ 353 (519) .+-|+..|...+++.||. | +..+.+.|++......... ...-..+....|+. T Consensus 212 ~~~la~ai~~~~~~~~l~--------G-----~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~ 278 (401) T protein:vir:44 212 NSELATEFAEQEEIAFTT--------G-----DGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIY 278 (401) T ss_pred HHHHHHHHHHHHHhhhhc--------c-----CCCCccceeeccccccccccccccccccccccccccccCHHHHHHHHH Confidence 999999999998888885 1 0112234544322211100 00001122222222 Q ss_pred HHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc---- Q lcl|Aclame:pro 354 QIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS---- 429 (519) Q Consensus 354 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~---- 429 (519) .+.. .+..+...|+++.....|...- +.. +...+..+.+.. --++|.| ++|+++...+. T Consensus 279 -------~l~~--~~~~~a~~v~n~~~~~~L~~lk--d~~----G~~l~~~~~~~g-~~~~l~G-~PVv~~~~~p~~~~~ 341 (401) T protein:vir:44 279 -------TLRK--AHRTGAKFMMNNNSLFAIRLLK--DTE----GNYLWRPGLELG-QPSSLAG-YGIAENEQMPDIAAD 341 (401) T ss_pred -------hcch--hhhcCCEEEEcHHHHHHHHHhh--ccC----CceeecCCcCCC-CCceecc-eeeEEecCcCCccCC Confidence 2322 2223456889999888887431 110 011122221111 1146776 68888876552 Q ss_pred -ceEEEEEecCCCccceeEeecccccccccccCcccccceeeeee--eeceee-cCcccccccCCcceeecCCchhhhcc Q lcl|Aclame:pro 430 -DYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKT--RYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSL 505 (519) Q Consensus 430 -dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~t--RY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~ 505 (519) +.+++| +-.. +|-=+....+....|+-.=+-.++|.. |++..+ +|-+ .+.+. ++ T Consensus 342 ~~~i~~G---d~~~----~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d~~~~~~~a-------~~~l~-----~~--- 399 (401) T protein:vir:44 342 AKAIAFG---NFKR----GYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQA-------IKLLK-----IA--- 399 (401) T ss_pred ccEEEEe---ehhc----cEEEEEecceEEeeeccccCCcEEEEEEEEeccEEecccc-------eEEEE-----ee--- Confidence 122222 1100 010000000111123332233344333 665432 1111 11111 01 Q ss_pred cchhhhhhhhhcCC Q lcl|Aclame:pro 506 GLNGYFRRVYVKGI 519 (519) Q Consensus 506 ~~~~y~r~v~v~~~ 519 (519) -= T Consensus 400 ------------aa 401 (401) T protein:vir:44 400 ------------AA 401 (401) T ss_pred ------------cC Confidence 11 No 97 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=36.75 E-value=1.2 Score=20.24 Aligned_cols=337 Identities=16% Similarity=0.148 Sum_probs=126.8 Q ss_pred CChHHHHHhhhh-hhCCCccccccccchhhhhh--hhhhhHHHHHhhhhh--------ccc--------hhhh----hhh Q lcl|Aclame:pro 1 MKKNALVQKWSA-LLENEALPEIVGASKQAIIA--KIFENQEQDILTAPE--------YRD--------EKIS----EAF 57 (519) Q Consensus 1 ~~~~~l~~kw~p-~l~~~~~~~~~~~~~~~~~~--~~~enq~~~~~~~~~--------~~~--------~~~~----~~~ 57 (519) ++ ++|.++=.. -..-|.+-++.+...+. .+ .=|+.|.+.+..... ... .... .++ T Consensus 18 ~~-~~l~~~~~~~~~~~e~~~~l~~ei~~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (389) T protein:vir:10 18 LN-AQLNAKLQDENASVDDFQKIKDDLTAA-KARRDAINDQIKALEAEKPAEPKTEPKDDGSKKGTDLSKKPIDAKKKAI 95 (389) T ss_pred HH-HHHHHHHHhHhhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhhhccccccccccccccchhHHHHHHHHH Confidence 10 000000000 00000011111111100 00 012222222211100 000 0000 111 Q ss_pred hhhhhhhhhccccccchhhhcccccc-ccccccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCC Q lcl|Aclame:pro 58 GSFLTEAEIGGDHGYDATNIAAGQTS-GAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAA 136 (519) Q Consensus 58 ~~~~~~~~~~~~~g~~~~~~~est~t-g~v~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~ 136 (519) ..+| - ..+-....+.+++++ |.+.--....-.++++..+..+..++|.|.||+++++-+--++. .... T Consensus 96 ~~~l----r--~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~~~~--- 164 (389) T protein:vir:10 96 NDFI----H--SHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR--ATDR--- 164 (389) T ss_pred HHHh----h--cchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEec--CCCc--- Confidence 1111 0 011111222333322 22211111122355556667778899999999987643222211 0000 Q ss_pred CcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCcccccccccccccc Q lcl|Aclame:pro 137 GAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEA 216 (519) Q Consensus 137 ~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~ 216 (519) . .+ T Consensus 165 -----~--------~~---------------------------------------------------------------- 167 (389) T protein:vir:10 165 -----F--------SS---------------------------------------------------------------- 167 (389) T ss_pred -----c--------cc---------------------------------------------------------------- Confidence 0 00 Q ss_pred ccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 217 GQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEI 296 (519) Q Consensus 217 g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEI 296 (519) ++++ ++. ...+...|.+..+++.|.. --..+|-||.+|- ..|.+++|.+-|...+ T Consensus 168 -----~~E~-----~~~----~~~~~~~~~~i~~~~~k~~-------~~~~iS~ell~ds----~~~l~~~i~~~la~~~ 222 (389) T protein:vir:10 168 -----VAEL-----AEN----PKLAEPEFNKVDWSVATYR-------GAIPLSEEAIADS----AVDLTALVGQSIKEKS 222 (389) T ss_pred -----cccc-----ccc----cccccccceeeeeeheeeE-------eeehhhHHHHhhh----hHHHHHHHHHHHHHHH Confidence 0000 000 0011234555555555554 4457899999984 3467888999999999 Q ss_pred HHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEE Q lcl|Aclame:pro 297 MLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIA 376 (519) Q Consensus 297 mlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~ 376 (519) ..-+|..|+.-+.... +.|+.... ..+.+..++..... ..+ ...+|| T Consensus 223 ~~~~~~~i~~g~~~~~-------------~~~~~~~~----------~~d~l~~~~~~~~~--------~~~--~a~~~~ 269 (389) T protein:vir:10 223 VNTYNAMIAPVLQSFT-------------AKKTTTDT----------LVDSLKHILNVDLD--------PAY--SRALVV 269 (389) T ss_pred HHHHHHHHhhhhcccc-------------cccccccc----------cHHHHHHHHHhhhh--------hhh--CcEEEe Confidence 8888888876432111 11211000 01222333221111 112 246789 Q ss_pred chHHHHHHHhc----CcccccccccccccccccCCCceEEEEecCcEEEEe-cC-CCcc---c-eEEEEEecCCCcccee Q lcl|Aclame:pro 377 SRNVVNVLAAV----DTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYI-DQ-YARS---D-YFTIGYKGSNEMDAGI 446 (519) Q Consensus 377 S~~va~~L~~~----g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~-D~-y~~~---d-y~~vG~KG~~~~~~~~ 446 (519) +|.....|... |...+.+. ..+.+...+-++|.| ++||+ |. ..+. | .+++|= +..+. T Consensus 270 n~~~~~~L~~lkd~~G~~i~~~~-------~~~~~~~~~~~~l~G-~pV~~~~~~~~~~~~~~~~~~~gd-----~~~~~ 336 (389) T protein:vir:10 270 TQSLFNTLDTLKDKNGRYLLHDA-------SDSITDGTAKGTILG-VPVYVVGDTLLGSLAGDQKAFVGD-----LKRGV 336 (389) T ss_pred cHHHHHHHHHhhccCCCeeeecC-------ccccccccccccccc-ceeEEecccccCCCCCceEEEEee-----ccccE Confidence 99988888853 21111110 011112223356888 57765 32 2222 1 133330 00000 Q ss_pred EeecccccccccccCcccccceeeeeeeeceee-cCcccccccCCcceeecCCchhhhcccc Q lcl|Aclame:pro 447 YYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSLGL 507 (519) Q Consensus 447 fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~~~ 507 (519) .+... ....+...|-..|.-.+...-|++..+ ||=+ ...+. =.+--+..++| T Consensus 337 ~~~~~-~~~~i~~~~~~~~~~~~~~~~r~d~~~~~~~a-------~~~~~-~~~~~~~~~~~ 389 (389) T protein:vir:10 337 LFTDR-QQVTLAWEDSKIYGKYLGAAFRFGVQKADSKA-------GYFVT-NTDVPGSALGK 389 (389) T ss_pred EEEee-cceEEEeeccccccceEEEEEEeccEEecccc-------eEEEE-eeccCCCCCCC Confidence 00000 111222234455666777778998654 2211 11111 01122233444 No 98 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=33.38 E-value=1.4 Score=19.85 Aligned_cols=348 Identities=16% Similarity=0.097 Sum_probs=123.7 Q ss_pred CChHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhh----------------------------------- Q lcl|Aclame:pro 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTA----------------------------------- 45 (519) Q Consensus 1 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~----------------------------------- 45 (519) |+-++|+|+=+-+++. +-++.+..+. ..+.|.+.+.+.+. T Consensus 1 M~l~eL~e~r~~l~~e--~~~l~~k~~~---~~~t~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~ 75 (409) T protein:vir:45 1 MKLHELKQKRNTIATD--MRALNEKIGD---NAWTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQN 75 (409) T ss_pred CCHHHHHHHHHHHHHH--HHHHHHHhhc---CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhccc Confidence 9999999887665532 1111111100 00111111111000 Q ss_pred -----hhccchhhhhhhhhhhhhhhhccccccchhhh----ccccccccc------cccCceehhhHHHHHhhhhhhhce Q lcl|Aclame:pro 46 -----PEYRDEKISEAFGSFLTEAEIGGDHGYDATNI----AAGQTSGAV------TQIGPAVMGMVRRAIPHLIAFDIC 110 (519) Q Consensus 46 -----~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~----~est~tg~v------~~~~P~L~~l~Rra~p~LIa~DI~ 110 (519) .....+.-.+++..+|...... -..-..+.+ +.+++++.- ..+.+-++.++| +..+..++| T Consensus 76 ~~~~~~~~~~~~~~~a~~~~l~~~~~~-~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~---~~~~l~~~~ 151 (409) T protein:vir:45 76 LDPENNSQQDEKRAQVFDKWMRHGASE-LTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMK---SYGGIASVA 151 (409) T ss_pred CCCCCcchhhHHHHHHHHHHHHhhhhh-ccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHH---hhhhhhhhc Confidence 0000111122222222110000 000000001 111111110 111122222222 233344566 Q ss_pred eeccCCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 111 GVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQA 190 (519) Q Consensus 111 GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~ 190 (519) -|-|+++.....+-.... ... T Consensus 152 ~~~~~~~~~~~~~~~~~~-~~~---------------------------------------------------------- 172 (409) T protein:vir:45 152 QILTTSDGRTMEWATADG-TSE---------------------------------------------------------- 172 (409) T ss_pred eeeecCCCceEEEEeecc-Ccc---------------------------------------------------------- Confidence 666665433222111000 000 Q ss_pred cccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccH Q lcl|Aclame:pro 191 VEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSI 270 (519) Q Consensus 191 ~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTm 270 (519) . ..-+++|-.. ..+...|.+..|.-.|.. +.=..+|- T Consensus 173 ------------~---------------~~~v~E~~~~----------~~~~~~f~~~~l~~~k~~------~~~i~is~ 209 (409) T protein:vir:45 173 ------------V---------------GVLLGENEEA----------GEEDTDFGMGSLGALKMT------SKIIRVSN 209 (409) T ss_pred ------------c---------------cccccccccc----------cccccccceeeeeeeeee------eeehhhhH Confidence 0 0000111000 001122333333222211 11235799 Q ss_pred HHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHH-HHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHH Q lcl|Aclame:pro 271 ELAQDLRAVHGMDADAELSGILATEIMLEINREVID-WINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFK 349 (519) Q Consensus 271 ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~-~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r 349 (519) ||.+|- .+|.+++|.+-|+..|..-+|+.||. .=+ ..+..+.|++.......... ... . T Consensus 210 ell~ds----~~~l~~~i~~~la~a~~~~~~~a~l~G~G~-----------~~~~~p~Gil~~~~~~~~~~---~~~--~ 269 (409) T protein:vir:45 210 ELLQDS----AIDMEAYLARRIAERIGRGEARYLIQGTGA-----------GTPKQPKGLAASVTGTTQTA---AAN--A 269 (409) T ss_pred HHHhcc----HHHHHHHHHHHHHHHHHHHHHHHhhccCCC-----------CCccccceeeeccccccccc---ccc--c Confidence 999994 25789999999999999999999885 100 01112234433211100000 000 0 Q ss_pred HHHHHHHHHHHHHHhhccccCCCE-EEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCc Q lcl|Aclame:pro 350 ALLFQIDKEAAEIARQTGRGAGNF-IIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYAR 428 (519) Q Consensus 350 ~L~~~i~~~a~~I~~~T~rg~gn~-~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~ 428 (519) --+..|+++.+.+...=+. .+.+ ++|++.....|...= +. .++..+..+.+.. --++|.| ++|+++.+.| T Consensus 270 ~~~d~i~~l~~~l~~~~~~-~a~~~~~~n~~~~~~l~~lk--d~----~G~~i~~~~~~~~-~~~~l~G-~PV~~~~~~p 340 (409) T protein:vir:45 270 VKWQEILALKHSIDPAYRR-GPKFRLAFNDNTLKLISEME--DG----QGRPLWLPDIVGV-APASVLN-VPYVIDQEID 340 (409) T ss_pred cchHHHHHHHHhhhhhhcc-CCeEEEEECHHHHHHHHHhh--cC----CCceeeccCcCCC-CCceecc-eeeEEecCcC Confidence 0012234444444433232 3455 578998877775421 10 0111122222211 1136777 6999988876 Q ss_pred c----ce-EEEEEecCCCccceeEeeccccccccc-ccCcccccceeeee--eeeceee-cCcccccccCCcceeecCCc Q lcl|Aclame:pro 429 S----DY-FTIGYKGSNEMDAGIYYAPYVALTPLR-GSDPKNFQPVMGFK--TRYGIGI-NPFADPAAQAPTKRIQNGMP 499 (519) Q Consensus 429 ~----dy-~~vG~KG~~~~~~~~fyaPYv~~~~~~-~~dp~s~qP~~g~~--tRY~l~~-nP~~~~~~~~~~~~i~~~~d 499 (519) . ++ +++| +-.. .+... .-...++ ..|+-.=...++|. .||+..+ ||=+ .+.+. . T Consensus 341 ~~~~~~~~i~~G---d~~~---~~i~~-~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A-------~~~l~-~-- 403 (409) T protein:vir:45 341 DIGAGKKFMFCG---DFDR---FIIRR-VRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSA-------IKALV-G-- 403 (409) T ss_pred CccCCccEEEEe---ehhh---hheee-ccceEEEEeecccccCCcEEEEEEEEeccEeechhh-------eEEEE-e-- Confidence 3 12 2222 1100 01100 0111111 23443223334443 4776442 2211 11111 1 Q ss_pred hhhhcccc Q lcl|Aclame:pro 500 DIVNSLGL 507 (519) Q Consensus 500 ~~a~~~~~ 507 (519) +.+++- T Consensus 404 --k~s~~~ 409 (409) T protein:vir:45 404 --KGSVGG 409 (409) T ss_pred --ccCCCC Confidence 111111 No 99 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=33.31 E-value=1.4 Score=19.84 Aligned_cols=322 Identities=13% Similarity=0.069 Sum_probs=121.3 Q ss_pred CChH--HHHHhhhhh---h--CCCccc----------cccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhh Q lcl|Aclame:pro 1 MKKN--ALVQKWSAL---L--ENEALP----------EIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTE 63 (519) Q Consensus 1 ~~~~--~l~~kw~p~---l--~~~~~~----------~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~ 63 (519) +..| .|.++.... . +.+..+ +-...+|++. .+.|.++ .+......++.. T Consensus 34 ~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~~~-------------~~~~~~~~~~~~ 99 (392) T protein:vir:10 34 MMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF-MKALRNK-------------PLNAEEREFLED 99 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHH-HHHHhcc-------------cccHHHHHHHhh Confidence 1111 222222211 0 000000 1111122221 1111111 111111111100 Q ss_pred hhhccccccchhhhcccccc-cccc---ccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcc Q lcl|Aclame:pro 64 AEIGGDHGYDATNIAAGQTS-GAVT---QIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAK 139 (519) Q Consensus 64 ~~~~~~~g~~~~~~~est~t-g~v~---~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~ 139 (519) .. .......++++ |.+. .+.+-++.+. .....-.+++++.||++++|-+. ..+..+.. T Consensus 100 ~~-------~~~~~~~~t~~~gg~~vP~~~~~~ii~~~---~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~~~------ 161 (392) T protein:vir:10 100 DL-------EQRAMSGLTGEDGGLVIPQDIQTQINELA---RSFDALEQYVTVEPVRTRSGSRV--LEKNSDMI------ 161 (392) T ss_pred hh-------hhhhccccccCCCceecchhHHHHHHHHH---HhhhhhhhhceeeeccCCceeEE--EEeecCCc------ Confidence 00 00111111211 2111 2333333444 44556678999999998876422 11111000 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccc Q lcl|Aclame:pro 140 EAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQL 219 (519) Q Consensus 140 eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~ 219 (519) + ..| T Consensus 162 ~---------a~~------------------------------------------------------------------- 165 (392) T protein:vir:10 162 P---------FAE------------------------------------------------------------------- 165 (392) T ss_pred c---------cee------------------------------------------------------------------- Confidence 0 000 Q ss_pred eecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 220 AEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLE 299 (519) Q Consensus 220 ~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlE 299 (519) +++|-.. ..+....|.++.|...|. +-...+|-||.+|- ..|.+++|.+.|...|..- T Consensus 166 --v~E~~~~---------~~~~~~~~~~v~l~~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~ 223 (392) T protein:vir:10 166 --ITEMGEI---------PETDNPKFSNVQYAVKDR-------AGILPLSRSLLQDS----DQNILKYVTKWLGKKSKVT 223 (392) T ss_pred --ecccccc---------cccccccceeEEeeeeeE-------EEeehhhHHHHhhh----HHHHHHHHHHHHHHHHHHH Confidence 0010000 001122345555555444 44557999999994 2567899999999999999 Q ss_pred hhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchH Q lcl|Aclame:pro 300 INREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRN 379 (519) Q Consensus 300 INReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~ 379 (519) ++..|+.-... ..+.|+..++ ....++.. .. ...+-..-..|++|. T Consensus 224 ~d~~~~~g~g~-------------~~~~~~~~~d-------------~i~~~~~~--~l------~~~~~~~a~~vm~~~ 269 (392) T protein:vir:10 224 RNVLILGVIEK-------------LTKQAIKSLD-------------DIKDVLNV--KL------DPAISPNAILLTNQD 269 (392) T ss_pred HHHHHhhcccc-------------ccccCccCHH-------------HHHHHHHH--hh------hhhhccCCEEEEcHH Confidence 98888752211 1122333222 12222211 11 112223356789999 Q ss_pred HHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccc------ Q lcl|Aclame:pro 380 VVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVA------ 453 (519) Q Consensus 380 va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~------ 453 (519) ....|...= + +. +.-.+..+.+. -..++|.|...|+++.... ++.+|...-+..++|+.+-. T Consensus 270 ~~~~L~~lk--d-~~---G~~l~~~~~~~-~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~i~~ 337 (392) T protein:vir:10 270 GFNYLDKLK--D-KD---GKYILQSDPTQ-KNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) T ss_pred HHHHHHHhh--c-cC---CCeEeecCccC-CccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEEEEe Confidence 999997531 1 00 01111122211 1235677765666543221 11111111122233332111 Q ss_pred -cccccccCc------ccccceeeeeeeeceee-cCcccccccCCcceeecC--Cchhhhccc Q lcl|Aclame:pro 454 -LTPLRGSDP------KNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNG--MPDIVNSLG 506 (519) Q Consensus 454 -~~~~~~~dp------~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~--~d~~a~~~~ 506 (519) ..+.-.+++ .+.+=.+-...|++..+ +|-+ ...+.-. -+-.. -.| T Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a-------~~~l~~~~~a~~~~-~~~ 392 (392) T protein:vir:10 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA-------AVYGEIDLSAPVEQ-PQG 392 (392) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEecccc-------eEEEEecccccccC-CCC Confidence 000001122 23444566677777543 2221 1111111 01000 001 No 100 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=33.31 E-value=1.4 Score=19.84 Aligned_cols=322 Identities=13% Similarity=0.069 Sum_probs=121.3 Q ss_pred CChH--HHHHhhhhh---h--CCCccc----------cccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhh Q lcl|Aclame:pro 1 MKKN--ALVQKWSAL---L--ENEALP----------EIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTE 63 (519) Q Consensus 1 ~~~~--~l~~kw~p~---l--~~~~~~----------~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~ 63 (519) +..| .|.++.... . +.+..+ +-...+|++. .+.|.++ .+......++.. T Consensus 34 ~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~~~-------------~~~~~~~~~~~~ 99 (392) T protein:vir:10 34 MMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF-MKALRNK-------------PLNAEEREFLED 99 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHH-HHHHhcc-------------cccHHHHHHHhh Confidence 1111 222222211 0 000000 1111122221 1111111 111111111100 Q ss_pred hhhccccccchhhhcccccc-cccc---ccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcc Q lcl|Aclame:pro 64 AEIGGDHGYDATNIAAGQTS-GAVT---QIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAK 139 (519) Q Consensus 64 ~~~~~~~g~~~~~~~est~t-g~v~---~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~ 139 (519) .. .......++++ |.+. .+.+-++.+. .....-.+++++.||++++|-+. ..+..+.. T Consensus 100 ~~-------~~~~~~~~t~~~gg~~vP~~~~~~ii~~~---~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~~~------ 161 (392) T protein:vir:10 100 DL-------EQRAMSGLTGEDGGLVIPQDIQTQINELA---RSFDALEQYVTVEPVRTRSGSRV--LEKNSDMI------ 161 (392) T ss_pred hh-------hhhhccccccCCCceecchhHHHHHHHHH---HhhhhhhhhceeeeccCCceeEE--EEeecCCc------ Confidence 00 00111111211 2111 2333333444 44556678999999998876422 11111000 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccc Q lcl|Aclame:pro 140 EAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQL 219 (519) Q Consensus 140 eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~ 219 (519) + ..| T Consensus 162 ~---------a~~------------------------------------------------------------------- 165 (392) T protein:vir:10 162 P---------FAE------------------------------------------------------------------- 165 (392) T ss_pred c---------cee------------------------------------------------------------------- Confidence 0 000 Q ss_pred eecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 220 AEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLE 299 (519) Q Consensus 220 ~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlE 299 (519) +++|-.. ..+....|.++.|...|. +-...+|-||.+|- ..|.+++|.+.|...|..- T Consensus 166 --v~E~~~~---------~~~~~~~~~~v~l~~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~ 223 (392) T protein:vir:10 166 --ITEMGEI---------PETDNPKFSNVQYAVKDR-------AGILPLSRSLLQDS----DQNILKYVTKWLGKKSKVT 223 (392) T ss_pred --ecccccc---------cccccccceeEEeeeeeE-------EEeehhhHHHHhhh----HHHHHHHHHHHHHHHHHHH Confidence 0010000 001122345555555444 44557999999994 2567899999999999999 Q ss_pred hhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchH Q lcl|Aclame:pro 300 INREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRN 379 (519) Q Consensus 300 INReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~ 379 (519) ++..|+.-... ..+.|+..++ ....++.. .. ...+-..-..|++|. T Consensus 224 ~d~~~~~g~g~-------------~~~~~~~~~d-------------~i~~~~~~--~l------~~~~~~~a~~vm~~~ 269 (392) T protein:vir:10 224 RNVLILGVIEK-------------LTKQAIKSLD-------------DIKDVLNV--KL------DPAISPNAILLTNQD 269 (392) T ss_pred HHHHHhhcccc-------------ccccCccCHH-------------HHHHHHHH--hh------hhhhccCCEEEEcHH Confidence 98888752211 1122333222 12222211 11 112223356789999 Q ss_pred HHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccc------ Q lcl|Aclame:pro 380 VVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVA------ 453 (519) Q Consensus 380 va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~------ 453 (519) ....|...= + +. +.-.+..+.+. -..++|.|...|+++.... ++.+|...-+..++|+.+-. T Consensus 270 ~~~~L~~lk--d-~~---G~~l~~~~~~~-~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~i~~ 337 (392) T protein:vir:10 270 GFNYLDKLK--D-KD---GKYILQSDPTQ-KNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) T ss_pred HHHHHHHhh--c-cC---CCeEeecCccC-CccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEEEEe Confidence 999997531 1 00 01111122211 1235677765666543221 11111111122233332111 Q ss_pred -cccccccCc------ccccceeeeeeeeceee-cCcccccccCCcceeecC--Cchhhhccc Q lcl|Aclame:pro 454 -LTPLRGSDP------KNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNG--MPDIVNSLG 506 (519) Q Consensus 454 -~~~~~~~dp------~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~--~d~~a~~~~ 506 (519) ..+.-.+++ .+.+=.+-...|++..+ +|-+ ...+.-. -+-.. -.| T Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a-------~~~l~~~~~a~~~~-~~~ 392 (392) T protein:vir:10 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA-------AVYGEIDLSAPVEQ-PQG 392 (392) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEecccc-------eEEEEecccccccC-CCC Confidence 000001122 23444566677777543 2221 1111111 01000 001 No 101 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=33.31 E-value=1.4 Score=19.84 Aligned_cols=322 Identities=13% Similarity=0.069 Sum_probs=121.3 Q ss_pred CChH--HHHHhhhhh---h--CCCccc----------cccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhh Q lcl|Aclame:pro 1 MKKN--ALVQKWSAL---L--ENEALP----------EIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTE 63 (519) Q Consensus 1 ~~~~--~l~~kw~p~---l--~~~~~~----------~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~ 63 (519) +..| .|.++.... . +.+..+ +-...+|++. .+.|.++ .+......++.. T Consensus 34 ~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~~~-------------~~~~~~~~~~~~ 99 (392) T protein:vir:10 34 MMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF-MKALRNK-------------PLNAEEREFLED 99 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHH-HHHHhcc-------------cccHHHHHHHhh Confidence 1111 222222211 0 000000 1111122221 1111111 111111111100 Q ss_pred hhhccccccchhhhcccccc-cccc---ccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcc Q lcl|Aclame:pro 64 AEIGGDHGYDATNIAAGQTS-GAVT---QIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAK 139 (519) Q Consensus 64 ~~~~~~~g~~~~~~~est~t-g~v~---~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~ 139 (519) .. .......++++ |.+. .+.+-++.+. .....-.+++++.||++++|-+. ..+..+.. T Consensus 100 ~~-------~~~~~~~~t~~~gg~~vP~~~~~~ii~~~---~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~~~------ 161 (392) T protein:vir:10 100 DL-------EQRAMSGLTGEDGGLVIPQDIQTQINELA---RSFDALEQYVTVEPVRTRSGSRV--LEKNSDMI------ 161 (392) T ss_pred hh-------hhhhccccccCCCceecchhHHHHHHHHH---HhhhhhhhhceeeeccCCceeEE--EEeecCCc------ Confidence 00 00111111211 2111 2333333444 44556678999999998876422 11111000 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccc Q lcl|Aclame:pro 140 EAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQL 219 (519) Q Consensus 140 eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~ 219 (519) + ..| T Consensus 162 ~---------a~~------------------------------------------------------------------- 165 (392) T protein:vir:10 162 P---------FAE------------------------------------------------------------------- 165 (392) T ss_pred c---------cee------------------------------------------------------------------- Confidence 0 000 Q ss_pred eecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 220 AEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLE 299 (519) Q Consensus 220 ~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlE 299 (519) +++|-.. ..+....|.++.|...|. +-...+|-||.+|- ..|.+++|.+.|...|..- T Consensus 166 --v~E~~~~---------~~~~~~~~~~v~l~~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~ 223 (392) T protein:vir:10 166 --ITEMGEI---------PETDNPKFSNVQYAVKDR-------AGILPLSRSLLQDS----DQNILKYVTKWLGKKSKVT 223 (392) T ss_pred --ecccccc---------cccccccceeEEeeeeeE-------EEeehhhHHHHhhh----HHHHHHHHHHHHHHHHHHH Confidence 0010000 001122345555555444 44557999999994 2567899999999999999 Q ss_pred hhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchH Q lcl|Aclame:pro 300 INREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRN 379 (519) Q Consensus 300 INReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~ 379 (519) ++..|+.-... ..+.|+..++ ....++.. .. ...+-..-..|++|. T Consensus 224 ~d~~~~~g~g~-------------~~~~~~~~~d-------------~i~~~~~~--~l------~~~~~~~a~~vm~~~ 269 (392) T protein:vir:10 224 RNVLILGVIEK-------------LTKQAIKSLD-------------DIKDVLNV--KL------DPAISPNAILLTNQD 269 (392) T ss_pred HHHHHhhcccc-------------ccccCccCHH-------------HHHHHHHH--hh------hhhhccCCEEEEcHH Confidence 98888752211 1122333222 12222211 11 112223356789999 Q ss_pred HHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccc------ Q lcl|Aclame:pro 380 VVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVA------ 453 (519) Q Consensus 380 va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~------ 453 (519) ....|...= + +. +.-.+..+.+. -..++|.|...|+++.... ++.+|...-+..++|+.+-. T Consensus 270 ~~~~L~~lk--d-~~---G~~l~~~~~~~-~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~i~~ 337 (392) T protein:vir:10 270 GFNYLDKLK--D-KD---GKYILQSDPTQ-KNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) T ss_pred HHHHHHHhh--c-cC---CCeEeecCccC-CccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEEEEe Confidence 999997531 1 00 01111122211 1235677765666543221 11111111122233332111 Q ss_pred -cccccccCc------ccccceeeeeeeeceee-cCcccccccCCcceeecC--Cchhhhccc Q lcl|Aclame:pro 454 -LTPLRGSDP------KNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNG--MPDIVNSLG 506 (519) Q Consensus 454 -~~~~~~~dp------~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~--~d~~a~~~~ 506 (519) ..+.-.+++ .+.+=.+-...|++..+ +|-+ ...+.-. -+-.. -.| T Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a-------~~~l~~~~~a~~~~-~~~ 392 (392) T protein:vir:10 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA-------AVYGEIDLSAPVEQ-PQG 392 (392) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEecccc-------eEEEEecccccccC-CCC Confidence 000001122 23444566677777543 2221 1111111 01000 001 No 102 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=33.31 E-value=1.4 Score=19.84 Aligned_cols=322 Identities=13% Similarity=0.069 Sum_probs=121.3 Q ss_pred CChH--HHHHhhhhh---h--CCCccc----------cccccchhhhhhhhhhhHHHHHhhhhhccchhhhhhhhhhhhh Q lcl|Aclame:pro 1 MKKN--ALVQKWSAL---L--ENEALP----------EIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTE 63 (519) Q Consensus 1 ~~~~--~l~~kw~p~---l--~~~~~~----------~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~ 63 (519) +..| .|.++.... . +.+..+ +-...+|++. .+.|.++ .+......++.. T Consensus 34 ~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~~~-------------~~~~~~~~~~~~ 99 (392) T protein:vir:10 34 MMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF-MKALRNK-------------PLNAEEREFLED 99 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHH-HHHHhcc-------------cccHHHHHHHhh Confidence 1111 222222211 0 000000 1111122221 1111111 111111111100 Q ss_pred hhhccccccchhhhcccccc-cccc---ccCceehhhHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcc Q lcl|Aclame:pro 64 AEIGGDHGYDATNIAAGQTS-GAVT---QIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAK 139 (519) Q Consensus 64 ~~~~~~~g~~~~~~~est~t-g~v~---~~~P~L~~l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~ 139 (519) .. .......++++ |.+. .+.+-++.+. .....-.+++++.||++++|-+. ..+..+.. T Consensus 100 ~~-------~~~~~~~~t~~~gg~~vP~~~~~~ii~~~---~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~~~------ 161 (392) T protein:vir:10 100 DL-------EQRAMSGLTGEDGGLVIPQDIQTQINELA---RSFDALEQYVTVEPVRTRSGSRV--LEKNSDMI------ 161 (392) T ss_pred hh-------hhhhccccccCCCceecchhHHHHHHHHH---HhhhhhhhhceeeeccCCceeEE--EEeecCCc------ Confidence 00 00111111211 2111 2333333444 44556678999999998876422 11111000 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccc Q lcl|Aclame:pro 140 EAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQL 219 (519) Q Consensus 140 eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~ 219 (519) + ..| T Consensus 162 ~---------a~~------------------------------------------------------------------- 165 (392) T protein:vir:10 162 P---------FAE------------------------------------------------------------------- 165 (392) T ss_pred c---------cee------------------------------------------------------------------- Confidence 0 000 Q ss_pred eecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 220 AEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLE 299 (519) Q Consensus 220 ~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlE 299 (519) +++|-.. ..+....|.++.|...|. +-...+|-||.+|- ..|.+++|.+.|...|..- T Consensus 166 --v~E~~~~---------~~~~~~~~~~v~l~~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~ 223 (392) T protein:vir:10 166 --ITEMGEI---------PETDNPKFSNVQYAVKDR-------AGILPLSRSLLQDS----DQNILKYVTKWLGKKSKVT 223 (392) T ss_pred --ecccccc---------cccccccceeEEeeeeeE-------EEeehhhHHHHhhh----HHHHHHHHHHHHHHHHHHH Confidence 0010000 001122345555555444 44557999999994 2567899999999999999 Q ss_pred hhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchH Q lcl|Aclame:pro 300 INREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRN 379 (519) Q Consensus 300 INReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~ 379 (519) ++..|+.-... ..+.|+..++ ....++.. .. ...+-..-..|++|. T Consensus 224 ~d~~~~~g~g~-------------~~~~~~~~~d-------------~i~~~~~~--~l------~~~~~~~a~~vm~~~ 269 (392) T protein:vir:10 224 RNVLILGVIEK-------------LTKQAIKSLD-------------DIKDVLNV--KL------DPAISPNAILLTNQD 269 (392) T ss_pred HHHHHhhcccc-------------ccccCccCHH-------------HHHHHHHH--hh------hhhhccCCEEEEcHH Confidence 98888752211 1122333222 12222211 11 112223356789999 Q ss_pred HHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccc------ Q lcl|Aclame:pro 380 VVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVA------ 453 (519) Q Consensus 380 va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~------ 453 (519) ....|...= + +. +.-.+..+.+. -..++|.|...|+++.... ++.+|...-+..++|+.+-. T Consensus 270 ~~~~L~~lk--d-~~---G~~l~~~~~~~-~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~i~~ 337 (392) T protein:vir:10 270 GFNYLDKLK--D-KD---GKYILQSDPTQ-KNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) T ss_pred HHHHHHHhh--c-cC---CCeEeecCccC-CccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEEEEe Confidence 999997531 1 00 01111122211 1235677765666543221 11111111122233332111 Q ss_pred -cccccccCc------ccccceeeeeeeeceee-cCcccccccCCcceeecC--Cchhhhccc Q lcl|Aclame:pro 454 -LTPLRGSDP------KNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNG--MPDIVNSLG 506 (519) Q Consensus 454 -~~~~~~~dp------~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~--~d~~a~~~~ 506 (519) ..+.-.+++ .+.+=.+-...|++..+ +|-+ ...+.-. -+-.. -.| T Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a-------~~~l~~~~~a~~~~-~~~ 392 (392) T protein:vir:10 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA-------AVYGEIDLSAPVEQ-PQG 392 (392) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEecccc-------eEEEEecccccccC-CCC Confidence 000001122 23444566677777543 2221 1111111 01000 001 No 103 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=33.00 E-value=1.4 Score=19.81 Aligned_cols=219 Identities=10% Similarity=0.088 Sum_probs=101.4 Q ss_pred cccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHH Q lcl|Aclame:pro 193 AVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIEL 272 (519) Q Consensus 193 ~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmEL 272 (519) -.+.+. |.+.+++. -...+|.+. ....-+..+|+++ ..+++.|-+.=.=++|=| T Consensus 1 ~~~~~~------------------Gdtit~P~--~iGda~~v~---eG~~i~~~~l~~t--~~~atIk~~gk~~~itD~- 54 (231) T protein:vir:73 1 ENGINL------------------ANLCEYPN--DIGDAADVA---EGGEISLDKIGTT--TKSVTIKKAAKGTEITDE- 54 (231) T ss_pred CccccC------------------CceEEecc--cccchhhhc---CCCcCChhhcccc--ceeeeEeeeccceeeeHH- Confidence 000001 11111110 011222221 1112233445544 444444544333333332 Q ss_pred HHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHH Q lcl|Aclame:pro 273 AQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALL 352 (519) Q Consensus 273 AQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~ 352 (519) ..|.+ +| |.-.|..+-|+..|...++.|++..+..++. +. +..++++. +..+..+ T Consensus 55 -a~l~~-~g-Dp~~ea~~Q~~~~iA~kvD~di~~~~~~a~l------~~-----~~~~t~d~----------i~~A~~~- 109 (231) T protein:vir:73 55 -AALSG-YG-DPIGESNKQLGLSLANKVDDDLLKAAKTTSQ------TV-----STKANVDG----------VQAALDI- 109 (231) T ss_pred -HHhhc-cC-chHHHHHHHHHHHHHHhhhHHHHHhhccccc------cc-----cccccHHH----------HHHHHHH- Confidence 22555 33 8899999999999999999999975443221 10 11121111 1111111 Q ss_pred HHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceE Q lcl|Aclame:pro 353 FQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYF 432 (519) Q Consensus 353 ~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 432 (519) +.++ -....++||+|+++.-|.....+...-....+... .++ .+|.+.| ++|+++...+.+ T Consensus 110 --fgde---------~~~~~vivv~p~~~~~Lrk~~~~~~~~~~~g~~i~---~~G--~iG~i~G-~~Vi~S~~~~~~-- 170 (231) T protein:vir:73 110 --FNDE---------DAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANAL---ING--TYADVLG-AQIVRSKKLAEG-- 170 (231) T ss_pred --hccc---------cccceEEEEcchHHHhhhhccchhhhhhhhcccee---eec--ccceEcc-eEEEEcCCCCCC-- Confidence 1111 13568999999999999764433322111111111 122 3577766 899998877642 Q ss_pred EEEEecCCCccceeEeecccc------------cccccccCcccccceeeeeeeeceeecCcccccccCCcceeecCCch Q lcl|Aclame:pro 433 TIGYKGSNEMDAGIYYAPYVA------------LTPLRGSDPKNFQPVMGFKTRYGIGINPFADPAAQAPTKRIQNGMPD 500 (519) Q Consensus 433 ~vG~KG~~~~~~~~fyaPYv~------------~~~~~~~dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~i~~~~d~ 500 (519) +.++++|+. ...-...|+..+.-.+--.-.|++.. .++. ..+ T Consensus 171 ------------~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l-------~~~~-~vv------ 224 (231) T protein:vir:73 171 ------------SALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYL-------YDLT-KVV------ 224 (231) T ss_pred ------------ceeeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEE-------EcCc-cEE------ Confidence 222333321 01111247777777777777777554 1111 111 Q ss_pred hhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 501 IVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 501 ~a~~~~~~~y~r~v~v~~~ 519 (519) ++.+||+ T Consensus 225 ------------~~t~~g~ 231 (231) T protein:vir:73 225 ------------NITFTGV 231 (231) T ss_pred ------------EEEeecC Confidence 1233444 No 104 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=31.22 E-value=1.5 Score=19.60 Aligned_cols=288 Identities=10% Similarity=-0.026 Sum_probs=96.5 Q ss_pred ccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceeccc Q lcl|Aclame:pro 145 MYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAE 224 (519) Q Consensus 145 fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~ 224 (519) |.+. ++..++. ..+..- ........ ... ...-.. ....+.... ..... +..+. T Consensus 1 Ma~~----~~~~gg~---~vP~~~---~~~ii~~l-~~~--s~i~~l--~~~i~~~~~----~~~ip--------~~~~~ 53 (315) T protein:vir:80 1 MADD----FLSAGKL---ELPGSM---IGAVRDRA-IDS--GVLAKL--SPEQPTIFG----PVKGA--------VFSGV 53 (315) T ss_pred CCCC----cCCcCce---EcchHH---HHHHHHHH-Hhh--chhhhh--cceeecCCC----ceEEE--------EEeCC Confidence 1100 0000000 000000 00000000 000 000000 000000000 00000 00111 Q ss_pred ccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHH Q lcl|Aclame:pro 225 GMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREV 304 (519) Q Consensus 225 GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINRei 304 (519) +-..-.+| +..+++...+++++++.+|.-+-....|-||.+|. ..|+..+|.++|..++...|.|.+ T Consensus 54 ~~a~wv~E---------g~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~s----~~~~~~~l~~~i~~~la~ai~~~~ 120 (315) T protein:vir:80 54 PRAKIVGE---------GEVKPSASVDVSAFTAQPIKVVTQQRVSDEFMWAD----ADYRLGVLQDLISPALGASIGRAV 120 (315) T ss_pred cceEEeeC---------CccccccccceeeeEeeeeeEEeeehhhHHHhhcC----chhHHHHHHHHHHHHHHHHHHHHH Confidence 11111223 23445555666666666666666678999998884 356666777777777766666666 Q ss_pred HHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHH Q lcl|Aclame:pro 305 IDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVL 384 (519) Q Consensus 305 i~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L 384 (519) =+.+..-... . +.....|+.+.-... .. .++..-..+.-+.++...+.....+ ..+-.|++|+....| T Consensus 121 d~a~~~G~~~--~----~~~~~~~~~~~~~~~-~~----~~~~~~~~~~d~~~~~~~~~~~~~~-~~~~~imn~~~~~~L 188 (315) T protein:vir:80 121 DLIAFHGIDP--A----TGKAASAVHTSLNKT-KN----IVDATDSATADLVKAVGLIAGAGLQ-VPNGVALDPAFSFAL 188 (315) T ss_pred hhheeeccCC--C----CCccccccccccccc-cc----eeeccccchHHHHHHHHHHhhccCc-cceEEEEcHHHHHHH Confidence 5532211000 0 000111211110000 00 0000011112233333333222222 335688999998888 Q ss_pred HhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccc---------e--------EEEEEecCCCccceeE Q lcl|Aclame:pro 385 AAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD---------Y--------FTIGYKGSNEMDAGIY 447 (519) Q Consensus 385 ~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y--------~~vG~KG~~~~~~~~f 447 (519) ...=.....+. ..+..+.....+. .++|.| ++|+++.+.+.+ . +.+|+.+... +- T Consensus 189 ~~l~~~~g~~~-~g~~~~~~~~~g~--~~tl~G-~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~----i~ 260 (315) T protein:vir:80 189 STEVYPKGSPL-AGQPMYPAAGFAG--LDNWRG-LNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFP----IE 260 (315) T ss_pred HHHhhccCCcc-cccccccccccCC--Cceecc-eeeEecCcCCcccccccccccEEEEeecccEEEEEecCee----EE Confidence 75411000000 0111111111111 157887 699998887531 1 2222222111 11 Q ss_pred eecccccccccccCcc----c-ccc-eeeee--eeecee-ecCcccccccCCcceeecCC-chhhhcccch Q lcl|Aclame:pro 448 YAPYVALTPLRGSDPK----N-FQP-VMGFK--TRYGIG-INPFADPAAQAPTKRIQNGM-PDIVNSLGLN 508 (519) Q Consensus 448 yaPYv~~~~~~~~dp~----s-~qP-~~g~~--tRY~l~-~nP~~~~~~~~~~~~i~~~~-d~~a~~~~~~ 508 (519) ..+| .|++ + ||. .++|. .|+|.. .+|=+ ..++.+.- +. +.-..-| T Consensus 261 i~~~--------~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a-------~~~l~~~~a~~-~~~~~~~ 315 (315) T protein:vir:80 261 LIEY--------GDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDS-------FAVVKEKAAPK-PNPPAEN 315 (315) T ss_pred Eecc--------ccccCcccchhhcCcEEEEEEEEecceeecccc-------eEEEeeccCCC-CCCCCCC Confidence 1122 1111 1 221 13332 455533 23311 12222111 10 0011112 No 105 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=29.92 E-value=1.6 Score=19.44 Aligned_cols=297 Identities=9% Similarity=-0.007 Sum_probs=118.3 Q ss_pred cccchhhhccccccccccccCceehh--hHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCcccccccccc Q lcl|Aclame:pro 70 HGYDATNIAAGQTSGAVTQIGPAVMG--MVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYA 147 (519) Q Consensus 70 ~g~~~~~~~est~tg~v~~~~P~L~~--l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnE 147 (519) -.+..++...+-++..+++|-|-+.+ +..+.-+++ +|+=..+ T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~-----------------v~~~~~~------------------- 44 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKM-----------------LDTSVVK------------------- 44 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhc-----------------chhhccc------------------- Confidence 11222333444456667777666533 222222222 2221111 Q ss_pred cccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccc Q lcl|Aclame:pro 148 PNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMA 227 (519) Q Consensus 148 adt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~Gms 227 (519) .|++.........++ ..|.. . .. .+.-+..+. T Consensus 45 ---d~~~~~~~Gdtv~ip----------------~~g~~---------------~--~~------------d~~~~~~i~ 76 (341) T protein:vir:94 45 ---TWGAQVKKGDTFHVP----------------RISEL---------------G--VE------------DKATDVPVG 76 (341) T ss_pred ---cccccccCCceEEEe----------------ccCcc---------------e--ee------------eecCCCccc Confidence 011110000000000 00000 0 00 000011111 Q ss_pred hhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHH Q lcl|Aclame:pro 228 TSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDW 307 (519) Q Consensus 228 Ta~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~ 307 (519) . +.+ .-.+..++|||...-+-.= +-+|. ++. | .|--.|+..-....++.+++++|+.+ T Consensus 77 ~---~~~---------~~~~~~itiD~~~~~~~~i-----~d~d~---~~~-~-~d~~~~~~~~~~~aLA~~~D~~i~~~ 134 (341) T protein:vir:94 77 V---QPV---------NDTDFVITVDTDRTTAVAL-----DDLLE---IQA-S-YDLRAPYLEAMGYALAKDMTGSILGL 134 (341) T ss_pred c---ccc---------cCceEEEEEeeeeecceee-----chHHH---Hhh-c-cchHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 101 1135667787764332110 12222 223 3 68888898888999999999999887 Q ss_pred HHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhc Q lcl|Aclame:pro 308 INYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAV 387 (519) Q Consensus 308 i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~ 387 (519) +-..+.... +.+-. ..+.....+ +.....+.+..+...+++. .+- ..|-|+|++|++...|... T Consensus 135 ~a~~~~~~~-~~~~~------~~~~~~t~~--~~~~~~~~i~~a~~~Lde~--~VP-----~~gR~lvv~P~~~~~Ll~~ 198 (341) T protein:vir:94 135 RAAVQNTAS-QNVFS------SSNGAITGN--GQAFSFAVFLAARRLLLEA--DVP-----EEKIVLLISPGQESALFTI 198 (341) T ss_pred hhhcccccc-Ccccc------CccccccCc--hhhhhHHHHHHHHHHHhhc--CCC-----ccCCEEEeCHHHHHHHhhc Confidence 643332111 00000 000110111 1112234444444444432 111 2457999999999999887 Q ss_pred CcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEe-------------------------cCCCc Q lcl|Aclame:pro 388 DTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYK-------------------------GSNEM 442 (519) Q Consensus 388 g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K-------------------------G~~~~ 442 (519) ..|......+ +.+ ..+-.+|.|.| +.||..++-|.+-.. +++ |.... T Consensus 199 ~~~~~~~~~g-----~~~-l~~G~ig~i~G-~~V~~Sn~lp~~~~~-~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 270 (341) T protein:vir:94 199 PQFISKDFIN-----NAP-IAQGQIGSLMG-VRVIRTSLIGNNSAT-GWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSL 270 (341) T ss_pred hhhhhhhccc-----cch-hheeeeeeEec-eEEEEeccccccccc-cccccccceeccccccccccccccccccccccc Confidence 7766543211 111 22223577766 999999988753211 111 11222 Q ss_pred cceeEeecccccccccccCcccccceee-----------------eeeeeceeecCcccccccCCcceeecCCchh Q lcl|Aclame:pro 443 DAGIYYAPYVALTPLRGSDPKNFQPVMG-----------------FKTRYGIGINPFADPAAQAPTKRIQNGMPDI 501 (519) Q Consensus 443 ~~~~fyaPYv~~~~~~~~dp~s~qP~~g-----------------~~tRY~l~~nP~~~~~~~~~~~~i~~~~d~~ 501 (519) ..||++.+. ..-..+.+||+.++...- +..||.+.+=++-- +....+.-+-|-. T Consensus 271 ~~gl~~~~~-av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp----~~~v~~~~~~~~~ 341 (341) T protein:vir:94 271 PATFTGNSR-PVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRP----LHAVNIHTTGDTV 341 (341) T ss_pred EEEEEEecc-cccceeeecchhhhccccccccccccchhhhhhhhhhhhhhhcccccCc----ceeEEEecCcCCC Confidence 334444433 222333345554443211 11222222211100 0001111111111 No 106 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=29.62 E-value=1.6 Score=19.40 Aligned_cols=273 Identities=11% Similarity=0.005 Sum_probs=115.5 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCcc Q lcl|Aclame:pro 164 LAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDN 243 (519) Q Consensus 164 ~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~ 243 (519) .+.... ...+...+-.- +..+ ..........++.... +....+ ..|.+.+++.=-.+..+|.+ .....- T Consensus 1 ma~~~T-~l~d~iiPev~-~~~v-~~~~~~~l~~~~~~~~---d~~l~g--~~G~tv~iP~~~~ig~a~~~---~~g~~i 69 (274) T protein:vir:12 1 MAQGLT-KTSNQIIPEVL-APMM-QAQLEKKLRFASFAEV---DSTLQG--QPGDTLTFPAFVYSGDAQVV---AEGEKI 69 (274) T ss_pred CCccee-ehhhhhchHHH-HHHH-HHHHHhhhhhccccee---cccccC--CCCCEEEEeeecCCCccccc---cCCCcc Confidence 111110 01111111000 0000 0000000000000000 000000 01222222210011122222 111222 Q ss_pred ccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccc Q lcl|Aclame:pro 244 PWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTV 323 (519) Q Consensus 244 ~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~ 323 (519) ...++..+ +.+++-+-|+-.=+++=| ..+.+ +-|.-.|..+-++..|..+++.+++..+..+..-. T Consensus 70 ~~~~lt~~--~~~~~i~~~~~~~~i~D~--~~~~~--~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~-------- 135 (274) T protein:vir:12 70 PTDILETK--KREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV-------- 135 (274) T ss_pred chhhcccc--eeeEEeeeecceeeecHH--HHHhc--ccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------- Confidence 33444444 334444444422222221 22333 56889999999999999999999998665322110 Q ss_pred cccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccccccccccccc Q lcl|Aclame:pro 324 GAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFN 403 (519) Q Consensus 324 ~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~ 403 (519) ....++ ++-+-....++.++. ..++++||+|.|++.|.......|..+. .. . T Consensus 136 --~~~a~~-------------~d~i~dA~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~fv~~s---~~-g 187 (274) T protein:vir:12 136 --NADITK-------------LNGLQSAIDKFNDED---------LEPMVLFINPLDAGKLRGDASTNFTRAT---EL-G 187 (274) T ss_pred --cccccC-------------HHHHHHHHHHhcccc---------ccccEEEeCHHHHHHHHhhhhhhccccc---cc-c Confidence 011222 222233333333321 2568999999999999976544333211 11 1 Q ss_pred ccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-cCc Q lcl|Aclame:pro 404 VDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPF 482 (519) Q Consensus 404 ~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-nP~ 482 (519) .....+-.+|.+.| ++||+|...|..-..+--+|. -.||. --+...-...||..++-.+-..-+||+.+ || T Consensus 188 ~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~gA-----~~~~~-~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~- 259 (274) T protein:vir:12 188 DDIIVKGAFGEALG-AIIVRSNKLEAGTAILAKKGA-----VKLIL-KRDFFLEVARDASTKTTALYSDKHYVAYLYDE- 259 (274) T ss_pred ccceecccceeecC-eeEEEeCCCCcceEEEEeccc-----eeeee-cCCceeccccchhhcccEEEeeeEEEEEEEcC- Confidence 12222335688876 899999988753221111121 11222 11222222359999999999999999655 22 Q ss_pred ccccccCCcceeecCCchhhhcccc Q lcl|Aclame:pro 483 ADPAAQAPTKRIQNGMPDIVNSLGL 507 (519) Q Consensus 483 ~~~~~~~~~~~i~~~~d~~a~~~~~ 507 (519) ++-.++.-+ .++-.| T Consensus 260 ------~~vv~~t~~----~~~~~~ 274 (274) T protein:vir:12 260 ------SKAVKITKG----SGSLEM 274 (274) T ss_pred ------CceEEEEcC----CccccC Confidence 112223221 112233 No 107 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=28.02 E-value=1.8 Score=19.20 Aligned_cols=341 Identities=12% Similarity=0.024 Sum_probs=111.7 Q ss_pred CCh---------HHHHHhhhhhhCCCccccccccchhhhhhhh---hhhHHHHHh-hhhhccchhhhhhhhhhhhhhhh- Q lcl|Aclame:pro 1 MKK---------NALVQKWSALLENEALPEIVGASKQAIIAKI---FENQEQDIL-TAPEYRDEKISEAFGSFLTEAEI- 66 (519) Q Consensus 1 ~~~---------~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~---~enq~~~~~-~~~~~~~~~~~~~~~~~~~~~~~- 66 (519) +++ ..+.++...-++ .+-.-.+.-+. ....+ ++.+++.+. .......+....++...+.+... T Consensus 159 ~k~~~e~~~~e~~e~~~~~~~~~e--~l~~~~e~~~~-~~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 235 (543) T protein:vir:81 159 MRTFGRDAEEVKGELRARALSAIE--KMQGASDNVRA-AATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAA 235 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH-HHHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHH Confidence 000 001111111111 00000000000 00111 111111110 00000001111111111110000 Q ss_pred ----ccccccch-hhhccccccccccccCceehhhHHHHHhh-hhhhhceeeccCCccchhheeeeeeecCCCCCCCccc Q lcl|Aclame:pro 67 ----GGDHGYDA-TNIAAGQTSGAVTQIGPAVMGMVRRAIPH-LIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKE 140 (519) Q Consensus 67 ----~~~~g~~~-~~~~est~tg~v~~~~P~L~~l~Rra~p~-LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~e 140 (519) ...+.... ....-++++|.+.--..+.-.++.+.... -+...++-|.|++|.. .+.-..+. T Consensus 236 ~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~~g~~--------~~~~~~~~----- 302 (543) T protein:vir:81 236 ILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVATGDV--------WHGVSSAA----- 302 (543) T ss_pred HhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccCCcce--------EEEEecCC----- Confidence 00000000 00000111111110001111111111111 1222333333333221 00000000 Q ss_pred ccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccce Q lcl|Aclame:pro 141 AFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLA 220 (519) Q Consensus 141 A~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~ 220 (519) +. .. T Consensus 303 -------~~---------------------------------------------------------------------a~ 306 (543) T protein:vir:81 303 -------VQ---------------------------------------------------------------------WS 306 (543) T ss_pred -------cc---------------------------------------------------------------------ee Confidence 00 00 Q ss_pred ecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 221 EIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEI 300 (519) Q Consensus 221 ~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEI 300 (519) -+++| ..+++-..+++.++++++.-+=...+|-||.+|- + |.++.|.+-|...|...+ T Consensus 307 ~v~Eg-----------------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~--~---~~~~~i~~~l~~~~~~~~ 364 (543) T protein:vir:81 307 WDAEF-----------------EEVSDDSPEFGQPEIPVKKAQGFVPISIEALQDE--A---NVTETVALLFAEGKDELE 364 (543) T ss_pred ecccC-----------------ccccccccccceeeeeeeeeEeeehhhHHHHhcc--H---HHHHHHHHHHHHHHHHHH Confidence 00111 1122233345566677777777788999999873 2 679999999999999999 Q ss_pred hHHHHHHHHhhhhhhhhcccccccccceeeecccc-----ccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEE Q lcl|Aclame:pro 301 NREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDP-----IDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFII 375 (519) Q Consensus 301 NReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~-----~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v 375 (519) |+-||. |. .+...+.|++..... ..........+-...|+..+. ..+.....+| T Consensus 365 d~ail~--------G~----Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~---------~~~~~~~~~v 423 (543) T protein:vir:81 365 AVTLTT--------GT----GQGNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLA---------ARHRRQGAWL 423 (543) T ss_pred HHHHhc--------cC----CCCcccccchhhcccccccccccccccccHHHHHHHHHhhh---------ccccCCcEEE Confidence 998874 10 011123343221110 000001111222333333332 2333334688 Q ss_pred EchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccc---------e-EEEEEecCCCccce Q lcl|Aclame:pro 376 ASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD---------Y-FTIGYKGSNEMDAG 445 (519) Q Consensus 376 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y-~~vG~KG~~~~~~~ 445 (519) ++|.+...|...- +.. +...+.....+. -++|.| ++||+..+.+.. + |++|-- .. T Consensus 424 ~n~~~~~~l~~lk--d~~----G~~l~~~~~~g~--~~~l~G-~pv~~~~~~~~~~~~~~~~~~~~i~~gd~------~~ 488 (543) T protein:vir:81 424 ANNLIYNKIRQFD--TQG----GAGLWTTIGNGE--PSQLLG-RPVGEAEAMDANWNTSASADNFVLLYGNF------QN 488 (543) T ss_pred EcHHHHHHHHHhh--cCC----CceeccCcCCCC--Cccccc-eeeEEeccccccccccccCCcceEEEeec------cc Confidence 9999998887532 100 001111111111 146776 699888775531 1 111111 00 Q ss_pred eEeecccccccccccCcc--------cccceeeeeeeeceee-cCcccccccCCcceeecCCchhhhcc Q lcl|Aclame:pro 446 IYYAPYVALTPLRGSDPK--------NFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSL 505 (519) Q Consensus 446 ~fyaPYv~~~~~~~~dp~--------s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~ 505 (519) .++... .. +.-.+||. ..+=.+=+..|+|..+ ||-+ ...+. ++..+ T Consensus 489 ~~i~~~-~~-~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A-------~~~l~-----~~~~a 543 (543) T protein:vir:81 489 YVIADR-IG-MTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNA-------FRLLN-----VETAS 543 (543) T ss_pred eeEEee-cc-cEEEEeccccccchhhcCceEEEEEEeeccEeecccc-------eEEEE-----ecccC Confidence 111100 00 11112332 2233445556777643 2222 11111 11122 No 108 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=27.84 E-value=1.8 Score=19.18 Aligned_cols=271 Identities=10% Similarity=0.026 Sum_probs=116.1 Q ss_pred ccccccccccccccccccccccccccccccccCCCC--CCCccccccccccccccccceecccccchhhhhhcccCCCCC Q lcl|Aclame:pro 164 LAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAG--ATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGST 241 (519) Q Consensus 164 ~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag--~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~ 241 (519) .+..+ +...+...+-.-. ... ............ ..+.. + .+ ..|.+..++.=-....+|.+. . T Consensus 1 Ma~~~-T~~~~~iiPev~s-~~v-~~~~~~~~v~~~~~~~~~~-l----~g--~~G~tv~ip~~~~~g~a~~~~---~-- 65 (278) T protein:vir:80 1 MADLT-TKLANLIDPEVMG-PMI-SAKLPKAIKFGKIAPIDNS-L----EG--QPGSEITVPKYKYIGDAQDVA---E-- 65 (278) T ss_pred CCCcc-eehhheecHHHHH-HHH-HHHHHHhhhhcccceeccc-c----cC--CCCCEEEEeeeccCCcceeec---C-- Confidence 11110 0011111110000 000 000000000000 00000 0 00 011112211100011222221 1 Q ss_pred ccccccceeEEEEEEEEeecccccccccHHHHHHHHhh-cCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 242 DNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAV-HGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) Q Consensus 242 ~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAi-HGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t 320 (519) +..+..-..+..+++++-|-|+- + | + .-|+-+. -+-|.-.+..+-++..+..+++++++..+... .... T Consensus 66 g~~i~~~~lt~~~~~~~i~~~~~-a-~--~-v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a-~~~~---- 135 (278) T protein:vir:80 66 GAAIDYSALETESVKHGIKKAGK-G-V--K-LTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTT-TLEV---- 135 (278) T ss_pred CCcCcccccccceeeEeeehhhc-c-c--c-ccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccc---- Confidence 12333334556666777676653 2 2 2 3344443 36789999999999999999999999866421 1000 Q ss_pred ccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCccccc--ccccc Q lcl|Aclame:pro 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSY--AAQGL 398 (519) Q Consensus 321 ~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~--~~~~~ 398 (519) ..+-+.|..+ -+.+.+-.+..+ +....-. ...+++++|.+...|.......|. +..+ T Consensus 136 ----~~~~t~~~~~--------~~~~~~~da~~~-------l~~~~~~-~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g- 194 (278) T protein:vir:80 136 ----KGAINIGLID--------KIENTFTDAPDA-------IEDESIT-TTGVLFLNYKDTAKLREEAAGSWTKASQLG- 194 (278) T ss_pred ----ccccccchhh--------hHHHHHHHHHHh-------hcccCCC-cccEEEECHHHHHHHHhhhhhhcccccccc- Confidence 0011122111 122222222222 2221111 124899999999999866533332 2111 Q ss_pred cccccccCCCceEEEEecCcEEEEecCCCccce-EEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeece Q lcl|Aclame:pro 399 GQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDY-FTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGI 477 (519) Q Consensus 399 ~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l 477 (519) .....+-.+|.+.| ++||++...|..= ++++ +| +-.|+..= +...-...|+..++-.|-...+||+ T Consensus 195 -----~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~-~g-----Ai~~~~~~-~~~vE~~Rd~~~~~d~i~~~~~yg~ 261 (278) T protein:vir:80 195 -----DDLLVKGAFGELLG-WEIVRTKKLADGNALAVK-AG-----ALKTFLKR-NLLAESGRDMDHKLTKFNADQHYAV 261 (278) T ss_pred -----ccceeeccceeecc-eeEEEcCCCCcceEEEEe-cc-----ceeeeecC-CcccccccchhhccceeeeeeEEEE Confidence 11122335788876 8999999988531 1111 12 11122211 1221122599999999999999998 Q ss_pred ee-cCcccccccCCcceeecCCchhhhc Q lcl|Aclame:pro 478 GI-NPFADPAAQAPTKRIQNGMPDIVNS 504 (519) Q Consensus 478 ~~-nP~~~~~~~~~~~~i~~~~d~~a~~ 504 (519) .+ ||-. -.+|.-+ |+. T Consensus 262 ~v~~~~~-------~v~it~~----a~~ 278 (278) T protein:vir:80 262 ALVDETK-------AVKVVPV----AGN 278 (278) T ss_pred EEEcCcc-------eEEEeec----cCC Confidence 64 3322 2333332 111 No 109 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=27.01 E-value=1.9 Score=19.07 Aligned_cols=345 Identities=12% Similarity=0.042 Sum_probs=123.0 Q ss_pred CChH----HHHHhhhhhhCCCc-----cccccccch----hhhhhhh--hhhHHHHHhhhhhccchhhhhhh-------- Q lcl|Aclame:pro 1 MKKN----ALVQKWSALLENEA-----LPEIVGASK----QAIIAKI--FENQEQDILTAPEYRDEKISEAF-------- 57 (519) Q Consensus 1 ~~~~----~l~~kw~p~l~~~~-----~~~~~~~~~----~~~~~~~--~enq~~~~~~~~~~~~~~~~~~~-------- 57 (519) ||-+ +|+++.+.+.+.-. +-+..+..+ ++..+.+ |+++-+.+.+..+-+.+.+.+.. T Consensus 1 Mn~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGG 80 (421) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 7743 35566666553310 000000000 0000100 00110111111000000000000 Q ss_pred -----------------hhhhhhhhhccccccchhhhccccccccc---cccCceehhhHHHHHhhhhhhhceeeccCCc Q lcl|Aclame:pro 58 -----------------GSFLTEAEIGGDHGYDATNIAAGQTSGAV---TQIGPAVMGMVRRAIPHLIAFDICGVQPLNN 117 (519) Q Consensus 58 -----------------~~~~~~~~~~~~~g~~~~~~~est~tg~v---~~~~P~L~~l~Rra~p~LIa~DI~GVQPmTG 117 (519) ..+. .. +-+..-.....-+-+++.|.+ ..+.+-++.+. .+...-.++|-+.||++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~---~~~~~l~~l~~~~~~~~ 155 (421) T protein:vir:13 81 RVIINGDSKEEKRSLQLSAMS-KT-IRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLK---EGYPSLKEHCHVIPVNR 155 (421) T ss_pred ccccccchhHHHHHHHHHHHH-Hh-hhccchhHHHhhccccCCcceecchhhHHHHHHHH---HhhhhhhhhceeeeccC Confidence 0000 00 000000000000111111211 12223333333 34455678888888887 Q ss_pred cchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCC Q lcl|Aclame:pro 118 PTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVD 197 (519) Q Consensus 118 PTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ 197 (519) +++-+- +...... +.+ T Consensus 156 ~~~~~~-----~~~~~~~--------------~~~--------------------------------------------- 171 (421) T protein:vir:13 156 NAGKMP-----VRAGASV--------------DKL--------------------------------------------- 171 (421) T ss_pred CceEEE-----EeecCCc--------------cce--------------------------------------------- Confidence 664221 1110000 000 Q ss_pred CCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHH Q lcl|Aclame:pro 198 AGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLR 277 (519) Q Consensus 198 ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLK 277 (519) ...++| ...++-..++++++...+.-+-...+|.||.+|-- T Consensus 172 ----------------------~~~~E~-----------------~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~ 212 (421) T protein:vir:13 172 ----------------------ANLAKD-----------------TELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSE 212 (421) T ss_pred ----------------------eecccc-----------------ccccccccceeEEEeeeeeeEeehhhhHHHHhhhH Confidence 000010 11122223334445555555555679999999842 Q ss_pred hhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHH Q lcl|Aclame:pro 278 AVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDK 357 (519) Q Consensus 278 AiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~ 357 (519) .|.++.|.+-|+..+..-+|..|+..+. |..+ .+++.++ +..+.++..+.. T Consensus 213 ----~~l~~~i~~~la~~~~~~~~~~i~~~~~--------g~~~----~~~~~~~-------------d~i~~~~~~l~~ 263 (421) T protein:vir:13 213 ----INFLEFVNEEFAEFAVNTENAEIVKQAK--------AVLA----EETINDY-------------AGLVKTINSLVP 263 (421) T ss_pred ----HHHHHHHHHHHHHHHHHHhhhhHhhhhh--------hccc----cccccch-------------HHHHHHHHHhhh Confidence 4568888888888888888888775321 2221 1232222 233444444432 Q ss_pred HHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCcc-------- Q lcl|Aclame:pro 358 EAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS-------- 429 (519) Q Consensus 358 ~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-------- 429 (519) .+..+..+|++|.....|...- + +. + +.+..+.... --++|.| ++|++..+.+. T Consensus 264 ---------~~~~~a~~v~n~~~~~~l~~lk--d-~~---G-~~i~~~~~~~-~~~tl~G-~pV~~~~~~~~~~~~~~~~ 325 (421) T protein:vir:13 264 ---------NARKRAIIVTNSDGRAYLDGLM--D-KQ---G-RPLLKELSDG-GDLVFKG-RPVIELEESIFDVGDETKF 325 (421) T ss_pred ---------hhcCCCEEEEcHHHHHHHHHhh--c-CC---C-ceeecCcCCC-CCceecc-eeeEEeccccccCCCceEE Confidence 2234567899999888887531 1 10 0 1111111110 0146777 58887766542 Q ss_pred ------ceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-----------cCcccccccCCcc Q lcl|Aclame:pro 430 ------DYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-----------NPFADPAAQAPTK 492 (519) Q Consensus 430 ------dy~~vG~KG~~~~~~~~fyaPYv~~~~~~~~dp~s~qP~~g~~tRY~l~~-----------nP~~~~~~~~~~~ 492 (519) +|+.++.++.-..+.+- + .+-..-+=.+-+..||+.++ .++..-.+. T Consensus 326 ~~gd~~~~~~~~~~~~~~v~~~~----~--------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~---- 389 (421) T protein:vir:13 326 IVSDFKTLIKFMDRKQYLIDQSK----E--------AGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKL---- 389 (421) T ss_pred EEEeccccEEEEEecceEEEeec----c--------cccccCeeEEEEEeeecceeecchhhheeeecccceeecc---- Confidence 12333332222111110 0 00111222344445554332 111100000 Q ss_pred eeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 493 RIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 493 ~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) +..+.-+..+++++=--+-.||-- T Consensus 390 ---~~~~~~~~~~~~~~~~~~~~~~~~ 413 (421) T protein:vir:13 390 ---QEVLKSSPRSGKNKNESKEEIKEE 413 (421) T ss_pred ---ccccCCCCcCCCCccccchheeec Confidence 000111111222211111112111 No 110 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=26.74 E-value=1.9 Score=19.04 Aligned_cols=310 Identities=15% Similarity=0.119 Sum_probs=120.8 Q ss_pred CCccchhheeeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAV 194 (519) Q Consensus 115 mTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~ 194 (519) |.--++.-.+.|.-.+.. ..++.++|- ..|+|.--.. +...+ . ..+......+.+ T Consensus 1 m~~~~~~~~~t~~g~~~~-----~~d~~al~i---k~f~~eV~~~----f~~~s------~------~~~~~~~r~i~~- 55 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKS-----SSDALALFL---KVFAGEVLTA----FTRRS------V------TADKHIVRTIQN- 55 (347) T ss_pred CCCCCccccccccccCCc-----cccHHHHHH---HHHhHHHHHH----HHHHH------h------hhcccccccccc- Confidence 555555333333222111 111112111 1233221110 00000 0 000000000000 Q ss_pred cCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 195 TVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQ 274 (519) Q Consensus 195 ~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQ 274 (519) | +..-.+ .. .......+- ..+.+.+ ......=.|+.++||++.+ +..-+.-.- T Consensus 56 ----G--~sv~i~--~i---G~~tv~~~t------~G~~l~~--~~~~~~~~e~~itID~~~~--------~~~~VddiD 108 (347) T protein:vir:94 56 ----G--KSAQFP--VM---GRTSGVYLA------PGERLSD--KRKGIKHTEKVITIDGLLT--------ADVMIFDIE 108 (347) T ss_pred ----c--ceEEEe--cc---cceeeeeec------CCCCcCC--CCCCCCcceEEEEecchhh--------hhHHhhhHH Confidence 0 000000 00 000000000 0111100 0011233567788887632 223344344 Q ss_pred HHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHH-hhhhhhhhcccccccccce-----eeeccccccccccchHHHHH Q lcl|Aclame:pro 275 DLRAVHGMDADAELSGILATEIMLEINREVIDWIN-YSAQVGKSGMTNTVGAKAG-----VFDFQDPIDIRGARWAGESF 348 (519) Q Consensus 275 DLKAiHGLDAEaELsNILSTEImlEINReii~~i~-~~a~~~~~~~t~~~~~~~G-----~fDl~~~~d~~~~~~a~e~~ 348 (519) |.++ | .|-.+|++.-....+..++++-|++.+. ..+..+.. +..+.| +++.....+. --...-. T Consensus 109 ~~q~-~-~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~-----~~~~~g~~~~s~~~~~~~~~~---~~~~~~~ 178 (347) T protein:vir:94 109 DAMN-H-YDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAAS-----NENIAGLGTASVLEVGKKADL---DTPAKLG 178 (347) T ss_pred HHhc-C-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----ccccCCCcccceeeccccccc---cchhhhH Confidence 4444 3 7889999999999999999999998653 23322221 111112 1221111110 0112223 Q ss_pred HHHHHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCc Q lcl|Aclame:pro 349 KALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYAR 428 (519) Q Consensus 349 r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~ 428 (519) ..++..|-+.....-..---..|-|+|++|+.-.+|...-.+.... ... ..+..+-.+|.+.| ++||.-++.| T Consensus 179 ~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~-----~~~-~~~~~~G~Vg~i~G-~~V~~Sn~lp 251 (347) T protein:vir:94 179 EAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAAN-----YAA-LIDPETGNIRNVMG-FVVVEVPHLV 251 (347) T ss_pred HHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhh-----ccc-cccccccceEEEec-eEEEecCccc Confidence 4444444333333333222224679999999999997543332221 111 11222225688876 8999988877 Q ss_pred c----------ceEEEE-------------EecCCCccceeEeecccc----ccc---ccccCcccccceeeeeeeecee Q lcl|Aclame:pro 429 S----------DYFTIG-------------YKGSNEMDAGIYYAPYVA----LTP---LRGSDPKNFQPVMGFKTRYGIG 478 (519) Q Consensus 429 ~----------dy~~vG-------------~KG~~~~~~~~fyaPYv~----~~~---~~~~dp~s~qP~~g~~tRY~l~ 478 (519) . .|-++. |+++-.-..++||-|=.- +.+ -...|+..|.=.|==+..||.. T Consensus 252 ~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~ 331 (347) T protein:vir:94 252 QGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVDAQGDLIVGKYAMGHG 331 (347) T ss_pred ccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhchhhHHHHhhhhhhhcCc Confidence 4 122221 333333335777777522 222 1123455444333223333321 Q ss_pred -ecCcccccccCCcceeecCCchhhhccc Q lcl|Aclame:pro 479 -INPFADPAAQAPTKRIQNGMPDIVNSLG 506 (519) Q Consensus 479 -~nP~~~~~~~~~~~~i~~~~d~~a~~~~ 506 (519) .+|-+- ..|.- ..++ T Consensus 332 ~~rP~~a-------~~~~~------~~A~ 347 (347) T protein:vir:94 332 GLRPEAA-------GALVF------SPAE 347 (347) T ss_pred cccccee-------EEEEe------cCCC Confidence 223221 11110 0111 No 111 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=24.65 E-value=2.1 Score=18.76 Aligned_cols=350 Identities=11% Similarity=0.054 Sum_probs=111.4 Q ss_pred CC-hHHHHHhhhhhhCCCccccccccchhhhhhhhhhhHHHHHhhhh--hccchhhhhhhhhhhhhhhhccccccch--- Q lcl|Aclame:pro 1 MK-KNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAP--EYRDEKISEAFGSFLTEAEIGGDHGYDA--- 74 (519) Q Consensus 1 ~~-~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~~--- 74 (519) |+ -+++.+|...+-.. -+-++.+.-+..-..+.+++..+.+.++. +-+.+.-.++........-......-+. T Consensus 1 ik~L~e~~~e~~e~~~~-~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~ 79 (390) T protein:vir:40 1 MNNLDKKDSETLNISTA-FLNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKY 79 (390) T ss_pred CchHHHHHHHHHHHHHH-HHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHH Confidence 11 01111111111000 00001111111111111121111111100 0000000000000000000000000000 Q ss_pred --hhhccccccccccccCceehh------hHHHHHhhhhhhhceeeccCCccchhheeeeeeecCCCCCCCccccccccc Q lcl|Aclame:pro 75 --TNIAAGQTSGAVTQIGPAVMG------MVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMY 146 (519) Q Consensus 75 --~~~~est~tg~v~~~~P~L~~------l~Rra~p~LIa~DI~GVQPmTGPTGLIFAMRsrY~~~~~~~~~~eA~~~fn 146 (519) ..+.++++++ ...||| +++++-..-+-.++|-|.||++....|. +....+ ++ T Consensus 80 ~~~~~~~~~~~~-----gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~----~~~~~~------~a----- 139 (390) T protein:vir:40 80 YNEVIAGNGFAG-----VTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWII----SVGDVA------TA----- 139 (390) T ss_pred HHHHHhccCccc-----CcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEE----EEcCCc------ce----- Confidence 0111111111 112222 2333333445567889999887444332 110000 00 Q ss_pred ccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCccccccccccccccccceeccccc Q lcl|Aclame:pro 147 APNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGM 226 (519) Q Consensus 147 Eadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~Gm 226 (519) .|- .++ T Consensus 140 ----~~~---------------------------------------------------------------------~E~- 145 (390) T protein:vir:40 140 ----WWG---------------------------------------------------------------------PLC- 145 (390) T ss_pred ----eee---------------------------------------------------------------------ccc- Confidence 000 000 Q ss_pred chhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHH Q lcl|Aclame:pro 227 ATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVID 306 (519) Q Consensus 227 sTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLDAEaELsNILSTEImlEINReii~ 306 (519) +|. ....+..|.+..|++.|..+- ...|-||.+|-- .|.|++|.+.|+..|..-+|+.||. T Consensus 146 ----~~~----~~~~~~~f~~i~l~~~k~~~~-------i~iS~ell~ds~----~~l~~~i~~~la~~i~~~~~~a~l~ 206 (390) T protein:vir:40 146 ----AEI----KEVLDNGFDKIQTGMYKLSAY-------IPVCNAMLDLGP----SWLDQYVRTILGEAMALGLEAGIVN 206 (390) T ss_pred ----ccc----CccccccceeeEeeeeeEEEe-------ehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHhhhhc Confidence 000 011234577777777777653 358889988853 4679999999999999999999986 Q ss_pred HHHhhhhhhhhcccccccccceeeecccccc------ccccchHHHHHHHHHHHHHHHHHHHHhhccccCCCEEEEchHH Q lcl|Aclame:pro 307 WINYSAQVGKSGMTNTVGAKAGVFDFQDPID------IRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNV 380 (519) Q Consensus 307 ~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d------~~~~~~a~e~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~v 380 (519) |. | .+.+.|++.-..... ....-...+-.-.++..+......-.... ++. -..||+|.- T Consensus 207 --------G~-G----~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~-~~~-a~~i~n~~t 271 (390) T protein:vir:40 207 --------GS-G----KDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKS-VSD-AILVINPAD 271 (390) T ss_pred --------cc-C----CCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhh-hcC-ceEEEcchh Confidence 10 0 112233332100000 00000000111222222222111111111 123 334666654 Q ss_pred -HHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEE--------EecCCCcccee--Eee Q lcl|Aclame:pro 381 -VNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIG--------YKGSNEMDAGI--YYA 449 (519) Q Consensus 381 -a~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG--------~KG~~~~~~~~--fya 449 (519) +..|...-.+ .|.++....+.+.-+++|+++++.+.+=++.| -.+....+.+- +|. T Consensus 272 ~~~~l~~~~~~-------------~d~~G~~v~~~~~~g~pvv~~~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~~~~~f~ 338 (390) T protein:vir:40 272 YWSKIYAATSY-------------MTPQGVWVTGILPVPLEIVQSVAVPVGKAVAGRAKDYFMGIGSEQVIRTSTEYRLL 338 (390) T ss_pred HHHHHHHHhhc-------------cCCCCccccccCCCceeEEEcCCCCCCcEEEEeeceEEEEeecceEEEecchhhhh Confidence 4444421111 11112111222333579999988886434443 22222111100 000 Q ss_pred cc--c-----ccccccccCcccccceeeeeeee-ceeecCccc-ccccCCcceeecCCchhhhcccchh Q lcl|Aclame:pro 450 PY--V-----ALTPLRGSDPKNFQPVMGFKTRY-GIGINPFAD-PAAQAPTKRIQNGMPDIVNSLGLNG 509 (519) Q Consensus 450 PY--v-----~~~~~~~~dp~s~qP~~g~~tRY-~l~~nP~~~-~~~~~~~~~i~~~~d~~a~~~~~~~ 509 (519) + + .-.....+||+.|.- +=++.== .-.+.||.. ...+ ++|. +- T Consensus 339 -~~~~~~r~~~r~dg~v~~~~A~~~-l~~~~~~~~~~~~~~~~~~~~~--------~~~~-------~~ 390 (390) T protein:vir:40 339 -DDETLYYAKQYANGRPKDNSSFLV-FDITGLEGSPAIDVNVVNNATP--------SETP-------AE 390 (390) T ss_pred -cCcEEEEEEEEeCCEEecccceEE-EEeeccCCCCCCCcceeeCCCC--------CCCC-------CC Confidence 0 0 000011134443330 0000000 001112211 0010 1111 00 No 112 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=24.03 E-value=2.2 Score=18.68 Aligned_cols=312 Identities=15% Similarity=0.130 Sum_probs=120.5 Q ss_pred CC-ccchhheeeeeeecCCCCCCCcccccccccccccccCcccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 115 LN-NPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEA 193 (519) Q Consensus 115 mT-GPTGLIFAMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~ 193 (519) |- .|+|.--.-|..++.-. .++.++|- .-|||.--.. +...+-........ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~-----~~~~al~i---e~~~g~V~~~-------------------f~~~s~~~~~v~~r- 52 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSA-----ADKLALFL---KVFGGEVLTA-------------------FARTSVTMPRHMLR- 52 (347) T ss_pred CCCCccCcccccccccCCcc-----cchHHHHH---HHHHHHHHHH-------------------HHHHHhhhhhhccc- Confidence 32 33333222232222110 01111111 1122211000 00000000000000 Q ss_pred ccCCCCCCCccccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHH Q lcl|Aclame:pro 194 VTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELA 273 (519) Q Consensus 194 ~~~~ag~t~~~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELA 273 (519) +...| +.... ......+...+-. .+.+. +...+....|+-++||++- -+...++-. T Consensus 53 -~~~~G--~sv~i-----~~iG~~t~~~~~~------g~~l~--~~~~~~~~~e~~ltiD~~~--------y~~~~Vddi 108 (347) T protein:vir:33 53 -SIASG--KSAQF-----PVIGRTKAAYLKP------GENLD--DKRKDIKHTEKVIHIDGLL--------TADVLIYDI 108 (347) T ss_pred -ccccc--ceeEe-----eeccceeeeeecC------CCCCC--CCCCCCccceEEEEechhh--------hhhHHHhhH Confidence 00000 00000 0000000000000 11110 0111234567788888753 233456666 Q ss_pred HHHHhhcCCCHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhccccccc-ccceeeeccccccccccchHH-HHHHHH Q lcl|Aclame:pro 274 QDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVG-AKAGVFDFQDPIDIRGARWAG-ESFKAL 351 (519) Q Consensus 274 QDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~-~~~G~fDl~~~~d~~~~~~a~-e~~r~L 351 (519) -+.++ | .|-..|++.-....++..+++-|+..|..-......-...... ...+.+... ....+--|.. +-...+ T Consensus 109 D~~q~-~-~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~tg~~~d~~~~a~~i 184 (347) T protein:vir:33 109 EDAMN-H-YDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGLGKPTVLTLV--KPTTGSLTDPVELGKAI 184 (347) T ss_pred HHHhc-C-CchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccccc--ccccccccchhhhHHHH Confidence 66666 4 7889999999999999999999987554221111000000000 001111111 1111112222 223445 Q ss_pred HHHHHHHHHHHHhhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccce Q lcl|Aclame:pro 352 LFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDY 431 (519) Q Consensus 352 ~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy 431 (519) |..|.+.....-.+=--..|-|+|++|+.-.+|.....+..... . ..+....-.+|.+.| ++||.-+..|.-. T Consensus 185 ~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~-----~-~~~~~~~G~V~~i~G-~~V~~Sn~lp~~~ 257 (347) T protein:vir:33 185 IAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANY-----Q-ALLDPERGTIRNVMG-FEVVEVPHLTAGG 257 (347) T ss_pred HHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccc-----c-cccccccceeEEEec-eeEEEecccccCc Confidence 55554444444443333357899999999999998776553311 1 123344446788877 9999999887643 Q ss_pred EEEEEecCCCccc----------------------eeEeecccc----c---ccccccCcccccceeeeeeeeceee-cC Q lcl|Aclame:pro 432 FTIGYKGSNEMDA----------------------GIYYAPYVA----L---TPLRGSDPKNFQPVMGFKTRYGIGI-NP 481 (519) Q Consensus 432 ~~vG~KG~~~~~~----------------------~~fyaPYv~----~---~~~~~~dp~s~qP~~g~~tRY~l~~-nP 481 (519) ++ + ......+ ||||-|=.. + ..-+..|++.|-=.|=-+..||..+ +| T Consensus 258 ~~-~--~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~vlrP 334 (347) T protein:vir:33 258 AG-D--TREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRP 334 (347) T ss_pred cc-c--ccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchhhhhHhhhhhhhcCCceecc Confidence 22 0 1111112 333332211 1 1111124444433333333333221 11 Q ss_pred cccccccCCcceeecCCchhhhcccchhhhhhhhhcCC Q lcl|Aclame:pro 482 FADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) Q Consensus 482 ~~~~~~~~~~~~i~~~~d~~a~~~~~~~y~r~v~v~~~ 519 (519) -.- ..| ..|.| T Consensus 335 ~~a-------v~i--------------------~~~~~ 345 (347) T protein:vir:33 335 EAA-------GAI--------------------VLPKV 345 (347) T ss_pred cce-------EEE--------------------ecCCC Confidence 110 000 01111 No 113 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=20.70 E-value=2.7 Score=18.20 Aligned_cols=343 Identities=11% Similarity=-0.010 Sum_probs=110.0 Q ss_pred CCh------HHHHHhhh---hhhCCC------cccc---------ccccc-hhhhhhhhhhhHHHHHhhhhhccchhhhh Q lcl|Aclame:pro 1 MKK------NALVQKWS---ALLENE------ALPE---------IVGAS-KQAIIAKIFENQEQDILTAPEYRDEKISE 55 (519) Q Consensus 1 ~~~------~~l~~kw~---p~l~~~------~~~~---------~~~~~-~~~~~~~~~enq~~~~~~~~~~~~~~~~~ 55 (519) +.+ +++.+.+. -.++.. ...+ +...- ...-.....+.+.+...+..........+ T Consensus 48 ~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 127 (437) T protein:vir:10 48 KEDEIKEIRSNIEVLEQASALKVEEKRDDSDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQD 127 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhH Confidence 000 01111111 111100 0000 00000 00000111111111111110000000000 Q ss_pred hhhhhhhhhhhccccccc-------hhhhccccccccccccCceehh-----hHHHHHhhhhhhhceeeccCCccchhhe Q lcl|Aclame:pro 56 AFGSFLTEAEIGGDHGYD-------ATNIAAGQTSGAVTQIGPAVMG-----MVRRAIPHLIAFDICGVQPLNNPTGQVF 123 (519) Q Consensus 56 ~~~~~~~~~~~~~~~g~~-------~~~~~est~tg~v~~~~P~L~~-----l~Rra~p~LIa~DI~GVQPmTGPTGLIF 123 (519) .......+........+. ......+++ .....|++ .++.........+++.|.||+.+.+-+- T Consensus 128 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~-----~~~g~lvp~~~~~~i~~~~~~~~l~~~~~~~~~~~~~~~~~ 202 (437) T protein:vir:10 128 MKLKVGGEIADKKVTAFADYLKTGEVRDVTGIAL-----KDGKVIIPETILTPEKEVHQFPRLGSLVRTESVTTTTGKLP 202 (437) T ss_pred HHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhccc-----ccccccchHHHHHHHHHhhhhhhhhhcceeEeeccCceeeE Confidence 000000000000000000 000111111 01111211 1111111122345566666665543222 Q ss_pred eeeeeecCCCCCCCcccccccccccccccCccccccccccccccccccccccccccccccccccccccccccCCCCCCCc Q lcl|Aclame:pro 124 ALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDA 203 (519) Q Consensus 124 AMRsrY~~~~~~~~~~eA~~~fnEadt~fSG~~~~~~~~~~~~~t~~~~g~~~~~~~~~~G~~~~~~~~~~~~~ag~t~~ 203 (519) -++..-+ . . T Consensus 203 ~~~~~~~---------~---------~----------------------------------------------------- 211 (437) T protein:vir:10 203 IFNNSTD---------L---------L----------------------------------------------------- 211 (437) T ss_pred Eeecccc---------c---------c----------------------------------------------------- Confidence 1111000 0 0 Q ss_pred cccccccccccccccceecccccchhhhhhcccCCCCCccccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCC Q lcl|Aclame:pro 204 AKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMD 283 (519) Q Consensus 204 ~~~~~~~~~~~~~g~~~~~~~GmsTa~AEal~~~ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYTmELAQDLKAiHGLD 283 (519) ..++++- .....+...|.++.|.+.|..+ -..+|-||.+|- ..| T Consensus 212 ----------------~~~~e~~---------~~~e~~~~~~~~v~~~~~k~~~-------~~~is~ell~ds----~~~ 255 (437) T protein:vir:10 212 ----------------TAHTEYG---------QTTKNATPVITPILWDLKTYTG-------GYVFSQELISDS----SYD 255 (437) T ss_pred ----------------ccccccc---------cccccccccceeeeeehhheee-------ehhhhHHHHhhh----HHH Confidence 0000000 0001122346666666555543 467899999984 356 Q ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhhhhcccccccccceeeeccccccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 284 ADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIA 363 (519) Q Consensus 284 AEaELsNILSTEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~fDl~~~~d~~~~~~a~e~~r~L~~~i~~~a~~I~ 363 (519) .+++|.+.|+..|..-+|..||.-... +.+.++-. .+. -.|...+.. . T Consensus 256 ~~~~i~~~l~~~~~~~~~~~i~~g~g~-------------~~~~~~~~----------~~~----~~~~~~~~~-----~ 303 (437) T protein:vir:10 256 WQAELQSRLIELRDNTDDSLIITALTD-------------GIKKTTST----------YLL----GDLKKVLNV-----T 303 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhcc-------------cccccccc----------cch----hhHHHHHHh-----h Confidence 788999999999999999988863211 01111100 000 111111110 1 Q ss_pred hhccccCCCEEEEchHHHHHHHhcCcccccccccccccccccCCCceEEEEecCcEEEEecCCCccceEEEEEecCCCcc Q lcl|Aclame:pro 364 RQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMD 443 (519) Q Consensus 364 ~~T~rg~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~ 443 (519) ....+.++-..||+|.....|...= + + .+...+..+.+... -++|.| ++||+.+..... ....-+ T Consensus 304 l~~~~~~~~~~~~~~~~~~~l~~lk--d-~---~g~~~~~~~~~~~~-~~~l~G-~pv~~~~~~~~~-------~~~~~~ 368 (437) T protein:vir:10 304 LKPQDSAAASIVMSQSAYNLFDMAT--D-A---MGRPLLQPNVTAAT-GYTLLG-KTVVIVDDKLFP-------SASAGD 368 (437) T ss_pred hhhhhhcCCEEEEcHHHHHHHHHhh--c-c---CCCeeeccCccCCC-Cccccc-ceeEEecccccC-------CcCCCc Confidence 1122323446799999988887641 0 0 00111122222111 146887 577764332110 000000 Q ss_pred ceeEeeccc--------ccccccc-cCcccccceeeeeeeeceee-cCcccccccCCcceeecCCchhhhcccchh Q lcl|Aclame:pro 444 AGIYYAPYV--------ALTPLRG-SDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSLGLNG 509 (519) Q Consensus 444 ~~~fyaPYv--------~~~~~~~-~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~i~~~~d~~a~~~~~~~ 509 (519) ..+||+.+- +...+.. -+-+.+...+.+..||+..+ +|-+- ..|.---+...-....++ T Consensus 369 ~~~~~gd~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~-------~~l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 369 VNIVVAPLKKAVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLI-------VNLTGKLKAVTVVQSTAV 437 (437) T ss_pred eEEEEeeccccEEEEeeeceEEEEecccccccceeeEEEEEccEEecccce-------EEEEeeccccccCCCCCC Confidence 112222221 1111111 13345555666777887543 33221 111100011110111111 Done!