Query lcl|NC_014661.1_cdsid_YP_004009801.1 [gene=23] [protein=gp23 major head protein] [protein_id=YP_004009801.1] [location=113701..115275] Match_columns 524 No_of_seqs 171 out of 421 Neff 5.1 Searched_HMMs 1612 Date Thu Nov 7 15:04:17 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_184 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_184_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:6901 Length: 522 # 100.0 8E-259 5E-262 1435.4 38.7 521 2-524 1-522 (522) 2 protein:vir:103463 Length: 521 100.0 4E-254 2E-257 1410.1 37.3 520 3-524 1-521 (521) 3 protein:vir:7214 Length: 521 # 100.0 3E-253 2E-256 1404.9 37.8 520 3-524 1-521 (521) 4 protein:vir:107947 Length: 519 100.0 8E-253 5E-256 1402.7 39.1 517 6-524 1-519 (519) 5 protein:vir:80986 Length: 528 100.0 1E-252 7E-256 1401.8 37.6 517 5-524 1-528 (528) 6 protein:vir:106286 Length: 534 100.0 2E-249 1E-252 1384.1 39.0 517 6-524 1-534 (534) 7 protein:vir:101039 Length: 529 100.0 4E-249 2E-252 1382.6 37.8 518 3-524 1-529 (529) 8 protein:vir:101811 Length: 529 100.0 1E-248 9E-252 1379.3 38.4 518 3-524 1-529 (529) 9 protein:vir:100603 Length: 529 100.0 2E-248 1E-251 1378.5 37.2 518 3-524 1-529 (529) 10 protein:vir:6601 Length: 528 # 100.0 2E-247 1E-250 1372.8 37.3 517 5-524 1-528 (528) 11 protein:vir:98143 Length: 524 100.0 4E-247 2E-250 1371.4 37.9 517 5-524 1-524 (524) 12 protein:vir:5670 Length: 514 # 100.0 2E-243 1E-246 1351.3 36.1 508 9-524 1-514 (514) 13 protein:vir:104915 Length: 470 100.0 1E-228 9E-232 1269.7 36.3 461 3-524 1-469 (470) 14 protein:vir:106998 Length: 468 100.0 2E-226 1E-229 1257.5 35.9 458 5-524 1-467 (468) 15 protein:vir:104549 Length: 462 100.0 6E-223 4E-226 1238.8 36.3 453 1-524 1-461 (462) 16 protein:vir:103181 Length: 457 100.0 1E-218 9E-222 1214.9 35.8 448 1-524 1-456 (457) 17 protein:vir:5942 Length: 523 # 100.0 6E-197 4E-200 1096.3 33.0 450 1-504 1-523 (523) 18 protein:vir:191 Length: 385 # 96.9 0.00026 1.6E-07 40.3 18.5 350 1-510 1-385 (385) 19 protein:vir:1886 Length: 385 # 96.9 0.00026 1.6E-07 40.3 18.5 350 1-510 1-385 (385) 20 protein:vir:41 Length: 299 # N 96.2 0.00091 5.7E-07 37.2 17.3 277 74-511 1-299 (299) 21 protein:vir:81160 Length: 371 96.1 0.001 6.4E-07 37.0 17.1 336 1-509 1-371 (371) 22 protein:vir:1433 Length: 435 # 96.1 0.0011 6.6E-07 36.9 18.4 352 5-507 1-435 (435) 23 protein:vir:4339 Length: 395 # 95.7 0.0016 9.8E-07 35.9 19.8 353 1-509 1-395 (395) 24 protein:vir:10364 Length: 390 95.5 0.002 1.2E-06 35.4 20.4 345 1-507 15-390 (390) 25 protein:vir:4997 Length: 397 # 95.4 0.0021 1.3E-06 35.3 19.3 332 5-512 1-397 (397) 26 protein:vir:81227 Length: 413 95.2 0.0025 1.5E-06 34.9 20.0 358 1-514 1-413 (413) 27 protein:vir:4830 Length: 397 # 95.0 0.003 1.8E-06 34.4 18.4 334 5-510 1-397 (397) 28 protein:vir:78523 Length: 338 94.8 0.0036 2.2E-06 34.0 20.2 311 35-507 1-338 (338) 29 protein:vir:78223 Length: 333 94.6 0.0041 2.5E-06 33.7 13.1 309 146-504 1-333 (333) 30 protein:vir:1025 Length: 408 # 94.1 0.0032 2E-06 34.2 10.7 339 1-503 1-408 (408) 31 protein:vir:100135 Length: 418 93.5 0.0073 4.5E-06 32.3 19.1 356 1-508 27-418 (418) 32 protein:vir:3870 Length: 400 # 93.5 0.0074 4.6E-06 32.3 16.0 327 1-507 24-400 (400) 33 protein:vir:1268 Length: 397 # 93.2 0.0083 5.1E-06 32.0 17.3 340 1-511 5-397 (397) 34 protein:vir:8420 Length: 477 # 93.2 0.0084 5.2E-06 32.0 22.1 367 1-507 66-477 (477) 35 protein:vir:7855 Length: 497 # 92.5 0.011 6.9E-06 31.3 18.8 365 1-513 39-497 (497) 36 protein:vir:101650 Length: 497 92.5 0.011 6.9E-06 31.3 18.8 365 1-513 39-497 (497) 37 protein:vir:4953 Length: 397 # 92.5 0.011 7E-06 31.3 19.3 333 5-508 1-397 (397) 38 protein:vir:96762 Length: 632 92.4 0.011 7.1E-06 31.2 16.1 335 1-492 248-632 (632) 39 protein:vir:97148 Length: 324 92.0 0.013 8.3E-06 30.8 16.6 297 47-499 1-324 (324) 40 protein:vir:9820 Length: 272 # 91.8 0.014 8.8E-06 30.7 16.9 269 165-512 1-272 (272) 41 protein:vir:3033 Length: 272 # 91.8 0.014 8.8E-06 30.7 16.9 269 165-512 1-272 (272) 42 protein:vir:93742 Length: 274 91.6 0.015 9.3E-06 30.6 16.6 273 165-512 1-274 (274) 43 protein:vir:81070 Length: 390 91.6 0.015 9.3E-06 30.6 20.8 344 1-502 15-390 (390) 44 protein:vir:7771 Length: 330 # 90.7 0.019 1.2E-05 30.0 16.6 298 61-514 1-330 (330) 45 protein:vir:97053 Length: 390 90.6 0.02 1.2E-05 29.9 17.8 351 1-507 12-390 (390) 46 protein:vir:4856 Length: 293 # 90.2 0.022 1.4E-05 29.6 18.2 268 65-506 1-293 (293) 47 protein:vir:2344 Length: 397 # 89.9 0.024 1.5E-05 29.5 19.3 310 74-524 1-364 (397) 48 protein:vir:96123 Length: 274 89.8 0.024 1.5E-05 29.4 16.0 273 165-520 1-274 (274) 49 protein:vir:9574 Length: 300 # 89.1 0.028 1.7E-05 29.1 17.5 280 81-523 1-300 (300) 50 protein:vir:7409 Length: 408 # 87.0 0.041 2.6E-05 28.2 20.6 346 1-514 1-408 (408) 51 protein:vir:1638 Length: 298 # 86.4 0.046 2.9E-05 27.9 15.9 280 81-503 1-298 (298) 52 protein:vir:98339 Length: 415 85.9 0.049 3.1E-05 27.7 17.1 357 5-514 1-415 (415) 53 protein:vir:81100 Length: 415 85.9 0.049 3.1E-05 27.7 17.1 357 5-514 1-415 (415) 54 protein:vir:79987 Length: 415 85.9 0.049 3.1E-05 27.7 17.1 357 5-514 1-415 (415) 55 protein:vir:9704 Length: 394 # 85.8 0.05 3.1E-05 27.7 17.2 330 1-515 49-394 (394) 56 protein:vir:104085 Length: 320 85.7 0.051 3.2E-05 27.7 16.8 290 46-510 1-320 (320) 57 protein:vir:2504 Length: 305 # 85.6 0.052 3.2E-05 27.6 17.2 284 82-500 1-305 (305) 58 protein:vir:96392 Length: 324 85.0 0.056 3.5E-05 27.4 16.8 300 38-504 1-324 (324) 59 protein:vir:78830 Length: 324 85.0 0.056 3.5E-05 27.4 16.8 300 38-504 1-324 (324) 60 protein:vir:105004 Length: 392 84.4 0.06 3.8E-05 27.3 18.3 345 1-508 1-392 (392) 61 protein:vir:107593 Length: 392 84.4 0.06 3.8E-05 27.3 18.3 345 1-508 1-392 (392) 62 protein:vir:102082 Length: 392 84.4 0.06 3.8E-05 27.3 18.3 345 1-508 1-392 (392) 63 protein:vir:102873 Length: 392 84.4 0.06 3.8E-05 27.3 18.3 345 1-508 1-392 (392) 64 protein:vir:4700 Length: 415 # 83.4 0.069 4.3E-05 27.0 19.9 358 5-514 1-415 (415) 65 protein:vir:4600 Length: 415 # 83.4 0.069 4.3E-05 27.0 19.9 358 5-514 1-415 (415) 66 protein:vir:8187 Length: 311 # 82.7 0.075 4.6E-05 26.8 17.8 284 82-510 1-311 (311) 67 protein:vir:96262 Length: 274 81.9 0.082 5.1E-05 26.6 16.4 270 165-512 1-274 (274) 68 protein:vir:95898 Length: 274 81.9 0.082 5.1E-05 26.6 16.4 270 165-512 1-274 (274) 69 protein:vir:104256 Length: 458 81.5 0.085 5.3E-05 26.4 17.0 349 1-509 73-458 (458) 70 protein:vir:9410 Length: 415 # 81.0 0.09 5.6E-05 26.3 20.1 361 5-514 1-415 (415) 71 protein:vir:105905 Length: 304 79.4 0.1 6.5E-05 26.0 15.4 280 61-503 1-304 (304) 72 protein:vir:94142 Length: 304 79.4 0.1 6.5E-05 26.0 15.4 280 61-503 1-304 (304) 73 protein:vir:98635 Length: 377 79.2 0.11 6.6E-05 25.9 15.7 358 1-502 1-377 (377) 74 protein:vir:9759 Length: 303 # 79.1 0.11 6.7E-05 25.9 16.5 284 81-504 1-303 (303) 75 protein:vir:4226 Length: 326 # 78.4 0.11 7.1E-05 25.7 16.3 302 38-499 1-326 (326) 76 protein:vir:100247 Length: 425 77.9 0.12 7.4E-05 25.6 17.6 338 1-501 61-425 (425) 77 protein:vir:95763 Length: 297 76.7 0.13 8.2E-05 25.4 16.2 275 71-496 1-297 (297) 78 protein:vir:94424 Length: 387 74.7 0.16 9.6E-05 25.0 14.0 333 5-524 1-381 (387) 79 protein:vir:96978 Length: 387 74.7 0.16 9.6E-05 25.0 14.0 333 5-524 1-381 (387) 80 protein:vir:2685 Length: 387 # 74.7 0.16 9.6E-05 25.0 14.0 333 5-524 1-381 (387) 81 protein:vir:99920 Length: 311 73.7 0.17 0.0001 24.8 18.2 283 82-507 1-311 (311) 82 protein:vir:97433 Length: 274 72.6 0.18 0.00011 24.7 16.7 272 165-512 1-274 (274) 83 protein:vir:94494 Length: 274 72.6 0.18 0.00011 24.7 16.7 272 165-512 1-274 (274) 84 protein:vir:96223 Length: 324 71.3 0.2 0.00012 24.4 17.2 302 38-513 1-324 (324) 85 protein:vir:102119 Length: 404 69.2 0.23 0.00014 24.1 21.9 350 1-494 1-404 (404) 86 protein:vir:80684 Length: 315 68.1 0.24 0.00015 24.0 17.1 282 81-513 1-315 (315) 87 protein:vir:94673 Length: 419 65.4 0.28 0.00018 23.6 18.4 368 1-513 10-419 (419) 88 protein:vir:9643 Length: 377 # 64.7 0.3 0.00018 23.5 15.6 357 1-502 1-377 (377) 89 protein:vir:9309 Length: 324 # 63.6 0.31 0.00019 23.3 19.4 304 38-513 1-324 (324) 90 protein:vir:1383 Length: 421 # 62.3 0.34 0.00021 23.2 16.9 358 1-524 3-413 (421) 91 protein:vir:101607 Length: 379 61.3 0.36 0.00022 23.0 20.3 344 1-506 13-379 (379) 92 protein:vir:8102 Length: 543 # 59.4 0.39 0.00024 22.8 15.8 335 1-510 173-543 (543) 93 protein:vir:5739 Length: 366 # 57.7 0.43 0.00027 22.6 15.9 328 1-505 1-366 (366) 94 protein:vir:4456 Length: 401 # 57.5 0.43 0.00027 22.6 13.9 347 1-500 1-401 (401) 95 protein:vir:2430 Length: 318 # 57.1 0.44 0.00027 22.5 17.1 287 46-511 1-318 (318) 96 protein:vir:103955 Length: 324 52.3 0.56 0.00035 22.0 18.6 300 5-510 1-324 (324) 97 protein:vir:108211 Length: 318 51.4 0.58 0.00036 21.9 14.1 305 119-524 1-315 (318) 98 protein:vir:3991 Length: 404 # 50.0 0.62 0.00039 21.7 23.0 350 1-512 1-404 (404) 99 protein:vir:80376 Length: 435 44.7 0.8 0.00049 21.1 20.9 354 1-507 41-435 (435) 100 protein:vir:1239 Length: 274 # 44.1 0.82 0.00051 21.1 16.9 271 165-512 1-274 (274) 101 protein:vir:3364 Length: 347 # 42.8 0.87 0.00054 20.9 13.2 315 119-524 1-345 (347) 102 protein:vir:9361 Length: 402 # 42.0 0.9 0.00056 20.8 15.2 340 1-524 12-396 (402) 103 protein:vir:3845 Length: 395 # 41.6 0.92 0.00057 20.8 19.2 337 5-512 1-395 (395) 104 protein:vir:4092 Length: 390 # 40.0 0.99 0.00061 20.6 18.5 343 5-514 1-390 (390) 105 protein:vir:485 Length: 407 # 38.1 1.1 0.00067 20.4 17.1 345 1-524 14-400 (407) 106 protein:vir:99749 Length: 324 34.8 1.3 0.00079 20.0 18.7 298 5-499 1-324 (324) 107 protein:vir:105038 Length: 428 34.2 1.3 0.00081 19.9 19.4 338 1-505 49-428 (428) 108 protein:vir:105334 Length: 276 33.6 1.3 0.00084 19.9 18.5 270 82-514 1-276 (276) 109 protein:vir:94771 Length: 298 32.0 1.5 0.0009 19.7 17.8 280 81-503 1-298 (298) 110 protein:vir:739 Length: 231 # 31.1 1.5 0.00095 19.6 14.2 219 187-524 1-231 (231) 111 protein:vir:6242 Length: 390 # 30.3 1.6 0.00099 19.5 17.7 346 1-524 27-389 (390) 112 protein:vir:79078 Length: 307 29.7 1.6 0.001 19.4 9.3 296 149-524 1-305 (307) 113 protein:vir:1781 Length: 221 # 27.7 1.8 0.0011 19.2 14.8 203 255-498 1-221 (221) 114 protein:vir:93881 Length: 387 27.4 1.8 0.0011 19.1 15.7 333 5-524 1-381 (387) 115 protein:vir:1084 Length: 437 # 24.3 2.2 0.0014 18.7 16.1 340 1-514 48-437 (437) 116 protein:vir:2201 Length: 345 # 23.8 2.3 0.0014 18.6 15.1 308 119-520 1-345 (345) 117 protein:vir:96833 Length: 275 22.8 2.4 0.0015 18.5 14.9 268 163-498 1-275 (275) 118 protein:vir:3613 Length: 272 # 20.5 2.8 0.0017 18.2 15.9 266 165-524 1-272 (272) No 1 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=100.00 E-value=8.4e-259 Score=1435.43 Aligned_cols=521 Identities=81% Similarity=1.237 Sum_probs=502.1 Q ss_pred CcccchHHHHHHhhhhhhccCCCcchhhhhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhh Q lcl|NC_014661. 2 STQIKTKAQLVADWKPLLEAEGAPEIAQGKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNI 81 (524) Q Consensus 2 ~~~~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i 81 (524) -+|+|++|+|+|||+|||||||+|||+++||+|+|+|||||||+|+|+|+|+|++++++||.||.||+++|+|||++++| T Consensus 1 ~~~~~~~e~l~~kw~p~l~~~~~~~~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i 80 (522) T protein:vir:69 1 MTTIKTKAQLVDKWKELLEGEGLPEIANSKQAIIAKIFENQEKDFEVSPEYKDEKIAQAFGSFLTEAEIGGDHGYNAQNI 80 (522) T ss_pred CCccchHHHHHHhhHHHhcCCCCCccccchhhhhhhhhhhhhHHhhcccccchhHHHHhhhhhhhhhccccccCCCcccc Confidence 35899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccccc Q lcl|NC_014661. 82 AAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGS 161 (524) Q Consensus 82 ~est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~ 161 (524) +||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++.++++|+||+|||+|++|||++. T Consensus 81 ~es~~t~~v~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~ 160 (522) T protein:vir:69 81 AAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGA 160 (522) T ss_pred cccccccccccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccccccccccccccccccccccCCCCC-cccccccccccccccccccccccccchhhhccccc Q lcl|NC_014661. 162 HEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTAD-AAELDAEVIKQMDAGILVEIAEGMATSIAELQEGF 240 (524) Q Consensus 162 ~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~-~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~l 240 (524) ..........+..+.++.+++.+...++++....+..+..++.+ ....+.++......+..|+++.||+|+.+|+++.| T Consensus 161 ~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~l 240 (522) T protein:vir:69 161 AKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEGF 240 (522) T ss_pred cccccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccccceeeccccchhhhhhcccC Confidence 99999999889899999999999999999988887776665544 33456677788889999999999999999999999 Q ss_pred CCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhh Q lcl|NC_014661. 241 NGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKT 320 (524) Q Consensus 241 Ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~ 320 (524) |++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||||||||||++|+.+|+++++ T Consensus 241 ggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~ 320 (522) T protein:vir:69 241 NGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKS 320 (522) T ss_pred CCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccc Q lcl|NC_014661. 321 GQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQG 400 (524) Q Consensus 321 ~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~ 400 (524) +++.+.++++|+|||+++.|+.++||++||||+|++|||+|||+|+|+|+||+|||||||++||++|+|+|++++++++| T Consensus 321 g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~ 400 (522) T protein:vir:69 321 GMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQG 400 (522) T ss_pred ccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhccccccccccc Confidence 99988999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeece Q lcl|NC_014661. 401 LARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGI 480 (524) Q Consensus 401 ~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l 480 (524) ...|+++|+++++|+|+|+|||+||||||+++|||+|||||++|+|+||||||||||+|+|++||+||||+||||||||| T Consensus 401 ~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l 480 (522) T protein:vir:69 401 LASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGI 480 (522) T ss_pred ccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeece Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 481 GINPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 481 ~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) ++|||++..++.+. .||+||||..++..|+|.|||||+|||| T Consensus 481 ~vNP~~~~~~~~~~--~ri~~g~p~~~~~~~~n~y~r~v~v~~~ 522 (522) T protein:vir:69 481 GVNPFAESSLQAPG--ARIQSGMPSILNSLGKNAYFRRVYVKGI 522 (522) T ss_pred eecCcccccCCccc--ceeecccchhhcccCCcceeeEEEeecC Confidence 99999998877654 5899999999999999999999999999 No 2 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=100.00 E-value=3.6e-254 Score=1410.06 Aligned_cols=520 Identities=81% Similarity=1.238 Sum_probs=498.6 Q ss_pred cccchHHHHHHhhhhhhccCCCcchhhhhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhc Q lcl|NC_014661. 3 TQIKTKAQLVADWKPLLEAEGAPEIAQGKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIA 82 (524) Q Consensus 3 ~~~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~ 82 (524) -|||++|+|+|||+|||||||+|||+++||+|+|+|||||||+++|+|+||+++++++||.+|.||+++++|||++.+|+ T Consensus 1 ~~~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~i~ 80 (521) T protein:vir:10 1 MTIKTKAELLNKWKPLLEGEGLPEIANSKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIA 80 (521) T ss_pred CCcchhHHHHHhhhhhhccCCCCccccchhhhhhhhhhhhhhhhhhccccchhHHHHHHhhhhhhhcccCcccccccccc Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccccccccccc Q lcl|NC_014661. 83 AGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSH 162 (524) Q Consensus 83 est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~ 162 (524) ||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||++|+++++++|+||.++++|+.|||+++. T Consensus 81 es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~a 160 (521) T protein:vir:10 81 AGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAA 160 (521) T ss_pred ccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999989999999999999999999999 Q ss_pred ccccccccccccccccccccccccccccccccccccccCCCCC-cccccccccccccccccccccccccchhhhcccccC Q lcl|NC_014661. 163 EVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTAD-AAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFN 241 (524) Q Consensus 163 ~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~-~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lG 241 (524) ...+....++....++.+++.+..+++++......++..++.+ ....+.........+..|++++||+|+++|+|+.|| T Consensus 161 t~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g 240 (521) T protein:vir:10 161 KKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFN 240 (521) T ss_pred cccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhhHhhhccCC Confidence 9999999999999999999999999988887777666655543 344667788889999999999999999999999999 Q ss_pred CCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhc Q lcl|NC_014661. 242 GSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTG 321 (524) Q Consensus 242 gs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~ 321 (524) ++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+.+|++|+++ T Consensus 241 ~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g 320 (521) T protein:vir:10 241 GSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSG 320 (521) T ss_pred CCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccccccc Q lcl|NC_014661. 322 QTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGL 401 (524) Q Consensus 322 ~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~ 401 (524) ++.++++++|+|||++++|++++||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|++++++++|. T Consensus 321 ~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~ 400 (521) T protein:vir:10 321 MTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGL 400 (521) T ss_pred eeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeecee Q lcl|NC_014661. 402 ARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIG 481 (524) Q Consensus 402 ~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~ 481 (524) ..|+++|+|+++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|+|++||+||||+|||||||||+ T Consensus 401 ~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~ 480 (521) T protein:vir:10 401 ATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG 480 (521) T ss_pred cccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeecee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 482 INPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 482 ~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) +|||++..+|.+++. |.+|++......++|.|||||+|||| T Consensus 481 ~NP~~~~~~~~~~~~--i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:10 481 INPFAESAAQAPASR--IQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred ecCcccccCCcccee--ecccchhhhccccccceeeeeeecCC Confidence 999999999999865 44554444447789999999999999 No 3 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=100.00 E-value=3.1e-253 Score=1404.93 Aligned_cols=520 Identities=80% Similarity=1.227 Sum_probs=497.8 Q ss_pred cccchHHHHHHhhhhhhccCCCcchhhhhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhc Q lcl|NC_014661. 3 TQIKTKAQLVADWKPLLEAEGAPEIAQGKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIA 82 (524) Q Consensus 3 ~~~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~ 82 (524) -|||++|+|+|||+|||||||+|||+++||+|+|+|||||||+++|+|+||+++++++|+.+|.|++++++|||++.+|+ T Consensus 1 ~~~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia 80 (521) T protein:vir:72 1 MTIKTKAELLNKWKPLLEGEGLPEIANSKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIA 80 (521) T ss_pred CCcchhHHHHHhhhhhhccCCCCccccchhhhhhhhhhhhhhhhhhcccccchHHHHHHhhhhhhhcccCccccCccccc Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccccccccccc Q lcl|NC_014661. 83 AGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSH 162 (524) Q Consensus 83 est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~ 162 (524) ||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++.++++|+||.++++|++|||+++. T Consensus 81 es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~ 160 (521) T protein:vir:72 81 AGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAA 160 (521) T ss_pred ccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999889999999999999999999999 Q ss_pred cccccccccccccccccccccccccccccccccccc-ccCCCCCcccccccccccccccccccccccccchhhhcccccC Q lcl|NC_014661. 163 EVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAV-TLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFN 241 (524) Q Consensus 163 ~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~-~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lG 241 (524) ........+...+.|+.+++.+...++++....... ..+++++....++.+......+..|+++.||+|+.+|+++.+| T Consensus 161 ~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g 240 (521) T protein:vir:72 161 KKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQEGFN 240 (521) T ss_pred ccccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccchhhhhhhcccC Confidence 888888899999999999999998888877655444 3445566667788888899999999999999999999999999 Q ss_pred CCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhc Q lcl|NC_014661. 242 GSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTG 321 (524) Q Consensus 242 gs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~ 321 (524) ++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|+.+|++|+++ T Consensus 241 ~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g 320 (521) T protein:vir:72 241 GSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSG 320 (521) T ss_pred CcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccccccc Q lcl|NC_014661. 322 QTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGL 401 (524) Q Consensus 322 ~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~ 401 (524) ++.++++++|+|||++++|++++||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|.+++++++|. T Consensus 321 ~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~ 400 (521) T protein:vir:72 321 MTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGL 400 (521) T ss_pred eeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeecee Q lcl|NC_014661. 402 ARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIG 481 (524) Q Consensus 402 ~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~ 481 (524) ..|+++|+|+++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|+|++||+||||+|||||||||+ T Consensus 401 ~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~ 480 (521) T protein:vir:72 401 ATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG 480 (521) T ss_pred cccccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeecee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 482 INPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 482 ~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) +|||++..+|.+++. |.+|++..++..++|.|||||+|||| T Consensus 481 ~NP~~~~~~~~~a~~--i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:72 481 INPFAESAAQAPASR--IQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred ecCcccccCccccee--ecCcChhhhcCccccceeeeeeecCC Confidence 999999999999865 44554444447788999999999999 No 4 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=100.00 E-value=7.9e-253 Score=1402.70 Aligned_cols=517 Identities=80% Similarity=1.217 Sum_probs=497.4 Q ss_pred chHHHHHHhhhhhhccCCCcchhh-hhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhccc Q lcl|NC_014661. 6 KTKAQLVADWKPLLEAEGAPEIAQ-GKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAAG 84 (524) Q Consensus 6 ~~~~~l~~kw~p~l~~~~~~~~~~-~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~es 84 (524) .++|+|+|||+|||||||+|+|++ +||+|+++||||||+||.|+++|+++.++++|+.||.|++++++|||+++.|++| T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~i~~~~~en~~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~t~i~~~ 80 (519) T protein:vir:10 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) T ss_pred CchhHHHHHhHHhhcccccchhhhhhhHHHHHHHHHHHHHHhhhcccccchHHHHHHhhhcchhccCCccccCccccccc Confidence 456889999999999999999987 8999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccccccccccccc Q lcl|NC_014661. 85 QTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEV 164 (524) Q Consensus 85 t~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~ 164 (524) ++|++|++|+|+||+|+||++|||||+||||||||||||||||||||||+++++.++++|+||+|||+|++|||+++... T Consensus 81 ~~t~~v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fSG~~~~~~ 160 (519) T protein:vir:10 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) T ss_pred cccccccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccccccccccCccccccc Confidence 99999999999999999999999999999999999999999999999999999888999999999999999999999999 Q ss_pred ccccccccccccccccccccccccccccccccccc-cCCCCCcccccccccccccccccccccccccchhhhcccccCCC Q lcl|NC_014661. 165 FAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVT-LATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGS 243 (524) Q Consensus 165 ~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~-~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs 243 (524) ..+++.+.+...++++++.+..+++++.......+ .+++++....+.........+..|++++||+|+.+|+++.||++ T Consensus 161 ~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal~~lggs 240 (519) T protein:vir:10 161 FEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGS 240 (519) T ss_pred cccccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccccccchhhccccCCCc Confidence 99999999999999999999999998877765443 34445555677778888999999999999999999999999999 Q ss_pred CCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccc Q lcl|NC_014661. 244 NNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQT 323 (524) Q Consensus 244 ~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~ 323 (524) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+.+|++++...+ T Consensus 241 s~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t 320 (519) T protein:vir:10 241 TDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) T ss_pred cccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhcceeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccccccccc Q lcl|NC_014661. 324 LTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLAR 403 (524) Q Consensus 324 ~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~ 403 (524) .+.++.+|+|||++++|++++||++||||+||+|||||||+|+|+|+||+||||||||+||++|+|+|+++++++++... T Consensus 321 ~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~~~~~ 400 (519) T protein:vir:10 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) T ss_pred cCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhccccccccc Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceeeC Q lcl|NC_014661. 404 GLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIGIN 483 (524) Q Consensus 404 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~n 483 (524) ++++|+++++|+|+|+|||+||||||+++|||+|||||++|+|+||||||||||++++++||+||||+|||||||||++| T Consensus 401 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~N 480 (519) T protein:vir:10 401 GFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGIN 480 (519) T ss_pred cccccCCCceEEEEecCceEEEecCCCCcceEEEEEecCcccccceeeccccccccccccCCccccceeeeeeeeceeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 484 PLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 484 P~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) ||++..++.+. .||+||||++|++++||+|||||+|||| T Consensus 481 P~~~~~~~~~~--~~i~~g~~~~a~~~~~n~y~r~v~v~~~ 519 (519) T protein:vir:10 481 PFADPAAQAPT--KRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) T ss_pred CcccccccCcc--ceeccCchhhhccccCceeeeeeeeecC Confidence 99998887764 5789999999999999999999999999 No 5 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=100.00 E-value=1.2e-252 Score=1401.77 Aligned_cols=517 Identities=70% Similarity=1.104 Sum_probs=479.8 Q ss_pred cchHHHHHHhhhhhhccCCCcchhh-hhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhcc Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEAEGAPEIAQ-GKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAA 83 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~~~~~~~~~-~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~e 83 (524) ||++|+|+|||+|||||||+|+|++ +||+|+|+|||||||||+|+|.|+|++++++||.||.||+++|+|||++++|+| T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 80 (528) T protein:vir:80 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAA 80 (528) T ss_pred CcchHHHHHhhhHhhcCCccchhcchhhhhhhhhhhhhhhHHhhccccccchHHHHhhhhhccccccccccCCccccccc Confidence 9999999999999999999999987 899999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccccccccccc- Q lcl|NC_014661. 84 GQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSH- 162 (524) Q Consensus 84 st~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~- 162 (524) |++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++.++++|+||+|+++|+.||+..+. T Consensus 81 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~~ 160 (528) T protein:vir:80 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAKG 160 (528) T ss_pred cccccccccCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCccccccccccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999987553 Q ss_pred --------ccccccccccccccccccccccccccccccccccccccC-CCCCcccccccccccccccccccccccccchh Q lcl|NC_014661. 163 --------EVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLA-TTADAAELDAEVIKQMDAGILVEIAEGMATSI 233 (524) Q Consensus 163 --------~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a-~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~ 233 (524) ..++....+.....++++++.+..++............. ..+.+...+.........+..|+++.||+|+. T Consensus 161 ~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~ 240 (528) T protein:vir:80 161 AAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSI 240 (528) T ss_pred cccccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccccccccccchhh Confidence 223344455666677777777666665555444322222 22223333444556677888999999999999 Q ss_pred hhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhh Q lcl|NC_014661. 234 AELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINY 313 (524) Q Consensus 234 aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~ 313 (524) +|.+++||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||||||||||++|+. T Consensus 241 AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~ 320 (528) T protein:vir:80 241 AEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) T ss_pred hhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcc Q lcl|NC_014661. 314 SAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTS 393 (524) Q Consensus 314 ~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~ 393 (524) +|++||++++.++++++|+|||++++|++|+||++|+||+|++|||+|||+|+|+|+||+|||||||++||++|+|+|++ T Consensus 321 ~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~ 400 (528) T protein:vir:80 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) T ss_pred eeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccceee Q lcl|NC_014661. 394 VTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLG 473 (524) Q Consensus 394 ~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~ 473 (524) +++++.|...++++|+|+++|+|+|+|||+||||||+++|||+|||||++|+|+||||||||||+|++++||+||||+|| T Consensus 401 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g 480 (528) T protein:vir:80 401 ISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLG 480 (528) T ss_pred ccccccccccccccCCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeceeeCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 474 FKTRYGIGINPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 474 ~~tRY~l~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) |||||||++|||+++.++++. .|++||+|| .+.+|+|.|||||+|||| T Consensus 481 ~~tRY~l~~NP~~~~~~~~~~--~r~~~g~~~-~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:80 481 FKTRYGIGINPFADSKSQAPS--ARITSGMLS-KDSVGKNAYFRRVWVKGC 528 (528) T ss_pred eeeeeceeecCcccccCCccc--ccccccchh-hhhcCccceeEEeeeccC Confidence 999999999999999988754 589999998 578999999999999999 No 6 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=100.00 E-value=2e-249 Score=1384.07 Aligned_cols=517 Identities=60% Similarity=0.977 Sum_probs=476.7 Q ss_pred chHHHHHHhhhhhhccCCCcchhh-hhhhhhhhhhhhHHHHHhhh--hccccchhhhccccc--------cccccccccc Q lcl|NC_014661. 6 KTKAQLVADWKPLLEAEGAPEIAQ-GKHAIIAKMFENQEADIKSD--AAYRDEKLAEAFGGF--------LTEAEIGGDH 74 (524) Q Consensus 6 ~~~~~l~~kw~p~l~~~~~~~~~~-~~~~~~~~~~enq~~~~~~~--~~~~~~~~~~~~~~~--------l~ea~~~~~~ 74 (524) .++|+|+|||+|||||||+|+|.+ +||+|+|+|||||||+|+|+ +.|+|+.++++||.| |.||+++++| T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~ 80 (534) T protein:vir:10 1 MSKKSLLKKWQPLVESEGMPAIASMKRKDIVARIFENQDEDIAHNEGGVYTDQVVVNSMVDVKGRIEEARLAEANIGGDH 80 (534) T ss_pred CchhHHHHHhHHhhcCCccccccchhhhhhhhhhhhhHHHHHhhhcccccchhhhhhhhhccccchhhcccccccccccc Confidence 455889999999999999999987 89999999999999999886 899999999999988 9999999999 Q ss_pred ccchhhhccccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccc Q lcl|NC_014661. 75 GYDPQNIAAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDA 154 (524) Q Consensus 75 g~~~~~i~est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt 154 (524) |||+++|+||++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||.+++...++.||||..+.+|+ T Consensus 81 g~~~~~ia~s~~s~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt 160 (534) T protein:vir:10 81 GYDATKIASGETSGSITNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGPDA 160 (534) T ss_pred ccccccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999888889999965555999 Q ss_pred cccccccccccccccccccccccccccccccc-----ccccccccccccccC-CCCCccccccccccccccccccccccc Q lcl|NC_014661. 155 MFSGRGSHEVFAPLASGTVVAQGTIYKHEFVA-----TGTAFLQATGAVTLA-TTADAAELDAEVIKQMDAGILVEIAEG 228 (524) Q Consensus 155 ~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~-----tg~~~~~~~g~~~~a-~~~~~~~~d~~~~~~~~~g~~~~~g~G 228 (524) +|||+++......+..+.+...++.+++.... +++...+......+. ..++....+.........+..|+++.| T Consensus 161 ~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~g 240 (534) T protein:vir:10 161 DFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSA 240 (534) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccccceecccc Confidence 99999998888888888888888777765432 222222222221111 122233334445556677888999999 Q ss_pred ccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHH Q lcl|NC_014661. 229 MATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVI 308 (524) Q Consensus 229 m~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii 308 (524) |+|+.+|+++.+|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||| T Consensus 241 m~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii 320 (534) T protein:vir:10 241 MATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMV 320 (534) T ss_pred cchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHh Q lcl|NC_014661. 309 DWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLA 388 (524) Q Consensus 309 ~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~ 388 (524) |+||+||+|||+.++...++++|+|||.++.|+.++||++|+||+|+++||||||+|+|+|+||+||||||||+||++|+ T Consensus 321 ~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~ 400 (534) T protein:vir:10 321 LWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALG 400 (534) T ss_pred HHHhhhhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHh Confidence 99999999999999988889999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccc Q lcl|NC_014661. 389 SVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNF 468 (524) Q Consensus 389 ~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~ 468 (524) |+|+++++|+++...++++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||++++++||+|| T Consensus 401 ~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sf 480 (534) T protein:vir:10 401 HTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVEDYFTVGYKGASEMDAGLYYCPYVALTPLRGTDPKNF 480 (534) T ss_pred hccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeeeeeeeceeeCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 469 QPVLGFKTRYGIGINPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 469 qP~~~~~tRY~l~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) ||+|||||||||++|||++..++++. .||.||||++++.+|+|.|||||+|||| T Consensus 481 qP~~g~~tRY~l~~NP~~~~~~~~~~--~~i~~g~~~~~~~ag~n~~~~~~~Vk~l 534 (534) T protein:vir:10 481 QPVLGFKTRYGVKLHPMADATQNKGF--AKISNGMPQHTNMFGKNAFFRRVLVAGV 534 (534) T ss_pred cceeeeeeeeceeecCcccccCCccc--cccccCCcchhhhcccccceeeeeeecC Confidence 99999999999999999999988875 5899999999999999999999999999 No 7 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=100.00 E-value=3.7e-249 Score=1382.60 Aligned_cols=518 Identities=71% Similarity=1.133 Sum_probs=476.8 Q ss_pred cccchHHHHHHhhhhhhccCCCcchhh-hhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhh Q lcl|NC_014661. 3 TQIKTKAQLVADWKPLLEAEGAPEIAQ-GKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNI 81 (524) Q Consensus 3 ~~~~~~~~l~~kw~p~l~~~~~~~~~~-~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i 81 (524) -||++ |+|+|||+|||||||+|+|++ +||+|+|+|||||||+++|++.||++.++++++.+|+|++++|+|||++.+| T Consensus 1 ~~~~~-~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~~~~~~~~~~~~~~~i 79 (529) T protein:vir:10 1 MSLKN-KEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNI 79 (529) T ss_pred CcccH-HHHHHHhHHHhcCCccchhccchhhhhhhhhhhhhHHHHhhccccchhhhhhhhhcccchhhcccccccccccc Confidence 46666 569999999999999999986 8999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccccc Q lcl|NC_014661. 82 AAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGS 161 (524) Q Consensus 82 ~est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~ 161 (524) +|||+|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++.++++|+||+++.|++.|||.+. T Consensus 80 ~est~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~y~Pda~~sga~~ 159 (529) T protein:vir:10 80 AAGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLAT 159 (529) T ss_pred ccccccccccccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999865 Q ss_pred cccc--------ccccccccccccccccccccccccccccccccc--ccCCCCCcccccccccccccccccccccccccc Q lcl|NC_014661. 162 HEVF--------APLASGTVVAQGTIYKHEFVATGTAFLQATGAV--TLATTADAAELDAEVIKQMDAGILVEIAEGMAT 231 (524) Q Consensus 162 ~~~~--------~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~--~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~T 231 (524) .... ..+.....++.++...+.|...++++....... ..+........+.........+..|++++||+| T Consensus 160 ~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm~T 239 (529) T protein:vir:10 160 KGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMAT 239 (529) T ss_pred cccccccCccccccccccccccccCcceeeeecccceecccccccccccCccccCcccccccccccccccccccccccch Confidence 4332 223334455556666666667777766543222 112222222334445667778899999999999 Q ss_pred hhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhH Q lcl|NC_014661. 232 SIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWI 311 (524) Q Consensus 232 s~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l 311 (524) +++|+|+.+|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+||||||||||||+| T Consensus 240 a~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l 319 (529) T protein:vir:10 240 SIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWI 319 (529) T ss_pred hhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCC Q lcl|NC_014661. 312 NYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVD 391 (524) Q Consensus 312 ~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~ 391 (524) |+||+|||+.|+...++.+|+|||++++|++++||++|+||+|++|||||||+|+|+|+||+|||||||++||++|+|+| T Consensus 320 ~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~ 399 (529) T protein:vir:10 320 NYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALID 399 (529) T ss_pred hhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccce Q lcl|NC_014661. 392 TSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPV 471 (524) Q Consensus 392 ~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~ 471 (524) +++++++++...|+++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+ T Consensus 400 ~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~ 479 (529) T protein:vir:10 400 TNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPV 479 (529) T ss_pred hhccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeceeeCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 472 LGFKTRYGIGINPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 472 ~~~~tRY~l~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) |||||||||++|||+++.++++. .|++||+|| .+.+|+|.|||||+|||| T Consensus 480 ~g~~tRY~l~~NP~~~~~~~~~~--~r~~~g~~~-~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 480 MGFKTRYAIGVNPFAESRTQAPQ--GRITSGMPG-VNSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeeeeceeecCcccccccccc--ccccCCcch-hhhcCccceeEEeeeccC Confidence 99999999999999999988754 488999997 678999999999999999 No 8 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=100.00 E-value=1.4e-248 Score=1379.33 Aligned_cols=518 Identities=71% Similarity=1.133 Sum_probs=477.4 Q ss_pred cccchHHHHHHhhhhhhccCCCcchhh-hhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhh Q lcl|NC_014661. 3 TQIKTKAQLVADWKPLLEAEGAPEIAQ-GKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNI 81 (524) Q Consensus 3 ~~~~~~~~l~~kw~p~l~~~~~~~~~~-~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i 81 (524) -+|++ |+|+|||+|||||||+|+|++ +||+|+|+|||||||+++|+|.||++.++++++.+|.|++++|+|||++.+| T Consensus 1 ~~~~~-~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i 79 (529) T protein:vir:10 1 MSLKN-KEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNI 79 (529) T ss_pred Cccch-HHHHHHhhHhhcCCccchhccchhhhhhhhhhhhhHHHHhcccccchhhhhhhhhccchhhccccccccccccc Confidence 34566 579999999999999999986 8999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccccc Q lcl|NC_014661. 82 AAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGS 161 (524) Q Consensus 82 ~est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~ 161 (524) +|||+|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++.++++|+||+.+.|++.|||++. T Consensus 80 ~~st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~~~pda~~sga~~ 159 (529) T protein:vir:10 80 AAGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLAT 159 (529) T ss_pred ccccccccccccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999865 Q ss_pred ccc--------cccccccccccccccccccccccccccccccccc-cc-CCCCCcccccccccccccccccccccccccc Q lcl|NC_014661. 162 HEV--------FAPLASGTVVAQGTIYKHEFVATGTAFLQATGAV-TL-ATTADAAELDAEVIKQMDAGILVEIAEGMAT 231 (524) Q Consensus 162 ~~~--------~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~-~~-a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~T 231 (524) ... ...+..+..++.++...+.+...++++....... .. +.+......+.........+..|++++||+| T Consensus 160 ~ga~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsT 239 (529) T protein:vir:10 160 KGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMAT 239 (529) T ss_pred ccccccccccccccccccccccccccceeeecccCceeeccccccccccCccccCcccccccccccccccccccccchhh Confidence 433 2233344555666666677777777766543222 22 2222222344556667788899999999999 Q ss_pred hhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhH Q lcl|NC_014661. 232 SIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWI 311 (524) Q Consensus 232 s~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l 311 (524) +.+|+|+.+|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+||||||||||||+| T Consensus 240 a~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l 319 (529) T protein:vir:10 240 SIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWI 319 (529) T ss_pred hhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCC Q lcl|NC_014661. 312 NYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVD 391 (524) Q Consensus 312 ~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~ 391 (524) |++|+|||+.|+.+.++.+|+|||++++|++++||++|+||+|++|||||||+|+|+|+||+|||||||++||++|+|+| T Consensus 320 ~~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~ 399 (529) T protein:vir:10 320 NYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALID 399 (529) T ss_pred hhhhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccce Q lcl|NC_014661. 392 TSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPV 471 (524) Q Consensus 392 ~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~ 471 (524) ++++++..+...|+++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||++++++||+||||+ T Consensus 400 ~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~ 479 (529) T protein:vir:10 400 TNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGFDPKNFQPV 479 (529) T ss_pred ccccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeceeeCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 472 LGFKTRYGIGINPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 472 ~~~~tRY~l~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) |||||||||++|||+++.++++. .|++||+|| .+.+|+|.|||||+|||| T Consensus 480 ~g~~tRY~l~~NP~~~~~~~~~~--~r~~~g~~~-~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 480 MGFKTRYAIGVNPFAESRTQAPQ--GRITSGMPG-VNSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeeeeceeecCcccccccccc--ccccCCcch-hhhcCccceeEEeeeccC Confidence 99999999999999999888754 488999997 678999999999999999 No 9 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=100.00 E-value=2.1e-248 Score=1378.48 Aligned_cols=518 Identities=72% Similarity=1.147 Sum_probs=476.4 Q ss_pred cccchHHHHHHhhhhhhccCCCcchhh-hhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhh Q lcl|NC_014661. 3 TQIKTKAQLVADWKPLLEAEGAPEIAQ-GKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNI 81 (524) Q Consensus 3 ~~~~~~~~l~~kw~p~l~~~~~~~~~~-~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i 81 (524) -+|++ |+|+|||+|||||||+|+|++ +||+|+|+|||||||+|+|+|.||+..++++++.+|.|++++|+|||++.+| T Consensus 1 ~~~~~-~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i 79 (529) T protein:vir:10 1 MSLKT-KEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKTDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNI 79 (529) T ss_pred Cccch-HHHHHHhhHhhcCCccchhcchhhhhhhhhhhhhHHHHhhcccccchhhhhhhhhhccchhhcccccccccccc Confidence 35666 579999999999999999986 8999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccccc Q lcl|NC_014661. 82 AAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGS 161 (524) Q Consensus 82 ~est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~ 161 (524) +||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++.+.+.+.|+||+|+|||++|||.+. T Consensus 80 a~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~~~ 159 (529) T protein:vir:10 80 AAGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGLAA 159 (529) T ss_pred cccccccccccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999865 Q ss_pred ccc--------cccccccccccccccccccccccccccccccc-ccc-cCCCCCcccccccccccccccccccccccccc Q lcl|NC_014661. 162 HEV--------FAPLASGTVVAQGTIYKHEFVATGTAFLQATG-AVT-LATTADAAELDAEVIKQMDAGILVEIAEGMAT 231 (524) Q Consensus 162 ~~~--------~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g-~~~-~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~T 231 (524) ... ......+.....++...+.+...++.+..... ..+ .+.+..+...+.........+..|++++||+| T Consensus 160 ~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsT 239 (529) T protein:vir:10 160 KGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMAT 239 (529) T ss_pred ccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccccccccccccccch Confidence 432 33333444455555555566555555543322 222 22222333455566677888889999999999 Q ss_pred hhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhH Q lcl|NC_014661. 232 SIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWI 311 (524) Q Consensus 232 s~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l 311 (524) +.+|+|+.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+| T Consensus 240 a~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i 319 (529) T protein:vir:10 240 SIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWI 319 (529) T ss_pred hhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCC Q lcl|NC_014661. 312 NYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVD 391 (524) Q Consensus 312 ~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~ 391 (524) +.+|++++.+++.+.++.+|+|||+++.|++++||++|+||+|++|||||||+|+|+|+||+|||||||++||++|+|++ T Consensus 320 ~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~ 399 (529) T protein:vir:10 320 NYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALVD 399 (529) T ss_pred hhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccce Q lcl|NC_014661. 392 TSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPV 471 (524) Q Consensus 392 ~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~ 471 (524) .++++++++...|+++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+ T Consensus 400 ~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~ 479 (529) T protein:vir:10 400 AGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPV 479 (529) T ss_pred cccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeceeeCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 472 LGFKTRYGIGINPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 472 ~~~~tRY~l~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) |||||||||++|||++..++++. .|++||+|| ++.+++|.|||||+|||| T Consensus 480 ~g~~tRY~l~~NP~~~~~~~~~~--~r~~~g~~~-~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 480 MGFKTRYAIGVNPFAESRTQAPT--SRISNGMPG-AHSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeeeeceeecCcccccccccc--ccccCCcch-hhhcCccceeeEeeeccC Confidence 99999999999999999999754 588999996 789999999999999999 No 10 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=100.00 E-value=2.3e-247 Score=1372.77 Aligned_cols=517 Identities=68% Similarity=1.088 Sum_probs=468.3 Q ss_pred cchHHHHHHhhhhhhccCCCcchhh-hhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhcc Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEAEGAPEIAQ-GKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAA 83 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~~~~~~~~~-~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~e 83 (524) ||++|+|+|||+|||||||+|+|++ +||+|+|+|||||||+|+|+|.|+++++++|||.+|.||+++|+|||++.+|+| T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 80 (528) T protein:vir:66 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAA 80 (528) T ss_pred CcchHHHHHHhHHhhcCCCcchhcchhhhhhhhhhhhhhHHHhhcccchhhHHHHHhhhhhhhhhcccccccccchhccc Confidence 9999999999999999999999987 899999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccccccc Q lcl|NC_014661. 84 GQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHE 163 (524) Q Consensus 84 st~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~ 163 (524) |++|++|++|||+||+|||||+|||||+|||||||||||||||||||++|+++++.++++++||+.+.+++.||+..+.. T Consensus 81 s~~t~~v~~~~P~Li~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~~ 160 (528) T protein:vir:66 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAKE 160 (528) T ss_pred cccccccccCchhHHHHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCccccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999875543 Q ss_pred cccccc---------cccccccccccccccccccccccccccccccCC-CCCcccccccccccccccccccccccccchh Q lcl|NC_014661. 164 VFAPLA---------SGTVVAQGTIYKHEFVATGTAFLQATGAVTLAT-TADAAELDAEVIKQMDAGILVEIAEGMATSI 233 (524) Q Consensus 164 ~~~~~~---------~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~-~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~ 233 (524) .....+ .......++.+.++...++.............. .......+.........+..|+++.||+|+. T Consensus 161 a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~ 240 (528) T protein:vir:66 161 ATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATSI 240 (528) T ss_pred ccccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccccccccccccccccceecccccchhh Confidence 322211 222333444444444444433322222111111 1111112233344556677899999999999 Q ss_pred hhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhh Q lcl|NC_014661. 234 AELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINY 313 (524) Q Consensus 234 aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~ 313 (524) +|+++.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||||||||||++|+. T Consensus 241 aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~~ 320 (528) T protein:vir:66 241 AEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) T ss_pred hhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcc Q lcl|NC_014661. 314 SAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTS 393 (524) Q Consensus 314 ~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~ 393 (524) +|++||++++.++++++|+|||++++|++|+||++|+||+|++|||+|||+|+|+|+||+|||||||++||++|+|+|++ T Consensus 321 ~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~ 400 (528) T protein:vir:66 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) T ss_pred eeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccceee Q lcl|NC_014661. 394 VTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLG 473 (524) Q Consensus 394 ~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~ 473 (524) +++++.+...++++|+++++|+|+|+|||+||||||+++|||+|||||++|+|+||||||||||+|++++||+||||+|| T Consensus 401 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g 480 (528) T protein:vir:66 401 ISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLG 480 (528) T ss_pred ccccccccccccccCCCCceeEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeceeeCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 474 FKTRYGIGINPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 474 ~~tRY~l~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) |||||||++|||+++.+|++. .|++||+|| .+.+|+|.|||||+|||| T Consensus 481 ~~tRY~l~vNP~~~~~~~~~~--~ri~~g~~~-~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:66 481 FKTRYGIGINPFADSKSQEPS--ARITSGMLS-KDSVGKNAYFRRVWVKGC 528 (528) T ss_pred eeeeeceeecCcccccCcccc--ccccccchh-hhhcCccceeEEeeeccC Confidence 999999999999998877654 589999998 578899999999999999 No 11 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=100.00 E-value=4e-247 Score=1371.41 Aligned_cols=517 Identities=71% Similarity=1.143 Sum_probs=480.9 Q ss_pred cchHHHHHHhhhhhhcc-CCCcchhh-hhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhc Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEA-EGAPEIAQ-GKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIA 82 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~-~~~~~~~~-~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~ 82 (524) |++||+|+|||+||||+ ||+|+|++ +||+|+|+||||||||++++|.|+|+++++|||.||.||+++|+|||++.+|+ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~ 80 (524) T protein:vir:98 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) T ss_pred CcchHHHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHHHhcCccccchHHHHhhhcccccccccccccccccccc Confidence 88889999999999996 89999988 89999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcc----cccccccccccccccc Q lcl|NC_014661. 83 AGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGA----KEAFHPMYAPDAMFSG 158 (524) Q Consensus 83 est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g----~eAf~~fnEadt~FSG 158 (524) ||++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++..++. .|||++|+++|+.||| T Consensus 81 ~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG 160 (524) T protein:vir:98 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) T ss_pred ccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCcccccccccccccccccccccCC Confidence 999999999999999999999999999999999999999999999999999998765432 7899999999999999 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccCC-CCCcccccccccccccccccccccccccchhhhcc Q lcl|NC_014661. 159 RGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLAT-TADAAELDAEVIKQMDAGILVEIAEGMATSIAELQ 237 (524) Q Consensus 159 ~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~-~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l 237 (524) +++..+.++.+.+..+..++...+.+...+..+........... .++....+.........+..|+++.||+|+.+|+| T Consensus 161 ~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL 240 (524) T protein:vir:98 161 EGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQ 240 (524) T ss_pred ccccccccccccccccccccccccccccccceeccccccCcccccccccccccccccccccccceeecccccchhhhhhh Confidence 99999999999999999988888888888887776655443322 23334445555666778889999999999999999 Q ss_pred cccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhh Q lcl|NC_014661. 238 EGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQV 317 (524) Q Consensus 238 ~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~ 317 (524) +.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+.+|++ T Consensus 241 ~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~ 320 (524) T protein:vir:98 241 ENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQV 320 (524) T ss_pred ccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccc Q lcl|NC_014661. 318 GKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPA 397 (524) Q Consensus 318 ~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~ 397 (524) ++.+++..+++++|+|||+++.|..++||++|+||+|++|||+|||+|+|+|+||+||||||||+||++|+|++....++ T Consensus 321 ~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~ 400 (524) T protein:vir:98 321 GKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 400 (524) T ss_pred ceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999976665655 Q ss_pred ccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeee Q lcl|NC_014661. 398 AQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTR 477 (524) Q Consensus 398 a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tR 477 (524) +++....++.|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|+|++||+||||+|||||| T Consensus 401 s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tR 480 (524) T protein:vir:98 401 SQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 480 (524) T ss_pred cchhhcccccCCccceEEEEecCceEEEecCCCCcceEEEEeeCCcccccceeeccccccccccccCCccccceeeeeee Confidence 55555668889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eceeeCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 478 YGIGINPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 478 Y~l~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) |||++|||+++.+++++ .|++||+|| .+.+|+|.|||||+|||| T Consensus 481 Y~l~~NP~~~~~~~~~~--~ri~~g~~~-~~~ag~n~~~r~~~Vk~l 524 (524) T protein:vir:98 481 YGIGINPFANSRSQAPA--DRITSGMIS-KEMCGKNAYFRKVWVKGL 524 (524) T ss_pred eceeecCcccccCCccc--cccccCcch-HhhcCccceeeEeeeccC Confidence 99999999999998775 488999998 467899999999999999 No 12 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=100.00 E-value=1.9e-243 Score=1351.32 Aligned_cols=508 Identities=61% Similarity=0.992 Sum_probs=458.8 Q ss_pred HHHHHhhhhhhccCC--Ccchhh-hhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhcccc Q lcl|NC_014661. 9 AQLVADWKPLLEAEG--APEIAQ-GKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAAGQ 85 (524) Q Consensus 9 ~~l~~kw~p~l~~~~--~~~~~~-~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~est 85 (524) -+|+|||+||||||| +|+|++ +||+|+|+|||||||+++|+++|+|+.++++|+.+|+||+++|+|||++.+|+||+ T Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia~s~ 80 (514) T protein:vir:56 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) T ss_pred CchhhhhhHHhcccccccccccchhhhhhhhhhhhhHHHHHhcCCcccchhhhhhhhccccccccccccccccccccccc Confidence 469999999999998 799998 79999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccc Q lcl|NC_014661. 86 TSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVF 165 (524) Q Consensus 86 ~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~ 165 (524) +|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++++ +.||||++||+|++|||++..... T Consensus 81 ~t~~v~~~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~t--g~EAf~~~nEadt~fSG~~~~~~~ 158 (514) T protein:vir:56 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT--GAEAFHPTRQADASFSGQAAASTI 158 (514) T ss_pred ccccccccchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcc--cccccccccccCcCcccccccccc Confidence 999999999999999999999999999999999999999999999999998754 679999999999999999887777 Q ss_pred cccccccccccccccccccccccccccccc-c-ccccCCCCCcccccccccccccccccccccccccchhhhcccccCCC Q lcl|NC_014661. 166 APLASGTVVAQGTIYKHEFVATGTAFLQAT-G-AVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGS 243 (524) Q Consensus 166 ~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~-g-~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs 243 (524) ...+.......++.........+....... . .........+...+.........+..|+++.||+|+.+|+++.||++ T Consensus 159 ~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~lggs 238 (514) T protein:vir:56 159 ADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGS 238 (514) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhcccCCCC Confidence 776666666666655544332222111111 1 01111112222333445667778889999999999999999999999 Q ss_pred CCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccc Q lcl|NC_014661. 244 NNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQT 323 (524) Q Consensus 244 ~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~ 323 (524) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+.+|+++|..++ T Consensus 239 ~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~~~~~ 318 (514) T protein:vir:56 239 SNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWT 318 (514) T ss_pred cccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999988888877 Q ss_pred cccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccccccccc Q lcl|NC_014661. 324 LTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLAR 403 (524) Q Consensus 324 ~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~ 403 (524) .+. .++|+|||++++|+.++||++|+||+|++|||||+|+|+|+|+||+|||||||++||++|+|+|++++++++|+.. T Consensus 319 ~~~-~~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~ 397 (514) T protein:vir:56 319 QGA-GAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQD 397 (514) T ss_pred ccc-ccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccCccc Confidence 665 5699999999999999999999999999999999999999999999999999999999999999999999999887 Q ss_pred c-cccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee Q lcl|NC_014661. 404 G-LNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIGI 482 (524) Q Consensus 404 ~-~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~ 482 (524) + ++.|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||++++++||+||||+|||||||||++ T Consensus 398 ~~~~~d~~~~~~aG~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~ 477 (514) T protein:vir:56 398 GSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYGVQV 477 (514) T ss_pred cccccccCcceEEEEecCceEEEecCCCCcceEEEEEecCcceecceeeccccccccccccCCccccceeeeeeeeceee Confidence 5 899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 483 NPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 483 nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) |||++.. ++.++++||||.+|++ ++|.|||||+|||| T Consensus 478 NPy~~~~----~~~~~~~~~~~~~a~~-~~n~y~r~v~v~~l 514 (514) T protein:vir:56 478 NPFADPT----ASATKVGNGAPVAASM-GKNAYFRRVFVKGL 514 (514) T ss_pred CCCCCcc----ccccccCCcchhhhcc-cccceeeeEEEecC Confidence 9998443 4457899999998855 78999999999999 No 13 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=100.00 E-value=1.4e-228 Score=1269.72 Aligned_cols=461 Identities=37% Similarity=0.649 Sum_probs=409.4 Q ss_pred cccchHHHHHHhhhhhhccCCCcchhh-hhhhhhhhhhhhHHHHHhhhhccccchhhhccccccccc-ccccccccchhh Q lcl|NC_014661. 3 TQIKTKAQLVADWKPLLEAEGAPEIAQ-GKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEA-EIGGDHGYDPQN 80 (524) Q Consensus 3 ~~~~~~~~l~~kw~p~l~~~~~~~~~~-~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea-~~~~~~g~~~~~ 80 (524) -|||++|+|+|||+|||||||+|+|++ +||+|+|+|||||||+|+|++.+ |.|+ +++++||+++.+ T Consensus 1 ~~~~~~e~l~~kw~p~l~~~~~~~i~~~~~~~v~a~l~enq~~~~~~~~~~------------l~e~~~~~~~~~~~~~~ 68 (470) T protein:vir:10 1 MQMFNSEYLQEKWAPILDYDGLDPIKDSHRRSVTAVLLENQEKELREERNF------------LSEAPNVNTNSGATAGF 68 (470) T ss_pred CCcchhHHHHHhhhhhhcCCccchhcchhhhhhhhhhhhhhHHHHhhccch------------hhhhhhccccccccccc Confidence 689999999999999999999999987 89999999999999999999974 7777 799999999999 Q ss_pred hccccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccccccccc Q lcl|NC_014661. 81 IAAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRG 160 (524) Q Consensus 81 i~est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~ 160 (524) |+|||+|++|++|||+||+|||||+|||||+|||||||||||||||||||+||.++. |+|+| |+|+|+.|||++ T Consensus 69 i~~st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~s----G~Eaf--fnEA~T~fSG~~ 142 (470) T protein:vir:10 69 SADATAAGPVAGFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQS----GTEAL--FNEADTAFSGQP 142 (470) T ss_pred cccccccccccccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCC----cccee--eecCCcccCccc Confidence 999999999999999999999999999999999999999999999999999999874 67888 799999999987 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhccccc Q lcl|NC_014661. 161 SHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGF 240 (524) Q Consensus 161 ~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~l 240 (524) ...........+.............++.++... .......+..|+++.||+|+.+| .| T Consensus 143 ~~~~~~~~~~~~~a~~~g~~~~~~~gt~~~~~~-------------------~~~~~a~~~~y~~~~GMsTa~aE---~l 200 (470) T protein:vir:10 143 DGLDDTSGFTATGANNVGLGTTAQQGSNPGLLN-------------------STAAQTNATDYNVGQGMRTDSAE---DL 200 (470) T ss_pred ccccccccccccccccccccccccccccccccc-------------------cccccccccccccccccchHHhh---hc Confidence 665433322111100000000000111111100 01122334568899999999999 57 Q ss_pred CCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhh Q lcl|NC_014661. 241 NGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKT 320 (524) Q Consensus 241 Ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~ 320 (524) |++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+||+||+|||+ T Consensus 201 g~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~k~ 280 (470) T protein:vir:10 201 GDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPGAQ 280 (470) T ss_pred CCCCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhcee Confidence 78899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccc Q lcl|NC_014661. 321 GQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQG 400 (524) Q Consensus 321 ~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~ 400 (524) +++ +++|+|||+++.| +||++|+||+|++||++++|+|+++|+||+|||||||++||++|+|+|++++.|++. T Consensus 281 ~~~----~~~Gv~Dl~~~~~---gr~~~e~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~~ 353 (470) T protein:vir:10 281 ANV----AAAGTFDLDTDSN---GRWSVEKFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALN 353 (470) T ss_pred ccc----cccceEEeecccc---hhHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhccccccccccc Confidence 998 6699999997776 699999999999999999999999999999999999999999999999999998876 Q ss_pred cccccccccCcceEEEEecCceEEEeeCC------CCcceEEEEEecCCCccceeEeecccccccccccCcccccceeee Q lcl|NC_014661. 401 LARGLNTDTTKAVFAGILGGRYKVYIDQY------ARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGF 474 (524) Q Consensus 401 ~~~~~~~d~~~~~~~G~l~~~~~vy~D~y------~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~ 474 (524) . .++.|+|+++|+|+|+|||+|||||| +++|||+|||||++++|+||||||||||++++.+||+||||+||| T Consensus 354 ~--~~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~ 431 (470) T protein:vir:10 354 A--NLNVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVGYKGSSPYDAGLFYCPYVPLQMVRAVGQDTFQPKIGF 431 (470) T ss_pred c--ccccCCCCceEEEEecCceEEEeeccccccCcccccEEEEEEecCcceecceeeccccccccCCCCCCccccceeee Confidence 3 57999999999999999999999997 789999999999999999999999999999999999999999999 Q ss_pred eeeeceeeCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 475 KTRYGIGINPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 475 ~tRY~l~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) ||||||++|||++..++.++++++ ++|.|||||+|||| T Consensus 432 ~tRY~l~~NP~~~~~~~~~~~i~~------------~~n~y~r~~~v~~l 469 (470) T protein:vir:10 432 KTRYGLVENPFSQGTTQGLGTLTR------------NSNRYYRRVKVANL 469 (470) T ss_pred eeeeceeecCcccCCCcccccccC------------CCCceeeEEEeecc Confidence 999999999999999998876533 77999999999999 No 14 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=100.00 E-value=2.4e-226 Score=1257.53 Aligned_cols=458 Identities=37% Similarity=0.627 Sum_probs=402.9 Q ss_pred cchHHHHHHhhhhhhccCCCcchhh-hhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhcc Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEAEGAPEIAQ-GKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAA 83 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~~~~~~~~~-~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~e 83 (524) |++.|+|+|||+|||||||+|+|++ +||+|+|+|||||||||+|++.|++|.+.++||+. +.....++++ T Consensus 1 ~~~~e~l~~kW~plLe~~~~~~i~~~~k~~i~a~llENQe~~~~~~~~~~~~~~~~~~~~~---------~~~~~n~~~~ 71 (468) T protein:vir:10 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGAG---------TIAPAGSALG 71 (468) T ss_pred CcchHHHHHhhhHhhcCCccchhccchhhhhhhhhhhhHHHHHhccccccchhhHhhcCCc---------ccchhhhhhh Confidence 9999999999999999999999988 89999999999999999999999999999999742 2344457788 Q ss_pred ccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccccccc Q lcl|NC_014661. 84 GQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHE 163 (524) Q Consensus 84 st~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~ 163 (524) +++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||.++. |+||| |||||++|||+++.. T Consensus 72 ~~~t~~v~~~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~----g~EAf--~nEadt~fSg~~~~~ 145 (468) T protein:vir:10 72 SANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEAL--FNEPDTGFTGGYDAS 145 (468) T ss_pred hcccccccccCchhhhhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCCC----Cccce--ecccccccccccccc Confidence 999999999999999999999999999999999999999999999999999884 78999 699999999976543 Q ss_pred cccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCC Q lcl|NC_014661. 164 VFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGS 243 (524) Q Consensus 164 ~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs 243 (524) ............. ++....++... ....+..|+++.||+|+.+|.++ + T Consensus 146 ~~~~~~~~~~~~~---------------------------~~~~g~~~~~~-~~a~~~~~~~g~gMsTa~aE~lG----~ 193 (468) T protein:vir:10 146 QGDYAVRTGAGVG---------------------------GDSEGNNPALL-NDAAPGTYEVGSKMPREDLERMG----E 193 (468) T ss_pred ccccccccccccc---------------------------cCCCCCccccc-ccccccccccccccchHHHhhcC----C Confidence 2221111000000 00001111111 11234567899999999999763 4 Q ss_pred CCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccc Q lcl|NC_014661. 244 NNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQT 323 (524) Q Consensus 244 ~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~ 323 (524) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||||+||+||+|||+.++ T Consensus 194 ~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~g~ 273 (468) T protein:vir:10 194 ANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNV 273 (468) T ss_pred CCcccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheecccc Confidence 56789999999999999999999999999999999999999999999999999999999999999999999999998764 Q ss_pred cccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccccccccc Q lcl|NC_014661. 324 LTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLAR 403 (524) Q Consensus 324 ~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~ 403 (524) +++|+|||+++.| +||++|+||+|++|||+++|+|+++|+||+|||||||++||++|+|+|+++++|++.... T Consensus 274 ----~~~Gv~d~~~~~~---~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~~ 346 (468) T protein:vir:10 274 ----ANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAG 346 (468) T ss_pred ----ccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceecccccccc Confidence 7799999997776 699999999999999999999999999999999999999999999999999999988876 Q ss_pred cc---ccccCcceEEEEecCceEEEeeCCC----CcceEEEEEecCCCccceeEeecccccccccccCcccccceeeeee Q lcl|NC_014661. 404 GL---NTDTTKAVFAGILGGRYKVYIDQYA----RQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKT 476 (524) Q Consensus 404 ~~---~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~t 476 (524) ++ +.|+++++|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|++++||+||||+||||| T Consensus 347 ~~~~~~~D~tg~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~sfqP~~g~~t 426 (468) T protein:vir:10 347 GPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKT 426 (468) T ss_pred cccccccccCcceEEEEecCceEEEEccccccCCccceEEEEEecCcceeceeeeccccccccccccCCCcccceeeeee Confidence 64 7899999999999999999999996 5899999999999999999999999999999999999999999999 Q ss_pred eeceeeCCcccccCCccccceeeccccchh-hhhccccceeeeeeeecC Q lcl|NC_014661. 477 RYGIGINPLADTAAQQPAGNARIANGMPSI-ANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 477 RY~l~~nP~~~~~~~~~~~~~~~~~g~~~~-a~~~~~~~~~r~~~v~~~ 524 (524) ||||++|||+... +++||++.. +...++|.|||||+|||| T Consensus 427 RY~l~~NP~~~~~--------~~~~g~~~~~~~~~~~N~y~r~~~v~~l 467 (468) T protein:vir:10 427 RYGMVSNPFVTTN--------GLYNGTPDGEALTPNANMYYRRVQVTNL 467 (468) T ss_pred eeceeecccceec--------cccCCCcccccccccccceeeeEEEecc Confidence 9999999999653 456666654 335699999999999999 No 15 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=100.00 E-value=6.3e-223 Score=1238.78 Aligned_cols=453 Identities=40% Similarity=0.677 Sum_probs=395.7 Q ss_pred CCcccchHHHHHHhhhhhhccCCCcchhh-hhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchh Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEAEGAPEIAQ-GKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQ 79 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~~~~~~~~~-~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~ 79 (524) ||+ |+|+|||+|||||||+|+|++ +||+|+++|||||||+|+|++. +|+||. ++|||+ T Consensus 1 ms~-----~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~------------~l~ea~--~~~g~~-- 59 (462) T protein:vir:10 1 MSI-----QQLQEKWAPVLNHESVPEIKDSYKKGVVAQLLENQENAIREEGQ------------VLNETL--QTTGYT-- 59 (462) T ss_pred Cch-----HHHHHHhhhhhcccccchhhhhhHHHHHHHHhhhHHHHHHhccc------------chhccc--cccCCC-- Confidence 655 899999999999999999998 6999999999999999999776 688884 899999 Q ss_pred hhccccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCC--ccccccccccccccccc Q lcl|NC_014661. 80 NIAAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAA--GAKEAFHPMYAPDAMFS 157 (524) Q Consensus 80 ~i~est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~--~g~eAf~~fnEadt~FS 157 (524) .|+++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++.+. ++.||| |||+|+.|| T Consensus 60 --~~~~~t~~~~~~~P~Li~l~Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~gtEAl--fnEadt~fS 135 (462) T protein:vir:10 60 --TGDTATGPVAGFDPVLISLIRRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSDFREAL--FNEPNAGFS 135 (462) T ss_pred --cCcccccccccccchhhhHHHHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccccchhh--hccCCcCcc Confidence 67888999999999999999999999999999999999999999999999999876543 578888 699999999 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcc Q lcl|NC_014661. 158 GRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQ 237 (524) Q Consensus 158 G~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l 237 (524) |..+............ .......+..+... + .........++++.||+|+.+|++ T Consensus 136 g~~~~~~~~~~~~~~~---------------~~~~~~~g~~~~~~--~--------~~~~g~~~~~~~~~GM~Ta~aE~l 190 (462) T protein:vir:10 136 GGAGTGLSNYDPTASS---------------SAVNDAEGANPGLL--N--------DSPAGTYEVTGDATGMATATAEAL 190 (462) T ss_pred cccccccccccccccc---------------ccccccccccceee--c--------CCCccceecccccccccchhcccc Confidence 9765432221111000 00000111111000 0 000111234567889999999976 Q ss_pred cccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhh Q lcl|NC_014661. 238 EGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQV 317 (524) Q Consensus 238 ~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~ 317 (524) +. ++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||||+||+||+| T Consensus 191 g~--~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~ 268 (462) T protein:vir:10 191 DD--SSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVNAVK 268 (462) T ss_pred CC--ccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhhee Confidence 52 4667899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccc Q lcl|NC_014661. 318 GKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPA 397 (524) Q Consensus 318 ~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~ 397 (524) ||++++ +++|+|||+++.| +||++|+||+|++||++++|+|+|+|+||+|||||||++||++|+|+|+|++.| T Consensus 269 ~k~~~~----~~~Gv~dl~~~~~---gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~p 341 (462) T protein:vir:10 269 GAIANT----ATDGIFDLDVDSN---GRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMAGVLDYAP 341 (462) T ss_pred eecccc----cccceeeeccccc---hHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhccchhccc Confidence 999998 6699999987766 699999999999999999999999999999999999999999999999999999 Q ss_pred cccccccc-ccccCcceEEEEecCceEEEeeCC----CCcceEEEEEecCCCccceeEeecccccccccccCccccccee Q lcl|NC_014661. 398 AQGLARGL-NTDTTKAVFAGILGGRYKVYIDQY----ARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVL 472 (524) Q Consensus 398 a~~~~~~~-~~d~~~~~~~G~l~~~~~vy~D~y----~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~ 472 (524) +.....++ ++|+++.+|+|+|+|||+|||||| +++|||+|||||++++|+||||||||||+++|++||+||||+| T Consensus 342 ~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~ 421 (462) T protein:vir:10 342 GLQGNSALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPNTFQPKI 421 (462) T ss_pred cccccccccccccccceeEEEecCceEEEEecccCCCcccceEEEEEeCCcccccceeeccccccccccccCCcccccee Confidence 87766665 789999999999999999999998 6899999999999999999999999999999999999999999 Q ss_pred eeeeeeceeeCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 473 GFKTRYGIGINPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 473 ~~~tRY~l~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) ||||||||++|||+++.++.++++++ ++|.|||||+|||| T Consensus 422 g~~tRY~l~~NP~t~~~~~~~~~~~~------------~~n~y~r~~~v~~l 461 (462) T protein:vir:10 422 GFKTRYGMVSNPFSGGLTQGSGALTA------------NANKYYRRVQVANL 461 (462) T ss_pred eeeeeeeeeecCCCCCcCCccccccc------------cCcceeeeEEeecc Confidence 99999999999999999999876533 78999999999999 No 16 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=100.00 E-value=1.4e-218 Score=1214.92 Aligned_cols=448 Identities=41% Similarity=0.685 Sum_probs=394.2 Q ss_pred CCcccchHHHHHHhhhhhhccCCCcchhh-hhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchh Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEAEGAPEIAQ-GKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQ 79 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~~~~~~~~~-~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~ 79 (524) ||+ |+|+|||+|||||||+|+|++ +||+|+++|||||||+|.|++. +|+||. ++|||++. T Consensus 1 m~~-----~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~lenq~~~~~~~~~------------~l~ea~--~~~g~~~~ 61 (457) T protein:vir:10 1 MSF-----QNLQEKWAPVLEHDSLPEIGDSYKKGVVAQLLENQEKAIAEEGK------------ILTETL--QTTGYTGG 61 (457) T ss_pred Cch-----HHHHHHhhHhhccCccchhhhhHHHHHHHHHhhhHHHHHHhccc------------cccccc--cccCCCcc Confidence 654 889999999999999999988 7999999999999999998776 688885 89999965 Q ss_pred hhccccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCC--Cccccccccccccccccc Q lcl|NC_014661. 80 NIAAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIA--AGAKEAFHPMYAPDAMFS 157 (524) Q Consensus 80 ~i~est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~--~~g~eAf~~fnEadt~FS 157 (524) |++|++|++|||+||+||||++|||||+|||||||||||||||||||+||.++... .+.+||| |||||+.|| T Consensus 62 ----s~~t~~v~~~~P~Li~l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a~~~EAl--~nEadt~fS 135 (457) T protein:vir:10 62 ----DTVTGPVAGFDPVLISLIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAAGYDEAF--FNEPNAGFS 135 (457) T ss_pred ----cccccccccccchhhhhhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCcccccccccccee--eeccCcccC Confidence 67899999999999999999999999999999999999999999999999988654 3457888 799999999 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcc Q lcl|NC_014661. 158 GRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQ 237 (524) Q Consensus 158 G~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l 237 (524) |..+......... .+ .. .+..+.. .+ .........++++.||+|+++|.+ T Consensus 136 g~~~~~~~~~~~~-----~~-----~~----------~gt~~~~-------~~---~~~~~~~~~~~~~~gmsTA~aE~l 185 (457) T protein:vir:10 136 GGPGAYDPGATGV-----TN-----DA----------EGTNPAL-------LN---DSPAGTYEQADDATGMSTATVEAL 185 (457) T ss_pred ccccccccccccc-----cc-----cc----------ccccccc-------cC---ccccccccccccccchhhhhhhcc Confidence 9755432111000 00 00 0000000 00 011122345788999999999976 Q ss_pred cccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhh Q lcl|NC_014661. 238 EGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQV 317 (524) Q Consensus 238 ~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~ 317 (524) +. +++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+||+||+| T Consensus 186 gd--~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~ 263 (457) T protein:vir:10 186 DD--STANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAVA 263 (457) T ss_pred CC--CCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhhee Confidence 42 5677899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccc Q lcl|NC_014661. 318 GKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPA 397 (524) Q Consensus 318 ~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~ 397 (524) ||++|+ +++|+|||+++.| +||++|+||+|++||++++|+|+++|+||+|||||||++||++|+|+|++++.| T Consensus 264 ~~~~~~----~~~gv~dl~~~~~---g~~~~e~~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p 336 (457) T protein:vir:10 264 GAQNNT----ATAGVFDLDVDSN---GRWSVEKFKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGMAGVLDYTP 336 (457) T ss_pred eecccc----ccceeeeeecccc---chhhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhhcccccccc Confidence 999998 5699999986666 699999999999999999999999999999999999999999999999999999 Q ss_pred ccccccc-cccccCcceEEEEecCceEEEeeCCC----CcceEEEEEecCCCccceeEeecccccccccccCccccccee Q lcl|NC_014661. 398 AQGLARG-LNTDTTKAVFAGILGGRYKVYIDQYA----RQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVL 472 (524) Q Consensus 398 a~~~~~~-~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~ 472 (524) ++....+ .+.|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+++|++||+||||+| T Consensus 337 ~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~ 416 (457) T protein:vir:10 337 ALNGNNGLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPDTFQPKI 416 (457) T ss_pred hhhccccccccccccceeEEEecCCeEEEEecccccCCccceEEEEEeCCcceecceeecccccccccCccCCcccccee Confidence 9887766 46899999999999999999999986 589999999999999999999999999999999999999999 Q ss_pred eeeeeeceeeCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 473 GFKTRYGIGINPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 473 ~~~tRY~l~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) ||||||||++|||+.+.++.++++++ |+|.||||++|+|| T Consensus 417 g~~tRY~l~~NP~~~~~~~~~~~~~~------------~~n~~~~rs~vs~l 456 (457) T protein:vir:10 417 GFKTRYGMVSNPFAGGLTQGSGALTV------------NANKYYRRVQVANL 456 (457) T ss_pred eeeeeeeeeecccccccccccccccc------------cchhhcceeeeeec Confidence 99999999999999999999876532 67899999999999 No 17 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=100.00 E-value=6.2e-197 Score=1096.27 Aligned_cols=450 Identities=24% Similarity=0.328 Sum_probs=345.7 Q ss_pred CCcccchHHHHHHhhhhhhccCCCcchhhhhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhh Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEAEGAPEIAQGKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQN 80 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~ 80 (524) ||++. .+|+|+|||+||||++|.| +||+|+|+|||||||+ ++ .+ T Consensus 1 ~~~~~-~~e~l~~kw~p~l~~~~~~----~~~~~~a~llenq~~~---~~----------------------------~~ 44 (523) T protein:vir:59 1 MSQPK-INEQLIEKWQPLLEGCRND----WERHTLATLLENQYRE---AK----------------------------KH 44 (523) T ss_pred CCcch-hhHHHHHhhhhhhcccCCh----hHHHHHHHHhhhhhHH---HH----------------------------Hh Confidence 98877 6688999999999986655 7999999999999984 22 34 Q ss_pred hccccccccccccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCC--------cccccccccccc Q lcl|NC_014661. 81 IAAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAA--------GAKEAFHPMYAP 152 (524) Q Consensus 81 i~est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~--------~g~eAf~~fnEa 152 (524) |.|++.|++|++|.| ||+||||++|||||+||||||||||||||||||||||.++++.. .+.++..+++++ T Consensus 45 l~e~~~~~~~~~~~~-~~~~v~r~~p~l~a~DIWGVQPMTGPTGLIFAMRSRY~~q~gteA~yg~~~~~~~~a~~~~~ea 123 (523) T protein:vir:59 45 LMETTQTTEVDGWNL-ALPIVRRVFANLRATDLVSVQPLSLPTGLVFYLDFKSPELPGNGSVYGGTGLTTDTATGGLYDE 123 (523) T ss_pred hhhhhhccccccccc-hhhhhhhHhhhhhhhhccccccCCCCcceeEEEEeeccCCCCcccccCccccCccccccccccc Confidence 566677999999997 99999999999999999999999999999999999999986321 122233445677 Q ss_pred ccccccccccccccccc--c------------cccccccccc--cccc-----ccccc---------------------- Q lcl|NC_014661. 153 DAMFSGRGSHEVFAPLA--S------------GTVVAQGTIY--KHEF-----VATGT---------------------- 189 (524) Q Consensus 153 dt~FSG~~~~~~~~~~~--~------------g~~~a~g~~~--~~~~-----~~tg~---------------------- 189 (524) ++.|++........... . +.+.+.+... +... ...+. T Consensus 124 n~~~s~~~~~~~~~~d~~~sg~~~~~~~a~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~ 203 (523) T protein:vir:59 124 NARLSRREYETTITVDLATAQQATMRDVGFDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAY 203 (523) T ss_pred ccccccccccCccCCCcccccccccccccccccchhhccccceeeeeccccccccccccccccccccccccccccccccc Confidence 77777654332211100 0 0000000000 0000 00000 Q ss_pred ---ccccccccccc--------CCCCCcccccccccccccccccccccccccchhhhcccccC--CCCCcchhhcceEEE Q lcl|NC_014661. 190 ---AFLQATGAVTL--------ATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFN--GSNNNPWNEMGFRID 256 (524) Q Consensus 190 ---~~~~~~g~~~~--------a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lG--gs~~~~f~EMsFsIE 256 (524) .+......... ...++..............+..++.+.||+|+.+|.++..+ ++.++.|+||+|+|| T Consensus 204 s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIe 283 (523) T protein:vir:59 204 PLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELR 283 (523) T ss_pred hhhccccccccccccccccccccccccccccCCCcccccccccccccccccchhhccccccccccccccccccceeeEEE Confidence 00000000000 00000001111111122234568889999999999887544 578899999999999 Q ss_pred EEEEEEecccccchhhHHHHHHHHhhc-CCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecc Q lcl|NC_014661. 257 KQVIEAKSRQLKAQYSIELAQDLRAVH-GMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDF 335 (524) Q Consensus 257 K~TVtAKSRALKAEYT~ELAQDLKAvH-GLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl 335 (524) ||+|||||||||||||||||||||||| |||||+||+||||+||||||||||||+||+||+|||+.++ .++|+||| T Consensus 284 K~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~----~~~g~~~~ 359 (523) T protein:vir:59 284 SRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGF----WSEVVGEY 359 (523) T ss_pred eEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccc----cccceeee Confidence 999999999999999999999999999 9999999999999999999999999999999999999988 55999999 Q ss_pred ccccc---ccccchH--HHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccC Q lcl|NC_014661. 336 QDPID---VRGARWA--GESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTT 410 (524) Q Consensus 336 ~~~~d---~~~~~~a--~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~ 410 (524) .++.| +.+.+|. +|++|.||++||||+|+|+|+|+||+|||||||++||++|+++|+|...+ ....|++ T Consensus 360 ~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~------~~~~~~~ 433 (523) T protein:vir:59 360 YDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGN------DNRDGGT 433 (523) T ss_pred cccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCC------ccccccc Confidence 98776 2222332 89999999999999999999999999999999999999999999885522 2355778 Q ss_pred cceEEEEecCceEEEeeCCCCcceEEEEEecC-CCccceeEeeccccccccccc-Ccccccceeeeeeeeceee-CCccc Q lcl|NC_014661. 411 KAVFAGILGGRYKVYIDQYARQDYFTIGYKGD-NEMDAGIYYAPYVALTPLRGA-DPKNFQPVLGFKTRYGIGI-NPLAD 487 (524) Q Consensus 411 ~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~-~~~d~g~fyaPYv~~~~~~~~-Dp~s~qP~~~~~tRY~l~~-nP~~~ 487 (524) +.+|+|+|+|||+||||||+++|||+|||||. .++|+|||||||||+.++|.+ ||+||||+|||||||||++ |||+. T Consensus 434 ~~~~~g~l~~~~~vy~d~~~~~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~ 513 (523) T protein:vir:59 434 GIFYVGMVQGRYRLYKNIYQNQPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFY 513 (523) T ss_pred cceeEEEecCceEEEecCCCCcceEEEEecccCCcccccceecccchhhcccccccCCcccceeeeeeehhheecchhHh Confidence 89999999999999999999999999999995 599999999999999999997 9999999999999999986 99997 Q ss_pred ccCCccccceeeccccc Q lcl|NC_014661. 488 TAAQQPAGNARIANGMP 504 (524) Q Consensus 488 ~~~~~~~~~~~~~~g~~ 504 (524) +.---+ +-. | T Consensus 514 ~~~~~~-----~~~--~ 523 (523) T protein:vir:59 514 GLLYVK-----LLQ--P 523 (523) T ss_pred hhhhhh-----hcC--C Confidence 743211 000 0 No 18 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=96.92 E-value=0.00026 Score=40.27 Aligned_cols=350 Identities=14% Similarity=0.095 Sum_probs=136.5 Q ss_pred CCcccchHHHHHHhhhhh---hccCCCcchhhhhhhhhhhhhhhHHHHHhh------------------hhccccc---- Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPL---LEAEGAPEIAQGKHAIIAKMFENQEADIKS------------------DAAYRDE---- 55 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~---l~~~~~~~~~~~~~~~~~~~~enq~~~~~~------------------~~~~~~~---- 55 (524) ||+--.-++++.++++.+ ++.+. -++.... .. .+=|++|-+.+.+ .+..... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~-~e~~~~~-~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQK-AEIESTG-QV-SKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh Confidence 876322233333333332 22111 0111000 00 0111111111100 0000000 Q ss_pred --hhhhcccccccccccccccccchhhhccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeee Q lcl|NC_014661. 56 --KLAEAFGGFLTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAV 132 (524) Q Consensus 56 --~~~~~~~~~l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSr 132 (524) ...+.+-.++..........-....+..+++++. .-..|.++ .+++++..+..-.++|-++||++++.-+ .+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~----~~ 152 (385) T protein:vir:19 78 SERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAG-SLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEY----VR 152 (385) T ss_pred HHHHHHHHHHHHHHhhccchhhHHHhhhccccccCC-ceecchhhhHHHHHhhhccchhhhcceecccCcceEE----EE Confidence 0000000000000000000000001111111111 11223333 5555566677888899999998765321 11 Q ss_pred ecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccccc Q lcl|NC_014661. 133 YGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAE 212 (524) Q Consensus 133 Y~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~ 212 (524) +.... +.+.|- T Consensus 153 ~~~~~--------------~~a~~v------------------------------------------------------- 163 (385) T protein:vir:19 153 EEVFT--------------NNADVV------------------------------------------------------- 163 (385) T ss_pred EecCC--------------cceeee------------------------------------------------------- Confidence 10000 000000 Q ss_pred ccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHH Q lcl|NC_014661. 213 VIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELA 292 (524) Q Consensus 213 ~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELa 292 (524) .| +...++-..++++++.+.|.-+-...+|-||.||-- +.++.|. T Consensus 164 ---------------------~E---------~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~ 208 (385) T protein:vir:19 164 ---------------------AE---------KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYIN 208 (385) T ss_pred ---------------------cc---------CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHH Confidence 01 112334455566777777777778889999999852 2467777 Q ss_pred HHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_014661. 293 NILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRG 372 (524) Q Consensus 293 NILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg 372 (524) +-|+..|..-+|+.||.. .- +...+.|++.......... .... -..+..|..+...|. ..++ T Consensus 209 ~~la~a~~~~~d~~~l~G----~g--------~~~~~~Gi~~~~~~~~~~~-~~~~---~~~~d~i~~~~~~l~--~~~~ 270 (385) T protein:vir:19 209 NRLMYGLALKEEGQLLNG----DG--------TGDNLEGLNKVATAYDTSL-NATG---DTRADIIAHAIYQVT--ESEF 270 (385) T ss_pred HHHHHHHHHHHHHHHHhc----cC--------CCCcccccccccccccccc-cccc---cchHHHHHHHHHhhc--cccC Confidence 777777777777777642 10 1112234432211110000 0000 112233333333332 2344 Q ss_pred CccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEee Q lcl|NC_014661. 373 AGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYA 452 (524) Q Consensus 373 ~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fya 452 (524) ..+.+||||+....|..... +.|.. +..+.+. -.-++|.| ++|+++++.|..-+++|--- .+++. T Consensus 271 ~~~~~~~~~~~~~~l~~lkd-----~~G~~--l~~~~~~-~~~~~l~G-~pV~~~~~~p~~~~~~gd~~-----~~~~~- 335 (385) T protein:vir:19 271 SASGIVLNPRDWHNIALLKD-----NEGRY--IFGGPQA-FTSNIMWG-LPVVPTKAQAAGTFTVGGFD-----MASQV- 335 (385) T ss_pred CCCEEEEcHHHHHHHHHhhc-----CCCce--eccCccc-CCCceecc-eeeEEcCcCCCCcEEEeecc-----cEEEE- Confidence 67889999999999876431 11110 1111110 01256777 79999999987766665210 01111 Q ss_pred cccc-ccccccc--Ccccc-cceee--eeeeece-eeCCcccccCCccccceeeccccchhhhhc Q lcl|NC_014661. 453 PYVA-LTPLRGA--DPKNF-QPVLG--FKTRYGI-GINPLADTAAQQPAGNARIANGMPSIANSV 510 (524) Q Consensus 453 PYv~-~~~~~~~--Dp~s~-qP~~~--~~tRY~l-~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~ 510 (524) +.. ...+... +.+-| +..++ ...||+. +.+|= .+.++.- +-.+ T Consensus 336 -~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~---------a~~~~~~-----~aa~ 385 (385) T protein:vir:19 336 -WDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPT---------AIIKGTF-----SSGS 385 (385) T ss_pred -EEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccc---------ceEEEEe-----ccCC Confidence 111 0001100 00111 22333 3447766 33441 1111111 1011 No 19 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=96.92 E-value=0.00026 Score=40.27 Aligned_cols=350 Identities=14% Similarity=0.095 Sum_probs=136.5 Q ss_pred CCcccchHHHHHHhhhhh---hccCCCcchhhhhhhhhhhhhhhHHHHHhh------------------hhccccc---- Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPL---LEAEGAPEIAQGKHAIIAKMFENQEADIKS------------------DAAYRDE---- 55 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~---l~~~~~~~~~~~~~~~~~~~~enq~~~~~~------------------~~~~~~~---- 55 (524) ||+--.-++++.++++.+ ++.+. -++.... .. .+=|++|-+.+.+ .+..... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~-~e~~~~~-~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQK-AEIESTG-QV-SKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh Confidence 876322233333333332 22111 0111000 00 0111111111100 0000000 Q ss_pred --hhhhcccccccccccccccccchhhhccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeee Q lcl|NC_014661. 56 --KLAEAFGGFLTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAV 132 (524) Q Consensus 56 --~~~~~~~~~l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSr 132 (524) ...+.+-.++..........-....+..+++++. .-..|.++ .+++++..+..-.++|-++||++++.-+ .+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~----~~ 152 (385) T protein:vir:18 78 SERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAG-SLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEY----VR 152 (385) T ss_pred HHHHHHHHHHHHHHhhccchhhHHHhhhccccccCC-ceecchhhhHHHHHhhhccchhhhcceecccCcceEE----EE Confidence 0000000000000000000000001111111111 11223333 5555566677888899999998765321 11 Q ss_pred ecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccccc Q lcl|NC_014661. 133 YGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAE 212 (524) Q Consensus 133 Y~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~ 212 (524) +.... +.+.|- T Consensus 153 ~~~~~--------------~~a~~v------------------------------------------------------- 163 (385) T protein:vir:18 153 EEVFT--------------NNADVV------------------------------------------------------- 163 (385) T ss_pred EecCC--------------cceeee------------------------------------------------------- Confidence 10000 000000 Q ss_pred ccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHH Q lcl|NC_014661. 213 VIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELA 292 (524) Q Consensus 213 ~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELa 292 (524) .| +...++-..++++++.+.|.-+-...+|-||.||-- +.++.|. T Consensus 164 ---------------------~E---------~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~ 208 (385) T protein:vir:18 164 ---------------------AE---------KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYIN 208 (385) T ss_pred ---------------------cc---------CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHH Confidence 01 112334455566777777777778889999999852 2467777 Q ss_pred HHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_014661. 293 NILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRG 372 (524) Q Consensus 293 NILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg 372 (524) +-|+..|..-+|+.||.. .- +...+.|++.......... .... -..+..|..+...|. ..++ T Consensus 209 ~~la~a~~~~~d~~~l~G----~g--------~~~~~~Gi~~~~~~~~~~~-~~~~---~~~~d~i~~~~~~l~--~~~~ 270 (385) T protein:vir:18 209 NRLMYGLALKEEGQLLNG----DG--------TGDNLEGLNKVATAYDTSL-NATG---DTRADIIAHAIYQVT--ESEF 270 (385) T ss_pred HHHHHHHHHHHHHHHHhc----cC--------CCCcccccccccccccccc-cccc---cchHHHHHHHHHhhc--cccC Confidence 777777777777777642 10 1112234432211110000 0000 112233333333332 2344 Q ss_pred CccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEee Q lcl|NC_014661. 373 AGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYA 452 (524) Q Consensus 373 ~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fya 452 (524) ..+.+||||+....|..... +.|.. +..+.+. -.-++|.| ++|+++++.|..-+++|--- .+++. T Consensus 271 ~~~~~~~~~~~~~~l~~lkd-----~~G~~--l~~~~~~-~~~~~l~G-~pV~~~~~~p~~~~~~gd~~-----~~~~~- 335 (385) T protein:vir:18 271 SASGIVLNPRDWHNIALLKD-----NEGRY--IFGGPQA-FTSNIMWG-LPVVPTKAQAAGTFTVGGFD-----MASQV- 335 (385) T ss_pred CCCEEEEcHHHHHHHHHhhc-----CCCce--eccCccc-CCCceecc-eeeEEcCcCCCCcEEEeecc-----cEEEE- Confidence 67889999999999876431 11110 1111110 01256777 79999999987766665210 01111 Q ss_pred cccc-ccccccc--Ccccc-cceee--eeeeece-eeCCcccccCCccccceeeccccchhhhhc Q lcl|NC_014661. 453 PYVA-LTPLRGA--DPKNF-QPVLG--FKTRYGI-GINPLADTAAQQPAGNARIANGMPSIANSV 510 (524) Q Consensus 453 PYv~-~~~~~~~--Dp~s~-qP~~~--~~tRY~l-~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~ 510 (524) +.. ...+... +.+-| +..++ ...||+. +.+|= .+.++.- +-.+ T Consensus 336 -~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~---------a~~~~~~-----~aa~ 385 (385) T protein:vir:18 336 -WDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPT---------AIIKGTF-----SSGS 385 (385) T ss_pred -EEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccc---------ceEEEEe-----ccCC Confidence 111 0001100 00111 22333 3447766 33441 1111111 1011 No 20 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=96.17 E-value=0.00091 Score=37.25 Aligned_cols=277 Identities=11% Similarity=0.070 Sum_probs=133.5 Q ss_pred cccchhhhccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccc Q lcl|NC_014661. 74 HGYDPQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAP 152 (524) Q Consensus 74 ~g~~~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEa 152 (524) -||+++....+...+. ..-|.+. .+++++..+.+-.+++-+-||++.+--+ ...+. + T Consensus 1 ~g~~a~~~~~~~~~~~--~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~-----~~~~~---------------~ 58 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTG--SIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEF-----TFMSG---------------V 58 (299) T ss_pred CCcCCCcccccCCCce--ecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEE-----EEEcC---------------C Confidence 5666554332221111 1222222 6777778888999999999998776321 11000 0 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccch Q lcl|NC_014661. 153 DAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATS 232 (524) Q Consensus 153 dt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts 232 (524) .+.| + T Consensus 59 ~a~~--------------------------------------------------------------------v------- 63 (299) T protein:vir:41 59 GAFW--------------------------------------------------------------------V------- 63 (299) T ss_pred ceee--------------------------------------------------------------------e------- Confidence 0000 0 Q ss_pred hhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHh Q lcl|NC_014661. 233 IAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWIN 312 (524) Q Consensus 233 ~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~ 312 (524) +| +.+++|...++++++...|..+-...+|-||.+|-. .|.+++|.+.|+..|...+++.||..-- T Consensus 64 -~E---------~~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l~G~g 129 (299) T protein:vir:41 64 -DE---------AERIQTSKPTFTKAKMRSKKMGVIIPTTKENLNYSV----TNFFSLMQAEIVEAFYKKFDQAVFTGVE 129 (299) T ss_pred -ec---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 11 122344555668888888888888999999999854 4568999999999999999988885321 Q ss_pred hhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCc Q lcl|NC_014661. 313 YSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDT 392 (524) Q Consensus 313 ~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~ 392 (524) +- .+.|++...... ... .+.....+.-|+++-+.+... ..+.+.+||+|+....|..... T Consensus 130 ~~-------------~~~gil~~~~~~-~~~----~~~~~~~~~~l~~~~~~l~~~--~~~~~~~v~n~~~~~~L~~lkd 189 (299) T protein:vir:41 130 SP-------------YNWNILKSATDA-SNL----VEETANKYDDLNEAIGLIEAE--DLEPNGIATIRKQRVKYRSTKD 189 (299) T ss_pred Cc-------------cccccccccccc-cee----eccccccHHHHHHHHHhhhcc--cCCcCEEEEcHHHHHHHHHhhc Confidence 10 112222110000 000 000001122234444444432 3357789999999999986331 Q ss_pred cccccccccccccccccCcceEEEEecCceEEEeeCCCCcce----EEEEEecCCCccceeEeeccccccc--------c Q lcl|NC_014661. 393 SVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDY----FTIGYKGDNEMDAGIYYAPYVALTP--------L 460 (524) Q Consensus 393 ~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy----~~vG~KG~~~~d~g~fyaPYv~~~~--------~ 460 (524) +.|.- =+..+.++. .++|.| ++|++.++.+.+= +++|-.- ..++..+-.... . T Consensus 190 -----~~G~~-l~~~~~~~~--~~~l~G-~PV~~~~~~~~~~~~~~~~~gdfs------~~~i~~~~~~~i~~~~~~~~~ 254 (299) T protein:vir:41 190 -----GNGMP-IFNTATSNG--VDDVLG-LPIAYTPKYTFGDKDISELVGDWN------QAYYGILRGVEYEILTEATLT 254 (299) T ss_pred -----cCCce-eecCCcCCC--Cceecc-eeeEEecccCCCCCceEEEEEecc------cEEEEEecCcEEEEeeccccc Confidence 11110 011122211 246776 7999888876541 2222110 011111111110 0 Q ss_pred cccCccc-----ccc-eeee--eeeeceee-CCcccccCCccccceeeccccchhhhhcc Q lcl|NC_014661. 461 RGADPKN-----FQP-VLGF--KTRYGIGI-NPLADTAAQQPAGNARIANGMPSIANSVG 511 (524) Q Consensus 461 ~~~Dp~s-----~qP-~~~~--~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~~~~ 511 (524) ...|++. ||- .+.| ..|++..+ ||=+ +.++.. . .+| T Consensus 255 ~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A---------~~~l~~---~---aa~ 299 (299) T protein:vir:41 255 TVADETGKPLNLAERDMAAIKATFEVGFMVVKDEA---------FSAVQP---K---AGN 299 (299) T ss_pred ccccccccchhhhhcCcEEEEEEEEeccEEecccc---------eEEEEe---c---cCC Confidence 1112221 222 2333 35777654 4411 112111 0 111 No 21 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=96.09 E-value=0.001 Score=36.97 Aligned_cols=336 Identities=15% Similarity=0.107 Sum_probs=138.3 Q ss_pred CCcccchHHH-HHHhhhh---hhccCCCcchhhhhhh---hhhhhhhhHHHHHhhhhc-cccchh-----------hhcc Q lcl|NC_014661. 1 MSTQIKTKAQ-LVADWKP---LLEAEGAPEIAQGKHA---IIAKMFENQEADIKSDAA-YRDEKL-----------AEAF 61 (524) Q Consensus 1 ~~~~~~~~~~-l~~kw~p---~l~~~~~~~~~~~~~~---~~~~~~enq~~~~~~~~~-~~~~~~-----------~~~~ 61 (524) |++++....+ +..++.. +++.+.+-++...+.. +-.+ ++.+++.+.+... ..++.. ..+| T Consensus 1 M~k~l~~l~e~~~~~~~e~~~~~~~~~~e~~~~~~~ei~~l~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (371) T protein:vir:81 1 MPKELRELLEQINNKKEEARKLLAENKIEEAKKLKEEIVALQEK-FDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAF 79 (371) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccccccccchhhHHHHHHHH Confidence 9987754322 2223332 2332222223221111 1111 1111111111000 000000 0011 Q ss_pred cccccccccccccccchhhhcccccc-ccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCC Q lcl|NC_014661. 62 GGFLTEAEIGGDHGYDPQNIAAGQTS-GAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIA 139 (524) Q Consensus 62 ~~~l~ea~~~~~~g~~~~~i~est~t-g~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~ 139 (524) ..++. +-+...+..++++ |.+. .-+.+. .+++.+.++.+..+++.+.||++.++-+.-.+ .... T Consensus 80 ~~~l~--------~~~~~a~~~~t~~~gg~~-vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~--~~~~--- 145 (371) T protein:vir:81 80 VNHIR--------TRFRNAMSEGSNQDGGYT-VPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKK--RSQQ--- 145 (371) T ss_pred HHHHH--------HHHHHhhccCCCccCcee-ecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecCC--- Confidence 11110 1112223333322 2211 112222 46666778888999999999988776542111 1100 Q ss_pred CccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccc Q lcl|NC_014661. 140 AGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDA 219 (524) Q Consensus 140 ~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~ 219 (524) .+ ..| T Consensus 146 ---~~---------a~~--------------------------------------------------------------- 150 (371) T protein:vir:81 146 ---TG---------FVE--------------------------------------------------------------- 150 (371) T ss_pred ---cc---------eee--------------------------------------------------------------- Confidence 00 000 Q ss_pred cccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|NC_014661. 220 GILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEI 299 (524) Q Consensus 220 g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEI 299 (524) +++|- ...| .+...|.+..++..|.. -...+|-||.+|-. .|.++.|.+.|...| T Consensus 151 -----v~Eg~--~~~~-------~~~~~f~~i~~~~~k~~-------~~~~iS~ell~ds~----~~l~~~i~~~l~~a~ 205 (371) T protein:vir:81 151 -----VAEGA--AIGE-------KATPQFTLLQYQVKKYA-------GFFRVTNELLNDST----EAIVNTLVRWIGDES 205 (371) T ss_pred -----ecccc--cccc-------ccccceeeEEeeeeEEE-------EeehhhHHHHhhhh----HHHHHHHHHHHHHHH Confidence 00110 0000 01123444444444444 45579999999853 466888999999999 Q ss_pred HHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEe Q lcl|NC_014661. 300 MLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIA 379 (524) Q Consensus 300 mlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~ 379 (524) ..-+|+.||...-+ +.+.|+.+++ ....++... ....+.....+|+ T Consensus 206 ~~~~~~~i~~g~g~-------------~~~~~~~~~~-------------~i~~~~~~~--------l~~~~~~~a~~vm 251 (371) T protein:vir:81 206 RVTRNGLIINVLNT-------------KAKTAIADLD-------------GLKQIINVQ--------LDPVFRSTSSVIV 251 (371) T ss_pred HHHHHHHHHhhccc-------------ccccccccHH-------------HHHHHHHhh--------cchhhhcCCEEEE Confidence 88888888774332 1223433322 112211110 0112223457899 Q ss_pred CHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeeccccc-- Q lcl|NC_014661. 380 SRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVAL-- 457 (524) Q Consensus 380 S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~-- 457 (524) +|.....|..... +.|.- -+..+.+. -..|+|.| ++||+..+.+...-.++--+.+ ..-++|+.+..+ T Consensus 252 n~~~~~~L~~lkd-----~~g~~-l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~~~~~~~--~~~i~~Gd~~~~~~ 321 (371) T protein:vir:81 252 NQDAFNWLDTLKD-----QNGQY-LLQPSISS-PTGRQLLG-LPVVIVSNKVLANRVDGGTGAQ--FAPIIVGDLKEAVV 321 (371) T ss_pred cHHHHHHHHHhhc-----cCCCe-eeecccCC-CCCceecc-eeEEEecccccCccccccccCC--cceEEEEehhceEE Confidence 9999999975421 11100 01111121 12367887 6999887766443222111111 122344432110 Q ss_pred -----ccccccCcc------cccceeeeeeeecee-eCCcccccCCccccceeeccccchhhhh Q lcl|NC_014661. 458 -----TPLRGADPK------NFQPVLGFKTRYGIG-INPLADTAAQQPAGNARIANGMPSIANS 509 (524) Q Consensus 458 -----~~~~~~Dp~------s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~a~~ 509 (524) .+.-.+++. +-|=.+-...||+.. .||=+ +.++.- +.+ T Consensus 322 ~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a---------~~~~~~-----~~A 371 (371) T protein:vir:81 322 MFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEA---------FVFGEV-----QLA 371 (371) T ss_pred EEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccc---------eEEEEE-----ecC Confidence 011112322 223455555666653 34411 111111 101 No 22 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=96.06 E-value=0.0011 Score=36.89 Aligned_cols=352 Identities=13% Similarity=0.095 Sum_probs=126.8 Q ss_pred cchHHHHHHhhhhhhccC-CC-------cch--------hhhhhhhhhhhhhhHHHHHhhhhccccchhh--------hc Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEAE-GA-------PEI--------AQGKHAIIAKMFENQEADIKSDAAYRDEKLA--------EA 60 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~~-~~-------~~~--------~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~--------~~ 60 (524) |..+| |+|+++.+++.- .+ ..+ ...+..| . =|++|.+.+.+.-+ +..... .. T Consensus 1 M~i~e-L~e~r~~~~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~ei-~-~l~~~I~~~e~~~~-~~~~~~~~~~~~~~~~ 76 (435) T protein:vir:14 1 MNVNE-LRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKF-S-ELTAQIERAEAAER-MAAAAAVPVDPNPTAV 76 (435) T ss_pred CCHHH-HHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHH-H-HHHHHHHHHHHHHH-HHHhhcccccchhhhh Confidence 66644 888888776521 10 011 1111111 0 11222222111000 000000 00 Q ss_pred cccc--cccc--ccccccccc----hhhhccc-----------------------cccccccccCcchh------hHHHH Q lcl|NC_014661. 61 FGGF--LTEA--EIGGDHGYD----PQNIAAG-----------------------QTSGAVTQIGPAVM------GMVRR 103 (524) Q Consensus 61 ~~~~--l~ea--~~~~~~g~~----~~~i~es-----------------------t~tg~v~~~~P~Li------~l~Rr 103 (524) .... -... ..-...+.. ...+... .+++.- ..+..|| .++++ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~-~~gg~~vP~~~~~~ii~~ 155 (435) T protein:vir:14 77 AAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSP-GAGGVLVPENLSSEVIEL 155 (435) T ss_pred hhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCc-CCCccccchhHHHHHHHH Confidence 0000 0000 000000000 0000000 000000 0001111 11111 Q ss_pred HHHhhhhhhc-eeeecCCCcchheeeeeeeecCccCCCcccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 104 AIPNLIAFDI-CGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKH 182 (524) Q Consensus 104 a~~nLIa~DI-~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~ 182 (524) +.++.+..++ +-+.||+... + +|... + T Consensus 156 l~~~~~i~~~~~~~~~~~~~~-~------~~p~~--------------------~------------------------- 183 (435) T protein:vir:14 156 LRPKSVVRKLGARTLPLSNGN-I------TIPRL--------------------K------------------------- 183 (435) T ss_pred HhhhchhhhhcceeeecCCCc-e------EEEEE--------------------e------------------------- Confidence 1122222221 1111111000 0 00000 0 Q ss_pred ccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEE Q lcl|NC_014661. 183 EFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEA 262 (524) Q Consensus 183 ~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtA 262 (524) +.+ ..+-+ +| +...++-.-++++++..+ T Consensus 184 -------------------~~~----------------~a~~v--------~E---------~~~~~~~~~~f~~i~~~~ 211 (435) T protein:vir:14 184 -------------------GGA----------------IVGYI--------GA---------DTDIPTTQQQFDDLKLTA 211 (435) T ss_pred -------------------CCc----------------ceeee--------cc---------CccccccccceeEEEeee Confidence 000 00000 11 122344455667777777 Q ss_pred ecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceeccccccccc Q lcl|NC_014661. 263 KSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVR 342 (524) Q Consensus 263 KSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~ 342 (524) +..+-....|-||.+| +....+.|+.|.+-|+..|...+|+-||.. .-+...+.|++....+..+. T Consensus 212 ~k~~~~~~iS~ell~d--s~~~~~l~~~i~~~l~~ai~~~~d~a~l~G------------~G~~~~p~Gi~~~~~~~~~~ 277 (435) T protein:vir:14 212 KKMAALVPIANDLIKY--AGVNPNVDQIVVGDLTAAIGAREDKAFIRD------------DGTANTPKGLRFWALPSNVI 277 (435) T ss_pred EEEEEeehhhHHHHHh--hccCHHHHHHHHHHHHHHHHHHHHHHhhcc------------CCCCccccceeeccccccee Confidence 7777788899999999 321334678888888888888887777632 11112345554322111110 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHh-hccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCc Q lcl|NC_014661. 343 GARWAGESFKALLFQIDKESAEIAR-QTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGR 421 (524) Q Consensus 343 ~~~~a~E~~r~L~~~i~~~a~~I~~-~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~ 421 (524) ..- ...-+..++..+.++-..+.. ...+ ....+|+++.....|..... +.|. -+..+.+ .|+|.| T Consensus 278 ~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lkd-----~~G~--~l~~~~~----~g~l~G- 343 (435) T protein:vir:14 278 TAS-DASTLQKIETDLGKVILALENADANL-TQPGWIMAPRTFRFLEGLRD-----GNGN--KVYPELA----NGMLKG- 343 (435) T ss_pred ccc-cccchhhHHHHHHHHHHHhhhccccc-cCCEEEEcHHHHHHHHHhhc-----cCCc--eeccCCC----CCeeec- Confidence 000 001112222223333222222 1234 24567999999999975431 1111 1111222 257877 Q ss_pred eEEEeeCCCCcc--------eEEEE--------EecCCCccceeEeecccccccccccCcccc---cceeeeeeeeceee Q lcl|NC_014661. 422 YKVYIDQYARQD--------YFTIG--------YKGDNEMDAGIYYAPYVALTPLRGADPKNF---QPVLGFKTRYGIGI 482 (524) Q Consensus 422 ~~vy~D~y~~~d--------y~~vG--------~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~---qP~~~~~tRY~l~~ 482 (524) ++||++++.|.+ -+++| ..+.-+ +-..||..........-..| |=.+=...|++..+ T Consensus 344 ~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~ 419 (435) T protein:vir:14 344 YPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEEETLE----IDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGP 419 (435) T ss_pred ceeEeeccccccccCCCccceEEEeecccEEEEEecccE----EEEeccccccccccchhhhhhcChhheeeeeeeCcee Confidence 699998775432 23333 222222 22333322111000000001 12333556666543 Q ss_pred -CCcccccCCccccceeeccccchhh Q lcl|NC_014661. 483 -NPLADTAAQQPAGNARIANGMPSIA 507 (524) Q Consensus 483 -nP~~~~~~~~~~~~~~~~~g~~~~a 507 (524) +| .. ...-.|.+|.| T Consensus 420 ~~~---------~a-~~~l~~~~~~~ 435 (435) T protein:vir:14 420 RHV---------ES-IAVLAGVAWGA 435 (435) T ss_pred ecc---------cc-eEEEecCCCCC Confidence 22 11 34445577765 No 23 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=95.71 E-value=0.0016 Score=35.94 Aligned_cols=353 Identities=13% Similarity=0.048 Sum_probs=144.6 Q ss_pred CCcccchHHHHHHhhhhhhccCCCcchhhh-hhh--hhhhhhhhHHHHHh---hhhccccchh----------------- Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEAEGAPEIAQG-KHA--IIAKMFENQEADIK---SDAAYRDEKL----------------- 57 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~~~~~~~~~~-~~~--~~~~~~enq~~~~~---~~~~~~~~~~----------------- 57 (524) |+.-.+.-++|.+++.-+-+.- .++.+. +.. =+..+.+.+++.+. .+..-.+..+ T Consensus 1 m~~~~k~l~el~~~~~~~~~~~--~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQI--KSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGE 78 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Confidence 8887777777877776553320 001000 000 00011111111110 0000000000 Q ss_pred --hhcccccccccc----------cccccccchhhhccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcch Q lcl|NC_014661. 58 --AEAFGGFLTEAE----------IGGDHGYDPQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTG 124 (524) Q Consensus 58 --~~~~~~~l~ea~----------~~~~~g~~~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTG 124 (524) ....+....+.. .+.......+.+...+.++. .-.-|.+. .++++.-+..+..++|.++||.+++. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~ 157 (395) T protein:vir:43 79 EAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGG-ALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSV 157 (395) T ss_pred chhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCc-cccchhhHHHHHHHHHhhhhHHhhccceecCCCce Confidence 000000000000 00000000011111111111 11223222 45555667788889999999887643 Q ss_pred heeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCC Q lcl|NC_014661. 125 QVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTA 204 (524) Q Consensus 125 LIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~ 204 (524) -+ .+..... +.+.| T Consensus 158 ~~----~~~~~~~--------------~~a~~------------------------------------------------ 171 (395) T protein:vir:43 158 EY----VRETGFV--------------NNAAP------------------------------------------------ 171 (395) T ss_pred EE----EEEecCC--------------Cceee------------------------------------------------ Confidence 11 1110000 00000 Q ss_pred CcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcC Q lcl|NC_014661. 205 DAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHG 284 (524) Q Consensus 205 ~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHG 284 (524) +++| ...++-..+++++++..+.-+-...+|-||.||.- T Consensus 172 --------------------v~E~-----------------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~---- 210 (395) T protein:vir:43 172 --------------------VSEG-----------------TQKPYSDLTFELENAPVRTIAHLFKASRQILDDAS---- 210 (395) T ss_pred --------------------ecCC-----------------ccccccccceeEEEEeeeeEEEeehhhHHHHHhHH---- Confidence 0010 11223344556666666666667789999999853 Q ss_pred CChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_014661. 285 MDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAE 364 (524) Q Consensus 285 LDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~ 364 (524) +.++.|.+-|+..+...+|+.||.. + | +...+.|++......-+.. .... ....++..|..+.+. T Consensus 211 -~l~~~v~~~la~a~~~~~d~~~l~G----~--g------~~~~~~Gi~~~~~~~~~~~-~~~~-~~~~~~~~i~~~~~~ 275 (395) T protein:vir:43 211 -ALQSYIDARARYGLMLVEECQLLYG----N--G------TGANLHGIIPQAQAYAPPS-GVVV-TAEQRIDRIRLAILQ 275 (395) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHhc----c--C------CCCcccccccccccccccc-cccc-ccchhHHHHHHHHHh Confidence 3588899999999999999888742 1 0 0112234432211100000 0000 011233344444444 Q ss_pred HHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCC Q lcl|NC_014661. 365 IARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNE 444 (524) Q Consensus 365 I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~ 444 (524) +.. .+.....+|+||.....|.... .+.|.- +..|... .-.++|.| ++|+++++.|.+-+++|--.. T Consensus 276 ~~~--~~~~~~~~vmn~~~~~~l~~lk-----d~~G~~--i~~~~~~-~~~~~l~G-~pVv~~~~~~~~~~~~gd~~~-- 342 (395) T protein:vir:43 276 AQL--AEFPASGIVLNPIDWALIELNK-----DAENRY--IIGSPQN-GTTPTLWR-LPVVETQAITQDEFLTGAFSL-- 342 (395) T ss_pred hcc--ccCCCcEEEEcHHHHHHHHHhh-----ccCCce--ecccccc-CCCceecc-eeeEEcCCCCCCcEEEEeccc-- Confidence 432 3446778999999998886432 111110 1112111 11246776 799999998776666553211 Q ss_pred ccceeEeecccccccccccC-c-ccccc---eeeeeeeeceee-CCcccccCCccccceeeccccchhhhh Q lcl|NC_014661. 445 MDAGIYYAPYVALTPLRGAD-P-KNFQP---VLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPSIANS 509 (524) Q Consensus 445 ~d~g~fyaPYv~~~~~~~~D-p-~s~qP---~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~~ 509 (524) ..+.. .-....+...+ . ..|+- .+-+..|++..+ +|= .+.++ + ++.+ T Consensus 343 ---~~~~~-~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~---------a~~~~-~----~taa 395 (395) T protein:vir:43 343 ---GAQIF-DRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPE---------AFVTG-S----LTAS 395 (395) T ss_pred ---eEEEE-EecceEEEEeccccchhhcCcEEEEEEEeeccEEeccc---------ceEEE-E----eccC Confidence 00000 00111111111 1 12322 333445777654 231 11111 1 1111 No 24 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=95.49 E-value=0.002 Score=35.41 Aligned_cols=345 Identities=10% Similarity=0.022 Sum_probs=134.1 Q ss_pred CCcccc-----------hHHHHHHhhhhhhccCCCcchhhhhhhhh--hhhhhhHHHHHh-hhhccccc-------hhhh Q lcl|NC_014661. 1 MSTQIK-----------TKAQLVADWKPLLEAEGAPEIAQGKHAII--AKMFENQEADIK-SDAAYRDE-------KLAE 59 (524) Q Consensus 1 ~~~~~~-----------~~~~l~~kw~p~l~~~~~~~~~~~~~~~~--~~~~enq~~~~~-~~~~~~~~-------~~~~ 59 (524) +..++. ..++..+++.-+.. ++...++.+- ...++.-++... .+..-+.. .-.. T Consensus 15 ~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~-----e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (390) T protein:vir:10 15 VTDSLRAFGERAVRDGELNASARSKVDELFA-----TVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDLFVASEQFQ 89 (390) T ss_pred HHHHHHHHHHHHHhhcccCHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHhhcccccccccchhhhhhhhHHHH Confidence 111110 01223344433221 1111111110 000111000000 00000000 0000 Q ss_pred cccccccccccccccccchhhhc----cccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeec Q lcl|NC_014661. 60 AFGGFLTEAEIGGDHGYDPQNIA----AGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYG 134 (524) Q Consensus 60 ~~~~~l~ea~~~~~~g~~~~~i~----est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~ 134 (524) .+-....+.... ...+-+... .++++.+-.-.-|.++ .++.++-++..-.++|.+.||++++.-+. +.. T Consensus 90 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~----~~~ 163 (390) T protein:vir:10 90 ASAGRWNDRSAR--ATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYV----QET 163 (390) T ss_pred HHHHhhhhhhhh--hhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEE----EEe Confidence 000000000000 000000000 0011111111223333 44555555666778899999987653221 110 Q ss_pred CccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccccccc Q lcl|NC_014661. 135 KDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVI 214 (524) Q Consensus 135 ~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~ 214 (524) ..+ +.+.|- T Consensus 164 ~~~--------------~~a~~v--------------------------------------------------------- 172 (390) T protein:vir:10 164 GFV--------------NNAAIV--------------------------------------------------------- 172 (390) T ss_pred cCC--------------cceeee--------------------------------------------------------- Confidence 000 000000 Q ss_pred ccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHH Q lcl|NC_014661. 215 KQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANI 294 (524) Q Consensus 215 ~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNI 294 (524) +| +...++-..+++++++.+|..+....+|-||.||-- |.++.|.+- T Consensus 173 -------------------~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~ 219 (390) T protein:vir:10 173 -------------------AE---------GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-----QLASYMNNR 219 (390) T ss_pred -------------------cC---------CccccccccceeEEEEeeEEEEEeehhhHHHHHhHH-----HHHHHHHHH Confidence 01 011233345667777777777888999999999852 468899999 Q ss_pred HHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCc Q lcl|NC_014661. 295 LATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAG 374 (524) Q Consensus 295 LStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~g 374 (524) |+..|...||+.||..= | +...+.|++........... .+ ...++..+..+-..+. ..+... T Consensus 220 l~~~~~~~~~~~il~G~------G------~~~~p~Gi~~~~~~~~~~~~-~~---~~~~~~~~~~~~~~l~--~~~~~~ 281 (390) T protein:vir:10 220 LIRGLKVKEDAEILRGT------G------ANDGLLGLIPQATTYAAPTT-IA---GATRVDQLRLAMLQAS--LAEYPA 281 (390) T ss_pred HHHHHHHHHHHHHhhcC------C------CCcccccccccccccccccc-cc---ccchHHHHHHHHHhhc--cccCCC Confidence 99999999998888431 0 11134455433211110000 00 0111222222222332 223367 Q ss_pred cEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecc Q lcl|NC_014661. 375 NFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPY 454 (524) Q Consensus 375 n~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPY 454 (524) +.+|++|.....|..... +.|.-- +..+.... .++|.| ++|++++..|..-+++|--- .+++.+.. T Consensus 282 ~~~v~n~~~~~~L~~lkd-----~~g~~l-~~~~~~~~--~~~l~G-~pv~~~~~~p~~~~~~gdf~-----~~~~~~~~ 347 (390) T protein:vir:10 282 SGIVINPIDWAAIELAKD-----ANNQYL-IGNARGTL--TPTLWG-LPVVATQAMAPGEFLVGAFD-----LAAQIFDQ 347 (390) T ss_pred CEEEEcHHHHHHHHHhhc-----CCCcee-ecCCcCcC--Cceecc-eeeEEcCCCCCCcEEEEecc-----ceEEEEEe Confidence 789999999998874321 111100 01111111 245766 69999999887766665210 11222211 Q ss_pred cccccccccC----cccccceeeeeeeeceee-CCcccccCCccccceeeccccchhh Q lcl|NC_014661. 455 VALTPLRGAD----PKNFQPVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPSIA 507 (524) Q Consensus 455 v~~~~~~~~D----p~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a 507 (524) ....+...+ -.+-+=.+-...||+..+ +| ..+.++.= | T Consensus 348 -~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~---------~a~~~~~~-----a 390 (390) T protein:vir:10 348 -WDARVEIGYVNDDFQRNMVTVLAEERLALVVYRP---------EALISGSF-----A 390 (390) T ss_pred -cceEEEEeecccccccCcEEEEEEEeeccEEecc---------ccEEEEEe-----C Confidence 111111111 112222333445666532 34 11112111 1 No 25 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=95.42 E-value=0.0021 Score=35.27 Aligned_cols=332 Identities=13% Similarity=0.107 Sum_probs=134.7 Q ss_pred cchHHHHHHhhhhhhccCC-C-cchhh----------hhhhhhhhhh------hhHHHHHhhhhccccchh--------- Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEAEG-A-PEIAQ----------GKHAIIAKMF------ENQEADIKSDAAYRDEKL--------- 57 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~~~-~-~~~~~----------~~~~~~~~~~------enq~~~~~~~~~~~~~~~--------- 57 (524) |++.++|.+.|.-+.+.-. + -++.. --+++.+.+- |-+++.+.+.+....... T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 8888999888887765310 0 00000 0011111111 111111111110000000 Q ss_pred ----------hhcccccccccccccccccchhhhccccc-cccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcch Q lcl|NC_014661. 58 ----------AEAFGGFLTEAEIGGDHGYDPQNIAAGQT-SGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTG 124 (524) Q Consensus 58 ----------~~~~~~~l~ea~~~~~~g~~~~~i~est~-tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTG 124 (524) ..++..++.. +..... .....+++ .|.+ .. |.-+ .+++.+-++..-.+++.|+||++.+| T Consensus 81 ~~~~~~~~~~~~~~~~~l~~----~~~~~~-~~~~~~t~~~gg~-~i-P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 153 (397) T protein:vir:49 81 KNEEEVKANFVKDFKNLVRG----RYQNLL-DSKTDGSGSDAGL-TI-PQDIRTAINTLVRQFDSLQEYVNVENVTTLTG 153 (397) T ss_pred chhhHHHHHHHHHHHHHhhc----chhhHH-HhhhccCCccCcc-ee-cHHHHHHHHHHHHhhhhHhhhcceeeccCCcc Confidence 0011111110 000000 00111111 1111 11 3222 35555667777889999999998876 Q ss_pred heeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCC Q lcl|NC_014661. 125 QVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTA 204 (524) Q Consensus 125 LIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~ 204 (524) -+- |...... .+.+.|-+ T Consensus 154 ~~~-----~~~~~~~-----------~~~a~~v~---------------------------------------------- 171 (397) T protein:vir:49 154 SRV-----YEKWADI-----------TGLAKLDD---------------------------------------------- 171 (397) T ss_pred eEE-----EEeeccC-----------Ccceeeec---------------------------------------------- Confidence 431 1111000 00000000 Q ss_pred CcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcc-eEEEEEEEEEecccccchhhHHHHHHHHhhc Q lcl|NC_014661. 205 DAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMG-FRIDKQVIEAKSRQLKAQYSIELAQDLRAVH 283 (524) Q Consensus 205 ~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELAQDLKAvH 283 (524) +| ..+++-. -+++.++..++.-+-...+|-||.+|-. T Consensus 172 ----------------------E~-----------------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--- 209 (397) T protein:vir:49 172 ----------------------EG-----------------GQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSA--- 209 (397) T ss_pred ----------------------cc-----------------cccccccccceeeeEeeeeeeEeehhhHHHHHhhhh--- Confidence 00 0111111 1344455555555556789999999853 Q ss_pred CCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHH Q lcl|NC_014661. 284 GMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESA 363 (524) Q Consensus 284 GLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~ 363 (524) +|.+++|.+-|+..|..-+|+.||...-+ .+ ...+.+++ +-...|+..+. T Consensus 210 -~~l~~~i~~~l~~~~~~~~d~ail~G~g~----------~~--~~~~~~~~-------------d~i~~~~~~l~---- 259 (397) T protein:vir:49 210 -ENILAWLSGWIAKKVVVTRNKAILEAIGT----------LP--NKPTLAKW-------------DDIIDLQAKVD---- 259 (397) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHhcccc----------cc--ccccccCH-------------HHHHHHHHhhh---- Confidence 56799999999999999999998854221 11 12233322 11233333332 Q ss_pred HHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeC--CCCc-----ceEE Q lcl|NC_014661. 364 EIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQ--YARQ-----DYFT 436 (524) Q Consensus 364 ~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~--y~~~-----dy~~ 436 (524) +.+.....+|++|.....|..... +.|.- -+..+.+.. ..++|.| ++|++.. ..+. .-++ T Consensus 260 -----~~~~~~a~~v~n~~~~~~l~~lkd-----~~g~~-l~~~~~~~g-~~~~l~G-~pV~~~~~~~~~~~~~~~~~~~ 326 (397) T protein:vir:49 260 -----PAIKQTSLFLTNTSGFTALKKVKN-----AMGDY-LMERDVKSP-TGYSIDG-FVVKEISDRFLPNGTGGAMPLY 326 (397) T ss_pred -----hhhcCCCEEEEcHHHHHHHHHhhc-----cCCce-eecccccCC-CCceecc-eeeEEecccccccccCCceeEE Confidence 223356789999999999976421 11100 011111111 1246877 4776522 2111 1122 Q ss_pred EE---------EecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-CC--c-----ccccCCccccceee Q lcl|NC_014661. 437 IG---------YKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIGI-NP--L-----ADTAAQQPAGNARI 499 (524) Q Consensus 437 vG---------~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP--~-----~~~~~~~~~~~~~~ 499 (524) +| ..+.-+ +-..||.. .+-...+-.+-...|++..+ +| | +...++++ T Consensus 327 ~gd~~~~~~~~~~~~~~----i~~~~~~~------~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~------ 390 (397) T protein:vir:49 327 FGDLKQAVTLFDRQHLS----LLSTNIGG------GAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIADQKA------ 390 (397) T ss_pred EeeccceEEEEeecccE----EEEecccc------chhhcCeeeEEEEEeeccEEecccceEEEEecccccccC------ Confidence 22 221111 11222211 11123334444556665543 33 1 11111111 Q ss_pred ccccchhhhhccc Q lcl|NC_014661. 500 ANGMPSIANSVGK 512 (524) Q Consensus 500 ~~g~~~~a~~~~~ 512 (524) ..-.++. T Consensus 391 ------~~~~~~~ 397 (397) T protein:vir:49 391 ------KLSTAGA 397 (397) T ss_pred ------cccccCC Confidence 1111122 No 26 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=95.24 E-value=0.0025 Score=34.88 Aligned_cols=358 Identities=11% Similarity=0.003 Sum_probs=126.4 Q ss_pred CCcccchHHHHHHhhhhhhc-cC-------CCcchhhhhhhh------hhhhhhhHHHH------Hhhhhccc-cchhhh Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLE-AE-------GAPEIAQGKHAI------IAKMFENQEAD------IKSDAAYR-DEKLAE 59 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~-~~-------~~~~~~~~~~~~------~~~~~enq~~~------~~~~~~~~-~~~~~~ 59 (524) |.++-.. .+|..... .+ .+..+....+.. ...-++-.++. ..+..... .....+ T Consensus 1 ~~ke~~~-----~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (413) T protein:vir:81 1 MVKEAGD-----APTNAQVAEIAEVKSMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYK 75 (413) T ss_pred ChhhHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhh Confidence 3332221 11111000 00 000000000000 00000000000 00000000 000000 Q ss_pred cccccccc---------------------cccccccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhceee Q lcl|NC_014661. 60 AFGGFLTE---------------------AEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGV 116 (524) Q Consensus 60 ~~~~~l~e---------------------a~~~~~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GV 116 (524) +++....+ .............-.. +++.+....=|..+ .+++.+-+..+..+++.| T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~ 154 (413) T protein:vir:81 76 SIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTA-TLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDN 154 (413) T ss_pred hhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhc-ccccccccccchhhHHHHHHHHhhhhhHHhhcce Confidence 11110000 0000000000000011 11111111113222 345555567788899999 Q ss_pred ecCCCcchheeeeeeeecCccCCCcccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 117 QPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATG 196 (524) Q Consensus 117 QPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g 196 (524) +||++++.-+. .+.-... ...++. T Consensus 155 ~~~~~~~~~~~-~~~~~~~--------------~~~~a~----------------------------------------- 178 (413) T protein:vir:81 155 LTMTNTTIKYL-MEKANRV--------------VEGGFK----------------------------------------- 178 (413) T ss_pred eeccCCceeEE-Eeccccc--------------cccccc----------------------------------------- Confidence 99999865321 1000000 000000 Q ss_pred ccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHH Q lcl|NC_014661. 197 AVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELA 276 (524) Q Consensus 197 ~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELA 276 (524) .+++|-.. .| +....|.+..|.+.| .+-...+|-||. T Consensus 179 ---------------------------~v~Eg~~~--~~-------~~~~~f~~i~~~~~k-------~~~~~~iS~ell 215 (413) T protein:vir:81 179 ---------------------------TVAEGGKK--PY-------MRFADFDIVTESLSK-------IAGLTKITDEMI 215 (413) T ss_pred ---------------------------eecCcccc--cc-------cCcccceeeEeeeee-------EEEeehhhHHHH Confidence 00111000 00 001234444444444 444567899999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHH Q lcl|NC_014661. 277 QDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLF 356 (524) Q Consensus 277 QDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~ 356 (524) +|--+ .++.|.+-|+..|..-+|+.||..- -+...+.|+++......+.. .....++. T Consensus 216 ~ds~~-----l~~~i~~~la~~~~~~~d~~~l~G~------------G~~~~~~Gi~~~~~~~~~~~-----~~~~~~~~ 273 (413) T protein:vir:81 216 EDYDF-----LVSYINARLLEELAIEEERQLLLGD------------GTGNNLTGLLKRDGIQTLAV-----SNKDELAD 273 (413) T ss_pred HHHHH-----HHHHHHHHHHHHHHHHHHHHHhccC------------CCCCcccccccccccccccc-----cccchhHH Confidence 98632 4788888888888888888776421 11112345544322211111 01112233 Q ss_pred HHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCc----cccccccccccccccccCcceEEEEecCceEEEeeCCCCc Q lcl|NC_014661. 357 QIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDT----SVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQ 432 (524) Q Consensus 357 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~----~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~ 432 (524) -|.+.-..+.....+ ..+.+|++|.....|..... +.+.+..... +.+. .....++|.| ++|+++...+. T Consensus 274 ~i~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~---~~~~-~~~~~~~l~G-~pv~~s~~~~~ 347 (413) T protein:vir:81 274 SIYKAMTNISLATPF-QADALVINPLDYQELRLAKDANGQYYGGGVFQGQ---YGSG-GIMLDPAPWG-LRTVQSQVVPV 347 (413) T ss_pred HHHHHHHHhhhhccC-CCcEEEEcHHHHHHHHHhhccCCceecccccccc---cccc-ccccCceecc-eeeEEcCCCCc Confidence 333333334333444 45668899999888864321 1111110000 0000 0011245676 69999998776 Q ss_pred ceEEEEEecCCCccceeEeecccc-cccccc---c--Ccccccceeeeeeeeceee-CCcccccCCccccceeeccccch Q lcl|NC_014661. 433 DYFTIGYKGDNEMDAGIYYAPYVA-LTPLRG---A--DPKNFQPVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPS 505 (524) Q Consensus 433 dy~~vG~KG~~~~d~g~fyaPYv~-~~~~~~---~--Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~ 505 (524) .-+++|---. +|--+.. ...+.. . +-.+-|=.+-+..||++.+ +|=+ +.++.- -+. T Consensus 348 ~~~~~gd~~~-------~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a---------~~~l~~-~~~ 410 (413) T protein:vir:81 348 GKPVVGAFRS-------AASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEA---------IVQLDV-AEV 410 (413) T ss_pred ccEEEEeccc-------EEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccc---------eEEEEe-cCC Confidence 6666553211 0100110 111111 1 1123344555566776543 3310 111100 000 Q ss_pred hhhhccccc Q lcl|NC_014661. 506 IANSVGKNG 514 (524) Q Consensus 506 ~a~~~~~~~ 514 (524) .+ + T Consensus 411 ~~------p 413 (413) T protein:vir:81 411 VT------P 413 (413) T ss_pred CC------C Confidence 00 0 No 27 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=95.01 E-value=0.003 Score=34.45 Aligned_cols=334 Identities=13% Similarity=0.122 Sum_probs=125.8 Q ss_pred cchHHHHHHhhhhhhcc----C------CC-cch-hhhhhhhhhhhhhhHHH--HH-------hhhhccccc----h--- Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEA----E------GA-PEI-AQGKHAIIAKMFENQEA--DI-------KSDAAYRDE----K--- 56 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~----~------~~-~~~-~~~~~~~~~~~~enq~~--~~-------~~~~~~~~~----~--- 56 (524) |++.++|.+.|.-+=+. + .. .+. .+-.+++-+.+-+.+++ .+ +........ . T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcccccc Confidence 77777776666544110 0 00 000 00001111111111111 11 000000000 0 Q ss_pred ---------hhhcccccccccccccccccchhhhcccccc-ccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchh Q lcl|NC_014661. 57 ---------LAEAFGGFLTEAEIGGDHGYDPQNIAAGQTS-GAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQ 125 (524) Q Consensus 57 ---------~~~~~~~~l~ea~~~~~~g~~~~~i~est~t-g~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGL 125 (524) ...++..++.... .--. .....++++ |.+. .-+.+. .+++.+-++..-.+++.++||++++|- T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~----~~~~-~~~~~~t~~~gg~~-iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 154 (397) T protein:vir:48 81 KSEEEVKAGFVKDFKNLVRGRY----QNLL-DSKTDASGSDAGLT-IPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGS 154 (397) T ss_pred chhhHHHHHHHHHHHHHHhhhh----hHHH-HHhhccCCcccccc-ccHHHHHHHHHHHHHHHHHHhhhceeeccCCcce Confidence 0001111111110 0000 001111111 1110 111221 344444556777889999999998885 Q ss_pred eeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCC Q lcl|NC_014661. 126 VFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTAD 205 (524) Q Consensus 126 IFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~ 205 (524) +--++ ..... +.+.|- T Consensus 155 ~~~~~--~~~~~--------------~~a~~v------------------------------------------------ 170 (397) T protein:vir:48 155 RVYEK--WADIT--------------GLAKLD------------------------------------------------ 170 (397) T ss_pred EEEEe--ecCCC--------------cceeee------------------------------------------------ Confidence 43221 10000 000000 Q ss_pred cccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCC Q lcl|NC_014661. 206 AAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGM 285 (524) Q Consensus 206 ~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGL 285 (524) ++|- ...| +....|.++.|++.|. +-...+|-||.+|-. . T Consensus 171 --------------------~E~~--~~~~-------~~~~~~~~v~~~~~k~-------~~~~~iS~ell~ds~----~ 210 (397) T protein:vir:48 171 --------------------DEAG--SIGT-------NDDPKLYPIRYAIKRY-------AGISTVTNSLLADSA----E 210 (397) T ss_pred --------------------cccc--cccc-------ccccceeeEEeeheee-------eeehhhHHHHHhhch----H Confidence 0000 0000 1112344444554444 445689999999843 5 Q ss_pred ChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_014661. 286 DADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEI 365 (524) Q Consensus 286 DAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I 365 (524) |.+++|.+-|+..|..-+|+.||...-+. ....+..++ +-...++..+ T Consensus 211 ~l~~~v~~~l~~~~~~~~d~~il~G~g~~------------~~~~~~~~~-------------d~i~~~~~~l------- 258 (397) T protein:vir:48 211 NILAWLSGWIAKKVVVTRNKAILEAIATL------------PTKPTLTKW-------------DDIIDLQAKV------- 258 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccc------------ccccccccH-------------HHHHHHHHHh------- Confidence 77999999999999999999988643211 111222221 1123333333 Q ss_pred HhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeC--CC-----Cc------ Q lcl|NC_014661. 366 ARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQ--YA-----RQ------ 432 (524) Q Consensus 366 ~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~--y~-----~~------ 432 (524) ... +..+..+||+|.....|.....- .| ..-+..|.+.. --++|.| ++|++-. .. +. T Consensus 259 ~~~--~~~~a~~v~n~~~~~~L~~lkd~-----~G-~~i~~~~~~~~-~~~~l~G-~PV~~~~~~~~~~~~~~~~~~~~g 328 (397) T protein:vir:48 259 DPA--IKQTSFFLTNTSGFTALKKVKNA-----FG-DYLMERDVKSP-TGYSIDG-FAVKEVADRWLANASSGAMPLYFG 328 (397) T ss_pred hhh--hcCCCEEEECHHHHHHHHHhhcC-----CC-ceeeccCcCCC-CCceecc-ceeEEecccccCCcCCCceEEEEE Confidence 222 22467889999999999754211 01 00011122111 1246777 5776522 11 11 Q ss_pred ---ceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeecee-eCC--c-----ccccCCccccceeecc Q lcl|NC_014661. 433 ---DYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIG-INP--L-----ADTAAQQPAGNARIAN 501 (524) Q Consensus 433 ---dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP--~-----~~~~~~~~~~~~~~~~ 501 (524) +|++++..+.-+.. ..++. ..+-.+.+-.+-...||+.. .|| | +......+ . T Consensus 329 d~~~~~~~~~~~~~~i~----~~~~~------~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~--~----- 391 (397) T protein:vir:48 329 DLKQAVTLFDRQQMSLL----STNIG------GGAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKG--N----- 391 (397) T ss_pred eccceEEEEeecceEEE----Eeccc------hhhhhcCceeEEEEeeeccEEecccceEEEEecccccCCC--C----- Confidence 12222222211111 00100 00112222334444444432 233 1 11111111 0 Q ss_pred ccchhhhhc Q lcl|NC_014661. 502 GMPSIANSV 510 (524) Q Consensus 502 g~~~~a~~~ 510 (524) .+..+ . T Consensus 392 -~~~~~--~ 397 (397) T protein:vir:48 392 -LGSTA--V 397 (397) T ss_pred -ccccC--C Confidence 01111 0 No 28 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=94.76 E-value=0.0036 Score=34.00 Aligned_cols=311 Identities=14% Similarity=0.045 Sum_probs=131.5 Q ss_pred hhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhccccccccccccCcchh-hHHHHHHHhhhhhhc Q lcl|NC_014661. 35 IAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDI 113 (524) Q Consensus 35 ~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI 113 (524) +|.| +| .-..+.|... .|+ ..++.++ -.-+.+. .+++.+.+..+-..+ T Consensus 1 ~~~~--------~e-------~~~~~~~~~~----~~~---------~~~~~~~---liP~~~~~~ii~~~~~~s~l~~l 49 (338) T protein:vir:78 1 MATL--------NE-------LAPNTAGSNH----QGR---------LAHVPSD---LLPKEIVGPIFDKAQESSLVLRL 49 (338) T ss_pred Ccch--------HH-------hhhhhccccc----ccc---------eeccccc---ccchHHHHHHHHHHHhhchhhhh Confidence 1111 11 1111111111 000 0111111 1112222 456666677888999 Q ss_pred eeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 114 CGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQ 193 (524) Q Consensus 114 ~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~ 193 (524) |.+.||+++..-|.- +.. .+.+.+-|... T Consensus 50 ~~~~~~~~~~~~ip~----~~~---------------~~~a~~v~~~~-------------------------------- 78 (338) T protein:vir:78 50 GENIPISYGETIIPT----TVK---------------RPEVGQVGVGT-------------------------------- 78 (338) T ss_pred cceeeccCCceEEEE----Eec---------------Cccceeecccc-------------------------------- Confidence 999999986543321 111 11111111000 Q ss_pred cccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhH Q lcl|NC_014661. 194 ATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSI 273 (524) Q Consensus 194 ~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ 273 (524) ..-.+| +...++-.-+++.++...+..+-...+|- T Consensus 79 ------------------------------------~~~~~E---------g~~~~~~~~~f~~v~l~~~k~~~~~~is~ 113 (338) T protein:vir:78 79 ------------------------------------SNEQRE---------GGTKPLSGTAWDTRSVAPIKLATIVTVSE 113 (338) T ss_pred ------------------------------------cccccc---------cccccccccceeEEEEEEEEEEEeehhhH Confidence 000011 11223333444555666666666677899 Q ss_pred HHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHH Q lcl|NC_014661. 274 ELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKA 353 (524) Q Consensus 274 ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~ 353 (524) ||.+|- ..|.|++|.+-|+..|...||..||..--...--+ ..+..+.....+.... +. -++ .... T Consensus 114 ell~ds----~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~-~~gi~~~~~~~~~~~~----~~---~~~--~~~~ 179 (338) T protein:vir:78 114 EFARMN----PSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSA-LQGIDTNNVIVNTTNV----DY---LQT--GTTP 179 (338) T ss_pred HHHhcC----HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc-cccccccccccccccc----cc---ccc--cchh Confidence 999983 36779999999999999999998886432110000 0000000000111110 10 001 0123 Q ss_pred HHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcc Q lcl|NC_014661. 354 LLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQD 433 (524) Q Consensus 354 L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d 433 (524) ++..+..+...|...=.+ ..+.++++|+....|..+..++-. .|. .-+..+.... -.++|.| ++||++.+-|.+ T Consensus 180 ~~~~~~~~~~~~~~~~~~-~~~~~~m~~~~~~~L~~~~~l~d~--~g~-~l~~~~~~~~-~~~~l~G-~PV~~~~~ip~~ 253 (338) T protein:vir:78 180 LLDRFLDGYDLVSANTDV-DFNGWAADPRYRARLLRSQAYRDA--NGN-VDPTRINLAA-SAGDLLG-LPVQFGKAVGGD 253 (338) T ss_pred hHHHHHHHHHHhhhhccc-cceEEEEchHHHHHHHHHhhhccC--CCc-eeecccccCC-CCceeee-eeEEEccccCcc Confidence 344445554444433333 577899999998888543322111 000 0011111111 1257787 599998765421 Q ss_pred ---------eEEEE--------EecCCCccceeEeecccccccccccCccc-----cc-ceee--eeeeec-eeeCCccc Q lcl|NC_014661. 434 ---------YFTIG--------YKGDNEMDAGIYYAPYVALTPLRGADPKN-----FQ-PVLG--FKTRYG-IGINPLAD 487 (524) Q Consensus 434 ---------y~~vG--------~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s-----~q-P~~~--~~tRY~-l~~nP~~~ 487 (524) -+++| ..+.-.+ =..+| .......||.. || --++ ...|++ ...|| T Consensus 254 ~~~~~~~~~~~~~gdfs~~~~~~~~~~~i----~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~--- 324 (338) T protein:vir:78 254 LGAATDSKVRVVGGDFSQLKYGFADEIRV----KMSDT--ATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDK--- 324 (338) T ss_pred ccccCCcccEEEEEecceEEEEeecccEE----EEeec--ccccccccccccchhhhhcCcEEEEEEEEeccEeecc--- Confidence 23333 1111110 00011 11111223322 11 1123 356777 34555 Q ss_pred ccCCccccceeeccccchhh Q lcl|NC_014661. 488 TAAQQPAGNARIANGMPSIA 507 (524) Q Consensus 488 ~~~~~~~~~~~~~~g~~~~a 507 (524) ..+.++.++....| T Consensus 325 ------~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 325 ------QAFVKFVDDEDPDA 338 (338) T ss_pred ------cceEEEecccCCCC Confidence 12345555433333 No 29 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=94.56 E-value=0.0041 Score=33.67 Aligned_cols=309 Identities=12% Similarity=0.050 Sum_probs=109.0 Q ss_pred ccccccccccccccccccccccccccccc--ccccccccccccccccccccccccccCCCCCcccccccccccccccccc Q lcl|NC_014661. 146 FHPMYAPDAMFSGRGSHEVFAPLASGTVV--AQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILV 223 (524) Q Consensus 146 f~~fnEadt~FSG~~~~~~~~~~~~g~~~--a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~ 223 (524) ++-++|--+.-.|...........++... -...+..... ... ...+.....+..+ . ............+. T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~-~~s-~l~~~~~~~~~~~--~----~~~~p~~~~~~~a~ 72 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQ-ESS-LVLRMGEQIPISY--G----ETIIPTTVKRPEVG 72 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHH-hhc-hhhhhcceeeccC--C----ceEEEEEeCCceeE Confidence 11123321111111111000000000000 0000000000 000 0000000000000 0 00000000111111 Q ss_pred cccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHh Q lcl|NC_014661. 224 EIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEI 303 (524) Q Consensus 224 ~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEI 303 (524) -+++|-....+| +...++-..+++++++..|--+--...|-||.+|-. .|.|++|.+.|...|...| T Consensus 73 ~v~eg~~~~~~e---------~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~----~~~~~~i~~~la~ai~~~~ 139 (333) T protein:vir:78 73 QVGVGTSNEQRE---------GGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNP----SGLYTKLQGDLAYAIGRGI 139 (333) T ss_pred eecCcccccccc---------cccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHH Confidence 223333232333 123344555566666666555556678889988754 4679999999999999999 Q ss_pred hHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHH Q lcl|NC_014661. 304 NREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNV 383 (524) Q Consensus 304 NREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~v 383 (524) +..+|..--..... ...|++.-...................+..|..+-..+...-.+ ..+.+|++|.. T Consensus 140 d~~~l~G~g~~~~~----------~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~~~~~vmn~~~ 208 (333) T protein:vir:78 140 DLAVFHGKSPLTGS----------ALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDV-EFNGWAVDPRF 208 (333) T ss_pred HHHHhcccCCCCCc----------ccccccccccccccccccccccccchhHHHHHHHHHhhcccccc-CceEEEEcchH Confidence 99998532211110 11111110000000000000011111222233333333333344 57788889988 Q ss_pred HHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcc---------eEEEE--------EecCCCcc Q lcl|NC_014661. 384 VNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQD---------YFTIG--------YKGDNEMD 446 (524) Q Consensus 384 a~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y~~vG--------~KG~~~~d 446 (524) ...|..+..++- ++|. .-+..+... .-.|+|.| ++|+++.+.+.+ .+++| ..+..+. T Consensus 209 ~~~L~~~~~~~d--~~G~-~i~~~~~~~-~~~~~l~G-~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i- 282 (333) T protein:vir:78 209 RAHLLRAQAYRD--ANGN-VDPSRINLA-AQTGDVLG-LPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRI- 282 (333) T ss_pred HHHHHHHhhhcC--CCCc-eeecCcccc-CCCceeec-eeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEE- Confidence 877754332211 0000 001111111 01257787 699998876544 23333 2222111 Q ss_pred ceeEeecccccccccccCccccc-ceee--eeeeecee-eCCcccccCCccccceeeccc-cc Q lcl|NC_014661. 447 AGIYYAPYVALTPLRGADPKNFQ-PVLG--FKTRYGIG-INPLADTAAQQPAGNARIANG-MP 504 (524) Q Consensus 447 ~g~fyaPYv~~~~~~~~Dp~s~q-P~~~--~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g-~~ 504 (524) -..+|.-.......--.-|| -.++ ...|++.. .+|= .+.++.+. -| T Consensus 283 ---~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~---------a~~~l~~~~a~ 333 (333) T protein:vir:78 283 ---KMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQ---------AFVKFVDDEQP 333 (333) T ss_pred ---EEeccccccccccceeehhhcCcEEEEEEEEEccEEeccc---------ceEEEeccCCC Confidence 11122110000000000111 1122 24577744 5661 11222221 11 No 30 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=94.10 E-value=0.0032 Score=34.23 Aligned_cols=339 Identities=14% Similarity=0.148 Sum_probs=127.2 Q ss_pred CCcccchHHHHHHhhhhhhc--------------cCCC--cchhhhhhhh---hhh--hhhhHHHHHhhhhccccch--- Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLE--------------AEGA--PEIAQGKHAI---IAK--MFENQEADIKSDAAYRDEK--- 56 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~--------------~~~~--~~~~~~~~~~---~~~--~~enq~~~~~~~~~~~~~~--- 56 (524) |-..|. .++|.++|.-+.+ .+.. -++...+..+ .++ -++.|.++..+........ T Consensus 1 m~~~m~-l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) T protein:vir:10 1 MGVKLT-VNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) T ss_pred CCcccc-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 777774 5668877754433 1111 1121111111 111 1222222221110000000 Q ss_pred -------------hhhccccccccccccccccc----chhhhcccccc-ccccccCcchh--hHHHHHHHhhhhhhceee Q lcl|NC_014661. 57 -------------LAEAFGGFLTEAEIGGDHGY----DPQNIAAGQTS-GAVTQIGPAVM--GMVRRAIPNLIAFDICGV 116 (524) Q Consensus 57 -------------~~~~~~~~l~ea~~~~~~g~----~~~~i~est~t-g~v~~~~P~Li--~l~Rra~~nLIa~DI~GV 116 (524) ...+|..++. ..++. +...+..++.+ |... . |.-+ .+++.+.......+++.+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~a~~~~t~~~gg~~-v-P~~~~~~Ii~~~~~~~~l~~~~~~ 152 (408) T protein:vir:10 80 GPLNKSENELKDKFVKDFVNMVR-----NPMAFMNTVSSKTETSGSDSAAGLT-I-PQDIRTMINTLVRQYDSLQQYVRV 152 (408) T ss_pred cccccchhhhHHHHHHHHHHHhh-----cchhhhhhhhhhhhhcccccCCcee-c-cHhHHHHHHHHHHhhchhhhhcce Confidence 0001111110 00110 01111112211 1110 1 3222 355666667778899999 Q ss_pred ecCCCcchheeeeeeeecCccCCCcccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 117 QPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATG 196 (524) Q Consensus 117 QPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g 196 (524) .||+++.|-+--.+ .... .+.+.|-+ T Consensus 153 ~~~~~~~~~~~~~~-----~~~~-----------~~~a~~v~-------------------------------------- 178 (408) T protein:vir:10 153 ESVSTSNGSRVYEK-----WTDV-----------TPLTVMDA-------------------------------------- 178 (408) T ss_pred eeccCCcceEEEee-----cccc-----------ccceeeec-------------------------------------- Confidence 99999888653221 1000 00000000 Q ss_pred ccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHH Q lcl|NC_014661. 197 AVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELA 276 (524) Q Consensus 197 ~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELA 276 (524) +|- ...| .+...|.++.|++.|.. -...+|-||. T Consensus 179 ------------------------------E~~--~~~~-------~~~~~~~~i~~~~~k~~-------~~~~iS~ell 212 (408) T protein:vir:10 179 ------------------------------EDG--KIPD-------LDNPQLTIIKYLIKRYA-------GIITATNTSL 212 (408) T ss_pred ------------------------------Ccc--cccc-------ccCcceeeEEeeeeeEE-------eeehhHHHHH Confidence 000 0000 01123555555555554 4456999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHH-HH Q lcl|NC_014661. 277 QDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKA-LL 355 (524) Q Consensus 277 QDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~-L~ 355 (524) +|- .+|.+++|.+-|+..|..-+|+.||...-+.. ...|..+++ .... |+ T Consensus 213 ~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~------------~~~~~~~~~-------------~l~~~~~ 263 (408) T protein:vir:10 213 KDT----AENILAWLSSWIAKKVVVTRNQAIIEVMKAAP------------KKPTIAKFD-------------DVITMIN 263 (408) T ss_pred hhc----hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------cccccccHH-------------HHHHHHH Confidence 994 45678899999999999999888875433211 112222211 1112 11 Q ss_pred HHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCC--CC-- Q lcl|NC_014661. 356 FQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQY--AR-- 431 (524) Q Consensus 356 ~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y--~~-- 431 (524) ..+. ..+-..-.+||++.....|.....- .|. .-+..+.+.. .-++|.| ++|++-.+ .+ T Consensus 264 ~~~~---------~~~~~~a~~v~n~~~~~~l~~lkd~-----~G~-~i~~~~~~~~-~~~~l~G-~PV~~~~~~~~~~~ 326 (408) T protein:vir:10 264 TAVD---------PAIIATSSLLTNQSGLNKLALVKTA-----EGK-YLLEPDPTKP-NSYLIKG-KQVIVVADRWLPNT 326 (408) T ss_pred Hhhh---------hhhccCCEEEEcHHHHHHHHHhhcc-----CCc-eEeccCcCCC-CCceecc-eeeEEecccccCcc Confidence 1111 1221223578999999999764311 110 0011111111 1136777 57776332 11 Q ss_pred ------------cceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeecee-eCC-------cccccCC Q lcl|NC_014661. 432 ------------QDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIG-INP-------LADTAAQ 491 (524) Q Consensus 432 ------------~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP-------~~~~~~~ 491 (524) .++++++.++.... =+.++.- .+-.+.+=.+-+..||++. .+| |+..... T Consensus 327 ~~~~~~i~~gd~~~~~~~~~~~~~~v----~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~ 396 (408) T protein:vir:10 327 GSTVYPLYYGDMSQAITLFDRENMSL----LPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) T ss_pred CCCceEEEEEehhccEEEEEecceEE----EEccccc------chhhcCceEEEEEEeeccEEeccccEEEEEeeccccC Confidence 11233332221111 1111100 0001112222233333332 111 0000000 Q ss_pred ccccceeecccc Q lcl|NC_014661. 492 QPAGNARIANGM 503 (524) Q Consensus 492 ~~~~~~~~~~g~ 503 (524) .+.......... T Consensus 397 ~~~~~~~~~~~~ 408 (408) T protein:vir:10 397 VGNFKTTTSTAV 408 (408) T ss_pred CCCCCCCCcccC Confidence 000000111111 No 31 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=93.51 E-value=0.0073 Score=32.30 Aligned_cols=356 Identities=9% Similarity=0.014 Sum_probs=133.4 Q ss_pred CCccc----chHHHHHHhhhhhhcc-CCC-cchhhhhhhhhhh--hhhhHHHHHhhhhc-------cccc---------- Q lcl|NC_014661. 1 MSTQI----KTKAQLVADWKPLLEA-EGA-PEIAQGKHAIIAK--MFENQEADIKSDAA-------YRDE---------- 55 (524) Q Consensus 1 ~~~~~----~~~~~l~~kw~p~l~~-~~~-~~~~~~~~~~~~~--~~enq~~~~~~~~~-------~~~~---------- 55 (524) |..++ ..-+++.++..--++. +.+ .++........+. -++.+..++.+... ...+ T Consensus 27 ~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 106 (418) T protein:vir:10 27 VTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATVDELLIKQGELQARLLEAEQKLARGGGSAELETPKTLGQLVTES 106 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhhHHhhhH Confidence 11111 1111233333221110 000 1111000000000 01111111110000 0000 Q ss_pred hhhhcccccccccc---cccccccchhhhccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeee Q lcl|NC_014661. 56 KLAEAFGGFLTEAE---IGGDHGYDPQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRA 131 (524) Q Consensus 56 ~~~~~~~~~l~ea~---~~~~~g~~~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRS 131 (524) .-..++..++.+.. .....-.+......+++++.-...-|.+. .+++.+.+..+-.+++.+-||++++.- T Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~------ 180 (418) T protein:vir:10 107 EEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIE------ 180 (418) T ss_pred HHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCcee------ Confidence 00000000000000 00000000000111111111111222222 455566677888899999999876531 Q ss_pred eecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccc Q lcl|NC_014661. 132 VYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDA 211 (524) Q Consensus 132 rY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~ 211 (524) |.-.... .+.+.|- T Consensus 181 -~~~~~~~-----------~~~a~~v------------------------------------------------------ 194 (418) T protein:vir:10 181 -YTVETGF-----------TNNAAAV------------------------------------------------------ 194 (418) T ss_pred -EEEEecC-----------CCceeee------------------------------------------------------ Confidence 1110000 0000000 Q ss_pred cccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHH Q lcl|NC_014661. 212 EVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAEL 291 (524) Q Consensus 212 ~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaEL 291 (524) + | +...++-..++++++..+|.-+-...+|-||.||.- |.++.| T Consensus 195 --------------~--------E---------~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~-----~l~~~i 238 (418) T protein:vir:10 195 --------------A--------E---------GAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDAP-----ALQSYI 238 (418) T ss_pred --------------c--------c---------CccccccccceeeEEEeeeeEEEeehhhHHHHHhHH-----HHHHHH Confidence 0 1 011222234566777777777777889999999852 457888 Q ss_pred HHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_014661. 292 ANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGR 371 (524) Q Consensus 292 aNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~r 371 (524) .+-|+..|..-+|+-||..- -+...+.|++............ . ....+..|..+-..+. ..+ T Consensus 239 ~~~l~~a~~~~~d~a~l~G~------------g~~~~p~Gi~~~~~~~~~~~~~--~--~~~~~~~i~~~~~~~~--~~~ 300 (418) T protein:vir:10 239 DGRARYGLQLTEEGQILKGD------------GTGANILGILPQASAFMPSITL--A--NATPIDKIRLALLQAV--LAE 300 (418) T ss_pred HHHHHHHHHHHHHHHHhccC------------CCCccccccccccccccccccc--c--ccccHHHHHHHHHhhc--ccc Confidence 88888888888887776321 1111234444322111100000 0 0011222233323332 234 Q ss_pred cCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEe Q lcl|NC_014661. 372 GAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYY 451 (524) Q Consensus 372 g~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fy 451 (524) ...+.+||+|.+...|....- ..|.- +..+.+.. -.|+|.| ++|+++++.|.+-+++|---. . ++. T Consensus 301 ~~~~~~v~n~~~~~~L~~lkd-----~~G~~--i~~~~~~~-~~~~l~G-~pV~~~~~~p~~~~~~gd~s~----~-~~~ 366 (418) T protein:vir:10 301 FPATGIVLNPIDWASIELTKD-----SQGRY--IVGNPVNG-TTPRLWN-LPVVETQAMTANEFLVGAFSM----A-AQI 366 (418) T ss_pred CCCCEEEEcHHHHHHHHHhhc-----CCCce--eccccccC-CCceecc-eeeEEcCCCCCCcEEEeeccc----e-EEE Confidence 467789999999999875321 11100 11111111 1357777 799999998876666653210 0 000 Q ss_pred ecccccccccccCccc---cc---ceeeeeeeeceee-CCcccccCCccccceeeccccchhhh Q lcl|NC_014661. 452 APYVALTPLRGADPKN---FQ---PVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPSIAN 508 (524) Q Consensus 452 aPYv~~~~~~~~Dp~s---~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~ 508 (524) |.-..+.-.+|+.. |+ =.+=+..|++..+ +|= .+.+ .......+| T Consensus 367 --~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~---------a~~~-~~~~~~~~g 418 (418) T protein:vir:10 367 --FDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPE---------SFVT-GALVEQAGG 418 (418) T ss_pred --EEecceEEEEecccchhhhcCceEEEEEEeeccEEeccc---------ceEE-EEeccCCCC Confidence 10011101112211 22 2333455776542 341 1111 111222222 No 32 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=93.47 E-value=0.0074 Score=32.25 Aligned_cols=327 Identities=14% Similarity=0.077 Sum_probs=130.3 Q ss_pred CCcc-------------cchHHHHHHhhhhhhccCCCcchhhh--hhhhhhhhhhhHHHHHhhhhcccc-chhhhccccc Q lcl|NC_014661. 1 MSTQ-------------IKTKAQLVADWKPLLEAEGAPEIAQG--KHAIIAKMFENQEADIKSDAAYRD-EKLAEAFGGF 64 (524) Q Consensus 1 ~~~~-------------~~~~~~l~~kw~p~l~~~~~~~~~~~--~~~~~~~~~enq~~~~~~~~~~~~-~~~~~~~~~~ 64 (524) +.++ ....+++++++.-+.+ ++.+. +......+.+..++.....+.-.. ....+.+... T Consensus 24 ~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~-----ei~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 98 (400) T protein:vir:38 24 MKTELRSLLEGEDSEENLKKAEGVRAKYDKAGK-----EIKDLEEKRDLYEAALKGNEQSSGKKPDHPEEHSYRDALNAY 98 (400) T ss_pred HHHHHHHHHHhhccchHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHhhcccccccchhhhhHHHHHHHH Confidence 1111 0111234444443322 22221 111111111111111100000000 0000000000 Q ss_pred c-----------------cccccccccccch-hhhccccccccccccCcc--hhhHHHHHHHhhhhhhceeeecCCCcch Q lcl|NC_014661. 65 L-----------------TEAEIGGDHGYDP-QNIAAGQTSGAVTQIGPA--VMGMVRRAIPNLIAFDICGVQPMQGPTG 124 (524) Q Consensus 65 l-----------------~ea~~~~~~g~~~-~~i~est~tg~v~~~~P~--Li~l~Rra~~nLIa~DI~GVQPmTGPTG 124 (524) . ............. ....+++++.+-...-|. .-.++++.-++.+..+++.+.||++.++ T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 178 (400) T protein:vir:38 99 LHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKG 178 (400) T ss_pred HhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcce Confidence 0 0000000000000 001111111111011122 1134444556778889999999998876 Q ss_pred heeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCC Q lcl|NC_014661. 125 QVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTA 204 (524) Q Consensus 125 LIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~ 204 (524) -+--++. .. +. ..+- T Consensus 179 ~~~~~~~----~~----~~----------~~~~----------------------------------------------- 193 (400) T protein:vir:38 179 TYPTVAN----AT----TK----------MVTV----------------------------------------------- 193 (400) T ss_pred EEEEEec----CC----Cc----------cccc----------------------------------------------- Confidence 3322110 00 00 0000 Q ss_pred CcccccccccccccccccccccccccchhhhcccccCCCCCcchhhc-ceEEEEEEEEEecccccchhhHHHHHHHHhhc Q lcl|NC_014661. 205 DAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEM-GFRIDKQVIEAKSRQLKAQYSIELAQDLRAVH 283 (524) Q Consensus 205 ~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EM-sFsIEK~TVtAKSRALKAEYT~ELAQDLKAvH 283 (524) +++ ...++. ..+++.++...+.-+-...+|-||.+|- T Consensus 194 ---------------------~E~-----------------~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds---- 231 (400) T protein:vir:38 194 ---------------------AEL-----------------EKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDS---- 231 (400) T ss_pred ---------------------ccc-----------------ccccccccccceeeEeehhheeeehhhHHHHHhhh---- Confidence 000 000111 1234455555666666778999999985 Q ss_pred CCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHH Q lcl|NC_014661. 284 GMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESA 363 (524) Q Consensus 284 GLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~ 363 (524) ..|.+++|.+-|...|...+|+-|+...-. +.+.|+..++ ....++..... T Consensus 232 ~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~-------------~~~~~~~~~~-------------~~~~~~~~~~~--- 282 (400) T protein:vir:38 232 AIDLVGLIAQNGQQIKVNTTNGAVATLLKG-------------FTAKTISSVD-------------DLKHINNVDLD--- 282 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcccc-------------ccccccccHH-------------HHHHHHHhhhh--- Confidence 346788999999999999888888754321 1122332221 11222111111 Q ss_pred HHHhhccccCccEEEeCHHHHHHHhcC----CccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEE Q lcl|NC_014661. 364 EIARQTGRGAGNFIIASRNVVNVLASV----DTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGY 439 (524) Q Consensus 364 ~I~~~T~rg~gn~~v~S~~va~~L~~~----~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~ 439 (524) ..+ ...+|++|.....|... |...+.| +.++. ..++|.| ++|++..+.+.. - T Consensus 283 -----~~~--~a~~v~~~~~~~~l~~lkd~~G~~i~~~----------~~~~~-~~~~l~G-~pv~~~~~~~~~-----~ 338 (400) T protein:vir:38 283 -----PAY--SRVIIASQSFYNFLDTVKDGNGRYLLQD----------SILTP-SGKSVLG-MPIAVVSDDTLG-----A 338 (400) T ss_pred -----hhh--CcEEEEcHHHHHHHHHhhccCCCeeeec----------CcCCC-Ccccccc-ceeEEecccccC-----C Confidence 112 34577899999988753 2222211 11111 1246887 588877764421 1 Q ss_pred ecCCCccceeEeeccc--------ccccccccCcccccceeeeeeeeceee-CCcccccCCccccceeeccccchhh Q lcl|NC_014661. 440 KGDNEMDAGIYYAPYV--------ALTPLRGADPKNFQPVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPSIA 507 (524) Q Consensus 440 KG~~~~d~g~fyaPYv--------~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a 507 (524) .| +.-++|+.+- ....++..|-..|+..+-...||+..+ +|-+-. .+++.. .| T Consensus 339 ~g----~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~-------~l~~~~----~a 400 (400) T protein:vir:38 339 AG----EAHAFLGDIKRAILFANRADFMVRWVDDQIYGQFLQAGMRFGVSVADEKAGY-------FLTYTP----KA 400 (400) T ss_pred CC----ceEEEEEeccccEEEEeecceEEEEecccccceeEEEEEEeccEEecccceE-------EEEeec----CC Confidence 11 1122332221 122233446667777788888887653 441100 011111 01 No 33 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=93.24 E-value=0.0083 Score=32.00 Aligned_cols=340 Identities=15% Similarity=0.081 Sum_probs=127.9 Q ss_pred CCcccchH----HHHHHhhhhhhccCCCcch-------hhhhh--hhhhhhhhhHHHHHhhhhccccchh---------- Q lcl|NC_014661. 1 MSTQIKTK----AQLVADWKPLLEAEGAPEI-------AQGKH--AIIAKMFENQEADIKSDAAYRDEKL---------- 57 (524) Q Consensus 1 ~~~~~~~~----~~l~~kw~p~l~~~~~~~~-------~~~~~--~~~~~~~enq~~~~~~~~~~~~~~~---------- 57 (524) |++++... +++.++=..+++.+..-++ ...++ +-+.+..|.+.+.+.....-..... T Consensus 5 m~k~l~el~~~~~~~~~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (397) T protein:vir:12 5 MSKKEIALRQQFTEKKQQADKALQEGNTDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQRSQG 84 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccccc Confidence 56555431 1233333334443322211 11111 0111112222221111000000000 Q ss_pred ----------hhcc-----cccccccccccccccchhhhcccc-ccccccccCcchh--hHHHHHHHhhhhhhceeeecC Q lcl|NC_014661. 58 ----------AEAF-----GGFLTEAEIGGDHGYDPQNIAAGQ-TSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPM 119 (524) Q Consensus 58 ----------~~~~-----~~~l~ea~~~~~~g~~~~~i~est-~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPm 119 (524) ..++ +..+.+.+-.....-+...+..++ ++|.+. . |.-+ .+++.+.++.+-.+++.+.|| T Consensus 85 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~l-v-P~~~~~~ii~~~~~~~~l~~~~~~~~~ 162 (397) T protein:vir:12 85 QGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGIL-I-PEDIGRQIHEFKRQFEPLEQYVTVEPV 162 (397) T ss_pred chhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCccc-C-chhHHHHHHHhhhhhhhHHhhcceeec Confidence 0000 111111100000000001111111 122221 1 2222 355555667788899999999 Q ss_pred CCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 120 QGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVT 199 (524) Q Consensus 120 TGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~ 199 (524) +++.|-+- +..+... +.+.|-+ T Consensus 163 ~~~~~~~~-----~~~~~~~------------~~a~~v~----------------------------------------- 184 (397) T protein:vir:12 163 TTRSGTRL-----LEKNADM------------VPFSPVE----------------------------------------- 184 (397) T ss_pred cCCceeEE-----EEEecCC------------cceeeec----------------------------------------- Confidence 99887532 1111000 0000000 Q ss_pred cCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHH Q lcl|NC_014661. 200 LATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDL 279 (524) Q Consensus 200 ~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDL 279 (524) +|-.. -| ++...|.++.|+..|..+ ...+|-||.+|- T Consensus 185 ---------------------------Eg~~~--~~-------~~~~~~~~v~~~~~k~~~-------~~~is~e~l~ds 221 (397) T protein:vir:12 185 ---------------------------ELGNL--PE-------IDQPRFTKVSYSIIDYGG-------IMTLSNSMLNDS 221 (397) T ss_pred ---------------------------ccccc--cc-------cccccceeEEeeheeeEe-------eehhhHHHHhhc Confidence 00000 00 011235555555555554 455999999885 Q ss_pred HhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHH Q lcl|NC_014661. 280 RAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQID 359 (524) Q Consensus 280 KAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~ 359 (524) - +|.++.|.+.|...|...+|+-||...-+ +.+.|+..+++ ..+.++..++ T Consensus 222 ~----~~l~~~i~~~l~~~~~~~~d~~il~G~g~-------------~~~~g~~~~~~------------i~~~~~~~l~ 272 (397) T protein:vir:12 222 D----QAIMTYVAKWFAKKSVVTRNNLILAAIAS-------------LKKVDIDGLDG------------IKKALNVTLD 272 (397) T ss_pred h----HHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------ccccccccHHH------------HHHHHhhccc Confidence 3 56788999999999999999888754321 23345543321 1111222222 Q ss_pred HHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCc-----ce Q lcl|NC_014661. 360 KESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQ-----DY 434 (524) Q Consensus 360 ~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy 434 (524) ..+..+..++|+|.....|..... +.|.- -+..+.+.. .-++|.| ++|++.+.... +. T Consensus 273 ---------~~~~~~a~~~~n~~~~~~L~~lkd-----~~G~~-l~~~~~~~g-~~~~l~G-~pv~~~~~~~~~~~~~~~ 335 (397) T protein:vir:12 273 ---------PMVAPGSIVLTNQDGYDWLDTLKD-----GTGRY-LLQPDPTNP-TKKLLDG-RPVVPFTNRVLKTQKGKA 335 (397) T ss_pred ---------hhhhCCCEEEEcHHHHHHHHHhhc-----cCCce-eecccccCC-CCccccc-eeeEEecccccccCCCcc Confidence 122345678999999998865311 11110 011122111 1246777 58886543211 10 Q ss_pred -EEEEEecCCCccceeEeeccccccccccc-Cc----ccccceeeeeeeecee-eCCcccccCCccccceeeccccchhh Q lcl|NC_014661. 435 -FTIGYKGDNEMDAGIYYAPYVALTPLRGA-DP----KNFQPVLGFKTRYGIG-INPLADTAAQQPAGNARIANGMPSIA 507 (524) Q Consensus 435 -~~vG~KG~~~~d~g~fyaPYv~~~~~~~~-Dp----~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~a 507 (524) +++|-- ......+. -....+... .+ .+.+-.+-...|++.. .||=+-.. .+++- T Consensus 336 ~~~~gd~-----~~~~~~~~-~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~-------~~~t~------ 396 (397) T protein:vir:12 336 PLIIGNL-----KEAIVLFD-REQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVF-------GQITV------ 396 (397) T ss_pred EEEEEeh-----hceEEEEe-ecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEE-------EEEee------ Confidence 222210 00000000 000001000 01 1223445555666553 23311100 00000 Q ss_pred hhcc Q lcl|NC_014661. 508 NSVG 511 (524) Q Consensus 508 ~~~~ 511 (524) . T Consensus 397 ---~ 397 (397) T protein:vir:12 397 ---E 397 (397) T ss_pred ---C Confidence 0 No 34 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=93.20 E-value=0.0084 Score=31.96 Aligned_cols=367 Identities=14% Similarity=0.114 Sum_probs=141.9 Q ss_pred CCcccchHHHHHHhhhhhh---------ccCCCcchhhhhhhhhh--hhhhhHHHHHhh----hhccccchhhhcccccc Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLL---------EAEGAPEIAQGKHAIIA--KMFENQEADIKS----DAAYRDEKLAEAFGGFL 65 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l---------~~~~~~~~~~~~~~~~~--~~~enq~~~~~~----~~~~~~~~~~~~~~~~l 65 (524) +-.++...+...++=..+- ..+..+.....+....+ +-+-++.+.... ++.-+........+... T Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 145 (477) T protein:vir:84 66 LDEQIRELESEIERSGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIR 145 (477) T ss_pred HHHHHHHHHHHHHHhhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHH Confidence 2222211111111100000 00000000000000000 000011110000 00000000000000000 Q ss_pred cccccccccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccc Q lcl|NC_014661. 66 TEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAK 143 (524) Q Consensus 66 ~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~ 143 (524) .....+.....+..++++|.. ..-|..+ .++...-++.+..++|++.||++.+|-+-=.|.. .. .. T Consensus 146 ----~~~~~~~~~~~~~~~~~~gg~-lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~--~~-----~~ 213 (477) T protein:vir:84 146 ----KIAKVGEEYRDLDRNGGTGGY-AVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKIL--TG-----TS 213 (477) T ss_pred ----HHHHhhhhhccccccCCCcce-eeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEe--cC-----cc Confidence 000001111111111221111 1113222 2555555677788999999999988754211111 00 00 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccc Q lcl|NC_014661. 144 EAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILV 223 (524) Q Consensus 144 eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~ 223 (524) .+ .+. T Consensus 214 ~a---------~~~------------------------------------------------------------------ 218 (477) T protein:vir:84 214 TA---------IQA------------------------------------------------------------------ 218 (477) T ss_pred ee---------eee------------------------------------------------------------------ Confidence 00 000 Q ss_pred cccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHh Q lcl|NC_014661. 224 EIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEI 303 (524) Q Consensus 224 ~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEI 303 (524) ++|-.. .....++...+++.+++.+|.-+-...+|-||.+|-. .|.++.|.+-|+..|..-| T Consensus 219 --~Eg~~~------------~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~ 280 (477) T protein:vir:84 219 --ADNAAL------------TAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAA----VSVDEFVFRDLAADYANKL 280 (477) T ss_pred --ccCccc------------ccccccccccceeeEEEeeeeEEeeeHHHHHHHhccc----hhHHHHHHHHHHHHHHHHH Confidence 011000 0112344556777888888888888899999999943 5679999999999999999 Q ss_pred hHHHHhhHhhhhhhhhhccccccccccceecccccccc----cccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEe Q lcl|NC_014661. 304 NREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDV----RGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIA 379 (524) Q Consensus 304 NREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~----~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~ 379 (524) ++.||.. .-+.+.|.|++.......+ .+..| +....++..|-...+.+....+. .+..+|+ T Consensus 281 d~~~l~G------------~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~--~~~~~~~~~i~~~~~~~~~~~~~-~~~~~v~ 345 (477) T protein:vir:84 281 NVQVISG------------TGSNNQVVGVRATAGITQVTATSAGSAL--EKHQIIYQKIADAIQRVHTSRFL-EPEVIVM 345 (477) T ss_pred HHHHhcc------------CCCCCccceeeeccccccccccccccch--hhHHHHHHHHHHHHhhccccccC-CccEEEE Confidence 9888743 1111245666654321111 01111 11223333344444444333333 3567888 Q ss_pred CHHHHHHHhcCCc----cccccccc--cccccccccCcceEEEEecCceEEEeeCCCCcc--------eEEEEEecCCCc Q lcl|NC_014661. 380 SRNVVNVLASVDT----SVTPAAQG--LARGLNTDTTKAVFAGILGGRYKVYIDQYARQD--------YFTIGYKGDNEM 445 (524) Q Consensus 380 S~~va~~L~~~~~----~~~~~a~~--~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y~~vG~KG~~~~ 445 (524) +|.....|....- ..+.|... ...++..+.-.....|+|.| ++|+++++.|.+ -|++|--.+. T Consensus 346 ~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G-~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~-- 422 (477) T protein:vir:84 346 HPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHG-LPVVTDPTLPTTLGTGTDQDVIHVLRASDL-- 422 (477) T ss_pred cHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcc-cceEecCcccccccccCCcceEEEEEeceE-- Confidence 8888777754321 11111100 00001111111223467876 699999987753 3444433211 Q ss_pred cceeEeecccccccccccCccccc--ceeeeeeeece-----eeCCcccccCCccccceeecc-c--cchhh Q lcl|NC_014661. 446 DAGIYYAPYVALTPLRGADPKNFQ--PVLGFKTRYGI-----GINPLADTAAQQPAGNARIAN-G--MPSIA 507 (524) Q Consensus 446 d~g~fyaPYv~~~~~~~~Dp~s~q--P~~~~~tRY~l-----~~nP~~~~~~~~~~~~~~~~~-g--~~~~a 507 (524) +.- ..+..+ .++|.++. ..+.|.. |++ +-+| .-+.+++- + -|.+| T Consensus 423 ----~i~--~~~~~~-~~~~~~~~~~~~~~~~v-~~~~~~~~~r~~---------~afv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 423 ----ALF--ESSVRM-RALQETRAENLSVLLQV-YGYLAFTAARFP---------QSVVEIGGTALTAPTFA 477 (477) T ss_pred ----EEE--eeceeE-Eeccccccccceeeeee-hhhhhhhhhccc---------cceEEeecccccccccC Confidence 000 001111 12333322 2222321 221 1234 11112111 1 23333 No 35 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=92.51 E-value=0.011 Score=31.30 Aligned_cols=365 Identities=12% Similarity=0.042 Sum_probs=138.3 Q ss_pred CCcccchH----------HHHHHhhhhhhccCCCcchhhhhhhhhhhhhhhHHH-----------HHhhhhccccchhhh Q lcl|NC_014661. 1 MSTQIKTK----------AQLVADWKPLLEAEGAPEIAQGKHAIIAKMFENQEA-----------DIKSDAAYRDEKLAE 59 (524) Q Consensus 1 ~~~~~~~~----------~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~enq~~-----------~~~~~~~~~~~~~~~ 59 (524) +.+++..- +++..++..++... +.+-..+-|...+ ...++.+.+...... T Consensus 39 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~---------~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (497) T protein:vir:78 39 IEPDFKAHQAEVEAHERAQEMLKSLGGADAAK---------DGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFE 109 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhh Confidence 11111111 11122222222210 0011111110000 000000000000000 Q ss_pred c---ccccccccccccccc-------------cc----hhhh-ccccccccccccCcchh-hHHHHHHHhhhhhhceeee Q lcl|NC_014661. 60 A---FGGFLTEAEIGGDHG-------------YD----PQNI-AAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQ 117 (524) Q Consensus 60 ~---~~~~l~ea~~~~~~g-------------~~----~~~i-~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQ 117 (524) . +.....-+......+ -. .... ..+++++... .-|.+. .++...-+..+..+++.+. T Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~-vp~~~~~~ii~~~~~~~~i~~l~~~~ 188 (497) T protein:vir:78 110 KGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPG-ILPTFLPGIVEQLFYELSLADLISSR 188 (497) T ss_pred hhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccc-cchhhhHHHHHHHHhhhhHHhhcccc Confidence 0 000000000000000 00 0000 0111122211 112111 3333344566677888888 Q ss_pred cCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 118 PMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 118 PmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) ||++++.- |..+... . +.+. T Consensus 189 ~~~~~~~~-------~~~~~~~---~--------~~a~------------------------------------------ 208 (497) T protein:vir:78 189 PVTSPNLS-------YLTESAA---H--------NNAA------------------------------------------ 208 (497) T ss_pred ccCCCceE-------EEEEcCC---C--------Ccce------------------------------------------ Confidence 88876421 1111000 0 0000 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQ 277 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQ 277 (524) -.+| +...+|...+++++++.+|.-+-...+|-||++ T Consensus 209 ----------------------------------wv~E---------~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~ 245 (497) T protein:vir:78 209 ----------------------------------AVAE---------AGTYPFSSEEFARVYEQVGKVANALTITDEGLR 245 (497) T ss_pred ----------------------------------eecc---------CcccccccccceeeEeeeeeeEeecHhHHHHHH Confidence 0011 122344556677888888888888899999999 Q ss_pred HHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhh--------Hhhhhhhhhhccccccccccce----ecccccccccccc Q lcl|NC_014661. 278 DLRAVHGMDADAELANILATEIMLEINREVIDW--------INYSAQVGKTGQTLTVGSKAGV----FDFQDPIDVRGAR 345 (524) Q Consensus 278 DLKAvHGLDAEaELaNILStEImlEINREii~~--------l~~~A~~~k~~~~~~~~~~aG~----fdl~~~~d~~~~~ 345 (524) |-- +.++.|.+-|...|..-+|+.||.. |.+.+..+....... ....+ +.+....+-. .. T Consensus 246 d~~-----~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~-~~ 317 (497) T protein:vir:78 246 DAP-----ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASS--LFGATSATVSNVKFPADGT-NG 317 (497) T ss_pred hHH-----HHHHHHHHHHHHHHHHHHHHHhhcCCCccccccccccccccccccccc--chhhhhhhhhhhhhhcccc-cc Confidence 942 2589999999999999999998863 222221111000000 00000 0000000000 00 Q ss_pred hHH-----HHH-----------------------HHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcC----Ccc Q lcl|NC_014661. 346 WAG-----ESF-----------------------KALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASV----DTS 393 (524) Q Consensus 346 ~a~-----E~~-----------------------r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~----~~~ 393 (524) |.+ ... -.+...+...-..+.+...+ .++.+|.+|.....|... |.+ T Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~ 396 (497) T protein:vir:78 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQY 396 (497) T ss_pred hhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhccc-CCCeEEEchHHHHHHHHhhcCCCce Confidence 000 000 11222233333444454555 577888999988887643 333 Q ss_pred ccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCC------ccceeEeecccccccccccCccc Q lcl|NC_014661. 394 VTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNE------MDAGIYYAPYVALTPLRGADPKN 467 (524) Q Consensus 394 ~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~------~d~g~fyaPYv~~~~~~~~Dp~s 467 (524) .+.+..+...+..... -++|.| ++|++.+..+.+-+++|--.... .+-.+-..||.. .+=.+ T Consensus 397 i~~~~~~~~~~~~~~~-----~~~l~G-~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~------~~f~~ 464 (497) T protein:vir:78 397 MGGNFFGNAYGNPVNG-----GKNIWG-VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNG------TDFVD 464 (497) T ss_pred eccCcccccccccccC-----Cceeec-eeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccc------hhhhc Confidence 3333333332211111 125666 79999888776655554221100 001111122210 01122 Q ss_pred ccceeeeeeeece-eeCCcccccCCccccceeeccccchhhhhcccc Q lcl|NC_014661. 468 FQPVLGFKTRYGI-GINPLADTAAQQPAGNARIANGMPSIANSVGKN 513 (524) Q Consensus 468 ~qP~~~~~tRY~l-~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~ 513 (524) .+=.+=+..|+++ +.+|=+ +.++.- -..+-. | T Consensus 465 n~v~~r~~~r~~~~v~~p~A---------~~~l~~--~~~~~~---~ 497 (497) T protein:vir:78 465 GKVTVRAEERLGLLVYRPSA---------FQLIQL--KKGATG---S 497 (497) T ss_pred CcEEEEEEEeecceeecccc---------EEEEEe--cCCccC---C Confidence 3334555678866 667722 112111 111111 1 No 36 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=92.51 E-value=0.011 Score=31.30 Aligned_cols=365 Identities=12% Similarity=0.042 Sum_probs=138.3 Q ss_pred CCcccchH----------HHHHHhhhhhhccCCCcchhhhhhhhhhhhhhhHHH-----------HHhhhhccccchhhh Q lcl|NC_014661. 1 MSTQIKTK----------AQLVADWKPLLEAEGAPEIAQGKHAIIAKMFENQEA-----------DIKSDAAYRDEKLAE 59 (524) Q Consensus 1 ~~~~~~~~----------~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~enq~~-----------~~~~~~~~~~~~~~~ 59 (524) +.+++..- +++..++..++... +.+-..+-|...+ ...++.+.+...... T Consensus 39 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~---------~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (497) T protein:vir:10 39 IEPDFKAHQAEVEAHERAQEMLKSLGGADAAK---------DGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFE 109 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhh Confidence 11111111 11122222222210 0011111110000 000000000000000 Q ss_pred c---ccccccccccccccc-------------cc----hhhh-ccccccccccccCcchh-hHHHHHHHhhhhhhceeee Q lcl|NC_014661. 60 A---FGGFLTEAEIGGDHG-------------YD----PQNI-AAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQ 117 (524) Q Consensus 60 ~---~~~~l~ea~~~~~~g-------------~~----~~~i-~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQ 117 (524) . +.....-+......+ -. .... ..+++++... .-|.+. .++...-+..+..+++.+. T Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~-vp~~~~~~ii~~~~~~~~i~~l~~~~ 188 (497) T protein:vir:10 110 KGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPG-ILPTFLPGIVEQLFYELSLADLISSR 188 (497) T ss_pred hhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccc-cchhhhHHHHHHHHhhhhHHhhcccc Confidence 0 000000000000000 00 0000 0111122211 112111 3333344566677888888 Q ss_pred cCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 118 PMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 118 PmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) ||++++.- |..+... . +.+. T Consensus 189 ~~~~~~~~-------~~~~~~~---~--------~~a~------------------------------------------ 208 (497) T protein:vir:10 189 PVTSPNLS-------YLTESAA---H--------NNAA------------------------------------------ 208 (497) T ss_pred ccCCCceE-------EEEEcCC---C--------Ccce------------------------------------------ Confidence 88876421 1111000 0 0000 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQ 277 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQ 277 (524) -.+| +...+|...+++++++.+|.-+-...+|-||++ T Consensus 209 ----------------------------------wv~E---------~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~ 245 (497) T protein:vir:10 209 ----------------------------------AVAE---------AGTYPFSSEEFARVYEQVGKVANALTITDEGLR 245 (497) T ss_pred ----------------------------------eecc---------CcccccccccceeeEeeeeeeEeecHhHHHHHH Confidence 0011 122344556677888888888888899999999 Q ss_pred HHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhh--------Hhhhhhhhhhccccccccccce----ecccccccccccc Q lcl|NC_014661. 278 DLRAVHGMDADAELANILATEIMLEINREVIDW--------INYSAQVGKTGQTLTVGSKAGV----FDFQDPIDVRGAR 345 (524) Q Consensus 278 DLKAvHGLDAEaELaNILStEImlEINREii~~--------l~~~A~~~k~~~~~~~~~~aG~----fdl~~~~d~~~~~ 345 (524) |-- +.++.|.+-|...|..-+|+.||.. |.+.+..+....... ....+ +.+....+-. .. T Consensus 246 d~~-----~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~-~~ 317 (497) T protein:vir:10 246 DAP-----ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASS--LFGATSATVSNVKFPADGT-NG 317 (497) T ss_pred hHH-----HHHHHHHHHHHHHHHHHHHHHhhcCCCccccccccccccccccccccc--chhhhhhhhhhhhhhcccc-cc Confidence 942 2589999999999999999998863 222221111000000 00000 0000000000 00 Q ss_pred hHH-----HHH-----------------------HHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcC----Ccc Q lcl|NC_014661. 346 WAG-----ESF-----------------------KALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASV----DTS 393 (524) Q Consensus 346 ~a~-----E~~-----------------------r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~----~~~ 393 (524) |.+ ... -.+...+...-..+.+...+ .++.+|.+|.....|... |.+ T Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~ 396 (497) T protein:vir:10 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQY 396 (497) T ss_pred hhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhccc-CCCeEEEchHHHHHHHHhhcCCCce Confidence 000 000 11222233333444454555 577888999988887643 333 Q ss_pred ccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCC------ccceeEeecccccccccccCccc Q lcl|NC_014661. 394 VTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNE------MDAGIYYAPYVALTPLRGADPKN 467 (524) Q Consensus 394 ~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~------~d~g~fyaPYv~~~~~~~~Dp~s 467 (524) .+.+..+...+..... -++|.| ++|++.+..+.+-+++|--.... .+-.+-..||.. .+=.+ T Consensus 397 i~~~~~~~~~~~~~~~-----~~~l~G-~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~------~~f~~ 464 (497) T protein:vir:10 397 MGGNFFGNAYGNPVNG-----GKNIWG-VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNG------TDFVD 464 (497) T ss_pred eccCcccccccccccC-----Cceeec-eeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccc------hhhhc Confidence 3333333332211111 125666 79999888776655554221100 001111122210 01122 Q ss_pred ccceeeeeeeece-eeCCcccccCCccccceeeccccchhhhhcccc Q lcl|NC_014661. 468 FQPVLGFKTRYGI-GINPLADTAAQQPAGNARIANGMPSIANSVGKN 513 (524) Q Consensus 468 ~qP~~~~~tRY~l-~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~ 513 (524) .+=.+=+..|+++ +.+|=+ +.++.- -..+-. | T Consensus 465 n~v~~r~~~r~~~~v~~p~A---------~~~l~~--~~~~~~---~ 497 (497) T protein:vir:10 465 GKVTVRAEERLGLLVYRPSA---------FQLIQL--KKGATG---S 497 (497) T ss_pred CcEEEEEEEeecceeecccc---------EEEEEe--cCCccC---C Confidence 3334555678866 667722 112111 111111 1 No 37 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=92.48 E-value=0.011 Score=31.27 Aligned_cols=333 Identities=14% Similarity=0.093 Sum_probs=132.8 Q ss_pred cchHHHHHHhhhhhhcc--------------CCC--cchhhhhhhhhhhhh---hhHHHHHhhhhccccchhhh------ Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEA--------------EGA--PEIAQGKHAIIAKMF---ENQEADIKSDAAYRDEKLAE------ 59 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~--------------~~~--~~~~~~~~~~~~~~~---enq~~~~~~~~~~~~~~~~~------ 59 (524) |++.++|.++|.-+-+. +.. -++...+..+ ..+- |.+++.+.+........... T Consensus 1 Mk~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~~~~~i-~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNER-DTAKMKRDMFKEQYTEARANEVANMSEEEKKPL 79 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhhcccccccccc Confidence 77777777766644322 000 0111111111 1111 11111111111000000000 Q ss_pred -------------cccccccccccccccccchhhhcccccc-ccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcc Q lcl|NC_014661. 60 -------------AFGGFLTEAEIGGDHGYDPQNIAAGQTS-GAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPT 123 (524) Q Consensus 60 -------------~~~~~l~ea~~~~~~g~~~~~i~est~t-g~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPT 123 (524) +|..++.. +. -........++++ |.+. -|.-+ .+++.+-++.+-.++|.++||++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~l~~----~~-~~~~~~~~~~t~~~gg~~--vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 152 (397) T protein:vir:49 80 TKSEEEVKAGFVKDFKNLVRG----RY-QNLLDSKTDASGSDAGLT--IPQDIQTAIHTLVSQYDSLQEYVNVENVTTLT 152 (397) T ss_pred ccchhHHHHHHHHHHHHHHhc----ch-hHHHHHhhccccccCccc--ccHhHHHHHHHHHHhhhhHHhhhceeecccCc Confidence 01111100 00 0000011112211 2111 13221 4555556677888999999999998 Q ss_pred hheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCC Q lcl|NC_014661. 124 GQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATT 203 (524) Q Consensus 124 GLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~ 203 (524) |-+. |...... .+.+.|- T Consensus 153 ~~~~-----~~~~~~~-----------~~~a~~v---------------------------------------------- 170 (397) T protein:vir:49 153 GSRV-----YEKWTDI-----------TGLANID---------------------------------------------- 170 (397) T ss_pred cceE-----EEeeccC-----------Ccceeee---------------------------------------------- Confidence 8432 2111000 0000000 Q ss_pred CCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhc-ceEEEEEEEEEecccccchhhHHHHHHHHhh Q lcl|NC_014661. 204 ADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEM-GFRIDKQVIEAKSRQLKAQYSIELAQDLRAV 282 (524) Q Consensus 204 ~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EM-sFsIEK~TVtAKSRALKAEYT~ELAQDLKAv 282 (524) ++| ...++. ..+++++++.++.-+-...+|-||.+|-. T Consensus 171 ----------------------~E~-----------------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~-- 209 (397) T protein:vir:49 171 ----------------------DEA-----------------GKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSA-- 209 (397) T ss_pred ----------------------cCc-----------------cccccccccceeeEEeeeeeEEeeehhHHHHHhhhH-- Confidence 011 011111 23344444555555555679999999852 Q ss_pred cCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHH Q lcl|NC_014661. 283 HGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKES 362 (524) Q Consensus 283 HGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a 362 (524) .|.+++|.+-|+..|..-+|+.||...-+.. .+.|.+++ +-...|+..|... T Consensus 210 --~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~------------~~~~~~~~-------------d~i~~~~~~l~~~- 261 (397) T protein:vir:49 210 --ENILAWLSGWIAKKVVVTRNKAILEAIAALP------------TKPTLTKW-------------DDIIDLEAKVDPA- 261 (397) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------cccccccH-------------HHHHHHHHhhhhh- Confidence 5679999999999999999999887533221 12233322 2234444444321 Q ss_pred HHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeC--CCC-----c--- Q lcl|NC_014661. 363 AEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQ--YAR-----Q--- 432 (524) Q Consensus 363 ~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~--y~~-----~--- 432 (524) +.....+|++|.....|..... +.|. .-+..|.+. -..++|.| ++|++.. ..+ . T Consensus 262 --------~~~~a~~vmn~~~~~~l~~lkd-----~~G~-~l~~~~~~~-~~~~~l~G-~PV~~~~~~~~~~~~~~~~~i 325 (397) T protein:vir:49 262 --------IKQTSFFLTNTSGFTALKKVKN-----ALGD-YLMERDVKS-PTGYSIDG-FAVKEVADRWLANGTGGAMPL 325 (397) T ss_pred --------hcCCCEEEEcHHHHHHHHHhhc-----CCCc-eeeccCcCC-CCCceecc-eeeEEecccccccccCCceeE Confidence 2245688999999999976421 1111 011122221 11257877 5887622 211 1 Q ss_pred ------ceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeecee-eCC--cc---cccCCccccceeec Q lcl|NC_014661. 433 ------DYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIG-INP--LA---DTAAQQPAGNARIA 500 (524) Q Consensus 433 ------dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP--~~---~~~~~~~~~~~~~~ 500 (524) +|++++.++..+. =+.+|.. .+-...+-.+-...|++.. .|| |. ......+++. T Consensus 326 ~~gd~~~~~~~~~~~~~~i----~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~---- 391 (397) T protein:vir:49 326 YFGDLKQAVTLFDRQHMSL----LSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGN---- 391 (397) T ss_pred EEeeccceEEEEeecceEE----EEecccc------chhhcCceeEEEEeeeCcEEecccceEEEEeecccCCCCC---- Confidence 1222322222111 1122211 0111223333344444432 233 11 0000000000 Q ss_pred cccchhhh Q lcl|NC_014661. 501 NGMPSIAN 508 (524) Q Consensus 501 ~g~~~~a~ 508 (524) .+..|- T Consensus 392 --~~~~~~ 397 (397) T protein:vir:49 392 --LGSTAV 397 (397) T ss_pred --cccccC Confidence 111111 No 38 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=92.43 E-value=0.011 Score=31.23 Aligned_cols=335 Identities=14% Similarity=0.143 Sum_probs=127.2 Q ss_pred CCccc---chH----HHHHHhhhhhhc----------cCCCcchh------hhhhhhhhhhhhhHHHHHhhhh----c-- Q lcl|NC_014661. 1 MSTQI---KTK----AQLVADWKPLLE----------AEGAPEIA------QGKHAIIAKMFENQEADIKSDA----A-- 51 (524) Q Consensus 1 ~~~~~---~~~----~~l~~kw~p~l~----------~~~~~~~~------~~~~~~~~~~~enq~~~~~~~~----~-- 51 (524) +.+.+ .+- +.+.++.++--. ....+.+. ..+|....--|.+..+.+.... . T Consensus 248 ~~~ai~~g~sld~~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~ 327 (632) T protein:vir:96 248 AQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFE 327 (632) T ss_pred HHHHHhccccHHHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhh Confidence 11100 110 112222211000 00001111 1111110000111111110000 0 Q ss_pred -cccchhhhccccccccccccccccc----c---hhhhcc-ccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCC Q lcl|NC_014661. 52 -YRDEKLAEAFGGFLTEAEIGGDHGY----D---PQNIAA-GQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQG 121 (524) Q Consensus 52 -~~~~~~~~~~~~~l~ea~~~~~~g~----~---~~~i~e-st~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTG 121 (524) .-...+.+..|. ++ .|. + ...+.. ++++|...-....+- .++...-|..|...+ |++.+++ T Consensus 328 ~e~a~~~a~~~G~---~a-----rg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l-~~~~~~~ 398 (632) T protein:vir:96 328 REVSLAIADASGK---EA-----RGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM-GARMLPG 398 (632) T ss_pred hHHHHHHHHhhhh---hh-----hhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhh-cceEeec Confidence 000001111110 00 000 0 000000 111111100001111 123333345555554 5555555 Q ss_pred cchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccC Q lcl|NC_014661. 122 PTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLA 201 (524) Q Consensus 122 PTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a 201 (524) .+|-+ ++..+.. +.+ T Consensus 399 ~~g~~-----~ip~~~~---~~~--------------------------------------------------------- 413 (632) T protein:vir:96 399 LVGDV-----DIPKKTS---GAN--------------------------------------------------------- 413 (632) T ss_pred CCcce-----EEEEEeC---Cce--------------------------------------------------------- Confidence 44421 1111100 000 Q ss_pred CCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHh Q lcl|NC_014661. 202 TTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRA 281 (524) Q Consensus 202 ~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKA 281 (524) .+-+ +| +...++-..+++++++.+|+=+-...+|-||..| T Consensus 414 --------------------a~wv--------~E---------~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~d--- 453 (632) T protein:vir:96 414 --------------------FYWI--------GE---------DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQ--- 453 (632) T ss_pred --------------------eEee--------cC---------CccccccccceeeEEeeeeEEEEehhhHHHHHhc--- Confidence 0000 11 1223444566778888888888888899998876 Q ss_pred hcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccc----cccchHHHHHHHHHHH Q lcl|NC_014661. 282 VHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDV----RGARWAGESFKALLFQ 357 (524) Q Consensus 282 vHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~----~~~~~a~E~~r~L~~~ 357 (524) -.+|.|++|.+-|...|...+++.+|..-= +.+.+.|++.......+ .+..| +.+..|+.. T Consensus 454 -s~~~~~~~i~~~l~~a~~~~~d~a~l~G~G------------~~~~p~Gi~~~~~~~~~~~~~~~~~~--~~i~~~~~~ 518 (632) T protein:vir:96 454 -SSIHVENLIREDLIEGIGVALDLAMLTGTG------------LANDPVGLLNMTGVPALTYPAGGVDW--ASVVDMETK 518 (632) T ss_pred -cchHHHHHHHHHHHHHHHHHHHHHhhcccC------------CCCccceeeecccccceecccccCCH--HHHHHHHHH Confidence 257899999999999999999999875310 11234455533221111 11112 223334333 Q ss_pred HHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEE Q lcl|NC_014661. 358 IDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTI 437 (524) Q Consensus 358 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~v 437 (524) |... -........|+++.....|......+. +|.- +- . -|+|.| |+|++.++.+.+-+++ T Consensus 519 i~~~-------~~~~~~~~~~~~~~~~~~l~~~~l~d~---~G~~--i~---~----~~~l~G-~pv~~s~~ip~~~~~~ 578 (632) T protein:vir:96 519 ISTF-------NADAGRLAYLTSVTQRGAAKKAQVFDN---TGER--IW---Q----NNEVNG-YRAEASNQIPADTWIF 578 (632) T ss_pred Hhhc-------ccccCccEEEEchhHHHHHHHHhccCC---CCce--ee---c----CCeecc-cceEeccccccCcEEE Confidence 3222 111224457889888777764322111 1100 00 0 146776 7999998887665555 Q ss_pred EEecCCCccceeEeecccccccccccCc----ccccceeeeeeeecee-eCC--cccccCCc Q lcl|NC_014661. 438 GYKGDNEMDAGIYYAPYVALTPLRGADP----KNFQPVLGFKTRYGIG-INP--LADTAAQQ 492 (524) Q Consensus 438 G~KG~~~~d~g~fyaPYv~~~~~~~~Dp----~s~qP~~~~~tRY~l~-~nP--~~~~~~~~ 492 (524) |--. -+|+.-+-.+. -.+|| .+.+=.+=...|+++. .+| |+.....+ T Consensus 579 gd~s------~~~i~~~~~~~--i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 579 GDWS------QIVIAMWGVLD--LKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred eecc------eEEEEEecceE--EEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 4211 11111110000 11233 2333334445666553 244 32222211 No 39 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=91.97 E-value=0.013 Score=30.85 Aligned_cols=297 Identities=10% Similarity=0.062 Sum_probs=120.8 Q ss_pred hhhhccccchhhhcccccccccccccccccchhhhccccccccccccCcc-hh-hHHHHHHHhhhhhhceeeecCCCcch Q lcl|NC_014661. 47 KSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPA-VM-GMVRRAIPNLIAFDICGVQPMQGPTG 124 (524) Q Consensus 47 ~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~-Li-~l~Rra~~nLIa~DI~GVQPmTGPTG 124 (524) ||.+++.+..+.. |-..+.+.. -+.+.... .++++... =|. +. .+++.+..+.+..+++-+.||++.+- T Consensus 1 ~~~~~~~~~~~~~-f~~~~~~~~-----~~~a~~~~-~~~~~~~~--iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~ 71 (324) T protein:vir:97 1 MEQTQKLKLNLQH-FASNNVKPQ-----VFNPDNVM-MHEKKDGT--LMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) T ss_pred CccchhHHHHHHH-HHHhhhhhh-----hhcccccc-ccCCCcce--echhHHHHHHHHHHhhcchhhhcceeeccCCce Confidence 2222211111110 000000000 00111111 11111111 122 22 35566677888899999999987653 Q ss_pred heeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCC Q lcl|NC_014661. 125 QVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTA 204 (524) Q Consensus 125 LIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~ 204 (524) -|- ++... . .+.|- T Consensus 72 ~ip----~~~~~------~---------~a~~v----------------------------------------------- 85 (324) T protein:vir:97 72 KFT----FWADK------P---------GAYWV----------------------------------------------- 85 (324) T ss_pred EEE----EEecC------c---------ceeEe----------------------------------------------- Confidence 220 11000 0 00000 Q ss_pred CcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcC Q lcl|NC_014661. 205 DAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHG 284 (524) Q Consensus 205 ~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHG 284 (524) +| +..+++...++++++.+.|.-+.-..+|-||.+|-. T Consensus 86 -----------------------------~E---------g~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~---- 123 (324) T protein:vir:97 86 -----------------------------GE---------GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY---- 123 (324) T ss_pred -----------------------------cc---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch---- Confidence 01 011233344455555555555555669999999863 Q ss_pred CChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_014661. 285 MDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAE 364 (524) Q Consensus 285 LDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~ 364 (524) .|.+++|.+-|+..|...+++.||..--... .+.|++......... ......+..|+++.+. T Consensus 124 ~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~------------~~~gi~~~~~~~~~~------~~~~~~~~~i~~~~~~ 185 (324) T protein:vir:97 124 SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP------------FGKSIAQSIEKTNKV------IKGDFTQDNIIDLEAL 185 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCCCc------------cCcccccccccccee------ccccCCHHHHHHHHHh Confidence 5679999999999999999999986422111 112222111000000 0001112234444444 Q ss_pred HHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCC--cceEEEEEecC Q lcl|NC_014661. 365 IARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYAR--QDYFTIGYKGD 442 (524) Q Consensus 365 I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~--~dy~~vG~KG~ 442 (524) |.. .+.....+|++|.....|....- +.|. .-+. +.. .++|.| ++|++.+..+ ...+++|-. T Consensus 186 l~~--~~~~~~~~v~n~~~~~~L~~lkd-----~~g~-~~~~-~~~----~~tl~G-~PV~~~~~~~~~~~~~~~gd~-- 249 (324) T protein:vir:97 186 LED--DELEANAFISKTQNRSLLRKIVD-----PETK-ERIY-DRN----SDTLDG-LPVVNLKSSNLKRGELITGDF-- 249 (324) T ss_pred hhh--ccCCCCEEEEcHHHHHHHHHhhc-----CCCc-eeec-CCC----Cccccc-eeeEeecCCCCCcceEEEEec-- Confidence 433 22345678999999999975331 1111 1111 111 246777 5888866543 223333311 Q ss_pred CCccceeEeecccccccccccCcc--------c------cc---ceeeeeeeece-eeCC--cccccC---Cccccceee Q lcl|NC_014661. 443 NEMDAGIYYAPYVALTPLRGADPK--------N------FQ---PVLGFKTRYGI-GINP--LADTAA---QQPAGNARI 499 (524) Q Consensus 443 ~~~d~g~fyaPYv~~~~~~~~Dp~--------s------~q---P~~~~~tRY~l-~~nP--~~~~~~---~~~~~~~~~ 499 (524) +.+++... ....++..|.. . || =.+=+..||+. ..|| |+.-.. ..++-...+ T Consensus 250 ----~~~~i~~~-~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) T protein:vir:97 250 ----DKLIYGIP-QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred ----ccEEEEEe-cCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCCCCCCC Confidence 00111110 11111111100 0 11 12222356654 3344 111100 000000011 No 40 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=91.79 E-value=0.014 Score=30.71 Aligned_cols=269 Identities=14% Similarity=0.082 Sum_probs=121.7 Q ss_pred cccccccccccccccccccccccc-ccccccccc-cccCCCCCcccccccccccccccccccccccccchhhhcccccCC Q lcl|NC_014661. 165 FAPLASGTVVAQGTIYKHEFVATG-TAFLQATGA-VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNG 242 (524) Q Consensus 165 ~~~~~~g~~~a~g~~~~~~~~~tg-~~~~~~~g~-~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGg 242 (524) ++.... ...++........- ......... .+... .+. ... . ..|....+..=-....++.. +. T Consensus 1 MA~~~T----~~~~~~iPev~s~~v~~~~~~~~~~~~~~~-~~~-~~~-----g-~~G~tv~iP~~~~~~~a~~v---~e 65 (272) T protein:vir:98 1 MAVGTT----KMAQMLDPEVLADMIDAEVGKAIRFAPLAE-VDT-TLE-----G-QPGTTLTVPKWDYIGDAEDV---AE 65 (272) T ss_pred CCCccc----cchheechHHHHHHHHHHHHHHhhhhcccc-ccc-ccc-----C-CCCCEEEEEEecCCCCcccc---cC Confidence 111000 00111111000000 000000000 00000 000 000 0 00111111100001111110 00 Q ss_pred CCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhcc Q lcl|NC_014661. 243 SNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQ 322 (524) Q Consensus 243 s~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~ 322 (524) +..++.=..+.+.++++.|.++-.-++|=|++.+ -+-|.+.++.+-|+..|..+|+.+|+..+...... T Consensus 66 --g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~----- 134 (272) T protein:vir:98 66 --GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT----- 134 (272) T ss_pred --CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----- Confidence 1233344456778888888887767777666543 25799999999999999999999999876432210 Q ss_pred ccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccc Q lcl|NC_014661. 323 TLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLA 402 (524) Q Consensus 323 ~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~ 402 (524) + .+..++ +-+-.+..++.++ ....+++||+|.++..|.......+..+.. T Consensus 135 ~------~~~~t~-------------d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~-- 184 (272) T protein:vir:98 135 V------EATATV-------------DGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWLGATE-- 184 (272) T ss_pred c------ccccCH-------------HHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhcccccccccc-- Confidence 0 111111 1222333333322 234679999999999997654333322211 Q ss_pred cccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee Q lcl|NC_014661. 403 RGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIGI 482 (524) Q Consensus 403 ~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~ 482 (524) .+. +.-.+-.+|.+.| ++|+++++.|.+=+++.-+|.- +++-..-+.. ...-|+.+++-.+-..-|||+.+ T Consensus 185 ~~~--~~~~~g~ig~i~G-~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~~~~v--e~~r~~~~~~~~i~~~~~~~~~v 255 (272) T protein:vir:98 185 VGA--NRVVSGVYGEVLG-VQIVRSRKCPKGTAYMVRKGAL----RIMLKRNTMV--ETDRDITKAINQIVANKHYGVYL 255 (272) T ss_pred ccc--cccccccchhhcC-eeEEEcCCCCcceEEEEcCCeE----EEEecCCcee--eeccccccceeEEEEEEEEEEEE Confidence 111 1111123578877 7999999998655444333311 1111211111 11237888888888888998752 Q ss_pred -CCcccccCCccccceeeccccchhhhhccc Q lcl|NC_014661. 483 -NPLADTAAQQPAGNARIANGMPSIANSVGK 512 (524) Q Consensus 483 -nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~ 512 (524) || .++.+++-. -+..| T Consensus 256 ~~~---------~~vv~~t~~-----~a~~~ 272 (272) T protein:vir:98 256 YKA---------EKAVKITLK-----DAAKK 272 (272) T ss_pred EcC---------CceEEEEec-----ccccC Confidence 33 233444331 12222 No 41 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=91.79 E-value=0.014 Score=30.71 Aligned_cols=269 Identities=14% Similarity=0.082 Sum_probs=121.7 Q ss_pred cccccccccccccccccccccccc-ccccccccc-cccCCCCCcccccccccccccccccccccccccchhhhcccccCC Q lcl|NC_014661. 165 FAPLASGTVVAQGTIYKHEFVATG-TAFLQATGA-VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNG 242 (524) Q Consensus 165 ~~~~~~g~~~a~g~~~~~~~~~tg-~~~~~~~g~-~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGg 242 (524) ++.... ...++........- ......... .+... .+. ... . ..|....+..=-....++.. +. T Consensus 1 MA~~~T----~~~~~~iPev~s~~v~~~~~~~~~~~~~~~-~~~-~~~-----g-~~G~tv~iP~~~~~~~a~~v---~e 65 (272) T protein:vir:30 1 MAVGTT----KMAQMLDPEVLADMIDAEVGKAIRFAPLAE-VDT-TLE-----G-QPGTTLTVPKWDYIGDAEDV---AE 65 (272) T ss_pred CCCccc----cchheechHHHHHHHHHHHHHHhhhhcccc-ccc-ccc-----C-CCCCEEEEEEecCCCCcccc---cC Confidence 111000 00111111000000 000000000 00000 000 000 0 00111111100001111110 00 Q ss_pred CCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhcc Q lcl|NC_014661. 243 SNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQ 322 (524) Q Consensus 243 s~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~ 322 (524) +..++.=..+.+.++++.|.++-.-++|=|++.+ -+-|.+.++.+-|+..|..+|+.+|+..+...... T Consensus 66 --g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~----- 134 (272) T protein:vir:30 66 --GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT----- 134 (272) T ss_pred --CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----- Confidence 1233344456778888888887767777666543 25799999999999999999999999876432210 Q ss_pred ccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccc Q lcl|NC_014661. 323 TLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLA 402 (524) Q Consensus 323 ~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~ 402 (524) + .+..++ +-+-.+..++.++ ....+++||+|.++..|.......+..+.. T Consensus 135 ~------~~~~t~-------------d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~-- 184 (272) T protein:vir:30 135 V------EATATV-------------DGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWLGATE-- 184 (272) T ss_pred c------ccccCH-------------HHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhcccccccccc-- Confidence 0 111111 1222333333322 234679999999999997654333322211 Q ss_pred cccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee Q lcl|NC_014661. 403 RGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIGI 482 (524) Q Consensus 403 ~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~ 482 (524) .+. +.-.+-.+|.+.| ++|+++++.|.+=+++.-+|.- +++-..-+.. ...-|+.+++-.+-..-|||+.+ T Consensus 185 ~~~--~~~~~g~ig~i~G-~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~~~~v--e~~r~~~~~~~~i~~~~~~~~~v 255 (272) T protein:vir:30 185 VGA--NRVVSGVYGEVLG-VQIVRSRKCPKGTAYMVRKGAL----RIMLKRNTMV--ETDRDITKAINQIVANKHYGVYL 255 (272) T ss_pred ccc--cccccccchhhcC-eeEEEcCCCCcceEEEEcCCeE----EEEecCCcee--eeccccccceeEEEEEEEEEEEE Confidence 111 1111123578877 7999999998655444333311 1111211111 11237888888888888998752 Q ss_pred -CCcccccCCccccceeeccccchhhhhccc Q lcl|NC_014661. 483 -NPLADTAAQQPAGNARIANGMPSIANSVGK 512 (524) Q Consensus 483 -nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~ 512 (524) || .++.+++-. -+..| T Consensus 256 ~~~---------~~vv~~t~~-----~a~~~ 272 (272) T protein:vir:30 256 YKA---------EKAVKITLK-----DAAKK 272 (272) T ss_pred EcC---------CceEEEEec-----ccccC Confidence 33 233444331 12222 No 42 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=91.62 E-value=0.015 Score=30.58 Aligned_cols=273 Identities=12% Similarity=0.044 Sum_probs=121.2 Q ss_pred ccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCC Q lcl|NC_014661. 165 FAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSN 244 (524) Q Consensus 165 ~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~ 244 (524) +.+... --.++....... ..+..+.....-+...... +.... . ..|....++.=-.+..++. +.... T Consensus 1 ma~~~T----~~~~~iiPev~~-~~v~~~~~~~~~~~~~~~~---~~~l~-g-~~G~tv~ip~~~~~g~~~~---~~eg~ 67 (274) T protein:vir:93 1 MPQGIT----KTSNQIIPEVLA-PMMQAQLEKKLRFASFAEV---DSTLQ-G-QPGDTLTFPAFVYSGDAQV---VAEGE 67 (274) T ss_pred CCccce----ehhheechHHHH-HHHHHHHHhhhhhcccccc---ccccc-C-CCCCEEEEEeeccCCCccc---ccCCC Confidence 111000 000100000000 0000000000000000000 00000 0 0111111111000111221 11112 Q ss_pred CcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhcccc Q lcl|NC_014661. 245 NNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTL 324 (524) Q Consensus 245 ~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~ 324 (524) .-.+.++. ..+.+++-|-|+-.=+++=|. .+.+ +-|.-.+..+-++..+...++++++..+..... T Consensus 68 ~i~~~~it--~~~~~~~i~~~~~~~~i~D~~--~~~~--~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~-------- 133 (274) T protein:vir:93 68 KIPTDILE--TKKREAKIRKIAKGTSITDEA--LLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------- 133 (274) T ss_pred cccccccc--cceeEEEeeeecccccccHHH--HHhh--ccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------- Confidence 23344444 445555556665332333332 2223 578999999999999999999999987754321 Q ss_pred ccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccc Q lcl|NC_014661. 325 TVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARG 404 (524) Q Consensus 325 ~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~ 404 (524) +. ....+ ..+.+-.+..++.++. ..+++++|+|.+++.|.....+.+..++... T Consensus 134 ~~--~~~~~-------------~~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g-- 187 (274) T protein:vir:93 134 TV--NADIT-------------KLNGLQSAIDKFNDED---------LEPMVLFINPLDAGKLRGDASTNFTRATELG-- 187 (274) T ss_pred cc--ccccc-------------CHHHHHHHHHHhhhcc---------CCccEEEeCHHHHHHHHhhhhhccccccccc-- Confidence 10 01111 1233344444444321 2578999999999999865433333332211 Q ss_pred cccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-C Q lcl|NC_014661. 405 LNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIGI-N 483 (524) Q Consensus 405 ~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-n 483 (524) .+...+-.+|.+.| ++||+|+..|..-..+.-+|. +-|.---+......-|++++.=.+-...|||+.+ | T Consensus 188 --~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~ga------i~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~ 258 (274) T protein:vir:93 188 --DDIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKGA------VKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYD 258 (274) T ss_pred --ccceeecccceecC-eeEEEcCCCCcceEEEEeCCe------EEEEecCCcccccccchhhcccEEEEEEEEEEEEEc Confidence 11222335788876 899999998865443333332 1121011112112249999999999999999853 4 Q ss_pred CcccccCCccccceeeccccchhhhhccc Q lcl|NC_014661. 484 PLADTAAQQPAGNARIANGMPSIANSVGK 512 (524) Q Consensus 484 P~~~~~~~~~~~~~~~~~g~~~~a~~~~~ 512 (524) | .++.++... +++-.| T Consensus 259 ~---------~~~v~~t~~----~~s~~~ 274 (274) T protein:vir:93 259 E---------SKAVKITKG----SGSLEM 274 (274) T ss_pred C---------CceEEEeeC----ccccCC Confidence 4 223343331 233344 No 43 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=91.61 E-value=0.015 Score=30.58 Aligned_cols=344 Identities=10% Similarity=0.041 Sum_probs=127.5 Q ss_pred CCcccchHHHHHHh------hhhhhccCCCcchhhhhhhhhhhh--hhhHHHHHhhhhccccchhhhcccccccccc--- Q lcl|NC_014661. 1 MSTQIKTKAQLVAD------WKPLLEAEGAPEIAQGKHAIIAKM--FENQEADIKSDAAYRDEKLAEAFGGFLTEAE--- 69 (524) Q Consensus 1 ~~~~~~~~~~l~~k------w~p~l~~~~~~~~~~~~~~~~~~~--~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~--- 69 (524) +-.++. .+.|+ ...-... ...++..-.+.+-++| +|.+...+.... -..+.-..+.+....+.+ T Consensus 15 ~~~~~~---~~~e~~~~~~~~~~e~~~-~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 89 (390) T protein:vir:81 15 VTDSLR---AFGERAVRDGELNASARS-KVDELFATVGNLSAEVQAARQRVAELEGNG-AGGDVQHVSVGDMFVASEQFQ 89 (390) T ss_pred HHHHHH---HHHHHHHhhcCcCHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccccccccchhhhhhhHHHH Confidence 111110 00000 0000000 0000000001111111 111111110000 000000000000000000 Q ss_pred --------cccccccchhhh----ccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCc Q lcl|NC_014661. 70 --------IGGDHGYDPQNI----AAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKD 136 (524) Q Consensus 70 --------~~~~~g~~~~~i----~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q 136 (524) -.+....+.... ..++++..-....|..+ .++++.-+..+-.+++.+.||++++.-+ ... T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~-------~~~ 162 (390) T protein:vir:81 90 ASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY-------VQE 162 (390) T ss_pred HHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEE-------EEE Confidence 000000000000 00111111112233333 4555566677788999999998776322 111 Q ss_pred cCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccccccccc Q lcl|NC_014661. 137 PIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQ 216 (524) Q Consensus 137 ~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~ 216 (524) .... +...| T Consensus 163 ~~~~-----------~~a~~------------------------------------------------------------ 171 (390) T protein:vir:81 163 TGFV-----------NNAAI------------------------------------------------------------ 171 (390) T ss_pred ecCC-----------cceee------------------------------------------------------------ Confidence 0000 00000 Q ss_pred ccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHH Q lcl|NC_014661. 217 MDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILA 296 (524) Q Consensus 217 ~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILS 296 (524) +++| ..+++-..++++++.+.|.-+-...+|-||.+|- . +.++.|.+-|+ T Consensus 172 --------v~Eg-----------------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~--~---~~~~~i~~~l~ 221 (390) T protein:vir:81 172 --------VAEG-----------------ALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA--P---QLASYMNNRLI 221 (390) T ss_pred --------ecCC-----------------cccccccceeeEEEEeeeEEEEeehhhHHHHHhH--H---HHHHHHHHHHH Confidence 0000 1112223334455555555555667899999984 2 46888999999 Q ss_pred HHHHHHhhHHHHhhHhhhhhhhhhccccccccccceeccccccccc---ccchHHHHHHHHHHHHHHHHHHHHhhccccC Q lcl|NC_014661. 297 TEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVR---GARWAGESFKALLFQIDKESAEIARQTGRGA 373 (524) Q Consensus 297 tEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~---~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~ 373 (524) ..|...+|+-||..- -+...+.|++......... ......+.+..++.++. ..+.. T Consensus 222 ~~~~~~~d~a~l~G~------------g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~ 280 (390) T protein:vir:81 222 RGLKVKEDAEILRGT------------GANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQAS---------LAEYN 280 (390) T ss_pred HHHHHHHHHHHHhcC------------CCCCcccceeecccccccccccccchhHHHHHHHHHhhc---------cccCC Confidence 999999998887431 0111244554322111100 01112222333332222 22335 Q ss_pred ccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeec Q lcl|NC_014661. 374 GNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAP 453 (524) Q Consensus 374 gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaP 453 (524) .+.+|++|.+...|.... .+.|. -+..+... .-.++|.| ++|++.+..|.+-+++|---. .++.. T Consensus 281 ~~~~v~~~~~~~~l~~lk-----d~~G~--~l~~~~~~-~~~~~l~G-~pv~~~~~~p~~~~~~gd~~~-----~~~~~- 345 (390) T protein:vir:81 281 PSGIVINPIDWAAIELAK-----DANNQ--YLIGNARG-TLTPTLWG-LPVVATQAMAPGEFLVGAFDL-----AAQIF- 345 (390) T ss_pred CCEEEEcHHHHHHHHHhh-----cCCCc--eeecCccc-ccCceecc-eeeEEcCCCCCCcEEEEehhc-----eEEEE- Confidence 678899999998886432 11110 00111111 01246776 699999998877666653210 01110 Q ss_pred ccccccccccC-c---ccccceeeeeeeece-eeCCcccccCCccccceeeccc Q lcl|NC_014661. 454 YVALTPLRGAD-P---KNFQPVLGFKTRYGI-GINPLADTAAQQPAGNARIANG 502 (524) Q Consensus 454 Yv~~~~~~~~D-p---~s~qP~~~~~tRY~l-~~nP~~~~~~~~~~~~~~~~~g 502 (524) ......+...+ + .+-+=.+=...|++. +.+|= .+.+++=+ T Consensus 346 ~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~---------a~v~~t~a 390 (390) T protein:vir:81 346 DQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPE---------ALISGSFA 390 (390) T ss_pred EecceEEEEecccchhhcCcEEEEEEEeeccEEeccc---------ceEEEEeC Confidence 00111111111 1 112223335566666 34441 11222221 No 44 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=90.74 E-value=0.019 Score=29.99 Aligned_cols=298 Identities=10% Similarity=0.006 Sum_probs=125.6 Q ss_pred ccccccccccccccccchhhhccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCC Q lcl|NC_014661. 61 FGGFLTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIA 139 (524) Q Consensus 61 ~~~~l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~ 139 (524) |...-..+. .. .++.++.. ..-|.++ .+++++.++.+-.+++-+.||+++.-- |..... T Consensus 1 m~~~~~~a~----------~~-~~t~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~-------~p~~~~- 60 (330) T protein:vir:77 1 MAGSTVPST----------QV-ALTGDFSA-FLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGIS-------IPHWTG- 60 (330) T ss_pred Ccccccchh----------hc-cccCCCcc-eechhHHHHHHHHHHhccchhhhcceeeccCCceE-------EEEEcC- Confidence 211111111 10 01111111 1224444 566777788888999999999875521 211000 Q ss_pred CccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccc Q lcl|NC_014661. 140 AGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDA 219 (524) Q Consensus 140 ~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~ 219 (524) +.+ +.|- T Consensus 61 --~~~---------a~~v-------------------------------------------------------------- 67 (330) T protein:vir:77 61 --AVS---------ASWT-------------------------------------------------------------- 67 (330) T ss_pred --Ccc---------eeEe-------------------------------------------------------------- Confidence 000 0000 Q ss_pred cccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|NC_014661. 220 GILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEI 299 (524) Q Consensus 220 g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEI 299 (524) +| +..+++-..+++++++..|..+-+..+|-||.+|- ..|.|++|.+-|+..| T Consensus 68 --------------~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~ai 120 (330) T protein:vir:77 68 --------------GE---------AERKPITKGSFGKQELEPVKITTIFAESAEVVRLN----PLNYLNTMRTKIAEAI 120 (330) T ss_pred --------------cC---------CCccccccceeeEEEEeEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHH Confidence 01 12233444566777888888888888999999983 5678999999999999 Q ss_pred HHHhhHHHHhh---------HhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_014661. 300 MLEINREVIDW---------INYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTG 370 (524) Q Consensus 300 mlEINREii~~---------l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~ 370 (524) ...||+-||.. +...+...... ......+. .- ....++..+.++-..+.+. T Consensus 121 ~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~------~~~~~~~~--------~~----~~~~~~~~l~~~~~~~~~~-- 180 (330) T protein:vir:77 121 ALKFDAAAIHGIDKPSAFKGYLAETTKVVSL------ADTNLTTA--------SG----PQGNAYLAVNNALSLLVNS-- 180 (330) T ss_pred HHHHHHHhhcccCCCCcccccccccccccee------eccccccc--------cc----ccchhHHHHHHHHHhhhhc-- Confidence 99999988842 11111000000 00000010 01 1122333444444444443 Q ss_pred ccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCc--------------ceEE Q lcl|NC_014661. 371 RGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQ--------------DYFT 436 (524) Q Consensus 371 rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--------------dy~~ 436 (524) ....+.+|+++.....|.....-. ..--|......+......-++|.| ++||++.+.+. .+++ T Consensus 181 ~~~~~~~vmn~~~~~~l~~lkd~~--G~~l~~~~~~~~~~~~~~~~~l~G-~PV~~~~~~p~~~~~~~~~~~~gd~s~~~ 257 (330) T protein:vir:77 181 GKKWTGTLLDNVTEPILNTAVDGN--GRPLFVESTYTEQVGAIREGRILG-RPTYVADNVVNGTVGNRVVGVMGDFSQVI 257 (330) T ss_pred CCCccEEEEcHHHHHHHHHHhccC--CceeecCccccccccccCCceecc-eeeEEeccccCCCCCCccEEEEEecceEE Confidence 235667899999999997532100 000011110001111112246666 79999988642 2233 Q ss_pred EEEecCCCc----cceeEeecccccccccccCcccc---cceeeeeeeeceee-CCcccccCCccccceeeccccchhhh Q lcl|NC_014661. 437 IGYKGDNEM----DAGIYYAPYVALTPLRGADPKNF---QPVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPSIAN 508 (524) Q Consensus 437 vG~KG~~~~----d~g~fyaPYv~~~~~~~~Dp~s~---qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~ 508 (524) +|-.+..+. ++.+.+.- .........+-+-| +=.+=...|++..+ +|= .++++...- || T Consensus 258 i~~~~~~~i~~~~e~~~~~~~-~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~---------a~~~i~~~~---~~ 324 (330) T protein:vir:77 258 WGQIGGLSFDVTDQATLDFGE-EQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKD---------AFVKLTDQV---AG 324 (330) T ss_pred EEEecCcEEEEeecceeeecc-cccccccccccchhhcCcEEEEEEEEeccEEeccc---------ceEEEEecc---CC Confidence 343332221 11111100 00000000000001 11112233444332 330 011111100 00 Q ss_pred hccccc Q lcl|NC_014661. 509 SVGKNG 514 (524) Q Consensus 509 ~~~~~~ 514 (524) .-..-- T Consensus 325 ~~~~~~ 330 (330) T protein:vir:77 325 TDPEEE 330 (330) T ss_pred cCCCCC Confidence 000000 No 45 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=90.64 E-value=0.02 Score=29.92 Aligned_cols=351 Identities=10% Similarity=0.019 Sum_probs=128.7 Q ss_pred CCcccchHHHHHHhhhhh--hccCCCcchhhh---hhhhhhhhhhhHHHHHhh--hhccccchhhhcccccccccc---- Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPL--LEAEGAPEIAQG---KHAIIAKMFENQEADIKS--DAAYRDEKLAEAFGGFLTEAE---- 69 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~--l~~~~~~~~~~~---~~~~~~~~~enq~~~~~~--~~~~~~~~~~~~~~~~l~ea~---- 69 (524) ..+.....+.+.++=... |..|....+... .+.+.++|=+- ++.+.+ ...-.......+.+....+.+ T Consensus 12 ~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~-e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (390) T protein:vir:97 12 LANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAA-RQRVAELEGNGAGGDVQHVSVGDMFVASEQFQA 90 (390) T ss_pred HHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcccccccccccchhhhhhhHHHHH Confidence 111111111122221000 000000001000 01111111110 000000 000000000000000000000 Q ss_pred ------cc-cccccchhh-----hccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCc Q lcl|NC_014661. 70 ------IG-GDHGYDPQN-----IAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKD 136 (524) Q Consensus 70 ------~~-~~~g~~~~~-----i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q 136 (524) .+ +....+... ...+++++.. -.-|.++ .+++++-++.+-.+++.+-||++++.-+ .+.... T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~----~~~~~~ 165 (390) T protein:vir:97 91 STGRWNDRSARATMNIKAALNTASTDAAGSAGA-LTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY----VQETGF 165 (390) T ss_pred HHHHhhhhhhhhhhHHHHHHHhhhccccccccc-ccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEE----EEEecC Confidence 00 000000000 0011111111 1112222 4555556677778889999988766322 111000 Q ss_pred cCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccccccccc Q lcl|NC_014661. 137 PIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQ 216 (524) Q Consensus 137 ~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~ 216 (524) . +.+.| T Consensus 166 ~--------------~~a~~------------------------------------------------------------ 171 (390) T protein:vir:97 166 V--------------NNAAI------------------------------------------------------------ 171 (390) T ss_pred C--------------cceee------------------------------------------------------------ Confidence 0 00000 Q ss_pred ccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHH Q lcl|NC_014661. 217 MDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILA 296 (524) Q Consensus 217 ~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILS 296 (524) +++| ..+++-..++++++...|.-+-...+|-||.+|-- +.++.|.+-|+ T Consensus 172 --------v~Eg-----------------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~-----~l~~~i~~~la 221 (390) T protein:vir:97 172 --------VAEG-----------------ALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-----QLASYMNNRLI 221 (390) T ss_pred --------ecCC-----------------ccccccccceeEEEEeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHH Confidence 0000 11222223345555555555556789999999852 46888988888 Q ss_pred HHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccE Q lcl|NC_014661. 297 TEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNF 376 (524) Q Consensus 297 tEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~ 376 (524) ..|...||+.||.. . -+...+.|++............-. ...+..|..+-..+. ..+...+. T Consensus 222 ~a~~~~~d~a~l~G----~--------g~~~~p~Gi~~~~~~~~~~~~~~~----~~~~d~~~~~~~~~~--~~~~~~~~ 283 (390) T protein:vir:97 222 RGLKVKEDAEILRG----T--------GANDGLLGLIPQATTYAAPTTIAG----ATRVDQLRLAMLQAS--LAEYPASG 283 (390) T ss_pred HHHHHHHHHHHhhc----C--------CCCccccceeeccccccccccccc----cchHHHHHHHHHhhc--cccCCCCE Confidence 88888888887742 0 111234455432111111000000 111111222222221 23335778 Q ss_pred EEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccc Q lcl|NC_014661. 377 IIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVA 456 (524) Q Consensus 377 ~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~ 456 (524) +|++|.....|..... +.|. -+..|... .--++|.| ++|++++..|.+-+++|--- ..+++...-. T Consensus 284 ~v~n~~~~~~L~~lkd-----~~G~--~l~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~gd~~-----~~~~~~~~~~ 349 (390) T protein:vir:97 284 IVINPIDWAAIELAKD-----ANNQ--YLIGNARG-TLTPTLWG-LPVVATQAMAPGEFLVGAFD-----LAAQIFDQWD 349 (390) T ss_pred EEEcHHHHHHHHHhhc-----CCCc--eeecCccC-CCCceecc-eeeEEcCCCCCCcEEEEecc-----ceEEEEEecc Confidence 9999999999975331 1111 01111111 01246776 69999999887766666311 0111111111 Q ss_pred cccccccCc---ccccceeeeeeeeceee-CCcccccCCccccceeeccccchhh Q lcl|NC_014661. 457 LTPLRGADP---KNFQPVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPSIA 507 (524) Q Consensus 457 ~~~~~~~Dp---~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a 507 (524) ++.....+. .+-+=.+-+..||++.+ +|= .+.++.= | T Consensus 350 ~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~---------a~v~~~~-----a 390 (390) T protein:vir:97 350 ARVEIGYVNDDFQRNMVTVLAEERLALVVYRPE---------ALITGSF-----A 390 (390) T ss_pred eEEEEeecccccccCcEEEEEEEeeccEEeccc---------cEEEEEe-----C Confidence 111111111 12222344456777654 341 1222211 1 No 46 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=90.17 E-value=0.022 Score=29.64 Aligned_cols=268 Identities=14% Similarity=0.102 Sum_probs=116.4 Q ss_pred ccccccccccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcc Q lcl|NC_014661. 65 LTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGA 142 (524) Q Consensus 65 l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g 142 (524) ..|+- +.++++..-.-. |.-+ .+++.+-++.+-.+++.+-||++.+|-+ .+...... T Consensus 1 ~l~~~------------~~~t~~~gg~li-P~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~-----~~~~~~~~--- 59 (293) T protein:vir:48 1 MLDSK------------TDHSGSDAGLTI-PQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSR-----VYEKWTDI--- 59 (293) T ss_pred Cceee------------cccccCcCceEe-chhHHHHHHHHHHhhhhhhhhceeeeccCCcceE-----EEEeecCC--- Confidence 11221 011111110011 2222 3555555677778888888888776511 11110000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccccccccccccccc Q lcl|NC_014661. 143 KEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGIL 222 (524) Q Consensus 143 ~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~ 222 (524) .+.+.| T Consensus 60 --------~~~a~~------------------------------------------------------------------ 65 (293) T protein:vir:48 60 --------TGLANI------------------------------------------------------------------ 65 (293) T ss_pred --------Ccceee------------------------------------------------------------------ Confidence 000000 Q ss_pred ccccccccchhhhcccccCCCCCcchhhcc-eEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHH Q lcl|NC_014661. 223 VEIAEGMATSIAELQEGFNGSNNNPWNEMG-FRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIML 301 (524) Q Consensus 223 ~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEIml 301 (524) + +| +..++|.+ .++++++..+|.-+-...+|-||.+|. .+|.|++|.+-|+..|.. T Consensus 66 --v--------~E---------g~~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~la~~~~~ 122 (293) T protein:vir:48 66 --D--------DE---------AGKIADIDDPKLSLIKYTIKRYAGISTVTNSLLADS----AENILAWLSGWIAKKVVV 122 (293) T ss_pred --e--------cC---------CcccccccccceeEEEEeeeEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHHHHH Confidence 0 11 11233332 456677777777777788999999986 367899999999999999 Q ss_pred HhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCH Q lcl|NC_014661. 302 EINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASR 381 (524) Q Consensus 302 EINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~ 381 (524) -+|+.|+..+-..+. ..+.+++ +....|+.++. .. +.....++|++ T Consensus 123 ~~~~~i~~g~~~~~~------------~~~~~~~-------------d~i~~~~~~l~-------~~--~~~~a~~vmn~ 168 (293) T protein:vir:48 123 TRNKAILGVVDKLPT------------KPTLTKW-------------DDIIDLEAKVD-------PA--IKQTSFFLTNT 168 (293) T ss_pred HHHhHHhhccccccc------------cccccCH-------------HHHHHHHHhhh-------hh--hcCCCEEEEcH Confidence 999988865432221 1122221 22333444332 21 22345678999 Q ss_pred HHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEe--eCCCCc--------------ceEEEEEecCCCc Q lcl|NC_014661. 382 NVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYI--DQYARQ--------------DYFTIGYKGDNEM 445 (524) Q Consensus 382 ~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~--------------dy~~vG~KG~~~~ 445 (524) .....|....- ..|. .=+..+.+.. ..++|.| ++|++ |.+.+. +++.++.++.-.. T Consensus 169 ~~~~~L~~lkd-----~~g~-~l~~~~~~~~-~~~~l~G-~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 240 (293) T protein:vir:48 169 SGFTALKKVKN-----ALGD-YLMERDVKSP-TGYSIAG-FAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSL 240 (293) T ss_pred HHHHHHHHhhc-----cCCc-eEeecCcCCC-CCceecc-eeeEEecccccCCccCCceEEEEEeccceEEEEEecceEE Confidence 99999875331 1111 0011122211 1246777 57775 322221 1222222221111 Q ss_pred cceeEeecccccccccccCcccccceeeeeeeecee-eCCccc-----ccCCccccceeeccccchh Q lcl|NC_014661. 446 DAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIG-INPLAD-----TAAQQPAGNARIANGMPSI 506 (524) Q Consensus 446 d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~-----~~~~~~~~~~~~~~g~~~~ 506 (524) + ..++.. .+-.+-|=.+-...||+.. .+|-+- .....++. ..|-..+ T Consensus 241 ~----~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~----~~~~~~~ 293 (293) T protein:vir:48 241 L----STNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKG----NIGSTAV 293 (293) T ss_pred E----Eecccc------hhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCc----cccccCC Confidence 1 111100 0112223334444555443 222110 00000000 0000000 No 47 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=89.86 E-value=0.024 Score=29.47 Aligned_cols=310 Identities=15% Similarity=0.092 Sum_probs=126.7 Q ss_pred cccchhhhcccccccc-cc-ccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccc Q lcl|NC_014661. 74 HGYDPQNIAAGQTSGA-VT-QIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMY 150 (524) Q Consensus 74 ~g~~~~~i~est~tg~-v~-~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fn 150 (524) -||++.......++.. .. -.-|.++ .+++++..+.+-.+++-+.||++++.- |..... + T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~-------ip~~~~---~-------- 62 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIV-------IPHWTG---D-------- 62 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceE-------EEEEcC---C-------- Confidence 5666555444322211 11 1234444 445555566777888888888876421 111100 0 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccccccccccccccccccccccc Q lcl|NC_014661. 151 APDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMA 230 (524) Q Consensus 151 Eadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~ 230 (524) +.+.| ++ T Consensus 63 -~~a~w--------------------------------------------------------------------v~---- 69 (397) T protein:vir:23 63 -VSAQW--------------------------------------------------------------------IG---- 69 (397) T ss_pred -cceEE--------------------------------------------------------------------ec---- Confidence 00000 00 Q ss_pred chhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhh Q lcl|NC_014661. 231 TSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDW 310 (524) Q Consensus 231 Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~ 310 (524) | +..+++-..+++++++..|..+-.-.+|-||.+|-. .|.|++|.+-|...|...||+.+|.. T Consensus 70 ----E---------g~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G 132 (397) T protein:vir:23 70 ----E---------GDMKPITKGNMTKRDVHPAKIATIFVASAETVRANP----ANYLGTMRTKVATAIAMAFDNAALHG 132 (397) T ss_pred ----C---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 1 112333345567777777777778889999999863 67799999999999999999999864 Q ss_pred HhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcC Q lcl|NC_014661. 311 INYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASV 390 (524) Q Consensus 311 l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~ 390 (524) .-+-... .+..+.. .... .++. ...+..+..+...+.. .+...+.+|++++....|... T Consensus 133 ~gt~~~~------------~~~~~~~---~~~~-~~~~---~~~~~~~~~~~~~l~~--~~~~~a~~vmn~~~~~~L~~l 191 (397) T protein:vir:23 133 TNAPSAF------------QGYLDQS---NKTQ-SISP---NAYQGLGVSGLTKLVT--DGKKWTHTLLDDTVEPVLNGS 191 (397) T ss_pred ccCCccc------------ccccccc---ccee-eecc---cchhHHHHHHHHhhhh--cccCCCEEEEcHHHHHHHHHh Confidence 3211000 0111100 0000 0000 0011112222222222 234577899999999999754 Q ss_pred C----ccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccc---- Q lcl|NC_014661. 391 D----TSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRG---- 462 (524) Q Consensus 391 ~----~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~---- 462 (524) . ...+.+... .........|+|.| ++|+++++.+.+-++ ++.|+-. .+||.-. ....++. T Consensus 192 kd~~G~~i~~~~~~------~~~~~~~~~~tl~G-~Pv~~s~~~~~g~~~-~~~gDfs---~~~i~~~-~~i~i~~~~e~ 259 (397) T protein:vir:23 192 VDANGRPLFVESTY------ESLTTPFREGRILG-RPTILSDHVAEGDVV-GYAGDFS---QIIWGQV-GGLSFDVTDQA 259 (397) T ss_pred hccCCceeeccccc------ccccccccCceeee-eeEEEeCCCCCCceE-EEEeecc---eEEEEEE-eceEEEEeeee Confidence 2 111211110 00111112357766 699999887543211 1222210 0111100 0000110 Q ss_pred -----cCccc-----c---cceeeeeeeecee-eCC--------------cccccCCcccccee-eccc-----cchhhh Q lcl|NC_014661. 463 -----ADPKN-----F---QPVLGFKTRYGIG-INP--------------LADTAAQQPAGNAR-IANG-----MPSIAN 508 (524) Q Consensus 463 -----~Dp~s-----~---qP~~~~~tRY~l~-~nP--------------~~~~~~~~~~~~~~-~~~g-----~~~~a~ 508 (524) .|+.. | |=.+=+..|++.. .+| ++.......++..+ ..+| ++.-|. T Consensus 260 ~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 339 (397) T protein:vir:23 260 TLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYALDLDGASAGNFTLSLDGKTSANIAYNAS 339 (397) T ss_pred eeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceeeecccccCcceEEEEecCccccCcccccc Confidence 01110 0 1112222334331 112 11110000010000 0011 110000 Q ss_pred hcc-ccce--------eeeeeeecC Q lcl|NC_014661. 509 SVG-KNGY--------FRRVLVKGI 524 (524) Q Consensus 509 ~~~-~~~~--------~r~~~v~~~ 524 (524) .+. +.++ ---+-|.+- T Consensus 340 ~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (397) T protein:vir:23 340 TATVKSAIVAIDDGVSADDVTVTGS 364 (397) T ss_pred hhhhHHHhhhcccccccceeeeecC Confidence 000 0000 000000000 No 48 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=89.81 E-value=0.024 Score=29.44 Aligned_cols=273 Identities=12% Similarity=0.070 Sum_probs=118.8 Q ss_pred ccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCC Q lcl|NC_014661. 165 FAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSN 244 (524) Q Consensus 165 ~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~ 244 (524) +++.. +. -.++....... ..+..+.....-+...... +..... ..|....++.=-.+..+|. +.... T Consensus 1 ma~~~---T~-~~d~i~Pev~s-~~v~~~~~~~~~~~~~~~~---~~~l~g--~~G~tv~ip~~~~~g~~~~---~~~g~ 67 (274) T protein:vir:96 1 MAQGT---TK-VSNLIVPEVLA-PMMQAELDKKLRFAQFADI---DSTLVG--QPGDTLTFPAFTYSGDAQV---IAEGE 67 (274) T ss_pred CCccc---cc-hhhhhhhHHHH-HHHHHHHHhhhhhcccccc---cccccC--CCCCEEEEEeeccCCCccc---cCCCC Confidence 11111 00 00111110000 0000000000000000000 000000 0111111111001112221 11122 Q ss_pred CcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhcccc Q lcl|NC_014661. 245 NNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTL 324 (524) Q Consensus 245 ~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~ 324 (524) .-.+.++.++= .+++-+-|+-.-+++=|. ++..+-|.-.+..+-++..+..+++++|+..+..... T Consensus 68 ~i~~~~it~~~--~~~~i~~~~~~~~i~D~~----~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~-------- 133 (274) T protein:vir:96 68 KIPVDQIGTSK--REAKVRKIGKGTELTDEA----VLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------- 133 (274) T ss_pred cCchhhcccce--eEEEEEeeeceeeecHHH----HHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-------- Confidence 33445554443 344445554322333222 1234678999999999999999999999987753221 Q ss_pred ccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccc Q lcl|NC_014661. 325 TVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARG 404 (524) Q Consensus 325 ~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~ 404 (524) +.. +..+ ..+.+-.+..++.++. ...++++|+|.+++.|.......+..++.... T Consensus 134 ~~~--~~~~-------------~~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~- 188 (274) T protein:vir:96 134 TVE--ADIT-------------KLDGLQTAIDKFNDED---------LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGD- 188 (274) T ss_pred CcC--cccc-------------cHHHHHHHHHHhcccC---------CCceEEEeCHHHHHHHHhcccccccccccccc- Confidence 110 1111 1233334444444321 25789999999999997765433333222111 Q ss_pred cccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-C Q lcl|NC_014661. 405 LNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIGI-N 483 (524) Q Consensus 405 ~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-n 483 (524) ....+-.+|.+.| ++||+|...|..=..+-=+|.-. |+.. -+......-||..++-.|-...+||+.. | T Consensus 189 ---~~~~~g~ig~~~G-~~Vi~s~~~p~~t~~l~~~gA~~-----~~~~-~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~ 258 (274) T protein:vir:96 189 ---NIIVKGAFGEALG-AVIVRSNKLNKGEALLAKKGAVK-----LITK-RDFFLEKDRDASRKSTALYSDKHYVAYLYD 258 (274) T ss_pred ---cceeecccceecC-eeEEEcCCCCcceEEEEeCccee-----eeec-CCcccccccchhhcccEEEEeeEEEEEEEc Confidence 1122234788876 89999999876432221122211 1111 1111112249999999999999999865 5 Q ss_pred CcccccCCccccceeeccccchhhhhccccceeeeee Q lcl|NC_014661. 484 PLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVL 520 (524) Q Consensus 484 P~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~ 520 (524) | .++.++..+-. ++ |+ T Consensus 259 ~---------~~vv~~t~~~~------~~------~~ 274 (274) T protein:vir:96 259 E---------SKVVKITKGAG------DE------VM 274 (274) T ss_pred C---------ccEEEEEcCcc------cc------cC Confidence 5 23444444321 11 11 No 49 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=89.14 E-value=0.028 Score=29.09 Aligned_cols=280 Identities=14% Similarity=0.078 Sum_probs=126.4 Q ss_pred hccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccc Q lcl|NC_014661. 81 IAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGR 159 (524) Q Consensus 81 i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~ 159 (524) .+++|+++... .-|.+. .++.++.+..+..+++.+.||++-.. +|..... + +++.|- T Consensus 1 ma~~t~~~G~l-ip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-------~~p~~~~---~---------~~a~wv-- 58 (300) T protein:vir:95 1 MSEAQLSKGNL-FNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQ-------REFVFDF---D---------SDIDIV-- 58 (300) T ss_pred CcccccCCcce-echhhHHHHHHHHHhhhhhhhhcceeeccCCce-------EEEEEec---C---------cceEEe-- Confidence 45566555442 334333 44444555667778999999876322 1211100 0 000010 Q ss_pred cccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccc Q lcl|NC_014661. 160 GSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEG 239 (524) Q Consensus 160 ~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~ 239 (524) + | T Consensus 59 ------------------------------------------------------------------~--------E---- 60 (300) T protein:vir:95 59 ------------------------------------------------------------------A--------E---- 60 (300) T ss_pred ------------------------------------------------------------------e--------C---- Confidence 0 1 Q ss_pred cCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhh Q lcl|NC_014661. 240 FNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGK 319 (524) Q Consensus 240 lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k 319 (524) +.+.++...+++.+++++|.-+-...+|-||.+-... ..+|-+++|.+-|...|...+++.+|.....- .|+ T Consensus 61 -----g~~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~--~g~ 132 (300) T protein:vir:95 61 -----NGKKTHGGVSLDPVTIVPLKVEYGARVSDEFLHASEE-AKVDMLTDFVEGFSKKLARGLDIMSIHGINPR--TKQ 132 (300) T ss_pred -----CcccccccccceeeEeeeEEEEEeehhhHHHhccCCC-CHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC--CCC Confidence 1123344455566667777666777899998753222 23567888888899999999988888653210 010 Q ss_pred hccccccccccceecccc--cccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccc Q lcl|NC_014661. 320 TGQTLTVGSKAGVFDFQD--PIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPA 397 (524) Q Consensus 320 ~~~~~~~~~~aG~fdl~~--~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~ 397 (524) +....|...+.. ...+.+ .....+.-|.++...+.. .+++.+.+|++|.....|..... T Consensus 133 ------~~~~~~~~~~~~~~~~~~~~------~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~lkd----- 193 (300) T protein:vir:95 133 ------ASTIIGDNCFDKKVTQTVPF------KDTNPDESMEDAVGMIDG--SERDITGAILDPIFTTALSKMKN----- 193 (300) T ss_pred ------Ccccccccccccccceeecc------cccchHHHHHHHHHHhhh--cCCCccEEEECHHHHHHHHHhhc----- Confidence 000111110000 000000 011223333444333322 23466789999999999865321 Q ss_pred ccccccccccccCcceEEEEecCceEEEeeCCCCc------ceEEEEEecCCCccceeEeecccc--cccccccCccc-- Q lcl|NC_014661. 398 AQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQ------DYFTIGYKGDNEMDAGIYYAPYVA--LTPLRGADPKN-- 467 (524) Q Consensus 398 a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~d~g~fyaPYv~--~~~~~~~Dp~s-- 467 (524) +.|.- =+..+.++. ..++|.| ++|+++.+.+. +.+++|- +.-+++|..... +...+-.|++. T Consensus 194 ~~G~~-i~~~~~~~~-~~~~l~G-~Pv~~s~~v~~~~~~~~~~~~~GD-----f~~~~~~~~~~~~~~~v~~~~~~d~~~ 265 (300) T protein:vir:95 194 AEGGK-LYPELAWGG-VPDAING-LAVDKNRTVSYSQTDPKNTAIVGD-----FETMFKWGYAKEVPMEIIKYGDPDNSG 265 (300) T ss_pred cCCCe-eccCccccC-CCceecc-eeeEEecCCCCCCCCCccEEEEee-----ccceEEEEEecccEEEEeeccCCCCcc Confidence 11100 011122211 2367888 69999888543 2233331 111122222111 11111113321 Q ss_pred ---cc---ceeeeeeeeceee-CCcccccCCccccceeeccccchhhhhccccceeeeeeeec Q lcl|NC_014661. 468 ---FQ---PVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKG 523 (524) Q Consensus 468 ---~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~ 523 (524) || =.+=+..|+++.+ || +.+.+-..+.| T Consensus 266 ~~~f~~~~v~~r~~~r~d~~v~~~----------------------------~a~~~l~~~~g 300 (300) T protein:vir:95 266 RDLKGYNQIYIRCEAYIGWGIMDA----------------------------ASFARIVKTGG 300 (300) T ss_pred hhhhhcCcEEEEEEEeecceeecc----------------------------cceEEEecCCC Confidence 11 2223344666433 55 22211111222 No 50 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=87.03 E-value=0.041 Score=28.16 Aligned_cols=346 Identities=15% Similarity=0.129 Sum_probs=132.9 Q ss_pred CCcccchHHHHHHhhhhhhccCC-C---------------cchhhhhhhhhhhhh------hhHHHHHhhhhc-cccchh Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEAEG-A---------------PEIAQGKHAIIAKMF------ENQEADIKSDAA-YRDEKL 57 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~~~-~---------------~~~~~~~~~~~~~~~------enq~~~~~~~~~-~~~~~~ 57 (524) |...| +.++|.++|..+.+.-- + -++...+.++ ..+. ++|.++..+... -..+.. T Consensus 1 m~~~m-~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (408) T protein:vir:74 1 MGVKL-TVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKR-DNEKVRRDALREQLVEAQAEQVVNMREEE 78 (408) T ss_pred CChhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 88877 55778888887754200 0 0011111111 0111 111111110000 000000 Q ss_pred ---------------hhcccccccccccccccccchhhhcccccc-ccccccCcchh--hHHHHHHHhhhhhhceeeecC Q lcl|NC_014661. 58 ---------------AEAFGGFLTEAEIGGDHGYDPQNIAAGQTS-GAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPM 119 (524) Q Consensus 58 ---------------~~~~~~~l~ea~~~~~~g~~~~~i~est~t-g~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPm 119 (524) ..+|...+.-. ....+.-....+..++.+ |.+. . |.-+ .+++.+-++....++++++|| T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~~~~~gg~~-v-P~~~~~~Ii~~~~~~~~l~~~~~~~~~ 155 (408) T protein:vir:74 79 KGPLNKSENELKDKFVKDFVNMVRNP-MAFLNTVSSKTETSGSDSAAGLT-I-PQDIRTMINTLVRQYDSLQQYVRVESV 155 (408) T ss_pred cccccchhhhhHHHHHHHHHHHHhcc-hhhhhhhhhhhhcccccCCCcee-e-chhHhhHHHHHHhhhcchhhhcceeec Confidence 00000000000 000000111111111111 1111 1 2111 344445566678899999999 Q ss_pred CCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 120 QGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVT 199 (524) Q Consensus 120 TGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~ 199 (524) ++.+|-+--.+ .... + +...+ T Consensus 156 ~~~~~~~~~~~--~~~~-----~---------~~~~~------------------------------------------- 176 (408) T protein:vir:74 156 STSSGSRVYEK--WTDV-----T---------PLKAM------------------------------------------- 176 (408) T ss_pred cCCcceEEEEe--ecCC-----c---------ccccc------------------------------------------- Confidence 99887441111 0000 0 00000 Q ss_pred cCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcc-eEEEEEEEEEecccccchhhHHHHHH Q lcl|NC_014661. 200 LATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMG-FRIDKQVIEAKSRQLKAQYSIELAQD 278 (524) Q Consensus 200 ~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELAQD 278 (524) ++++ ...++.+ .+++++++..+..+-...+|-||.+| T Consensus 177 -------------------------v~E~-----------------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~d 214 (408) T protein:vir:74 177 -------------------------DEED-----------------GKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKD 214 (408) T ss_pred -------------------------cccc-----------------cccccccccceeeEEeeeeeEEeeehhHHHHHhh Confidence 0000 1112221 34455555566666667799999998 Q ss_pred HHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHH Q lcl|NC_014661. 279 LRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQI 358 (524) Q Consensus 279 LKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i 358 (524) - .+|.+++|.+-|+..|..-+|+.||...= +.....++.+++ .|...+ T Consensus 215 s----~~~l~~~i~~~l~~~~~~~~d~~il~G~G------------~~~~~~~~~~~~----------------~i~~~~ 262 (408) T protein:vir:74 215 T----AENILAWLSSWIAKKVVVTRNQAIIAAMG------------TVPKKPTIANFD----------------DVITMI 262 (408) T ss_pred c----hHHHHHHHHHHHHHHHHHHHHHHHhhccc------------ccccccccccHH----------------HHHHHH Confidence 3 45679999999999999999988875311 111122333221 111111 Q ss_pred HHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCC--CCc---- Q lcl|NC_014661. 359 DKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQY--ARQ---- 432 (524) Q Consensus 359 ~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y--~~~---- 432 (524) ...+. ..+...-.+||+|.....|..... +.| ..-+..+.+.. ..++|.| ++||+-.+ .+. T Consensus 263 ---~~~l~--~~~~~~a~~v~n~~~~~~l~~lkd-----~~G-~~l~~~~~~~~-~~~~l~G-~pV~~~~~~~~~~~~~~ 329 (408) T protein:vir:74 263 ---NTSVD--PAIIATSSLLTNQSGLNKLALVKT-----AEG-KYLLEPDPTKP-NSYLIKG-KQVIVVADRWLPNSGST 329 (408) T ss_pred ---HHhhh--hhhcCCCEEEEcHHHHHHHHHhhc-----CCC-ceEeccCcCCC-CCceecc-eeeEEecCcccccccCC Confidence 11111 122234467899999999976321 111 11112222221 1246777 58876332 111 Q ss_pred ce-EEEEE-e----cCCCccceeEeecccccccccccCcccccceeeeeeeecee-eCCcc-------cccCCcccccee Q lcl|NC_014661. 433 DY-FTIGY-K----GDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIG-INPLA-------DTAAQQPAGNAR 498 (524) Q Consensus 433 dy-~~vG~-K----G~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~-------~~~~~~~~~~~~ 498 (524) ++ +++|- + .-....-.+=..||.- .+-...+-.+-+..||+.. .+|=+ ...+..++ T Consensus 330 ~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~---- 399 (408) T protein:vir:74 330 VYPLYYGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQVGN---- 399 (408) T ss_pred cceEEEEehhccEEEEEecceEEEEecccc------chhhcceeeEEEEEeeCcEEecccceEEEEeecccCCCCC---- Confidence 11 22220 1 0000001111122210 0112344555555666543 23310 01111110 Q ss_pred eccccchhhhhccccc Q lcl|NC_014661. 499 IANGMPSIANSVGKNG 514 (524) Q Consensus 499 ~~~g~~~~a~~~~~~~ 514 (524) -+.-+-+ ++ T Consensus 400 ----~~~~~~~---~~ 408 (408) T protein:vir:74 400 ----FKTTTST---AV 408 (408) T ss_pred ----CCCCccc---cC Confidence 1110001 11 No 51 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=86.36 E-value=0.046 Score=27.91 Aligned_cols=280 Identities=13% Similarity=0.073 Sum_probs=126.5 Q ss_pred hccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccc Q lcl|NC_014661. 81 IAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGR 159 (524) Q Consensus 81 i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~ 159 (524) .+ +++|.. .-|.+. .+++.+-++.+-.++|.+.||++...- |.-.+. +.+ +.| T Consensus 1 ma--~~gG~l--vp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~-------ip~~~~---~~~---------a~~--- 54 (298) T protein:vir:16 1 MV--LNKGTL--FDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEK-------VFTFTM---DSE---------IDV--- 54 (298) T ss_pred Cc--ccCcce--echhHHHHHHHHHHhhhhhhhhcceeeccCCceE-------EEEEec---Ccc---------eEE--- Confidence 22 122221 223333 455556678888999999998753221 111100 000 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccc Q lcl|NC_014661. 160 GSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEG 239 (524) Q Consensus 160 ~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~ 239 (524) ++ | T Consensus 55 -----------------------------------------------------------------v~--------E---- 57 (298) T protein:vir:16 55 -----------------------------------------------------------------VA--------E---- 57 (298) T ss_pred -----------------------------------------------------------------ec--------C---- Confidence 01 1 Q ss_pred cCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhh Q lcl|NC_014661. 240 FNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGK 319 (524) Q Consensus 240 lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k 319 (524) +.++++-..++++++..+|.-+-....|-||.++--- -..|-+++|.+-|...|...|+..++.....-. |+ T Consensus 58 -----~~~~~~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~--g~ 129 (298) T protein:vir:16 58 -----SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQEFNDGFAKKVARGIDLMAFHGVNPRL--GT 129 (298) T ss_pred -----CccccccccceeEEEEeeeeEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHHhhccccCCC--Cc Confidence 1223344445566666666666678899999875432 124567888888888888888888876532100 11 Q ss_pred hccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccccc Q lcl|NC_014661. 320 TGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQ 399 (524) Q Consensus 320 ~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~ 399 (524) ...... ..++......... ..+....++..|..+...+... +.+...+|++|.....|....- +. T Consensus 130 ~~~~~~---~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~vmn~~~~~~l~~lkd-----~~ 194 (298) T protein:vir:16 130 ASAVIG---TNHFDSKVTQKVE-----APRGIADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAKQKD-----LQ 194 (298) T ss_pred cccccc---ccccccccccccc-----cccccccHHHHHHHHHHHhhhc--CCCccEEEEcHHHHHHHHHhhc-----cC Confidence 100000 0000000000000 0111122334444554444431 2356679999999998875321 11 Q ss_pred ccccccccccCcceEEEEecCceEEEeeCCCC------cceEEEEEecCCCccceeEeeccc--ccccccccCccc---- Q lcl|NC_014661. 400 GLARGLNTDTTKAVFAGILGGRYKVYIDQYAR------QDYFTIGYKGDNEMDAGIYYAPYV--ALTPLRGADPKN---- 467 (524) Q Consensus 400 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~------~dy~~vG~KG~~~~d~g~fyaPYv--~~~~~~~~Dp~s---- 467 (524) |. .-+..+.... -.|+|.| ++|+++.+.+ .+.+++|-- ..++.|..-- ++...+..||+. T Consensus 195 G~-~i~~~~~~~~-~~~~l~G-~PV~~~~~v~~~~~~~~~~~~~GDf-----s~~~~~~~~~~~~~~~~~~~~~~~~~~~ 266 (298) T protein:vir:16 195 DN-ALFPELKWGA-TPDTING-LPVDVNKTVSDMSLTQRDRAIIGDF-----ANGFKWGYAKEVPLEVIQYGDPDNSGLD 266 (298) T ss_pred CC-eeecCcccCC-CCceecc-eeeEEecccccccCCCccEEEEeec-----cceEEEEEecCceEEEeeccCCcCcchh Confidence 10 0011111111 1257888 5999998754 234554410 0112222111 111222234432 Q ss_pred -cc-ceeee--eeeec-eeeCCcccccCCccccceeecccc Q lcl|NC_014661. 468 -FQ-PVLGF--KTRYG-IGINPLADTAAQQPAGNARIANGM 503 (524) Q Consensus 468 -~q-P~~~~--~tRY~-l~~nP~~~~~~~~~~~~~~~~~g~ 503 (524) || =.++| ..|++ ...+| ..+.++.+.- T Consensus 267 ~f~~~~v~~ra~~r~d~~v~~~---------~a~~~l~~at 298 (298) T protein:vir:16 267 LKGYNQVYIRAELFLGWGILDA---------TKFARVTEAN 298 (298) T ss_pred hhhcCcEEEEEEEEEccEeecc---------cceEEEeecC Confidence 32 12344 45776 34455 1233433311 No 52 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=85.91 E-value=0.049 Score=27.75 Aligned_cols=357 Identities=15% Similarity=0.116 Sum_probs=141.9 Q ss_pred cchHHHHHHhhhhhhcc--------------CCCcchhhhh---hhhhhhhhhhHH--HHHhhhh--------------- Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEA--------------EGAPEIAQGK---HAIIAKMFENQE--ADIKSDA--------------- 50 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~--------------~~~~~~~~~~---~~~~~~~~enq~--~~~~~~~--------------- 50 (524) |++.++|.++=+-+.+. +..-+....+ +.+-++|-+.|+ .++.+.. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 55555555555554332 1110000000 011112211111 1110000 Q ss_pred -ccccchhhhccccccccc-----cc-----ccccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhceeee Q lcl|NC_014661. 51 -AYRDEKLAEAFGGFLTEA-----EI-----GGDHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQ 117 (524) Q Consensus 51 -~~~~~~~~~~~~~~l~ea-----~~-----~~~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQ 117 (524) ..+...-...+...+.+- +. .-..+.+... -++++.+-...-|.-+ .+++++..+.+-.+++.|. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:98 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQG--GSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhh--ccccccccccccchHHHHHHHHHHHhhhhhhhheeee Confidence 000000000000000000 00 0000000000 0111111111124433 4556667778889999999 Q ss_pred cCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 118 PMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 118 PmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) ||++..+-+--. .+.. +.. ..|- T Consensus 159 ~~~~~~~~~~~~-----~~~~---~~~---------~~~v---------------------------------------- 181 (415) T protein:vir:98 159 RVTNGSGKYPVV-----RQSE---VAA---------LEKV---------------------------------------- 181 (415) T ss_pred eccCCceeEEEE-----eecC---Ccc---------ceee---------------------------------------- Confidence 999887743211 1100 000 0000 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcc-eEEEEEEEEEecccccchhhHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMG-FRIDKQVIEAKSRQLKAQYSIELA 276 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELA 276 (524) +++ ...++.+ -++++++...+..+-...+|-||. T Consensus 182 ----------------------------~E~-----------------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell 216 (415) T protein:vir:98 182 ----------------------------EEL-----------------EENPELAVKPFFQLAYDINTHRGYFRISREAI 216 (415) T ss_pred ----------------------------ccc-----------------cccCcccccceeeEEeeeeeeEeeehhhHHHH Confidence 000 0011111 134444555555555567999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccce-ecccccccccccchHHHHHHHHH Q lcl|NC_014661. 277 QDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGV-FDFQDPIDVRGARWAGESFKALL 355 (524) Q Consensus 277 QDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~-fdl~~~~d~~~~~~a~E~~r~L~ 355 (524) +|- ..|.+++|.+-|+..|..-+|+.||...-.-...+-.... ...++ ..-.... ..+.+..++ T Consensus 217 ~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~----~~~~~~~~~~~~~-------~~~~i~~~~ 281 (415) T protein:vir:98 217 EDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF----EKEGKKLEVKKAK-------SLDDIKDAI 281 (415) T ss_pred hhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccc----ccccccccccccc-------chhHHHHHH Confidence 984 3577999999999999999999998765432111100000 00000 0000000 012222333 Q ss_pred HHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceE Q lcl|NC_014661. 356 FQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYF 435 (524) Q Consensus 356 ~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 435 (524) ..+. . -.+ +.+.+||++.....|..... +.|.- =+..+.+. -..++|.| ++|++.++.+.. T Consensus 282 ~~~~-------~-~~~-~~~~~v~n~~~~~~l~~lkd-----~~G~~-l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~-- 342 (415) T protein:vir:98 282 NLNV-------K-PNY-EHNVAIVSQTMFAKLDKMKD-----KLGNY-LIQPDVKE-KTQQRLLG-AKIEILPDEVLG-- 342 (415) T ss_pred Hhhh-------h-hcc-CCCEEEEcHHHHHHHHHhhc-----cCCce-eeccCcCC-CCCceecc-eeeEEecccccC-- Confidence 2222 1 123 56788999999999975321 11110 01112221 12246777 688887765421 Q ss_pred EEEEecCCCccceeEeec----ccc----cccccccCcccccceeeeeeeecee-eCCcccccCCccccceeeccccchh Q lcl|NC_014661. 436 TIGYKGDNEMDAGIYYAP----YVA----LTPLRGADPKNFQPVLGFKTRYGIG-INPLADTAAQQPAGNARIANGMPSI 506 (524) Q Consensus 436 ~vG~KG~~~~d~g~fyaP----Yv~----~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~ 506 (524) -.|+ ..++|+- |+- ...+...|-.+++..+....|++.. .+|=+-..-. +.....|-+ T Consensus 343 ---~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~----~~~~~~~~~-- 409 (415) T protein:vir:98 343 ---QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE----YDDSERGEG-- 409 (415) T ss_pred ---CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEE----EeccCCCCC-- Confidence 1111 1122221 111 1112223556778888888899864 3552111100 001111111 Q ss_pred hhhccccc Q lcl|NC_014661. 507 ANSVGKNG 514 (524) Q Consensus 507 a~~~~~~~ 514 (524) ..+.-. T Consensus 410 --~~~~~~ 415 (415) T protein:vir:98 410 --DLGLEA 415 (415) T ss_pred --ccccCC Confidence 111111 No 53 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=85.91 E-value=0.049 Score=27.75 Aligned_cols=357 Identities=15% Similarity=0.116 Sum_probs=141.9 Q ss_pred cchHHHHHHhhhhhhcc--------------CCCcchhhhh---hhhhhhhhhhHH--HHHhhhh--------------- Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEA--------------EGAPEIAQGK---HAIIAKMFENQE--ADIKSDA--------------- 50 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~--------------~~~~~~~~~~---~~~~~~~~enq~--~~~~~~~--------------- 50 (524) |++.++|.++=+-+.+. +..-+....+ +.+-++|-+.|+ .++.+.. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 55555555555554332 1110000000 011112211111 1110000 Q ss_pred -ccccchhhhccccccccc-----cc-----ccccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhceeee Q lcl|NC_014661. 51 -AYRDEKLAEAFGGFLTEA-----EI-----GGDHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQ 117 (524) Q Consensus 51 -~~~~~~~~~~~~~~l~ea-----~~-----~~~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQ 117 (524) ..+...-...+...+.+- +. .-..+.+... -++++.+-...-|.-+ .+++++..+.+-.+++.|. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:81 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQG--GSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhh--ccccccccccccchHHHHHHHHHHHhhhhhhhheeee Confidence 000000000000000000 00 0000000000 0111111111124433 4556667778889999999 Q ss_pred cCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 118 PMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 118 PmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) ||++..+-+--. .+.. +.. ..|- T Consensus 159 ~~~~~~~~~~~~-----~~~~---~~~---------~~~v---------------------------------------- 181 (415) T protein:vir:81 159 RVTNGSGKYPVV-----RQSE---VAA---------LEKV---------------------------------------- 181 (415) T ss_pred eccCCceeEEEE-----eecC---Ccc---------ceee---------------------------------------- Confidence 999887743211 1100 000 0000 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcc-eEEEEEEEEEecccccchhhHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMG-FRIDKQVIEAKSRQLKAQYSIELA 276 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELA 276 (524) +++ ...++.+ -++++++...+..+-...+|-||. T Consensus 182 ----------------------------~E~-----------------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell 216 (415) T protein:vir:81 182 ----------------------------EEL-----------------EENPELAVKPFFQLAYDINTHRGYFRISREAI 216 (415) T ss_pred ----------------------------ccc-----------------cccCcccccceeeEEeeeeeeEeeehhhHHHH Confidence 000 0011111 134444555555555567999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccce-ecccccccccccchHHHHHHHHH Q lcl|NC_014661. 277 QDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGV-FDFQDPIDVRGARWAGESFKALL 355 (524) Q Consensus 277 QDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~-fdl~~~~d~~~~~~a~E~~r~L~ 355 (524) +|- ..|.+++|.+-|+..|..-+|+.||...-.-...+-.... ...++ ..-.... ..+.+..++ T Consensus 217 ~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~----~~~~~~~~~~~~~-------~~~~i~~~~ 281 (415) T protein:vir:81 217 EDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF----EKEGKKLEVKKAK-------SLDDIKDAI 281 (415) T ss_pred hhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccc----ccccccccccccc-------chhHHHHHH Confidence 984 3577999999999999999999998765432111100000 00000 0000000 012222333 Q ss_pred HHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceE Q lcl|NC_014661. 356 FQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYF 435 (524) Q Consensus 356 ~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 435 (524) ..+. . -.+ +.+.+||++.....|..... +.|.- =+..+.+. -..++|.| ++|++.++.+.. T Consensus 282 ~~~~-------~-~~~-~~~~~v~n~~~~~~l~~lkd-----~~G~~-l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~-- 342 (415) T protein:vir:81 282 NLNV-------K-PNY-EHNVAIVSQTMFAKLDKMKD-----KLGNY-LIQPDVKE-KTQQRLLG-AKIEILPDEVLG-- 342 (415) T ss_pred Hhhh-------h-hcc-CCCEEEEcHHHHHHHHHhhc-----cCCce-eeccCcCC-CCCceecc-eeeEEecccccC-- Confidence 2222 1 123 56788999999999975321 11110 01112221 12246777 688887765421 Q ss_pred EEEEecCCCccceeEeec----ccc----cccccccCcccccceeeeeeeecee-eCCcccccCCccccceeeccccchh Q lcl|NC_014661. 436 TIGYKGDNEMDAGIYYAP----YVA----LTPLRGADPKNFQPVLGFKTRYGIG-INPLADTAAQQPAGNARIANGMPSI 506 (524) Q Consensus 436 ~vG~KG~~~~d~g~fyaP----Yv~----~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~ 506 (524) -.|+ ..++|+- |+- ...+...|-.+++..+....|++.. .+|=+-..-. +.....|-+ T Consensus 343 ---~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~----~~~~~~~~~-- 409 (415) T protein:vir:81 343 ---QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE----YDDSERGEG-- 409 (415) T ss_pred ---CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEE----EeccCCCCC-- Confidence 1111 1122221 111 1112223556778888888899864 3552111100 001111111 Q ss_pred hhhccccc Q lcl|NC_014661. 507 ANSVGKNG 514 (524) Q Consensus 507 a~~~~~~~ 514 (524) ..+.-. T Consensus 410 --~~~~~~ 415 (415) T protein:vir:81 410 --DLGLEA 415 (415) T ss_pred --ccccCC Confidence 111111 No 54 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=85.91 E-value=0.049 Score=27.75 Aligned_cols=357 Identities=15% Similarity=0.116 Sum_probs=141.9 Q ss_pred cchHHHHHHhhhhhhcc--------------CCCcchhhhh---hhhhhhhhhhHH--HHHhhhh--------------- Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEA--------------EGAPEIAQGK---HAIIAKMFENQE--ADIKSDA--------------- 50 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~--------------~~~~~~~~~~---~~~~~~~~enq~--~~~~~~~--------------- 50 (524) |++.++|.++=+-+.+. +..-+....+ +.+-++|-+.|+ .++.+.. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 55555555555554332 1110000000 011112211111 1110000 Q ss_pred -ccccchhhhccccccccc-----cc-----ccccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhceeee Q lcl|NC_014661. 51 -AYRDEKLAEAFGGFLTEA-----EI-----GGDHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQ 117 (524) Q Consensus 51 -~~~~~~~~~~~~~~l~ea-----~~-----~~~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQ 117 (524) ..+...-...+...+.+- +. .-..+.+... -++++.+-...-|.-+ .+++++..+.+-.+++.|. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:79 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQG--GSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhh--ccccccccccccchHHHHHHHHHHHhhhhhhhheeee Confidence 000000000000000000 00 0000000000 0111111111124433 4556667778889999999 Q ss_pred cCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 118 PMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 118 PmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) ||++..+-+--. .+.. +.. ..|- T Consensus 159 ~~~~~~~~~~~~-----~~~~---~~~---------~~~v---------------------------------------- 181 (415) T protein:vir:79 159 RVTNGSGKYPVV-----RQSE---VAA---------LEKV---------------------------------------- 181 (415) T ss_pred eccCCceeEEEE-----eecC---Ccc---------ceee---------------------------------------- Confidence 999887743211 1100 000 0000 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcc-eEEEEEEEEEecccccchhhHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMG-FRIDKQVIEAKSRQLKAQYSIELA 276 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELA 276 (524) +++ ...++.+ -++++++...+..+-...+|-||. T Consensus 182 ----------------------------~E~-----------------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell 216 (415) T protein:vir:79 182 ----------------------------EEL-----------------EENPELAVKPFFQLAYDINTHRGYFRISREAI 216 (415) T ss_pred ----------------------------ccc-----------------cccCcccccceeeEEeeeeeeEeeehhhHHHH Confidence 000 0011111 134444555555555567999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccce-ecccccccccccchHHHHHHHHH Q lcl|NC_014661. 277 QDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGV-FDFQDPIDVRGARWAGESFKALL 355 (524) Q Consensus 277 QDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~-fdl~~~~d~~~~~~a~E~~r~L~ 355 (524) +|- ..|.+++|.+-|+..|..-+|+.||...-.-...+-.... ...++ ..-.... ..+.+..++ T Consensus 217 ~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~----~~~~~~~~~~~~~-------~~~~i~~~~ 281 (415) T protein:vir:79 217 EDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF----EKEGKKLEVKKAK-------SLDDIKDAI 281 (415) T ss_pred hhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccc----ccccccccccccc-------chhHHHHHH Confidence 984 3577999999999999999999998765432111100000 00000 0000000 012222333 Q ss_pred HHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceE Q lcl|NC_014661. 356 FQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYF 435 (524) Q Consensus 356 ~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 435 (524) ..+. . -.+ +.+.+||++.....|..... +.|.- =+..+.+. -..++|.| ++|++.++.+.. T Consensus 282 ~~~~-------~-~~~-~~~~~v~n~~~~~~l~~lkd-----~~G~~-l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~-- 342 (415) T protein:vir:79 282 NLNV-------K-PNY-EHNVAIVSQTMFAKLDKMKD-----KLGNY-LIQPDVKE-KTQQRLLG-AKIEILPDEVLG-- 342 (415) T ss_pred Hhhh-------h-hcc-CCCEEEEcHHHHHHHHHhhc-----cCCce-eeccCcCC-CCCceecc-eeeEEecccccC-- Confidence 2222 1 123 56788999999999975321 11110 01112221 12246777 688887765421 Q ss_pred EEEEecCCCccceeEeec----ccc----cccccccCcccccceeeeeeeecee-eCCcccccCCccccceeeccccchh Q lcl|NC_014661. 436 TIGYKGDNEMDAGIYYAP----YVA----LTPLRGADPKNFQPVLGFKTRYGIG-INPLADTAAQQPAGNARIANGMPSI 506 (524) Q Consensus 436 ~vG~KG~~~~d~g~fyaP----Yv~----~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~ 506 (524) -.|+ ..++|+- |+- ...+...|-.+++..+....|++.. .+|=+-..-. +.....|-+ T Consensus 343 ---~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~----~~~~~~~~~-- 409 (415) T protein:vir:79 343 ---QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE----YDDSERGEG-- 409 (415) T ss_pred ---CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEE----EeccCCCCC-- Confidence 1111 1122221 111 1112223556778888888899864 3552111100 001111111 Q ss_pred hhhccccc Q lcl|NC_014661. 507 ANSVGKNG 514 (524) Q Consensus 507 a~~~~~~~ 514 (524) ..+.-. T Consensus 410 --~~~~~~ 415 (415) T protein:vir:79 410 --DLGLEA 415 (415) T ss_pred --ccccCC Confidence 111111 No 55 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=85.79 E-value=0.05 Score=27.71 Aligned_cols=330 Identities=15% Similarity=0.108 Sum_probs=131.6 Q ss_pred CCcccchHHHHHHhhhhhhccCCC------cchhh--hhhhhhhhhhhhHHHHHhh--hhccccchhhhccccccccccc Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEAEGA------PEIAQ--GKHAIIAKMFENQEADIKS--DAAYRDEKLAEAFGGFLTEAEI 70 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~~~~------~~~~~--~~~~~~~~~~enq~~~~~~--~~~~~~~~~~~~~~~~l~ea~~ 70 (524) +.+++...+.-++.=....+.+.. +.... ..+......+..+....+. ......+.....-.....+.. T Consensus 49 l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 127 (394) T protein:vir:97 49 AKANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQ- 127 (394) T ss_pred HHHHHHHHHHHHHHHHHHhhhhccccccccccchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhh- Confidence 434433322211111122222221 11111 1122222222222211111 111111111100000000100 Q ss_pred ccccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccc Q lcl|NC_014661. 71 GGDHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHP 148 (524) Q Consensus 71 ~~~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~ 148 (524) +.+.++.+....-|.-+ .+++.+-+..+...++.+.||+++++-+--++ ... +. T Consensus 128 -----------~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-----~~~---~~----- 183 (394) T protein:vir:97 128 -----------KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQ-----RAT---TK----- 183 (394) T ss_pred -----------ccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEe-----cCC---Cc----- Confidence 00111111000113222 35555556777788999999988765331110 000 00 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccccccccccccccccccccc Q lcl|NC_014661. 149 MYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEG 228 (524) Q Consensus 149 fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~G 228 (524) ..+ +++| T Consensus 184 -----~~~--------------------------------------------------------------------v~E~ 190 (394) T protein:vir:97 184 -----MVT--------------------------------------------------------------------VAEL 190 (394) T ss_pred -----cce--------------------------------------------------------------------eccc Confidence 000 0011 Q ss_pred ccchhhhcccccCCCCCcchhhc-ceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHH Q lcl|NC_014661. 229 MATSIAELQEGFNGSNNNPWNEM-GFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREV 307 (524) Q Consensus 229 m~Ts~aE~l~~lGgs~~~~f~EM-sFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREi 307 (524) ...++. ...++++++.++.-+-...+|-||++|- ..|.+++|.+-|+..|..-+|..| T Consensus 191 -----------------~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds----~~~~~~~i~~~la~~~~~~~~~~i 249 (394) T protein:vir:97 191 -----------------EKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA----DVDLVGIVSESISQIKVNTTNDAI 249 (394) T ss_pred -----------------ccccccccccceeEEeehhheeeehhhHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHH Confidence 011111 2456677777777777889999999986 346688888888888888888877 Q ss_pred HhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHH Q lcl|NC_014661. 308 IDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVL 387 (524) Q Consensus 308 i~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L 387 (524) |..+.+. .+.+...+ +....++... ....+ .+ -+|++|.+...| T Consensus 250 ~~g~~~~-------------~~~~~~~~-------------~~~~~~~~~~--------~~~~~-~a-~~v~n~~~~~~l 293 (394) T protein:vir:97 250 AKVLKSF-------------TTKTVKNL-------------DEIKALLNGG--------FDPAY-NV-SLIVSQSFYQTL 293 (394) T ss_pred hhccccc-------------cccccccH-------------HHHHHHHHhh--------hhhhh-CC-EEEEcHHHHHHH Confidence 7543211 12222221 1111222111 11222 23 467999999998 Q ss_pred hcCCccccccccccccccccccCcceEEEEecCceEEEe--eCCCCcceEEEEEecCCCccceeEeecccccccccccCc Q lcl|NC_014661. 388 ASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYI--DQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADP 465 (524) Q Consensus 388 ~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp 465 (524) ....- +.|.- -+..+.+.. .-++|.| ++|++ |...+..-+++|-- ..+.++..- ....++..|. T Consensus 294 ~~lkd-----~~G~~-i~~~~~~~~-~~~~l~G-~pv~~~~~~~~~~~~~~~gd~-----~~~~~~~~~-~~~~~~~~~~ 359 (394) T protein:vir:97 294 DTLKD-----GNGRY-LLQDDITAV-SGKVLLG-KPVFVLSDEVLGANKAFIGDF-----KRGVLFADR-KDLGLRWADN 359 (394) T ss_pred HHhhc-----cCCCe-eeecCcCCC-CCceecc-ceeEEecccccCCccEEEeec-----cccEEEEEe-cceEEEEecc Confidence 76421 11100 011121111 1246877 57777 44445444544421 011111111 1112233455 Q ss_pred ccccceeeeeeeeceee-CCcccccCCccccceeeccccchhhhhccccce Q lcl|NC_014661. 466 KNFQPVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPSIANSVGKNGY 515 (524) Q Consensus 466 ~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~ 515 (524) ..++..+-...||+..+ +|-+-.. .+++. - .-++ T Consensus 360 ~~~~~~~~~~~r~d~~v~~~~a~~~-------~~~~~---~------~~p~ 394 (394) T protein:vir:97 360 EIYGQYLQAVLRFGVSKVDDKAGYY-------VTFTP---E------PLPL 394 (394) T ss_pred cccceeEEEEEEEccEEecccceEE-------EEecc---c------ccCC Confidence 55565666667877643 4411000 01110 0 0111 No 56 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=85.70 E-value=0.051 Score=27.67 Aligned_cols=290 Identities=11% Similarity=0.053 Sum_probs=118.6 Q ss_pred Hhhhhccccchhhhcccccccccccccccccchhhhcc-cccccccc-ccCcchh-hHHHHHHHhhhhhhceeeecCCCc Q lcl|NC_014661. 46 IKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAA-GQTSGAVT-QIGPAVM-GMVRRAIPNLIAFDICGVQPMQGP 122 (524) Q Consensus 46 ~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~e-st~tg~v~-~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGP 122 (524) +++. .+++...-+. .+++++.. ..-|.+. .+++.+....+-.+++-+.||++. T Consensus 1 ~~~~------------------------~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~ 56 (320) T protein:vir:10 1 MAAG------------------------TAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTT 56 (320) T ss_pred CCCC------------------------ccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCC Confidence 1110 1111111111 11111111 1223333 355555566777888999998775 Q ss_pred chheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCC Q lcl|NC_014661. 123 TGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLAT 202 (524) Q Consensus 123 TGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~ 202 (524) +.-|. +... +. .+.|= T Consensus 57 ~~~~p----~~~~------~~---------~a~~v--------------------------------------------- 72 (320) T protein:vir:10 57 GQKIP----HWIG------DV---------SAQWI--------------------------------------------- 72 (320) T ss_pred ceEEE----EEeC------Cc---------ceEEe--------------------------------------------- Confidence 43221 1100 00 00000 Q ss_pred CCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhh Q lcl|NC_014661. 203 TADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAV 282 (524) Q Consensus 203 ~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAv 282 (524) +| +..+++-..+++++++..|..+-...+|.||.+|-. T Consensus 73 -------------------------------~E---------~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~-- 110 (320) T protein:vir:10 73 -------------------------------GE---------GDMKPITKGNMTSQNIAPHKIATIFVASAETVRANP-- 110 (320) T ss_pred -------------------------------cC---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCh-- Confidence 01 011223334456677777777778889999999855 Q ss_pred cCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccc-----cccccceecccccccccccchHHHHHHHHHHH Q lcl|NC_014661. 283 HGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLT-----VGSKAGVFDFQDPIDVRGARWAGESFKALLFQ 357 (524) Q Consensus 283 HGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~-----~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~ 357 (524) .|.|+.|.+.|...|...||+.+|..-- .+...+... .....+..... + -++.+ .+ T Consensus 111 --~~l~~~i~~~l~~a~a~~~d~a~l~G~g----~~~~~~~~~~~~~~~~~~~~~~~~~---~----~~~~~---~~--- 171 (320) T protein:vir:10 111 --ANYLGTMRTKVATAFAMAFDSAALNGTD----SPFPTYLAQTTKSVSLADPGGATAS---D----LTAYD---AV--- 171 (320) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHhhcccC----CCCCcccccccccccceeccccccc---c----cccHH---HH--- Confidence 5678899999999999999888874211 000000000 00001111000 0 01111 11 Q ss_pred HHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCc----cccccccccccccccccCcceEEEEecCceEEEeeCCCCcc Q lcl|NC_014661. 358 IDKESAEIARQTGRGAGNFIIASRNVVNVLASVDT----SVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQD 433 (524) Q Consensus 358 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~----~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d 433 (524) +..+...+. ..+....++|++|.....|..... ..+.+. ...+......-+++.| ++||++++.+.+ T Consensus 172 ~~~~~~~~~--~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~------~~~~~~~~~~~~~i~g-~pv~~~~~~~~~ 242 (320) T protein:vir:10 172 AVNGLSLLV--NAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIES------TYTDENSPFRAGRIVS-RPTILSDHVADG 242 (320) T ss_pred HHHHHhhhh--cccCCCcEEEEcHHHHHHHHHhhccCCceeeccc------cccCccccccCceeee-eeeEecCCCCCC Confidence 111222221 233356789999999999975321 111110 0011111222345655 699999887654 Q ss_pred eEEEEEecCCCccceeEeecccccccc--------cccCccc-----cc---ceeeeeeeeceee-CCcccccCCccccc Q lcl|NC_014661. 434 YFTIGYKGDNEMDAGIYYAPYVALTPL--------RGADPKN-----FQ---PVLGFKTRYGIGI-NPLADTAAQQPAGN 496 (524) Q Consensus 434 y~~vG~KG~~~~d~g~fyaPYv~~~~~--------~~~Dp~s-----~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~ 496 (524) -..+ +-|+-. .+++.-+-..... ...|+.. || =.+=...|+++.+ +|= -+ T Consensus 243 ~~~~-~~gd~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~---------a~ 309 (320) T protein:vir:10 243 TTVG-YMGDFR---NVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKD---------AF 309 (320) T ss_pred ceEE-EEeecc---eEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEeccc---------ce Confidence 2211 112111 1122211111110 0011111 11 1122335565432 331 11 Q ss_pred eeecc-ccchhhhhc Q lcl|NC_014661. 497 ARIAN-GMPSIANSV 510 (524) Q Consensus 497 ~~~~~-g~~~~a~~~ 510 (524) .++.. +-|. + T Consensus 310 ~~l~~~~ap~----~ 320 (320) T protein:vir:10 310 VKLTNVVTPD----A 320 (320) T ss_pred EEEEeccCCC----C Confidence 22211 0011 0 No 57 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=85.57 E-value=0.052 Score=27.63 Aligned_cols=284 Identities=13% Similarity=0.088 Sum_probs=124.6 Q ss_pred ccccccccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccc Q lcl|NC_014661. 82 AAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGR 159 (524) Q Consensus 82 ~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~ 159 (524) +..+++++....=|.-+ .+++++.++.+..+++-+.||++++--| ..... .+.+.|-|. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~-------p~~~~------------~~~a~wv~E 61 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHL-------PVLAT------------LPEADWVGE 61 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEE-------EEEeC------------CcceEEeec Confidence 22333322222113222 5666777788888999999998775222 11100 001111111 Q ss_pred cccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccc Q lcl|NC_014661. 160 GSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEG 239 (524) Q Consensus 160 ~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~ 239 (524) |-... T Consensus 62 --------------------------------------------------------------------~~~~~------- 66 (305) T protein:vir:25 62 --------------------------------------------------------------------SATDP------- 66 (305) T ss_pred --------------------------------------------------------------------ccccc------- Confidence 00000 Q ss_pred cCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhh Q lcl|NC_014661. 240 FNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGK 319 (524) Q Consensus 240 lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k 319 (524) ...++.-..++++++..++..+-.-.+|-||.+|-. .|.|++|.+-|+..|...+++.+|..--.- .+. T Consensus 67 -----~~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~--~~~ 135 (305) T protein:vir:25 67 -----KGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDAT----VAVLTEVAELGGQAIGKKLDQAVIFGTDKP--ASW 135 (305) T ss_pred -----cccccccccceeeEEeeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHhhhheeccCCC--CCc Confidence 001111233445556666666667789999999843 578999999999999999999998532110 000 Q ss_pred hccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccccc Q lcl|NC_014661. 320 TGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQ 399 (524) Q Consensus 320 ~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~ 399 (524) . ...+.+..+.--..... . -.....-.++.-+.++...+...-. ..+-+++++.....|....- +. T Consensus 136 ~---~~~~~~~~~~~~~~~~~-~---~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~lkd-----~~ 201 (305) T protein:vir:25 136 V---SPALIPAAVTAGQAVEV-V---GGVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANIRD-----AN 201 (305) T ss_pred c---ccccccccccccccccc-c---ccchhhhHHHHHHHHHHHhhhhccc--ccceeEecHHHHHHHHHhhc-----cC Confidence 0 00000000000000000 0 0111223344444444444444332 34557889999888864211 11 Q ss_pred ccccccccccCcceEEEEecCceEEEeeCCCCcc----eEEE--------EEecCCCccceeEeecccccccccccCccc Q lcl|NC_014661. 400 GLARGLNTDTTKAVFAGILGGRYKVYIDQYARQD----YFTI--------GYKGDNEMDAGIYYAPYVALTPLRGADPKN 467 (524) Q Consensus 400 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----y~~v--------G~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s 467 (524) |.- =+. -++|.| ++|+|..+.+.+ -+++ |..+.-+.+- ..+.-+. ..-.|.+ T Consensus 202 G~~-i~~--------~~~l~G-~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~----~~~~~~~--~~~~~~~ 265 (305) T protein:vir:25 202 GNP-VFR--------DDSFAG-FRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKF----LDQATLG--TGENQIN 265 (305) T ss_pred Cce-eec--------CCcccc-cceEEcCccCCCCCccEEEEEecceEEEEEecCeEEEE----eeeeeee--cCCceee Confidence 100 010 135766 688888775432 1222 2222111110 0000000 0011111 Q ss_pred -cc-ceee--eeeeece-eeCCcccc-cCCccc-cceeec Q lcl|NC_014661. 468 -FQ-PVLG--FKTRYGI-GINPLADT-AAQQPA-GNARIA 500 (524) Q Consensus 468 -~q-P~~~--~~tRY~l-~~nP~~~~-~~~~~~-~~~~~~ 500 (524) || ..++ ...|||+ +.||=+-. .+..+. -+.... T Consensus 266 ~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~pa~ 305 (305) T protein:vir:25 266 LAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) T ss_pred eeecCcEEEEEEEeecceeeCcccEEEEccccccccCCCC Confidence 22 1223 4668996 55884311 111111 011222 No 58 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=84.97 E-value=0.056 Score=27.43 Aligned_cols=300 Identities=11% Similarity=0.042 Sum_probs=122.1 Q ss_pred hhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhcee Q lcl|NC_014661. 38 MFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICG 115 (524) Q Consensus 38 ~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~G 115 (524) +=++|+..... ..|... .......++. .... ++++.. .=|.-+ .+++.+..+....+++- T Consensus 1 ~~~~~~~~~~~-~~~~~~----~~~~~~~~a~----------~~~~-~~~~~~--~iP~~~~~~ii~~~~~~s~l~~l~~ 62 (324) T protein:vir:96 1 MEQTQKLKLNL-QHFASN----NVKPQVFNPD----------NVMM-HEKKDG--TLMNEFTTPILQEVMENSKIMQLGK 62 (324) T ss_pred CCcchhhhHHH-HHHHHH----hhhhhhhccc----------cccc-cCcCcc--ccchhHHHHHHHHHHhhchhhhhcc Confidence 11111111100 000000 0000001111 1110 111111 112222 35556667777888888 Q ss_pred eecCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 116 VQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQAT 195 (524) Q Consensus 116 VQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~ 195 (524) +-||++++--| .-... + +.+.| T Consensus 63 ~~~~~~~~~~~-------p~~~~---~---------~~a~~--------------------------------------- 84 (324) T protein:vir:96 63 YEPMEGTEKKF-------TFWAD---K---------PGAYW--------------------------------------- 84 (324) T ss_pred eeeccCCceEE-------EEEec---C---------cceeE--------------------------------------- Confidence 88988765222 11100 0 00000 Q ss_pred cccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHH Q lcl|NC_014661. 196 GAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIEL 275 (524) Q Consensus 196 g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~EL 275 (524) + +| +..+++...++++++++.+.-+.-..+|-|| T Consensus 85 -----------------------------v--------~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~is~el 118 (324) T protein:vir:96 85 -----------------------------V--------GE---------GQKIETSKATWVNATMRAFKLGVILPVTKEF 118 (324) T ss_pred -----------------------------e--------cC---------CccccccccceeEEEEeeEEEEEeehhhHHH Confidence 0 11 1223344445566666666666667799999 Q ss_pred HHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHH Q lcl|NC_014661. 276 AQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALL 355 (524) Q Consensus 276 AQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~ 355 (524) .+|-. .|.+++|.+-|+..|...|++.+|..--.. ..+.|+........... .....+ T Consensus 119 l~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~------------~~~~gi~~~~~~~~~~~------~~~~t~ 176 (324) T protein:vir:96 119 LNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN------------PFGKSIAQSIEKTNKVI------KGDFTQ 176 (324) T ss_pred Hhcch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCC------------CcCccccccccccceec------cccccH Confidence 99864 567999999999999999999888642211 11223332211110000 001123 Q ss_pred HHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCC--Ccc Q lcl|NC_014661. 356 FQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYA--RQD 433 (524) Q Consensus 356 ~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~--~~d 433 (524) ..|+++.+.|.. .+...+.+|+||.....|.....- .|.- -+. +..+ ++|.| ++|++++.. +.. T Consensus 177 ~~i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~d~-----~G~~-~~~-~~~~----~~l~G-~PV~~~~~~~~~~~ 242 (324) T protein:vir:96 177 DNIIDLEALLED--DELEANAFISKTQNRSLLRKIVDP-----ETKE-RIY-DRNS----DSLDG-LPVVNLKSSNLKRG 242 (324) T ss_pred HHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhcc-----CCCe-eec-CCCC----Ccccc-eeeEeeCCCCCCcc Confidence 334444444433 334567899999999999754321 1110 111 1122 35666 588887764 333 Q ss_pred eEEEEEecCCCccceeEeeccccccccccc---------Ccc-----cc---cceeeeeeeeceee-CC--cccccCCcc Q lcl|NC_014661. 434 YFTIGYKGDNEMDAGIYYAPYVALTPLRGA---------DPK-----NF---QPVLGFKTRYGIGI-NP--LADTAAQQP 493 (524) Q Consensus 434 y~~vG~KG~~~~d~g~fyaPYv~~~~~~~~---------Dp~-----s~---qP~~~~~tRY~l~~-nP--~~~~~~~~~ 493 (524) .+++|-. +.+++... ....++.. |+. -| |=.+=...||+..+ +| |+. ..... T Consensus 243 ~~~~gd~------~~~~~g~~-~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~-l~~a~ 314 (324) T protein:vir:96 243 ELITGDF------DKLIYGIP-QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK-LVPAD 314 (324) T ss_pred eEEEEec------ceEEEEEe-cCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEE-Eeccc Confidence 3443321 01111111 11000000 110 01 11122234554432 23 111 11111 Q ss_pred ccceeeccccc Q lcl|NC_014661. 494 AGNARIANGMP 504 (524) Q Consensus 494 ~~~~~~~~g~~ 504 (524) ++... +-|.- T Consensus 315 ~~~~~-~~~~~ 324 (324) T protein:vir:96 315 KRTDS-VPGEV 324 (324) T ss_pred ccCCC-CCCCC Confidence 11000 00110 No 59 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=84.97 E-value=0.056 Score=27.43 Aligned_cols=300 Identities=11% Similarity=0.042 Sum_probs=122.1 Q ss_pred hhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhcee Q lcl|NC_014661. 38 MFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICG 115 (524) Q Consensus 38 ~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~G 115 (524) +=++|+..... ..|... .......++. .... ++++.. .=|.-+ .+++.+..+....+++- T Consensus 1 ~~~~~~~~~~~-~~~~~~----~~~~~~~~a~----------~~~~-~~~~~~--~iP~~~~~~ii~~~~~~s~l~~l~~ 62 (324) T protein:vir:78 1 MEQTQKLKLNL-QHFASN----NVKPQVFNPD----------NVMM-HEKKDG--TLMNEFTTPILQEVMENSKIMQLGK 62 (324) T ss_pred CCcchhhhHHH-HHHHHH----hhhhhhhccc----------cccc-cCcCcc--ccchhHHHHHHHHHHhhchhhhhcc Confidence 11111111100 000000 0000001111 1110 111111 112222 35556667777888888 Q ss_pred eecCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 116 VQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQAT 195 (524) Q Consensus 116 VQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~ 195 (524) +-||++++--| .-... + +.+.| T Consensus 63 ~~~~~~~~~~~-------p~~~~---~---------~~a~~--------------------------------------- 84 (324) T protein:vir:78 63 YEPMEGTEKKF-------TFWAD---K---------PGAYW--------------------------------------- 84 (324) T ss_pred eeeccCCceEE-------EEEec---C---------cceeE--------------------------------------- Confidence 88988765222 11100 0 00000 Q ss_pred cccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHH Q lcl|NC_014661. 196 GAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIEL 275 (524) Q Consensus 196 g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~EL 275 (524) + +| +..+++...++++++++.+.-+.-..+|-|| T Consensus 85 -----------------------------v--------~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~is~el 118 (324) T protein:vir:78 85 -----------------------------V--------GE---------GQKIETSKATWVNATMRAFKLGVILPVTKEF 118 (324) T ss_pred -----------------------------e--------cC---------CccccccccceeEEEEeeEEEEEeehhhHHH Confidence 0 11 1223344445566666666666667799999 Q ss_pred HHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHH Q lcl|NC_014661. 276 AQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALL 355 (524) Q Consensus 276 AQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~ 355 (524) .+|-. .|.+++|.+-|+..|...|++.+|..--.. ..+.|+........... .....+ T Consensus 119 l~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~------------~~~~gi~~~~~~~~~~~------~~~~t~ 176 (324) T protein:vir:78 119 LNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN------------PFGKSIAQSIEKTNKVI------KGDFTQ 176 (324) T ss_pred Hhcch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCC------------CcCccccccccccceec------cccccH Confidence 99864 567999999999999999999888642211 11223332211110000 001123 Q ss_pred HHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCC--Ccc Q lcl|NC_014661. 356 FQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYA--RQD 433 (524) Q Consensus 356 ~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~--~~d 433 (524) ..|+++.+.|.. .+...+.+|+||.....|.....- .|.- -+. +..+ ++|.| ++|++++.. +.. T Consensus 177 ~~i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~d~-----~G~~-~~~-~~~~----~~l~G-~PV~~~~~~~~~~~ 242 (324) T protein:vir:78 177 DNIIDLEALLED--DELEANAFISKTQNRSLLRKIVDP-----ETKE-RIY-DRNS----DSLDG-LPVVNLKSSNLKRG 242 (324) T ss_pred HHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhcc-----CCCe-eec-CCCC----Ccccc-eeeEeeCCCCCCcc Confidence 334444444433 334567899999999999754321 1110 111 1122 35666 588887764 333 Q ss_pred eEEEEEecCCCccceeEeeccccccccccc---------Ccc-----cc---cceeeeeeeeceee-CC--cccccCCcc Q lcl|NC_014661. 434 YFTIGYKGDNEMDAGIYYAPYVALTPLRGA---------DPK-----NF---QPVLGFKTRYGIGI-NP--LADTAAQQP 493 (524) Q Consensus 434 y~~vG~KG~~~~d~g~fyaPYv~~~~~~~~---------Dp~-----s~---qP~~~~~tRY~l~~-nP--~~~~~~~~~ 493 (524) .+++|-. +.+++... ....++.. |+. -| |=.+=...||+..+ +| |+. ..... T Consensus 243 ~~~~gd~------~~~~~g~~-~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~-l~~a~ 314 (324) T protein:vir:78 243 ELITGDF------DKLIYGIP-QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK-LVPAD 314 (324) T ss_pred eEEEEec------ceEEEEEe-cCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEE-Eeccc Confidence 3443321 01111111 11000000 110 01 11122234554432 23 111 11111 Q ss_pred ccceeeccccc Q lcl|NC_014661. 494 AGNARIANGMP 504 (524) Q Consensus 494 ~~~~~~~~g~~ 504 (524) ++... +-|.- T Consensus 315 ~~~~~-~~~~~ 324 (324) T protein:vir:78 315 KRTDS-VPGEV 324 (324) T ss_pred ccCCC-CCCCC Confidence 11000 00110 No 60 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=84.43 E-value=0.06 Score=27.26 Aligned_cols=345 Identities=14% Similarity=0.106 Sum_probs=133.6 Q ss_pred CCcccchHH-HHHHhhh---hhhccCCCcchhhhhh---hhhhhhhhhHHHHHhhhhccccc------h-------hhhc Q lcl|NC_014661. 1 MSTQIKTKA-QLVADWK---PLLEAEGAPEIAQGKH---AIIAKMFENQEADIKSDAAYRDE------K-------LAEA 60 (524) Q Consensus 1 ~~~~~~~~~-~l~~kw~---p~l~~~~~~~~~~~~~---~~~~~~~enq~~~~~~~~~~~~~------~-------~~~~ 60 (524) |++++.... +|.++++ -+++.+..-++...+. ++-++| |.+++...++...+.. . ...+ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKI-DLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 999876432 2344443 3344333322222111 122211 2222111111110000 0 0001 Q ss_pred c-----cccccccc--cccccccchhhhcccccc-ccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcchheeeee Q lcl|NC_014661. 61 F-----GGFLTEAE--IGGDHGYDPQNIAAGQTS-GAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTGQVFALR 130 (524) Q Consensus 61 ~-----~~~l~ea~--~~~~~g~~~~~i~est~t-g~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMR 130 (524) + +..+.... ....+ ........++++ |.+ .. |.-+ .+++.+..+..-.+++.+.||++++|-+. . T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~t~~~gg~-~v-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~ 154 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDD-LEQRAMSGLTGEDGGL-VI-PQDIQTQINELARSFDALEQYVTVEPVRTRSGSRV--L 154 (392) T ss_pred HHHHHhcccccHHHHHHHhhh-hhhhhccccccCCCce-ec-chhHHHHHHHHHHhhhhhhhhceeeeccCCceeEE--E Confidence 1 11111100 00000 000111112221 211 11 2222 34444455666778999999999877421 1 Q ss_pred eeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccc Q lcl|NC_014661. 131 AVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELD 210 (524) Q Consensus 131 SrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d 210 (524) .+..+.. ...| T Consensus 155 ~~~~~~~---------------~a~~------------------------------------------------------ 165 (392) T protein:vir:10 155 EKNSDMI---------------PFAE------------------------------------------------------ 165 (392) T ss_pred EeecCCc---------------ccee------------------------------------------------------ Confidence 1111000 0000 Q ss_pred ccccccccccccccccccccchhhhcccccCCCCCcchhhc-ceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHH Q lcl|NC_014661. 211 AEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEM-GFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADA 289 (524) Q Consensus 211 ~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EM-sFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEa 289 (524) +++|- ..++- .-++++++..++.-+-...+|-||.+|- ..|.++ T Consensus 166 --------------v~E~~-----------------~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~ 210 (392) T protein:vir:10 166 --------------ITEMG-----------------EIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS----DQNILK 210 (392) T ss_pred --------------ecccc-----------------cccccccccceeEEeeeeeEEEeehhhHHHHhhh----HHHHHH Confidence 00110 01111 1234444555555555667999999994 256788 Q ss_pred HHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_014661. 290 ELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQT 369 (524) Q Consensus 290 ELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T 369 (524) +|.+-|...|...++..|+...-+. .+.|..+++ ....++... . .. T Consensus 211 ~i~~~l~~~i~~~~d~~~~~g~g~~-------------~~~~~~~~d-------------~i~~~~~~~--l------~~ 256 (392) T protein:vir:10 211 YVTKWLGKKSKVTRNVLILGVIEKL-------------TKQAIKSLD-------------DIKDVLNVK--L------DP 256 (392) T ss_pred HHHHHHHHHHHHHHHHHHhhccccc-------------cccCccCHH-------------HHHHHHHHh--h------hh Confidence 9999999999999988887433211 122333221 122222111 1 12 Q ss_pred cccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCcccee Q lcl|NC_014661. 370 GRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGI 449 (524) Q Consensus 370 ~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~ 449 (524) .+-..-.+|++|.....|....- +.|. .-+..+.+. -..++|.|...|+++.... ++.+|...-+..+ T Consensus 257 ~~~~~a~~vm~~~~~~~L~~lkd-----~~G~-~l~~~~~~~-~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~ 324 (392) T protein:vir:10 257 AISPNAILLTNQDGFNYLDKLKD-----KDGK-YILQSDPTQ-KNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPL 324 (392) T ss_pred hhccCCEEEEcHHHHHHHHHhhc-----cCCC-eEeecCccC-CccccccCcccEEEecccc-----cCCCcccCCceEE Confidence 22234457899999999976421 1110 001112221 1235677765666543321 1111222222223 Q ss_pred Eeeccccc-------ccccccCc------ccccceeeeeeeeceee-CC--cccccCCccccceeeccccchhhh Q lcl|NC_014661. 450 YYAPYVAL-------TPLRGADP------KNFQPVLGFKTRYGIGI-NP--LADTAAQQPAGNARIANGMPSIAN 508 (524) Q Consensus 450 fyaPYv~~-------~~~~~~Dp------~s~qP~~~~~tRY~l~~-nP--~~~~~~~~~~~~~~~~~g~~~~a~ 508 (524) +|+.+-.+ .+.-.++| .+.+=.+-...|++..+ +| |.... -..+-++.. |+ | T Consensus 325 ~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-~~~~a~~~~----~~--~ 392 (392) T protein:vir:10 325 IIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDLSAPVEQ----PQ--G 392 (392) T ss_pred EEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-ecccccccC----CC--C Confidence 33332110 00001122 23445566677777543 34 21110 000000110 11 1 No 61 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=84.43 E-value=0.06 Score=27.26 Aligned_cols=345 Identities=14% Similarity=0.106 Sum_probs=133.6 Q ss_pred CCcccchHH-HHHHhhh---hhhccCCCcchhhhhh---hhhhhhhhhHHHHHhhhhccccc------h-------hhhc Q lcl|NC_014661. 1 MSTQIKTKA-QLVADWK---PLLEAEGAPEIAQGKH---AIIAKMFENQEADIKSDAAYRDE------K-------LAEA 60 (524) Q Consensus 1 ~~~~~~~~~-~l~~kw~---p~l~~~~~~~~~~~~~---~~~~~~~enq~~~~~~~~~~~~~------~-------~~~~ 60 (524) |++++.... +|.++++ -+++.+..-++...+. ++-++| |.+++...++...+.. . ...+ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKI-DLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 999876432 2344443 3344333322222111 122211 2222111111110000 0 0001 Q ss_pred c-----cccccccc--cccccccchhhhcccccc-ccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcchheeeee Q lcl|NC_014661. 61 F-----GGFLTEAE--IGGDHGYDPQNIAAGQTS-GAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTGQVFALR 130 (524) Q Consensus 61 ~-----~~~l~ea~--~~~~~g~~~~~i~est~t-g~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMR 130 (524) + +..+.... ....+ ........++++ |.+ .. |.-+ .+++.+..+..-.+++.+.||++++|-+. . T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~t~~~gg~-~v-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~ 154 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDD-LEQRAMSGLTGEDGGL-VI-PQDIQTQINELARSFDALEQYVTVEPVRTRSGSRV--L 154 (392) T ss_pred HHHHHhcccccHHHHHHHhhh-hhhhhccccccCCCce-ec-chhHHHHHHHHHHhhhhhhhhceeeeccCCceeEE--E Confidence 1 11111100 00000 000111112221 211 11 2222 34444455666778999999999877421 1 Q ss_pred eeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccc Q lcl|NC_014661. 131 AVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELD 210 (524) Q Consensus 131 SrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d 210 (524) .+..+.. ...| T Consensus 155 ~~~~~~~---------------~a~~------------------------------------------------------ 165 (392) T protein:vir:10 155 EKNSDMI---------------PFAE------------------------------------------------------ 165 (392) T ss_pred EeecCCc---------------ccee------------------------------------------------------ Confidence 1111000 0000 Q ss_pred ccccccccccccccccccccchhhhcccccCCCCCcchhhc-ceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHH Q lcl|NC_014661. 211 AEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEM-GFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADA 289 (524) Q Consensus 211 ~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EM-sFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEa 289 (524) +++|- ..++- .-++++++..++.-+-...+|-||.+|- ..|.++ T Consensus 166 --------------v~E~~-----------------~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~ 210 (392) T protein:vir:10 166 --------------ITEMG-----------------EIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS----DQNILK 210 (392) T ss_pred --------------ecccc-----------------cccccccccceeEEeeeeeEEEeehhhHHHHhhh----HHHHHH Confidence 00110 01111 1234444555555555667999999994 256788 Q ss_pred HHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_014661. 290 ELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQT 369 (524) Q Consensus 290 ELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T 369 (524) +|.+-|...|...++..|+...-+. .+.|..+++ ....++... . .. T Consensus 211 ~i~~~l~~~i~~~~d~~~~~g~g~~-------------~~~~~~~~d-------------~i~~~~~~~--l------~~ 256 (392) T protein:vir:10 211 YVTKWLGKKSKVTRNVLILGVIEKL-------------TKQAIKSLD-------------DIKDVLNVK--L------DP 256 (392) T ss_pred HHHHHHHHHHHHHHHHHHhhccccc-------------cccCccCHH-------------HHHHHHHHh--h------hh Confidence 9999999999999988887433211 122333221 122222111 1 12 Q ss_pred cccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCcccee Q lcl|NC_014661. 370 GRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGI 449 (524) Q Consensus 370 ~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~ 449 (524) .+-..-.+|++|.....|....- +.|. .-+..+.+. -..++|.|...|+++.... ++.+|...-+..+ T Consensus 257 ~~~~~a~~vm~~~~~~~L~~lkd-----~~G~-~l~~~~~~~-~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~ 324 (392) T protein:vir:10 257 AISPNAILLTNQDGFNYLDKLKD-----KDGK-YILQSDPTQ-KNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPL 324 (392) T ss_pred hhccCCEEEEcHHHHHHHHHhhc-----cCCC-eEeecCccC-CccccccCcccEEEecccc-----cCCCcccCCceEE Confidence 22234457899999999976421 1110 001112221 1235677765666543321 1111222222223 Q ss_pred Eeeccccc-------ccccccCc------ccccceeeeeeeeceee-CC--cccccCCccccceeeccccchhhh Q lcl|NC_014661. 450 YYAPYVAL-------TPLRGADP------KNFQPVLGFKTRYGIGI-NP--LADTAAQQPAGNARIANGMPSIAN 508 (524) Q Consensus 450 fyaPYv~~-------~~~~~~Dp------~s~qP~~~~~tRY~l~~-nP--~~~~~~~~~~~~~~~~~g~~~~a~ 508 (524) +|+.+-.+ .+.-.++| .+.+=.+-...|++..+ +| |.... -..+-++.. |+ | T Consensus 325 ~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-~~~~a~~~~----~~--~ 392 (392) T protein:vir:10 325 IIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDLSAPVEQ----PQ--G 392 (392) T ss_pred EEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-ecccccccC----CC--C Confidence 33332110 00001122 23445566677777543 34 21110 000000110 11 1 No 62 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=84.43 E-value=0.06 Score=27.26 Aligned_cols=345 Identities=14% Similarity=0.106 Sum_probs=133.6 Q ss_pred CCcccchHH-HHHHhhh---hhhccCCCcchhhhhh---hhhhhhhhhHHHHHhhhhccccc------h-------hhhc Q lcl|NC_014661. 1 MSTQIKTKA-QLVADWK---PLLEAEGAPEIAQGKH---AIIAKMFENQEADIKSDAAYRDE------K-------LAEA 60 (524) Q Consensus 1 ~~~~~~~~~-~l~~kw~---p~l~~~~~~~~~~~~~---~~~~~~~enq~~~~~~~~~~~~~------~-------~~~~ 60 (524) |++++.... +|.++++ -+++.+..-++...+. ++-++| |.+++...++...+.. . ...+ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKI-DLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 999876432 2344443 3344333322222111 122211 2222111111110000 0 0001 Q ss_pred c-----cccccccc--cccccccchhhhcccccc-ccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcchheeeee Q lcl|NC_014661. 61 F-----GGFLTEAE--IGGDHGYDPQNIAAGQTS-GAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTGQVFALR 130 (524) Q Consensus 61 ~-----~~~l~ea~--~~~~~g~~~~~i~est~t-g~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMR 130 (524) + +..+.... ....+ ........++++ |.+ .. |.-+ .+++.+..+..-.+++.+.||++++|-+. . T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~t~~~gg~-~v-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~ 154 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDD-LEQRAMSGLTGEDGGL-VI-PQDIQTQINELARSFDALEQYVTVEPVRTRSGSRV--L 154 (392) T ss_pred HHHHHhcccccHHHHHHHhhh-hhhhhccccccCCCce-ec-chhHHHHHHHHHHhhhhhhhhceeeeccCCceeEE--E Confidence 1 11111100 00000 000111112221 211 11 2222 34444455666778999999999877421 1 Q ss_pred eeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccc Q lcl|NC_014661. 131 AVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELD 210 (524) Q Consensus 131 SrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d 210 (524) .+..+.. ...| T Consensus 155 ~~~~~~~---------------~a~~------------------------------------------------------ 165 (392) T protein:vir:10 155 EKNSDMI---------------PFAE------------------------------------------------------ 165 (392) T ss_pred EeecCCc---------------ccee------------------------------------------------------ Confidence 1111000 0000 Q ss_pred ccccccccccccccccccccchhhhcccccCCCCCcchhhc-ceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHH Q lcl|NC_014661. 211 AEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEM-GFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADA 289 (524) Q Consensus 211 ~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EM-sFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEa 289 (524) +++|- ..++- .-++++++..++.-+-...+|-||.+|- ..|.++ T Consensus 166 --------------v~E~~-----------------~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~ 210 (392) T protein:vir:10 166 --------------ITEMG-----------------EIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS----DQNILK 210 (392) T ss_pred --------------ecccc-----------------cccccccccceeEEeeeeeEEEeehhhHHHHhhh----HHHHHH Confidence 00110 01111 1234444555555555667999999994 256788 Q ss_pred HHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_014661. 290 ELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQT 369 (524) Q Consensus 290 ELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T 369 (524) +|.+-|...|...++..|+...-+. .+.|..+++ ....++... . .. T Consensus 211 ~i~~~l~~~i~~~~d~~~~~g~g~~-------------~~~~~~~~d-------------~i~~~~~~~--l------~~ 256 (392) T protein:vir:10 211 YVTKWLGKKSKVTRNVLILGVIEKL-------------TKQAIKSLD-------------DIKDVLNVK--L------DP 256 (392) T ss_pred HHHHHHHHHHHHHHHHHHhhccccc-------------cccCccCHH-------------HHHHHHHHh--h------hh Confidence 9999999999999988887433211 122333221 122222111 1 12 Q ss_pred cccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCcccee Q lcl|NC_014661. 370 GRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGI 449 (524) Q Consensus 370 ~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~ 449 (524) .+-..-.+|++|.....|....- +.|. .-+..+.+. -..++|.|...|+++.... ++.+|...-+..+ T Consensus 257 ~~~~~a~~vm~~~~~~~L~~lkd-----~~G~-~l~~~~~~~-~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~ 324 (392) T protein:vir:10 257 AISPNAILLTNQDGFNYLDKLKD-----KDGK-YILQSDPTQ-KNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPL 324 (392) T ss_pred hhccCCEEEEcHHHHHHHHHhhc-----cCCC-eEeecCccC-CccccccCcccEEEecccc-----cCCCcccCCceEE Confidence 22234457899999999976421 1110 001112221 1235677765666543321 1111222222223 Q ss_pred Eeeccccc-------ccccccCc------ccccceeeeeeeeceee-CC--cccccCCccccceeeccccchhhh Q lcl|NC_014661. 450 YYAPYVAL-------TPLRGADP------KNFQPVLGFKTRYGIGI-NP--LADTAAQQPAGNARIANGMPSIAN 508 (524) Q Consensus 450 fyaPYv~~-------~~~~~~Dp------~s~qP~~~~~tRY~l~~-nP--~~~~~~~~~~~~~~~~~g~~~~a~ 508 (524) +|+.+-.+ .+.-.++| .+.+=.+-...|++..+ +| |.... -..+-++.. |+ | T Consensus 325 ~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-~~~~a~~~~----~~--~ 392 (392) T protein:vir:10 325 IIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDLSAPVEQ----PQ--G 392 (392) T ss_pred EEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-ecccccccC----CC--C Confidence 33332110 00001122 23445566677777543 34 21110 000000110 11 1 No 63 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=84.43 E-value=0.06 Score=27.26 Aligned_cols=345 Identities=14% Similarity=0.106 Sum_probs=133.6 Q ss_pred CCcccchHH-HHHHhhh---hhhccCCCcchhhhhh---hhhhhhhhhHHHHHhhhhccccc------h-------hhhc Q lcl|NC_014661. 1 MSTQIKTKA-QLVADWK---PLLEAEGAPEIAQGKH---AIIAKMFENQEADIKSDAAYRDE------K-------LAEA 60 (524) Q Consensus 1 ~~~~~~~~~-~l~~kw~---p~l~~~~~~~~~~~~~---~~~~~~~enq~~~~~~~~~~~~~------~-------~~~~ 60 (524) |++++.... +|.++++ -+++.+..-++...+. ++-++| |.+++...++...+.. . ...+ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKI-DLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 999876432 2344443 3344333322222111 122211 2222111111110000 0 0001 Q ss_pred c-----cccccccc--cccccccchhhhcccccc-ccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcchheeeee Q lcl|NC_014661. 61 F-----GGFLTEAE--IGGDHGYDPQNIAAGQTS-GAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTGQVFALR 130 (524) Q Consensus 61 ~-----~~~l~ea~--~~~~~g~~~~~i~est~t-g~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMR 130 (524) + +..+.... ....+ ........++++ |.+ .. |.-+ .+++.+..+..-.+++.+.||++++|-+. . T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~t~~~gg~-~v-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~ 154 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDD-LEQRAMSGLTGEDGGL-VI-PQDIQTQINELARSFDALEQYVTVEPVRTRSGSRV--L 154 (392) T ss_pred HHHHHhcccccHHHHHHHhhh-hhhhhccccccCCCce-ec-chhHHHHHHHHHHhhhhhhhhceeeeccCCceeEE--E Confidence 1 11111100 00000 000111112221 211 11 2222 34444455666778999999999877421 1 Q ss_pred eeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccc Q lcl|NC_014661. 131 AVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELD 210 (524) Q Consensus 131 SrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d 210 (524) .+..+.. ...| T Consensus 155 ~~~~~~~---------------~a~~------------------------------------------------------ 165 (392) T protein:vir:10 155 EKNSDMI---------------PFAE------------------------------------------------------ 165 (392) T ss_pred EeecCCc---------------ccee------------------------------------------------------ Confidence 1111000 0000 Q ss_pred ccccccccccccccccccccchhhhcccccCCCCCcchhhc-ceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHH Q lcl|NC_014661. 211 AEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEM-GFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADA 289 (524) Q Consensus 211 ~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EM-sFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEa 289 (524) +++|- ..++- .-++++++..++.-+-...+|-||.+|- ..|.++ T Consensus 166 --------------v~E~~-----------------~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~ 210 (392) T protein:vir:10 166 --------------ITEMG-----------------EIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS----DQNILK 210 (392) T ss_pred --------------ecccc-----------------cccccccccceeEEeeeeeEEEeehhhHHHHhhh----HHHHHH Confidence 00110 01111 1234444555555555667999999994 256788 Q ss_pred HHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_014661. 290 ELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQT 369 (524) Q Consensus 290 ELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T 369 (524) +|.+-|...|...++..|+...-+. .+.|..+++ ....++... . .. T Consensus 211 ~i~~~l~~~i~~~~d~~~~~g~g~~-------------~~~~~~~~d-------------~i~~~~~~~--l------~~ 256 (392) T protein:vir:10 211 YVTKWLGKKSKVTRNVLILGVIEKL-------------TKQAIKSLD-------------DIKDVLNVK--L------DP 256 (392) T ss_pred HHHHHHHHHHHHHHHHHHhhccccc-------------cccCccCHH-------------HHHHHHHHh--h------hh Confidence 9999999999999988887433211 122333221 122222111 1 12 Q ss_pred cccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCcccee Q lcl|NC_014661. 370 GRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGI 449 (524) Q Consensus 370 ~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~ 449 (524) .+-..-.+|++|.....|....- +.|. .-+..+.+. -..++|.|...|+++.... ++.+|...-+..+ T Consensus 257 ~~~~~a~~vm~~~~~~~L~~lkd-----~~G~-~l~~~~~~~-~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~ 324 (392) T protein:vir:10 257 AISPNAILLTNQDGFNYLDKLKD-----KDGK-YILQSDPTQ-KNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPL 324 (392) T ss_pred hhccCCEEEEcHHHHHHHHHhhc-----cCCC-eEeecCccC-CccccccCcccEEEecccc-----cCCCcccCCceEE Confidence 22234457899999999976421 1110 001112221 1235677765666543321 1111222222223 Q ss_pred Eeeccccc-------ccccccCc------ccccceeeeeeeeceee-CC--cccccCCccccceeeccccchhhh Q lcl|NC_014661. 450 YYAPYVAL-------TPLRGADP------KNFQPVLGFKTRYGIGI-NP--LADTAAQQPAGNARIANGMPSIAN 508 (524) Q Consensus 450 fyaPYv~~-------~~~~~~Dp------~s~qP~~~~~tRY~l~~-nP--~~~~~~~~~~~~~~~~~g~~~~a~ 508 (524) +|+.+-.+ .+.-.++| .+.+=.+-...|++..+ +| |.... -..+-++.. |+ | T Consensus 325 ~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-~~~~a~~~~----~~--~ 392 (392) T protein:vir:10 325 IIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDLSAPVEQ----PQ--G 392 (392) T ss_pred EEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-ecccccccC----CC--C Confidence 33332110 00001122 23445566677777543 34 21110 000000110 11 1 No 64 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=83.43 E-value=0.069 Score=26.97 Aligned_cols=358 Identities=14% Similarity=0.083 Sum_probs=144.3 Q ss_pred cchHHHHHH-----------hhhhhhccCCCcchhh---hh---hhhhhhhh--hhHHHHHhhhhcc------------- Q lcl|NC_014661. 5 IKTKAQLVA-----------DWKPLLEAEGAPEIAQ---GK---HAIIAKMF--ENQEADIKSDAAY------------- 52 (524) Q Consensus 5 ~~~~~~l~~-----------kw~p~l~~~~~~~~~~---~~---~~~~~~~~--enq~~~~~~~~~~------------- 52 (524) |++.++|.+ +++.+-+.....++.+ .+ +++-++|= +.+...+.+...- T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 555444443 4433222211111110 11 01111110 0111111110000 Q ss_pred ---ccchhhhcccccccccc----------cccccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhceeee Q lcl|NC_014661. 53 ---RDEKLAEAFGGFLTEAE----------IGGDHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQ 117 (524) Q Consensus 53 ---~~~~~~~~~~~~l~ea~----------~~~~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQ 117 (524) .+..-....+..+.+.. ..-..+.+. .+.++++.+-..--|..+ .+++.+.+...-.+++.+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:47 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI--QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhh--hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhccee Confidence 00000000000000000 000000000 001111111111224332 4666677788889999999 Q ss_pred cCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 118 PMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 118 PmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) ||+++++-+--.+ +.. +.+ ..| T Consensus 159 ~~~~~~~~~~~~~-----~~~---~~~---------~~~----------------------------------------- 180 (415) T protein:vir:47 159 RVTNGSGKYPVVR-----QSE---VAA---------LEK----------------------------------------- 180 (415) T ss_pred eccCCceeEEEEE-----ecC---Ccc---------eee----------------------------------------- Confidence 9999886431111 100 000 000 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcc-eEEEEEEEEEecccccchhhHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMG-FRIDKQVIEAKSRQLKAQYSIELA 276 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELA 276 (524) ++ | +...++.+ -++++++..++..+-...+|-||. T Consensus 181 ---------------------------v~--------E---------g~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell 216 (415) T protein:vir:47 181 ---------------------------VE--------E---------LEENPELAVKPFFQLAYDINTHRGYFRISREAI 216 (415) T ss_pred ---------------------------cc--------c---------ccccccccccceeeEEeeeeeeEeeehhhHHHH Confidence 00 0 01122222 245666666666666678999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHH Q lcl|NC_014661. 277 QDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLF 356 (524) Q Consensus 277 QDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~ 356 (524) +|-. .|.+++|.+-|+..|..-+|+.||...-+-...+-... +. .....+.-. +.. ..+-...|+. T Consensus 217 ~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~--~~-~~~~~~~~~------~~~-~~~~i~~~~~ 282 (415) T protein:vir:47 217 EDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSG--FE-KEGKKLEVK------KAK-SLDDIKDAIN 282 (415) T ss_pred hhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccc--cc-cccceeccc------ccc-chHHHHHHHH Confidence 9843 56799999999999999999999876543221111000 00 000111100 000 1122333433 Q ss_pred HHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEE Q lcl|NC_014661. 357 QIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFT 436 (524) Q Consensus 357 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~ 436 (524) .+.. .+.+.+.+|+++.....|....- +.|.- =+..+.+.. ..++|.| ++|++.++.+.. T Consensus 283 ~~~~---------~~~~~~~~v~n~~~~~~L~~lkd-----~~G~~-i~~~~~~~~-~~~~l~G-~pV~~~~~~~~~--- 342 (415) T protein:vir:47 283 LNVK---------PNYEHNVAIVSQTMFAKLDKMKD-----KLGNY-LIQPDVKEK-TQQRLLG-AKIEILPDEVLG--- 342 (415) T ss_pred hhhh---------hccCCCEEEEcHHHHHHHHHhhc-----cCCCe-eeccCcCCC-CCccccc-eeeEEecccccc--- Confidence 3332 22357789999999999975321 11110 011122211 1246777 588877665421 Q ss_pred EEEecCCCccceeEeecccc--------cccccccCcccccceeeeeeeecee-eCCcccccCCccccceeeccccchhh Q lcl|NC_014661. 437 IGYKGDNEMDAGIYYAPYVA--------LTPLRGADPKNFQPVLGFKTRYGIG-INPLADTAAQQPAGNARIANGMPSIA 507 (524) Q Consensus 437 vG~KG~~~~d~g~fyaPYv~--------~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~a 507 (524) -.| +..++|+.|-. ...+...|-.+++-.+-...|++.. .+|=+-..- .+.....|-+.. T Consensus 343 --~~~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~----~~~~~~~~~~~~- 411 (415) T protein:vir:47 343 --QKG----NNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVI----EYDDSERGEGDL- 411 (415) T ss_pred --CCC----ccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccccEEEE----EeeccCCCCCCc- Confidence 111 11122222111 1122223556677777788898764 355111000 001111221111 Q ss_pred hhccccc Q lcl|NC_014661. 508 NSVGKNG 514 (524) Q Consensus 508 ~~~~~~~ 514 (524) +.-. T Consensus 412 ---~~~~ 415 (415) T protein:vir:47 412 ---GLEA 415 (415) T ss_pred ---cCCC Confidence 1111 No 65 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=83.43 E-value=0.069 Score=26.97 Aligned_cols=358 Identities=14% Similarity=0.083 Sum_probs=144.3 Q ss_pred cchHHHHHH-----------hhhhhhccCCCcchhh---hh---hhhhhhhh--hhHHHHHhhhhcc------------- Q lcl|NC_014661. 5 IKTKAQLVA-----------DWKPLLEAEGAPEIAQ---GK---HAIIAKMF--ENQEADIKSDAAY------------- 52 (524) Q Consensus 5 ~~~~~~l~~-----------kw~p~l~~~~~~~~~~---~~---~~~~~~~~--enq~~~~~~~~~~------------- 52 (524) |++.++|.+ +++.+-+.....++.+ .+ +++-++|= +.+...+.+...- T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 555444443 4433222211111110 11 01111110 0111111110000 Q ss_pred ---ccchhhhcccccccccc----------cccccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhceeee Q lcl|NC_014661. 53 ---RDEKLAEAFGGFLTEAE----------IGGDHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQ 117 (524) Q Consensus 53 ---~~~~~~~~~~~~l~ea~----------~~~~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQ 117 (524) .+..-....+..+.+.. ..-..+.+. .+.++++.+-..--|..+ .+++.+.+...-.+++.+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:46 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI--QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhh--hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhccee Confidence 00000000000000000 000000000 001111111111224332 4666677788889999999 Q ss_pred cCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 118 PMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 118 PmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) ||+++++-+--.+ +.. +.+ ..| T Consensus 159 ~~~~~~~~~~~~~-----~~~---~~~---------~~~----------------------------------------- 180 (415) T protein:vir:46 159 RVTNGSGKYPVVR-----QSE---VAA---------LEK----------------------------------------- 180 (415) T ss_pred eccCCceeEEEEE-----ecC---Ccc---------eee----------------------------------------- Confidence 9999886431111 100 000 000 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcc-eEEEEEEEEEecccccchhhHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMG-FRIDKQVIEAKSRQLKAQYSIELA 276 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELA 276 (524) ++ | +...++.+ -++++++..++..+-...+|-||. T Consensus 181 ---------------------------v~--------E---------g~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell 216 (415) T protein:vir:46 181 ---------------------------VE--------E---------LEENPELAVKPFFQLAYDINTHRGYFRISREAI 216 (415) T ss_pred ---------------------------cc--------c---------ccccccccccceeeEEeeeeeeEeeehhhHHHH Confidence 00 0 01122222 245666666666666678999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHH Q lcl|NC_014661. 277 QDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLF 356 (524) Q Consensus 277 QDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~ 356 (524) +|-. .|.+++|.+-|+..|..-+|+.||...-+-...+-... +. .....+.-. +.. ..+-...|+. T Consensus 217 ~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~--~~-~~~~~~~~~------~~~-~~~~i~~~~~ 282 (415) T protein:vir:46 217 EDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSG--FE-KEGKKLEVK------KAK-SLDDIKDAIN 282 (415) T ss_pred hhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccc--cc-cccceeccc------ccc-chHHHHHHHH Confidence 9843 56799999999999999999999876543221111000 00 000111100 000 1122333433 Q ss_pred HHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEE Q lcl|NC_014661. 357 QIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFT 436 (524) Q Consensus 357 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~ 436 (524) .+.. .+.+.+.+|+++.....|....- +.|.- =+..+.+.. ..++|.| ++|++.++.+.. T Consensus 283 ~~~~---------~~~~~~~~v~n~~~~~~L~~lkd-----~~G~~-i~~~~~~~~-~~~~l~G-~pV~~~~~~~~~--- 342 (415) T protein:vir:46 283 LNVK---------PNYEHNVAIVSQTMFAKLDKMKD-----KLGNY-LIQPDVKEK-TQQRLLG-AKIEILPDEVLG--- 342 (415) T ss_pred hhhh---------hccCCCEEEEcHHHHHHHHHhhc-----cCCCe-eeccCcCCC-CCccccc-eeeEEecccccc--- Confidence 3332 22357789999999999975321 11110 011122211 1246777 588877665421 Q ss_pred EEEecCCCccceeEeecccc--------cccccccCcccccceeeeeeeecee-eCCcccccCCccccceeeccccchhh Q lcl|NC_014661. 437 IGYKGDNEMDAGIYYAPYVA--------LTPLRGADPKNFQPVLGFKTRYGIG-INPLADTAAQQPAGNARIANGMPSIA 507 (524) Q Consensus 437 vG~KG~~~~d~g~fyaPYv~--------~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~a 507 (524) -.| +..++|+.|-. ...+...|-.+++-.+-...|++.. .+|=+-..- .+.....|-+.. T Consensus 343 --~~~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~----~~~~~~~~~~~~- 411 (415) T protein:vir:46 343 --QKG----NNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVI----EYDDSERGEGDL- 411 (415) T ss_pred --CCC----ccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccccEEEE----EeeccCCCCCCc- Confidence 111 11122222111 1122223556677777788898764 355111000 001111221111 Q ss_pred hhccccc Q lcl|NC_014661. 508 NSVGKNG 514 (524) Q Consensus 508 ~~~~~~~ 514 (524) +.-. T Consensus 412 ---~~~~ 415 (415) T protein:vir:46 412 ---GLEA 415 (415) T ss_pred ---cCCC Confidence 1111 No 66 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=82.68 E-value=0.075 Score=26.76 Aligned_cols=284 Identities=11% Similarity=0.091 Sum_probs=124.4 Q ss_pred ccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccccccccc Q lcl|NC_014661. 82 AAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRG 160 (524) Q Consensus 82 ~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~ 160 (524) |..+++|.+.- -+.+. .+++++.++-+..+++-|-||++..- +|...+. +.+ +.| T Consensus 1 mat~~~gg~lv-P~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~-------~~p~~~~---~~~---------a~w---- 56 (311) T protein:vir:81 1 MVALATGTFQL-PKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQ-------QYMTLTA---PPR---------GEV---- 56 (311) T ss_pred CceecCCceEc-chhHHHHHHHHHHhcchhhhhcceeecCCCce-------EEEEEeC---Cce---------eEE---- Confidence 44555554421 12222 56667777888899999999865421 2211100 000 000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhccccc Q lcl|NC_014661. 161 SHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGF 240 (524) Q Consensus 161 ~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~l 240 (524) ++ | T Consensus 57 ----------------------------------------------------------------v~--------E----- 59 (311) T protein:vir:81 57 ----------------------------------------------------------------VG--------E----- 59 (311) T ss_pred ----------------------------------------------------------------ee--------c----- Confidence 00 1 Q ss_pred CCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhh Q lcl|NC_014661. 241 NGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKT 320 (524) Q Consensus 241 Ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~ 320 (524) +..+++...++++++..+|.-+-....|-||.|+--. -.++-|++|.+-|+..|...|+.-++.... +.-++ T Consensus 60 ----g~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~a~l~G~~--~~~~~- 131 (311) T protein:vir:81 60 ----GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGIN--PLTGA- 131 (311) T ss_pred ----CcccccccceeeEEEEeeEEEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHhhhcccc--CCCCc- Confidence 1122333334455555555555566899999875322 134557888888888888888887775532 00011 Q ss_pred ccccccccccceecc----cccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccc Q lcl|NC_014661. 321 GQTLTVGSKAGVFDF----QDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTP 396 (524) Q Consensus 321 ~~~~~~~~~aG~fdl----~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~ 396 (524) ...|++.. ....... ......++.-|+.+-..+.. .++..+.+|++|+....|.... T Consensus 132 -------~~~gi~~~~~~~~~~~~~~-----~~~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~lk----- 192 (311) T protein:vir:81 132 -------ALSGSPAKILDTTNIVELT-----TGTSATPDLAVEAAVGLVLG--DNLSPDGVALDNTFSFMLATQR----- 192 (311) T ss_pred -------ccccccccccccceeeeec-----ccccchHHHHHHHHHHHhhh--cCCCceEEEEcHHHHHHHHhhh----- Confidence 11121111 0000000 00011222334444444432 2346777899999999996532 Q ss_pred cccccccccccccCcceEEEEecCceEEEeeCCCCcceEE------EEEecCCCc-----c-ceeEeecccccccc--cc Q lcl|NC_014661. 397 AAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFT------IGYKGDNEM-----D-AGIYYAPYVALTPL--RG 462 (524) Q Consensus 397 ~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~------vG~KG~~~~-----d-~g~fyaPYv~~~~~--~~ 462 (524) .+.|.- -+..+.+. -..|+|.| ++|+++.+-+..-.. +...+.... | +.+++...-..... +- T Consensus 193 d~~G~~-l~~~~~~~-~~~~tl~G-~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~ 269 (311) T protein:vir:81 193 DSQGRK-LYPELGFG-TDVASFAG-LNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEF 269 (311) T ss_pred ccCCCe-eecCcccc-CCCceecc-eeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEecc Confidence 111100 01111111 12367887 699988765432211 111111110 1 12233322222111 11 Q ss_pred cCccc----ccc-eeee--eeeecee-eCCcccccCCccccceeeccccchhhhhc Q lcl|NC_014661. 463 ADPKN----FQP-VLGF--KTRYGIG-INPLADTAAQQPAGNARIANGMPSIANSV 510 (524) Q Consensus 463 ~Dp~s----~qP-~~~~--~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~a~~~ 510 (524) .|+.. ||- .++| ..|++.. .+| ..+.++.+--. + T Consensus 270 ~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~---------~a~~~l~~a~~-----~ 311 (311) T protein:vir:81 270 GDPDGLGDLKRQNQIAIRAEVVYGIGIMST---------DAFAVVRDADE-----S 311 (311) T ss_pred CCCCcchhhhhcCcEEEEEEEEeccEeecc---------cceEEEEeecc-----C Confidence 23321 222 1333 4677743 565 11223322111 1 No 67 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=81.90 E-value=0.082 Score=26.55 Aligned_cols=270 Identities=14% Similarity=0.091 Sum_probs=118.5 Q ss_pred ccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCC Q lcl|NC_014661. 165 FAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSN 244 (524) Q Consensus 165 ~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~ 244 (524) +++ ... --.++....... ..+..+.....-+..... .+.... . ..|...+.+.=-.+..+|. +.... T Consensus 1 m~~---~~T-~l~d~i~Pev~~-~~v~~~~~~~l~~~~~~~---~~~~l~-g-~~G~tv~iP~~~~ig~a~~---~~~g~ 67 (274) T protein:vir:96 1 MAQ---GMT-KLTNQIVPEVLA-PMMQAELEKKLRFASFAE---IDNTLV-G-QPGDTLTFPAFIYSGDAKV---VAEGE 67 (274) T ss_pred CCc---cee-ehhheechHHHH-HHHHHHHHhhhhccccce---eccccc-C-CCCCEEEeeeecCCCcccc---ccCCC Confidence 111 000 000111000000 000000000000000000 000000 0 0011111111000112221 11112 Q ss_pred CcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcC-CChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccc Q lcl|NC_014661. 245 NNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHG-MDADAELANILATEIMLEINREVIDWINYSAQVGKTGQT 323 (524) Q Consensus 245 ~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHG-LDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~ 323 (524) .-...++..+=. +++-+-|+ |+ |.+ -|+-+..+ -|.-.|..+-++..+..+++++++..+...... + T Consensus 68 ~i~~~~lt~~~~--~~~i~~~~-~a-~~i---~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~-----~ 135 (274) T protein:vir:96 68 KIPTDILETKKR--EAKIRKIA-KG-TSI---SDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT-----V 135 (274) T ss_pred ccchhhccccee--EEEeeeee-cc-eee---hHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----c Confidence 223344444333 33334443 22 222 25555553 588999999999999999999999777543211 0 Q ss_pred cccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccccccccc Q lcl|NC_014661. 324 LTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLAR 403 (524) Q Consensus 324 ~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~ 403 (524) ....++ .+.+-....++.++. -.+++++++|.+++.|...+...+..++.... T Consensus 136 -----~~~~~~-------------~d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~ 188 (274) T protein:vir:96 136 -----EADITK-------------LTGLQTAIDKFNDED---------LEPMVLFISPLDAGKLRGDATTNFTRATELGD 188 (274) T ss_pred -----cccccC-------------HHHHHHHHHHhcccc---------ccccEEEeCHHHHHHHHhhccccccccccccc Confidence 011111 222333334443322 14789999999999998765443333221110 Q ss_pred ccccccCcceEEEEecCceEEEeeCCCCcce-EEEEEecCCCccceeEeecccccccccc-cCcccccceeeeeeeecee Q lcl|NC_014661. 404 GLNTDTTKAVFAGILGGRYKVYIDQYARQDY-FTIGYKGDNEMDAGIYYAPYVALTPLRG-ADPKNFQPVLGFKTRYGIG 481 (524) Q Consensus 404 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~~d~g~fyaPYv~~~~~~~-~Dp~s~qP~~~~~tRY~l~ 481 (524) ..-.+-.+|.+.| ++||+|...+..- +++| +|. -.||.. ....++. -||++++=.+-..-+||+. T Consensus 189 ----~~~~~G~ig~~~G-~~Vi~s~~~~~~t~~l~~-~gA-----~~~~~~--~~~~vE~~Rd~~~~~d~i~~~~~y~~~ 255 (274) T protein:vir:96 189 ----DVIVKGAFGEALG-AVIVRSNKLEAGTAILAK-KGA-----VKLITK--RDFFLETDRDPSTKTTALYSDKHYVAY 255 (274) T ss_pred ----cceeccccceecC-eEEEEeCCCCCceEEEEe-ccc-----eeeeec--CCcccccccccccccCEEEEeEEEEEE Confidence 1112234788887 8999999887433 2222 221 112221 1112232 3999999999999999986 Q ss_pred e-CCcccccCCccccceeeccccchhhhhccc Q lcl|NC_014661. 482 I-NPLADTAAQQPAGNARIANGMPSIANSVGK 512 (524) Q Consensus 482 ~-nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~ 512 (524) + || .++.++.-|- |+..| T Consensus 256 ~~~~---------~~~v~~tk~~----~~~~~ 274 (274) T protein:vir:96 256 LYDE---------SKAVKITKGS----GSLEM 274 (274) T ss_pred EEcC---------CcEEEEEcCC----ccccC Confidence 4 44 2345555542 23344 No 68 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=81.90 E-value=0.082 Score=26.55 Aligned_cols=270 Identities=14% Similarity=0.091 Sum_probs=118.5 Q ss_pred ccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCC Q lcl|NC_014661. 165 FAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSN 244 (524) Q Consensus 165 ~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~ 244 (524) +++ ... --.++....... ..+..+.....-+..... .+.... . ..|...+.+.=-.+..+|. +.... T Consensus 1 m~~---~~T-~l~d~i~Pev~~-~~v~~~~~~~l~~~~~~~---~~~~l~-g-~~G~tv~iP~~~~ig~a~~---~~~g~ 67 (274) T protein:vir:95 1 MAQ---GMT-KLTNQIVPEVLA-PMMQAELEKKLRFASFAE---IDNTLV-G-QPGDTLTFPAFIYSGDAKV---VAEGE 67 (274) T ss_pred CCc---cee-ehhheechHHHH-HHHHHHHHhhhhccccce---eccccc-C-CCCCEEEeeeecCCCcccc---ccCCC Confidence 111 000 000111000000 000000000000000000 000000 0 0011111111000112221 11112 Q ss_pred CcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcC-CChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccc Q lcl|NC_014661. 245 NNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHG-MDADAELANILATEIMLEINREVIDWINYSAQVGKTGQT 323 (524) Q Consensus 245 ~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHG-LDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~ 323 (524) .-...++..+=. +++-+-|+ |+ |.+ -|+-+..+ -|.-.|..+-++..+..+++++++..+...... + T Consensus 68 ~i~~~~lt~~~~--~~~i~~~~-~a-~~i---~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~-----~ 135 (274) T protein:vir:95 68 KIPTDILETKKR--EAKIRKIA-KG-TSI---SDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT-----V 135 (274) T ss_pred ccchhhccccee--EEEeeeee-cc-eee---hHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----c Confidence 223344444333 33334443 22 222 25555553 588999999999999999999999777543211 0 Q ss_pred cccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccccccccc Q lcl|NC_014661. 324 LTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLAR 403 (524) Q Consensus 324 ~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~ 403 (524) ....++ .+.+-....++.++. -.+++++++|.+++.|...+...+..++.... T Consensus 136 -----~~~~~~-------------~d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~ 188 (274) T protein:vir:95 136 -----EADITK-------------LTGLQTAIDKFNDED---------LEPMVLFISPLDAGKLRGDATTNFTRATELGD 188 (274) T ss_pred -----cccccC-------------HHHHHHHHHHhcccc---------ccccEEEeCHHHHHHHHhhccccccccccccc Confidence 011111 222333334443322 14789999999999998765443333221110 Q ss_pred ccccccCcceEEEEecCceEEEeeCCCCcce-EEEEEecCCCccceeEeecccccccccc-cCcccccceeeeeeeecee Q lcl|NC_014661. 404 GLNTDTTKAVFAGILGGRYKVYIDQYARQDY-FTIGYKGDNEMDAGIYYAPYVALTPLRG-ADPKNFQPVLGFKTRYGIG 481 (524) Q Consensus 404 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~~d~g~fyaPYv~~~~~~~-~Dp~s~qP~~~~~tRY~l~ 481 (524) ..-.+-.+|.+.| ++||+|...+..- +++| +|. -.||.. ....++. -||++++=.+-..-+||+. T Consensus 189 ----~~~~~G~ig~~~G-~~Vi~s~~~~~~t~~l~~-~gA-----~~~~~~--~~~~vE~~Rd~~~~~d~i~~~~~y~~~ 255 (274) T protein:vir:95 189 ----DVIVKGAFGEALG-AVIVRSNKLEAGTAILAK-KGA-----VKLITK--RDFFLETDRDPSTKTTALYSDKHYVAY 255 (274) T ss_pred ----cceeccccceecC-eEEEEeCCCCCceEEEEe-ccc-----eeeeec--CCcccccccccccccCEEEEeEEEEEE Confidence 1112234788887 8999999887433 2222 221 112221 1112232 3999999999999999986 Q ss_pred e-CCcccccCCccccceeeccccchhhhhccc Q lcl|NC_014661. 482 I-NPLADTAAQQPAGNARIANGMPSIANSVGK 512 (524) Q Consensus 482 ~-nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~ 512 (524) + || .++.++.-|- |+..| T Consensus 256 ~~~~---------~~~v~~tk~~----~~~~~ 274 (274) T protein:vir:95 256 LYDE---------SKAVKITKGS----GSLEM 274 (274) T ss_pred EEcC---------CcEEEEEcCC----ccccC Confidence 4 44 2345555542 23344 No 69 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=81.48 E-value=0.085 Score=26.44 Aligned_cols=349 Identities=15% Similarity=0.148 Sum_probs=125.5 Q ss_pred CCcccchHH----HHHHhhhh-----------hhcc-CCCcchhhhhhhhhhhhhhhHHHHHhhhhccccchhhhccccc Q lcl|NC_014661. 1 MSTQIKTKA----QLVADWKP-----------LLEA-EGAPEIAQGKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGF 64 (524) Q Consensus 1 ~~~~~~~~~----~l~~kw~p-----------~l~~-~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~ 64 (524) +..+.+... ...++... .++. +..+.+.....+......+|-++++.. +.+.. .... .+.. T Consensus 73 l~ee~~~~~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~-~~~~~-~~~~-~~~~ 149 (458) T protein:vir:10 73 LDEKSKKSNELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEK-LVLLS-YVME-KGVF 149 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHH-HHHHH-HHHh-hccc Confidence 110000000 01111110 0000 000000000000000000011111100 00000 0000 0000 Q ss_pred ccccccccccccc-hhhhcccccccccc-ccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCc Q lcl|NC_014661. 65 LTEAEIGGDHGYD-PQNIAAGQTSGAVT-QIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAG 141 (524) Q Consensus 65 l~ea~~~~~~g~~-~~~i~est~tg~v~-~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~ 141 (524) +.+ .+.. -.....+++..... ..-|.+. .++.++.++.+..++|-++||+++..-++ -... T Consensus 150 --~~~----~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~-------~~~~--- 213 (458) T protein:vir:10 150 --ETE----HGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTML-------VEPD--- 213 (458) T ss_pred --hhh----hhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEE-------EecC--- Confidence 000 0000 00000111111111 1112222 34555667788899999999988642111 1100 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccc Q lcl|NC_014661. 142 AKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGI 221 (524) Q Consensus 142 g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~ 221 (524) .+.+.|-+.+.. T Consensus 214 ---------~~~a~~v~e~~~----------------------------------------------------------- 225 (458) T protein:vir:10 214 ---------AGKATWVAASTY----------------------------------------------------------- 225 (458) T ss_pred ---------Ccceeecccccc----------------------------------------------------------- Confidence 001111111000 Q ss_pred cccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHH Q lcl|NC_014661. 222 LVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIML 301 (524) Q Consensus 222 ~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEIml 301 (524) .-.+... ..-..+++++++.++.-+....+|-||.+|-- .|.+++|.+-|..-|.. T Consensus 226 ------~~~~~~~--------------~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~----~~~~~~i~~~l~~~i~~ 281 (458) T protein:vir:10 226 ------GTDTTTG--------------EEVKGALKEIHFSTYKLAAKSFITDETEEDAI----FSLLPLLRKRLIEAHAV 281 (458) T ss_pred ------ccccccc--------------ccccccceeeEeeeeeEEeeehhhHHHHhcch----HHHHHHHHHHHHHHHHH Confidence 0000000 00112235556666666666789999998832 45688899999999999 Q ss_pred HhhHHHHhhHhhhhhhhhhccccccccccceecccccc------cccccchHHHHHHHHHHHHHHHHHHHHhhccccCcc Q lcl|NC_014661. 302 EINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPI------DVRGARWAGESFKALLFQIDKESAEIARQTGRGAGN 375 (524) Q Consensus 302 EINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~------d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn 375 (524) -||+.||..- | .+.|.|++...... +..+..-..-.+..|. ++-+.+. ..+.... T Consensus 282 ~~d~~~l~G~------G-------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~----~~~~~l~--~~~~~~~ 342 (458) T protein:vir:10 282 SIEEAFMTGD------G-------SGKPKGLLTLASEDSAKVVTEAKADGSVLVTAKTIS----KLRRKLG--RHGLKLS 342 (458) T ss_pred HHHHHhhcCC------C-------CCccceeeecccccccceeecccccccccccHHHHH----HHHHhhh--hhhcCCC Confidence 9988887520 0 01233443321110 0000000001122222 2222222 1222456 Q ss_pred EEEeCHHHHHHHhcCCc----cccccccccccccccccCcceEEEEecCceEEEeeCCCCc-----ceEEEEEecCCCcc Q lcl|NC_014661. 376 FIIASRNVVNVLASVDT----SVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQ-----DYFTIGYKGDNEMD 446 (524) Q Consensus 376 ~~v~S~~va~~L~~~~~----~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG~~~~d 446 (524) .+|+++.....|....- ..+.+.. . ....+.+ -++|.| ++|+++.+.|. +.++..++ + T Consensus 343 ~~v~~~~~~~~l~~lkd~~G~~i~~~~~--~-~~~~~~~----~~~l~G-~pv~~~~~~p~~~~~~~~~~~~f~-~---- 409 (458) T protein:vir:10 343 KLVLIVSMDAYYDLLEDEEWQDVAQVGN--D-SVKLQGQ----VGRIYG-LPVVVSEYFPAKANSAEFAVIVYK-D---- 409 (458) T ss_pred EEEEcHHHHHHHHhhcccCCceeecccc--c-cccccCc----Cceecc-eeeEEccccccccCCcceEEEEec-c---- Confidence 78999999998865321 1111100 0 0000111 135776 79999988654 22222221 1 Q ss_pred ceeEeecccccccccccCcccccceeeee--eeece-eeCCcccccCCccccceeeccccchhhhh Q lcl|NC_014661. 447 AGIYYAPYVALTPLRGADPKNFQPVLGFK--TRYGI-GINPLADTAAQQPAGNARIANGMPSIANS 509 (524) Q Consensus 447 ~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~--tRY~l-~~nP~~~~~~~~~~~~~~~~~g~~~~a~~ 509 (524) +.++... ..+....||-+-...++|. .|+|+ +.+| ..+.+ ..+|.+ T Consensus 410 -~~~~~~~--~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~---------~a~v~-----~~~aa~ 458 (458) T protein:vir:10 410 -NFVMPRQ--RAVTVERERQAGKQRDAYYVTQRVNLQRYFA---------NGVVS-----GTYAAS 458 (458) T ss_pred -cEEEEEe--eceEEEeecccCCCceEEEEEEEecceEecc---------cceEE-----EeeccC Confidence 0111101 1111123655445556666 46653 3445 12222 112211 No 70 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=80.96 E-value=0.09 Score=26.32 Aligned_cols=361 Identities=17% Similarity=0.144 Sum_probs=137.6 Q ss_pred cchHHHHHHhhhhhhc-----------c---CCCcchhhhh---hhhhhhhhhhHH--HHHhhh-------h-------- Q lcl|NC_014661. 5 IKTKAQLVADWKPLLE-----------A---EGAPEIAQGK---HAIIAKMFENQE--ADIKSD-------A-------- 50 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~-----------~---~~~~~~~~~~---~~~~~~~~enq~--~~~~~~-------~-------- 50 (524) |++.++|.++=+-+++ . +..-+....+ +.+-+++=+.|+ +.+.+. . T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccch Confidence 4444444333333221 1 1100000000 011111111000 011110 0 Q ss_pred -ccccchhhhcccccccc-----ccccc-----ccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhceeee Q lcl|NC_014661. 51 -AYRDEKLAEAFGGFLTE-----AEIGG-----DHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQ 117 (524) Q Consensus 51 -~~~~~~~~~~~~~~l~e-----a~~~~-----~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQ 117 (524) ......-...++..+.+ .+... ..+.+......++++|.. .-|.-+ .+++.+-+..+-.+++.|+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~--~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:94 81 STYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFV--VIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccc--cCcHHHHHHHHHHHHhhhhhhhhccee Confidence 00000000001111111 11000 000010000011122222 124222 4566666788889999999 Q ss_pred cCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 118 PMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 118 PmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) ||++.++-+--. +.... . ...|- T Consensus 159 ~~~~~~~~~~~~--~~~~~------~---------~~~~v---------------------------------------- 181 (415) T protein:vir:94 159 RVTNGSGKYPVV--RQSEV------A---------ALEKV---------------------------------------- 181 (415) T ss_pred eccCCceeEEEE--eecCC------c---------cceec---------------------------------------- Confidence 998876533111 11000 0 00000 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQ 277 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQ 277 (524) ++| ++. ...+...|.+..|++.|. +-.-.+|-||.+ T Consensus 182 ----------------------------~Eg-----~~~----~~~~~~~~~~i~~~~~k~-------~~~~~is~ell~ 217 (415) T protein:vir:94 182 ----------------------------EEL-----EEN----PELAVKPFFQLAYDINTH-------RGYFRISREAIE 217 (415) T ss_pred ----------------------------ccc-----ccc----cccccccceeeEeeheee-------eeechhhHHHHh Confidence 000 000 000112244444444444 444569999999 Q ss_pred HHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccce-ecccccccccccchHHHHHHHHHH Q lcl|NC_014661. 278 DLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGV-FDFQDPIDVRGARWAGESFKALLF 356 (524) Q Consensus 278 DLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~-fdl~~~~d~~~~~~a~E~~r~L~~ 356 (524) |-- .|.+++|.+-|...|..-+|+.||...-.-.-.+- .+.....++ ...++.. ..+....++. T Consensus 218 ds~----~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~----~~~~~~~~~~~~~~~~~-------~~~~i~~~~~ 282 (415) T protein:vir:94 218 DAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGST----SSGFEKEGKKLEVKKAK-------SLDDIKDAIN 282 (415) T ss_pred hch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc----ccccccccccccccccc-------chHHHHHHHH Confidence 864 46799999999999999999999886543221110 000000100 0000000 1122233333 Q ss_pred HHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcc--- Q lcl|NC_014661. 357 QIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQD--- 433 (524) Q Consensus 357 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--- 433 (524) .+. . ..+ +.+.+|++|.....|..... +.|.- =+..+.+.. ..++|.| ++|++.+..+.. T Consensus 283 ~~~-------~-~~~-~~~~~vmn~~~~~~l~~lkd-----~~G~~-l~~~~~~~~-~~~~l~G-~pV~~~~~~~~~~~~ 345 (415) T protein:vir:94 283 LNV-------K-PNY-EHNVAIVSQTMFAKLDKMKD-----KLGNY-LIQPDVKEK-TQQRLLG-AKIEILPDEVLGQKG 345 (415) T ss_pred hhh-------h-hcc-CCCEEEEcHHHHHHHHHhhc-----cCCCe-eeccCcCCC-CCceecc-eeeEEecccccCCCC Confidence 222 1 223 57788999999999975321 11110 011122211 1246777 588887765321 Q ss_pred -e-EEEEEecCCCccceeEeecccccccccccCcccccceeeeeeeecee-eCCcccccCCccccceeeccccchhhhhc Q lcl|NC_014661. 434 -Y-FTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIG-INPLADTAAQQPAGNARIANGMPSIANSV 510 (524) Q Consensus 434 -y-~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~a~~~ 510 (524) . +++|--.. . +..... ....+...|-.+++-.+-...|+++. .+|=+-..-. +.....|-+. . T Consensus 346 ~~~i~~gd~~~----~-~~~~~~-~~~~v~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~----~~~~~~~~~~----~ 411 (415) T protein:vir:94 346 NNTLIIGNLKD----A-IVLFDR-SQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE----YDDSERGEGD----L 411 (415) T ss_pred ccEEEEEehhc----c-EEEEee-cceEEEEeccccCceEEEEEEEeccEEeccccEEEEE----EeccCCCCCc----c Confidence 1 23331000 0 000000 11122233556677777778888864 3442111100 0111121111 1 Q ss_pred cccc Q lcl|NC_014661. 511 GKNG 514 (524) Q Consensus 511 ~~~~ 514 (524) +.-. T Consensus 412 ~~~~ 415 (415) T protein:vir:94 412 GLEA 415 (415) T ss_pred ccCC Confidence 1111 No 71 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=79.45 E-value=0.1 Score=25.96 Aligned_cols=280 Identities=14% Similarity=0.062 Sum_probs=128.0 Q ss_pred ccccccccccccccccchhhhccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCC Q lcl|NC_014661. 61 FGGFLTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIA 139 (524) Q Consensus 61 ~~~~l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~ 139 (524) |-..-. .+.++. .|+.+.. ..-+.+. .+++++.++.+..+++-+-||++.+- +|...+. T Consensus 1 ma~~~~----------~~~~~~-~t~~gg~-lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-------~ip~~~~- 60 (304) T protein:vir:10 1 MATPTY----------TPGNVI-LSDFKNG-VIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKK-------KFTYLAK- 60 (304) T ss_pred Cccccc----------cccccc-ccCCCce-ecchhHHHHHHHHHHhccchhhhcceeeccCCce-------EEEEEeC- Confidence 111111 111111 1111111 1222232 46666677778888888888876542 1211100 Q ss_pred CccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccc Q lcl|NC_014661. 140 AGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDA 219 (524) Q Consensus 140 ~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~ 219 (524) +.. +.| T Consensus 61 --~~~---------a~~--------------------------------------------------------------- 66 (304) T protein:vir:10 61 --GVG---------AYW--------------------------------------------------------------- 66 (304) T ss_pred --Ccc---------eEE--------------------------------------------------------------- Confidence 000 000 Q ss_pred cccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|NC_014661. 220 GILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEI 299 (524) Q Consensus 220 g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEI 299 (524) + +| +.++++-.-++++++++.|..+-...+|-||.+|- .+|.|++|.+-|...| T Consensus 67 -----v--------~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i 120 (304) T protein:vir:10 67 -----V--------SE---------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AKDFFNEVKPLIAEAF 120 (304) T ss_pred -----e--------ec---------CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHH Confidence 0 01 11223334456677777777777888999999985 4678999999999999 Q ss_pred HHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEe Q lcl|NC_014661. 300 MLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIA 379 (524) Q Consensus 300 mlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~ 379 (524) ...||+.+|..--...- +.+.+.+++.-...... ........+.-|+++-..|... +.....++| T Consensus 121 a~~~d~~~l~G~g~~~~--------~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~--~~~~~~~v~ 185 (304) T protein:vir:10 121 YKAFDQAVIFGTKSPYN--------TSTSGKPLVEGAEEKGN-----VVTDTNNLYVDLSALMATIEDE--ELDPNGVLT 185 (304) T ss_pred HHHHHhhheeccCCCcc--------ccccccccccccccccc-----ccccccchHHHHHHHHHHhhhc--cCCcCEEEE Confidence 99999988764221110 00011111110000000 0011122344455555555442 224567899 Q ss_pred CHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcc------------eEEEEEecCCCccc Q lcl|NC_014661. 380 SRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQD------------YFTIGYKGDNEMDA 447 (524) Q Consensus 380 S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------y~~vG~KG~~~~d~ 447 (524) ++.+...|..... +.|.- -+..+ .|+|.| ++||++++.+.+ ++++|..+..+.+ T Consensus 186 ~~~~~~~L~~lkd-----~~G~~-l~~~~------~~~l~G-~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~- 251 (304) T protein:vir:10 186 TRSFRSKMRNALD-----ANDRP-LFDAN------GNEIMG-LPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYA- 251 (304) T ss_pred cHHHHHHHHHhhc-----cCCcE-eecCC------Cccccc-eeeEEecccccCCCCcEEEEEehhhEEEEEecceEEE- Confidence 9999999975321 11110 01111 156777 699988886432 2333333222110 Q ss_pred eeEeecccccc--cccccCcc-----ccc---ceeeeeeeeceee-CCcccccCCccccceeecccc Q lcl|NC_014661. 448 GIYYAPYVALT--PLRGADPK-----NFQ---PVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGM 503 (524) Q Consensus 448 g~fyaPYv~~~--~~~~~Dp~-----s~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~ 503 (524) ...+.. +....|++ -|+ =.+=+..||++.+ || ..+.++..-. T Consensus 252 -----~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~---------~a~~~l~~a~ 304 (304) T protein:vir:10 252 -----ISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKP---------EAFATLKPTE 304 (304) T ss_pred -----EeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecc---------cceEEEEecC Confidence 001110 11111222 132 2333356777653 34 1233444422 No 72 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=79.45 E-value=0.1 Score=25.96 Aligned_cols=280 Identities=14% Similarity=0.062 Sum_probs=128.0 Q ss_pred ccccccccccccccccchhhhccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCC Q lcl|NC_014661. 61 FGGFLTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIA 139 (524) Q Consensus 61 ~~~~l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~ 139 (524) |-..-. .+.++. .|+.+.. ..-+.+. .+++++.++.+..+++-+-||++.+- +|...+. T Consensus 1 ma~~~~----------~~~~~~-~t~~gg~-lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-------~ip~~~~- 60 (304) T protein:vir:94 1 MATPTY----------TPGNVI-LSDFKNG-VIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKK-------KFTYLAK- 60 (304) T ss_pred Cccccc----------cccccc-ccCCCce-ecchhHHHHHHHHHHhccchhhhcceeeccCCce-------EEEEEeC- Confidence 111111 111111 1111111 1222232 46666677778888888888876542 1211100 Q ss_pred CccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccc Q lcl|NC_014661. 140 AGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDA 219 (524) Q Consensus 140 ~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~ 219 (524) +.. +.| T Consensus 61 --~~~---------a~~--------------------------------------------------------------- 66 (304) T protein:vir:94 61 --GVG---------AYW--------------------------------------------------------------- 66 (304) T ss_pred --Ccc---------eEE--------------------------------------------------------------- Confidence 000 000 Q ss_pred cccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|NC_014661. 220 GILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEI 299 (524) Q Consensus 220 g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEI 299 (524) + +| +.++++-.-++++++++.|..+-...+|-||.+|- .+|.|++|.+-|...| T Consensus 67 -----v--------~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i 120 (304) T protein:vir:94 67 -----V--------SE---------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AKDFFNEVKPLIAEAF 120 (304) T ss_pred -----e--------ec---------CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHH Confidence 0 01 11223334456677777777777888999999985 4678999999999999 Q ss_pred HHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEe Q lcl|NC_014661. 300 MLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIA 379 (524) Q Consensus 300 mlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~ 379 (524) ...||+.+|..--...- +.+.+.+++.-...... ........+.-|+++-..|... +.....++| T Consensus 121 a~~~d~~~l~G~g~~~~--------~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~--~~~~~~~v~ 185 (304) T protein:vir:94 121 YKAFDQAVIFGTKSPYN--------TSTSGKPLVEGAEEKGN-----VVTDTNNLYVDLSALMATIEDE--ELDPNGVLT 185 (304) T ss_pred HHHHHhhheeccCCCcc--------ccccccccccccccccc-----ccccccchHHHHHHHHHHhhhc--cCCcCEEEE Confidence 99999988764221110 00011111110000000 0011122344455555555442 224567899 Q ss_pred CHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcc------------eEEEEEecCCCccc Q lcl|NC_014661. 380 SRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQD------------YFTIGYKGDNEMDA 447 (524) Q Consensus 380 S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------y~~vG~KG~~~~d~ 447 (524) ++.+...|..... +.|.- -+..+ .|+|.| ++||++++.+.+ ++++|..+..+.+ T Consensus 186 ~~~~~~~L~~lkd-----~~G~~-l~~~~------~~~l~G-~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~- 251 (304) T protein:vir:94 186 TRSFRSKMRNALD-----ANDRP-LFDAN------GNEIMG-LPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYA- 251 (304) T ss_pred cHHHHHHHHHhhc-----cCCcE-eecCC------Cccccc-eeeEEecccccCCCCcEEEEEehhhEEEEEecceEEE- Confidence 9999999975321 11110 01111 156777 699988886432 2333333222110 Q ss_pred eeEeecccccc--cccccCcc-----ccc---ceeeeeeeeceee-CCcccccCCccccceeecccc Q lcl|NC_014661. 448 GIYYAPYVALT--PLRGADPK-----NFQ---PVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGM 503 (524) Q Consensus 448 g~fyaPYv~~~--~~~~~Dp~-----s~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~ 503 (524) ...+.. +....|++ -|+ =.+=+..||++.+ || ..+.++..-. T Consensus 252 -----~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~---------~a~~~l~~a~ 304 (304) T protein:vir:94 252 -----ISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKP---------EAFATLKPTE 304 (304) T ss_pred -----EeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecc---------cceEEEEecC Confidence 001110 11111222 132 2333356777653 34 1233444422 No 73 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=79.18 E-value=0.11 Score=25.91 Aligned_cols=358 Identities=10% Similarity=0.030 Sum_probs=130.3 Q ss_pred CCcccchHHHHHHhhhhhhccCCCcchhhhhhhhhhhhhhhHHHHHhh--hhccccchhhhcccccccccccccccccch Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEAEGAPEIAQGKHAIIAKMFENQEADIKS--DAAYRDEKLAEAFGGFLTEAEIGGDHGYDP 78 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~enq~~~~~~--~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~ 78 (524) |.-+++.-+++.+|=+-+.+....-....-+.+....+++.-..++.+ ..++++.....-....|..-+- -.|+ T Consensus 1 M~i~~k~~~~~~~~~~~l~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~---~~~~- 76 (377) T protein:vir:98 1 MAINLKELPKYREAVAELSAKISAGATSEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEI---KFFN- 76 (377) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhccCCcccCHHHH---HHHH- Confidence 888888877777776655543221111111112222222222222211 0111111111111111111100 0010 Q ss_pred hhhccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccc Q lcl|NC_014661. 79 QNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFS 157 (524) Q Consensus 79 ~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FS 157 (524) ..+..++.++.-...-+.++ .+.++....-.-..+|-|+|++|.+-++ +.... +.+.|. T Consensus 77 ~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~~~------~~~~~--------------~~a~w~ 136 (377) T protein:vir:98 77 DIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKAL------TAETS--------------GTAVWG 136 (377) T ss_pred HHHhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCcceEEE------EecCC--------------cceeEe Confidence 11111221111111112222 2222222222445678888887654322 11000 001111 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcc Q lcl|NC_014661. 158 GRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQ 237 (524) Q Consensus 158 G~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l 237 (524) +.. +.+. T Consensus 137 ~e~------------------------------------------------------------------~~~~------- 143 (377) T protein:vir:98 137 DIF------------------------------------------------------------------GEIK------- 143 (377) T ss_pred ecc------------------------------------------------------------------cccC------- Confidence 000 0000 Q ss_pred cccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhh------- Q lcl|NC_014661. 238 EGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDW------- 310 (524) Q Consensus 238 ~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~------- 310 (524) ......|.++.|..-|... ....|-||.+|- ..|.|+.|.+-|+..|..-++..||.. T Consensus 144 ----~~~~~~f~~i~l~~~kl~a-------~~~is~elL~ds----~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~ 208 (377) T protein:vir:98 144 ----GQLKQAFKEQDFSQFKLTA-------FVVIPKDALKFG----PKWIKQFITEQLKEAIAVALELAIVKGDGLLQPV 208 (377) T ss_pred ----cccCccceeEeecceeEEe-------eecccHHhhhcc----HhHHHHHHHHHHHHHHHHHHhhceEeccCCCcce Confidence 1123456677777776654 245677777663 567899999999999999999998862 Q ss_pred -Hhhhhhhhhhccccccccccceecc-cccccccccchHHHHH-HHHHHHHHHHHHHHHhhccccCccEEE-eCHHHHHH Q lcl|NC_014661. 311 -INYSAQVGKTGQTLTVGSKAGVFDF-QDPIDVRGARWAGESF-KALLFQIDKESAEIARQTGRGAGNFII-ASRNVVNV 386 (524) Q Consensus 311 -l~~~A~~~k~~~~~~~~~~aG~fdl-~~~~d~~~~~~a~E~~-r~L~~~i~~~a~~I~~~T~rg~gn~~v-~S~~va~~ 386 (524) |.+....+.+... +.....++.+. +...|... --.+.+ +.+...+.+....--|.-+.+.|+++. +.|..+-. T Consensus 209 Gil~~~~~~~~~~~-~~~~~~~~~~~~~~~~~l~~--~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~ 285 (377) T protein:vir:98 209 GLLKDLSQPTVDQS-TGRDITTYKTDKEAIADLSD--LTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWA 285 (377) T ss_pred eeeecccccccccc-cccccccccchhhhHhhhhh--hchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhh Confidence 2221111111000 00011111100 00001000 000111 112222222322222444567788765 45543322 Q ss_pred HhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCC--ccceeEeecccccccccccC Q lcl|NC_014661. 387 LASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNE--MDAGIYYAPYVALTPLRGAD 464 (524) Q Consensus 387 L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~--~d~g~fyaPYv~~~~~~~~D 464 (524) ++ +...... ..| .+...|.=.++|..+.+.|..-++.|.....- ...++-+..|.+..+. T Consensus 286 ~~--p~~~~~~----~~G--------~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~---- 347 (377) T protein:vir:98 286 LE--AQFTSRN----QFG--------EYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAM---- 347 (377) T ss_pred cc--ccccccC----CCC--------ccccccCCCceEEecCCCCcccEEEEEecceeEEeecceEEEeechhhhh---- Confidence 21 1110000 001 11223332344555666666556666542211 0011222222111111 Q ss_pred cccccceeeeeee--e-ceeeCCcccccCCccccceeeccc Q lcl|NC_014661. 465 PKNFQPVLGFKTR--Y-GIGINPLADTAAQQPAGNARIANG 502 (524) Q Consensus 465 p~s~qP~~~~~tR--Y-~l~~nP~~~~~~~~~~~~~~~~~g 502 (524) .-.++|+.+ + |-.+||=+ -.+..++-| T Consensus 348 ----~d~~~f~~~~r~dg~~~~~~a-------~~vl~i~~~ 377 (377) T protein:vir:98 348 ----EDLQLYLTKNYFYGKAKDNHT-------AALLTLAGG 377 (377) T ss_pred ----cCceEEEEEEEEcCEEeccCc-------EEEEEEecC Confidence 112344443 2 33444421 223344444 No 74 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=79.11 E-value=0.11 Score=25.89 Aligned_cols=284 Identities=13% Similarity=0.130 Sum_probs=126.5 Q ss_pred hccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccc Q lcl|NC_014661. 81 IAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGR 159 (524) Q Consensus 81 i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~ 159 (524) ++ .++++.+ -..|.+. .+++++.+..+..+++.+.||++.+.-|. ++.. +.+| .| T Consensus 1 m~-t~t~gg~-liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip----~~~~------~~~a---------~w--- 56 (303) T protein:vir:97 1 MG-TETSKAS-LFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEF----TFTL------DSDI---------DV--- 56 (303) T ss_pred Cc-ccCCCCe-EcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE----EEec------Ccce---------EE--- Confidence 22 2233332 2334444 56777778888999999999976543331 1110 0000 00 Q ss_pred cccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccc Q lcl|NC_014661. 160 GSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEG 239 (524) Q Consensus 160 ~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~ 239 (524) +++| T Consensus 57 -----------------------------------------------------------------v~E~----------- 60 (303) T protein:vir:97 57 -----------------------------------------------------------------VAEN----------- 60 (303) T ss_pred -----------------------------------------------------------------eecC----------- Confidence 0010 Q ss_pred cCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhh Q lcl|NC_014661. 240 FNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGK 319 (524) Q Consensus 240 lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k 319 (524) ...++-..+++.++..+|.-+-...+|-||.|.... ..++-+++|.+-|+..|...|+..+|........ T Consensus 61 ------~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g--- 130 (303) T protein:vir:97 61 ------GKKTHGGLSLEPVTIVPIKVEYGARLSDEFLYATEE-EKIDILKAFNEGFAKKLARGIDLMAMHGINPRTK--- 130 (303) T ss_pred ------ccccccccceeeEEeeeEEEEEeehhhHHHhhcCcc-chHHHHHHHHHHHHHHHHHHHHhhhhcccccCCc--- Confidence 112222334455556666666667899999863322 2466788999999999999999988876531111 Q ss_pred hccccccccccceecccc--cccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccc Q lcl|NC_014661. 320 TGQTLTVGSKAGVFDFQD--PIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPA 397 (524) Q Consensus 320 ~~~~~~~~~~aG~fdl~~--~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~ 397 (524) +++...|...+.. ..-+..+ ....++.-|.++-+.+.. ..+..+.+|++|.....|.... . T Consensus 131 -----~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~lk-----d 193 (303) T protein:vir:97 131 -----KASDVIGTNHFDSKVTQVVKFT-----ESEDADANIEAAVNLIQG--AEGVVTGLAMDTEFSTALAKVT-----N 193 (303) T ss_pred -----cccccccccccccccccccccc-----cccchHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHHHHhh-----c Confidence 0111111111110 0000000 011233344444444432 2235677999999999886422 1 Q ss_pred ccccccccccccCcceEEEEecCceEEEeeCCCCcce-----EEEEEecCCCccceeEeeccc--ccccccccCccc--- Q lcl|NC_014661. 398 AQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDY-----FTIGYKGDNEMDAGIYYAPYV--ALTPLRGADPKN--- 467 (524) Q Consensus 398 a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-----~~vG~KG~~~~d~g~fyaPYv--~~~~~~~~Dp~s--- 467 (524) +.|.-- +..+.....-.|+|.| ++|+++.+-+... -.+.+-|+- ...+.+...- ++...+..|++. T Consensus 194 ~~g~~~-~~~~~~~~~~~~~l~G-~Pv~~s~~v~~~~~~~~~~~~~~~Gdf--~~~~~~~~~~~~~~~~~~~~~~d~~~~ 269 (303) T protein:vir:97 194 GEMGPK-MYPELAWGANPDSING-LKSSVNTTVGAGADEAESKDLVIIGDF--ESMFKWGYAKQIPMEIIKYGDPDNSGK 269 (303) T ss_pred cCCCeE-EecCccCCCCCceecc-eeeEEecccCCccccCCCccEEEEeec--cccEEEEEecCcEEEEeeccCCCCcch Confidence 111100 1111111111257887 7999988754311 001111211 0111122111 111122223321 Q ss_pred --ccc-eeee--eeeecee-eCCcccccCCccccceeeccccc Q lcl|NC_014661. 468 --FQP-VLGF--KTRYGIG-INPLADTAAQQPAGNARIANGMP 504 (524) Q Consensus 468 --~qP-~~~~--~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~ 504 (524) |+- .++| ..||+.. .|| ..+.++.++-= T Consensus 270 ~~~~~n~~~~r~~~r~~~~v~~p---------~af~~l~~~~~ 303 (303) T protein:vir:97 270 DLKGYNQIYLRAEAYIGWGILDA---------KSFARVTKGEV 303 (303) T ss_pred hhhhcCcEEEEEEEEeccEeecc---------cceEEeeCCCC Confidence 111 1333 4466543 344 12233333210 No 75 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=78.40 E-value=0.11 Score=25.74 Aligned_cols=302 Identities=11% Similarity=0.031 Sum_probs=125.9 Q ss_pred hhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhccccccccccccCcchhhHHHHHHHhhhhhhceeee Q lcl|NC_014661. 38 MFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQ 117 (524) Q Consensus 38 ~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li~l~Rra~~nLIa~DI~GVQ 117 (524) |.=|-+|.. +++. .++.+.+..+++++...--.+.+=.+++.+.+..+-..++-+. T Consensus 1 ~~~~~~r~~--------~~~~----------------~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~ 56 (326) T protein:vir:42 1 MAVNPDRTT--------PFLG----------------VNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKI 56 (326) T ss_pred CCCCccchh--------hhcC----------------cchhhheeccccCCcceechhhHHHHHHHHHhcchhhhhccee Confidence 222222210 0000 0011111111111111001111114555555666777788888 Q ss_pred cCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 118 PMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 118 PmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) ||++++.- |..... +. .+.| T Consensus 57 ~~~~~~~~-------~p~~~~---~~---------~a~~----------------------------------------- 76 (326) T protein:vir:42 57 PMGTTGQK-------IPHWTG---DV---------SASW----------------------------------------- 76 (326) T ss_pred eccCCceE-------EEEEeC---Cc---------ceEE----------------------------------------- Confidence 98876521 111000 00 0000 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQ 277 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQ 277 (524) + +| +..++|-..+++++++.+|...-.-.+|-||.+ T Consensus 77 ---------------------------v--------~E---------g~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~ 112 (326) T protein:vir:42 77 ---------------------------I--------GE---------GDMKPITKGNMTSQTIAPHKIATIFVASAETVR 112 (326) T ss_pred ---------------------------e--------cC---------CccccccccceeEEEEeeEEEEEeehhhHHHHh Confidence 0 01 122344456677888888888888899999999 Q ss_pred HHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHH Q lcl|NC_014661. 278 DLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQ 357 (524) Q Consensus 278 DLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~ 357 (524) |- ..|.+++|.+-|+..|...+++.+|..--+ +...+..+.....++.... .... +.......+. T Consensus 113 ~s----~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs----~~p~gi~~~~~~~~~~~~~-~~~~----~~~~~~~~~~-- 177 (326) T protein:vir:42 113 AN----PANYLGTMRTKVATAFAMAFDNAAINGTDS----PFPTFLAQTTKEVSLVDPD-GTGS----NADLTVYDAV-- 177 (326) T ss_pred cC----HHHHHHHHHHHHHHHHHHHHHHHhhcccCC----Cccccccccccccceeecc-cccc----cccchhHHHH-- Confidence 84 367899999999999999999999853110 0000000000000000000 0000 0000111111 Q ss_pred HHHHHHHHHhhccccCccEEEeCHHHHHHHhcCC----ccccccccccccccccccCcceEEEEecCceEEEeeCCCCcc Q lcl|NC_014661. 358 IDKESAEIARQTGRGAGNFIIASRNVVNVLASVD----TSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQD 433 (524) Q Consensus 358 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~----~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d 433 (524) +..+..... ..+...+.+|+++.....|.... ...+.+. ...........++|.| ++|+++++.+.+ T Consensus 178 ~~~~~~~~~--~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~------~~~~~~~~~~~~~l~G-~pv~~~~~~~~~ 248 (326) T protein:vir:42 178 AVNALSLLV--NAGKKWTHTLLDDITEPILNGAKDKSGRPLFIES------TYTEENSPFRLGRIVA-RPTILSDHVASG 248 (326) T ss_pred HHHHHhhhh--hhccCccEEEEeHHHHHHHHHhhccCCceeeccc------cccCccccccCceeee-eeEEEcCCCCCC Confidence 111111111 22335778899999999997532 1111111 0001111223456777 699999987654 Q ss_pred eEEEEEecCCCccceeEeecccccccccc---------cCccc-----cc---ceeeeeeeecee-eCC--cccccCCcc Q lcl|NC_014661. 434 YFTIGYKGDNEMDAGIYYAPYVALTPLRG---------ADPKN-----FQ---PVLGFKTRYGIG-INP--LADTAAQQP 493 (524) Q Consensus 434 y~~vG~KG~~~~d~g~fyaPYv~~~~~~~---------~Dp~s-----~q---P~~~~~tRY~l~-~nP--~~~~~~~~~ 493 (524) =. +++-|+-. -+||...-.. .++. .|+.. || =.+=...|++.. .+| |+. ..... T Consensus 249 ~~-~~~~Gd~s---~~~~~~~~~~-~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~-l~~~~ 322 (326) T protein:vir:42 249 TV-VGYQGDFR---QLVWGQVGGL-SFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVK-LTNVD 322 (326) T ss_pred ce-EEEEeecc---eEEEEEecce-EEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEE-Eeecc Confidence 22 12222211 1222222111 1111 11111 22 233345666653 344 221 11111 Q ss_pred ccceee Q lcl|NC_014661. 494 AGNARI 499 (524) Q Consensus 494 ~~~~~~ 499 (524) +. +. T Consensus 323 ~~--~~ 326 (326) T protein:vir:42 323 AT--EA 326 (326) T ss_pred cc--CC Confidence 10 11 No 76 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=77.93 E-value=0.12 Score=25.64 Aligned_cols=338 Identities=14% Similarity=0.139 Sum_probs=126.7 Q ss_pred CCcccchHHHHHHhhhhhhccCCCcchhhhhhhhhhhhhhh--HHHHH-hhhhccccchhhhcccccccccccccccccc Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEAEGAPEIAQGKHAIIAKMFEN--QEADI-KSDAAYRDEKLAEAFGGFLTEAEIGGDHGYD 77 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~en--q~~~~-~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~ 77 (524) ......+.+ ++++=.- ++. +|...++.+ ....+. .++.- .+.....+.....+|..++...+ . T Consensus 61 ~~~~~~~~e-~~~~~~~-~~~----ei~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l~~~e-------~ 126 (425) T protein:vir:10 61 VKAGLPTSD-ALAKVDK-VSA----DLEALQAAV-DEANIKIAAAQMGANGVKPLRDPEYTEAFKAHVKRGD-------V 126 (425) T ss_pred HHhhhccHH-HHHHHHH-HHH----HHHHHHHHH-HHHHHHHHhhhcccccccccccHHHHHHHHHHhhhhh-------h Confidence 000011111 1111000 000 111111111 000000 00000 00011111111222332222111 1 Q ss_pred hhhhccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccccc Q lcl|NC_014661. 78 PQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMF 156 (524) Q Consensus 78 ~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~F 156 (524) ...+.+++++..-.-.-+.+. .+++.+-...+..++|.|.||+++..-+. -... + +.+.| T Consensus 127 ~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~-------~~~~---~---------~~a~w 187 (425) T protein:vir:10 127 QAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKL-------FNMG---G---------TTSGW 187 (425) T ss_pred HHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEE-------EEcC---C---------cceee Confidence 111222222111001112222 25555556778888999999987654221 1100 0 00011 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhc Q lcl|NC_014661. 157 SGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAEL 236 (524) Q Consensus 157 SG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~ 236 (524) -+ +|-. ..| T Consensus 188 v~--------------------------------------------------------------------E~~~--~~~- 196 (425) T protein:vir:10 188 VG--------------------------------------------------------------------EASQ--RPQ- 196 (425) T ss_pred ec--------------------------------------------------------------------cccc--ccc- Confidence 00 0000 000 Q ss_pred ccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhh Q lcl|NC_014661. 237 QEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQ 316 (524) Q Consensus 237 l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~ 316 (524) +....|.++.|++.|..+ ...+|-||.+|- .+|.+++|.+-|+..|..-+|+-||..= T Consensus 197 ------~~~~~f~~v~~~~~k~~~-------~i~iS~ell~ds----~~~l~~~i~~~la~ai~~~~d~~~l~G~----- 254 (425) T protein:vir:10 197 ------TNAATFQPLSFASGEIYA-------NPAATQQILDDA----EIDLESWLATEVQTEFAKQEGKAFLAGD----- 254 (425) T ss_pred ------ccccccceeeeeheeeEe-------ehHhHHHHHhcc----hhHHHHHHHHHHHHHHHHHHHhhhhccc----- Confidence 011235666666666544 566999999985 3567899999999999999999888530 Q ss_pred hhhhccccccccccceecccc---------------cccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCH Q lcl|NC_014661. 317 VGKTGQTLTVGSKAGVFDFQD---------------PIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASR 381 (524) Q Consensus 317 ~~k~~~~~~~~~~aG~fdl~~---------------~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~ 381 (524) | .+.|.|++.... ........-..+....|+..+.. .+-+...+|+++ T Consensus 255 -G-------~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~---------~~~~~a~~vmn~ 317 (425) T protein:vir:10 255 -G-------TNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPS---------AFTGNARFAMNR 317 (425) T ss_pred -C-------CCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhh---------hhccCCEEEEch Confidence 0 011222221100 00000000011223334333221 222344678999 Q ss_pred HHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCc-----ceEEEEEecCCCccceeEeecccc Q lcl|NC_014661. 382 NVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQ-----DYFTIGYKGDNEMDAGIYYAPYVA 456 (524) Q Consensus 382 ~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG~~~~d~g~fyaPYv~ 456 (524) .....|.... .++|.- =+..+.+.. ..++|.| ++|+++.+.|. +-|++| +-.. ..+.+. . T Consensus 318 ~~~~~L~~lk-----D~~G~~-l~~~~~~~g-~~~~l~G-~PV~~~~~~p~~~~~~~~i~~G---d~~~--~~~i~~--~ 382 (425) T protein:vir:10 318 NTQRQVRKLK-----DGQGNY-LWQPSYVAG-QPATLAG-YPVTEVPDMPDVAANSTPILFG---DFQQ--TYLIID--R 382 (425) T ss_pred HHHHHHHHhh-----cCCCce-eeccCccCC-CCceecc-eeeEEecCcCCccCCccEEEEE---ehhc--cEEEEE--e Confidence 9999987532 122210 011111111 1257877 69999887652 334443 1110 011111 1 Q ss_pred cccccccCcccccceee--eeeeecee-eCCcccccCCccccceeecc Q lcl|NC_014661. 457 LTPLRGADPKNFQPVLG--FKTRYGIG-INPLADTAAQQPAGNARIAN 501 (524) Q Consensus 457 ~~~~~~~Dp~s~qP~~~--~~tRY~l~-~nP~~~~~~~~~~~~~~~~~ 501 (524) ..+....||-.-+-.++ ...||+.. .+|-+...-+- ++++ T Consensus 383 ~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~-----~as~ 425 (425) T protein:vir:10 383 IGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKV-----AASE 425 (425) T ss_pred cceEEEecccccCCcEEEEEEEEeccEeecccceEEEEe-----eccC Confidence 11111223333223233 44567653 45533211110 1111 No 77 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=76.70 E-value=0.13 Score=25.39 Aligned_cols=275 Identities=10% Similarity=0.044 Sum_probs=123.8 Q ss_pred ccccccchhhhccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccc Q lcl|NC_014661. 71 GGDHGYDPQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPM 149 (524) Q Consensus 71 ~~~~g~~~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~f 149 (524) =...++++.+...++ ++.. ..-+.+. .+++.+.+.-+-..++.+.||++++...+-.. .. +.+ T Consensus 1 m~~~~~~~~~~~~t~-~~~~-lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~~-------~~~----- 64 (297) T protein:vir:95 1 MTVQTFNPENVLVSQ-KKDG-TLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQ--TD-------GIS----- 64 (297) T ss_pred CCccccccccccccC-CCcc-eechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEE--cC-------Cce----- Confidence 122233333333222 2211 1222222 45565666777888899999988876543111 00 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccc Q lcl|NC_014661. 150 YAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGM 229 (524) Q Consensus 150 nEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm 229 (524) +.|- T Consensus 65 ----a~~v------------------------------------------------------------------------ 68 (297) T protein:vir:95 65 ----AYWV------------------------------------------------------------------------ 68 (297) T ss_pred ----eEEe------------------------------------------------------------------------ Confidence 0000 Q ss_pred cchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHh Q lcl|NC_014661. 230 ATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVID 309 (524) Q Consensus 230 ~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~ 309 (524) +| +..+++-..++++++...|..+-...+|.||.+|-. .|.+.+|.+-|+..|...+++.||. T Consensus 69 ----~E---------g~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~ 131 (297) T protein:vir:95 69 ----NE---------TEKIKTDKPEVVPVTLKAHKLGIILVTSREALNYTW----KKFFEDMKPQIVEAFYKKIDEAGLL 131 (297) T ss_pred ----ec---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhc Confidence 01 112333344556666777777777789999999875 4679999999999999999999985 Q ss_pred hHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhc Q lcl|NC_014661. 310 WINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLAS 389 (524) Q Consensus 310 ~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~ 389 (524) ..... .+.|++....... . ... ..-.+.-|.++...|... +...+.++++|+....|.. T Consensus 132 G~g~~-------------~~~gi~~~~~~~~--~--~~~--~~~t~~~i~~~~~~l~~~--~~~~~~~v~~~~~~~~L~~ 190 (297) T protein:vir:95 132 GHDTP-------------FANSVAKAAKDAN--K--VIG--GPINYDNILKLQDALYDA--DVEPNAFVSKIQNRSALRE 190 (297) T ss_pred ccCCc-------------ccccccccccccc--e--ecc--cccCHHHHHHHHHHhhhc--cCCcCEEEEcHHHHHHHHH Confidence 32211 1122222111000 0 000 011122344444555443 2245678999999999975 Q ss_pred CCccccccccccccccccccCcceEEEEecCceEEEeeCCC--CcceEE--------EEEecCCCccceeEeeccccccc Q lcl|NC_014661. 390 VDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYA--RQDYFT--------IGYKGDNEMDAGIYYAPYVALTP 459 (524) Q Consensus 390 ~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~--~~dy~~--------vG~KG~~~~d~g~fyaPYv~~~~ 459 (524) ... +.|. .+- ... .++|.| ++|++-+.. +..-++ +|..+.-+.+- .. +... T Consensus 191 l~d-----~~G~--~i~-~~~----~~~l~G-~Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~----~~--~~~~ 251 (297) T protein:vir:95 191 ARD-----GNKV--SIY-DKA----ANTIDG-ITTVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKI----SE--EGQI 251 (297) T ss_pred hhc-----cCCc--eee-cCC----CCcccc-eeeEeecCCCCCCceEEEEecccEEEEEecCeEEEE----ee--cccc Confidence 221 1110 000 111 135665 577754432 222232 33332211100 00 0000 Q ss_pred ccccCcc----c-cc-ceeee--eeeeceee-CC--cccccCCccccc Q lcl|NC_014661. 460 LRGADPK----N-FQ-PVLGF--KTRYGIGI-NP--LADTAAQQPAGN 496 (524) Q Consensus 460 ~~~~Dp~----s-~q-P~~~~--~tRY~l~~-nP--~~~~~~~~~~~~ 496 (524) ....|+. + || =.++| ..|++..+ || |+.-. ...++ T Consensus 252 ~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~--~at~~ 297 (297) T protein:vir:95 252 STITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLT--PAERV 297 (297) T ss_pred ccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEe--ecCCC Confidence 0111221 1 22 11222 35666553 44 22211 11111 No 78 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=74.69 E-value=0.16 Score=25.02 Aligned_cols=333 Identities=15% Similarity=0.116 Sum_probs=115.0 Q ss_pred cchHHHHHHhhhhhhcc--------------CCC--cchhhhhhh---hhhh--hhhhHHHHHhhhhccccchhhhcccc Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEA--------------EGA--PEIAQGKHA---IIAK--MFENQEADIKSDAAYRDEKLAEAFGG 63 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~--------------~~~--~~~~~~~~~---~~~~--~~enq~~~~~~~~~~~~~~~~~~~~~ 63 (524) |++-++|+++|.-+.+. +.. .+|...+.. +.++ -|+.|.+.+..+-.-........... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 55555565555554331 000 122221111 1111 12233222211110000000000000 Q ss_pred cccccccc---------ccccc-ch----------hhhccccccccccccCcchhh------HHHHHHHhhhhhhceeee Q lcl|NC_014661. 64 FLTEAEIG---------GDHGY-DP----------QNIAAGQTSGAVTQIGPAVMG------MVRRAIPNLIAFDICGVQ 117 (524) Q Consensus 64 ~l~ea~~~---------~~~g~-~~----------~~i~est~tg~v~~~~P~Li~------l~Rra~~nLIa~DI~GVQ 117 (524) .-.+.+.. ..++. .. ..+.+++.+ .+ ..||+ ++++....-.-.+++.|. T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~----~g-G~lIP~~~~~~Ii~~~~~~~~l~~~~~~~ 155 (387) T protein:vir:94 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDS----GG-DKLLPKTLSKEIVSEPFAKNQLREKARLT 155 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCC----CC-ceeechhHHHHHHHHHHhhchhhhhceee Confidence 00000000 00000 00 001111111 11 12222 233333334446777777 Q ss_pred cCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 118 PMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 118 PmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) |+++.+.- |-.+.. +.+.| T Consensus 156 ~~~~~~~p----~~~~~~----------------~~a~~----------------------------------------- 174 (387) T protein:vir:94 156 NIKGLEIP----RVSYTL----------------DDDDF----------------------------------------- 174 (387) T ss_pred ecCCceee----eeeccC----------------Ccccc----------------------------------------- Confidence 76543210 000000 00000 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQ 277 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQ 277 (524) +++| ...++...++++++..+|.-+-...+|-||.+ T Consensus 175 ---------------------------v~Eg-----------------~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ 210 (387) T protein:vir:94 175 ---------------------------ITDV-----------------ETAKELKAKGDTVKFTTNKFKVFAAISDTVIH 210 (387) T ss_pred ---------------------------cccc-----------------ccccccccccceeeechheeeeechhhHHHHh Confidence 0011 11122222334445555555556789999999 Q ss_pred HHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHH Q lcl|NC_014661. 278 DLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQ 357 (524) Q Consensus 278 DLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~ 357 (524) |- ..|.|++|.+-|+..|..-.|..++-.-. .. +-+.|++.=....-+.+ -.++-. T Consensus 211 ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~---g~---------g~~~g~~~~~~~~~~~~--------~~~~d~ 266 (387) T protein:vir:94 211 GS----DVDLVNWVENALQSGLAAKERKDALAVSP---KS---------GLEHMSFYNGSVKEVEG--------ADMYDA 266 (387) T ss_pred hh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCC---Cc---------cccceeeeccccccccc--------cchHHH Confidence 85 35568889988888887765655542211 11 12233331111111111 112223 Q ss_pred HHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEE Q lcl|NC_014661. 358 IDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTI 437 (524) Q Consensus 358 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~v 437 (524) |..+-+.+...= |..+.|++-+...+.+|.... - ..+ .+- ...+ ++|.| ++||+..+++. +++ T Consensus 267 i~~~~~~l~~~y-~~na~~imn~~t~~~~~~~~~---~--~~~---~~~-~~~~----~~llG-~PV~~~~~~~~--~~~ 329 (387) T protein:vir:94 267 IINALADLHEDY-RDNATIYMRYADYVKIISVLS---N--GTT---NFF-DTPA----EKVFG-KPVVFTDAAVK--PIV 329 (387) T ss_pred HHHHHhccChhh-hcCCEEEEechHHHHHHHHHh---c--CCC---ccc-ccCC----ccccc-cceEEecCCCc--eee Confidence 333333333321 235666554444444443211 1 000 110 1111 35776 59998877653 344 Q ss_pred EEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-CCcccccCCccccceeeccccchhhhhcccccee Q lcl|NC_014661. 438 GYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPSIANSVGKNGYF 516 (524) Q Consensus 438 G~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~ 516 (524) | +- +-||.=|......+..|..+.+-.+-...||+..+ +|= -| T Consensus 330 G---Df----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~-----------------------------A~ 373 (387) T protein:vir:94 330 G---DF----NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDS-----------------------------AF 373 (387) T ss_pred e---ch----hhhhhhhhhhhheecccccCCceEEEEEEEeCcEeechh-----------------------------he Confidence 4 11 11222221111111123333332333333655432 331 12 Q ss_pred eeeeeecC Q lcl|NC_014661. 517 RRVLVKGI 524 (524) Q Consensus 517 r~~~v~~~ 524 (524) |.+.||-= T Consensus 374 ~~l~~ka~ 381 (387) T protein:vir:94 374 RIAKAKEN 381 (387) T ss_pred EEEEeecC Confidence 22222221 No 79 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=74.69 E-value=0.16 Score=25.02 Aligned_cols=333 Identities=15% Similarity=0.116 Sum_probs=115.0 Q ss_pred cchHHHHHHhhhhhhcc--------------CCC--cchhhhhhh---hhhh--hhhhHHHHHhhhhccccchhhhcccc Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEA--------------EGA--PEIAQGKHA---IIAK--MFENQEADIKSDAAYRDEKLAEAFGG 63 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~--------------~~~--~~~~~~~~~---~~~~--~~enq~~~~~~~~~~~~~~~~~~~~~ 63 (524) |++-++|+++|.-+.+. +.. .+|...+.. +.++ -|+.|.+.+..+-.-........... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 55555565555554331 000 122221111 1111 12233222211110000000000000 Q ss_pred cccccccc---------ccccc-ch----------hhhccccccccccccCcchhh------HHHHHHHhhhhhhceeee Q lcl|NC_014661. 64 FLTEAEIG---------GDHGY-DP----------QNIAAGQTSGAVTQIGPAVMG------MVRRAIPNLIAFDICGVQ 117 (524) Q Consensus 64 ~l~ea~~~---------~~~g~-~~----------~~i~est~tg~v~~~~P~Li~------l~Rra~~nLIa~DI~GVQ 117 (524) .-.+.+.. ..++. .. ..+.+++.+ .+ ..||+ ++++....-.-.+++.|. T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~----~g-G~lIP~~~~~~Ii~~~~~~~~l~~~~~~~ 155 (387) T protein:vir:96 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDS----GG-DKLLPKTLSKEIVSEPFAKNQLREKARLT 155 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCC----CC-ceeechhHHHHHHHHHHhhchhhhhceee Confidence 00000000 00000 00 001111111 11 12222 233333334446777777 Q ss_pred cCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 118 PMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 118 PmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) |+++.+.- |-.+.. +.+.| T Consensus 156 ~~~~~~~p----~~~~~~----------------~~a~~----------------------------------------- 174 (387) T protein:vir:96 156 NIKGLEIP----RVSYTL----------------DDDDF----------------------------------------- 174 (387) T ss_pred ecCCceee----eeeccC----------------Ccccc----------------------------------------- Confidence 76543210 000000 00000 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQ 277 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQ 277 (524) +++| ...++...++++++..+|.-+-...+|-||.+ T Consensus 175 ---------------------------v~Eg-----------------~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ 210 (387) T protein:vir:96 175 ---------------------------ITDV-----------------ETAKELKAKGDTVKFTTNKFKVFAAISDTVIH 210 (387) T ss_pred ---------------------------cccc-----------------ccccccccccceeeechheeeeechhhHHHHh Confidence 0011 11122222334445555555556789999999 Q ss_pred HHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHH Q lcl|NC_014661. 278 DLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQ 357 (524) Q Consensus 278 DLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~ 357 (524) |- ..|.|++|.+-|+..|..-.|..++-.-. .. +-+.|++.=....-+.+ -.++-. T Consensus 211 ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~---g~---------g~~~g~~~~~~~~~~~~--------~~~~d~ 266 (387) T protein:vir:96 211 GS----DVDLVNWVENALQSGLAAKERKDALAVSP---KS---------GLEHMSFYNGSVKEVEG--------ADMYDA 266 (387) T ss_pred hh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCC---Cc---------cccceeeeccccccccc--------cchHHH Confidence 85 35568889988888887765655542211 11 12233331111111111 112223 Q ss_pred HHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEE Q lcl|NC_014661. 358 IDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTI 437 (524) Q Consensus 358 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~v 437 (524) |..+-+.+...= |..+.|++-+...+.+|.... - ..+ .+- ...+ ++|.| ++||+..+++. +++ T Consensus 267 i~~~~~~l~~~y-~~na~~imn~~t~~~~~~~~~---~--~~~---~~~-~~~~----~~llG-~PV~~~~~~~~--~~~ 329 (387) T protein:vir:96 267 IINALADLHEDY-RDNATIYMRYADYVKIISVLS---N--GTT---NFF-DTPA----EKVFG-KPVVFTDAAVK--PIV 329 (387) T ss_pred HHHHHhccChhh-hcCCEEEEechHHHHHHHHHh---c--CCC---ccc-ccCC----ccccc-cceEEecCCCc--eee Confidence 333333333321 235666554444444443211 1 000 110 1111 35776 59998877653 344 Q ss_pred EEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-CCcccccCCccccceeeccccchhhhhcccccee Q lcl|NC_014661. 438 GYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPSIANSVGKNGYF 516 (524) Q Consensus 438 G~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~ 516 (524) | +- +-||.=|......+..|..+.+-.+-...||+..+ +|= -| T Consensus 330 G---Df----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~-----------------------------A~ 373 (387) T protein:vir:96 330 G---DF----NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDS-----------------------------AF 373 (387) T ss_pred e---ch----hhhhhhhhhhhheecccccCCceEEEEEEEeCcEeechh-----------------------------he Confidence 4 11 11222221111111123333332333333655432 331 12 Q ss_pred eeeeeecC Q lcl|NC_014661. 517 RRVLVKGI 524 (524) Q Consensus 517 r~~~v~~~ 524 (524) |.+.||-= T Consensus 374 ~~l~~ka~ 381 (387) T protein:vir:96 374 RIAKAKEN 381 (387) T ss_pred EEEEeecC Confidence 22222221 No 80 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=74.69 E-value=0.16 Score=25.02 Aligned_cols=333 Identities=15% Similarity=0.116 Sum_probs=115.0 Q ss_pred cchHHHHHHhhhhhhcc--------------CCC--cchhhhhhh---hhhh--hhhhHHHHHhhhhccccchhhhcccc Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEA--------------EGA--PEIAQGKHA---IIAK--MFENQEADIKSDAAYRDEKLAEAFGG 63 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~--------------~~~--~~~~~~~~~---~~~~--~~enq~~~~~~~~~~~~~~~~~~~~~ 63 (524) |++-++|+++|.-+.+. +.. .+|...+.. +.++ -|+.|.+.+..+-.-........... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 55555565555554331 000 122221111 1111 12233222211110000000000000 Q ss_pred cccccccc---------ccccc-ch----------hhhccccccccccccCcchhh------HHHHHHHhhhhhhceeee Q lcl|NC_014661. 64 FLTEAEIG---------GDHGY-DP----------QNIAAGQTSGAVTQIGPAVMG------MVRRAIPNLIAFDICGVQ 117 (524) Q Consensus 64 ~l~ea~~~---------~~~g~-~~----------~~i~est~tg~v~~~~P~Li~------l~Rra~~nLIa~DI~GVQ 117 (524) .-.+.+.. ..++. .. ..+.+++.+ .+ ..||+ ++++....-.-.+++.|. T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~----~g-G~lIP~~~~~~Ii~~~~~~~~l~~~~~~~ 155 (387) T protein:vir:26 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDS----GG-DKLLPKTLSKEIVSEPFAKNQLREKARLT 155 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCC----CC-ceeechhHHHHHHHHHHhhchhhhhceee Confidence 00000000 00000 00 001111111 11 12222 233333334446777777 Q ss_pred cCCCcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 118 PMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 118 PmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) |+++.+.- |-.+.. +.+.| T Consensus 156 ~~~~~~~p----~~~~~~----------------~~a~~----------------------------------------- 174 (387) T protein:vir:26 156 NIKGLEIP----RVSYTL----------------DDDDF----------------------------------------- 174 (387) T ss_pred ecCCceee----eeeccC----------------Ccccc----------------------------------------- Confidence 76543210 000000 00000 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQ 277 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQ 277 (524) +++| ...++...++++++..+|.-+-...+|-||.+ T Consensus 175 ---------------------------v~Eg-----------------~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ 210 (387) T protein:vir:26 175 ---------------------------ITDV-----------------ETAKELKAKGDTVKFTTNKFKVFAAISDTVIH 210 (387) T ss_pred ---------------------------cccc-----------------ccccccccccceeeechheeeeechhhHHHHh Confidence 0011 11122222334445555555556789999999 Q ss_pred HHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHH Q lcl|NC_014661. 278 DLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQ 357 (524) Q Consensus 278 DLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~ 357 (524) |- ..|.|++|.+-|+..|..-.|..++-.-. .. +-+.|++.=....-+.+ -.++-. T Consensus 211 ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~---g~---------g~~~g~~~~~~~~~~~~--------~~~~d~ 266 (387) T protein:vir:26 211 GS----DVDLVNWVENALQSGLAAKERKDALAVSP---KS---------GLEHMSFYNGSVKEVEG--------ADMYDA 266 (387) T ss_pred hh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCC---Cc---------cccceeeeccccccccc--------cchHHH Confidence 85 35568889988888887765655542211 11 12233331111111111 112223 Q ss_pred HHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEE Q lcl|NC_014661. 358 IDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTI 437 (524) Q Consensus 358 i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~v 437 (524) |..+-+.+...= |..+.|++-+...+.+|.... - ..+ .+- ...+ ++|.| ++||+..+++. +++ T Consensus 267 i~~~~~~l~~~y-~~na~~imn~~t~~~~~~~~~---~--~~~---~~~-~~~~----~~llG-~PV~~~~~~~~--~~~ 329 (387) T protein:vir:26 267 IINALADLHEDY-RDNATIYMRYADYVKIISVLS---N--GTT---NFF-DTPA----EKVFG-KPVVFTDAAVK--PIV 329 (387) T ss_pred HHHHHhccChhh-hcCCEEEEechHHHHHHHHHh---c--CCC---ccc-ccCC----ccccc-cceEEecCCCc--eee Confidence 333333333321 235666554444444443211 1 000 110 1111 35776 59998877653 344 Q ss_pred EEecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-CCcccccCCccccceeeccccchhhhhcccccee Q lcl|NC_014661. 438 GYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPSIANSVGKNGYF 516 (524) Q Consensus 438 G~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~ 516 (524) | +- +-||.=|......+..|..+.+-.+-...||+..+ +|= -| T Consensus 330 G---Df----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~-----------------------------A~ 373 (387) T protein:vir:26 330 G---DF----NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDS-----------------------------AF 373 (387) T ss_pred e---ch----hhhhhhhhhhhheecccccCCceEEEEEEEeCcEeechh-----------------------------he Confidence 4 11 11222221111111123333332333333655432 331 12 Q ss_pred eeeeeecC Q lcl|NC_014661. 517 RRVLVKGI 524 (524) Q Consensus 517 r~~~v~~~ 524 (524) |.+.||-= T Consensus 374 ~~l~~ka~ 381 (387) T protein:vir:26 374 RIAKAKEN 381 (387) T ss_pred EEEEeecC Confidence 22222221 No 81 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=73.67 E-value=0.17 Score=24.84 Aligned_cols=283 Identities=13% Similarity=0.086 Sum_probs=119.8 Q ss_pred ccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccccccccc Q lcl|NC_014661. 82 AAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRG 160 (524) Q Consensus 82 ~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~ 160 (524) |.+++|+.-...-+.+. .+++++-+..+..+++-+.||+.... +|..... + +.+.| T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~-------~~p~~~~---~---------~~a~w---- 57 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNE-------DIITFNG---R---------PKAEF---- 57 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCce-------EEEEEeC---C---------ceeEE---- Confidence 22222221111212222 56666777788888888888875321 1211100 0 00000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhccccc Q lcl|NC_014661. 161 SHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGF 240 (524) Q Consensus 161 ~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~l 240 (524) + +| T Consensus 58 ----------------------------------------------------------------v--------~E----- 60 (311) T protein:vir:99 58 ----------------------------------------------------------------V--------GE----- 60 (311) T ss_pred ----------------------------------------------------------------e--------ec----- Confidence 0 11 Q ss_pred CCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhh Q lcl|NC_014661. 241 NGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKT 320 (524) Q Consensus 241 Ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~ 320 (524) +..+++...++++++..+|.-+-....|-||.|+-.- -..|-+++|.+-|...|...|++.+|...-.. .++. T Consensus 61 ----g~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~--~g~~ 133 (311) T protein:vir:99 61 ----GQQKSSTTGEFDFVTSTPKKAQVTMRFNEEVQWADED-YQLGVLQTLSEAGAEALARALDLGLYHRINPL--TGTV 133 (311) T ss_pred ----CcccccccceeeEEEEeeEEEEEeehhhHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccCcc--cCcc Confidence 1123333445566666666666678899999763321 13556888888888888888888888653210 0110 Q ss_pred c-ccccc-ccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccc Q lcl|NC_014661. 321 G-QTLTV-GSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAA 398 (524) Q Consensus 321 ~-~~~~~-~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a 398 (524) . ++... ....+...+... .+ -.+..-|+.+-..+...-.+...+..|++++....|....- + T Consensus 134 ~~g~~~~~~~~~~~~~~~~~------~~-----~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd-----~ 197 (311) T protein:vir:99 134 IPGWSNYLGAASKRVELTAD------TI-----ANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARY-----T 197 (311) T ss_pred ccccccccccccceeecccc------cc-----chhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhc-----c Confidence 0 00000 000111111100 00 11122233333332222223345668999999999965321 1 Q ss_pred cccccccccccCcceEEEEecCceEEEeeCCC----------------CcceEEEEEecCCCccceeEeecccccc--cc Q lcl|NC_014661. 399 QGLARGLNTDTTKAVFAGILGGRYKVYIDQYA----------------RQDYFTIGYKGDNEMDAGIYYAPYVALT--PL 460 (524) Q Consensus 399 ~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~----------------~~dy~~vG~KG~~~~d~g~fyaPYv~~~--~~ 460 (524) .|. .-+..+.+.. -.++|.| ++|++..+- +++++++|= ...++.|.-..... .. T Consensus 198 ~G~-~l~~~~~~~~-~~~~l~G-~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gd-----f~~~~~~~~~~~~~~~~~ 269 (311) T protein:vir:99 198 DGR-KKFPELGLGI-GVSSFEG-IDASVSDTVNGGDEADPDDEDLDAARAVRGIVGD-----FANGIHWGVQRDIPVELI 269 (311) T ss_pred CCC-eeecCcccCC-CCceecc-eeeEeecccccccccccccchhhccCcceEEEee-----ccccEEEEEecCceEEEe Confidence 110 0011111110 1246777 588887653 233333331 11122222111111 11 Q ss_pred cccCcccc-----cceeee--eeeeceeeCCcccccCCccccceeeccccchhh Q lcl|NC_014661. 461 RGADPKNF-----QPVLGF--KTRYGIGINPLADTAAQQPAGNARIANGMPSIA 507 (524) Q Consensus 461 ~~~Dp~s~-----qP~~~~--~tRY~l~~nP~~~~~~~~~~~~~~~~~g~~~~a 507 (524) +.-|++.. .--++| ..|||..+-+ + ...++.+.- | T Consensus 270 ~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~--------~-~~v~~~~~~---A 311 (311) T protein:vir:99 270 KYGDPDGQGDLKRHNQIALRLEIVYGWYVFT--------D-RFVVIENAV---A 311 (311) T ss_pred ecCCCCcchhhhhcCcEEEEEEEeecceecC--------h-hHeeeeccc---C Confidence 11133321 112333 5788865432 0 012222211 1 No 82 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=72.61 E-value=0.18 Score=24.66 Aligned_cols=272 Identities=11% Similarity=0.041 Sum_probs=117.1 Q ss_pred ccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCC Q lcl|NC_014661. 165 FAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSN 244 (524) Q Consensus 165 ~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~ 244 (524) +.+. .+. -.++...... ...+..+.....-+...... +.... . ..|...+.+.=-.+..+|. +.... T Consensus 1 ma~~---~T~-~~d~iiPev~-~~~v~~~~~~~l~~~~~~~~---d~~l~-g-~~G~tv~iP~~~~~g~a~~---~~~g~ 67 (274) T protein:vir:97 1 MPQG---LTK-TSDQIIPEVL-APMMQAQLEKKLRFASFAEV---DSTLQ-G-QPGDTLTFPAFVYSGDAQV---VAEGE 67 (274) T ss_pred CCcc---cee-hhheechHHH-HHHHHHhhhhhhhhccccee---ccccc-C-CCCCEEEEeeecCCCcccc---ccCCC Confidence 1110 000 0000000000 00000000000000000000 00000 0 0111111111000112221 11112 Q ss_pred CcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhcccc Q lcl|NC_014661. 245 NNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTL 324 (524) Q Consensus 245 ~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~ 324 (524) .-...++. ..+.+++.+-|+-.=+++=| ..+.+ +-|.-.|..+-++.-|..+++.+++..+...+.. + T Consensus 68 ~i~~~~lt--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~-----~- 135 (274) T protein:vir:97 68 KIPTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT-----V- 135 (274) T ss_pred cccccccc--cceeEEEeeeecceecccHH--HHHhc--cchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-----c- Confidence 23344444 33444444555522233322 22233 4688899999999999999999999877543321 0 Q ss_pred ccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccc Q lcl|NC_014661. 325 TVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARG 404 (524) Q Consensus 325 ~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~ 404 (524) .+..++ .+-+-.+..++.++. ....+++|+|.+++.|.......+..++... T Consensus 136 ----~~~~~~-------------~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g-- 187 (274) T protein:vir:97 136 ----NADITK-------------LNGLQSAIDKFNDED---------LEPMVLFVNPLDAGKLRGDASTNFTRATELG-- 187 (274) T ss_pred ----cccccC-------------HHHHHHHHHHhhccC---------CCceEEEeCHHHHHHHHhhhhhhccccCccc-- Confidence 011121 233334444444321 2578999999999999865433332222211 Q ss_pred cccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccc-cCcccccceeeeeeeeceee- Q lcl|NC_014661. 405 LNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRG-ADPKNFQPVLGFKTRYGIGI- 482 (524) Q Consensus 405 ~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~-~Dp~s~qP~~~~~tRY~l~~- 482 (524) .....+-.+|.+.| ++||+|+..|..-..+--+| .+-|.---+.. ++. -||..+.=.+-..-+||+.+ T Consensus 188 --~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~g------A~~~~~~~~~~-vE~~Rd~~~~~d~i~~~~~y~~~~~ 257 (274) T protein:vir:97 188 --DDIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKG------AVKLILKRDFF-LEVARDASTKTTALYSDKHYVAYLY 257 (274) T ss_pred --ccceeccccceecC-eeEEEcCCCCcceEEEEeCc------ceEeeecCCce-eccccchhhcccEEEEEEEEEEEEE Confidence 11122334688876 79999999885432222122 22221111111 222 38999999999999999853 Q ss_pred CCcccccCCccccceeeccccchhhhhccc Q lcl|NC_014661. 483 NPLADTAAQQPAGNARIANGMPSIANSVGK 512 (524) Q Consensus 483 nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~ 512 (524) || .++.++.-.. ++-.| T Consensus 258 ~~---------~~vv~~t~~~----~~~~~ 274 (274) T protein:vir:97 258 DE---------SKAVKITKGS----GSLEM 274 (274) T ss_pred cC---------CceEEEecCc----ccccC Confidence 44 2344444321 12333 No 83 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=72.61 E-value=0.18 Score=24.66 Aligned_cols=272 Identities=11% Similarity=0.041 Sum_probs=117.1 Q ss_pred ccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCC Q lcl|NC_014661. 165 FAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSN 244 (524) Q Consensus 165 ~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~ 244 (524) +.+. .+. -.++...... ...+..+.....-+...... +.... . ..|...+.+.=-.+..+|. +.... T Consensus 1 ma~~---~T~-~~d~iiPev~-~~~v~~~~~~~l~~~~~~~~---d~~l~-g-~~G~tv~iP~~~~~g~a~~---~~~g~ 67 (274) T protein:vir:94 1 MPQG---LTK-TSDQIIPEVL-APMMQAQLEKKLRFASFAEV---DSTLQ-G-QPGDTLTFPAFVYSGDAQV---VAEGE 67 (274) T ss_pred CCcc---cee-hhheechHHH-HHHHHHhhhhhhhhccccee---ccccc-C-CCCCEEEEeeecCCCcccc---ccCCC Confidence 1110 000 0000000000 00000000000000000000 00000 0 0111111111000112221 11112 Q ss_pred CcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhcccc Q lcl|NC_014661. 245 NNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTL 324 (524) Q Consensus 245 ~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~ 324 (524) .-...++. ..+.+++.+-|+-.=+++=| ..+.+ +-|.-.|..+-++.-|..+++.+++..+...+.. + T Consensus 68 ~i~~~~lt--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~-----~- 135 (274) T protein:vir:94 68 KIPTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT-----V- 135 (274) T ss_pred cccccccc--cceeEEEeeeecceecccHH--HHHhc--cchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-----c- Confidence 23344444 33444444555522233322 22233 4688899999999999999999999877543321 0 Q ss_pred ccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccc Q lcl|NC_014661. 325 TVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARG 404 (524) Q Consensus 325 ~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~ 404 (524) .+..++ .+-+-.+..++.++. ....+++|+|.+++.|.......+..++... T Consensus 136 ----~~~~~~-------------~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g-- 187 (274) T protein:vir:94 136 ----NADITK-------------LNGLQSAIDKFNDED---------LEPMVLFVNPLDAGKLRGDASTNFTRATELG-- 187 (274) T ss_pred ----cccccC-------------HHHHHHHHHHhhccC---------CCceEEEeCHHHHHHHHhhhhhhccccCccc-- Confidence 011121 233334444444321 2578999999999999865433332222211 Q ss_pred cccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccc-cCcccccceeeeeeeeceee- Q lcl|NC_014661. 405 LNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRG-ADPKNFQPVLGFKTRYGIGI- 482 (524) Q Consensus 405 ~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~-~Dp~s~qP~~~~~tRY~l~~- 482 (524) .....+-.+|.+.| ++||+|+..|..-..+--+| .+-|.---+.. ++. -||..+.=.+-..-+||+.+ T Consensus 188 --~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~g------A~~~~~~~~~~-vE~~Rd~~~~~d~i~~~~~y~~~~~ 257 (274) T protein:vir:94 188 --DDIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKG------AVKLILKRDFF-LEVARDASTKTTALYSDKHYVAYLY 257 (274) T ss_pred --ccceeccccceecC-eeEEEcCCCCcceEEEEeCc------ceEeeecCCce-eccccchhhcccEEEEEEEEEEEEE Confidence 11122334688876 79999999885432222122 22221111111 222 38999999999999999853 Q ss_pred CCcccccCCccccceeeccccchhhhhccc Q lcl|NC_014661. 483 NPLADTAAQQPAGNARIANGMPSIANSVGK 512 (524) Q Consensus 483 nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~ 512 (524) || .++.++.-.. ++-.| T Consensus 258 ~~---------~~vv~~t~~~----~~~~~ 274 (274) T protein:vir:94 258 DE---------SKAVKITKGS----GSLEM 274 (274) T ss_pred cC---------CceEEEecCc----ccccC Confidence 44 2344444321 12333 No 84 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=71.29 E-value=0.2 Score=24.44 Aligned_cols=302 Identities=11% Similarity=0.058 Sum_probs=118.0 Q ss_pred hhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhccccccccccccCcchh-hHHHHHHHhhhhhhceee Q lcl|NC_014661. 38 MFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGV 116 (524) Q Consensus 38 ~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GV 116 (524) +=++|+..+.. ++|.+.... + ....+ .....++..+.. .-|.+. .+++.+..+.+..+++.+ T Consensus 1 ~~~~~~~~~~~-~~f~~~~~~---~-~~~~a----------~~~~~~~~~~~l--ip~~~~~~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:96 1 MEQTQKLKLNL-QHFASNNVK---P-QVFNP----------DNVMMHEKKDGT--LLNDFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred CCcchhhhHHH-HHHHHhhhh---h-hhccc----------ccccccCCCcce--echhHHHHHHHHHHhhchhhhhcce Confidence 11111111100 001000000 0 00011 000111111111 122333 455566677888999999 Q ss_pred ecCCCcchheeeeeeeecCccCCCcccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 117 QPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATG 196 (524) Q Consensus 117 QPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g 196 (524) -||++++.-|. ++... +.+.| T Consensus 64 ~~~~~~~~~~p----~~~~~---------------~~a~~---------------------------------------- 84 (324) T protein:vir:96 64 EPMEGTEKKFT----FWADK---------------PGAYW---------------------------------------- 84 (324) T ss_pred eeccCCceEEE----EEecC---------------cceee---------------------------------------- Confidence 99988753221 11000 00000 Q ss_pred ccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHH Q lcl|NC_014661. 197 AVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELA 276 (524) Q Consensus 197 ~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELA 276 (524) +++| ..+++..-+++++++..|.-+-....|-||. T Consensus 85 ----------------------------v~Eg-----------------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell 119 (324) T protein:vir:96 85 ----------------------------VGEG-----------------QKIETSKATWVNATMRAFKLGVILPVTKEFL 119 (324) T ss_pred ----------------------------ecCC-----------------ccccccccceeEEEEEeEEEEEeehhhHHHH Confidence 0000 1112222233444444444444555999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHH Q lcl|NC_014661. 277 QDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLF 356 (524) Q Consensus 277 QDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~ 356 (524) +|-. .|.+++|.+.|...|...+++.||..--. ...+.|++....... . +. .....+. T Consensus 120 ~ds~----~~l~~~i~~~l~~aia~~~d~~~l~G~g~------------~~~~~~~~~~~~~~~--~--~~--~~~~~~~ 177 (324) T protein:vir:96 120 NYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGN------------NPFGKSIAQSIKKTN--K--VI--KGDFTQD 177 (324) T ss_pred hcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCC------------CCcCccccccccccc--e--ec--ccccchH Confidence 9853 56789999999999999999988853211 011222222110000 0 00 0011122 Q ss_pred HHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCC--Ccce Q lcl|NC_014661. 357 QIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYA--RQDY 434 (524) Q Consensus 357 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~--~~dy 434 (524) .|..+-+.|.. .+...+.++||+.....|..+.. +.|.- -+. +..+ ++|.| ++|++++.. +... T Consensus 178 ~i~~~~~~i~~--~~~~~~~~i~n~~~~~~L~~lkd-----~~G~~-~~~-~~~~----~~l~G-~PV~~~~~~~~~~~~ 243 (324) T protein:vir:96 178 NIIDLEALLED--DELEANAFISKTQNRSLLRKIVD-----PETKE-RIY-DRNS----DSLDG-LPVVNLKSSNLKRGE 243 (324) T ss_pred HHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhC-----CCCCe-eec-CCCC----Ccccc-eeeEeecCCCCCcce Confidence 23334444432 33467789999999999975432 11111 011 1111 35666 688886653 2222 Q ss_pred EEEE--------EecCCCccceeEeecccccccccccCccc-----c---cceeeeeeeece-eeCC--cccccCCcccc Q lcl|NC_014661. 435 FTIG--------YKGDNEMDAGIYYAPYVALTPLRGADPKN-----F---QPVLGFKTRYGI-GINP--LADTAAQQPAG 495 (524) Q Consensus 435 ~~vG--------~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s-----~---qP~~~~~tRY~l-~~nP--~~~~~~~~~~~ 495 (524) +++| ..+.-+.+ ...+. ......|+.. | |=.+=..-||++ ..+| |+.- ..+.++ T Consensus 244 ~~~gd~s~~~~~~~~~~~i~----~~~~~--~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l-~~a~~~ 316 (324) T protein:vir:96 244 LITGDFDKLIYGIPQLIEYK----IDETA--QLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL-VPADKR 316 (324) T ss_pred EEEEecceEEEEEecCcEEE----Eeecc--cccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEE-eccccc Confidence 3333 32221110 00000 0000011110 1 122333455655 3344 1110 000000 Q ss_pred ceeeccccchhhhhcccc Q lcl|NC_014661. 496 NARIANGMPSIANSVGKN 513 (524) Q Consensus 496 ~~~~~~g~~~~a~~~~~~ 513 (524) -.. ..++- T Consensus 317 -------~~~---~~~~~ 324 (324) T protein:vir:96 317 -------TDS---VPGEV 324 (324) T ss_pred -------CCC---CCCCC Confidence 000 11111 No 85 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=69.23 E-value=0.23 Score=24.13 Aligned_cols=350 Identities=13% Similarity=0.091 Sum_probs=130.2 Q ss_pred CCcccchHHH-HHHhhhh---hhccCCCc--chhhhhhhhhhhhhhhHHHHHhh----hhcccc---------------- Q lcl|NC_014661. 1 MSTQIKTKAQ-LVADWKP---LLEAEGAP--EIAQGKHAIIAKMFENQEADIKS----DAAYRD---------------- 54 (524) Q Consensus 1 ~~~~~~~~~~-l~~kw~p---~l~~~~~~--~~~~~~~~~~~~~~enq~~~~~~----~~~~~~---------------- 54 (524) |++++..-.+ +.++.+- +++..... ++...+..+ .. |+.|.+.+.+ +..... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~~ee~~~~~~e~-~~-l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKDGVTAEELNKTSNEI-DI-LQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIY 78 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHH-HH-HHHHHHHHHHHHHHHHHHhhhhccccccccchhhHH Confidence 9998744322 3333333 44433332 221111111 11 1111111110 000000 Q ss_pred --chhhhc-cccccccccccc-cccc-chhhhcccc-ccccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcchhe Q lcl|NC_014661. 55 --EKLAEA-FGGFLTEAEIGG-DHGY-DPQNIAAGQ-TSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTGQV 126 (524) Q Consensus 55 --~~~~~~-~~~~l~ea~~~~-~~g~-~~~~i~est-~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTGLI 126 (524) .....+ ...++-+-+..+ .... +...+.+++ ++|.+ .. |.-+ .+++.+-.+....+++++.||+++.|-+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~-~v-P~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~ 156 (404) T protein:vir:10 79 NGALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGY-AV-PEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSR 156 (404) T ss_pred HHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCce-ee-chhHHHHHHHHHhhhhhHhhhhceeeccCCccce Confidence 000000 000111000000 0010 111122222 12221 11 2222 3455555677788999999999999854 Q ss_pred eeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCc Q lcl|NC_014661. 127 FALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADA 206 (524) Q Consensus 127 FAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~ 206 (524) - |...... +...|-+ T Consensus 157 ~-----~~~~~~~------------~~~~~v~------------------------------------------------ 171 (404) T protein:vir:10 157 T-----YEKRSKQ------------KPMKPLS------------------------------------------------ 171 (404) T ss_pred E-----EEEecCC------------cceeecc------------------------------------------------ Confidence 2 2111000 0000000 Q ss_pred ccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCC Q lcl|NC_014661. 207 AELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMD 286 (524) Q Consensus 207 ~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLD 286 (524) +|- ...+ .....++++++.+.|.-+-...+|-||.+|-. .+ T Consensus 172 --------------------e~~--~~~~-------------~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~ 212 (404) T protein:vir:10 172 --------------------ENQ--QIPT-------------NGDNGKLERFNFKLKDLADFMSIPNDLLKFAD----KS 212 (404) T ss_pred --------------------ccc--cccc-------------cccccceeeeEeeheeeEeeehhhHHHHhhcH----HH Confidence 000 0000 00122334555555555556689999999843 35 Q ss_pred hHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_014661. 287 ADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIA 366 (524) Q Consensus 287 AEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~ 366 (524) .+++|.+.|+..|...+|+.||...- +...+.|++......-... .. ...+..++..-+.+ T Consensus 213 l~~~i~~~la~~~~~~~~~~il~G~g------------~~~~~~gi~~~~~~~~~~~---~~---~~~~~~~~~~~~~~- 273 (404) T protein:vir:10 213 LEDWIINWFVDKVRITRNAEILYGAG------------GDEHATGIMTANKFKKITL---PK---SPALKDFKKCKNVE- 273 (404) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCC------------CCCcccceeeccccceeec---cc---cccHHHHHHHHHhh- Confidence 58888888888888888888874321 1112233332221110000 00 00111222211111 Q ss_pred hhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCC-CCcceEEEEEecCCCc Q lcl|NC_014661. 367 RQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQY-ARQDYFTIGYKGDNEM 445 (524) Q Consensus 367 ~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y-~~~dy~~vG~KG~~~~ 445 (524) ....+...-.+||||+..+.|...... .|.- -+..+.++. .-++|.| ++|++.+. .+.. ..- T Consensus 274 l~~~~~~~~~~v~n~~~~~~L~~lkd~-----~G~~-l~~~~~~~~-~~~~l~G-~PV~~~~~~~~~~---------~~~ 336 (404) T protein:vir:10 274 LLNVFKATSSWIVNQDGFNYLDSLEDK-----TGRP-YLQPDPKDP-TQYRFLG-LPVIELPNDLLLS---------TES 336 (404) T ss_pred hhccccCCCEEEEcHHHHHHHHHhhcc-----CCce-eeccCcCCC-CCccccc-eeeEEecccccCC---------CCC Confidence 223342233579999999999764311 1100 011121111 1246777 58875332 1100 000 Q ss_pred cceeEeecccc---------cccccccCc----ccccceeeeeeeeceee-CC--cc---cccCCccc Q lcl|NC_014661. 446 DAGIYYAPYVA---------LTPLRGADP----KNFQPVLGFKTRYGIGI-NP--LA---DTAAQQPA 494 (524) Q Consensus 446 d~g~fyaPYv~---------~~~~~~~Dp----~s~qP~~~~~tRY~l~~-nP--~~---~~~~~~~~ 494 (524) +..++|+.+-. +......++ ...+=.+-...|+++.+ +| |. ....-.|+ T Consensus 337 ~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 337 AIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred ccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 11112221110 111111122 23334556666776542 33 21 11111111 No 86 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=68.14 E-value=0.24 Score=23.97 Aligned_cols=282 Identities=12% Similarity=0.030 Sum_probs=111.8 Q ss_pred hcccc-ccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccccccc Q lcl|NC_014661. 81 IAAGQ-TSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSG 158 (524) Q Consensus 81 i~est-~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG 158 (524) .++.+ ++|.. ..-+.+. .+++++-.+.+...++-|.||.+.. + +|..... +.+ +.| T Consensus 1 Ma~~~~~~gg~-~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~-~------~ip~~~~---~~~---------a~w-- 58 (315) T protein:vir:80 1 MADDFLSAGKL-ELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP-V------KGAVFSG---VPR---------AKI-- 58 (315) T ss_pred CCCCcCCcCce-EcchHHHHHHHHHHHhhchhhhhcceeecCCCc-e------EEEEEeC---Ccc---------eEE-- Confidence 22222 22222 1222232 3566666677778888888886532 1 1211100 000 000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhccc Q lcl|NC_014661. 159 RGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQE 238 (524) Q Consensus 159 ~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~ 238 (524) ++ | T Consensus 59 ------------------------------------------------------------------v~--------E--- 61 (315) T protein:vir:80 59 ------------------------------------------------------------------VG--------E--- 61 (315) T ss_pred ------------------------------------------------------------------ee--------C--- Confidence 01 1 Q ss_pred ccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhh Q lcl|NC_014661. 239 GFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVG 318 (524) Q Consensus 239 ~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~ 318 (524) +..+++...+++++++.+|.-+-....|-||.+|- ..|+..+|.++|..++...|.|.+=..+++-...+ T Consensus 62 ------g~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~s----~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~ 131 (315) T protein:vir:80 62 ------GEVKPSASVDVSAFTAQPIKVVTQQRVSDEFMWAD----ADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPA 131 (315) T ss_pred ------CccccccccceeeeEeeeeeEEeeehhhHHHhhcC----chhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCC Confidence 01122333344555555555445567899999884 35666677777777777666666555444211110 Q ss_pred hhccccccccccceecc----cccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccc Q lcl|NC_014661. 319 KTGQTLTVGSKAGVFDF----QDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSV 394 (524) Q Consensus 319 k~~~~~~~~~~aG~fdl----~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~ 394 (524) .. ....|+.+. ....+..+ ..+.-+.++-..+.....+ ..+..+++|+....|....... T Consensus 132 ~~------~~~~~~~~~~~~~~~~~~~~~---------~~~~d~~~~~~~~~~~~~~-~~~~~imn~~~~~~L~~l~~~~ 195 (315) T protein:vir:80 132 TG------KAASAVHTSLNKTKNIVDATD---------SATADLVKAVGLIAGAGLQ-VPNGVALDPAFSFALSTEVYPK 195 (315) T ss_pred CC------ccccccccccccccceeeccc---------cchHHHHHHHHHHhhccCc-cceEEEEcHHHHHHHHHHhhcc Confidence 00 011111111 00111111 1112222332233222223 3456889999999997643221 Q ss_pred cccccccccccccccCcceEEEEecCceEEEeeCCCCcc---------eE--------EEEEecCCCccceeEeeccccc Q lcl|NC_014661. 395 TPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQD---------YF--------TIGYKGDNEMDAGIYYAPYVAL 457 (524) Q Consensus 395 ~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y~--------~vG~KG~~~~d~g~fyaPYv~~ 457 (524) ..+..+.- -++....+. .++|.| ++|+++.+.+.+ .+ .+|+.+... +-..+| T Consensus 196 g~~~~g~~-~~~~~~~g~--~~tl~G-~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~----i~i~~~--- 264 (315) T protein:vir:80 196 GSPLAGQP-MYPAAGFAG--LDNWRG-LNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFP----IELIEY--- 264 (315) T ss_pred CCcccccc-cccccccCC--Cceecc-eeeEecCcCCcccccccccccEEEEeecccEEEEEecCee----EEEecc--- Confidence 11111100 011111111 257887 699998886432 12 222222111 111122 Q ss_pred ccccccCcc----c-ccc-eeeee--eeecee-eCCcccccCCccccceeecccc-chhhhhcccc Q lcl|NC_014661. 458 TPLRGADPK----N-FQP-VLGFK--TRYGIG-INPLADTAAQQPAGNARIANGM-PSIANSVGKN 513 (524) Q Consensus 458 ~~~~~~Dp~----s-~qP-~~~~~--tRY~l~-~nP~~~~~~~~~~~~~~~~~g~-~~~a~~~~~~ 513 (524) .|++ + ||. .++|. .|+|.. .+|= .+.++.+.- |. +--..-| T Consensus 265 -----~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~---------a~~~l~~~~a~~-~~~~~~~ 315 (315) T protein:vir:80 265 -----GDPDQTGRDLKGHNEVMVRAEAVLYVAIESLD---------SFAVVKEKAAPK-PNPPAEN 315 (315) T ss_pred -----ccccCcccchhhcCcEEEEEEEEecceeeccc---------ceEEEeeccCCC-CCCCCCC Confidence 1111 1 221 13332 345432 3440 011111100 00 0000111 No 87 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=65.41 E-value=0.28 Score=23.58 Aligned_cols=368 Identities=10% Similarity=0.014 Sum_probs=128.9 Q ss_pred CCcccchHHH----HHHhhhhhhcc-CCC-cchhhh------hhhhhhhhhhhHHHHHhhhhcc--ccchhhhccccccc Q lcl|NC_014661. 1 MSTQIKTKAQ----LVADWKPLLEA-EGA-PEIAQG------KHAIIAKMFENQEADIKSDAAY--RDEKLAEAFGGFLT 66 (524) Q Consensus 1 ~~~~~~~~~~----l~~kw~p~l~~-~~~-~~~~~~------~~~~~~~~~enq~~~~~~~~~~--~~~~~~~~~~~~l~ 66 (524) |..++....+ ..++=..+++. +.+ -+|..- +......+.+..++.......- ..+......+.... T Consensus 10 ~~a~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (419) T protein:vir:94 10 QRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFA 89 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccchhhhhh Confidence 1111111000 00000001000 000 011100 0000011111111100000000 00000000110000 Q ss_pred ccc----------cccccccc---------hhhhccccccccccccCcchhh-H-HHHHHHhhhhhhceeeecCCCcchh Q lcl|NC_014661. 67 EAE----------IGGDHGYD---------PQNIAAGQTSGAVTQIGPAVMG-M-VRRAIPNLIAFDICGVQPMQGPTGQ 125 (524) Q Consensus 67 ea~----------~~~~~g~~---------~~~i~est~tg~v~~~~P~Li~-l-~Rra~~nLIa~DI~GVQPmTGPTGL 125 (524) +.+ -+...++. ......++.+.+....-|.+++ + ..+.-..++..++|.+.||++++.- T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~ 169 (419) T protein:vir:94 90 DSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLE 169 (419) T ss_pred hHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCcee Confidence 000 00000000 0000011111111122233331 1 1111223456789999999876532 Q ss_pred eeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCC Q lcl|NC_014661. 126 VFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTAD 205 (524) Q Consensus 126 IFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~ 205 (524) + ......+ . ....+. T Consensus 170 ~-------~~~~~~~----------~--~~~~~~---------------------------------------------- 184 (419) T protein:vir:94 170 Y-------IRDTSGT----------A--GAGSTW---------------------------------------------- 184 (419) T ss_pred e-------eeecccc----------c--cccccC---------------------------------------------- Confidence 1 1111000 0 000000 Q ss_pred cccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCC Q lcl|NC_014661. 206 AAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGM 285 (524) Q Consensus 206 ~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGL 285 (524) +-..-.+| +..+++...++++++..+|.=+-...+|-||.||.- T Consensus 185 ----------------------~~a~~v~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~----- 228 (419) T protein:vir:94 185 ----------------------NKAAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS----- 228 (419) T ss_pred ----------------------cccceecC---------CccccccccceeeEEeeeeeEEEeehhhHHHHHhHH----- Confidence 00000111 123455556666777777766667789999999962 Q ss_pred ChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccc-cchHHHHHHHHHHHHHHHHHH Q lcl|NC_014661. 286 DADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRG-ARWAGESFKALLFQIDKESAE 364 (524) Q Consensus 286 DAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~-~~~a~E~~r~L~~~i~~~a~~ 364 (524) +.+++|.+-|+..|...+|+.||..= ..+.+.|++.......... .-+.....-..+..|.++-+. T Consensus 229 ~l~~~i~~~la~a~~~~~d~aii~G~-------------G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~ 295 (419) T protein:vir:94 229 QLMGYIQGRLTYGLRFLRDRQLLNGN-------------GSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTV 295 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcc-------------CcccccceecccccccccccccccccccchhHHHHHHHHHh Confidence 35899999999999999999998520 0012334332111000000 000111112233344444444 Q ss_pred HHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCC- Q lcl|NC_014661. 365 IARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDN- 443 (524) Q Consensus 365 I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~- 443 (524) +.. .+...+.+||+|.....|..... ..+...-+..+... -..++|.| ++|+++...+..-+++|--... T Consensus 296 ~~~--~~~~~~~~v~n~~~~~~l~~~k~-----~~~~~~~~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~gd~~~~~ 366 (419) T protein:vir:94 296 AEI--AGFPPDGVVVHPQDWESIELDQA-----PGSGVFRVIANVQG-EATPRIWG-LNVVSTVAIAQGTALVGGFRQGA 366 (419) T ss_pred hhh--ccCCCCEEEEcHHHHHHHHHHhh-----cCCCceeecCCccc-CCCccccc-eeeEEcCCCCCccEEEeeccceE Confidence 432 22357789999999998864321 00100001111111 01246776 6999999877655555521100 Q ss_pred ----CccceeEeecccccccccccCcccccceeeeeeeeceee-CCcccccCCccccceeeccccchhhhhcccc Q lcl|NC_014661. 444 ----EMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPSIANSVGKN 513 (524) Q Consensus 444 ----~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~ 513 (524) ..+-.+-..++.... =..-+=.+=+..||++.+ +|=+ +.++.- + .+- + T Consensus 367 ~~~~~~~~~v~~~~~~~~~------~~~~~~~~r~~~r~d~~v~~~~a---------~~~~~~-----~-aa~-~ 419 (419) T protein:vir:94 367 TLWSRQGITVLMTDSHADF------FTANTLVILAEFRANLAVYQPKA---------FVRVTF-----A-AAT-T 419 (419) T ss_pred EEEEecceEEEEeccccch------hhcCcEEEEEEEeeccEEecccc---------EEEEEe-----c-cCC-C Confidence 000011111111000 011222334455666543 2311 111100 0 000 0 No 88 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=64.73 E-value=0.3 Score=23.49 Aligned_cols=357 Identities=10% Similarity=0.017 Sum_probs=124.3 Q ss_pred CCcccchHHHHHHhhhhhhccCCCcchhhhhhhhhhhhhhhHHHHHhh--hhccccchhhhcccccccccccccccccch Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEAEGAPEIAQGKHAIIAKMFENQEADIKS--DAAYRDEKLAEAFGGFLTEAEIGGDHGYDP 78 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~enq~~~~~~--~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~ 78 (524) |.-.++..+++.||=+-+++.-..-.......+-...++++.++++.. ..++++..........|..-+ --|=+ T Consensus 1 M~i~~~~~~~~~e~~~~l~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee----~~~~~ 76 (377) T protein:vir:96 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEE----IKFFN 76 (377) T ss_pred CCccHHHHHHHHHHHHHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCHHH----HHHHH Confidence 888887777777776666654222111111111112222222222211 111111111111111111100 00000 Q ss_pred hhhccccccccccccCcc-hh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccccc Q lcl|NC_014661. 79 QNIAAGQTSGAVTQIGPA-VM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMF 156 (524) Q Consensus 79 ~~i~est~tg~v~~~~P~-Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~F 156 (524) ..+..++.++.-. .=|. ++ .+++.....=.-..+|-|+|++|++ |--+.... +.+.| T Consensus 77 ~~~~~~~~~~gg~-lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~------~i~~~~~~--------------~~a~w 135 (377) T protein:vir:96 77 DIDKNVGGKDKFK-LLPEETMVQVFDDLVAEHPLLKVINFKNTSLRL------KALTAETS--------------GTAVW 135 (377) T ss_pred HHHhcCCCCCCce-ecCHHHHHHHHHHHHhhhhhhhhceeEecCCce------EEEEecCC--------------cceeE Confidence 1111111111100 1132 22 2222222222445678899987753 21121100 00111 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhc Q lcl|NC_014661. 157 SGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAEL 236 (524) Q Consensus 157 SG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~ 236 (524) -+ ++ +|. T Consensus 136 v~--------------------------------------------------------------------e~-----~~~ 142 (377) T protein:vir:96 136 GD--------------------------------------------------------------------IF-----GEI 142 (377) T ss_pred ee--------------------------------------------------------------------cc-----ccc Confidence 00 00 000 Q ss_pred ccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhh------ Q lcl|NC_014661. 237 QEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDW------ 310 (524) Q Consensus 237 l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~------ 310 (524) -......|.++.|..-|... ....|-||.+| -.+|.|++|.+-|+..|..-+|+.||.. T Consensus 143 ----~~~~~~~f~~i~l~~~kl~~-------~~~is~~ll~d----s~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P 207 (377) T protein:vir:96 143 ----KGQLKQAFKEQDFSQFKLTA-------FVVIPKDALKF----GPKWLKQFITEQLKEAIAVALELAIVKGNGLLQP 207 (377) T ss_pred ----ccccCccceeEeeeeeeEEe-------echhhHHHhhc----chhhHHHHHHHHHHHHHHHHHhhceEeccCCCcc Confidence 00113456666776666654 34577777766 4678899999999999999999999862 Q ss_pred --Hhhhhhhhhhccccccccccceecc-cc---cccccccchHHHHHHHHHHHHHHHHHHHHhhccccCcc-EEEeCHHH Q lcl|NC_014661. 311 --INYSAQVGKTGQTLTVGSKAGVFDF-QD---PIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGN-FIIASRNV 383 (524) Q Consensus 311 --l~~~A~~~k~~~~~~~~~~aG~fdl-~~---~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn-~~v~S~~v 383 (524) |.+-.....+...... ...++.+- .. ..+.. .....+.+..|...+-.... ....+..|+ +.++.|.. T Consensus 208 ~Gil~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~~~~~~~---~~~~~~~~~a~~~mn~~t 282 (377) T protein:vir:96 208 VGLLKDLSQPTVDQSTGR-DITTYKTDKEAIADLSDLD-PDTAVELLVPVMKHLSVNDK---KHPLKIAGQVKLLLNPED 282 (377) T ss_pred eeeeeccccccccccccc-cccceeeccccccccccCC-hhHHHHHHHHHHHhhccccc---cccccccCceEEEEchhh Confidence 2221111111111110 00111110 00 00000 11112222222222211111 011111122 34566665 Q ss_pred HHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCC--ccceeEeeccccccccc Q lcl|NC_014661. 384 VNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNE--MDAGIYYAPYVALTPLR 461 (524) Q Consensus 384 a~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~--~d~g~fyaPYv~~~~~~ 461 (524) +.-+. +.+...++.| .+.-.|.=.++|..++..|..-++.|..+..- ...++=...|.+..+.+ T Consensus 283 ~~~~~--~~~~~~~~~G------------~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~ 348 (377) T protein:vir:96 283 RWTLE--AKFTSRNQFG------------EYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAME 348 (377) T ss_pred HHhcc--ccccccCCCC------------CceeccCCCceEEecCCCCcccEEEEEcCcEEEEEecccEEEeehhhhhhc Confidence 44332 2221211111 11222332345666666665555555432210 00111111121111111 Q ss_pred ccCcccccceeeeeeeec-eeeCCcccccCCccccceeeccc Q lcl|NC_014661. 462 GADPKNFQPVLGFKTRYG-IGINPLADTAAQQPAGNARIANG 502 (524) Q Consensus 462 ~~Dp~s~qP~~~~~tRY~-l~~nP~~~~~~~~~~~~~~~~~g 502 (524) -|=.+=.+.|++ ..++|=+ .-+..++-| T Consensus 349 ------d~~~f~~~~r~dG~~~d~~a-------~~vl~l~~~ 377 (377) T protein:vir:96 349 ------DLQLYLTKNYFYGKAKDNHT-------AALLTLAGG 377 (377) T ss_pred ------CCeEEEEEEEEcCEEecCCc-------EEEEEEecC Confidence 111122222322 2222211 111122222 No 89 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=63.61 E-value=0.31 Score=23.34 Aligned_cols=304 Identities=10% Similarity=0.040 Sum_probs=125.4 Q ss_pred hhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhccccccccccccCcchh-hHHHHHHHhhhhhhceee Q lcl|NC_014661. 38 MFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGV 116 (524) Q Consensus 38 ~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GV 116 (524) +.|+|+....- +.|...... +..+ .|. .... ++++.. ..-|.+. .+++.+..+.+..+++.+ T Consensus 1 ~~~~~~~~~~~-~~f~~~~~~---~~~~-~a~----------~~~~-~~~~~~-liP~~~~~~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:93 1 MEQTQKLKLNL-QHFASNNVK---PQVF-NPD----------NVMM-HEKKDG-TLLNDFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred CchhHHHHHHH-HHHHHhhhh---hhhc-ccc----------cccc-cCCCcc-eechhHHHHHHHHHHhhchhhhhcce Confidence 33333332211 111110000 0000 010 0000 111111 1112233 456666778888999999 Q ss_pred ecCCCcchheeeeeeeecCccCCCcccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 117 QPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATG 196 (524) Q Consensus 117 QPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g 196 (524) -||++++--| .-... +. .+.| T Consensus 64 ~~~~~~~~~i-------p~~~~---~~---------~a~~---------------------------------------- 84 (324) T protein:vir:93 64 EPMEGTEKKF-------TFWAD---KP---------GAYW---------------------------------------- 84 (324) T ss_pred eeccCCceEE-------EEEec---Cc---------ceee---------------------------------------- Confidence 9998875322 11100 00 0000 Q ss_pred ccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHH Q lcl|NC_014661. 197 AVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELA 276 (524) Q Consensus 197 ~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELA 276 (524) + +| +..+++..-++++++++.+..+-....|-||. T Consensus 85 ----------------------------v--------~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell 119 (324) T protein:vir:93 85 ----------------------------V--------GE---------GQKIETSKATWVNATMRAFKLGVILPVTKEFL 119 (324) T ss_pred ----------------------------e--------cC---------CccccccccceeEEEEEeEEEEEeehhhHHHH Confidence 0 01 11223334455677777777777788999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHH Q lcl|NC_014661. 277 QDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLF 356 (524) Q Consensus 277 QDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~ 356 (524) +|-. .|.+++|.+-|+..|...+++.+|..--... .+.|+++........ ......+. T Consensus 120 ~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~------------~~~~~~~~~~~~~~~------~~~~~~~~ 177 (324) T protein:vir:93 120 NYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP------------FGKSIAQSIEKTNKV------IKGDFTQD 177 (324) T ss_pred hcch----HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------cCcccccccccccee------ccccccHH Confidence 9953 4678999999999999999998876422111 112222211100000 00011122 Q ss_pred HHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCC--Ccce Q lcl|NC_014661. 357 QIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYA--RQDY 434 (524) Q Consensus 357 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~--~~dy 434 (524) .|.++-+.|.. .+.....++|++.....|..... +.|.- ...+.. -++|.| ++|++.+.. +... T Consensus 178 ~i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~l~d-----~~G~~--~~~~~~----~~~l~G-~PVv~~~~~~~~~~~ 243 (324) T protein:vir:93 178 NIIDLEALLED--DELEANAFISKTQNRSLLRKIVD-----PETKE--RIYDRN----SDSLDG-LPVVNLKSSNLKRGE 243 (324) T ss_pred HHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhC-----CCCCe--eecCCC----CCcccc-eeeEeecCCCCCcce Confidence 23333333322 23356689999999999975421 11211 111111 245766 688886653 3223 Q ss_pred EEEE--------EecCCCccceeEeecccccccccccCcc------cccceeeeeeeeceee-CC--cccccCCccccce Q lcl|NC_014661. 435 FTIG--------YKGDNEMDAGIYYAPYVALTPLRGADPK------NFQPVLGFKTRYGIGI-NP--LADTAAQQPAGNA 497 (524) Q Consensus 435 ~~vG--------~KG~~~~d~g~fyaPYv~~~~~~~~Dp~------s~qP~~~~~tRY~l~~-nP--~~~~~~~~~~~~~ 497 (524) +++| ..+.-+.+ ...+..+......|.. .-|=.+=+..||+..+ +| |+. .+.+.++. T Consensus 244 i~~gdfs~~~~~~~~~~~i~----~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~-l~~a~~~~- 317 (324) T protein:vir:93 244 LITGDFDKLIYGIPQLIEYK----IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK-LVPADKRT- 317 (324) T ss_pred EEEEecceEEEEEecCcEEE----EeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEE-EecccccC- Confidence 3333 33222111 0001001110000100 1122333445555542 23 110 01111000 Q ss_pred eeccccchhhhhcccc Q lcl|NC_014661. 498 RIANGMPSIANSVGKN 513 (524) Q Consensus 498 ~~~~g~~~~a~~~~~~ 513 (524) ..+ .++- T Consensus 318 ~~~---------~~~~ 324 (324) T protein:vir:93 318 DSV---------PGEV 324 (324) T ss_pred CCC---------CCCC Confidence 000 0110 No 90 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=62.30 E-value=0.34 Score=23.17 Aligned_cols=358 Identities=12% Similarity=0.062 Sum_probs=131.7 Q ss_pred CCcccchHHHHHHhhhhhhcc--------------CCCcchhhhhhhhhhhhhhhHHHHHhhhhccccchhh---hcccc Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEA--------------EGAPEIAQGKHAIIAKMFENQEADIKSDAAYRDEKLA---EAFGG 63 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~--------------~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~---~~~~~ 63 (524) |.++|+. |+++.+-+.+. +..-+..+.+.. +.. |+++-+.+.+..+-....+. .+... T Consensus 3 ~~e~lke---l~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e-~~~-l~~~i~~~~~~~~~~~~~~~~~~~~~~~ 77 (421) T protein:vir:13 3 LFERLKE---LRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALE-REK-IEARMEIIEEEIESVMTAIDEERKNTNF 77 (421) T ss_pred HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 5555542 44444443322 111111110001 011 11111111111110000000 00000 Q ss_pred ccccccccc-----------------ccccc---hhhhccccccccc---cccCcchhhHHHHHHHhhhhhhceeeecCC Q lcl|NC_014661. 64 FLTEAEIGG-----------------DHGYD---PQNIAAGQTSGAV---TQIGPAVMGMVRRAIPNLIAFDICGVQPMQ 120 (524) Q Consensus 64 ~l~ea~~~~-----------------~~g~~---~~~i~est~tg~v---~~~~P~Li~l~Rra~~nLIa~DI~GVQPmT 120 (524) .-......+ .+|.. ...-.-+++.|.+ ..+.+. ++..+.+..+-.+++.+.||+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~~---Ii~~~~~~~~l~~l~~~~~~~ 154 (421) T protein:vir:13 78 TGGRVIINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVNE---FEKLKEGYPSLKEHCHVIPVN 154 (421) T ss_pred cccccccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhHHH---HHHHHHhhhhhhhhceeeecc Confidence 000000000 00000 0000001111211 112223 333344556678889999998 Q ss_pred CcchheeeeeeeecCccCCCcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 121 GPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTL 200 (524) Q Consensus 121 GPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~ 200 (524) ++++-+- +...... +.+. T Consensus 155 ~~~~~~~-----~~~~~~~--------------~~~~------------------------------------------- 172 (421) T protein:vir:13 155 RNAGKMP-----VRAGASV--------------DKLA------------------------------------------- 172 (421) T ss_pred CCceEEE-----EeecCCc--------------ccee------------------------------------------- Confidence 8765321 1110000 0000 Q ss_pred CCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHH Q lcl|NC_014661. 201 ATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLR 280 (524) Q Consensus 201 a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLK 280 (524) ..++| ...++-..++++++...+.-+-...+|-||.+|-- T Consensus 173 -----------------------~~~E~-----------------~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~ 212 (421) T protein:vir:13 173 -----------------------NLAKD-----------------TELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSE 212 (421) T ss_pred -----------------------ecccc-----------------ccccccccceeEEEeeeeeeEeehhhhHHHHhhhH Confidence 00000 11222233445555555555556779999999842 Q ss_pred hhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHH Q lcl|NC_014661. 281 AVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDK 360 (524) Q Consensus 281 AvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~ 360 (524) .|.++.|.+-|+..+..-+|..|+..+..+. +..++.++ +..+.++..+.. T Consensus 213 ----~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~------------~~~~~~~~-------------d~i~~~~~~l~~ 263 (421) T protein:vir:13 213 ----INFLEFVNEEFAEFAVNTENAEIVKQAKAVL------------AEETINDY-------------AGLVKTINSLVP 263 (421) T ss_pred ----HHHHHHHHHHHHHHHHHHhhhhHhhhhhhcc------------ccccccch-------------HHHHHHHHHhhh Confidence 4568888888888888888888775332110 11222221 234445444432 Q ss_pred HHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEe Q lcl|NC_014661. 361 ESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYK 440 (524) Q Consensus 361 ~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K 440 (524) .+.....+|+++.....|....- +.|. -+..+.... --++|.| ++|++..+.+.. -. T Consensus 264 ---------~~~~~a~~v~n~~~~~~l~~lkd-----~~G~--~i~~~~~~~-~~~tl~G-~pV~~~~~~~~~-----~~ 320 (421) T protein:vir:13 264 ---------NARKRAIIVTNSDGRAYLDGLMD-----KQGR--PLLKELSDG-GDLVFKG-RPVIELEESIFD-----VG 320 (421) T ss_pred ---------hhcCCCEEEEcHHHHHHHHHhhc-----CCCc--eeecCcCCC-CCceecc-eeeEEecccccc-----CC Confidence 22346778999999998875321 1111 011111110 0246777 588877664421 00 Q ss_pred cCCCccceeEeecccc--------cccccccCc---ccccceeeeeeeeceee-CCcc-cccCCccccceeeccccchhh Q lcl|NC_014661. 441 GDNEMDAGIYYAPYVA--------LTPLRGADP---KNFQPVLGFKTRYGIGI-NPLA-DTAAQQPAGNARIANGMPSIA 507 (524) Q Consensus 441 G~~~~d~g~fyaPYv~--------~~~~~~~Dp---~s~qP~~~~~tRY~l~~-nP~~-~~~~~~~~~~~~~~~g~~~~a 507 (524) + +..+||+-+-. ...+...+- ..-+=.+-+..||+.++ +|=+ ....-.........++.+.-+ T Consensus 321 ~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~ 396 (421) T protein:vir:13 321 D----ETKFIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKSS 396 (421) T ss_pred C----ceEEEEEeccccEEEEEecceEEEeecccccccCeeEEEEEeeecceeecchhhheeeecccceeeccccccCCC Confidence 0 11122221110 000111111 12223455566665442 1100 000000000111112233333 Q ss_pred hhccccceeeeeeeecC Q lcl|NC_014661. 508 NSVGKNGYFRRVLVKGI 524 (524) Q Consensus 508 ~~~~~~~~~r~~~v~~~ 524 (524) ..+++|+=-=+-.||-- T Consensus 397 ~~~~~~~~~~~~~~~~~ 413 (421) T protein:vir:13 397 PRSGKNKNESKEEIKEE 413 (421) T ss_pred CcCCCCccccchheeec Confidence 34444433333333322 No 91 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=61.33 E-value=0.36 Score=23.05 Aligned_cols=344 Identities=10% Similarity=0.041 Sum_probs=127.7 Q ss_pred CCcccch-----HHHHHHhhhhhhcc---CCCcchhhhhhhhhhhhhhhH---HHHHhhhhc--cccchhhhcccccccc Q lcl|NC_014661. 1 MSTQIKT-----KAQLVADWKPLLEA---EGAPEIAQGKHAIIAKMFENQ---EADIKSDAA--YRDEKLAEAFGGFLTE 67 (524) Q Consensus 1 ~~~~~~~-----~~~l~~kw~p~l~~---~~~~~~~~~~~~~~~~~~enq---~~~~~~~~~--~~~~~~~~~~~~~l~e 67 (524) .-.++.. .++..++...+.+. +...+....+.+ ...|-+.. |+.+.+... .......+++...... T Consensus 13 ~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~-~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 91 (379) T protein:vir:10 13 IKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSD-MAALQAHADKLDVKLKEKAKSEDKSDSLVKSITENFND 91 (379) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcccccccchhHHHHHHHHHHh Confidence 1111111 11122222222111 000011111111 11221111 111111110 0000001111000000 Q ss_pred cc-cccccccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccc Q lcl|NC_014661. 68 AE-IGGDHGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKE 144 (524) Q Consensus 68 a~-~~~~~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~e 144 (524) .. .-.........-+..+++++....=|.-+ .+++..-....-.+++.|.||++++.-| .-.. T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~-------~~~~------- 157 (379) T protein:vir:10 92 IKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTF-------VREN------- 157 (379) T ss_pred HHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEE-------EEee------- Confidence 00 00000000000000111111111112211 2334344456677889999998775322 1110 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccccccccccccccccc Q lcl|NC_014661. 145 AFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVE 224 (524) Q Consensus 145 Af~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~ 224 (524) ++.+.. T Consensus 158 ----------~~~~~~---------------------------------------------------------------- 163 (379) T protein:vir:10 158 ----------GAGEGA---------------------------------------------------------------- 163 (379) T ss_pred ----------cCCCcc---------------------------------------------------------------- Confidence 000000 Q ss_pred ccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhh Q lcl|NC_014661. 225 IAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEIN 304 (524) Q Consensus 225 ~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEIN 304 (524) ..-.+| +...+++..++++++..+|.=+--..+|-||.||--. .++.|.+-|+..|..-+| T Consensus 164 -----~~~v~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~~-----l~~~i~~~la~~~~~~~~ 224 (379) T protein:vir:10 164 -----IGAQVE---------GATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLPF-----LTSFIPNALRRDYAKAEN 224 (379) T ss_pred -----cccccC---------CccccccccceeeeEeeeeeEEeeehhhHHHHhhHHH-----HHHHHHHHHHHHHHHHHH Confidence 000011 1223444555666666666656667899999999632 588899999999998898 Q ss_pred HHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHH Q lcl|NC_014661. 305 REVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVV 384 (524) Q Consensus 305 REii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va 384 (524) ..++..+...+. .+++-. .+ ...++..+.++.++.. ..+ ..+.+|++|... T Consensus 225 ~~~~~g~~~~~~-------------~~~~~~---~~----~~~~d~i~~~~~~~~~--------~~~-~~~~~vmn~~~~ 275 (379) T protein:vir:10 225 AAFNAVLAANAT-------------ASTEII---TN----KNKVEMLINEIAKQEN--------LDF-PVTAIVLRPTDY 275 (379) T ss_pred HHHhcccccccc-------------cccccc---cC----cccHHHHHHHHHhhhh--------ccC-CCCEEEEcHHHH Confidence 888765543221 111100 11 0112333333333321 233 567789999998 Q ss_pred HHHhcCCccccccccccccccccccCcce-EEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccc- Q lcl|NC_014661. 385 NVLASVDTSVTPAAQGLARGLNTDTTKAV-FAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRG- 462 (524) Q Consensus 385 ~~L~~~~~~~~~~a~~~~~~~~~d~~~~~-~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~- 462 (524) ..|..... +.|.-- +..+.+.+. -.-+|.| ++|+++++.+...+++|=-.. .-+++- ....+.. T Consensus 276 ~~l~~lkd-----~~G~~l-~~~~~~~~~~~~~~l~G-~pvv~s~~~~ag~~~~gdf~~----~~~~~~---~~~~i~~~ 341 (379) T protein:vir:10 276 YDILVTQK-----SVGAGY-GLPGVVTQDNGVLRING-IPLFRATWLAANKYYVGDWTR----VTKVTT---EGLSLEFS 341 (379) T ss_pred HHHHHhhc-----cCCcee-ccCCccCCCCCcceecc-eeeEecCCCCCCceEEeeccc----EEEEEE---eceEEEEe Confidence 88865321 111000 000110000 0014666 799999998776655542211 112221 1111111 Q ss_pred cC----cccccceeeeeeeecee-eCCcccccCCccccceeeccccchh Q lcl|NC_014661. 463 AD----PKNFQPVLGFKTRYGIG-INPLADTAAQQPAGNARIANGMPSI 506 (524) Q Consensus 463 ~D----p~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~ 506 (524) .+ -.+-+=.+=+..|+|+. .+|=+ +.++. ...+ T Consensus 342 ~~~~~~f~~~~~~~r~~~R~~~~v~~p~a---------~v~~~--~~~~ 379 (379) T protein:vir:10 342 EVEGTNFVKNNITARIEAQVALAVEQPAA---------LIFGD--FTAV 379 (379) T ss_pred ecccccccCCcEEEEEEEEeccEEecCcc---------EEEEE--ecCC Confidence 11 12222233334577553 45511 11110 0000 No 92 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=59.40 E-value=0.39 Score=22.81 Aligned_cols=335 Identities=15% Similarity=0.039 Sum_probs=113.1 Q ss_pred CC-cccchHHHHHHhhhhhhccCCCcchhhhhhhhhhhhhhhHHHHHhh------hhcccc---chhhhccccccccccc Q lcl|NC_014661. 1 MS-TQIKTKAQLVADWKPLLEAEGAPEIAQGKHAIIAKMFENQEADIKS------DAAYRD---EKLAEAFGGFLTEAEI 70 (524) Q Consensus 1 ~~-~~~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~enq~~~~~~------~~~~~~---~~~~~~~~~~l~ea~~ 70 (524) |. ++...-+.|..+..-+-+. +. .. ..-++.+++.+.. ....+. +.........+...+- T Consensus 173 ~~~~~~~~~e~l~~~~e~~~~~-----~~----~~-~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~ 242 (543) T protein:vir:81 173 LRARALSAIEKMQGASDNVRAA-----AT----KI-IERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTEEEK 242 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-----HH----HH-HHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhh Confidence 11 1111112222222211110 00 00 0001111111100 000000 0000000000000000 Q ss_pred ccccccchh-hhccccccccccccCcchhhHHHHHH-HhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccc Q lcl|NC_014661. 71 GGDHGYDPQ-NIAAGQTSGAVTQIGPAVMGMVRRAI-PNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHP 148 (524) Q Consensus 71 ~~~~g~~~~-~i~est~tg~v~~~~P~Li~l~Rra~-~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~ 148 (524) .-.... ...-++++|.+.--....-.++.+.. +.-+...++-|.|++|..- +.-... T Consensus 243 ---~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~~g~~~--------~~~~~~---------- 301 (543) T protein:vir:81 243 ---RAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVATGDVW--------HGVSSA---------- 301 (543) T ss_pred ---hhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccCCcceE--------EEEecC---------- Confidence 000000 00000111111000000111121121 1122333444444432210 100000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccccccccccccccccccccc Q lcl|NC_014661. 149 MYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEG 228 (524) Q Consensus 149 fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~G 228 (524) .+.+.| + T Consensus 302 --~~~a~~--------------------------------------------------------------------v--- 308 (543) T protein:vir:81 302 --AVQWSW--------------------------------------------------------------------D--- 308 (543) T ss_pred --Ccceee--------------------------------------------------------------------c--- Confidence 000000 0 Q ss_pred ccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHH Q lcl|NC_014661. 229 MATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVI 308 (524) Q Consensus 229 m~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii 308 (524) +| +..+++-..+++.++++++.-+=...+|-||.+|- + |.++.|.+-|...|...+|+-|| T Consensus 309 -----~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~--~---~~~~~i~~~l~~~~~~~~d~ail 369 (543) T protein:vir:81 309 -----AE---------FEEVSDDSPEFGQPEIPVKKAQGFVPISIEALQDE--A---NVTETVALLFAEGKDELEAVTLT 369 (543) T ss_pred -----cc---------CccccccccccceeeeeeeeeEeeehhhHHHHhcc--H---HHHHHHHHHHHHHHHHHHHHHHh Confidence 01 01122333455667777777777788999999873 2 67999999999999999999887 Q ss_pred hhHhhhhhhhhhccccccccccceeccccc-----ccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHH Q lcl|NC_014661. 309 DWINYSAQVGKTGQTLTVGSKAGVFDFQDP-----IDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNV 383 (524) Q Consensus 309 ~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~-----~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~v 383 (524) .. .-+.+.+.|++..... ..........+-+..|+..+. ..+.....+|++|.+ T Consensus 370 ~G------------~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~---------~~~~~~~~~v~n~~~ 428 (543) T protein:vir:81 370 TG------------TGQGNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLA---------ARHRRQGAWLANNLI 428 (543) T ss_pred cc------------CCCCcccccchhhcccccccccccccccccHHHHHHHHHhhh---------ccccCCcEEEEcHHH Confidence 42 0111123333221100 000000111122333333332 233334467899999 Q ss_pred HHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcce----------EEEEEecCCCccceeEeec Q lcl|NC_014661. 384 VNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDY----------FTIGYKGDNEMDAGIYYAP 453 (524) Q Consensus 384 a~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy----------~~vG~KG~~~~d~g~fyaP 453 (524) ...|..... +.|.-- +.....+. -++|.| ++||+..+.+..- |++|-- ..+++.. T Consensus 429 ~~~l~~lkd-----~~G~~l-~~~~~~g~--~~~l~G-~pv~~~~~~~~~~~~~~~~~~~~i~~gd~------~~~~i~~ 493 (543) T protein:vir:81 429 YNKIRQFDT-----QGGAGL-WTTIGNGE--PSQLLG-RPVGEAEAMDANWNTSASADNFVLLYGNF------QNYVIAD 493 (543) T ss_pred HHHHHHhhc-----CCCcee-ccCcCCCC--Cccccc-eeeEEeccccccccccccCCcceEEEeec------cceeEEe Confidence 999975321 111000 01111111 146776 6999888754321 111111 0011110 Q ss_pred ccccccccccCcc--------cccceeeeeeeecee-eCCcccccCCccccceeeccccchhhhhc Q lcl|NC_014661. 454 YVALTPLRGADPK--------NFQPVLGFKTRYGIG-INPLADTAAQQPAGNARIANGMPSIANSV 510 (524) Q Consensus 454 Yv~~~~~~~~Dp~--------s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~a~~~ 510 (524) . .. +.-.+||. ..+=.+-+..|+|.. .||=+ +..+.- +-.+ T Consensus 494 ~-~~-~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A---------~~~l~~-----~~~a 543 (543) T protein:vir:81 494 R-IG-MTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNA---------FRLLNV-----ETAS 543 (543) T ss_pred e-cc-cEEEEeccccccchhhcCceEEEEEEeeccEeecccc---------eEEEEe-----cccC Confidence 0 01 00112332 223344555677764 34421 111111 1111 No 93 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=57.70 E-value=0.43 Score=22.60 Aligned_cols=328 Identities=15% Similarity=0.137 Sum_probs=118.4 Q ss_pred CCcccchHHHHHHhhhh--hhccCCCcchhh--hhhhhhhhhh--hhHHHHHhh-hhccccchhhhcccccccccccccc Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKP--LLEAEGAPEIAQ--GKHAIIAKMF--ENQEADIKS-DAAYRDEKLAEAFGGFLTEAEIGGD 73 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p--~l~~~~~~~~~~--~~~~~~~~~~--enq~~~~~~-~~~~~~~~~~~~~~~~l~ea~~~~~ 73 (524) |+-.-.. .-++++.+ .+...+.+.-+. -.|.+.+... -|..+.++. ...+.++... T Consensus 1 ~a~~~a~--~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~--------------- 63 (366) T protein:vir:57 1 MAAAVAV--PVKAHSVAPGIIIKEELQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLS--------------- 63 (366) T ss_pred Ccccccc--cccccccccccccccccccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhh--------------- Confidence 2221111 01111111 111111111000 0111211110 011111100 0011111110 Q ss_pred cccchhhhccccccccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccc Q lcl|NC_014661. 74 HGYDPQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYA 151 (524) Q Consensus 74 ~g~~~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnE 151 (524) ..+..++++|.+. =|.-+ .++.+.-+..+...+ |++.+.+++|-+ +|...+. +. T Consensus 64 -----~a~~~~~~~Gg~l--vP~~~~~~ii~~l~~~s~l~~l-g~~~v~~~~g~~-----~~p~~t~---~~-------- 119 (366) T protein:vir:57 64 -----MAISTAAGSGGAL--IPQNMQNEVIELLRDRTVVRIL-GARSIPLPNGNL-----SMPRLSG---GA-------- 119 (366) T ss_pred -----hhccccccCCccc--cchhHHHHHHHHHhhhcchhhh-ceeeeecCCCce-----EEEEEeC---Cc-------- Confidence 0111111122110 02211 122222222222222 333222233211 0110000 00 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccc Q lcl|NC_014661. 152 PDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMAT 231 (524) Q Consensus 152 adt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~T 231 (524) .+ +-+ T Consensus 120 -~a--------------------------------------------------------------------~wv------ 124 (366) T protein:vir:57 120 -TA--------------------------------------------------------------------GYV------ 124 (366) T ss_pred -ce--------------------------------------------------------------------eee------ Confidence 00 000 Q ss_pred hhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhH Q lcl|NC_014661. 232 SIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWI 311 (524) Q Consensus 232 s~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l 311 (524) +| +...++...+++++++..|.-+-...+|-||.+|-- .|.|+.|.+-|...|...+++.||..= T Consensus 125 --~E---------~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l~G~ 189 (366) T protein:vir:57 125 --GE---------GKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAG----FNVEQLLLGDILSAIATREDKAFLRDD 189 (366) T ss_pred --cc---------CccccccccceeEEEEeeEEEEEeehhhHHHHhhhh----HHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 11 122334445567777777777777889999998853 457899999999999999998888541 Q ss_pred hhhhhhhhhccccccccccceeccccccc----ccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHH Q lcl|NC_014661. 312 NYSAQVGKTGQTLTVGSKAGVFDFQDPID----VRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVL 387 (524) Q Consensus 312 ~~~A~~~k~~~~~~~~~~aG~fdl~~~~d----~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L 387 (524) -+ ...+.|++....... ..+. +.. +..+-..++.+.........+......|+++.....| T Consensus 190 G~------------~~~p~Gi~~~~~~~~~~~~~~~t--~~~-~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L 254 (366) T protein:vir:57 190 GT------------GDTPKGMKAVATAANRLVAWTGT--AIN-LTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTL 254 (366) T ss_pred CC------------Cccccceeeccccccceeecccc--ccc-hhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHH Confidence 10 112233332111100 0000 000 0111111111212222222333456678999999998 Q ss_pred hcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcc----------------eEEEEEecCCCccceeEe Q lcl|NC_014661. 388 ASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQD----------------YFTIGYKGDNEMDAGIYY 451 (524) Q Consensus 388 ~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----------------y~~vG~KG~~~~d~g~fy 451 (524) ..... +.|.- +-.+.++ |+|.| |+|+++.+.|.+ ++++|-.+..+.+ . T Consensus 255 ~~lkd-----~~G~~--l~~~~~~----g~l~G-~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~----~ 318 (366) T protein:vir:57 255 FGLRD-----GNGNK--VYPEMSQ----GILKG-YPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVD----F 318 (366) T ss_pred Hhhhc-----cCCce--eccCCCC----Ceecc-eeeEEccccccccccCCCccEEEEEecceEEEEEecceEEE----E Confidence 75321 11111 1112222 57877 799998876542 1222222222211 1 Q ss_pred ecccccccccccCcc--------cccceeeeeeeeceee-CCcccccCCccccceeeccccch Q lcl|NC_014661. 452 APYVALTPLRGADPK--------NFQPVLGFKTRYGIGI-NPLADTAAQQPAGNARIANGMPS 505 (524) Q Consensus 452 aPYv~~~~~~~~Dp~--------s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~g~~~ 505 (524) .++... .|+. +-+=.+=...||++.+ +| ..+ .+..|..| T Consensus 319 ~~ea~~-----~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~---------~a~-~~lt~~~~ 366 (366) T protein:vir:57 319 STEATY-----KDADGQLVSAFARNQSLIRVVTEHDIGFRHP---------EGL-VLGTGVIW 366 (366) T ss_pred eecccc-----ccccccchhhhhcCceeEEeeeeeCcEeecc---------ccE-EEEecccC Confidence 111000 0111 1112333455666654 23 122 33344666 No 94 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=57.46 E-value=0.43 Score=22.57 Aligned_cols=347 Identities=14% Similarity=0.183 Sum_probs=135.1 Q ss_pred CCcccchHHH----HHHhhhhhhccCCCcchhhhh----------hhhhhhh--hhhHHHHH----hhhhcc---ccc-- Q lcl|NC_014661. 1 MSTQIKTKAQ----LVADWKPLLEAEGAPEIAQGK----------HAIIAKM--FENQEADI----KSDAAY---RDE-- 55 (524) Q Consensus 1 ~~~~~~~~~~----l~~kw~p~l~~~~~~~~~~~~----------~~~~~~~--~enq~~~~----~~~~~~---~~~-- 55 (524) |+-.|+.-++ |.++..-+-+.. ...+...+ +.+-++| +|++.+++ .+...- ... T Consensus 1 m~~~lk~l~~~~~el~~~~~~~k~~~-~~~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (401) T protein:vir:44 1 MAVDIKDVEQVAQELQQKFDDFKAKN-DKRVEAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNKV 79 (401) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch Confidence 7776666554 444444332111 01111110 1111111 22322222 111000 000 Q ss_pred --hhhhcccccccccccccccccchhhhcccccc-ccc---cccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeee Q lcl|NC_014661. 56 --KLAEAFGGFLTEAEIGGDHGYDPQNIAAGQTS-GAV---TQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFAL 129 (524) Q Consensus 56 --~~~~~~~~~l~ea~~~~~~g~~~~~i~est~t-g~v---~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAM 129 (524) ....+|+.++-..........+.+.+..++.+ |.+ ..+.+.++.+.| ...+-.+++-+.||++++..+.- T Consensus 80 ~~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~- 155 (401) T protein:vir:44 80 AAEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLK---DEVVMRQEATVITVGGSDYKKLV- 155 (401) T ss_pred hHHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHH---hhhhhhhhceeeecCCCceEEEE- Confidence 00223333332221111111222222322221 111 233444555554 35566788999999887532210 Q ss_pred eeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccc Q lcl|NC_014661. 130 RAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAEL 209 (524) Q Consensus 130 RSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~ 209 (524) ... + +...|-+ T Consensus 156 ------~~~---~---------~~a~wv~--------------------------------------------------- 166 (401) T protein:vir:44 156 ------NLG---G---------TASGWVG--------------------------------------------------- 166 (401) T ss_pred ------ecC---C---------ccceeec--------------------------------------------------- Confidence 000 0 0000100 Q ss_pred cccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHH Q lcl|NC_014661. 210 DAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADA 289 (524) Q Consensus 210 d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEa 289 (524) +|-. ..+ +....|.+..|.+.| -+--..+|-||.+|- .+|.++ T Consensus 167 -----------------E~~~--~~~-------~~~~~~~~v~~~~~k-------~~~~~~iS~ell~ds----~~~l~~ 209 (401) T protein:vir:44 167 -----------------ETDT--RSQ-------TATSRLGLIEPFMGE-------IYGNPQATQKMLDDA----FFNVEA 209 (401) T ss_pred -----------------cccc--cCc-------cccccceeeeeehhh-------eeeehhhhHHHHhcc----hHHHHH Confidence 0000 000 011234444444444 444567899999984 457799 Q ss_pred HHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceeccccccccc---------------ccchHHHHHHHH Q lcl|NC_014661. 290 ELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVR---------------GARWAGESFKAL 354 (524) Q Consensus 290 ELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~---------------~~~~a~E~~r~L 354 (524) +|.+-|+..|...+++.+|.. . ..+.|.|++......... ...-..+....| T Consensus 210 ~i~~~la~ai~~~~~~~~l~G------------~-G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~ 276 (401) T protein:vir:44 210 WINSELATEFAEQEEIAFTTG------------D-GTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKL 276 (401) T ss_pred HHHHHHHHHHHHHHHhhhhcc------------C-CCCccceeeccccccccccccccccccccccccccccCHHHHHHH Confidence 999999999999888888842 0 001233333221110000 000001222223 Q ss_pred HHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCC--- Q lcl|NC_014661. 355 LFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYAR--- 431 (524) Q Consensus 355 ~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~--- 431 (524) +..+ . ..+..+..+|+++.....|....- +.|. .=+..+.+.. --++|.| ++|+++...| T Consensus 277 ~~~l-------~--~~~~~~a~~v~n~~~~~~L~~lkd-----~~G~-~l~~~~~~~g-~~~~l~G-~PVv~~~~~p~~~ 339 (401) T protein:vir:44 277 IYTL-------R--KAHRTGAKFMMNNNSLFAIRLLKD-----TEGN-YLWRPGLELG-QPSSLAG-YGIAENEQMPDIA 339 (401) T ss_pred HHhc-------c--hhhhcCCEEEEcHHHHHHHHHhhc-----cCCc-eeecCCcCCC-CCceecc-eeeEEecCcCCcc Confidence 3222 1 122234567899999999875321 1110 0011221110 1246876 6888887643 Q ss_pred --cceEEEEEecCCCccceeEeecccccccccccCcccccceeeeee--eecee-eCCcccccCCccccceeec Q lcl|NC_014661. 432 --QDYFTIGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKT--RYGIG-INPLADTAAQQPAGNARIA 500 (524) Q Consensus 432 --~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~t--RY~l~-~nP~~~~~~~~~~~~~~~~ 500 (524) .+.+++| +-.. +|-=+....+....||-.=+-.++|.. |++.. .+|-+...-.- +.. T Consensus 340 ~~~~~i~~G---d~~~----~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~-----~aa 401 (401) T protein:vir:44 340 ADAKAIAFG---NFKR----GYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKI-----AAA 401 (401) T ss_pred CCccEEEEe---ehhc----cEEEEEecceEEeeeccccCCcEEEEEEEEeccEEecccceEEEEe-----ecC Confidence 2223333 1100 111010011111234433334444443 66543 45522221111 111 No 95 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=57.15 E-value=0.44 Score=22.54 Aligned_cols=287 Identities=9% Similarity=0.018 Sum_probs=122.2 Q ss_pred Hhhhhccccchhhhcccccccccccccccccchhhhcccc-ccccccccCc-chh-hHHHHHHHhhhhhhceeeecCCCc Q lcl|NC_014661. 46 IKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAAGQ-TSGAVTQIGP-AVM-GMVRRAIPNLIAFDICGVQPMQGP 122 (524) Q Consensus 46 ~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~est-~tg~v~~~~P-~Li-~l~Rra~~nLIa~DI~GVQPmTGP 122 (524) ++- ...+++..-+..+ ++.+....=| .+. .+++.+-+..+..+++.+.||+++ T Consensus 1 ~~~------------------------~~~~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 56 (318) T protein:vir:24 1 MAA------------------------GTAFAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTT 56 (318) T ss_pred CCC------------------------CCCCCHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCC Confidence 111 1122222222221 1111111112 222 345556667788889999999876 Q ss_pred chheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCC Q lcl|NC_014661. 123 TGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLAT 202 (524) Q Consensus 123 TGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~ 202 (524) +.-| .-... +.+ +.| T Consensus 57 ~~~i-------p~~~~---~~~---------a~~---------------------------------------------- 71 (318) T protein:vir:24 57 GQKI-------PHWVG---DVS---------AQW---------------------------------------------- 71 (318) T ss_pred ceEE-------EEEeC---Ccc---------eEE---------------------------------------------- Confidence 4322 11100 000 000 Q ss_pred CCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhh Q lcl|NC_014661. 203 TADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAV 282 (524) Q Consensus 203 ~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAv 282 (524) ++ | +.++++...++++++.+.|..+-...+|-||.+|-. T Consensus 72 ----------------------v~--------E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~-- 110 (318) T protein:vir:24 72 ----------------------IG--------E---------GDMKPITKGNMTSQTIAPHKIATIFVASAETVRANP-- 110 (318) T ss_pred ----------------------ec--------C---------CccccccccceeEEEEeeEEEEEeehhhHHHhhcCh-- Confidence 00 1 112333444567777777777777899999999843 Q ss_pred cCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccc----cchHHHHHHHHHHHH Q lcl|NC_014661. 283 HGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRG----ARWAGESFKALLFQI 358 (524) Q Consensus 283 HGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~----~~~a~E~~r~L~~~i 358 (524) .|.+++|.+.|+..|...|++.+|..--+- .+.|++.......... .-+..+....++ T Consensus 111 --~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 172 (318) T protein:vir:24 111 --ANYLGTMRTKVATAFAMAFDGAAMHGTDSP-------------FPTYIGQTTKAISIADTTGATTVYDQVAVNGL--- 172 (318) T ss_pred --HHHHHHHHHHHHHHHHHHHHHhhhcccCCC-------------CCcccccccccccccccccccchHHHHHHHHH--- Confidence 578999999999999999999998543211 1122221111110000 011111112222 Q ss_pred HHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcc---eE-EEEecCceEEEeeCCCCcce Q lcl|NC_014661. 359 DKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKA---VF-AGILGGRYKVYIDQYARQDY 434 (524) Q Consensus 359 ~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~---~~-~G~l~~~~~vy~D~y~~~dy 434 (524) ..+. -.......+|++|.....|..... +.|.- =+..+.... .+ -+.+.+ ++|++.+..+..- T Consensus 173 ----~~~~--~~~~~~~~~v~n~~~~~~L~~lkd-----~~G~~-l~~~~~~~~~~~~~~~~~i~g-~pv~~~~~~~~~~ 239 (318) T protein:vir:24 173 ----SLLV--NDGKKWTHTLLDDITEPILNGAKD-----QNGRP-LFIESTYGEAASPFRSGRIVA-RPTILSDHVVEGT 239 (318) T ss_pred ----Hhhc--cccCCCCEEEEcHHHHHHHHHhhc-----cCCce-eecCccccCccccccCceEEE-EeeEEeCCCCCCc Confidence 1221 223356788999999999975321 00100 000111111 11 123333 5777777654321 Q ss_pred --EEEEEecCCCccceeEeecccccccccc---------cCccc-----c---cceeeeeeeecee-eCCcccccCCccc Q lcl|NC_014661. 435 --FTIGYKGDNEMDAGIYYAPYVALTPLRG---------ADPKN-----F---QPVLGFKTRYGIG-INPLADTAAQQPA 494 (524) Q Consensus 435 --~~vG~KG~~~~d~g~fyaPYv~~~~~~~---------~Dp~s-----~---qP~~~~~tRY~l~-~nP~~~~~~~~~~ 494 (524) +++| +- +.++|+-.-.+ .++. .|+.. | |=.+=...||+.. .+|= T Consensus 240 ~~~~~g---df---s~~~~~~~~~l-~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~--------- 303 (318) T protein:vir:24 240 TVGFMG---DF---SQLIWGQIGGL-SFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAE--------- 303 (318) T ss_pred cEEEEe---ec---ceEEEEEecCe-EEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEeccc--------- Confidence 1211 11 11233322111 1111 11111 2 2233345677665 3331 Q ss_pred cceeeccccchhhhhcc Q lcl|NC_014661. 495 GNARIANGMPSIANSVG 511 (524) Q Consensus 495 ~~~~~~~g~~~~a~~~~ 511 (524) .+.+|..- - -++..+ T Consensus 304 a~~~i~~~-~-a~~~~~ 318 (318) T protein:vir:24 304 AFVALTNV-V-SGGGEG 318 (318) T ss_pred ceEEEEee-c-cCCCCC Confidence 12222221 0 012222 No 96 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=52.30 E-value=0.56 Score=21.97 Aligned_cols=300 Identities=10% Similarity=0.061 Sum_probs=124.6 Q ss_pred cchHHHHHHhhhhhhccCCCcchhhhhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhccc Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEAEGAPEIAQGKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAAG 84 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~es 84 (524) |+..+++. ...|+...-+..-| .+ ++. ....+ T Consensus 1 ~~~~~~~~----------------~~~~~f~~~~~~~~---------------------~~-~a~----------~~~~~ 32 (324) T protein:vir:10 1 MEQTQKLK----------------LNLQHFASNNVKPQ---------------------VF-NPD----------NVMMH 32 (324) T ss_pred CCCchHHH----------------HHHHHHHHHhhccc---------------------ee-ccc----------ceecc Confidence 11100000 00000000011111 00 111 00111 Q ss_pred cccccccccCcc-hh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccccccccccc Q lcl|NC_014661. 85 QTSGAVTQIGPA-VM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSH 162 (524) Q Consensus 85 t~tg~v~~~~P~-Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~ 162 (524) +..+. .. |. +. .+++.+..+.+..+++-+.||++.+.-|. +... +. .+.| T Consensus 33 ~~~~~--li-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p----~~~~------~~---------~a~~------ 84 (324) T protein:vir:10 33 EKKDG--TL-LNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT----FWAD------KP---------GAYW------ 84 (324) T ss_pred CCCcc--ee-chhHHHHHHHHHHhhchhhhhcceeeccCCceEEE----EEeC------Cc---------ceeE------ Confidence 11111 01 22 22 34555566777888999999987653221 1100 00 0000 Q ss_pred ccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCC Q lcl|NC_014661. 163 EVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNG 242 (524) Q Consensus 163 ~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGg 242 (524) + +| T Consensus 85 --------------------------------------------------------------v--------~E------- 87 (324) T protein:vir:10 85 --------------------------------------------------------------V--------GE------- 87 (324) T ss_pred --------------------------------------------------------------e--------cc------- Confidence 0 01 Q ss_pred CCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhcc Q lcl|NC_014661. 243 SNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQ 322 (524) Q Consensus 243 s~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~ 322 (524) +..+++...+++++++..|..+..-..|-||.+|-. .|.+++|.+.|+..|...+++.+|..--... T Consensus 88 --g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~------- 154 (324) T protein:vir:10 88 --GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP------- 154 (324) T ss_pred --CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCc------- Confidence 122344455677778888888888889999999864 4679999999999999999999986422111 Q ss_pred ccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccc Q lcl|NC_014661. 323 TLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLA 402 (524) Q Consensus 323 ~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~ 402 (524) .+.|+++........ .. ..-.+..|.++.+.|. ..+...+.+|++|.....|..... ++|.- T Consensus 155 -----~~~~i~~~~~~~~~~---~~---~~~t~~~i~~~~~~l~--~~~~~~~~~v~n~~~~~~L~~l~d-----~~g~~ 216 (324) T protein:vir:10 155 -----FGKSIAQSIEKTNKV---IK---GDFTQDNIIDLEALLE--DDELEANAFISKTQNRSLLRKIVD-----PETKE 216 (324) T ss_pred -----cCcccccccccccee---cc---ccCCHHHHHHHHHhhh--hccCCCCEEEEcHHHHHHHHHhhc-----cCCce Confidence 112222111000000 00 0001222333333332 233457788999999999975321 11211 Q ss_pred cccccccCcceEEEEecCceEEEeeCCCC--cceEEEEEecCCCccceeEeecccccccccc---------cCcc----- Q lcl|NC_014661. 403 RGLNTDTTKAVFAGILGGRYKVYIDQYAR--QDYFTIGYKGDNEMDAGIYYAPYVALTPLRG---------ADPK----- 466 (524) Q Consensus 403 ~~~~~d~~~~~~~G~l~~~~~vy~D~y~~--~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~---------~Dp~----- 466 (524) -+. +..+ ++|.| ++|++.+.++ ...+++|-. +.+++... ....++. .|+. T Consensus 217 -~~~-~~~~----~~l~G-~PV~~~~~~~~~~~~~~~gd~------~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~~~~~ 282 (324) T protein:vir:10 217 -RIY-DRNS----DTLDG-LPVVNLKSSNLKRGELITGDF------DKLIYGIP-QLIEYKIDETAQLSTVKNEDGTPVN 282 (324) T ss_pred -eec-CCCC----ccccc-eeEEeecCCCCCcceEEEEec------ccEEEEEe-cCcEEEEeecccccccccccccchh Confidence 011 1122 35777 5888877643 223444421 01111111 1111111 1111 Q ss_pred ---cccceeeeeeeece-eeCC--cccccCCccccceeeccccchhhhhc Q lcl|NC_014661. 467 ---NFQPVLGFKTRYGI-GINP--LADTAAQQPAGNARIANGMPSIANSV 510 (524) Q Consensus 467 ---s~qP~~~~~tRY~l-~~nP--~~~~~~~~~~~~~~~~~g~~~~a~~~ 510 (524) +-+=.+=...||+. ..|| |+.-. ...+ +-..-++.. T Consensus 283 ~~~~~~~~~r~~~r~d~~v~~~~A~~~l~-~a~~-------~~~~~~~~~ 324 (324) T protein:vir:10 283 LFEQDMVALRATMHVALHIADDKAFAKLV-PADK-------KTDSVPGEV 324 (324) T ss_pred hhhcCcEEEEEEEEEccEEecccceEEEE-eccC-------CCCCCCCCC Confidence 11222333456664 3344 11110 0000 000000000 No 97 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=51.38 E-value=0.58 Score=21.87 Aligned_cols=305 Identities=15% Similarity=0.105 Sum_probs=130.9 Q ss_pred CCCcchheeeeeeeecCccCCCcccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 119 MQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAV 198 (524) Q Consensus 119 mTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~ 198 (524) ||.|||+| |.|.....+ -++. .++|. |=.+- ........-.+ ..++.... .+........... T Consensus 1 ~~~~~~i~----s~~~~~~it--v~~l---l~~P~--~I~~~----i~e~~~~~~ia-d~lf~~~~-a~~~~~v~f~~~~ 63 (318) T protein:vir:10 1 MTAPTGIV----SVSDGPAIT--VREL---VGNPL--WIPTA----LKKMMVNQFIS-ESLFRNGG-ANPNGVVAYNEGN 63 (318) T ss_pred CCCCCcce----eeecCCcee--hHHh---hCCch--hHHHH----HHHHHhccchh-hhhhhccc-ccccceeEEEecc Confidence 99999998 556543221 1111 12221 10000 00000000000 00000000 0000000000000 Q ss_pred ccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEE-EEEEEEEecccccchhhHHHHH Q lcl|NC_014661. 199 TLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRI-DKQVIEAKSRQLKAQYSIELAQ 277 (524) Q Consensus 199 ~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsI-EK~TVtAKSRALKAEYT~ELAQ 277 (524) ++-. .+....+ +| +.+|+.-.-.. ++....+|-+.||-++|=|.. T Consensus 64 p~~~----------------~~d~e~V--------aE---------ggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~- 109 (318) T protein:vir:10 64 PSFL----------------EDDVADV--------AE---------FGEIPVSAGARGLPRTAFAVKKALGVRVSKEMI- 109 (318) T ss_pred cccc----------------cCcHhhc--------cC---------cccccccCCCCCchhhhhhehhccceeccHHHH- Confidence 0000 0000111 11 12233333333 122223457889999998864 Q ss_pred HHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhh---ccccccccccceecccccccccccchHHHHHHH- Q lcl|NC_014661. 278 DLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKT---GQTLTVGSKAGVFDFQDPIDVRGARWAGESFKA- 353 (524) Q Consensus 278 DLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~---~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~- 353 (524) .-+.+|+-.....-|++-|...+|+.+++.|.......-. .|.....-..+++| +.|..+. T Consensus 110 ---~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~~~~~d~~~------------A~e~v~~a 174 (318) T protein:vir:10 110 ---DENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGGKVRTDIAI------------AIEQISTA 174 (318) T ss_pred ---hhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcccccccchh------------hhhhhhhh Confidence 3468899999999999999999999999987543221100 11100000012222 2222221 Q ss_pred HHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcc-ccccccccccccccccCcceEEEEecCceEEEeeCCCCc Q lcl|NC_014661. 354 LLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTS-VTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQ 432 (524) Q Consensus 354 L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~-~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~ 432 (524) +...+.+....-.++=.| ..|.||.+|...+.|.....+ ++...-+.....-...+ ..|.|.+-| ++|..+++.|. T Consensus 175 ~~~~~~a~~~~~~~~~GY-~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~t-g~~~g~~lG-l~vi~s~~~p~ 251 (318) T protein:vir:10 175 APTAYPAGVGSSDEYFGF-IPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWT-GNFPGSVMG-LNVIRSRTFPI 251 (318) T ss_pred hhhhhhhhhhhhhhccCc-cceeeEECHHHHHHHhcchhhhhhhhccchhhhhccccc-ccccceeec-eEEeecCccCC Confidence 111211222222345677 699999999999999765442 11111111000001123 234566656 89999999988 Q ss_pred ceEEEEEecCCCccceeEeeccccccccc----ccCcccccceeeeeeeeceeeCCcccccCCccccceeeccccchhhh Q lcl|NC_014661. 433 DYFTIGYKGDNEMDAGIYYAPYVALTPLR----GADPKNFQPVLGFKTRYGIGINPLADTAAQQPAGNARIANGMPSIAN 508 (524) Q Consensus 433 dy~~vG~KG~~~~d~g~fyaPYv~~~~~~----~~Dp~s~qP~~~~~tRY~l~~nP~~~~~~~~~~~~~~~~~g~~~~a~ 508 (524) |=.+|==+|. -| ||+-=.|++... .-|| +.+|-..-..|+=-...++.+ .|..+ T Consensus 252 ~~alvlq~g~----vG-~~~d~~pl~~t~~~~egg~~-~g~~~~s~~~~~~~~~~~~V~---------------~PkA~- 309 (318) T protein:vir:10 252 DRVLIMERGT----VG-FYSDTRPLQFTALYPEGNGP-NGGPTESYRADASHKRALAVD---------------QPKAA- 309 (318) T ss_pred CeeEEEecCC----cc-eeeccccceeeecccCCCCC-CCCcchhhheehheeeeeeee---------------Cccee- Confidence 7755543321 11 444333333221 1244 345554444444332222211 11111 Q ss_pred hccccceeeeeeeecC Q lcl|NC_014661. 509 SVGKNGYFRRVLVKGI 524 (524) Q Consensus 509 ~~~~~~~~r~~~v~~~ 524 (524) |+++|| T Consensus 310 ----------~~itgi 315 (318) T protein:vir:10 310 ----------LWLTGI 315 (318) T ss_pred ----------EEEeec Confidence 677777 No 98 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=50.03 E-value=0.62 Score=21.72 Aligned_cols=350 Identities=15% Similarity=0.150 Sum_probs=133.6 Q ss_pred CCcccchHHHHHHhhhhhhccCC-C---------------cchhhhhhhhhh--hhhhhHHHHHhhhhccccchh----- Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEAEG-A---------------PEIAQGKHAIIA--KMFENQEADIKSDAAYRDEKL----- 57 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~~~-~---------------~~~~~~~~~~~~--~~~enq~~~~~~~~~~~~~~~----- 57 (524) |+=.|.- ++|.++|.-+.+.-- + -++...++++-. .-++.+++.+.+...-..... T Consensus 1 ~~~~m~l-~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (404) T protein:vir:39 1 MGVKLTV-NQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (404) T ss_pred CChHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 8877744 668888877654300 0 011111111100 000111111111000000000 Q ss_pred --------------hhcccccccccccccccccchhhhcccccc-ccccccCcchh-hHHHHHHHhhhhhhceeeecCCC Q lcl|NC_014661. 58 --------------AEAFGGFLTEAEIGGDHGYDPQNIAAGQTS-GAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQG 121 (524) Q Consensus 58 --------------~~~~~~~l~ea~~~~~~g~~~~~i~est~t-g~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTG 121 (524) ..+|..++... .......+...+..++++ |.+ ..-+.+. .+++.+-++....+++.++||++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~e~~a~~~~t~~~gg~-~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 157 (404) T protein:vir:39 80 GPLNKSEYELKDKFVKEFVNMVRNP-MAFLNTVSSKTETSGSDSAAGL-TIPQDIRTMINTLVRQYDSLQQYVRVESVST 157 (404) T ss_pred cccccchhhhHHHHHHHHHHHHhcc-hhhhhhhhhhhhhcccccCCce-eccHHHHHHHHHHHHhhhhHHhhcceeeccC Confidence 00111110000 000000111111222211 111 1111221 34444556778889999999999 Q ss_pred cchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccC Q lcl|NC_014661. 122 PTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLA 201 (524) Q Consensus 122 PTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a 201 (524) +++-+--.| .... .+.+.|- T Consensus 158 ~~~~~~~~~--~~~~--------------~~~a~~v-------------------------------------------- 177 (404) T protein:vir:39 158 SNGSRVYEK--WTDV--------------TPLTVMD-------------------------------------------- 177 (404) T ss_pred CcceEEEEe--ecCC--------------ccceeee-------------------------------------------- Confidence 877542111 0000 0000010 Q ss_pred CCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHh Q lcl|NC_014661. 202 TTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRA 281 (524) Q Consensus 202 ~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKA 281 (524) ++|-. ..| .+...|.++.|++.|..+- ..+|-||.+|- T Consensus 178 ------------------------~Eg~~--~~~-------~~~~~f~~i~~~~~k~~~~-------~~iS~ell~ds-- 215 (404) T protein:vir:39 178 ------------------------AEDGK--IPD-------LDNPRLTIIKYLIKRYAGI-------ITATNTLLKDT-- 215 (404) T ss_pred ------------------------cCccc--ccc-------ccccceeeEEeeeeeEEee-------ehhHHHHHhhc-- Confidence 01100 000 1123466666666666654 44999999984 Q ss_pred hcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHH-HHHHH Q lcl|NC_014661. 282 VHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALL-FQIDK 360 (524) Q Consensus 282 vHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~-~~i~~ 360 (524) ..|.+++|.+-|+..|..-+|..||...-+ .+ ...+..++++ ...++ ..+. T Consensus 216 --~~~l~~~i~~~l~~~~~~~~d~~il~g~g~----------~~--~~~~~~~~~~-------------i~~~~~~~~~- 267 (404) T protein:vir:39 216 --AENILAWLSSWIAKKVVVTRNQAIIAAMGT----------VP--KKPTIAKFDD-------------VITMINTSVD- 267 (404) T ss_pred --hHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------cc--cccccccHHH-------------HHHHHHHhhh- Confidence 256799999999999999999988854211 11 1122232221 11111 1111 Q ss_pred HHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEe Q lcl|NC_014661. 361 ESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYK 440 (524) Q Consensus 361 ~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K 440 (524) ..+.....+||+|.....|..... +.|.- -+..+.+.. -.++|.| ++|++-.+.. ++-. T Consensus 268 --------~~~~~~a~~v~n~~~~~~L~~lkd-----~~G~~-l~~~~~~~~-~~~~l~G-~pV~~~~~~~-----~~~~ 326 (404) T protein:vir:39 268 --------PAIIATSSLLTNQSGLNKLALVKT-----AEGKY-LLEPDPTKP-NSYLIKG-KKVIVVADRW-----LPNS 326 (404) T ss_pred --------hhhccCCEEEEcHHHHHHHHHhhc-----cCCce-eeccCcCCC-Ccceecc-eeEEEecccc-----cCcc Confidence 112234568999999999986421 11100 001111111 1246777 5777633211 1111 Q ss_pred cCCCccceeEeeccccc------cccc-ccCc------ccccceeeeeeeecee-eCCcccccCCccccceeeccccchh Q lcl|NC_014661. 441 GDNEMDAGIYYAPYVAL------TPLR-GADP------KNFQPVLGFKTRYGIG-INPLADTAAQQPAGNARIANGMPSI 506 (524) Q Consensus 441 G~~~~d~g~fyaPYv~~------~~~~-~~Dp------~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~ 506 (524) +... ..+||.-+-.+ ..+. .+++ ...+=.+-...||+.. .+|-+-..-.- ..... ... T Consensus 327 ~~~~--~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~----~~~a~--~~~ 398 (404) T protein:vir:39 327 GSTV--YPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSF----TAIAD--QVG 398 (404) T ss_pred CCCc--cEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEe----ecccc--CCC Confidence 1110 11222211100 0000 0122 1334455566777754 34521100000 00000 000 Q ss_pred hhhccc Q lcl|NC_014661. 507 ANSVGK 512 (524) Q Consensus 507 a~~~~~ 512 (524) ....|| T Consensus 399 ~~~~~~ 404 (404) T protein:vir:39 399 NFTAGK 404 (404) T ss_pred CCCCCC Confidence 012233 No 99 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=44.73 E-value=0.8 Score=21.13 Aligned_cols=354 Identities=13% Similarity=0.124 Sum_probs=117.5 Q ss_pred CCcccchHHHHHHhhhhhhc---------cCCCcchhhhhhhhhhhhhhhHHHHHhhhhccccchhhhccccc---cccc Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLE---------AEGAPEIAQGKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGF---LTEA 68 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~---------~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~---l~ea 68 (524) |-.++ +.|.++..-+-+ .+-.+.....+.. ...-..+|.+.... +...-.....++... +.++ T Consensus 41 l~~ei---~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 115 (435) T protein:vir:80 41 LSSKF---NELTAQIERAEAAERMAAAAAVPVDPNPAAVTAS-AAAPVYAQPKAPEV-KGAKMARMVRALAAARGDAQLA 115 (435) T ss_pred HHHHH---HHHHHHHHHHHHHHHHHHhhcccccchhhhhccc-cccccccccchhhh-hHHHHHHHHHHHHhccchhHHH Confidence 21111 112222221110 0000000000000 00000001000000 000000000000000 0000 Q ss_pred -ccccccccchhhhccccccccccccCcchh------hHHHHHHHhhhhhhc-eeeecCCCcchheeeeeeeecCccCCC Q lcl|NC_014661. 69 -EIGGDHGYDPQNIAAGQTSGAVTQIGPAVM------GMVRRAIPNLIAFDI-CGVQPMQGPTGQVFALRAVYGKDPIAA 140 (524) Q Consensus 69 -~~~~~~g~~~~~i~est~tg~v~~~~P~Li------~l~Rra~~nLIa~DI-~GVQPmTGPTGLIFAMRSrY~~q~~~~ 140 (524) ...-..++... .+...+++. ...+..|| .+++++-++.+...+ +=+.||+.+.- +|...+. T Consensus 116 ~~~~~~~~~~~~-~~~~~~~~~-~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~-------~~p~~~~-- 184 (435) T protein:vir:80 116 SKLAIERGFGEE-VAMSLNTLS-PGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNI-------TIPRLKG-- 184 (435) T ss_pred HHHHHhhhhhhh-hhhhhcccC-CCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCce-------EEEEEeC-- Confidence 00000000000 000000000 01111122 133333334444444 22333332211 1110000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccccccccccccc Q lcl|NC_014661. 141 GAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAG 220 (524) Q Consensus 141 ~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g 220 (524) +. .+.| T Consensus 185 -~~---------~a~~---------------------------------------------------------------- 190 (435) T protein:vir:80 185 -GA---------IVGY---------------------------------------------------------------- 190 (435) T ss_pred -Cc---------ceee---------------------------------------------------------------- Confidence 00 0000 Q ss_pred ccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHH Q lcl|NC_014661. 221 ILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIM 300 (524) Q Consensus 221 ~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEIm 300 (524) + +| +...++...++++++...+.-+-....|-||.+|-.- +.|.|+.|.+-|+.-|. T Consensus 191 ----v--------~E---------~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~--~~~l~~~i~~~l~~a~~ 247 (435) T protein:vir:80 191 ----I--------GA---------DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGV--NPNVDQIVVGDLTAAIG 247 (435) T ss_pred ----e--------cc---------CccccccccceeeEEEeeEEEEEeehhhHHHHHhhcc--cHHHHHHHHHHHHHHHH Confidence 0 01 1223444556677777777777778899999999432 45678888888888888 Q ss_pred HHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhh-ccccCccEEEe Q lcl|NC_014661. 301 LEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQ-TGRGAGNFIIA 379 (524) Q Consensus 301 lEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~-T~rg~gn~~v~ 379 (524) ..+++-||.. . | +...+.|++.......+... -.......++..+.+.-..+... ..+ ....+|+ T Consensus 248 ~~~d~a~l~G----~--G------~~~~p~Gi~~~~~~~~~~~~-~~~~~~~~~~~d~~~~~~~~~~~~~~~-~~~~~vm 313 (435) T protein:vir:80 248 AREDKAFIRD----D--G------TANTPKGLRFWALPGNVITA-SDGSTLQKIETDLGKAILALENADANL-TQPGWIM 313 (435) T ss_pred HHHHHHhhcc----C--C------CCCcccceeecccccceeec-ccccchhhHHHHHHHHHHHhhcccccc-ccCEEEE Confidence 8887777643 1 0 11123454432211110000 00111122222222222222211 123 4567799 Q ss_pred CHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcc--------eEEEE--------EecCC Q lcl|NC_014661. 380 SRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQD--------YFTIG--------YKGDN 443 (524) Q Consensus 380 S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y~~vG--------~KG~~ 443 (524) ++.....|..... +.|. -+..+.++ |+|.| ++||++.+.|.+ .+++| -.+.- T Consensus 314 n~~~~~~L~~lkd-----~~G~--~l~~~~~~----~~l~G-~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~ 381 (435) T protein:vir:80 314 APRTFRFLEGLRD-----GNGN--KVYPELAN----GMLKG-YPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEETL 381 (435) T ss_pred cHHHHHHHHhhhc-----cCCc--eeccCCCC----CeEee-eeeEEeccccccccCCCCcceEEEEEcccEEEEeecce Confidence 9999999976431 1111 11112222 46776 699998886542 12222 22211 Q ss_pred CccceeEeecccccccccccCcccc---cceeeeeeeeceeeC-CcccccCCccccceeeccccchhh Q lcl|NC_014661. 444 EMDAGIYYAPYVALTPLRGADPKNF---QPVLGFKTRYGIGIN-PLADTAAQQPAGNARIANGMPSIA 507 (524) Q Consensus 444 ~~d~g~fyaPYv~~~~~~~~Dp~s~---qP~~~~~tRY~l~~n-P~~~~~~~~~~~~~~~~~g~~~~a 507 (524) .. -..+|.-+..-...--..| +=.+=..-|+++.+. |= . +..-+|..|.| T Consensus 382 ~i----~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~---------a-~~~l~~~~~~~ 435 (435) T protein:vir:80 382 EI----DYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVE---------S-IAVLSGVAWGA 435 (435) T ss_pred EE----EEeccccccccccchhhhhhcCcceeeeeeeeCcEeeccc---------c-eEEEeccCCCC Confidence 11 1111110000000000001 112223445554432 31 1 23344455544 No 100 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=44.08 E-value=0.82 Score=21.06 Aligned_cols=271 Identities=12% Similarity=0.033 Sum_probs=116.2 Q ss_pred ccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCC Q lcl|NC_014661. 165 FAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSN 244 (524) Q Consensus 165 ~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~ 244 (524) +++. .+. -.++...... ...+..+.....-+...+.. +.... . ..|...+.+.=-.+..+|. +.... T Consensus 1 ma~~---~T~-l~d~iiPev~-~~~v~~~~~~~l~~~~~~~~---d~~l~-g-~~G~tv~iP~~~~ig~a~~---~~~g~ 67 (274) T protein:vir:12 1 MAQG---LTK-TSNQIIPEVL-APMMQAQLEKKLRFASFAEV---DSTLQ-G-QPGDTLTFPAFVYSGDAQV---VAEGE 67 (274) T ss_pred CCcc---eee-hhhhhchHHH-HHHHHHHHHhhhhhccccee---ccccc-C-CCCCEEEEeeecCCCcccc---ccCCC Confidence 1111 000 0000000000 00000000000000000000 00000 0 0111122111000112221 11122 Q ss_pred CcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhcccc Q lcl|NC_014661. 245 NNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTL 324 (524) Q Consensus 245 ~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~ 324 (524) .-...++..+=.+ ++-+-|+-.=+++=| ..+.+ +-|.-.|..+-++.-|..+++.+++..+.+.... T Consensus 68 ~i~~~~lt~~~~~--~~i~~~~~~~~i~D~--~~~~~--~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~------- 134 (274) T protein:vir:12 68 KIPTDILETKKRE--AKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT------- 134 (274) T ss_pred ccchhhcccceee--EEeeeecceeeecHH--HHHhc--ccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc------- Confidence 3334444444443 333444422222221 12333 5688999999999999999999999877643211 Q ss_pred ccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccc Q lcl|NC_014661. 325 TVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARG 404 (524) Q Consensus 325 ~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~ 404 (524) . ....++ .+-+-....++..+. ..+++++++|.|++.|...+...+..++... T Consensus 135 -~--~~~a~~-------------~d~i~dA~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g-- 187 (274) T protein:vir:12 135 -V--NADITK-------------LNGLQSAIDKFNDED---------LEPMVLFINPLDAGKLRGDASTNFTRATELG-- 187 (274) T ss_pred -c--cccccC-------------HHHHHHHHHHhcccc---------ccccEEEeCHHHHHHHHhhhhhhcccccccc-- Confidence 0 011111 222333333333321 1478999999999999876543333322111 Q ss_pred cccccCcceEEEEecCceEEEeeCCCCcceE-EEEEecCCCccceeEeecccccccccc-cCcccccceeeeeeeeceee Q lcl|NC_014661. 405 LNTDTTKAVFAGILGGRYKVYIDQYARQDYF-TIGYKGDNEMDAGIYYAPYVALTPLRG-ADPKNFQPVLGFKTRYGIGI 482 (524) Q Consensus 405 ~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~-~vG~KG~~~~d~g~fyaPYv~~~~~~~-~Dp~s~qP~~~~~tRY~l~~ 482 (524) .....+-.+|.+.| ++||+|...|..-. ++| +|.- .||. --+.. ++. -||..++-.+-..-+||+.. T Consensus 188 --~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~-~gA~-----~~~~-~~~~~-vE~~Rd~~~~~d~i~~~~~y~~~~ 256 (274) T protein:vir:12 188 --DDIIVKGAFGEALG-AIIVRSNKLEAGTAILAK-KGAV-----KLIL-KRDFF-LEVARDASTKTTALYSDKHYVAYL 256 (274) T ss_pred --ccceecccceeecC-eeEEEeCCCCcceEEEEe-ccce-----eeee-cCCce-eccccchhhcccEEEeeeEEEEEE Confidence 01122235788877 79999998875321 221 1211 1221 11112 232 39999999999999999653 Q ss_pred -CCcccccCCccccceeeccccchhhhhccc Q lcl|NC_014661. 483 -NPLADTAAQQPAGNARIANGMPSIANSVGK 512 (524) Q Consensus 483 -nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~ 512 (524) || .++.++.-+- ++-.| T Consensus 257 ~~~---------~~vv~~t~~~----~~~~~ 274 (274) T protein:vir:12 257 YDE---------SKAVKITKGS----GSLEM 274 (274) T ss_pred EcC---------CceEEEEcCC----ccccC Confidence 55 2334444321 22333 No 101 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=42.84 E-value=0.87 Score=20.92 Aligned_cols=315 Identities=16% Similarity=0.113 Sum_probs=123.8 Q ss_pred CC-CcchheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 119 MQ-GPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGA 197 (524) Q Consensus 119 mT-GPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~ 197 (524) |- .++|.--.-|..++.- ++..-|+ |- .-|||.- ...+....... .-....+.....++.....+. T Consensus 1 ~~~~~~~~~~~t~~g~~~~---~~~~~al--~i---e~~~g~V----~~~f~~~s~~~-~~v~~r~~~~G~sv~i~~iG~ 67 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQS---AADKLAL--FL---KVFGGEV----LTAFARTSVTM-PRHMLRSIASGKSAQFPVIGR 67 (347) T ss_pred CCCCccCcccccccccCCc---ccchHHH--HH---HHHHHHH----HHHHHHHHhhh-hhhccccccccceeEeeeccc Confidence 32 3333322223223211 0011111 11 1222211 00000000000 000000011111111111111 Q ss_pred cccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHH Q lcl|NC_014661. 198 VTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQ 277 (524) Q Consensus 198 ~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQ 277 (524) .+.. .|.-++.+... ..+....|+-++||++ +-+...++-.- T Consensus 68 ~t~~--------------------~~~~g~~l~~~----------~~~~~~~e~~ltiD~~--------~y~~~~VddiD 109 (347) T protein:vir:33 68 TKAA--------------------YLKPGENLDDK----------RKDIKHTEKVIHIDGL--------LTADVLIYDIE 109 (347) T ss_pred eeee--------------------eecCCCCCCCC----------CCCCccceEEEEechh--------hhhhHHHhhHH Confidence 1110 01111111100 1123456777888865 33455677666 Q ss_pred HHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhcccccc-ccccceecccccccccccchHHH-HHHHHH Q lcl|NC_014661. 278 DLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTV-GSKAGVFDFQDPIDVRGARWAGE-SFKALL 355 (524) Q Consensus 278 DLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~-~~~aG~fdl~~~~d~~~~~~a~E-~~r~L~ 355 (524) +.++ | .|-..|++.-....++..+++-|+..|......-+....... ....+.+... ....+.-|..+ -...+| T Consensus 110 ~~q~-~-~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~tg~~~d~~~~a~~i~ 185 (347) T protein:vir:33 110 DAMN-H-YDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGLGKPTVLTLV--KPTTGSLTDPVELGKAII 185 (347) T ss_pred HHhc-C-CchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccccc--ccccccccchhhhHHHHH Confidence 7776 4 788999999899999999999898776532211100000000 0001111111 11111122222 234444 Q ss_pred HHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceE Q lcl|NC_014661. 356 FQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYF 435 (524) Q Consensus 356 ~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 435 (524) ..|.+.....-.+==--.+-|+|++|+.-++|-.++.|...... + . +....-.+|.+.| ++||.-++-|.-.+ T Consensus 186 ~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~----~-~-~~~~~G~V~~i~G-~~V~~Sn~lp~~~~ 258 (347) T protein:vir:33 186 AQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQ----A-L-LDPERGTIRNVMG-FEVVEVPHLTAGGA 258 (347) T ss_pred HHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccccccccccc----c-c-cccccceeEEEec-eeEEEecccccCcc Confidence 44443333333322222478999999999999888877543221 1 1 1233346788877 99999998766433 Q ss_pred E-------EEEe------------cCCCccceeEeecccc----ccc---ccccCcccccceeeeeeeecee-eCCcccc Q lcl|NC_014661. 436 T-------IGYK------------GDNEMDAGIYYAPYVA----LTP---LRGADPKNFQPVLGFKTRYGIG-INPLADT 488 (524) Q Consensus 436 ~-------vG~K------------G~~~~d~g~fyaPYv~----~~~---~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~ 488 (524) + .|-+ +.-.-..||||.|=.. +.. -+.-|++.|-=.|=-+..||.. .+|=. T Consensus 259 ~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~vlrP~~-- 336 (347) T protein:vir:33 259 GDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRPEA-- 336 (347) T ss_pred ccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchhhhhHhhhhhhhcCCceecccc-- Confidence 2 1100 0000012344433222 111 1112444444444444444432 22310 Q ss_pred cCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 489 AAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 489 ~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) +.. +..|.+ T Consensus 337 -----av~----------------------i~~~~~ 345 (347) T protein:vir:33 337 -----AGA----------------------IVLPKV 345 (347) T ss_pred -----eEE----------------------EecCCC Confidence 000 011111 No 102 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=42.03 E-value=0.9 Score=20.83 Aligned_cols=340 Identities=16% Similarity=0.148 Sum_probs=113.3 Q ss_pred CCcccchHHHHHHhhhhhhcc----C---------C---Ccchhhhhh---hhhhh--hhhhHHHHHhhhhccc------ Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEA----E---------G---APEIAQGKH---AIIAK--MFENQEADIKSDAAYR------ 53 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~----~---------~---~~~~~~~~~---~~~~~--~~enq~~~~~~~~~~~------ 53 (524) =-..|++-++|+++|.-+.+. . + ..+|.+.+. .+-++ -|+.|.+.+..+-.-. T Consensus 12 ~g~~mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~ 91 (402) T protein:vir:93 12 GGNEMPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGE 91 (402) T ss_pred CCCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 113334433444444333211 0 0 011222111 11111 1222222221100000 Q ss_pred -------cchhhhccccccccccccccc------ccch-hhhcccccc-ccccccCcchh--hHHHHHHHhhhhhhceee Q lcl|NC_014661. 54 -------DEKLAEAFGGFLTEAEIGGDH------GYDP-QNIAAGQTS-GAVTQIGPAVM--GMVRRAIPNLIAFDICGV 116 (524) Q Consensus 54 -------~~~~~~~~~~~l~ea~~~~~~------g~~~-~~i~est~t-g~v~~~~P~Li--~l~Rra~~nLIa~DI~GV 116 (524) .+....++..++-...-+..+ +.+. ..+.+++.+ |.. . =|.=+ .+++.....-+-.++|.| T Consensus 92 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~-l-IP~~~~~~Ii~~~~~~~~l~~~~~v 169 (402) T protein:vir:93 92 AYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDK-L-LPKTLSKEIVSEPFAKNQLREKARL 169 (402) T ss_pred cCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCcc-c-cchhHHHHHHHhHHhhhhhhhhcee Confidence 000000010000000000000 0000 001111111 100 0 02111 133333333445677777 Q ss_pred ecCCCcchheeeeeeeecCccCCCcccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 117 QPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATG 196 (524) Q Consensus 117 QPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g 196 (524) -|+++.+.- |-.+... .+.| T Consensus 170 ~~~~~~~~p----~~~~~~~----------------~a~~---------------------------------------- 189 (402) T protein:vir:93 170 TNIKGLEIP----RVSYTLD----------------DDDF---------------------------------------- 189 (402) T ss_pred eecCCceee----eeeccCC----------------cccc---------------------------------------- Confidence 776543210 0000000 0000 Q ss_pred ccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHH Q lcl|NC_014661. 197 AVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELA 276 (524) Q Consensus 197 ~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELA 276 (524) +++|-... .+...|.+..|.+ +.-+-...+|-||. T Consensus 190 ----------------------------v~Eg~~~~----------~~~~~f~~i~~~~-------~k~~~~i~iS~ell 224 (402) T protein:vir:93 190 ----------------------------ITDVETAK----------ELKAKGDTVKFTT-------NKFKVFAAISDTVI 224 (402) T ss_pred ----------------------------cccccccc----------ccccccceeeecc-------eeeeeechhhHHHH Confidence 00110000 0112344444444 44444578999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHH Q lcl|NC_014661. 277 QDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLF 356 (524) Q Consensus 277 QDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~ 356 (524) +|- ..|.+++|.+-|+..|..-.|..++-.-.-+ +-+.|++.=....-+.+.. ..+....|+. T Consensus 225 ~Ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~------------g~p~g~~~~~~~~~~~~~~-~~d~l~~~~~ 287 (402) T protein:vir:93 225 HGS----DVDLVNWVENALQSGLAAKERKDALAVSPKS------------GLEHMSFYNGSVKEVEGAD-MYDAIINALA 287 (402) T ss_pred hhh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc------------cccceeeeccccccccccc-hHHHHHHHHh Confidence 985 3556889999999888876666555322211 1223333211111111111 0122333333 Q ss_pred HHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEE Q lcl|NC_014661. 357 QIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFT 436 (524) Q Consensus 357 ~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~ 436 (524) .+... -+..+.|++-+...+.++.-.. - +.+ .+- ...+ ++|.| ++||+..+++. ++ T Consensus 288 -------~l~~~-y~~na~~imn~~t~~~~~~~~~---d--~~~---~~~-~~~~----~~llG-~PV~~t~~~~~--i~ 343 (402) T protein:vir:93 288 -------DLHED-YRDNATIYMRYADYVKIISVLS---N--GTT---NFF-DTPA----EKVFG-KPVVFTDAAVK--PI 343 (402) T ss_pred -------ccChh-hhcCCEEEEechHHHHHHHHHh---c--CCC---ccc-ccCC----ccccc-cceEEecCCCc--ee Confidence 33222 1235666554444445443211 1 001 110 1111 25776 69999887654 34 Q ss_pred EEEecCCCccceeEeecccccccccccCcccccceeeeeeeecee-eCCcccccCCccccceeeccccchhhhhccccce Q lcl|NC_014661. 437 IGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIG-INPLADTAAQQPAGNARIANGMPSIANSVGKNGY 515 (524) Q Consensus 437 vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~ 515 (524) +|-- +-||.=|.....-+..|+.+.+-.+-...|++.. +||=+ T Consensus 344 ~GDf-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A----------------------------- 387 (402) T protein:vir:93 344 VGDF-------NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSA----------------------------- 387 (402) T ss_pred eech-------hhhhhhhhhhhhhhhhcccCCceEEEEEEEeCcEEechhh----------------------------- Confidence 4421 1122222211111122444433333333355432 23311 Q ss_pred eeeeeeecC Q lcl|NC_014661. 516 FRRVLVKGI 524 (524) Q Consensus 516 ~r~~~v~~~ 524 (524) ||.+.||.- T Consensus 388 ~~~l~ik~~ 396 (402) T protein:vir:93 388 FRIAKAKEN 396 (402) T ss_pred eEEEEeecC Confidence 222333322 No 103 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=41.61 E-value=0.92 Score=20.78 Aligned_cols=337 Identities=14% Similarity=0.114 Sum_probs=125.2 Q ss_pred cchHHHHHHhhhhhhccCCCcchhhhhh--------hhhhhhhh------hHHHHHhhhhccccchh--------hhccc Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEAEGAPEIAQGKH--------AIIAKMFE------NQEADIKSDAAYRDEKL--------AEAFG 62 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~--------~~~~~~~e------nq~~~~~~~~~~~~~~~--------~~~~~ 62 (524) |.- ++|+++|.-+.+. +.++.+-++ ..-....| ++.+.+++......+.. ..... T Consensus 1 M~~-~eL~~~~~~~~~~--~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (395) T protein:vir:38 1 MNI-NQLKDAFDMAGQK--VQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVN 77 (395) T ss_pred CCH-HHHHHHHHHHHHH--HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 444 3366666555321 111111000 00000011 11111111000000000 00001 Q ss_pred cccccccccc----------ccccchhhhcccc-ccccccccCcchh--hHHHHHHHhhhhhhceeeecCCCcchheeee Q lcl|NC_014661. 63 GFLTEAEIGG----------DHGYDPQNIAAGQ-TSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQGPTGQVFAL 129 (524) Q Consensus 63 ~~l~ea~~~~----------~~g~~~~~i~est-~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAM 129 (524) +...+..... .++.... .++++ ++++-...=|.-+ .+++.+.+..+..+++.++||++++|-+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~--- 153 (395) T protein:vir:38 78 KKPLPVKDGKPDAQAMKNQFVKDFKNL-VTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSR--- 153 (395) T ss_pred ccccchhhhhHHHHHHHHHHHHHHHHH-HhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceE--- Confidence 1111100000 0111111 11122 2221111113332 3555555677888999999999998853 Q ss_pred eeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccc Q lcl|NC_014661. 130 RAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAEL 209 (524) Q Consensus 130 RSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~ 209 (524) .|...... + +.+.| T Consensus 154 --~~~~~~~~--~---------~~a~~----------------------------------------------------- 167 (395) T protein:vir:38 154 --VYEKLADI--T---------PLKDL----------------------------------------------------- 167 (395) T ss_pred --EEEeeccC--C---------ccccc----------------------------------------------------- Confidence 11100000 0 00000 Q ss_pred cccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHH Q lcl|NC_014661. 210 DAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADA 289 (524) Q Consensus 210 d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEa 289 (524) +++|-. ..| +....|.+..| .++.-+-...+|-||.+|- +.|-++ T Consensus 168 ---------------v~E~~~--~~~-------~~~~~f~~v~~-------~~~k~~~~~~iS~ell~ds----~~~l~~ 212 (395) T protein:vir:38 168 ---------------DDESAL--IGD-------NDDPELTVVKY-------LIHRYAGITTVTNTLLKDT----VDNIIQ 212 (395) T ss_pred ---------------cccccc--ccc-------ccccceeeEEe-------eeeeeEeehhhHHHHHhhh----HHHHHH Confidence 000100 000 01123444444 4444444556999999993 356688 Q ss_pred HHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_014661. 290 ELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQT 369 (524) Q Consensus 290 ELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T 369 (524) .|.+-|+..|..-||+.||...-+. ....|..++ +....++...... T Consensus 213 ~i~~~la~~~~~~~~~~il~g~g~~------------~~~~~~~~~-------------~~i~~~~~~~l~~-------- 259 (395) T protein:vir:38 213 WLVNWAAKKDVVTRNAKILEVMGKA------------PKKPTISQF-------------DNIKDLENNTLDP-------- 259 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhccccc------------ccccccccH-------------HHHHHHHHHhhhh-------- Confidence 8888888888888888877532211 111222221 1122222221111 Q ss_pred cccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCc-----ce-EEEE----- Q lcl|NC_014661. 370 GRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQ-----DY-FTIG----- 438 (524) Q Consensus 370 ~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy-~~vG----- 438 (524) .+.....+||+|.....|....- +.|. .-+..+.+. -..++|.| ++|++....+. +. +++| T Consensus 260 ~~~~~a~~v~n~~~~~~L~~lkd-----~~G~-~l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~~~~i~~gd~~~~ 331 (395) T protein:vir:38 260 AIESTSSFITNQSGYNILSKVKD-----ADGR-YLMQPDVTS-PDKYLIDG-KPVIRIADKWLPDVSGSHPLYFGDLKQG 331 (395) T ss_pred hhcCCCEEEEcHHHHHHHHHhhc-----cCCc-eeeccCcCC-CCcceecc-ceeEEecccccCcCCCcceEEEEecccc Confidence 11134568999999999965321 1110 001111111 11246776 58887553211 11 2222 Q ss_pred ----EecCCCccceeEeecccccccccccCcccccceeeeeeeeceee-CC--cc-----cccCCccccceeeccccchh Q lcl|NC_014661. 439 ----YKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKTRYGIGI-NP--LA-----DTAAQQPAGNARIANGMPSI 506 (524) Q Consensus 439 ----~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP--~~-----~~~~~~~~~~~~~~~g~~~~ 506 (524) .+.. -.+=+.++. ..+-...+=.+-+..||+..+ +| |. ....+.++ T Consensus 332 ~~i~~~~~----~~i~~~~~~------~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~------------ 389 (395) T protein:vir:38 332 ITLFDRQQ----MQIDTTNVG------AGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVANQAQG------------ 389 (395) T ss_pred EEEEEecc----eEEEEeccc------cchhhcCceEEEEEEeeccEEecccceEEEEeecccCCCCC------------ Confidence 1110 001111110 001122334555566666543 23 11 11111111 Q ss_pred hhhccc Q lcl|NC_014661. 507 ANSVGK 512 (524) Q Consensus 507 a~~~~~ 512 (524) +-..|| T Consensus 390 ~~~~~~ 395 (395) T protein:vir:38 390 TAGTGK 395 (395) T ss_pred ccCCCC Confidence 112233 No 104 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=40.01 E-value=0.99 Score=20.61 Aligned_cols=343 Identities=10% Similarity=0.026 Sum_probs=117.1 Q ss_pred cchHHHHHHhhhhhhccC--CCcchhh---hhhhhhhhhhhhHHHHHhhh------hccccchhh-hccccccccccccc Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEAE--GAPEIAQ---GKHAIIAKMFENQEADIKSD------AAYRDEKLA-EAFGGFLTEAEIGG 72 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~~--~~~~~~~---~~~~~~~~~~enq~~~~~~~------~~~~~~~~~-~~~~~~l~ea~~~~ 72 (524) |++-+++.+|..-+-..+ .+.+... ..+++...+ +-.+.+...+ +..++.... ......|.+.+ T Consensus 1 ik~L~e~~~e~~e~~~~~~~~~~~~~~~~e~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~--- 76 (390) T protein:vir:40 1 MNNLDKKDSETLNISTAFLNAIKEGATEAEQVTAFTNMA-EQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDE--- 76 (390) T ss_pred CchHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHH--- Confidence 444444444433321110 0001110 111111110 0000001000 000000000 00001111110 Q ss_pred ccccchhhhccccccccccccCcch-h-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccc Q lcl|NC_014661. 73 DHGYDPQNIAAGQTSGAVTQIGPAV-M-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMY 150 (524) Q Consensus 73 ~~g~~~~~i~est~tg~v~~~~P~L-i-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fn 150 (524) --+-...++++++++.- -.=|.- . .+++.+-..-+-.+++-+.||++....|. +.... .+ T Consensus 77 -r~~~~~~~~~~~~~~gg-~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~----~~~~~------~~------ 138 (390) T protein:vir:40 77 -SKYYNEVIAGNGFAGVT-ALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWII----SVGDV------AT------ 138 (390) T ss_pred -HHHHHHHHhccCcccCc-ccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEE----EEcCC------cc------ Confidence 00001112222211110 111221 1 23333334445677899999887554332 11000 00 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCccccccccccccccccccccccccc Q lcl|NC_014661. 151 APDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMA 230 (524) Q Consensus 151 Eadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~ 230 (524) +.|-+ ++ T Consensus 139 ---a~~~~--------------------------------------------------------------------E~-- 145 (390) T protein:vir:40 139 ---AWWGP--------------------------------------------------------------------LC-- 145 (390) T ss_pred ---eeeec--------------------------------------------------------------------cc-- Confidence 00000 00 Q ss_pred chhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhh Q lcl|NC_014661. 231 TSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDW 310 (524) Q Consensus 231 Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~ 310 (524) ++. -..+...|.+..|++.|..+ ....|-||.+|-- .|.|++|.+.|+..|..-+|+.||.. T Consensus 146 ---~~~----~~~~~~~f~~i~l~~~k~~~-------~i~iS~ell~ds~----~~l~~~i~~~la~~i~~~~~~a~l~G 207 (390) T protein:vir:40 146 ---AEI----KEVLDNGFDKIQTGMYKLSA-------YIPVCNAMLDLGP----SWLDQYVRTILGEAMALGLEAGIVNG 207 (390) T ss_pred ---ccc----CccccccceeeEeeeeeEEE-------eehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 000 00123446677777776654 3458899999863 46799999999999999999999863 Q ss_pred Hhhhhhhhhhccccccccccceeccccc--------ccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHH Q lcl|NC_014661. 311 INYSAQVGKTGQTLTVGSKAGVFDFQDP--------IDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRN 382 (524) Q Consensus 311 l~~~A~~~k~~~~~~~~~~aG~fdl~~~--------~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~ 382 (524) = | .+.|.|++.-... .......| +-.-.++..+......-.... ++.+.| ||++. T Consensus 208 ~------G-------~~~P~Gil~~~~~~~~~~~~~~~~~~~t~--~~~~~~~~~l~~~~~~~~~~~-~~~a~~-i~n~~ 270 (390) T protein:vir:40 208 S------G-------KDQPIGMMRDLNNVTAGEHPVKTATPLTD--LTPATLATKVMLPLTDNGKKS-VSDAIL-VINPA 270 (390) T ss_pred c------C-------CCccceeeeccccccccccccccccccch--hhHHHHHHHHHHHhhcchhhh-hcCceE-EEcch Confidence 1 0 0112222210000 00000000 111222222222111111111 223444 45554 Q ss_pred -HHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeeccccccccc Q lcl|NC_014661. 383 -VVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLR 461 (524) Q Consensus 383 -va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~ 461 (524) .+..|...-.+ .|.++....+.+.-+++|+++++.+.+-++.|--. .+ ++.-. ....+ T Consensus 271 t~~~~l~~~~~~-------------~d~~G~~v~~~~~~g~pvv~~~~~p~~~i~~Gd~s--~~----~i~~~-~~~~v- 329 (390) T protein:vir:40 271 DYWSKIYAATSY-------------MTPQGVWVTGILPVPLEIVQSVAVPVGKAVAGRAK--DY----FMGIG-SEQVI- 329 (390) T ss_pred hHHHHHHHHhhc-------------cCCCCccccccCCCceeEEEcCCCCCCcEEEEeec--eE----EEEee-cceEE- Confidence 45555421111 11222222223334579999998876655554321 00 00000 00001 Q ss_pred ccCccc----ccceeeeeeeece--------------------eeCCcccccCCccccceeeccccchhhhhccccc Q lcl|NC_014661. 462 GADPKN----FQPVLGFKTRYGI--------------------GINPLADTAAQQPAGNARIANGMPSIANSVGKNG 514 (524) Q Consensus 462 ~~Dp~s----~qP~~~~~tRY~l--------------------~~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~ 514 (524) .++++. -+=.+-...|++. .+.||....+.... +. +- T Consensus 330 ~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~~~~~~~~~~~~~~~~~~---------~~-------~~ 390 (390) T protein:vir:40 330 RTSTEYRLLDDETLYYAKQYANGRPKDNSSFLVFDITGLEGSPAIDVNVVNNATPSE---------TP-------AE 390 (390) T ss_pred EecchhhhhcCcEEEEEEEEeCCEEecccceEEEEeeccCCCCCCCcceeeCCCCCC---------CC-------CC Confidence 111211 1111222223321 12222221111110 00 00 No 105 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=38.09 E-value=1.1 Score=20.39 Aligned_cols=345 Identities=12% Similarity=0.148 Sum_probs=122.7 Q ss_pred CCc---ccchH-----HHHHHhhhhhhccCCCcchhhhhhh--hhhhhhhhHHHHHhhhhcc----ccc---hhhhcccc Q lcl|NC_014661. 1 MST---QIKTK-----AQLVADWKPLLEAEGAPEIAQGKHA--IIAKMFENQEADIKSDAAY----RDE---KLAEAFGG 63 (524) Q Consensus 1 ~~~---~~~~~-----~~l~~kw~p~l~~~~~~~~~~~~~~--~~~~~~enq~~~~~~~~~~----~~~---~~~~~~~~ 63 (524) |.+ .++.+ +++.+++.-+-+ ++....+. -+....+.+++.+.+...- .++ ...+++.. T Consensus 14 ~~~~~~~~k~~~~~~~~~~e~~~~~l~~-----~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a~~~ 88 (407) T protein:vir:48 14 LQRKFDDFKEKNDKRIDAIEQEKGKLAG-----EVETLNGKLAELENLKSDLEAELAEVKRPAGGTQNKVASEHKEAFIG 88 (407) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHH Confidence 111 11110 112233332211 11111000 0011111111111110000 000 00112222 Q ss_pred cccccccccccccchhhhccccc-cccc---cccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCC Q lcl|NC_014661. 64 FLTEAEIGGDHGYDPQNIAAGQT-SGAV---TQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIA 139 (524) Q Consensus 64 ~l~ea~~~~~~g~~~~~i~est~-tg~v---~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~ 139 (524) ++-..........+.+.+..++. +|.+ ..+.+-++.+.| .+.+-.+++.+-||++++.-+. + ... T Consensus 89 ~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~---~~~~l~~~~~~~~~~~~~~~~~--~----~~~-- 157 (407) T protein:vir:48 89 FMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLK---DEVVMRQEATVITLGGSDYKKL--V----NLG-- 157 (407) T ss_pred HHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHH---hhhhhhhhceeeecCCCceEEE--E----ecC-- Confidence 22111100001111222222221 1111 122334444444 4566677888888887653321 0 000 Q ss_pred CccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccc Q lcl|NC_014661. 140 AGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDA 219 (524) Q Consensus 140 ~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~ 219 (524) +. .+.|- T Consensus 158 --~~---------~a~~v-------------------------------------------------------------- 164 (407) T protein:vir:48 158 --GT---------TSGWV-------------------------------------------------------------- 164 (407) T ss_pred --Cc---------ceeee-------------------------------------------------------------- Confidence 00 00000 Q ss_pred cccccccccccchhhhcccccCCCCCcchhhcc-eEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHH Q lcl|NC_014661. 220 GILVEIAEGMATSIAELQEGFNGSNNNPWNEMG-FRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATE 298 (524) Q Consensus 220 g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStE 298 (524) +++ ...++.+ -.+++++...|.-+-...+|-||.+|- ..|.+++|.+-|+.. T Consensus 165 ------~E~-----------------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~ 217 (407) T protein:vir:48 165 ------GET-----------------DARPETATSKLGLIEPFMGEIYGNPQATQKMLDDA----FFNVEDWINSELALE 217 (407) T ss_pred ------ccc-----------------ccccccccccceeEEeeeeeeEeehhhHHHHHhcc----hHHHHHHHHHHHHHH Confidence 000 0011111 123444444444455568999999983 356799999999999 Q ss_pred HHHHhhHHHHhhHhhhhhhhhhccccccccccceeccccccc------------ccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_014661. 299 IMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPID------------VRGARWAGESFKALLFQIDKESAEIA 366 (524) Q Consensus 299 ImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d------------~~~~~~a~E~~r~L~~~i~~~a~~I~ 366 (524) |...+++-||.. . ..+.+.|++-...... +....-....+..| ..+-+.+. T Consensus 218 i~~~~~~a~l~G------------~-G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i----~~l~~~l~ 280 (407) T protein:vir:48 218 FAEQEEIAFTSG------------D-GSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAI----IKLIYTLR 280 (407) T ss_pred HHHHHHhhhhcc------------C-CCCccceeeecccccccccccccccccccccccccccChHHH----HHHHHhhc Confidence 999998887752 0 0012233331110000 00000000011222 22222222 Q ss_pred hhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCc-----ceEEEEEec Q lcl|NC_014661. 367 RQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQ-----DYFTIGYKG 441 (524) Q Consensus 367 ~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG 441 (524) . .+-....+|+++.....|..+.- ..|.- -+..+.+.. ..++|.| ++|+++.+.|. +.|++| T Consensus 281 ~--~~~~~a~~v~n~~~~~~L~~lkD-----~~Gr~-l~~~~~~~g-~~~~l~G-~PV~~~~~~p~~~~~~~~i~~G--- 347 (407) T protein:vir:48 281 K--AHRSGAKFMMNNSSLFAIRLLKD-----NDGNY-LWRPGIELG-QPSSLAG-YGIVENEQMPDIAADAKAIAFG--- 347 (407) T ss_pred h--hhhcCCEEEEcHHHHHHHHHhhc-----cCCce-eeccCcCCC-CCceecc-eeeEEecCcCCccCCccEEEEE--- Confidence 1 22123356899999999875431 11100 011121211 1246776 69999988653 223322 Q ss_pred CCCccceeEeecccccccccccCcccccceeeeee--eeceee-CCcccccCCccccceeeccccchhhhhccccceeee Q lcl|NC_014661. 442 DNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKT--RYGIGI-NPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRR 518 (524) Q Consensus 442 ~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~t--RY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~ 518 (524) +-. ..++. +.-..+.-..||-.-+..++|.. ||+..+ +|= -|+. T Consensus 348 d~~--~~~~i--~~~~~~~i~~d~~~~~~~~~~~~~~r~d~~v~~~~-----------------------------a~~~ 394 (407) T protein:vir:48 348 NFK--RGYTI--VDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQ-----------------------------AIKL 394 (407) T ss_pred ecc--ccEEE--EEeeceEEEeeccccCCcEEEEEEEEeccEEeccc-----------------------------ceEE Confidence 110 00000 00000111124433233344433 565432 221 1122 Q ss_pred eeeecC Q lcl|NC_014661. 519 VLVKGI 524 (524) Q Consensus 519 ~~v~~~ 524 (524) +.|+-- T Consensus 395 l~~~aa 400 (407) T protein:vir:48 395 MKIGAA 400 (407) T ss_pred EEeecc Confidence 222222 No 106 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=34.80 E-value=1.3 Score=20.02 Aligned_cols=298 Identities=10% Similarity=0.054 Sum_probs=122.2 Q ss_pred cchHHHHHHhhhhhhccCCCcchhhhhhhhhhhhhhhHHHHHhhhhccccchhhhcccccccccccccccccchhhhccc Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEAEGAPEIAQGKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFLTEAEIGGDHGYDPQNIAAG 84 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~i~es 84 (524) |+..|++. ...|+...-+.+-|+ + .+. ... + T Consensus 1 ~~k~~~~~----------------~~~~~~~~~~~~~~~-----------------~-----~a~----------~~~-~ 31 (324) T protein:vir:99 1 MEQTQKLK----------------LNLQHFASNNVKPQV-----------------F-----NPD----------NVM-M 31 (324) T ss_pred CCCchHhh----------------HHHHHHHHHhhhhhh-----------------c-----ccc----------cee-c Confidence 11111000 000011111111110 0 110 000 0 Q ss_pred cccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccccccc Q lcl|NC_014661. 85 QTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHE 163 (524) Q Consensus 85 t~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~ 163 (524) +.++.. ..-+.+. .+++.+..+.+-.+++.+.||++.+.-|. ++... . .+.| T Consensus 32 ~~~~~~-lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p----~~~~~------~---------~a~~------- 84 (324) T protein:vir:99 32 HEKKDG-TLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFT----FWADK------P---------GAYW------- 84 (324) T ss_pred cCCCcc-eechhHHHHHHHHHHhhchhhhhcceeeccCCceEEE----EEecC------c---------ceeE------- Confidence 111110 1111121 34455556777888899999887653220 11000 0 0000 Q ss_pred cccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCC Q lcl|NC_014661. 164 VFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGS 243 (524) Q Consensus 164 ~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs 243 (524) + +| T Consensus 85 -------------------------------------------------------------v--------~E-------- 87 (324) T protein:vir:99 85 -------------------------------------------------------------V--------GE-------- 87 (324) T ss_pred -------------------------------------------------------------e--------cc-------- Confidence 0 11 Q ss_pred CCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccc Q lcl|NC_014661. 244 NNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQT 323 (524) Q Consensus 244 ~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~ 323 (524) +..+++...++++++++.|.-+---..|-||.+|-. .|.+++|.+.|+..|...+++.||..--.. T Consensus 88 -g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~--------- 153 (324) T protein:vir:99 88 -GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN--------- 153 (324) T ss_pred -CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCCC--------- Confidence 122344455667777777777777789999999974 467999999999999999999998532111 Q ss_pred cccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccccccccc Q lcl|NC_014661. 324 LTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLAR 403 (524) Q Consensus 324 ~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~ 403 (524) ..+.|+++-....... .. ....+..|.++-+.|. ..+.....+|++|.....|....- +.|.- T Consensus 154 ---~~~~~~~~~~~~~~~~---~~---~~~~~~~i~~~~~~l~--~~~~~~~~~v~n~~~~~~L~~l~d-----~~g~~- 216 (324) T protein:vir:99 154 ---PFGKSIAQSIEKTNKV---IK---GDFTQDNIIDLEALLE--DDELEANAFISKTQNRSLLRKIVD-----PETKE- 216 (324) T ss_pred ---ccCcccccccccccee---cc---ccCCHHHHHHHHHhhh--hccCCCCEEEEcHHHHHHHHHhhc-----CCCce- Confidence 0111222110000000 00 0011222333434433 233456788999999999975421 11111 Q ss_pred ccccccCcceEEEEecCceEEEeeCCCC--cceEEEEEecCCCccceeEeeccccccccccc---------Cccc----- Q lcl|NC_014661. 404 GLNTDTTKAVFAGILGGRYKVYIDQYAR--QDYFTIGYKGDNEMDAGIYYAPYVALTPLRGA---------DPKN----- 467 (524) Q Consensus 404 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~--~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~---------Dp~s----- 467 (524) ...+..+ ++|.| ++|++.+..+ ...+++|-.. .+++..- ....++.. |+.. T Consensus 217 -~~~~~~~----~~l~G-~PVv~~~~~~~~~~~~i~gd~~------~~~~~~~-~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:99 217 -RIYDRNS----DTLDG-LPVVNLKSSNLKRGELITGDFD------KLIYGIP-QLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred -eecCCCC----ccccc-eeEEeecCCCCCcceEEEEecc------cEEEEEe-cCcEEEEeecccccccccccccchhh Confidence 1111122 45777 5888877643 2234443221 0111110 00001111 1110 Q ss_pred c---cceeeeeeeecee-eCC--cccc---cCCccccceee Q lcl|NC_014661. 468 F---QPVLGFKTRYGIG-INP--LADT---AAQQPAGNARI 499 (524) Q Consensus 468 ~---qP~~~~~tRY~l~-~nP--~~~~---~~~~~~~~~~~ 499 (524) | +=.+=...||+.. .|| |+.- ....++....+ T Consensus 284 f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~~ 324 (324) T protein:vir:99 284 FEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred hhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 1 1222223555532 333 1100 00000000000 No 107 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=34.21 E-value=1.3 Score=19.95 Aligned_cols=338 Identities=17% Similarity=0.165 Sum_probs=120.6 Q ss_pred CCcccchHHH---HHHh-hhhhhccCC--Ccch----hhhhhhh----hhhhhhhHHHHHhhhhccccchhhhccccccc Q lcl|NC_014661. 1 MSTQIKTKAQ---LVAD-WKPLLEAEG--APEI----AQGKHAI----IAKMFENQEADIKSDAAYRDEKLAEAFGGFLT 66 (524) Q Consensus 1 ~~~~~~~~~~---l~~k-w~p~l~~~~--~~~~----~~~~~~~----~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ 66 (524) .-.++...|. +... =.|+-..+. .+.. ...+..- ...+.+.. ..++....+... . .. . T Consensus 49 l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~----~-~~--~ 120 (428) T protein:vir:10 49 ISAKMDRMEATERAAALVAKPVKATQHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQ-GNLQDAAKFASD----E-LN--D 120 (428) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhchhhccccccccccchhhhHHHHHHHHHHHHhh-hhHHHHHHHhhh----h-hh--h Confidence 2222222121 1110 011111100 0000 0011100 01111110 001000000000 0 00 0 Q ss_pred ccccccccccchhhhccccccccc---cccCcchhhHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccc Q lcl|NC_014661. 67 EAEIGGDHGYDPQNIAAGQTSGAV---TQIGPAVMGMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAK 143 (524) Q Consensus 67 ea~~~~~~g~~~~~i~est~tg~v---~~~~P~Li~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~ 143 (524) ++. ...+..++++|.+ ..+.+-+|.+.| ++.+..++ |+..+++++|-+ +|.-.+. + T Consensus 121 ~~~--------~~~~~~~~~~gg~liP~~~~~~ii~~l~---~~~~l~~~-~~~~~~~~~g~~-----~~p~~~~---~- 179 (428) T protein:vir:10 121 QSV--------SMAISTAAGSGGVLIPQNIHSEVIELLR---DRTIVRKL-GARSIPLPNGNM-----SLPRLAG---G- 179 (428) T ss_pred hhH--------hhhhcccccCCccccchhHHHHHHHHHh---hhchhhhh-cceeeecCCcce-----EEEEEeC---C- Confidence 000 0001111112211 111222233332 34444554 222222333321 1110000 0 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccc Q lcl|NC_014661. 144 EAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILV 223 (524) Q Consensus 144 eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~ 223 (524) +.+ + T Consensus 180 --------~~a--------------------------------------------------------------------~ 183 (428) T protein:vir:10 180 --------ATA--------------------------------------------------------------------S 183 (428) T ss_pred --------cce--------------------------------------------------------------------e Confidence 000 0 Q ss_pred cccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHh Q lcl|NC_014661. 224 EIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEI 303 (524) Q Consensus 224 ~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEI 303 (524) -+ +| +...++...++++++...|.-+-...+|-||.+|- ..|.++.|.+.|...|...+ T Consensus 184 ~v--------~E---------g~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds----~~~l~~~i~~~l~~ai~~~~ 242 (428) T protein:vir:10 184 YT--------GE---------NQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRA----GFNVEQLVLQDILTAISVRE 242 (428) T ss_pred ee--------cc---------CccccccccceeeEEeeeEEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHHHHHHH Confidence 00 11 12234445566667777777777889999999884 24568888888888888888 Q ss_pred hHHHHhhHhhhhhhhhhccccccccccceecccccccc-----cccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEE Q lcl|NC_014661. 304 NREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDV-----RGARWAGESFKALLFQIDKESAEIARQTGRGAGNFII 378 (524) Q Consensus 304 NREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~-----~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v 378 (524) |+.||..= -+...|.|++........ ...--..+....+. .+..+...+... .+ .....| T Consensus 243 d~~~l~G~------------G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~-~~~~~v 307 (428) T protein:vir:10 243 DKAFMRDD------------GTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYL-DSIILMSMDGNS-NM-ISSGWG 307 (428) T ss_pred HHHHhccC------------CCCccccccccccccccccccccccccccHHHHHHHH-HHHHHhhhcccc-cc-ccCEEE Confidence 88887420 011123344321100000 00000112222222 222223333332 22 345667 Q ss_pred eCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcc----------------eEEEEEecC Q lcl|NC_014661. 379 ASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQD----------------YFTIGYKGD 442 (524) Q Consensus 379 ~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----------------y~~vG~KG~ 442 (524) +++.....|....- +.|.- +-.+.. -|+|.| ++||++.+.|.+ ++++|..+. T Consensus 308 ~n~~~~~~L~~lkd-----~~G~~--i~~~~~----~g~l~G-~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~ 375 (428) T protein:vir:10 308 MSNRTYMKLFGLRD-----GNGNK--VYPEMA----QGMLKG-YPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGN 375 (428) T ss_pred EcHHHHHHHHHhhc-----cCCce--eccCCC----CCeeec-eeeEEeccccccccCCCccceEEEEecceEEEEEecc Confidence 89999988875331 11110 111122 256777 699998876543 122333322 Q ss_pred CCccceeEeecccccccccccCcccc---cceeeeeeeeceeeC-CcccccCCccccceeeccccch Q lcl|NC_014661. 443 NEMDAGIYYAPYVALTPLRGADPKNF---QPVLGFKTRYGIGIN-PLADTAAQQPAGNARIANGMPS 505 (524) Q Consensus 443 ~~~d~g~fyaPYv~~~~~~~~Dp~s~---qP~~~~~tRY~l~~n-P~~~~~~~~~~~~~~~~~g~~~ 505 (524) -+.+ ..+|..........-..| +=.+=...|+++.+. |= - ..+-.|..| T Consensus 376 i~i~----~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~---------a-~~~~t~~~~ 428 (428) T protein:vir:10 376 MKVD----FSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPE---------G-LVLGTGVLF 428 (428) T ss_pred eEEE----eecccccccccccccchhhcchhheeeeeeeCceeeccc---------e-EEEEeccCC Confidence 2211 122211110000000011 122335567776554 41 1 223333555 No 108 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=33.59 E-value=1.3 Score=19.88 Aligned_cols=270 Identities=14% Similarity=0.150 Sum_probs=122.9 Q ss_pred ccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCcccccccccccccccccccc Q lcl|NC_014661. 82 AAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRG 160 (524) Q Consensus 82 ~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~ 160 (524) |..++|.-..-+-|.++ .+++--+++.+.+ .|.+ ..+..++|++ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~---------~~~~--------------------------~~~~~l~g~~ 45 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRF---------AQFA--------------------------DIDSTLVGQP 45 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhh---------cccc--------------------------eecccccCCC Confidence 33223333344567666 5555444444433 0101 0111222211 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhccccc Q lcl|NC_014661. 161 SHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGF 240 (524) Q Consensus 161 ~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~l 240 (524) +..- + . |.- +. .+....+.+|- T Consensus 46 G~ti--~------------------------i------P~~-------------~~--igda~~~~eg~----------- 67 (276) T protein:vir:10 46 GDTL--T------------------------F------PAF-------------VY--SGDATVVPEGQ----------- 67 (276) T ss_pred CCEE--E------------------------e------eee-------------cC--CCccccccCCC----------- Confidence 1000 0 0 000 00 00000011110 Q ss_pred CCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhc-CCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhh Q lcl|NC_014661. 241 NGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVH-GMDADAELANILATEIMLEINREVIDWINYSAQVGK 319 (524) Q Consensus 241 Ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvH-GLDAEaELaNILStEImlEINREii~~l~~~A~~~k 319 (524) .-...++. ..+.+++.+-|.-.=++| |+-+.. +.|.-.|..+-++.-|...++.+++..+.....- T Consensus 68 ----~i~~~~lt--~~~~~a~i~~~~k~~~~t-----D~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~-- 134 (276) T protein:vir:10 68 ----KIPVDKIE--TNRREAKIHKIGKGTDIT-----DEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLT-- 134 (276) T ss_pred ----ccCccccc--cceeeEEeehcccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-- Confidence 01112222 234444444444322333 332222 6799999999999999999999999877643311 Q ss_pred hccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcC--Ccccccc Q lcl|NC_014661. 320 TGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASV--DTSVTPA 397 (524) Q Consensus 320 ~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~--~~~~~~~ 397 (524) . .++.+++ +.+-....++.++ -...++++|+|.+++.|... ..|...+ T Consensus 135 ------~--~~~~~t~-------------d~i~~A~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s 184 (276) T protein:vir:10 135 ------V--SADIGTL-------------AGLEAAIDTFDDE---------DLEPMVLFINPKDAGKLRSSASDNFTRAT 184 (276) T ss_pred ------c--cccccCH-------------HHHHHHHHHhccc---------cCcccEEEEcHHHHHHHHHhccccccccc Confidence 0 1122221 2222222222222 12578999999999999542 3443333 Q ss_pred ccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccccccccc-cCcccccceeeeee Q lcl|NC_014661. 398 AQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVALTPLRG-ADPKNFQPVLGFKT 476 (524) Q Consensus 398 a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~-~Dp~s~qP~~~~~t 476 (524) ..+ .+...+-.+|.+.| ++|++|...|..-..+--+|.-. |+.. -+.. ++. -|++.++-.|--.. T Consensus 185 ~~g------~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~gAi~-----~~~~-~~~~-vE~dRd~~~~~d~i~~~~ 250 (276) T protein:vir:10 185 ELG------DNIIVKGAFGEALG-AVIVRSKKLDEGEAILAKRGAVK-----LITK-RDFF-LETDRDPSTKTTALYSDK 250 (276) T ss_pred ccc------ccceeccccceecc-eeEEEcCCCCcceEEEEecccee-----eeec-CCce-eecccchhhcccEEEEee Confidence 221 11122335788877 79999999875432222122221 1110 0111 222 38899999998888 Q ss_pred eeceee-CCcccccCCccccceeeccccchhhhhccccc Q lcl|NC_014661. 477 RYGIGI-NPLADTAAQQPAGNARIANGMPSIANSVGKNG 514 (524) Q Consensus 477 RY~l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~ 514 (524) +||+.. || .++.++..+-. ....+. T Consensus 251 ~y~~~~~~~---------~~vv~~t~~~~----~~~~~~ 276 (276) T protein:vir:10 251 HYVAYLYDE---------SKAVKVTKGAG----TTDSGA 276 (276) T ss_pred EEEEEEEcC---------cceEEEecCCc----CCcCCC Confidence 998753 44 23444444311 111111 No 109 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=32.01 E-value=1.5 Score=19.69 Aligned_cols=280 Identities=13% Similarity=0.063 Sum_probs=112.5 Q ss_pred hccccccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccccccccccccccc Q lcl|NC_014661. 81 IAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGR 159 (524) Q Consensus 81 i~est~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~ 159 (524) .+ +++|. ..-|.+. .+++.+.++.+-.+++.+.||++..- +|.-... + +.+.| T Consensus 1 ma--~~gG~--lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~-------~~p~~~~---~---------~~a~~--- 54 (298) T protein:vir:94 1 MV--LNKGT--LFDPELVTDLISKVAGKSSIARLSAQKPIPFNGE-------KVFTFTM---D---------SEIDV--- 54 (298) T ss_pred Ce--ecccc--ccChhHHHHHHHHHHhhchhhhhcceeeccCCce-------EEEEEec---C---------cceEE--- Confidence 11 12222 1224443 46666677888899999999876321 1211100 0 00000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccc Q lcl|NC_014661. 160 GSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEG 239 (524) Q Consensus 160 ~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~ 239 (524) +++| .|. T Consensus 55 -----------------------------------------------------------------v~Eg-----~~~--- 61 (298) T protein:vir:94 55 -----------------------------------------------------------------VAES-----GKK--- 61 (298) T ss_pred -----------------------------------------------------------------eeCC-----ccc--- Confidence 0011 000 Q ss_pred cCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhh Q lcl|NC_014661. 240 FNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGK 319 (524) Q Consensus 240 lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k 319 (524) ..+...|.++.|...|.. -....|-||.|+--. -..+-+++|.+-|...|..+|+.-+|.....-. |. T Consensus 62 --~~~~~~f~~v~l~~~k~~-------~~~~iS~ell~~~~~-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~--g~ 129 (298) T protein:vir:94 62 --THGGVTLAPQTMVPIKVE-------YGARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVNPRL--GT 129 (298) T ss_pred --cccccceeEEEEeeeEEE-------EeeehhHHHhccCCc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC--Cc Confidence 001223444455544444 356788998764221 012335666666666666666666654321100 00 Q ss_pred hccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCcccccccc Q lcl|NC_014661. 320 TGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQ 399 (524) Q Consensus 320 ~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~ 399 (524) .... ....+.......... .......++.-+.++-..+... +.+...+|++|.....|..... +. T Consensus 130 ~~~~---~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~vmn~~~~~~l~~lkd-----~~ 194 (298) T protein:vir:94 130 ASAV---IGTNHFDSKVTQKVE-----APRGIADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAKQKD-----LQ 194 (298) T ss_pred cccc---ccccccccccccccc-----cccccccHHHHHHHHHHhhhhc--CCCccEEEEcHHHHHHHHHhhc-----cC Confidence 0000 000000000000000 0011112233344444443331 2356679999999999965321 11 Q ss_pred ccccccccccCcceEEEEecCceEEEeeCCCC------cceEEEEEecCCCccceeEeecccccc--cccccCccc---- Q lcl|NC_014661. 400 GLARGLNTDTTKAVFAGILGGRYKVYIDQYAR------QDYFTIGYKGDNEMDAGIYYAPYVALT--PLRGADPKN---- 467 (524) Q Consensus 400 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~------~dy~~vG~KG~~~~d~g~fyaPYv~~~--~~~~~Dp~s---- 467 (524) |. .-+..+.++. -.|+|.| ++|++++.-+ .+.+++| +-. .++.|...-.+. ..+..||+. T Consensus 195 G~-~l~~~~~~~~-~~~tl~G-~PV~~~~~v~~~~~~~~~~~~~G---dfs--~~~~~~~~~~~~~~~~~~~~~d~~~~~ 266 (298) T protein:vir:94 195 GN-ALFPELKWGA-TPDTING-LPVDVNKTVSDMSLTQRDRAIIG---DFA--NGFKWGYAKEVPLEVIQYGDPDNSGLD 266 (298) T ss_pred CC-eeecCcccCC-CCceecc-eeeEEecccccccCCCccEEEEe---ecc--ceEEEEEecCceEEEeecCCCcCcchh Confidence 11 0011122211 1257877 6999888643 2233333 111 112233221211 112223321 Q ss_pred -cc-ceeee--eeeecee-eCCcccccCCccccceeecccc Q lcl|NC_014661. 468 -FQ-PVLGF--KTRYGIG-INPLADTAAQQPAGNARIANGM 503 (524) Q Consensus 468 -~q-P~~~~--~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~ 503 (524) || =.++| ..|+++. .+| ..+.++.+.- T Consensus 267 ~f~~~~v~~r~~~r~~~~~~~~---------~a~~~l~~~t 298 (298) T protein:vir:94 267 LKGYNQVYIRAELFLGWGILDA---------TKFARVTEAN 298 (298) T ss_pred hhhcCcEEEEEEEEeccEeecc---------cceEEEEecC Confidence 22 12344 4567654 444 1233443311 No 110 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=31.09 E-value=1.5 Score=19.58 Aligned_cols=219 Identities=12% Similarity=0.101 Sum_probs=98.5 Q ss_pred ccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEeccc Q lcl|NC_014661. 187 TGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQ 266 (524) Q Consensus 187 tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRA 266 (524) -+ +...+..+.-+.+.|.+...++| ..-+..+|+++ ..+++.|-+. T Consensus 1 ~~-----------------~~~~Gdtit~P~~iGda~~v~eG---------------~~i~~~~l~~t--~~~atIk~~g 46 (231) T protein:vir:73 1 EN-----------------GINLANLCEYPNDIGDAADVAEG---------------GEISLDKIGTT--TKSVTIKKAA 46 (231) T ss_pred Cc-----------------cccCCceEEecccccchhhhcCC---------------CcCChhhcccc--ceeeeEeeec Confidence 00 00000111111112222111121 12234455544 3444445443 Q ss_pred ccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccch Q lcl|NC_014661. 267 LKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARW 346 (524) Q Consensus 267 LKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~ 346 (524) =.=++|=|- .|.+ +| |.-.|..+-|+..|...++.||+..+-+.+... +..++++ T Consensus 47 k~~~itD~a--~l~~-~g-Dp~~ea~~Q~~~~iA~kvD~di~~~~~~a~l~~-----------~~~~t~d---------- 101 (231) T protein:vir:73 47 KGTEITDEA--ALSG-YG-DPIGESNKQLGLSLANKVDDDLLKAAKTTSQTV-----------STKANVD---------- 101 (231) T ss_pred cceeeeHHH--Hhhc-cC-chHHHHHHHHHHHHHHhhhHHHHHhhccccccc-----------cccccHH---------- Confidence 333444332 2555 33 889999999999999999999997766544210 1112111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEe Q lcl|NC_014661. 347 AGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYI 426 (524) Q Consensus 347 a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~ 426 (524) .+..+..+ +.++ -....+++|+|+++..|.....+......... +.=.++ .+|.+.| ++|++ T Consensus 102 ~i~~A~~~---fgde---------~~~~~vivv~p~~~~~Lrk~~~~~~~~~~~g~---~i~~~G--~iG~i~G-~~Vi~ 163 (231) T protein:vir:73 102 GVQAALDI---FNDE---------DAQAYVLIVNPKDAAKIRKDANAKNIGSEVGA---NALING--TYADVLG-AQIVR 163 (231) T ss_pred HHHHHHHH---hccc---------cccceEEEEcchHHHhhhhccchhhhhhhhcc---ceeeec--ccceEcc-eEEEE Confidence 11111111 1111 13567999999999999764433222111111 111122 4677766 89998 Q ss_pred eCCCCcceEEEEEecCCCccceeEeeccccc-----c-cccc------cCcccccceeeeeeeeceeeCCcccccCCccc Q lcl|NC_014661. 427 DQYARQDYFTIGYKGDNEMDAGIYYAPYVAL-----T-PLRG------ADPKNFQPVLGFKTRYGIGINPLADTAAQQPA 494 (524) Q Consensus 427 D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~-----~-~~~~------~Dp~s~qP~~~~~tRY~l~~nP~~~~~~~~~~ 494 (524) +...+. +..++++|+.. . ..+. -|+..+.-.+----.|++.. T Consensus 164 S~~~~~--------------~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l------------ 217 (231) T protein:vir:73 164 SKKLAE--------------GSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYL------------ 217 (231) T ss_pred cCCCCC--------------CceeeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEE------------ Confidence 877653 22344555320 0 0000 14444444444444444321 Q ss_pred cceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 495 GNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 495 ~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) .+.+.. =++-+||+ T Consensus 218 ---------------~~~~~v-v~~t~~g~ 231 (231) T protein:vir:73 218 ---------------YDLTKV-VNITFTGV 231 (231) T ss_pred ---------------EcCccE-EEEEeecC Confidence 111111 01334455 No 111 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=30.26 E-value=1.6 Score=19.48 Aligned_cols=346 Identities=14% Similarity=0.071 Sum_probs=121.4 Q ss_pred CCcccchHHHHHHhhhhhhccCCCcchhhhhhhhhhhhhhhHHHHHhhhhccccchhhhcccccc----c-ccc-ccccc Q lcl|NC_014661. 1 MSTQIKTKAQLVADWKPLLEAEGAPEIAQGKHAIIAKMFENQEADIKSDAAYRDEKLAEAFGGFL----T-EAE-IGGDH 74 (524) Q Consensus 1 ~~~~~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l----~-ea~-~~~~~ 74 (524) ......+. +.+++|.-+... +.+....| .+..|. ++...+...-.+.......+.-. . ++. -.+.. T Consensus 27 ~~~~~lt~-e~~~~~~~l~~e-----~~~l~~~i-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~ 98 (390) T protein:vir:62 27 FAGKEMTD-EAREKEERLITA-----VSDYDARI-KRGIEA-IKAIDPVTSLLSGLQGSGSGAQRSADVDDDATLRAGNL 98 (390) T ss_pred hhcccccH-HHHHHHHHHHHH-----HHHHHHHH-HHHHHH-HHHHHHHHHHHhhcccccccchhhcchHHHHHHhhhhh Confidence 11111121 133333332211 11111111 011110 00000000000000000000000 0 000 00000 Q ss_pred ccc-----hhhhccccccccccccCcchh-hHHHHHH-HhhhhhhceeeecCCCcchheeeeeeeecCccCCCccccccc Q lcl|NC_014661. 75 GYD-----PQNIAAGQTSGAVTQIGPAVM-GMVRRAI-PNLIAFDICGVQPMQGPTGQVFALRAVYGKDPIAAGAKEAFH 147 (524) Q Consensus 75 g~~-----~~~i~est~tg~v~~~~P~Li-~l~Rra~-~nLIa~DI~GVQPmTGPTGLIFAMRSrY~~q~~~~~g~eAf~ 147 (524) +.. ......+++++.-...-|.+. .++..+. ...+...++-|-||++...+-+... ... .. T Consensus 99 ~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~---~~~------~~--- 166 (390) T protein:vir:62 99 GEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVI---TGR------SS--- 166 (390) T ss_pred hhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEE---cCC------cc--- Confidence 000 000011111111101111111 1111111 1223455666666554333221100 000 00 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccc Q lcl|NC_014661. 148 PMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAE 227 (524) Q Consensus 148 ~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~ 227 (524) +.| T Consensus 167 ------a~w----------------------------------------------------------------------- 169 (390) T protein:vir:62 167 ------ASI----------------------------------------------------------------------- 169 (390) T ss_pred ------eee----------------------------------------------------------------------- Confidence 000 Q ss_pred cccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHH Q lcl|NC_014661. 228 GMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREV 307 (524) Q Consensus 228 Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREi 307 (524) .+| +..+++-.-++++++..+|..+-...+|-||.+|- .+|.+++|.+-|+..|..-+|..| T Consensus 170 -----v~E---------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~ 231 (390) T protein:vir:62 170 -----VGE---------TAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQ----VLDLVGFLVSDAGPAIGDAMGRHF 231 (390) T ss_pred -----ecc---------cccccccccceeeeEeeeeeEEeehHHHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHhhh Confidence 011 11233334445677777777777889999999992 467899999999999999999988 Q ss_pred HhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHH Q lcl|NC_014661. 308 IDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVL 387 (524) Q Consensus 308 i~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L 387 (524) |.. .|.|.|+++......... -.... -.--+..|+.|-+.+...-+ ..-..|+++.....| T Consensus 232 l~G---------------~G~p~Gi~~~~~~~~~~~-~~~~~-~~~~~~~l~~~~~~l~~~~~--~~a~~vmn~~~~~~L 292 (390) T protein:vir:62 232 ITG---------------TGQPRGILTDASPATATF-LATDT-DSKVSDALIDLFHEVPSAYR--ANAKYVVNDLRAAQM 292 (390) T ss_pred hcc---------------CCccccccccccccccce-ecccc-cccchHHHHHHHHhhhhhhh--cCCEEEEchHHHHHH Confidence 842 112344443221110000 00000 00011122333333322222 222568899988888 Q ss_pred hcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeecccc-cccccccCcc Q lcl|NC_014661. 388 ASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGDNEMDAGIYYAPYVA-LTPLRGADPK 466 (524) Q Consensus 388 ~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~-~~~~~~~Dp~ 466 (524) .... .+.|.- =+..+.+.. .-++|.| ++|+++.+.|.+-|++|-- +-|+-.... ....+..|+- T Consensus 293 ~~lk-----d~~g~~-l~~~~~~~g-~~~~l~G-~Pv~~~~~~p~~~i~~gd~-------s~~~i~~~~~~~v~~~~~~~ 357 (390) T protein:vir:62 293 RKLK-----DANGQY-LWQSGLTVG-APSLFNG-KVVETDDGMPADKILFADL-------SKYRVRFAGSLRVDRSVDAK 357 (390) T ss_pred HHhh-----ccCCCe-eecCCcCCC-ccceecc-cceEEecCCCCccEEEeec-------cceeEEeecceEEEeecccc Confidence 6532 111110 011111111 1136787 6999999987765554411 001110000 1111112322 Q ss_pred cccceee--eeeeecee-eCCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 467 NFQPVLG--FKTRYGIG-INPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 467 s~qP~~~--~~tRY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) .-.-.++ +..|++.. .||= . ||.+.||.= T Consensus 358 ~~~~~~~~~~~~r~d~~~~~~~----------------------------A-~~~l~~~~~ 389 (390) T protein:vir:62 358 FSTDQIVYRFLQRADGLLVDAR----------------------------G-AKVLTVTPG 389 (390) T ss_pred ccCCcEEEEEEEEeCcEeechh----------------------------h-eEEEEeecC Confidence 1122222 33455432 2221 1 222222222 No 112 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=29.69 E-value=1.6 Score=19.41 Aligned_cols=296 Identities=18% Similarity=0.227 Sum_probs=116.2 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccc-cc Q lcl|NC_014661. 149 MYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEI-AE 227 (524) Q Consensus 149 fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~-g~ 227 (524) |-..+ ..+...+.-+..+.+.... .+++ ...+ ...++.. ..+....+ .+ T Consensus 1 m~~~~---------~~~~~dp~LT~~A~gy~n~-~~Ia--d~lf---P~vpV~~---------------~~~k~~~f~~e 50 (307) T protein:vir:79 1 MGRLS---------KLRIVDPVLTNLAIGYTNA-EFIG--QTLM---PVVEVEK---------------EGGKIPKFGKE 50 (307) T ss_pred CCCCC---------CCcccCHHHHHHHhhccch-hhhh--hhcC---Ccccccc---------------cccceeeeccc Confidence 10000 0111111122222221110 1110 0000 0001000 00111111 12 Q ss_pred cccchhhhcccccCCCCCcchhhcce-EEEEEEEEEecccccchhhHHHHHHH--HhhcCCChHHHHHHHHHHHHHHHhh Q lcl|NC_014661. 228 GMATSIAELQEGFNGSNNNPWNEMGF-RIDKQVIEAKSRQLKAQYSIELAQDL--RAVHGMDADAELANILATEIMLEIN 304 (524) Q Consensus 228 Gm~Ts~aE~l~~lGgs~~~~f~EMsF-sIEK~TVtAKSRALKAEYT~ELAQDL--KAvHGLDAEaELaNILStEImlEIN 304 (524) ++....-+ -+.+ ...+++-| .++..++..+-.+| |..-|- .+..++|-|+--..-|...|++..= T Consensus 51 ~f~~~~t~--ra~~----~~~~~v~~~~~~~~~~~~~~~~l------~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E 118 (307) T protein:vir:79 51 SFRLYQTE--RALR----AKSNRMNPEDIDSVDVNLDEHDL------EYPIDYREDQESAFPLEQAAVQTATDAIQLRRE 118 (307) T ss_pred cccccccc--cccC----CCcceeeeeccccccccccccch------hhcccchhcCCCCCCHHHHHHHHHHHHHHhHHH Confidence 22222211 1122 22344444 23433333333333 322222 2334667777666667666666543 Q ss_pred HHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHH Q lcl|NC_014661. 305 REVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVV 384 (524) Q Consensus 305 REii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va 384 (524) .++-+.+.. +..+...+. . .|+. . ..|+-. -..-+-.|++.-..|.+.+++ ..|.+|.+++|. T Consensus 119 ~~~A~l~~~-~~~y~~~~k-------~--tLsg-t----~~Wsd~-~sDPi~di~~~~~ai~~~~g~-~Pn~~vlg~~a~ 181 (307) T protein:vir:79 119 KMIADLSQN-PSSYAAGNK-------K--QLSA-T----EKFTAA-NSDPVGVIEDGKEAIRTKIGR-RPNTMVIGASAY 181 (307) T ss_pred HHHHHHhcc-ccccCCCce-------E--EEcc-C----cccCCC-CCCcHHHHHHHHHHHHHhhCC-ccceEEeCHHHH Confidence 333333322 222221111 1 1110 1 134331 245566788888889998998 899999999999 Q ss_pred HHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeC--CCCcceEEEEEecCC--CccceeEeecccccccc Q lcl|NC_014661. 385 NVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQ--YARQDYFTIGYKGDN--EMDAGIYYAPYVALTPL 460 (524) Q Consensus 385 ~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~--y~~~dy~~vG~KG~~--~~d~g~fyaPYv~~~~~ 460 (524) .+|..++-+.-.=-... .|. .|...++-.|+-. +|+|.. |... |+.. -+...+..+ |++-..- T Consensus 182 ~~l~~h~~i~~~lk~~~-~g~---it~~~la~l~~v~-~V~vg~a~y~~~-------~~~~~~iw~~~~~l~-y~~~~~~ 248 (307) T protein:vir:79 182 KTLKAHPQLIEKIKYSM-KGI---VTVDLLKEIFEVE-NIAVGEAIYADD-------KDRFTDIWGANIVLA-YVPLQRG 248 (307) T ss_pred HHHhcCHHHHHHhcCcc-ccc---cCHHHHHHHhCce-eEEEeeeeeecc-------cccchhcCCCceEEE-ecccccC Confidence 99988776543211111 111 1222222222222 333322 2111 1211 133455666 6654331 Q ss_pred cccCcccccceeeeeeeeceeeCCcccccCC-ccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 461 RGADPKNFQPVLGFKTRYGIGINPLADTAAQ-QPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 461 ~~~Dp~s~qP~~~~~tRY~l~~nP~~~~~~~-~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) .-+|+.+.|..|+..|+. -+|+.+.... ....+.|.++-..-. -....-. .+++|. T Consensus 249 -~~~~~~~~ps~Gyt~~~~--g~~~~d~~~~~~~~~~vrv~~~~~~~-i~~~~~G----~li~~~ 305 (307) T protein:vir:79 249 -GQQRTPYEPSYGYTLRKK--GNPVVDTRIEDGKLELVRATDIFRPY-LLGADAG----YLISGI 305 (307) T ss_pred -CCCCcccccccceeEEec--CceEEecccCCCceeEEeecccccce-eeccccc----hhhccC Confidence 347888999999999975 4665543321 111112222210000 0000000 123333 No 113 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=27.71 E-value=1.8 Score=19.16 Aligned_cols=203 Identities=14% Similarity=0.167 Sum_probs=102.7 Q ss_pred EEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceec Q lcl|NC_014661. 255 IDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFD 334 (524) Q Consensus 255 IEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fd 334 (524) ||=. |-|..=++-.-+-++ | +|-..|.+.=...+++.++.+-|++.+...|..-.. .+ ...|..| T Consensus 1 iD~l--------L~a~~~VdDiD~aqa-~-~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p---~~--~~~~g~~ 65 (221) T protein:vir:17 1 MDDL--------LVASQFVYDLDEILA-Q-WNTRSEISKQIGEALAIHYDERIARVLASASIAAAP---VT--GQDGGFS 65 (221) T ss_pred CCcc--------hhHHHHHHhHHHHHh-h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCc---cc--ccccCcc Confidence 2211 223333333444444 4 788889999999999999999998888766642111 00 0011111 Q ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHH-HHHHHhcCCcccccc-ccccccccccccCcc Q lcl|NC_014661. 335 FQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRN-VVNVLASVDTSVTPA-AQGLARGLNTDTTKA 412 (524) Q Consensus 335 l~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~-va~~L~~~~~~~~~~-a~~~~~~~~~d~~~~ 412 (524) . ....+.. .....||..|-+.+...-.+-=--.|-|+|++|+ ...+|+..+.+.... ..+ ...+..+. T Consensus 66 ~---~~~a~~t---~~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~----s~g~~~~g 135 (221) T protein:vir:17 66 V---NIGAGNT---NNAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGN----TQGDMNTG 135 (221) T ss_pred e---ecccccc---CCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeeccc----cccccccc Confidence 1 1001100 0112333333333333333333336789999995 677776433321111 111 11111111 Q ss_pred eEEEEecCceEEEeeCCCCc----ceEE------------EEEecCCCccceeEeecccccccccccCcccccceeeeee Q lcl|NC_014661. 413 VFAGILGGRYKVYIDQYARQ----DYFT------------IGYKGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKT 476 (524) Q Consensus 413 ~~~G~l~~~~~vy~D~y~~~----dy~~------------vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~t 476 (524) ..+|.+.| ++||.=++.|. +|.. =.|.|+-.-..||||.|=.-+ -++.+.|-|--|.+.-| T Consensus 136 ~~i~~v~G-~~V~~SnnlP~~~gt~~~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avg-tvkl~~~~~~~~~~~~~- 212 (221) T protein:vir:17 136 KGLYVNAG-IRIYKSNVLASLYGTNLVTDPGDATTSGENNGSYRPAITDRAGLVFHKEAAD-TVEVLLPPSRPPLVISM- 212 (221) T ss_pred ceeeeecC-cEEEEeccCCcccccccccCCccccccccccccccccccceEEEEEcchhee-eeeeecCCCCCceeeee- Confidence 24677886 99999999876 3321 134455555679999987443 34677888877754322 Q ss_pred eeceeeCCcccccCCcccccee Q lcl|NC_014661. 477 RYGIGINPLADTAAQQPAGNAR 498 (524) Q Consensus 477 RY~l~~nP~~~~~~~~~~~~~~ 498 (524) |.-...+. | T Consensus 213 --------~~~~~~~~-----~ 221 (221) T protein:vir:17 213 --------FSIRRPDR-----R 221 (221) T ss_pred --------eeccCCCC-----C Confidence 21111111 0 No 114 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=27.44 E-value=1.8 Score=19.13 Aligned_cols=333 Identities=16% Similarity=0.118 Sum_probs=114.0 Q ss_pred cchHHHHHHhhhhhhcc---------------CCC-cchhhhhh---hhhhhh--hhhHHHHHhhhh-ccc--------- Q lcl|NC_014661. 5 IKTKAQLVADWKPLLEA---------------EGA-PEIAQGKH---AIIAKM--FENQEADIKSDA-AYR--------- 53 (524) Q Consensus 5 ~~~~~~l~~kw~p~l~~---------------~~~-~~~~~~~~---~~~~~~--~enq~~~~~~~~-~~~--------- 53 (524) |++-++|+++|.-+.+. +.. .++...+. .+-+++ |+.|...+..+. ... T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCCC Confidence 55544444444433221 000 11221111 111111 233322221111 000 Q ss_pred ---cchhhhcccccccccccccccccc--------hhhhccccccccccccCcchh--hHHHHHHHhhhhhhceeeecCC Q lcl|NC_014661. 54 ---DEKLAEAFGGFLTEAEIGGDHGYD--------PQNIAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMQ 120 (524) Q Consensus 54 ---~~~~~~~~~~~l~ea~~~~~~g~~--------~~~i~est~tg~v~~~~P~Li--~l~Rra~~nLIa~DI~GVQPmT 120 (524) .+....++..++-.. ..+..+.. ...+.+++.+..-. .=|.=+ .++++...+-+-.+++.|.|++ T Consensus 81 ~~~~~~~~~~~~~~~r~~-~~~~~~~~~~~~~~~~~~al~~~t~s~gG~-~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~ 158 (387) T protein:vir:93 81 LNDHEKMVKAKAEFYRHA-ILPNEFEKPSMEAQRLLHALPTGNDSGGDK-LLPKTLSKEIVSEPFAKNQLREKARLTNIK 158 (387) T ss_pred cchhhHHHHHHHHHHHHH-hhhhhhhhhhhhhHHHHHhhccCcCCCCce-eechhHHHHHHHHHHhhchhhhheeeeecC Confidence 000011111111100 11111110 00111112111100 012211 2333333344556778887775 Q ss_pred CcchheeeeeeeecCccCCCcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 121 GPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTL 200 (524) Q Consensus 121 GPTGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~ 200 (524) +.+. . +-.+.. . ...| T Consensus 159 ~~~~--p--~~~~~~-------~---------~a~~-------------------------------------------- 174 (387) T protein:vir:93 159 GLEI--P--RVSYTL-------D---------DDDF-------------------------------------------- 174 (387) T ss_pred CceE--E--EEeecC-------C---------cccc-------------------------------------------- Confidence 4321 0 000000 0 0000 Q ss_pred CCCCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHH Q lcl|NC_014661. 201 ATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLR 280 (524) Q Consensus 201 a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLK 280 (524) +++| ...++...+++.++..++.-+-...+|-||.||- T Consensus 175 ------------------------v~E~-----------------~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds- 212 (387) T protein:vir:93 175 ------------------------ITDV-----------------ETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGS- 212 (387) T ss_pred ------------------------ccCc-----------------ccccccccccceeeeeheeeeeechhhHHHHhhh- Confidence 0000 0011112233444555555555788999999984 Q ss_pred hhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHH Q lcl|NC_014661. 281 AVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDK 360 (524) Q Consensus 281 AvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~ 360 (524) ..|.|++|.+-|+..|..-.|..++-.-.-+ +-+.|++.-.....+.+ -.++-.|+. T Consensus 213 ---~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~------------g~p~g~l~~~~~~~v~~--------~~~~d~i~~ 269 (387) T protein:vir:93 213 ---DVDLVNWVENALQSGLAAKERKDALAVSPKS------------GLDHMSFYNGSVKEVEG--------ADMYDAIIN 269 (387) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc------------cccceeeeccccccccc--------cchHHHHHH Confidence 3456888888888888766566555222111 12233332111111111 112223333 Q ss_pred HHHHHHhhccccCccEEEeCHHH-HHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEE Q lcl|NC_014661. 361 ESAEIARQTGRGAGNFIIASRNV-VNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGY 439 (524) Q Consensus 361 ~a~~I~~~T~rg~gn~~v~S~~v-a~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~ 439 (524) +-+.+...=+ ..+.|+ +++.. ..+|.-.. - +.+.. + ...+ .+|.| ++||+..+++. +++|- T Consensus 270 ~~~~l~~~~~-~~a~~~-mn~~t~~~~~~~~~---d--~~~~~--~--~~~~----~~llG-~PV~~~~~~~~--~~~GD 331 (387) T protein:vir:93 270 ALADLHEDYR-DNATIY-MRYADYVKIISVLS---N--GTTNF--F--DTPA----EKVFG-KPVVFTDAAVK--PIVGD 331 (387) T ss_pred HHhccChhhh-cCCEEE-EechHHHHHHHHHh---c--CCCcc--c--ccCC----ccccc-cceEEecCCCc--eeeee Confidence 3333333222 356565 45444 44443211 0 00000 1 0111 25776 59998776643 33442 Q ss_pred ecCCCccceeEeecccccccccccCcccccceeeeee--eecee-eCCcccccCCccccceeeccccchhhhhcccccee Q lcl|NC_014661. 440 KGDNEMDAGIYYAPYVALTPLRGADPKNFQPVLGFKT--RYGIG-INPLADTAAQQPAGNARIANGMPSIANSVGKNGYF 516 (524) Q Consensus 440 KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~t--RY~l~-~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~ 516 (524) - +-||-=|.. +....+.......++|.. ||+.. .+|= -| T Consensus 332 f-------~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~r~d~~v~~~e-----------------------------A~ 373 (387) T protein:vir:93 332 F-------NYFGINYDG--TTYDTDKDVKKGEYLFVLTAWYDQQRTLDS-----------------------------AF 373 (387) T ss_pred h-------hhhheehhh--heeeecccccCCceeEEEEeeeCceeechh-----------------------------he Confidence 1 111211111 111112223344556655 44332 2231 12 Q ss_pred eeeeeecC Q lcl|NC_014661. 517 RRVLVKGI 524 (524) Q Consensus 517 r~~~v~~~ 524 (524) |.+.||-= T Consensus 374 ~~l~~k~~ 381 (387) T protein:vir:93 374 RIAKAKEN 381 (387) T ss_pred EEEEeecC Confidence 22222221 No 115 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=24.30 E-value=2.2 Score=18.71 Aligned_cols=340 Identities=16% Similarity=0.082 Sum_probs=113.7 Q ss_pred CCcccch-HHHHHHhh---hhhhccCC------C---------cchhhh--hhhhhhhhhhhHHHHHhhhhccccchhh- Q lcl|NC_014661. 1 MSTQIKT-KAQLVADW---KPLLEAEG------A---------PEIAQG--KHAIIAKMFENQEADIKSDAAYRDEKLA- 58 (524) Q Consensus 1 ~~~~~~~-~~~l~~kw---~p~l~~~~------~---------~~~~~~--~~~~~~~~~enq~~~~~~~~~~~~~~~~- 58 (524) +.+++.. ++++.+.+ .-.++..- . -++... ...-...+.+.+.+...+.......... T Consensus 48 ~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 127 (437) T protein:vir:10 48 KEDEIKEIRSNIEVLEQASALKVEEKRDDSDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQD 127 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhH Confidence 1111110 00111111 11111000 0 000000 0011112222222211111110000000 Q ss_pred --------------hcccccccccccccccccchhhhcccc-ccccccccCcchh-hHHHHHHHhhhhhhceeeecCCCc Q lcl|NC_014661. 59 --------------EAFGGFLTEAEIGGDHGYDPQNIAAGQ-TSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMQGP 122 (524) Q Consensus 59 --------------~~~~~~l~ea~~~~~~g~~~~~i~est-~tg~v~~~~P~Li-~l~Rra~~nLIa~DI~GVQPmTGP 122 (524) ..+...+.+.+ ......++ ..+.+ .-|.-+ ..++.........+++.|.||+.+ T Consensus 128 ~~~~~~~~~~~~~~~~~~~~~~~~e--------~~~~~~~~~~~~g~--lvp~~~~~~i~~~~~~~~l~~~~~~~~~~~~ 197 (437) T protein:vir:10 128 MKLKVGGEIADKKVTAFADYLKTGE--------VRDVTGIALKDGKV--IIPETILTPEKEVHQFPRLGSLVRTESVTTT 197 (437) T ss_pred HHHHHHHHHHHhhhhhhHHHHHhhh--------hhhhhhcccccccc--cchHHHHHHHHHhhhhhhhhhcceeEeeccC Confidence 00111111100 00001111 11111 012111 112211122234556777777766 Q ss_pred chheeeeeeeecCccCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCC Q lcl|NC_014661. 123 TGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLAT 202 (524) Q Consensus 123 TGLIFAMRSrY~~q~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~ 202 (524) .+-+--++..- + ...+ T Consensus 198 ~~~~~~~~~~~--------~----------~~~~---------------------------------------------- 213 (437) T protein:vir:10 198 TGKLPIFNNST--------D----------LLTA---------------------------------------------- 213 (437) T ss_pred ceeeEEeeccc--------c----------cccc---------------------------------------------- Confidence 54321111000 0 0000 Q ss_pred CCCcccccccccccccccccccccccccchhhhcccccCCCCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhh Q lcl|NC_014661. 203 TADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAV 282 (524) Q Consensus 203 ~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAv 282 (524) ++++- ...| .+...|.++.|.+.|+. --..+|-||.+|- T Consensus 214 ----------------------~~e~~--~~~e-------~~~~~~~~v~~~~~k~~-------~~~~is~ell~ds--- 252 (437) T protein:vir:10 214 ----------------------HTEYG--QTTK-------NATPVITPILWDLKTYT-------GGYVFSQELISDS--- 252 (437) T ss_pred ----------------------ccccc--cccc-------cccccceeeeeehhhee-------eehhhhHHHHhhh--- Confidence 00000 0000 11234555555555554 4567899999984 Q ss_pred cCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccceecccccccccccchHHHHHHHHHHHHHHHH Q lcl|NC_014661. 283 HGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKES 362 (524) Q Consensus 283 HGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a 362 (524) ..|.+++|.+.|+.-|..-+|..||..+-+. .++++-. .+..+ +..++.. . T Consensus 253 -~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~-------------~~~~~~~----------~~~~~-~~~~~~~--~-- 303 (437) T protein:vir:10 253 -SYDWQAELQSRLIELRDNTDDSLIITALTDG-------------IKKTTST----------YLLGD-LKKVLNV--T-- 303 (437) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHhhhhccc-------------ccccccc----------cchhh-HHHHHHh--h-- Confidence 3567889999999999999999988764321 1111110 00001 1111110 0 Q ss_pred HHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCcceEEEEEecC Q lcl|NC_014661. 363 AEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDYFTIGYKGD 442 (524) Q Consensus 363 ~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~ 442 (524) ....+..+-.+|+++.+...|..... +.|.- -+..+.+.. .-++|.| ++||+....... .. T Consensus 304 ----l~~~~~~~~~~~~~~~~~~~l~~lkd-----~~g~~-~~~~~~~~~-~~~~l~G-~pv~~~~~~~~~-------~~ 364 (437) T protein:vir:10 304 ----LKPQDSAAASIVMSQSAYNLFDMATD-----AMGRP-LLQPNVTAA-TGYTLLG-KTVVIVDDKLFP-------SA 364 (437) T ss_pred ----hhhhhhcCCEEEEcHHHHHHHHHhhc-----cCCCe-eeccCccCC-CCccccc-ceeEEecccccC-------Cc Confidence 11222233457999999999876421 11100 011122211 1246887 577764332100 00 Q ss_pred CCccceeEeecccc--------ccccccc-Ccccccceeeeeeeecee-eCC--cccccCCccccceeeccccchhhhhc Q lcl|NC_014661. 443 NEMDAGIYYAPYVA--------LTPLRGA-DPKNFQPVLGFKTRYGIG-INP--LADTAAQQPAGNARIANGMPSIANSV 510 (524) Q Consensus 443 ~~~d~g~fyaPYv~--------~~~~~~~-Dp~s~qP~~~~~tRY~l~-~nP--~~~~~~~~~~~~~~~~~g~~~~a~~~ 510 (524) ..-+..+||+.+-. ...++.. +-..+...+.+..||+.. ++| |..-....+ ..+-.. T Consensus 365 ~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~-----------~~~~~~ 433 (437) T protein:vir:10 365 SAGDVNIVVAPLKKAVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGKLK-----------AVTVVQ 433 (437) T ss_pred CCCceEEEEeeccccEEEEeeeceEEEEecccccccceeeEEEEEccEEecccceEEEEeecc-----------ccccCC Confidence 00001122222211 1111111 334455566667788653 344 211110000 000000 Q ss_pred cccc Q lcl|NC_014661. 511 GKNG 514 (524) Q Consensus 511 ~~~~ 514 (524) -.++ T Consensus 434 ~~~~ 437 (437) T protein:vir:10 434 STAV 437 (437) T ss_pred CCCC Confidence 1111 No 116 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=23.77 E-value=2.3 Score=18.64 Aligned_cols=308 Identities=15% Similarity=0.149 Sum_probs=123.5 Q ss_pred CCCcchheeeeeeeecCc--cCCCcccccccccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_014661. 119 MQGPTGQVFALRAVYGKD--PIAAGAKEAFHPMYAPDAMFSGRGSHEVFAPLASGTVVAQGTIYKHEFVATGTAFLQATG 196 (524) Q Consensus 119 mTGPTGLIFAMRSrY~~q--~~~~~g~eAf~~fnEadt~FSG~~~~~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g 196 (524) |+.-+|. ++++.. +...+.++..+.|- .-|+|. ....+....... +-+...+..+++.+.....| T Consensus 1 ~~~~~~~-----~~~~~~~~~~~~~~~~~~al~l---e~f~ge----V~~~f~~~s~~~-~~~~~r~i~~gks~~~~~iG 67 (345) T protein:vir:22 1 MASMTGG-----QQMGTNQGKGVVAAGDKLALFL---KVFGGE----VLTAFARTSVTT-SRHMVRSISSGKSAQFPVLG 67 (345) T ss_pred Ccccccc-----hhcccccccccccCCchhHHHH---HHHhHH----HHHHHHHHhhhc-ccceeeeccccceEEEeeec Confidence 3332221 111111 11000111111111 122321 111111111100 00000111111111111111 Q ss_pred ccccCCCCCcccccccccccccccccccccccccchhhhcccccCCC-CCcchhhcceEEEEEEEEEecccccchhhHHH Q lcl|NC_014661. 197 AVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGS-NNNPWNEMGFRIDKQVIEAKSRQLKAQYSIEL 275 (524) Q Consensus 197 ~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs-~~~~f~EMsFsIEK~TVtAKSRALKAEYT~EL 275 (524) .+. ......| +. +.++ .+....|.-++||+. |-+..-+.- T Consensus 68 ~~~----------------------~~~~~~G------~~---l~~~~~~~~~~e~~ltID~~--------~y~~~~Vdd 108 (345) T protein:vir:22 68 RTQ----------------------AAYLAPG------EN---LDDKRKDIKHTEKVITIDGL--------LTADVLIYD 108 (345) T ss_pred ceE----------------------EEeeecC------CC---CCCCCCCcccceEEEEecch--------hhhhhhHhh Confidence 111 1111111 11 1121 134567777888864 334455555 Q ss_pred HHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhccccccccccce-ecccccccccccch--HHHHHH Q lcl|NC_014661. 276 AQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTLTVGSKAGV-FDFQDPIDVRGARW--AGESFK 352 (524) Q Consensus 276 AQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~~~~~~aG~-fdl~~~~d~~~~~~--a~E~~r 352 (524) .-|.++ | .|-..|++.=...++..++.+-|++.|..-|..-......-.+...|+ .+... .+..+ ...... T Consensus 109 iD~~q~-~-~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~~~~~~~~~~~~~----~g~~~t~~~~~~~ 182 (345) T protein:vir:22 109 IEDAMN-H-YDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIEGLGTATVIETTQ----NKAALTDQVALGK 182 (345) T ss_pred HHHHhc-C-chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccc----ccccccccccCHH Confidence 555555 4 799999999999999999999999988765542110000000000110 01000 00000 011223 Q ss_pred HHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccccccccCcceEEEEecCceEEEeeCCCCc Q lcl|NC_014661. 353 ALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARGLNTDTTKAVFAGILGGRYKVYIDQYARQ 432 (524) Q Consensus 353 ~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~ 432 (524) .+|..|-..+...-.+-=--.+-|+|++|++-.+|-.++-|....-. + ..+ ...-.+|.++| ++||.-++.|. T Consensus 183 ~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~----~-~~~-~~~G~V~~i~G-~~V~~sn~lp~ 255 (345) T protein:vir:22 183 EIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYA----A-LID-PEKGSIRNVMG-FEVVEVPHLTA 255 (345) T ss_pred HHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccc----c-ccc-cccceEEEEec-eEEEecccccc Confidence 44444444443333332223578999999999999888776543221 1 112 23346788886 89998877552 Q ss_pred c-----------------------eEEEEEecCCCccceeEeeccccccccccc--------Ccccccceeeeeeeecee Q lcl|NC_014661. 433 D-----------------------YFTIGYKGDNEMDAGIYYAPYVALTPLRGA--------DPKNFQPVLGFKTRYGIG 481 (524) Q Consensus 433 d-----------------------y~~vG~KG~~~~d~g~fyaPYv~~~~~~~~--------Dp~s~qP~~~~~tRY~l~ 481 (524) - |+.+ +.+. ..++||.|=.- ...+.+ |+..|.= .+..+|... T Consensus 256 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~---~~~~-~~~l~~h~~A~-~~v~~~~~~~e~~r~~~~~~d--~I~~~~a~G 328 (345) T protein:vir:22 256 GGAGTAREGTTGQKHVFPANKGEGNVKV---AKDN-VIGLFMHRSAV-GTVKLRDLALERARRANFQAD--QIIAKYAMG 328 (345) T ss_pred cccCccccCcccccccccccccceeeee---ccCc-eEEEEEehhhe-eeeeeecceeeeeechhHHHH--HHHHHHhcC Confidence 1 1111 1112 25788877522 222222 3333321 223333332 Q ss_pred eCCcccccCCccccceeeccccchhhhhccccceeeeee Q lcl|NC_014661. 482 INPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVL 520 (524) Q Consensus 482 ~nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~ 520 (524) +=|+- -+-+..+++ ++. T Consensus 329 ~~vlR----Peaa~~i~~------------------~~~ 345 (345) T protein:vir:22 329 HGGLR----PEAAGAVVF------------------KVE 345 (345) T ss_pred Ccccc----cceeEEEEE------------------eeC Confidence 22210 000000110 000 No 117 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=22.77 E-value=2.4 Score=18.50 Aligned_cols=268 Identities=11% Similarity=0.052 Sum_probs=112.2 Q ss_pred ccccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCC Q lcl|NC_014661. 163 EVFAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNG 242 (524) Q Consensus 163 ~~~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGg 242 (524) -+..+- +..+ ++...... +..+........-+..... .+.... . ..|.....+.=-....+|. +.. T Consensus 1 ~~~~~~---T~l~--d~i~PEv~-~~~v~~~~~~~~~~~~~~~---~~~~l~-g-~~G~tv~iP~~~~ig~a~~---~~~ 66 (275) T protein:vir:96 1 MALENM---TKLA--NMVNPEVL-APMMQAELDKKLKFAQFAD---IDNTLV-G-QPGNTITFPAFVYSGDAKV---VPE 66 (275) T ss_pred CCCccc---chhh--hhhchHHH-HHHHHHHHHHhhhhcccce---eccccc-C-CCCCEEEeeeeccCCcccc---ccC Confidence 111110 0000 01000000 0000000000000000000 000000 0 0011111110000112221 111 Q ss_pred CCCcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhc-CCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhc Q lcl|NC_014661. 243 SNNNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVH-GMDADAELANILATEIMLEINREVIDWINYSAQVGKTG 321 (524) Q Consensus 243 s~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvH-GLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~ 321 (524) ...-...++.++ +.+++-|-|.-.=+++ |+-+.. +-|.-.|..+-++..|...++.+++..+...... T Consensus 67 g~~i~~~~lt~~--~~~~~i~~~~~~~~i~-----D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~---- 135 (275) T protein:vir:96 67 GEEIPIDLIETK--KRQATIRKIGKGTVLT-----DEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLK---- 135 (275) T ss_pred CCCcchhhcccc--eeeEEeehhccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc---- Confidence 122334444444 4444445554433333 333322 4688899999999999999999998777643211 Q ss_pred cccccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCC--cccccccc Q lcl|NC_014661. 322 QTLTVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVD--TSVTPAAQ 399 (524) Q Consensus 322 ~~~~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~--~~~~~~a~ 399 (524) . ....++ .+.+-....++.++. ..+++++++|++++.|.... .|+..+.. T Consensus 136 ----~--~~~~~~-------------~d~i~dA~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~ 187 (275) T protein:vir:96 136 ----V--EADITK-------------LAGLQTAIDKFNDED---------LEPMVLFVNPLDAGKLRASATDNFTRATLL 187 (275) T ss_pred ----c--cccccC-------------HHHHHHHHHHhcccc---------CCccEEEeCHHHHHHHHhcccccccccccc Confidence 0 011122 222223333333221 25789999999999996543 34332222 Q ss_pred ccccccccccCcceEEEEecCceEEEeeCCCCcce-EEEEEecCCCccceeEeecccccccccc-cCcccccceeeeeee Q lcl|NC_014661. 400 GLARGLNTDTTKAVFAGILGGRYKVYIDQYARQDY-FTIGYKGDNEMDAGIYYAPYVALTPLRG-ADPKNFQPVLGFKTR 477 (524) Q Consensus 400 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~~d~g~fyaPYv~~~~~~~-~Dp~s~qP~~~~~tR 477 (524) +. ..-.+-.+|.+.| ++||+|...|..= +++| +| +-.|+.. ....++. -|++.++=.|--..+ T Consensus 188 g~------~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~i~~-~g-----A~~~~~~--~~~~vE~~Rd~~~~~d~i~~~~~ 252 (275) T protein:vir:96 188 GD------NVIVKGAFGEALG-AIIVRSNKIKEGEAILAK-RG-----AVKLITK--RDFFLETERHASHKSTALFSDKH 252 (275) T ss_pred cc------cceeccccceecC-eeEEEeCCCCcceEEEEe-cc-----ceeeeec--CCcccccccchhhcCcEEEEeEE Confidence 21 1112234688877 7999999876422 2222 12 1112211 1112232 399999999999999 Q ss_pred ecee-eCCc-ccccCCcccccee Q lcl|NC_014661. 478 YGIG-INPL-ADTAAQQPAGNAR 498 (524) Q Consensus 478 Y~l~-~nP~-~~~~~~~~~~~~~ 498 (524) ||+. .||= ....+-.|+++-. T Consensus 253 y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 253 YVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred EEEEEEcCccEEEEEecccccCC Confidence 9953 4661 1111222222211 No 118 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=20.53 E-value=2.8 Score=18.17 Aligned_cols=266 Identities=11% Similarity=0.051 Sum_probs=113.1 Q ss_pred ccccccccccccccccccccccccccccccccccccCCCCCcccccccccccccccccccccccccchhhhcccccCCCC Q lcl|NC_014661. 165 FAPLASGTVVAQGTIYKHEFVATGTAFLQATGAVTLATTADAAELDAEVIKQMDAGILVEIAEGMATSIAELQEGFNGSN 244 (524) Q Consensus 165 ~~~~~~g~~~a~g~~~~~~~~~tg~~~~~~~g~~~~a~~~~~~~~d~~~~~~~~~g~~~~~g~Gm~Ts~aE~l~~lGgs~ 244 (524) +++.. +.. .++....... ..+..+.....-+...+. .+..... ..|.....+.==....+|. +.... T Consensus 1 ma~~~--T~~--~d~iiPev~~-~~v~~~~~~~~~~~~~~~---~~~~l~g--~~G~ti~iP~~~~~gda~~---~~eg~ 67 (272) T protein:vir:36 1 MSKQK--TTL--ADLVNPEVLA-PIVSYELNKALRFAPLAQ---VDTTLQG--QPGNTLKFPAFTYIGDAAD---VAEGG 67 (272) T ss_pred CCCcc--eeh--hhhhchHHHH-HHHHHHHHhhhhhccccc---ccccccc--CCCCEEEEeeeccCccccc---cCCCC Confidence 11100 000 0000000000 000000000000000000 0000000 0011111111000112221 11111 Q ss_pred CcchhhcceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhHhhhhhhhhhcccc Q lcl|NC_014661. 245 NNPWNEMGFRIDKQVIEAKSRQLKAQYSIELAQDLRAVHGMDADAELANILATEIMLEINREVIDWINYSAQVGKTGQTL 324 (524) Q Consensus 245 ~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAvHGLDAEaELaNILStEImlEINREii~~l~~~A~~~k~~~~~ 324 (524) .-...++ +..+.+++-|-|+-.-++|=|.+ +.-+-|.-.|..+-++..+..+++++|+..+..... .+ T Consensus 68 ~i~~~~l--t~~~~~~~i~~~~k~~~vtD~~~----~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~-----~~- 135 (272) T protein:vir:36 68 EISLDKI--GTTTKSVTIKKAAKGTEITDEAA----LSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQ-----TV- 135 (272) T ss_pred ccChhhc--CCcceeEeeehhhccccccHHHH----hhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----cc- Confidence 2223334 34555566666653223333221 223678999999999999999999999877653221 11 Q ss_pred ccccccceecccccccccccchHHHHHHHHHHHHHHHHHHHHhhccccCccEEEeCHHHHHHHhcCCccccccccccccc Q lcl|NC_014661. 325 TVGSKAGVFDFQDPIDVRGARWAGESFKALLFQIDKESAEIARQTGRGAGNFIIASRNVVNVLASVDTSVTPAAQGLARG 404 (524) Q Consensus 325 ~~~~~aG~fdl~~~~d~~~~~~a~E~~r~L~~~i~~~a~~I~~~T~rg~gn~~v~S~~va~~L~~~~~~~~~~a~~~~~~ 404 (524) .+.+++ +.+-.+..++.++. ...++++|+|.++..|.....|.....++.... T Consensus 136 -----~~~~~~-------------d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~ 188 (272) T protein:vir:36 136 -----STKANV-------------DGVQAALDIFNDED---------AQAYVLIVNPKDAAKIRKDANAKNIGSEVGANA 188 (272) T ss_pred -----cccccH-------------HHHHHHHHHhhhcC---------CCceEEEEcHHHHHHHhcccccccccccccccc Confidence 111111 11222222332221 246799999999999987666655433222111 Q ss_pred cccccCcceEEEEecCceEEEeeCCCCcc---eEEEEE-ecCCCccceeEeecccccccccc-cCcccccceeeeeeeec Q lcl|NC_014661. 405 LNTDTTKAVFAGILGGRYKVYIDQYARQD---YFTIGY-KGDNEMDAGIYYAPYVALTPLRG-ADPKNFQPVLGFKTRYG 479 (524) Q Consensus 405 ~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---y~~vG~-KG~~~~d~g~fyaPYv~~~~~~~-~Dp~s~qP~~~~~tRY~ 479 (524) ..+-.+|.+.| ++|++|...|.+ |..+.+ +|. ..+|.. ....++. -|+..++=.+--.-+|| T Consensus 189 -----~~~G~ig~~~G-~~Vv~s~~~p~~~~~~~~~~~~~gA-----~~~~~~--~~~~vE~~R~~~~~~d~i~~~~~y~ 255 (272) T protein:vir:36 189 -----LINGTYADVLG-AQIVRSKKLAEGSALMFKIVSNSPA-----LKLVLK--RGVQVETDRDIVTKTTVITADEHYA 255 (272) T ss_pred -----eeeeccceecC-eeEEEeCCCCCCceeEEEEEecccc-----eeeeec--CCcccccccchhhcCcEEEEEEEEE Confidence 11123578877 899999997654 211211 121 112211 1111222 38889998888888888 Q ss_pred eee-CCcccccCCccccceeeccccchhhhhccccceeeeeeeecC Q lcl|NC_014661. 480 IGI-NPLADTAAQQPAGNARIANGMPSIANSVGKNGYFRRVLVKGI 524 (524) Q Consensus 480 l~~-nP~~~~~~~~~~~~~~~~~g~~~~a~~~~~~~~~r~~~v~~~ 524 (524) +.+ || ..+.++ -.||+ T Consensus 256 ~~v~~~---------~~vv~~--------------------t~~g~ 272 (272) T protein:vir:36 256 AYLYDL---------TKVVNI--------------------TFTGV 272 (272) T ss_pred EEEEcC---------ccEEEE--------------------eecCC Confidence 754 55 112222 22222 Done!