Query lcl|NC_014036.1_cdsid_YP_003580059.1 [gene=183] [protein=gp23 major capsid protein] [protein_id=YP_003580059.1] [location=132689..134257] Match_columns 522 No_of_seqs 172 out of 420 Neff 5.2 Searched_HMMs 1612 Date Thu Nov 7 15:25:56 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_202 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_202_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:98143 Length: 524 100.0 3E-261 2E-264 1448.6 39.3 520 1-522 1-524 (524) 2 protein:vir:80986 Length: 528 100.0 8E-259 5E-262 1435.6 38.9 521 1-522 1-528 (528) 3 protein:vir:6901 Length: 522 # 100.0 3E-254 2E-257 1410.5 38.5 518 1-522 4-522 (522) 4 protein:vir:100603 Length: 529 100.0 3E-254 2E-257 1410.2 38.2 521 1-522 1-529 (529) 5 protein:vir:6601 Length: 528 # 100.0 9E-254 5E-257 1407.9 38.9 521 1-522 1-528 (528) 6 protein:vir:107947 Length: 519 100.0 9E-251 6E-254 1391.4 39.1 518 1-522 1-519 (519) 7 protein:vir:101811 Length: 529 100.0 1E-250 7E-254 1391.0 39.2 521 1-522 1-529 (529) 8 protein:vir:101039 Length: 529 100.0 1E-250 8E-254 1390.7 38.4 520 1-522 3-529 (529) 9 protein:vir:103463 Length: 521 100.0 1E-250 8E-254 1390.6 38.1 517 1-522 3-521 (521) 10 protein:vir:7214 Length: 521 # 100.0 1E-249 7E-253 1385.4 38.2 517 1-522 3-521 (521) 11 protein:vir:106286 Length: 534 100.0 2E-247 1E-250 1373.3 38.9 518 1-522 1-534 (534) 12 protein:vir:5670 Length: 514 # 100.0 3E-238 2E-241 1323.1 37.0 510 5-522 1-514 (514) 13 protein:vir:104915 Length: 470 100.0 2E-221 1E-224 1230.7 35.5 460 1-522 3-469 (470) 14 protein:vir:106998 Length: 468 100.0 7E-220 4E-223 1222.1 35.5 460 1-522 1-467 (468) 15 protein:vir:104549 Length: 462 100.0 2E-216 1E-219 1203.3 36.0 454 1-522 1-461 (462) 16 protein:vir:103181 Length: 457 100.0 3E-213 2E-216 1185.7 35.2 449 1-522 1-456 (457) 17 protein:vir:5942 Length: 523 # 100.0 4E-194 3E-197 1080.8 32.0 444 1-503 1-523 (523) 18 protein:vir:191 Length: 385 # 96.9 0.0003 1.9E-07 39.9 20.7 348 1-502 1-385 (385) 19 protein:vir:1886 Length: 385 # 96.9 0.0003 1.9E-07 39.9 20.7 348 1-502 1-385 (385) 20 protein:vir:4953 Length: 397 # 96.3 0.00079 4.9E-07 37.6 20.0 331 1-508 1-397 (397) 21 protein:vir:81227 Length: 413 95.8 0.0014 8.8E-07 36.2 18.6 352 1-522 1-410 (413) 22 protein:vir:81100 Length: 415 95.6 0.0018 1.1E-06 35.6 15.2 357 1-508 1-415 (415) 23 protein:vir:98339 Length: 415 95.6 0.0018 1.1E-06 35.6 15.2 357 1-508 1-415 (415) 24 protein:vir:79987 Length: 415 95.6 0.0018 1.1E-06 35.6 15.2 357 1-508 1-415 (415) 25 protein:vir:9410 Length: 415 # 95.2 0.0026 1.6E-06 34.7 16.0 363 1-508 1-415 (415) 26 protein:vir:4997 Length: 397 # 94.5 0.0044 2.7E-06 33.5 20.7 334 1-502 1-397 (397) 27 protein:vir:41 Length: 299 # N 94.4 0.0044 2.7E-06 33.5 18.7 275 72-497 1-299 (299) 28 protein:vir:4830 Length: 397 # 92.2 0.013 7.8E-06 31.0 19.6 333 1-510 1-397 (397) 29 protein:vir:100135 Length: 418 92.2 0.013 7.8E-06 31.0 17.5 352 1-509 21-418 (418) 30 protein:vir:7409 Length: 408 # 92.1 0.013 8.1E-06 30.9 17.1 342 1-508 4-408 (408) 31 protein:vir:9820 Length: 272 # 91.8 0.014 8.8E-06 30.7 15.0 267 147-511 1-272 (272) 32 protein:vir:3033 Length: 272 # 91.8 0.014 8.8E-06 30.7 15.0 267 147-511 1-272 (272) 33 protein:vir:9759 Length: 303 # 90.8 0.019 1.2E-05 30.0 15.6 283 79-498 1-303 (303) 34 protein:vir:10364 Length: 390 90.7 0.02 1.2E-05 30.0 19.3 343 1-501 19-390 (390) 35 protein:vir:78223 Length: 333 89.5 0.026 1.6E-05 29.3 13.9 302 144-494 1-333 (333) 36 protein:vir:2504 Length: 305 # 89.1 0.028 1.7E-05 29.1 18.2 284 80-502 1-305 (305) 37 protein:vir:9574 Length: 300 # 89.0 0.029 1.8E-05 29.0 18.2 282 79-521 1-300 (300) 38 protein:vir:4600 Length: 415 # 88.8 0.03 1.9E-05 28.9 17.6 359 1-508 1-415 (415) 39 protein:vir:4700 Length: 415 # 88.8 0.03 1.9E-05 28.9 17.6 359 1-508 1-415 (415) 40 protein:vir:4856 Length: 293 # 88.8 0.03 1.9E-05 28.9 17.0 258 55-505 1-293 (293) 41 protein:vir:1268 Length: 397 # 88.7 0.031 1.9E-05 28.9 15.9 338 1-520 5-397 (397) 42 protein:vir:81070 Length: 390 88.5 0.032 2E-05 28.8 19.8 344 1-501 1-390 (390) 43 protein:vir:104256 Length: 458 88.4 0.033 2E-05 28.7 17.3 349 1-496 81-458 (458) 44 protein:vir:8420 Length: 477 # 87.0 0.042 2.6E-05 28.1 19.6 370 1-508 16-477 (477) 45 protein:vir:3870 Length: 400 # 86.5 0.045 2.8E-05 28.0 14.7 325 1-496 10-400 (400) 46 protein:vir:101650 Length: 497 86.3 0.047 2.9E-05 27.9 21.1 371 1-502 53-497 (497) 47 protein:vir:7855 Length: 497 # 86.3 0.047 2.9E-05 27.9 21.1 371 1-502 53-497 (497) 48 protein:vir:4339 Length: 395 # 86.2 0.047 2.9E-05 27.9 19.1 351 1-496 1-395 (395) 49 protein:vir:1638 Length: 298 # 86.2 0.047 2.9E-05 27.9 15.8 280 79-502 1-298 (298) 50 protein:vir:81160 Length: 371 86.0 0.049 3E-05 27.8 19.9 327 1-496 22-371 (371) 51 protein:vir:8187 Length: 311 # 85.7 0.051 3.2E-05 27.7 18.5 286 80-496 1-311 (311) 52 protein:vir:104085 Length: 320 84.0 0.064 4E-05 27.1 17.1 294 44-496 1-320 (320) 53 protein:vir:7771 Length: 330 # 84.0 0.064 4E-05 27.1 20.1 312 59-522 1-323 (330) 54 protein:vir:96123 Length: 274 82.4 0.077 4.8E-05 26.7 15.7 270 165-499 1-274 (274) 55 protein:vir:2344 Length: 397 # 81.7 0.084 5.2E-05 26.5 19.9 301 72-522 1-329 (397) 56 protein:vir:95763 Length: 297 80.5 0.095 5.9E-05 26.2 16.1 277 69-504 1-297 (297) 57 protein:vir:78523 Length: 338 79.4 0.11 6.5E-05 25.9 19.9 310 60-506 1-338 (338) 58 protein:vir:6242 Length: 390 # 76.0 0.14 8.7E-05 25.3 14.3 349 1-522 4-389 (390) 59 protein:vir:105905 Length: 304 75.7 0.14 8.9E-05 25.2 12.4 274 134-502 1-304 (304) 60 protein:vir:94142 Length: 304 75.7 0.14 8.9E-05 25.2 12.4 274 134-502 1-304 (304) 61 protein:vir:99920 Length: 311 74.9 0.15 9.5E-05 25.0 18.4 285 79-503 1-311 (311) 62 protein:vir:101607 Length: 379 73.2 0.17 0.00011 24.8 19.8 336 1-522 1-379 (379) 63 protein:vir:97148 Length: 324 71.8 0.19 0.00012 24.5 17.9 295 45-498 1-324 (324) 64 protein:vir:1025 Length: 408 # 71.5 0.19 0.00012 24.5 15.8 332 1-504 5-408 (408) 65 protein:vir:97053 Length: 390 71.3 0.2 0.00012 24.4 19.8 348 1-501 1-390 (390) 66 protein:vir:93742 Length: 274 69.8 0.22 0.00013 24.2 16.0 270 155-513 1-274 (274) 67 protein:vir:95898 Length: 274 69.8 0.22 0.00013 24.2 13.2 266 174-507 1-274 (274) 68 protein:vir:96262 Length: 274 69.8 0.22 0.00013 24.2 13.2 266 174-507 1-274 (274) 69 protein:vir:96392 Length: 324 69.0 0.23 0.00014 24.1 16.2 300 15-503 1-324 (324) 70 protein:vir:78830 Length: 324 69.0 0.23 0.00014 24.1 16.2 300 15-503 1-324 (324) 71 protein:vir:3991 Length: 404 # 68.8 0.23 0.00014 24.1 17.7 349 1-510 1-404 (404) 72 protein:vir:6212 Length: 434 # 66.3 0.27 0.00017 23.7 18.5 347 1-502 39-434 (434) 73 protein:vir:1433 Length: 435 # 65.7 0.28 0.00017 23.6 16.0 343 1-506 30-435 (435) 74 protein:vir:739 Length: 231 # 64.9 0.29 0.00018 23.5 13.9 218 215-522 1-231 (231) 75 protein:vir:100247 Length: 425 63.3 0.32 0.0002 23.3 18.9 344 1-497 64-425 (425) 76 protein:vir:96762 Length: 632 58.9 0.4 0.00025 22.7 19.7 333 1-493 260-632 (632) 77 protein:vir:80684 Length: 315 58.4 0.41 0.00026 22.7 14.3 278 147-502 1-315 (315) 78 protein:vir:4092 Length: 390 # 57.3 0.44 0.00027 22.6 18.5 348 1-502 1-390 (390) 79 protein:vir:4226 Length: 326 # 55.8 0.47 0.00029 22.4 17.1 304 44-507 1-326 (326) 80 protein:vir:1383 Length: 421 # 52.1 0.56 0.00035 21.9 17.4 346 1-522 4-415 (421) 81 protein:vir:1781 Length: 221 # 51.4 0.58 0.00036 21.9 15.9 204 256-497 1-221 (221) 82 protein:vir:108211 Length: 318 47.4 0.7 0.00044 21.4 11.7 287 117-522 1-315 (318) 83 protein:vir:105038 Length: 428 45.8 0.76 0.00047 21.2 21.2 346 1-504 1-428 (428) 84 protein:vir:9704 Length: 394 # 43.7 0.83 0.00052 21.0 18.6 334 1-522 31-390 (394) 85 protein:vir:3613 Length: 272 # 43.6 0.84 0.00052 21.0 13.1 264 155-522 1-272 (272) 86 protein:vir:96223 Length: 324 42.6 0.88 0.00055 20.9 18.3 302 24-511 1-324 (324) 87 protein:vir:2430 Length: 318 # 40.3 0.98 0.00061 20.6 17.7 290 44-509 1-318 (318) 88 protein:vir:2685 Length: 387 # 39.4 1 0.00063 20.5 15.5 328 1-522 1-381 (387) 89 protein:vir:94424 Length: 387 39.4 1 0.00063 20.5 15.5 328 1-522 1-381 (387) 90 protein:vir:96978 Length: 387 39.4 1 0.00063 20.5 15.5 328 1-522 1-381 (387) 91 protein:vir:102119 Length: 404 37.7 1.1 0.00068 20.3 15.8 347 1-495 1-404 (404) 92 protein:vir:9309 Length: 324 # 36.8 1.2 0.00072 20.2 19.6 302 36-513 1-324 (324) 93 protein:vir:100172 Length: 394 35.2 1.2 0.00077 20.1 18.6 346 1-510 1-394 (394) 94 protein:vir:93881 Length: 387 34.8 1.3 0.00079 20.0 14.5 334 1-522 1-381 (387) 95 protein:vir:80376 Length: 435 33.8 1.3 0.00082 19.9 18.0 352 1-506 42-435 (435) 96 protein:vir:94673 Length: 419 32.8 1.4 0.00087 19.8 23.0 360 1-522 1-417 (419) 97 protein:vir:100884 Length: 389 32.0 1.5 0.0009 19.7 18.7 338 1-510 15-389 (389) 98 protein:vir:4456 Length: 401 # 31.8 1.5 0.00091 19.7 15.5 344 1-522 1-401 (401) 99 protein:vir:103955 Length: 324 31.2 1.5 0.00094 19.6 20.8 295 1-498 1-324 (324) 100 protein:vir:99749 Length: 324 29.5 1.7 0.001 19.4 19.7 296 1-498 1-324 (324) 101 protein:vir:3845 Length: 395 # 29.1 1.7 0.001 19.3 17.5 339 1-510 1-395 (395) 102 protein:vir:94771 Length: 298 26.8 1.9 0.0012 19.0 15.9 275 81-502 1-298 (298) 103 protein:vir:97433 Length: 274 25.8 2 0.0012 18.9 14.8 269 155-507 1-274 (274) 104 protein:vir:94494 Length: 274 25.8 2 0.0012 18.9 14.8 269 155-507 1-274 (274) 105 protein:vir:1328 Length: 392 # 22.0 2.5 0.0016 18.4 18.8 349 1-522 1-391 (392) 106 protein:vir:107593 Length: 392 21.8 2.5 0.0016 18.4 19.0 325 1-509 35-392 (392) 107 protein:vir:102873 Length: 392 21.8 2.5 0.0016 18.4 19.0 325 1-509 35-392 (392) 108 protein:vir:105004 Length: 392 21.8 2.5 0.0016 18.4 19.0 325 1-509 35-392 (392) 109 protein:vir:102082 Length: 392 21.8 2.5 0.0016 18.4 19.0 325 1-509 35-392 (392) No 1 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=100.00 E-value=3.3e-261 Score=1448.63 Aligned_cols=520 Identities=89% Similarity=1.309 Sum_probs=502.1 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) ||++|+|+|||+||||++||||||++.+||+|+|+|||||||+++++|.|||++++++|+.+|.||+++|+|||++.+|+ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~ 80 (524) T protein:vir:98 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) T ss_pred CcchHHHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHHHhcCccccchHHHHhhhcccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcc----cchhcccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGA----KEAFHPMFSPDSMYSG 156 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~----~eA~~~~~Eadt~fSG 156 (522) ||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++..+. +|||++++++|++||| T Consensus 81 ~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG 160 (524) T protein:vir:98 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) T ss_pred ccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCcccccccccccccccccccccCC Confidence 999999999999999999999999999999999999999999999999999998765443 7899999999999999 Q ss_pred cccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhh Q lcl|NC_014036. 157 QGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAE 236 (522) Q Consensus 157 ~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aE 236 (522) .++.. .++....+.....|....+.+...+.+..+.......+.+++++..++.........+..++++.||+|+.+| T Consensus 161 ~g~~t--~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aE 238 (524) T protein:vir:98 161 EGAHT--AFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAE 238 (524) T ss_pred ccccc--cccccccccccccccccccccccccceeccccccCcccccccccccccccccccccccceeecccccchhhhh Confidence 86543 3455666667777778888888889888888888888889999999999988889999999999999999999 Q ss_pred hccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcc Q lcl|NC_014036. 237 LQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTA 316 (522) Q Consensus 237 al~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a 316 (522) +++.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++| T Consensus 239 aL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a 318 (524) T protein:vir:98 239 LQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTA 318 (524) T ss_pred hhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccc Q lcl|NC_014036. 317 QVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGIT 396 (522) Q Consensus 317 ~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~ 396 (522) |++++||++.+++++|+|||+++.|+.++||++|++|+|++|||+|||+|+|+|+||+||||||||+||++|+|+|+++. T Consensus 319 ~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~ 398 (524) T protein:vir:98 319 QVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGIT 398 (524) T ss_pred eeceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCCccccceeeee Q lcl|NC_014036. 397 PAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 476 (522) Q Consensus 397 ~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~ 476 (522) ++++|+++++++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||| T Consensus 399 ~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 478 (524) T protein:vir:98 399 PASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 478 (524) T ss_pred cccchhhcccccCCccceEEEEecCceEEEecCCCCcceEEEEeeCCcccccceeeccccccccccccCCccccceeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecceecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 477 TRYGVGINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 477 tRY~l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) |||||++|||+++.++++++||++|+||++|||||+|||||+|||| T Consensus 479 tRY~l~~NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~l 524 (524) T protein:vir:98 479 TRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) T ss_pred eeeceeecCcccccCCccccccccCcchHhhcCccceeeEeeeccC Confidence 9999999999999999999999999999999999999999999999 No 2 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=100.00 E-value=7.9e-259 Score=1435.58 Aligned_cols=521 Identities=70% Similarity=1.113 Sum_probs=493.2 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) |+++|+|+|||+||||+ ||||+|++.|||+|+|+|||||||+|+|+|.|||++++++|+.+|.||+++|+|||++++|+ T Consensus 1 ~~~~~~l~~kw~p~l~~-~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~ 79 (528) T protein:vir:80 1 MKTTKELMEKWSPLLEN-EKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIA 79 (528) T ss_pred CcchHHHHHhhhHhhcC-CccchhcchhhhhhhhhhhhhhhHHhhccccccchHHHHhhhhhccccccccccCCcccccc Confidence 99999999999999995 78999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAA 160 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~ 160 (522) ||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++.++++|+||+++++|+.||+..+. T Consensus 80 es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~ 159 (528) T protein:vir:80 80 AGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAK 159 (528) T ss_pred ccccccccccCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCcccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999997553 Q ss_pred c-------ccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccch Q lcl|NC_014036. 161 P-------SNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATS 233 (522) Q Consensus 161 ~-------~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts 233 (522) . ...++......+...|+++++.+...+....+.........+.......+.........+..++++.||+|+ T Consensus 160 ~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta 239 (528) T protein:vir:80 160 GAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATS 239 (528) T ss_pred ccccccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccccccccccchh Confidence 2 233455556667778888888888777777766665555544444444455556667788899999999999 Q ss_pred hhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Q lcl|NC_014036. 234 VAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMIN 313 (522) Q Consensus 234 ~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~ 313 (522) .+|.++.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||||||||||++|+ T Consensus 240 ~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~ 319 (528) T protein:vir:80 240 IAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVIN 319 (528) T ss_pred hhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcc Q lcl|NC_014036. 314 YTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDS 393 (522) Q Consensus 314 ~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~ 393 (522) +++++|+++|++++++++|+|||++++|++|+||++|+||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|+ T Consensus 320 ~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~ 399 (528) T protein:vir:80 320 FTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQ 399 (528) T ss_pred heeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCCcccccee Q lcl|NC_014036. 394 GITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVM 473 (522) Q Consensus 394 ~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~ 473 (522) +++++..+.+..+++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+| T Consensus 400 ~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~ 479 (528) T protein:vir:80 400 GISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVL 479 (528) T ss_pred cccccccccccccccCCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCcccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeecceecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 474 GFKTRYGVGINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 474 ~~~tRY~l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) ||||||||++|||+++.+|++++||+||+||+++||||+|||||+|||| T Consensus 480 g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:80 480 GFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred eeeeeeceeecCcccccCCcccccccccchhhhhcCccceeEEeeeccC Confidence 9999999999999999999999999999999999999999999999999 No 3 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=100.00 E-value=3e-254 Score=1410.46 Aligned_cols=518 Identities=72% Similarity=1.139 Sum_probs=498.1 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) |.++|+|+|||+||||+ ||||+|.+ +||+|+|+|||||||+|+|+|+|||++++++|+.||+||+++|+|||++++|+ T Consensus 4 ~~~~e~l~~kw~p~l~~-~~~~~~~~-~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~ 81 (522) T protein:vir:69 4 IKTKAQLVDKWKELLEG-EGLPEIAN-SKQAIIAKIFENQEKDFEVSPEYKDEKIAQAFGSFLTEAEIGGDHGYNAQNIA 81 (522) T ss_pred cchHHHHHHhhHHHhcC-CCCCcccc-chhhhhhhhhhhhhHHhhcccccchhHHHHhhhhhhhhhccccccCCCccccc Confidence 66779999999999995 78999987 59999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAA 160 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~ 160 (522) ||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++.++++|+|++|||+|++|||.+.. T Consensus 82 es~~t~~v~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~~ 161 (522) T protein:vir:69 82 AGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGAA 161 (522) T ss_pred ccccccccccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCcccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999889999999999999999998554 Q ss_pred cccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccc Q lcl|NC_014036. 161 PSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQ 240 (522) Q Consensus 161 ~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~ 240 (522) . .+.........+.|+.+.+.+...+++..+.....+...++.+...++..+.+..+.+..|+++.||+|+.+|+++. T Consensus 162 t--~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~ 239 (522) T protein:vir:69 162 K--KFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEG 239 (522) T ss_pred c--cccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccccceeeccccchhhhhhccc Confidence 3 35556666667778888998999999999988888888888888778888888999999999999999999999999 Q ss_pred cCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcccccc Q lcl|NC_014036. 241 FNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGK 320 (522) Q Consensus 241 lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~ 320 (522) ||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||||||||||++|+++||+|+ T Consensus 240 lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~ 319 (522) T protein:vir:69 240 FNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGK 319 (522) T ss_pred CCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccccc Q lcl|NC_014036. 321 TGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQ 400 (522) Q Consensus 321 ~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~ 400 (522) +||++.+.+++|+|||+++.|+.++||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|.+++++++ T Consensus 320 ~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~ 399 (522) T protein:vir:69 320 SGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQ 399 (522) T ss_pred cccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeec Q lcl|NC_014036. 401 GLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYG 480 (522) Q Consensus 401 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~ 480 (522) +.+.+.++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||||| T Consensus 400 ~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~ 479 (522) T protein:vir:69 400 GLASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYG 479 (522) T ss_pred cccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceecCccccccCCccccccCcch-HHhhccccceeeeeeeccC Q lcl|NC_014036. 481 VGINPFANSRSQAPSDRITSGMI-TKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 481 l~~nP~~~~~~~~~~~~i~~g~~-~~~~~~~~~~~r~~~Vk~~ 522 (522) |++|||++..+|++++||+||+| |.+++|+|.|||||+|||| T Consensus 480 l~vNP~~~~~~~~~~~ri~~g~p~~~~~~~~n~y~r~v~v~~~ 522 (522) T protein:vir:69 480 IGVNPFAESSLQAPGARIQSGMPSILNSLGKNAYFRRVYVKGI 522 (522) T ss_pred eeecCcccccCCcccceeecccchhhcccCCcceeeEEEeecC Confidence 99999999999999999999996 6699999999999999999 No 4 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=100.00 E-value=3.5e-254 Score=1410.15 Aligned_cols=521 Identities=74% Similarity=1.148 Sum_probs=479.0 Q ss_pred Cc-chHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhcccccccccccccccccccc Q lcl|NC_014036. 1 MS-KKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKI 79 (522) Q Consensus 1 ~~-~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~ 79 (522) |+ +.|+|+|||+||||+ ||||+|++.|||+|+|+|||||||+|+|+|.|||..++++++.+|+|++++|+|||++.+| T Consensus 1 ~~~~~~~l~~kw~p~l~~-~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i 79 (529) T protein:vir:10 1 MSLKTKEILNKWTPLLEG-EGLPEIAGKNKQALVAQILEAQEKDSKTDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNI 79 (529) T ss_pred CccchHHHHHHhhHhhcC-CccchhcchhhhhhhhhhhhhHHHHhhcccccchhhhhhhhhhccchhhcccccccccccc Confidence 54 346899999999995 8899999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccccccccccc Q lcl|NC_014036. 80 ASGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGA 159 (522) Q Consensus 80 ~~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~ 159 (522) +||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++..+++|+|++++|+|+.|||.+. T Consensus 80 a~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~~~ 159 (529) T protein:vir:10 80 AAGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGLAA 159 (529) T ss_pred cccccccccccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999998999999999999999999765 Q ss_pred cccc------ccccccccccccccceeecccccccceeeeeccc-cccccCCCCcccccccccccccccccccccccccc Q lcl|NC_014036. 160 APSN------GFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSG-APVTVTGSTDDALDAAVIAEQEKGTLAEISYGMAT 232 (522) Q Consensus 160 ~~~~------~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~-~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~T 232 (522) .... .........+...++.....+-..++.+.....+ .+...++.+....+.......+.+..+++++||+| T Consensus 160 ~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsT 239 (529) T protein:vir:10 160 KGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMAT 239 (529) T ss_pred ccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccccccccccccccch Confidence 4432 2333333444444444444444444444433322 22222333334445556667788889999999999 Q ss_pred hhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_014036. 233 SVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMI 312 (522) Q Consensus 233 s~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i 312 (522) +.+|+++.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++| T Consensus 240 a~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i 319 (529) T protein:vir:10 240 SIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWI 319 (529) T ss_pred hhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhc Q lcl|NC_014036. 313 NYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARID 392 (522) Q Consensus 313 ~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~ 392 (522) +++||+|++||++++++.+|+|||+++.|++++||++|++|+|++|||+|||+|+|+|+||+||||||||+||++|+|+| T Consensus 320 ~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~ 399 (529) T protein:vir:10 320 NYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALVD 399 (529) T ss_pred hhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCCccccce Q lcl|NC_014036. 393 SGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPV 472 (522) Q Consensus 393 ~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~ 472 (522) +++++++...+.+.++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+ T Consensus 400 ~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~ 479 (529) T protein:vir:10 400 AGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPV 479 (529) T ss_pred cccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeecceecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 473 MGFKTRYGVGINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 473 ~~~~tRY~l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) |||||||||++|||+++.+|++++||+||+||++++|||+|||||+|||| T Consensus 480 ~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 480 MGFKTRYAIGVNPFAESRTQAPTSRISNGMPGAHSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeeeeceeecCccccccccccccccCCcchhhhcCccceeeEeeeccC Confidence 99999999999999999999999999999999999999999999999999 No 5 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=100.00 E-value=8.8e-254 Score=1407.92 Aligned_cols=521 Identities=71% Similarity=1.127 Sum_probs=486.0 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) |+++|+|+|||+||||+ ||||+|++.|||+|+|+|||||||+|+|+|.|||++++++|+.+|.|++++|+|||++++|+ T Consensus 1 ~~~~~~l~~kw~p~l~~-~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~ 79 (528) T protein:vir:66 1 MKTTKELMEKWSPLLEN-EKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIA 79 (528) T ss_pred CcchHHHHHHhHHhhcC-CCcchhcchhhhhhhhhhhhhhHHHhhcccchhhHHHHHhhhhhhhhhcccccccccchhcc Confidence 99999999999999995 78999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAA 160 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~ 160 (522) ||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++..++.++|++.+.+|+.||+.... T Consensus 80 es~~t~~v~~~~P~Li~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~ 159 (528) T protein:vir:66 80 AGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAK 159 (528) T ss_pred ccccccccccCchhHHHHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCcccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999986443 Q ss_pred cc-------cccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccch Q lcl|NC_014036. 161 PS-------NGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATS 233 (522) Q Consensus 161 ~~-------~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts 233 (522) .. ..+...+.......|+.+.+++.+++....+.................+.......+.+..++++.||+|+ T Consensus 160 ~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta 239 (528) T protein:vir:66 160 EATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATS 239 (528) T ss_pred cccccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccccccccccccccccceecccccchh Confidence 22 11222333445556788888888887776665554444333333333344455566777889999999999 Q ss_pred hhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Q lcl|NC_014036. 234 VAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMIN 313 (522) Q Consensus 234 ~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~ 313 (522) .+|+++.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||||||||||++|+ T Consensus 240 ~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~ 319 (528) T protein:vir:66 240 IAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVIN 319 (528) T ss_pred hhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcc Q lcl|NC_014036. 314 YTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDS 393 (522) Q Consensus 314 ~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~ 393 (522) +++++|+++|++++++++|+|||++++|++|+||++|+||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|+ T Consensus 320 ~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~ 399 (528) T protein:vir:66 320 FTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQ 399 (528) T ss_pred heeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCCcccccee Q lcl|NC_014036. 394 GITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVM 473 (522) Q Consensus 394 ~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~ 473 (522) +++++.++.+...++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+| T Consensus 400 ~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~ 479 (528) T protein:vir:66 400 GISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVL 479 (528) T ss_pred cccccccccccccccCCCCceeEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCcccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeecceecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 474 GFKTRYGVGINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 474 ~~~tRY~l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) ||||||||++|||+++.+|++++||+||+||+++||||+|||||+|||| T Consensus 480 g~~tRY~l~vNP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:66 480 GFKTRYGIGINPFADSKSQEPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred eeeeeeceeecCcccccCccccccccccchhhhhcCccceeEEeeeccC Confidence 9999999999999999999999999999999999999999999999999 No 6 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=100.00 E-value=9.2e-251 Score=1391.36 Aligned_cols=518 Identities=71% Similarity=1.122 Sum_probs=498.5 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) |+|. +|+|||+||||| ||+|+|++.|||+|+++|||||||+|.+++.||+++++++|+.||+|++++++|||++++|+ T Consensus 1 ~~~~-~l~~kw~p~l~~-~~~~~i~~~~~~~i~~~~~en~~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~t~i~ 78 (519) T protein:vir:10 1 MKKN-ALVQKWSALLEN-EALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIA 78 (519) T ss_pred Cchh-HHHHHhHHhhcc-cccchhhhhhhHHHHHHHHHHHHHHhhhcccccchHHHHHHhhhcchhccCCccccCccccc Confidence 8776 799999999995 89999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAA 160 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~ 160 (522) ++++|++|++|+|+||+|+||++|||||+||||||||||||||||||||||+++++++++.|+|++|||+|++|||+++. T Consensus 79 ~~~~t~~v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fSG~~~~ 158 (519) T protein:vir:10 79 AGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAA 158 (519) T ss_pred cccccccccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccccccccccCccccc Confidence 99999999999999999999999999999999999999999999999999999998889999999999999999998665 Q ss_pred cccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccc Q lcl|NC_014036. 161 PSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQ 240 (522) Q Consensus 161 ~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~ 240 (522) . ....+........++.+++.+...+++..+..........++....++.+.....+.+..|++++||+|+.+|+++. T Consensus 159 ~--~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal~~ 236 (519) T protein:vir:10 159 E--TFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEG 236 (519) T ss_pred c--ccccccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccccccchhhcccc Confidence 3 34555666677788888999999999988888877777777777778888889999999999999999999999999 Q ss_pred cCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcccccc Q lcl|NC_014036. 241 FNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGK 320 (522) Q Consensus 241 lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~ 320 (522) ||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++||+|+ T Consensus 237 lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~ 316 (519) T protein:vir:10 237 FNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGK 316 (519) T ss_pred CCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhcce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccccc Q lcl|NC_014036. 321 TGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQ 400 (522) Q Consensus 321 ~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~ 400 (522) +|||++++.++|+|||++++|+.++||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|.+++++++ T Consensus 317 ~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~ 396 (519) T protein:vir:10 317 SGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQ 396 (519) T ss_pred eecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeec Q lcl|NC_014036. 401 GLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYG 480 (522) Q Consensus 401 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~ 480 (522) +.+...++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||||| T Consensus 397 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~ 476 (519) T protein:vir:10 397 GLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYG 476 (519) T ss_pred cccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEecCcccccceeeccccccccccccCCccccceeeeeeeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceecCccccccCCccccccCcch-HHhhccccceeeeeeeccC Q lcl|NC_014036. 481 VGINPFANSRSQAPSDRITSGMI-TKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 481 l~~nP~~~~~~~~~~~~i~~g~~-~~~~~~~~~~~r~~~Vk~~ 522 (522) |++|||++..+|++..||+||+| |.+..++|.|||||+|||| T Consensus 477 l~~NP~~~~~~~~~~~~i~~g~~~~a~~~~~n~y~r~v~v~~~ 519 (519) T protein:vir:10 477 IGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) T ss_pred eeecCcccccccCccceeccCchhhhccccCceeeeeeeeecC Confidence 99999999999999999999987 7899999999999999999 No 7 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=100.00 E-value=1.1e-250 Score=1390.97 Aligned_cols=521 Identities=73% Similarity=1.128 Sum_probs=476.4 Q ss_pred Ccch-HHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhcccccccccccccccccccc Q lcl|NC_014036. 1 MSKK-NELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKI 79 (522) Q Consensus 1 ~~~~-~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~ 79 (522) |+.. |+|+|||+||||+ ||||+|++.|||+|+|+|||||||+++|+|.|||++++++++.+|+|++++|+|||++++| T Consensus 1 ~~~~~~~l~~kw~p~l~~-~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i 79 (529) T protein:vir:10 1 MSLKNKEILNKWTPLLEG-EGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNI 79 (529) T ss_pred CccchHHHHHHhhHhhcC-CccchhccchhhhhhhhhhhhhHHHHhcccccchhhhhhhhhccchhhccccccccccccc Confidence 5533 6899999999995 8899999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccccccccccc Q lcl|NC_014036. 80 ASGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGA 159 (522) Q Consensus 80 ~~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~ 159 (522) +|||+|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++..+++|+||+.+.+|+.|||.+. T Consensus 80 ~~st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~~~pda~~sga~~ 159 (529) T protein:vir:10 80 AAGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLAT 159 (529) T ss_pred ccccccccccccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999998999999999999999999865 Q ss_pred ccccc------cccccccccccccceeecccccccceeeeecccc-ccccCCCCcccccccccccccccccccccccccc Q lcl|NC_014036. 160 APSNG------FTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGA-PVTVTGSTDDALDAAVIAEQEKGTLAEISYGMAT 232 (522) Q Consensus 160 ~~~~~------~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~-p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~T 232 (522) ....+ +.........+.++...+.+...++++.....+. +...++......+.........+..+++++||+| T Consensus 160 ~ga~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsT 239 (529) T protein:vir:10 160 KGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMAT 239 (529) T ss_pred ccccccccccccccccccccccccccceeeecccCceeeccccccccccCccccCcccccccccccccccccccccchhh Confidence 43322 2333333444444444444444444444333222 2222222333445556667788899999999999 Q ss_pred hhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_014036. 233 SVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMI 312 (522) Q Consensus 233 s~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i 312 (522) +.+|+++.||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||||||||||++| T Consensus 240 a~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l 319 (529) T protein:vir:10 240 SIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWI 319 (529) T ss_pred hhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhc Q lcl|NC_014036. 313 NYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARID 392 (522) Q Consensus 313 ~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~ 392 (522) +.+|++++.+|++++++++|+|||++++|+.++||++|+||+|++|||+|||+|+|+|+||+||||||||+||++|+|+| T Consensus 320 ~~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~ 399 (529) T protein:vir:10 320 NYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALID 399 (529) T ss_pred hhhhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCCccccce Q lcl|NC_014036. 393 SGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPV 472 (522) Q Consensus 393 ~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~ 472 (522) .+.+++.++...+.++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||++++++||+||||+ T Consensus 400 ~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~ 479 (529) T protein:vir:10 400 TNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGFDPKNFQPV 479 (529) T ss_pred ccccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccce Confidence 99998888888888899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeecceecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 473 MGFKTRYGVGINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 473 ~~~~tRY~l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) |||||||||++|||+++.+|++++||+||+||++++|||+|||||+|||| T Consensus 480 ~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 480 MGFKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 99999999999999999999999999999999999999999999999999 No 8 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=100.00 E-value=1.2e-250 Score=1390.66 Aligned_cols=520 Identities=73% Similarity=1.131 Sum_probs=474.1 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) |++ |+|+|||+||||+ ||||+|++.|||+|+|+|||||||+++|++.|||++++++++.+|+|++++|+|||++++|+ T Consensus 3 ~~~-~~l~~kw~p~l~~-~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~~~~~~~~~~~~~~~i~ 80 (529) T protein:vir:10 3 LKN-KEILNKWTPLLEG-EGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIA 80 (529) T ss_pred ccH-HHHHHHhHHHhcC-CccchhccchhhhhhhhhhhhhHHHHhhccccchhhhhhhhhcccchhhccccccccccccc Confidence 544 4799999999995 88999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAA 160 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~ 160 (522) |||+|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++..+++|+||+++.+|+.|||.... T Consensus 81 est~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~y~Pda~~sga~~~ 160 (529) T protein:vir:10 81 AGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATK 160 (529) T ss_pred cccccccccccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCcccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999997654 Q ss_pred cccc------cccccccccccccceeecccccccceeeeecccc-ccccCCCCcccccccccccccccccccccccccch Q lcl|NC_014036. 161 PSNG------FTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGA-PVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATS 233 (522) Q Consensus 161 ~~~~------~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~-p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts 233 (522) ...+ +..+......+.++.....+-..++.+.....+. +...++......+.........+..++++.||+|+ T Consensus 161 ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm~Ta 240 (529) T protein:vir:10 161 GATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATS 240 (529) T ss_pred ccccccCccccccccccccccccCcceeeeecccceecccccccccccCccccCcccccccccccccccccccccccchh Confidence 3322 2222233333333333333333344433332222 22222222333444556667788899999999999 Q ss_pred hhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Q lcl|NC_014036. 234 VAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMIN 313 (522) Q Consensus 234 ~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~ 313 (522) .+|+++.+|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||||||||||++|+ T Consensus 241 ~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~ 320 (529) T protein:vir:10 241 IAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWIN 320 (529) T ss_pred hhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcc Q lcl|NC_014036. 314 YTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDS 393 (522) Q Consensus 314 ~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~ 393 (522) ++|++++.+|++++++++|+|||++++|+.++||++|+||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|. T Consensus 321 ~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~ 400 (529) T protein:vir:10 321 YTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDT 400 (529) T ss_pred hhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCCcccccee Q lcl|NC_014036. 394 GITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVM 473 (522) Q Consensus 394 ~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~ 473 (522) +++++.++...+.++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+| T Consensus 401 ~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~ 480 (529) T protein:vir:10 401 NISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVM 480 (529) T ss_pred hccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeecceecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 474 GFKTRYGVGINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 474 ~~~tRY~l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) ||||||||++|||+++.+|++++||+||+||++++|||+|||||+|||| T Consensus 481 g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 481 GFKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 9999999999999999999999999999999999999999999999999 No 9 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=100.00 E-value=1.2e-250 Score=1390.64 Aligned_cols=517 Identities=71% Similarity=1.110 Sum_probs=496.3 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) |+++|+|+|||+||||+ ||||+|++ +||+|+|+|||||||+++++|+|||++|+++|+.+|.|++++++||+++++|+ T Consensus 3 ~~~~~~l~~kw~p~l~~-~~~~~i~~-~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~i~ 80 (521) T protein:vir:10 3 IKTKAELLNKWKPLLEG-EGLPEIAN-SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIA 80 (521) T ss_pred cchhHHHHHhhhhhhcc-CCCCcccc-chhhhhhhhhhhhhhhhhhccccchhHHHHHHhhhhhhhcccCcccccccccc Confidence 99999999999999996 89999987 59999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAA 160 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~ 160 (522) ||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++++.+++|+|++++++|+.|||+++. T Consensus 81 es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~a 160 (521) T protein:vir:10 81 AGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAA 160 (521) T ss_pred ccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999888999999999999999998765 Q ss_pred cccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccc Q lcl|NC_014036. 161 PSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQ 240 (522) Q Consensus 161 ~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~ 240 (522) .. ++.....++...|+.+.+.+...++++.+.....+...+++++..++.........+..|+++.||+|+.+|+++. T Consensus 161 t~--~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~ 238 (521) T protein:vir:10 161 KK--FAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQES 238 (521) T ss_pred cc--cccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhhHhhhcc Confidence 33 4555666777788889999999999988888888888888888888888899999999999999999999999999 Q ss_pred cCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcccccc Q lcl|NC_014036. 241 FNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGK 320 (522) Q Consensus 241 lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~ 320 (522) ||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|++++|+|+ T Consensus 239 ~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~ 318 (521) T protein:vir:10 239 FNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGK 318 (521) T ss_pred CCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccccc Q lcl|NC_014036. 321 TGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQ 400 (522) Q Consensus 321 ~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~ 400 (522) +||+.++++++|+|||++++|++++||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|.+++++++ T Consensus 319 ~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~ 398 (521) T protein:vir:10 319 SGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQ 398 (521) T ss_pred eeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeec Q lcl|NC_014036. 401 GLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYG 480 (522) Q Consensus 401 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~ 480 (522) +++.+.++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||||| T Consensus 399 ~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~ 478 (521) T protein:vir:10 399 GLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYG 478 (521) T ss_pred cccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceecCccccccCCccccccCcchHHhhcccc--ceeeeeeeccC Q lcl|NC_014036. 481 VGINPFANSRSQAPSDRITSGMITKEMFGKN--AYFRKVYVKGL 522 (522) Q Consensus 481 l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~--~~~r~~~Vk~~ 522 (522) |++|||+++.+|++. |+|++++|++++|+| .|||||+|||| T Consensus 479 l~~NP~~~~~~~~~~-~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:10 479 IGINPFAESAAQAPA-SRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eeecCcccccCCccc-eeecccchhhhccccccceeeeeeecCC Confidence 999999999999765 888999999877655 59999999999 No 10 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=100.00 E-value=1.1e-249 Score=1385.40 Aligned_cols=517 Identities=70% Similarity=1.104 Sum_probs=493.7 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) |+++|+|+|||+||||+ ||||+|++ +||+|+|+|||||||+++++|+|||++++++|+.+|.|++++++||+++++|+ T Consensus 3 ~~~~~~l~~kw~p~l~~-~~~~~i~~-~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia 80 (521) T protein:vir:72 3 IKTKAELLNKWKPLLEG-EGLPEIAN-SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIA 80 (521) T ss_pred cchhHHHHHhhhhhhcc-CCCCcccc-chhhhhhhhhhhhhhhhhhcccccchHHHHHHhhhhhhhcccCccccCccccc Confidence 99999999999999996 89999987 59999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAA 160 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~ 160 (522) ||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++..+++|+||+++++|+.|||+++. T Consensus 81 es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~ 160 (521) T protein:vir:72 81 AGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAA 160 (521) T ss_pred ccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999998888999999999999999999765 Q ss_pred cccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccc Q lcl|NC_014036. 161 PSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQ 240 (522) Q Consensus 161 ~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~ 240 (522) .. +.........+.|+.+++.+...++++.+.....+.....++....+..+....+.+..|+++.||+|+.+|+++. T Consensus 161 ~~--~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~ 238 (521) T protein:vir:72 161 KK--FPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQEG 238 (521) T ss_pred cc--ccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccchhhhhhhcc Confidence 43 4555666677888889988888888888777777777667777777888888899999999999999999999999 Q ss_pred cCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcccccc Q lcl|NC_014036. 241 FNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGK 320 (522) Q Consensus 241 lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~ 320 (522) +|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|++++|+|+ T Consensus 239 ~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~ 318 (521) T protein:vir:72 239 FNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGK 318 (521) T ss_pred cCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccccc Q lcl|NC_014036. 321 TGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQ 400 (522) Q Consensus 321 ~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~ 400 (522) +||+.++++++|+|||++++|++++||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|++++++++ T Consensus 319 ~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~ 398 (521) T protein:vir:72 319 SGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQ 398 (521) T ss_pred eeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeec Q lcl|NC_014036. 401 GLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYG 480 (522) Q Consensus 401 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~ 480 (522) +++.+.++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||||| T Consensus 399 ~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~ 478 (521) T protein:vir:72 399 GLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYG 478 (521) T ss_pred cccccccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceecCccccccCCccccccCcchHHhhcccc--ceeeeeeeccC Q lcl|NC_014036. 481 VGINPFANSRSQAPSDRITSGMITKEMFGKN--AYFRKVYVKGL 522 (522) Q Consensus 481 l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~--~~~r~~~Vk~~ 522 (522) |++|||+++.+|+++ |++++++|++++|+| .|||||+|||| T Consensus 479 l~~NP~~~~~~~~~a-~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:72 479 IGINPFAESAAQAPA-SRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eeecCcccccCcccc-eeecCcChhhhcCccccceeeeeeecCC Confidence 999999999999765 889999999887665 49999999999 No 11 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=100.00 E-value=1.8e-247 Score=1373.31 Aligned_cols=518 Identities=60% Similarity=0.948 Sum_probs=469.3 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhh--hhhcchhhhhhhccc--------cccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVD--PVYRDEKIVESFGGF--------LAEAEIAG 70 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~--~~~~~~~~~~~~~~~--------l~ea~~~~ 70 (522) |++. +|+|||+||||+ ||||+|++.|||+|+|+|||||||+|+|| +.|||++++++|+.| |.||++++ T Consensus 1 ~~~~-~l~~kw~p~l~~-~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~ 78 (534) T protein:vir:10 1 MSKK-SLLKKWQPLVES-EGMPAIASMKRKDIVARIFENQDEDIAHNEGGVYTDQVVVNSMVDVKGRIEEARLAEANIGG 78 (534) T ss_pred Cchh-HHHHHhHHhhcC-CccccccchhhhhhhhhhhhhHHHHHhhhcccccchhhhhhhhhccccchhhcccccccccc Confidence 7765 799999999995 88999999999999999999999999998 699999999999988 99999999 Q ss_pred cccccccccccccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccc Q lcl|NC_014036. 71 DHGYDATKIASGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSP 150 (522) Q Consensus 71 ~~g~~~~~~~~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Ea 150 (522) +|||++.+|+||++|++|++|||+||+|||||+|||||+||||||||||||||||||||||.++++..+++||||+.+.+ T Consensus 79 ~~g~~~~~ia~s~~s~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~a 158 (534) T protein:vir:10 79 DHGYDATKIASGETSGSITNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGP 158 (534) T ss_pred ccccccccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999998887889999998889 Q ss_pred cccccccccccccccccccccccccccceeeccc-----ccccceeeeeccccccccCCCCccccccccccccccccccc Q lcl|NC_014036. 151 DSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDF-----VETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAE 225 (522) Q Consensus 151 dt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~-----~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~ 225 (522) |++|||+++... ...+....+...++.+++.. ...++...+......+....++....+.........+..++ T Consensus 159 dt~fSG~~~a~~--~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~ 236 (534) T protein:vir:10 159 DADFSGRGAAQD--IAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVE 236 (534) T ss_pred cccccccccccc--cccccccccccccccccccccccccccccccccccccccccccccCCcccccccccccccccccee Confidence 999999865432 23334444444555444322 22223222222222222222222222233344455667899 Q ss_pred ccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhh Q lcl|NC_014036. 226 ISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEIN 305 (522) Q Consensus 226 ~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEIN 305 (522) ++.||+|+.+|+++.+|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||| T Consensus 237 ~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEIN 316 (534) T protein:vir:10 237 TSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEIN 316 (534) T ss_pred cccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHH Q lcl|NC_014036. 306 REIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVV 385 (522) Q Consensus 306 Reii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va 385 (522) ||||++|+++|++++.+|+.++++++|+|||+++.|+.++||++||+|+|++|||+|||+|+|+|+||+||||||||+|| T Consensus 317 Reii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va 396 (534) T protein:vir:10 317 REMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVA 396 (534) T ss_pred HHHHHHHhhhhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccC Q lcl|NC_014036. 386 SALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSD 465 (522) Q Consensus 386 ~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~D 465 (522) ++|+|+|.+.++++.+.+.+.++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||++++++| T Consensus 397 ~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~d 476 (534) T protein:vir:10 397 AALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVEDYFTVGYKGASEMDAGLYYCPYVALTPLRGTD 476 (534) T ss_pred HHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccceeeeeeeecceecCccccccCCccccccCcch-HHhhccccceeeeeeeccC Q lcl|NC_014036. 466 PKNFQPVMGFKTRYGVGINPFANSRSQAPSDRITSGMI-TKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 466 p~s~qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~~g~~-~~~~~~~~~~~r~~~Vk~~ 522 (522) |+||||+|||||||||++|||++..++++..||+||++ |++++|+|+|||||+|||| T Consensus 477 p~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~~i~~g~~~~~~~ag~n~~~~~~~Vk~l 534 (534) T protein:vir:10 477 PKNFQPVLGFKTRYGVKLHPMADATQNKGFAKISNGMPQHTNMFGKNAFFRRVLVAGV 534 (534) T ss_pred CccccceeeeeeeeceeecCcccccCCccccccccCCcchhhhcccccceeeeeeecC Confidence 99999999999999999999999999999899999976 9999999999999999999 No 12 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=100.00 E-value=2.6e-238 Score=1323.10 Aligned_cols=510 Identities=60% Similarity=0.975 Sum_probs=445.9 Q ss_pred HHHHHhhhhhhccccc--hhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccccc Q lcl|NC_014036. 5 NELMEKWNDLLESQEG--LPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIASG 82 (522) Q Consensus 5 ~~l~~kw~p~l~~~~~--~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~~~ 82 (522) -+|+|||+||||+ || +|+|++.|||+|+|+|||||||+++|+++|||++++++|+.+|+|++++|+|||++.+|+|| T Consensus 1 ~~l~~kw~p~l~~-~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia~s 79 (514) T protein:vir:56 1 MNLTEKWKDLLEA-EGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQG 79 (514) T ss_pred CchhhhhhHHhcc-cccccccccchhhhhhhhhhhhhHHHHHhcCCcccchhhhhhhhcccccccccccccccccccccc Confidence 4799999999996 67 89999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccc Q lcl|NC_014036. 83 NSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPS 162 (522) Q Consensus 83 t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~ 162 (522) ++|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++.+ +.||||+++|+|++|||+++... T Consensus 80 ~~t~~v~~~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~t--g~EAf~~~nEadt~fSG~~~~~~ 157 (514) T protein:vir:56 80 VTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT--GAEAFHPTRQADASFSGQAAAST 157 (514) T ss_pred cccccccccchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcc--cccccccccccCcCccccccccc Confidence 9999999999999999999999999999999999999999999999999998754 56999999999999999866543 Q ss_pred cccccccccccccccceeecccccccceeeeecccc-ccccCCCCcccccccccccccccccccccccccchhhhhcccc Q lcl|NC_014036. 163 NGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGA-PVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQF 241 (522) Q Consensus 163 ~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~-p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~l 241 (522) +...........|+....+....+.......... ......+.............+.+..|+++.||+|+.+|+++.| T Consensus 158 --~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~l 235 (514) T protein:vir:56 158 --IADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENF 235 (514) T ss_pred --cccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhcccC Confidence 2233333333334443332222221111111100 0000111111122234455677889999999999999999999 Q ss_pred CCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccc Q lcl|NC_014036. 242 NGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKT 321 (522) Q Consensus 242 ggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~ 321 (522) |++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+..+++++. T Consensus 236 ggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~~ 315 (514) T protein:vir:56 236 NGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKS 315 (514) T ss_pred CCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccc Q lcl|NC_014036. 322 GFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQG 401 (522) Q Consensus 322 ~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~ 401 (522) +|++++.. +|+|||++++|++|+||++|++|+|++|||||+|+|+|+|+||+||||||||+||++|+|+|.++++++.+ T Consensus 316 ~~~~~~~~-~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g 394 (514) T protein:vir:56 316 GWTQGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQG 394 (514) T ss_pred cccccccc-ccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccC Confidence 99998754 89999999999999999999999999999999999999999999999999999999999999999988765 Q ss_pred c-ccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeec Q lcl|NC_014036. 402 L-QKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYG 480 (522) Q Consensus 402 ~-~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~ 480 (522) . ...+++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||++++++||+||||+|||||||| T Consensus 395 ~~~~~~~~d~~~~~~aG~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~ 474 (514) T protein:vir:56 395 MQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYG 474 (514) T ss_pred ccccccccccCcceEEEEecCceEEEecCCCCcceEEEEEecCcceecceeeccccccccccccCCccccceeeeeeeec Confidence 4 457899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 481 VGINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 481 l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) |++|||++..++.. ++.|+++-.+..++|.|||||+|||| T Consensus 475 l~~NPy~~~~~~~~--~~~~~~~~~a~~~~n~y~r~v~v~~l 514 (514) T protein:vir:56 475 VQVNPFADPTASAT--KVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) T ss_pred eeeCCCCCcccccc--ccCCcchhhhcccccceeeeEEEecC Confidence 99999996665532 45555554455559999999999999 No 13 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=100.00 E-value=1.8e-221 Score=1230.73 Aligned_cols=460 Identities=40% Similarity=0.679 Sum_probs=404.9 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccc-ccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEA-EIAGDHGYDATKI 79 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea-~~~~~~g~~~~~~ 79 (522) |+++|+|+|||+||||+ ||+|+|++.|||+|+++|||||||+|+|++ .+|.|+ +++++||+++..| T Consensus 3 ~~~~e~l~~kw~p~l~~-~~~~~i~~~~~~~v~a~l~enq~~~~~~~~------------~~l~e~~~~~~~~~~~~~~i 69 (470) T protein:vir:10 3 MFNSEYLQEKWAPILDY-DGLDPIKDSHRRSVTAVLLENQEKELREER------------NFLSEAPNVNTNSGATAGFS 69 (470) T ss_pred cchhHHHHHhhhhhhcC-CccchhcchhhhhhhhhhhhhhHHHHhhcc------------chhhhhhhcccccccccccc Confidence 99999999999999995 889999999999999999999999999999 458888 8999999999999 Q ss_pred ccccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccccccccccc Q lcl|NC_014036. 80 ASGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGA 159 (522) Q Consensus 80 ~~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~ 159 (522) +|||+|++|++|||+||+|||||+|||||+|||||||||||||||||||+||.++. ++|+||+ |+|++|||.++ T Consensus 70 ~~st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~s----G~Eaffn--EA~T~fSG~~~ 143 (470) T protein:vir:10 70 ADATAAGPVAGFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQS----GTEALFN--EADTAFSGQPD 143 (470) T ss_pred ccccccccccccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCC----ccceeee--cCCcccCcccc Confidence 99999999999999999999999999999999999999999999999999999874 4588875 99999999866 Q ss_pred ccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhcc Q lcl|NC_014036. 160 APSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQE 239 (522) Q Consensus 160 ~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~ 239 (522) ....... .....+... +.......++.+..++. ......+..|+++.||+|+.+|. T Consensus 144 ~~~~~~~-~~~~~a~~~-------------------g~~~~~~~gt~~~~~~~--~~~~a~~~~y~~~~GMsTa~aE~-- 199 (470) T protein:vir:10 144 GLDDTSG-FTATGANNV-------------------GLGTTAQQGSNPGLLNS--TAAQTNATDYNVGQGMRTDSAED-- 199 (470) T ss_pred ccccccc-ccccccccc-------------------ccccccccccccccccc--ccccccccccccccccchHHhhh-- Confidence 5432111 000000000 00011111222211111 11223455688999999999996 Q ss_pred ccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccc Q lcl|NC_014036. 240 QFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVG 319 (522) Q Consensus 240 ~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~ 319 (522) ||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|+++ T Consensus 200 -lg~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~ 278 (470) T protein:vir:10 200 -LGDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPG 278 (470) T ss_pred -cCCCCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhc Confidence 6788899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccc Q lcl|NC_014036. 320 KTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAG 399 (522) Q Consensus 320 ~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~ 399 (522) +..|+ ..+|+|||+++.| +||++|+||+|++||++++|+|+|+|+||+||||||||+||++|+|+|.+.+.+. T Consensus 279 k~~~~----~~~Gv~Dl~~~~~---gr~~~e~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~ 351 (470) T protein:vir:10 279 AQANV----AAAGTFDLDTDSN---GRWSVEKFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPA 351 (470) T ss_pred eeccc----cccceEEeecccc---hhHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhccccccccc Confidence 98875 5689999997666 6999999999999999999999999999999999999999999999987766655 Q ss_pred cccccccccccccceeEEEecCceEEEecCC------CccceEEEEEecCCCccceeEeecccccccccccCCcccccee Q lcl|NC_014036. 400 QGLQKTLNVDTTKAVFAGVLGGVYKVYIDQY------ARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVM 473 (522) Q Consensus 400 ~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y------~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~ 473 (522) ++..+++|+++++|+|+|+|||+|||||| +++|||+|||||++++|+||||||||||++++++||+||||+| T Consensus 352 --~~~~~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~ 429 (470) T protein:vir:10 352 --LNANLNVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVGYKGSSPYDAGLFYCPYVPLQMVRAVGQDTFQPKI 429 (470) T ss_pred --cccccccCCCCceEEEEecCceEEEeeccccccCcccccEEEEEEecCcceecceeeccccccccCCCCCCcccccee Confidence 56779999999999999999999999997 8889999999999999999999999999999999999999999 Q ss_pred eeeeeecceecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 474 GFKTRYGVGINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 474 ~~~tRY~l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) ||||||||++|||+.+.+|+ +.||++ |+|.|||||+|||| T Consensus 430 g~~tRY~l~~NP~~~~~~~~-~~~i~~--------~~n~y~r~~~v~~l 469 (470) T protein:vir:10 430 GFKTRYGLVENPFSQGTTQG-LGTLTR--------NSNRYYRRVKVANL 469 (470) T ss_pred eeeeeeceeecCcccCCCcc-cccccC--------CCCceeeEEEeecc Confidence 99999999999999999995 457775 78899999999999 No 14 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=100.00 E-value=7e-220 Score=1222.08 Aligned_cols=460 Identities=36% Similarity=0.620 Sum_probs=399.0 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) |+++|+|+|||+||||+ ||+|+|++.|||+|+|+|||||||+|+|+|.|+++.++++|+. .+......++ T Consensus 1 ~~~~e~l~~kW~plLe~-~~~~~i~~~~k~~i~a~llENQe~~~~~~~~~~~~~~~~~~~~---------~~~~~~n~~~ 70 (468) T protein:vir:10 1 MFNAEHLQEKWSPVLNH-GEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGA---------GTIAPAGSAL 70 (468) T ss_pred CcchHHHHHhhhHhhcC-CccchhccchhhhhhhhhhhhHHHHHhccccccchhhHhhcCC---------cccchhhhhh Confidence 99999999999999995 8899999999999999999999999999999999999999862 2234445677 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAA 160 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~ 160 (522) ++++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||.++. ++|+||+ |||++|||.+.. T Consensus 71 ~~~~t~~v~~~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~----g~EAf~n--Eadt~fSg~~~~ 144 (468) T protein:vir:10 71 GSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEALFN--EPDTGFTGGYDA 144 (468) T ss_pred hhcccccccccCchhhhhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCCC----Cccceec--cccccccccccc Confidence 8899999999999999999999999999999999999999999999999999885 4688875 999999997544 Q ss_pred cccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccc Q lcl|NC_014036. 161 PSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQ 240 (522) Q Consensus 161 ~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~ 240 (522) ........ .+ ........++++. . ...+.+..++++.||+|+.+|.++ T Consensus 145 ~~~~~~~~-------~~------------------~~~~~~~~g~~~~-----~-~~~a~~~~~~~g~gMsTa~aE~lG- 192 (468) T protein:vir:10 145 SQGDYAVR-------TG------------------AGVGGDSEGNNPA-----L-LNDAAPGTYEVGSKMPREDLERMG- 192 (468) T ss_pred cccccccc-------cc------------------cccccCCCCCccc-----c-cccccccccccccccchHHHhhcC- Confidence 32111000 00 0001111122221 1 112344568899999999999853 Q ss_pred cCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcccccc Q lcl|NC_014036. 241 FNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGK 320 (522) Q Consensus 241 lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~ 320 (522) +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|++++ T Consensus 193 ---~~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k 269 (468) T protein:vir:10 193 ---EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGA 269 (468) T ss_pred ---CCCcccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhhee Confidence 35678999999999999999999999999999999999999999999999999999999999999999999999987 Q ss_pred ccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccccc Q lcl|NC_014036. 321 TGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQ 400 (522) Q Consensus 321 ~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~ 400 (522) + ..++.+|+|||+++.| +||++|++|+|++|||+|+|+|+|+|+||+||||||||+||++|+|+|.+.++++. T Consensus 270 ~----~g~~~~Gv~d~~~~~~---~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~ 342 (468) T protein:vir:10 270 Q----NNVANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGL 342 (468) T ss_pred c----cccccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceecccc Confidence 4 3577899999997666 69999999999999999999999999999999999999999999999999999987 Q ss_pred ccccc---cccccccceeEEEecCceEEEecCCCc----cceEEEEEecCCCccceeEeecccccccccccCCcccccee Q lcl|NC_014036. 401 GLQKT---LNVDTTKAVFAGVLGGVYKVYIDQYAR----GDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVM 473 (522) Q Consensus 401 ~~~~~---~~~d~~~~~~~G~l~~~~~vy~D~y~~----~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~ 473 (522) ..+.. +++|+++.+|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||+|++++||+||||+| T Consensus 343 ~~~~~~~~~~~D~tg~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~sfqP~~ 422 (468) T protein:vir:10 343 NGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKI 422 (468) T ss_pred cccccccccccccCcceEEEEecCceEEEEccccccCCccceEEEEEecCcceeceeeeccccccccccccCCCccccee Confidence 77765 479999999999999999999999865 79999999999999999999999999999999999999999 Q ss_pred eeeeeecceecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 474 GFKTRYGVGINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 474 ~~~tRY~l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) ||||||||++|||+...+.. ...++|.+|.+ |+|.|||||+|||| T Consensus 423 g~~tRY~l~~NP~~~~~~~~--~g~~~~~~~~~--~~N~y~r~~~v~~l 467 (468) T protein:vir:10 423 GFKTRYGMVSNPFVTTNGLY--NGTPDGEALTP--NANMYYRRVQVTNL 467 (468) T ss_pred eeeeeeceeecccceecccc--CCCcccccccc--cccceeeeEEEecc Confidence 99999999999999644321 13356666644 79999999999999 No 15 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=100.00 E-value=1.9e-216 Score=1203.28 Aligned_cols=454 Identities=39% Similarity=0.675 Sum_probs=393.0 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) ||+ |+|+|||+||||+ ||+|+|++.+||+|+++|||||||+|+|++ .+|+|+ .++||+++. T Consensus 1 ms~-~~l~~~w~~~l~~-~~~~~i~~~~~~~~~~~~~enq~~~~~~~~------------~~l~ea--~~~~g~~~~--- 61 (462) T protein:vir:10 1 MSI-QQLQEKWAPVLNH-ESVPEIKDSYKKGVVAQLLENQENAIREEG------------QVLNET--LQTTGYTTG--- 61 (462) T ss_pred Cch-HHHHHHhhhhhcc-cccchhhhhhHHHHHHHHhhhHHHHHHhcc------------cchhcc--ccccCCCcC--- Confidence 998 5899999999995 789999999999999999999999999977 679999 488998865 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCC--Ccccchhcccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLA--SGAKEAFHPMFSPDSMYSGQG 158 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~--t~~~eA~~~~~Eadt~fSG~g 158 (522) +++|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++.+ .++.||||+ |+|+.|||.+ T Consensus 62 -~~~t~~~~~~~P~Li~l~Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~gtEAlfn--Eadt~fSg~~ 138 (462) T protein:vir:10 62 -DTATGPVAGFDPVLISLIRRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSDFREALFN--EPNAGFSGGA 138 (462) T ss_pred -cccccccccccchhhhHHHHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccccchhhhc--cCCcCccccc Confidence 67799999999999999999999999999999999999999999999999987653 357888875 9999999976 Q ss_pred cccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhc Q lcl|NC_014036. 159 AAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQ 238 (522) Q Consensus 159 ~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal 238 (522) +............ .... ...++++...+.. .......+..+.||+|+.+|++ T Consensus 139 ~~~~~~~~~~~~~-----------------~~~~--------~~~g~~~~~~~~~---~~g~~~~~~~~~GM~Ta~aE~l 190 (462) T protein:vir:10 139 GTGLSNYDPTASS-----------------SAVN--------DAEGANPGLLNDS---PAGTYEVTGDATGMATATAEAL 190 (462) T ss_pred ccccccccccccc-----------------cccc--------ccccccceeecCC---Cccceecccccccccchhcccc Confidence 5432211100000 0000 0011111111100 0112234567889999999986 Q ss_pred cccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcccc Q lcl|NC_014036. 239 EQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQV 318 (522) Q Consensus 239 ~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~ 318 (522) +. ++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|++ T Consensus 191 g~--~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~ 268 (462) T protein:vir:10 191 DD--SSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVNAVK 268 (462) T ss_pred CC--ccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhhee Confidence 52 4667799999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccc Q lcl|NC_014036. 319 GKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 398 (522) Q Consensus 319 ~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~ 398 (522) ++.+|+ ..+|+|||+++.+ +||++|++|+|++||+++||+|+|+|+||+||||||||+||++|+|+|.+.+.| T Consensus 269 ~k~~~~----~~~Gv~dl~~~~~---gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~p 341 (462) T protein:vir:10 269 GAIANT----ATDGIFDLDVDSN---GRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMAGVLDYAP 341 (462) T ss_pred eecccc----cccceeeeccccc---hHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhccchhccc Confidence 998875 5689999987655 699999999999999999999999999999999999999999999999999988 Q ss_pred cccccccc-ccccccceeEEEecCceEEEecCC----CccceEEEEEecCCCccceeEeecccccccccccCCcccccee Q lcl|NC_014036. 399 GQGLQKTL-NVDTTKAVFAGVLGGVYKVYIDQY----ARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVM 473 (522) Q Consensus 399 ~~~~~~~~-~~d~~~~~~~G~l~~~~~vy~D~y----~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~ 473 (522) +...+..+ ++|+++.+|+|+|+|||+|||||| +++|||+|||||++++|+||||||||||+++|++||+||||+| T Consensus 342 ~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~ 421 (462) T protein:vir:10 342 GLQGNSALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPNTFQPKI 421 (462) T ss_pred cccccccccccccccceeEEEecCceEEEEecccCCCcccceEEEEEeCCcccccceeeccccccccccccCCcccccee Confidence 76666665 799999999999999999999998 6789999999999999999999999999999999999999999 Q ss_pred eeeeeecceecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 474 GFKTRYGVGINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 474 ~~~tRY~l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) ||||||||++|||+.+.++++ +|++ +|+|.|||||+|||| T Consensus 422 g~~tRY~l~~NP~t~~~~~~~-~~~~--------~~~n~y~r~~~v~~l 461 (462) T protein:vir:10 422 GFKTRYGMVSNPFSGGLTQGS-GALT--------ANANKYYRRVQVANL 461 (462) T ss_pred eeeeeeeeeecCCCCCcCCcc-cccc--------ccCcceeeeEEeecc Confidence 999999999999999999965 4555 578999999999999 No 16 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=100.00 E-value=3e-213 Score=1185.72 Aligned_cols=449 Identities=40% Similarity=0.683 Sum_probs=392.5 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) ||+ |+|+|||+||||| ||||||++.|||+|+++|||||||+|+|++ .+|+||. ++||+.+. T Consensus 1 m~~-~~l~~~w~~~l~~-~~~~~i~~~~~~~~~~~~lenq~~~~~~~~------------~~l~ea~--~~~g~~~~--- 61 (457) T protein:vir:10 1 MSF-QNLQEKWAPVLEH-DSLPEIGDSYKKGVVAQLLENQEKAIAEEG------------KILTETL--QTTGYTGG--- 61 (457) T ss_pred Cch-HHHHHHhhHhhcc-CccchhhhhHHHHHHHHHhhhHHHHHHhcc------------ccccccc--cccCCCcc--- Confidence 998 5799999999995 889999999999999999999999999977 7799994 99999877 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCC--Ccccchhcccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLA--SGAKEAFHPMFSPDSMYSGQG 158 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~--t~~~eA~~~~~Eadt~fSG~g 158 (522) |++|++|++|||+||+||||++|||||+|||||||||||||||||||+||.++.+. .+.+|+||+ |+|+.|||.. T Consensus 62 -s~~t~~v~~~~P~Li~l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a~~~EAl~n--Eadt~fSg~~ 138 (457) T protein:vir:10 62 -DTVTGPVAGFDPVLISLIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAAGYDEAFFN--EPNAGFSGGP 138 (457) T ss_pred -cccccccccccchhhhhhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccccccccceeee--ccCcccCccc Confidence 56799999999999999999999999999999999999999999999999988664 345788865 9999999975 Q ss_pred cccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhc Q lcl|NC_014036. 159 AAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQ 238 (522) Q Consensus 159 ~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal 238 (522) +........ ......++++...+... ....+.++++.||+|+.+|.+ T Consensus 139 ~~~~~~~~~------------------------------~~~~~~gt~~~~~~~~~---~~~~~~~~~~~gmsTA~aE~l 185 (457) T protein:vir:10 139 GAYDPGATG------------------------------VTNDAEGTNPALLNDSP---AGTYEQADDATGMSTATVEAL 185 (457) T ss_pred ccccccccc------------------------------cccccccccccccCccc---cccccccccccchhhhhhhcc Confidence 443211000 00111122222222111 123346788999999999986 Q ss_pred cccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcccc Q lcl|NC_014036. 239 EQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQV 318 (522) Q Consensus 239 ~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~ 318 (522) ++ +++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|++ T Consensus 186 gd--~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~ 263 (457) T protein:vir:10 186 DD--STANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAVA 263 (457) T ss_pred CC--CCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhhee Confidence 42 5667789999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccc Q lcl|NC_014036. 319 GKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 398 (522) Q Consensus 319 ~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~ 398 (522) ++.+|+. .+|+|||+++.| +||++|+||+|++||++++|+|+++|+||+||||||||+||++|+|+|.+.++| T Consensus 264 ~~~~~~~----~~gv~dl~~~~~---g~~~~e~~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p 336 (457) T protein:vir:10 264 GAQNNTA----TAGVFDLDVDSN---GRWSVEKFKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGMAGVLDYTP 336 (457) T ss_pred eeccccc----cceeeeeecccc---chhhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhhcccccccc Confidence 9988864 589999986655 699999999999999999999999999999999999999999999999998888 Q ss_pred ccccccc-cccccccceeEEEecCceEEEecCCC----ccceEEEEEecCCCccceeEeecccccccccccCCcccccee Q lcl|NC_014036. 399 GQGLQKT-LNVDTTKAVFAGVLGGVYKVYIDQYA----RGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVM 473 (522) Q Consensus 399 ~~~~~~~-~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~ 473 (522) +...+.. .++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+++|++||+||||+| T Consensus 337 ~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~ 416 (457) T protein:vir:10 337 ALNGNNGLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPDTFQPKI 416 (457) T ss_pred hhhccccccccccccceeEEEecCCeEEEEecccccCCccceEEEEEeCCcceecceeecccccccccCccCCcccccee Confidence 7655555 46899999999999999999999887 479999999999999999999999999999999999999999 Q ss_pred eeeeeecceecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 474 GFKTRYGVGINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 474 ~~~tRY~l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) ||||||||++|||+.+.+|++. |++. |.|.|+||+.|+|| T Consensus 417 g~~tRY~l~~NP~~~~~~~~~~-~~~~--------~~n~~~~rs~vs~l 456 (457) T protein:vir:10 417 GFKTRYGMVSNPFAGGLTQGSG-ALTV--------NANKYYRRVQVANL 456 (457) T ss_pred eeeeeeeeeecccccccccccc-cccc--------cchhhcceeeeeec Confidence 9999999999999999999764 5553 46789999999999 No 17 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=100.00 E-value=4.1e-194 Score=1080.80 Aligned_cols=444 Identities=23% Similarity=0.318 Sum_probs=339.7 Q ss_pred Ccch---HHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhcccccccccccccccccc Q lcl|NC_014036. 1 MSKK---NELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDAT 77 (522) Q Consensus 1 ~~~~---~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~ 77 (522) ||++ |+|+|||+||||+ |++.|||+|+|+|||||||+ ++ + T Consensus 1 ~~~~~~~e~l~~kw~p~l~~------~~~~~~~~~~a~llenq~~~---~~----------------------------~ 43 (523) T protein:vir:59 1 MSQPKINEQLIEKWQPLLEG------CRNDWERHTLATLLENQYRE---AK----------------------------K 43 (523) T ss_pred CCcchhhHHHHHhhhhhhcc------cCChhHHHHHHHHhhhhhHH---HH----------------------------H Confidence 9987 8999999999995 66889999999999999974 22 2 Q ss_pred ccccccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcc----------- Q lcl|NC_014036. 78 KIASGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHP----------- 146 (522) Q Consensus 78 ~~~~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~----------- 146 (522) +|+|++.|++|++|+| ||+||||++|||||+||||||||||||||||||||||.++.+. |++|+ T Consensus 44 ~l~e~~~~~~~~~~~~-~~~~v~r~~p~l~a~DIWGVQPMTGPTGLIFAMRSRY~~q~gt----eA~yg~~~~~~~~a~~ 118 (523) T protein:vir:59 44 HLMETTQTTEVDGWNL-ALPIVRRVFANLRATDLVSVQPLSLPTGLVFYLDFKSPELPGN----GSVYGGTGLTTDTATG 118 (523) T ss_pred hhhhhhhccccccccc-hhhhhhhHhhhhhhhhccccccCCCCcceeEEEEeeccCCCCc----ccccCccccCcccccc Confidence 3455566899999997 9999999999999999999999999999999999999998643 34433 Q ss_pred -ccccccccccccccccccccccccccc------------cccccee--eccc-----ccccce---------------- Q lcl|NC_014036. 147 -MFSPDSMYSGQGAAPSNGFTKLTSAQA------------IADGAIV--FHDF-----VETGRV---------------- 190 (522) Q Consensus 147 -~~Eadt~fSG~g~~~~~~~~~~~~~~~------------~a~g~~a--~~~~-----~~~g~~---------------- 190 (522) ++++++.|++.+...........++.. .+.+... +... ...+.+ T Consensus 119 ~~~ean~~~s~~~~~~~~~~d~~~sg~~~~~~~a~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tg 198 (523) T protein:vir:59 119 GLYDENARLSRREYETTITVDLATAQQATMRDVGFDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPE 198 (523) T ss_pred cccccccccccccccCccCCCcccccccccccccccccchhhccccceeeeecccccccccccccccccccccccccccc Confidence 456777777754433222111111100 0000000 0000 000000 Q ss_pred ---------e-------eeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccC--CCCCcccccc Q lcl|NC_014036. 191 ---------F-------LQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFN--GSTGNPWNEM 252 (522) Q Consensus 191 ---------~-------~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lg--gs~~~~f~EM 252 (522) . ..+.........+++.................++.+.||+|+.+|.++..+ ++.++.|+|| T Consensus 199 a~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM 278 (523) T protein:vir:59 199 NTVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEI 278 (523) T ss_pred ccccchhhccccccccccccccccccccccccccccCCCcccccccccccccccccchhhccccccccccccccccccce Confidence 0 000000000000111111111111122234467889999999999987654 5778899999 Q ss_pred ceEEEEEEEEEecccccchhhHHHHHHHHhhc-CCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccc Q lcl|NC_014036. 253 GFRIDKQVIEARSRQLKAQYSVELAQDLRAVH-GMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKA 331 (522) Q Consensus 253 sFsIEK~TVtAKSRALKAEYT~ELAQDLKAiH-GLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~ 331 (522) +|+||||+|||||||||||||||||||||||| |||||+||+||||||||||||||||++|+++|++++.+|+ ..+ T Consensus 279 ~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~----~~~ 354 (523) T protein:vir:59 279 NLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGF----WSE 354 (523) T ss_pred eeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccc----ccc Confidence 99999999999999999999999999999999 9999999999999999999999999999999999998875 457 Q ss_pred eeeccccccc---cccchhH--HHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccc--c Q lcl|NC_014036. 332 GAFDFQDPID---VRGARWA--GESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQ--K 404 (522) Q Consensus 332 g~fd~~~~~d---~~~~r~~--~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~--~ 404 (522) |+|||.++.| +.|.+|. +||+|+|++|||||+|+|+|+|+||+|||||||||||++|+++ +|++ . T Consensus 355 g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~--------~~~~~~~ 426 (523) T protein:vir:59 355 VVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESM--------PGFTPGN 426 (523) T ss_pred ceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhc--------cccccCC Confidence 9999997766 4444554 9999999999999999999999999999999999999999844 3343 2 Q ss_pred ccccccccceeEEEecCceEEEecCCCccceEEEEEecC-CCccceeEeeccccccccccc-CCccccceeeeeeeecce Q lcl|NC_014036. 405 TLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGD-NEMDAGIYYAPYVALTPLRGS-DPKNFQPVMGFKTRYGVG 482 (522) Q Consensus 405 ~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~-~~~d~glfyaPYv~~~~~~~~-Dp~s~qP~~~~~tRY~l~ 482 (522) ....|+++.+|+|+|+|||+||||||+++|||+|||||. .++|+|||||||||+.+++++ ||+||||+|||||||||+ T Consensus 427 ~~~~~~~~~~~~g~l~~~~~vy~d~~~~~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~ 506 (523) T protein:vir:59 427 DNRDGGTGIFYVGMVQGRYRLYKNIYQNQPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALE 506 (523) T ss_pred ccccccccceeEEEecCceEEEecCCCCcceEEEEecccCCcccccceecccchhhcccccccCCcccceeeeeeehhhe Confidence 245677888999999999999999999999999999995 599999999999999999985 999999999999999998 Q ss_pred e-cCccccccCCccccccCcch Q lcl|NC_014036. 483 I-NPFANSRSQAPSDRITSGMI 503 (522) Q Consensus 483 ~-nP~~~~~~~~~~~~i~~g~~ 503 (522) + |||+.+.---+ +.. + T Consensus 507 v~nP~~~~~~~~~---~~~--~ 523 (523) T protein:vir:59 507 VVRPEFYGLLYVK---LLQ--P 523 (523) T ss_pred ecchhHhhhhhhh---hcC--C Confidence 6 99997654211 000 0 No 18 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=96.85 E-value=0.0003 Score=39.91 Aligned_cols=348 Identities=12% Similarity=0.070 Sum_probs=143.6 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHH-----HHHhhhHHHHhhhhhhhcch---------------------- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLI-----AAIMEAQEKDAEVDPVYRDE---------------------- 53 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~-----~~~~enq~~~~~~~~~~~~~---------------------- 53 (522) |++.++|.++..-+.+. +-++.+..+..+- ..=|++|-+.+.++-.=.++ T Consensus 1 M~~l~el~~~~~~~~~e---~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:19 1 MSELALIQKAIEESQQK---MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh Confidence 99999999988777642 2223222222110 01111211111000000000 Q ss_pred --hhhhhhccccccccccccccccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeee Q lcl|NC_014036. 54 --KIVESFGGFLAEAEIAGDHGYDATKIASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAV 130 (522) Q Consensus 54 --~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSr 130 (522) ...+.+...+..........-....+..++.++. .-..|.++ .+++++..+..-.++|-++||++++.-+. + T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~----~ 152 (385) T protein:vir:19 78 SERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAG-SLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYV----R 152 (385) T ss_pred HHHHHHHHHHHHHHhhccchhhHHHhhhccccccCC-ceecchhhhHHHHHhhhccchhhhcceecccCcceEEE----E Confidence 0000000000000000000000001111111111 01223333 45555566777788888888877652110 1 Q ss_pred ecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccc Q lcl|NC_014036. 131 YGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDAL 210 (522) Q Consensus 131 Y~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~ 210 (522) +.... +... T Consensus 153 ~~~~~--------------~~a~--------------------------------------------------------- 161 (385) T protein:vir:19 153 EEVFT--------------NNAD--------------------------------------------------------- 161 (385) T ss_pred EecCC--------------ccee--------------------------------------------------------- Confidence 10000 0000 Q ss_pred cccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHH Q lcl|NC_014036. 211 DAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADA 290 (522) Q Consensus 211 ~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEa 290 (522) ..+| +..+++-..++++++.+.|.-+-...+|.||.||-- +.++ T Consensus 162 ----------------------~v~E---------~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~ 205 (385) T protein:vir:19 162 ----------------------VVAE---------KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQS 205 (385) T ss_pred ----------------------eecc---------CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHH Confidence 0011 012344455667777777777778889999999852 3478 Q ss_pred HHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_014036. 291 ELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQT 370 (522) Q Consensus 291 ELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T 370 (522) .|.+-|+..|..-+|+.||.- . |.. ....|++.......... -... -..+-.|..+...|. . T Consensus 206 ~i~~~la~a~~~~~d~~~l~G---~---g~~------~~~~Gi~~~~~~~~~~~-~~~~---~~~~d~i~~~~~~l~--~ 267 (385) T protein:vir:19 206 YINNRLMYGLALKEEGQLLNG---D---GTG------DNLEGLNKVATAYDTSL-NATG---DTRADIIAHAIYQVT--E 267 (385) T ss_pred HHHHHHHHHHHHHHHHHHHhc---c---CCC------Ccccccccccccccccc-cccc---cchHHHHHHHHHhhc--c Confidence 888888888888888888821 0 000 01122221111000000 0000 112223333333332 3 Q ss_pred cccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCcccee Q lcl|NC_014036. 371 GRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGI 450 (522) Q Consensus 371 ~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~gl 450 (522) .+...+.+||||.....|..+- +.. +. ....+... ..-++|.| ++|+++++.|..-+++|--- .++ T Consensus 268 ~~~~~~~~~~~~~~~~~l~~lk--d~~-G~----~l~~~~~~-~~~~~l~G-~pV~~~~~~p~~~~~~gd~~-----~~~ 333 (385) T protein:vir:19 268 SEFSASGIVLNPRDWHNIALLK--DNE-GR----YIFGGPQA-FTSNIMWG-LPVVPTKAQAAGTFTVGGFD-----MAS 333 (385) T ss_pred ccCCCCEEEEcHHHHHHHHHhh--cCC-Cc----eeccCccc-CCCceecc-eeeEEcCcCCCCcEEEeecc-----cEE Confidence 3446678999999999887541 111 10 01111111 11356777 79999999997655555210 011 Q ss_pred Eeecccc-ccccc----ccCC-ccccceeeeeeeecc-eecCccccccCCccccccCcc Q lcl|NC_014036. 451 YYAPYVA-LTPLR----GSDP-KNFQPVMGFKTRYGV-GINPFANSRSQAPSDRITSGM 502 (522) Q Consensus 451 fyaPYv~-~~~~~----~~Dp-~s~qP~~~~~tRY~l-~~nP~~~~~~~~~~~~i~~g~ 502 (522) +. +.. ...+. ..|+ ..-+=.+-...||+. +.+|=+...- ++..+. T Consensus 334 ~~--~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~-----~~~aa~ 385 (385) T protein:vir:19 334 QV--WDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKG-----TFSSGS 385 (385) T ss_pred EE--EEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEE-----EeccCC Confidence 11 110 00011 1111 111223334457776 4455221110 011111 No 19 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=96.85 E-value=0.0003 Score=39.91 Aligned_cols=348 Identities=12% Similarity=0.070 Sum_probs=143.6 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHH-----HHHhhhHHHHhhhhhhhcch---------------------- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLI-----AAIMEAQEKDAEVDPVYRDE---------------------- 53 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~-----~~~~enq~~~~~~~~~~~~~---------------------- 53 (522) |++.++|.++..-+.+. +-++.+..+..+- ..=|++|-+.+.++-.=.++ T Consensus 1 M~~l~el~~~~~~~~~e---~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:18 1 MSELALIQKAIEESQQK---MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh Confidence 99999999988777642 2223222222110 01111211111000000000 Q ss_pred --hhhhhhccccccccccccccccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeee Q lcl|NC_014036. 54 --KIVESFGGFLAEAEIAGDHGYDATKIASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAV 130 (522) Q Consensus 54 --~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSr 130 (522) ...+.+...+..........-....+..++.++. .-..|.++ .+++++..+..-.++|-++||++++.-+. + T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~----~ 152 (385) T protein:vir:18 78 SERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAG-SLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYV----R 152 (385) T ss_pred HHHHHHHHHHHHHHhhccchhhHHHhhhccccccCC-ceecchhhhHHHHHhhhccchhhhcceecccCcceEEE----E Confidence 0000000000000000000000001111111111 01223333 45555566777788888888877652110 1 Q ss_pred ecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccc Q lcl|NC_014036. 131 YGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDAL 210 (522) Q Consensus 131 Y~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~ 210 (522) +.... +... T Consensus 153 ~~~~~--------------~~a~--------------------------------------------------------- 161 (385) T protein:vir:18 153 EEVFT--------------NNAD--------------------------------------------------------- 161 (385) T ss_pred EecCC--------------ccee--------------------------------------------------------- Confidence 10000 0000 Q ss_pred cccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHH Q lcl|NC_014036. 211 DAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADA 290 (522) Q Consensus 211 ~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEa 290 (522) ..+| +..+++-..++++++.+.|.-+-...+|.||.||-- +.++ T Consensus 162 ----------------------~v~E---------~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~ 205 (385) T protein:vir:18 162 ----------------------VVAE---------KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQS 205 (385) T ss_pred ----------------------eecc---------CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHH Confidence 0011 012344455667777777777778889999999852 3478 Q ss_pred HHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_014036. 291 ELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQT 370 (522) Q Consensus 291 ELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T 370 (522) .|.+-|+..|..-+|+.||.- . |.. ....|++.......... -... -..+-.|..+...|. . T Consensus 206 ~i~~~la~a~~~~~d~~~l~G---~---g~~------~~~~Gi~~~~~~~~~~~-~~~~---~~~~d~i~~~~~~l~--~ 267 (385) T protein:vir:18 206 YINNRLMYGLALKEEGQLLNG---D---GTG------DNLEGLNKVATAYDTSL-NATG---DTRADIIAHAIYQVT--E 267 (385) T ss_pred HHHHHHHHHHHHHHHHHHHhc---c---CCC------Ccccccccccccccccc-cccc---cchHHHHHHHHHhhc--c Confidence 888888888888888888821 0 000 01122221111000000 0000 112223333333332 3 Q ss_pred cccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCcccee Q lcl|NC_014036. 371 GRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGI 450 (522) Q Consensus 371 ~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~gl 450 (522) .+...+.+||||.....|..+- +.. +. ....+... ..-++|.| ++|+++++.|..-+++|--- .++ T Consensus 268 ~~~~~~~~~~~~~~~~~l~~lk--d~~-G~----~l~~~~~~-~~~~~l~G-~pV~~~~~~p~~~~~~gd~~-----~~~ 333 (385) T protein:vir:18 268 SEFSASGIVLNPRDWHNIALLK--DNE-GR----YIFGGPQA-FTSNIMWG-LPVVPTKAQAAGTFTVGGFD-----MAS 333 (385) T ss_pred ccCCCCEEEEcHHHHHHHHHhh--cCC-Cc----eeccCccc-CCCceecc-eeeEEcCcCCCCcEEEeecc-----cEE Confidence 3446678999999999887541 111 10 01111111 11356777 79999999997655555210 011 Q ss_pred Eeecccc-ccccc----ccCC-ccccceeeeeeeecc-eecCccccccCCccccccCcc Q lcl|NC_014036. 451 YYAPYVA-LTPLR----GSDP-KNFQPVMGFKTRYGV-GINPFANSRSQAPSDRITSGM 502 (522) Q Consensus 451 fyaPYv~-~~~~~----~~Dp-~s~qP~~~~~tRY~l-~~nP~~~~~~~~~~~~i~~g~ 502 (522) +. +.. ...+. ..|+ ..-+=.+-...||+. +.+|=+...- ++..+. T Consensus 334 ~~--~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~-----~~~aa~ 385 (385) T protein:vir:18 334 QV--WDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKG-----TFSSGS 385 (385) T ss_pred EE--EEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEE-----EeccCC Confidence 11 110 00011 1111 111223334457776 4455221110 011111 No 20 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=96.28 E-value=0.00079 Score=37.59 Aligned_cols=331 Identities=13% Similarity=0.108 Sum_probs=133.1 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhh-------------HH---HHHH---hhhHHHHhhhhhhhcchhhh----- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKK-------------QL---IAAI---MEAQEKDAEVDPVYRDEKIV----- 56 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~-------------~~---~~~~---~enq~~~~~~~~~~~~~~~~----- 56 (522) |.+.++|.++|.-+-+. +-++.+.-++ ++ +..+ +|.+++.+.+...-...... T Consensus 1 Mk~~~el~~~~~~~~~~---~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDK---VENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKK 77 (397) T ss_pred CchHHHHHHHHHHHHHH---HHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 99999998888765542 1111111000 00 0111 11111111111100000000 Q ss_pred --------------hhhcccccccccccccccccccccccc-ccccccccCcchh--hHHHHHHhhhhhhhceeeccCCc Q lcl|NC_014036. 57 --------------ESFGGFLAEAEIAGDHGYDATKIASGN-SSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTG 119 (522) Q Consensus 57 --------------~~~~~~l~ea~~~~~~g~~~~~~~~~t-~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTG 119 (522) ..|..+|.. ...........++ +.|.+. -|.-+ .+++.+-++.+..++|.++||++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~l~~-----~~~~~~~~~~~~t~~~gg~~--vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 150 (397) T protein:vir:49 78 PLTKSEEEVKAGFVKDFKNLVRG-----RYQNLLDSKTDASGSDAGLT--IPQDIQTAIHTLVSQYDSLQEYVNVENVTT 150 (397) T ss_pred ccccchhHHHHHHHHHHHHHHhc-----chhHHHHHhhccccccCccc--ccHhHHHHHHHHHHhhhhHHhhhceeeccc Confidence 001111100 0000000111112 112221 13211 45555566778888999999999 Q ss_pred hhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccc Q lcl|NC_014036. 120 PTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAP 199 (522) Q Consensus 120 PTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p 199 (522) ++|-+. |...... .+.+.| T Consensus 151 ~~~~~~-----~~~~~~~-----------~~~a~~--------------------------------------------- 169 (397) T protein:vir:49 151 LTGSRV-----YEKWTDI-----------TGLANI--------------------------------------------- 169 (397) T ss_pred CccceE-----EEeeccC-----------Ccceee--------------------------------------------- Confidence 887432 2111000 000000 Q ss_pred cccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCcccccc-ceEEEEEEEEEecccccchhhHHHHH Q lcl|NC_014036. 200 VTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEM-GFRIDKQVIEARSRQLKAQYSVELAQ 278 (522) Q Consensus 200 ~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EM-sFsIEK~TVtAKSRALKAEYT~ELAQ 278 (522) .+|. ..+++. ..+++++++.+|.-+-...+|-||.+ T Consensus 170 ----------------------------------v~E~---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ 206 (397) T protein:vir:49 170 ----------------------------------DDEA---------GKIADVDDPKLSLIKYTIKRYAGISTVTNSLLA 206 (397) T ss_pred ----------------------------------ecCc---------cccccccccceeeEEeeeeeEEeeehhHHHHHh Confidence 0010 011221 13344444445555555679999999 Q ss_pred HHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHH Q lcl|NC_014036. 279 DLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQ 358 (522) Q Consensus 279 DLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~ 358 (522) |-. .|.+++|.+-|+..|..-+|+.||.-.-... ...++++++ -...|+.. T Consensus 207 ds~----~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~------------~~~~~~~~d-------------~i~~~~~~ 257 (397) T protein:vir:49 207 DSA----ENILAWLSGWIAKKVVVTRNKAILEAIAALP------------TKPTLTKWD-------------DIIDLEAK 257 (397) T ss_pred hhH----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------cccccccHH-------------HHHHHHHh Confidence 852 5679999999999999999999984221111 112333222 23444444 Q ss_pred HHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEe--cCCCcc--- Q lcl|NC_014036. 359 IDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYI--DQYARG--- 433 (522) Q Consensus 359 i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~--- 433 (522) |... +.....+|++|.....|..+- +.. + +-....+.+. ...++|.| ++|++ |...+. T Consensus 258 l~~~---------~~~~a~~vmn~~~~~~l~~lk--d~~-G---~~l~~~~~~~-~~~~~l~G-~PV~~~~~~~~~~~~~ 320 (397) T protein:vir:49 258 VDPA---------IKQTSFFLTNTSGFTALKKVK--NAL-G---DYLMERDVKS-PTGYSIDG-FAVKEVADRWLANGTG 320 (397) T ss_pred hhhh---------hcCCCEEEEcHHHHHHHHHhh--cCC-C---ceeeccCcCC-CCCceecc-eeeEEecccccccccC Confidence 4321 224578899999999987551 111 0 0111111111 12357877 58876 222221 Q ss_pred -----------ceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeecce-ecC--cc-----ccccCCc Q lcl|NC_014036. 434 -----------DYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVG-INP--FA-----NSRSQAP 494 (522) Q Consensus 434 -----------dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP--~~-----~~~~~~~ 494 (522) +|++++.++..+. =+.+|.. .+-...+-.+-...|++.. .|| |. ...+..+ T Consensus 321 ~~~~i~~gd~~~~~~~~~~~~~~i----~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~ 390 (397) T protein:vir:49 321 GAMPLYFGDLKQAVTLFDRQHMSL----LSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKG 390 (397) T ss_pred CceeEEEeeccceEEEEeecceEE----EEecccc------chhhcCceeEEEEeeeCcEEecccceEEEEeecccCCCC Confidence 1222222221111 1112211 0111223334444455432 233 11 0000000 Q ss_pred cccccCcchHHhhc Q lcl|NC_014036. 495 SDRITSGMITKEMF 508 (522) Q Consensus 495 ~~~i~~g~~~~~~~ 508 (522) ..+.-. + T Consensus 391 ----~~~~~~---~ 397 (397) T protein:vir:49 391 ----NLGSTA---V 397 (397) T ss_pred ----Cccccc---C Confidence 000000 0 No 21 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=95.81 E-value=0.0014 Score=36.19 Aligned_cols=352 Identities=13% Similarity=0.040 Sum_probs=124.6 Q ss_pred CcchHHHHHhhhhhh----------ccccc--------hhhhcchhhh-----HHHHHHhhhHHHHhhhhhhhcchhhhh Q lcl|NC_014036. 1 MSKKNELMEKWNDLL----------ESQEG--------LPDIATKSKK-----QLIAAIMEAQEKDAEVDPVYRDEKIVE 57 (522) Q Consensus 1 ~~~~~~l~~kw~p~l----------~~~~~--------~~~i~~~~~~-----~~~~~~~enq~~~~~~~~~~~~~~~~~ 57 (522) |-|. ....+|.... +..++ ..+++..-++ ...+...+-+++...+.+ ..... T Consensus 1 ~~ke-~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~ 75 (413) T protein:vir:81 1 MVKE-AGDAPTNAQVAEIAEVKSMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRK----GEGYK 75 (413) T ss_pred Chhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhh----hhhhh Confidence 2221 1112221110 00000 0000000000 000000000000000000 00000 Q ss_pred hhcccccccc---------------------ccccccccccccccccccccccccCcchh--hHHHHHHhhhhhhhceee Q lcl|NC_014036. 58 SFGGFLAEAE---------------------IAGDHGYDATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDICGV 114 (522) Q Consensus 58 ~~~~~l~ea~---------------------~~~~~g~~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GV 114 (522) .++..+.+.. ..............++ +.+....=|..+ .+++.+-+..+..+++.| T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~ 154 (413) T protein:vir:81 76 SIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATL-TDEFQGGYGTTWNRNIIYRRREKLVVADLMDN 154 (413) T ss_pred hhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhccc-ccccccccchhhHHHHHHHHhhhhhHHhhcce Confidence 0111000000 0000000000001111 111111113222 355555567788899999 Q ss_pred ccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeee Q lcl|NC_014036. 115 QPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQN 194 (522) Q Consensus 115 QPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~ 194 (522) +||++++.-+.-. . ..... ..++. T Consensus 155 ~~~~~~~~~~~~~-~---~~~~~-----------~~~a~----------------------------------------- 178 (413) T protein:vir:81 155 LTMTNTTIKYLME-K---ANRVV-----------EGGFK----------------------------------------- 178 (413) T ss_pred eeccCCceeEEEe-c---ccccc-----------ccccc----------------------------------------- Confidence 9999876422111 0 00000 00000 Q ss_pred ccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccce-EEEEEEEEEecccccchhh Q lcl|NC_014036. 195 VSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGF-RIDKQVIEARSRQLKAQYS 273 (522) Q Consensus 195 ~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsF-sIEK~TVtAKSRALKAEYT 273 (522) .++ |. ...++... .++.++...|..+-...+| T Consensus 179 ------------------------------~v~--------Eg---------~~~~~~~~~~f~~i~~~~~k~~~~~~iS 211 (413) T protein:vir:81 179 ------------------------------TVA--------EG---------GKKPYMRFADFDIVTESLSKIAGLTKIT 211 (413) T ss_pred ------------------------------eec--------Cc---------ccccccCcccceeeEeeeeeEEEeehhh Confidence 001 10 01122222 1233444444444556789 Q ss_pred HHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHH Q lcl|NC_014036. 274 VELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYK 353 (522) Q Consensus 274 ~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r 353 (522) -||.+|--+ .++.|.+-|+..|..-+|+.||. -... + ..-.|+++......... ...- T Consensus 212 ~ell~ds~~-----l~~~i~~~la~~~~~~~d~~~l~---G~G~-~--------~~~~Gi~~~~~~~~~~~-----~~~~ 269 (413) T protein:vir:81 212 DEMIEDYDF-----LVSYINARLLEELAIEEERQLLL---GDGT-G--------NNLTGLLKRDGIQTLAV-----SNKD 269 (413) T ss_pred HHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHhc---cCCC-C--------Ccccccccccccccccc-----cccc Confidence 999998632 48889999998899999988882 1110 0 01124433322111110 0011 Q ss_pred HHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcc-----cccccccccccccccccccceeEEEecCceEEEec Q lcl|NC_014036. 354 ALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDS-----GITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYID 428 (522) Q Consensus 354 ~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~-----~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D 428 (522) .++.-|......+.....+ ..+.+|++|.....|..+-- ++..+... .+.+ ......++|.| ++|+++ T Consensus 270 ~~~~~i~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~----~~~~-~~~~~~~~l~G-~pv~~s 342 (413) T protein:vir:81 270 ELADSIYKAMTNISLATPF-QADALVINPLDYQELRLAKDANGQYYGGGVFQG----QYGS-GGIMLDPAPWG-LRTVQS 342 (413) T ss_pred hhHHHHHHHHHHhhhhccC-CCcEEEEcHHHHHHHHHhhccCCceeccccccc----cccc-cccccCceecc-eeeEEc Confidence 2222333333334333444 34668899998888764310 01111000 0000 00112246776 699999 Q ss_pred CCCccceEEEEEecCC--Ccc---ceeEeecccccccccccCCccccceeeeeeeeccee-cCccccccCCccccccCcc Q lcl|NC_014036. 429 QYARGDYFTVGYKGDN--EMD---AGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGM 502 (522) Q Consensus 429 ~y~~~dy~~vG~KG~~--~~d---~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~ 502 (522) ...+..-+++|---.. -.+ -.+=..+|.. .+-.+-|=.+-+..||++.+ +|= T Consensus 343 ~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~---------------- 400 (413) T protein:vir:81 343 QVVPVGKPVVGAFRSAASVLRKGGVRIDSTNTNV------DDFENNLITVRAEERVGLMVTFPE---------------- 400 (413) T ss_pred CCCCcccEEEEecccEEEEEEecceEEEEecccc------chhhcCcEEEEEEEeeccEEeccc---------------- Confidence 9887655555421100 000 0011111110 01123344555556666543 221 Q ss_pred hHHhhccccceeeeeeeccC Q lcl|NC_014036. 503 ITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 503 ~~~~~~~~~~~~r~~~Vk~~ 522 (522) -|+++-++.. T Consensus 401 ----------a~~~l~~~~~ 410 (413) T protein:vir:81 401 ----------AIVQLDVAEV 410 (413) T ss_pred ----------ceEEEEecCC Confidence 1111111111 No 22 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=95.59 E-value=0.0018 Score=35.64 Aligned_cols=357 Identities=15% Similarity=0.111 Sum_probs=143.5 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhH------------------HHHHHhhhHHH--Hhhhhh------------ Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQ------------------LIAAIMEAQEK--DAEVDP------------ 48 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~------------------~~~~~~enq~~--~~~~~~------------ 48 (522) |...++|.++=.-+.+. +.+++..-++. +-++|-+.|++ .+.+.. T Consensus 1 mk~~~el~~~l~el~~~---~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQ---IDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) T ss_pred CchHHHHHHHHHHHHHH---HHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 88777777776666542 11111111111 11111111110 000000 Q ss_pred ----hhcchhhhhhhccccccccccc----------cccccccccccccccccccccCcchh--hHHHHHHhhhhhhhce Q lcl|NC_014036. 49 ----VYRDEKIVESFGGFLAEAEIAG----------DHGYDATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDIC 112 (522) Q Consensus 49 ----~~~~~~~~~~~~~~l~ea~~~~----------~~g~~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~ 112 (522) ..++..-...+...+.+.+... ..+.+.....-.+..|. ..-|.-+ .+++++..+.+-.+++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg--~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:81 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF--VVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred chhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccc--cccchHHHHHHHHHHHhhhhhhhhe Confidence 0000000000000000000000 00000000000111111 1124433 4556666778889999 Q ss_pred eeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceee Q lcl|NC_014036. 113 GVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFL 192 (522) Q Consensus 113 GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~ 192 (522) .|+||++..+-+--. .+.... ...| T Consensus 156 ~~~~~~~~~~~~~~~-----~~~~~~------------~~~~-------------------------------------- 180 (415) T protein:vir:81 156 TVKRVTNGSGKYPVV-----RQSEVA------------ALEK-------------------------------------- 180 (415) T ss_pred eeeeccCCceeEEEE-----eecCCc------------ccee-------------------------------------- Confidence 999998877643221 110000 0000 Q ss_pred eeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccch Q lcl|NC_014036. 193 QNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQ 271 (522) Q Consensus 193 ~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAE 271 (522) + +|. ...++.+ -++++++...+..+-... T Consensus 181 ---------------------------------v--------~E~---------~~~~~~~~~~~~~v~~~~~k~~~~~~ 210 (415) T protein:vir:81 181 ---------------------------------V--------EEL---------EENPELAVKPFFQLAYDINTHRGYFR 210 (415) T ss_pred ---------------------------------e--------ccc---------cccCcccccceeeEEeeeeeeEeeeh Confidence 0 000 0011111 134444444455555567 Q ss_pred hhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHH Q lcl|NC_014036. 272 YSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGES 351 (522) Q Consensus 272 YT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~ 351 (522) +|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-...-.+......... . ...... ...| +. T Consensus 211 iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~-~--~~~~~~-----~~~~--~~ 276 (415) T protein:vir:81 211 ISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEG-K--KLEVKK-----AKSL--DD 276 (415) T ss_pred hhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccc-c--cccccc-----ccch--hH Confidence 999999984 35779999999999999999999995432221111111000000 0 000000 0011 11 Q ss_pred HHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCC Q lcl|NC_014036. 352 YKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYA 431 (522) Q Consensus 352 ~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~ 431 (522) ...++.. +.. .+-+.+.+||+|.....|..+- +.... -....+.+ ....++|.| ++|++.++. T Consensus 277 i~~~~~~-------~~~--~~~~~~~~v~n~~~~~~l~~lk--d~~G~----~l~~~~~~-~~~~~~l~G-~pV~~~~~~ 339 (415) T protein:vir:81 277 IKDAINL-------NVK--PNYEHNVAIVSQTMFAKLDKMK--DKLGN----YLIQPDVK-EKTQQRLLG-AKIEILPDE 339 (415) T ss_pred HHHHHHh-------hhh--hccCCCEEEEcHHHHHHHHHhh--ccCCc----eeeccCcC-CCCCceecc-eeeEEeccc Confidence 2233322 221 1225578899999988887541 11100 00011111 112357777 688887765 Q ss_pred ccceEEEEEecCCCccceeEeec----ccc----cccccccCCccccceeeeeeeecce-ecCccccccCCccccccCcc Q lcl|NC_014036. 432 RGDYFTVGYKGDNEMDAGIYYAP----YVA----LTPLRGSDPKNFQPVMGFKTRYGVG-INPFANSRSQAPSDRITSGM 502 (522) Q Consensus 432 ~~dy~~vG~KG~~~~d~glfyaP----Yv~----~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~ 502 (522) +.. -.|+ ..++|+- |+- ...+...|-.+++..+....|++.. .+|=+...-..... .--.+ T Consensus 340 ~~~-----~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~-~~~~~ 409 (415) T protein:vir:81 340 VLG-----QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS-ERGEG 409 (415) T ss_pred ccC-----CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEEEecc-CCCCC Confidence 531 1111 1122221 111 1112234556777788888898764 45522211110000 01122 Q ss_pred hHHhhc Q lcl|NC_014036. 503 ITKEMF 508 (522) Q Consensus 503 ~~~~~~ 508 (522) +...-+ T Consensus 410 ~~~~~~ 415 (415) T protein:vir:81 410 DLGLEA 415 (415) T ss_pred ccccCC Confidence 222212 No 23 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=95.59 E-value=0.0018 Score=35.64 Aligned_cols=357 Identities=15% Similarity=0.111 Sum_probs=143.5 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhH------------------HHHHHhhhHHH--Hhhhhh------------ Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQ------------------LIAAIMEAQEK--DAEVDP------------ 48 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~------------------~~~~~~enq~~--~~~~~~------------ 48 (522) |...++|.++=.-+.+. +.+++..-++. +-++|-+.|++ .+.+.. T Consensus 1 mk~~~el~~~l~el~~~---~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQ---IDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) T ss_pred CchHHHHHHHHHHHHHH---HHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 88777777776666542 11111111111 11111111110 000000 Q ss_pred ----hhcchhhhhhhccccccccccc----------cccccccccccccccccccccCcchh--hHHHHHHhhhhhhhce Q lcl|NC_014036. 49 ----VYRDEKIVESFGGFLAEAEIAG----------DHGYDATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDIC 112 (522) Q Consensus 49 ----~~~~~~~~~~~~~~l~ea~~~~----------~~g~~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~ 112 (522) ..++..-...+...+.+.+... ..+.+.....-.+..|. ..-|.-+ .+++++..+.+-.+++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg--~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:98 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF--VVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred chhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccc--cccchHHHHHHHHHHHhhhhhhhhe Confidence 0000000000000000000000 00000000000111111 1124433 4556666778889999 Q ss_pred eeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceee Q lcl|NC_014036. 113 GVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFL 192 (522) Q Consensus 113 GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~ 192 (522) .|+||++..+-+--. .+.... ...| T Consensus 156 ~~~~~~~~~~~~~~~-----~~~~~~------------~~~~-------------------------------------- 180 (415) T protein:vir:98 156 TVKRVTNGSGKYPVV-----RQSEVA------------ALEK-------------------------------------- 180 (415) T ss_pred eeeeccCCceeEEEE-----eecCCc------------ccee-------------------------------------- Confidence 999998877643221 110000 0000 Q ss_pred eeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccch Q lcl|NC_014036. 193 QNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQ 271 (522) Q Consensus 193 ~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAE 271 (522) + +|. ...++.+ -++++++...+..+-... T Consensus 181 ---------------------------------v--------~E~---------~~~~~~~~~~~~~v~~~~~k~~~~~~ 210 (415) T protein:vir:98 181 ---------------------------------V--------EEL---------EENPELAVKPFFQLAYDINTHRGYFR 210 (415) T ss_pred ---------------------------------e--------ccc---------cccCcccccceeeEEeeeeeeEeeeh Confidence 0 000 0011111 134444444455555567 Q ss_pred hhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHH Q lcl|NC_014036. 272 YSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGES 351 (522) Q Consensus 272 YT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~ 351 (522) +|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-...-.+......... . ...... ...| +. T Consensus 211 iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~-~--~~~~~~-----~~~~--~~ 276 (415) T protein:vir:98 211 ISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEG-K--KLEVKK-----AKSL--DD 276 (415) T ss_pred hhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccc-c--cccccc-----ccch--hH Confidence 999999984 35779999999999999999999995432221111111000000 0 000000 0011 11 Q ss_pred HHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCC Q lcl|NC_014036. 352 YKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYA 431 (522) Q Consensus 352 ~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~ 431 (522) ...++.. +.. .+-+.+.+||+|.....|..+- +.... -....+.+ ....++|.| ++|++.++. T Consensus 277 i~~~~~~-------~~~--~~~~~~~~v~n~~~~~~l~~lk--d~~G~----~l~~~~~~-~~~~~~l~G-~pV~~~~~~ 339 (415) T protein:vir:98 277 IKDAINL-------NVK--PNYEHNVAIVSQTMFAKLDKMK--DKLGN----YLIQPDVK-EKTQQRLLG-AKIEILPDE 339 (415) T ss_pred HHHHHHh-------hhh--hccCCCEEEEcHHHHHHHHHhh--ccCCc----eeeccCcC-CCCCceecc-eeeEEeccc Confidence 2233322 221 1225578899999988887541 11100 00011111 112357777 688887765 Q ss_pred ccceEEEEEecCCCccceeEeec----ccc----cccccccCCccccceeeeeeeecce-ecCccccccCCccccccCcc Q lcl|NC_014036. 432 RGDYFTVGYKGDNEMDAGIYYAP----YVA----LTPLRGSDPKNFQPVMGFKTRYGVG-INPFANSRSQAPSDRITSGM 502 (522) Q Consensus 432 ~~dy~~vG~KG~~~~d~glfyaP----Yv~----~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~ 502 (522) +.. -.|+ ..++|+- |+- ...+...|-.+++..+....|++.. .+|=+...-..... .--.+ T Consensus 340 ~~~-----~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~-~~~~~ 409 (415) T protein:vir:98 340 VLG-----QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS-ERGEG 409 (415) T ss_pred ccC-----CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEEEecc-CCCCC Confidence 531 1111 1122221 111 1112234556777788888898764 45522211110000 01122 Q ss_pred hHHhhc Q lcl|NC_014036. 503 ITKEMF 508 (522) Q Consensus 503 ~~~~~~ 508 (522) +...-+ T Consensus 410 ~~~~~~ 415 (415) T protein:vir:98 410 DLGLEA 415 (415) T ss_pred ccccCC Confidence 222212 No 24 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=95.59 E-value=0.0018 Score=35.64 Aligned_cols=357 Identities=15% Similarity=0.111 Sum_probs=143.5 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhH------------------HHHHHhhhHHH--Hhhhhh------------ Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQ------------------LIAAIMEAQEK--DAEVDP------------ 48 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~------------------~~~~~~enq~~--~~~~~~------------ 48 (522) |...++|.++=.-+.+. +.+++..-++. +-++|-+.|++ .+.+.. T Consensus 1 mk~~~el~~~l~el~~~---~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQ---IDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) T ss_pred CchHHHHHHHHHHHHHH---HHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 88777777776666542 11111111111 11111111110 000000 Q ss_pred ----hhcchhhhhhhccccccccccc----------cccccccccccccccccccccCcchh--hHHHHHHhhhhhhhce Q lcl|NC_014036. 49 ----VYRDEKIVESFGGFLAEAEIAG----------DHGYDATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDIC 112 (522) Q Consensus 49 ----~~~~~~~~~~~~~~l~ea~~~~----------~~g~~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~ 112 (522) ..++..-...+...+.+.+... ..+.+.....-.+..|. ..-|.-+ .+++++..+.+-.+++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg--~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:79 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF--VVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred chhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccc--cccchHHHHHHHHHHHhhhhhhhhe Confidence 0000000000000000000000 00000000000111111 1124433 4556666778889999 Q ss_pred eeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceee Q lcl|NC_014036. 113 GVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFL 192 (522) Q Consensus 113 GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~ 192 (522) .|+||++..+-+--. .+.... ...| T Consensus 156 ~~~~~~~~~~~~~~~-----~~~~~~------------~~~~-------------------------------------- 180 (415) T protein:vir:79 156 TVKRVTNGSGKYPVV-----RQSEVA------------ALEK-------------------------------------- 180 (415) T ss_pred eeeeccCCceeEEEE-----eecCCc------------ccee-------------------------------------- Confidence 999998877643221 110000 0000 Q ss_pred eeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccch Q lcl|NC_014036. 193 QNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQ 271 (522) Q Consensus 193 ~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAE 271 (522) + +|. ...++.+ -++++++...+..+-... T Consensus 181 ---------------------------------v--------~E~---------~~~~~~~~~~~~~v~~~~~k~~~~~~ 210 (415) T protein:vir:79 181 ---------------------------------V--------EEL---------EENPELAVKPFFQLAYDINTHRGYFR 210 (415) T ss_pred ---------------------------------e--------ccc---------cccCcccccceeeEEeeeeeeEeeeh Confidence 0 000 0011111 134444444455555567 Q ss_pred hhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHH Q lcl|NC_014036. 272 YSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGES 351 (522) Q Consensus 272 YT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~ 351 (522) +|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-...-.+......... . ...... ...| +. T Consensus 211 iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~-~--~~~~~~-----~~~~--~~ 276 (415) T protein:vir:79 211 ISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEG-K--KLEVKK-----AKSL--DD 276 (415) T ss_pred hhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccc-c--cccccc-----ccch--hH Confidence 999999984 35779999999999999999999995432221111111000000 0 000000 0011 11 Q ss_pred HHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCC Q lcl|NC_014036. 352 YKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYA 431 (522) Q Consensus 352 ~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~ 431 (522) ...++.. +.. .+-+.+.+||+|.....|..+- +.... -....+.+ ....++|.| ++|++.++. T Consensus 277 i~~~~~~-------~~~--~~~~~~~~v~n~~~~~~l~~lk--d~~G~----~l~~~~~~-~~~~~~l~G-~pV~~~~~~ 339 (415) T protein:vir:79 277 IKDAINL-------NVK--PNYEHNVAIVSQTMFAKLDKMK--DKLGN----YLIQPDVK-EKTQQRLLG-AKIEILPDE 339 (415) T ss_pred HHHHHHh-------hhh--hccCCCEEEEcHHHHHHHHHhh--ccCCc----eeeccCcC-CCCCceecc-eeeEEeccc Confidence 2233322 221 1225578899999988887541 11100 00011111 112357777 688887765 Q ss_pred ccceEEEEEecCCCccceeEeec----ccc----cccccccCCccccceeeeeeeecce-ecCccccccCCccccccCcc Q lcl|NC_014036. 432 RGDYFTVGYKGDNEMDAGIYYAP----YVA----LTPLRGSDPKNFQPVMGFKTRYGVG-INPFANSRSQAPSDRITSGM 502 (522) Q Consensus 432 ~~dy~~vG~KG~~~~d~glfyaP----Yv~----~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~ 502 (522) +.. -.|+ ..++|+- |+- ...+...|-.+++..+....|++.. .+|=+...-..... .--.+ T Consensus 340 ~~~-----~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~-~~~~~ 409 (415) T protein:vir:79 340 VLG-----QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS-ERGEG 409 (415) T ss_pred ccC-----CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEEEecc-CCCCC Confidence 531 1111 1122221 111 1112234556777788888898764 45522211110000 01122 Q ss_pred hHHhhc Q lcl|NC_014036. 503 ITKEMF 508 (522) Q Consensus 503 ~~~~~~ 508 (522) +...-+ T Consensus 410 ~~~~~~ 415 (415) T protein:vir:79 410 DLGLEA 415 (415) T ss_pred ccccCC Confidence 222212 No 25 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=95.16 E-value=0.0026 Score=34.73 Aligned_cols=363 Identities=15% Similarity=0.096 Sum_probs=136.0 Q ss_pred CcchHHHHHhhhhhhccccc-hhhhcch-------hhhH-------HHHHHhhhHH--HHhhhhh--------------- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEG-LPDIATK-------SKKQ-------LIAAIMEAQE--KDAEVDP--------------- 48 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~-~~~i~~~-------~~~~-------~~~~~~enq~--~~~~~~~--------------- 48 (522) |-..++|.++=.-+++..+. .-+..+. -.+. +-++|=+.|+ +.+.+.. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccch Confidence 54444444333332221000 0000000 0011 1111111110 0011000 Q ss_pred -hhcchhhhhhhcccccc-----ccccc-----cccccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeec Q lcl|NC_014036. 49 -VYRDEKIVESFGGFLAE-----AEIAG-----DHGYDATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQ 115 (522) Q Consensus 49 -~~~~~~~~~~~~~~l~e-----a~~~~-----~~g~~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQ 115 (522) ......-...+...+.+ .+... -.+.+.......+.+|... -|.-+ .+++.+-+..+-.+++.++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~--iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:94 81 STYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVV--IPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhcccccccccc--CcHHHHHHHHHHHHhhhhhhhhccee Confidence 00000000001111111 11000 0000111001111222221 24222 4566666788889999999 Q ss_pred cCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeec Q lcl|NC_014036. 116 PMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNV 195 (522) Q Consensus 116 PmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~ 195 (522) ||++.++-+--. +..... ...|- T Consensus 159 ~~~~~~~~~~~~--~~~~~~---------------~~~~v---------------------------------------- 181 (415) T protein:vir:94 159 RVTNGSGKYPVV--RQSEVA---------------ALEKV---------------------------------------- 181 (415) T ss_pred eccCCceeEEEE--eecCCc---------------cceec---------------------------------------- Confidence 998765432211 110000 00000 Q ss_pred cccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccchhhH Q lcl|NC_014036. 196 SGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQYSV 274 (522) Q Consensus 196 ~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ 274 (522) +|. ...++.+ -++++++...|..+-.-.+|- T Consensus 182 ---------------------------------------~Eg---------~~~~~~~~~~~~~i~~~~~k~~~~~~is~ 213 (415) T protein:vir:94 182 ---------------------------------------EEL---------EENPELAVKPFFQLAYDINTHRGYFRISR 213 (415) T ss_pred ---------------------------------------ccc---------ccccccccccceeeEeeheeeeeechhhH Confidence 000 0111221 123444444444444556999 Q ss_pred HHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHH Q lcl|NC_014036. 275 ELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKA 354 (522) Q Consensus 275 ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~ 354 (522) ||.+|-- .|.+++|.+-|...|..-+|+.||.-.-...-.+....... . .......... ..+.... T Consensus 214 ell~ds~----~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~-~--~~~~~~~~~~-------~~~~i~~ 279 (415) T protein:vir:94 214 EAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEK-E--GKKLEVKKAK-------SLDDIKD 279 (415) T ss_pred HHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccc-c--cccccccccc-------chHHHHH Confidence 9999864 46799999999999999999999954332221111110000 0 0000000000 0112223 Q ss_pred HHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCcc- Q lcl|NC_014036. 355 LLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG- 433 (522) Q Consensus 355 L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~- 433 (522) ++..+ .. ..+ +.+.+|++|.....|..+- +.... -....+.+. ...++|.| ++|++.+..+. T Consensus 280 ~~~~~-------~~-~~~-~~~~~vmn~~~~~~l~~lk--d~~G~----~l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~ 342 (415) T protein:vir:94 280 AINLN-------VK-PNY-EHNVAIVSQTMFAKLDKMK--DKLGN----YLIQPDVKE-KTQQRLLG-AKIEILPDEVLG 342 (415) T ss_pred HHHhh-------hh-hcc-CCCEEEEcHHHHHHHHHhh--ccCCC----eeeccCcCC-CCCceecc-eeeEEecccccC Confidence 33222 21 222 5678899999999887541 11100 000111111 12356777 58888776553 Q ss_pred ---ce-EEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeecce-ecCccccccCCccccccCcchHHhhc Q lcl|NC_014036. 434 ---DY-FTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVG-INPFANSRSQAPSDRITSGMITKEMF 508 (522) Q Consensus 434 ---dy-~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~~~~~~~ 508 (522) +. +++|--.. . +..... ....+...|-.+++-.+-...|+++. .+|=+...-.-.. -..-.++...-+ T Consensus 343 ~~~~~~i~~gd~~~----~-~~~~~~-~~~~v~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~-~~~~~~~~~~~~ 415 (415) T protein:vir:94 343 QKGNNTLIIGNLKD----A-IVLFDR-SQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDD-SERGEGDLGLEA 415 (415) T ss_pred CCCccEEEEEehhc----c-EEEEee-cceEEEEeccccCceEEEEEEEeccEEeccccEEEEEEec-cCCCCCccccCC Confidence 11 23331000 0 000000 11122234555667677777888764 3552221111000 001112222211 No 26 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=94.46 E-value=0.0044 Score=33.53 Aligned_cols=334 Identities=14% Similarity=0.112 Sum_probs=135.8 Q ss_pred CcchHHHHHhhhhhhccccchhhhcch----------hhhHHHHHHhh------hHHHHhhhhhhhc------------- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATK----------SKKQLIAAIME------AQEKDAEVDPVYR------------- 51 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~----------~~~~~~~~~~e------nq~~~~~~~~~~~------------- 51 (522) |.+.++|.+.|.-+.+.-+.+-+..+. --+++.+.|-+ -+++.+.+.+.-. T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 999999999998887642211110000 00112222211 1111111111000 Q ss_pred -ch-----hhhhhhcccccccccccccccccccccccc-ccccccccCcchh--hHHHHHHhhhhhhhceeeccCCchhh Q lcl|NC_014036. 52 -DE-----KIVESFGGFLAEAEIAGDHGYDATKIASGN-SSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTG 122 (522) Q Consensus 52 -~~-----~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t-~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPTG 122 (522) .. .....+..+|... ... .......++ +.|.+. . |.-+ .+++.+-++..-.+++.|+||++.+| T Consensus 81 ~~~~~~~~~~~~~~~~~l~~~----~~~-~~~~~~~~t~~~gg~~-i-P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 153 (397) T protein:vir:49 81 KNEEEVKANFVKDFKNLVRGR----YQN-LLDSKTDGSGSDAGLT-I-PQDIRTAINTLVRQFDSLQEYVNVENVTTLTG 153 (397) T ss_pred chhhHHHHHHHHHHHHHhhcc----hhh-HHHhhhccCCccCcce-e-cHHHHHHHHHHHHhhhhHhhhcceeeccCCcc Confidence 00 0000111111100 000 000011111 111111 1 3222 35555566777888999999998775 Q ss_pred hheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeecccccccc Q lcl|NC_014036. 123 QVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTV 202 (522) Q Consensus 123 LIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~ 202 (522) -+- |...... .+.+.|- T Consensus 154 ~~~-----~~~~~~~-----------~~~a~~v----------------------------------------------- 170 (397) T protein:vir:49 154 SRV-----YEKWADI-----------TGLAKLD----------------------------------------------- 170 (397) T ss_pred eEE-----EEeeccC-----------Ccceeee----------------------------------------------- Confidence 421 1111000 0000000 Q ss_pred CCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccchhhHHHHHHHH Q lcl|NC_014036. 203 TGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQYSVELAQDLR 281 (522) Q Consensus 203 tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELAQDLK 281 (522) +|. ..+++-. -+++.++..++.-+-...+|-||.+|-. T Consensus 171 --------------------------------~E~---------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~ 209 (397) T protein:vir:49 171 --------------------------------DEG---------GQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSA 209 (397) T ss_pred --------------------------------ccc---------cccccccccceeeeEeeeeeeEeehhhHHHHHhhhh Confidence 010 0112222 1344445555555556789999999853 Q ss_pred hhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHH Q lcl|NC_014036. 282 AVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDK 361 (522) Q Consensus 282 AiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~ 361 (522) +|.+++|.+-|+..|..-+|+.||.= ... .....++++++ -...|+..+. T Consensus 210 ----~~l~~~i~~~l~~~~~~~~d~ail~G---~g~---------~~~~~~~~~~d-------------~i~~~~~~l~- 259 (397) T protein:vir:49 210 ----ENILAWLSGWIAKKVVVTRNKAILEA---IGT---------LPNKPTLAKWD-------------DIIDLQAKVD- 259 (397) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHhc---ccc---------ccccccccCHH-------------HHHHHHHhhh- Confidence 56799999999999999999999821 111 11122333222 1233333332 Q ss_pred HHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEe--cCCCcc------ Q lcl|NC_014036. 362 EANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYI--DQYARG------ 433 (522) Q Consensus 362 ~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~------ 433 (522) +.+.....+|++|.....|..+- +.... -....+.+. ...++|.| ++|++ |...+. T Consensus 260 --------~~~~~~a~~v~n~~~~~~l~~lk--d~~g~----~l~~~~~~~-g~~~~l~G-~pV~~~~~~~~~~~~~~~~ 323 (397) T protein:vir:49 260 --------PAIKQTSLFLTNTSGFTALKKVK--NAMGD----YLMERDVKS-PTGYSIDG-FVVKEISDRFLPNGTGGAM 323 (397) T ss_pred --------hhhcCCCEEEEcHHHHHHHHHhh--ccCCc----eeecccccC-CCCceecc-eeeEEecccccccccCCce Confidence 22335578899999999887551 11100 000011111 11246877 47765 222221 Q ss_pred --------ceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccee-cC--c-----cccccCCcccc Q lcl|NC_014036. 434 --------DYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVGI-NP--F-----ANSRSQAPSDR 497 (522) Q Consensus 434 --------dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP--~-----~~~~~~~~~~~ 497 (522) +|++++..+.-. +-..||.. .+-...+-.+-...|++..+ +| | +...++.+... T Consensus 324 ~~~~gd~~~~~~~~~~~~~~----i~~~~~~~------~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~ 393 (397) T protein:vir:49 324 PLYFGDLKQAVTLFDRQHLS----LLSTNIGG------GAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIADQKAKLS 393 (397) T ss_pred eEEEeeccceEEEEeecccE----EEEecccc------chhhcCeeeEEEEEeeccEEecccceEEEEecccccccCccc Confidence 122222222111 11222211 01123333444445555432 33 1 11222222111 Q ss_pred ccCcc Q lcl|NC_014036. 498 ITSGM 502 (522) Q Consensus 498 i~~g~ 502 (522) ..|. T Consensus 394 -~~~~ 397 (397) T protein:vir:49 394 -TAGA 397 (397) T ss_pred -ccCC Confidence 1111 No 27 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=94.44 E-value=0.0044 Score=33.50 Aligned_cols=275 Identities=12% Similarity=0.054 Sum_probs=130.6 Q ss_pred ccccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccc Q lcl|NC_014036. 72 HGYDATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFS 149 (522) Q Consensus 72 ~g~~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~E 149 (522) .|+++-....+. ++.. .. |.-+ .+++++..+.+-.+++-+-||++.+--+ ...+. . T Consensus 1 ~g~~a~~~~~~~-~~~~-~i-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~-----~~~~~--~------------ 58 (299) T protein:vir:41 1 MGFNPDTTTMQS-AKTG-SI-PINISEQIITGVKNGSAAMKLAKAVPMTKPEEEF-----TFMSG--V------------ 58 (299) T ss_pred CCcCCCcccccC-CCce-ec-chhHHHHHHHHHHhcchhhhhceeeecCCCcEEE-----EEEcC--C------------ Confidence 455544322211 1111 11 3333 6677778888899999999998765211 11000 0 Q ss_pred ccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCccccccccccccccccccccccc Q lcl|NC_014036. 150 PDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYG 229 (522) Q Consensus 150 adt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~G 229 (522) .+. T Consensus 59 -~a~---------------------------------------------------------------------------- 61 (299) T protein:vir:41 59 -GAF---------------------------------------------------------------------------- 61 (299) T ss_pred -cee---------------------------------------------------------------------------- Confidence 000 Q ss_pred ccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHH Q lcl|NC_014036. 230 MATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIV 309 (522) Q Consensus 230 m~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii 309 (522) ..+| +..+++...++++++...|..+-...+|-||.+|-. .|.+++|.+.|...|...+|+.|| T Consensus 62 ---~v~E---------~~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l 125 (299) T protein:vir:41 62 ---WVDE---------AERIQTSKPTFTKAKMRSKKMGVIIPTTKENLNYSV----TNFFSLMQAEIVEAFYKKFDQAVF 125 (299) T ss_pred ---eeec---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHh Confidence 0011 112345555667888888888888999999999854 456999999999999999999998 Q ss_pred hhhhhccccccccccccccccceeecccc-ccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHH Q lcl|NC_014036. 310 DMINYTAQVGKTGFTQTVGSKAGAFDFQD-PIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSAL 388 (522) Q Consensus 310 ~~i~~~a~~~~~~~~~~~~~~~g~fd~~~-~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L 388 (522) .= .....- .|++.-.. .... .......+.-|+++.+.+.. .+.+++.+||+|+....| T Consensus 126 ~G---~g~~~~----------~gil~~~~~~~~~------~~~~~~~~~~l~~~~~~l~~--~~~~~~~~v~n~~~~~~L 184 (299) T protein:vir:41 126 TG---VESPYN----------WNILKSATDASNL------VEETANKYDDLNEAIGLIEA--EDLEPNGIATIRKQRVKY 184 (299) T ss_pred hc---ccCccc----------cccccccccccee------eccccccHHHHHHHHHhhhc--ccCCcCEEEEcHHHHHHH Confidence 31 111000 01110000 0000 00000112223444444443 233567899999999998 Q ss_pred hhhcccccccccccccccccccccceeEEEecCceEEEecCCCccce----EEEEEecCCCccceeEeeccccccc---- Q lcl|NC_014036. 389 ARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDY----FTVGYKGDNEMDAGIYYAPYVALTP---- 460 (522) Q Consensus 389 ~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy----~~vG~KG~~~~d~glfyaPYv~~~~---- 460 (522) ...- +.. + .-....+.+.. .++|.| ++|++.++.+.+= +++|--- ..++..+-.... T Consensus 185 ~~lk--d~~-G---~~l~~~~~~~~--~~~l~G-~PV~~~~~~~~~~~~~~~~~gdfs------~~~i~~~~~~~i~~~~ 249 (299) T protein:vir:41 185 RSTK--DGN-G---MPIFNTATSNG--VDDVLG-LPIAYTPKYTFGDKDISELVGDWN------QAYYGILRGVEYEILT 249 (299) T ss_pred HHhh--ccC-C---ceeecCCcCCC--Cceecc-eeeEEecccCCCCCceEEEEEecc------cEEEEEecCcEEEEee Confidence 8541 111 0 01111111111 246776 7999888877541 2222110 011111111000 Q ss_pred ----ccccCCcc-----ccc-eeee--eeeeccee-cCccccccCCcccc Q lcl|NC_014036. 461 ----LRGSDPKN-----FQP-VMGF--KTRYGVGI-NPFANSRSQAPSDR 497 (522) Q Consensus 461 ----~~~~Dp~s-----~qP-~~~~--~tRY~l~~-nP~~~~~~~~~~~~ 497 (522) ....|++. ||- .+.| ..|++..+ ||=+...-....+. T Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 250 EATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred cccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 01122221 222 2333 35777654 45222221111111 No 28 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=92.17 E-value=0.013 Score=31.01 Aligned_cols=333 Identities=13% Similarity=0.126 Sum_probs=126.8 Q ss_pred CcchHHHHHhhhhhhccccchhhhcc-------------hhhhHHHHHHhhhHHH---------Hhhhhhhhcch----- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIAT-------------KSKKQLIAAIMEAQEK---------DAEVDPVYRDE----- 53 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~-------------~~~~~~~~~~~enq~~---------~~~~~~~~~~~----- 53 (522) |.+.++|.+.|.-+=+. +-++.. .-.+++-+.|-+.+++ ..+..+..... T Consensus 1 Mk~~~el~~~~~~~~~~---i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDK---VENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKK 77 (397) T ss_pred CchHHHHHHHHHHHHHH---HHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhccc Confidence 99999987777655221 111110 0001111122111111 00000000000 Q ss_pred -----------hhhhhhcccccccccccccccccccccccc-ccccccccCcchh-hHHHHHHhhhhhhhceeeccCCch Q lcl|NC_014036. 54 -----------KIVESFGGFLAEAEIAGDHGYDATKIASGN-SSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGP 120 (522) Q Consensus 54 -----------~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t-~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGP 120 (522) .....+..++.+.. .-....+ ..++ +.|.+. .-+.+. .+++.+-++..-.+++.++||+++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~-~~~t~~~gg~~-iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 151 (397) T protein:vir:48 78 PLTKSEEEVKAGFVKDFKNLVRGRY----QNLLDSK-TDASGSDAGLT-IPQDIQTAIHTLVRQYDSLQEYVNVENVTTL 151 (397) T ss_pred cccchhhHHHHHHHHHHHHHHhhhh----hHHHHHh-hccCCcccccc-ccHHHHHHHHHHHHHHHHHHhhhceeeccCC Confidence 00001111111110 0000001 1111 112111 111121 344444556677889999999998 Q ss_pred hhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeecccccc Q lcl|NC_014036. 121 TGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPV 200 (522) Q Consensus 121 TGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~ 200 (522) +|-+--.+ .... .+.+.|- T Consensus 152 ~~~~~~~~--~~~~--------------~~~a~~v--------------------------------------------- 170 (397) T protein:vir:48 152 TGSRVYEK--WADI--------------TGLAKLD--------------------------------------------- 170 (397) T ss_pred cceEEEEe--ecCC--------------Ccceeee--------------------------------------------- Confidence 87543221 1000 0000000 Q ss_pred ccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHH Q lcl|NC_014036. 201 TVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDL 280 (522) Q Consensus 201 ~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDL 280 (522) +.| +. ...+....|.+..|++.|. +-...+|-||.+|- T Consensus 171 --------------------------~E~------~~---~~~~~~~~~~~v~~~~~k~-------~~~~~iS~ell~ds 208 (397) T protein:vir:48 171 --------------------------DEA------GS---IGTNDDPKLYPIRYAIKRY-------AGISTVTNSLLADS 208 (397) T ss_pred --------------------------ccc------cc---cccccccceeeEEeeheee-------eeehhhHHHHHhhc Confidence 000 00 0001112344555555544 44568999999984 Q ss_pred HhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHH Q lcl|NC_014036. 281 RAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQID 360 (522) Q Consensus 281 KAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~ 360 (522) . .|.+++|.+-|+..|..-+|+.||.-. ..- ....++.++ +-...++..+ T Consensus 209 ~----~~l~~~v~~~l~~~~~~~~d~~il~G~---g~~---------~~~~~~~~~-------------d~i~~~~~~l- 258 (397) T protein:vir:48 209 A----ENILAWLSGWIAKKVVVTRNKAILEAI---ATL---------PTKPTLTKW-------------DDIIDLQAKV- 258 (397) T ss_pred h----HHHHHHHHHHHHHHHHHHHHHHHhhcc---ccc---------ccccccccH-------------HHHHHHHHHh- Confidence 3 577999999999999999999998321 110 011122221 1123333333 Q ss_pred HHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecC--CCcc----- Q lcl|NC_014036. 361 KEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQ--YARG----- 433 (522) Q Consensus 361 ~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~--y~~~----- 433 (522) ... +..+..+||+|.....|..+- +.... -....+.+. ..-++|.| ++|++-. ..+. T Consensus 259 ------~~~--~~~~a~~v~n~~~~~~L~~lk--d~~G~----~i~~~~~~~-~~~~~l~G-~PV~~~~~~~~~~~~~~~ 322 (397) T protein:vir:48 259 ------DPA--IKQTSFFLTNTSGFTALKKVK--NAFGD----YLMERDVKS-PTGYSIDG-FAVKEVADRWLANASSGA 322 (397) T ss_pred ------hhh--hcCCCEEEECHHHHHHHHHhh--cCCCc----eeeccCcCC-CCCceecc-ceeEEecccccCCcCCCc Confidence 222 224578899999999997541 11100 000111111 11246777 5776521 2111 Q ss_pred ---------ceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeecce-ecC--c-----cccccCCccc Q lcl|NC_014036. 434 ---------DYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVG-INP--F-----ANSRSQAPSD 496 (522) Q Consensus 434 ---------dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP--~-----~~~~~~~~~~ 496 (522) +|++++..+.-...- .++.. .+-.+.+=.+-...||+.. .|| | +...++.+. T Consensus 323 ~~~~~gd~~~~~~~~~~~~~~i~~----~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~- 391 (397) T protein:vir:48 323 MPLYFGDLKQAVTLFDRQQMSLLS----TNIGG------GAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKGN- 391 (397) T ss_pred eEEEEEeccceEEEEeecceEEEE----eccch------hhhhcCceeEEEEeeeccEEecccceEEEEecccccCCCC- Confidence 233333332221111 11100 0111222233333333322 122 1 111111000 Q ss_pred cccCcchHHhhccc Q lcl|NC_014036. 497 RITSGMITKEMFGK 510 (522) Q Consensus 497 ~i~~g~~~~~~~~~ 510 (522) .+.- +- T Consensus 392 ---~~~~-----~~ 397 (397) T protein:vir:48 392 ---LGST-----AV 397 (397) T ss_pred ---cccc-----CC Confidence 0000 00 No 29 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=92.15 E-value=0.013 Score=31.00 Aligned_cols=352 Identities=10% Similarity=0.032 Sum_probs=129.7 Q ss_pred CcchHHHHHhhhhhhcc-------c-------cc-hhhhcchhhhHHHHH--HhhhHHHHhhhhhhh-------cc---- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLES-------Q-------EG-LPDIATKSKKQLIAA--IMEAQEKDAEVDPVY-------RD---- 52 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~-------~-------~~-~~~i~~~~~~~~~~~--~~enq~~~~~~~~~~-------~~---- 52 (522) +-+.++|.+...-+.+. . +. ..++.... ..+.+. -++.+..++.+...- .. T Consensus 21 ~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~-~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~ 99 (418) T protein:vir:10 21 EQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATV-DELLIKQGELQARLLEAEQKLARGGGSAELETPKTL 99 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Confidence 22222222222211110 0 00 00110000 000000 001111111110000 00 Q ss_pred ------hhhhhhhcccccccccc---ccccccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhh Q lcl|NC_014036. 53 ------EKIVESFGGFLAEAEIA---GDHGYDATKIASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTG 122 (522) Q Consensus 53 ------~~~~~~~~~~l~ea~~~---~~~g~~~~~~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTG 122 (522) ..-...+..++.+.... ...-.+......+++++.-...-|.+. .+++.+.+..+..+++.+-||++++. T Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 179 (418) T protein:vir:10 100 GQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSI 179 (418) T ss_pred hHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCce Confidence 00000011111100000 000000001111111111111222222 45555666777888888888877642 Q ss_pred hheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeecccccccc Q lcl|NC_014036. 123 QVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTV 202 (522) Q Consensus 123 LIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~ 202 (522) .|.-.... .+.+. T Consensus 180 -------~~~~~~~~-----------~~~a~------------------------------------------------- 192 (418) T protein:vir:10 180 -------EYTVETGF-----------TNNAA------------------------------------------------- 192 (418) T ss_pred -------eEEEEecC-----------CCcee------------------------------------------------- Confidence 11100000 00000 Q ss_pred CCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHh Q lcl|NC_014036. 203 TGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRA 282 (522) Q Consensus 203 tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKA 282 (522) ..+|. ...++-..++++++..+|.-+-...+|-||.||.- T Consensus 193 ------------------------------~v~E~---------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~- 232 (418) T protein:vir:10 193 ------------------------------AVAEG---------AQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDAP- 232 (418) T ss_pred ------------------------------eeccC---------ccccccccceeeEEEeeeeEEEeehhhHHHHHhHH- Confidence 00110 01223334566777777777777889999999862 Q ss_pred hcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeecccccccc--ccchhHHHHHHHHHHHHH Q lcl|NC_014036. 283 VHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDV--RGARWAGESYKALLIQID 360 (522) Q Consensus 283 iHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~--~~~r~~~E~~r~L~~~i~ 360 (522) |.++.|.+-|+..|..-+|+-||.=--.... . .|++........ .... ...+..|. T Consensus 233 ----~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~--p----------~Gi~~~~~~~~~~~~~~~------~~~~~~i~ 290 (418) T protein:vir:10 233 ----ALQSYIDGRARYGLQLTEEGQILKGDGTGAN--I----------LGILPQASAFMPSITLAN------ATPIDKIR 290 (418) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHhccCCCCcc--c----------cccccccccccccccccc------cccHHHHH Confidence 5688999999999999999988821000000 0 122111100000 0000 01122222 Q ss_pred HHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEE Q lcl|NC_014036. 361 KEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGY 440 (522) Q Consensus 361 ~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~ 440 (522) .+...+. ..+...+.+||+|.....|..+- +.. +. ....+.+. ...|+|.| ++|+++++.|.+-+++|- T Consensus 291 ~~~~~~~--~~~~~~~~~v~n~~~~~~L~~lk--d~~-G~----~i~~~~~~-~~~~~l~G-~pV~~~~~~p~~~~~~gd 359 (418) T protein:vir:10 291 LALLQAV--LAEFPATGIVLNPIDWASIELTK--DSQ-GR----YIVGNPVN-GTTPRLWN-LPVVETQAMTANEFLVGA 359 (418) T ss_pred HHHHhhc--cccCCCCEEEEcHHHHHHHHHhh--cCC-Cc----eecccccc-CCCceecc-eeeEEcCCCCCCcEEEee Confidence 3323332 23446678999999998887441 111 10 01111111 11357777 799999998865555552 Q ss_pred ecCC-----CccceeEeecccccccccccCCccccceeeeeeeeccee-cCccccccCCccccccCcchHHhhcc Q lcl|NC_014036. 441 KGDN-----EMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMITKEMFG 509 (522) Q Consensus 441 KG~~-----~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~~~~~~ 509 (522) --.. ..+-.+=..||... +-...+=.+=+..|++..+ +|=+. .-+......+| T Consensus 360 ~s~~~~~~~~~~~~i~~~~~~~~------~f~~~~~~~r~~~~~d~~~~~~~a~----------~~~~~~~~~~g 418 (418) T protein:vir:10 360 FSMAAQIFDRMEIEVLLSTENVD------DFEKNMVSIRAEERLALAVYRPESF----------VTGALVEQAGG 418 (418) T ss_pred ccceEEEEEecceEEEEecccch------hhhcCceEEEEEEeeccEEecccce----------EEEEeccCCCC Confidence 1100 00000111111110 0112222333455666543 34111 11111112222 No 30 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=92.05 E-value=0.013 Score=30.91 Aligned_cols=342 Identities=16% Similarity=0.125 Sum_probs=129.9 Q ss_pred CcchHHHHHhhhhhhccccchh-hhcch------hh---hHHHHHH---h------hhHHHHhhhhhh------------ Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLP-DIATK------SK---KQLIAAI---M------EAQEKDAEVDPV------------ 49 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~-~i~~~------~~---~~~~~~~---~------enq~~~~~~~~~------------ 49 (522) |-+.++|.++|..+.+.-+.+- ++... .. +.+.+.+ . ++|.++..+.+. T Consensus 4 ~m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (408) T protein:vir:74 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (408) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 2255778888887765322221 11110 00 1111111 1 111111110000 Q ss_pred -----hcchhhhhhhcccccccccccccccccccccccc-ccccccccCcchh--hHHHHHHhhhhhhhceeeccCCchh Q lcl|NC_014036. 50 -----YRDEKIVESFGGFLAEAEIAGDHGYDATKIASGN-SSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPT 121 (522) Q Consensus 50 -----~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t-~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPT 121 (522) .++.. ...|...+.-. ...-...+...+..++ ..|.+.- |.-+ .+++.+-++....+++.++||++.+ T Consensus 84 ~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~a~~~~~~~~gg~~v--P~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) T protein:vir:74 84 KSENELKDKF-VKDFVNMVRNP-MAFLNTVSSKTETSGSDSAAGLTI--PQDIRTMINTLVRQYDSLQQYVRVESVSTSS 159 (408) T ss_pred chhhhhHHHH-HHHHHHHHhcc-hhhhhhhhhhhhcccccCCCceee--chhHhhHHHHHHhhhcchhhhcceeeccCCc Confidence 00000 00000000000 0000011111111111 1111111 2111 3444445666778999999999887 Q ss_pred hhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccc Q lcl|NC_014036. 122 GQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVT 201 (522) Q Consensus 122 GLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~ 201 (522) |-+--.+ ..... +... T Consensus 160 ~~~~~~~--~~~~~--------------~~~~------------------------------------------------ 175 (408) T protein:vir:74 160 GSRVYEK--WTDVT--------------PLKA------------------------------------------------ 175 (408) T ss_pred ceEEEEe--ecCCc--------------cccc------------------------------------------------ Confidence 6542211 00000 0000 Q ss_pred cCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccchhhHHHHHHH Q lcl|NC_014036. 202 VTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQYSVELAQDL 280 (522) Q Consensus 202 ~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELAQDL 280 (522) ..+|. ...++.+ .+++++++..+..+-...+|-||.+|- T Consensus 176 -------------------------------~v~E~---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds 215 (408) T protein:vir:74 176 -------------------------------MDEED---------GKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDT 215 (408) T ss_pred -------------------------------ccccc---------cccccccccceeeEEeeeeeEEeeehhHHHHHhhc Confidence 00010 0122222 344555555555556667999999983 Q ss_pred HhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHH Q lcl|NC_014036. 281 RAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQID 360 (522) Q Consensus 281 KAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~ 360 (522) .+|.+++|.+-|+..|..-+|+.||. -.. +.....++.+++ .|...+ T Consensus 216 ----~~~l~~~i~~~l~~~~~~~~d~~il~---G~G---------~~~~~~~~~~~~----------------~i~~~~- 262 (408) T protein:vir:74 216 ----AENILAWLSSWIAKKVVVTRNQAIIA---AMG---------TVPKKPTIANFD----------------DVITMI- 262 (408) T ss_pred ----hHHHHHHHHHHHHHHHHHHHHHHHhh---ccc---------ccccccccccHH----------------HHHHHH- Confidence 45779999999999999999999883 111 111122333222 111111 Q ss_pred HHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCC--Ccc----c Q lcl|NC_014036. 361 KEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQY--ARG----D 434 (522) Q Consensus 361 ~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y--~~~----d 434 (522) ...+. ..+...-.+||+|.....|..+- + +.+ +-....+.+. ...++|.| ++||+-.+ .+. + T Consensus 263 --~~~l~--~~~~~~a~~v~n~~~~~~l~~lk--d-~~G---~~l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~~~ 330 (408) T protein:vir:74 263 --NTSVD--PAIIATSSLLTNQSGLNKLALVK--T-AEG---KYLLEPDPTK-PNSYLIKG-KQVIVVADRWLPNSGSTV 330 (408) T ss_pred --HHhhh--hhhcCCCEEEEcHHHHHHHHHhh--c-CCC---ceEeccCcCC-CCCceecc-eeeEEecCcccccccCCc Confidence 11111 12223346889999999987541 1 111 0111111111 12357777 68876332 221 1 Q ss_pred e-EEEE---------EecCCCccceeEeecccccccccccCCccccceeeeeeeecce-ecCcc--c----cccCCcccc Q lcl|NC_014036. 435 Y-FTVG---------YKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVG-INPFA--N----SRSQAPSDR 497 (522) Q Consensus 435 y-~~vG---------~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~--~----~~~~~~~~~ 497 (522) + +++| -++.. .+=..||.- .+-...+-.+-+..||+.. .+|=+ . .....+ T Consensus 331 ~~i~~gd~~~~~~~~~~~~~----~i~~~~~~~------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~--- 397 (408) T protein:vir:74 331 YPLYYGDMSQAITLFDRENM----SLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQV--- 397 (408) T ss_pred ceEEEEehhccEEEEEecce----EEEEecccc------chhhcceeeEEEEEeeCcEEecccceEEEEeecccCCC--- Confidence 1 2222 11111 111122210 0112344444455555543 22210 0 000000 Q ss_pred ccCcchHHhhc Q lcl|NC_014036. 498 ITSGMITKEMF 508 (522) Q Consensus 498 i~~g~~~~~~~ 508 (522) --.+.....+. T Consensus 398 ~~~~~~~~~~~ 408 (408) T protein:vir:74 398 GNFKTTTSTAV 408 (408) T ss_pred CCCCCCccccC Confidence 00111111111 No 31 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=91.79 E-value=0.014 Score=30.71 Aligned_cols=267 Identities=13% Similarity=0.081 Sum_probs=117.8 Q ss_pred cccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccc Q lcl|NC_014036. 147 MFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEI 226 (522) Q Consensus 147 ~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~ 226 (522) |-...|.-+ ...++...+ ++..... .....+..+........+. .|....+ T Consensus 1 MA~~~T~~~------~~~iPev~s-------~~v~~~~--~~~~~~~~~~~~~~~~~g~--------------~G~tv~i 51 (272) T protein:vir:98 1 MAVGTTKMA------QMLDPEVLA-------DMIDAEV--GKAIRFAPLAEVDTTLEGQ--------------PGTTLTV 51 (272) T ss_pred CCCccccch------heechHHHH-------HHHHHHH--HHHhhhhccccccccccCC--------------CCCEEEE Confidence 100000000 000000000 0000000 0000000000000000000 0111111 Q ss_pred c----ccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHH Q lcl|NC_014036. 227 S----YGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIML 302 (522) Q Consensus 227 g----~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEIml 302 (522) . .+-....+| +..++.=..+.+.++++.|.++-.-++|=|++.+ -+-|.++++.+-|+..|.. T Consensus 52 P~~~~~~~a~~v~e---------g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~ 118 (272) T protein:vir:98 52 PKWDYIGDAEDVAE---------GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDH 118 (272) T ss_pred EEecCCCCcccccC---------CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHH Confidence 1 111112222 1223344455777788888887666777666533 2578999999999999999 Q ss_pred HhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEch Q lcl|NC_014036. 303 EINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASR 382 (522) Q Consensus 303 EINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~ 382 (522) +|+.+|+..+...... . .+...+ +-+-.+..++.++ ....+++||+| T Consensus 119 ~~d~~i~~~~~~a~~~-~----------~~~~t~-------------d~i~da~~~l~~~---------~~~~~~~vv~p 165 (272) T protein:vir:98 119 KVDADVLDALSKSTQT-V----------EATATV-------------DGVSKALDIFNDE---------DDAETVIVMNP 165 (272) T ss_pred HHHHHHHHHhcccccc-c----------ccccCH-------------HHHHHHHHHHhcc---------CCCccEEEEcH Confidence 9999999765432211 1 011111 1122233333322 23568999999 Q ss_pred hHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeeccccccccc Q lcl|NC_014036. 383 NVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLR 462 (522) Q Consensus 383 ~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~ 462 (522) .++..|.......+.... ..++ +......+|.+.| ++|+++++.+.+=+++.-+|.- +++-..-+... . T Consensus 166 ~~~~~L~k~~~~~~~~~~---~~~~-~~~~~g~ig~i~G-~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~~~~ve--~ 234 (272) T protein:vir:98 166 ADASTLRLDAAKEWLGAT---EVGA-NRVVSGVYGEVLG-VQIVRSRKCPKGTAYMVRKGAL----RIMLKRNTMVE--T 234 (272) T ss_pred HHHHHHHHhccccccccc---cccc-cccccccchhhcC-eeEEEcCCCCcceEEEEcCCeE----EEEecCCceee--e Confidence 999998754322222110 0111 1111223678877 7999999998644444333311 11111111111 1 Q ss_pred ccCCccccceeeeeeeecce-ecCccccccCCccccccCcchHHhhcccc Q lcl|NC_014036. 463 GSDPKNFQPVMGFKTRYGVG-INPFANSRSQAPSDRITSGMITKEMFGKN 511 (522) Q Consensus 463 ~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~~~~~~~~~~ 511 (522) .-|+.+++=.+-..-|||+. .||-.. .++. -..++|- T Consensus 235 ~r~~~~~~~~i~~~~~~~~~v~~~~~v-------v~~t-----~~~a~~~ 272 (272) T protein:vir:98 235 DRDITKAINQIVANKHYGVYLYKAEKA-------VKIT-----LKDAAKK 272 (272) T ss_pred ccccccceeEEEEEEEEEEEEEcCCce-------EEEE-----ecccccC Confidence 23788888888888899875 344110 0111 1122333 No 32 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=91.79 E-value=0.014 Score=30.71 Aligned_cols=267 Identities=13% Similarity=0.081 Sum_probs=117.8 Q ss_pred cccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccc Q lcl|NC_014036. 147 MFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEI 226 (522) Q Consensus 147 ~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~ 226 (522) |-...|.-+ ...++...+ ++..... .....+..+........+. .|....+ T Consensus 1 MA~~~T~~~------~~~iPev~s-------~~v~~~~--~~~~~~~~~~~~~~~~~g~--------------~G~tv~i 51 (272) T protein:vir:30 1 MAVGTTKMA------QMLDPEVLA-------DMIDAEV--GKAIRFAPLAEVDTTLEGQ--------------PGTTLTV 51 (272) T ss_pred CCCccccch------heechHHHH-------HHHHHHH--HHHhhhhccccccccccCC--------------CCCEEEE Confidence 100000000 000000000 0000000 0000000000000000000 0111111 Q ss_pred c----ccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHH Q lcl|NC_014036. 227 S----YGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIML 302 (522) Q Consensus 227 g----~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEIml 302 (522) . .+-....+| +..++.=..+.+.++++.|.++-.-++|=|++.+ -+-|.++++.+-|+..|.. T Consensus 52 P~~~~~~~a~~v~e---------g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~ 118 (272) T protein:vir:30 52 PKWDYIGDAEDVAE---------GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDH 118 (272) T ss_pred EEecCCCCcccccC---------CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHH Confidence 1 111112222 1223344455777788888887666777666533 2578999999999999999 Q ss_pred HhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEch Q lcl|NC_014036. 303 EINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASR 382 (522) Q Consensus 303 EINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~ 382 (522) +|+.+|+..+...... . .+...+ +-+-.+..++.++ ....+++||+| T Consensus 119 ~~d~~i~~~~~~a~~~-~----------~~~~t~-------------d~i~da~~~l~~~---------~~~~~~~vv~p 165 (272) T protein:vir:30 119 KVDADVLDALSKSTQT-V----------EATATV-------------DGVSKALDIFNDE---------DDAETVIVMNP 165 (272) T ss_pred HHHHHHHHHhcccccc-c----------ccccCH-------------HHHHHHHHHHhcc---------CCCccEEEEcH Confidence 9999999765432211 1 011111 1122233333322 23568999999 Q ss_pred hHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeeccccccccc Q lcl|NC_014036. 383 NVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLR 462 (522) Q Consensus 383 ~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~ 462 (522) .++..|.......+.... ..++ +......+|.+.| ++|+++++.+.+=+++.-+|.- +++-..-+... . T Consensus 166 ~~~~~L~k~~~~~~~~~~---~~~~-~~~~~g~ig~i~G-~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~~~~ve--~ 234 (272) T protein:vir:30 166 ADASTLRLDAAKEWLGAT---EVGA-NRVVSGVYGEVLG-VQIVRSRKCPKGTAYMVRKGAL----RIMLKRNTMVE--T 234 (272) T ss_pred HHHHHHHHhccccccccc---cccc-cccccccchhhcC-eeEEEcCCCCcceEEEEcCCeE----EEEecCCceee--e Confidence 999998754322222110 0111 1111223678877 7999999998644444333311 11111111111 1 Q ss_pred ccCCccccceeeeeeeecce-ecCccccccCCccccccCcchHHhhcccc Q lcl|NC_014036. 463 GSDPKNFQPVMGFKTRYGVG-INPFANSRSQAPSDRITSGMITKEMFGKN 511 (522) Q Consensus 463 ~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~~~~~~~~~~ 511 (522) .-|+.+++=.+-..-|||+. .||-.. .++. -..++|- T Consensus 235 ~r~~~~~~~~i~~~~~~~~~v~~~~~v-------v~~t-----~~~a~~~ 272 (272) T protein:vir:30 235 DRDITKAINQIVANKHYGVYLYKAEKA-------VKIT-----LKDAAKK 272 (272) T ss_pred ccccccceeEEEEEEEEEEEEEcCCce-------EEEE-----ecccccC Confidence 23788888888888899875 344110 0111 1122333 No 33 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=90.75 E-value=0.019 Score=29.99 Aligned_cols=283 Identities=13% Similarity=0.119 Sum_probs=127.3 Q ss_pred cccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccccccccc Q lcl|NC_014036. 79 IASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQ 157 (522) Q Consensus 79 ~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~ 157 (522) ++-. +++.+ -..|.+. .+++++.+..+..+++.+.||++.+.-|. ++... +.+. T Consensus 1 m~t~-t~gg~-liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip----~~~~~---------------~~a~---- 55 (303) T protein:vir:97 1 MGTE-TSKAS-LFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEF----TFTLD---------------SDID---- 55 (303) T ss_pred Cccc-CCCCe-EcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE----EEecC---------------cceE---- Confidence 2222 23332 2334444 66677777888999999999875433221 11100 0000 Q ss_pred ccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhh Q lcl|NC_014036. 158 GAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAEL 237 (522) Q Consensus 158 g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEa 237 (522) ..+|. T Consensus 56 ---------------------------------------------------------------------------wv~E~ 60 (303) T protein:vir:97 56 ---------------------------------------------------------------------------VVAEN 60 (303) T ss_pred ---------------------------------------------------------------------------EeecC Confidence 01110 Q ss_pred ccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccc Q lcl|NC_014036. 238 QEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQ 317 (522) Q Consensus 238 l~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~ 317 (522) ..+++-..+++.++..+|.-+-...+|-||.|.... ..++-+++|.+-|+..|...|+..+|.=...... T Consensus 61 ---------~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g 130 (303) T protein:vir:97 61 ---------GKKTHGGLSLEPVTIVPIKVEYGARLSDEFLYATEE-EKIDILKAFNEGFAKKLARGIDLMAMHGINPRTK 130 (303) T ss_pred ---------ccccccccceeeEEeeeEEEEEeehhhHHHhhcCcc-chHHHHHHHHHHHHHHHHHHHHhhhhcccccCCc Confidence 112333334456666666666677899999863322 2466789999999999999999998832211111 Q ss_pred cccccccccccccceeecccc--ccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccc Q lcl|NC_014036. 318 VGKTGFTQTVGSKAGAFDFQD--PIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGI 395 (522) Q Consensus 318 ~~~~~~~~~~~~~~g~fd~~~--~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~ 395 (522) -+ ....|...+.. ..-+..+ ....++.-|.++-+.+.. ..+..+-+|++|.....|..+ .+ T Consensus 131 ~~--------~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~l--kd 193 (303) T protein:vir:97 131 KA--------SDVIGTNHFDSKVTQVVKFT-----ESEDADANIEAAVNLIQG--AEGVVTGLAMDTEFSTALAKV--TN 193 (303) T ss_pred cc--------cccccccccccccccccccc-----cccchHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHHHHh--hc Confidence 00 00011111100 0000000 011233344444444432 233556799999999888633 11 Q ss_pred cccccccccccccccccceeEEEecCceEEEecCCCccce-----EEEEEecCCCccceeEeeccc--ccccccccCCcc Q lcl|NC_014036. 396 TPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDY-----FTVGYKGDNEMDAGIYYAPYV--ALTPLRGSDPKN 468 (522) Q Consensus 396 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-----~~vG~KG~~~~d~glfyaPYv--~~~~~~~~Dp~s 468 (522) .... -....+.....-.|+|.| ++|+++.+-+... -.+.+-|+- ...+.+...- ++...+..|++. T Consensus 194 ~~g~----~~~~~~~~~~~~~~~l~G-~Pv~~s~~v~~~~~~~~~~~~~~~Gdf--~~~~~~~~~~~~~~~~~~~~~~d~ 266 (303) T protein:vir:97 194 GEMG----PKMYPELAWGANPDSING-LKSSVNTTVGAGADEAESKDLVIIGDF--ESMFKWGYAKQIPMEIIKYGDPDN 266 (303) T ss_pred cCCC----eEEecCccCCCCCceecc-eeeEEecccCCccccCCCccEEEEeec--cccEEEEEecCcEEEEeeccCCCC Confidence 1110 001111111112357887 7999988765311 011122221 1111122111 111222223321 Q ss_pred -----ccc-eeee--eeeecc-eecCccc-cccCCccccc Q lcl|NC_014036. 469 -----FQP-VMGF--KTRYGV-GINPFAN-SRSQAPSDRI 498 (522) Q Consensus 469 -----~qP-~~~~--~tRY~l-~~nP~~~-~~~~~~~~~i 498 (522) |+- .++| ..||+. +.||=+. ..+++ +| T Consensus 267 ~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~---~~ 303 (303) T protein:vir:97 267 SGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKG---EV 303 (303) T ss_pred cchhhhhcCcEEEEEEEEeccEeecccceEEeeCC---CC Confidence 221 2444 567775 4566222 22222 23 No 34 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=90.69 E-value=0.02 Score=29.95 Aligned_cols=343 Identities=11% Similarity=0.018 Sum_probs=129.0 Q ss_pred Cc-----------chHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhh--------hhcchhhhhhhcc Q lcl|NC_014036. 1 MS-----------KKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDP--------VYRDEKIVESFGG 61 (522) Q Consensus 1 ~~-----------~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~--------~~~~~~~~~~~~~ 61 (522) |. ..++..+++.-+...- .++.... +.+- ..++.-++...... ......-...+-. T Consensus 19 ~~~~~e~~~~~~~~~~e~~~~~~~~~~e~---~~l~~~i-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 93 (390) T protein:vir:10 19 LRAFGERAVRDGELNASARSKVDELFATV---GNLSAEV-QAAR-QRVAELEGNGAGGDVQHVSVGDLFVASEQFQASAG 93 (390) T ss_pred HHHHHHHHHhhcccCHHHHHHHHHHHHHH---HHHHHHH-HHHH-HHHHHHHhhcccccccccchhhhhhhhHHHHHHHH Confidence 11 1122333444333211 1110000 0000 00111000000000 0000000000000 Q ss_pred cccccccccccccccccc----ccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCC Q lcl|NC_014036. 62 FLAEAEIAGDHGYDATKI----ASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPL 136 (522) Q Consensus 62 ~l~ea~~~~~~g~~~~~~----~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~ 136 (522) +..+..... ..+.... ..++++.+-.-.-|.++ .++.++-++..-.++|.+.||++++.-+. +..... T Consensus 94 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~----~~~~~~- 166 (390) T protein:vir:10 94 RWNDRSARA--TMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYV----QETGFV- 166 (390) T ss_pred hhhhhhhhh--hhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEE----EEecCC- Confidence 000000000 0000000 00111111111223333 44555555666778889998876542111 000000 Q ss_pred CCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccc Q lcl|NC_014036. 137 ASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIA 216 (522) Q Consensus 137 ~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~ 216 (522) ..+. T Consensus 167 -------------~~a~--------------------------------------------------------------- 170 (390) T protein:vir:10 167 -------------NNAA--------------------------------------------------------------- 170 (390) T ss_pred -------------ccee--------------------------------------------------------------- Confidence 0000 Q ss_pred cccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHH Q lcl|NC_014036. 217 EQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAIL 296 (522) Q Consensus 217 ~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNIL 296 (522) ..+| +...++-..+++++++.+|..+....+|-||.||-- |.++.|.+-| T Consensus 171 ----------------~v~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~l 220 (390) T protein:vir:10 171 ----------------IVAE---------GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-----QLASYMNNRL 220 (390) T ss_pred ----------------eecC---------CccccccccceeEEEEeeEEEEEeehhhHHHHHhHH-----HHHHHHHHHH Confidence 0011 011334445667777777777888999999999852 4689999999 Q ss_pred HHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCc Q lcl|NC_014036. 297 ATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGN 376 (522) Q Consensus 297 STEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn 376 (522) +..|...||+.||.= +-. + ..-.|++...........- + ...++..+..+...+. ..+...+ T Consensus 221 ~~~~~~~~~~~il~G---~G~-~--------~~p~Gi~~~~~~~~~~~~~-~---~~~~~~~~~~~~~~l~--~~~~~~~ 282 (390) T protein:vir:10 221 IRGLKVKEDAEILRG---TGA-N--------DGLLGLIPQATTYAAPTTI-A---GATRVDQLRLAMLQAS--LAEYPAS 282 (390) T ss_pred HHHHHHHHHHHHhhc---CCC-C--------ccccccccccccccccccc-c---ccchHHHHHHHHHhhc--cccCCCC Confidence 999999999999821 100 0 0012332221100000000 0 0111122222222222 2333667 Q ss_pred EEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeeccc Q lcl|NC_014036. 377 FIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYV 456 (522) Q Consensus 377 ~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv 456 (522) .+|++|.....|..+- +.... ..-.+... .-.++|.| ++|++++..|.+-+++|--- .+++.+.. T Consensus 283 ~~v~n~~~~~~L~~lk--d~~g~-----~l~~~~~~-~~~~~l~G-~pv~~~~~~p~~~~~~gdf~-----~~~~~~~~- 347 (390) T protein:vir:10 283 GIVINPIDWAAIELAK--DANNQ-----YLIGNARG-TLTPTLWG-LPVVATQAMAPGEFLVGAFD-----LAAQIFDQ- 347 (390) T ss_pred EEEEcHHHHHHHHHhh--cCCCc-----eeecCCcC-cCCceecc-eeeEEcCCCCCCcEEEEecc-----ceEEEEEe- Confidence 8999999988887432 11100 00001100 01245766 69999999887655555210 11111111 Q ss_pred ccccccccC----Cccccceeeeeeeecce-ecCccccccCCccccccCc Q lcl|NC_014036. 457 ALTPLRGSD----PKNFQPVMGFKTRYGVG-INPFANSRSQAPSDRITSG 501 (522) Q Consensus 457 ~~~~~~~~D----p~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g 501 (522) ....+...+ -.+-+=.+-...||+.. .+|=+.- .+.=+ T Consensus 348 ~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~-------~~~~a 390 (390) T protein:vir:10 348 WDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALI-------SGSFA 390 (390) T ss_pred cceEEEEeecccccccCcEEEEEEEeeccEEeccccEE-------EEEeC Confidence 111111111 11222233334566553 2331110 00000 No 35 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=89.45 E-value=0.026 Score=29.25 Aligned_cols=302 Identities=12% Similarity=0.059 Sum_probs=109.3 Q ss_pred hcccccccccccccccccccccccccccccccccceeecc----ccc---ccceeeeeccccccccCCCCcccccccccc Q lcl|NC_014036. 144 FHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHD----FVE---TGRVFLQNVSGAPVTVTGSTDDALDAAVIA 216 (522) Q Consensus 144 ~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~----~~~---~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~ 216 (522) +.-++|--+.-.|....+. .. ...+.+.... ... ...... ...+....++.. . .... T Consensus 1 ~a~l~el~~~~~~~~~~g~--~~-------~~~~~liP~~~~~~ii~~l~~~s~l~---~~~~~~~~~~~~--~--~~p~ 64 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGR--LA-------HVPSDLLPKEIVGPIFDKAQESSLVL---RMGEQIPISYGE--T--IIPT 64 (333) T ss_pred CchhHHhhhhcccccccCc--ee-------cCCccccchhHHHHHHHHHHhhchhh---hhcceeeccCCc--e--EEEE Confidence 3332222111111110000 00 0000000000 000 000000 000000000000 0 0000 Q ss_pred cccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHH Q lcl|NC_014036. 217 EQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAIL 296 (522) Q Consensus 217 ~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNIL 296 (522) .........+++|-....+|. ...++-.-+++++++..|--+--...|-||.+|-. .|.|++|.+.| T Consensus 65 ~~~~~~a~~v~eg~~~~~~e~---------~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~----~~~~~~i~~~l 131 (333) T protein:vir:78 65 TVKRPEVGQVGVGTSNEQREG---------GLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNP----SGLYTKLQGDL 131 (333) T ss_pred EeCCceeEeecCccccccccc---------ccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHH Confidence 000111112222222222231 22445555666666666655566678889888754 46799999999 Q ss_pred HHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCc Q lcl|NC_014036. 297 ATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGN 376 (522) Q Consensus 297 STEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn 376 (522) ...|...|+..+|.=-......+..++.. ..++..-. ............+..|..+-..+...-.+ ..+ T Consensus 132 a~ai~~~~d~~~l~G~g~~~~~~~~g~~~----~~~~~~~~------~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~~~ 200 (333) T protein:vir:78 132 AYAIGRGIDLAVFHGKSPLTGSALQGIDT----DNVIANTT------NVDYLQETGDPLLDRLLDGYDLVSANTDV-EFN 200 (333) T ss_pred HHHHHHHHHHHHhcccCCCCCcccccccc----cccccccc------cccccccccchhHHHHHHHHHhhcccccc-Cce Confidence 99999999999983111111111111100 00100000 00000011111222233333333333333 557 Q ss_pred EEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccce---------EEEE-------- Q lcl|NC_014036. 377 FIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDY---------FTVG-------- 439 (522) Q Consensus 377 ~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy---------~~vG-------- 439 (522) .+|+.|.-...|..+..+.-..+.... ..+.. ..-.|+|.| ++|+++.+.+.+. +++| T Consensus 201 ~~vmn~~~~~~L~~~~~~~d~~G~~i~---~~~~~-~~~~~~l~G-~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g 275 (333) T protein:vir:78 201 GWAVDPRFRAHLLRAQAYRDANGNVDP---SRINL-AAQTGDVLG-LPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFG 275 (333) T ss_pred EEEEcchHHHHHHHHhhhcCCCCceee---cCccc-cCCCceeec-eeeEEccccCCCccccCCCccEEEEEecccEEEE Confidence 888999887777644221111110000 00000 011257787 6999988876442 3333 Q ss_pred EecCCCccceeEeecccccccccccCCcccc-ceee--eeeeecce-ecC--ccc-cccCCc Q lcl|NC_014036. 440 YKGDNEMDAGIYYAPYVALTPLRGSDPKNFQ-PVMG--FKTRYGVG-INP--FAN-SRSQAP 494 (522) Q Consensus 440 ~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~q-P~~~--~~tRY~l~-~nP--~~~-~~~~~~ 494 (522) ..+..++ -..+|.-.......--.-|| -.++ ...|++.. .+| |.. ....+| T Consensus 276 ~~~~~~i----~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 276 FADEIRI----KMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred EeeccEE----EEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCCC Confidence 2222111 11122110000000000111 1122 34577744 666 333 222223 No 36 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=89.14 E-value=0.028 Score=29.09 Aligned_cols=284 Identities=12% Similarity=0.133 Sum_probs=125.1 Q ss_pred ccccccccccccCcchh--hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccccccccc Q lcl|NC_014036. 80 ASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQ 157 (522) Q Consensus 80 ~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~ 157 (522) +..+++++....=|.-+ .+++++.++.+..+++-+.||++++--|.- ... .+.+.|-|. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~----~~~---------------~~~a~wv~E 61 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPV----LAT---------------LPEADWVGE 61 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEE----EeC---------------CcceEEeec Confidence 22333333222223222 566677777888888999998876521111 000 001111111 Q ss_pred ccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhh Q lcl|NC_014036. 158 GAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAEL 237 (522) Q Consensus 158 g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEa 237 (522) +.. T Consensus 62 ~~~----------------------------------------------------------------------------- 64 (305) T protein:vir:25 62 SAT----------------------------------------------------------------------------- 64 (305) T ss_pred ccc----------------------------------------------------------------------------- Confidence 000 Q ss_pred ccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccc Q lcl|NC_014036. 238 QEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQ 317 (522) Q Consensus 238 l~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~ 317 (522) .....++.-..++++++..++..+-.-.+|-||.+|- ..|.|++|.+-|+..|...+++.+|.=.-.... T Consensus 65 ------~~~~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~ 134 (305) T protein:vir:25 65 ------DPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPAS 134 (305) T ss_pred ------cccccccccccceeeEEeeeEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHhhhheeccCCCCC Confidence 0000112223344555566666666778999999984 357899999999999999999999931100000 Q ss_pred cccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccc Q lcl|NC_014036. 318 VGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITP 397 (522) Q Consensus 318 ~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~ 397 (522) .+..+.... ....+. ...... .......++.-+.+....+...-. ..+-+|++|.....|..+ .+.. T Consensus 135 ~~~~~~~~~-~~~~~~----~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~l--kd~~ 201 (305) T protein:vir:25 135 WVSPALIPA-AVTAGQ----AVEVVG----GVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANI--RDAN 201 (305) T ss_pred ccccccccc-cccccc----cccccc----cchhhhHHHHHHHHHHHhhhhccc--ccceeEecHHHHHHHHHh--hccC Confidence 000000000 000000 000000 111223344444444444444322 334578899988888633 1110 Q ss_pred cccccccccccccccceeE-EEecCceEEEecCCCccc----eEEE--------EEecCCCccceeEeeccccccccccc Q lcl|NC_014036. 398 AGQGLQKTLNVDTTKAVFA-GVLGGVYKVYIDQYARGD----YFTV--------GYKGDNEMDAGIYYAPYVALTPLRGS 464 (522) Q Consensus 398 ~~~~~~~~~~~d~~~~~~~-G~l~~~~~vy~D~y~~~d----y~~v--------G~KG~~~~d~glfyaPYv~~~~~~~~ 464 (522) + ...|. ++|.| ++|+|..+.+.+ -+++ |..+.-+.+- ..+.-+.+ .- T Consensus 202 -G------------~~i~~~~~l~G-~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~----~~~~~~~~--~~ 261 (305) T protein:vir:25 202 -G------------NPVFRDDSFAG-FRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKF----LDQATLGT--GE 261 (305) T ss_pred -C------------ceeecCCcccc-cceEEcCccCCCCCccEEEEEecceEEEEEecCeEEEE----eeeeeeec--CC Confidence 0 01111 46766 699988876542 1222 2222111110 00100000 00 Q ss_pred CCcc-cc-ceee--eeeeecc-eecCccc-cccCCccccccCcc Q lcl|NC_014036. 465 DPKN-FQ-PVMG--FKTRYGV-GINPFAN-SRSQAPSDRITSGM 502 (522) Q Consensus 465 Dp~s-~q-P~~~--~~tRY~l-~~nP~~~-~~~~~~~~~i~~g~ 502 (522) .|.+ || ..++ ...|||+ +.||=+- ..+..+.+-+.-.. T Consensus 262 ~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~pa~ 305 (305) T protein:vir:25 262 NQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) T ss_pred ceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCCCC Confidence 1111 22 1223 4668996 5588442 33332222222221 No 37 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=89.04 E-value=0.029 Score=29.04 Aligned_cols=282 Identities=13% Similarity=0.068 Sum_probs=125.4 Q ss_pred cccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccccccccc Q lcl|NC_014036. 79 IASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQ 157 (522) Q Consensus 79 ~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~ 157 (522) .+++++++... .-|.+. .++.++.+..+..+++.+.||++-.. +|.-... .+++. T Consensus 1 ma~~t~~~G~l-ip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-------~~p~~~~------------~~~a~---- 56 (300) T protein:vir:95 1 MSEAQLSKGNL-FNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQ-------REFVFDF------------DSDID---- 56 (300) T ss_pred CcccccCCcce-echhhHHHHHHHHHhhhhhhhhcceeeccCCce-------EEEEEec------------CcceE---- Confidence 34555554442 334333 44444555666778999999876321 1211000 00000 Q ss_pred ccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhh Q lcl|NC_014036. 158 GAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAEL 237 (522) Q Consensus 158 g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEa 237 (522) ..+| T Consensus 57 ---------------------------------------------------------------------------wv~E- 60 (300) T protein:vir:95 57 ---------------------------------------------------------------------------IVAE- 60 (300) T ss_pred ---------------------------------------------------------------------------EeeC- Confidence 1112 Q ss_pred ccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccc Q lcl|NC_014036. 238 QEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQ 317 (522) Q Consensus 238 l~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~ 317 (522) +.+.++...+++.+++.+|.-+-...+|-||.+-... ..+|-+++|.+-|...|...+++.++.=.....- T Consensus 61 --------g~~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g 131 (300) T protein:vir:95 61 --------NGKKTHGGVSLDPVTIVPLKVEYGARVSDEFLHASEE-AKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTK 131 (300) T ss_pred --------CcccccccccceeeEeeeEEEEEeehhhHHHhccCCC-CHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCC Confidence 1123455555667777777777778899998753322 2466788999999999999999999822110000 Q ss_pred cccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccc Q lcl|NC_014036. 318 VGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITP 397 (522) Q Consensus 318 ~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~ 397 (522) ......... ...+... ..+.+ .....+.-|.++...+.. .+++.+.+|++|.....|..+- + T Consensus 132 -~~~~~~~~~-~~~~~~~----~~~~~------~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~lk--d-- 193 (300) T protein:vir:95 132 -QASTIIGDN-CFDKKVT----QTVPF------KDTNPDESMEDAVGMIDG--SERDITGAILDPIFTTALSKMK--N-- 193 (300) T ss_pred -CCccccccc-ccccccc----eeecc------cccchHHHHHHHHHHhhh--cCCCccEEEECHHHHHHHHHhh--c-- Confidence 000000000 0000000 00000 001223334444443332 2346667899999998886431 1 Q ss_pred cccccccccccccccceeEEEecCceEEEecCCCcc------ceEEEEEecCCCccceeEeecccc--cccccccCCcc- Q lcl|NC_014036. 398 AGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG------DYFTVGYKGDNEMDAGIYYAPYVA--LTPLRGSDPKN- 468 (522) Q Consensus 398 ~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~d~glfyaPYv~--~~~~~~~Dp~s- 468 (522) +.+ +-....+.+. ...++|.| ++|+++.+.+. +.+++|- +.-+++|..... +...+-.|++. T Consensus 194 -~~G-~~i~~~~~~~-~~~~~l~G-~Pv~~s~~v~~~~~~~~~~~~~GD-----f~~~~~~~~~~~~~~~v~~~~~~d~~ 264 (300) T protein:vir:95 194 -AEG-GKLYPELAWG-GVPDAING-LAVDKNRTVSYSQTDPKNTAIVGD-----FETMFKWGYAKEVPMEIIKYGDPDNS 264 (300) T ss_pred -cCC-CeeccCcccc-CCCceecc-eeeEEecCCCCCCCCCccEEEEee-----ccceEEEEEecccEEEEeeccCCCCc Confidence 100 1111111111 12467888 69999888653 1222221 001112221111 11111112221 Q ss_pred ----cc---ceeeeeeeeccee-cCccccccCCccccccCcchHHhhccccceeeeeeecc Q lcl|NC_014036. 469 ----FQ---PVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKG 521 (522) Q Consensus 469 ----~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~ 521 (522) || =.+=+..|+++.+ +| +++.+..++.| T Consensus 265 ~~~~f~~~~v~~r~~~r~d~~v~~~-------------------------~a~~~l~~~~g 300 (300) T protein:vir:95 265 GRDLKGYNQIYIRCEAYIGWGIMDA-------------------------ASFARIVKTGG 300 (300) T ss_pred chhhhhcCcEEEEEEEeecceeecc-------------------------cceEEEecCCC Confidence 11 2222334666433 44 33333333334 No 38 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=88.82 E-value=0.03 Score=28.94 Aligned_cols=359 Identities=14% Similarity=0.099 Sum_probs=140.5 Q ss_pred CcchHHHHHhhhhhhccccc-hhhhcch---h----hhHHHHHH--hhhHHHHh-------hhhh--------------- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEG-LPDIATK---S----KKQLIAAI--MEAQEKDA-------EVDP--------------- 48 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~-~~~i~~~---~----~~~~~~~~--~enq~~~~-------~~~~--------------- 48 (522) |-..++|.++=.-+.+..+. +-++++. - .+.+...+ |+.|-+.+ .+.. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 44443333332222211000 0000000 0 00111000 11111111 1000 Q ss_pred -hhcchhhhhhhcccccccccc----------ccccccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeec Q lcl|NC_014036. 49 -VYRDEKIVESFGGFLAEAEIA----------GDHGYDATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQ 115 (522) Q Consensus 49 -~~~~~~~~~~~~~~l~ea~~~----------~~~g~~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQ 115 (522) ..++..-....+..+.+.... -..+.+......++..|.+ --|..+ .+++.+.+...-.+++.+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~--~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:46 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFV--VIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcc--cccHHHHHHHHHHHHhhhhhhhhccee Confidence 000000000000000000000 0000000000111112211 124332 4666677788889999999 Q ss_pred cCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeec Q lcl|NC_014036. 116 PMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNV 195 (522) Q Consensus 116 PmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~ 195 (522) ||+++++-+--.+ +... +...| T Consensus 159 ~~~~~~~~~~~~~-----~~~~------------~~~~~----------------------------------------- 180 (415) T protein:vir:46 159 RVTNGSGKYPVVR-----QSEV------------AALEK----------------------------------------- 180 (415) T ss_pred eccCCceeEEEEE-----ecCC------------cceee----------------------------------------- Confidence 9999876432111 0000 00000 Q ss_pred cccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccchhhH Q lcl|NC_014036. 196 SGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQYSV 274 (522) Q Consensus 196 ~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ 274 (522) + +| +...++.+ -++++++..++..+-...+|- T Consensus 181 ------------------------------v--------~E---------g~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ 213 (415) T protein:vir:46 181 ------------------------------V--------EE---------LEENPELAVKPFFQLAYDINTHRGYFRISR 213 (415) T ss_pred ------------------------------c--------cc---------ccccccccccceeeEEeeeeeeEeeehhhH Confidence 0 01 01123332 245566666666666678999 Q ss_pred HHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHH Q lcl|NC_014036. 275 ELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKA 354 (522) Q Consensus 275 ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~ 354 (522) ||.+|-. .|.+++|.+-|+..|..-+|+.||.-.-...-.+....... ....+.-. +... .+-... T Consensus 214 ell~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~---~~~~~~~~------~~~~-~~~i~~ 279 (415) T protein:vir:46 214 EAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEK---EGKKLEVK------KAKS-LDDIKD 279 (415) T ss_pred HHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccc---ccceeccc------cccc-hHHHHH Confidence 9999843 57799999999999999999999954322221111110000 00111000 0000 122233 Q ss_pred HHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccc Q lcl|NC_014036. 355 LLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGD 434 (522) Q Consensus 355 L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d 434 (522) |+..+.. .+.+.+.+|++|.....|..+- +.. +. -....+.+. ...++|.| ++|++.++.+.. T Consensus 280 ~~~~~~~---------~~~~~~~~v~n~~~~~~L~~lk--d~~-G~---~i~~~~~~~-~~~~~l~G-~pV~~~~~~~~~ 342 (415) T protein:vir:46 280 AINLNVK---------PNYEHNVAIVSQTMFAKLDKMK--DKL-GN---YLIQPDVKE-KTQQRLLG-AKIEILPDEVLG 342 (415) T ss_pred HHHhhhh---------hccCCCEEEEcHHHHHHHHHhh--ccC-CC---eeeccCcCC-CCCccccc-eeeEEecccccc Confidence 3333322 2235678999999998887541 111 00 000111111 12356777 688877665531 Q ss_pred eEEEEEecCCCccceeEeeccc--------ccccccccCCccccceeeeeeeecce-ecCccccc-cCCccccccCcchH Q lcl|NC_014036. 435 YFTVGYKGDNEMDAGIYYAPYV--------ALTPLRGSDPKNFQPVMGFKTRYGVG-INPFANSR-SQAPSDRITSGMIT 504 (522) Q Consensus 435 y~~vG~KG~~~~d~glfyaPYv--------~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~-~~~~~~~i~~g~~~ 504 (522) -.| +..++|+.|- ....+...|-.+++-.+-...|++.. .+|=+... +-.+. .--.++. T Consensus 343 -----~~~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~--~~~~~~~ 411 (415) T protein:vir:46 343 -----QKG----NNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS--ERGEGDL 411 (415) T ss_pred -----CCC----ccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccccEEEEEeecc--CCCCCCc Confidence 001 1112222211 11112223455667777778888764 35522100 00000 0011222 Q ss_pred Hhhc Q lcl|NC_014036. 505 KEMF 508 (522) Q Consensus 505 ~~~~ 508 (522) ..-+ T Consensus 412 ~~~~ 415 (415) T protein:vir:46 412 GLEA 415 (415) T ss_pred cCCC Confidence 2211 No 39 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=88.82 E-value=0.03 Score=28.94 Aligned_cols=359 Identities=14% Similarity=0.099 Sum_probs=140.5 Q ss_pred CcchHHHHHhhhhhhccccc-hhhhcch---h----hhHHHHHH--hhhHHHHh-------hhhh--------------- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEG-LPDIATK---S----KKQLIAAI--MEAQEKDA-------EVDP--------------- 48 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~-~~~i~~~---~----~~~~~~~~--~enq~~~~-------~~~~--------------- 48 (522) |-..++|.++=.-+.+..+. +-++++. - .+.+...+ |+.|-+.+ .+.. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 44443333332222211000 0000000 0 00111000 11111111 1000 Q ss_pred -hhcchhhhhhhcccccccccc----------ccccccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeec Q lcl|NC_014036. 49 -VYRDEKIVESFGGFLAEAEIA----------GDHGYDATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQ 115 (522) Q Consensus 49 -~~~~~~~~~~~~~~l~ea~~~----------~~~g~~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQ 115 (522) ..++..-....+..+.+.... -..+.+......++..|.+ --|..+ .+++.+.+...-.+++.+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~--~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:47 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFV--VIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcc--cccHHHHHHHHHHHHhhhhhhhhccee Confidence 000000000000000000000 0000000000111112211 124332 4666677788889999999 Q ss_pred cCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeec Q lcl|NC_014036. 116 PMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNV 195 (522) Q Consensus 116 PmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~ 195 (522) ||+++++-+--.+ +... +...| T Consensus 159 ~~~~~~~~~~~~~-----~~~~------------~~~~~----------------------------------------- 180 (415) T protein:vir:47 159 RVTNGSGKYPVVR-----QSEV------------AALEK----------------------------------------- 180 (415) T ss_pred eccCCceeEEEEE-----ecCC------------cceee----------------------------------------- Confidence 9999876432111 0000 00000 Q ss_pred cccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccchhhH Q lcl|NC_014036. 196 SGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQYSV 274 (522) Q Consensus 196 ~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ 274 (522) + +| +...++.+ -++++++..++..+-...+|- T Consensus 181 ------------------------------v--------~E---------g~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ 213 (415) T protein:vir:47 181 ------------------------------V--------EE---------LEENPELAVKPFFQLAYDINTHRGYFRISR 213 (415) T ss_pred ------------------------------c--------cc---------ccccccccccceeeEEeeeeeeEeeehhhH Confidence 0 01 01123332 245566666666666678999 Q ss_pred HHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHH Q lcl|NC_014036. 275 ELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKA 354 (522) Q Consensus 275 ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~ 354 (522) ||.+|-. .|.+++|.+-|+..|..-+|+.||.-.-...-.+....... ....+.-. +... .+-... T Consensus 214 ell~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~---~~~~~~~~------~~~~-~~~i~~ 279 (415) T protein:vir:47 214 EAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEK---EGKKLEVK------KAKS-LDDIKD 279 (415) T ss_pred HHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccc---ccceeccc------cccc-hHHHHH Confidence 9999843 57799999999999999999999954322221111110000 00111000 0000 122233 Q ss_pred HHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccc Q lcl|NC_014036. 355 LLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGD 434 (522) Q Consensus 355 L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d 434 (522) |+..+.. .+.+.+.+|++|.....|..+- +.. +. -....+.+. ...++|.| ++|++.++.+.. T Consensus 280 ~~~~~~~---------~~~~~~~~v~n~~~~~~L~~lk--d~~-G~---~i~~~~~~~-~~~~~l~G-~pV~~~~~~~~~ 342 (415) T protein:vir:47 280 AINLNVK---------PNYEHNVAIVSQTMFAKLDKMK--DKL-GN---YLIQPDVKE-KTQQRLLG-AKIEILPDEVLG 342 (415) T ss_pred HHHhhhh---------hccCCCEEEEcHHHHHHHHHhh--ccC-CC---eeeccCcCC-CCCccccc-eeeEEecccccc Confidence 3333322 2235678999999998887541 111 00 000111111 12356777 688877665531 Q ss_pred eEEEEEecCCCccceeEeeccc--------ccccccccCCccccceeeeeeeecce-ecCccccc-cCCccccccCcchH Q lcl|NC_014036. 435 YFTVGYKGDNEMDAGIYYAPYV--------ALTPLRGSDPKNFQPVMGFKTRYGVG-INPFANSR-SQAPSDRITSGMIT 504 (522) Q Consensus 435 y~~vG~KG~~~~d~glfyaPYv--------~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~-~~~~~~~i~~g~~~ 504 (522) -.| +..++|+.|- ....+...|-.+++-.+-...|++.. .+|=+... +-.+. .--.++. T Consensus 343 -----~~~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~--~~~~~~~ 411 (415) T protein:vir:47 343 -----QKG----NNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS--ERGEGDL 411 (415) T ss_pred -----CCC----ccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccccEEEEEeecc--CCCCCCc Confidence 001 1112222211 11112223455667777778888764 35522100 00000 0011222 Q ss_pred Hhhc Q lcl|NC_014036. 505 KEMF 508 (522) Q Consensus 505 ~~~~ 508 (522) ..-+ T Consensus 412 ~~~~ 415 (415) T protein:vir:47 412 GLEA 415 (415) T ss_pred cCCC Confidence 2211 No 40 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=88.77 E-value=0.03 Score=28.91 Aligned_cols=258 Identities=15% Similarity=0.119 Sum_probs=115.0 Q ss_pred hhhhhccccccccccccccccccccccccc-cccccccCcchh--hHHHHHHhhhhhhhceeeccCCchhhhheeeeeee Q lcl|NC_014036. 55 IVESFGGFLAEAEIAGDHGYDATKIASGNS-SGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVY 131 (522) Q Consensus 55 ~~~~~~~~l~ea~~~~~~g~~~~~~~~~t~-tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY 131 (522) +.+++. .+++ .|.+ -. |.-+ .+++.+-++.+-.+++.+-||++.+|- ..+ T Consensus 1 ~l~~~~--------------------~~t~~~gg~-li-P~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~-----~~~ 53 (293) T protein:vir:48 1 MLDSKT--------------------DHSGSDAGL-TI-PQDIRTAINTLVRQYDSLQEYVNVENVTTLTGS-----RVY 53 (293) T ss_pred Cceeec--------------------ccccCcCce-Ee-chhHHHHHHHHHHhhhhhhhhceeeeccCCcce-----EEE Confidence 111111 1111 1111 11 2222 345555566777788888888776541 111 Q ss_pred cCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCccccc Q lcl|NC_014036. 132 GKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALD 211 (522) Q Consensus 132 ~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~ 211 (522) ...... ++ T Consensus 54 ~~~~~~-----------------~~------------------------------------------------------- 61 (293) T protein:vir:48 54 EKWTDI-----------------TG------------------------------------------------------- 61 (293) T ss_pred EeecCC-----------------Cc------------------------------------------------------- Confidence 110000 00 Q ss_pred ccccccccccccccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHH Q lcl|NC_014036. 212 AAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADA 290 (522) Q Consensus 212 ~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEa 290 (522) .....+| +..++|.+ .++++++..+|.-+-...+|-||.+|. .+|.|+ T Consensus 62 ------------------~a~~v~E---------g~~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds----~~~l~~ 110 (293) T protein:vir:48 62 ------------------LANIDDE---------AGKIADIDDPKLSLIKYTIKRYAGISTVTNSLLADS----AENILA 110 (293) T ss_pred ------------------ceeeecC---------CcccccccccceeEEEEeeeEEEEeehhhHHHHhhh----hHHHHH Confidence 0001112 11234443 456677777777777788999999986 367899 Q ss_pred HHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_014036. 291 ELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQT 370 (522) Q Consensus 291 ELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T 370 (522) +|.+-|+..|..-+|+.|+.-+...+. ..+.+++ +....|+.++ ... T Consensus 111 ~i~~~la~~~~~~~~~~i~~g~~~~~~------------~~~~~~~-------------d~i~~~~~~l-------~~~- 157 (293) T protein:vir:48 111 WLSGWIAKKVVVTRNKAILGVVDKLPT------------KPTLTKW-------------DDIIDLEAKV-------DPA- 157 (293) T ss_pred HHHHHHHHHHHHHHHhHHhhccccccc------------cccccCH-------------HHHHHHHHhh-------hhh- Confidence 999999999999999999843322211 1122221 1123343333 222 Q ss_pred cccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEe--cCCCcc--------------c Q lcl|NC_014036. 371 GRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYI--DQYARG--------------D 434 (522) Q Consensus 371 ~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~--------------d 434 (522) +......+|+|.....|..+- +.... ....++......++|.| ++|++ |.+.+. + T Consensus 158 -~~~~a~~vmn~~~~~~L~~lk--d~~g~-----~l~~~~~~~~~~~~l~G-~Pv~~~~~~~~~~~~~~~~~~~~gd~~~ 228 (293) T protein:vir:48 158 -IKQTSFFLTNTSGFTALKKVK--NALGD-----YLMERDVKSPTGYSIAG-FAVKEISDRWLPNASSGVMPLYFGDLKQ 228 (293) T ss_pred -hcCCCEEEEcHHHHHHHHHhh--ccCCc-----eEeecCcCCCCCceecc-eeeEEecccccCCccCCceEEEEEeccc Confidence 223456789999988886431 11100 01111111112346777 58775 333221 1 Q ss_pred eEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeecc---------------eecCccccccCCcccccc Q lcl|NC_014036. 435 YFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGV---------------GINPFANSRSQAPSDRIT 499 (522) Q Consensus 435 y~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l---------------~~nP~~~~~~~~~~~~i~ 499 (522) ++.++.++.-..+ ..++.. .+-.+.|=.+-...||+. .+-|+....+-+ T Consensus 229 ~~~~~~~~~~~i~----~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~~------ 292 (293) T protein:vir:48 229 AVTLFDRQQMSLL----STNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIGSTA------ 292 (293) T ss_pred eEEEEEecceEEE----Eecccc------hhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCccccccC------ Confidence 2222222211111 111100 011122333334444443 333333222111 Q ss_pred CcchHH Q lcl|NC_014036. 500 SGMITK 505 (522) Q Consensus 500 ~g~~~~ 505 (522) . T Consensus 293 -----~ 293 (293) T protein:vir:48 293 -----V 293 (293) T ss_pred -----C Confidence 0 No 41 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=88.66 E-value=0.031 Score=28.86 Aligned_cols=338 Identities=15% Similarity=0.078 Sum_probs=121.5 Q ss_pred Ccch-HHHHHh----h---hhhhcccc-----c-hhhhcchhhh-HHHHHHhhhHHHHhhhhhhhcchh----------- Q lcl|NC_014036. 1 MSKK-NELMEK----W---NDLLESQE-----G-LPDIATKSKK-QLIAAIMEAQEKDAEVDPVYRDEK----------- 54 (522) Q Consensus 1 ~~~~-~~l~~k----w---~p~l~~~~-----~-~~~i~~~~~~-~~~~~~~enq~~~~~~~~~~~~~~----------- 54 (522) |.++ ++|.++ + ..+++..+ . ..+++...++ +-+....|.+.+.+.....-.+.. T Consensus 5 m~k~l~el~~~~~~~~~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (397) T protein:vir:12 5 MSKKEIALRQQFTEKKQQADKALQEGNTDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQRSQG 84 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccccc Confidence 4433 223333 2 22222110 0 0111111000 000111111111111100000000 Q ss_pred ---------hhhhh-----ccccccccccccccccccccccc-cccccccccCcchh--hHHHHHHhhhhhhhceeeccC Q lcl|NC_014036. 55 ---------IVESF-----GGFLAEAEIAGDHGYDATKIASG-NSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPM 117 (522) Q Consensus 55 ---------~~~~~-----~~~l~ea~~~~~~g~~~~~~~~~-t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPm 117 (522) ....| +..+.+.+-.....-+...+..+ +++|.+.- |.-+ .+++.+.++.+-.+++.+.|| T Consensus 85 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lv--P~~~~~~ii~~~~~~~~l~~~~~~~~~ 162 (397) T protein:vir:12 85 QGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILI--PEDIGRQIHEFKRQFEPLEQYVTVEPV 162 (397) T ss_pred chhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccC--chhHHHHHHHhhhhhhhHHhhcceeec Confidence 00001 11111111000000000111111 12222211 2222 355555667778899999999 Q ss_pred CchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccc Q lcl|NC_014036. 118 TGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSG 197 (522) Q Consensus 118 TGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~ 197 (522) +++.|-+- +..+... +.+.|-+ T Consensus 163 ~~~~~~~~-----~~~~~~~------------~~a~~v~----------------------------------------- 184 (397) T protein:vir:12 163 TTRSGTRL-----LEKNADM------------VPFSPVE----------------------------------------- 184 (397) T ss_pred cCCceeEE-----EEEecCC------------cceeeec----------------------------------------- Confidence 98887432 1111000 0000000 Q ss_pred cccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHH Q lcl|NC_014036. 198 APVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELA 277 (522) Q Consensus 198 ~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELA 277 (522) +| ++. ...+...|.+..|+..|..+- ..+|-||. T Consensus 185 ------------------------------Eg-----~~~----~~~~~~~~~~v~~~~~k~~~~-------~~is~e~l 218 (397) T protein:vir:12 185 ------------------------------EL-----GNL----PEIDQPRFTKVSYSIIDYGGI-------MTLSNSML 218 (397) T ss_pred ------------------------------cc-----ccc----cccccccceeEEeeheeeEee-------ehhhHHHH Confidence 00 000 000112355555555555544 55999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHH Q lcl|NC_014036. 278 QDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLI 357 (522) Q Consensus 278 QDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~ 357 (522) +|-- +|.++.|.+.|...|...+|+.|+.-.-. +...|+..+++ ..+.++. T Consensus 219 ~ds~----~~l~~~i~~~l~~~~~~~~d~~il~G~g~-------------~~~~g~~~~~~------------i~~~~~~ 269 (397) T protein:vir:12 219 NDSD----QAIMTYVAKWFAKKSVVTRNNLILAAIAS-------------LKKVDIDGLDG------------IKKALNV 269 (397) T ss_pred hhch----HHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------ccccccccHHH------------HHHHHhh Confidence 8853 56799999999999999999998832111 11124432221 1111222 Q ss_pred HHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCcc---- Q lcl|NC_014036. 358 QIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG---- 433 (522) Q Consensus 358 ~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~---- 433 (522) .++ ..+..+..+||+|.....|..+- +.. + .-....+.+. ..-++|.| ++|++.+.... T Consensus 270 ~l~---------~~~~~~a~~~~n~~~~~~L~~lk--d~~-G---~~l~~~~~~~-g~~~~l~G-~pv~~~~~~~~~~~~ 332 (397) T protein:vir:12 270 TLD---------PMVAPGSIVLTNQDGYDWLDTLK--DGT-G---RYLLQPDPTN-PTKKLLDG-RPVVPFTNRVLKTQK 332 (397) T ss_pred ccc---------hhhhCCCEEEEcHHHHHHHHHhh--ccC-C---ceeecccccC-CCCccccc-eeeEEecccccccCC Confidence 222 12234567899999998887541 000 0 0000011111 12246777 58886543211 Q ss_pred -ce-EEEEEecCCCccceeEeeccccccccccc-----CCccccceeeeeeeecce-ecCccccccCCccccccCcchHH Q lcl|NC_014036. 434 -DY-FTVGYKGDNEMDAGIYYAPYVALTPLRGS-----DPKNFQPVMGFKTRYGVG-INPFANSRSQAPSDRITSGMITK 505 (522) Q Consensus 434 -dy-~~vG~KG~~~~d~glfyaPYv~~~~~~~~-----Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~~~~ 505 (522) +. +++|-- .....++. -....+... +-.+.+-.+-...|++.. .||=+...-. T Consensus 333 ~~~~~~~gd~-----~~~~~~~~-~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~------------- 393 (397) T protein:vir:12 333 GKAPLIIGNL-----KEAIVLFD-REQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQ------------- 393 (397) T ss_pred CccEEEEEeh-----hceEEEEe-ecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE------------- Confidence 11 222210 00000000 000000000 011233445555666543 2441111110 Q ss_pred hhccccceeeeeeec Q lcl|NC_014036. 506 EMFGKNAYFRKVYVK 520 (522) Q Consensus 506 ~~~~~~~~~r~~~Vk 520 (522) +=+| T Consensus 394 -----------~t~~ 397 (397) T protein:vir:12 394 -----------ITVE 397 (397) T ss_pred -----------EeeC Confidence 0000 No 42 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=88.47 E-value=0.032 Score=28.77 Aligned_cols=344 Identities=12% Similarity=0.042 Sum_probs=132.7 Q ss_pred Ccch-HHHHHhhhhhhccccchhhhcchh----------hhHH---HHHH--hhhH----HHHhhhhh--hhcchhhhhh Q lcl|NC_014036. 1 MSKK-NELMEKWNDLLESQEGLPDIATKS----------KKQL---IAAI--MEAQ----EKDAEVDP--VYRDEKIVES 58 (522) Q Consensus 1 ~~~~-~~l~~kw~p~l~~~~~~~~i~~~~----------~~~~---~~~~--~enq----~~~~~~~~--~~~~~~~~~~ 58 (522) |++. ++|.+.+.-+.+. +.++.+.- ++.+ .+.+ |+.+ |+.+.+.. .-....-..+ T Consensus 1 m~~l~~~l~~~~~~~~~~---~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 77 (390) T protein:vir:81 1 MTDITSKLEATLANVTDS---LRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVS 77 (390) T ss_pred ChHHHHHHHHHHHHHHHH---HHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 9988 4477777777653 22221111 1111 1111 1111 01011100 0000000000 Q ss_pred hcccccccc----c------c-cccccccccc----ccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhh Q lcl|NC_014036. 59 FGGFLAEAE----I------A-GDHGYDATKI----ASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTG 122 (522) Q Consensus 59 ~~~~l~ea~----~------~-~~~g~~~~~~----~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTG 122 (522) .+....+.+ . . .....+.... ..++++.+-.-..|..+ .++++.-+..+-.+++.+.||++++. T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 157 (390) T protein:vir:81 78 VGDMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALI 157 (390) T ss_pred chhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCce Confidence 011110100 0 0 0000000000 00111111111223333 44555555666777788888776652 Q ss_pred hheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeecccccccc Q lcl|NC_014036. 123 QVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTV 202 (522) Q Consensus 123 LIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~ 202 (522) -+ ......+ ... T Consensus 158 ~~-------~~~~~~~-----------~~a-------------------------------------------------- 169 (390) T protein:vir:81 158 EY-------VQETGFV-----------NNA-------------------------------------------------- 169 (390) T ss_pred EE-------EEEecCC-----------cce-------------------------------------------------- Confidence 11 1100000 000 Q ss_pred CCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHh Q lcl|NC_014036. 203 TGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRA 282 (522) Q Consensus 203 tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKA 282 (522) . ..+|. ..+++-..++++++.+.|.-+-...+|-||.+|- T Consensus 170 ---------------------~--------~v~Eg---------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-- 209 (390) T protein:vir:81 170 ---------------------A--------IVAEG---------ALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-- 209 (390) T ss_pred ---------------------e--------eecCC---------cccccccceeeEEEEeeeEEEEeehhhHHHHHhH-- Confidence 0 00110 1122333344555555555555667899999984 Q ss_pred hcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeecccccccc---ccchhHHHHHHHHHHHH Q lcl|NC_014036. 283 VHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDV---RGARWAGESYKALLIQI 359 (522) Q Consensus 283 iHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~---~~~r~~~E~~r~L~~~i 359 (522) . +.++.|.+-|+..|...+|+.||.- .-. +. .-.|++........ .......+....++.++ T Consensus 210 ~---~~~~~i~~~l~~~~~~~~d~a~l~G---~g~-~~--------~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 274 (390) T protein:vir:81 210 P---QLASYMNNRLIRGLKVKEDAEILRG---TGA-ND--------GLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQA 274 (390) T ss_pred H---HHHHHHHHHHHHHHHHHHHHHHHhc---CCC-CC--------cccceeecccccccccccccchhHHHHHHHHHhh Confidence 2 4699999999999999999998821 100 00 01233221110000 00111223333333332 Q ss_pred HHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEE Q lcl|NC_014036. 360 DKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVG 439 (522) Q Consensus 360 ~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG 439 (522) . ..+...+.+|++|.....|..+- +.. +. ....+... .-.++|.| ++|++.+..|.+-+++| T Consensus 275 ~---------~~~~~~~~~v~~~~~~~~l~~lk--d~~-G~----~l~~~~~~-~~~~~l~G-~pv~~~~~~p~~~~~~g 336 (390) T protein:vir:81 275 S---------LAEYNPSGIVINPIDWAAIELAK--DAN-NQ----YLIGNARG-TLTPTLWG-LPVVATQAMAPGEFLVG 336 (390) T ss_pred c---------cccCCCCEEEEcHHHHHHHHHhh--cCC-Cc----eeecCccc-ccCceecc-eeeEEcCCCCCCcEEEE Confidence 2 22335678899999988887441 111 10 00011111 11246776 69999999887655555 Q ss_pred EecCCCccceeEeecccccccccccCC----ccccceeeeeeeecc-eecCccccccCCccccccCc Q lcl|NC_014036. 440 YKGDNEMDAGIYYAPYVALTPLRGSDP----KNFQPVMGFKTRYGV-GINPFANSRSQAPSDRITSG 501 (522) Q Consensus 440 ~KG~~~~d~glfyaPYv~~~~~~~~Dp----~s~qP~~~~~tRY~l-~~nP~~~~~~~~~~~~i~~g 501 (522) ---. .++.. ......+...+. .+-+=.+=...|++. +.+|=+.- ++.=+ T Consensus 337 d~~~-----~~~~~-~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v-------~~t~a 390 (390) T protein:vir:81 337 AFDL-----AAQIF-DQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALI-------SGSFA 390 (390) T ss_pred ehhc-----eEEEE-EecceEEEEecccchhhcCcEEEEEEEeeccEEecccceE-------EEEeC Confidence 3210 00000 000111111110 112223334556655 33441111 11101 No 43 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=88.37 E-value=0.033 Score=28.73 Aligned_cols=349 Identities=12% Similarity=0.101 Sum_probs=123.9 Q ss_pred CcchHHHHHhhhhh-----------hccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhcccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDL-----------LESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIA 69 (522) Q Consensus 1 ~~~~~~l~~kw~p~-----------l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~ 69 (522) .-......+|...- ++..+..+.+.+..++.... -.++-++++.. +.+.+ .... ....+.+ T Consensus 81 ~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~-~~~~~~~~~e~-~~~~~-~~~~---~~~~~~~-- 152 (458) T protein:vir:10 81 NELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYG-TQENFEDEVEK-LVLLS-YVME---KGVFETE-- 152 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchh-hhhhHHHHHHH-HHHHH-HHHh---hccchhh-- Confidence 00001111222111 11111111111111110000 00000011100 00000 0000 0000000 Q ss_pred ccccc-ccccccccccccccc-ccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcc Q lcl|NC_014036. 70 GDHGY-DATKIASGNSSGAIT-NIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHP 146 (522) Q Consensus 70 ~~~g~-~~~~~~~~t~tg~v~-~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~ 146 (522) .+. .-.....+++..... ..-|.+. .++.++.++.+..++|-++||+++..-++ . ... T Consensus 153 --~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~-~------~~~---------- 213 (458) T protein:vir:10 153 --HGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTML-V------EPD---------- 213 (458) T ss_pred --hhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEE-E------ecC---------- Confidence 000 000001111111111 1112222 45555667778899999999987642111 0 000 Q ss_pred cccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccc Q lcl|NC_014036. 147 MFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEI 226 (522) Q Consensus 147 ~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~ 226 (522) .+.+.|-+.+... T Consensus 214 --~~~a~~v~e~~~~----------------------------------------------------------------- 226 (458) T protein:vir:10 214 --AGKATWVAASTYG----------------------------------------------------------------- 226 (458) T ss_pred --Ccceeeccccccc----------------------------------------------------------------- Confidence 0011111110000 Q ss_pred cccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhH Q lcl|NC_014036. 227 SYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINR 306 (522) Q Consensus 227 g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINR 306 (522) .| +.. ...-..+++++++.++.-+....+|-||.+|-- .|.+++|.+-|..-|..-||+ T Consensus 227 --------~~-------~~~--~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~----~~~~~~i~~~l~~~i~~~~d~ 285 (458) T protein:vir:10 227 --------TD-------TTT--GEEVKGALKEIHFSTYKLAAKSFITDETEEDAI----FSLLPLLRKRLIEAHAVSIEE 285 (458) T ss_pred --------cc-------ccc--cccccccceeeEeeeeeEEeeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHH Confidence 00 000 001112235556666666666789999998832 467899999999999999999 Q ss_pred HHHhhhhhccccccccccccccccceeeccccc------cccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEE Q lcl|NC_014036. 307 EIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDP------IDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIA 380 (522) Q Consensus 307 eii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~------~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~ 380 (522) .||. -... + .-.|++..... ....+..-..-.+..| +++-+.+. ..+......|+ T Consensus 286 ~~l~---G~G~-~---------~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i----~~~~~~l~--~~~~~~~~~v~ 346 (458) T protein:vir:10 286 AFMT---GDGS-G---------KPKGLLTLASEDSAKVVTEAKADGSVLVTAKTI----SKLRRKLG--RHGLKLSKLVL 346 (458) T ss_pred Hhhc---CCCC-C---------ccceeeecccccccceeecccccccccccHHHH----HHHHHhhh--hhhcCCCEEEE Confidence 9982 1100 0 01122221110 0000000000011222 22222222 12224567899 Q ss_pred chhHHHHHhhhcccccccccc-cccccccccccceeEEEecCceEEEecCCCcc-----ceEEEEEecCCCccceeEeec Q lcl|NC_014036. 381 SRNVVSALARIDSGITPAGQG-LQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG-----DYFTVGYKGDNEMDAGIYYAP 454 (522) Q Consensus 381 S~~va~~L~~~~~~~~~~~~~-~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG~~~~d~glfyaP 454 (522) +|.....|..+--....+-.. .......+.+ -++|.| ++|+++.+.|. +.++..++ + +.++.. T Consensus 347 ~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~----~~~l~G-~pv~~~~~~p~~~~~~~~~~~~f~-~-----~~~~~~ 415 (458) T protein:vir:10 347 IVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQ----VGRIYG-LPVVVSEYFPAKANSAEFAVIVYK-D-----NFVMPR 415 (458) T ss_pred cHHHHHHHHhhcccCCceeeccccccccccCc----Cceecc-eeeEEccccccccCCcceEEEEec-c-----cEEEEE Confidence 999988886431100000000 0000001111 236776 79999988764 22222221 1 011110 Q ss_pred ccccccccccCCccccceeeee--eeecc-eecCccccccCCccc Q lcl|NC_014036. 455 YVALTPLRGSDPKNFQPVMGFK--TRYGV-GINPFANSRSQAPSD 496 (522) Q Consensus 455 Yv~~~~~~~~Dp~s~qP~~~~~--tRY~l-~~nP~~~~~~~~~~~ 496 (522) . ..+....||-+-...++|. .|.|+ +.+|=+.-...-.++ T Consensus 416 ~--~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 416 Q--RAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred e--eceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 1 1111123555445556665 46653 345622211111111 No 44 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=86.99 E-value=0.042 Score=28.15 Aligned_cols=370 Identities=13% Similarity=0.148 Sum_probs=139.3 Q ss_pred CcchHHHHHhhhhhhccccc----------hhhhcc---hhhh------HHHHHH------------hhhHHHHhhhhhh Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEG----------LPDIAT---KSKK------QLIAAI------------MEAQEKDAEVDPV 49 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~----------~~~i~~---~~~~------~~~~~~------------~enq~~~~~~~~~ 49 (522) .-+.++|.++=.-+++..+. ..++.. ..+. .+.+++ .++|++...+.+. T Consensus 16 ~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (477) T protein:vir:84 16 VEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIERSGKLEAETKTVRKATV 95 (477) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhccccc Confidence 22222222222222221100 000000 0000 001111 1112111111100 Q ss_pred hcchh------hhhhhccc-------cccc----cc---------------cccccccccccccccccccccccCcchh- Q lcl|NC_014036. 50 YRDEK------IVESFGGF-------LAEA----EI---------------AGDHGYDATKIASGNSSGAITNIGPAVI- 96 (522) Q Consensus 50 ~~~~~------~~~~~~~~-------l~ea----~~---------------~~~~g~~~~~~~~~t~tg~v~~~~P~Li- 96 (522) -.++. ...++... .... +. ....+.....+..++++|.. ..-|-.+ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~-lv~~~~~~ 174 (477) T protein:vir:84 96 EVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGY-AVPPLWMM 174 (477) T ss_pred ccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcce-eeccchhH Confidence 00000 00000000 0000 00 00000011111111111111 1123322 Q ss_pred -hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccccccccccccccccccccccccccc Q lcl|NC_014036. 97 -GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIA 175 (522) Q Consensus 97 -~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a 175 (522) .++...-++.+..++|++.||++.+|-+-=.|.. .. . .. ..+.+ T Consensus 175 ~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~--~~--~---~~---------a~~~~------------------- 219 (477) T protein:vir:84 175 NRFIELARAGRTYANLCPTEPLPGGTSSINIPKIL--TG--T---ST---------AIQAA------------------- 219 (477) T ss_pred HHHHHHhhhcchHHHhhceeeecCCcceeEEEEEe--cC--c---ce---------eeeec------------------- Confidence 2555555677788999999999988754222111 00 0 00 00000 Q ss_pred ccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceE Q lcl|NC_014036. 176 DGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFR 255 (522) Q Consensus 176 ~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFs 255 (522) +|-. ......++...+ T Consensus 220 ----------------------------------------------------Eg~~------------~~~~~~~~s~~~ 235 (477) T protein:vir:84 220 ----------------------------------------------------DNAA------------LTAPSAHEVDLT 235 (477) T ss_pred ----------------------------------------------------cCcc------------cccccccccccc Confidence 0000 001123455567 Q ss_pred EEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeec Q lcl|NC_014036. 256 IDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFD 335 (522) Q Consensus 256 IEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd 335 (522) ++.+++.+|.-+-...+|-||.+|-. .|.++.|.+-|+..|..-|++.||. -+ |..+ .-.|++. T Consensus 236 f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~~~l~---G~---Gt~~------~p~Gi~~ 299 (477) T protein:vir:84 236 DGFVQANVKTIAGQQGIAIQLLDQAA----VSVDEFVFRDLAADYANKLNVQVIS---GT---GSNN------QVVGVRA 299 (477) T ss_pred eeeEEEeeeeEEeeeHHHHHHHhccc----hhHHHHHHHHHHHHHHHHHHHHHhc---cC---CCCC------ccceeee Confidence 78888888888888899999999943 5679999999999999999999882 11 1000 0123432 Q ss_pred cccccc----cccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhh-cc----ccccccccc-ccc Q lcl|NC_014036. 336 FQDPID----VRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARI-DS----GITPAGQGL-QKT 405 (522) Q Consensus 336 ~~~~~d----~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~-~~----~~~~~~~~~-~~~ 405 (522) ...... ..+..|. ....++..|-...+.+....+. .+..+|++|....+|..+ +. +|.++.... ... T Consensus 300 ~~~~~~~~~~~~~~t~~--~~~~~~~~i~~~~~~~~~~~~~-~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~ 376 (477) T protein:vir:84 300 TAGITQVTATSAGSALE--KHQIIYQKIADAIQRVHTSRFL-EPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLG 376 (477) T ss_pred ccccccccccccccchh--hHHHHHHHHHHHHhhccccccC-CccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccc Confidence 221110 0111222 1223333333333434333333 346778888776666533 11 111111000 001 Q ss_pred cccccccceeEEEecCceEEEecCCCccc--------eEEEEEecCCCccceeEeecccccccccccCCccc--cceeee Q lcl|NC_014036. 406 LNVDTTKAVFAGVLGGVYKVYIDQYARGD--------YFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNF--QPVMGF 475 (522) Q Consensus 406 ~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~--qP~~~~ 475 (522) ...+.-.....|+|.| ++|+++++.|.+ -|++|--.+. +.- ..+..+ .++|.++ ...+.| T Consensus 377 ~~~~~~~~~~~~~l~G-~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~------~i~--~~~~~~-~~~~~~~~~~~~~~~ 446 (477) T protein:vir:84 377 VLTEVASQRVVGQMHG-LPVVTDPTLPTTLGTGTDQDVIHVLRASDL------ALF--ESSVRM-RALQETRAENLSVLL 446 (477) T ss_pred cccccccccccchhcc-cceEecCcccccccccCCcceEEEEEeceE------EEE--eeceeE-Eeccccccccceeee Confidence 1111122233567876 699999998743 3444433211 000 001111 1222222 122222 Q ss_pred eeeec-----ceecC--ccccccCCccccccCcchHHhhc Q lcl|NC_014036. 476 KTRYG-----VGINP--FANSRSQAPSDRITSGMITKEMF 508 (522) Q Consensus 476 ~tRY~-----l~~nP--~~~~~~~~~~~~i~~g~~~~~~~ 508 (522) .. |+ .+-+| |......+-.+ --++ T Consensus 447 ~v-~~~~~~~~~r~~~afv~~t~~~~~~--------~~~~ 477 (477) T protein:vir:84 447 QV-YGYLAFTAARFPQSVVEIGGTALTA--------PTFA 477 (477) T ss_pred ee-hhhhhhhhhccccceEEeecccccc--------cccC Confidence 21 22 22245 22111100000 0111 No 45 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=86.49 E-value=0.045 Score=27.96 Aligned_cols=325 Identities=14% Similarity=0.096 Sum_probs=125.6 Q ss_pred Ccc-------------------------------hHHHHHhhhhhhccccchhhhcchhhh-HHHHHHhhhHHHHhhhhh Q lcl|NC_014036. 1 MSK-------------------------------KNELMEKWNDLLESQEGLPDIATKSKK-QLIAAIMEAQEKDAEVDP 48 (522) Q Consensus 1 ~~~-------------------------------~~~l~~kw~p~l~~~~~~~~i~~~~~~-~~~~~~~enq~~~~~~~~ 48 (522) |.+ .+++.+++.-+.+. |.+.-++ .....+.+..++.....+ T Consensus 10 ~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~e------i~~l~e~~~~~~~~~~~~~~~~~~~~ 83 (400) T protein:vir:38 10 VKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKE------IKDLEEKRDLYEAALKGNEQSSGKKP 83 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHhhcccccc Confidence 211 12222222222110 1100000 000011111000000000 Q ss_pred hh-cchhhhhhhccccccc-----------------ccccccccccc-cccccc--ccccccccCcc--hhhHHHHHHhh Q lcl|NC_014036. 49 VY-RDEKIVESFGGFLAEA-----------------EIAGDHGYDAT-KIASGN--SSGAITNIGPA--VIGMVRRAIPN 105 (522) Q Consensus 49 ~~-~~~~~~~~~~~~l~ea-----------------~~~~~~g~~~~-~~~~~t--~tg~v~~~~P~--Li~l~Rra~~~ 105 (522) .- ......+.+....... ........+.. ....++ .+|.+ .-|. .-.++++.-++ T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~--~vP~~~~~~ii~~~~~~ 161 (400) T protein:vir:38 84 DHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAAS--TIPETISNTPQRELQTV 161 (400) T ss_pred cchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcc--cccHHHHHHHHHHHHhh Confidence 00 0000000000000000 00000000000 001111 11211 1122 11344455567 Q ss_pred hhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccc Q lcl|NC_014036. 106 LIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFV 185 (522) Q Consensus 106 lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~ 185 (522) .+..+++.+.||++.++-+--++.. . . ...+ T Consensus 162 ~~l~~~~~~~~~~~~~~~~~~~~~~----~-~-------------~~~~------------------------------- 192 (400) T protein:vir:38 162 VDLKPFTNVFQASTQKGTYPTVANA----T-T-------------KMVT------------------------------- 192 (400) T ss_pred hhhhhcceeEeccCcceEEEEEecC----C-C-------------cccc------------------------------- Confidence 7888999999998886533221100 0 0 0000 Q ss_pred cccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCcccccc-ceEEEEEEEEEe Q lcl|NC_014036. 186 ETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEM-GFRIDKQVIEAR 264 (522) Q Consensus 186 ~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EM-sFsIEK~TVtAK 264 (522) + +|. ...++. ..+++.++...+ T Consensus 193 ----------------------------------------~--------~E~---------~~~~~~~~~~f~~i~~~~~ 215 (400) T protein:vir:38 193 ----------------------------------------V--------AEL---------EKNPAMAKPEFKPVNWSVE 215 (400) T ss_pred ----------------------------------------c--------ccc---------ccccccccccceeeEeehh Confidence 0 000 001111 123445555566 Q ss_pred cccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeecccccccccc Q lcl|NC_014036. 265 SRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRG 344 (522) Q Consensus 265 SRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~ 344 (522) .-+-...+|-||.+|- ..|.+++|.+-|...|...+|+-|+.-...... .|+..++ T Consensus 216 k~~~~~~is~ell~ds----~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~-------------~~~~~~~------- 271 (400) T protein:vir:38 216 TYRQALPVSQESIDDS----AIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTA-------------KTISSVD------- 271 (400) T ss_pred heeeehhhHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhhhhccccccc-------------cccccHH------- Confidence 6666778999999985 356799999999999999999998833221111 1221111 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceE Q lcl|NC_014036. 345 ARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYK 424 (522) Q Consensus 345 ~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~ 424 (522) ....++.... ...+ . ...|++|.....|..+ .+.. + +-....+.+. ...++|.| ++ T Consensus 272 ------~~~~~~~~~~--------~~~~-~-a~~v~~~~~~~~l~~l--kd~~-G---~~i~~~~~~~-~~~~~l~G-~p 327 (400) T protein:vir:38 272 ------DLKHINNVDL--------DPAY-S-RVIIASQSFYNFLDTV--KDGN-G---RYLLQDSILT-PSGKSVLG-MP 327 (400) T ss_pred ------HHHHHHHhhh--------hhhh-C-cEEEEcHHHHHHHHHh--hccC-C---CeeeecCcCC-CCcccccc-ce Confidence 1112211111 1112 2 4577899998888754 1100 0 0000111111 11247887 58 Q ss_pred EEecCCCccceEEEEEecCCCccceeEeeccc--------ccccccccCCccccceeeeeeeeccee-cCccc-cccCCc Q lcl|NC_014036. 425 VYIDQYARGDYFTVGYKGDNEMDAGIYYAPYV--------ALTPLRGSDPKNFQPVMGFKTRYGVGI-NPFAN-SRSQAP 494 (522) Q Consensus 425 vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv--------~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~-~~~~~~ 494 (522) |++..+.+.. -.| +.-++|+.+- ....++..|-..|+..+-...||+..+ +|-+. ..+=++ T Consensus 328 v~~~~~~~~~-----~~g----~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~ 398 (400) T protein:vir:38 328 IAVVSDDTLG-----AAG----EAHAFLGDIKRAILFANRADFMVRWVDDQIYGQFLQAGMRFGVSVADEKAGYFLTYTP 398 (400) T ss_pred eEEecccccC-----CCC----ceEEEEEeccccEEEEeecceEEEEecccccceeEEEEEEeccEEecccceEEEEeec Confidence 8887765531 111 1112222211 122233456667777788888987653 44221 111111 Q ss_pred cc Q lcl|NC_014036. 495 SD 496 (522) Q Consensus 495 ~~ 496 (522) .+ T Consensus 399 ~a 400 (400) T protein:vir:38 399 KA 400 (400) T ss_pred CC Confidence 11 No 46 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=86.27 E-value=0.047 Score=27.88 Aligned_cols=371 Identities=12% Similarity=0.041 Sum_probs=133.2 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhh---HHHHhhhhhhhcchhhhhhhcccccc---ccccccc-- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEA---QEKDAEVDPVYRDEKIVESFGGFLAE---AEIAGDH-- 72 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~en---q~~~~~~~~~~~~~~~~~~~~~~l~e---a~~~~~~-- 72 (522) -.+.+++..++..++... .++.+...+.-...+.+. .++...+++..+..........+..+ +...... T Consensus 53 ~~~~~~~~~~~~~~~a~~---~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 129 (497) T protein:vir:10 53 HERAQEMLKSLGGADAAK---DGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGT 129 (497) T ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHH Confidence 001111222222222210 001000000000000000 00000000100000000000000000 0000000 Q ss_pred -----------ccc----ccc-cccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCC Q lcl|NC_014036. 73 -----------GYD----ATK-IASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDP 135 (522) Q Consensus 73 -----------g~~----~~~-~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~ 135 (522) +-. ... ...+++++... .-|.+. .+++..-+..+..+++.+.||++++. .|.-.. T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~-vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~-------~~~~~~ 201 (497) T protein:vir:10 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPG-ILPTFLPGIVEQLFYELSLADLISSRPVTSPNL-------SYLTES 201 (497) T ss_pred HHHHHHHHHhhhhhhHHHHHhhhcccCcccccc-cchhhhHHHHHHHHhhhhHHhhccccccCCCce-------EEEEEc Confidence 000 000 00111112211 111111 23333334556677888888776531 111000 Q ss_pred CCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCccccccccc Q lcl|NC_014036. 136 LASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVI 215 (522) Q Consensus 136 ~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~ 215 (522) .. + T Consensus 202 ~~-----------------~------------------------------------------------------------ 204 (497) T protein:vir:10 202 AA-----------------H------------------------------------------------------------ 204 (497) T ss_pred CC-----------------C------------------------------------------------------------ Confidence 00 0 Q ss_pred ccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHH Q lcl|NC_014036. 216 AEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAI 295 (522) Q Consensus 216 ~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNI 295 (522) +-....+|. ...++...+++++++.+|.-+-...+|-||++|-- +.|+.|.+- T Consensus 205 -------------~~a~wv~E~---------~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~i~~~ 257 (497) T protein:vir:10 205 -------------NNAAAVAEA---------GTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP-----ELFNFVQGR 257 (497) T ss_pred -------------CcceeeccC---------cccccccccceeeEeeeeeeEeecHhHHHHHHhHH-----HHHHHHHHH Confidence 000011221 12455556678888888888888899999999942 258999999 Q ss_pred HHHHHHHHhhHHHHh---------hhhhccccccc-cccccccccceeeccccccccccchhHH-----HHH-------- Q lcl|NC_014036. 296 LATEIMLEINREIVD---------MINYTAQVGKT-GFTQTVGSKAGAFDFQDPIDVRGARWAG-----ESY-------- 352 (522) Q Consensus 296 LSTEImlEINReii~---------~i~~~a~~~~~-~~~~~~~~~~g~fd~~~~~d~~~~r~~~-----E~~-------- 352 (522) |...|..-+|..||. .++...-.... ++.......+-.+.+....+-. ..|.+ ... T Consensus 258 l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 336 (497) T protein:vir:10 258 LLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT-NGAFVGQDTVASLKYGRVVTG 336 (497) T ss_pred HHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccc-cchhhhhhHHHHHHHHHhhhh Confidence 999999999999983 01111000000 0000000000000000000000 00100 000 Q ss_pred ---------------HHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhh----cccccccccccccccccccccc Q lcl|NC_014036. 353 ---------------KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARI----DSGITPAGQGLQKTLNVDTTKA 413 (522) Q Consensus 353 ---------------r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~----~~~~~~~~~~~~~~~~~d~~~~ 413 (522) ..+...+-..-..+.+...+ .++.+|.+|....+|..+ |...+.+....+.... . T Consensus 337 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~-----~ 410 (497) T protein:vir:10 337 AAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNP-----V 410 (497) T ss_pred hhhhccchhccccchhhhhhHHHHHHhhhhhhccc-CCCeEEEchHHHHHHHHhhcCCCceeccCccccccccc-----c Confidence 11222233333444454444 557788999888777643 1111111111111100 0 Q ss_pred eeEEEecCceEEEecCCCccceEEEEEecCC------CccceeEeecccccccccccCCccccceeeeeeeecc-eecCc Q lcl|NC_014036. 414 VFAGVLGGVYKVYIDQYARGDYFTVGYKGDN------EMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGV-GINPF 486 (522) Q Consensus 414 ~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~------~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l-~~nP~ 486 (522) ..-++|.| ++|++.+..+.+=+++|--... ..+-.+-..||.. .+=.+.+=.+=+..|+++ +.+|= T Consensus 411 ~~~~~l~G-~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~------~~f~~n~v~~r~~~r~~~~v~~p~ 483 (497) T protein:vir:10 411 NGGKNIWG-VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNG------TDFVDGKVTVRAEERLGLLVYRPS 483 (497) T ss_pred cCCceeec-eeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccc------hhhhcCcEEEEEEEeecceeeccc Confidence 01125666 7999988888654554421110 0001111222210 111223444555678866 66773 Q ss_pred cccccCCccccccCcc Q lcl|NC_014036. 487 ANSRSQAPSDRITSGM 502 (522) Q Consensus 487 ~~~~~~~~~~~i~~g~ 502 (522) +...-+-.. ...|+ T Consensus 484 A~~~l~~~~--~~~~~ 497 (497) T protein:vir:10 484 AFQLIQLKK--GATGS 497 (497) T ss_pred cEEEEEecC--CccCC Confidence 332111100 01111 No 47 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=86.27 E-value=0.047 Score=27.88 Aligned_cols=371 Identities=12% Similarity=0.041 Sum_probs=133.2 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhh---HHHHhhhhhhhcchhhhhhhcccccc---ccccccc-- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEA---QEKDAEVDPVYRDEKIVESFGGFLAE---AEIAGDH-- 72 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~en---q~~~~~~~~~~~~~~~~~~~~~~l~e---a~~~~~~-- 72 (522) -.+.+++..++..++... .++.+...+.-...+.+. .++...+++..+..........+..+ +...... T Consensus 53 ~~~~~~~~~~~~~~~a~~---~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 129 (497) T protein:vir:78 53 HERAQEMLKSLGGADAAK---DGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGT 129 (497) T ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHH Confidence 001111222222222210 001000000000000000 00000000100000000000000000 0000000 Q ss_pred -----------ccc----ccc-cccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCC Q lcl|NC_014036. 73 -----------GYD----ATK-IASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDP 135 (522) Q Consensus 73 -----------g~~----~~~-~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~ 135 (522) +-. ... ...+++++... .-|.+. .+++..-+..+..+++.+.||++++. .|.-.. T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~-vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~-------~~~~~~ 201 (497) T protein:vir:78 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPG-ILPTFLPGIVEQLFYELSLADLISSRPVTSPNL-------SYLTES 201 (497) T ss_pred HHHHHHHHHhhhhhhHHHHHhhhcccCcccccc-cchhhhHHHHHHHHhhhhHHhhccccccCCCce-------EEEEEc Confidence 000 000 00111112211 111111 23333334556677888888776531 111000 Q ss_pred CCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCccccccccc Q lcl|NC_014036. 136 LASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVI 215 (522) Q Consensus 136 ~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~ 215 (522) .. + T Consensus 202 ~~-----------------~------------------------------------------------------------ 204 (497) T protein:vir:78 202 AA-----------------H------------------------------------------------------------ 204 (497) T ss_pred CC-----------------C------------------------------------------------------------ Confidence 00 0 Q ss_pred ccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHH Q lcl|NC_014036. 216 AEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAI 295 (522) Q Consensus 216 ~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNI 295 (522) +-....+|. ...++...+++++++.+|.-+-...+|-||++|-- +.|+.|.+- T Consensus 205 -------------~~a~wv~E~---------~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~i~~~ 257 (497) T protein:vir:78 205 -------------NNAAAVAEA---------GTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP-----ELFNFVQGR 257 (497) T ss_pred -------------CcceeeccC---------cccccccccceeeEeeeeeeEeecHhHHHHHHhHH-----HHHHHHHHH Confidence 000011221 12455556678888888888888899999999942 258999999 Q ss_pred HHHHHHHHhhHHHHh---------hhhhccccccc-cccccccccceeeccccccccccchhHH-----HHH-------- Q lcl|NC_014036. 296 LATEIMLEINREIVD---------MINYTAQVGKT-GFTQTVGSKAGAFDFQDPIDVRGARWAG-----ESY-------- 352 (522) Q Consensus 296 LSTEImlEINReii~---------~i~~~a~~~~~-~~~~~~~~~~g~fd~~~~~d~~~~r~~~-----E~~-------- 352 (522) |...|..-+|..||. .++...-.... ++.......+-.+.+....+-. ..|.+ ... T Consensus 258 l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 336 (497) T protein:vir:78 258 LLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT-NGAFVGQDTVASLKYGRVVTG 336 (497) T ss_pred HHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccc-cchhhhhhHHHHHHHHHhhhh Confidence 999999999999983 01111000000 0000000000000000000000 00100 000 Q ss_pred ---------------HHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhh----cccccccccccccccccccccc Q lcl|NC_014036. 353 ---------------KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARI----DSGITPAGQGLQKTLNVDTTKA 413 (522) Q Consensus 353 ---------------r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~----~~~~~~~~~~~~~~~~~d~~~~ 413 (522) ..+...+-..-..+.+...+ .++.+|.+|....+|..+ |...+.+....+.... . T Consensus 337 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~-----~ 410 (497) T protein:vir:78 337 AAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNP-----V 410 (497) T ss_pred hhhhccchhccccchhhhhhHHHHHHhhhhhhccc-CCCeEEEchHHHHHHHHhhcCCCceeccCccccccccc-----c Confidence 11222233333444454444 557788999888777643 1111111111111100 0 Q ss_pred eeEEEecCceEEEecCCCccceEEEEEecCC------CccceeEeecccccccccccCCccccceeeeeeeecc-eecCc Q lcl|NC_014036. 414 VFAGVLGGVYKVYIDQYARGDYFTVGYKGDN------EMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGV-GINPF 486 (522) Q Consensus 414 ~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~------~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l-~~nP~ 486 (522) ..-++|.| ++|++.+..+.+=+++|--... ..+-.+-..||.. .+=.+.+=.+=+..|+++ +.+|= T Consensus 411 ~~~~~l~G-~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~------~~f~~n~v~~r~~~r~~~~v~~p~ 483 (497) T protein:vir:78 411 NGGKNIWG-VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNG------TDFVDGKVTVRAEERLGLLVYRPS 483 (497) T ss_pred cCCceeec-eeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccc------hhhhcCcEEEEEEEeecceeeccc Confidence 01125666 7999988888654554421110 0001111222210 111223444555678866 66773 Q ss_pred cccccCCccccccCcc Q lcl|NC_014036. 487 ANSRSQAPSDRITSGM 502 (522) Q Consensus 487 ~~~~~~~~~~~i~~g~ 502 (522) +...-+-.. ...|+ T Consensus 484 A~~~l~~~~--~~~~~ 497 (497) T protein:vir:78 484 AFQLIQLKK--GATGS 497 (497) T ss_pred cEEEEEecC--CccCC Confidence 332111100 01111 No 48 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=86.24 E-value=0.047 Score=27.87 Aligned_cols=351 Identities=13% Similarity=0.041 Sum_probs=136.5 Q ss_pred Ccch----HHHHHhhhhhhccccchhhhcchhhhHH--HHHHhhhHHHHh--------------hhhhh--hc------c Q lcl|NC_014036. 1 MSKK----NELMEKWNDLLESQEGLPDIATKSKKQL--IAAIMEAQEKDA--------------EVDPV--YR------D 52 (522) Q Consensus 1 ~~~~----~~l~~kw~p~l~~~~~~~~i~~~~~~~~--~~~~~enq~~~~--------------~~~~~--~~------~ 52 (522) |++. ++|.+++.-+-+. +-++.+.-+..+ +..+.+.+++.+ .+... .. . T Consensus 1 m~~~~k~l~el~~~~~~~~~~---~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQ---IKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGG 77 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 8875 4555555444321 111111111000 001111111111 10000 00 0 Q ss_pred hhhhhhhcccccccc--------cccccc-ccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhh Q lcl|NC_014036. 53 EKIVESFGGFLAEAE--------IAGDHG-YDATKIASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTG 122 (522) Q Consensus 53 ~~~~~~~~~~l~ea~--------~~~~~g-~~~~~~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTG 122 (522) +...........+.. ..+.+. ....+...++++.+-.-.-|.++ .++++.-+..+..++|.++||.+++. T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~ 157 (395) T protein:vir:43 78 EEAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSV 157 (395) T ss_pred cchhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCce Confidence 000000000000000 000000 00001111111111111222222 45555566777788888888876532 Q ss_pred hheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeecccccccc Q lcl|NC_014036. 123 QVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTV 202 (522) Q Consensus 123 LIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~ 202 (522) -+. +..... +.. T Consensus 158 ~~~----~~~~~~--------------~~a-------------------------------------------------- 169 (395) T protein:vir:43 158 EYV----RETGFV--------------NNA-------------------------------------------------- 169 (395) T ss_pred EEE----EEecCC--------------Cce-------------------------------------------------- Confidence 110 000000 000 Q ss_pred CCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHh Q lcl|NC_014036. 203 TGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRA 282 (522) Q Consensus 203 tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKA 282 (522) . ..+|. ...++-..+++++++..+.-+-...+|-||.||.- T Consensus 170 ---------------------~--------~v~E~---------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~- 210 (395) T protein:vir:43 170 ---------------------A--------PVSEG---------TQKPYSDLTFELENAPVRTIAHLFKASRQILDDAS- 210 (395) T ss_pred ---------------------e--------eecCC---------ccccccccceeEEEEeeeeEEEeehhhHHHHHhHH- Confidence 0 00110 11233444566666666666677789999999863 Q ss_pred hcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHH Q lcl|NC_014036. 283 VHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKE 362 (522) Q Consensus 283 iHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~ 362 (522) +.++.|.+-|+..+...+|+.||. -. |.. ..-.|++......-... -... ....++..|..+ T Consensus 211 ----~l~~~v~~~la~a~~~~~d~~~l~---G~---g~~------~~~~Gi~~~~~~~~~~~-~~~~-~~~~~~~~i~~~ 272 (395) T protein:vir:43 211 ----ALQSYIDARARYGLMLVEECQLLY---GN---GTG------ANLHGIIPQAQAYAPPS-GVVV-TAEQRIDRIRLA 272 (395) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHh---cc---CCC------Ccccccccccccccccc-cccc-ccchhHHHHHHH Confidence 358999999999999999999882 11 000 00112221110000000 0000 011233344444 Q ss_pred HHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEec Q lcl|NC_014036. 363 ANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKG 442 (522) Q Consensus 363 an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG 442 (522) .+.+. ..+.....+|+||.....|..+- + +.+ .....+... .-.++|.| ++|+++++.+.+=+++|--. T Consensus 273 ~~~~~--~~~~~~~~~vmn~~~~~~l~~lk--d--~~G---~~i~~~~~~-~~~~~l~G-~pVv~~~~~~~~~~~~gd~~ 341 (395) T protein:vir:43 273 ILQAQ--LAEFPASGIVLNPIDWALIELNK--D--AEN---RYIIGSPQN-GTTPTLWR-LPVVETQAITQDEFLTGAFS 341 (395) T ss_pred HHhhc--cccCCCcEEEEcHHHHHHHHHhh--c--cCC---ceecccccc-CCCceecc-eeeEEcCCCCCCcEEEEecc Confidence 44443 23445678999999988886441 1 110 011111111 12356776 79999999886555554311 Q ss_pred CCCccceeEeecccccccccccC-C-cccc---ceeeeeeeeccee-cCccccccCCccc Q lcl|NC_014036. 443 DNEMDAGIYYAPYVALTPLRGSD-P-KNFQ---PVMGFKTRYGVGI-NPFANSRSQAPSD 496 (522) Q Consensus 443 ~~~~d~glfyaPYv~~~~~~~~D-p-~s~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~ 496 (522) . ..+.. .-....+...+ . ..|+ =.+-+..|++..+ +|=+.-.-.-..+ T Consensus 342 ~-----~~~~~-~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 342 L-----GAQIF-DRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred c-----eEEEE-EecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 1 00000 00111111111 1 1232 2333445776654 2411111110110 No 49 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=86.20 E-value=0.047 Score=27.85 Aligned_cols=280 Identities=13% Similarity=0.055 Sum_probs=125.0 Q ss_pred cccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccccccccc Q lcl|NC_014036. 79 IASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQ 157 (522) Q Consensus 79 ~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~ 157 (522) .+ +++|.+ .-|.+. .+++.+-++.+..++|.+.||++... +|.-.... +.+ T Consensus 1 ma--~~gG~l--vp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~-------~ip~~~~~------------~~a----- 52 (298) T protein:vir:16 1 MV--LNKGTL--FDPTLVTDLISKVAGKSSIARLSAQKPIPFNGE-------KVFTFTMD------------SEI----- 52 (298) T ss_pred Cc--ccCcce--echhHHHHHHHHHHhhhhhhhhcceeeccCCce-------EEEEEecC------------cce----- Confidence 12 222222 223333 45555666788899999999875321 11100000 000 Q ss_pred ccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhh Q lcl|NC_014036. 158 GAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAEL 237 (522) Q Consensus 158 g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEa 237 (522) + ..+|. T Consensus 53 ------------------------------------------------------------------~--------~v~E~ 58 (298) T protein:vir:16 53 ------------------------------------------------------------------D--------VVAES 58 (298) T ss_pred ------------------------------------------------------------------E--------EecCC Confidence 0 11121 Q ss_pred ccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccc Q lcl|NC_014036. 238 QEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQ 317 (522) Q Consensus 238 l~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~ 317 (522) .++++-..++++++..+|.-+-....|-||.++--- -..|-+++|.+-|...|...|+..++.-..... T Consensus 59 ---------~~~~~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~- 127 (298) T protein:vir:16 59 ---------GKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQEFNDGFAKKVARGIDLMAFHGVNPRL- 127 (298) T ss_pred ---------ccccccccceeEEEEeeeeEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHHhhccccCCC- Confidence 123444455667777777777778899999875432 135568889999999999999988883211000 Q ss_pred cccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccc Q lcl|NC_014036. 318 VGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITP 397 (522) Q Consensus 318 ~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~ 397 (522) |.. .......++......... .......++..|..+...+.+ .+.+...+|++|.....|..+ .+.. T Consensus 128 -g~~---~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~l--kd~~ 194 (298) T protein:vir:16 128 -GTA---SAVIGTNHFDSKVTQKVE-----APRGIADPNGAIENAVELLTG--VDADVTGIAINPSFRSALAKQ--KDLQ 194 (298) T ss_pred -Ccc---cccccccccccccccccc-----cccccccHHHHHHHHHHHhhh--cCCCccEEEEcHHHHHHHHHh--hccC Confidence 000 000000000000000000 001112233444455444443 123555689999999888754 1111 Q ss_pred cccccccccccccccceeEEEecCceEEEecCCCcc------ceEEEEEecCCCccceeEeeccc--ccccccccCCcc- Q lcl|NC_014036. 398 AGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG------DYFTVGYKGDNEMDAGIYYAPYV--ALTPLRGSDPKN- 468 (522) Q Consensus 398 ~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~d~glfyaPYv--~~~~~~~~Dp~s- 468 (522) ... ....+.+. .-.|+|.| ++|+++.+.+. +.+++|- - ..++.|..-- ++...+..||+. T Consensus 195 G~~----i~~~~~~~-~~~~~l~G-~PV~~~~~v~~~~~~~~~~~~~GD---f--s~~~~~~~~~~~~~~~~~~~~~~~~ 263 (298) T protein:vir:16 195 DNA----LFPELKWG-ATPDTING-LPVDVNKTVSDMSLTQRDRAIIGD---F--ANGFKWGYAKEVPLEVIQYGDPDNS 263 (298) T ss_pred CCe----eecCcccC-CCCceecc-eeeEEecccccccCCCccEEEEee---c--cceEEEEEecCceEEEeeccCCcCc Confidence 110 00011111 11257888 59999987653 3344441 0 0111222110 111222224432 Q ss_pred ----cc-ceeee--eeeec-ceecCccccccCCccccccCcc Q lcl|NC_014036. 469 ----FQ-PVMGF--KTRYG-VGINPFANSRSQAPSDRITSGM 502 (522) Q Consensus 469 ----~q-P~~~~--~tRY~-l~~nP~~~~~~~~~~~~i~~g~ 502 (522) || =.++| ..|++ ...+|=+ ..++.++. T Consensus 264 ~~~~f~~~~v~~ra~~r~d~~v~~~~a-------~~~l~~at 298 (298) T protein:vir:16 264 GLDLKGYNQVYIRAELFLGWGILDATK-------FARVTEAN 298 (298) T ss_pred chhhhhcCcEEEEEEEEEccEeecccc-------eEEEeecC Confidence 22 11333 45776 4455522 12222222 No 50 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=86.01 E-value=0.049 Score=27.78 Aligned_cols=327 Identities=15% Similarity=0.092 Sum_probs=133.8 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhh-HHHHHHhhhHHHHhhhhhhhcch-----hhhhhhccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKK-QLIAAIMEAQEKDAEVDPVYRDE-----KIVESFGGFLAEAEIAGDHGY 74 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~-~~~~~~~enq~~~~~~~~~~~~~-----~~~~~~~~~l~ea~~~~~~g~ 74 (522) .... +=.|.|.-+.. +|....++ +....+.|-+++.......-+.. .....|..+|..- T Consensus 22 ~~~~-~~~e~~~~~~~------ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-------- 86 (371) T protein:vir:81 22 LLAE-NKIEEAKKLKE------EIVALQEKFDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAFVNHIRTR-------- 86 (371) T ss_pred HhhH-HHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHHHHHH-------- Confidence 1111 12234555443 23322221 11222333332222222111000 0111222222110 Q ss_pred ccccccccc-ccccccccCcc-hh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccc Q lcl|NC_014036. 75 DATKIASGN-SSGAITNIGPA-VI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPD 151 (522) Q Consensus 75 ~~~~~~~~t-~tg~v~~~~P~-Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Ead 151 (522) ....+..++ .+|.+. =|. +. .+++.+.++.+..+++.+.||++.++-+.-.+ ..... . T Consensus 87 ~~~a~~~~t~~~gg~~--vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~--~~~~~---------------~ 147 (371) T protein:vir:81 87 FRNAMSEGSNQDGGYT--VPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKK--RSQQT---------------G 147 (371) T ss_pred HHHhhccCCCccCcee--ecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecCCc---------------c Confidence 111122222 112211 132 22 46666677888999999999988765543211 11000 0 Q ss_pred ccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCccccccccccccccccccccccccc Q lcl|NC_014036. 152 SMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMA 231 (522) Q Consensus 152 t~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~ 231 (522) ..| T Consensus 148 a~~----------------------------------------------------------------------------- 150 (371) T protein:vir:81 148 FVE----------------------------------------------------------------------------- 150 (371) T ss_pred eee----------------------------------------------------------------------------- Confidence 000 Q ss_pred chhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhh Q lcl|NC_014036. 232 TSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDM 311 (522) Q Consensus 232 Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~ 311 (522) .+|.. .....+...|.+..++..|.. -...+|-||.+|-. .|.++.|.+.|...|..-+|+.|+.- T Consensus 151 --v~Eg~-~~~~~~~~~f~~i~~~~~k~~-------~~~~iS~ell~ds~----~~l~~~i~~~l~~a~~~~~~~~i~~g 216 (371) T protein:vir:81 151 --VAEGA-AIGEKATPQFTLLQYQVKKYA-------GFFRVTNELLNDST----EAIVNTLVRWIGDESRVTRNGLIINV 216 (371) T ss_pred --ecccc-ccccccccceeeEEeeeeEEE-------EeehhhHHHHhhhh----HHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 01100 000001123445555555544 45579999999853 46789999999999999999998843 Q ss_pred hhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhh Q lcl|NC_014036. 312 INYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARI 391 (522) Q Consensus 312 i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~ 391 (522) ....+. .|+.+++ ....++... ....+.....+|++|.....|..+ T Consensus 217 ~g~~~~-------------~~~~~~~-------------~i~~~~~~~--------l~~~~~~~a~~vmn~~~~~~L~~l 262 (371) T protein:vir:81 217 LNTKAK-------------TAIADLD-------------GLKQIINVQ--------LDPVFRSTSSVIVNQDAFNWLDTL 262 (371) T ss_pred cccccc-------------cccccHH-------------HHHHHHHhh--------cchhhhcCCEEEEcHHHHHHHHHh Confidence 221111 1222211 111111110 111222345789999999888754 Q ss_pred cccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeeccccc-------cccccc Q lcl|NC_014036. 392 DSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVAL-------TPLRGS 464 (522) Q Consensus 392 ~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~-------~~~~~~ 464 (522) - +.... -....+.+ ....|+|.| ++||+..+.+...-.++--+.+ ..-++|+.+..+ .+.-.+ T Consensus 263 k--d~~g~----~l~~~~~~-~~~~~~l~G-~pV~~~~~~~~~~~~~~~~~~~--~~~i~~Gd~~~~~~~~~~~~~~i~~ 332 (371) T protein:vir:81 263 K--DQNGQ----YLLQPSIS-SPTGRQLLG-LPVVIVSNKVLANRVDGGTGAQ--FAPIIVGDLKEAVVMFDRQRTEIMS 332 (371) T ss_pred h--ccCCC----eeeecccC-CCCCceecc-eeEEEecccccCccccccccCC--cceEEEEehhceEEEEeecceEEEE Confidence 1 11100 00001111 123468887 6999887776433221111111 122334432110 000012 Q ss_pred CCc------cccceeeeeeeecce-ecCccccccCCccc Q lcl|NC_014036. 465 DPK------NFQPVMGFKTRYGVG-INPFANSRSQAPSD 496 (522) Q Consensus 465 Dp~------s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~ 496 (522) ++. +-|=.+-...||+.. .||=+...-.-..+ T Consensus 333 ~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 333 SNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred eccccchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 222 223445555566553 34422111111111 No 51 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=85.67 E-value=0.051 Score=27.67 Aligned_cols=286 Identities=12% Similarity=0.107 Sum_probs=124.0 Q ss_pred ccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccc Q lcl|NC_014036. 80 ASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQG 158 (522) Q Consensus 80 ~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g 158 (522) +-.+++|.+.-- +.+. .+++++-++-+..+++-|.||++.. .+|+-... .+.+. T Consensus 1 mat~~~gg~lvP-~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~-------~~~p~~~~------------~~~a~----- 55 (311) T protein:vir:81 1 MVALATGTFQLP-KHLVPGVWQKAQGQSVLARLSMAEPQEFGE-------QQYMTLTA------------PPRGE----- 55 (311) T ss_pred CceecCCceEcc-hhHHHHHHHHHHhcchhhhhcceeecCCCc-------eEEEEEeC------------CceeE----- Confidence 444445544211 2222 5666677788888999999986532 12211100 00000 Q ss_pred cccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhc Q lcl|NC_014036. 159 AAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQ 238 (522) Q Consensus 159 ~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal 238 (522) ..+| T Consensus 56 --------------------------------------------------------------------------wv~E-- 59 (311) T protein:vir:81 56 --------------------------------------------------------------------------VVGE-- 59 (311) T ss_pred --------------------------------------------------------------------------Eeec-- Confidence 0112 Q ss_pred cccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcccc Q lcl|NC_014036. 239 EQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQV 318 (522) Q Consensus 239 ~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~ 318 (522) +..+++...++++++..+|.-+-....|-||.|+--. -.++-|++|.+-|+..|...|+.-++.=.....-. T Consensus 60 -------g~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~ 131 (311) T protein:vir:81 60 -------GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGINPLTGA 131 (311) T ss_pred -------CcccccccceeeEEEEeeEEEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCc Confidence 1123333344456666555555566899999875332 23556888888888888888888887321101110 Q ss_pred ccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccc Q lcl|NC_014036. 319 GKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 398 (522) Q Consensus 319 ~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~ 398 (522) +..++-........+...... . ...++.-|+.+-..+.. .+...+-+|++|.....|..+- +... T Consensus 132 ~~~gi~~~~~~~~~~~~~~~~------~-----~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~lk--d~~G 196 (311) T protein:vir:81 132 ALSGSPAKILDTTNIVELTTG------T-----SATPDLAVEAAVGLVLG--DNLSPDGVALDNTFSFMLATQR--DSQG 196 (311) T ss_pred ccccccccccccceeeeeccc------c-----cchHHHHHHHHHHHhhh--cCCCceEEEEcHHHHHHHHhhh--ccCC Confidence 111110000000111111110 0 01122234444444422 2336677899999998887431 1110 Q ss_pred ccccccccccccccceeEEEecCceEEEecCCCccceE------EEEEecCCCc-----c-ceeEeecccccccc--ccc Q lcl|NC_014036. 399 GQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYF------TVGYKGDNEM-----D-AGIYYAPYVALTPL--RGS 464 (522) Q Consensus 399 ~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~------~vG~KG~~~~-----d-~glfyaPYv~~~~~--~~~ 464 (522) . -... +.......|+|.| ++|+++.+-+..-. .+...+.... | +.+++...-..... +-. T Consensus 197 ~----~l~~-~~~~~~~~~tl~G-~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~ 270 (311) T protein:vir:81 197 R----KLYP-ELGFGTDVASFAG-LNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFG 270 (311) T ss_pred C----eeec-CccccCCCceecc-eeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccC Confidence 0 0000 1111112467887 69998877653221 1111111110 1 11233322221111 111 Q ss_pred CCcc----ccc-eeee--eeeecce-ecC--ccccccCCccc Q lcl|NC_014036. 465 DPKN----FQP-VMGF--KTRYGVG-INP--FANSRSQAPSD 496 (522) Q Consensus 465 Dp~s----~qP-~~~~--~tRY~l~-~nP--~~~~~~~~~~~ 496 (522) |+.. ||- .++| ..|++.. .+| |+. .+++..+ T Consensus 271 ~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~-l~~a~~~ 311 (311) T protein:vir:81 271 DPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAV-VRDADES 311 (311) T ss_pred CCCcchhhhhcCcEEEEEEEEeccEeecccceEE-EEeeccC Confidence 2221 222 1333 4677744 677 332 2222211 No 52 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=83.97 E-value=0.064 Score=27.12 Aligned_cols=294 Identities=13% Similarity=0.082 Sum_probs=118.4 Q ss_pred hhhhhhhcchhhhhhhcccccccccccccccccccccccccccccc-ccCcchh-hHHHHHHhhhhhhhceeeccCCchh Q lcl|NC_014036. 44 AEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIASGNSSGAIT-NIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPT 121 (522) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t~tg~v~-~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPT 121 (522) +++...+ ++ +...+++ ++++... ..-|.+. .+++.+....+-.+++-+.||++.+ T Consensus 1 ~~~~~~~--------------~~--------~~~~~~~-t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 57 (320) T protein:vir:10 1 MAAGTAF--------------QV--------DHAQIAQ-TGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTG 57 (320) T ss_pred CCCCccC--------------CH--------HHHHhhc-cccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCc Confidence 1110000 00 0111111 1111111 1223333 3555555566778888888887654 Q ss_pred hhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccc Q lcl|NC_014036. 122 GQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVT 201 (522) Q Consensus 122 GLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~ 201 (522) .-|. +..+. +.+. T Consensus 58 ~~~p----~~~~~---------------~~a~------------------------------------------------ 70 (320) T protein:vir:10 58 QKIP----HWIGD---------------VSAQ------------------------------------------------ 70 (320) T ss_pred eEEE----EEeCC---------------cceE------------------------------------------------ Confidence 2111 11000 0000 Q ss_pred cCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHH Q lcl|NC_014036. 202 VTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLR 281 (522) Q Consensus 202 ~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLK 281 (522) ..+|. ..+++-..+++++++..|..+..-.+|.||.+|-. T Consensus 71 -------------------------------~v~E~---------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~ 110 (320) T protein:vir:10 71 -------------------------------WIGEG---------DMKPITKGNMTSQNIAPHKIATIFVASAETVRANP 110 (320) T ss_pred -------------------------------EecCC---------ccccccccceeEEEEeeEEEEEeehhhHHHHhcCh Confidence 01110 11233334456677777777778889999999855 Q ss_pred hhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccc----cccccccceeeccccccccccchhHHHHHHHHHH Q lcl|NC_014036. 282 AVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGF----TQTVGSKAGAFDFQDPIDVRGARWAGESYKALLI 357 (522) Q Consensus 282 AiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~----~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~ 357 (522) .|.|+.|.+.|...|...||+.+|. -...-...+. +.......+.....+ -+..+ .+ T Consensus 111 ----~~l~~~i~~~l~~a~a~~~d~a~l~---G~g~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~---~~-- 171 (320) T protein:vir:10 111 ----ANYLGTMRTKVATAFAMAFDSAALN---GTDSPFPTYLAQTTKSVSLADPGGATASD-------LTAYD---AV-- 171 (320) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHhhc---ccCCCCCcccccccccccceecccccccc-------cccHH---HH-- Confidence 5679999999999999999999882 1110000000 000000111110000 01111 11 Q ss_pred HHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEE Q lcl|NC_014036. 358 QIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFT 437 (522) Q Consensus 358 ~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~ 437 (522) +..+...+ ...+.....+||+|.....|..+-- .....-.+............-+++.| ++|+++++.+.+=.. T Consensus 172 -~~~~~~~~--~~~~~~~~~~v~n~~~~~~L~~lkd--~~G~~l~~~~~~~~~~~~~~~~~i~g-~pv~~~~~~~~~~~~ 245 (320) T protein:vir:10 172 -AVNGLSLL--VNAKKKWTHTLLDDIVEPILNGAKD--KNGRPLFIESTYTDENSPFRAGRIVS-RPTILSDHVADGTTV 245 (320) T ss_pred -HHHHHhhh--hcccCCCcEEEEcHHHHHHHHHhhc--cCCceeeccccccCccccccCceeee-eeeEecCCCCCCceE Confidence 11122222 2233355789999999999974311 11000000000001111222345655 799999887754211 Q ss_pred EEEecCCCccceeEeecccccccc--------cccCCcc-----cc---ceeeeeeeecce-ecC--cccc-ccCCccc Q lcl|NC_014036. 438 VGYKGDNEMDAGIYYAPYVALTPL--------RGSDPKN-----FQ---PVMGFKTRYGVG-INP--FANS-RSQAPSD 496 (522) Q Consensus 438 vG~KG~~~~d~glfyaPYv~~~~~--------~~~Dp~s-----~q---P~~~~~tRY~l~-~nP--~~~~-~~~~~~~ 496 (522) ++-|+-. .+++.-+-..... ...|+.. || =.+=...|+++. .+| |+.- ..-+|.+ T Consensus 246 -~~~gd~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 246 -GYMGDFR---NVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred -EEEeecc---eEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccCCCC Confidence 1111111 1112111111100 0011111 11 112233556543 244 2210 0112222 No 53 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=83.95 E-value=0.064 Score=27.12 Aligned_cols=312 Identities=10% Similarity=-0.002 Sum_probs=121.3 Q ss_pred hccccccccccccccccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCC Q lcl|NC_014036. 59 FGGFLAEAEIAGDHGYDATKIASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLA 137 (522) Q Consensus 59 ~~~~l~ea~~~~~~g~~~~~~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~ 137 (522) |...-.-+. ....+...|.+ .-|.++ .+++++.++.+-.+++-+.||+++.- +|.-... T Consensus 1 m~~~~~~a~----------~~~~t~~~g~~--i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-------~~p~~~~- 60 (330) T protein:vir:77 1 MAGSTVPST----------QVALTGDFSAF--LTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGI-------SIPHWTG- 60 (330) T ss_pred Ccccccchh----------hccccCCCcce--echhHHHHHHHHHHhccchhhhcceeeccCCce-------EEEEEcC- Confidence 221111111 10101111111 123333 46677778888888999999887542 1110000 Q ss_pred CcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCccccccccccc Q lcl|NC_014036. 138 SGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAE 217 (522) Q Consensus 138 t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~ 217 (522) .+.+.| T Consensus 61 -----------~~~a~~--------------------------------------------------------------- 66 (330) T protein:vir:77 61 -----------AVSASW--------------------------------------------------------------- 66 (330) T ss_pred -----------CcceeE--------------------------------------------------------------- Confidence 000000 Q ss_pred ccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHH Q lcl|NC_014036. 218 QEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILA 297 (522) Q Consensus 218 ~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILS 297 (522) .+| +..+++-..+++++++..|..+-+..+|-||.+|- ..|.|++|.+-|+ T Consensus 67 ----------------v~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~ 117 (330) T protein:vir:77 67 ----------------TGE---------AERKPITKGSFGKQELEPVKITTIFAESAEVVRLN----PLNYLNTMRTKIA 117 (330) T ss_pred ----------------ecC---------CCccccccceeeEEEEeEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHH Confidence 011 11234444566777888888888888999999983 5788999999999 Q ss_pred HHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcE Q lcl|NC_014036. 298 TEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNF 377 (522) Q Consensus 298 TEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~ 377 (522) ..|...||+.+|. -.-. +. +..+-..++.+.....+......+ .....++..+.++-..+.+. ....+. T Consensus 118 ~ai~~~~~~~~l~---G~g~-~~----~~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~~~~--~~~~~~ 186 (330) T protein:vir:77 118 EAIALKFDAAAIH---GIDK-PS----AFKGYLAETTKVVSLADTNLTTAS-GPQGNAYLAVNNALSLLVNS--GKKWTG 186 (330) T ss_pred HHHHHHHHHHhhc---ccCC-CC----ccccccccccccceeecccccccc-cccchhHHHHHHHHHhhhhc--CCCccE Confidence 9999999999982 1000 00 000000000000000000000000 01123344444554444443 234567 Q ss_pred EEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecc-- Q lcl|NC_014036. 378 IIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPY-- 455 (522) Q Consensus 378 ~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPY-- 455 (522) +||+|.....|..+- +.....-.+............-++|.| ++||++.+.+.+ ...-..-+||.-+ T Consensus 187 ~vmn~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~~~~~~~l~G-~PV~~~~~~p~~--------~~~~~~~~~~gd~s~ 255 (330) T protein:vir:77 187 TLLDNVTEPILNTAV--DGNGRPLFVESTYTEQVGAIREGRILG-RPTYVADNVVNG--------TVGNRVVGVMGDFSQ 255 (330) T ss_pred EEEcHHHHHHHHHHh--ccCCceeecCccccccccccCCceecc-eeeEEeccccCC--------CCCCccEEEEEecce Confidence 899999998887431 111000000000000111112246666 799998886631 0000000111110 Q ss_pred -c----ccccccccCCccc---cceeeeeeeecceecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 456 -V----ALTPLRGSDPKNF---QPVMGFKTRYGVGINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 456 -v----~~~~~~~~Dp~s~---qP~~~~~tRY~l~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) + ....+...|--++ ++..+. +.....|-|..... .-|+.---++.- -+-.-+++|-+|.= T Consensus 256 ~~i~~~~~~~i~~~~e~~~~~~~~~~~~--~~~~~~~~f~~~~~---~~r~~~r~d~~v--~~~~a~~~i~~~~~ 323 (330) T protein:vir:77 256 VIWGQIGGLSFDVTDQATLDFGEEQGGV--WVPKLISLWQHNMV---AVRCEAEFAFMV--NDKDAFVKLTDQVA 323 (330) T ss_pred EEEEEecCcEEEEeecceeeeccccccc--ccccccchhhcCcE---EEEEEEEeccEE--ecccceEEEEeccC Confidence 0 0000111111000 000000 00000111111100 001111111110 00111222222222 No 54 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=82.39 E-value=0.077 Score=26.68 Aligned_cols=270 Identities=9% Similarity=0.012 Sum_probs=116.5 Q ss_pred cccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCC Q lcl|NC_014036. 165 FTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGS 244 (522) Q Consensus 165 ~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs 244 (522) +. ...+- -.++..-.... ..+..+.....-.....+....+ ... .|....+..-=.+..++. .... T Consensus 1 ma---~~~T~-~~d~i~Pev~s-~~v~~~~~~~~~~~~~~~~~~~l-----~g~-~G~tv~ip~~~~~g~~~~---~~~g 66 (274) T protein:vir:96 1 MA---QGTTK-VSNLIVPEVLA-PMMQAELDKKLRFAQFADIDSTL-----VGQ-PGDTLTFPAFTYSGDAQV---IAEG 66 (274) T ss_pred CC---ccccc-hhhhhhhHHHH-HHHHHHHHhhhhhcccccccccc-----cCC-CCCEEEEEeeccCCCccc---cCCC Confidence 11 11110 00110000000 00000000000000000000000 000 122222211001112221 1112 Q ss_pred CCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcccccccccc Q lcl|NC_014036. 245 TGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFT 324 (522) Q Consensus 245 ~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~ 324 (522) ..-++.++.++=. +++-|-|+-.=+++=|. ++..+-|.-.+..+-++..++.+++++|+..+...+.. + T Consensus 67 ~~i~~~~it~~~~--~~~i~~~~~~~~i~D~~----~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~-~---- 135 (274) T protein:vir:96 67 EKIPVDQIGTSKR--EAKVRKIGKGTELTDEA----VLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT-V---- 135 (274) T ss_pred CcCchhhccccee--EEEEEeeeceeeecHHH----HHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-c---- Confidence 2334555554443 44445554322333222 23346788999999999999999999999776543211 0 Q ss_pred ccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccc-cccccc Q lcl|NC_014036. 325 QTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITP-AGQGLQ 403 (522) Q Consensus 325 ~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~-~~~~~~ 403 (522) .+..++ .+.+-.+..++.++. ...+++||+|.+++.|.......+. ++.. T Consensus 136 -----~~~~~~-------------~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~-- 186 (274) T protein:vir:96 136 -----EADITK-------------LDGLQTAIDKFNDED---------LEPMVLFVNPLDAGGLRTSASDNFTRPTQL-- 186 (274) T ss_pred -----Cccccc-------------HHHHHHHHHHhcccC---------CCceEEEeCHHHHHHHHhcccccccccccc-- Confidence 011111 233333444444322 2568999999999999765433222 2211 Q ss_pred cccccccccceeEEEecCceEEEecCCCccce-EEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeecce Q lcl|NC_014036. 404 KTLNVDTTKAVFAGVLGGVYKVYIDQYARGDY-FTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVG 482 (522) Q Consensus 404 ~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~ 482 (522) ++ .......+|.+.| ++||+|...|..= +++| +|.-. |+.. -+...-..-||.+++-.|-...+||+. T Consensus 187 --g~-~~~~~g~ig~~~G-~~Vi~s~~~p~~t~~l~~-~gA~~-----~~~~-~~~~vE~~Rd~~~~~d~i~~~~~yg~~ 255 (274) T protein:vir:96 187 --GD-NIIVKGAFGEALG-AVIVRSNKLNKGEALLAK-KGAVK-----LITK-RDFFLEKDRDASRKSTALYSDKHYVAY 255 (274) T ss_pred --cc-cceeecccceecC-eeEEEcCCCCcceEEEEe-Cccee-----eeec-CCcccccccchhhcccEEEEeeEEEEE Confidence 11 1112234788876 8999999988633 2332 22211 1111 111111134899999999999999986 Q ss_pred e-cC-ccccccCCcccccc Q lcl|NC_014036. 483 I-NP-FANSRSQAPSDRIT 499 (522) Q Consensus 483 ~-nP-~~~~~~~~~~~~i~ 499 (522) . || =....+-+...++. T Consensus 256 ~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 256 LYDESKVVKITKGAGDEVM 274 (274) T ss_pred EEcCccEEEEEcCcccccC Confidence 5 67 11112222222222 No 55 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=81.66 E-value=0.084 Score=26.49 Aligned_cols=301 Identities=12% Similarity=0.062 Sum_probs=127.8 Q ss_pred cccccccccccc-cccccc-ccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccc Q lcl|NC_014036. 72 HGYDATKIASGN-SSGAIT-NIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMF 148 (522) Q Consensus 72 ~g~~~~~~~~~t-~tg~v~-~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~ 148 (522) .|+++.+..... ++.+.. -.-|.++ .+++++..+.+-.+++-+.||++++. + |.-... T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-----~--ip~~~~------------ 61 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGI-----V--IPHWTG------------ 61 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCce-----E--EEEEcC------------ Confidence 344333322221 111111 1234443 44555555666777888888876541 1 110000 Q ss_pred cccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccc Q lcl|NC_014036. 149 SPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISY 228 (522) Q Consensus 149 Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~ 228 (522) .+.+. T Consensus 62 ~~~a~--------------------------------------------------------------------------- 66 (397) T protein:vir:23 62 DVSAQ--------------------------------------------------------------------------- 66 (397) T ss_pred CcceE--------------------------------------------------------------------------- Confidence 00000 Q ss_pred cccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHH Q lcl|NC_014036. 229 GMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREI 308 (522) Q Consensus 229 Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINRei 308 (522) ..+| +..+++-..+++++++..|..+-.-.+|-||.+|-. .|.|++|.+-|...|...||+.+ T Consensus 67 ----wv~E---------g~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~ 129 (397) T protein:vir:23 67 ----WIGE---------GDMKPITKGNMTKRDVHPAKIATIFVASAETVRANP----ANYLGTMRTKVATAIAMAFDNAA 129 (397) T ss_pred ----EecC---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHH Confidence 0111 012334445567777777777778889999999863 67799999999999999999999 Q ss_pred HhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHH Q lcl|NC_014036. 309 VDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSAL 388 (522) Q Consensus 309 i~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L 388 (522) |.=-..- .+..++......... +.. ...+..+..+...+.. .+...+-+|+++.....| T Consensus 130 l~G~gt~--~~~~~~~~~~~~~~~--------------~~~---~~~~~~~~~~~~~l~~--~~~~~a~~vmn~~~~~~L 188 (397) T protein:vir:23 130 LHGTNAP--SAFQGYLDQSNKTQS--------------ISP---NAYQGLGVSGLTKLVT--DGKKWTHTLLDDTVEPVL 188 (397) T ss_pred hhcccCC--cccccccccccceee--------------ecc---cchhHHHHHHHHhhhh--cccCCCEEEEcHHHHHHH Confidence 8311110 011111100000000 000 0011112222222222 234567899999999998 Q ss_pred hhhcc----cccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeeccccccccccc Q lcl|NC_014036. 389 ARIDS----GITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGS 464 (522) Q Consensus 389 ~~~~~----~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~ 464 (522) .++-- ..+.+.. +. ........|+|.| ++|+++++.+.+-+ +++.|+-. .+||.-. ....++.. T Consensus 189 ~~lkd~~G~~i~~~~~--~~----~~~~~~~~~tl~G-~Pv~~s~~~~~g~~-~~~~gDfs---~~~i~~~-~~i~i~~~ 256 (397) T protein:vir:23 189 NGSVDANGRPLFVEST--YE----SLTTPFREGRILG-RPTILSDHVAEGDV-VGYAGDFS---QIIWGQV-GGLSFDVT 256 (397) T ss_pred HHhhccCCceeecccc--cc----cccccccCceeee-eeEEEeCCCCCCce-EEEEeecc---eEEEEEE-eceEEEEe Confidence 75410 0111110 00 0011112357766 69999998875321 11222211 1111111 11111111 Q ss_pred ---------CCcc-----c---cceeeeeeeecc-eecC--ccccccCC-ccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 465 ---------DPKN-----F---QPVMGFKTRYGV-GINP--FANSRSQA-PSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 465 ---------Dp~s-----~---qP~~~~~tRY~l-~~nP--~~~~~~~~-~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) |+.. | |=.+=+..|++. ..+| |..-.... +...+. ..-+-......|-++|= T Consensus 257 ~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~ 329 (397) T protein:vir:23 257 DQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYAL------DLDGASAGNFTLSLDGK 329 (397) T ss_pred eeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceeee------cccccCcceEEEEecCc Confidence 1110 1 122223345554 2233 11111100 000000 00112233334444333 No 56 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=80.46 E-value=0.095 Score=26.20 Aligned_cols=277 Identities=10% Similarity=-0.002 Sum_probs=121.5 Q ss_pred cccccccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccc Q lcl|NC_014036. 69 AGDHGYDATKIASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPM 147 (522) Q Consensus 69 ~~~~g~~~~~~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~ 147 (522) =....+++.+...++ +++. -.-+.+. .+++.+.+.-+-..++.+.||++++...+-.. .. T Consensus 1 m~~~~~~~~~~~~t~-~~~~-lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~~--------------- 61 (297) T protein:vir:95 1 MTVQTFNPENVLVSQ-KKDG-TLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQ--TD--------------- 61 (297) T ss_pred CCccccccccccccC-CCcc-eechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEE--cC--------------- Confidence 111122222222121 2221 1222222 45555666777888899999988765543210 00 Q ss_pred ccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCccccccccccccccccccccc Q lcl|NC_014036. 148 FSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEIS 227 (522) Q Consensus 148 ~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g 227 (522) .+.+.| T Consensus 62 -~~~a~~------------------------------------------------------------------------- 67 (297) T protein:vir:95 62 -GISAYW------------------------------------------------------------------------- 67 (297) T ss_pred -CceeEE------------------------------------------------------------------------- Confidence 000000 Q ss_pred ccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHH Q lcl|NC_014036. 228 YGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINRE 307 (522) Q Consensus 228 ~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINRe 307 (522) .+| +..+++-..++++++...|..+-.-.+|.||.+|-. .|.+.+|.+-|+..|...+++. T Consensus 68 ------v~E---------g~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a 128 (297) T protein:vir:95 68 ------VNE---------TEKIKTDKPEVVPVTLKAHKLGIILVTSREALNYTW----KKFFEDMKPQIVEAFYKKIDEA 128 (297) T ss_pred ------eec---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHH Confidence 011 011333344556666777777777789999999875 4679999999999999999999 Q ss_pred HHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHH Q lcl|NC_014036. 308 IVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSA 387 (522) Q Consensus 308 ii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~ 387 (522) +|.=.. ..+. .|++.-...... ... ..-.+..|.++...|... +...+.+|++|..... T Consensus 129 ~l~G~g---~~~~----------~gi~~~~~~~~~----~~~--~~~t~~~i~~~~~~l~~~--~~~~~~~v~~~~~~~~ 187 (297) T protein:vir:95 129 GLLGHD---TPFA----------NSVAKAAKDANK----VIG--GPINYDNILKLQDALYDA--DVEPNAFVSKIQNRSA 187 (297) T ss_pred HhcccC---Cccc----------ccccccccccce----ecc--cccCHHHHHHHHHHhhhc--cCCcCEEEEcHHHHHH Confidence 982111 0011 112111100000 000 011122344455555443 2244678999999998 Q ss_pred HhhhcccccccccccccccccccccceeEEEecCceEEEecCCCc--cceE--------EEEEecCCCccceeEeecccc Q lcl|NC_014036. 388 LARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYAR--GDYF--------TVGYKGDNEMDAGIYYAPYVA 457 (522) Q Consensus 388 L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~--~dy~--------~vG~KG~~~~d~glfyaPYv~ 457 (522) |..+- +.... -.. .. ..++|.| ++|++-+..+ ..-+ ++|..+.-+.+- .. + T Consensus 188 L~~l~--d~~G~----~i~--~~----~~~~l~G-~Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~----~~--~ 248 (297) T protein:vir:95 188 LREAR--DGNKV----SIY--DK----AANTIDG-ITTVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKI----SE--E 248 (297) T ss_pred HHHhh--ccCCc----eee--cC----CCCcccc-eeeEeecCCCCCCceEEEEecccEEEEEecCeEEEE----ee--c Confidence 87431 11100 000 00 1245665 5777544433 1222 233332211100 00 0 Q ss_pred cccccccCCc-----ccc-ceee--eeeeeccee-cCccccccCCccccccCcchH Q lcl|NC_014036. 458 LTPLRGSDPK-----NFQ-PVMG--FKTRYGVGI-NPFANSRSQAPSDRITSGMIT 504 (522) Q Consensus 458 ~~~~~~~Dp~-----s~q-P~~~--~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~ 504 (522) .......|+. -|| =.++ ...|++..+ ||=+. .++....+. T Consensus 249 ~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~-------~~l~~at~~ 297 (297) T protein:vir:95 249 GQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAF-------AKLTPAERV 297 (297) T ss_pred cccccccccCccchhhhhcCcEEEEEEEEeccEeecccce-------EEEeecCCC Confidence 0000111221 122 1222 335776654 44211 112222222 No 57 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=79.38 E-value=0.11 Score=25.95 Aligned_cols=310 Identities=15% Similarity=0.065 Sum_probs=129.0 Q ss_pred cccccccccccc-ccccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCC Q lcl|NC_014036. 60 GGFLAEAEIAGD-HGYDATKIASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLA 137 (522) Q Consensus 60 ~~~l~ea~~~~~-~g~~~~~~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~ 137 (522) --.|+|-..... ...++. ..++.++ -.-+.+. .+++.+.+..+-..+|.+.||+++..-|.- +.. T Consensus 1 ~~~~~e~~~~~~~~~~~~~--~~~~~~~---liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~----~~~---- 67 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGR--LAHVPSD---LLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPT----TVK---- 67 (338) T ss_pred CcchHHhhhhhcccccccc--eeccccc---ccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEE----Eec---- Confidence 112223221100 000111 1111111 1222222 456666677888999999999886433322 111 Q ss_pred CcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCccccccccccc Q lcl|NC_014036. 138 SGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAE 217 (522) Q Consensus 138 t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~ 217 (522) .+.+.+-|.+. T Consensus 68 -----------~~~a~~v~~~~---------------------------------------------------------- 78 (338) T protein:vir:78 68 -----------RPEVGQVGVGT---------------------------------------------------------- 78 (338) T ss_pred -----------Cccceeecccc---------------------------------------------------------- Confidence 11111111000 Q ss_pred ccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHH Q lcl|NC_014036. 218 QEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILA 297 (522) Q Consensus 218 ~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILS 297 (522) ....+| +...++-.-+++.++...+..+-...+|-||.+|- ..|.|++|.+-|+ T Consensus 79 -------------~~~~~E---------g~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds----~~~~~~~i~~~la 132 (338) T protein:vir:78 79 -------------SNEQRE---------GGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMN----PSGLYTKLQADLA 132 (338) T ss_pred -------------cccccc---------cccccccccceeEEEEEEEEEEEeehhhHHHHhcC----HHHHHHHHHHHHH Confidence 000111 11233333445556666666666777999999983 3678999999999 Q ss_pred HHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcE Q lcl|NC_014036. 298 TEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNF 377 (522) Q Consensus 298 TEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~ 377 (522) ..|...||..||.=.-...--+..++... ....+.... +. -+. ....++..+..+...|...=.+ ..+. T Consensus 133 ~a~~~~~d~~~l~G~g~~~~~~~~gi~~~-~~~~~~~~~----~~---~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~ 201 (338) T protein:vir:78 133 YAIGRGIDLAVFHGKSPLTGSALQGIDTN-NVIVNTTNV----DY---LQT--GTTPLLDRFLDGYDLVSANTDV-DFNG 201 (338) T ss_pred HHHHHHHHHHhhcccCCCccccccccccc-ccccccccc----cc---ccc--cchhhHHHHHHHHHHhhhhccc-cceE Confidence 99999999999832111100001111000 000011000 00 000 0123344444444444332222 5578 Q ss_pred EEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccc---------eEEEE--------E Q lcl|NC_014036. 378 IIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGD---------YFTVG--------Y 440 (522) Q Consensus 378 ~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y~~vG--------~ 440 (522) +|++|+....|..+-.+.-..+. ....+.....-.++|.| ++||++.+-|.+ -+++| . T Consensus 202 ~~m~~~~~~~L~~~~~l~d~~g~----~l~~~~~~~~~~~~l~G-~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~ 276 (338) T protein:vir:78 202 WAADPRYRARLLRSQAYRDANGN----VDPTRINLAASAGDLLG-LPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGF 276 (338) T ss_pred EEEchHHHHHHHHHhhhccCCCc----eeecccccCCCCceeee-eeEEEccccCccccccCCcccEEEEEecceEEEEe Confidence 99999998887643211111100 00001111112357787 599998775521 12333 2 Q ss_pred ecCCCccceeEeecccccccccccCCcc-----cc---ceeeeeeeec-ceecCccccccCCccccccCcchHHh Q lcl|NC_014036. 441 KGDNEMDAGIYYAPYVALTPLRGSDPKN-----FQ---PVMGFKTRYG-VGINPFANSRSQAPSDRITSGMITKE 506 (522) Q Consensus 441 KG~~~~d~glfyaPYv~~~~~~~~Dp~s-----~q---P~~~~~tRY~-l~~nP~~~~~~~~~~~~i~~g~~~~~ 506 (522) .+.-.+ =..+| .......||.. || =.+=...|++ ...||=+. .++.++..-+. T Consensus 277 ~~~~~i----~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~-------~~l~~~~~~~~ 338 (338) T protein:vir:78 277 ADEIRV----KMSDT--ATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAF-------VKFVDDEDPDA 338 (338) T ss_pred ecccEE----EEeec--ccccccccccccchhhhhcCcEEEEEEEEeccEeecccce-------EEEecccCCCC Confidence 211110 00011 11111223321 11 1222356777 45566221 11111111111 No 58 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=75.98 E-value=0.14 Score=25.25 Aligned_cols=349 Identities=15% Similarity=0.072 Sum_probs=114.9 Q ss_pred Ccch---HHHHHhhh---hhhccccc---hhhhcchhhhHHHH--HHhhhHHHHhhhhhhhcchhhhhhhcccccccc-- Q lcl|NC_014036. 1 MSKK---NELMEKWN---DLLESQEG---LPDIATKSKKQLIA--AIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAE-- 67 (522) Q Consensus 1 ~~~~---~~l~~kw~---p~l~~~~~---~~~i~~~~~~~~~~--~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~-- 67 (522) |.-+ |+..++|. -|++...+ -++.+..+.+ +.+ .-|+.|.+...+..+-.++.- ........+.. T Consensus 4 ~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~-l~~e~~~l~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 81 (390) T protein:vir:62 4 TTLSANFEARERATAELRTLTDEFAGKEMTDEAREKEER-LITAVSDYDARIKRGIEAIKAIDPVT-SLLSGLQGSGSGA 81 (390) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhhcccccccc Confidence 2211 12222232 22321110 0111111110 000 011111111111110000000 00000000000 Q ss_pred ------------ccccccccc-----cccccccccccccccCcchh-hHHHHHH-hhhhhhhceeeccCCchhhhheeee Q lcl|NC_014036. 68 ------------IAGDHGYDA-----TKIASGNSSGAITNIGPAVI-GMVRRAI-PNLIAFDICGVQPMTGPTGQVFALR 128 (522) Q Consensus 68 ------------~~~~~g~~~-----~~~~~~t~tg~v~~~~P~Li-~l~Rra~-~~lI~~DI~GVQPmTGPTGLIFAMR 128 (522) ..+..+..- ......+++++-...-|.+. .++..+. ...+...++-|-||++...+-+.. T Consensus 82 ~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~- 160 (390) T protein:vir:62 82 QRSADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTV- 160 (390) T ss_pred hhhcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEE- Confidence 000000000 00000111110000001111 1111000 111223333333332221111110 Q ss_pred eeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcc Q lcl|NC_014036. 129 AVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDD 208 (522) Q Consensus 129 SrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~ 208 (522) . ++ T Consensus 161 --~-----------------------------------------------------------------------~~---- 163 (390) T protein:vir:62 161 --I-----------------------------------------------------------------------TG---- 163 (390) T ss_pred --E-----------------------------------------------------------------------cC---- Confidence 0 00 Q ss_pred cccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCCh Q lcl|NC_014036. 209 ALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDA 288 (522) Q Consensus 209 ~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDA 288 (522) .+-....+|. ..+++-.-++++++..+|..+-...+|-||.+|- .+|. T Consensus 164 -------------------~~~a~wv~E~---------~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds----~~~l 211 (390) T protein:vir:62 164 -------------------RSSASIVGET---------AEIPESYPATAQRSMGGFKYGFASVVSYEFATDQ----VLDL 211 (390) T ss_pred -------------------Ccceeeeccc---------ccccccccceeeeEeeeeeEEeehHHHHHHHhhh----hHHH Confidence 0000012221 1234444556777777788888889999999992 4678 Q ss_pred HHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeecccccccccc-chhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_014036. 289 DAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRG-ARWAGESYKALLIQIDKEANEIA 367 (522) Q Consensus 289 EaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~-~r~~~E~~r~L~~~i~~~an~I~ 367 (522) +++|.+-|+..|..-+|..||. -+-+ -.|++.........- .-.+. .--+-.|+.+-+.+. T Consensus 212 ~~~i~~~l~~~i~~~~d~~~l~---G~G~------------p~Gi~~~~~~~~~~~~~~~~~---~~~~~~l~~~~~~l~ 273 (390) T protein:vir:62 212 VGFLVSDAGPAIGDAMGRHFIT---GTGQ------------PRGILTDASPATATFLATDTD---SKVSDALIDLFHEVP 273 (390) T ss_pred HHHHHHHHHHHHHHHHHhhhhc---cCCc------------cccccccccccccceeccccc---ccchHHHHHHHHhhh Confidence 9999999999999999999983 1100 012221110000000 00000 000112233333333 Q ss_pred HhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCcc Q lcl|NC_014036. 368 RQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMD 447 (522) Q Consensus 368 r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d 447 (522) ..-+. . -..|+++.....|..+- +. .+ +-....+.+. ..-++|.| ++|+++.+.|.+-|++|-- . T Consensus 274 ~~~~~-~-a~~vmn~~~~~~L~~lk--d~--~g--~~l~~~~~~~-g~~~~l~G-~Pv~~~~~~p~~~i~~gd~---s-- 338 (390) T protein:vir:62 274 SAYRA-N-AKYVVNDLRAAQMRKLK--DA--NG--QYLWQSGLTV-GAPSLFNG-KVVETDDGMPADKILFADL---S-- 338 (390) T ss_pred hhhhc-C-CEEEEchHHHHHHHHhh--cc--CC--CeeecCCcCC-Cccceecc-cceEEecCCCCccEEEeec---c-- Confidence 22222 2 24688999888886441 10 00 0000001010 11236787 6999999988655544411 0 Q ss_pred ceeEeeccccc-ccccccCCcc--ccceeeeeeeecce-ecCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 448 AGIYYAPYVAL-TPLRGSDPKN--FQPVMGFKTRYGVG-INPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 448 ~glfyaPYv~~-~~~~~~Dp~s--~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) -|+-..... ...+..|+-. -|=.+=+..|++.. .|| ++ ++.+.||.= T Consensus 339 --~~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~--------~A------------------~~~l~~~~~ 389 (390) T protein:vir:62 339 --KYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDA--------RG------------------AKVLTVTPG 389 (390) T ss_pred --ceeEEeecceEEEeeccccccCCcEEEEEEEEeCcEeech--------hh------------------eEEEEeecC Confidence 011000000 0011112211 11222234455432 233 11 222222222 No 59 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=75.67 E-value=0.14 Score=25.20 Aligned_cols=274 Identities=11% Similarity=-0.026 Sum_probs=112.4 Q ss_pred CCCCCcccchhcccccccccccccccccccccccccccccccccceeeccc----c---cccceeeeeccccccccCCCC Q lcl|NC_014036. 134 DPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDF----V---ETGRVFLQNVSGAPVTVTGST 206 (522) Q Consensus 134 ~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~----~---~~g~~~~~~~~~~p~~~tgt~ 206 (522) ... +.-. ......+...|....... . .......+ ..+....++. T Consensus 1 ma~-------------------~~~~-------~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~---~~~~~~~~~~ 51 (304) T protein:vir:10 1 MAT-------------------PTYT-------PGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMK---LAKNEPMTAQ 51 (304) T ss_pred Ccc-------------------cccc-------cccccccCCCceecchhHHHHHHHHHHhccchhh---hcceeeccCC Confidence 100 0000 000000000011100000 0 00000000 0011111100 Q ss_pred cccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCC Q lcl|NC_014036. 207 DDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGM 286 (522) Q Consensus 207 ~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGL 286 (522) . .... +..+.+-....+| +..+++-.-++++++++.|..+-...+|-||.+|- .+ T Consensus 52 ~--~~ip----------~~~~~~~a~~v~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~ 106 (304) T protein:vir:10 52 K--KKFT----------YLAKGVGAYWVSE---------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AK 106 (304) T ss_pred c--eEEE----------EEeCCcceEEeec---------CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc----hH Confidence 0 0000 0011111112233 23466777778899999999999999999999985 47 Q ss_pred ChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_014036. 287 DADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEI 366 (522) Q Consensus 287 DAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I 366 (522) |.|++|.+-|...|...||+.+|.=--...-.+. + ..+++.-...... ........+.-|+++...+ T Consensus 107 ~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~--~------~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l 173 (304) T protein:vir:10 107 DFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTST--S------GKPLVEGAEEKGN-----VVTDTNNLYVDLSALMATI 173 (304) T ss_pred HHHHHHHHHHHHHHHHHHHhhheeccCCCccccc--c------ccccccccccccc-----ccccccchHHHHHHHHHHh Confidence 7899999999999999999999831111000000 0 0011000000000 0001122344455555555 Q ss_pred HHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccc------------ Q lcl|NC_014036. 367 ARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGD------------ 434 (522) Q Consensus 367 ~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------ 434 (522) ... +.....+||+|.....|..+- +.... .. .+. ..|+|.| ++||++++.+.+ T Consensus 174 ~~~--~~~~~~~v~~~~~~~~L~~lk--d~~G~-----~l---~~~--~~~~l~G-~PV~~~~~~~~~~~~~~~~~gd~~ 238 (304) T protein:vir:10 174 EDE--ELDPNGVLTTRSFRSKMRNAL--DANDR-----PL---FDA--NGNEIMG-LPLSYTGADVYDKKKSLALMGDWD 238 (304) T ss_pred hhc--cCCcCEEEEcHHHHHHHHHhh--ccCCc-----Ee---ecC--CCccccc-eeeEEecccccCCCCcEEEEEehh Confidence 442 224456899999999987431 11100 00 001 1256777 699998887642 Q ss_pred eEEEEEecCCCccceeEeecccccc--cccccCCc-----ccc---ceeeeeeeeccee-cCccccccCCccccccCcc Q lcl|NC_014036. 435 YFTVGYKGDNEMDAGIYYAPYVALT--PLRGSDPK-----NFQ---PVMGFKTRYGVGI-NPFANSRSQAPSDRITSGM 502 (522) Q Consensus 435 y~~vG~KG~~~~d~glfyaPYv~~~--~~~~~Dp~-----s~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~ 502 (522) ++++|..+..+.+ ...+.. +....|++ -|+ =.+=+..||++.+ ||=+. .++...+ T Consensus 239 ~~~~~~~~~~~i~------~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~-------~~l~~a~ 304 (304) T protein:vir:10 239 YARYGILQGIEYA------ISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAF-------ATLKPTE 304 (304) T ss_pred hEEEEEecceEEE------EeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccce-------EEEEecC Confidence 1333333322110 001110 11111222 122 2333456777653 44111 1222222 No 60 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=75.67 E-value=0.14 Score=25.20 Aligned_cols=274 Identities=11% Similarity=-0.026 Sum_probs=112.4 Q ss_pred CCCCCcccchhcccccccccccccccccccccccccccccccccceeeccc----c---cccceeeeeccccccccCCCC Q lcl|NC_014036. 134 DPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDF----V---ETGRVFLQNVSGAPVTVTGST 206 (522) Q Consensus 134 ~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~----~---~~g~~~~~~~~~~p~~~tgt~ 206 (522) ... +.-. ......+...|....... . .......+ ..+....++. T Consensus 1 ma~-------------------~~~~-------~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~---~~~~~~~~~~ 51 (304) T protein:vir:94 1 MAT-------------------PTYT-------PGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMK---LAKNEPMTAQ 51 (304) T ss_pred Ccc-------------------cccc-------cccccccCCCceecchhHHHHHHHHHHhccchhh---hcceeeccCC Confidence 100 0000 000000000011100000 0 00000000 0011111100 Q ss_pred cccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCC Q lcl|NC_014036. 207 DDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGM 286 (522) Q Consensus 207 ~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGL 286 (522) . .... +..+.+-....+| +..+++-.-++++++++.|..+-...+|-||.+|- .+ T Consensus 52 ~--~~ip----------~~~~~~~a~~v~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~ 106 (304) T protein:vir:94 52 K--KKFT----------YLAKGVGAYWVSE---------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AK 106 (304) T ss_pred c--eEEE----------EEeCCcceEEeec---------CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc----hH Confidence 0 0000 0011111112233 23466777778899999999999999999999985 47 Q ss_pred ChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_014036. 287 DADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEI 366 (522) Q Consensus 287 DAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I 366 (522) |.|++|.+-|...|...||+.+|.=--...-.+. + ..+++.-...... ........+.-|+++...+ T Consensus 107 ~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~--~------~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l 173 (304) T protein:vir:94 107 DFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTST--S------GKPLVEGAEEKGN-----VVTDTNNLYVDLSALMATI 173 (304) T ss_pred HHHHHHHHHHHHHHHHHHHhhheeccCCCccccc--c------ccccccccccccc-----ccccccchHHHHHHHHHHh Confidence 7899999999999999999999831111000000 0 0011000000000 0001122344455555555 Q ss_pred HHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccc------------ Q lcl|NC_014036. 367 ARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGD------------ 434 (522) Q Consensus 367 ~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------ 434 (522) ... +.....+||+|.....|..+- +.... .. .+. ..|+|.| ++||++++.+.+ T Consensus 174 ~~~--~~~~~~~v~~~~~~~~L~~lk--d~~G~-----~l---~~~--~~~~l~G-~PV~~~~~~~~~~~~~~~~~gd~~ 238 (304) T protein:vir:94 174 EDE--ELDPNGVLTTRSFRSKMRNAL--DANDR-----PL---FDA--NGNEIMG-LPLSYTGADVYDKKKSLALMGDWD 238 (304) T ss_pred hhc--cCCcCEEEEcHHHHHHHHHhh--ccCCc-----Ee---ecC--CCccccc-eeeEEecccccCCCCcEEEEEehh Confidence 442 224456899999999987431 11100 00 001 1256777 699998887642 Q ss_pred eEEEEEecCCCccceeEeecccccc--cccccCCc-----ccc---ceeeeeeeeccee-cCccccccCCccccccCcc Q lcl|NC_014036. 435 YFTVGYKGDNEMDAGIYYAPYVALT--PLRGSDPK-----NFQ---PVMGFKTRYGVGI-NPFANSRSQAPSDRITSGM 502 (522) Q Consensus 435 y~~vG~KG~~~~d~glfyaPYv~~~--~~~~~Dp~-----s~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~ 502 (522) ++++|..+..+.+ ...+.. +....|++ -|+ =.+=+..||++.+ ||=+. .++...+ T Consensus 239 ~~~~~~~~~~~i~------~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~-------~~l~~a~ 304 (304) T protein:vir:94 239 YARYGILQGIEYA------ISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAF-------ATLKPTE 304 (304) T ss_pred hEEEEEecceEEE------EeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccce-------EEEEecC Confidence 1333333322110 001110 11111222 122 2333456777653 44111 1222222 No 61 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=74.86 E-value=0.15 Score=25.05 Aligned_cols=285 Identities=13% Similarity=0.084 Sum_probs=118.2 Q ss_pred cccccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccccccccc Q lcl|NC_014036. 79 IASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQ 157 (522) Q Consensus 79 ~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~ 157 (522) .+..++++.. ..-+.+. .+++++-+..+..+++-+.||+.... +|+-.... +.+ T Consensus 1 Mat~tt~~g~-~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~-------~~p~~~~~------------~~a----- 55 (311) T protein:vir:99 1 MATFGTGNLK-NLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNE-------DIITFNGR------------PKA----- 55 (311) T ss_pred CceecCCCce-eccHHHHHHHHHHHHhhchhhhhcceeeccCCce-------EEEEEeCC------------cee----- Confidence 1212222222 1112222 56666667777788888888765321 12110000 000 Q ss_pred ccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhh Q lcl|NC_014036. 158 GAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAEL 237 (522) Q Consensus 158 g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEa 237 (522) ...+|. T Consensus 56 --------------------------------------------------------------------------~wv~Eg 61 (311) T protein:vir:99 56 --------------------------------------------------------------------------EFVGEG 61 (311) T ss_pred --------------------------------------------------------------------------EEeecC Confidence 001121 Q ss_pred ccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccc Q lcl|NC_014036. 238 QEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQ 317 (522) Q Consensus 238 l~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~ 317 (522) ..+++...++++++..+|.-+-....|-||.|+-.- -..|-+++|.+-|...|+..|++.+|.-.....- T Consensus 62 ---------~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g 131 (311) T protein:vir:99 62 ---------QQKSSTTGEFDFVTSTPKKAQVTMRFNEEVQWADED-YQLGVLQTLSEAGAEALARALDLGLYHRINPLTG 131 (311) T ss_pred ---------cccccccceeeEEEEeeEEEEEeehhhHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccC Confidence 123444445566666666666688899999763321 1355688999999999999999999843221110 Q ss_pred cccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccc Q lcl|NC_014036. 318 VGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITP 397 (522) Q Consensus 318 ~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~ 397 (522) -+..+...-.........+.. ..+ -.+..-|+.+...+...-.+...+-.|++|+....|..+- +.. T Consensus 132 ~~~~g~~~~~~~~~~~~~~~~------~~~-----~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk--d~~ 198 (311) T protein:vir:99 132 TVIPGWSNYLGAASKRVELTA------DTI-----ANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTAR--YTD 198 (311) T ss_pred ccccccccccccccceeeccc------ccc-----chhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhh--ccC Confidence 111110000000001111110 000 1111222333332222222234466899999999886431 110 Q ss_pred cccccccccccccccceeEEEecCceEEEecCCCcc----------------ceEEEEEecCCCccceeEeecccccc-- Q lcl|NC_014036. 398 AGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG----------------DYFTVGYKGDNEMDAGIYYAPYVALT-- 459 (522) Q Consensus 398 ~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~----------------dy~~vG~KG~~~~d~glfyaPYv~~~-- 459 (522) + +-....+... ...++|.| ++|++..+-+. +++++|= ...++.|.-..... T Consensus 199 -G---~~l~~~~~~~-~~~~~l~G-~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gd-----f~~~~~~~~~~~~~~~ 267 (311) T protein:vir:99 199 -G---RKKFPELGLG-IGVSSFEG-IDASVSDTVNGGDEADPDDEDLDAARAVRGIVGD-----FANGIHWGVQRDIPVE 267 (311) T ss_pred -C---CeeecCcccC-CCCceecc-eeeEeecccccccccccccchhhccCcceEEEee-----ccccEEEEEecCceEE Confidence 0 0000001111 11356777 58888765431 2223221 01122222111111 Q ss_pred cccccCCccc-----cceeee--eeeecceecCccccccCCccccccCcch Q lcl|NC_014036. 460 PLRGSDPKNF-----QPVMGF--KTRYGVGINPFANSRSQAPSDRITSGMI 503 (522) Q Consensus 460 ~~~~~Dp~s~-----qP~~~~--~tRY~l~~nP~~~~~~~~~~~~i~~g~~ 503 (522) ..+.-|++.. .--++| ..|||..+-+= ...++.++.- T Consensus 268 ~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-------~~v~~~~~~A 311 (311) T protein:vir:99 268 LIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-------RFVVIENAVA 311 (311) T ss_pred EeecCCCCcchhhhhcCcEEEEEEEeecceecCh-------hHeeeecccC Confidence 1111123321 112333 57888654320 1112222221 No 62 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=73.23 E-value=0.17 Score=24.76 Aligned_cols=336 Identities=11% Similarity=0.024 Sum_probs=126.6 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhh-----------------------------hhh--h Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAE-----------------------------VDP--V 49 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~-----------------------------~~~--~ 49 (522) |..+| +.++=.-+++.. -+......+.+ ..+.|.-++.+. +.. . T Consensus 1 m~~~e-~~~~~~~~~~~l---~~~~~~~~~e~-~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~~ 75 (379) T protein:vir:10 1 MEALE-IKVALEAIKGQV---DSKSSAQALEV-KGLIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKSE 75 (379) T ss_pred CCHHH-HHHHHHHHHHHH---HHHHHHHHHHH-HHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 65442 444433333210 00000000000 001111111110 000 0 Q ss_pred hcchhhhhhhcccccccc---ccccccccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeccCCchhhhh Q lcl|NC_014036. 50 YRDEKIVESFGGFLAEAE---IAGDHGYDATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQV 124 (522) Q Consensus 50 ~~~~~~~~~~~~~l~ea~---~~~~~g~~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPTGLI 124 (522) .......+++........ .....+-... +..+++++....=|.-+ .+++..-....-.+++.|.||++++. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~-- 151 (379) T protein:vir:10 76 DKSDSLVKSITENFNDIKEVRNGKSIQVKAV--GDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTY-- 151 (379) T ss_pred ccchhHHHHHHHHHHhHHHHHhhhhhhhhhh--cccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCce-- Confidence 000000001000000000 0000000000 00111111111112211 23333333455667777777766532 Q ss_pred eeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCC Q lcl|NC_014036. 125 FALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTG 204 (522) Q Consensus 125 FAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tg 204 (522) .|.-.. ++.+ T Consensus 152 -----~~~~~~-----------------~~~~------------------------------------------------ 161 (379) T protein:vir:10 152 -----TFVREN-----------------GAGE------------------------------------------------ 161 (379) T ss_pred -----EEEEee-----------------cCCC------------------------------------------------ Confidence 111000 0000 Q ss_pred CCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhc Q lcl|NC_014036. 205 STDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVH 284 (522) Q Consensus 205 t~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiH 284 (522) +-....+| +...+++..++++++..+|.=+--..+|-||.||--. T Consensus 162 ------------------------~~~~~v~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~~-- 206 (379) T protein:vir:10 162 ------------------------GAIGAQVE---------GATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLPF-- 206 (379) T ss_pred ------------------------cccccccC---------CccccccccceeeeEeeeeeEEeeehhhHHHHhhHHH-- Confidence 00001112 1224555566666666666666667899999999632 Q ss_pred CCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_014036. 285 GMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEAN 364 (522) Q Consensus 285 GLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an 364 (522) .++.|.+-|+..|+.-+|..++.-+...+..+..+ ..+- ...+..+.++.++.. T Consensus 207 ---l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~----------~~~~----------~~~d~i~~~~~~~~~--- 260 (379) T protein:vir:10 207 ---LTSFIPNALRRDYAKAENAAFNAVLAANATASTEI----------ITNK----------NKVEMLINEIAKQEN--- 260 (379) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHhccccccccccccc----------ccCc----------ccHHHHHHHHHhhhh--- Confidence 58999999999999999998886544433222111 1000 012223333333321 Q ss_pred HHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccc-cccccceeEEEecCceEEEecCCCccceEEEEEecC Q lcl|NC_014036. 365 EIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLN-VDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGD 443 (522) Q Consensus 365 ~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~-~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~ 443 (522) .+-..+.+|++|.....|..+- +.....-.+-... .+.+. -+|.| ++|+++++.+...+++|=-.. T Consensus 261 ------~~~~~~~~vmn~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~~----~~l~G-~pvv~s~~~~ag~~~~gdf~~ 327 (379) T protein:vir:10 261 ------LDFPVTAIVLRPTDYYDILVTQ--KSVGAGYGLPGVVTQDNGV----LRING-IPLFRATWLAANKYYVGDWTR 327 (379) T ss_pred ------ccCCCCEEEEcHHHHHHHHHhh--ccCCceeccCCccCCCCCc----ceecc-eeeEecCCCCCCceEEeeccc Confidence 2225677999999988886441 1110000000000 01111 14665 799999998865555432111 Q ss_pred CCccceeEeecccccccccc-c----CCccccceeeeeeeecce-ecCccccccCCccccccCcchHHhhccccceeeee Q lcl|NC_014036. 444 NEMDAGIYYAPYVALTPLRG-S----DPKNFQPVMGFKTRYGVG-INPFANSRSQAPSDRITSGMITKEMFGKNAYFRKV 517 (522) Q Consensus 444 ~~~d~glfyaPYv~~~~~~~-~----Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~ 517 (522) .-+++- ....+.. . +-.+-+=.+=+..|+|+. .+|=+. .++ T Consensus 328 ----~~~~~~---~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~--------------------------v~~ 374 (379) T protein:vir:10 328 ----VTKVTT---EGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAAL--------------------------IFG 374 (379) T ss_pred ----EEEEEE---eceEEEEeecccccccCCcEEEEEEEEeccEEecCccE--------------------------EEE Confidence 111111 1111110 0 112222223334577543 355111 111 Q ss_pred eeccC Q lcl|NC_014036. 518 YVKGL 522 (522) Q Consensus 518 ~Vk~~ 522 (522) -+..| T Consensus 375 ~~~~~ 379 (379) T protein:vir:10 375 DFTAV 379 (379) T ss_pred EecCC Confidence 12222 No 63 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=71.80 E-value=0.19 Score=24.53 Aligned_cols=295 Identities=10% Similarity=0.042 Sum_probs=119.0 Q ss_pred hhhhhhcchhhhhhhccccccccccccccccccccccccccccccccCcc-hh-hHHHHHHhhhhhhhceeeccCCchhh Q lcl|NC_014036. 45 EVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIASGNSSGAITNIGPA-VI-GMVRRAIPNLIAFDICGVQPMTGPTG 122 (522) Q Consensus 45 ~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t~tg~v~~~~P~-Li-~l~Rra~~~lI~~DI~GVQPmTGPTG 122 (522) ||.+++.+..+.. |...+.+.+.. .+.... .++++... =|. +. .+++.+..+.+..+++-+.||++.+- T Consensus 1 ~~~~~~~~~~~~~-f~~~~~~~~~~-----~a~~~~-~~~~~~~~--iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~ 71 (324) T protein:vir:97 1 MEQTQKLKLNLQH-FASNNVKPQVF-----NPDNVM-MHEKKDGT--LMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) T ss_pred CccchhHHHHHHH-HHHhhhhhhhh-----cccccc-ccCCCcce--echhHHHHHHHHHHhhcchhhhcceeeccCCce Confidence 2222221111100 00011111000 001111 11112211 122 22 45566667788888899999887552 Q ss_pred hheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeecccccccc Q lcl|NC_014036. 123 QVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTV 202 (522) Q Consensus 123 LIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~ 202 (522) -|- +.... +.+. T Consensus 72 ~ip----~~~~~---------------~~a~------------------------------------------------- 83 (324) T protein:vir:97 72 KFT----FWADK---------------PGAY------------------------------------------------- 83 (324) T ss_pred EEE----EEecC---------------ccee------------------------------------------------- Confidence 111 01000 0000 Q ss_pred CCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHh Q lcl|NC_014036. 203 TGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRA 282 (522) Q Consensus 203 tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKA 282 (522) ..+|. ..+++...++++++.+.|.-+.-..+|-||.+|-. T Consensus 84 ------------------------------~v~Eg---------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~- 123 (324) T protein:vir:97 84 ------------------------------WVGEG---------QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY- 123 (324) T ss_pred ------------------------------EeccC---------ccccccccceeEEEEeeEEEEEeehhhHHHHhcch- Confidence 00110 11233344455555555555555669999999863 Q ss_pred hcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHH Q lcl|NC_014036. 283 VHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKE 362 (522) Q Consensus 283 iHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~ 362 (522) .|.+++|.+-|+..|...+++.||.---.... ..|++........ . ....-.+..|+++ T Consensus 124 ---~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~------------~~gi~~~~~~~~~-----~-~~~~~~~~~i~~~ 182 (324) T protein:vir:97 124 ---SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF------------GKSIAQSIEKTNK-----V-IKGDFTQDNIIDL 182 (324) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHhhccCCCCcc------------Cccccccccccce-----e-ccccCCHHHHHHH Confidence 56799999999999999999999932111100 0111111000000 0 0000112234444 Q ss_pred HHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCcc--ceEEEEE Q lcl|NC_014036. 363 ANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG--DYFTVGY 440 (522) Q Consensus 363 an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--dy~~vG~ 440 (522) .+.|.. .+.....+|++|.....|..+-- .. + + ....+.. .++|.| ++|++.+..+. ..+++|- T Consensus 183 ~~~l~~--~~~~~~~~v~n~~~~~~L~~lkd--~~--g--~-~~~~~~~----~~tl~G-~PV~~~~~~~~~~~~~~~gd 248 (324) T protein:vir:97 183 EALLED--DELEANAFISKTQNRSLLRKIVD--PE--T--K-ERIYDRN----SDTLDG-LPVVNLKSSNLKRGELITGD 248 (324) T ss_pred HHhhhh--ccCCCCEEEEcHHHHHHHHHhhc--CC--C--c-eeecCCC----Cccccc-eeeEeecCCCCCcceEEEEe Confidence 444443 23345578999999988874411 11 0 0 0111111 246777 58888665442 1233331 Q ss_pred ecCCCccceeEeecccccccccccCCc--------c------cc---ceeeeeeeecc-eecC--ccc-----cccCCcc Q lcl|NC_014036. 441 KGDNEMDAGIYYAPYVALTPLRGSDPK--------N------FQ---PVMGFKTRYGV-GINP--FAN-----SRSQAPS 495 (522) Q Consensus 441 KG~~~~d~glfyaPYv~~~~~~~~Dp~--------s------~q---P~~~~~tRY~l-~~nP--~~~-----~~~~~~~ 495 (522) . +.+++.. .....++..|.. . || =.+=+..||+. ..|| |+. ..+..+. T Consensus 249 ~------~~~~i~~-~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~ 321 (324) T protein:vir:97 249 F------DKLIYGI-PQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVP 321 (324) T ss_pred c------ccEEEEE-ecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCCCC Confidence 1 0011110 011111111100 0 11 12222355654 3344 111 1111122 Q ss_pred ccc Q lcl|NC_014036. 496 DRI 498 (522) Q Consensus 496 ~~i 498 (522) +++ T Consensus 322 ~~~ 324 (324) T protein:vir:97 322 GEV 324 (324) T ss_pred CCC Confidence 333 No 64 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=71.54 E-value=0.19 Score=24.48 Aligned_cols=332 Identities=14% Similarity=0.132 Sum_probs=122.7 Q ss_pred CcchHHHHHhhhhhhccccc---------------hhhhcchhhhHHHHHH------hhhHHHHhhhhhhhcc------- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEG---------------LPDIATKSKKQLIAAI------MEAQEKDAEVDPVYRD------- 52 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~---------------~~~i~~~~~~~~~~~~------~enq~~~~~~~~~~~~------- 52 (522) | +.++|.++|.-+.+.-+. +.+|.. .+..+ ..+ ++.|.+++.+...... T Consensus 5 m-~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (408) T protein:vir:10 5 L-TVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSE-LKNKR-DNEKVRRDALREQLVEAQAEQVVNMREEEKGP 81 (408) T ss_pred c-cHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHhccccccccc Confidence 4 456788777555432110 111111 11111 111 1122122211110000 Q ss_pred ---------hhhhhhhccccccccccccccc----ccccccccccc-ccccccCcchh--hHHHHHHhhhhhhhceeecc Q lcl|NC_014036. 53 ---------EKIVESFGGFLAEAEIAGDHGY----DATKIASGNSS-GAITNIGPAVI--GMVRRAIPNLIAFDICGVQP 116 (522) Q Consensus 53 ---------~~~~~~~~~~l~ea~~~~~~g~----~~~~~~~~t~t-g~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQP 116 (522) ......|..++. ..++. +...+..++.+ |... =|.-+ .+++.+.......+++.+.| T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~a~~~~t~~~gg~~--vP~~~~~~Ii~~~~~~~~l~~~~~~~~ 154 (408) T protein:vir:10 82 LNKSENELKDKFVKDFVNMVR-----NPMAFMNTVSSKTETSGSDSAAGLT--IPQDIRTMINTLVRQYDSLQQYVRVES 154 (408) T ss_pred cccchhhhHHHHHHHHHHHhh-----cchhhhhhhhhhhhhcccccCCcee--ccHhHHHHHHHHHHhhchhhhhcceee Confidence 000011111110 00000 11111112211 1110 13222 35556666777889999999 Q ss_pred CCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeecc Q lcl|NC_014036. 117 MTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVS 196 (522) Q Consensus 117 mTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~ 196 (522) |+++.|-+--.+ .... .+.+.|- T Consensus 155 ~~~~~~~~~~~~-----~~~~-----------~~~a~~v----------------------------------------- 177 (408) T protein:vir:10 155 VSTSNGSRVYEK-----WTDV-----------TPLTVMD----------------------------------------- 177 (408) T ss_pred ccCCcceEEEee-----cccc-----------ccceeee----------------------------------------- Confidence 998887654221 0000 0000000 Q ss_pred ccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHH Q lcl|NC_014036. 197 GAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVEL 276 (522) Q Consensus 197 ~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~EL 276 (522) +|... ....+...|.+..|++.|.. -...+|-|| T Consensus 178 --------------------------------------~E~~~-~~~~~~~~~~~i~~~~~k~~-------~~~~iS~el 211 (408) T protein:vir:10 178 --------------------------------------AEDGK-IPDLDNPQLTIIKYLIKRYA-------GIITATNTS 211 (408) T ss_pred --------------------------------------cCccc-cccccCcceeeEEeeeeeEE-------eeehhHHHH Confidence 01000 00001123555555555554 445699999 Q ss_pred HHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHH- Q lcl|NC_014036. 277 AQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKAL- 355 (522) Q Consensus 277 AQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L- 355 (522) .+|- .+|.+++|.+-|+..|..-+|+.||.-.-... ...++.+++ ....+ T Consensus 212 l~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~------------~~~~~~~~~-------------~l~~~~ 262 (408) T protein:vir:10 212 LKDT----AENILAWLSSWIAKKVVVTRNQAIIEVMKAAP------------KKPTIAKFD-------------DVITMI 262 (408) T ss_pred Hhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------cccccccHH-------------HHHHHH Confidence 9994 46779999999999999999999883221110 112222221 11111 Q ss_pred HHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCC--Ccc Q lcl|NC_014036. 356 LIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQY--ARG 433 (522) Q Consensus 356 ~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y--~~~ 433 (522) +..+. ..+-..-.+||+|.....|..+- +.... -....+.+. ...++|.| ++|++-.+ .+. T Consensus 263 ~~~~~---------~~~~~~a~~v~n~~~~~~l~~lk--d~~G~----~i~~~~~~~-~~~~~l~G-~PV~~~~~~~~~~ 325 (408) T protein:vir:10 263 NTAVD---------PAIIATSSLLTNQSGLNKLALVK--TAEGK----YLLEPDPTK-PNSYLIKG-KQVIVVADRWLPN 325 (408) T ss_pred HHhhh---------hhhccCCEEEEcHHHHHHHHHhh--ccCCc----eEeccCcCC-CCCceecc-eeeEEecccccCc Confidence 11111 12112235789999998887551 11100 000001111 11236766 57776322 121 Q ss_pred --------------ceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeecce-----------ecCccc Q lcl|NC_014036. 434 --------------DYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVG-----------INPFAN 488 (522) Q Consensus 434 --------------dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-----------~nP~~~ 488 (522) ++++++.++.... =+.++.- .+-.+.+=.+-+..||++. .-|.+. T Consensus 326 ~~~~~~~i~~gd~~~~~~~~~~~~~~v----~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~ 395 (408) T protein:vir:10 326 TGSTVYPLYYGDMSQAITLFDRENMSL----LPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIAD 395 (408) T ss_pred cCCCceEEEEEehhccEEEEEecceEE----EEccccc------chhhcCceEEEEEEeeccEEeccccEEEEEeecccc Confidence 1233333322111 1111100 0001112222233333332 111111 Q ss_pred cccCCccccccCcchH Q lcl|NC_014036. 489 SRSQAPSDRITSGMIT 504 (522) Q Consensus 489 ~~~~~~~~~i~~g~~~ 504 (522) .....+. +..+.. T Consensus 396 ~~~~~~~---~~~~~~ 408 (408) T protein:vir:10 396 QVGNFKT---TTSTAV 408 (408) T ss_pred CCCCCCC---CCcccC Confidence 1111110 001111 No 65 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=71.25 E-value=0.2 Score=24.44 Aligned_cols=348 Identities=13% Similarity=0.068 Sum_probs=134.6 Q ss_pred Ccch-HHHHHhhhhhhccccchhhhcch----------hhhHHH---HHH--hhhHHHHhhhhhhhcchh------hhhh Q lcl|NC_014036. 1 MSKK-NELMEKWNDLLESQEGLPDIATK----------SKKQLI---AAI--MEAQEKDAEVDPVYRDEK------IVES 58 (522) Q Consensus 1 ~~~~-~~l~~kw~p~l~~~~~~~~i~~~----------~~~~~~---~~~--~enq~~~~~~~~~~~~~~------~~~~ 58 (522) |++. ++|.+++.-+++.-+ ++.+. -++.+- +.+ |+-|.+.+.+...-.+.. -..+ T Consensus 1 m~~~~~~l~~~~~~~~~~~~---~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 77 (390) T protein:vir:97 1 MTDITAKLEATLANVTDSLK---AFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVS 77 (390) T ss_pred ChHHHHHHHHHHHHHHHHHH---HHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 8887 468888888887432 22111 111110 000 000111111100000000 0000 Q ss_pred hcccccccc----------cc-ccccccccccc---ccccccccc-ccCcchh-hHHHHHHhhhhhhhceeeccCCchhh Q lcl|NC_014036. 59 FGGFLAEAE----------IA-GDHGYDATKIA---SGNSSGAIT-NIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTG 122 (522) Q Consensus 59 ~~~~l~ea~----------~~-~~~g~~~~~~~---~~t~tg~v~-~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTG 122 (522) .+....+.+ .. +....+..... .++++++-. -.-|.++ .+++++-++.+-.+++.+-||++++. T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~ 157 (390) T protein:vir:97 78 VGDMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALI 157 (390) T ss_pred chhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCce Confidence 000000000 00 00000000000 001111100 0111122 44444555666667777777766552 Q ss_pred hheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeecccccccc Q lcl|NC_014036. 123 QVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTV 202 (522) Q Consensus 123 LIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~ 202 (522) -+.- ..... +... T Consensus 158 ~~~~----~~~~~--------------~~a~------------------------------------------------- 170 (390) T protein:vir:97 158 EYVQ----ETGFV--------------NNAA------------------------------------------------- 170 (390) T ss_pred EEEE----EecCC--------------ccee------------------------------------------------- Confidence 1110 00000 0000 Q ss_pred CCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHh Q lcl|NC_014036. 203 TGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRA 282 (522) Q Consensus 203 tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKA 282 (522) ..+| +..+++-..++++++...|.-+-...+|-||.+|-- T Consensus 171 ------------------------------~v~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~- 210 (390) T protein:vir:97 171 ------------------------------IVAE---------GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP- 210 (390) T ss_pred ------------------------------eecC---------CccccccccceeEEEEeeeeEEEeehhhHHHHHhHH- Confidence 0011 011222233345555555555556789999999852 Q ss_pred hcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHH Q lcl|NC_014036. 283 VHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKE 362 (522) Q Consensus 283 iHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~ 362 (522) +.++.|.+-|+..|...||+.||.- . |.. ..-.|++............ .. ...+..|..+ T Consensus 211 ----~l~~~i~~~la~a~~~~~d~a~l~G---~---g~~------~~p~Gi~~~~~~~~~~~~~-~~---~~~~d~~~~~ 270 (390) T protein:vir:97 211 ----QLASYMNNRLIRGLKVKEDAEILRG---T---GAN------DGLLGLIPQATTYAAPTTI-AG---ATRVDQLRLA 270 (390) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHhhc---C---CCC------ccccceeeccccccccccc-cc---cchHHHHHHH Confidence 5699999999999999999998831 1 000 0012332211100000000 00 0111112222 Q ss_pred HHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEec Q lcl|NC_014036. 363 ANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKG 442 (522) Q Consensus 363 an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG 442 (522) -..+ ...+...+.+|++|.....|..+- +. . + .....+... .--++|.| ++|++++..+.+-+++|--- T Consensus 271 ~~~~--~~~~~~~~~~v~n~~~~~~L~~lk--d~--~-G--~~l~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~gd~~ 339 (390) T protein:vir:97 271 MLQA--SLAEYPASGIVINPIDWAAIELAK--DA--N-N--QYLIGNARG-TLTPTLWG-LPVVATQAMAPGEFLVGAFD 339 (390) T ss_pred HHhh--ccccCCCCEEEEcHHHHHHHHHhh--cC--C-C--ceeecCccC-CCCceecc-eeeEEcCCCCCCcEEEEecc Confidence 2222 233345678899999998887441 11 1 0 011111111 11246776 69999999887666665210 Q ss_pred CCCccceeEeecccccccccccCC---ccccceeeeeeeeccee-cCccccccCCccccccCc Q lcl|NC_014036. 443 DNEMDAGIYYAPYVALTPLRGSDP---KNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSG 501 (522) Q Consensus 443 ~~~~d~glfyaPYv~~~~~~~~Dp---~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g 501 (522) ..+++...-.++.....+. .+-+=.+-+..||++.+ +|=+.- +|.=+ T Consensus 340 -----~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~v-------~~~~a 390 (390) T protein:vir:97 340 -----LAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALI-------TGSFA 390 (390) T ss_pred -----ceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEE-------EEEeC Confidence 0111111101111111111 12222344455777654 342111 11101 No 66 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=69.85 E-value=0.22 Score=24.22 Aligned_cols=270 Identities=11% Similarity=0.006 Sum_probs=113.9 Q ss_pred ccccccccc--ccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccc Q lcl|NC_014036. 155 SGQGAAPSN--GFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMAT 232 (522) Q Consensus 155 SG~g~~~~~--~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~T 232 (522) ..+..+... -++...+... ... .....+ +..+. +....+. .. .|....+..-=.+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v-------~~~-~~~~~~-~~~~~--------~~~~~l~-----g~-~G~tv~ip~~~~~ 57 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMM-------QAQ-LEKKLR-FASFA--------EVDSTLQ-----GQ-PGDTLTFPAFVYS 57 (274) T ss_pred CCccceehhheechHHHHHHH-------HHH-HHhhhh-hcccc--------ccccccc-----CC-CCCEEEEEeeccC Confidence 111000000 0000000000 000 000000 00000 0000000 00 1111111110001 Q ss_pred hhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_014036. 233 SVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMI 312 (522) Q Consensus 233 s~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i 312 (522) ..++. ......-++.++. ..+.+++-|-|+-.=+++=| +.+.+ +-|.-.+..+-++..+...++++++..+ T Consensus 58 g~~~~---~~eg~~i~~~~it--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~d~~~~~~~~~~~~~a~~~d~~~~~~~ 128 (274) T protein:vir:93 58 GDAQV---VAEGEKIPTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEAL 128 (274) T ss_pred CCccc---ccCCCcccccccc--cceeEEEeeeecccccccHH--HHHhh--ccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111 1111223344444 44445555666532233332 22333 5788999999999999999999999766 Q ss_pred hhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhc Q lcl|NC_014036. 313 NYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARID 392 (522) Q Consensus 313 ~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~ 392 (522) ...+.. + ....++ .+.+-.+..++.++. ..+++++|+|.+++.|..-. T Consensus 129 ~~a~~~-~---------~~~~~~-------------~d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~ 176 (274) T protein:vir:93 129 MGAKLT-V---------NADITK-------------LNGLQSAIDKFNDED---------LEPMVLFINPLDAGKLRGDA 176 (274) T ss_pred hccccc-c---------cccccC-------------HHHHHHHHHHhhhcc---------CCccEEEeCHHHHHHHHhhh Confidence 443211 1 011121 233333444444321 25689999999999997542 Q ss_pred cccc-ccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeecccccccccccCCccccc Q lcl|NC_014036. 393 SGIT-PAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQP 471 (522) Q Consensus 393 ~~~~-~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP 471 (522) ...+ .++.. ++ +......+|.+.| ++||+|+..|..-..+.-+|. +-|.---+......-|+++++= T Consensus 177 ~~~f~~~s~~----g~-~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~ga------i~~~~~~~~~vE~~Rd~~~~~d 244 (274) T protein:vir:93 177 STNFTRATEL----GD-DIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKGA------VKLILKRDFFLEVARDASTKTT 244 (274) T ss_pred hhcccccccc----cc-cceeecccceecC-eeEEEcCCCCcceEEEEeCCe------EEEEecCCcccccccchhhccc Confidence 2222 22211 11 1222335788876 899999998854433332332 1121011112112348999999 Q ss_pred eeeeeeeecce-ecCccccccCCccccccCcchHHhhccccce Q lcl|NC_014036. 472 VMGFKTRYGVG-INPFANSRSQAPSDRITSGMITKEMFGKNAY 513 (522) Q Consensus 472 ~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~ 513 (522) .+-...|||+. .||=.. ..+..+.-.-. | T Consensus 245 ~i~~~~~y~~~~~~~~~~-------v~~t~~~~s~~------~ 274 (274) T protein:vir:93 245 ALYSDKHYVAYLYDESKA-------VKITKGSGSLE------M 274 (274) T ss_pred EEEEEEEEEEEEEcCCce-------EEEeeCccccC------C Confidence 99999999985 355110 01110000001 1 No 67 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=69.84 E-value=0.22 Score=24.22 Aligned_cols=266 Identities=12% Similarity=0.060 Sum_probs=113.3 Q ss_pred ccccceeeccccccc---ceeeeec-cccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccc Q lcl|NC_014036. 174 IADGAIVFHDFVETG---RVFLQNV-SGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPW 249 (522) Q Consensus 174 ~a~g~~a~~~~~~~g---~~~~~~~-~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f 249 (522) .+.+..-..+..... ..+.... ...-+.+.......+ .+. .|...+++.-=....+|. ......-+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l-----~g~-~G~tv~iP~~~~ig~a~~---~~~g~~i~~ 71 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTL-----VGQ-PGDTLTFPAFIYSGDAKV---VAEGEKIPT 71 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccc-----cCC-CCCEEEeeeecCCCcccc---ccCCCccch Confidence 111111111111000 0000000 000000000000000 000 122222211001112221 111122234 Q ss_pred cccceEEEEEEEEEecccccchhhHHHHHHHHhhcC-CChHHHHHHHHHHHHHHHhhHHHHhhhhhcccccccccccccc Q lcl|NC_014036. 250 NEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHG-MDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVG 328 (522) Q Consensus 250 ~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHG-LDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~ 328 (522) .++..+=.+ ++-+-|+ |+ |.+. |+-+..+ -|--.|..+-++..+..+++++++..+..... .+ T Consensus 72 ~~lt~~~~~--~~i~~~~-~a-~~i~---D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~-~~-------- 135 (274) T protein:vir:95 72 DILETKKRE--AKIRKIA-KG-TSIS---DEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL-TV-------- 135 (274) T ss_pred hhcccceeE--EEeeeee-cc-eeeh---HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cc-------- Confidence 444444333 3334443 22 2222 5555543 47889999999999999999999976654221 11 Q ss_pred ccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccccccccccccc Q lcl|NC_014036. 329 SKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNV 408 (522) Q Consensus 329 ~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~ 408 (522) ....+++ +.+-....++.++.+ .++++|++|.|++.|..-....+.... ..+ . T Consensus 136 -~~~~~~~-------------d~i~~A~~~lgd~~~---------~~~~ivv~p~~~~~L~k~~~~~f~~~s---~~g-~ 188 (274) T protein:vir:95 136 -EADITKL-------------TGLQTAIDKFNDEDL---------EPMVLFISPLDAGKLRGDATTNFTRAT---ELG-D 188 (274) T ss_pred -cccccCH-------------HHHHHHHHHhccccc---------cccEEEeCHHHHHHHHhhccccccccc---ccc-c Confidence 0112221 223333344443321 568999999999999754322222110 000 0 Q ss_pred ccccceeEEEecCceEEEecCCCccce-EEEEEecCCCccceeEeecccccccccc-cCCccccceeeeeeeecce-ecC Q lcl|NC_014036. 409 DTTKAVFAGVLGGVYKVYIDQYARGDY-FTVGYKGDNEMDAGIYYAPYVALTPLRG-SDPKNFQPVMGFKTRYGVG-INP 485 (522) Q Consensus 409 d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~~d~glfyaPYv~~~~~~~-~Dp~s~qP~~~~~tRY~l~-~nP 485 (522) .-..+..+|.+.| ++||+|...+..- +++| +|. -.||.. ....++. =||.+++=.+-..-+||+. .|| T Consensus 189 ~~~~~G~ig~~~G-~~Vi~s~~~~~~t~~l~~-~gA-----~~~~~~--~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~ 259 (274) T protein:vir:95 189 DVIVKGAFGEALG-AVIVRSNKLEAGTAILAK-KGA-----VKLITK--RDFFLETDRDPSTKTTALYSDKHYVAYLYDE 259 (274) T ss_pred cceeccccceecC-eEEEEeCCCCCceEEEEe-ccc-----eeeeec--CCcccccccccccccCEEEEeEEEEEEEEcC Confidence 1112335788887 8999999877432 2222 221 112221 1111222 3899999999999999875 455 Q ss_pred ccccccCCccccccCcchHHhh Q lcl|NC_014036. 486 FANSRSQAPSDRITSGMITKEM 507 (522) Q Consensus 486 ~~~~~~~~~~~~i~~g~~~~~~ 507 (522) = .-.++..|+-.-.| T Consensus 260 ~-------~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 260 S-------KAVKITKGSGSLEM 274 (274) T ss_pred C-------cEEEEEcCCccccC Confidence 0 11222222222222 No 68 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=69.84 E-value=0.22 Score=24.22 Aligned_cols=266 Identities=12% Similarity=0.060 Sum_probs=113.3 Q ss_pred ccccceeeccccccc---ceeeeec-cccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccc Q lcl|NC_014036. 174 IADGAIVFHDFVETG---RVFLQNV-SGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPW 249 (522) Q Consensus 174 ~a~g~~a~~~~~~~g---~~~~~~~-~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f 249 (522) .+.+..-..+..... ..+.... ...-+.+.......+ .+. .|...+++.-=....+|. ......-+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l-----~g~-~G~tv~iP~~~~ig~a~~---~~~g~~i~~ 71 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTL-----VGQ-PGDTLTFPAFIYSGDAKV---VAEGEKIPT 71 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccc-----cCC-CCCEEEeeeecCCCcccc---ccCCCccch Confidence 111111111111000 0000000 000000000000000 000 122222211001112221 111122234 Q ss_pred cccceEEEEEEEEEecccccchhhHHHHHHHHhhcC-CChHHHHHHHHHHHHHHHhhHHHHhhhhhcccccccccccccc Q lcl|NC_014036. 250 NEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHG-MDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVG 328 (522) Q Consensus 250 ~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHG-LDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~ 328 (522) .++..+=.+ ++-+-|+ |+ |.+. |+-+..+ -|--.|..+-++..+..+++++++..+..... .+ T Consensus 72 ~~lt~~~~~--~~i~~~~-~a-~~i~---D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~-~~-------- 135 (274) T protein:vir:96 72 DILETKKRE--AKIRKIA-KG-TSIS---DEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL-TV-------- 135 (274) T ss_pred hhcccceeE--EEeeeee-cc-eeeh---HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cc-------- Confidence 444444333 3334443 22 2222 5555543 47889999999999999999999976654221 11 Q ss_pred ccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccccccccccccc Q lcl|NC_014036. 329 SKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNV 408 (522) Q Consensus 329 ~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~ 408 (522) ....+++ +.+-....++.++.+ .++++|++|.|++.|..-....+.... ..+ . T Consensus 136 -~~~~~~~-------------d~i~~A~~~lgd~~~---------~~~~ivv~p~~~~~L~k~~~~~f~~~s---~~g-~ 188 (274) T protein:vir:96 136 -EADITKL-------------TGLQTAIDKFNDEDL---------EPMVLFISPLDAGKLRGDATTNFTRAT---ELG-D 188 (274) T ss_pred -cccccCH-------------HHHHHHHHHhccccc---------cccEEEeCHHHHHHHHhhccccccccc---ccc-c Confidence 0112221 223333344443321 568999999999999754322222110 000 0 Q ss_pred ccccceeEEEecCceEEEecCCCccce-EEEEEecCCCccceeEeecccccccccc-cCCccccceeeeeeeecce-ecC Q lcl|NC_014036. 409 DTTKAVFAGVLGGVYKVYIDQYARGDY-FTVGYKGDNEMDAGIYYAPYVALTPLRG-SDPKNFQPVMGFKTRYGVG-INP 485 (522) Q Consensus 409 d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~~d~glfyaPYv~~~~~~~-~Dp~s~qP~~~~~tRY~l~-~nP 485 (522) .-..+..+|.+.| ++||+|...+..- +++| +|. -.||.. ....++. =||.+++=.+-..-+||+. .|| T Consensus 189 ~~~~~G~ig~~~G-~~Vi~s~~~~~~t~~l~~-~gA-----~~~~~~--~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~ 259 (274) T protein:vir:96 189 DVIVKGAFGEALG-AVIVRSNKLEAGTAILAK-KGA-----VKLITK--RDFFLETDRDPSTKTTALYSDKHYVAYLYDE 259 (274) T ss_pred cceeccccceecC-eEEEEeCCCCCceEEEEe-ccc-----eeeeec--CCcccccccccccccCEEEEeEEEEEEEEcC Confidence 1112335788887 8999999877432 2222 221 112221 1111222 3899999999999999875 455 Q ss_pred ccccccCCccccccCcchHHhh Q lcl|NC_014036. 486 FANSRSQAPSDRITSGMITKEM 507 (522) Q Consensus 486 ~~~~~~~~~~~~i~~g~~~~~~ 507 (522) = .-.++..|+-.-.| T Consensus 260 ~-------~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 260 S-------KAVKITKGSGSLEM 274 (274) T ss_pred C-------cEEEEEcCCccccC Confidence 0 11222222222222 No 69 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=69.01 E-value=0.23 Score=24.09 Aligned_cols=300 Identities=11% Similarity=0.035 Sum_probs=120.3 Q ss_pred hccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccccccccccccccCcc Q lcl|NC_014036. 15 LESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIASGNSSGAITNIGPA 94 (522) Q Consensus 15 l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t~tg~v~~~~P~ 94 (522) .+. ++..+..+++....+.+-+ .++ + .+... +.++.. .=|. T Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~---------------------~~~-a----------~~~~~-~~~~~~--~iP~ 41 (324) T protein:vir:96 1 MEQ----TQKLKLNLQHFASNNVKPQ---------------------VFN-P----------DNVMM-HEKKDG--TLMN 41 (324) T ss_pred CCc----chhhhHHHHHHHHHhhhhh---------------------hhc-c----------ccccc-cCcCcc--ccch Confidence 110 1111112222111111111 000 1 00110 111111 1132 Q ss_pred hh--hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccc Q lcl|NC_014036. 95 VI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQ 172 (522) Q Consensus 95 Li--~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~ 172 (522) -+ .+++.+..+....+++-+-||++++- +|.-... .+++. T Consensus 42 ~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~-------~~p~~~~------------~~~a~------------------- 83 (324) T protein:vir:96 42 EFTTPILQEVMENSKIMQLGKYEPMEGTEK-------KFTFWAD------------KPGAY------------------- 83 (324) T ss_pred hHHHHHHHHHHhhchhhhhcceeeccCCce-------EEEEEec------------Cccee------------------- Confidence 22 35555666777788888888876542 1111000 00000 Q ss_pred cccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCcccccc Q lcl|NC_014036. 173 AIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEM 252 (522) Q Consensus 173 ~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EM 252 (522) ..+| +..+++. T Consensus 84 ------------------------------------------------------------~v~E---------g~~~~~~ 94 (324) T protein:vir:96 84 ------------------------------------------------------------WVGE---------GQKIETS 94 (324) T ss_pred ------------------------------------------------------------EecC---------Ccccccc Confidence 0111 1123444 Q ss_pred ceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccce Q lcl|NC_014036. 253 GFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAG 332 (522) Q Consensus 253 sFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g 332 (522) ..++++++++.+.-+.-..+|-||.+|-. .|.+++|.+-|+..|...|++.+|.=--.... ..| T Consensus 95 ~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~------------~~g 158 (324) T protein:vir:96 95 KATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF------------GKS 158 (324) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCc------------Ccc Confidence 45566666666666667779999999864 56799999999999999999999832111110 012 Q ss_pred eeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccccccccccccccccc Q lcl|NC_014036. 333 AFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTK 412 (522) Q Consensus 333 ~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~ 412 (522) +.......... ......+..|.++.+.+.. .+...+.+|+||.....|..+-- . .+ ... ..+.. T Consensus 159 i~~~~~~~~~~------~~~~~t~~~i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~d--~--~G--~~~-~~~~~- 222 (324) T protein:vir:96 159 IAQSIEKTNKV------IKGDFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVD--P--ET--KER-IYDRN- 222 (324) T ss_pred cccccccccee------ccccccHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhc--c--CC--Cee-ecCCC- Confidence 22111100000 0001112334444444433 33455678999999999875411 1 10 001 11111 Q ss_pred ceeEEEecCceEEEecCCCcc--ceEEEE--------EecCCCccceeEeecccccccccccCCc-----cc---cceee Q lcl|NC_014036. 413 AVFAGVLGGVYKVYIDQYARG--DYFTVG--------YKGDNEMDAGIYYAPYVALTPLRGSDPK-----NF---QPVMG 474 (522) Q Consensus 413 ~~~~G~l~~~~~vy~D~y~~~--dy~~vG--------~KG~~~~d~glfyaPYv~~~~~~~~Dp~-----s~---qP~~~ 474 (522) .++|.| ++|++++.... ..+++| ..+.-..+- ..+..... ..|+. -| |=.+= T Consensus 223 ---~~~l~G-~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~----~~~~~~~~--~~~~~~~~~~~f~~d~~~~r 292 (324) T protein:vir:96 223 ---SDSLDG-LPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKI----DETAQLST--VKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred ---CCcccc-eeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEE----eecccccc--cccccccchhhhhcCcEEEE Confidence 235666 68888776442 223333 332211100 00000000 00110 01 11222 Q ss_pred eeeeeccee-cC--ccccccCCc-cccccCcch Q lcl|NC_014036. 475 FKTRYGVGI-NP--FANSRSQAP-SDRITSGMI 503 (522) Q Consensus 475 ~~tRY~l~~-nP--~~~~~~~~~-~~~i~~g~~ 503 (522) ...||+..+ +| |+. .+.+. ....+-|+- T Consensus 293 ~~~r~d~~v~~~~A~~~-l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 293 ATMHVALHIADDKAFAK-LVPADKRTDSVPGEV 324 (324) T ss_pred EEEEEccEEecccceEE-EecccccCCCCCCCC Confidence 234555432 23 111 00000 000111111 No 70 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=69.01 E-value=0.23 Score=24.09 Aligned_cols=300 Identities=11% Similarity=0.035 Sum_probs=120.3 Q ss_pred hccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccccccccccccccCcc Q lcl|NC_014036. 15 LESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIASGNSSGAITNIGPA 94 (522) Q Consensus 15 l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t~tg~v~~~~P~ 94 (522) .+. ++..+..+++....+.+-+ .++ + .+... +.++.. .=|. T Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~---------------------~~~-a----------~~~~~-~~~~~~--~iP~ 41 (324) T protein:vir:78 1 MEQ----TQKLKLNLQHFASNNVKPQ---------------------VFN-P----------DNVMM-HEKKDG--TLMN 41 (324) T ss_pred CCc----chhhhHHHHHHHHHhhhhh---------------------hhc-c----------ccccc-cCcCcc--ccch Confidence 110 1111112222111111111 000 1 00110 111111 1132 Q ss_pred hh--hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccc Q lcl|NC_014036. 95 VI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQ 172 (522) Q Consensus 95 Li--~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~ 172 (522) -+ .+++.+..+....+++-+-||++++- +|.-... .+++. T Consensus 42 ~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~-------~~p~~~~------------~~~a~------------------- 83 (324) T protein:vir:78 42 EFTTPILQEVMENSKIMQLGKYEPMEGTEK-------KFTFWAD------------KPGAY------------------- 83 (324) T ss_pred hHHHHHHHHHHhhchhhhhcceeeccCCce-------EEEEEec------------Cccee------------------- Confidence 22 35555666777788888888876542 1111000 00000 Q ss_pred cccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCcccccc Q lcl|NC_014036. 173 AIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEM 252 (522) Q Consensus 173 ~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EM 252 (522) ..+| +..+++. T Consensus 84 ------------------------------------------------------------~v~E---------g~~~~~~ 94 (324) T protein:vir:78 84 ------------------------------------------------------------WVGE---------GQKIETS 94 (324) T ss_pred ------------------------------------------------------------EecC---------Ccccccc Confidence 0111 1123444 Q ss_pred ceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccce Q lcl|NC_014036. 253 GFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAG 332 (522) Q Consensus 253 sFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g 332 (522) ..++++++++.+.-+.-..+|-||.+|-. .|.+++|.+-|+..|...|++.+|.=--.... ..| T Consensus 95 ~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~------------~~g 158 (324) T protein:vir:78 95 KATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF------------GKS 158 (324) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCc------------Ccc Confidence 45566666666666667779999999864 56799999999999999999999832111110 012 Q ss_pred eeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccccccccccccccccc Q lcl|NC_014036. 333 AFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTK 412 (522) Q Consensus 333 ~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~ 412 (522) +.......... ......+..|.++.+.+.. .+...+.+|+||.....|..+-- . .+ ... ..+.. T Consensus 159 i~~~~~~~~~~------~~~~~t~~~i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~d--~--~G--~~~-~~~~~- 222 (324) T protein:vir:78 159 IAQSIEKTNKV------IKGDFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVD--P--ET--KER-IYDRN- 222 (324) T ss_pred cccccccccee------ccccccHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhc--c--CC--Cee-ecCCC- Confidence 22111100000 0001112334444444433 33455678999999999875411 1 10 001 11111 Q ss_pred ceeEEEecCceEEEecCCCcc--ceEEEE--------EecCCCccceeEeecccccccccccCCc-----cc---cceee Q lcl|NC_014036. 413 AVFAGVLGGVYKVYIDQYARG--DYFTVG--------YKGDNEMDAGIYYAPYVALTPLRGSDPK-----NF---QPVMG 474 (522) Q Consensus 413 ~~~~G~l~~~~~vy~D~y~~~--dy~~vG--------~KG~~~~d~glfyaPYv~~~~~~~~Dp~-----s~---qP~~~ 474 (522) .++|.| ++|++++.... ..+++| ..+.-..+- ..+..... ..|+. -| |=.+= T Consensus 223 ---~~~l~G-~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~----~~~~~~~~--~~~~~~~~~~~f~~d~~~~r 292 (324) T protein:vir:78 223 ---SDSLDG-LPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKI----DETAQLST--VKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred ---CCcccc-eeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEE----eecccccc--cccccccchhhhhcCcEEEE Confidence 235666 68888776442 223333 332211100 00000000 00110 01 11222 Q ss_pred eeeeeccee-cC--ccccccCCc-cccccCcch Q lcl|NC_014036. 475 FKTRYGVGI-NP--FANSRSQAP-SDRITSGMI 503 (522) Q Consensus 475 ~~tRY~l~~-nP--~~~~~~~~~-~~~i~~g~~ 503 (522) ...||+..+ +| |+. .+.+. ....+-|+- T Consensus 293 ~~~r~d~~v~~~~A~~~-l~~a~~~~~~~~~~~ 324 (324) T protein:vir:78 293 ATMHVALHIADDKAFAK-LVPADKRTDSVPGEV 324 (324) T ss_pred EEEEEccEEecccceEE-EecccccCCCCCCCC Confidence 234555432 23 111 00000 000111111 No 71 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=68.81 E-value=0.23 Score=24.06 Aligned_cols=349 Identities=15% Similarity=0.103 Sum_probs=130.9 Q ss_pred Cc---chHHHHHhhhhhhccccch-hhhcc------h---hhhHHHHHH------hhhHHHHhhhhhhhcc--------- Q lcl|NC_014036. 1 MS---KKNELMEKWNDLLESQEGL-PDIAT------K---SKKQLIAAI------MEAQEKDAEVDPVYRD--------- 52 (522) Q Consensus 1 ~~---~~~~l~~kw~p~l~~~~~~-~~i~~------~---~~~~~~~~~------~enq~~~~~~~~~~~~--------- 52 (522) |+ +.++|.++|.-+.+.-..+ .++.. . -.+.+.+.+ ++.+++.+.+.+.-.. T Consensus 1 ~~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (404) T protein:vir:39 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (404) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 22 3366888877775421111 11100 0 011111111 1111111111110000 Q ss_pred ----------hhhhhhhccccccccccccc-ccccccccccc-ccccccccCcchh-hHHHHHHhhhhhhhceeeccCCc Q lcl|NC_014036. 53 ----------EKIVESFGGFLAEAEIAGDH-GYDATKIASGN-SSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTG 119 (522) Q Consensus 53 ----------~~~~~~~~~~l~ea~~~~~~-g~~~~~~~~~t-~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTG 119 (522) .....+|..++.-. .... ..+...+..++ ++|.+. .-+.+. .+++.+-++....+++.++||++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~e~~a~~~~t~~~gg~~-iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 157 (404) T protein:vir:39 81 PLNKSEYELKDKFVKEFVNMVRNP--MAFLNTVSSKTETSGSDSAAGLT-IPQDIRTMINTLVRQYDSLQQYVRVESVST 157 (404) T ss_pred ccccchhhhHHHHHHHHHHHHhcc--hhhhhhhhhhhhhcccccCCcee-ccHHHHHHHHHHHHhhhhHHhhcceeeccC Confidence 00001111111000 0000 00111111122 111111 111121 34444556777888999999998 Q ss_pred hhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccc Q lcl|NC_014036. 120 PTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAP 199 (522) Q Consensus 120 PTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p 199 (522) +++-+--.| .... .+.+.|-+ T Consensus 158 ~~~~~~~~~--~~~~--------------~~~a~~v~------------------------------------------- 178 (404) T protein:vir:39 158 SNGSRVYEK--WTDV--------------TPLTVMDA------------------------------------------- 178 (404) T ss_pred CcceEEEEe--ecCC--------------ccceeeec------------------------------------------- Confidence 876542211 0000 00000000 Q ss_pred cccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHH Q lcl|NC_014036. 200 VTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQD 279 (522) Q Consensus 200 ~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQD 279 (522) +|- ...| .+...|.++.|++.|..+- ..+|-||.+| T Consensus 179 ----------------------------Eg~--~~~~-------~~~~~f~~i~~~~~k~~~~-------~~iS~ell~d 214 (404) T protein:vir:39 179 ----------------------------EDG--KIPD-------LDNPRLTIIKYLIKRYAGI-------ITATNTLLKD 214 (404) T ss_pred ----------------------------Ccc--cccc-------ccccceeeEEeeeeeEEee-------ehhHHHHHhh Confidence 000 0000 1123466667777666654 4499999998 Q ss_pred HHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHH-HH Q lcl|NC_014036. 280 LRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALL-IQ 358 (522) Q Consensus 280 LKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~-~~ 358 (522) - ..|.+++|.+-|+..|..-+|..||.- ... .....+..++++ ...++ .. T Consensus 215 s----~~~l~~~i~~~l~~~~~~~~d~~il~g---~g~---------~~~~~~~~~~~~-------------i~~~~~~~ 265 (404) T protein:vir:39 215 T----AENILAWLSSWIAKKVVVTRNQAIIAA---MGT---------VPKKPTIAKFDD-------------VITMINTS 265 (404) T ss_pred c----hHHHHHHHHHHHHHHHHHHHHHHHHhc---ccc---------cccccccccHHH-------------HHHHHHHh Confidence 4 356799999999999999999999831 111 111223333221 11111 11 Q ss_pred HHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCcc----- Q lcl|NC_014036. 359 IDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG----- 433 (522) Q Consensus 359 i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~----- 433 (522) + ...+.....+||+|.....|..+= +.... -....+.+. ...++|.| ++|++-.+... T Consensus 266 ~---------~~~~~~~a~~v~n~~~~~~L~~lk--d~~G~----~l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~ 328 (404) T protein:vir:39 266 V---------DPAIIATSSLLTNQSGLNKLALVK--TAEGK----YLLEPDPTK-PNSYLIKG-KKVIVVADRWLPNSGS 328 (404) T ss_pred h---------hhhhccCCEEEEcHHHHHHHHHhh--ccCCc----eeeccCcCC-CCcceecc-eeEEEecccccCccCC Confidence 1 111223457899999999888541 11100 000001111 11246777 57776322111 Q ss_pred -ce-EEEE-Eec----CCCccceeEeecccccccccccCCccccceeeeeeeecce-ecCccccccCCccccccCcchHH Q lcl|NC_014036. 434 -DY-FTVG-YKG----DNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVG-INPFANSRSQAPSDRITSGMITK 505 (522) Q Consensus 434 -dy-~~vG-~KG----~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~~~~ 505 (522) ++ +++| ++. ....+-.+=..+|+.. +=...+=.+-...||+.. .+|-+...-.-...--..|. T Consensus 329 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~------~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~~~--- 399 (404) T protein:vir:39 329 TVYPLYYGDMSQAITLFDRENMSLLPTNIGAG------AFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQVGN--- 399 (404) T ss_pred CccEEEEEeccccEEEEeecceEEEEeccchh------hhhhceeeEEEEeeeccEEecccceEEEEeeccccCCCC--- Confidence 11 2222 010 0000001111111110 011334455566777654 34521110000000000111 Q ss_pred hhccc Q lcl|NC_014036. 506 EMFGK 510 (522) Q Consensus 506 ~~~~~ 510 (522) .-+|| T Consensus 400 ~~~~~ 404 (404) T protein:vir:39 400 FTAGK 404 (404) T ss_pred CCCCC Confidence 12455 No 72 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=66.25 E-value=0.27 Score=23.70 Aligned_cols=347 Identities=12% Similarity=0.092 Sum_probs=124.5 Q ss_pred CcchHHHHHhhhhhhccccchhh-------------------------------hcchhhhHHHHHHhhhHHHHhhhhhh Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPD-------------------------------IATKSKKQLIAAIMEAQEKDAEVDPV 49 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~-------------------------------i~~~~~~~~~~~~~enq~~~~~~~~~ 49 (522) -...++|..+..-|-+..+.+.+ +....+|+...+.+.+....-. +.. T Consensus 39 ~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~-~~~ 117 (434) T protein:vir:62 39 KAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKG-HRT 117 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhcc-ccc Confidence 01112222232222110000000 0000000000000000000000 000 Q ss_pred hcchhhhhhhccccccccccccccc-cccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeccCCchhhhhee Q lcl|NC_014036. 50 YRDEKIVESFGGFLAEAEIAGDHGY-DATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFA 126 (522) Q Consensus 50 ~~~~~~~~~~~~~l~ea~~~~~~g~-~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPTGLIFA 126 (522) ..+.....+|..+|..- ... +...+..++..|.+. =|.-+ .+++..-++.+...++-|.|+++..- |- T Consensus 118 ~~~~e~r~a~~~~l~~~-----~~~~e~~a~~~~t~~GG~l--vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~~~~--~p 188 (434) T protein:vir:62 118 NKETEIRSVFANYIVGN-----IDEKEARALGLVTGNGSVT--IPDFLSKEIITYAQEENFLRRLGTGVKTKENIK--YP 188 (434) T ss_pred hHHHHHHHHHHHHhccc-----cchhhhhhhccccccccee--cchhhHHHHHHhhhhhhhhhhhcceeccCCceE--EE Confidence 00001111122111110 000 011111111111110 13332 25555556667777787777654210 00 Q ss_pred eeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCC Q lcl|NC_014036. 127 LRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGST 206 (522) Q Consensus 127 MRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~ 206 (522) ++...+. +.+- T Consensus 189 ---~~~~~~~---------------a~~~--------------------------------------------------- 199 (434) T protein:vir:62 189 ---VLVKKAE---------------AQGH--------------------------------------------------- 199 (434) T ss_pred ---EEecCCc---------------ccce--------------------------------------------------- Confidence 0100000 0000 Q ss_pred cccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCC Q lcl|NC_014036. 207 DDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGM 286 (522) Q Consensus 207 ~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGL 286 (522) . ..+| +...++-..++++++..+|.-+-...+|-||.+|- .+ T Consensus 200 -----------------~--------~~~e---------~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds----~~ 241 (434) T protein:vir:62 200 -----------------K--------NERT---------NNEMPETDIEFDEIELSPTEFDALATVTKKLLART----GL 241 (434) T ss_pred -----------------e--------cccc---------cccccccccceeeEEeeheeeEeehhhHHHHHhcc----hH Confidence 0 0000 11122223356677777777777888999999995 46 Q ss_pred ChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeecccc-ccccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_014036. 287 DADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQD-PIDVRGARWAGESYKALLIQIDKEANE 365 (522) Q Consensus 287 DAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~-~~d~~~~r~~~E~~r~L~~~i~~~an~ 365 (522) |.+++|.+-|+..|..-+++.||. -. |..+ ...|++.-.. .....+.. .+..| -++-.. T Consensus 242 ~l~~~i~~~la~~~~~~~d~~~l~---G~---G~~~------~~~g~~~~~~~~~~~~~~~----~~d~l----~~l~~~ 301 (434) T protein:vir:62 242 PIEQIVMDELKKAYVRKETQYMVN---GD---EANN------INDGALAKKAVEFKTDEKN----LYDAL----VKMKNT 301 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc---cC---CCCc------cccceeecccccccccccc----hhhHH----HHHHhh Confidence 779999999999999999999992 11 0000 0011110000 00000000 11222 222233 Q ss_pred HHHhccccCCcEEEEchhHHHHHhhhccccccccc-ccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCC Q lcl|NC_014036. 366 IARQTGRGAGNFIIASRNVVSALARIDSGITPAGQ-GLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDN 444 (522) Q Consensus 366 I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~-~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~ 444 (522) +... +.+.-..|++|.....|..+ .+..... |.......+.+ -.+|.| ++|+++.+.+..- .|.. T Consensus 302 l~~~--~~~~a~~v~n~~~~~~L~~l--kd~~G~~l~~~~~~~~~g~----~~tl~G-~pV~~~~~~~~~~-----~~~~ 367 (434) T protein:vir:62 302 PVKE--VRKKARWVLNTAALTKIETM--KTDDGFPLLRPFNQAEGGI----GYTLLG-FPVEEEDAIDIPD-----SPDT 367 (434) T ss_pred cchh--hhcCCEEEEcHHHHHHHHHh--hccCCCEeeccCCCccCCC----Cceecc-eeeEEecCccCcc-----CCCc Confidence 3222 22333568899998888643 1111100 00000000111 125777 6999887765211 0100 Q ss_pred CccceeEe---ecccc------cccccccCC--ccccceeeeeeeecce-e-cCccccccCCccccccCcc Q lcl|NC_014036. 445 EMDAGIYY---APYVA------LTPLRGSDP--KNFQPVMGFKTRYGVG-I-NPFANSRSQAPSDRITSGM 502 (522) Q Consensus 445 ~~d~glfy---aPYv~------~~~~~~~Dp--~s~qP~~~~~tRY~l~-~-nP~~~~~~~~~~~~i~~g~ 502 (522) .-++| +-|.- ....+..++ .+-|=.+..+.|++-. + .|++...-.-+ -++..|. T Consensus 368 ---~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~-~~~~~~~ 434 (434) T protein:vir:62 368 ---PVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYV-LKAPTGA 434 (434) T ss_pred ---eEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEE-eccCCCC Confidence 01111 11111 111111222 2223335556777533 4 38775432211 0112222 No 73 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=65.71 E-value=0.28 Score=23.62 Aligned_cols=343 Identities=14% Similarity=0.143 Sum_probs=119.7 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhh----------------hc-----chhhhhhh Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPV----------------YR-----DEKIVESF 59 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~----------------~~-----~~~~~~~~ 59 (522) ++.. -.+++..+.. | |++. +.+| .+ +|.+|+..+.... .+ ++.....+ T Consensus 30 lt~e--e~~~~~~l~~--e----i~~l-~~~I-~~-~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 98 (435) T protein:vir:14 30 LSVE--QQAEFDQLSS--K----FSEL-TAQI-ER-AEAAERMAAAAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKM 98 (435) T ss_pred CCHH--HHHHHHHHHH--H----HHHH-HHHH-HH-HHHHHHHHHhhcccccchhhhhhhccccccccccchhhhhHHHH Confidence 2222 2345555432 1 2221 1111 11 1222221111000 00 00000011 Q ss_pred cccc---cccc--------cccc--ccccccccccccccccccccCcchh------hHHHHHHhhhhhhhc-eeeccCCc Q lcl|NC_014036. 60 GGFL---AEAE--------IAGD--HGYDATKIASGNSSGAITNIGPAVI------GMVRRAIPNLIAFDI-CGVQPMTG 119 (522) Q Consensus 60 ~~~l---~ea~--------~~~~--~g~~~~~~~~~t~tg~v~~~~P~Li------~l~Rra~~~lI~~DI-~GVQPmTG 119 (522) +.++ ..+. .... .+....+.. .+.|.. ....|| .+++++.++.+..++ +-+.||+. T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~t~~---~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~ 174 (435) T protein:vir:14 99 ARMVRALAAARGDAQLASKLAIERGFGEEVAMSL-NTLSPG---AGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSN 174 (435) T ss_pred HHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhc-ccCCcC---CCccccchhHHHHHHHHHhhhchhhhhcceeeecCC Confidence 1110 0000 0000 000000000 000000 011121 122222233333332 11222211 Q ss_pred hhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccc Q lcl|NC_014036. 120 PTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAP 199 (522) Q Consensus 120 PTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p 199 (522) .. + +|+. +++ T Consensus 175 ~~-~------~~p~--------------------~~~------------------------------------------- 184 (435) T protein:vir:14 175 GN-I------TIPR--------------------LKG------------------------------------------- 184 (435) T ss_pred Cc-e------EEEE--------------------EeC------------------------------------------- Confidence 00 0 0000 000 Q ss_pred cccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHH Q lcl|NC_014036. 200 VTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQD 279 (522) Q Consensus 200 ~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQD 279 (522) +. ... ..+|. ...++-.-++++++..++..+-....|-||.+| T Consensus 185 ----~~----------------~a~--------~v~E~---------~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d 227 (435) T protein:vir:14 185 ----GA----------------IVG--------YIGAD---------TDIPTTQQQFDDLKLTAKKMAALVPIANDLIKY 227 (435) T ss_pred ----Cc----------------cee--------eeccC---------ccccccccceeEEEeeeEEEEEeehhhHHHHHh Confidence 00 000 01121 123444455677777777777788899999999 Q ss_pred HHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeecccccccccc-chhHHHHHHHHHHH Q lcl|NC_014036. 280 LRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRG-ARWAGESYKALLIQ 358 (522) Q Consensus 280 LKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~-~r~~~E~~r~L~~~ 358 (522) +.-..+.|+.|.+-|+..|...+|+-|| +-.-.-+ .-.|++....+..+.. ..+ ..+..+... T Consensus 228 --s~~~~~l~~~i~~~l~~ai~~~~d~a~l---~G~G~~~---------~p~Gi~~~~~~~~~~~~~~~--~~~~~~~~~ 291 (435) T protein:vir:14 228 --AGVNPNVDQIVVGDLTAAIGAREDKAFI---RDDGTAN---------TPKGLRFWALPSNVITASDA--STLQKIETD 291 (435) T ss_pred --hccCHHHHHHHHHHHHHHHHHHHHHHhh---ccCCCCc---------cccceeecccccceeccccc--cchhhHHHH Confidence 3223447888999999999988888887 2110000 0112221110000000 000 001112222 Q ss_pred HHHHHHHHHH-hccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccc--- Q lcl|NC_014036. 359 IDKEANEIAR-QTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGD--- 434 (522) Q Consensus 359 i~~~an~I~r-~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--- 434 (522) +.++-..+.. ...+ .....|++|.....|..+-- ..+ + ....+.+ -|+|.| ++|+++++.|.+ T Consensus 292 ~~~l~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lkd----~~G--~-~l~~~~~----~g~l~G-~Pv~~~~~~p~~~~~ 358 (435) T protein:vir:14 292 LGKVILALENADANL-TQPGWIMAPRTFRFLEGLRD----GNG--N-KVYPELA----NGMLKG-YPVGKTTQVPINLGE 358 (435) T ss_pred HHHHHHHhhhccccc-cCCEEEEcHHHHHHHHHhhc----cCC--c-eeccCCC----CCeeec-ceeEeeccccccccC Confidence 2222222222 2233 23467999999999875421 110 0 1111111 257777 699998876532 Q ss_pred -----eEE--------EEEecCCCccceeEeecccccccccccCCccc---cceeeeeeeeccee-cCccccccCCcccc Q lcl|NC_014036. 435 -----YFT--------VGYKGDNEMDAGIYYAPYVALTPLRGSDPKNF---QPVMGFKTRYGVGI-NPFANSRSQAPSDR 497 (522) Q Consensus 435 -----y~~--------vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~---qP~~~~~tRY~l~~-nP~~~~~~~~~~~~ 497 (522) -++ ||..+.-+ +-..||..........-..| |=.+=...|++..+ +| +... T Consensus 359 ~~~~~~i~~gd~s~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~--------~a~~ 426 (435) T protein:vir:14 359 TGKESEIYFTDFGDVFIGEEETLE----IDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHV--------ESIA 426 (435) T ss_pred CCccceEEEeecccEEEEEecccE----EEEeccccccccccchhhhhhcChhheeeeeeeCceeecc--------cceE Confidence 122 33332222 22333322111000000001 12233445555532 22 1223 Q ss_pred ccCcchHHh Q lcl|NC_014036. 498 ITSGMITKE 506 (522) Q Consensus 498 i~~g~~~~~ 506 (522) +.+|-+|.. T Consensus 427 ~l~~~~~~~ 435 (435) T protein:vir:14 427 VLAGVAWGA 435 (435) T ss_pred EEecCCCCC Confidence 344555544 No 74 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=64.88 E-value=0.29 Score=23.51 Aligned_cols=218 Identities=10% Similarity=0.048 Sum_probs=98.2 Q ss_pred cccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHH Q lcl|NC_014036. 215 IAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSA 294 (522) Q Consensus 215 ~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsN 294 (522) -++...|...+++.- ...+|.+. ....-+..+|+++=.+.+ .|-+.=.=++|=| ..|.+ +| |--.|..+ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~---eG~~i~~~~l~~t~~~at--Ik~~gk~~~itD~--a~l~~-~g-Dp~~ea~~ 69 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVA---EGGEISLDKIGTTTKSVT--IKKAAKGTEITDE--AALSG-YG-DPIGESNK 69 (231) T ss_pred CccccCCceEEeccc--ccchhhhc---CCCcCChhhccccceeee--EeeeccceeeeHH--HHhhc-cC-chHHHHHH Confidence 000011111111100 11222221 112233555665544444 4444333333322 22555 33 88899999 Q ss_pred HHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccC Q lcl|NC_014036. 295 ILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGA 374 (522) Q Consensus 295 ILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~ 374 (522) -|+..|...+|.||+..+..++. .+ +..+++ +.+..+..++ ..+ -.. T Consensus 70 Q~~~~iA~kvD~di~~~~~~a~l-~~----------~~~~t~----------d~i~~A~~~f---gde---------~~~ 116 (231) T protein:vir:73 70 QLGLSLANKVDDDLLKAAKTTSQ-TV----------STKANV----------DGVQAALDIF---NDE---------DAQ 116 (231) T ss_pred HHHHHHHHhhhHHHHHhhccccc-cc----------cccccH----------HHHHHHHHHh---ccc---------ccc Confidence 99999999999999965543332 11 111111 1121222221 111 135 Q ss_pred CcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeec Q lcl|NC_014036. 375 GNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAP 454 (522) Q Consensus 375 gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaP 454 (522) ..++||+|+++..|...-..+...+.. +.+.-. +-.+|.+.| ++|+++...+. +..++++ T Consensus 117 ~~vivv~p~~~~~Lrk~~~~~~~~~~~---g~~i~~--~G~iG~i~G-~~Vi~S~~~~~--------------~~~~~~~ 176 (231) T protein:vir:73 117 AYVLIVNPKDAAKIRKDANAKNIGSEV---GANALI--NGTYADVLG-AQIVRSKKLAE--------------GSALMFK 176 (231) T ss_pred ceEEEEcchHHHhhhhccchhhhhhhh---ccceee--ecccceEcc-eEEEEcCCCCC--------------Cceeeee Confidence 679999999999987542222221111 111111 224677766 79998877663 2234444 Q ss_pred cccc------cccc------ccCCccccceeeeeeeeccee-cCccccccCCccccccCcchHHhhccccceeeeeeecc Q lcl|NC_014036. 455 YVAL------TPLR------GSDPKNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKG 521 (522) Q Consensus 455 Yv~~------~~~~------~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~ 521 (522) |+.. ...+ .-|+..+.-.+--.-.|++.. || .=..++.+|| T Consensus 177 ~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~--------------------------~~vv~~t~~g 230 (231) T protein:vir:73 177 IVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDL--------------------------TKVVNITFTG 230 (231) T ss_pred EEeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcC--------------------------ccEEEEEeec Confidence 5320 0000 124444444444444443321 11 0123445566 Q ss_pred C Q lcl|NC_014036. 522 L 522 (522) Q Consensus 522 ~ 522 (522) + T Consensus 231 ~ 231 (231) T protein:vir:73 231 V 231 (231) T ss_pred C Confidence 6 No 75 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=63.31 E-value=0.32 Score=23.30 Aligned_cols=344 Identities=14% Similarity=0.113 Sum_probs=126.7 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhh--hHHHH-hhhhhhhcchhhhhhhcccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIME--AQEKD-AEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDAT 77 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~e--nq~~~-~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~ 77 (522) +.+..+.+++=.-+- -+|... ++. +..... +.++. -.+.....+.....+|..+|...+. .. T Consensus 64 ~~~~~e~~~~~~~~~------~ei~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l~~~e~-------~~ 128 (425) T protein:vir:10 64 GLPTSDALAKVDKVS------ADLEAL-QAA-VDEANIKIAAAQMGANGVKPLRDPEYTEAFKAHVKRGDV-------QA 128 (425) T ss_pred hhccHHHHHHHHHHH------HHHHHH-HHH-HHHHHHHHHhhhcccccccccccHHHHHHHHHHhhhhhh-------HH Confidence 111111222211110 011110 000 000000 00000 0011111122223334333322110 11 Q ss_pred cccccccc-ccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccccccc Q lcl|NC_014036. 78 KIASGNSS-GAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYS 155 (522) Q Consensus 78 ~~~~~t~t-g~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fS 155 (522) .+..++.+ |.+ -.-+.+. .+++.+-...+..++|.|.||+++..-+.- ... .+.+.|- T Consensus 129 al~~~t~~~gG~-lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~-------~~~------------~~~a~wv 188 (425) T protein:vir:10 129 ALNKGEDSEGGY-LTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLF-------NMG------------GTTSGWV 188 (425) T ss_pred HhhcCcCCCCce-eccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEE-------EcC------------Ccceeee Confidence 12222211 111 1112222 255555567778889999998876542210 000 0000000 Q ss_pred ccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhh Q lcl|NC_014036. 156 GQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVA 235 (522) Q Consensus 156 G~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~a 235 (522) + T Consensus 189 ~------------------------------------------------------------------------------- 189 (425) T protein:vir:10 189 G------------------------------------------------------------------------------- 189 (425) T ss_pred c------------------------------------------------------------------------------- Confidence 0 Q ss_pred hhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhc Q lcl|NC_014036. 236 ELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYT 315 (522) Q Consensus 236 Eal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~ 315 (522) |.. ....+....|.++.|++.|..+ ...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||. =. T Consensus 190 E~~-~~~~~~~~~f~~v~~~~~k~~~-------~i~iS~ell~ds----~~~l~~~i~~~la~ai~~~~d~~~l~---G~ 254 (425) T protein:vir:10 190 EAS-QRPQTNAATFQPLSFASGEIYA-------NPAATQQILDDA----EIDLESWLATEVQTEFAKQEGKAFLA---GD 254 (425) T ss_pred ccc-ccccccccccceeeeeheeeEe-------ehHhHHHHHhcc----hhHHHHHHHHHHHHHHHHHHHhhhhc---cc Confidence 100 0000011235666666665544 566999999985 35679999999999999999999883 10 Q ss_pred cccccccc-ccccc-ccce--eeccc-cccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhh Q lcl|NC_014036. 316 AQVGKTGF-TQTVG-SKAG--AFDFQ-DPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALAR 390 (522) Q Consensus 316 a~~~~~~~-~~~~~-~~~g--~fd~~-~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~ 390 (522) -.-.-.|+ +...+ ..+. .++.. .........-..+....|+..+.. .+-+....|++|.....|.. T Consensus 255 G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~---------~~~~~a~~vmn~~~~~~L~~ 325 (425) T protein:vir:10 255 GTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPS---------AFTGNARFAMNRNTQRQVRK 325 (425) T ss_pred CCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhh---------hhccCCEEEEchHHHHHHHH Confidence 00000010 00000 0000 00000 000000000011222334333221 22233467899999888874 Q ss_pred hcccccccccccccccccccccceeEEEecCceEEEecCCCcc-----ceEEEEEecCCCccceeEeecccccccccccC Q lcl|NC_014036. 391 IDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG-----DYFTVGYKGDNEMDAGIYYAPYVALTPLRGSD 465 (522) Q Consensus 391 ~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~D 465 (522) +- +.... -....+.+ ....++|.| ++|+++.+.|. +.|++| +-.. ..+...= .......| T Consensus 326 lk--D~~G~----~l~~~~~~-~g~~~~l~G-~PV~~~~~~p~~~~~~~~i~~G---d~~~--~~~i~~~--~~~~v~~d 390 (425) T protein:vir:10 326 LK--DGQGN----YLWQPSYV-AGQPATLAG-YPVTEVPDMPDVAANSTPILFG---DFQQ--TYLIIDR--IGVRVLRD 390 (425) T ss_pred hh--cCCCc----eeeccCcc-CCCCceecc-eeeEEecCcCCccCCccEEEEE---ehhc--cEEEEEe--cceEEEec Confidence 41 11100 00001111 111257877 69999888763 334443 1110 0111100 11111123 Q ss_pred Ccc--ccceeeeeeeecc-eecCccccccCCcccc Q lcl|NC_014036. 466 PKN--FQPVMGFKTRYGV-GINPFANSRSQAPSDR 497 (522) Q Consensus 466 p~s--~qP~~~~~tRY~l-~~nP~~~~~~~~~~~~ 497 (522) |-. .+=.+-...||+. +.+|-+...-.-..++ T Consensus 391 ~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 391 PYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred ccccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 322 2222333456654 3456444333322222 No 76 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=58.92 E-value=0.4 Score=22.75 Aligned_cols=333 Identities=15% Similarity=0.127 Sum_probs=126.0 Q ss_pred CcchH-----------HHHHhhhhhhccccchhhhcchh-----hhHHHHHHhhhHHHHhhhhhhh-------cchhhhh Q lcl|NC_014036. 1 MSKKN-----------ELMEKWNDLLESQEGLPDIATKS-----KKQLIAAIMEAQEKDAEVDPVY-------RDEKIVE 57 (522) Q Consensus 1 ~~~~~-----------~l~~kw~p~l~~~~~~~~i~~~~-----~~~~~~~~~enq~~~~~~~~~~-------~~~~~~~ 57 (522) ..+++ +.......-+.. .+++.+.. +|....--|.+..+.+...+.. -...+.+ T Consensus 260 ~~ra~~ld~l~~~~~a~~~~~~a~~~~~---~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~ 336 (632) T protein:vir:96 260 QFRALVLERMNPGQPGNFEKPGAGDLPG---KPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIAD 336 (632) T ss_pred HHHHHHHHHHhhhhhhhhhhhhhhhhhh---hhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHH Confidence 22221 011112222211 22222211 1111111111111111111000 0000111 Q ss_pred hhcccccccc-ccccccccccc-ccccc-ccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecC Q lcl|NC_014036. 58 SFGGFLAEAE-IAGDHGYDATK-IASGN-SSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGK 133 (522) Q Consensus 58 ~~~~~l~ea~-~~~~~g~~~~~-~~~~t-~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~ 133 (522) ..|. ++. +.-....-.++ +...| ++|...--...+- .++...-|..|...+ |++.+++.+|-+ +++. T Consensus 337 ~~G~---~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l-~~~~~~~~~g~~-----~ip~ 407 (632) T protein:vir:96 337 ASGK---EARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM-GARMLPGLVGDV-----DIPK 407 (632) T ss_pred hhhh---hhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhh-cceEeecCCcce-----EEEE Confidence 1110 000 00000000000 00011 1111100011110 122222345555554 555444443311 1111 Q ss_pred CCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCccccccc Q lcl|NC_014036. 134 DPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAA 213 (522) Q Consensus 134 ~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~ 213 (522) +.+ + T Consensus 408 ~~~-------------------------------------------------------------------~--------- 411 (632) T protein:vir:96 408 KTS-------------------------------------------------------------------G--------- 411 (632) T ss_pred EeC-------------------------------------------------------------------C--------- Confidence 000 0 Q ss_pred ccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHH Q lcl|NC_014036. 214 VIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELS 293 (522) Q Consensus 214 ~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELs 293 (522) +.....+| +...++-..+++++++.+|+=+-...+|-||..| -.+|.|++|. T Consensus 412 ---------------~~a~wv~E---------~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~d----s~~~~~~~i~ 463 (632) T protein:vir:96 412 ---------------ANFYWIGE---------DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQ----SSIHVENLIR 463 (632) T ss_pred ---------------ceeEeecC---------CccccccccceeeEEeeeeEEEEehhhHHHHHhc----cchHHHHHHH Confidence 00001112 1124455567778888888888888899998876 3678999999 Q ss_pred HHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeecccccc----ccccchhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_014036. 294 AILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPI----DVRGARWAGESYKALLIQIDKEANEIARQ 369 (522) Q Consensus 294 NILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~----d~~~~r~~~E~~r~L~~~i~~~an~I~r~ 369 (522) +-|...|...+++.+|.= . |..+ + -.|++...... ...+..|. ....|...|... T Consensus 464 ~~l~~a~~~~~d~a~l~G---~---G~~~--~----p~Gi~~~~~~~~~~~~~~~~~~~--~i~~~~~~i~~~------- 522 (632) T protein:vir:96 464 EDLIEGIGVALDLAMLTG---T---GLAN--D----PVGLLNMTGVPALTYPAGGVDWA--SVVDMETKISTF------- 522 (632) T ss_pred HHHHHHHHHHHHHHhhcc---c---CCCC--c----cceeeecccccceecccccCCHH--HHHHHHHHHhhc------- Confidence 999999999999999821 1 1000 0 01222211110 11111232 233333333222 Q ss_pred ccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeE--EEecCceEEEecCCCccceEEEEEecCCCcc Q lcl|NC_014036. 370 TGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFA--GVLGGVYKVYIDQYARGDYFTVGYKGDNEMD 447 (522) Q Consensus 370 T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~--G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d 447 (522) -........|++|.....|...... |.++..-+ |+|.| |+|++.++.+.+-+++|--. T Consensus 523 ~~~~~~~~~~~~~~~~~~l~~~~l~--------------d~~G~~i~~~~~l~G-~pv~~s~~ip~~~~~~gd~s----- 582 (632) T protein:vir:96 523 NADAGRLAYLTSVTQRGAAKKAQVF--------------DNTGERIWQNNEVNG-YRAEASNQIPADTWIFGDWS----- 582 (632) T ss_pred ccccCccEEEEchhHHHHHHHHhcc--------------CCCCceeecCCeecc-cceEeccccccCcEEEeecc----- Confidence 1111224568898877776532111 11111111 56776 79999999886555544210 Q ss_pred ceeEeecccccccccccCC----ccccceeeeeeeecce-ecC--ccccccCC Q lcl|NC_014036. 448 AGIYYAPYVALTPLRGSDP----KNFQPVMGFKTRYGVG-INP--FANSRSQA 493 (522) Q Consensus 448 ~glfyaPYv~~~~~~~~Dp----~s~qP~~~~~tRY~l~-~nP--~~~~~~~~ 493 (522) -+|+.-+-.+. -.+|| .+.+=.+=...|+++. .+| |...+..+ T Consensus 583 -~~~i~~~~~~~--i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 583 -QIVIAMWGVLD--LKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred -eEEEEEecceE--EEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 01111110000 01233 3333344456666653 345 33333332 No 77 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=58.39 E-value=0.41 Score=22.68 Aligned_cols=278 Identities=12% Similarity=-0.012 Sum_probs=99.6 Q ss_pred cccccccccccccccccccccccccccccccceeeccccc-------ccceeeeeccccccccCCCCccccccccccccc Q lcl|NC_014036. 147 MFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVE-------TGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQE 219 (522) Q Consensus 147 ~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~-------~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~ 219 (522) |.+ .....+|.+....+.. -.....+ .......++.. ... T Consensus 1 Ma~---------------------~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~---l~~~i~~~~~~--~~i------- 47 (315) T protein:vir:80 1 MAD---------------------DFLSAGKLELPGSMIGAVRDRAIDSGVLAK---LSPEQPTIFGP--VKG------- 47 (315) T ss_pred CCC---------------------CcCCcCceEcchHHHHHHHHHHHhhchhhh---hcceeecCCCc--eEE------- Confidence 111 0001111111111000 0000000 00011110000 000 Q ss_pred ccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHH Q lcl|NC_014036. 220 KGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATE 299 (522) Q Consensus 220 ~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTE 299 (522) ....+.+-+...+|. ..+++...+++++++.+|.-+-....|-||.+|. ..|+..+|.++|..+ T Consensus 48 ---p~~~~~~~a~wv~Eg---------~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~s----~~~~~~~l~~~i~~~ 111 (315) T protein:vir:80 48 ---AVFSGVPRAKIVGEG---------EVKPSASVDVSAFTAQPIKVVTQQRVSDEFMWAD----ADYRLGVLQDLISPA 111 (315) T ss_pred ---EEEeCCcceEEeeCC---------ccccccccceeeeEeeeeeEEeeehhhHHHhhcC----chhHHHHHHHHHHHH Confidence 011222223344452 3456666677777777777767778999999884 355566666666666 Q ss_pred HHHHhhHHHHhh-hhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEE Q lcl|NC_014036. 300 IMLEINREIVDM-INYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFI 378 (522) Q Consensus 300 ImlEINReii~~-i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~ 378 (522) +...|.|.+=.. ++-+--.+. ....+-...+.-.....+..+.-| .-+.++...+.....+ ..+-. T Consensus 112 la~ai~~~~d~a~~~G~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~---------~d~~~~~~~~~~~~~~-~~~~~ 178 (315) T protein:vir:80 112 LGASIGRAVDLIAFHGIDPATG---KAASAVHTSLNKTKNIVDATDSAT---------ADLVKAVGLIAGAGLQ-VPNGV 178 (315) T ss_pred HHHHHHHHHhhheeeccCCCCC---ccccccccccccccceeeccccch---------HHHHHHHHHHhhccCc-cceEE Confidence 666555555432 221110000 000000000000000011111111 1222222223222222 33568 Q ss_pred EEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccc---------eE--------EEEEe Q lcl|NC_014036. 379 IASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGD---------YF--------TVGYK 441 (522) Q Consensus 379 v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y~--------~vG~K 441 (522) |++|+....|..+=....... +.+......... -.++|.| ++|+++.+.+.+ .+ .+|+. T Consensus 179 imn~~~~~~L~~l~~~~g~~~-~g~~~~~~~~~g--~~~tl~G-~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~ 254 (315) T protein:vir:80 179 ALDPAFSFALSTEVYPKGSPL-AGQPMYPAAGFA--GLDNWRG-LNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQ 254 (315) T ss_pred EEcHHHHHHHHHHhhccCCcc-cccccccccccC--CCceecc-eeeEecCcCCcccccccccccEEEEeecccEEEEEe Confidence 899999998875511111110 111111000111 1257887 699998887631 12 22222 Q ss_pred cCCCccceeEeecccccccccccCCc----c-ccc-eeeee--eeecce-ecC--ccc-cccCCccccccCcc Q lcl|NC_014036. 442 GDNEMDAGIYYAPYVALTPLRGSDPK----N-FQP-VMGFK--TRYGVG-INP--FAN-SRSQAPSDRITSGM 502 (522) Q Consensus 442 G~~~~d~glfyaPYv~~~~~~~~Dp~----s-~qP-~~~~~--tRY~l~-~nP--~~~-~~~~~~~~~i~~g~ 502 (522) +... +-..+| .|++ + ||. .++|. .|+|.. .+| |.. ...-+|......+. T Consensus 255 ~~~~----i~i~~~--------~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~~~~~~ 315 (315) T protein:vir:80 255 RNFP----IELIEY--------GDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAEN 315 (315) T ss_pred cCee----EEEecc--------ccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCCCCCCC Confidence 2111 111122 1111 1 221 13332 455433 455 211 00111222222222 No 78 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=57.35 E-value=0.44 Score=22.56 Aligned_cols=348 Identities=11% Similarity=0.060 Sum_probs=117.6 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhh-------hhcchhhhhhh----cccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDP-------VYRDEKIVESF----GGFLAEAEIA 69 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~-------~~~~~~~~~~~----~~~l~ea~~~ 69 (522) |-+.+++.+|...+-.. . +-.+...-+..-..+.+++..+.+.++. ..++......+ ...|.+.+-- T Consensus 1 ik~L~e~~~e~~e~~~~-~-~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~ 78 (390) T protein:vir:40 1 MNNLDKKDSETLNISTA-F-LNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESK 78 (390) T ss_pred CchHHHHHHHHHHHHHH-H-HHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHH Confidence 66665555555443321 1 1222222211111122222111111110 00000000000 1111111100 Q ss_pred cccccccccccccc-ccccccccCcchh--hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcc Q lcl|NC_014036. 70 GDHGYDATKIASGN-SSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHP 146 (522) Q Consensus 70 ~~~g~~~~~~~~~t-~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~ 146 (522) ..+ ..+...+ +.|.. .=|.-+ .+++.+-..-+-.++|-+.||++....|.. ....+ T Consensus 79 ---~~~-~~~~~~~~~~gg~--lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~----~~~~~----------- 137 (390) T protein:vir:40 79 ---YYN-EVIAGNGFAGVTA--LLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIIS----VGDVA----------- 137 (390) T ss_pred ---HHH-HHHhccCcccCcc--cccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEE----EcCCc----------- Confidence 000 0001111 11111 112211 233333334455677888888774433321 00000 Q ss_pred cccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccc Q lcl|NC_014036. 147 MFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEI 226 (522) Q Consensus 147 ~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~ 226 (522) .+.|-+. T Consensus 138 ----~a~~~~E--------------------------------------------------------------------- 144 (390) T protein:vir:40 138 ----TAWWGPL--------------------------------------------------------------------- 144 (390) T ss_pred ----ceeeecc--------------------------------------------------------------------- Confidence 0000000 Q ss_pred cccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhH Q lcl|NC_014036. 227 SYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINR 306 (522) Q Consensus 227 g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINR 306 (522) + ++ .-..+...|.+..|++.|..+- ...|-||.+|-- .|.|++|.+.|+..|..-+|+ T Consensus 145 --~-----~~----~~~~~~~~f~~i~l~~~k~~~~-------i~iS~ell~ds~----~~l~~~i~~~la~~i~~~~~~ 202 (390) T protein:vir:40 145 --C-----AE----IKEVLDNGFDKIQTGMYKLSAY-------IPVCNAMLDLGP----SWLDQYVRTILGEAMALGLEA 202 (390) T ss_pred --c-----cc----cCccccccceeeEeeeeeEEEe-------ehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHh Confidence 0 00 0001123477777777776543 458899999863 467999999999999999999 Q ss_pred HHHhh---------hhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcE Q lcl|NC_014036. 307 EIVDM---------INYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNF 377 (522) Q Consensus 307 eii~~---------i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~ 377 (522) .||.= ++..+.... + ......++.+...+ ....+..|...+..... .. ++.+. T Consensus 203 a~l~G~G~~~P~Gil~~~~~~~~-~--~~~~~~~~~~t~~~---------~~~~~~~l~~~~~~~~~----~~-~~~a~- 264 (390) T protein:vir:40 203 GIVNGSGKDQPIGMMRDLNNVTA-G--EHPVKTATPLTDLT---------PATLATKVMLPLTDNGK----KS-VSDAI- 264 (390) T ss_pred hhhcccCCCccceeeeccccccc-c--ccccccccccchhh---------HHHHHHHHHHHhhcchh----hh-hcCce- Confidence 99930 111100000 0 00000011111000 11122222222211111 11 12333 Q ss_pred EEEchhH-HHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEe--------cCCCccc Q lcl|NC_014036. 378 IIASRNV-VSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYK--------GDNEMDA 448 (522) Q Consensus 378 ~v~S~~v-a~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K--------G~~~~d~ 448 (522) .||+|.. +..|..+-.+ .|..+....+.+.-+++|+++++.+.+-++.|-- +....+- T Consensus 265 ~i~n~~t~~~~l~~~~~~-------------~d~~G~~v~~~~~~g~pvv~~~~~p~~~i~~Gd~s~~~i~~~~~~~v~~ 331 (390) T protein:vir:40 265 LVINPADYWSKIYAATSY-------------MTPQGVWVTGILPVPLEIVQSVAVPVGKAVAGRAKDYFMGIGSEQVIRT 331 (390) T ss_pred EEEcchhHHHHHHHHhhc-------------cCCCCccccccCCCceeEEEcCCCCCCcEEEEeeceEEEEeecceEEEe Confidence 4566553 3334311000 1111222222333457999999988654444422 1111110 Q ss_pred ee--Eeecc-------cccccccccCCccccceeeeeeee-cceecCccccccCCccccccCcc Q lcl|NC_014036. 449 GI--YYAPY-------VALTPLRGSDPKNFQPVMGFKTRY-GVGINPFANSRSQAPSDRITSGM 502 (522) Q Consensus 449 gl--fyaPY-------v~~~~~~~~Dp~s~qP~~~~~tRY-~l~~nP~~~~~~~~~~~~i~~g~ 502 (522) +- +|. + ..-......||+.|. ++=++.== .-.+.||....+-.++ ..+. T Consensus 332 ~~~~~f~-~~~~~~r~~~r~dg~v~~~~A~~-~l~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 390 (390) T protein:vir:40 332 STEYRLL-DDETLYYAKQYANGRPKDNSSFL-VFDITGLEGSPAIDVNVVNNATPSE---TPAE 390 (390) T ss_pred cchhhhh-cCcEEEEEEEEeCCEEecccceE-EEEeeccCCCCCCCcceeeCCCCCC---CCCC Confidence 00 000 0 000000113444333 00000000 0122233331111111 0111 No 79 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=55.80 E-value=0.47 Score=22.38 Aligned_cols=304 Identities=12% Similarity=0.018 Sum_probs=126.2 Q ss_pred hhhhhhhcchhhhhhhccccccccccccccccccccccccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhhh Q lcl|NC_014036. 44 AEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIASGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQ 123 (522) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGL 123 (522) +.-||-=.. .+|. ..+.+.+..+++++.-.--.+.+=.+++.+.+..+-..++-+.||++++. T Consensus 1 ~~~~~~r~~--------~~~~--------~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~- 63 (326) T protein:vir:42 1 MAVNPDRTT--------PFLG--------VNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQ- 63 (326) T ss_pred CCCCccchh--------hhcC--------cchhhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCce- Confidence 211110000 0011 11112222222211111111111145555556666777888888876542 Q ss_pred heeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccC Q lcl|NC_014036. 124 VFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVT 203 (522) Q Consensus 124 IFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~t 203 (522) +|+-... .+... T Consensus 64 ------~~p~~~~------------~~~a~-------------------------------------------------- 75 (326) T protein:vir:42 64 ------KIPHWTG------------DVSAS-------------------------------------------------- 75 (326) T ss_pred ------EEEEEeC------------CcceE-------------------------------------------------- Confidence 1110000 00000 Q ss_pred CCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhh Q lcl|NC_014036. 204 GSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAV 283 (522) Q Consensus 204 gt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAi 283 (522) . .+| +..++|-..+++++++.+|..+-.-.+|-||.+|- T Consensus 76 ---------------------~--------v~E---------g~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s--- 114 (326) T protein:vir:42 76 ---------------------W--------IGE---------GDMKPITKGNMTSQTIAPHKIATIFVASAETVRAN--- 114 (326) T ss_pred ---------------------E--------ecC---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcC--- Confidence 0 011 11244555667788888888888889999999984 Q ss_pred cCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHH Q lcl|NC_014036. 284 HGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEA 363 (522) Q Consensus 284 HGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~a 363 (522) ..|.+++|.+-|+..|+..+++.+|. -.-.-...+..... ...+...... .... .......+. +..+. T Consensus 115 -~~~~~~~i~~~l~~a~~~~~d~a~l~---G~gs~~p~gi~~~~-~~~~~~~~~~-~~~~----~~~~~~~~~--~~~~~ 182 (326) T protein:vir:42 115 -PANYLGTMRTKVATAFAMAFDNAAIN---GTDSPFPTFLAQTT-KEVSLVDPDG-TGSN----ADLTVYDAV--AVNAL 182 (326) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHhhc---ccCCCccccccccc-cccceeeccc-cccc----ccchhHHHH--HHHHH Confidence 36789999999999999999999982 11100000000000 0000000000 0000 000011111 11111 Q ss_pred HHHHHhccccCCcEEEEchhHHHHHhhhc----ccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEE Q lcl|NC_014036. 364 NEIARQTGRGAGNFIIASRNVVSALARID----SGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVG 439 (522) Q Consensus 364 n~I~r~T~~g~gn~~v~S~~va~~L~~~~----~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG 439 (522) ... ...+...+..|++|.....|..+- ...+.+.. +. ........|+|.| ++|+++++.+.+=. ++ T Consensus 183 ~~~--~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~--~~----~~~~~~~~~~l~G-~pv~~~~~~~~~~~-~~ 252 (326) T protein:vir:42 183 SLL--VNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIEST--YT----EENSPFRLGRIVA-RPTILSDHVASGTV-VG 252 (326) T ss_pred hhh--hhhccCccEEEEeHHHHHHHHHhhccCCceeecccc--cc----CccccccCceeee-eeEEEcCCCCCCce-EE Confidence 111 222335678899999999987431 11111110 00 0111123456777 79999998775322 12 Q ss_pred EecCCCccceeEeecccccccccc---------cCCcc-----cc---ceeeeeeeecce-ecCccccccCCccccccCc Q lcl|NC_014036. 440 YKGDNEMDAGIYYAPYVALTPLRG---------SDPKN-----FQ---PVMGFKTRYGVG-INPFANSRSQAPSDRITSG 501 (522) Q Consensus 440 ~KG~~~~d~glfyaPYv~~~~~~~---------~Dp~s-----~q---P~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g 501 (522) +-|+-. -+||...-. ..++. .|+.. || =.+=...|++.. .+|=+. ..+.+ T Consensus 253 ~~Gd~s---~~~~~~~~~-~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~--------~~l~~ 320 (326) T protein:vir:42 253 YQGDFR---QLVWGQVGG-LSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAF--------VKLTN 320 (326) T ss_pred EEeecc---eEEEEEecc-eEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccce--------EEEee Confidence 222211 122222211 11111 11111 22 333456677664 344111 01112 Q ss_pred chHHhh Q lcl|NC_014036. 502 MITKEM 507 (522) Q Consensus 502 ~~~~~~ 507 (522) -.+..+ T Consensus 321 ~~~~~~ 326 (326) T protein:vir:42 321 VDATEA 326 (326) T ss_pred ccccCC Confidence 222222 No 80 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=52.10 E-value=0.56 Score=21.95 Aligned_cols=346 Identities=12% Similarity=0.038 Sum_probs=121.2 Q ss_pred CcchHHHHHhhh--------------hhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhh---hhhhcccc Q lcl|NC_014036. 1 MSKKNELMEKWN--------------DLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKI---VESFGGFL 63 (522) Q Consensus 1 ~~~~~~l~~kw~--------------p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~---~~~~~~~l 63 (522) .-+-++|+++.+ .+++..+ .-++.... .. +.. |+++-+.+.+...-....+ .......- T Consensus 4 ~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~-~~e~~~~~-~e-~~~-l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (421) T protein:vir:13 4 FERLKELRAKKKELEEKRCGIVEEIRSLAKEKK-EEEARSKA-LE-REK-IEARMEIIEEEIESVMTAIDEERKNTNFTG 79 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-hHHHHHHH-HH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhhhcccc Confidence 111111222222 2221111 01110000 00 011 1111111111100000000 00000000 Q ss_pred cccccc--------------------ccccccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeccCCchh Q lcl|NC_014036. 64 AEAEIA--------------------GDHGYDATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPT 121 (522) Q Consensus 64 ~ea~~~--------------------~~~g~~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPT 121 (522) ...... +..-....+-+-+++.|.+. =|.-+ .+++.+.+..+-.+++-+.||++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~l--iP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~ 157 (421) T protein:vir:13 80 GRVIINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAV--IPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNA 157 (421) T ss_pred cccccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCCccee--cchhhHHHHHHHHHhhhhhhhhceeeeccCCc Confidence 000000 00000001111111222211 12221 2333344455667888888888766 Q ss_pred hhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccc Q lcl|NC_014036. 122 GQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVT 201 (522) Q Consensus 122 GLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~ 201 (522) +-+- +...... +.+ T Consensus 158 ~~~~-----~~~~~~~--------------~~~----------------------------------------------- 171 (421) T protein:vir:13 158 GKMP-----VRAGASV--------------DKL----------------------------------------------- 171 (421) T ss_pred eEEE-----EeecCCc--------------cce----------------------------------------------- Confidence 4221 1110000 000 Q ss_pred cCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHH Q lcl|NC_014036. 202 VTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLR 281 (522) Q Consensus 202 ~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLK 281 (522) .. .+| +...++-..++++++...+.-+-...+|-||.+|-- T Consensus 172 ----------------------~~--------~~E---------~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~ 212 (421) T protein:vir:13 172 ----------------------AN--------LAK---------DTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSE 212 (421) T ss_pred ----------------------ee--------ccc---------cccccccccceeEEEeeeeeeEeehhhhHHHHhhhH Confidence 00 001 011223233445555555555556779999999842 Q ss_pred hhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHH Q lcl|NC_014036. 282 AVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDK 361 (522) Q Consensus 282 AiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~ 361 (522) .|.++.|.+-|+..+..-+|..|+..+. |.. ...++.++ +..+.++..+.. T Consensus 213 ----~~l~~~i~~~la~~~~~~~~~~i~~~~~--------g~~----~~~~~~~~-------------d~i~~~~~~l~~ 263 (421) T protein:vir:13 213 ----INFLEFVNEEFAEFAVNTENAEIVKQAK--------AVL----AEETINDY-------------AGLVKTINSLVP 263 (421) T ss_pred ----HHHHHHHHHHHHHHHHHHhhhhHhhhhh--------hcc----ccccccch-------------HHHHHHHHHhhh Confidence 4568889999998888899998884322 111 11222222 223444444432 Q ss_pred HHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCcc-------- Q lcl|NC_014036. 362 EANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG-------- 433 (522) Q Consensus 362 ~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-------- 433 (522) .+.....+|++|.....|..+- +.... ....+... .--++|.| ++|++..+.+. T Consensus 264 ---------~~~~~a~~v~n~~~~~~l~~lk--d~~G~-----~i~~~~~~-~~~~tl~G-~pV~~~~~~~~~~~~~~~~ 325 (421) T protein:vir:13 264 ---------NARKRAIIVTNSDGRAYLDGLM--DKQGR-----PLLKELSD-GGDLVFKG-RPVIELEESIFDVGDETKF 325 (421) T ss_pred ---------hhcCCCEEEEcHHHHHHHHHhh--cCCCc-----eeecCcCC-CCCceecc-eeeEEeccccccCCCceEE Confidence 2235567899999988887441 11100 01111110 01246777 58887776552 Q ss_pred ------ceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccee-----------cCcc--ccccCCc Q lcl|NC_014036. 434 ------DYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVGI-----------NPFA--NSRSQAP 494 (522) Q Consensus 434 ------dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-----------nP~~--~~~~~~~ 494 (522) +|+.++.++.-..+.+- + .+-...+=.+-+..||++++ .++. ...++.+ T Consensus 326 ~~gd~~~~~~~~~~~~~~v~~~~----~--------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~ 393 (421) T protein:vir:13 326 IVSDFKTLIKFMDRKQYLIDQSK----E--------AGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVL 393 (421) T ss_pred EEEeccccEEEEEecceEEEeec----c--------cccccCeeEEEEEeeecceeecchhhheeeecccceeecccccc Confidence 12222222222111100 0 01112222344455554332 1111 1111212 Q ss_pred cccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 495 SDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 495 ~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) ..-...|..- .+||-+ +.|.+= T Consensus 394 ~~~~~~~~~~--~~~~~~----~~~~~~ 415 (421) T protein:vir:13 394 KSSPRSGKNK--NESKEE----IKEEGE 415 (421) T ss_pred CCCCcCCCCc--cccchh----eeeccc Confidence 1111111110 012211 111111 No 81 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=51.36 E-value=0.58 Score=21.87 Aligned_cols=204 Identities=16% Similarity=0.162 Sum_probs=104.8 Q ss_pred EEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeec Q lcl|NC_014036. 256 IDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFD 335 (522) Q Consensus 256 IEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd 335 (522) ||=. |=|..=++-.-+-++ | .|--.|.+.=...+++.++++-|++.+...|+... .++..++....... T Consensus 1 iD~l--------L~a~~~VdDiD~aqa-~-~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~-p~~~~~~g~~~~~~ 69 (221) T protein:vir:17 1 MDDL--------LVASQFVYDLDEILA-Q-WNTRSEISKQIGEALAIHYDERIARVLASASIAAA-PVTGQDGGFSVNIG 69 (221) T ss_pred CCcc--------hhHHHHHHhHHHHHh-h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcC-cccccccCcceecc Confidence 2221 233333444444455 4 78888888889999999999999988776665322 22211211111110 Q ss_pred cccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchh-HHHHHhhhcccccccccccccccccccccce Q lcl|NC_014036. 336 FQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRN-VVSALARIDSGITPAGQGLQKTLNVDTTKAV 414 (522) Q Consensus 336 ~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~-va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~ 414 (522) -.... ....|+..|-+.+...-.+----.|-|+|++|+ ...+|+..+..+... .+.......+. .. T Consensus 70 a~~t~----------~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~-d~~~s~g~~~~--g~ 136 (221) T protein:vir:17 70 AGNTN----------NAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNR-EIGNTQGDMNT--GK 136 (221) T ss_pred ccccC----------CHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeee-ecccccccccc--cc Confidence 00000 112233333333333333333346789999996 556665434433221 11111111111 12 Q ss_pred eEEEecCceEEEecCCCcc----ceEE------------EEEecCCCccceeEeecccccccccccCCccccceeeeeee Q lcl|NC_014036. 415 FAGVLGGVYKVYIDQYARG----DYFT------------VGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTR 478 (522) Q Consensus 415 ~~G~l~~~~~vy~D~y~~~----dy~~------------vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tR 478 (522) .+|.+.| ++||.=++.|. +|.. =.|.|+-.-..||||.|=.-++ ++.+.|-|--|-+.-| T Consensus 137 ~i~~v~G-~~V~~SnnlP~~~gt~~~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgt-vkl~~~~~~~~~~~~~-- 212 (221) T protein:vir:17 137 GLYVNAG-IRIYKSNVLASLYGTNLVTDPGDATTSGENNGSYRPAITDRAGLVFHKEAADT-VEVLLPPSRPPLVISM-- 212 (221) T ss_pred eeeeecC-cEEEEeccCCcccccccccCCccccccccccccccccccceEEEEEcchheee-eeeecCCCCCceeeee-- Confidence 4777886 89999999886 3321 1344555555789998874333 4667787777654322 Q ss_pred ecceecCccccccCCcccc Q lcl|NC_014036. 479 YGVGINPFANSRSQAPSDR 497 (522) Q Consensus 479 Y~l~~nP~~~~~~~~~~~~ 497 (522) |.-.+ |.+| T Consensus 213 -------~~~~~---~~~~ 221 (221) T protein:vir:17 213 -------FSIRR---PDRR 221 (221) T ss_pred -------eeccC---CCCC Confidence 22222 2223 No 82 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=47.36 E-value=0.7 Score=21.42 Aligned_cols=287 Identities=15% Similarity=0.111 Sum_probs=129.3 Q ss_pred CCchhhhheeeeeeecCCCCCC------------cccchhcccccccccccccccccccccccccccccccccceeeccc Q lcl|NC_014036. 117 MTGPTGQVFALRAVYGKDPLAS------------GAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDF 184 (522) Q Consensus 117 mTGPTGLIFAMRSrY~~~~~~t------------~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~ 184 (522) ||.|||++=+.. +...+- --.+..-+-+-+|.-|...++.... T Consensus 1 ~~~~~~i~s~~~----~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~--------------------- 55 (318) T protein:vir:10 1 MTAPTGIVSVSD----GPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNG--------------------- 55 (318) T ss_pred CCCCCcceeeec----CCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccc--------------------- Confidence 999999886543 211110 0011111112233333322111000 Q ss_pred ccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEE-EEEEEEE Q lcl|NC_014036. 185 VETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRI-DKQVIEA 263 (522) Q Consensus 185 ~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsI-EK~TVtA 263 (522) .++.....|+-.. ...|...+ +.+|+.-+-.. ++....+ T Consensus 56 ------~v~f~~~~p~~~~-----------------------------~d~e~VaE-----ggEiP~~~~~~G~~~ia~~ 95 (318) T protein:vir:10 56 ------VVAYNEGNPSFLE-----------------------------DDVADVAE-----FGEIPVSAGARGLPRTAFA 95 (318) T ss_pred ------eeEEEeccccccc-----------------------------CcHhhccC-----cccccccCCCCCchhhhhh Confidence 0000011111000 01111000 11122222222 1112233 Q ss_pred ecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcccccc---ccccccccccceeecccccc Q lcl|NC_014036. 264 RSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGK---TGFTQTVGSKAGAFDFQDPI 340 (522) Q Consensus 264 KSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~---~~~~~~~~~~~g~fd~~~~~ 340 (522) |-+.||-++|=|.. .-+.+|.-.....-|++-|...+|+.+++.|........ ..|++......+++|- T Consensus 96 ~K~G~~~~vS~Em~----~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~~~~~d~~~A---- 167 (318) T protein:vir:10 96 VKKALGVRVSKEMI----DENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGGKVRTDIAIA---- 167 (318) T ss_pred ehhccceeccHHHH----hhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcccccccchhh---- Confidence 47889999998864 336888999999999999999999999988755432222 2355422222233322 Q ss_pred ccccchhHHHHHHH-HHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccc--cccccceeEE Q lcl|NC_014036. 341 DVRGARWAGESYKA-LLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLN--VDTTKAVFAG 417 (522) Q Consensus 341 d~~~~r~~~E~~r~-L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~--~d~~~~~~~G 417 (522) .|..+. +...+.++...-.++=+| -.|.||.+|...+.|..- ..+...-....+..+ ...+ ..|.| T Consensus 168 --------~e~v~~a~~~~~~a~~~~~~~~~GY-~pdtIVlhP~~~~~l~~n-~~~~~~y~~~a~~~~~~~~~t-g~~~g 236 (318) T protein:vir:10 168 --------IEQISTAAPTAYPAGVGSSDEYFGF-IPDTIVMHYALLPILMDN-ENFMKVYERNANYVSTAPDWT-GNFPG 236 (318) T ss_pred --------hhhhhhhhhhhhhhhhhhhhhccCc-cceeeEECHHHHHHHhcc-hhhhhhhhccchhhhhccccc-ccccc Confidence 222221 112222222222346677 669999999999998421 111111000011011 1122 34566 Q ss_pred EecCceEEEecCCCccceEEEEEecCCCccceeEeeccccccccccc----CCccccceeeeeeeecc-----eecCccc Q lcl|NC_014036. 418 VLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGS----DPKNFQPVMGFKTRYGV-----GINPFAN 488 (522) Q Consensus 418 ~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~----Dp~s~qP~~~~~tRY~l-----~~nP~~~ 488 (522) .+-| ++|..+++-|.|=.+|==+|. -| ||+-=.|++...-+ || +.+|-..-..|+=- +..|++ T Consensus 237 ~~lG-l~vi~s~~~p~~~alvlq~g~----vG-~~~d~~pl~~t~~~~egg~~-~g~~~~s~~~~~~~~~~~~V~~PkA- 308 (318) T protein:vir:10 237 SVMG-LNVIRSRTFPIDRVLIMERGT----VG-FYSDTRPLQFTALYPEGNGP-NGGPTESYRADASHKRALAVDQPKA- 308 (318) T ss_pred eeec-eEEeecCccCCCeeEEEecCC----cc-eeeccccceeeecccCCCCC-CCCcchhhheehheeeeeeeeCcce- Confidence 6666 899999999987654433321 11 44433333332222 33 24444433333222 223322 Q ss_pred cccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 489 SRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 489 ~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) -|+++|| T Consensus 309 ---------------------------~~~itgi 315 (318) T protein:vir:10 309 ---------------------------ALWLTGI 315 (318) T ss_pred ---------------------------eEEEeec Confidence 2444555 No 83 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=45.76 E-value=0.76 Score=21.24 Aligned_cols=346 Identities=15% Similarity=0.165 Sum_probs=123.6 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhh----------h---HHHHHH--hhhHHHHhhhhhhhcchhhhhhhcccc-- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSK----------K---QLIAAI--MEAQEKDAEVDPVYRDEKIVESFGGFL-- 63 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~----------~---~~~~~~--~enq~~~~~~~~~~~~~~~~~~~~~~l-- 63 (522) |.+.++|.|+=+-+++. +-+|.+... + .+.+.+ |+.|.+.+.+.+.. +.......-... T Consensus 1 M~kl~~L~e~r~~l~~~---~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~-~~~~~~~~~~~~~~ 76 (428) T protein:vir:10 1 MPQIEELRRQRAGINEQ---IQALATIEATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERA-AALVAKPVKATQHG 76 (428) T ss_pred CchHHHHHHHHHHHHHH---HHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhhhhhchhhc Confidence 99988888887776653 233332110 0 111110 11111110000000 000000000000 Q ss_pred ----cccccccccccccccc-------------------------------ccccccccccccCcchh--hHHHHHHhhh Q lcl|NC_014036. 64 ----AEAEIAGDHGYDATKI-------------------------------ASGNSSGAITNIGPAVI--GMVRRAIPNL 106 (522) Q Consensus 64 ----~ea~~~~~~g~~~~~~-------------------------------~~~t~tg~v~~~~P~Li--~l~Rra~~~l 106 (522) ...+.....+..-.+. ..++++|.+.- |.-+ .++.++-+.. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~li--P~~~~~~ii~~l~~~~ 154 (428) T protein:vir:10 77 PAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLI--PQNIHSEVIELLRDRT 154 (428) T ss_pred cccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCcccc--chhHHHHHHHHHhhhc Confidence 0000000000000000 00000111000 0000 0001011111 Q ss_pred hhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeeccccc Q lcl|NC_014036. 107 IAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVE 186 (522) Q Consensus 107 I~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~ 186 (522) +..++ |+..+++++|-+ +| T Consensus 155 ~l~~~-~~~~~~~~~g~~-----~~------------------------------------------------------- 173 (428) T protein:vir:10 155 IVRKL-GARSIPLPNGNM-----SL------------------------------------------------------- 173 (428) T ss_pred hhhhh-cceeeecCCcce-----EE------------------------------------------------------- Confidence 11111 000000000000 00 Q ss_pred ccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecc Q lcl|NC_014036. 187 TGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSR 266 (522) Q Consensus 187 ~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSR 266 (522) |.- .+.+-....+| +...++...++++++...|.- T Consensus 174 ------------p~~------------------------~~~~~a~~v~E---------g~~~~~~~~~f~~i~~~~~k~ 208 (428) T protein:vir:10 174 ------------PRL------------------------AGGATASYTGE---------NQDAKVSEARFDDVKLTAKTM 208 (428) T ss_pred ------------EEE------------------------eCCcceeeecc---------CccccccccceeeEEeeeEEE Confidence 000 00000011222 123456666777777777777 Q ss_pred cccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccc------- Q lcl|NC_014036. 267 QLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDP------- 339 (522) Q Consensus 267 ALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~------- 339 (522) +-...+|-||.+|- ..|.++.|.+.|...|...+|+.||. -.-. +. .-.|++.-... T Consensus 209 ~~~v~is~ell~ds----~~~l~~~i~~~l~~ai~~~~d~~~l~---G~G~-~~--------~p~Gi~~~~~~~~~~~~~ 272 (428) T protein:vir:10 209 IAMVPISNALIGRA----GFNVEQLVLQDILTAISVREDKAFMR---DDGT-GD--------TPIGMKARATQWNRLLPW 272 (428) T ss_pred EEeehhhHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhc---cCCC-Cc--------cccccccccccccccccc Confidence 78899999999984 24568999999999999999999882 1100 00 00122111000 Q ss_pred -cccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEE Q lcl|NC_014036. 340 -IDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGV 418 (522) Q Consensus 340 -~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~ 418 (522) .+. +. .......+ ..+..+...+.+.-. .....|++|.....|..+- + ..+ . ....+. .-|+ T Consensus 273 ~~~~-~~--~~~~~~~~-~~~~~~~~~~~~~~~--~~~~~v~n~~~~~~L~~lk--d--~~G--~-~i~~~~----~~g~ 335 (428) T protein:vir:10 273 AADA-AV--NLDTIDTY-LDSIILMSMDGNSNM--ISSGWGMSNRTYMKLFGLR--D--GNG--N-KVYPEM----AQGM 335 (428) T ss_pred cccc-cc--cHHHHHHH-HHHHHHhhhcccccc--ccCEEEEcHHHHHHHHHhh--c--cCC--c-eeccCC----CCCe Confidence 000 00 01112222 222222333333222 2345678999888887441 1 110 0 011111 1256 Q ss_pred ecCceEEEecCCCccc----------------eEEEEEecCCCccceeEeecccccccccccCCccc---cceeeeeeee Q lcl|NC_014036. 419 LGGVYKVYIDQYARGD----------------YFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNF---QPVMGFKTRY 479 (522) Q Consensus 419 l~~~~~vy~D~y~~~d----------------y~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~---qP~~~~~tRY 479 (522) |.| ++||++.+.|.+ ++++|..+.-+.+ ..+|..........-..| +=.+=...|+ T Consensus 336 l~G-~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i~----~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~ 410 (428) T protein:vir:10 336 LKG-YPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKVD----FSKEASYIDTDGKLVSAFSRNQSLIRVVTEH 410 (428) T ss_pred eec-eeeEEeccccccccCCCccceEEEEecceEEEEEecceEEE----eecccccccccccccchhhcchhheeeeeee Confidence 777 699998876643 1223333222211 112211110000000001 1222345566 Q ss_pred cceec-CccccccCCccccccCcchH Q lcl|NC_014036. 480 GVGIN-PFANSRSQAPSDRITSGMIT 504 (522) Q Consensus 480 ~l~~n-P~~~~~~~~~~~~i~~g~~~ 504 (522) ++.+. | ++-.+..|-.| T Consensus 411 d~~v~~p--------~a~~~~t~~~~ 428 (428) T protein:vir:10 411 DIGFRHP--------EGLVLGTGVLF 428 (428) T ss_pred Cceeecc--------ceEEEEeccCC Confidence 65543 4 11122333344 No 84 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=43.69 E-value=0.83 Score=21.01 Aligned_cols=334 Identities=16% Similarity=0.121 Sum_probs=126.0 Q ss_pred CcchHHHHHhhhhhhccccch-hhhcchhhh-HHHHHHhhhHHHHhh-------hhhhhcchh--hhhhhcccccccc-- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGL-PDIATKSKK-QLIAAIMEAQEKDAE-------VDPVYRDEK--IVESFGGFLAEAE-- 67 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~-~~i~~~~~~-~~~~~~~enq~~~~~-------~~~~~~~~~--~~~~~~~~l~ea~-- 67 (522) ++. +-.++|.-+...-+.+ -+|....++ +-.-...|.|.+... ++..+++.. ....++..+.... T Consensus 31 ~~~--~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 108 (394) T protein:vir:97 31 LES--DDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRF 108 (394) T ss_pred hch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccchhhHHHHHHHHHHHHHHHHHhhhhhhh Confidence 221 1223344333110000 001100000 000000111100000 000000000 0000000000000 Q ss_pred -------ccccccccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCC Q lcl|NC_014036. 68 -------IAGDHGYDATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAS 138 (522) Q Consensus 68 -------~~~~~g~~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t 138 (522) ..............+.++.+-...-|.-+ .+++.+-+..+...++.+.||+++++-+--++ .... T Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-----~~~~- 182 (394) T protein:vir:97 109 EGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQ-----RATT- 182 (394) T ss_pred hhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEe-----cCCC- Confidence 00000000011111111111111123322 35555556777788899999887754321110 0000 Q ss_pred cccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccc Q lcl|NC_014036. 139 GAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQ 218 (522) Q Consensus 139 ~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~ 218 (522) .. T Consensus 183 ------------~~------------------------------------------------------------------ 184 (394) T protein:vir:97 183 ------------KM------------------------------------------------------------------ 184 (394) T ss_pred ------------cc------------------------------------------------------------------ Confidence 00 Q ss_pred cccccccccccccchhhhhccccCCCCCcccccc-ceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHH Q lcl|NC_014036. 219 EKGTLAEISYGMATSVAELQEQFNGSTGNPWNEM-GFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILA 297 (522) Q Consensus 219 ~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EM-sFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILS 297 (522) . ..+|. ...++. ...++++++.++.-+-...+|-||++|- ..|.+++|.+-|+ T Consensus 185 -----~--------~v~E~---------~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds----~~~~~~~i~~~la 238 (394) T protein:vir:97 185 -----V--------TVAEL---------EKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA----DVDLVGIVSESIS 238 (394) T ss_pred -----c--------eeccc---------ccccccccccceeEEeehhheeeehhhHHHHHhhh----hHHHHHHHHHHHH Confidence 0 00110 011222 2346667777777777889999999986 3467888999999 Q ss_pred HHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcE Q lcl|NC_014036. 298 TEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNF 377 (522) Q Consensus 298 TEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~ 377 (522) ..|..-+|..||.-+...+. .+...++ ....++... ....+ . .- T Consensus 239 ~~~~~~~~~~i~~g~~~~~~-------------~~~~~~~-------------~~~~~~~~~--------~~~~~-~-a~ 282 (394) T protein:vir:97 239 QIKVNTTNDAIAKVLKSFTT-------------KTVKNLD-------------EIKALLNGG--------FDPAY-N-VS 282 (394) T ss_pred HHHHHHHHHHHhhccccccc-------------cccccHH-------------HHHHHHHhh--------hhhhh-C-CE Confidence 99999899888843322111 1211111 111121111 11222 2 23 Q ss_pred EEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEe--cCCCccceEEEEEecCCCccceeEeecc Q lcl|NC_014036. 378 IIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYI--DQYARGDYFTVGYKGDNEMDAGIYYAPY 455 (522) Q Consensus 378 ~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~dy~~vG~KG~~~~d~glfyaPY 455 (522) +|++|.+...|..+- +.... -....+.+. ..-++|.| ++|++ |...+..-+++|-- ..+.++..- T Consensus 283 ~v~n~~~~~~l~~lk--d~~G~----~i~~~~~~~-~~~~~l~G-~pv~~~~~~~~~~~~~~~gd~-----~~~~~~~~~ 349 (394) T protein:vir:97 283 LIVSQSFYQTLDTLK--DGNGR----YLLQDDITA-VSGKVLLG-KPVFVLSDEVLGANKAFIGDF-----KRGVLFADR 349 (394) T ss_pred EEEcHHHHHHHHHhh--ccCCC----eeeecCcCC-CCCceecc-ceeEEecccccCCccEEEeec-----cccEEEEEe Confidence 679999988887541 11000 000011111 11247777 58777 44444444444421 011111111 Q ss_pred cccccccccCCccccceeeeeeeeccee-cCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 456 VALTPLRGSDPKNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 456 v~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) ....+...|...++..+-...||+..+ +|=+ ++.+-++.. T Consensus 350 -~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a--------------------------~~~~~~~~~ 390 (394) T protein:vir:97 350 -KDLGLRWADNEIYGQYLQAVLRFGVSKVDDKA--------------------------GYYVTFTPE 390 (394) T ss_pred -cceEEEEecccccceeEEEEEEEccEEecccc--------------------------eEEEEeccc Confidence 111222344455555555666776532 3311 111222222 No 85 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=43.62 E-value=0.84 Score=21.01 Aligned_cols=264 Identities=9% Similarity=0.041 Sum_probs=109.1 Q ss_pred cccccccccc--cccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccc Q lcl|NC_014036. 155 SGQGAAPSNG--FTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMAT 232 (522) Q Consensus 155 SG~g~~~~~~--~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~T 232 (522) .....+.... .+...+. +.... +... ..+..+. .....+. .. .|...++..-=.. T Consensus 1 ma~~~T~~~d~iiPev~~~-------~v~~~-~~~~-~~~~~~~--------~~~~~l~-----g~-~G~ti~iP~~~~~ 57 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAP-------IVSYE-LNKA-LRFAPLA--------QVDTTLQ-----GQ-PGNTLKFPAFTYI 57 (272) T ss_pred CCCcceehhhhhchHHHHH-------HHHHH-HHhh-hhhcccc--------ccccccc-----cC-CCCEEEEeeeccC Confidence 1100000000 0110000 00000 0000 0000000 0000000 00 1111211110011 Q ss_pred hhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_014036. 233 SVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMI 312 (522) Q Consensus 233 s~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i 312 (522) ..+|.. .....-+..++. ..+.+++-|-|+-.-++|=|. ++.-+-|.-.|..+-++..+..+++++|+..+ T Consensus 58 gda~~~---~eg~~i~~~~lt--~~~~~~~i~~~~k~~~vtD~~----~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l 128 (272) T protein:vir:36 58 GDAADV---AEGGEISLDKIG--TTTKSVTIKKAAKGTEITDEA----ALSGYGDPIGESNKQLGLSLANKVDDDLLSAA 128 (272) T ss_pred cccccc---CCCCccChhhcC--CcceeEeeehhhccccccHHH----HhhccchHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 111211 111112233443 445555556665322233222 12235789999999999999999999999665 Q ss_pred hhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhc Q lcl|NC_014036. 313 NYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARID 392 (522) Q Consensus 313 ~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~ 392 (522) ...+. .+ .+.+.+ +.+-.+..++.++. ...+++||+|.++..|.... T Consensus 129 ~~~~~-~~----------~~~~~~-------------d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~ 175 (272) T protein:vir:36 129 KTTSQ-TV----------STKANV-------------DGVQAALDIFNDED---------AQAYVLIVNPKDAAKIRKDA 175 (272) T ss_pred ccccc-cc----------cccccH-------------HHHHHHHHHhhhcC---------CCceEEEEcHHHHHHHhccc Confidence 32211 10 111111 11222223333222 14579999999999997543 Q ss_pred ccccccccccccccccccccceeEEEecCceEEEecCCCccc---eEEEEE-ecCCCccceeEeeccccccccc-ccCCc Q lcl|NC_014036. 393 SGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGD---YFTVGY-KGDNEMDAGIYYAPYVALTPLR-GSDPK 467 (522) Q Consensus 393 ~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---y~~vG~-KG~~~~d~glfyaPYv~~~~~~-~~Dp~ 467 (522) ........... .......+|.+.| ++|++|...|.+ |..+.. +|. ..+|.. ....++ .-|+. T Consensus 176 ~~~~~~~~~~~-----~~~~~G~ig~~~G-~~Vv~s~~~p~~~~~~~~~~~~~gA-----~~~~~~--~~~~vE~~R~~~ 242 (272) T protein:vir:36 176 NAKNIGSEVGA-----NALINGTYADVLG-AQIVRSKKLAEGSALMFKIVSNSPA-----LKLVLK--RGVQVETDRDIV 242 (272) T ss_pred ccccccccccc-----cceeeeccceecC-eeEEEeCCCCCCceeEEEEEecccc-----eeeeec--CCcccccccchh Confidence 33332211100 0111224678877 899999998853 222221 121 112211 111122 23888 Q ss_pred cccceeeeeeeeccee-cCccccccCCccccccCcchHHhhccccceeeeeeeccC Q lcl|NC_014036. 468 NFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 468 s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~Vk~~ 522 (522) .++=.+--.-+||+.+ ||=. ...+-.||+ T Consensus 243 ~~~d~i~~~~~y~~~v~~~~~--------------------------vv~~t~~g~ 272 (272) T protein:vir:36 243 TKTTVITADEHYAAYLYDLTK--------------------------VVNITFTGV 272 (272) T ss_pred hcCcEEEEEEEEEEEEEcCcc--------------------------EEEEeecCC Confidence 8888888888888754 5511 122333333 No 86 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=42.58 E-value=0.88 Score=20.89 Aligned_cols=302 Identities=10% Similarity=0.044 Sum_probs=116.8 Q ss_pred hcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccccccccccccccCcchh-hHHHHH Q lcl|NC_014036. 24 IATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIASGNSSGAITNIGPAVI-GMVRRA 102 (522) Q Consensus 24 i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t~tg~v~~~~P~Li-~l~Rra 102 (522) |+...++..-.+-+.| .+-+.+... +.....++..+.+ .-|.+. .+++.+ T Consensus 1 ~~~~~~~~~~~~~f~~----------------------~~~~~~~~~-----a~~~~~~~~~~~l--ip~~~~~~ii~~~ 51 (324) T protein:vir:96 1 MEQTQKLKLNLQHFAS----------------------NNVKPQVFN-----PDNVMMHEKKDGT--LLNDFTTPILQEV 51 (324) T ss_pred CCcchhhhHHHHHHHH----------------------hhhhhhhcc-----cccccccCCCcce--echhHHHHHHHHH Confidence 1111111110000000 000000000 0000111111111 122233 455566 Q ss_pred HhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeec Q lcl|NC_014036. 103 IPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFH 182 (522) Q Consensus 103 ~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~ 182 (522) ..+.+..+++.+-||++++.-|. ++... +.+.| T Consensus 52 ~~~s~l~~l~~~~~~~~~~~~~p----~~~~~---------------~~a~~---------------------------- 84 (324) T protein:vir:96 52 MENSKIMQLGKYEPMEGTEKKFT----FWADK---------------PGAYW---------------------------- 84 (324) T ss_pred HhhchhhhhcceeeccCCceEEE----EEecC---------------cceee---------------------------- Confidence 67778889999999987653221 01000 00000 Q ss_pred ccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEE Q lcl|NC_014036. 183 DFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIE 262 (522) Q Consensus 183 ~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVt 262 (522) + +|. ..+++..-+++++++. T Consensus 85 -------------------------------------------v--------~Eg---------~~~~~~~~~f~~v~~~ 104 (324) T protein:vir:96 85 -------------------------------------------V--------GEG---------QKIETSKATWVNATMR 104 (324) T ss_pred -------------------------------------------e--------cCC---------ccccccccceeEEEEE Confidence 0 010 0122222233444444 Q ss_pred EecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeecccccccc Q lcl|NC_014036. 263 ARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDV 342 (522) Q Consensus 263 AKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~ 342 (522) .|.-+-....|-||.+|-. .|.+++|.+.|...|...+++.||.--..... ..|++....... T Consensus 105 ~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~------------~~~~~~~~~~~~- 167 (324) T protein:vir:96 105 AFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF------------GKSIAQSIKKTN- 167 (324) T ss_pred eEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCc------------Cccccccccccc- Confidence 4444445559999999853 56799999999999999999999832110000 011111100000 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCc Q lcl|NC_014036. 343 RGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGV 422 (522) Q Consensus 343 ~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~ 422 (522) .+. .....+..|..+.+.|.. .+...+.+||+|.....|..+-- +.+ .... .+.. .++|.| T Consensus 168 ---~~~--~~~~~~~~i~~~~~~i~~--~~~~~~~~i~n~~~~~~L~~lkd----~~G--~~~~-~~~~----~~~l~G- 228 (324) T protein:vir:96 168 ---KVI--KGDFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVD----PET--KERI-YDRN----SDSLDG- 228 (324) T ss_pred ---eec--ccccchHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhC----CCC--Ceee-cCCC----CCcccc- Confidence 000 001112223344444432 33456789999999998875411 111 0001 1111 235666 Q ss_pred eEEEecCCCcc--ceEE--------EEEecCCCccceeEeecccccccccccCCcc-----c---cceeeeeeeecc-ee Q lcl|NC_014036. 423 YKVYIDQYARG--DYFT--------VGYKGDNEMDAGIYYAPYVALTPLRGSDPKN-----F---QPVMGFKTRYGV-GI 483 (522) Q Consensus 423 ~~vy~D~y~~~--dy~~--------vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s-----~---qP~~~~~tRY~l-~~ 483 (522) ++|++++.... ..++ +|..+.-+.+. ..+... ....|+.. | |=.+=..-||++ .. T Consensus 229 ~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~----~~~~~~--~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~ 302 (324) T protein:vir:96 229 LPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKI----DETAQL--STVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) T ss_pred eeeEeecCCCCCcceEEEEecceEEEEEecCcEEEE----eecccc--cccccccccchhhhhcCcEEEEEEEEeccEEe Confidence 68888665442 2233 33332211100 000000 00011110 1 122334456665 33 Q ss_pred cC--ccccccCCccccccCcchHHhhcccc Q lcl|NC_014036. 484 NP--FANSRSQAPSDRITSGMITKEMFGKN 511 (522) Q Consensus 484 nP--~~~~~~~~~~~~i~~g~~~~~~~~~~ 511 (522) +| |+.-..-.+.....- |+- T Consensus 303 ~~~a~~~l~~a~~~~~~~~--------~~~ 324 (324) T protein:vir:96 303 DDKAFAKLVPADKRTDSVP--------GEV 324 (324) T ss_pred cccceEEEecccccCCCCC--------CCC Confidence 45 111000000000011 111 No 87 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=40.29 E-value=0.98 Score=20.64 Aligned_cols=290 Identities=9% Similarity=-0.009 Sum_probs=119.8 Q ss_pred hhhhhhhcchhhhhhhccccccccccccccccccccccccccccccccCcc-hh-hHHHHHHhhhhhhhceeeccCCchh Q lcl|NC_014036. 44 AEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIASGNSSGAITNIGPA-VI-GMVRRAIPNLIAFDICGVQPMTGPT 121 (522) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t~tg~v~~~~P~-Li-~l~Rra~~~lI~~DI~GVQPmTGPT 121 (522) ++....+ ..+...+.. +++.+-...=|. +. .+++.+-+..+..+++.+.||++++ T Consensus 1 ~~~~~~~----------------------~~e~~~~~~-~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 57 (318) T protein:vir:24 1 MAAGTAF----------------------AVDHAQIAQ-TGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTG 57 (318) T ss_pred CCCCCCC----------------------CHHHHHhhc-ccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCc Confidence 1111100 001111111 111111111122 21 3455556677888888899987754 Q ss_pred hhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccc Q lcl|NC_014036. 122 GQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVT 201 (522) Q Consensus 122 GLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~ 201 (522) .- |.-.... +.+ T Consensus 58 ~~-------ip~~~~~------------~~a------------------------------------------------- 69 (318) T protein:vir:24 58 QK-------IPHWVGD------------VSA------------------------------------------------- 69 (318) T ss_pred eE-------EEEEeCC------------cce------------------------------------------------- Confidence 21 2110000 000 Q ss_pred cCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHH Q lcl|NC_014036. 202 VTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLR 281 (522) Q Consensus 202 ~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLK 281 (522) . ..+| +.++++...++++++.+.|..+-...+|-||.+|-. T Consensus 70 ----------------------~--------~v~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~ 110 (318) T protein:vir:24 70 ----------------------Q--------WIGE---------GDMKPITKGNMTSQTIAPHKIATIFVASAETVRANP 110 (318) T ss_pred ----------------------E--------EecC---------CccccccccceeEEEEeeEEEEEeehhhHHHhhcCh Confidence 0 0011 112344445567777777777777899999999843 Q ss_pred hhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccc----cccchhHHHHHHHHHH Q lcl|NC_014036. 282 AVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPID----VRGARWAGESYKALLI 357 (522) Q Consensus 282 AiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d----~~~~r~~~E~~r~L~~ 357 (522) .|.+++|.+.|+..|...|+..+|.-.....- .|++....... ....-+.-+....++. T Consensus 111 ----~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 173 (318) T protein:vir:24 111 ----ANYLGTMRTKVATAFAMAFDGAAMHGTDSPFP-------------TYIGQTTKAISIADTTGATTVYDQVAVNGLS 173 (318) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHhhhcccCCCCC-------------cccccccccccccccccccchHHHHHHHHHH Confidence 67899999999999999999999832111100 11111110000 0000111111122222 Q ss_pred HHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccc-ccccccccccccccee-EEEecCceEEEecCCCccc- Q lcl|NC_014036. 358 QIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAG-QGLQKTLNVDTTKAVF-AGVLGGVYKVYIDQYARGD- 434 (522) Q Consensus 358 ~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~-~~~~~~~~~d~~~~~~-~G~l~~~~~vy~D~y~~~d- 434 (522) . + .-.......+||+|.....|..+ .+.... -|..... ......+ -+.+.+ ++|++.+..+.. T Consensus 174 ~-------~--~~~~~~~~~~v~n~~~~~~L~~l--kd~~G~~l~~~~~~--~~~~~~~~~~~i~g-~pv~~~~~~~~~~ 239 (318) T protein:vir:24 174 L-------L--VNDGKKWTHTLLDDITEPILNGA--KDQNGRPLFIESTY--GEAASPFRSGRIVA-RPTILSDHVVEGT 239 (318) T ss_pred h-------h--ccccCCCCEEEEcHHHHHHHHHh--hccCCceeecCccc--cCccccccCceEEE-EeeEEeCCCCCCc Confidence 1 2 12233557889999999999743 111100 0000000 0111111 123443 577777776532 Q ss_pred e-EEEEEecCCCccceeEeecccccccccc---------cCCc----c-c---cceeeeeeeecce-ecCccccccCCcc Q lcl|NC_014036. 435 Y-FTVGYKGDNEMDAGIYYAPYVALTPLRG---------SDPK----N-F---QPVMGFKTRYGVG-INPFANSRSQAPS 495 (522) Q Consensus 435 y-~~vG~KG~~~~d~glfyaPYv~~~~~~~---------~Dp~----s-~---qP~~~~~tRY~l~-~nP~~~~~~~~~~ 495 (522) . +++| +- +.++|+-.-. ..++. .|+. + | |=.+=...||+.. .+|=+.. T Consensus 240 ~~~~~g---df---s~~~~~~~~~-l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~------ 306 (318) T protein:vir:24 240 TVGFMG---DF---SQLIWGQIGG-LSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFV------ 306 (318) T ss_pred cEEEEe---ec---ceEEEEEecC-eEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceE------ Confidence 1 1111 11 1123332211 11111 1111 1 2 2333445677765 3442111 Q ss_pred ccccCcchHHhhcc Q lcl|NC_014036. 496 DRITSGMITKEMFG 509 (522) Q Consensus 496 ~~i~~g~~~~~~~~ 509 (522) .+.+-.+.-..| T Consensus 307 --~i~~~~a~~~~~ 318 (318) T protein:vir:24 307 --ALTNVVSGGGEG 318 (318) T ss_pred --EEEeeccCCCCC Confidence 111111111112 No 88 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=39.40 E-value=1 Score=20.54 Aligned_cols=328 Identities=14% Similarity=0.114 Sum_probs=116.5 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhh-----------------H---HHHHH--hhhHHHHhhhhhhhcchhhhhh Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKK-----------------Q---LIAAI--MEAQEKDAEVDPVYRDEKIVES 58 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~-----------------~---~~~~~--~enq~~~~~~~~~~~~~~~~~~ 58 (522) |.+.++|.++|.-+.+. +-++.+..+. . +.+++ |+.|.+.+..+..-+ ... T Consensus 1 Mk~l~el~~~~~~~~~~---~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~----~~~ 73 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQ---LKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAK----VKD 73 (387) T ss_pred CchHHHHHHHHHHHHHH---HHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----hhh Confidence 99988888888776652 1222211110 0 11110 122211111100000 000 Q ss_pred hccccccc----ccc---------cccccc----------c-cccccccccccccccCcchhh------HHHHHHhhhhh Q lcl|NC_014036. 59 FGGFLAEA----EIA---------GDHGYD----------A-TKIASGNSSGAITNIGPAVIG------MVRRAIPNLIA 108 (522) Q Consensus 59 ~~~~l~ea----~~~---------~~~g~~----------~-~~~~~~t~tg~v~~~~P~Li~------l~Rra~~~lI~ 108 (522) ..+..++. ... .-.+.. . ..+.+++.+ .+ ..||+ ++++.-..-.- T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~----~g-G~lIP~~~~~~Ii~~~~~~~~l 148 (387) T protein:vir:26 74 KGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDS----GG-DKLLPKTLSKEIVSEPFAKNQL 148 (387) T ss_pred ccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCC----CC-ceeechhHHHHHHHHHHhhchh Confidence 00000000 000 000000 0 001111111 01 11222 22222223334 Q ss_pred hhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeeccccccc Q lcl|NC_014036. 109 FDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETG 188 (522) Q Consensus 109 ~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g 188 (522) .+++.|.|+++.+. .+.. +++ T Consensus 149 ~~~~~~~~~~~~~~------p~~~---------------------~~~-------------------------------- 169 (387) T protein:vir:26 149 REKARLTNIKGLEI------PRVS---------------------YTL-------------------------------- 169 (387) T ss_pred hhhceeeecCCcee------eeee---------------------ccC-------------------------------- Confidence 56666666543211 0000 000 Q ss_pred ceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccc Q lcl|NC_014036. 189 RVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQL 268 (522) Q Consensus 189 ~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRAL 268 (522) . +.. ..+|. ...++...++++++..+|.-+- T Consensus 170 ----------------~----------------~a~--------~v~Eg---------~~~~~~~~~f~~v~l~~~k~~~ 200 (387) T protein:vir:26 170 ----------------D----------------DDD--------FITDV---------ETAKELKAKGDTVKFTTNKFKV 200 (387) T ss_pred ----------------C----------------ccc--------ccccc---------ccccccccccceeeechheeee Confidence 0 000 01110 0122223333455555555555 Q ss_pred cchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhH Q lcl|NC_014036. 269 KAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWA 348 (522) Q Consensus 269 KAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~ 348 (522) ...+|-||.+|- ..|.|++|.+-|+..|..-.|..++- .....|.. .|++--.....+.+ T Consensus 201 ~i~iS~ell~ds----~~~l~~~i~~~la~~~~~~e~~~~~~---~g~g~g~~---------~g~~~~~~~~~~~~---- 260 (387) T protein:vir:26 201 FAAISDTVIHGS----DVDLVNWVENALQSGLAAKERKDALA---VSPKSGLE---------HMSFYNGSVKEVEG---- 260 (387) T ss_pred echhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhHhh---cCCCcccc---------ceeeeccccccccc---- Confidence 688999999985 45668999999999888766666652 11111111 11111000011111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEec Q lcl|NC_014036. 349 GESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYID 428 (522) Q Consensus 349 ~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D 428 (522) -.++-.|..+-+.+...= +..+.|++-+...+.++...+.. .+ ..- +..+ ++|.| ++||+. T Consensus 261 ----~~~~d~i~~~~~~l~~~y-~~na~~imn~~t~~~~~~~~~~~----~~----~~~-~~~~----~~llG-~PV~~~ 321 (387) T protein:vir:26 261 ----ADMYDAIINALADLHEDY-RDNATIYMRYADYVKIISVLSNG----TT----NFF-DTPA----EKVFG-KPVVFT 321 (387) T ss_pred ----cchHHHHHHHHhccChhh-hcCCEEEEechHHHHHHHHHhcC----CC----ccc-ccCC----ccccc-cceEEe Confidence 112223333333333321 23555654444444544433211 10 000 1111 35776 599988 Q ss_pred CCCccceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccee-cCccccccCCccccccCcchHHhh Q lcl|NC_014036. 429 QYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMITKEM 507 (522) Q Consensus 429 ~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~~~~ 507 (522) .+++. +++| +- +-||.=|......+.-|..+.+-.+-...||+..+ +| T Consensus 322 ~~~~~--~~~G---Df----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~---------------------- 370 (387) T protein:vir:26 322 DAAVK--PIVG---DF----NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLD---------------------- 370 (387) T ss_pred cCCCc--eeee---ch----hhhhhhhhhhhheecccccCCceEEEEEEEeCcEeech---------------------- Confidence 77653 3333 11 11222121111111112222232333333554432 23 Q ss_pred ccccceeeeeeeccC Q lcl|NC_014036. 508 FGKNAYFRKVYVKGL 522 (522) Q Consensus 508 ~~~~~~~r~~~Vk~~ 522 (522) .-++.+.||-= T Consensus 371 ----~A~~~l~~ka~ 381 (387) T protein:vir:26 371 ----SAFRIAKAKEN 381 (387) T ss_pred ----hheEEEEeecC Confidence 12233333222 No 89 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=39.40 E-value=1 Score=20.54 Aligned_cols=328 Identities=14% Similarity=0.114 Sum_probs=116.5 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhh-----------------H---HHHHH--hhhHHHHhhhhhhhcchhhhhh Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKK-----------------Q---LIAAI--MEAQEKDAEVDPVYRDEKIVES 58 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~-----------------~---~~~~~--~enq~~~~~~~~~~~~~~~~~~ 58 (522) |.+.++|.++|.-+.+. +-++.+..+. . +.+++ |+.|.+.+..+..-+ ... T Consensus 1 Mk~l~el~~~~~~~~~~---~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~----~~~ 73 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQ---LKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAK----VKD 73 (387) T ss_pred CchHHHHHHHHHHHHHH---HHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----hhh Confidence 99988888888776652 1222211110 0 11110 122211111100000 000 Q ss_pred hccccccc----ccc---------cccccc----------c-cccccccccccccccCcchhh------HHHHHHhhhhh Q lcl|NC_014036. 59 FGGFLAEA----EIA---------GDHGYD----------A-TKIASGNSSGAITNIGPAVIG------MVRRAIPNLIA 108 (522) Q Consensus 59 ~~~~l~ea----~~~---------~~~g~~----------~-~~~~~~t~tg~v~~~~P~Li~------l~Rra~~~lI~ 108 (522) ..+..++. ... .-.+.. . ..+.+++.+ .+ ..||+ ++++.-..-.- T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~----~g-G~lIP~~~~~~Ii~~~~~~~~l 148 (387) T protein:vir:94 74 KGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDS----GG-DKLLPKTLSKEIVSEPFAKNQL 148 (387) T ss_pred ccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCC----CC-ceeechhHHHHHHHHHHhhchh Confidence 00000000 000 000000 0 001111111 01 11222 22222223334 Q ss_pred hhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeeccccccc Q lcl|NC_014036. 109 FDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETG 188 (522) Q Consensus 109 ~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g 188 (522) .+++.|.|+++.+. .+.. +++ T Consensus 149 ~~~~~~~~~~~~~~------p~~~---------------------~~~-------------------------------- 169 (387) T protein:vir:94 149 REKARLTNIKGLEI------PRVS---------------------YTL-------------------------------- 169 (387) T ss_pred hhhceeeecCCcee------eeee---------------------ccC-------------------------------- Confidence 56666666543211 0000 000 Q ss_pred ceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccc Q lcl|NC_014036. 189 RVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQL 268 (522) Q Consensus 189 ~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRAL 268 (522) . +.. ..+|. ...++...++++++..+|.-+- T Consensus 170 ----------------~----------------~a~--------~v~Eg---------~~~~~~~~~f~~v~l~~~k~~~ 200 (387) T protein:vir:94 170 ----------------D----------------DDD--------FITDV---------ETAKELKAKGDTVKFTTNKFKV 200 (387) T ss_pred ----------------C----------------ccc--------ccccc---------ccccccccccceeeechheeee Confidence 0 000 01110 0122223333455555555555 Q ss_pred cchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhH Q lcl|NC_014036. 269 KAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWA 348 (522) Q Consensus 269 KAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~ 348 (522) ...+|-||.+|- ..|.|++|.+-|+..|..-.|..++- .....|.. .|++--.....+.+ T Consensus 201 ~i~iS~ell~ds----~~~l~~~i~~~la~~~~~~e~~~~~~---~g~g~g~~---------~g~~~~~~~~~~~~---- 260 (387) T protein:vir:94 201 FAAISDTVIHGS----DVDLVNWVENALQSGLAAKERKDALA---VSPKSGLE---------HMSFYNGSVKEVEG---- 260 (387) T ss_pred echhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhHhh---cCCCcccc---------ceeeeccccccccc---- Confidence 688999999985 45668999999999888766666652 11111111 11111000011111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEec Q lcl|NC_014036. 349 GESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYID 428 (522) Q Consensus 349 ~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D 428 (522) -.++-.|..+-+.+...= +..+.|++-+...+.++...+.. .+ ..- +..+ ++|.| ++||+. T Consensus 261 ----~~~~d~i~~~~~~l~~~y-~~na~~imn~~t~~~~~~~~~~~----~~----~~~-~~~~----~~llG-~PV~~~ 321 (387) T protein:vir:94 261 ----ADMYDAIINALADLHEDY-RDNATIYMRYADYVKIISVLSNG----TT----NFF-DTPA----EKVFG-KPVVFT 321 (387) T ss_pred ----cchHHHHHHHHhccChhh-hcCCEEEEechHHHHHHHHHhcC----CC----ccc-ccCC----ccccc-cceEEe Confidence 112223333333333321 23555654444444544433211 10 000 1111 35776 599988 Q ss_pred CCCccceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccee-cCccccccCCccccccCcchHHhh Q lcl|NC_014036. 429 QYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMITKEM 507 (522) Q Consensus 429 ~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~~~~ 507 (522) .+++. +++| +- +-||.=|......+.-|..+.+-.+-...||+..+ +| T Consensus 322 ~~~~~--~~~G---Df----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~---------------------- 370 (387) T protein:vir:94 322 DAAVK--PIVG---DF----NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLD---------------------- 370 (387) T ss_pred cCCCc--eeee---ch----hhhhhhhhhhhheecccccCCceEEEEEEEeCcEeech---------------------- Confidence 77653 3333 11 11222121111111112222232333333554432 23 Q ss_pred ccccceeeeeeeccC Q lcl|NC_014036. 508 FGKNAYFRKVYVKGL 522 (522) Q Consensus 508 ~~~~~~~r~~~Vk~~ 522 (522) .-++.+.||-= T Consensus 371 ----~A~~~l~~ka~ 381 (387) T protein:vir:94 371 ----SAFRIAKAKEN 381 (387) T ss_pred ----hheEEEEeecC Confidence 12233333222 No 90 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=39.40 E-value=1 Score=20.54 Aligned_cols=328 Identities=14% Similarity=0.114 Sum_probs=116.5 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhh-----------------H---HHHHH--hhhHHHHhhhhhhhcchhhhhh Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKK-----------------Q---LIAAI--MEAQEKDAEVDPVYRDEKIVES 58 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~-----------------~---~~~~~--~enq~~~~~~~~~~~~~~~~~~ 58 (522) |.+.++|.++|.-+.+. +-++.+..+. . +.+++ |+.|.+.+..+..-+ ... T Consensus 1 Mk~l~el~~~~~~~~~~---~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~----~~~ 73 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQ---LKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAK----VKD 73 (387) T ss_pred CchHHHHHHHHHHHHHH---HHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----hhh Confidence 99988888888776652 1222211110 0 11110 122211111100000 000 Q ss_pred hccccccc----ccc---------cccccc----------c-cccccccccccccccCcchhh------HHHHHHhhhhh Q lcl|NC_014036. 59 FGGFLAEA----EIA---------GDHGYD----------A-TKIASGNSSGAITNIGPAVIG------MVRRAIPNLIA 108 (522) Q Consensus 59 ~~~~l~ea----~~~---------~~~g~~----------~-~~~~~~t~tg~v~~~~P~Li~------l~Rra~~~lI~ 108 (522) ..+..++. ... .-.+.. . ..+.+++.+ .+ ..||+ ++++.-..-.- T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~----~g-G~lIP~~~~~~Ii~~~~~~~~l 148 (387) T protein:vir:96 74 KGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDS----GG-DKLLPKTLSKEIVSEPFAKNQL 148 (387) T ss_pred ccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCC----CC-ceeechhHHHHHHHHHHhhchh Confidence 00000000 000 000000 0 001111111 01 11222 22222223334 Q ss_pred hhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeeccccccc Q lcl|NC_014036. 109 FDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETG 188 (522) Q Consensus 109 ~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g 188 (522) .+++.|.|+++.+. .+.. +++ T Consensus 149 ~~~~~~~~~~~~~~------p~~~---------------------~~~-------------------------------- 169 (387) T protein:vir:96 149 REKARLTNIKGLEI------PRVS---------------------YTL-------------------------------- 169 (387) T ss_pred hhhceeeecCCcee------eeee---------------------ccC-------------------------------- Confidence 56666666543211 0000 000 Q ss_pred ceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccc Q lcl|NC_014036. 189 RVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQL 268 (522) Q Consensus 189 ~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRAL 268 (522) . +.. ..+|. ...++...++++++..+|.-+- T Consensus 170 ----------------~----------------~a~--------~v~Eg---------~~~~~~~~~f~~v~l~~~k~~~ 200 (387) T protein:vir:96 170 ----------------D----------------DDD--------FITDV---------ETAKELKAKGDTVKFTTNKFKV 200 (387) T ss_pred ----------------C----------------ccc--------ccccc---------ccccccccccceeeechheeee Confidence 0 000 01110 0122223333455555555555 Q ss_pred cchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhH Q lcl|NC_014036. 269 KAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWA 348 (522) Q Consensus 269 KAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~ 348 (522) ...+|-||.+|- ..|.|++|.+-|+..|..-.|..++- .....|.. .|++--.....+.+ T Consensus 201 ~i~iS~ell~ds----~~~l~~~i~~~la~~~~~~e~~~~~~---~g~g~g~~---------~g~~~~~~~~~~~~---- 260 (387) T protein:vir:96 201 FAAISDTVIHGS----DVDLVNWVENALQSGLAAKERKDALA---VSPKSGLE---------HMSFYNGSVKEVEG---- 260 (387) T ss_pred echhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhHhh---cCCCcccc---------ceeeeccccccccc---- Confidence 688999999985 45668999999999888766666652 11111111 11111000011111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEec Q lcl|NC_014036. 349 GESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYID 428 (522) Q Consensus 349 ~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D 428 (522) -.++-.|..+-+.+...= +..+.|++-+...+.++...+.. .+ ..- +..+ ++|.| ++||+. T Consensus 261 ----~~~~d~i~~~~~~l~~~y-~~na~~imn~~t~~~~~~~~~~~----~~----~~~-~~~~----~~llG-~PV~~~ 321 (387) T protein:vir:96 261 ----ADMYDAIINALADLHEDY-RDNATIYMRYADYVKIISVLSNG----TT----NFF-DTPA----EKVFG-KPVVFT 321 (387) T ss_pred ----cchHHHHHHHHhccChhh-hcCCEEEEechHHHHHHHHHhcC----CC----ccc-ccCC----ccccc-cceEEe Confidence 112223333333333321 23555654444444544433211 10 000 1111 35776 599988 Q ss_pred CCCccceEEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeeccee-cCccccccCCccccccCcchHHhh Q lcl|NC_014036. 429 QYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMITKEM 507 (522) Q Consensus 429 ~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~~~~ 507 (522) .+++. +++| +- +-||.=|......+.-|..+.+-.+-...||+..+ +| T Consensus 322 ~~~~~--~~~G---Df----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~---------------------- 370 (387) T protein:vir:96 322 DAAVK--PIVG---DF----NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLD---------------------- 370 (387) T ss_pred cCCCc--eeee---ch----hhhhhhhhhhhheecccccCCceEEEEEEEeCcEeech---------------------- Confidence 77653 3333 11 11222121111111112222232333333554432 23 Q ss_pred ccccceeeeeeeccC Q lcl|NC_014036. 508 FGKNAYFRKVYVKGL 522 (522) Q Consensus 508 ~~~~~~~r~~~Vk~~ 522 (522) .-++.+.||-= T Consensus 371 ----~A~~~l~~ka~ 381 (387) T protein:vir:96 371 ----SAFRIAKAKEN 381 (387) T ss_pred ----hheEEEEeecC Confidence 12233333222 No 91 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=37.72 E-value=1.1 Score=20.35 Aligned_cols=347 Identities=10% Similarity=0.049 Sum_probs=125.7 Q ss_pred Ccch-HHHHHhhhhhh-------ccccch-hhhcchhhhHHHHHHhhhHHHHhhhhhh----h----------------- Q lcl|NC_014036. 1 MSKK-NELMEKWNDLL-------ESQEGL-PDIATKSKKQLIAAIMEAQEKDAEVDPV----Y----------------- 50 (522) Q Consensus 1 ~~~~-~~l~~kw~p~l-------~~~~~~-~~i~~~~~~~~~~~~~enq~~~~~~~~~----~----------------- 50 (522) |++. ++|+++=+-++ +..+.. .++....+ .+ .. |+.|-+.+.+... . T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~~ee~~~~~~-e~-~~-l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKDGVTAEELNKTSN-EI-DI-LQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVI 77 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHH-HH-HH-HHHHHHHHHHHHHHHHHHhhhhccccccccchhhH Confidence 9974 45666544333 321111 12211111 00 11 1111111110000 0 Q ss_pred -cchhhhhh-hcccccccc-ccccccccccc-ccccc-ccccccccCcchh--hHHHHHHhhhhhhhceeeccCCchhhh Q lcl|NC_014036. 51 -RDEKIVES-FGGFLAEAE-IAGDHGYDATK-IASGN-SSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQ 123 (522) Q Consensus 51 -~~~~~~~~-~~~~l~ea~-~~~~~g~~~~~-~~~~t-~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPTGL 123 (522) .......+ ...++.+-+ .+........+ +..++ ++|.+. . |.-+ .+++.+-.+....+++++.||+++.|- T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~-v-P~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~ 155 (404) T protein:vir:10 78 YNGALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYA-V-PEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGS 155 (404) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCcee-e-chhHHHHHHHHHhhhhhHhhhhceeeccCCccc Confidence 00000000 011111110 00011111111 11122 122211 1 2222 445555567778899999999999875 Q ss_pred heeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccC Q lcl|NC_014036. 124 VFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVT 203 (522) Q Consensus 124 IFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~t 203 (522) +- |..... .+...|-+.+.. T Consensus 156 ~~-----~~~~~~------------~~~~~~v~e~~~------------------------------------------- 175 (404) T protein:vir:10 156 RT-----YEKRSK------------QKPMKPLSENQQ------------------------------------------- 175 (404) T ss_pred eE-----EEEecC------------Ccceeecccccc------------------------------------------- Confidence 32 211100 000000000000 Q ss_pred CCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhh Q lcl|NC_014036. 204 GSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAV 283 (522) Q Consensus 204 gt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAi 283 (522) ..+ .....++++++.+.|.-+-...+|-||.+|-. T Consensus 176 ------------------------------~~~-------------~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~-- 210 (404) T protein:vir:10 176 ------------------------------IPT-------------NGDNGKLERFNFKLKDLADFMSIPNDLLKFAD-- 210 (404) T ss_pred ------------------------------ccc-------------cccccceeeeEeeheeeEeeehhhHHHHhhcH-- Confidence 000 00112234445555544555689999999843 Q ss_pred cCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHH Q lcl|NC_014036. 284 HGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEA 363 (522) Q Consensus 284 HGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~a 363 (522) .+.+++|.+.|+..|...+|+.||.---... ...|+.......-.. +.. ...+..++..- T Consensus 211 --~~l~~~i~~~la~~~~~~~~~~il~G~g~~~------------~~~gi~~~~~~~~~~---~~~---~~~~~~~~~~~ 270 (404) T protein:vir:10 211 --KSLEDWIINWFVDKVRITRNAEILYGAGGDE------------HATGIMTANKFKKIT---LPK---SPALKDFKKCK 270 (404) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCC------------cccceeeccccceee---ccc---cccHHHHHHHH Confidence 3568889999999999999998883211110 011222111100000 000 00111122111 Q ss_pred HHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCC-CccceEEEEEec Q lcl|NC_014036. 364 NEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQY-ARGDYFTVGYKG 442 (522) Q Consensus 364 n~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y-~~~dy~~vG~KG 442 (522) +. .....+...-.+||||+....|..+- +.... -....+.+. ..-++|.| ++|++.+. .+.. T Consensus 271 ~~-~l~~~~~~~~~~v~n~~~~~~L~~lk--d~~G~----~l~~~~~~~-~~~~~l~G-~PV~~~~~~~~~~-------- 333 (404) T protein:vir:10 271 NV-ELLNVFKATSSWIVNQDGFNYLDSLE--DKTGR----PYLQPDPKD-PTQYRFLG-LPVIELPNDLLLS-------- 333 (404) T ss_pred Hh-hhhccccCCCEEEEcHHHHHHHHHhh--ccCCc----eeeccCcCC-CCCccccc-eeeEEecccccCC-------- Confidence 11 11233323335799999999887541 11100 001111111 11246777 58875322 1110 Q ss_pred CCCccceeEeecccc---------cccccccCC----ccccceeeeeeeecce-ecC--ccc---cccCCcc Q lcl|NC_014036. 443 DNEMDAGIYYAPYVA---------LTPLRGSDP----KNFQPVMGFKTRYGVG-INP--FAN---SRSQAPS 495 (522) Q Consensus 443 ~~~~d~glfyaPYv~---------~~~~~~~Dp----~s~qP~~~~~tRY~l~-~nP--~~~---~~~~~~~ 495 (522) ..-+..++|+.+-. +......++ ...+=.+-...|+++. .+| |.. ...-.|. T Consensus 334 -~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 334 -TESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred -CCCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 00011111211100 000001111 2333445566666653 233 211 1111111 No 92 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=36.80 E-value=1.2 Score=20.24 Aligned_cols=302 Identities=10% Similarity=0.034 Sum_probs=122.6 Q ss_pred HhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccccccccccccccCcchh-hHHHHHHhhhhhhhceee Q lcl|NC_014036. 36 IMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIASGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGV 114 (522) Q Consensus 36 ~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GV 114 (522) ..|+|+....- +.+ ...+.+-+...+ -+.. ++.+++. ..-|.+. .+++.+..+.+..+++-+ T Consensus 1 ~~~~~~~~~~~-~~f---------~~~~~~~~~~~a-----~~~~-~~~~~~~-liP~~~~~~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:93 1 MEQTQKLKLNL-QHF---------ASNNVKPQVFNP-----DNVM-MHEKKDG-TLLNDFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred CchhHHHHHHH-HHH---------HHhhhhhhhccc-----cccc-ccCCCcc-eechhHHHHHHHHHHhhchhhhhcce Confidence 22222111110 001 000000000000 0001 1111111 1122233 456666678888899999 Q ss_pred ccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeee Q lcl|NC_014036. 115 QPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQN 194 (522) Q Consensus 115 QPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~ 194 (522) -||++++-- |.-.... +.+. T Consensus 64 ~~~~~~~~~-------ip~~~~~------------~~a~----------------------------------------- 83 (324) T protein:vir:93 64 EPMEGTEKK-------FTFWADK------------PGAY----------------------------------------- 83 (324) T ss_pred eeccCCceE-------EEEEecC------------ccee----------------------------------------- Confidence 999876522 2110000 0000 Q ss_pred ccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhH Q lcl|NC_014036. 195 VSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSV 274 (522) Q Consensus 195 ~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ 274 (522) ..+| +..+++..-++++++++.+..+-....|- T Consensus 84 --------------------------------------~v~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ 116 (324) T protein:vir:93 84 --------------------------------------WVGE---------GQKIETSKATWVNATMRAFKLGVILPVTK 116 (324) T ss_pred --------------------------------------eecC---------CccccccccceeEEEEEeEEEEEeehhhH Confidence 0011 01133334455677777777777788999 Q ss_pred HHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeecccccc--ccccchhHHHHH Q lcl|NC_014036. 275 ELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPI--DVRGARWAGESY 352 (522) Q Consensus 275 ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~--d~~~~r~~~E~~ 352 (522) ||.+|-. .|.+++|.+-|+..|...+++.+|.=-..... ..|+++..... -+.+ T Consensus 117 ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~------------~~~~~~~~~~~~~~~~~-------- 172 (324) T protein:vir:93 117 EFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF------------GKSIAQSIEKTNKVIKG-------- 172 (324) T ss_pred HHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc------------Cccccccccccceeccc-------- Confidence 9999953 46799999999999999999999832111000 01121111000 0000 Q ss_pred HHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCc Q lcl|NC_014036. 353 KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYAR 432 (522) Q Consensus 353 r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~ 432 (522) ...+-.|.++-+.|.. .+...+.+||+|.....|..+-- +.+. ....+.. .+.|.| ++|++.+... T Consensus 173 ~~~~~~i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~l~d----~~G~---~~~~~~~----~~~l~G-~PVv~~~~~~ 238 (324) T protein:vir:93 173 DFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVD----PETK---ERIYDRN----SDSLDG-LPVVNLKSSN 238 (324) T ss_pred cccHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhC----CCCC---eeecCCC----CCcccc-eeeEeecCCC Confidence 0112223333333332 23455689999999999875411 1110 0011111 245766 6888866533 Q ss_pred --cceE--------EEEEecCCCccceeEeecccccccccccCC------ccccceeeeeeeeccee-cC--ccccccCC Q lcl|NC_014036. 433 --GDYF--------TVGYKGDNEMDAGIYYAPYVALTPLRGSDP------KNFQPVMGFKTRYGVGI-NP--FANSRSQA 493 (522) Q Consensus 433 --~dy~--------~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp------~s~qP~~~~~tRY~l~~-nP--~~~~~~~~ 493 (522) ...+ ++|..+.-+.+ ...+..+......|. ..-|=.+=+..||+..+ +| |+. .+ T Consensus 239 ~~~~~i~~gdfs~~~~~~~~~~~i~----~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~-l~-- 311 (324) T protein:vir:93 239 LKRGELITGDFDKLIYGIPQLIEYK----IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK-LV-- 311 (324) T ss_pred CCcceEEEEecceEEEEEecCcEEE----EeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEE-Ee-- Confidence 2223 33333322211 001100111000110 01122333445665542 33 111 01 Q ss_pred ccccccCcchHHhhccccce Q lcl|NC_014036. 494 PSDRITSGMITKEMFGKNAY 513 (522) Q Consensus 494 ~~~~i~~g~~~~~~~~~~~~ 513 (522) +.++..-+-.... T Consensus 312 -------~a~~~~~~~~~~~ 324 (324) T protein:vir:93 312 -------PADKRTDSVPGEV 324 (324) T ss_pred -------cccccCCCCCCCC Confidence 0001100000111 No 93 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=35.25 E-value=1.2 Score=20.07 Aligned_cols=346 Identities=15% Similarity=0.126 Sum_probs=131.0 Q ss_pred CcchHHHHHhhhhhhccccc-h-----------hhhc---chhhhHHHH--HHhhhHHHHhhhhhhhcc---hhhh---- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEG-L-----------PDIA---TKSKKQLIA--AIMEAQEKDAEVDPVYRD---EKIV---- 56 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~-~-----------~~i~---~~~~~~~~~--~~~enq~~~~~~~~~~~~---~~~~---- 56 (522) |++.++|+++=.-.++.-.. + .++. .... +..+ .-|+.|.+++.+...... ..+. T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~-~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~ 79 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLT-AAKARRDAINDQIKDLEAENKANSDPDKPVDNAQP 79 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhcc Confidence 88866665553333221000 0 0111 0000 0011 112333222222111000 0000 Q ss_pred --------------hhhccccccccccccccccccccccccccccccccCcchhhHHHHHHhhhhhhhceeeccCCchhh Q lcl|NC_014036. 57 --------------ESFGGFLAEAEIAGDHGYDATKIASGNSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTG 122 (522) Q Consensus 57 --------------~~~~~~l~ea~~~~~~g~~~~~~~~~t~tg~v~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTG 122 (522) .++..+|..-. ...+.......++.|.+.--.+..-.++++..+..+-.++|.+.||+++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~l~~~~----~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 155 (394) T protein:vir:10 80 NGTDLKKKPIDAKKKAINDFIHSHG----KVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKG 155 (394) T ss_pred cccchhhhHHHHHHHHHHHHHhccc----hhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCce Confidence 11111111100 000000000111122221111111246666667777889999999988754 Q ss_pred hheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeecccccccc Q lcl|NC_014036. 123 QVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTV 202 (522) Q Consensus 123 LIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~ 202 (522) -+--.+ .... ...| T Consensus 156 ~~~~~~-----~~~~-------------~~~~------------------------------------------------ 169 (394) T protein:vir:10 156 TYPILK-----RATD-------------RFSS------------------------------------------------ 169 (394) T ss_pred EEEEEe-----cCCC-------------cccc------------------------------------------------ Confidence 333221 0000 0000 Q ss_pred CCCCcccccccccccccccccccccccccchhhhhccccCCCCCcccccc-ceEEEEEEEEEecccccchhhHHHHHHHH Q lcl|NC_014036. 203 TGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEM-GFRIDKQVIEARSRQLKAQYSVELAQDLR 281 (522) Q Consensus 203 tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EM-sFsIEK~TVtAKSRALKAEYT~ELAQDLK 281 (522) .+|. ...++. ..++++++...|.-+-...+|-||.+|- T Consensus 170 -------------------------------~~E~---------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds- 208 (394) T protein:vir:10 170 -------------------------------VAEL---------AENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADS- 208 (394) T ss_pred -------------------------------cccc---------ccccccccccceeEEeeeeeeEeeehhHHHHHhhh- Confidence 0010 001111 1233444444444455577999999984 Q ss_pred hhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHH Q lcl|NC_014036. 282 AVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDK 361 (522) Q Consensus 282 AiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~ 361 (522) ..|.+++|.+-|+..|..-+|+.|+.-...... .++.... ..+....++..... T Consensus 209 ---~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~-------------~~~~~~~----------~~d~l~~~~~~~~~ 262 (394) T protein:vir:10 209 ---AVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTA-------------KATTTDT----------LVDSLKHILNVDLD 262 (394) T ss_pred ---hHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------------ccccccc----------cHHHHHHHHHhhhh Confidence 256799999999999999999999843321111 0111110 01112222211111 Q ss_pred HHHHHHHhccccCCcEEEEchhHHHHHhhhccccccccc-ccccccccccccceeEEEecCceEEEecC--CCcc---ce Q lcl|NC_014036. 362 EANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQ-GLQKTLNVDTTKAVFAGVLGGVYKVYIDQ--YARG---DY 435 (522) Q Consensus 362 ~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~-~~~~~~~~d~~~~~~~G~l~~~~~vy~D~--y~~~---dy 435 (522) . .+ . ..+|++|.....|..+-. ..... |... ..+.+.....++|.| ++|++.. +.+. +. T Consensus 263 ~--------~~-~-a~~vmn~~~~~~l~~lkd--~~G~~i~~~~--~~~~~~~~~~~~L~G-~PV~~~~~~~~~~~~~~~ 327 (394) T protein:vir:10 263 P--------AY-S-RALVVTQSLFNTLDTLKD--KNGRYLLHDA--SDSITDGTAKGTVLG-VPVYVVGDALLGSAAGDQ 327 (394) T ss_pred h--------hc-c-CEEEecHHHHHHHHHhhc--cCCCeeeecc--ccccccCCccccccc-ceeEEecccccCCCCCce Confidence 1 11 2 357899999888875411 10000 0000 011122223457887 5887632 2221 11 Q ss_pred -EEEEEecCCCccceeEeecccccccccccCCccccceeeeeeeecce-ecCccccc-cCCccccccCcchHHhhccc Q lcl|NC_014036. 436 -FTVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVG-INPFANSR-SQAPSDRITSGMITKEMFGK 510 (522) Q Consensus 436 -~~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~-~~~~~~~i~~g~~~~~~~~~ 510 (522) +++|---. ++....- ....+...|...|.-.+-...|++.. .||-+... +-.+. ..|..- --|| T Consensus 328 ~i~~gd~s~-----~~~~~~~-~~~~v~~~~~~~~~~~~~~~~r~d~~~~~~~ai~~~~~~~~---~~~~~~--~~~~ 394 (394) T protein:vir:10 328 KAFVGDLKR-----GVLFADR-QQVTLAWEDSKIYGRYLGAAFRFGVKQADSNAGYFVTNTDA---ASGSTS--GTGK 394 (394) T ss_pred EEEEeeccc-----cEEEEee-cceEEEEecccccceeEEEEEEeccEEeccccEEEEEeecc---cCCCCC--CCCC Confidence 22220000 0000000 11112223445555556666777754 34422100 00000 000000 0122 No 94 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=34.81 E-value=1.3 Score=20.02 Aligned_cols=334 Identities=17% Similarity=0.123 Sum_probs=116.6 Q ss_pred CcchHHHHHhhhhhhccccc----h-----------hhhcchhh--hHHHHHH--hhhHHHHhhhhh-hh---------- Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEG----L-----------PDIATKSK--KQLIAAI--MEAQEKDAEVDP-VY---------- 50 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~----~-----------~~i~~~~~--~~~~~~~--~enq~~~~~~~~-~~---------- 50 (522) |-+.++|.++|.-+.+.-.. + .+|....+ ..+-+++ |+.|.+.+..+. .. T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCCC Confidence 88887777776665542111 0 11110000 0011111 222222211110 00 Q ss_pred --cchhhhhhhcccccccccccccccc--------cccccccccc-ccccccCcchh--hHHHHHHhhhhhhhceeeccC Q lcl|NC_014036. 51 --RDEKIVESFGGFLAEAEIAGDHGYD--------ATKIASGNSS-GAITNIGPAVI--GMVRRAIPNLIAFDICGVQPM 117 (522) Q Consensus 51 --~~~~~~~~~~~~l~ea~~~~~~g~~--------~~~~~~~t~t-g~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPm 117 (522) .++.....+..++... ..+..+.. -..+.+++.+ |.+ .=|.=+ .++++....-+-.+++.|.|+ T Consensus 81 ~~~~~~~~~~~~~~~r~~-~~~~~~~~~~~~~~~~~~al~~~t~s~gG~--~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~ 157 (387) T protein:vir:93 81 LNDHEKMVKAKAEFYRHA-ILPNEFEKPSMEAQRLLHALPTGNDSGGDK--LLPKTLSKEIVSEPFAKNQLREKARLTNI 157 (387) T ss_pred cchhhHHHHHHHHHHHHH-hhhhhhhhhhhhhHHHHHhhccCcCCCCce--eechhHHHHHHHHHHhhchhhhheeeeec Confidence 0001111111111110 00111100 0001111111 110 012111 133333333344567777666 Q ss_pred CchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccc Q lcl|NC_014036. 118 TGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSG 197 (522) Q Consensus 118 TGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~ 197 (522) ++.+. . +-.+.. +.. T Consensus 158 ~~~~~--p--~~~~~~----------------~~a--------------------------------------------- 172 (387) T protein:vir:93 158 KGLEI--P--RVSYTL----------------DDD--------------------------------------------- 172 (387) T ss_pred CCceE--E--EEeecC----------------Ccc--------------------------------------------- Confidence 43210 0 000000 000 Q ss_pred cccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHH Q lcl|NC_014036. 198 APVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELA 277 (522) Q Consensus 198 ~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELA 277 (522) . ..+|. ...++...+++.++..++.-+-...+|-||. T Consensus 173 --------------------------~--------~v~E~---------~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell 209 (387) T protein:vir:93 173 --------------------------D--------FITDV---------ETAKELKLKGDTVKFTTNKFKVFAAISDTVI 209 (387) T ss_pred --------------------------c--------cccCc---------ccccccccccceeeeeheeeeeechhhHHHH Confidence 0 01110 0011222233445555555566788999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHH Q lcl|NC_014036. 278 QDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLI 357 (522) Q Consensus 278 QDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~ 357 (522) ||- ..|.|++|.+-|+..|..-.|..++-.-+-+... .|++.-.....+.+. .++- T Consensus 210 ~Ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p------------~g~l~~~~~~~v~~~--------~~~d 265 (387) T protein:vir:93 210 HGS----DVDLVNWVENALQSGLAAKERKDALAVSPKSGLD------------HMSFYNGSVKEVEGA--------DMYD 265 (387) T ss_pred hhh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCcccc------------ceeeecccccccccc--------chHH Confidence 984 3456899999998888876666666221111110 122111111111111 1222 Q ss_pred HHHHHHHHHHHhccccCCcEEEEchhH-HHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceE Q lcl|NC_014036. 358 QIDKEANEIARQTGRGAGNFIIASRNV-VSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYF 436 (522) Q Consensus 358 ~i~~~an~I~r~T~~g~gn~~v~S~~v-a~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 436 (522) .|..+-+.+...=+ ..+.|+ +++.. ..+|....-+. ...| ...+ .+|.| ++||+..+++. + T Consensus 266 ~i~~~~~~l~~~~~-~~a~~~-mn~~t~~~~~~~~~d~~--~~~~-------~~~~----~~llG-~PV~~~~~~~~--~ 327 (387) T protein:vir:93 266 AIINALADLHEDYR-DNATIY-MRYADYVKIISVLSNGT--TNFF-------DTPA----EKVFG-KPVVFTDAAVK--P 327 (387) T ss_pred HHHHHHhccChhhh-cCCEEE-EechHHHHHHHHHhcCC--Cccc-------ccCC----ccccc-cceEEecCCCc--e Confidence 33333333333222 244554 55544 44444332110 0000 0111 25776 59988776553 3 Q ss_pred EEEEecCCCccceeEeecccccccccccCCccccceeeeee--eecce-ecCccccccCCccccccCcchHHhhccccce Q lcl|NC_014036. 437 TVGYKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKT--RYGVG-INPFANSRSQAPSDRITSGMITKEMFGKNAY 513 (522) Q Consensus 437 ~vG~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~t--RY~l~-~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~ 513 (522) ++|-- +-||-=|.... ...+.......++|.. ||+.. .+|= - T Consensus 328 ~~GDf-------~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~r~d~~v~~~e--------------------------A 372 (387) T protein:vir:93 328 IVGDF-------NYFGINYDGTT--YDTDKDVKKGEYLFVLTAWYDQQRTLDS--------------------------A 372 (387) T ss_pred eeeeh-------hhhheehhhhe--eeecccccCCceeEEEEeeeCceeechh--------------------------h Confidence 33411 11111111100 1112222334555554 44332 2331 1 Q ss_pred eeeeeeccC Q lcl|NC_014036. 514 FRKVYVKGL 522 (522) Q Consensus 514 ~r~~~Vk~~ 522 (522) ||.+-||-= T Consensus 373 ~~~l~~k~~ 381 (387) T protein:vir:93 373 FRIAKAKEN 381 (387) T ss_pred eEEEEeecC Confidence 222222222 No 95 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=33.85 E-value=1.3 Score=19.91 Aligned_cols=352 Identities=15% Similarity=0.142 Sum_probs=117.9 Q ss_pred CcchHHHHHhhhhhhcccc---chhhhcchhhhH---HHHHHhhhHHHHhhhhhhhcchhh---hhhhccc---cccc-c Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQE---GLPDIATKSKKQ---LIAAIMEAQEKDAEVDPVYRDEKI---VESFGGF---LAEA-E 67 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~---~~~~i~~~~~~~---~~~~~~enq~~~~~~~~~~~~~~~---~~~~~~~---l~ea-~ 67 (522) +.+-+.|.++..-+-+.++ .+..-....+.. -...-..+|++. +..+.... ..++... +.++ + T Consensus 42 ~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 117 (435) T protein:vir:80 42 SSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKA----PEVKGAKMARMVRALAAARGDAQLASK 117 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhccccccccccccch----hhhhHHHHHHHHHHHHhccchhHHHHH Confidence 2222333333332211000 000000000000 000000001000 00000000 0000000 0000 0 Q ss_pred --ccccccccccccccccccccccccCcchhh------HHHHHHhhhhhhhc-eeeccCCchhhhheeeeeeecCCCCCC Q lcl|NC_014036. 68 --IAGDHGYDATKIASGNSSGAITNIGPAVIG------MVRRAIPNLIAFDI-CGVQPMTGPTGQVFALRAVYGKDPLAS 138 (522) Q Consensus 68 --~~~~~g~~~~~~~~~t~tg~v~~~~P~Li~------l~Rra~~~lI~~DI-~GVQPmTGPTGLIFAMRSrY~~~~~~t 138 (522) .....+.+..+.. .+.+ ......||+ +++++-++.+...+ +=+.||+.+. + +|+-. T Consensus 118 ~~~~~~~~~~~~~~~-~~~~---~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~-~------~~p~~---- 182 (435) T protein:vir:80 118 LAIERGFGEEVAMSL-NTLS---PGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGN-I------TIPRL---- 182 (435) T ss_pred HHHhhhhhhhhhhhh-cccC---CCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCc-e------EEEEE---- Confidence 0000000000000 0111 011112221 22222233333333 1122322211 0 11000 Q ss_pred cccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccc Q lcl|NC_014036. 139 GAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQ 218 (522) Q Consensus 139 ~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~ 218 (522) ++ + +. T Consensus 183 ----------------~~-----------------------------------------------~--~~---------- 187 (435) T protein:vir:80 183 ----------------KG-----------------------------------------------G--AI---------- 187 (435) T ss_pred ----------------eC-----------------------------------------------C--cc---------- Confidence 00 0 00 Q ss_pred cccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHH Q lcl|NC_014036. 219 EKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILAT 298 (522) Q Consensus 219 ~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILST 298 (522) ....+| +...++...++++++...+.-+-....|.||.+|-.- +.|.|+.|.+-|+. T Consensus 188 ------------a~~v~E---------~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~--~~~l~~~i~~~l~~ 244 (435) T protein:vir:80 188 ------------VGYIGA---------DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGV--NPNVDQIVVGDLTA 244 (435) T ss_pred ------------eeeecc---------CccccccccceeeEEEeeEEEEEeehhhHHHHHhhcc--cHHHHHHHHHHHHH Confidence 000112 1124455566777777777777788899999999432 45678889999999 Q ss_pred HHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEE Q lcl|NC_014036. 299 EIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFI 378 (522) Q Consensus 299 EImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~ 378 (522) -|...+++-||. -.-. +. + -.|++.......+... -.+.....+...+.+.-..+.....+-..... T Consensus 245 a~~~~~d~a~l~---G~G~-~~----~----p~Gi~~~~~~~~~~~~-~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 311 (435) T protein:vir:80 245 AIGAREDKAFIR---DDGT-AN----T----PKGLRFWALPGNVITA-SDGSTLQKIETDLGKAILALENADANLTQPGW 311 (435) T ss_pred HHHHHHHHHhhc---cCCC-CC----c----ccceeecccccceeec-ccccchhhHHHHHHHHHHHhhccccccccCEE Confidence 999999988872 1100 00 0 0122111100000000 00011112222222222222221112234567 Q ss_pred EEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccc--------eEE--------EEEec Q lcl|NC_014036. 379 IASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGD--------YFT--------VGYKG 442 (522) Q Consensus 379 v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y~~--------vG~KG 442 (522) |++|.....|..+-- .. + . ....+.+. |+|.| ++||++.+.|.+ .|+ ||-.+ T Consensus 312 vmn~~~~~~L~~lkd--~~-G---~-~l~~~~~~----~~l~G-~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~ 379 (435) T protein:vir:80 312 IMAPRTFRFLEGLRD--GN-G---N-KVYPELAN----GMLKG-YPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEE 379 (435) T ss_pred EEcHHHHHHHHhhhc--cC-C---c-eeccCCCC----CeEee-eeeEEeccccccccCCCCcceEEEEEcccEEEEeec Confidence 999999999875421 11 1 0 11112222 46766 699998886532 122 33332 Q ss_pred CCCccceeEeecccccccccccCCccc---cceeeeeeeecceec-CccccccCCccccccCcchHHh Q lcl|NC_014036. 443 DNEMDAGIYYAPYVALTPLRGSDPKNF---QPVMGFKTRYGVGIN-PFANSRSQAPSDRITSGMITKE 506 (522) Q Consensus 443 ~~~~d~glfyaPYv~~~~~~~~Dp~s~---qP~~~~~tRY~l~~n-P~~~~~~~~~~~~i~~g~~~~~ 506 (522) .-..+ ..+|.-+..-...--..| +=.+=..-|+++.+. | ++..+.+|-.|.. T Consensus 380 ~~~i~----~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~--------~a~~~l~~~~~~~ 435 (435) T protein:vir:80 380 TLEID----YSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHV--------ESIAVLSGVAWGA 435 (435) T ss_pred ceEEE----EeccccccccccchhhhhhcCcceeeeeeeeCcEeecc--------cceEEEeccCCCC Confidence 22211 111111000000000001 122234455555442 3 2223344555544 No 96 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=32.82 E-value=1.4 Score=19.79 Aligned_cols=360 Identities=12% Similarity=0.065 Sum_probs=129.1 Q ss_pred CcchHHHHH---hhhhhhccccchhhhcchhh------hHHHHHH------hhhHHHHhhhhhhhcchhhhhhh------ Q lcl|NC_014036. 1 MSKKNELME---KWNDLLESQEGLPDIATKSK------KQLIAAI------MEAQEKDAEVDPVYRDEKIVESF------ 59 (522) Q Consensus 1 ~~~~~~l~~---kw~p~l~~~~~~~~i~~~~~------~~~~~~~------~enq~~~~~~~~~~~~~~~~~~~------ 59 (522) |..-+.|.| .|.-..+. +-+....-+ ++....| ++.+.+.+++...- .+...... T Consensus 1 m~~~~~lee~~a~l~~~~~~---~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 76 (419) T protein:vir:94 1 MPPTPTLEEQRAALLARLDD---TSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPA-PKGPADGGTPLTPA 76 (419) T ss_pred CCHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhhhhcccccc Confidence 554433333 23332221 111111000 1111111 11111111111100 00000000 Q ss_pred --------cccccccc----ccccc--cc-------------cccccccccccccccccCcchhh-HHH-HHHhhhhhhh Q lcl|NC_014036. 60 --------GGFLAEAE----IAGDH--GY-------------DATKIASGNSSGAITNIGPAVIG-MVR-RAIPNLIAFD 110 (522) Q Consensus 60 --------~~~l~ea~----~~~~~--g~-------------~~~~~~~~t~tg~v~~~~P~Li~-l~R-ra~~~lI~~D 110 (522) +....+.+ ..+.+ +. .......++.+.+-...-|.+++ ++. +.-..++..+ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~ 156 (419) T protein:vir:94 77 EAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVAD 156 (419) T ss_pred ccccccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhh Confidence 00000000 00000 00 00000000011111111233331 111 1112334566 Q ss_pred ceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccce Q lcl|NC_014036. 111 ICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRV 190 (522) Q Consensus 111 I~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~ 190 (522) +|.+.||++++.-+ +| ..+.+ . T Consensus 157 ~~~~~~~~~~~~~~--~~-----~~~~~-----------------~---------------------------------- 178 (419) T protein:vir:94 157 LLDQQNADYNVLEY--IR-----DTSGT-----------------A---------------------------------- 178 (419) T ss_pred cceeeeccCCceee--ee-----ecccc-----------------c---------------------------------- Confidence 77777776653211 11 00000 0 Q ss_pred eeeeccccccccCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccc Q lcl|NC_014036. 191 FLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKA 270 (522) Q Consensus 191 ~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKA 270 (522) +... +.+-+...+| +..+++...++++++..+|.=+-.. T Consensus 179 --------~~~~------------------------~~~~a~~v~E---------g~~~~~~~~~~~~i~~~~~k~~~~~ 217 (419) T protein:vir:94 179 --------GAGS------------------------TWNKAAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWL 217 (419) T ss_pred --------cccc------------------------cCcccceecC---------CccccccccceeeEEeeeeeEEEee Confidence 0000 0000011222 1235566666777777777777778 Q ss_pred hhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHH Q lcl|NC_014036. 271 QYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGE 350 (522) Q Consensus 271 EYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E 350 (522) .+|-||.||.- +.+++|.+-|+..|...+|+.||. -...-...|+-. ..|+.-... .. -+... T Consensus 218 ~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~aii~---G~G~~~p~Gi~~----~~~~~~~~~----~~-~~~~~ 280 (419) T protein:vir:94 218 PITRQAADDNS-----QLMGYIQGRLTYGLRFLRDRQLLN---GNGSTEMQGILT----TPGIGTYQQ----PK-PTAPA 280 (419) T ss_pred hhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHHh---ccCcccccceec----ccccccccc----cc-ccccc Confidence 89999999962 358999999999999999999982 111101111100 011100000 00 00001 Q ss_pred HHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccccc-ccccccccccccceeEEEecCceEEEecC Q lcl|NC_014036. 351 SYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQ-GLQKTLNVDTTKAVFAGVLGGVYKVYIDQ 429 (522) Q Consensus 351 ~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~-~~~~~~~~d~~~~~~~G~l~~~~~vy~D~ 429 (522) ..-..+..|.++-+.+.. .+...+.+||+|.....|..+- +..... ..+... .+. ..++|.| ++|+++. T Consensus 281 t~~~~~~~l~~~~~~~~~--~~~~~~~~v~n~~~~~~l~~~k--~~~~~~~~~~~~~-~~~----~~~~l~G-~pV~~~~ 350 (419) T protein:vir:94 281 TDEPPLVDIRRAKTVAEI--AGFPPDGVVVHPQDWESIELDQ--APGSGVFRVIANV-QGE----ATPRIWG-LNVVSTV 350 (419) T ss_pred ccchhHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHHh--hcCCCceeecCCc-ccC----CCccccc-eeeEEcC Confidence 112233344444444432 2335678999999988876441 111110 011110 111 2346776 6999999 Q ss_pred CCccceEEEEEecCC-----CccceeEeecccccccccccCCccccceeeeeeeeccee-cCccccccCCccccccCcch Q lcl|NC_014036. 430 YARGDYFTVGYKGDN-----EMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMI 503 (522) Q Consensus 430 y~~~dy~~vG~KG~~-----~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~ 503 (522) ..+..-+++|--... ..+-.+-..++.... =..-+=.+=+..||++.+ +|= T Consensus 351 ~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~------~~~~~~~~r~~~r~d~~v~~~~----------------- 407 (419) T protein:vir:94 351 AIAQGTALVGGFRQGATLWSRQGITVLMTDSHADF------FTANTLVILAEFRANLAVYQPK----------------- 407 (419) T ss_pred CCCCccEEEeeccceEEEEEecceEEEEeccccch------hhcCcEEEEEEEeeccEEeccc----------------- Confidence 877544444421000 000011111111000 011222333445555432 221 Q ss_pred HHhhccccceeeeeeeccC Q lcl|NC_014036. 504 TKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 504 ~~~~~~~~~~~r~~~Vk~~ 522 (522) -|.++-++-. T Consensus 408 ---------a~~~~~~~aa 417 (419) T protein:vir:94 408 ---------AFVRVTFAAA 417 (419) T ss_pred ---------cEEEEEeccC Confidence 1111222222 No 97 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=32.01 E-value=1.5 Score=19.69 Aligned_cols=338 Identities=16% Similarity=0.169 Sum_probs=125.5 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhh--hHHHH--HHhhhHHHHhhhhhh--------hcch--------hh----h Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSK--KQLIA--AIMEAQEKDAEVDPV--------YRDE--------KI----V 56 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~--~~~~~--~~~enq~~~~~~~~~--------~~~~--------~~----~ 56 (522) +.+....+++ -..+....+.++..... .+..+ .=|+.|.+.+..... .... .. . T Consensus 15 ~~e~~~~l~~--~~~~~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 92 (389) T protein:vir:10 15 CADLNAQLNA--KLQDENASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGSKKGTDLSKKPIDAKK 92 (389) T ss_pred HHHHHHHHHH--HHHhHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccchhHHHHHH Confidence 1111111110 00000001111111100 00000 012233333221110 0000 00 0 Q ss_pred hhhcccccccccccccccccccccccccc-ccccccCcchh--hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecC Q lcl|NC_014036. 57 ESFGGFLAEAEIAGDHGYDATKIASGNSS-GAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGK 133 (522) Q Consensus 57 ~~~~~~l~ea~~~~~~g~~~~~~~~~t~t-g~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~ 133 (522) .++..+|- ..+.....+..++++ |.+. =|--+ .++++..+..+..++|.|.||+++++-+--++ T Consensus 93 ~~~~~~lr------~~~~~~~~~~~~t~~~gg~~--vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~----- 159 (389) T protein:vir:10 93 KAINDFIH------SHGKVIDATSKVTSTEAGVL--IPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILK----- 159 (389) T ss_pred HHHHHHhh------cchhhhhhhcccccCCccee--ehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEe----- Confidence 01111110 011111122222221 2211 13222 45666667778889999999998764322211 Q ss_pred CCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCccccccc Q lcl|NC_014036. 134 DPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAA 213 (522) Q Consensus 134 ~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~ 213 (522) +... ...+- T Consensus 160 ~~~~-------------~~~~~---------------------------------------------------------- 168 (389) T protein:vir:10 160 RATD-------------RFSSV---------------------------------------------------------- 168 (389) T ss_pred cCCC-------------ccccc---------------------------------------------------------- Confidence 0000 00000 Q ss_pred ccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHH Q lcl|NC_014036. 214 VIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELS 293 (522) Q Consensus 214 ~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELs 293 (522) +++ ++. ...+...|.+..+++.|. +--..+|-||.+|- ..|.+++|. T Consensus 169 -------------~E~-----~~~----~~~~~~~~~~i~~~~~k~-------~~~~~iS~ell~ds----~~~l~~~i~ 215 (389) T protein:vir:10 169 -------------AEL-----AEN----PKLAEPEFNKVDWSVATY-------RGAIPLSEEAIADS----AVDLTALVG 215 (389) T ss_pred -------------ccc-----ccc----cccccccceeeeeeheee-------EeeehhhHHHHhhh----hHHHHHHHH Confidence 000 000 000112355555555555 44557899999984 346788999 Q ss_pred HHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_014036. 294 AILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRG 373 (522) Q Consensus 294 NILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g 373 (522) +-|...+..-+|..|+.-+......+. .+.... +.+..++... . ...+ T Consensus 216 ~~la~~~~~~~~~~i~~g~~~~~~~~~----------~~~~~~-------------d~l~~~~~~~-~-------~~~~- 263 (389) T protein:vir:10 216 QSIKEKSVNTYNAMIAPVLQSFTAKKT----------TTDTLV-------------DSLKHILNVD-L-------DPAY- 263 (389) T ss_pred HHHHHHHHHHHHHHHhhhhcccccccc----------cccccH-------------HHHHHHHHhh-h-------hhhh- Confidence 999999999999999854432211111 111111 1122222111 1 1122 Q ss_pred CCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEe-cC-CCcc---ce-EEEEEecCCCcc Q lcl|NC_014036. 374 AGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYI-DQ-YARG---DY-FTVGYKGDNEMD 447 (522) Q Consensus 374 ~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~-D~-y~~~---dy-~~vG~KG~~~~d 447 (522) ...+|++|.....|..+- +.....-.+.. ..+.+...+-++|.| ++||+ |. ..+. |. +++|= +. T Consensus 264 -~a~~~~n~~~~~~L~~lk--d~~G~~i~~~~-~~~~~~~~~~~~l~G-~pV~~~~~~~~~~~~~~~~~~~gd-----~~ 333 (389) T protein:vir:10 264 -SRALVVTQSLFNTLDTLK--DKNGRYLLHDA-SDSITDGTAKGTILG-VPVYVVGDTLLGSLAGDQKAFVGD-----LK 333 (389) T ss_pred -CcEEEecHHHHHHHHHhh--ccCCCeeeecC-ccccccccccccccc-ceeEEecccccCCCCCceEEEEee-----cc Confidence 245789999988887541 10000000000 011112223357888 58875 32 2221 21 33330 00 Q ss_pred ceeEeecccccccccccCCccccceeeeeeeecce-ecCcc--c-cccCCccccccCcchHHhhccc Q lcl|NC_014036. 448 AGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVG-INPFA--N-SRSQAPSDRITSGMITKEMFGK 510 (522) Q Consensus 448 ~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~--~-~~~~~~~~~i~~g~~~~~~~~~ 510 (522) .+.++... ....+...|-..|.-.+-..-|++.. .||=+ . ..+..+ ...+|| T Consensus 334 ~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~----------~~~~~~ 389 (389) T protein:vir:10 334 RGVLFTDR-QQVTLAWEDSKIYGKYLGAAFRFGVQKADSKAGYFVTNTDVP----------GSALGK 389 (389) T ss_pred ccEEEEee-cceEEEeeccccccceEEEEEEeccEEecccceEEEEeeccC----------CCCCCC Confidence 00000000 11122233445555667777788865 33311 0 011111 011223 No 98 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=31.83 E-value=1.5 Score=19.67 Aligned_cols=344 Identities=14% Similarity=0.181 Sum_probs=124.3 Q ss_pred Ccch--------HHHHHhhhhhhccccc-hhhhcchhh------hHHHHHH--hhhHHHHh----hhhh-h------hcc Q lcl|NC_014036. 1 MSKK--------NELMEKWNDLLESQEG-LPDIATKSK------KQLIAAI--MEAQEKDA----EVDP-V------YRD 52 (522) Q Consensus 1 ~~~~--------~~l~~kw~p~l~~~~~-~~~i~~~~~------~~~~~~~--~enq~~~~----~~~~-~------~~~ 52 (522) |+-. ++|.++..-+-+..|- +.++....+ +.+-++| +|++.+++ .+.. . -.. T Consensus 1 m~~~lk~l~~~~~el~~~~~~~k~~~~~~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (401) T protein:vir:44 1 MAVDIKDVEQVAQELQQKFDDFKAKNDKRVEAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNKVA 80 (401) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchh Confidence 5544 4455555444321110 111111100 0111111 12222211 1110 0 000 Q ss_pred hhhhhhhccccccccccccccccccccccccc-cccc---cccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeee Q lcl|NC_014036. 53 EKIVESFGGFLAEAEIAGDHGYDATKIASGNS-SGAI---TNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALR 128 (522) Q Consensus 53 ~~~~~~~~~~l~ea~~~~~~g~~~~~~~~~t~-tg~v---~~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMR 128 (522) ....++|..+|-......-...+.+.+..++. .|.+ ..+.+-++.+.| ...+-.+++-+.||++++..+.- T Consensus 81 ~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~-- 155 (401) T protein:vir:44 81 AEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLK---DEVVMRQEATVITVGGSDYKKLV-- 155 (401) T ss_pred HHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHH---hhhhhhhhceeeecCCCceEEEE-- Confidence 00112233333211100000011111222221 1111 233344444444 34556778888898877532111 Q ss_pred eeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcc Q lcl|NC_014036. 129 AVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDD 208 (522) Q Consensus 129 SrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~ 208 (522) ... .+...| T Consensus 156 -----~~~------------~~~a~w------------------------------------------------------ 164 (401) T protein:vir:44 156 -----NLG------------GTASGW------------------------------------------------------ 164 (401) T ss_pred -----ecC------------Ccccee------------------------------------------------------ Confidence 000 000000 Q ss_pred cccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCCh Q lcl|NC_014036. 209 ALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDA 288 (522) Q Consensus 209 ~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDA 288 (522) .+|... ...+....|.+..|.+.| -+--..+|-||.+|- .+|. T Consensus 165 -------------------------v~E~~~-~~~~~~~~~~~v~~~~~k-------~~~~~~iS~ell~ds----~~~l 207 (401) T protein:vir:44 165 -------------------------VGETDT-RSQTATSRLGLIEPFMGE-------IYGNPQATQKMLDDA----FFNV 207 (401) T ss_pred -------------------------eccccc-cCccccccceeeeeehhh-------eeeehhhhHHHHhcc----hHHH Confidence 011000 000011124444444444 444567899999984 4677 Q ss_pred HHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeecccccccc---------------ccchhHHHHHH Q lcl|NC_014036. 289 DAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDV---------------RGARWAGESYK 353 (522) Q Consensus 289 EaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~---------------~~~r~~~E~~r 353 (522) +++|.+-|+..|...+++.+|. -.-. + .-.|++........ ....-..+... T Consensus 208 ~~~i~~~la~ai~~~~~~~~l~---G~G~-~---------~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~ 274 (401) T protein:vir:44 208 EAWINSELATEFAEQEEIAFTT---GDGT-K---------KPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAII 274 (401) T ss_pred HHHHHHHHHHHHHHHHHhhhhc---cCCC-C---------ccceeeccccccccccccccccccccccccccccCHHHHH Confidence 9999999999999999999882 1100 0 00122211100000 00000011122 Q ss_pred HHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCcc Q lcl|NC_014036. 354 ALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG 433 (522) Q Consensus 354 ~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~ 433 (522) .|+.. +.. .+-.+...|+++.....|..+ .+..... ....+.+. ..-++|.| ++|+++...|. T Consensus 275 ~~~~~-------l~~--~~~~~a~~v~n~~~~~~L~~l--kd~~G~~----l~~~~~~~-g~~~~l~G-~PVv~~~~~p~ 337 (401) T protein:vir:44 275 KLIYT-------LRK--AHRTGAKFMMNNNSLFAIRLL--KDTEGNY----LWRPGLEL-GQPSSLAG-YGIAENEQMPD 337 (401) T ss_pred HHHHh-------cch--hhhcCCEEEEcHHHHHHHHHh--hccCCce----eecCCcCC-CCCceecc-eeeEEecCcCC Confidence 23322 221 222345678999988888643 1111000 00111111 11246776 68888877552 Q ss_pred ceEEEEEecCCCccceeEeeccccc------c-cccccCCccccceeeeee--eecce-ecCccccccCCccccccCcch Q lcl|NC_014036. 434 DYFTVGYKGDNEMDAGIYYAPYVAL------T-PLRGSDPKNFQPVMGFKT--RYGVG-INPFANSRSQAPSDRITSGMI 503 (522) Q Consensus 434 dy~~vG~KG~~~~d~glfyaPYv~~------~-~~~~~Dp~s~qP~~~~~t--RY~l~-~nP~~~~~~~~~~~~i~~g~~ 503 (522) + |... ..++|..+-.+ . +....||-.=+-.++|.. |++.. .+|-+ T Consensus 338 ----~---~~~~--~~i~~Gd~~~~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d~~~~~~~a---------------- 392 (401) T protein:vir:44 338 ----I---AADA--KAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQA---------------- 392 (401) T ss_pred ----c---cCCc--cEEEEeehhccEEEEEecceEEeeeccccCCcEEEEEEEEeccEEecccc---------------- Confidence 0 1110 11222221110 0 000122222222233322 44432 23322 Q ss_pred HHhhccccceeeeeeeccC Q lcl|NC_014036. 504 TKEMFGKNAYFRKVYVKGL 522 (522) Q Consensus 504 ~~~~~~~~~~~r~~~Vk~~ 522 (522) |+++.||-= T Consensus 393 ----------~~~l~~~aa 401 (401) T protein:vir:44 393 ----------IKLLKIAAA 401 (401) T ss_pred ----------eEEEEeecC Confidence 222222222 No 99 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=31.24 E-value=1.5 Score=19.60 Aligned_cols=295 Identities=10% Similarity=0.054 Sum_probs=123.8 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) |-|.+++. ...|+....+..-| .+ .++ +.. T Consensus 1 ~~~~~~~~------------------~~~~~f~~~~~~~~---------------------~~-~a~----------~~~ 30 (324) T protein:vir:10 1 MEQTQKLK------------------LNLQHFASNNVKPQ---------------------VF-NPD----------NVM 30 (324) T ss_pred CCCchHHH------------------HHHHHHHHHhhccc---------------------ee-ccc----------cee Confidence 11111000 00111111111111 01 010 001 Q ss_pred cccccccccccCcc-hh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhcccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPA-VI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQG 158 (522) Q Consensus 81 ~~t~tg~v~~~~P~-Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g 158 (522) ++.+++. .=|. +. .+++.+..+.+..+++-+.||++.+.-|. +.... +.+. T Consensus 31 -~~~~~~~--liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p----~~~~~---------------~~a~----- 83 (324) T protein:vir:10 31 -MHEKKDG--TLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT----FWADK---------------PGAY----- 83 (324) T ss_pred -ccCCCcc--eechhHHHHHHHHHHhhchhhhhcceeeccCCceEEE----EEeCC---------------ccee----- Confidence 1111111 0122 21 34555556777788888888887542111 00000 0000 Q ss_pred cccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhc Q lcl|NC_014036. 159 AAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQ 238 (522) Q Consensus 159 ~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal 238 (522) ..+| T Consensus 84 --------------------------------------------------------------------------~v~E-- 87 (324) T protein:vir:10 84 --------------------------------------------------------------------------WVGE-- 87 (324) T ss_pred --------------------------------------------------------------------------Eecc-- Confidence 0111 Q ss_pred cccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcccc Q lcl|NC_014036. 239 EQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQV 318 (522) Q Consensus 239 ~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~ 318 (522) +..+++...+++++++..|..+..-.+|-||.+|-. .|.+++|.+.|+..|...|++.+|.--...... T Consensus 88 -------g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~ 156 (324) T protein:vir:10 88 -------GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) T ss_pred -------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccC Confidence 112445555677778888888888889999999864 467999999999999999999998321111100 Q ss_pred ccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccccccc Q lcl|NC_014036. 319 GKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 398 (522) Q Consensus 319 ~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~ 398 (522) .|++........ -.. ..-.+..|.++.+.|.. .+...+.+|++|.....|..+-- . T Consensus 157 ------------~~i~~~~~~~~~---~~~---~~~t~~~i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~l~d--~-- 212 (324) T protein:vir:10 157 ------------KSIAQSIEKTNK---VIK---GDFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVD--P-- 212 (324) T ss_pred ------------ccccccccccce---ecc---ccCCHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhc--c-- Confidence 111110000000 000 00012223334444432 34456778999999998875411 1 Q ss_pred ccccccccccccccceeEEEecCceEEEecCCCcc--ceEEEEEecCCCccceeEeecccccccccc---------cCCc Q lcl|NC_014036. 399 GQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG--DYFTVGYKGDNEMDAGIYYAPYVALTPLRG---------SDPK 467 (522) Q Consensus 399 ~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--dy~~vG~KG~~~~d~glfyaPYv~~~~~~~---------~Dp~ 467 (522) .+ +.. ..+.. .++|.| ++|++.+.+.. ..+++|-. +.+++... ....++. .|+. T Consensus 213 ~g--~~~-~~~~~----~~~l~G-~PV~~~~~~~~~~~~~~~gd~------~~~~~~~~-~~~~i~~~~~~~~~~~~~~~ 277 (324) T protein:vir:10 213 ET--KER-IYDRN----SDTLDG-LPVVNLKSSNLKRGELITGDF------DKLIYGIP-QLIEYKIDETAQLSTVKNED 277 (324) T ss_pred CC--cee-ecCCC----Cccccc-eeEEeecCCCCCcceEEEEec------ccEEEEEe-cCcEEEEeeccccccccccc Confidence 10 001 11111 235777 58888776553 22333321 01111110 0000111 1111 Q ss_pred --------cccceeeeeeeecc-eecC--ccc-----cccCCccccc Q lcl|NC_014036. 468 --------NFQPVMGFKTRYGV-GINP--FAN-----SRSQAPSDRI 498 (522) Q Consensus 468 --------s~qP~~~~~tRY~l-~~nP--~~~-----~~~~~~~~~i 498 (522) +-+=.+=...||+. ..|| |.. ..+..+.+.| T Consensus 278 ~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:10 278 GTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred ccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 11223333456765 3445 221 1111111222 No 100 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=29.47 E-value=1.7 Score=19.38 Aligned_cols=296 Identities=10% Similarity=0.072 Sum_probs=123.2 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccccccccccccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEAEIAGDHGYDATKIA 80 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 80 (522) |-|.|++. ...|+....+.+-|+ + .++ +.. T Consensus 1 ~~k~~~~~------------------~~~~~~~~~~~~~~~---------------------~-~a~----------~~~ 30 (324) T protein:vir:99 1 MEQTQKLK------------------LNLQHFASNNVKPQV---------------------F-NPD----------NVM 30 (324) T ss_pred CCCchHhh------------------HHHHHHHHHhhhhhh---------------------c-ccc----------cee Confidence 22222111 111111111111110 0 000 001 Q ss_pred cccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGA 159 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~ 159 (522) ++.+++. ..-+.+. .+++.+..+.+-.+++.+.||++.+.-|. +.... +.+. T Consensus 31 -~~~~~~~-lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p----~~~~~---------------~~a~------ 83 (324) T protein:vir:99 31 -MHEKKDG-TLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFT----FWADK---------------PGAY------ 83 (324) T ss_pred -ccCCCcc-eechhHHHHHHHHHHhhchhhhhcceeeccCCceEEE----EEecC---------------ccee------ Confidence 1111111 1111121 34455556677788888888887542111 11000 0000 Q ss_pred ccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhcc Q lcl|NC_014036. 160 APSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQE 239 (522) Q Consensus 160 ~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~ 239 (522) ..+| T Consensus 84 -------------------------------------------------------------------------~v~E--- 87 (324) T protein:vir:99 84 -------------------------------------------------------------------------WVGE--- 87 (324) T ss_pred -------------------------------------------------------------------------Eecc--- Confidence 0111 Q ss_pred ccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccc Q lcl|NC_014036. 240 QFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTAQVG 319 (522) Q Consensus 240 ~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~ 319 (522) +..+++...++++++++.|.-+---..|-||.+|-. .|.+++|.+.|+..|...+++.||.--..... + T Consensus 88 ------g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~-~ 156 (324) T protein:vir:99 88 ------GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF-G 156 (324) T ss_pred ------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCcc-C Confidence 112445555667777777777777789999999974 46799999999999999999999832111000 0 Q ss_pred cccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccc Q lcl|NC_014036. 320 KTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAG 399 (522) Q Consensus 320 ~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~ 399 (522) .|++.-...... -.. ..-.+..|.++.+.|. ..+...+.+|++|.....|..+-- +. T Consensus 157 -----------~~~~~~~~~~~~---~~~---~~~~~~~i~~~~~~l~--~~~~~~~~~v~n~~~~~~L~~l~d----~~ 213 (324) T protein:vir:99 157 -----------KSIAQSIEKTNK---VIK---GDFTQDNIIDLEALLE--DDELEANAFISKTQNRSLLRKIVD----PE 213 (324) T ss_pred -----------ccccccccccce---ecc---ccCCHHHHHHHHHhhh--hccCCCCEEEEcHHHHHHHHHhhc----CC Confidence 111110000000 000 0011223334444443 234456778999999999875411 11 Q ss_pred cccccccccccccceeEEEecCceEEEecCCCcc--ceEEEEEecCCCccceeEeecccccccccc---------cCCc- Q lcl|NC_014036. 400 QGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG--DYFTVGYKGDNEMDAGIYYAPYVALTPLRG---------SDPK- 467 (522) Q Consensus 400 ~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--dy~~vG~KG~~~~d~glfyaPYv~~~~~~~---------~Dp~- 467 (522) + +.. ..+.. .++|.| ++|+|.+.... ..+++|-.. .+++..- ....++. .|+. T Consensus 214 g--~~~-~~~~~----~~~l~G-~PVv~~~~~~~~~~~~i~gd~~------~~~~~~~-~~~~i~~~~~~~~~~~~~~~~ 278 (324) T protein:vir:99 214 T--KER-IYDRN----SDTLDG-LPVVNLKSSNLKRGELITGDFD------KLIYGIP-QLIEYKIDETAQLSTVKNEDG 278 (324) T ss_pred C--cee-ecCCC----Cccccc-eeEEeecCCCCCcceEEEEecc------cEEEEEe-cCcEEEEeecccccccccccc Confidence 0 001 11111 246777 58888776553 223333211 0111100 0000010 0111 Q ss_pred -------cccceeeeeeeecce-ecC--ccc-----cccCCccccc Q lcl|NC_014036. 468 -------NFQPVMGFKTRYGVG-INP--FAN-----SRSQAPSDRI 498 (522) Q Consensus 468 -------s~qP~~~~~tRY~l~-~nP--~~~-----~~~~~~~~~i 498 (522) +-+=.+=...||+.. .|| |.. ..+..+.+.| T Consensus 279 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~~ 324 (324) T protein:vir:99 279 TPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred cchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 111222233566633 344 111 1111111222 No 101 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=29.06 E-value=1.7 Score=19.33 Aligned_cols=339 Identities=13% Similarity=0.079 Sum_probs=129.1 Q ss_pred CcchHHHHHhhhhhhccccchhhhcchhhhHHHH-------HHhhh------HHHHhhhhh----hhcch----hhhhhh Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLIA-------AIMEA------QEKDAEVDP----VYRDE----KIVESF 59 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~~-------~~~en------q~~~~~~~~----~~~~~----~~~~~~ 59 (522) |.-. +|+++|.-+.+. +.++.+..++.... ..+|. |-+.+.+.. ...++ .-.... T Consensus 1 M~~~-eL~~~~~~~~~~---~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (395) T protein:vir:38 1 MNIN-QLKDAFDMAGQK---VQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPV 76 (395) T ss_pred CCHH-HHHHHHHHHHHH---HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 7765 488888777543 33333222221111 11110 100111100 00000 000000 Q ss_pred cccccccccccc----------ccccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeccCCchhhhheee Q lcl|NC_014036. 60 GGFLAEAEIAGD----------HGYDATKIASGNSSGAITNIGPAVI--GMVRRAIPNLIAFDICGVQPMTGPTGQVFAL 127 (522) Q Consensus 60 ~~~l~ea~~~~~----------~g~~~~~~~~~t~tg~v~~~~P~Li--~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAM 127 (522) .+...+...... .+.........+++++-...=|.-+ .+++.+....+..+++.++||++++|-+- T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~-- 154 (395) T protein:vir:38 77 NKKPLPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRV-- 154 (395) T ss_pred cccccchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEE-- Confidence 000000000000 0000000011111111111113332 35555556777888999999999887531 Q ss_pred eeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCc Q lcl|NC_014036. 128 RAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTD 207 (522) Q Consensus 128 RSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~ 207 (522) |...... .+...| T Consensus 155 ---~~~~~~~-----------~~~a~~----------------------------------------------------- 167 (395) T protein:vir:38 155 ---YEKLADI-----------TPLKDL----------------------------------------------------- 167 (395) T ss_pred ---EEeeccC-----------Cccccc----------------------------------------------------- Confidence 1100000 000000 Q ss_pred ccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCC Q lcl|NC_014036. 208 DALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMD 287 (522) Q Consensus 208 ~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLD 287 (522) ++.| +. ...+....|.+..|+..| -+-...+|-||.+|- ..| T Consensus 168 ------------------v~E~------~~---~~~~~~~~f~~v~~~~~k-------~~~~~~iS~ell~ds----~~~ 209 (395) T protein:vir:38 168 ------------------DDES------AL---IGDNDDPELTVVKYLIHR-------YAGITTVTNTLLKDT----VDN 209 (395) T ss_pred ------------------cccc------cc---cccccccceeeEEeeeee-------eEeehhhHHHHHhhh----HHH Confidence 0000 00 000011224444444444 444556999999993 356 Q ss_pred hHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_014036. 288 ADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIA 367 (522) Q Consensus 288 AEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~ 367 (522) -++.|.+-|+..|..-||+.|+.= ...- ....+..+++ ....++..... T Consensus 210 l~~~i~~~la~~~~~~~~~~il~g---~g~~---------~~~~~~~~~~-------------~i~~~~~~~l~------ 258 (395) T protein:vir:38 210 IIQWLVNWAAKKDVVTRNAKILEV---MGKA---------PKKPTISQFD-------------NIKDLENNTLD------ 258 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhc---cccc---------ccccccccHH-------------HHHHHHHHhhh------ Confidence 689999999999999999998831 1110 0111222111 12222221111 Q ss_pred HhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCcc-----ce-EEEE-- Q lcl|NC_014036. 368 RQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG-----DY-FTVG-- 439 (522) Q Consensus 368 r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy-~~vG-- 439 (522) . .+.....+||+|.....|..+- + +.+ ......+......++|.| ++|++....+. +. +++| T Consensus 259 ~--~~~~~a~~v~n~~~~~~L~~lk--d-~~G----~~l~~~~~~~~~~~~l~G-~pV~~~~~~~~~~~~~~~~i~~gd~ 328 (395) T protein:vir:38 259 P--AIESTSSFITNQSGYNILSKVK--D-ADG----RYLMQPDVTSPDKYLIDG-KPVIRIADKWLPDVSGSHPLYFGDL 328 (395) T ss_pred h--hhcCCCEEEEcHHHHHHHHHhh--c-cCC----ceeeccCcCCCCcceecc-ceeEEecccccCcCCCcceEEEEec Confidence 1 1113456899999998887441 1 111 011011111112246776 68887554221 11 2222 Q ss_pred -------EecCCCccceeEeecccccccccccCCccccceeeeeeeeccee-cC--c-----cccccCCccccccCcchH Q lcl|NC_014036. 440 -------YKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGVGI-NP--F-----ANSRSQAPSDRITSGMIT 504 (522) Q Consensus 440 -------~KG~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP--~-----~~~~~~~~~~~i~~g~~~ 504 (522) .+.. -.+=+.++. ..+-...+=.+-+..||++.+ +| | +...++.+. T Consensus 329 ~~~~~i~~~~~----~~i~~~~~~------~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~--------- 389 (395) T protein:vir:38 329 KQGITLFDRQQ----MQIDTTNVG------AGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVANQAQG--------- 389 (395) T ss_pred cccEEEEEecc----eEEEEeccc------cchhhcCceEEEEEEeeccEEecccceEEEEeecccCCCCC--------- Confidence 1111 001111110 001122334455556666543 23 1 111222111 Q ss_pred Hhhccc Q lcl|NC_014036. 505 KEMFGK 510 (522) Q Consensus 505 ~~~~~~ 510 (522) .--.|| T Consensus 390 ~~~~~~ 395 (395) T protein:vir:38 390 TAGTGK 395 (395) T ss_pred ccCCCC Confidence 111234 No 102 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=26.79 E-value=1.9 Score=19.04 Aligned_cols=275 Identities=12% Similarity=0.079 Sum_probs=111.4 Q ss_pred cccccccccccCcchh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccchhccccccccccccccc Q lcl|NC_014036. 81 SGNSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGA 159 (522) Q Consensus 81 ~~t~tg~v~~~~P~Li-~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~ 159 (522) -.+.+|.+ .-|.+. .+++.+.++.+..+++.+.||++.. + +|.-.... +.+.| T Consensus 1 ma~~gG~l--ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-----~--~~p~~~~~------------~~a~~----- 54 (298) T protein:vir:94 1 MVLNKGTL--FDPELVTDLISKVAGKSSIARLSAQKPIPFNG-----E--KVFTFTMD------------SEIDV----- 54 (298) T ss_pred Ceeccccc--cChhHHHHHHHHHHhhchhhhhcceeeccCCc-----e--EEEEEecC------------cceEE----- Confidence 11222222 224443 4666666778888889888886532 1 11110000 00000 Q ss_pred ccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccchhhhhcc Q lcl|NC_014036. 160 APSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQE 239 (522) Q Consensus 160 ~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~ 239 (522) +++| |.. T Consensus 55 ------------------------------------------------------------------v~Eg------~~~- 61 (298) T protein:vir:94 55 ------------------------------------------------------------------VAES------GKK- 61 (298) T ss_pred ------------------------------------------------------------------eeCC------ccc- Confidence 0000 000 Q ss_pred ccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhcc--- Q lcl|NC_014036. 240 QFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMINYTA--- 316 (522) Q Consensus 240 ~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i~~~a--- 316 (522) ..+...|.++.|...|.. -....|-||.|+--. -..+-+++|.+-|...|...|+..++.-..... T Consensus 62 ---~~~~~~f~~v~l~~~k~~-------~~~~iS~ell~~~~~-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~ 130 (298) T protein:vir:94 62 ---THGGVTLAPQTMVPIKVE-------YGARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTA 130 (298) T ss_pred ---cccccceeEEEEeeeEEE-------EeeehhHHHhccCCc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcc Confidence 001122444445444444 456789998764321 113346677777777777777777773211000 Q ss_pred --ccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhccc Q lcl|NC_014036. 317 --QVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSG 394 (522) Q Consensus 317 --~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~ 394 (522) -.+..++..... ...... .....++.-+.++...+.. .+.+...+|++|.....|...- T Consensus 131 ~~~~~~~~~~~~~~---~~~~~~------------~~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~lk-- 191 (298) T protein:vir:94 131 SAVIGTNHFDSKVT---QKVEAP------------RGIADPNGAIENAVELLTG--VDADVTGIAINPSFRSALAKQK-- 191 (298) T ss_pred cccccccccccccc---cccccc------------cccccHHHHHHHHHHhhhh--cCCCccEEEEcHHHHHHHHHhh-- Confidence 000000000000 000000 0011222334444443333 1234567999999999887431 Q ss_pred ccccccccccccccccccceeEEEecCceEEEecCCCcc------ceEEEEEecCCCccceeEeecccccc--cccccCC Q lcl|NC_014036. 395 ITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARG------DYFTVGYKGDNEMDAGIYYAPYVALT--PLRGSDP 466 (522) Q Consensus 395 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~d~glfyaPYv~~~--~~~~~Dp 466 (522) +.. + .-....+.+. -..|+|.| ++|++++.-+. +.+++| +-. .++.|...-.+. ..+..|| T Consensus 192 d~~-G---~~l~~~~~~~-~~~~tl~G-~PV~~~~~v~~~~~~~~~~~~~G---dfs--~~~~~~~~~~~~~~~~~~~~~ 260 (298) T protein:vir:94 192 DLQ-G---NALFPELKWG-ATPDTING-LPVDVNKTVSDMSLTQRDRAIIG---DFA--NGFKWGYAKEVPLEVIQYGDP 260 (298) T ss_pred ccC-C---CeeecCcccC-CCCceecc-eeeEEecccccccCCCccEEEEe---ecc--ceEEEEEecCceEEEeecCCC Confidence 111 0 0000111111 12357877 69998886542 222222 111 112233221111 1111233 Q ss_pred cc-----cc-ceeee--eeeecce-ecCccccccCCccccccCcc Q lcl|NC_014036. 467 KN-----FQ-PVMGF--KTRYGVG-INPFANSRSQAPSDRITSGM 502 (522) Q Consensus 467 ~s-----~q-P~~~~--~tRY~l~-~nP~~~~~~~~~~~~i~~g~ 502 (522) +. || =.++| ..|+++. .+|=+ ..++.+.. T Consensus 261 d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a-------~~~l~~~t 298 (298) T protein:vir:94 261 DNSGLDLKGYNQVYIRAELFLGWGILDATK-------FARVTEAN 298 (298) T ss_pred cCcchhhhhcCcEEEEEEEEeccEeecccc-------eEEEEecC Confidence 21 22 12334 4577654 44411 12333232 No 103 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=25.84 E-value=2 Score=18.92 Aligned_cols=269 Identities=13% Similarity=0.046 Sum_probs=110.3 Q ss_pred ccccccccc--ccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccc Q lcl|NC_014036. 155 SGQGAAPSN--GFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMAT 232 (522) Q Consensus 155 SG~g~~~~~--~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~T 232 (522) ..+..+... -++...+..... . .....+ +..+... ...+ ... .|...++..-=.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~-------~-~~~~l~-~~~~~~~--------d~~l-----~g~-~G~tv~iP~~~~~ 57 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQA-------Q-LEKKLR-FASFAEV--------DSTL-----QGQ-PGDTLTFPAFVYS 57 (274) T ss_pred CCccceehhheechHHHHHHHHH-------h-hhhhhh-hccccee--------cccc-----cCC-CCCEEEEeeecCC Confidence 110000000 001000000000 0 000000 0000000 0000 000 1111111110001 Q ss_pred hhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_014036. 233 SVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMI 312 (522) Q Consensus 233 s~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i 312 (522) ..+|. ......-+..++.+ .+.+++.+-|+-.=+++=| ..+.+ +-|.-.|..+-++.-|..+++.+++..+ T Consensus 58 g~a~~---~~~g~~i~~~~lt~--~~~~~~i~~~~~~~~i~D~--~~~~~--~~dp~~~~~~~~a~a~a~~vd~~~~~~l 128 (274) T protein:vir:97 58 GDAQV---VAEGEKIPTDILET--KKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEAL 128 (274) T ss_pred Ccccc---ccCCCccccccccc--ceeEEEeeeecceecccHH--HHHhc--cchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11221 11112223444443 3344444555522222222 23333 4578888999999999999999999777 Q ss_pred hhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhc Q lcl|NC_014036. 313 NYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARID 392 (522) Q Consensus 313 ~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~ 392 (522) ...+.. +. +..+++ +-+-.+..++.++. ..+++++|+|.|++.|..-. T Consensus 129 ~~a~~~-~~---------~~~~~~-------------d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~ 176 (274) T protein:vir:97 129 MGAKLT-VN---------ADITKL-------------NGLQSAIDKFNDED---------LEPMVLFVNPLDAGKLRGDA 176 (274) T ss_pred hccCcc-cc---------ccccCH-------------HHHHHHHHHhhccC---------CCceEEEeCHHHHHHHHhhh Confidence 554321 11 112222 22333344444332 25689999999999997532 Q ss_pred cc-ccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeeccccccccc-ccCCcccc Q lcl|NC_014036. 393 SG-ITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLR-GSDPKNFQ 470 (522) Q Consensus 393 ~~-~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~-~~Dp~s~q 470 (522) .. +..++.. ++ .......+|.+.| ++||+|+..|..-..+--+| .+-|.---+. .++ .-||..+. T Consensus 177 ~~~f~~~s~~----g~-~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~g------A~~~~~~~~~-~vE~~Rd~~~~~ 243 (274) T protein:vir:97 177 STNFTRATEL----GD-DIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKG------AVKLILKRDF-FLEVARDASTKT 243 (274) T ss_pred hhhccccCcc----cc-cceeccccceecC-eeEEEcCCCCcceEEEEeCc------ceEeeecCCc-eeccccchhhcc Confidence 11 1222211 11 1122334788876 79999999885332211122 2222111111 122 24888999 Q ss_pred ceeeeeeeecce-ecCccccccCCccccccCcchHHhh Q lcl|NC_014036. 471 PVMGFKTRYGVG-INPFANSRSQAPSDRITSGMITKEM 507 (522) Q Consensus 471 P~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~~~~~~ 507 (522) =.+-..-+||+. .||=- -..+..+.-.-.| T Consensus 244 d~i~~~~~y~~~~~~~~~-------vv~~t~~~~~~~~ 274 (274) T protein:vir:97 244 TALYSDKHYVAYLYDESK-------AVKITKGSGSLEM 274 (274) T ss_pred cEEEEEEEEEEEEEcCCc-------eEEEecCcccccC Confidence 999888999875 35500 0011111100111 No 104 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=25.84 E-value=2 Score=18.92 Aligned_cols=269 Identities=13% Similarity=0.046 Sum_probs=110.3 Q ss_pred ccccccccc--ccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccccccccccccc Q lcl|NC_014036. 155 SGQGAAPSN--GFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGTLAEISYGMAT 232 (522) Q Consensus 155 SG~g~~~~~--~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~T 232 (522) ..+..+... -++...+..... . .....+ +..+... ...+ ... .|...++..-=.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~-------~-~~~~l~-~~~~~~~--------d~~l-----~g~-~G~tv~iP~~~~~ 57 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQA-------Q-LEKKLR-FASFAEV--------DSTL-----QGQ-PGDTLTFPAFVYS 57 (274) T ss_pred CCccceehhheechHHHHHHHHH-------h-hhhhhh-hccccee--------cccc-----cCC-CCCEEEEeeecCC Confidence 110000000 001000000000 0 000000 0000000 0000 000 1111111110001 Q ss_pred hhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_014036. 233 SVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDMI 312 (522) Q Consensus 233 s~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEImlEINReii~~i 312 (522) ..+|. ......-+..++.+ .+.+++.+-|+-.=+++=| ..+.+ +-|.-.|..+-++.-|..+++.+++..+ T Consensus 58 g~a~~---~~~g~~i~~~~lt~--~~~~~~i~~~~~~~~i~D~--~~~~~--~~dp~~~~~~~~a~a~a~~vd~~~~~~l 128 (274) T protein:vir:94 58 GDAQV---VAEGEKIPTDILET--KKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEAL 128 (274) T ss_pred Ccccc---ccCCCccccccccc--ceeEEEeeeecceecccHH--HHHhc--cchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11221 11112223444443 3344444555522222222 23333 4578888999999999999999999777 Q ss_pred hhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEchhHHHHHhhhc Q lcl|NC_014036. 313 NYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARID 392 (522) Q Consensus 313 ~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S~~va~~L~~~~ 392 (522) ...+.. +. +..+++ +-+-.+..++.++. ..+++++|+|.|++.|..-. T Consensus 129 ~~a~~~-~~---------~~~~~~-------------d~i~dA~~~l~d~~---------~~~~~ivv~p~~~~~L~k~~ 176 (274) T protein:vir:94 129 MGAKLT-VN---------ADITKL-------------NGLQSAIDKFNDED---------LEPMVLFVNPLDAGKLRGDA 176 (274) T ss_pred hccCcc-cc---------ccccCH-------------HHHHHHHHHhhccC---------CCceEEEeCHHHHHHHHhhh Confidence 554321 11 112222 22333344444332 25689999999999997532 Q ss_pred cc-ccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeeccccccccc-ccCCcccc Q lcl|NC_014036. 393 SG-ITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVALTPLR-GSDPKNFQ 470 (522) Q Consensus 393 ~~-~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~~~~~-~~Dp~s~q 470 (522) .. +..++.. ++ .......+|.+.| ++||+|+..|..-..+--+| .+-|.---+. .++ .-||..+. T Consensus 177 ~~~f~~~s~~----g~-~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~g------A~~~~~~~~~-~vE~~Rd~~~~~ 243 (274) T protein:vir:94 177 STNFTRATEL----GD-DIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKG------AVKLILKRDF-FLEVARDASTKT 243 (274) T ss_pred hhhccccCcc----cc-cceeccccceecC-eeEEEcCCCCcceEEEEeCc------ceEeeecCCc-eeccccchhhcc Confidence 11 1222211 11 1122334788876 79999999885332211122 2222111111 122 24888999 Q ss_pred ceeeeeeeecce-ecCccccccCCccccccCcchHHhh Q lcl|NC_014036. 471 PVMGFKTRYGVG-INPFANSRSQAPSDRITSGMITKEM 507 (522) Q Consensus 471 P~~~~~tRY~l~-~nP~~~~~~~~~~~~i~~g~~~~~~ 507 (522) =.+-..-+||+. .||=- -..+..+.-.-.| T Consensus 244 d~i~~~~~y~~~~~~~~~-------vv~~t~~~~~~~~ 274 (274) T protein:vir:94 244 TALYSDKHYVAYLYDESK-------AVKITKGSGSLEM 274 (274) T ss_pred cEEEEEEEEEEEEEcCCc-------eEEEecCcccccC Confidence 999888999875 35500 0011111100111 No 105 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=22.02 E-value=2.5 Score=18.39 Aligned_cols=349 Identities=14% Similarity=0.048 Sum_probs=113.1 Q ss_pred Ccch--HHHHHhhhhh-------hcc---ccchhhhcchhh------hHHHH---HHhhhHHHHhhhhhhhcchhhhhhh Q lcl|NC_014036. 1 MSKK--NELMEKWNDL-------LES---QEGLPDIATKSK------KQLIA---AIMEAQEKDAEVDPVYRDEKIVESF 59 (522) Q Consensus 1 ~~~~--~~l~~kw~p~-------l~~---~~~~~~i~~~~~------~~~~~---~~~enq~~~~~~~~~~~~~~~~~~~ 59 (522) |... ++|.||=+-+ ++. ++=..++..... +.+-+ +.+|.+++...... .+.... T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~-----~~~~~~ 75 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFAGKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTS-----LLSGLQ 75 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhcccC Confidence 4322 2333332222 111 000011111110 01111 11111111000000 000000 Q ss_pred cccccccc---cc--------cccccc-----ccccccccccccccccCcchh-hHHHHHHh-hhhhhhceeeccCCchh Q lcl|NC_014036. 60 GGFLAEAE---IA--------GDHGYD-----ATKIASGNSSGAITNIGPAVI-GMVRRAIP-NLIAFDICGVQPMTGPT 121 (522) Q Consensus 60 ~~~l~ea~---~~--------~~~g~~-----~~~~~~~t~tg~v~~~~P~Li-~l~Rra~~-~lI~~DI~GVQPmTGPT 121 (522) +... +.+ .. +..+.. ......+|++++-...-|.+. .++.+... ..+...++-|-|+++.. T Consensus 76 ~~~~-~~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~ 154 (392) T protein:vir:13 76 GSGS-GAQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDAN 154 (392) T ss_pred Cccc-chhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCc Confidence 0000 000 00 000000 000000111111000001110 11111111 11111222222211110 Q ss_pred hhheeeeeeecCCCCCCcccchhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccc Q lcl|NC_014036. 122 GQVFALRAVYGKDPLASGAKEAFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVT 201 (522) Q Consensus 122 GLIFAMRSrY~~~~~~t~~~eA~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~ 201 (522) .+-+ |.. T Consensus 155 ~~~~-------------------------------------------------------------------------~~~ 161 (392) T protein:vir:13 155 PMDF-------------------------------------------------------------------------TVI 161 (392) T ss_pred eeEE-------------------------------------------------------------------------EEE Confidence 0000 000 Q ss_pred cCCCCcccccccccccccccccccccccccchhhhhccccCCCCCccccccceEEEEEEEEEecccccchhhHHHHHHHH Q lcl|NC_014036. 202 VTGSTDDALDAAVIAEQEKGTLAEISYGMATSVAELQEQFNGSTGNPWNEMGFRIDKQVIEARSRQLKAQYSVELAQDLR 281 (522) Q Consensus 202 ~tgt~~~~~~~~~~~~~~~g~~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMsFsIEK~TVtAKSRALKAEYT~ELAQDLK 281 (522) ++ .+-....+| +..+++-...+++++...+..+-...+|-||.+|= T Consensus 162 -~~-----------------------~~~a~~v~E---------~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds- 207 (392) T protein:vir:13 162 -TG-----------------------RATAGIVGE---------TAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQ- 207 (392) T ss_pred -cC-----------------------Ccceeeecc---------cccccccccceeeEEeeeeeEEeeehhHHHHHhcc- Confidence 00 000001122 12244555556666666666666778999999982 Q ss_pred hhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHH Q lcl|NC_014036. 282 AVHGMDADAELSAILATEIMLEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDK 361 (522) Q Consensus 282 AiHGLDAEaELsNILSTEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~ 361 (522) ..|.++.|.+-|...|..-+|..||. -...-. -.|++......+.. .-|+ ....-.+..|.+ T Consensus 208 ---~~~l~~~i~~~l~~~i~~~~d~~~l~---G~Gt~~----------p~Gil~~~~~~~~~-~~~~-~~~~~~~d~l~~ 269 (392) T protein:vir:13 208 ---VLDLVGFLVSDAGPAIGDAMGRHFLT---GTGTGQ----------PRGILTDATGANAA-FGEA-DADSKVSDALID 269 (392) T ss_pred ---hHHHHHHHHHHHHHHHHHHHHHHHhc---ccCCcc----------cccccccccccccc-cccc-ccccccHHHHHH Confidence 46789999999999999999999882 111000 01222111100000 0000 000001112222 Q ss_pred HHHHHHHhccccCCcEEEEchhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEe Q lcl|NC_014036. 362 EANEIARQTGRGAGNFIIASRNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYK 441 (522) Q Consensus 362 ~an~I~r~T~~g~gn~~v~S~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K 441 (522) +-+.+... +.++...|++|.....|..+ .+ +.+...-.....+.+. ++|.| ++||++.+.|.+-|++|-- T Consensus 270 ~~~~l~~~--~~~~a~~v~n~~~~~~l~~l--kd-~~G~~l~~~~~~~g~~----~~l~G-~Pv~~~~~~~~~~i~~Gdf 339 (392) T protein:vir:13 270 LFHEVPSA--YRKNAKFVVNDLRAAQMRKL--KD-ANGQYLWQSALTVGAP----DTFNG-KVVETDDGMPADKVLFADL 339 (392) T ss_pred HHHhhhhh--hhcCCEEEEcHHHHHHHHHh--hc-cCCceeecCCcCCCCC----ceecc-eeeEEcCCCCCCcEEEeec Confidence 22233222 22344578899988888643 11 1110000010111111 36776 7999999998766655421 Q ss_pred cCCCccceeEeecccccccccccCCccccceee--eeeeecc-eecCccccccCCccccccCcchHHhhccccceeeeee Q lcl|NC_014036. 442 GDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMG--FKTRYGV-GINPFANSRSQAPSDRITSGMITKEMFGKNAYFRKVY 518 (522) Q Consensus 442 G~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~~--~~tRY~l-~~nP~~~~~~~~~~~~i~~g~~~~~~~~~~~~~r~~~ 518 (522) +. .++.---.....+..|+..-...++ ...|++. ..||-+.. .+. T Consensus 340 --~~----~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~A~~--------------------------~~~ 387 (392) T protein:vir:13 340 --SK----YRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAK--------------------------VLT 387 (392) T ss_pred --cc----eeEEeecceEEEeeccccccCCcEEEEEEEEeccEEecccceE--------------------------EEE Confidence 00 1111111111111123222122223 3344433 23442211 111 Q ss_pred eccC Q lcl|NC_014036. 519 VKGL 522 (522) Q Consensus 519 Vk~~ 522 (522) ||-= T Consensus 388 ~~~a 391 (392) T protein:vir:13 388 VTPA 391 (392) T ss_pred eecc Confidence 1111 No 106 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=21.85 E-value=2.5 Score=18.37 Aligned_cols=325 Identities=13% Similarity=0.078 Sum_probs=124.4 Q ss_pred CcchHHHHHhhhhhhccc--c---------c---hhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQ--E---------G---LPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEA 66 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~--~---------~---~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea 66 (522) +.+.+.|.++....-+-. + . ..+-...+|+. ....|.+++..-.+ +.+.... .|. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~-~~~~~~~---------~~~ 103 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNKPLNAEE-REFLEDD---------LEQ 103 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcccccHHH-HHHHhhh---------hhh Confidence 333334444433211100 0 0 00111122222 22222222110000 0000000 000 Q ss_pred cccccccccccccccccc-ccccc---ccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccc Q lcl|NC_014036. 67 EIAGDHGYDATKIASGNS-SGAIT---NIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKE 142 (522) Q Consensus 67 ~~~~~~g~~~~~~~~~t~-tg~v~---~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~e 142 (522) .....+++ .|.+. .+.+. +++.+..+..-.+++.+.||++++|-+. ..+..+.. T Consensus 104 ----------~~~~~~t~~~gg~~vP~~~~~~---ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~~~------- 161 (392) T protein:vir:10 104 ----------RAMSGLTGEDGGLVIPQDIQTQ---INELARSFDALEQYVTVEPVRTRSGSRV--LEKNSDMI------- 161 (392) T ss_pred ----------hhccccccCCCceecchhHHHH---HHHHHHhhhhhhhhceeeeccCCceeEE--EEeecCCc------- Confidence 00111111 12111 12233 3444445666778999999998876421 11111000 Q ss_pred hhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccc Q lcl|NC_014036. 143 AFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGT 222 (522) Q Consensus 143 A~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~ 222 (522) ...| T Consensus 162 --------~a~~-------------------------------------------------------------------- 165 (392) T protein:vir:10 162 --------PFAE-------------------------------------------------------------------- 165 (392) T ss_pred --------ccee-------------------------------------------------------------------- Confidence 0000 Q ss_pred cccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHH Q lcl|NC_014036. 223 LAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIM 301 (522) Q Consensus 223 ~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEIm 301 (522) .+|. ...++-. -++++++..++.-+-...+|-||.+|- ..|.+++|.+-|...|. T Consensus 166 -----------v~E~---------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~ 221 (392) T protein:vir:10 166 -----------ITEM---------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS----DQNILKYVTKWLGKKSK 221 (392) T ss_pred -----------eccc---------ccccccccccceeEEeeeeeEEEeehhhHHHHhhh----HHHHHHHHHHHHHHHHH Confidence 0010 0011111 134444444555555567999999994 35679999999999999 Q ss_pred HHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEc Q lcl|NC_014036. 302 LEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIAS 381 (522) Q Consensus 302 lEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S 381 (522) .-+|..|+.-..... ..+...++ -...++... . ...+-..-..|++ T Consensus 222 ~~~d~~~~~g~g~~~-------------~~~~~~~d-------------~i~~~~~~~--l------~~~~~~~a~~vm~ 267 (392) T protein:vir:10 222 VTRNVLILGVIEKLT-------------KQAIKSLD-------------DIKDVLNVK--L------DPAISPNAILLTN 267 (392) T ss_pred HHHHHHHhhcccccc-------------ccCccCHH-------------HHHHHHHHh--h------hhhhccCCEEEEc Confidence 999999883222111 12222221 122222111 1 1122233557899 Q ss_pred hhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeeccccc--- Q lcl|NC_014036. 382 RNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVAL--- 458 (522) Q Consensus 382 ~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~--- 458 (522) |.....|..+- +... +-....+.+ ....++|.|...|+++.... ++.+|...-+..++|+.+-.+ T Consensus 268 ~~~~~~L~~lk--d~~G----~~l~~~~~~-~~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~i 335 (392) T protein:vir:10 268 QDGFNYLDKLK--DKDG----KYILQSDPT-QKNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIVL 335 (392) T ss_pred HHHHHHHHHhh--ccCC----CeEeecCcc-CCccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEEE Confidence 99999987541 1100 000011111 12235677765666543321 111122222222333322110 Q ss_pred ---cccc-ccCC------ccccceeeeeeeeccee-cCccccccCCccccccCcchHHhhcc Q lcl|NC_014036. 459 ---TPLR-GSDP------KNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMITKEMFG 509 (522) Q Consensus 459 ---~~~~-~~Dp------~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~~~~~~ 509 (522) ..+. .+++ .+.+=.+-...|++..+ +|-+...-. +....+...-+| T Consensus 336 ~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~a~~~~~~~ 392 (392) T protein:vir:10 336 FKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-----IDLSAPVEQPQG 392 (392) T ss_pred EeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-----ecccccccCCCC Confidence 0000 1122 23445566677777543 342211100 001111111222 No 107 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=21.85 E-value=2.5 Score=18.37 Aligned_cols=325 Identities=13% Similarity=0.078 Sum_probs=124.4 Q ss_pred CcchHHHHHhhhhhhccc--c---------c---hhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQ--E---------G---LPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEA 66 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~--~---------~---~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea 66 (522) +.+.+.|.++....-+-. + . ..+-...+|+. ....|.+++..-.+ +.+.... .|. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~-~~~~~~~---------~~~ 103 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNKPLNAEE-REFLEDD---------LEQ 103 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcccccHHH-HHHHhhh---------hhh Confidence 333334444433211100 0 0 00111122222 22222222110000 0000000 000 Q ss_pred cccccccccccccccccc-ccccc---ccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccc Q lcl|NC_014036. 67 EIAGDHGYDATKIASGNS-SGAIT---NIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKE 142 (522) Q Consensus 67 ~~~~~~g~~~~~~~~~t~-tg~v~---~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~e 142 (522) .....+++ .|.+. .+.+. +++.+..+..-.+++.+.||++++|-+. ..+..+.. T Consensus 104 ----------~~~~~~t~~~gg~~vP~~~~~~---ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~~~------- 161 (392) T protein:vir:10 104 ----------RAMSGLTGEDGGLVIPQDIQTQ---INELARSFDALEQYVTVEPVRTRSGSRV--LEKNSDMI------- 161 (392) T ss_pred ----------hhccccccCCCceecchhHHHH---HHHHHHhhhhhhhhceeeeccCCceeEE--EEeecCCc------- Confidence 00111111 12111 12233 3444445666778999999998876421 11111000 Q ss_pred hhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccc Q lcl|NC_014036. 143 AFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGT 222 (522) Q Consensus 143 A~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~ 222 (522) ...| T Consensus 162 --------~a~~-------------------------------------------------------------------- 165 (392) T protein:vir:10 162 --------PFAE-------------------------------------------------------------------- 165 (392) T ss_pred --------ccee-------------------------------------------------------------------- Confidence 0000 Q ss_pred cccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHH Q lcl|NC_014036. 223 LAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIM 301 (522) Q Consensus 223 ~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEIm 301 (522) .+|. ...++-. -++++++..++.-+-...+|-||.+|- ..|.+++|.+-|...|. T Consensus 166 -----------v~E~---------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~ 221 (392) T protein:vir:10 166 -----------ITEM---------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS----DQNILKYVTKWLGKKSK 221 (392) T ss_pred -----------eccc---------ccccccccccceeEEeeeeeEEEeehhhHHHHhhh----HHHHHHHHHHHHHHHHH Confidence 0010 0011111 134444444555555567999999994 35679999999999999 Q ss_pred HHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEc Q lcl|NC_014036. 302 LEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIAS 381 (522) Q Consensus 302 lEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S 381 (522) .-+|..|+.-..... ..+...++ -...++... . ...+-..-..|++ T Consensus 222 ~~~d~~~~~g~g~~~-------------~~~~~~~d-------------~i~~~~~~~--l------~~~~~~~a~~vm~ 267 (392) T protein:vir:10 222 VTRNVLILGVIEKLT-------------KQAIKSLD-------------DIKDVLNVK--L------DPAISPNAILLTN 267 (392) T ss_pred HHHHHHHhhcccccc-------------ccCccCHH-------------HHHHHHHHh--h------hhhhccCCEEEEc Confidence 999999883222111 12222221 122222111 1 1122233557899 Q ss_pred hhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeeccccc--- Q lcl|NC_014036. 382 RNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVAL--- 458 (522) Q Consensus 382 ~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~--- 458 (522) |.....|..+- +... +-....+.+ ....++|.|...|+++.... ++.+|...-+..++|+.+-.+ T Consensus 268 ~~~~~~L~~lk--d~~G----~~l~~~~~~-~~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~i 335 (392) T protein:vir:10 268 QDGFNYLDKLK--DKDG----KYILQSDPT-QKNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIVL 335 (392) T ss_pred HHHHHHHHHhh--ccCC----CeEeecCcc-CCccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEEE Confidence 99999987541 1100 000011111 12235677765666543321 111122222222333322110 Q ss_pred ---cccc-ccCC------ccccceeeeeeeeccee-cCccccccCCccccccCcchHHhhcc Q lcl|NC_014036. 459 ---TPLR-GSDP------KNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMITKEMFG 509 (522) Q Consensus 459 ---~~~~-~~Dp------~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~~~~~~ 509 (522) ..+. .+++ .+.+=.+-...|++..+ +|-+...-. +....+...-+| T Consensus 336 ~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~a~~~~~~~ 392 (392) T protein:vir:10 336 FKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-----IDLSAPVEQPQG 392 (392) T ss_pred EeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-----ecccccccCCCC Confidence 0000 1122 23445566677777543 342211100 001111111222 No 108 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=21.85 E-value=2.5 Score=18.37 Aligned_cols=325 Identities=13% Similarity=0.078 Sum_probs=124.4 Q ss_pred CcchHHHHHhhhhhhccc--c---------c---hhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQ--E---------G---LPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEA 66 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~--~---------~---~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea 66 (522) +.+.+.|.++....-+-. + . ..+-...+|+. ....|.+++..-.+ +.+.... .|. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~-~~~~~~~---------~~~ 103 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNKPLNAEE-REFLEDD---------LEQ 103 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcccccHHH-HHHHhhh---------hhh Confidence 333334444433211100 0 0 00111122222 22222222110000 0000000 000 Q ss_pred cccccccccccccccccc-ccccc---ccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccc Q lcl|NC_014036. 67 EIAGDHGYDATKIASGNS-SGAIT---NIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKE 142 (522) Q Consensus 67 ~~~~~~g~~~~~~~~~t~-tg~v~---~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~e 142 (522) .....+++ .|.+. .+.+. +++.+..+..-.+++.+.||++++|-+. ..+..+.. T Consensus 104 ----------~~~~~~t~~~gg~~vP~~~~~~---ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~~~------- 161 (392) T protein:vir:10 104 ----------RAMSGLTGEDGGLVIPQDIQTQ---INELARSFDALEQYVTVEPVRTRSGSRV--LEKNSDMI------- 161 (392) T ss_pred ----------hhccccccCCCceecchhHHHH---HHHHHHhhhhhhhhceeeeccCCceeEE--EEeecCCc------- Confidence 00111111 12111 12233 3444445666778999999998876421 11111000 Q ss_pred hhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccc Q lcl|NC_014036. 143 AFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGT 222 (522) Q Consensus 143 A~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~ 222 (522) ...| T Consensus 162 --------~a~~-------------------------------------------------------------------- 165 (392) T protein:vir:10 162 --------PFAE-------------------------------------------------------------------- 165 (392) T ss_pred --------ccee-------------------------------------------------------------------- Confidence 0000 Q ss_pred cccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHH Q lcl|NC_014036. 223 LAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIM 301 (522) Q Consensus 223 ~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEIm 301 (522) .+|. ...++-. -++++++..++.-+-...+|-||.+|- ..|.+++|.+-|...|. T Consensus 166 -----------v~E~---------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~ 221 (392) T protein:vir:10 166 -----------ITEM---------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS----DQNILKYVTKWLGKKSK 221 (392) T ss_pred -----------eccc---------ccccccccccceeEEeeeeeEEEeehhhHHHHhhh----HHHHHHHHHHHHHHHHH Confidence 0010 0011111 134444444555555567999999994 35679999999999999 Q ss_pred HHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEc Q lcl|NC_014036. 302 LEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIAS 381 (522) Q Consensus 302 lEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S 381 (522) .-+|..|+.-..... ..+...++ -...++... . ...+-..-..|++ T Consensus 222 ~~~d~~~~~g~g~~~-------------~~~~~~~d-------------~i~~~~~~~--l------~~~~~~~a~~vm~ 267 (392) T protein:vir:10 222 VTRNVLILGVIEKLT-------------KQAIKSLD-------------DIKDVLNVK--L------DPAISPNAILLTN 267 (392) T ss_pred HHHHHHHhhcccccc-------------ccCccCHH-------------HHHHHHHHh--h------hhhhccCCEEEEc Confidence 999999883222111 12222221 122222111 1 1122233557899 Q ss_pred hhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeeccccc--- Q lcl|NC_014036. 382 RNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVAL--- 458 (522) Q Consensus 382 ~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~--- 458 (522) |.....|..+- +... +-....+.+ ....++|.|...|+++.... ++.+|...-+..++|+.+-.+ T Consensus 268 ~~~~~~L~~lk--d~~G----~~l~~~~~~-~~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~i 335 (392) T protein:vir:10 268 QDGFNYLDKLK--DKDG----KYILQSDPT-QKNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIVL 335 (392) T ss_pred HHHHHHHHHhh--ccCC----CeEeecCcc-CCccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEEE Confidence 99999987541 1100 000011111 12235677765666543321 111122222222333322110 Q ss_pred ---cccc-ccCC------ccccceeeeeeeeccee-cCccccccCCccccccCcchHHhhcc Q lcl|NC_014036. 459 ---TPLR-GSDP------KNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMITKEMFG 509 (522) Q Consensus 459 ---~~~~-~~Dp------~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~~~~~~ 509 (522) ..+. .+++ .+.+=.+-...|++..+ +|-+...-. +....+...-+| T Consensus 336 ~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~a~~~~~~~ 392 (392) T protein:vir:10 336 FKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-----IDLSAPVEQPQG 392 (392) T ss_pred EeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-----ecccccccCCCC Confidence 0000 1122 23445566677777543 342211100 001111111222 No 109 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=21.85 E-value=2.5 Score=18.37 Aligned_cols=325 Identities=13% Similarity=0.078 Sum_probs=124.4 Q ss_pred CcchHHHHHhhhhhhccc--c---------c---hhhhcchhhhHHHHHHhhhHHHHhhhhhhhcchhhhhhhccccccc Q lcl|NC_014036. 1 MSKKNELMEKWNDLLESQ--E---------G---LPDIATKSKKQLIAAIMEAQEKDAEVDPVYRDEKIVESFGGFLAEA 66 (522) Q Consensus 1 ~~~~~~l~~kw~p~l~~~--~---------~---~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea 66 (522) +.+.+.|.++....-+-. + . ..+-...+|+. ....|.+++..-.+ +.+.... .|. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~-~~~~~~~---------~~~ 103 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNKPLNAEE-REFLEDD---------LEQ 103 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcccccHHH-HHHHhhh---------hhh Confidence 333334444433211100 0 0 00111122222 22222222110000 0000000 000 Q ss_pred cccccccccccccccccc-ccccc---ccCcchhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCCcccc Q lcl|NC_014036. 67 EIAGDHGYDATKIASGNS-SGAIT---NIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLASGAKE 142 (522) Q Consensus 67 ~~~~~~g~~~~~~~~~t~-tg~v~---~~~P~Li~l~Rra~~~lI~~DI~GVQPmTGPTGLIFAMRSrY~~~~~~t~~~e 142 (522) .....+++ .|.+. .+.+. +++.+..+..-.+++.+.||++++|-+. ..+..+.. T Consensus 104 ----------~~~~~~t~~~gg~~vP~~~~~~---ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~~~------- 161 (392) T protein:vir:10 104 ----------RAMSGLTGEDGGLVIPQDIQTQ---INELARSFDALEQYVTVEPVRTRSGSRV--LEKNSDMI------- 161 (392) T ss_pred ----------hhccccccCCCceecchhHHHH---HHHHHHhhhhhhhhceeeeccCCceeEE--EEeecCCc------- Confidence 00111111 12111 12233 3444445666778999999998876421 11111000 Q ss_pred hhcccccccccccccccccccccccccccccccccceeecccccccceeeeeccccccccCCCCcccccccccccccccc Q lcl|NC_014036. 143 AFHPMFSPDSMYSGQGAAPSNGFTKLTSAQAIADGAIVFHDFVETGRVFLQNVSGAPVTVTGSTDDALDAAVIAEQEKGT 222 (522) Q Consensus 143 A~~~~~Eadt~fSG~g~~~~~~~~~~~~~~~~a~g~~a~~~~~~~g~~~~~~~~~~p~~~tgt~~~~~~~~~~~~~~~g~ 222 (522) ...| T Consensus 162 --------~a~~-------------------------------------------------------------------- 165 (392) T protein:vir:10 162 --------PFAE-------------------------------------------------------------------- 165 (392) T ss_pred --------ccee-------------------------------------------------------------------- Confidence 0000 Q ss_pred cccccccccchhhhhccccCCCCCccccccc-eEEEEEEEEEecccccchhhHHHHHHHHhhcCCChHHHHHHHHHHHHH Q lcl|NC_014036. 223 LAEISYGMATSVAELQEQFNGSTGNPWNEMG-FRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIM 301 (522) Q Consensus 223 ~~~~g~Gm~Ts~aEal~~lggs~~~~f~EMs-FsIEK~TVtAKSRALKAEYT~ELAQDLKAiHGLDAEaELsNILSTEIm 301 (522) .+|. ...++-. -++++++..++.-+-...+|-||.+|- ..|.+++|.+-|...|. T Consensus 166 -----------v~E~---------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~ 221 (392) T protein:vir:10 166 -----------ITEM---------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS----DQNILKYVTKWLGKKSK 221 (392) T ss_pred -----------eccc---------ccccccccccceeEEeeeeeEEEeehhhHHHHhhh----HHHHHHHHHHHHHHHHH Confidence 0010 0011111 134444444555555567999999994 35679999999999999 Q ss_pred HHhhHHHHhhhhhccccccccccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhccccCCcEEEEc Q lcl|NC_014036. 302 LEINREIVDMINYTAQVGKTGFTQTVGSKAGAFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIAS 381 (522) Q Consensus 302 lEINReii~~i~~~a~~~~~~~~~~~~~~~g~fd~~~~~d~~~~r~~~E~~r~L~~~i~~~an~I~r~T~~g~gn~~v~S 381 (522) .-+|..|+.-..... ..+...++ -...++... . ...+-..-..|++ T Consensus 222 ~~~d~~~~~g~g~~~-------------~~~~~~~d-------------~i~~~~~~~--l------~~~~~~~a~~vm~ 267 (392) T protein:vir:10 222 VTRNVLILGVIEKLT-------------KQAIKSLD-------------DIKDVLNVK--L------DPAISPNAILLTN 267 (392) T ss_pred HHHHHHHhhcccccc-------------ccCccCHH-------------HHHHHHHHh--h------hhhhccCCEEEEc Confidence 999999883222111 12222221 122222111 1 1122233557899 Q ss_pred hhHHHHHhhhcccccccccccccccccccccceeEEEecCceEEEecCCCccceEEEEEecCCCccceeEeeccccc--- Q lcl|NC_014036. 382 RNVVSALARIDSGITPAGQGLQKTLNVDTTKAVFAGVLGGVYKVYIDQYARGDYFTVGYKGDNEMDAGIYYAPYVAL--- 458 (522) Q Consensus 382 ~~va~~L~~~~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~glfyaPYv~~--- 458 (522) |.....|..+- +... +-....+.+ ....++|.|...|+++.... ++.+|...-+..++|+.+-.+ T Consensus 268 ~~~~~~L~~lk--d~~G----~~l~~~~~~-~~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~i 335 (392) T protein:vir:10 268 QDGFNYLDKLK--DKDG----KYILQSDPT-QKNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIVL 335 (392) T ss_pred HHHHHHHHHhh--ccCC----CeEeecCcc-CCccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEEE Confidence 99999987541 1100 000011111 12235677765666543321 111122222222333322110 Q ss_pred ---cccc-ccCC------ccccceeeeeeeeccee-cCccccccCCccccccCcchHHhhcc Q lcl|NC_014036. 459 ---TPLR-GSDP------KNFQPVMGFKTRYGVGI-NPFANSRSQAPSDRITSGMITKEMFG 509 (522) Q Consensus 459 ---~~~~-~~Dp------~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~i~~g~~~~~~~~ 509 (522) ..+. .+++ .+.+=.+-...|++..+ +|-+...-. +....+...-+| T Consensus 336 ~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~a~~~~~~~ 392 (392) T protein:vir:10 336 FKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-----IDLSAPVEQPQG 392 (392) T ss_pred EeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-----ecccccccCCCC Confidence 0000 1122 23445566677777543 342211100 001111111222 Done!