Query lcl|NC_012740.1_cdsid_YP_002922237.1 [gene=23] [protein=major capsid protein] [protein_id=YP_002922237.1] [location=97956..99542] Match_columns 528 No_of_seqs 173 out of 432 Neff 5.3 Searched_HMMs 1612 Date Thu Nov 7 15:42:24 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_165 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_165_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80986 Length: 528 100.0 2E-265 1E-268 1471.6 39.1 528 1-528 1-528 (528) 2 protein:vir:6601 Length: 528 # 100.0 9E-262 6E-265 1451.6 38.9 528 1-528 1-528 (528) 3 protein:vir:6901 Length: 522 # 100.0 9E-256 6E-259 1418.9 36.6 518 1-528 4-522 (522) 4 protein:vir:100603 Length: 529 100.0 2E-255 1E-258 1417.1 36.8 527 1-528 2-529 (529) 5 protein:vir:98143 Length: 524 100.0 3E-253 2E-256 1405.2 36.7 517 1-528 1-524 (524) 6 protein:vir:101039 Length: 529 100.0 1E-252 6E-256 1402.1 36.7 527 1-528 2-529 (529) 7 protein:vir:101811 Length: 529 100.0 3E-252 2E-255 1399.9 37.3 527 1-528 2-529 (529) 8 protein:vir:103463 Length: 521 100.0 6E-251 4E-254 1392.5 37.3 517 1-528 3-521 (521) 9 protein:vir:7214 Length: 521 # 100.0 9E-251 5E-254 1391.5 37.2 517 1-528 3-521 (521) 10 protein:vir:106286 Length: 534 100.0 1E-250 9E-254 1390.2 37.5 518 2-528 1-534 (534) 11 protein:vir:107947 Length: 519 100.0 7E-249 4E-252 1381.0 36.4 518 2-528 1-519 (519) 12 protein:vir:5670 Length: 514 # 100.0 5E-239 3E-242 1327.1 34.8 508 5-528 1-514 (514) 13 protein:vir:104915 Length: 470 100.0 1E-224 6E-228 1248.6 34.1 460 1-528 3-469 (470) 14 protein:vir:106998 Length: 468 100.0 2E-222 1E-225 1236.6 34.5 460 1-528 1-467 (468) 15 protein:vir:104549 Length: 462 100.0 9E-220 6E-223 1221.4 33.7 454 2-528 1-461 (462) 16 protein:vir:103181 Length: 457 100.0 6E-215 4E-218 1195.1 34.3 449 2-528 1-456 (457) 17 protein:vir:5942 Length: 523 # 100.0 6E-196 4E-199 1090.9 32.0 451 1-528 1-521 (523) 18 protein:vir:1886 Length: 385 # 95.6 0.0019 1.2E-06 35.6 19.4 348 1-508 18-385 (385) 19 protein:vir:191 Length: 385 # 95.6 0.0019 1.2E-06 35.6 19.4 348 1-508 18-385 (385) 20 protein:vir:81227 Length: 413 95.4 0.0021 1.3E-06 35.3 20.4 353 1-528 2-410 (413) 21 protein:vir:79987 Length: 415 95.4 0.0022 1.4E-06 35.1 16.4 358 1-514 1-415 (415) 22 protein:vir:81100 Length: 415 95.4 0.0022 1.4E-06 35.1 16.4 358 1-514 1-415 (415) 23 protein:vir:98339 Length: 415 95.4 0.0022 1.4E-06 35.1 16.4 358 1-514 1-415 (415) 24 protein:vir:4953 Length: 397 # 94.8 0.0035 2.2E-06 34.1 16.5 329 1-514 1-397 (397) 25 protein:vir:81160 Length: 371 94.1 0.0054 3.4E-06 33.0 19.2 327 1-502 22-371 (371) 26 protein:vir:78523 Length: 338 93.4 0.0076 4.7E-06 32.2 16.7 311 59-512 1-338 (338) 27 protein:vir:41 Length: 299 # N 93.3 0.0079 4.9E-06 32.1 18.4 276 71-515 1-299 (299) 28 protein:vir:9410 Length: 415 # 93.3 0.0081 5E-06 32.1 18.2 362 1-514 1-415 (415) 29 protein:vir:104256 Length: 458 92.6 0.011 6.8E-06 31.3 17.6 349 1-517 81-458 (458) 30 protein:vir:4830 Length: 397 # 92.3 0.012 7.3E-06 31.2 18.0 333 1-516 1-397 (397) 31 protein:vir:100135 Length: 418 91.8 0.014 8.7E-06 30.8 17.8 352 1-515 21-418 (418) 32 protein:vir:105905 Length: 304 91.2 0.017 1E-05 30.3 15.8 280 58-508 1-304 (304) 33 protein:vir:94142 Length: 304 91.2 0.017 1E-05 30.3 15.8 280 58-508 1-304 (304) 34 protein:vir:3033 Length: 272 # 91.0 0.018 1.1E-05 30.2 16.5 270 146-517 1-272 (272) 35 protein:vir:9820 Length: 272 # 91.0 0.018 1.1E-05 30.2 16.5 270 146-517 1-272 (272) 36 protein:vir:4997 Length: 397 # 91.0 0.018 1.1E-05 30.2 19.9 328 1-516 1-397 (397) 37 protein:vir:4700 Length: 415 # 90.7 0.02 1.2E-05 29.9 18.2 357 1-514 1-415 (415) 38 protein:vir:4600 Length: 415 # 90.7 0.02 1.2E-05 29.9 18.2 357 1-514 1-415 (415) 39 protein:vir:6212 Length: 434 # 90.5 0.02 1.2E-05 29.9 10.3 344 1-515 30-434 (434) 40 protein:vir:96123 Length: 274 89.7 0.025 1.5E-05 29.4 15.4 273 146-519 1-274 (274) 41 protein:vir:10364 Length: 390 89.3 0.027 1.7E-05 29.2 20.1 342 1-514 30-390 (390) 42 protein:vir:8420 Length: 477 # 89.2 0.028 1.7E-05 29.1 20.4 368 1-514 67-477 (477) 43 protein:vir:2344 Length: 397 # 88.9 0.029 1.8E-05 29.0 17.7 306 71-528 1-329 (397) 44 protein:vir:78223 Length: 333 88.4 0.032 2E-05 28.8 16.6 302 32-500 1-333 (333) 45 protein:vir:94673 Length: 419 88.0 0.035 2.2E-05 28.6 19.7 355 1-505 32-419 (419) 46 protein:vir:9574 Length: 300 # 87.6 0.038 2.3E-05 28.4 18.3 282 78-527 1-300 (300) 47 protein:vir:4856 Length: 293 # 87.6 0.038 2.4E-05 28.4 17.6 259 54-511 1-293 (293) 48 protein:vir:95763 Length: 297 87.1 0.041 2.5E-05 28.2 15.7 277 68-510 1-297 (297) 49 protein:vir:1268 Length: 397 # 86.8 0.043 2.7E-05 28.1 16.6 328 1-503 39-397 (397) 50 protein:vir:93742 Length: 274 86.3 0.047 2.9E-05 27.9 15.6 273 146-519 1-274 (274) 51 protein:vir:4339 Length: 395 # 84.5 0.06 3.7E-05 27.3 19.5 350 1-514 1-395 (395) 52 protein:vir:9759 Length: 303 # 83.8 0.066 4.1E-05 27.1 16.4 281 78-509 1-303 (303) 53 protein:vir:99920 Length: 311 83.7 0.066 4.1E-05 27.0 19.3 285 78-509 1-311 (311) 54 protein:vir:104085 Length: 320 83.0 0.072 4.5E-05 26.9 16.5 289 43-502 1-320 (320) 55 protein:vir:4092 Length: 390 # 82.5 0.077 4.8E-05 26.7 18.6 351 1-508 1-390 (390) 56 protein:vir:100247 Length: 425 80.6 0.093 5.8E-05 26.2 19.7 333 1-503 65-425 (425) 57 protein:vir:9704 Length: 394 # 79.0 0.11 6.8E-05 25.9 18.9 338 1-514 30-394 (394) 58 protein:vir:739 Length: 231 # 77.4 0.13 7.8E-05 25.5 11.8 218 199-528 1-231 (231) 59 protein:vir:4226 Length: 326 # 77.0 0.13 8E-05 25.5 18.9 306 43-517 1-326 (326) 60 protein:vir:8187 Length: 311 # 75.4 0.15 9.1E-05 25.1 19.1 288 79-510 1-311 (311) 61 protein:vir:6242 Length: 390 # 75.3 0.15 9.2E-05 25.1 15.5 351 1-511 4-390 (390) 62 protein:vir:80376 Length: 435 74.2 0.16 0.0001 24.9 18.1 348 1-512 42-435 (435) 63 protein:vir:3870 Length: 400 # 73.2 0.17 0.00011 24.8 18.6 323 1-511 41-400 (400) 64 protein:vir:97148 Length: 324 72.2 0.19 0.00012 24.6 15.7 295 44-504 1-324 (324) 65 protein:vir:96262 Length: 274 68.7 0.23 0.00014 24.0 15.4 271 146-513 1-274 (274) 66 protein:vir:95898 Length: 274 68.7 0.23 0.00014 24.0 15.4 271 146-513 1-274 (274) 67 protein:vir:2430 Length: 318 # 68.7 0.23 0.00015 24.0 18.6 289 43-515 1-318 (318) 68 protein:vir:96762 Length: 632 68.1 0.24 0.00015 24.0 19.6 334 1-499 238-632 (632) 69 protein:vir:2504 Length: 305 # 68.1 0.24 0.00015 24.0 18.4 285 79-517 1-305 (305) 70 protein:vir:4456 Length: 401 # 67.2 0.26 0.00016 23.8 15.2 352 1-528 1-401 (401) 71 protein:vir:97433 Length: 274 66.0 0.27 0.00017 23.7 16.1 273 146-513 1-274 (274) 72 protein:vir:94494 Length: 274 66.0 0.27 0.00017 23.7 16.1 273 146-513 1-274 (274) 73 protein:vir:7771 Length: 330 # 65.0 0.29 0.00018 23.5 18.2 289 58-501 1-330 (330) 74 protein:vir:1433 Length: 435 # 64.8 0.29 0.00018 23.5 17.2 347 1-512 29-435 (435) 75 protein:vir:1328 Length: 392 # 64.6 0.3 0.00018 23.5 16.7 349 1-511 4-392 (392) 76 protein:vir:81070 Length: 390 61.2 0.36 0.00022 23.0 21.4 334 1-507 30-390 (390) 77 protein:vir:97053 Length: 390 60.7 0.37 0.00023 23.0 18.7 341 1-514 30-390 (390) 78 protein:vir:94424 Length: 387 59.3 0.4 0.00025 22.8 13.8 333 1-528 1-381 (387) 79 protein:vir:2685 Length: 387 # 59.3 0.4 0.00025 22.8 13.8 333 1-528 1-381 (387) 80 protein:vir:96978 Length: 387 59.3 0.4 0.00025 22.8 13.8 333 1-528 1-381 (387) 81 protein:vir:9309 Length: 324 # 58.0 0.42 0.00026 22.6 16.3 303 35-517 1-324 (324) 82 protein:vir:80930 Length: 278 57.0 0.44 0.00028 22.5 15.7 273 171-516 1-278 (278) 83 protein:vir:7409 Length: 408 # 56.9 0.45 0.00028 22.5 20.7 335 1-514 39-408 (408) 84 protein:vir:96223 Length: 324 55.7 0.47 0.00029 22.4 18.1 303 23-517 1-324 (324) 85 protein:vir:101607 Length: 379 50.5 0.61 0.00038 21.8 19.9 338 1-528 22-379 (379) 86 protein:vir:96392 Length: 324 45.0 0.78 0.00049 21.2 18.2 301 1-517 1-324 (324) 87 protein:vir:78830 Length: 324 45.0 0.78 0.00049 21.2 18.2 301 1-517 1-324 (324) 88 protein:vir:3845 Length: 395 # 41.9 0.91 0.00056 20.8 17.3 343 2-516 1-395 (395) 89 protein:vir:1638 Length: 298 # 40.9 0.95 0.00059 20.7 19.7 280 78-508 1-298 (298) 90 protein:vir:101650 Length: 497 39.7 1 0.00062 20.6 21.6 362 1-508 70-497 (497) 91 protein:vir:7855 Length: 497 # 39.7 1 0.00062 20.6 21.6 362 1-508 70-497 (497) 92 protein:vir:100884 Length: 389 38.3 1.1 0.00067 20.4 21.8 338 1-516 15-389 (389) 93 protein:vir:1239 Length: 274 # 38.0 1.1 0.00068 20.4 15.5 273 146-513 1-274 (274) 94 protein:vir:3613 Length: 272 # 37.1 1.1 0.00071 20.3 15.2 267 146-528 1-272 (272) 95 protein:vir:108211 Length: 318 36.8 1.2 0.00072 20.2 10.7 286 116-528 1-315 (318) 96 protein:vir:1025 Length: 408 # 35.9 1.2 0.00075 20.1 14.9 334 1-510 4-408 (408) 97 protein:vir:94771 Length: 298 32.9 1.4 0.00086 19.8 16.0 280 78-508 1-298 (298) 98 protein:vir:3991 Length: 404 # 31.2 1.5 0.00094 19.6 20.4 349 1-516 4-404 (404) 99 protein:vir:102873 Length: 392 31.2 1.5 0.00094 19.6 20.3 325 1-515 35-392 (392) 100 protein:vir:107593 Length: 392 31.2 1.5 0.00094 19.6 20.3 325 1-515 35-392 (392) 101 protein:vir:105004 Length: 392 31.2 1.5 0.00094 19.6 20.3 325 1-515 35-392 (392) 102 protein:vir:102082 Length: 392 31.2 1.5 0.00094 19.6 20.3 325 1-515 35-392 (392) 103 protein:vir:8102 Length: 543 # 30.9 1.5 0.00095 19.6 16.2 345 1-514 159-543 (543) 104 protein:vir:105334 Length: 276 27.7 1.8 0.0011 19.2 14.4 271 146-516 1-276 (276) 105 protein:vir:80684 Length: 315 26.8 1.9 0.0012 19.0 19.2 291 78-517 1-315 (315) 106 protein:vir:5739 Length: 366 # 25.6 2 0.0013 18.9 16.2 329 1-510 3-366 (366) 107 protein:vir:1781 Length: 221 # 24.4 2.2 0.0014 18.7 16.0 203 262-503 1-221 (221) 108 protein:vir:95107 Length: 270 24.2 2.2 0.0014 18.7 12.1 267 166-519 1-270 (270) 109 protein:vir:1383 Length: 421 # 22.5 2.4 0.0015 18.5 17.1 347 1-528 4-415 (421) 110 protein:vir:94711 Length: 347 21.7 2.6 0.0016 18.4 12.9 312 116-503 1-347 (347) No 1 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=100.00 E-value=2.1e-265 Score=1471.64 Aligned_cols=528 Identities=95% Similarity=1.346 Sum_probs=518.3 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCcccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIAS 80 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e 80 (528) |+++|+|+|||+|||||||+|||++.|||+|+|+|||||||+|+|+|.|||+++++|||.||.||+++|+|||++++|+| T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 80 (528) T protein:vir:80 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAA 80 (528) T ss_pred CcchHHHHHhhhHhhcCCccchhcchhhhhhhhhhhhhhhHHhhccccccchHHHHhhhhhccccccccccCCccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCc Q lcl|NC_012740. 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKD 160 (528) Q Consensus 81 ~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~ 160 (528) |++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++..+++|+||+++++|++||+..+.. T Consensus 81 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~~ 160 (528) T protein:vir:80 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAKG 160 (528) T ss_pred cccccccccCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCccccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchhh Q lcl|NC_012740. 161 ATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSI 240 (528) Q Consensus 161 ~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~ 240 (528) .....+.++.|...+.+.+.+.|+++.+.+..+++....+++....++.+......+.........+..|+++.||+|+. T Consensus 161 ~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~ 240 (528) T protein:vir:80 161 AAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSI 240 (528) T ss_pred cccccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccccccccccchhh Confidence 98899999999999999999999999999999999999999888888887777777777777788889999999999999 Q ss_pred hhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Q lcl|NC_012740. 241 AELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) Q Consensus 241 AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~ 320 (528) +|.++.||+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|++ T Consensus 241 AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~ 320 (528) T protein:vir:80 241 AEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) T ss_pred hhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred heeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccc Q lcl|NC_012740. 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) Q Consensus 321 ~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~ 400 (528) +|++|++|||.++++++|+|||++++|++|+||++||||+|++|||+|||+|+|+|+||+|||||||||||++|+|+|++ T Consensus 321 ~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~ 400 (528) T protein:vir:80 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) T ss_pred eeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecCccccceee Q lcl|NC_012740. 401 ISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLG 480 (528) Q Consensus 401 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~ 480 (528) +++++++.+.+++.|+++++|+|+|+|||+||||||+++|||+|||||++|+|+||||||||||+|+|++||+||||+|| T Consensus 401 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g 480 (528) T protein:vir:80 401 ISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLG 480 (528) T ss_pred ccccccccccccccCCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeceeecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 481 FKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 481 ~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) |||||||++|||+++.+|++++||+||+||+++||||+|||||+|||| T Consensus 481 ~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:80 481 FKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred eeeeeceeecCcccccCCcccccccccchhhhhcCccceeEEeeeccC Confidence 999999999999999999999999999999999999999999999999 No 2 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=100.00 E-value=9.4e-262 Score=1451.63 Aligned_cols=528 Identities=94% Similarity=1.342 Sum_probs=514.2 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCcccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIAS 80 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e 80 (528) |+++|+|+|||+|||||||+|||++.|||+|+|+|||||||+|+|+|.|||+++++||+.+|.||+++|+|||++++|+| T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 80 (528) T protein:vir:66 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAA 80 (528) T ss_pred CcchHHHHHHhHHhhcCCCcchhcchhhhhhhhhhhhhhHHHhhcccchhhHHHHHhhhhhhhhhcccccccccchhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCc Q lcl|NC_012740. 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKD 160 (528) Q Consensus 81 ~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~ 160 (528) |++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++..+++|+||+.+.++++||+..+.. T Consensus 81 s~~t~~v~~~~P~Li~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~~ 160 (528) T protein:vir:66 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAKE 160 (528) T ss_pred cccccccccCchhHHHHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCccccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchhh Q lcl|NC_012740. 161 ATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSI 240 (528) Q Consensus 161 ~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~ 240 (528) +...++++.+|+..+.+.....|+++.+.+++++++....+......+........+.........+..++++.||+|+. T Consensus 161 a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~ 240 (528) T protein:vir:66 161 ATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATSI 240 (528) T ss_pred ccccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccccccccccccccccceecccccchhh Confidence 99999999999999999999999999999999999988888777776666555555666666677788899999999999 Q ss_pred hhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Q lcl|NC_012740. 241 AELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) Q Consensus 241 AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~ 320 (528) +|+++.+|+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||||||||||++|++ T Consensus 241 aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~~ 320 (528) T protein:vir:66 241 AEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) T ss_pred hhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred heeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccc Q lcl|NC_012740. 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) Q Consensus 321 ~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~ 400 (528) +|++|++|||.++++++|+|||++++|++|+||++||||+|++|||+|||+|+|+|+||+|||||||||||++|+|+|++ T Consensus 321 ~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~ 400 (528) T protein:vir:66 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) T ss_pred eeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecCccccceee Q lcl|NC_012740. 401 ISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLG 480 (528) Q Consensus 401 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~ 480 (528) +++++++.+.+++.|+++++|+|+|+|||+||||||+++|||+|||||++|+|+||||||||||+|+|++||+||||+|| T Consensus 401 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g 480 (528) T protein:vir:66 401 ISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLG 480 (528) T ss_pred ccccccccccccccCCCCceeEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeceeecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 481 FKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 481 ~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) |||||||++|||+++.+|++++||+||+||+++||||+|||||+|||| T Consensus 481 ~~tRY~l~vNP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:66 481 FKTRYGIGINPFADSKSQEPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred eeeeeceeecCcccccCccccccccccchhhhhcCccceeEEeeeccC Confidence 999999999999999999999999999999999999999999999999 No 3 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=100.00 E-value=8.9e-256 Score=1418.87 Aligned_cols=518 Identities=70% Similarity=1.111 Sum_probs=493.9 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCcccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIAS 80 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e 80 (528) |+++|+|+|||+|||||||+|+|.+ +||+|+|+|||||||+|+|+|+|||+++++|||.||+||+++|+|||++++|+| T Consensus 4 ~~~~e~l~~kw~p~l~~~~~~~~~~-~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 82 (522) T protein:vir:69 4 IKTKAQLVDKWKELLEGEGLPEIAN-SKQAIIAKIFENQEKDFEVSPEYKDEKIAQAFGSFLTEAEIGGDHGYNAQNIAA 82 (522) T ss_pred cchHHHHHHhhHHHhcCCCCCcccc-chhhhhhhhhhhhhHHhhcccccchhHHHHhhhhhhhhhccccccCCCcccccc Confidence 9999999999999999999999987 599999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCc Q lcl|NC_012740. 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKD 160 (528) Q Consensus 81 ~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~ 160 (528) |++|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++++.+++|+|+.++|+|+.|||.+++. T Consensus 83 s~~t~~v~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~~t 162 (522) T protein:vir:69 83 GQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGAAK 162 (522) T ss_pred cccccccccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999998899999999999999999976543 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchhh Q lcl|NC_012740. 161 ATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSI 240 (528) Q Consensus 161 ~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~ 240 (528) . +........++.|+.+.+.|...++.............+..+....+.........+..|+++.||+|+. T Consensus 163 ~---------~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~ 233 (522) T protein:vir:69 163 K---------FPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSI 233 (522) T ss_pred c---------ccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccccceeeccccchhh Confidence 3 3444455567778888888888888887777666666666666666667777788889999999999999 Q ss_pred hhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Q lcl|NC_012740. 241 AELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) Q Consensus 241 AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~ 320 (528) +|+++.||+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|++ T Consensus 234 aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~ 313 (522) T protein:vir:69 234 AELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINY 313 (522) T ss_pred hhhcccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred heeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccc Q lcl|NC_012740. 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) Q Consensus 321 ~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~ 400 (528) |||+|++|||..+++++|+|||+++.|++++||++||||+|++|||+|||+|+|+|+||+|||||||||||++|+|+|++ T Consensus 314 sa~~~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~ 393 (522) T protein:vir:69 314 SAQVGKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTG 393 (522) T ss_pred hheeeccccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecCccccceee Q lcl|NC_012740. 401 ISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLG 480 (528) Q Consensus 401 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~ 480 (528) ++.++++.+.+.+.|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|| T Consensus 394 ~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g 473 (522) T protein:vir:69 394 ISYAAQGLASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMG 473 (522) T ss_pred cccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeceeecCcccccCCCccceecccch-HHhhcchhhhhhhhhhccC Q lcl|NC_012740. 481 FKTRYGIGINPFADSKSQAPSARITSGML-SKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 481 ~~tRY~l~~nP~~~~~~~~~~~~~~~~~~-~~~~a~~~~~~r~~~Vk~~ 528 (528) |||||||++|||++..+|++++|||||+| |.+++|+|.|||||+|||| T Consensus 474 ~~tRY~l~vNP~~~~~~~~~~~ri~~g~p~~~~~~~~n~y~r~v~v~~~ 522 (522) T protein:vir:69 474 FKTRYGIGVNPFAESSLQAPGARIQSGMPSILNSLGKNAYFRRVYVKGI 522 (522) T ss_pred eeeeeceeecCcccccCCcccceeecccchhhcccCCcceeeEEEeecC Confidence 99999999999999999999999999995 6799999999999999999 No 4 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=100.00 E-value=1.9e-255 Score=1417.10 Aligned_cols=527 Identities=70% Similarity=1.099 Sum_probs=496.5 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCcccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIAS 80 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e 80 (528) -.++|+|+|||+|||||||+|||++.|||+|+|+|||||||+|+|||.|||..++++++.+|+|++++|+||+++.+|+| T Consensus 2 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~ia~ 81 (529) T protein:vir:10 2 SLKTKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKTDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAA 81 (529) T ss_pred ccchHHHHHHhhHhhcCCccchhcchhhhhhhhhhhhhHHHHhhcccccchhhhhhhhhhccchhhcccccccccccccc Confidence 35678999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCc Q lcl|NC_012740. 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKD 160 (528) Q Consensus 81 ~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~ 160 (528) |++|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++.++.+++|+|++++|+|++|||..... T Consensus 82 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~~~~~ 161 (529) T protein:vir:10 82 GQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGLAAKG 161 (529) T ss_pred cccccccccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999977655 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccC-cccccCccccccccccccccccccccchh Q lcl|NC_012740. 161 ATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTK-ADSESDDEVVMKLMEEGKLAEIAFGMATS 239 (528) Q Consensus 161 ~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~g~~~~~g~GmsTa 239 (528) .. .....+.+...+.+.....++...+.|.+.++.+...++........ ..+...+.........+..++++.||+|+ T Consensus 162 ~~-~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsTa 240 (529) T protein:vir:10 162 AT-TSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMATS 240 (529) T ss_pred cc-ccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccccccccccccccchh Confidence 44 44556677777888888888888888888888877665543332221 12233444555666778889999999999 Q ss_pred hhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Q lcl|NC_012740. 240 IAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVIN 319 (528) Q Consensus 240 ~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~ 319 (528) .+|+|+.||+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+|+ T Consensus 241 ~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i~ 320 (529) T protein:vir:10 241 IAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWIN 320 (529) T ss_pred hhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcccc Q lcl|NC_012740. 320 FTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQ 399 (528) Q Consensus 320 ~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~ 399 (528) ++||+|++||++++++++|+|||+++.|++++||++||||+|++|||+|||+|+|+|+||+||||||||+||++|+|.|. T Consensus 321 ~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~ 400 (529) T protein:vir:10 321 YTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALVDA 400 (529) T ss_pred hhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecCcccccee Q lcl|NC_012740. 400 GISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVL 479 (528) Q Consensus 400 ~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~ 479 (528) ++++++++.+.+.+.|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+| T Consensus 401 ~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~ 480 (529) T protein:vir:10 401 GITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVM 480 (529) T ss_pred ccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeceeecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 480 GFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 480 ~~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) ||||||||++|||+++.+|++++||+||+||++++|||+|||||+|||| T Consensus 481 g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 481 GFKTRYAIGVNPFAESRTQAPTSRISNGMPGAHSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeeeceeecCccccccccccccccCCcchhhhcCccceeeEeeeccC Confidence 9999999999999999999999999999999999999999999999999 No 5 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=100.00 E-value=2.7e-253 Score=1405.25 Aligned_cols=517 Identities=71% Similarity=1.109 Sum_probs=485.0 Q ss_pred CcchHHHHHhhhhhhcC-CccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLEN-EKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIA 79 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~-~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~ 79 (528) |+++|+|+|||+||||+ |++|||++.+||+|+|+||||||||++++|.|||+++++|||.+|.||+++|+|||++.+|+ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~ 80 (524) T protein:vir:98 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) T ss_pred CcchHHHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHHHhcCccccchHHHHhhhcccccccccccccccccccc Confidence 99999999999999996 89999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCc----ccccccccccccccccc Q lcl|NC_012740. 80 SGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEH----AKEAFHPMYSPNAFHSS 155 (528) Q Consensus 80 e~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~----G~EA~~n~~Eadt~fSG 155 (528) ||++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++++.+ .+|||++++++|+.||| T Consensus 81 ~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG 160 (524) T protein:vir:98 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) T ss_pred ccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCcccccccccccccccccccccCC Confidence 99999999999999999999999999999999999999999999999999999875432 25777777899999999 Q ss_pred cccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccc Q lcl|NC_012740. 156 LAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFG 235 (528) Q Consensus 156 ~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~G 235 (528) .++... +...+.+.....|+...+.+...+....++........+..++...+.........+..++++.| T Consensus 161 ~g~~t~---------~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~G 231 (524) T protein:vir:98 161 EGAHTA---------FAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVG 231 (524) T ss_pred cccccc---------ccccccccccccccccccccccccceeccccccCcccccccccccccccccccccccceeecccc Confidence 765433 44555566677788888888888888888877776666666666666666677777889999999 Q ss_pred cchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHH Q lcl|NC_012740. 236 MATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIV 315 (528) Q Consensus 236 msTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii 315 (528) |+|+.+|+|+.+|+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||| T Consensus 232 msTA~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii 311 (524) T protein:vir:98 232 MATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIV 311 (524) T ss_pred cchhhhhhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhh Q lcl|NC_012740. 316 DVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILA 395 (528) Q Consensus 316 ~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~ 395 (528) ++|+++||+|++||+.++.+++|+|||+++.|..++||++||||+|++|||+|||+|+|+|+||+|||||||||||++|+ T Consensus 312 ~~i~~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~ 391 (524) T protein:vir:98 312 DLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALA 391 (524) T ss_pred HHHhhhheeceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred c--cccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecCc Q lcl|NC_012740. 396 S--ADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQ 473 (528) Q Consensus 396 ~--~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~ 473 (528) | +||+.+++. .+.+++.|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+ T Consensus 392 ~~~~g~~~~s~~--~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~ 469 (524) T protein:vir:98 392 RIDSGITPASQG--LQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPK 469 (524) T ss_pred hhhcccccccch--hhcccccCCccceEEEEecCceEEEecCCCCcceEEEEeeCCcccccceeeccccccccccccCCc Confidence 9 777766644 466788999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeeeeeeceeecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 474 SFHPVLGFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 474 s~qP~~~~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) ||||+|||||||||++|||+++.++++++|||||+||+++||+|+|||||+|||| T Consensus 470 sfqP~~g~~tRY~l~~NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~l 524 (524) T protein:vir:98 470 NFQPVMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) T ss_pred cccceeeeeeeeceeecCcccccCCccccccccCcchHhhcCccceeeEeeeccC Confidence 9999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=100.00 E-value=1e-252 Score=1402.07 Aligned_cols=527 Identities=70% Similarity=1.097 Sum_probs=489.1 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCcccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIAS 80 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e 80 (528) -.++|+|+|||+|||||||+|||++.|||+|+|+|||||||+++|+|.|||++++++++.+|+|++++|+|||++++|+| T Consensus 2 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~~~~~~~~~~~~~~~i~e 81 (529) T protein:vir:10 2 SLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAA 81 (529) T ss_pred cccHHHHHHHhHHHhcCCccchhccchhhhhhhhhhhhhHHHHhhccccchhhhhhhhhcccchhhcccccccccccccc Confidence 34567899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCc Q lcl|NC_012740. 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKD 160 (528) Q Consensus 81 ~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~ 160 (528) |++|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++++.+++|+||+++.|+++|||+.... T Consensus 82 st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~y~Pda~~sga~~~g 161 (529) T protein:vir:10 82 GQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKG 161 (529) T ss_pred ccccccccccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCccccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988766 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCc-ccccCccccccccccccccccccccchh Q lcl|NC_012740. 161 ATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKA-DSESDDEVVMKLMEEGKLAEIAFGMATS 239 (528) Q Consensus 161 ~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~~~~~g~GmsTa 239 (528) ..... .++.+...+.....+.++...+.|.+.++.+...+.......... .+...+.........+..|+++.||+|+ T Consensus 162 a~~~~-~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm~Ta 240 (529) T protein:vir:10 162 ATTTT-DGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATS 240 (529) T ss_pred ccccc-CccccccccccccccccCcceeeeecccceecccccccccccCccccCcccccccccccccccccccccccchh Confidence 65433 334455555666677777777777777777765543332221111 1112233344556678889999999999 Q ss_pred hhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Q lcl|NC_012740. 240 IAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVIN 319 (528) Q Consensus 240 ~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~ 319 (528) .+|+|+.+|+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+|+ T Consensus 241 ~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~ 320 (529) T protein:vir:10 241 IAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWIN 320 (529) T ss_pred hhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcccc Q lcl|NC_012740. 320 FTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQ 399 (528) Q Consensus 320 ~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~ 399 (528) .+|++++.+|+++.++++|+|||++++|+.++||++||||+|+++||||||+|+|+|+||+||||||||+||++|+|+|+ T Consensus 321 ~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~ 400 (529) T protein:vir:10 321 YTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDT 400 (529) T ss_pred hhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecCcccccee Q lcl|NC_012740. 400 GISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVL 479 (528) Q Consensus 400 ~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~ 479 (528) ++++++++...+++.|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+| T Consensus 401 ~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~ 480 (529) T protein:vir:10 401 NISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVM 480 (529) T ss_pred hccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeceeecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 480 GFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 480 ~~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) ||||||||++|||+++.+|++++||+||+||+++||||+|||||+|||| T Consensus 481 g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 481 GFKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 9999999999999999999999999999999999999999999999999 No 7 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=100.00 E-value=2.5e-252 Score=1399.91 Aligned_cols=527 Identities=70% Similarity=1.095 Sum_probs=492.0 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCcccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIAS 80 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e 80 (528) -.++|+|+|||+|||||||+|||++.|||+|+|+|||||||+++|+|.|||++++++++.+|+|++++|+|||++++|+| T Consensus 2 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i~~ 81 (529) T protein:vir:10 2 SLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAA 81 (529) T ss_pred ccchHHHHHHhhHhhcCCccchhccchhhhhhhhhhhhhHHHHhcccccchhhhhhhhhccchhhccccccccccccccc Confidence 34667899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCc Q lcl|NC_012740. 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKD 160 (528) Q Consensus 81 ~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~ 160 (528) |++|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++++.+++|+||+++.++++|||+.... T Consensus 82 st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~~~pda~~sga~~~g 161 (529) T protein:vir:10 82 GQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKG 161 (529) T ss_pred ccccccccccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCccccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988766 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCc-ccccCccccccccccccccccccccchh Q lcl|NC_012740. 161 ATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKA-DSESDDEVVMKLMEEGKLAEIAFGMATS 239 (528) Q Consensus 161 ~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~~~~~g~GmsTa 239 (528) ..+ ...++.+...+.....+.|+...+.|.+.++.+...+.......... .+...+.........+..|+++.||+|+ T Consensus 162 a~t-~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsTa 240 (529) T protein:vir:10 162 ATT-TTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATS 240 (529) T ss_pred ccc-cccccccccccccccccccccceeeecccCceeeccccccccccCccccCcccccccccccccccccccccchhhh Confidence 554 34455566667777788888888888888888776544332222111 1112233344556678889999999999 Q ss_pred hhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Q lcl|NC_012740. 240 IAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVIN 319 (528) Q Consensus 240 ~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~ 319 (528) .+|+|+.+|+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+|+ T Consensus 241 ~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~ 320 (529) T protein:vir:10 241 IAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWIN 320 (529) T ss_pred hhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcccc Q lcl|NC_012740. 320 FTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQ 399 (528) Q Consensus 320 ~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~ 399 (528) .+|++++.+|+++.++.+|+|||++++|+.++||++||||+|+++||||||+|+|+|+||+||||||||+||++|+|+|+ T Consensus 321 ~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~ 400 (529) T protein:vir:10 321 YTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDT 400 (529) T ss_pred hhhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecCcccccee Q lcl|NC_012740. 400 GISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVL 479 (528) Q Consensus 400 ~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~ 479 (528) +..+++++...+++.|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+| T Consensus 401 ~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~ 480 (529) T protein:vir:10 401 NISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGFDPKNFQPVM 480 (529) T ss_pred cccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeceeecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 480 GFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 480 ~~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) ||||||||++|||+++.+|++++||+||+||+++||+|+|||||+|||| T Consensus 481 g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 481 GFKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 9999999999999999999999999999999999999999999999999 No 8 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=100.00 E-value=5.7e-251 Score=1392.51 Aligned_cols=517 Identities=69% Similarity=1.084 Sum_probs=492.1 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCcccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIAS 80 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e 80 (528) |+++|+|+|||+|||||||+|+|++ +||+|+|+|||||||++.++|+|||++++++|+.+|.|++++++|||++++|+| T Consensus 3 ~~~~~~l~~kw~p~l~~~~~~~i~~-~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~i~e 81 (521) T protein:vir:10 3 IKTKAELLNKWKPLLEGEGLPEIAN-SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAA 81 (521) T ss_pred cchhHHHHHhhhhhhccCCCCcccc-chhhhhhhhhhhhhhhhhhccccchhHHHHHHhhhhhhhcccCccccccccccc Confidence 9999999999999999999999987 599999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCc Q lcl|NC_012740. 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKD 160 (528) Q Consensus 81 ~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~ 160 (528) |++|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++++.+++|+||+++++|+.|||.+++. T Consensus 82 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at 161 (521) T protein:vir:10 82 GQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAK 161 (521) T ss_pred cccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhcccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999998899999999999999999987654 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchhh Q lcl|NC_012740. 161 ATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSI 240 (528) Q Consensus 161 ~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~ 240 (528) . +.....++....|+.+.+.|...+..+...........+..+....+.........+..|+++.||+|+. T Consensus 162 ~---------~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~ 232 (521) T protein:vir:10 162 K---------FAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSI 232 (521) T ss_pred c---------cccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhh Confidence 3 3444455667778888888888888877777666666666666666777777888899999999999999 Q ss_pred hhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Q lcl|NC_012740. 241 AELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) Q Consensus 241 AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~ 320 (528) +|+|+.+|+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|++ T Consensus 233 aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~ 312 (521) T protein:vir:10 233 AELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINY 312 (521) T ss_pred HhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred heeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccc Q lcl|NC_012740. 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) Q Consensus 321 ~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~ 400 (528) |+++|++|||.+.++++|+|||+++.|++++||++||||+|++|||+|||+|+|+|+||+|||||||||||++|+|+|.+ T Consensus 313 sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~ 392 (521) T protein:vir:10 313 SAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTG 392 (521) T ss_pred eeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecCccccceee Q lcl|NC_012740. 401 ISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLG 480 (528) Q Consensus 401 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~ 480 (528) ++.++++.+.+.+.|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|| T Consensus 393 ~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g 472 (521) T protein:vir:10 393 ISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMG 472 (521) T ss_pred cccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeceeecCcccccCCCccceecccchHHhhc--chhhhhhhhhhccC Q lcl|NC_012740. 481 FKTRYGIGINPFADSKSQAPSARITSGMLSKDSV--GKNAYFRRVWVKGC 528 (528) Q Consensus 481 ~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a--~~~~~~r~~~Vk~~ 528 (528) |||||||++|||+++.+|+ ++|+|++++|++++ ++|.|||||+|||| T Consensus 473 ~~tRY~l~~NP~~~~~~~~-~~~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:10 473 FKTRYGIGINPFAESAAQA-PASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eeeeeceeecCcccccCCc-cceeecccchhhhccccccceeeeeeecCC Confidence 9999999999999999985 67999999998876 67789999999999 No 9 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=100.00 E-value=8.6e-251 Score=1391.54 Aligned_cols=517 Identities=69% Similarity=1.083 Sum_probs=491.5 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCcccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIAS 80 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e 80 (528) |+++|+|+|||+|||||||+|+|++ +||+|+|+|||||||++.++|+|||++++++|+.+|.|++++++|||++++|+| T Consensus 3 ~~~~~~l~~kw~p~l~~~~~~~i~~-~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~iae 81 (521) T protein:vir:72 3 IKTKAELLNKWKPLLEGEGLPEIAN-SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAA 81 (521) T ss_pred cchhHHHHHhhhhhhccCCCCcccc-chhhhhhhhhhhhhhhhhhcccccchHHHHHHhhhhhhhcccCccccCcccccc Confidence 9999999999999999999999987 599999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCc Q lcl|NC_012740. 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKD 160 (528) Q Consensus 81 ~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~ 160 (528) |++|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++++.+++|+||+++++|++|||+++.. T Consensus 82 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~~ 161 (521) T protein:vir:72 82 GQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAAK 161 (521) T ss_pred cccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhcccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999998899999999999999999987654 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchhh Q lcl|NC_012740. 161 ATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSI 240 (528) Q Consensus 161 ~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~ 240 (528) . +.....+.+.+.||.+.+.+...++.+..........++..+....+.........+..|+++.||+|+. T Consensus 162 ~---------~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~ 232 (521) T protein:vir:72 162 K---------FPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSI 232 (521) T ss_pred c---------ccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccchhh Confidence 3 3444556677889999999988888877766666666666666666667777788888999999999999 Q ss_pred hhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Q lcl|NC_012740. 241 AELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) Q Consensus 241 AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~ 320 (528) +|+++.+|+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|++ T Consensus 233 aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~ 312 (521) T protein:vir:72 233 AELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINY 312 (521) T ss_pred hhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred heeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccc Q lcl|NC_012740. 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) Q Consensus 321 ~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~ 400 (528) |+|+|++|||.+.++++|+|||+++.|++++||++||||+|++|||+|||+|+|+|+||+|||||||||||++|+|+|.+ T Consensus 313 sa~~g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~ 392 (521) T protein:vir:72 313 SAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTG 392 (521) T ss_pred eeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecCccccceee Q lcl|NC_012740. 401 ISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLG 480 (528) Q Consensus 401 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~ 480 (528) ++.++++...+.+.|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|| T Consensus 393 ~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g 472 (521) T protein:vir:72 393 ISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMG 472 (521) T ss_pred cccccccccccccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeceeecCcccccCCCccceecccchHHhhc--chhhhhhhhhhccC Q lcl|NC_012740. 481 FKTRYGIGINPFADSKSQAPSARITSGMLSKDSV--GKNAYFRRVWVKGC 528 (528) Q Consensus 481 ~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a--~~~~~~r~~~Vk~~ 528 (528) |||||||++|||+++.+|+ ++|+|++++|++++ ++|.|||||+|||| T Consensus 473 ~~tRY~l~~NP~~~~~~~~-~a~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:72 473 FKTRYGIGINPFAESAAQA-PASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eeeeeceeecCcccccCcc-cceeecCcChhhhcCccccceeeeeeecCC Confidence 9999999999999999985 78999999999877 66789999999999 No 10 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=100.00 E-value=1.5e-250 Score=1390.25 Aligned_cols=518 Identities=57% Similarity=0.909 Sum_probs=476.7 Q ss_pred cchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhc--cccchhhhhhhhccc--------cccccccccC Q lcl|NC_012740. 2 KTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVD--PIYKDEKVVEAFGGF--------IAEAEVAGDH 71 (528) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~--~~~~~~~~~~~~~~~--------l~ea~~~~~~ 71 (528) .++|+|+|||+|||||||+|||++.|||+|+|+|||||||+|+|| +.|||++++++|+.| |.||+++++| T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~ 80 (534) T protein:vir:10 1 MSKKSLLKKWQPLVESEGMPAIASMKRKDIVARIFENQDEDIAHNEGGVYTDQVVVNSMVDVKGRIEEARLAEANIGGDH 80 (534) T ss_pred CchhHHHHHhHHhhcCCccccccchhhhhhhhhhhhhHHHHHhhhcccccchhhhhhhhhccccchhhcccccccccccc Confidence 789999999999999999999999999999999999999999998 699999999999998 9999999999 Q ss_pred CccccccccccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccc Q lcl|NC_012740. 72 GYNASNIASGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNA 151 (528) Q Consensus 72 g~~~~~~~e~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt 151 (528) ||++.||+||++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||.++++.++++|||||++.+|+ T Consensus 81 g~~~~~ia~s~~s~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt 160 (534) T protein:vir:10 81 GYDATKIASGETSGSITNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGPDA 160 (534) T ss_pred ccccccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999888899999999999999 Q ss_pred cccccccCccccccccccccccccccccccccccc-----cccccccccccccccccccccccCcccccCcccccccccc Q lcl|NC_012740. 152 FHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIV-----HHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEE 226 (528) Q Consensus 152 ~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~-----~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (528) +|||++++........++ +...|+.+ .+.+..++....+...........++....+......... T Consensus 161 ~fSG~~~a~~~~~~~~~~---------a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~ 231 (534) T protein:vir:10 161 DFSGRGAAQDIAVFVRGT---------AVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLAN 231 (534) T ss_pred cccccccccccccccccc---------cccccccccccccccccccccccccccccccccccccCCcccccccccccccc Confidence 999987765443322222 22223222 2233344444444444444444455554444555556667 Q ss_pred ccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|NC_012740. 227 GKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEV 306 (528) Q Consensus 227 g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEI 306 (528) +..|+++.||+|+.||+++.+|+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|| T Consensus 232 ~~~y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEI 311 (534) T protein:vir:10 232 GYAVETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEI 311 (534) T ss_pred ccceecccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHH Confidence 78899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEE Q lcl|NC_012740. 307 LLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIA 386 (528) Q Consensus 307 mlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~ 386 (528) |+||||||||+|+.+|++++.+|+....+++|+|||.++.|+.++||++||||+|+++||||||+|+|+|+||+|||||| T Consensus 312 mlEINReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~ 391 (534) T protein:vir:10 312 MHEINREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIIC 391 (534) T ss_pred HHHhhHHHHHHHhhhhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEE Confidence 99999999999999999999999888889999999999999999999999999999999999999999999999999999 Q ss_pred chhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccce Q lcl|NC_012740. 387 SRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTP 466 (528) Q Consensus 387 S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~ 466 (528) |||||++|+|+|||++.|+.+.+.+.+.|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+| T Consensus 392 S~~Va~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~ 471 (534) T protein:vir:10 392 SRNVAAALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVEDYFTVGYKGASEMDAGLYYCPYVALTP 471 (534) T ss_pred chhHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEecCccccceeeeeeeeceeecCcccccCCCccceecccch-HHhhcchhhhhhhhhhccC Q lcl|NC_012740. 467 LRATDPQSFHPVLGFKTRYGIGINPFADSKSQAPSARITSGML-SKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 467 ~~~~Dp~s~qP~~~~~tRY~l~~nP~~~~~~~~~~~~~~~~~~-~~~~a~~~~~~r~~~Vk~~ 528 (528) ++++||+||||+|||||||||++|||++..++++..||+||++ |++++|+|+|||||+|||| T Consensus 472 ~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~~i~~g~~~~~~~ag~n~~~~~~~Vk~l 534 (534) T protein:vir:10 472 LRGTDPKNFQPVLGFKTRYGVKLHPMADATQNKGFAKISNGMPQHTNMFGKNAFFRRVLVAGV 534 (534) T ss_pred ccccCCccccceeeeeeeeceeecCcccccCCccccccccCCcchhhhcccccceeeeeeecC Confidence 9999999999999999999999999999999988899999975 9999999999999999999 No 11 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=100.00 E-value=7.2e-249 Score=1380.98 Aligned_cols=518 Identities=68% Similarity=1.081 Sum_probs=486.5 Q ss_pred cchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCccccccccc Q lcl|NC_012740. 2 KTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIASG 81 (528) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~ 81 (528) .++|+|+|||+||||||++|+|++.|||+|+++||||||+||.+++.||+++++++|+.||+|++++++|||++++|+++ T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~i~~~~~en~~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~t~i~~~ 80 (519) T protein:vir:10 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) T ss_pred CchhHHHHHhHHhhcccccchhhhhhhHHHHHHHHHHHHHHhhhcccccchHHHHHHhhhcchhccCCccccCccccccc Confidence 67899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCcc Q lcl|NC_012740. 82 QTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDA 161 (528) Q Consensus 82 t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~ 161 (528) ++|++|++|+|+||+|+||++|||||+||||||||||||||||||||||++++++.+++|+|++++|+|+.|||.++... T Consensus 81 ~~t~~v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fSG~~~~~~ 160 (519) T protein:vir:10 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) T ss_pred cccccccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccccccccccCccccccc Confidence 99999999999999999999999999999999999999999999999999999888999999999999999999876543 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchhhh Q lcl|NC_012740. 162 TTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIA 241 (528) Q Consensus 162 ~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~A 241 (528) .. ....+.....|+.+.+.|..++.............+........+.........+..|+++.||+|+.+ T Consensus 161 ~~---------~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~a 231 (519) T protein:vir:10 161 FE---------ALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIA 231 (519) T ss_pred cc---------cccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccccccchh Confidence 32 223344566777777777777666665555444444444444445556667778899999999999999 Q ss_pred hhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhh Q lcl|NC_012740. 242 ELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFT 321 (528) Q Consensus 242 E~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~ 321 (528) |+++.+|+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|++| T Consensus 232 Eal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~s 311 (519) T protein:vir:10 232 ELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYS 311 (519) T ss_pred hccccCCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcccccc Q lcl|NC_012740. 322 AQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGI 401 (528) Q Consensus 322 a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~ 401 (528) ||+|++|+|.+.+..+|||||+++.|+.++||++||||+|++|||+|||+|+|+|+||+|||||||||||++|+++|.++ T Consensus 312 a~~~~~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~ 391 (519) T protein:vir:10 312 AQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSV 391 (519) T ss_pred hhcceeecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecCccccceeee Q lcl|NC_012740. 402 SLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGF 481 (528) Q Consensus 402 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~ 481 (528) +.++++.+.+.+.|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+||| T Consensus 392 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~ 471 (519) T protein:vir:10 392 SYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGF 471 (519) T ss_pred ccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEecCcccccceeeccccccccccccCCccccceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeceeecCcccccCCCccceecccch-HHhhcchhhhhhhhhhccC Q lcl|NC_012740. 482 KTRYGIGINPFADSKSQAPSARITSGML-SKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 482 ~tRY~l~~nP~~~~~~~~~~~~~~~~~~-~~~~a~~~~~~r~~~Vk~~ 528 (528) ||||||++|||++..+|++++||+||+| |.+..++|.|||||+|||| T Consensus 472 ~tRY~l~~NP~~~~~~~~~~~~i~~g~~~~a~~~~~n~y~r~v~v~~~ 519 (519) T protein:vir:10 472 KTRYGIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) T ss_pred eeeeceeecCcccccccCccceeccCchhhhccccCceeeeeeeeecC Confidence 9999999999999999999999999977 7899999999999999999 No 12 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=100.00 E-value=4.8e-239 Score=1327.12 Aligned_cols=508 Identities=58% Similarity=0.916 Sum_probs=448.2 Q ss_pred HHHHHhhhhhhcCCc--cchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCcccccccccc Q lcl|NC_012740. 5 KELMEKWSPLLENEK--LPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIASGQ 82 (528) Q Consensus 5 ~~l~~kw~p~l~~~~--~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~t 82 (528) -+|+|||+||||||| +|||++.|||+|+|+|||||||+++|+++|||++++++|+.+|+||+++|+|||++.+|+||+ T Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia~s~ 80 (514) T protein:vir:56 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) T ss_pred CchhhhhhHHhcccccccccccchhhhhhhhhhhhhHHHHHhcCCcccchhhhhhhhccccccccccccccccccccccc Confidence 689999999999998 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCccc Q lcl|NC_012740. 83 TTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDAT 162 (528) Q Consensus 83 ~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~ 162 (528) +|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++ ++.||||+++|+|+.|||+.++... T Consensus 81 ~t~~v~~~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~--tg~EAf~~~nEadt~fSG~~~~~~~ 158 (514) T protein:vir:56 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPL--TGAEAFHPTRQADASFSGQAAASTI 158 (514) T ss_pred ccccccccchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCc--ccccccccccccCcCcccccccccc Confidence 99999999999999999999999999999999999999999999999998764 4789999999999999998876554 Q ss_pred ccccccccccccc---ccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchh Q lcl|NC_012740. 163 TVSPTGTAFQKLT---LSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATS 239 (528) Q Consensus 163 ~~~~tgt~f~~~t---~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa 239 (528) ...+........+ ...+...|+.+...+...+.. .....+...........+.+..|+++.||+|+ T Consensus 159 ~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~-----------~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta 227 (514) T protein:vir:56 159 ADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAV-----------TLAVAGQMTATEYTDGVAGGLLVEIDAGMATS 227 (514) T ss_pred ccccccccccccccccccccccccccccccccccccc-----------cccccccccccccccccccchhhhhhhhhhhh Confidence 3332221111111 111222233322211111111 11111112222344456677889999999999 Q ss_pred hhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Q lcl|NC_012740. 240 IAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVIN 319 (528) Q Consensus 240 ~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~ 319 (528) .+|+++.||+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|+ T Consensus 228 ~aEal~~lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~ 307 (514) T protein:vir:56 228 QAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVN 307 (514) T ss_pred hhhhcccCCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcccc Q lcl|NC_012740. 320 FTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQ 399 (528) Q Consensus 320 ~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~ 399 (528) +.+++++.+|+.++ .++|+|||++++|++|+||++||||+|+++||||+|+|+|+|+||+||||||||+||++|+|+|| T Consensus 308 ~~atv~~~~~~~~~-~~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~ 386 (514) T protein:vir:56 308 SQAQIGKSGWTQGA-GAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDT 386 (514) T ss_pred hheeehhccccccc-ccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhh Confidence 99999999999888 45899999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccc-cccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecCccccce Q lcl|NC_012740. 400 GISLAMQGAAQ-GLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPV 478 (528) Q Consensus 400 ~~~~~~~~~~~-~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~ 478 (528) |++++++|... +.+.|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||++++++||+||||+ T Consensus 387 l~~~~~~g~~~~~~~~d~~~~~~aG~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~ 466 (514) T protein:vir:56 387 LVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPV 466 (514) T ss_pred hccccccCccccccccccCcceEEEEecCceEEEecCCCCcceEEEEEecCcceecceeeccccccccccccCCccccce Confidence 99999998776 48899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeceeecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 479 LGFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 479 ~~~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) |||||||||++|||++...+. .++.|+++-.+..++|.|||||+|||| T Consensus 467 ~g~~tRY~l~~NPy~~~~~~~--~~~~~~~~~~a~~~~n~y~r~v~v~~l 514 (514) T protein:vir:56 467 IGFKTRYGVQVNPFADPTASA--TKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) T ss_pred eeeeeeeceeeCCCCCccccc--cccCCcchhhhcccccceeeeEEEecC Confidence 999999999999999766543 455666665555579999999999999 No 13 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=100.00 E-value=1e-224 Score=1248.55 Aligned_cols=460 Identities=37% Similarity=0.631 Sum_probs=408.0 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccc-cccccCCccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEA-EVAGDHGYNASNIA 79 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea-~~~~~~g~~~~~~~ 79 (528) |+++|+|+|||+|||||||+|||++.|||+|+|+|||||||+|+|++ .+|.|+ +++++||+++.+|+ T Consensus 3 ~~~~e~l~~kw~p~l~~~~~~~i~~~~~~~v~a~l~enq~~~~~~~~------------~~l~e~~~~~~~~~~~~~~i~ 70 (470) T protein:vir:10 3 MFNSEYLQEKWAPILDYDGLDPIKDSHRRSVTAVLLENQEKELREER------------NFLSEAPNVNTNSGATAGFSA 70 (470) T ss_pred cchhHHHHHhhhhhhcCCccchhcchhhhhhhhhhhhhhHHHHhhcc------------chhhhhhhccccccccccccc Confidence 99999999999999999999999999999999999999999999999 468888 79999999999999 Q ss_pred cccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccC Q lcl|NC_012740. 80 SGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAK 159 (528) Q Consensus 80 e~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a 159 (528) |||+|++|++|||+||+||||++|||||+|||||||||||||||||||+||.++ +|+|+||+ |+++.|||..++ T Consensus 71 ~st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~----sG~Eaffn--EA~T~fSG~~~~ 144 (470) T protein:vir:10 71 DATAAGPVAGFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQ----SGTEALFN--EADTAFSGQPDG 144 (470) T ss_pred cccccccccccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCC----Cccceeee--cCCcccCccccc Confidence 999999999999999999999999999999999999999999999999999874 67899997 899999998766 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchh Q lcl|NC_012740. 160 DATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATS 239 (528) Q Consensus 160 ~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa 239 (528) .........+... ....+ .....+..+.... .......+..|+++.||+|+ T Consensus 145 ~~~~~~~~~~~a~------~~g~~----------------~~~~~gt~~~~~~-------~~~~~a~~~~y~~~~GMsTa 195 (470) T protein:vir:10 145 LDDTSGFTATGAN------NVGLG----------------TTAQQGSNPGLLN-------STAAQTNATDYNVGQGMRTD 195 (470) T ss_pred ccccccccccccc------ccccc----------------ccccccccccccc-------cccccccccccccccccchH Confidence 5432221111000 00000 0000001110000 01112234568899999999 Q ss_pred hhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Q lcl|NC_012740. 240 IAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVIN 319 (528) Q Consensus 240 ~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~ 319 (528) .+|. +|+++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+|+ T Consensus 196 ~aE~---lg~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~ 272 (470) T protein:vir:10 196 SAED---LGDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIY 272 (470) T ss_pred Hhhh---cCCCCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHh Confidence 9994 5678899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcccc Q lcl|NC_012740. 320 FTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQ 399 (528) Q Consensus 320 ~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~ 399 (528) .+|++++.+++ +++|+|||+++.| +||++|+||+|++||++++|+|+++|+||+||||||||+||++|+++|| T Consensus 273 ~~a~~~k~~~~----~~~Gv~Dl~~~~~---gr~~~e~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~ 345 (470) T protein:vir:10 273 NVAEPGAQANV----AAAGTFDLDTDSN---GRWSVEKFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGV 345 (470) T ss_pred hhhhhceeccc----cccceEEeecccc---hhHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhccc Confidence 99999999887 7899999997776 6999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccCceEEEEecCceEEEeeCC------CCcceEEEEEecCCCccceeEeccccccceeEEecCc Q lcl|NC_012740. 400 GISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQY------ARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQ 473 (528) Q Consensus 400 ~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y------~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~ 473 (528) |++.|.. ..+++.|+++++|+|+|+|||+|||||| +++|||+|||||++++|+||||||||||.+++.+||+ T Consensus 346 l~~~~~~--~~~~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~ 423 (470) T protein:vir:10 346 LDYTPAL--NANLNVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVGYKGSSPYDAGLFYCPYVPLQMVRAVGQD 423 (470) T ss_pred ccccccc--ccccccCCCCceEEEEecCceEEEeeccccccCcccccEEEEEEecCcceecceeeccccccccCCCCCCc Confidence 9998765 4468899999999999999999999997 8899999999999999999999999999999999999 Q ss_pred cccceeeeeeeeceeecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 474 SFHPVLGFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 474 s~qP~~~~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) ||||+|||||||||++|||+.+.+| +++||++ |+|.|||||+|||| T Consensus 424 sfqP~~g~~tRY~l~~NP~~~~~~~-~~~~i~~--------~~n~y~r~~~v~~l 469 (470) T protein:vir:10 424 TFQPKIGFKTRYGLVENPFSQGTTQ-GLGTLTR--------NSNRYYRRVKVANL 469 (470) T ss_pred cccceeeeeeeeceeecCcccCCCc-ccccccC--------CCCceeeEEEeecc Confidence 9999999999999999999999998 5788875 88999999999999 No 14 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=100.00 E-value=1.6e-222 Score=1236.61 Aligned_cols=460 Identities=37% Similarity=0.599 Sum_probs=396.4 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCcccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIAS 80 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e 80 (528) ||++|+|+|||+|||||||+|||++.|||+|+|+|||||||||+|+|.|+++.++++|+. .+......+++ T Consensus 1 ~~~~e~l~~kW~plLe~~~~~~i~~~~k~~i~a~llENQe~~~~~~~~~~~~~~~~~~~~---------~~~~~~n~~~~ 71 (468) T protein:vir:10 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGA---------GTIAPAGSALG 71 (468) T ss_pred CcchHHHHHhhhHhhcCCccchhccchhhhhhhhhhhhHHHHHhccccccchhhHhhcCC---------cccchhhhhhh Confidence 999999999999999999999999999999999999999999999999999999999962 22444556788 Q ss_pred ccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCc Q lcl|NC_012740. 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKD 160 (528) Q Consensus 81 ~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~ 160 (528) +++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||.++ .|+||||+ |+++.|||..+.. T Consensus 72 ~~~t~~v~~~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~----~g~EAf~n--Eadt~fSg~~~~~ 145 (468) T protein:vir:10 72 SANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQ----AGEEALFN--EPDTGFTGGYDAS 145 (468) T ss_pred hcccccccccCchhhhhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCC----CCccceec--ccccccccccccc Confidence 89999999999999999999999999999999999999999999999999874 57899997 9999999875432 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchhh Q lcl|NC_012740. 161 ATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSI 240 (528) Q Consensus 161 ~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~ 240 (528) ..... ...+.. ..++....++.. .....+..++++.||+|+. T Consensus 146 ~~~~~--------------~~~~~~-----------------------~~~~~~g~~~~~-~~~a~~~~~~~g~gMsTa~ 187 (468) T protein:vir:10 146 QGDYA--------------VRTGAG-----------------------VGGDSEGNNPAL-LNDAAPGTYEVGSKMPRED 187 (468) T ss_pred ccccc--------------cccccc-----------------------cccCCCCCcccc-cccccccccccccccchHH Confidence 11100 000000 000000011111 1122345688999999999 Q ss_pred hhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Q lcl|NC_012740. 241 AELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) Q Consensus 241 AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~ 320 (528) +|.++ ++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+|+. T Consensus 188 aE~lG----~~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~ 263 (468) T protein:vir:10 188 LERMG----EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYT 263 (468) T ss_pred HhhcC----CCCcccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhh Confidence 99773 44678999999999999999999999999999999999999999999999999999999999999998777 Q ss_pred heeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccc Q lcl|NC_012740. 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) Q Consensus 321 ~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~ 400 (528) +|+++++... +++|+|||+++.| +||++|+||+|++|||+++|+|+++|+||+||||||||+||++|+++||+ T Consensus 264 va~~~k~~g~----~~~Gv~d~~~~~~---~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l 336 (468) T protein:vir:10 264 VAKKGAQNNV----ANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVL 336 (468) T ss_pred hhhheecccc----ccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcc Confidence 7777665322 7899999997766 69999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccc---ccccccCceEEEEecCceEEEeeCCC----CcceEEEEEecCCCccceeEeccccccceeEEecCc Q lcl|NC_012740. 401 ISLAMQGAAQG---LNTDTTKAVFAGVLAGKYKVFIDQYA----RQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQ 473 (528) Q Consensus 401 ~~~~~~~~~~~---~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~ 473 (528) ++.|......+ .+.|+++++|+|+|+|||+||||||+ ++|||+|||||++++|+|||||||||+.|++++||+ T Consensus 337 ~~~~~~~~~~~~~~~~~D~tg~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~ 416 (468) T protein:vir:10 337 DYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPN 416 (468) T ss_pred eecccccccccccccccccCcceEEEEecCceEEEEccccccCCccceEEEEEecCcceeceeeeccccccccccccCCC Confidence 99988776654 47999999999999999999999996 589999999999999999999999999999999999 Q ss_pred cccceeeeeeeeceeecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 474 SFHPVLGFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 474 s~qP~~~~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) ||||+|||||||||++|||+...+..+ ...+|.+| .+|+|.|||||+|||| T Consensus 417 sfqP~~g~~tRY~l~~NP~~~~~~~~~--g~~~~~~~--~~~~N~y~r~~~v~~l 467 (468) T protein:vir:10 417 TFQPKIGFKTRYGMVSNPFVTTNGLYN--GTPDGEAL--TPNANMYYRRVQVTNL 467 (468) T ss_pred cccceeeeeeeeceeecccceeccccC--CCcccccc--cccccceeeeEEEecc Confidence 999999999999999999997543221 12444444 3689999999999999 No 15 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=100.00 E-value=9.1e-220 Score=1221.44 Aligned_cols=454 Identities=38% Similarity=0.641 Sum_probs=396.3 Q ss_pred cchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCccccccccc Q lcl|NC_012740. 2 KTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIASG 81 (528) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~ 81 (528) .++|+|+|||+||||||++|+|++.+||+|+++|||||||+|+|++ .+|+|+. ++||+++. + T Consensus 1 ms~~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~------------~~l~ea~--~~~g~~~~----~ 62 (462) T protein:vir:10 1 MSIQQLQEKWAPVLNHESVPEIKDSYKKGVVAQLLENQENAIREEG------------QVLNETL--QTTGYTTG----D 62 (462) T ss_pred CchHHHHHHhhhhhcccccchhhhhhHHHHHHHHhhhHHHHHHhcc------------cchhccc--cccCCCcC----c Confidence 6789999999999999999999999999999999999999999977 6899994 89998865 5 Q ss_pred cccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCC--CCcccccccccccccccccccccC Q lcl|NC_012740. 82 QTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPL--AEHAKEAFHPMYSPNAFHSSLAAK 159 (528) Q Consensus 82 t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~--s~~G~EA~~n~~Eadt~fSG~~~a 159 (528) ++|++|++|||+||+|||||+|||||+|||||||||||||||||||+||++++. .++++||||+ |+|+.|||..+. T Consensus 63 ~~t~~~~~~~P~Li~l~Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~gtEAlfn--Eadt~fSg~~~~ 140 (462) T protein:vir:10 63 TATGPVAGFDPVLISLIRRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSDFREALFN--EPNAGFSGGAGT 140 (462) T ss_pred ccccccccccchhhhHHHHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccccchhhhc--cCCcCccccccc Confidence 669999999999999999999999999999999999999999999999998754 4688999997 999999987654 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchh Q lcl|NC_012740. 160 DATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATS 239 (528) Q Consensus 160 ~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa 239 (528) ........... +... ...+....... .........+..+.||+|+ T Consensus 141 ~~~~~~~~~~~------------~~~~-----------~~~g~~~~~~~------------~~~~g~~~~~~~~~GM~Ta 185 (462) T protein:vir:10 141 GLSNYDPTASS------------SAVN-----------DAEGANPGLLN------------DSPAGTYEVTGDATGMATA 185 (462) T ss_pred ccccccccccc------------cccc-----------ccccccceeec------------CCCccceecccccccccch Confidence 32211110000 0000 00000000000 0001112335567899999 Q ss_pred hhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Q lcl|NC_012740. 240 IAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVIN 319 (528) Q Consensus 240 ~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~ 319 (528) .+|+++. +++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+|+ T Consensus 186 ~aE~lg~--~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~ 263 (462) T protein:vir:10 186 TAEALDD--SSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIY 263 (462) T ss_pred hccccCC--ccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhh Confidence 9998863 45678999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcccc Q lcl|NC_012740. 320 FTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQ 399 (528) Q Consensus 320 ~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~ 399 (528) ++|++++.+|+ +++|||||+++.+ +||++|+||+|++||++++|+|+|+|+||+|||||||||||++|+|+|| T Consensus 264 ~~a~~~k~~~~----~~~Gv~dl~~~~~---gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~ 336 (462) T protein:vir:10 264 VNAVKGAIANT----ATDGIFDLDVDSN---GRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMAGV 336 (462) T ss_pred hhheeeecccc----cccceeeeccccc---hHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhccc Confidence 99999998887 7899999987765 6999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccc-cccccCceEEEEecCceEEEeeCC----CCcceEEEEEecCCCccceeEeccccccceeEEecCcc Q lcl|NC_012740. 400 GISLAMQGAAQGL-NTDTTKAVFAGVLAGKYKVFIDQY----ARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQS 474 (528) Q Consensus 400 ~~~~~~~~~~~~~-~~d~~~~~~~G~l~~~~~vy~D~y----~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s 474 (528) |+++|+......+ +.|+++.+|+|+|+|||+|||||| +++|||+|||||++++|+||||||||||++++++||+| T Consensus 337 l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~s 416 (462) T protein:vir:10 337 LDYAPGLQGNSALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPNT 416 (462) T ss_pred hhccccccccccccccccccceeEEEecCceEEEEecccCCCcccceEEEEEeCCcccccceeeccccccccccccCCcc Confidence 9999986666665 689999999999999999999998 68999999999999999999999999999999999999 Q ss_pred ccceeeeeeeeceeecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 475 FHPVLGFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 475 ~qP~~~~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) |||+|||||||||++|||+++.+++ ++|++ +|+|.|||||+|||| T Consensus 417 fqP~~g~~tRY~l~~NP~t~~~~~~-~~~~~--------~~~n~y~r~~~v~~l 461 (462) T protein:vir:10 417 FQPKIGFKTRYGMVSNPFSGGLTQG-SGALT--------ANANKYYRRVQVANL 461 (462) T ss_pred ccceeeeeeeeeeeecCCCCCcCCc-ccccc--------ccCcceeeeEEeecc Confidence 9999999999999999999999985 56665 588999999999999 No 16 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=100.00 E-value=5.8e-215 Score=1195.12 Aligned_cols=449 Identities=40% Similarity=0.665 Sum_probs=393.2 Q ss_pred cchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCccccccccc Q lcl|NC_012740. 2 KTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIASG 81 (528) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~ 81 (528) .++|+|+|||+||||||++|||++.|||+|+++|||||||+|+|++ .+|+||. ++||+++.. T Consensus 1 m~~~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~lenq~~~~~~~~------------~~l~ea~--~~~g~~~~s---- 62 (457) T protein:vir:10 1 MSFQNLQEKWAPVLEHDSLPEIGDSYKKGVVAQLLENQEKAIAEEG------------KILTETL--QTTGYTGGD---- 62 (457) T ss_pred CchHHHHHHhhHhhccCccchhhhhHHHHHHHHHhhhHHHHHHhcc------------ccccccc--cccCCCccc---- Confidence 6789999999999999999999999999999999999999999977 6899995 999998765 Q ss_pred cccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCC--CcccccccccccccccccccccC Q lcl|NC_012740. 82 QTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLA--EHAKEAFHPMYSPNAFHSSLAAK 159 (528) Q Consensus 82 t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s--~~G~EA~~n~~Eadt~fSG~~~a 159 (528) ++|++|++|||+||+||||++|||||+|||||||||||||||||||+||.++.+. .+.+||||+ |+++.|||..++ T Consensus 63 ~~t~~v~~~~P~Li~l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a~~~EAl~n--Eadt~fSg~~~~ 140 (457) T protein:vir:10 63 TVTGPVAGFDPVLISLIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAAGYDEAFFN--EPNAGFSGGPGA 140 (457) T ss_pred ccccccccccchhhhhhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccccccccceeee--ccCcccCccccc Confidence 5689999999999999999999999999999999999999999999999886543 456899987 899999987654 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchh Q lcl|NC_012740. 160 DATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATS 239 (528) Q Consensus 160 ~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa 239 (528) ....... ..++. . +..+... + .........++++.||+|+ T Consensus 141 ~~~~~~~--------------~~~~~--------------~----gt~~~~~-----~---~~~~~~~~~~~~~~gmsTA 180 (457) T protein:vir:10 141 YDPGATG--------------VTNDA--------------E----GTNPALL-----N---DSPAGTYEQADDATGMSTA 180 (457) T ss_pred ccccccc--------------ccccc--------------c----ccccccc-----C---ccccccccccccccchhhh Confidence 3221100 00000 0 0000000 0 0011122357789999999 Q ss_pred hhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Q lcl|NC_012740. 240 IAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVIN 319 (528) Q Consensus 240 ~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~ 319 (528) .+|.++. +++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+|+ T Consensus 181 ~aE~lgd--~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~ 258 (457) T protein:vir:10 181 TVEALDD--STANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIY 258 (457) T ss_pred hhhccCC--CCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHh Confidence 9998752 55677899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcccc Q lcl|NC_012740. 320 FTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQ 399 (528) Q Consensus 320 ~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~ 399 (528) .+|++++.+|+ +++|+|||+++.| +||++|+||+|++||++++|+|+++|+||+||||||||+||++|+++|| T Consensus 259 ~~a~~~~~~~~----~~~gv~dl~~~~~---g~~~~e~~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~sg~ 331 (457) T protein:vir:10 259 TNAVAGAQNNT----ATAGVFDLDVDSN---GRWSVEKFKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGMAGV 331 (457) T ss_pred hhheeeecccc----ccceeeeeecccc---chhhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhhccc Confidence 99999999887 7899999986666 6999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccc-cccccCceEEEEecCceEEEeeCCC----CcceEEEEEecCCCccceeEeccccccceeEEecCcc Q lcl|NC_012740. 400 GISLAMQGAAQGL-NTDTTKAVFAGVLAGKYKVFIDQYA----RQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQS 474 (528) Q Consensus 400 ~~~~~~~~~~~~~-~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s 474 (528) |+++|+.....++ +.|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||.+++++||+| T Consensus 332 l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~s 411 (457) T protein:vir:10 332 LDYTPALNGNNGLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPDT 411 (457) T ss_pred ccccchhhccccccccccccceeEEEecCCeEEEEecccccCCccceEEEEEeCCcceecceeecccccccccCccCCcc Confidence 9999987766664 6899999999999999999999886 5899999999999999999999999999999999999 Q ss_pred ccceeeeeeeeceeecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 475 FHPVLGFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 475 ~qP~~~~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) |||+|||||||||++|||+.+.+|+ +++++. |.|.||||+.|+|| T Consensus 412 fqP~~g~~tRY~l~~NP~~~~~~~~-~~~~~~--------~~n~~~~rs~vs~l 456 (457) T protein:vir:10 412 FQPKIGFKTRYGMVSNPFAGGLTQG-SGALTV--------NANKYYRRVQVANL 456 (457) T ss_pred ccceeeeeeeeeeeecccccccccc-cccccc--------cchhhcceeeeeec Confidence 9999999999999999999999985 566664 56789999999999 No 17 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=100.00 E-value=5.8e-196 Score=1090.93 Aligned_cols=451 Identities=22% Similarity=0.265 Sum_probs=342.2 Q ss_pred Cc---chHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCccccc Q lcl|NC_012740. 1 MK---TTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASN 77 (528) Q Consensus 1 ~~---~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~ 77 (528) |. .+|+|+|||+||||+ |++.|||+|+|+|||||||+ ++ ++ T Consensus 1 ~~~~~~~e~l~~kw~p~l~~-----~~~~~~~~~~a~llenq~~~---~~----------------------------~~ 44 (523) T protein:vir:59 1 MSQPKINEQLIEKWQPLLEG-----CRNDWERHTLATLLENQYRE---AK----------------------------KH 44 (523) T ss_pred CCcchhhHHHHHhhhhhhcc-----cCChhHHHHHHHHhhhhhHH---HH----------------------------Hh Confidence 54 468999999999997 66779999999999999973 22 45 Q ss_pred cccccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccc---------- Q lcl|NC_012740. 78 IASGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMY---------- 147 (528) Q Consensus 78 ~~e~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~---------- 147 (528) |+|++.|++|++|+| ||+||||++|||||+||||||||||||||||||||||.++ .|+|+||++. T Consensus 45 l~e~~~~~~~~~~~~-~~~~v~r~~p~l~a~DIWGVQPMTGPTGLIFAMRSRY~~q----~gteA~yg~~~~~~~~a~~~ 119 (523) T protein:vir:59 45 LMETTQTTEVDGWNL-ALPIVRRVFANLRATDLVSVQPLSLPTGLVFYLDFKSPEL----PGNGSVYGGTGLTTDTATGG 119 (523) T ss_pred hhhhhhccccccccc-hhhhhhhHhhhhhhhhccccccCCCCcceeEEEEeeccCC----CCcccccCccccCccccccc Confidence 666677999999997 9999999999999999999999999999999999999874 5678887643 Q ss_pred --cccccccccccCcccccccccccccccc-----ccccccccccc--ccc--------ccc---------------cc- Q lcl|NC_012740. 148 --SPNAFHSSLAAKDATTVSPTGTAFQKLT-----LSTPIAAGDIV--HHT--------FAE---------------TG- 194 (528) Q Consensus 148 --Eadt~fSG~~~a~~~~~~~tgt~f~~~t-----~~t~~a~Gdi~--~~~--------f~~---------------tg- 194 (528) ++++.|++..+..........+...... .+.....++.. +.. .++ ++ T Consensus 120 ~~ean~~~s~~~~~~~~~~d~~~sg~~~~~~~a~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga 199 (523) T protein:vir:59 120 LYDENARLSRREYETTITVDLATAQQATMRDVGFDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPEN 199 (523) T ss_pred ccccccccccccccCccCCCcccccccccccccccccchhhccccceeeeeccccccccccccccccccccccccccccc Confidence 3444555443322221111100000000 00000000000 000 000 00 Q ss_pred ------cccccccc--ccc-----ccccCcccccCccccccccccccccccccccchhhhhhhcccC--CCCCcccccce Q lcl|NC_012740. 195 ------IAYLQNVT--AEQ-----VTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFN--GSQNNPWNEMS 259 (528) Q Consensus 195 ------~~~~~~~~--~~~-----~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~g--gs~~~~f~EMa 259 (528) ..+...+. ... ....+.+.................++.+.||+|+.+|.++..+ ++.++.|+||+ T Consensus 200 ~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~ 279 (523) T protein:vir:59 200 TVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEIN 279 (523) T ss_pred cccchhhccccccccccccccccccccccccccccCCCcccccccccccccccccchhhcccccccccccccccccccee Confidence 00000000 000 0000000000001111122234568889999999999887654 46788999999 Q ss_pred eEEeeEEEEEecccccccchHHHHHHHHhhc-CCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccc Q lcl|NC_012740. 260 MRIDKQVVEAKSRQLKARYSIEVAQDLRAVH-GMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAG 338 (528) Q Consensus 260 FSIEK~TVTAKSRALKAEYT~ELAQDLkAiH-GLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g 338 (528) |+||||+|||||||||||||||||||||||| |||||+||+||||+||||||||||||+|+.+|++++++++ .++| T Consensus 280 FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~----~~~g 355 (523) T protein:vir:59 280 LELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGF----WSEV 355 (523) T ss_pred eEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccc----cccc Confidence 9999999999999999999999999999999 9999999999999999999999999999999999988876 7899 Q ss_pred eeccccccc---cccchhH--HHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcccccccccccccccccc Q lcl|NC_012740. 339 VFDLQDPID---TRGARWA--GESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLN 413 (528) Q Consensus 339 ~~dl~~~~d---~~~~r~a--~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~ 413 (528) ||||.++.| ..|.+|+ +||+|.||++||||+|+|+|+|+||+|||||||||||++|+++|||+..+ ... T Consensus 356 ~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~------~~~ 429 (523) T protein:vir:59 356 VGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGN------DNR 429 (523) T ss_pred eeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCC------ccc Confidence 999998766 4444553 99999999999999999999999999999999999999999999985442 246 Q ss_pred ccccCceEEEEecCceEEEeeCCCCcceEEEEEecC-CCccceeEeccccccceeEE-ecCccccceeeeeeeeceee-c Q lcl|NC_012740. 414 TDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGD-NEMDAGIYYAPYVALTPLRA-TDPQSFHPVLGFKTRYGIGI-N 490 (528) Q Consensus 414 ~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~-~~~d~g~fyaPYv~~~~~~~-~Dp~s~qP~~~~~tRY~l~~-n 490 (528) .|+++.+|+|+|+|||+||||+|+++|||+|||||. .++|+|||||||||+.++++ .||+||||+|||||||||++ | T Consensus 430 ~~~~~~~~~g~l~~~~~vy~d~~~~~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~n 509 (523) T protein:vir:59 430 DGGTGIFYVGMVQGRYRLYKNIYQNQPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVR 509 (523) T ss_pred cccccceeEEEecCceEEEecCCCCcceEEEEecccCCcccccceecccchhhcccccccCCcccceeeeeeehhheecc Confidence 677889999999999999999999999999999995 59999999999999998865 59999999999999999986 9 Q ss_pred CcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 491 PFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 491 P~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) ||+.+.-- ||=| T Consensus 510 P~~~~~~~--------------------------~~~~ 521 (523) T protein:vir:59 510 PEFYGLLY--------------------------VKLL 521 (523) T ss_pred hhHhhhhh--------------------------hhhc Confidence 99976532 1111 No 18 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=95.55 E-value=0.0019 Score=35.56 Aligned_cols=348 Identities=15% Similarity=0.104 Sum_probs=131.0 Q ss_pred Ccch-HHHHHhhhhh---hcC--CccchhccchhhhhhhhhhhhHHHHhhhccccch------hhhhhhhcccccccccc Q lcl|NC_012740. 1 MKTT-KELMEKWSPL---LEN--EKLPEIATASKQKLVAKILESQEADFAVDPIYKD------EKVVEAFGGFIAEAEVA 68 (528) Q Consensus 1 ~~~~-~~l~~kw~p~---l~~--~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~------~~~~~~~~~~l~ea~~~ 68 (528) |... ++..++..-+ .+. +...++...-+ +. ...++..++.....+.... ....+.+-..+...... T Consensus 18 ~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (385) T protein:vir:18 18 MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELT-KS-GTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGT 95 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH-HHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhcc Confidence 1000 0000110000 000 00001100000 00 1112222222221111100 00111111111100000 Q ss_pred ccCCccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccc Q lcl|NC_012740. 69 GDHGYNASNIASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMY 147 (528) Q Consensus 69 ~~~g~~~~~~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~ 147 (528) ....-....+..+++++.. -..|.++ .+++++..+..-.++|-++||++++.-+. + +.... T Consensus 96 ~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~--~--~~~~~------------- 157 (385) T protein:vir:18 96 FGAKTFNKSLGSDADSAGS-LIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYV--R--EEVFT------------- 157 (385) T ss_pred chhhHHHhhhccccccCCc-eecchhhhHHHHHhhhccchhhhcceecccCcceEEE--E--EecCC------------- Confidence 0000000111111111111 1123333 55555666788889999999988752111 0 00000 Q ss_pred cccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccc Q lcl|NC_012740. 148 SPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEG 227 (528) Q Consensus 148 Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 227 (528) +.+.+ T Consensus 158 -~~a~~-------------------------------------------------------------------------- 162 (385) T protein:vir:18 158 -NNADV-------------------------------------------------------------------------- 162 (385) T ss_pred -cceee-------------------------------------------------------------------------- Confidence 00000 Q ss_pred cccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHH Q lcl|NC_012740. 228 KLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVL 307 (528) Q Consensus 228 ~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEIm 307 (528) + +| +...++-..++++++.+.|.-+-...+|.||.||-- +.++.|.+-|+..|. T Consensus 163 ----v--------~E---------~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~ 216 (385) T protein:vir:18 163 ----V--------AE---------KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYINNRLMYGLA 216 (385) T ss_pred ----e--------cc---------CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHH Confidence 0 01 011233344455566666666666789999999852 347788888888888 Q ss_pred HHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEc Q lcl|NC_012740. 308 LEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIAS 387 (528) Q Consensus 308 lEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S 387 (528) .-+|+.||.- . |.. ....|++.......... -... -..+..|..+...|. ..+...+.+||| T Consensus 217 ~~~d~~~l~G---~---g~~------~~~~Gi~~~~~~~~~~~-~~~~---~~~~d~i~~~~~~l~--~~~~~~~~~~~~ 278 (385) T protein:vir:18 217 LKEEGQLLNG---D---GTG------DNLEGLNKVATAYDTSL-NATG---DTRADIIAHAIYQVT--ESEFSASGIVLN 278 (385) T ss_pred HHHHHHHHhc---c---CCC------Ccccccccccccccccc-cccc---cchHHHHHHHHHhhc--cccCCCCEEEEc Confidence 8888887731 0 000 01233332221110000 0000 112233344434442 234467889999 Q ss_pred hhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEecccccccee Q lcl|NC_012740. 388 RNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPL 467 (528) Q Consensus 388 ~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~ 467 (528) |+....|.... ...|.. +..+.+.. .-++|.| ++|+++++.|..-+++|-- +-+|--+.-..+. T Consensus 279 ~~~~~~l~~lk-----d~~G~~--l~~~~~~~-~~~~l~G-~pV~~~~~~p~~~~~~gd~-------~~~~~~~~~~~~~ 342 (385) T protein:vir:18 279 PRDWHNIALLK-----DNEGRY--IFGGPQAF-TSNIMWG-LPVVPTKAQAAGTFTVGGF-------DMASQVWDRMDAT 342 (385) T ss_pred HHHHHHHHHhh-----cCCCce--eccCcccC-CCceecc-eeeEEcCcCCCCcEEEeec-------ccEEEEEEecceE Confidence 99999887532 111100 11111111 1356766 9999999998776666521 0011111111111 Q ss_pred EEecCc---cc-cceee--eeeeece-eecCcccccCCCccceecccc Q lcl|NC_012740. 468 RATDPQ---SF-HPVLG--FKTRYGI-GINPFADSKSQAPSARITSGM 508 (528) Q Consensus 468 ~~~Dp~---s~-qP~~~--~~tRY~l-~~nP~~~~~~~~~~~~~~~~~ 508 (528) .-++.. -| +..++ ...||+. +.+|=+..+ .++.-+. T Consensus 343 v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~-----~~~~aa~ 385 (385) T protein:vir:18 343 VEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIK-----GTFSSGS 385 (385) T ss_pred EEEeccccchhhcCcEEEEEEEeeccEEecccceEE-----EEeccCC Confidence 111111 12 23344 3457776 344411100 0111111 No 19 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=95.55 E-value=0.0019 Score=35.56 Aligned_cols=348 Identities=15% Similarity=0.104 Sum_probs=131.0 Q ss_pred Ccch-HHHHHhhhhh---hcC--CccchhccchhhhhhhhhhhhHHHHhhhccccch------hhhhhhhcccccccccc Q lcl|NC_012740. 1 MKTT-KELMEKWSPL---LEN--EKLPEIATASKQKLVAKILESQEADFAVDPIYKD------EKVVEAFGGFIAEAEVA 68 (528) Q Consensus 1 ~~~~-~~l~~kw~p~---l~~--~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~------~~~~~~~~~~l~ea~~~ 68 (528) |... ++..++..-+ .+. +...++...-+ +. ...++..++.....+.... ....+.+-..+...... T Consensus 18 ~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (385) T protein:vir:19 18 MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELT-KS-GTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGT 95 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH-HHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhcc Confidence 1000 0000110000 000 00001100000 00 1112222222221111100 00111111111100000 Q ss_pred ccCCccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccc Q lcl|NC_012740. 69 GDHGYNASNIASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMY 147 (528) Q Consensus 69 ~~~g~~~~~~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~ 147 (528) ....-....+..+++++.. -..|.++ .+++++..+..-.++|-++||++++.-+. + +.... T Consensus 96 ~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~--~--~~~~~------------- 157 (385) T protein:vir:19 96 FGAKTFNKSLGSDADSAGS-LIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYV--R--EEVFT------------- 157 (385) T ss_pred chhhHHHhhhccccccCCc-eecchhhhHHHHHhhhccchhhhcceecccCcceEEE--E--EecCC------------- Confidence 0000000111111111111 1123333 55555666788889999999988752111 0 00000 Q ss_pred cccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccc Q lcl|NC_012740. 148 SPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEG 227 (528) Q Consensus 148 Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 227 (528) +.+.+ T Consensus 158 -~~a~~-------------------------------------------------------------------------- 162 (385) T protein:vir:19 158 -NNADV-------------------------------------------------------------------------- 162 (385) T ss_pred -cceee-------------------------------------------------------------------------- Confidence 00000 Q ss_pred cccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHH Q lcl|NC_012740. 228 KLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVL 307 (528) Q Consensus 228 ~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEIm 307 (528) + +| +...++-..++++++.+.|.-+-...+|.||.||-- +.++.|.+-|+..|. T Consensus 163 ----v--------~E---------~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~ 216 (385) T protein:vir:19 163 ----V--------AE---------KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYINNRLMYGLA 216 (385) T ss_pred ----e--------cc---------CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHH Confidence 0 01 011233344455566666666666789999999852 347788888888888 Q ss_pred HHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEc Q lcl|NC_012740. 308 LEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIAS 387 (528) Q Consensus 308 lEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S 387 (528) .-+|+.||.- . |.. ....|++.......... -... -..+..|..+...|. ..+...+.+||| T Consensus 217 ~~~d~~~l~G---~---g~~------~~~~Gi~~~~~~~~~~~-~~~~---~~~~d~i~~~~~~l~--~~~~~~~~~~~~ 278 (385) T protein:vir:19 217 LKEEGQLLNG---D---GTG------DNLEGLNKVATAYDTSL-NATG---DTRADIIAHAIYQVT--ESEFSASGIVLN 278 (385) T ss_pred HHHHHHHHhc---c---CCC------Ccccccccccccccccc-cccc---cchHHHHHHHHHhhc--cccCCCCEEEEc Confidence 8888887731 0 000 01233332221110000 0000 112233344434442 234467889999 Q ss_pred hhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEecccccccee Q lcl|NC_012740. 388 RNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPL 467 (528) Q Consensus 388 ~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~ 467 (528) |+....|.... ...|.. +..+.+.. .-++|.| ++|+++++.|..-+++|-- +-+|--+.-..+. T Consensus 279 ~~~~~~l~~lk-----d~~G~~--l~~~~~~~-~~~~l~G-~pV~~~~~~p~~~~~~gd~-------~~~~~~~~~~~~~ 342 (385) T protein:vir:19 279 PRDWHNIALLK-----DNEGRY--IFGGPQAF-TSNIMWG-LPVVPTKAQAAGTFTVGGF-------DMASQVWDRMDAT 342 (385) T ss_pred HHHHHHHHHhh-----cCCCce--eccCcccC-CCceecc-eeeEEcCcCCCCcEEEeec-------ccEEEEEEecceE Confidence 99999887532 111100 11111111 1356766 9999999998776666521 0011111111111 Q ss_pred EEecCc---cc-cceee--eeeeece-eecCcccccCCCccceecccc Q lcl|NC_012740. 468 RATDPQ---SF-HPVLG--FKTRYGI-GINPFADSKSQAPSARITSGM 508 (528) Q Consensus 468 ~~~Dp~---s~-qP~~~--~~tRY~l-~~nP~~~~~~~~~~~~~~~~~ 508 (528) .-++.. -| +..++ ...||+. +.+|=+..+ .++.-+. T Consensus 343 v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~-----~~~~aa~ 385 (385) T protein:vir:19 343 VEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIK-----GTFSSGS 385 (385) T ss_pred EEEeccccchhhcCcEEEEEEEeeccEEecccceEE-----EEeccCC Confidence 111111 12 23344 3457776 344411100 0111111 No 20 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=95.43 E-value=0.0021 Score=35.29 Aligned_cols=353 Identities=13% Similarity=0.055 Sum_probs=125.6 Q ss_pred Cc---------------chHHHHHhhhhhhcCCc--cchhccchhh-----hhhhhhhhhHHHHhhhccccchhhhhhhh Q lcl|NC_012740. 1 MK---------------TTKELMEKWSPLLENEK--LPEIATASKQ-----KLVAKILESQEADFAVDPIYKDEKVVEAF 58 (528) Q Consensus 1 ~~---------------~~~~l~~kw~p~l~~~~--~~~~~~~~~~-----~~~~~~~enq~~~~~~~~~~~~~~~~~~~ 58 (528) |. ..+++.++=.-.++... .-+++..-++ ...+...+-+++...+.+ .....++ T Consensus 2 ~ke~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~ 77 (413) T protein:vir:81 2 VKEAGDAPTNAQVAEIAEVKSMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRK----GEGYKSI 77 (413) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhh----hhhhhhh Confidence 11 11111111111000000 0000000000 000000000000000000 0000011 Q ss_pred ccccccc---------------------cccccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeec Q lcl|NC_012740. 59 GGFIAEA---------------------EVAGDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQP 115 (528) Q Consensus 59 ~~~l~ea---------------------~~~~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQP 115 (528) +..+.+. ...............++ +.+....=|..+ .+++.+-+..+..+++.|+| T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~ 156 (413) T protein:vir:81 78 GEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATL-TDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLT 156 (413) T ss_pred hhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhccc-ccccccccchhhHHHHHHHHhhhhhHHhhcceee Confidence 1000000 00000000000001111 111111113222 34555556778889999999 Q ss_pred CCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 116 MSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGI 195 (528) Q Consensus 116 mTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~ 195 (528) |++++.-+.-.+. ... ....+ T Consensus 157 ~~~~~~~~~~~~~-~~~--------------~~~~a-------------------------------------------- 177 (413) T protein:vir:81 157 MTNTTIKYLMEKA-NRV--------------VEGGF-------------------------------------------- 177 (413) T ss_pred ccCCceeEEEecc-ccc--------------ccccc-------------------------------------------- Confidence 9998532211110 000 00000 Q ss_pred ccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEeccccc Q lcl|NC_012740. 196 AYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLK 275 (528) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALK 275 (528) ..++.|-.. .| +....|.+..|.+.|..+ . T Consensus 178 ----------------------------------~~v~Eg~~~--~~-------~~~~~f~~i~~~~~k~~~-------~ 207 (413) T protein:vir:81 178 ----------------------------------KTVAEGGKK--PY-------MRFADFDIVTESLSKIAG-------L 207 (413) T ss_pred ----------------------------------ceecCcccc--cc-------cCcccceeeEeeeeeEEE-------e Confidence 000000000 00 111246666666666554 3 Q ss_pred ccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHH Q lcl|NC_012740. 276 ARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAG 355 (528) Q Consensus 276 AEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~ 355 (528) ...|-||.+|--+ .++.|.+-|+..|..-+|+.||. -+ |. .....|+++......... T Consensus 208 ~~iS~ell~ds~~-----l~~~i~~~la~~~~~~~d~~~l~---G~---G~------~~~~~Gi~~~~~~~~~~~----- 265 (413) T protein:vir:81 208 TKITDEMIEDYDF-----LVSYINARLLEELAIEEERQLLL---GD---GT------GNNLTGLLKRDGIQTLAV----- 265 (413) T ss_pred ehhhHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHhc---cC---CC------CCcccccccccccccccc----- Confidence 5589999998632 47888888888888888888772 11 10 012345544432211111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccc----cccccccccccccccccccCceEEEEecCceEE Q lcl|NC_012740. 356 ESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASAD----QGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKV 431 (528) Q Consensus 356 e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g----~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~v 431 (528) .....++.-|...-..+.....+ ..+.+|++|.....|..-- -....+. ..+...+. .....++|.| ++| T Consensus 266 ~~~~~~~~~i~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~---~~~~~~~~-~~~~~~~l~G-~pv 339 (413) T protein:vir:81 266 SNKDELADSIYKAMTNISLATPF-QADALVINPLDYQELRLAKDANGQYYGGGV---FQGQYGSG-GIMLDPAPWG-LRT 339 (413) T ss_pred cccchhHHHHHHHHHHhhhhccC-CCcEEEEcHHHHHHHHHhhccCCceecccc---cccccccc-ccccCceecc-eee Confidence 01112233333333444444444 4566889999888776421 1000000 00000000 0011245654 899 Q ss_pred EeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecC------ccccceeeeeeeeceee-cCcccccCCCcccee Q lcl|NC_012740. 432 FIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDP------QSFHPVLGFKTRYGIGI-NPFADSKSQAPSARI 504 (528) Q Consensus 432 y~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp------~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~ 504 (528) +++...+..-+++|---. +|--+...-+..-+++ .+-|=.+-+..||++.+ +|= T Consensus 340 ~~s~~~~~~~~~~gd~~~-------~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~------------ 400 (413) T protein:vir:81 340 VQSQVVPVGKPVVGAFRS-------AASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPE------------ 400 (413) T ss_pred EEcCCCCcccEEEEeccc-------EEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccc------------ Confidence 999987766666553210 0111111111111111 22344455556666542 220 Q ss_pred cccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 505 TSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 505 ~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) -|+++-++.. T Consensus 401 --------------a~~~l~~~~~ 410 (413) T protein:vir:81 401 --------------AIVQLDVAEV 410 (413) T ss_pred --------------ceEEEEecCC Confidence 0011111111 No 21 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=95.37 E-value=0.0022 Score=35.15 Aligned_cols=358 Identities=17% Similarity=0.112 Sum_probs=145.8 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhh------------------hhhhhhhhhHHH--Hhhhc-------------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQ------------------KLVAKILESQEA--DFAVD-------------- 46 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~------------------~~~~~~~enq~~--~~~~~-------------- 46 (528) |...++|.++=.-+.+.-. +++..-++ .+-++|-+.|++ .+.+. T Consensus 1 mk~~~el~~~l~el~~~~~--~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQID--LKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHH--HHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 8888888888777755210 01111111 111111111110 00000 Q ss_pred --cccchhhhhhhhccccccc-----cc-----cccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhhcee Q lcl|NC_012740. 47 --PIYKDEKVVEAFGGFIAEA-----EV-----AGDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICG 112 (528) Q Consensus 47 --~~~~~~~~~~~~~~~l~ea-----~~-----~~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~G 112 (528) +..++..-...+...+.+- +. .-..+.......-++..|. ..-|.-+ .+++++..+.+-.+++. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg--~~iP~~~~~~ii~~~~~~~~l~~~~~ 156 (415) T protein:vir:79 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF--VVIPEEIVTDILKLKEVEFNLDKYVT 156 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccc--cccchHHHHHHHHHHHhhhhhhhhee Confidence 0000000000011100000 00 0000001111011111121 1124333 45566667788899999 Q ss_pred eecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 113 VQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAE 192 (528) Q Consensus 113 VQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~ 192 (528) |.||++..+-+--.|.. . +. ...+ T Consensus 157 ~~~~~~~~~~~~~~~~~--~------~~---------~~~~--------------------------------------- 180 (415) T protein:vir:79 157 VKRVTNGSGKYPVVRQS--E------VA---------ALEK--------------------------------------- 180 (415) T ss_pred eeeccCCceeEEEEeec--C------Cc---------ccee--------------------------------------- Confidence 99999886543322210 0 00 0000 Q ss_pred cccccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecc Q lcl|NC_012740. 193 TGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSR 272 (528) Q Consensus 193 tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSR 272 (528) ++.+ ++ ........|.+..|++.|..+ T Consensus 181 ---------------------------------------v~E~-----~~----~~~~~~~~~~~v~~~~~k~~~----- 207 (415) T protein:vir:79 181 ---------------------------------------VEEL-----EE----NPELAVKPFFQLAYDINTHRG----- 207 (415) T ss_pred ---------------------------------------eccc-----cc----cCcccccceeeEEeeeeeeEe----- Confidence 0000 00 000111235555555555544 Q ss_pred cccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccch Q lcl|NC_012740. 273 QLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGAR 352 (528) Q Consensus 273 ALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r 352 (528) ...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-...-.+ +.... ...++ . ....+. T Consensus 208 --~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--~~~~~--~~~~~--~---~~~~~~- 271 (415) T protein:vir:79 208 --YFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--TSSGF--EKEGK--K---LEVKKA- 271 (415) T ss_pred --eehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc--ccccc--ccccc--c---cccccc- Confidence 456999999984 35789999999999999999999996332111111 00000 00000 0 000000 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEE Q lcl|NC_012740. 353 WAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVF 432 (528) Q Consensus 353 ~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy 432 (528) -..+....++ +.+.. --+ +.+.+||++.....|.... ...| ..-...+.+.. ..++|.| ++|+ T Consensus 272 ~~~~~i~~~~-------~~~~~-~~~-~~~~~v~n~~~~~~l~~lk-----d~~G-~~l~~~~~~~~-~~~~l~G-~pV~ 334 (415) T protein:vir:79 272 KSLDDIKDAI-------NLNVK-PNY-EHNVAIVSQTMFAKLDKMK-----DKLG-NYLIQPDVKEK-TQQRLLG-AKIE 334 (415) T ss_pred cchhHHHHHH-------Hhhhh-hcc-CCCEEEEcHHHHHHHHHhh-----ccCC-ceeeccCcCCC-CCceecc-eeeE Confidence 0011222232 22322 123 5677899999998887631 1111 00011122211 2346765 8888 Q ss_pred eeCCCCcceEEEEEecCCCccceeEecc----ccc----cceeEEecCccccceeeeeeeecee-ecCcccccCCCccce Q lcl|NC_012740. 433 IDQYARQDYFTVGYKGDNEMDAGIYYAP----YVA----LTPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSAR 503 (528) Q Consensus 433 ~D~y~~~dy~~vG~KG~~~~d~g~fyaP----Yv~----~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~ 503 (528) +.++.+.. -.|+ ..++|+- |+- ..-+...|-.+++..+....|++.. .+|=+.-.-.- ..- T Consensus 335 ~~~~~~~~-----~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~-~~~ 404 (415) T protein:vir:79 335 ILPDEVLG-----QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEY-DDS 404 (415) T ss_pred EecccccC-----CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEEE-ecc Confidence 87765321 1111 1122221 211 1112344556778888888899865 34411110000 000 Q ss_pred ecccchHHhhc Q lcl|NC_012740. 504 ITSGMLSKDSV 514 (528) Q Consensus 504 ~~~~~~~~~~a 514 (528) ..-.++..+-+ T Consensus 405 ~~~~~~~~~~~ 415 (415) T protein:vir:79 405 ERGEGDLGLEA 415 (415) T ss_pred CCCCCccccCC Confidence 01112222222 No 22 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=95.37 E-value=0.0022 Score=35.15 Aligned_cols=358 Identities=17% Similarity=0.112 Sum_probs=145.8 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhh------------------hhhhhhhhhHHH--Hhhhc-------------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQ------------------KLVAKILESQEA--DFAVD-------------- 46 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~------------------~~~~~~~enq~~--~~~~~-------------- 46 (528) |...++|.++=.-+.+.-. +++..-++ .+-++|-+.|++ .+.+. T Consensus 1 mk~~~el~~~l~el~~~~~--~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQID--LKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHH--HHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 8888888888777755210 01111111 111111111110 00000 Q ss_pred --cccchhhhhhhhccccccc-----cc-----cccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhhcee Q lcl|NC_012740. 47 --PIYKDEKVVEAFGGFIAEA-----EV-----AGDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICG 112 (528) Q Consensus 47 --~~~~~~~~~~~~~~~l~ea-----~~-----~~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~G 112 (528) +..++..-...+...+.+- +. .-..+.......-++..|. ..-|.-+ .+++++..+.+-.+++. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg--~~iP~~~~~~ii~~~~~~~~l~~~~~ 156 (415) T protein:vir:81 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF--VVIPEEIVTDILKLKEVEFNLDKYVT 156 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccc--cccchHHHHHHHHHHHhhhhhhhhee Confidence 0000000000011100000 00 0000001111011111121 1124333 45566667788899999 Q ss_pred eecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 113 VQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAE 192 (528) Q Consensus 113 VQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~ 192 (528) |.||++..+-+--.|.. . +. ...+ T Consensus 157 ~~~~~~~~~~~~~~~~~--~------~~---------~~~~--------------------------------------- 180 (415) T protein:vir:81 157 VKRVTNGSGKYPVVRQS--E------VA---------ALEK--------------------------------------- 180 (415) T ss_pred eeeccCCceeEEEEeec--C------Cc---------ccee--------------------------------------- Confidence 99999886543322210 0 00 0000 Q ss_pred cccccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecc Q lcl|NC_012740. 193 TGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSR 272 (528) Q Consensus 193 tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSR 272 (528) ++.+ ++ ........|.+..|++.|..+ T Consensus 181 ---------------------------------------v~E~-----~~----~~~~~~~~~~~v~~~~~k~~~----- 207 (415) T protein:vir:81 181 ---------------------------------------VEEL-----EE----NPELAVKPFFQLAYDINTHRG----- 207 (415) T ss_pred ---------------------------------------eccc-----cc----cCcccccceeeEEeeeeeeEe----- Confidence 0000 00 000111235555555555544 Q ss_pred cccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccch Q lcl|NC_012740. 273 QLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGAR 352 (528) Q Consensus 273 ALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r 352 (528) ...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-...-.+ +.... ...++ . ....+. T Consensus 208 --~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--~~~~~--~~~~~--~---~~~~~~- 271 (415) T protein:vir:81 208 --YFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--TSSGF--EKEGK--K---LEVKKA- 271 (415) T ss_pred --eehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc--ccccc--ccccc--c---cccccc- Confidence 456999999984 35789999999999999999999996332111111 00000 00000 0 000000 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEE Q lcl|NC_012740. 353 WAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVF 432 (528) Q Consensus 353 ~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy 432 (528) -..+....++ +.+.. --+ +.+.+||++.....|.... ...| ..-...+.+.. ..++|.| ++|+ T Consensus 272 ~~~~~i~~~~-------~~~~~-~~~-~~~~~v~n~~~~~~l~~lk-----d~~G-~~l~~~~~~~~-~~~~l~G-~pV~ 334 (415) T protein:vir:81 272 KSLDDIKDAI-------NLNVK-PNY-EHNVAIVSQTMFAKLDKMK-----DKLG-NYLIQPDVKEK-TQQRLLG-AKIE 334 (415) T ss_pred cchhHHHHHH-------Hhhhh-hcc-CCCEEEEcHHHHHHHHHhh-----ccCC-ceeeccCcCCC-CCceecc-eeeE Confidence 0011222232 22322 123 5677899999998887631 1111 00011122211 2346765 8888 Q ss_pred eeCCCCcceEEEEEecCCCccceeEecc----ccc----cceeEEecCccccceeeeeeeecee-ecCcccccCCCccce Q lcl|NC_012740. 433 IDQYARQDYFTVGYKGDNEMDAGIYYAP----YVA----LTPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSAR 503 (528) Q Consensus 433 ~D~y~~~dy~~vG~KG~~~~d~g~fyaP----Yv~----~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~ 503 (528) +.++.+.. -.|+ ..++|+- |+- ..-+...|-.+++..+....|++.. .+|=+.-.-.- ..- T Consensus 335 ~~~~~~~~-----~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~-~~~ 404 (415) T protein:vir:81 335 ILPDEVLG-----QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEY-DDS 404 (415) T ss_pred EecccccC-----CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEEE-ecc Confidence 87765321 1111 1122221 211 1112344556778888888899865 34411110000 000 Q ss_pred ecccchHHhhc Q lcl|NC_012740. 504 ITSGMLSKDSV 514 (528) Q Consensus 504 ~~~~~~~~~~a 514 (528) ..-.++..+-+ T Consensus 405 ~~~~~~~~~~~ 415 (415) T protein:vir:81 405 ERGEGDLGLEA 415 (415) T ss_pred CCCCCccccCC Confidence 01112222222 No 23 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=95.37 E-value=0.0022 Score=35.15 Aligned_cols=358 Identities=17% Similarity=0.112 Sum_probs=145.8 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhh------------------hhhhhhhhhHHH--Hhhhc-------------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQ------------------KLVAKILESQEA--DFAVD-------------- 46 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~------------------~~~~~~~enq~~--~~~~~-------------- 46 (528) |...++|.++=.-+.+.-. +++..-++ .+-++|-+.|++ .+.+. T Consensus 1 mk~~~el~~~l~el~~~~~--~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQID--LKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHH--HHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 8888888888777755210 01111111 111111111110 00000 Q ss_pred --cccchhhhhhhhccccccc-----cc-----cccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhhcee Q lcl|NC_012740. 47 --PIYKDEKVVEAFGGFIAEA-----EV-----AGDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICG 112 (528) Q Consensus 47 --~~~~~~~~~~~~~~~l~ea-----~~-----~~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~G 112 (528) +..++..-...+...+.+- +. .-..+.......-++..|. ..-|.-+ .+++++..+.+-.+++. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg--~~iP~~~~~~ii~~~~~~~~l~~~~~ 156 (415) T protein:vir:98 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF--VVIPEEIVTDILKLKEVEFNLDKYVT 156 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccc--cccchHHHHHHHHHHHhhhhhhhhee Confidence 0000000000011100000 00 0000001111011111121 1124333 45566667788899999 Q ss_pred eecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 113 VQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAE 192 (528) Q Consensus 113 VQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~ 192 (528) |.||++..+-+--.|.. . +. ...+ T Consensus 157 ~~~~~~~~~~~~~~~~~--~------~~---------~~~~--------------------------------------- 180 (415) T protein:vir:98 157 VKRVTNGSGKYPVVRQS--E------VA---------ALEK--------------------------------------- 180 (415) T ss_pred eeeccCCceeEEEEeec--C------Cc---------ccee--------------------------------------- Confidence 99999886543322210 0 00 0000 Q ss_pred cccccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecc Q lcl|NC_012740. 193 TGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSR 272 (528) Q Consensus 193 tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSR 272 (528) ++.+ ++ ........|.+..|++.|..+ T Consensus 181 ---------------------------------------v~E~-----~~----~~~~~~~~~~~v~~~~~k~~~----- 207 (415) T protein:vir:98 181 ---------------------------------------VEEL-----EE----NPELAVKPFFQLAYDINTHRG----- 207 (415) T ss_pred ---------------------------------------eccc-----cc----cCcccccceeeEEeeeeeeEe----- Confidence 0000 00 000111235555555555544 Q ss_pred cccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccch Q lcl|NC_012740. 273 QLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGAR 352 (528) Q Consensus 273 ALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r 352 (528) ...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-...-.+ +.... ...++ . ....+. T Consensus 208 --~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--~~~~~--~~~~~--~---~~~~~~- 271 (415) T protein:vir:98 208 --YFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--TSSGF--EKEGK--K---LEVKKA- 271 (415) T ss_pred --eehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc--ccccc--ccccc--c---cccccc- Confidence 456999999984 35789999999999999999999996332111111 00000 00000 0 000000 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEE Q lcl|NC_012740. 353 WAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVF 432 (528) Q Consensus 353 ~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy 432 (528) -..+....++ +.+.. --+ +.+.+||++.....|.... ...| ..-...+.+.. ..++|.| ++|+ T Consensus 272 ~~~~~i~~~~-------~~~~~-~~~-~~~~~v~n~~~~~~l~~lk-----d~~G-~~l~~~~~~~~-~~~~l~G-~pV~ 334 (415) T protein:vir:98 272 KSLDDIKDAI-------NLNVK-PNY-EHNVAIVSQTMFAKLDKMK-----DKLG-NYLIQPDVKEK-TQQRLLG-AKIE 334 (415) T ss_pred cchhHHHHHH-------Hhhhh-hcc-CCCEEEEcHHHHHHHHHhh-----ccCC-ceeeccCcCCC-CCceecc-eeeE Confidence 0011222232 22322 123 5677899999998887631 1111 00011122211 2346765 8888 Q ss_pred eeCCCCcceEEEEEecCCCccceeEecc----ccc----cceeEEecCccccceeeeeeeecee-ecCcccccCCCccce Q lcl|NC_012740. 433 IDQYARQDYFTVGYKGDNEMDAGIYYAP----YVA----LTPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSAR 503 (528) Q Consensus 433 ~D~y~~~dy~~vG~KG~~~~d~g~fyaP----Yv~----~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~ 503 (528) +.++.+.. -.|+ ..++|+- |+- ..-+...|-.+++..+....|++.. .+|=+.-.-.- ..- T Consensus 335 ~~~~~~~~-----~~~~----~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~-~~~ 404 (415) T protein:vir:98 335 ILPDEVLG-----QKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEY-DDS 404 (415) T ss_pred EecccccC-----CCCc----cEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEEE-ecc Confidence 87765321 1111 1122221 211 1112344556778888888899865 34411110000 000 Q ss_pred ecccchHHhhc Q lcl|NC_012740. 504 ITSGMLSKDSV 514 (528) Q Consensus 504 ~~~~~~~~~~a 514 (528) ..-.++..+-+ T Consensus 405 ~~~~~~~~~~~ 415 (415) T protein:vir:98 405 ERGEGDLGLEA 415 (415) T ss_pred CCCCCccccCC Confidence 01112222222 No 24 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=94.79 E-value=0.0035 Score=34.06 Aligned_cols=329 Identities=12% Similarity=0.093 Sum_probs=132.6 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchh----------------hhh---hhhhh---hhHHHHhhhccccchhh----- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASK----------------QKL---VAKIL---ESQEADFAVDPIYKDEK----- 53 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~----------------~~~---~~~~~---enq~~~~~~~~~~~~~~----- 53 (528) |...++|+++|.-+-+. |++..+ +++ +..+. |.+++.+.+...-.... T Consensus 1 Mk~~~el~~~~~~~~~~-----~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 75 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDK-----VENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEE 75 (397) T ss_pred CchHHHHHHHHHHHHHH-----HHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Confidence 99999999888865432 111100 000 01110 11111111110000000 Q ss_pred --------------hhhhhccccccccccccCCccccccccccc-cccccccCcc-hh-hHHHHHHhhhhhhhceeeecC Q lcl|NC_012740. 54 --------------VVEAFGGFIAEAEVAGDHGYNASNIASGQT-TGAITNVGPA-VI-GMVRRAIPNLIAFDICGVQPM 116 (528) Q Consensus 54 --------------~~~~~~~~l~ea~~~~~~g~~~~~~~e~t~-tg~v~~~~P~-li-~l~Rra~~~lI~~DI~GVQPm 116 (528) ...+|..+|.. ...........+++ .|.+. -|. +. .+++.+-++.+..++|.++|| T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~l~~-----~~~~~~~~~~~~t~~~gg~~--vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 148 (397) T protein:vir:49 76 KKPLTKSEEEVKAGFVKDFKNLVRG-----RYQNLLDSKTDASGSDAGLT--IPQDIQTAIHTLVSQYDSLQEYVNVENV 148 (397) T ss_pred ccccccchhHHHHHHHHHHHHHHhc-----chhHHHHHhhccccccCccc--ccHhHHHHHHHHHHhhhhHHhhhceeec Confidence 00001111110 00000001111221 12221 122 11 355555567788899999999 Q ss_pred CcccceeeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 117 STPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIA 196 (528) Q Consensus 117 TgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~ 196 (528) ++++|-+.-++- .+. .+.+.+ T Consensus 149 ~~~~~~~~~~~~--~~~--------------~~~a~~------------------------------------------- 169 (397) T protein:vir:49 149 TTLTGSRVYEKW--TDI--------------TGLANI------------------------------------------- 169 (397) T ss_pred ccCccceEEEee--ccC--------------Ccceee------------------------------------------- Confidence 998874332221 000 000000 Q ss_pred cccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccc Q lcl|NC_012740. 197 YLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKA 276 (528) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKA 276 (528) ++.|- + ........|.++.|++.|..+- . T Consensus 170 -----------------------------------v~E~~-----~----~~~~~~~~~~~i~~~~~k~~~~-------~ 198 (397) T protein:vir:49 170 -----------------------------------DDEAG-----K----IADVDDPKLSLIKYTIKRYAGI-------S 198 (397) T ss_pred -----------------------------------ecCcc-----c----cccccccceeeEEeeeeeEEee-------e Confidence 00000 0 0001123466666666666554 4 Q ss_pred cchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHH Q lcl|NC_012740. 277 RYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGE 356 (528) Q Consensus 277 EYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e 356 (528) .+|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-... ...|+.++ + T Consensus 199 ~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~------------~~~~~~~~-------------d 249 (397) T protein:vir:49 199 TVTNSLLADS----AENILAWLSGWIAKKVVVTRNKAILEAIAALP------------TKPTLTKW-------------D 249 (397) T ss_pred hhHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------cccccccH-------------H Confidence 5999999985 25779999999999999999999985321110 12223222 2 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeC- Q lcl|NC_012740. 357 SFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQ- 435 (528) Q Consensus 357 ~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~- 435 (528) ....|+..|... +.....+|++|.....|...- ...| ..-...+.+. -..++|.| ++|++.. T Consensus 250 ~i~~~~~~l~~~---------~~~~a~~vmn~~~~~~l~~lk-----d~~G-~~l~~~~~~~-~~~~~l~G-~PV~~~~~ 312 (397) T protein:vir:49 250 DIIDLEAKVDPA---------IKQTSFFLTNTSGFTALKKVK-----NALG-DYLMERDVKS-PTGYSIDG-FAVKEVAD 312 (397) T ss_pred HHHHHHHhhhhh---------hcCCCEEEEcHHHHHHHHHhh-----cCCC-ceeeccCcCC-CCCceecc-eeeEEecc Confidence 234444444321 224567899999999987631 1111 1111112211 11346765 7777522 Q ss_pred -CCC----cc-eEEEE---------EecCCCccceeEeccccccceeEEecCccccceeeeeeeecee-ecC--cc---- Q lcl|NC_012740. 436 -YAR----QD-YFTVG---------YKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIG-INP--FA---- 493 (528) Q Consensus 436 -y~~----~d-y~~vG---------~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP--~~---- 493 (528) ..+ .+ -+++| .++..+. =+.+|.. .+-...+-.+-...|++.. .|| |. T Consensus 313 ~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i----~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~ 382 (397) T protein:vir:49 313 RWLANGTGGAMPLYFGDLKQAVTLFDRQHMSL----LSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASF 382 (397) T ss_pred cccccccCCceeEEEeeccceEEEEeecceEE----EEecccc------chhhcCceeEEEEeeeCcEEecccceEEEEe Confidence 211 11 12222 2211111 1111110 0011222333444455443 222 11 Q ss_pred -cccCCCccceecccchHHhhc Q lcl|NC_012740. 494 -DSKSQAPSARITSGMLSKDSV 514 (528) Q Consensus 494 -~~~~~~~~~~~~~~~~~~~~a 514 (528) ...+..+ ....-.. T Consensus 383 ~~~~~~~~-------~~~~~~~ 397 (397) T protein:vir:49 383 KAIADQKG-------NLGSTAV 397 (397) T ss_pred ecccCCCC-------CcccccC Confidence 0011000 0000000 No 25 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=94.08 E-value=0.0054 Score=33.00 Aligned_cols=327 Identities=17% Similarity=0.137 Sum_probs=138.0 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhh-hhhhhhhhhHHHHhhhccccc-----hhhhhhhhccccccccccccCCcc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQ-KLVAKILESQEADFAVDPIYK-----DEKVVEAFGGFIAEAEVAGDHGYN 74 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~-~~~~~~~enq~~~~~~~~~~~-----~~~~~~~~~~~l~ea~~~~~~g~~ 74 (528) +.+.++ .|.|.-+.. ||....++ +....++|-+++.......-+ .......|..+|.. .. T Consensus 22 ~~~~~~-~e~~~~~~~-----ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~--------~~ 87 (371) T protein:vir:81 22 LLAENK-IEEAKKLKE-----EIVALQEKFDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAFVNHIRT--------RF 87 (371) T ss_pred HhhHHH-HHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHHHHH--------HH Confidence 222222 334544333 24332221 112222232222222211110 00111222222210 01 Q ss_pred ccccccccc-cccccccCcc-hh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccc Q lcl|NC_012740. 75 ASNIASGQT-TGAITNVGPA-VI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNA 151 (528) Q Consensus 75 ~~~~~e~t~-tg~v~~~~P~-li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt 151 (528) ...+..+++ +|.+. . |. +. .+++.+.++.+..+++.+.||++.++-+.-.+. ... .+ + T Consensus 88 ~~a~~~~t~~~gg~~-v-P~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~--~~~------~~---------a 148 (371) T protein:vir:81 88 RNAMSEGSNQDGGYT-V-PQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKR--SQQ------TG---------F 148 (371) T ss_pred HHhhccCCCccCcee-e-cHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEee--cCC------cc---------e Confidence 122222221 12211 1 32 22 466666678889999999999887655432221 000 00 0 Q ss_pred cccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccc Q lcl|NC_012740. 152 FHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAE 231 (528) Q Consensus 152 ~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 231 (528) .+ T Consensus 149 ~~------------------------------------------------------------------------------ 150 (371) T protein:vir:81 149 VE------------------------------------------------------------------------------ 150 (371) T ss_pred ee------------------------------------------------------------------------------ Confidence 00 Q ss_pred cccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhh Q lcl|NC_012740. 232 IAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEIN 311 (528) Q Consensus 232 ~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEIN 311 (528) + +|.. .........|.+..+...|..+. ..+|-||.+|-. .|.++.|.+.|...|..-+| T Consensus 151 v--------~Eg~-~~~~~~~~~f~~i~~~~~k~~~~-------~~iS~ell~ds~----~~l~~~i~~~l~~a~~~~~~ 210 (371) T protein:vir:81 151 V--------AEGA-AIGEKATPQFTLLQYQVKKYAGF-------FRVTNELLNDST----EAIVNTLVRWIGDESRVTRN 210 (371) T ss_pred e--------cccc-ccccccccceeeEEeeeeEEEEe-------ehhhHHHHhhhh----HHHHHHHHHHHHHHHHHHHH Confidence 0 0100 00001123466666666666654 469999999853 46689999999999999999 Q ss_pred HHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHH Q lcl|NC_012740. 312 REIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVV 391 (528) Q Consensus 312 Reii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va 391 (528) +.||.-... +.+.|+.+.+ ....++... ....+.....+|++|... T Consensus 211 ~~i~~g~g~-------------~~~~~~~~~~-------------~i~~~~~~~--------l~~~~~~~a~~vmn~~~~ 256 (371) T protein:vir:81 211 GLIINVLNT-------------KAKTAIADLD-------------GLKQIINVQ--------LDPVFRSTSSVIVNQDAF 256 (371) T ss_pred HHHHhhccc-------------ccccccccHH-------------HHHHHHHhh--------cchhhhcCCEEEEcHHHH Confidence 988852211 1233332222 112211110 111222345788999999 Q ss_pred HHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccc-------c Q lcl|NC_012740. 392 NILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVA-------L 464 (528) Q Consensus 392 ~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~-------~ 464 (528) ..|...- ...|. .-...+.+. -..|+|.| ++||+..+.+...-.++--+.+ ..-++|+.+-. . T Consensus 257 ~~L~~lk-----d~~g~-~l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~~~~~~~--~~~i~~Gd~~~~~~~~~~~ 326 (371) T protein:vir:81 257 NWLDTLK-----DQNGQ-YLLQPSISS-PTGRQLLG-LPVVIVSNKVLANRVDGGTGAQ--FAPIIVGDLKEAVVMFDRQ 326 (371) T ss_pred HHHHHhh-----ccCCC-eeeecccCC-CCCceecc-eeEEEecccccCccccccccCC--cceEEEEehhceEEEEeec Confidence 8887531 11110 001111111 12467866 8899887766443222111111 12234443211 1 Q ss_pred ceeEEecCcc------ccceeeeeeeecee-ecCcccccCCCccc Q lcl|NC_012740. 465 TPLRATDPQS------FHPVLGFKTRYGIG-INPFADSKSQAPSA 502 (528) Q Consensus 465 ~~~~~~Dp~s------~qP~~~~~tRY~l~-~nP~~~~~~~~~~~ 502 (528) .+...+++.. -|=.+-...||+.. .||=+...-.-..+ T Consensus 327 ~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 327 RTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred ceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 2222233332 23455555666654 34411111100001 No 26 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=93.41 E-value=0.0076 Score=32.19 Aligned_cols=311 Identities=14% Similarity=0.041 Sum_probs=128.4 Q ss_pred ccccccccccccCCccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCC Q lcl|NC_012740. 59 GGFIAEAEVAGDHGYNASNIASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAE 137 (528) Q Consensus 59 ~~~l~ea~~~~~~g~~~~~~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~ 137 (528) -..|+|-..... |.+...-..++.++-| -+.+. .+++.+.+..+-..+|.+.||+++..-|.-.. . T Consensus 1 ~~~~~e~~~~~~-~~~~~~~~~~~~~~li---P~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~----~----- 67 (338) T protein:vir:78 1 MATLNELAPNTA-GSNHQGRLAHVPSDLL---PKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTV----K----- 67 (338) T ss_pred CcchHHhhhhhc-ccccccceeccccccc---chHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEe----c----- Confidence 112333221100 0111111111112212 22222 45666667888899999999998743332211 1 Q ss_pred cccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCc Q lcl|NC_012740. 138 HAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDD 217 (528) Q Consensus 138 ~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~ 217 (528) .+.+.+-+. T Consensus 68 ----------~~~a~~v~~------------------------------------------------------------- 76 (338) T protein:vir:78 68 ----------RPEVGQVGV------------------------------------------------------------- 76 (338) T ss_pred ----------Cccceeecc------------------------------------------------------------- Confidence 011100000 Q ss_pred cccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHH Q lcl|NC_012740. 218 EVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAE 297 (528) Q Consensus 218 ~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~E 297 (528) +...-.+|.... ......|.+..+...|. +-...+|-||.+|- ..|.|++ T Consensus 77 -----------------~~~~~~~Eg~~~--~~~~~~f~~v~l~~~k~-------~~~~~is~ell~ds----~~~~~~~ 126 (338) T protein:vir:78 77 -----------------GTSNEQREGGTK--PLSGTAWDTRSVAPIKL-------ATIVTVSEEFARMN----PSGLYTK 126 (338) T ss_pred -----------------cccccccccccc--cccccceeEEEEEEEEE-------EEeehhhHHHHhcC----HHHHHHH Confidence 000000110000 01112355555555444 44466899999983 3678999 Q ss_pred HHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_012740. 298 LNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTG 377 (528) Q Consensus 298 LanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~ 377 (528) |.+-|...|...||..||.=.-...--+..|+.... ...+....+ . -+. ....++..+..+...|...=. T Consensus 127 i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~-~~~~~~~~~----~---~~~--~~~~~~~~~~~~~~~~~~~~~ 196 (338) T protein:vir:78 127 LQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNN-VIVNTTNVD----Y---LQT--GTTPLLDRFLDGYDLVSANTD 196 (338) T ss_pred HHHHHHHHHHHHHHHHhhcccCCCcccccccccccc-ccccccccc----c---ccc--cchhhHHHHHHHHHHhhhhcc Confidence 999999999999999998522110000111111000 000000000 0 000 012334455555555543333 Q ss_pred cCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcc---------eEEEEEec Q lcl|NC_012740. 378 RGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQD---------YFTVGYKG 448 (528) Q Consensus 378 ~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y~~vG~KG 448 (528) + ..+.+|++|+....|...--+ +...|. .-...+.+.. -.++|.| ++||++.+-|.+ -+++|-. T Consensus 197 ~-~~~~~~m~~~~~~~L~~~~~l--~d~~g~-~l~~~~~~~~-~~~~l~G-~PV~~~~~ip~~~~~~~~~~~~~~~gdf- 269 (338) T protein:vir:78 197 V-DFNGWAADPRYRARLLRSQAY--RDANGN-VDPTRINLAA-SAGDLLG-LPVQFGKAVGGDLGAATDSKVRVVGGDF- 269 (338) T ss_pred c-cceEEEEchHHHHHHHHHhhh--ccCCCc-eeecccccCC-CCceeee-eeEEEccccCccccccCCcccEEEEEec- Confidence 3 577899999998877543211 111110 0011111111 1356766 799988765421 2333311 Q ss_pred CCCccceeEeccccc--------cceeEEecCcc-----cc-ceee--eeeeece-eecCcccccCCCccceecccchHH Q lcl|NC_012740. 449 DNEMDAGIYYAPYVA--------LTPLRATDPQS-----FH-PVLG--FKTRYGI-GINPFADSKSQAPSARITSGMLSK 511 (528) Q Consensus 449 ~~~~d~g~fyaPYv~--------~~~~~~~Dp~s-----~q-P~~~--~~tRY~l-~~nP~~~~~~~~~~~~~~~~~~~~ 511 (528) +-.+++.--. ..+....||.. || --++ ...|++. +.||=+ .+++.++...+ T Consensus 270 -----s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a-------~~~l~~~~~~~ 337 (338) T protein:vir:78 270 -----SQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQA-------FVKFVDDEDPD 337 (338) T ss_pred -----ceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeecccc-------eEEEecccCCC Confidence 0011110001 11112223321 11 1123 3567874 445511 23444444443 Q ss_pred h Q lcl|NC_012740. 512 D 512 (528) Q Consensus 512 ~ 512 (528) . T Consensus 338 ~ 338 (338) T protein:vir:78 338 A 338 (338) T ss_pred C Confidence 3 No 27 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=93.34 E-value=0.0079 Score=32.11 Aligned_cols=276 Identities=12% Similarity=0.078 Sum_probs=133.2 Q ss_pred CCccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccccc Q lcl|NC_012740. 71 HGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYS 148 (528) Q Consensus 71 ~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~E 148 (528) .|+++.....+...+.. . |.-+ .+++++..+.+-.+++-+-||++.+--+ ...+. T Consensus 1 ~g~~a~~~~~~~~~~~~--i-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~-----~~~~~--------------- 57 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGS--I-PINISEQIITGVKNGSAAMKLAKAVPMTKPEEEF-----TFMSG--------------- 57 (299) T ss_pred CCcCCCcccccCCCcee--c-chhHHHHHHHHHHhcchhhhhceeeecCCCcEEE-----EEEcC--------------- Confidence 56666553322211111 2 3322 6777778888999999999998875211 11000 Q ss_pred ccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccccc Q lcl|NC_012740. 149 PNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGK 228 (528) Q Consensus 149 adt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 228 (528) +.+.| T Consensus 58 ~~a~~--------------------------------------------------------------------------- 62 (299) T protein:vir:41 58 VGAFW--------------------------------------------------------------------------- 62 (299) T ss_pred Cceee--------------------------------------------------------------------------- Confidence 00000 Q ss_pred ccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHH Q lcl|NC_012740. 229 LAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLL 308 (528) Q Consensus 229 ~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEIml 308 (528) .+| +..+++...++++++...|..+-...+|-||.+|-. .|.++.|.+.|...|.. T Consensus 63 -----------v~E---------~~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~a~~~ 118 (299) T protein:vir:41 63 -----------VDE---------AERIQTSKPTFTKAKMRSKKMGVIIPTTKENLNYSV----TNFFSLMQAEIVEAFYK 118 (299) T ss_pred -----------eec---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHH Confidence 011 112344455567777778877778889999999753 45689999999999999 Q ss_pred HhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEch Q lcl|NC_012740. 309 EINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASR 388 (528) Q Consensus 309 EINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~ 388 (528) .+|+.||.= +.. +.+.|++...... ... .......+.-|+++-+.+... ..+++.+||+| T Consensus 119 ~~d~a~l~G---~g~----------~~~~gil~~~~~~-~~~----~~~~~~~~~~l~~~~~~l~~~--~~~~~~~v~n~ 178 (299) T protein:vir:41 119 KFDQAVFTG---VES----------PYNWNILKSATDA-SNL----VEETANKYDDLNEAIGLIEAE--DLEPNGIATIR 178 (299) T ss_pred HHHHHHhhc---ccC----------ccccccccccccc-cee----eccccccHHHHHHHHHhhhcc--cCCcCEEEEcH Confidence 999988831 111 0122222111000 000 000001122344444555443 33577899999 Q ss_pred hHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcce----EEEEEecCCCccceeEecccccc Q lcl|NC_012740. 389 NVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDY----FTVGYKGDNEMDAGIYYAPYVAL 464 (528) Q Consensus 389 ~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy----~~vG~KG~~~~d~g~fyaPYv~~ 464 (528) +....|..-. ...| ..-...+.+.. .++|.| ++|++.++.+.+= +++|-- +.+++..+-.. T Consensus 179 ~~~~~L~~lk-----d~~G-~~l~~~~~~~~--~~~l~G-~PV~~~~~~~~~~~~~~~~~gdf------s~~~i~~~~~~ 243 (299) T protein:vir:41 179 KQRVKYRSTK-----DGNG-MPIFNTATSNG--VDDVLG-LPIAYTPKYTFGDKDISELVGDW------NQAYYGILRGV 243 (299) T ss_pred HHHHHHHHhh-----ccCC-ceeecCCcCCC--Cceecc-eeeEEecccCCCCCceEEEEEec------ccEEEEEecCc Confidence 9999998631 1111 11111222221 246765 8999888876542 222211 01111112122 Q ss_pred ceeE--------EecCcc-----ccc-eeee--eeeeceee-cCcccccCCCccceecccchHHhhcc Q lcl|NC_012740. 465 TPLR--------ATDPQS-----FHP-VLGF--KTRYGIGI-NPFADSKSQAPSARITSGMLSKDSVG 515 (528) Q Consensus 465 ~~~~--------~~Dp~s-----~qP-~~~~--~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~~~a~ 515 (528) .+.+ ..|++. ||- .+.| ..|++..+ ||=+...-.. +.+| T Consensus 244 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~------------~aa~ 299 (299) T protein:vir:41 244 EYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQP------------KAGN 299 (299) T ss_pred EEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEe------------ccCC Confidence 2221 222221 222 2333 35777653 3411111111 1122 No 28 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=93.30 E-value=0.0081 Score=32.06 Aligned_cols=362 Identities=17% Similarity=0.119 Sum_probs=142.2 Q ss_pred CcchHHHHHhhhhhhcCCc--cchhccch-------hhhhhhhh--hhhHHHHh-------hhcc--------------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEK--LPEIATAS-------KQKLVAKI--LESQEADF-------AVDP--------------- 47 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~--~~~~~~~~-------~~~~~~~~--~enq~~~~-------~~~~--------------- 47 (528) |...++|.++=.-+++... .-+..+.- .+++...+ |++|-+.+ .+.. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccch Confidence 7777777666655543200 00000000 00111000 11111100 0000 Q ss_pred -ccchhhhhhhhcccccc-----cccc-----ccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeee Q lcl|NC_012740. 48 -IYKDEKVVEAFGGFIAE-----AEVA-----GDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQ 114 (528) Q Consensus 48 -~~~~~~~~~~~~~~l~e-----a~~~-----~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQ 114 (528) .+....-...+...+.+ .+.. --.+.+......++.+|... -|.-+ .+++.+-+..+-.+++.++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~--iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:94 81 STYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVV--IPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhcccccccccc--CcHHHHHHHHHHHHhhhhhhhhccee Confidence 00000000011111111 1100 00000111111112222221 23222 4556666788889999999 Q ss_pred cCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 115 PMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETG 194 (528) Q Consensus 115 PmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg 194 (528) ||++..+-+--.+. ... +...+ T Consensus 159 ~~~~~~~~~~~~~~--~~~---------------~~~~~----------------------------------------- 180 (415) T protein:vir:94 159 RVTNGSGKYPVVRQ--SEV---------------AALEK----------------------------------------- 180 (415) T ss_pred eccCCceeEEEEee--cCC---------------cccee----------------------------------------- Confidence 99887543322220 000 00000 Q ss_pred cccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccc Q lcl|NC_012740. 195 IAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQL 274 (528) Q Consensus 195 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRAL 274 (528) ++.| ++. .......|.+..|++.|+.+ T Consensus 181 -------------------------------------v~Eg-----~~~----~~~~~~~~~~i~~~~~k~~~------- 207 (415) T protein:vir:94 181 -------------------------------------VEEL-----EEN----PELAVKPFFQLAYDINTHRG------- 207 (415) T ss_pred -------------------------------------cccc-----ccc----cccccccceeeEeeheeeee------- Confidence 0000 000 00111235555666665554 Q ss_pred cccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccce-eccccccccccchh Q lcl|NC_012740. 275 KARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGV-FDLQDPIDTRGARW 353 (528) Q Consensus 275 KAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~-~dl~~~~d~~~~r~ 353 (528) .-.+|-||.+|.- +|.+++|.+-|...|..-+|+.||.-.-...-.+..... ...++ ....... T Consensus 208 ~~~is~ell~ds~----~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~----~~~~~~~~~~~~~------- 272 (415) T protein:vir:94 208 YFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF----EKEGKKLEVKKAK------- 272 (415) T ss_pred echhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccc----ccccccccccccc------- Confidence 3559999999864 577999999999999999999999643211111100000 00000 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEe Q lcl|NC_012740. 354 AGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFI 433 (528) Q Consensus 354 a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~ 433 (528) ..+....++.. +.. ..+ +.+.+|++|.....|.... ...|. .-...+.+.. ..++|.| ++|++ T Consensus 273 ~~~~i~~~~~~-------~~~-~~~-~~~~~vmn~~~~~~l~~lk-----d~~G~-~l~~~~~~~~-~~~~l~G-~pV~~ 335 (415) T protein:vir:94 273 SLDDIKDAINL-------NVK-PNY-EHNVAIVSQTMFAKLDKMK-----DKLGN-YLIQPDVKEK-TQQRLLG-AKIEI 335 (415) T ss_pred chHHHHHHHHh-------hhh-hcc-CCCEEEEcHHHHHHHHHhh-----ccCCC-eeeccCcCCC-CCceecc-eeeEE Confidence 11222333322 221 223 5778899999999987631 11111 0011122211 2346766 78888 Q ss_pred eCCCCcc----e-EEEEEecCCCccceeEeccccccceeEEecCccccceeeeeeeecee-ecCcccccCCCccceeccc Q lcl|NC_012740. 434 DQYARQD----Y-FTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSG 507 (528) Q Consensus 434 D~y~~~d----y-~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~ 507 (528) .+..+.. . +++|--.. . +..... ........|-.+++-.+-...|+++. .+|=+...-.- ..-..-. T Consensus 336 ~~~~~~~~~~~~~i~~gd~~~----~-~~~~~~-~~~~v~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~-~~~~~~~ 408 (415) T protein:vir:94 336 LPDEVLGQKGNNTLIIGNLKD----A-IVLFDR-SQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEY-DDSERGE 408 (415) T ss_pred ecccccCCCCccEEEEEehhc----c-EEEEee-cceEEEEeccccCceEEEEEEEeccEEeccccEEEEEE-eccCCCC Confidence 7764321 1 23331000 0 000001 11122344556677777778888865 24411111000 0001111 Q ss_pred chHHhhc Q lcl|NC_012740. 508 MLSKDSV 514 (528) Q Consensus 508 ~~~~~~a 514 (528) ++..+-+ T Consensus 409 ~~~~~~~ 415 (415) T protein:vir:94 409 GDLGLEA 415 (415) T ss_pred CccccCC Confidence 2222222 No 29 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=92.56 E-value=0.011 Score=31.34 Aligned_cols=349 Identities=13% Similarity=0.115 Sum_probs=125.3 Q ss_pred CcchHHHHHhhhhhhc------------CCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhcccccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLE------------NEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVA 68 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~------------~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~ 68 (528) .-......++...-+. .+..+.+.+..++.... -.++-++++.. +.+. ..+.. .+ ..+.+.. T Consensus 81 ~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~-~~~~~~~~~e~-~~~~-~~~~~-~~--~~~~~~~ 154 (458) T protein:vir:10 81 NELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYG-TQENFEDEVEK-LVLL-SYVME-KG--VFETEHG 154 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchh-hhhhHHHHHHH-HHHH-HHHHh-hc--cchhhhh Confidence 0001111122111110 00011111111100000 00000011100 0000 00000 00 0000000 Q ss_pred ccCCcccccccccccccccc-ccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccc Q lcl|NC_012740. 69 GDHGYNASNIASGQTTGAIT-NVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPM 146 (528) Q Consensus 69 ~~~g~~~~~~~e~t~tg~v~-~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~ 146 (528) -..-+....+++..... ..-|.+. .++.++.++.+..++|-++||+++..-++ .. .. T Consensus 155 ---~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~-~~----~~------------- 213 (458) T protein:vir:10 155 ---QRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTML-VE----PD------------- 213 (458) T ss_pred ---hhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEE-Ee----cC------------- Confidence 00000111111111111 1112222 45555667778899999999988642111 10 00 Q ss_pred ccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccc Q lcl|NC_012740. 147 YSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEE 226 (528) Q Consensus 147 ~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (528) .+.+.|-+-++ T Consensus 214 -~~~a~~v~e~~-------------------------------------------------------------------- 224 (458) T protein:vir:10 214 -AGKATWVAAST-------------------------------------------------------------------- 224 (458) T ss_pred -Ccceeeccccc-------------------------------------------------------------------- Confidence 00000000000 Q ss_pred ccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|NC_012740. 227 GKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEV 306 (528) Q Consensus 227 g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEI 306 (528) ..-.+.. .......|.+ +++.++.-+....+|-||.+|-- .|.+++|.+-|..-| T Consensus 225 -------~~~~~~~-------~~~~~~~~~~-------i~~~~~k~~~~v~is~ell~ds~----~~~~~~i~~~l~~~i 279 (458) T protein:vir:10 225 -------YGTDTTT-------GEEVKGALKE-------IHFSTYKLAAKSFITDETEEDAI----FSLLPLLRKRLIEAH 279 (458) T ss_pred -------ccccccc-------ccccccccee-------eEeeeeeEEeeehhhHHHHhcch----HHHHHHHHHHHHHHH Confidence 0000000 0011223444 44444444445679999988832 567889999999999 Q ss_pred HHHhhHHHHhhhhhheeecccceeeccccccceeccccccc------cccchhHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_012740. 307 LLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPID------TRGARWAGESFKSLIYQIDKEAAEIARQTGRGA 380 (528) Q Consensus 307 mlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d------~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~ 380 (528) ..-||+.||. -+ | .+.+.|++......+ ..+..-..-.+..| +++-+.+.. .+.+ T Consensus 280 ~~~~d~~~l~---G~------G----~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i----~~~~~~l~~--~~~~ 340 (458) T protein:vir:10 280 AVSIEEAFMT---GD------G----SGKPKGLLTLASEDSAKVVTEAKADGSVLVTAKTI----SKLRRKLGR--HGLK 340 (458) T ss_pred HHHHHHHhhc---CC------C----CCccceeeecccccccceeecccccccccccHHHH----HHHHHhhhh--hhcC Confidence 9999998873 11 1 113444443321111 00000000112222 222222322 2224 Q ss_pred CcEEEEchhHHHHhhcccccccccccc-ccccccccccCceEEEEecCceEEEeeCCCCc-----ceEEEEEecCCCccc Q lcl|NC_012740. 381 GNFVIASRNVVNILASADQGISLAMQG-AAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQ-----DYFTVGYKGDNEMDA 454 (528) Q Consensus 381 gn~~v~S~~va~~L~~~g~~~~~~~~~-~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG~~~~d~ 454 (528) ...+|++|.....|...--....+-.. .......+.+ -++|.| ++|+++.+.|. +.++..++ + T Consensus 341 ~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~----~~~l~G-~pv~~~~~~p~~~~~~~~~~~~f~-~----- 409 (458) T protein:vir:10 341 LSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQ----VGRIYG-LPVVVSEYFPAKANSAEFAVIVYK-D----- 409 (458) T ss_pred CCEEEEcHHHHHHHHhhcccCCceeeccccccccccCc----Cceecc-eeeEEccccccccCCcceEEEEec-c----- Confidence 667899999988886531000000000 0000011111 235765 99999988654 22222221 1 Q ss_pred eeEeccccccceeEEecCccccceeeee--eeece-eecCcccccCCCccceecccchHHhhcchh Q lcl|NC_012740. 455 GIYYAPYVALTPLRATDPQSFHPVLGFK--TRYGI-GINPFADSKSQAPSARITSGMLSKDSVGKN 517 (528) Q Consensus 455 g~fyaPYv~~~~~~~~Dp~s~qP~~~~~--tRY~l-~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~ 517 (528) +.++ ..-..+.+..||-+-...++|. .|.|+ +.+|=+. |. +.-.+ | T Consensus 410 ~~~~--~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~---------v~-~~~aa-----~ 458 (458) T protein:vir:10 410 NFVM--PRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGV---------VS-GTYAA-----S 458 (458) T ss_pred cEEE--EEeeceEEEeecccCCCceEEEEEEEecceEecccce---------EE-Eeecc-----C Confidence 0111 1112233445666555666666 46654 3445111 11 11111 0 No 30 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=92.34 E-value=0.012 Score=31.15 Aligned_cols=333 Identities=11% Similarity=0.086 Sum_probs=127.0 Q ss_pred CcchHHHHHhhhhhhcCCccchhcc-------------chhhhhhhhhhhhHHH---------Hhhhccccc-------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIAT-------------ASKQKLVAKILESQEA---------DFAVDPIYK-------- 50 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~-------------~~~~~~~~~~~enq~~---------~~~~~~~~~-------- 50 (528) |.+.++|.+.|.-+=+ .+-++.. .-.+++-+.|-+.+++ ..+..+... T Consensus 1 Mk~~~el~~~~~~~~~--~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (397) T protein:vir:48 1 MKTSNELHDLWVAQGD--KVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKP 78 (397) T ss_pred CchHHHHHHHHHHHHH--HHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcccc Confidence 9999999888866521 1111100 0001111111111111 000000000 Q ss_pred --------hhhhhhhhccccccccccccCCcccccccccc-ccccccccCcchh-hHHHHHHhhhhhhhceeeecCCccc Q lcl|NC_012740. 51 --------DEKVVEAFGGFIAEAEVAGDHGYNASNIASGQ-TTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPT 120 (528) Q Consensus 51 --------~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~t-~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPT 120 (528) .......|..++.+... -... ....++ +.|.+. .-+.+. .+++.+-++..-.+++.++||++++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~-~~~~~t~~~gg~~-iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 152 (397) T protein:vir:48 79 LTKSEEEVKAGFVKDFKNLVRGRYQ----NLLD-SKTDASGSDAGLT-IPQDIQTAIHTLVRQYDSLQEYVNVENVTTLT 152 (397) T ss_pred ccchhhHHHHHHHHHHHHHHhhhhh----HHHH-HhhccCCcccccc-ccHHHHHHHHHHHHHHHHHHhhhceeeccCCc Confidence 00001111111111100 0000 001111 112111 111111 3444445566778899999999987 Q ss_pred ceeeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 121 SQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQN 200 (528) Q Consensus 121 GLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~ 200 (528) |-+--.+. ... .+.+.+ T Consensus 153 ~~~~~~~~--~~~--------------~~~a~~----------------------------------------------- 169 (397) T protein:vir:48 153 GSRVYEKW--ADI--------------TGLAKL----------------------------------------------- 169 (397) T ss_pred ceEEEEee--cCC--------------Ccceee----------------------------------------------- Confidence 65443321 000 000000 Q ss_pred cccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchH Q lcl|NC_012740. 201 VTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSI 280 (528) Q Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ 280 (528) ++.+ ++ ...+....|.+..|++.|..+- ..+|- T Consensus 170 -------------------------------v~E~-----~~----~~~~~~~~~~~v~~~~~k~~~~-------~~iS~ 202 (397) T protein:vir:48 170 -------------------------------DDEA-----GS----IGTNDDPKLYPIRYAIKRYAGI-------STVTN 202 (397) T ss_pred -------------------------------eccc-----cc----cccccccceeeEEeeheeeeee-------hhhHH Confidence 0000 00 0011123466666666666544 56999 Q ss_pred HHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHH Q lcl|NC_012740. 281 EVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKS 360 (528) Q Consensus 281 ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~ 360 (528) ||.+|-. .|.+++|.+-|+..|..-+|+.|+.-.- . .. ...++.++ +-... T Consensus 203 ell~ds~----~~l~~~v~~~l~~~~~~~~d~~il~G~g---~---~~------~~~~~~~~-------------d~i~~ 253 (397) T protein:vir:48 203 SLLADSA----ENILAWLSGWIAKKVVVTRNKAILEAIA---T---LP------TKPTLTKW-------------DDIID 253 (397) T ss_pred HHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccc---c---cc------cccccccH-------------HHHHH Confidence 9999843 5779999999999999999999984221 1 00 11122111 11223 Q ss_pred HHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeC--CC- Q lcl|NC_012740. 361 LIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQ--YA- 437 (528) Q Consensus 361 L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~--y~- 437 (528) ++ +.+... +..+..+||+|.....|...- ...| ..-...+.+.. --++|.| ++|++-. .. T Consensus 254 ~~-------~~l~~~--~~~~a~~v~n~~~~~~L~~lk-----d~~G-~~i~~~~~~~~-~~~~l~G-~PV~~~~~~~~~ 316 (397) T protein:vir:48 254 LQ-------AKVDPA--IKQTSFFLTNTSGFTALKKVK-----NAFG-DYLMERDVKSP-TGYSIDG-FAVKEVADRWLA 316 (397) T ss_pred HH-------HHhhhh--hcCCCEEEECHHHHHHHHHhh-----cCCC-ceeeccCcCCC-CCceecc-ceeEEecccccC Confidence 33 333332 224678899999999997631 1111 00011121111 1246765 6766421 11 Q ss_pred ----C---------cceEEEEEecCCCccceeEeccccccceeEEecCccccceeeeeeeecee-ecC-------ccccc Q lcl|NC_012740. 438 ----R---------QDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIG-INP-------FADSK 496 (528) Q Consensus 438 ----~---------~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP-------~~~~~ 496 (528) + .+|++++..+..+..-+-+...| -...+=.+-...||+.. .|| ++... T Consensus 317 ~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~----------~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:48 317 NASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGA----------FETDTTKIRVIDRFDVVATDTESFVPASFKAIA 386 (397) T ss_pred CcCCCceEEEEEeccceEEEEeecceEEEEeccchhh----------hhcCceeEEEEeeeccEEecccceEEEEecccc Confidence 1 11233333222111111000000 11111222223333221 111 01111 Q ss_pred CCCccceecccchHHhhcch Q lcl|NC_012740. 497 SQAPSARITSGMLSKDSVGK 516 (528) Q Consensus 497 ~~~~~~~~~~~~~~~~~a~~ 516 (528) .+. +. .+.- +- T Consensus 387 ~~~-~~---~~~~-----~~ 397 (397) T protein:vir:48 387 DQK-GN---LGST-----AV 397 (397) T ss_pred cCC-CC---cccc-----CC Confidence 110 00 0000 00 No 31 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=91.85 E-value=0.014 Score=30.76 Aligned_cols=352 Identities=11% Similarity=0.057 Sum_probs=129.5 Q ss_pred CcchHHHHHhhhhhhcC----------------Cccchhccchhhhhhhhh--hhhHHHHhhhc----c---c------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLEN----------------EKLPEIATASKQKLVAKI--LESQEADFAVD----P---I------- 48 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~----------------~~~~~~~~~~~~~~~~~~--~enq~~~~~~~----~---~------- 48 (528) +...++|.+...-+.+. +...++.... ..+.+.+ |+.+..++.+. . . T Consensus 21 ~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~-~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~ 99 (418) T protein:vir:10 21 EQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATV-DELLIKQGELQARLLEAEQKLARGGGSAELETPKTL 99 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Confidence 11112222211111110 0011110000 0000000 01110000000 0 0 Q ss_pred ---cchhhhhhhhcccccccccccc---CCccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccc Q lcl|NC_012740. 49 ---YKDEKVVEAFGGFIAEAEVAGD---HGYNASNIASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTS 121 (528) Q Consensus 49 ---~~~~~~~~~~~~~l~ea~~~~~---~g~~~~~~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTG 121 (528) +.+..-...+..++.+...... .-.+......+++++.-...-|.+. .+++.+.+..+..++|.+-||++++. T Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 179 (418) T protein:vir:10 100 GQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSI 179 (418) T ss_pred hHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCce Confidence 0000000111111111000000 0000011111111111111222222 45566667788889999999987742 Q ss_pred eeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 122 QIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNV 201 (528) Q Consensus 122 LIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~ 201 (528) -+ .|. .+. .+.+.| T Consensus 180 ~~--~~~--~~~--------------~~~a~~------------------------------------------------ 193 (418) T protein:vir:10 180 EY--TVE--TGF--------------TNNAAA------------------------------------------------ 193 (418) T ss_pred eE--EEE--ecC--------------CCceee------------------------------------------------ Confidence 11 110 000 000000 Q ss_pred ccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHH Q lcl|NC_012740. 202 TAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIE 281 (528) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~E 281 (528) + +| +...++-..++++++..+|.-+-...+|-| T Consensus 194 ------------------------------v--------~E---------~~~~~~~~~~f~~v~~~~~k~~~~~~is~e 226 (418) T protein:vir:10 194 ------------------------------V--------AE---------GAQKPTSDLKFNLKNQPVRTIAHLFKASRQ 226 (418) T ss_pred ------------------------------e--------cc---------CccccccccceeeEEEeeeeEEEeehhhHH Confidence 0 01 001122223445555555555556779999 Q ss_pred HHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHH Q lcl|NC_012740. 282 VAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSL 361 (528) Q Consensus 282 LAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L 361 (528) |.||.- |.++.|.+-|+..|..-+|+-||.= +.. -..+.|++..........+ .. .... T Consensus 227 ll~ds~-----~l~~~i~~~l~~a~~~~~d~a~l~G---~g~---------~~~p~Gi~~~~~~~~~~~~--~~--~~~~ 285 (418) T protein:vir:10 227 ILDDAP-----ALQSYIDGRARYGLQLTEEGQILKG---DGT---------GANILGILPQASAFMPSIT--LA--NATP 285 (418) T ss_pred HHHhHH-----HHHHHHHHHHHHHHHHHHHHHHhcc---CCC---------Ccccccccccccccccccc--cc--cccc Confidence 999852 5688888888888888888888731 100 0013343322211100000 00 0011 Q ss_pred HHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcce Q lcl|NC_012740. 362 IYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDY 441 (528) Q Consensus 362 ~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy 441 (528) +..|..+-..+. ..+...+.+||||.....|...- + ..|. -+..+.+.. -.|+|.| ++|+++++.|.+- T Consensus 286 ~~~i~~~~~~~~--~~~~~~~~~v~n~~~~~~L~~lk--d---~~G~--~i~~~~~~~-~~~~l~G-~pV~~~~~~p~~~ 354 (418) T protein:vir:10 286 IDKIRLALLQAV--LAEFPATGIVLNPIDWASIELTK--D---SQGR--YIVGNPVNG-TTPRLWN-LPVVETQAMTANE 354 (418) T ss_pred HHHHHHHHHhhc--cccCCCCEEEEcHHHHHHHHHhh--c---CCCc--eeccccccC-CCceecc-eeeEEcCCCCCCc Confidence 222333333332 23446778999999998886531 1 1110 011111111 1356765 8999999988766 Q ss_pred EEEEEecCCCccceeEeccccccceeEEecCcc---ccc---eeeeeeeeceee-cCcccccCCCccceecccchHHhhc Q lcl|NC_012740. 442 FTVGYKGDNEMDAGIYYAPYVALTPLRATDPQS---FHP---VLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDSV 514 (528) Q Consensus 442 ~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s---~qP---~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~~~a 514 (528) +++|---. +|-=+.-..+...+|+.. |+- .+=+..|++..+ +|=+ +.-+.-....+ T Consensus 355 ~~~gd~s~-------~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a----------~~~~~~~~~~~ 417 (418) T protein:vir:10 355 FLVGAFSM-------AAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPES----------FVTGALVEQAG 417 (418) T ss_pred EEEeeccc-------eEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEecccc----------eEEEEeccCCC Confidence 66653210 010011122222233322 222 333445666542 2301 11111111122 Q ss_pred c Q lcl|NC_012740. 515 G 515 (528) Q Consensus 515 ~ 515 (528) | T Consensus 418 g 418 (418) T protein:vir:10 418 G 418 (418) T ss_pred C Confidence 3 No 32 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=91.23 E-value=0.017 Score=30.31 Aligned_cols=280 Identities=13% Similarity=0.081 Sum_probs=126.1 Q ss_pred hccccccccccccCCccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCC Q lcl|NC_012740. 58 FGGFIAEAEVAGDHGYNASNIASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLA 136 (528) Q Consensus 58 ~~~~l~ea~~~~~~g~~~~~~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s 136 (528) |. -..+++.+...++ .+.. ..-+.+. .+++++.++.+..+++-+-||++.+- +|+-..+ T Consensus 1 ma----------~~~~~~~~~~~t~-~gg~-lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-------~ip~~~~- 60 (304) T protein:vir:10 1 MA----------TPTYTPGNVILSD-FKNG-VIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKK-------KFTYLAK- 60 (304) T ss_pred Cc----------ccccccccccccC-CCce-ecchhHHHHHHHHHHhccchhhhcceeeccCCce-------EEEEEeC- Confidence 11 1112223322111 1111 1222232 56666777788888898988877531 1111000 Q ss_pred CcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccC Q lcl|NC_012740. 137 EHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESD 216 (528) Q Consensus 137 ~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~ 216 (528) + +.+. T Consensus 61 --~---------~~a~---------------------------------------------------------------- 65 (304) T protein:vir:10 61 --G---------VGAY---------------------------------------------------------------- 65 (304) T ss_pred --C---------cceE---------------------------------------------------------------- Confidence 0 0000 Q ss_pred ccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHH Q lcl|NC_012740. 217 DEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADA 296 (528) Q Consensus 217 ~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ 296 (528) - .+| +..+++-.-+++++++..|..+-...+|-||.+|- .+|.|+ T Consensus 66 --------------~--------v~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~ 110 (304) T protein:vir:10 66 --------------W--------VSE---------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AKDFFN 110 (304) T ss_pred --------------E--------eec---------CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc----hHHHHH Confidence 0 001 01123334445666666666666788999999875 477899 Q ss_pred HHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012740. 297 ELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQT 376 (528) Q Consensus 297 ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T 376 (528) .|.+-|...|...||+.+|.=.-... ..+. ...+++.-...... ........+.-|+++-+.+...= T Consensus 111 ~i~~~l~~~ia~~~d~~~l~G~g~~~---~~~~-----~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~~ 177 (304) T protein:vir:10 111 EVKPLIAEAFYKAFDQAVIFGTKSPY---NTST-----SGKPLVEGAEEKGN-----VVTDTNNLYVDLSALMATIEDEE 177 (304) T ss_pred HHHHHHHHHHHHHHHhhheeccCCCc---cccc-----cccccccccccccc-----ccccccchHHHHHHHHHHhhhcc Confidence 99999999999999999884211000 0000 00111100000000 00011223444555555555432 Q ss_pred ccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcc------------eEEE Q lcl|NC_012740. 377 GRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQD------------YFTV 444 (528) Q Consensus 377 ~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------y~~v 444 (528) .+..-+||++.....|...- ...|. .-...+ .|+|.| ++||++++.+.+ ++++ T Consensus 178 --~~~~~~v~~~~~~~~L~~lk-----d~~G~-~l~~~~------~~~l~G-~PV~~~~~~~~~~~~~~~~~gd~~~~~~ 242 (304) T protein:vir:10 178 --LDPNGVLTTRSFRSKMRNAL-----DANDR-PLFDAN------GNEIMG-LPLSYTGADVYDKKKSLALMGDWDYARY 242 (304) T ss_pred --CCcCEEEEcHHHHHHHHHhh-----ccCCc-EeecCC------Cccccc-eeeEEecccccCCCCcEEEEEehhhEEE Confidence 24556899999999987531 11110 001111 256755 899988886433 1222 Q ss_pred EEecCCCccceeEecccccccee--EEecCc-----ccc---ceeeeeeeeceee-cCcccccCCCccceecccc Q lcl|NC_012740. 445 GYKGDNEMDAGIYYAPYVALTPL--RATDPQ-----SFH---PVLGFKTRYGIGI-NPFADSKSQAPSARITSGM 508 (528) Q Consensus 445 G~KG~~~~d~g~fyaPYv~~~~~--~~~Dp~-----s~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~ 508 (528) |..+..+. ....+.... .-.|++ -|+ =.+=+..||++.+ || + -.+++...+ T Consensus 243 ~~~~~~~i------~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~------~-a~~~l~~a~ 304 (304) T protein:vir:10 243 GILQGIEY------AISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKP------E-AFATLKPTE 304 (304) T ss_pred EEecceEE------EEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecc------c-ceEEEEecC Confidence 32222111 000111111 111221 122 2233456787662 33 1 134444444 No 33 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=91.23 E-value=0.017 Score=30.31 Aligned_cols=280 Identities=13% Similarity=0.081 Sum_probs=126.1 Q ss_pred hccccccccccccCCccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCC Q lcl|NC_012740. 58 FGGFIAEAEVAGDHGYNASNIASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLA 136 (528) Q Consensus 58 ~~~~l~ea~~~~~~g~~~~~~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s 136 (528) |. -..+++.+...++ .+.. ..-+.+. .+++++.++.+..+++-+-||++.+- +|+-..+ T Consensus 1 ma----------~~~~~~~~~~~t~-~gg~-lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-------~ip~~~~- 60 (304) T protein:vir:94 1 MA----------TPTYTPGNVILSD-FKNG-VIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKK-------KFTYLAK- 60 (304) T ss_pred Cc----------ccccccccccccC-CCce-ecchhHHHHHHHHHHhccchhhhcceeeccCCce-------EEEEEeC- Confidence 11 1112223322111 1111 1222232 56666777788888898988877531 1111000 Q ss_pred CcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccC Q lcl|NC_012740. 137 EHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESD 216 (528) Q Consensus 137 ~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~ 216 (528) + +.+. T Consensus 61 --~---------~~a~---------------------------------------------------------------- 65 (304) T protein:vir:94 61 --G---------VGAY---------------------------------------------------------------- 65 (304) T ss_pred --C---------cceE---------------------------------------------------------------- Confidence 0 0000 Q ss_pred ccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHH Q lcl|NC_012740. 217 DEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADA 296 (528) Q Consensus 217 ~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ 296 (528) - .+| +..+++-.-+++++++..|..+-...+|-||.+|- .+|.|+ T Consensus 66 --------------~--------v~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~ 110 (304) T protein:vir:94 66 --------------W--------VSE---------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AKDFFN 110 (304) T ss_pred --------------E--------eec---------CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc----hHHHHH Confidence 0 001 01123334445666666666666788999999875 477899 Q ss_pred HHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012740. 297 ELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQT 376 (528) Q Consensus 297 ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T 376 (528) .|.+-|...|...||+.+|.=.-... ..+. ...+++.-...... ........+.-|+++-+.+...= T Consensus 111 ~i~~~l~~~ia~~~d~~~l~G~g~~~---~~~~-----~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~~ 177 (304) T protein:vir:94 111 EVKPLIAEAFYKAFDQAVIFGTKSPY---NTST-----SGKPLVEGAEEKGN-----VVTDTNNLYVDLSALMATIEDEE 177 (304) T ss_pred HHHHHHHHHHHHHHHhhheeccCCCc---cccc-----cccccccccccccc-----ccccccchHHHHHHHHHHhhhcc Confidence 99999999999999999884211000 0000 00111100000000 00011223444555555555432 Q ss_pred ccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcc------------eEEE Q lcl|NC_012740. 377 GRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQD------------YFTV 444 (528) Q Consensus 377 ~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------y~~v 444 (528) .+..-+||++.....|...- ...|. .-...+ .|+|.| ++||++++.+.+ ++++ T Consensus 178 --~~~~~~v~~~~~~~~L~~lk-----d~~G~-~l~~~~------~~~l~G-~PV~~~~~~~~~~~~~~~~~gd~~~~~~ 242 (304) T protein:vir:94 178 --LDPNGVLTTRSFRSKMRNAL-----DANDR-PLFDAN------GNEIMG-LPLSYTGADVYDKKKSLALMGDWDYARY 242 (304) T ss_pred --CCcCEEEEcHHHHHHHHHhh-----ccCCc-EeecCC------Cccccc-eeeEEecccccCCCCcEEEEEehhhEEE Confidence 24556899999999987531 11110 001111 256755 899988886433 1222 Q ss_pred EEecCCCccceeEecccccccee--EEecCc-----ccc---ceeeeeeeeceee-cCcccccCCCccceecccc Q lcl|NC_012740. 445 GYKGDNEMDAGIYYAPYVALTPL--RATDPQ-----SFH---PVLGFKTRYGIGI-NPFADSKSQAPSARITSGM 508 (528) Q Consensus 445 G~KG~~~~d~g~fyaPYv~~~~~--~~~Dp~-----s~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~ 508 (528) |..+..+. ....+.... .-.|++ -|+ =.+=+..||++.+ || + -.+++...+ T Consensus 243 ~~~~~~~i------~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~------~-a~~~l~~a~ 304 (304) T protein:vir:94 243 GILQGIEY------AISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKP------E-AFATLKPTE 304 (304) T ss_pred EEecceEE------EEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecc------c-ceEEEEecC Confidence 32222111 000111111 111221 122 2233456787662 33 1 134444444 No 34 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=91.03 E-value=0.018 Score=30.18 Aligned_cols=270 Identities=12% Similarity=0.068 Sum_probs=118.9 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccc-cccccccccccccccCcccccCcccccccc Q lcl|NC_012740. 146 MYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETG-IAYLQNVTAEQVTPTKADSESDDEVVMKLM 224 (528) Q Consensus 146 ~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (528) |...++.- .. .-.|..-.++....+.... ....... .... .+. T Consensus 1 MA~~~T~~------------------~~--~~iPev~s~~v~~~~~~~~~~~~~~~~-~~~~-----~g~---------- 44 (272) T protein:vir:30 1 MAVGTTKM------------------AQ--MLDPEVLADMIDAEVGKAIRFAPLAEV-DTTL-----EGQ---------- 44 (272) T ss_pred CCCccccc------------------hh--eechHHHHHHHHHHHHHHhhhhccccc-cccc-----cCC---------- Confidence 00000000 00 0000000111100000000 0000000 0000 000 Q ss_pred ccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHH Q lcl|NC_012740. 225 EEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILAN 304 (528) Q Consensus 225 ~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILSt 304 (528) .|.......--....++.. +. +..+..=..+.+.++++.|.++-.-++|=|++.+ -+-|.++++.+-|+. T Consensus 45 -~G~tv~iP~~~~~~~a~~v---~e--g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~ 114 (272) T protein:vir:30 45 -PGTTLTVPKWDYIGDAEDV---AE--GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVE 114 (272) T ss_pred -CCCEEEEEEecCCCCcccc---cC--CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHH Confidence 0000111000001111111 11 1123333445677778888777666777666533 257999999999999 Q ss_pred HHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEE Q lcl|NC_012740. 305 EVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFV 384 (528) Q Consensus 305 EImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~ 384 (528) .|..+|+++|+..+.-... .+ .+-.. .+-+-.+..++.++ ....+++ T Consensus 115 ~~a~~~d~~i~~~~~~a~~--------~~---~~~~t-------------~d~i~da~~~l~~~---------~~~~~~~ 161 (272) T protein:vir:30 115 AIDHKVDADVLDALSKSTQ--------TV---EATAT-------------VDGVSKALDIFNDE---------DDAETVI 161 (272) T ss_pred HHHHHHHHHHHHHhccccc--------cc---ccccC-------------HHHHHHHHHHHhcc---------CCCccEE Confidence 9999999999976532111 10 00000 12222333333322 2357899 Q ss_pred EEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEecccccc Q lcl|NC_012740. 385 IASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVAL 464 (528) Q Consensus 385 v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~ 464 (528) ||+|.++..|............ ...+ +....-.+|++.| ++|+++++.|.+=+++.-+|.- +++-.. +. T Consensus 162 vv~p~~~~~L~k~~~~~~~~~~---~~~~-~~~~~g~ig~i~G-~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~--~~ 230 (272) T protein:vir:30 162 VMNPADASTLRLDAAKEWLGAT---EVGA-NRVVSGVYGEVLG-VQIVRSRKCPKGTAYMVRKGAL----RIMLKR--NT 230 (272) T ss_pred EEcHHHHHHHHHhccccccccc---cccc-cccccccchhhcC-eeEEEcCCCCcceEEEEcCCeE----EEEecC--Cc Confidence 9999999999765433222111 1111 1111123678866 8999999998655444333321 111121 12 Q ss_pred ceeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHhhcchh Q lcl|NC_012740. 465 TPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKDSVGKN 517 (528) Q Consensus 465 ~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~a~~~ 517 (528) ....--|+.+++=.+-..-|||+. .||=. -.+++-. .|+|- T Consensus 231 ~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~-------vv~~t~~-----~a~~~ 272 (272) T protein:vir:30 231 MVETDRDITKAINQIVANKHYGVYLYKAEK-------AVKITLK-----DAAKK 272 (272) T ss_pred eeeeccccccceeEEEEEEEEEEEEEcCCc-------eEEEEec-----ccccC Confidence 223345788888888888888875 34410 1122211 23333 No 35 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=91.03 E-value=0.018 Score=30.18 Aligned_cols=270 Identities=12% Similarity=0.068 Sum_probs=118.9 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccc-cccccccccccccccCcccccCcccccccc Q lcl|NC_012740. 146 MYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETG-IAYLQNVTAEQVTPTKADSESDDEVVMKLM 224 (528) Q Consensus 146 ~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (528) |...++.- .. .-.|..-.++....+.... ....... .... .+. T Consensus 1 MA~~~T~~------------------~~--~~iPev~s~~v~~~~~~~~~~~~~~~~-~~~~-----~g~---------- 44 (272) T protein:vir:98 1 MAVGTTKM------------------AQ--MLDPEVLADMIDAEVGKAIRFAPLAEV-DTTL-----EGQ---------- 44 (272) T ss_pred CCCccccc------------------hh--eechHHHHHHHHHHHHHHhhhhccccc-cccc-----cCC---------- Confidence 00000000 00 0000000111100000000 0000000 0000 000 Q ss_pred ccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHH Q lcl|NC_012740. 225 EEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILAN 304 (528) Q Consensus 225 ~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILSt 304 (528) .|.......--....++.. +. +..+..=..+.+.++++.|.++-.-++|=|++.+ -+-|.++++.+-|+. T Consensus 45 -~G~tv~iP~~~~~~~a~~v---~e--g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~ 114 (272) T protein:vir:98 45 -PGTTLTVPKWDYIGDAEDV---AE--GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVE 114 (272) T ss_pred -CCCEEEEEEecCCCCcccc---cC--CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHH Confidence 0000111000001111111 11 1123333445677778888777666777666533 257999999999999 Q ss_pred HHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEE Q lcl|NC_012740. 305 EVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFV 384 (528) Q Consensus 305 EImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~ 384 (528) .|..+|+++|+..+.-... .+ .+-.. .+-+-.+..++.++ ....+++ T Consensus 115 ~~a~~~d~~i~~~~~~a~~--------~~---~~~~t-------------~d~i~da~~~l~~~---------~~~~~~~ 161 (272) T protein:vir:98 115 AIDHKVDADVLDALSKSTQ--------TV---EATAT-------------VDGVSKALDIFNDE---------DDAETVI 161 (272) T ss_pred HHHHHHHHHHHHHhccccc--------cc---ccccC-------------HHHHHHHHHHHhcc---------CCCccEE Confidence 9999999999976532111 10 00000 12222333333322 2357899 Q ss_pred EEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEecccccc Q lcl|NC_012740. 385 IASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVAL 464 (528) Q Consensus 385 v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~ 464 (528) ||+|.++..|............ ...+ +....-.+|++.| ++|+++++.|.+=+++.-+|.- +++-.. +. T Consensus 162 vv~p~~~~~L~k~~~~~~~~~~---~~~~-~~~~~g~ig~i~G-~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~--~~ 230 (272) T protein:vir:98 162 VMNPADASTLRLDAAKEWLGAT---EVGA-NRVVSGVYGEVLG-VQIVRSRKCPKGTAYMVRKGAL----RIMLKR--NT 230 (272) T ss_pred EEcHHHHHHHHHhccccccccc---cccc-cccccccchhhcC-eeEEEcCCCCcceEEEEcCCeE----EEEecC--Cc Confidence 9999999999765433222111 1111 1111123678866 8999999998655444333321 111121 12 Q ss_pred ceeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHhhcchh Q lcl|NC_012740. 465 TPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKDSVGKN 517 (528) Q Consensus 465 ~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~a~~~ 517 (528) ....--|+.+++=.+-..-|||+. .||=. -.+++-. .|+|- T Consensus 231 ~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~-------vv~~t~~-----~a~~~ 272 (272) T protein:vir:98 231 MVETDRDITKAINQIVANKHYGVYLYKAEK-------AVKITLK-----DAAKK 272 (272) T ss_pred eeeeccccccceeEEEEEEEEEEEEEcCCc-------eEEEEec-----ccccC Confidence 223345788888888888888875 34410 1122211 23333 No 36 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=91.02 E-value=0.018 Score=30.17 Aligned_cols=328 Identities=13% Similarity=0.109 Sum_probs=132.0 Q ss_pred CcchHHHHHhhhhhhcCCccchhccc----------------hhhhhhhhhhh------hHHHHhhhccc---------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATA----------------SKQKLVAKILE------SQEADFAVDPI---------- 48 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~----------------~~~~~~~~~~e------nq~~~~~~~~~---------- 48 (528) |.+.++|.+.|.-+.+. +.+. --+++.+.|-+ -+++.+.+.+. T Consensus 1 Mk~~~eL~~~~~~~~~~-----~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDK-----VENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEE 75 (397) T ss_pred CchHHHHHHHHHHHHHH-----HHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 99999999999887763 1111 00111122111 11111111100 Q ss_pred ----cch-----hhhhhhhccccccccccccCCccc-cccccccc-cccccccCcchh--hHHHHHHhhhhhhhceeeec Q lcl|NC_012740. 49 ----YKD-----EKVVEAFGGFIAEAEVAGDHGYNA-SNIASGQT-TGAITNVGPAVI--GMVRRAIPNLIAFDICGVQP 115 (528) Q Consensus 49 ----~~~-----~~~~~~~~~~l~ea~~~~~~g~~~-~~~~e~t~-tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQP 115 (528) .+. .....+|..+|.. + ..+. .....+++ .|.+. . |.-+ .+++.+-++..-.+++.|+| T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~l~~----~--~~~~~~~~~~~t~~~gg~~-i-P~~~~~~ii~~~~~~~~l~~~~~~~~ 147 (397) T protein:vir:49 76 KKPLTKNEEEVKANFVKDFKNLVRG----R--YQNLLDSKTDGSGSDAGLT-I-PQDIRTAINTLVRQFDSLQEYVNVEN 147 (397) T ss_pred cccccchhhHHHHHHHHHHHHHhhc----c--hhhHHHhhhccCCccCcce-e-cHHHHHHHHHHHHhhhhHhhhcceee Confidence 000 0001111111110 0 0011 01111111 12111 1 2222 35555666777889999999 Q ss_pred CCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 116 MSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGI 195 (528) Q Consensus 116 mTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~ 195 (528) |++.+|-+--.+ .... .+.+.| T Consensus 148 ~~~~~~~~~~~~--~~~~--------------~~~a~~------------------------------------------ 169 (397) T protein:vir:49 148 VTTLTGSRVYEK--WADI--------------TGLAKL------------------------------------------ 169 (397) T ss_pred ccCCcceEEEEe--eccC--------------Ccceee------------------------------------------ Confidence 998865422111 0000 000000 Q ss_pred ccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEeccccc Q lcl|NC_012740. 196 AYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLK 275 (528) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALK 275 (528) ++.+- + ...+....|.+..|++.|..+ . T Consensus 170 ------------------------------------v~E~~-----~----~~~~~~~~~~~v~~~~~k~~~-------~ 197 (397) T protein:vir:49 170 ------------------------------------DDEGG-----Q----IGQNDDPKLSLIRYAIKRYAG-------I 197 (397) T ss_pred ------------------------------------ecccc-----c----cccccccceeeeEeeeeeeEe-------e Confidence 00000 0 000111235555555555544 4 Q ss_pred ccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHH Q lcl|NC_012740. 276 ARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAG 355 (528) Q Consensus 276 AEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~ 355 (528) ..+|-||.+|. .+|.+++|.+-|+..|..-+|+.||.=. .. | ....+++++ T Consensus 198 ~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~d~ail~G~---g~----~-----~~~~~~~~~------------- 248 (397) T protein:vir:49 198 STVTNSLLADS----AENILAWLSGWIAKKVVVTRNKAILEAI---GT----L-----PNKPTLAKW------------- 248 (397) T ss_pred hhhHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHHhcc---cc----c-----cccccccCH------------- Confidence 66999999985 3577999999999999999999998321 11 0 011222222 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeC Q lcl|NC_012740. 356 ESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQ 435 (528) Q Consensus 356 e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~ 435 (528) +-...|+..+ . +.+.....+|++|.....|.... ...|. .-...+.+.. ..++|.| ++|++.. T Consensus 249 d~i~~~~~~l-------~--~~~~~~a~~v~n~~~~~~l~~lk-----d~~g~-~l~~~~~~~g-~~~~l~G-~pV~~~~ 311 (397) T protein:vir:49 249 DDIIDLQAKV-------D--PAIKQTSLFLTNTSGFTALKKVK-----NAMGD-YLMERDVKSP-TGYSIDG-FVVKEIS 311 (397) T ss_pred HHHHHHHHhh-------h--hhhcCCCEEEEcHHHHHHHHHhh-----ccCCc-eeecccccCC-CCceecc-eeeEEec Confidence 1122333332 2 22335678899999999887641 11110 0011111111 1246766 6666422 Q ss_pred --CCC-----cceEEEE---------EecCCCccceeEeccccccceeEEecCccccceeeeeeeeceee-cC------- Q lcl|NC_012740. 436 --YAR-----QDYFTVG---------YKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIGI-NP------- 491 (528) Q Consensus 436 --y~~-----~dy~~vG---------~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP------- 491 (528) ..+ ..-+++| ..+.-+ +-..||.- .+-...+-.+-...|++..+ +| T Consensus 312 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~----i~~~~~~~------~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~ 381 (397) T protein:vir:49 312 DRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLS----LLSTNIGG------GAFETDTTKVRVIDRFDVVSTDTEAFVPAS 381 (397) T ss_pred ccccccccCCceeEEEeeccceEEEEeecccE----EEEecccc------chhhcCeeeEEEEEeeccEEecccceEEEE Confidence 211 1112222 221111 11122210 00112233334445554432 22 Q ss_pred cccccCCCccceecccchHHhhcch Q lcl|NC_012740. 492 FADSKSQAPSARITSGMLSKDSVGK 516 (528) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~a~~ 516 (528) |+...++++.... .|. T Consensus 382 ~~~~~~~~~~~~~---------~~~ 397 (397) T protein:vir:49 382 FKAIADQKAKLST---------AGA 397 (397) T ss_pred ecccccccCcccc---------cCC Confidence 1112222211111 111 No 37 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=90.68 E-value=0.02 Score=29.95 Aligned_cols=357 Identities=17% Similarity=0.111 Sum_probs=143.4 Q ss_pred CcchHHHHHhhhhhhcC--Cccchhccch-------hhhhhhhh--hhhHHHHh-------hhcc--------------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLEN--EKLPEIATAS-------KQKLVAKI--LESQEADF-------AVDP--------------- 47 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~--~~~~~~~~~~-------~~~~~~~~--~enq~~~~-------~~~~--------------- 47 (528) |...++|.++=.-+.+. +.+-++++.- .+++...+ |+.|-+.+ .+.. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 77777666665555432 0000010000 00111000 11111111 0000 Q ss_pred -ccchhhhhhhhccccccccc----------cccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeee Q lcl|NC_012740. 48 -IYKDEKVVEAFGGFIAEAEV----------AGDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQ 114 (528) Q Consensus 48 -~~~~~~~~~~~~~~l~ea~~----------~~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQ 114 (528) ...+..-....+..+.+... .-..+.+......++..|.+ --|..+ .+++.+.+...-.+++.+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~--~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:47 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFV--VIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcc--cccHHHHHHHHHHHHhhhhhhhhccee Confidence 00000000000000000000 00000000000111112211 123332 4666677788889999999 Q ss_pred cCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 115 PMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETG 194 (528) Q Consensus 115 PmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg 194 (528) ||+++++-+.-.+.. . + +...+ T Consensus 159 ~~~~~~~~~~~~~~~--~------~---------~~~~~----------------------------------------- 180 (415) T protein:vir:47 159 RVTNGSGKYPVVRQS--E------V---------AALEK----------------------------------------- 180 (415) T ss_pred eccCCceeEEEEEec--C------C---------cceee----------------------------------------- Confidence 999987544222210 0 0 00000 Q ss_pred cccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccce-eEEeeEEEEEeccc Q lcl|NC_012740. 195 IAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMS-MRIDKQVVEAKSRQ 273 (528) Q Consensus 195 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMa-FSIEK~TVTAKSRA 273 (528) + +| +...++.+ -++++++..++..+ T Consensus 181 -------------------------------------v--------~E---------g~~~~~~~~~~~~~v~~~~~k~~ 206 (415) T protein:vir:47 181 -------------------------------------V--------EE---------LEENPELAVKPFFQLAYDINTHR 206 (415) T ss_pred -------------------------------------c--------cc---------ccccccccccceeeEEeeeeeeE Confidence 0 01 01122222 23445555555555 Q ss_pred ccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchh Q lcl|NC_012740. 274 LKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARW 353 (528) Q Consensus 274 LKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~ 353 (528) -...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-...-.+.-... . ......... +.- T Consensus 207 ~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~--~-~~~~~~~~~------~~~- 272 (415) T protein:vir:47 207 GYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF--E-KEGKKLEVK------KAK- 272 (415) T ss_pred eeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccc--c-cccceeccc------ccc- Confidence 5567999999984 3577899999999999999999998533211111110000 0 000010000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEe Q lcl|NC_012740. 354 AGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFI 433 (528) Q Consensus 354 a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~ 433 (528) ..+-...|+..+.. .+.+.+.+|++|.....|.... ...|- .-...+.+.. ..++|.| ++|++ T Consensus 273 ~~~~i~~~~~~~~~---------~~~~~~~~v~n~~~~~~L~~lk-----d~~G~-~i~~~~~~~~-~~~~l~G-~pV~~ 335 (415) T protein:vir:47 273 SLDDIKDAINLNVK---------PNYEHNVAIVSQTMFAKLDKMK-----DKLGN-YLIQPDVKEK-TQQRLLG-AKIEI 335 (415) T ss_pred chHHHHHHHHhhhh---------hccCCCEEEEcHHHHHHHHHhh-----ccCCC-eeeccCcCCC-CCccccc-eeeEE Confidence 11222333333332 2235778999999998887531 11110 0011122211 1346766 78887 Q ss_pred eCCCCcceEEEEEecCCCccceeEecccc--------ccceeEEecCccccceeeeeeeecee-ecC--cccccCCCccc Q lcl|NC_012740. 434 DQYARQDYFTVGYKGDNEMDAGIYYAPYV--------ALTPLRATDPQSFHPVLGFKTRYGIG-INP--FADSKSQAPSA 502 (528) Q Consensus 434 D~y~~~dy~~vG~KG~~~~d~g~fyaPYv--------~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP--~~~~~~~~~~~ 502 (528) .++.+. |-.| +..++|+.|- ........|-.+++-.+-...|++.. .+| |....-. + T Consensus 336 ~~~~~~-----~~~~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~---~ 403 (415) T protein:vir:47 336 LPDEVL-----GQKG----NNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYD---D 403 (415) T ss_pred eccccc-----cCCC----ccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccccEEEEEee---c Confidence 765442 1111 1112222211 11122334556677777788898865 344 1110000 0 Q ss_pred eecccchHHhhc Q lcl|NC_012740. 503 RITSGMLSKDSV 514 (528) Q Consensus 503 ~~~~~~~~~~~a 514 (528) -..-.++..+-+ T Consensus 404 ~~~~~~~~~~~~ 415 (415) T protein:vir:47 404 SERGEGDLGLEA 415 (415) T ss_pred cCCCCCCccCCC Confidence 001112222222 No 38 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=90.68 E-value=0.02 Score=29.95 Aligned_cols=357 Identities=17% Similarity=0.111 Sum_probs=143.4 Q ss_pred CcchHHHHHhhhhhhcC--Cccchhccch-------hhhhhhhh--hhhHHHHh-------hhcc--------------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLEN--EKLPEIATAS-------KQKLVAKI--LESQEADF-------AVDP--------------- 47 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~--~~~~~~~~~~-------~~~~~~~~--~enq~~~~-------~~~~--------------- 47 (528) |...++|.++=.-+.+. +.+-++++.- .+++...+ |+.|-+.+ .+.. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 77777666665555432 0000010000 00111000 11111111 0000 Q ss_pred -ccchhhhhhhhccccccccc----------cccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeee Q lcl|NC_012740. 48 -IYKDEKVVEAFGGFIAEAEV----------AGDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQ 114 (528) Q Consensus 48 -~~~~~~~~~~~~~~l~ea~~----------~~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQ 114 (528) ...+..-....+..+.+... .-..+.+......++..|.+ --|..+ .+++.+.+...-.+++.+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~--~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:46 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFV--VIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcc--cccHHHHHHHHHHHHhhhhhhhhccee Confidence 00000000000000000000 00000000000111112211 123332 4666677788889999999 Q ss_pred cCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 115 PMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETG 194 (528) Q Consensus 115 PmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg 194 (528) ||+++++-+.-.+.. . + +...+ T Consensus 159 ~~~~~~~~~~~~~~~--~------~---------~~~~~----------------------------------------- 180 (415) T protein:vir:46 159 RVTNGSGKYPVVRQS--E------V---------AALEK----------------------------------------- 180 (415) T ss_pred eccCCceeEEEEEec--C------C---------cceee----------------------------------------- Confidence 999987544222210 0 0 00000 Q ss_pred cccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccce-eEEeeEEEEEeccc Q lcl|NC_012740. 195 IAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMS-MRIDKQVVEAKSRQ 273 (528) Q Consensus 195 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMa-FSIEK~TVTAKSRA 273 (528) + +| +...++.+ -++++++..++..+ T Consensus 181 -------------------------------------v--------~E---------g~~~~~~~~~~~~~v~~~~~k~~ 206 (415) T protein:vir:46 181 -------------------------------------V--------EE---------LEENPELAVKPFFQLAYDINTHR 206 (415) T ss_pred -------------------------------------c--------cc---------ccccccccccceeeEEeeeeeeE Confidence 0 01 01122222 23445555555555 Q ss_pred ccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchh Q lcl|NC_012740. 274 LKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARW 353 (528) Q Consensus 274 LKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~ 353 (528) -...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-...-.+.-... . ......... +.- T Consensus 207 ~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~--~-~~~~~~~~~------~~~- 272 (415) T protein:vir:46 207 GYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF--E-KEGKKLEVK------KAK- 272 (415) T ss_pred eeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccc--c-cccceeccc------ccc- Confidence 5567999999984 3577899999999999999999998533211111110000 0 000010000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEe Q lcl|NC_012740. 354 AGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFI 433 (528) Q Consensus 354 a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~ 433 (528) ..+-...|+..+.. .+.+.+.+|++|.....|.... ...|- .-...+.+.. ..++|.| ++|++ T Consensus 273 ~~~~i~~~~~~~~~---------~~~~~~~~v~n~~~~~~L~~lk-----d~~G~-~i~~~~~~~~-~~~~l~G-~pV~~ 335 (415) T protein:vir:46 273 SLDDIKDAINLNVK---------PNYEHNVAIVSQTMFAKLDKMK-----DKLGN-YLIQPDVKEK-TQQRLLG-AKIEI 335 (415) T ss_pred chHHHHHHHHhhhh---------hccCCCEEEEcHHHHHHHHHhh-----ccCCC-eeeccCcCCC-CCccccc-eeeEE Confidence 11222333333332 2235778999999998887531 11110 0011122211 1346766 78887 Q ss_pred eCCCCcceEEEEEecCCCccceeEecccc--------ccceeEEecCccccceeeeeeeecee-ecC--cccccCCCccc Q lcl|NC_012740. 434 DQYARQDYFTVGYKGDNEMDAGIYYAPYV--------ALTPLRATDPQSFHPVLGFKTRYGIG-INP--FADSKSQAPSA 502 (528) Q Consensus 434 D~y~~~dy~~vG~KG~~~~d~g~fyaPYv--------~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP--~~~~~~~~~~~ 502 (528) .++.+. |-.| +..++|+.|- ........|-.+++-.+-...|++.. .+| |....-. + T Consensus 336 ~~~~~~-----~~~~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~---~ 403 (415) T protein:vir:46 336 LPDEVL-----GQKG----NNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYD---D 403 (415) T ss_pred eccccc-----cCCC----ccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccccEEEEEee---c Confidence 765442 1111 1112222211 11122334556677777788898865 344 1110000 0 Q ss_pred eecccchHHhhc Q lcl|NC_012740. 503 RITSGMLSKDSV 514 (528) Q Consensus 503 ~~~~~~~~~~~a 514 (528) -..-.++..+-+ T Consensus 404 ~~~~~~~~~~~~ 415 (415) T protein:vir:46 404 SERGEGDLGLEA 415 (415) T ss_pred cCCCCCCccCCC Confidence 001112222222 No 39 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=90.52 E-value=0.02 Score=29.94 Aligned_cols=344 Identities=11% Similarity=0.094 Sum_probs=123.5 Q ss_pred Cc---------chHHHHHhhhhhhcCCccchhcc----------------------------------chhhhhhhhhhh Q lcl|NC_012740. 1 MK---------TTKELMEKWSPLLENEKLPEIAT----------------------------------ASKQKLVAKILE 37 (528) Q Consensus 1 ~~---------~~~~l~~kw~p~l~~~~~~~~~~----------------------------------~~~~~~~~~~~e 37 (528) +. ..++|.++..-|-+. +-++.+ ..+|+...+.+. T Consensus 30 ~~~ee~~~~~~e~~~l~~~~~~l~~~--i~~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 107 (434) T protein:vir:62 30 VRSEELAAVKAEVEQLTKEIQTISEE--LAKLEEKEKEEDPAKKKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIA 107 (434) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHhhhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHH Confidence 11 111222222222110 000000 000000000000 Q ss_pred hHHHHhhhccccchhhhhhhhccccccccccccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeec Q lcl|NC_012740. 38 SQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQP 115 (528) Q Consensus 38 nq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQP 115 (528) +..... .+....+.....+|..+|..-. .-.....+..+++.|.+. =|.-+ .+++..-+..+...++-|.| T Consensus 108 ~~~~~~-~~~~~~~~e~r~a~~~~l~~~~----~~~e~~a~~~~t~~GG~l--vP~~~~~~Ii~~l~~~~~i~~~~~~~~ 180 (434) T protein:vir:62 108 AALSTK-GHRTNKETEIRSVFANYIVGNI----DEKEARALGLVTGNGSVT--IPDFLSKEIITYAQEENFLRRLGTGVK 180 (434) T ss_pred hhhhhc-cccchHHHHHHHHHHHHhcccc----chhhhhhhccccccccee--cchhhHHHHHHhhhhhhhhhhhcceec Confidence 000000 0000011111122222221100 000111111112112211 13333 35555566777788888888 Q ss_pred CCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 116 MSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGI 195 (528) Q Consensus 116 mTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~ 195 (528) +++..- |- ++.... .+.+ T Consensus 181 ~~~~~~--~p---~~~~~~---------------~a~~------------------------------------------ 198 (434) T protein:vir:62 181 TKENIK--YP---VLVKKA---------------EAQG------------------------------------------ 198 (434) T ss_pred cCCceE--EE---EEecCC---------------cccc------------------------------------------ Confidence 765310 00 000000 0000 Q ss_pred ccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEeccccc Q lcl|NC_012740. 196 AYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLK 275 (528) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALK 275 (528) . ...+| +...++-..++++++..+|.-+-. T Consensus 199 ---------------------------------~--------~~~~e---------~~~~~~~~~~f~~v~~~~~k~~~~ 228 (434) T protein:vir:62 199 ---------------------------------H--------KNERT---------NNEMPETDIEFDEIELSPTEFDAL 228 (434) T ss_pred ---------------------------------e--------ecccc---------cccccccccceeeEEeeheeeEee Confidence 0 00000 001111122445555555555556 Q ss_pred ccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccc-cccccchhH Q lcl|NC_012740. 276 ARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDP-IDTRGARWA 354 (528) Q Consensus 276 AEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~-~d~~~~r~a 354 (528) ..+|-||.+|- .+|.+++|.+-|+..|..-+++.||.= + |..+ .+.|++.-... ....+. T Consensus 229 ~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~d~~~l~G---~---G~~~------~~~g~~~~~~~~~~~~~~--- 289 (434) T protein:vir:62 229 ATVTKKLLART----GLPIEQIVMDELKKAYVRKETQYMVNG---D---EANN------INDGALAKKAVEFKTDEK--- 289 (434) T ss_pred hhhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhcc---C---CCCc------cccceeeccccccccccc--- Confidence 77999999995 467799999999999999999999941 0 0000 01111110000 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccc----cccccccCceEEEEecCceE Q lcl|NC_012740. 355 GESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQ----GLNTDTTKAVFAGVLAGKYK 430 (528) Q Consensus 355 ~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~----~~~~d~~~~~~~G~l~~~~~ 430 (528) -.+..| -++-..+...-+ +.-..|+++.....|... +...|-.- ....+.++ .+|.| ++ T Consensus 290 -~~~d~l----~~l~~~l~~~~~--~~a~~v~n~~~~~~L~~l-----kd~~G~~l~~~~~~~~~g~~----~tl~G-~p 352 (434) T protein:vir:62 290 -NLYDAL----VKMKNTPVKEVR--KKARWVLNTAALTKIETM-----KTDDGFPLLRPFNQAEGGIG----YTLLG-FP 352 (434) T ss_pred -chhhHH----HHHHhhcchhhh--cCCEEEEcHHHHHHHHHh-----hccCCCEeeccCCCccCCCC----ceecc-ee Confidence 011222 223333332222 233558899998888753 11111100 00011111 24655 88 Q ss_pred EEeeCCCCcce------EEEEEecCCCccceeEecccc-ccceeEEecCc--cccceeeeeeeec-eeec-CcccccCCC Q lcl|NC_012740. 431 VFIDQYARQDY------FTVGYKGDNEMDAGIYYAPYV-ALTPLRATDPQ--SFHPVLGFKTRYG-IGIN-PFADSKSQA 499 (528) Q Consensus 431 vy~D~y~~~dy------~~vG~KG~~~~d~g~fyaPYv-~~~~~~~~Dp~--s~qP~~~~~tRY~-l~~n-P~~~~~~~~ 499 (528) |+++.+.+..- |.+| +-.. . +..... ...+.+..+.- .-|=.+..+.|++ ..++ |++...- T Consensus 353 V~~~~~~~~~~~~~~~~i~~G---dfs~--~-~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~-- 424 (434) T protein:vir:62 353 VEEEDAIDIPDSPDTPVFYFG---DFSK--F-YIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVY-- 424 (434) T ss_pred eEEecCccCccCCCceEEEEe---eccc--e-EEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEE-- Confidence 98887754221 2222 1110 0 011111 12233333332 2222344556774 3343 7664322 Q ss_pred ccceecccchHHhhcc Q lcl|NC_012740. 500 PSARITSGMLSKDSVG 515 (528) Q Consensus 500 ~~~~~~~~~~~~~~a~ 515 (528) .+.-.....+ T Consensus 425 ------~~~~~~~~~~ 434 (434) T protein:vir:62 425 ------KYVLKAPTGA 434 (434) T ss_pred ------EEEeccCCCC Confidence 1111111222 No 40 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=89.71 E-value=0.025 Score=29.39 Aligned_cols=273 Identities=9% Similarity=0.028 Sum_probs=117.1 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccc Q lcl|NC_012740. 146 MYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLME 225 (528) Q Consensus 146 ~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (528) |....|.- .. .-.|-....+....+... ....+......... .. T Consensus 1 ma~~~T~~------------------~d--~i~Pev~s~~v~~~~~~~-------~~~~~~~~~~~~l~---------g~ 44 (274) T protein:vir:96 1 MAQGTTKV------------------SN--LIVPEVLAPMMQAELDKK-------LRFAQFADIDSTLV---------GQ 44 (274) T ss_pred CCccccch------------------hh--hhhhHHHHHHHHHHHHhh-------hhhccccccccccc---------CC Confidence 00000000 00 000111111110001000 00000000000000 00 Q ss_pred cccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHH Q lcl|NC_012740. 226 EGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANE 305 (528) Q Consensus 226 ~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStE 305 (528) .|...+...=-.+..+|. +......++.++.++= .+++-|-|+-.=+++=|. ++..+-|.-.+..+-++.. T Consensus 45 ~G~tv~ip~~~~~g~~~~---~~~g~~i~~~~it~~~--~~~~i~~~~~~~~i~D~~----~~~~~~d~~~~~~~~~~~~ 115 (274) T protein:vir:96 45 PGDTLTFPAFTYSGDAQV---IAEGEKIPVDQIGTSK--REAKVRKIGKGTELTDEA----VLSGFGDPQGEAVRQHGLA 115 (274) T ss_pred CCCEEEEEeeccCCCccc---cCCCCcCchhhcccce--eEEEEEeeeceeeecHHH----HHhhcchHHHHHHHHHHHH Confidence 111111111001112221 1112233455554443 344445554222333222 2334678999999999999 Q ss_pred HHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEE Q lcl|NC_012740. 306 VLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVI 385 (528) Q Consensus 306 ImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v 385 (528) |+.+++++|+..+..... ++.+ .. - ..+.+-.+..++.++. ...+++| T Consensus 116 ~a~~~d~~i~~~l~~a~~--------~~~~--~~------------~-~~d~i~dA~~~l~d~~---------~~~~~iv 163 (274) T protein:vir:96 116 IANKVDNDVLEALKGATL--------TVEA--DI------------T-KLDGLQTAIDKFNDED---------LEPMVLF 163 (274) T ss_pred HHHHHHHHHHHHHhcCCC--------CcCc--cc------------c-cHHHHHHHHHHhcccC---------CCceEEE Confidence 999999999987743221 1100 00 1 1333344444444321 2578999 Q ss_pred EchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccc Q lcl|NC_012740. 386 ASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALT 465 (528) Q Consensus 386 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~ 465 (528) |+|.+++.|..-............. .....-.+|.+.| ++||+|...|..=..+-=+|.-. |+.. -+.. T Consensus 164 v~p~~~~~L~k~~~~~f~~~~~~g~----~~~~~g~ig~~~G-~~Vi~s~~~p~~t~~l~~~gA~~-----~~~~-~~~~ 232 (274) T protein:vir:96 164 VNPLDAGGLRTSASDNFTRPTQLGD----NIIVKGAFGEALG-AVIVRSNKLNKGEALLAKKGAVK-----LITK-RDFF 232 (274) T ss_pred eCHHHHHHHHhcccccccccccccc----cceeecccceecC-eeEEEcCCCCcceEEEEeCccee-----eeec-CCcc Confidence 9999999997754333322211111 1112234888865 99999999875432222122211 1111 1222 Q ss_pred eeEEecCccccceeeeeeeeceee-cCcccccCCCccceecccchHHhhcchhhh Q lcl|NC_012740. 466 PLRATDPQSFHPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDSVGKNAY 519 (528) Q Consensus 466 ~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 519 (528) .-.--||.+++-.|-...+||+.. ||= .-..++.+.-- -.| T Consensus 233 vE~~Rd~~~~~d~i~~~~~yg~~~~~~~-------~vv~~t~~~~~------~~~ 274 (274) T protein:vir:96 233 LEKDRDASRKSTALYSDKHYVAYLYDES-------KVVKITKGAGD------EVM 274 (274) T ss_pred cccccchhhcccEEEEeeEEEEEEEcCc-------cEEEEEcCccc------ccC Confidence 334568999999999999999864 550 01122211111 111 No 41 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=89.33 E-value=0.027 Score=29.19 Aligned_cols=342 Identities=14% Similarity=0.055 Sum_probs=133.3 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchh--hhhhhhhhhhHHHHhhhcc--------ccchhhhhhhhcccccccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASK--QKLVAKILESQEADFAVDP--------IYKDEKVVEAFGGFIAEAEVAGD 70 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~--~~~~~~~~enq~~~~~~~~--------~~~~~~~~~~~~~~l~ea~~~~~ 70 (528) --.+++..+++.-+... +....+ +.+-. .++.-++...... ......-...+-.+..+..... T Consensus 30 ~~~~~e~~~~~~~~~~e-----~~~l~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 102 (390) T protein:vir:10 30 GELNASARSKVDELFAT-----VGNLSAEVQAARQ-RVAELEGNGAGGDVQHVSVGDLFVASEQFQASAGRWNDRSARA- 102 (390) T ss_pred cccCHHHHHHHHHHHHH-----HHHHHHHHHHHHH-HHHHHHhhcccccccccchhhhhhhhHHHHHHHHhhhhhhhhh- Confidence 12334555666554321 211100 00000 1111111000000 0000000001100000000000 Q ss_pred CCcccccc---c-cccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccc Q lcl|NC_012740. 71 HGYNASNI---A-SGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHP 145 (528) Q Consensus 71 ~g~~~~~~---~-e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n 145 (528) ..+.+.. + .++++.+-.-.-|.++ .++.++-.+..-.++|.+.||++++.-+. + ..+.. T Consensus 103 -~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~--~--~~~~~----------- 166 (390) T protein:vir:10 103 -TMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYV--Q--ETGFV----------- 166 (390) T ss_pred -hhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEE--E--EecCC----------- Confidence 0000000 0 0011111111223333 44555555666778899999987642111 0 00000 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccc Q lcl|NC_012740. 146 MYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLME 225 (528) Q Consensus 146 ~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (528) +.+.+ T Consensus 167 ---~~a~~------------------------------------------------------------------------ 171 (390) T protein:vir:10 167 ---NNAAI------------------------------------------------------------------------ 171 (390) T ss_pred ---cceee------------------------------------------------------------------------ Confidence 00000 Q ss_pred cccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHH Q lcl|NC_012740. 226 EGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANE 305 (528) Q Consensus 226 ~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStE 305 (528) + +| +...++-..+++++++.+|..+....+|-||.||-- |.++.|.+-|+.. T Consensus 172 ------v--------~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~l~~~ 223 (390) T protein:vir:10 172 ------V--------AE---------GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-----QLASYMNNRLIRG 223 (390) T ss_pred ------e--------cC---------CccccccccceeEEEEeeEEEEEeehhhHHHHHhHH-----HHHHHHHHHHHHH Confidence 0 01 011233334455666666666667889999999852 4689999999999 Q ss_pred HHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEE Q lcl|NC_012740. 306 VLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVI 385 (528) Q Consensus 306 ImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v 385 (528) |...||+.||.= + |. ...+.|++......-...+ .+. ..++..+..+-..+.. .+...+.+| T Consensus 224 ~~~~~~~~il~G---~---G~------~~~p~Gi~~~~~~~~~~~~-~~~---~~~~~~~~~~~~~l~~--~~~~~~~~v 285 (390) T protein:vir:10 224 LKVKEDAEILRG---T---GA------NDGLLGLIPQATTYAAPTT-IAG---ATRVDQLRLAMLQASL--AEYPASGIV 285 (390) T ss_pred HHHHHHHHHhhc---C---CC------Ccccccccccccccccccc-ccc---cchHHHHHHHHHhhcc--ccCCCCEEE Confidence 999999998831 1 00 0123444433211100000 000 1112222222233322 233577889 Q ss_pred EchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccc Q lcl|NC_012740. 386 ASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALT 465 (528) Q Consensus 386 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~ 465 (528) ++|.....|.... ...|.. -...+.... .++|.| ++|++++..|..-+++|--- .+++.+...-+. T Consensus 286 ~n~~~~~~L~~lk-----d~~g~~-l~~~~~~~~--~~~l~G-~pv~~~~~~p~~~~~~gdf~-----~~~~~~~~~~~~ 351 (390) T protein:vir:10 286 INPIDWAAIELAK-----DANNQY-LIGNARGTL--TPTLWG-LPVVATQAMAPGEFLVGAFD-----LAAQIFDQWDAR 351 (390) T ss_pred EcHHHHHHHHHhh-----cCCCce-eecCCcCcC--Cceecc-eeeEEcCCCCCCcEEEEecc-----ceEEEEEecceE Confidence 9999988887521 111100 000111111 235654 89999999887766665310 112222111111 Q ss_pred eeEEecC---ccccceeeeeeeecee-ecCcccccCCCccceecccchHHhhc Q lcl|NC_012740. 466 PLRATDP---QSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKDSV 514 (528) Q Consensus 466 ~~~~~Dp---~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~a 514 (528) .....+. .+-+=.+-...||+.. .+|=+ .++++ +| T Consensus 352 i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a-------~~~~~-------~a 390 (390) T protein:vir:10 352 VEIGYVNDDFQRNMVTVLAEERLALVVYRPEA-------LISGS-------FA 390 (390) T ss_pred EEEeecccccccCcEEEEEEEeeccEEecccc-------EEEEE-------eC Confidence 1111111 1222233344567654 23311 11111 11 No 42 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=89.23 E-value=0.028 Score=29.13 Aligned_cols=368 Identities=14% Similarity=0.109 Sum_probs=139.5 Q ss_pred CcchHHHHHhhhhhh---cCC----ccc---hhccchhhhhhhhhh---hhHHHHhhhccccchhhhhhhhc-cccccc- Q lcl|NC_012740. 1 MKTTKELMEKWSPLL---ENE----KLP---EIATASKQKLVAKIL---ESQEADFAVDPIYKDEKVVEAFG-GFIAEA- 65 (528) Q Consensus 1 ~~~~~~l~~kw~p~l---~~~----~~~---~~~~~~~~~~~~~~~---enq~~~~~~~~~~~~~~~~~~~~-~~l~ea- 65 (528) -...++|.++=.... +.+ ..+ +.....++.-....+ -++++...... -.++. ...+. ...... T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~~~~~ 144 (477) T protein:vir:84 67 DEQIRELESEIERSGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEP-AKERL-RRHMVDVESDKEI 144 (477) T ss_pred HHHHHHHHHHHHHhhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhH-HHHHH-HHHHhhhhhhhhH Confidence 111111111100000 000 000 000000000000000 01111000000 00000 00000 000000 Q ss_pred cccccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccc Q lcl|NC_012740. 66 EVAGDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAF 143 (528) Q Consensus 66 ~~~~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~ 143 (528) ......+.....+..++++|.. ..-|..+ .++...-++.+..++|++.||++.+|-+-=.|..-+ ...+ T Consensus 145 ~~~~~~~~~~~~~~~~~~~gg~-lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~-------~~~a- 215 (477) T protein:vir:84 145 RKIAKVGEEYRDLDRNGGTGGY-AVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTG-------TSTA- 215 (477) T ss_pred HHHHHhhhhhccccccCCCcce-eeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecC-------ccee- Confidence 0000111122222222222211 1113222 245545567778899999999998765422221100 0000 Q ss_pred cccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccc Q lcl|NC_012740. 144 HPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKL 223 (528) Q Consensus 144 ~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (528) .+ T Consensus 216 --------~~---------------------------------------------------------------------- 217 (477) T protein:vir:84 216 --------IQ---------------------------------------------------------------------- 217 (477) T ss_pred --------ee---------------------------------------------------------------------- Confidence 00 Q ss_pred cccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHH Q lcl|NC_012740. 224 MEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILA 303 (528) Q Consensus 224 ~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILS 303 (528) ++.|-.. .....++...+++.++..+|.-+-...+|-||.+|- ..|.++.|.+-|. T Consensus 218 --------~~Eg~~~------------~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~ 273 (477) T protein:vir:84 218 --------AADNAAL------------TAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQA----AVSVDEFVFRDLA 273 (477) T ss_pred --------eccCccc------------ccccccccccceeeEEEeeeeEEeeeHHHHHHHhcc----chhHHHHHHHHHH Confidence 0000000 001234445566777777777777788999999994 3567999999999 Q ss_pred HHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccc----cccchhHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_012740. 304 NEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPID----TRGARWAGESFKSLIYQIDKEAAEIARQTGRG 379 (528) Q Consensus 304 tEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d----~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g 379 (528) ..|..-|++.||. =+ |.. +.+.|++....... ..+.-| .....++..|-...+.+....+. T Consensus 274 ~~~~~~~d~~~l~---G~---Gt~------~~p~Gi~~~~~~~~~~~~~~~~t~--~~~~~~~~~i~~~~~~~~~~~~~- 338 (477) T protein:vir:84 274 ADYANKLNVQVIS---GT---GSN------NQVVGVRATAGITQVTATSAGSAL--EKHQIIYQKIADAIQRVHTSRFL- 338 (477) T ss_pred HHHHHHHHHHHhc---cC---CCC------Cccceeeeccccccccccccccch--hhHHHHHHHHHHHHhhccccccC- Confidence 9999999998883 11 111 13456654432111 011112 11223333444444444443333 Q ss_pred CCcEEEEchhHHHHhhcc----cccccccc-cc-ccccccccccCceEEEEecCceEEEeeCCCCcc--------eEEEE Q lcl|NC_012740. 380 AGNFVIASRNVVNILASA----DQGISLAM-QG-AAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQD--------YFTVG 445 (528) Q Consensus 380 ~gn~~v~S~~va~~L~~~----g~~~~~~~-~~-~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y~~vG 445 (528) .+..+|++|+....|... |.-...|. .+ .......+.-.....|+|.| ++|+++++.|.+ -|++| T Consensus 339 ~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G-~pVv~s~~~p~~~~~~~d~~~i~~g 417 (477) T protein:vir:84 339 EPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHG-LPVVTDPTLPTTLGTGTDQDVIHVL 417 (477) T ss_pred CccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcc-cceEecCcccccccccCCcceEEEE Confidence 466788888776666542 21111110 00 00111112222223567865 899999987653 35555 Q ss_pred EecCCCccceeEeccccccceeEEecCcccc--ceeeeeeeec-----eeecCcccccCCCccceecccchHH-hhc Q lcl|NC_012740. 446 YKGDNEMDAGIYYAPYVALTPLRATDPQSFH--PVLGFKTRYG-----IGINPFADSKSQAPSARITSGMLSK-DSV 514 (528) Q Consensus 446 ~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~q--P~~~~~tRY~-----l~~nP~~~~~~~~~~~~~~~~~~~~-~~a 514 (528) --.+. +.- ...+...++|.++. ..+.|.+ |+ .+-+|=+ .+.|+....-. -++ T Consensus 418 d~~~~------~i~---~~~~~~~~~~~~~~~~~~~~~~v-~~~~~~~~~r~~~a-------fv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 418 RASDL------ALF---ESSVRMRALQETRAENLSVLLQV-YGYLAFTAARFPQS-------VVEIGGTALTAPTFA 477 (477) T ss_pred EeceE------EEE---eeceeEEeccccccccceeeeee-hhhhhhhhhccccc-------eEEeecccccccccC Confidence 44211 000 11122223333322 2222221 22 1124511 11111111111 122 No 43 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=88.92 E-value=0.029 Score=28.98 Aligned_cols=306 Identities=13% Similarity=0.075 Sum_probs=125.0 Q ss_pred CCccccccccccc-ccccc-ccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccc Q lcl|NC_012740. 71 HGYNASNIASGQT-TGAIT-NVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMY 147 (528) Q Consensus 71 ~g~~~~~~~e~t~-tg~v~-~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~ 147 (528) .|+++.+.....+ +.+.. -.-|.++ .+++++..+.+-.+++-+.||++++. +...... T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-----~ip~~~~-------------- 61 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGI-----VIPHWTG-------------- 61 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCce-----EEEEEcC-------------- Confidence 4555554333322 11111 1234443 44555556677788888999887641 1111000 Q ss_pred cccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccc Q lcl|NC_012740. 148 SPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEG 227 (528) Q Consensus 148 Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 227 (528) .+.+.| T Consensus 62 ~~~a~w-------------------------------------------------------------------------- 67 (397) T protein:vir:23 62 DVSAQW-------------------------------------------------------------------------- 67 (397) T ss_pred CcceEE-------------------------------------------------------------------------- Confidence 000000 Q ss_pred cccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHH Q lcl|NC_012740. 228 KLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVL 307 (528) Q Consensus 228 ~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEIm 307 (528) + +| +..+++-..+++++++..|..+-.-.+|-||.+|-. .|.|++|.+-|...|. T Consensus 68 ----v--------~E---------g~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~----~~l~~~i~~~l~~aia 122 (397) T protein:vir:23 68 ----I--------GE---------GDMKPITKGNMTKRDVHPAKIATIFVASAETVRANP----ANYLGTMRTKVATAIA 122 (397) T ss_pred ----e--------cC---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHH Confidence 0 01 011233344456666666666666779999999863 6779999999999999 Q ss_pred HHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEc Q lcl|NC_012740. 308 LEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIAS 387 (528) Q Consensus 308 lEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S 387 (528) ..||+.+|.=.. ... ...++.+.... .. -++. ...+..+..+...+.. .+...+-+|++ T Consensus 123 ~~~d~a~l~G~g---t~~---------~~~~~~~~~~~--~~--~~~~---~~~~~~~~~~~~~l~~--~~~~~a~~vmn 181 (397) T protein:vir:23 123 MAFDNAALHGTN---APS---------AFQGYLDQSNK--TQ--SISP---NAYQGLGVSGLTKLVT--DGKKWTHTLLD 181 (397) T ss_pred HHHHHHHhhccc---CCc---------ccccccccccc--ee--eecc---cchhHHHHHHHHhhhh--cccCCCEEEEc Confidence 999999984111 100 00011110000 00 0000 0111112222233332 23456789999 Q ss_pred hhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccc---- Q lcl|NC_012740. 388 RNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVA---- 463 (528) Q Consensus 388 ~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~---- 463 (528) ++....|...- +.....-...............|+|.| ++|+++++.+.+-+ +++.|+-. .+||.-.-. T Consensus 182 ~~~~~~L~~lk--d~~G~~i~~~~~~~~~~~~~~~~tl~G-~Pv~~s~~~~~g~~-~~~~gDfs---~~~i~~~~~i~i~ 254 (397) T protein:vir:23 182 DTVEPVLNGSV--DANGRPLFVESTYESLTTPFREGRILG-RPTILSDHVAEGDV-VGYAGDFS---QIIWGQVGGLSFD 254 (397) T ss_pred HHHHHHHHHhh--ccCCceeecccccccccccccCceeee-eeEEEeCCCCCCce-EEEEeecc---eEEEEEEeceEEE Confidence 99999988641 000000000111111111122457754 99999988764322 11222211 111111100 Q ss_pred ----cceeEEecCcc-----c---cceeeeeeeecee-ecC--cccccCCCc-cceecccchHHhhcchhhhhhhhhhcc Q lcl|NC_012740. 464 ----LTPLRATDPQS-----F---HPVLGFKTRYGIG-INP--FADSKSQAP-SARITSGMLSKDSVGKNAYFRRVWVKG 527 (528) Q Consensus 464 ----~~~~~~~Dp~s-----~---qP~~~~~tRY~l~-~nP--~~~~~~~~~-~~~~~~~~~~~~~a~~~~~~r~~~Vk~ 527 (528) ..++...|+.. | |=.+=+..|++.. .+| |..-..... ...+. ..-+-.....+|-+++ T Consensus 255 ~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~ 328 (397) T protein:vir:23 255 VTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYAL------DLDGASAGNFTLSLDG 328 (397) T ss_pred EeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceeee------cccccCcceEEEEecC Confidence 11111111110 1 1122233344442 122 111000000 00000 0001111111222221 Q ss_pred C Q lcl|NC_012740. 528 C 528 (528) Q Consensus 528 ~ 528 (528) = T Consensus 329 ~ 329 (397) T protein:vir:23 329 K 329 (397) T ss_pred c Confidence 1 No 44 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=88.43 E-value=0.032 Score=28.75 Aligned_cols=302 Identities=14% Similarity=0.104 Sum_probs=121.9 Q ss_pred hhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhh Q lcl|NC_012740. 32 VAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFD 109 (528) Q Consensus 32 ~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~D 109 (528) +| .|+|-..... |.+...-..++.++-+ |--+ .+++.+.++.+..+ T Consensus 1 ~a---------------------------~l~el~~~~~-~~~~~g~~~~~~~~li----P~~~~~~ii~~l~~~s~l~~ 48 (333) T protein:vir:78 1 MA---------------------------TLNELLPNSA-GSNHQGRLAHVPSDLL----PKEIVGPIFDKAQESSLVLR 48 (333) T ss_pred Cc---------------------------hhHHhhhhcc-cccccCceecCCcccc----chhHHHHHHHHHHhhchhhh Confidence 12 2222211100 1111111111111112 3322 45666667788889 Q ss_pred ceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccc Q lcl|NC_012740. 110 ICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHT 189 (528) Q Consensus 110 I~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~ 189 (528) ++-+.||++..--|.-.. . .+.+.|-+ T Consensus 49 ~~~~~~~~~~~~~~p~~~----~---------------~~~a~~v~---------------------------------- 75 (333) T protein:vir:78 49 MGEQIPISYGETIIPTTV----K---------------RPEVGQVG---------------------------------- 75 (333) T ss_pred hcceeeccCCceEEEEEe----C---------------CceeEeec---------------------------------- Confidence 999999876421111100 0 00111100 Q ss_pred ccccccccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEE Q lcl|NC_012740. 190 FAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEA 269 (528) Q Consensus 190 f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTA 269 (528) .|-....+|.. .-......|.+..++..|..+-. T Consensus 76 --------------------------------------------eg~~~~~~e~~--~~~~~~~~f~~i~l~~~kl~~~~ 109 (333) T protein:vir:78 76 --------------------------------------------VGTSNEQREGG--LKPLSGTAWDTRSVSPIKLATIV 109 (333) T ss_pred --------------------------------------------Ccccccccccc--cccccccceeEEEEeeEEEEEee Confidence 00000011100 00112345666666666666544 Q ss_pred ecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccc Q lcl|NC_012740. 270 KSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTR 349 (528) Q Consensus 270 KSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~ 349 (528) ..|-||.+|-. .|.|++|.+.|...|...|+..||.=-......+..|.. ...++..... .+. T Consensus 110 -------~is~ell~~s~----~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~----~~~~~~~~~~-~~~- 172 (333) T protein:vir:78 110 -------TVSEEFARMNP----SGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGID----TDNVIANTTN-VDY- 172 (333) T ss_pred -------hhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccc----cccccccccc-ccc- Confidence 47888887744 567999999999999999999998411111111111111 0111111110 000 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCce Q lcl|NC_012740. 350 GARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKY 429 (528) Q Consensus 350 ~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~ 429 (528) ........+..|..+-..+...-.+ ..+.+|++|+....|.....+.- ..|. .-...+.... -.|+|.| + T Consensus 173 ----~~~~~~~~~~~i~~~~~~~~~~~~~-~~~~~vmn~~~~~~L~~~~~~~d--~~G~-~i~~~~~~~~-~~~~l~G-~ 242 (333) T protein:vir:78 173 ----LQETGDPLLDRLLDGYDLVSANTDV-EFNGWAVDPRFRAHLLRAQAYRD--ANGN-VDPSRINLAA-QTGDVLG-L 242 (333) T ss_pred ----cccccchhHHHHHHHHHhhcccccc-CceEEEEcchHHHHHHHHhhhcC--CCCc-eeecCccccC-CCceeec-e Confidence 0011111222333333333333344 57888889988777754322110 0000 0011111110 1256765 7 Q ss_pred EEEeeCCCCcce---------EEEE--------EecCCCccceeEeccccccceeEEecCc-----ccc-ceee--eeee Q lcl|NC_012740. 430 KVFIDQYARQDY---------FTVG--------YKGDNEMDAGIYYAPYVALTPLRATDPQ-----SFH-PVLG--FKTR 484 (528) Q Consensus 430 ~vy~D~y~~~dy---------~~vG--------~KG~~~~d~g~fyaPYv~~~~~~~~Dp~-----s~q-P~~~--~~tR 484 (528) +|+++.+.+.+. +++| ..+..+++ ..+|.- ..|.. -|| -.++ ...| T Consensus 243 Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~----~~~~~~-----~~~~~~~~~~~~~~~~v~~r~~~r 313 (333) T protein:vir:78 243 PAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIK----MSDTAT-----LTDSGSATVSMWQTNQIAILIEVT 313 (333) T ss_pred eeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEE----Eecccc-----ccccccceeehhhcCcEEEEEEEE Confidence 999988765442 3333 22211111 111100 00000 011 1122 2346 Q ss_pred ecee-ecC--cccc-cCCCc Q lcl|NC_012740. 485 YGIG-INP--FADS-KSQAP 500 (528) Q Consensus 485 Y~l~-~nP--~~~~-~~~~~ 500 (528) ++.. .+| |+.- ...+| T Consensus 314 ~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 314 FGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred EccEEecccceEEEeccCCC Confidence 6644 555 3321 12222 No 45 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=88.02 E-value=0.035 Score=28.57 Aligned_cols=355 Identities=11% Similarity=0.038 Sum_probs=125.2 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhcc--ccchhhhhhhhcccccccc---------ccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDP--IYKDEKVVEAFGGFIAEAE---------VAG 69 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~--~~~~~~~~~~~~~~l~ea~---------~~~ 69 (528) .-..+.|.++...-++. +.. +.+....+.+..++...... .-.......+.+....+.+ ..+ T Consensus 32 ~~e~~~~~~~~~~~~~~-----~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 104 (419) T protein:vir:94 32 VAEARGLADALQAESDR-----AAA--RAALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRG 104 (419) T ss_pred HHHHHHHHHHHHHHHHH-----HHH--HHHHHHHHHHHHHHHhhhhccccccccccccchhhhhhhHHHHHHHHHhhhhh Confidence 11111122222211111 000 00000000010000000000 0000000000000000000 000 Q ss_pred ----------cCCccccccccccccccccccCcchhh-H-HHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCC Q lcl|NC_012740. 70 ----------DHGYNASNIASGQTTGAITNVGPAVIG-M-VRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAE 137 (528) Q Consensus 70 ----------~~g~~~~~~~e~t~tg~v~~~~P~li~-l-~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~ 137 (528) ..-........++.+.+-...-|.+++ + ..+.-..++..++|.+.||++++. .-+|.. . .. T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~--~---~~ 177 (419) T protein:vir:94 105 QFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVL--EYIRDT--S---GT 177 (419) T ss_pred hhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCce--eeeeec--c---cc Confidence 000000000111111111122233331 1 111122345688999999988742 111110 0 00 Q ss_pred cccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCc Q lcl|NC_012740. 138 HAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDD 217 (528) Q Consensus 138 ~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~ 217 (528) . ....+ T Consensus 178 ~------------~~~~~-------------------------------------------------------------- 183 (419) T protein:vir:94 178 A------------GAGST-------------------------------------------------------------- 183 (419) T ss_pred c------------ccccc-------------------------------------------------------------- Confidence 0 00000 Q ss_pred cccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHH Q lcl|NC_012740. 218 EVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAE 297 (528) Q Consensus 218 ~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~E 297 (528) .+-..-.+| +..+++...++++++..+|.=+-...+|-||.||.- +.+++ T Consensus 184 ----------------~~~a~~v~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~ 233 (419) T protein:vir:94 184 ----------------WNKAAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-----QLMGY 233 (419) T ss_pred ----------------CcccceecC---------CccccccccceeeEEeeeeeEEEeehhhHHHHHhHH-----HHHHH Confidence 000000111 112344444455555555555555679999999963 35899 Q ss_pred HHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceecccccccccc-chhHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012740. 298 LNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRG-ARWAGESFKSLIYQIDKEAAEIARQT 376 (528) Q Consensus 298 LanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~-~r~a~e~~r~L~~~i~~~a~~I~~~T 376 (528) |.+-|+..|...+|+.||. -+.. +.+.|++.......... .-+.....-..+..|.++-+.+.. T Consensus 234 i~~~la~a~~~~~d~aii~---G~G~----------~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~-- 298 (419) T protein:vir:94 234 IQGRLTYGLRFLRDRQLLN---GNGS----------TEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEI-- 298 (419) T ss_pred HHHHHHHHHHHHHHHHHHh---ccCc----------ccccceecccccccccccccccccccchhHHHHHHHHHhhhh-- Confidence 9999999999999999983 1111 12333332211000000 000111112234444444444432 Q ss_pred ccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCcccee Q lcl|NC_012740. 377 GRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGI 456 (528) Q Consensus 377 ~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~ 456 (528) .+...+.+||+|.....|...- ...+...-...+... -..++|.| ++|+++...|..-+++|--. - T Consensus 299 ~~~~~~~~v~n~~~~~~l~~~k-----~~~~~~~~~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~gd~~-------~ 364 (419) T protein:vir:94 299 AGFPPDGVVVHPQDWESIELDQ-----APGSGVFRVIANVQG-EATPRIWG-LNVVSTVAIAQGTALVGGFR-------Q 364 (419) T ss_pred ccCCCCEEEEcHHHHHHHHHHh-----hcCCCceeecCCccc-CCCccccc-eeeEEcCCCCCccEEEeecc-------c Confidence 2335778999999988886431 000110001111111 01236655 89999998776555555211 0 Q ss_pred EeccccccceeEEecCc------cccceeeeeeeeceee-cC--cccccCCCccceec Q lcl|NC_012740. 457 YYAPYVALTPLRATDPQ------SFHPVLGFKTRYGIGI-NP--FADSKSQAPSARIT 505 (528) Q Consensus 457 fyaPYv~~~~~~~~Dp~------s~qP~~~~~tRY~l~~-nP--~~~~~~~~~~~~~~ 505 (528) +|--+.-..+..-+++. .-+=.+=+..||++.+ +| |....--+ + .+ T Consensus 365 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~a--a-~~ 419 (419) T protein:vir:94 365 GATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAA--A-TT 419 (419) T ss_pred eEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEecc--C-CC Confidence 01001111111112221 1222334455666542 22 11100000 0 00 No 46 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=87.60 E-value=0.038 Score=28.39 Aligned_cols=282 Identities=11% Similarity=0.045 Sum_probs=121.9 Q ss_pred cccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccccccccccccc Q lcl|NC_012740. 78 IASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSL 156 (528) Q Consensus 78 ~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~ 156 (528) .+++++++... .-|.+. .++.++.+..+..+++.+.||.+-.. +|+-.. + .+++.| T Consensus 1 ma~~t~~~G~l-ip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-------~~p~~~---~---------~~~a~w--- 57 (300) T protein:vir:95 1 MSEAQLSKGNL-FNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQ-------REFVFD---F---------DSDIDI--- 57 (300) T ss_pred CcccccCCcce-echhhHHHHHHHHHhhhhhhhhcceeeccCCce-------EEEEEe---c---------CcceEE--- Confidence 45555555443 334333 44444555667778999999876421 111000 0 000000 Q ss_pred ccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccccccccccccc Q lcl|NC_012740. 157 AAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGM 236 (528) Q Consensus 157 ~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~Gm 236 (528) + T Consensus 58 ---------------------------------------------------------------------------v---- 58 (300) T protein:vir:95 58 ---------------------------------------------------------------------------V---- 58 (300) T ss_pred ---------------------------------------------------------------------------e---- Confidence 0 Q ss_pred chhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHh Q lcl|NC_012740. 237 ATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVD 316 (528) Q Consensus 237 sTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~ 316 (528) +| +...++...+++.++..+|.=+-...+|-||.+.... ..+|-+++|.+-|...|...+++.++. T Consensus 59 ----~E---------g~~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~l~~aia~~~d~~~l~ 124 (300) T protein:vir:95 59 ----AE---------NGKKTHGGVSLDPVTIVPLKVEYGARVSDEFLHASEE-AKVDMLTDFVEGFSKKLARGLDIMSIH 124 (300) T ss_pred ----eC---------CcccccccccceeeEeeeEEEEEeehhhHHHhccCCC-CHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 01 0112233333444444444444456689998753222 246678888888999999999888883 Q ss_pred hhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhc Q lcl|NC_012740. 317 VINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILAS 396 (528) Q Consensus 317 ~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~ 396 (528) =... .. | ++. ...|.......... .... .....+.-|.++...+.. .+.+.+-+|++|.....|.. T Consensus 125 G~~~-~~-g-~~~-----~~~~~~~~~~~~~~-~~~~---~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~ 190 (300) T protein:vir:95 125 GINP-RT-K-QAS-----TIIGDNCFDKKVTQ-TVPF---KDTNPDESMEDAVGMIDG--SERDITGAILDPIFTTALSK 190 (300) T ss_pred cccC-CC-C-CCc-----ccccccccccccce-eecc---cccchHHHHHHHHHHhhh--cCCCccEEEECHHHHHHHHH Confidence 2110 00 0 000 00111000000000 0000 001223334444443332 23466778999999988865 Q ss_pred cccccccccccccccccccccCceEEEEecCceEEEeeCCCCc------ceEEEEEecCCCccceeEeccccccceeEE- Q lcl|NC_012740. 397 ADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQ------DYFTVGYKGDNEMDAGIYYAPYVALTPLRA- 469 (528) Q Consensus 397 ~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~- 469 (528) .. ...|.. -...+.+. -..++|.| ++|+++.+.+. +.+++|- +.-+++|.......+++. T Consensus 191 lk-----d~~G~~-i~~~~~~~-~~~~~l~G-~Pv~~s~~v~~~~~~~~~~~~~GD-----f~~~~~~~~~~~~~~~v~~ 257 (300) T protein:vir:95 191 MK-----NAEGGK-LYPELAWG-GVPDAING-LAVDKNRTVSYSQTDPKNTAIVGD-----FETMFKWGYAKEVPMEIIK 257 (300) T ss_pred hh-----ccCCCe-eccCcccc-CCCceecc-eeeEEecCCCCCCCCCccEEEEee-----ccceEEEEEecccEEEEee Confidence 31 111110 01111111 12467866 89999888543 2233331 111223332222222221 Q ss_pred -ecCcc-----cc---ceeeeeeeeceee-cCcccccCCCccceecccchHHhhcchhhhhhhhhhcc Q lcl|NC_012740. 470 -TDPQS-----FH---PVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKG 527 (528) Q Consensus 470 -~Dp~s-----~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~ 527 (528) .|++. || =.+=+..|++..+ +|=+ +.+..++.| T Consensus 258 ~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a-------------------------~~~l~~~~g 300 (300) T protein:vir:95 258 YGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAAS-------------------------FARIVKTGG 300 (300) T ss_pred ccCCCCcchhhhhcCcEEEEEEEeecceeecccc-------------------------eEEEecCCC Confidence 23321 21 2233345777543 5511 111112222 No 47 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=87.56 E-value=0.038 Score=28.37 Aligned_cols=259 Identities=13% Similarity=0.081 Sum_probs=111.4 Q ss_pred hhhhhccccccccccccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeec Q lcl|NC_012740. 54 VVEAFGGFIAEAEVAGDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYG 131 (528) Q Consensus 54 ~~~~~~~~l~ea~~~~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~ 131 (528) +.+++. .++++..-.-. |.-+ .+++.+-++.+-.+++.+-||++.+|-+ .+. T Consensus 1 ~l~~~~--------------------~~t~~~gg~li-P~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~-----~~~ 54 (293) T protein:vir:48 1 MLDSKT--------------------DHSGSDAGLTI-PQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSR-----VYE 54 (293) T ss_pred Cceeec--------------------ccccCcCceEe-chhHHHHHHHHHHhhhhhhhhceeeeccCCcceE-----EEE Confidence 222221 11111110011 2222 3555555677778888888888765311 111 Q ss_pred CCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCc Q lcl|NC_012740. 132 GDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKA 211 (528) Q Consensus 132 ~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~ 211 (528) .... .++ T Consensus 55 ~~~~-----------------~~~-------------------------------------------------------- 61 (293) T protein:vir:48 55 KWTD-----------------ITG-------------------------------------------------------- 61 (293) T ss_pred eecC-----------------CCc-------------------------------------------------------- Confidence 0000 000 Q ss_pred ccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccce-eEEeeEEEEEecccccccchHHHHHHHHhhc Q lcl|NC_012740. 212 DSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMS-MRIDKQVVEAKSRQLKARYSIEVAQDLRAVH 290 (528) Q Consensus 212 ~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMa-FSIEK~TVTAKSRALKAEYT~ELAQDLkAiH 290 (528) ...-.+| +...+|.+ .++++++..+|.-+-...+|-||.+|. T Consensus 62 ------------------------~a~~v~E---------g~~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds---- 104 (293) T protein:vir:48 62 ------------------------LANIDDE---------AGKIADIDDPKLSLIKYTIKRYAGISTVTNSLLADS---- 104 (293) T ss_pred ------------------------ceeeecC---------CcccccccccceeEEEEeeeEEEEeehhhHHHHhhh---- Confidence 0000011 11123332 345555556666666677999999986 Q ss_pred CCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_012740. 291 GMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAA 370 (528) Q Consensus 291 GLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~ 370 (528) .+|.|++|.+-|...|..-+|+.|+.-+...+ ...+..++ +....|+.+ T Consensus 105 ~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~------------~~~~~~~~-------------d~i~~~~~~------ 153 (293) T protein:vir:48 105 AENILAWLSGWIAKKVVVTRNKAILGVVDKLP------------TKPTLTKW-------------DDIIDLEAK------ 153 (293) T ss_pred hHHHHHHHHHHHHHHHHHHHHhHHhhcccccc------------ccccccCH-------------HHHHHHHHh------ Confidence 36789999999999999999999985432111 11111111 222333333 Q ss_pred HHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEe--eCCCCc--------- Q lcl|NC_012740. 371 EIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFI--DQYARQ--------- 439 (528) Q Consensus 371 ~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~--------- 439 (528) +...-+ .....+|++.....|.... ...| ..-...+.+.. ..++|.| ++|++ |.+.+. T Consensus 154 -l~~~~~--~~a~~vmn~~~~~~L~~lk-----d~~g-~~l~~~~~~~~-~~~~l~G-~Pv~~~~~~~~~~~~~~~~~~~ 222 (293) T protein:vir:48 154 -VDPAIK--QTSFFLTNTSGFTALKKVK-----NALG-DYLMERDVKSP-TGYSIAG-FAVKEISDRWLPNASSGVMPLY 222 (293) T ss_pred -hhhhhc--CCCEEEEcHHHHHHHHHhh-----ccCC-ceEeecCcCCC-CCceecc-eeeEEecccccCCccCCceEEE Confidence 332222 3456789999988887531 1111 01111122211 1346765 77775 322221 Q ss_pred -----ceEEEEEecCCCccceeEeccccccceeEEecCccccceeeeeeeece---------------eecCcccccCCC Q lcl|NC_012740. 440 -----DYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGI---------------GINPFADSKSQA 499 (528) Q Consensus 440 -----dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l---------------~~nP~~~~~~~~ 499 (528) +++.++.++.-..+ ..++.. .+-.+-|=.+-...||+. .+-|+....+-+ T Consensus 223 ~gd~~~~~~~~~~~~~~i~----~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~~ 292 (293) T protein:vir:48 223 FGDLKQAVTLFDRQQMSLL----STNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIGSTA 292 (293) T ss_pred EEeccceEEEEEecceEEE----Eecccc------hhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCccccccC Confidence 12222222211111 111000 001122233333444443 333333222211 Q ss_pred ccceecccchHH Q lcl|NC_012740. 500 PSARITSGMLSK 511 (528) Q Consensus 500 ~~~~~~~~~~~~ 511 (528) . T Consensus 293 -----------~ 293 (293) T protein:vir:48 293 -----------V 293 (293) T ss_pred -----------C Confidence 0 No 48 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=87.14 E-value=0.041 Score=28.21 Aligned_cols=277 Identities=12% Similarity=0.051 Sum_probs=124.4 Q ss_pred cccCCccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccc Q lcl|NC_012740. 68 AGDHGYNASNIASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPM 146 (528) Q Consensus 68 ~~~~g~~~~~~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~ 146 (528) -....+++.+...++..+. -.-+.+. .+++.+.+.-+-..++.+.||++++...+-... . + T Consensus 1 m~~~~~~~~~~~~t~~~~~--lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~--~-------~------- 62 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDG--TLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQT--D-------G------- 62 (297) T ss_pred CCccccccccccccCCCcc--eechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEc--C-------C------- Confidence 1222344444432222222 1222222 555666677788888999999888655432210 0 0 Q ss_pred ccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccc Q lcl|NC_012740. 147 YSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEE 226 (528) Q Consensus 147 ~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (528) +.+.| T Consensus 63 --~~a~~------------------------------------------------------------------------- 67 (297) T protein:vir:95 63 --ISAYW------------------------------------------------------------------------- 67 (297) T ss_pred --ceeEE------------------------------------------------------------------------- Confidence 00000 Q ss_pred ccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|NC_012740. 227 GKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEV 306 (528) Q Consensus 227 g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEI 306 (528) .+| +..+++-..++++++...|..+-.-.+|.||.+|-. .|.+.+|.+-|+..| T Consensus 68 -------------v~E---------g~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai 121 (297) T protein:vir:95 68 -------------VNE---------TEKIKTDKPEVVPVTLKAHKLGIILVTSREALNYTW----KKFFEDMKPQIVEAF 121 (297) T ss_pred -------------eec---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHH Confidence 001 011233333445555555555556679999999864 467899999999999 Q ss_pred HHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEE Q lcl|NC_012740. 307 LLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIA 386 (528) Q Consensus 307 mlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~ 386 (528) ...+++.||.=. -.. .+.|++........ ... ..-.+.-|.++...|...- ...+.+|+ T Consensus 122 ~~~~d~a~l~G~---g~~----------~~~gi~~~~~~~~~----~~~--~~~t~~~i~~~~~~l~~~~--~~~~~~v~ 180 (297) T protein:vir:95 122 YKKIDEAGLLGH---DTP----------FANSVAKAAKDANK----VIG--GPINYDNILKLQDALYDAD--VEPNAFVS 180 (297) T ss_pred HHHHHHHHhccc---CCc----------ccccccccccccce----ecc--cccCHHHHHHHHHHhhhcc--CCcCEEEE Confidence 999999998311 000 12333322111000 000 0111233445555554432 24567899 Q ss_pred chhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCC--CcceEEEEEecCCCccceeEecccccc Q lcl|NC_012740. 387 SRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYA--RQDYFTVGYKGDNEMDAGIYYAPYVAL 464 (528) Q Consensus 387 S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~--~~dy~~vG~KG~~~~d~g~fyaPYv~~ 464 (528) +|+....|...- ...|. . ..... .++|.| ++|++-+.. +..-+++|=. ..++|...-+. T Consensus 181 ~~~~~~~L~~l~-----d~~G~-~--i~~~~----~~~l~G-~Pv~~~~~~~~~~~~~~~gd~------s~~~~~~~~~~ 241 (297) T protein:vir:95 181 KIQNRSALREAR-----DGNKV-S--IYDKA----ANTIDG-ITTVDLKSARFEKGDLLAGDF------DNLIYGVPYNI 241 (297) T ss_pred cHHHHHHHHHhh-----ccCCc-e--eecCC----CCcccc-eeeEeecCCCCCCceEEEEec------ccEEEEEecCe Confidence 999999887521 11110 0 01111 234544 677654432 2222333211 01122221111 Q ss_pred cee--------EEecCc-----ccc-ceeee--eeeeceee-cCcccccCCCccceecccchH Q lcl|NC_012740. 465 TPL--------RATDPQ-----SFH-PVLGF--KTRYGIGI-NPFADSKSQAPSARITSGMLS 510 (528) Q Consensus 465 ~~~--------~~~Dp~-----s~q-P~~~~--~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~ 510 (528) .+. ...|+. -|| =.++| ..|++..+ ||=+ .+++....+. T Consensus 242 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a-------~~~l~~at~~ 297 (297) T protein:vir:95 242 TYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDA-------FAKLTPAERV 297 (297) T ss_pred EEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccc-------eEEEeecCCC Confidence 111 111221 022 12222 35666653 3311 2233333333 No 49 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=86.78 E-value=0.043 Score=28.07 Aligned_cols=328 Identities=13% Similarity=0.030 Sum_probs=125.7 Q ss_pred CcchHHHHHhhhhhhcCCc--------------cchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhcccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEK--------------LPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAE 66 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~--------------~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~ 66 (528) +-.-+.|.++...+-+... .++......+......-+.+. .|+......-.+..+.+.+ T Consensus 39 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~a~~~~~~~~~~~~~~ 111 (397) T protein:vir:12 39 LDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQRSQGQGNEERQQ-------QYSKAFLKGLRGKRLTDEE 111 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcccccccchhhHHHH-------HHHHHHHHHHhccCCcHHH Confidence 3333444444333221100 000000000000000000000 1111111000112221111 Q ss_pred ccccCCcccccccccc-ccccccccCcchh--hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccc Q lcl|NC_012740. 67 VAGDHGYNASNIASGQ-TTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAF 143 (528) Q Consensus 67 ~~~~~g~~~~~~~e~t-~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~ 143 (528) -.....-+...+..++ ++|.+.- |.-+ .+++.+.++.+-.+++.+.||+++.|-+--.|.. + + T Consensus 112 ~~~~~~~~~~a~~~~~~~~gg~lv--P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~------~---- 177 (397) T protein:vir:12 112 RDLLDSPEFRAMSGINDEDGGILI--PEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNA--D------M---- 177 (397) T ss_pred HHHHhhhhhhhccccccccCcccC--chhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEec--C------C---- Confidence 1000000111111221 2222211 2221 3555555677788999999999887643222110 0 0 Q ss_pred cccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccc Q lcl|NC_012740. 144 HPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKL 223 (528) Q Consensus 144 ~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (528) +.+.| T Consensus 178 -----~~a~~---------------------------------------------------------------------- 182 (397) T protein:vir:12 178 -----VPFSP---------------------------------------------------------------------- 182 (397) T ss_pred -----cceee---------------------------------------------------------------------- Confidence 00000 Q ss_pred cccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHH Q lcl|NC_012740. 224 MEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILA 303 (528) Q Consensus 224 ~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILS 303 (528) ++.| ++ ........|.+..|+..|..+- ..+|-||.+|-- +|.++.|.+.|. T Consensus 183 --------v~Eg-----~~----~~~~~~~~~~~v~~~~~k~~~~-------~~is~e~l~ds~----~~l~~~i~~~l~ 234 (397) T protein:vir:12 183 --------VEEL-----GN----LPEIDQPRFTKVSYSIIDYGGI-------MTLSNSMLNDSD----QAIMTYVAKWFA 234 (397) T ss_pred --------eccc-----cc----ccccccccceeEEeeheeeEee-------ehhhHHHHhhch----HHHHHHHHHHHH Confidence 0000 00 0001123466666666666665 449999998853 567899999999 Q ss_pred HHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcE Q lcl|NC_012740. 304 NEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNF 383 (528) Q Consensus 304 tEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~ 383 (528) ..|...+|+.|+.-. . .+.+.|+..+++- .+.++..++ ..+..+.. T Consensus 235 ~~~~~~~d~~il~G~---g----------~~~~~g~~~~~~i------------~~~~~~~l~---------~~~~~~a~ 280 (397) T protein:vir:12 235 KKSVVTRNNLILAAI---A----------SLKKVDIDGLDGI------------KKALNVTLD---------PMVAPGSI 280 (397) T ss_pred HHHHHHHHHHHHhcc---c----------cccccccccHHHH------------HHHHhhccc---------hhhhCCCE Confidence 999999999888421 1 1134455433211 111222222 12234566 Q ss_pred EEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEecccc- Q lcl|NC_012740. 384 VIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYV- 462 (528) Q Consensus 384 ~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv- 462 (528) +||+|.....|... +...|. .-...+.+.. .-++|.| ++|++.+...... ..-+.-++|+.|- T Consensus 281 ~~~n~~~~~~L~~l-----kd~~G~-~l~~~~~~~g-~~~~l~G-~pv~~~~~~~~~~--------~~~~~~~~~gd~~~ 344 (397) T protein:vir:12 281 VLTNQDGYDWLDTL-----KDGTGR-YLLQPDPTNP-TKKLLDG-RPVVPFTNRVLKT--------QKGKAPLIIGNLKE 344 (397) T ss_pred EEEcHHHHHHHHHh-----hccCCc-eeecccccCC-CCccccc-eeeEEeccccccc--------CCCccEEEEEehhc Confidence 88999998888653 111110 0011121111 1246755 7887654321100 0000112222211 Q ss_pred --------ccceeEEecC----ccccceeeeeeeecee-ecCcccccCCCccce Q lcl|NC_012740. 463 --------ALTPLRATDP----QSFHPVLGFKTRYGIG-INPFADSKSQAPSAR 503 (528) Q Consensus 463 --------~~~~~~~~Dp----~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~ 503 (528) .+.....-.+ .+.+-.+-...|++.. .||=+...-. -.++ T Consensus 345 ~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~-~t~~ 397 (397) T protein:vir:12 345 AIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQ-ITVE 397 (397) T ss_pred eEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-EeeC Confidence 0111111111 1223445556666654 2331111000 0011 No 50 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=86.27 E-value=0.047 Score=27.88 Aligned_cols=273 Identities=11% Similarity=0.014 Sum_probs=118.4 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccc Q lcl|NC_012740. 146 MYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLME 225 (528) Q Consensus 146 ~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (528) |....| ..... -.|-...++....... ............... .. T Consensus 1 ma~~~T------------------~~~~~--iiPev~~~~v~~~~~~-------~~~~~~~~~~~~~l~---------g~ 44 (274) T protein:vir:93 1 MPQGIT------------------KTSNQ--IIPEVLAPMMQAQLEK-------KLRFASFAEVDSTLQ---------GQ 44 (274) T ss_pred CCccce------------------ehhhe--echHHHHHHHHHHHHh-------hhhhccccccccccc---------CC Confidence 100000 00000 0011111111000000 000000000000000 00 Q ss_pred cccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHH Q lcl|NC_012740. 226 EGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANE 305 (528) Q Consensus 226 ~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStE 305 (528) .|...+...=-.+..++. +......++.++. ..+.+++-|-|+-. |.+.=-+.+.+ +-|.-.+..+-++.. T Consensus 45 ~G~tv~ip~~~~~g~~~~---~~eg~~i~~~~it--~~~~~~~i~~~~~~--~~i~D~~~~~~--~~d~~~~~~~~~~~~ 115 (274) T protein:vir:93 45 PGDTLTFPAFVYSGDAQV---VAEGEKIPTDILE--TKKREAKIRKIAKG--TSITDEALLSG--YGDPQGEQVRQHGLA 115 (274) T ss_pred CCCEEEEEeeccCCCccc---ccCCCcccccccc--cceeEEEeeeeccc--ccccHHHHHhh--ccchHHHHHHHHHHH Confidence 111111111000112221 1112233445444 44455555666522 33322222223 578999999999999 Q ss_pred HHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEE Q lcl|NC_012740. 306 VLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVI 385 (528) Q Consensus 306 ImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v 385 (528) +...++++++..+...+. ++ .... ...+-+-.+..++.++. ..+++++ T Consensus 116 ~a~~~d~~~~~~~~~a~~--------~~--~~~~-------------~~~d~i~dA~~~l~d~~---------~~~~~iv 163 (274) T protein:vir:93 116 HANKVDNDVLEALMGAKL--------TV--NADI-------------TKLNGLQSAIDKFNDED---------LEPMVLF 163 (274) T ss_pred HHHHHHHHHHHHHhcccc--------cc--cccc-------------cCHHHHHHHHHHhhhcc---------CCccEEE Confidence 999999999987643221 00 0001 11333344444444321 2578999 Q ss_pred EchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccc Q lcl|NC_012740. 386 ASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALT 465 (528) Q Consensus 386 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~ 465 (528) |+|.+++.|............... .+....-.+|.+.| ++||+|+..|..-..+.-+|.- -|.---+.. T Consensus 164 v~p~~~~~L~k~~~~~f~~~s~~g----~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~gai------~~~~~~~~~ 232 (274) T protein:vir:93 164 INPLDAGKLRGDASTNFTRATELG----DDIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKGAV------KLILKRDFF 232 (274) T ss_pred eCHHHHHHHHhhhhhccccccccc----ccceeecccceecC-eeEEEcCCCCcceEEEEeCCeE------EEEecCCcc Confidence 999999999865333222221111 11222335888875 9999999988654433333321 121111223 Q ss_pred eeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHhhcchhhh Q lcl|NC_012740. 466 PLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKDSVGKNAY 519 (528) Q Consensus 466 ~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 519 (528) ...--|+.++.=.|-...|||+. .||= .-..+..+. +.=.| T Consensus 233 vE~~Rd~~~~~d~i~~~~~y~~~~~~~~-------~~v~~t~~~------~s~~~ 274 (274) T protein:vir:93 233 LEVARDASTKTTALYSDKHYVAYLYDES-------KAVKITKGS------GSLEM 274 (274) T ss_pred cccccchhhcccEEEEEEEEEEEEEcCC-------ceEEEeeCc------cccCC Confidence 33456899999999999999986 3440 011111111 11112 No 51 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=84.46 E-value=0.06 Score=27.27 Aligned_cols=350 Identities=12% Similarity=0.088 Sum_probs=133.7 Q ss_pred Ccc----hHHHHHhhhhhhcCCccchhccchhhhh--hhhhhhhHHHHh--------------hh--------ccccchh Q lcl|NC_012740. 1 MKT----TKELMEKWSPLLENEKLPEIATASKQKL--VAKILESQEADF--------------AV--------DPIYKDE 52 (528) Q Consensus 1 ~~~----~~~l~~kw~p~l~~~~~~~~~~~~~~~~--~~~~~enq~~~~--------------~~--------~~~~~~~ 52 (528) |.+ .++|.+++.-+-+. +-++.+.-+..+ ...+.+.+++.+ .+ +..-..+ T Consensus 1 m~~~~k~l~el~~~~~~~~~~--~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQ--IKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGE 78 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Confidence 665 44555555444221 000100000000 001111111110 00 0000000 Q ss_pred hhhhhhcccccccc--------ccccCC-ccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccce Q lcl|NC_012740. 53 KVVEAFGGFIAEAE--------VAGDHG-YNASNIASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQ 122 (528) Q Consensus 53 ~~~~~~~~~l~ea~--------~~~~~g-~~~~~~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGL 122 (528) ......+....+.. ..+.+. ........++++.+-.-.-|.++ .++++.-+..+..++|.++||.+++.- T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~ 158 (395) T protein:vir:43 79 EAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVE 158 (395) T ss_pred chhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceE Confidence 00000000010110 000000 00001011111111111223222 455556677888899999998876421 Q ss_pred eeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 123 IFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVT 202 (528) Q Consensus 123 IFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~ 202 (528) + .+ ..... +.+ T Consensus 159 ~--~~--~~~~~--------------~~a--------------------------------------------------- 169 (395) T protein:vir:43 159 Y--VR--ETGFV--------------NNA--------------------------------------------------- 169 (395) T ss_pred E--EE--EecCC--------------Cce--------------------------------------------------- Confidence 1 11 00000 000 Q ss_pred cccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHH Q lcl|NC_012740. 203 AEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEV 282 (528) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~EL 282 (528) .-+ +| +...++-..+++++++..|.-+-...+|-|| T Consensus 170 ---------------------------~~v--------~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~is~el 205 (395) T protein:vir:43 170 ---------------------------APV--------SE---------GTQKPYSDLTFELENAPVRTIAHLFKASRQI 205 (395) T ss_pred ---------------------------eee--------cC---------CccccccccceeEEEEeeeeEEEeehhhHHH Confidence 000 01 0011222333444444444444556799999 Q ss_pred HHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHH Q lcl|NC_012740. 283 AQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLI 362 (528) Q Consensus 283 AQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~ 362 (528) .||.- +.++.|.+-|+..|...+|+.||. -+ |.. ....|++......-... -... ....++ T Consensus 206 l~d~~-----~l~~~v~~~la~a~~~~~d~~~l~---G~---g~~------~~~~Gi~~~~~~~~~~~-~~~~-~~~~~~ 266 (395) T protein:vir:43 206 LDDAS-----ALQSYIDARARYGLMLVEECQLLY---GN---GTG------ANLHGIIPQAQAYAPPS-GVVV-TAEQRI 266 (395) T ss_pred HHhHH-----HHHHHHHHHHHHHHHHHHHHHHHh---cc---CCC------Ccccccccccccccccc-cccc-ccchhH Confidence 99863 358899999999999999998883 11 100 01234432221100000 0000 011233 Q ss_pred HHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceE Q lcl|NC_012740. 363 YQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYF 442 (528) Q Consensus 363 ~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 442 (528) ..|..+.+.+.. .+.+...+|+||.....|..-- ...|. -+..+.... -.++|.| ++|+++++.|.+=+ T Consensus 267 ~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~lk-----d~~G~--~i~~~~~~~-~~~~l~G-~pVv~~~~~~~~~~ 335 (395) T protein:vir:43 267 DRIRLAILQAQL--AEFPASGIVLNPIDWALIELNK-----DAENR--YIIGSPQNG-TTPTLWR-LPVVETQAITQDEF 335 (395) T ss_pred HHHHHHHHhhcc--ccCCCcEEEEcHHHHHHHHHhh-----ccCCc--eeccccccC-CCceecc-eeeEEcCCCCCCcE Confidence 344444444433 3445778999999988876421 11111 011111111 1346765 89999999877666 Q ss_pred EEEEecCCCccceeEeccccccceeEEecCc---cccc---eeeeeeeeceee-cCcccccCCCccceecccchHHhhc Q lcl|NC_012740. 443 TVGYKGDNEMDAGIYYAPYVALTPLRATDPQ---SFHP---VLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDSV 514 (528) Q Consensus 443 ~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~---s~qP---~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~~~a 514 (528) ++|--.. ..+. +.-..+..-+++. .|+- .+-+..|++..+ +|=+ .+++. -. .+ T Consensus 336 ~~gd~~~-----~~~~--~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a-------~~~~~-~t----aa 395 (395) T protein:vir:43 336 LTGAFSL-----GAQI--FDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEA-------FVTGS-LT----AS 395 (395) T ss_pred EEEeccc-----eEEE--EEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccc-------eEEEE-ec----cC Confidence 6553211 0000 1111111112221 2322 333445777653 2311 11110 00 00 No 52 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=83.78 E-value=0.066 Score=27.07 Aligned_cols=281 Identities=13% Similarity=0.115 Sum_probs=124.9 Q ss_pred cccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccccccccccccc Q lcl|NC_012740. 78 IASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSL 156 (528) Q Consensus 78 ~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~ 156 (528) ++-. +++.+ -..|.+. .+++++-+..+..+++.+.||++.+.-|. ++.. +.+| .| T Consensus 1 m~t~-t~gg~-liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip----~~~~------~~~a---------~w--- 56 (303) T protein:vir:97 1 MGTE-TSKAS-LFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEF----TFTL------DSDI---------DV--- 56 (303) T ss_pred Cccc-CCCCe-EcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE----EEec------Ccce---------EE--- Confidence 3322 33332 2334444 66777778888999999999976543221 1100 0000 00 Q ss_pred ccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccccccccccccc Q lcl|NC_012740. 157 AAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGM 236 (528) Q Consensus 157 ~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~Gm 236 (528) + T Consensus 57 ---------------------------------------------------------------------------v---- 57 (303) T protein:vir:97 57 ---------------------------------------------------------------------------V---- 57 (303) T ss_pred ---------------------------------------------------------------------------e---- Confidence 0 Q ss_pred chhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHh Q lcl|NC_012740. 237 ATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVD 316 (528) Q Consensus 237 sTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~ 316 (528) +| +...++-..+++.++..+|.-+-...+|-||.|.... ..++-+++|.+-|+..|...|+..+|. T Consensus 58 ----~E---------~~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~a~~~~ld~a~l~ 123 (303) T protein:vir:97 58 ----AE---------NGKKTHGGLSLEPVTIVPIKVEYGARLSDEFLYATEE-EKIDILKAFNEGFAKKLARGIDLMAMH 123 (303) T ss_pred ----ec---------CccccccccceeeEEeeeEEEEEeehhhHHHhhcCcc-chHHHHHHHHHHHHHHHHHHHHhhhhc Confidence 01 0011122222334444444444445799999863322 246678899999999999999888884 Q ss_pred hhhhheeecccceeeccccccceeccc--cccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHh Q lcl|NC_012740. 317 VINFTAQVGKTGMTQTVGSKAGVFDLQ--DPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNIL 394 (528) Q Consensus 317 ~i~~~a~~~~~~~~~~~~~~~g~~dl~--~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L 394 (528) =.....- +.+...|...+. ...-+..+ ....++.-|.++-+.+.. .....+-+|++|.....| T Consensus 124 G~~~~~g--------~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L 188 (303) T protein:vir:97 124 GINPRTK--------KASDVIGTNHFDSKVTQVVKFT-----ESEDADANIEAAVNLIQG--AEGVVTGLAMDTEFSTAL 188 (303) T ss_pred ccccCCc--------cccccccccccccccccccccc-----cccchHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHH Confidence 3211000 001111111111 00000000 001233444444444433 233566799999999888 Q ss_pred hccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcc--------eEEEEEecCCCccceeEeccccccce Q lcl|NC_012740. 395 ASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQD--------YFTVGYKGDNEMDAGIYYAPYVALTP 466 (528) Q Consensus 395 ~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y~~vG~KG~~~~d~g~fyaPYv~~~~ 466 (528) ... +...|.. -...+.....-.|+|.| ++|+++.+-+.. .+++| + +...+.+...-...+ T Consensus 189 ~~l-----kd~~g~~-~~~~~~~~~~~~~~l~G-~Pv~~s~~v~~~~~~~~~~~~~~~G---d--f~~~~~~~~~~~~~~ 256 (303) T protein:vir:97 189 AKV-----TNGEMGP-KMYPELAWGANPDSING-LKSSVNTTVGAGADEAESKDLVIIG---D--FESMFKWGYAKQIPM 256 (303) T ss_pred HHh-----hccCCCe-EEecCccCCCCCceecc-eeeEEecccCCccccCCCccEEEEe---e--ccccEEEEEecCcEE Confidence 642 1111100 01111111111356876 999998875432 22222 2 111122222222222 Q ss_pred e--EEecCcc-----ccc-eeee--eeeecee-ecCcccccCCCccceecccch Q lcl|NC_012740. 467 L--RATDPQS-----FHP-VLGF--KTRYGIG-INPFADSKSQAPSARITSGML 509 (528) Q Consensus 467 ~--~~~Dp~s-----~qP-~~~~--~tRY~l~-~nP~~~~~~~~~~~~~~~~~~ 509 (528) . .-.|++. |+- .++| ..||+.. .||=+ .+++.++.- T Consensus 257 ~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~a-------f~~l~~~~~ 303 (303) T protein:vir:97 257 EIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKS-------FARVTKGEV 303 (303) T ss_pred EEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccc-------eEEeeCCCC Confidence 2 2223321 221 2444 5678764 45511 233333333 No 53 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=83.72 E-value=0.066 Score=27.05 Aligned_cols=285 Identities=14% Similarity=0.083 Sum_probs=120.3 Q ss_pred cccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccccccccccccc Q lcl|NC_012740. 78 IASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSL 156 (528) Q Consensus 78 ~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~ 156 (528) .+..++++.. ..-+.+. .+++++-+..+..+++-+.||+... +| |+-..+ + +.+.| T Consensus 1 Mat~tt~~g~-~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~-----~~--~p~~~~---~---------~~a~w--- 57 (311) T protein:vir:99 1 MATFGTGNLK-NLPRNIADGMVKDVVQGSTVAVLSARKPQRFGN-----ED--IITFNG---R---------PKAEF--- 57 (311) T ss_pred CceecCCCce-eccHHHHHHHHHHHHhhchhhhhcceeeccCCc-----eE--EEEEeC---C---------ceeEE--- Confidence 2222222222 1222222 5677777788888888888887542 11 111000 0 00000 Q ss_pred ccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccccccccccccc Q lcl|NC_012740. 157 AAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGM 236 (528) Q Consensus 157 ~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~Gm 236 (528) T Consensus 58 -------------------------------------------------------------------------------- 57 (311) T protein:vir:99 58 -------------------------------------------------------------------------------- 57 (311) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred chhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHh Q lcl|NC_012740. 237 ATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVD 316 (528) Q Consensus 237 sTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~ 316 (528) .+| +..+++...++++++..+|.-+-....|-||.|+-.- -..|-+++|.+-|...|+..|++.+|. T Consensus 58 ---v~E---------g~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~~~l~ 124 (311) T protein:vir:99 58 ---VGE---------GQQKSSTTGEFDFVTSTPKKAQVTMRFNEEVQWADED-YQLGVLQTLSEAGAEALARALDLGLYH 124 (311) T ss_pred ---eec---------CcccccccceeeEEEEeeEEEEEeehhhHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 011 0112333334445555555555567799999763321 135568888888999999999888885 Q ss_pred hhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhc Q lcl|NC_012740. 317 VINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILAS 396 (528) Q Consensus 317 ~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~ 396 (528) -.....--+..|...-.+.......+... .+ -.+..-|+.+-..+...-.+...+-.|++|+....|.. T Consensus 125 G~g~~~g~~~~g~~~~~~~~~~~~~~~~~------~~-----~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~ 193 (311) T protein:vir:99 125 RINPLTGTVIPGWSNYLGAASKRVELTAD------TI-----ANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLST 193 (311) T ss_pred ccCcccCccccccccccccccceeecccc------cc-----chhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHh Confidence 32110000111111000001111111110 00 11222233333333333333345668999999999865 Q ss_pred cccccccccccccccccccccCceEEEEecCceEEEeeCCC----------------CcceEEEEEecCCCccceeEecc Q lcl|NC_012740. 397 ADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYA----------------RQDYFTVGYKGDNEMDAGIYYAP 460 (528) Q Consensus 397 ~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~----------------~~dy~~vG~KG~~~~d~g~fyaP 460 (528) .. ...|. .-...+.... -.++|.| ++|++..+- +.+++++|= ...++.|.- T Consensus 194 lk-----d~~G~-~l~~~~~~~~-~~~~l~G-~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gd-----f~~~~~~~~ 260 (311) T protein:vir:99 194 AR-----YTDGR-KKFPELGLGI-GVSSFEG-IDASVSDTVNGGDEADPDDEDLDAARAVRGIVGD-----FANGIHWGV 260 (311) T ss_pred hh-----ccCCC-eeecCcccCC-CCceecc-eeeEeecccccccccccccchhhccCcceEEEee-----ccccEEEEE Confidence 31 11110 0011111110 1356755 888887653 233333331 111222322 Q ss_pred ccccceeEEe--cCccc-----cceeee--eeeeceeecCcccccCCCccceecccch Q lcl|NC_012740. 461 YVALTPLRAT--DPQSF-----HPVLGF--KTRYGIGINPFADSKSQAPSARITSGML 509 (528) Q Consensus 461 Yv~~~~~~~~--Dp~s~-----qP~~~~--~tRY~l~~nP~~~~~~~~~~~~~~~~~~ 509 (528) .-...+++.- |++.. .--++| ..|||..+-+ . ..+++.++.- T Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~------~-~~v~~~~~~A 311 (311) T protein:vir:99 261 QRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFT------D-RFVVIENAVA 311 (311) T ss_pred ecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecC------h-hHeeeecccC Confidence 2222222221 23321 112344 5788865432 0 1233332222 No 54 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=83.03 E-value=0.072 Score=26.85 Aligned_cols=289 Identities=14% Similarity=0.119 Sum_probs=117.1 Q ss_pred hhhccccchhhhhhhhccccccccccccCCcc--cccccccccccccc-ccCcchh-hHHHHHHhhhhhhhceeeecCCc Q lcl|NC_012740. 43 FAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYN--ASNIASGQTTGAIT-NVGPAVI-GMVRRAIPNLIAFDICGVQPMST 118 (528) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~--~~~~~e~t~tg~v~-~~~P~li-~l~Rra~~~lI~~DI~GVQPmTg 118 (528) +++ . .+++ ...+++. +++... ..-|.+. .+++.+....+-.+++-+.||++ T Consensus 1 ~~~-------------------~-----~~~~~~~~~~~~t-~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~ 55 (320) T protein:vir:10 1 MAA-------------------G-----TAFQVDHAQIAQT-GDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGT 55 (320) T ss_pred CCC-------------------C-----ccCCHHHHHhhcc-ccccccccccHHHHHHHHHHHHhccchhhhcceeeccC Confidence 111 0 0111 1111111 111111 1223333 35555556677888899999887 Q ss_pred ccceeeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 119 PTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYL 198 (528) Q Consensus 119 PTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~ 198 (528) .+.-|. +..+ + +++.| T Consensus 56 ~~~~~p----~~~~------~---------~~a~~--------------------------------------------- 71 (320) T protein:vir:10 56 TGQKIP----HWIG------D---------VSAQW--------------------------------------------- 71 (320) T ss_pred CceEEE----EEeC------C---------cceEE--------------------------------------------- Confidence 642211 0000 0 00000 Q ss_pred cccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccc Q lcl|NC_012740. 199 QNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARY 278 (528) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEY 278 (528) .+| +..+++-..++++++...|..+-.-.+ T Consensus 72 -----------------------------------------v~E---------~~~~~~~~~~f~~v~~~~~k~~~~~~i 101 (320) T protein:vir:10 72 -----------------------------------------IGE---------GDMKPITKGNMTSQNIAPHKIATIFVA 101 (320) T ss_pred -----------------------------------------ecC---------CccccccccceeEEEEeeEEEEEeehh Confidence 001 011222233345555555666666779 Q ss_pred hHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeccccee---ec-cccccceeccccccccccchhH Q lcl|NC_012740. 279 SIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMT---QT-VGSKAGVFDLQDPIDTRGARWA 354 (528) Q Consensus 279 T~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~---~~-~~~~~g~~dl~~~~d~~~~r~a 354 (528) |.||.+|-. .|.|+.|.+.|...|...||+.||. -+......+.. .. .....+..... + -++ T Consensus 102 s~ell~ds~----~~l~~~i~~~l~~a~a~~~d~a~l~---G~g~~~~~~~~~~~~~~~~~~~~~~~~~---~----~~~ 167 (320) T protein:vir:10 102 SAETVRANP----ANYLGTMRTKVATAFAMAFDSAALN---GTDSPFPTYLAQTTKSVSLADPGGATAS---D----LTA 167 (320) T ss_pred hHHHHhcCh----HHHHHHHHHHHHHHHHHHHHHHhhc---ccCCCCCcccccccccccceeccccccc---c----ccc Confidence 999999854 5778999999999999999999873 11100000000 00 00000000000 0 011 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccc---cccccccccCceEEEEecCceEE Q lcl|NC_012740. 355 GESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGA---AQGLNTDTTKAVFAGVLAGKYKV 431 (528) Q Consensus 355 ~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~---~~~~~~d~~~~~~~G~l~~~~~v 431 (528) .+ .+ +..+...+. ..+-....+||+|.....|..-. ...|. ......+......-++|. +++| T Consensus 168 ~~---~~---~~~~~~~~~--~~~~~~~~~v~n~~~~~~L~~lk-----d~~G~~l~~~~~~~~~~~~~~~~~i~-g~pv 233 (320) T protein:vir:10 168 YD---AV---AVNGLSLLV--NAKKKWTHTLLDDIVEPILNGAK-----DKNGRPLFIESTYTDENSPFRAGRIV-SRPT 233 (320) T ss_pred HH---HH---HHHHHhhhh--cccCCCcEEEEcHHHHHHHHHhh-----ccCCceeeccccccCccccccCceee-eeee Confidence 11 11 112222222 23335678999999999997521 11110 000111122222234554 4899 Q ss_pred EeeCCCCcceEEEEEecCCCccceeEeccccccceeE--------EecCcc-----cc---ceeeeeeeecee-ecC--c Q lcl|NC_012740. 432 FIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLR--------ATDPQS-----FH---PVLGFKTRYGIG-INP--F 492 (528) Q Consensus 432 y~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~--------~~Dp~s-----~q---P~~~~~tRY~l~-~nP--~ 492 (528) +++++.+.+=..+ +-|+-. .+++.-+-...+.+ ..|+.. || =.+=...|+++. .+| | T Consensus 234 ~~~~~~~~~~~~~-~~gd~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~ 309 (320) T protein:vir:10 234 ILSDHVADGTTVG-YMGDFR---NVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAF 309 (320) T ss_pred EecCCCCCCceEE-EEeecc---eEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccce Confidence 9998876543211 112111 11222221111111 111111 11 112233566554 233 2 Q ss_pred cc-ccCCCccc Q lcl|NC_012740. 493 AD-SKSQAPSA 502 (528) Q Consensus 493 ~~-~~~~~~~~ 502 (528) +. ...-+|.| T Consensus 310 ~~l~~~~ap~~ 320 (320) T protein:vir:10 310 VKLTNVVTPDA 320 (320) T ss_pred EEEEeccCCCC Confidence 11 00112333 No 55 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=82.46 E-value=0.077 Score=26.70 Aligned_cols=351 Identities=11% Similarity=0.082 Sum_probs=127.6 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhcc----------ccchhhhhhhh-ccccccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDP----------IYKDEKVVEAF-GGFIAEAEVAG 69 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~----------~~~~~~~~~~~-~~~l~ea~~~~ 69 (528) |.+-+++++|..-+-.. -+-.+...-+..-..+.+++..+.+.++. ..++....... ...|.+.+-- T Consensus 1 ik~L~e~~~e~~e~~~~-~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~- 78 (390) T protein:vir:40 1 MNNLDKKDSETLNISTA-FLNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESK- 78 (390) T ss_pred CchHHHHHHHHHHHHHH-HHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHH- Confidence 88877777776554331 11122222211111122222111111110 01111100000 0111111100 Q ss_pred cCCccccccccccc-cccccccCcch-h-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccc Q lcl|NC_012740. 70 DHGYNASNIASGQT-TGAITNVGPAV-I-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPM 146 (528) Q Consensus 70 ~~g~~~~~~~e~t~-tg~v~~~~P~l-i-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~ 146 (528) ..+ ..+.++++ .+.. .=|.- . .+.+.+-..-+-.++|-+.||++....|... ... T Consensus 79 --~~~-~~~~~~~~~~gg~--lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~----~~~------------- 136 (390) T protein:vir:40 79 --YYN-EVIAGNGFAGVTA--LLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISV----GDV------------- 136 (390) T ss_pred --HHH-HHHhccCcccCcc--cccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEE----cCC------------- Confidence 000 00111111 1111 11221 1 2333333444567889999998754433210 000 Q ss_pred ccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccc Q lcl|NC_012740. 147 YSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEE 226 (528) Q Consensus 147 ~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (528) +.+.|- T Consensus 137 --~~a~~~------------------------------------------------------------------------ 142 (390) T protein:vir:40 137 --ATAWWG------------------------------------------------------------------------ 142 (390) T ss_pred --cceeee------------------------------------------------------------------------ Confidence 000000 Q ss_pred ccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|NC_012740. 227 GKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEV 306 (528) Q Consensus 227 g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEI 306 (528) +. .++. -......|.+..|++.|..+-. ..|-||.+|-- .|.|++|.+.|+..| T Consensus 143 ------~E-----~~~~----~~~~~~~f~~i~l~~~k~~~~i-------~iS~ell~ds~----~~l~~~i~~~la~~i 196 (390) T protein:vir:40 143 ------PL-----CAEI----KEVLDNGFDKIQTGMYKLSAYI-------PVCNAMLDLGP----SWLDQYVRTILGEAM 196 (390) T ss_pred ------cc-----cccc----CccccccceeeEeeeeeEEEee-------hhhHHHHhcch----HHHHHHHHHHHHHHH Confidence 00 0000 0112345888888888887644 48899999863 467999999999999 Q ss_pred HHHhhHHHHhhhhhheeecccceeeccccccceeccccccc------cccchhHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_012740. 307 LLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPID------TRGARWAGESFKSLIYQIDKEAAEIARQTGRGA 380 (528) Q Consensus 307 mlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d------~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~ 380 (528) ..-+|+.||. -+ | .+.+.|++.-....- ....-...+-.-.++..+......-.... +++ T Consensus 197 ~~~~~~a~l~---G~------G----~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~-~~~ 262 (390) T protein:vir:40 197 ALGLEAGIVN---GS------G----KDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKS-VSD 262 (390) T ss_pred HHHHHhhhhc---cc------C----CCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhh-hcC Confidence 9999999984 11 1 012333322110000 00000000111222222222111111111 223 Q ss_pred CcEEEEch-hHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEe--------cCCC Q lcl|NC_012740. 381 GNFVIASR-NVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYK--------GDNE 451 (528) Q Consensus 381 gn~~v~S~-~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K--------G~~~ 451 (528) +.| ||++ ..+..|...-++ + |..+....+.+.-+++|+++++.|.+-++.|-- +... T Consensus 263 a~~-i~n~~t~~~~l~~~~~~--~-----------d~~G~~v~~~~~~g~pvv~~~~~p~~~i~~Gd~s~~~i~~~~~~~ 328 (390) T protein:vir:40 263 AIL-VINPADYWSKIYAATSY--M-----------TPQGVWVTGILPVPLEIVQSVAVPVGKAVAGRAKDYFMGIGSEQV 328 (390) T ss_pred ceE-EEcchhHHHHHHHHhhc--c-----------CCCCccccccCCCceeEEEcCCCCCCcEEEEeeceEEEEeecceE Confidence 445 4554 445555532221 1 111111122223468999998877665555432 1111 Q ss_pred cccee--Eeccccc-----ccee--EEecCccccceeeeeeeec-eeecCcccccCCCccceecccc Q lcl|NC_012740. 452 MDAGI--YYAPYVA-----LTPL--RATDPQSFHPVLGFKTRYG-IGINPFADSKSQAPSARITSGM 508 (528) Q Consensus 452 ~d~g~--fyaPYv~-----~~~~--~~~Dp~s~qP~~~~~tRY~-l~~nP~~~~~~~~~~~~~~~~~ 508 (528) .+-+- +|. +.. .... .++||+.|. ++=++.==+ -.+.||....+-..+.+ +. T Consensus 329 v~~~~~~~f~-~~~~~~r~~~r~dg~v~~~~A~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~ 390 (390) T protein:vir:40 329 IRTSTEYRLL-DDETLYYAKQYANGRPKDNSSFL-VFDITGLEGSPAIDVNVVNNATPSETP---AE 390 (390) T ss_pred EEecchhhhh-cCcEEEEEEEEeCCEEecccceE-EEEeeccCCCCCCCcceeeCCCCCCCC---CC Confidence 11000 000 000 0000 134444333 000110000 02223333222111111 11 No 56 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=80.64 E-value=0.093 Score=26.24 Aligned_cols=333 Identities=14% Similarity=0.132 Sum_probs=130.4 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhh--hHHHH-hhhccccchhhhhhhhccccccccccccCCccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILE--SQEAD-FAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASN 77 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~e--nq~~~-~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~ 77 (528) ..+.| ++++=.-+- -+|... ++. +..... +.++. -.+.....+.....+|..+|...+ .... T Consensus 65 ~~~~e-~~~~~~~~~-----~ei~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l~~~e-------~~~a 129 (425) T protein:vir:10 65 LPTSD-ALAKVDKVS-----ADLEAL-QAA-VDEANIKIAAAQMGANGVKPLRDPEYTEAFKAHVKRGD-------VQAA 129 (425) T ss_pred hccHH-HHHHHHHHH-----HHHHHH-HHH-HHHHHHHHHhhhcccccccccccHHHHHHHHHHhhhhh-------hHHH Confidence 11111 111111000 011100 000 000000 00000 000011112222333444443211 1112 Q ss_pred ccccccc-ccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccc Q lcl|NC_012740. 78 IASGQTT-GAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSS 155 (528) Q Consensus 78 ~~e~t~t-g~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG 155 (528) +..++++ |.+ -.-+.+. .+++.+-...+..++|.|.||+++..-+. + .. ++ +.+.| T Consensus 130 l~~~t~~~gG~-lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~--~---~~-----~~---------~~a~w-- 187 (425) T protein:vir:10 130 LNKGEDSEGGY-LTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKL--F---NM-----GG---------TTSGW-- 187 (425) T ss_pred hhcCcCCCCce-eccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEE--E---Ec-----CC---------cceee-- Confidence 2222211 111 1112222 25555556777888999999987743222 0 00 00 00000 Q ss_pred cccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccc Q lcl|NC_012740. 156 LAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFG 235 (528) Q Consensus 156 ~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~G 235 (528) ++ T Consensus 188 ----------------------------------------------------------------------------v~-- 189 (425) T protein:vir:10 188 ----------------------------------------------------------------------------VG-- 189 (425) T ss_pred ----------------------------------------------------------------------------ec-- Confidence 00 Q ss_pred cchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHH Q lcl|NC_012740. 236 MATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIV 315 (528) Q Consensus 236 msTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii 315 (528) |.. ....+....|.++.|++-|..+- ..+|-||.+|- .+|.+++|.+-|+..|..-+|+-|| T Consensus 190 ------E~~-~~~~~~~~~f~~v~~~~~k~~~~-------i~iS~ell~ds----~~~l~~~i~~~la~ai~~~~d~~~l 251 (425) T protein:vir:10 190 ------EAS-QRPQTNAATFQPLSFASGEIYAN-------PAATQQILDDA----EIDLESWLATEVQTEFAKQEGKAFL 251 (425) T ss_pred ------ccc-ccccccccccceeeeeheeeEee-------hHhHHHHHhcc----hhHHHHHHHHHHHHHHHHHHHhhhh Confidence 000 00001112477777777776654 55999999985 3567899999999999999999888 Q ss_pred hhhhhheeecccceeeccccccceeccccc---------------cccccchhHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_012740. 316 DVINFTAQVGKTGMTQTVGSKAGVFDLQDP---------------IDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGA 380 (528) Q Consensus 316 ~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~---------------~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~ 380 (528) . =+ |. +.+.|++..... .....+--..+....|+..+ .. .+-+ T Consensus 252 ~---G~------G~----~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l-------~~--~~~~ 309 (425) T protein:vir:10 252 A---GD------GT----NKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDL-------PS--AFTG 309 (425) T ss_pred c---cc------CC----CCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhh-------hh--hhcc Confidence 3 11 00 122333221110 00000000112223333322 21 2223 Q ss_pred CcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCc-----ceEEEEEecCCCccce Q lcl|NC_012740. 381 GNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQ-----DYFTVGYKGDNEMDAG 455 (528) Q Consensus 381 gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG~~~~d~g 455 (528) ....|++|.....|...- ...|- .-...+.+.. ..++|.| ++|+++.+.|. +-|++| +-.. T Consensus 310 ~a~~vmn~~~~~~L~~lk-----D~~G~-~l~~~~~~~g-~~~~l~G-~PV~~~~~~p~~~~~~~~i~~G---d~~~--- 375 (425) T protein:vir:10 310 NARFAMNRNTQRQVRKLK-----DGQGN-YLWQPSYVAG-QPATLAG-YPVTEVPDMPDVAANSTPILFG---DFQQ--- 375 (425) T ss_pred CCEEEEchHHHHHHHHhh-----cCCCc-eeeccCccCC-CCceecc-eeeEEecCcCCccCCccEEEEE---ehhc--- Confidence 446789999998887531 11110 0011121111 1356765 89999887652 334443 1110 Q ss_pred eEeccccccceeEEecCccccceee--eeeeecee-ecCcccccCCCccce Q lcl|NC_012740. 456 IYYAPYVALTPLRATDPQSFHPVLG--FKTRYGIG-INPFADSKSQAPSAR 503 (528) Q Consensus 456 ~fyaPYv~~~~~~~~Dp~s~qP~~~--~~tRY~l~-~nP~~~~~~~~~~~~ 503 (528) +|-=+.-..+.+..||-.-+-.++ ...||+.. .+|-+...-.-.-++ T Consensus 376 -~~~i~~~~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 376 -TYLIIDRIGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred -cEEEEEecceEEEecccccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 111111122333445443333333 34466653 455333222111111 No 57 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=78.96 E-value=0.11 Score=25.86 Aligned_cols=338 Identities=15% Similarity=0.146 Sum_probs=123.4 Q ss_pred CcchHHHHHhhhhhhcC-Ccc-chhccchhh-hhhhhhhhhHHHHhh-------hccccchh--hhhhhhcccccccc-- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLEN-EKL-PEIATASKQ-KLVAKILESQEADFA-------VDPIYKDE--KVVEAFGGFIAEAE-- 66 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~-~~~-~~~~~~~~~-~~~~~~~enq~~~~~-------~~~~~~~~--~~~~~~~~~l~ea~-- 66 (528) ..+. +-.++|.-+... +.+ -+|....++ +-.-...|.|.+... ++..+++. .....++..+.... T Consensus 30 ~~~~-~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 108 (394) T protein:vir:97 30 ALES-DDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRF 108 (394) T ss_pred hhch-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccchhhHHHHHHHHHHHHHHHHHhhhhhhh Confidence 1111 112333333210 000 001000000 000000000000000 00000000 00000111111000 Q ss_pred -------ccccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCC Q lcl|NC_012740. 67 -------VAGDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAE 137 (528) Q Consensus 67 -------~~~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~ 137 (528) ............+.+.++.+-...-|.-+ .+++.+-+..+...++.+.||+++++-+--++. . T Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-----~--- 180 (394) T protein:vir:97 109 EGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQR-----A--- 180 (394) T ss_pred hhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEec-----C--- Confidence 00000001111111111111111123322 355555567777889999999887543311110 0 Q ss_pred cccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCc Q lcl|NC_012740. 138 HAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDD 217 (528) Q Consensus 138 ~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~ 217 (528) + +.. T Consensus 181 -~---------~~~------------------------------------------------------------------ 184 (394) T protein:vir:97 181 -T---------TKM------------------------------------------------------------------ 184 (394) T ss_pred -C---------Ccc------------------------------------------------------------------ Confidence 0 000 Q ss_pred cccccccccccccccccccchhhhhhhcccCCCCCcccccc-eeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHH Q lcl|NC_012740. 218 EVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEM-SMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADA 296 (528) Q Consensus 218 ~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EM-aFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ 296 (528) .-+ +| +...++. ...+++++..+|.-+-...+|-||++|- ..|.++ T Consensus 185 ------------~~v--------~E---------~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds----~~~~~~ 231 (394) T protein:vir:97 185 ------------VTV--------AE---------LEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA----DVDLVG 231 (394) T ss_pred ------------cee--------cc---------cccccccccccceeEEeehhheeeehhhHHHHHhhh----hHHHHH Confidence 000 01 0001111 1334555555555555677999999986 346788 Q ss_pred HHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012740. 297 ELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQT 376 (528) Q Consensus 297 ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T 376 (528) +|.+-|+..|..-+|..||.-+... .+.+...+ +....++... ... T Consensus 232 ~i~~~la~~~~~~~~~~i~~g~~~~-------------~~~~~~~~-------------~~~~~~~~~~--------~~~ 277 (394) T protein:vir:97 232 IVSESISQIKVNTTNDAIAKVLKSF-------------TTKTVKNL-------------DEIKALLNGG--------FDP 277 (394) T ss_pred HHHHHHHHHHHHHHHHHHhhccccc-------------cccccccH-------------HHHHHHHHhh--------hhh Confidence 8999999888888888888532211 11222111 1111222111 112 Q ss_pred ccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEe--eCCCCcceEEEEEecCCCccc Q lcl|NC_012740. 377 GRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFI--DQYARQDYFTVGYKGDNEMDA 454 (528) Q Consensus 377 ~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~dy~~vG~KG~~~~d~ 454 (528) .+ .+. +|++|.+...|...- ...|- .-...+.+.. .-++|.| ++|++ |...+..-+++|-- .. T Consensus 278 ~~-~a~-~v~n~~~~~~l~~lk-----d~~G~-~i~~~~~~~~-~~~~l~G-~pv~~~~~~~~~~~~~~~gd~-----~~ 342 (394) T protein:vir:97 278 AY-NVS-LIVSQSFYQTLDTLK-----DGNGR-YLLQDDITAV-SGKVLLG-KPVFVLSDEVLGANKAFIGDF-----KR 342 (394) T ss_pred hh-CCE-EEEcHHHHHHHHHhh-----ccCCC-eeeecCcCCC-CCceecc-ceeEEecccccCCccEEEeec-----cc Confidence 22 233 679999988887631 11110 0011121111 1246766 77776 44445444544421 01 Q ss_pred eeEeccccccceeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHhhc Q lcl|NC_012740. 455 GIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKDSV 514 (528) Q Consensus 455 g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~a 514 (528) +.++..-..+. ....|...++..+-...||+.. .+|=+. ..++..+-..-. T Consensus 343 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~r~d~~v~~~~a~--------~~~~~~~~~~p~ 394 (394) T protein:vir:97 343 GVLFADRKDLG-LRWADNEIYGQYLQAVLRFGVSKVDDKAG--------YYVTFTPEPLPL 394 (394) T ss_pred cEEEEEecceE-EEEecccccceeEEEEEEEccEEecccce--------EEEEecccccCC Confidence 11122221122 2334555555556666777664 233111 111111111000 No 58 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=77.38 E-value=0.13 Score=25.53 Aligned_cols=218 Identities=12% Similarity=0.088 Sum_probs=100.5 Q ss_pred cccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccc Q lcl|NC_012740. 199 QNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARY 278 (528) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEY 278 (528) .+ ..+.|.+.+... . ...+|.+. ....-+..+|+++ ..+++.|-+.=.=++ T Consensus 1 ~~----------------------~~~~Gdtit~P~-~-iGda~~v~---eG~~i~~~~l~~t--~~~atIk~~gk~~~i 51 (231) T protein:vir:73 1 EN----------------------GINLANLCEYPN-D-IGDAADVA---EGGEISLDKIGTT--TKSVTIKKAAKGTEI 51 (231) T ss_pred Cc----------------------cccCCceEEecc-c-ccchhhhc---CCCcCChhhcccc--ceeeeEeeeccceee Confidence 00 011111111110 0 22333221 1222345556654 444444544333333 Q ss_pred hHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHH Q lcl|NC_012740. 279 SIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESF 358 (528) Q Consensus 279 T~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~ 358 (528) |=| ..|.+ +| |.-.|..+-|+..|...++.||+..+..++.- .+..++ .+.+..+ T Consensus 52 tD~--a~l~~-~g-Dp~~ea~~Q~~~~iA~kvD~di~~~~~~a~l~-----------~~~~~t----------~d~i~~A 106 (231) T protein:vir:73 52 TDE--AALSG-YG-DPIGESNKQLGLSLANKVDDDLLKAAKTTSQT-----------VSTKAN----------VDGVQAA 106 (231) T ss_pred eHH--HHhhc-cC-chHHHHHHHHHHHHHHhhhHHHHHhhcccccc-----------cccccc----------HHHHHHH Confidence 322 22455 33 88999999999999999999999765533321 011111 1112122 Q ss_pred HHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCC Q lcl|NC_012740. 359 KSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYAR 438 (528) Q Consensus 359 r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~ 438 (528) ..+ +.++ -....++||+|+++.-|...--+... +..-+.+.-.++ .+|.+.| ++|+++...+ T Consensus 107 ~~~---fgde---------~~~~~vivv~p~~~~~Lrk~~~~~~~---~~~~g~~i~~~G--~iG~i~G-~~Vi~S~~~~ 168 (231) T protein:vir:73 107 LDI---FNDE---------DAQAYVLIVNPKDAAKIRKDANAKNI---GSEVGANALING--TYADVLG-AQIVRSKKLA 168 (231) T ss_pred HHH---hccc---------cccceEEEEcchHHHhhhhccchhhh---hhhhccceeeec--ccceEcc-eEEEEcCCCC Confidence 222 1111 13567999999999998763211111 101111111111 3778866 8999987765 Q ss_pred cceEEEEEecCCCccceeEeccccc--cc----------eeEEecCccccceeeeeeeeceee-cCcccccCCCccceec Q lcl|NC_012740. 439 QDYFTVGYKGDNEMDAGIYYAPYVA--LT----------PLRATDPQSFHPVLGFKTRYGIGI-NPFADSKSQAPSARIT 505 (528) Q Consensus 439 ~dy~~vG~KG~~~~d~g~fyaPYv~--~~----------~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~ 505 (528) . ++.++++|+. +. .-.--|+..+.-.+----.|++.. || .+ T Consensus 169 ~--------------~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~----------~~-- 222 (231) T protein:vir:73 169 E--------------GSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDL----------TK-- 222 (231) T ss_pred C--------------CceeeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcC----------cc-- Confidence 3 2234445532 11 111235555555555555555431 11 00 Q ss_pred ccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 506 SGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 506 ~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) ..++.+||+ T Consensus 223 --------------vv~~t~~g~ 231 (231) T protein:vir:73 223 --------------VVNITFTGV 231 (231) T ss_pred --------------EEEEEeecC Confidence 123444555 No 59 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=77.04 E-value=0.13 Score=25.46 Aligned_cols=306 Identities=13% Similarity=0.081 Sum_probs=126.6 Q ss_pred hhhccccchhhhhhhhccccccccccccCCccccccccccccccccccCcchhhHHHHHHhhhhhhhceeeecCCcccce Q lcl|NC_012740. 43 FAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIASGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQ 122 (528) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~t~tg~v~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGL 122 (528) +.-||.=.. .++. ..+.+.+..+++++.-.--.+.+=.+++.+.+..+-..++-+.||++++. T Consensus 1 ~~~~~~r~~--------~~~~--------~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~- 63 (326) T protein:vir:42 1 MAVNPDRTT--------PFLG--------VNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQ- 63 (326) T ss_pred CCCCccchh--------hhcC--------cchhhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCce- Confidence 222220000 0111 11122222222221111111122245565666667778888999887642 Q ss_pred eeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 123 IFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVT 202 (528) Q Consensus 123 IFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~ 202 (528) | |+-.. ++ +.+.| T Consensus 64 ----~--~p~~~---~~---------~~a~~------------------------------------------------- 76 (326) T protein:vir:42 64 ----K--IPHWT---GD---------VSASW------------------------------------------------- 76 (326) T ss_pred ----E--EEEEe---CC---------cceEE------------------------------------------------- Confidence 1 11000 00 00000 Q ss_pred cccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHH Q lcl|NC_012740. 203 AEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEV 282 (528) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~EL 282 (528) + +| +..++|-..+++++++.+|..+-.-.+|-|| T Consensus 77 -----------------------------v--------~E---------g~~~~~~~~~f~~i~~~~~k~~~~v~iS~el 110 (326) T protein:vir:42 77 -----------------------------I--------GE---------GDMKPITKGNMTSQTIAPHKIATIFVASAET 110 (326) T ss_pred -----------------------------e--------cC---------CccccccccceeEEEEeeEEEEEeehhhHHH Confidence 0 01 1123344455566666677666677899999 Q ss_pred HHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHH Q lcl|NC_012740. 283 AQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLI 362 (528) Q Consensus 283 AQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~ 362 (528) .+|- ..|.++.|.+-|+..|...+++.+|.= +-.-...|....... .++..... ... +.......+. T Consensus 111 l~~s----~~~~~~~i~~~l~~a~~~~~d~a~l~G---~gs~~p~gi~~~~~~-~~~~~~~~-~~~----~~~~~~~~~~ 177 (326) T protein:vir:42 111 VRAN----PANYLGTMRTKVATAFAMAFDNAAING---TDSPFPTFLAQTTKE-VSLVDPDG-TGS----NADLTVYDAV 177 (326) T ss_pred HhcC----HHHHHHHHHHHHHHHHHHHHHHHhhcc---cCCCccccccccccc-cceeeccc-ccc----cccchhHHHH Confidence 9984 367899999999999999999999831 110000111100000 00000000 000 0000011111 Q ss_pred HHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccc---cccccccccCceEEEEecCceEEEeeCCCCc Q lcl|NC_012740. 363 YQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGA---AQGLNTDTTKAVFAGVLAGKYKVFIDQYARQ 439 (528) Q Consensus 363 ~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~---~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~ 439 (528) +..+.... ...+...+.+|++|.....|..-. ...|. ....-.........++|.| ++|+++++.+. T Consensus 178 --~~~~~~~~--~~~~~~~a~~v~n~~~~~~L~~lk-----d~~G~~l~~~~~~~~~~~~~~~~~l~G-~pv~~~~~~~~ 247 (326) T protein:vir:42 178 --AVNALSLL--VNAGKKWTHTLLDDITEPILNGAK-----DKSGRPLFIESTYTEENSPFRLGRIVA-RPTILSDHVAS 247 (326) T ss_pred --HHHHHhhh--hhhccCccEEEEeHHHHHHHHHhh-----ccCCceeeccccccCccccccCceeee-eeEEEcCCCCC Confidence 11111122 222335778899999999987531 11110 0001111112223456655 99999998765 Q ss_pred ceEEEEEecCCCccceeEeccccccceeEE--------ecCcc-----cc---ceeeeeeeeceee-cCcccccCCCccc Q lcl|NC_012740. 440 DYFTVGYKGDNEMDAGIYYAPYVALTPLRA--------TDPQS-----FH---PVLGFKTRYGIGI-NPFADSKSQAPSA 502 (528) Q Consensus 440 dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~--------~Dp~s-----~q---P~~~~~tRY~l~~-nP~~~~~~~~~~~ 502 (528) +=. +++-|+-. -+||...-...+.+. .|+.. || =.+=...|++..+ +| ++ .+ T Consensus 248 ~~~-~~~~Gd~s---~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~------~a-~~ 316 (326) T protein:vir:42 248 GTV-VGYQGDFR---QLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDK------DA-FV 316 (326) T ss_pred Cce-EEEEeecc---eEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecc------cc-eE Confidence 432 22333211 123332222222211 11111 22 2333466777652 22 11 12 Q ss_pred eecccchHHhhcchh Q lcl|NC_012740. 503 RITSGMLSKDSVGKN 517 (528) Q Consensus 503 ~~~~~~~~~~~a~~~ 517 (528) ++. +- .++++ T Consensus 317 ~l~-~~----~~~~~ 326 (326) T protein:vir:42 317 KLT-NV----DATEA 326 (326) T ss_pred EEe-ec----cccCC Confidence 222 11 12222 No 60 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=75.38 E-value=0.15 Score=25.14 Aligned_cols=288 Identities=12% Similarity=0.066 Sum_probs=122.9 Q ss_pred ccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccc Q lcl|NC_012740. 79 ASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLA 157 (528) Q Consensus 79 ~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~ 157 (528) |-.+++|.+.- -+.+. .+++++-++-+..+++-|-||++.. .+|+-.. ++ +.+.| T Consensus 1 mat~~~gg~lv-P~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~-------~~~p~~~---~~---------~~a~w---- 56 (311) T protein:vir:81 1 MVALATGTFQL-PKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-------QQYMTLT---AP---------PRGEV---- 56 (311) T ss_pred CceecCCceEc-chhHHHHHHHHHHhcchhhhhcceeecCCCc-------eEEEEEe---CC---------ceeEE---- Confidence 44455555421 12222 5666677788889999999986542 1121100 00 00000 Q ss_pred cCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccc Q lcl|NC_012740. 158 AKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMA 237 (528) Q Consensus 158 ~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~Gms 237 (528) ++. T Consensus 57 --------------------------------------------------------------------------v~E--- 59 (311) T protein:vir:81 57 --------------------------------------------------------------------------VGE--- 59 (311) T ss_pred --------------------------------------------------------------------------eec--- Confidence 000 Q ss_pred hhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhh Q lcl|NC_012740. 238 TSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDV 317 (528) Q Consensus 238 Ta~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~ 317 (528) +|.. ......|.++.+...|.. -....|-||.|+--. -.++-|++|.+-|+..|...|+.-++.= T Consensus 60 ---g~~~----~~~~~~f~~v~l~~~kl~-------~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~a~l~G 124 (311) T protein:vir:81 60 ---GAQK----SESTATFAPVTAIPRKVQ-------VTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHG 124 (311) T ss_pred ---Cccc----ccccceeeEEEEeeEEEE-------EeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 0000 001123444444444443 345689999875332 1355677888888888888888777732 Q ss_pred hhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcc Q lcl|NC_012740. 318 INFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASA 397 (528) Q Consensus 318 i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~ 397 (528) .....--+..|..........+..+... ....++.-|+.+-..+.. .+.+.+-+|++|+....|... T Consensus 125 ~~~~~~~~~~gi~~~~~~~~~~~~~~~~-----------~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~l 191 (311) T protein:vir:81 125 INPLTGAALSGSPAKILDTTNIVELTTG-----------TSATPDLAVEAAVGLVLG--DNLSPDGVALDNTFSFMLATQ 191 (311) T ss_pred ccCCCCcccccccccccccceeeeeccc-----------ccchHHHHHHHHHHHhhh--cCCCceEEEEcHHHHHHHHhh Confidence 1100000111111000011111111110 001223334444444432 234677789999999888653 Q ss_pred ccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEE------EEEecCCCc-----c-ceeEeccccccc Q lcl|NC_012740. 398 DQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFT------VGYKGDNEM-----D-AGIYYAPYVALT 465 (528) Q Consensus 398 g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~------vG~KG~~~~-----d-~g~fyaPYv~~~ 465 (528) . ...|.. -...+.+. -..|+|.| ++|+++.+-+..-.. +...+.... | +.+++...-... T Consensus 192 k-----d~~G~~-l~~~~~~~-~~~~tl~G-~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~ 263 (311) T protein:vir:81 192 R-----DSQGRK-LYPELGFG-TDVASFAG-LNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIP 263 (311) T ss_pred h-----ccCCCe-eecCcccc-CCCceecc-eeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccce Confidence 1 111110 01111111 12467866 899988765432211 111111110 1 223343333344 Q ss_pred eeEEecCccccc-------eeee--eeeecee-ecCcccccCCCccceecccchH Q lcl|NC_012740. 466 PLRATDPQSFHP-------VLGF--KTRYGIG-INPFADSKSQAPSARITSGMLS 510 (528) Q Consensus 466 ~~~~~Dp~s~qP-------~~~~--~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~ 510 (528) +.+.-|.+.-++ .++| ..|++.. .+|=+ .+++.+.... T Consensus 264 ~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a-------~~~l~~a~~~ 311 (311) T protein:vir:81 264 LELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDA-------FAVVRDADES 311 (311) T ss_pred EEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccc-------eEEEEeeccC Confidence 444333322222 1344 4678754 56611 1222222222 No 61 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=75.32 E-value=0.15 Score=25.13 Aligned_cols=351 Identities=12% Similarity=0.047 Sum_probs=120.4 Q ss_pred Ccc---hHHHHHhhhh---hhcCC----ccchhccchhhhhhh--hhhhhHHHHhhhccccchhh--hhhhh-ccccccc Q lcl|NC_012740. 1 MKT---TKELMEKWSP---LLENE----KLPEIATASKQKLVA--KILESQEADFAVDPIYKDEK--VVEAF-GGFIAEA 65 (528) Q Consensus 1 ~~~---~~~l~~kw~p---~l~~~----~~~~~~~~~~~~~~~--~~~enq~~~~~~~~~~~~~~--~~~~~-~~~l~ea 65 (528) |.= .|+.-++|.- |++.. --++.+..+.+ +.+ .-|+.|.+...+..+-.+.. ..... .....+. T Consensus 4 ~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~-l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (390) T protein:vir:62 4 TTLSANFEARERATAELRTLTDEFAGKEMTDEAREKEER-LITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGSGAQ 82 (390) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccch Confidence 110 1111111221 12110 01111111110 000 00111111110000000000 00000 0000000 Q ss_pred c----------ccccCCcc-----ccccccccccccccccCcchh-hHHHHHH-hhhhhhhceeeecCCcccceeeeeee Q lcl|NC_012740. 66 E----------VAGDHGYN-----ASNIASGQTTGAITNVGPAVI-GMVRRAI-PNLIAFDICGVQPMSTPTSQIFAIRS 128 (528) Q Consensus 66 ~----------~~~~~g~~-----~~~~~e~t~tg~v~~~~P~li-~l~Rra~-~~lI~~DI~GVQPmTgPTGLIFAMRs 128 (528) . ..+..+.. ......++++++-...-|.+. .++..+. ...+...++-|-||++...+-+-... T Consensus 83 ~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~ 162 (390) T protein:vir:62 83 RSADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVIT 162 (390) T ss_pred hhcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEc Confidence 0 00000000 000001111111000111111 1111111 12234455555555443222111110 Q ss_pred eecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 129 VYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTP 208 (528) Q Consensus 129 rY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~ 208 (528) + + +.+ T Consensus 163 ---~------~---------~~a--------------------------------------------------------- 167 (390) T protein:vir:62 163 ---G------R---------SSA--------------------------------------------------------- 167 (390) T ss_pred ---C------C---------cce--------------------------------------------------------- Confidence 0 0 000 Q ss_pred cCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHh Q lcl|NC_012740. 209 TKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRA 288 (528) Q Consensus 209 ~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkA 288 (528) +. .+| +..+++-.-++++++..+|..+-....|-||.+|- T Consensus 168 --------------------------~w---v~E---------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds-- 207 (390) T protein:vir:62 168 --------------------------SI---VGE---------TAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQ-- 207 (390) T ss_pred --------------------------ee---ecc---------cccccccccceeeeEeeeeeEEeehHHHHHHHhhh-- Confidence 00 011 11123333344556666666666678999999992 Q ss_pred hcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccc-hhHHHHHHHHHHHHHH Q lcl|NC_012740. 289 VHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGA-RWAGESFKSLIYQIDK 367 (528) Q Consensus 289 iHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~-r~a~e~~r~L~~~i~~ 367 (528) .+|.+++|.+-|+..|..-+|..||. | .|.+.|++........... -.+. .--+..|+. T Consensus 208 --~~~l~~~i~~~l~~~i~~~~d~~~l~--------G-------~G~p~Gi~~~~~~~~~~~~~~~~~---~~~~~~l~~ 267 (390) T protein:vir:62 208 --VLDLVGFLVSDAGPAIGDAMGRHFIT--------G-------TGQPRGILTDASPATATFLATDTD---SKVSDALID 267 (390) T ss_pred --hHHHHHHHHHHHHHHHHHHHHhhhhc--------c-------CCccccccccccccccceeccccc---ccchHHHHH Confidence 46789999999999999999999883 1 1123444443211100000 0000 001122333 Q ss_pred HHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEe Q lcl|NC_012740. 368 EAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYK 447 (528) Q Consensus 368 ~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K 447 (528) |-+.+...-+. .+ ..|+++.....|..- +...| ..-...+.+.. .-++|.| ++|+++.+.|.+=|++|-- T Consensus 268 ~~~~l~~~~~~-~a-~~vmn~~~~~~L~~l-----kd~~g-~~l~~~~~~~g-~~~~l~G-~Pv~~~~~~p~~~i~~gd~ 337 (390) T protein:vir:62 268 LFHEVPSAYRA-NA-KYVVNDLRAAQMRKL-----KDANG-QYLWQSGLTVG-APSLFNG-KVVETDDGMPADKILFADL 337 (390) T ss_pred HHHhhhhhhhc-CC-EEEEchHHHHHHHHh-----hccCC-CeeecCCcCCC-ccceecc-cceEEecCCCCccEEEeec Confidence 43444333222 33 467788888777652 11111 00011121111 1236766 7999999988766555411 Q ss_pred cCCCccceeEeccccccceeEEecCccccceee--eeeeecee-ecCcccccCCCccceecccchHH Q lcl|NC_012740. 448 GDNEMDAGIYYAPYVALTPLRATDPQSFHPVLG--FKTRYGIG-INPFADSKSQAPSARITSGMLSK 511 (528) Q Consensus 448 G~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~--~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~ 511 (528) .. .+...--.....+..|+-.-.-.++ +..|++.. .|| +-.+++.....+ T Consensus 338 ---s~---~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~--------~A~~~l~~~~~a 390 (390) T protein:vir:62 338 ---SK---YRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDA--------RGAKVLTVTPGA 390 (390) T ss_pred ---cc---eeEEeecceEEEeeccccccCCcEEEEEEEEeCcEeech--------hheEEEEeecCC Confidence 00 0111000112223333322222333 44556543 222 112333222222 No 62 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=74.16 E-value=0.16 Score=24.92 Aligned_cols=348 Identities=14% Similarity=0.130 Sum_probs=117.2 Q ss_pred CcchHHHHHhhhhhhcCCcc-----chhc--cchhhhhhhhhhhhHHHHhhhccccchhh---hhhhhccc---cccc-c Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKL-----PEIA--TASKQKLVAKILESQEADFAVDPIYKDEK---VVEAFGGF---IAEA-E 66 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~-----~~~~--~~~~~~~~~~~~enq~~~~~~~~~~~~~~---~~~~~~~~---l~ea-~ 66 (528) +..-++|.++..-+-+-+.+ -++. ....+.-...-..+|++. +..+... ...++... +.++ . T Consensus 42 ~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 117 (435) T protein:vir:80 42 SSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKA----PEVKGAKMARMVRALAAARGDAQLASK 117 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhccccccccccccch----hhhhHHHHHHHHHHHHhccchhHHHHH Confidence 22223333333322110000 0000 000000000000011000 0000000 00000000 0000 0 Q ss_pred c--cccCCccccccc-ccc-ccccccccCcchh--hHHHHHHhhhhhhhc-eeeecCCcccceeeeeeeeecCCCCCCcc Q lcl|NC_012740. 67 V--AGDHGYNASNIA-SGQ-TTGAITNVGPAVI--GMVRRAIPNLIAFDI-CGVQPMSTPTSQIFAIRSVYGGDPLAEHA 139 (528) Q Consensus 67 ~--~~~~g~~~~~~~-e~t-~tg~v~~~~P~li--~l~Rra~~~lI~~DI-~GVQPmTgPTGLIFAMRsrY~~~~~s~~G 139 (528) . ....+.+..+.. .++ ..|.+. =|.-+ .+++++-+..+...+ +=+.||+.+. + +|+-.+ ++ T Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~gg~l--vP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~-~------~~p~~~---~~ 185 (435) T protein:vir:80 118 LAIERGFGEEVAMSLNTLSPGAGGVL--VPENLSSEVIELLRPKSVVRKLGARTLPLSNGN-I------TIPRLK---GG 185 (435) T ss_pred HHHhhhhhhhhhhhhcccCCCCCccc--cchhHHHHHHHHHhhhchhhhccceeeecCCCc-e------EEEEEe---CC Confidence 0 000000000000 000 111110 02211 133333344444444 2244443321 0 110000 00 Q ss_pred cccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccc Q lcl|NC_012740. 140 KEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEV 219 (528) Q Consensus 140 ~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (528) +.+ T Consensus 186 ---------~~a-------------------------------------------------------------------- 188 (435) T protein:vir:80 186 ---------AIV-------------------------------------------------------------------- 188 (435) T ss_pred ---------cce-------------------------------------------------------------------- Confidence 000 Q ss_pred cccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHH Q lcl|NC_012740. 220 VMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELN 299 (528) Q Consensus 220 ~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELa 299 (528) .-+ +| +...++...++++++...+.-+-....|.||.+|-.- +.|.|+.|. T Consensus 189 ----------~~v--------~E---------~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~--~~~l~~~i~ 239 (435) T protein:vir:80 189 ----------GYI--------GA---------DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGV--NPNVDQIVV 239 (435) T ss_pred ----------eee--------cc---------CccccccccceeeEEEeeEEEEEeehhhHHHHHhhcc--cHHHHHHHH Confidence 000 01 0112333444555555555555556799999999432 456788888 Q ss_pred HHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_012740. 300 AILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRG 379 (528) Q Consensus 300 nILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g 379 (528) +-|..-|...+++-||.- . |. ...+.|++.......+... -.+.....++..+.+.-..+.....+- T Consensus 240 ~~l~~a~~~~~d~a~l~G---~------G~---~~~p~Gi~~~~~~~~~~~~-~~~~~~~~~~~d~~~~~~~~~~~~~~~ 306 (435) T protein:vir:80 240 GDLTAAIGAREDKAFIRD---D------GT---ANTPKGLRFWALPGNVITA-SDGSTLQKIETDLGKAILALENADANL 306 (435) T ss_pred HHHHHHHHHHHHHHhhcc---C------CC---CCcccceeecccccceeec-ccccchhhHHHHHHHHHHHhhcccccc Confidence 888888888888877731 1 10 0023444332211110000 001111222222222222222222222 Q ss_pred CCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcc--------eEEEE------ Q lcl|NC_012740. 380 AGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQD--------YFTVG------ 445 (528) Q Consensus 380 ~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y~~vG------ 445 (528) ....+|++|.....|.... ...|. . +..+.++ |+|.| ++||++.+.|.+ -|++| T Consensus 307 ~~~~~vmn~~~~~~L~~lk-----d~~G~-~-l~~~~~~----~~l~G-~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~ 374 (435) T protein:vir:80 307 TQPGWIMAPRTFRFLEGLR-----DGNGN-K-VYPELAN----GMLKG-YPVGKTTQVPINLGEAGKESEIYFTDFGDVF 374 (435) T ss_pred ccCEEEEcHHHHHHHHhhh-----ccCCc-e-eccCCCC----CeEee-eeeEEeccccccccCCCCcceEEEEEcccEE Confidence 3556789999999887642 11111 0 1122222 45655 899998886432 12222 Q ss_pred --EecCCCccceeEeccccccceeEEecCc-----cc---cceeeeeeeeceeec-CcccccCCCccceecccchHHh Q lcl|NC_012740. 446 --YKGDNEMDAGIYYAPYVALTPLRATDPQ-----SF---HPVLGFKTRYGIGIN-PFADSKSQAPSARITSGMLSKD 512 (528) Q Consensus 446 --~KG~~~~d~g~fyaPYv~~~~~~~~Dp~-----s~---qP~~~~~tRY~l~~n-P~~~~~~~~~~~~~~~~~~~~~ 512 (528) -.+.-..+ ..+|.-+ .|+. .| +=.+=..-|+++.+. | +...+++|-.|.. T Consensus 375 i~~~~~~~i~----~~~~~~~-----~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~--------~a~~~l~~~~~~~ 435 (435) T protein:vir:80 375 IGEEETLEID----YSKEATY-----KDADGHMVSAFQRDQTLIRVIAKNDFGPRHV--------ESIAVLSGVAWGA 435 (435) T ss_pred EEeecceEEE----Eeccccc-----cccccchhhhhhcCcceeeeeeeeCcEeecc--------cceEEEeccCCCC Confidence 22221111 1111000 0000 01 122234456655532 2 2234455555553 No 63 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=73.17 E-value=0.17 Score=24.75 Aligned_cols=323 Identities=12% Similarity=0.083 Sum_probs=125.7 Q ss_pred CcchH-------HHHHhhh----------hhhcCCccchh--ccchhhh----hhhhhhhhHHHHhhhccccchhhhhhh Q lcl|NC_012740. 1 MKTTK-------ELMEKWS----------PLLENEKLPEI--ATASKQK----LVAKILESQEADFAVDPIYKDEKVVEA 57 (528) Q Consensus 1 ~~~~~-------~l~~kw~----------p~l~~~~~~~~--~~~~~~~----~~~~~~enq~~~~~~~~~~~~~~~~~~ 57 (528) +...+ +|.++.. ...+.+..... .....+. ....-++...+....-.. .... ... T Consensus 41 ~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~ 118 (400) T protein:vir:38 41 LKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNF-EKTD-VGT 118 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHH-HHHH-HHH Confidence 11112 2222222 11111100000 0000000 111111111110000000 0000 000 Q ss_pred hccccccccccccCCccccc-cccc--cccccccccCcc--hhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecC Q lcl|NC_012740. 58 FGGFIAEAEVAGDHGYNASN-IASG--QTTGAITNVGPA--VIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGG 132 (528) Q Consensus 58 ~~~~l~ea~~~~~~g~~~~~-~~e~--t~tg~v~~~~P~--li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~ 132 (528) ..... ........ ...+ +..|.+ .-|. .-.++++.-++.+..+++.+.||++.++-+--++..- T Consensus 119 ~~~~~-------~~~~~~~~~~~~~~~~~~gg~--~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-- 187 (400) T protein:vir:38 119 FAVLR-------AVPTDASDAVNAGVKAADAAS--TIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANAT-- 187 (400) T ss_pred Hhhhh-------hhhHHHHHHHhhcccccCCcc--cccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecCC-- Confidence 00000 00000000 0111 111111 1122 1134444556778889999999998865332222100 Q ss_pred CCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcc Q lcl|NC_012740. 133 DPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKAD 212 (528) Q Consensus 133 ~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~ 212 (528) + .+.+ T Consensus 188 ------~----------~~~~----------------------------------------------------------- 192 (400) T protein:vir:38 188 ------T----------KMVT----------------------------------------------------------- 192 (400) T ss_pred ------C----------cccc----------------------------------------------------------- Confidence 0 0000 Q ss_pred cccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCC Q lcl|NC_012740. 213 SESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGM 292 (528) Q Consensus 213 ~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGL 292 (528) ++.+-.. .......|.+..+.+.|.. -...+|-||.+|- .. T Consensus 193 -------------------~~E~~~~---------~~~~~~~f~~i~~~~~k~~-------~~~~is~ell~ds----~~ 233 (400) T protein:vir:38 193 -------------------VAELEKN---------PAMAKPEFKPVNWSVETYR-------QALPVSQESIDDS----AI 233 (400) T ss_pred -------------------ccccccc---------cccccccceeeEeehhhee-------eehhhHHHHHhhh----HH Confidence 0000000 0001223555555555544 4567999999985 35 Q ss_pred ChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_012740. 293 DADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEI 372 (528) Q Consensus 293 DAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I 372 (528) |.+++|.+-|...|...+|+-|+.-.. . . ...|+..++ ....++.... T Consensus 234 ~~~~~i~~~l~~~~~~~~~~~i~~~~~---~-----~-----~~~~~~~~~-------------~~~~~~~~~~------ 281 (400) T protein:vir:38 234 DLVGLIAQNGQQIKVNTTNGAVATLLK---G-----F-----TAKTISSVD-------------DLKHINNVDL------ 281 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccc---c-----c-----cccccccHH-------------HHHHHHHhhh------ Confidence 678999999999999999998884321 1 0 122222211 1122211111 Q ss_pred HHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCc Q lcl|NC_012740. 373 ARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEM 452 (528) Q Consensus 373 ~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~ 452 (528) ...+ ...+|+||.....|...- ...|. .-...+.+.. ..++|.| ++|++..+.+.. -.| T Consensus 282 --~~~~--~a~~v~~~~~~~~l~~lk-----d~~G~-~i~~~~~~~~-~~~~l~G-~pv~~~~~~~~~-----~~g---- 340 (400) T protein:vir:38 282 --DPAY--SRVIIASQSFYNFLDTVK-----DGNGR-YLLQDSILTP-SGKSVLG-MPIAVVSDDTLG-----AAG---- 340 (400) T ss_pred --hhhh--CcEEEEcHHHHHHHHHhh-----ccCCC-eeeecCcCCC-Ccccccc-ceeEEecccccC-----CCC---- Confidence 1112 235678998888887531 11110 0011122211 1346766 788877664421 111 Q ss_pred cceeEecccc--------ccceeEEecCccccceeeeeeeeceee-cCcccccCCCccceecccchHH Q lcl|NC_012740. 453 DAGIYYAPYV--------ALTPLRATDPQSFHPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSK 511 (528) Q Consensus 453 d~g~fyaPYv--------~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~ 511 (528) +.-++|+.+- ........|-..|+..+-...||+..+ +|=+ ...+.-.+.+ T Consensus 341 ~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a--------~~~l~~~~~a 400 (400) T protein:vir:38 341 EAHAFLGDIKRAILFANRADFMVRWVDDQIYGQFLQAGMRFGVSVADEKA--------GYFLTYTPKA 400 (400) T ss_pred ceEEEEEeccccEEEEeecceEEEEecccccceeEEEEEEeccEEecccc--------eEEEEeecCC Confidence 1122332211 122334566677788888889998762 3311 1111111111 No 64 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=72.16 E-value=0.19 Score=24.58 Aligned_cols=295 Identities=13% Similarity=0.101 Sum_probs=120.1 Q ss_pred hhccccchhhhhhhhccccccccccccCCccccccccccccccccccCcc-hh-hHHHHHHhhhhhhhceeeecCCcccc Q lcl|NC_012740. 44 AVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIASGQTTGAITNVGPA-VI-GMVRRAIPNLIAFDICGVQPMSTPTS 121 (528) Q Consensus 44 ~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~t~tg~v~~~~P~-li-~l~Rra~~~lI~~DI~GVQPmTgPTG 121 (528) ||.+++.+..+. -|...+.+.+. ..+.+... +++++.. =|. +. .+++.+..+.+..+++-+.||++.+- T Consensus 1 ~~~~~~~~~~~~-~f~~~~~~~~~-----~~a~~~~~-~~~~~~~--iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~ 71 (324) T protein:vir:97 1 MEQTQKLKLNLQ-HFASNNVKPQV-----FNPDNVMM-HEKKDGT--LMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) T ss_pred CccchhHHHHHH-HHHHhhhhhhh-----hccccccc-cCCCcce--echhHHHHHHHHHHhhcchhhhcceeeccCCce Confidence 221111111110 00001111110 11111111 1112211 122 22 45666677888899999999987652 Q ss_pred eeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 122 QIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNV 201 (528) Q Consensus 122 LIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~ 201 (528) -|- +... + +.+.| T Consensus 72 ~ip----~~~~------~---------~~a~~------------------------------------------------ 84 (324) T protein:vir:97 72 KFT----FWAD------K---------PGAYW------------------------------------------------ 84 (324) T ss_pred EEE----EEec------C---------cceeE------------------------------------------------ Confidence 111 0000 0 00000 Q ss_pred ccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHH Q lcl|NC_012740. 202 TAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIE 281 (528) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~E 281 (528) + +|... -......|.++.|+..|..+- ..+|-| T Consensus 85 ------------------------------v--------~Eg~~--~~~~~~~f~~v~~~~~k~~~~-------~~is~e 117 (324) T protein:vir:97 85 ------------------------------V--------GEGQK--IETSKATWVNATMRAFKLGVI-------LPVTKE 117 (324) T ss_pred ------------------------------e--------ccCcc--ccccccceeEEEEeeEEEEEe-------ehhhHH Confidence 0 01000 000112355555665555554 459999 Q ss_pred HHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHH Q lcl|NC_012740. 282 VAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSL 361 (528) Q Consensus 282 LAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L 361 (528) |.+|- ..|.+++|.+-|+..|...+++.||.---... .+.|++......... ...... T Consensus 118 ll~ds----~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~------------~~~gi~~~~~~~~~~------~~~~~~ 175 (324) T protein:vir:97 118 FLNYT----YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP------------FGKSIAQSIEKTNKV------IKGDFT 175 (324) T ss_pred HHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc------------cCcccccccccccee------ccccCC Confidence 99986 36779999999999999999999995211000 122222211100000 000011 Q ss_pred HHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCC--c Q lcl|NC_012740. 362 IYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYAR--Q 439 (528) Q Consensus 362 ~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~--~ 439 (528) +..|+++.+.|.. .+.....+|++|.....|.... ...| ......+. .++|.| ++|++.+..+ . T Consensus 176 ~~~i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~lk-----d~~g-~~~~~~~~-----~~tl~G-~PV~~~~~~~~~~ 241 (324) T protein:vir:97 176 QDNIIDLEALLED--DELEANAFISKTQNRSLLRKIV-----DPET-KERIYDRN-----SDTLDG-LPVVNLKSSNLKR 241 (324) T ss_pred HHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhh-----cCCC-ceeecCCC-----Cccccc-eeeEeecCCCCCc Confidence 2234455555543 2234557899999999887531 1111 11111121 245655 7888766532 2 Q ss_pred ceEEEEEecCCCccceeEeccccccceeEEecCcc--------------cc---ceeeeeeeece-eecC--cc-----c Q lcl|NC_012740. 440 DYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQS--------------FH---PVLGFKTRYGI-GINP--FA-----D 494 (528) Q Consensus 440 dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s--------------~q---P~~~~~tRY~l-~~nP--~~-----~ 494 (528) ..+++|-. +.+++... ...-..+.|... || =.+=+..||+. ..|| |+ . T Consensus 242 ~~~~~gd~------~~~~i~~~-~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~ 314 (324) T protein:vir:97 242 GELITGDF------DKLIYGIP-QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred ceEEEEec------ccEEEEEe-cCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Confidence 23333311 01111111 111111111110 11 11122345553 2233 11 1 Q ss_pred ccCCCcccee Q lcl|NC_012740. 495 SKSQAPSARI 504 (528) Q Consensus 495 ~~~~~~~~~~ 504 (528) ..+.+.++++ T Consensus 315 ~~~~~~~~~~ 324 (324) T protein:vir:97 315 KKTDSVPGEV 324 (324) T ss_pred CCCCCCCCCC Confidence 1122344444 No 65 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=68.68 E-value=0.23 Score=24.04 Aligned_cols=271 Identities=12% Similarity=0.052 Sum_probs=115.9 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccc Q lcl|NC_012740. 146 MYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLME 225 (528) Q Consensus 146 ~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (528) |....| ...+.- .|-....+....+.. ...+...-.....-. . . T Consensus 1 m~~~~T------------------~l~d~i--~Pev~~~~v~~~~~~-------~l~~~~~~~~~~~l~--------g-~ 44 (274) T protein:vir:96 1 MAQGMT------------------KLTNQI--VPEVLAPMMQAELEK-------KLRFASFAEIDNTLV--------G-Q 44 (274) T ss_pred CCccee------------------ehhhee--chHHHHHHHHHHHHh-------hhhccccceeccccc--------C-C Confidence 000000 000000 011111111000000 000000000000000 0 0 Q ss_pred cccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhc-CCChHHHHHHHHHH Q lcl|NC_012740. 226 EGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVH-GMDADAELNAILAN 304 (528) Q Consensus 226 ~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiH-GLDAE~ELanILSt 304 (528) .|...+...=-.+..+|.. .....-...++..+=. +++-+-|+ |+ |.+ -|+-+.. +-|.-.|..+-++. T Consensus 45 ~G~tv~iP~~~~ig~a~~~---~~g~~i~~~~lt~~~~--~~~i~~~~-~a-~~i---~D~~~~~~~~d~~~~~~~~~~~ 114 (274) T protein:vir:96 45 PGDTLTFPAFIYSGDAKVV---AEGEKIPTDILETKKR--EAKIRKIA-KG-TSI---SDEALLSGYGDPQGEQVRQHGL 114 (274) T ss_pred CCCEEEeeeecCCCccccc---cCCCccchhhccccee--EEEeeeee-cc-eee---hHHHHhhccchHHHHHHHHHHH Confidence 0111111110011222211 1112233444443333 33334443 22 222 2666655 35899999999999 Q ss_pred HHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEE Q lcl|NC_012740. 305 EVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFV 384 (528) Q Consensus 305 EImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~ 384 (528) .|..+++++|+..+..... ++ ....+ ..+.+-....++.++. -.++++ T Consensus 115 ~~a~~vd~~i~~~l~~a~~--------~~--~~~~~-------------~~d~i~~A~~~lgd~~---------~~~~~i 162 (274) T protein:vir:96 115 AHANKVDDDVLEALKSAKL--------TV--EADIT-------------KLTGLQTAIDKFNDED---------LEPMVL 162 (274) T ss_pred HHHHHHHHHHHHHHhcccc--------cc--ccccc-------------CHHHHHHHHHHhcccc---------ccccEE Confidence 9999999999987642211 11 00111 1233333444443331 157899 Q ss_pred EEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcce-EEEEEecCCCccceeEeccccc Q lcl|NC_012740. 385 IASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDY-FTVGYKGDNEMDAGIYYAPYVA 463 (528) Q Consensus 385 v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~~d~g~fyaPYv~ 463 (528) ||+|.|++.|......+....... + .....+-.+|.+.| ++||+|...|..- +++| +|.- .||.. -+ T Consensus 163 vv~p~~~~~L~k~~~~~f~~~s~~--g--~~~~~~G~ig~~~G-~~Vi~s~~~~~~t~~l~~-~gA~-----~~~~~-~~ 230 (274) T protein:vir:96 163 FISPLDAGKLRGDATTNFTRATEL--G--DDVIVKGAFGEALG-AVIVRSNKLEAGTAILAK-KGAV-----KLITK-RD 230 (274) T ss_pred EeCHHHHHHHHhhccccccccccc--c--ccceeccccceecC-eEEEEeCCCCCceEEEEe-ccce-----eeeec-CC Confidence 999999999987643332221100 0 11112234888865 9999999887433 2222 2211 12221 12 Q ss_pred cceeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHhh Q lcl|NC_012740. 464 LTPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKDS 513 (528) Q Consensus 464 ~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~ 513 (528) ...-.--||.+++=.+-..-+||+. .|| + .-.+++.|+-.-.| T Consensus 231 ~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~-----~--~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 231 FFLETDRDPSTKTTALYSDKHYVAYLYDE-----S--KAVKITKGSGSLEM 274 (274) T ss_pred cccccccccccccCEEEEeEEEEEEEEcC-----C--cEEEEEcCCccccC Confidence 2223456899999999999999886 344 1 11233322211122 No 66 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=68.68 E-value=0.23 Score=24.04 Aligned_cols=271 Identities=12% Similarity=0.052 Sum_probs=115.9 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccc Q lcl|NC_012740. 146 MYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLME 225 (528) Q Consensus 146 ~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (528) |....| ...+.- .|-....+....+.. ...+...-.....-. . . T Consensus 1 m~~~~T------------------~l~d~i--~Pev~~~~v~~~~~~-------~l~~~~~~~~~~~l~--------g-~ 44 (274) T protein:vir:95 1 MAQGMT------------------KLTNQI--VPEVLAPMMQAELEK-------KLRFASFAEIDNTLV--------G-Q 44 (274) T ss_pred CCccee------------------ehhhee--chHHHHHHHHHHHHh-------hhhccccceeccccc--------C-C Confidence 000000 000000 011111111000000 000000000000000 0 0 Q ss_pred cccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhc-CCChHHHHHHHHHH Q lcl|NC_012740. 226 EGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVH-GMDADAELNAILAN 304 (528) Q Consensus 226 ~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiH-GLDAE~ELanILSt 304 (528) .|...+...=-.+..+|.. .....-...++..+=. +++-+-|+ |+ |.+ -|+-+.. +-|.-.|..+-++. T Consensus 45 ~G~tv~iP~~~~ig~a~~~---~~g~~i~~~~lt~~~~--~~~i~~~~-~a-~~i---~D~~~~~~~~d~~~~~~~~~~~ 114 (274) T protein:vir:95 45 PGDTLTFPAFIYSGDAKVV---AEGEKIPTDILETKKR--EAKIRKIA-KG-TSI---SDEALLSGYGDPQGEQVRQHGL 114 (274) T ss_pred CCCEEEeeeecCCCccccc---cCCCccchhhccccee--EEEeeeee-cc-eee---hHHHHhhccchHHHHHHHHHHH Confidence 0111111110011222211 1112233444443333 33334443 22 222 2666655 35899999999999 Q ss_pred HHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEE Q lcl|NC_012740. 305 EVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFV 384 (528) Q Consensus 305 EImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~ 384 (528) .|..+++++|+..+..... ++ ....+ ..+.+-....++.++. -.++++ T Consensus 115 ~~a~~vd~~i~~~l~~a~~--------~~--~~~~~-------------~~d~i~~A~~~lgd~~---------~~~~~i 162 (274) T protein:vir:95 115 AHANKVDDDVLEALKSAKL--------TV--EADIT-------------KLTGLQTAIDKFNDED---------LEPMVL 162 (274) T ss_pred HHHHHHHHHHHHHHhcccc--------cc--ccccc-------------CHHHHHHHHHHhcccc---------ccccEE Confidence 9999999999987642211 11 00111 1233333444443331 157899 Q ss_pred EEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcce-EEEEEecCCCccceeEeccccc Q lcl|NC_012740. 385 IASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDY-FTVGYKGDNEMDAGIYYAPYVA 463 (528) Q Consensus 385 v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~~d~g~fyaPYv~ 463 (528) ||+|.|++.|......+....... + .....+-.+|.+.| ++||+|...|..- +++| +|.- .||.. -+ T Consensus 163 vv~p~~~~~L~k~~~~~f~~~s~~--g--~~~~~~G~ig~~~G-~~Vi~s~~~~~~t~~l~~-~gA~-----~~~~~-~~ 230 (274) T protein:vir:95 163 FISPLDAGKLRGDATTNFTRATEL--G--DDVIVKGAFGEALG-AVIVRSNKLEAGTAILAK-KGAV-----KLITK-RD 230 (274) T ss_pred EeCHHHHHHHHhhccccccccccc--c--ccceeccccceecC-eEEEEeCCCCCceEEEEe-ccce-----eeeec-CC Confidence 999999999987643332221100 0 11112234888865 9999999887433 2222 2211 12221 12 Q ss_pred cceeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHhh Q lcl|NC_012740. 464 LTPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKDS 513 (528) Q Consensus 464 ~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~ 513 (528) ...-.--||.+++=.+-..-+||+. .|| + .-.+++.|+-.-.| T Consensus 231 ~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~-----~--~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 231 FFLETDRDPSTKTTALYSDKHYVAYLYDE-----S--KAVKITKGSGSLEM 274 (274) T ss_pred cccccccccccccCEEEEeEEEEEEEEcC-----C--cEEEEEcCCccccC Confidence 2223456899999999999999886 344 1 11233322211122 No 67 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=68.66 E-value=0.23 Score=24.04 Aligned_cols=289 Identities=10% Similarity=0.043 Sum_probs=120.0 Q ss_pred hhhccccchhhhhhhhccccccccccccCCccccccccc-cccccccccCc-chh-hHHHHHHhhhhhhhceeeecCCcc Q lcl|NC_012740. 43 FAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIASG-QTTGAITNVGP-AVI-GMVRRAIPNLIAFDICGVQPMSTP 119 (528) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~-t~tg~v~~~~P-~li-~l~Rra~~~lI~~DI~GVQPmTgP 119 (528) ++.. ..+++.+-+.. +++.+-...=| .+. .+++.+-+..+..+++.+.||+++ T Consensus 1 ~~~~------------------------~~~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 56 (318) T protein:vir:24 1 MAAG------------------------TAFAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTT 56 (318) T ss_pred CCCC------------------------CCCCHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCC Confidence 1111 11111111111 11111111112 222 355556667788889999999876 Q ss_pred cceeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 120 TSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQ 199 (528) Q Consensus 120 TGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~ 199 (528) +.-|- +... + +.+. T Consensus 57 ~~~ip----~~~~------~---------~~a~----------------------------------------------- 70 (318) T protein:vir:24 57 GQKIP----HWVG------D---------VSAQ----------------------------------------------- 70 (318) T ss_pred ceEEE----EEeC------C---------cceE----------------------------------------------- Confidence 42211 0000 0 0000 Q ss_pred ccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccch Q lcl|NC_012740. 200 NVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYS 279 (528) Q Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT 279 (528) -++ | +..+++...++++++.+.|..+-...+| T Consensus 71 -------------------------------~v~--------E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS 102 (318) T protein:vir:24 71 -------------------------------WIG--------E---------GDMKPITKGNMTSQTIAPHKIATIFVAS 102 (318) T ss_pred -------------------------------Eec--------C---------CccccccccceeEEEEeeEEEEEeehhh Confidence 000 1 0112333344455555555555567899 Q ss_pred HHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccc----cchhHH Q lcl|NC_012740. 280 IEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTR----GARWAG 355 (528) Q Consensus 280 ~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~----~~r~a~ 355 (528) -||.+|-. .|.+++|.+.|+..|...|++.++.-.. . +.+.|++......... ..-+.. T Consensus 103 ~e~l~ds~----~~~~~~i~~~l~~~~~~~~d~a~l~G~g---~----------~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (318) T protein:vir:24 103 AETVRANP----ANYLGTMRTKVATAFAMAFDGAAMHGTD---S----------PFPTYIGQTTKAISIADTTGATTVYD 165 (318) T ss_pred HHHhhcCh----HHHHHHHHHHHHHHHHHHHHHhhhcccC---C----------CCCcccccccccccccccccccchHH Confidence 99999843 6789999999999999999999983211 0 1122232222111100 000111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccc---cccccccccCceEEEEecCceEEE Q lcl|NC_012740. 356 ESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGA---AQGLNTDTTKAVFAGVLAGKYKVF 432 (528) Q Consensus 356 e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~---~~~~~~d~~~~~~~G~l~~~~~vy 432 (528) +... ++...+. -.......+||||.....|.... ...|. .............-+.+. +++|+ T Consensus 166 ~~~~-------~~~~~~~--~~~~~~~~~v~n~~~~~~L~~lk-----d~~G~~l~~~~~~~~~~~~~~~~~i~-g~pv~ 230 (318) T protein:vir:24 166 QVAV-------NGLSLLV--NDGKKWTHTLLDDITEPILNGAK-----DQNGRPLFIESTYGEAASPFRSGRIV-ARPTI 230 (318) T ss_pred HHHH-------HHHHhhc--cccCCCCEEEEcHHHHHHHHHhh-----ccCCceeecCccccCccccccCceEE-EEeeE Confidence 1111 1222222 22335678899999999998531 11110 000001111111112343 36777 Q ss_pred eeCCCCcce--EEEEEecCCCccceeEeccccccceeE--------EecCcc-----c---cceeeeeeeecee-ecCcc Q lcl|NC_012740. 433 IDQYARQDY--FTVGYKGDNEMDAGIYYAPYVALTPLR--------ATDPQS-----F---HPVLGFKTRYGIG-INPFA 493 (528) Q Consensus 433 ~D~y~~~dy--~~vG~KG~~~~d~g~fyaPYv~~~~~~--------~~Dp~s-----~---qP~~~~~tRY~l~-~nP~~ 493 (528) +.+..+..- +++| +- +.++|+-.-.+...+ ..|+.. | |=.+=...||+.. .+|=+ T Consensus 231 ~~~~~~~~~~~~~~g---df---s~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a 304 (318) T protein:vir:24 231 LSDHVVEGTTVGFMG---DF---SQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEA 304 (318) T ss_pred EeCCCCCCccEEEEe---ec---ceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccc Confidence 777754321 2222 11 113333222221111 111111 2 2333345677766 33311 Q ss_pred cccCCCccceecccchHHhhcc Q lcl|NC_012740. 494 DSKSQAPSARITSGMLSKDSVG 515 (528) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~a~ 515 (528) .++|. +-.+.-.-| T Consensus 305 -------~~~i~-~~~a~~~~~ 318 (318) T protein:vir:24 305 -------FVALT-NVVSGGGEG 318 (318) T ss_pred -------eEEEE-eeccCCCCC Confidence 11221 111111112 No 68 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=68.10 E-value=0.24 Score=23.96 Aligned_cols=334 Identities=13% Similarity=0.116 Sum_probs=123.2 Q ss_pred Cc---chHHHHHhhh-----------hhhcCC-----------------ccchhccch-----hhhhhhhhhhhHHHHhh Q lcl|NC_012740. 1 MK---TTKELMEKWS-----------PLLENE-----------------KLPEIATAS-----KQKLVAKILESQEADFA 44 (528) Q Consensus 1 ~~---~~~~l~~kw~-----------p~l~~~-----------------~~~~~~~~~-----~~~~~~~~~enq~~~~~ 44 (528) |. ..+.+..+|. -+||+- ..+++.+.. +|....--|.+..+.+. T Consensus 238 l~~~~~~~~~~~~ai~~g~sld~~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a 317 (632) T protein:vir:96 238 IGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAA 317 (632) T ss_pred HHHHhhhhhhHHHHHhccccHHHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhh Confidence 11 1111122211 011110 001111000 00000000000000000 Q ss_pred hccccchhhhhhhhccccccccccccCCccc------------ccccccc-ccccccccCcchh-hHHHHHHhhhhhhhc Q lcl|NC_012740. 45 VDPIYKDEKVVEAFGGFIAEAEVAGDHGYNA------------SNIASGQ-TTGAITNVGPAVI-GMVRRAIPNLIAFDI 110 (528) Q Consensus 45 ~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~------------~~~~e~t-~tg~v~~~~P~li-~l~Rra~~~lI~~DI 110 (528) ..+ ........-+... .....|.+. ..+...+ ++|...-....+- .++...-+..|...+ T Consensus 318 ~~~-~~~a~~~~e~a~~-----~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l 391 (632) T protein:vir:96 318 TGD-WSKAGFEREVSLA-----IADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM 391 (632) T ss_pred ccc-hhhhhhhhHHHHH-----HHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhh Confidence 000 0000000000000 000111110 0011111 1111100001110 123323345555554 Q ss_pred eeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccc Q lcl|NC_012740. 111 CGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTF 190 (528) Q Consensus 111 ~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f 190 (528) |++.+++.+|-+ + ++... ++ + T Consensus 392 -~~~~~~~~~g~~---~--ip~~~---~~---------~----------------------------------------- 412 (632) T protein:vir:96 392 -GARMLPGLVGDV---D--IPKKT---SG---------A----------------------------------------- 412 (632) T ss_pred -cceEeecCCcce---E--EEEEe---CC---------c----------------------------------------- Confidence 555444443311 1 11000 00 0 Q ss_pred cccccccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEe Q lcl|NC_012740. 191 AETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAK 270 (528) Q Consensus 191 ~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAK 270 (528) ..+-+ +| +...++-..+++++++.+| T Consensus 413 -------------------------------------~a~wv--------~E---------~~~~~~s~~~f~~i~l~~~ 438 (632) T protein:vir:96 413 -------------------------------------NFYWI--------GE---------DEDVQDSDFDFTTLSFSPK 438 (632) T ss_pred -------------------------------------eeEee--------cC---------CccccccccceeeEEeeee Confidence 00000 11 1123344455666777777 Q ss_pred cccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceecccccc---- Q lcl|NC_012740. 271 SRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPI---- 346 (528) Q Consensus 271 SRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~---- 346 (528) +=+-...+|-||..| -.+|.|++|.+-|...|...+++.+|.= + |.. +.+.|++...... T Consensus 439 k~~~~v~iS~ell~d----s~~~~~~~i~~~l~~a~~~~~d~a~l~G---~---G~~------~~p~Gi~~~~~~~~~~~ 502 (632) T protein:vir:96 439 TIAGAVPVTRKLRKQ----SSIHVENLIREDLIEGIGVALDLAMLTG---T---GLA------NDPVGLLNMTGVPALTY 502 (632) T ss_pred EEEEehhhHHHHHhc----cchHHHHHHHHHHHHHHHHHHHHHhhcc---c---CCC------Cccceeeecccccceec Confidence 666667788888776 3688999999999999999999999841 1 111 1234443322111 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEec Q lcl|NC_012740. 347 DTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLA 426 (528) Q Consensus 347 d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~ 426 (528) ...+..| +.+..|+.. |...-........|+++.....|...... ...|.. +.. -|+|+ T Consensus 503 ~~~~~~~--~~i~~~~~~-------i~~~~~~~~~~~~~~~~~~~~~l~~~~l~---d~~G~~--i~~-------~~~l~ 561 (632) T protein:vir:96 503 PAGGVDW--ASVVDMETK-------ISTFNADAGRLAYLTSVTQRGAAKKAQVF---DNTGER--IWQ-------NNEVN 561 (632) T ss_pred ccccCCH--HHHHHHHHH-------HhhcccccCccEEEEchhHHHHHHHHhcc---CCCCce--eec-------CCeec Confidence 1111112 122333333 32222222234568898887777653211 111100 100 14675 Q ss_pred CceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecCcc----ccceeeeeeeecee-ecC--cccccCCC Q lcl|NC_012740. 427 GKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQS----FHPVLGFKTRYGIG-INP--FADSKSQA 499 (528) Q Consensus 427 ~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s----~qP~~~~~tRY~l~-~nP--~~~~~~~~ 499 (528) | |+|++.++.+.+-+++|-- +-+|+.-+ +.+...+||.+ .+=.+=...|+++. .+| |+..+..+ T Consensus 562 G-~pv~~s~~ip~~~~~~gd~------s~~~i~~~--~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 562 G-YRAEASNQIPADTWIFGDW------SQIVIAMW--GVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred c-cceEeccccccCcEEEeec------ceEEEEEe--cceEEEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 4 9999999887766555421 11122211 12233445533 33334445666654 344 44333332 No 69 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=68.06 E-value=0.24 Score=23.95 Aligned_cols=285 Identities=10% Similarity=0.047 Sum_probs=119.4 Q ss_pred ccccccccccccCcchh--hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccccccccccccc Q lcl|NC_012740. 79 ASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSL 156 (528) Q Consensus 79 ~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~ 156 (528) +..+++++....=|.-+ .+++++.++.+..+++-+.||++++--|--.. . .+.+.|-+ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~----~---------------~~~a~wv~- 60 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA----T---------------LPEADWVG- 60 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEe----C---------------CcceEEee- Confidence 23333333222123222 56667777888889999999987752111100 0 00000000 Q ss_pred ccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccccccccccccc Q lcl|NC_012740. 157 AAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGM 236 (528) Q Consensus 157 ~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~Gm 236 (528) .|- T Consensus 61 -----------------------------------------------------------------------------E~~ 63 (305) T protein:vir:25 61 -----------------------------------------------------------------------------ESA 63 (305) T ss_pred -----------------------------------------------------------------------------ccc Confidence 000 Q ss_pred chhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHh Q lcl|NC_012740. 237 ATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVD 316 (528) Q Consensus 237 sTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~ 316 (528) ... ...++.-..++++++..++..+-.-.+|-||.+|- ..|.|++|.+-|+..|...+++.+|. T Consensus 64 ~~~------------~~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~~~a~~~d~a~~~ 127 (305) T protein:vir:25 64 TDP------------KGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIF 127 (305) T ss_pred ccc------------cccccccccceeeEEeeeEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHhhhhee Confidence 000 00011112233344444444444567999999984 35789999999999999999999983 Q ss_pred hhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhc Q lcl|NC_012740. 317 VINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILAS 396 (528) Q Consensus 317 ~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~ 396 (528) =.....-.+..+.. +....--...... . .....-.++..+.++...+...-. ..+-+|+++.....|.. T Consensus 128 G~g~~~~~~~~~~~-----~~~~~~~~~~~~~-~---~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~ 196 (305) T protein:vir:25 128 GTDKPASWVSPALI-----PAAVTAGQAVEVV-G---GVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVAN 196 (305) T ss_pred ccCCCCCccccccc-----ccccccccccccc-c---cchhhhHHHHHHHHHHHhhhhccc--ccceeEecHHHHHHHHH Confidence 11000000000000 0000000000000 0 111223344444444444444332 34457889998888864 Q ss_pred cccccccccccccccccccccCceEEEEecCceEEEeeCCCCcc----eEEEEEecCCCccceeEeccccccceeEEecC Q lcl|NC_012740. 397 ADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQD----YFTVGYKGDNEMDAGIYYAPYVALTPLRATDP 472 (528) Q Consensus 397 ~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----y~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp 472 (528) . +...|. .-.. -++|.| ++|+|..+.+.+ -+++|-. ..+++. .....-....|- T Consensus 197 l-----kd~~G~-~i~~--------~~~l~G-~Pv~~~~~~~~~~~~~~~~~gd~------s~~~i~-~~~~~~i~~~~~ 254 (305) T protein:vir:25 197 I-----RDANGN-PVFR--------DDSFAG-FRTFFNRNGAWDADAAIEVIADS------SRVKIG-VRQDITVKFLDQ 254 (305) T ss_pred h-----hccCCc-eeec--------CCcccc-cceEEcCccCCCCCccEEEEEec------ceEEEE-EecCeEEEEeee Confidence 2 111110 0000 135655 888888775432 1222211 011111 111111111111 Q ss_pred ---------cc-cc-ceee--eeeeece-eecCcccccCCCccceecccchHHhhcchh Q lcl|NC_012740. 473 ---------QS-FH-PVLG--FKTRYGI-GINPFADSKSQAPSARITSGMLSKDSVGKN 517 (528) Q Consensus 473 ---------~s-~q-P~~~--~~tRY~l-~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~ 517 (528) .+ || ..++ ...|||+ +.||=+.-+ ..+.++..-.-.. T Consensus 255 ~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~--------~~~~~~~~~~pa~ 305 (305) T protein:vir:25 255 ATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQG--------ANKTPVAVVAPAA 305 (305) T ss_pred eeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEE--------EccccccccCCCC Confidence 11 21 1222 4668996 558843211 1122221111111 No 70 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=67.17 E-value=0.26 Score=23.83 Aligned_cols=352 Identities=15% Similarity=0.179 Sum_probs=133.2 Q ss_pred Ccch--------HHHHHhhhhhhcC--Cccchhccchh------hhhhhhh--hhhH----HHHhhhccc-------cch Q lcl|NC_012740. 1 MKTT--------KELMEKWSPLLEN--EKLPEIATASK------QKLVAKI--LESQ----EADFAVDPI-------YKD 51 (528) Q Consensus 1 ~~~~--------~~l~~kw~p~l~~--~~~~~~~~~~~------~~~~~~~--~enq----~~~~~~~~~-------~~~ 51 (528) |+-. ++|+++..-+-+. +.+.++....+ +.+-++| ||++ ++.+.+... -.. T Consensus 1 m~~~lk~l~~~~~el~~~~~~~k~~~~~~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (401) T protein:vir:44 1 MAVDIKDVEQVAQELQQKFDDFKAKNDKRVEAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNKVA 80 (401) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchh Confidence 3322 3444444433211 00111111100 1111111 2222 222111110 000 Q ss_pred hhhhhhhccccccccccccCCccccccccccc-cccc---cccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeee Q lcl|NC_012740. 52 EKVVEAFGGFIAEAEVAGDHGYNASNIASGQT-TGAI---TNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIR 127 (528) Q Consensus 52 ~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~t~-tg~v---~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMR 127 (528) ....++|..+|-......-...+.+.+..++. .|.+ ..+.+-++.+.| ...+-.+++-+.||++++..+.-.. T Consensus 81 ~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~ 157 (401) T protein:vir:44 81 AEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLK---DEVVMRQEATVITVGGSDYKKLVNL 157 (401) T ss_pred HHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHH---hhhhhhhhceeeecCCCceEEEEec Confidence 01123334343211111101112222232222 1211 234444555555 3556678899999988753221100 Q ss_pred eeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 128 SVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVT 207 (528) Q Consensus 128 srY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~ 207 (528) ++ +...+ T Consensus 158 ----------~~---------~~a~w------------------------------------------------------ 164 (401) T protein:vir:44 158 ----------GG---------TASGW------------------------------------------------------ 164 (401) T ss_pred ----------CC---------cccee------------------------------------------------------ Confidence 00 00000 Q ss_pred ccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHH Q lcl|NC_012740. 208 PTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLR 287 (528) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLk 287 (528) .+|.. .........|.+..+.+.|..+- ..+|-||.+|- T Consensus 165 --------------------------------v~E~~-~~~~~~~~~~~~v~~~~~k~~~~-------~~iS~ell~ds- 203 (401) T protein:vir:44 165 --------------------------------VGETD-TRSQTATSRLGLIEPFMGEIYGN-------PQATQKMLDDA- 203 (401) T ss_pred --------------------------------ecccc-ccCccccccceeeeeehhheeee-------hhhhHHHHhcc- Confidence 00000 00001112466666666665554 56899999984 Q ss_pred hhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccch-h-H------HHHHH Q lcl|NC_012740. 288 AVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGAR-W-A------GESFK 359 (528) Q Consensus 288 AiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r-~-a------~e~~r 359 (528) .+|.|++|.+-|+..|...+++.||. | .| .+.+.|++..........+. | . ..... T Consensus 204 ---~~~l~~~i~~~la~ai~~~~~~~~l~--------G-~G----~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~ 267 (401) T protein:vir:44 204 ---FFNVEAWINSELATEFAEQEEIAFTT--------G-DG----TKKPKGFLAYESTEESDKARAFGKLQHIVSGEATA 267 (401) T ss_pred ---hHHHHHHHHHHHHHHHHHHHHhhhhc--------c-CC----CCccceeeccccccccccccccccccccccccccc Confidence 46779999999999999999888883 1 01 11334444322211100000 0 0 00000 Q ss_pred HHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCc Q lcl|NC_012740. 360 SLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQ 439 (528) Q Consensus 360 ~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~ 439 (528) --|..|..+-+.+.. .+-.+...|+++.....|...- ...| ..-...+.+.. --++|.| ++|+++...|. T Consensus 268 ~~~d~i~~~~~~l~~--~~~~~a~~v~n~~~~~~L~~lk-----d~~G-~~l~~~~~~~g-~~~~l~G-~PVv~~~~~p~ 337 (401) T protein:vir:44 268 VTADAIIKLIYTLRK--AHRTGAKFMMNNNSLFAIRLLK-----DTEG-NYLWRPGLELG-QPSSLAG-YGIAENEQMPD 337 (401) T ss_pred cCHHHHHHHHHhcch--hhhcCCEEEEcHHHHHHHHHhh-----ccCC-ceeecCCcCCC-CCceecc-eeeEEecCcCC Confidence 001122222222222 2223456789999888887531 1111 01111221111 1246755 88888776542 Q ss_pred -----ceEEEEEecCCCccceeEeccccccceeEEecCccccceeeeee--eecee-ecCcccccCCCccceecccchHH Q lcl|NC_012740. 440 -----DYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKT--RYGIG-INPFADSKSQAPSARITSGMLSK 511 (528) Q Consensus 440 -----dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~t--RY~l~-~nP~~~~~~~~~~~~~~~~~~~~ 511 (528) +.+++| +-.. +|-=+.-..+....|+-.=+-.++|.. |++.. .+|-+...- T Consensus 338 ~~~~~~~i~~G---d~~~----~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d~~~~~~~a~~~l-------------- 396 (401) T protein:vir:44 338 IAADAKAIAFG---NFKR----GYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLL-------------- 396 (401) T ss_pred ccCCccEEEEe---ehhc----cEEEEEecceEEeeeccccCCcEEEEEEEEeccEEecccceEEE-------------- Confidence 112222 1100 011011112233345543344444443 66543 233222111 Q ss_pred hhcchhhhhhhhhhccC Q lcl|NC_012740. 512 DSVGKNAYFRRVWVKGC 528 (528) Q Consensus 512 ~~a~~~~~~r~~~Vk~~ 528 (528) .||-= T Consensus 397 ------------~~~aa 401 (401) T protein:vir:44 397 ------------KIAAA 401 (401) T ss_pred ------------EeecC Confidence 11111 No 71 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=66.00 E-value=0.27 Score=23.66 Aligned_cols=273 Identities=11% Similarity=0.013 Sum_probs=114.6 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccc Q lcl|NC_012740. 146 MYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLME 225 (528) Q Consensus 146 ~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (528) |....|- .... -.|-...++...... ....+.......... . .. T Consensus 1 ma~~~T~------------------~~d~--iiPev~~~~v~~~~~-------~~l~~~~~~~~d~~l-------~--g~ 44 (274) T protein:vir:97 1 MPQGLTK------------------TSDQ--IIPEVLAPMMQAQLE-------KKLRFASFAEVDSTL-------Q--GQ 44 (274) T ss_pred CCcccee------------------hhhe--echHHHHHHHHHhhh-------hhhhhcccceecccc-------c--CC Confidence 1000000 0000 001011111100000 000000000000000 0 00 Q ss_pred cccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHH Q lcl|NC_012740. 226 EGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANE 305 (528) Q Consensus 226 ~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStE 305 (528) .|...+...=-.+..+|.. ......+..++.+ .+.+++.+-|+ |+ |.+.=-..+.+ +-|.-.|..+-++.- T Consensus 45 ~G~tv~iP~~~~~g~a~~~---~~g~~i~~~~lt~--~~~~~~i~~~~-~~-~~i~D~~~~~~--~~dp~~~~~~~~a~a 115 (274) T protein:vir:97 45 PGDTLTFPAFVYSGDAQVV---AEGEKIPTDILET--KKREAKIRKIA-KG-TSITDEALLSG--YGDPQGEQVRQHGLA 115 (274) T ss_pred CCCEEEEeeecCCCccccc---cCCCccccccccc--ceeEEEeeeec-ce-ecccHHHHHhc--cchHHHHHHHHHHHH Confidence 1111111110011222211 1122233444443 34444445555 32 33222222333 468889999999999 Q ss_pred HHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEE Q lcl|NC_012740. 306 VLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVI 385 (528) Q Consensus 306 ImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v 385 (528) |..+++.+++..+..++.. +. +..+ ..+-+-.+..++.++. ....+++ T Consensus 116 ~a~~vd~~~~~~l~~a~~~-~~---------~~~~-------------~~d~i~dA~~~l~d~~---------~~~~~iv 163 (274) T protein:vir:97 116 HANKVDNDVLEALMGAKLT-VN---------ADIT-------------KLNGLQSAIDKFNDED---------LEPMVLF 163 (274) T ss_pred HHHHHHHHHHHHHhccCcc-cc---------cccc-------------CHHHHHHHHHHhhccC---------CCceEEE Confidence 9999999999876433221 10 0011 1333344444444331 2578999 Q ss_pred EchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccc Q lcl|NC_012740. 386 ASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALT 465 (528) Q Consensus 386 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~ 465 (528) |+|.|++.|............... .....+-.+|.+.| ++||+|+..|..-..+--+| .+-|.---+.. T Consensus 164 v~p~~~~~L~k~~~~~f~~~s~~g----~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~g------A~~~~~~~~~~ 232 (274) T protein:vir:97 164 VNPLDAGKLRGDASTNFTRATELG----DDIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKG------AVKLILKRDFF 232 (274) T ss_pred eCHHHHHHHHhhhhhhccccCccc----ccceeccccceecC-eeEEEcCCCCcceEEEEeCc------ceEeeecCCce Confidence 999999999865322222211111 11112234788865 89999999885432222122 22221111222 Q ss_pred eeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHhh Q lcl|NC_012740. 466 PLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKDS 513 (528) Q Consensus 466 ~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~ 513 (528) .-.--||..+.=.+-..-+||+. .|| . .-..+..+.-.-.| T Consensus 233 vE~~Rd~~~~~d~i~~~~~y~~~~~~~-----~--~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 233 LEVARDASTKTTALYSDKHYVAYLYDE-----S--KAVKITKGSGSLEM 274 (274) T ss_pred eccccchhhcccEEEEEEEEEEEEEcC-----C--ceEEEecCcccccC Confidence 23445888898888888899885 344 0 01122211111111 No 72 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=66.00 E-value=0.27 Score=23.66 Aligned_cols=273 Identities=11% Similarity=0.013 Sum_probs=114.6 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccc Q lcl|NC_012740. 146 MYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLME 225 (528) Q Consensus 146 ~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (528) |....|- .... -.|-...++...... ....+.......... . .. T Consensus 1 ma~~~T~------------------~~d~--iiPev~~~~v~~~~~-------~~l~~~~~~~~d~~l-------~--g~ 44 (274) T protein:vir:94 1 MPQGLTK------------------TSDQ--IIPEVLAPMMQAQLE-------KKLRFASFAEVDSTL-------Q--GQ 44 (274) T ss_pred CCcccee------------------hhhe--echHHHHHHHHHhhh-------hhhhhcccceecccc-------c--CC Confidence 1000000 0000 001011111100000 000000000000000 0 00 Q ss_pred cccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHH Q lcl|NC_012740. 226 EGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANE 305 (528) Q Consensus 226 ~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStE 305 (528) .|...+...=-.+..+|.. ......+..++.+ .+.+++.+-|+ |+ |.+.=-..+.+ +-|.-.|..+-++.- T Consensus 45 ~G~tv~iP~~~~~g~a~~~---~~g~~i~~~~lt~--~~~~~~i~~~~-~~-~~i~D~~~~~~--~~dp~~~~~~~~a~a 115 (274) T protein:vir:94 45 PGDTLTFPAFVYSGDAQVV---AEGEKIPTDILET--KKREAKIRKIA-KG-TSITDEALLSG--YGDPQGEQVRQHGLA 115 (274) T ss_pred CCCEEEEeeecCCCccccc---cCCCccccccccc--ceeEEEeeeec-ce-ecccHHHHHhc--cchHHHHHHHHHHHH Confidence 1111111110011222211 1122233444443 34444445555 32 33222222333 468889999999999 Q ss_pred HHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEE Q lcl|NC_012740. 306 VLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVI 385 (528) Q Consensus 306 ImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v 385 (528) |..+++.+++..+..++.. +. +..+ ..+-+-.+..++.++. ....+++ T Consensus 116 ~a~~vd~~~~~~l~~a~~~-~~---------~~~~-------------~~d~i~dA~~~l~d~~---------~~~~~iv 163 (274) T protein:vir:94 116 HANKVDNDVLEALMGAKLT-VN---------ADIT-------------KLNGLQSAIDKFNDED---------LEPMVLF 163 (274) T ss_pred HHHHHHHHHHHHHhccCcc-cc---------cccc-------------CHHHHHHHHHHhhccC---------CCceEEE Confidence 9999999999876433221 10 0011 1333344444444331 2578999 Q ss_pred EchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccc Q lcl|NC_012740. 386 ASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALT 465 (528) Q Consensus 386 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~ 465 (528) |+|.|++.|............... .....+-.+|.+.| ++||+|+..|..-..+--+| .+-|.---+.. T Consensus 164 v~p~~~~~L~k~~~~~f~~~s~~g----~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~g------A~~~~~~~~~~ 232 (274) T protein:vir:94 164 VNPLDAGKLRGDASTNFTRATELG----DDIIVKGAFGEALG-AIIVRTNKLEAGTAILAKKG------AVKLILKRDFF 232 (274) T ss_pred eCHHHHHHHHhhhhhhccccCccc----ccceeccccceecC-eeEEEcCCCCcceEEEEeCc------ceEeeecCCce Confidence 999999999865322222211111 11112234788865 89999999885432222122 22221111222 Q ss_pred eeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHhh Q lcl|NC_012740. 466 PLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKDS 513 (528) Q Consensus 466 ~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~ 513 (528) .-.--||..+.=.+-..-+||+. .|| . .-..+..+.-.-.| T Consensus 233 vE~~Rd~~~~~d~i~~~~~y~~~~~~~-----~--~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 233 LEVARDASTKTTALYSDKHYVAYLYDE-----S--KAVKITKGSGSLEM 274 (274) T ss_pred eccccchhhcccEEEEEEEEEEEEEcC-----C--ceEEEecCcccccC Confidence 23445888898888888899885 344 0 01122211111111 No 73 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=64.96 E-value=0.29 Score=23.52 Aligned_cols=289 Identities=12% Similarity=0.057 Sum_probs=121.9 Q ss_pred hccccccccccccCCccccccccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCC Q lcl|NC_012740. 58 FGGFIAEAEVAGDHGYNASNIASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLA 136 (528) Q Consensus 58 ~~~~l~ea~~~~~~g~~~~~~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s 136 (528) |...- ..+.....+...|.+ .-|.++ .+++++.++.+-.+++-+.||+++.- +|.-.. T Consensus 1 m~~~~----------~~a~~~~~t~~~g~~--i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-------~~p~~~-- 59 (330) T protein:vir:77 1 MAGST----------VPSTQVALTGDFSAF--LTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGI-------SIPHWT-- 59 (330) T ss_pred Ccccc----------cchhhccccCCCcce--echhHHHHHHHHHHhccchhhhcceeeccCCce-------EEEEEc-- Confidence 21111 111111111111111 223444 56677778888999999999987641 111000 Q ss_pred CcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccC Q lcl|NC_012740. 137 EHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESD 216 (528) Q Consensus 137 ~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~ 216 (528) ++ +.+.| T Consensus 60 -~~---------~~a~~--------------------------------------------------------------- 66 (330) T protein:vir:77 60 -GA---------VSASW--------------------------------------------------------------- 66 (330) T ss_pred -CC---------cceeE--------------------------------------------------------------- Confidence 00 00000 Q ss_pred ccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHH Q lcl|NC_012740. 217 DEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADA 296 (528) Q Consensus 217 ~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ 296 (528) + +| +..+++-..++++++...|..+-+..+|-||.+|- ..|.|+ T Consensus 67 ---------------v--------~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~ 110 (330) T protein:vir:77 67 ---------------T--------GE---------AERKPITKGSFGKQELEPVKITTIFAESAEVVRLN----PLNYLN 110 (330) T ss_pred ---------------e--------cC---------CCccccccceeeEEEEeEEEEEEeehhhHHHHhcc----hHHHHH Confidence 0 01 11233334456677777777777778999999983 578899 Q ss_pred HHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012740. 297 ELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQT 376 (528) Q Consensus 297 ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T 376 (528) +|.+-|+..|...||+-||.=--...- -.|+.... .+.....+...... ......++..+.++-..+.+. T Consensus 111 ~i~~~l~~ai~~~~~~~~l~G~g~~~~--~~g~~~~~------~~~~~~~~~~~~~~-~~~~~~~~~~l~~~~~~~~~~- 180 (330) T protein:vir:77 111 TMRTKIAEAIALKFDAAAIHGIDKPSA--FKGYLAET------TKVVSLADTNLTTA-SGPQGNAYLAVNNALSLLVNS- 180 (330) T ss_pred HHHHHHHHHHHHHHHHHhhcccCCCCc--cccccccc------cccceeeccccccc-ccccchhHHHHHHHHHhhhhc- Confidence 999999999999999999830000000 00000000 00000000000000 001122334444444444443 Q ss_pred ccCCCcEEEEchhHHHHhhccccccccccccc---cccccccccCceEEEEecCceEEEeeCCCCc-------------- Q lcl|NC_012740. 377 GRGAGNFVIASRNVVNILASADQGISLAMQGA---AQGLNTDTTKAVFAGVLAGKYKVFIDQYARQ-------------- 439 (528) Q Consensus 377 ~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~---~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-------------- 439 (528) ....+.+||+|+....|.... ...|. ......+......-++|.| ++||++.+.+. T Consensus 181 -~~~~~~~vmn~~~~~~l~~lk-----d~~G~~l~~~~~~~~~~~~~~~~~l~G-~PV~~~~~~p~~~~~~~~~~~~gd~ 253 (330) T protein:vir:77 181 -GKKWTGTLLDNVTEPILNTAV-----DGNGRPLFVESTYTEQVGAIREGRILG-RPTYVADNVVNGTVGNRVVGVMGDF 253 (330) T ss_pred -CCCccEEEEcHHHHHHHHHHh-----ccCCceeecCccccccccccCCceecc-eeeEEeccccCCCCCCccEEEEEec Confidence 235667899999999887521 11110 0111111111112346655 89999988642 Q ss_pred ceEEEEEecCCCc----cceeEe----------ccccccceeEEecCccccceeeeeeeecee-ecC--c------cccc Q lcl|NC_012740. 440 DYFTVGYKGDNEM----DAGIYY----------APYVALTPLRATDPQSFHPVLGFKTRYGIG-INP--F------ADSK 496 (528) Q Consensus 440 dy~~vG~KG~~~~----d~g~fy----------aPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP--~------~~~~ 496 (528) .++++|-.+..+. ++.+.+ .++.. | ..-+=.+=...|++.. .+| | +-+. T Consensus 254 s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~--f------~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~~ 325 (330) T protein:vir:77 254 SQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISL--W------QHNMVAVRCEAEFAFMVNDKDAFVKLTDQVAGT 325 (330) T ss_pred ceEEEEEecCcEEEEeecceeeecccccccccccccch--h------hcCcEEEEEEEEeccEEecccceEEEEeccCCc Confidence 1223333322211 111111 11100 0 0001111122244333 122 1 1111 Q ss_pred CCCcc Q lcl|NC_012740. 497 SQAPS 501 (528) Q Consensus 497 ~~~~~ 501 (528) +..++ T Consensus 326 ~~~~~ 330 (330) T protein:vir:77 326 DPEEE 330 (330) T ss_pred CCCCC Confidence 11111 No 74 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=64.78 E-value=0.29 Score=23.50 Aligned_cols=347 Identities=12% Similarity=0.116 Sum_probs=120.0 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccc---------------------cchhhhhhhhc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPI---------------------YKDEKVVEAFG 59 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~---------------------~~~~~~~~~~~ 59 (528) ..+.|+ .+++.-+.. +|.+.- ..| .+ +|.+++..+.... -..+.....|+ T Consensus 29 ~lt~ee-~~~~~~l~~-----ei~~l~-~~I-~~-~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 99 (435) T protein:vir:14 29 ALSVEQ-QAEFDQLSS-----KFSELT-AQI-ER-AEAAERMAAAAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMA 99 (435) T ss_pred CCCHHH-HHHHHHHHH-----HHHHHH-HHH-HH-HHHHHHHHHhhcccccchhhhhhhccccccccccchhhhhHHHHH Confidence 233332 344544432 122211 111 00 1222211110000 00000000111 Q ss_pred ccc---cccc--------cccc--CCccccc-ccccc-ccccccccCcchh--hHHHHHHhhhhhhhc-eeeecCCcccc Q lcl|NC_012740. 60 GFI---AEAE--------VAGD--HGYNASN-IASGQ-TTGAITNVGPAVI--GMVRRAIPNLIAFDI-CGVQPMSTPTS 121 (528) Q Consensus 60 ~~l---~ea~--------~~~~--~g~~~~~-~~e~t-~tg~v~~~~P~li--~l~Rra~~~lI~~DI-~GVQPmTgPTG 121 (528) .++ ..+. .... .+....+ +..++ ..|.+. =|.-+ .+++++.++.+..++ +-+.||+... T Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~--vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~- 176 (435) T protein:vir:14 100 RMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVL--VPENLSSEVIELLRPKSVVRKLGARTLPLSNGN- 176 (435) T ss_pred HHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccc--cchhHHHHHHHHHhhhchhhhhcceeeecCCCc- Confidence 110 0000 0000 0000000 00000 111110 02111 234434444554444 2233332210 Q ss_pred eeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 122 QIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNV 201 (528) Q Consensus 122 LIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~ 201 (528) + +|+.. .+.. T Consensus 177 ~------~~p~~--------------------~~~~-------------------------------------------- 186 (435) T protein:vir:14 177 I------TIPRL--------------------KGGA-------------------------------------------- 186 (435) T ss_pred e------EEEEE--------------------eCCc-------------------------------------------- Confidence 0 01000 0000 Q ss_pred ccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHH Q lcl|NC_012740. 202 TAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIE 281 (528) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~E 281 (528) ..+-+ +| +...++-.-++++++..++.-+-....|-| T Consensus 187 --------------------------~a~~v--------~E---------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~e 223 (435) T protein:vir:14 187 --------------------------IVGYI--------GA---------DTDIPTTQQQFDDLKLTAKKMAALVPIAND 223 (435) T ss_pred --------------------------ceeee--------cc---------CccccccccceeEEEeeeEEEEEeehhhHH Confidence 00000 01 011223333445555555555555679999 Q ss_pred HHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHH Q lcl|NC_012740. 282 VAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSL 361 (528) Q Consensus 282 LAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L 361 (528) |.+| +....+.|+.|.+-|+..|...+|+-|+. -+ |. ...+.|++....+..+...- ....+..+ T Consensus 224 ll~d--s~~~~~l~~~i~~~l~~ai~~~~d~a~l~---G~------G~---~~~p~Gi~~~~~~~~~~~~~-~~~~~~~~ 288 (435) T protein:vir:14 224 LIKY--AGVNPNVDQIVVGDLTAAIGAREDKAFIR---DD------GT---ANTPKGLRFWALPSNVITAS-DASTLQKI 288 (435) T ss_pred HHHh--hccCHHHHHHHHHHHHHHHHHHHHHHhhc---cC------CC---Cccccceeecccccceeccc-cccchhhH Confidence 9999 32233477888888888888888888772 11 00 01244443322111000000 00111222 Q ss_pred HHHHHHHHHHHHHh-hccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcc Q lcl|NC_012740. 362 IYQIDKEAAEIARQ-TGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQD 440 (528) Q Consensus 362 ~~~i~~~a~~I~~~-T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d 440 (528) +..+.++-..+... ..+ ....+|++|.....|...- ...|. . +..+.+ -|+|.| ++|+++++.|.+ T Consensus 289 ~~~~~~l~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lk-----d~~G~-~-l~~~~~----~g~l~G-~Pv~~~~~~p~~ 355 (435) T protein:vir:14 289 ETDLGKVILALENADANL-TQPGWIMAPRTFRFLEGLR-----DGNGN-K-VYPELA----NGMLKG-YPVGKTTQVPIN 355 (435) T ss_pred HHHHHHHHHHhhhccccc-cCCEEEEcHHHHHHHHHhh-----ccCCc-e-eccCCC----CCeeec-ceeEeecccccc Confidence 22233332333221 233 2445789999999887642 11110 0 111222 256766 899998775442 Q ss_pred --------eEEEE--------EecCCCccceeEeccccccceeEEecCccc---cceeeeeeeeceee-cCcccccCCCc Q lcl|NC_012740. 441 --------YFTVG--------YKGDNEMDAGIYYAPYVALTPLRATDPQSF---HPVLGFKTRYGIGI-NPFADSKSQAP 500 (528) Q Consensus 441 --------y~~vG--------~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~---qP~~~~~tRY~l~~-nP~~~~~~~~~ 500 (528) -+++| ..+.-+ +-..||.-........-..| |=.+=...|++..+ +| + T Consensus 356 ~~~~~~~~~i~~gd~s~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~--------~ 423 (435) T protein:vir:14 356 LGETGKESEIYFTDFGDVFIGEEETLE----IDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHV--------E 423 (435) T ss_pred ccCCCccceEEEeecccEEEEEecccE----EEEeccccccccccchhhhhhcChhheeeeeeeCceeecc--------c Confidence 13333 222211 12222211000000000001 12233455666541 12 1 Q ss_pred cceecccchHHh Q lcl|NC_012740. 501 SARITSGMLSKD 512 (528) Q Consensus 501 ~~~~~~~~~~~~ 512 (528) ...+..|-+|.. T Consensus 424 a~~~l~~~~~~~ 435 (435) T protein:vir:14 424 SIAVLAGVAWGA 435 (435) T ss_pred ceEEEecCCCCC Confidence 234445555544 No 75 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=64.64 E-value=0.3 Score=23.48 Aligned_cols=349 Identities=13% Similarity=0.052 Sum_probs=118.8 Q ss_pred Cc-----c-----hHHHHHhhhhhhcCCccchhccchh------hhhh---hhhhhhHHHHhhhccccchhhhhhhhccc Q lcl|NC_012740. 1 MK-----T-----TKELMEKWSPLLENEKLPEIATASK------QKLV---AKILESQEADFAVDPIYKDEKVVEAFGGF 61 (528) Q Consensus 1 ~~-----~-----~~~l~~kw~p~l~~~~~~~~~~~~~------~~~~---~~~~enq~~~~~~~~~~~~~~~~~~~~~~ 61 (528) |. . .++|++...-+-+.+-..++..... +.+- .+.+|.+++....... +....+.. T Consensus 4 ~~l~~l~e~r~~~~~e~~~l~~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~-----~~~~~~~~ 78 (392) T protein:vir:13 4 TTLSANFEARERATAELRSLTDEFAGKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSL-----LSGLQGSG 78 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hcccCCcc Confidence 11 0 1122222211111111122211111 1111 1122222111111100 00000000 Q ss_pred ccccc---c--------cccCCc-----cccccccccccccccccCcchh-hHHHHHHh-hhhhhhceeeecCCccccee Q lcl|NC_012740. 62 IAEAE---V--------AGDHGY-----NASNIASGQTTGAITNVGPAVI-GMVRRAIP-NLIAFDICGVQPMSTPTSQI 123 (528) Q Consensus 62 l~ea~---~--------~~~~g~-----~~~~~~e~t~tg~v~~~~P~li-~l~Rra~~-~lI~~DI~GVQPmTgPTGLI 123 (528) . +.+ . .+..+. .......++++++-...-|.+. .++.+... ..+...++-|-|+++...+- T Consensus 79 ~-~~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~ 157 (392) T protein:vir:13 79 S-GAQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMD 157 (392) T ss_pred c-chhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeE Confidence 0 000 0 000000 0000111122221111112111 12221221 22334444444433221110 Q ss_pred eeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 124 FAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTA 203 (528) Q Consensus 124 FAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~ 203 (528) +- + ... .+ T Consensus 158 ~~-~--~~~---------------~~------------------------------------------------------ 165 (392) T protein:vir:13 158 FT-V--ITG---------------RA------------------------------------------------------ 165 (392) T ss_pred EE-E--EcC---------------Cc------------------------------------------------------ Confidence 00 0 000 00 Q ss_pred ccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHH Q lcl|NC_012740. 204 EQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVA 283 (528) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELA 283 (528) ..+-+ +| +..+++-...+++++...|.-+-...+|-||. T Consensus 166 ------------------------~a~~v--------~E---------~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell 204 (392) T protein:vir:13 166 ------------------------TAGIV--------GE---------TAEIPESYPATTQRSMGGFKYGFASVVSYEFA 204 (392) T ss_pred ------------------------ceeee--------cc---------cccccccccceeeEEeeeeeEEeeehhHHHHH Confidence 00000 01 01123333333444444444444567999999 Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHH Q lcl|NC_012740. 284 QDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIY 363 (528) Q Consensus 284 QDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~ 363 (528) +|= ..|.++.|.+-|...|..-+|..||. | .| .+.+.|++......+... -|+ ....-.+. T Consensus 205 ~ds----~~~l~~~i~~~l~~~i~~~~d~~~l~--------G-~G----t~~p~Gil~~~~~~~~~~-~~~-~~~~~~~d 265 (392) T protein:vir:13 205 TDQ----VLDLVGFLVSDAGPAIGDAMGRHFLT--------G-TG----TGQPRGILTDATGANAAF-GEA-DADSKVSD 265 (392) T ss_pred hcc----hHHHHHHHHHHHHHHHHHHHHHHHhc--------c-cC----Cccccccccccccccccc-ccc-ccccccHH Confidence 983 46788999999999999999998883 1 01 123445543322111110 010 00011122 Q ss_pred HHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEE Q lcl|NC_012740. 364 QIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFT 443 (528) Q Consensus 364 ~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~ 443 (528) .|.++-+.+... +.++...|+++.....|..- +...| ..-...+.+... -++|.| ++||++.+.|.+-|+ T Consensus 266 ~l~~~~~~l~~~--~~~~a~~v~n~~~~~~l~~l-----kd~~G-~~l~~~~~~~g~-~~~l~G-~Pv~~~~~~~~~~i~ 335 (392) T protein:vir:13 266 ALIDLFHEVPSA--YRKNAKFVVNDLRAAQMRKL-----KDANG-QYLWQSALTVGA-PDTFNG-KVVETDDGMPADKVL 335 (392) T ss_pred HHHHHHHhhhhh--hhcCCEEEEcHHHHHHHHHh-----hccCC-ceeecCCcCCCC-Cceecc-eeeEEcCCCCCCcEE Confidence 223333333332 22344568899988887752 11111 000111111110 136765 999999999877666 Q ss_pred EEEecCCCccceeEeccccccceeEEecCccccceeee--eeeecee-ecCcccccCCCccceecccchHH Q lcl|NC_012740. 444 VGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGF--KTRYGIG-INPFADSKSQAPSARITSGMLSK 511 (528) Q Consensus 444 vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~--~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~ 511 (528) +|-- . -.++.---.....+..|+..-...++| ..|++.. .||=+. ++......+ T Consensus 336 ~Gdf---~---~~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~A~--------~~~~~~~aa 392 (392) T protein:vir:13 336 FADL---S---KYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGA--------KVLTVTPAA 392 (392) T ss_pred Eeec---c---ceeEEeecceEEEeeccccccCCcEEEEEEEEeccEEecccce--------EEEEeeccC Confidence 6521 0 011111111222233344322233333 3455433 334211 111111111 No 76 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=61.24 E-value=0.36 Score=23.04 Aligned_cols=334 Identities=12% Similarity=0.047 Sum_probs=125.7 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhh--hhhHHHHhhhccccchhhhhhhhcccccccc-----------c Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKI--LESQEADFAVDPIYKDEKVVEAFGGFIAEAE-----------V 67 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~--~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~-----------~ 67 (528) -..+++..++-.-+.. |+ +.+-++| +|.+...+.... -....-..+.+....+.+ - T Consensus 30 ~~~~~e~~~~~~~l~~-----e~-----~~l~~~i~~~e~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 98 (390) T protein:vir:81 30 GELNASARSKVDELFA-----TV-----GNLSAEVQAARQRVAELEGNG-AGGDVQHVSVGDMFVASEQFQASAGRWNDR 98 (390) T ss_pred cCcCHHHHHHHHHHHH-----HH-----HHHHHHHHHHHHHHHHHHhcc-cccccccccchhhhhhhHHHHHHHHHHhhh Confidence 0111112222111111 01 1111111 111111110000 000000000000000000 0 Q ss_pred cccCCcccccc----ccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccc Q lcl|NC_012740. 68 AGDHGYNASNI----ASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEA 142 (528) Q Consensus 68 ~~~~g~~~~~~----~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA 142 (528) .+....+.... ..++++.+-.-..|..+ .++++.-+..+-.++|.+.||++++.-+.-.. ... T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~----~~~-------- 166 (390) T protein:vir:81 99 SARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQET----GFV-------- 166 (390) T ss_pred hhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEe----cCC-------- Confidence 00000000000 00111111112233333 45555566777889999999988762221100 000 Q ss_pred ccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccc Q lcl|NC_012740. 143 FHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMK 222 (528) Q Consensus 143 ~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (528) +...+ T Consensus 167 ------~~a~~--------------------------------------------------------------------- 171 (390) T protein:vir:81 167 ------NNAAI--------------------------------------------------------------------- 171 (390) T ss_pred ------cceee--------------------------------------------------------------------- Confidence 00000 Q ss_pred ccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHH Q lcl|NC_012740. 223 LMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAIL 302 (528) Q Consensus 223 ~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanIL 302 (528) ++.|- + . ......|.++.+.+.|..+- ..+|-||.+|-- +.++.|.+-| T Consensus 172 ---------v~Eg~-----~-~----~~~~~~~~~i~~~~~k~~~~-------~~is~ell~d~~-----~~~~~i~~~l 220 (390) T protein:vir:81 172 ---------VAEGA-----L-K----PESSLKFAKKTDTTHVIAHT-------MKATRQILSDAP-----QLASYMNNRL 220 (390) T ss_pred ---------ecCCc-----c-c----ccccceeeEEEEeeeEEEEe-------ehhhHHHHHhHH-----HHHHHHHHHH Confidence 00000 0 0 00122366666666665554 558999999842 4689999999 Q ss_pred HHHHHHHhhHHHHhhhhhheeecccceeeccccccceecccccccc---ccchhHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_012740. 303 ANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDT---RGARWAGESFKSLIYQIDKEAAEIARQTGRG 379 (528) Q Consensus 303 StEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~---~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g 379 (528) +..|...+|+-||.- .-. -..+.|++........ .......+....++.++ . ..+. T Consensus 221 ~~~~~~~~d~a~l~G---~g~---------~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~--~~~~ 279 (390) T protein:vir:81 221 IRGLKVKEDAEILRG---TGA---------NDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQA-------S--LAEY 279 (390) T ss_pred HHHHHHHHHHHHHhc---CCC---------CCcccceeecccccccccccccchhHHHHHHHHHhh-------c--cccC Confidence 999999999988831 000 0013344332111000 00011222233333222 2 2233 Q ss_pred CCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEec Q lcl|NC_012740. 380 AGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYA 459 (528) Q Consensus 380 ~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fya 459 (528) ..+.+|++|.....|...- ...|. -+..+.... -.++|.| ++|++.+..|.+-+++|---. .++. T Consensus 280 ~~~~~v~~~~~~~~l~~lk-----d~~G~--~l~~~~~~~-~~~~l~G-~pv~~~~~~p~~~~~~gd~~~-----~~~~- 344 (390) T protein:vir:81 280 NPSGIVINPIDWAAIELAK-----DANNQ--YLIGNARGT-LTPTLWG-LPVVATQAMAPGEFLVGAFDL-----AAQI- 344 (390) T ss_pred CCCEEEEcHHHHHHHHHhh-----cCCCc--eeecCcccc-cCceecc-eeeEEcCCCCCCcEEEEehhc-----eEEE- Confidence 5677899999988887531 11110 011111111 1236655 899999998877666653210 0111 Q ss_pred ccccccee-EEec-Cc---cccceeeeeeeece-eecCcccccCCCccceeccc Q lcl|NC_012740. 460 PYVALTPL-RATD-PQ---SFHPVLGFKTRYGI-GINPFADSKSQAPSARITSG 507 (528) Q Consensus 460 PYv~~~~~-~~~D-p~---s~qP~~~~~tRY~l-~~nP~~~~~~~~~~~~~~~~ 507 (528) +.-..+. ...+ +. +-+=.+=...|++. +.+|=+. .+++=+ T Consensus 345 -~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~-------v~~t~a 390 (390) T protein:vir:81 345 -FDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEAL-------ISGSFA 390 (390) T ss_pred -EEecceEEEEecccchhhcCcEEEEEEEeeccEEecccce-------EEEEeC Confidence 1111111 1111 11 12223334556666 3344111 111101 No 77 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=60.69 E-value=0.37 Score=22.97 Aligned_cols=341 Identities=12% Similarity=0.051 Sum_probs=125.6 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhh--hhhhhHHHHhhhcc--------ccchhhhhhhhcccccccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVA--KILESQEADFAVDP--------IYKDEKVVEAFGGFIAEAEVAGD 70 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~--~~~enq~~~~~~~~--------~~~~~~~~~~~~~~l~ea~~~~~ 70 (528) --.+++..++-.-+.. +|... +..+-+ ..++..++...... ......-...+.....+.. +. T Consensus 30 ~~~~~e~~~~~~~~~~-----e~~~l-~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 101 (390) T protein:vir:97 30 GELNASARSKVDELFA-----TVGNL-SAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASTGRWNDRS--AR 101 (390) T ss_pred cCCCHHHHHHHHHHHH-----HHHHH-HHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHhhhhh--hh Confidence 0011122222211111 01000 000000 00111100000000 0000000000000000000 00 Q ss_pred CCcccc-----ccccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccc Q lcl|NC_012740. 71 HGYNAS-----NIASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFH 144 (528) Q Consensus 71 ~g~~~~-----~~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~ 144 (528) ...+.. ....+++++.. -.-|.++ .+++++-++.+-.+++.+-||++++.-+--.. .. T Consensus 102 ~~~~~~~~~~~~~~~~~~~~g~-lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~----~~----------- 165 (390) T protein:vir:97 102 ATMNIKAALNTASTDAAGSAGA-LTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQET----GF----------- 165 (390) T ss_pred hhhHHHHHHHhhhccccccccc-ccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEe----cC----------- Confidence 000000 00111111111 1112222 45555556777788899999987753211100 00 Q ss_pred ccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccc Q lcl|NC_012740. 145 PMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLM 224 (528) Q Consensus 145 n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (528) .+.+.| T Consensus 166 ---~~~a~~----------------------------------------------------------------------- 171 (390) T protein:vir:97 166 ---VNNAAI----------------------------------------------------------------------- 171 (390) T ss_pred ---Ccceee----------------------------------------------------------------------- Confidence 000000 Q ss_pred ccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHH Q lcl|NC_012740. 225 EEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILAN 304 (528) Q Consensus 225 ~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILSt 304 (528) ++.|- + . ......|.+..|.+.|..+ ...+|-||.+|-- +.++.|.+-|+. T Consensus 172 -------v~Eg~-----~-~----~~~~~~~~~i~~~~~k~~~-------~~~is~ell~ds~-----~l~~~i~~~la~ 222 (390) T protein:vir:97 172 -------VAEGA-----L-K----PESSLKFAKKTDTTHVIAH-------TMKATRQILSDAP-----QLASYMNNRLIR 222 (390) T ss_pred -------ecCCc-----c-c----cccccceeEEEEeeeeEEE-------eehhhHHHHHhHH-----HHHHHHHHHHHH Confidence 00000 0 0 0011235555555555444 4679999999852 568999999999 Q ss_pred HHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEE Q lcl|NC_012740. 305 EVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFV 384 (528) Q Consensus 305 EImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~ 384 (528) .|...||+.||.- + |. ...+.|++..........+ ... ...+..|..+-..+ ...+...+.+ T Consensus 223 a~~~~~d~a~l~G---~---g~------~~~p~Gi~~~~~~~~~~~~-~~~---~~~~d~~~~~~~~~--~~~~~~~~~~ 284 (390) T protein:vir:97 223 GLKVKEDAEILRG---T---GA------NDGLLGLIPQATTYAAPTT-IAG---ATRVDQLRLAMLQA--SLAEYPASGI 284 (390) T ss_pred HHHHHHHHHHhhc---C---CC------Cccccceeecccccccccc-ccc---cchHHHHHHHHHhh--ccccCCCCEE Confidence 9999999888731 0 00 0123444432111100000 000 11111122222222 2333457788 Q ss_pred EEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEecccccc Q lcl|NC_012740. 385 IASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVAL 464 (528) Q Consensus 385 v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~ 464 (528) |++|+....|..-- ...|. . +..+.... --++|.| ++|++++..|.+-+++|--- ..+++...-.+ T Consensus 285 v~n~~~~~~L~~lk-----d~~G~-~-l~~~~~~~-~~~~l~G-~pV~~~~~~~~~~~~~gd~~-----~~~~~~~~~~~ 350 (390) T protein:vir:97 285 VINPIDWAAIELAK-----DANNQ-Y-LIGNARGT-LTPTLWG-LPVVATQAMAPGEFLVGAFD-----LAAQIFDQWDA 350 (390) T ss_pred EEcHHHHHHHHHhh-----cCCCc-e-eecCccCC-CCceecc-eeeEEcCCCCCCcEEEEecc-----ceEEEEEecce Confidence 99999998887421 11111 0 11111111 1246754 89999999887767666311 11112111112 Q ss_pred ceeEEecCc---cccceeeeeeeeceee-cCcccccCCCccceecccchHHhhc Q lcl|NC_012740. 465 TPLRATDPQ---SFHPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDSV 514 (528) Q Consensus 465 ~~~~~~Dp~---s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~~~a 514 (528) +.....+.. +-+=.+-...||++.+ +|=+ .+++. +| T Consensus 351 ~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a-------~v~~~-------~a 390 (390) T protein:vir:97 351 RVEIGYVNDDFQRNMVTVLAEERLALVVYRPEA-------LITGS-------FA 390 (390) T ss_pred EEEEeecccccccCcEEEEEEEeeccEEecccc-------EEEEE-------eC Confidence 222222222 2232344556887763 2311 11111 11 No 78 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=59.27 E-value=0.4 Score=22.79 Aligned_cols=333 Identities=15% Similarity=0.101 Sum_probs=121.6 Q ss_pred CcchHHHHHhhhhhhcCC-----cc-----------chhccchh--hhhhhhh--hhhHHHHhhhcccc----------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENE-----KL-----------PEIATASK--QKLVAKI--LESQEADFAVDPIY----------- 49 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~-----~~-----------~~~~~~~~--~~~~~~~--~enq~~~~~~~~~~----------- 49 (528) |...++|+++|.-+.+.- .+ .+|....+ ..+.+++ |+.|.+.+..+..- T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 999999998888765520 00 11211111 0011111 12221111110000 Q ss_pred --chhhhhhhhccccccccccccCCc-------c-ccccccccccccccccCcchhh------HHHHHHhhhhhhhceee Q lcl|NC_012740. 50 --KDEKVVEAFGGFIAEAEVAGDHGY-------N-ASNIASGQTTGAITNVGPAVIG------MVRRAIPNLIAFDICGV 113 (528) Q Consensus 50 --~~~~~~~~~~~~l~ea~~~~~~g~-------~-~~~~~e~t~tg~v~~~~P~li~------l~Rra~~~lI~~DI~GV 113 (528) .++....++..++.... .+.... . ...+.+++.+. + ..||+ ++++.-..-.-.+++.| T Consensus 81 ~~~~~~~~~~~~~~~r~~~-~~~~~~~~~~~~~~~~~a~~~~~~~~----g-G~lIP~~~~~~Ii~~~~~~~~l~~~~~~ 154 (387) T protein:vir:94 81 LSDNEKMVKAKAEFYRHAI-LPNEFEKPSMEAQRLLHALPTGNDSG----G-DKLLPKTLSKEIVSEPFAKNQLREKARL 154 (387) T ss_pred CchhHHHHHHHHHHHHHHH-hhhhHHHHHHHHHHHHhhhccCCCCC----C-ceeechhHHHHHHHHHHhhchhhhhcee Confidence 00010111110000000 000000 0 00111111111 1 12222 33333334445677888 Q ss_pred ecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 114 QPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAET 193 (528) Q Consensus 114 QPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~t 193 (528) .|+++.+.- |-.+.. +.+.| T Consensus 155 ~~~~~~~~p----~~~~~~----------------~~a~~---------------------------------------- 174 (387) T protein:vir:94 155 TNIKGLEIP----RVSYTL----------------DDDDF---------------------------------------- 174 (387) T ss_pred eecCCceee----eeeccC----------------Ccccc---------------------------------------- Confidence 777653210 000000 00000 Q ss_pred ccccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEeccc Q lcl|NC_012740. 194 GIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQ 273 (528) Q Consensus 194 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRA 273 (528) ++ .++.. ......|.+..|.+.|. + T Consensus 175 --------------------------------------v~------Eg~~~----~~~~~~f~~v~l~~~k~-------~ 199 (387) T protein:vir:94 175 --------------------------------------IT------DVETA----KELKAKGDTVKFTTNKF-------K 199 (387) T ss_pred --------------------------------------cc------ccccc----cccccccceeeechhee-------e Confidence 00 00000 01112355555555444 4 Q ss_pred ccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchh Q lcl|NC_012740. 274 LKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARW 353 (528) Q Consensus 274 LKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~ 353 (528) -...+|-||.+|- ..|.|++|.+-|+..|..-.|..++-.-+-+. .+.|++.-.....+.+ T Consensus 200 ~~i~iS~ell~ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g------------~~~g~~~~~~~~~~~~--- 260 (387) T protein:vir:94 200 VFAAISDTVIHGS----DVDLVNWVENALQSGLAAKERKDALAVSPKSG------------LEHMSFYNGSVKEVEG--- 260 (387) T ss_pred eechhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCcc------------ccceeeeccccccccc--- Confidence 4577999999984 45668899999998887766666652211111 1222221111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEe Q lcl|NC_012740. 354 AGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFI 433 (528) Q Consensus 354 a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~ 433 (528) -.++-.|..+-+.+...=+ ..+.|++-+...+.+|..- + ... ..+. ...+ ++|.| ++||+ T Consensus 261 -----~~~~d~i~~~~~~l~~~y~-~na~~imn~~t~~~~~~~~---~--~~~---~~~~-~~~~----~~llG-~PV~~ 320 (387) T protein:vir:94 261 -----ADMYDAIINALADLHEDYR-DNATIYMRYADYVKIISVL---S--NGT---TNFF-DTPA----EKVFG-KPVVF 320 (387) T ss_pred -----cchHHHHHHHHhccChhhh-cCCEEEEechHHHHHHHHH---h--cCC---Cccc-ccCC----ccccc-cceEE Confidence 1122334444444443323 3566665444444444331 1 000 0111 1111 25665 79998 Q ss_pred eCCCCcceEEEEEecCCCccceeEeccccccceeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHh Q lcl|NC_012740. 434 DQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKD 512 (528) Q Consensus 434 D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~ 512 (528) ..+++. +++| +- +-||.=|....+.+..|..+.+-.+-...||+.. ++| T Consensus 321 ~~~~~~--~~~G---Df----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~--------------------- 370 (387) T protein:vir:94 321 TDAAVK--PIVG---DF----NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLD--------------------- 370 (387) T ss_pred ecCCCc--eeee---ch----hhhhhhhhhhhheecccccCCceEEEEEEEeCcEeech--------------------- Confidence 877653 3444 11 1122222222222333333333333334466544 223 Q ss_pred hcchhhhhhhhhhccC Q lcl|NC_012740. 513 SVGKNAYFRRVWVKGC 528 (528) Q Consensus 513 ~a~~~~~~r~~~Vk~~ 528 (528) .-|+.+.||-= T Consensus 371 -----~A~~~l~~ka~ 381 (387) T protein:vir:94 371 -----SAFRIAKAKEN 381 (387) T ss_pred -----hheEEEEeecC Confidence 11112222111 No 79 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=59.27 E-value=0.4 Score=22.79 Aligned_cols=333 Identities=15% Similarity=0.101 Sum_probs=121.6 Q ss_pred CcchHHHHHhhhhhhcCC-----cc-----------chhccchh--hhhhhhh--hhhHHHHhhhcccc----------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENE-----KL-----------PEIATASK--QKLVAKI--LESQEADFAVDPIY----------- 49 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~-----~~-----------~~~~~~~~--~~~~~~~--~enq~~~~~~~~~~----------- 49 (528) |...++|+++|.-+.+.- .+ .+|....+ ..+.+++ |+.|.+.+..+..- T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 999999998888765520 00 11211111 0011111 12221111110000 Q ss_pred --chhhhhhhhccccccccccccCCc-------c-ccccccccccccccccCcchhh------HHHHHHhhhhhhhceee Q lcl|NC_012740. 50 --KDEKVVEAFGGFIAEAEVAGDHGY-------N-ASNIASGQTTGAITNVGPAVIG------MVRRAIPNLIAFDICGV 113 (528) Q Consensus 50 --~~~~~~~~~~~~l~ea~~~~~~g~-------~-~~~~~e~t~tg~v~~~~P~li~------l~Rra~~~lI~~DI~GV 113 (528) .++....++..++.... .+.... . ...+.+++.+. + ..||+ ++++.-..-.-.+++.| T Consensus 81 ~~~~~~~~~~~~~~~r~~~-~~~~~~~~~~~~~~~~~a~~~~~~~~----g-G~lIP~~~~~~Ii~~~~~~~~l~~~~~~ 154 (387) T protein:vir:26 81 LSDNEKMVKAKAEFYRHAI-LPNEFEKPSMEAQRLLHALPTGNDSG----G-DKLLPKTLSKEIVSEPFAKNQLREKARL 154 (387) T ss_pred CchhHHHHHHHHHHHHHHH-hhhhHHHHHHHHHHHHhhhccCCCCC----C-ceeechhHHHHHHHHHHhhchhhhhcee Confidence 00010111110000000 000000 0 00111111111 1 12222 33333334445677888 Q ss_pred ecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 114 QPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAET 193 (528) Q Consensus 114 QPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~t 193 (528) .|+++.+.- |-.+.. +.+.| T Consensus 155 ~~~~~~~~p----~~~~~~----------------~~a~~---------------------------------------- 174 (387) T protein:vir:26 155 TNIKGLEIP----RVSYTL----------------DDDDF---------------------------------------- 174 (387) T ss_pred eecCCceee----eeeccC----------------Ccccc---------------------------------------- Confidence 777653210 000000 00000 Q ss_pred ccccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEeccc Q lcl|NC_012740. 194 GIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQ 273 (528) Q Consensus 194 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRA 273 (528) ++ .++.. ......|.+..|.+.|. + T Consensus 175 --------------------------------------v~------Eg~~~----~~~~~~f~~v~l~~~k~-------~ 199 (387) T protein:vir:26 175 --------------------------------------IT------DVETA----KELKAKGDTVKFTTNKF-------K 199 (387) T ss_pred --------------------------------------cc------ccccc----cccccccceeeechhee-------e Confidence 00 00000 01112355555555444 4 Q ss_pred ccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchh Q lcl|NC_012740. 274 LKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARW 353 (528) Q Consensus 274 LKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~ 353 (528) -...+|-||.+|- ..|.|++|.+-|+..|..-.|..++-.-+-+. .+.|++.-.....+.+ T Consensus 200 ~~i~iS~ell~ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g------------~~~g~~~~~~~~~~~~--- 260 (387) T protein:vir:26 200 VFAAISDTVIHGS----DVDLVNWVENALQSGLAAKERKDALAVSPKSG------------LEHMSFYNGSVKEVEG--- 260 (387) T ss_pred eechhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCcc------------ccceeeeccccccccc--- Confidence 4577999999984 45668899999998887766666652211111 1222221111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEe Q lcl|NC_012740. 354 AGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFI 433 (528) Q Consensus 354 a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~ 433 (528) -.++-.|..+-+.+...=+ ..+.|++-+...+.+|..- + ... ..+. ...+ ++|.| ++||+ T Consensus 261 -----~~~~d~i~~~~~~l~~~y~-~na~~imn~~t~~~~~~~~---~--~~~---~~~~-~~~~----~~llG-~PV~~ 320 (387) T protein:vir:26 261 -----ADMYDAIINALADLHEDYR-DNATIYMRYADYVKIISVL---S--NGT---TNFF-DTPA----EKVFG-KPVVF 320 (387) T ss_pred -----cchHHHHHHHHhccChhhh-cCCEEEEechHHHHHHHHH---h--cCC---Cccc-ccCC----ccccc-cceEE Confidence 1122334444444443323 3566665444444444331 1 000 0111 1111 25665 79998 Q ss_pred eCCCCcceEEEEEecCCCccceeEeccccccceeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHh Q lcl|NC_012740. 434 DQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKD 512 (528) Q Consensus 434 D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~ 512 (528) ..+++. +++| +- +-||.=|....+.+..|..+.+-.+-...||+.. ++| T Consensus 321 ~~~~~~--~~~G---Df----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~--------------------- 370 (387) T protein:vir:26 321 TDAAVK--PIVG---DF----NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLD--------------------- 370 (387) T ss_pred ecCCCc--eeee---ch----hhhhhhhhhhhheecccccCCceEEEEEEEeCcEeech--------------------- Confidence 877653 3444 11 1122222222222333333333333334466544 223 Q ss_pred hcchhhhhhhhhhccC Q lcl|NC_012740. 513 SVGKNAYFRRVWVKGC 528 (528) Q Consensus 513 ~a~~~~~~r~~~Vk~~ 528 (528) .-|+.+.||-= T Consensus 371 -----~A~~~l~~ka~ 381 (387) T protein:vir:26 371 -----SAFRIAKAKEN 381 (387) T ss_pred -----hheEEEEeecC Confidence 11112222111 No 80 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=59.27 E-value=0.4 Score=22.79 Aligned_cols=333 Identities=15% Similarity=0.101 Sum_probs=121.6 Q ss_pred CcchHHHHHhhhhhhcCC-----cc-----------chhccchh--hhhhhhh--hhhHHHHhhhcccc----------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENE-----KL-----------PEIATASK--QKLVAKI--LESQEADFAVDPIY----------- 49 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~-----~~-----------~~~~~~~~--~~~~~~~--~enq~~~~~~~~~~----------- 49 (528) |...++|+++|.-+.+.- .+ .+|....+ ..+.+++ |+.|.+.+..+..- T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 999999998888765520 00 11211111 0011111 12221111110000 Q ss_pred --chhhhhhhhccccccccccccCCc-------c-ccccccccccccccccCcchhh------HHHHHHhhhhhhhceee Q lcl|NC_012740. 50 --KDEKVVEAFGGFIAEAEVAGDHGY-------N-ASNIASGQTTGAITNVGPAVIG------MVRRAIPNLIAFDICGV 113 (528) Q Consensus 50 --~~~~~~~~~~~~l~ea~~~~~~g~-------~-~~~~~e~t~tg~v~~~~P~li~------l~Rra~~~lI~~DI~GV 113 (528) .++....++..++.... .+.... . ...+.+++.+. + ..||+ ++++.-..-.-.+++.| T Consensus 81 ~~~~~~~~~~~~~~~r~~~-~~~~~~~~~~~~~~~~~a~~~~~~~~----g-G~lIP~~~~~~Ii~~~~~~~~l~~~~~~ 154 (387) T protein:vir:96 81 LSDNEKMVKAKAEFYRHAI-LPNEFEKPSMEAQRLLHALPTGNDSG----G-DKLLPKTLSKEIVSEPFAKNQLREKARL 154 (387) T ss_pred CchhHHHHHHHHHHHHHHH-hhhhHHHHHHHHHHHHhhhccCCCCC----C-ceeechhHHHHHHHHHHhhchhhhhcee Confidence 00010111110000000 000000 0 00111111111 1 12222 33333334445677888 Q ss_pred ecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 114 QPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAET 193 (528) Q Consensus 114 QPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~t 193 (528) .|+++.+.- |-.+.. +.+.| T Consensus 155 ~~~~~~~~p----~~~~~~----------------~~a~~---------------------------------------- 174 (387) T protein:vir:96 155 TNIKGLEIP----RVSYTL----------------DDDDF---------------------------------------- 174 (387) T ss_pred eecCCceee----eeeccC----------------Ccccc---------------------------------------- Confidence 777653210 000000 00000 Q ss_pred ccccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEeccc Q lcl|NC_012740. 194 GIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQ 273 (528) Q Consensus 194 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRA 273 (528) ++ .++.. ......|.+..|.+.|. + T Consensus 175 --------------------------------------v~------Eg~~~----~~~~~~f~~v~l~~~k~-------~ 199 (387) T protein:vir:96 175 --------------------------------------IT------DVETA----KELKAKGDTVKFTTNKF-------K 199 (387) T ss_pred --------------------------------------cc------ccccc----cccccccceeeechhee-------e Confidence 00 00000 01112355555555444 4 Q ss_pred ccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchh Q lcl|NC_012740. 274 LKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARW 353 (528) Q Consensus 274 LKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~ 353 (528) -...+|-||.+|- ..|.|++|.+-|+..|..-.|..++-.-+-+. .+.|++.-.....+.+ T Consensus 200 ~~i~iS~ell~ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g------------~~~g~~~~~~~~~~~~--- 260 (387) T protein:vir:96 200 VFAAISDTVIHGS----DVDLVNWVENALQSGLAAKERKDALAVSPKSG------------LEHMSFYNGSVKEVEG--- 260 (387) T ss_pred eechhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCcc------------ccceeeeccccccccc--- Confidence 4577999999984 45668899999998887766666652211111 1222221111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEe Q lcl|NC_012740. 354 AGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFI 433 (528) Q Consensus 354 a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~ 433 (528) -.++-.|..+-+.+...=+ ..+.|++-+...+.+|..- + ... ..+. ...+ ++|.| ++||+ T Consensus 261 -----~~~~d~i~~~~~~l~~~y~-~na~~imn~~t~~~~~~~~---~--~~~---~~~~-~~~~----~~llG-~PV~~ 320 (387) T protein:vir:96 261 -----ADMYDAIINALADLHEDYR-DNATIYMRYADYVKIISVL---S--NGT---TNFF-DTPA----EKVFG-KPVVF 320 (387) T ss_pred -----cchHHHHHHHHhccChhhh-cCCEEEEechHHHHHHHHH---h--cCC---Cccc-ccCC----ccccc-cceEE Confidence 1122334444444443323 3566665444444444331 1 000 0111 1111 25665 79998 Q ss_pred eCCCCcceEEEEEecCCCccceeEeccccccceeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHh Q lcl|NC_012740. 434 DQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKD 512 (528) Q Consensus 434 D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~ 512 (528) ..+++. +++| +- +-||.=|....+.+..|..+.+-.+-...||+.. ++| T Consensus 321 ~~~~~~--~~~G---Df----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~--------------------- 370 (387) T protein:vir:96 321 TDAAVK--PIVG---DF----NYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLD--------------------- 370 (387) T ss_pred ecCCCc--eeee---ch----hhhhhhhhhhhheecccccCCceEEEEEEEeCcEeech--------------------- Confidence 877653 3444 11 1122222222222333333333333334466544 223 Q ss_pred hcchhhhhhhhhhccC Q lcl|NC_012740. 513 SVGKNAYFRRVWVKGC 528 (528) Q Consensus 513 ~a~~~~~~r~~~Vk~~ 528 (528) .-|+.+.||-= T Consensus 371 -----~A~~~l~~ka~ 381 (387) T protein:vir:96 371 -----SAFRIAKAKEN 381 (387) T ss_pred -----hheEEEEeecC Confidence 11112222111 No 81 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=58.00 E-value=0.42 Score=22.64 Aligned_cols=303 Identities=11% Similarity=0.081 Sum_probs=120.7 Q ss_pred hhhhHHHHhhhccccchhhhhhhhccccccccccccCCccccccccccccccccccCcchh-hHHHHHHhhhhhhhceee Q lcl|NC_012740. 35 ILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGV 113 (528) Q Consensus 35 ~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GV 113 (528) ..|+|+....- +. |...+.+-+.. .+-+...++..+.+ .-+.+. .+++.+..+.+..+++-+ T Consensus 1 ~~~~~~~~~~~----~~------f~~~~~~~~~~-----~a~~~~~~~~~~~l--iP~~~~~~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:93 1 MEQTQKLKLNL----QH------FASNNVKPQVF-----NPDNVMMHEKKDGT--LLNDFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred CchhHHHHHHH----HH------HHHhhhhhhhc-----ccccccccCCCcce--echhHHHHHHHHHHhhchhhhhcce Confidence 22233221110 10 10011111100 01111111111111 122233 466666778888999999 Q ss_pred ecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 114 QPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAET 193 (528) Q Consensus 114 QPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~t 193 (528) -||++++--|.- ... + +.+.| T Consensus 64 ~~~~~~~~~ip~----~~~------~---------~~a~~---------------------------------------- 84 (324) T protein:vir:93 64 EPMEGTEKKFTF----WAD------K---------PGAYW---------------------------------------- 84 (324) T ss_pred eeccCCceEEEE----Eec------C---------cceee---------------------------------------- Confidence 999887522210 000 0 00000 Q ss_pred ccccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEeccc Q lcl|NC_012740. 194 GIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQ 273 (528) Q Consensus 194 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRA 273 (528) + +| +..+++..-++++++++.|..+ T Consensus 85 --------------------------------------v--------~E---------g~~~~~~~~~f~~i~~~~~k~~ 109 (324) T protein:vir:93 85 --------------------------------------V--------GE---------GQKIETSKATWVNATMRAFKLG 109 (324) T ss_pred --------------------------------------e--------cC---------CccccccccceeEEEEEeEEEE Confidence 0 01 0112222333445555555555 Q ss_pred ccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceecccccc--ccccc Q lcl|NC_012740. 274 LKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPI--DTRGA 351 (528) Q Consensus 274 LKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~--d~~~~ 351 (528) -....|-||.+|-. .|.+++|.+-|+..|...+++.+|.--..+. ...|+++..... ...+ T Consensus 110 ~~~~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~------------~~~~~~~~~~~~~~~~~~- 172 (324) T protein:vir:93 110 VILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP------------FGKSIAQSIEKTNKVIKG- 172 (324) T ss_pred EeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------cCccccccccccceeccc- Confidence 55679999999953 5678999999999999999998884211000 112222221110 0000 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEE Q lcl|NC_012740. 352 RWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKV 431 (528) Q Consensus 352 r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~v 431 (528) ...+..|.++-+.|.. .+.....+||+|.....|.... ...|. . +..+.. .+.|.| ++| T Consensus 173 -------~~~~~~i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~l~-----d~~G~-~-~~~~~~----~~~l~G-~PV 231 (324) T protein:vir:93 173 -------DFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIV-----DPETK-E-RIYDRN----SDSLDG-LPV 231 (324) T ss_pred -------cccHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhh-----CCCCC-e-eecCCC----CCcccc-eee Confidence 0112223333333333 2335668999999999998631 11111 1 111111 235655 788 Q ss_pred EeeCCC--CcceEEEE--------EecCCCccceeEeccccccceeEEecC------ccccceeeeeeeeceee-cCccc Q lcl|NC_012740. 432 FIDQYA--RQDYFTVG--------YKGDNEMDAGIYYAPYVALTPLRATDP------QSFHPVLGFKTRYGIGI-NPFAD 494 (528) Q Consensus 432 y~D~y~--~~dy~~vG--------~KG~~~~d~g~fyaPYv~~~~~~~~Dp------~s~qP~~~~~tRY~l~~-nP~~~ 494 (528) ++.+.. +...+++| ..+.-+.+-. .+..+....-.|. ..-|=.+=+..||+..+ +|=+ T Consensus 232 v~~~~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~----~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a- 306 (324) T protein:vir:93 232 VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKID----ETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA- 306 (324) T ss_pred EeecCCCCCcceEEEEecceEEEEEecCcEEEEe----ecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc- Confidence 886653 33233333 3322211100 0000000000000 01122333445665542 2200 Q ss_pred ccCCCccceecccchHH-hhcchh Q lcl|NC_012740. 495 SKSQAPSARITSGMLSK-DSVGKN 517 (528) Q Consensus 495 ~~~~~~~~~~~~~~~~~-~~a~~~ 517 (528) .+++.+..... ...|+- T Consensus 307 ------~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:93 307 ------FAKLVPADKRTDSVPGEV 324 (324) T ss_pred ------eEEEecccccCCCCCCCC Confidence 11111111000 111221 No 82 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=57.00 E-value=0.44 Score=22.52 Aligned_cols=273 Identities=10% Similarity=0.070 Sum_probs=113.4 Q ss_pred ccccccccccccccccccc-ccccc-cccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccC Q lcl|NC_012740. 171 FQKLTLSTPIAAGDIVHHT-FAETG-IAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFN 248 (528) Q Consensus 171 f~~~t~~t~~a~Gdi~~~~-f~~tg-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~g 248 (528) ++. . ++...+++... +..-- ..................-. ...|...+...=-....+|.. . T Consensus 1 Ma~---~-~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~---------g~~G~tv~ip~~~~~g~a~~~---~ 64 (278) T protein:vir:80 1 MAD---L-TTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLE---------GQPGSEITVPKYKYIGDAQDV---A 64 (278) T ss_pred CCC---c-ceehhheecHHHHHHHHHHHHHHhhhhcccceeccccc---------CCCCCEEEEeeeccCCcceee---c Confidence 000 0 00011111100 00000 00000000000000000000 000111111110001122211 1 Q ss_pred CCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhc-CCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeccc Q lcl|NC_012740. 249 GSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVH-GMDADAELNAILANEVLLEINREIVDVINFTAQVGKT 327 (528) Q Consensus 249 gs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiH-GLDAE~ELanILStEImlEINReii~~i~~~a~~~~~ 327 (528) .....+.. ..+..+++++-|-|+ |+ | + .-|+-+.. +-|.-.+..+-++.-+..+++++++..+.-... .+. T Consensus 65 ~g~~i~~~--~lt~~~~~~~i~~~~-~a-~--~-v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~-~~~ 136 (278) T protein:vir:80 65 EGAAIDYS--ALETESVKHGIKKAG-KG-V--K-LTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTL-EVK 136 (278) T ss_pred CCCcCccc--ccccceeeEeeehhh-cc-c--c-ccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-ccc Confidence 11222333 445566666666665 22 2 2 33444443 679999999999999999999999987642111 110 Q ss_pred ceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcccccccccccc Q lcl|NC_012740. 328 GMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQG 407 (528) Q Consensus 328 ~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~ 407 (528) .+-+.|.. | -+.+.+-.+..++.+ .--. ...+++++|.+.+.|.......+..... T Consensus 137 --------~~~t~~~~---~-----~~~~~~~da~~~l~~-------~~~~-~~~~ivv~p~~~~~L~k~~~~~~~~~~~ 192 (278) T protein:vir:80 137 --------GAINIGLI---D-----KIENTFTDAPDAIED-------ESIT-TTGVLFLNYKDTAKLREEAAGSWTKASQ 192 (278) T ss_pred --------cccccchh---h-----hHHHHHHHHHHhhcc-------cCCC-cccEEEECHHHHHHHHhhhhhhcccccc Confidence 01111211 1 012222222222222 1111 2348999999999997654333322111 Q ss_pred ccccccccccCceEEEEecCceEEEeeCCCCcce-EEEEEecCCCccceeEeccccccceeEEecCccccceeeeeeeec Q lcl|NC_012740. 408 AAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDY-FTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYG 486 (528) Q Consensus 408 ~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~ 486 (528) ... ....+-.+|.+.| ++||+++..|..= ++++ +|. -.|+..= +...-.--|+..++-.|-...+|| T Consensus 193 ~g~----~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~-~gA-----i~~~~~~-~~~vE~~Rd~~~~~d~i~~~~~yg 260 (278) T protein:vir:80 193 LGD----DLLVKGAFGELLG-WEIVRTKKLADGNALAVK-AGA-----LKTFLKR-NLLAESGRDMDHKLTKFNADQHYA 260 (278) T ss_pred ccc----cceeeccceeecc-eeEEEcCCCCcceEEEEe-ccc-----eeeeecC-CcccccccchhhccceeeeeeEEE Confidence 011 1112235888965 9999999987431 1221 121 1122211 222224468889999988889998 Q ss_pred eee-cCcccccCCCccceecccchHHhhcch Q lcl|NC_012740. 487 IGI-NPFADSKSQAPSARITSGMLSKDSVGK 516 (528) Q Consensus 487 l~~-nP~~~~~~~~~~~~~~~~~~~~~~a~~ 516 (528) +.+ ||-.. ..++. .||. T Consensus 261 ~~v~~~~~~-------v~it~------~a~~ 278 (278) T protein:vir:80 261 VALVDETKA-------VKVVP------VAGN 278 (278) T ss_pred EEEEcCcce-------EEEee------ccCC Confidence 864 56111 11111 1111 No 83 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=56.87 E-value=0.45 Score=22.50 Aligned_cols=335 Identities=14% Similarity=0.111 Sum_probs=122.2 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhh---h---------ccccchhhhhhhhcccccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFA---V---------DPIYKDEKVVEAFGGFIAEAEVA 68 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~---~---------~~~~~~~~~~~~~~~~l~ea~~~ 68 (528) +-..+++..++..+.+.. -++ +. ++-+.|++... + ....++.. ...|...+.-.. . T Consensus 39 ~e~i~e~~~~~~~~~~~~--~~~----~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~ 106 (408) T protein:vir:74 39 AEAMSELKNKRDNEKVRR--DAL----RE----QLVEAQAEQVVNMREEEKGPLNKSENELKDKF-VKDFVNMVRNPM-A 106 (408) T ss_pred HHHHHHHHHHHHHHHHHH--HHH----HH----HHHHHHHHHHhhccccccccccchhhhhHHHH-HHHHHHHHhcch-h Confidence 323345555665444321 111 11 11111111000 0 00111111 111211110000 0 Q ss_pred ccCCccccccccccc-ccccc---ccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccc Q lcl|NC_012740. 69 GDHGYNASNIASGQT-TGAIT---NVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFH 144 (528) Q Consensus 69 ~~~g~~~~~~~e~t~-tg~v~---~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~ 144 (528) .-..-..+.+..++. .|.+. .+.+ .+++.+-++....+++.++||++.+|-+--.+- ... + T Consensus 107 ~~~~~~~~a~~~~~~~~gg~~vP~~~~~---~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~~~-----~----- 171 (408) T protein:vir:74 107 FLNTVSSKTETSGSDSAAGLTIPQDIRT---MINTLVRQYDSLQQYVRVESVSTSSGSRVYEKW--TDV-----T----- 171 (408) T ss_pred hhhhhhhhhhcccccCCCceeechhHhh---HHHHHHhhhcchhhhcceeeccCCcceEEEEee--cCC-----c----- Confidence 000111122222221 12111 1222 344444456677899999999988654422220 000 0 Q ss_pred ccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccc Q lcl|NC_012740. 145 PMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLM 224 (528) Q Consensus 145 n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (528) +... T Consensus 172 ----~~~~------------------------------------------------------------------------ 175 (408) T protein:vir:74 172 ----PLKA------------------------------------------------------------------------ 175 (408) T ss_pred ----cccc------------------------------------------------------------------------ Confidence 0000 Q ss_pred ccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHH Q lcl|NC_012740. 225 EEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILAN 304 (528) Q Consensus 225 ~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILSt 304 (528) -++.+ ++. .......|.++.|...|.. -...+|-||.+|- .+|.+++|.+-|+. T Consensus 176 ------~v~E~-----~~~----~~~~~~~~~~i~~~~~k~~-------~~~~iS~ell~ds----~~~l~~~i~~~l~~ 229 (408) T protein:vir:74 176 ------MDEED-----GKI----PDLDNPRLTIIKYLIKRYA-------GIITATNTLLKDT----AENILAWLSSWIAK 229 (408) T ss_pred ------ccccc-----ccc----ccccccceeeEEeeeeeEE-------eeehhHHHHHhhc----hHHHHHHHHHHHHH Confidence 00000 000 0011223555555555544 4456999999983 46779999999999 Q ss_pred HHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEE Q lcl|NC_012740. 305 EVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFV 384 (528) Q Consensus 305 EImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~ 384 (528) .|..-+|+.||.= . |. .....++.+++ .|...+ ...+.. .+...-.+ T Consensus 230 ~~~~~~d~~il~G---~------G~---~~~~~~~~~~~----------------~i~~~~---~~~l~~--~~~~~a~~ 276 (408) T protein:vir:74 230 KVVVTRNQAIIAA---M------GT---VPKKPTIANFD----------------DVITMI---NTSVDP--AIIATSSL 276 (408) T ss_pred HHHHHHHHHHhhc---c------cc---cccccccccHH----------------HHHHHH---HHhhhh--hhcCCCEE Confidence 9999999988831 1 11 00112222221 111111 112221 22234467 Q ss_pred EEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCC--CCc----ce-EEEE-----EecCCCc Q lcl|NC_012740. 385 IASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQY--ARQ----DY-FTVG-----YKGDNEM 452 (528) Q Consensus 385 v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y--~~~----dy-~~vG-----~KG~~~~ 452 (528) ||+|.....|...- ...| ..-...+.+.. .-++|.| ++||+-.+ .+. ++ +++| |..-.-. T Consensus 277 v~n~~~~~~l~~lk-----d~~G-~~l~~~~~~~~-~~~~l~G-~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~ 348 (408) T protein:vir:74 277 LTNQSGLNKLALVK-----TAEG-KYLLEPDPTKP-NSYLIKG-KQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRE 348 (408) T ss_pred EEcHHHHHHHHHhh-----cCCC-ceEeccCcCCC-CCceecc-eeeEEecCcccccccCCcceEEEEehhccEEEEEec Confidence 89999999998631 1111 11111222221 1246755 77776332 111 11 2222 1000000 Q ss_pred cceeEeccccccceeEEecCccccceeeeeeeecee-ecCcc--c----ccCCCccceecccchHHhhc Q lcl|NC_012740. 453 DAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIG-INPFA--D----SKSQAPSARITSGMLSKDSV 514 (528) Q Consensus 453 d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~--~----~~~~~~~~~~~~~~~~~~~a 514 (528) .-.+=..||.- .+-...+-.+-+..||+.. .+|=+ . .....+.+ .+.....+. T Consensus 349 ~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~---~~~~~~~~~ 408 (408) T protein:vir:74 349 NMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQVGN---FKTTTSTAV 408 (408) T ss_pred ceEEEEecccc------chhhcceeeEEEEEeeCcEEecccceEEEEeecccCCCCC---CCCCccccC Confidence 00111112110 0012333444445555543 12200 0 00000000 000000011 No 84 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=55.72 E-value=0.47 Score=22.37 Aligned_cols=303 Identities=12% Similarity=0.083 Sum_probs=119.0 Q ss_pred hccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCccccccccccccccccccCcchh-hHHHHH Q lcl|NC_012740. 23 IATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIASGQTTGAITNVGPAVI-GMVRRA 101 (528) Q Consensus 23 ~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~t~tg~v~~~~P~li-~l~Rra 101 (528) |+...+...- ++. +... +-+.+.. ++.+...++..+.+ .-|.+. .+++.+ T Consensus 1 ~~~~~~~~~~----------~~~---f~~~---------~~~~~~~-----~a~~~~~~~~~~~l--ip~~~~~~ii~~~ 51 (324) T protein:vir:96 1 MEQTQKLKLN----------LQH---FASN---------NVKPQVF-----NPDNVMMHEKKDGT--LLNDFTTPILQEV 51 (324) T ss_pred CCcchhhhHH----------HHH---HHHh---------hhhhhhc-----ccccccccCCCcce--echhHHHHHHHHH Confidence 1111111100 000 0000 0000000 01111111111111 223333 455666 Q ss_pred HhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccc Q lcl|NC_012740. 102 IPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIA 181 (528) Q Consensus 102 ~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a 181 (528) ..+.+..+++.+-||++++.-|.- +.. + +.+.| T Consensus 52 ~~~s~l~~l~~~~~~~~~~~~~p~----~~~------~---------~~a~~---------------------------- 84 (324) T protein:vir:96 52 MENSKIMQLGKYEPMEGTEKKFTF----WAD------K---------PGAYW---------------------------- 84 (324) T ss_pred HhhchhhhhcceeeccCCceEEEE----Eec------C---------cceee---------------------------- Confidence 778888999999999887532211 000 0 00000 Q ss_pred ccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeE Q lcl|NC_012740. 182 AGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMR 261 (528) Q Consensus 182 ~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFS 261 (528) ++.| +.. ......|.+..+. T Consensus 85 --------------------------------------------------v~Eg------~~~----~~~~~~f~~v~~~ 104 (324) T protein:vir:96 85 --------------------------------------------------VGEG------QKI----ETSKATWVNATMR 104 (324) T ss_pred --------------------------------------------------ecCC------ccc----cccccceeEEEEE Confidence 0000 000 0112346666666 Q ss_pred EeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceec Q lcl|NC_012740. 262 IDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFD 341 (528) Q Consensus 262 IEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~d 341 (528) +.|..+-. ..|-||.+|-. .|.+++|.+.|...|...+++.||.--..+ ..+.|++. T Consensus 105 ~~k~~~~~-------~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~~~l~G~g~~------------~~~~~~~~ 161 (324) T protein:vir:96 105 AFKLGVIL-------PVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN------------PFGKSIAQ 161 (324) T ss_pred eEEEEEee-------hhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCCC------------CcCccccc Confidence 66666554 49999999853 567899999999999999999888421100 01222222 Q ss_pred cccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceE Q lcl|NC_012740. 342 LQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVF 421 (528) Q Consensus 342 l~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~ 421 (528) ...... . +.. ....+..|..+-+.|.. .+...+.+|||+.....|.... ...| ..-+. +.. T Consensus 162 ~~~~~~--~--~~~--~~~~~~~i~~~~~~i~~--~~~~~~~~i~n~~~~~~L~~lk-----d~~G-~~~~~-~~~---- 222 (324) T protein:vir:96 162 SIKKTN--K--VIK--GDFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIV-----DPET-KERIY-DRN---- 222 (324) T ss_pred cccccc--e--ecc--cccchHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhh-----CCCC-Ceeec-CCC---- Confidence 111000 0 000 01112223344444433 2345778999999999887642 1111 11111 111 Q ss_pred EEEecCceEEEeeCCC--CcceEEEE--------EecCCCccceeEeccccccceeEEecCcc-----c---cceeeeee Q lcl|NC_012740. 422 AGVLAGKYKVFIDQYA--RQDYFTVG--------YKGDNEMDAGIYYAPYVALTPLRATDPQS-----F---HPVLGFKT 483 (528) Q Consensus 422 ~G~l~~~~~vy~D~y~--~~dy~~vG--------~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s-----~---qP~~~~~t 483 (528) .++|.| ++|++++.. +...+++| ..+.-+.+.+ .+ ..+....|+.. | |=.+=..- T Consensus 223 ~~~l~G-~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~----~~--~~~~~~~~~~~~~~~~~~~n~v~~r~~~ 295 (324) T protein:vir:96 223 SDSLDG-LPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKID----ET--AQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) T ss_pred CCcccc-eeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEe----ec--ccccccccccccchhhhhcCcEEEEEEE Confidence 234654 888886653 22223333 2222111000 00 00000011110 1 12223345 Q ss_pred eecee-ecCcccccCCCccceecccchH-Hhhcchh Q lcl|NC_012740. 484 RYGIG-INPFADSKSQAPSARITSGMLS-KDSVGKN 517 (528) Q Consensus 484 RY~l~-~nP~~~~~~~~~~~~~~~~~~~-~~~a~~~ 517 (528) ||++. .+|=+ .+++...... ....|+- T Consensus 296 r~d~~v~~~~a-------~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 296 HVALHIADDKA-------FAKLVPADKRTDSVPGEV 324 (324) T ss_pred EeccEEecccc-------eEEEecccccCCCCCCCC Confidence 66653 33310 0111110000 0112222 No 85 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=50.55 E-value=0.61 Score=21.77 Aligned_cols=338 Identities=12% Similarity=0.024 Sum_probs=124.0 Q ss_pred CcchHHHHHhhhhhhcCCccchhc----cchh---hhhhhhhhhhHHHHhhhcc--ccchhhhhhhhcccccccc---cc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIA----TASK---QKLVAKILESQEADFAVDP--IYKDEKVVEAFGGFIAEAE---VA 68 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~----~~~~---~~~~~~~~enq~~~~~~~~--~~~~~~~~~~~~~~l~ea~---~~ 68 (528) -...++..++...+.+. +.+.. +..+ +++.+++ +..|+...+.. ........+++........ .. T Consensus 22 ~~~~~e~~~~~e~~~~~--~~~~~~~~~~e~~~~~~~l~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 98 (379) T protein:vir:10 22 SAQALEVKGLIEALEAK--MTSEKDLAVNELKSDMAALQAHA-DKLDVKLKEKAKSEDKSDSLVKSITENFNDIKEVRNG 98 (379) T ss_pred HHHHHHHHHHHHHHHhH--hhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcccccccchhHHHHHHHHHHhHHHHHhh Confidence 01112222222222110 11111 1111 1111111 11111111111 0111111111111100000 00 Q ss_pred ccCCccccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccc Q lcl|NC_012740. 69 GDHGYNASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPM 146 (528) Q Consensus 69 ~~~g~~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~ 146 (528) ...+-... +..+++++....=|.-+ .+++..-....-.++|.|.||++++.- |.-..+ T Consensus 99 ~~~~~~~~--~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~-------~~~~~~----------- 158 (379) T protein:vir:10 99 KSIQVKAV--GDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYT-------FVRENG----------- 158 (379) T ss_pred hhhhhhhh--cccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceE-------EEEeec----------- Confidence 00000000 11111222211112211 233434445667789999999887421 111000 Q ss_pred ccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccc Q lcl|NC_012740. 147 YSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEE 226 (528) Q Consensus 147 ~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (528) +.+.. T Consensus 159 ------~~~~~--------------------------------------------------------------------- 163 (379) T protein:vir:10 159 ------AGEGA--------------------------------------------------------------------- 163 (379) T ss_pred ------CCCcc--------------------------------------------------------------------- Confidence 00000 Q ss_pred ccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|NC_012740. 227 GKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEV 306 (528) Q Consensus 227 g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEI 306 (528) ..-.+| +...+++..++++++..+|.=+--..+|-||.||.-. .++.|.+-|+..| T Consensus 164 ----------~~~v~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~~-----l~~~i~~~la~~~ 219 (379) T protein:vir:10 164 ----------IGAQVE---------GATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLPF-----LTSFIPNALRRDY 219 (379) T ss_pred ----------cccccC---------CccccccccceeeeEeeeeeEEeeehhhHHHHhhHHH-----HHHHHHHHHHHHH Confidence 000011 1122333344444444444444446799999999642 5788999999999 Q ss_pred HHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEE Q lcl|NC_012740. 307 LLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIA 386 (528) Q Consensus 307 mlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~ 386 (528) +.-+|..++.-+...+..+.. +. .+. -..+..+.++.++. . ..+ ..+.+|+ T Consensus 220 ~~~~~~~~~~g~~~~~~~~~~----------~~------~~~----~~~d~i~~~~~~~~-------~-~~~-~~~~~vm 270 (379) T protein:vir:10 220 AKAENAAFNAVLAANATASTE----------II------TNK----NKVEMLINEIAKQE-------N-LDF-PVTAIVL 270 (379) T ss_pred HHHHHHHHhcccccccccccc----------cc------cCc----ccHHHHHHHHHhhh-------h-ccC-CCCEEEE Confidence 999998887644332211111 11 010 01233333333332 1 122 5677899 Q ss_pred chhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEecccccc-c Q lcl|NC_012740. 387 SRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVAL-T 465 (528) Q Consensus 387 S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~-~ 465 (528) +|.....|....--...+...++.... +... -+|.| ++|+++++.+...+++|=-.. .-+++- .+ . T Consensus 271 n~~~~~~l~~lkd~~G~~l~~~~~~~~-~~~~----~~l~G-~pvv~s~~~~ag~~~~gdf~~----~~~~~~---~~~~ 337 (379) T protein:vir:10 271 RPTDYYDILVTQKSVGAGYGLPGVVTQ-DNGV----LRING-IPLFRATWLAANKYYVGDWTR----VTKVTT---EGLS 337 (379) T ss_pred cHHHHHHHHHhhccCCceeccCCccCC-CCCc----ceecc-eeeEecCCCCCCceEEeeccc----EEEEEE---eceE Confidence 999888876431000000000000000 1111 14544 899999998776655542211 112211 11 1 Q ss_pred eeEEecCcc-c-cceee--eeeeecee-ecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 466 PLRATDPQS-F-HPVLG--FKTRYGIG-INPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 466 ~~~~~Dp~s-~-qP~~~--~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) +....++.. | +-.++ +..|+|+. .+|=+.- ++-+..| T Consensus 338 i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v--------------------------~~~~~~~ 379 (379) T protein:vir:10 338 LEFSEVEGTNFVKNNITARIEAQVALAVEQPAALI--------------------------FGDFTAV 379 (379) T ss_pred EEEeecccccccCCcEEEEEEEEeccEEecCccEE--------------------------EEEecCC Confidence 112222211 1 22233 34577554 3451111 1111111 No 86 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=45.05 E-value=0.78 Score=21.16 Aligned_cols=301 Identities=11% Similarity=0.047 Sum_probs=118.7 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCcccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIAS 80 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e 80 (528) |.. . +..+..+++....+.+-+. .++.+... T Consensus 1 ~~~----------------~-~~~~~~~~~~~~~~~~~~~--------------------------------~~a~~~~~ 31 (324) T protein:vir:96 1 MEQ----------------T-QKLKLNLQHFASNNVKPQV--------------------------------FNPDNVMM 31 (324) T ss_pred CCc----------------c-hhhhHHHHHHHHHhhhhhh--------------------------------hccccccc Confidence 000 0 0111111111111111110 11111111 Q ss_pred ccccccccccCcchh--hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccccccccccccccc Q lcl|NC_012740. 81 GQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAA 158 (528) Q Consensus 81 ~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~ 158 (528) +. +++. .=|.-+ .+++.+..+....+++-+-||++++--|.-.. . + +++.+ T Consensus 32 ~~-~~~~--~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~----~------~---------~~a~~----- 84 (324) T protein:vir:96 32 HE-KKDG--TLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWA----D------K---------PGAYW----- 84 (324) T ss_pred cC-cCcc--ccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEe----c------C---------cceeE----- Confidence 11 1111 113222 45666667777888898989887642111000 0 0 00000 Q ss_pred CccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccch Q lcl|NC_012740. 159 KDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMAT 238 (528) Q Consensus 159 a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsT 238 (528) + T Consensus 85 -------------------------------------------------------------------------v------ 85 (324) T protein:vir:96 85 -------------------------------------------------------------------------V------ 85 (324) T ss_pred -------------------------------------------------------------------------e------ Confidence 0 Q ss_pred hhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_012740. 239 SIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVI 318 (528) Q Consensus 239 a~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i 318 (528) +| +..+++...++++++++.|.-+.-..+|-||.+|-. .|.+++|.+-|+..|...|++.+|.=- T Consensus 86 --~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~ 150 (324) T protein:vir:96 86 --GE---------GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) T ss_pred --cC---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 01 011233333444444444444445569999999863 677999999999999999999988421 Q ss_pred hhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccc Q lcl|NC_012740. 319 NFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASAD 398 (528) Q Consensus 319 ~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g 398 (528) -.+. .+.|+.......... ......+..|+++.+.+.. .+...+.+|+||.....|.... T Consensus 151 g~~~------------~~~gi~~~~~~~~~~------~~~~~t~~~i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~ 210 (324) T protein:vir:96 151 GNNP------------FGKSIAQSIEKTNKV------IKGDFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred CCCC------------cCcccccccccccee------ccccccHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhh Confidence 1110 122333222111000 0001123334444444433 3345667899999999887542 Q ss_pred cccccccccccccccccccCceEEEEecCceEEEeeCCC--CcceEEEEEecCCCccceeEeccccccceeEEec----- Q lcl|NC_012740. 399 QGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYA--RQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATD----- 471 (528) Q Consensus 399 ~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~--~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~D----- 471 (528) ...| ...+ .+.. .++|.| ++|++++.. +...+++|-. +.+++...-...+...-+ T Consensus 211 -----d~~G-~~~~-~~~~----~~~l~G-~PV~~~~~~~~~~~~~~~gd~------~~~~~g~~~~~~i~~~~~~~~~~ 272 (324) T protein:vir:96 211 -----DPET-KERI-YDRN----SDSLDG-LPVVNLKSSNLKRGELITGDF------DKLIYGIPQLIEYKIDETAQLST 272 (324) T ss_pred -----ccCC-Ceee-cCCC----CCcccc-eeeEeeCCCCCCcceEEEEec------ceEEEEEecCcEEEEeecccccc Confidence 1111 1111 1111 234655 788887763 3333444421 111222111111111111 Q ss_pred ---Cc-----cc---cceeeeeeeeceee-cCcccccCCCccceecccchHH--hhcchh Q lcl|NC_012740. 472 ---PQ-----SF---HPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSK--DSVGKN 517 (528) Q Consensus 472 ---p~-----s~---qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~--~~a~~~ 517 (528) +. -| |=.+=...||+..+ +|=+ .+++.. -+|. ...|.- T Consensus 273 ~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A-------~~~l~~-a~~~~~~~~~~~ 324 (324) T protein:vir:96 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKA-------FAKLVP-ADKRTDSVPGEV 324 (324) T ss_pred cccccccchhhhhcCcEEEEEEEEEccEEecccc-------eEEEec-ccccCCCCCCCC Confidence 00 01 11112234555432 2200 011110 0000 000110 No 87 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=45.05 E-value=0.78 Score=21.16 Aligned_cols=301 Identities=11% Similarity=0.047 Sum_probs=118.7 Q ss_pred CcchHHHHHhhhhhhcCCccchhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccccccccCCcccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYNASNIAS 80 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e 80 (528) |.. . +..+..+++....+.+-+. .++.+... T Consensus 1 ~~~----------------~-~~~~~~~~~~~~~~~~~~~--------------------------------~~a~~~~~ 31 (324) T protein:vir:78 1 MEQ----------------T-QKLKLNLQHFASNNVKPQV--------------------------------FNPDNVMM 31 (324) T ss_pred CCc----------------c-hhhhHHHHHHHHHhhhhhh--------------------------------hccccccc Confidence 000 0 0111111111111111110 11111111 Q ss_pred ccccccccccCcchh--hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccccccccccccccc Q lcl|NC_012740. 81 GQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAA 158 (528) Q Consensus 81 ~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~ 158 (528) +. +++. .=|.-+ .+++.+..+....+++-+-||++++--|.-.. . + +++.+ T Consensus 32 ~~-~~~~--~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~----~------~---------~~a~~----- 84 (324) T protein:vir:78 32 HE-KKDG--TLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWA----D------K---------PGAYW----- 84 (324) T ss_pred cC-cCcc--ccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEe----c------C---------cceeE----- Confidence 11 1111 113222 45666667777888898989887642111000 0 0 00000 Q ss_pred CccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccch Q lcl|NC_012740. 159 KDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMAT 238 (528) Q Consensus 159 a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsT 238 (528) + T Consensus 85 -------------------------------------------------------------------------v------ 85 (324) T protein:vir:78 85 -------------------------------------------------------------------------V------ 85 (324) T ss_pred -------------------------------------------------------------------------e------ Confidence 0 Q ss_pred hhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhh Q lcl|NC_012740. 239 SIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVI 318 (528) Q Consensus 239 a~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i 318 (528) +| +..+++...++++++++.|.-+.-..+|-||.+|-. .|.+++|.+-|+..|...|++.+|.=- T Consensus 86 --~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~ 150 (324) T protein:vir:78 86 --GE---------GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) T ss_pred --cC---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 01 011233333444444444444445569999999863 677999999999999999999988421 Q ss_pred hhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccc Q lcl|NC_012740. 319 NFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASAD 398 (528) Q Consensus 319 ~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g 398 (528) -.+. .+.|+.......... ......+..|+++.+.+.. .+...+.+|+||.....|.... T Consensus 151 g~~~------------~~~gi~~~~~~~~~~------~~~~~t~~~i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~ 210 (324) T protein:vir:78 151 GNNP------------FGKSIAQSIEKTNKV------IKGDFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred CCCC------------cCcccccccccccee------ccccccHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhh Confidence 1110 122333222111000 0001123334444444433 3345667899999999887542 Q ss_pred cccccccccccccccccccCceEEEEecCceEEEeeCCC--CcceEEEEEecCCCccceeEeccccccceeEEec----- Q lcl|NC_012740. 399 QGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYA--RQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATD----- 471 (528) Q Consensus 399 ~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~--~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~D----- 471 (528) ...| ...+ .+.. .++|.| ++|++++.. +...+++|-. +.+++...-...+...-+ T Consensus 211 -----d~~G-~~~~-~~~~----~~~l~G-~PV~~~~~~~~~~~~~~~gd~------~~~~~g~~~~~~i~~~~~~~~~~ 272 (324) T protein:vir:78 211 -----DPET-KERI-YDRN----SDSLDG-LPVVNLKSSNLKRGELITGDF------DKLIYGIPQLIEYKIDETAQLST 272 (324) T ss_pred -----ccCC-Ceee-cCCC----CCcccc-eeeEeeCCCCCCcceEEEEec------ceEEEEEecCcEEEEeecccccc Confidence 1111 1111 1111 234655 788887763 3333444421 111222111111111111 Q ss_pred ---Cc-----cc---cceeeeeeeeceee-cCcccccCCCccceecccchHH--hhcchh Q lcl|NC_012740. 472 ---PQ-----SF---HPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSK--DSVGKN 517 (528) Q Consensus 472 ---p~-----s~---qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~--~~a~~~ 517 (528) +. -| |=.+=...||+..+ +|=+ .+++.. -+|. ...|.- T Consensus 273 ~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A-------~~~l~~-a~~~~~~~~~~~ 324 (324) T protein:vir:78 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKA-------FAKLVP-ADKRTDSVPGEV 324 (324) T ss_pred cccccccchhhhhcCcEEEEEEEEEccEEecccc-------eEEEec-ccccCCCCCCCC Confidence 00 01 11112234555432 2200 011110 0000 000110 No 88 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=41.93 E-value=0.91 Score=20.82 Aligned_cols=343 Identities=13% Similarity=0.098 Sum_probs=128.0 Q ss_pred cchHHHHHhhhhhhcCCccchhccchhhhhhh-------hhhhh------HHHHhhhcc-----ccchhh---hhhhhcc Q lcl|NC_012740. 2 KTTKELMEKWSPLLENEKLPEIATASKQKLVA-------KILES------QEADFAVDP-----IYKDEK---VVEAFGG 60 (528) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~~~~~~~~~~~~-------~~~en------q~~~~~~~~-----~~~~~~---~~~~~~~ 60 (528) -+.++|+++|.-+.+ .+.++.+..++.... ..+|. |-+.+.+.. ...+.. -.....+ T Consensus 1 M~~~eL~~~~~~~~~--~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (395) T protein:vir:38 1 MNINQLKDAFDMAGQ--KVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNK 78 (395) T ss_pred CCHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 455778888877743 232232222211111 01110 100011100 000000 0000000 Q ss_pred cccccccccc----------CCcccccccccc-ccccccccCcchh--hHHHHHHhhhhhhhceeeecCCcccceeeeee Q lcl|NC_012740. 61 FIAEAEVAGD----------HGYNASNIASGQ-TTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIR 127 (528) Q Consensus 61 ~l~ea~~~~~----------~g~~~~~~~e~t-~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMR 127 (528) ...+...... .+.. ...++++ ++++-...=|.-+ .+++.+....+..++|.++||++++|-+--.+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 157 (395) T protein:vir:38 79 KPLPVKDGKPDAQAMKNQFVKDFK-NLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEK 157 (395) T ss_pred cccchhhhhHHHHHHHHHHHHHHH-HHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEe Confidence 0000000000 0000 0011111 1111111113222 35555556777888999999999876431111 Q ss_pred eeecCCCCCCcccccccccccccccccccccCcccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 128 SVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVT 207 (528) Q Consensus 128 srY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~ 207 (528) - .+. .+.+. T Consensus 158 ~--~~~--------------~~~a~------------------------------------------------------- 166 (395) T protein:vir:38 158 L--ADI--------------TPLKD------------------------------------------------------- 166 (395) T ss_pred e--ccC--------------Ccccc------------------------------------------------------- Confidence 0 000 00000 Q ss_pred ccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHH Q lcl|NC_012740. 208 PTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLR 287 (528) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLk 287 (528) .++.+ ++ ...+....|.+..|+..|..+- ..+|.||.+|- T Consensus 167 -----------------------~v~E~-----~~----~~~~~~~~f~~v~~~~~k~~~~-------~~iS~ell~ds- 206 (395) T protein:vir:38 167 -----------------------LDDES-----AL----IGDNDDPELTVVKYLIHRYAGI-------TTVTNTLLKDT- 206 (395) T ss_pred -----------------------ccccc-----cc----cccccccceeeEEeeeeeeEee-------hhhHHHHHhhh- Confidence 00000 00 0011123466666666666554 45999999993 Q ss_pred hhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHH Q lcl|NC_012740. 288 AVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDK 367 (528) Q Consensus 288 AiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~ 367 (528) ..|-++.|.+-|+..|..-||+.|+.-. .. +. ...|..++ +....++... T Consensus 207 ---~~~l~~~i~~~la~~~~~~~~~~il~g~---g~----~~-----~~~~~~~~-------------~~i~~~~~~~-- 256 (395) T protein:vir:38 207 ---VDNIIQWLVNWAAKKDVVTRNAKILEVM---GK----AP-----KKPTISQF-------------DNIKDLENNT-- 256 (395) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHhhcc---cc----cc-----cccccccH-------------HHHHHHHHHh-- Confidence 4566899999999999999999888421 11 00 11122111 1112222211 Q ss_pred HHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCc-----ce- Q lcl|NC_012740. 368 EAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQ-----DY- 441 (528) Q Consensus 368 ~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy- 441 (528) +...-+ ....+||+|.....|... +...|. .-...+.+. -..++|.| ++|++....+. +. T Consensus 257 ----l~~~~~--~~a~~v~n~~~~~~L~~l-----kd~~G~-~l~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~~~~ 322 (395) T protein:vir:38 257 ----LDPAIE--STSSFITNQSGYNILSKV-----KDADGR-YLMQPDVTS-PDKYLIDG-KPVIRIADKWLPDVSGSHP 322 (395) T ss_pred ----hhhhhc--CCCEEEEcHHHHHHHHHh-----hccCCc-eeeccCcCC-CCcceecc-ceeEEecccccCcCCCcce Confidence 111111 345689999999888653 111110 001111111 11235655 78877543211 11 Q ss_pred EEEEEecCCCccceeEeccccccceeEEecC----ccccceeeeeeeeceee-cC-------cccccCCCccceecccch Q lcl|NC_012740. 442 FTVGYKGDNEMDAGIYYAPYVALTPLRATDP----QSFHPVLGFKTRYGIGI-NP-------FADSKSQAPSARITSGML 509 (528) Q Consensus 442 ~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp----~s~qP~~~~~tRY~l~~-nP-------~~~~~~~~~~~~~~~~~~ 509 (528) +++|--.. .+.....-.+.....-++ ...+=.+-+..||+..+ +| ++...++++. T Consensus 323 i~~gd~~~-----~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~-------- 389 (395) T protein:vir:38 323 LYFGDLKQ-----GITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVANQAQG-------- 389 (395) T ss_pred EEEEeccc-----cEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecccCCCCC-------- Confidence 22221000 000000000000011111 12233455555666542 23 1111222111 Q ss_pred HHhhcch Q lcl|NC_012740. 510 SKDSVGK 516 (528) Q Consensus 510 ~~~~a~~ 516 (528) .--.|| T Consensus 390 -~~~~~~ 395 (395) T protein:vir:38 390 -TAGTGK 395 (395) T ss_pred -ccCCCC Confidence 111244 No 89 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=40.92 E-value=0.95 Score=20.71 Aligned_cols=280 Identities=13% Similarity=0.089 Sum_probs=123.5 Q ss_pred cccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccccccccccccc Q lcl|NC_012740. 78 IASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSL 156 (528) Q Consensus 78 ~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~ 156 (528) .+ +++|.+ .-|.+. .+++.+-++.+-.++|.+.||++...-|. .. .. + +.+ T Consensus 1 ma--~~gG~l--vp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip-~~---~~------~---------~~a----- 52 (298) T protein:vir:16 1 MV--LNKGTL--FDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVF-TF---TM------D---------SEI----- 52 (298) T ss_pred Cc--ccCcce--echhHHHHHHHHHHhhhhhhhhcceeeccCCceEEE-EE---ec------C---------cce----- Confidence 22 222222 223333 45555667888999999999875421111 00 00 0 000 Q ss_pred ccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccccccccccccc Q lcl|NC_012740. 157 AAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGM 236 (528) Q Consensus 157 ~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~Gm 236 (528) +-+ T Consensus 53 -------------------------------------------------------------------------~~v---- 55 (298) T protein:vir:16 53 -------------------------------------------------------------------------DVV---- 55 (298) T ss_pred -------------------------------------------------------------------------EEe---- Confidence 000 Q ss_pred chhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHh Q lcl|NC_012740. 237 ATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVD 316 (528) Q Consensus 237 sTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~ 316 (528) +| +..+++-..++++++..+|.-+-....|-||.++--- -..|-+++|.+-|...|...|++.++. T Consensus 56 ----~E---------~~~~~~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d-~~~~l~~~i~~~la~ai~~~~d~~~l~ 121 (298) T protein:vir:16 56 ----AE---------SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQEFNDGFAKKVARGIDLMAFH 121 (298) T ss_pred ----cC---------CccccccccceeEEEEeeeeEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 01 0123333344455555555555567799999875432 135567888888888888888888874 Q ss_pred hhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhc Q lcl|NC_012740. 317 VINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILAS 396 (528) Q Consensus 317 ~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~ 396 (528) -.... .|.........++......... ..+....++..|..+...+... +.+..-+|++|.....|.. T Consensus 122 G~~~~-----~g~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~vmn~~~~~~l~~ 189 (298) T protein:vir:16 122 GVNPR-----LGTASAVIGTNHFDSKVTQKVE-----APRGIADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAK 189 (298) T ss_pred cccCC-----CCcccccccccccccccccccc-----cccccccHHHHHHHHHHHhhhc--CCCccEEEEcHHHHHHHHH Confidence 21100 0000000000000000000000 0111122344455555544442 2355668999999998875 Q ss_pred cccccccccccccccccccccCceEEEEecCceEEEeeCCCC------cceEEEEEecCCCccceeEecccccccee--E Q lcl|NC_012740. 397 ADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYAR------QDYFTVGYKGDNEMDAGIYYAPYVALTPL--R 468 (528) Q Consensus 397 ~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~------~dy~~vG~KG~~~~d~g~fyaPYv~~~~~--~ 468 (528) .. ...|. .-...+.... -.|+|.| ++|+++.+.+ .+.+++|- +..++.|..--.+.+. + T Consensus 190 lk-----d~~G~-~i~~~~~~~~-~~~~l~G-~PV~~~~~v~~~~~~~~~~~~~GD-----fs~~~~~~~~~~~~~~~~~ 256 (298) T protein:vir:16 190 QK-----DLQDN-ALFPELKWGA-TPDTING-LPVDVNKTVSDMSLTQRDRAIIGD-----FANGFKWGYAKEVPLEVIQ 256 (298) T ss_pred hh-----ccCCC-eeecCcccCC-CCceecc-eeeEEecccccccCCCccEEEEee-----ccceEEEEEecCceEEEee Confidence 31 11111 0011111111 1357866 8999988754 23444441 1111223222122222 2 Q ss_pred EecCcc-----cc-ceeee--eeeece-eecCcccccCCCccceecccc Q lcl|NC_012740. 469 ATDPQS-----FH-PVLGF--KTRYGI-GINPFADSKSQAPSARITSGM 508 (528) Q Consensus 469 ~~Dp~s-----~q-P~~~~--~tRY~l-~~nP~~~~~~~~~~~~~~~~~ 508 (528) -.|++. || =.++| ..|++. ..+|= -.+++.++. T Consensus 257 ~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~-------a~~~l~~at 298 (298) T protein:vir:16 257 YGDPDNSGLDLKGYNQVYIRAELFLGWGILDAT-------KFARVTEAN 298 (298) T ss_pred ccCCcCcchhhhhcCcEEEEEEEEEccEeeccc-------ceEEEeecC Confidence 223332 22 11333 457764 34441 134444443 No 90 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=39.74 E-value=1 Score=20.58 Aligned_cols=362 Identities=14% Similarity=0.024 Sum_probs=129.8 Q ss_pred CcchHHHHH------hhhhhhcCCccchhccchhh----hhhhhhhhhHHHHhhhccccchhhhhhhhcccccccccccc Q lcl|NC_012740. 1 MKTTKELME------KWSPLLENEKLPEIATASKQ----KLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGD 70 (528) Q Consensus 1 ~~~~~~l~~------kw~p~l~~~~~~~~~~~~~~----~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~ 70 (528) +...+.-++ .+...-..+...+.....++ .-..+.-..+........ ......+....+...... T Consensus 70 ~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~--- 144 (497) T protein:vir:10 70 KDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAAD--PGTAAAELMGAFADGETA--- 144 (497) T ss_pred HHHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhh--hHHHHHHHHHHHhhhhhh--- Confidence 110000000 00000000000000000000 000000000000000000 000000000000000000 Q ss_pred CCccccc-cccccccccc---cccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccc Q lcl|NC_012740. 71 HGYNASN-IASGQTTGAI---TNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPM 146 (528) Q Consensus 71 ~g~~~~~-~~e~t~tg~v---~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~ 146 (528) ...... ...+++++.. ..+.+-+|.+.| +..+..+++.+.||+++..- |.-..+ + T Consensus 145 -~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~---~~~~i~~l~~~~~~~~~~~~-------~~~~~~---~------- 203 (497) T protein:vir:10 145 -PAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF---YELSLADLISSRPVTSPNLS-------YLTESA---A------- 203 (497) T ss_pred -HHHHHhhhcccCcccccccchhhhHHHHHHHH---hhhhHHhhccccccCCCceE-------EEEEcC---C------- Confidence 000000 0111222222 133344444444 46677899999999887421 111000 0 Q ss_pred ccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccc Q lcl|NC_012740. 147 YSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEE 226 (528) Q Consensus 147 ~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (528) .+.+. T Consensus 204 -~~~a~-------------------------------------------------------------------------- 208 (497) T protein:vir:10 204 -HNNAA-------------------------------------------------------------------------- 208 (497) T ss_pred -CCcce-------------------------------------------------------------------------- Confidence 00000 Q ss_pred ccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|NC_012740. 227 GKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEV 306 (528) Q Consensus 227 g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEI 306 (528) -.+| +...++...+++++++.+|.-+-...+|-||++|-- +.|+.|.+-|...| T Consensus 209 ------------wv~E---------~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~i~~~l~~~i 262 (497) T protein:vir:10 209 ------------AVAE---------AGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP-----ELFNFVQGRLLEGI 262 (497) T ss_pred ------------eecc---------CcccccccccceeeEeeeeeeEeecHhHHHHHHhHH-----HHHHHHHHHHHHHH Confidence 0011 112344445566777777776667889999999942 35899999999999 Q ss_pred HHHhhHHHHhhhhhheeecccceeecc-------cc------ccceeccccccccccchhHH-----HHH---------- Q lcl|NC_012740. 307 LLEINREIVDVINFTAQVGKTGMTQTV-------GS------KAGVFDLQDPIDTRGARWAG-----ESF---------- 358 (528) Q Consensus 307 mlEINReii~~i~~~a~~~~~~~~~~~-------~~------~~g~~dl~~~~d~~~~r~a~-----e~~---------- 358 (528) ..-+|+.||. =+..-...|+-... .. .+-.+.+....+-. ..|.+ ... T Consensus 263 ~~~~d~~~l~---G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 338 (497) T protein:vir:10 263 QRKEEVQLLA---GGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT-NGAFVGQDTVASLKYGRVVTGAA 338 (497) T ss_pred HHHHHHHhhc---CCCcccccccccccccccccccccchhhhhhhhhhhhhhcccc-cchhhhhhHHHHHHHHHhhhhhh Confidence 9999999884 11000011110000 00 00000000000000 00100 000 Q ss_pred -------------HHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcc----ccccccccccccccccccccCceE Q lcl|NC_012740. 359 -------------KSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASA----DQGISLAMQGAAQGLNTDTTKAVF 421 (528) Q Consensus 359 -------------r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~----g~~~~~~~~~~~~~~~~d~~~~~~ 421 (528) -.+...+...-..+.+...+ .++.+|.+|.....|... |-....+..+...+. .... T Consensus 339 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~-----~~~~ 412 (497) T protein:vir:10 339 GSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGN-----PVNG 412 (497) T ss_pred hhccchhccccchhhhhhHHHHHHhhhhhhccc-CCCeEEEchHHHHHHHHhhcCCCceeccCcccccccc-----cccC Confidence 01122223333444454444 577888999888777643 211111111000110 0011 Q ss_pred EEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecC------ccccceeeeeeeece-eecCccc Q lcl|NC_012740. 422 AGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDP------QSFHPVLGFKTRYGI-GINPFAD 494 (528) Q Consensus 422 ~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp------~s~qP~~~~~tRY~l-~~nP~~~ 494 (528) -.+|.| ++|++.+..+.+=+++|--.. .+|-=..-..+.+.+++ .+.+=.+=+..|+++ +.+|=+. T Consensus 413 ~~~l~G-~pV~~t~~~~~~~~~~Gd~~~------~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~ 485 (497) T protein:vir:10 413 GKNIWG-VPVVTTPLIPLGTILVGHFAP------SVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAF 485 (497) T ss_pred Cceeec-eeeEecCCCCCCceEEeeccc------ceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccE Confidence 225654 899988887766555542211 00100001111222222 122334445678866 6677332 Q ss_pred ccCCCccceecccc Q lcl|NC_012740. 495 SKSQAPSARITSGM 508 (528) Q Consensus 495 ~~~~~~~~~~~~~~ 508 (528) ..-+-.. ..-|+ T Consensus 486 ~~l~~~~--~~~~~ 497 (497) T protein:vir:10 486 QLIQLKK--GATGS 497 (497) T ss_pred EEEEecC--CccCC Confidence 1111000 00111 No 91 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=39.74 E-value=1 Score=20.58 Aligned_cols=362 Identities=14% Similarity=0.024 Sum_probs=129.8 Q ss_pred CcchHHHHH------hhhhhhcCCccchhccchhh----hhhhhhhhhHHHHhhhccccchhhhhhhhcccccccccccc Q lcl|NC_012740. 1 MKTTKELME------KWSPLLENEKLPEIATASKQ----KLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGD 70 (528) Q Consensus 1 ~~~~~~l~~------kw~p~l~~~~~~~~~~~~~~----~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~ 70 (528) +...+.-++ .+...-..+...+.....++ .-..+.-..+........ ......+....+...... T Consensus 70 ~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~--- 144 (497) T protein:vir:78 70 KDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAAD--PGTAAAELMGAFADGETA--- 144 (497) T ss_pred HHHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhh--hHHHHHHHHHHHhhhhhh--- Confidence 110000000 00000000000000000000 000000000000000000 000000000000000000 Q ss_pred CCccccc-cccccccccc---cccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccc Q lcl|NC_012740. 71 HGYNASN-IASGQTTGAI---TNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPM 146 (528) Q Consensus 71 ~g~~~~~-~~e~t~tg~v---~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~ 146 (528) ...... ...+++++.. ..+.+-+|.+.| +..+..+++.+.||+++..- |.-..+ + T Consensus 145 -~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~---~~~~i~~l~~~~~~~~~~~~-------~~~~~~---~------- 203 (497) T protein:vir:78 145 -PAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF---YELSLADLISSRPVTSPNLS-------YLTESA---A------- 203 (497) T ss_pred -HHHHHhhhcccCcccccccchhhhHHHHHHHH---hhhhHHhhccccccCCCceE-------EEEEcC---C------- Confidence 000000 0111222222 133344444444 46677899999999887421 111000 0 Q ss_pred ccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccc Q lcl|NC_012740. 147 YSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEE 226 (528) Q Consensus 147 ~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (528) .+.+. T Consensus 204 -~~~a~-------------------------------------------------------------------------- 208 (497) T protein:vir:78 204 -HNNAA-------------------------------------------------------------------------- 208 (497) T ss_pred -CCcce-------------------------------------------------------------------------- Confidence 00000 Q ss_pred ccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHH Q lcl|NC_012740. 227 GKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEV 306 (528) Q Consensus 227 g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEI 306 (528) -.+| +...++...+++++++.+|.-+-...+|-||++|-- +.|+.|.+-|...| T Consensus 209 ------------wv~E---------~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~i~~~l~~~i 262 (497) T protein:vir:78 209 ------------AVAE---------AGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP-----ELFNFVQGRLLEGI 262 (497) T ss_pred ------------eecc---------CcccccccccceeeEeeeeeeEeecHhHHHHHHhHH-----HHHHHHHHHHHHHH Confidence 0011 112344445566777777776667889999999942 35899999999999 Q ss_pred HHHhhHHHHhhhhhheeecccceeecc-------cc------ccceeccccccccccchhHH-----HHH---------- Q lcl|NC_012740. 307 LLEINREIVDVINFTAQVGKTGMTQTV-------GS------KAGVFDLQDPIDTRGARWAG-----ESF---------- 358 (528) Q Consensus 307 mlEINReii~~i~~~a~~~~~~~~~~~-------~~------~~g~~dl~~~~d~~~~r~a~-----e~~---------- 358 (528) ..-+|+.||. =+..-...|+-... .. .+-.+.+....+-. ..|.+ ... T Consensus 263 ~~~~d~~~l~---G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 338 (497) T protein:vir:78 263 QRKEEVQLLA---GGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT-NGAFVGQDTVASLKYGRVVTGAA 338 (497) T ss_pred HHHHHHHhhc---CCCcccccccccccccccccccccchhhhhhhhhhhhhhcccc-cchhhhhhHHHHHHHHHhhhhhh Confidence 9999999884 11000011110000 00 00000000000000 00100 000 Q ss_pred -------------HHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcc----ccccccccccccccccccccCceE Q lcl|NC_012740. 359 -------------KSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASA----DQGISLAMQGAAQGLNTDTTKAVF 421 (528) Q Consensus 359 -------------r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~----g~~~~~~~~~~~~~~~~d~~~~~~ 421 (528) -.+...+...-..+.+...+ .++.+|.+|.....|... |-....+..+...+. .... T Consensus 339 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~-----~~~~ 412 (497) T protein:vir:78 339 GSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGN-----PVNG 412 (497) T ss_pred hhccchhccccchhhhhhHHHHHHhhhhhhccc-CCCeEEEchHHHHHHHHhhcCCCceeccCcccccccc-----cccC Confidence 01122223333444454444 577888999888777643 211111111000110 0011 Q ss_pred EEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEecC------ccccceeeeeeeece-eecCccc Q lcl|NC_012740. 422 AGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDP------QSFHPVLGFKTRYGI-GINPFAD 494 (528) Q Consensus 422 ~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp------~s~qP~~~~~tRY~l-~~nP~~~ 494 (528) -.+|.| ++|++.+..+.+=+++|--.. .+|-=..-..+.+.+++ .+.+=.+=+..|+++ +.+|=+. T Consensus 413 ~~~l~G-~pV~~t~~~~~~~~~~Gd~~~------~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~ 485 (497) T protein:vir:78 413 GKNIWG-VPVVTTPLIPLGTILVGHFAP------SVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAF 485 (497) T ss_pred Cceeec-eeeEecCCCCCCceEEeeccc------ceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccE Confidence 225654 899988887766555542211 00100001111222222 122334445678866 6677332 Q ss_pred ccCCCccceecccc Q lcl|NC_012740. 495 SKSQAPSARITSGM 508 (528) Q Consensus 495 ~~~~~~~~~~~~~~ 508 (528) ..-+-.. ..-|+ T Consensus 486 ~~l~~~~--~~~~~ 497 (497) T protein:vir:78 486 QLIQLKK--GATGS 497 (497) T ss_pred EEEEecC--CccCC Confidence 1111000 00111 No 92 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=38.28 E-value=1.1 Score=20.41 Aligned_cols=338 Identities=15% Similarity=0.160 Sum_probs=128.9 Q ss_pred CcchHHHHHhhhhhhc-C---Cccchhccchhhhhhh--hhhhhHHHHhhhcc--------ccchhh--------h---- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLE-N---EKLPEIATASKQKLVA--KILESQEADFAVDP--------IYKDEK--------V---- 54 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~-~---~~~~~~~~~~~~~~~~--~~~enq~~~~~~~~--------~~~~~~--------~---- 54 (528) +......+++= ..+ . |.+.++.+...+ ..+ .=|+.|.+.+..+. ...... . T Consensus 15 ~~e~~~~l~~~--~~~~~~~~e~~~~l~~ei~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 91 (389) T protein:vir:10 15 CADLNAQLNAK--LQDENASVDDFQKIKDDLTA-AKARRDAINDQIKALEAEKPAEPKTEPKDDGSKKGTDLSKKPIDAK 91 (389) T ss_pred HHHHHHHHHHH--HHhHhhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccchhHHHHH Confidence 11111111100 000 0 011111111000 000 01222222221110 000000 0 Q ss_pred hhhhccccccccccccCCcccccccccccc-ccccccCcchh--hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeec Q lcl|NC_012740. 55 VEAFGGFIAEAEVAGDHGYNASNIASGQTT-GAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYG 131 (528) Q Consensus 55 ~~~~~~~l~ea~~~~~~g~~~~~~~e~t~t-g~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~ 131 (528) ..++..+|- ..+.....+.+++++ |.+. =|--+ .++++..+..+..++|.|.||+++++-+--++. . T Consensus 92 ~~~~~~~lr------~~~~~~~~~~~~t~~~gg~~--vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~ 161 (389) T protein:vir:10 92 KKAINDFIH------SHGKVIDATSKVTSTEAGVL--IPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR--A 161 (389) T ss_pred HHHHHHHhh------cchhhhhhhcccccCCccee--ehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEec--C Confidence 011111110 011111222233222 2221 12222 356666677788899999999988643322221 0 Q ss_pred CCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCc Q lcl|NC_012740. 132 GDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKA 211 (528) Q Consensus 132 ~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~ 211 (528) .. ...+ T Consensus 162 ~~----------------~~~~---------------------------------------------------------- 167 (389) T protein:vir:10 162 TD----------------RFSS---------------------------------------------------------- 167 (389) T ss_pred CC----------------cccc---------------------------------------------------------- Confidence 00 0000 Q ss_pred ccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcC Q lcl|NC_012740. 212 DSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHG 291 (528) Q Consensus 212 ~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHG 291 (528) .+. .++. .......|.+..+++.|..+- ..+|-||.+|- . T Consensus 168 --------------------~~E-----~~~~----~~~~~~~~~~i~~~~~k~~~~-------~~iS~ell~ds----~ 207 (389) T protein:vir:10 168 --------------------VAE-----LAEN----PKLAEPEFNKVDWSVATYRGA-------IPLSEEAIADS----A 207 (389) T ss_pred --------------------ccc-----cccc----cccccccceeeeeeheeeEee-------ehhhHHHHhhh----h Confidence 000 0000 001123467777777776654 45999999984 3 Q ss_pred CChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_012740. 292 MDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAE 371 (528) Q Consensus 292 LDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~ 371 (528) .|.+++|.+-|...+..-+|+.|+.-+.... +.|+.... ..+.+..++.... T Consensus 208 ~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~-------------~~~~~~~~----------~~d~l~~~~~~~~----- 259 (389) T protein:vir:10 208 VDLTALVGQSIKEKSVNTYNAMIAPVLQSFT-------------AKKTTTDT----------LVDSLKHILNVDL----- 259 (389) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccc-------------cccccccc----------cHHHHHHHHHhhh----- Confidence 4678899999999999999999885432111 11111000 0112223322111 Q ss_pred HHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEe-eCC-CCc---c-eEEEE Q lcl|NC_012740. 372 IARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFI-DQY-ARQ---D-YFTVG 445 (528) Q Consensus 372 I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~-D~y-~~~---d-y~~vG 445 (528) ...+ ...+|+++.....|...- +.-...-...+. .+.+...+-++|.| ++||+ |.. .+. | .+++| T Consensus 260 ---~~~~--~a~~~~n~~~~~~L~~lk--d~~G~~i~~~~~-~~~~~~~~~~~l~G-~pV~~~~~~~~~~~~~~~~~~~g 330 (389) T protein:vir:10 260 ---DPAY--SRALVVTQSLFNTLDTLK--DKNGRYLLHDAS-DSITDGTAKGTILG-VPVYVVGDTLLGSLAGDQKAFVG 330 (389) T ss_pred ---hhhh--CcEEEecHHHHHHHHHhh--ccCCCeeeecCc-cccccccccccccc-ceeEEecccccCCCCCceEEEEe Confidence 1122 245789999988888631 000000000001 11112223456766 77775 322 221 1 13333 Q ss_pred EecCCCccceeEeccccccceeEEecCccccceeeeeeeeceee-cCcccccCCCccceecccc-hHHhhcch Q lcl|NC_012740. 446 YKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIGI-NPFADSKSQAPSARITSGM-LSKDSVGK 516 (528) Q Consensus 446 ~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~-~~~~~a~~ 516 (528) = +..+.++... ........|-..|.-.+-..-|++..+ ||=+ .+.+.-. --...++| T Consensus 331 d-----~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r~d~~~~~~~a--------~~~~~~~~~~~~~~~~ 389 (389) T protein:vir:10 331 D-----LKRGVLFTDR-QQVTLAWEDSKIYGKYLGAAFRFGVQKADSKA--------GYFVTNTDVPGSALGK 389 (389) T ss_pred e-----ccccEEEEee-cceEEEeeccccccceEEEEEEeccEEecccc--------eEEEEeeccCCCCCCC Confidence 0 0000001100 111223445556666777778888752 3300 0000000 00122333 No 93 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=37.97 E-value=1.1 Score=20.38 Aligned_cols=273 Identities=11% Similarity=0.002 Sum_probs=114.4 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccc Q lcl|NC_012740. 146 MYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLME 225 (528) Q Consensus 146 ~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (528) |....|-. .+. -.|-....+....+. ..............-. .. T Consensus 1 ma~~~T~l------------------~d~--iiPev~~~~v~~~~~-------~~l~~~~~~~~d~~l~---------g~ 44 (274) T protein:vir:12 1 MAQGLTKT------------------SNQ--IIPEVLAPMMQAQLE-------KKLRFASFAEVDSTLQ---------GQ 44 (274) T ss_pred CCcceeeh------------------hhh--hchHHHHHHHHHHHH-------hhhhhcccceeccccc---------CC Confidence 10000000 000 001111111110000 0000000000000000 00 Q ss_pred cccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHH Q lcl|NC_012740. 226 EGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANE 305 (528) Q Consensus 226 ~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStE 305 (528) .|...+...=-.+..+|.. .........++..+=. +++-+-|+-. |.+.=-..+.+ +-|.-.|..+-++.- T Consensus 45 ~G~tv~iP~~~~ig~a~~~---~~g~~i~~~~lt~~~~--~~~i~~~~~~--~~i~D~~~~~~--~~d~~~~~~~q~~~~ 115 (274) T protein:vir:12 45 PGDTLTFPAFVYSGDAQVV---AEGEKIPTDILETKKR--EAKIRKIAKG--TSITDEALLSG--YGDPQGEQVRQHGLA 115 (274) T ss_pred CCCEEEEeeecCCCccccc---cCCCccchhhccccee--eEEeeeecce--eeecHHHHHhc--ccchHHHHHHHHHHH Confidence 1111111110011122211 1222334445444433 3333444322 32211122333 578899999999999 Q ss_pred HHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEE Q lcl|NC_012740. 306 VLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVI 385 (528) Q Consensus 306 ImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v 385 (528) |..+++.+++..+..+.. . +.+ ..+ ..+-+-....++..+. ..++++| T Consensus 116 ~a~~vd~~~l~~~~~a~~-~-------~~~--~a~-------------~~d~i~dA~~~lgd~~---------~~~~~iv 163 (274) T protein:vir:12 116 HANKVDNDVLEALMGAKL-T-------VNA--DIT-------------KLNGLQSAIDKFNDED---------LEPMVLF 163 (274) T ss_pred HHHHHHHHHHHHHhcccc-c-------ccc--ccc-------------CHHHHHHHHHHhcccc---------ccccEEE Confidence 999999999987653222 1 101 111 1232333333333321 1578999 Q ss_pred EchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccc Q lcl|NC_012740. 386 ASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALT 465 (528) Q Consensus 386 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~ 465 (528) |+|.|++.|......+....... ......+-.+|.+.| ++||+|...|..-..+--+|.- .||. --+.. T Consensus 164 v~p~~~~~L~k~~~~~fv~~s~~----g~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~gA~-----~~~~-~~~~~ 232 (274) T protein:vir:12 164 INPLDAGKLRGDASTNFTRATEL----GDDIIVKGAFGEALG-AIIVRSNKLEAGTAILAKKGAV-----KLIL-KRDFF 232 (274) T ss_pred eCHHHHHHHHhhhhhhccccccc----cccceecccceeecC-eeEEEeCCCCcceEEEEeccce-----eeee-cCCce Confidence 99999999987643222221110 011122235888865 9999999887532211111211 1221 11222 Q ss_pred eeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHhh Q lcl|NC_012740. 466 PLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKDS 513 (528) Q Consensus 466 ~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~ 513 (528) .-.--||..++=.+-..-+||+. .|| + .-.+++.++-.-.| T Consensus 233 vE~~Rd~~~~~d~i~~~~~y~~~~~~~-----~--~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 233 LEVARDASTKTTALYSDKHYVAYLYDE-----S--KAVKITKGSGSLEM 274 (274) T ss_pred eccccchhhcccEEEeeeEEEEEEEcC-----C--ceEEEEcCCccccC Confidence 33456889999999999999964 355 0 11222211111112 No 94 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=37.07 E-value=1.1 Score=20.28 Aligned_cols=267 Identities=10% Similarity=0.058 Sum_probs=109.5 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccc Q lcl|NC_012740. 146 MYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLME 225 (528) Q Consensus 146 ~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (528) |....|.. .+.- .|-...++....+.. .....+....... ... . T Consensus 1 ma~~~T~~------------------~d~i--iPev~~~~v~~~~~~-------~~~~~~~~~~~~~-------l~g--~ 44 (272) T protein:vir:36 1 MSKQKTTL------------------ADLV--NPEVLAPIVSYELNK-------ALRFAPLAQVDTT-------LQG--Q 44 (272) T ss_pred CCCcceeh------------------hhhh--chHHHHHHHHHHHHh-------hhhhccccccccc-------ccc--C Confidence 10000000 0000 011111111000000 0000000000000 000 0 Q ss_pred cccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHH Q lcl|NC_012740. 226 EGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANE 305 (528) Q Consensus 226 ~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStE 305 (528) .|...+...=-....+|.+ .....-+..++ +..+.+++-|-|+-.-++|=|. ++.-+-|.-.|+.+-++.. T Consensus 45 ~G~ti~iP~~~~~gda~~~---~eg~~i~~~~l--t~~~~~~~i~~~~k~~~vtD~~----~~~~~~d~~~~~~~~~a~~ 115 (272) T protein:vir:36 45 PGNTLKFPAFTYIGDAADV---AEGGEISLDKI--GTTTKSVTIKKAAKGTEITDEA----ALSGYGDPIGESNKQLGLS 115 (272) T ss_pred CCCEEEEeeeccCcccccc---CCCCccChhhc--CCcceeEeeehhhccccccHHH----HhhccchHHHHHHHHHHHH Confidence 1111111110011222211 11122233444 3455566666665322222222 1223679999999999999 Q ss_pred HHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEE Q lcl|NC_012740. 306 VLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVI 385 (528) Q Consensus 306 ImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v 385 (528) +..+++++|+..+.-. +.++ .+.++ .+.+-.+..++.++. ...+++| T Consensus 116 ~a~~~d~~i~~~l~~~--------~~~~---~~~~~-------------~d~i~~A~~~lgd~~---------~~~~~iv 162 (272) T protein:vir:36 116 LANKVDDDLLSAAKTT--------SQTV---STKAN-------------VDGVQAALDIFNDED---------AQAYVLI 162 (272) T ss_pred HHHHHHHHHHHHhccc--------cccc---ccccc-------------HHHHHHHHHHhhhcC---------CCceEEE Confidence 9999999999765311 1111 11111 122222223333221 2467999 Q ss_pred EchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcc---eEEEEE-ecCCCccceeEeccc Q lcl|NC_012740. 386 ASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQD---YFTVGY-KGDNEMDAGIYYAPY 461 (528) Q Consensus 386 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---y~~vG~-KG~~~~d~g~fyaPY 461 (528) |+|.++..|..-.-+........ . ....+-.+|.+.| ++|++|...|.+ |..+.. +|. -.+|..= T Consensus 163 v~p~~~~~L~k~~~~~~~~~~~~---~--~~~~~G~ig~~~G-~~Vv~s~~~p~~~~~~~~~~~~~gA-----~~~~~~~ 231 (272) T protein:vir:36 163 VNPKDAAKIRKDANAKNIGSEVG---A--NALINGTYADVLG-AQIVRSKKLAEGSALMFKIVSNSPA-----LKLVLKR 231 (272) T ss_pred EcHHHHHHHhccccccccccccc---c--cceeeeccceecC-eeEEEeCCCCCCceeEEEEEecccc-----eeeeecC Confidence 99999999976443333221111 1 1111123678866 999999997654 111111 121 1112110 Q ss_pred cccceeEEecCccccceeeeeeeeceee-cCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 462 VALTPLRATDPQSFHPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 462 v~~~~~~~~Dp~s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) ....-.--|+..++=.+--.-+||+.+ || . .-++++ .||+ T Consensus 232 -~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~-----~--~vv~~t-------------------~~g~ 272 (272) T protein:vir:36 232 -GVQVETDRDIVTKTTVITADEHYAAYLYDL-----T--KVVNIT-------------------FTGV 272 (272) T ss_pred -CcccccccchhhcCcEEEEEEEEEEEEEcC-----c--cEEEEe-------------------ecCC Confidence 111223458888888888888888753 55 0 112222 2222 No 95 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=36.78 E-value=1.2 Score=20.24 Aligned_cols=286 Identities=14% Similarity=0.095 Sum_probs=129.0 Q ss_pred CCcccceeeeeeeeecCCCCCCccccccc--------------ccccccccccccccCcccccccccccccccccccccc Q lcl|NC_012740. 116 MSTPTSQIFAIRSVYGGDPLAEHAKEAFH--------------PMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIA 181 (528) Q Consensus 116 mTgPTGLIFAMRsrY~~~~~s~~G~EA~~--------------n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a 181 (528) ||.|||++=+.. +...+ -.+.+. +.+-++..|... T Consensus 1 ~~~~~~i~s~~~----~~~it--v~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~------------------------- 49 (318) T protein:vir:10 1 MTAPTGIVSVSD----GPAIT--VRELVGNPLWIPTALKKMMVNQFISESLFRNG------------------------- 49 (318) T ss_pred CCCCCcceeeec----CCcee--hHHhhCCchhHHHHHHHHHhccchhhhhhhcc------------------------- Confidence 999999885544 22221 111111 001111112111 Q ss_pred ccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeE Q lcl|NC_012740. 182 AGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMR 261 (528) Q Consensus 182 ~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFS 261 (528) +......+.+....+. | .....|.-. .+..|+.-.-. T Consensus 50 ------------~a~~~~~v~f~~~~p~--------------------~------~~~d~e~Va-----EggEiP~~~~~ 86 (318) T protein:vir:10 50 ------------GANPNGVVAYNEGNPS--------------------F------LEDDVADVA-----EFGEIPVSAGA 86 (318) T ss_pred ------------cccccceeEEEecccc--------------------c------ccCcHhhcc-----CcccccccCCC Confidence 1000000000000000 0 001111110 01123322223 Q ss_pred E-eeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeeccc---ceeecccccc Q lcl|NC_012740. 262 I-DKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKT---GMTQTVGSKA 337 (528) Q Consensus 262 I-EK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~---~~~~~~~~~~ 337 (528) . ++....+|.+.||-++|=|.. .-+.+|+-.....-|++-|...+|+.+++.|......... .|.....-.. T Consensus 87 ~G~~~ia~~~K~G~~~~vS~Em~----~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~~~~~ 162 (318) T protein:vir:10 87 RGLPRTAFAVKKALGVRVSKEMI----DENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGGKVRT 162 (318) T ss_pred CCchhhhhhehhccceeccHHHH----hhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcccccc Confidence 3 222223457889999998864 3467899999999999999999999999977543332222 2321000011 Q ss_pred ceeccccccccccchhHHHHHHH-HHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhcc-cccccccccccccccccc Q lcl|NC_012740. 338 GVFDLQDPIDTRGARWAGESFKS-LIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASA-DQGISLAMQGAAQGLNTD 415 (528) Q Consensus 338 g~~dl~~~~d~~~~r~a~e~~r~-L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~-g~~~~~~~~~~~~~~~~d 415 (528) +++| |+|..+. +...+.+....-..+=.| -.|.||.+|...+.|... .|..+.+.-+........ T Consensus 163 d~~~------------A~e~v~~a~~~~~~a~~~~~~~~~GY-~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~ 229 (318) T protein:vir:10 163 DIAI------------AIEQISTAAPTAYPAGVGSSDEYFGF-IPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPD 229 (318) T ss_pred cchh------------hhhhhhhhhhhhhhhhhhhhhhccCc-cceeeEECHHHHHHHhcchhhhhhhhccchhhhhccc Confidence 2222 2222221 112222222233346678 599999999999999543 222221110000000111 Q ss_pred ccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccccceeEEe----cCccccceeeeeeeece---- Q lcl|NC_012740. 416 TTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRAT----DPQSFHPVLGFKTRYGI---- 487 (528) Q Consensus 416 ~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~~~----Dp~s~qP~~~~~tRY~l---- 487 (528) .+ ..|.|.+-| ++|..+++.|.|=.+|==+|. -| ||+-=.|++.+-.. || ..+|-..-..|+=- T Consensus 230 ~t-g~~~g~~lG-l~vi~s~~~p~~~alvlq~g~----vG-~~~d~~pl~~t~~~~egg~~-~g~~~~s~~~~~~~~~~~ 301 (318) T protein:vir:10 230 WT-GNFPGSVMG-LNVIRSRTFPIDRVLIMERGT----VG-FYSDTRPLQFTALYPEGNGP-NGGPTESYRADASHKRAL 301 (318) T ss_pred cc-ccccceeec-eEEeecCccCCCeeEEEecCC----cc-eeeccccceeeecccCCCCC-CCCcchhhheehheeeee Confidence 22 445676655 999999999888765544431 11 55433444433222 44 24444444433322 Q ss_pred -eecCcccccCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 488 -GINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 488 -~~nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) +..|++. |+++|| T Consensus 302 ~V~~PkA~----------------------------~~itgi 315 (318) T protein:vir:10 302 AVDQPKAA----------------------------LWLTGI 315 (318) T ss_pred eeeCccee----------------------------EEEeec Confidence 2233332 333444 No 96 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=35.85 E-value=1.2 Score=20.14 Aligned_cols=334 Identities=15% Similarity=0.145 Sum_probs=126.1 Q ss_pred CcchHHHHHhhhhhhcC--------------Cc--cchhccchhhhh---hhh--hhhhHHHHhhhccccc--------- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLEN--------------EK--LPEIATASKQKL---VAK--ILESQEADFAVDPIYK--------- 50 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~--------------~~--~~~~~~~~~~~~---~~~--~~enq~~~~~~~~~~~--------- 50 (528) |.+.++|.++|.-+.+. +. ..+|... +..+ .++ -|+.|.++..+..... T Consensus 4 ~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (408) T protein:vir:10 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSEL-KNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPL 82 (408) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 55788888888554431 00 1112111 1111 110 0122222221110000 Q ss_pred -------hhhhhhhhccccccccccccCC----cccccccccccc-ccccccCcchh--hHHHHHHhhhhhhhceeeecC Q lcl|NC_012740. 51 -------DEKVVEAFGGFIAEAEVAGDHG----YNASNIASGQTT-GAITNVGPAVI--GMVRRAIPNLIAFDICGVQPM 116 (528) Q Consensus 51 -------~~~~~~~~~~~l~ea~~~~~~g----~~~~~~~e~t~t-g~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPm 116 (528) ......+|..++. ..++ -+...+..++.+ |... . |.-+ .+++.+.......+++.+.|| T Consensus 83 ~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~a~~~~t~~~gg~~-v-P~~~~~~Ii~~~~~~~~l~~~~~~~~~ 155 (408) T protein:vir:10 83 NKSENELKDKFVKDFVNMVR-----NPMAFMNTVSSKTETSGSDSAAGLT-I-PQDIRTMINTLVRQYDSLQQYVRVESV 155 (408) T ss_pred ccchhhhHHHHHHHHHHHhh-----cchhhhhhhhhhhhhcccccCCcee-c-cHhHHHHHHHHHHhhchhhhhcceeec Confidence 0001111111110 0010 011122222211 1111 1 3222 355556667778899999999 Q ss_pred CcccceeeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 117 STPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIA 196 (528) Q Consensus 117 TgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~ 196 (528) +++.|-+--.|-. +. .+.+.+ T Consensus 156 ~~~~~~~~~~~~~--~~--------------~~~a~~------------------------------------------- 176 (408) T protein:vir:10 156 STSNGSRVYEKWT--DV--------------TPLTVM------------------------------------------- 176 (408) T ss_pred cCCcceEEEeecc--cc--------------ccceee------------------------------------------- Confidence 9987765433210 00 000000 Q ss_pred cccccccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccc Q lcl|NC_012740. 197 YLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKA 276 (528) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKA 276 (528) ++.+ ++ ........|.++.|++.|..+- . T Consensus 177 -----------------------------------v~E~-----~~----~~~~~~~~~~~i~~~~~k~~~~-------~ 205 (408) T protein:vir:10 177 -----------------------------------DAED-----GK----IPDLDNPQLTIIKYLIKRYAGI-------I 205 (408) T ss_pred -----------------------------------ecCc-----cc----cccccCcceeeEEeeeeeEEee-------e Confidence 0000 00 0001123466666666666654 4 Q ss_pred cchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHH Q lcl|NC_012740. 277 RYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGE 356 (528) Q Consensus 277 EYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e 356 (528) .+|-||.+|- .+|.+++|.+-|+..|..-+|+.||.-.-... ...|+.++ + T Consensus 206 ~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~------------~~~~~~~~-------------~ 256 (408) T protein:vir:10 206 TATNTSLKDT----AENILAWLSSWIAKKVVVTRNQAIIEVMKAAP------------KKPTIAKF-------------D 256 (408) T ss_pred hhHHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------cccccccH-------------H Confidence 5999999994 46778999999999999999998884221100 11122111 1 Q ss_pred HHHHH-HHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeC Q lcl|NC_012740. 357 SFKSL-IYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQ 435 (528) Q Consensus 357 ~~r~L-~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~ 435 (528) ....+ +..+. ..+-..-.+|||+.....|...- ...|. .-...+.+.. ..++|.| ++|++-. T Consensus 257 ~l~~~~~~~~~---------~~~~~~a~~v~n~~~~~~l~~lk-----d~~G~-~i~~~~~~~~-~~~~l~G-~PV~~~~ 319 (408) T protein:vir:10 257 DVITMINTAVD---------PAIIATSSLLTNQSGLNKLALVK-----TAEGK-YLLEPDPTKP-NSYLIKG-KQVIVVA 319 (408) T ss_pred HHHHHHHHhhh---------hhhccCCEEEEcHHHHHHHHHhh-----ccCCc-eEeccCcCCC-CCceecc-eeeEEec Confidence 11111 11111 12212235789999998887641 11110 0011111111 1235655 7777632 Q ss_pred C--CC--------------cceEEEEEecCCCccceeE-eccccccc---------eeEEecCccccceeeeeeeeceee Q lcl|NC_012740. 436 Y--AR--------------QDYFTVGYKGDNEMDAGIY-YAPYVALT---------PLRATDPQSFHPVLGFKTRYGIGI 489 (528) Q Consensus 436 y--~~--------------~dy~~vG~KG~~~~d~g~f-yaPYv~~~---------~~~~~Dp~s~qP~~~~~tRY~l~~ 489 (528) + .+ .++++++.++.....-+-+ |..|.-.+ -..+.||+.|. +..|.=.+ T Consensus 320 ~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~-----~~~~~~~~ 394 (408) T protein:vir:10 320 DRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALV-----AGSFSAIA 394 (408) T ss_pred ccccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEE-----EEEeeccc Confidence 2 11 1122233222111100000 00000000 00234444432 22221111 Q ss_pred cCcccccCCCccceecccchH Q lcl|NC_012740. 490 NPFADSKSQAPSARITSGMLS 510 (528) Q Consensus 490 nP~~~~~~~~~~~~~~~~~~~ 510 (528) -+ .+....+.+ +.. T Consensus 395 ~~--~~~~~~~~~-----~~~ 408 (408) T protein:vir:10 395 DQ--VGNFKTTTS-----TAV 408 (408) T ss_pred cC--CCCCCCCCc-----ccC Confidence 00 011111110 000 No 97 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=32.93 E-value=1.4 Score=19.80 Aligned_cols=280 Identities=13% Similarity=0.069 Sum_probs=116.1 Q ss_pred cccccccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccccccccccccccccccc Q lcl|NC_012740. 78 IASGQTTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSL 156 (528) Q Consensus 78 ~~e~t~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~ 156 (528) .+ +.+|.+ .-|.+. .+++.+.++.+..+++.+.||++.. ++ |+-..+ + +.+.| T Consensus 1 ma--~~gG~l--ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-----~~--~p~~~~---~---------~~a~~--- 54 (298) T protein:vir:94 1 MV--LNKGTL--FDPELVTDLISKVAGKSSIARLSAQKPIPFNG-----EK--VFTFTM---D---------SEIDV--- 54 (298) T ss_pred Ce--eccccc--cChhHHHHHHHHHHhhchhhhhcceeeccCCc-----eE--EEEEec---C---------cceEE--- Confidence 11 222222 234443 4666667788889999999987632 11 110000 0 00000 Q ss_pred ccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccccccccccccccccccc Q lcl|NC_012740. 157 AAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGM 236 (528) Q Consensus 157 ~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~Gm 236 (528) ++.| T Consensus 55 ---------------------------------------------------------------------------v~Eg- 58 (298) T protein:vir:94 55 ---------------------------------------------------------------------------VAES- 58 (298) T ss_pred ---------------------------------------------------------------------------eeCC- Confidence 0000 Q ss_pred chhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHh Q lcl|NC_012740. 237 ATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVD 316 (528) Q Consensus 237 sTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~ 316 (528) |.. ......|.++.|...|..+ ....|-||.|+--. -..+-+++|.+-|...|...|+.-++. T Consensus 59 -----~~~----~~~~~~f~~v~l~~~k~~~-------~~~iS~ell~~~~~-~~~~l~~~i~~~la~ai~~~~d~~~l~ 121 (298) T protein:vir:94 59 -----GKK----THGGVTLAPQTMVPIKVEY-------GARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFH 121 (298) T ss_pred -----ccc----cccccceeEEEEeeeEEEE-------eeehhHHHhccCCc-cHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 000 0112346666666666654 35688998764221 013345666666666666666666663 Q ss_pred hhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhc Q lcl|NC_012740. 317 VINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILAS 396 (528) Q Consensus 317 ~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~ 396 (528) -.... -|.... .....++......... .......++.-+.++-..+... +.+...+|++|+....|.. T Consensus 122 G~~~~--~g~~~~---~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~vmn~~~~~~l~~ 189 (298) T protein:vir:94 122 GVNPR--LGTASA---VIGTNHFDSKVTQKVE-----APRGIADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAK 189 (298) T ss_pred ccccC--CCcccc---cccccccccccccccc-----cccccccHHHHHHHHHHhhhhc--CCCccEEEEcHHHHHHHHH Confidence 21100 000000 0000000000000000 0011112233344444444332 2356679999999998865 Q ss_pred cccccccccccccccccccccCceEEEEecCceEEEeeCCCC------cceEEEEEecCCCccceeEeccccccceeE-- Q lcl|NC_012740. 397 ADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYAR------QDYFTVGYKGDNEMDAGIYYAPYVALTPLR-- 468 (528) Q Consensus 397 ~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~------~dy~~vG~KG~~~~d~g~fyaPYv~~~~~~-- 468 (528) .. ...|. .-...+.++. -.|+|.| ++|++++.-+ .+.+++| +-. .++.|...-.+.+.+ T Consensus 190 lk-----d~~G~-~l~~~~~~~~-~~~tl~G-~PV~~~~~v~~~~~~~~~~~~~G---dfs--~~~~~~~~~~~~~~~~~ 256 (298) T protein:vir:94 190 QK-----DLQGN-ALFPELKWGA-TPDTING-LPVDVNKTVSDMSLTQRDRAIIG---DFA--NGFKWGYAKEVPLEVIQ 256 (298) T ss_pred hh-----ccCCC-eeecCcccCC-CCceecc-eeeEEecccccccCCCccEEEEe---ecc--ceEEEEEecCceEEEee Confidence 31 11110 0011111111 1357766 8999888643 2223333 111 122344333333322 Q ss_pred EecCcc-----cc-ceeee--eeeecee-ecCcccccCCCccceecccc Q lcl|NC_012740. 469 ATDPQS-----FH-PVLGF--KTRYGIG-INPFADSKSQAPSARITSGM 508 (528) Q Consensus 469 ~~Dp~s-----~q-P~~~~--~tRY~l~-~nP~~~~~~~~~~~~~~~~~ 508 (528) -.||+. || =.++| ..|++.. .+| + -.+++.+.. T Consensus 257 ~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~------~-a~~~l~~~t 298 (298) T protein:vir:94 257 YGDPDNSGLDLKGYNQVYIRAELFLGWGILDA------T-KFARVTEAN 298 (298) T ss_pred cCCCcCcchhhhhcCcEEEEEEEEeccEeecc------c-ceEEEEecC Confidence 122221 22 12344 5577755 344 1 134555444 No 98 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=31.23 E-value=1.5 Score=19.60 Aligned_cols=349 Identities=15% Similarity=0.113 Sum_probs=134.4 Q ss_pred CcchHHHHHhhhhhhcCCc--cchhcc------c---hhhhhhhhh------hhhHHHHhhhccc------------cc- Q lcl|NC_012740. 1 MKTTKELMEKWSPLLENEK--LPEIAT------A---SKQKLVAKI------LESQEADFAVDPI------------YK- 50 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~--~~~~~~------~---~~~~~~~~~------~enq~~~~~~~~~------------~~- 50 (528) .-+.++|+++|.-+.+.-. .-++.. . ..+.+.+.| ++.+++.+.+.+. .. T Consensus 4 ~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (404) T protein:vir:39 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (404) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 2267888888887755300 000000 0 001111111 0011111111000 00 Q ss_pred ------hhhhhhhhccccccccccccCCccccccccccc-cccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccce Q lcl|NC_012740. 51 ------DEKVVEAFGGFIAEAEVAGDHGYNASNIASGQT-TGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQ 122 (528) Q Consensus 51 ------~~~~~~~~~~~l~ea~~~~~~g~~~~~~~e~t~-tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGL 122 (528) ......+|..++.-.. ......+...+..+++ +|.+. .-+.+. .+++.+-++....++|.++||+++++- T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~a~~~~t~~~gg~~-iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 161 (404) T protein:vir:39 84 KSEYELKDKFVKEFVNMVRNPM-AFLNTVSSKTETSGSDSAAGLT-IPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGS 161 (404) T ss_pred cchhhhHHHHHHHHHHHHhcch-hhhhhhhhhhhhcccccCCcee-ccHHHHHHHHHHHHhhhhHHhhcceeeccCCcce Confidence 0000111111110000 0000011111222221 11111 111121 344444567778899999999988655 Q ss_pred eeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 123 IFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVT 202 (528) Q Consensus 123 IFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~ 202 (528) +--.|- .+. .+.+.+ T Consensus 162 ~~~~~~--~~~--------------~~~a~~------------------------------------------------- 176 (404) T protein:vir:39 162 RVYEKW--TDV--------------TPLTVM------------------------------------------------- 176 (404) T ss_pred EEEEee--cCC--------------ccceee------------------------------------------------- Confidence 432220 000 000000 Q ss_pred cccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHH Q lcl|NC_012740. 203 AEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEV 282 (528) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~EL 282 (528) ++.| ++ ........|.++.|++.|..+-.+ +|-|| T Consensus 177 -----------------------------v~Eg-----~~----~~~~~~~~f~~i~~~~~k~~~~~~-------iS~el 211 (404) T protein:vir:39 177 -----------------------------DAED-----GK----IPDLDNPRLTIIKYLIKRYAGIIT-------ATNTL 211 (404) T ss_pred -----------------------------ecCc-----cc----cccccccceeeEEeeeeeEEeeeh-------hHHHH Confidence 0000 00 000112357777788777776644 99999 Q ss_pred HHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHH Q lcl|NC_012740. 283 AQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLI 362 (528) Q Consensus 283 AQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~ 362 (528) .+|- ..|.+++|.+-|+..|..-+|+.||.-. |.. ....+..++++ ...++ T Consensus 212 l~ds----~~~l~~~i~~~l~~~~~~~~d~~il~g~---------g~~---~~~~~~~~~~~-------------i~~~~ 262 (404) T protein:vir:39 212 LKDT----AENILAWLSSWIAKKVVVTRNQAIIAAM---------GTV---PKKPTIAKFDD-------------VITMI 262 (404) T ss_pred Hhhc----hHHHHHHHHHHHHHHHHHHHHHHHHhcc---------ccc---ccccccccHHH-------------HHHHH Confidence 9984 3577999999999999999999998421 110 01122222211 11111 Q ss_pred HHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceE Q lcl|NC_012740. 363 YQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYF 442 (528) Q Consensus 363 ~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 442 (528) .. .+ ...+.....+||+|.....|...- ...|. .-...+.+.. -.++|.| ++|++-.+. T Consensus 263 ~~------~~--~~~~~~~a~~v~n~~~~~~L~~lk-----d~~G~-~l~~~~~~~~-~~~~l~G-~pV~~~~~~----- 321 (404) T protein:vir:39 263 NT------SV--DPAIIATSSLLTNQSGLNKLALVK-----TAEGK-YLLEPDPTKP-NSYLIKG-KKVIVVADR----- 321 (404) T ss_pred HH------hh--hhhhccCCEEEEcHHHHHHHHHhh-----ccCCc-eeeccCcCCC-Ccceecc-eeEEEeccc----- Confidence 10 01 111223456899999999998631 11110 0011111111 1246655 677763221 Q ss_pred EEEEecCCCccceeEeccccc-------cceeEEecCc------cccceeeeeeeecee-ecCcccccCCCccceecccc Q lcl|NC_012740. 443 TVGYKGDNEMDAGIYYAPYVA-------LTPLRATDPQ------SFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGM 508 (528) Q Consensus 443 ~vG~KG~~~~d~g~fyaPYv~-------~~~~~~~Dp~------s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~ 508 (528) .++-.+... ..+||.-+-. .-....+++. ..+=.+-...||+.. .+|-+...-.-..+ -+. T Consensus 322 ~~~~~~~~~--~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~---a~~ 396 (404) T protein:vir:39 322 WLPNSGSTV--YPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAI---ADQ 396 (404) T ss_pred ccCccCCCc--cEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEeecc---ccC Confidence 111111110 1122221110 0011122222 233445566777754 24421110000000 001 Q ss_pred hHHhhcch Q lcl|NC_012740. 509 LSKDSVGK 516 (528) Q Consensus 509 ~~~~~a~~ 516 (528) -...-+|| T Consensus 397 ~~~~~~~~ 404 (404) T protein:vir:39 397 VGNFTAGK 404 (404) T ss_pred CCCCCCCC Confidence 11234566 No 99 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=31.23 E-value=1.5 Score=19.60 Aligned_cols=325 Identities=14% Similarity=0.090 Sum_probs=126.9 Q ss_pred CcchHHHHHhhhhhhcC-----Cccc----------hhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLEN-----EKLP----------EIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEA 65 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~-----~~~~----------~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea 65 (528) +-..+.|.++....-+- +..+ +-...+|+. ..+.|.+++..-.+ ...+. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~---------~~~~~------ 98 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNKPLNAEE---------REFLE------ 98 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcccccHHH---------HHHHh------ Confidence 33333444443321100 0000 011111111 11111111100000 00000 Q ss_pred cccccCCccccccccccc-ccccc---ccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccc Q lcl|NC_012740. 66 EVAGDHGYNASNIASGQT-TGAIT---NVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKE 141 (528) Q Consensus 66 ~~~~~~g~~~~~~~e~t~-tg~v~---~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~E 141 (528) .+ ........+++ .|.+. .+.+. +++.+..+..-.+++.+.||++++|-+.-.+ ..+. T Consensus 99 ----~~-~~~~~~~~~t~~~gg~~vP~~~~~~---ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~~~~-------- 160 (392) T protein:vir:10 99 ----DD-LEQRAMSGLTGEDGGLVIPQDIQTQ---INELARSFDALEQYVTVEPVRTRSGSRVLEK--NSDM-------- 160 (392) T ss_pred ----hh-hhhhhccccccCCCceecchhHHHH---HHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecCC-------- Confidence 00 00011111121 12111 12233 3444445666778999999998865422111 1000 Q ss_pred cccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccc Q lcl|NC_012740. 142 AFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVM 221 (528) Q Consensus 142 A~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (528) +.+.| T Consensus 161 -------~~a~~-------------------------------------------------------------------- 165 (392) T protein:vir:10 161 -------IPFAE-------------------------------------------------------------------- 165 (392) T ss_pred -------cccee-------------------------------------------------------------------- Confidence 00000 Q ss_pred cccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHH Q lcl|NC_012740. 222 KLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAI 301 (528) Q Consensus 222 ~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanI 301 (528) ++.+ ++. ..+....|.++.+...|+.+- ..+|-||.+|- ..|.+++|.+- T Consensus 166 ----------v~E~-----~~~----~~~~~~~~~~v~l~~~k~~~~-------~~iS~ell~ds----~~~l~~~i~~~ 215 (392) T protein:vir:10 166 ----------ITEM-----GEI----PETDNPKFSNVQYAVKDRAGI-------LPLSRSLLQDS----DQNILKYVTKW 215 (392) T ss_pred ----------eccc-----ccc----cccccccceeEEeeeeeEEEe-------ehhhHHHHhhh----HHHHHHHHHHH Confidence 0000 000 001123466666666666554 45999999994 35678999999 Q ss_pred HHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_012740. 302 LANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAG 381 (528) Q Consensus 302 LStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~g 381 (528) |...|..-+|..|+.-.. . +.+.|+.+. +....++... ....+-.. T Consensus 216 l~~~i~~~~d~~~~~g~g---~----------~~~~~~~~~-------------d~i~~~~~~~--------l~~~~~~~ 261 (392) T protein:vir:10 216 LGKKSKVTRNVLILGVIE---K----------LTKQAIKSL-------------DDIKDVLNVK--------LDPAISPN 261 (392) T ss_pred HHHHHHHHHHHHHhhccc---c----------ccccCccCH-------------HHHHHHHHHh--------hhhhhccC Confidence 999999999988873221 1 112222222 1122222111 11222234 Q ss_pred cEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccc Q lcl|NC_012740. 382 NFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPY 461 (528) Q Consensus 382 n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPY 461 (528) -..|+||.....|.... ...| ..-...+.+. -..++|.|...|+++.... ++.+|...-+..++|+.+ T Consensus 262 a~~vm~~~~~~~L~~lk-----d~~G-~~l~~~~~~~-~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdf 329 (392) T protein:vir:10 262 AILLTNQDGFNYLDKLK-----DKDG-KYILQSDPTQ-KNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDL 329 (392) T ss_pred CEEEEcHHHHHHHHHhh-----ccCC-CeEeecCccC-CccccccCcccEEEecccc-----cCCCcccCCceEEEEEeh Confidence 55789999999997631 1111 0011122221 1235677766676544321 111222222233344332 Q ss_pred cc-------cceeEEecCc------cccceeeeeeeeceee-cCcccccCCCccceecccchHHhhcc Q lcl|NC_012740. 462 VA-------LTPLRATDPQ------SFHPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDSVG 515 (528) Q Consensus 462 v~-------~~~~~~~Dp~------s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~~~a~ 515 (528) -. ..+...+++. +.+=.+-...|++..+ +|=+...-. +.-..+..--+| T Consensus 330 s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~a~~~~~~~ 392 (392) T protein:vir:10 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-----IDLSAPVEQPQG 392 (392) T ss_pred hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-----ecccccccCCCC Confidence 11 1111223332 3344566667777542 331111100 000111111223 No 100 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=31.23 E-value=1.5 Score=19.60 Aligned_cols=325 Identities=14% Similarity=0.090 Sum_probs=126.9 Q ss_pred CcchHHHHHhhhhhhcC-----Cccc----------hhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLEN-----EKLP----------EIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEA 65 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~-----~~~~----------~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea 65 (528) +-..+.|.++....-+- +..+ +-...+|+. ..+.|.+++..-.+ ...+. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~---------~~~~~------ 98 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNKPLNAEE---------REFLE------ 98 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcccccHHH---------HHHHh------ Confidence 33333444443321100 0000 011111111 11111111100000 00000 Q ss_pred cccccCCccccccccccc-ccccc---ccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccc Q lcl|NC_012740. 66 EVAGDHGYNASNIASGQT-TGAIT---NVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKE 141 (528) Q Consensus 66 ~~~~~~g~~~~~~~e~t~-tg~v~---~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~E 141 (528) .+ ........+++ .|.+. .+.+. +++.+..+..-.+++.+.||++++|-+.-.+ ..+. T Consensus 99 ----~~-~~~~~~~~~t~~~gg~~vP~~~~~~---ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~~~~-------- 160 (392) T protein:vir:10 99 ----DD-LEQRAMSGLTGEDGGLVIPQDIQTQ---INELARSFDALEQYVTVEPVRTRSGSRVLEK--NSDM-------- 160 (392) T ss_pred ----hh-hhhhhccccccCCCceecchhHHHH---HHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecCC-------- Confidence 00 00011111121 12111 12233 3444445666778999999998865422111 1000 Q ss_pred cccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccc Q lcl|NC_012740. 142 AFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVM 221 (528) Q Consensus 142 A~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (528) +.+.| T Consensus 161 -------~~a~~-------------------------------------------------------------------- 165 (392) T protein:vir:10 161 -------IPFAE-------------------------------------------------------------------- 165 (392) T ss_pred -------cccee-------------------------------------------------------------------- Confidence 00000 Q ss_pred cccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHH Q lcl|NC_012740. 222 KLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAI 301 (528) Q Consensus 222 ~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanI 301 (528) ++.+ ++. ..+....|.++.+...|+.+- ..+|-||.+|- ..|.+++|.+- T Consensus 166 ----------v~E~-----~~~----~~~~~~~~~~v~l~~~k~~~~-------~~iS~ell~ds----~~~l~~~i~~~ 215 (392) T protein:vir:10 166 ----------ITEM-----GEI----PETDNPKFSNVQYAVKDRAGI-------LPLSRSLLQDS----DQNILKYVTKW 215 (392) T ss_pred ----------eccc-----ccc----cccccccceeEEeeeeeEEEe-------ehhhHHHHhhh----HHHHHHHHHHH Confidence 0000 000 001123466666666666554 45999999994 35678999999 Q ss_pred HHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_012740. 302 LANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAG 381 (528) Q Consensus 302 LStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~g 381 (528) |...|..-+|..|+.-.. . +.+.|+.+. +....++... ....+-.. T Consensus 216 l~~~i~~~~d~~~~~g~g---~----------~~~~~~~~~-------------d~i~~~~~~~--------l~~~~~~~ 261 (392) T protein:vir:10 216 LGKKSKVTRNVLILGVIE---K----------LTKQAIKSL-------------DDIKDVLNVK--------LDPAISPN 261 (392) T ss_pred HHHHHHHHHHHHHhhccc---c----------ccccCccCH-------------HHHHHHHHHh--------hhhhhccC Confidence 999999999988873221 1 112222222 1122222111 11222234 Q ss_pred cEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccc Q lcl|NC_012740. 382 NFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPY 461 (528) Q Consensus 382 n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPY 461 (528) -..|+||.....|.... ...| ..-...+.+. -..++|.|...|+++.... ++.+|...-+..++|+.+ T Consensus 262 a~~vm~~~~~~~L~~lk-----d~~G-~~l~~~~~~~-~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdf 329 (392) T protein:vir:10 262 AILLTNQDGFNYLDKLK-----DKDG-KYILQSDPTQ-KNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDL 329 (392) T ss_pred CEEEEcHHHHHHHHHhh-----ccCC-CeEeecCccC-CccccccCcccEEEecccc-----cCCCcccCCceEEEEEeh Confidence 55789999999997631 1111 0011122221 1235677766676544321 111222222233344332 Q ss_pred cc-------cceeEEecCc------cccceeeeeeeeceee-cCcccccCCCccceecccchHHhhcc Q lcl|NC_012740. 462 VA-------LTPLRATDPQ------SFHPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDSVG 515 (528) Q Consensus 462 v~-------~~~~~~~Dp~------s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~~~a~ 515 (528) -. ..+...+++. +.+=.+-...|++..+ +|=+...-. +.-..+..--+| T Consensus 330 s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~a~~~~~~~ 392 (392) T protein:vir:10 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-----IDLSAPVEQPQG 392 (392) T ss_pred hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-----ecccccccCCCC Confidence 11 1111223332 3344566667777542 331111100 000111111223 No 101 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=31.23 E-value=1.5 Score=19.60 Aligned_cols=325 Identities=14% Similarity=0.090 Sum_probs=126.9 Q ss_pred CcchHHHHHhhhhhhcC-----Cccc----------hhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLEN-----EKLP----------EIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEA 65 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~-----~~~~----------~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea 65 (528) +-..+.|.++....-+- +..+ +-...+|+. ..+.|.+++..-.+ ...+. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~---------~~~~~------ 98 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNKPLNAEE---------REFLE------ 98 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcccccHHH---------HHHHh------ Confidence 33333444443321100 0000 011111111 11111111100000 00000 Q ss_pred cccccCCccccccccccc-ccccc---ccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccc Q lcl|NC_012740. 66 EVAGDHGYNASNIASGQT-TGAIT---NVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKE 141 (528) Q Consensus 66 ~~~~~~g~~~~~~~e~t~-tg~v~---~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~E 141 (528) .+ ........+++ .|.+. .+.+. +++.+..+..-.+++.+.||++++|-+.-.+ ..+. T Consensus 99 ----~~-~~~~~~~~~t~~~gg~~vP~~~~~~---ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~~~~-------- 160 (392) T protein:vir:10 99 ----DD-LEQRAMSGLTGEDGGLVIPQDIQTQ---INELARSFDALEQYVTVEPVRTRSGSRVLEK--NSDM-------- 160 (392) T ss_pred ----hh-hhhhhccccccCCCceecchhHHHH---HHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecCC-------- Confidence 00 00011111121 12111 12233 3444445666778999999998865422111 1000 Q ss_pred cccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccc Q lcl|NC_012740. 142 AFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVM 221 (528) Q Consensus 142 A~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (528) +.+.| T Consensus 161 -------~~a~~-------------------------------------------------------------------- 165 (392) T protein:vir:10 161 -------IPFAE-------------------------------------------------------------------- 165 (392) T ss_pred -------cccee-------------------------------------------------------------------- Confidence 00000 Q ss_pred cccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHH Q lcl|NC_012740. 222 KLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAI 301 (528) Q Consensus 222 ~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanI 301 (528) ++.+ ++. ..+....|.++.+...|+.+- ..+|-||.+|- ..|.+++|.+- T Consensus 166 ----------v~E~-----~~~----~~~~~~~~~~v~l~~~k~~~~-------~~iS~ell~ds----~~~l~~~i~~~ 215 (392) T protein:vir:10 166 ----------ITEM-----GEI----PETDNPKFSNVQYAVKDRAGI-------LPLSRSLLQDS----DQNILKYVTKW 215 (392) T ss_pred ----------eccc-----ccc----cccccccceeEEeeeeeEEEe-------ehhhHHHHhhh----HHHHHHHHHHH Confidence 0000 000 001123466666666666554 45999999994 35678999999 Q ss_pred HHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_012740. 302 LANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAG 381 (528) Q Consensus 302 LStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~g 381 (528) |...|..-+|..|+.-.. . +.+.|+.+. +....++... ....+-.. T Consensus 216 l~~~i~~~~d~~~~~g~g---~----------~~~~~~~~~-------------d~i~~~~~~~--------l~~~~~~~ 261 (392) T protein:vir:10 216 LGKKSKVTRNVLILGVIE---K----------LTKQAIKSL-------------DDIKDVLNVK--------LDPAISPN 261 (392) T ss_pred HHHHHHHHHHHHHhhccc---c----------ccccCccCH-------------HHHHHHHHHh--------hhhhhccC Confidence 999999999988873221 1 112222222 1122222111 11222234 Q ss_pred cEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccc Q lcl|NC_012740. 382 NFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPY 461 (528) Q Consensus 382 n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPY 461 (528) -..|+||.....|.... ...| ..-...+.+. -..++|.|...|+++.... ++.+|...-+..++|+.+ T Consensus 262 a~~vm~~~~~~~L~~lk-----d~~G-~~l~~~~~~~-~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdf 329 (392) T protein:vir:10 262 AILLTNQDGFNYLDKLK-----DKDG-KYILQSDPTQ-KNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDL 329 (392) T ss_pred CEEEEcHHHHHHHHHhh-----ccCC-CeEeecCccC-CccccccCcccEEEecccc-----cCCCcccCCceEEEEEeh Confidence 55789999999997631 1111 0011122221 1235677766676544321 111222222233344332 Q ss_pred cc-------cceeEEecCc------cccceeeeeeeeceee-cCcccccCCCccceecccchHHhhcc Q lcl|NC_012740. 462 VA-------LTPLRATDPQ------SFHPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDSVG 515 (528) Q Consensus 462 v~-------~~~~~~~Dp~------s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~~~a~ 515 (528) -. ..+...+++. +.+=.+-...|++..+ +|=+...-. +.-..+..--+| T Consensus 330 s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~a~~~~~~~ 392 (392) T protein:vir:10 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-----IDLSAPVEQPQG 392 (392) T ss_pred hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-----ecccccccCCCC Confidence 11 1111223332 3344566667777542 331111100 000111111223 No 102 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=31.23 E-value=1.5 Score=19.60 Aligned_cols=325 Identities=14% Similarity=0.090 Sum_probs=126.9 Q ss_pred CcchHHHHHhhhhhhcC-----Cccc----------hhccchhhhhhhhhhhhHHHHhhhccccchhhhhhhhccccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLEN-----EKLP----------EIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEA 65 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~-----~~~~----------~~~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~l~ea 65 (528) +-..+.|.++....-+- +..+ +-...+|+. ..+.|.+++..-.+ ...+. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~---------~~~~~------ 98 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNKPLNAEE---------REFLE------ 98 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcccccHHH---------HHHHh------ Confidence 33333444443321100 0000 011111111 11111111100000 00000 Q ss_pred cccccCCccccccccccc-ccccc---ccCcchhhHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccc Q lcl|NC_012740. 66 EVAGDHGYNASNIASGQT-TGAIT---NVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKE 141 (528) Q Consensus 66 ~~~~~~g~~~~~~~e~t~-tg~v~---~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~E 141 (528) .+ ........+++ .|.+. .+.+. +++.+..+..-.+++.+.||++++|-+.-.+ ..+. T Consensus 99 ----~~-~~~~~~~~~t~~~gg~~vP~~~~~~---ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~~~~-------- 160 (392) T protein:vir:10 99 ----DD-LEQRAMSGLTGEDGGLVIPQDIQTQ---INELARSFDALEQYVTVEPVRTRSGSRVLEK--NSDM-------- 160 (392) T ss_pred ----hh-hhhhhccccccCCCceecchhHHHH---HHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecCC-------- Confidence 00 00011111121 12111 12233 3444445666778999999998865422111 1000 Q ss_pred cccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccc Q lcl|NC_012740. 142 AFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVM 221 (528) Q Consensus 142 A~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (528) +.+.| T Consensus 161 -------~~a~~-------------------------------------------------------------------- 165 (392) T protein:vir:10 161 -------IPFAE-------------------------------------------------------------------- 165 (392) T ss_pred -------cccee-------------------------------------------------------------------- Confidence 00000 Q ss_pred cccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHH Q lcl|NC_012740. 222 KLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAI 301 (528) Q Consensus 222 ~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanI 301 (528) ++.+ ++. ..+....|.++.+...|+.+- ..+|-||.+|- ..|.+++|.+- T Consensus 166 ----------v~E~-----~~~----~~~~~~~~~~v~l~~~k~~~~-------~~iS~ell~ds----~~~l~~~i~~~ 215 (392) T protein:vir:10 166 ----------ITEM-----GEI----PETDNPKFSNVQYAVKDRAGI-------LPLSRSLLQDS----DQNILKYVTKW 215 (392) T ss_pred ----------eccc-----ccc----cccccccceeEEeeeeeEEEe-------ehhhHHHHhhh----HHHHHHHHHHH Confidence 0000 000 001123466666666666554 45999999994 35678999999 Q ss_pred HHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_012740. 302 LANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAG 381 (528) Q Consensus 302 LStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~g 381 (528) |...|..-+|..|+.-.. . +.+.|+.+. +....++... ....+-.. T Consensus 216 l~~~i~~~~d~~~~~g~g---~----------~~~~~~~~~-------------d~i~~~~~~~--------l~~~~~~~ 261 (392) T protein:vir:10 216 LGKKSKVTRNVLILGVIE---K----------LTKQAIKSL-------------DDIKDVLNVK--------LDPAISPN 261 (392) T ss_pred HHHHHHHHHHHHHhhccc---c----------ccccCccCH-------------HHHHHHHHHh--------hhhhhccC Confidence 999999999988873221 1 112222222 1122222111 11222234 Q ss_pred cEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccc Q lcl|NC_012740. 382 NFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPY 461 (528) Q Consensus 382 n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPY 461 (528) -..|+||.....|.... ...| ..-...+.+. -..++|.|...|+++.... ++.+|...-+..++|+.+ T Consensus 262 a~~vm~~~~~~~L~~lk-----d~~G-~~l~~~~~~~-~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdf 329 (392) T protein:vir:10 262 AILLTNQDGFNYLDKLK-----DKDG-KYILQSDPTQ-KNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDL 329 (392) T ss_pred CEEEEcHHHHHHHHHhh-----ccCC-CeEeecCccC-CccccccCcccEEEecccc-----cCCCcccCCceEEEEEeh Confidence 55789999999997631 1111 0011122221 1235677766676544321 111222222233344332 Q ss_pred cc-------cceeEEecCc------cccceeeeeeeeceee-cCcccccCCCccceecccchHHhhcc Q lcl|NC_012740. 462 VA-------LTPLRATDPQ------SFHPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDSVG 515 (528) Q Consensus 462 v~-------~~~~~~~Dp~------s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~~~a~ 515 (528) -. ..+...+++. +.+=.+-...|++..+ +|=+...-. +.-..+..--+| T Consensus 330 s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~a~~~~~~~ 392 (392) T protein:vir:10 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-----IDLSAPVEQPQG 392 (392) T ss_pred hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE-----ecccccccCCCC Confidence 11 1111223332 3344566667777542 331111100 000111111223 No 103 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=30.95 E-value=1.5 Score=19.56 Aligned_cols=345 Identities=14% Similarity=0.058 Sum_probs=110.1 Q ss_pred Ccc--------hHHHHHhhhhhhcCCccchhccchhhhh--hhhhhhhHHHHhhh------ccccchhh---hhhhhccc Q lcl|NC_012740. 1 MKT--------TKELMEKWSPLLENEKLPEIATASKQKL--VAKILESQEADFAV------DPIYKDEK---VVEAFGGF 61 (528) Q Consensus 1 ~~~--------~~~l~~kw~p~l~~~~~~~~~~~~~~~~--~~~~~enq~~~~~~------~~~~~~~~---~~~~~~~~ 61 (528) |.. ..++.++...-++ .+..-.+..+..+ ...-++.+++.+.. .+.++... +....... T Consensus 159 ~k~~~e~~~~e~~e~~~~~~~~~e--~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 236 (543) T protein:vir:81 159 MRTFGRDAEEVKGELRARALSAIE--KMQGASDNVRAAATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAI 236 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHH Confidence 000 0011111111110 0000000000000 00011111111110 00011000 00001111 Q ss_pred cccccccccCCccccccccccccccccccCcchhhHHHHHH-hhhhhhhceeeecCCcccceeeeeeeeecCCCCCCccc Q lcl|NC_012740. 62 IAEAEVAGDHGYNASNIASGQTTGAITNVGPAVIGMVRRAI-PNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAK 140 (528) Q Consensus 62 l~ea~~~~~~g~~~~~~~e~t~tg~v~~~~P~li~l~Rra~-~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~ 140 (528) +...+..-- .+.....-++++|.+.--....-.++.+.. +.-+...++-|.|++|.. +++-.. ++ T Consensus 237 l~~~e~~~~--~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~~g~~--------~~~~~~---~~- 302 (543) T protein:vir:81 237 LTEEEKRAI--NEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVATGDV--------WHGVSS---AA- 302 (543) T ss_pred hhhhhhhhh--hhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccCCcce--------EEEEec---CC- Confidence 111100000 000000011112221100011111222121 112233444444443321 000000 00 Q ss_pred ccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCcccc Q lcl|NC_012740. 141 EAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVV 220 (528) Q Consensus 141 EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (528) +.+ T Consensus 303 --------~~a--------------------------------------------------------------------- 305 (543) T protein:vir:81 303 --------VQW--------------------------------------------------------------------- 305 (543) T ss_pred --------cce--------------------------------------------------------------------- Confidence 000 Q ss_pred ccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHH Q lcl|NC_012740. 221 MKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNA 300 (528) Q Consensus 221 ~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELan 300 (528) +- .+| +...++-..+++.+++++|.-+=...+|-||.+|- .|.++.|.+ T Consensus 306 ---------~~--------v~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-----~~~~~~i~~ 354 (543) T protein:vir:81 306 ---------SW--------DAE---------FEEVSDDSPEFGQPEIPVKKAQGFVPISIEALQDE-----ANVTETVAL 354 (543) T ss_pred ---------ee--------ccc---------CccccccccccceeeeeeeeeEeeehhhHHHHhcc-----HHHHHHHHH Confidence 00 001 01112223344555555555555677999999874 277999999 Q ss_pred HHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccc-----cccccchhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_012740. 301 ILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDP-----IDTRGARWAGESFKSLIYQIDKEAAEIARQ 375 (528) Q Consensus 301 ILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~-----~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~ 375 (528) -|...|...+|+-||. -+.. ...+.|++..... ......-...+-+..|+..+. T Consensus 355 ~l~~~~~~~~d~ail~---G~Gt---------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--------- 413 (543) T protein:vir:81 355 LFAEGKDELEAVTLTT---GTGQ---------GNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLA--------- 413 (543) T ss_pred HHHHHHHHHHHHHHhc---cCCC---------CcccccchhhcccccccccccccccccHHHHHHHHHhhh--------- Confidence 9999999999998882 1100 0022333221100 000000011222233333322 Q ss_pred hccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccce Q lcl|NC_012740. 376 TGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAG 455 (528) Q Consensus 376 T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g 455 (528) ..+.....+|++|.+...|.... ...|. .-......+. -++|.| ++||+..+.+..-...+=.|. .- T Consensus 414 ~~~~~~~~~v~n~~~~~~l~~lk-----d~~G~-~l~~~~~~g~--~~~l~G-~pv~~~~~~~~~~~~~~~~~~----~~ 480 (543) T protein:vir:81 414 ARHRRQGAWLANNLIYNKIRQFD-----TQGGA-GLWTTIGNGE--PSQLLG-RPVGEAEAMDANWNTSASADN----FV 480 (543) T ss_pred ccccCCcEEEEcHHHHHHHHHhh-----cCCCc-eeccCcCCCC--Cccccc-eeeEEeccccccccccccCCc----ce Confidence 23333346789999998887531 11110 0001111111 246765 899988875432211000000 01 Q ss_pred eEec---cccc---cceeEEecCccc--------cceeeeeeeecee-ecCcccccCCCccceecccchHHhhc Q lcl|NC_012740. 456 IYYA---PYVA---LTPLRATDPQSF--------HPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKDSV 514 (528) Q Consensus 456 ~fya---PYv~---~~~~~~~Dp~s~--------qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~a 514 (528) ++|. -|+. ..+...+||..+ +=.+-+..|+|.. .||=+...-. +. ..| T Consensus 481 i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~-----~~------~~a 543 (543) T protein:vir:81 481 LLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLN-----VE------TAS 543 (543) T ss_pred EEEeeccceeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccceEEEE-----ec------ccC Confidence 1111 1111 112233444332 3344445567664 3441111100 00 001 No 104 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=27.69 E-value=1.8 Score=19.16 Aligned_cols=271 Identities=11% Similarity=0.067 Sum_probs=113.0 Q ss_pred cccccccccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccc Q lcl|NC_012740. 146 MYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLME 225 (528) Q Consensus 146 ~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (528) |....|. -... -.|-.-..+....+... ..+.+....... .. .. T Consensus 1 Ma~~~T~------------------l~d~--i~Pev~~~~v~~~~~~~-------~~~~~~~~~~~~-------l~--g~ 44 (276) T protein:vir:10 1 MAQGTTT------------------KSTQ--IVPEVLAPMMQAELDKK-------LRFAQFADIDST-------LV--GQ 44 (276) T ss_pred CCcceee------------------hhhh--hchHHHHHHHHHHHHhh-------hhhcccceeccc-------cc--CC Confidence 0000000 0000 00111111111000000 000000000000 00 00 Q ss_pred cccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhc-CCChHHHHHHHHHH Q lcl|NC_012740. 226 EGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVH-GMDADAELNAILAN 304 (528) Q Consensus 226 ~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiH-GLDAE~ELanILSt 304 (528) .|...+...=-....+|.. .....-+..++ +..+.+++.|-|.-.=++| |+-+.. +.|.-.|..+-++. T Consensus 45 ~G~ti~iP~~~~igda~~~---~eg~~i~~~~l--t~~~~~a~i~~~~k~~~~t-----D~a~~~~~~dp~~~~~~~~~~ 114 (276) T protein:vir:10 45 PGDTLTFPAFVYSGDATVV---PEGQKIPVDKI--ETNRREAKIHKIGKGTDIT-----DEALLSGYGDPQGEAVRQHGL 114 (276) T ss_pred CCCEEEeeeecCCCccccc---cCCCccCcccc--ccceeeEEeehcccccccc-----HHHHHhhccchHHHHHHHHHH Confidence 1111111110011222321 11122233333 3445555555554333333 333333 68999999999999 Q ss_pred HHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEE Q lcl|NC_012740. 305 EVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFV 384 (528) Q Consensus 305 EImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~ 384 (528) -|...++.+++..+..... ++ .++.++ .+.+-....++.++ -...+++ T Consensus 115 ~~a~~~d~~~~~~l~~~~~--------~~--~~~~~t-------------~d~i~~A~~~lgd~---------~~~~~~i 162 (276) T protein:vir:10 115 AIANKVDNDVLEALRGTKL--------TV--SADIGT-------------LAGLEAAIDTFDDE---------DLEPMVL 162 (276) T ss_pred HHHHHHHHHHHHHHhcccc--------cc--cccccC-------------HHHHHHHHHHhccc---------cCcccEE Confidence 9999999999986543221 11 011111 12222222222222 1257899 Q ss_pred EEchhHHHHhhc---cccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccc Q lcl|NC_012740. 385 IASRNVVNILAS---ADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPY 461 (528) Q Consensus 385 v~S~~va~~L~~---~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPY 461 (528) ||+|.+++.|.. ..+...+ .. ..+...+-.+|++.| ++|++|...|..-..+--+|.-.+ +.. T Consensus 163 vv~p~~~~~L~k~~~~~f~~~s-~~------g~~~~~~G~ig~~~G-~~Vi~s~~~p~~t~~l~~~gAi~~----~~~-- 228 (276) T protein:vir:10 163 FINPKDAGKLRSSASDNFTRAT-EL------GDNIIVKGAFGEALG-AVIVRSKKLDEGEAILAKRGAVKL----ITK-- 228 (276) T ss_pred EEcHHHHHHHHHhccccccccc-cc------cccceeccccceecc-eeEEEcCCCCcceEEEEeccceee----eec-- Confidence 999999999854 3332211 11 111122234788865 999999998754332222232221 111 Q ss_pred cccceeEEecCccccceeeeeeeecee-ecCcccccCCCccceecccchHHhhcch Q lcl|NC_012740. 462 VALTPLRATDPQSFHPVLGFKTRYGIG-INPFADSKSQAPSARITSGMLSKDSVGK 516 (528) Q Consensus 462 v~~~~~~~~Dp~s~qP~~~~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~~~a~~ 516 (528) -+...-.--|++.++=.|--..+||+. .|| ..-.++..++ |..-+|. T Consensus 229 ~~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~-------~~vv~~t~~~-~~~~~~~ 276 (276) T protein:vir:10 229 RDFFLETDRDPSTKTTALYSDKHYVAYLYDE-------SKAVKVTKGA-GTTDSGA 276 (276) T ss_pred CCceeecccchhhcccEEEEeeEEEEEEEcC-------cceEEEecCC-cCCcCCC Confidence 112222345888888888888888875 234 0112222222 2222222 No 105 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=26.77 E-value=1.9 Score=19.04 Aligned_cols=291 Identities=15% Similarity=0.056 Sum_probs=108.1 Q ss_pred ccccc-ccccccccCcchh-hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccccccc Q lcl|NC_012740. 78 IASGQ-TTGAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSS 155 (528) Q Consensus 78 ~~e~t-~tg~v~~~~P~li-~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG 155 (528) .++.+ ++|... .-+.+. .+++++-.+.+...++-|.||.+.. +-|-.. .. + +.+.| T Consensus 1 Ma~~~~~~gg~~-vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~-~~ip~~---~~------~---------~~a~w-- 58 (315) T protein:vir:80 1 MADDFLSAGKLE-LPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVF---SG------V---------PRAKI-- 58 (315) T ss_pred CCCCcCCcCceE-cchHHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEE---eC------C---------cceEE-- Confidence 22222 233332 222222 4566666777788888888887542 111110 00 0 00000 Q ss_pred cccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccc Q lcl|NC_012740. 156 LAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFG 235 (528) Q Consensus 156 ~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~G 235 (528) ++.| T Consensus 59 ----------------------------------------------------------------------------v~Eg 62 (315) T protein:vir:80 59 ----------------------------------------------------------------------------VGEG 62 (315) T ss_pred ----------------------------------------------------------------------------eeCC Confidence 0000 Q ss_pred cchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHH Q lcl|NC_012740. 236 MATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIV 315 (528) Q Consensus 236 msTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii 315 (528) |.. ......|.++.+...|..+- ...|-||.+|- ..|+..+|.++|..++...|.|.+= T Consensus 63 ------~~~----~~s~~~f~~v~l~~~kl~~~-------~~iS~ell~~s----~~~~~~~l~~~i~~~la~ai~~~~d 121 (315) T protein:vir:80 63 ------EVK----PSASVDVSAFTAQPIKVVTQ-------QRVSDEFMWAD----ADYRLGVLQDLISPALGASIGRAVD 121 (315) T ss_pred ------ccc----cccccceeeeEeeeeeEEee-------ehhhHHHhhcC----chhHHHHHHHHHHHHHHHHHHHHHh Confidence 000 01122355555555555443 46899998874 3566667777777777776666654 Q ss_pred hh-hhhheeecccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHh Q lcl|NC_012740. 316 DV-INFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNIL 394 (528) Q Consensus 316 ~~-i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L 394 (528) +. ++-+--. ++. ...|+.+..... .+ .++..-..+.-+.++-..+.....+ ..+-.|++|+....| T Consensus 122 ~a~~~G~~~~--~~~-----~~~~~~~~~~~~-~~----~~~~~~~~~~d~~~~~~~~~~~~~~-~~~~~imn~~~~~~L 188 (315) T protein:vir:80 122 LIAFHGIDPA--TGK-----AASAVHTSLNKT-KN----IVDATDSATADLVKAVGLIAGAGLQ-VPNGVALDPAFSFAL 188 (315) T ss_pred hheeeccCCC--CCc-----cccccccccccc-cc----eeeccccchHHHHHHHHHHhhccCc-cceEEEEcHHHHHHH Confidence 43 3211000 000 111111110000 00 0000011112233333333322222 345688999999988 Q ss_pred hccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcc---------eEEEEEecCCCccceeEeccccccc Q lcl|NC_012740. 395 ASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQD---------YFTVGYKGDNEMDAGIYYAPYVALT 465 (528) Q Consensus 395 ~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y~~vG~KG~~~~d~g~fyaPYv~~~ 465 (528) ...--...++..+..- ......+. .++|.| ++|+++.+.+.+ .++.| +-. .++|...-... T Consensus 189 ~~l~~~~g~~~~g~~~-~~~~~~g~--~~tl~G-~PV~~~~~~~~~~~~~~~~~~~~~~G---Dfs---~~~~g~~~~~~ 258 (315) T protein:vir:80 189 STEVYPKGSPLAGQPM-YPAAGFAG--LDNWRG-LNVGASSTVSGAPEMSPASGVKAIVG---DFS---RVHWGFQRNFP 258 (315) T ss_pred HHHhhccCCccccccc-ccccccCC--Cceecc-eeeEecCcCCcccccccccccEEEEe---ecc---cEEEEEecCee Confidence 7542111111111000 00111111 257866 999998886432 12222 100 01122111111 Q ss_pred eeEEe--cCc----c-ccc-eeeee--eeecee-ecCcccccCCCccceecccc-hHHhhcchh Q lcl|NC_012740. 466 PLRAT--DPQ----S-FHP-VLGFK--TRYGIG-INPFADSKSQAPSARITSGM-LSKDSVGKN 517 (528) Q Consensus 466 ~~~~~--Dp~----s-~qP-~~~~~--tRY~l~-~nP~~~~~~~~~~~~~~~~~-~~~~~a~~~ 517 (528) +.+.- |++ + ||. .++|. .|+|.. .+|=+ .+++.+.. +-..-.+.| T Consensus 259 i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a-------~~~l~~~~a~~~~~~~~~ 315 (315) T protein:vir:80 259 IELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDS-------FAVVKEKAAPKPNPPAEN 315 (315) T ss_pred EEEeccccccCcccchhhcCcEEEEEEEEecceeecccc-------eEEEeeccCCCCCCCCCC Confidence 11110 111 0 111 12222 344432 33300 00000000 001111112 No 106 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=25.56 E-value=2 Score=18.88 Aligned_cols=329 Identities=11% Similarity=0.122 Sum_probs=111.3 Q ss_pred CcchHHHHHhhhh--hhcCCccchhccch-hhhhhhhhhh---hHHHHhhh-ccccchhhhhhhhccccccccccccCCc Q lcl|NC_012740. 1 MKTTKELMEKWSP--LLENEKLPEIATAS-KQKLVAKILE---SQEADFAV-DPIYKDEKVVEAFGGFIAEAEVAGDHGY 73 (528) Q Consensus 1 ~~~~~~l~~kw~p--~l~~~~~~~~~~~~-~~~~~~~~~e---nq~~~~~~-~~~~~~~~~~~~~~~~l~ea~~~~~~g~ 73 (528) -...+.-++++.+ .+...+.++-+..- .|.+.+ |.. |..+.+.. .+.|.++. T Consensus 3 ~~~a~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a-~a~~~g~~~~a~~~a~~~~~~~~-------------------- 61 (366) T protein:vir:57 3 AAVAVPVKAHSVAPGIIIKEELQQYKGAGMTRMVMS-IAAGKGNLADAAKFAATELGDTG-------------------- 61 (366) T ss_pred ccccccccccccccccccccccccccchhHHHHHHH-HHhcccchhHHHHHHHHhhcchh-------------------- Confidence 1111111112111 00001111100000 011111 110 00000000 00000100 Q ss_pred cccccccccccccccccCcchh--hHHHHHHhhhhhhhceeeecCCcccceeeeeeeeecCCCCCCcccccccccccccc Q lcl|NC_012740. 74 NASNIASGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLAEHAKEAFHPMYSPNA 151 (528) Q Consensus 74 ~~~~~~e~t~tg~v~~~~P~li--~l~Rra~~~lI~~DI~GVQPmTgPTGLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt 151 (528) ....+..++++|.+. =|.-+ .+++++-+..+...+ |++.+.+++|-+- |+-.. ++ +. T Consensus 62 ~~~a~~~~~~~Gg~l--vP~~~~~~ii~~l~~~s~l~~l-g~~~v~~~~g~~~-----~p~~t---~~---------~~- 120 (366) T protein:vir:57 62 LSMAISTAAGSGGAL--IPQNMQNEVIELLRDRTVVRIL-GARSIPLPNGNLS-----MPRLS---GG---------AT- 120 (366) T ss_pred hhhhccccccCCccc--cchhHHHHHHHHHhhhcchhhh-ceeeeecCCCceE-----EEEEe---CC---------cc- Confidence 011112222222221 02211 122222223333222 2222222222110 10000 00 00 Q ss_pred cccccccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccc Q lcl|NC_012740. 152 FHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAE 231 (528) Q Consensus 152 ~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 231 (528) .+- T Consensus 121 -----------------------------------------------------------------------------a~w 123 (366) T protein:vir:57 121 -----------------------------------------------------------------------------AGY 123 (366) T ss_pred -----------------------------------------------------------------------------eee Confidence 000 Q ss_pred cccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhh Q lcl|NC_012740. 232 IAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEIN 311 (528) Q Consensus 232 ~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEIN 311 (528) + +| +...++...+++++++..|.-+-...+|-||.+|-. .|.|+.|.+-|...|...++ T Consensus 124 v--------~E---------~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~~~~~i~~~l~~a~~~~~d 182 (366) T protein:vir:57 124 V--------GE---------GKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAG----FNVEQLLLGDILSAIATRED 182 (366) T ss_pred e--------cc---------CccccccccceeEEEEeeEEEEEeehhhHHHHhhhh----HHHHHHHHHHHHHHHHHHHH Confidence 0 11 011233333445555555555555679999998753 56789999999999999999 Q ss_pred HHHHhhhhhheeecccceeeccccccceeccccccc----cccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEc Q lcl|NC_012740. 312 REIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPID----TRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIAS 387 (528) Q Consensus 312 Reii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d----~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S 387 (528) +.||.=--... .+.|++....... ..+. +. .+..+-..++.+.........+......|++ T Consensus 183 ~a~l~G~G~~~------------~p~Gi~~~~~~~~~~~~~~~t--~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn 247 (366) T protein:vir:57 183 KAFLRDDGTGD------------TPKGMKAVATAANRLVAWTGT--AI-NLTTIDEYLDSLILKHMDSNSNMIRCGWGLS 247 (366) T ss_pred HHhhccCCCCc------------cccceeeccccccceeecccc--cc-chhhHHHHHHHHHHhhhccccccccCEEEec Confidence 98884100000 1223322111000 0000 00 0011111112222222223333345667899 Q ss_pred hhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEecCCCccceeEeccccc---- Q lcl|NC_012740. 388 RNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVA---- 463 (528) Q Consensus 388 ~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~d~g~fyaPYv~---- 463 (528) +.....|...- ...|. . +..+.+ -|+|.| ++|+++.+.|.+- |...-..-++|+-+-. T Consensus 248 ~~~~~~L~~lk-----d~~G~-~-l~~~~~----~g~l~G-~Pvv~s~~ip~~~------~~~~~~~~i~~gdfs~~~i~ 309 (366) T protein:vir:57 248 NRTYMTLFGLR-----DGNGN-K-VYPEMS----QGILKG-YPIQRTSAIPANL------GDDGNESEIYFCDFNDVVIG 309 (366) T ss_pred HHHHHHHHhhh-----ccCCc-e-eccCCC----CCeecc-eeeEEcccccccc------ccCCCccEEEEEecceEEEE Confidence 99988887531 11110 0 111222 256765 9999988765431 1110011122222211 Q ss_pred ----cceeEEec-----Cc--------cccceeeeeeeeceee-cCcccccCCCccceecccchH Q lcl|NC_012740. 464 ----LTPLRATD-----PQ--------SFHPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLS 510 (528) Q Consensus 464 ----~~~~~~~D-----p~--------s~qP~~~~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~ 510 (528) ......-| +. .-+=.+=...||++.+ +| +..-+..|-.| T Consensus 310 ~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~--------~a~~~lt~~~~ 366 (366) T protein:vir:57 310 EDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHP--------EGLVLGTGVIW 366 (366) T ss_pred EecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeecc--------ccEEEEecccC Confidence 00111111 10 1112333445566552 12 12334445555 No 107 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=24.39 E-value=2.2 Score=18.72 Aligned_cols=203 Identities=16% Similarity=0.199 Sum_probs=104.3 Q ss_pred EeeEEEEEecccccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceec Q lcl|NC_012740. 262 IDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFD 341 (528) Q Consensus 262 IEK~TVTAKSRALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~d 341 (528) ||=. |=|..=++-.-+.++ | +|-..|.+.=...+++.++++-|++.+...|+-... ++...+.....+. T Consensus 1 iD~l--------L~a~~~VdDiD~aqa-~-~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p-~~~~~~g~~~~~~ 69 (221) T protein:vir:17 1 MDDL--------LVASQFVYDLDEILA-Q-WNTRSEISKQIGEALAIHYDERIARVLASASIAAAP-VTGQDGGFSVNIG 69 (221) T ss_pred CCcc--------hhHHHHHHhHHHHHh-h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCc-ccccccCcceecc Confidence 3322 223333444444445 4 889999999999999999999999877655553221 1111111111111 Q ss_pred cccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchh-HHHHhhcccccc-ccccccccccccccccCc Q lcl|NC_012740. 342 LQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRN-VVNILASADQGI-SLAMQGAAQGLNTDTTKA 419 (528) Q Consensus 342 l~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~-va~~L~~~g~~~-~~~~~~~~~~~~~d~~~~ 419 (528) -...++ + ..||..|-+.+...-.+-=--.|-|+|++|+ ...+|+.-.-.. .....+. ..+..+. T Consensus 70 a~~t~~-------~---~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s----~g~~~~g 135 (221) T protein:vir:17 70 AGNTNN-------A---QAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNT----QGDMNTG 135 (221) T ss_pred ccccCC-------H---HHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccc----ccccccc Confidence 011111 1 2333333333333333333336889999995 777776422111 1111111 1111112 Q ss_pred eEEEEecCceEEEeeCCCCc----ceEE------------EEEecCCCccceeEeccccccceeEEecCccccceeeeee Q lcl|NC_012740. 420 VFAGVLAGKYKVFIDQYARQ----DYFT------------VGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKT 483 (528) Q Consensus 420 ~~~G~l~~~~~vy~D~y~~~----dy~~------------vG~KG~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~t 483 (528) ..+|.++| ++||.=++.|. +|.. =.|.|+-.-..||||.|=.-++ ++.+.|-|--|.+.- T Consensus 136 ~~i~~v~G-~~V~~SnnlP~~~gt~~~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgt-vkl~~~~~~~~~~~~-- 211 (221) T protein:vir:17 136 KGLYVNAG-IRIYKSNVLASLYGTNLVTDPGDATTSGENNGSYRPAITDRAGLVFHKEAADT-VEVLLPPSRPPLVIS-- 211 (221) T ss_pred ceeeeecC-cEEEEeccCCcccccccccCCccccccccccccccccccceEEEEEcchheee-eeeecCCCCCceeee-- Confidence 24788875 99999999875 3321 1344555556899999875444 456777776665432 Q ss_pred eeceeecCcccccCCCccce Q lcl|NC_012740. 484 RYGIGINPFADSKSQAPSAR 503 (528) Q Consensus 484 RY~l~~nP~~~~~~~~~~~~ 503 (528) -|.-+. |..| T Consensus 212 -------~~~~~~---~~~~ 221 (221) T protein:vir:17 212 -------MFSIRR---PDRR 221 (221) T ss_pred -------eeeccC---CCCC Confidence 233222 2222 No 108 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=24.20 E-value=2.2 Score=18.70 Aligned_cols=267 Identities=11% Similarity=0.029 Sum_probs=116.5 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccCcccccCccccccccccccccccccccchhhhhhhc Q lcl|NC_012740. 166 PTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQNVTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQ 245 (528) Q Consensus 166 ~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~ 245 (528) ..-|...+.- .|-.-.++........ ..+.+...... ... ...|...+...=-.+..+|.+ T Consensus 1 Ma~T~~~d~I--~Pev~~~~V~e~~~~~-------~~~~~~~~~d~-------~L~--g~~G~ti~~P~~~~igdae~~- 61 (270) T protein:vir:95 1 MTQTKKANLI--NPEVLANVVSAQMQNA-------IRFTPYAVTDD-------TLV--GQPGDTITRPKYAYIGAAEDL- 61 (270) T ss_pred CCceehhhhc--chHHHHHHHHHHHHhH-------Hhhcccccccc-------ccC--CCCCCEEEeeeecCCCccccc- Confidence 0001111000 0111111111100000 00000000000 000 001111111110012233422 Q ss_pred ccCCCCCcccccceeEEeeEEEEEecccccccchHHHHHHHHhhc-CCChHHHHHHHHHHHHHHHhhHHHHhhhhhheee Q lcl|NC_012740. 246 GFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVH-GMDADAELNAILANEVLLEINREIVDVINFTAQV 324 (528) Q Consensus 246 ~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ELAQDLkAiH-GLDAE~ELanILStEImlEINReii~~i~~~a~~ 324 (528) .....-+..+|+ ..+.+++.|-|+-.=++| ||.+.- |-|.-.|..+-++.-|+.+++.++|..+..... T Consensus 62 --~eg~~i~~~~lt--~~~~~a~i~~~gk~~~it-----D~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~- 131 (270) T protein:vir:95 62 --QEGVAMDTTQMS--MTTTKVTVKETGKAVEVT-----QTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQ- 131 (270) T ss_pred --cCCCccchhhcc--cchheeeeehhhCcceec-----HHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccc- Confidence 122233445554 456666667777555555 444433 459999999999999999999999986652211 Q ss_pred cccceeeccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccc Q lcl|NC_012740. 325 GKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLA 404 (528) Q Consensus 325 ~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~ 404 (528) +....++. +-+-.-..++.. ....-++|+|.|.+++.|....+++... T Consensus 132 ----------~~~~~~t~-------------~~~~dA~~~lgd---------~~~~~~~i~vhs~~~~~Lrk~~~~~~~~ 179 (270) T protein:vir:95 132 ----------TATVSADA-------------TGILDAIEVFNS---------ENDEDYVLYVNPKDYNKLVKSLFKVGGN 179 (270) T ss_pred ----------ccccccCH-------------HHHHHHHHHhcc---------ccCCCcEEEEcHHHHHHHHhhhcccccc Confidence 11111111 111111111111 2345689999999999999876654321 Q ss_pred cccccccccccccCceEEEEecCceEEEeeCCCCcceEEEEEe-cCCCccceeEeccccccceeEEecCccccceeeeee Q lcl|NC_012740. 405 MQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYK-GDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKT 483 (528) Q Consensus 405 ~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K-G~~~~d~g~fyaPYv~~~~~~~~Dp~s~qP~~~~~t 483 (528) . +.+.- .+-.+|.+.| ++|++|.+.+.+|-..-++ |.-. |+-.= +...-.--|+..++-.+--.. T Consensus 180 ~-----~~~~~--~~G~ig~~~G-~~Viv~s~~~~~~~~~l~~~gAi~-----~~~~~-~~~vEtdRd~~~~~d~i~~~~ 245 (270) T protein:vir:95 180 V-----QDRAI--SKGDLVEIVG-VSDIVKSKRVSENTAFLQRYGAME-----IVNKK-KPEAYTDFDILKRTHLLSTNY 245 (270) T ss_pred c-----ccchh--cccccceecc-eeEEEeCCCCCceeEEEEecccee-----eeecC-CceeeeccchhhcccEEEeee Confidence 1 11111 1113778866 9999999888887443333 2111 11100 111123347777777777777 Q ss_pred eeceee-cCcccccCCCccceecccchHHhhcchhhh Q lcl|NC_012740. 484 RYGIGI-NPFADSKSQAPSARITSGMLSKDSVGKNAY 519 (528) Q Consensus 484 RY~l~~-nP~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 519 (528) +|++.. || .+ -.+++ -+-+|.-.| T Consensus 246 ~y~v~~~~~---sk----vv~~t-----~~~a~~~~~ 270 (270) T protein:vir:95 246 HYSVNLKDE---TG----VVKVT-----FKPSGSLEM 270 (270) T ss_pred EEEEEEEcc---ce----EEEEE-----ecCCCCcCC Confidence 777641 22 10 01111 011222333 No 109 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=22.49 E-value=2.4 Score=18.46 Aligned_cols=347 Identities=13% Similarity=0.042 Sum_probs=122.0 Q ss_pred CcchHHHHHhhhhhhcC--------------CccchhccchhhhhhhhhhhhHHHHhhhcc-ccchhh--hhhhhccccc Q lcl|NC_012740. 1 MKTTKELMEKWSPLLEN--------------EKLPEIATASKQKLVAKILESQEADFAVDP-IYKDEK--VVEAFGGFIA 63 (528) Q Consensus 1 ~~~~~~l~~kw~p~l~~--------------~~~~~~~~~~~~~~~~~~~enq~~~~~~~~-~~~~~~--~~~~~~~~l~ 63 (528) ...-++|+++.+-+.+. +..-++.... .. +.. |+++-+.+.+.. ...+.. ...+....-. T Consensus 4 ~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~-~e-~~~-l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (421) T protein:vir:13 4 FERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKA-LE-REK-IEARMEIIEEEIESVMTAIDEERKNTNFTGG 80 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHH-HH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 22223333333333321 1111111100 00 000 111111111100 000000 0000000000 Q ss_pred ccccc--------------------ccCCccccccccccccccc---cccCcchhhHHHHHHhhhhhhhceeeecCCccc Q lcl|NC_012740. 64 EAEVA--------------------GDHGYNASNIASGQTTGAI---TNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPT 120 (528) Q Consensus 64 ea~~~--------------------~~~g~~~~~~~e~t~tg~v---~~~~P~li~l~Rra~~~lI~~DI~GVQPmTgPT 120 (528) ..... +..-.....-+-+++.|.+ ..+.+.++.+ +.+..+-.+++-+.||++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~---~~~~~~l~~l~~~~~~~~~~ 157 (421) T protein:vir:13 81 RVIINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKL---KEGYPSLKEHCHVIPVNRNA 157 (421) T ss_pred ccccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhHHHHHHH---HHhhhhhhhhceeeeccCCc Confidence 00000 0000000000111112221 1222333334 44455678889999998876 Q ss_pred ceeeeeeeeecCCCCCCcccccccccccccccccccccCccccccccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 121 SQIFAIRSVYGGDPLAEHAKEAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETGIAYLQN 200 (528) Q Consensus 121 GLIFAMRsrY~~~~~s~~G~EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg~~~~~~ 200 (528) +-+--.+ .. +.+.+ T Consensus 158 ~~~~~~~-----~~--------------~~~~~----------------------------------------------- 171 (421) T protein:vir:13 158 GKMPVRA-----GA--------------SVDKL----------------------------------------------- 171 (421) T ss_pred eEEEEee-----cC--------------Cccce----------------------------------------------- Confidence 4221110 00 00000 Q ss_pred cccccccccCcccccCccccccccccccccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecccccccchH Q lcl|NC_012740. 201 VTAEQVTPTKADSESDDEVVMKLMEEGKLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSRQLKARYSI 280 (528) Q Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSRALKAEYT~ 280 (528) ...+.+- +. ..+...|.+..|++.|..+ ...+|- T Consensus 172 -----------------------------~~~~E~~-----~~-----~~s~~~f~~i~~~~~k~~~-------~v~iS~ 205 (421) T protein:vir:13 172 -----------------------------ANLAKDT-----EL-----VKAMLKTQPMAYDIDDYGL-------LAPIDN 205 (421) T ss_pred -----------------------------eeccccc-----cc-----cccccceeEEEeeeeeeEe-------ehhhhH Confidence 0000000 00 0012235555555555544 456999 Q ss_pred HHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhheeecccceeeccccccceeccccccccccchhHHHHHHH Q lcl|NC_012740. 281 EVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKS 360 (528) Q Consensus 281 ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~r~a~e~~r~ 360 (528) ||.+|-- .|.++.|.+-|+..+..-+|..|+..+ +|+. ..+++.+ .+..+. T Consensus 206 ell~ds~----~~l~~~i~~~la~~~~~~~~~~i~~~~--------~g~~----~~~~~~~-------------~d~i~~ 256 (421) T protein:vir:13 206 SLLEDSE----INFLEFVNEEFAEFAVNTENAEIVKQA--------KAVL----AEETIND-------------YAGLVK 256 (421) T ss_pred HHHhhhH----HHHHHHHHHHHHHHHHHHhhhhHhhhh--------hhcc----ccccccc-------------hHHHHH Confidence 9999842 456888888888888888888887532 2222 1122211 123344 Q ss_pred HHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEEEeeCCCCcc Q lcl|NC_012740. 361 LIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKVFIDQYARQD 440 (528) Q Consensus 361 L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d 440 (528) ++..+.. .+.....+|+++.....|...- ...|. -+..+.... --++|.| ++|++..+.+.. T Consensus 257 ~~~~l~~---------~~~~~a~~v~n~~~~~~l~~lk-----d~~G~--~i~~~~~~~-~~~tl~G-~pV~~~~~~~~~ 318 (421) T protein:vir:13 257 TINSLVP---------NARKRAIIVTNSDGRAYLDGLM-----DKQGR--PLLKELSDG-GDLVFKG-RPVIELEESIFD 318 (421) T ss_pred HHHHhhh---------hhcCCCEEEEcHHHHHHHHHhh-----cCCCc--eeecCcCCC-CCceecc-eeeEEecccccc Confidence 4444432 2234667889999888887531 11110 011111111 0245755 788877664321 Q ss_pred eEEEEEecCCCccceeEecccc---------ccceeEEecCc---cccceeeeeeeeceee-----------cCcc--cc Q lcl|NC_012740. 441 YFTVGYKGDNEMDAGIYYAPYV---------ALTPLRATDPQ---SFHPVLGFKTRYGIGI-----------NPFA--DS 495 (528) Q Consensus 441 y~~vG~KG~~~~d~g~fyaPYv---------~~~~~~~~Dp~---s~qP~~~~~tRY~l~~-----------nP~~--~~ 495 (528) -.+ +..+||+-+- .+... ..+-. ..+=.+-+..||+.++ .++. .. T Consensus 319 -----~~~----~~~~~~gd~~~~~~~~~~~~~~v~-~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~ 388 (421) T protein:vir:13 319 -----VGD----ETKFIVSDFKTLIKFMDRKQYLID-QSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVK 388 (421) T ss_pred -----CCC----ceEEEEEeccccEEEEEecceEEE-eecccccccCeeEEEEEeeecceeecchhhheeeecccceeec Confidence 000 1112222110 01111 11111 2223455556665442 1111 01 Q ss_pred cCCCccceecccchHHhhcchhhhhhhhhhccC Q lcl|NC_012740. 496 KSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) Q Consensus 496 ~~~~~~~~~~~~~~~~~~a~~~~~~r~~~Vk~~ 528 (528) .++.+.+-...+..- .+||-+ +.|.+= T Consensus 389 ~~~~~~~~~~~~~~~--~~~~~~----~~~~~~ 415 (421) T protein:vir:13 389 LQEVLKSSPRSGKNK--NESKEE----IKEEGE 415 (421) T ss_pred cccccCCCCcCCCCc--cccchh----eeeccc Confidence 111111111111110 122221 111111 No 110 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=21.74 E-value=2.6 Score=18.35 Aligned_cols=312 Identities=13% Similarity=0.100 Sum_probs=111.9 Q ss_pred CCcccceeeeeeeeecCCCCCCccc-ccccccccccccccccccCccccccccccccccccccccccccccccccccccc Q lcl|NC_012740. 116 MSTPTSQIFAIRSVYGGDPLAEHAK-EAFHPMYSPNAFHSSLAAKDATTVSPTGTAFQKLTLSTPIAAGDIVHHTFAETG 194 (528) Q Consensus 116 mTgPTGLIFAMRsrY~~~~~s~~G~-EA~~n~~Eadt~fSG~~~a~~~~~~~tgt~f~~~t~~t~~a~Gdi~~~~f~~tg 194 (528) |.--++..-+.|.-.++ .+++ -++|- .-| .|.... .|.... T Consensus 1 m~~~~~~~~~t~~g~~~----~~~d~~al~i-----k~f----------------------------~~eV~~-~f~~~s 42 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGK----SSSDALALFL-----KVF----------------------------AGEVLT-AFTRRS 42 (347) T ss_pred CCCCCccccccccccCC----ccccHHHHHH-----HHH----------------------------hHHHHH-HHHHHH Confidence 33333211111111100 0000 01110 000 011100 000000 Q ss_pred cccccccccccccccCcccccCccccccccccc--cccccccccchhhhhhhcccCCCCCcccccceeEEeeEEEEEecc Q lcl|NC_012740. 195 IAYLQNVTAEQVTPTKADSESDDEVVMKLMEEG--KLAEIAFGMATSIAELQQGFNGSQNNPWNEMSMRIDKQVVEAKSR 272 (528) Q Consensus 195 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~~~~g~GmsTa~AE~l~~~ggs~~~~f~EMaFSIEK~TVTAKSR 272 (528) .....+...... .++.......... ..+.-|+.+. +......-.|+-++||++.+ T Consensus 43 -~~~~~~~~r~i~-------~G~sv~i~~iG~~tv~~~t~G~~l~----------~~~~~~~~~e~~itID~~~~----- 99 (347) T protein:vir:94 43 -VTADKHIVRTIQ-------NGKSAQFPVMGRTSGVYLAPGERLS----------DKRKGIKHTEKVITIDGLLT----- 99 (347) T ss_pred -hhhccccccccc-------ccceEEEecccceeeeeecCCCCcC----------CCCCCCCcceEEEEecchhh----- Confidence 000000000000 0111111111110 1111111110 00011234677788887532 Q ss_pred cccccchHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh-hheeecccceeeccccccceeccccccccccc Q lcl|NC_012740. 273 QLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVIN-FTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGA 351 (528) Q Consensus 273 ALKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINReii~~i~-~~a~~~~~~~~~~~~~~~g~~dl~~~~d~~~~ 351 (528) +..-+.-.-|.++ | .|-.+|++.-....+..++++-|++.+. ..+..+.............+++.....+... T Consensus 100 ---~~~~VddiD~~q~-~-~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~~~~s~~~~~~~~~~~~- 173 (347) T protein:vir:94 100 ---ADVMIFDIEDAMN-H-YDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAGLGTASVLEVGKKADLDT- 173 (347) T ss_pred ---hhHHhhhHHHHhc-C-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcccceeeccccccccc- Confidence 3334444444555 3 7889999999999999999999998653 2333222110000000011122111111100 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhccCCCcEEEEchhHHHHhhccccccccccccccccccccccCceEEEEecCceEE Q lcl|NC_012740. 352 RWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAQGLNTDTTKAVFAGVLAGKYKV 431 (528) Q Consensus 352 r~a~e~~r~L~~~i~~~a~~I~~~T~~g~gn~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~v 431 (528) ..+-...++..|-+.....-..-=--.|-|+|++|+.-.+|-..-.+.... ...+.+. ..-.+|.+.| ++| T Consensus 174 --~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~-----~~~~~~~-~~G~Vg~i~G-~~V 244 (347) T protein:vir:94 174 --PAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAAN-----YAALIDP-ETGNIRNVMG-FVV 244 (347) T ss_pred --hhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhh-----ccccccc-cccceEEEec-eEE Confidence 111223333333222222221111113789999999999886542221111 1111122 2225899966 999 Q ss_pred EeeCCCCcc----------eEEE-E------------EecCCCccceeEeccccccce-------eEEecCccccceeee Q lcl|NC_012740. 432 FIDQYARQD----------YFTV-G------------YKGDNEMDAGIYYAPYVALTP-------LRATDPQSFHPVLGF 481 (528) Q Consensus 432 y~D~y~~~d----------y~~v-G------------~KG~~~~d~g~fyaPYv~~~~-------~~~~Dp~s~qP~~~~ 481 (528) |.-++.|.- |-++ | |+++-.-..++||-|=.-+.- ..-.|+..|-=.|== T Consensus 245 ~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~i~~ 324 (347) T protein:vir:94 245 VEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVDAQGDLIVG 324 (347) T ss_pred EecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhchhhHHHHhhh Confidence 998887652 2111 1 223222335677766533211 112233333322211 Q ss_pred eeeecee-ecCcccccCCCccce Q lcl|NC_012740. 482 KTRYGIG-INPFADSKSQAPSAR 503 (528) Q Consensus 482 ~tRY~l~-~nP~~~~~~~~~~~~ 503 (528) +..||.. .+|=+-..-....|. T Consensus 325 ~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 325 KYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred hhhhcCcccccceeEEEEecCCC Confidence 2222221 223110000000111 Done!