Query lcl|NC_013650.1_cdsid_YP_003347758.1 [gene=79] [protein=gp79] [protein_id=YP_003347758.1] [location=33946..37638] Match_columns 1230 No_of_seqs 475 out of 18196 Neff 8.4 Searched_HMMs 1612 Date Thu Nov 7 14:36:11 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_76 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_76_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:105154 Length: 525 98.0 4.3E-08 2.7E-11 61.0 7.3 458 32-551 1-525 (525) 2 protein:vir:3420 Length: 533 # 94.5 0.0042 2.6E-06 33.6 13.9 466 24-563 1-533 (533) 3 protein:vir:79538 Length: 502 86.5 0.045 2.8E-05 28.0 17.1 437 16-539 1-502 (502) 4 protein:vir:389 Length: 530 # 84.6 0.059 3.7E-05 27.3 15.4 434 24-539 1-530 (530) 5 protein:vir:81152 Length: 411 82.3 0.015 9.4E-06 30.6 5.0 379 110-564 1-411 (411) 6 protein:vir:78641 Length: 278 82.1 0.01 6.5E-06 31.4 4.0 269 173-496 1-278 (278) 7 protein:vir:105002 Length: 432 80.9 0.031 1.9E-05 28.9 6.1 393 110-551 1-432 (432) 8 protein:vir:102855 Length: 432 80.9 0.031 1.9E-05 28.9 6.1 393 110-551 1-432 (432) 9 protein:vir:107605 Length: 432 80.9 0.031 1.9E-05 28.9 6.1 393 110-551 1-432 (432) 10 protein:vir:95542 Length: 548 80.2 0.097 6E-05 26.1 11.2 481 16-599 1-548 (548) 11 protein:vir:102118 Length: 409 74.7 0.049 3E-05 27.8 5.4 375 118-564 1-409 (409) 12 protein:vir:102080 Length: 429 67.7 0.25 0.00015 23.9 7.7 401 92-571 1-429 (429) 13 protein:vir:101647 Length: 460 65.8 0.28 0.00017 23.6 8.5 423 59-552 1-460 (460) 14 protein:vir:6382 Length: 553 # 62.2 0.34 0.00021 23.2 16.4 460 32-539 1-553 (553) 15 protein:vir:1266 Length: 416 # 61.2 0.29 0.00018 23.5 6.7 380 128-562 1-416 (416) 16 protein:vir:3843 Length: 397 # 58.2 0.42 0.00026 22.7 7.7 375 98-570 1-397 (397) 17 protein:vir:79772 Length: 648 55.6 0.097 6E-05 26.1 3.1 544 99-726 1-648 (648) 18 protein:vir:3989 Length: 392 # 49.2 0.18 0.00011 24.6 3.5 373 111-541 1-392 (392) 19 protein:vir:1023 Length: 392 # 49.2 0.18 0.00011 24.6 3.5 373 111-541 1-392 (392) 20 protein:vir:2683 Length: 412 # 47.8 0.43 0.00027 22.6 5.3 363 141-562 1-412 (412) 21 protein:vir:4952 Length: 386 # 46.4 0.73 0.00046 21.3 7.6 373 112-537 1-386 (386) 22 protein:vir:8418 Length: 409 # 46.4 0.53 0.00033 22.1 5.6 379 109-552 1-409 (409) 23 protein:vir:96738 Length: 505 41.4 0.93 0.00058 20.8 17.1 440 32-529 1-505 (505) 24 protein:vir:4828 Length: 382 # 41.2 0.39 0.00024 22.8 4.0 359 130-537 1-382 (382) 25 protein:vir:1082 Length: 359 # 37.2 0.5 0.00031 22.2 3.9 333 113-531 1-359 (359) 26 protein:vir:1380 Length: 422 # 30.6 1.6 0.00097 19.5 6.6 385 112-563 1-422 (422) 27 protein:vir:7407 Length: 392 # 30.1 1.6 0.001 19.5 6.2 369 114-541 1-392 (392) 28 protein:vir:81218 Length: 423 29.1 0.96 0.00059 20.7 4.0 375 127-567 1-423 (423) 29 protein:vir:93610 Length: 454 28.2 1.7 0.0011 19.3 5.3 425 98-594 1-454 (454) 30 protein:vir:10321 Length: 495 25.2 2.1 0.0013 18.8 14.2 435 16-529 1-495 (495) 31 protein:vir:9702 Length: 406 # 22.3 2.5 0.0015 18.4 7.7 371 130-572 1-406 (406) 32 protein:vir:105064 Length: 421 21.7 2.6 0.0016 18.3 5.9 378 132-561 1-421 (421) No 1 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=98.02 E-value=4.3e-08 Score=60.97 Aligned_cols=458 Identities=17% Similarity=0.229 Sum_probs=190.2 Q ss_pred hHHHHHh-hhhhcCCcchhHHHHHHhhhhhhhhHH-----HHHHHHhccCcccceeecCchhhh-hhHhhhhhcCCCCcc Q lcl|NC_013650. 32 MARAQAA-ALQNTVNNKPLIDYFQGRRRAAEANRQ-----RLASYRKQGNFGSNMQIAMPKIRQ-PLGTLADKGIPFNVE 104 (1230) Q Consensus 32 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 104 (1230) |.|..-. .++.|++.- -++.-+-..-..+-.|| -|-+---+| |-+||- .|-+|.. -+-||- +=||-- T Consensus 1 ~~~~~~~~~~~~t~~k~-~~~~e~~~~~~n~~~~~y~ty~~~~~~f~~g-fv~~~~-~ng~i~~v~~~~l~---~~f~np 74 (525) T protein:vir:10 1 MTRTKGSKNKSTTIEKQ-SLQIEQLQEHINELERQYNTYDDVVDAFIDG-FVMDLC-NNGKIKTVNLDTLQ---LWFNNP 74 (525) T ss_pred CCCCcCCcccccchhhh-hhhHHHHHHHHhhhhhhcchhhhHHHHHHHH-HHHHhh-cCCceeeeeHHHHH---hhhcCh Confidence 1111111 112222211 11111111111111111 111112222 223331 0111110 011111 112222 Q ss_pred hhhHHHHHHHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHHHhhccccchhHHhhHHH-hhhhhhhhcce Q lcl|NC_013650. 105 DEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQ-FAREYFTVGEV 183 (1230) Q Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 183 (1230) |+ -...|..-...||-..--|--|.|+.-..|-.+...+ -|-..--.-++|.++|+-+.. +-..-.|--=. T Consensus 75 d~-~~~~i~~l~~y~yi~~~~v~ql~~li~~lp~l~y~i~-------~~~~~k~~~~~~s~~n~~l~k~i~hk~ltrdll 146 (525) T protein:vir:10 75 DK-YINNIVNLLTYYYIIDGNVFQLYDLIFSLPPLDYQIK-------VLKRDKDYKEDLSTINLYLEKKIQHKQLTRDLL 146 (525) T ss_pred HH-HHHHHHHHHHHhhhhcchHHHHHHHHHhcCCcceeeh-------hhhhccchhhHHHHHHHHHHHhHHHHHHHHHHH Confidence 21 1233555667788877777788888877887665443 333333344666666665442 22111221112 Q ss_pred eeccccchhcccchhh------hhcC------chhhhcchhhhhccchh----eee--ehhhhhccccccccccccccce Q lcl|NC_013650. 184 TSLAHFNESLGVWSSE------EILN------PDMLRVSRSMFVQRERV----QLM--VKDLVDHLRQGPTTAGGNMSTV 245 (1230) Q Consensus 184 ~~~~~~~~~~~~~~~~------~~~~------~~~~~~~~~~~~~~~~~----~~~--~~~~~~~~~~~~~~~~~~~~~~ 245 (1230) .-++|----.|.|--. -|.| |-| |-+- ++|. ||. ++.+.-++--+. + T Consensus 147 ~q~a~~gtlig~wlg~~~~py~~vf~~~kyvfp~~-r~~g-----~~v~vid~~~f~~~~~~~r~~~~~~------l--- 211 (525) T protein:vir:10 147 VQLAHSGTLIGTWLGSKREPYFNVFNNLKYVFPYG-RAKG-----KMVAVIDLQWFDEMSELERKLTFEN------L--- 211 (525) T ss_pred HHhhccCceeEeeecCCCCcchhhhhhhhhhcccc-ccCC-----ceEEEEehHHhhhhhHHHHHHHHHh------h--- Confidence 2344444445666321 0111 111 1111 1111 111 222222111110 0 Q ss_pred eecCcchhhhccchhhhhhhhhHHH----hhhccCCCccccchhhhhhccCCcccccc-CCccccchHHHHHHHHHHHHH Q lcl|NC_013650. 246 EETPSEREQRMREFQDLQRRYPEII----QAAMQNDGLDISEALISRVVNRPTAWATR-GAPHLLRSFRTLMAEESLNAA 320 (1230) Q Consensus 246 ~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 320 (1230) ++..+. +.|..-- +--.-.|+|.||++...|+.+++..+..| |+|.++..|.++..+++|+.+ T Consensus 212 -----sp~i~~-------~~y~~~~~~~~~~~~~~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~hk~klrd~ 279 (525) T protein:vir:10 212 -----SPLITE-------NKYKKWKEYNGENEDALRYIMLPISKTLVARIHTLSRNQRLGIPYGTQTLFDIQHKQKLRDL 279 (525) T ss_pred -----chhhhh-------hhhhHHhhcccccchhheeeecccceeEEeeecccccCcccCcchhhhHHHHHHHHHHHHHH Confidence 011111 2221111 22234688999999999999999999999 999999999999999999999 Q ss_pred hhcccceeeceeEEEeecccccCCcccc-cCc---hhhHHHHHHHhhhhhhh---------hhhhhhhhhhhhhcccccc Q lcl|NC_013650. 321 QDAVADRLYSPLVLATLGIEDMGDGEPW-IPD---QGELDEVRDDMQSLLAA---------DFRLMVHNFGLKVENVFGR 387 (1230) Q Consensus 321 ~~~~~~~~~~~~~~~~l~~~d~~~~~~~-~~~---~~~l~~~~d~~~~~~~~---------~~~~~~~~~~~~~~~~~~~ 387 (1230) +.+||.++.+++.+-+++..+ +.-. ||+ |..|+-|+++.+..+.. ++++++.=--+..++.++. T Consensus 280 EqsIA~kii~a~avLk~gg~~---gn~mk~p~~~kqkil~gVk~aleK~~kdK~Gi~vi~~Pdfa~~efp~ik~~~~glD 356 (525) T protein:vir:10 280 EQSIADKIIKAMAVLKFRGKD---DNDSKVKESAKRKVLAGVKRALEKGVKDKNGIACIAMPDFATFEFPEIKNGDKTLD 356 (525) T ss_pred HHHHHHHhhhhheeeeecccc---CccccCchHHHHHHHHHHHHHHhcccccccCeEEEeccceeecccccccCcccCCC Confidence 999999999999999998733 3443 776 77788888776553322 2222111000001111111 Q ss_pred ccccCCCcchhhHHHHHHhhhhhhhhhhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhccchhhh-HHH---hhhhhh Q lcl|NC_013650. 388 ESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEV-VAE---AQGHYD 463 (1230) Q Consensus 388 ~~~~~~d~~~~~i~~~~~~~l~i~~ali~~~~g~~~~~~~~~~~~~~k~~~~~~~~l~~~~~~~~~~-l~~---~~~~~~ 463 (1230) + ..|+.|...+..++|++..+..++ |+.|+++.++++-+-|.+..+....+....+-.+. +.+ +..-+. T Consensus 357 g------~K~d~I~~DI~~A~GlS~sL~nGd-ggNyAtaslnld~fykkigVm~e~Iee~y~kL~d~Vl~~~k~~nyifn 429 (525) T protein:vir:10 357 P------KKYDSIDNDITNATGISQVLTNGT-KGNYASAKLNLDVFYKKIGVMLEIIEEIYNQLIDIILGEEKGCNYIFQ 429 (525) T ss_pred c------hhhhhhhhhhhhhhccceeeecCC-CCceeeeeeeHHHHHHHHHHHHHHHHHHHHHHHhhhcCcccCcceEEe Confidence 1 146778888899999999988855 88999999888766554332222222111000000 000 000011 Q ss_pred hhhcccchhHHHHHHHHHHHhhh--------------hhhh----hhccceeeecccceecCCCeeeecc-cccEehhee Q lcl|NC_013650. 464 YDLKGGVRVPIYREIVEYDEETG--------------QEYI----RKVPKLLIPEVKFSCVVPGTQVLTP-EGQRNVEDI 524 (1230) Q Consensus 464 ~~~~~g~~~~v~~~i~~~~~~~g--------------~~~i----~~~~~l~~~~~~~~Clt~DT~VlT~-dG~k~IedL 524 (1230) |+--+.+.....-.++...+..+ .+++ ....++..-. -..=+-.|.|+|. +| .-|+-= T Consensus 430 ydkd~pi~~kkk~d~LIkL~d~g~s~k~vldl~gis~e~y~E~s~yEtE~lkl~E--Ki~pp~~~~v~SGk~~-n~iG~P 506 (525) T protein:vir:10 430 YNKDTPIEREKKLDTLIKLEAQGYSAKYVLDILGISSEEYFEESIYEIEKLKLRE--KIMPPLNTNVLSGKDG-NDIGSP 506 (525) T ss_pred cCCCchhhhhhhhhhhhhhhccchhhhhhhhhhccCcchHHHHHHHHHHHHHHhh--hccccccceeeecccc-ccccCC Confidence 11000000000000000000000 0000 0000000000 0000112333332 11 000000 Q ss_pred ccCcEEEEecCCCeEEEeeEEEeecCc Q lcl|NC_013650. 525 RPGDEVIAWDGTGYVVDTALHTGIEHR 551 (1230) Q Consensus 525 ~vGD~V~t~~g~~~~v~~v~~~~~~~~ 551 (1230) . .+++... ...+....++. T Consensus 507 ~-------~dd~~~~-dati~s~~~~~ 525 (525) T protein:vir:10 507 K-------LDDSDSS-DATIESKERGV 525 (525) T ss_pred c-------cCCCcch-hhhhhhhhcCC Confidence 0 0000000 00000001111 No 2 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=94.50 E-value=0.0042 Score=33.59 Aligned_cols=466 Identities=11% Similarity=0.054 Sum_probs=181.9 Q ss_pred CCCCCCchhHHHHHhhhhhcCCcchhHHHHHHhhhhhhhhHHHHHHHHhcc-CcccceeecCchhhhhhHhhhhhcCCCC Q lcl|NC_013650. 24 VNMPNSPTMARAQAAALQNTVNNKPLIDYFQGRRRAAEANRQRLASYRKQG-NFGSNMQIAMPKIRQPLGTLADKGIPFN 102 (1230) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 102 (1230) +.|| ......+.+|+ . .|...+.|...+ ..+-=|+--+|..+. + | T Consensus 1 ~~~p---------------------~~~~~~~~~~~-~-~~~~~~~y~~~a~~~~~~~~~w~p~~~s-----~------~ 46 (533) T protein:vir:34 1 MKTP---------------------TIPTLLGPDGM-T-SLREYAGYHGGGSGFGGQLRSWNPPSES-----V------D 46 (533) T ss_pred CCCc---------------------hhhhhhccccc-c-hHHHHHhhhhccCCCCCcccccccCCCC-----H------H Confidence 3333 22222222222 1 122233333221 111112211232222 2 2 Q ss_pred cchhhHHHHHHHHHHHHhcccchhHHHHHhhhcCcc-cccccccc------------c----hhHHHHHHHhh------- Q lcl|NC_013650. 103 VEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPV-VGMEFDSK------------D----PLIKTFYEDLF------- 158 (1230) Q Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~------------~----~~~~~~~~~~~------- 158 (1230) .+-.-.++.||.=+|.+|+.++++.-.||.+...=| .|+...++ | ..|+..|+... T Consensus 47 ~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~ 126 (533) T protein:vir:34 47 AALLPNFTRGNARADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCI 126 (533) T ss_pred HHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCcccee Confidence 223346777888999999999999999998877633 23333332 1 35556665543 Q ss_pred -ccccchhHHhhHHHhhhhhhhhcceeeccccchhccc-ch-hhhhcCchhhhcchhhhhccchheeeehhhhhcccccc Q lcl|NC_013650. 159 -FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGV-WS-SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGP 235 (1230) Q Consensus 159 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (1230) +.-.+|+..+ -...-|.++.-||+|-+-+....-|. |. +-+.|+||+|.-.... .... T Consensus 127 D~~g~~~f~~~-q~l~~r~~~~dGE~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~-~~~~----------------- 187 (533) T protein:vir:34 127 DVERKRTFTMM-IREGVAMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNT-GDSR----------------- 187 (533) T ss_pred ccccccCHHHH-HHHHHHHHHhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCC-CCCC----------------- Confidence 3334454444 34588999999999998877765432 32 6689999997432111 0011 Q ss_pred ccccccccceeec----CcchhhhccchhhhhhhhhHH--HhhhccCCCccccchhhhhhccCCccccccCCccccchHH Q lcl|NC_013650. 236 TTAGGNMSTVEET----PSEREQRMREFQDLQRRYPEI--IQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFR 309 (1230) Q Consensus 236 ~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (1230) .+-.=++.+ |+-| -+..+..+.. +......+.++++-..|-|+...--.-..||-|.+.+... T Consensus 188 ----~i~~GIe~d~~Gr~~aY-------~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~ 256 (533) T protein:vir:34 188 ----NCRAGVQINDSGAALGY-------YVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPVEDGQTRGANVFYSVME 256 (533) T ss_pred ----ceEeeeEECCCCCeEEE-------EEeecCCCCccccccceeeeeeccChhHeeeeccccCCCcccCCchHHHHHH Confidence 111111222 2222 2222221110 0111122345677788999998877778999999988888 Q ss_pred HHHHHHHHHHHhhc---ccceeeceeEEEeecccccCCcccccCch-----hhHHHHHHHhhhhhh-hhhhh-----hhh Q lcl|NC_013650. 310 TLMAEESLNAAQDA---VADRLYSPLVLATLGIEDMGDGEPWIPDQ-----GELDEVRDDMQSLLA-ADFRL-----MVH 375 (1230) Q Consensus 310 ~~~~~~~~~~~~~~---~~~~~~~~~~~~~l~~~d~~~~~~~~~~~-----~~l~~~~d~~~~~~~-~~~~~-----~~~ 375 (1230) .|...+.+.+++.. ++..+ +-++=+..+.... .+.....+ +++....+...+... ..+.+ ... T Consensus 257 ~l~~l~~y~dael~~a~i~A~~-a~fi~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L 333 (533) T protein:vir:34 257 QMKMLDTLQNTQLQSAIVKAMY-AATIESELDTQSA--MDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHL 333 (533) T ss_pred HHHHHHHHHHHHHHHHHHhhhh-eeeeecCCCcccc--cccccCCCcccccccccccchhhhhccCcceeeccCceeeec Confidence 88888888766543 22111 1111122222111 11111111 111111110000000 00000 111 Q ss_pred hhhhhhccccccccccCCCcchhhHHHHHHhhhhhhhhhhhhcc-chhhHHHHHHHHHHHHHHHHHHHHHhhhccch-h- Q lcl|NC_013650. 376 NFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGT-GGAYASSALNREFVTQIMTGFQNALKRHIRRR-C- 452 (1230) Q Consensus 376 ~~~~~~~~~~~~~~~~~~d~~~~~i~~~~~~~l~i~~ali~~~~-g~~~~~~~~~~~~~~k~~~~~~~~l~~~~~~~-~- 452 (1230) .-|..++...-..-..++......+.+.+..++|+...++.++- +..|++....+....+.+...+..+....... + T Consensus 334 ~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~ 413 (533) T protein:vir:34 334 MPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFL 413 (533) T ss_pred CCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12222222111111223333346688888899999998888885 67898877766666665555554443211110 1 Q ss_pred hhHHHhhhhhhhhhcccchhHHH--HHHHHHHHhhh--hhhhhhc--cceeeecccceecCCCeeeeccccc--Ee-hhe Q lcl|NC_013650. 453 EVVAEAQGHYDYDLKGGVRVPIY--REIVEYDEETG--QEYIRKV--PKLLIPEVKFSCVVPGTQVLTPEGQ--RN-VED 523 (1230) Q Consensus 453 ~~l~~~~~~~~~~~~~g~~~~v~--~~i~~~~~~~g--~~~i~~~--~~l~~~~~~~~Clt~DT~VlT~dG~--k~-Ied 523 (1230) .+|.+..-.....+..+...+.+ +.-.......+ ..++... ...........+.|- +.|.-..|. .. +++ T Consensus 414 ~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~-~~~~a~~G~D~~ev~~q 492 (533) T protein:vir:34 414 CWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTY-EKECAKRGDDYQEIFAQ 492 (533) T ss_pred HHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCH-HHHHHHcCCCHHHHHHH Confidence 11221111111111111000000 00000000000 0000000 000000000000000 011111120 00 000 Q ss_pred e-------ccCcEEEEecCCCeEEEeeEEEeecCcceEEEEEEcCce Q lcl|NC_013650. 524 I-------RPGDEVIAWDGTGYVVDTALHTGIEHRDELVEVITKTGR 563 (1230) Q Consensus 524 L-------~vGD~V~t~~g~~~~v~~v~~~~~~~~~~l~rI~t~~G~ 563 (1230) + +.-..+...++.......... -.....+ ...+. T Consensus 493 ~a~e~~~~~~~gl~~~~~~~~~~~s~~~~-~~~~~~~-----~~~~~ 533 (533) T protein:vir:34 493 QVRETMERRAAGLKPPAWAAAAFESGLRQ-STEEEKS-----DSRAA 533 (533) T ss_pred HHHHHHHHHhcCCCCCCCCCcCccCCCCC-CCCCCcc-----cCCCC Confidence 0 000000000000000000000 0000000 00000 No 3 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=86.49 E-value=0.045 Score=27.96 Aligned_cols=437 Identities=13% Similarity=0.047 Sum_probs=181.3 Q ss_pred HHHHHHhcCCCCCCchhHHHHHhhhhhcCCcchhHHHHHHhhhhhhhhHHHHHHHHhccCcccceeecCchhhhhhHhhh Q lcl|NC_013650. 16 VNRLRKAGVNMPNSPTMARAQAAALQNTVNNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLA 95 (1230) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (1230) +|-|-|+ -.-++ |..-++|.+ .|+.+..|.-.+ ++-..-..|.. ..+ T Consensus 1 mn~~dr~------------------i~~~s--P~~~~~R~~------ar~~~~~y~aa~--~~r~~~~~~~~-----~s~ 47 (502) T protein:vir:79 1 MAILDDV------------------IGVFS--PGWKAARLR------SRAVIQAYEAVK--TTRTHKARREN-----RTA 47 (502) T ss_pred CchHhhH------------------HhhcC--hHHHHHHHh------hHHHHhhccccC--cccccCCCCCC-----CCh Confidence 1111110 01122 222333333 334444454332 12222222322 223 Q ss_pred hhcCCCCcchhhHHHHHHHHHHHHhcccchhHHHHHhhhcCcc-c-ccc----ccccc--------hhHHHHHHHhh--- Q lcl|NC_013650. 96 DKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPV-V-GME----FDSKD--------PLIKTFYEDLF--- 158 (1230) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~----~~~~~--------~~~~~~~~~~~--- 158 (1230) | .+....++.||.=||..|+..+++.-.|+.+...=| . |+. .+..| ..|++.|+... T Consensus 48 ~------~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~ 121 (502) T protein:vir:79 48 D------QLSQYGAVSLREQARYLDNNHDLVIGVFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSP 121 (502) T ss_pred H------HHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCc Confidence 3 333446788899999999999999999998887755 2 222 22233 34455555443 Q ss_pred -ccccchhHHhhHHHhhhhhhhhcceeeccccchh------cccchhhhhcCchhhhcchhhhhccchheeeehhhhhcc Q lcl|NC_013650. 159 -FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNES------LGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHL 231 (1230) Q Consensus 159 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (1230) +..++|+.. |....-|.++.-||+|-.-+.++. +++=-+-++|+||+|..... + T Consensus 122 D~~g~~~f~~-~q~l~~r~~~~dGE~f~~~~~~~~~~~~~g~~~~l~lq~iepd~l~~~~~----~-------------- 182 (502) T protein:vir:79 122 EVTGQFTRPM-LERLMLRTWLRDGEVFAQMVSGRINSLTPSAGVHFWLEALEPDFIPMTSD----E-------------- 182 (502) T ss_pred CccccCCHHH-HHHHHHHHHHhCCceEEEEeecccCccCCCcccceEEEEecchhcCCCCC----C-------------- Confidence 222334333 344588999999999988776552 11111569999999742211 0 Q ss_pred ccccccccccccceeecCcchhhhccchhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHH Q lcl|NC_013650. 232 RQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTL 311 (1230) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (1230) +..+..=+|.+-..+-+ -|-+ .+..|.. ....+.+.+|-.-|.|+...--.-..||-|.+.++...| T Consensus 183 ------~~~i~~GVe~d~~Gr~~---aY~i-~~~hPgd---~~~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l 249 (502) T protein:vir:79 183 ------SNRLNQGVFVDDWGRPE---KYLV-YKSRPVS---GRQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRL 249 (502) T ss_pred ------CCeeEeeeEECCCCceE---EEEE-eecCCCC---CcccceeEechhheEEeecccCCccccCCchHHHHHHHH Confidence 11111112222111111 1111 1222331 122344567778899999876777899999998887777 Q ss_pred HHHHHHHHHhhcccceeeceeEEEeecccccCCcccccCchhh--HHHHHHHhhhhhhhhhhhhhhhhhhhhcccccccc Q lcl|NC_013650. 312 MAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGE--LDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRES 389 (1230) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~d~~~~~~~~~~~~~--l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 389 (1230) ...+.+.+++..-+-.-+.-..+.+-...+...... ..+.+. ...+.. +.+ +....-|..++...-..- T Consensus 250 ~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~-~~~~~~~~~~~l~p---G~i-----~~~L~pGe~i~~~~p~~p 320 (502) T protein:vir:79 250 SALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDG-NGSKENERELTIQP---GII-----YDDLKPGEEIGMVKSDRP 320 (502) T ss_pred HHHhHHHHHHHHHHHHhhhheeeeecCCCccccccc-CCCCCccccccccC---Ccc-----ccccCCCceeeeeCCCCC Confidence 777777766543222222222223322111100000 011110 001100 000 011112333333222221 Q ss_pred ccCCCcchhhHHHHHHhhhhhhhhhhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhccch-h-hhHHHh--hhhhh-- Q lcl|NC_013650. 390 VPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRR-C-EVVAEA--QGHYD-- 463 (1230) Q Consensus 390 ~~~~d~~~~~i~~~~~~~l~i~~ali~~~~g~~~~~~~~~~~~~~k~~~~~~~~l~~~~~~~-~-~~l~~~--~~~~~-- 463 (1230) ..+....+..+.+.+..++|+...++.++-...|++....+....+.+...++.+.....+. + .+|.+. .+... T Consensus 321 ~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p 400 (502) T protein:vir:79 321 NPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLP 400 (502) T ss_pred CCCHHHHHHHHHHHHHhhcCCCHHHHhccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCC Confidence 22333334668888888999998888877666888877766655555555554443221111 1 112111 11100 Q ss_pred -------h----hhcccc-hhHHHHHHHHH--------------HHhhhhhhhhhc----cceeeecccceecCCCeeee Q lcl|NC_013650. 464 -------Y----DLKGGV-RVPIYREIVEY--------------DEETGQEYIRKV----PKLLIPEVKFSCVVPGTQVL 513 (1230) Q Consensus 464 -------~----~~~~g~-~~~v~~~i~~~--------------~~~~g~~~i~~~----~~l~~~~~~~~Clt~DT~Vl 513 (1230) + +...+. .++-.++.... ..+.|..+.... .........+.-++.+ .-. T Consensus 401 ~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~-~~~ 479 (502) T protein:vir:79 401 RDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFDTD-PAS 479 (502) T ss_pred CCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCC-CCC Confidence 0 000000 00001111000 001111110000 0000000001001100 000 Q ss_pred ccc-ccEeh--heeccCcEEEEecCCCeE Q lcl|NC_013650. 514 TPE-GQRNV--EDIRPGDEVIAWDGTGYV 539 (1230) Q Consensus 514 T~d-G~k~I--edL~vGD~V~t~~g~~~~ 539 (1230) +.. +-.+. +|=..+ +++... T Consensus 480 ~~~~~~~~~~~~e~~~~------~~~~e~ 502 (502) T protein:vir:79 480 DKGGSSAATKRQEPQHT------DDQSEE 502 (502) T ss_pred CCCCCCCCCCCCCCCCC------CCCCCC Confidence 000 00000 000000 000000 No 4 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=84.56 E-value=0.059 Score=27.30 Aligned_cols=434 Identities=11% Similarity=0.053 Sum_probs=174.9 Q ss_pred CCCCCCchhHHHHHhhhhhcCCcchhHHHHHHhhhhhhhhHHHHHHHHhccCcccceeecCchhhhhhHhhhhhcCCCCc Q lcl|NC_013650. 24 VNMPNSPTMARAQAAALQNTVNNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNV 103 (1230) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 103 (1230) .++|. ...+...+.....++..+.+.+...++++ -.|.. ..+| - T Consensus 1 ~~~~~------------~~~~~~~~~~~~~~~~~~~a~~~~~~~~~-------------w~~~~-----~s~~------~ 44 (530) T protein:vir:38 1 MKIPS------------LVGPDGKTSLREYAGYHGGGGGFGGQLRG-------------WNPPS-----ESAD------A 44 (530) T ss_pred Cccce------------eecCccccchHHHhhhhcccCCCCCcccc-------------cccCC-----CCHH------H Confidence 11111 11122233322222222221111111111 11211 1111 2 Q ss_pred chhhHHHHHHHHHHHHhcccchhHHHHHhhhcCccccccc--ccc------------c----hhHHHHHHHhh------- Q lcl|NC_013650. 104 EDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEF--DSK------------D----PLIKTFYEDLF------- 158 (1230) Q Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~------------~----~~~~~~~~~~~------- 158 (1230) +..-.++.+|.=||..|+.++++.-.|+.+... |||--+ .++ | ..|+..|.... T Consensus 45 ~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~n-vVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~ 123 (530) T protein:vir:38 45 ALLPNYSRGNARADDLVRNNGYAANAVQLHQDH-IVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGI 123 (530) T ss_pred HHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH-hhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEE Confidence 223456778888999999999999999988766 344332 222 2 34555565432 Q ss_pred -ccccchhHHhhHHHhhhhhhhhcceeeccccchhccc-ch-hhhhcCchhhhcchhhhhccchheeeehhhhhcccccc Q lcl|NC_013650. 159 -FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGV-WS-SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGP 235 (1230) Q Consensus 159 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (1230) +--.+|+.. |-...-|.++.-||+|-+-+.+...|. |. +-++|+||+|.-... ..+ T Consensus 124 D~~g~~~f~~-~q~l~~r~~~~dGE~~~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~--~~~------------------ 182 (530) T protein:vir:38 124 DAERKRTFTM-MIREGVAMHAFNGELCVQATWDSDSTRLFRTQFKMVSPKRVSNPNN--IGD------------------ 182 (530) T ss_pred eeeccCCHHH-HHHHHHHHHhhCCceEEEeeeccCCCCccceEEEEechhhcCCCCC--CCC------------------ Confidence 222344444 345588999999999998887765432 32 679999999742211 000 Q ss_pred ccccccccceeec----CcchhhhccchhhhhhhhhH--HHhhhccCCCccccchhhhhhccCCccccccCCccccchHH Q lcl|NC_013650. 236 TTAGGNMSTVEET----PSEREQRMREFQDLQRRYPE--IIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFR 309 (1230) Q Consensus 236 ~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (1230) ++.+-.=+|.+ |+-|- +..+..|. .+........++.+-..+-|+...--.-.+||-|.+..... T Consensus 183 --~~~i~~GIe~d~~Gr~~aY~-------i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~ 253 (530) T protein:vir:38 183 --TRNCRAGVKINDSGAALGYY-------VSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVME 253 (530) T ss_pred --CCeeEeeeEECCCCceEEEE-------EeeccCCCccccccceeeeeeccChhHeEeeccccCCCcccCCchHHHHHH Confidence 11111111222 22221 11111110 00111112345677778999998877788999999988888 Q ss_pred HHHHHHHHHHHhhcccceeeceeEEEe--ecccccCCcccccC-----chhhHHHHHHHhhhhhhh------hhhhhhhh Q lcl|NC_013650. 310 TLMAEESLNAAQDAVADRLYSPLVLAT--LGIEDMGDGEPWIP-----DQGELDEVRDDMQSLLAA------DFRLMVHN 376 (1230) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--l~~~d~~~~~~~~~-----~~~~l~~~~d~~~~~~~~------~~~~~~~~ 376 (1230) .|...+.+.+++..-+-.-+.-..+.+ .+.... ++.... +...+........+.... .-.+.... T Consensus 254 ~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~ 331 (530) T protein:vir:38 254 QMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSA--MDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLL 331 (530) T ss_pred HHHHHhHHHHHHHHHHHHhhhheeeeeccCCcccc--ccccccCCcccccccccccchhhhhcccccceeccCceeeecC Confidence 888888877665332211111111222 221111 111111 111111110000000000 00001111 Q ss_pred hhhhhccccccccccCCCcchhhHHHHHHhhhhhhhhhhhhcc-chhhHHHHHHHHHHHHHHHHHHHHHhhhccc-hh-h Q lcl|NC_013650. 377 FGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGT-GGAYASSALNREFVTQIMTGFQNALKRHIRR-RC-E 453 (1230) Q Consensus 377 ~~~~~~~~~~~~~~~~~d~~~~~i~~~~~~~l~i~~ali~~~~-g~~~~~~~~~~~~~~k~~~~~~~~l~~~~~~-~~-~ 453 (1230) -|..++...-..-..+++..+..+.+.+..++|+...++.++- +..|++....+....+.+...+..+...... -+ . T Consensus 332 pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~ 411 (530) T protein:vir:38 332 PGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLC 411 (530) T ss_pred CCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHH Confidence 2222222211111223333346688888889999998888875 5679887766655555554444433221100 00 0 Q ss_pred hHHHhhhhhhhhhcc---------------------c-chhHHHHHHHHH--------------HHhhhhhhh------- Q lcl|NC_013650. 454 VVAEAQGHYDYDLKG---------------------G-VRVPIYREIVEY--------------DEETGQEYI------- 490 (1230) Q Consensus 454 ~l~~~~~~~~~~~~~---------------------g-~~~~v~~~i~~~--------------~~~~g~~~i------- 490 (1230) .|.+..-.....+.. + ..++-.+++... ..+.|..+. T Consensus 412 wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a 491 (530) T protein:vir:38 412 WLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQV 491 (530) T ss_pred HHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHH Confidence 111110000000000 0 000001111000 001111100 Q ss_pred ---hhccceeeecccceecCCCeeeecccccEehheeccCcEEEEecCCCeE Q lcl|NC_013650. 491 ---RKVPKLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIAWDGTGYV 539 (1230) Q Consensus 491 ---~~~~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t~~g~~~~ 539 (1230) .....+.+ -++.++.-.+..|-..=++-.. ++...- T Consensus 492 ~e~~~~~~~Gl------~~~~~~~~~~~~~~~~~~~~~~-------d~~~~a 530 (530) T protein:vir:38 492 RESMERRAAGL------NPPAWAAAAFEAGVKKSNEEEQ-------DGARAA 530 (530) T ss_pred HHHHHHHHcCC------CCCCCcccccCCCCCCCCCCCC-------CCCCCC Confidence 00001111 0111111111111000000000 000000 No 5 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=82.32 E-value=0.015 Score=30.56 Aligned_cols=379 Identities=11% Similarity=0.074 Sum_probs=152.7 Q ss_pred HHHHHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHH------Hhhcc--ccchhHHhhHHHhhhhhhhhc Q lcl|NC_013650. 110 RVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYE------DLFFG--EDLNYLEFLPDQFAREYFTVG 181 (1230) Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~--~~~~~~~~~~~~~~~~~~~~~ 181 (1230) =-|..|.+-|+.... --.+..|+.+-.++- +.++. .-.--++++-+.++.-=|+|= T Consensus 1 MG~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~g~~~~~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~ 64 (411) T protein:vir:81 1 MGWWSRLTRFFRPRN----------------ETVDMTNPLLLQWLGVDPDTPRNQLSEATYFACLKILSESLGKLPLKMY 64 (411) T ss_pred CchHHHHHhhccCcc----------------cccccchHHHHHHhcCcccChhhhhccHHHHHHHHHHHHhHhhCceeEE Confidence 122223333343321 011112232222210 00000 001234555554543222220 Q ss_pred ceeeccccchhcccch-h----hhhc--CchhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCcchhh Q lcl|NC_013650. 182 EVTSLAHFNESLGVWS-S----EEIL--NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQ 254 (1230) Q Consensus 182 ~~~~~~~~~~~~~~~~-~----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (1230) . -++ .|... . ...| .|.= ..++..||+..+.+|++.|...-+... .+|...+....+|....+ T Consensus 65 ~------~~~-~~~~~~~~~~l~~lL~~~PN~-~~t~~~f~~~l~~~lll~Gna~~~i~r--~~g~~~~l~~l~~~~v~~ 134 (411) T protein:vir:81 65 Q------KTE-RGIVKSDREELYNLLKLRPNP-YMTSSVFWSTVEMNRNHYGNAYVWCQY--SGPQLQALWILPSQYVTI 134 (411) T ss_pred E------ecC-CceeeecccHHHHHHhhccCC-CCCHHHHHHHHHHHHhhcCCeEEEEEe--cCCceEEEEEECCceEEE Confidence 0 000 11111 0 1122 2322 258899999999999999876555443 357777777777777765 Q ss_pred hccchhhhhhhhh--HHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceeecee Q lcl|NC_013650. 255 RMREFQDLQRRYP--EIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPL 332 (1230) Q Consensus 255 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (1230) ...+...+..+.. +.+.-..+|+...++..-+-|++...+-=-..|.+.+..+.+++.+.....+...+.+..-+.+. T Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~ 214 (411) T protein:vir:81 135 VVDDRGLLGEKNAIWYRYNDPYDGKMYVFRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGK 214 (411) T ss_pred EEcCcccccccceEEEEEEecCCceEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 5444433332211 12233345666778888899998543322336888888888888888888887777666655555 Q ss_pred EEEeecccccCCcccccCchhhHHHHHHHhhhhhhh-hh--hhhhhhhhhhhccccccccccCCCcch----hhHHHHHH Q lcl|NC_013650. 333 VLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-DF--RLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLL 405 (1230) Q Consensus 333 ~~~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~ 405 (1230) -+.+++. + -+.+..+.+++.+.....- .+ ...+...++..+.+ ...+.|.++ +...+++. T Consensus 215 gil~~~~-~--------l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l----~~~~~d~q~~e~~~~~~~~Ia 281 (411) T protein:vir:81 215 AVLEYTG-D--------LNQEARDRLVKGFEQFANGSKNAGKIIPVPLGMKLVPL----DIKLTDSQFFELKKYTALQIA 281 (411) T ss_pred eEEEeCC-C--------CCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEc----cCCHHHHHHHHHHHHHHHHHH Confidence 5555542 1 1335566666665554421 10 01111222222111 112223332 34556677 Q ss_pred hhhhhhhhhhhhccchhhHHHHHHH-HHHHHHHHHHHHHHhhhccchhhhHHH--hhh--hhhhhhcccchhHHHH--HH Q lcl|NC_013650. 406 QAWGIGEALISGGTGGAYASSALNR-EFVTQIMTGFQNALKRHIRRRCEVVAE--AQG--HYDYDLKGGVRVPIYR--EI 478 (1230) Q Consensus 406 ~~l~i~~ali~~~~g~~~~~~~~~~-~~~~k~~~~~~~~l~~~~~~~~~~l~~--~~~--~~~~~~~~g~~~~v~~--~i 478 (1230) .+++|-..++...+...++..+... .+....+.-+...+......+. +.. +.. ...++...-...+... +. T Consensus 282 ~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~~l--l~~~~~~~~~~~~fd~~~ll~~d~~~~~~~ 359 (411) T protein:vir:81 282 AAFGIKPNQINDYEKSSYASAEAQNLAFYVDTLLYVLKQYEEEITYKI--LSNDLISQGHYFKFNVNVILRADIKTQMDS 359 (411) T ss_pred HHhCCCHHHhCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhc--CChhhcCCCcEEEeechhhhccCHHHHHHH Confidence 7888888877555555555443322 2222222222222221111110 000 000 0011110000000000 00 Q ss_pred HHHHHhhhhhhhhhc-cceeeecccceecCCCeeeecccccEehheeccCcEEEEecCCCeEEEeeEEEeecCcceEEEE Q lcl|NC_013650. 479 VEYDEETGQEYIRKV-PKLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIAWDGTGYVVDTALHTGIEHRDELVEV 557 (1230) Q Consensus 479 ~~~~~~~g~~~i~~~-~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t~~g~~~~v~~v~~~~~~~~~~l~rI 557 (1230) ...+...|.--.... ..+.... +.++...+....+.+|+.+... . T Consensus 360 ~~~~~~~g~~t~NE~R~~~gl~p-----~~ggD~~~~~~n~~pl~~~~~~---~-------------------------- 405 (411) T protein:vir:81 360 LSTAVQNGIMTPNEARDYLDMPA-----DDYGNNLMANGNYIPLSMLGAN---Y-------------------------- 405 (411) T ss_pred HHHHHhCCCcCHHHHHHHhCCCC-----CCCCCeeeeccCccchhhhhhh---h-------------------------- Confidence 011111110000000 0000000 1122222223334444333110 0 Q ss_pred EEcCceE Q lcl|NC_013650. 558 ITKTGRT 564 (1230) Q Consensus 558 ~t~~G~~ 564 (1230) .+.|.. T Consensus 406 -~kgGd~ 411 (411) T protein:vir:81 406 -GKGGDS 411 (411) T ss_pred -ccCCCC Confidence 011111 No 6 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=82.11 E-value=0.01 Score=31.45 Aligned_cols=269 Identities=10% Similarity=0.089 Sum_probs=113.1 Q ss_pred hhhhhhhhcceeeccccchhcccch-hhhhcC--chhhhcchhhhhccchheeeehhhhhccccccccccccccceeecC Q lcl|NC_013650. 173 FAREYFTVGEVTSLAHFNESLGVWS-SEEILN--PDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETP 249 (1230) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (1230) ++.-=|.+=+. ++.. .+ -...|| |.= ..++..||+..+.+|++.|...-+.... ..|...+.+..+| T Consensus 1 ia~l~~~~~~~------~~~~--~~~l~~lL~~~PN~-~~t~~~f~~~~~~~ll~~Gna~~~i~r~-~~G~~~~l~~l~~ 70 (278) T protein:vir:78 1 MASLPLKMYED------YKVV--NTEVSDLLTVSPNN-SLSSFDFINQIETIRNEKGNAYVLIERD-IYHQPSKLFLLNP 70 (278) T ss_pred CccceeEEEec------Cccc--ccHHHHHHHhcCCC-CCCHHHHHHHHHHHHhhcCCEEEEEEEC-CCCcEEEEEEECC Confidence 22111111000 1100 00 012222 332 2578899999999999999875554432 3445556666666 Q ss_pred cchhhhcc-chhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhccccee Q lcl|NC_013650. 250 SEREQRMR-EFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRL 328 (1230) Q Consensus 250 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (1230) ....+... ++..+..++ ....|..+.++..-+-|+++....=...|.+.+..+..++..-....+..-..++.. T Consensus 71 ~~v~v~~~~~~~~~~y~~-----~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~~ 145 (278) T protein:vir:78 71 DVVEMLIENQSRELYYSI-----HAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKP 145 (278) T ss_pred ceeEEEEcCCCceEEEEE-----EcCCceEEEEccccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 55544332 333232222 233444566777778899865332233588888777766665444433321112111 Q ss_pred eceeEEEeecccccCCcccccCchhhHHHHHHHhhhhhhhhhhhhhhhhhhhhccccccccccCCCcch----hhHHHHH Q lcl|NC_013650. 329 YSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKL 404 (1230) Q Consensus 329 ~~~~~~~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~ 404 (1230) .++++. .+ ++ -+.+..+.+++.++..+...-...+-..++..+.+. ..+.|.++ +...+++ T Consensus 146 -~~~i~~-~~-~~--------l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~----~~~~d~~~~e~~~~~~~~I 210 (278) T protein:vir:78 146 -DSFMLK-YG-SN--------VGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLP----KKYVSEDIVASENLTRERV 210 (278) T ss_pred -CcEEEE-eC-CC--------CCHHHHHHHHHHHHHHhccCCCceecCCCceEEEcc----CChhHHHHHHHHHHHHHHH Confidence 222222 22 11 134667777777766553221122222222221111 12223332 2355666 Q ss_pred HhhhhhhhhhhhhccchhhHHHHHHHHHH-HHHHHHHHHHHhhhccchhhhHHHhhhhhhhhhcccchhHHHHHHHHHHH Q lcl|NC_013650. 405 LQAWGIGEALISGGTGGAYASSALNREFV-TQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDE 483 (1230) Q Consensus 405 ~~~l~i~~ali~~~~g~~~~~~~~~~~~~-~k~~~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~g~~~~v~~~i~~~~~ 483 (1230) ..++||...++....+..++..+.....+ ...+.-+...+.. .+...+... ... T Consensus 211 a~~fgVpp~~lg~~~~~~~sn~~~~~~~~~~~~l~P~~~~i~~--------------~ln~~L~~~-----------~e~ 265 (278) T protein:vir:78 211 ANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEE--------------EFNRKLLTK-----------TDR 265 (278) T ss_pred HHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH--------------HHHhhcCCh-----------hHh Confidence 77888887777555555555443322211 1111111111110 000000000 000 Q ss_pred hhhhhhhhhccce Q lcl|NC_013650. 484 ETGQEYIRKVPKL 496 (1230) Q Consensus 484 ~~g~~~i~~~~~l 496 (1230) ..|.........+ T Consensus 266 ~~g~~~~f~~~~l 278 (278) T protein:vir:78 266 EKIGILNLTLNLI 278 (278) T ss_pred cCCceEEEecccC Confidence 0011101111111 No 7 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=80.85 E-value=0.031 Score=28.87 Aligned_cols=393 Identities=13% Similarity=0.118 Sum_probs=144.7 Q ss_pred HHHHHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHHHhhccccc-----------------hhHHhhHHH Q lcl|NC_013650. 110 RVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDL-----------------NYLEFLPDQ 172 (1230) Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~ 172 (1230) =-|..|.+.|+.-+ .+-..-.......++ .|++.+ |-.- --++++-+. T Consensus 1 M~~~~r~~~~~~~~----------~r~~~~~~~~~~~~~---~~~~~~--g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ 65 (432) T protein:vir:10 1 MKIVDSVKKFFNFE----------KRQTSQVIELNKDDE---KLLEWL--GISPSTISVKGKNALKVATVFACIKILSES 65 (432) T ss_pred CChHHHHHHhcCcc----------ccCcccccccCCchH---HHHHHh--CCCcCccccchhhhhccHHHHHHHHHHHHh Confidence 11112222233211 000000000001111 122211 1111 134444444 Q ss_pred hhhhhhhhcceeeccccchhcccchh-----hhhc--Cchhhhcchhhhhccchheeeehhhhhccccccccccccccce Q lcl|NC_013650. 173 FAREYFTVGEVTSLAHFNESLGVWSS-----EEIL--NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTV 245 (1230) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (1230) ++.-=|+| -.-+ ..|.++. ...| .|.= ..+++.||+..+.+|.+.|..--+.... ..|...+.+ T Consensus 66 ia~lp~~~------~~~~-~~~~~~~~~~~l~~lL~~~PN~-~~t~~~f~~~~~~~lll~Gnay~~i~r~-~~G~~~~L~ 136 (432) T protein:vir:10 66 VSKLPLKI------YQED-EYGIQRGTKHYLNNLLRLRPNP-YMSSMNFFGSLEAQKNLYGNSYANIEFD-RKGKVQALW 136 (432) T ss_pred hccCceEE------EEec-CCceeeccccHHHHHHHhhccC-CCCHHHHHHHHHHHHhhcCCeEEEEEEC-CCCcEEEEE Confidence 44322221 0001 1121110 1112 2433 2578999999999999988776665433 344566777 Q ss_pred eecCcchhhhccchhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhccc Q lcl|NC_013650. 246 EETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA 325 (1230) Q Consensus 246 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (1230) ..+|....+..++..++..+.........++....++..-+-|+++..+-=-..|.+++..+.+++..-....+...+.+ T Consensus 137 ~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 216 (432) T protein:vir:10 137 PIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFY 216 (432) T ss_pred EEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 77777665544433333221110111123444566888888999865322223588888777777777777776655655 Q ss_pred ceeeceeEEEeecccccCCcccccCchhhHHHHHHHhhhhhhh-h--hhhhhhhhhhhhccccccccccCCCcch----h Q lcl|NC_013650. 326 DRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADY----D 398 (1230) Q Consensus 326 ~~~~~~~~~~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~ 398 (1230) ..-+.+.-+.+++. + -+.+..+.+++.++....- . -...+...++..+-+ ...+.|.++ . T Consensus 217 ~ng~~p~gil~~~~-~--------l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l----~~~~~d~q~~e~~~ 283 (432) T protein:vir:10 217 KQGLQVKGLVQYVG-D--------LNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPI----SLNMSDAQFLENTE 283 (432) T ss_pred hccCCccEEEEcCC-C--------CCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEc----cCChhHHHHHHHHH Confidence 44444444444432 1 1234556666665544321 1 011122222222111 112223332 2 Q ss_pred hHHHHHHhhhhhhhhhhhhccchhhHHHHHHH-HHHHHHHHHHHHHHhhhccchhhhHHHhh---h-hhhhhhcccchhH Q lcl|NC_013650. 399 RIERKLLQAWGIGEALISGGTGGAYASSALNR-EFVTQIMTGFQNALKRHIRRRCEVVAEAQ---G-HYDYDLKGGVRVP 473 (1230) Q Consensus 399 ~i~~~~~~~l~i~~ali~~~~g~~~~~~~~~~-~~~~k~~~~~~~~l~~~~~~~~~~l~~~~---~-~~~~~~~~g~~~~ 473 (1230) ...+++..+++|-..++.......++..+... .+....+.-+...+......+ .+.+.. + .+.++...=...+ T Consensus 284 ~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~k--Ll~~~~~~~g~~~~fd~~~l~~~d 361 (432) T protein:vir:10 284 LTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYK--LFLDSELDKGFYSKFNVDAILRAD 361 (432) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh--hcChhhcCCCcEEEeechhhhcCC Confidence 34456677888888777544444454433222 122222222222221111110 000000 0 0011100000000 Q ss_pred HHH--HHHHHHHhhhhhhhhhc-cceeeecccceecCCCeeeecccccEehheeccCcEEEEecCCCeEEEeeEEEeecC Q lcl|NC_013650. 474 IYR--EIVEYDEETGQEYIRKV-PKLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIAWDGTGYVVDTALHTGIEH 550 (1230) Q Consensus 474 v~~--~i~~~~~~~g~~~i~~~-~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t~~g~~~~v~~v~~~~~~~ 550 (1230) ... +....+...|.--.... ..+.... +.++..++....+.+++++... ....++. -.......-.+ T Consensus 362 ~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p-----i~ggD~~~~~~n~~~~~~~~~~---~~k~~~~--~~~~~~~~~~~ 431 (432) T protein:vir:10 362 IKTRYEAYRTGIQGGFLKPNEARSKEDLPP-----EAGGDRLLVNGNMLPIDMAGQA---YLKGGDT--NGEVSKEGNEG 431 (432) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCC-----CCCCCeEeecccccchhhcccc---ccCCCCC--CCCCCCCCCCC Confidence 000 01111111110000000 0001111 2223333333444455443110 0000000 00000000000 Q ss_pred c Q lcl|NC_013650. 551 R 551 (1230) Q Consensus 551 ~ 551 (1230) . T Consensus 432 ~ 432 (432) T protein:vir:10 432 N 432 (432) T ss_pred C Confidence 0 No 8 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=80.85 E-value=0.031 Score=28.87 Aligned_cols=393 Identities=13% Similarity=0.118 Sum_probs=144.7 Q ss_pred HHHHHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHHHhhccccc-----------------hhHHhhHHH Q lcl|NC_013650. 110 RVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDL-----------------NYLEFLPDQ 172 (1230) Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~ 172 (1230) =-|..|.+.|+.-+ .+-..-.......++ .|++.+ |-.- --++++-+. T Consensus 1 M~~~~r~~~~~~~~----------~r~~~~~~~~~~~~~---~~~~~~--g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ 65 (432) T protein:vir:10 1 MKIVDSVKKFFNFE----------KRQTSQVIELNKDDE---KLLEWL--GISPSTISVKGKNALKVATVFACIKILSES 65 (432) T ss_pred CChHHHHHHhcCcc----------ccCcccccccCCchH---HHHHHh--CCCcCccccchhhhhccHHHHHHHHHHHHh Confidence 11112222233211 000000000001111 122211 1111 134444444 Q ss_pred hhhhhhhhcceeeccccchhcccchh-----hhhc--Cchhhhcchhhhhccchheeeehhhhhccccccccccccccce Q lcl|NC_013650. 173 FAREYFTVGEVTSLAHFNESLGVWSS-----EEIL--NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTV 245 (1230) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (1230) ++.-=|+| -.-+ ..|.++. ...| .|.= ..+++.||+..+.+|.+.|..--+.... ..|...+.+ T Consensus 66 ia~lp~~~------~~~~-~~~~~~~~~~~l~~lL~~~PN~-~~t~~~f~~~~~~~lll~Gnay~~i~r~-~~G~~~~L~ 136 (432) T protein:vir:10 66 VSKLPLKI------YQED-EYGIQRGTKHYLNNLLRLRPNP-YMSSMNFFGSLEAQKNLYGNSYANIEFD-RKGKVQALW 136 (432) T ss_pred hccCceEE------EEec-CCceeeccccHHHHHHHhhccC-CCCHHHHHHHHHHHHhhcCCeEEEEEEC-CCCcEEEEE Confidence 44322221 0001 1121110 1112 2433 2578999999999999988776665433 344566777 Q ss_pred eecCcchhhhccchhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhccc Q lcl|NC_013650. 246 EETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA 325 (1230) Q Consensus 246 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (1230) ..+|....+..++..++..+.........++....++..-+-|+++..+-=-..|.+++..+.+++..-....+...+.+ T Consensus 137 ~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 216 (432) T protein:vir:10 137 PIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFY 216 (432) T ss_pred EEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 77777665544433333221110111123444566888888999865322223588888777777777777776655655 Q ss_pred ceeeceeEEEeecccccCCcccccCchhhHHHHHHHhhhhhhh-h--hhhhhhhhhhhhccccccccccCCCcch----h Q lcl|NC_013650. 326 DRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADY----D 398 (1230) Q Consensus 326 ~~~~~~~~~~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~ 398 (1230) ..-+.+.-+.+++. + -+.+..+.+++.++....- . -...+...++..+-+ ...+.|.++ . T Consensus 217 ~ng~~p~gil~~~~-~--------l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l----~~~~~d~q~~e~~~ 283 (432) T protein:vir:10 217 KQGLQVKGLVQYVG-D--------LNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPI----SLNMSDAQFLENTE 283 (432) T ss_pred hccCCccEEEEcCC-C--------CCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEc----cCChhHHHHHHHHH Confidence 44444444444432 1 1234556666665544321 1 011122222222111 112223332 2 Q ss_pred hHHHHHHhhhhhhhhhhhhccchhhHHHHHHH-HHHHHHHHHHHHHHhhhccchhhhHHHhh---h-hhhhhhcccchhH Q lcl|NC_013650. 399 RIERKLLQAWGIGEALISGGTGGAYASSALNR-EFVTQIMTGFQNALKRHIRRRCEVVAEAQ---G-HYDYDLKGGVRVP 473 (1230) Q Consensus 399 ~i~~~~~~~l~i~~ali~~~~g~~~~~~~~~~-~~~~k~~~~~~~~l~~~~~~~~~~l~~~~---~-~~~~~~~~g~~~~ 473 (1230) ...+++..+++|-..++.......++..+... .+....+.-+...+......+ .+.+.. + .+.++...=...+ T Consensus 284 ~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~k--Ll~~~~~~~g~~~~fd~~~l~~~d 361 (432) T protein:vir:10 284 LTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYK--LFLDSELDKGFYSKFNVDAILRAD 361 (432) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh--hcChhhcCCCcEEEeechhhhcCC Confidence 34456677888888777544444454433222 122222222222221111110 000000 0 0011100000000 Q ss_pred HHH--HHHHHHHhhhhhhhhhc-cceeeecccceecCCCeeeecccccEehheeccCcEEEEecCCCeEEEeeEEEeecC Q lcl|NC_013650. 474 IYR--EIVEYDEETGQEYIRKV-PKLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIAWDGTGYVVDTALHTGIEH 550 (1230) Q Consensus 474 v~~--~i~~~~~~~g~~~i~~~-~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t~~g~~~~v~~v~~~~~~~ 550 (1230) ... +....+...|.--.... ..+.... +.++..++....+.+++++... ....++. -.......-.+ T Consensus 362 ~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p-----i~ggD~~~~~~n~~~~~~~~~~---~~k~~~~--~~~~~~~~~~~ 431 (432) T protein:vir:10 362 IKTRYEAYRTGIQGGFLKPNEARSKEDLPP-----EAGGDRLLVNGNMLPIDMAGQA---YLKGGDT--NGEVSKEGNEG 431 (432) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCC-----CCCCCeEeecccccchhhcccc---ccCCCCC--CCCCCCCCCCC Confidence 000 01111111110000000 0001111 2223333333444455443110 0000000 00000000000 Q ss_pred c Q lcl|NC_013650. 551 R 551 (1230) Q Consensus 551 ~ 551 (1230) . T Consensus 432 ~ 432 (432) T protein:vir:10 432 N 432 (432) T ss_pred C Confidence 0 No 9 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=80.85 E-value=0.031 Score=28.87 Aligned_cols=393 Identities=13% Similarity=0.118 Sum_probs=144.7 Q ss_pred HHHHHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHHHhhccccc-----------------hhHHhhHHH Q lcl|NC_013650. 110 RVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDL-----------------NYLEFLPDQ 172 (1230) Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~ 172 (1230) =-|..|.+.|+.-+ .+-..-.......++ .|++.+ |-.- --++++-+. T Consensus 1 M~~~~r~~~~~~~~----------~r~~~~~~~~~~~~~---~~~~~~--g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ 65 (432) T protein:vir:10 1 MKIVDSVKKFFNFE----------KRQTSQVIELNKDDE---KLLEWL--GISPSTISVKGKNALKVATVFACIKILSES 65 (432) T ss_pred CChHHHHHHhcCcc----------ccCcccccccCCchH---HHHHHh--CCCcCccccchhhhhccHHHHHHHHHHHHh Confidence 11112222233211 000000000001111 122211 1111 134444444 Q ss_pred hhhhhhhhcceeeccccchhcccchh-----hhhc--Cchhhhcchhhhhccchheeeehhhhhccccccccccccccce Q lcl|NC_013650. 173 FAREYFTVGEVTSLAHFNESLGVWSS-----EEIL--NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTV 245 (1230) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (1230) ++.-=|+| -.-+ ..|.++. ...| .|.= ..+++.||+..+.+|.+.|..--+.... ..|...+.+ T Consensus 66 ia~lp~~~------~~~~-~~~~~~~~~~~l~~lL~~~PN~-~~t~~~f~~~~~~~lll~Gnay~~i~r~-~~G~~~~L~ 136 (432) T protein:vir:10 66 VSKLPLKI------YQED-EYGIQRGTKHYLNNLLRLRPNP-YMSSMNFFGSLEAQKNLYGNSYANIEFD-RKGKVQALW 136 (432) T ss_pred hccCceEE------EEec-CCceeeccccHHHHHHHhhccC-CCCHHHHHHHHHHHHhhcCCeEEEEEEC-CCCcEEEEE Confidence 44322221 0001 1121110 1112 2433 2578999999999999988776665433 344566777 Q ss_pred eecCcchhhhccchhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhccc Q lcl|NC_013650. 246 EETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA 325 (1230) Q Consensus 246 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (1230) ..+|....+..++..++..+.........++....++..-+-|+++..+-=-..|.+++..+.+++..-....+...+.+ T Consensus 137 ~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 216 (432) T protein:vir:10 137 PIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFY 216 (432) T ss_pred EEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 77777665544433333221110111123444566888888999865322223588888777777777777776655655 Q ss_pred ceeeceeEEEeecccccCCcccccCchhhHHHHHHHhhhhhhh-h--hhhhhhhhhhhhccccccccccCCCcch----h Q lcl|NC_013650. 326 DRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADY----D 398 (1230) Q Consensus 326 ~~~~~~~~~~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~ 398 (1230) ..-+.+.-+.+++. + -+.+..+.+++.++....- . -...+...++..+-+ ...+.|.++ . T Consensus 217 ~ng~~p~gil~~~~-~--------l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l----~~~~~d~q~~e~~~ 283 (432) T protein:vir:10 217 KQGLQVKGLVQYVG-D--------LNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPI----SLNMSDAQFLENTE 283 (432) T ss_pred hccCCccEEEEcCC-C--------CCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEc----cCChhHHHHHHHHH Confidence 44444444444432 1 1234556666665544321 1 011122222222111 112223332 2 Q ss_pred hHHHHHHhhhhhhhhhhhhccchhhHHHHHHH-HHHHHHHHHHHHHHhhhccchhhhHHHhh---h-hhhhhhcccchhH Q lcl|NC_013650. 399 RIERKLLQAWGIGEALISGGTGGAYASSALNR-EFVTQIMTGFQNALKRHIRRRCEVVAEAQ---G-HYDYDLKGGVRVP 473 (1230) Q Consensus 399 ~i~~~~~~~l~i~~ali~~~~g~~~~~~~~~~-~~~~k~~~~~~~~l~~~~~~~~~~l~~~~---~-~~~~~~~~g~~~~ 473 (1230) ...+++..+++|-..++.......++..+... .+....+.-+...+......+ .+.+.. + .+.++...=...+ T Consensus 284 ~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~k--Ll~~~~~~~g~~~~fd~~~l~~~d 361 (432) T protein:vir:10 284 LTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYK--LFLDSELDKGFYSKFNVDAILRAD 361 (432) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh--hcChhhcCCCcEEEeechhhhcCC Confidence 34456677888888777544444454433222 122222222222221111110 000000 0 0011100000000 Q ss_pred HHH--HHHHHHHhhhhhhhhhc-cceeeecccceecCCCeeeecccccEehheeccCcEEEEecCCCeEEEeeEEEeecC Q lcl|NC_013650. 474 IYR--EIVEYDEETGQEYIRKV-PKLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIAWDGTGYVVDTALHTGIEH 550 (1230) Q Consensus 474 v~~--~i~~~~~~~g~~~i~~~-~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t~~g~~~~v~~v~~~~~~~ 550 (1230) ... +....+...|.--.... ..+.... +.++..++....+.+++++... ....++. -.......-.+ T Consensus 362 ~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p-----i~ggD~~~~~~n~~~~~~~~~~---~~k~~~~--~~~~~~~~~~~ 431 (432) T protein:vir:10 362 IKTRYEAYRTGIQGGFLKPNEARSKEDLPP-----EAGGDRLLVNGNMLPIDMAGQA---YLKGGDT--NGEVSKEGNEG 431 (432) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCC-----CCCCCeEeecccccchhhcccc---ccCCCCC--CCCCCCCCCCC Confidence 000 01111111110000000 0001111 2223333333444455443110 0000000 00000000000 Q ss_pred c Q lcl|NC_013650. 551 R 551 (1230) Q Consensus 551 ~ 551 (1230) . T Consensus 432 ~ 432 (432) T protein:vir:10 432 N 432 (432) T ss_pred C Confidence 0 No 10 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=80.17 E-value=0.097 Score=26.13 Aligned_cols=481 Identities=15% Similarity=0.120 Sum_probs=186.4 Q ss_pred HHHHHHhcCCCCCCchhHHHHHhhhhhcCCcchhHHHHHHhhhhhhhhHHHHHHHHhccCcccceeecCchhhhhhHhhh Q lcl|NC_013650. 16 VNRLRKAGVNMPNSPTMARAQAAALQNTVNNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLA 95 (1230) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (1230) +|-|-|+.-- ++ |..-+.|.+.| ..+..|.-.+. +. +..+.+ .+.. T Consensus 1 Mn~iDr~i~~------------------~s--P~~a~~R~~ar------~~~~~y~aa~~-~r-~~~~~~---~~~s--- 46 (548) T protein:vir:95 1 MNLIDRLLEP------------------LA--PELVARRLAAR------EAIQAYEAARP-GR-THKAKR---QPLG--- 46 (548) T ss_pred CchHHhHhhh------------------cc--hHHHHHHHHhH------HHhccccccCc-cc-cccccC---CCCC--- Confidence 2222222111 11 22222222222 22223322211 11 111111 1111 Q ss_pred hhcCCCCcchhhHHHHHHHHHHHHhcccchhHHHHHhhhcCccc--ccccccc----c--------hhHHHHHHHhh--- Q lcl|NC_013650. 96 DKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVV--GMEFDSK----D--------PLIKTFYEDLF--- 158 (1230) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~----~--------~~~~~~~~~~~--- 158 (1230) .|.+....++.+|.=||..|+.++++.-.||.+...=|= |..+..+ | ..|+..|++.. T Consensus 47 -----~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~ 121 (548) T protein:vir:95 47 -----ADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSP 121 (548) T ss_pred -----hHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCc Confidence 123344567889999999999999999999988766441 2333221 2 34555666554 Q ss_pred -ccccchhHHhhHHHhhhhhhhhcceeeccccchh-c---c-cch-hhhhcCchhhhcchhhhhccchheeeehhhhhcc Q lcl|NC_013650. 159 -FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNES-L---G-VWS-SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHL 231 (1230) Q Consensus 159 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~---~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (1230) +...+|+-. |....-|.++.-||+|-.-..... . | -|- +-+.|+||+|...... + T Consensus 122 D~~g~~~f~~-lq~l~~R~~~~dGE~f~~~~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~-------------~---- 183 (548) T protein:vir:95 122 ETSGELTRPQ-VERLMCRTWLRDGEGLAQKLMGRVPNYTFATSVPFALELLEPDYLPFSYNN-------------L---- 183 (548) T ss_pred cccccCCHHH-HHHHHHHHHHhCCceEEEeeecccccccCCcccceEEEEechhhcCCCCCC-------------C---- Confidence 222344433 445588999999999976665432 1 1 111 5689999997432111 0 Q ss_pred ccccccccccccceeecCcchhhhccchhhhhhhhhH-HHhhhccCCCccccchhhhhhccCCccccccCCccccchHHH Q lcl|NC_013650. 232 RQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPE-IIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRT 310 (1230) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (1230) ++.+-.=+|.+-..+-+ -|-+.. ..|- .+...+.....++|-..|.|+...-..-..||-|.+.+.... T Consensus 184 ------~~~i~~GIE~D~~Grp~---aY~i~~-~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r~gQ~RGvs~lapvl~~ 253 (548) T protein:vir:95 184 ------SKGIVQGIERDTWRRKR---AYHLLK-DHPGNLQTLGGSLAVKRVEAERIIHIAYRKRIGQNRGVPMLHAVLIR 253 (548) T ss_pred ------CCceeeeeEECCCCceE---EEEEee-cCCCcccccccccceeeechhHheecccccCCccccCcchHHHHHHH Confidence 11111112222111111 111111 1122 111223445567888899999988777789999999888888 Q ss_pred HHHHHHHHHHhhcccceeeceeEEEeecccccCCcccccCchhhHHHHHHHhhhhhhhhhhhhhhhhhhhhccccccccc Q lcl|NC_013650. 311 LMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESV 390 (1230) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 390 (1230) |...+.+.+++..-+-.-+.-..+.+-+..+...++..-.+.+....+.. +.+ .....-|..++...-..-. T Consensus 254 l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~p---G~i-----v~~L~pGe~i~~~~p~~p~ 325 (548) T protein:vir:95 254 LADLKDYEESERVAARISAALAMYIKKGNPDSYTVEPGKDRKNRTIPIAP---GMV-----FDDLEPGEDVGMIESNRPN 325 (548) T ss_pred HHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCCCCcccccccccccC---Ccc-----ccccCCCceeeecCCCCCC Confidence 88888877665432222222222222221111111111001010000000 000 0001122222221111111 Q ss_pred cCCCcchhhHHHHHHhhhhhhhhhhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhccch-h-hhHHHhhhhhhhhhcc Q lcl|NC_013650. 391 PNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRR-C-EVVAEAQGHYDYDLKG 468 (1230) Q Consensus 391 ~~~d~~~~~i~~~~~~~l~i~~ali~~~~g~~~~~~~~~~~~~~k~~~~~~~~l~~~~~~~-~-~~l~~~~~~~~~~~~~ 468 (1230) .+.......+.+.+..++|+...++.++-...|++....+..+.+.+...+..+.....+. + .+|.+..-.-...+.. T Consensus 326 ~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~ 405 (548) T protein:vir:95 326 PFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPA 405 (548) T ss_pred CCHHHHHHHHHHHHHhhcCCCHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC Confidence 2222234667888888999998888877666888877766655555554444333211111 1 1122111000000000 Q ss_pred cc----------------hhHHHHHHHHH--------------HHhhhhhhhhh----ccceeeecccceecCCCeee-e Q lcl|NC_013650. 469 GV----------------RVPIYREIVEY--------------DEETGQEYIRK----VPKLLIPEVKFSCVVPGTQV-L 513 (1230) Q Consensus 469 g~----------------~~~v~~~i~~~--------------~~~~g~~~i~~----~~~l~~~~~~~~Clt~DT~V-l 513 (1230) +. .++-.++.... ..+.|..+... ..........+.-++.+..- . T Consensus 406 ~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~ 485 (548) T protein:vir:95 406 DVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAGLVFSSDAYHQL 485 (548) T ss_pred CCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccc Confidence 00 00001111000 00111110000 00000000011111222111 1 Q ss_pred cccccEehhee----ccCcEEEEecCCCeEEEeeEEEeecCcceEEEEEEcCceEEEEc-CCcceEeecCcchhhhhhcc Q lcl|NC_013650. 514 TPEGQRNVEDI----RPGDEVIAWDGTGYVVDTALHTGIEHRDELVEVITKTGRTIRCT-ADHPFWTDQGWVKAQDLTDE 588 (1230) Q Consensus 514 T~dG~k~IedL----~vGD~V~t~~g~~~~v~~v~~~~~~~~~~l~rI~t~~G~~L~~T-p~H~f~v~~g~~~~~~~~~~ 588 (1230) +..|-.+.++. .-+++.++-++...+|- ..|--|-+. |+. +... +.+ T Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~---~~~~-------~~~ 537 (548) T protein:vir:95 486 VKSGMDPVEAVQKVYLGVGKMLTADEARELVN------------------RYGAGLPVPGPDF---PNES-------NNG 537 (548) T ss_pred cccccCCCCchhhhccccccccccchhHHhhc------------------cCCCCCcCCCCCC---Cccc-------ccC Confidence 12222222211 11222222222111111 111111100 000 0000 000 Q ss_pred hhccccccccc Q lcl|NC_013650. 589 IAIRTSAGQLA 599 (1230) Q Consensus 589 ~~~~~~~~~~~ 599 (1230) -.......... T Consensus 538 ~~~~~~~~~~~ 548 (548) T protein:vir:95 538 GADGQPSNPDP 548 (548) T ss_pred CCCCCCCCCCC Confidence 00000000000 No 11 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=74.73 E-value=0.049 Score=27.78 Aligned_cols=375 Identities=12% Similarity=0.071 Sum_probs=136.2 Q ss_pred HHhcccchhHHHHHhhhcCccccccccccchhHHHHHHHhhccccc------------hhHHhhHHHhhhhhhhhcceee Q lcl|NC_013650. 118 LFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDL------------NYLEFLPDQFAREYFTVGEVTS 185 (1230) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~ 185 (1230) .||. ..|---.-.....|+.+-.++-.-.-|... --++++-+.++.= + T Consensus 1 m~f~------------~~~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l--------p 60 (409) T protein:vir:10 1 MLFR------------KGFKNQSQEISIDDKKILEWLGINPSETYVNGKSCLKQATVFGCIRILSDNISKL--------P 60 (409) T ss_pred Cccc------------ccccCcCCCCCCChHHHHHHhcCCcCcceechhhhhccHHHHHHHHHHHHhhhhC--------c Confidence 1110 000000000001122122222110001111 1123333333321 1 Q ss_pred ccccchhcccchhh-----hhc--CchhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCcchhhhccc Q lcl|NC_013650. 186 LAHFNESLGVWSSE-----EIL--NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMRE 258 (1230) Q Consensus 186 ~~~~~~~~~~~~~~-----~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 258 (1230) |-.+....|.+... .+| .|.= ..++..||+..+.+|++.|..--+.. +...|...+..-.+|....+..++ T Consensus 61 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~-~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~G~~~~L~~i~~~~V~v~~~~ 138 (409) T protein:vir:10 61 IKIYQKKDGIKRVPDHYLEYLLKLRPNP-YMSSSDFWKCIEVQRNIYGNAYVALD-FKKNGEIKGLYPLKSDGMKIFVDD 138 (409) T ss_pred eEEEEecCCeeeccCchHHHHHhhccCC-CCCHHHHHHHHHHHHhhcCCeEEEEE-EcCCCcEEEEEEEcCCceEEEEcC Confidence 22222233333221 112 2443 25889999999999999887655532 344555666766777665444333 Q ss_pred hhhhhhhhhHHH--hhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEEe Q lcl|NC_013650. 259 FQDLQRRYPEII--QAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLAT 336 (1230) Q Consensus 259 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 336 (1230) .-++... .+++ -....+....++..-+-|+++-.. =...|.+.+..+.+.+..-....+...+.+...+.+.-+.+ T Consensus 139 ~~~~~~~-~~~~y~~~~~~g~~~~~~~~evih~r~~~~-d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~ 216 (409) T protein:vir:10 139 TGLLNSE-NNVWYLYTDDLGQRHKFMSDEILHFKGLTA-DGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQ 216 (409) T ss_pred Ccccccc-ceEEEEEEeCCceeEEeccccEEEecCcCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEE Confidence 2222211 1111 112233345577777778875421 12358888877888887777777666665555554444444 Q ss_pred ecccccCCcccccCchhhHHHHHHHhhhhhhhh---hhhhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhhh Q lcl|NC_013650. 337 LGIEDMGDGEPWIPDQGELDEVRDDMQSLLAAD---FRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWG 409 (1230) Q Consensus 337 l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l~ 409 (1230) ++. + -+.+..+.+++.++...... -...+...++..+-+ .+.+.|.++ +...+++..+++ T Consensus 217 ~~~-~--------l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l----~~~~~d~q~~e~~~~~~~~Ia~~fg 283 (409) T protein:vir:10 217 YAG-D--------LNPEAEEVFKENFERMSSGLKNAHRIAMLPIGYKFEPI----SQKLVDAQFLENSQLTIRQIASVFG 283 (409) T ss_pred cCC-C--------CCHHHHHHHHHHHHHHhccccccCCceecCCCceEEEc----cCChhhHHHHHHHHHHHHHHHHHhC Confidence 432 1 13355666666655444210 011111222222111 122233332 335556677888 Q ss_pred hhhhhhhhccchhhHHHHHHH-HHHHHHHHHHHHHHhhhccchhhhHHHhhh--hhhhhhcccchhHHHH--HHHHHHHh Q lcl|NC_013650. 410 IGEALISGGTGGAYASSALNR-EFVTQIMTGFQNALKRHIRRRCEVVAEAQG--HYDYDLKGGVRVPIYR--EIVEYDEE 484 (1230) Q Consensus 410 i~~ali~~~~g~~~~~~~~~~-~~~~k~~~~~~~~l~~~~~~~~~~l~~~~~--~~~~~~~~g~~~~v~~--~i~~~~~~ 484 (1230) |-..++.......++..+... .+....+.-+...+......+.-.-.+... ...++...=...+... +....+.. T Consensus 284 VPp~~lg~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~ 363 (409) T protein:vir:10 284 VKMHQLNDLDRATHSNITEQNREFYIDTLQSILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQ 363 (409) T ss_pred CCHHHcCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHh Confidence 877776544444444433222 222222222222221111110000000000 0011100000000000 00011111 Q ss_pred hhhhhhhhc-cceeeecccceecCCCeeeecccccEehheeccCcEEEEecCCCeEEEeeEEEeecCcceEEEEEEcCce Q lcl|NC_013650. 485 TGQEYIRKV-PKLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIAWDGTGYVVDTALHTGIEHRDELVEVITKTGR 563 (1230) Q Consensus 485 ~g~~~i~~~-~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t~~g~~~~v~~v~~~~~~~~~~l~rI~t~~G~ 563 (1230) .|.--.... ..+.... +.++..++....+.+++++... . .+.|. T Consensus 364 ~G~~T~NE~R~~lgl~p-----~~ggD~~~~~~n~~~~~~~~~~-------------------~-----------~kgGe 408 (409) T protein:vir:10 364 NGFKTPNEIRELEEDEP-----LEGGDVLLINGNMIPVKMAGEQ-------------------Y-----------SKGGE 408 (409) T ss_pred CCCcCHHHHHHHhCCCC-----CCCcCeeeeccCccchhhcccc-------------------c-----------cccCC Confidence 110000000 0000000 1111222222233333332100 0 01111 Q ss_pred E Q lcl|NC_013650. 564 T 564 (1230) Q Consensus 564 ~ 564 (1230) + T Consensus 409 ~ 409 (409) T protein:vir:10 409 K 409 (409) T ss_pred C Confidence 1 No 12 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=67.68 E-value=0.25 Score=23.90 Aligned_cols=401 Identities=12% Similarity=0.111 Sum_probs=146.9 Q ss_pred HhhhhhcCCCCcchhhHHHHHHHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHHHhhccccchhHHhhHH Q lcl|NC_013650. 92 GTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPD 171 (1230) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 171 (1230) --++++++.|..+...+- ...-+.-+.+++....-+ .++. ++.+ +.+..-.+ .-.++++-+ T Consensus 1 M~~~~~~f~~~~r~~~~~----------~~~~~~~~~~~~~~g~~~-~~~~-v~~~----~al~~~~v---~~~i~~ia~ 61 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQV----------IELNKDDEKLLEWLGISP-STIS-VKGK----NALKVATV---FACIKILSE 61 (429) T ss_pred CchhhhhhcccccCcccc----------cccCCChHHHHHHhcCCC-Ccce-echh----hhhccHHH---HHHHHHHHH Confidence 112222222222211110 000111111222211000 0100 0000 11110000 123444544 Q ss_pred Hhhhhhhhhcceeeccccch-hcccchh-----hhhc--Cchhhhcchhhhhccchheeeehhhhhcccccccccccccc Q lcl|NC_013650. 172 QFAREYFTVGEVTSLAHFNE-SLGVWSS-----EEIL--NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMS 243 (1230) Q Consensus 172 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~-----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (1230) .++.==|+ .+-. ..|.... ..+| .|.= ..++..||+..+.+|++.|...-+... -..|...+ T Consensus 62 ~ia~l~~~--------~~~~~~~~~~~~~~~~l~~lL~~~PN~-~~t~~~f~~~~~~~lll~Gnay~~i~r-~~~G~~~~ 131 (429) T protein:vir:10 62 SVSKLPLK--------IYQEDEYGIQRGTKHYLNNLLRLRPNP-YMSSMNFFGSLEAQKNLYGNSYANIEF-DRKGKVQA 131 (429) T ss_pred hhccCceE--------EEEecCCceeeccccHHHHHHHhhccC-CCCHHHHHHHHHHHHhhcCCeEEEEEE-CCCCcEEE Confidence 44432122 1110 1111110 0112 1322 247889999999999998876655543 34556778 Q ss_pred ceeecCcchhhhccchhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhc Q lcl|NC_013650. 244 TVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDA 323 (1230) Q Consensus 244 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (1230) ....+|....+...+..++...+........+|....++..-+-|++...+-=-..|.+++..+.+++..-....++..+ T Consensus 132 L~~i~~~~v~v~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 211 (429) T protein:vir:10 132 LWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINN 211 (429) T ss_pred EEEEcCceeEEEEcCcccccccceEEEEEccCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 88888877766555544443221111122234545667877788998653222234877777777777776666666555 Q ss_pred ccceeeceeEEEeecccccCCcccccCchhhHHHHHHHhhhhhhh-hh--hhhhhhhhhhhccccccccccCCCcch--- Q lcl|NC_013650. 324 VADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-DF--RLMVHNFGLKVENVFGRESVPNLDADY--- 397 (1230) Q Consensus 324 ~~~~~~~~~~~~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~d~~~--- 397 (1230) .+..-+.+.-+.+++. .-+.+..+.+++.++....- .+ ...+-..++..+-+ ...+.|.++ T Consensus 212 ~~~ng~~~~~il~~~~---------~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l----~~~~~d~q~~e~ 278 (429) T protein:vir:10 212 FYKQGLQVKGLVQYVG---------DLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPI----SLNMSDAQFLEN 278 (429) T ss_pred HHhccCCccEEEEcCC---------CCCHHHHHHHHHHHHHHhccccccCceeecCCCceEEEc----cCChhHHHHHHH Confidence 5544444444444431 11334556666665544421 10 11111222222221 122333332 Q ss_pred -hhHHHHHHhhhhhhhhhhhhccchhhHHHHHHHH-HHHHHHHHHHHHHhhhccchhhhHHHhhh----hhhhhhcccch Q lcl|NC_013650. 398 -DRIERKLLQAWGIGEALISGGTGGAYASSALNRE-FVTQIMTGFQNALKRHIRRRCEVVAEAQG----HYDYDLKGGVR 471 (1230) Q Consensus 398 -~~i~~~~~~~l~i~~ali~~~~g~~~~~~~~~~~-~~~k~~~~~~~~l~~~~~~~~~~l~~~~~----~~~~~~~~g~~ 471 (1230) +...+++..+++|-..++.......++..+.... ++...+.-+...+......+ .+.+... ...++...-.. T Consensus 279 ~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~k--l~~~~~~~~g~~~~fd~~~ll~ 356 (429) T protein:vir:10 279 TELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYK--LFLDSELDKGFYSKFNVDAILR 356 (429) T ss_pred HHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh--hcChhhcCCCcEEEeechhhhc Confidence 2345567778888888776555555544332222 22222222222111111100 0100000 00111000000 Q ss_pred hHHHH--HHHHHHHhhhhhhhhhc-cceeeecccceecCCCeeeecccccEehhee-----ccCcEEEEecCCCeEEEee Q lcl|NC_013650. 472 VPIYR--EIVEYDEETGQEYIRKV-PKLLIPEVKFSCVVPGTQVLTPEGQRNVEDI-----RPGDEVIAWDGTGYVVDTA 543 (1230) Q Consensus 472 ~~v~~--~i~~~~~~~g~~~i~~~-~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL-----~vGD~V~t~~g~~~~v~~v 543 (1230) .+... +....+...|.--+... ..+.... +.++..++..-.+.+|+++ +.|+. +++ T Consensus 357 ~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p-----~~ggD~~~~~~n~~~~d~~~~~~~k~g~~----~~~------- 420 (429) T protein:vir:10 357 ADIKTRYEAYRTGIQGGFLKPNEARSKEDLPP-----EAGGDRLLVNGNMLPIDMAGQAYLKGGDT----NGE------- 420 (429) T ss_pred CCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC-----CCCcCeeeecccccchhhccccccCCCCC----CCC------- Confidence 00100 01111111110000000 0001111 1222233333344444332 11110 000 Q ss_pred EEEeecCcceEEEEEEcCceEEEEcCCc Q lcl|NC_013650. 544 LHTGIEHRDELVEVITKTGRTIRCTADH 571 (1230) Q Consensus 544 ~~~~~~~~~~l~rI~t~~G~~L~~Tp~H 571 (1230) .+.+|.+ +. T Consensus 421 --------------~~~~~~e-----~~ 429 (429) T protein:vir:10 421 --------------VSKEGNE-----GN 429 (429) T ss_pred --------------CCCCCCC-----CC Confidence 0000000 00 No 13 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=65.78 E-value=0.28 Score=23.63 Aligned_cols=423 Identities=11% Similarity=0.024 Sum_probs=145.8 Q ss_pred hhhhhHHHHHHHHhccCcccceeecCchhhhhhHhhhhhcCCCCcchhhHHHHHHHHHHHHhcccchhHHHHHhh----h Q lcl|NC_013650. 59 AAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIY----S 134 (1230) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~ 134 (1230) |+.--..++...+..++ +.++. +..-+|... -+-|++-+.-.+ .=|...+-|=.+|++- + T Consensus 1 ~~~~~~~~~~~~~~~~~--~~~~~----~~~~~g~~~-~~~~~~~~~~~~---------~~a~~~~~v~~~v~~ia~~iA 64 (460) T protein:vir:10 1 MANRIIRALRELTGLDN--KFNDA----FIKYIGQTF-TKYDNNGKTYLE---------QGYNINPDVYSCISQMAAKTV 64 (460) T ss_pred CchhHHHHHhhhhccCC--CchHH----HHHhhcccc-CCCccchhhhhH---------HHHhcchHHHHHHHHHHHhhh Confidence 33222222222222221 11111 000011000 011222111110 0112223333334433 3 Q ss_pred cCccccccccccchhHHHHHHHhhccccch---hHHhhHHHhhhhhhhhcceeeccccchhcccchhhhhcCchhhhcch Q lcl|NC_013650. 135 KFPVVGMEFDSKDPLIKTFYEDLFFGEDLN---YLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSR 211 (1230) Q Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (1230) ..|+.=++- ..|....+....... ++. ....+.+..- +.++=+..-+..+ -+.-||. .++ T Consensus 65 ~lp~~v~~~-~~~g~~~~~~~~~~~--~~~~~~~~~~~~~~~~--~~~~~~~~~~~~L---------~~~PN~~---~t~ 127 (460) T protein:vir:10 65 AVPYTIKVV-KDTKAYQQLNNLNIS--TKGLYSFTQSLQKNRL--DTKAFSETEKAFP---------LESPNPT---QTW 127 (460) T ss_pred hCceEEEec-cCCccchhhhhhhhh--hhhhHHHHHHhhcchh--hhcccchhHHHHH---------HhCCCCC---CCH Confidence 445443321 122222111111111 000 0111111100 0000000000111 1123554 489 Q ss_pred hhhhccchheeeehhhhhcccccc---ccccccccceeecCcchhhhccchhhhhh-hhhH-HHhhhccCCCccccchhh Q lcl|NC_013650. 212 SMFVQRERVQLMVKDLVDHLRQGP---TTAGGNMSTVEETPSEREQRMREFQDLQR-RYPE-IIQAAMQNDGLDISEALI 286 (1230) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~ 286 (1230) ..||+..+.+|++.|..-.+.... ...|.+.+....+|....+...+...... .+.. .+.-..++....++..-+ T Consensus 128 ~~f~~~~~~~lll~Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ev 207 (460) T protein:vir:10 128 ADIYSLYKTYMRLNGNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEV 207 (460) T ss_pred HHHHHHHHHHHhhcCCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccce Confidence 999999999999988864443322 34466666666666666554433222221 1111 112223555677888888 Q ss_pred hhhccCCcccc-----ccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEEeecccccCCcccccCchhhHHHHHHH Q lcl|NC_013650. 287 SRVVNRPTAWA-----TRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDD 361 (1230) Q Consensus 287 ~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~d~~~~~~~~~~~~~l~~~~d~ 361 (1230) -|++....... .+|.+++..+.+.+..-....+.....+.....+..+.+.+. -.+.+..+.+++. T Consensus 208 ih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~---------~l~~e~~~~~~~~ 278 (460) T protein:vir:10 208 IHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGST---------GLTQPQADSLKQR 278 (460) T ss_pred EEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecCC---------CCCHHHHHHHHHH Confidence 89886543322 357777766666676666666665554444444444444331 1234667777776 Q ss_pred hhhhhhh-hh--hhhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhhhhhhhhhhhccchh--hHHHHHH-HH Q lcl|NC_013650. 362 MQSLLAA-DF--RLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEALISGGTGGA--YASSALN-RE 431 (1230) Q Consensus 362 ~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l~i~~ali~~~~g~~--~~~~~~~-~~ 431 (1230) ++..+.- .+ ...+...++....+ ...+.|.++ ....+++..+++|-..++....+.+ ++..+.. .. T Consensus 279 ~~~~~~g~~n~g~~~vl~~g~~~~~l----~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~ 354 (460) T protein:vir:10 279 LTEMDKSPDRLSQIAGASGEIAFTKI----SLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKR 354 (460) T ss_pred HHHHhcCccccCCceecCCCceEEEc----cCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHH Confidence 6555421 11 01111112211111 111222222 2344666778888777765443332 2322221 22 Q ss_pred HHHHHHHHHHHHHhhhccchhhhHHHhhhhhhhhhc-ccchhHHHHHHHHHHHhhhhhhhhhccceeeecccceecCCCe Q lcl|NC_013650. 432 FVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLK-GGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEVKFSCVVPGT 510 (1230) Q Consensus 432 ~~~k~~~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~-~g~~~~v~~~i~~~~~~~g~~~i~~~~~l~~~~~~~~Clt~DT 510 (1230) ++...+.-+...+......+ .+.+......+.+. +-...+..+.-..... .++ +. T Consensus 355 f~~~~l~P~~~~ie~~ln~k--l~~~~~~~~~~~i~~d~~~l~~l~~d~~~~~----~~~------------------~~ 410 (460) T protein:vir:10 355 VVTDNIQPDLVILKQAFDKK--FIKRFKGYENAVIEWDISELPEMQTDMVAMA----SWL------------------NT 410 (460) T ss_pred HHHHHHHHHHHHHHHHHHHh--hcCcccccCCceEEeecchhhhHHHHHHHHH----HHH------------------hC Confidence 22222222222222111111 01111000000000 0000111111001100 000 00 Q ss_pred eeecccccEehheec-----cCcEEEEecCCCeEEEe----eEEEeecCcc Q lcl|NC_013650. 511 QVLTPEGQRNVEDIR-----PGDEVIAWDGTGYVVDT----ALHTGIEHRD 552 (1230) Q Consensus 511 ~VlT~dG~k~IedL~-----vGD~V~t~~g~~~~v~~----v~~~~~~~~~ 552 (1230) -|+|.+-.+.+..+. -||.++... +...+.. ......++.+ T Consensus 411 g~~T~NE~R~~~g~~pi~~~~gD~~~~~~-n~~~~~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 411 IPVTPNEIRIAMKYETLNQDGMDIVFMPS-NKVRIDDVSNNLIDSAFNQNQ 460 (460) T ss_pred CCCCHHHHHHHhCCCCCCCCCCCeeeecc-cccchhhcccccCCCcccCCC Confidence 122222211111111 133322110 0000000 0000111111 No 14 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=62.21 E-value=0.34 Score=23.16 Aligned_cols=460 Identities=10% Similarity=0.030 Sum_probs=175.3 Q ss_pred hHHHHHhhhhhcCCcchhHHHHHHhhhhhhhhHHHHHHHHhccCcccceeecCchhhhhhHhhhhhcCCCCcchhhHHHH Q lcl|NC_013650. 32 MARAQAAALQNTVNNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRV 111 (1230) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 111 (1230) |-++.-.+....-...| .+|+ +..-.+|..-+..+.-++--+|.... + |.+-.-.++. T Consensus 1 m~~~~~r~~~~~a~~~~-------~~~~----~~~~~~y~gA~~~~r~~~~w~~~~~s-----~------~~~~~~~~~~ 58 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRP-------EQSA----SLGGGGLEGASRLSRETVSWNPSLRS-----P------DALINPLKRI 58 (553) T ss_pred Ccchhhhhhcccccccc-------hhhh----hhhcccccccccCCCcccccccCCCC-----h------HHHHHHHHHH Confidence 11111111111000011 1111 00011232222222223333333222 2 2233456888 Q ss_pred HHHHHHHHhcccchhHHHHHhhhcCccc-ccccccc-----------------chhHHHHHHHhh--------ccccchh Q lcl|NC_013650. 112 IRHWCRLFYATHDLVPLLIDIYSKFPVV-GMEFDSK-----------------DPLIKTFYEDLF--------FGEDLNY 165 (1230) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-----------------~~~~~~~~~~~~--------~~~~~~~ 165 (1230) ||.=||..++.++++.-.|+.+...=|= |+...++ ...|++.|+... +--.||+ T Consensus 59 lr~RaRdL~rNn~~a~~av~~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f 138 (553) T protein:vir:63 59 ADARGRDMADNDGFTNGAVGYQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTF 138 (553) T ss_pred HHHHHHHHHhcChHHHHHHHHHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCH Confidence 9999999999999999999987765331 2222222 134555565543 2234444 Q ss_pred HHhhHHHhhhhhhhhcceeeccccchhcc-cc-hhhhhcCchhhhcchhhhhccchheeeehhhhhcccccccccccccc Q lcl|NC_013650. 166 LEFLPDQFAREYFTVGEVTSLAHFNESLG-VW-SSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMS 243 (1230) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (1230) ..+ -...-|.++.-||+|-.-+....-| .| -+-++|+||+|...... + .+..+-. T Consensus 139 ~~~-q~l~~r~~~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~----~------------------~~~~i~~ 195 (553) T protein:vir:63 139 TGL-IRLGVVGYVKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQ----L------------------DTPTLRR 195 (553) T ss_pred HHH-HHHHHHHHHhCCceEEEeeeccCCCCcccceEEEechhhcCCCCCC----C------------------CCCeeEe Confidence 443 4558899999999998766654322 23 36689999997443211 0 1111222 Q ss_pred ceeec----Ccchhhhccc-hhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHH Q lcl|NC_013650. 244 TVEET----PSEREQRMRE-FQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLN 318 (1230) Q Consensus 244 ~~~~~----~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (1230) =+|.+ |+-|-+..+. ++.. ..-..-++.....+..+++-..|-|+...--.-.+||-|.+.+....|...+.+. T Consensus 196 GVE~d~~Gr~vaY~i~~~hPgd~~-~~~~~~~~~~r~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~ 274 (553) T protein:vir:63 196 GVQYDKRGRPQGYWIQVAHPGDLY-QMAPDMYKWKFVQQSKPWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFK 274 (553) T ss_pred eeEECCCCceEEEEeeccCCCccc-cccccccceeeeccccccChhHheecccccCCCcccCCchHHHHHHHHHHHhHHH Confidence 22332 2222222111 1100 0000001111112335677778889888766668899998877777777777776 Q ss_pred HHhhcccceeeceeEE--Eeeccccc--------CCcccccCchhhHHHHHHHhhhhhhh---hhhhhhhhhhhhhcccc Q lcl|NC_013650. 319 AAQDAVADRLYSPLVL--ATLGIEDM--------GDGEPWIPDQGELDEVRDDMQSLLAA---DFRLMVHNFGLKVENVF 385 (1230) Q Consensus 319 ~~~~~~~~~~~~~~~~--~~l~~~d~--------~~~~~~~~~~~~l~~~~d~~~~~~~~---~~~~~~~~~~~~~~~~~ 385 (1230) +++..-+-.-+.-..+ +..+.++. +++.+--...+..+......++-... .-.+....-|..++... T Consensus 275 daeL~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~ 354 (553) T protein:vir:63 275 EMSLQNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKP 354 (553) T ss_pred HHHHHHHHHhhhheeeeecCCChhhhhhhcccccccccccccccccccccccccccccceeecCceeeecCCCCeeeecC Confidence 5544322111111111 11211111 00000000000111111100000000 00001111122222211 Q ss_pred ccccccCCCcchhhHHHHHHhhhhhhhhhhhhcc-chhhHHHHHHHHHHHHHHHHHHHHHhhhccch--hhhHHHhhhhh Q lcl|NC_013650. 386 GRESVPNLDADYDRIERKLLQAWGIGEALISGGT-GGAYASSALNREFVTQIMTGFQNALKRHIRRR--CEVVAEAQGHY 462 (1230) Q Consensus 386 ~~~~~~~~d~~~~~i~~~~~~~l~i~~ali~~~~-g~~~~~~~~~~~~~~k~~~~~~~~l~~~~~~~--~~~l~~~~~~~ 462 (1230) -..-..+.......+.+.+..++|+...++.++- +..|++....+....+.+...+..+.....+. ..+|.+..-.. T Consensus 355 p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G 434 (553) T protein:vir:63 355 MGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAG 434 (553) T ss_pred CCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Confidence 1111122222245678888889999988888875 56798887766666665555554443221111 01121111000 Q ss_pred hhhhccc------------------------c-hhHHHHHHHHH--------------HHhhhhhhhhhc----cceeee Q lcl|NC_013650. 463 DYDLKGG------------------------V-RVPIYREIVEY--------------DEETGQEYIRKV----PKLLIP 499 (1230) Q Consensus 463 ~~~~~~g------------------------~-~~~v~~~i~~~--------------~~~~g~~~i~~~----~~l~~~ 499 (1230) ...+..+ . -++=.+++... ..+.|..+.... ...... T Consensus 435 ~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~ 514 (553) T protein:vir:63 435 EVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALL 514 (553) T ss_pred CccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHH Confidence 0000000 0 00000111000 001111100000 000000 Q ss_pred cccceecCCCeeeecccccEehheeccCcEEEE-ecCCCeE Q lcl|NC_013650. 500 EVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIA-WDGTGYV 539 (1230) Q Consensus 500 ~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t-~~g~~~~ 539 (1230) ...+.-+..+..-.+..|- ..+-...+.--+ ..++..+ T Consensus 515 ~~~Gl~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 515 KKYGLTFNLSAKRSLGDGR--DAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred HHcCCCCCCCCccccCCCc--ccCCCCCCCCCCCCcccccC Confidence 0001001111111111110 000000000000 0000000 No 15 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=61.20 E-value=0.29 Score=23.50 Aligned_cols=380 Identities=11% Similarity=0.145 Sum_probs=141.4 Q ss_pred HHHH-hhhcCccccccccccchhHHHHHHHh--hccccch------------hHHhhHHHhhhhhhhhcceeeccccc-h Q lcl|NC_013650. 128 LLID-IYSKFPVVGMEFDSKDPLIKTFYEDL--FFGEDLN------------YLEFLPDQFAREYFTVGEVTSLAHFN-E 191 (1230) Q Consensus 128 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 191 (1230) .+|| |+.+.--..-...+.++.+..+|--. .-|..++ -++++-+.++.-=|++ +- - T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~--------~~~~ 72 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKLPIHT--------YKRT 72 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEE--------EEec Confidence 1221 11111110000001111111111000 0011111 2233333333221111 10 0 Q ss_pred hcccchhh-----hhc--CchhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCcchhhhcc-chhhhh Q lcl|NC_013650. 192 SLGVWSSE-----EIL--NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMR-EFQDLQ 263 (1230) Q Consensus 192 ~~~~~~~~-----~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 263 (1230) ..|.|... ..| .|.= ..++..||+..+.+|++.|...-+.... ..|...+....+|....+... +...+. T Consensus 73 ~~~~~~~~~~~l~~~l~~~PN~-~~t~~~f~~~~v~~lll~Gna~~~i~r~-~~G~~~~L~~l~~~~v~v~~~~~~~~~~ 150 (416) T protein:vir:12 73 DGGIERKPEHKSAHAVYARPNP-YMTAFTWKKLMMTHVLTWGNAYSYIQFG-SHGYPEALFPLRPDYTNAYVHPTTGMLW 150 (416) T ss_pred CCccccccccHHHHHHHhhccc-CCCHHHHHHHHHHHHhhcCCeEEEEEEC-CCCcEEEEEEECCcceEEEEeCCCcEEE Confidence 12232211 112 2333 2578899999999999999887766543 345577777777766653322 222222 Q ss_pred hhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEEeecccccC Q lcl|NC_013650. 264 RRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMG 343 (1230) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~d~~ 343 (1230) .+ ....|..++++..-+-|+++.... -..|.+++..+.+++..-...++...+.+...+.+.-+.+++. T Consensus 151 ~~------~~~~g~~~~~~~~eiih~~~~~~~-~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~---- 219 (416) T protein:vir:12 151 YQ------TVLNGKAIELYDYEVLHFKGLSTD-GIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPA---- 219 (416) T ss_pred EE------EecCCeEEEecCccEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCC---- Confidence 11 122455678888888899865322 2358888877788887777777666665555555544445432 Q ss_pred CcccccCchhhHHHHHHHhhhhhhhhhhhhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhhhhhhhhhhhcc Q lcl|NC_013650. 344 DGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEALISGGT 419 (1230) Q Consensus 344 ~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l~i~~ali~~~~ 419 (1230) .-+.+..+.+++.+++...... ..+...+++.+.+ ...+.|.++ ....+++..+++|-..++.... T Consensus 220 -----~~~~e~~~~~~~~~~~~~~~~~-~~vl~~g~~~~~l----~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 289 (416) T protein:vir:12 220 -----FLDEKPKENVRKEWKRVNKVEN-IAIIDYGLEYQSI----SMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELD 289 (416) T ss_pred -----CCCHHHHHHHHHHHHHHhcCCC-eeecCCCceEEEc----cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcc Confidence 2245778888877754332111 1111112222111 112233332 2344566667888777776555 Q ss_pred chhhHHHHHHH-HHHHHHHHHHHHHHhhhccchhhhHHHhh---h-hhhhhhcccchhHHHH--HHHHHHHhhhhhhhhh Q lcl|NC_013650. 420 GGAYASSALNR-EFVTQIMTGFQNALKRHIRRRCEVVAEAQ---G-HYDYDLKGGVRVPIYR--EIVEYDEETGQEYIRK 492 (1230) Q Consensus 420 g~~~~~~~~~~-~~~~k~~~~~~~~l~~~~~~~~~~l~~~~---~-~~~~~~~~g~~~~v~~--~i~~~~~~~g~~~i~~ 492 (1230) ..+++..+... .+....+.-+...+......+ .+.... + .+.++...=...+... +....+...|.--+.. T Consensus 290 ~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~--l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE 367 (416) T protein:vir:12 290 KATFSNIEHQSIEYVRNTLQPWIVNFEQELNVK--LFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDE 367 (416) T ss_pred CCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh--hcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHH Confidence 55555433322 222222222222222111111 000000 0 0011100000000000 0011111111000000 Q ss_pred cc-ceeeecccceecCCCeeeecccccEehheeccCcEEEEecCCCeEEEeeEEEeecCcceEEEEEEcCc Q lcl|NC_013650. 493 VP-KLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIAWDGTGYVVDTALHTGIEHRDELVEVITKTG 562 (1230) Q Consensus 493 ~~-~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t~~g~~~~v~~v~~~~~~~~~~l~rI~t~~G 562 (1230) .. .+.... +.++..++..-.+.+++.+..-.. +..+. ...-+. .-.+| T Consensus 368 ~R~~~gl~P-----i~ggd~~~~~~n~~~~~~~~~~~~--~~~~~------~~~gge---------~~~~g 416 (416) T protein:vir:12 368 IRELLERNP-----IENGDKYISSLNYVFLDFLEEYQR--LKAGG------AMKGGD---------NKNEG 416 (416) T ss_pred HHHHhCCCC-----CCCcceeeeccccccccccchhhc--ccccc------ccCCCC---------CcCCC Confidence 00 000000 111122222222333322210000 00000 000000 00122 No 16 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=58.25 E-value=0.42 Score=22.67 Aligned_cols=375 Identities=10% Similarity=0.016 Sum_probs=131.8 Q ss_pred cCCCCcchhhHHHHHHHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHHHhhccccch------------h Q lcl|NC_013650. 98 GIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLN------------Y 165 (1230) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~ 165 (1230) |--|++.-. ..-.....|+.+..|+.--.-|..+. - T Consensus 1 M~~f~~~~~--------------------------------~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~ 48 (397) T protein:vir:38 1 MPLLKLNKS--------------------------------HSQGFSLNDPDWVNFLTGGEAQKYVSADTALKNSDIFSL 48 (397) T ss_pred Ccchhhhhc--------------------------------ccCcccCCchhhhhhhcCCcCCceechHHhhccHHHHHH Confidence 111111000 00011112232222221111112222 1 Q ss_pred HHhhHHHhhhhhhhhcceeeccccchhcccchhhhhcCchhhhcchhhhhccchheeeehhhhhccccccccccccccce Q lcl|NC_013650. 166 LEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTV 245 (1230) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (1230) ++++-..++.-=|++ .+. ..+-=..--||+ .+.+.||+..+.+|++.|...-... +...|...+.. T Consensus 49 v~~ia~~ia~~p~~~--------~~~--~~~~l~~~PN~~---~s~~~f~~~~~~~lll~Gna~~~i~-r~~~g~~~~l~ 114 (397) T protein:vir:38 49 IMQLSGDLAMVRYTS--------ESD--RSQSIISNPSVT---ANGYSFWQGMFAQLLLDGNCYAYRH-KNTNGVDLSWE 114 (397) T ss_pred HHHHHHHHhhCcccc--------ccc--HHHHHHhcCCCC---CCHHHHHHHHHHHhhhcCCEEEEEE-ECCCCcEEEEE Confidence 344444343221211 111 111111112443 4899999999999999987644432 23445566666 Q ss_pred eecCcchhhhc-cchhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcc Q lcl|NC_013650. 246 EETPSEREQRM-REFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAV 324 (1230) Q Consensus 246 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (1230) ...|....+.. ..+..+.-++-. .....+....++..-+-|++....---..|.+++..+.+.+.......+...+. T Consensus 115 ~l~~~~v~i~~~~~~~~~~y~~~~--~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~ 192 (397) T protein:vir:38 115 YLRPSQVQPMLLQDGSGLIYNINF--DEPAIGYMENVPAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKA 192 (397) T ss_pred EEcCceeEEEEcCCCceEEEEEEe--ccccccceeEecCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 66665543332 233222212111 112223334577777888887643222469988888888888777777776766 Q ss_pred cceeeceeEEEeecccccCCcccccCchhhHHHHHHHhhhhhhhhh--hhhhhhhhhhhccccccccccCCCcch----h Q lcl|NC_013650. 325 ADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF--RLMVHNFGLKVENVFGRESVPNLDADY----D 398 (1230) Q Consensus 325 ~~~~~~~~~~~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~d~~~----~ 398 (1230) +..-+++.-+.++... + +.++.+.+++.++......+ ...+...++.... -.+.+.|.++ + T Consensus 193 f~ng~~~~~il~~~~~-~--------~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~----l~~~~~d~~~~e~~~ 259 (397) T protein:vir:38 193 LKQSVTASAVLTIQKG-G--------LLDAETRIARSKEISKQIHNSDGPVVIDALEDYKP----LEVKGNIASLLNQVD 259 (397) T ss_pred HhccCCccEEEEeCCC-C--------CHHHHHHHHHHHHHHhcccccCCceecCCCceEEe----cCCChhHHHHHHHHH Confidence 6666665555555421 1 11333444444332221110 0011111111111 1122233332 4 Q ss_pred hHHHHHHhhhhhhhhhhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhccchhhhHHHhhhhhhhhhcccchhHHHHHH Q lcl|NC_013650. 399 RIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREI 478 (1230) Q Consensus 399 ~i~~~~~~~l~i~~ali~~~~g~~~~~~~~~~~~~~k~~~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~g~~~~v~~~i 478 (1230) ...+++..+++|...++.+... .++..+..+.+....+.-+...+......+ .+.+........+ .+...... +. T Consensus 260 ~~~~~Ia~afgVp~~~lg~~~~-~~~~~e~~~~~~~~~l~P~~~~ie~~ln~~--l~~~~~~~~~~~~-~~d~~~~~-~~ 334 (397) T protein:vir:38 260 WTRDQIAKVYGVPDSYLNGQGD-QQSSITQISGQYAKSLNRYVQAIVGELNDK--LHANISANIRFAI-DAMGDQYA-ST 334 (397) T ss_pred HHHHHHHHHhCCCHHHhCCCCC-cccHHHHHHHHHHHHHHHHHHHHHHHHHHh--ccChhcccccccc-cCCHHHHH-HH Confidence 4566677788998887765433 222222222222222221111111111100 0111111111111 01000000 00 Q ss_pred HHHHHhhhhhhhhhccceeeecccceecCCCeeeecccccEehhee---ccCcEEEEecCCCeEEEeeEEEeecCcceEE Q lcl|NC_013650. 479 VEYDEETGQEYIRKVPKLLIPEVKFSCVVPGTQVLTPEGQRNVEDI---RPGDEVIAWDGTGYVVDTALHTGIEHRDELV 555 (1230) Q Consensus 479 ~~~~~~~g~~~i~~~~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL---~vGD~V~t~~g~~~~v~~v~~~~~~~~~~l~ 555 (1230) ...+...| |+|.+-++.+..+ ..||....... .......... ..+..+-- T Consensus 335 ~~~~~~~G-------------------------~~t~nE~R~~lg~~p~~~~d~~~~~~~-~~~~~~~~~~-~~g~~~~~ 387 (397) T protein:vir:38 335 ISSSVKGG-------------------------TIAGNQARFILQNSGYLAKDLPDPEKE-PQQAIQLIQQ-EGGENDGN 387 (397) T ss_pred HHHHHhCC-------------------------CcCHHHHHHHhCCCCCCCCcccccccc-cccccccccc-ccCCCCCC Confidence 00011110 1222211111100 11221100000 0000000000 00000000 Q ss_pred EEEEcCceEEEEcCC Q lcl|NC_013650. 556 EVITKTGRTIRCTAD 570 (1230) Q Consensus 556 rI~t~~G~~L~~Tp~ 570 (1230) . ....+. -|+ T Consensus 388 ~-~~e~~~----~~~ 397 (397) T protein:vir:38 388 N-SDERGS----DPE 397 (397) T ss_pred C-CCCCCC----CCC Confidence 0 000000 011 No 17 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=55.64 E-value=0.097 Score=26.14 Aligned_cols=544 Identities=13% Similarity=0.085 Sum_probs=174.3 Q ss_pred CCCCcchhhHHHHHHHHH--HHHhc-----ccchhHHHHHh--------hhcCccccccccccchhHHHHHHHh------ Q lcl|NC_013650. 99 IPFNVEDEEELRVIRHWC--RLFYA-----THDLVPLLIDI--------YSKFPVVGMEFDSKDPLIKTFYEDL------ 157 (1230) Q Consensus 99 ~~~~~~~~~~~~~~~~~~--~~~~~-----~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~------ 157 (1230) +-+.|--+-= |. ++|+| ++|++ |.. -...|..|+.+..+.+--+..+-.+ T Consensus 1 ~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~ 71 (648) T protein:vir:79 1 MARKVWGRGF------WSRISLMWRDEDDDKEPLV---LEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMD 71 (648) T ss_pred Cccchhcchh------hhhhhhhccCccccccccc---cccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHh Confidence 1111111111 32 25566 55443 221 2335677777777766555444332 Q ss_pred hccccchhHHhh--HHHhhhhhhhhcce-----------------eeccccchhcccchhhhhcCchhhhcchhhhhccc Q lcl|NC_013650. 158 FFGEDLNYLEFL--PDQFAREYFTVGEV-----------------TSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRE 218 (1230) Q Consensus 158 ~~~~~~~~~~~~--~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (1230) .+|...|+++=. ..++++-|-+-..| +.-...++-..+|.++.++.|.. ..+..+|++.. T Consensus 72 ~~~g~~~~~epp~d~~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~ll~rPn~-~~t~~~f~~~l 150 (648) T protein:vir:79 72 GGGGGRDFEEPEFDFNEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIRMRFTLMAEAT-QIPTNQLFIEI 150 (648) T ss_pred hcCCccccccCCcCHHHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCccchhhHHHHHhhccCC-CCCHHHHHHHH Confidence 111122222111 11223333332221 22223344555676777777776 47999999999 Q ss_pred hheeeehhhhhcccccccc--------------ccccccceeecCcchhhhccchhhhhhhhhHHHhhhccCCCccccch Q lcl|NC_013650. 219 RVQLMVKDLVDHLRQGPTT--------------AGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEA 284 (1230) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (1230) +.+|++-|..--....-.. .+.+......+|....+...++..+.. +.++..+++..+.++.. T Consensus 151 ~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~~---Y~y~~~g~~~~~~~~~~ 227 (648) T protein:vir:79 151 AEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIKG---WQQEQEGQDKPQKFKPE 227 (648) T ss_pred HHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCceee---eEEEecCCceeEEecCc Confidence 9999877665433322222 222334444566666666555554321 22355677777788777 Q ss_pred hhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEEeecccccCCcccccCchhhHHHHHHHhhh Q lcl|NC_013650. 285 LISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQS 364 (1230) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~ 364 (1230) -+-|++...+.....|.|++..+..+|.+-....+...+.+..-+.+..+.+++..+. ..-+.+..++.+++.+.. T Consensus 228 dIIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~----~~e~~k~~~e~~~~~~~~ 303 (648) T protein:vir:79 228 DIVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQE----GFGAEEGEVDLVRGEVEN 303 (648) T ss_pred cEEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCcc----chHHHHHHHHHHHHhccc Confidence 7889997666667789999989998888877777776666666667766666642211 111223444555544433 Q ss_pred hhhhhhhhhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhhhhhhhhhhhccchhhHHHHHHHHHHHHHHHHH Q lcl|NC_013650. 365 LLAADFRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGF 440 (1230) Q Consensus 365 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l~i~~ali~~~~g~~~~~~~~~~~~~~k~~~~~ 440 (1230) .. +...+...+.+...-...+.|.+| ....+++..+++|-..++.......++........+....... T Consensus 304 ~~-------i~gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~~~~i~~l 376 (648) T protein:vir:79 304 MD-------VEGGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDFKDRIKAL 376 (648) T ss_pred cc-------ccccccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHHHHHHHHH Confidence 21 111111011000000001111122 2234566777888776664332223333222211111111111 Q ss_pred HHHHhhhccchhhhHHHh--hhhhhhhhc--ccchh---HHHHHHHHHHHhhhhhhhhhccceeeecccceecCCCeeee Q lcl|NC_013650. 441 QNALKRHIRRRCEVVAEA--QGHYDYDLK--GGVRV---PIYREIVEYDEETGQEYIRKVPKLLIPEVKFSCVVPGTQVL 513 (1230) Q Consensus 441 ~~~l~~~~~~~~~~l~~~--~~~~~~~~~--~g~~~---~v~~~i~~~~~~~g~~~i~~~~~l~~~~~~~~Clt~DT~Vl 513 (1230) ...+....... .+.++ ...+...+. ...+. .+.+.-.....+....++. + -|+ T Consensus 377 ~~~i~~~le~~--~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~-----------~-------Gil 436 (648) T protein:vir:79 377 QKVMATFINEF--MVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLYE-----------H-------NAI 436 (648) T ss_pred HHHHHHHHHHH--HHHHHhhhhhccccccccceEEEeecccchhhHHHHHHHHHHHHh-----------C-------CCc Confidence 11111110000 00000 000000000 00000 0000000000000000000 0 022 Q ss_pred ccc------ccEehheeccCcEEEE-----ecCCCeEEEeeEEEeecC---cceEE--------EEEEcCceEEEEcCCc Q lcl|NC_013650. 514 TPE------GQRNVEDIRPGDEVIA-----WDGTGYVVDTALHTGIEH---RDELV--------EVITKTGRTIRCTADH 571 (1230) Q Consensus 514 T~d------G~k~IedL~vGD~V~t-----~~g~~~~v~~v~~~~~~~---~~~l~--------rI~t~~G~~L~~Tp~H 571 (1230) |.+ |+-||.+-...+++.. .+........-....... ..+.- +-...+|... ++. T Consensus 437 T~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~g~~~--~~~- 513 (648) T protein:vir:79 437 SEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTPAGGSSASASGDKKKKATDNKTKPTNQHGTKT--SPK- 513 (648) T ss_pred CHHHHHHHhCCCCCCCCCCccccccccccchhccccccCCCCCCCCCCCCccccccccccCCCCCCCCCCCcCC--CCc- Confidence 222 2222211000000000 000000000000000000 00000 0000000000 000 Q ss_pred ceEeecCcchhhhhhcchhccccccccccccccccchhhHHhhhhhhcCCeeecCcceEeeccHHHHHHHHHH-HHhhcc Q lcl|NC_013650. 572 PFWTDQGWVKAQDLTDEIAIRTSAGQLAAQVETEDDPDLLRFLGLLVGDGSYSTTHVSLSVSDLEVDRFVCEQ-AERMGL 650 (1230) Q Consensus 572 ~f~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llG~~vgDG~~~~~~i~~~~~d~e~~~~l~~~-~~~lg~ 650 (1230) .....+............. ..........+..+-+|...--+++ ...+. +.+..+.+.+..+ .+.... T Consensus 514 ---~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--~~~~~~~~~~~~~~~~~~~~ 582 (648) T protein:vir:79 514 ---KQTNGRHVRYMQEMLLEYT-----TLNEAIKALIERYYQYGSKEHLKSI-NGSLM--YTEGRLLELTTQYWGEEVTE 582 (648) T ss_pred ---cccchhhhhhhhhhhhcch-----hhhHHHhhHHHHHHHHhHHHHHHhh-hhhhe--eccchhHHHHHHHhhhhhhc Confidence 0000000000000000000 0000000000000000000000000 00000 0011111111110 000000 Q ss_pred ccccccccccceeEeeeeecccceeeeeechhhH-HHHHHHhhhccccc---------------cccccHHHhcCCHHHH Q lcl|NC_013650. 651 TARRKADARTDKVWYRTFVRPAPWKGNALHNPLH-KLLREQGMWGKNGH---------------QKRVPPIVWAAGEKGR 714 (1230) Q Consensus 651 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~a~---------------~K~IP~~i~~~~~e~~ 714 (1230) . +.+ .++.....+. .....++.....+. .|+|-...|.-+.. T Consensus 583 ~-----------~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 640 (648) T protein:vir:79 583 K-----------VRI---------PFHRMTENLREEVMSTIDKVEGVAEASDIAQAVFDVFTDRLGHISNEAFAISES-- 640 (648) T ss_pred e-----------eee---------eHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHhhhhhhhhhHHHhhh-- Confidence 0 000 0000000000 00000000000000 11221111111110 Q ss_pred HHHHHHHHhcCC Q lcl|NC_013650. 715 AAFLSGYFDADG 726 (1230) Q Consensus 715 ~afL~GLfdgDG 726 (1230) | .-..+|| T Consensus 641 ---~-~~~~~~~ 648 (648) T protein:vir:79 641 ---L-AEVNGDG 648 (648) T ss_pred ---H-hhhcCCC Confidence 0 1123344 No 18 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=49.16 E-value=0.18 Score=24.64 Aligned_cols=373 Identities=12% Similarity=0.033 Sum_probs=128.9 Q ss_pred HHHHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHH---------Hhhccc--cchhHHhhHHHhhhhhhh Q lcl|NC_013650. 111 VIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYE---------DLFFGE--DLNYLEFLPDQFAREYFT 179 (1230) Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~--~~~~~~~~~~~~~~~~~~ 179 (1230) -+--+-..|.+..+-.. .--+..+.....|..+...+- +.++.. -.--+++|-+.++.= T Consensus 1 m~m~~f~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l--- 70 (392) T protein:vir:39 1 MILPILNFINQTNDPPE-------VGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIV--- 70 (392) T ss_pred Ccchhhhhhhccccccc-------ccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccC--- Confidence 11111111111000000 000011111122222222111 000000 001223333333321 Q ss_pred hcceeeccccchhcccchhhhhcCchhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCcchhhhccc- Q lcl|NC_013650. 180 VGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMRE- 258 (1230) Q Consensus 180 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 258 (1230) +|..+++... .-+-.|.- ..+++.||+..+.+|++.|...-+.. +...|.+.+..-.+|.+..+.... T Consensus 71 -----p~~~~~~~~~----~l~~~PN~-~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~g~~~~L~~l~~~~v~~~~~~~ 139 (392) T protein:vir:39 71 -----KINAEKKKNQ----GIIDNPST-NANKHGFWQSMFAQLLLGGEAFAYRW-RNANGADMKWEYLRPSQVNTYYFEY 139 (392) T ss_pred -----ceeeccchhh----hHhhcCCC-CCCHHHHHHHHHHHhhhcCcEEEEEE-ECCCCcEEEEEEEcCceeEEEEcCC Confidence 2222222221 11112433 36899999999999999998765543 334566778887788776544432 Q ss_pred hhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEEeec Q lcl|NC_013650. 259 FQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLG 338 (1230) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 338 (1230) +..+.-++- +.....+....++..-+-|++....-=...|-+++..+.+++.+-....+...+.+..-+.+.-+.++. T Consensus 140 ~~~~~y~~~--~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 217 (392) T protein:vir:39 140 ENGMYYNIT--FDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVK 217 (392) T ss_pred CceEEEEEE--ecCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC Confidence 222111110 011122223446666678888653222245888877777777777766666666555555555455554 Q ss_pred ccccCCcccccCchhhHHHHHHHhhhhhhhhhhhhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhhhhhhhh Q lcl|NC_013650. 339 IEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEAL 414 (1230) Q Consensus 339 ~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l~i~~al 414 (1230) .. . ..+.+..+..++.+.+.-. .....+...++..+-+ ...+.|.++ +...+++..++||-..+ T Consensus 218 ~~-~------~~~~~~~~~~~~~~~~~~~-~g~~~vl~~g~~~~~l----~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~ 285 (392) T protein:vir:39 218 GG-G------LLSDKDKASRSRSFMKRSR-SGGPVVLDDLEEFTAL----EIKSNVAQLLSQTDWTSKQYAKVYGLPDSY 285 (392) T ss_pred CC-C------CchHHHHHHHHHHHhcccc-CCCeeecCCCceEEEc----cCChhHHHHHHHHHHHHHHHHHHhCCCHHH Confidence 21 1 1222233333332222110 0000111111111111 112223332 33456777788888777 Q ss_pred hhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhccchhhhHHHhhhhhhhhhcccchhHHHHHHHHHHHhhhhhhhhhcc Q lcl|NC_013650. 415 ISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVP 494 (1230) Q Consensus 415 i~~~~g~~~~~~~~~~~~~~k~~~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~g~~~~v~~~i~~~~~~~g~~~i~~~~ 494 (1230) +.. .+...+.....+.++...+.-+...+......+ .+..+.......+ .... ......+..+...+..-..... T Consensus 286 lg~-~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~--L~~~~~~d~~~~~-~~d~-~~~~~~~~~l~~~g~~t~nE~r 360 (392) T protein:vir:39 286 IGG-QGDQQSSIQQISGMYASALNRYLRPAISELEYK--LSDHISVNMRPAI-DPLG-DNYLSTISTATRWGALAENQAT 360 (392) T ss_pred hCC-CCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh--ccccccccchhhh-ccCH-HHHHHHHHHHHhCCCcCHHHHH Confidence 642 333332222222222222222222211111100 0000000000000 0000 0000111111111100000000 Q ss_pred ceeeecccceecCCCeeeecccccEehhee---ccCcEEEEecCCCeEEE Q lcl|NC_013650. 495 KLLIPEVKFSCVVPGTQVLTPEGQRNVEDI---RPGDEVIAWDGTGYVVD 541 (1230) Q Consensus 495 ~l~~~~~~~~Clt~DT~VlT~dG~k~IedL---~vGD~V~t~~g~~~~v~ 541 (1230) .+. ... + +++++++..+++ ..||. ...+. T Consensus 361 ~~l-~~~-g---------~~p~e~r~~e~l~~~~~Gd~-------~~p~p 392 (392) T protein:vir:39 361 FVL-QEA-G---------YIPKDLPAPENTNKKTTGQS-------NEPVP 392 (392) T ss_pred HHH-Hhc-C---------CCccccchhcCCCCCCCCCC-------CCCCC Confidence 000 000 0 112222222222 11221 01111 No 19 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=49.16 E-value=0.18 Score=24.64 Aligned_cols=373 Identities=12% Similarity=0.033 Sum_probs=128.9 Q ss_pred HHHHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHH---------Hhhccc--cchhHHhhHHHhhhhhhh Q lcl|NC_013650. 111 VIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYE---------DLFFGE--DLNYLEFLPDQFAREYFT 179 (1230) Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~--~~~~~~~~~~~~~~~~~~ 179 (1230) -+--+-..|.+..+-.. .--+..+.....|..+...+- +.++.. -.--+++|-+.++.= T Consensus 1 m~m~~f~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l--- 70 (392) T protein:vir:10 1 MILPILNFINQTNDPPE-------VGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIV--- 70 (392) T ss_pred Ccchhhhhhhccccccc-------ccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccC--- Confidence 11111111111000000 000011111122222222111 000000 001223333333321 Q ss_pred hcceeeccccchhcccchhhhhcCchhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCcchhhhccc- Q lcl|NC_013650. 180 VGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMRE- 258 (1230) Q Consensus 180 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 258 (1230) +|..+++... .-+-.|.- ..+++.||+..+.+|++.|...-+.. +...|.+.+..-.+|.+..+.... T Consensus 71 -----p~~~~~~~~~----~l~~~PN~-~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~g~~~~L~~l~~~~v~~~~~~~ 139 (392) T protein:vir:10 71 -----KINAEKKKNQ----GIIDNPST-NANKHGFWQSMFAQLLLGGEAFAYRW-RNANGADMKWEYLRPSQVNTYYFEY 139 (392) T ss_pred -----ceeeccchhh----hHhhcCCC-CCCHHHHHHHHHHHhhhcCcEEEEEE-ECCCCcEEEEEEEcCceeEEEEcCC Confidence 2222222221 11112433 36899999999999999998765543 334566778887788776544432 Q ss_pred hhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEEeec Q lcl|NC_013650. 259 FQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLG 338 (1230) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 338 (1230) +..+.-++- +.....+....++..-+-|++....-=...|-+++..+.+++.+-....+...+.+..-+.+.-+.++. T Consensus 140 ~~~~~y~~~--~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 217 (392) T protein:vir:10 140 ENGMYYNIT--FDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVK 217 (392) T ss_pred CceEEEEEE--ecCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC Confidence 222111110 011122223446666678888653222245888877777777777766666666555555555455554 Q ss_pred ccccCCcccccCchhhHHHHHHHhhhhhhhhhhhhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhhhhhhhh Q lcl|NC_013650. 339 IEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEAL 414 (1230) Q Consensus 339 ~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l~i~~al 414 (1230) .. . ..+.+..+..++.+.+.-. .....+...++..+-+ ...+.|.++ +...+++..++||-..+ T Consensus 218 ~~-~------~~~~~~~~~~~~~~~~~~~-~g~~~vl~~g~~~~~l----~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~ 285 (392) T protein:vir:10 218 GG-G------LLSDKDKASRSRSFMKRSR-SGGPVVLDDLEEFTAL----EIKSNVAQLLSQTDWTSKQYAKVYGLPDSY 285 (392) T ss_pred CC-C------CchHHHHHHHHHHHhcccc-CCCeeecCCCceEEEc----cCChhHHHHHHHHHHHHHHHHHHhCCCHHH Confidence 21 1 1222233333332222110 0000111111111111 112223332 33456777788888777 Q ss_pred hhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhccchhhhHHHhhhhhhhhhcccchhHHHHHHHHHHHhhhhhhhhhcc Q lcl|NC_013650. 415 ISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVP 494 (1230) Q Consensus 415 i~~~~g~~~~~~~~~~~~~~k~~~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~g~~~~v~~~i~~~~~~~g~~~i~~~~ 494 (1230) +.. .+...+.....+.++...+.-+...+......+ .+..+.......+ .... ......+..+...+..-..... T Consensus 286 lg~-~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~--L~~~~~~d~~~~~-~~d~-~~~~~~~~~l~~~g~~t~nE~r 360 (392) T protein:vir:10 286 IGG-QGDQQSSIQQISGMYASALNRYLRPAISELEYK--LSDHISVNMRPAI-DPLG-DNYLSTISTATRWGALAENQAT 360 (392) T ss_pred hCC-CCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh--ccccccccchhhh-ccCH-HHHHHHHHHHHhCCCcCHHHHH Confidence 642 333332222222222222222222211111100 0000000000000 0000 0000111111111100000000 Q ss_pred ceeeecccceecCCCeeeecccccEehhee---ccCcEEEEecCCCeEEE Q lcl|NC_013650. 495 KLLIPEVKFSCVVPGTQVLTPEGQRNVEDI---RPGDEVIAWDGTGYVVD 541 (1230) Q Consensus 495 ~l~~~~~~~~Clt~DT~VlT~dG~k~IedL---~vGD~V~t~~g~~~~v~ 541 (1230) .+. ... + +++++++..+++ ..||. ...+. T Consensus 361 ~~l-~~~-g---------~~p~e~r~~e~l~~~~~Gd~-------~~p~p 392 (392) T protein:vir:10 361 FVL-QEA-G---------YIPKDLPAPENTNKKTTGQS-------NEPVP 392 (392) T ss_pred HHH-Hhc-C---------CCccccchhcCCCCCCCCCC-------CCCCC Confidence 000 000 0 112222222222 11221 01111 No 20 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=47.82 E-value=0.43 Score=22.60 Aligned_cols=363 Identities=10% Similarity=0.103 Sum_probs=126.2 Q ss_pred ccccccc--------hhHHHHHHHhhccccch-------------------------hHHhhHHHhhhhhhhhcceeecc Q lcl|NC_013650. 141 MEFDSKD--------PLIKTFYEDLFFGEDLN-------------------------YLEFLPDQFAREYFTVGEVTSLA 187 (1230) Q Consensus 141 ~~~~~~~--------~~~~~~~~~~~~~~~~~-------------------------~~~~~~~~~~~~~~~~~~~~~~~ 187 (1230) |+|-.|- +.+.+++..-.- ...+ -+++|-+.++.-=|++-+.... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~~- 78 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWIDQSTS-KLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV- 78 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhccccc-ccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeecccc- Confidence 6665541 112222111110 0001 1222222222211111110000 Q ss_pred ccchhcccchhhhhcC--chhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCcchhhhcc--chhhhh Q lcl|NC_013650. 188 HFNESLGVWSSEEILN--PDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMR--EFQDLQ 263 (1230) Q Consensus 188 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 263 (1230) .+..+ ..+|| |.= ..+++.||+..+.+|++.|-.--+... ...|...+.+-.+|....+... .+.+.+ T Consensus 79 -~~~~~-----~~lL~~~PN~-~~t~~~f~~~~~~~lll~Gnay~~i~r-~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y 150 (412) T protein:vir:26 79 -VNTEV-----SDLLTVSPNN-SLSSFDFINQIETIRNEKGNAYVLIER-DIYHQPSKLFLLNPDVVEMLIENQSRELYY 150 (412) T ss_pred -ccchH-----HHHHHhhccc-CCCHHHHHHHHHHHHhhcCceEEEEEE-CCCCcEEEEEEEcCceeEEEEeCCCcEEEE Confidence 11111 11232 443 258999999999999988876444332 2334455666556655443322 222222 Q ss_pred hhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEEeecccccC Q lcl|NC_013650. 264 RRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMG 343 (1230) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~d~~ 343 (1230) +.. ...++.+.++..-+-|+++-..-=...|.+.+..+.+++.+....++..-. .+....+++++. + . T Consensus 151 ~~~------~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~~~~-~~~~~~~~i~~~-~-~--- 218 (412) T protein:vir:26 151 SIH------AATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLT-EMQKPDSFMLKY-G-S--- 218 (412) T ss_pred EEE------cCCceEEEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH-hcCCCCceEEec-C-C--- Confidence 221 112333457777778887642211234776665555666655544443211 222222233222 1 1 Q ss_pred CcccccCchhhHHHHHHHhhhhhhhhhhhhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhhhhhhhhhhhcc Q lcl|NC_013650. 344 DGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEALISGGT 419 (1230) Q Consensus 344 ~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l~i~~ali~~~~ 419 (1230) .-+.+..+.+++.++......-...+...++....+ ...+.|.++ +...+++..+++|-..++.+.. T Consensus 219 -----~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l----~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~ 289 (412) T protein:vir:26 219 -----NVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPL----PKKYVSEDIVASENLTRERVANVFQLPSVFLNARS 289 (412) T ss_pred -----CCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEc----CCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Confidence 124466777777776554321111122222222211 112223332 2244667778899888776544 Q ss_pred chhhHHHHHHH-HHHHHHHHHHHHHHhhhccchhhhHHH--h-hh-hhhhhhcccchhHHHH--HHHHHHHhhhhhhhhh Q lcl|NC_013650. 420 GGAYASSALNR-EFVTQIMTGFQNALKRHIRRRCEVVAE--A-QG-HYDYDLKGGVRVPIYR--EIVEYDEETGQEYIRK 492 (1230) Q Consensus 420 g~~~~~~~~~~-~~~~k~~~~~~~~l~~~~~~~~~~l~~--~-~~-~~~~~~~~g~~~~v~~--~i~~~~~~~g~~~i~~ 492 (1230) ...++..+... .+....+.-+...+......+. +.+ . .+ .+.++...=...+... +.+......|.--+.. T Consensus 290 ~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kL--l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE 367 (412) T protein:vir:26 290 NTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKL--LTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTIND 367 (412) T ss_pred CCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhc--CCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHH Confidence 44454443322 2222222222222221111110 000 0 00 0111110000001100 0111111111000000 Q ss_pred c-cceeeecccceecCCCeeeecccccEehheeccCcEEEEecCCCeEEEeeEEEeecCcceEEEEEEcCc Q lcl|NC_013650. 493 V-PKLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIAWDGTGYVVDTALHTGIEHRDELVEVITKTG 562 (1230) Q Consensus 493 ~-~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t~~g~~~~v~~v~~~~~~~~~~l~rI~t~~G 562 (1230) . ..+......+ +|..+ +...+.+|+....+.... .| +.... ..| T Consensus 368 ~R~~~gl~p~~g----gD~~~-~~~n~~~~~~~~~~~~~~--~g-----------G~~n~--------~e~ 412 (412) T protein:vir:26 368 IREWEDLPPVEG----GDKPL-ISGDLYPIDTPLELRKSL--KG-----------GDKNV--------NES 412 (412) T ss_pred HHHHhCCCCCCC----cCeee-ecccccccccchhhcccc--cC-----------CCCCc--------CCC Confidence 0 0000111000 12222 111222332221111000 00 00000 000 No 21 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=46.44 E-value=0.73 Score=21.32 Aligned_cols=373 Identities=12% Similarity=0.098 Sum_probs=130.2 Q ss_pred HHHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHHHhhcccc--chhHHhhHHHhhhhhhhhcceeecccc Q lcl|NC_013650. 112 IRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGED--LNYLEFLPDQFAREYFTVGEVTSLAHF 189 (1230) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (1230) ++-|-+++=++..- ...-+.+ +...+.-+-+....-+...+..++... .--++++-+.++.- +|... T Consensus 1 M~~f~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~--------p~~~~ 69 (386) T protein:vir:49 1 MPIFNITNLATESP-PINQESF--FDIADSDFLASLNSSEWVSAENALKNSDLFSIISQLSNDLATA--------KITTS 69 (386) T ss_pred CchhhhhccCCCCc-ccchhhh--hhhhhccccccccCCceechhhhhccHHHHHHHHHHHHHhhhC--------ceeec Confidence 11122222111110 0000000 000000000000000011111111000 01234444444331 22222 Q ss_pred chhcccchhhhhcCchhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCcchhhhc-cchhhhhhhhhH Q lcl|NC_013650. 190 NESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRM-REFQDLQRRYPE 268 (1230) Q Consensus 190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 268 (1230) .... ..-...|.- ..++..||+..+.+|++.|...-+... ...|...+....+|....+.. .++..+..++- T Consensus 70 ~~~~----~~l~~~PN~-~~t~~~f~~~~~~~lll~Gna~~~i~r-~~~g~~~~l~~i~~~~v~v~~~~~~~~~~y~~~- 142 (386) T protein:vir:49 70 RKQL----QGIVDNPSN-NANRFNFYQSIFAQMLLGGEAFAYRWR-NDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNIT- 142 (386) T ss_pred cchh----hhhhhccCC-CCCHHHHHHHHHHHhhhcCCEEEEEEE-CCCCcEEEEEEecCceeEEEEcCCCceEEEEEE- Confidence 2111 011112333 258999999999999999887555322 234555566666665553332 22222222211 Q ss_pred HHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEEeecccccCCcccc Q lcl|NC_013650. 269 IIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPW 348 (1230) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~d~~~~~~~ 348 (1230) +.-...+....++..-+-|++....-=...|.+.+..+.+.+..-....+.....+...+++.-+.+++.... T Consensus 143 -~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~------ 215 (386) T protein:vir:49 143 -FDDPHIAPKQHVPQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGL------ 215 (386) T ss_pred -EcCccccceeEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCC------ Confidence 1122333345577777889887643223469999888888888877777777776666666666666653211 Q ss_pred cCchhhHHHHHHHhhhhhhhhhhhhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhhhhhhhhhhhccchhhH Q lcl|NC_013650. 349 IPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEALISGGTGGAYA 424 (1230) Q Consensus 349 ~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l~i~~ali~~~~g~~~~ 424 (1230) + +....++..++.+-.-.-...+...++..+.+ ...+.|.++ +...+++..+++|-..++.+ .+..++ T Consensus 216 -~--~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l----~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~-~~~~~~ 287 (386) T protein:vir:49 216 -L--DFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPL----EIKSNVAQLLSQADWTTGQFAKVYGIPESIVGG-DGDQQS 287 (386) T ss_pred -h--HHHHHHHHHHHHhccCCCCceecCCCceEEEc----cCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC-CCCccc Confidence 1 22222222222211110111111122211111 112223232 33455667788887777653 223332 Q ss_pred HHHHHHHHHHHHH----HHHHHHHhhhccchhhhHHHhhhhhhhhhcccchhHH--HHHHHHHHHhhhhhhhhhccceee Q lcl|NC_013650. 425 SSALNREFVTQIM----TGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPI--YREIVEYDEETGQEYIRKVPKLLI 498 (1230) Q Consensus 425 ~~~~~~~~~~k~~----~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~g~~~~v--~~~i~~~~~~~g~~~i~~~~~l~~ 498 (1230) ..+..+......+ ......+...... ...+++..-...+. ....+..+...|.--+.....+.. T Consensus 288 ~~~~~~~~~~~~i~~~l~~i~~~~~~~l~~----------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~ 357 (386) T protein:vir:49 288 SLEMIYNIYFKSVSRYLRPFVSEMSKKLSC----------EVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQ 357 (386) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------hhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHh Confidence 2222221111111 1111111111100 11111000000000 011111111111000000000000 Q ss_pred ecccceecCCCeeeecccccEehheeccCcEEEEecCCC Q lcl|NC_013650. 499 PEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIAWDGTG 537 (1230) Q Consensus 499 ~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t~~g~~ 537 (1230) . .. +.++ .+...++.. ...++-||. |++. T Consensus 358 -~--~~-~~~~-~~~~~~~~~-~~~~~gGd~----~~~~ 386 (386) T protein:vir:49 358 -Q--AE-ILPK-ELPDGKNPN-RTSLKGGEI----NEQD 386 (386) T ss_pred -h--CC-CCCC-cCcchhccC-CCCCCCCCC----CCCC Confidence 0 00 0000 000000000 011222331 1111 No 22 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=46.37 E-value=0.53 Score=22.08 Aligned_cols=379 Identities=12% Similarity=0.077 Sum_probs=132.1 Q ss_pred HHHHHHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHH-Hhhccccch------------hHHhhHHHhhh Q lcl|NC_013650. 109 LRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYE-DLFFGEDLN------------YLEFLPDQFAR 175 (1230) Q Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~------------~~~~~~~~~~~ 175 (1230) ...+.+| | .+..- -....+...+-...+ ..+.|.... -++++-+.++. T Consensus 1 Mgl~~~~---f--------------~~~~~--~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~ 61 (409) T protein:vir:84 1 MSLFTRI---F--------------SGPSE--ERTLTKISGIPSPAEDWAMHGDRPGANSAMTLGAFYACVTLLADTVAS 61 (409) T ss_pred Cchhhhh---h--------------cCCCc--ccccccccccccccchhhccCcccchhhhhccHHHHHHHHHHHHhhhh Confidence 2222211 1 11000 000000000000001 111122221 23334444432 Q ss_pred hhhhhcceeeccccchhcccchh----hhhc--CchhhhcchhhhhccchheeeehhhhhccccccccccccccceeecC Q lcl|NC_013650. 176 EYFTVGEVTSLAHFNESLGVWSS----EEIL--NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETP 249 (1230) Q Consensus 176 ~~~~~~~~~~~~~~~~~~~~~~~----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (1230) -=|+| +-...+.... ...| .|.= ..++..|++..+.+|++.|....+-..+..+|...+....+| T Consensus 62 lp~~~--------~~~~~~~~~~~~~l~~lL~~~PN~-~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p 132 (409) T protein:vir:84 62 LSIDA--------YRKKDNVRIPVSPAPKLLESTPYP-GLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHP 132 (409) T ss_pred CceEE--------EEecCCcccccchHHHHhhccCCC-CCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcC Confidence 22211 1111111111 1122 2433 357899999999999999987654444456677777777777 Q ss_pred cchhhhccchhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceee Q lcl|NC_013650. 250 SEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLY 329 (1230) Q Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (1230) ....+......... .+...-+..+-.++..-+-|++....---..|-+++..+.+++.+-....+.....+...+ T Consensus 133 ~~v~v~~~~~~~~~-----~~~~~~~~~g~~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~ 207 (409) T protein:vir:84 133 DCIHVTDAKDEDGD-----WIEPVYRIDGKVVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSA 207 (409) T ss_pred ceeEEEEcCCCcce-----EEEEEecCCceEEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 65544322111100 0111112223446677777888653211135777776667677766666666565555555 Q ss_pred ceeEEEeecccccCCcccccCchhhHHHHHHHhhhhhhhhhhhhhhhhhhhhccccccccccCCCcch----hhHHHHHH Q lcl|NC_013650. 330 SPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLL 405 (1230) Q Consensus 330 ~~~~~~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~ 405 (1230) .+.-+.+++. + -+.+.++.+++.+.....-.-...+...++..+. -...+.|.++ ....+++. T Consensus 208 ~p~gil~~~~-~--------l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~----~~~~~~d~q~~e~~~~~~~~Ia 274 (409) T protein:vir:84 208 NPSGILSSDA-D--------LTPDQVKQTQKQWIQSHHNRRLPAVMSAGIKWQS----VSITPNESQFLETRSFQRSEIA 274 (409) T ss_pred CccEEEecCC-C--------CCHHHHHHHHHHHHHHhccCCCeeecCCCceEEE----ccCChhHHHHHHHHHHHHHHHH Confidence 5544445542 1 1335667776655443311100111111111111 1122233332 23446677 Q ss_pred hhhhhhhhhhhhccchhhHHHH---HHHHHHHHHHHHHHHHHhhhccchhhhHHHhhhh-hhhhhcccchhHHHH--HHH Q lcl|NC_013650. 406 QAWGIGEALISGGTGGAYASSA---LNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGH-YDYDLKGGVRVPIYR--EIV 479 (1230) Q Consensus 406 ~~l~i~~ali~~~~g~~~~~~~---~~~~~~~k~~~~~~~~l~~~~~~~~~~l~~~~~~-~~~~~~~g~~~~v~~--~i~ 479 (1230) .+++|-..++....+..+..++ ....+....+.-+...+......+ |. .+. ..+++..=...+... +.+ T Consensus 275 ~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~---L~--~g~~i~fd~~~l~~~d~~~~~~~~ 349 (409) T protein:vir:84 275 MWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCIEQALDTF---LP--RGQFVKFNVDGLMRGDVTARFTAY 349 (409) T ss_pred HHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHh---cc--CCCeEEEechhhhccCHHHHHHHH Confidence 7888877766433333332221 111222222222211111111100 00 000 011100000000000 001 Q ss_pred HHHHhhhhhhhhhcc-ceeeecccceecCCCeeeecccccEehheeccCcEEEEecCCCeEEEeeEEEeecCcc Q lcl|NC_013650. 480 EYDEETGQEYIRKVP-KLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIAWDGTGYVVDTALHTGIEHRD 552 (1230) Q Consensus 480 ~~~~~~g~~~i~~~~-~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t~~g~~~~v~~v~~~~~~~~~ 552 (1230) ..+...|.--+.... .+.... +.++...+.+-.+.+++++...+..-.... .....+.+ T Consensus 350 ~~~~~~G~~t~NE~R~~~g~~p-----~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~---------~~~~~gn~ 409 (409) T protein:vir:84 350 QMGLQNGIWSVNEVRAWEDAPP-----IPEGDIHLQPMNFVPLGYVPPEEPAQEPQP---------NSATEGNK 409 (409) T ss_pred HHHHhCCCcCHHHHHHHhCCCC-----CCCcceeeecccccccccCCccccCcCCCC---------CCccCCCC Confidence 111111100000000 000000 111222222333333333322111100000 00000111 No 23 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=41.39 E-value=0.93 Score=20.76 Aligned_cols=440 Identities=14% Similarity=0.082 Sum_probs=177.6 Q ss_pred hHHHHHhhhhhcCCcchhHHHHHHhhhhhhhhHHHHHHHHhccCcccceeecCchhhhhhHhhhhhcCCCCcchhhHHHH Q lcl|NC_013650. 32 MARAQAAALQNTVNNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRV 111 (1230) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 111 (1230) |.+|..-+. +-...+- +. ..++. ..-|.....|.-.+ -+.-++.- .++|-...+ |.+-...++. T Consensus 1 ~~r~~~~~~---~~dr~i~-~~-~~~~~-~~~~~~~~~y~aa~-~~r~~~~w---~~~~~~~s~------~~~i~~~~~~ 64 (505) T protein:vir:96 1 MKRAEKKPS---LAQRMVN-WA-WYRYV-EPQKNAARAFEAAR-RDRLGKAW---LRRASRLSA------DEEIYADLAS 64 (505) T ss_pred CCCCccccc---hhhcccc-hh-hhhhH-HHHHHhhhhccccc-CCCccccc---cCCCCCCCh------HHHHHHHHHH Confidence 433322111 1111111 11 11111 11122233343222 11111111 012211111 2333456888 Q ss_pred HHHHHHHHhcccchhHHHHHhhhcCcc-c-ccccccc--------ch----hHHHHHHHhhc------cccchhHHhhHH Q lcl|NC_013650. 112 IRHWCRLFYATHDLVPLLIDIYSKFPV-V-GMEFDSK--------DP----LIKTFYEDLFF------GEDLNYLEFLPD 171 (1230) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~--------~~----~~~~~~~~~~~------~~~~~~~~~~~~ 171 (1230) ||.=||..|+.++++.-.|+.+...=| . |+-+.+. |+ .|...|+...- .-.+|+. -|-. T Consensus 65 lr~RaRdL~rNn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~-~lq~ 143 (505) T protein:vir:96 65 LVQRAREQSINNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFV-TLLH 143 (505) T ss_pred HHHHHHHHHhcChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHH-HHHH Confidence 999999999999999999999988866 2 6655543 32 45555554331 1122332 2445 Q ss_pred Hhhhhhhhhcceeeccccchhcccch-hhhhcCchhhhcchhhhhccchheeeehhhhhccccccccccccccceeec-- Q lcl|NC_013650. 172 QFAREYFTVGEVTSLAHFNESLGVWS-SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEET-- 248 (1230) Q Consensus 172 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 248 (1230) ..-|.++.-||+|-.-+.... +-|. +-+.|+||+|........++ +..+-.=+|.+ T Consensus 144 l~~r~~~~dGE~f~~~~~~~~-~~~~~~lqliepd~l~~~~n~~~~~--------------------~~~i~~GIe~d~~ 202 (505) T protein:vir:96 144 LWMETLARDGEVLVREHRGYP-NKWGYALQILECDRLDLNYNADLQN--------------------GNRIRMSIELDAW 202 (505) T ss_pred HHHHHHhhCCceEEEEeecCC-CCcceEEEEechhhcCCCCCcccCC--------------------cCeEEeceEECCC Confidence 588999999998755443322 1222 67899999974332111111 01111112222 Q ss_pred --Ccchhhhcc-chhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhccc Q lcl|NC_013650. 249 --PSEREQRMR-EFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA 325 (1230) Q Consensus 249 --~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (1230) |+-|-+..+ -++-..+ ....+....++|-..|-|+...--.--.||-|.+......|...+.+.+++..-+ T Consensus 203 Gr~~aY~i~~~hPgd~~~~------~~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a 276 (505) T protein:vir:96 203 ERPVAYHLLVNHPGDNSYC------YHYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAA 276 (505) T ss_pred CceEEEEEeecCCCccccc------cccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHH Confidence 222211111 0110000 0112233455777788899887667788999998888777777777776544322 Q ss_pred ceeeceeEEEeecccccCCcc-cccCchh-hHHHHHHHhhhhhhhhhhhhhhhhhhhhccccccccccCCCcchhhHHHH Q lcl|NC_013650. 326 DRLYSPLVLATLGIEDMGDGE-PWIPDQG-ELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERK 403 (1230) Q Consensus 326 ~~~~~~~~~~~l~~~d~~~~~-~~~~~~~-~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~i~~~ 403 (1230) -.-+.-..+.+= |.+... +.....+ .+..+. ... +....-|..++...-..-..++...+..+.+. T Consensus 277 ~i~A~~a~fi~~---~~~~~~~~~~~~~~~~~~~l~----pG~-----i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~ 344 (505) T protein:vir:96 277 ELGAKKVGFYEQ---DPEAYDQPPEDDQGEIVEEVE----AGT-----YQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRG 344 (505) T ss_pred HHhhhheeeeec---CCccCCCccccccCccccccC----Cce-----eeecCCCCeeeeeCCCCCCCCHHHHHHHHHHH Confidence 222211112221 111111 1011111 111111 100 11112233333322222122333334678888 Q ss_pred HHhhhhhhhhhhhhcc-chhhHHHHHHHHHHHHHHHHHHHHHhhhccch-h-hhHHHh--hhhhhh------------hh Q lcl|NC_013650. 404 LLQAWGIGEALISGGT-GGAYASSALNREFVTQIMTGFQNALKRHIRRR-C-EVVAEA--QGHYDY------------DL 466 (1230) Q Consensus 404 ~~~~l~i~~ali~~~~-g~~~~~~~~~~~~~~k~~~~~~~~l~~~~~~~-~-~~l~~~--~~~~~~------------~~ 466 (1230) +..++|+...++.++- +..|++....+....+.+...+..+.....+. + .+|.+. .+.... +. T Consensus 345 iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~ 424 (505) T protein:vir:96 345 VAAGMGPAYNRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQ 424 (505) T ss_pred HHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeec Confidence 8999999988888775 56788877666555555444444332211110 1 112211 111000 00 Q ss_pred cccc-hhHHHHHHHHH--------------HHhhhhhhhhhc----cceeeecccceecCCCe-eeecccccEehheecc Q lcl|NC_013650. 467 KGGV-RVPIYREIVEY--------------DEETGQEYIRKV----PKLLIPEVKFSCVVPGT-QVLTPEGQRNVEDIRP 526 (1230) Q Consensus 467 ~~g~-~~~v~~~i~~~--------------~~~~g~~~i~~~----~~l~~~~~~~~Clt~DT-~VlT~dG~k~IedL~v 526 (1230) ..+. -++-.+++... ..+.|..+.... .........+ +..++ ...+..+-..=++=.. T Consensus 425 ~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~G--l~~~~~~~~~~~~~~~~~~~~~ 502 (505) T protein:vir:96 425 PRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDPEDVFDEIAWEEQLMRDKG--VNPTPPEQESKDATTDEEDDSA 502 (505) T ss_pred cCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcC--CCCCCCCCCCCCCCCCCCCCCC Confidence 0000 00001111000 011111110000 0000000001 11111 0111111111111112 Q ss_pred CcE Q lcl|NC_013650. 527 GDE 529 (1230) Q Consensus 527 GD~ 529 (1230) +|. T Consensus 503 ~d~ 505 (505) T protein:vir:96 503 SDD 505 (505) T ss_pred CCC Confidence 232 No 24 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=41.15 E-value=0.39 Score=22.80 Aligned_cols=359 Identities=11% Similarity=0.085 Sum_probs=130.6 Q ss_pred HHhhhcCc---cccccccccchhHHHHHHHhhccccch------------hHHhhHHHhhhhhhhhcceeeccccchhcc Q lcl|NC_013650. 130 IDIYSKFP---VVGMEFDSKDPLIKTFYEDLFFGEDLN------------YLEFLPDQFAREYFTVGEVTSLAHFNESLG 194 (1230) Q Consensus 130 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 194 (1230) .=++++.. ..... -..+-.-..|......|+... -++++-+.++.-=|++-+.. . .. T Consensus 1 Mg~f~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~-----~--~~ 72 (382) T protein:vir:48 1 MPIFNLATESPPDNQG-GFFDVVDSDFLASLKGNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKK-----L--QG 72 (382) T ss_pred CccccccccCCccccc-ccccchhhhccccccCCcccchHhhhccHHHHHHHHHHHHhhccCceeeecch-----h--hh Confidence 11111110 00000 000000011111222222221 22333333332222222110 0 11 Q ss_pred cchhhhhcCchhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCcchhhhc-cchhhhhhhhhHHHhhh Q lcl|NC_013650. 195 VWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRM-REFQDLQRRYPEIIQAA 273 (1230) Q Consensus 195 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 273 (1230) ++ -.|.- ..++..|++..+.+|++.|-..-+... -..|...+....+|....+.. .++..+.-++-. .-. T Consensus 73 L~-----~~PN~-~~t~~~f~~~l~~~l~l~Gna~~~i~r-d~~G~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~--~~~ 143 (382) T protein:vir:48 73 IV-----DNPSN-NANRFNFYQSIFAQMLLGGEAFAYRWR-NENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITF--DDP 143 (382) T ss_pred hh-----hhcCC-CCCHHHHHHHHHHHhhhcCCEEEEEEE-CCCCcEEEEEEEcCceeEEEEcCCCCeEEEEEEe--cCc Confidence 11 12333 258999999999999999877665433 344556777777777665433 233333222211 111 Q ss_pred ccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEEeecccccCCcccccCchh Q lcl|NC_013650. 274 MQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQG 353 (1230) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~d~~~~~~~~~~~~ 353 (1230) ..+....++..-+-|++....-=...|.+.+..+..++..-....+...+.+...+.+.-+.++... . +.+ T Consensus 144 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~----~-----~~e 214 (382) T protein:vir:48 144 RIPPKQHVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGG----G-----LLD 214 (382) T ss_pred cccceeEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC----C-----ChH Confidence 2223345666667788765432235688888788888888777777777766666666666666421 1 113 Q ss_pred hHHHHHHHhhhhhhhhhhhhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhhhhhhhhhhhccchhhHHHHHH Q lcl|NC_013650. 354 ELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEALISGGTGGAYASSALN 429 (1230) Q Consensus 354 ~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l~i~~ali~~~~g~~~~~~~~~ 429 (1230) ..+.+++.+.......-...+...++....+ ...+.|.++ +...+++..+++|...++... +......... T Consensus 215 ~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l----~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~-~~~~~~~~~~ 289 (382) T protein:vir:48 215 FKTKLSRSRQAMKQMQGGPLVLDDLEDFTPL----EIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQ-GDQQSSLEMS 289 (382) T ss_pred HHHHHHHHHHhhccCCCCeeEcCCCceEEEc----cCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCcccHHHHH Confidence 3333333332222111111111122211111 112223333 334466677888877776432 2222222222 Q ss_pred HHHHHHHHHHHHHHHhhhccchhhhHHHhhhhhhhhhcccchhHHHHHHHHHHHhhhhhhhhhccceeeecccceecCCC Q lcl|NC_013650. 430 REFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEVKFSCVVPG 509 (1230) Q Consensus 430 ~~~~~k~~~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~g~~~~v~~~i~~~~~~~g~~~i~~~~~l~~~~~~~~Clt~D 509 (1230) +.++...+.-+...+......+ .+.+........+ +.....+ ...+..+...+..-+.....+.. + T Consensus 290 ~~~~~~~l~p~~~~i~~~l~~~--l~~~~~~~~~~~~-~~~~~~~-~~~~~~l~~~g~~t~~e~r~~l~----------~ 355 (382) T protein:vir:48 290 SDLYSKAVSRYLRPFLSELSQK--LSCDVDADIFPAV-DPTGSNY-ISRINSLVKTGTLAQNQGLYILQ----------Q 355 (382) T ss_pred HHHHHHHHHHHHHHHHHHHHHH--hcChhhhhhhhhh-ccchhHH-HHHHHHHhhcCccCHHHHHHHHh----------h Confidence 2222222211111111111100 0000000000000 0000000 00011111111000000000000 0 Q ss_pred eeeec---ccccEehheeccCcEEEEecCCC Q lcl|NC_013650. 510 TQVLT---PEGQRNVEDIRPGDEVIAWDGTG 537 (1230) Q Consensus 510 T~VlT---~dG~k~IedL~vGD~V~t~~g~~ 537 (1230) .-+.+ .+|......++-||. +++. T Consensus 356 ~g~~~~~~~~~~~~~~~~~GGd~----~~~~ 382 (382) T protein:vir:48 356 AEILPKELPNGENPNSTLKGGEE----DGQD 382 (382) T ss_pred CCCCCcchhhhhcCCCCCCCCCC----CCCC Confidence 00111 001000111233332 1111 No 25 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=37.18 E-value=0.5 Score=22.22 Aligned_cols=333 Identities=10% Similarity=0.088 Sum_probs=123.6 Q ss_pred HHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHH--Hhhccccch------------hHHhhHHHhhhhhh Q lcl|NC_013650. 113 RHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYE--DLFFGEDLN------------YLEFLPDQFAREYF 178 (1230) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~------------~~~~~~~~~~~~~~ 178 (1230) -.|-..|-+..-+ +.|+....+.. ....|...+ .+++|-+.++.. T Consensus 1 M~~~~~f~~r~~~-------------------~~~~~~~~~~~~~~~~~~~~v~~~~al~~~av~~cv~~ia~~ia~~-- 59 (359) T protein:vir:10 1 MSILNPFERRSSI-------------------TPNNYYPFMVQNGSIVPNSLVDATEALKNSDLYAVTSLISSDIAGT-- 59 (359) T ss_pred CcccchhhccccC-------------------CCCcchhhhhccccccCCcccCHHHhhcchHHHHHHHHHHHhhhcC-- Confidence 1122222211110 11111000000 000111111 233333333321 Q ss_pred hhcceeeccccchhcccchhhhhcCchhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCcchhhhccc Q lcl|NC_013650. 179 TVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMRE 258 (1230) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 258 (1230) +|.. +.-. ..-...|.- ..+++.||+..+.+|+..|-.--+.. +-..|...+..-.+|....+...+ T Consensus 60 ------p~~~-~~~~----~~L~~~PN~-~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~g~~~~l~~l~~~~v~i~~~~ 126 (359) T protein:vir:10 60 ------RFIG-NQVF----TSVLNNPSH-LTNAFSFWQTAILNLLLNGNVFLAIL-KGDNSLMKELRLIPSNAITIDLTD 126 (359) T ss_pred ------cccc-chHH----HHHhhcccc-cCCHHHHHHHHHHhccccCceEEEEE-ECCCCeEEEEEEeCCceEEEEEcC Confidence 1110 0000 111122333 25899999999999998887543321 223454555555555544443334 Q ss_pred hhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccc----cccCCccccchHHHHHHHHHHHHHhhcccceeeceeEE Q lcl|NC_013650. 259 FQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAW----ATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVL 334 (1230) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 334 (1230) ..+.+. +.....+....++..-+-|++...... -..|.+.+-.+.+++.+....++...+.+..-..+.-+ T Consensus 127 ~~~~y~-----~~~~~~~~~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gi 201 (359) T protein:vir:10 127 DTLTYE-----VNQFDDYPSAKYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSV 201 (359) T ss_pred CeEEEE-----EEecCCceEEEEcccceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE Confidence 333221 111223444556666667887542111 12477766666777777777776666555444444444 Q ss_pred EeecccccCCcccccCchhhHHHHHHHhhhhhhhhhh--hhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhh Q lcl|NC_013650. 335 ATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFR--LMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAW 408 (1230) Q Consensus 335 ~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l 408 (1230) .+++.. ..+++..+.+++.++...+..+. ..+...++..+.+ ...+.|.++ ....+++..++ T Consensus 202 l~~~~~--------~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l----~~~~~d~q~le~~~~~~~~Ia~~f 269 (359) T protein:vir:10 202 VKVPQG--------TLSSEAKDSIRKEFEKANGGNNSGRVMVLDQSADFSTV----SINADVANYLNSMNWGRTQIAKAF 269 (359) T ss_pred EEeCCC--------CCCHHHHHHHHHHHHHHhCccccCCceecCCCcceeee----cCCHHHHHHHHHHHHHHHHHHHHh Confidence 444321 23556777777776655432211 1111222222111 112223232 22345566788 Q ss_pred hhhhhhhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhccchhhhHHHhhhhhhhhhcccchh--HHHHHHHHHHHhhh Q lcl|NC_013650. 409 GIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRV--PIYREIVEYDEETG 486 (1230) Q Consensus 409 ~i~~ali~~~~g~~~~~~~~~~~~~~k~~~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~g~~~--~v~~~i~~~~~~~g 486 (1230) +|-..++.. ++...+..+.........+... +..+..+-...+. .....+...-... ...+..+..+.+ T Consensus 270 gVPp~~lg~-~~~~~~~~~~~e~~~~~~l~~~---l~p~~~~l~~~l~---~~~~~~~~~~~~~d~~~~~~~~~~~~~-- 340 (359) T protein:vir:10 270 GVSDSYLNG-TGDQQSSLDQIKDLYVNALNRF---IEPLISELRIKCD---SSIGVDMSPITDYSNSVFKADILNWVK-- 340 (359) T ss_pred CCCHHHhCC-CCcccccHHHHHHHHHHHHHHH---HHHHHHHHHHHhh---hhhcccchhhhhcCHHHHHHHHHHHHh-- Confidence 887777642 2221111111111111111111 1111111000000 0011110000000 000000000000 Q ss_pred hhhhhhccceeeecccceecCCCeeeecccccEehheeccCcEEE Q lcl|NC_013650. 487 QEYIRKVPKLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVI 531 (1230) Q Consensus 487 ~~~i~~~~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~ 531 (1230) .-|+|.+-.+.+.++.+ |+ T Consensus 341 -----------------------~G~~t~NE~R~~l~~~p---v~ 359 (359) T protein:vir:10 341 -----------------------EGIIEPTEAKTLLESKG---II 359 (359) T ss_pred -----------------------CCCcCHHHHHHHhCCCC---CC Confidence 01233333333333322 12 No 26 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=30.60 E-value=1.6 Score=19.52 Aligned_cols=385 Identities=13% Similarity=0.099 Sum_probs=139.1 Q ss_pred HHHHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHHHhhccccch----------------hHHhhHHHhhh Q lcl|NC_013650. 112 IRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLN----------------YLEFLPDQFAR 175 (1230) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~ 175 (1230) +--|-|+|=+...-...-. -+.++...+..++ .|++. ||-+.. -++++-+.++. T Consensus 1 MG~f~~lf~~~~~~~~~~~-----~~~~~~~~~~~~~---~~~~~--~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~ 70 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRS-----NYDEDIGIDISDS---NFWEK--FGIKLNFSVRGKRALKENTVYVCTKIRAESIGK 70 (422) T ss_pred CchhhhhhhccCCccchhh-----hhhhccccccCcc---hhhhh--ccccCCcccchhhhhccHHHHHHHHHHHHhhhh Confidence 2222233321110000000 0000000111111 11111 111111 13334343332 Q ss_pred hhhhh---cceeeccccchhcccchhhhhcC--chhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCc Q lcl|NC_013650. 176 EYFTV---GEVTSLAHFNESLGVWSSEEILN--PDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPS 250 (1230) Q Consensus 176 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (1230) -=|.+ ++.+. +.- -...|+ |.= ..++..||+..+.+|.+.|...-+... ...|...+.+..+|. T Consensus 71 lp~~~~~~~~~~~-----~~~----~~~lL~~~PN~-~~t~~~f~~~~~~~lll~Gna~~~i~r-~~~G~~~~L~~i~~~ 139 (422) T protein:vir:13 71 LSLKIYKDKEEYK-----EHE----LYYLLRYKPNP-LMSSINFWKCLETQRTLKGNAYAYIER-DRKGKIIGLYPINSD 139 (422) T ss_pred CceEEEecCcccc-----cch----HHHHHhhhccc-CCCHHHHHHHHHHHHhhcCCeEEEEEE-CCCCcEEEEEEECCc Confidence 21111 11110 000 011222 322 147889999999999999876444332 234566677777776 Q ss_pred chhhhccchhhhh-hhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceee Q lcl|NC_013650. 251 EREQRMREFQDLQ-RRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLY 329 (1230) Q Consensus 251 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (1230) ...+...+--++. ............++...++..-+-|++...+.=-..|.+++..+.+++..-...++...+.+..-+ T Consensus 140 ~v~~~~~~~~~~~~~~~~~y~~~~~~g~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~ 219 (422) T protein:vir:13 140 NVTKIIDDDNFLSSLSKVWYVVTDKNGKEHKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGL 219 (422) T ss_pred ceEEEEcCCcceeccceEEEEEEeCCCeEEEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 6644333221111 111111112223444557777778888653322235888887888888877777777666665555 Q ss_pred ceeEEEeecccccCCcccccCchhhHHHHHHHhhhhhhh-hh--hhhhhhhhhhhccccccccccCCCcch----hhHHH Q lcl|NC_013650. 330 SPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-DF--RLMVHNFGLKVENVFGRESVPNLDADY----DRIER 402 (1230) Q Consensus 330 ~~~~~~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~ 402 (1230) .+.-+.+++. + -+.+..+.+++.+.....- .+ ...+...++..+- -.+.+.|.++ ....+ T Consensus 220 ~p~gil~~~~-~--------l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~----l~~~~~d~q~le~~~~~~~ 286 (422) T protein:vir:13 220 SIKGIVQYVG-D--------LDEKAKKIFKKEFESMSNGLENAHSISLLPFGYQFQP----ISLSMADAQFLENSKLTKR 286 (422) T ss_pred CccEEEEeCC-C--------CCHHHHHHHHHHHHHHhcCccccCCceecCCCceeee----ccCChhHHHHHHHHHHHHH Confidence 4544445442 1 1345566666665544321 00 0111112221111 1222333333 23455 Q ss_pred HHHhhhhhhhhhhhhccchhhHHHHHH-HHHHHHHHHHHHHHHhhhccchhhhHHHhh---h-hhhhhhcccchhHHH-- Q lcl|NC_013650. 403 KLLQAWGIGEALISGGTGGAYASSALN-REFVTQIMTGFQNALKRHIRRRCEVVAEAQ---G-HYDYDLKGGVRVPIY-- 475 (1230) Q Consensus 403 ~~~~~l~i~~ali~~~~g~~~~~~~~~-~~~~~k~~~~~~~~l~~~~~~~~~~l~~~~---~-~~~~~~~~g~~~~v~-- 475 (1230) ++..+++|...++.......++..+.. ..+....+.-+...+......+ .+.+.. + ...++...-...+.. T Consensus 287 ~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~--Ll~~~~~~~g~~i~fd~~~l~r~d~~~~ 364 (422) T protein:vir:13 287 ELAATFGMKSYHLNDLERATFNNLTEQQKDFYVTTLQSSLTVYEQEIQDK--LFSQYETLQDVKAEFNVDTILRSDIKTR 364 (422) T ss_pred HHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh--hCChhhhcCCceEEeechhhhcCCHHHH Confidence 667788887777765444444443322 2222222222222221111110 000000 0 001110000000000 Q ss_pred HHHHHHHHhhhhhhhhhc-cceeeecccceecCCCeeeecccccEehheeccCcEEEEecCCCeEEEeeEEEeecCcceE Q lcl|NC_013650. 476 REIVEYDEETGQEYIRKV-PKLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIAWDGTGYVVDTALHTGIEHRDEL 554 (1230) Q Consensus 476 ~~i~~~~~~~g~~~i~~~-~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t~~g~~~~v~~v~~~~~~~~~~l 554 (1230) .+.+..+...|.--.... ..+.... +.++..++..-++.+|+++-.... + T Consensus 365 ~~~~~~~~~~G~~T~NE~R~~~gl~p-----~~ggD~~~~~~n~~~l~~~~~~~~-------------------~----- 415 (422) T protein:vir:13 365 YEAYRIGIQGGFIEANEARRRENLPP-----VEGGDRLLVNGNMIPIEMAGEQYK-------------------K----- 415 (422) T ss_pred HHHHHHHHhCCCcCHHHHHHHhCCCC-----CCCcCeeeeccCccchhhcccccc-------------------c----- Confidence 001111111110000000 0000000 111222223333444433311000 0 Q ss_pred EEEEEcCce Q lcl|NC_013650. 555 VEVITKTGR 563 (1230) Q Consensus 555 ~rI~t~~G~ 563 (1230) --..+|+ T Consensus 416 --~g~~~g~ 422 (422) T protein:vir:13 416 --GGEKGGK 422 (422) T ss_pred --CCCcCCC Confidence 0001111 No 27 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=30.07 E-value=1.6 Score=19.46 Aligned_cols=369 Identities=11% Similarity=0.016 Sum_probs=136.4 Q ss_pred HHHHH--HhcccchhHHHHHhhhcCccccccccccchhHHHHHHHhhccccc------------hhHHhhHHHhhhhhhh Q lcl|NC_013650. 114 HWCRL--FYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDL------------NYLEFLPDQFAREYFT 179 (1230) Q Consensus 114 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~ 179 (1230) -|-.+ |++...-- ...-.+..+.....|+.+..+|-... |+.. --+++|-+.++. T Consensus 1 m~m~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~v~~~~al~~~~v~~~v~~ia~~ia~---- 69 (392) T protein:vir:74 1 MILPILNFINQTNDP------PEAGSVQSYFPDGNDAQIMESLLGDN-NEWVSARAALRNSDLFSIILQLSSDLAI---- 69 (392) T ss_pred CcchhhhhhhcccCc------ccccccccccccCchhhhhhhccCCC-CcccchhhhhcchHHHHHHHHHHHhhcc---- Confidence 11111 11110000 00011222233333443333322111 1111 134445454442 Q ss_pred hcceeeccccchhcccchhhhhcCchhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCcchhhhccc- Q lcl|NC_013650. 180 VGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMRE- 258 (1230) Q Consensus 180 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 258 (1230) ++|..+...... -.-.|.- ..+...||+..+.+|++.|...-+.. +-..|...+..-.+|....+..+. T Consensus 70 ----lp~~~~~~~~~~----l~~~PN~-~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~G~~~~L~~i~~~~v~v~~~~~ 139 (392) T protein:vir:74 70 ----VKINAEKKKNQG----IIDNPST-NANKHGFWQSMFAQLLLGGEAFAYRW-RNANGADMKWEYLRPSQVNTYYFEY 139 (392) T ss_pred ----Cceeeccchhhh----hhhhcCC-CCCHHHHHHHHHHHhhhcCCEEEEEE-ECCCCcEEEEEEEcCceeEEEEcCC Confidence 122222222111 1112433 26899999999999999998755543 334566777777777766554432 Q ss_pred -hhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEEee Q lcl|NC_013650. 259 -FQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATL 337 (1230) Q Consensus 259 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 337 (1230) +.+.+.. -. .....+.-..++..-+-|++.-..--...|.+.+..+.+++.+-....+...+.+..-+.+.-+.++ T Consensus 140 ~~~~~y~~-~~--~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~ 216 (392) T protein:vir:74 140 ENGMYYNI-TF--DDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTV 216 (392) T ss_pred CceEEEEE-Ee--cCCccceeEEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEe Confidence 2222211 11 0111111234666667788764322124588888778888877777777766666655666656666 Q ss_pred cccccCCcccccCchhhHHHHHHHhhhhhhhhhhhhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhhhhhhh Q lcl|NC_013650. 338 GIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEA 413 (1230) Q Consensus 338 ~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l~i~~a 413 (1230) +... ..+++..+...+.+.+.-.. ....+...++..+-+ .+.+.|.++ ....+++..+++|-.. T Consensus 217 ~~~~-------~~~~~~~~~~~~~~~~~~n~-g~~~vl~~g~~~~~l----~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~ 284 (392) T protein:vir:74 217 KGGG-------LLSDKDKASRSRSFMKRSRS-GGPVVLDDLEEFTAL----EIKSNVAQLLSQTDWTSKQYAKVYGLPDS 284 (392) T ss_pred CCCC-------CchHHHHHHHHHHHhccccC-CCeeecCCCceEEEc----cCChhHHHHHHHHHHHHHHHHHHhCCCHH Confidence 4311 12223333333333221100 000111112211111 112233333 3344567778888777 Q ss_pred hhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhccchhhhHHHhhhhhhhhhcccchhHHHHHHHHHHHhhhhhhhhhc Q lcl|NC_013650. 414 LISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKV 493 (1230) Q Consensus 414 li~~~~g~~~~~~~~~~~~~~k~~~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~g~~~~v~~~i~~~~~~~g~~~i~~~ 493 (1230) ++. +.+...+.....+.++...+.-+...+......+ .+..+.......+ ..... .....+..+...+.--.... T Consensus 285 ~lg-~~~~~~~~~e~~~~~~~~~l~p~~~~ie~~l~~~--l~~~~~~~~~~~~-~~d~~-~~~~~~~~l~~~g~~t~nea 359 (392) T protein:vir:74 285 YIG-GQGDQQSSIQQISGMYASALNRYLRPAISELEYK--LSDHISVNMRPAI-DPLGD-NYLSTISTATRWGALAENQA 359 (392) T ss_pred HhC-CCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh--ccchhcccchhhh-cCCHH-HHHHHHHHHHhCCCcCHHHH Confidence 764 3333333222222222222211111111111000 0000000000000 00000 00011111111110000000 Q ss_pred cceeeecccceecCCCeeeecccccEehhee---ccCcEEEEecCCCeEEE Q lcl|NC_013650. 494 PKLLIPEVKFSCVVPGTQVLTPEGQRNVEDI---RPGDEVIAWDGTGYVVD 541 (1230) Q Consensus 494 ~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL---~vGD~V~t~~g~~~~v~ 541 (1230) ..+. ...- +++++++..+++ ..||. . ..+. T Consensus 360 r~~~----------~~~g-~~pne~r~~enl~~~~~Gd~-----~--~p~p 392 (392) T protein:vir:74 360 TFVL----------QEAG-YIPKDLPAPENTNKKTTGQS-----N--EPVP 392 (392) T ss_pred HHHH----------HhCC-CCccccchhcCCCCCCCCCC-----C--CCCC Confidence 0000 0000 123333333332 22331 0 1111 No 28 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=29.08 E-value=0.96 Score=20.68 Aligned_cols=375 Identities=11% Similarity=0.076 Sum_probs=128.0 Q ss_pred HHHHHhhhcCccccccccccchhHHHHHHHhhccccc------------------hhHHhhHHHhhhhhhhhcceeeccc Q lcl|NC_013650. 127 PLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDL------------------NYLEFLPDQFAREYFTVGEVTSLAH 188 (1230) Q Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (1230) =-++|-....|.+...-++. .++...+.+... --+++|-+.++.-=|+| +. T Consensus 1 Mg~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~---~~--- 69 (423) T protein:vir:81 1 MGFLQKLGLAPSVVATPEPI-----ELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQA---FE--- 69 (423) T ss_pred CchhHhhccccccccCcccc-----ccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEE---EE--- Confidence 11222222223332211111 111111111100 12344444343321111 00 Q ss_pred cchhcccchhhhhcCchhhh--------cchhhhhccchheeeehhhhhccccccccccc--cccceeecCcchhhhcc- Q lcl|NC_013650. 189 FNESLGVWSSEEILNPDMLR--------VSRSMFVQRERVQLMVKDLVDHLRQGPTTAGG--NMSTVEETPSEREQRMR- 257 (1230) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~- 257 (1230) -+ .+|- ++++.|....+ .+...||+..+.+|++.|..--+-.. -.++. ....+-.++....++.. T Consensus 70 ~~-~dg~--~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r-d~~~~~~~~~l~p~~~~~v~~~~~~ 145 (423) T protein:vir:81 70 RV-EDGG--RERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPG-DLGVDTPTLDIRPIPVSWVQRRAYK 145 (423) T ss_pred Ee-cCCc--eeeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe-cCCcCcceEEEeecccceeeeeecc Confidence 00 0110 23333333222 36899999999999998875433211 12222 12222222222222111 Q ss_pred --chhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEE Q lcl|NC_013650. 258 --EFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLA 335 (1230) Q Consensus 258 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (1230) ...+.+ ++ .+.....|.-+.++..-+-|+++-..----.|.+++..+..++..-....+...+.+...+.+.-+. T Consensus 146 ~~~~~~~Y-~~--~~~~~~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi 222 (423) T protein:vir:81 146 DGWGSLDY-II--IESGDNDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVI 222 (423) T ss_pred CCCcceEE-EE--EEecCCCceEEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE Confidence 111111 11 0011223444667777778887442111125887777777777777777777666655555554444 Q ss_pred eecccccCCcccccCchhhHHHHHHHhhhhhhh--hh--hhhhhhhhhhhccccccccccCCCcch----hhHHHHHHhh Q lcl|NC_013650. 336 TLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA--DF--RLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQA 407 (1230) Q Consensus 336 ~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~--~~--~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~ 407 (1230) +.... +.++ --+.+..+.+++.++..+.. .. ...+...++...- -+..+.|.++ ....+++..+ T Consensus 223 ~~~~~-~~~~---~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~----l~~s~~d~q~~e~~~~~~~eIa~~ 294 (423) T protein:vir:81 223 MRDPE-SKAG---KWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAEN----FHTTSKDEQTVETTKLSLQTVAQV 294 (423) T ss_pred EecCc-ccCc---cCCHHHHHHHHHHHHHHhccccccCCcceecCCCceEEe----ccCChhhHHHHHHHHhhHHHHHHH Confidence 43321 1000 11345566666655444311 00 1111122221111 1223334333 2344556778 Q ss_pred hhhhhhhhhhccchhhHHHHH-HHHHHHHHHHHHHHHHhhhccchhhhHHHhh----h-hhhhhhcccchhHHHHHHHHH Q lcl|NC_013650. 408 WGIGEALISGGTGGAYASSAL-NREFVTQIMTGFQNALKRHIRRRCEVVAEAQ----G-HYDYDLKGGVRVPIYREIVEY 481 (1230) Q Consensus 408 l~i~~ali~~~~g~~~~~~~~-~~~~~~k~~~~~~~~l~~~~~~~~~~l~~~~----~-~~~~~~~~g~~~~v~~~i~~~ 481 (1230) ++|-..++....+..++..+. .+.+....+.-+...++.....+. +.+.. + .+.++...=...+. .. T Consensus 295 fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~L--~~~~~~~~~~~~~~fd~~~llr~d~-----~~ 367 (423) T protein:vir:81 295 YGINPTMVGQLDNANYSNVREFRKALYGDNLGSWIRIIQDVMNLFL--LPRVGIDNEKFYFEFNLEEKLRASF-----EE 367 (423) T ss_pred hCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhh--cCccccccCccEEEecchhhhccCH-----HH Confidence 888766664333444443222 122222222222222221111110 00000 0 00011000000000 00 Q ss_pred HHhhhhhhhhhccceeeecccceecCCCeeeecccccEehhee---ccCcEEEEecCCCeEEEeeEEEeecCcceEEEEE Q lcl|NC_013650. 482 DEETGQEYIRKVPKLLIPEVKFSCVVPGTQVLTPEGQRNVEDI---RPGDEVIAWDGTGYVVDTALHTGIEHRDELVEVI 558 (1230) Q Consensus 482 ~~~~g~~~i~~~~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL---~vGD~V~t~~g~~~~v~~v~~~~~~~~~~l~rI~ 558 (1230) ..+.....+ .+.-|+|.+-++.+..+ .-||.++.... ... ..... T Consensus 368 r~~~~~~~l-----------------~~~G~~T~NE~R~~~gl~p~~gGD~~~~p~n-~~~-------~~~~~------- 415 (423) T protein:vir:81 368 AAEIKRAAV-----------------GNVAWMTINEVRAMDNLPSIDGGDDLARPLN-TEF-------GDSED------- 415 (423) T ss_pred HHHHHHHHH-----------------hCCCCcCHHHHHHHhCCCCCCCcceeecccc-ccc-------CccCC------- Confidence 000000000 01113333332222222 22554432110 000 00000 Q ss_pred EcCceEEEE Q lcl|NC_013650. 559 TKTGRTIRC 567 (1230) Q Consensus 559 t~~G~~L~~ 567 (1230) ..|.+..+ T Consensus 416 -~~~~~~~t 423 (423) T protein:vir:81 416 -APGEEVET 423 (423) T ss_pred -CCCCCCCC Confidence 00000000 No 29 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=28.24 E-value=1.7 Score=19.26 Aligned_cols=425 Identities=9% Similarity=0.077 Sum_probs=147.2 Q ss_pred cCCCCcc---hhhHHHHHH--HHHHHHhcccchhHHHHHhhhcCccccccccccchhHHHHHHHhhccccchhHHhhHHH Q lcl|NC_013650. 98 GIPFNVE---DEEELRVIR--HWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQ 172 (1230) Q Consensus 98 ~~~~~~~---~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (1230) |.-+=++ .+.+.+.++ .|-..| ..+-+++.---..|.. ++.+-..+. ..-.--++++-+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~g~~~~g~~-v~~~~al~~-------~~V~~~v~~Ia~~ 65 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLF-------QAVAEPFAGAWQQGVK-ADPEAVLSF-------HAVFACISLISQD 65 (454) T ss_pred CCCccccCcccccccccccchhhhhhh-------hhhhhhhcchhhcCcc-cChHHhhcc-------HHHHHHHHHHHHh Confidence 2211111 111111110 122221 1111111100000100 111111000 0000123444443 Q ss_pred hhhhhhhhcceeeccccc-hhcccchhhhhcC---------chhhhcchhhhhccchheeeehhhhhccccccccccccc Q lcl|NC_013650. 173 FAREYFTVGEVTSLAHFN-ESLGVWSSEEILN---------PDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNM 242 (1230) Q Consensus 173 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (1230) ++.==|. .+. ...|.+ +++-+ |-- ..++..||+..+.+|++.|...-+... ...|.+. T Consensus 66 iA~lp~~--------~~~~~~~g~~--~~~~~~~~~~L~~~PN~-~~t~~~f~~~l~~~lll~Gna~~~i~r-~~~G~~~ 133 (454) T protein:vir:93 66 IAKMRLR--------LMQTDAQGIR--RETRRGDIARLCRRPNA-QQNRIQFFELWLNAKLRHGNTVVLKIR-NARGQIK 133 (454) T ss_pred hccCceE--------EEEeccCCcc--chhhhHHHHHHHhcCCC-CCCHHHHHHHHHHHHhhcCceEEEEEE-CCCCcEE Confidence 4322111 111 011211 11212 222 247889999999999999987766543 2346677 Q ss_pred cceeecCcchhhhc-cchhhhhhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHh Q lcl|NC_013650. 243 STVEETPSEREQRM-REFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQ 321 (1230) Q Consensus 243 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (1230) +....+|....+.. .++.+.+.-... ...+.+..+.++..-+-|++.....--..|.+++..+.+++.+-....+.. T Consensus 134 ~L~~i~~~~v~v~~~~~g~~~y~~~~~--~~~~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~ 211 (454) T protein:vir:93 134 ELRILDWNRVEPLVADDGEVFYRITPD--RNCGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENS 211 (454) T ss_pred EEEEEcCcceEEEEcCCCcEEEEEEec--cccccceeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHH Confidence 88888887775443 344443321111 222333445677777888886533223368888878888888777777776 Q ss_pred hcccceeeceeEEEeecccccCCcccccCchhhHHHHHHHhhhhhhhhhh--hhhhhhhhhhccccccccccCCCcch-- Q lcl|NC_013650. 322 DAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFR--LMVHNFGLKVENVFGRESVPNLDADY-- 397 (1230) Q Consensus 322 ~~~~~~~~~~~~~~~l~~~d~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~d~~~-- 397 (1230) .+.+..-+.+.-+.+++. + -+.+..+.+++.++......+. ..+...++..+ .-.+.+.|.++ T Consensus 212 ~~~f~ng~~p~gil~~~~-~--------l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~----~l~~~~~d~q~le 278 (454) T protein:vir:93 212 TSFFRNGGRPSGVIEIPG-S--------ITEENAKKLKSNWDSGYTGENAGKTAILSNGAKYN----PTTFSPVDSQTVE 278 (454) T ss_pred HHHHhccCCccEEEecCC-C--------CCHHHHHHHHHHHHHHhcccccCCceeccCCceEE----EcccChhHHHHHH Confidence 665555454544455542 1 1346677777766655532110 11111121111 11222333333 Q ss_pred --hhHHHHHHhhhhhhhhhhhhccchhhHHHHHH-HHHHHHHHHHHHHHHhhhccchhhhHHHhhhhhhhhhcccchhHH Q lcl|NC_013650. 398 --DRIERKLLQAWGIGEALISGGTGGAYASSALN-REFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPI 474 (1230) Q Consensus 398 --~~i~~~~~~~l~i~~ali~~~~g~~~~~~~~~-~~~~~k~~~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~g~~~~v 474 (1230) ....+++..+++|-..++....+..++..+.. +.+....+.-+...+......+. +........++...=+..+. T Consensus 279 ~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~L--~~~~~~~~~f~~~~ll~~D~ 356 (454) T protein:vir:93 279 QLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQCLQTLIESIELLLDEAL--ETGENESTEFDVTTLLRMDS 356 (454) T ss_pred HHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHhh--cCCCCcEEEeechhhhccCH Confidence 22345666678887777654444445433322 22222222222222221111110 00000000111000000000 Q ss_pred HH--HHHHHHHhhhhhhhhhcc-ceeeecccceecCCCeeeecccccEehheeccCcEEE---EecCCCeEEEeeEEEee Q lcl|NC_013650. 475 YR--EIVEYDEETGQEYIRKVP-KLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVI---AWDGTGYVVDTALHTGI 548 (1230) Q Consensus 475 ~~--~i~~~~~~~g~~~i~~~~-~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~---t~~g~~~~v~~v~~~~~ 548 (1230) .. +....+...|.--..... .+. ...+.++..++......+++++...+..- ..+++..... T Consensus 357 ~~r~~~~~~~~~~G~~T~NE~R~~~g-----l~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 424 (454) T protein:vir:93 357 ERRMKTLGDAVKNTLLTPNEARKREN-----LPPLAGGDALYLQQQNYSLEALSRRDAREDPFASSGKTASVP------- 424 (454) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhC-----CCCCCCCCeeeeccCccchHhhhccCcccCCCCCCccCCCCC------- Confidence 00 001111111100000000 000 01112222222222222333332211100 0000000000 Q ss_pred cCcceEEEEEEcCceEEEEcCCcceEeecCcchhhhhhcchhcccc Q lcl|NC_013650. 549 EHRDELVEVITKTGRTIRCTADHPFWTDQGWVKAQDLTDEIAIRTS 594 (1230) Q Consensus 549 ~~~~~l~rI~t~~G~~L~~Tp~H~f~v~~g~~~~~~~~~~~~~~~~ 594 (1230) . .....+|. .-............-.+ +... T Consensus 425 ~------~~~~~d~~--------~~~~e~~~d~~~~~~~~--~~~~ 454 (454) T protein:vir:93 425 Q------AVAASDGN--------KAITETEHDAVKAMFRG--ILKK 454 (454) T ss_pred C------CCCCCCCC--------CCccCCccchhhhhhhh--hhcC Confidence 0 00000000 00000000000000000 0000 No 30 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=25.16 E-value=2.1 Score=18.83 Aligned_cols=435 Identities=13% Similarity=0.062 Sum_probs=165.8 Q ss_pred HHHHHHhcCCCCCCchhHHHHHhhhhhcCCcchhHHHHHHhhhhhhhhHHHHHHHHhccCcccceeecCchhhhhhHhhh Q lcl|NC_013650. 16 VNRLRKAGVNMPNSPTMARAQAAALQNTVNNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLA 95 (1230) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (1230) +|-+.+ |.. ++.+ +.++|. ....|+-.+ -|..++- .+ .+.+ T Consensus 1 m~~~~~-~~~-------------------a~~~-----~~~~~~------~~~~y~aa~-~~~~~~~-~~------~~s~ 41 (495) T protein:vir:10 1 MNMTPS-GYQ-------------------SLAS-----GLLVPV------GASAYEGAS-GGHRWQD-IG------DYGP 41 (495) T ss_pred CCcccc-ccc-------------------ccch-----hhhhHH------Hhhhhhccc-cCcccCC-CC------CCCh Confidence 111111 111 1111 111111 111232211 1111110 00 1111 Q ss_pred hhcCCCCcchhhHHHHHHHHHHHHhcccchhHHHHHhhhcCcc-cccccccc------chhHHHHHHHhh----ccccch Q lcl|NC_013650. 96 DKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPV-VGMEFDSK------DPLIKTFYEDLF----FGEDLN 164 (1230) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~------~~~~~~~~~~~~----~~~~~~ 164 (1230) |-+-.-.++.+|.=||..|+.++++.-.|+.+...=| .|+...++ ...|+..|+... +...+| T Consensus 42 ------d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~ 115 (495) T protein:vir:10 42 ------DTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQS 115 (495) T ss_pred ------hHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccC Confidence 2233456778898999999999999999998776633 34444443 445666666654 222344 Q ss_pred hHHhhHHHhhhhhhhhcceeeccccch-hcc-cc-hhhhhcCchhhhcchhhhhccchheeeehhhhhcccccccccccc Q lcl|NC_013650. 165 YLEFLPDQFAREYFTVGEVTSLAHFNE-SLG-VW-SSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGN 241 (1230) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (1230) +.. |....-|.++.-||+|-.-+... ..| -| -+-+.|+||+|......-.... =.-++.| T Consensus 116 f~~-lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~~-g~~i~~G--------------- 178 (495) T protein:vir:10 116 FYG-LQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPDMLASDIPDETLPS-GGYVKGG--------------- 178 (495) T ss_pred HHH-HHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEechhhcCCCCCCCCCCC-CCEEEec--------------- Confidence 443 44558899999999986655442 222 23 2679999999743322110000 0001111 Q ss_pred ccceeecCcchhhhccchhhhhhhhhH-HHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHH Q lcl|NC_013650. 242 MSTVEETPSEREQRMREFQDLQRRYPE-IIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAA 320 (1230) Q Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (1230) +|.+-.-+-+ -|-+ .+..|- -+......+.+.+|-..|-|+..+ ..-..||-|. |.+-..|...+.+.++ T Consensus 179 ---Ie~d~~Gr~v---aY~i-~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~~-r~gQ~RGis~-la~i~~l~~l~~y~da 249 (495) T protein:vir:10 179 ---IRFSNGGKRK---AYCF-YRNHPAESSLIGDPVDTVWIKAEHVLHVTVL-TVRSDAGAPW-FQLLLRLNELDQYEDA 249 (495) T ss_pred ---eEECCCCceE---EEEE-eecCCCcccccccccceeeechhheEecccc-CCCcccCcch-hHHHHHHHHhhHHHHH Confidence 1111000000 1111 111232 111122334466777778898765 5668889874 4554344444445444 Q ss_pred hhcccceeeceeEEEe--ecccccCCcccccCc-hhh-HHHHHHHhhhhhhhhhhhhhhhhhhhhccccccccccCCCcc Q lcl|NC_013650. 321 QDAVADRLYSPLVLAT--LGIEDMGDGEPWIPD-QGE-LDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDAD 396 (1230) Q Consensus 321 ~~~~~~~~~~~~~~~~--l~~~d~~~~~~~~~~-~~~-l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 396 (1230) +..-+..-+.-..+.+ .+.... +.. ..+ .++ -......++... +....-|..++...-..-..+.... T Consensus 250 el~~a~i~A~~~~fi~~~~~~~~~--~~~-~~~~~~~~~~~~~~~l~pG~-----i~~L~pGe~i~~~~p~~p~~~~~~f 321 (495) T protein:vir:10 250 ELVRKKTAALFAAFIQEATADSTG--GPT-IGQPKRSKGGKRITGLNPGT-----LQYLQPGQEVKFSNPADVGTTYEPW 321 (495) T ss_pred HHHHHHHhhhheeeeecCCCcccc--ccc-cCccccccCcccceecCCce-----eeecCCCCeeeeeCCCCCCCCHHHH Confidence 3221111111111222 222111 110 110 000 000000010100 1111223333332222112233333 Q ss_pred hhhHHHHHHhhhhhhhhhhhhcc-chhhHHHHHHHHHHHHHHHHHHH-HHhhhccc--hhhhHHHh--hhhhh------- Q lcl|NC_013650. 397 YDRIERKLLQAWGIGEALISGGT-GGAYASSALNREFVTQIMTGFQN-ALKRHIRR--RCEVVAEA--QGHYD------- 463 (1230) Q Consensus 397 ~~~i~~~~~~~l~i~~ali~~~~-g~~~~~~~~~~~~~~k~~~~~~~-~l~~~~~~--~~~~l~~~--~~~~~------- 463 (1230) ...+.+.+..++|+...+++++- +..|++....+..+.+.+...+. .+...... ...+|.+. .+... T Consensus 322 ~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~ 401 (495) T protein:vir:10 322 LRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQR 401 (495) T ss_pred HHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhh Confidence 46688888889999998888776 45688877665544444444332 12111111 01112111 11110 Q ss_pred ---h----hhcccc-hhHHHHHHHHH--------------HHhhhhhhhhh----ccceeeecccceecCCCeeeecccc Q lcl|NC_013650. 464 ---Y----DLKGGV-RVPIYREIVEY--------------DEETGQEYIRK----VPKLLIPEVKFSCVVPGTQVLTPEG 517 (1230) Q Consensus 464 ---~----~~~~g~-~~~v~~~i~~~--------------~~~~g~~~i~~----~~~l~~~~~~~~Clt~DT~VlT~dG 517 (1230) + +...+. .++-.++.... ..+.|..+... +.........+.-|..+....+..| T Consensus 402 ~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~~~~~~~ 481 (495) T protein:vir:10 402 RRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDMEELFDMISDANQLIDEYDLRLDSDPRYVNGSG 481 (495) T ss_pred hHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCcCCCcc Confidence 0 000000 00001111000 00111100000 0000000000111111212222222 Q ss_pred cEe--hheeccCcE Q lcl|NC_013650. 518 QRN--VEDIRPGDE 529 (1230) Q Consensus 518 ~k~--IedL~vGD~ 529 (1230) -.+ -++=..+|+ T Consensus 482 ~~~~~~~~~~~~~e 495 (495) T protein:vir:10 482 AEQKSVMEAALNNE 495 (495) T ss_pred CCCCCCCCCCCCCC Confidence 111 111111111 No 31 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=22.31 E-value=2.5 Score=18.43 Aligned_cols=371 Identities=11% Similarity=0.059 Sum_probs=130.4 Q ss_pred HHhhhcCccccccccccchhHHHHHHHhhccccc---------------hhHHhhHHHhhhhhhhh----cceeeccccc Q lcl|NC_013650. 130 IDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDL---------------NYLEFLPDQFAREYFTV----GEVTSLAHFN 190 (1230) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 190 (1230) +-++.+ ..-.+.+.|+-+..|+ .|... --+++|-+.++.=-|++ |+++. + T Consensus 1 m~~f~~---~~~~~~~~~~~~~~~~----~~~~~~~~~~~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~g~~~~----~ 69 (406) T protein:vir:97 1 MSFFQP---LGTSKVSYDDYISSVL----AGDVSQKYLGVSALKNSDILTATSIIAGDIARFPLVKKDVNGDIIH----D 69 (406) T ss_pred Cccccc---cCCCCCCcchHHHHHh----cCCCCcccccchhhccHHHHHHHHHHHHhhhhCeeEEEecCccccc----c Confidence 111111 1111223333333332 11111 13445555454322211 22111 0 Q ss_pred hhcccchhhhhcC--chhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCcchhhhcc-chhhhhhhhh Q lcl|NC_013650. 191 ESLGVWSSEEILN--PDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMR-EFQDLQRRYP 267 (1230) Q Consensus 191 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 267 (1230) +.+ ...|| |-= ..++..||+..+.+|++.|..--+......+|...+..-.+|....+... .+.+.+ +| T Consensus 70 ~~~-----~~lL~~~PN~-~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~~~~y-~~- 141 (406) T protein:vir:97 70 EDI-----NYLLNVKSTS-NASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNHEIVY-TF- 141 (406) T ss_pred chH-----HHHhhccCCC-CCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCceEEE-EE- Confidence 111 12232 322 25899999999999999887765543322345566766666666544332 232222 11 Q ss_pred HHHhhhccCCCccccchhhhhhccCC-ccccccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEEeecccccCCcc Q lcl|NC_013650. 268 EIIQAAMQNDGLDISEALISRVVNRP-TAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGE 346 (1230) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~d~~~~~ 346 (1230) .....+..+.++..-+-|++.-. .+ ..|.+.+.....++.+-...++...+.+..-..+..|...+. T Consensus 142 ---~~~~~~~~~~~~~~evih~r~~~~dg--~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~------- 209 (406) T protein:vir:97 142 ---TDMLTAKQVKCFAHDVIHWKFFSHDT--ILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGA------- 209 (406) T ss_pred ---EecCCceEEEEccccEEEecCCCCCC--cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCC------- Confidence 22234444567666677887431 12 237777766666666655555554444433344444433221 Q ss_pred cccCchhhHHHHHHHhhhhhhhhhh--hhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhhhhhhhhhhhccc Q lcl|NC_013650. 347 PWIPDQGELDEVRDDMQSLLAADFR--LMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEALISGGTG 420 (1230) Q Consensus 347 ~~~~~~~~l~~~~d~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l~i~~ali~~~~g 420 (1230) .-+.+..+.+++.++....-.+. ..+...++...-+ .+.+.|.++ ....+++..+++|-..++... T Consensus 210 --~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l----~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~-- 281 (406) T protein:vir:97 210 --QLSGDARQRARQEFEKMREGSVGGSPLVFDSTMEYTPL----EIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVN-- 281 (406) T ss_pred --CCCHHHHHHHHHHHHHHhcccccCceeecCCCceEEEc----cCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCC-- Confidence 12446677777776655532110 0111112111111 112222222 223455666888877776421 Q ss_pred hhhHH-HHHHHHHHHHHHHHHHHHHhhhccchhhhHHHh--hh-hhhhhhcccchhHHHHHHHHHHHhhhhhhhhhc-cc Q lcl|NC_013650. 421 GAYAS-SALNREFVTQIMTGFQNALKRHIRRRCEVVAEA--QG-HYDYDLKGGVRVPIYREIVEYDEETGQEYIRKV-PK 495 (1230) Q Consensus 421 ~~~~~-~~~~~~~~~k~~~~~~~~l~~~~~~~~~~l~~~--~~-~~~~~~~~g~~~~v~~~i~~~~~~~g~~~i~~~-~~ 495 (1230) ..++. ....+.++...+.-+...+......+ .+... .. ...+++..-.... ...+ ......|---.... .. T Consensus 282 ~~~~~~e~~~~~f~~~~l~P~~~~ie~~l~~k--ll~~~~~~~~~i~fd~~~~~~~~-~~~~-~~~~~~g~~T~NE~R~~ 357 (406) T protein:vir:97 282 SPNQSVAQLMEDYVTNDLPFYFDAITSELGLK--TLNDKDRRLYHIEFDTRSVTGRN-VDEI-VKLVNNQILTPNQGLVE 357 (406) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHhhh--hcChhhccceeEEEecCccchhh-HHHH-HHHHhCCCcCHHHHHHH Confidence 12222 12222222222222222221111110 00000 00 0111111000000 0000 00000000000000 00 Q ss_pred eeeecccceecCCCeeeecccccEehheeccC-cEEEEecCCCeEEEeeEEEeecCcceEEEEEEcCceEEEEcCCcc Q lcl|NC_013650. 496 LLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPG-DEVIAWDGTGYVVDTALHTGIEHRDELVEVITKTGRTIRCTADHP 572 (1230) Q Consensus 496 l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vG-D~V~t~~g~~~~v~~v~~~~~~~~~~l~rI~t~~G~~L~~Tp~H~ 572 (1230) +......+ ..+...++..++.+++.+..+ |..-+ ..+ |-+.....+-- T Consensus 358 ~g~~p~~~---~~gD~~~~~~n~~~~~~~~~~~~~~~~-------------------------~~~-gg~~~~~~~~~ 406 (406) T protein:vir:97 358 LGKQKSTD---PNMDRYQSSLNYVFLDKKEEYQDKVGI-------------------------KGK-GGEVNAEEDKS 406 (406) T ss_pred hCCCCCCC---CCCCeEeeccCccchhccccccccccc-------------------------ccC-CCCCCCCCCCC Confidence 00000000 001122222223333322110 00000 000 00000000100 No 32 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=21.70 E-value=2.6 Score=18.35 Aligned_cols=378 Identities=12% Similarity=0.122 Sum_probs=134.5 Q ss_pred hhhcCcccccc-ccccchhHHHHHHHhhccccc---------------------hhHHhhHHHhhhhhhhhcceeecccc Q lcl|NC_013650. 132 IYSKFPVVGME-FDSKDPLIKTFYEDLFFGEDL---------------------NYLEFLPDQFAREYFTVGEVTSLAHF 189 (1230) Q Consensus 132 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (1230) .+ -|.+..-. --++++ .|++.++.+... --+++|-+.++.-=|.| -.- T Consensus 1 m~-~~~~~~~~~~~~s~~---~~w~~~~~~~~~~~~~~g~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~------~~~ 70 (421) T protein:vir:10 1 MF-IPQMFEGKKRSVSGG---GFWEAMLGGVRSSHSKAGVMITPETALALSAVRACVTLLAESVAQLPVEL------YRR 70 (421) T ss_pred CC-CcchhcccccccCcc---hhhHHHhhhhccCcccCCceechHHhhccHHHHHHHHHHHHhhccCceEE------EEE Confidence 00 11111000 001112 233333322111 13444444444322211 001 Q ss_pred chhcccchh-----hhhcC--chhhhcchhhhhccchheeeehhhhhccccccccccccccceeecCcchhhhccchhhh Q lcl|NC_013650. 190 NESLGVWSS-----EEILN--PDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDL 262 (1230) Q Consensus 190 ~~~~~~~~~-----~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 262 (1230) +...+.... -..|| |.- ..++..||+..+.+|++.|..--+.. +...|...+.+-.+|....+.....-.+ T Consensus 71 ~~~g~~~~~~~~~l~~lL~~~PN~-~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~G~~~~L~~l~~~~v~v~~~~~g~~ 148 (421) T protein:vir:10 71 DKNGGRQRATDHPIYDLIHSQPNK-KDTSFEYFEQQQGLLGLEGNCYSIID-RDGKGYPKELIPINPKKVIVLKGPDGMP 148 (421) T ss_pred cCCCceeecccchHHHHHhhcccC-CCCHHHHHHHHHHHHhhcCCeEEEEE-EcCCCcEEEEEEecCceEEEEECCCceE Confidence 111111110 11221 333 25799999999999999987755543 3344556666666666554422221112 Q ss_pred hhhhhHHHhhhccCCCccccchhhhhhccCCccccccCCccccchHHHHHHHHHHHHHhhcccceeeceeEEEeeccccc Q lcl|NC_013650. 263 QRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDM 342 (1230) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~d~ 342 (1230) +-++.+ .+ -.++...+-|++.-..- -..|.+++-.+.+++.+-....+...+.+..-+.+.-+.+... ++ T Consensus 149 ~y~~~~------~g--~~~~~~eiih~~~~~~d-~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~~ 218 (421) T protein:vir:10 149 YYEIPE------IG--ETLPMRMMHHVKVFSLD-GYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPK-EA 218 (421) T ss_pred EEEEcC------CC--cEEchhhEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecC-cc Confidence 222211 12 23556666777653211 1237766656666676666666665555544444433333331 11 Q ss_pred CCcccccCchhhHHHHHHHhhhhhhh-h--hhhhhhhhhhhhccccccccccCCCcch----hhHHHHHHhhhhhhhhhh Q lcl|NC_013650. 343 GDGEPWIPDQGELDEVRDDMQSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEALI 415 (1230) Q Consensus 343 ~~~~~~~~~~~~l~~~~d~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~d~~~----~~i~~~~~~~l~i~~ali 415 (1230) .-.-+++.++.++..++....- . ....+...++..+.+. ..+.|.++ ....+++..+++|-..++ T Consensus 219 ----~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~----~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 290 (421) T protein:vir:10 219 ----PAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMS----QDNEKAQLLQSRQWGVEEVCRLYKIPPHMV 290 (421) T ss_pred ----CccCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEecC----CChhHHHHHHHHHHhHHHHHHHhCCCHHHc Confidence 0023567777777665555421 1 1112222233222221 22333333 235566677888887777 Q ss_pred hhccchhhHHHHHHH-HHHHHHHHHHHHHHhhhccchhhhHHH--hhh-hhhhhhcccchhHHHH--HHHHHHHhhhhhh Q lcl|NC_013650. 416 SGGTGGAYASSALNR-EFVTQIMTGFQNALKRHIRRRCEVVAE--AQG-HYDYDLKGGVRVPIYR--EIVEYDEETGQEY 489 (1230) Q Consensus 416 ~~~~g~~~~~~~~~~-~~~~k~~~~~~~~l~~~~~~~~~~l~~--~~~-~~~~~~~~g~~~~v~~--~i~~~~~~~g~~~ 489 (1230) ......+++..+... .+....+.-+...++.....+. +.. ..+ ...++...-...+... +........|.-- T Consensus 291 g~~~~~t~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL--~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T 368 (421) T protein:vir:10 291 QMLAKATNNNIEHQGLQFVMYTLLAWLKRHEGALQRDL--LLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLS 368 (421) T ss_pred CCCcCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhc--cCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcC Confidence 555545554433221 1111121111111111111000 000 000 0011100000000000 0111111111000 Q ss_pred hhhc-cceeeecccceecCCCeeeecccccEehheeccCcEEEEecCCCeEEEeeEEEeecCcceEEEEEEcC Q lcl|NC_013650. 490 IRKV-PKLLIPEVKFSCVVPGTQVLTPEGQRNVEDIRPGDEVIAWDGTGYVVDTALHTGIEHRDELVEVITKT 561 (1230) Q Consensus 490 i~~~-~~l~~~~~~~~Clt~DT~VlT~dG~k~IedL~vGD~V~t~~g~~~~v~~v~~~~~~~~~~l~rI~t~~ 561 (1230) +... ..+.... +.++..++.+-....++++..++.--+.. ...+.-.+.+.. T Consensus 369 ~NE~R~~~gl~p-----~~ggD~~~~~~n~~~~~~~~~~~~~~~~~---------------~~~e~d~~~~~~ 421 (421) T protein:vir:10 369 VNDIRRMENLPP-----IAGGDKYLTPLNMVDSAQIIPGDKKPTAQ---------------QMAEIDTILSRT 421 (421) T ss_pred HHHHHHHhCCCC-----CCCcceeeeccccccccccccCCCCcccc---------------cCcccccccccC Confidence 0000 0001111 11122222333333344443333211100 000011111110 Done!