Query lcl|NC_018835.1_cdsid_YP_006906096.1 [gene=NJ01_029] [protein=bacterial surface protein] [protein_id=YP_006906096.1] [location=22732..25749] Match_columns 1005 No_of_seqs 551 out of 2167 Neff 10.2 Searched_HMMs 1612 Date Thu Nov 7 14:01:37 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_25 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_25_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:8837 Length: 513 # 100.0 5.8E-81 3.6E-84 460.5 39.1 497 415-1005 1-513 (513) 2 protein:vir:118 Length: 449 # 98.7 3.7E-09 2.3E-12 66.8 16.7 207 1-227 230-449 (449) 3 protein:vir:9268 Length: 472 # 98.1 4.8E-06 3E-09 49.7 27.9 429 458-1005 1-462 (472) 4 protein:vir:5202 Length: 448 # 98.0 1.4E-06 8.6E-10 52.7 15.2 206 1-227 236-448 (448) 5 protein:vir:100960 Length: 472 97.8 1.8E-05 1.1E-08 46.7 28.6 433 458-1005 1-462 (472) 6 protein:vir:99075 Length: 392 97.7 1.1E-05 7.1E-09 47.7 16.2 175 1-206 211-392 (392) 7 protein:vir:2109 Length: 472 # 97.6 3.4E-05 2.1E-08 45.1 29.6 431 458-1005 1-462 (472) 8 protein:vir:177 Length: 472 # 97.3 9.6E-05 6E-08 42.6 27.6 438 458-1005 1-462 (472) 9 protein:vir:108312 Length: 458 96.9 0.00025 1.6E-07 40.3 29.3 415 470-1000 1-458 (458) 10 protein:vir:105428 Length: 472 96.8 0.00035 2.2E-07 39.5 27.9 433 458-1000 1-472 (472) 11 protein:vir:118 Length: 449 # 96.1 0.00072 4.4E-07 37.8 13.2 128 1-137 316-449 (449) 12 protein:vir:105525 Length: 472 93.7 0.0066 4.1E-06 32.5 27.0 426 492-998 1-472 (472) 13 protein:vir:3529 Length: 477 # 93.6 0.007 4.3E-06 32.4 29.8 436 452-1005 1-475 (477) 14 protein:vir:5202 Length: 448 # 93.3 0.0046 2.9E-06 33.4 10.1 127 1-137 316-448 (448) 15 protein:vir:99075 Length: 392 85.4 0.053 3.3E-05 27.6 12.5 112 1-116 262-392 (392) 16 protein:vir:78703 Length: 905 22.0 1.7 0.001 19.3 4.0 113 825-1005 1-117 (905) No 1 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=100.00 E-value=5.8e-81 Score=460.45 Aligned_cols=497 Identities=23% Similarity=0.354 Sum_probs=315.8 Q ss_pred ccccccceeeeeeeeeEEEeecccceeeeeeeeeeecccceeeccceeeeeeccccceeEEEEeecCCceEEeeecCCcc Q lcl|NC_018835. 415 EPTSGIDTSGMYEGNSFYDYSNVNDIEGFARASLFATPLSSVTLDVVSASLDVGEEIVITATASPEGDYSYQWSVDKTGY 494 (1005) Q Consensus 415 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~t~~~g~~~t~tat~~~~~~~~vt~sss~~~~ 494 (1005) ..-..... .. ....... ..+..... +... .-..+-+..+.-.....-.+-. T Consensus 1 ~~~~~~~~--~~---------------~~g~~~d---------~~p~~lp~--~a~s-~~~N~~~~~~~~~~~~g~~pv~ 51 (513) T protein:vir:88 1 MALERQEV--KN---------------PTGIVTD---------IAPADLPL--DKWS-FGNNVRFKNGKAQKALGHSPIF 51 (513) T ss_pred CCcCChhh--cc---------------cccceec---------cChhhcCC--Ccce-eeeeeeEecceeeecCccceee Confidence 00000000 00 0000000 00000000 0000 0000000000000000000000 Q ss_pred eecccccccceeEEEecCCceEEEeeeccc------------cceeeEEEeecccccceeeceeeeccccceeecccccc Q lcl|NC_018835. 495 VSTTSVTGKSIKLVALRKGEINVTCTVSQM------------TQKDYDAFDDYPWYHAVISNCAVATTHYETPQVKEFES 562 (1005) Q Consensus 495 ~tv~~~~~~t~~vt~~~~gt~Tit~~~~~~------------~~~~~~~t~t~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 562 (1005) ++.. ....+.....+.|..++....... ..............-.......+.+.....+.. T Consensus 52 a~~~--~~~~g~~~~~~~g~~~~~~~~~~~~~~~~~~t~~dvs~~~~~~~~~~~w~~~~f~~~i~a~ng~~~~q~----- 124 (513) T protein:vir:88 52 DTAQ--APILDMFPFIRNNIPYWLLCSEKRLYLADGTTIIDVSPGPYSASVTNRWSVGSFNGVIFANDGVNPPHH----- 124 (513) T ss_pred ecCC--CCceeeeeeecCCCeEEEEeeceEEEEecCceeeeccccceeecccCceeeeeecCEEEEEcCCCcceE----- Confidence 0000 000011111111111111100000 000000000000000000001111111111111 Q ss_pred eeEEecCCceeEEEEEccCCCceeEEeeccceeeeccceEEEeecccceEEEEecccCceEEEEEeeccccccccccccc Q lcl|NC_018835. 563 EYFVDLPGWGEQTVVDNDGNPSVKKFNWKCERVRSFNNRLFALNMREANASGVTTNYPLRLRWSNFANENKAPTLWDDFA 642 (1005) Q Consensus 563 ~~t~~~~~t~~~t~t~~~~~~t~~tvt~tss~~~~~~~~~~~~~~~~~t~t~t~~~~~~~~~~~~~~t~~~~~~t~~~~~ 642 (1005) ..... ......++... .+....+....+.+...+..++. ...+..+.++...+.+..+..++... T Consensus 125 ---~~~~s---~~f~dl~g~p~----~~~a~~i~v~~~flv~~~~t~~~-----~~~PnrV~wS~~~D~~~~P~~W~~t~ 189 (513) T protein:vir:88 125 ---LPPTE---SVFRVLPNFPA----NTTFRRLKSFKNFLIGLNVTSNS-----IEMPQMVWWSTSADAGGVPASWDPTD 189 (513) T ss_pred ---EcCCC---ceeeeccCCCc----ccceEEEEEEeeEEEEeecccCc-----CCCCceEEEecccCCccccccccccc Confidence 00000 00001111100 11222233444455544443331 14566677776665555555554221 Q ss_pred eeeeeccccccceeeeeeeeccCcccceeeeeccceeeeccccCceEEEEecCCEEEEEEeCCCCceEEEEEeccccccc Q lcl|NC_018835. 643 YDRVVSSDLASNIVGQTQALENGYAGYIDLADSNGSLIDILPLKDYLFVYTEFETYIGSPTNNTYQPLMFKKLFNDSGIL 722 (1005) Q Consensus 643 t~~~~~~~~~~~t~~~t~~~t~~~~~~~~~t~t~~~~i~~~~~g~~~v~~~~~~~~~~t~t~~t~~~~~~~~~~~~~g~~ 722 (1005) ....++...+.+..++++.+.+++++.+++.++.++.|+++++. .+|+|++++.++||+ T Consensus 190 --------------------~t~~a~~~~l~d~~g~~v~g~~~g~~liif~e~~i~~m~y~g~~-~if~~~~i~~~~G~~ 248 (513) T protein:vir:88 190 --------------------PTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIGGL-YIFQFQQLFNDVGIL 248 (513) T ss_pred --------------------ccCcccccccCCCccceeeeeecccceEEEecccEEEEEecCCC-ceEEEEeeccccccc Confidence 12235677778888999999999999999999999999999765 599999999999999 Q ss_pred cCceeEEeCCeEEEEeCCcEEEeCCceecccccchHHHHHHhhcCcchhccEEEEEcCCCCEEEEEEeccCCCcCCcccc Q lcl|NC_018835. 723 APECVVEVEGSHFVVTQNDVILHNGATKKSIASNRVKNMLINEVCLVNPLATRVHLHQDKKEVWVLYVGPGEPKESFACT 802 (1005) Q Consensus 723 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 802 (1005) +|+||+++++.|||++++|||||+|.++++|++|||+||||+++|..|++|+++++||+++||||+||+.+++ ...+|| T Consensus 249 ~p~SI~~~~~~~ffls~~Gf~~~~G~~~~~Ig~ekVdk~f~~~~n~~~~~~~~~~~d~~~~~v~~~y~s~~~~-~~~~~~ 327 (513) T protein:vir:88 249 GPNCAIEFDGNHFVVGHGDVYVHNGVQKQSVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSE-PGKHCD 327 (513) T ss_pred CCceeEEECCeEEEEeCCceEEecCceeeecccchhhhhhhccCCcccceEEEEEEcCcccEEEEEecCCCCC-CCcccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999998764 244799 Q ss_pred eEEEEecccCceeeeeecce--eeeeeeecccccCCccccCCccccCCCccceeeecCccccCCcccEEEEeecC-CceE Q lcl|NC_018835. 803 KAAVWNYEFDTWSFRTIPYA--QCIGLVDPPVLERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSFL-RGFY 879 (1005) Q Consensus 803 ~~~~~~~~~~~w~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 879 (1005) |++||||++++||++++|+. .++|+.++++. ..| +.++.+||++.. .|.++..++.+..+++++.+ +..| T Consensus 328 ~~lVYd~~~~~Ws~~~~p~~~~g~~g~~~~~~~---~~~-~~~~~~~d~~~~---~~~~~~~~~~~~sl~~~~~~~~~~~ 400 (513) T protein:vir:88 328 RAIIWNWKENTWSIRDLPNVLSGAYGIIDPKTS---NLW-DDDSNPWDTDTS---VWGEGSYNPAKSSMIFTSFQDAKLF 400 (513) T ss_pred eEEEEEccCCeEEEEeccchhhccccccccccc---cee-cccccccccchh---hhhccccccccceeEeeeccCCcee Confidence 99999999999999999975 66888888664 478 458899999775 67888888888777777665 4555 Q ss_pred EEecccccccccccceeeeeccceeeeccccccccccCcccceeeeeeeeeEeCCCcEEEEEeCeeeCccCCceeCCccc Q lcl|NC_018835. 880 QVDVGALDYFYDRLNDVVIEKPLEMRLERTGLDFDNVTNEWNQKHINRFRPQTTGSGTYTFEAGGSQFSNEYGHPHTSKT 959 (1005) Q Consensus 880 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 959 (1005) +++.+ ......||+++|||.+++|++ .+++|+|++++|+++++|.|+|.+|.+++++++++|+++++ T Consensus 401 ~fd~~----------~~f~G~~lea~~~t~~~~~~~---~~~~~~i~~v~~~~t~~g~~t~~vg~~~~~~~~~~~s~~~~ 467 (513) T protein:vir:88 401 LFGET----------STFSGQSFTSTLERSDIYLGD---DRMMKTVSAVIPHITGNGVCNIWVGNAQVQGSGIRWKGPYP 467 (513) T ss_pred eeccc----------ccccCCceEEEEEecCccccC---chhheeeeeeeeeeecceEEEEEEeeeccCcccccccccee Confidence 55433 345788999999999999974 47799999999999999999999999999999999999999 Q ss_pred ccCCcceEEEEEeCCceEEEEEEEcCCCceEEEEeeeEEEee-cCCC Q lcl|NC_018835. 960 YTIGVDRHVSVRLNHPYLFYNVIDNDVNSNAAINGLTIEFAV-GGRR 1005 (1005) Q Consensus 960 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 1005 (1005) |++++++++++|+.|||+++|| ..+++++|+|+|||+|+.+ +|+| T Consensus 468 ~~~~~~~~~~~r~~gRy~~~ri-~i~~~~~w~~~G~~ve~~~~~g~R 513 (513) T protein:vir:88 468 YRIGQDYKIDTKHVGRYIALKF-DFASAGDWYFNGYTLEMAPKAGMR 513 (513) T ss_pred eecccCceEEeccCCceEEEEE-EccCCCceEEeeEEEEEecCCCCC Confidence 9999999999999999999997 5556999999999999999 6999 No 2 >protein:vir:118 Length: 449 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:4 # MgeName: B103 # Cross-refs: genbank:acc:NP_690641;swissprot:sw:q37888;genbank:gi:22855155;interpro:IPR003343;uniprot:Q37888;genbank:GeneID:955370 Probab=98.71 E-value=3.7e-09 Score=66.82 Aligned_cols=207 Identities=15% Similarity=0.100 Sum_probs=84.2 Q ss_pred CeEEEeee------eceEEE-EcCCcceeeeccceecceeceeccceeEecCCceeEEEEEecCCc-ccccc---cccce Q lcl|NC_018835. 1 MALYPIKS------LGAVGV-IADQAPTDLAPNAFTNAINARFVEQRVFKTGGNAPLSYVDEDKDL-TPLSF---VSMPF 69 (1005) Q Consensus 1 ~~~y~i~s------~g~~tv-~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~-t~~t~---t~~~~ 69 (1005) -..|...+ ....-. ........+..+.+...-+...+. .....+..+... ..... ..... T Consensus 230 t~~~N~~~v~~~ad~~dl~~i~~~d~~~~ld~t~ls~afN~tavD---------a~~~~tvVddfAst~~~a~~~sk~~~ 300 (449) T protein:vir:11 230 TRDYNAMAVRTRSDIRDVHLFIDADLNAELDVDVLAKAFNMDRTT---------FLGNVTVIDGFASTGLKAVMVDKDWF 300 (449) T ss_pred CCCCCceeeccccCccceEEEEccCcceecccccchhhhccceee---------eeeeeeecCccCCccceeeeecccee Confidence 02222211 111111 111111111111111111111110 000000000000 00000 00000 Q ss_pred EEEEcCCcEEEEecCCceEEEEecceEEEEEEecCcce-EEEEEeeeccccceeeecCccceEecCCceEEEEEEEeCCC Q lcl|NC_018835. 70 DYYSAGNSFLVVGTDKKLYKLTDESLTDISRKVATVTK-KASASIKIYPVVSQIVPKESTISMNFNQTKNLEVSLLPADA 148 (1005) Q Consensus 70 t~~ss~~s~~~t~~~~g~~t~t~~~t~~~t~~~~~~t~-t~t~tvt~~~~v~~it~~~~~~t~~~g~t~tltatv~~~~~ 148 (1005) ................|.+... .+............. ...+.....+++++++++|...++..|++.+|+|++.|.++ T Consensus 301 ~~~d~~~~~~~~~~~~G~y~n~-~~tvt~t~~~~~~~~~~a~~~~~~~~~VTsVsVtPss~tL~~G~T~qLTATV~psna 379 (449) T protein:vir:11 301 MVYDTLQKMETIRNPRGLYWNY-YYHVWQVLSASRFANAVAFVTGDDVPAVTQVIVSPAIASVKQGKSQAFTAYVRATDD 379 (449) T ss_pred EEeeeeeEEEEEEcCcceeecc-ceEEEEEEecccccceeeeeeeeccceeeEEEeeccceeeecCceEEEEEEEecCCC Confidence 0000000111111122221110 011111111111111 11223334456789999999999999999999999999999 Q ss_pred ccceEEEEECCCc-eeeEeccccccceeeeeecccCcceeeeeeccccceeeeEEEeeeecccceeeccccceeeeccee Q lcl|NC_018835. 149 NNTDLVWEVSNSS-YGSITVDPSDSKLATLTSFEREGNLVVTISTADDSVVAQIAVNIIDGDSGIFLSQDTVTIRKGGTT 227 (1005) Q Consensus 149 t~~tvt~tss~~~-~atv~~~~~~~~~~~~~t~~~~Gt~tiTat~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (1005) +++.|+|++++.. .++++.+| .+++...|+++|+++..++................... ... ...+... T Consensus 380 tnk~VTWSsSd~s~~ATVda~G-------~VTAva~GTAtITAta~~~s~TaT~tvtV~~~a~VtVt-P~s--a~ggaqA 449 (449) T protein:vir:11 380 KEHEVVWSVDGGSTGTSISSDG-------VLTVAANETNQLTVKATVDIGTADEPKPVVGEAVVNVR-PDS--STGGAQA 449 (449) T ss_pred CCceEEEEEeCCceEEEEcCCc-------eEEEecCccEEEEEEEecCcEEEEEEeeecceEEEEEe-ecC--CCCcccC Confidence 9999999988775 47787766 47888899999999887765544433322211111000 000 0000000 No 3 >protein:vir:9268 Length: 472 # NCBI annotation: 10 # Family: family:all:1540 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720332;genbank:gi:24371590;genbank:GeneID:955815 Probab=98.09 E-value=4.8e-06 Score=49.73 Aligned_cols=429 Identities=10% Similarity=0.022 Sum_probs=150.8 Q ss_pred ccceeeeeeccccceeEEEEeecCCceEEeeecCCcceecccccccceeEEEecCCceEEEeeeccccceeeEEEeeccc Q lcl|NC_018835. 458 LDVVSASLDVGEEIVITATASPEGDYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPW 537 (1005) Q Consensus 458 ~~~~~~t~~~g~~~t~tat~~~~~~~~vt~sss~~~~~tv~~~~~~t~~vt~~~~gt~Tit~~~~~~~~~~~~~t~t~~~ 537 (1005) .......+..+-.. .+.......+- .-...+.............. ..|.... +...+..-.- T Consensus 1 m~~~~ipl~~g~~~------~~~~a~~~~~~-pvn~y~~~~~~~~ss~~Lr~-~pG~~~~-a~~~G~~RG~--------- 62 (472) T protein:vir:92 1 MPIQQLPMMKGMGK------DFKNADYIDYL-PINMLATPKEVLDSSGYLRS-FPGIAKR-NDVNGVSRGV--------- 62 (472) T ss_pred Cceeeccccccccc------cCccCcceeee-ecccccccccccccccceee-cccceee-cCCCCcccce--------- Confidence 00000000000000 00000000000 00000000000000000000 0000000 0000000000 Q ss_pred ccceeeceeeeccccceeecccccceeEEecCCceeEEEEEccCCCceeEEeeccceeeeccceEEEeecccce-EEEEe Q lcl|NC_018835. 538 YHAVISNCAVATTHYETPQVKEFESEYFVDLPGWGEQTVVDNDGNPSVKKFNWKCERVRSFNNRLFALNMREAN-ASGVT 616 (1005) Q Consensus 538 ~~~~~~~~~v~~~~~~~~~~~~~~~~~t~~~~~t~~~t~t~~~~~~t~~tvt~tss~~~~~~~~~~~~~~~~~t-~t~t~ 616 (1005) .-....+. .. .+.+.....+... ...+..+.-..-..+.....+..+... -.... T Consensus 63 -----------------~~~~~~~~-ly-~V~G~~Ly~v~~~-----iG~i~gsgrVsMa~n~~~~av~~~~~~~~Y~~~ 118 (472) T protein:vir:92 63 -----------------EYNTAQNA-VY-RVCGGKLYKGEAV-----VGDVAGSGRVSMAHGRTSQAVGVNGQLIEYRYD 118 (472) T ss_pred -----------------eeeeeCCe-EE-EEeCcceEEEEee-----EeeccCcccEEEecCCeEEEEEECCceeEEEEe Confidence 00000000 00 0000001111000 000000000000000000111000000 00000 Q ss_pred cccCceEEEEEeeccccccc------cccccceeee-eccccccceeeeeeeeccCcc--cceeeeeccceeeeccccCc Q lcl|NC_018835. 617 TNYPLRLRWSNFANENKAPT------LWDDFAYDRV-VSSDLASNIVGQTQALENGYA--GYIDLADSNGSLIDILPLKD 687 (1005) Q Consensus 617 ~~~~~~~~~~~~~t~~~~~~------t~~~~~t~~~-~~~~~~~~t~~~t~~~t~~~~--~~~~~t~t~~~~i~~~~~g~ 687 (1005) ....... ..+......-. ..+..+..-+ ........-++.......... ....-...+..++......+ T Consensus 119 ~~~~t~~--~~~~d~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~~~iS~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~ 196 (472) T protein:vir:92 119 GAVKTVS--NWPADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGSWRD 196 (472) T ss_pred cchhhhh--cccCccccccccccceeEEEEecceEEEccCCCceEEEeccCCccccccccccccccCCCCceEEEEeecc Confidence 0000000 00000000000 0000000000 000000000010011111111 11222334556677777888 Q ss_pred eEEEEecCCEEEEEEeCCCC-ceEEEEEec---cccccccCceeEEeCCeEEEEeCCc-----EEEeCCceecccccchH Q lcl|NC_018835. 688 YLFVYTEFETYIGSPTNNTY-QPLMFKKLF---NDSGILAPECVVEVEGSHFVVTQND-----VILHNGATKKSIASNRV 758 (1005) Q Consensus 688 ~~v~~~~~~~~~~t~t~~t~-~~~~~~~~~---~~~g~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 758 (1005) ..+.+-...+-+..-+|++. .-|+|+..+ -..||.+|.|+..+++.+||++|++ +|+.+|.+++.|-...| T Consensus 197 ~l~lfG~~TiEvw~ntG~ad~~~fpy~r~~g~~i~~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aI 276 (472) T protein:vir:92 197 FIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASI 276 (472) T ss_pred EEEEEeccceEEEEecCCCCcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHHH Confidence 89999999998898888773 456777654 6899999999999999999999998 99999999999999999 Q ss_pred HHHHHhhcCcchhccEEEEEcCCCCE-EE-EEEeccCCCcCCcccceEEEEecccC----ceeeeeecceeeeeeeeccc Q lcl|NC_018835. 759 KNMLINEVCLVNPLATRVHLHQDKKE-VW-VLYVGPGEPKESFACTKAAVWNYEFD----TWSFRTIPYAQCIGLVDPPV 832 (1005) Q Consensus 759 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~----~w~~~~~~~~~~~~~~~~~~ 832 (1005) .+- ++.++...+...+++.-++.+. .| .+| ++|.++||-.++ +|+.+.-=+ .+.++ T Consensus 277 E~~-i~~y~~~e~~~a~~~s~~~eGH~fy~Ltf-----------P~~Tw~yD~at~~~~e~W~~~~sg~------~~~~~ 338 (472) T protein:vir:92 277 EKI-IRSYTADELATGVMEALRFDSHELLIIHL-----------PRHVLVYDASSSQNGPQWCVLKTGL------YDDVY 338 (472) T ss_pred HHH-HHhcCcchhceeeEEEEEecCeeEEEEEc-----------CCceEEEEcccCcCCceeeeecCCC------cccce Confidence 998 6677665555555555443333 22 223 367899998888 688776210 11111 Q ss_pred ccCCccccCCccccCCCccceeeecCccccCCcccEEEEeecCCc-eEEEecccccccccccceeeeeccceeeeccccc Q lcl|NC_018835. 833 LERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSFLRG-FYQVDVGALDYFYDRLNDVVIEKPLEMRLERTGL 911 (1005) Q Consensus 833 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 911 (1005) .+....|. --.+++|+.+.+ .|+++-....+.+ +|.+ T Consensus 339 R~~~~~~~-------------------------~g~~ivGD~~nG~ly~l~~~~~t~~~---------~~~~-------- 376 (472) T protein:vir:92 339 RAIDFMYE-------------------------GNQITCGDKSEAVTGQLQFDISSQYD---------KQQE-------- 376 (472) T ss_pred eEEEEEee-------------------------CCeEEEEEcCCCeEEEEeccccccCC---------Ccce-------- Confidence 11111111 112445555333 3333222111111 1100 Q ss_pred cccccCcccceeeeeeeeeEeCCCc----EEEEEeCe---eeCccCCceeCCcccccCCcceEEEEEeCCceEEEEEEEc Q lcl|NC_018835. 912 DFDNVTNEWNQKHINRFRPQTTGSG----TYTFEAGG---SQFSNEYGHPHTSKTYTIGVDRHVSVRLNHPYLFYNVIDN 984 (1005) Q Consensus 912 ~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 984 (1005) |+ |++|++-.++ ..+++++. ++.+---..|+.- --+-+.++.+.....||| .-|++=- T Consensus 377 ------------~~-~~~P~~~~dn~R~~d~eve~~~Gv~q~~d~v~L~wSdd-G~~~~~~~~~~~g~~g~~-~tr~~~~ 441 (472) T protein:vir:92 377 ------------HL-LFTPIFKADNARCFDLEVESSTGVAQYADRLFLSATTD-GINYGREQMIEQNEPFVY-DKRVLWK 441 (472) T ss_pred ------------EE-EEeceEecCCCEEEEEeeeccCCCCCcCceEEEEeecc-ccccccceeeccCCccch-hcceeee Confidence 00 1222222111 11111111 1111112223321 112235677888888888 1121100 Q ss_pred CCCceEEEEeeeEEEeecCCC Q lcl|NC_018835. 985 DVNSNAAINGLTIEFAVGGRR 1005 (1005) Q Consensus 985 ~~~~~~~~~~~~~~~~~~~~~ 1005 (1005) +-+.--+--||++......++ T Consensus 442 RlG~~r~~v~f~~r~~~~~~~ 462 (472) T protein:vir:92 442 RVGRIRRLIGFKLRVITKSPV 462 (472) T ss_pred eeeecccceeEEEEEEecCcc Confidence 101111123555555554444 No 4 >protein:vir:5202 Length: 448 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:116 # MgeName: PZA # Cross-refs: genbank:acc:NP_040725;genbank:gi:9626396;genbank:GeneID:1260967 Probab=97.99 E-value=1.4e-06 Score=52.71 Aligned_cols=206 Identities=17% Similarity=0.193 Sum_probs=82.3 Q ss_pred CeEEEeeeeceEE-EEcCCcceeeeccceecceeceeccceeEecCCceeEEEEEecCCccccccccc---ceEEEEcCC Q lcl|NC_018835. 1 MALYPIKSLGAVG-VIADQAPTDLAPNAFTNAINARFVEQRVFKTGGNAPLSYVDEDKDLTPLSFVSM---PFDYYSAGN 76 (1005) Q Consensus 1 ~~~y~i~s~g~~t-v~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~t~~t~t~~---~~t~~ss~~ 76 (1005) |++-.-...-..- +........+-.+.+..+-+-..+.. .++ +...... .......... -........ T Consensus 236 ~~v~~~~~~~dl~li~~~~~~~~ldv~~la~afn~~~~~~-----~~~--~~~vd~F-~~~g~~~i~vskk~~~~~d~~~ 307 (448) T protein:vir:52 236 MAVRTRSYMEDLHLIIDADLEAELDVDVLAKAFNMNRTDF-----LGN--VTVIDGF-ASTGLEAVLVDKDWFMVYDNLH 307 (448) T ss_pred ccccccccceeeEEEECCCceEeecHHHHHHHhccccccc-----Ccc--eEEecCc-cccCceeeeeeeeeeeeeeccc Confidence 2221111111111 11111222222222211111111100 000 0001100 0000001111 111122223 Q ss_pred cEEEEecCCceEEEEecceEEEEEEecC-cceEEEEEeee-ccccceeeecCccceEecCCceEEEEEEEeCCCccceEE Q lcl|NC_018835. 77 SFLVVGTDKKLYKLTDESLTDISRKVAT-VTKKASASIKI-YPVVSQIVPKESTISMNFNQTKNLEVSLLPADANNTDLV 154 (1005) Q Consensus 77 s~~~t~~~~g~~t~t~~~t~~~t~~~~~-~t~t~t~tvt~-~~~v~~it~~~~~~t~~~g~t~tltatv~~~~~t~~tvt 154 (1005) .........|.+..- ...+.+.... ....+...+.. .+.+.+++++|...++..|++.+|+|++.+.++.+..|+ T Consensus 308 kg~t~~na~GL~~N~---~~TItatss~~~~t~atA~V~~t~paVtsVsVsPttasL~~G~TqqlTATVsg~na~~~~VT 384 (448) T protein:vir:52 308 KMETVRNPRGLYWNY---YYHVWQTLSVSRSANAVAFVSGDVPAVTQVIVSPNIAAVKQGGKQQFTAYVRATDGKDHKVV 384 (448) T ss_pred eeeeeeccccceeee---eeEEEEEEccCccccceEEEEecccccceEEEcccceeecCCCeEEEEEEEecCCCCCCceE Confidence 333333444444222 1122222211 11222222222 256789999999999999999999999999999999999 Q ss_pred EEECCCce-eeEeccccccceeeeeecccCcceeeeeeccccceeeeEEEeeeecccceeeccccceeeeccee Q lcl|NC_018835. 155 WEVSNSSY-GSITVDPSDSKLATLTSFEREGNLVVTISTADDSVVAQIAVNIIDGDSGIFLSQDTVTIRKGGTT 227 (1005) Q Consensus 155 ~tss~~~~-atv~~~~~~~~~~~~~t~~~~Gt~tiTat~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (1005) |++++.+. ++++.+|. .+....|+.+++++.................. .+...+... ..+... T Consensus 385 WSvS~ns~~aTVsssG~-------vTv~a~gTatITVtATvdts~a~~~~~vv~ea-~VsvtP~~a--s~G~q~ 448 (448) T protein:vir:52 385 WSVEGGSTGTAITGDGL-------LSVSGNEENQLTVKATVDIGTEDKPNLVVGEA-VVSIRPNNA--SGGAQA 448 (448) T ss_pred EEEcCCceeeEEeCCcc-------EEeccCCcceEEEEEEecCcccCCceeeeeeE-EEEecCCCC--CCcCCC Confidence 99887766 67777763 44455566666666543222111111111000 000000000 000000 No 5 >protein:vir:100960 Length: 472 # NCBI annotation: gp10 # Family: family:all:1540 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006412;genbank:gi:46358704;genbank:GeneID:2777110 Probab=97.81 E-value=1.8e-05 Score=46.66 Aligned_cols=433 Identities=10% Similarity=0.027 Sum_probs=151.2 Q ss_pred ccceeeeeeccccceeEEEEeecCCceEEeeecCCcceecccccccceeEEEecCCceEEEeeeccccceeeEEEeeccc Q lcl|NC_018835. 458 LDVVSASLDVGEEIVITATASPEGDYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPW 537 (1005) Q Consensus 458 ~~~~~~t~~~g~~~t~tat~~~~~~~~vt~sss~~~~~tv~~~~~~t~~vt~~~~gt~Tit~~~~~~~~~~~~~t~t~~~ 537 (1005) .......+..|-.. .......+.+- .-...+.............. ..|.... +...+..-. T Consensus 1 m~~~~ipl~~g~~~------~~~~a~~~~~~-pvn~y~~~~~~~~ss~~Lr~-~pG~~~~-a~~~G~~RG---------- 61 (472) T protein:vir:10 1 MPIQQLPMMKGMGK------DFKNADYIDYL-PINMLATPKEVLNSSGYLRS-FPGIAKR-NDVNGVSRG---------- 61 (472) T ss_pred Cceeeccccccccc------CCCcCcceeee-eeccccccccccccccceee-cccceee-cCCCCcccc---------- Confidence 00000000000000 00000000000 00000000000000000000 0000000 000000000 Q ss_pred ccceeeceeeeccccceeecccccceeEEecCCceeEEEEEccCCCceeEEeeccceeeeccceEEEeecccceE-EEEe Q lcl|NC_018835. 538 YHAVISNCAVATTHYETPQVKEFESEYFVDLPGWGEQTVVDNDGNPSVKKFNWKCERVRSFNNRLFALNMREANA-SGVT 616 (1005) Q Consensus 538 ~~~~~~~~~v~~~~~~~~~~~~~~~~~t~~~~~t~~~t~t~~~~~~t~~tvt~tss~~~~~~~~~~~~~~~~~t~-t~t~ 616 (1005) ..-....+. .. .+.+.....+... ...+..+.-..-..+.....+..+.... .... T Consensus 62 ----------------~~~~~~~~~-ly-~V~G~~Ly~v~~~-----iG~i~gsgrVsMa~n~~~~~v~~~~~~~~Y~~~ 118 (472) T protein:vir:10 62 ----------------VEYNTAQNA-VY-RVCGGKLYKGEAV-----VGDVAGSGRVSMAHGRTSQAVGVNGQLIEYRYD 118 (472) T ss_pred ----------------eeeeeeCCe-EE-EEeCcceEEEEee-----EeeccCcccEEEeeCCeEEEEEECCceeEEEEe Confidence 000000000 00 0000101111000 0000000000000000001110000000 0000 Q ss_pred cccCceEEEEEeeccccccc------cccccceeee-eccccccceeeeeeeeccCcc--cceeeeeccceeeeccccCc Q lcl|NC_018835. 617 TNYPLRLRWSNFANENKAPT------LWDDFAYDRV-VSSDLASNIVGQTQALENGYA--GYIDLADSNGSLIDILPLKD 687 (1005) Q Consensus 617 ~~~~~~~~~~~~~t~~~~~~------t~~~~~t~~~-~~~~~~~~t~~~t~~~t~~~~--~~~~~t~t~~~~i~~~~~g~ 687 (1005) ...... ...+......-. ..+..+..-+ ........-++.......... ....-...+..++......+ T Consensus 119 ~~~~t~--~~~~~d~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~~~iS~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~ 196 (472) T protein:vir:10 119 GAVKTV--SNWPADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGSWRD 196 (472) T ss_pred cchhhh--hcccCccccccccccceeEEEEecceEEEccCCCceEEEeccCCccccccccccccccCCCCceEEEEeecc Confidence 000000 000000000000 0000000000 000000000010011111111 11222334556677777888 Q ss_pred eEEEEecCCEEEEEEeCCCC-ceEEEEEec---cccccccCceeEEeCCeEEEEeCCc-----EEEeCCceecccccchH Q lcl|NC_018835. 688 YLFVYTEFETYIGSPTNNTY-QPLMFKKLF---NDSGILAPECVVEVEGSHFVVTQND-----VILHNGATKKSIASNRV 758 (1005) Q Consensus 688 ~~v~~~~~~~~~~t~t~~t~-~~~~~~~~~---~~~g~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 758 (1005) ..+.+-...+-+..-+|++. .-|+|+..+ -..||.+|.|+..+++.+||++|++ +|+.+|.+++.|-...| T Consensus 197 ~l~lfG~~TiEvw~ntG~ad~~~fpy~r~~g~~i~~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aI 276 (472) T protein:vir:10 197 FIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASI 276 (472) T ss_pred EEEEEeccceEEEEecCCCCcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHHH Confidence 89999999998898888773 456777654 6899999999999999999999998 99999999999999999 Q ss_pred HHHHHhhcCcchhccEEEEEcC-CCCEEE-EEEeccCCCcCCcccceEEEEecccCc----eeeeeecceeeeeeeeccc Q lcl|NC_018835. 759 KNMLINEVCLVNPLATRVHLHQ-DKKEVW-VLYVGPGEPKESFACTKAAVWNYEFDT----WSFRTIPYAQCIGLVDPPV 832 (1005) Q Consensus 759 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~----w~~~~~~~~~~~~~~~~~~ 832 (1005) .+-+ +.....-+...+++.-+ +.++.| .+|| ++.++||-.++. |++++- |+.+.++ T Consensus 277 E~~i-~~y~~~e~~~A~~~t~~~~GH~fy~LtfP-----------~~Tw~yD~at~~w~erw~~~~~------g~~~~~~ 338 (472) T protein:vir:10 277 EKII-RSYTAEELATGVMETLRFDSHELLIIHLP-----------RHVLVYDASSSQNGPQWCVLKT------GLYDDVY 338 (472) T ss_pred HHHH-HhcCCccccceEEEEEEeCCeEEEEEEcC-----------CeeEEEEcccCcccceeeeecC------CCcccce Confidence 9995 66654443444443333 333332 2333 678999988884 554430 0111111 Q ss_pred ccCCccccCCccccCCCccceeeecCccccCCcccEEEEeec-CCceEEEecccccccccccceeeeeccceeeeccccc Q lcl|NC_018835. 833 LERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSF-LRGFYQVDVGALDYFYDRLNDVVIEKPLEMRLERTGL 911 (1005) Q Consensus 833 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 911 (1005) .+....|. --.+++|+. +.-.|+++-+....+++...-...+-+|. T Consensus 339 R~~~~~~~-------------------------~g~~ivGD~~nG~ly~ld~~~~t~~g~~~~~~~~~p~l~-------- 385 (472) T protein:vir:10 339 RAVDFMYE-------------------------GNQITCGDKSEALTGQLQFDISSQYGLQQEHLLFTPLFK-------- 385 (472) T ss_pred eEEEEEee-------------------------CCeEEEEEcCCCeEEEEecccCCCCCCcccceEEccccc-------- Confidence 11111111 112455665 34445666554444444433222222211 Q ss_pred cccccCcccceeeeeeeeeEeCCCcEEEEEeCe---eeCccCCceeCCcccccCCcceEEEEEeCCceEEEEEEEcCCCc Q lcl|NC_018835. 912 DFDNVTNEWNQKHINRFRPQTTGSGTYTFEAGG---SQFSNEYGHPHTSKTYTIGVDRHVSVRLNHPYLFYNVIDNDVNS 988 (1005) Q Consensus 912 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 988 (1005) -|. + |++ ..+++++. ++.+---..|+.- --+-+.++.+.....||| .-|++=-+-+. T Consensus 386 -~dn-------~---R~~-------d~eve~~~Gv~~~~d~v~L~wSdd-G~~~~~~~~~~~g~~g~~-~tr~~~~RlG~ 445 (472) T protein:vir:10 386 -ADN-------A---RCF-------DLEVESSTGVAQYADRLFLSATTD-GINYGREQMIEQNEPFVY-DKRVIWKRVGR 445 (472) T ss_pred -CCC-------C---EEE-------EEeeeccCCCCCcCcEEEEEeecc-ccccccceeeccCCccch-hcceeeeeeee Confidence 010 0 111 01111110 1111111222221 111225677888888888 11210001011 Q ss_pred eEEEEeeeEEEeecCCC Q lcl|NC_018835. 989 NAAINGLTIEFAVGGRR 1005 (1005) Q Consensus 989 ~~~~~~~~~~~~~~~~~ 1005 (1005) --+--||++......++ T Consensus 446 ~r~~v~f~~r~~~~~~~ 462 (472) T protein:vir:10 446 IRRLIGFKLRVITKSPV 462 (472) T ss_pred cccceeEEEEEEecCcc Confidence 11123455554444444 No 6 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=97.72 E-value=1.1e-05 Score=47.67 Aligned_cols=175 Identities=11% Similarity=0.089 Sum_probs=70.8 Q ss_pred CeEEEeeeeceEEEEcCCcceeeeccceecceeceeccceeEecCCceeEEEEEecCCcccccccccceEEEEcCCcEEE Q lcl|NC_018835. 1 MALYPIKSLGAVGVIADQAPTDLAPNAFTNAINARFVEQRVFKTGGNAPLSYVDEDKDLTPLSFVSMPFDYYSAGNSFLV 80 (1005) Q Consensus 1 ~~~y~i~s~g~~tv~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~t~~t~t~~~~t~~ss~~s~~~ 80 (1005) +-+|..+........ -..+.+..... .......+........ .... ....+..... ... T Consensus 211 ~~v~~s~~~~~~t~~------a~~~~a~~~at-----~a~v~~~~~~~~~s~s-~~~~--------v~~~~~~~~~-~t~ 269 (392) T protein:vir:99 211 YEIVESTLIPHGDAY------LYHPTAFIMAT-----RAPAPPMGAVRSTAIS-GDQR--------IAMRWLVDYD-STI 269 (392) T ss_pred eEEEeecccccccce------eeecccccccc-----ccccccccccceeEEe-cccc--------eecceeeccc-cee Confidence 111211111100000 00000000000 0000000000000000 0000 0000000000 000 Q ss_pred EecCCceEEEEecceEEEEEEecCcceEEEEE---eeeccccceeeecCccceEecCCceEEEEEEEeCCCcc--ceEEE Q lcl|NC_018835. 81 VGTDKKLYKLTDESLTDISRKVATVTKKASAS---IKIYPVVSQIVPKESTISMNFNQTKNLEVSLLPADANN--TDLVW 155 (1005) Q Consensus 81 t~~~~g~~t~t~~~t~~~t~~~~~~t~t~t~t---vt~~~~v~~it~~~~~~t~~~g~t~tltatv~~~~~t~--~tvt~ 155 (1005) .......... .+...+..... ........ ......+..+.+.+....+..|+..++.+++.+.++.. ..++| T Consensus 270 ~s~~~~v~~~--~g~~~v~~~~~-~~~~~~~~~~~~~~~v~v~~v~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~vtw 346 (392) T protein:vir:99 270 TSNRSLIDTY--FGLKVVEDPNG-VGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDF 346 (392) T ss_pred ecccccccee--EEEEEEeeccc-cceeeeeeeeeecceeeeeeeecccceeEeeeccceeEEEEEEecCCccccceEEE Confidence 0000000000 01111111100 00111111 11122344556677777888899888888888887765 67999 Q ss_pred EECCCceeeEeccccccceeeeeecccCcceeeeeecccc--ceeeeEEEeee Q lcl|NC_018835. 156 EVSNSSYGSITVDPSDSKLATLTSFEREGNLVVTISTADD--SVVAQIAVNII 206 (1005) Q Consensus 156 tss~~~~atv~~~~~~~~~~~~~t~~~~Gt~tiTat~~~~--~~~~s~~~~~~ 206 (1005) +++|+.+++++.+| .+++...|+++|+++..+. ....++..... T Consensus 347 ~Ssn~~vAtV~~~G-------~Vt~v~~G~atITa~~~~~~~~~t~t~~vtV~ 392 (392) T protein:vir:99 347 ESSATDKATVAAGG-------LVTGVAAGTSTVTATLVTPSGDREDTIVITVV 392 (392) T ss_pred EEcCCeeEEEcCCc-------eEEEEecceEEEEEEEEcCCCcEEEEEEEEeC Confidence 99999999999877 4788899999999997543 33334433333 No 7 >protein:vir:2109 Length: 472 # NCBI annotation: head completion protein # Family: family:all:1540 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059633;genbank:gi:9635541;genbank:GeneID:1262840 Probab=97.64 E-value=3.4e-05 Score=45.06 Aligned_cols=431 Identities=11% Similarity=0.068 Sum_probs=149.1 Q ss_pred ccceeeeeeccccceeEEEEeecCCceEEeeecCCcceecccccccceeEEEecCCceEEEeeeccccceeeEEEeeccc Q lcl|NC_018835. 458 LDVVSASLDVGEEIVITATASPEGDYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPW 537 (1005) Q Consensus 458 ~~~~~~t~~~g~~~t~tat~~~~~~~~vt~sss~~~~~tv~~~~~~t~~vt~~~~gt~Tit~~~~~~~~~~~~~t~t~~~ 537 (1005) .......+..|-.. .......+.|-- -.-.+.........+..... .| .+..+...+..- T Consensus 1 m~~~q~Pl~~g~~~------~~~~~d~~~~~p-VN~~a~~~~~~~s~~~lr~t-PG-~~~~~~~~g~~R----------- 60 (472) T protein:vir:21 1 MPIQQLPMMKGMGK------DFKNADYIDYLP-VNMLATPKEILNSSGYLRSF-PG-ITKRYDMNGVSR----------- 60 (472) T ss_pred CceEEeeccccccc------cccccceeeeee-eeeeeeccCCcccceeeeec-CC-cceeccCCCcee----------- Confidence 00000000000000 000000000000 00000000000000000000 00 000000000000 Q ss_pred ccceeeceeeeccccceeecccccceeEEecCCceeEEEEEccCCCceeEEeeccceeeeccceEEEeecccceE-EEEe Q lcl|NC_018835. 538 YHAVISNCAVATTHYETPQVKEFESEYFVDLPGWGEQTVVDNDGNPSVKKFNWKCERVRSFNNRLFALNMREANA-SGVT 616 (1005) Q Consensus 538 ~~~~~~~~~v~~~~~~~~~~~~~~~~~t~~~~~t~~~t~t~~~~~~t~~tvt~tss~~~~~~~~~~~~~~~~~t~-t~t~ 616 (1005) .....+..+.-+ .+.+..-..+... ...+..+.-..-..+.....+..+.... .... T Consensus 61 ---------------G~~~~t~~~~ly--~V~G~~LY~v~~~-----~G~i~gsgrVsMa~n~~~~~v~~~~~~~~Y~~~ 118 (472) T protein:vir:21 61 ---------------GVEYNTAQNAVY--RVCGGKLYKGESE-----VGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYD 118 (472) T ss_pred ---------------eeeecccCCeEE--EEeCCceEEEeee-----eeeecccccEEEeeCCeEEEEEECCceeEEEEe Confidence 000000000000 0001110110000 0000000000000001001110000000 0000 Q ss_pred cccCceEEEEEeeccccccc------cccccceeee-eccccccceeeeeeeeccCcc--cceeeeeccceeeeccccCc Q lcl|NC_018835. 617 TNYPLRLRWSNFANENKAPT------LWDDFAYDRV-VSSDLASNIVGQTQALENGYA--GYIDLADSNGSLIDILPLKD 687 (1005) Q Consensus 617 ~~~~~~~~~~~~~t~~~~~~------t~~~~~t~~~-~~~~~~~~t~~~t~~~t~~~~--~~~~~t~t~~~~i~~~~~g~ 687 (1005) ...... ...+......-. ..+..+..-+ ........-++........+. ....-...+..++......+ T Consensus 119 ~~~~t~--~~~~~d~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~f~is~l~d~~~~~~y~~FatAE~~pD~Iv~i~~~~~ 196 (472) T protein:vir:21 119 GTVKTV--SNWPADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRD 196 (472) T ss_pred cchhhh--hcccCccccccccccceeEEEEecceEEEccCCcceeEEecCCCCccccCCccceeeccCCCceEEEEeecc Confidence 000000 000000000000 0000000000 000000000111111111111 12333344566777777888 Q ss_pred eEEEEecCCEEEEEEeCCC-CceEEEEEec---cccccccCceeEEeCCeEEEEeCCc-----EEEeCCceecccccchH Q lcl|NC_018835. 688 YLFVYTEFETYIGSPTNNT-YQPLMFKKLF---NDSGILAPECVVEVEGSHFVVTQND-----VILHNGATKKSIASNRV 758 (1005) Q Consensus 688 ~~v~~~~~~~~~~t~t~~t-~~~~~~~~~~---~~~g~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 758 (1005) ..+.+-...+-+..-+|++ +.-|+|+..+ -..||.+|.|+..+++.+||++|++ +|+.+|.|++.|-...| T Consensus 197 ~l~lfG~~TiEvw~ntG~ad~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aI 276 (472) T protein:vir:21 197 FIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASI 276 (472) T ss_pred EEEEEeccceEEEEecCCCCcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHHH Confidence 8999999999889888877 3456777654 6899999999999999999999998 99999999999999999 Q ss_pred HHHHHhhcCcchhccEEEEEc-CCCCEEE-EEEeccCCCcCCcccceEEEEecccCc----eeeeeecceeeeeeeeccc Q lcl|NC_018835. 759 KNMLINEVCLVNPLATRVHLH-QDKKEVW-VLYVGPGEPKESFACTKAAVWNYEFDT----WSFRTIPYAQCIGLVDPPV 832 (1005) Q Consensus 759 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~----w~~~~~~~~~~~~~~~~~~ 832 (1005) .+-+ +.....-+...+++.- .+.++.| .+|| ++.++|+-.++. |+++.-= .++.++ T Consensus 277 E~~i-~~y~~~e~~~A~~~t~~~eGH~fy~LtfP-----------~~Tw~yD~at~~~~e~W~~~~sg------~~~~~~ 338 (472) T protein:vir:21 277 EKII-RSYTAEEMATGVMETLRFDSHELLIIHLP-----------RHVLVYDASSSQNGPQWCVLKTG------LYDDVY 338 (472) T ss_pred HHHH-HhcCCccccceEEEEEEeCCeEEEEEEcC-----------CeeEEEEcccCccCceeeeeccC------CCcCce Confidence 9995 6665443334444333 3333332 2333 678999988884 7777611 011111 Q ss_pred ccCCccccCCccccCCCccceeeecCccccCCcccEEEEeecCCceE-EE--ecccccccccccceeeeeccceeeeccc Q lcl|NC_018835. 833 LERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSFLRGFY-QV--DVGALDYFYDRLNDVVIEKPLEMRLERT 909 (1005) Q Consensus 833 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 909 (1005) ......|. --.+++|+.+.+.+ ++ +... |+|.-...+-..|+ T Consensus 339 R~~~~~~~-------------------------~g~~ivGD~~nG~ly~L~fd~~~---~~d~~~~~~r~~p~------- 383 (472) T protein:vir:21 339 RGVDFMYE-------------------------GNQITCGDKSEAVVGQLQFDISS---QYDKQQEHLLFTPL------- 383 (472) T ss_pred eEEEEEee-------------------------CCeEEEEEcCCCeEEEEEecccc---cCCCcCcEEEEccc------- Confidence 11111111 11245555544332 22 2221 11110011111111 Q ss_pred cccccccCcccceeeeeeeeeEeCCCcEEEEEeCe---eeCccCCceeCCcccccCCcceEEEEEeCCceEEEEEEEcCC Q lcl|NC_018835. 910 GLDFDNVTNEWNQKHINRFRPQTTGSGTYTFEAGG---SQFSNEYGHPHTSKTYTIGVDRHVSVRLNHPYLFYNVIDNDV 986 (1005) Q Consensus 910 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 986 (1005) +..|.. |++ ..+++++. ++.+---..|+.- --+-+.++.+.....||| .-|++=-+- T Consensus 384 -~~~dn~----------R~f-------d~eve~~~Gv~q~~d~v~L~wSdd-G~~~~~~~~~~~g~~g~~-~tr~~~~Rl 443 (472) T protein:vir:21 384 -FKADNA----------RCF-------DLEVESSTGVAQYADRLFLSATTD-GINYGREQMIEQNEPFVY-DKRVLWKRV 443 (472) T ss_pred -eeCCCC----------EEE-------EEeeeccCCCCCcCcEEEEEeecc-ccccccceeeccCCccch-hcceeeeee Confidence 111110 000 01111111 1111111223321 112235677888888888 112110010 Q ss_pred CceEEEEeeeEEEeecCCC Q lcl|NC_018835. 987 NSNAAINGLTIEFAVGGRR 1005 (1005) Q Consensus 987 ~~~~~~~~~~~~~~~~~~~ 1005 (1005) +.--+--||++......++ T Consensus 444 G~~r~~v~f~~r~~~~~~~ 462 (472) T protein:vir:21 444 GRIRRLIGFKLRVITKSPV 462 (472) T ss_pred eecccceeEEEEEEecCcc Confidence 1111123555555554444 No 8 >protein:vir:177 Length: 472 # NCBI annotation: DNA stabilization protein # Family: family:all:1540 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112082;genbank:gi:13559872;genbank:GeneID:920982 Probab=97.32 E-value=9.6e-05 Score=42.61 Aligned_cols=438 Identities=11% Similarity=0.025 Sum_probs=162.8 Q ss_pred ccceeeeeeccccceeEEEEeecCCceEEeeecCCcceecccccccceeEEEecCCceEEEeeeccccceeeEEEeeccc Q lcl|NC_018835. 458 LDVVSASLDVGEEIVITATASPEGDYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPW 537 (1005) Q Consensus 458 ~~~~~~t~~~g~~~t~tat~~~~~~~~vt~sss~~~~~tv~~~~~~t~~vt~~~~gt~Tit~~~~~~~~~~~~~t~t~~~ 537 (1005) .......+..|.... +......+...+... +.-.......+.... ..|- +..+...+..-. T Consensus 1 m~~~~~Pl~~G~~~~-~~~~d~~~~~pVN~~------a~~~~~~~s~~~l~~-tPGl-~~~a~v~G~~RG---------- 61 (472) T protein:vir:17 1 MPIQQLPLMKGVGKD-FRNADYIDYLPVNML------ATPKEILNSSGYLRS-FPGI-AKRSDVNGVSRG---------- 61 (472) T ss_pred CCeeeeeeccCceee-ccccchhheeeeeee------eeccCCCcccceeec-CCCc-eeeccCCccccc---------- Confidence 000000000000000 000000000000000 000000000000000 0000 000000000000 Q ss_pred ccceeeceeeeccccceeecccccceeEEecCCceeEEEEEccCCCceeEEeeccceeeeccceEEEeeccc-ceEEEEe Q lcl|NC_018835. 538 YHAVISNCAVATTHYETPQVKEFESEYFVDLPGWGEQTVVDNDGNPSVKKFNWKCERVRSFNNRLFALNMRE-ANASGVT 616 (1005) Q Consensus 538 ~~~~~~~~~v~~~~~~~~~~~~~~~~~t~~~~~t~~~t~t~~~~~~t~~tvt~tss~~~~~~~~~~~~~~~~-~t~t~t~ 616 (1005) .........-+-+ .+.....+.. ....+..+.-..-..+.....+..+. ..-..-. T Consensus 62 ----------------~~~~~~~g~lY~V--~G~~LY~v~~-----~iGsiag~grVsMa~n~~~~av~~~g~~~~Y~y~ 118 (472) T protein:vir:17 62 ----------------VEYNMAQNAVYRV--CGGKLYKGES-----EVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYD 118 (472) T ss_pred ----------------eEEEeeCCeEEEE--ecceEeeeec-----ceecccCcccEEEecCCcEEEEEECCceeEEEee Confidence 0000000000000 0000000000 00000000000000010001111100 0000000 Q ss_pred cccCceEEEEEeeccccc------cccccccceeee-eccccccceeeeeeeecc--CcccceeeeeccceeeeccccCc Q lcl|NC_018835. 617 TNYPLRLRWSNFANENKA------PTLWDDFAYDRV-VSSDLASNIVGQTQALEN--GYAGYIDLADSNGSLIDILPLKD 687 (1005) Q Consensus 617 ~~~~~~~~~~~~~t~~~~------~~t~~~~~t~~~-~~~~~~~~t~~~t~~~t~--~~~~~~~~t~t~~~~i~~~~~g~ 687 (1005) +........ +...... ....+..+..-+ ........-++....... .......-...+..++......+ T Consensus 119 ~~v~t~~~~--~~d~~~~~~dlg~~~dv~f~dGyfV~~~~Gt~~~~is~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~ 196 (472) T protein:vir:17 119 GTVKTVSNW--PTDSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRD 196 (472) T ss_pred ccchhhhcc--ccccccccccccceeeeeeecceEEEeccCcceEEEeccCCccccccccccccccCCCCceEEEEeecc Confidence 000000000 0000000 000000000000 000000000111011000 01111122334456677777888 Q ss_pred eEEEEecCCEEEEEEeCCCCc-eEEEEEec---cccccccCceeEEeCCeEEEEeCC-----cEEEeCCceecccccchH Q lcl|NC_018835. 688 YLFVYTEFETYIGSPTNNTYQ-PLMFKKLF---NDSGILAPECVVEVEGSHFVVTQN-----DVILHNGATKKSIASNRV 758 (1005) Q Consensus 688 ~~v~~~~~~~~~~t~t~~t~~-~~~~~~~~---~~~g~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~ 758 (1005) ..+.+-...+-+..-+|++.. -|+|+..+ -..||.+|.|+..+++.+||++|+ -+|+.+|.+++.|-...| T Consensus 197 ~i~lfG~~TiEvw~ntG~a~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~~~~rIST~aI 276 (472) T protein:vir:17 197 FIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYIIGSGQVSPISSASI 276 (472) T ss_pred EEEEEeccceEEEEeeCCCCCCcCceeecCcceeeecccCcchhhecCceEEEEecCCccccEEEEccCceeEEecCHHH Confidence 899999999999999998763 37777654 689999999999999999999996 478999999999999999 Q ss_pred HHHHHhhcCcchhccEEEEEcCCCCE-E-EEEEeccCCCcCCcccceEEEEecccCceeeeeecceeeeeeeecccccCC Q lcl|NC_018835. 759 KNMLINEVCLVNPLATRVHLHQDKKE-V-WVLYVGPGEPKESFACTKAAVWNYEFDTWSFRTIPYAQCIGLVDPPVLERG 836 (1005) Q Consensus 759 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~ 836 (1005) .+- ++.++...+...+++.-++.+. . +.+|| +|-++|+-.++.|--| +-+.. -|.....+-+.. T Consensus 277 E~~-i~~y~~~e~~dA~~~t~~~~GH~fy~LtfP-----------~~Tw~yD~~t~~Wher-w~~~~-~g~~~~~~Ra~~ 342 (472) T protein:vir:17 277 EKI-LRSYTADELADGVMESLRFDAHELLIIHLP-----------RHVLVYDASSSANGPQ-WCVLK-TGLYDDVYRAID 342 (472) T ss_pred HHH-HHhcCCccccceeEEEEEeCCeEEEEEEcC-----------CceeEeecccccCcee-eeeec-CCCccCceEEEE Confidence 998 6777765555555555443333 2 23333 6789999999888876 11100 000111111111 Q ss_pred ccccCCccccCCCccceeeecCccccCCcccEEEEeecCCc-eEEEecccccccccccceeeeeccceeeeccccccccc Q lcl|NC_018835. 837 PIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSFLRG-FYQVDVGALDYFYDRLNDVVIEKPLEMRLERTGLDFDN 915 (1005) Q Consensus 837 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 915 (1005) ..|. --.+++|+.+.+ .|.++-+.....++ |.+.++.=+-++.|. T Consensus 343 ~~~~-------------------------~g~~~vGD~~ng~ly~ld~~~~td~g~---------pi~~~~~~p~~~~~~ 388 (472) T protein:vir:17 343 FIYE-------------------------GNQITCGDKLESVTGKLQFDISSQYDK---------QQEHLLFTPLFKADN 388 (472) T ss_pred EEEe-------------------------CCeEEEEEcCCCeEEEEcccCcCCCCc---------eeEEEEecceeeCCC Confidence 1111 123567777544 56666654433222 222222223334432 Q ss_pred cCcccceeeeeeeeeEeCCCcEEEEEeCeeeCccC--CceeCCcccccCCcceEEEEEeCCceEEEEEEEcCCCceEEEE Q lcl|NC_018835. 916 VTNEWNQKHINRFRPQTTGSGTYTFEAGGSQFSNE--YGHPHTSKTYTIGVDRHVSVRLNHPYLFYNVIDNDVNSNAAIN 993 (1005) Q Consensus 916 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 993 (1005) . | ++ +-++-+..|.-+.++- -.+|+..+ +-+.+..+...+.|||. -|++=-+-+.--+-- T Consensus 389 ~-------R---V~-----d~el~~~tG~~~~adp~~l~~~sDg~--~~g~~~~~~~~~~g~~~-~R~~~~RlG~~r~~v 450 (472) T protein:vir:17 389 A-------R---VF-----DLEVESSTGVAQYADRLFLSATTDGI--NYGREQMIEQNEPFVYD-KRVLWKRVGRIRKNV 450 (472) T ss_pred c-------e---EE-----EEEEeeeCCcccCCCceEEEcccCCc--ccchhhhhhhccCcccc-cceeeeeeeeccccc Confidence 1 2 22 1123334444433321 23577643 33466668888888881 111000000000012 Q ss_pred eeeEEEeecCCC Q lcl|NC_018835. 994 GLTIEFAVGGRR 1005 (1005) Q Consensus 994 ~~~~~~~~~~~~ 1005 (1005) ||.+......++ T Consensus 451 ~f~~~~~~~~~~ 462 (472) T protein:vir:17 451 GFKLRVITKSPV 462 (472) T ss_pred eEEEEEeecccc Confidence 444444444444 No 9 >protein:vir:108312 Length: 458 # NCBI annotation: hypothetical protein # Family: family:all:1540 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552278;genbank:gi:160700603;genbank:GeneID:5758828 Probab=96.93 E-value=0.00025 Score=40.32 Aligned_cols=415 Identities=13% Similarity=0.095 Sum_probs=149.7 Q ss_pred cceeEEEEeecCCceEEeeecCCcceecccccccceeEEEe--cCCceEEEeeeccccceeeEEEeecccccceeeceee Q lcl|NC_018835. 470 EIVITATASPEGDYSYQWSVDKTGYVSTTSVTGKSIKLVAL--RKGEINVTCTVSQMTQKDYDAFDDYPWYHAVISNCAV 547 (1005) Q Consensus 470 ~~t~tat~~~~~~~~vt~sss~~~~~tv~~~~~~t~~vt~~--~~gt~Tit~~~~~~~~~~~~~t~t~~~~~~~~~~~~v 547 (1005) -..... |. ..+..........- ..+.-.. ..+...... ...+ +...-... T Consensus 1 m~~~~i---p~--gsy~a~~~~~daq~-------~VN~yp~~~e~g~ss~~l-------------~~tP---Gl~~f~~~ 52 (458) T protein:vir:10 1 MVQRQI---PL--VATTAEGDVSGQEI-------LVNVYPRKSDGGKYPFTL-------------RHTP---GLAFFCEL 52 (458) T ss_pred Cceeee---ce--eeeeccccccccee-------eeeeeeecccccccccce-------------EecC---CceeeecC Confidence 000000 00 00000000000000 0000000 000000000 0000 00000000 Q ss_pred eccccceeec---------ccccceeEEecCCceeEEEEEccCCCceeEEeeccceeeeccceEEEeecccceEEEEecc Q lcl|NC_018835. 548 ATTHYETPQV---------KEFESEYFVDLPGWGEQTVVDNDGNPSVKKFNWKCERVRSFNNRLFALNMREANASGVTTN 618 (1005) Q Consensus 548 ~~~~~~~~~~---------~~~~~~~t~~~~~t~~~t~t~~~~~~t~~tvt~tss~~~~~~~~~~~~~~~~~t~t~t~~~ 618 (1005) .+.+.-+. .....-+-+..++.......+... .-+. -.-++.......+...... + T Consensus 53 --~~~~~~g~~~~~g~ly~v~g~~LY~V~~~~~~~~iG~i~gs----g~Vs------Ma~ng~q~vi~~G~~gY~y---d 117 (458) T protein:vir:10 53 --PTFPVMAMHQNGSRAFAVTPRDMYEISKDGTYKRLGSVDFK----GRVV------MEDNGKQIVMVDGEKGYYY---D 117 (458) T ss_pred --CCCceeeEEecCCEEEEeeCceEEEEeCCceEEEEecccCc----eeEE------EeeCCcEEEEEECCeEEEE---e Confidence 00000000 000000001111110000000000 0000 0000000111111100000 0 Q ss_pred cCceEEEEEeeccccccccccccceeeee-ccccccceeeeeeeeccCcccceeeeeccceeeeccccCceEEEEecCCE Q lcl|NC_018835. 619 YPLRLRWSNFANENKAPTLWDDFAYDRVV-SSDLASNIVGQTQALENGYAGYIDLADSNGSLIDILPLKDYLFVYTEFET 697 (1005) Q Consensus 619 ~~~~~~~~~~~t~~~~~~t~~~~~t~~~~-~~~~~~~t~~~t~~~t~~~~~~~~~t~t~~~~i~~~~~g~~~v~~~~~~~ 697 (1005) ..+..........-......+..+..-+- .......-++.......+---...-...+..++......+..+.+-...+ T Consensus 118 ~at~~~~~i~d~~~~~~~~v~~~dGy~V~~~~g~~~~~is~L~d~s~d~l~fa~Ae~~pD~iv~i~~~~~~i~~fG~~Ti 197 (458) T protein:vir:10 118 SETEIVQEIKAEGFYPASTVTYQDGYFIFDRKGTGQFFISELLDVAFDPLDFATAEGQPDPLLAVLSDHREVFMFGQETI 197 (458) T ss_pred ecccEEEeccCccccCcceEEEeCcEEEEEeeCCCEEEEEecCcceeCcceeeeecCCCCceEEEEeeccEEEEEeccce Confidence 00000000000000000000000000000 00000000011000000000122223344566777777888899999999 Q ss_pred EEEEEeCCCCceEEEEEe---ccccccccCceeEEeCCeEEEEeCCc-EEEeCCceecccccchHHHHHHhhcCcchhcc Q lcl|NC_018835. 698 YIGSPTNNTYQPLMFKKL---FNDSGILAPECVVEVEGSHFVVTQND-VILHNGATKKSIASNRVKNMLINEVCLVNPLA 773 (1005) Q Consensus 698 ~~~t~t~~t~~~~~~~~~---~~~~g~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 773 (1005) -+..-+|++. |+|+.. .-..||.+|.|+..+++.+||++|++ +|+.+|-+++.|....|.+-+ +.+..+ . T Consensus 198 Evw~ntG~a~--fpy~r~~ga~i~~Gcaa~~sv~~~~~t~~~l~~d~~Vy~l~g~~~~rIST~aIE~~i-~sy~~~---d 271 (458) T protein:vir:10 198 EVWYNSGAAD--FPFERNQGAFIEKGIGAPYSVAKTNNTVYFIGSDLMIYQITGYTPVRISTHAVEQTL-KGVNLS---D 271 (458) T ss_pred EEEEecCCCC--cceeecccceeeecccCcchhhhhCceEEEEcCCeEEEEecCceeEEeeCHHHHHHH-hcCChh---h Confidence 9999988875 667663 34889999999999999999999776 569999999999999999996 555443 3 Q ss_pred EEEE-EcCCCCEEE-EEEeccCCCcCCcccceEEEEecccCceeeeeecceeeeeeeecccccCCccccCCccccCCCcc Q lcl|NC_018835. 774 TRVH-LHQDKKEVW-VLYVGPGEPKESFACTKAAVWNYEFDTWSFRTIPYAQCIGLVDPPVLERGPIWSDFQEITWDDPS 851 (1005) Q Consensus 774 ~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 851 (1005) .+++ +..+.++.| ..||+ .++-.+||-..+.|..|. + +.++ ++-+++..|.+ T Consensus 272 a~a~t~~~eGH~fy~LtfP~---------a~~Tw~yD~~t~~Wher~---S---g~~~-~~Ra~~~v~~~---------- 325 (458) T protein:vir:10 272 AFAYTYQSEGHLFYVLTIPG---------KNLTWCYDISSGSWHVRQ---S---YQFD-RHVSNNSIYFD---------- 325 (458) T ss_pred eEEEEEEecCeEEEEEECCC---------CCceeEEecccccceeec---c---CCCC-ceEEEEEEEeC---------- Confidence 4444 444445443 45554 566678999999999974 0 1111 11111111111 Q ss_pred ceeeecCccccCCcccEEEEeecCCc-eEEEecccccccccccceeeeeccc---eeeeccccccccccCcccceeeeee Q lcl|NC_018835. 852 IKELVWRKDATNFRQRVTIVGSFLRG-FYQVDVGALDYFYDRLNDVVIEKPL---EMRLERTGLDFDNVTNEWNQKHINR 927 (1005) Q Consensus 852 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~ 927 (1005) -..++|+.+.+ .|.++-+.....++-..-...+-++ +.+|....+ T Consensus 326 ---------------g~~~vGD~~ng~ly~ld~~~~td~g~~i~~~~~~p~~~~~~~rl~~~~~---------------- 374 (458) T protein:vir:10 326 ---------------QKTLVGDFQNGRIYIMADNYYTDDGDPVVREFILPVVNNGREFLTVDSL---------------- 374 (458) T ss_pred ---------------CeEEEEEcCCCeEEEEcccCcCCCCceeeeeeeccceeCCCCeEEEEEE---------------- Confidence 12455665443 4555444333323222211111111 011111111 Q ss_pred eeeEeCCCcEEEEEeCeeeC------ccCCceeCCcccccCCcceEE-EEEeCCce--------------EEEEEEEcCC Q lcl|NC_018835. 928 FRPQTTGSGTYTFEAGGSQF------SNEYGHPHTSKTYTIGVDRHV-SVRLNHPY--------------LFYNVIDNDV 986 (1005) Q Consensus 928 ~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--------------~~~~~~~~~~ 986 (1005) ++.++.|.-.- |.--+.||.-.-.+-+.+.++ .+-.-||| +-|||+-.+ T Consensus 375 ---------el~~~tGvg~~~~~~~~p~~~l~~S~d~g~~~s~~~~~~~lg~~gey~tr~~~~rlG~ar~rvf~v~~s~- 444 (458) T protein:vir:10 375 ---------ELDLSSGVGLTVGQGSDPELRVYFSKDNGNEYSQNFKVGKIGRKGEFLTRAKVNRFGCARQFTFKVEISD- 444 (458) T ss_pred ---------EEEEecceeeeeCCCCCceEEEEEeeCCCcccchhHHHhhcCCcchhhhhhhhhhhccCcceEEEEEEec- Confidence 12222221100 111233443333332333333 23333444 123331111 Q ss_pred CceEEEEeeeEEEe Q lcl|NC_018835. 987 NSNAAINGLTIEFA 1000 (1005) Q Consensus 987 ~~~~~~~~~~~~~~ 1000 (1005) --+-.|.|.-+++- T Consensus 445 p~~~~l~ga~~~~r 458 (458) T protein:vir:10 445 PIPVDIGGAWVEVR 458 (458) T ss_pred chhhcceeeeEEeC Confidence 22334677777665 No 10 >protein:vir:105428 Length: 472 # NCBI annotation: gene 8 protein # Family: family:all:1540 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958184;genbank:gi:41057286;genbank:GeneID:2716675 Probab=96.77 E-value=0.00035 Score=39.53 Aligned_cols=433 Identities=11% Similarity=0.029 Sum_probs=159.4 Q ss_pred ccceeeeeeccccceeEEEEeecCCceEEeeecCCcceecccccccceeEEEecCCceEEEeeeccccceeeEEEeeccc Q lcl|NC_018835. 458 LDVVSASLDVGEEIVITATASPEGDYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPW 537 (1005) Q Consensus 458 ~~~~~~t~~~g~~~t~tat~~~~~~~~vt~sss~~~~~tv~~~~~~t~~vt~~~~gt~Tit~~~~~~~~~~~~~t~t~~~ 537 (1005) .......+..|.... +......+...+... +.-.......+.... ..|- ...+...+..-. T Consensus 1 m~~~~~pl~~G~~~~-~~~~d~~~~~pVN~~------a~~~~~~~s~~~l~~-tPGl-~~~a~v~G~~RG---------- 61 (472) T protein:vir:10 1 MPIQQLPLMKGVGKD-FRNADYIDYLPVNML------ATPKEILNSSGYLRS-FPGI-AKRSDVNGVSRG---------- 61 (472) T ss_pred CCeeeeeeccCceee-ccccchhheeeeeee------eeccCCCcccceeec-CCCc-eeeccCCccccc---------- Confidence 000000010000000 000000000000000 000000000000000 0000 000000000000 Q ss_pred ccceeeceeeeccccceeecccccceeEEecCCceeEEEEEccCCCceeEEeeccceeeeccceEEEeeccc-ceEEEEe Q lcl|NC_018835. 538 YHAVISNCAVATTHYETPQVKEFESEYFVDLPGWGEQTVVDNDGNPSVKKFNWKCERVRSFNNRLFALNMRE-ANASGVT 616 (1005) Q Consensus 538 ~~~~~~~~~v~~~~~~~~~~~~~~~~~t~~~~~t~~~t~t~~~~~~t~~tvt~tss~~~~~~~~~~~~~~~~-~t~t~t~ 616 (1005) .........-+-+ .+.....+.. ....+..+.-..-..+.....+..+. ..-..-. T Consensus 62 ----------------~~~~~~~g~lY~V--~G~~LY~v~~-----~iGsiag~grVsMa~n~~~~av~~~g~~~~Y~yd 118 (472) T protein:vir:10 62 ----------------VEYNMAQNAVYRV--CGGKLYKGES-----EVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYD 118 (472) T ss_pred ----------------eEEEeeCCeEEEE--ecceEeeeec-----ceecccCcccEEEecCCcEEEEEECCceeEEEee Confidence 0000000000000 0000000000 00000000000000000000000000 0000000 Q ss_pred cccCceEEEEEeeccccc------cccccccceeee-eccccccceeeeeeeecc--CcccceeeeeccceeeeccccCc Q lcl|NC_018835. 617 TNYPLRLRWSNFANENKA------PTLWDDFAYDRV-VSSDLASNIVGQTQALEN--GYAGYIDLADSNGSLIDILPLKD 687 (1005) Q Consensus 617 ~~~~~~~~~~~~~t~~~~------~~t~~~~~t~~~-~~~~~~~~t~~~t~~~t~--~~~~~~~~t~t~~~~i~~~~~g~ 687 (1005) +........ +...... ....+..+..-+ ........-++....... .......-...+..++......+ T Consensus 119 ~~v~t~~~~--~~d~~~p~~dlg~~~dv~f~dGyfV~~~~Gt~~~~is~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~ 196 (472) T protein:vir:10 119 GTVKTVSNW--PTDSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRD 196 (472) T ss_pred ccchhhhcc--ccccccccccccceeeeeeecceEEEeccCcceEEEeccCCccccccccccccccCCCCceEEEEeecc Confidence 000000000 0000000 000000000000 000000000111011000 01111122334456677777788 Q ss_pred eEEEEecCCEEEEEEeCCCCc-eEEEEE---eccccccccCceeEEeCCeEEEEeCC-----cEEEeCCceecccccchH Q lcl|NC_018835. 688 YLFVYTEFETYIGSPTNNTYQ-PLMFKK---LFNDSGILAPECVVEVEGSHFVVTQN-----DVILHNGATKKSIASNRV 758 (1005) Q Consensus 688 ~~v~~~~~~~~~~t~t~~t~~-~~~~~~---~~~~~g~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~ 758 (1005) ..+.+-...+-+..-+|++.. -|.|+. +.-..||.+|.|+..+++.+||++|+ -+|+.+|.+++.|-...| T Consensus 197 ~i~lfG~~TiEvw~ntG~a~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~~~~rIST~aI 276 (472) T protein:vir:10 197 FIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYIIGSGQVSPIASASI 276 (472) T ss_pred EEEEEeccceEEEEecCCCCcccCceeecccceeeecccCcchhhecCceEEEEecCCccccEEEEccCceeEEecCHHH Confidence 889999999988988887753 377776 56689999999999999999999996 478999999999999999 Q ss_pred HHHHHhhcCcchhccEEEEEcCCCCE-E-EEEEeccCCCcCCcccceEEEEecccCceeeeeecceeeeeeeecccccCC Q lcl|NC_018835. 759 KNMLINEVCLVNPLATRVHLHQDKKE-V-WVLYVGPGEPKESFACTKAAVWNYEFDTWSFRTIPYAQCIGLVDPPVLERG 836 (1005) Q Consensus 759 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~ 836 (1005) .+- ++.++...+...+++.-++.+. . +.+|| +|-++||-.++.|--| +-+.. -|....++-+.. T Consensus 277 E~~-i~~y~~~e~~dA~~~t~~~~GH~fy~LtfP-----------~~Tw~yD~~t~~Wher-w~~~~-~g~~~~~~Ra~~ 342 (472) T protein:vir:10 277 EKI-LRSYTADELADGVMESLRFDAHELLIIHLP-----------RHVLVYDASSSANGPQ-WCVLK-TGLYDDVYRAID 342 (472) T ss_pred HHH-HHhcCCccccceeEEEEEeCCeEEEEEEcC-----------CceeEeecccccCcee-eeeec-CCCccCceEEEE Confidence 998 6777765555555555443333 2 23333 5789999999888876 11100 000111111111 Q ss_pred ccccCCccccCCCccceeeecCccccCCcccEEEEeecCCc-eEEEecccccccccccceeeeeccceeeeccccccccc Q lcl|NC_018835. 837 PIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSFLRG-FYQVDVGALDYFYDRLNDVVIEKPLEMRLERTGLDFDN 915 (1005) Q Consensus 837 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 915 (1005) +.|. --.+++|+.+.+ .|.++-+....+++ |.+.++.=+-++.|. T Consensus 343 ~~~~-------------------------~g~~~vGD~~ng~ly~l~~~~~td~G~---------~i~~~~~~p~~~~d~ 388 (472) T protein:vir:10 343 FIYE-------------------------GNQITCGDKLESVTGKLQFDISSQYGL---------QQEHLLFTPLFKADN 388 (472) T ss_pred EEEe-------------------------CCeEEEEEcCCCeEEEEcccCcCcCCC---------cceEEEeccceeCCC Confidence 1111 123566676444 55666554333232 222333333334432 Q ss_pred cCcccceeeeeeeeeEeCCCcEEEEEeCeeeC--ccCCceeCCcccccCCcceEEEEEeCCceE---------------E Q lcl|NC_018835. 916 VTNEWNQKHINRFRPQTTGSGTYTFEAGGSQF--SNEYGHPHTSKTYTIGVDRHVSVRLNHPYL---------------F 978 (1005) Q Consensus 916 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~ 978 (1005) . | +| +-++-+.+|.-+. |---.+|+..+ +-+.+..+...+.|||. . T Consensus 389 ~-------R---v~-----d~~ve~~~G~~~~adp~~~~~~sDg~--~~g~~~~~~~~~~g~~~~R~~~~RlG~~r~~vg 451 (472) T protein:vir:10 389 A-------R---CF-----DLEVESSTGVAQYADRLFLSATTDGI--NYGREQMIEQNEPFVYDKRVLWKRVGRIRKNVG 451 (472) T ss_pred C-------e---EE-----EEEEEeecCCCcccCceEEEeccCCc--ccchhhhhhhccCcccccceeeeeeeeccccce Confidence 1 2 22 1123333333221 11123555532 33456667777788881 1 Q ss_pred EEEEEcCCCceEEEEeeeEEEe Q lcl|NC_018835. 979 YNVIDNDVNSNAAINGLTIEFA 1000 (1005) Q Consensus 979 ~~~~~~~~~~~~~~~~~~~~~~ 1000 (1005) +||+- ..-.+-.|+|.-+++- T Consensus 452 f~~r~-~~~~~v~l~ga~~~~e 472 (472) T protein:vir:10 452 FKLRV-ITKSPVTLSGAQIRIE 472 (472) T ss_pred EEEEE-EeccccceeeeeEEeC Confidence 22210 1112223444433333 No 11 >protein:vir:118 Length: 449 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:4 # MgeName: B103 # Cross-refs: genbank:acc:NP_690641;swissprot:sw:q37888;genbank:gi:22855155;interpro:IPR003343;uniprot:Q37888;genbank:GeneID:955370 Probab=96.14 E-value=0.00072 Score=37.83 Aligned_cols=128 Identities=7% Similarity=-0.009 Sum_probs=60.4 Q ss_pred CeE-----EEeeeeceEEEEcCCcc-eeeeccceecceeceeccceeEecCCceeEEEEEecCCcccccccccceEEEEc Q lcl|NC_018835. 1 MAL-----YPIKSLGAVGVIADQAP-TDLAPNAFTNAINARFVEQRVFKTGGNAPLSYVDEDKDLTPLSFVSMPFDYYSA 74 (1005) Q Consensus 1 ~~~-----y~i~s~g~~tv~~~~~~-~~~~~~~~t~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~t~~t~t~~~~t~~ss 74 (1005) -.+ |.+.............. ...........+...+ .......|.+..++++.... ++.+..++|+++ T Consensus 316 ~G~y~n~~~tvt~t~~~~~~~~~~a~~~~~~~~~VTsVsVtP-ss~tL~~G~T~qLTATV~ps-----natnk~VTWSsS 389 (449) T protein:vir:11 316 RGLYWNYYYHVWQVLSASRFANAVAFVTGDDVPAVTQVIVSP-AIASVKQGKSQAFTAYVRAT-----DDKEHEVVWSVD 389 (449) T ss_pred cceeeccceEEEEEEecccccceeeeeeeeccceeeEEEeec-cceeeecCceEEEEEEEecC-----CCCCceEEEEEe Confidence 001 11111111111100000 0000000111111111 12334566677777665433 445566778777 Q ss_pred CCcEEEEecCCceEEEEecceEEEEEEecCcceEEEEEeeeccccceeeecCccceEecCCce Q lcl|NC_018835. 75 GNSFLVVGTDKKLYKLTDESLTDISRKVATVTKKASASIKIYPVVSQIVPKESTISMNFNQTK 137 (1005) Q Consensus 75 ~~s~~~t~~~~g~~t~t~~~t~~~t~~~~~~t~t~t~tvt~~~~v~~it~~~~~~t~~~g~t~ 137 (1005) +.+..+++...|.+++.+.|++.+++.....+...++.+++... ..+++.+.... .|... T Consensus 390 d~s~~ATVda~G~VTAva~GTAtITAta~~~s~TaT~tvtV~~~-a~VtVtP~sa~--ggaqA 449 (449) T protein:vir:11 390 GGSTGTSISSDGVLTVAANETNQLTVKATVDIGTADEPKPVVGE-AVVNVRPDSST--GGAQA 449 (449) T ss_pred CCceEEEEcCCceEEEecCccEEEEEEEecCcEEEEEEeeecce-EEEEEeecCCC--CcccC Confidence 77777788899999999999999999877766666665544321 22333332211 11111 No 12 >protein:vir:105525 Length: 472 # NCBI annotation: phage DNA stabilization protein # Family: family:all:1540 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516194;genbank:gi:89885997;genbank:GeneID:3964385 Probab=93.71 E-value=0.0066 Score=32.53 Aligned_cols=426 Identities=12% Similarity=0.075 Sum_probs=143.6 Q ss_pred CcceecccccccceeEEEecCCce------EEEeeeccccceeeEEEeecccccceeeceeeeccccceeecc--cccce Q lcl|NC_018835. 492 TGYVSTTSVTGKSIKLVALRKGEI------NVTCTVSQMTQKDYDAFDDYPWYHAVISNCAVATTHYETPQVK--EFESE 563 (1005) Q Consensus 492 ~~~~tv~~~~~~t~~vt~~~~gt~------Tit~~~~~~~~~~~~~t~t~~~~~~~~~~~~v~~~~~~~~~~~--~~~~~ 563 (1005) -.+..+- ...+...+.....+ ...+......+.... ... ..+...-..+. ...-+.- ..... T Consensus 1 m~~~q~p---l~~g~~~~~~~~~~~~~lpvN~y~~p~~~~~ss~~--lr~--~PG~~~~~~~~---g~~RG~~~~~~~~~ 70 (472) T protein:vir:10 1 MAIMQLP---LLRGLGKARDDADYIDALPVNMLATPKPVLNASGY--LRS--FPGITHKAEVA---GVSRGVQYNTHEKT 70 (472) T ss_pred CCceeee---cccccccCccccCceeeeeeeeeecccccccccee--ecc--cCCceeecCCC---cccceeEeeeeCCe Confidence 0000000 00000000000000 000000000000000 000 00000000000 0000000 00000 Q ss_pred eEEecCCceeEEEEEccCCCceeEEeeccceeee--ccc-eEEEeecccceEEEEecccCceEEE---EEeecc-ccccc Q lcl|NC_018835. 564 YFVDLPGWGEQTVVDNDGNPSVKKFNWKCERVRS--FNN-RLFALNMREANASGVTTNYPLRLRW---SNFANE-NKAPT 636 (1005) Q Consensus 564 ~t~~~~~t~~~t~t~~~~~~t~~tvt~tss~~~~--~~~-~~~~~~~~~~t~t~t~~~~~~~~~~---~~~~t~-~~~~~ 636 (1005) .....+.. -..+... ...+. .+.-+.- ..+ ..+.+ .++.....-.....+.... ....+. -.... T Consensus 71 lY~V~G~~-Ly~v~~~-----vG~ia-gsg~VsMa~~~~~q~v~v-~g~~~~y~y~g~~~t~~~~~~~~~it~~dl~~~~ 142 (472) T protein:vir:10 71 VYRGLGNQ-LYKGHKP-----IADLA-GKGRISMAFSRNSQAVVA-AGKMTLYRYDGTVKTLENWPKEKKYTQYDIGNVR 142 (472) T ss_pred EEEEecce-EEEEEee-----eeeec-ccccEEEEecCCceEEEE-ecceeEEEeccchhhhhhccccccCCccccCCce Confidence 01111111 0100000 00000 0000000 000 00000 0000000000000000000 000000 00000 Q ss_pred cccccceeee-eccccccceeeeeeeeccCcc--cceeeeeccceeeeccccCceEEEEecCCEEEEEEeCCCCceEEEE Q lcl|NC_018835. 637 LWDDFAYDRV-VSSDLASNIVGQTQALENGYA--GYIDLADSNGSLIDILPLKDYLFVYTEFETYIGSPTNNTYQPLMFK 713 (1005) Q Consensus 637 t~~~~~t~~~-~~~~~~~~t~~~t~~~t~~~~--~~~~~t~t~~~~i~~~~~g~~~v~~~~~~~~~~t~t~~t~~~~~~~ 713 (1005) ..+..+..-+ .....-..-++........+. ....-...+..++......+..+.+-...+-+..-+|++. |+|+ T Consensus 143 ~v~~~dGyfV~~~~gt~~~~iS~L~d~s~~~~~~~FatAE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~--fpf~ 220 (472) T protein:vir:10 143 DMCHLRGRYVWCKDGSDIFGVTDLEDESHPDRYRALYRAESQPDGIIGIDSWRDFIVCFGASTIEYFSLTGAAD--GQSA 220 (472) T ss_pred eEEEeCceEEEeecCCceEEEeecCCcccCCcccceeeecCCCCceEEEEeeccEEEEEeccceEEEEecCCCC--ccee Confidence 0000000000 000000000111010011111 1223333456667777778888999999998999888875 7777 Q ss_pred Ee------ccccccccCceeEEeCCeEEEEeCCc-----EEEeCCceecccccchHHHHHHhhcCcchhccEEEEEcCCC Q lcl|NC_018835. 714 KL------FNDSGILAPECVVEVEGSHFVVTQND-----VILHNGATKKSIASNRVKNMLINEVCLVNPLATRVHLHQDK 782 (1005) Q Consensus 714 ~~------~~~~g~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 782 (1005) .. +-..||.+|.|+..+++.+||++|+. +|+.+|.|++.|-...|.+- ++.++...++..+++.-++. T Consensus 221 r~~~~pg~~iq~Gcaa~~sv~~~~~s~~~l~~d~~g~~~V~~~~g~q~~rIST~aIE~~-i~~y~~~e~~dA~~~s~~~e 299 (472) T protein:vir:10 221 IYAAQPALMVEKGIAGTHCKTRLGDAHVIISHQATGAPSVFLINQAQATSIATATIEKI-LRSYTHDELASAVMETVRFD 299 (472) T ss_pred eeccCccceeeecccCchhhhhhCceEEEEecCCCcceEEEEccCceEEEecCHHHHHH-HHhCCcccccceeEEEEEeC Confidence 53 35699999999999999999999996 89999999999999999998 67777666666666655444 Q ss_pred CE--EEEEEeccCCCcCCcccceEEEEecccCc----eeeeeecceeeeeeeecccccCCccccCCccccCCCccc--ee Q lcl|NC_018835. 783 KE--VWVLYVGPGEPKESFACTKAAVWNYEFDT----WSFRTIPYAQCIGLVDPPVLERGPIWSDFQEITWDDPSI--KE 854 (1005) Q Consensus 783 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 854 (1005) +. .+.+|| ++-++||-.++. |+.+.- |.++..+-+....|.+-|-+-.|.... =. T Consensus 300 GH~fy~LtfP-----------~~Tw~yD~at~~~~~~w~~~~~------g~~~~~~Ra~~~~~~~g~~~vGD~~ng~l~~ 362 (472) T protein:vir:10 300 SHELVLIHLS-----------RQVLCYDAAANQNGLQWSLLKT------GFYHAPYRGIDFMFADHHLTCGDKNDSLLGQ 362 (472) T ss_pred CeEEEEEEcC-----------CeeEEEeccCCccceeeeeeec------CCccCceEEEEEEEeCCeEEEEEcCCCeEEE Confidence 44 333444 457888855554 444431 111111111122222222222111111 11 Q ss_pred eecCccccCCcc-cEEEEeecCCceEEEecccccccccccceeeeeccceeeeccccccccccCcccceeeeeeeeeEeC Q lcl|NC_018835. 855 LVWRKDATNFRQ-RVTIVGSFLRGFYQVDVGALDYFYDRLNDVVIEKPLEMRLERTGLDFDNVTNEWNQKHINRFRPQTT 933 (1005) Q Consensus 855 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 933 (1005) +.|+.++++..| ++++..+. .-+-+.+ .|+ |+ | ---+|..+- --+++++.+ T Consensus 363 ld~~~~td~g~pi~~~~~tp~-----~~~~n~R-vfd--~e-------l---~~~tGvg~~----------~~~v~L~wS 414 (472) T protein:vir:10 363 LDFASSAQYEKPQEHVLYTPL-----FKADNAR-VFD--FE-------L---EASTGVAHI----------ADRLFLSAT 414 (472) T ss_pred EcCcCcCCCCceeEEEeeccc-----eecCCCe-EEE--EE-------E---EeeCCcCcc----------CceEEEEEe Confidence 123444444333 11111111 1111111 000 00 0 011122211 113556654 Q ss_pred CCcEE-----EEEeCeeeCccCCceeCCcccccCCcceEEEEEeC-CceEEEEEEEcCCCceE--EEEeeeEE Q lcl|NC_018835. 934 GSGTY-----TFEAGGSQFSNEYGHPHTSKTYTIGVDRHVSVRLN-HPYLFYNVIDNDVNSNA--AINGLTIE 998 (1005) Q Consensus 934 ~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~--~~~~~~~~ 998 (1005) ++|.. .++-+++---..-++|+- +|.- |.. | +.|||+.. .|. ++.|.-+| T Consensus 415 ddg~~~~~~~~~~~~g~~~~~~r~~w~R-----lG~a-----r~~vg--f~~rv~~s---~pv~~~~~~a~~e 472 (472) T protein:vir:10 415 ADGLHFGREQMINQNAPFAYDRRILWRR-----MGRV-----RKNLG--FKVRVITS---SPVTLSGCQIRME 472 (472) T ss_pred ccccccchhHHHhhcCccchhheeeehe-----eecc-----ccccc--eEEEEEEe---cccccccceeeeC Confidence 44311 111111111122234431 1111 111 1 24555222 233 23444444 No 13 >protein:vir:3529 Length: 477 # NCBI annotation: P28 # Family: family:all:1540 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050989;genbank:gi:9633575;genbank:GeneID:1262322 Probab=93.61 E-value=0.007 Score=32.41 Aligned_cols=436 Identities=12% Similarity=0.048 Sum_probs=151.2 Q ss_pred ccceeeccceeeeeeccccceeEEEEeecCCceEEeeecCCcceecccccccceeEEEecCCceEEEeeeccccceeeEE Q lcl|NC_018835. 452 PLSSVTLDVVSASLDVGEEIVITATASPEGDYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDA 531 (1005) Q Consensus 452 ~~~~~t~~~~~~t~~~g~~~t~tat~~~~~~~~vt~sss~~~~~tv~~~~~~t~~vt~~~~gt~Tit~~~~~~~~~~~~~ 531 (1005) .....-.......+..|... +. .......+- .-.-.+..-......+-.. ...|- ...+...+. T Consensus 1 ~~~~~~m~~~~ipl~~g~~~---~~---~~~d~~~~~-PVN~~a~p~~~~~s~~~L~-~~pG~-~~~~~~~G~------- 64 (477) T protein:vir:35 1 MLSEVFMPKIQIPLAKGLVK---DI---KTADYIDAL-PVNMLATPKEVLNASGYLR-SFPGI-EKKQDAKGV------- 64 (477) T ss_pred Ccccceeeeecccccccccc---cc---ccccceeee-eeccceeeccccccccccc-cCCcc-eeeccCCcc------- Confidence 00000000000000000000 00 000000000 0000000000000000000 00000 000000000 Q ss_pred Eeecccccceeeceeeeccccceeec---ccccceeEEecCCceeEEEEEccCCCceeEEeeccceee-eccceEEEe-e Q lcl|NC_018835. 532 FDDYPWYHAVISNCAVATTHYETPQV---KEFESEYFVDLPGWGEQTVVDNDGNPSVKKFNWKCERVR-SFNNRLFAL-N 606 (1005) Q Consensus 532 t~t~~~~~~~~~~~~v~~~~~~~~~~---~~~~~~~t~~~~~t~~~t~t~~~~~~t~~tvt~tss~~~-~~~~~~~~~-~ 606 (1005) .-+. .....-+ ...+ .....+.. ....+ +.+..+. ..+.....+ . T Consensus 65 ----------------------~RG~~~~~~~g~lY-~V~G-~~LY~v~~-----~vG~I-~gsg~VsMa~n~~~~aIv~ 114 (477) T protein:vir:35 65 ----------------------SRGVHFNTKNNALY-RVCG-NTLYRNDK-----EVADI-AGMSRVSMSHSSHSQAICF 114 (477) T ss_pred ----------------------ccceeEeecCCeEE-EEec-CeeEeeee-----eeeee-cccccEEEeeCCcEEEEEE Confidence 0000 0000000 0000 00000000 00000 0000000 001100000 0 Q ss_pred cccceEEEEecccCceEEEEEeeccccc------ccccccccee-eeeccccccceeeeeeee--ccCcccceeeeeccc Q lcl|NC_018835. 607 MREANASGVTTNYPLRLRWSNFANENKA------PTLWDDFAYD-RVVSSDLASNIVGQTQAL--ENGYAGYIDLADSNG 677 (1005) Q Consensus 607 ~~~~t~t~t~~~~~~~~~~~~~~t~~~~------~~t~~~~~t~-~~~~~~~~~~t~~~t~~~--t~~~~~~~~~t~t~~ 677 (1005) .+......- +.. ............+ ....+..+.. .-........-++..... .........-...+. T Consensus 115 ~g~~~gy~y--~~t-~~~~~~~~~~~~p~~~l~~~~~v~f~dGyfV~~~~gt~~~~iS~L~d~s~~d~~~~FasAE~~pD 191 (477) T protein:vir:35 115 EGKVKLYRY--DGT-EKALSNWPKDKYPQYDLGEVIDVCRNRGRYIWLQKGGERFGVTDLEDESKPDRYQPFYRAESQPD 191 (477) T ss_pred CCcceeEEE--ecc-cceeeecCccccCCccccceeEEEeeCceEEEeecCCCeEEEeecCCccccccccccccccCCCC Confidence 000000000 000 0000000000000 0000000000 000000000000100100 111111222333445 Q ss_pred eeeeccccCceEEEEecCCEEEEEEeCCCCceEEEEEec----cccccccCceeEEeCCeEEEEeCCc-----EEEeCCc Q lcl|NC_018835. 678 SLIDILPLKDYLFVYTEFETYIGSPTNNTYQPLMFKKLF----NDSGILAPECVVEVEGSHFVVTQND-----VILHNGA 748 (1005) Q Consensus 678 ~~i~~~~~g~~~v~~~~~~~~~~t~t~~t~~~~~~~~~~----~~~g~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~ 748 (1005) .++......+..+.+-...+-+..-+|++.-.|+|+... -..||.+|.|+..+++.+||++|+. +|+.+|. T Consensus 192 ~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~f~~p~~r~~~~~mIq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~ 271 (477) T protein:vir:35 192 GIVSVDAWRDLIVCFGSSSIEYFTLTGSADTSQPLYIHQAAYMIQAGIAGRDCKCRYQDKYAILSHQSTGQPAVYLIGAG 271 (477) T ss_pred ceEEEEeeccEEEEEeccceEEEEecCCCCCCcceeecCCceeeeecccCchhhhhhCceEEEEecCCCcccEEEEccCc Confidence 667777778889999999999999999887666666654 6899999999999999999999963 7899999 Q ss_pred eecccccchHHHHHHhhcCcchhccEEEEEcCCCCE--EEEEEeccCCCcCCcccceEEEEecccC----ceeeeeecce Q lcl|NC_018835. 749 TKKSIASNRVKNMLINEVCLVNPLATRVHLHQDKKE--VWVLYVGPGEPKESFACTKAAVWNYEFD----TWSFRTIPYA 822 (1005) Q Consensus 749 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~w~~~~~~~~ 822 (1005) |++.|-...|.+- ++.....+..+.|+..-++.+. .+.+|| ||-++||-.++ +|+.+.- T Consensus 272 q~~rIST~aIE~~-i~ay~~~e~a~af~~t~~~eGH~fy~LtfP-----------~~Tw~yD~at~~w~e~W~~~~~--- 336 (477) T protein:vir:35 272 EKNKISTATIDKI-IRYYSADELAASFMESIRFDNHELLLLHLP-----------KHTLCFDGSASHQYSQWSLLKS--- 336 (477) T ss_pred eeEEecCHHHHHH-HHhcCCcchhceeEEEEEeCCeeEEEEEcC-----------CceEEEecccccccceeeeecc--- Confidence 9999999999998 5666666555555444333333 233333 56788986665 5666530 Q ss_pred eeeeeeecccccCCccccCCccccCCCccceeeecCccccCCcccEEEEeec-CCceEEEecccccccccccceeeeecc Q lcl|NC_018835. 823 QCIGLVDPPVLERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSF-LRGFYQVDVGALDYFYDRLNDVVIEKP 901 (1005) Q Consensus 823 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 901 (1005) |.+..++-+....|.+ -..++|+. +...|.++-.....+++... T Consensus 337 ---g~~~~~~Ra~~~~~~~-------------------------g~~~vGD~~ng~l~~ld~~~~~d~g~~i~------- 381 (477) T protein:vir:35 337 ---GFYDEPYRAIDFMFFD-------------------------NQITVGDKKEGVLGHLIFNASNQYEQQTE------- 381 (477) T ss_pred ---CCccCceEEEEEEEeC-------------------------CeEEEEEcCCCeEEEECCCCcccCCCccc------- Confidence 0001111111111111 12445555 44445555444333333322 Q ss_pred ceeeeccccccccccCcccceeeeeeeeeEeCCCcEEEEEeCeeeCcc-CCceeCCcccccCCcceEEEEEeCCceEEEE Q lcl|NC_018835. 902 LEMRLERTGLDFDNVTNEWNQKHINRFRPQTTGSGTYTFEAGGSQFSN-EYGHPHTSKTYTIGVDRHVSVRLNHPYLFYN 980 (1005) Q Consensus 902 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 980 (1005) .++.=+-++.|.. |+..+ ++-+..|.-+..+ --..|+.- -.+-+.++.+.....|||. -| T Consensus 382 --~~~~~p~~~~d~~-------Rv~~~--------el~~~tGvgq~~d~v~L~~sdd-G~~~~~~~~~~~g~~g~~~-~r 442 (477) T protein:vir:35 382 --HLLYTPMIKADNA-------RLFDF--------ELEASTGVAQIADKLFLSVTTD-GINYSREQLIEQNSPFQYD-KR 442 (477) T ss_pred --eEEecceeeCCCC-------eEEEE--------EEEEecCcCccCceEEEEEecc-ccccccceeecCCCccccc-cc Confidence 2222223333321 22111 1111211111100 12233332 2233367788888888881 11 Q ss_pred EEEcCCCceEEEEeeeEEEee--------cCCC Q lcl|NC_018835. 981 VIDNDVNSNAAINGLTIEFAV--------GGRR 1005 (1005) Q Consensus 981 ~~~~~~~~~~~~~~~~~~~~~--------~~~~ 1005 (1005) ++=-+-+.-=..-||.+.+.- .+.| T Consensus 443 ~~~~RlG~~r~~vgf~~r~~~~~pv~l~~~~~~ 475 (477) T protein:vir:35 443 ILWRRIGRVRKNIGFKIRIITKSPVTLSDLSIR 475 (477) T ss_pred eeeeeeeeceeccceEEEEEecCCceeccceeE Confidence 100000000001233333222 1222 No 14 >protein:vir:5202 Length: 448 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:116 # MgeName: PZA # Cross-refs: genbank:acc:NP_040725;genbank:gi:9626396;genbank:GeneID:1260967 Probab=93.29 E-value=0.0046 Score=33.38 Aligned_cols=127 Identities=9% Similarity=-0.001 Sum_probs=54.1 Q ss_pred Ce-----EEEeeeeceEEEEcCCcc-eeeeccceecceeceeccceeEecCCceeEEEEEecCCcccccccccceEEEEc Q lcl|NC_018835. 1 MA-----LYPIKSLGAVGVIADQAP-TDLAPNAFTNAINARFVEQRVFKTGGNAPLSYVDEDKDLTPLSFVSMPFDYYSA 74 (1005) Q Consensus 1 ~~-----~y~i~s~g~~tv~~~~~~-~~~~~~~~t~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~t~~t~t~~~~t~~ss 74 (1005) =+ .|+|.............. +.... .....+... -.......|.+..+++.... .+..+..++|..+ T Consensus 316 ~GL~~N~~~TItatss~~~~t~atA~V~~t~-paVtsVsVs-PttasL~~G~TqqlTATVsg-----~na~~~~VTWSvS 388 (448) T protein:vir:52 316 RGLYWNYYYHVWQTLSVSRSANAVAFVSGDV-PAVTQVIVS-PNIAAVKQGGKQQFTAYVRA-----TDGKDHKVVWSVE 388 (448) T ss_pred ccceeeeeeEEEEEEccCccccceEEEEecc-cccceEEEc-ccceeecCCCeEEEEEEEec-----CCCCCCceEEEEc Confidence 22 334444333222221111 11111 001111111 12344566777777776653 3344556788877 Q ss_pred CCcEEEEecCCceEEEEecceEEEEEEecCcceEEEEEeeeccccceeeecCccceEecCCce Q lcl|NC_018835. 75 GNSFLVVGTDKKLYKLTDESLTDISRKVATVTKKASASIKIYPVVSQIVPKESTISMNFNQTK 137 (1005) Q Consensus 75 ~~s~~~t~~~~g~~t~t~~~t~~~t~~~~~~t~t~t~tvt~~~~v~~it~~~~~~t~~~g~t~ 137 (1005) +++...+++..|.+++.+.++..+++.+...+........+. ....+++.|...+ .|... T Consensus 389 ~ns~~aTVsssG~vTv~a~gTatITVtATvdts~a~~~~~vv-~ea~VsvtP~~as--~G~q~ 448 (448) T protein:vir:52 389 GGSTGTAITGDGLLSVSGNEENQLTVKATVDIGTEDKPNLVV-GEAVVSIRPNNAS--GGAQA 448 (448) T ss_pred CCceeeEEeCCccEEeccCCcceEEEEEEecCcccCCceeee-eeEEEEecCCCCC--CcCCC Confidence 777668889999999887777666665432111111111000 0011222222211 11000 No 15 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=85.41 E-value=0.053 Score=27.58 Aligned_cols=112 Identities=9% Similarity=0.065 Sum_probs=46.2 Q ss_pred CeEEEeeeec---------eEEEEcCC-cc--e-eeeccceecceece----eccceeEecCCceeEEEEEecCCccccc Q lcl|NC_018835. 1 MALYPIKSLG---------AVGVIADQ-AP--T-DLAPNAFTNAINAR----FVEQRVFKTGGNAPLSYVDEDKDLTPLS 63 (1005) Q Consensus 1 ~~~y~i~s~g---------~~tv~~~~-~~--~-~~~~~~~t~~~~~~----~~~~~~~~~~~t~~~t~~~~~~~~t~~t 63 (1005) +..|...... ....+... .. . ..........+... .........+....+..+........ T Consensus 262 ~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v~~~~~~~~~~~~~~~~~~~t~~~~~~~~-- 339 (392) T protein:vir:99 262 LVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDD-- 339 (392) T ss_pred eecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceeeeeeeecccceeEeeeccceeEEEEEEecCCcc-- Confidence 0001100000 00000000 00 0 00000000000000 00011222233333333333322221 Q ss_pred ccccceEEEEcCCcEEEEecCCceEEEEecceEEEEEEecC--cceEEEEEeeec Q lcl|NC_018835. 64 FVSMPFDYYSAGNSFLVVGTDKKLYKLTDESLTDISRKVAT--VTKKASASIKIY 116 (1005) Q Consensus 64 ~t~~~~t~~ss~~s~~~t~~~~g~~t~t~~~t~~~t~~~~~--~t~t~t~tvt~~ 116 (1005) ... ...|++++..+++++..|.+++...|.+++++.... ...+.++.+++. T Consensus 340 -~~~-~vtw~Ssn~~vAtV~~~G~Vt~v~~G~atITa~~~~~~~~~t~t~~vtV~ 392 (392) T protein:vir:99 340 -VTA-LCDFESSATDKATVAAGGLVTGVAAGTSTVTATLVTPSGDREDTIVITVV 392 (392) T ss_pred -ccc-eEEEEEcCCeeEEEcCCceEEEEecceEEEEEEEEcCCCcEEEEEEEEeC Confidence 112 234667778888888999999999999999998753 445666666665 No 16 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=22.04 E-value=1.7 Score=19.34 Aligned_cols=113 Identities=10% Similarity=-0.055 Sum_probs=43.2 Q ss_pred eeeeecccccCCccccCCccccCCCccceeeecCccccCCcccEEEEeecCCceEEEecccccccccccceeeeecccee Q lcl|NC_018835. 825 IGLVDPPVLERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSFLRGFYQVDVGALDYFYDRLNDVVIEKPLEM 904 (1005) Q Consensus 825 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 904 (1005) |+++.+-+ +.++.|=|+|...+-+.+-.. +..| +.++|.+. T Consensus 1 M~~v~~si---------------------------------~nl~~GvSqQp~~~r~pgQ~~----~q~N--~~~d~v~G 41 (905) T protein:vir:78 1 MGAVLQKI---------------------------------PNLLGGVSQQPDPVKLPGQVR----EAEN--VYLDPTFG 41 (905) T ss_pred Cccceecc---------------------------------hhhhCceeecchhhcCCcchh----hhhc--cccccccc Confidence 22222211 112234456655555544432 3333 56777888 Q ss_pred eeccccccccccCcccceeeeeeeeeEeCCCcEEEEEeCeeeCccCCceeCCcccccCCcc-eEEEEEeCCc-eEEEEEE Q lcl|NC_018835. 905 RLERTGLDFDNVTNEWNQKHINRFRPQTTGSGTYTFEAGGSQFSNEYGHPHTSKTYTIGVD-RHVSVRLNHP-YLFYNVI 982 (1005) Q Consensus 905 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~ 982 (1005) +..|+|..|= |+|... +.....|+.-.. .+.+ |++.++.+|. +.-+||. T Consensus 42 l~kRp~~~~i--------~~l~~~-------------------~~~~~~~~~~~r--~~~e~y~~~~~~~g~~~~~i~v~ 92 (905) T protein:vir:78 42 CRKRPATKFV--------GELATN-------------------LPSDTRWFPIFR--DAGERYAVALYKDGSGNTQVRVW 92 (905) T ss_pred cccCchhhhh--------hhhcCC-------------------CCCCceEEEEEe--CCCceEEEEEeeCCCCCcceEEE Confidence 8889887662 211110 011222321000 0111 2334444442 2234554 Q ss_pred EcCCCceEEE--EeeeEEEeecCCC Q lcl|NC_018835. 983 DNDVNSNAAI--NGLTIEFAVGGRR 1005 (1005) Q Consensus 983 ~~~~~~~~~~--~~~~~~~~~~~~~ 1005 (1005) +...+..-.+ .+....|...+.| T Consensus 93 d~~~G~~~~V~~~~~~~~yl~~~~~ 117 (905) T protein:vir:78 93 DMQTGAERTVTPDATATAYLATTNL 117 (905) T ss_pred EccCCcEEEEecCCCccceeecCCC Confidence 4433322222 1222233333333 Done!