Query lcl|NC_010324.1_cdsid_YP_001671764.1 [gene=phi32_19] [protein=bacterial surface protein] [protein_id=YP_001671764.1] [location=16414..19431] Match_columns 1005 No_of_seqs 725 out of 2224 Neff 10.1 Searched_HMMs 1612 Date Thu Nov 7 14:06:58 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_19 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_19_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:8837 Length: 513 # 100.0 2.6E-88 1.6E-91 500.8 38.7 509 406-1005 1-513 (513) 2 protein:vir:108312 Length: 458 98.4 6.4E-07 4E-10 54.5 27.9 433 470-1000 1-458 (458) 3 protein:vir:100960 Length: 472 98.3 1.8E-06 1.1E-09 52.1 28.1 426 458-1000 1-472 (472) 4 protein:vir:9268 Length: 472 # 98.2 2.2E-06 1.4E-09 51.6 29.2 410 458-1000 1-472 (472) 5 protein:vir:177 Length: 472 # 98.2 3E-06 1.8E-09 50.9 30.8 432 470-1000 1-472 (472) 6 protein:vir:118 Length: 449 # 98.1 1.2E-06 7.7E-10 53.0 16.0 202 1-227 236-449 (449) 7 protein:vir:2109 Length: 472 # 98.0 6.8E-06 4.2E-09 48.9 29.0 425 458-1000 1-472 (472) 8 protein:vir:105428 Length: 472 97.9 1.1E-05 6.9E-09 47.7 32.1 431 470-1000 1-472 (472) 9 protein:vir:3529 Length: 477 # 97.2 0.00012 7.4E-08 42.1 31.2 426 457-1005 1-475 (477) 10 protein:vir:5202 Length: 448 # 96.6 0.00047 2.9E-07 38.8 14.6 206 1-227 236-448 (448) 11 protein:vir:118 Length: 449 # 96.4 0.00031 1.9E-07 39.8 12.6 127 1-138 318-449 (449) 12 protein:vir:99075 Length: 392 95.4 0.0022 1.4E-06 35.2 14.6 175 1-206 211-392 (392) 13 protein:vir:105525 Length: 472 95.0 0.0029 1.8E-06 34.5 26.8 430 492-998 1-472 (472) 14 protein:vir:5202 Length: 448 # 94.1 0.0025 1.5E-06 34.9 10.2 127 1-154 318-448 (448) 15 protein:vir:99075 Length: 392 91.8 0.014 8.7E-06 30.7 12.1 112 1-116 257-392 (392) 16 protein:vir:78703 Length: 905 28.3 0.8 0.0005 21.1 3.5 112 825-1005 1-117 (905) 17 protein:vir:95475 Length: 771 27.2 1.9 0.0012 19.1 35.6 665 291-1005 1-771 (771) 18 protein:vir:100022 Length: 976 26.8 1.4 0.00085 19.8 4.5 115 825-1005 1-124 (976) No 1 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=100.00 E-value=2.6e-88 Score=500.80 Aligned_cols=509 Identities=24% Similarity=0.370 Sum_probs=322.7 Q ss_pred ccccceeeeccccceeccccceeeeEEeeccccceeeeeeEeeeeeeccccccccccceeecCCccceEEEEecCCCceE Q lcl|NC_010324. 406 ESEETVYFAEPTSGIDTSGMYEGNNFYDYSNVNDIEGFARASLLATPLSSVTLDIVSASLDVGEEIVITATASPEGEYSY 485 (1005) Q Consensus 406 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~t~~~~~~t~~~g~t~t~tat~t~~~~~tv 485 (1005) -...-.......+.........- .... ................ .+.-..+.++....--... T Consensus 1 ~~~~~~~~~~~~g~~~d~~p~~l-------p~~a---~s~~~N~~~~~~~~~~--------~~g~~pv~a~~~~~~~g~~ 62 (513) T protein:vir:88 1 MALERQEVKNPTGIVTDIAPADL-------PLDK---WSFGNNVRFKNGKAQK--------ALGHSPIFDTAQAPILDMF 62 (513) T ss_pred CCcCChhhcccccceeccChhhc-------CCCc---ceeeeeeeEecceeee--------cCccceeeecCCCCceeee Confidence 00000000000000000000000 0000 0000000000000000 0000000000000000000 Q ss_pred EEeecCceeEEEeecCCceeEEeeccCCcEEEEEeeCcceecceeEeeccccccceeccceeccceeeccccccccceeE Q lcl|NC_010324. 486 QWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPWYHAVISNCAVATTHYETPQVKEFESEYF 565 (1005) Q Consensus 486 t~t~s~~~~~t~~~~~~~t~~~~~~~~gt~tiT~~~t~~~~~~~~~~~~~~~~~~~~~~~~vt~t~~~~~~~t~~~~~~t 565 (1005) .+..++......... ... ......+ .+ .+.. ....................+.+.+...+.+-....... T Consensus 63 ~~~~~g~~~~~~~~~--~~~--~~~~~~t--~~-dvs~---~~~~~~~~~~w~~~~f~~~i~a~ng~~~~q~~~~~s~~f 132 (513) T protein:vir:88 63 PFIRNNIPYWLLCSE--KRL--YLADGTT--II-DVSP---GPYSASVTNRWSVGSFNGVIFANDGVNPPHHLPPTESVF 132 (513) T ss_pred eeecCCCeEEEEeec--eEE--EEecCce--ee-eccc---cceeecccCceeeeeecCEEEEEcCCCcceEEcCCCcee Confidence 111100000000000 000 0000000 00 0000 000000000000001111111111111111110000000 Q ss_pred EecCCceeEEEEecCCCCcceeEEeeeccccccCceEEEEecCCceeeEEecCceeEEEEeecccCCcceeeeeeeeece Q lcl|NC_010324. 566 VDLPGWGEQTVVDNDGNPSVKKFNWKCERVRSFNNRLFALNMREANASGVTTNYPLRLRWSNFANENKAPTLWDDFAYDR 645 (1005) Q Consensus 566 ~~~~~t~t~t~t~~~~~~t~~tv~~~~~~~~~~~~~~~~~~~~~~t~t~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~ 645 (1005) . ..++... .+....+....+.++..+..+.. ...+..+.++...+.+..+..|+. T Consensus 133 ~-----------dl~g~p~----~~~a~~i~v~~~flv~~~~t~~~-----~~~PnrV~wS~~~D~~~~P~~W~~----- 187 (513) T protein:vir:88 133 R-----------VLPNFPA----NTTFRRLKSFKNFLIGLNVTSNS-----IEMPQMVWWSTSADAGGVPASWDP----- 187 (513) T ss_pred e-----------eccCCCc----ccceEEEEEEeeEEEEeecccCc-----CCCCceEEEecccCCccccccccc----- Confidence 1 1111111 11122222344445544443331 145677778777766666665542 Q ss_pred eecccccceeeecccccccccccceeeEecccceeeccccceeEEEEcCCceEEEEEECCCCceeeeeecCCcceeecCc Q lcl|NC_010324. 646 VVSSDLASNIVGQTQALENGYAGYIDLADSNGSLIDILPLKDYLFVYTEFETYIGSPTNNTYQPLMFKKLFNDSGILAPE 725 (1005) Q Consensus 646 ~~~~~~t~~~~~~~~~~~t~~~~~~~~t~t~~~iv~g~~~g~~tii~t~~~~~~~t~tggt~~~~~~~~~~~~~~~~~~~ 725 (1005) ...+..+++.++.+.++.++.+.+++++.++|.++.++.|+++|+. .+|+|+|++.++||++|+ T Consensus 188 ---------------t~~t~~a~~~~l~d~~g~~v~g~~~g~~liif~e~~i~~m~y~g~~-~if~~~~i~~~~G~~~p~ 251 (513) T protein:vir:88 188 ---------------TDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIGGL-YIFQFQQLFNDVGILGPN 251 (513) T ss_pred ---------------ccccCcccccccCCCccceeeeeecccceEEEecccEEEEEecCCC-ceEEEEeecccccccCCc Confidence 2233445678888889999999999999999999999999999765 599999999999999999 Q ss_pred eEEEECCEEEEEeCCCEEEECCCccccccchhHHHHHHhhcCccccceEEEEEcCCCCEEEEEEecCCCCcCCCCCCeEE Q lcl|NC_010324. 726 CVVEVEGSHFVVTQNDVILHNGATKKSIASNRVKNMLINEVCLVNPLATRVHLHQDKKEVWVLYVGPGEPKESFACTKAA 805 (1005) Q Consensus 726 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 805 (1005) ||++++++||||+++|||+|||.++++|++||||||||+++|+.|++|+++++||++|||||+||+.+++ ...+|+|+| T Consensus 252 SI~~~~~~~ffls~~Gf~~~~G~~~~~Ig~ekVdk~f~~~~n~~~~~~~~~~~d~~~~~v~~~y~s~~~~-~~~~~~~~l 330 (513) T protein:vir:88 252 CAIEFDGNHFVVGHGDVYVHNGVQKQSVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSE-PGKHCDRAI 330 (513) T ss_pred eeEEECCeEEEEeCCceEEecCceeeecccchhhhhhhccCCcccceEEEEEEcCcccEEEEEecCCCCC-CCcccceEE Confidence 9999999999999999999999999999999999999999999999999999999999999999998765 256899999 Q ss_pred EEEeccCeeeEecccce--eeeeecccccccCCceecccccccCCCcchhhccccccccCccceeEEEecCC-CeeEEEe Q lcl|NC_010324. 806 VWNYEFDTWSFRTIPYA--QCIGLVDPPVLERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSFL-KGFYQVD 882 (1005) Q Consensus 806 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 882 (1005) ||||++++||++++|+. .++|+.+++. +.+| +.++++||+++. .|.+++.++.+.+++..+++ ..+|++| T Consensus 331 VYd~~~~~Ws~~~~p~~~~g~~g~~~~~~---~~~~-~~~~~~~d~~~~---~~~~~~~~~~~~sl~~~~~~~~~~~~fd 403 (513) T protein:vir:88 331 IWNWKENTWSIRDLPNVLSGAYGIIDPKT---SNLW-DDDSNPWDTDTS---VWGEGSYNPAKSSMIFTSFQDAKLFLFG 403 (513) T ss_pred EEEccCCeEEEEeccchhhcccccccccc---ccee-cccccccccchh---hhhccccccccceeEeeeccCCceeeec Confidence 99999999999999976 6678877655 4589 559999999886 66788889999999888755 5788887 Q ss_pred ccceeeecccceeEEEecceeeeecCccccccccCCccceeeeeeeeeEEecCceeEEEEeeecCCCCCceECCCeEEec Q lcl|NC_010324. 883 VGALDYFYDRLNDVVIEKPLEMRLERTGIDFDNVTNEWNQKHINRFRPQTTGSGTYIFEAGGSQFSNEYGHPHTSKTYTI 962 (1005) Q Consensus 883 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 962 (1005) .+. +|.|..|+++ |||++++||+. +++|+|++++|++..+|+|+|.+|..+.+.++++|+++++|+| T Consensus 404 ~~~-~f~G~~lea~---------~~t~~~~~~~~---~~~~~i~~v~~~~t~~g~~t~~vg~~~~~~~~~~~s~~~~~~~ 470 (513) T protein:vir:88 404 ETS-TFSGQSFTST---------LERSDIYLGDD---RMMKTVSAVIPHITGNGVCNIWVGNAQVQGSGIRWKGPYPYRI 470 (513) T ss_pred ccc-cccCCceEEE---------EEecCccccCc---hhheeeeeeeeeeecceEEEEEEeeeccCccccccccceeeec Confidence 665 5666655555 79999999755 6999999999999999999999999999999999999999999 Q ss_pred CCCeeEeeecCCceEEEEEEEccCCCcEEEEeEEEEeec-cCCC Q lcl|NC_010324. 963 GVDRHVSVRLNHPYLFYNVIDNDVNSNAAINGLTIEFAV-GGRR 1005 (1005) Q Consensus 963 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 1005 (1005) ++++|+|+|++|||+++|| ++|++++|+|+|||+|+.+ +|+| T Consensus 471 ~~~~~~~~r~~gRy~~~ri-~i~~~~~w~~~G~~ve~~~~~g~R 513 (513) T protein:vir:88 471 GQDYKIDTKHVGRYIALKF-DFASAGDWYFNGYTLEMAPKAGMR 513 (513) T ss_pred ccCceEEeccCCceEEEEE-EccCCCceEEeeEEEEEecCCCCC Confidence 9999999999999999995 9999999999999999999 6999 No 2 >protein:vir:108312 Length: 458 # NCBI annotation: hypothetical protein # Family: family:all:1540 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552278;genbank:gi:160700603;genbank:GeneID:5758828 Probab=98.44 E-value=6.4e-07 Score=54.52 Aligned_cols=433 Identities=12% Similarity=0.032 Sum_probs=168.9 Q ss_pred ccceEEEEecCCCceEEEeec---CceeEEEeecCCceeEEeeccCCcEEEEEeeCcceecceeEeeccccccceec--- Q lcl|NC_010324. 470 EIVITATASPEGEYSYQWSVD---KTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPWYHAVIS--- 543 (1005) Q Consensus 470 t~t~tat~t~~~~~tvt~t~s---~~~~~t~~~~~~~t~~~~~~~~gt~tiT~~~t~~~~~~~~~~~~~~~~~~~~~--- 543 (1005) -.... .|.. .+..... ..-.+...... ...+......... .+... .........-. T Consensus 1 m~~~~---ip~g--sy~a~~~~~daq~~VN~yp~~--------~e~g~ss~~l~~t-PGl~~----f~~~~~~~~~g~~~ 62 (458) T protein:vir:10 1 MVQRQ---IPLV--ATTAEGDVSGQEILVNVYPRK--------SDGGKYPFTLRHT-PGLAF----FCELPTFPVMAMHQ 62 (458) T ss_pred Cceee---ecee--eeecccccccceeeeeeeeec--------ccccccccceEec-CCcee----eecCCCCceeeEEe Confidence 00000 0000 0000000 00000000000 0000000000000 00000 00000000000 Q ss_pred --cceeccceeeccccccccceeEEecCCceeEEEEecCCCCcceeEEeeeccccccCceEEEEecCCceeeEEecCcee Q lcl|NC_010324. 544 --NCAVATTHYETPQVKEFESEYFVDLPGWGEQTVVDNDGNPSVKKFNWKCERVRSFNNRLFALNMREANASGVTTNYPL 621 (1005) Q Consensus 544 --~~~vt~t~~~~~~~t~~~~~~t~~~~~t~t~t~t~~~~~~t~~tv~~~~~~~~~~~~~~~~~~~~~~t~t~~~~~~~~ 621 (1005) .....+.+..... +..+...+....+.. ..-+.... ++..+.+-.+.. .-.. +..+ T Consensus 63 ~~g~ly~v~g~~LY~---------V~~~~~~~~iG~i~g----sg~VsMa~------ng~q~vi~~G~~-gY~y--d~at 120 (458) T protein:vir:10 63 NGSRAFAVTPRDMYE---------ISKDGTYKRLGSVDF----KGRVVMED------NGKQIVMVDGEK-GYYY--DSET 120 (458) T ss_pred cCCEEEEeeCceEEE---------EeCCceEEEEecccC----ceeEEEee------CCcEEEEEECCe-EEEE--eecc Confidence 0000000000000 000000000000000 00000000 000000100100 0000 0000 Q ss_pred EEEEeecccCCcce-eeeeeeeeceeecccccceeeecccccccc-cccceeeEecccceeeccccceeEEEEcCCceEE Q lcl|NC_010324. 622 RLRWSNFANENKAP-TLWDDFAYDRVVSSDLASNIVGQTQALENG-YAGYIDLADSNGSLIDILPLKDYLFVYTEFETYI 699 (1005) Q Consensus 622 ~~~~~~~~~~~~~~-~t~~~~~~~~~~~~~~t~~~~~~~~~~~t~-~~~~~~~t~t~~~iv~g~~~g~~tii~t~~~~~~ 699 (1005) . ......+.+..+ ....-..+--+-........-......... .-.+......+..|+.-....+..+.|.++.+.+ T Consensus 121 ~-~~~~i~d~~~~~~~~v~~~dGy~V~~~~g~~~~~is~L~d~s~d~l~fa~Ae~~pD~iv~i~~~~~~i~~fG~~TiEv 199 (458) T protein:vir:10 121 E-IVQEIKAEGFYPASTVTYQDGYFIFDRKGTGQFFISELLDVAFDPLDFATAEGQPDPLLAVLSDHREVFMFGQETIEV 199 (458) T ss_pred c-EEEeccCccccCcceEEEeCcEEEEEeeCCCEEEEEecCcceeCcceeeeecCCCCceEEEEeeccEEEEEeccceEE Confidence 0 000000000000 000000000000000000000000000111 1123334445667788888899999999999999 Q ss_pred EEEECCCCceeeeee---cCCcceeecCceEEEECCEEEEEeCC-CEEEECCCccccccchhHHHHHHhhcCccccceEE Q lcl|NC_010324. 700 GSPTNNTYQPLMFKK---LFNDSGILAPECVVEVEGSHFVVTQN-DVILHNGATKKSIASNRVKNMLINEVCLVNPLATR 775 (1005) Q Consensus 700 ~t~tggt~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 775 (1005) ..-+|++. |-|+. .+-..||+++.|+..+++.+|||+|+ -||+-+|-+++.|....|++-+ ++++..- ..+| T Consensus 200 w~ntG~a~--fpy~r~~ga~i~~Gcaa~~sv~~~~~t~~~l~~d~~Vy~l~g~~~~rIST~aIE~~i-~sy~~~d-a~a~ 275 (458) T protein:vir:10 200 WYNSGAAD--FPFERNQGAFIEKGIGAPYSVAKTNNTVYFIGSDLMIYQITGYTPVRISTHAVEQTL-KGVNLSD-AFAY 275 (458) T ss_pred EEecCCCC--cceeecccceeeecccCcchhhhhCceEEEEcCCeEEEEecCceeEEeeCHHHHHHH-hcCChhh-eEEE Confidence 99999876 55655 34488999999999999999999988 5689999999999999999988 4444333 2333 Q ss_pred EEEcCCCCEEEEEEecCCCCcCCCCCCeEEEEEeccCeeeEecccceeeeeecccccccCCceecccccccCCCcchhhc Q lcl|NC_010324. 776 VHLHQDKKEVWVLYVGPGEPKESFACTKAAVWNYEFDTWSFRTIPYAQCIGLVDPPVLERGPIWSDFQEITWDDPSIKEL 855 (1005) Q Consensus 776 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 855 (1005) .+..+-.+.++..||+ .++-.|||-..+.|..|. -+.++ +| T Consensus 276 t~~~eGH~fy~LtfP~---------a~~Tw~yD~~t~~Wher~------Sg~~~----------------~~-------- 316 (458) T protein:vir:10 276 TYQSEGHLFYVLTIPG---------KNLTWCYDISSGSWHVRQ------SYQFD----------------RH-------- 316 (458) T ss_pred EEEecCeEEEEEECCC---------CCceeEEecccccceeec------cCCCC----------------ce-------- Confidence 3333333345566775 267789999999999984 11111 11 Q ss_pred cccccccCccceeEEEecCCC-eeEEEeccceeeecccceeEEEecceee---eecCccccccccCCcc-----ceeeee Q lcl|NC_010324. 856 VWRKDATNFRQRVTIVGSFLK-GFYQVDVGALDYFYDRLNDVVIEKPLEM---RLERTGIDFDNVTNEW-----NQKHIN 926 (1005) Q Consensus 856 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~-----~~~~~~ 926 (1005) +-..-.+--...|.||++. .+|++|-+..+.+|+.+.-.+.+.++.. +|....++||-++--. .-+-.- T Consensus 317 --Ra~~~v~~~g~~~vGD~~ng~ly~ld~~~~td~g~~i~~~~~~p~~~~~~~rl~~~~~el~~~tGvg~~~~~~~~p~~ 394 (458) T protein:vir:10 317 --VSNNSIYFDQKTLVGDFQNGRIYIMADNYYTDDGDPVVREFILPVVNNGREFLTVDSLELDLSSGVGLTVGQGSDPEL 394 (458) T ss_pred --EEEEEEEeCCeEEEEEcCCCeEEEEcccCcCCCCceeeeeeeccceeCCCCeEEEEEEEEEEecceeeeeCCCCCceE Confidence 1111122223457777444 4788888877778877766655544322 2222355553322100 000000 Q ss_pred eeeeEEecCceeEEEEee-ecCCCCCceECC-CeEEecCCCeeEeeecCCceEEEEEEEccCCCcEEEEeEEEEee Q lcl|NC_010324. 927 RFRPQTTGSGTYIFEAGG-SQFSNEYGHPHT-SKTYTIGVDRHVSVRLNHPYLFYNVIDNDVNSNAAINGLTIEFA 1000 (1005) Q Consensus 927 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1000 (1005) ++|.-.|+ =+++.-.- .++.+.+.++.. +.-+|+|+-++ |= +|+ ..-+--+-.|.|.-+++- T Consensus 395 ~l~~S~d~--g~~~s~~~~~~~lg~~gey~tr~~~~rlG~ar~-------rv--f~v-~~s~p~~~~l~ga~~~~r 458 (458) T protein:vir:10 395 RVYFSKDN--GNEYSQNFKVGKIGRKGEFLTRAKVNRFGCARQ-------FT--FKV-EISDPIPVDIGGAWVEVR 458 (458) T ss_pred EEEEeeCC--CcccchhHHHhhcCCcchhhhhhhhhhhccCcc-------eE--EEE-EEecchhhcceeeeEEeC Confidence 11222221 11111000 001111111111 22335555221 22 332 333334456888888754 No 3 >protein:vir:100960 Length: 472 # NCBI annotation: gp10 # Family: family:all:1540 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006412;genbank:gi:46358704;genbank:GeneID:2777110 Probab=98.28 E-value=1.8e-06 Score=52.12 Aligned_cols=426 Identities=12% Similarity=0.052 Sum_probs=159.0 Q ss_pred cccccceeecCCccceEEEEecCCCceEEEeecCceeEEEeecCCceeEEeeccCCcEEEEEeeCcceecceeEeecccc Q lcl|NC_010324. 458 LDIVSASLDVGEEIVITATASPEGEYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPW 537 (1005) Q Consensus 458 ~~~~~~t~~~g~t~t~tat~t~~~~~tvt~t~s~~~~~t~~~~~~~t~~~~~~~~gt~tiT~~~t~~~~~~~~~~~~~~~ 537 (1005) .......+..|.. -....+..+.+---+ -.+.............. ..|-.-. +...+ T Consensus 1 m~~~~ipl~~g~~------~~~~~a~~~~~~pvn-~y~~~~~~~~ss~~Lr~-~pG~~~~-a~~~G-------------- 57 (472) T protein:vir:10 1 MPIQQLPMMKGMG------KDFKNADYIDYLPIN-MLATPKEVLNSSGYLRS-FPGIAKR-NDVNG-------------- 57 (472) T ss_pred Cceeecccccccc------cCCCcCcceeeeeec-cccccccccccccceee-cccceee-cCCCC-------------- Confidence 0000000000000 000000000000000 00000000000000000 0000000 00000 Q ss_pred ccceeccceeccceeecccc--ccccceeEEecCCceeEEEEecC-CCCcceeEEeeeccccccCceEEEEecCCcee-e Q lcl|NC_010324. 538 YHAVISNCAVATTHYETPQV--KEFESEYFVDLPGWGEQTVVDND-GNPSVKKFNWKCERVRSFNNRLFALNMREANA-S 613 (1005) Q Consensus 538 ~~~~~~~~~vt~t~~~~~~~--t~~~~~~t~~~~~t~t~t~t~~~-~~~t~~tv~~~~~~~~~~~~~~~~~~~~~~t~-t 613 (1005) . .-+. ..-.... ..+.+.....+...- ..+...-+... .+.....+-...... - T Consensus 58 ---~------------~RG~~~~~~~~~l-y~V~G~~Ly~v~~~iG~i~gsgrVsMa------~n~~~~~v~~~~~~~~Y 115 (472) T protein:vir:10 58 ---V------------SRGVEYNTAQNAV-YRVCGGKLYKGEAVVGDVAGSGRVSMA------HGRTSQAVGVNGQLIEY 115 (472) T ss_pred ---c------------ccceeeeeeCCeE-EEEeCcceEEEEeeEeeccCcccEEEe------eCCeEEEEEECCceeEE Confidence 0 0000 0000000 011111111100000 00000000000 000000110000000 0 Q ss_pred EEecCceeEEEEeecccCCccee------eeeeeeeceeecccccceeeecccccccccccc---eeeEecccceeeccc Q lcl|NC_010324. 614 GVTTNYPLRLRWSNFANENKAPT------LWDDFAYDRVVSSDLASNIVGQTQALENGYAGY---IDLADSNGSLIDILP 684 (1005) Q Consensus 614 ~~~~~~~~~~~~~~~~~~~~~~~------t~~~~~~~~~~~~~~t~~~~~~~~~~~t~~~~~---~~~t~t~~~iv~g~~ 684 (1005) ........... -+........ ...-..+--+-....+...-............+ -.....+..|+.-.. T Consensus 116 ~~~~~~~t~~~--~~~d~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~~~iS~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~ 193 (472) T protein:vir:10 116 RYDGAVKTVSN--WPADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGS 193 (472) T ss_pred EEecchhhhhc--ccCccccccccccceeEEEEecceEEEccCCCceEEEeccCCccccccccccccccCCCCceEEEEe Confidence 00000000000 0000000000 000000000000111101110000111111111 223344566777788 Q ss_pred cceeEEEEcCCceEEEEEECCCC-ceeeeeecC---CcceeecCceEEEECCEEEEEeCCC-----EEEECCCccccccc Q lcl|NC_010324. 685 LKDYLFVYTEFETYIGSPTNNTY-QPLMFKKLF---NDSGILAPECVVEVEGSHFVVTQND-----VILHNGATKKSIAS 755 (1005) Q Consensus 685 ~g~~tii~t~~~~~~~t~tggt~-~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~ 755 (1005) ..+..++|.++.+.+..-+|++. .-|=|+..+ -..||+++.||..+++.+|||+|+. ||+.+|-+++.|.. T Consensus 194 ~~~~l~lfG~~TiEvw~ntG~ad~~~fpy~r~~g~~i~~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST 273 (472) T protein:vir:10 194 WRDFIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIAT 273 (472) T ss_pred eccEEEEEeccceEEEEecCCCCcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecC Confidence 89999999999999999999873 124455444 6899999999999999999999997 89999999999999 Q ss_pred hhHHHHHHhhcCccccceEEE-EEcCCCCE-EEEEEecCCCCcCCCCCCeEEEEEeccC----eeeEecccceeeeeecc Q lcl|NC_010324. 756 NRVKNMLINEVCLVNPLATRV-HLHQDKKE-VWVLYVGPGEPKESFACTKAAVWNYEFD----TWSFRTIPYAQCIGLVD 829 (1005) Q Consensus 756 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~ 829 (1005) ..|++-+ +...+.-+...++ .+.-+.|+ ++.+|| +|.+|||-.++ ||+.+. -+++. T Consensus 274 ~aIE~~i-~~y~~~e~~~A~~~t~~~~GH~fy~LtfP-----------~~Tw~yD~at~~w~erw~~~~------~g~~~ 335 (472) T protein:vir:10 274 ASIEKII-RSYTAEELATGVMETLRFDSHELLIIHLP-----------RHVLVYDASSSQNGPQWCVLK------TGLYD 335 (472) T ss_pred HHHHHHH-HhcCCccccceEEEEEEeCCeEEEEEEcC-----------CeeEEEEcccCcccceeeeec------CCCcc Confidence 9999988 6665444333332 23334444 444555 68999998888 455543 11100 Q ss_pred cccccCCceecccccccCCCcchhhccccccccCccceeEEEec-CCCeeEEEeccceeeecccceeEEEecceeeeecC Q lcl|NC_010324. 830 PPVLERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGS-FLKGFYQVDVGALDYFYDRLNDVVIEKPLEMRLER 908 (1005) Q Consensus 830 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 908 (1005) ..|+--.--|.-.+.|.|+ ....+|++|-+.-+.+|+...-....-++..- .+ T Consensus 336 -------------------------~~~R~~~~~~~~g~~ivGD~~nG~ly~ld~~~~t~~g~~~~~~~~~p~l~~d-n~ 389 (472) T protein:vir:10 336 -------------------------DVYRAVDFMYEGNQITCGDKSEALTGQLQFDISSQYGLQQEHLLFTPLFKAD-NA 389 (472) T ss_pred -------------------------cceeEEEEEeeCCeEEEEEcCCCeEEEEecccCCCCCCcccceEEcccccCC-CC Confidence 0122222222223346666 34456788887767777776655544443110 11 Q ss_pred cccccccc--CCccceeeeeeeeeEEecCceeEEEEeeecCCCCCceECCCeEEecCCCeeEeeecCCce----E----- Q lcl|NC_010324. 909 TGIDFDNV--TNEWNQKHINRFRPQTTGSGTYIFEAGGSQFSNEYGHPHTSKTYTIGVDRHVSVRLNHPY----L----- 977 (1005) Q Consensus 909 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~----- 977 (1005) +=+||+.- +--.-.. -++|++...+|. +|. ...++.-.+.||| + T Consensus 390 R~~d~eve~~~Gv~~~~--d~v~L~wSddG~---------------~~~--------~~~~~~~g~~g~~~tr~~~~RlG 444 (472) T protein:vir:10 390 RCFDLEVESSTGVAQYA--DRLFLSATTDGI---------------NYG--------REQMIEQNEPFVYDKRVIWKRVG 444 (472) T ss_pred EEEEEeeeccCCCCCcC--cEEEEEeecccc---------------ccc--------cceeeccCCccchhcceeeeeee Confidence 11111100 0000000 134444333322 222 1223333444444 1 Q ss_pred ------EEEEEEccCCCcEEEEeEEEEee Q lcl|NC_010324. 978 ------FYNVIDNDVNSNAAINGLTIEFA 1000 (1005) Q Consensus 978 ------~~~~~~~~~~~~~~~~~~~~~~~ 1000 (1005) .+|| ++-.-.+-.++|..+.+- T Consensus 445 ~~r~~v~f~~-r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:10 445 RIRRLIGFKL-RVITKSPVTLSGCQIRLE 472 (472) T ss_pred ecccceeEEE-EEEecCcceeeeeEEeeC Confidence 1222 333334444555544433 No 4 >protein:vir:9268 Length: 472 # NCBI annotation: 10 # Family: family:all:1540 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720332;genbank:gi:24371590;genbank:GeneID:955815 Probab=98.24 E-value=2.2e-06 Score=51.60 Aligned_cols=410 Identities=11% Similarity=0.045 Sum_probs=142.7 Q ss_pred cccccceeecCCccceEEEEecCCCceEEEeecCceeEEEeecCCceeEEeeccCCcEEEEEeeCcceecceeEeecccc Q lcl|NC_010324. 458 LDIVSASLDVGEEIVITATASPEGEYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPW 537 (1005) Q Consensus 458 ~~~~~~t~~~g~t~t~tat~t~~~~~tvt~t~s~~~~~t~~~~~~~t~~~~~~~~gt~tiT~~~t~~~~~~~~~~~~~~~ 537 (1005) .......+..|.. -.+..+....+---+ -.+.............. ..|-.-. +...+ T Consensus 1 m~~~~ipl~~g~~------~~~~~a~~~~~~pvn-~y~~~~~~~~ss~~Lr~-~pG~~~~-a~~~G-------------- 57 (472) T protein:vir:92 1 MPIQQLPMMKGMG------KDFKNADYIDYLPIN-MLATPKEVLDSSGYLRS-FPGIAKR-NDVNG-------------- 57 (472) T ss_pred Cceeecccccccc------ccCccCcceeeeecc-cccccccccccccceee-cccceee-cCCCC-------------- Confidence 0000000000000 000000000000000 00000000000000000 0000000 00000 Q ss_pred ccceeccceeccceeecccc--ccccceeEEecCCceeEEEEecC-CCCcceeEEeeeccccccCceEEEEecCCcee-e Q lcl|NC_010324. 538 YHAVISNCAVATTHYETPQV--KEFESEYFVDLPGWGEQTVVDND-GNPSVKKFNWKCERVRSFNNRLFALNMREANA-S 613 (1005) Q Consensus 538 ~~~~~~~~~vt~t~~~~~~~--t~~~~~~t~~~~~t~t~t~t~~~-~~~t~~tv~~~~~~~~~~~~~~~~~~~~~~t~-t 613 (1005) . .-+. ..-.... ..+.+.....+...- ..+...-+... .+.....+-...... - T Consensus 58 ---~------------~RG~~~~~~~~~l-y~V~G~~Ly~v~~~iG~i~gsgrVsMa------~n~~~~av~~~~~~~~Y 115 (472) T protein:vir:92 58 ---V------------SRGVEYNTAQNAV-YRVCGGKLYKGEAVVGDVAGSGRVSMA------HGRTSQAVGVNGQLIEY 115 (472) T ss_pred ---c------------ccceeeeeeCCeE-EEEeCcceEEEEeeEeeccCcccEEEe------cCCeEEEEEECCceeEE Confidence 0 0000 0000000 011111111100000 00000000000 000001110000000 0 Q ss_pred EEecCceeEEEEeecccCCccee------eeeeeeeceeecccccceeeecccccccccccc---eeeEecccceeeccc Q lcl|NC_010324. 614 GVTTNYPLRLRWSNFANENKAPT------LWDDFAYDRVVSSDLASNIVGQTQALENGYAGY---IDLADSNGSLIDILP 684 (1005) Q Consensus 614 ~~~~~~~~~~~~~~~~~~~~~~~------t~~~~~~~~~~~~~~t~~~~~~~~~~~t~~~~~---~~~t~t~~~iv~g~~ 684 (1005) ........... -+........ ...-..+--+-....+...-............+ -.....+..|+.-.. T Consensus 116 ~~~~~~~t~~~--~~~d~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~~~iS~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~ 193 (472) T protein:vir:92 116 RYDGAVKTVSN--WPADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGS 193 (472) T ss_pred EEecchhhhhc--ccCccccccccccceeEEEEecceEEEccCCCceEEEeccCCccccccccccccccCCCCceEEEEe Confidence 00000000000 0000000000 000000000000111101110000111111111 223344566777788 Q ss_pred cceeEEEEcCCceEEEEEECCCC-ceeeeeecC---CcceeecCceEEEECCEEEEEeCCC-----EEEECCCccccccc Q lcl|NC_010324. 685 LKDYLFVYTEFETYIGSPTNNTY-QPLMFKKLF---NDSGILAPECVVEVEGSHFVVTQND-----VILHNGATKKSIAS 755 (1005) Q Consensus 685 ~g~~tii~t~~~~~~~t~tggt~-~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~ 755 (1005) ..+..++|.++.+.+..-+|++. .-|=|+..+ -..||+++.||..+++.+|||+|+. ||+.+|-+++.|.. T Consensus 194 ~~~~l~lfG~~TiEvw~ntG~ad~~~fpy~r~~g~~i~~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST 273 (472) T protein:vir:92 194 WRDFIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIAT 273 (472) T ss_pred eccEEEEEeccceEEEEecCCCCcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecC Confidence 89999999999999999999873 124455443 6899999999999999999999997 89999999999999 Q ss_pred hhHHHHHHhhcCccccceEEE-EEcCCCCEEE-EEEecCCCCcCCCCCCeEEEEEeccC----eeeEeccc-----ce-e Q lcl|NC_010324. 756 NRVKNMLINEVCLVNPLATRV-HLHQDKKEVW-VLYVGPGEPKESFACTKAAVWNYEFD----TWSFRTIP-----YA-Q 823 (1005) Q Consensus 756 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~-----~~-~ 823 (1005) ..|++-+ +.++..-+...+. .+--+.|+.| .+|| +|.+|||-.++ +|+.+.-- .+ + T Consensus 274 ~aIE~~i-~~y~~~e~~~a~~~s~~~eGH~fy~LtfP-----------~~Tw~yD~at~~~~e~W~~~~sg~~~~~~R~~ 341 (472) T protein:vir:92 274 ASIEKII-RSYTADELATGVMEALRFDSHELLIIHLP-----------RHVLVYDASSSQNGPQWCVLKTGLYDDVYRAI 341 (472) T ss_pred HHHHHHH-HhcCcchhceeeEEEEEecCeeEEEEEcC-----------CceEEEEcccCcCCceeeeecCCCcccceeEE Confidence 8888865 4444333333222 2333444444 3444 78999998888 78877631 11 1 Q ss_pred eeeecccccccCCceecccccccCCCcchhhccccccccCccceeEEEecC-CCeeEEEeccceeeecccceeEEE---- Q lcl|NC_010324. 824 CIGLVDPPVLERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSF-LKGFYQVDVGALDYFYDRLNDVVI---- 898 (1005) Q Consensus 824 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~---- 898 (1005) ++-..+.+. |.|++ ...+|++|-+..+++|....-... T Consensus 342 ~~~~~~g~~-------------------------------------ivGD~~nG~ly~l~~~~~t~~~~~~~~~~~~P~~ 384 (472) T protein:vir:92 342 DFMYEGNQI-------------------------------------TCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIF 384 (472) T ss_pred EEEeeCCeE-------------------------------------EEEEcCCCeEEEEeccccccCCCcceEEEEeceE Confidence 222223333 33331 122333332222222211100000 Q ss_pred --------ecceeeeecCccccccccCCccceeeeeeeeeEEecCceeEEEEeeecCCCCCceECCCeEEecCCCeeEee Q lcl|NC_010324. 899 --------EKPLEMRLERTGIDFDNVTNEWNQKHINRFRPQTTGSGTYIFEAGGSQFSNEYGHPHTSKTYTIGVDRHVSV 970 (1005) Q Consensus 899 --------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 970 (1005) .-.+|.. ...| |++| ++|++...+|. +|. ...++.- T Consensus 385 ~~dn~R~~d~eve~~-~Gv~-----q~~d-------~v~L~wSddG~---------------~~~--------~~~~~~~ 428 (472) T protein:vir:92 385 KADNARCFDLEVESS-TGVA-----QYAD-------RLFLSATTDGI---------------NYG--------REQMIEQ 428 (472) T ss_pred ecCCCEEEEEeeecc-CCCC-----CcCc-------eEEEEeecccc---------------ccc--------cceeecc Confidence 0000000 1111 1111 34444443332 111 1223333 Q ss_pred ecCCce----E-----------EEEEEEccCCCcEEEEeEEEEee Q lcl|NC_010324. 971 RLNHPY----L-----------FYNVIDNDVNSNAAINGLTIEFA 1000 (1005) Q Consensus 971 ~~~~~~----~-----------~~~~~~~~~~~~~~~~~~~~~~~ 1000 (1005) .+.||| + .+|| ++-.-.+-.++|..+.+- T Consensus 429 g~~g~~~tr~~~~RlG~~r~~v~f~~-r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:92 429 NEPFVYDKRVLWKRVGRIRRLIGFKL-RVITKSPVTLSGCQIRLE 472 (472) T ss_pred CCccchhcceeeeeeeecccceeEEE-EEEecCcceeeeeEEeeC Confidence 444444 1 1232 333334445555544433 No 5 >protein:vir:177 Length: 472 # NCBI annotation: DNA stabilization protein # Family: family:all:1540 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112082;genbank:gi:13559872;genbank:GeneID:920982 Probab=98.18 E-value=3e-06 Score=50.90 Aligned_cols=432 Identities=12% Similarity=0.082 Sum_probs=164.9 Q ss_pred ccceEEEEecCCCceEEEeecCceeEEEeecCCceeEEeeccCCcEEEEEeeCcceecceeEeeccccccceeccceecc Q lcl|NC_010324. 470 EIVITATASPEGEYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPWYHAVISNCAVAT 549 (1005) Q Consensus 470 t~t~tat~t~~~~~tvt~t~s~~~~~t~~~~~~~t~~~~~~~~gt~tiT~~~t~~~~~~~~~~~~~~~~~~~~~~~~vt~ 549 (1005) -....+-. ..+....+-.. ..+.... .+.-......+...... ...+.- .... .. T Consensus 1 m~~~~~Pl-~~G~~~~~~~~---d~~~~~p---VN~~a~~~~~~~s~~~l-------------~~tPGl-~~~a----~v 55 (472) T protein:vir:17 1 MPIQQLPL-MKGVGKDFRNA---DYIDYLP---VNMLATPKEILNSSGYL-------------RSFPGI-AKRS----DV 55 (472) T ss_pred CCeeeeee-ccCceeecccc---chhheee---eeeeeeccCCCccccee-------------ecCCCc-eeec----cC Confidence 00000000 00000000000 0000000 00000000000000000 000000 0000 00 Q ss_pred ceeeccc--cccccceeEEecCCceeEEEEecCCCCcceeEEeeeccccccCceEEEEecCCce-eeEEecCceeEEEEe Q lcl|NC_010324. 550 THYETPQ--VKEFESEYFVDLPGWGEQTVVDNDGNPSVKKFNWKCERVRSFNNRLFALNMREAN-ASGVTTNYPLRLRWS 626 (1005) Q Consensus 550 t~~~~~~--~t~~~~~~t~~~~~t~t~t~t~~~~~~t~~tv~~~~~~~~~~~~~~~~~~~~~~t-~t~~~~~~~~~~~~~ 626 (1005) .+.. -+ ....+...- .+.+....-+... ..++....-.....+.....+-..... .-............ T Consensus 56 ~G~~-RG~~~~~~~g~lY-~V~G~~LY~v~~~-----iGsiag~grVsMa~n~~~~av~~~g~~~~Y~y~~~v~t~~~~- 127 (472) T protein:vir:17 56 NGVS-RGVEYNMAQNAVY-RVCGGKLYKGESE-----VGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNW- 127 (472) T ss_pred Cccc-cceEEEeeCCeEE-EEecceEeeeecc-----eecccCcccEEEecCCcEEEEEECCceeEEEeeccchhhhcc- Confidence 0000 00 000000000 0111111100000 000000000000000000111110000 00000000000000 Q ss_pred ecccCCcc------eeeeeeeeeceeecccccceeeeccccccccccccee---eEecccceeeccccceeEEEEcCCce Q lcl|NC_010324. 627 NFANENKA------PTLWDDFAYDRVVSSDLASNIVGQTQALENGYAGYID---LADSNGSLIDILPLKDYLFVYTEFET 697 (1005) Q Consensus 627 ~~~~~~~~------~~t~~~~~~~~~~~~~~t~~~~~~~~~~~t~~~~~~~---~t~t~~~iv~g~~~g~~tii~t~~~~ 697 (1005) +...... .....-..+--+-....+................+.. ....+..|+.-....+..++|.++.+ T Consensus 128 -~~d~~~~~~dlg~~~dv~f~dGyfV~~~~Gt~~~~is~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~~i~lfG~~Ti 206 (472) T protein:vir:17 128 -PTDSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTI 206 (472) T ss_pred -ccccccccccccceeeeeeecceEEEeccCcceEEEeccCCccccccccccccccCCCCceEEEEeeccEEEEEeccce Confidence 0000000 0000000000000111111110110011111112222 34456677888888999999999999 Q ss_pred EEEEEECCCCce-eeeeecC---CcceeecCceEEEECCEEEEEeCC-----CEEEECCCccccccchhHHHHHHhhcCc Q lcl|NC_010324. 698 YIGSPTNNTYQP-LMFKKLF---NDSGILAPECVVEVEGSHFVVTQN-----DVILHNGATKKSIASNRVKNMLINEVCL 768 (1005) Q Consensus 698 ~~~t~tggt~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 768 (1005) .+..-+|++... |=|+..+ -..||+++.||..+++.+|||+++ -||+.+|.+++.|-...|++-+ +.++. T Consensus 207 Evw~ntG~a~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~~~~rIST~aIE~~i-~~y~~ 285 (472) T protein:vir:17 207 EYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYIIGSGQVSPISSASIEKIL-RSYTA 285 (472) T ss_pred EEEEeeCCCCCCcCceeecCcceeeecccCcchhhecCceEEEEecCCccccEEEEccCceeEEecCHHHHHHH-HhcCC Confidence 999999987522 4555544 689999999999999999999996 3689999999999888888865 44554 Q ss_pred cccceEEE-EEcCCCCE-EEEEEecCCCCcCCCCCCeEEEEEeccCeeeEecccceeeeeecccccccCCceeccccccc Q lcl|NC_010324. 769 VNPLATRV-HLHQDKKE-VWVLYVGPGEPKESFACTKAAVWNYEFDTWSFRTIPYAQCIGLVDPPVLERGPIWSDFQEIT 846 (1005) Q Consensus 769 ~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 846 (1005) .-+...++ .+.-+.|+ ++.+|| +|-+|||-.++.|..|- ++ +. + T Consensus 286 ~e~~dA~~~t~~~~GH~fy~LtfP-----------~~Tw~yD~~t~~Wherw-----~~-------~~---------~-- 331 (472) T protein:vir:17 286 DELADGVMESLRFDAHELLIIHLP-----------RHVLVYDASSSANGPQW-----CV-------LK---------T-- 331 (472) T ss_pred ccccceeEEEEEeCCeEEEEEEcC-----------CceeEeecccccCceee-----ee-------ec---------C-- Confidence 33333222 23334444 444555 78999999999998762 00 00 0 Q ss_pred CCCcchhhccccccccCccceeEEEecCCC-eeEEEeccceeeecccceeEEEecceeeeecCccccccccCCccceeee Q lcl|NC_010324. 847 WDDPSIKELVWRKDATNFRQRVTIVGSFLK-GFYQVDVGALDYFYDRLNDVVIEKPLEMRLERTGIDFDNVTNEWNQKHI 925 (1005) Q Consensus 847 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 925 (1005) ...-+-|+-..-.+--...|+||++. .+|++|-+..+.+|+.+.-...+ |+ |+.|.. |+ T Consensus 332 ----g~~~~~~Ra~~~~~~~g~~~vGD~~ng~ly~ld~~~~td~g~pi~~~~~~-p~--------~~~~~~-------RV 391 (472) T protein:vir:17 332 ----GLYDDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYDKQQEHLLFT-PL--------FKADNA-------RV 391 (472) T ss_pred ----CCccCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCCCCceeEEEEec-ce--------eeCCCc-------eE Confidence 00001122233333334457777554 47888888888888777666555 32 444333 33 Q ss_pred eeeeeEEecCceeEEEEeeecCCCC--CceECCCeEEecCCCeeEeeecCCce----EEEEEE----------EccCCCc Q lcl|NC_010324. 926 NRFRPQTTGSGTYIFEAGGSQFSNE--YGHPHTSKTYTIGVDRHVSVRLNHPY----LFYNVI----------DNDVNSN 989 (1005) Q Consensus 926 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~----------~~~~~~~ 989 (1005) ..+ ++. +..|-.+...- ..+|+-+.+| +-...++-.+.||| +..|+= ++-+-.+ T Consensus 392 ~d~--el~------~~tG~~~~adp~~l~~~sDg~~~--g~~~~~~~~~~g~~~~R~~~~RlG~~r~~v~f~~~~~~~~~ 461 (472) T protein:vir:17 392 FDL--EVE------SSTGVAQYADRLFLSATTDGINY--GREQMIEQNEPFVYDKRVLWKRVGRIRKNVGFKLRVITKSP 461 (472) T ss_pred EEE--EEe------eeCCcccCCCceEEEcccCCccc--chhhhhhhccCcccccceeeeeeeeccccceEEEEEeeccc Confidence 211 111 11122222211 2234421111 11123455666766 222321 2222233 Q ss_pred EEEEeEEEEee Q lcl|NC_010324. 990 AAINGLTIEFA 1000 (1005) Q Consensus 990 ~~~~~~~~~~~ 1000 (1005) -.|+|.-++.- T Consensus 462 ~~l~~a~~~~e 472 (472) T protein:vir:17 462 VTLSGCQIRIE 472 (472) T ss_pred ceeeeeEEEeC Confidence 34555444322 No 6 >protein:vir:118 Length: 449 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:4 # MgeName: B103 # Cross-refs: genbank:acc:NP_690641;swissprot:sw:q37888;genbank:gi:22855155;interpro:IPR003343;uniprot:Q37888;genbank:GeneID:955370 Probab=98.06 E-value=1.2e-06 Score=52.98 Aligned_cols=202 Identities=14% Similarity=0.103 Sum_probs=72.9 Q ss_pred CeEEEecccceeee-ecCCcceEeeccceecceeeeecccceEecCCceEEEEEEecCCccccccccccEEEEecCCeE- Q lcl|NC_010324. 1 MALYPIKSLGAVGV-IADQAPTDLAPNAFTNAINARFVEQRVFKTGGNAPLSYVDEDKDLTPLSFISMPFDYYSAGNSF- 78 (1005) Q Consensus 1 ~a~y~v~s~~t~~~-~~~~~~tt~~~~~~t~~~~~~~~~~~~~~~g~t~t~t~t~~~~~~t~~~~t~~~~t~tss~~~~- 78 (1005) |++......--.-. ...........+.....-+.... .........+.... .....+..+.... T Consensus 236 ~~v~~~ad~~dl~~i~~~d~~~~ld~t~ls~afN~tav---------Da~~~~tvVddfAs-----t~~~a~~~sk~~~~ 301 (449) T protein:vir:11 236 MAVRTRSDIRDVHLFIDADLNAELDVDVLAKAFNMDRT---------TFLGNVTVIDGFAS-----TGLKAVMVDKDWFM 301 (449) T ss_pred eeeccccCccceEEEEccCcceecccccchhhhcccee---------eeeeeeeecCccCC-----ccceeeeeccceeE Confidence 22211111110000 00000111111101100000000 00000000000000 0000000000000 Q ss_pred --------EEEccCCcEEEeecCceEEEEEeecCcc-eeEEEEEEeeecccceeeccceeEEecCCceEEEEEEecCCCC Q lcl|NC_010324. 79 --------LVVGTDKKLYKLTDESLTDISRKVATVT-KKASASIKIYPVVSQIVPKESTISMNFNQTKNLEVSLLPADAN 149 (1005) Q Consensus 79 --------~tv~~~~~~~t~t~~~~~~~t~~~~~~t-~~~t~tvtv~~~vt~it~~~~t~t~~~g~t~tltatvt~~~~t 149 (1005) .......+.+..... ............ ..........+.++++++++...++..|++.+|+|++.|.++. T Consensus 302 ~~d~~~~~~~~~~~~G~y~n~~~-tvt~t~~~~~~~~~~a~~~~~~~~~VTsVsVtPss~tL~~G~T~qLTATV~psnat 380 (449) T protein:vir:11 302 VYDTLQKMETIRNPRGLYWNYYY-HVWQVLSASRFANAVAFVTGDDVPAVTQVIVSPAIASVKQGKSQAFTAYVRATDDK 380 (449) T ss_pred EeeeeeEEEEEEcCcceeeccce-EEEEEEecccccceeeeeeeeccceeeEEEeeccceeeecCceEEEEEEEecCCCC Confidence 011111111110000 001111111111 1112223334467889999999999999999999999999999 Q ss_pred CcceEEEEcCCc-ceEEecCCceeeccceeeeccccceeeeeecccceeeeeeeEeeeccccceeecccceeeeeecce Q lcl|NC_010324. 150 NTDLVWEVSNSS-YGSITVDPSDSKLATLTSFEKEGNLVVTISTANESVVAQIAVNIIDGDSGIFLSQDTVTIRKGGTT 227 (1005) Q Consensus 150 ~~tvt~tss~~~-~~tv~~~~~~~~~~~~tt~~~~gt~t~t~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~t~~~~~~~ 227 (1005) ++.|+|++++.. .+++..+|.. ++...|+++++................... ..+....... ..+... T Consensus 381 nk~VTWSsSd~s~~ATVda~G~V-------TAva~GTAtITAta~~~s~TaT~tvtV~~~-a~VtVtP~sa--~ggaqA 449 (449) T protein:vir:11 381 EHEVVWSVDGGSTGTSISSDGVL-------TVAANETNQLTVKATVDIGTADEPKPVVGE-AVVNVRPDSS--TGGAQA 449 (449) T ss_pred CceEEEEEeCCceEEEEcCCceE-------EEecCccEEEEEEEecCcEEEEEEeeecce-EEEEEeecCC--CCcccC Confidence 999999988876 4777776653 445556666666544433322222111100 0000000000 000000 No 7 >protein:vir:2109 Length: 472 # NCBI annotation: head completion protein # Family: family:all:1540 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059633;genbank:gi:9635541;genbank:GeneID:1262840 Probab=98.02 E-value=6.8e-06 Score=48.92 Aligned_cols=425 Identities=9% Similarity=0.038 Sum_probs=145.0 Q ss_pred cccccceeecCCccceEEEEecCCCceEEEeecCceeEEEeecCCceeEEeeccCCcEEEEEeeCcceecceeEeecccc Q lcl|NC_010324. 458 LDIVSASLDVGEEIVITATASPEGEYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPW 537 (1005) Q Consensus 458 ~~~~~~t~~~g~t~t~tat~t~~~~~tvt~t~s~~~~~t~~~~~~~t~~~~~~~~gt~tiT~~~t~~~~~~~~~~~~~~~ 537 (1005) .......+..|-. .....+..+.|---+ -.+.................-..-.++ .+.. T Consensus 1 m~~~q~Pl~~g~~------~~~~~~d~~~~~pVN-~~a~~~~~~~s~~~lr~tPG~~~~~~~--~g~~------------ 59 (472) T protein:vir:21 1 MPIQQLPMMKGMG------KDFKNADYIDYLPVN-MLATPKEILNSSGYLRSFPGITKRYDM--NGVS------------ 59 (472) T ss_pred CceEEeecccccc------ccccccceeeeeeee-eeeeccCCcccceeeeecCCcceeccC--CCce------------ Confidence 0000000000000 000000000000000 000000000000000000000000000 0000 Q ss_pred ccceeccceeccceeeccccccccceeEEecCCceeEEEEec-CCCCcceeEEeeeccccccCceEEEEecCCcee-eEE Q lcl|NC_010324. 538 YHAVISNCAVATTHYETPQVKEFESEYFVDLPGWGEQTVVDN-DGNPSVKKFNWKCERVRSFNNRLFALNMREANA-SGV 615 (1005) Q Consensus 538 ~~~~~~~~~vt~t~~~~~~~t~~~~~~t~~~~~t~t~t~t~~-~~~~t~~tv~~~~~~~~~~~~~~~~~~~~~~t~-t~~ 615 (1005) -+ ....+..+. . ..+.+..-.-+... ...+...-+.. ..+.....+-...... -.. T Consensus 60 ------------RG--~~~~t~~~~-l-y~V~G~~LY~v~~~~G~i~gsgrVsM------a~n~~~~~v~~~~~~~~Y~~ 117 (472) T protein:vir:21 60 ------------RG--VEYNTAQNA-V-YRVCGGKLYKGESEVGDVAGSGRVSM------AHGRTSQAVGVNGQLVEYRY 117 (472) T ss_pred ------------ee--eeecccCCe-E-EEEeCCceEEEeeeeeeecccccEEE------eeCCeEEEEEECCceeEEEE Confidence 00 000000000 0 00001100000000 00000000000 0000000000000000 000 Q ss_pred ecCceeEEEEeecccCCccee------eeeeeeeceeecccccceeeeccccccccccc---ceeeEecccceeeccccc Q lcl|NC_010324. 616 TTNYPLRLRWSNFANENKAPT------LWDDFAYDRVVSSDLASNIVGQTQALENGYAG---YIDLADSNGSLIDILPLK 686 (1005) Q Consensus 616 ~~~~~~~~~~~~~~~~~~~~~------t~~~~~~~~~~~~~~t~~~~~~~~~~~t~~~~---~~~~t~t~~~iv~g~~~g 686 (1005) ......... -+........ ...-..+--+-....+...-............ +......+..|+.-.... T Consensus 118 ~~~~~t~~~--~~~d~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~f~is~l~d~~~~~~y~~FatAE~~pD~Iv~i~~~~ 195 (472) T protein:vir:21 118 DGTVKTVSN--WPADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWR 195 (472) T ss_pred ecchhhhhc--ccCccccccccccceeEEEEecceEEEccCCcceeEEecCCCCccccCCccceeeccCCCceEEEEeec Confidence 000000000 0000000000 00000000000011110111110011111111 223444566788888889 Q ss_pred eeEEEEcCCceEEEEEECCCC-ceeeeeec---CCcceeecCceEEEECCEEEEEeCCC-----EEEECCCccccccchh Q lcl|NC_010324. 687 DYLFVYTEFETYIGSPTNNTY-QPLMFKKL---FNDSGILAPECVVEVEGSHFVVTQND-----VILHNGATKKSIASNR 757 (1005) Q Consensus 687 ~~tii~t~~~~~~~t~tggt~-~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~ 757 (1005) +..++|.++.+.+..-+|++. .-|=|+.. +-..||+++.||..+++.+|||+|+. ||+.+|-+++.|.... T Consensus 196 ~~l~lfG~~TiEvw~ntG~ad~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~a 275 (472) T protein:vir:21 196 DFIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATAS 275 (472) T ss_pred cEEEEEeccceEEEEecCCCCcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHH Confidence 999999999999999999873 22445544 36899999999999999999999997 8999999999999999 Q ss_pred HHHHHHhhcCccccceEEE-EEcCCCCE-EEEEEecCCCCcCCCCCCeEEEEEeccC----eeeEeccc-----ce-eee Q lcl|NC_010324. 758 VKNMLINEVCLVNPLATRV-HLHQDKKE-VWVLYVGPGEPKESFACTKAAVWNYEFD----TWSFRTIP-----YA-QCI 825 (1005) Q Consensus 758 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~-----~~-~~~ 825 (1005) |++-+ +...+.-+...++ .+.-+.|+ ++.+|| +|.+|||-.++ +|+++.-- .+ +++ T Consensus 276 IE~~i-~~y~~~e~~~A~~~t~~~eGH~fy~LtfP-----------~~Tw~yD~at~~~~e~W~~~~sg~~~~~~R~~~~ 343 (472) T protein:vir:21 276 IEKII-RSYTAEEMATGVMETLRFDSHELLIIHLP-----------RHVLVYDASSSQNGPQWCVLKTGLYDDVYRGVDF 343 (472) T ss_pred HHHHH-HhcCCccccceEEEEEEeCCeEEEEEEcC-----------CeeEEEEcccCccCceeeeeccCCCcCceeEEEE Confidence 99988 6665444333222 23334444 444555 67999998888 58877632 11 222 Q ss_pred eecccccccCCceecccccccCCCcchhhccccccccCccceeEEEecCCCeeEEEeccceeeecccceeEEEecceeee Q lcl|NC_010324. 826 GLVDPPVLERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSFLKGFYQVDVGALDYFYDRLNDVVIEKPLEMR 905 (1005) Q Consensus 826 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 905 (1005) -..+.+.+-.+..=-.+=.+.||+ .|.-+... +.++.. +....|..+. | .-.+|.. T Consensus 344 ~~~~g~~ivGD~~nG~ly~L~fd~------~~~~d~~~---~~~r~~----p~~~~dn~R~-f----------d~eve~~ 399 (472) T protein:vir:21 344 MYEGNQITCGDKSEAVVGQLQFDI------SSQYDKQQ---EHLLFT----PLFKADNARC-F----------DLEVESS 399 (472) T ss_pred EeeCCeEEEEEcCCCeEEEEEecc------cccCCCcC---cEEEEc----cceeCCCCEE-E----------EEeeecc Confidence 223333332200000000012221 11122111 111110 0000011110 0 0000000 Q ss_pred ecCccccccccCCccceeeeeeeeeEEecCceeEEEEeeecCCCCCceECCCeEEecCCCeeEeeecCCce--------- Q lcl|NC_010324. 906 LERTGIDFDNVTNEWNQKHINRFRPQTTGSGTYIFEAGGSQFSNEYGHPHTSKTYTIGVDRHVSVRLNHPY--------- 976 (1005) Q Consensus 906 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------- 976 (1005) ...|. ++| ++|++...+|. +|. ...++.-.+.||| T Consensus 400 -~Gv~q-----~~d-------~v~L~wSddG~---------------~~~--------~~~~~~~g~~g~~~tr~~~~Rl 443 (472) T protein:vir:21 400 -TGVAQ-----YAD-------RLFLSATTDGI---------------NYG--------REQMIEQNEPFVYDKRVLWKRV 443 (472) T ss_pred -CCCCC-----cCc-------EEEEEeecccc---------------ccc--------cceeeccCCccchhcceeeeee Confidence 11111 111 35555443332 111 1122333333444 Q ss_pred ------EEEEEEEccCCCcEEEEeEEEEee Q lcl|NC_010324. 977 ------LFYNVIDNDVNSNAAINGLTIEFA 1000 (1005) Q Consensus 977 ------~~~~~~~~~~~~~~~~~~~~~~~~ 1000 (1005) +.+|| ++-.-.+-.++|..+.+- T Consensus 444 G~~r~~v~f~~-r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:21 444 GRIRRLIGFKL-RVITKSPVTLSGCQIRLE 472 (472) T ss_pred eecccceeEEE-EEEecCcceeeeeEEeeC Confidence 12232 333334445555554433 No 8 >protein:vir:105428 Length: 472 # NCBI annotation: gene 8 protein # Family: family:all:1540 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958184;genbank:gi:41057286;genbank:GeneID:2716675 Probab=97.91 E-value=1.1e-05 Score=47.73 Aligned_cols=431 Identities=12% Similarity=0.078 Sum_probs=162.4 Q ss_pred ccceEEEEecCCCceEEEeecCceeEEEeecCCceeEEeeccCCcEEEEEeeCcceecceeEeeccccccceeccceecc Q lcl|NC_010324. 470 EIVITATASPEGEYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPWYHAVISNCAVAT 549 (1005) Q Consensus 470 t~t~tat~t~~~~~tvt~t~s~~~~~t~~~~~~~t~~~~~~~~gt~tiT~~~t~~~~~~~~~~~~~~~~~~~~~~~~vt~ 549 (1005) -....+-. ..+....+-.. ..+.... .+.-......+...... ...+.- .... .. T Consensus 1 m~~~~~pl-~~G~~~~~~~~---d~~~~~p---VN~~a~~~~~~~s~~~l-------------~~tPGl-~~~a----~v 55 (472) T protein:vir:10 1 MPIQQLPL-MKGVGKDFRNA---DYIDYLP---VNMLATPKEILNSSGYL-------------RSFPGI-AKRS----DV 55 (472) T ss_pred CCeeeeee-ccCceeecccc---chhheee---eeeeeeccCCCccccee-------------ecCCCc-eeec----cC Confidence 00000000 00000000000 0000000 00000000000000000 000000 0000 00 Q ss_pred ceeeccc--cccccceeEEecCCceeEEEEec-CCCCcceeEEeeeccccccCceEEEEecCCce-eeEEecCceeEEEE Q lcl|NC_010324. 550 THYETPQ--VKEFESEYFVDLPGWGEQTVVDN-DGNPSVKKFNWKCERVRSFNNRLFALNMREAN-ASGVTTNYPLRLRW 625 (1005) Q Consensus 550 t~~~~~~--~t~~~~~~t~~~~~t~t~t~t~~-~~~~t~~tv~~~~~~~~~~~~~~~~~~~~~~t-~t~~~~~~~~~~~~ 625 (1005) .+.. -+ ....+...- .+.+....-+... ...+...-+.. ..+.....+-.+... .-............ T Consensus 56 ~G~~-RG~~~~~~~g~lY-~V~G~~LY~v~~~iGsiag~grVsM------a~n~~~~av~~~g~~~~Y~yd~~v~t~~~~ 127 (472) T protein:vir:10 56 NGVS-RGVEYNMAQNAVY-RVCGGKLYKGESEVGDVAGSGRVSM------AHGRTSQAVGVNGQLVEYRYDGTVKTVSNW 127 (472) T ss_pred Cccc-cceEEEeeCCeEE-EEecceEeeeecceecccCcccEEE------ecCCcEEEEEECCceeEEEeeccchhhhcc Confidence 0000 00 000000000 0111111100000 00000000000 000000111111000 00000000000000 Q ss_pred eecccCCcc------eeeeeeeeeceeecccccceeeeccccccccccccee---eEecccceeeccccceeEEEEcCCc Q lcl|NC_010324. 626 SNFANENKA------PTLWDDFAYDRVVSSDLASNIVGQTQALENGYAGYID---LADSNGSLIDILPLKDYLFVYTEFE 696 (1005) Q Consensus 626 ~~~~~~~~~------~~t~~~~~~~~~~~~~~t~~~~~~~~~~~t~~~~~~~---~t~t~~~iv~g~~~g~~tii~t~~~ 696 (1005) +...... .....-..+--+-....+................+.. ....+..|+.-....+..++|.++. T Consensus 128 --~~d~~~p~~dlg~~~dv~f~dGyfV~~~~Gt~~~~is~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~~i~lfG~~T 205 (472) T protein:vir:10 128 --PTDSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSST 205 (472) T ss_pred --ccccccccccccceeeeeeecceEEEeccCcceEEEeccCCccccccccccccccCCCCceEEEEeeccEEEEEeccc Confidence 0000000 0000000000000111111111110011111112222 3445667788888899999999999 Q ss_pred eEEEEEECCCCc-eeeeee---cCCcceeecCceEEEECCEEEEEeCC-----CEEEECCCccccccchhHHHHHHhhcC Q lcl|NC_010324. 697 TYIGSPTNNTYQ-PLMFKK---LFNDSGILAPECVVEVEGSHFVVTQN-----DVILHNGATKKSIASNRVKNMLINEVC 767 (1005) Q Consensus 697 ~~~~t~tggt~~-~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 767 (1005) +.+..-+|++.. -|=|+. ++-..||+++.||..+++.+|||+++ -||+.+|.+++.|....|++-+ +.++ T Consensus 206 iEvw~ntG~a~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~~~~rIST~aIE~~i-~~y~ 284 (472) T protein:vir:10 206 IEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYIIGSGQVSPIASASIEKIL-RSYT 284 (472) T ss_pred eEEEEecCCCCcccCceeecccceeeecccCcchhhecCceEEEEecCCccccEEEEccCceeEEecCHHHHHHH-HhcC Confidence 999999997742 144554 66699999999999999999999996 3689999999999998888865 4455 Q ss_pred ccccceEEE-EEcCCCCE-EEEEEecCCCCcCCCCCCeEEEEEeccCeeeEecccceeeeeecccccccCCceecccccc Q lcl|NC_010324. 768 LVNPLATRV-HLHQDKKE-VWVLYVGPGEPKESFACTKAAVWNYEFDTWSFRTIPYAQCIGLVDPPVLERGPIWSDFQEI 845 (1005) Q Consensus 768 ~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 845 (1005) ..-+...++ .+.-+.|+ ++.+|| +|.+|||-.++.|..|- ++ +. + T Consensus 285 ~~e~~dA~~~t~~~~GH~fy~LtfP-----------~~Tw~yD~~t~~Wherw-----~~-------~~---------~- 331 (472) T protein:vir:10 285 ADELADGVMESLRFDAHELLIIHLP-----------RHVLVYDASSSANGPQW-----CV-------LK---------T- 331 (472) T ss_pred CccccceeEEEEEeCCeEEEEEEcC-----------CceeEeecccccCceee-----ee-------ec---------C- Confidence 433333222 23333444 444555 78999999999998762 00 00 0 Q ss_pred cCCCcchhhccccccccCccceeEEEecCCC-eeEEEeccceeeecccceeEEEecceeeeecCccccccccCCccceee Q lcl|NC_010324. 846 TWDDPSIKELVWRKDATNFRQRVTIVGSFLK-GFYQVDVGALDYFYDRLNDVVIEKPLEMRLERTGIDFDNVTNEWNQKH 924 (1005) Q Consensus 846 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 924 (1005) ...-+-|+-..-.+--...|+||++. .+|++|-+..+.+|+...-...... |+.|.. | T Consensus 332 -----g~~~~~~Ra~~~~~~~g~~~vGD~~ng~ly~l~~~~~td~G~~i~~~~~~p~---------~~~d~~-------R 390 (472) T protein:vir:10 332 -----GLYDDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYGLQQEHLLFTPL---------FKADNA-------R 390 (472) T ss_pred -----CCccCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCcCCCcceEEEeccc---------eeCCCC-------e Confidence 00001122222333333457777544 4788888877777776655544322 333332 2 Q ss_pred eeeeeeEEecCceeEEEEeeecCCCC--CceECCCeEEecCCCeeEeeecCCce----EEEEE----------EEccCCC Q lcl|NC_010324. 925 INRFRPQTTGSGTYIFEAGGSQFSNE--YGHPHTSKTYTIGVDRHVSVRLNHPY----LFYNV----------IDNDVNS 988 (1005) Q Consensus 925 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~----------~~~~~~~ 988 (1005) + | +=.+.+.+|-.+..+- ..+|+-+.+| +-...++-.+.||| +..|+ +++-+-. T Consensus 391 v---~-----d~~ve~~~G~~~~adp~~~~~~sDg~~~--g~~~~~~~~~~g~~~~R~~~~RlG~~r~~vgf~~r~~~~~ 460 (472) T protein:vir:10 391 C---F-----DLEVESSTGVAQYADRLFLSATTDGINY--GREQMIEQNEPFVYDKRVLWKRVGRIRKNVGFKLRVITKS 460 (472) T ss_pred E---E-----EEEEEeecCCCcccCceEEEeccCCccc--chhhhhhhccCcccccceeeeeeeeccccceEEEEEEecc Confidence 2 1 0011111222111111 2233321111 11122445556666 22221 1333334 Q ss_pred cEEEEeEEEEee Q lcl|NC_010324. 989 NAAINGLTIEFA 1000 (1005) Q Consensus 989 ~~~~~~~~~~~~ 1000 (1005) +-.|+|.-+++- T Consensus 461 ~v~l~ga~~~~e 472 (472) T protein:vir:10 461 PVTLSGAQIRIE 472 (472) T ss_pred ccceeeeeEEeC Confidence 444666555533 No 9 >protein:vir:3529 Length: 477 # NCBI annotation: P28 # Family: family:all:1540 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050989;genbank:gi:9633575;genbank:GeneID:1262322 Probab=97.24 E-value=0.00012 Score=42.08 Aligned_cols=426 Identities=10% Similarity=0.068 Sum_probs=147.9 Q ss_pred ccccccceeecCCccceEEEE-ecCCCceEEEeecCceeEEEeecCCceeEEeeccCCcEEEEEeeCcceecceeEeecc Q lcl|NC_010324. 457 TLDIVSASLDVGEEIVITATA-SPEGEYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDY 535 (1005) Q Consensus 457 t~~~~~~t~~~g~t~t~tat~-t~~~~~tvt~t~s~~~~~t~~~~~~~t~~~~~~~~gt~tiT~~~t~~~~~~~~~~~~~ 535 (1005) -++ + .+-+.. .|-. .+.......+.......++-... ....... T Consensus 1 ~~~---------~--~~m~~~~ipl~----------------------~g~~~~~~~~d~~~~~PVN~~a~--p~~~~~s 45 (477) T protein:vir:35 1 MLS---------E--VFMPKIQIPLA----------------------KGLVKDIKTADYIDALPVNMLAT--PKEVLNA 45 (477) T ss_pred Ccc---------c--ceeeeeccccc----------------------cccccccccccceeeeeecccee--ecccccc Confidence 000 0 000000 0000 00000000000000000000000 0000000 Q ss_pred ccccceeccceecccee-ecccc--ccccceeEEecCCceeEEEEec-CCCCcceeEEeeeccccccCceEEEE-ecCCc Q lcl|NC_010324. 536 PWYHAVISNCAVATTHY-ETPQV--KEFESEYFVDLPGWGEQTVVDN-DGNPSVKKFNWKCERVRSFNNRLFAL-NMREA 610 (1005) Q Consensus 536 ~~~~~~~~~~~vt~t~~-~~~~~--t~~~~~~t~~~~~t~t~t~t~~-~~~~t~~tv~~~~~~~~~~~~~~~~~-~~~~~ 610 (1005) ............-.... ..-+. ...+.... .+.+.....+... ...+...-+.. ..++....+ ..+.. T Consensus 46 ~~~L~~~pG~~~~~~~~G~~RG~~~~~~~g~lY-~V~G~~LY~v~~~vG~I~gsg~VsM------a~n~~~~aIv~~g~~ 118 (477) T protein:vir:35 46 SGYLRSFPGIEKKQDAKGVSRGVHFNTKNNALY-RVCGNTLYRNDKEVADIAGMSRVSM------SHSSHSQAICFEGKV 118 (477) T ss_pred ccccccCCcceeeccCCccccceeEeecCCeEE-EEecCeeEeeeeeeeeecccccEEE------eeCCcEEEEEECCcc Confidence 00000000000000000 00000 00000000 0001110000000 00000000000 000000000 00000 Q ss_pred eeeEEecCceeEEEEeecccCC---cceeeeeeeeeceeecccccceeeecccccccccc---cceeeEecccceeeccc Q lcl|NC_010324. 611 NASGVTTNYPLRLRWSNFANEN---KAPTLWDDFAYDRVVSSDLASNIVGQTQALENGYA---GYIDLADSNGSLIDILP 684 (1005) Q Consensus 611 t~t~~~~~~~~~~~~~~~~~~~---~~~~t~~~~~~~~~~~~~~t~~~~~~~~~~~t~~~---~~~~~t~t~~~iv~g~~ 684 (1005) ..-..................+ .......-..+--+-....+.......-....... -+......+..|+.-.. T Consensus 119 ~gy~y~~t~~~~~~~~~~~~p~~~l~~~~~v~f~dGyfV~~~~gt~~~~iS~L~d~s~~d~~~~FasAE~~pD~Ivgi~~ 198 (477) T protein:vir:35 119 KLYRYDGTEKALSNWPKDKYPQYDLGEVIDVCRNRGRYIWLQKGGERFGVTDLEDESKPDRYQPFYRAESQPDGIVSVDA 198 (477) T ss_pred eeEEEecccceeeecCccccCCccccceeEEEeeCceEEEeecCCCeEEEeecCCccccccccccccccCCCCceEEEEe Confidence 0000000000000000000000 00000000000000000001100000000000111 12234445667777888 Q ss_pred cceeEEEEcCCceEEEEEECCCCceeeeeec---C-CcceeecCceEEEECCEEEEEeCC-C----EEEECCCccccccc Q lcl|NC_010324. 685 LKDYLFVYTEFETYIGSPTNNTYQPLMFKKL---F-NDSGILAPECVVEVEGSHFVVTQN-D----VILHNGATKKSIAS 755 (1005) Q Consensus 685 ~g~~tii~t~~~~~~~t~tggt~~~~~~~~~---~-~~~~~~~~~~~~~~~~~~~~~~~~-~----~~~~~~~~~~~~~~ 755 (1005) ..+..++|.++.+.+..-+|++.--|-|++. + -..||+++.||..+++.+|||+++ . ||+.+|.+++.|.. T Consensus 199 ~~~~i~lfG~~TiEvw~ntG~a~f~~p~~r~~~~~mIq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~q~~rIST 278 (477) T protein:vir:35 199 WRDLIVCFGSSSIEYFTLTGSADTSQPLYIHQAAYMIQAGIAGRDCKCRYQDKYAILSHQSTGQPAVYLIGAGEKNKIST 278 (477) T ss_pred eccEEEEEeccceEEEEecCCCCCCcceeecCCceeeeecccCchhhhhhCceEEEEecCCCcccEEEEccCceeEEecC Confidence 8999999999999999999988654344444 4 589999999999999999999997 3 79999999999999 Q ss_pred hhHHHHHHhhcCccccceEEE-EEcCCCCE-EEEEEecCCCCcCCCCCCeEEEEEeccC----eeeEec-----ccce-e Q lcl|NC_010324. 756 NRVKNMLINEVCLVNPLATRV-HLHQDKKE-VWVLYVGPGEPKESFACTKAAVWNYEFD----TWSFRT-----IPYA-Q 823 (1005) Q Consensus 756 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~-----~~~~-~ 823 (1005) ..|++-+ +...+....+.|+ .+.-+.|+ ++.+|| +|-+|||-..+ +|+.+. -+.+ . T Consensus 279 ~aIE~~i-~ay~~~e~a~af~~t~~~eGH~fy~LtfP-----------~~Tw~yD~at~~w~e~W~~~~~g~~~~~~Ra~ 346 (477) T protein:vir:35 279 ATIDKII-RYYSADELAASFMESIRFDNHELLLLHLP-----------KHTLCFDGSASHQYSQWSLLKSGFYDEPYRAI 346 (477) T ss_pred HHHHHHH-HhcCCcchhceeEEEEEeCCeeEEEEEcC-----------CceEEEecccccccceeeeeccCCccCceEEE Confidence 9888875 4455555555442 23334444 445555 67899996665 677763 2221 1 Q ss_pred eeeecccccccCCceecccccccCCCcchhhccccccccCccceeEEEec-CCCeeEEEeccceeeecccceeEEEecce Q lcl|NC_010324. 824 CIGLVDPPVLERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGS-FLKGFYQVDVGALDYFYDRLNDVVIEKPL 902 (1005) Q Consensus 824 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 902 (1005) ++-.++.+ .|.|| ....+|++|-+..+.+|+...-.+...+ T Consensus 347 ~~~~~~g~-------------------------------------~~vGD~~ng~l~~ld~~~~~d~g~~i~~~~~~p~- 388 (477) T protein:vir:35 347 DFMFFDNQ-------------------------------------ITVGDKKEGVLGHLIFNASNQYEQQTEHLLYTPM- 388 (477) T ss_pred EEEEeCCe-------------------------------------EEEEEcCCCeEEEECCCCcccCCCccceEEecce- Confidence 22222222 34444 3334555555555555555443333222 Q ss_pred eeeecCccccccccCCccceeeeeeeeeEEecCceeEEEEeeecC-CCCCceECCCeEEecCCCeeEeeecCCce----E Q lcl|NC_010324. 903 EMRLERTGIDFDNVTNEWNQKHINRFRPQTTGSGTYIFEAGGSQF-SNEYGHPHTSKTYTIGVDRHVSVRLNHPY----L 977 (1005) Q Consensus 903 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~ 977 (1005) |+.|.. |+..+..- +..|-.+. -.-+..|+.- =..-+...++.-.+.||| + T Consensus 389 --------~~~d~~-------Rv~~~el~--------~~tGvgq~~d~v~L~~sdd-G~~~~~~~~~~~g~~g~~~~r~~ 444 (477) T protein:vir:35 389 --------IKADNA-------RLFDFELE--------ASTGVAQIADKLFLSVTTD-GINYSREQLIEQNSPFQYDKRIL 444 (477) T ss_pred --------eeCCCC-------eEEEEEEE--------EecCcCccCceEEEEEecc-ccccccceeecCCCcccccccee Confidence 222222 22111100 00000000 0012222211 111112344555556666 2 Q ss_pred EEEEE----------EccCCCcEEEEeEEEEeeccCCC Q lcl|NC_010324. 978 FYNVI----------DNDVNSNAAINGLTIEFAVGGRR 1005 (1005) Q Consensus 978 ~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~ 1005 (1005) ..|+= ++-.-.|--++| .++| T Consensus 445 ~~RlG~~r~~vgf~~r~~~~~pv~l~~-------~~~~ 475 (477) T protein:vir:35 445 WRRIGRVRKNIGFKIRIITKSPVTLSD-------LSIR 475 (477) T ss_pred eeeeeeceeccceEEEEEecCCceecc-------ceeE Confidence 22221 111112222222 2333 No 10 >protein:vir:5202 Length: 448 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:116 # MgeName: PZA # Cross-refs: genbank:acc:NP_040725;genbank:gi:9626396;genbank:GeneID:1260967 Probab=96.60 E-value=0.00047 Score=38.81 Aligned_cols=206 Identities=13% Similarity=0.141 Sum_probs=71.1 Q ss_pred CeEEEecccceeeee-cCCcceEeeccceecceeeeecccceEecCCceEEEEEEecCCccccccccccEEEE---ecCC Q lcl|NC_010324. 1 MALYPIKSLGAVGVI-ADQAPTDLAPNAFTNAINARFVEQRVFKTGGNAPLSYVDEDKDLTPLSFISMPFDYY---SAGN 76 (1005) Q Consensus 1 ~a~y~v~s~~t~~~~-~~~~~tt~~~~~~t~~~~~~~~~~~~~~~g~t~t~t~t~~~~~~t~~~~t~~~~t~t---ss~~ 76 (1005) |++......--.-.+ .......+.......+-+-... .... ..+.+..-+..+. .+....-.|. .... T Consensus 236 ~~v~~~~~~~dl~li~~~~~~~~ldv~~la~afn~~~~-----~~~~-~~~~vd~F~~~g~--~~i~vskk~~~~~d~~~ 307 (448) T protein:vir:52 236 MAVRTRSYMEDLHLIIDADLEAELDVDVLAKAFNMNRT-----DFLG-NVTVIDGFASTGL--EAVLVDKDWFMVYDNLH 307 (448) T ss_pred ccccccccceeeEEEECCCceEeecHHHHHHHhccccc-----ccCc-ceEEecCccccCc--eeeeeeeeeeeeeeccc Confidence 443333222221111 1111111111111100000000 0000 0000000000000 0000011111 1111 Q ss_pred eEEEEccCCcEEEeecCceEEEEEe-ecCcceeEEEEEEe-eecccceeeccceeEEecCCceEEEEEEecCCCCCcceE Q lcl|NC_010324. 77 SFLVVGTDKKLYKLTDESLTDISRK-VATVTKKASASIKI-YPVVSQIVPKESTISMNFNQTKNLEVSLLPADANNTDLV 154 (1005) Q Consensus 77 ~~~tv~~~~~~~t~t~~~~~~~t~~-~~~~t~~~t~tvtv-~~~vt~it~~~~t~t~~~g~t~tltatvt~~~~t~~tvt 154 (1005) .........+.+..- ...+... .......+...+.. .+.+.++++++.+.++..|++.+|+|++.+.++.++.|+ T Consensus 308 kg~t~~na~GL~~N~---~~TItatss~~~~t~atA~V~~t~paVtsVsVsPttasL~~G~TqqlTATVsg~na~~~~VT 384 (448) T protein:vir:52 308 KMETVRNPRGLYWNY---YYHVWQTLSVSRSANAVAFVSGDVPAVTQVIVSPNIAAVKQGGKQQFTAYVRATDGKDHKVV 384 (448) T ss_pred eeeeeeccccceeee---eeEEEEEEccCccccceEEEEecccccceEEEcccceeecCCCeEEEEEEEecCCCCCCceE Confidence 111111122222111 0111111 11222222222222 255778999999999999999999999999999999999 Q ss_pred EEEcCCcc-eEEecCCceeeccceeeeccccceeeeeecccceeeeeeeEeeeccccceeecccceeeeeecce Q lcl|NC_010324. 155 WEVSNSSY-GSITVDPSDSKLATLTSFEKEGNLVVTISTANESVVAQIAVNIIDGDSGIFLSQDTVTIRKGGTT 227 (1005) Q Consensus 155 ~tss~~~~-~tv~~~~~~~~~~~~tt~~~~gt~t~t~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~t~~~~~~~ 227 (1005) |++++... +++..+|..+.. ..++.+++.................. ...+...+.... .|... T Consensus 385 WSvS~ns~~aTVsssG~vTv~-------a~gTatITVtATvdts~a~~~~~vv~-ea~VsvtP~~as--~G~q~ 448 (448) T protein:vir:52 385 WSVEGGSTGTAITGDGLLSVS-------GNEENQLTVKATVDIGTEDKPNLVVG-EAVVSIRPNNAS--GGAQA 448 (448) T ss_pred EEEcCCceeeEEeCCccEEec-------cCCcceEEEEEEecCcccCCceeeee-eEEEEecCCCCC--CcCCC Confidence 99987766 577777654321 22222333222111111100000000 000000000000 00000 No 11 >protein:vir:118 Length: 449 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:4 # MgeName: B103 # Cross-refs: genbank:acc:NP_690641;swissprot:sw:q37888;genbank:gi:22855155;interpro:IPR003343;uniprot:Q37888;genbank:GeneID:955370 Probab=96.42 E-value=0.00031 Score=39.79 Aligned_cols=127 Identities=8% Similarity=0.062 Sum_probs=53.7 Q ss_pred Ce---EEEecccceeeeecCCcceEeecccee-cceeeeecccceEecCCceEEEEEEecCCccccccccccEEEEecCC Q lcl|NC_010324. 1 MA---LYPIKSLGAVGVIADQAPTDLAPNAFT-NAINARFVEQRVFKTGGNAPLSYVDEDKDLTPLSFISMPFDYYSAGN 76 (1005) Q Consensus 1 ~a---~y~v~s~~t~~~~~~~~~tt~~~~~~t-~~~~~~~~~~~~~~~g~t~t~t~t~~~~~~t~~~~t~~~~t~tss~~ 76 (1005) +. -+.+......+...........+.... ..+.. ......+..|.+.+|++++.+.+.. ...++|++++. T Consensus 318 ~y~n~~~tvt~t~~~~~~~~~~a~~~~~~~~~VTsVsV-tPss~tL~~G~T~qLTATV~psnat-----nk~VTWSsSd~ 391 (449) T protein:vir:11 318 LYWNYYYHVWQVLSASRFANAVAFVTGDDVPAVTQVIV-SPAIASVKQGKSQAFTAYVRATDDK-----EHEVVWSVDGG 391 (449) T ss_pred eeeccceEEEEEEecccccceeeeeeeeccceeeEEEe-eccceeeecCceEEEEEEEecCCCC-----CceEEEEEeCC Confidence 00 011111111111111111111111111 11111 1234567789999999988776543 45689998877 Q ss_pred eE-EEEccCCcEEEeecCceEEEEEeecCcceeEEEEEEeeecccceeeccceeEEecCCceE Q lcl|NC_010324. 77 SF-LVVGTDKKLYKLTDESLTDISRKVATVTKKASASIKIYPVVSQIVPKESTISMNFNQTKN 138 (1005) Q Consensus 77 ~~-~tv~~~~~~~t~t~~~~~~~t~~~~~~t~~~t~tvtv~~~vt~it~~~~t~t~~~g~t~t 138 (1005) .. ++++.+ |..+..+.+...+++.....+...++.+++... ..+++.+.+.. |...- T Consensus 392 s~~ATVda~-G~VTAva~GTAtITAta~~~s~TaT~tvtV~~~-a~VtVtP~sa~---ggaqA 449 (449) T protein:vir:11 392 STGTSISSD-GVLTVAANETNQLTVKATVDIGTADEPKPVVGE-AVVNVRPDSST---GGAQA 449 (449) T ss_pred ceEEEEcCC-ceEEEecCccEEEEEEEecCcEEEEEEeeecce-EEEEEeecCCC---CcccC Confidence 64 667655 444545555556666555544444444333211 11122211110 10000 No 12 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=95.38 E-value=0.0022 Score=35.18 Aligned_cols=175 Identities=11% Similarity=0.085 Sum_probs=64.3 Q ss_pred CeEEEecccceeeeecCCcceEeeccceecceeeeecccceEecCCceEEEEEEecCCccccccccccEEEEecCCeEEE Q lcl|NC_010324. 1 MALYPIKSLGAVGVIADQAPTDLAPNAFTNAINARFVEQRVFKTGGNAPLSYVDEDKDLTPLSFISMPFDYYSAGNSFLV 80 (1005) Q Consensus 1 ~a~y~v~s~~t~~~~~~~~~tt~~~~~~t~~~~~~~~~~~~~~~g~t~t~t~t~~~~~~t~~~~t~~~~t~tss~~~~~t 80 (1005) +-+|..........+ ...+........ ......+......... .... ...|......... T Consensus 211 ~~v~~s~~~~~~t~~------a~~~~a~~~at~-----a~v~~~~~~~~~s~s~-~~~v--------~~~~~~~~~~t~~ 270 (392) T protein:vir:99 211 YEIVESTLIPHGDAY------LYHPTAFIMATR-----APAPPMGAVRSTAISG-DQRI--------AMRWLVDYDSTIT 270 (392) T ss_pred eEEEeecccccccce------eeeccccccccc-----cccccccccceeEEec-ccce--------ecceeecccceee Confidence 111111111110000 000000000000 0000000000000000 0000 0001100000000 Q ss_pred EccCCcEEEeecCceEEEEEeecCcceeEEEE---EEeeecccceeeccceeEEecCCceEEEEEEecCCCCC--cceEE Q lcl|NC_010324. 81 VGTDKKLYKLTDESLTDISRKVATVTKKASAS---IKIYPVVSQIVPKESTISMNFNQTKNLEVSLLPADANN--TDLVW 155 (1005) Q Consensus 81 v~~~~~~~t~t~~~~~~~t~~~~~~t~~~t~t---vtv~~~vt~it~~~~t~t~~~g~t~tltatvt~~~~t~--~tvt~ 155 (1005) .+.. ....... ........ ......... ......+..+.+.+....+..++..++.+++.+.+... ..++| T Consensus 271 s~~~-~v~~~~g--~~~v~~~~-~~~~~~~~~~~~~~~~v~v~~v~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~vtw 346 (392) T protein:vir:99 271 SNRS-LIDTYFG--LKVVEDPN-GVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDF 346 (392) T ss_pred cccc-ccceeEE--EEEEeecc-ccceeeeeeeeeecceeeeeeeecccceeEeeeccceeEEEEEEecCCccccceEEE Confidence 0000 0000000 00000000 000000111 11122234455667777888888888888888887765 66999 Q ss_pred EEcCCcceEEecCCceeeccceeeeccccceeeeeeccc--ceeeeeeeEeee Q lcl|NC_010324. 156 EVSNSSYGSITVDPSDSKLATLTSFEKEGNLVVTISTAN--ESVVAQIAVNII 206 (1005) Q Consensus 156 tss~~~~~tv~~~~~~~~~~~~tt~~~~gt~t~t~~~~~--~~~~~~~t~~~~ 206 (1005) +++++.+++++.+|. .++...|+++++..... +.....+..... T Consensus 347 ~Ssn~~vAtV~~~G~-------Vt~v~~G~atITa~~~~~~~~~t~t~~vtV~ 392 (392) T protein:vir:99 347 ESSATDKATVAAGGL-------VTGVAAGTSTVTATLVTPSGDREDTIVITVV 392 (392) T ss_pred EEcCCeeEEEcCCce-------EEEEecceEEEEEEEEcCCCcEEEEEEEEeC Confidence 999999999998775 45566677777776543 222222222222 No 13 >protein:vir:105525 Length: 472 # NCBI annotation: phage DNA stabilization protein # Family: family:all:1540 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516194;genbank:gi:89885997;genbank:GeneID:3964385 Probab=95.04 E-value=0.0029 Score=34.50 Aligned_cols=430 Identities=13% Similarity=0.063 Sum_probs=145.8 Q ss_pred ceeEEEeecCCceeEEeeccCCcEEEEEeeCcceecceeEeeccccccceeccceecccee-eccccc--cccceeEEec Q lcl|NC_010324. 492 TGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPWYHAVISNCAVATTHY-ETPQVK--EFESEYFVDL 568 (1005) Q Consensus 492 ~~~~t~~~~~~~t~~~~~~~~gt~tiT~~~t~~~~~~~~~~~~~~~~~~~~~~~~vt~t~~-~~~~~t--~~~~~~t~~~ 568 (1005) -..... .-..+....-.+...-....++-.... ..........-............ ..-+.- ..+..... + T Consensus 1 m~~~q~---pl~~g~~~~~~~~~~~~~lpvN~y~~p--~~~~~ss~~lr~~PG~~~~~~~~g~~RG~~~~~~~~~lY~-V 74 (472) T protein:vir:10 1 MAIMQL---PLLRGLGKARDDADYIDALPVNMLATP--KPVLNASGYLRSFPGITHKAEVAGVSRGVQYNTHEKTVYR-G 74 (472) T ss_pred CCceee---ecccccccCccccCceeeeeeeeeecc--ccccccceeecccCCceeecCCCcccceeEeeeeCCeEEE-E Confidence 000000 000000011111111100001100000 00000000000000000000000 000000 00011111 1 Q ss_pred CCceeEEEEec-CCCCcceeEEeeeccccccCceEEEEecCCceeeEEecCceeEEEEe---ecccCCc-ceeeeeeeee Q lcl|NC_010324. 569 PGWGEQTVVDN-DGNPSVKKFNWKCERVRSFNNRLFALNMREANASGVTTNYPLRLRWS---NFANENK-APTLWDDFAY 643 (1005) Q Consensus 569 ~~t~t~t~t~~-~~~~t~~tv~~~~~~~~~~~~~~~~~~~~~~t~t~~~~~~~~~~~~~---~~~~~~~-~~~t~~~~~~ 643 (1005) .+..-.-+... ...+...-+... .......+.+... ...-...........+. -...... ......-..+ T Consensus 75 ~G~~Ly~v~~~vG~iagsg~VsMa----~~~~~q~v~v~g~-~~~y~y~g~~~t~~~~~~~~~it~~dl~~~~~v~~~dG 149 (472) T protein:vir:10 75 LGNQLYKGHKPIADLAGKGRISMA----FSRNSQAVVAAGK-MTLYRYDGTVKTLENWPKEKKYTQYDIGNVRDMCHLRG 149 (472) T ss_pred ecceEEEEEeeeeeecccccEEEE----ecCCceEEEEecc-eeEEEeccchhhhhhccccccCCccccCCceeEEEeCc Confidence 11111110000 000000000000 0000011111000 00000000000000000 0000000 0000111111 Q ss_pred ceeeccccccee-eecccc--cccccccceeeEecccceeeccccceeEEEEcCCceEEEEEECCCCceeeeee------ Q lcl|NC_010324. 644 DRVVSSDLASNI-VGQTQA--LENGYAGYIDLADSNGSLIDILPLKDYLFVYTEFETYIGSPTNNTYQPLMFKK------ 714 (1005) Q Consensus 644 ~~~~~~~~t~~~-~~~~~~--~~t~~~~~~~~t~t~~~iv~g~~~g~~tii~t~~~~~~~t~tggt~~~~~~~~------ 714 (1005) --+-....+... ...... ......-+......+..|+.-....+..+.|.++.+.+..-+|++. |.|+. T Consensus 150 yfV~~~~gt~~~~iS~L~d~s~~~~~~~FatAE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~--fpf~r~~~~pg 227 (472) T protein:vir:10 150 RYVWCKDGSDIFGVTDLEDESHPDRYRALYRAESQPDGIIGIDSWRDFIVCFGASTIEYFSLTGAAD--GQSAIYAAQPA 227 (472) T ss_pred eEEEeecCCceEEEeecCCcccCCcccceeeecCCCCceEEEEeeccEEEEEeccceEEEEecCCCC--cceeeeccCcc Confidence 111111111111 000000 0011111233444566778888889999999999999999999875 66664 Q ss_pred cCCcceeecCceEEEECCEEEEEeCCC-----EEEECCCccccccchhHHHHHHhhcCccccceEEEEE-cCCCCE-EEE Q lcl|NC_010324. 715 LFNDSGILAPECVVEVEGSHFVVTQND-----VILHNGATKKSIASNRVKNMLINEVCLVNPLATRVHL-HQDKKE-VWV 787 (1005) Q Consensus 715 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~ 787 (1005) ++-+.||+++.||..+++.+|||+++. ||+.+|.+++.|....|++-+ +.++..-+...+++. .-+.|+ ++. T Consensus 228 ~~iq~Gcaa~~sv~~~~~s~~~l~~d~~g~~~V~~~~g~q~~rIST~aIE~~i-~~y~~~e~~dA~~~s~~~eGH~fy~L 306 (472) T protein:vir:10 228 LMVEKGIAGTHCKTRLGDAHVIISHQATGAPSVFLINQAQATSIATATIEKIL-RSYTHDELASAVMETVRFDSHELVLI 306 (472) T ss_pred ceeeecccCchhhhhhCceEEEEecCCCcceEEEEccCceEEEecCHHHHHHH-HhCCcccccceeEEEEEeCCeEEEEE Confidence 346799999999999999999999994 799999999999998888865 455544444433322 233333 445 Q ss_pred EEecCCCCcCCCCCCeEEEEEeccCe----eeEeccc-----ce-eeeeecccccccCCceecccccccCCCcchhhccc Q lcl|NC_010324. 788 LYVGPGEPKESFACTKAAVWNYEFDT----WSFRTIP-----YA-QCIGLVDPPVLERGPIWSDFQEITWDDPSIKELVW 857 (1005) Q Consensus 788 ~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~-----~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 857 (1005) +|| ++.+|||-.++. |+.+.-- .+ .++-.++.+. T Consensus 307 tfP-----------~~Tw~yD~at~~~~~~w~~~~~g~~~~~~Ra~~~~~~~g~~------------------------- 350 (472) T protein:vir:10 307 HLS-----------RQVLCYDAAANQNGLQWSLLKTGFYHAPYRGIDFMFADHHL------------------------- 350 (472) T ss_pred EcC-----------CeeEEEeccCCccceeeeeeecCCccCceEEEEEEEeCCeE------------------------- Confidence 555 678999955554 4444311 11 1222223233 Q ss_pred cccccCccceeEEEec-CCCeeEEEeccceeeecccceeEEEeccee---eeecCccccccccCCccceeeeeeeeeEEe Q lcl|NC_010324. 858 RKDATNFRQRVTIVGS-FLKGFYQVDVGALDYFYDRLNDVVIEKPLE---MRLERTGIDFDNVTNEWNQKHINRFRPQTT 933 (1005) Q Consensus 858 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 933 (1005) |.|| ....+|++|-+..+.+|+...=...+..+. +|+- .++|+-++.-.. .--++|++.. T Consensus 351 ------------~vGD~~ng~l~~ld~~~~td~g~pi~~~~~tp~~~~~n~Rvf--d~el~~~tGvg~--~~~~v~L~wS 414 (472) T protein:vir:10 351 ------------TCGDKNDSLLGQLDFASSAQYEKPQEHVLYTPLFKADNARVF--DFELEASTGVAH--IADRLFLSAT 414 (472) T ss_pred ------------EEEEcCCCeEEEEcCcCcCCCCceeEEEeeccceecCCCeEE--EEEEEeeCCcCc--cCceEEEEEe Confidence 3333 223344444444433333322222110000 0000 011111111110 1113566655 Q ss_pred cCceeEEEEeeecCCCCCceECC-CeEEecCCCeeEeeecCCce-EEEEEEEccCCCcEE--EEeEEEE Q lcl|NC_010324. 934 GSGTYIFEAGGSQFSNEYGHPHT-SKTYTIGVDRHVSVRLNHPY-LFYNVIDNDVNSNAA--INGLTIE 998 (1005) Q Consensus 934 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~--~~~~~~~ 998 (1005) .+|-....-. .....+++..+. +--.|+|. +|- |= +.+|+ -.-.|.. +.|.-+| T Consensus 415 ddg~~~~~~~-~~~~~g~~~~~~r~~w~RlG~-----ar~--~vgf~~rv---~~s~pv~~~~~~a~~e 472 (472) T protein:vir:10 415 ADGLHFGREQ-MINQNAPFAYDRRILWRRMGR-----VRK--NLGFKVRV---ITSSPVTLSGCQIRME 472 (472) T ss_pred ccccccchhH-HHhhcCccchhheeeeheeec-----ccc--ccceEEEE---EEecccccccceeeeC Confidence 4443221100 111111111110 01113333 111 11 22343 2122332 3444444 No 14 >protein:vir:5202 Length: 448 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:116 # MgeName: PZA # Cross-refs: genbank:acc:NP_040725;genbank:gi:9626396;genbank:GeneID:1260967 Probab=94.15 E-value=0.0025 Score=34.85 Aligned_cols=127 Identities=12% Similarity=0.056 Sum_probs=50.7 Q ss_pred Ce---EEEecccceeeeecCCcceEeeccceecceeeeecccceEecCCceEEEEEEecCCccccccccccEEEEecCCe Q lcl|NC_010324. 1 MA---LYPIKSLGAVGVIADQAPTDLAPNAFTNAINARFVEQRVFKTGGNAPLSYVDEDKDLTPLSFISMPFDYYSAGNS 77 (1005) Q Consensus 1 ~a---~y~v~s~~t~~~~~~~~~tt~~~~~~t~~~~~~~~~~~~~~~g~t~t~t~t~~~~~~t~~~~t~~~~t~tss~~~ 77 (1005) |- -|++.+............... .................+..|++.++++++.+.+.. ...++|++++.. T Consensus 318 L~~N~~~TItatss~~~~t~atA~V~-~t~paVtsVsVsPttasL~~G~TqqlTATVsg~na~-----~~~VTWSvS~ns 391 (448) T protein:vir:52 318 LYWNYYYHVWQTLSVSRSANAVAFVS-GDVPAVTQVIVSPNIAAVKQGGKQQFTAYVRATDGK-----DHKVVWSVEGGS 391 (448) T ss_pred ceeeeeeEEEEEEccCccccceEEEE-ecccccceEEEcccceeecCCCeEEEEEEEecCCCC-----CCceEEEEcCCc Confidence 11 335555544433222211111 111111111112234567789999999988755443 445899988777 Q ss_pred E-EEEccCCcEEEeecCceEEEEEeecCcceeEEEEEEeeecccceeeccceeEEecCCceEEEEEEecCCCCCcceE Q lcl|NC_010324. 78 F-LVVGTDKKLYKLTDESLTDISRKVATVTKKASASIKIYPVVSQIVPKESTISMNFNQTKNLEVSLLPADANNTDLV 154 (1005) Q Consensus 78 ~-~tv~~~~~~~t~t~~~~~~~t~~~~~~t~~~t~tvtv~~~vt~it~~~~t~t~~~g~t~tltatvt~~~~t~~tvt 154 (1005) . ++++. .|.+++.+.+...+++.....+..+.....+. ....+++.|.......-. T Consensus 392 ~~aTVss-sG~vTv~a~gTatITVtATvdts~a~~~~~vv--------------------~ea~VsvtP~~as~G~q~ 448 (448) T protein:vir:52 392 TGTAITG-DGLLSVSGNEENQLTVKATVDIGTEDKPNLVV--------------------GEAVVSIRPNNASGGAQA 448 (448) T ss_pred eeeEEeC-CccEEeccCCcceEEEEEEecCcccCCceeee--------------------eeEEEEecCCCCCCcCCC Confidence 6 44544 44545444333333332211111100000000 011223333322211100 No 15 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=91.83 E-value=0.014 Score=30.74 Aligned_cols=112 Identities=9% Similarity=0.029 Sum_probs=49.2 Q ss_pred Ce-----------------EEEecccceeeeecCCcce---Eeeccceecceee--eecccceEecCCceEEEEEEecCC Q lcl|NC_010324. 1 MA-----------------LYPIKSLGAVGVIADQAPT---DLAPNAFTNAINA--RFVEQRVFKTGGNAPLSYVDEDKD 58 (1005) Q Consensus 1 ~a-----------------~y~v~s~~t~~~~~~~~~t---t~~~~~~t~~~~~--~~~~~~~~~~g~t~t~t~t~~~~~ 58 (1005) ++ +-++.....+.......-. ..........+.. .......+..+...++..+..+.+ T Consensus 257 v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v~~~~~~~~~~~~~~~~~~~t~~~~~ 336 (392) T protein:vir:99 257 IAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDAN 336 (392) T ss_pred eecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceeeeeeeecccceeEeeeccceeEEEEEEecC Confidence 00 0000000110000000000 0000000000100 111122344556666666665554 Q ss_pred ccccccccccEEEEecCCeEEEEccCCcEEEeecCceEEEEEeecC--cceeEEEEEEee Q lcl|NC_010324. 59 LTPLSFISMPFDYYSAGNSFLVVGTDKKLYKLTDESLTDISRKVAT--VTKKASASIKIY 116 (1005) Q Consensus 59 ~t~~~~t~~~~t~tss~~~~~tv~~~~~~~t~t~~~~~~~t~~~~~--~t~~~t~tvtv~ 116 (1005) .. .....++|.|+++.+++++..+.+..+. .+...+++.... ....+++.+++. T Consensus 337 ~~---~~~~~vtw~Ssn~~vAtV~~~G~Vt~v~-~G~atITa~~~~~~~~~t~t~~vtV~ 392 (392) T protein:vir:99 337 GD---DVTALCDFESSATDKATVAAGGLVTGVA-AGTSTVTATLVTPSGDREDTIVITVV 392 (392) T ss_pred Cc---cccceEEEEEcCCeeEEEcCCceEEEEe-cceEEEEEEEEcCCCcEEEEEEEEeC Confidence 43 2345689999999999999865555554 455566666543 345566666554 No 16 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=28.32 E-value=0.8 Score=21.11 Aligned_cols=112 Identities=9% Similarity=-0.051 Sum_probs=45.6 Q ss_pred eeecccccccCCceecccccccCCCcchhhccccccccCccceeEEEecCCCeeEEEeccceeeecccceeEEEecceee Q lcl|NC_010324. 825 IGLVDPPVLERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSFLKGFYQVDVGALDYFYDRLNDVVIEKPLEM 904 (1005) Q Consensus 825 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 904 (1005) |+++. ++..++...- |-|+..+.+.+-.. ++ .=+.++|.++ T Consensus 1 M~~v~-------------------------~si~nl~~Gv--------SqQp~~~r~pgQ~~----~q--~N~~~d~v~G 41 (905) T protein:vir:78 1 MGAVL-------------------------QKIPNLLGGV--------SQQPDPVKLPGQVR----EA--ENVYLDPTFG 41 (905) T ss_pred Cccce-------------------------ecchhhhCce--------eecchhhcCCcchh----hh--hccccccccc Confidence 11111 1111111111 23554444444332 12 3355788889 Q ss_pred eecCccccccccCCccceeeeeeeeeEEecCceeEEEEeeecCCCCCceECCCeEEecCCC--eeEeeecCCc-eEEEEE Q lcl|NC_010324. 905 RLERTGIDFDNVTNEWNQKHINRFRPQTTGSGTYIFEAGGSQFSNEYGHPHTSKTYTIGVD--RHVSVRLNHP-YLFYNV 981 (1005) Q Consensus 905 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~-~~~~~~ 981 (1005) +.+|+|..| +|+|... +....+|+ .|+=+.+ |++.....|. +.-+|+ T Consensus 42 l~kRp~~~~--------i~~l~~~-------------------~~~~~~~~---~~~r~~~e~y~~~~~~~g~~~~~i~v 91 (905) T protein:vir:78 42 CRKRPATKF--------VGELATN-------------------LPSDTRWF---PIFRDAGERYAVALYKDGSGNTQVRV 91 (905) T ss_pred cccCchhhh--------hhhhcCC-------------------CCCCceEE---EEEeCCCceEEEEEeeCCCCCcceEE Confidence 999999554 1222110 11223332 1111111 4444445553 234666 Q ss_pred EEccCCCcEEEE--eEEEEeeccCCC Q lcl|NC_010324. 982 IDNDVNSNAAIN--GLTIEFAVGGRR 1005 (1005) Q Consensus 982 ~~~~~~~~~~~~--~~~~~~~~~~~~ 1005 (1005) +++.++..-.++ +...+|--.+.| T Consensus 92 ~d~~~G~~~~V~~~~~~~~yl~~~~~ 117 (905) T protein:vir:78 92 WDMQTGAERTVTPDATATAYLATTNL 117 (905) T ss_pred EEccCCcEEEEecCCCccceeecCCC Confidence 666555444332 333444444444 No 17 >protein:vir:95475 Length: 771 # NCBI annotation: hypothetical protein ORF038 # Family: family:all:5234 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294631;genbank:gi:149408197;genbank:GeneID:5237042 Probab=27.17 E-value=1.9 Score=19.09 Aligned_cols=665 Identities=12% Similarity=0.050 Sum_probs=187.5 Q ss_pred ceeeeeccccc-eeecccceeeeeecccccccceeeeeecCccc------ccccc--ccccceeccCCcceEEEEEeecc Q lcl|NC_010324. 291 DSISLSQSDVT-VSRGSQYILTATLSPANAPNQNITWTSSNPNI------ATVSG--TSTQGTINALLAGFTEITATTEE 361 (1005) Q Consensus 291 ~~~~~~~~~~~-~~~~~~~~~t~t~~~~~~t~~~~~~tss~~~v------atv~~--t~~~~~vt~~~~Gt~tiTvt~~~ 361 (1005) .....+..... ...+-.........+.++.-........-.+. ..... ......+...+.|..-++..... T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~~~vls~~~vpa~g~~~v~~~~W~ 80 (771) T protein:vir:95 1 MAKTTNAAEFNTFVGGLITEASPLTFPQNASIDEVNFILNRDGSRNRRNGMDFENGATKVVCNTLVPADGTIAVTSHNWE 80 (771) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhceeeeecCCceEEEEEEecccceEEeeeechh Confidence 00000000000 00000000000000000000000000000000 00000 00000000011111111110000 Q ss_pred ---cceee-------eeeEEEccccc-cccccceeeeeccccceeeeeeeeecccccc-cceeeeccccceeccccceee Q lcl|NC_010324. 362 ---GNRIA-------VCTVRVDLAGR-TMRTSAMAFAAPVSESVETQEEEVVTPPESE-ETVYFAEPTSGIDTSGMYEGN 429 (1005) Q Consensus 362 ---~~~t~-------t~tvTvt~~~~-~~~~~~~~~t~~~~~~~~~~~~~~~t~~~~~-~~~~~~~~t~~~~~~~~~~~~ 429 (1005) |.... ...+.+-.... .......... ....-+... ........-............ T Consensus 81 na~G~v~~~~livqvg~~l~f~q~t~~pLs~~n~~~~------------a~~nlSPsh~isv~v~~G~livanp~i~~~~ 148 (771) T protein:vir:95 81 NAGGEVGRWISLVQVGTELKFFQTTGETLSEGNFYNY------------QFVNMSPSHKLSYAVVDGLLVVANGSRDIYV 148 (771) T ss_pred hcccccCcEEEEEEeccEEEEEecCCCcccccceeee------------ecceeccceeEEEEEeeeEEEEecCCccEEE Confidence 00000 00000000000 0000000000 000000000 000000000000000000000 Q ss_pred eEEeeccccceeeeeeEeeeeeeccccccccccceeecCCccceE-EEEecC---CCceEEEeecCceeEEEeecCCcee Q lcl|NC_010324. 430 NFYDYSNVNDIEGFARASLLATPLSSVTLDIVSASLDVGEEIVIT-ATASPE---GEYSYQWSVDKTGYVSTTSVTGKSI 505 (1005) Q Consensus 430 ~~~~~~~~~~~~~~~~~t~~~~~~~~~t~~~~~~t~~~g~t~t~t-at~t~~---~~~tvt~t~s~~~~~t~~~~~~~t~ 505 (1005) .......-.........-. ......-.....+..+....-. -+.++. +-.+..|........... .....+ T Consensus 149 --~~~d~~t~s~t~~~ll~r~--rf~~q~~~~G~d~~~~~~~~~~gt~~tn~~iynlyN~gw~~pk~~~~snt-~~~~iV 223 (771) T protein:vir:95 149 --FEYDSGSVSVTTKRLLVRD--LFGVQDIVNGVDLRQGNDIATRPTVQTNAHIYNLRNQTFGVPRVTWHSNE-PSDPIV 223 (771) T ss_pred --EEecCCcceeEeeeeeeee--hhhccccccccceecccccccCCcccCchhheeccccceeccccccccCC-ccccce Confidence 0000000000000000000 0000000000000000000000 000000 000111110000000000 000000 Q ss_pred EEeeccCCcEEE-EEeeC----cceecceeEeeccccccceeccceeccceeeccccccccceeEEec--CCceeEEEEe Q lcl|NC_010324. 506 KLVALRKGEINV-TCTVS----QMTQKDYDAFDDYPWYHAVISNCAVATTHYETPQVKEFESEYFVDL--PGWGEQTVVD 578 (1005) Q Consensus 506 ~~~~~~~gt~ti-T~~~t----~~~~~~~~~~~~~~~~~~~~~~~~vt~t~~~~~~~t~~~~~~t~~~--~~t~t~t~t~ 578 (1005) .......+..-. ....+ ........+....... .+..... ........-....+ -+-..++..+ T Consensus 224 ~~y~a~~g~~pS~sd~~N~a~~k~~~~Ei~t~~~f~~~-------~~~~~~~--Gt~~~~~G~yi~da~~~g~~~Lt~~v 294 (771) T protein:vir:95 224 TFRSAASGKFPSNSDSVNLALSKRADVEPSTTDRFRAE-------DIVLNPI--GTYETARGFFIIDAMARGKSRLEEIV 294 (771) T ss_pred EeeeccCCCCcCCceeeccccchhhccceeeecccchh-------hhhhccc--CcccccCcceeeehhhhcccccceee Confidence 000000000000 00000 0000000000000000 0000000 00000000000000 0000011111 Q ss_pred cCCCCcce-eEEee-ecccccc-CceEEEEecCC-------ceeeEEecCce---eEEEEeecccCCcceeeeeeeeece Q lcl|NC_010324. 579 NDGNPSVK-KFNWK-CERVRSF-NNRLFALNMRE-------ANASGVTTNYP---LRLRWSNFANENKAPTLWDDFAYDR 645 (1005) Q Consensus 579 ~~~~~t~~-tv~~~-~~~~~~~-~~~~~~~~~~~-------~t~t~~~~~~~---~~~~~~~~~~~~~~~~t~~~~~~~~ 645 (1005) ......-. ..... -...... .-.+++...+- .-+.....+.+ ..+-.+-.+..- ...... T Consensus 295 e~~gr~~s~~~~~~~l~~~~t~~~~~~vaeyagRvwYag~~~~~iD~dkng~~~~~~ilfSqLv~s~-------~di~nC 367 (771) T protein:vir:95 295 KLKQRYPSLSFGVSSLPQDETPGGASVVCEYAGRVWYAGFSGQIIDGDDQSPRLVSYILFSQLVDSP-------ADIVNC 367 (771) T ss_pred eccccchhhhccccccccccCCCCceeEEeeeeeEEEecceeEEeeccccCCceeeeEeeehhhcch-------hhcccc Confidence 00000000 00000 0000000 00011100000 00000000000 000000000000 000000 Q ss_pred eecccccceeeecccccccccccceeeEecccceeeccccceeEEEEcCCceEEEEEE---CCCCceeeeeecCCcceee Q lcl|NC_010324. 646 VVSSDLASNIVGQTQALENGYAGYIDLADSNGSLIDILPLKDYLFVYTEFETYIGSPT---NNTYQPLMFKKLFNDSGIL 722 (1005) Q Consensus 646 ~~~~~~t~~~~~~~~~~~t~~~~~~~~t~t~~~iv~g~~~g~~tii~t~~~~~~~t~t---ggt~~~~~~~~~~~~~~~~ 722 (1005) -...+++.. .. ...-.+.|...--.-...++.-...+...++|-++++....+. |-+..-|...|+.. +||+ T Consensus 368 yQd~DPTse---e~-~dLidTDGg~iri~gah~ii~Lv~f~~sLlvfc~NGVWAi~ggsd~g~tAtdY~ltKIs~-vg~s 442 (771) T protein:vir:95 368 YQDGDPTST---EE-PELVDTDGGFIRIEGAHDIINLVNVGSAVMVVAANGIWMIQGGSDYGFTATNYLVTKISE-HGCS 442 (771) T ss_pred cccCCCchh---hh-hhhhhcCCCEEEecCCCCceeEEEecceEEEEEecceEEEEeccCCceeeeeeEEEEeee-eccC Confidence 001111111 11 1111222222222334556677778888999999998877443 22334567788887 9999 Q ss_pred cCceEEEECCEEEEEeCCCEEEE-----CCCccccccchhHHHHHHhhcCccccceEEEEEcCCCCEEEEEEecCCCCcC Q lcl|NC_010324. 723 APECVVEVEGSHFVVTQNDVILH-----NGATKKSIASNRVKNMLINEVCLVNPLATRVHLHQDKKEVWVLYVGPGEPKE 797 (1005) Q Consensus 723 ~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 797 (1005) +|++|+-+++..||.++++||.. +-.+-| +..|+=-..+++.|+++......+++|...++|||.|| +..+. T Consensus 443 spnSvVvvg~~i~ywsdtgIyal~~Ndfn~~tAq-nLTekTIq~~~~~I~~dk~knVtg~fd~~e~rvyw~yP--n~~D~ 519 (771) T protein:vir:95 443 SPNSVVVVDNSFMYWGDDGIYHLTRNQYGDYVAN-NLTEKTIQKYYEKIPSDAILNATGFYDSYDKKVKWLYN--TVLDG 519 (771) T ss_pred CCccEEEecceEEEeeCCceEEEeecccCcchhh-ccchHHHHHHHhhcchhhhcceEEEEEccCCEEEEEec--ceecC Confidence 99999999999999999999932 333344 66666667889999999999999999999999999999 44443 Q ss_pred CCCCCeEEEEEeccCe---eeEec-----ccce------eeeeecc---------cccccCC-c-eecccccccCCCcch Q lcl|NC_010324. 798 SFACTKAAVWNYEFDT---WSFRT-----IPYA------QCIGLVD---------PPVLERG-P-IWSDFQEITWDDPSI 852 (1005) Q Consensus 798 ~~~~~~~~~~~~~~~~---~~~~~-----~~~~------~~~~~~~---------~~~~~~~-~-~~~~~~~~~~~~~~~ 852 (1005) ++..-.-||+|.+++. |-+-+ ||+. ...++.+ .+++.-| + +=-.+-.-+.-++.. T Consensus 520 ~~e~~t~LV~dLalgaFYp~~i~~~~ag~l~~~vg~~~~p~~~lv~T~~eV~v~~~~v~~tG~~vtV~~~~r~~~~~~~~ 599 (771) T protein:vir:95 520 RTEPVTELVFDLALGAFYPSKIGSLTAGRLPIPVGSVKIPPYKLVETGEEVTVASEQVTATGELVTVKVSTRSPVIRETK 599 (771) T ss_pred CCcceeeeeeeecccccccccccccccCccceeeeeeecCccccccccceEEecceeeEecCCceEEEEEEeeccccceE Confidence 4544455999998885 53333 2211 0011110 1111111 0 000000000111110 Q ss_pred hhccccccccCccceeEEEecCCCeeEEEe-------ccc---eeeecccceeEEEecce-----eeeecCccccccccC Q lcl|NC_010324. 853 KELVWRKDATNFRQRVTIVGSFLKGFYQVD-------VGA---LDYFYDRLNDVVIEKPL-----EMRLERTGIDFDNVT 917 (1005) Q Consensus 853 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~---~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~ 917 (1005) -.. . .+...+. +|-|+.+++--|+== +.+ +++.||.|.+-...+.+ +++.-.+|+=.|-.+ T Consensus 600 y~~-~-~~dg~~g--~~~Fa~~~~~~f~DW~sv~~~~vdy~sy~~~gY~~~gd~~~~k~~PYit~y~~~tedg~v~~~~g 675 (771) T protein:vir:95 600 YII-V-EKLSSPM--RISFGGYTDEEFVDWKSVDGIGVDAPAYLLTGYLAGGDYQREKFVPYITFHFKKTEDGFVEDAEG 675 (771) T ss_pred EEE-E-EecCCCe--eEEeccccCcceeecccCCCcccchHHHHHhhhhccchheeeeccceEEEEEEeecccceecccc Confidence 000 0 0111111 233344333322211 111 12333333322111110 122222232222111 Q ss_pred --------Cccceeee-eeeeeEEecCce----eEEEEeeecCCCCCceECCCeEEecCCCeeEeeecCCceEEEEEEEc Q lcl|NC_010324. 918 --------NEWNQKHI-NRFRPQTTGSGT----YIFEAGGSQFSNEYGHPHTSKTYTIGVDRHVSVRLNHPYLFYNVIDN 984 (1005) Q Consensus 918 --------~~~~~~~~-~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 984 (1005) .-.|...| -.+-|+.-+=|+ |.|. -..+|. .+- +..++++.-+|-|.-.|-.||=+++||-.. T Consensus 676 ~~~p~n~sSclm~~sw~ws~s~~t~k~~~~~eaYk~~--~~~~p~-~~~-~~~yp~~~VV~TKsriRG~Gr~~~~rf~s~ 751 (771) T protein:vir:95 676 DWTPTNQSSCMVQSQWSWTNSPASNKWGRTWQAYRFR--RHFFPD-NID-NQFDDGNSVVETKSRLRGSGKVLSLYITTE 751 (771) T ss_pred cccccCCcceEEEEEeeeecCCCCCccccchheeeec--ceeccC-Ccc-hhcCCccceeeeeheeeecceEEEEEEEec Confidence 11111111 011111111000 1111 111111 111 123333333455656678888899998555 Q ss_pred cCCCcEEEEeEEEEeeccCCC Q lcl|NC_010324. 985 DVNSNAAINGLTIEFAVGGRR 1005 (1005) Q Consensus 985 ~~~~~~~~~~~~~~~~~~~~~ 1005 (1005) + +|+.+|-||.+-.++.|-- T Consensus 752 ~-gKdlhl~Gysil~~~~~~~ 771 (771) T protein:vir:95 752 P-KKNLHIYGWSMLVDVNGTV 771 (771) T ss_pred C-CcceEEEeEEEEEeecCcC Confidence 5 9999999999999999988 No 18 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=26.79 E-value=1.4 Score=19.85 Aligned_cols=115 Identities=11% Similarity=-0.023 Sum_probs=39.7 Q ss_pred eeecccccccCCceecccccccCCCcchhhccccccccCccceeEEEecCCCeeEEEeccceeeecccceeEEEecceee Q lcl|NC_010324. 825 IGLVDPPVLERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSFLKGFYQVDVGALDYFYDRLNDVVIEKPLEM 904 (1005) Q Consensus 825 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 904 (1005) |+++. ++..++...-+ -|+..+.+.+-.. ++ .=+.++|.++ T Consensus 1 M~~v~-------------------------~si~nl~~GvS--------qQp~~~r~pgQ~~----~q--~N~~~d~v~G 41 (976) T protein:vir:10 1 MASVT-------------------------QTIPTLTGGLS--------QQPDELKIPGQVS----VA--NNVIPDVTHG 41 (976) T ss_pred Cccee-------------------------ecchhhhCcce--------ecchhhcCCchhh----hh--hccccccccc Confidence 11111 11112222211 3444444443332 12 3356788889 Q ss_pred eecCccccccccCCccceeeeeeeeeEEecCceeEEEEeeecCCCCCceECCCeEEecCCC--eeEeeecCCceEEEEEE Q lcl|NC_010324. 905 RLERTGIDFDNVTNEWNQKHINRFRPQTTGSGTYIFEAGGSQFSNEYGHPHTSKTYTIGVD--RHVSVRLNHPYLFYNVI 982 (1005) Q Consensus 905 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 982 (1005) +.+|+|..| |.++-..... ...+....+|+ .|.-+.+ |++....+| -+|++ T Consensus 42 l~kRp~~~~-----------v~~l~~~~~~----------~~~~~~~~~~~---~~~r~~~e~y~~~~~~~g---~~~v~ 94 (976) T protein:vir:10 42 LLKRPGGKL-----------VASISDNGTA----------ALNSQTNGKWF---SYYRDETESYIGQVSRSG---DINMW 94 (976) T ss_pred cccCCccee-----------eeeecCCCcc----------cccccccceEE---EEEcCCCcEEEEEEecCC---ceEEE Confidence 999999443 2222111100 00111122331 1111111 222222222 25555 Q ss_pred EccCCCcEEEEeE------EEEee-ccCCC Q lcl|NC_010324. 983 DNDVNSNAAINGL------TIEFA-VGGRR 1005 (1005) Q Consensus 983 ~~~~~~~~~~~~~------~~~~~-~~~~~ 1005 (1005) +..+|..+-++.- ...|. ..++| T Consensus 95 ~~~~G~~~~v~~~~~~~~~~~~yl~~~~~~ 124 (976) T protein:vir:10 95 RCSDGQAMTVNYDSGTATALTTYLTHTNDE 124 (976) T ss_pred EccCCeEEEEEcCCCcccccchhhccCCcc Confidence 5554554444322 11222 22322 Done!