Query lcl|NC_021073.1_cdsid_YP_007878104.1 [gene=VPDG_00132] [protein=hypothetical protein] [protein_id=YP_007878104.1] [location=complement(85203..85574)] Match_columns 123 No_of_seqs 15 out of 17 Neff 2.8 Searched_HMMs 1612 Date Thu Nov 7 16:21:12 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_131 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_131_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:105055 Length: 129 100.0 1.3E-62 7.9E-66 359.9 9.5 118 1-123 1-124 (129) 2 protein:vir:5744 Length: 140 # 100.0 1.4E-62 8.9E-66 359.6 9.0 118 1-123 15-138 (140) 3 protein:vir:81066 Length: 118 98.5 1.6E-09 9.9E-13 68.8 9.2 110 1-123 2-117 (118) 4 protein:vir:10368 Length: 118 98.5 1.1E-09 6.9E-13 69.7 8.3 110 1-123 2-117 (118) 5 protein:vir:93602 Length: 114 98.5 1.8E-09 1.1E-12 68.5 9.4 107 1-123 1-114 (114) 6 protein:vir:97070 Length: 118 98.5 1.5E-09 9.3E-13 69.0 8.8 110 1-123 2-115 (118) 7 protein:vir:195 Length: 115 # 98.4 6E-09 3.7E-12 65.7 9.6 107 1-123 1-115 (115) 8 protein:vir:100242 Length: 114 98.2 1.2E-08 7.5E-12 64.0 7.1 109 1-123 1-114 (114) 9 protein:vir:1438 Length: 115 # 98.0 5.1E-08 3.2E-11 60.5 6.9 111 1-123 1-115 (115) 10 protein:vir:4348 Length: 121 # 97.9 1.8E-07 1.1E-10 57.6 8.5 110 1-123 1-119 (121) 11 protein:vir:100116 Length: 115 97.8 9.4E-08 5.8E-11 59.1 6.4 113 1-123 1-115 (115) 12 protein:vir:1892 Length: 121 # 97.7 5.9E-07 3.7E-10 54.7 8.9 110 1-123 1-121 (121) 13 protein:vir:80371 Length: 115 96.4 2.1E-05 1.3E-08 46.3 6.2 111 1-123 1-115 (115) 14 protein:vir:1643 Length: 111 # 81.6 0.056 3.5E-05 27.4 7.9 108 1-122 1-111 (111) 15 protein:vir:94768 Length: 111 80.7 0.063 3.9E-05 27.1 7.8 105 1-122 1-111 (111) 16 protein:vir:9579 Length: 111 # 76.2 0.11 6.9E-05 25.8 7.7 107 1-122 1-111 (111) 17 protein:vir:80105 Length: 162 63.4 0.32 0.0002 23.3 8.5 119 1-123 13-147 (162) 18 protein:vir:1387 Length: 116 # 51.0 0.59 0.00037 21.8 7.4 110 1-123 1-115 (116) 19 protein:vir:9764 Length: 111 # 46.2 0.74 0.00046 21.3 8.2 109 1-122 1-111 (111) 20 protein:vir:96764 Length: 177 42.0 0.65 0.0004 21.6 5.3 119 1-123 13-139 (177) 21 protein:vir:10327 Length: 182 34.0 1.3 0.00082 19.9 5.8 121 1-123 12-138 (182) 22 protein:vir:103278 Length: 169 28.4 1.8 0.0011 19.2 6.6 109 1-123 41-168 (169) 23 protein:vir:1274 Length: 162 # 23.3 2.3 0.0014 18.6 7.1 110 1-123 44-161 (162) 24 protein:vir:6215 Length: 109 # 21.4 2.6 0.0016 18.3 7.8 105 1-123 2-108 (109) No 1 >protein:vir:105055 Length: 129 # NCBI annotation: Gp10 # Family: family:all:11393 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006590;genbank:gi:46402096;genbank:GeneID:2777921 Probab=100.00 E-value=1.3e-62 Score=359.91 Aligned_cols=118 Identities=17% Similarity=0.327 Sum_probs=112.2 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEE-ecCchHHHHHHHHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIV-TPDSELVITKKNELIEK 79 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~-~~Dy~~~l~~d~~i~~~ 79 (123) |||.+||+||||||||+||||+|| |+++||||||||||||+ ||||+||+||+|||||+|+ ++||++++++|++||++ T Consensus 1 MIE~~ik~~LerlT~l~vYPLlLP-dt~~eGvtyQRISDpk~-~sGl~~T~Lv~~RfQI~~~~~dDY~~ll~ld~~i~~~ 78 (129) T protein:vir:10 1 MIELAIKNELERITGMDAYPLLLP-DTVQEGVTFQRISDPEM-YSGTLRTGIVSARIQVNLYRVDDYTSLLQLDKKIWSE 78 (129) T ss_pred CccHHHHHHHHHhhcCcccceecC-CchhcCeeeeeccCccc-cchhhhheeeeeEEEEEEEEecCchHHHHHHHHHHHH Confidence 999999999999999999999999 79999999999999999 7999999999999999999 68999999999999999 Q ss_pred hccce-eeecccccccceeeeecCcccchhhccc----eeeEEeecccC Q lcl|NC_021073. 80 YEGFS-GTMGETDIFISRIESSVPTFDNAQQNFE----YNITIKFTENI 123 (123) Q Consensus 80 we~i~-G~ig~~pV~~~Q~v~rg~~~q~~~~lt~----yri~~~fte~~ 123 (123) ||+|+ |+||+||| |||+||+++|||+++|+ |||+|||-=+- T Consensus 79 We~i~HG~Ig~yPV---Q~V~RG~~~Q~~~tltnn~~~yr~~RDfII~y 124 (129) T protein:vir:10 79 WKSIVHGQLDGVPV---QYVERGGIQQDKTTLTNRSIQYRLIRDFIIHY 124 (129) T ss_pred hhhhcccccCCeee---eeeeeccccccceeccCCcEEEEEEeeEEEEe Confidence 99995 99999999 99999999999999986 99997764444 No 2 >protein:vir:5744 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:11393 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892055;genbank:gi:33770518;uniprot:Q7Y405;genbank:GeneID:2637455 Probab=100.00 E-value=1.4e-62 Score=359.63 Aligned_cols=118 Identities=19% Similarity=0.327 Sum_probs=111.3 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEE-ecCchHHHHHHHHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIV-TPDSELVITKKNELIEK 79 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~-~~Dy~~~l~~d~~i~~~ 79 (123) |||.+||+||||+|||+||||+|| |+++||||||||||||+ ||||+||+||+|||||+|+ ++||++++++|++||++ T Consensus 15 MIE~~ik~~LerlT~l~vYPLlLP-dt~~EGVtyQRISDPk~-~sGl~~T~LV~~RfQI~~~~~dDY~~ll~ld~~i~~~ 92 (140) T protein:vir:57 15 MIEQSLKSALERITGMNVYPLLLP-DTELEGVTFQRISDPEI-ETGLVRTNLIDCRFQITIHLIDDYTRLVVLDAAIWAE 92 (140) T ss_pred HhhHHHHHHHHHhhcCcccceecC-ChhhcCeeeeeccCccc-hhhhhhhheeeeEEEEEEEEecCchHHHHHHHHHHHH Confidence 999999999999999999999999 79999999999999999 7999999999999999998 58999999999999999 Q ss_pred hccce-eeecccccccceeeeecCcccchhhccc----eeeEEeecccC Q lcl|NC_021073. 80 YEGFS-GTMGETDIFISRIESSVPTFDNAQQNFE----YNITIKFTENI 123 (123) Q Consensus 80 we~i~-G~ig~~pV~~~Q~v~rg~~~q~~~~lt~----yri~~~fte~~ 123 (123) ||+|+ |+||+||| |||+||+++|||+++|+ |||+|||-=+- T Consensus 93 We~i~HG~Ig~yPV---Q~V~RG~~~Q~~~tltnn~~~Yrl~RDFII~y 138 (140) T protein:vir:57 93 WKKVVHGYIDGYPV---QYVRRGGVQQGVTTLTNNSKHFWFSRDFILSF 138 (140) T ss_pred HhhhcccccCCeee---eeeeeccccccceeccCCcEEEEEEeeEEEEe Confidence 99995 99999999 99999999999999986 99986653333 No 3 >protein:vir:81066 Length: 118 # NCBI annotation: p13 # Family: family:all:3244 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285683;genbank:gi:148727191;genbank:GeneID:5247111 Probab=98.50 E-value=1.6e-09 Score=68.81 Aligned_cols=110 Identities=11% Similarity=0.076 Sum_probs=79.2 Q ss_pred CcchHHHHHHHHHhcCccceeecCcccc-ccceeeEeecCCccccccccc-c-ccccceEEEEEEecCchHHHHHHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAA-YPAIVYKEISNGRNDDSNLDS-S-NLRNKYYEIVIVTPDSELVITKKNELI 77 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~-~egvTyQrISDP~~~dsGl~r-T-~Lv~aRfQI~i~~~Dy~~~l~~d~~i~ 77 (123) ++|.+|++.|..+.+.++||...|++.. .|-|+||+||+... + .|.- + +..+.|+||.++-.+|..+.++-+++. T Consensus 2 s~e~~l~a~L~~~~~~Rvyp~~aP~~~~~~Pyiv~q~vsg~p~-~-~l~G~~~~~~~~rvQIdvyA~t~~~A~~l~~av~ 79 (118) T protein:vir:81 2 SYGRVLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPV-Y-WQEGGMPEKVNARVQIQIWSRSKQEAYLATVQVL 79 (118) T ss_pred chHHHHHHHHHhhcCCccccccCCCCCccCceEEEEecCCccc-c-cccCCCCCccceeEEEEEeeCCHHHHHHHHHHHH Confidence 5699999999999999999999999876 49999999999644 2 4533 2 467899999999999888888877776 Q ss_pred HHhccceeeecccccccceeeeecCcccchhhccc-eeeEEeec--ccC Q lcl|NC_021073. 78 EKYEGFSGTMGETDIFISRIESSVPTFDNAQQNFE-YNITIKFT--ENI 123 (123) Q Consensus 78 ~~we~i~G~ig~~pV~~~Q~v~rg~~~q~~~~lt~-yri~~~ft--e~~ 123 (123) .+=++ .+. . ...|+-.++-+.-|+ ||.+.||. .+. T Consensus 80 ~al~~-------~~~---~-~~~~~~~d~ye~dt~l~r~~~Df~iw~~~ 117 (118) T protein:vir:81 80 RLVSE-------APD---M-QVLSQPIDDYVREIKLYGSRVDVSMWYPI 117 (118) T ss_pred HHhhh-------ccc---e-eeccCCccccccccCceeEEEEEEEEecC Confidence 55432 222 1 122333333333343 99998885 223 No 4 >protein:vir:10368 Length: 118 # NCBI annotation: conserved phage protein # Family: family:all:3244 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858960;genbank:gi:32128425;genbank:GeneID:2648389 Probab=98.50 E-value=1.1e-09 Score=69.68 Aligned_cols=110 Identities=11% Similarity=0.087 Sum_probs=77.3 Q ss_pred CcchHHHHHHHHHhcCccceeecCcccc-ccceeeEeecCCccccccccc-c-ccccceEEEEEEecCchHHHHHHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAA-YPAIVYKEISNGRNDDSNLDS-S-NLRNKYYEIVIVTPDSELVITKKNELI 77 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~-~egvTyQrISDP~~~dsGl~r-T-~Lv~aRfQI~i~~~Dy~~~l~~d~~i~ 77 (123) ++|.+|++.|..+.+.++||...|++.. .|-|+||+||+... + -|.- + ...+.|+||.++-.+|..+.++-+++. T Consensus 2 s~e~~l~a~L~~~~~~RVyp~~aP~~~~~~Pyiv~q~vsg~p~-~-~l~G~~~~~~~~rvQIdvyA~t~~~A~~l~~av~ 79 (118) T protein:vir:10 2 SYGRVLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPV-Y-WQEGGMPEKVNARVQIQIWSRSKQEAYLATVQVL 79 (118) T ss_pred chHHHHHHHHhhhcCCccccccCCCCCCcCCEEEEEecCCccc-c-cccCCCCccceeEEEEEEeeCCHHHHHHHHHHHH Confidence 5599999999999999999999999876 49999999999654 2 3543 3 377899999999999999888877775 Q ss_pred HHhccceeeecccccccceeeeecCcccchhhcc-ceeeEEeec--ccC Q lcl|NC_021073. 78 EKYEGFSGTMGETDIFISRIESSVPTFDNAQQNF-EYNITIKFT--ENI 123 (123) Q Consensus 78 ~~we~i~G~ig~~pV~~~Q~v~rg~~~q~~~~lt-~yri~~~ft--e~~ 123 (123) ..=+ +.....|+ |.-.++-+.-| -||.+.||. .+. T Consensus 80 ~al~---~~~~~~~~--------~~~~d~ye~dt~l~r~~~Df~vw~~~ 117 (118) T protein:vir:10 80 RLVS---EANDMQVL--------SQPIDDYVREIKLYGSRVDISMWYNL 117 (118) T ss_pred HHhh---hcccceec--------cCCCccccccCCceEEEEEEEEeeec Confidence 4432 22222222 11123333333 388887775 122 No 5 >protein:vir:93602 Length: 114 # NCBI annotation: putative structural component # Family: family:all:896 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449300;genbank:gi:157166048;uniprot:Q6H9U1;genbank:GeneID:5580424 Probab=98.49 E-value=1.8e-09 Score=68.52 Aligned_cols=107 Identities=19% Similarity=0.181 Sum_probs=73.7 Q ss_pred CcchHHHHHHHHHhcCccceeecCcc-----ccccceeeEeecC-CccccccccccccccceEEEEEEecCchHHHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQN-----AAYPAIVYKEISN-GRNDDSNLDSSNLRNKYYEIVIVTPDSELVITKKN 74 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d-----~~~egvTyQrISD-P~~~dsGl~rT~Lv~aRfQI~i~~~Dy~~~l~~d~ 74 (123) |+|.+|++.|-.+.+.++||-+.|+. +..|-||||++|+ |.+.-.| -.-.+.|+||.++-..+..+-++-+ T Consensus 1 M~e~~i~~lL~~~~~gRvyp~~~P~~~~~~~~~~Pyiv~q~vsg~p~~~l~g---p~~~~~~vQIDvyA~t~~~A~~l~~ 77 (114) T protein:vir:93 1 MTEADLYPHLAHLAGGQVYPYVVPLLDGRPSVALPWVVFSLISSVSADVMGG---QAESSVSVQIDVYAGTVTQARQIRQ 77 (114) T ss_pred CchHHHHHHHHhhcCcccccccCCcccCcCCccCceEEEEeccCcccccccC---ccccceEEEEEeeeCCHHHHHHHHH Confidence 99999999999999999999999963 4568899999996 6665455 3347899999999877777777666 Q ss_pred HHHHHhccceeeecccccccceeeeecCcccchhhcc-ceeeEEeecccC Q lcl|NC_021073. 75 ELIEKYEGFSGTMGETDIFISRIESSVPTFDNAQQNF-EYNITIKFTENI 123 (123) Q Consensus 75 ~i~~~we~i~G~ig~~pV~~~Q~v~rg~~~q~~~~lt-~yri~~~fte~~ 123 (123) ++..+=+. .-|+ .+.++.-+ +.-| -||.+.||-=-+ T Consensus 78 ~v~~Al~~------~~~~----~~~~~~~y---e~dt~lyR~~~d~~v~~ 114 (114) T protein:vir:93 78 DAREAIML------LAPG----SVSEMQDY---IPENRCYRATLEFQVTV 114 (114) T ss_pred HHHHHHhh------cCcE----eecCCCcc---cccccceeeEEEEEEeC Confidence 66554431 1122 11222222 2223 377776664444 No 6 >protein:vir:97070 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:3244 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453569;genbank:gi:84662604;genbank:GeneID:5142485 Probab=98.49 E-value=1.5e-09 Score=68.96 Aligned_cols=110 Identities=9% Similarity=0.056 Sum_probs=78.3 Q ss_pred CcchHHHHHHHHHhcCccceeecCcccc-ccceeeEeecCCccccccccc-c-ccccceEEEEEEecCchHHHHHHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAA-YPAIVYKEISNGRNDDSNLDS-S-NLRNKYYEIVIVTPDSELVITKKNELI 77 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~-~egvTyQrISDP~~~dsGl~r-T-~Lv~aRfQI~i~~~Dy~~~l~~d~~i~ 77 (123) ++|.+|++.|-.+.+.++||-..|++.. .|-|+||+||+... + -|.- + ...+.|+||.++-..|..+.++-+++. T Consensus 2 ~~e~~l~a~L~~~~~~Rvyp~~aP~~~~~~Pyiv~q~vsg~p~-~-~ldG~~~~~~~~rvQIdvyA~t~~~A~~l~~av~ 79 (118) T protein:vir:97 2 SYGRMLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPV-Y-WKEGGMPDKVNARVQVQIWSRSKQEAYLATVQVL 79 (118) T ss_pred chHHHHHHHHhhhcCCccccccCCCCCCcCCEEEEEecCCccc-c-cccCCCCCccceeEEEEEeeCCHHHHHHHHHHHH Confidence 8999999999999999999999999876 49999999999655 2 4544 3 488899999999999998888877775 Q ss_pred HHhccceeeecccccccceeeeecCcccchhhccc-eeeEEeecccC Q lcl|NC_021073. 78 EKYEGFSGTMGETDIFISRIESSVPTFDNAQQNFE-YNITIKFTENI 123 (123) Q Consensus 78 ~~we~i~G~ig~~pV~~~Q~v~rg~~~q~~~~lt~-yri~~~fte~~ 123 (123) ..=++ ++. +.+ +|.-.++-+.-|+ ||.+.||.=-- T Consensus 80 ~al~~-------~~~---~~~-~~~~~~~ye~dt~lyr~~~Df~iw~ 115 (118) T protein:vir:97 80 RIVSE-------AND---MQV-LSQPIDDYVRELKLYGSRVDISMWY 115 (118) T ss_pred HHhhc-------ccc---ccc-ccCCcccccccCCceEEEEEEEEEe Confidence 44222 222 111 1222232333333 88887774222 No 7 >protein:vir:195 Length: 115 # NCBI annotation: Gp11 # Family: family:all:896 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037705;genbank:gi:9634170;genbank:GeneID:1262540 Probab=98.38 E-value=6e-09 Score=65.67 Aligned_cols=107 Identities=13% Similarity=0.169 Sum_probs=74.2 Q ss_pred CcchHHHHHHHHHhcCccceeecCccc------cccceeeEeecC-CccccccccccccccceEEEEEEecCchHHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNA------AYPAIVYKEISN-GRNDDSNLDSSNLRNKYYEIVIVTPDSELVITKK 73 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~------~~egvTyQrISD-P~~~dsGl~rT~Lv~aRfQI~i~~~Dy~~~l~~d 73 (123) |+|.+|++.|--+.+..+||.+.|++. ..+-|+||+||+ |.+.-.| ...-+.|+||.++-..+..+-++- T Consensus 1 M~e~~i~~lL~~l~~gRvyp~~aP~~~~~~~~~~~Pyiv~q~vsg~p~~~L~G---~~~~~~~vQIDvyA~t~~~A~~l~ 77 (115) T protein:vir:19 1 MNEDNIYALLSPLAEGRVYPYVAPLGSDGKPSVSPPWIIFSIVDDVSADVLCG---QAESRVSVQVDVYSTSIAESRSLR 77 (115) T ss_pred CchhHHHHHHhhhcCcccceeeccCCCCCCccccCCeEEEEeccCcccccccC---CCccceEEEEEEeeCChHHHHHHH Confidence 999999999999999999999999854 568899999997 6555455 345789999999977777777766 Q ss_pred HHHHHHhccceeeecccccccceeeeecCcccchhhcc-ceeeEEeecccC Q lcl|NC_021073. 74 NELIEKYEGFSGTMGETDIFISRIESSVPTFDNAQQNF-EYNITIKFTENI 123 (123) Q Consensus 74 ~~i~~~we~i~G~ig~~pV~~~Q~v~rg~~~q~~~~lt-~yri~~~fte~~ 123 (123) +++..+=+.. -|+ ...+. ++-+.-| -||.+.||-=.- T Consensus 78 ~~i~~Al~~~------~p~------~~~~~-~~ye~dt~lyR~s~d~~V~~ 115 (115) T protein:vir:19 78 DLVLASLEPL------TPT------EVVKI-PGYEPDYRLYRATLDFKVTP 115 (115) T ss_pred HHHHHHhhhc------CCE------EecCC-CCcccchhceeeEEEEEecC Confidence 6666655422 133 22222 2222223 377776652111 No 8 >protein:vir:100242 Length: 114 # NCBI annotation: gp71 # Family: family:all:2712 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355407;genbank:gi:77864697;genbank:GeneID:3725964 Probab=98.17 E-value=1.2e-08 Score=64.01 Aligned_cols=109 Identities=11% Similarity=0.154 Sum_probs=84.0 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEEecCchHHHHH-HHHHHH- Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTPDSELVITK-KNELIE- 78 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~Dy~~~l~~-d~~i~~- 78 (123) |=+..|.++|..+.|-++||=.=|+.+..|-+||||||.-+. ++=-..++..++||||-++-.-|+-+.++ ++.+-. T Consensus 1 ~~~~~i~~~l~~~~g~~~~~~~aP~~~~~Py~vy~rvsg~p~-~tL~G~~g~~~~r~QiD~yA~T~~eA~~La~~~~~~l 79 (114) T protein:vir:10 1 MSALTIRDAIGIVGGAKGYVSVASSAAQSPYYVVSRVSGTRD-MALGGATGGKSGMFQIDVYAKTYTEADSLADQIIDRV 79 (114) T ss_pred CceeeeehhhcccccccccCCCCCCCCCCceEEEEeccCccc-ccccCCCCcceEEEEEEeeeCCHHHHHHHHHHHHhhc Confidence 999999999999999999999999999999999999999886 57778899999999999999889888888 344322 Q ss_pred -Hhccce-eeecccccccceeeeecCcccchhhcc-ceeeEEeecccC Q lcl|NC_021073. 79 -KYEGFS-GTMGETDIFISRIESSVPTFDNAQQNF-EYNITIKFTENI 123 (123) Q Consensus 79 -~we~i~-G~ig~~pV~~~Q~v~rg~~~q~~~~lt-~yri~~~fte~~ 123 (123) .|+.++ |.+.+-|= .-++-| -||++.||.=+. T Consensus 80 ~~~~~f~~~~l~~~~d-------------~ye~dT~l~Rvsld~si~f 114 (114) T protein:vir:10 80 ESTGMFSVGGVSDLPD-------------DYSSDTGVFRVSLEISVQF 114 (114) T ss_pred ccccCeeeeccccCCC-------------CCCcccCceEEEEEEEEeC Confidence 233344 33332221 112223 389987777666 No 9 >protein:vir:1438 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:2712 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536367;genbank:gi:17975172;genbank:GeneID:929144 Probab=97.96 E-value=5.1e-08 Score=60.54 Aligned_cols=111 Identities=14% Similarity=0.125 Sum_probs=80.3 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEEecCchHHHHHHHHHHHHh Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTPDSELVITKKNELIEKY 80 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~Dy~~~l~~d~~i~~~w 80 (123) |-=+-||++|..+.+-.+||-..|+++..|-||||+||+... ++=-.+++..+.|+||.++-..|..+-++-+++...= T Consensus 1 ~~~~~i~~aL~~l~~~RVyp~~aP~~~~~Pyiv~q~vsg~p~-~~L~G~~~~~~~~vQIDvyA~t~~~A~~l~~~v~~~~ 79 (115) T protein:vir:14 1 MSVIVIRDALQGIGGAKGYLGVAPAKAPAPYFVVTRVHGALD-MALAGLTGGRSGSYQIDCYAPTFTDADRLADLAVDRA 79 (115) T ss_pred CeeEeeehhhccccccccccccCCCCCCCCEEEEEeecCccc-ccccCCCCCcceEEEEEEeeCCHHHHHHHHHHHHHHH Confidence 888889999999999999999999999999999999999866 3222369999999999999888888888777775543 Q ss_pred ccceeeecccccccceeeeecCccc---chhhcc-ceeeEEeecccC Q lcl|NC_021073. 81 EGFSGTMGETDIFISRIESSVPTFD---NAQQNF-EYNITIKFTENI 123 (123) Q Consensus 81 e~i~G~ig~~pV~~~Q~v~rg~~~q---~~~~lt-~yri~~~fte~~ 123 (123) ++.. .+..+ |++.+ +-+.-| -||++.||.=-. T Consensus 80 ~~~~-------~~~~~----~~~~~~~d~ye~dt~lyR~s~D~~vWf 115 (115) T protein:vir:14 80 MSVQ-------DRFSV----GGVDELPDDYSEDTGLFRISLELSVEF 115 (115) T ss_pred hcCc-------cceee----eeecCCCCCCcccccceeeEEEEEEeC Confidence 3322 11111 11221 122223 399988776222 No 10 >protein:vir:4348 Length: 121 # NCBI annotation: Orf15 # Family: family:all:896 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061511;genbank:gi:9635607;genbank:GeneID:1262874 Probab=97.88 E-value=1.8e-07 Score=57.62 Aligned_cols=110 Identities=12% Similarity=0.124 Sum_probs=74.2 Q ss_pred CcchHHH-----HHHHHHhcC---cccee-ecCccccccceeeEeecCCccccccccccccccceEEEEEEecCchHHHH Q lcl|NC_021073. 1 MIEIDLK-----NDLESSMGM---NAYPI-KIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTPDSELVIT 71 (123) Q Consensus 1 MIE~~iK-----~~LErltgl---~~YPL-llP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~Dy~~~l~ 71 (123) |.|.=-+ .+|.+|+|- ++||+ .-|+++..|-||||+||+... +.=-.+.+..++|+||.++-.+|..+-+ T Consensus 1 m~~~i~~~l~~d~~v~allg~~~~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~-~~l~g~~~~~~~~vQIDvyA~t~~~A~~ 79 (121) T protein:vir:43 1 MYPPIFKVCSSSPAVTAILGASPLRMYQFGLAPQLVVKPYATWQTISGSPE-NYLWGRPDADGFTIQVDIFSATAAEARD 79 (121) T ss_pred CChHHHHHHhhChhhhhhhcCCCceeeccCCCCCCCcCCeEEEEEecCccc-ceecCCCCcceeEEEEEeeeCCHHHHHH Confidence 7664322 456668875 59997 669999999999999997654 2322367889999999999999999988 Q ss_pred HHHHHHHHhccceeeecccccccceeeeecCcccchhhccceeeEEeecccC Q lcl|NC_021073. 72 KKNELIEKYEGFSGTMGETDIFISRIESSVPTFDNAQQNFEYNITIKFTENI 123 (123) Q Consensus 72 ~d~~i~~~we~i~G~ig~~pV~~~Q~v~rg~~~q~~~~lt~yri~~~fte~~ 123 (123) +-+++..+=++-.+ .+ ...+..++..+. -||++.|+.--+ T Consensus 80 l~~av~~Al~~~~~-----~~-----~~~~~~ye~dT~--lyR~s~Dv~w~~ 119 (121) T protein:vir:43 80 AAKAIRDAIELSAY-----VV-----RWGGESVDPDTK--TYRVSFDVDWIV 119 (121) T ss_pred HHHHHHHHhhhcCC-----cc-----cCCCCCCccccc--ceeeeeEEEEee Confidence 88888876654222 11 112233332222 288886655333 No 11 >protein:vir:100116 Length: 115 # NCBI annotation: gp10 # Family: family:all:2712 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945040;genbank:gi:38707900;genbank:GeneID:2744163 Probab=97.84 E-value=9.4e-08 Score=59.11 Aligned_cols=113 Identities=15% Similarity=0.167 Sum_probs=80.0 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCcccccccc-ccccccceEEEEEEecCchHHHHHHHHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLD-SSNLRNKYYEIVIVTPDSELVITKKNELIEK 79 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~-rT~Lv~aRfQI~i~~~Dy~~~l~~d~~i~~~ 79 (123) |-=+=||++|..+.|-.+||-..|+++..|=+|||+||+... + -|. +++..+.|+||.++-..|..+-++-+++... T Consensus 1 ~~~~~i~~aL~~l~~~RVyp~~aP~~~~~Pyiv~q~vsg~p~-~-~L~G~~~~~~~~vQIDvyA~t~~~A~~l~~~v~~~ 78 (115) T protein:vir:10 1 MSVIVIRDALQGIGGAKGYLGVAPEKAPAPYFVVTRVHGALD-M-ALAGLTGGRSGSYQIDCYAPTFTDADRLADLAVDR 78 (115) T ss_pred CeeEEeehhhcccCCceeecccCCCCCCCCEEEEEeecCccc-c-ccCCCCCCcceEEEEEEeeCCHHHHHHHHHHHHHH Confidence 888889999999999999999999999999999999999866 3 443 6999999999999988888888877777554 Q ss_pred hccceeeecccccccceeeeecCcccchhhcc-ceeeEEeecccC Q lcl|NC_021073. 80 YEGFSGTMGETDIFISRIESSVPTFDNAQQNF-EYNITIKFTENI 123 (123) Q Consensus 80 we~i~G~ig~~pV~~~Q~v~rg~~~q~~~~lt-~yri~~~fte~~ 123 (123) =++.... ..++.-+ ...++-+.-| -||++.||.=-. T Consensus 79 ~~~~~~~-------~~~~~~~-~~~d~ye~dt~lyR~s~D~~vWf 115 (115) T protein:vir:10 79 AMSVQDR-------FSVGGVD-ELPDDYSEDTGLFRISLELSVEF 115 (115) T ss_pred HhcCccc-------eeEeeec-CCCCCCcccccceeeEEEEEEeC Confidence 3333211 1111100 0011122223 399988776222 No 12 >protein:vir:1892 Length: 121 # NCBI annotation: gp11 # Family: family:all:896 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037672;genbank:gi:9634130;genbank:GeneID:1262490 Probab=97.70 E-value=5.9e-07 Score=54.73 Aligned_cols=110 Identities=15% Similarity=0.212 Sum_probs=74.3 Q ss_pred CcchHHH-----HHHHHHhcC---cccee-ecCccccccceeeEeecCCccccccccccccccceEEEEEEecCchHHHH Q lcl|NC_021073. 1 MIEIDLK-----NDLESSMGM---NAYPI-KIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTPDSELVIT 71 (123) Q Consensus 1 MIE~~iK-----~~LErltgl---~~YPL-llP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~Dy~~~l~ 71 (123) |+..=.+ .++.+|+|- ++||+ .-|+++..|-||||+||+-.. +.=-.+.+..++|+||.++-..|..+.+ T Consensus 1 m~~~i~~~l~~d~~v~allg~~~~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~-~~l~G~~~~~~~~vQIDvyA~t~~~A~~ 79 (121) T protein:vir:18 1 MIAPIFSVCASSPEVTDLLGSNPVRIYPFGIQDDNVVYPYVVWQNITGSPE-NYIAQRPDADFFTLQVDAYADTVDEVIA 79 (121) T ss_pred CchHHHHHHhcChhhhhhhcCCCceeeeccCCCCcCcCCeEEEEEecCccc-ceecCCCCcceeEEEEEeecCCHHHHHH Confidence 6653222 567788875 69998 789999999999999997654 2223368899999999999999999888 Q ss_pred HHHHHHHHhccceeeecccccccceeeeecCcccchhhccceeeEEe--ecccC Q lcl|NC_021073. 72 KKNELIEKYEGFSGTMGETDIFISRIESSVPTFDNAQQNFEYNITIK--FTENI 123 (123) Q Consensus 72 ~d~~i~~~we~i~G~ig~~pV~~~Q~v~rg~~~q~~~~lt~yri~~~--fte~~ 123 (123) +-+++..+=|+ +++...++ +..++..+.+ ||++.| |--+= T Consensus 80 l~~avr~Ale~-~~~~~~~~---------~~~ye~dT~l--yR~s~Dv~~~~~r 121 (121) T protein:vir:18 80 VATALRDAIEP-HAHITRWG---------GQERDPETKR--YRYSFDVDWIVTR 121 (121) T ss_pred HHHHHHHHhhh-cCcccCCC---------CCCCcccccc--eeeeeEEEEeecC Confidence 88888876664 33332222 2333333333 666533 33333 No 13 >protein:vir:80371 Length: 115 # NCBI annotation: gp11 # Family: family:all:2712 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111090;genbank:gi:134288622;genbank:GeneID:4960618 Probab=96.44 E-value=2.1e-05 Score=46.27 Aligned_cols=111 Identities=11% Similarity=0.108 Sum_probs=79.8 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEEecCchHHHHHHHHHHHHh Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTPDSELVITKKNELIEKY 80 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~Dy~~~l~~d~~i~~~w 80 (123) |--+-|.+||..+-|-++||-+-|+.+..+-+||||||..+- ++=..-++--.+||||-.+-+-|+.+-++-++....= T Consensus 1 ~~~~vir~al~~i~~~~~~~~vAp~~~~~pyivy~rvsga~e-~~L~G~ag~~~~~~QID~yA~T~~ea~~La~~v~d~~ 79 (115) T protein:vir:80 1 MSVIVVRDALQGIGGAKGYLGVAPEKAPARYFVVTRVHGALD-MALAGPTGGRSGSYQIDCYAPTFTDADRLADLAVDRA 79 (115) T ss_pred CeeeeeechhhhccccccceeeccccCcCCeEEEeecCCCcc-ccccCCCCCceeEEEEeeecCCHHHHHHHHHHHHHhh Confidence 888889999999999999999999999999999999998865 4555667788999999999888888877777666511 Q ss_pred ccceeeecccccccceeeeecCcccchhhc---c-ceeeEEeecccC Q lcl|NC_021073. 81 EGFSGTMGETDIFISRIESSVPTFDNAQQN---F-EYNITIKFTENI 123 (123) Q Consensus 81 e~i~G~ig~~pV~~~Q~v~rg~~~q~~~~l---t-~yri~~~fte~~ 123 (123) .|-=..++| |++.++.+.- | -||.+.+|.=.. T Consensus 80 ---~~~~~~~~v--------g~l~e~pd~Ye~DT~l~Rvs~dv~i~f 115 (115) T protein:vir:80 80 ---MSVQDRFSV--------GGVDELPDDYSADTGLFRVSLELSVEF 115 (115) T ss_pred ---hCCccccce--------ecccCCCcccccccceEEEEEEEEEeC Confidence 122222334 4444432211 2 277765544333 No 14 >protein:vir:1643 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695064;genbank:gi:23455755;genbank:GeneID:955492 Probab=81.62 E-value=0.056 Score=27.43 Aligned_cols=108 Identities=16% Similarity=0.297 Sum_probs=63.6 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEEec-CchHHHHHHHHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTP-DSELVITKKNELIEK 79 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~-Dy~~~l~~d~~i~~~ 79 (123) |||.-|++=|.+-+|.++|= =.|++.--+.|+.+|+..++. | .+-.++|-+--+-+ .+. +.+|-.++-.. T Consensus 1 miE~~i~~~L~~~l~Vpv~~-e~p~~~P~~FV~vErtGG~~~-~------~~~~~~lAVq~w~~S~~e-Aa~La~~v~~~ 71 (111) T protein:vir:16 1 MIEIIIKNFLDTHLSVSSFL-EKKGEMPLSYILFEKTGSSKS-N------HLLSSTFAFQSYAPSMYE-AAKLNEQLKEV 71 (111) T ss_pred ChHHhHHHHHhhcCCceeEe-ecCCCCCCceEEEEecCCccc-c------ccccceEEEEecchhHHH-HHHHHHHHHHH Confidence 99999999999999998873 448887778999999999976 3 23456777777744 332 22222333222 Q ss_pred hccce--eeecccccccceeeeecCcccchhhccceeeEEeeccc Q lcl|NC_021073. 80 YEGFS--GTMGETDIFISRIESSVPTFDNAQQNFEYNITIKFTEN 122 (123) Q Consensus 80 we~i~--G~ig~~pV~~~Q~v~rg~~~q~~~~lt~yri~~~fte~ 122 (123) =+.+. =.|+...+ .+.=++-+..+-...|.++.+.+.- T Consensus 72 l~~l~~~~~I~av~~-----~s~ynf~d~~tk~~RYQav~~i~~~ 111 (111) T protein:vir:16 72 VERLIELNEISNVSL-----NSDYNFTDTETKEYRYQAVFDINHY 111 (111) T ss_pred Hhhccccccceeeec-----CCCCcCCCCCCCCceEEEEEEEeeC Confidence 22221 12222222 1111222233334567777665555 No 15 >protein:vir:94768 Length: 111 # NCBI annotation: unknown # Family: family:all:1269 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996711;genbank:gi:45597426;genbank:GeneID:2769040 Probab=80.75 E-value=0.063 Score=27.15 Aligned_cols=105 Identities=14% Similarity=0.280 Sum_probs=64.4 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEEec-CchH---HHHHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTP-DSEL---VITKKNEL 76 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~-Dy~~---~l~~d~~i 76 (123) |||.-+++=|.+-+|.++|= =.|++.--+.|+.+|....+. | . +-.++|-|--+-+ .++. +.+...++ T Consensus 1 miE~~v~~~L~~~l~vpv~~-e~p~~~p~~FV~vErtGG~~~-~-~-----~~~~~lAVQ~~~~S~~eAa~La~~v~~~~ 72 (111) T protein:vir:94 1 MIEIIIKNFLDTHLSVSSFL-EKKGEMPLSYVLFEKTGSSKS-N-H-----LLSSTFAFQSYAPSMYEAAKLNEQLKEVV 72 (111) T ss_pred ChHHhHHHHHhhcCCcceEe-ecCCCCCCceEEEEecCCccc-c-c-----cccceEEEEecchhHHHHHHHHHHHHHHH Confidence 99999999999999998873 448877778999999999876 3 2 3456777777744 4432 23333333 Q ss_pred --HHHhccceeeecccccccceeeeecCcccchhhccceeeEEeeccc Q lcl|NC_021073. 77 --IEKYEGFSGTMGETDIFISRIESSVPTFDNAQQNFEYNITIKFTEN 122 (123) Q Consensus 77 --~~~we~i~G~ig~~pV~~~Q~v~rg~~~q~~~~lt~yri~~~fte~ 122 (123) +.+|++|.+ | -..+.=++-+..+-...|.++.+.+.- T Consensus 73 ~~l~~~~~i~~------v---~~~s~Ynf~d~~tk~~RYQav~~i~~~ 111 (111) T protein:vir:94 73 ERLIELNEISN------V---SLNSDYNFTDTETKEYRYQAVFDINHY 111 (111) T ss_pred hhcccccccce------e---ecCCCcccCCCcCCCceEEEEEEEeeC Confidence 133444442 1 111222333344445567777655555 No 16 >protein:vir:9579 Length: 111 # NCBI annotation: gp45 # Family: family:all:1269 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862884;genbank:gi:32469476;genbank:GeneID:1461321 Probab=76.23 E-value=0.11 Score=25.82 Aligned_cols=107 Identities=12% Similarity=0.231 Sum_probs=62.6 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEEec-C---chHHHHHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTP-D---SELVITKKNEL 76 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~-D---y~~~l~~d~~i 76 (123) |||.-+++=|.+-+|.+++ +=.|.+.--+.||.+|+..++. | . +-.++|-+--+-+ . ...+.+...++ T Consensus 1 miE~~v~~~L~~~l~vpv~-~~vp~~~P~~FV~vErtGG~~~-~-~-----~~~p~laVq~wg~S~~~Aa~La~~v~~a~ 72 (111) T protein:vir:95 1 MIEIIINKYLDGHLDVPSF-FEHEAEAPDSFVIIQKTGGKER-N-H-----SGSATFAFQSYAPTMQKAAELNVKVKSAV 72 (111) T ss_pred ChHHhHHHHhhhhcCeeEE-eecCCCCCCceEEEEeeCCccc-c-c-----cccceEEEEeccccHHHHHHHHHHHHHHH Confidence 9999999999998887665 3456566668999999999977 3 2 3456777777743 3 33333333333 Q ss_pred HHHhccceeeecccccccceeeeecCcccchhhccceeeEEeeccc Q lcl|NC_021073. 77 IEKYEGFSGTMGETDIFISRIESSVPTFDNAQQNFEYNITIKFTEN 122 (123) Q Consensus 77 ~~~we~i~G~ig~~pV~~~Q~v~rg~~~q~~~~lt~yri~~~fte~ 122 (123) ..|.+... |+..-+ .+.=.+-+..+-...|.++.+.+.- T Consensus 73 -~~l~~~~~-i~~v~~-----~s~ynf~d~~tk~~RYQ~~~~i~~~ 111 (111) T protein:vir:95 73 -KGLIELDS-ICGVHL-----NSDYNFTDTETKQYRYQAVFDINYF 111 (111) T ss_pred -hhhhcccc-cccccc-----CCccccCCCCCCCceEEEEEEEEeC Confidence 56633321 222111 1111222333344567776655554 No 17 >protein:vir:80105 Length: 162 # NCBI annotation: gp13 # Family: family:all:2729 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468717;genbank:gi:157325297;genbank:GeneID:5601796 Probab=63.41 E-value=0.32 Score=23.31 Aligned_cols=119 Identities=9% Similarity=0.046 Sum_probs=62.3 Q ss_pred CcchHHHHHHHHHhcCccce--eecCccccccceeeEeecC--CccccccccccccccceEEEEEEecCchHHHHHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYP--IKIPQNAAYPAIVYKEISN--GRNDDSNLDSSNLRNKYYEIVIVTPDSELVITKKNEL 76 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YP--LllP~d~~~egvTyQrISD--P~~~dsGl~rT~Lv~aRfQI~i~~~Dy~~~l~~d~~i 76 (123) |+-.-|-.--+-.+|+.+.| -..|+ ..+|.+||+-+|. |... -+....-.....||..+..+-..++.+-+++ T Consensus 13 lv~~ii~~i~~~~~gl~vI~~~~~g~~-p~yPF~TY~v~~pyi~~~~--~~~~~e~~~~~isi~~~S~~~~eAl~la~~l 89 (162) T protein:vir:80 13 LVKTLINAVNELSGGLQLIESSSGGEQ-PEYPFCQYTITSPYIAISP--DIVEGEQFEIVISLTWRALSGHQALNLANIT 89 (162) T ss_pred HHHHHHHHHHhhhcceeEEEccCCCCC-CCCCeEEEEEecCccccCC--cccCCcceEEEEEEEEEeCCHHHHHHHHHHH Confidence 32221211122234777655 23454 5679999999875 3221 1223334557788888888999999999999 Q ss_pred HHHhccce----eeec-c-cccccceeeeecCcccchhhc----c--ceeeEEeecccC Q lcl|NC_021073. 77 IEKYEGFS----GTMG-E-TDIFISRIESSVPTFDNAQQN----F--EYNITIKFTENI 123 (123) Q Consensus 77 ~~~we~i~----G~ig-~-~pV~~~Q~v~rg~~~q~~~~l----t--~yri~~~fte~~ 123 (123) +...+..+ +... + .||=.+..-.|-. +++.+-- + ..|+.+.|-..+ T Consensus 90 ~~~f~~~~~~~~~~~~~gIvvvdv~~~~~R~~-~~~~~yerR~GFD~~~Rv~r~~e~~~ 147 (162) T protein:vir:80 90 NKYFRSQKGRFFMQENGGIVVVSVQNSGLRDT-FISIEYERSAGIDLRLRVVDSYSSEI 147 (162) T ss_pred HHHhhcCCceeeeeecCcEEEEecCCCcccee-EeeeeeeeeecceEEEEEeecccccc Confidence 88776543 3332 2 2442222222211 1111110 1 144556665555 No 18 >protein:vir:1387 Length: 116 # NCBI annotation: Gp10 protein # Family: family:all:517 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612839;genbank:gi:20065973;genbank:GeneID:935788 Probab=51.02 E-value=0.59 Score=21.83 Aligned_cols=110 Identities=12% Similarity=0.091 Sum_probs=64.7 Q ss_pred CcchHHH----HHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEEecCchHHHHHHHHH Q lcl|NC_021073. 1 MIEIDLK----NDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTPDSELVITKKNEL 76 (123) Q Consensus 1 MIE~~iK----~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~Dy~~~l~~d~~i 76 (123) |=|..|+ .+|+-+ |.|||...-..+..-+-|||..+.+-.- +-+-.+=....-++||.|...|...+.++.+++ T Consensus 1 ~~~m~I~~~i~~~Lk~i-~ipV~~~~y~~~~~~~~Itf~~y~e~~~-~yaDd~e~~t~~~iQVDI~sk~~~~~~~l~~~V 78 (116) T protein:vir:13 1 MEDFDIIALVYECLECL-NVPVIEGWYDEELNKTHITVHEYLEQDE-SFEDDEAREEEHNIQIDVWSKDSLEAFKLKKAI 78 (116) T ss_pred CCccchhHHHHHHHhhc-CCeeeecccCCCCccceEEEEeeecCCC-cccCCeeeeEEEEEEEEEeecCCccHHHHHHHH Confidence 5555554 555443 8889988877765557889888765433 235666677788999999997665565677777 Q ss_pred HHHhccce-eeecccccccceeeeecCcccchhhccceeeEEeecccC Q lcl|NC_021073. 77 IEKYEGFS-GTMGETDIFISRIESSVPTFDNAQQNFEYNITIKFTENI 123 (123) Q Consensus 77 ~~~we~i~-G~ig~~pV~~~Q~v~rg~~~q~~~~lt~yri~~~fte~~ 123 (123) -+.-+..- ....+.+ ++...+.+..+-+.-.|-+-| T Consensus 79 ~~lMk~~GF~r~~~~d-----------~ye~dt~iyhk~~RF~y~~el 115 (116) T protein:vir:13 79 KKLLKKNNFYFDSSED-----------FYETKTRIYHKGLRFSYISEI 115 (116) T ss_pred HHHHHHcCCEeeecCC-----------Cccchhhhhhhhhhheeeeec Confidence 76655432 2222222 233333333222222333333 No 19 >protein:vir:9764 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795526;genbank:gi:28876278;genbank:GeneID:1257819 Probab=46.17 E-value=0.74 Score=21.29 Aligned_cols=109 Identities=11% Similarity=0.223 Sum_probs=58.7 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEEecCchHHHHHHHHHHHHh Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTPDSELVITKKNELIEKY 80 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~Dy~~~l~~d~~i~~~w 80 (123) |||+-|++-|..-+|.+++ +=.|++..-+.|+..|..+.+- | ++ -.+.|-|--+-+.--.+-+|-.++-..= T Consensus 1 mIE~~i~~yL~~~l~vpv~-~e~p~~~P~~FV~vEkTGG~~~-~-~~-----~~a~lAvQsyg~S~~~AA~La~~V~~a~ 72 (111) T protein:vir:97 1 MIEVIIKKYLDEHLDVPSF-FEHQKDEPARFIILEKTSGAKQ-N-HL-----LSSTFAFQSYAESLYEAALLNDKVKQVI 72 (111) T ss_pred ChhhhhhHHHhhhcCceEE-EeecCCCCCceEEEEeeCCccc-c-cc-----ccceEEEEecchhHHHHHHHHHHHHHHh Confidence 9999999999999999887 4456566668899999998755 3 32 2334444444332222223333333333 Q ss_pred ccce--eeecccccccceeeeecCcccchhhccceeeEEeeccc Q lcl|NC_021073. 81 EGFS--GTMGETDIFISRIESSVPTFDNAQQNFEYNITIKFTEN 122 (123) Q Consensus 81 e~i~--G~ig~~pV~~~Q~v~rg~~~q~~~~lt~yri~~~fte~ 122 (123) +.+. -.|++.-+ .+.=++-+..+-...|...-+.++- T Consensus 73 ~~l~~l~~i~~v~l-----ns~Ynf~d~~tk~yRYQa~~di~~~ 111 (111) T protein:vir:97 73 EQLDVLPQVSGVHL-----NADYNFTDTATKRYRYQAVFDINHY 111 (111) T ss_pred hhhccCccceeeee-----cccccCCCCCCCCccEEEEEEEeeC Confidence 3322 13333222 2222333333334456665444444 No 20 >protein:vir:96764 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1090 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039825;genbank:gi:126010857;genbank:GeneID:5076274 Probab=42.03 E-value=0.65 Score=21.62 Aligned_cols=119 Identities=9% Similarity=0.045 Sum_probs=58.7 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEEecCchHHHHHHH-HHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTPDSELVITKKN-ELIEK 79 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~Dy~~~l~~d~-~i~~~ 79 (123) -|+..||+++-.+-=-..||-....-...++|-.-.-+.++-.+.|..|+. +.+||+..++++......+++- .+... T Consensus 13 AI~~~l~~~~P~l~tV~~y~~~~~~~~~tPAv~iel~~~~~~~d~g~G~~~-~~~r~~a~vvv~~~~~~~~l~a~~lAa~ 91 (177) T protein:vir:96 13 AIQAELESRLADEVTVASYADFGDVQVVDAMVLIEFEQTSPATRGHDGRYC-HQYDITLHAVVGRQRQRAELEAINLAAA 91 (177) T ss_pred HHHHHHHHhCccceeeccccccccccccCceeEEeeccCCcccCCCCCceE-EEEEEEEEEEeCCCCCChHHHHHHHHHH Confidence 344444444443444568986432112336665554444444468888988 6799999999864443333332 23333 Q ss_pred hcc-ceeeecccccccceeeeecCcccchhh-----ccceee-EEeecccC Q lcl|NC_021073. 80 YEG-FSGTMGETDIFISRIESSVPTFDNAQQ-----NFEYNI-TIKFTENI 123 (123) Q Consensus 80 we~-i~G~ig~~pV~~~Q~v~rg~~~q~~~~-----lt~yri-~~~fte~~ 123 (123) +.. ++|..=|.|. +.|+.+-..++..+ +..|.. +..|+-.| T Consensus 92 l~~~v~~~~wGLp~---~~v~~~~~i~a~pd~f~p~ldgy~vW~Vew~Q~i 139 (177) T protein:vir:96 92 IERVTDENLWGLPY---QQVDRPENIRSAPSMFKVGSDGYDAWGVSFRQRI 139 (177) T ss_pred HHHHHhcccccCCc---cccccceeeeccccccccccCceeEEEEEEEEEE Confidence 433 3444444443 32332222222211 223544 35555544 No 21 >protein:vir:10327 Length: 182 # NCBI annotation: ORF29 # Family: family:all:1090 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758922;genbank:gi:27311196;genbank:GeneID:956141 Probab=33.98 E-value=1.3 Score=19.92 Aligned_cols=121 Identities=9% Similarity=-0.029 Sum_probs=67.6 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEEecCchHHHHHH-HHHHHH Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTPDSELVITKK-NELIEK 79 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~Dy~~~l~~d-~~i~~~ 79 (123) -|+..||+++-.++=-..||..- +....++|-.-..+-++-.+.|..|+.++ .||+..+++...+...+++ ..+-+. T Consensus 12 AI~~~Lk~~~p~l~~~~~y~~~~-~~i~~PAv~vel~~~~~~~d~~tGq~~~~-~~~~a~~vv~~~~~~~~~~~~~lAa~ 89 (182) T protein:vir:10 12 AIKAKLRETFPKVTVDDYNPEPE-LSVLAPALLLELEEFPMGADVGDDRYPAA-CRFSVHCVLGWEVKSLALELWEFSAA 89 (182) T ss_pred HHHHHHHHhcCCceeeecCcccc-CccccceeeeeeecCCcCCCCCCCcEEEE-EEEEEEEEecccCCCchHHHHHHHHH Confidence 56667777777777778899753 44555766665555333225688888854 9999999986444433322 234444 Q ss_pred hcc-ceeeeccccc---ccceeeeecCcccchhhccceee-EEeecccC Q lcl|NC_021073. 80 YEG-FSGTMGETDI---FISRIESSVPTFDNAQQNFEYNI-TIKFTENI 123 (123) Q Consensus 80 we~-i~G~ig~~pV---~~~Q~v~rg~~~q~~~~lt~yri-~~~fte~~ 123 (123) +.. ++|+.=|.|. ---+.+...+..=++..+-.|.+ +..|+-.+ T Consensus 90 l~~~v~~~~wGL~~~~v~~a~~i~a~p~~f~~~~~dgy~vW~VeW~Q~i 138 (182) T protein:vir:10 90 VAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTL 138 (182) T ss_pred HHHHHhcCcccCCccccCccceeeeccCccChhhcCceEEEEEEEEEEE Confidence 433 4555545442 11134444444333433334555 45555544 No 22 >protein:vir:103278 Length: 169 # NCBI annotation: phage-related conserved hypothetical protein # Family: family:all:5121 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277458;genbank:gi:71834100;genbank:GeneID:3562389 Probab=28.41 E-value=1.8 Score=19.25 Aligned_cols=109 Identities=13% Similarity=0.149 Sum_probs=59.2 Q ss_pred CcchHHHHHHHHHhcCccceeecCccccccceeeEeecCCccc-----------cccccccc-cccceEEEEEEecC--- Q lcl|NC_021073. 1 MIEIDLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRND-----------DSNLDSSN-LRNKYYEIVIVTPD--- 65 (123) Q Consensus 1 MIE~~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~-----------dsGl~rT~-Lv~aRfQI~i~~~D--- 65 (123) -|-.++.++|++...- ++.-|| ...|+|.|.-..|+..+ ..||.+.. .-.+.|||+++.|- T Consensus 41 ei~~a~rk~l~~~a~a--~~~~Lp--VA~ENVaFtPp~dG~~YLr~~~lPadT~~~~L~gd~R~y~GVfQIsVV~PaGtG 116 (169) T protein:vir:10 41 EMMVAARKLVSDAAVD--IAGSLP--VAYENCGFTPPKNGSSWLKFDYTEVDSVTWGLQRTCRYYVGMVQVSIFFSPGEG 116 (169) T ss_pred HHHHHHHHHHHHHHhh--cccCCc--EeeCCCCcCCCCCCccEEEEEEecCCceeeeccCCCceEEEEEEEEEEecCCCC Confidence 2334444555444332 333334 44455555433333210 12444432 33589999999862 Q ss_pred chHHHHHHHHHHHHhcc---ce-eeecccccccceeeeecCcccchhhccceeeEEeecccC Q lcl|NC_021073. 66 SELVITKKNELIEKYEG---FS-GTMGETDIFISRIESSVPTFDNAQQNFEYNITIKFTENI 123 (123) Q Consensus 66 y~~~l~~d~~i~~~we~---i~-G~ig~~pV~~~Q~v~rg~~~q~~~~lt~yri~~~fte~~ 123 (123) -.++.++-.+|....+. .. |+|-+-|+ | +|--+....|-|-+.|.=.. T Consensus 117 ~~ka~qiAdeiadlF~~gt~L~~Gyi~~~~~---~-------~p~i~~~s~~~iPvr~~~R~ 168 (169) T protein:vir:10 117 TDRPRQLAGRLSEAFADGTMLDSGYIYEGGS---V-------FPPVKSQSGWFIPVRFYVRM 168 (169) T ss_pred cchhHHHHHHHHHhhhCCceeeceeecCCCe---E-------CCeeecCCceEEeEEEEEEe Confidence 56778888889988866 33 98888776 3 23333334555555544444 No 23 >protein:vir:1274 Length: 162 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690766;genbank:gi:22855006;genbank:GeneID:955217 Probab=23.33 E-value=2.3 Score=18.58 Aligned_cols=110 Identities=15% Similarity=0.168 Sum_probs=61.1 Q ss_pred CcchHHH--HHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEEecCch--HHHHHHHHH Q lcl|NC_021073. 1 MIEIDLK--NDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTPDSE--LVITKKNEL 76 (123) Q Consensus 1 MIE~~iK--~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~Dy~--~~l~~d~~i 76 (123) ++...|. ..|..++|-.+|=+.-|....-+-|||..+.+-.- +---.+=--..-+|||.|...|.+ ...++++++ T Consensus 44 ~v~q~L~n~~~L~~l~~~~i~~l~~~~~~~~p~Itf~e~~~~p~-~yADD~e~ss~~~iQIDIwsk~st~~d~~~l~~~I 122 (162) T protein:vir:12 44 ELVSTLNSSAFLKGLTSGGIHNLVANDVSAFPRVVFSEIQDADA-DFADNEVYSFEVRYQISIFTQASTRGKETAIASEI 122 (162) T ss_pred HHHHHhcChhHHHhhCCCceEEEeecCCCCceEEEEEeecCCCC-cccccceeeEEEEEEEEEeecCCcchhHHHHHHHH Confidence 3333332 56777888899999987777889999999987432 223334445678999999986643 345677777 Q ss_pred HHHhccceeeecccccccceeeeecCcccchhhccceeeEEee----cccC Q lcl|NC_021073. 77 IEKYEGFSGTMGETDIFISRIESSVPTFDNAQQNFEYNITIKF----TENI 123 (123) Q Consensus 77 ~~~we~i~G~ig~~pV~~~Q~v~rg~~~q~~~~lt~yri~~~f----te~~ 123 (123) -..=+.+ |+ +-.+...++...+.+ |+=.+.| -+-+ T Consensus 123 ~~lMk~~-GF---------~R~s~~d~YE~DTkl--yHK~~RF~~~y~~E~ 161 (162) T protein:vir:12 123 DRLMREI-GY---------SRYDSQDLYETDTKV--FHKARRYKKTYYQEV 161 (162) T ss_pred HHHHHHc-CC---------EeecCCCCCCChhhh--hhhhheeccceeeec Confidence 6653221 11 001111222223333 2222222 2222 No 24 >protein:vir:6215 Length: 109 # NCBI annotation: hypothetical protein # Family: family:all:10885 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852595;genbank:gi:31415855;genbank:GeneID:1489213 Probab=21.41 E-value=2.6 Score=18.30 Aligned_cols=105 Identities=19% Similarity=0.240 Sum_probs=58.5 Q ss_pred Ccch-HHHHHHHHHhcCccceeecCccccccceeeEeecCCccccccccccccccceEEEEEEecC-chHHHHHHHHHHH Q lcl|NC_021073. 1 MIEI-DLKNDLESSMGMNAYPIKIPQNAAYPAIVYKEISNGRNDDSNLDSSNLRNKYYEIVIVTPD-SELVITKKNELIE 78 (123) Q Consensus 1 MIE~-~iK~~LErltgl~~YPLllP~d~~~egvTyQrISDP~~~dsGl~rT~Lv~aRfQI~i~~~D-y~~~l~~d~~i~~ 78 (123) .|-- .+|++| ..+|++||-=.-|.++.+|-+.|.-||..+---|| .+-+.-.-||||+++.- =..+..+++++-. T Consensus 2 ~i~Fe~lr~~L-k~~g~~V~RD~ap~~t~YPyivYs~v~e~~k~AS~--kv~~~~~~YQvSl~T~GtE~dl~~l~k~f~~ 78 (109) T protein:vir:62 2 QINFEQLRSLM-KKSGIPVSRDNAPTGIDYPYIVYEFVNEQHKRASN--KVLKDMPLYQIAVITNGTEKDYEPLKAVFNE 78 (109) T ss_pred cccHHHHHHHH-HhcCCceeeccCCCCCCCceEEEEeecCceeeecc--ceEeecceeEEEEeeccchhHHHHHHHHHhh Confidence 2332 355655 45899999999999999999999999987653333 33456678999999742 3344555555543 Q ss_pred HhccceeeecccccccceeeeecCcccchhhccceeeEEeecccC Q lcl|NC_021073. 79 KYEGFSGTMGETDIFISRIESSVPTFDNAQQNFEYNITIKFTENI 123 (123) Q Consensus 79 ~we~i~G~ig~~pV~~~Q~v~rg~~~q~~~~lt~yri~~~fte~~ 123 (123) .==.++++.| + |. -.|-++-|+|- +|-+-| T Consensus 79 ~~vpfs~f~g---I---qg------DENDdTiTnfy---TyVrci 108 (109) T protein:vir:62 79 VGVSYSQFDG---M---DY------DENDDTITQFI---TYVRCI 108 (109) T ss_pred cCCccccccc---c---CC------CCCcchheeee---eeeEEe Confidence 2111112211 1 10 11333334321 111111 Done!