Query lcl|NC_019711.1_cdsid_YP_007111786.1 [gene=F849_gp12] [protein=minor tail protein] [protein_id=YP_007111786.1] [location=8362..8757] Match_columns 131 No_of_seqs 47 out of 78 Neff 4.5 Searched_HMMs 1612 Date Thu Nov 7 16:31:33 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_12 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_12_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:3428 Length: 131 # 100.0 5.2E-71 3.2E-74 405.9 14.6 131 1-131 1-131 (131) 2 protein:vir:397 Length: 132 # 100.0 1.1E-68 7.1E-72 393.1 14.7 131 1-131 1-132 (132) 3 protein:vir:79571 Length: 137 100.0 2.7E-67 1.7E-70 385.6 13.5 131 1-131 6-137 (137) 4 protein:vir:96125 Length: 140 94.6 0.0027 1.7E-06 34.7 11.2 123 1-131 3-133 (140) 5 protein:vir:102955 Length: 138 91.2 0.017 1E-05 30.3 11.3 111 1-131 1-115 (138) 6 protein:vir:79047 Length: 145 90.9 0.019 1.2E-05 30.1 12.0 114 1-131 1-118 (145) 7 protein:vir:96894 Length: 140 90.5 0.021 1.3E-05 29.8 11.5 124 1-131 1-133 (140) 8 protein:vir:5979 Length: 134 # 89.9 0.024 1.5E-05 29.5 11.1 123 1-131 1-133 (134) 9 protein:vir:1244 Length: 145 # 89.3 0.027 1.7E-05 29.2 11.7 123 1-131 1-133 (145) 10 protein:vir:96260 Length: 141 88.8 0.03 1.9E-05 28.9 11.3 124 1-131 1-135 (141) 11 protein:vir:105892 Length: 141 88.8 0.03 1.9E-05 28.9 11.3 124 1-131 1-135 (141) 12 protein:vir:94096 Length: 141 88.8 0.03 1.9E-05 28.9 11.3 124 1-131 1-135 (141) 13 protein:vir:95111 Length: 145 84.9 0.057 3.5E-05 27.4 11.7 124 1-131 1-133 (145) 14 protein:vir:97421 Length: 145 84.8 0.058 3.6E-05 27.4 11.6 124 1-131 1-133 (145) 15 protein:vir:94488 Length: 145 84.8 0.058 3.6E-05 27.4 11.6 124 1-131 1-133 (145) 16 protein:vir:93736 Length: 145 84.8 0.058 3.6E-05 27.4 11.6 124 1-131 1-133 (145) 17 protein:vir:97325 Length: 145 84.2 0.063 3.9E-05 27.2 11.7 124 1-131 1-133 (145) 18 protein:vir:9764 Length: 111 # 81.8 0.043 2.7E-05 28.1 7.3 104 1-131 1-108 (111) 19 protein:vir:94768 Length: 111 81.2 0.067 4.2E-05 27.0 8.1 104 1-131 1-108 (111) 20 protein:vir:94921 Length: 125 80.9 0.091 5.6E-05 26.3 12.3 120 1-131 1-123 (125) 21 protein:vir:95961 Length: 145 80.2 0.097 6E-05 26.1 11.4 124 1-131 1-133 (145) 22 protein:vir:94794 Length: 145 80.2 0.097 6E-05 26.1 11.4 124 1-131 1-133 (145) 23 protein:vir:107096 Length: 145 75.5 0.15 9E-05 25.2 11.3 123 1-131 1-133 (145) 24 protein:vir:105337 Length: 145 75.5 0.15 9E-05 25.2 11.3 123 1-131 1-133 (145) 25 protein:vir:1643 Length: 111 # 71.6 0.19 0.00012 24.5 8.6 103 1-131 1-108 (111) 26 protein:vir:4907 Length: 128 # 68.4 0.24 0.00015 24.0 12.0 121 1-128 1-128 (128) 27 protein:vir:107704 Length: 132 68.3 0.24 0.00015 24.0 9.1 117 2-131 1-127 (132) 28 protein:vir:3972 Length: 129 # 61.3 0.36 0.00022 23.0 12.1 121 1-128 2-129 (129) 29 protein:vir:2741 Length: 128 # 60.9 0.36 0.00023 23.0 11.8 121 1-128 1-128 (128) 30 protein:vir:99226 Length: 157 58.5 0.41 0.00026 22.7 8.8 128 1-131 5-149 (157) 31 protein:vir:3618 Length: 129 # 57.3 0.44 0.00027 22.6 12.1 121 1-128 2-129 (129) 32 protein:vir:103278 Length: 169 56.1 0.46 0.00029 22.4 9.9 119 1-130 41-169 (169) 33 protein:vir:96485 Length: 128 46.7 0.73 0.00045 21.3 12.1 120 1-128 1-128 (128) 34 protein:vir:10327 Length: 182 44.7 0.8 0.00049 21.1 11.0 124 1-131 1-140 (182) 35 protein:vir:79247 Length: 157 32.2 1.4 0.00089 19.7 9.1 127 1-131 5-149 (157) 36 protein:vir:744 Length: 129 # 25.2 2.1 0.0013 18.8 12.2 121 1-128 2-129 (129) 37 protein:vir:104224 Length: 161 20.0 2.8 0.0018 18.1 8.8 126 1-131 1-154 (161) No 1 >protein:vir:3428 Length: 131 # NCBI annotation: tail component # Family: family:all:911 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040591;genbank:gi:9626255;genbank:GeneID:2703486 Probab=100.00 E-value=5.2e-71 Score=405.89 Aligned_cols=131 Identities=100% Similarity=1.489 Sum_probs=130.6 Q ss_pred CccHHHHHHHHHHHHhcCCceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCCChhHHHHH Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHDTGATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELDAW 80 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~~~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~~d~~LD~~ 80 (131) |+||+|||+|+|+||+++++|+||||||+|||++|+|||||||||++|+|+++|+++|+|+|||+||||+++||++||+| T Consensus 1 ~~ht~IR~~Vid~L~~~l~~v~~fdG~P~fide~ElPAVAV~l~d~~~~~~~ld~~~w~A~LhI~iyLka~~~ds~LD~~ 80 (131) T protein:vir:34 1 MKHTELRAAVLDALEKHDTGATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELDAW 80 (131) T ss_pred CchHHHHHHHHHHHhccCCceEEecCCceeeccccCcEEEEEeecCCCCcceecCCeeEEEEEEEEEeecCCCHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhhhchhhhhhheeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 81 MESRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 81 ~E~~i~~v~~~~~~l~~li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) ||++|+|||+++++|++|+++|+++||+|+||++|+||++|||+|+|||+| T Consensus 81 ~E~~i~~v~~~~~~l~~l~~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) T protein:vir:34 81 MESRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) T ss_pred HHHHhHHHhhcchhhhhHhhhhhhccCCcccccccceEEEEEEEEEEEEeC Confidence 999999999999999999999999999999999999999999999999999 No 2 >protein:vir:397 Length: 132 # NCBI annotation: gp12 # Family: family:all:911 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046907;genbank:gi:9630477;genbank:GeneID:1261651 Probab=100.00 E-value=1.1e-68 Score=393.05 Aligned_cols=131 Identities=69% Similarity=1.184 Sum_probs=129.3 Q ss_pred CccHHHHHHHHHHHHhcCCc-eEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCCChhHHHH Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHDTG-ATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELDA 79 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~~~-v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~~d~~LD~ 79 (131) |+||+|||+|+|+||+++++ ++||||||+|||++|+|||||||||++|+|+++|+++|+|+|||+||||+++||++||+ T Consensus 1 ~~ht~IR~~Vid~L~~~l~~~~~ffdGrP~fiDe~elPAVAV~l~d~~~~~~~ld~~~w~A~LhI~iyLka~~~ds~LD~ 80 (132) T protein:vir:39 1 MKHRDIRKVIIDALESAIGTDAIYFDGRPAVLEEGDFPAVAVYLTDAEYTGEELDADTWQAILHIEVFLEAQVPDSELDD 80 (132) T ss_pred CchHHHHHHHHHHHHhhCCCceEEecCcceeeccccCcEEEEEeecCCCCcceecCCeeEEEEEEEEEeecCCCHHHHHH Confidence 99999999999999999987 57999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhhhchhhhhhheeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 80 WMESRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 80 ~~E~~i~~v~~~~~~l~~li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) |||++|+|++.++++|++++++|.++||+|+||++|+||++|||+|+|||+| T Consensus 81 ~aE~~i~p~i~~~~~l~~l~~~~~~~gy~Y~rD~~~atW~sadL~y~ItY~~ 132 (132) T protein:vir:39 81 WMETRVYPVLAEVPGLESLITTMVQQGYDYQRDDDMALWSSADLKYSITYDM 132 (132) T ss_pred HHHHHhHhhhcccchhhhHhhhhhhcCCCcccccccceEEEEEEEEEEEEeC Confidence 9999999999999999999999999999999999999999999999999999 No 3 >protein:vir:79571 Length: 137 # NCBI annotation: putative tail component # Family: family:all:911 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272522;genbank:gi:148609391;genbank:GeneID:5204407 Probab=100.00 E-value=2.7e-67 Score=385.57 Aligned_cols=131 Identities=50% Similarity=0.932 Sum_probs=128.6 Q ss_pred CccHHHHHHHHHHHHhcCCc-eEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCCChhHHHH Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHDTG-ATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELDA 79 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~~~-v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~~d~~LD~ 79 (131) +|||+|||+|+|+||+++++ ++||||||+|+|++|+|||||||||++|+|+++|+++|+|+|||+||||+++||++||+ T Consensus 6 ~iht~IR~~Vid~L~~~l~~~~~ffdGrP~fiDe~ElPAVAV~l~da~~~~~~ld~~~W~A~LhI~iyLka~~~ds~LD~ 85 (137) T protein:vir:79 6 NRHTQIRQVVLARLREQCGDSATFFDGLPAFVDAQELPAVSVWLSDAQYTGKMTDEDDWQAVLHIAVFIRAQAPDSELDM 85 (137) T ss_pred HHHHHHHHHHHHHHHhhcCCcEEEeCCccceechhhCcEEEEEeecCCCCcceecCCeeEEEEEEEEEeecCCCHHHHHH Confidence 56999999999999999987 57999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhhhchhhhhhheeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 80 WMESRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 80 ~~E~~i~~v~~~~~~l~~li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) |||++|+|+|+++++|++|+++|+++||+|+||++|+||+||||+|+|||+= T Consensus 86 ~~E~~I~~v~~~~~~l~~l~~~~~~~gY~Y~rD~e~~tW~sadL~y~ItYe~ 137 (137) T protein:vir:79 86 WMESTIFPALNDVPALSGLIDTLIPLGFNYQRDNEMATWAMAEITYQITYTN 137 (137) T ss_pred HHHHHHHHhhcchhhhhhHhhhhhcccCCcccccccceeEEEEEEEEEEEcC Confidence 9999999999999999999999999999999999999999999999999999 No 4 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=94.57 E-value=0.0027 Score=34.70 Aligned_cols=123 Identities=11% Similarity=0.084 Sum_probs=82.7 Q ss_pred Cc-cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCce-eecCCceEEEEEEEEEeecCCC Q lcl|NC_019711. 1 MK-HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGE-ELDSDTWQAELHIEVFLPAQVP 73 (131) Q Consensus 1 ~~-H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~-~ld~~~w~A~LhI~iyLk~~~~ 73 (131) |. -.+++++|.++|++.-. +-.+||..| ++...|-|. |-+.+..+. .-|+..++-.|.|.|+=+.. + T Consensus 3 msa~~aLq~Ai~~~L~ad~~l~alvggrVyD~~P---~~~~~PYV~--lG~~~~~~~~~~~~~g~~~~~tl~Vws~~~-g 76 (140) T protein:vir:96 3 VTAEPLLYNKIMNNLIENPITDKLVGGRVFDCVQ---KDVVYPYIV--VGESNVTESERSPGMREIIAITFHVYSQYE-N 76 (140) T ss_pred cchhHHHHHHHHHHhccChhHHhhcCcccccCCc---cCCCCCEEE--eCCceeeecCCCcccceEEEEEEEEEEcCC-C Confidence 55 44999999999987432 225788877 345667444 444444333 33577789999999986654 4 Q ss_pred hhHHHHHHHHhhhhhhhchhhhhhh-eeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 74 DSELDAWMESRIYPVMSDIPALSDL-ITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 74 d~~LD~~~E~~i~~v~~~~~~l~~l-i~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) -.+--+++. .|..++.....|.+. +..+...+=++.||.+..+|+ +.|+|.++++= T Consensus 77 ~~ea~~ia~-ai~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~t~h-gvl~~ra~ve~ 133 (140) T protein:vir:96 77 GAEARELLK-YLNYACRLNINFKDYELEWIKKDNSQVFTDIDQYTKH-GVLRLLYKVRH 133 (140) T ss_pred HHHHHHHHH-HHHHHhcCCccCCCceEEEEEEeeeEEeecCCCceEE-EEEEEEEEEee Confidence 444555675 688777543343332 446777788888999998876 66888888887 No 5 >protein:vir:102955 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1535 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945290;genbank:gi:39653725;uniprot:Q708M2;genbank:GeneID:2672869 Probab=91.25 E-value=0.017 Score=30.32 Aligned_cols=111 Identities=12% Similarity=0.173 Sum_probs=70.9 Q ss_pred Cc--cHHHHHHHHHHHHhcCCceEEEeCcceEEc-cccCcEEEEEeeCCccCceeecCCceEEEEEEEE-EeecCCChhH Q lcl|NC_019711. 1 MK--HTELRAAVLDALEKHDTGATFFDGRPAVFD-EADFPAVAVYLTGAEYTGEELDSDTWQAELHIEV-FLPAQVPDSE 76 (131) Q Consensus 1 ~~--H~~IR~~V~d~L~~~~~~v~~f~GrP~fid-e~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~i-yLk~~~~d~~ 76 (131) |. -=+||.+|...|++.+|++++|+-. +. -=+-|..-|.+-+.+..... ...++=.+-+.| |.|....-++ T Consensus 1 ~~~~~~~I~~aI~~~Lk~~fpd~~Iy~e~---i~Qgf~~PcFFI~ll~~~~~~~~--~~r~~r~~~~dI~Yfp~~~~~~e 75 (138) T protein:vir:10 1 MANKGFRLVEELVSHIKGLYPDIRIYLDE---VEQGFKEPCFFIHVVDTKYTPEA--NKYVKVRSKVDLSYFPPKKKRSE 75 (138) T ss_pred CCcchhhhHHHHHHHHHHhcCCceeeecc---cccCCcCCeEEEEEecccCcccc--CceEEEEEEEEEEEecCcchhHH Confidence 54 4589999999999999999888632 11 22569999999999988887 444444444433 6665433344 Q ss_pred HHHHHHHhhhhhhhchhhhhhheeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 77 LDAWMESRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 77 LD~~~E~~i~~v~~~~~~l~~li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) +=+-+ ++++.++..++. +..++-.|+= .-..|+|.++|.+ T Consensus 76 ~~~v~-e~L~~~f~~~~~-------i~~~~~~~~I-------~DgVLhf~f~~~~ 115 (138) T protein:vir:10 76 CLAMQ-EELSYKLLHLPT-------IHLFDRQYEV-------VDNVLHCIFNAST 115 (138) T ss_pred HHHHH-HHHHHHHhhcCe-------eeeecceeeE-------EcCeEEEEEEEEE Confidence 44344 467766665542 2223333321 2246999999988 No 6 >protein:vir:79047 Length: 145 # NCBI annotation: hypothetical protein # Family: family:all:1535 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110730;genbank:gi:134287347;genbank:GeneID:4955221 Probab=90.86 E-value=0.019 Score=30.06 Aligned_cols=114 Identities=11% Similarity=0.135 Sum_probs=68.9 Q ss_pred CccHHHHHHHHHHHHhcCCc-eEEEeCcceEEc-cccCcEEEEEeeCCccCceeecCCce-EEEEEEEEEeec-CCChhH Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHDTG-ATFFDGRPAVFD-EADFPAVAVYLTGAEYTGEELDSDTW-QAELHIEVFLPA-QVPDSE 76 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~~~-v~~f~GrP~fid-e~elPAVAVyl~da~~~~~~ld~~~w-~A~LhI~iyLk~-~~~d~~ 76 (131) |+ ++||.+|...|++.+|+ +++|+.. +. .-+.|.--|-+-+.+..... ...+ +...--=.|.|. .....+ T Consensus 1 mi-~dI~~aI~~~Lk~~Fp~~~~IY~e~---i~Qgf~~PcFFI~ll~~~~~~~~--~~r~~r~~~~dI~Yfp~~~~~~~e 74 (145) T protein:vir:79 1 ML-NNIIDGISVKLDKSFGEKYTIYSED---VEQGINEPCFFIVPLNPSKTPYP--SGRELKKNSFDVHYFPRSEAKNFE 74 (145) T ss_pred Ch-HHHHHHHHHHHHHhcCCceEEEecc---cccCccCCeeEEEEecccccccc--CceEEEEEEEEEEEeecCCCCchh Confidence 77 89999999999999995 7888764 22 22669999999988888776 4444 433333345554 333344 Q ss_pred HHHHHHHhhhhhhhchhhhhhheeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 77 LDAWMESRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 77 LD~~~E~~i~~v~~~~~~l~~li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) +=+.++ +++.++..+.-.. ..+..++-+++= .-..|+|.++|.+ T Consensus 75 ~~ev~e-~L~~~le~i~v~~---~~~~~~~~~~ei-------vDgvLhf~~~~~~ 118 (145) T protein:vir:79 75 INEIAE-MLLEELEYIEING---DLVRGTNMNFEI-------IDNVLHFFVDYNY 118 (145) T ss_pred HHHHHH-HHHhhhcceeecC---cEEeeecceeEE-------eeceEEEEEEEEE Confidence 444554 4555553322111 123333333332 2236889888888 No 7 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=90.46 E-value=0.021 Score=29.81 Aligned_cols=124 Identities=15% Similarity=0.107 Sum_probs=79.5 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQV 72 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~ 72 (131) |. -.+++++|.++|++.-. +-.+||..|. ....|-|.+-=++....+. -|...++-.+.|.|+=+... T Consensus 1 Msms~~~aLq~Ai~a~L~ada~l~alvg~~VyD~~P~---~~~~Pyv~lG~~~~~~~~~-~~~~g~~~~~~i~Vws~~~g 76 (140) T protein:vir:96 1 MWVSVEPELTVQIYKRLKASPIINKFVGDRVFDVVQE---DAVYPYIVVGESNVTNNES-STMMRETVGIVIHVYSQFAT 76 (140) T ss_pred CCccHHHHHHHHHHHHhhcChhHHHhcCCccccCCcc---CCCCCEEEecCceeeecCC-CcccceEEEEEEEEEEcCCC Confidence 44 67999999999987421 2257887773 4566755554444444332 24667899999999865543 Q ss_pred ChhHHHHHHHHhhhhhhhchhhhhhh-eeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 73 PDSELDAWMESRIYPVMSDIPALSDL-ITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 73 ~d~~LD~~~E~~i~~v~~~~~~l~~l-i~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) -.+--+++. .|..++..-..+.+- +.++...+=+..||.+..+|+ +.|+|.+++.= T Consensus 77 -~~ea~~ia~-av~~AL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~h-gvl~~r~~v~~ 133 (140) T protein:vir:96 77 -QYEAKQIIS-AIGYVLNRPIDIENYEFQFSRIDSQSVFPDIDRFTKH-GTIRLLFKYRH 133 (140) T ss_pred -HHHHHHHHH-HHHHHhCCCccCCCCeEEEEEEeeeEEEecCCCceEE-EEEEEEEEEEe Confidence 344456775 577777432222221 445666667788999988886 56777776665 No 8 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=89.86 E-value=0.024 Score=29.46 Aligned_cols=123 Identities=12% Similarity=0.046 Sum_probs=81.4 Q ss_pred Cc----cHHHHHHHHHHHHhcCCce----EEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCC Q lcl|NC_019711. 1 MK----HTELRAAVLDALEKHDTGA----TFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQV 72 (131) Q Consensus 1 ~~----H~~IR~~V~d~L~~~~~~v----~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~ 72 (131) |. -.++|++|.++|++.-.-. .+||+-|. ....|-|.+-=++....+ .-|...++-.+.|.|+=++ T Consensus 1 m~~~s~~~aLq~Ai~~~L~ad~~l~alvg~I~D~~P~---~~~~PYV~lG~~~~~d~~-~~~~~g~~~~~ti~Vws~~-- 74 (134) T protein:vir:59 1 MTWKLASRALQKATVENLESYQPLMEMVNQVTESPGK---DDPYPYVVIGDQSSTPFE-TKSSFGENITMDFHVWGGT-- 74 (134) T ss_pred CCccchhHHHHHHHHHHhhcChhHHHhhhhhhcCCCC---CCCCCEEEeCCceeeecC-CCcccceEEEEEEEEEECC-- Confidence 44 4589999999999843311 47777662 446675444334433332 4567789999999999543 Q ss_pred ChhHHHHHHHHhhhhhhhchh-hhh-hheeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 73 PDSELDAWMESRIYPVMSDIP-ALS-DLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 73 ~d~~LD~~~E~~i~~v~~~~~-~l~-~li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) +-.+--+++. .|..++.+.+ .|. .-+..+....-++.||.+..+|+ +.|+|....+= T Consensus 75 g~~ea~~ia~-av~~aL~~~~L~l~~~~lv~l~~~~~~~~rd~dg~~~h-g~l~fra~ve~ 133 (134) T protein:vir:59 75 TRAEAQDISS-RVLEALTYKPLMFEGFTFVAKKLVLAQVITDTDGVTKH-GIIKVRFTINN 133 (134) T ss_pred ChHHHHHHHH-HHHHHhcCCCcccCCceEEEeEEeeeeEEecCCCceEE-EEEEEEEEEec Confidence 5566666775 5778874432 222 22556777788888999999885 67777777777 No 9 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=89.29 E-value=0.027 Score=29.17 Aligned_cols=123 Identities=16% Similarity=0.156 Sum_probs=80.8 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCceee-cCCceEEEEEEEEEeecC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEEL-DSDTWQAELHIEVFLPAQ 71 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~l-d~~~w~A~LhI~iyLk~~ 71 (131) |. -.+++++|.++|++.-. +-.+||..| ++...|-| .|-+.+..+... |...++-.|.|.|+=+.. T Consensus 1 M~~s~~~aLq~ai~~~L~ad~~l~~lvg~~vyD~~P---~~~~~PyV--~lG~~~~~~~~t~~~~~~~~~lti~Vws~~~ 75 (145) T protein:vir:12 1 MWVSVERYLFNKVYNKLKSNPIIQKQLGGRVFDCVQ---KDAVYPYI--VVGETNVTNKETTTSMVEDVGITLHVYSQAR 75 (145) T ss_pred CcccHHHHHHHHHHHHhhcChhHHHhcCcccccCCc---cCCCCCEE--EeccceeeecCCCcccceEEEEEEEEEEcCc Confidence 44 56899999999985321 236899888 55677864 455555555543 456789999999986554 Q ss_pred CChhHHHHHHHHhhhhhhhchhhhhhh-eeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 72 VPDSELDAWMESRIYPVMSDIPALSDL-ITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 72 ~~d~~LD~~~E~~i~~v~~~~~~l~~l-i~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) +-.+.-+++. .|..++.....+.+. +-.+...+-+.-||.+..+|+. .|+|..+++= T Consensus 76 -gr~ea~~ia~-ai~~aL~~~l~l~~~~lv~l~~~~~~~~rd~d~~~~hg-vl~~ra~i~~ 133 (145) T protein:vir:12 76 -NRDEASQIIQ-FLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHG-IIRLVFKYRH 133 (145) T ss_pred -cHHHHHHHHH-HHHHHhccccCCCCceEEEEEEeeEEEEecCCCceEEE-EEEEEEEEEe Confidence 4456677886 577777544333322 3456677777889988877653 4555555554 No 10 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=88.79 E-value=0.03 Score=28.92 Aligned_cols=124 Identities=12% Similarity=0.037 Sum_probs=77.4 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQV 72 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~ 72 (131) |. -.++|++|.++|++.-. +-.+||.-|. ....|-|.+-=++....+ .-|...++-.+.|.|+=+... T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~---~~~~PYv~lG~~~~~~~~-~~~~~g~~~~~ti~Vws~~~g 76 (141) T protein:vir:96 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQD---DAVYPYIVVGESNVTNNE-SSATMRETVGIVIHVYSQFAT 76 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCccccCCcc---CCCCCEEEeCCceeeecC-CCcccceEEEEEEEEEEcCCC Confidence 43 67899999999987422 2257777663 446675554444444333 235677999999999976654 Q ss_pred ChhHHHHHHHHhhhhhhhchhhhhh-heeeeeecccccccccccceEEEEEEEEEEE--EeC Q lcl|NC_019711. 73 PDSELDAWMESRIYPVMSDIPALSD-LITSMVASGYDYRRDDDAGLWSSADLTYVIT--YEM 131 (131) Q Consensus 73 ~d~~LD~~~E~~i~~v~~~~~~l~~-li~~~~~~gy~Y~rD~e~~tW~sadL~y~It--Y~~ 131 (131) ..... +++. .|..++..-..+.+ .+..+.+..=+..||.+..+|+ +.|+|.++ +.- T Consensus 77 ~~eak-~ia~-av~~AL~~~l~l~~~~lv~l~~~~~~~~rd~dg~t~h-gvl~~ra~v~~~~ 135 (141) T protein:vir:96 77 QYEAK-LILS-AIGYVLNRPIEIDNYEFQFSRIDSQAVFPDIDRFTKH-GTIRLLFKYRHKK 135 (141) T ss_pred HHHHH-HHHH-HHHHHhcccccCCCceEEEEEEeeeeeeecCCCceEE-EEEEEEEEEEecc Confidence 44444 4664 68888854322332 2445666666777998888886 33555544 433 No 11 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=88.79 E-value=0.03 Score=28.92 Aligned_cols=124 Identities=12% Similarity=0.037 Sum_probs=77.4 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQV 72 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~ 72 (131) |. -.++|++|.++|++.-. +-.+||.-|. ....|-|.+-=++....+ .-|...++-.+.|.|+=+... T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~---~~~~PYv~lG~~~~~~~~-~~~~~g~~~~~ti~Vws~~~g 76 (141) T protein:vir:10 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQD---DAVYPYIVVGESNVTNNE-SSATMRETVGIVIHVYSQFAT 76 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCccccCCcc---CCCCCEEEeCCceeeecC-CCcccceEEEEEEEEEEcCCC Confidence 43 67899999999987422 2257777663 446675554444444333 235677999999999976654 Q ss_pred ChhHHHHHHHHhhhhhhhchhhhhh-heeeeeecccccccccccceEEEEEEEEEEE--EeC Q lcl|NC_019711. 73 PDSELDAWMESRIYPVMSDIPALSD-LITSMVASGYDYRRDDDAGLWSSADLTYVIT--YEM 131 (131) Q Consensus 73 ~d~~LD~~~E~~i~~v~~~~~~l~~-li~~~~~~gy~Y~rD~e~~tW~sadL~y~It--Y~~ 131 (131) ..... +++. .|..++..-..+.+ .+..+.+..=+..||.+..+|+ +.|+|.++ +.- T Consensus 77 ~~eak-~ia~-av~~AL~~~l~l~~~~lv~l~~~~~~~~rd~dg~t~h-gvl~~ra~v~~~~ 135 (141) T protein:vir:10 77 QYEAK-LILS-AIGYVLNRPIEIDNYEFQFSRIDSQAVFPDIDRFTKH-GTIRLLFKYRHKK 135 (141) T ss_pred HHHHH-HHHH-HHHHHhcccccCCCceEEEEEEeeeeeeecCCCceEE-EEEEEEEEEEecc Confidence 44444 4664 68888854322332 2445666666777998888886 33555544 433 No 12 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=88.79 E-value=0.03 Score=28.92 Aligned_cols=124 Identities=12% Similarity=0.037 Sum_probs=77.4 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQV 72 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~ 72 (131) |. -.++|++|.++|++.-. +-.+||.-|. ....|-|.+-=++....+ .-|...++-.+.|.|+=+... T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~---~~~~PYv~lG~~~~~~~~-~~~~~g~~~~~ti~Vws~~~g 76 (141) T protein:vir:94 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQD---DAVYPYIVVGESNVTNNE-SSATMRETVGIVIHVYSQFAT 76 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCccccCCcc---CCCCCEEEeCCceeeecC-CCcccceEEEEEEEEEEcCCC Confidence 43 67899999999987422 2257777663 446675554444444333 235677999999999976654 Q ss_pred ChhHHHHHHHHhhhhhhhchhhhhh-heeeeeecccccccccccceEEEEEEEEEEE--EeC Q lcl|NC_019711. 73 PDSELDAWMESRIYPVMSDIPALSD-LITSMVASGYDYRRDDDAGLWSSADLTYVIT--YEM 131 (131) Q Consensus 73 ~d~~LD~~~E~~i~~v~~~~~~l~~-li~~~~~~gy~Y~rD~e~~tW~sadL~y~It--Y~~ 131 (131) ..... +++. .|..++..-..+.+ .+..+.+..=+..||.+..+|+ +.|+|.++ +.- T Consensus 77 ~~eak-~ia~-av~~AL~~~l~l~~~~lv~l~~~~~~~~rd~dg~t~h-gvl~~ra~v~~~~ 135 (141) T protein:vir:94 77 QYEAK-LILS-AIGYVLNRPIEIDNYEFQFSRIDSQAVFPDIDRFTKH-GTIRLLFKYRHKK 135 (141) T ss_pred HHHHH-HHHH-HHHHHhcccccCCCceEEEEEEeeeeeeecCCCceEE-EEEEEEEEEEecc Confidence 44444 4664 68888854322332 2445666666777998888886 33555544 433 No 13 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=84.89 E-value=0.057 Score=27.41 Aligned_cols=124 Identities=15% Similarity=0.101 Sum_probs=79.4 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQV 72 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~ 72 (131) |. -.++|++|.++|+..-. +-.+||..|. ....|-|.+-=++....+ .-|...++-.+.|.|+=+... T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~---~a~~PYV~lG~~~~~~~~-~~~~~g~~~~~ti~Vws~~~g 76 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNSIIQKQLDGRVFDCVQK---DAVYPYIVVGETNVTNKE-TTTSMVEDVGITLHVYSQARN 76 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcC---CCCCCEEEecCceeeecC-CCcccceEEEEEEEEEEcCCC Confidence 65 67899999999987432 2257887773 446776554444444333 235677899999999866544 Q ss_pred ChhHHHHHHHHhhhhhhhchhhhhh-heeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 73 PDSELDAWMESRIYPVMSDIPALSD-LITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 73 ~d~~LD~~~E~~i~~v~~~~~~l~~-li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) . .+--.++. .|..++..-..|.+ .+..+....=++.||.+..+|+ +.|+|.+.++= T Consensus 77 ~-~eak~ia~-av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~h-gvl~~ra~ve~ 133 (145) T protein:vir:95 77 R-DEASQIIQ-FLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDRYTKH-GIIRLVFKYRH 133 (145) T ss_pred H-HHHHHHHH-HHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEE-EEEEEEEEEEe Confidence 3 44445664 68888854333322 1445666666777888888665 45677766666 No 14 >protein:vir:97421 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240755;genbank:gi:66396436;genbank:GeneID:5133777 Probab=84.79 E-value=0.058 Score=27.38 Aligned_cols=124 Identities=15% Similarity=0.097 Sum_probs=80.2 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQV 72 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~ 72 (131) |. -.++|++|.++|+..-. +-.+||..|. ....|-|.+-=++....+ .-|...++-.+.|.|+=+... T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~---~a~~PYV~lG~~~~~d~~-~~~~~g~~~~~ti~Vws~~~g 76 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQK---DAVYPYIVVGETNVTNKE-TTTSMVEDVGITLHVYSQARN 76 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcC---CCCCCEEEeCCceeeecC-CCcccceEEEEEEEEEEcCCC Confidence 65 67899999999987432 2257887773 446676554444444333 235677899999999866543 Q ss_pred ChhHHHHHHHHhhhhhhhchhhhhhh-eeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 73 PDSELDAWMESRIYPVMSDIPALSDL-ITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 73 ~d~~LD~~~E~~i~~v~~~~~~l~~l-i~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) -.+--.++. .|..++..-..|.+- +..+....=++.||.+..+|+ +.|+|.+.++= T Consensus 77 -~~eak~ia~-av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~h-gvl~fra~ve~ 133 (145) T protein:vir:97 77 -RDEASQIIQ-FLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKH-GIIRLVFKYRH 133 (145) T ss_pred -HHHHHHHHH-HHHHHhccccCCCCCeEEEeEEeeeeEeecCCcceEE-EEEEEEEEEEe Confidence 344455774 688787543333332 345556666777998888775 45777777776 No 15 >protein:vir:94488 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240682;genbank:gi:66396364;genbank:GeneID:5133752 Probab=84.79 E-value=0.058 Score=27.38 Aligned_cols=124 Identities=15% Similarity=0.097 Sum_probs=80.2 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQV 72 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~ 72 (131) |. -.++|++|.++|+..-. +-.+||..|. ....|-|.+-=++....+ .-|...++-.+.|.|+=+... T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~---~a~~PYV~lG~~~~~d~~-~~~~~g~~~~~ti~Vws~~~g 76 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQK---DAVYPYIVVGETNVTNKE-TTTSMVEDVGITLHVYSQARN 76 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcC---CCCCCEEEeCCceeeecC-CCcccceEEEEEEEEEEcCCC Confidence 65 67899999999987432 2257887773 446676554444444333 235677899999999866543 Q ss_pred ChhHHHHHHHHhhhhhhhchhhhhhh-eeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 73 PDSELDAWMESRIYPVMSDIPALSDL-ITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 73 ~d~~LD~~~E~~i~~v~~~~~~l~~l-i~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) -.+--.++. .|..++..-..|.+- +..+....=++.||.+..+|+ +.|+|.+.++= T Consensus 77 -~~eak~ia~-av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~h-gvl~fra~ve~ 133 (145) T protein:vir:94 77 -RDEASQIIQ-FLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKH-GIIRLVFKYRH 133 (145) T ss_pred -HHHHHHHHH-HHHHHhccccCCCCCeEEEeEEeeeeEeecCCcceEE-EEEEEEEEEEe Confidence 344455774 688787543333332 345556666777998888775 45777777776 No 16 >protein:vir:93736 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240465;genbank:gi:66396143;genbank:GeneID:5133505 Probab=84.79 E-value=0.058 Score=27.38 Aligned_cols=124 Identities=15% Similarity=0.097 Sum_probs=80.2 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQV 72 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~ 72 (131) |. -.++|++|.++|+..-. +-.+||..|. ....|-|.+-=++....+ .-|...++-.+.|.|+=+... T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~---~a~~PYV~lG~~~~~d~~-~~~~~g~~~~~ti~Vws~~~g 76 (145) T protein:vir:93 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQK---DAVYPYIVVGETNVTNKE-TTTSMVEDVGITLHVYSQARN 76 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcC---CCCCCEEEeCCceeeecC-CCcccceEEEEEEEEEEcCCC Confidence 65 67899999999987432 2257887773 446676554444444333 235677899999999866543 Q ss_pred ChhHHHHHHHHhhhhhhhchhhhhhh-eeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 73 PDSELDAWMESRIYPVMSDIPALSDL-ITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 73 ~d~~LD~~~E~~i~~v~~~~~~l~~l-i~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) -.+--.++. .|..++..-..|.+- +..+....=++.||.+..+|+ +.|+|.+.++= T Consensus 77 -~~eak~ia~-av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~h-gvl~fra~ve~ 133 (145) T protein:vir:93 77 -RDEASQIIQ-FLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKH-GIIRLVFKYRH 133 (145) T ss_pred -HHHHHHHHH-HHHHHhccccCCCCCeEEEeEEeeeeEeecCCcceEE-EEEEEEEEEEe Confidence 344455774 688787543333332 345556666777998888775 45777777776 No 17 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=84.17 E-value=0.063 Score=27.18 Aligned_cols=124 Identities=15% Similarity=0.090 Sum_probs=80.8 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQV 72 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~ 72 (131) |. -.++|++|.++|+..-. +-.+||.-|. ....|-|.+-=++....+ .-|...++-.+.|.|+=+... T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~~l~alvggrV~D~~P~---~a~~PYv~lG~~~~~d~~-~~~~~g~~~~~ti~Vws~~~g 76 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIRKQLDGRVFDCVQK---DAVYPYIVVGETNVTNKE-TTTSMVEDVGITLHVYSQARN 76 (145) T ss_pred CcchHhHHHHHHHHHHhhcChhHHHhhcCceecCCcc---CCCCCEEEeCcceeeecC-CCcccceEEEEEEEEEEcCCC Confidence 65 67899999999997432 2257887773 446775554434443333 234667899999999976544 Q ss_pred ChhHHHHHHHHhhhhhhhchhhhhh-heeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 73 PDSELDAWMESRIYPVMSDIPALSD-LITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 73 ~d~~LD~~~E~~i~~v~~~~~~l~~-li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) . .+--+++. .|..++.+-..|.+ .+..+....=++.||.+..+|+ +.|+|.+.++= T Consensus 77 ~-~eak~ia~-av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~h-gvl~fra~ve~ 133 (145) T protein:vir:97 77 R-DEASQIIQ-FLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKH-GIIRLVFKYRH 133 (145) T ss_pred H-HHHHHHHH-HHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEE-EEEEEEEEEec Confidence 3 44445664 68888854333332 1445666677778998888775 45888877776 No 18 >protein:vir:9764 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795526;genbank:gi:28876278;genbank:GeneID:1257819 Probab=81.81 E-value=0.043 Score=28.09 Aligned_cols=104 Identities=14% Similarity=0.303 Sum_probs=64.8 Q ss_pred CccHHHHHHHHHHHHhcCCceEEEeCcceEEc-cccCcE--EEEEeeCCccCceeecCCceEEEEEEEEEeecCCChhHH Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHDTGATFFDGRPAVFD-EADFPA--VAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSEL 77 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~~~v~~f~GrP~fid-e~elPA--VAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~~d~~L 77 (131) ||-. -|+..|..++ |-|+|++ +.+.|. |.| +.||-..+.-.-+|+|-|..|=++ +.+- T Consensus 1 mIE~----~i~~yL~~~l-------~vpv~~e~p~~~P~~FV~v-----EkTGG~~~~~~~~a~lAvQsyg~S---~~~A 61 (111) T protein:vir:97 1 MIEV----IIKKYLDEHL-------DVPSFFEHQKDEPARFIIL-----EKTSGAKQNHLLSSTFAFQSYAES---LYEA 61 (111) T ss_pred Chhh----hhhHHHhhhc-------CceEEEeecCCCCCceEEE-----EeeCCccccccccceEEEEecchh---HHHH Confidence 4322 2344444444 6699998 777774 444 445555555557888888876544 4444 Q ss_pred HHHHHHhhhhhhhchhhhhhheeeeee-cccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 78 DAWMESRIYPVMSDIPALSDLITSMVA-SGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 78 D~~~E~~i~~v~~~~~~l~~li~~~~~-~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) -++++ +|.++|...+.+.++.. +.+ +.|+|...+-.. =.||+.|++ T Consensus 62 A~La~-~V~~a~~~l~~l~~i~~-v~lns~Ynf~d~~tk~------yRYQa~~di 108 (111) T protein:vir:97 62 ALLND-KVKQVIEQLDVLPQVSG-VHLNADYNFTDTATKR------YRYQAVFDI 108 (111) T ss_pred HHHHH-HHHHHhhhhccCcccee-eeecccccCCCCCCCC------ccEEEEEEE Confidence 45666 58899988888775544 444 478888766422 357777777 No 19 >protein:vir:94768 Length: 111 # NCBI annotation: unknown # Family: family:all:1269 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996711;genbank:gi:45597426;genbank:GeneID:2769040 Probab=81.24 E-value=0.067 Score=27.01 Aligned_cols=104 Identities=13% Similarity=0.305 Sum_probs=65.0 Q ss_pred CccHHHHHHHHHHHHhcCCceEEEeCcceEEc-cccCc--EEEEEeeCCccCceeecCCceEEEEEEEEEeecCCChhHH Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHDTGATFFDGRPAVFD-EADFP--AVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSEL 77 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~~~v~~f~GrP~fid-e~elP--AVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~~d~~L 77 (131) || -.-|++.|.+++ |-|.+++ +.+.| -|.|=-++..+ ++-.-++.|-|.+|=++ +.+= T Consensus 1 mi----E~~v~~~L~~~l-------~vpv~~e~p~~~p~~FV~vErtGG~~-----~~~~~~~~lAVQ~~~~S---~~eA 61 (111) T protein:vir:94 1 MI----EIIIKNFLDTHL-------SVSSFLEKKGEMPLSYVLFEKTGSSK-----SNHLLSSTFAFQSYAPS---MYEA 61 (111) T ss_pred Ch----HHhHHHHHhhcC-------CcceEeecCCCCCCceEEEEecCCcc-----ccccccceEEEEecchh---HHHH Confidence 44 344556666665 4589998 77776 35554444443 34446788888887554 3344 Q ss_pred HHHHHHhhhhhhhchhhhhhheeeeeec-ccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 78 DAWMESRIYPVMSDIPALSDLITSMVAS-GYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 78 D~~~E~~i~~v~~~~~~l~~li~~~~~~-gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) -++++ +|.++|...+.+.+... +.+. .|+|...+... =.||+.|++ T Consensus 62 a~La~-~v~~~~~~l~~~~~i~~-v~~~s~Ynf~d~~tk~------~RYQav~~i 108 (111) T protein:vir:94 62 AKLNE-QLKEVVERLIELNEISN-VSLNSDYNFTDTETKE------YRYQAVFDI 108 (111) T ss_pred HHHHH-HHHHHHhhcccccccce-eecCCCcccCCCcCCC------ceEEEEEEE Confidence 45665 79999999888776644 5544 78886555322 356666666 No 20 >protein:vir:94921 Length: 125 # NCBI annotation: possible peptidoglycan binding protein # Family: family:all:5248 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239283;genbank:gi:66392065;genbank:GeneID:5076566 Probab=80.87 E-value=0.091 Score=26.29 Aligned_cols=120 Identities=8% Similarity=0.137 Sum_probs=67.1 Q ss_pred CccHHHHHHHHHHHHhcCCce-EEEeCcceEEccccCcEEEEEeeCCccCceeecCCc--eEEEEEEEEEeecCCChhHH Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHDTGA-TFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDT--WQAELHIEVFLPAQVPDSEL 77 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~~~v-~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~--w~A~LhI~iyLk~~~~d~~L 77 (131) |-+.+||++|+.+++++.+.. -.|+|+|. +.+-+=+-+.+...-...-.+.+.- =.+.+-|.||-|...+.... T Consensus 1 Mt~~q~r~~I~~r~~a~~~~~~I~~~N~pp---~~~~~W~Rlti~~g~~~~a~iG~~~~~rtGli~iqiF~p~~~G~~~~ 77 (125) T protein:vir:94 1 MSYFQEKLDIENYFKANWPDTPIFYENRTA---NSTGTWVRLTIQNGDAFQASNGEVSYRHPGVVFVQIFTKKEVGSGEA 77 (125) T ss_pred CCHHHHHHHHHHHHHhCCCccceeeCCCCC---CCCCceEEEEeccCcccccccCCceeeeeeEEEEEeeecCCcChHHH Confidence 999999999999999998864 37777761 1223333444433333333333332 23889999999999998888 Q ss_pred HHHHHHhhhhhhhchhhhhhheeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 78 DAWMESRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 78 D~~~E~~i~~v~~~~~~l~~li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) .+.+. .+-........ +++ +-..+++=.= .+..-|.-.+++ |+|.= T Consensus 78 ~~~ad-~~~~~f~~~~~-g~i-~f~~~~~~~~---g~~~gwyQ~Nv~--I~f~~ 123 (125) T protein:vir:94 78 LKLAD-KVDALFRSKTL-GNI-QFKVPQVQKV---PSTTEWYQVNVS--TEFYR 123 (125) T ss_pred HHHHH-HHHHHHccCCC-Cce-EEeeceecCC---CCCCCEEEEEEE--Eeeec Confidence 88875 45544433321 221 2222221111 123446433321 23333 No 21 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=80.21 E-value=0.097 Score=26.14 Aligned_cols=124 Identities=15% Similarity=0.101 Sum_probs=79.3 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQV 72 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~ 72 (131) |. -.++|++|.++|+..-. +-.+||..|. ....|-|.+-=++....+ .-|...++-.|.|.|+=+... T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~---~~~~PYv~lG~~~~~d~~-~~~~~g~~~~~ti~Vws~~~g 76 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDGRVFDCVQK---DAVYPYIVVGETNVTNKE-TTTSMVEDVGITLHVYSQARN 76 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhccccccCCcC---CCCCCEEEecCceeeecC-CCcccceEEEEEEEEEEcCCC Confidence 55 67999999999987432 1258888773 446675554434443332 235677899999999866544 Q ss_pred ChhHHHHHHHHhhhhhhhchhhhhh-heeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 73 PDSELDAWMESRIYPVMSDIPALSD-LITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 73 ~d~~LD~~~E~~i~~v~~~~~~l~~-li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) . .+--+++. .|..++..-..|.+ .+..+....=++.||.+..+|+ +.|+|.+.++= T Consensus 77 ~-~eak~ia~-av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~h-gvl~fra~ve~ 133 (145) T protein:vir:95 77 R-DEASQIIQ-FLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKH-GVIRLVFKYRH 133 (145) T ss_pred H-HHHHHHHH-HHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEE-EEEEEEEEEEe Confidence 3 44445664 68888854333322 1345666666777898888774 56777777766 No 22 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=80.20 E-value=0.097 Score=26.14 Aligned_cols=124 Identities=15% Similarity=0.099 Sum_probs=79.3 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQV 72 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~ 72 (131) |. -.++|++|.++|+..-. +-.+||..|. ....|-|.+-=++....+ .-|...++-.|.|.|+=+... T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~---~~~~PYv~lG~~~~~d~~-~~~~~g~~~~~ti~Vws~~~g 76 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDGRVFDCVQK---DAVYPYIVVGETNVTNKE-TTTSMVEDVGITLHVYSQARN 76 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhccccccCCcC---CCCCCEEEecCceeeecC-CCcccceEEEEEEEEEEcCCC Confidence 55 67999999999987432 1258888773 446675554434443332 235677899999999866544 Q ss_pred ChhHHHHHHHHhhhhhhhchhhhhh-heeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 73 PDSELDAWMESRIYPVMSDIPALSD-LITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 73 ~d~~LD~~~E~~i~~v~~~~~~l~~-li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) . .+--+++. .|..++..-..|.+ .+..+....=++.||.+..+|+ +.|+|.+.++= T Consensus 77 ~-~eak~ia~-av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~h-gvl~fra~ve~ 133 (145) T protein:vir:94 77 R-DEASQIIQ-FLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKH-GIIRLVFKYRH 133 (145) T ss_pred H-HHHHHHHH-HHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEE-EEEEEEEEEEe Confidence 3 44445664 68888854333322 1345666666777898888774 56777777766 No 23 >protein:vir:107096 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950611;genbank:gi:119953691;genbank:GeneID:4643105 Probab=75.53 E-value=0.15 Score=25.17 Aligned_cols=123 Identities=15% Similarity=0.163 Sum_probs=78.0 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCcee-ecCCceEEEEEEEEEeecC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEE-LDSDTWQAELHIEVFLPAQ 71 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~-ld~~~w~A~LhI~iyLk~~ 71 (131) |. -.++|++|.++|++.-. +-.+||..| .....|-|. |-+.+..+.. -|...++-.+.|.|+=+.. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~rVyD~~P---~~a~~PyV~--lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~ 75 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIIKKQLGGRVFDCVQ---KDAVYPYIV--VGETNVTNKETTTSMFEDVGVTLHVYSQAR 75 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhccccccCCc---cCCCCCEEE--eCcceeeecCCCcccceEEEEEEEEEEcCC Confidence 55 67899999999987422 225888877 345667544 4344333333 3466789999999997665 Q ss_pred CChhHHHHHHHHhhhhhhhchhhhhhh-eeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 72 VPDSELDAWMESRIYPVMSDIPALSDL-ITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 72 ~~d~~LD~~~E~~i~~v~~~~~~l~~l-i~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) ..... -+++. .|..++..-..+.+- +..+....=++.||.+..+|+ +.|+|.+..+= T Consensus 76 g~~ea-~~ia~-av~~aL~a~l~l~~~~lv~l~~~~~~~~rd~dg~~~h-gvl~~ra~ve~ 133 (145) T protein:vir:10 76 NRDEA-SQIIQ-YLGFVLNSEIEINNYSFIKSRIDTQEVITDIDQYTKH-GIIRLIFKYRH 133 (145) T ss_pred CHHHH-HHHHH-HHHHHhCCCcCCCCCeEEEEEEeeeeEeecCCCceEE-EEEEEEEEEee Confidence 44443 45664 677777422222221 345556666777998888775 45777777776 No 24 >protein:vir:105337 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950674;genbank:gi:119967844;genbank:GeneID:4643216 Probab=75.52 E-value=0.15 Score=25.17 Aligned_cols=123 Identities=15% Similarity=0.163 Sum_probs=78.1 Q ss_pred Cc---cHHHHHHHHHHHHhcCC-----ceEEEeCcceEEccccCcEEEEEeeCCccCcee-ecCCceEEEEEEEEEeecC Q lcl|NC_019711. 1 MK---HTELRAAVLDALEKHDT-----GATFFDGRPAVFDEADFPAVAVYLTGAEYTGEE-LDSDTWQAELHIEVFLPAQ 71 (131) Q Consensus 1 ~~---H~~IR~~V~d~L~~~~~-----~v~~f~GrP~fide~elPAVAVyl~da~~~~~~-ld~~~w~A~LhI~iyLk~~ 71 (131) |. -.++|++|.++|++.-. +-.+||..| .....|-|. |-+.+..+.. -|...++-.+.|.|+=+.. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~rVyD~~P---~~a~~PyV~--lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~ 75 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIVSKQLGGRVFDCVQ---KDAVYPYIV--VGETNVTNKETTTSMFEDVGVTLHVYSQAR 75 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhccccccCCc---cCCCCCEEE--eCcceeeecCCCcccceEEEEEEEEEEcCC Confidence 55 67899999999987422 225888877 345667544 4344333333 3466789999999997665 Q ss_pred CChhHHHHHHHHhhhhhhhchhhhhhh-eeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 72 VPDSELDAWMESRIYPVMSDIPALSDL-ITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 72 ~~d~~LD~~~E~~i~~v~~~~~~l~~l-i~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) ..... -+++. .|..++..-..+.+- +..+....=++.||.+..+|+ +.|+|.+..+= T Consensus 76 g~~ea-~~ia~-av~~aL~a~l~l~~~~lv~l~~~~~~~~rd~dg~~~h-gvl~~ra~ve~ 133 (145) T protein:vir:10 76 NRDEA-SQIIQ-YLGFVLNSEIEINNYSFIKSRIDTQEVITDIDQYTKH-GIIRLIFKYRH 133 (145) T ss_pred CHHHH-HHHHH-HHHHHhCCCcCCCCCeEEEEEEeeeeEeecCCCceEE-EEEEEEEEEee Confidence 44443 45664 677777422222221 345556666777998888775 45777777776 No 25 >protein:vir:1643 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695064;genbank:gi:23455755;genbank:GeneID:955492 Probab=71.65 E-value=0.19 Score=24.50 Aligned_cols=103 Identities=14% Similarity=0.307 Sum_probs=64.5 Q ss_pred CccHHHHHHHHHHHHhcCCceEEEeCcceEEc-cccCcE--EEEEeeCCccCceeecCCceEEEEEEEEEeecCCChhHH Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHDTGATFFDGRPAVFD-EADFPA--VAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSEL 77 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~~~v~~f~GrP~fid-e~elPA--VAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~~d~~L 77 (131) || =.-|++.|.+++ |-|.+++ +.+.|. |.|=-++..++. -.-++.|-|.+|=++ +.+= T Consensus 1 mi----E~~i~~~L~~~l-------~Vpv~~e~p~~~P~~FV~vErtGG~~~~-----~~~~~~lAVq~w~~S---~~eA 61 (111) T protein:vir:16 1 MI----EIIIKNFLDTHL-------SVSSFLEKKGEMPLSYILFEKTGSSKSN-----HLLSSTFAFQSYAPS---MYEA 61 (111) T ss_pred Ch----HHhHHHHHhhcC-------CceeEeecCCCCCCceEEEEecCCcccc-----ccccceEEEEecchh---HHHH Confidence 44 344556666665 5589998 777774 555444444443 336788888886544 3344 Q ss_pred HHHHHHhhhhhhhchhhhhhheeeeee-cccccccccc-cceEEEEEEEEEEEEeC Q lcl|NC_019711. 78 DAWMESRIYPVMSDIPALSDLITSMVA-SGYDYRRDDD-AGLWSSADLTYVITYEM 131 (131) Q Consensus 78 D~~~E~~i~~v~~~~~~l~~li~~~~~-~gy~Y~rD~e-~~tW~sadL~y~ItY~~ 131 (131) -++++ ++.++|...+.+.+... +.+ ++|+|...+. +. .||++|++ T Consensus 62 a~La~-~v~~~l~~l~~~~~I~a-v~~~s~ynf~d~~tk~~-------RYQav~~i 108 (111) T protein:vir:16 62 AKLNE-QLKEVVERLIELNEISN-VSLNSDYNFTDTETKEY-------RYQAVFDI 108 (111) T ss_pred HHHHH-HHHHHHhhcccccccee-eecCCCCcCCCCCCCCc-------eEEEEEEE Confidence 45665 79999988888776544 554 4788865553 33 45555555 No 26 >protein:vir:4907 Length: 128 # NCBI annotation: gp128 # Family: family:all:504 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056685;genbank:gi:9635020;genbank:GeneID:1262660 Probab=68.41 E-value=0.24 Score=24.00 Aligned_cols=121 Identities=13% Similarity=0.055 Sum_probs=73.6 Q ss_pred Ccc--HHHHHHHHHHHHhcCCceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCCChhHHH Q lcl|NC_019711. 1 MKH--TELRAAVLDALEKHDTGATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELD 78 (131) Q Consensus 1 ~~H--~~IR~~V~d~L~~~~~~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~~d~~LD 78 (131) |+. .+|=+++...|++ -|..+||-.|. ++...|=|.+-=+...+ ..+-++-.|+..|.|-|+=+ ...-.+.+ T Consensus 1 m~sp~q~L~~~~f~~l~~--~g~~vyD~lP~--~~v~YPfV~ig~~~~~~-~~tKt~~~g~v~ltihVW~~-~~~R~ev~ 74 (128) T protein:vir:49 1 MKQPDQLLHDEMYRISCE--LGYNTYTYLPP--DDAAYPFVVMGETMVLP-QSTKSHLIGRLSSTVHVWGR-VDDRKTLS 74 (128) T ss_pred CCchHHHHHHHHHHHHHh--cCCceecccCC--CCCCCCEEEeeeeeecC-CccccccccEEEEEEEEEeC-CCCchhHH Confidence 885 4555556665554 36679999996 55567887665554442 23456777999999999944 45688999 Q ss_pred HHHHHhhhhhhhchhhhhhheeeeeeccccc----ccccccceEEEE-EEEEEEE Q lcl|NC_019711. 79 AWMESRIYPVMSDIPALSDLITSMVASGYDY----RRDDDAGLWSSA-DLTYVIT 128 (131) Q Consensus 79 ~~~E~~i~~v~~~~~~l~~li~~~~~~gy~Y----~rD~e~~tW~sa-dL~y~It 128 (131) ++++ +|..++.......+..=......-+. +.+.++.+++.. +|.|+|= T Consensus 75 ~i~~-~i~~~l~~~~~t~~y~f~~~i~~s~~~~~~D~st~~~L~Hgvl~l~f~~~ 128 (128) T protein:vir:49 75 DMAG-QLMSSFFAIKNIGGKQFSAEINQSSIDSNRDNSTDEVLYHFVIYTYFKFV 128 (128) T ss_pred HHHH-HHHHHhhcccccCCeEEEEEeccceEEEEeecCCCcceeeEEEEEEEEeC Confidence 9997 46665544433333211122222222 245556667554 6666665 No 27 >protein:vir:107704 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003903;genbank:gi:45686319;genbank:GeneID:2773044 Probab=68.27 E-value=0.24 Score=23.98 Aligned_cols=117 Identities=18% Similarity=0.166 Sum_probs=64.7 Q ss_pred ccHHHHHHHHHHHHhcCCceE-EEeCcceEEcccc-CcEEEEEeeCCccCceeec--CCceEEEEEEEEEeecCCChhHH Q lcl|NC_019711. 2 KHTELRAAVLDALEKHDTGAT-FFDGRPAVFDEAD-FPAVAVYLTGAEYTGEELD--SDTWQAELHIEVFLPAQVPDSEL 77 (131) Q Consensus 2 ~H~~IR~~V~d~L~~~~~~v~-~f~GrP~fide~e-lPAVAVyl~da~~~~~~ld--~~~w~A~LhI~iyLk~~~~d~~L 77 (131) .|-++|.|+=.+|-+.-.+.. -|-+. .|-.+.+ .+=+-+|.-.+......|. .-.|.++..|.|..|+..+..+. T Consensus 1 ~hyE~~~a~r~~la~~~~~lpVA~eNv-~F~Pp~~G~~yLr~~~lpa~T~~~~L~~d~r~y~Gv~QI~Vv~paG~G~~~a 79 (132) T protein:vir:10 1 MHYELSAAARAAFLSKYRDFPHYMENR-NFTPPKDGGMWLRFNYIEGDTLYLSIDRKCKSYIAIVQIGVVFPPGSGVDEA 79 (132) T ss_pred CchHHHHHHHHHHHhhhcCCcEeecCC-CcCCCCCCceEEEEEEccCCceeeeccCcCcEEEEEEEEEEEecCCCCcchh Confidence 888888887776644322211 12222 2222233 3566677777776666665 44599999999999999999998 Q ss_pred HHHHHHhhhhhhhchhhhhhheeeeeecccccccc------cccceEEEEEEEEEEEEeC Q lcl|NC_019711. 78 DAWMESRIYPVMSDIPALSDLITSMVASGYDYRRD------DDAGLWSSADLTYVITYEM 131 (131) Q Consensus 78 D~~~E~~i~~v~~~~~~l~~li~~~~~~gy~Y~rD------~e~~tW~sadL~y~ItY~~ 131 (131) -.+|.+ |.....+-..| -+||=|+-- ...++|... =+..|.+ T Consensus 80 ~~iAd~-i~~~F~~g~~l--------~~Gyi~~~~~~~p~i~~~s~~~iP---vrf~yR~ 127 (132) T protein:vir:10 80 RLKAKE-IADFFKDGKML--------NVGYIFEGAIVHQIVKHESGWMIP---VRFTVRV 127 (132) T ss_pred HHHHHH-HHHhccCccee--------ecceecCCCccCCceeCCcceEEE---EEEEEEe Confidence 888875 44444333222 233333322 222333211 1112222 No 28 >protein:vir:3972 Length: 129 # NCBI annotation: structural protein # Family: family:all:504 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663680;genbank:gi:21716117;genbank:GeneID:951217 Probab=61.32 E-value=0.36 Score=23.05 Aligned_cols=121 Identities=16% Similarity=0.160 Sum_probs=69.6 Q ss_pred CccHHHHHHHHHHHHhcC--CceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCCChhHHH Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHD--TGATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELD 78 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~--~~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~~d~~LD 78 (131) |+. -.+++-+.+=+.+ -|..+||-.|. ++...|-|.+-=++..+. .+-|.-.|+..|.|-|+=.. ..-.+.+ T Consensus 2 mks--p~qeL~d~~f~~l~~lG~~vyD~lP~--~~v~YPfV~ig~~~~~~~-~tKt~~~g~v~ltihVW~~~-~~R~~v~ 75 (129) T protein:vir:39 2 IKT--RDQSIFDELFKRIQALGYTVYDYKQM--NEVGYPFVEMENTQTIHE-PNKTDIKGTVSLSLSVWGLQ-KKRKEVS 75 (129) T ss_pred CcC--hhHHHHHHHHHHHHhcCCeeeeccCC--CCCCcCEEEeeeeeecCC-ccccccccEEEEEEEEEeCC-cCchhHH Confidence 553 3445555443322 35678998885 555668877665555533 34567889999999999864 4477889 Q ss_pred HHHHHhhhhhhhchhhhhhheeeeeecccccc----cccccceEEEE-EEEEEEE Q lcl|NC_019711. 79 AWMESRIYPVMSDIPALSDLITSMVASGYDYR----RDDDAGLWSSA-DLTYVIT 128 (131) Q Consensus 79 ~~~E~~i~~v~~~~~~l~~li~~~~~~gy~Y~----rD~e~~tW~sa-dL~y~It 128 (131) +++++ |.-+........+.-=.+..+.-+.| .+.+..+++.. +|.|++. T Consensus 76 ~i~~~-i~~~~~~~~~t~~y~~~~~~~~~~~q~~~Dts~~~~L~Hgvi~l~f~~r 129 (129) T protein:vir:39 76 DMASN-IFNQALNISATDGYSWALNLQASTIQMMDDTTTGTPLKRAFINLEFRLR 129 (129) T ss_pred HHHHH-HHHHhcccccCCCeeEEEeecceeEEEecccCCCceeeeEEEEEEEEeC Confidence 99974 55444333322322111122222222 23455556554 7777777 No 29 >protein:vir:2741 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695114;genbank:gi:23455883;genbank:GeneID:955650 Probab=60.89 E-value=0.36 Score=22.99 Aligned_cols=121 Identities=13% Similarity=0.060 Sum_probs=71.3 Q ss_pred CccH--HHHHHHHHHHHhcCCceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCCChhHHH Q lcl|NC_019711. 1 MKHT--ELRAAVLDALEKHDTGATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELD 78 (131) Q Consensus 1 ~~H~--~IR~~V~d~L~~~~~~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~~d~~LD 78 (131) |+.. +|=+++...|++ -|..+||-.|. ++...|=|.+-=+...+. .+-|...|+..|.|-|+=.+ ..-.+++ T Consensus 1 M~sp~qeL~~~lf~~l~~--~g~~vyD~lP~--~~~~YPfV~ig~~~~~~~-~tkt~~~g~~~l~i~vW~~~-~~R~~v~ 74 (128) T protein:vir:27 1 MKQPDQLLHDEMYRISCE--LGYNTYTYLPP--DDAAYPFVVMGETMVLPQ-STKSHLIGRLSSTVHVWGHV-DDRKTLS 74 (128) T ss_pred CCCHHHHHHHHHHHHHHh--cCCceeccCCC--CCCCcCEEEeccceecCC-ccccccccEEEEEEEEEECC-cchhHHH Confidence 8854 444555555544 36779999888 656678776655544432 24567779999999999864 4578899 Q ss_pred HHHHHhhhhhhhchhhhhhheeeeeecccccc----cccccceEEEE-EEEEEEE Q lcl|NC_019711. 79 AWMESRIYPVMSDIPALSDLITSMVASGYDYR----RDDDAGLWSSA-DLTYVIT 128 (131) Q Consensus 79 ~~~E~~i~~v~~~~~~l~~li~~~~~~gy~Y~----rD~e~~tW~sa-dL~y~It 128 (131) ++++ +|..++.......+..=.+..+.-+.+ ++.+..+++.. +|.|+|= T Consensus 75 ~i~~-~i~~~~~~~~~t~~y~~~~~~~~~~~qil~Dtst~~~l~Hgii~l~f~~~ 128 (128) T protein:vir:27 75 DMAG-QLMSSFFAIKKIGGKQFSAEVNESSIDSNRDNSTDEVLYHFIIYTYFKFI 128 (128) T ss_pred HHHH-HHHHHhccccccCCeeEEEEeecceEEEeeecCCCceeeEEEEEEEEEeC Confidence 9997 466555444333322111222222222 44555556544 5555555 No 30 >protein:vir:99226 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950461;genbank:gi:119953662;genbank:GeneID:4643086 Probab=58.45 E-value=0.41 Score=22.69 Aligned_cols=128 Identities=14% Similarity=0.148 Sum_probs=64.6 Q ss_pred CccHHHHHHHHHHHHhcCCceE-EEeC-cceEEccccCcEEEEEee--CCcc----Cc-----eeec-CCceEEEEEEEE Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHDTGAT-FFDG-RPAVFDEADFPAVAVYLT--GAEY----TG-----EELD-SDTWQAELHIEV 66 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~~~v~-~f~G-rP~fide~elPAVAVyl~--da~~----~~-----~~ld-~~~w~A~LhI~i 66 (131) |=+.+.|.+|+++||+..|+.. ++-. -.+-+.+...++-|||+- +-+. +. ..-. ..+|.-+|-+.= T Consensus 5 ~d~~a~~~~IierLka~vp~l~~V~~aadla~i~~~~q~tPaayVi~~gd~~~~~~~~~~~~~~~Q~i~q~~~Vvlavr~ 84 (157) T protein:vir:99 5 FDYLFLEPLLIERIRSEVPGLAIVSGVPDLAALSEQDQPAPSVYVVYLGDEIGTGADHQGGRRAIQAIGQQWAVVLVVHY 84 (157) T ss_pred hhhhhhhHHHHHHHHhhhhHHHhhhcccchHHHhhccCCCcEEEEEecccccCCCcccccccceeeeeeeeEEEEEEEec Confidence 6678899999999998888754 3222 122334445555666662 2111 11 1111 234555444433 Q ss_pred EeecCCChhHHHHHHH---HhhhhhhhchhhhhhheeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 67 FLPAQVPDSELDAWME---SRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 67 yLk~~~~d~~LD~~~E---~~i~~v~~~~~~l~~li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) +=....+...+|+..+ +.+..+++-.|.-. ..-+.+..= =.+..-...|.==-|.|+|..-| T Consensus 85 ~~~~~~g~~a~d~ag~ll~~v~~AL~GW~P~~~--~~pl~~~~~-~~~~~y~~gf~yypl~F~~~~~~ 149 (157) T protein:vir:99 85 ADSSNSGEGARREAGPLLGRLVKALTGWAPAID--VAPLARSAR-QSPVTYASGYFYFPLVFTARFVY 149 (157) T ss_pred cccccccchhHHHHHHHHHHHHHHhcCCcCccc--CCceeeeec-CCcccccCceEEEEEEEEEeeec Confidence 3222233445555443 33334444444311 122222110 02334456688888999998888 No 31 >protein:vir:3618 Length: 129 # NCBI annotation: ORF41 # Family: family:all:504 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112704;genbank:gi:13786572;genbank:GeneID:921070 Probab=57.28 E-value=0.44 Score=22.55 Aligned_cols=121 Identities=20% Similarity=0.212 Sum_probs=68.4 Q ss_pred CccHHHHHHHHHHHHhcC--CceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCCChhHHH Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHD--TGATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELD 78 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~--~~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~~d~~LD 78 (131) |+. -.+++-+.|=+.+ -|..+||-.|. ++...|-|.+-=++..+. .+-+.-.|+..|.|-|+=. ...-.+.+ T Consensus 2 mks--p~qeL~d~~f~~l~~lG~~vyD~lP~--~~v~YPfV~ig~~~~~~~-~tKt~~~g~v~ltihVW~~-~~~R~~v~ 75 (129) T protein:vir:36 2 IKT--RDQSIFDELFKRIQALGYTVYDYKPM--NEVGYPFVELENTQTIHE-ANKTDIKGTVSLSLSVWGL-QKKRKEVS 75 (129) T ss_pred CcC--hhHHHHHHHHHHHHhcCCeeeeccCC--CCCCcCEEEeeeeeecCC-ccccccccEEEEEEEEEeC-CcCchhHH Confidence 553 4455555543332 35668998887 555668776665555533 3456778999999999944 45678899 Q ss_pred HHHHHhhhhhhhchhhhhhhee--eeeeccccccccc--ccceEEE-EEEEEEEE Q lcl|NC_019711. 79 AWMESRIYPVMSDIPALSDLIT--SMVASGYDYRRDD--DAGLWSS-ADLTYVIT 128 (131) Q Consensus 79 ~~~E~~i~~v~~~~~~l~~li~--~~~~~gy~Y~rD~--e~~tW~s-adL~y~It 128 (131) +++++ |.-++.......+..= ...-+..+-..|. +...++. .+|.|++. T Consensus 76 ~i~~~-i~~~~~~~~~t~~y~~~~~~~~~~~q~~~D~st~~~L~Hgii~l~f~~r 129 (129) T protein:vir:36 76 DMASN-IFNQALNISATDGYSWALNSQASTIQMLDDTTTNTPLKRALINLEFRLR 129 (129) T ss_pred HHHHH-HHHHhcccccCCCeEEEEEeeeeeEEEeccCCCCceeeEEEEEEEEEeC Confidence 99974 6555444332222211 1122222222332 2223444 47777777 No 32 >protein:vir:103278 Length: 169 # NCBI annotation: phage-related conserved hypothetical protein # Family: family:all:5121 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277458;genbank:gi:71834100;genbank:GeneID:3562389 Probab=56.11 E-value=0.46 Score=22.41 Aligned_cols=119 Identities=11% Similarity=0.137 Sum_probs=74.2 Q ss_pred CccHHHHHHHHHHHHhcCCceE-EEeCcceEEcccc-CcEEEEEeeCCccCceeecCCc--eEEEEEEEEEeecCCChhH Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHDTGAT-FFDGRPAVFDEAD-FPAVAVYLTGAEYTGEELDSDT--WQAELHIEVFLPAQVPDSE 76 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~~~v~-~f~GrP~fide~e-lPAVAVyl~da~~~~~~ld~~~--w~A~LhI~iyLk~~~~d~~ 76 (131) .+...+|++|.+.+++..++.. -|-|. +|-.+++ -+=+.+|.-.+.....+|+.+. +.+...|.|..|+..+..+ T Consensus 41 ei~~a~rk~l~~~a~a~~~~LpVA~ENV-aFtPp~dG~~YLr~~~lPadT~~~~L~gd~R~y~GVfQIsVV~PaGtG~~k 119 (169) T protein:vir:10 41 EMMVAARKLVSDAAVDIAGSLPVAYENC-GFTPPKNGSSWLKFDYTEVDSVTWGLQRTCRYYVGMVQVSIFFSPGEGTDR 119 (169) T ss_pred HHHHHHHHHHHHHHhhcccCCcEeeCCC-CcCCCCCCccEEEEEEecCCceeeeccCCCceEEEEEEEEEEecCCCCcch Confidence 5567789999999988544422 22222 4444344 4788888888888888887655 9999999999999999988 Q ss_pred HHHHHHHhhhhhhhchhhhhhheeeeeeccccccccc------ccceEEEEEEEEEEEEe Q lcl|NC_019711. 77 LDAWMESRIYPVMSDIPALSDLITSMVASGYDYRRDD------DAGLWSSADLTYVITYE 130 (131) Q Consensus 77 LD~~~E~~i~~v~~~~~~l~~li~~~~~~gy~Y~rD~------e~~tW~sadL~y~ItY~ 130 (131) .-.+|.+ |.....+-.. --+||=|+--. ..+-|..- +.|+..+. T Consensus 120 a~qiAde-iadlF~~gt~--------L~~Gyi~~~~~~~p~i~~~s~~~iP-vr~~~R~D 169 (169) T protein:vir:10 120 PRQLAGR-LSEAFADGTM--------LDSGYIYEGGSVFPPVKSQSGWFIP-VRFYVRMD 169 (169) T ss_pred hHHHHHH-HHHhhhCCce--------eeceeecCCCeECCeeecCCceEEe-EEEEEEeC Confidence 8888875 4444444322 22455554322 22334322 22222222 No 33 >protein:vir:96485 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238497;genbank:gi:66391773;genbank:GeneID:5176907 Probab=46.66 E-value=0.73 Score=21.34 Aligned_cols=120 Identities=13% Similarity=0.095 Sum_probs=67.9 Q ss_pred CccH--HHHHHHHHHHHhcCCceEEEeCcceEEccccCcEEEEEeeCCccCce-eecCCceEEEEEEEEEeecCCChhHH Q lcl|NC_019711. 1 MKHT--ELRAAVLDALEKHDTGATFFDGRPAVFDEADFPAVAVYLTGAEYTGE-ELDSDTWQAELHIEVFLPAQVPDSEL 77 (131) Q Consensus 1 ~~H~--~IR~~V~d~L~~~~~~v~~f~GrP~fide~elPAVAVyl~da~~~~~-~ld~~~w~A~LhI~iyLk~~~~d~~L 77 (131) |+.. ++=+++...|++ -|..+||-.|- ++...|=|-+- +.+.++. +-++..++..|.|-|+=. ...-.+. T Consensus 1 m~sp~qeL~d~~f~~l~~--~g~~vyd~lP~--~~v~YPfV~ig--~~~~~~~~tKt~~~g~v~ltihVW~~-~~~R~~v 73 (128) T protein:vir:96 1 MKQPDQLLHDEMYRISSG--LGYDTYTYLPP--EGAAYPFVVMG--ETMVLPQSTKSHLIGRLSSTVHVWGR-VDDRKTL 73 (128) T ss_pred CCCHHHHHHHHHHHHHHh--cCCeeecccCC--CCCCCCEEEEe--eeeecCCccccccccEEEEEEEEEEC-CCCchhH Confidence 8854 444555555554 36679999984 45566887665 4444433 345777999999999955 4457888 Q ss_pred HHHHHHhhhhhhhchhhhhhheeeeeecccccc--cc--cccceEEEE-EEEEEEE Q lcl|NC_019711. 78 DAWMESRIYPVMSDIPALSDLITSMVASGYDYR--RD--DDAGLWSSA-DLTYVIT 128 (131) Q Consensus 78 D~~~E~~i~~v~~~~~~l~~li~~~~~~gy~Y~--rD--~e~~tW~sa-dL~y~It 128 (131) +++++ +|..++.......+.-=.+....-+.+ .| .++.+++.. +|.|+|= T Consensus 74 ~~i~~-~i~~~l~~~~~t~~y~~~~~~~~~~~qii~D~st~~~l~Hgil~l~f~~~ 128 (128) T protein:vir:96 74 SDMAG-QLMSSFFTIKNIDGMQFSAEVNESSIDSNRDNSTDEVLYHFIIYTYFKFI 128 (128) T ss_pred HHHHH-HHHHHhhhhhccCCeEEEEEEeeeeEEEeeecCCCceeeEEEEEEEEEeC Confidence 99997 465555433222211101111222222 23 344455443 6666555 No 34 >protein:vir:10327 Length: 182 # NCBI annotation: ORF29 # Family: family:all:1090 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758922;genbank:gi:27311196;genbank:GeneID:956141 Probab=44.68 E-value=0.8 Score=21.12 Aligned_cols=124 Identities=16% Similarity=0.128 Sum_probs=65.4 Q ss_pred Ccc---HHHHHHHHHHHHhcCCceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeec--CCChh Q lcl|NC_019711. 1 MKH---TELRAAVLDALEKHDTGATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPA--QVPDS 75 (131) Q Consensus 1 ~~H---~~IR~~V~d~L~~~~~~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~--~~~d~ 75 (131) |.+ ++.+.||.+.||+.+|+++..+=.|.--..=..|||.|=|.+.+... ....+++...|.++.++-. ..+.. T Consensus 1 mt~~~l~~lh~AI~~~Lk~~~p~l~~~~~y~~~~~~i~~PAv~vel~~~~~~~-d~~tGq~~~~~~~~a~~vv~~~~~~~ 79 (182) T protein:vir:10 1 MSQTTITEVHEAIKAKLRETFPKVTVDDYNPEPELSVLAPALLLELEEFPMGA-DVGDDRYPAACRFSVHCVLGWEVKSL 79 (182) T ss_pred CCcCCHHHHHHHHHHHHHHhcCCceeeecCccccCccccceeeeeeecCCcCC-CCCCCcEEEEEEEEEEEEecccCCCc Confidence 554 56699999999999999886655553222224599999998888654 3456778888888887765 35555 Q ss_pred HHHHHH--HHhhhhhhhchhhhhhh-eeeeeec-----cccc-ccc--cccceEEEEEEEEEEEEeC Q lcl|NC_019711. 76 ELDAWM--ESRIYPVMSDIPALSDL-ITSMVAS-----GYDY-RRD--DDAGLWSSADLTYVITYEM 131 (131) Q Consensus 76 ~LD~~~--E~~i~~v~~~~~~l~~l-i~~~~~~-----gy~Y-~rD--~e~~tW~sadL~y~ItY~~ 131 (131) +|..++ .+ +--.+..-. =+| .+.+++- +-++ ... +.-..|... |+=+--+ T Consensus 80 ~~~~~~lAa~-l~~~v~~~~--wGL~~~~v~~a~~i~a~p~~f~~~~~dgy~vW~Ve---W~Q~i~L 140 (182) T protein:vir:10 80 ALELWEFSAA-VAQLIRKSG--VWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVT---WNQTLYL 140 (182) T ss_pred hHHHHHHHHH-HHHHHhcCc--ccCCccccCccceeeeccCccChhhcCceEEEEEE---EEEEEee Confidence 665443 21 111111111 122 1112211 1122 111 222455432 1111111 No 35 >protein:vir:79247 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469166;genbank:gi:157835008;genbank:GeneID:5648828 Probab=32.19 E-value=1.4 Score=19.71 Aligned_cols=127 Identities=18% Similarity=0.212 Sum_probs=63.3 Q ss_pred CccHHHHHHHHHHHHhcCCceE-EEeCcc--eEEccccCcEEEEEee--CCcc-Ccee--ecCCceE-EEEEEEEEeec- Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHDTGAT-FFDGRP--AVFDEADFPAVAVYLT--GAEY-TGEE--LDSDTWQ-AELHIEVFLPA- 70 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~~~v~-~f~GrP--~fide~elPAVAVyl~--da~~-~~~~--ld~~~w~-A~LhI~iyLk~- 70 (131) |=+...|.+|+++||+..|+.+ ++ |-+ +-|.+..+|+-|+|+- +-+. ++.. -..+.|| .+.|.+|-|-- T Consensus 5 ~d~~a~~~~IierLka~v~~l~~V~-~aadla~i~e~~q~tPaayVv~~gd~~~~~~~~~~~~~~~Q~vtq~f~Vvlavr 83 (157) T protein:vir:79 5 FDYLFLEPLLIERIRSEVPGLAIVS-GVPDLAALSEQDQPAPSVYVVYLGDEIGTGADYQGGRRAIQAIGQQWAVVLVVH 83 (157) T ss_pred hhhhhhhHHHHHHHHhhhhhhhhhc-cccchhhhhhhcCCCcEEEEEecccccCCCcccccCcceeeeeeeeEEEEEEEe Confidence 6678899999999998888754 32 322 2234444544556652 1111 1111 0112344 44555554442 Q ss_pred -----CCChhHHHHHHH---HhhhhhhhchhhhhhheeeeeecccccccccccceEEEEEEEEEEEEeC Q lcl|NC_019711. 71 -----QVPDSELDAWME---SRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) Q Consensus 71 -----~~~d~~LD~~~E---~~i~~v~~~~~~l~~li~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) ..+....|+-.+ +.+..+++-.|.-. ..-+.+..=. .+..-...|.==-|.|+|..-| T Consensus 84 n~~~~~~~~a~~d~ag~ll~~v~~AL~GW~P~~~--~~pl~~~~~~-~~~~y~~gf~yypl~F~~~~~~ 149 (157) T protein:vir:79 84 YADSSNSGEGARREAGPLLGRLVKALTGWAPAID--VAPLARSARQ-SPVTYASGYFYFPLVFTARFVY 149 (157) T ss_pred ccccccccchhHHHHHHHHHHHHHHhcCcccccc--CCceeeeecC-CcccccCCeEEEEEEEEEeeec Confidence 223334444433 33334444444311 1222222100 3344456677778999998888 No 36 >protein:vir:744 Length: 129 # NCBI annotation: major structural protein 2 # Family: family:all:504 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108721;genbank:gi:13487843;genbank:GeneID:920879 Probab=25.25 E-value=2.1 Score=18.84 Aligned_cols=121 Identities=18% Similarity=0.205 Sum_probs=67.7 Q ss_pred CccHHHHHHHHHHHHhcC--CceEEEeCcceEEccccCcEEEEEeeCCccCceeecCCceEEEEEEEEEeecCCChhHHH Q lcl|NC_019711. 1 MKHTELRAAVLDALEKHD--TGATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELD 78 (131) Q Consensus 1 ~~H~~IR~~V~d~L~~~~--~~v~~f~GrP~fide~elPAVAVyl~da~~~~~~ld~~~w~A~LhI~iyLk~~~~d~~LD 78 (131) |+. -.+++-+.|=+.+ -|..+||-.|. ++..-|-|-+-=++.... .+-|.-.++..|.|-|+=. ...-.+.+ T Consensus 2 mks--p~qeL~d~~~~~l~~lG~~vyD~lP~--~~v~YPfV~ig~~~~~~~-~tKt~~~g~v~ltihVW~~-~~~R~~v~ 75 (129) T protein:vir:74 2 IKT--RDQSIFDELFKRIQALGYTVYDYKPM--NEVGYPFVELENTQTIHE-ANKTDIKGTVSLSLSVWGL-QKKRKEVS 75 (129) T ss_pred CcC--hhHHHHHHHHHHHHhcCCeeeeccCC--CCCCcCEEEeeeeeecCC-ccccccccEEEEEEEEeeC-CccchhHH Confidence 553 4455555543332 25668998887 555568776665555533 3456778999999999944 44578899 Q ss_pred HHHHHhhhhhhhchhhhhhheeeeee--ccccc--ccccccceEEEE-EEEEEEE Q lcl|NC_019711. 79 AWMESRIYPVMSDIPALSDLITSMVA--SGYDY--RRDDDAGLWSSA-DLTYVIT 128 (131) Q Consensus 79 ~~~E~~i~~v~~~~~~l~~li~~~~~--~gy~Y--~rD~e~~tW~sa-dL~y~It 128 (131) +++++ |.-++.......+..=.+.. +..+= +.+.+..+++.. +|.|++. T Consensus 76 ~i~~~-i~~~~~~~~~t~~y~~~~~~~~~~~q~~~Dtst~~~L~Hgvi~l~f~~r 129 (129) T protein:vir:74 76 DMASN-IFNQALNISATDGYSWALNSQASTIQMLDDTTTHTPLKRALINLEFRLR 129 (129) T ss_pred HHHHH-HHHHhccccccCCcEEEEeecceeEEEcccCCCCceeeeEEEEEEEEeC Confidence 99974 55444433322222111111 11111 112445556554 7777766 No 37 >protein:vir:104224 Length: 161 # NCBI annotation: Hypothetical protein # Family: family:all:27047 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006974;genbank:gi:46401875;genbank:GeneID:2777635 Probab=20.01 E-value=2.8 Score=18.09 Aligned_cols=126 Identities=24% Similarity=0.391 Sum_probs=77.2 Q ss_pred Ccc-HHHHHHHHHHHHhcCCce---EEEeCcc------eE-Ec-cccCcEEEEEeeCCccCceeec-CCceE-EEEEEEE Q lcl|NC_019711. 1 MKH-TELRAAVLDALEKHDTGA---TFFDGRP------AV-FD-EADFPAVAVYLTGAEYTGEELD-SDTWQ-AELHIEV 66 (131) Q Consensus 1 ~~H-~~IR~~V~d~L~~~~~~v---~~f~GrP------~f-id-e~elPAVAVyl~da~~~~~~ld-~~~w~-A~LhI~i 66 (131) |-| +-|-++.+|++.+...|- .+||+.- ++ ++ -.|+|-|||+|- .-+|.+|- +.+|- -.|-|-| T Consensus 1 mdhrtsiaqamvdriskqmdgsqpdeyfnnlygnvsrqtykfeeirefpyvavhig--tetgqylpsgqqwmflelpilv 78 (161) T protein:vir:10 1 MDHRTSIAQAMVDRISKQMDGSQPDEYFNNLYGNVSRQTYKFEEIREFPYVAVHIG--TETGQYLPSGQQWMFLELPILV 78 (161) T ss_pred CcchhHHHHHHHHHHHhhcCCCCchhhhhhhhcccccceechhhhhhCceeEEEec--ccccccccCCceeEEeecceeE Confidence 886 679999999997765542 3666432 22 12 359999999994 45677776 45575 4788889 Q ss_pred Eeec-CCChhHHHHHHHHhhhhhhhchhhhhhheeeeeeccccccc------------cc-ccceEEEEEEEEEEEEeC Q lcl|NC_019711. 67 FLPA-QVPDSELDAWMESRIYPVMSDIPALSDLITSMVASGYDYRR------------DD-DAGLWSSADLTYVITYEM 131 (131) Q Consensus 67 yLk~-~~~d~~LD~~~E~~i~~v~~~~~~l~~li~~~~~~gy~Y~r------------D~-e~~tW~sadL~y~ItY~~ 131 (131) |=|- +.-...|.+++. -|..++..-..|.-.++ .+.|-.+.- |+ --+-++-|.+.-.+.|.- T Consensus 79 ydkektdiqeqleklva-diktvidtggnleytvs--kpngstfpceatdmiitsvstdegllapyglaeinvtvryqp 154 (161) T protein:vir:10 79 YDKEKTDIQEQLEKLVA-DIKTVIDTGGNLEYTVS--KPNGSTFPCEATDMIITSVSTDEGLLAPYGLAEINVTVRYQP 154 (161) T ss_pred eccccccHHHHHHHHHH-HHHHHhhCCCceeEEEe--CCCCccccccccceeeeeecccccccccccceeeeEEEEecC Confidence 9887 556666777764 45555544433331111 222333321 11 124467777878888877 Done!