Query lcl|NC_020854.1_cdsid_YP_007675019.1 [gene=CPKG_00046] [protein=hypothetical protein] [protein_id=YP_007675019.1] [location=26506..27066] Match_columns 186 No_of_seqs 80 out of 83 Neff 5.9 Searched_HMMs 1612 Date Thu Nov 7 16:02:32 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_46 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_46_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95004 Length: 169 100.0 8.1E-60 5.1E-63 344.5 15.8 164 1-186 1-169 (169) 2 protein:vir:78383 Length: 169 100.0 1.5E-59 9.3E-63 343.1 15.8 164 1-186 1-169 (169) 3 protein:vir:95176 Length: 172 100.0 8.1E-59 5.1E-62 339.0 15.7 164 1-186 3-170 (172) 4 protein:vir:80389 Length: 172 100.0 4.4E-58 2.7E-61 335.0 16.5 162 1-186 1-172 (172) 5 protein:vir:94955 Length: 170 100.0 3.5E-56 2.2E-59 324.6 15.8 163 1-186 1-170 (170) 6 protein:vir:97267 Length: 172 100.0 9.5E-54 5.9E-57 311.3 15.4 163 1-186 1-172 (172) 7 protein:vir:98900 Length: 132 98.3 2.3E-08 1.4E-11 62.5 10.2 125 15-181 1-132 (132) 8 protein:vir:80967 Length: 131 98.0 6.7E-08 4.1E-11 59.9 8.9 125 15-184 1-131 (131) 9 protein:vir:43 Length: 131 # N 98.0 6E-08 3.7E-11 60.2 8.7 124 15-184 1-131 (131) 10 protein:vir:2505 Length: 128 # 95.4 0.00022 1.4E-07 40.6 7.7 123 11-186 1-123 (128) 11 protein:vir:80320 Length: 188 94.7 0.0015 9.3E-07 36.1 10.1 144 1-181 1-188 (188) 12 protein:vir:1435 Length: 188 # 94.1 0.0025 1.6E-06 34.9 10.2 143 1-181 1-188 (188) 13 protein:vir:7857 Length: 188 # 93.7 0.0016 1E-06 35.8 8.4 137 20-179 1-188 (188) 14 protein:vir:101652 Length: 188 93.7 0.0016 1E-06 35.8 8.4 137 20-179 1-188 (188) 15 protein:vir:9576 Length: 131 # 93.5 0.0061 3.8E-06 32.7 11.0 123 14-186 1-128 (131) 16 protein:vir:99002 Length: 158 92.3 0.0096 5.9E-06 31.7 10.5 121 14-186 1-122 (158) 17 protein:vir:94761 Length: 132 92.3 0.011 6.8E-06 31.3 10.8 123 14-186 1-129 (132) 18 protein:vir:4788 Length: 130 # 92.1 0.0028 1.7E-06 34.6 7.3 122 15-185 1-130 (130) 19 protein:vir:79701 Length: 144 90.9 0.0083 5.2E-06 32.0 8.6 132 14-181 1-144 (144) 20 protein:vir:9821 Length: 138 # 89.0 0.011 7.1E-06 31.2 7.8 123 1-185 3-138 (138) 21 protein:vir:103283 Length: 125 88.7 0.007 4.3E-06 32.4 6.4 87 77-186 1-118 (125) 22 protein:vir:9761 Length: 140 # 86.3 0.047 2.9E-05 27.9 10.0 121 14-186 1-134 (140) 23 protein:vir:1640 Length: 132 # 84.9 0.057 3.5E-05 27.4 10.8 123 14-186 1-129 (132) 24 protein:vir:107756 Length: 147 84.7 0.058 3.6E-05 27.4 9.8 126 1-186 1-140 (147) 25 protein:vir:5256 Length: 119 # 84.1 0.064 3.9E-05 27.1 10.2 109 17-182 1-119 (119) 26 protein:vir:100245 Length: 113 83.7 0.045 2.8E-05 28.0 8.1 113 15-179 1-113 (113) 27 protein:vir:1993 Length: 141 # 76.5 0.09 5.6E-05 26.3 7.3 123 15-174 1-141 (141) 28 protein:vir:103846 Length: 138 76.5 0.11 6.8E-05 25.9 7.8 118 15-186 1-129 (138) 29 protein:vir:100103 Length: 120 75.0 0.15 9.4E-05 25.1 9.2 120 11-180 1-120 (120) 30 protein:vir:486 Length: 107 # 74.7 0.16 9.6E-05 25.0 8.2 104 16-178 1-107 (107) 31 protein:vir:4512 Length: 107 # 74.4 0.14 8.6E-05 25.3 7.8 107 16-178 1-107 (107) 32 protein:vir:1887 Length: 108 # 73.1 0.17 0.00011 24.7 8.4 102 1-180 1-108 (108) 33 protein:vir:192 Length: 108 # 73.1 0.17 0.00011 24.7 8.4 102 1-180 1-108 (108) 34 protein:vir:99222 Length: 138 71.4 0.028 1.7E-05 29.1 3.2 115 15-186 1-129 (138) 35 protein:vir:79253 Length: 138 71.4 0.028 1.7E-05 29.1 3.2 115 15-186 1-129 (138) 36 protein:vir:107702 Length: 136 70.9 0.18 0.00011 24.7 7.5 120 12-186 1-130 (136) 37 protein:vir:104344 Length: 132 70.2 0.16 9.8E-05 25.0 7.0 116 14-186 1-124 (132) 38 protein:vir:78254 Length: 149 69.1 0.23 0.00014 24.1 8.2 116 15-186 1-125 (149) 39 protein:vir:78478 Length: 149 69.1 0.23 0.00014 24.1 8.2 116 15-186 1-125 (149) 40 protein:vir:10365 Length: 115 68.8 0.23 0.00014 24.1 8.5 114 17-182 1-115 (115) 41 protein:vir:97069 Length: 115 66.7 0.26 0.00016 23.8 8.7 113 17-182 1-115 (115) 42 protein:vir:2432 Length: 124 # 65.9 0.28 0.00017 23.6 7.6 119 15-186 1-119 (124) 43 protein:vir:107864 Length: 150 65.3 0.053 3.3E-05 27.6 3.4 123 15-173 1-150 (150) 44 protein:vir:5742 Length: 110 # 63.0 0.32 0.0002 23.3 8.9 108 1-178 1-110 (110) 45 protein:vir:80036 Length: 111 61.5 0.35 0.00022 23.1 7.3 109 14-186 1-111 (111) 46 protein:vir:81069 Length: 115 60.7 0.37 0.00023 23.0 8.6 113 17-182 1-115 (115) 47 protein:vir:79640 Length: 134 55.3 0.48 0.0003 22.3 7.7 117 14-186 1-127 (134) 48 protein:vir:78849 Length: 110 55.1 0.49 0.0003 22.3 9.4 110 17-184 1-110 (110) 49 protein:vir:103957 Length: 110 55.1 0.49 0.0003 22.3 9.4 110 17-184 1-110 (110) 50 protein:vir:97145 Length: 110 55.1 0.49 0.0003 22.3 9.4 110 17-184 1-110 (110) 51 protein:vir:96390 Length: 110 55.1 0.49 0.0003 22.3 9.4 110 17-184 1-110 (110) 52 protein:vir:99796 Length: 110 55.1 0.49 0.0003 22.3 9.4 110 17-184 1-110 (110) 53 protein:vir:9311 Length: 110 # 55.1 0.49 0.0003 22.3 9.4 110 17-184 1-110 (110) 54 protein:vir:96221 Length: 110 55.1 0.49 0.0003 22.3 9.4 110 17-184 1-110 (110) 55 protein:vir:79074 Length: 150 49.9 0.63 0.00039 21.7 8.1 118 15-186 1-138 (150) 56 protein:vir:4458 Length: 107 # 47.6 0.7 0.00043 21.4 8.3 107 16-178 1-107 (107) 57 protein:vir:7773 Length: 123 # 46.1 0.75 0.00046 21.3 8.4 118 15-186 1-118 (123) 58 protein:vir:3639 Length: 158 # 45.6 0.33 0.0002 23.2 4.3 101 74-186 1-150 (158) 59 protein:vir:101559 Length: 158 45.6 0.33 0.0002 23.2 4.3 101 74-186 1-150 (158) 60 protein:vir:94507 Length: 113 43.2 0.86 0.00053 21.0 9.9 113 17-184 1-113 (113) 61 protein:vir:93592 Length: 108 41.7 0.92 0.00057 20.8 9.3 108 14-179 1-108 (108) 62 protein:vir:106583 Length: 105 41.0 0.94 0.00059 20.7 9.7 105 17-181 1-105 (105) 63 protein:vir:2738 Length: 112 # 39.2 1 0.00064 20.5 10.0 111 1-184 1-112 (112) 64 protein:vir:104088 Length: 125 37.1 1.1 0.0007 20.3 6.3 120 15-186 1-120 (125) 65 protein:vir:96108 Length: 155 35.9 1.2 0.00075 20.1 6.5 98 79-186 1-149 (155) 66 protein:vir:106739 Length: 158 29.6 1.2 0.00072 20.2 4.6 101 74-186 1-150 (158) 67 protein:vir:78595 Length: 158 29.6 1.2 0.00072 20.2 4.6 101 74-186 1-150 (158) 68 protein:vir:98481 Length: 136 29.6 1.6 0.001 19.4 7.8 111 14-186 1-119 (136) 69 protein:vir:78068 Length: 178 29.2 1.7 0.001 19.4 6.8 145 16-184 1-178 (178) 70 protein:vir:3034 Length: 111 # 28.5 1.1 0.00067 20.4 4.2 99 50-185 1-111 (111) 71 protein:vir:96488 Length: 113 28.1 1.8 0.0011 19.2 10.0 112 17-184 1-113 (113) 72 protein:vir:99848 Length: 172 27.3 1.6 0.001 19.4 5.0 121 1-173 1-172 (172) 73 protein:vir:107119 Length: 104 25.5 0.7 0.00044 21.4 2.6 104 18-183 1-104 (104) 74 protein:vir:105327 Length: 104 25.5 0.7 0.00044 21.4 2.6 104 18-183 1-104 (104) 75 protein:vir:94064 Length: 167 20.9 2.7 0.0017 18.2 5.7 97 79-186 1-145 (167) 76 protein:vir:95891 Length: 104 20.2 1.1 0.00067 20.4 2.5 104 18-183 1-104 (104) 77 protein:vir:96281 Length: 104 20.2 1.1 0.00067 20.4 2.5 104 18-183 1-104 (104) 78 protein:vir:97329 Length: 104 20.2 1.1 0.00067 20.4 2.5 104 18-183 1-104 (104) No 1 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=100.00 E-value=8.1e-60 Score=344.53 Aligned_cols=164 Identities=20% Similarity=0.246 Sum_probs=152.0 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhh--ccCcccCCcccccccCc Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRE--RYLGARATDTQALQWPR 78 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~--~~~G~r~~~~Q~laWPR 78 (186) |+||||+|+|+|+||||+|++||++||+.|+ .|.++++++||++|++|++|||++ +|+|+|++++|+|+||| T Consensus 1 M~liv~~~~g~~~anSYvt~~ea~aY~~~rg------~~~~~dd~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPR 74 (169) T protein:vir:95 1 MPLIVETGQGLPNADSYVSLEDGRALAAKYG------LELPEDDIAAEASLRNGAVYVGLFESQMCGRRVSANQALAFPR 74 (169) T ss_pred CeeEEeCCCCCCcccccccHHHHHHHHHHcC------CcCCCCHHHHHHHHHHHHHHhhccccccccccCCcchhhcccc Confidence 9999999999999999999999999999986 489999999999999999999986 79999999999999999 Q ss_pred cCccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEec-CeeEEeecCCCCc Q lcl|NC_020854. 79 TGVRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKI-GSIDVTPNQYGAT 157 (186) Q Consensus 79 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kv-G~isveY~~~~~~ 157 (186) +|+ ++++++++++.||++||+||||||++++++++.+++.+++.|+++++ |+|+|||+.+++. T Consensus 75 tg~----------------~~~g~~~~~~~IP~~V~~A~~elA~~~~~g~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~ 138 (169) T protein:vir:95 75 TGI----------------DLHGFPQPSNVIPSLVIQAQVMAAVEYGAGTDVRGSTDGREVQTERVEGAVTVSYFKNGYS 138 (169) T ss_pred CCc----------------eecccccccccchHHHHHHHHHHHHHHHcCccccCCCCccceeeeeeccceeEeecCCCCc Confidence 997 46889999999999999999999999999998888888889998876 9999999998888 Q ss_pred CcccchHHHHHHHhhhhccCCce--eeeeeC Q lcl|NC_020854. 158 GADRIPPMVERYLTGLRISGPGN--IAVKRS 186 (186) Q Consensus 158 ~~~~~~~~v~~lL~~l~~~~~g~--~~~~r~ 186 (186) +..+.|+++++||+|||++++|+ ++++|- T Consensus 139 ~~~~~~~a~~~LL~p~l~g~~g~~~i~~~rg 169 (169) T protein:vir:95 139 GGTVSITAADDALRPLLCGSNNAYSFNVFRG 169 (169) T ss_pred CccccHHHHHHhhhhhcccCCCcceeeeecC Confidence 88899999999999999998874 555566 No 2 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=100.00 E-value=1.5e-59 Score=343.07 Aligned_cols=164 Identities=21% Similarity=0.257 Sum_probs=151.7 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhh--ccCcccCCcccccccCc Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRE--RYLGARATDTQALQWPR 78 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~--~~~G~r~~~~Q~laWPR 78 (186) ||||||+|+|+|+||||+|++||++||+.|+ .|+++++++|+++|++|++|||++ +|+|+|++++|+|+||| T Consensus 1 MaliV~~~~g~~~anSYvtv~~a~aY~~~rg------~~~~~d~~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPR 74 (169) T protein:vir:78 1 MPLIVETGQGIPNADSYVSLEDGRALAAKYG------LELPEDDTAAEAALRNGAVYVGLFESQMCGRRVSANQALAFPR 74 (169) T ss_pred CeeEeeCCCCCccccccccHHHHHHHHHHcC------CcCCCChHHHHHHHHHHHHHhhhccccceeeeCCccccccccc Confidence 9999999999999999999999999999886 488999999999999999999986 89999999999999999 Q ss_pred cCccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEec-CeeEEeecCCCCc Q lcl|NC_020854. 79 TGVRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKI-GSIDVTPNQYGAT 157 (186) Q Consensus 79 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kv-G~isveY~~~~~~ 157 (186) +|+ .++|+++|++.||.+||+||||||++++++++.+++.+.+.|++|+| |+|+|||+.+++. T Consensus 75 tg~----------------~~~g~~~~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~ 138 (169) T protein:vir:78 75 TGV----------------TLHGFPQPSNVIPPLVIQAQVMAAVEYGAGTDVRGSTDGREVQTERVEGAVTVSYFKNGYS 138 (169) T ss_pred CCc----------------eecccccccccchHHHHHHHHHHHHHHhcCcccCCCCCcceeEEEEecCceeEeecCCCCC Confidence 997 36788999999999999999999999999988888888889999988 9999999998888 Q ss_pred CcccchHHHHHHHhhhhccCCceee--eeeC Q lcl|NC_020854. 158 GADRIPPMVERYLTGLRISGPGNIA--VKRS 186 (186) Q Consensus 158 ~~~~~~~~v~~lL~~l~~~~~g~~~--~~r~ 186 (186) +..+.|+++++||+||+++++|+|. ++|- T Consensus 139 ~~~~~~~~~~~LL~p~l~~~~g~~~i~~~rg 169 (169) T protein:vir:78 139 GGTVSITTADDALRPLLCGSNNAYSFNVFRG 169 (169) T ss_pred CCcccHHHHHHHhhhhcccCCCcceeeeecC Confidence 8889999999999999999887554 5555 No 3 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=100.00 E-value=8.1e-59 Score=339.05 Aligned_cols=164 Identities=25% Similarity=0.303 Sum_probs=148.3 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhh--hccCcccCCcccccccCc Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDR--ERYLGARATDTQALQWPR 78 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~--~~~~G~r~~~~Q~laWPR 78 (186) |+||||||+|+|+||||+|++||++||+.|+ .|+++++++||++|++|++|||+ ++|+|+|++++|+|+||| T Consensus 3 Malive~~~g~~~anSYvtv~ea~aY~~~rg------~~~~~~~~~ke~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR 76 (172) T protein:vir:95 3 ITIVVEDGSGVTNANSYVSVADARIYASNRG------VELPLDDDELAAMLIRSTDYLEAQACRFQGKPTSTTQALQWPR 76 (172) T ss_pred eeEEEeCCCCCCcccccccHHHHHHHHHhcC------CcCCCChHHHHHHHHHHHHHhhccCCceeeeecCCcccccCCc Confidence 9999999999999999999999999999884 59999999999999999999995 799999999999999999 Q ss_pred cCccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCC-CCcccceeEEecCeeEEeecCCCCc Q lcl|NC_020854. 79 TGVRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIG-LSGLEDYKNVKIGSIDVTPNQYGAT 157 (186) Q Consensus 79 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~-~~~~~~v~~~kvG~isveY~~~~~~ 157 (186) +|+. +++++++++.||++||+||||||++++++++..+ ..+.+.||+||||+|+|||+.+++. T Consensus 77 ~g~~----------------~~~~~v~~~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~~~vk~~kVG~I~veY~~~~~~ 140 (172) T protein:vir:95 77 TGVF----------------LNEDEVPSNVIPKSLIAAQVQLTMAINAGFDLQPNVSPQDYVTREKVGPIETEYADPLSV 140 (172) T ss_pred CCcc----------------cCcccccccchhHHHHHHHHHHHHHHHcCccccccCCcccceeEEeccceEEeeccCCCC Confidence 9984 5678999999999999999999999999877544 3567889999999999999988888 Q ss_pred CcccchHHHHHHHhhhhccCCc-eeeeeeC Q lcl|NC_020854. 158 GADRIPPMVERYLTGLRISGPG-NIAVKRS 186 (186) Q Consensus 158 ~~~~~~~~v~~lL~~l~~~~~g-~~~~~r~ 186 (186) ++.+.|+++++||+||+++++| .|.|+.+ T Consensus 141 ~~~~~~~~v~~LL~p~l~~~~~~~~~~r~~ 170 (172) T protein:vir:95 141 GIMPTFTAANALLAPLFGECASNKFALRTI 170 (172) T ss_pred CCcccHHHHHHHHhhhhcccCCcceeeEEE Confidence 8889999999999999877664 4555555 No 4 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=100.00 E-value=4.4e-58 Score=335.03 Aligned_cols=162 Identities=22% Similarity=0.264 Sum_probs=146.4 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhh--ccCcccCCcccccccCc Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRE--RYLGARATDTQALQWPR 78 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~--~~~G~r~~~~Q~laWPR 78 (186) |+|||||++|+|+||||+|++||++||+.| |...++++||++|++|+||||++ +|+|+|++++|+|+||| T Consensus 1 Malived~~g~~~anSYvt~~~a~aY~~~r--------g~~~~~d~~e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR 72 (172) T protein:vir:80 1 MALIVEDGTGKPDANTYAGADFVIAYAQAR--------GVTVDADEAERLILEAMDYIESFRRRWKGERNTREQGLTWPR 72 (172) T ss_pred CeeEeeCCCCCccccccccHHHHHHHHHHc--------CCCcCHHHHHHHHHHHHHHHhhccCccccccCCccccccccc Confidence 999999999999999999999999999987 66788889999999999999995 69999999999999999 Q ss_pred cCccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcC Q lcl|NC_020854. 79 TGVRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATG 158 (186) Q Consensus 79 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~ 158 (186) +|++ +++++++++.||.+||+||||||++++++++..+..++..|++||||+|++||+.+.+.+ T Consensus 73 ~g~~----------------~~g~~~~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~~~~v~~ekVG~i~~eY~~~~~~~ 136 (172) T protein:vir:80 73 HDAV----------------VDGFVIPSDVIPKELQSAVAAAVIEQVNGFELQQSQDQWAVRIEKVDVIEVQYAAGGGGQ 136 (172) T ss_pred cCcc----------------cCcccccccchhHHHHHHHHHHHHHHhcCCccCcCCCCceeeEEeccceEEeeecccCcc Confidence 9974 678899999999999999999999999887888888888999999999999998765544 Q ss_pred -------cccchHHHHHHHhhhhccCCc-eeeeeeC Q lcl|NC_020854. 159 -------ADRIPPMVERYLTGLRISGPG-NIAVKRS 186 (186) Q Consensus 159 -------~~~~~~~v~~lL~~l~~~~~g-~~~~~r~ 186 (186) ..+.||+|++||+||+++++| .+.|+|- T Consensus 137 ~~~~~~~~~~~~~~v~~LL~p~l~~~gg~~~~~vrg 172 (172) T protein:vir:80 137 SASANAPMKPTFPKIDALLNPLLVGDGGLFLVAVRG 172 (172) T ss_pred ccccccCCccchHHHHHHHhhhhcCCCCeeeeeecC Confidence 356899999999999988665 6666667 No 5 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=100.00 E-value=3.5e-56 Score=324.58 Aligned_cols=163 Identities=26% Similarity=0.448 Sum_probs=146.3 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhh-hccCcccCCcccccccCcc Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDR-ERYLGARATDTQALQWPRT 79 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~-~~~~G~r~~~~Q~laWPR~ 79 (186) |.+| |+|+|+|+||||+||+||++||+.|++. .+|.++++++||++|++|+||||+ ++|+|+|++++|+|+|||+ T Consensus 1 m~~i-~~~~g~~~AnSYvtv~ea~aY~~~r~~~---~~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~ 76 (170) T protein:vir:94 1 MPTV-DATPGSITANSYVTVAEANSYFDGSYGR---PLWTSASEDEKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCK 76 (170) T ss_pred Ccee-ecCCCCCcccceecHHHHHHHHHhhccc---cccCCCCHHHHHHHHHHHHHHhccccccccccCCcchhhccccc Confidence 7665 9999999999999999999999999865 479999999999999999999997 7999999999999999999 Q ss_pred CccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCc Q lcl|NC_020854. 80 GVRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGA 159 (186) Q Consensus 80 g~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~ 159 (186) |+. +++++++++.||++||+||||||++++++++..+. ..+.||+||||+|+|||+.+++ . T Consensus 77 g~~----------------~dg~~~~~~~IP~~V~~Aq~elA~~~~~~~~~~~~-~~~~v~~~kVG~i~veY~~~~~--~ 137 (170) T protein:vir:94 77 NAV----------------IGGMTLSQVSIPVKVKIAVFELAYFMLESGAALSF-ADQTIDSVKVGTIRVEFTKNST--D 137 (170) T ss_pred Ccc----------------cCccccccchhhHHHHHHHHHHHHHHHhCcccCcc-cccceeeEecceeEEEecCCCC--C Confidence 973 67889999999999999999999999998876554 4478999999999999975433 3 Q ss_pred ccchHHHHHHHhhhhcc------CCceeeeeeC Q lcl|NC_020854. 160 DRIPPMVERYLTGLRIS------GPGNIAVKRS 186 (186) Q Consensus 160 ~~~~~~v~~lL~~l~~~------~~g~~~~~r~ 186 (186) .+.++.|+.||+||+.+ ++.+++|+|- T Consensus 138 ~~~~~~v~~LL~p~l~~~~~g~~~~~~~~~~r~ 170 (170) T protein:vir:94 138 AGLPTFVEAMLSGFGSPVLYGSNAARSIDLVRA 170 (170) T ss_pred CccHHHHHHHhhhhhccccccccccceeeeecC Confidence 46789999999999977 5579999999 No 6 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=100.00 E-value=9.5e-54 Score=311.26 Aligned_cols=163 Identities=23% Similarity=0.290 Sum_probs=138.3 Q ss_pred CeeEeecCCC-CCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhh-hccCccc-CCcccccccC Q lcl|NC_020854. 1 MAITIVATAG-AADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDR-ERYLGAR-ATDTQALQWP 77 (186) Q Consensus 1 Mal~v~~~~g-~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~-~~~~G~r-~~~~Q~laWP 77 (186) |+|||||+|| +|+||||+|++||++||+.|+++ |.+.++++|+++|++|++|||+ ++|+|+| ++++|+|+|| T Consensus 1 m~liveD~t~~~~~AnSYvtv~~a~aY~~~rg~~-----~~a~~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WP 75 (172) T protein:vir:97 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNS-----FAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWP 75 (172) T ss_pred CceEeeCCCCCCCCccccccHHHHHHHHHhcCcc-----cCCCCcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcc Confidence 9999999998 79999999999999999999754 8889999999999999999998 7999987 6899999999 Q ss_pred ccCccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCC-----CcccceeEEecCeeEEeec Q lcl|NC_020854. 78 RTGVRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGL-----SGLEDYKNVKIGSIDVTPN 152 (186) Q Consensus 78 R~g~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~-----~~~~~v~~~kvG~isveY~ 152 (186) |+|+. +++.+++|.||++||+||||||+++++++..+.. .+...+||+|||+|+++|+ T Consensus 76 Rtg~~-----------------d~~~~~~~~IP~~v~~A~~elA~~al~~~l~~d~~~~~~~~~v~~kr~kvg~i~~~y~ 138 (172) T protein:vir:97 76 RTDAW-----------------DRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVT 138 (172) T ss_pred cCCCC-----------------CCcccccccccHHHHHHHHHHHHHHHhcccccccccccccccceeeeeeecceeeEee Confidence 99984 3457899999999999999999999988654322 2334689999999999997 Q ss_pred CCCCc-CcccchHHHHHHHhhhhccCCceeeeeeC Q lcl|NC_020854. 153 QYGAT-GADRIPPMVERYLTGLRISGPGNIAVKRS 186 (186) Q Consensus 153 ~~~~~-~~~~~~~~v~~lL~~l~~~~~g~~~~~r~ 186 (186) ..++. +..+.||+|++||+|+...++|. .|.|- T Consensus 139 ~~~~~~~~~p~~~~v~aLL~p~gl~~~~~-~~~r~ 172 (172) T protein:vir:97 139 FVGGAVFQMPKYPAADQKLVRAGLVRSGG-TLLRG 172 (172) T ss_pred ccCCCCCccccHHHHHHHHhhhccccCcc-eeccC Confidence 64443 35788999999999864444433 67777 No 7 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=98.25 E-value=2.3e-08 Score=62.46 Aligned_cols=125 Identities=17% Similarity=0.095 Sum_probs=84.4 Q ss_pred cceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccc Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVG 94 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~ 94 (186) =.|+|.++-+.|.. ...++++-+++|.+|+++||.+.|- + . +..+. T Consensus 1 M~Y~t~~~Y~~~~G-----------~~i~e~~F~~l~~rAs~~ID~iT~~-r-i--------~~~~~------------- 46 (132) T protein:vir:98 1 MPYLTYEEFMDLNG-----------RDIDDKKFEKLLPKASAIIDGVTGH-F-Y--------QKVDM------------- 46 (132) T ss_pred CCCCCHHHHHhhcC-----------CCCCHHHHHHHHHHHHHHHHHHhcc-c-c--------cCCCc------------- Confidence 67999998887653 2357778899999999999987651 1 0 00111 Q ss_pred cccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCC-cCc----ccchHHHHHH Q lcl|NC_020854. 95 FPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGA-TGA----DRIPPMVERY 169 (186) Q Consensus 95 ~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~-~~~----~~~~~~v~~l 169 (186) +.. ...++.+||.|+|..+-.+.+.+........+.++++++|..+|+|..+.+ .+. ....+-+..+ T Consensus 47 -------~~d-~~~~~~~vk~A~c~qiey~~~~G~~sae~~~~~~~S~svG~~Svs~~s~~~~~~~~~~~~~~~~~a~~~ 118 (132) T protein:vir:98 47 -------EKD-NAWRVNQFKLALCAQIEYFDALGATTFEEINNSPQTFQAGRTSVSNASRYNPSGANESKPLVAEDVYIY 118 (132) T ss_pred -------ccc-ChHHHHHHHHHHHHHHHHHHhccchhhhhccCccceeeeCcEEEEeeccCCcccccccccchHHHHHHH Confidence 011 245778999999999998876644333334567999999999999964332 221 1233556677 Q ss_pred Hh--hhhccCCcee Q lcl|NC_020854. 170 LT--GLRISGPGNI 181 (186) Q Consensus 170 L~--~l~~~~~g~~ 181 (186) |. |||..|-+++ T Consensus 119 L~~tGLLyrGV~~~ 132 (132) T protein:vir:98 119 LQGTGLLFQGVKTW 132 (132) T ss_pred HhhcCCccccCCCC Confidence 75 4888888888 No 8 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=98.05 E-value=6.7e-08 Score=59.93 Aligned_cols=125 Identities=20% Similarity=0.253 Sum_probs=79.9 Q ss_pred cceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccc Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVG 94 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~ 94 (186) =.|+|.++-..... + ...++++=+++|.+|+++||.+.| -|.... T Consensus 1 M~Y~d~~~Y~~~y~--G--------~~i~e~~F~~l~~rAs~~ID~~T~-------------~ri~~~------------ 45 (131) T protein:vir:80 1 MPYTTLEFYTNEYA--G--------EHLEQDEFAKLLKHAERKIDSVTF-------------YRIRKS------------ 45 (131) T ss_pred CCCCCHHHHHHhhC--C--------CCCchhHHHHHHHHHHHHHHHHhc-------------cccccc------------ Confidence 67888888754321 1 134667788999999999998765 121000 Q ss_pred cccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcc----cchHHHHHHH Q lcl|NC_020854. 95 FPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGAD----RIPPMVERYL 170 (186) Q Consensus 95 ~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~----~~~~~v~~lL 170 (186) +-+.. .+.+|.+||.|+|+.|-.+...+.. .....+.+++++||..||+|...+..+.. ...+.+..+| T Consensus 46 -----~~d~~-~~~~~~~vk~A~c~q~e~~~~~g~~-~~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L 118 (131) T protein:vir:80 46 -----GIEAF-SEFIQHQIQLATCNQIEYFKEAGGT-SELAVSKPDNVSIGRTSISDSNFASTATSLNSGLVGSDVRSYL 118 (131) T ss_pred -----ccccC-chhHHHHHHHHHHHHHHHHHHhhhh-hhhcccccCeeeeCceEEeeccccchhhhhhhhhhHHHHHHHH Confidence 00001 1579999999999999988876543 22335678999999999999765443322 2455566677 Q ss_pred hh--hhccCCceeeee Q lcl|NC_020854. 171 TG--LRISGPGNIAVK 184 (186) Q Consensus 171 ~~--l~~~~~g~~~~~ 184 (186) .+ ||..|- ..+ T Consensus 119 ~~TGLlyrGV---~~~ 131 (131) T protein:vir:80 119 AHTGLLYNGV---GVR 131 (131) T ss_pred hccCCeecCC---CCC Confidence 54 554442 223 No 9 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=98.05 E-value=6e-08 Score=60.16 Aligned_cols=124 Identities=22% Similarity=0.270 Sum_probs=80.4 Q ss_pred cceecHHHHHH-HHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcc Q lcl|NC_020854. 15 NSYLTLSDAQD-IIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAV 93 (186) Q Consensus 15 nSYvsla~Ada-Y~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~ 93 (186) =.|+|.++-.. |.. ...++++=+++|.+|+++||.+.| -|... T Consensus 1 M~Y~d~~~Y~~~y~g-----------~~i~e~~F~~l~~rAs~~ID~~T~-------------~ri~~------------ 44 (131) T protein:vir:43 1 MPYTTLEFYNDEYAG-----------EHLEQDEFDKLLKHAERKIDSVTF-------------YRIRK------------ 44 (131) T ss_pred CCCCCHHHHHHhhCC-----------CCCCHhHHHHHHHHHHHHHHHHhc-------------ccccc------------ Confidence 67889888754 421 235677788999999999998765 11100 Q ss_pred ccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCc----ccchHHHHHH Q lcl|NC_020854. 94 GFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGA----DRIPPMVERY 169 (186) Q Consensus 94 ~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~----~~~~~~v~~l 169 (186) .+-+.. .+.+|.+||.|+|+.|-.+...+... ....+.+++++||..||+|...+..+. ....+.+..+ T Consensus 45 -----~~~~~~-~~~~~~~vk~A~c~q~e~~~~~g~~s-~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~ 117 (131) T protein:vir:43 45 -----GGIESF-SEFIQHQIQLATCNQIEYFKEAGGTS-ELAVSKPDNVSIGRTSISDSNFASTATSLNSGLIGSDVRSY 117 (131) T ss_pred -----cCcccc-chhhHHHHHHHHHHHHHHHHHhHHHh-hhhccccCeeecCceEEeecccccchhhhchhhhHHHHHHH Confidence 000111 15789999999999999988764332 223446899999999999976444332 2245667777 Q ss_pred Hhh--hhccCCceeeee Q lcl|NC_020854. 170 LTG--LRISGPGNIAVK 184 (186) Q Consensus 170 L~~--l~~~~~g~~~~~ 184 (186) |.+ ||..|- ..+ T Consensus 118 L~~TGLlyrGV---~~~ 131 (131) T protein:vir:43 118 LAHTGLLYNGV---GVR 131 (131) T ss_pred HhccCCeecCC---CCC Confidence 754 554442 223 No 10 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=95.45 E-value=0.00022 Score=40.65 Aligned_cols=123 Identities=16% Similarity=0.119 Sum_probs=81.5 Q ss_pred CCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhh Q lcl|NC_020854. 11 AADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINT 90 (186) Q Consensus 11 ~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~ 90 (186) --.-.+.+|+.|..+-+-.- -.+.++.+.+.+|-.|+|-|.++-|.|. T Consensus 1 ~~~~~alAtvdDv~~~lrr~--------Lt~dE~~~a~~Ll~eAsdlI~g~l~~~~------------------------ 48 (128) T protein:vir:25 1 MTECKALATSQDVKRALRRD--------LTEAEQTDLSELLAEATDLVVGYLHPYP------------------------ 48 (128) T ss_pred CccchhccCHHHHHHHhcCC--------CCHHHHHHHHHHHhcchheeeeecCCCC------------------------ Confidence 12467888888887644210 0112222334456689999987665431 Q ss_pred hccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHH Q lcl|NC_020854. 91 YAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYL 170 (186) Q Consensus 91 ~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL 170 (186) ..|.+|.-|+.-+|..+..++.-++...+.. ++...|+.+++|..+++.+....-...+.+| T Consensus 49 --------------vp~~~p~~v~rVvA~ivarAltr~~~~~pe~----~S~TAgpfs~~ft~~~~~~g~yLTaa~k~~L 110 (128) T protein:vir:25 49 --------------VPTPTPGPIKRVVASMVAAVLTRPTQILPET----QSLTADGFGVTFTPGGNSPGPYLSAALKQRL 110 (128) T ss_pred --------------CCCCCCchHHHHHHHHHHHHhhCCCccCCCc----eeeecccccccccCCCCCCCceEcHHHHhhc Confidence 1367888999999999999987766544422 3567799999987766666655557788999 Q ss_pred hhhhccCCceeeeeeC Q lcl|NC_020854. 171 TGLRISGPGNIAVKRS 186 (186) Q Consensus 171 ~~l~~~~~g~~~~~r~ 186 (186) +++.. |.|.|.=+ T Consensus 111 rp~R~---~~~sV~l~ 123 (128) T protein:vir:25 111 RPYRT---GMVAVEMG 123 (128) T ss_pred ccccc---eeeEeecc Confidence 99844 56666555 No 11 >protein:vir:80320 Length: 188 # NCBI annotation: gp8, conserved hypothetical protein # Family: family:all:501 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111087;genbank:gi:134288682;genbank:GeneID:4960567 Probab=94.65 E-value=0.0015 Score=36.06 Aligned_cols=144 Identities=12% Similarity=0.083 Sum_probs=69.8 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHH-HHHHHHhhhccCcccC----Ccccccc Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALY-TATQRLDRERYLGARA----TDTQALQ 75 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~-~Atd~id~~~~~G~r~----~~~Q~la 75 (186) |..++...+. +-.=+||+|++++.. . ..+.+|+..+..|+ .|.++++++ .|+.- =...--. T Consensus 1 M~~~~~~~pp---a~ePVtL~e~K~hLR--i-------d~~~eD~~l~~~lI~aA~~~~E~~--~gr~l~~qt~~~~~~~ 66 (188) T protein:vir:80 1 MAAVLVEYLD---DAEPLTFEEVAFQCR--I-------DDDDERDFVERIVIPGARQAAESK--SGAAIRKARYVERLSG 66 (188) T ss_pred CCceeeccCC---CCcccCHHHHHHHcC--C-------CCchhhHHHHHHHHHHHHHHHHHH--hCCeeeeeeEEEEecC Confidence 8877766553 333489999999873 2 12334445556555 577788863 22211 0111112 Q ss_pred cCccCcccc---------------ccch-----hhhcccc-----ccccCcc-------cc-------cCCcchHHHHHH Q lcl|NC_020854. 76 WPRTGVRKP---------------DTYI-----NTYAVGF-----PFRITTD-------YF-------TDTEIPQQIKEA 116 (186) Q Consensus 76 WPR~g~~~~---------------~~~~-----~~~~~~~-----~~~~~~~-------~~-------~~d~IP~~Vk~A 116 (186) ||+.+.+.+ +|.. ..|.++. -.++.+. .+ -.+.+|..+|+| T Consensus 67 ~~~~~i~Lp~~PV~sV~sV~~~d~~G~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~~~~vP~~ik~a 146 (188) T protein:vir:80 67 FPLAEISLSVGQVIRVDSIEIRDASGATTTLDADAFELVQLGREALLVPEGQARWPFARAVTITYQAGVDLARYPSVRTW 146 (188) T ss_pred CCCCceEecccccceeeEEEEEcCCCcEEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEecccccChHHHHHH Confidence 333222222 2211 1111110 0110000 00 024688899999 Q ss_pred HHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhccCCcee Q lcl|NC_020854. 117 QATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRISGPGNI 181 (186) Q Consensus 117 ~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~~~g~~ 181 (186) ...++....++-+... +|. ..+. ..+.+++.||++|..-.+ | T Consensus 147 ill~va~~Ye~Re~~~-----------~g~---------~~~~-~P~~~v~~Ll~pyRvp~~--~ 188 (188) T protein:vir:80 147 MLLAAAWAYDHRELFS-----------EGQ---------PIGE-MPGGYADVLLNPITVPPR--F 188 (188) T ss_pred HHHHHHHHHhcccccc-----------ccc---------cccc-ccHHHHHHHhhccCCCCC--C Confidence 9999988876532110 111 1111 113458899999876553 3 No 12 >protein:vir:1435 Length: 188 # NCBI annotation: hypothetical protein # Family: family:all:501 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536364;genbank:gi:17975169;genbank:GeneID:929149 Probab=94.14 E-value=0.0025 Score=34.85 Aligned_cols=143 Identities=13% Similarity=0.082 Sum_probs=68.5 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHH-HHHHHHhhhccCcccC----Ccccccc Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALY-TATQRLDRERYLGARA----TDTQALQ 75 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~-~Atd~id~~~~~G~r~----~~~Q~la 75 (186) |+.++...+. +-.=+||+|++++.. . ....+|+..+..|+ .|.++++++ .|+.- -...--. T Consensus 1 m~~~~~~~pp---a~epVtLae~K~~lr--i-------d~~~eD~~l~~~li~aA~~~~E~~--tgr~l~~qt~~~~~~~ 66 (188) T protein:vir:14 1 MAAVLVEYLD---DAEPLTFEEVAFQCR--I-------DDDDERDFVERVVIPGARQAAESK--AGAAIRKARYVEHLSG 66 (188) T ss_pred CCceeeecCC---CCCccCHHHHHHHcC--C-------CCchhHHHHHHHHHHHHHHHHHHH--hCCeeeeeeEEEEecC Confidence 8877765543 445689999999873 2 12334445555555 667789863 22211 1111122 Q ss_pred cCccCcccc---------------ccchh-----hhcccc----c-ccc-C------ccc--------ccCCcchHHHHH Q lcl|NC_020854. 76 WPRTGVRKP---------------DTYIN-----TYAVGF----P-FRI-T------TDY--------FTDTEIPQQIKE 115 (186) Q Consensus 76 WPR~g~~~~---------------~~~~~-----~~~~~~----~-~~~-~------~~~--------~~~d~IP~~Vk~ 115 (186) ||+.+.+.+ +|... .|.++. . .++ + +.. + .+.+|..+|+ T Consensus 67 ~~~~~~~Lp~~Pv~sV~sV~~~d~~g~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~-~~~vP~~ik~ 145 (188) T protein:vir:14 67 FPPAEVPLSVGQVISVDSIEIRDASGATTTLDAGAFELVQLGRETLLVPAGQARWPYARAVTIKYQAGI-DLARYPSVRS 145 (188) T ss_pred cCCCceEecccCcceeeEEEEEcCCCceEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEecC-ccCchHHHHH Confidence 333222211 11100 011100 0 000 0 000 1 2468899999 Q ss_pred HHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhccCCcee Q lcl|NC_020854. 116 AQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRISGPGNI 181 (186) Q Consensus 116 A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~~~g~~ 181 (186) |...++....++-+.. ..|. ..+. -.+.+++.||.+|..-.+ | T Consensus 146 Aill~va~~Y~~Re~~-----------~~g~---------~~~~-lP~~~v~~Ll~pyRvP~~--~ 188 (188) T protein:vir:14 146 WMLLAAAWAYDHRELY-----------SDGQ---------PMGE-MPGGYSDVLLNPITVPPR--F 188 (188) T ss_pred HHHHHHHHHHhccccc-----------cccc---------cccc-ccHHHHHHHhhccCCCCC--C Confidence 9999998887653211 0110 1111 112347899999876543 2 No 13 >protein:vir:7857 Length: 188 # NCBI annotation: gp14 # Family: family:all:11114 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817464;genbank:gi:29565893;genbank:GeneID:1259086 Probab=93.71 E-value=0.0016 Score=35.84 Aligned_cols=137 Identities=15% Similarity=0.160 Sum_probs=71.3 Q ss_pred HHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhh---ccCcc--cC---C---cccccccCc------cCc- Q lcl|NC_020854. 20 LSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRE---RYLGA--RA---T---DTQALQWPR------TGV- 81 (186) Q Consensus 20 la~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~---~~~G~--r~---~---~~Q~laWPR------~g~- 81 (186) ..+|+.-+. -++.+.++.+-||..|+..+.++ .|.=. -+ + ..+. -|+ ..| T Consensus 1 ~~~~~~la~----------~~~~da~~v~lAL~~As~~VR~~~g~~i~~V~~dt~~id~~Gg~~~--LP~~Pvv~i~~Ve 68 (188) T protein:vir:78 1 MTFAQQLAD----------AFPEDADDAATALSWAKSQVEGYCGRKFDLVEDDVAIVDPYCGSSL--LPSIPVVEISKVE 68 (188) T ss_pred CchhhhHHH----------hcCCCcchHHHHHHHHHHHHHHHhCCcceeeeeeeeeeecCCCcee--eccCcceeeeEEE Confidence 222222221 24566667777999999998864 22200 00 0 0011 122 111 Q ss_pred -ccccc----------------------------chhhhccccccccCc--cccc--CCcchHHHHHHHHHHHHHHhccc Q lcl|NC_020854. 82 -RKPDT----------------------------YINTYAVGFPFRITT--DYFT--DTEIPQQIKEAQATLAVYLNNNK 128 (186) Q Consensus 82 -~~~~~----------------------------~~~~~~~~~~~~~~~--~~~~--~d~IP~~Vk~A~~eLA~~~~~~~ 128 (186) ...+| ++...+-+.|.-++- ..+. -++||.+|+...|++|-.++.++ T Consensus 69 ~~~~~G~~~~~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytHGy~evP~eiv~lv~d~A~~~~~np 148 (188) T protein:vir:78 69 GYLPTGNGMDWVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTHGYNPVPDELIDVAIRLAREYQSNP 148 (188) T ss_pred EEeeCCcccccccccccccccceeeecccccCcccccccccccccCcceEEEEEecCCCcccHHHHHHHHHHHHHHhcCc Confidence 00011 000011111100000 0011 26899999999999999999885 Q ss_pred CCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhccCCc Q lcl|NC_020854. 129 DGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRISGPG 179 (186) Q Consensus 129 ~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~~~g 179 (186) .. ...++||++|++|+.. +....-+.=+.+|+.|....-. T Consensus 149 ~~--------L~q~~vG~~S~tfa~~---~~~sl~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:78 149 EL--------LVSKQVGEIERRFGSV---AGTSLSKADQAILDRYVIATLA 188 (188) T ss_pred cc--------ceeeecCceeeecccc---cCCcccchhHHhhccccccccC Confidence 33 3679999999999843 2222345556788887775554 No 14 >protein:vir:101652 Length: 188 # NCBI annotation: gp15 # Family: family:all:11114 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654770;genbank:gi:109302768;genbank:GeneID:4156086 Probab=93.71 E-value=0.0016 Score=35.84 Aligned_cols=137 Identities=15% Similarity=0.160 Sum_probs=71.3 Q ss_pred HHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhh---ccCcc--cC---C---cccccccCc------cCc- Q lcl|NC_020854. 20 LSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRE---RYLGA--RA---T---DTQALQWPR------TGV- 81 (186) Q Consensus 20 la~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~---~~~G~--r~---~---~~Q~laWPR------~g~- 81 (186) ..+|+.-+. -++.+.++.+-||..|+..+.++ .|.=. -+ + ..+. -|+ ..| T Consensus 1 ~~~~~~la~----------~~~~da~~v~lAL~~As~~VR~~~g~~i~~V~~dt~~id~~Gg~~~--LP~~Pvv~i~~Ve 68 (188) T protein:vir:10 1 MTFAQQLAD----------AFPEDADDAATALSWAKSQVEGYCGRKFDLVEDDVAIVDPYCGSSL--LPSIPVVEISKVE 68 (188) T ss_pred CchhhhHHH----------hcCCCcchHHHHHHHHHHHHHHHhCCcceeeeeeeeeeecCCCcee--eccCcceeeeEEE Confidence 222222221 24566667777999999998864 22200 00 0 0011 122 111 Q ss_pred -ccccc----------------------------chhhhccccccccCc--cccc--CCcchHHHHHHHHHHHHHHhccc Q lcl|NC_020854. 82 -RKPDT----------------------------YINTYAVGFPFRITT--DYFT--DTEIPQQIKEAQATLAVYLNNNK 128 (186) Q Consensus 82 -~~~~~----------------------------~~~~~~~~~~~~~~~--~~~~--~d~IP~~Vk~A~~eLA~~~~~~~ 128 (186) ...+| ++...+-+.|.-++- ..+. -++||.+|+...|++|-.++.++ T Consensus 69 ~~~~~G~~~~~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytHGy~evP~eiv~lv~d~A~~~~~np 148 (188) T protein:vir:10 69 GYLPTGNGMDWVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTHGYNPVPDELIDVAIRLAREYQSNP 148 (188) T ss_pred EEeeCCcccccccccccccccceeeecccccCcccccccccccccCcceEEEEEecCCCcccHHHHHHHHHHHHHHhcCc Confidence 00011 000011111100000 0011 26899999999999999999885 Q ss_pred CCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhccCCc Q lcl|NC_020854. 129 DGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRISGPG 179 (186) Q Consensus 129 ~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~~~g 179 (186) .. ...++||++|++|+.. +....-+.=+.+|+.|....-. T Consensus 149 ~~--------L~q~~vG~~S~tfa~~---~~~sl~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:10 149 EL--------LVSKQVGEIERRFGSV---AGTSLSKADQAILDRYVIATLA 188 (188) T ss_pred cc--------ceeeecCceeeecccc---cCCcccchhHHhhccccccccC Confidence 33 3679999999999843 2222345556788887775554 No 15 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=93.45 E-value=0.0061 Score=32.74 Aligned_cols=123 Identities=10% Similarity=-0.015 Sum_probs=68.5 Q ss_pred ccceecHHHHHHHHHhhccccccccccCCCHHH---HHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhh Q lcl|NC_020854. 14 ANSYLTLSDAQDIIDGLVENDDVVAWATATADQ---KNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINT 90 (186) Q Consensus 14 AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~---ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~ 90 (186) -+.|+|++|..+-| -++++++ .+.+|-.|+++|. ..+|+.|... T Consensus 1 m~~fAtv~D~~~rw------------r~Lt~~E~~ra~~LL~~As~~ir--------------~~~p~~~~~l------- 47 (131) T protein:vir:95 1 MENFATVEDLKKLW------------RALKFDEEKRAEALLEVVSHSLR--------------VEAKKVGKDL------- 47 (131) T ss_pred CCccCCHHHHHHHh------------cCCCHHHHHHHHHHHHHHHHHHH--------------HhhhhccCCc------- Confidence 78899999997654 3344443 4556779999996 3355544211 Q ss_pred hccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCee--EEeecCCCCcCcccchHHHHH Q lcl|NC_020854. 91 YAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSI--DVTPNQYGATGADRIPPMVER 168 (186) Q Consensus 91 ~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~i--sveY~~~~~~~~~~~~~~v~~ 168 (186) ....-+.+..+.-+|.-+|+...+.+..+.... +..=.++..|+. +.+|. .+.|....-..-.. T Consensus 48 ---------~~~~~~~~~~~~~~~~V~~~~V~Ral~~~~~~~---G~tq~S~TaG~ys~S~t~~--~p~g~lylt~~e~~ 113 (131) T protein:vir:95 48 ---------DGLVATDPSFTMVVKSVTVDVVARTLMTSTDQE---PMTQVAESALGYSFSGSYL--VPGGGLFIKDSELK 113 (131) T ss_pred ---------cccccCCccchHHHHHHHHHHHHHHhcCCCCCC---Cceeeeeecccceeeeeee--cCCCCceeChHHHH Confidence 111223356677899999999999886542110 111245888988 45554 34444333333334 Q ss_pred HHhhhhccCCceeeeeeC Q lcl|NC_020854. 169 YLTGLRISGPGNIAVKRS 186 (186) Q Consensus 169 lL~~l~~~~~g~~~~~r~ 186 (186) +| ..++...+.|-=- T Consensus 114 ~L---Gl~~~r~~~i~~~ 128 (131) T protein:vir:95 114 RL---GLKKQRYGVIDIY 128 (131) T ss_pred Hh---CCCCCceeEEeec Confidence 44 2344433333222 No 16 >protein:vir:99002 Length: 158 # NCBI annotation: gp31 # Family: family:all:32652 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655896;genbank:gi:109521468;genbank:GeneID:4157967 Probab=92.32 E-value=0.0096 Score=31.65 Aligned_cols=121 Identities=11% Similarity=0.023 Sum_probs=75.5 Q ss_pred ccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhh-ccCcccCCcccccccCccCccccccchhhhc Q lcl|NC_020854. 14 ANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRE-RYLGARATDTQALQWPRTGVRKPDTYINTYA 92 (186) Q Consensus 14 AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~-~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~ 92 (186) --+|+||+|.+...+ |.-..+..+ ...+|=.+||.. .| +-..|...||- T Consensus 1 ~~alasvee~~trl~----------~~lp~~~~r--~~a~a~~vLd~~S~~----ar~~~gr~W~~-------------- 50 (158) T protein:vir:99 1 MAALVSVEEFTTFLR----------VPLPEEGSE--KYTQMEFLLTLASDW----ARELSCKPWLL-------------- 50 (158) T ss_pred CcceeeHhhhhhhhc----------ccCChhhhH--HHHHHHHHHHHHHHH----HHHhcCccCCC-------------- Confidence 568999999998773 322223322 222332344421 11 01234556882 Q ss_pred cccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhh Q lcl|NC_020854. 93 VGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTG 172 (186) Q Consensus 93 ~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~ 172 (186) .+.+|.-|+.=|...|-+.++|++ +++.+.+|+-++.|...+.... ...+.=...|+- T Consensus 51 -------------~~daP~~vr~ivL~aa~R~~~NP~--------g~~~~~~G~~~~~~~~~g~~~~-ffT~~E~~~L~r 108 (158) T protein:vir:99 51 -------------PADAPVTARGIILAASRREWNNPK--------RVSYVVKGPQSATFMQSAYPPG-FFTDAEEAKLRS 108 (158) T ss_pred -------------CCcchhHHHHHHHHHHHHHHhcCC--------ceEEeeecchhhhcccccCCCc-ccCHHHHHHHHH Confidence 356889999999999999998864 5678899999999976543321 233455678888 Q ss_pred hhccCCceeeeeeC Q lcl|NC_020854. 173 LRISGPGNIAVKRS 186 (186) Q Consensus 173 l~~~~~g~~~~~r~ 186 (186) |.++.+|-..+.-+ T Consensus 109 ~~~s~GG~~~~~tt 122 (158) T protein:vir:99 109 YGRSTGNWGVIETY 122 (158) T ss_pred hhcccCceeEEEee Confidence 87776665555544 No 17 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=92.27 E-value=0.011 Score=31.32 Aligned_cols=123 Identities=9% Similarity=-0.043 Sum_probs=68.0 Q ss_pred ccceecHHHHHHHHHhhccccccccccCCCHHH---HHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhh Q lcl|NC_020854. 14 ANSYLTLSDAQDIIDGLVENDDVVAWATATADQ---KNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINT 90 (186) Q Consensus 14 AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~---ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~ 90 (186) -+.|+|++|..+-| -++++++ .+.+|-.|+++|. ..||+.+.-.+... T Consensus 1 m~~fAtv~Dl~~r~------------r~L~~dE~~ra~~LL~dAs~~iR--------------~~~~~~~~~~~~~~--- 51 (132) T protein:vir:94 1 MNPFATVDDLTMLW------------RPLKGDEKERAEKLLEIVSDTLR--------------EEADKVGRDLDVMI--- 51 (132) T ss_pred CCCcCCHHHHHHHh------------ccCChhHHHHHHHHHHHHHHHHH--------------HHHhhhcccccccc--- Confidence 78999999998633 3444444 3445779999996 34555543221110 Q ss_pred hccccccccCcccccCCcchHHHHHHHHHHHHHHhcccC-CCCCCcccceeEEecCee--EEeecCCCCcCcccchHHHH Q lcl|NC_020854. 91 YAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKD-GIGLSGLEDYKNVKIGSI--DVTPNQYGATGADRIPPMVE 167 (186) Q Consensus 91 ~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~-~~~~~~~~~v~~~kvG~i--sveY~~~~~~~~~~~~~~v~ 167 (186) .-..+..|.-+|.-+|....+.+..+. ..+. .=.++..|+. +.+|. .+.|....-..-. T Consensus 52 ------------~~~~d~~~~~~k~V~~~~V~Ral~~~~~~~g~----tq~S~TaG~ys~S~T~~--np~G~lylt~~e~ 113 (132) T protein:vir:94 52 ------------SEKPSYFSSVVKSVTVDIVARTLMTSTDQEPM----TQTTESALGYSVSGSYL--VPGGGLFIKNSEL 113 (132) T ss_pred ------------CCCCccchhHHHHHHHHHHHHHhcCCCCCCCc----eeeeeecccceeeeeee--cCCCCceeChHHH Confidence 001123345566778888888886542 2211 1135788987 55664 3444433333333 Q ss_pred HHHhhhhccCCceeeeeeC Q lcl|NC_020854. 168 RYLTGLRISGPGNIAVKRS 186 (186) Q Consensus 168 ~lL~~l~~~~~g~~~~~r~ 186 (186) .+| ..++...+.|-=. T Consensus 114 ~~L---Gl~~~r~~~i~~~ 129 (132) T protein:vir:94 114 SRL---GLKKQRFGVIDFY 129 (132) T ss_pred Hhh---CCCCCceEEEeec Confidence 333 4455556666555 No 18 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=92.12 E-value=0.0028 Score=34.58 Aligned_cols=122 Identities=16% Similarity=0.092 Sum_probs=73.4 Q ss_pred cceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhc--cCcccCCcccccccCccCccccccchhhhc Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRER--YLGARATDTQALQWPRTGVRKPDTYINTYA 92 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~--~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~ 92 (186) =.|.|.+|-+.|- ..+.++=+++|-+|++-||.+. |.=+..+ T Consensus 1 M~YlT~eey~el~-------------~~~~~~F~kl~k~A~~~ID~~t~~~y~~~~~----------------------- 44 (130) T protein:vir:47 1 MTYLTQEEFDELD-------------FDEVTDFEKLAKRAKIAIDLYTNGIYQKDID----------------------- 44 (130) T ss_pred CCCCchhhHhhcC-------------CCChhhHHHHHHHHHHHHHHHhcccccccCC----------------------- Confidence 5688888887542 1245568999999999999752 3211110 Q ss_pred cccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHH---HHHH Q lcl|NC_020854. 93 VGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPM---VERY 169 (186) Q Consensus 93 ~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~---v~~l 169 (186) +.=+...+=.+||.|.|.-..++-..+ ..+....+.+.+.+||-.+++|...+.......+.. +-.+ T Consensus 45 ---------~~~~~~~r~~~vK~A~a~QieY~~~~G-~~s~~~~~~~~S~svGrtSis~~~~~~~~~~~~~~vs~da~~~ 114 (130) T protein:vir:47 45 ---------FEKEIAYRKSAVKLAMAFQIAYLDASG-IMSADDKQLANSVSIGRTSISYSTSQSTLAGQRFNLSMDAENA 114 (130) T ss_pred ---------ccCcchHHHHHHHHHHHHHHHHHHHhc-cccchhccCcceeeecceeeecCcCccccccCCccccHHHHHH Confidence 111233455688899888777766543 333444778999999999999976544333322222 3335 Q ss_pred Hhh--h-hccCCceeeeee Q lcl|NC_020854. 170 LTG--L-RISGPGNIAVKR 185 (186) Q Consensus 170 L~~--l-~~~~~g~~~~~r 185 (186) |.+ | |-.| |.--| T Consensus 115 L~~tGL~Ly~G---V~yd~ 130 (130) T protein:vir:47 115 LRQAGFSLVVG---VAYDR 130 (130) T ss_pred HHhcccccccC---CCccC Confidence 544 4 3333 23334 No 19 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=90.91 E-value=0.0083 Score=31.99 Aligned_cols=132 Identities=17% Similarity=0.133 Sum_probs=77.6 Q ss_pred ccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhc--cCcccCCcccccccCccCccccccchhhh Q lcl|NC_020854. 14 ANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRER--YLGARATDTQALQWPRTGVRKPDTYINTY 91 (186) Q Consensus 14 AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~--~~G~r~~~~Q~laWPR~g~~~~~~~~~~~ 91 (186) --.|.|-+|-+.+-- -..++++=+++|-+|.+-||.+. |.+. +-..-...|+ T Consensus 1 ~~pYLTy~ef~~lg~-----------~~~~~d~F~kllk~A~~~ID~~T~y~~~~----------y~~~~i~~d~----- 54 (144) T protein:vir:79 1 MKPYLTTSDFEKLGY-----------ELKKPDNFGKLLKSATVLINQICSYYDPA----------FAYHDLEADS----- 54 (144) T ss_pred CCcccchhhhhhhCC-----------CCcchhhhhhHHHHHHHHhhhhhhhhccc----------cccccccccc----- Confidence 668888777643221 12455678999999999999752 2110 0000000000 Q ss_pred ccccccccCcccccCCcch---HHHHHHHHHHHHHHhcccCCCC-CCcccceeEEecCeeEEeecCCCCcCc----ccch Q lcl|NC_020854. 92 AVGFPFRITTDYFTDTEIP---QQIKEAQATLAVYLNNNKDGIG-LSGLEDYKNVKIGSIDVTPNQYGATGA----DRIP 163 (186) Q Consensus 92 ~~~~~~~~~~~~~~~d~IP---~~Vk~A~~eLA~~~~~~~~~~~-~~~~~~v~~~kvG~isveY~~~~~~~~----~~~~ 163 (186) +.=....|| .+||.|.|.-..++...+...+ ....+.+++++||-.+++|...++.+. .... T Consensus 55 ----------~~d~~~~~~~r~~~vKkA~a~QIeY~~~~G~~sa~e~~~~~~~S~svGrtsvs~~~~~~~s~t~~~~~v~ 124 (144) T protein:vir:79 55 ----------QADPDSYLFRQAMAFKKAVALEMLFLEDSGYSSAYDVAQGALNSFTVGHTSMSLNPSAGQNLTVGSTGVV 124 (144) T ss_pred ----------cccchhhhhHHHHHHHHHHHHHHHHHHHcCCcchhhhhcCccceeEecceEEeecCCCcccccccccccc Confidence 000123355 4568888887777654433222 123678999999999999976554432 2355 Q ss_pred HHHHHHHhh--hhccCCcee Q lcl|NC_020854. 164 PMVERYLTG--LRISGPGNI 181 (186) Q Consensus 164 ~~v~~lL~~--l~~~~~g~~ 181 (186) +-+-.+|.+ ||-.|-+++ T Consensus 125 ~~a~~yL~~tGLLYrGV~s~ 144 (144) T protein:vir:79 125 KSAYDLLGRYGLLFSGVASL 144 (144) T ss_pred HHHHHHHhhcCccccccccC Confidence 667777765 666666665 No 20 >protein:vir:9821 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795583;genbank:gi:28876338;genbank:GeneID:1257921 Probab=88.97 E-value=0.011 Score=31.23 Aligned_cols=123 Identities=16% Similarity=0.222 Sum_probs=69.5 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhh---ccCcccCCcccccccC Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRE---RYLGARATDTQALQWP 77 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~---~~~G~r~~~~Q~laWP 77 (186) |++. +|.|.+|-+.+ + ..+.++-+++|-+|++-||.+ .|.+..-+.+ -.| T Consensus 3 ~~~M-----------~YlT~eey~~l----~---------~~~~~dF~kllk~As~~ID~~t~~~y~~~d~e~d--~~~- 55 (138) T protein:vir:98 3 VVII-----------AFLTQKEFEDL----G---------FDDVEDFEKMEKRASHAVNLYCRNRYDYKDLKKE--IAL- 55 (138) T ss_pred cccc-----------cccchHHHhcc----C---------CCChhhHHHHHHHHHHHhhhhhccccccccccch--hHH- Confidence 4433 68888876542 1 124446899999999999964 2322221111 222 Q ss_pred ccCccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCc Q lcl|NC_020854. 78 RTGVRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGAT 157 (186) Q Consensus 78 R~g~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~ 157 (186) +=.+||.|.|.-..++...+ ..+....+.+.+.+||-.++.|+-...+ T Consensus 56 -------------------------------r~~~vKkA~a~QIeY~~~~G-~ts~~d~~~~~s~svGrTSiS~~~~~~~ 103 (138) T protein:vir:98 56 -------------------------------VQKAVKRAIAYQIAYLNDSG-VMTAEDKQSFAGISLGRTSISYTVGHGQ 103 (138) T ss_pred -------------------------------HHHHHHHHHHHHHHHHHHcC-CcchhhccCcCceEeeeeEeeccccccc Confidence 33578888887666665443 3444447788999999999998432222 Q ss_pred Ccc--------cchHHHHHHHhh--hhccCCceeeeee Q lcl|NC_020854. 158 GAD--------RIPPMVERYLTG--LRISGPGNIAVKR 185 (186) Q Consensus 158 ~~~--------~~~~~v~~lL~~--l~~~~~g~~~~~r 185 (186) ++. ....-+..+|.+ ||-.|- .--| T Consensus 104 ~s~~~~~~~~~~~s~~A~~~L~~tGLLY~GV---~yd~ 138 (138) T protein:vir:98 104 GSQQKTLADRFNLCLDAENELLVVGLGYTGI---SYDR 138 (138) T ss_pred ccccccccccccccHHHHHHHhhcCcccccC---cccC Confidence 211 122223346654 444332 3344 No 21 >protein:vir:103283 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277463;genbank:gi:71834105;genbank:GeneID:3562394 Probab=88.70 E-value=0.007 Score=32.40 Aligned_cols=87 Identities=13% Similarity=0.112 Sum_probs=50.3 Q ss_pred CccCccccccchhhhccccccccCcccccC-CcchHHHHHHHHHHHHHH---------------------hcccC----- Q lcl|NC_020854. 77 PRTGVRKPDTYINTYAVGFPFRITTDYFTD-TEIPQQIKEAQATLAVYL---------------------NNNKD----- 129 (186) Q Consensus 77 PR~g~~~~~~~~~~~~~~~~~~~~~~~~~~-d~IP~~Vk~A~~eLA~~~---------------------~~~~~----- 129 (186) =| . .+|. -.+|++|++|=+|+|-.. +..+. T Consensus 1 mR--------------~---------l~P~f~~vpdevi~~wid~A~lFVC~~~fg~~~~~Al~lytlHLm~~dga~k~e 57 (125) T protein:vir:10 1 MR--------------T---------LYPPLKSQPDDVLNAWIEVAKLFICLDKFGDKQVQALAFYTLHLLSQDIALKTE 57 (125) T ss_pred Cc--------------c---------ccchhhccCHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccc Confidence 11 1 1222 467999998888776432 22221 Q ss_pred -CCCCCcccceeEEec-CeeEEeecCCCCcCccc--chHHHHHHHhhhhccCCceeeeeeC Q lcl|NC_020854. 130 -GIGLSGLEDYKNVKI-GSIDVTPNQYGATGADR--IPPMVERYLTGLRISGPGNIAVKRS 186 (186) Q Consensus 130 -~~~~~~~~~v~~~kv-G~isveY~~~~~~~~~~--~~~~v~~lL~~l~~~~~g~~~~~r~ 186 (186) .......++|++.+. |+++++|+..+..++.. .-+-.-.|+..|+.-.+|.|.+--+ T Consensus 58 ~~~~~~~s~r~~s~slsGE~Sit~~~~s~d~s~~~L~~T~wGk~~~~L~k~~~GgFaL~T~ 118 (125) T protein:vir:10 58 NDSSQTSSERVKSYSLSGEYTISYDTSTAAASSSNLEESSWGKLYIDLMRLKVGRWGLITS 118 (125) T ss_pred cccccccccceeeeeeccceEeecccccccccccccccCchHHHHHHHHHhcCCceeeecc Confidence 112234568999985 99999998765544311 1112335666666666677777666 No 22 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=86.25 E-value=0.047 Score=27.87 Aligned_cols=121 Identities=14% Similarity=0.061 Sum_probs=63.3 Q ss_pred ccceecHHHHHHHHHhhccccccccccCCCHHH---HHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhh Q lcl|NC_020854. 14 ANSYLTLSDAQDIIDGLVENDDVVAWATATADQ---KNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINT 90 (186) Q Consensus 14 AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~---ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~ 90 (186) -+.|+|++|..+-| -++++++ .+.+|-.|+++|. ..+|+.|.-.+. T Consensus 1 m~~fATv~Dv~~rw------------r~Lt~dE~~ra~~LL~dAS~~iR--------------~~~p~~g~~~~~----- 49 (140) T protein:vir:97 1 MGNFATTDDVILLW------------RPLSVDELKRANALLKVVSDTLR--------------MEADKVGKDLDK----- 49 (140) T ss_pred CCcCCCHHHHHHHh------------cCCCHhHHHHHHHHHHHHHHHHH--------------HhhhhccCCcch----- Confidence 78999999998755 3344443 4556779999996 346666542221 Q ss_pred hccccccccCcccccC-CcchHHHHHHHHHHHHHHhccc-CCCCCCcccceeEEecCee--EEeecCCCCcCcccchHHH Q lcl|NC_020854. 91 YAVGFPFRITTDYFTD-TEIPQQIKEAQATLAVYLNNNK-DGIGLSGLEDYKNVKIGSI--DVTPNQYGATGADRIPPMV 166 (186) Q Consensus 91 ~~~~~~~~~~~~~~~~-d~IP~~Vk~A~~eLA~~~~~~~-~~~~~~~~~~v~~~kvG~i--sveY~~~~~~~~~~~~~~v 166 (186) .++. +.-+.-++.-+|....+.+..+ +..+ ..-.++..|+. +.+|. .+.|....-+.- T Consensus 50 ------------~~~~~~~~~~~~k~V~~~mV~Ral~~~~d~~G----~tq~S~TaG~ys~S~T~~--np~G~lylt~~e 111 (140) T protein:vir:97 50 ------------TMVDKPYFVNVIKSVTVDIVARTLMTSTQGEP----MSQESQSALGYTWSGTYL--VPGGGLFIKDNE 111 (140) T ss_pred ------------hcccCccchhHHHHHHHHHHHHHhcCCCCCCc----ceeeeeeccchhheeeee--cCCCCceeChHH Confidence 1111 2234455667777777755322 2221 12245788988 55664 344433332333 Q ss_pred HHHHhhhhccCCceeee------eeC Q lcl|NC_020854. 167 ERYLTGLRISGPGNIAV------KRS 186 (186) Q Consensus 167 ~~lL~~l~~~~~g~~~~------~r~ 186 (186) ..+| ..++...+.| +|- T Consensus 112 ~~~L---Gl~~~r~~~i~~~g~~~~~ 134 (140) T protein:vir:97 112 LKRL---GLKKQRYGGIELYGEIKRD 134 (140) T ss_pred HHHh---CCCCCceeeecccCccccC Confidence 3344 3344433332 222 No 23 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=84.89 E-value=0.057 Score=27.41 Aligned_cols=123 Identities=10% Similarity=-0.017 Sum_probs=66.8 Q ss_pred ccceecHHHHHHHHHhhccccccccccCCCHH---HHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhh Q lcl|NC_020854. 14 ANSYLTLSDAQDIIDGLVENDDVVAWATATAD---QKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINT 90 (186) Q Consensus 14 AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~---~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~ 90 (186) -+.|+|++|..+-| -+++++ ..+.+|-.|+++|. ..+|+.+... T Consensus 1 m~~fAtv~Dv~~r~------------r~L~~~E~~ra~~lL~dAs~~ir--------------~~~p~~~~~l------- 47 (132) T protein:vir:16 1 MNPFATVDDLTMLW------------RPLKGDEKERAEKLLEIVSDSLR--------------EEADKVGRDL------- 47 (132) T ss_pred CCccCCHHHHHHHh------------cCCCHhHHHHHHHHHHHHHHHHH--------------Hhhhhhcccc------- Confidence 78999999997655 234444 34556779999996 2344443211 Q ss_pred hccccccccCcccccC-CcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCee--EEeecCCCCcCcccchHHHH Q lcl|NC_020854. 91 YAVGFPFRITTDYFTD-TEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSI--DVTPNQYGATGADRIPPMVE 167 (186) Q Consensus 91 ~~~~~~~~~~~~~~~~-d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~i--sveY~~~~~~~~~~~~~~v~ 167 (186) ....-++ +..+.-++.-+|+.....+..+.... +..=.++..|+. +.+|. .+.|....-+ T Consensus 48 ---------~a~~~e~~~~~~~~~~~V~~~~V~Ral~~~~~~~---G~tq~S~TaG~ys~S~t~~--~p~G~lylt~--- 110 (132) T protein:vir:16 48 ---------YAMIAEKPSYFASVVKSVTVDIVARTLMTSTDQE---PMTQTTESALGYSVSGSYL--VPGGGLFIKN--- 110 (132) T ss_pred ---------ccccccccccchhHHHHHHHHHHHHHhcCCCCCC---Cceeeeeeccchheeeeee--cCCCcceeCh--- Confidence 1111112 22344567788888888886652211 112246788988 55564 3444333323 Q ss_pred HHHhhhhccCCceeeeeeC Q lcl|NC_020854. 168 RYLTGLRISGPGNIAVKRS 186 (186) Q Consensus 168 ~lL~~l~~~~~g~~~~~r~ 186 (186) ..++-|..++.+.+.|-=. T Consensus 111 ~e~~~LG~~~~r~~~i~~~ 129 (132) T protein:vir:16 111 SELSRLGLKKQRFGVIDFY 129 (132) T ss_pred HHHHhhCCCCCceEEEeec Confidence 3333333455555555555 No 24 >protein:vir:107756 Length: 147 # NCBI annotation: gp21 # Family: family:all:664 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024869;genbank:gi:48697511;genbank:GeneID:2948377 Probab=84.73 E-value=0.058 Score=27.36 Aligned_cols=126 Identities=10% Similarity=0.020 Sum_probs=65.4 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccC Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTG 80 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g 80 (186) |++++ ++++..+-|= ..+.-...+++..+..|..|..+|+..+|. |... T Consensus 1 m~v~f-------------d~~~Fr~~fP------eFad~~~~pd~~i~~~l~~A~~~l~~~~~~-----------~~~~- 49 (147) T protein:vir:10 1 MDHTL-------------DITKFRALFP------EFNNDVKYPDALLEQWYAVAGEYLGLTDYA-----------CGLN- 49 (147) T ss_pred Cceec-------------CHHHHHHhcc------cccCCccCCHHHHHHHHHHHHHhhccccCC-----------cccC- Confidence 88875 4555554331 111112468899999999999999865541 1110 Q ss_pred ccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccC--CCCCCcccceeEEecCeeEEeecCCCCcC Q lcl|NC_020854. 81 VRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKD--GIGLSGLEDYKNVKIGSIDVTPNQYGATG 158 (186) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~--~~~~~~~~~v~~~kvG~isveY~~~~~~~ 158 (186) ...-.++.+.|+.+++.-.. .......+.|++.++|.|||.|+...... T Consensus 50 -----------------------------g~~~~~~l~Ll~AHll~l~~~~~~g~g~~G~v~Sas~G~VSVSy~~~~~~~ 100 (147) T protein:vir:10 50 -----------------------------GNTLDLALMQLTAHLMKSATILSSNKGAPMVMTSATIDKVSISTLAPPIKN 100 (147) T ss_pred -----------------------------hhhHHHHHHHHHHHHHHHHHhhccCCCcccceeeeeecceeeeeecCCCCC Confidence 13445566666666553321 11223356789999999999998542211 Q ss_pred c------ccchH-HHHHHHhhh-----hccCCceeeeeeC Q lcl|NC_020854. 159 A------DRIPP-MVERYLTGL-----RISGPGNIAVKRS 186 (186) Q Consensus 159 ~------~~~~~-~v~~lL~~l-----~~~~~g~~~~~r~ 186 (186) . .+.|- -.-+|++.+ +.+|.+.=.=.|. T Consensus 101 ~~~~w~~~T~YGq~y~~l~~~~~~Gg~vvgG~p~r~a~r~ 140 (147) T protein:vir:10 101 GWQYWLSTTPYGQMLWALLSMRSSGGFVYGGSPELSGYRR 140 (147) T ss_pred cchhhhhcCHHHHHHHHHHHhhCccceecCCCCccccccc Confidence 1 11111 112344333 3333332222222 No 25 >protein:vir:5256 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852761;genbank:gi:31544036;uniprot:Q7Y5T9;genbank:GeneID:2753553 Probab=84.05 E-value=0.064 Score=27.15 Aligned_cols=109 Identities=11% Similarity=0.005 Sum_probs=63.7 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFP 96 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~ 96 (186) -.|+++..+-|= +....+++..+..|-.|..+|+.-+| | T Consensus 1 m~t~~~Fr~~~P---------eF~~~pd~~i~~~l~~A~~~l~~~~~----------------g---------------- 39 (119) T protein:vir:52 1 MPLTEDFLLRYT---------EFGKTDAKRIGLFLSDAQAEVSKVQW----------------G---------------- 39 (119) T ss_pred CCcHHHHHHhhh---------hccCCCHHHHHHHHHHHHHhhCCcCC----------------c---------------- Confidence 677887776552 35568999999999999999975444 1 Q ss_pred cccCcccccCCcchHHHHHHHHHHHHHHhcccCC---CCCCcccceeEEecCeeEEeecCCCCcCc------ccch-HHH Q lcl|NC_020854. 97 FRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDG---IGLSGLEDYKNVKIGSIDVTPNQYGATGA------DRIP-PMV 166 (186) Q Consensus 97 ~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~---~~~~~~~~v~~~kvG~isveY~~~~~~~~------~~~~-~~v 166 (186) ..-.++.+.++.+++.-... -.....+.+++.++|.|+|+|+....... .+.| .-. T Consensus 40 --------------~~~~~~~~L~~AH~l~l~~~~~~~~g~~~g~v~S~s~G~vSvS~~~~~~~~~~~~w~~~T~YG~~y 105 (119) T protein:vir:52 40 --------------KLYDRGVMALTAHLLKLSADAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEY 105 (119) T ss_pred --------------hHHHHHHHHHHHHHHHhhhhhhccccccccceeeeeecceeeeeeccccCCcchhhhhcCHHHHHH Confidence 12234555666665532111 11123467999999999999975432211 1111 112 Q ss_pred HHHHhhhhccCCceee Q lcl|NC_020854. 167 ERYLTGLRISGPGNIA 182 (186) Q Consensus 167 ~~lL~~l~~~~~g~~~ 182 (186) -.|++.+ +.||.++ T Consensus 106 ~~L~r~~--g~Gg~Va 119 (119) T protein:vir:52 106 LRLRRLI--GVGVMVA 119 (119) T ss_pred HHHHHHh--cCCCcCC Confidence 2444443 3345555 No 26 >protein:vir:100245 Length: 113 # NCBI annotation: gp74 # Family: family:all:363 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355410;genbank:gi:77864700;genbank:GeneID:3725967 Probab=83.69 E-value=0.045 Score=27.97 Aligned_cols=113 Identities=18% Similarity=0.122 Sum_probs=64.6 Q ss_pred cceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccc Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVG 94 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~ 94 (186) =+++||++++.+.. . ....+++-.+..+.-|.+++. +|.|++...+|.-...-..... T Consensus 1 M~~vtLee~K~hLR--v-------d~d~dD~lI~~li~AA~~~ve--~~l~r~l~~~~~~~~~~~~~~~----------- 58 (113) T protein:vir:10 1 MALVELKLALGFVR--A-------NAGVEDDVVQMLLDAATQSAV--DYLNRQVFETEDAMTTAIEAGT----------- 58 (113) T ss_pred CCCCCHHHHHHHcC--C-------CCCcchHHHHHHHHHHHHHHH--HHhCcccccccccccccccccc----------- Confidence 45679999999883 2 233456666666666667775 3555555444443222211100 Q ss_pred cccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhh Q lcl|NC_020854. 95 FPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLR 174 (186) Q Consensus 95 ~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~ 174 (186) .......||..++.|+..|.-....+-+. +..|. ....+..++.||.+|. T Consensus 59 -------~~~~~~~~p~~i~~AvLllv~~~Y~nRe~-----------~~~~~------------~~~lP~~v~~Ll~~yR 108 (113) T protein:vir:10 59 -------AGQNPMVVNAAIRAAILKITAELYANRED-----------TAFGP------------ITELPLNARALLRPHR 108 (113) T ss_pred -------ccccccccChHHHHHHHHHHHHHHhhhhh-----------hchhh------------hhccCHHHHHHHHHhh Confidence 01123569999999999999888766211 01111 1123455788999876 Q ss_pred ccCCc Q lcl|NC_020854. 175 ISGPG 179 (186) Q Consensus 175 ~~~~g 179 (186) .-.|- T Consensus 109 ~~~g~ 113 (113) T protein:vir:10 109 IIPGV 113 (113) T ss_pred hhcCC Confidence 54332 No 27 >protein:vir:1993 Length: 141 # NCBI annotation: Hypothetical protein # Family: family:all:348 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050640;genbank:gi:9633527;genbank:GeneID:2636296 Probab=76.53 E-value=0.09 Score=26.30 Aligned_cols=123 Identities=20% Similarity=0.174 Sum_probs=62.9 Q ss_pred cceecHHHHHHHHHhhcc---ccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhh Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVE---NDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTY 91 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~---~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~ 91 (186) =+|+|++|..+.|....- ++........+++-.+++|..|+..||++ .+.| - T Consensus 1 M~Y~T~~Dl~~~~ge~~l~~Lt~d~~~~g~~d~~~i~~Al~dA~~eIdgy--L~~R-----------Y------------ 55 (141) T protein:vir:19 1 MNYATVNDLCARYTRTRLDILTRPKTADGQPDDAVAEQALADASAFIDGY--LAAR-----------F------------ 55 (141) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcCCCCccccCHHHHHHHHHHHHHHHHHH--Hhhc-----------c------------ Confidence 679999999877653221 11112234567778899999999999974 1111 0 Q ss_pred ccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccc-------eeEEecCeeEEeecCCCCc---Cc-- Q lcl|NC_020854. 92 AVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLED-------YKNVKIGSIDVTPNQYGAT---GA-- 159 (186) Q Consensus 92 ~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~-------v~~~kvG~isveY~~~~~~---~~-- 159 (186) .+|-..+|.-|+..||-+|.+.|.+... +..-.++ .+.+.-|.++.--...+.. +. T Consensus 56 -----------~lPl~~~P~~L~~~a~dIA~Y~L~~~~~-~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~ 123 (141) T protein:vir:19 56 -----------VLPLTVVPSLLKRQCCVVAWFYLNESQP-TEQITATYRDTVRWLEQVRDGKTDPGVESRTAASPEGEDL 123 (141) T ss_pred -----------cCCccccchHHHHHHHHHHHHHHhcCCC-ChHHHHHHHHHHHHHHHHhcCccccCCCCCCCCCCCCCce Confidence 1345679999999999999998865332 1110000 1122224444422111000 00 Q ss_pred ---ccchHHHHHHHhhhh Q lcl|NC_020854. 160 ---DRIPPMVERYLTGLR 174 (186) Q Consensus 160 ---~~~~~~v~~lL~~l~ 174 (186) .........=++||+ T Consensus 124 ~~~~~~~r~f~r~~~G~~ 141 (141) T protein:vir:19 124 VQVQSDPPVFSRKQKGFI 141 (141) T ss_pred eEeecCCcccCcccccCC Confidence 000111111122222 No 28 >protein:vir:103846 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938245;genbank:gi:38229150;genbank:GeneID:2648161 Probab=76.48 E-value=0.11 Score=25.85 Aligned_cols=118 Identities=16% Similarity=0.123 Sum_probs=61.3 Q ss_pred cceecHHHHHHHHHhhcc----ccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhh Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVE----NDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINT 90 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~----~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~ 90 (186) =+|+|++|..+.|...-- ++........+++-.+++|..|+..||++ .+.| . T Consensus 1 M~Y~T~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~~eIdgy--L~~R-----------Y----------- 56 (138) T protein:vir:10 1 MSYCTQADLVEQYGEASIRQLSDRVNKPATTIDPAVVAQAIADADAEIDLH--LHAR-----------Y----------- 56 (138) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccCCCcCccCHHHHHHHHHHHHHHHHHH--Hhhc-----------c----------- Confidence 679999999876543321 11111233567778999999999999974 1111 0 Q ss_pred hccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccc-------eeEEecCeeEEeecCCCCcCcccch Q lcl|NC_020854. 91 YAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLED-------YKNVKIGSIDVTPNQYGATGADRIP 163 (186) Q Consensus 91 ~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~-------v~~~kvG~isveY~~~~~~~~~~~~ 163 (186) .+|-..+|.-|+..||-+|.+.+.+.......-.++ .+.+.-|.++..-...+.. T Consensus 57 ------------~lPl~~vP~~L~~~a~dIA~Y~L~~~~~~~e~~~~rY~~Ai~~L~~Ia~G~~~Lg~~~~~~~------ 118 (138) T protein:vir:10 57 ------------QLPLAQVPVVLKRVACVLAFANLHTQVKDDHPAILDAERKRKLLGGISSGKLSLALTSSGTP------ 118 (138) T ss_pred ------------cCCccccchHHHHHHHHHHHHHHhcCCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCccc------ Confidence 134568999999999999999986432211100000 0111113333322111100 Q ss_pred HHHHHHHhhhhccCCceeeeeeC Q lcl|NC_020854. 164 PMVERYLTGLRISGPGNIAVKRS 186 (186) Q Consensus 164 ~~v~~lL~~l~~~~~g~~~~~r~ 186 (186) ..+++++. ..| T Consensus 119 -----------~~~~~~~~-~~s 129 (138) T protein:vir:10 119 -----------APIANTVQ-ISS 129 (138) T ss_pred -----------CCCCCcee-eec Confidence 01112222 222 No 29 >protein:vir:100103 Length: 120 # NCBI annotation: gp7 # Family: family:all:363 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945037;genbank:gi:38707897;genbank:GeneID:2744150 Probab=75.04 E-value=0.15 Score=25.08 Aligned_cols=120 Identities=15% Similarity=0.066 Sum_probs=66.8 Q ss_pred CCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhh Q lcl|NC_020854. 11 AADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINT 90 (186) Q Consensus 11 ~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~ 90 (186) .++=-+++||++++.+..- ....+++..+..+-.|.+|+.+ |.|++.-.++... +-..+..+ T Consensus 1 ~~~~m~~vtL~e~K~hLRv---------d~d~DD~lI~~~i~AA~~~v~~--~~~r~l~~~~~~~-~~~~~~~~------ 62 (120) T protein:vir:10 1 MADQTPIVSLEVALAHLRE---------DAGVADDLIKIYIGAATQSASD--YVDRKLYANDAEM-QAAVADAT------ 62 (120) T ss_pred CCCCCCccCHHHHHHHcCC---------CCCcchHHHHHHHHHHHHHHHH--HhCCccccccccc-chhhhccc------ Confidence 4556688899999998842 2345666677767777777764 4455433322111 10000000 Q ss_pred hccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHH Q lcl|NC_020854. 91 YAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYL 170 (186) Q Consensus 91 ~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL 170 (186) .......||..++.|++.|.-....+-+... +|. .......+..++.|| T Consensus 63 -----------~~~~~~~~~~~i~~AvLllvg~~YenRe~~~-----------~~~---------~~~~~~lP~~v~~Ll 111 (120) T protein:vir:10 63 -----------AGADPIVANDAIRAAILLTIGKLYAFREDVV-----------SGA---------SASVTELPSGAKSLL 111 (120) T ss_pred -----------cccccccCCHHHHHHHHHHHHHHHhchhhhh-----------hcc---------cccccccCHHHHHHH Confidence 0112346999999999999988876632211 011 011122344588899 Q ss_pred hhhhccCCce Q lcl|NC_020854. 171 TGLRISGPGN 180 (186) Q Consensus 171 ~~l~~~~~g~ 180 (186) .+|...=| - T Consensus 112 ~~yR~~~g-v 120 (120) T protein:vir:10 112 FPYRVGLG-V 120 (120) T ss_pred HHhhhccC-C Confidence 88753222 1 No 30 >protein:vir:486 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543093;swissprot:trembl:q8w626;genbank:gi:18249905;uniprot:Q8W626;genbank:GeneID:929692 Probab=74.68 E-value=0.16 Score=25.02 Aligned_cols=104 Identities=13% Similarity=0.154 Sum_probs=59.4 Q ss_pred ceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccc---cccCccCccccccchhhhc Q lcl|NC_020854. 16 SYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQA---LQWPRTGVRKPDTYINTYA 92 (186) Q Consensus 16 SYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~---laWPR~g~~~~~~~~~~~~ 92 (186) =++|+++++.|.. ...| ...+++-.+..+-.|.+++. +|.|++....+. -.||. T Consensus 1 M~vtL~e~K~hLR--id~D-----~~ddD~li~~~i~aA~~~i~--~~~~r~l~~~~~~~~~~~~~-------------- 57 (107) T protein:vir:48 1 MLLKEEEIKSHLR--LDDG-----LYSDGDFLKLLAQAVQKRTE--TYLNRKLYAPEETIPEDDPD-------------- 57 (107) T ss_pred CCCCHHHHHHHcC--CCCC-----CchhHHHHHHHHHHHHHHHH--HHhccccccccccccccCcc-------------- Confidence 7889999999873 2211 11244445666666777776 455554433221 22332 Q ss_pred cccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhh Q lcl|NC_020854. 93 VGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTG 172 (186) Q Consensus 93 ~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~ 172 (186) .-.||..+|.|+..|+-....+- +.+..-+ ....+..++.||.+ T Consensus 58 -------------~~~~~~~ik~Avlllv~~~Y~NR-------------e~v~~~~----------~~~iP~~v~~LL~~ 101 (107) T protein:vir:48 58 -------------GMHLTDDVRLAMLMLVSHFYENR-------------STITDVE----------KLETPMSFRWLAGP 101 (107) T ss_pred -------------ccccchhHHHHHHHHHHHHHhhh-------------hhhcccc----------ccccCHHHHHHHHH Confidence 12589999999999999887662 2111101 11233457788888 Q ss_pred hhccCC Q lcl|NC_020854. 173 LRISGP 178 (186) Q Consensus 173 l~~~~~ 178 (186) |..-+- T Consensus 102 yR~~~l 107 (107) T protein:vir:48 102 YRIVPL 107 (107) T ss_pred hhccCC Confidence 754332 No 31 >protein:vir:4512 Length: 107 # NCBI annotation: unknown # Family: family:all:363 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599038;genbank:gi:19548996;genbank:GeneID:935218 Probab=74.40 E-value=0.14 Score=25.28 Aligned_cols=107 Identities=15% Similarity=0.189 Sum_probs=61.3 Q ss_pred ceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcccc Q lcl|NC_020854. 16 SYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGF 95 (186) Q Consensus 16 SYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~ 95 (186) =++||++++.|.. ...| ...+++-.+..+-.|.+|+. +|.|++....+.. ||-. .++ T Consensus 1 M~vtL~e~K~hLR--Id~D-----~~ddD~lI~~~i~AA~~~i~--~~~~r~~~~~~~~-~~~~---~~~---------- 57 (107) T protein:vir:45 1 MLLKMEEIKLQLR--LDDD-----FSDEDELLELLGKAAQSRTE--NFLNRKLYATADD-RPAD---DPD---------- 57 (107) T ss_pred CCCCHHHHHHHcC--CCCC-----CchhHHHHHHHHHHHHHHHH--HHhcccccccccc-cccc---ccc---------- Confidence 7889999999873 2111 11234446666667778886 4667666544332 3321 111 Q ss_pred ccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhc Q lcl|NC_020854. 96 PFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRI 175 (186) Q Consensus 96 ~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~ 175 (186) .-.||..++.|++.|.-....+-.. +.. .+....+..++.||.+|.. T Consensus 58 ----------~~~~~~~~~~AvLllv~~~Y~NRe~-------------~~~----------~~~~~lp~~v~~Ll~~~R~ 104 (107) T protein:vir:45 58 ----------GLVISDDVKLALLLLVSHFYENRST-------------VTD----------VEKMELPMSFNWLVAPYRL 104 (107) T ss_pred ----------cccCChhHHHHHHHHHHHHHhhhhh-------------ccc----------cchhccchHHHHHHHHHhh Confidence 1257999999999998887766211 110 0011234457888888754 Q ss_pred cCC Q lcl|NC_020854. 176 SGP 178 (186) Q Consensus 176 ~~~ 178 (186) =+- T Consensus 105 ~~~ 107 (107) T protein:vir:45 105 IPL 107 (107) T ss_pred cCC Confidence 332 No 32 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=73.13 E-value=0.17 Score=24.75 Aligned_cols=102 Identities=18% Similarity=0.106 Sum_probs=61.9 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccC Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTG 80 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g 80 (186) |++. -=+++||++++.|.. .. ...+++..+..+-.|.+++.. |.|++... T Consensus 1 ~~~~---------~M~~vtLee~K~hLR--id-------~dddD~lI~~~i~AA~~~v~~--~~~~~~~~---------- 50 (108) T protein:vir:18 1 MAID---------VLDVISLSLFKQQIE--FE-------EDDRDELITLYAQAAFDYCMR--WCDEPAWK---------- 50 (108) T ss_pred CCCC---------cccccCHHHHHHHcC--CC-------CCcchHHHHHHHHHHHHHHHH--HhCCcccc---------- Confidence 6665 357899999999873 22 234666676666677777753 33332110 Q ss_pred ccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcc Q lcl|NC_020854. 81 VRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGAD 160 (186) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~ 160 (186) ....+|..|+.|+..|+-...++- +.++.. .. T Consensus 51 ------------------------~~~~~p~~ik~AiLllv~~~YenR-------------E~~~~~-----------~~ 82 (108) T protein:vir:18 51 ------------------------VAADIPAAVKGAVLLVFADMFEHR-------------TAQSEV-----------QL 82 (108) T ss_pred ------------------------cccccchHHHHHHHHHHHHHHhcc-------------cccccc-----------hh Confidence 134689999999999998888662 222110 11 Q ss_pred cchHHHHHHHhhhhccCC------ce Q lcl|NC_020854. 161 RIPPMVERYLTGLRISGP------GN 180 (186) Q Consensus 161 ~~~~~v~~lL~~l~~~~~------g~ 180 (186) ..++.++.||.++..=.| |+ T Consensus 83 ~~~~~~~~LL~pYR~~~g~~~~~~~~ 108 (108) T protein:vir:18 83 YENAAAERMMFIHRNWRGKAESEEGS 108 (108) T ss_pred hhhHHHHHHHHHHHhcCCCCCcccCC Confidence 123568888887652211 33 No 33 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=73.13 E-value=0.17 Score=24.75 Aligned_cols=102 Identities=18% Similarity=0.106 Sum_probs=61.9 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccC Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTG 80 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g 80 (186) |++. -=+++||++++.|.. .. ...+++..+..+-.|.+++.. |.|++... T Consensus 1 ~~~~---------~M~~vtLee~K~hLR--id-------~dddD~lI~~~i~AA~~~v~~--~~~~~~~~---------- 50 (108) T protein:vir:19 1 MAID---------VLDVISLSLFKQQIE--FE-------EDDRDELITLYAQAAFDYCMR--WCDEPAWK---------- 50 (108) T ss_pred CCCC---------cccccCHHHHHHHcC--CC-------CCcchHHHHHHHHHHHHHHHH--HhCCcccc---------- Confidence 6665 357899999999873 22 234666676666677777753 33332110 Q ss_pred ccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcc Q lcl|NC_020854. 81 VRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGAD 160 (186) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~ 160 (186) ....+|..|+.|+..|+-...++- +.++.. .. T Consensus 51 ------------------------~~~~~p~~ik~AiLllv~~~YenR-------------E~~~~~-----------~~ 82 (108) T protein:vir:19 51 ------------------------VAADIPAAVKGAVLLVFADMFEHR-------------TAQSEV-----------QL 82 (108) T ss_pred ------------------------cccccchHHHHHHHHHHHHHHhcc-------------cccccc-----------hh Confidence 134689999999999998888662 222110 11 Q ss_pred cchHHHHHHHhhhhccCC------ce Q lcl|NC_020854. 161 RIPPMVERYLTGLRISGP------GN 180 (186) Q Consensus 161 ~~~~~v~~lL~~l~~~~~------g~ 180 (186) ..++.++.||.++..=.| |+ T Consensus 83 ~~~~~~~~LL~pYR~~~g~~~~~~~~ 108 (108) T protein:vir:19 83 YENAAAERMMFIHRNWRGKAESEEGS 108 (108) T ss_pred hhhHHHHHHHHHHHhcCCCCCcccCC Confidence 123568888887652211 33 No 34 >protein:vir:99222 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950460;genbank:gi:119953661;genbank:GeneID:4643082 Probab=71.42 E-value=0.028 Score=29.14 Aligned_cols=115 Identities=16% Similarity=0.130 Sum_probs=59.8 Q ss_pred cceecHHHHHHHHHhhc----cccccccccCCCHHHHHHHHHHHHHHHhhh---ccCcccCCcccccccCccCccccccc Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLV----ENDDVVAWATATADQKNRALYTATQRLDRE---RYLGARATDTQALQWPRTGVRKPDTY 87 (186) Q Consensus 15 nSYvsla~AdaY~~~r~----~~~~~~~w~~~~~~~ke~aL~~Atd~id~~---~~~G~r~~~~Q~laWPR~g~~~~~~~ 87 (186) =+|+|++|..+-|...- ..+........+++-.+++|..|+..||++ +| T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY------------------------ 56 (138) T protein:vir:99 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRY------------------------ 56 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhhcc------------------------ Confidence 67999999987654321 111111233467777899999999999975 22 Q ss_pred hhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccc-------eeEEecCeeEEeecCCCCcCcc Q lcl|NC_020854. 88 INTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLED-------YKNVKIGSIDVTPNQYGATGAD 160 (186) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~-------v~~~kvG~isveY~~~~~~~~~ 160 (186) .+|-..+|.-|+..||-+|.+.+.+.......-.++ .+.+.-|.++.--...++. T Consensus 57 ---------------~lPl~~vP~~L~~~a~dIA~Y~L~~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~--- 118 (138) T protein:vir:99 57 ---------------QLPLASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKP--- 118 (138) T ss_pred ---------------cCCccccchHHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcC--- Confidence 134567999999999999999886532211100101 0111113333321111000 Q ss_pred cchHHHHHHHhhhhccCCceeeeeeC Q lcl|NC_020854. 161 RIPPMVERYLTGLRISGPGNIAVKRS 186 (186) Q Consensus 161 ~~~~~v~~lL~~l~~~~~g~~~~~r~ 186 (186) ..+++.+.| .+ T Consensus 119 --------------~~~~~~~~~-~~ 129 (138) T protein:vir:99 119 --------------APVANTVQI-SE 129 (138) T ss_pred --------------CCCCCceee-ec Confidence 001111111 11 No 35 >protein:vir:79253 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469165;genbank:gi:157835007;genbank:GeneID:5648834 Probab=71.42 E-value=0.028 Score=29.14 Aligned_cols=115 Identities=16% Similarity=0.130 Sum_probs=59.8 Q ss_pred cceecHHHHHHHHHhhc----cccccccccCCCHHHHHHHHHHHHHHHhhh---ccCcccCCcccccccCccCccccccc Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLV----ENDDVVAWATATADQKNRALYTATQRLDRE---RYLGARATDTQALQWPRTGVRKPDTY 87 (186) Q Consensus 15 nSYvsla~AdaY~~~r~----~~~~~~~w~~~~~~~ke~aL~~Atd~id~~---~~~G~r~~~~Q~laWPR~g~~~~~~~ 87 (186) =+|+|++|..+-|...- ..+........+++-.+++|..|+..||++ +| T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY------------------------ 56 (138) T protein:vir:79 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRY------------------------ 56 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhhcc------------------------ Confidence 67999999987654321 111111233467777899999999999975 22 Q ss_pred hhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccc-------eeEEecCeeEEeecCCCCcCcc Q lcl|NC_020854. 88 INTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLED-------YKNVKIGSIDVTPNQYGATGAD 160 (186) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~-------v~~~kvG~isveY~~~~~~~~~ 160 (186) .+|-..+|.-|+..||-+|.+.+.+.......-.++ .+.+.-|.++.--...++. T Consensus 57 ---------------~lPl~~vP~~L~~~a~dIA~Y~L~~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~--- 118 (138) T protein:vir:79 57 ---------------QLPLASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKP--- 118 (138) T ss_pred ---------------cCCccccchHHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcC--- Confidence 134567999999999999999886532211100101 0111113333321111000 Q ss_pred cchHHHHHHHhhhhccCCceeeeeeC Q lcl|NC_020854. 161 RIPPMVERYLTGLRISGPGNIAVKRS 186 (186) Q Consensus 161 ~~~~~v~~lL~~l~~~~~g~~~~~r~ 186 (186) ..+++.+.| .+ T Consensus 119 --------------~~~~~~~~~-~~ 129 (138) T protein:vir:79 119 --------------APVANTVQI-SE 129 (138) T ss_pred --------------CCCCCceee-ec Confidence 001111111 11 No 36 >protein:vir:107702 Length: 136 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003900;genbank:gi:45686316;genbank:GeneID:2773042 Probab=70.87 E-value=0.18 Score=24.68 Aligned_cols=120 Identities=13% Similarity=0.042 Sum_probs=59.9 Q ss_pred CcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhh Q lcl|NC_020854. 12 ADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTY 91 (186) Q Consensus 12 ~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~ 91 (186) -+-...++ .-+-|-.+ +-+....+++..+..+--|..||..- T Consensus 1 ~~~~~~~~---~ve~fR~l-----~PeF~dvPde~i~~~~d~A~~~v~~~------------------------------ 42 (136) T protein:vir:10 1 MNQETLIA---VVEQMRKL-----VPALRKVPDETLYAWVEMAELFVCQK------------------------------ 42 (136) T ss_pred CCchHHHH---HHHHHHHh-----ccccccCCHHHHHHHHHHHHHhhcCC------------------------------ Confidence 01111112 22222222 12345567777777777777777411 Q ss_pred ccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCC------CCCCcccceeE-EecCeeEEeecCCCCcCccc--- Q lcl|NC_020854. 92 AVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDG------IGLSGLEDYKN-VKIGSIDVTPNQYGATGADR--- 161 (186) Q Consensus 92 ~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~------~~~~~~~~v~~-~kvG~isveY~~~~~~~~~~--- 161 (186) .-.+...+|...++++++..+.. ......++|++ ...|+++|+|+...+++... T Consensus 43 ----------------~~Gk~y~~al~lltAHLl~l~~~~~~~~~~~~~~s~rv~ssat~GevSVS~a~~s~~~s~~WL~ 106 (136) T protein:vir:10 43 ----------------TFKDAYVKALALYALHLAFLDGALKGEDEDLESYSRRVTSFSLSGEFSQTFGEVTKNQSGDMML 106 (136) T ss_pred ----------------CChhHHHHHHHHHHHHHHhcccccccccccccccccceehheeccceeEeeccccCchhhHhhh Confidence 22246667777888888733221 11223455666 55899999998654443311 Q ss_pred chHHHHHHHhhhhccCCceeeeeeC Q lcl|NC_020854. 162 IPPMVERYLTGLRISGPGNIAVKRS 186 (186) Q Consensus 162 ~~~~v~~lL~~l~~~~~g~~~~~r~ 186 (186) .=||= +++..|++-.+|.|.+.-+ T Consensus 107 ~TpyG-q~y~aL~k~~~gGf~l~t~ 130 (136) T protein:vir:10 107 STPWG-KMFEQLKARRRGRFALMTG 130 (136) T ss_pred cCHHH-HHHHHHHhhcccchhhhhc Confidence 01222 2444444445555555545 No 37 >protein:vir:104344 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398973;genbank:gi:81343957;genbank:GeneID:3778876 Probab=70.16 E-value=0.16 Score=24.98 Aligned_cols=116 Identities=14% Similarity=0.077 Sum_probs=58.1 Q ss_pred ccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcc Q lcl|NC_020854. 14 ANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAV 93 (186) Q Consensus 14 AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~ 93 (186) -| ++--+++-. .+-+....+++..+.-+--|-.|| T Consensus 1 ~~-----~~~~e~~R~-----l~P~f~kvpdevI~~wielA~lfV----------------------------------- 35 (132) T protein:vir:10 1 MN-----DAILAFMRS-----LVPALKAVDDESINVWIDLARLYV----------------------------------- 35 (132) T ss_pred Cc-----hHHHHHHHH-----hcchhhcCChHHHHHHHHHHHHHH----------------------------------- Confidence 00 011111111 111233444444444443333333 Q ss_pred ccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCC-CCcccc-----eeEEec-CeeEEeecCCCCcCcc-cchHH Q lcl|NC_020854. 94 GFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIG-LSGLED-----YKNVKI-GSIDVTPNQYGATGAD-RIPPM 165 (186) Q Consensus 94 ~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~-~~~~~~-----v~~~kv-G~isveY~~~~~~~~~-~~~~~ 165 (186) ..+..+++...|...+|++++..+.... .+.+.. |++-++ |+++++|.+.+..++. ..-|| T Consensus 36 -----------c~~~~g~~~~~AlaL~taHLm~~dga~k~en~~~~t~S~rvaS~Sl~Ge~Sisf~~~sa~~s~L~~tp~ 104 (132) T protein:vir:10 36 -----------CADKFGNDADRAVGLYALHLMLSDGAFKGENEGLETYSRRMASYSLSGEFSITYDNQSAIQGDLSSSSW 104 (132) T ss_pred -----------HhhcCchhHHHHHHHHHHHHhhccccccccccchhhhhhhhhhhcccCceeeecccccccccccccCcH Confidence 2345566777777777888876644332 223333 444443 9999999865443321 11234 Q ss_pred HHHHHhhhhccCCceeeeeeC Q lcl|NC_020854. 166 VERYLTGLRISGPGNIAVKRS 186 (186) Q Consensus 166 v~~lL~~l~~~~~g~~~~~r~ 186 (186) - .|+..|+.-.+|.|.+--+ T Consensus 105 G-kl~~~L~k~~~GgfgL~t~ 124 (132) T protein:vir:10 105 G-RMYKALLRKKGGGFGLITS 124 (132) T ss_pred H-HHHHHHHHhccCccccccc Confidence 4 6667776666667766555 No 38 >protein:vir:78254 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491668;genbank:gi:157786492;genbank:GeneID:5625732 Probab=69.15 E-value=0.23 Score=24.11 Aligned_cols=116 Identities=22% Similarity=0.168 Sum_probs=65.5 Q ss_pred cceecHHHHHHHHHhhccccccccccCCCHHH---HHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhh Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVENDDVVAWATATADQ---KNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTY 91 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~---ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~ 91 (186) -+|+|++|..+.|. -+.++++ .+++|-.|++.|-+ .+|.-..+ T Consensus 1 ~afAtv~Dve~rw~-----------r~LT~eE~~~ae~lL~dAs~~IR~--------------~iP~La~~--------- 46 (149) T protein:vir:78 1 MAYAEPSDVVARLG-----------RPLTDDEETQVETFLEDAEIEIRS--------------RIPDLDDK--------- 46 (149) T ss_pred CCcCCHHHHHHHhC-----------CCCCHHHHHHHHHHHHHHHHHHHH--------------hccccccc--------- Confidence 67899999887552 2333333 56778889998863 23332111 Q ss_pred ccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHh Q lcl|NC_020854. 92 AVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLT 171 (186) Q Consensus 92 ~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~ 171 (186) +++..+=..+++=+|..-.+.+.+++ ++++++.|..+-......++|.-..-+-=-.+|+ T Consensus 47 ------------~~dp~~~a~v~~V~~~mV~R~~rnpe--------G~~S~T~G~YS~slt~~np~G~LylT~~E~a~LG 106 (149) T protein:vir:78 47 ------------AEDEDYLKRVIKVEASAVTRLIRNPD--------GYIGETDGNYSYQLNWRLNTGAIEITDKEWAQLG 106 (149) T ss_pred ------------cCCcchhhHHHHHHHHHHHHHhcCCC--------CeeeeecchhhhhhhccCCCCceeeCHHHHHhhC Confidence 12223335677778888888876654 3567888877655444444454433333334554 Q ss_pred hhhccCCceeeee------eC Q lcl|NC_020854. 172 GLRISGPGNIAVK------RS 186 (186) Q Consensus 172 ~l~~~~~g~~~~~------r~ 186 (186) + +...|+|.|+ || T Consensus 107 ~--~r~~G~~~i~p~~~~~~~ 125 (149) T protein:vir:78 107 L--SKNVGVLNVRPKTPLERS 125 (149) T ss_pred C--cccccceeecccCccccC Confidence 4 2233666654 44 No 39 >protein:vir:78478 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491587;genbank:gi:157786410;genbank:GeneID:5625630 Probab=69.15 E-value=0.23 Score=24.11 Aligned_cols=116 Identities=22% Similarity=0.168 Sum_probs=65.5 Q ss_pred cceecHHHHHHHHHhhccccccccccCCCHHH---HHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhh Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVENDDVVAWATATADQ---KNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTY 91 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~---ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~ 91 (186) -+|+|++|..+.|. -+.++++ .+++|-.|++.|-+ .+|.-..+ T Consensus 1 ~afAtv~Dve~rw~-----------r~LT~eE~~~ae~lL~dAs~~IR~--------------~iP~La~~--------- 46 (149) T protein:vir:78 1 MAYAEPSDVVARLG-----------RPLTDDEETQVETFLEDAEIEIRS--------------RIPDLDDK--------- 46 (149) T ss_pred CCcCCHHHHHHHhC-----------CCCCHHHHHHHHHHHHHHHHHHHH--------------hccccccc--------- Confidence 67899999887552 2333333 56778889998863 23332111 Q ss_pred ccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHh Q lcl|NC_020854. 92 AVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLT 171 (186) Q Consensus 92 ~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~ 171 (186) +++..+=..+++=+|..-.+.+.+++ ++++++.|..+-......++|.-..-+-=-.+|+ T Consensus 47 ------------~~dp~~~a~v~~V~~~mV~R~~rnpe--------G~~S~T~G~YS~slt~~np~G~LylT~~E~a~LG 106 (149) T protein:vir:78 47 ------------AEDEDYLKRVIKVEASAVTRLIRNPD--------GYIGETDGNYSYQLNWRLNTGAIEITDKEWAQLG 106 (149) T ss_pred ------------cCCcchhhHHHHHHHHHHHHHhcCCC--------CeeeeecchhhhhhhccCCCCceeeCHHHHHhhC Confidence 12223335677778888888876654 3567888877655444444454433333334554 Q ss_pred hhhccCCceeeee------eC Q lcl|NC_020854. 172 GLRISGPGNIAVK------RS 186 (186) Q Consensus 172 ~l~~~~~g~~~~~------r~ 186 (186) + +...|+|.|+ || T Consensus 107 ~--~r~~G~~~i~p~~~~~~~ 125 (149) T protein:vir:78 107 L--SKNVGVLNVRPKTPLERS 125 (149) T ss_pred C--cccccceeecccCccccC Confidence 4 2233666654 44 No 40 >protein:vir:10365 Length: 115 # NCBI annotation: conserved phage protein # Family: family:all:363 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858957;genbank:gi:32128422;genbank:GeneID:2648387 Probab=68.80 E-value=0.23 Score=24.06 Aligned_cols=114 Identities=14% Similarity=0.071 Sum_probs=64.0 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCccc-ccccCccCccccccchhhhcccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQ-ALQWPRTGVRKPDTYINTYAVGF 95 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q-~laWPR~g~~~~~~~~~~~~~~~ 95 (186) -+||++++.+. |...+ -...+++-.+..|-.|.+++.+ |.|++.-..| .+..+..+...+ T Consensus 1 mvtLe~~K~hL--Rid~~----d~d~dD~li~~~i~AA~~~v~~--~~~r~l~~~~~~~~~~~~~~~~~----------- 61 (115) T protein:vir:10 1 MITLAMVQRHL--QAELY----EDDERDYVMQQLLPAARESAEL--FINRKLYDTQADMLADQAAGVDP----------- 61 (115) T ss_pred CCCHHHHHHHc--CCCCC----CCchhhHHHHHHHHHHHHHHHH--HhCCcccccccccccccccccCC----------- Confidence 89999999988 33211 0123455566666677777764 5555543321 222222221110 Q ss_pred ccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhc Q lcl|NC_020854. 96 PFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRI 175 (186) Q Consensus 96 ~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~ 175 (186) -....||..+++|...|.-....+-+. +.+|. . ...+..++.||.+|.. T Consensus 62 --------~~~~~~p~~i~~AiLLlvg~~Y~nRe~-----------~~~~~----------~--~elP~~v~~LL~pyR~ 110 (115) T protein:vir:10 62 --------AGQLLITRTVEQAILLTVGEWYANREQ-----------VWVKG----------V--GLVTSSAQNLLHPYRK 110 (115) T ss_pred --------cccccCChHHHHHHHHHHHHHHhcchh-----------cccch----------h--hhcCHHHHHHHHHHHh Confidence 112359999999999999988866211 11121 1 1234458999999876 Q ss_pred cCCceee Q lcl|NC_020854. 176 SGPGNIA 182 (186) Q Consensus 176 ~~~g~~~ 182 (186) -+|= + T Consensus 111 ~~gv--~ 115 (115) T protein:vir:10 111 FAGV--R 115 (115) T ss_pred cCCC--C Confidence 6542 2 No 41 >protein:vir:97069 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453566;genbank:gi:84662601;genbank:GeneID:5142484 Probab=66.70 E-value=0.26 Score=23.76 Aligned_cols=113 Identities=16% Similarity=0.081 Sum_probs=58.9 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccc-cC-ccCccccccchhhhccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQ-WP-RTGVRKPDTYINTYAVG 94 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~la-WP-R~g~~~~~~~~~~~~~~ 94 (186) -+||++++.+. |...|. ...++...+..+-.|.+++. +|.|++.-.+|... ++ -.+...++ T Consensus 1 mvtLee~K~hL--Rid~d~----~d~DDali~~~i~AA~~~v~--~~l~r~l~~~~~~~~~~~~~~~~~~~--------- 63 (115) T protein:vir:97 1 MITLAMMQRHL--QAELYE----DDERDYVMQQLLPAARESAE--LFLNRKLYDVQADMLADQVLGVDPSD--------- 63 (115) T ss_pred CCCHHHHHHHc--CCCCCC----CchhhHHHHHHHHHHHHHHH--HHhCCcccchhhcccccccccCCCcc--------- Confidence 89999999987 332210 01122234544545555554 45555543333211 11 11111010 Q ss_pred cccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhh Q lcl|NC_020854. 95 FPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLR 174 (186) Q Consensus 95 ~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~ 174 (186) .-.||..||+|...|.-....+- |.+.. ++ ....+..++.||.+|. T Consensus 64 -----------~~~~p~~i~~AiLllvg~~Y~NR-------------E~v~~--------~~--~~elP~~~~~LL~pyR 109 (115) T protein:vir:97 64 -----------QLLITRTVEQAILLTVGEWYSSR-------------EQVWI--------KG--AGLVTSSAQNLLHPYR 109 (115) T ss_pred -----------cccCCHHHHHHHHHHHHHHHhcc-------------ccccc--------cc--ccccCHHHHHHHHHHH Confidence 12489999999999998887662 22111 01 1123456789999986 Q ss_pred ccCCceee Q lcl|NC_020854. 175 ISGPGNIA 182 (186) Q Consensus 175 ~~~~g~~~ 182 (186) .-.|= + T Consensus 110 ~~~Gv--~ 115 (115) T protein:vir:97 110 KFAGV--R 115 (115) T ss_pred hhcCC--C Confidence 64432 2 No 42 >protein:vir:2432 Length: 124 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046834;genbank:gi:9630402;genbank:GeneID:1261576 Probab=65.86 E-value=0.28 Score=23.64 Aligned_cols=119 Identities=13% Similarity=0.050 Sum_probs=68.6 Q ss_pred cceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccc Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVG 94 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~ 94 (186) =+|++++|..+.|.--. .+......+..|-.|++.|-+ .||.-..+ T Consensus 1 ~~~At~~Dv~~rw~r~L--------t~~E~~~ve~lL~dAs~~ir~--------------r~P~l~~~------------ 46 (124) T protein:vir:24 1 MAYATADDVVTLWAKEP--------EPEVMALIERRLEQVERMIRR--------------RIPDLDAR------------ 46 (124) T ss_pred CCCCCHHHHHHHhCCCC--------CHHHHHHHHHHHHHHHHHHHh--------------cCCCcchh------------ Confidence 67899999887663110 112223357778889888862 34543221 Q ss_pred cccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhh Q lcl|NC_020854. 95 FPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLR 174 (186) Q Consensus 95 ~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~ 174 (186) +..+.-|..|+.=+|+.-.+++.+++ +++++..|.-+-.-....++|.-..-+-=-.+|++ T Consensus 47 ---------~~~~~~~~~v~~V~a~~V~R~~rnP~--------G~~s~T~G~Ys~sl~~~~~~g~Lylt~~E~~~Lg~-- 107 (124) T protein:vir:24 47 ---------VSSDIFRADLIDIEADAVLRLVRNPE--------GYLSETDGAYTYQLQADLSQGKLVILDEEWTTLGV-- 107 (124) T ss_pred ---------cCCCCChhhHHHHHHHHHHHHhhCCC--------CceecccchhHHhhhhcccCCceeeCHHHHHhhCc-- Confidence 12234556788888888888887654 34567778765554433444544333333345554 Q ss_pred ccCCceeeeeeC Q lcl|NC_020854. 175 ISGPGNIAVKRS 186 (186) Q Consensus 175 ~~~~g~~~~~r~ 186 (186) ..+.|.|.|+=. T Consensus 108 ~r~~~~~~i~p~ 119 (124) T protein:vir:24 108 NRLSRMSTLVPN 119 (124) T ss_pred ccccceeEeecc Confidence 224577777666 No 43 >protein:vir:107864 Length: 150 # NCBI annotation: gp36 # Family: family:all:348 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024709;genbank:gi:48696946;genbank:GeneID:2845952 Probab=65.30 E-value=0.053 Score=27.57 Aligned_cols=123 Identities=17% Similarity=0.131 Sum_probs=59.0 Q ss_pred cceecHHHHHHHHHhhccc----ccc-----ccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccc Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVEN----DDV-----VAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPD 85 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~~----~~~-----~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~ 85 (186) =+|+|++|..+.|...--. +.. .+....+++-.+++|-.|+..||++ .+.|- T Consensus 1 M~Y~T~~Dl~~r~ge~el~~Ltd~~~~g~~~~~~~~~d~~~i~~Al~dA~~eIDgY--L~~RY----------------- 61 (150) T protein:vir:10 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESSVRQAEEIVDAH--LRGRY----------------- 61 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccCccccchhhcCHHHHHHHHHHHHHHHHHH--Hhhhc----------------- Confidence 7899999998776432111 000 0112356677889999999999974 11110 Q ss_pred cchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccC---C-CCCCcccc-------eeEEecCeeEEeecCC Q lcl|NC_020854. 86 TYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKD---G-IGLSGLED-------YKNVKIGSIDVTPNQY 154 (186) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~---~-~~~~~~~~-------v~~~kvG~isveY~~~ 154 (186) .+|-..+|..|+..||-+|.+.|-.-- . .+..-.++ .+.+.-|.++..-... T Consensus 62 -----------------~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~ 124 (150) T protein:vir:10 62 -----------------NLPLSPVPTVIKDVTVNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSG 124 (150) T ss_pred -----------------cCCcccccHHHHHHHHHHHHHHHHhcccccCCCCHHHHHHHHHHHHHHHHHhcCcccCCCCCC Confidence 134467899999999999988774311 1 11110011 1111124443322111 Q ss_pred CCcCcc-------cchHHHHHHHhhh Q lcl|NC_020854. 155 GATGAD-------RIPPMVERYLTGL 173 (186) Q Consensus 155 ~~~~~~-------~~~~~v~~lL~~l 173 (186) ...... +.-.+=..-|+|| T Consensus 125 ~~~~~~~~~~v~~~~r~f~r~~l~gf 150 (150) T protein:vir:10 125 PATPEPGEMKVRARRRQFDADLLERF 150 (150) T ss_pred CCCCCCceeeeecCCCccChhhccCC Confidence 100000 0001111223333 No 44 >protein:vir:5742 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892053;genbank:gi:33770516;uniprot:Q7Y407;genbank:GeneID:2637465 Probab=63.01 E-value=0.32 Score=23.26 Aligned_cols=108 Identities=20% Similarity=0.162 Sum_probs=60.8 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccc-c-ccCc Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQA-L-QWPR 78 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~-l-aWPR 78 (186) |+++ |+++.++.. |...| ...+++-.+..+..|-++++ +|.|+|--.++. + +-| T Consensus 1 m~mi--------------tLeeiK~hl--Rid~D-----~~~eD~lL~~y~~AA~~~~e--~~~~rkLy~~~~~~~~~p- 56 (110) T protein:vir:57 1 MGMT--------------SLSNVKTQL--RLEED-----FTEHDDFIESLIDAAQRSIE--RTYYCVLVDSQEALEKLP- 56 (110) T ss_pred CCCC--------------CHHHHHHHc--CCCCC-----CChhHHHHHHHHHHHHHHHH--HHhCCcccCCccccccCC- Confidence 6554 899999877 33222 12244444444545556665 677777544322 1 122 Q ss_pred cCccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcC Q lcl|NC_020854. 79 TGVRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATG 158 (186) Q Consensus 79 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~ 158 (186) . .++|. .++..|+.|+..|.-....| ||.|++. + T Consensus 57 ~---~~~gl--------------------~~~~di~~A~Lllv~hwYeN-------------REav~~~----------~ 90 (110) T protein:vir:57 57 E---GVRGF--------------------LIEPDTQLAARMMVAQWYLN-------------PKGTSPD----------G 90 (110) T ss_pred C---CCCcc--------------------ccCHHHHHHHHHHHHHHHhc-------------ccccccc----------c Confidence 1 11221 48889999999999998877 3444331 1 Q ss_pred cccchHHHHHHHhhhhccCC Q lcl|NC_020854. 159 ADRIPPMVERYLTGLRISGP 178 (186) Q Consensus 159 ~~~~~~~v~~lL~~l~~~~~ 178 (186) ....+-.++.||.|+..-.- T Consensus 91 ~~~~P~~v~~Ll~P~~~~~~ 110 (110) T protein:vir:57 91 DTPAQLGVEYLLFPLMEHTV 110 (110) T ss_pred ccchhHHHHHHHHHHHhhcC Confidence 11224557778888754322 No 45 >protein:vir:80036 Length: 111 # NCBI annotation: gp10 # Family: family:all:3186 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468714;genbank:gi:157325294;genbank:GeneID:5601727 Probab=61.54 E-value=0.35 Score=23.07 Aligned_cols=109 Identities=15% Similarity=0.126 Sum_probs=54.8 Q ss_pred ccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcc Q lcl|NC_020854. 14 ANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAV 93 (186) Q Consensus 14 AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~ 93 (186) -|+= ++-...-.. + -.+.+++..+.++-.|..-. |.+ T Consensus 1 m~tt--v~~vkl~a~-----~----L~~~sDDsl~~~I~dA~~e~-------------~a~------------------- 37 (111) T protein:vir:80 1 MKTD--VSKLKLTAS-----S----LASVSDDSLQVHIDDSYLEV-------------QEK------------------- 37 (111) T ss_pred Cchh--HHHHHHhhH-----h----hcCCChHHHHHHHHHHHHHh-------------hcC------------------- Confidence 1111 222222111 1 13466666666655543332 233 Q ss_pred ccccccCcccccCCcchHHH-HHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhh Q lcl|NC_020854. 94 GFPFRITTDYFTDTEIPQQI-KEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTG 172 (186) Q Consensus 94 ~~~~~~~~~~~~~d~IP~~V-k~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~ 172 (186) +-|+++ .+||--||++++.- .++.|++||||.++-+|+......--.+-+|=+.+++= T Consensus 38 --------------gFp~~~~e~a~rYLa~HLat~-------~~~~v~sE~V~~Lk~~Y~~~~~~~~l~~s~wGq~Y~rL 96 (111) T protein:vir:80 38 --------------GFPEKFEERANRYLAAHLATL-------ANKNVKSEAVGSLKREYYEVKGDSGLLSTEYGQEYARL 96 (111) T ss_pred --------------CCChhHHHHHHHHHHHHHHHh-------cCCCCchhhhhhHHHHhhhcccccccccchhHHHHHHH Confidence 333344 45777788887744 25689999999999999742221111223555444443 Q ss_pred hhc-cCCceeeeeeC Q lcl|NC_020854. 173 LRI-SGPGNIAVKRS 186 (186) Q Consensus 173 l~~-~~~g~~~~~r~ 186 (186) |=. +++++.+|.+- T Consensus 97 ~k~~~~gs~~~~vVv 111 (111) T protein:vir:80 97 LKEANGGSGISMVVV 111 (111) T ss_pred HHHhcCCccceeeeC Confidence 221 33345555555 No 46 >protein:vir:81069 Length: 115 # NCBI annotation: p10 # Family: family:all:363 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285680;genbank:gi:148727188;genbank:GeneID:5247114 Probab=60.70 E-value=0.37 Score=22.97 Aligned_cols=113 Identities=13% Similarity=0.071 Sum_probs=59.0 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccc-cccCccCcc-ccccchhhhccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQA-LQWPRTGVR-KPDTYINTYAVG 94 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~-laWPR~g~~-~~~~~~~~~~~~ 94 (186) -+|+++++++. |...|. ...+++-.+..+-.|.+++. +|.|++.-.+|. +.++..+.. .++ T Consensus 1 ivtLee~K~Hl--Rid~dd----~deDD~li~~~i~AA~~~v~--~~l~r~l~~~~~~~~~~~~~~~~~~~--------- 63 (115) T protein:vir:81 1 MITLAMVQRHL--QAELYE----DDERDYVMQQLLPAARESAE--LFINRKLYDTQADMLADQAAGVDPAG--------- 63 (115) T ss_pred CCCHHHHHHHc--CCCCCC----CccchHHHHHHHHHHHHHHH--HHhCCccccccccccccccccCCCCc--------- Confidence 89999999987 332110 11233344444444555543 455555433332 222222111 111 Q ss_pred cccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhh Q lcl|NC_020854. 95 FPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLR 174 (186) Q Consensus 95 ~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~ 174 (186) .-.||..||+|+..|.-....+- |.+.. ++ ....+..++.||.+|. T Consensus 64 -----------~~~~p~~i~~AiLllvg~~Y~NR-------------E~v~~--------~~--~~elP~~~~~LL~pyR 109 (115) T protein:vir:81 64 -----------QLLITRTVEQAILLTLGEWYSSR-------------EQVWT--------KG--AGLVTSSAQNLLHPYR 109 (115) T ss_pred -----------ccccCHHHHHHHHHHHHHHHhcc-------------chhcc--------hh--hhhcCHHHHHHHHHHH Confidence 12489999999999999888762 22211 01 1123455789999985 Q ss_pred ccCCceee Q lcl|NC_020854. 175 ISGPGNIA 182 (186) Q Consensus 175 ~~~~g~~~ 182 (186) .--|- + T Consensus 110 ~~~g~--~ 115 (115) T protein:vir:81 110 KFAGV--R 115 (115) T ss_pred hhcCC--C Confidence 53321 1 No 47 >protein:vir:79640 Length: 134 # NCBI annotation: gp38 # Family: family:all:5122 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285527;genbank:gi:148734510;genbank:GeneID:5219998 Probab=55.32 E-value=0.48 Score=22.32 Aligned_cols=117 Identities=12% Similarity=0.065 Sum_probs=59.2 Q ss_pred ccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcc Q lcl|NC_020854. 14 ANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAV 93 (186) Q Consensus 14 AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~ 93 (186) -|==..|+. |..+ +-+....+++..+..+-.|..||..- T Consensus 1 m~d~~~ve~----Fr~l-----~PeF~~vpde~l~~~~~~A~~~i~~~-------------------------------- 39 (134) T protein:vir:79 1 MNDIEILEQ----IYKI-----APAFKKVDPELIQAWIELAKDFVCEK-------------------------------- 39 (134) T ss_pred CchHHHHHH----HHHh-----ccccccCCHHHHHHHHHHhhhhhcCC-------------------------------- Confidence 111111222 2222 12355678888888888888888521 Q ss_pred ccccccCcccccCCcchHHHHHHHHHHHHHHhccc-----CCCCC-CcccceeE-EecCeeEEeecCCCCcCccc---ch Q lcl|NC_020854. 94 GFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNK-----DGIGL-SGLEDYKN-VKIGSIDVTPNQYGATGADR---IP 163 (186) Q Consensus 94 ~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~-----~~~~~-~~~~~v~~-~kvG~isveY~~~~~~~~~~---~~ 163 (186) ...+....|...++++++.-. +.... ...++|.+ ...|+++|+|+.....+... .= T Consensus 40 --------------~~g~~~~~al~lltAHLl~l~~~~~~~g~~~~~~~grv~ssst~G~vSvS~a~ps~~~~~~Wl~~T 105 (134) T protein:vir:79 40 --------------HFKDKYFRAVALYTLHLMTLDGAMKQESESVESYSHRIASFSLTGEFSQTFSKVSDDTSGNTLRQT 105 (134) T ss_pred --------------CCChHHHHHHHHHHHHHHhhcccccccccccccccchhhhhhhhcceeeeccCcccchhHHHHhcC Confidence 122466777788888887432 11111 12335555 44899999997654433211 00 Q ss_pred HHHHHHHhhhhccCCceeeeeeC Q lcl|NC_020854. 164 PMVERYLTGLRISGPGNIAVKRS 186 (186) Q Consensus 164 ~~v~~lL~~l~~~~~g~~~~~r~ 186 (186) |+= +++.-|.+-.+|.|.++-+ T Consensus 106 pYG-q~y~~L~k~~~GGf~~~t~ 127 (134) T protein:vir:79 106 PWG-KMYEVLNKKKGGGFGLTTA 127 (134) T ss_pred HHH-HHHHHHHHhhccchHhhhh Confidence 222 2444444444445554433 No 48 >protein:vir:78849 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285363;genbank:gi:148717891;genbank:GeneID:5246980 Probab=55.14 E-value=0.49 Score=22.30 Aligned_cols=110 Identities=15% Similarity=0.163 Sum_probs=72.2 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFP 96 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~ 96 (186) ...++....-.. -. ....++..+..+-+|.+.|-.+ .| T Consensus 1 M~~L~~vK~~lg---I~------d~~~D~lL~~ii~~a~~~i~~~--l~------------------------------- 38 (110) T protein:vir:78 1 MTTLADVKKRIG---LK------DEKQDEQLEEIIKSCESQLLSM--LP------------------------------- 38 (110) T ss_pred CchHHHHHHHhC---CC------CCchhHHHHHHHHHHHHHHHHH--hc------------------------------- Confidence 444555554331 11 1234555666677777776411 00 Q ss_pred cccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhcc Q lcl|NC_020854. 97 FRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRIS 176 (186) Q Consensus 97 ~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~ 176 (186) +-.+.||.++..-++++|+...+-- +..++++++++-.|++|.. .-..+...-++.++.+-... T Consensus 39 -------~~~~~iP~~l~~iv~ev~vkryNR~------g~EG~~S~S~eG~S~sf~d---~d~~~y~~~l~~y~~~~~~~ 102 (110) T protein:vir:78 39 -------IEVEQIPERFSYMIKEVAVKRYNRI------GAEGMTSEAVDGRSNAYEL---NDFKEYEAIIDNYFNARTRT 102 (110) T ss_pred -------cchhhhhhHHHHHHHHHHHHHhccc------CccccceeecCceeeeecc---cccchHHHHHHHHHhhcCCC Confidence 1136799999999999999988663 3457899999999999963 22344556778888776666 Q ss_pred CCceeeee Q lcl|NC_020854. 177 GPGNIAVK 184 (186) Q Consensus 177 ~~g~~~~~ 184 (186) +.|.+++- T Consensus 103 ~kG~v~Fl 110 (110) T protein:vir:78 103 KKGRAVFF 110 (110) T ss_pred CCceeeeC Confidence 77888887 No 49 >protein:vir:103957 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873994;genbank:gi:118430769;genbank:GeneID:4525451 Probab=55.14 E-value=0.49 Score=22.30 Aligned_cols=110 Identities=15% Similarity=0.163 Sum_probs=72.2 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFP 96 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~ 96 (186) ...++....-.. -. ....++..+..+-+|.+.|-.+ .| T Consensus 1 M~~L~~vK~~lg---I~------d~~~D~lL~~ii~~a~~~i~~~--l~------------------------------- 38 (110) T protein:vir:10 1 MTTLADVKKRIG---LK------DEKQDEQLEEIIKSCESQLLSM--LP------------------------------- 38 (110) T ss_pred CchHHHHHHHhC---CC------CCchhHHHHHHHHHHHHHHHHH--hc------------------------------- Confidence 444555554331 11 1234555666677777776411 00 Q ss_pred cccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhcc Q lcl|NC_020854. 97 FRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRIS 176 (186) Q Consensus 97 ~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~ 176 (186) +-.+.||.++..-++++|+...+-- +..++++++++-.|++|.. .-..+...-++.++.+-... T Consensus 39 -------~~~~~iP~~l~~iv~ev~vkryNR~------g~EG~~S~S~eG~S~sf~d---~d~~~y~~~l~~y~~~~~~~ 102 (110) T protein:vir:10 39 -------IEVEQIPERFSYMIKEVAVKRYNRI------GAEGMTSEAVDGRSNAYEL---NDFKEYEAIIDNYFNARTRT 102 (110) T ss_pred -------cchhhhhhHHHHHHHHHHHHHhccc------CccccceeecCceeeeecc---cccchHHHHHHHHHhhcCCC Confidence 1136799999999999999988663 3457899999999999963 22344556778888776666 Q ss_pred CCceeeee Q lcl|NC_020854. 177 GPGNIAVK 184 (186) Q Consensus 177 ~~g~~~~~ 184 (186) +.|.+++- T Consensus 103 ~kG~v~Fl 110 (110) T protein:vir:10 103 KKGRAVFF 110 (110) T ss_pred CCceeeeC Confidence 77888887 No 50 >protein:vir:97145 Length: 110 # NCBI annotation: ORF049 # Family: family:all:372 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239728;genbank:gi:66394913;genbank:GeneID:5130878 Probab=55.14 E-value=0.49 Score=22.30 Aligned_cols=110 Identities=15% Similarity=0.163 Sum_probs=72.2 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFP 96 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~ 96 (186) ...++....-.. -. ....++..+..+-+|.+.|-.+ .| T Consensus 1 M~~L~~vK~~lg---I~------d~~~D~lL~~ii~~a~~~i~~~--l~------------------------------- 38 (110) T protein:vir:97 1 MTTLADVKKRIG---LK------DEKQDEQLEEIIKSCESQLLSM--LP------------------------------- 38 (110) T ss_pred CchHHHHHHHhC---CC------CCchhHHHHHHHHHHHHHHHHH--hc------------------------------- Confidence 444555554331 11 1234555666677777776411 00 Q ss_pred cccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhcc Q lcl|NC_020854. 97 FRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRIS 176 (186) Q Consensus 97 ~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~ 176 (186) +-.+.||.++..-++++|+...+-- +..++++++++-.|++|.. .-..+...-++.++.+-... T Consensus 39 -------~~~~~iP~~l~~iv~ev~vkryNR~------g~EG~~S~S~eG~S~sf~d---~d~~~y~~~l~~y~~~~~~~ 102 (110) T protein:vir:97 39 -------IEVEQIPERFSYMIKEVAVKRYNRI------GAEGMTSEAVDGRSNAYEL---NDFKEYEAIIDNYFNARTRT 102 (110) T ss_pred -------cchhhhhhHHHHHHHHHHHHHhccc------CccccceeecCceeeeecc---cccchHHHHHHHHHhhcCCC Confidence 1136799999999999999988663 3457899999999999963 22344556778888776666 Q ss_pred CCceeeee Q lcl|NC_020854. 177 GPGNIAVK 184 (186) Q Consensus 177 ~~g~~~~~ 184 (186) +.|.+++- T Consensus 103 ~kG~v~Fl 110 (110) T protein:vir:97 103 KKGRAVFF 110 (110) T ss_pred CCceeeeC Confidence 77888887 No 51 >protein:vir:96390 Length: 110 # NCBI annotation: ORF048 # Family: family:all:372 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239650;genbank:gi:66395410;genbank:GeneID:5132866 Probab=55.14 E-value=0.49 Score=22.30 Aligned_cols=110 Identities=15% Similarity=0.163 Sum_probs=72.2 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFP 96 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~ 96 (186) ...++....-.. -. ....++..+..+-+|.+.|-.+ .| T Consensus 1 M~~L~~vK~~lg---I~------d~~~D~lL~~ii~~a~~~i~~~--l~------------------------------- 38 (110) T protein:vir:96 1 MTTLADVKKRIG---LK------DEKQDEQLEEIIKSCESQLLSM--LP------------------------------- 38 (110) T ss_pred CchHHHHHHHhC---CC------CCchhHHHHHHHHHHHHHHHHH--hc------------------------------- Confidence 444555554331 11 1234555666677777776411 00 Q ss_pred cccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhcc Q lcl|NC_020854. 97 FRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRIS 176 (186) Q Consensus 97 ~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~ 176 (186) +-.+.||.++..-++++|+...+-- +..++++++++-.|++|.. .-..+...-++.++.+-... T Consensus 39 -------~~~~~iP~~l~~iv~ev~vkryNR~------g~EG~~S~S~eG~S~sf~d---~d~~~y~~~l~~y~~~~~~~ 102 (110) T protein:vir:96 39 -------IEVEQIPERFSYMIKEVAVKRYNRI------GAEGMTSEAVDGRSNAYEL---NDFKEYEAIIDNYFNARTRT 102 (110) T ss_pred -------cchhhhhhHHHHHHHHHHHHHhccc------CccccceeecCceeeeecc---cccchHHHHHHHHHhhcCCC Confidence 1136799999999999999988663 3457899999999999963 22344556778888776666 Q ss_pred CCceeeee Q lcl|NC_020854. 177 GPGNIAVK 184 (186) Q Consensus 177 ~~g~~~~~ 184 (186) +.|.+++- T Consensus 103 ~kG~v~Fl 110 (110) T protein:vir:96 103 KKGRAVFF 110 (110) T ss_pred CCceeeeC Confidence 77888887 No 52 >protein:vir:99796 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004309;genbank:gi:122891763;genbank:GeneID:4712351 Probab=55.14 E-value=0.49 Score=22.30 Aligned_cols=110 Identities=15% Similarity=0.163 Sum_probs=72.2 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFP 96 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~ 96 (186) ...++....-.. -. ....++..+..+-+|.+.|-.+ .| T Consensus 1 M~~L~~vK~~lg---I~------d~~~D~lL~~ii~~a~~~i~~~--l~------------------------------- 38 (110) T protein:vir:99 1 MTTLADVKKRIG---LK------DEKQDEQLEEIIKSCESQLLSM--LP------------------------------- 38 (110) T ss_pred CchHHHHHHHhC---CC------CCchhHHHHHHHHHHHHHHHHH--hc------------------------------- Confidence 444555554331 11 1234555666677777776411 00 Q ss_pred cccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhcc Q lcl|NC_020854. 97 FRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRIS 176 (186) Q Consensus 97 ~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~ 176 (186) +-.+.||.++..-++++|+...+-- +..++++++++-.|++|.. .-..+...-++.++.+-... T Consensus 39 -------~~~~~iP~~l~~iv~ev~vkryNR~------g~EG~~S~S~eG~S~sf~d---~d~~~y~~~l~~y~~~~~~~ 102 (110) T protein:vir:99 39 -------IEVEQIPERFSYMIKEVAVKRYNRI------GAEGMTSEAVDGRSNAYEL---NDFKEYEAIIDNYFNARTRT 102 (110) T ss_pred -------cchhhhhhHHHHHHHHHHHHHhccc------CccccceeecCceeeeecc---cccchHHHHHHHHHhhcCCC Confidence 1136799999999999999988663 3457899999999999963 22344556778888776666 Q ss_pred CCceeeee Q lcl|NC_020854. 177 GPGNIAVK 184 (186) Q Consensus 177 ~~g~~~~~ 184 (186) +.|.+++- T Consensus 103 ~kG~v~Fl 110 (110) T protein:vir:99 103 KKGRAVFF 110 (110) T ss_pred CCceeeeC Confidence 77888887 No 53 >protein:vir:9311 Length: 110 # NCBI annotation: phi Mu50B-like protein # Family: family:all:372 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803289;genbank:gi:29028599;genbank:GeneID:1258047 Probab=55.14 E-value=0.49 Score=22.30 Aligned_cols=110 Identities=15% Similarity=0.163 Sum_probs=72.2 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFP 96 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~ 96 (186) ...++....-.. -. ....++..+..+-+|.+.|-.+ .| T Consensus 1 M~~L~~vK~~lg---I~------d~~~D~lL~~ii~~a~~~i~~~--l~------------------------------- 38 (110) T protein:vir:93 1 MTTLADVKKRIG---LK------DEKQDEQLEEIIKSCESQLLSM--LP------------------------------- 38 (110) T ss_pred CchHHHHHHHhC---CC------CCchhHHHHHHHHHHHHHHHHH--hc------------------------------- Confidence 444555554331 11 1234555666677777776411 00 Q ss_pred cccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhcc Q lcl|NC_020854. 97 FRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRIS 176 (186) Q Consensus 97 ~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~ 176 (186) +-.+.||.++..-++++|+...+-- +..++++++++-.|++|.. .-..+...-++.++.+-... T Consensus 39 -------~~~~~iP~~l~~iv~ev~vkryNR~------g~EG~~S~S~eG~S~sf~d---~d~~~y~~~l~~y~~~~~~~ 102 (110) T protein:vir:93 39 -------IEVEQIPERFSYMIKEVAVKRYNRI------GAEGMTSEAVDGRSNAYEL---NDFKEYEAIIDNYFNARTRT 102 (110) T ss_pred -------cchhhhhhHHHHHHHHHHHHHhccc------CccccceeecCceeeeecc---cccchHHHHHHHHHhhcCCC Confidence 1136799999999999999988663 3457899999999999963 22344556778888776666 Q ss_pred CCceeeee Q lcl|NC_020854. 177 GPGNIAVK 184 (186) Q Consensus 177 ~~g~~~~~ 184 (186) +.|.+++- T Consensus 103 ~kG~v~Fl 110 (110) T protein:vir:93 103 KKGRAVFF 110 (110) T ss_pred CCceeeeC Confidence 77888887 No 54 >protein:vir:96221 Length: 110 # NCBI annotation: ORF044 # Family: family:all:372 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239573;genbank:gi:66395333;genbank:GeneID:5132767 Probab=55.14 E-value=0.49 Score=22.30 Aligned_cols=110 Identities=15% Similarity=0.163 Sum_probs=72.2 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFP 96 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~ 96 (186) ...++....-.. -. ....++..+..+-+|.+.|-.+ .| T Consensus 1 M~~L~~vK~~lg---I~------d~~~D~lL~~ii~~a~~~i~~~--l~------------------------------- 38 (110) T protein:vir:96 1 MTTLADVKKRIG---LK------DEKQDEQLEEIIKSCESQLLSM--LP------------------------------- 38 (110) T ss_pred CchHHHHHHHhC---CC------CCchhHHHHHHHHHHHHHHHHH--hc------------------------------- Confidence 444555554331 11 1234555666677777776411 00 Q ss_pred cccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhcc Q lcl|NC_020854. 97 FRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRIS 176 (186) Q Consensus 97 ~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~ 176 (186) +-.+.||.++..-++++|+...+-- +..++++++++-.|++|.. .-..+...-++.++.+-... T Consensus 39 -------~~~~~iP~~l~~iv~ev~vkryNR~------g~EG~~S~S~eG~S~sf~d---~d~~~y~~~l~~y~~~~~~~ 102 (110) T protein:vir:96 39 -------IEVEQIPERFSYMIKEVAVKRYNRI------GAEGMTSEAVDGRSNAYEL---NDFKEYEAIIDNYFNARTRT 102 (110) T ss_pred -------cchhhhhhHHHHHHHHHHHHHhccc------CccccceeecCceeeeecc---cccchHHHHHHHHHhhcCCC Confidence 1136799999999999999988663 3457899999999999963 22344556778888776666 Q ss_pred CCceeeee Q lcl|NC_020854. 177 GPGNIAVK 184 (186) Q Consensus 177 ~~g~~~~~ 184 (186) +.|.+++- T Consensus 103 ~kG~v~Fl 110 (110) T protein:vir:96 103 KKGRAVFF 110 (110) T ss_pred CCceeeeC Confidence 77888887 No 55 >protein:vir:79074 Length: 150 # NCBI annotation: gp10, conserved hypothetical protein # Family: family:all:348 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111210;genbank:gi:134288797;genbank:GeneID:4960748 Probab=49.90 E-value=0.63 Score=21.70 Aligned_cols=118 Identities=19% Similarity=0.190 Sum_probs=59.6 Q ss_pred cceecHHHHHHHHHhhccc----ccc-----ccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccc Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVEN----DDV-----VAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPD 85 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~~----~~~-----~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~ 85 (186) =+|+|++|..+.|....-. +.. .+....+++-.+++|-.|+..||++ .+.| T Consensus 1 M~Y~T~~Dl~~r~ge~~l~~Ltd~~~~~~~~~~~~~~d~~~i~~Al~dA~~~IDgy--L~~R------------------ 60 (150) T protein:vir:79 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESAVRQAEEIVDAH--LRGR------------------ 60 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccccccccccccCHHHHHHHHHHHHHHHHHH--Hhhh------------------ Confidence 7899999998876432111 000 0112356677899999999999974 1111 Q ss_pred cchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCC----CCCCccc----c---eeEEecCeeEEeecCC Q lcl|NC_020854. 86 TYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDG----IGLSGLE----D---YKNVKIGSIDVTPNQY 154 (186) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~----~~~~~~~----~---v~~~kvG~isveY~~~ 154 (186) -.+|-..+|..|+..||-+|.+.|-..-. .+..-.+ . .+.+.-|.++.--... T Consensus 61 ----------------Y~lPl~~vP~~L~~~a~dIA~Y~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~ 124 (150) T protein:vir:79 61 ----------------YNLPLSPVPTVIKDVTVNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSG 124 (150) T ss_pred ----------------ccCCcccccHHHHHHHHHHHHHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCc Confidence 01345679999999999999888743211 1111000 1 1111224444322110 Q ss_pred CCcCcccchHHHHHHHhhhhccCCceeeeeeC Q lcl|NC_020854. 155 GATGADRIPPMVERYLTGLRISGPGNIAVKRS 186 (186) Q Consensus 155 ~~~~~~~~~~~v~~lL~~l~~~~~g~~~~~r~ 186 (186) ... .+.+.+.|.-. T Consensus 125 ~~~------------------~~~~~~~v~~~ 138 (150) T protein:vir:79 125 PAT------------------PEPGEMKVRAR 138 (150) T ss_pred cCC------------------CCCCceeeecC Confidence 000 11122222211 No 56 >protein:vir:4458 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700381;genbank:gi:23505453;genbank:GeneID:955660 Probab=47.57 E-value=0.7 Score=21.44 Aligned_cols=107 Identities=13% Similarity=0.207 Sum_probs=59.2 Q ss_pred ceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcccc Q lcl|NC_020854. 16 SYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGF 95 (186) Q Consensus 16 SYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~ 95 (186) =++||++++.|.. ...| ...++.-.+..+-.|.+||. +|.|++...++.. +|... ++ T Consensus 1 M~vtLee~K~hLR--Id~D-----~~dDD~lI~~~i~AA~~~i~--~~~~r~l~~~~~~-~~~~~---~~---------- 57 (107) T protein:vir:44 1 MLLSVEEIKAQLR--LDED-----FEADERYLQLLARAVQKRTE--TYLNRKLYAPDET-IPDSD---PD---------- 57 (107) T ss_pred CCCCHHHHHHHcC--CCCC-----CchhHHHHHHHHHHHHHHHH--HhhcCcccccccc-ccccc---cc---------- Confidence 7899999999873 2111 11234445656667778886 4666665443321 22211 11 Q ss_pred ccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhc Q lcl|NC_020854. 96 PFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRI 175 (186) Q Consensus 96 ~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~ 175 (186) .-.+|..++.|++.|+-....+-.. +...++ ...+--++.||.+|.. T Consensus 58 ----------~~~~~~~~~~AiLllv~~~Y~NRe~-------------~~~~~~----------~~lP~~v~~Ll~~yR~ 104 (107) T protein:vir:44 58 ----------GLLLQDDIRLGMLMLISHFYENRSS-------------VTEVEK----------LDMPQSFGWLVGPYRY 104 (107) T ss_pred ----------cccchhhHHHHHHHHHHHHHhhhhh-------------hccccc----------cccCHHHHHHHHHhhh Confidence 1257999999999999988876321 100000 1123346777777643 Q ss_pred cCC Q lcl|NC_020854. 176 SGP 178 (186) Q Consensus 176 ~~~ 178 (186) =-. T Consensus 105 ~p~ 107 (107) T protein:vir:44 105 FPQ 107 (107) T ss_pred cCC Confidence 222 No 57 >protein:vir:7773 Length: 123 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817607;genbank:gi:29566037;genbank:GeneID:1259231 Probab=46.06 E-value=0.75 Score=21.28 Aligned_cols=118 Identities=18% Similarity=0.152 Sum_probs=64.5 Q ss_pred cceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccc Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVG 94 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~ 94 (186) =+|+|++|..+.|.--. .+......+..|-.|++.|-+ .||.-..+ T Consensus 1 ~~~At~~Dv~ar~~r~L--------T~~E~~~ve~lL~dAs~~ir~--------------r~P~l~~~------------ 46 (123) T protein:vir:77 1 MPYATASDVTSRWARQP--------TDEETALINVRLADVERMIKR--------------RIPDLATK------------ 46 (123) T ss_pred CCcCCHHHHHHHhCCCC--------CHHHHHHHHHHHHHHHHHHHH--------------hccCcccc------------ Confidence 67899999887653110 112233356778899999863 34432211 Q ss_pred cccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhh Q lcl|NC_020854. 95 FPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLR 174 (186) Q Consensus 95 ~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~ 174 (186) .++..+=..|+.=+|+.-.+++.+++ +++++..|.-+-......+.|.-..-+ .=++-|. T Consensus 47 ---------a~d~~~~~~~~~V~~~~V~R~~rnpe--------G~~s~T~G~ys~sl~~a~~~g~Lylt~---~E~~~Lg 106 (123) T protein:vir:77 47 ---------VTDPDYLEDLKQVEADAVLRLVRNPE--------GYLSETDGNYTYMLRSDLASGKLEIFP---EEWEILG 106 (123) T ss_pred ---------cCCcchhHHHHHHHHHHHHHHhhCCC--------CceecccchhhhhhcccCCCCcceeCH---HHHHhhc Confidence 12223336778888888888887764 345566676544422222334333222 2233333 Q ss_pred ccCCceeeeeeC Q lcl|NC_020854. 175 ISGPGNIAVKRS 186 (186) Q Consensus 175 ~~~~g~~~~~r~ 186 (186) .+..+.+.|+-. T Consensus 107 ~~~~~~~~i~p~ 118 (123) T protein:vir:77 107 YRRSRMTVIVPN 118 (123) T ss_pred CCCCceeEEeec Confidence 455667766666 No 58 >protein:vir:3639 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705634;genbank:gi:23752319;genbank:GeneID:955737 Probab=45.59 E-value=0.33 Score=23.24 Aligned_cols=101 Identities=16% Similarity=0.136 Sum_probs=41.1 Q ss_pred cccCccCccccccchhhhccccccccCcccccC-CcchHHHHHHHHHHHH-HHhcccC---------------------- Q lcl|NC_020854. 74 LQWPRTGVRKPDTYINTYAVGFPFRITTDYFTD-TEIPQQIKEAQATLAV-YLNNNKD---------------------- 129 (186) Q Consensus 74 laWPR~g~~~~~~~~~~~~~~~~~~~~~~~~~~-d~IP~~Vk~A~~eLA~-~~~~~~~---------------------- 129 (186) ++= |.|-+-+.-..|-.. +|+ ..+|+..++.-+..|- .++++.+ T Consensus 1 ~~~-------~~~~v~Fd~a~FR~~-----fPeFa~~pd~~i~~~~~~A~~~~l~n~~~s~~~~~~~r~~ll~LltAHll 68 (158) T protein:vir:36 1 MST-------PPYRITFDPAGFIAE-----YPEFATVPTPRLQAMFNQAQAALLDNTGGSPVTDDNVLRELFNMLVAHLL 68 (158) T ss_pred CCC-------CCceEEcChHHHHHh-----CcccccCCHHHHHHHHHhhhheeeCCcccccccChHHHHHHHHHHHHHHH Confidence 222 222222222222222 222 3367766666665552 2211100 Q ss_pred ------CCCC--CcccceeEEecCeeEEeecCCCCcCc-------ccch-HHHHHHHhhhh-----ccCC-ceeeee--- Q lcl|NC_020854. 130 ------GIGL--SGLEDYKNVKIGSIDVTPNQYGATGA-------DRIP-PMVERYLTGLR-----ISGP-GNIAVK--- 184 (186) Q Consensus 130 ------~~~~--~~~~~v~~~kvG~isveY~~~~~~~~-------~~~~-~~v~~lL~~l~-----~~~~-g~~~~~--- 184 (186) .... ..-+.|++.++|.|||.|+.+...+. .+.| .-.-+|++.+. .+|. ....++ T Consensus 69 ~L~~~~~~g~~~g~vG~vsSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fw~L~~~~~~Gg~v~Gg~pe~~~~r~~g 148 (158) T protein:vir:36 69 TLFSAAPTSANSRPPGRLSSATEGTVSSSFEYALPQGSAIAPWYNQTQYGAMFWMATAHYRSARYMVSGGSGIGTARAYG 148 (158) T ss_pred HHhhhhhcccccCcccceeeeeeCceeEEeeeccccCCchhhhhhcCHHHHHHHHHHHhhCccccccccCCcccceeccC Confidence 0111 11268999999999999975433221 1111 11223444432 2222 122221 Q ss_pred eC Q lcl|NC_020854. 185 RS 186 (186) Q Consensus 185 r~ 186 (186) |. T Consensus 149 ~~ 150 (158) T protein:vir:36 149 QP 150 (158) T ss_pred ce Confidence 11 No 59 >protein:vir:101559 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958112;genbank:gi:41057658;genbank:GeneID:2716816 Probab=45.59 E-value=0.33 Score=23.24 Aligned_cols=101 Identities=16% Similarity=0.136 Sum_probs=41.1 Q ss_pred cccCccCccccccchhhhccccccccCcccccC-CcchHHHHHHHHHHHH-HHhcccC---------------------- Q lcl|NC_020854. 74 LQWPRTGVRKPDTYINTYAVGFPFRITTDYFTD-TEIPQQIKEAQATLAV-YLNNNKD---------------------- 129 (186) Q Consensus 74 laWPR~g~~~~~~~~~~~~~~~~~~~~~~~~~~-d~IP~~Vk~A~~eLA~-~~~~~~~---------------------- 129 (186) ++= |.|-+-+.-..|-.. +|+ ..+|+..++.-+..|- .++++.+ T Consensus 1 ~~~-------~~~~v~Fd~a~FR~~-----fPeFa~~pd~~i~~~~~~A~~~~l~n~~~s~~~~~~~r~~ll~LltAHll 68 (158) T protein:vir:10 1 MST-------PPYRITFDPAGFIAE-----YPEFATVPTPRLQAMFNQAQAALLDNTGGSPVTDDNVLRELFNMLVAHLL 68 (158) T ss_pred CCC-------CCceEEcChHHHHHh-----CcccccCCHHHHHHHHHhhhheeeCCcccccccChHHHHHHHHHHHHHHH Confidence 222 222222222222222 222 3367766666665552 2211100 Q ss_pred ------CCCC--CcccceeEEecCeeEEeecCCCCcCc-------ccch-HHHHHHHhhhh-----ccCC-ceeeee--- Q lcl|NC_020854. 130 ------GIGL--SGLEDYKNVKIGSIDVTPNQYGATGA-------DRIP-PMVERYLTGLR-----ISGP-GNIAVK--- 184 (186) Q Consensus 130 ------~~~~--~~~~~v~~~kvG~isveY~~~~~~~~-------~~~~-~~v~~lL~~l~-----~~~~-g~~~~~--- 184 (186) .... ..-+.|++.++|.|||.|+.+...+. .+.| .-.-+|++.+. .+|. ....++ T Consensus 69 ~L~~~~~~g~~~g~vG~vsSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fw~L~~~~~~Gg~v~Gg~pe~~~~r~~g 148 (158) T protein:vir:10 69 TLFSAAPTSANSRPPGRLSSATEGTVSSSFEYALPQGSAIAPWYNQTQYGAMFWMATAHYRSARYMVSGGSGIGTARAYG 148 (158) T ss_pred HHhhhhhcccccCcccceeeeeeCceeEEeeeccccCCchhhhhhcCHHHHHHHHHHHhhCccccccccCCcccceeccC Confidence 0111 11268999999999999975433221 1111 11223444432 2222 122221 Q ss_pred eC Q lcl|NC_020854. 185 RS 186 (186) Q Consensus 185 r~ 186 (186) |. T Consensus 149 ~~ 150 (158) T protein:vir:10 149 QP 150 (158) T ss_pred ce Confidence 11 No 60 >protein:vir:94507 Length: 113 # NCBI annotation: putative DNA packaging protein # Family: family:all:372 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223891;genbank:gi:62327103;genbank:GeneID:5075523 Probab=43.16 E-value=0.86 Score=20.95 Aligned_cols=113 Identities=13% Similarity=0.135 Sum_probs=71.9 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFP 96 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~ 96 (186) ...+++.+.-.. -+....++..+..|-+|.+.|-.+ T Consensus 1 M~~L~~vK~~lg---------i~d~~~D~lL~~iI~~a~~~i~~~----------------------------------- 36 (113) T protein:vir:94 1 MALLDSIKLRIG---------IEDTKQDDLLTDIISDVQARVLAY----------------------------------- 36 (113) T ss_pred CchHHHHHHHhC---------CCCCchhhHHHHHHHHHHHHHHHH----------------------------------- Confidence 344455544331 122344566777777888877411 Q ss_pred cccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhcc Q lcl|NC_020854. 97 FRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRIS 176 (186) Q Consensus 97 ~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~ 176 (186) ++...+-.+.||.++..-++|.|+...+-- +..++++.+++-.|++|... .-..+..+.++.++..-... T Consensus 37 --l~~~~~~~~~iP~~l~~Iv~evavkryNR~------g~EG~~S~SeeG~S~sf~~~--~df~~y~~~l~~~~~~~~~~ 106 (113) T protein:vir:94 37 --VNQDGLVQSELPNGLDFVIKDVTIRIYNKI------GDEGKESSSEGNVSNTWDTP--ADLSEYSDVLDVYRKSYKRR 106 (113) T ss_pred --hCCccchhhhhhhHHHHHHHHHHHHHhccc------CCccceeeecCceeeeecCc--cchhhHHHHHHHHHhhccCC Confidence 111112246899999999999999988663 45678999998899999631 22445566777777775555 Q ss_pred CCceeeee Q lcl|NC_020854. 177 GPGNIAVK 184 (186) Q Consensus 177 ~~g~~~~~ 184 (186) +.|. ++. T Consensus 107 ~~g~-rF~ 113 (113) T protein:vir:94 107 SAGM-RFI 113 (113) T ss_pred CCCc-eeC Confidence 5554 555 No 61 >protein:vir:93592 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:1879 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449297;genbank:gi:157166045;uniprot:Q6H9U4;genbank:GeneID:5580414 Probab=41.66 E-value=0.92 Score=20.79 Aligned_cols=108 Identities=20% Similarity=0.146 Sum_probs=59.1 Q ss_pred ccceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcc Q lcl|NC_020854. 14 ANSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAV 93 (186) Q Consensus 14 AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~ 93 (186) --.++|++|++++.. .. ...+++..+..+-.|.+++..+ .+.+.+. +...+ T Consensus 1 mm~~vtLeevK~hLR--Id-------~d~dD~li~~~i~aA~~~v~~~--l~~~~~~----------~~~~~-------- 51 (108) T protein:vir:93 1 MTALLTLEEIKAHLR--VD-------HDADDDMLMDKVRQATAVLLAY--IQGSRDK----------VIRED-------- 51 (108) T ss_pred CCcCCCHHHHHHHcC--CC-------CCcChHHHHHHHHHHHHHHHHH--hcccccc----------ccccc-------- Confidence 334579999999883 21 2346666777777777777532 2221111 11111 Q ss_pred ccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhh Q lcl|NC_020854. 94 GFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGL 173 (186) Q Consensus 94 ~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l 173 (186) .......+|..|+.|++.|.-....+-+ .++..+ ......|..+..||.+| T Consensus 52 --------~~~~~~~~~~~i~~AvLlLv~~~YenRe-------------~~~~~~--------~~~~elP~~v~~Ll~~~ 102 (108) T protein:vir:93 52 --------GELIPGEALTRMKGAAMRLTGMLYRNPD-------------LAEREE--------LLQGELPFSVSVLIYDL 102 (108) T ss_pred --------cccccccCChHHHHHHHHHHHHHHhccc-------------cccccc--------cccccCCHHHHHHHHHc Confidence 1123456789999999999999887732 221111 01112344567777776 Q ss_pred hccCCc Q lcl|NC_020854. 174 RISGPG 179 (186) Q Consensus 174 ~~~~~g 179 (186) .+.--- T Consensus 103 R~p~~~ 108 (108) T protein:vir:93 103 RCPTVL 108 (108) T ss_pred cccccC Confidence 443321 No 62 >protein:vir:106583 Length: 105 # NCBI annotation: putative protein # Family: family:all:6481 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958586;genbank:gi:41179246;genbank:GeneID:2717119 Probab=41.04 E-value=0.94 Score=20.72 Aligned_cols=105 Identities=16% Similarity=0.170 Sum_probs=62.6 Q ss_pred eecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFP 96 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~ 96 (186) +-..++.-.-...+.-.. ...+++..+-.|-.|...+-. T Consensus 1 ~~~~~~~~e~ik~L~~~~-----d~~~DelL~~lieda~~~vl~------------------------------------ 39 (105) T protein:vir:10 1 MLNVDQLTEIVSALSTRL-----ENVNNALLTELVKESIAQVLD------------------------------------ 39 (105) T ss_pred CCchHHHHHHHHHHhccC-----CCchhHHHHHHHHHHHHHHHH------------------------------------ Confidence 444444433333332110 122333333333333333310 Q ss_pred cccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhcc Q lcl|NC_020854. 97 FRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRIS 176 (186) Q Consensus 97 ~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~ 176 (186) +...++||..+...++++|+...+-. +..+.++.+.|-+|.+|..+ .|.-+...|+.|..+ T Consensus 40 ------y~nr~~ip~~l~~~v~evav~~fNR~------G~EG~tS~SegGvS~sy~~~-------~~~~~~~~l~~yR~~ 100 (105) T protein:vir:10 40 ------YTGQKKLVGSMDIYVKKLAVINYNRL------GIEGETQRSEGGITNYLETG-------IPKDIRQGLNSYRIA 100 (105) T ss_pred ------HcCCcccchhHHHHHHHHHHHHhccc------CCcccceeecCCeeeeeecc-------CcHHHHHHHHHHhhh Confidence 12246899999999999999988663 23567899999999999642 234466777888887 Q ss_pred CCcee Q lcl|NC_020854. 177 GPGNI 181 (186) Q Consensus 177 ~~g~~ 181 (186) .-+.| T Consensus 101 ~v~~~ 105 (105) T protein:vir:10 101 KVKKL 105 (105) T ss_pred cccCC Confidence 77777 No 63 >protein:vir:2738 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695111;genbank:gi:23455880;genbank:GeneID:955641 Probab=39.25 E-value=1 Score=20.52 Aligned_cols=111 Identities=10% Similarity=0.042 Sum_probs=68.9 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHHhhccccccccccC-CCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCcc Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIIDGLVENDDVVAWAT-ATADQKNRALYTATQRLDRERYLGARATDTQALQWPRT 79 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~~r~~~~~~~~w~~-~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~ 79 (186) |+|.- .-.++.... -.+ .+++..+-.+-+|.++|-. T Consensus 1 ~~l~~-----------~~~L~~iK~-------------~lg~~dD~lL~~ii~~a~~~i~~------------------- 37 (112) T protein:vir:27 1 MTLDK-----------DKVIKNVSV-------------DLNTNDDALLKILLERVVNHFKS------------------- 37 (112) T ss_pred Ccchh-----------HHHHHHHHh-------------hcCCChhHHHHHHHHHHHHHHHH------------------- Confidence 54431 111222221 112 2344555556677777741 Q ss_pred CccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCc Q lcl|NC_020854. 80 GVRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGA 159 (186) Q Consensus 80 g~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~ 159 (186) ++..+.||.++..-++|.|+...+-- +..++++.+++-+|++|..+.+ -. T Consensus 38 -----------------------~l~~~~iP~~l~~Iv~evavkryNR~------g~EG~~S~SeeG~S~sf~d~~~-df 87 (112) T protein:vir:27 38 -----------------------EYGVEEIDDKLAFIFEDCVIKRFNRR------GAEGAKSESVDGHSMSYYDNEN-EF 87 (112) T ss_pred -----------------------hcCccccchhHHHHHHHHHHHHhccc------CccccceeecCceeeeeccccc-ch Confidence 12246899999999999999988663 2347889999889999964322 23 Q ss_pred ccchHHHHHHHhhhhccCCceeeee Q lcl|NC_020854. 160 DRIPPMVERYLTGLRISGPGNIAVK 184 (186) Q Consensus 160 ~~~~~~v~~lL~~l~~~~~g~~~~~ 184 (186) .+..+.++.++......+.|.+++. T Consensus 88 ~~Y~~~l~~~~~~~~~~~~G~v~Fl 112 (112) T protein:vir:27 88 KPYDDMLQRLYGTSGQAKEGEVLFL 112 (112) T ss_pred hhhHHHHHHHHhhcCCCCCceeeeC Confidence 3444567777766666666777777 No 64 >protein:vir:104088 Length: 125 # NCBI annotation: gp20 # Family: family:all:2817 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655599;genbank:gi:109392470;genbank:GeneID:4156956 Probab=37.10 E-value=1.1 Score=20.28 Aligned_cols=120 Identities=13% Similarity=0.093 Sum_probs=64.7 Q ss_pred cceecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhccc Q lcl|NC_020854. 15 NSYLTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVG 94 (186) Q Consensus 15 nSYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~ 94 (186) -+|++.+|..+.|.--. .+......+..|-+|...|- +.||--..+ T Consensus 1 ma~A~~~Dv~~~w~r~l--------T~~E~~~v~~~L~~Ae~~Ir--------------~riP~L~~r------------ 46 (125) T protein:vir:10 1 MAYANAQDVVTLWAKEP--------EPEVMELIERRLAQVERMIK--------------RRIPNLDLK------------ 46 (125) T ss_pred CCcCCHHHHHHHhCCCC--------CHHHHHHHHHHHHHHHHHHH--------------HhCCChhhh------------ Confidence 78999999988774211 22344456677888888885 334422111 Q ss_pred cccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhh Q lcl|NC_020854. 95 FPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLR 174 (186) Q Consensus 95 ~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~ 174 (186) +.. ....+..|++=+.+.-.++..+++. .+++.-|+-+-......++|.-...+-=-.+|++ T Consensus 47 ----~~a----~~~~~~~v~~Vea~aV~Rv~rNPeG--------y~s~T~G~Ys~~l~~~~~~g~L~it~~Ew~~Lg~-- 108 (125) T protein:vir:10 47 ----VAA----DATFQADLIDIEADAVLRLVRNPEG--------YISETDGAYTYQLQTDLSQGRLTILDDEWTTLGV-- 108 (125) T ss_pred ----hhc----CCCccccHHHHHHHHHHHHhcCCCc--------ccccccchhHHhhhcccccCceeeCHHHHHhhcc-- Confidence 111 1223344555555555556666543 3445557765544433344544433333356666 Q ss_pred ccCCceeeeeeC Q lcl|NC_020854. 175 ISGPGNIAVKRS 186 (186) Q Consensus 175 ~~~~g~~~~~r~ 186 (186) ..+.|.|.|+=. T Consensus 109 ~r~s~~~~i~p~ 120 (125) T protein:vir:10 109 NRLSRMSVIAPN 120 (125) T ss_pred ccccceeeeecc Confidence 334677777666 No 65 >protein:vir:96108 Length: 155 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:664 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294442;genbank:gi:149408339;genbank:GeneID:5237227 Probab=35.90 E-value=1.2 Score=20.14 Aligned_cols=98 Identities=14% Similarity=0.092 Sum_probs=40.5 Q ss_pred cCccccccchhhhccccccccCcccccC-CcchHHHHHHHHHHHHHHhcccCC--------------------------- Q lcl|NC_020854. 79 TGVRKPDTYINTYAVGFPFRITTDYFTD-TEIPQQIKEAQATLAVYLNNNKDG--------------------------- 130 (186) Q Consensus 79 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~-d~IP~~Vk~A~~eLA~~~~~~~~~--------------------------- 130 (186) .=+.++.. ....+| ++.+ +.+|+..++...++|-..+++... T Consensus 1 ~v~fd~~~----FR~~fP------eFad~~~~Pd~~i~~~l~~A~~~l~~~~~~s~~~~g~~~~~~l~Ll~AH~l~L~~~ 70 (155) T protein:vir:96 1 MVIFDEQK----FRTLFP------EFADPASYPAVRLQLYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTM 70 (155) T ss_pred CcccCHHH----HHHhCc------cccCcccCCHHHHHHHHHHHHHhhcCCCccccccChHHHHHHHHHHHHHHHHHHHH Confidence 11111110 011111 1112 356777766666666555532110 Q ss_pred -----------CCCCcccceeEEecCeeEEeecCCCCcCc------ccchH-HHHHHHhhhhcc-----CCceeeeeeC Q lcl|NC_020854. 131 -----------IGLSGLEDYKNVKIGSIDVTPNQYGATGA------DRIPP-MVERYLTGLRIS-----GPGNIAVKRS 186 (186) Q Consensus 131 -----------~~~~~~~~v~~~kvG~isveY~~~~~~~~------~~~~~-~v~~lL~~l~~~-----~~g~~~~~r~ 186 (186) ......+.|+++++|.|||.|+.+..... .+.|- -.-+|++.+..+ |...=+=.|. T Consensus 71 ~~~gaa~~g~~~~g~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~f~~l~~~~~~Gg~~vgG~per~~~r~ 149 (155) T protein:vir:96 71 QVQGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFRK 149 (155) T ss_pred hhhhccccccccccccccceeeceecceeeeeecCCCCCchhHHhhcCHHHHHHHHHHHHhcccccccCCCCccccccc Confidence 01112356899999999999986433211 11111 122344443222 2222122222 No 66 >protein:vir:106739 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944316;genbank:gi:38638615;genbank:GeneID:2657368 Probab=29.60 E-value=1.2 Score=20.22 Aligned_cols=101 Identities=14% Similarity=0.080 Sum_probs=41.1 Q ss_pred cccCccCccccccchhhhccccccccCcccccC-CcchHHHHHHHHHHHHH-Hhcc------------------------ Q lcl|NC_020854. 74 LQWPRTGVRKPDTYINTYAVGFPFRITTDYFTD-TEIPQQIKEAQATLAVY-LNNN------------------------ 127 (186) Q Consensus 74 laWPR~g~~~~~~~~~~~~~~~~~~~~~~~~~~-d~IP~~Vk~A~~eLA~~-~~~~------------------------ 127 (186) ++=| .|-+-+.-..|-.. +|+ ..+|+..++.-+++|-. ++++ T Consensus 1 ~~~~-------~~~v~Fd~a~FR~~-----fPeFa~~pd~~i~~~~~~A~~~~~~~~~~s~~~~~~~r~~ll~LltAHll 68 (158) T protein:vir:10 1 MSTP-------PYRITFDPAGFIAE-----YPEFATVATPRLQAMFNQAQTALLDNTGGSPVTDDNVLRELFNMLVAHLL 68 (158) T ss_pred CCCC-------CceEEcChHHHHHh-----chhhccCCHHHHHHHHHHhhhhhcCCCccccccChhHHHHHHHHHHHHHH Confidence 2222 22222222222111 222 23677766665555522 1111 Q ss_pred ----cCC--CCCCcccceeEEecCeeEEeecCCCCcCcc-------cch-HHHHHHHhhh-----hccCC-ceeeee--- Q lcl|NC_020854. 128 ----KDG--IGLSGLEDYKNVKIGSIDVTPNQYGATGAD-------RIP-PMVERYLTGL-----RISGP-GNIAVK--- 184 (186) Q Consensus 128 ----~~~--~~~~~~~~v~~~kvG~isveY~~~~~~~~~-------~~~-~~v~~lL~~l-----~~~~~-g~~~~~--- 184 (186) ... ......+.|++.++|.|||.|+.+...+.. +.| .-.-+|++.+ +.+|. ....+| T Consensus 69 ~L~~~~~~~a~~g~~G~isSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fwal~~~~~~Ggy~~gg~pe~~~~r~~g 148 (158) T protein:vir:10 69 TLFGATPTSANSRPPGRLSSAAEGTVSSSFEFKLPEGSAIAPWYNQTQYGAMFWMATARYRSARYMVSGGSGIGTARAYG 148 (158) T ss_pred HHhHhhhccccCCcccceeeeeecceEEEEeecCccccchhHHhhcCHHHHHHHHHHHHhcccccccccCCcccceeecC Confidence 110 111224579999999999999764433211 111 1122344443 22222 222221 Q ss_pred eC Q lcl|NC_020854. 185 RS 186 (186) Q Consensus 185 r~ 186 (186) |. T Consensus 149 ~~ 150 (158) T protein:vir:10 149 QP 150 (158) T ss_pred cc Confidence 11 No 67 >protein:vir:78595 Length: 158 # NCBI annotation: BcepNY3gp07 # Family: family:all:664 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294844;genbank:gi:149882907;genbank:GeneID:5291066 Probab=29.60 E-value=1.2 Score=20.22 Aligned_cols=101 Identities=14% Similarity=0.080 Sum_probs=41.1 Q ss_pred cccCccCccccccchhhhccccccccCcccccC-CcchHHHHHHHHHHHHH-Hhcc------------------------ Q lcl|NC_020854. 74 LQWPRTGVRKPDTYINTYAVGFPFRITTDYFTD-TEIPQQIKEAQATLAVY-LNNN------------------------ 127 (186) Q Consensus 74 laWPR~g~~~~~~~~~~~~~~~~~~~~~~~~~~-d~IP~~Vk~A~~eLA~~-~~~~------------------------ 127 (186) ++=| .|-+-+.-..|-.. +|+ ..+|+..++.-+++|-. ++++ T Consensus 1 ~~~~-------~~~v~Fd~a~FR~~-----fPeFa~~pd~~i~~~~~~A~~~~~~~~~~s~~~~~~~r~~ll~LltAHll 68 (158) T protein:vir:78 1 MSTP-------PYRITFDPAGFIAE-----YPEFATVATPRLQAMFNQAQTALLDNTGGSPVTDDNVLRELFNMLVAHLL 68 (158) T ss_pred CCCC-------CceEEcChHHHHHh-----chhhccCCHHHHHHHHHHhhhhhcCCCccccccChhHHHHHHHHHHHHHH Confidence 2222 22222222222111 222 23677766665555522 1111 Q ss_pred ----cCC--CCCCcccceeEEecCeeEEeecCCCCcCcc-------cch-HHHHHHHhhh-----hccCC-ceeeee--- Q lcl|NC_020854. 128 ----KDG--IGLSGLEDYKNVKIGSIDVTPNQYGATGAD-------RIP-PMVERYLTGL-----RISGP-GNIAVK--- 184 (186) Q Consensus 128 ----~~~--~~~~~~~~v~~~kvG~isveY~~~~~~~~~-------~~~-~~v~~lL~~l-----~~~~~-g~~~~~--- 184 (186) ... ......+.|++.++|.|||.|+.+...+.. +.| .-.-+|++.+ +.+|. ....+| T Consensus 69 ~L~~~~~~~a~~g~~G~isSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fwal~~~~~~Ggy~~gg~pe~~~~r~~g 148 (158) T protein:vir:78 69 TLFGATPTSANSRPPGRLSSAAEGTVSSSFEFKLPEGSAIAPWYNQTQYGAMFWMATARYRSARYMVSGGSGIGTARAYG 148 (158) T ss_pred HHhHhhhccccCCcccceeeeeecceEEEEeecCccccchhHHhhcCHHHHHHHHHHHHhcccccccccCCcccceeecC Confidence 110 111224579999999999999764433211 111 1122344443 22222 222221 Q ss_pred eC Q lcl|NC_020854. 185 RS 186 (186) Q Consensus 185 r~ 186 (186) |. T Consensus 149 ~~ 150 (158) T protein:vir:78 149 QP 150 (158) T ss_pred cc Confidence 11 No 68 >protein:vir:98481 Length: 136 # NCBI annotation: ORF3 # Family: family:all:32440 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958281;genbank:gi:41057255;uniprot:Q38596;genbank:GeneID:2732865 Probab=29.58 E-value=1.6 Score=19.40 Aligned_cols=111 Identities=16% Similarity=0.149 Sum_probs=57.7 Q ss_pred ccceecHHHHHHHHHhhccccccccccCCCHH----HHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchh Q lcl|NC_020854. 14 ANSYLTLSDAQDIIDGLVENDDVVAWATATAD----QKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYIN 89 (186) Q Consensus 14 AnSYvsla~AdaY~~~r~~~~~~~~w~~~~~~----~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~ 89 (186) -..|+|++|..+-|. +...+++ ..+++|-.|++.|... ||.-+. T Consensus 1 M~~fAtv~Dl~~rw~----------~~~~dee~~ra~~~~lL~dAS~~ir~~--------------~p~~~~-------- 48 (136) T protein:vir:98 1 MAAYATVEDYQARAA----------VTLPDGSPRRAQVEAYLDDASALMARH--------------IPTGHT-------- 48 (136) T ss_pred CCccCCHHHHHHHhc----------cCCCCchhHHHHHHHHHHHHHHHHHHh--------------CCCCCC-------- Confidence 788999999987653 1222332 2355688999999743 443211 Q ss_pred hhccccccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHH Q lcl|NC_020854. 90 TYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERY 169 (186) Q Consensus 90 ~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~l 169 (186) .-|.-|+.=+|....+.+.+++. .++++.|.-+-.+... |...+-+-=..+ T Consensus 49 ------------------~~~~~~~~V~~~~V~R~~~np~G--------~~s~TaG~ys~s~t~~---G~Lylt~~E~~~ 99 (136) T protein:vir:98 49 ------------------PDPGTLRAICVAVVRRVMANPGG--------YRQRTIGQYAETLGED---GGLYLTEDEKGQ 99 (136) T ss_pred ------------------CChhHHHHHHHHHHHHHhhCCCC--------cccccchhHHHhhhcC---CCcccChHHHHH Confidence 12455666677777777766542 3456677644443321 221111111123 Q ss_pred Hhhhhc----cCCceeeeeeC Q lcl|NC_020854. 170 LTGLRI----SGPGNIAVKRS 186 (186) Q Consensus 170 L~~l~~----~~~g~~~~~r~ 186 (186) |+. -+ +..+.|.|-++ T Consensus 100 Lg~-~rqr~~~~d~a~si~~~ 119 (136) T protein:vir:98 100 LQP-PDQTAPDADAAYSLDLD 119 (136) T ss_pred hCC-CCCcccccccceecccC Confidence 322 11 11245666665 No 69 >protein:vir:78068 Length: 178 # NCBI annotation: gp7 # Family: family:all:6563 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468791;genbank:gi:157325372;genbank:GeneID:5601830 Probab=29.22 E-value=1.7 Score=19.35 Aligned_cols=145 Identities=13% Similarity=0.137 Sum_probs=70.1 Q ss_pred ceecHHHHHHHHHhhccccccccccCCCHHHHHHHH---HHH---HHHHh------hhccCcccCCcccccccCccCccc Q lcl|NC_020854. 16 SYLTLSDAQDIIDGLVENDDVVAWATATADQKNRAL---YTA---TQRLD------RERYLGARATDTQALQWPRTGVRK 83 (186) Q Consensus 16 SYvsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL---~~A---td~id------~~~~~G~r~~~~Q~laWPR~g~~~ 83 (186) -|.++++++.-. +..+.++--+| ||+ ..|++ ++.|++-.+-..--..+=|.|-+. T Consensus 1 m~l~l~~~~~~~-------------p~~tQ~dLdaLE~~IR~~TnN~F~~~~~~~k~l~~~d~~ti~~d~~~~~rVGDTI 67 (178) T protein:vir:78 1 MFLDLTKVQQKL-------------PNVTQDDLDVLEKQIRAYTQNHFLVPQSYLKGLQWSDGSTLALDSVRFLQVGDTI 67 (178) T ss_pred CccchHHHHhhC-------------CCCCHhHHHHHHHHHHHhhcccccchhhhheeeeecccceEEecCcccccccceE Confidence 555666665422 11111122221 121 12233 334555432111111122333221 Q ss_pred -------cccch---------hhhcccccc----ccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEe Q lcl|NC_020854. 84 -------PDTYI---------NTYAVGFPF----RITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVK 143 (186) Q Consensus 84 -------~~~~~---------~~~~~~~~~----~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~k 143 (186) .||.. ..+.++.++ .+++..+.--.-|..||-.+++|--.=.+ -.+.-+||+|. T Consensus 68 eI~nS~~NDgl~~ie~ie~~~~~~~v~~~vrd~~n~~~~~~~kV~YP~Di~~Gv~~Lle~D~~------~~~k~GVKqET 141 (178) T protein:vir:78 68 ELWDTGINDGIYLIESIEASNNTVQLNKSVRDTDENPDGYFGLVMYPEDLLGGAYSLIEYDQQ------GRQEYGVKQET 141 (178) T ss_pred EecCcccccceeEEEEeecccceeeeccccccccccccceeeeeechhhHHHHHHHHHHHHhc------cCcccceecee Confidence 23322 222223333 23344555566799999999987543221 13345799999 Q ss_pred cCeeEEeecCCCCcCcc-cchHHHHHHHhhhhccCCceeeee Q lcl|NC_020854. 144 IGSIDVTPNQYGATGAD-RIPPMVERYLTGLRISGPGNIAVK 184 (186) Q Consensus 144 vG~isveY~~~~~~~~~-~~~~~v~~lL~~l~~~~~g~~~~~ 184 (186) |--+|++|..-...-+. ..|.++-..|.++-...- + T Consensus 142 iaR~S~TYfD~~~~es~~GYPa~l~~FL~~Ykk~~w-----~ 178 (178) T protein:vir:78 142 VSRVSKTYYDTTEAESRFGYPAFMTEFLEPYMQGSW-----E 178 (178) T ss_pred eeeeeeEEeecCcccccccchHHHHHHHHHhhcCCC-----C Confidence 99999999654333232 334557788888865322 2 No 70 >protein:vir:3034 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438147;genbank:gi:16271810;genbank:GeneID:929268 Probab=28.45 E-value=1.1 Score=20.41 Aligned_cols=99 Identities=15% Similarity=0.186 Sum_probs=49.7 Q ss_pred HHHHHHHHHhhh--ccCcccCCcccccccCccCccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcc Q lcl|NC_020854. 50 ALYTATQRLDRE--RYLGARATDTQALQWPRTGVRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNN 127 (186) Q Consensus 50 aL~~Atd~id~~--~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~ 127 (186) ++-+|..-||.+ .|.- +-+-+--..| |. ..+|.|.|.--.++-.. T Consensus 1 L~k~A~~~Id~~t~~fY~-~~dle~D~~~-R~-------------------------------~~fK~Aia~QI~Yld~~ 47 (111) T protein:vir:30 1 MEKRASHAVNLYCRNRYD-YKDLKKEIAL-VQ-------------------------------KAVKRAIAYQIAYLNDS 47 (111) T ss_pred CchhhHHHHhHhhchhhh-hhhHHHHHHH-HH-------------------------------HHHHHHHHHHHHHHHhc Confidence 777888889864 2321 1111111112 21 46777766554444333 Q ss_pred cCCCCCCcccceeEEecCeeEEeecCCCCcCcc-----cchHHHHH---HHh--hhhccCCceeeeee Q lcl|NC_020854. 128 KDGIGLSGLEDYKNVKIGSIDVTPNQYGATGAD-----RIPPMVER---YLT--GLRISGPGNIAVKR 185 (186) Q Consensus 128 ~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~-----~~~~~v~~---lL~--~l~~~~~g~~~~~r 185 (186) +..+..+.+.+++.+||-.++.|+.....++. ..|-.... +|. ||+-.| +.--| T Consensus 48 -G~~t~~d~~s~~SisvGrTsiS~~~~~~~~~~~~~t~~~~~l~~da~n~L~~~Glly~G---V~yd~ 111 (111) T protein:vir:30 48 -GVMTAEDKQSFAGISLGRTSISYTVGHGQGSQQKTLADRFNLCLDAENELLVVGLGYTG---ISYDR 111 (111) T ss_pred -CCCChhhccCcceeeecceeeeccCccCCCCccccccccccchHHHHHHHHhhcccccc---ccccC Confidence 34444447889999999999998532221111 12223333 443 344333 34444 No 71 >protein:vir:96488 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238494;genbank:gi:66391770;genbank:GeneID:5176910 Probab=28.12 E-value=1.8 Score=19.21 Aligned_cols=112 Identities=12% Similarity=0.049 Sum_probs=70.1 Q ss_pred eecHHHHHHHHHhhccccccccccCC-CHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcccc Q lcl|NC_020854. 17 YLTLSDAQDIIDGLVENDDVVAWATA-TADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGF 95 (186) Q Consensus 17 Yvsla~AdaY~~~r~~~~~~~~w~~~-~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~ 95 (186) ..++++..--... ..+... +++..+..+-+|.++|-. T Consensus 1 M~~L~~~K~l~~i-------k~~~~~~~D~lL~~ii~~a~~~i~~----------------------------------- 38 (113) T protein:vir:96 1 MMALDKDKVIKNV-------SVDLNTDDDVLLKILLERVVNHFKS----------------------------------- 38 (113) T ss_pred CchhHHHHHHhcC-------CCCCCCchhHHHHHHHHHHHHHHHH----------------------------------- Confidence 3344444433321 123332 333455666677777741 Q ss_pred ccccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhc Q lcl|NC_020854. 96 PFRITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRI 175 (186) Q Consensus 96 ~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~ 175 (186) ++-.+.||.++..-..|.|+...+-- +..++++++++-.|++|..+.+ -..+....++.++..-.. T Consensus 39 -------~l~~~~iP~~L~~Iv~evavkryNR~------g~EG~~S~S~eG~S~sf~d~~~-df~eY~~~l~~~~~~~~~ 104 (113) T protein:vir:96 39 -------EYGVEEIDDKLAFIFEDCVIKRFNRR------GAEGAKSESVDGHSMSYYDNEN-EFKPYDDMLQRLYGTSGQ 104 (113) T ss_pred -------HhcccccchhHHHHHHHHHHHHhcCC------CccccceeccCceeeeeccccc-ccchhHHHHHHHHhhcCC Confidence 11246899999999999999988663 4567899999889999964322 233334566667666555 Q ss_pred cCCceeeee Q lcl|NC_020854. 176 SGPGNIAVK 184 (186) Q Consensus 176 ~~~g~~~~~ 184 (186) .+.|.+++- T Consensus 105 ~~~G~v~Fl 113 (113) T protein:vir:96 105 SKEGEVLFL 113 (113) T ss_pred CCCceeeeC Confidence 566777777 No 72 >protein:vir:99848 Length: 172 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164077;genbank:gi:56692609;genbank:GeneID:3192576 Probab=27.35 E-value=1.6 Score=19.45 Aligned_cols=121 Identities=17% Similarity=0.163 Sum_probs=55.6 Q ss_pred CeeEeecCCCCCcccceecHHHHHHHHH-------------------------hhcccccccccc-------CCCHHHHH Q lcl|NC_020854. 1 MAITIVATAGAADANSYLTLSDAQDIID-------------------------GLVENDDVVAWA-------TATADQKN 48 (186) Q Consensus 1 Mal~v~~~~g~~~AnSYvsla~AdaY~~-------------------------~r~~~~~~~~w~-------~~~~~~ke 48 (186) |+ -|+|++|.-+-|. .....+ .|. ..+++-.+ T Consensus 1 ~~-------------mYaT~~dl~~r~g~~el~qLt~~~~~~~~~~~l~d~~~~~~~~~---~~~~~~~~~g~~d~~~i~ 64 (172) T protein:vir:99 1 MA-------------VYITLPELAERPGAEELSQAATPRPLQAVDSELLDALLRGLPVD---RWTPEEIEVGHATVEVIN 64 (172) T ss_pred Cc-------------ccccHHHHHhhcCHHHHHHHhccccccCCHHHHHHHhhcchhhh---hcccccccccccCHHHHH Confidence 33 3555555433321 111111 122 35677899 Q ss_pred HHHHHHHHHHhhh-ccCcccCCcccccccCccCccccccchhhhccccccccCcccccCCcchHHHHHHHHHHHHHHhcc Q lcl|NC_020854. 49 RALYTATQRLDRE-RYLGARATDTQALQWPRTGVRKPDTYINTYAVGFPFRITTDYFTDTEIPQQIKEAQATLAVYLNNN 127 (186) Q Consensus 49 ~aL~~Atd~id~~-~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~~~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~ 127 (186) .+|..|+..||++ .-++ . .+|-..+|.-|+..||-+|.+.+-. T Consensus 65 ~Al~dA~aeIDgYL~~R~---------------Y---------------------~lPL~~vP~~L~~~a~dIArY~L~~ 108 (172) T protein:vir:99 65 SAVSDAQGYIDGFLQRRG---------------Y---------------------SLPLAKRYGVVTGWTRAIARYLLHQ 108 (172) T ss_pred HHHHHHHHHHHHHHhccc---------------c---------------------cCCCcccchHHHHHHHHHHHHHHHh Confidence 9999999999974 2210 0 2455789999999999999988854 Q ss_pred cC----CCCCCcccc-------eeEEecCeeEEeecCCCCcCc-------ccchHHHHHHHhhh Q lcl|NC_020854. 128 KD----GIGLSGLED-------YKNVKIGSIDVTPNQYGATGA-------DRIPPMVERYLTGL 173 (186) Q Consensus 128 ~~----~~~~~~~~~-------v~~~kvG~isveY~~~~~~~~-------~~~~~~v~~lL~~l 173 (186) .- ..+..-.++ .+.+.-|.++.--....+..+ .+.-.+=..-|+|| T Consensus 109 ~r~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~v~~~~r~F~rd~L~gf 172 (172) T protein:vir:99 109 DRLGPGAEKDPIVRDYRDALKFLQLIAEGKFSLGPDDPLTPPGGGVPQVLAPARTFSHDTLKDY 172 (172) T ss_pred ccCCcccCCHHHHHHHHHHHHHHHHHhcCccccCCCCCCCCCCCCceeeecCCCccChhhccCC Confidence 21 111110011 112222444432111101000 00001112233444 No 73 >protein:vir:107119 Length: 104 # NCBI annotation: conserved phage protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950608;genbank:gi:119953688;genbank:GeneID:4643128 Probab=25.50 E-value=0.7 Score=21.42 Aligned_cols=104 Identities=13% Similarity=0.162 Sum_probs=57.5 Q ss_pred ecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcccccc Q lcl|NC_020854. 18 LTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFPF 97 (186) Q Consensus 18 vsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~~ 97 (186) .+..+-..-...... ....+++.+ .||.. |++ |-+ T Consensus 1 Md~~dVK~l~~~~~~-------d~~~D~~~~-~li~~--y~e----------------~ae------------------- 35 (104) T protein:vir:10 1 MNAQDVKLLNNLSLD-------DTSNDETIE-LLIEK--YLN----------------VAE------------------- 35 (104) T ss_pred CCHHHHHHHhCCCCC-------CcccHHHHH-HHHHH--HHH----------------HHH------------------- Confidence 333333322211110 012222211 12211 221 111 Q ss_pred ccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhccC Q lcl|NC_020854. 98 RITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRISG 177 (186) Q Consensus 98 ~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~~ 177 (186) .+.-..+-+..+|..|+.+..+++=. . ..+.+++.+.|.+|.+|.+ ..|..+...|.||.+-. T Consensus 36 dyCn~~F~~~~lP~gV~~fvA~~iky---~-------~~~NissRSMGtVSyTy~t-------~iP~~i~~~L~PYRklr 98 (104) T protein:vir:10 36 EYCNQTFNRKSLPSNVEKFIANCIKQ---G-------TTSNISSRTMGTVSYTFVT-------DLPKETYGYLKPFRRLR 98 (104) T ss_pred HhcCCCCCCCCCCccHHHHHHHHHhh---c-------CCCCcccccccceeecccc-------hhHHHHHHhhhhhhhhc Confidence 11222334558999999998887653 1 1247788999999999964 35778999999998877 Q ss_pred Cceeee Q lcl|NC_020854. 178 PGNIAV 183 (186) Q Consensus 178 ~g~~~~ 183 (186) .+.+-| T Consensus 99 ~~~~~~ 104 (104) T protein:vir:10 99 WTGYHV 104 (104) T ss_pred cccccC Confidence 777777 No 74 >protein:vir:105327 Length: 104 # NCBI annotation: putative head morphogenesis protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950671;genbank:gi:119967841;genbank:GeneID:4643206 Probab=25.50 E-value=0.7 Score=21.42 Aligned_cols=104 Identities=13% Similarity=0.162 Sum_probs=57.5 Q ss_pred ecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcccccc Q lcl|NC_020854. 18 LTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFPF 97 (186) Q Consensus 18 vsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~~ 97 (186) .+..+-..-...... ....+++.+ .||.. |++ |-+ T Consensus 1 Md~~dVK~l~~~~~~-------d~~~D~~~~-~li~~--y~e----------------~ae------------------- 35 (104) T protein:vir:10 1 MNAQDVKLLNNLSLD-------DTSNDETIE-LLIEK--YLN----------------VAE------------------- 35 (104) T ss_pred CCHHHHHHHhCCCCC-------CcccHHHHH-HHHHH--HHH----------------HHH------------------- Confidence 333333322211110 012222211 12211 221 111 Q ss_pred ccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhccC Q lcl|NC_020854. 98 RITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRISG 177 (186) Q Consensus 98 ~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~~ 177 (186) .+.-..+-+..+|..|+.+..+++=. . ..+.+++.+.|.+|.+|.+ ..|..+...|.||.+-. T Consensus 36 dyCn~~F~~~~lP~gV~~fvA~~iky---~-------~~~NissRSMGtVSyTy~t-------~iP~~i~~~L~PYRklr 98 (104) T protein:vir:10 36 EYCNQTFNRKSLPSNVEKFIANCIKQ---G-------TTSNISSRTMGTVSYTFVT-------DLPKETYGYLKPFRRLR 98 (104) T ss_pred HhcCCCCCCCCCCccHHHHHHHHHhh---c-------CCCCcccccccceeecccc-------hhHHHHHHhhhhhhhhc Confidence 11222334558999999998887653 1 1247788999999999964 35778999999998877 Q ss_pred Cceeee Q lcl|NC_020854. 178 PGNIAV 183 (186) Q Consensus 178 ~g~~~~ 183 (186) .+.+-| T Consensus 99 ~~~~~~ 104 (104) T protein:vir:10 99 WTGYHV 104 (104) T ss_pred cccccC Confidence 777777 No 75 >protein:vir:94064 Length: 167 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453623;genbank:gi:84662659;genbank:GeneID:5142574 Probab=20.89 E-value=2.7 Score=18.23 Aligned_cols=97 Identities=15% Similarity=0.143 Sum_probs=36.3 Q ss_pred cCccccccchhhhccccccccCcccccC-CcchHHHHHHHHHHH---------------------------HHHhc--c- Q lcl|NC_020854. 79 TGVRKPDTYINTYAVGFPFRITTDYFTD-TEIPQQIKEAQATLA---------------------------VYLNN--N- 127 (186) Q Consensus 79 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~-d~IP~~Vk~A~~eLA---------------------------~~~~~--~- 127 (186) .++..=| -..|- ..+|+ ..+|+..++.-++.| .+++. . T Consensus 1 M~~~~Fd------~~~FR-----~~fPeFa~~Pd~~i~~~l~~A~~~~l~~~~~s~~~~~~~~~~~l~LltAHll~L~~~ 69 (167) T protein:vir:94 1 MAVVVFD------PTAFK-----LVYPEFVAVPDARLTALFNTVGYTILDNTDASVIVDPLRRAPLLDLLVAHMLALFGY 69 (167) T ss_pred CCcccCC------hHHHH-----HhchhcccCCHHHHHHHHHHHHHhhcCCCCcccccchhhHHHHHHHHHHHHHHHhhh Confidence 2211100 00000 11222 235666554444333 12210 0 Q ss_pred ---cCC--CCCCcccceeEEecCeeEEeecCCCCcCcc------cch-HHHHHHHhhhh-----ccCCceeeeeeC Q lcl|NC_020854. 128 ---KDG--IGLSGLEDYKNVKIGSIDVTPNQYGATGAD------RIP-PMVERYLTGLR-----ISGPGNIAVKRS 186 (186) Q Consensus 128 ---~~~--~~~~~~~~v~~~kvG~isveY~~~~~~~~~------~~~-~~v~~lL~~l~-----~~~~g~~~~~r~ 186 (186) ... ......+.|+++++|.|||.|+........ +.| .-.-+|++.+. .+|..-..-+|+ T Consensus 70 ~~a~~~~~~~~g~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~fwaL~~~~g~Gg~v~gG~~~~~~~~~ 145 (167) T protein:vir:94 70 VNADGSITPGTGTVGRVANASEGSVSTSLAYSTPTGAGEAWFTQTPYGAMYWAMSAPFRSFHYVAAGLSGVGYSQD 145 (167) T ss_pred hhhhcccccccccchheeeccccceeeeeecCCCCCchhhhhhcCHHHHHHHHHHHHhcccccccCCCCCCCCCcc Confidence 000 111123569999999999999865443221 111 11223444332 222111111333 No 76 >protein:vir:95891 Length: 104 # NCBI annotation: ORF051 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240387;genbank:gi:66396087;genbank:GeneID:5133402 Probab=20.20 E-value=1.1 Score=20.39 Aligned_cols=104 Identities=17% Similarity=0.187 Sum_probs=56.2 Q ss_pred ecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcccccc Q lcl|NC_020854. 18 LTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFPF 97 (186) Q Consensus 18 vsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~~ 97 (186) .+..+-+.-...... ....+++. ..||.. |++ |-+ T Consensus 1 Md~~dVK~l~~~~~~-------d~~~D~~~-~~li~~--y~e----------------~ae------------------- 35 (104) T protein:vir:95 1 MDAKDVKMINGLSLN-------DSSNDEQI-KYLIEE--YKS----------------VAE------------------- 35 (104) T ss_pred CCHHHHHHHhCCCCC-------CcccHHHH-HHHHHH--HHH----------------HHH------------------- Confidence 333333322211110 01222221 112211 221 111 Q ss_pred ccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhccC Q lcl|NC_020854. 98 RITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRISG 177 (186) Q Consensus 98 ~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~~ 177 (186) .+.-..+-+..+|..|+.+..+++=. + ..+.+++.+.|.+|.+|.+ ..|..+...|.||.+=. T Consensus 36 dyCn~~F~~~~lP~gV~~fvA~~iky-----~-----~~~NissRSMGtVSYTy~t-------~iP~~i~~~LkPYRklr 98 (104) T protein:vir:95 36 DYCNQKFDDKAVPSGVKKFIAECIKF-----G-----TTGNISARTMGTVSYTYVT-------DIPSSAYAYLLPYRKLS 98 (104) T ss_pred HhcCCCCCCCCCCccHHHHHHHHHhh-----C-----CCCCcccccccceeecccc-------hhHHHHHHhhhhhhhhc Confidence 11222334558999999998887653 1 1347789999999999964 35678899999988766 Q ss_pred Cceeee Q lcl|NC_020854. 178 PGNIAV 183 (186) Q Consensus 178 ~g~~~~ 183 (186) .+.+-| T Consensus 99 ~~~~~~ 104 (104) T protein:vir:95 99 WGKRYV 104 (104) T ss_pred ccccCC Confidence 666666 No 77 >protein:vir:96281 Length: 104 # NCBI annotation: ORF047 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240313;genbank:gi:66396008;genbank:GeneID:5133358 Probab=20.20 E-value=1.1 Score=20.39 Aligned_cols=104 Identities=17% Similarity=0.187 Sum_probs=56.2 Q ss_pred ecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcccccc Q lcl|NC_020854. 18 LTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFPF 97 (186) Q Consensus 18 vsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~~ 97 (186) .+..+-+.-...... ....+++. ..||.. |++ |-+ T Consensus 1 Md~~dVK~l~~~~~~-------d~~~D~~~-~~li~~--y~e----------------~ae------------------- 35 (104) T protein:vir:96 1 MDAKDVKMINGLSLN-------DSSNDEQI-KYLIEE--YKS----------------VAE------------------- 35 (104) T ss_pred CCHHHHHHHhCCCCC-------CcccHHHH-HHHHHH--HHH----------------HHH------------------- Confidence 333333322211110 01222221 112211 221 111 Q ss_pred ccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhccC Q lcl|NC_020854. 98 RITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRISG 177 (186) Q Consensus 98 ~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~~ 177 (186) .+.-..+-+..+|..|+.+..+++=. + ..+.+++.+.|.+|.+|.+ ..|..+...|.||.+=. T Consensus 36 dyCn~~F~~~~lP~gV~~fvA~~iky-----~-----~~~NissRSMGtVSYTy~t-------~iP~~i~~~LkPYRklr 98 (104) T protein:vir:96 36 DYCNQKFDDKAVPSGVKKFIAECIKF-----G-----TTGNISARTMGTVSYTYVT-------DIPSSAYAYLLPYRKLS 98 (104) T ss_pred HhcCCCCCCCCCCccHHHHHHHHHhh-----C-----CCCCcccccccceeecccc-------hhHHHHHHhhhhhhhhc Confidence 11222334558999999998887653 1 1347789999999999964 35678899999988766 Q ss_pred Cceeee Q lcl|NC_020854. 178 PGNIAV 183 (186) Q Consensus 178 ~g~~~~ 183 (186) .+.+-| T Consensus 99 ~~~~~~ 104 (104) T protein:vir:96 99 WGKRYV 104 (104) T ss_pred ccccCC Confidence 666666 No 78 >protein:vir:97329 Length: 104 # NCBI annotation: ORF048 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240613;genbank:gi:66396311;genbank:GeneID:5133685 Probab=20.16 E-value=1.1 Score=20.38 Aligned_cols=104 Identities=17% Similarity=0.183 Sum_probs=56.2 Q ss_pred ecHHHHHHHHHhhccccccccccCCCHHHHHHHHHHHHHHHhhhccCcccCCcccccccCccCccccccchhhhcccccc Q lcl|NC_020854. 18 LTLSDAQDIIDGLVENDDVVAWATATADQKNRALYTATQRLDRERYLGARATDTQALQWPRTGVRKPDTYINTYAVGFPF 97 (186) Q Consensus 18 vsla~AdaY~~~r~~~~~~~~w~~~~~~~ke~aL~~Atd~id~~~~~G~r~~~~Q~laWPR~g~~~~~~~~~~~~~~~~~ 97 (186) .+..+-+.-...... ....+++.+ .||.. |++ |-+ T Consensus 1 Md~~dVK~l~~~~~~-------d~~~D~~~~-~li~~--y~e----------------~ae------------------- 35 (104) T protein:vir:97 1 MDTKDVKMINGLSLN-------DSSNDEQIE-YLIEE--YKS----------------VAE------------------- 35 (104) T ss_pred CCHHHHHHHhCCCCC-------CcccHHHHH-HHHHH--HHH----------------HHH------------------- Confidence 333333322211110 012222211 12211 221 111 Q ss_pred ccCcccccCCcchHHHHHHHHHHHHHHhcccCCCCCCcccceeEEecCeeEEeecCCCCcCcccchHHHHHHHhhhhccC Q lcl|NC_020854. 98 RITTDYFTDTEIPQQIKEAQATLAVYLNNNKDGIGLSGLEDYKNVKIGSIDVTPNQYGATGADRIPPMVERYLTGLRISG 177 (186) Q Consensus 98 ~~~~~~~~~d~IP~~Vk~A~~eLA~~~~~~~~~~~~~~~~~v~~~kvG~isveY~~~~~~~~~~~~~~v~~lL~~l~~~~ 177 (186) .+.-..+-+..+|..|+.+..+++=. + ..+.+++.+.|.+|.+|.+ ..|..+...|.||.+=. T Consensus 36 dyCn~~F~~~~lP~gV~~fvA~~iky-----~-----~~~NissRSMGtVSYty~t-------~iP~~i~~~LkPYRklr 98 (104) T protein:vir:97 36 DYCNQKFDDKAVPSGVKKFIAECIKF-----G-----TTGNISARTMGTVSYTYVT-------DIPSSAYAYLMPYRKLS 98 (104) T ss_pred HhcCCCCCCCCCCccHHHHHHHHHhh-----C-----CCCCcccccccceeecccc-------hhHHHHHHhhhhhhhhc Confidence 11222334558999999998887653 1 1347788999999999964 35678899999988766 Q ss_pred Cceeee Q lcl|NC_020854. 178 PGNIAV 183 (186) Q Consensus 178 ~g~~~~ 183 (186) .+.+-| T Consensus 99 ~~~~~~ 104 (104) T protein:vir:97 99 WGKRYV 104 (104) T ss_pred ccccCC Confidence 666666 Done!