Query lcl|NC_021560.1_cdsid_YP_008130161.1 [gene=RHXG_00014] [protein=hypothetical protein] [protein_id=YP_008130161.1] [location=9912..10466] Match_columns 184 No_of_seqs 105 out of 142 Neff 7.6 Searched_HMMs 1612 Date Thu Nov 7 18:40:45 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_14 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_14_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:96763 Length: 177 100.0 1E-43 6.5E-47 256.2 18.4 168 1-184 5-175 (177) 2 protein:vir:396 Length: 184 # 100.0 4.6E-42 2.8E-45 247.2 18.2 172 5-184 1-184 (184) 3 protein:vir:3427 Length: 192 # 100.0 7.4E-40 4.6E-43 235.1 16.3 165 5-184 1-184 (192) 4 protein:vir:6375 Length: 205 # 100.0 1.4E-33 8.6E-37 200.7 16.7 178 1-183 1-205 (205) 5 protein:vir:79555 Length: 192 100.0 7.5E-33 4.6E-36 196.7 16.8 163 7-184 1-184 (192) 6 protein:vir:10326 Length: 62 # 97.7 8.3E-08 5.2E-11 59.4 4.4 59 113-184 1-59 (62) 7 protein:vir:194 Length: 149 # 97.4 4.5E-06 2.8E-09 49.9 10.0 144 1-184 2-149 (149) 8 protein:vir:93617 Length: 148 97.2 2.1E-05 1.3E-08 46.2 11.4 144 1-184 2-148 (148) 9 protein:vir:106570 Length: 182 97.1 5.7E-05 3.5E-08 43.8 12.9 166 1-184 2-182 (182) 10 protein:vir:5745 Length: 135 # 97.0 2.8E-05 1.7E-08 45.6 10.7 135 1-183 1-135 (135) 11 protein:vir:1891 Length: 179 # 96.3 0.00021 1.3E-07 40.8 10.9 156 1-184 3-171 (179) 12 protein:vir:105467 Length: 144 96.0 0.00041 2.5E-07 39.2 11.0 137 1-184 1-142 (144) 13 protein:vir:101594 Length: 173 95.9 0.00018 1.1E-07 41.1 8.9 163 5-181 1-173 (173) 14 protein:vir:102963 Length: 163 95.3 0.00063 3.9E-07 38.1 9.9 145 1-184 1-156 (163) 15 protein:vir:5978 Length: 144 # 95.0 0.0017 1.1E-06 35.7 11.3 140 1-183 4-144 (144) 16 protein:vir:4347 Length: 164 # 94.8 0.0009 5.6E-07 37.3 9.3 146 1-184 3-156 (164) 17 protein:vir:396 Length: 184 # 94.6 0.001 6.5E-07 36.9 9.1 166 8-184 1-176 (184) 18 protein:vir:100243 Length: 140 94.5 0.001 6.2E-07 37.0 8.9 137 1-184 1-138 (140) 19 protein:vir:79034 Length: 141 94.4 0.0019 1.2E-06 35.4 10.2 126 1-184 1-137 (141) 20 protein:vir:80362 Length: 140 94.2 0.0025 1.6E-06 34.8 10.3 135 1-184 1-138 (140) 21 protein:vir:97088 Length: 157 93.9 0.003 1.9E-06 34.4 10.2 147 1-184 1-155 (157) 22 protein:vir:105089 Length: 133 93.8 0.0038 2.3E-06 33.9 10.4 129 1-184 2-132 (133) 23 protein:vir:1437 Length: 140 # 93.7 0.0042 2.6E-06 33.6 10.6 134 2-184 1-138 (140) 24 protein:vir:94538 Length: 125 93.5 0.002 1.3E-06 35.4 8.5 117 1-184 5-124 (125) 25 protein:vir:100075 Length: 140 93.3 0.0046 2.9E-06 33.4 10.1 134 1-184 1-138 (140) 26 protein:vir:1386 Length: 149 # 92.7 0.0072 4.4E-06 32.3 10.4 142 1-184 1-149 (149) 27 protein:vir:106623 Length: 115 92.1 0.0033 2E-06 34.2 7.7 112 5-183 1-115 (115) 28 protein:vir:99744 Length: 115 92.1 0.0035 2.2E-06 34.1 7.8 115 5-183 1-115 (115) 29 protein:vir:78335 Length: 133 92.0 0.0076 4.7E-06 32.2 9.6 129 3-182 1-133 (133) 30 protein:vir:95789 Length: 114 91.9 0.0027 1.7E-06 34.7 7.0 113 1-179 1-114 (114) 31 protein:vir:3617 Length: 112 # 91.8 0.0073 4.5E-06 32.3 9.2 111 1-183 1-112 (112) 32 protein:vir:103917 Length: 115 91.6 0.0036 2.2E-06 34.0 7.4 115 5-183 1-115 (115) 33 protein:vir:96225 Length: 115 91.6 0.0036 2.2E-06 34.0 7.4 115 5-183 1-115 (115) 34 protein:vir:78858 Length: 115 91.6 0.0036 2.2E-06 34.0 7.4 115 5-183 1-115 (115) 35 protein:vir:96358 Length: 115 91.6 0.0036 2.2E-06 34.0 7.4 115 5-183 1-115 (115) 36 protein:vir:97144 Length: 115 91.6 0.0036 2.2E-06 34.0 7.4 115 5-183 1-115 (115) 37 protein:vir:9312 Length: 115 # 91.6 0.0036 2.2E-06 34.0 7.4 115 5-183 1-115 (115) 38 protein:vir:107568 Length: 146 91.0 0.014 8.5E-06 30.8 9.9 139 1-182 1-146 (146) 39 protein:vir:102085 Length: 146 91.0 0.014 8.5E-06 30.8 9.9 139 1-182 1-146 (146) 40 protein:vir:102875 Length: 146 91.0 0.014 8.5E-06 30.8 9.9 139 1-182 1-146 (146) 41 protein:vir:105007 Length: 146 91.0 0.014 8.5E-06 30.8 9.9 139 1-182 1-146 (146) 42 protein:vir:94419 Length: 133 88.4 0.02 1.3E-05 29.8 8.7 127 3-180 1-133 (133) 43 protein:vir:96973 Length: 133 88.4 0.02 1.3E-05 29.8 8.7 127 3-180 1-133 (133) 44 protein:vir:9363 Length: 133 # 88.4 0.02 1.3E-05 29.8 8.7 127 3-180 1-133 (133) 45 protein:vir:78644 Length: 133 88.4 0.02 1.3E-05 29.8 8.7 127 3-180 1-133 (133) 46 protein:vir:94108 Length: 149 87.8 0.01 6.5E-06 31.4 6.8 136 1-179 13-149 (149) 47 protein:vir:1273 Length: 127 # 86.6 0.038 2.4E-05 28.4 9.2 125 2-179 1-127 (127) 48 protein:vir:3873 Length: 128 # 86.6 0.039 2.4E-05 28.3 9.2 123 3-179 1-128 (128) 49 protein:vir:101302 Length: 134 86.6 0.035 2.2E-05 28.5 9.0 130 3-181 1-134 (134) 50 protein:vir:9513 Length: 134 # 86.6 0.035 2.2E-05 28.5 9.0 130 3-181 1-134 (134) 51 protein:vir:100652 Length: 134 86.0 0.032 2E-05 28.8 8.4 130 3-181 1-134 (134) 52 protein:vir:93898 Length: 133 83.2 0.06 3.7E-05 27.3 8.6 127 3-180 1-133 (133) 53 protein:vir:79555 Length: 192 83.0 0.052 3.2E-05 27.6 8.2 165 1-184 1-192 (192) 54 protein:vir:105916 Length: 149 82.8 0.059 3.7E-05 27.3 8.4 136 1-179 13-149 (149) 55 protein:vir:81147 Length: 126 82.5 0.055 3.4E-05 27.5 8.1 122 1-184 1-124 (126) 56 protein:vir:1988 Length: 156 # 77.5 0.088 5.5E-05 26.4 7.5 150 1-172 1-156 (156) 57 protein:vir:105330 Length: 137 76.9 0.039 2.4E-05 28.3 5.4 135 1-175 2-137 (137) 58 protein:vir:9930 Length: 108 # 73.6 0.17 0.0001 24.8 8.8 107 7-184 1-108 (108) 59 protein:vir:966 Length: 123 # 71.9 0.19 0.00012 24.5 9.3 120 1-184 1-123 (123) 60 protein:vir:99101 Length: 142 68.0 0.11 7.1E-05 25.8 5.8 136 1-175 2-142 (142) 61 protein:vir:8669 Length: 142 # 68.0 0.11 7.1E-05 25.8 5.8 136 1-175 2-142 (142) 62 protein:vir:743 Length: 108 # 66.8 0.26 0.00016 23.8 9.3 107 5-183 1-108 (108) 63 protein:vir:94654 Length: 142 64.7 0.16 9.8E-05 25.0 5.9 137 1-179 4-142 (142) 64 protein:vir:99528 Length: 92 # 62.9 0.23 0.00014 24.1 6.4 87 1-95 2-92 (92) 65 protein:vir:2740 Length: 114 # 62.8 0.33 0.0002 23.2 8.0 113 2-172 1-114 (114) 66 protein:vir:4906 Length: 114 # 62.8 0.33 0.0002 23.2 8.0 113 2-172 1-114 (114) 67 protein:vir:3427 Length: 192 # 62.8 0.33 0.0002 23.2 9.2 162 1-184 1-192 (192) 68 protein:vir:95894 Length: 137 60.3 0.37 0.00023 22.9 9.0 136 1-175 1-137 (137) 69 protein:vir:107099 Length: 137 60.1 0.38 0.00024 22.9 9.3 136 1-175 1-137 (137) 70 protein:vir:102338 Length: 116 56.6 0.45 0.00028 22.5 7.5 110 25-183 1-116 (116) 71 protein:vir:96486 Length: 112 50.1 0.32 0.0002 23.3 5.0 110 2-143 1-112 (112) 72 protein:vir:97427 Length: 137 49.3 0.64 0.0004 21.6 9.8 136 1-175 1-137 (137) 73 protein:vir:93738 Length: 137 49.3 0.64 0.0004 21.6 9.8 136 1-175 1-137 (137) 74 protein:vir:94490 Length: 137 49.3 0.64 0.0004 21.6 9.8 136 1-175 1-137 (137) 75 protein:vir:98409 Length: 108 48.4 0.67 0.00042 21.5 9.2 107 5-183 1-108 (108) 76 protein:vir:3787 Length: 231 # 45.1 0.78 0.00049 21.2 12.0 170 1-184 3-228 (231) 77 protein:vir:95372 Length: 124 39.9 1 0.00062 20.6 9.8 120 1-184 1-124 (124) 78 protein:vir:99833 Length: 190 39.8 1 0.00062 20.6 8.6 157 1-174 4-190 (190) 79 protein:vir:98636 Length: 138 39.2 1 0.00064 20.5 9.7 129 1-184 1-138 (138) 80 protein:vir:4096 Length: 140 # 30.3 1.6 0.00098 19.5 6.6 136 1-184 3-139 (140) 81 protein:vir:81106 Length: 125 27.8 1.8 0.0011 19.2 9.1 122 1-183 1-125 (125) 82 protein:vir:9414 Length: 125 # 27.8 1.8 0.0011 19.2 9.1 122 1-183 1-125 (125) 83 protein:vir:4704 Length: 125 # 27.8 1.8 0.0011 19.2 9.1 122 1-183 1-125 (125) 84 protein:vir:98342 Length: 125 27.8 1.8 0.0011 19.2 9.1 122 1-183 1-125 (125) 85 protein:vir:79988 Length: 125 27.8 1.8 0.0011 19.2 9.1 122 1-183 1-125 (125) 86 protein:vir:94796 Length: 137 26.4 1.9 0.0012 19.0 9.9 135 1-175 2-137 (137) 87 protein:vir:96829 Length: 135 25.7 2 0.0013 18.9 9.2 134 1-175 1-135 (135) 88 protein:vir:98860 Length: 230 24.5 2.2 0.0013 18.7 8.8 169 1-184 5-228 (230) 89 protein:vir:96121 Length: 137 24.4 2.2 0.0013 18.7 9.5 136 1-175 1-137 (137) 90 protein:vir:96012 Length: 133 24.0 2.2 0.0014 18.7 7.8 130 1-182 1-133 (133) 91 protein:vir:9647 Length: 132 # 23.6 2.3 0.0014 18.6 9.6 127 1-184 1-132 (132) No 1 >protein:vir:96763 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1091 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039824;genbank:gi:126010915;genbank:GeneID:5076273 Probab=100.00 E-value=1e-43 Score=256.22 Aligned_cols=168 Identities=20% Similarity=0.211 Sum_probs=152.8 Q ss_pred CeEEeeHH-HHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEE Q lcl|NC_021560. 1 MRLEMNSK-DFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVV 78 (184) Q Consensus 1 m~i~id~~-~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~ 78 (184) |+|+||++ +++.+.+.|..+++++ |+|+++|||+|+.|++|+++++|+++|+||++.|+ ++++++++ ++++++||+ T Consensus 5 ~~l~idv~~~l~~i~~~l~~~~~~~-~~A~~rAlNrta~~~rt~~~r~v~~~~~i~~k~ir~r~~~~~a~-~~~~~~i~~ 82 (177) T protein:vir:96 5 FEMKIDVSREAEDIAAMVAATTKQL-ELAAQRAMTKAGQWLRTHSVRELGQQLGIKQEPLKKRFRVYPQR-QKGEVRFWV 82 (177) T ss_pred ceeEEehhHHHHHHHHHHhhcHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheeeccC-CCcEEEEEE Confidence 88888876 6999999999998777 79999999999999999999999999999999997 58888888 578999999 Q ss_pred eecceeeeecC-CCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 79 ESGWIPLQRLG-AVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 79 ~g~~i~l~~f~-~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) ++++|||++|+ +++++.||++ |++.|+|||++++++||++||+|.|+ +|+||+++++|+.++|. . T Consensus 83 ~~~~i~l~~~~~~r~t~~Gv~~-g~~~~~gaFia~~~~g~~~Vf~R~gk--------~R~PI~~~~~pi~~~~~-----~ 148 (177) T protein:vir:96 83 GLDPIGVYRLGTPKVTQKGVKV-NRNEYDGAFISPMKSNYPLVFKRRGK--------ERLPIDLVDEDIDEPAM-----E 148 (177) T ss_pred eccceehhhcccCCCCccceEE-eeEEcCCceeccCCCCCceEEEEecC--------CccceEEEEcCchHHHH-----H Confidence 99999999996 6788888877 67789999999999999999999865 79999999999998874 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) +++.+.+++++.|+++|+|||+|+|.- T Consensus 149 ~~e~~~~~~~~~~~~~l~~Ei~~~L~g 175 (177) T protein:vir:96 149 VVERWERRVFQRFKELFEQEARAIING 175 (177) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 567788899999999999999999999 No 2 >protein:vir:396 Length: 184 # NCBI annotation: gp11 # Family: family:all:869 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046906;genbank:gi:9630476;genbank:GeneID:1261650 Probab=100.00 E-value=4.6e-42 Score=247.21 Aligned_cols=172 Identities=20% Similarity=0.255 Sum_probs=147.9 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEeecce Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVESGWI 83 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~g~~i 83 (184) ||+++|++++++|.+++++++|+|+++|||+|+.|++|+++++++++|+||+++|+ ++++++|+++++++.||+++++| T Consensus 1 ~~v~~l~~~~~~L~~l~~~~v~kA~~rAiNrt~~~~rt~~~r~v~~~~~i~~~~ir~r~~~~kas~~~l~a~I~~~~~~i 80 (184) T protein:vir:39 1 MSLKGLEQAIENLNSISKTAVPRASAQAVNRVANRAVSRSVAVVSKDTRVPRKLVKQRARVKRATVNKPRALIRVNRGNL 80 (184) T ss_pred CchHHHHHHHHHHhccCHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheecccCCCCeEEEEEEeccce Confidence 99999999999999999898899999999999999999999999999999999997 58899999999999999999999 Q ss_pred eeeecCC--------CCCCCcce---EecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhc Q lcl|NC_021560. 84 PLQRLGA--------VQNATGVY---AKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAIT 152 (184) Q Consensus 84 ~l~~f~~--------r~~~~gv~---~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~ 152 (184) ||++|++ ++.+.|+. ..|++.|+|||+++|++||++||+|.|. +|+||++++.|..+.+.. T Consensus 81 ~l~~~g~~~~k~~~~~~~~~~~~~~~~~g~~~~~gaFia~~~~G~~~Vf~R~gk--------~R~PI~~~~~~i~~~~~e 152 (184) T protein:vir:39 81 PAIKLGTASVRLSRRKRDKKGANSVLRIGPFRFPGGFIQQLKNGRWHVMRRTSK--------PRYPIEVVSIPLAAPLTT 152 (184) T ss_pred eeeeccccccccCccccccccccceeeecceecCcceeeecCCCceEEEEEecC--------cccceeEEEcCchHHHHH Confidence 9999975 23333432 3466779999999999999999999865 799999999874333222 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 153 NNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) ..++++.+.+++.++..|+..|+|||.++|.| T Consensus 153 ~~~~~~~~~~~~~~~~el~~~l~~~L~~~l~r 184 (184) T protein:vir:39 153 AFKEELPKLMESDMPKELRASLTNQLRLILTR 184 (184) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 23455677788888888999999999999999 No 3 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=100.00 E-value=7.4e-40 Score=235.13 Aligned_cols=165 Identities=21% Similarity=0.248 Sum_probs=141.0 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEeecce Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVESGWI 83 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~g~~i 83 (184) +.+++|++++++|.+|+++++|+|+++|||+|+.|++|+++++|+++|+||+++|+ ++++++|+.+++++.|+++++++ T Consensus 1 ~~ik~l~~~~~~L~~i~~~~vp~A~~rAiNrta~~a~t~~~r~v~~e~~I~~k~Ir~r~r~~kAs~~~l~a~I~~~~~~l 80 (192) T protein:vir:34 1 MAIKGLEQAVENLSRISKTAVPGAAAMAINRVASSAISQSASQVARETKVRRKLVKERARLKRATVKNPQARIKVNRGDL 80 (192) T ss_pred CcchhHHHHHHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHhhheeccccCCCceEEEEEeccce Confidence 33369999999999999999999999999999999999999999999999999998 58999999999999999999999 Q ss_pred eeeecCCCCC---CC---------------cceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecC Q lcl|NC_021560. 84 PLQRLGAVQN---AT---------------GVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAA 145 (184) Q Consensus 84 ~l~~f~~r~~---~~---------------gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gp 145 (184) |+++|+..+. ++ .+...|++.|+|||+++|+|||++||+|.+| +.||||+++..| T Consensus 81 ~~~~l~~~~~~~~rr~~~~~~~~~~~~~~g~~~k~Gk~~f~gaFia~m~ng~~~Vf~R~~g-------k~R~PIe~vkIp 153 (192) T protein:vir:34 81 PVIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAG-------KNRYPIDVVKIP 153 (192) T ss_pred eeeeecccccccccccccccccccccccccceeeecceecCCcccccCCCCCceeEEEccC-------CCccceeEEEec Confidence 9999976321 11 1234577789999999999999999999643 369999998876 Q ss_pred chhHhhcCchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 146 NPAHAITNNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 146 si~~m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) +.++ .-+.+++++++.|+++|+|||.|.|.. T Consensus 154 -is~~-------l~~af~~~~~~~~~~~~~~El~~~L~~ 184 (192) T protein:vir:34 154 -MAVP-------LTTAFKQNIERIRRERLPKELGYALQH 184 (192) T ss_pred -hhHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3333 235667788888999999999999888 No 4 >protein:vir:6375 Length: 205 # NCBI annotation: hypothetical protein # Family: family:all:10491 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918988;genbank:gi:34610163;genbank:gi:91214209;genbank:GeneID:2559587 Probab=100.00 E-value=1.4e-33 Score=200.73 Aligned_cols=178 Identities=15% Similarity=0.234 Sum_probs=145.9 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHhcccHHHHh---hhhhee-cccCCcEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVAR-LGPHTQMPRELVA---ALTTAH-FNAGGNTSK 75 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~-i~~~~~ik~k~ik---~~~~~k-a~~~~~~a~ 75 (184) |.|+++.++++++.+.|.+||..+ ++|+.+|||+|+.++++.++++ ++++|++|..+|+ +|++.| ||+++|+|. T Consensus 1 m~i~v~~~G~~~~~~~l~~l~~~~-~~a~~~AIN~ta~~~~~~~A~~~i~~~vn~k~~yv~~~~Rlti~k~As~~~L~A~ 79 (205) T protein:vir:63 1 MSIEIVAEGLGEFRDYVDRLPDIS-QQAAMIAINQTAQRTALPLARTEIGEQVNFPDNYLKDDSRLGVTKKATRNDLEAV 79 (205) T ss_pred CeeeeehhhHHHHHHHHHhcchhh-hHHHHHHHHHHHHHhhHHHHHHhhhhccccchhhhccceeeEEEeecCCCCeeEE Confidence 999999999999999999999777 6899999999999999999997 9999999999998 566655 999999999 Q ss_pred EEEeecceeeeecCCCCC-C-----CcceEeccc----ccCcceeeecC--------CCCeeeEEecCCeeecccCcc-- Q lcl|NC_021560. 76 VVVESGWIPLQRLGAVQN-A-----TGVYAKLRG----SYRHAFIAAMK--------SGHVGAFRRVPGTQMSSATGK-- 135 (184) Q Consensus 76 i~~~g~~i~l~~f~~r~~-~-----~gv~~~~~~----~~~gaFia~~~--------~g~~~vf~R~~~~~~~~~~~~-- 135 (184) |..+.+|+.|.+|.++.. . .||+|+++. .|+|||+.+++ |||.|||.|.++...++...+ T Consensus 80 I~ar~rpt~LsRF~~p~~~~~~~r~~GVsV~Vk~G~ak~l~gaF~~~lk~g~~l~e~~~~vgva~R~~~g~~~~~~~g~~ 159 (205) T protein:vir:63 80 IGARQRPTSLARFAEPGQTTKSTRKGGVSVVVKPGRTKQFKRGFLVRLRAGKTLTEDKYNLGLAVRLSPGETLHATDGAT 159 (205) T ss_pred EecCCCcceeeeccCCCccccccccCCeEEEEEcCCCeeccCceEEEeeccccccccccceEEEeeecCccccccccCce Confidence 999999999999975432 2 488876432 49999999997 899999999998776643221 Q ss_pred --ccceeeeecCchhHhhcCchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 136 --REQIRELFAANPAHAITNNPDVYLDVLAGVIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 136 --R~PI~~l~gpsi~~m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~ 183 (184) --+|+.||||||+|++++.. +.+.+.+.+.+.+.+++|..+++. T Consensus 160 k~~~~~k~LYGPSV~Qvf~~~~----e~I~~~i~~~l~~~f~r~~~~~~~ 205 (205) T protein:vir:63 160 KLSNNVYLLYGPSVDQVFRTVA----DDITTEVLDALADEFLRQFTRLSE 205 (205) T ss_pred ecCCceEEEEcCcHHHHHhhhh----hhhhHHHHHHHHHHHHHhhhhhcC Confidence 13689999999999999754 445555555555555556666666 No 5 >protein:vir:79555 Length: 192 # NCBI annotation: putative tail component # Family: family:all:869 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272521;genbank:gi:148609390;genbank:GeneID:5204391 Probab=99.96 E-value=7.5e-33 Score=196.72 Aligned_cols=163 Identities=20% Similarity=0.278 Sum_probs=126.9 Q ss_pred HHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH--------hcccHHHHh-hhhheeccc-CCcEEEE Q lcl|NC_021560. 7 SKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPH--------TQMPRELVA-ALTTAHFNA-GGNTSKV 76 (184) Q Consensus 7 ~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~--------~~ik~k~ik-~~~~~ka~~-~~~~a~i 76 (184) +++|++++++|..|+..++|+|..+|||+++.|+.+.+.+.|+++ ++||.++|+ +++++++++ +.+++.| T Consensus 1 ~kgl~~a~~nl~~l~~~~vp~A~~~ainrva~ra~~~t~~~v~~~~~~~~~~~~~I~~k~iR~R~r~~ka~~~~~~~~~I 80 (192) T protein:vir:79 1 MKGLENAIRNLNSLDTRMVPQASAWAINRVAQKAVSVATRQVAGNTVAGDNQVKGIPLKLVRQRVRVFKASPSGKMTARI 80 (192) T ss_pred CchHHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcCcHHHHHhhhhcccccCCCceEEEE Confidence 889999999999999999999999999997777777777777765 699999998 688999887 4789999 Q ss_pred EEeecceeeeecCCCC---C--------CCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecC Q lcl|NC_021560. 77 VVESGWIPLQRLGAVQ---N--------ATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAA 145 (184) Q Consensus 77 ~~~g~~i~l~~f~~r~---~--------~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gp 145 (184) |++.++||+++++..+ . ..++...|++.|+|||+++|+||+++||+|..| +.||||++...| T Consensus 81 ~v~~~~l~ai~lg~~r~r~~rr~~~~~~~~s~~~vGk~~f~gaFia~m~ngr~~V~~R~~g-------k~R~PIevvkIp 153 (192) T protein:vir:79 81 RVNRGNLPAIKLGTARVRLARRGGKLQYRGSVLKVGKYLFRDAFIQQLANGRWHVMRRIDG-------KNRYPIDVVKIP 153 (192) T ss_pred EEecCceeeeeecccccccccccccccccccceEEcceecCchhccccCCCCccceEecCC-------CccCCeeeEeec Confidence 9999999999998643 1 223345588999999999999999999999532 379999988877 Q ss_pred chhHhhcCchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 146 NPAHAITNNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 146 si~~m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) - .+.+ .+.+++++++.|++.|++|+.+.|.. T Consensus 154 i-s~~l-------~~af~~e~~r~~~~~~~~el~~~L~~ 184 (192) T protein:vir:79 154 L-SGPL-------TQAFEDARDRIIAAEMPKQLGYALKQ 184 (192) T ss_pred h-HHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5 3333 23344444445555555555555544 No 6 >protein:vir:10326 Length: 62 # NCBI annotation: ORF28 # Family: family:all:1091 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758921;genbank:gi:27311195;genbank:GeneID:956157 Probab=97.72 E-value=8.3e-08 Score=59.39 Aligned_cols=59 Identities=22% Similarity=0.258 Sum_probs=50.0 Q ss_pred cCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 113 MKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 113 ~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) |++.+..||+|.|+ .|+||+.+-- .+....+++.+.+...++.+|.+.|++|.+++|+- T Consensus 1 M~S~~llVfRR~gk--------eRlpIe~V~~-----dI~e~~~~ivery~~r~~~rF~elf~qE~~yvLs~ 59 (62) T protein:vir:10 1 MKSEHLNVFRRKGR--------ERLPIEVVRL-----PIEEQSNPIFERYYQRAQGRFTELLRQELNFALNH 59 (62) T ss_pred CCCCccchhhccCc--------cccchhhhcc-----ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 99999999999765 7999986432 23334567999999999999999999999999999 No 7 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=97.40 E-value=4.5e-06 Score=49.91 Aligned_cols=144 Identities=11% Similarity=0.149 Sum_probs=72.5 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhhe--ec-ccCCcEEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTA--HF-NAGGNTSKV 76 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~--ka-~~~~~~a~i 76 (184) |+++|+.++|+++.+.|..|+..+..+++..||..++.-+...+.+.+.. ....++ ++.+. +. ..+.....+ T Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~----~~g~l~~si~~~~~~~~~~~~~~~~v 77 (149) T protein:vir:19 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPV----RTGKLKKNVVVVTQKSRRRGEISSGV 77 (149) T ss_pred cceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC----Cchhhhhhccccccccccccceeecc Confidence 99999999999999999999876644667777777777776666554322 223333 22221 11 111112222 Q ss_pred EEeecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchH Q lcl|NC_021560. 77 VVESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPD 156 (184) Q Consensus 77 ~~~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~ 156 (184) .+. ............+.... -..+|.+.+ .-.|. ...|=...+.| T Consensus 78 ~~~-------~~~~~~~~~~~~~~~~~-~~~~~y~~f--------~E~GT--------~~~~a~PF~~p----------- 122 (149) T protein:vir:19 78 HIR-------GVNPRTGNSDNTMKANN-PRNAFYWRF--------VELGT--------ANMPAHPFVRP----------- 122 (149) T ss_pred ccc-------ccccccccccceeecCC-CCccceeee--------eccCC--------CCCCCCcchhH----------- Confidence 211 11111111111111111 112233221 11111 11121111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 157 VYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 157 ~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) .++.-.+.+.+.|...|..||++.|.| T Consensus 123 -A~~~~k~~~~~~~~~~l~~~l~k~~~k 149 (149) T protein:vir:19 123 -AYDTREEEAASVAIARMNQAIDEVLSK 149 (149) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 245556677778888888899999999 No 8 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=97.16 E-value=2.1e-05 Score=46.23 Aligned_cols=144 Identities=16% Similarity=0.209 Sum_probs=73.7 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhhe--ecccCCcEEEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTA--HFNAGGNTSKVV 77 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~--ka~~~~~~a~i~ 77 (184) |+++++.++|+++.+.|..|+..+..++...||..++.-+...+...+-..+ ..++ .+.+. +...+.+...|. T Consensus 2 m~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~----g~l~~~i~~~~~~~~~g~~~~~v~ 77 (148) T protein:vir:93 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRR----GKLRRNVVVLSRRSRDGGMESGVH 77 (148) T ss_pred cceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCc----chhhhhceeccccccCCceeeeee Confidence 9999999999999999999986654467788888888777766665543222 2232 22211 112222222222 Q ss_pred EeecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 78 VESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 78 ~~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) ..+ ........+..+..+. -..+|... |.-.| .+..|=...+.| T Consensus 78 ~~~-------~~~~~~~~~~~~~~~~-~~~~~y~~--------f~E~G--------T~~~pa~PFl~p------------ 121 (148) T protein:vir:93 78 IRG-------VNPDTGNSDNTMKADN-PRNAFYWR--------FVEMG--------TVNMPPHPFVRP------------ 121 (148) T ss_pred ecc-------cccccccccceeecCC-CCCcceee--------eeccC--------CCCCCCCcchhH------------ Confidence 221 1111112222222211 11122221 11111 112222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) .++.-.+.+.+.|.+.|..||++.|.| T Consensus 122 A~~~~k~~~~~~~~~~~~~~i~k~~~k 148 (148) T protein:vir:93 122 AFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 234455566677777888888888888 No 9 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=97.06 E-value=5.7e-05 Score=43.85 Aligned_cols=166 Identities=14% Similarity=0.134 Sum_probs=90.4 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhhe-ecccCCcEEEEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTA-HFNAGGNTSKVVV 78 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~-ka~~~~~~a~i~~ 78 (184) |+++| .+++++.+.|..++..+ .+++..|+.+++..+-..+.+++....-+....++ ++... ..+.+.+++.|+. T Consensus 2 ~~v~i--~Gld~L~~kl~~~~~~~-~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~ 78 (182) T protein:vir:10 2 IEVEL--KGVNELRAKLKKLPDIM-AKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWN 78 (182) T ss_pred eEEEE--ecHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeec Confidence 55555 58999999999999777 57999999999999888888888777677777776 44321 2234556777777 Q ss_pred eecceeeeecCCCCC----CCcc----eEe----cccccCcceeeecCCCCeeeEEe-cCCeeecccCccccceeeeecC Q lcl|NC_021560. 79 ESGWIPLQRLGAVQN----ATGV----YAK----LRGSYRHAFIAAMKSGHVGAFRR-VPGTQMSSATGKREQIRELFAA 145 (184) Q Consensus 79 ~g~~i~l~~f~~r~~----~~gv----~~~----~~~~~~gaFia~~~~g~~~vf~R-~~~~~~~~~~~~R~PI~~l~gp 145 (184) +-..=+...||+... ..|+ ... +|. +...++.-...++.++|.- ..+..++...+ .|=+..+=| T Consensus 79 ~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~-~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G--~~aqPFl~p 155 (182) T protein:vir:10 79 SSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWF-FPVDSVDLDLTKIYGIPKIKINGKYFYRTTG--QPARQFMTP 155 (182) T ss_pred CCCccceeecCcccccccCccccCccceeeeecCCce-eeccccccccccccccceeeecCceEeecCC--CCCCcchHH Confidence 766666667876431 1111 110 111 1111111111122222211 11111111000 122221212 Q ss_pred chhHhhcCchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 146 NPAHAITNNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 146 si~~m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) .++...+.+.+.+.+.+..+|...||= T Consensus 156 ------------A~~~~~~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 156 ------------AANKMAKEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred ------------HHHHhHHHHHHHHHHHHHHHHHHhhcC Confidence 234444556666666777777777777 No 10 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=97.00 E-value=2.8e-05 Score=45.58 Aligned_cols=135 Identities=13% Similarity=0.107 Sum_probs=64.2 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEee Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVES 80 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g 80 (184) |.++++.++|+++.+.|..|+..+..++...||.+++.-+...+...+.....-..+.++ ..|.++ T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~-------------~~I~i~- 66 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMR-------------DSIKIR- 66 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHH-------------hhcccc- Confidence 999999999999999999998766456777888888776665554332211111112222 222111 Q ss_pred cceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHH Q lcl|NC_021560. 81 GWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLD 160 (184) Q Consensus 81 ~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~ 160 (184) ..........+.+.... ..+|. .+..|.-.|- ++.|=...+.| .++ T Consensus 67 ------~~k~~~~~~~v~v~vg~--~~~~~------~~~~f~E~GT--------~~~~a~PF~~p------------a~~ 112 (135) T protein:vir:57 67 ------SSRGKAGSTVVVLRVGP--TRSHY------MKALAQEFGT--------IKQVAKPFIRP------------ALD 112 (135) T ss_pred ------cccccccceeEEEEecC--CCCcc------eeEeecccCC--------CCCCCCcchhH------------hHH Confidence 11111111122222110 01110 1122221121 22232222222 234 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 161 VLAGVIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 161 ~~~~~~~~~~~~rl~~Ei~r~L~ 183 (184) .-.+.+.+.|.+.|..||+++.- T Consensus 113 ~~~~~~~~~~~~~~~~~l~ka~r 135 (135) T protein:vir:57 113 YNKMQVLRILTVEIRDGLSTLSR 135 (135) T ss_pred HhHHHHHHHHHHHHHHHHHHhcC Confidence 44455555666666666666555 No 11 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=96.29 E-value=0.00021 Score=40.77 Aligned_cols=156 Identities=15% Similarity=0.133 Sum_probs=71.3 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH-hcccHHHHh-hhhhe---eccc--CCcE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPH-TQMPRELVA-ALTTA---HFNA--GGNT 73 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~-~~ik~k~ik-~~~~~---ka~~--~~~~ 73 (184) =.|+++.++|+++.+.|..|+..+..++++.||.+++.-++.++...+... .......++ ++.+. +.+. +... T Consensus 3 ~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~~~ 82 (179) T protein:vir:18 3 DSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGDLA 82 (179) T ss_pred ceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeeccccccccccccee Confidence 137888889999999999998776556888888888877776666554221 112222332 22221 1111 2223 Q ss_pred EEEEEeecceeeeecCCCCCCC--cceEe--ccccc--CcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCch Q lcl|NC_021560. 74 SKVVVESGWIPLQRLGAVQNAT--GVYAK--LRGSY--RHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANP 147 (184) Q Consensus 74 a~i~~~g~~i~l~~f~~r~~~~--gv~~~--~~~~~--~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi 147 (184) ..+.+.++..+.........+. +.... +.... ..+|...+ .-| | ....|=...+.|+ T Consensus 83 ~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~f-----vEf---G--------T~kmpa~PFlrPA- 145 (179) T protein:vir:18 83 FRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRF-----LEF---G--------TEHTSARPILRPA- 145 (179) T ss_pred EeeecccccccccccccccccCcccccccccccccCCCCccceeEE-----ecc---C--------CCCCCCCccchhh- Confidence 3333333333222221111111 11100 00000 01222211 011 1 1123333333332 Q ss_pred hHhhcCchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 148 AHAITNNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 148 ~~m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) ++.-.+.+.+.|.+.|.+||++.|.| T Consensus 146 -----------~~~~~~~a~~~i~~~l~~~i~k~lk~ 171 (179) T protein:vir:18 146 -----------MNGVDNDVINVFSTEMGKAIDRAIRL 171 (179) T ss_pred -----------HHhhHHHHHHHHHHHHHHHHHHHHHh Confidence 23333455555666666666666666 No 12 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=95.95 E-value=0.00041 Score=39.16 Aligned_cols=137 Identities=14% Similarity=0.185 Sum_probs=74.8 Q ss_pred CeE-EeeHHHHHHHHHHHhhccch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheec--ccCCcEEE Q lcl|NC_021560. 1 MRL-EMNSKDFEELERAFRRLPGE-IRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHF--NAGGNTSK 75 (184) Q Consensus 1 m~i-~id~~~l~~~~~~L~~l~~~-~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka--~~~~~~a~ 75 (184) |.. .||.++|+++.+.|...... .+++.+..+++.++.. +.+++.+.+-+....++ +..+... +.+.++.. T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~----~~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~~ 76 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQ----SLRILEANTPVKQGNLRRSWTAEGPTYGCGGWTIK 76 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHH----HHHHHHHhCCCCcchhccceeecceeeecCeeEEE Confidence 775 88999999999999876432 2345555656555554 45666667777776665 3443333 33444455 Q ss_pred EEEeecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCch Q lcl|NC_021560. 76 VVVESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNP 155 (184) Q Consensus 76 i~~~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~ 155 (184) |..+..--++..||+|+.+.+-...-++....+|+ .| -.| T Consensus 77 V~n~~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V--------------~G---------------------~~~----- 116 (144) T protein:vir:10 77 LINNAEYASYVESGHRQTPGRYVPVLKKRLVRDWV--------------PG---------------------QFY----- 116 (144) T ss_pred EecCCCcccccccceeecCCcccccCCCcccccee--------------cC---------------------ccc----- Confidence 54444445666677765432111000000011111 11 012 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 156 DVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 156 ~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) ++.-.+.++..|++.|+.+|+-++.. T Consensus 117 ---~~~a~~~~~~~~~~~l~k~l~~l~d~ 142 (144) T protein:vir:10 117 ---MKKSIPQIQRQLPQLVTEGLWGLKDL 142 (144) T ss_pred ---hHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 22233455666777777788888877 No 13 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=95.92 E-value=0.00018 Score=41.15 Aligned_cols=163 Identities=11% Similarity=0.076 Sum_probs=76.3 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheec-ccCCcEEEEEEeecc Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHF-NAGGNTSKVVVESGW 82 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka-~~~~~~a~i~~~g~~ 82 (184) |+.++++++.+.|..++..+ .+++..|+..++..+...+...+... ...++ ++.+... ..+.+.+.++....- T Consensus 1 i~i~Gld~L~~~L~~l~~~~-~~~~~~a~~~~a~~i~~~ak~~aPv~----TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Y 75 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDI-DKNINATTEEAANFIEDRAKTLAPKN----FGKLAQSISTSDLKAKDLISKKITVNELY 75 (173) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCcC----chhhhhcceeeeeccCceeEEeeCCCccc Confidence 88889999999999998766 57889999988888877776554433 34454 3433222 223445555544444 Q ss_pred eeeeecCCCCCCCcceE------ecccccCcceeeecCCCCeeeEEecCC-eeecccCccccceeeeecCc-hhHhhcCc Q lcl|NC_021560. 83 IPLQRLGAVQNATGVYA------KLRGSYRHAFIAAMKSGHVGAFRRVPG-TQMSSATGKREQIRELFAAN-PAHAITNN 154 (184) Q Consensus 83 i~l~~f~~r~~~~gv~~------~~~~~~~gaFia~~~~g~~~vf~R~~~-~~~~~~~~~R~PI~~l~gps-i~~m~~~~ 154 (184) -....||+++....-.. -+.+.....|.....+-|.. .+.-+ ...+ ....+....+|. .||-+-.. T Consensus 76 a~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~g~~~~~----~~~~~~~~~~~G~~aqPFl~P 149 (173) T protein:vir:10 76 GAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAW--CRAKGIDEKA----AYPIFAKILGAGINPQPFLYP 149 (173) T ss_pred chhhhcccccccCCCchhhhhhccccccccccccccccccccc--ccccccchhc----ccceeeEeecCCCCCCccchh Confidence 44556777654321110 01111011111110000000 00000 0000 000011111111 12222111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021560. 155 PDVYLDVLAGVIEDYFFPRIVHEIERL 181 (184) Q Consensus 155 ~~~~~~~~~~~~~~~~~~rl~~Ei~r~ 181 (184) .++.-.+.+.+.+...|..||..+ T Consensus 150 ---A~~~~~~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 150 ---AWIEGKKQYLKDLENLLKTYNKKI 173 (173) T ss_pred ---HHHHhHHHHHHHHHHHHHHHhhcC Confidence 234445566666667777777777 No 14 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=95.34 E-value=0.00063 Score=38.15 Aligned_cols=145 Identities=10% Similarity=0.053 Sum_probs=65.3 Q ss_pred CeEEeeHHHHHHHHHHHhhccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccH-----H--HHhh---hhheeccc Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPG-EIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPR-----E--LVAA---LTTAHFNA 69 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~-~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~-----k--~ik~---~~~~ka~~ 69 (184) |+..||-++|+++.+.|..+.. +..+..+...+|..+..+ .+.+-+.|=+-. . ..+. .+..+... T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~l----l~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~ 76 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTEL----KSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAH 76 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHH----HHHHHHhCCcccchhhhhhhhhcccchhhhhcccc Confidence 9999999999999999987643 233445666666666555 555555443311 1 0000 00001111 Q ss_pred CCcEEEEEEeecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhH Q lcl|NC_021560. 70 GGNTSKVVVESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAH 149 (184) Q Consensus 70 ~~~~a~i~~~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~ 149 (184) ...+..+.- +-...+..++..+..+.....-+.|=. +-.||+.+ .|| ..| . -. T Consensus 77 ~k~tG~lr~-----swk~~~~~k~~~~~~v~v~N~~~YA~~--VE~GHR~~---~gG---------fV~-------G-~f 129 (163) T protein:vir:10 77 GKQGGTLQK-----GWSKSRIEVSGRTYKQKVYNKVYYAPH--VEYGHKTV---NGG---------FVP-------G-QF 129 (163) T ss_pred ccccchhhc-----cceecceeecCCceEEEEEecCCccch--hhcceeec---CCc---------eec-------c-ch Confidence 111111100 000011122222222222222222100 13455443 111 111 0 11 Q ss_pred hhcCchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 150 AITNNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 150 m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) | ++.-.+++++.|++.|+.+|..+|.| T Consensus 130 m--------l~~s~~~~~~~~~~~~e~~l~~~l~k 156 (163) T protein:vir:10 130 F--------LHKTVEDTKSDMEKRVRDKYDGFMRK 156 (163) T ss_pred h--------hHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 33444566677888888888888877 No 15 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=95.02 E-value=0.0017 Score=35.75 Aligned_cols=140 Identities=13% Similarity=0.160 Sum_probs=74.6 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) |.+.||.+.++++.+.|..++..+ .+++.+||.+++..+...+. ...-+.-..++ ++.. ..+.+++++.|..+ T Consensus 4 ms~~i~~~g~~~l~~~l~~~~~~~-~~~v~~~l~~~a~~i~~~ak----~~apv~TG~Lr~SI~~-~~~~~g~~~~V~~~ 77 (144) T protein:vir:59 4 MSVRIDPSWRRIMSRNVRTFSGHV-LTQVEQVIIKTAEKIAGLAA----SLAPVDEGNLKNSIQI-DYKNNGLTAEITVG 77 (144) T ss_pred ceeeehhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH----HhCCccchhhhcCeeE-EeecCcEEEEEecC Confidence 999999999999999999998776 56888888887766555443 33344455555 3433 23455677877776 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) ..-=+...||+..... ++.+.-.+.|.. .+..+-|.+.. +.| |+-+ +. T Consensus 78 ~~YA~~vE~GT~~~~~----~~~~~~~~~~~~---~~~~g~~~~t~-----------------g~~--a~Pf------l~ 125 (144) T protein:vir:59 78 AEYAIYVEYGTGIYAV----DGNGRKTPWTYY---SPKLGRYVRTQ-----------------GAP--AQPF------FW 125 (144) T ss_pred CCccchhhcCcccccc----CCCccccccccc---cccccceecCC-----------------CCC--CCcc------hh Confidence 5555556677644211 011000011110 00111111110 111 1111 01 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~~Ei~r~L~ 183 (184) ..+ +.-.+.+..+|+++.| T Consensus 126 pA~-----~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 126 PAV-----EEGGEYFEREMRRLRG 144 (144) T ss_pred HHH-----HHHHHHHHHHHHHhcC Confidence 111 1234456668888888 No 16 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=94.83 E-value=0.0009 Score=37.28 Aligned_cols=146 Identities=10% Similarity=0.064 Sum_probs=66.5 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH-hcccHHHHh-hhhhee-----cccCCcE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPH-TQMPRELVA-ALTTAH-----FNAGGNT 73 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~-~~ik~k~ik-~~~~~k-----a~~~~~~ 73 (184) =.|++++++|+++.+.|..|+.++..++++.||..++.-++..+...+... ..-....++ ++.+.. ...+.+. T Consensus 3 ~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~~~ 82 (164) T protein:vir:43 3 DTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGDLG 82 (164) T ss_pred cceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCcccccccee Confidence 137788889999999999998776456788888888877766666554321 111112232 222211 1111111 Q ss_pred EEEEEeecceeeeecCCCCCCCcceEecccc-cCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhc Q lcl|NC_021560. 74 SKVVVESGWIPLQRLGAVQNATGVYAKLRGS-YRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAIT 152 (184) Q Consensus 74 a~i~~~g~~i~l~~f~~r~~~~gv~~~~~~~-~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~ 152 (184) ..+.+..+ .... ..+...... -..+|.+.+ .-.|- ...|=...+.| T Consensus 83 ~~vg~~~~--------~~~~--~~~~~~~~~~~~~~~y~~f--------~EfGT--------~km~a~PFlrP------- 129 (164) T protein:vir:43 83 FRIGVLHG--------AVLP--KKGERSDKTANAPTPHWRL--------LEFGT--------EDMRAQPFMRS------- 129 (164) T ss_pred EEeccccc--------cccc--ccccccccCCCCCcceEEE--------eecCC--------CCCCCCcchhh------- Confidence 11111110 0000 000000000 011232221 11111 12232222222 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 153 NNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) .++.-.+.+.+.|.+.|.+||++.|.| T Consensus 130 -----A~~~~k~~~~~~~~~~l~~~i~ka~~k 156 (164) T protein:vir:43 130 -----ALADNIAEVTSTFVSEYEKGIDRAIKR 156 (164) T ss_pred -----hHHHhHHHHHHHHHHHHHHHHHHHHHH Confidence 234444555566666667777777766 No 17 >protein:vir:396 Length: 184 # NCBI annotation: gp11 # Family: family:all:869 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046906;genbank:gi:9630476;genbank:GeneID:1261650 Probab=94.60 E-value=0.001 Score=36.93 Aligned_cols=166 Identities=14% Similarity=0.076 Sum_probs=76.3 Q ss_pred HHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeec-ceeee Q lcl|NC_021560. 8 KDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESG-WIPLQ 86 (184) Q Consensus 8 ~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~-~i~l~ 86 (184) =+++.+++....|. ...++++.+|+.++...+=..+..++.++..=. --|+.-.+.++-. . ...+.+ ...-+ T Consensus 1 ~~v~~l~~~~~~L~-~l~~~~v~kA~~rAiNrt~~~~rt~~~r~v~~~-~~i~~~~ir~r~~----~-~kas~~~l~a~I 73 (184) T protein:vir:39 1 MSLKGLEQAIENLN-SISKTAVPRASAQAVNRVANRAVSRSVAVVSKD-TRVPRKLVKQRAR----V-KRATVNKPRALI 73 (184) T ss_pred CchHHHHHHHHHHh-ccCHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hcCCHHHHHhhhe----e-cccCCCCeEEEE Confidence 99999999999996 444788999999999888776666665554421 2222222222211 0 011111 11111 Q ss_pred ecC--CCCCCCcceEecc----cccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCc---hhHhhcCchHH Q lcl|NC_021560. 87 RLG--AVQNATGVYAKLR----GSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAAN---PAHAITNNPDV 157 (184) Q Consensus 87 ~f~--~r~~~~gv~~~~~----~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gps---i~~m~~~~~~~ 157 (184) ... .-.--+-.....+ .....+.-+....| ...| .+.......+++.-|-+.-|-+ |......-+.. T Consensus 74 ~~~~~~i~l~~~g~~~~k~~~~~~~~~~~~~~~~~g-~~~~---~gaFia~~~~G~~~Vf~R~gk~R~PI~~~~~~i~~~ 149 (184) T protein:vir:39 74 RVNRGNLPAIKLGTASVRLSRRKRDKKGANSVLRIG-PFRF---PGGFIQQLKNGRWHVMRRTSKPRYPIEVVSIPLAAP 149 (184) T ss_pred EEeccceeeeeccccccccCccccccccccceeeec-ceec---CcceeeecCCCceEEEEEecCcccceeEEEcCchHH Confidence 111 0000000000000 00000111111111 1122 1112222233455665444433 22211121233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) ..+.+++++++.|+++|+|||++.|.+ T Consensus 150 ~~e~~~~~~~~~~~~~~~~el~~~l~~ 176 (184) T protein:vir:39 150 LTTAFKEELPKLMESDMPKELRASLTN 176 (184) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 456777788888888888888777777 No 18 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=94.54 E-value=0.001 Score=37.02 Aligned_cols=137 Identities=18% Similarity=0.178 Sum_probs=65.9 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) |- +++.++++++.+.|..|+.....+++..||..++.-+...+.+.+...+ ..++ ++.+......+....+.+. T Consensus 1 Ma-~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~t----G~l~~sI~~~~~~~~~~~~~~~~~ 75 (140) T protein:vir:10 1 MS-SVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKT----GKLKRNIVTAALKQKDSPGIATAG 75 (140) T ss_pred Cc-eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCh----hhHHHhceecccccccccceeEEe Confidence 43 5667799999999999986664468888888888888777766654332 3333 2222111111111111111 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) -+ -...+. .+....+|.+.+ --.|. +..|=.....| .+ T Consensus 76 ~~----------~~~~~~----~~~~~~~~y~~f--------~E~GT--------~~~~a~PFl~p------------A~ 113 (140) T protein:vir:10 76 VR----------VRTKGK----ADSPNNAFYWRF--------VELGT--------QFMKAEPFMRP------------AF 113 (140) T ss_pred ec----------cccccc----cCCCCcccccce--------eccCc--------CCCCCCcchhh------------hH Confidence 00 000000 001112222211 11111 11122211112 23 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) +.-.+.+.+.+.+.|..||++++.+ T Consensus 114 ~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:10 114 DASIAQAEGAIRTEIARAIDQVVGG 138 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc Confidence 4445566666777777777777777 No 19 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=94.44 E-value=0.0019 Score=35.45 Aligned_cols=126 Identities=12% Similarity=0.125 Sum_probs=59.1 Q ss_pred CeE--EeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhh-hhhe------ecccCC Q lcl|NC_021560. 1 MRL--EMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAA-LTTA------HFNAGG 71 (184) Q Consensus 1 m~i--~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~-~~~~------ka~~~~ 71 (184) |.= .||.++|+++.+.|..+.....++.+..+++.++..+. +.+.+.+-|....+++ ..+. .....+ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~----~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g 76 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLL----GKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQG 76 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH----HHHHHhCCCcchhhcccccccccccccceeecC Confidence 543 78888999999999877554446777777777776664 4445556666555543 2211 111111 Q ss_pred cEEEEEE--eecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhH Q lcl|NC_021560. 72 NTSKVVV--ESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAH 149 (184) Q Consensus 72 ~~a~i~~--~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~ 149 (184) ...+|.+ +..--++..+|++.... ++.++|+| T Consensus 77 ~~~~v~v~n~~~YA~~VE~Ghr~~~~------~gfV~G~f---------------------------------------- 110 (141) T protein:vir:79 77 NNYIIEVVNPTEYASYVNFGHRTKDG------KGWVKGQH---------------------------------------- 110 (141) T ss_pred CeeEEEEecCCcchhhhhcceeecCC------cceeCCch---------------------------------------- Confidence 1222221 11112222233322111 00011111 Q ss_pred hhcCchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 150 AITNNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 150 m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) |+ +.-.+.++..|++.|+..|+.+|.+ T Consensus 111 ml--------~~s~~~~~~~~~~~~~~~l~~~l~~ 137 (141) T protein:vir:79 111 FL--------TISEMELQSQVDKIIEKKLLILLKG 137 (141) T ss_pred hH--------HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 1122333444555555555555555 No 20 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=94.20 E-value=0.0025 Score=34.81 Aligned_cols=135 Identities=19% Similarity=0.179 Sum_probs=67.4 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecc--cCCcEEEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFN--AGGNTSKVV 77 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~--~~~~~a~i~ 77 (184) |- +|+.++|+++.+.|..|+..+..+++..|+..++.-+...+.+.+...+| .++ ++.+..-. .....+.+. T Consensus 1 Ma-~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG----~l~~~i~~~~~~~~~~~~~~~~~ 75 (140) T protein:vir:80 1 MS-SIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTG----KLRRNIVSAALRQKDAPGLATAG 75 (140) T ss_pred Cc-eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc----hhhhceeeeccccccccceeeee Confidence 43 56667999999999999866655688888888888887776665433322 222 12111100 011111111 Q ss_pred EeecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 78 VESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 78 ~~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) +..++.. +.+.-..+|.+. |.=.|. +..|=...+.| T Consensus 76 ~~~~~~~----------------~~~~~~~~~y~~--------f~E~GT--------~~~~a~PFl~p------------ 111 (140) T protein:vir:80 76 VRVRTKG----------------KADSPSNAFYWR--------FDEFGT--------QHMKAQPFMRP------------ 111 (140) T ss_pred eeccccc----------------ccCCCCCcceee--------eeccCC--------CCCCCCcchhh------------ Confidence 1111000 000001122211 111111 11222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) .++.-.+.+.+.+.+.|.++|.+.|++ T Consensus 112 A~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:80 112 AFDASIGEAEGAIRTELARAIDQALGG 138 (140) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 245556677777888888899999998 No 21 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=93.95 E-value=0.003 Score=34.41 Aligned_cols=147 Identities=15% Similarity=0.072 Sum_probs=74.7 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhhe--eccc--CCcEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTA--HFNA--GGNTSK 75 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~--ka~~--~~~~a~ 75 (184) |+++|..-+|..+.+.|..|+... .++++.|+.+++.-++.++...+. .+...++ ++.+. +... +.-++. T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~-~~v~R~A~~~ga~vv~dear~~aP----~~tG~LkksI~~~~~~~~s~~g~~~~~ 75 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHS-SDVVRTMTYESAVAVRESAKAFVN----DETGKLRNNLYVAYSPEESVEGIQTYA 75 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCC----CCcchhhhheeeeeccccCCCceEEEE Confidence 999998888889999999997544 577888888887777766654443 3344443 34332 1111 112233 Q ss_pred EEEeeccee---eeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhc Q lcl|NC_021560. 76 VVVESGWIP---LQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAIT 152 (184) Q Consensus 76 i~~~g~~i~---l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~ 152 (184) |..+.+..| +..||.......+. .=+|+|+.... +.+. ....| |+-|- T Consensus 76 Vg~~~~~a~~g~~vEfG~~~~~~~~~-----~~~~~~~~~~~--------~~~t-------~~~~P---------a~PFl 126 (157) T protein:vir:97 76 VSWRKKAAPHGHLLEFGHWQTHAAYR-----DKDGQWYSSKV--------KLVN-------PKWIP---------AKPFL 126 (157) T ss_pred EeecCCccceeeeeecCccccccccc-----CCccccccccc--------ccCC-------CCcCC---------CCccc Confidence 444433322 22233211100000 00222222110 0000 00112 22222 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 153 NNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) . ..++.-.+.+.+.+..+|.++|..+|.= T Consensus 127 R---PA~d~~k~~a~~~~~~~l~k~I~e~l~g 155 (157) T protein:vir:97 127 R---PGYDSVAMQIPDIARAAGAKKYAELQRG 155 (157) T ss_pred c---hHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 1 1356666777777888888899888877 No 22 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=93.77 E-value=0.0038 Score=33.87 Aligned_cols=129 Identities=16% Similarity=0.176 Sum_probs=53.3 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecc-cCCcEEEEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFN-AGGNTSKVVV 78 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~-~~~~~a~i~~ 78 (184) |+++ .++|+++.+.|..|+..+..++...||.+++.-+...+...+...++-..+.++ .+.+.... .+...+.+. T Consensus 2 ~~~~--i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~- 78 (133) T protein:vir:10 2 IRME--VKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVT- 78 (133) T ss_pred eeEe--eehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEE- Confidence 5555 558999999999998665445677888888877766655543322221112222 12111000 001111111 Q ss_pred eecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHH Q lcl|NC_021560. 79 ESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVY 158 (184) Q Consensus 79 ~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~ 158 (184) +..+..-...| +..|.-.|- +..|=...+.|+ T Consensus 79 --------------------v~vg~~~~~~~--------y~~f~E~GT--------~k~~a~PF~~pA------------ 110 (133) T protein:vir:10 79 --------------------LRVGPSKQHHM--------KVLAQEFGT--------VKQVADPFIRPA------------ 110 (133) T ss_pred --------------------EEecCCCCccc--------eEeeeccCC--------CCCCCCccchHH------------ Confidence 11110001111 112221121 122222222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 159 LDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 159 ~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) ++.-.+.+.+.|.+ |+.+.|.| T Consensus 111 ~~~~~~~~~~~~~~----~~~~~l~K 132 (133) T protein:vir:10 111 LDYNVQTVLRVLTV----EIRNGIQN 132 (133) T ss_pred HHHhHHHHHHHHHH----HHHHHhhc Confidence 23333333333433 44444444 No 23 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=93.71 E-value=0.0042 Score=33.63 Aligned_cols=134 Identities=16% Similarity=0.172 Sum_probs=68.0 Q ss_pred eEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheec--ccCCcEEEEEE Q lcl|NC_021560. 2 RLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHF--NAGGNTSKVVV 78 (184) Q Consensus 2 ~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka--~~~~~~a~i~~ 78 (184) =++|+.++++++.+.|..|+..+..+++..||..++.-+...+.+.+... ...++ ++.+... ..+.....+.+ T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~----tG~l~~sI~~~~~~~~~~~~~~~vg~ 76 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKK----TGKLRRNIVSAALRQKDAPGLATAGV 76 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC----hhhHHhhcccccccccccceeEEeee Confidence 12566669999999999998776556778888888888877766554322 23333 2222111 11111111111 Q ss_pred eecceeeeecCCCCCCCcceEecccccCcceeeec-CCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 79 ESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAM-KSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 79 ~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~-~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) .++. +..+... ..+|.+.+ -.|. +..|=...+.| T Consensus 77 ~~~~-------------~~~~~~~---~~~~y~~f~E~GT-----------------~~~~a~pFl~p------------ 111 (140) T protein:vir:14 77 RVRT-------------KGKADSP---NNAFYWRFDEFGT-----------------QHMKAQPFMRP------------ 111 (140) T ss_pred eecc-------------ccccCCC---Cccceeeeecccc-----------------CCCCCCcchhH------------ Confidence 1000 0000000 11122111 1111 12222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) .++.-.+.+.+.+.+.+..+|++.|++ T Consensus 112 a~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:14 112 AFDASIGEAEGAIRTELARAIDRVLGG 138 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 245556677788888889999999999 No 24 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=93.48 E-value=0.002 Score=35.36 Aligned_cols=117 Identities=12% Similarity=0.164 Sum_probs=55.2 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhhe--ecccCCcEEEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTA--HFNAGGNTSKVV 77 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~--ka~~~~~~a~i~ 77 (184) |+|+| ++++++.+.|..++..+. +++.+|+.+++..+...+.... .+.-..++ ++.+. +.+.+.+.+.|. T Consensus 5 ~~i~~--~Gld~l~~~L~~~~~~~~-~~v~~al~~~a~~i~~~ak~~a----p~~tG~L~~sI~~~~~~~~~~~~~~~v~ 77 (125) T protein:vir:94 5 FNIKF--KGVDKLLDEFDISRKELV-PYSVEAMKTSLSRAVEKSKGLA----RVDTGYMRNNIQQDEVKEEHGVVTGRYV 77 (125) T ss_pred eeeee--hhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHhhC----CCCChhhhhhceecceeccCCcEEEEee Confidence 66666 489999999999987774 6778888888777665543332 23223333 22221 112222222221 Q ss_pred EeecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 78 VESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 78 ~~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) .+.. ...|.-.|- +..|-+..+.|+.. T Consensus 78 ~~~~------------------------------------Ya~~vEfGT--------~~~~a~Pfl~pa~~--------- 104 (125) T protein:vir:94 78 ARAD------------------------------------YSSYNEYGT--------YRMSAQPFMAPSVA--------- 104 (125) T ss_pred CCCC------------------------------------ccceeeccc--------ccCCCCcccchhHH--------- Confidence 1111 111111111 12233333333322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) .. ...|.+.|+.+|.+.+-+ T Consensus 105 ---~~----~~~~~~~l~~~l~~a~k~ 124 (125) T protein:vir:94 105 ---AM----TPFFYKAVRDALNKAAKF 124 (125) T ss_pred ---HH----HHHHHHHHHHHHHHHhcc Confidence 12 233444555555555555 No 25 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=93.28 E-value=0.0046 Score=33.37 Aligned_cols=134 Identities=19% Similarity=0.206 Sum_probs=68.3 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheeccc--CCcEEEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNA--GGNTSKVV 77 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~--~~~~a~i~ 77 (184) |- +|+.++|+++.+.|..|+..+..+++..||...+.-+...+...+...+ ..++ ++.+..... ......+. T Consensus 1 Ma-~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~t----G~l~~sI~~~~~~~~~~~~~~~~g 75 (140) T protein:vir:10 1 MS-SIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKT----GKLRRNIVSAALRQKDAPGLATAG 75 (140) T ss_pred Cc-eeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCh----hhHHHhccccccccccccceEEee Confidence 32 5666799999999999986654467888888888877777666554333 3333 232221111 11111111 Q ss_pred EeecceeeeecCCCCCCCcceEecccccCcceeeec-CCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchH Q lcl|NC_021560. 78 VESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAM-KSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPD 156 (184) Q Consensus 78 ~~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~-~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~ 156 (184) +..+. +..+... ..+|.+.+ -+|+ +..|=...+.| T Consensus 76 ~~~~~-------------~~~~~~~---~~~~y~~f~E~GT-----------------~~~~a~PFl~p----------- 111 (140) T protein:vir:10 76 VRVRT-------------KGKADSP---NNAFYWRFDEFGT-----------------QHMKAQPFMRP----------- 111 (140) T ss_pred eeecc-------------ccccCCC---CccceeeeeccCC-----------------CCCCCCcchhh----------- Confidence 11100 0000000 11222211 1121 11222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 157 VYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 157 ~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) .++.-.+.+.+.+.+.+..+|.+.|++ T Consensus 112 -A~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:10 112 -AFDASIGEAEGAIRTELARAIDRVLGG 138 (140) T ss_pred -hHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 234555677777888888899999999 No 26 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=92.75 E-value=0.0072 Score=32.34 Aligned_cols=142 Identities=11% Similarity=0.115 Sum_probs=69.0 Q ss_pred Ce--EEeeHHHHHHHHHHHhhccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEE Q lcl|NC_021560. 1 MR--LEMNSKDFEELERAFRRLPG-EIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVV 77 (184) Q Consensus 1 m~--i~id~~~l~~~~~~L~~l~~-~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~ 77 (184) |. |+|+.++|+++.+.|..|.. ....++.+.||.+++.-++..+...+...-. +.+..+ ....+.+.+.- T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~-~~~~~~---~~~~~~~~~~d--- 73 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDD-NSKSGR---KGSRPPGHAAN--- 73 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCC-cccccc---ccccccchhhh--- Confidence 53 77888899999999999963 3335788888888888777666555432110 000000 00000001100 Q ss_pred EeecceeeeecCCCCCCCcce--EecccccC--cceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcC Q lcl|NC_021560. 78 VESGWIPLQRLGAVQNATGVY--AKLRGSYR--HAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITN 153 (184) Q Consensus 78 ~~g~~i~l~~f~~r~~~~gv~--~~~~~~~~--gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~ 153 (184) .|...+. +.+.|.. ..|+..-+ .+|.+.+ .-.| ....|=...+.| T Consensus 74 ----~i~~~~~---~~~~g~~~~~VG~~~~~~~~~~y~~f--------~E~G--------T~k~~a~pF~~p-------- 122 (149) T protein:vir:13 74 ----NIPEPKI---RKKKGNLQCVVGWEKSDNTPFYYMKM--------EEWG--------TSERPPHHAFGK-------- 122 (149) T ss_pred ----cceeccc---ccccceeEEEeeccCCCCCccceeee--------eccC--------ccCCCCCccchH-------- Confidence 1222222 1122221 11222111 1233221 1111 112222222222 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 154 NPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) .++.-.+.+.+.|.+.|..+|++.||- T Consensus 123 ----a~~~~~~~~~~~~~~~l~k~i~~~lG~ 149 (149) T protein:vir:13 123 ----TNKILKRVYDNIAQKKYDNFVKEKLGD 149 (149) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 245556677777888888899999999 No 27 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=92.15 E-value=0.0033 Score=34.21 Aligned_cols=112 Identities=13% Similarity=0.167 Sum_probs=56.0 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccH--HHHhh-hhheecccCCcEEEEEEeec Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPR--ELVAA-LTTAHFNAGGNTSKVVVESG 81 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~--k~ik~-~~~~ka~~~~~~a~i~~~g~ 81 (184) |..++|+++.+.|..++..+ .+++.+||.+++..+...+.+.....++.|- ..++. +.+. ..+.+++.|. T Consensus 1 i~i~Gld~L~~~l~~~~~~~-~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~--~~g~~~~~v~---- 73 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDI-EDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVK--KIGDLHYRVI---- 73 (115) T ss_pred CeehhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeee--ecCcEEEEee---- Confidence 77789999999999998666 5688899998888887777666555444442 22331 1111 1111222221 Q ss_pred ceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHHH Q lcl|NC_021560. 82 WIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLDV 161 (184) Q Consensus 82 ~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~ 161 (184) .++| ...|--.|. +..|-+..+.|+..+ T Consensus 74 ------------------------~~~~--------Ya~~vEfGT--------~km~a~PFl~PA~~~------------ 101 (115) T protein:vir:10 74 ------------------------STAH--------YSGFLEFGT--------RYMEPAPFMFPTYQT------------ 101 (115) T ss_pred ------------------------CCCc--------cchheeccc--------ccCCCCCchhhhHHH------------ Confidence 1222 122222221 223333333333321 Q ss_pred HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 162 LAGVIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 162 ~~~~~~~~~~~rl~~Ei~r~L~ 183 (184) -...+..+|.+++. T Consensus 102 --------~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 102 --------LKKSTINDLKRLLS 115 (115) T ss_pred --------HHHHHHHHHHHHhC Confidence 12223334444444 No 28 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=92.10 E-value=0.0035 Score=34.06 Aligned_cols=115 Identities=10% Similarity=0.088 Sum_probs=55.7 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeeccee Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESGWIP 84 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~~i~ 84 (184) |..++|+++.+.|..++..+ .+++..||.+++..+...+.......++.|. .++.|.-+|...- T Consensus 1 i~i~Gld~L~~~l~~~~~~~-~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~-----------~TG~Lr~SI~~~~---- 64 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNI-DDDVDDILQENAKEYVVRAKLKAREVMNKGY-----------WTGNLSRNIRYKK---- 64 (115) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccccCCCC-----------cchhhhhceeeee---- Confidence 77789999999999998666 5788999999888888887666554444442 1222322232210 Q ss_pred eeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHHHHHH Q lcl|NC_021560. 85 LQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLDVLAG 164 (184) Q Consensus 85 l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~~ 164 (184) ..|..+.. ..+++. ..|--.|- +..|-+..+.|+..+ ....+.+ T Consensus 65 ---------~g~~~~~V---~~~~~Y--------a~~vE~GT--------~~m~a~PFl~PA~~~--------~k~~~~~ 108 (115) T protein:vir:99 65 ---------TVDLQYTI---TSHAAY--------SGFLEFGT--------RYMEAEPFMWPVYEV--------IRKSTVE 108 (115) T ss_pred ---------cCcEEEEe---cCCccc--------cccccccc--------cccCCCCcchhhHHH--------HHHHHHH Confidence 01111111 112221 12221121 123333333333321 1122222 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 165 VIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 165 ~~~~~~~~rl~~Ei~r~L~ 183 (184) +|.+++- T Consensus 109 ------------~l~~~~k 115 (115) T protein:vir:99 109 ------------ELKTLFE 115 (115) T ss_pred ------------HHHHHhC Confidence 3333333 No 29 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=92.00 E-value=0.0076 Score=32.20 Aligned_cols=129 Identities=12% Similarity=0.091 Sum_probs=52.6 Q ss_pred EEeeHHHHHHHHHHHhh-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeec Q lcl|NC_021560. 3 LEMNSKDFEELERAFRR-LPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESG 81 (184) Q Consensus 3 i~id~~~l~~~~~~L~~-l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~ 81 (184) |++++.+++++++.|.. +......+...+||+.++..+ ...+.+. +.+++-+. ...-++ T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v----~~~lK~~----------~~~fkDTG-ati~ev----- 60 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLV----AKTLKSE----------FVQFKDTG-ASIDEI----- 60 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHH----HHHHHHh----------hcchhccc-ceeeeE----- Confidence 66666789999998876 544444455555555555444 3333322 22222111 111111 Q ss_pred ceeeeecCCCCCCCcceE--ecccc-cCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHH Q lcl|NC_021560. 82 WIPLQRLGAVQNATGVYA--KLRGS-YRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVY 158 (184) Q Consensus 82 ~i~l~~f~~r~~~~gv~~--~~~~~-~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~ 158 (184) .+..+.+..|+.. .+|.. -...=|--+ .-+| |.|-|.... |.=+| + T Consensus 61 -----~~s~p~~~~G~r~V~i~W~gp~~R~~iVHL--NE~G-Ytr~Gk~i~------------------PrG~G-----~ 109 (133) T protein:vir:78 61 -----NIEKPSYDKGVRSIKIDWKGPKDRYKIIHL--NEYG-YTRNGKKIT------------------PAGTG-----S 109 (133) T ss_pred -----EecCeeeeCCceEEEEEEecCCCceeEEEe--eccc-eecCCCeEc------------------cchhh-----H Confidence 1111111222111 12211 000000000 0111 233222111 22222 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021560. 159 LDVLAGVIEDYFFPRIVHEIERLL 182 (184) Q Consensus 159 ~~~~~~~~~~~~~~rl~~Ei~r~L 182 (184) .+...+..+..|.+-+..||.++| T Consensus 110 i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 110 VARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred HHHHHHhhhHHHHHHHHHHHHhhC Confidence 333444556677778888888888 No 30 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=91.91 E-value=0.0027 Score=34.68 Aligned_cols=113 Identities=7% Similarity=0.035 Sum_probs=49.2 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) |.|+|+ +++++.+.|..++..+ .+.+.+||..++..+..++.... -+.-+.++ ++. .+.+++++.|.++ T Consensus 1 msi~i~--Gld~l~~~l~~~~~~~-~~~v~~al~~~a~~i~~~ak~~a----Pv~TG~Lr~sI~---~~~~g~~~~V~~~ 70 (114) T protein:vir:95 1 MAIKWQ--GIEKLVATISNAQPKA-VEQSLQVLKNNGEKGKRIAKQLA----PKDTEFLKDHIT---TSYPGMEAHIHGE 70 (114) T ss_pred Ceeeee--hHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhC----CcCchhhhhcee---eecCceEEEeecC Confidence 766665 8999999999998766 45778888887777655443332 22222232 111 1111222222111 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) . +.+.|--.|- +..|-+..+.|+..+ .. T Consensus 71 ~------------------------------------~Ya~yvE~GT--------~~~~aqPfl~pa~~~--------~~ 98 (114) T protein:vir:95 71 A------------------------------------GYDGYQEYGT--------RFQPGTPHFRPMMEQ--------IQ 98 (114) T ss_pred C------------------------------------CccceeecCc--------cccCCCccchhhHHH--------HH Confidence 1 1112211111 112333223333211 12 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIVHEIE 179 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~~Ei~ 179 (184) ..+.+. |.+.|..++. T Consensus 99 ~~~~~~----l~~~l~~~~k 114 (114) T protein:vir:95 99 PQFQKD----MTDVMKGAFK 114 (114) T ss_pred HHHHHH----HHHHHHhhcC Confidence 222222 2222222333 No 31 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=91.82 E-value=0.0073 Score=32.31 Aligned_cols=111 Identities=12% Similarity=0.268 Sum_probs=47.9 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) |.++|+.++++++.+.|..+.. .+++.+||..++..+..++.. ...+.-+.++ ++.+ ..+.+..++.|..+ T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~---~~~~~~al~~~~~~i~~~ak~----~aPvdTG~Lr~si~~-~~~~~~~~~~V~~~ 72 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAAS---LKGVQQVVKSNTSNMTANMQK----LVPVDTGYMKRSIKM-ELTEGGFSGQAGPH 72 (112) T ss_pred CceeeeehhHHHHHHHHHhhhh---HHHHHHHHHHHHHHHHHHHHH----hCCCCchhhhhceee-eecCCceEEEeecC Confidence 9999999999999999987653 356788888887766555543 2333333333 2211 11111111111111 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) .. | +.|+ -.|+ ++.|-.-.+-|+. T Consensus 73 ~~-----------------------Y-a~~v---E~GT-----------------~k~~a~Pfl~pa~------------ 96 (112) T protein:vir:36 73 TD-----------------------Y-SAYV---EYGT-----------------RFQSAQPFVKPAY------------ 96 (112) T ss_pred CC-----------------------c-ccee---eccc-----------------cccCCCcchhhhH------------ Confidence 00 0 1111 1121 1122221111211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~~Ei~r~L~ 183 (184) +.. ...|.++|+ ++|- T Consensus 97 ~~~----~~~~~~~i~----~~lr 112 (112) T protein:vir:36 97 NEQ----KGVFIKDLE----RLLK 112 (112) T ss_pred HHH----HHHHHHHHH----HHcC Confidence 111 112222222 2222 No 32 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=91.64 E-value=0.0036 Score=33.96 Aligned_cols=115 Identities=10% Similarity=0.086 Sum_probs=53.8 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeeccee Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESGWIP 84 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~~i~ 84 (184) |+.++|+++.+.|..++..+ .+++..||.+++..+..++.+.....++.|. .++.|..+|.+. T Consensus 1 i~~~Gld~l~~~l~~~~~~~-~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~-----------~TG~Lr~sI~~~----- 63 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNI-DDDVDDILQENAKEYVVRAKLKAREVMNKGY-----------WTGNLSRNIRYK----- 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccccCCCCC-----------Cchhhhhcceee----- Confidence 77789999999999998776 4688899999888887776665544443331 122233333221 Q ss_pred eeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHHHHHH Q lcl|NC_021560. 85 LQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLDVLAG 164 (184) Q Consensus 85 l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~~ 164 (184) ...|..+... .+++. ..|--.|- +..|=+..+.|+..+ ... T Consensus 64 --------~~g~~~~~v~---~~~~Y--------a~~vE~GT--------~km~a~Pfl~PA~~~--------~~~---- 104 (115) T protein:vir:10 64 --------KTGDLQYTIT---SHAAY--------SGFLEFGT--------RYMEAEPFMWPVYEV--------IRK---- 104 (115) T ss_pred --------ecCceEEEee---cCccc--------hhhhcccc--------cccCCCCchhhhHHH--------HHH---- Confidence 0111111111 11111 11111111 223333333343321 112 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 165 VIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 165 ~~~~~~~~rl~~Ei~r~L~ 183 (184) .|.+ +|.+++- T Consensus 105 ----~~~~----~i~~~~k 115 (115) T protein:vir:10 105 ----STVE----ELKALFE 115 (115) T ss_pred ----HHHH----HHHHHhC Confidence 2222 2222222 No 33 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=91.64 E-value=0.0036 Score=33.96 Aligned_cols=115 Identities=10% Similarity=0.086 Sum_probs=53.8 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeeccee Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESGWIP 84 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~~i~ 84 (184) |+.++|+++.+.|..++..+ .+++..||.+++..+..++.+.....++.|. .++.|..+|.+. T Consensus 1 i~~~Gld~l~~~l~~~~~~~-~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~-----------~TG~Lr~sI~~~----- 63 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNI-DDDVDDILQENAKEYVVRAKLKAREVMNKGY-----------WTGNLSRNIRYK----- 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccccCCCCC-----------Cchhhhhcceee----- Confidence 77789999999999998776 4688899999888887776665544443331 122233333221 Q ss_pred eeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHHHHHH Q lcl|NC_021560. 85 LQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLDVLAG 164 (184) Q Consensus 85 l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~~ 164 (184) ...|..+... .+++. ..|--.|- +..|=+..+.|+..+ ... T Consensus 64 --------~~g~~~~~v~---~~~~Y--------a~~vE~GT--------~km~a~Pfl~PA~~~--------~~~---- 104 (115) T protein:vir:96 64 --------KTGDLQYTIT---SHAAY--------SGFLEFGT--------RYMEAEPFMWPVYEV--------IRK---- 104 (115) T ss_pred --------ecCceEEEee---cCccc--------hhhhcccc--------cccCCCCchhhhHHH--------HHH---- Confidence 0111111111 11111 11111111 223333333343321 112 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 165 VIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 165 ~~~~~~~~rl~~Ei~r~L~ 183 (184) .|.+ +|.+++- T Consensus 105 ----~~~~----~i~~~~k 115 (115) T protein:vir:96 105 ----STVE----ELKALFE 115 (115) T ss_pred ----HHHH----HHHHHhC Confidence 2222 2222222 No 34 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=91.64 E-value=0.0036 Score=33.96 Aligned_cols=115 Identities=10% Similarity=0.086 Sum_probs=53.8 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeeccee Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESGWIP 84 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~~i~ 84 (184) |+.++|+++.+.|..++..+ .+++..||.+++..+..++.+.....++.|. .++.|..+|.+. T Consensus 1 i~~~Gld~l~~~l~~~~~~~-~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~-----------~TG~Lr~sI~~~----- 63 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNI-DDDVDDILQENAKEYVVRAKLKAREVMNKGY-----------WTGNLSRNIRYK----- 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccccCCCCC-----------Cchhhhhcceee----- Confidence 77789999999999998776 4688899999888887776665544443331 122233333221 Q ss_pred eeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHHHHHH Q lcl|NC_021560. 85 LQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLDVLAG 164 (184) Q Consensus 85 l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~~ 164 (184) ...|..+... .+++. ..|--.|- +..|=+..+.|+..+ ... T Consensus 64 --------~~g~~~~~v~---~~~~Y--------a~~vE~GT--------~km~a~Pfl~PA~~~--------~~~---- 104 (115) T protein:vir:78 64 --------KTGDLQYTIT---SHAAY--------SGFLEFGT--------RYMEAEPFMWPVYEV--------IRK---- 104 (115) T ss_pred --------ecCceEEEee---cCccc--------hhhhcccc--------cccCCCCchhhhHHH--------HHH---- Confidence 0111111111 11111 11111111 223333333343321 112 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 165 VIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 165 ~~~~~~~~rl~~Ei~r~L~ 183 (184) .|.+ +|.+++- T Consensus 105 ----~~~~----~i~~~~k 115 (115) T protein:vir:78 105 ----STVE----ELKALFE 115 (115) T ss_pred ----HHHH----HHHHHhC Confidence 2222 2222222 No 35 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=91.64 E-value=0.0036 Score=33.96 Aligned_cols=115 Identities=10% Similarity=0.086 Sum_probs=53.8 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeeccee Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESGWIP 84 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~~i~ 84 (184) |+.++|+++.+.|..++..+ .+++..||.+++..+..++.+.....++.|. .++.|..+|.+. T Consensus 1 i~~~Gld~l~~~l~~~~~~~-~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~-----------~TG~Lr~sI~~~----- 63 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNI-DDDVDDILQENAKEYVVRAKLKAREVMNKGY-----------WTGNLSRNIRYK----- 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccccCCCCC-----------Cchhhhhcceee----- Confidence 77789999999999998776 4688899999888887776665544443331 122233333221 Q ss_pred eeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHHHHHH Q lcl|NC_021560. 85 LQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLDVLAG 164 (184) Q Consensus 85 l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~~ 164 (184) ...|..+... .+++. ..|--.|- +..|=+..+.|+..+ ... T Consensus 64 --------~~g~~~~~v~---~~~~Y--------a~~vE~GT--------~km~a~Pfl~PA~~~--------~~~---- 104 (115) T protein:vir:96 64 --------KTGDLQYTIT---SHAAY--------SGFLEFGT--------RYMEAEPFMWPVYEV--------IRK---- 104 (115) T ss_pred --------ecCceEEEee---cCccc--------hhhhcccc--------cccCCCCchhhhHHH--------HHH---- Confidence 0111111111 11111 11111111 223333333343321 112 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 165 VIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 165 ~~~~~~~~rl~~Ei~r~L~ 183 (184) .|.+ +|.+++- T Consensus 105 ----~~~~----~i~~~~k 115 (115) T protein:vir:96 105 ----STVE----ELKALFE 115 (115) T ss_pred ----HHHH----HHHHHhC Confidence 2222 2222222 No 36 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=91.64 E-value=0.0036 Score=33.96 Aligned_cols=115 Identities=10% Similarity=0.086 Sum_probs=53.8 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeeccee Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESGWIP 84 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~~i~ 84 (184) |+.++|+++.+.|..++..+ .+++..||.+++..+..++.+.....++.|. .++.|..+|.+. T Consensus 1 i~~~Gld~l~~~l~~~~~~~-~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~-----------~TG~Lr~sI~~~----- 63 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNI-DDDVDDILQENAKEYVVRAKLKAREVMNKGY-----------WTGNLSRNIRYK----- 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccccCCCCC-----------Cchhhhhcceee----- Confidence 77789999999999998776 4688899999888887776665544443331 122233333221 Q ss_pred eeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHHHHHH Q lcl|NC_021560. 85 LQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLDVLAG 164 (184) Q Consensus 85 l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~~ 164 (184) ...|..+... .+++. ..|--.|- +..|=+..+.|+..+ ... T Consensus 64 --------~~g~~~~~v~---~~~~Y--------a~~vE~GT--------~km~a~Pfl~PA~~~--------~~~---- 104 (115) T protein:vir:97 64 --------KTGDLQYTIT---SHAAY--------SGFLEFGT--------RYMEAEPFMWPVYEV--------IRK---- 104 (115) T ss_pred --------ecCceEEEee---cCccc--------hhhhcccc--------cccCCCCchhhhHHH--------HHH---- Confidence 0111111111 11111 11111111 223333333343321 112 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 165 VIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 165 ~~~~~~~~rl~~Ei~r~L~ 183 (184) .|.+ +|.+++- T Consensus 105 ----~~~~----~i~~~~k 115 (115) T protein:vir:97 105 ----STVE----ELKALFE 115 (115) T ss_pred ----HHHH----HHHHHhC Confidence 2222 2222222 No 37 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=91.64 E-value=0.0036 Score=33.96 Aligned_cols=115 Identities=10% Similarity=0.086 Sum_probs=53.8 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeeccee Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESGWIP 84 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~~i~ 84 (184) |+.++|+++.+.|..++..+ .+++..||.+++..+..++.+.....++.|. .++.|..+|.+. T Consensus 1 i~~~Gld~l~~~l~~~~~~~-~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~-----------~TG~Lr~sI~~~----- 63 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNI-DDDVDDILQENAKEYVVRAKLKAREVMNKGY-----------WTGNLSRNIRYK----- 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccccCCCCC-----------Cchhhhhcceee----- Confidence 77789999999999998776 4688899999888887776665544443331 122233333221 Q ss_pred eeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHHHHHH Q lcl|NC_021560. 85 LQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLDVLAG 164 (184) Q Consensus 85 l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~~ 164 (184) ...|..+... .+++. ..|--.|- +..|=+..+.|+..+ ... T Consensus 64 --------~~g~~~~~v~---~~~~Y--------a~~vE~GT--------~km~a~Pfl~PA~~~--------~~~---- 104 (115) T protein:vir:93 64 --------KTGDLQYTIT---SHAAY--------SGFLEFGT--------RYMEAEPFMWPVYEV--------IRK---- 104 (115) T ss_pred --------ecCceEEEee---cCccc--------hhhhcccc--------cccCCCCchhhhHHH--------HHH---- Confidence 0111111111 11111 11111111 223333333343321 112 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 165 VIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 165 ~~~~~~~~rl~~Ei~r~L~ 183 (184) .|.+ +|.+++- T Consensus 105 ----~~~~----~i~~~~k 115 (115) T protein:vir:93 105 ----STVE----ELKALFE 115 (115) T ss_pred ----HHHH----HHHHHhC Confidence 2222 2222222 No 38 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=90.96 E-value=0.014 Score=30.81 Aligned_cols=139 Identities=11% Similarity=0.048 Sum_probs=65.8 Q ss_pred Ce--EEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEE Q lcl|NC_021560. 1 MR--LEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVV 78 (184) Q Consensus 1 m~--i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~ 78 (184) |- ++|+.++++++.+.|..|+... .+++..||...+.-+...+...+....+-....+. ......+.+...|.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~---~~~~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKS---EPWRTGQHGADQIKV 76 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcccccccccc---cccccccccccccee Confidence 32 6677779999999999998775 57888888888777776666555433222111110 000001111111111 Q ss_pred eecceeeeecCCCCCCCc-ceEe-cccc--cCcceeeec-CCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcC Q lcl|NC_021560. 79 ESGWIPLQRLGAVQNATG-VYAK-LRGS--YRHAFIAAM-KSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITN 153 (184) Q Consensus 79 ~g~~i~l~~f~~r~~~~g-v~~~-~~~~--~~gaFia~~-~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~ 153 (184) ...+...| ..+. ++.. -.++|.+.+ -+|. +..|=. |.+-. T Consensus 77 ----------~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT-----------------~~~~a~----PFl~p---- 121 (146) T protein:vir:10 77 ----------TKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGT-----------------SKMPAH----PFIEP---- 121 (146) T ss_pred ----------ccccccccceeEEeeeccCCCCCcceeeeeccCC-----------------CCCCCC----cchhH---- Confidence 11111122 1121 1111 123444332 1111 112211 11111 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021560. 154 NPDVYLDVLAGVIEDYFFPRIVHEIERLL 182 (184) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L 182 (184) .++.-.+.+.+.+.+.|.+||.+.| T Consensus 122 ----a~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 122 ----GFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ----HHHHhHHHHHHHHHHHHHHHHhhcC Confidence 2344555666667777777777777 No 39 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=90.96 E-value=0.014 Score=30.81 Aligned_cols=139 Identities=11% Similarity=0.048 Sum_probs=65.8 Q ss_pred Ce--EEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEE Q lcl|NC_021560. 1 MR--LEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVV 78 (184) Q Consensus 1 m~--i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~ 78 (184) |- ++|+.++++++.+.|..|+... .+++..||...+.-+...+...+....+-....+. ......+.+...|.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~---~~~~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKS---EPWRTGQHGADQIKV 76 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcccccccccc---cccccccccccccee Confidence 32 6677779999999999998775 57888888888777776666555433222111110 000001111111111 Q ss_pred eecceeeeecCCCCCCCc-ceEe-cccc--cCcceeeec-CCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcC Q lcl|NC_021560. 79 ESGWIPLQRLGAVQNATG-VYAK-LRGS--YRHAFIAAM-KSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITN 153 (184) Q Consensus 79 ~g~~i~l~~f~~r~~~~g-v~~~-~~~~--~~gaFia~~-~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~ 153 (184) ...+...| ..+. ++.. -.++|.+.+ -+|. +..|=. |.+-. T Consensus 77 ----------~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT-----------------~~~~a~----PFl~p---- 121 (146) T protein:vir:10 77 ----------TKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGT-----------------SKMPAH----PFIEP---- 121 (146) T ss_pred ----------ccccccccceeEEeeeccCCCCCcceeeeeccCC-----------------CCCCCC----cchhH---- Confidence 11111122 1121 1111 123444332 1111 112211 11111 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021560. 154 NPDVYLDVLAGVIEDYFFPRIVHEIERLL 182 (184) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L 182 (184) .++.-.+.+.+.+.+.|.+||.+.| T Consensus 122 ----a~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 122 ----GFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ----HHHHhHHHHHHHHHHHHHHHHhhcC Confidence 2344555666667777777777777 No 40 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=90.96 E-value=0.014 Score=30.81 Aligned_cols=139 Identities=11% Similarity=0.048 Sum_probs=65.8 Q ss_pred Ce--EEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEE Q lcl|NC_021560. 1 MR--LEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVV 78 (184) Q Consensus 1 m~--i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~ 78 (184) |- ++|+.++++++.+.|..|+... .+++..||...+.-+...+...+....+-....+. ......+.+...|.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~---~~~~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKS---EPWRTGQHGADQIKV 76 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcccccccccc---cccccccccccccee Confidence 32 6677779999999999998775 57888888888777776666555433222111110 000001111111111 Q ss_pred eecceeeeecCCCCCCCc-ceEe-cccc--cCcceeeec-CCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcC Q lcl|NC_021560. 79 ESGWIPLQRLGAVQNATG-VYAK-LRGS--YRHAFIAAM-KSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITN 153 (184) Q Consensus 79 ~g~~i~l~~f~~r~~~~g-v~~~-~~~~--~~gaFia~~-~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~ 153 (184) ...+...| ..+. ++.. -.++|.+.+ -+|. +..|=. |.+-. T Consensus 77 ----------~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT-----------------~~~~a~----PFl~p---- 121 (146) T protein:vir:10 77 ----------TKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGT-----------------SKMPAH----PFIEP---- 121 (146) T ss_pred ----------ccccccccceeEEeeeccCCCCCcceeeeeccCC-----------------CCCCCC----cchhH---- Confidence 11111122 1121 1111 123444332 1111 112211 11111 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021560. 154 NPDVYLDVLAGVIEDYFFPRIVHEIERLL 182 (184) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L 182 (184) .++.-.+.+.+.+.+.|.+||.+.| T Consensus 122 ----a~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 122 ----GFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ----HHHHhHHHHHHHHHHHHHHHHhhcC Confidence 2344555666667777777777777 No 41 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=90.96 E-value=0.014 Score=30.81 Aligned_cols=139 Identities=11% Similarity=0.048 Sum_probs=65.8 Q ss_pred Ce--EEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEE Q lcl|NC_021560. 1 MR--LEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVV 78 (184) Q Consensus 1 m~--i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~ 78 (184) |- ++|+.++++++.+.|..|+... .+++..||...+.-+...+...+....+-....+. ......+.+...|.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~---~~~~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKS---EPWRTGQHGADQIKV 76 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcccccccccc---cccccccccccccee Confidence 32 6677779999999999998775 57888888888777776666555433222111110 000001111111111 Q ss_pred eecceeeeecCCCCCCCc-ceEe-cccc--cCcceeeec-CCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcC Q lcl|NC_021560. 79 ESGWIPLQRLGAVQNATG-VYAK-LRGS--YRHAFIAAM-KSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITN 153 (184) Q Consensus 79 ~g~~i~l~~f~~r~~~~g-v~~~-~~~~--~~gaFia~~-~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~ 153 (184) ...+...| ..+. ++.. -.++|.+.+ -+|. +..|=. |.+-. T Consensus 77 ----------~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT-----------------~~~~a~----PFl~p---- 121 (146) T protein:vir:10 77 ----------TKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGT-----------------SKMPAH----PFIEP---- 121 (146) T ss_pred ----------ccccccccceeEEeeeccCCCCCcceeeeeccCC-----------------CCCCCC----cchhH---- Confidence 11111122 1121 1111 123444332 1111 112211 11111 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021560. 154 NPDVYLDVLAGVIEDYFFPRIVHEIERLL 182 (184) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L 182 (184) .++.-.+.+.+.+.+.|.+||.+.| T Consensus 122 ----a~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 122 ----GFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ----HHHHhHHHHHHHHHHHHHHHHhhcC Confidence 2344555666667777777777777 No 42 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=88.36 E-value=0.02 Score=29.84 Aligned_cols=127 Identities=10% Similarity=0.068 Sum_probs=48.0 Q ss_pred EEeeHHHHHHHHHHHhh-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeec Q lcl|NC_021560. 3 LEMNSKDFEELERAFRR-LPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESG 81 (184) Q Consensus 3 i~id~~~l~~~~~~L~~-l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~ 81 (184) |++++.+++++++.|.. +......+...+||+.++..+ ...+.+. +.+++-+.. . T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v----~~~lK~~----------~~~fkDTGa-t--------- 56 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFF----IKALKKE----------FESFKDTGA-S--------- 56 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHH----HHHHHhh----------hhhhhcccc-e--------- Confidence 66666789999999876 554444455555555555544 3332222 222221111 0 Q ss_pred ceeeeecCCCCCCCcce---E-ecccc-cCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchH Q lcl|NC_021560. 82 WIPLQRLGAVQNATGVY---A-KLRGS-YRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPD 156 (184) Q Consensus 82 ~i~l~~f~~r~~~~gv~---~-~~~~~-~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~ 156 (184) +.-..+..+.+..|+. + .+|.. -...=|--+ .-+| |.|-|.... |.=+| T Consensus 57 -i~ev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iVHL--NE~G-ytr~Gk~i~------------------PrG~G---- 110 (133) T protein:vir:94 57 -IEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNIIHL--NEHG-YTRDGKKYT------------------PRGFG---- 110 (133) T ss_pred -eeeEEecCeeeccCCcceeEEEEeecCCCceeEEEe--eccc-eecCCCeEc------------------cchhh---- Confidence 1111111122222210 1 12211 000000000 0011 222221111 11122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 157 VYLDVLAGVIEDYFFPRIVHEIER 180 (184) Q Consensus 157 ~~~~~~~~~~~~~~~~rl~~Ei~r 180 (184) +.+...+..+..|.+-+..||.+ T Consensus 111 -~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 111 -VIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred -HHHHHHHhhhHHHHHHHHHHhcC Confidence 12333344455566666667776 No 43 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=88.36 E-value=0.02 Score=29.84 Aligned_cols=127 Identities=10% Similarity=0.068 Sum_probs=48.0 Q ss_pred EEeeHHHHHHHHHHHhh-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeec Q lcl|NC_021560. 3 LEMNSKDFEELERAFRR-LPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESG 81 (184) Q Consensus 3 i~id~~~l~~~~~~L~~-l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~ 81 (184) |++++.+++++++.|.. +......+...+||+.++..+ ...+.+. +.+++-+.. . T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v----~~~lK~~----------~~~fkDTGa-t--------- 56 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFF----IKALKKE----------FESFKDTGA-S--------- 56 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHH----HHHHHhh----------hhhhhcccc-e--------- Confidence 66666789999999876 554444455555555555544 3332222 222221111 0 Q ss_pred ceeeeecCCCCCCCcce---E-ecccc-cCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchH Q lcl|NC_021560. 82 WIPLQRLGAVQNATGVY---A-KLRGS-YRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPD 156 (184) Q Consensus 82 ~i~l~~f~~r~~~~gv~---~-~~~~~-~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~ 156 (184) +.-..+..+.+..|+. + .+|.. -...=|--+ .-+| |.|-|.... |.=+| T Consensus 57 -i~ev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iVHL--NE~G-ytr~Gk~i~------------------PrG~G---- 110 (133) T protein:vir:96 57 -IEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNIIHL--NEHG-YTRDGKKYT------------------PRGFG---- 110 (133) T ss_pred -eeeEEecCeeeccCCcceeEEEEeecCCCceeEEEe--eccc-eecCCCeEc------------------cchhh---- Confidence 1111111122222210 1 12211 000000000 0011 222221111 11122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 157 VYLDVLAGVIEDYFFPRIVHEIER 180 (184) Q Consensus 157 ~~~~~~~~~~~~~~~~rl~~Ei~r 180 (184) +.+...+..+..|.+-+..||.+ T Consensus 111 -~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 111 -VIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred -HHHHHHHhhhHHHHHHHHHHhcC Confidence 12333344455566666667776 No 44 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=88.36 E-value=0.02 Score=29.84 Aligned_cols=127 Identities=10% Similarity=0.068 Sum_probs=48.0 Q ss_pred EEeeHHHHHHHHHHHhh-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeec Q lcl|NC_021560. 3 LEMNSKDFEELERAFRR-LPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESG 81 (184) Q Consensus 3 i~id~~~l~~~~~~L~~-l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~ 81 (184) |++++.+++++++.|.. +......+...+||+.++..+ ...+.+. +.+++-+.. . T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v----~~~lK~~----------~~~fkDTGa-t--------- 56 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFF----IKALKKE----------FESFKDTGA-S--------- 56 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHH----HHHHHhh----------hhhhhcccc-e--------- Confidence 66666789999999876 554444455555555555544 3332222 222221111 0 Q ss_pred ceeeeecCCCCCCCcce---E-ecccc-cCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchH Q lcl|NC_021560. 82 WIPLQRLGAVQNATGVY---A-KLRGS-YRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPD 156 (184) Q Consensus 82 ~i~l~~f~~r~~~~gv~---~-~~~~~-~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~ 156 (184) +.-..+..+.+..|+. + .+|.. -...=|--+ .-+| |.|-|.... |.=+| T Consensus 57 -i~ev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iVHL--NE~G-ytr~Gk~i~------------------PrG~G---- 110 (133) T protein:vir:93 57 -IEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNIIHL--NEHG-YTRDGKKYT------------------PRGFG---- 110 (133) T ss_pred -eeeEEecCeeeccCCcceeEEEEeecCCCceeEEEe--eccc-eecCCCeEc------------------cchhh---- Confidence 1111111122222210 1 12211 000000000 0011 222221111 11122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 157 VYLDVLAGVIEDYFFPRIVHEIER 180 (184) Q Consensus 157 ~~~~~~~~~~~~~~~~rl~~Ei~r 180 (184) +.+...+..+..|.+-+..||.+ T Consensus 111 -~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 111 -VIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred -HHHHHHHhhhHHHHHHHHHHhcC Confidence 12333344455566666667776 No 45 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=88.36 E-value=0.02 Score=29.84 Aligned_cols=127 Identities=10% Similarity=0.068 Sum_probs=48.0 Q ss_pred EEeeHHHHHHHHHHHhh-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeec Q lcl|NC_021560. 3 LEMNSKDFEELERAFRR-LPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESG 81 (184) Q Consensus 3 i~id~~~l~~~~~~L~~-l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~ 81 (184) |++++.+++++++.|.. +......+...+||+.++..+ ...+.+. +.+++-+.. . T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v----~~~lK~~----------~~~fkDTGa-t--------- 56 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFF----IKALKKE----------FESFKDTGA-S--------- 56 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHH----HHHHHhh----------hhhhhcccc-e--------- Confidence 66666789999999876 554444455555555555544 3332222 222221111 0 Q ss_pred ceeeeecCCCCCCCcce---E-ecccc-cCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchH Q lcl|NC_021560. 82 WIPLQRLGAVQNATGVY---A-KLRGS-YRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPD 156 (184) Q Consensus 82 ~i~l~~f~~r~~~~gv~---~-~~~~~-~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~ 156 (184) +.-..+..+.+..|+. + .+|.. -...=|--+ .-+| |.|-|.... |.=+| T Consensus 57 -i~ev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iVHL--NE~G-ytr~Gk~i~------------------PrG~G---- 110 (133) T protein:vir:78 57 -IEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNIIHL--NEHG-YTRDGKKYT------------------PRGFG---- 110 (133) T ss_pred -eeeEEecCeeeccCCcceeEEEEeecCCCceeEEEe--eccc-eecCCCeEc------------------cchhh---- Confidence 1111111122222210 1 12211 000000000 0011 222221111 11122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 157 VYLDVLAGVIEDYFFPRIVHEIER 180 (184) Q Consensus 157 ~~~~~~~~~~~~~~~~rl~~Ei~r 180 (184) +.+...+..+..|.+-+..||.+ T Consensus 111 -~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 111 -VIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred -HHHHHHHhhhHHHHHHHHHHhcC Confidence 12333344455566666667776 No 46 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=87.84 E-value=0.01 Score=31.44 Aligned_cols=136 Identities=11% Similarity=0.148 Sum_probs=63.5 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) | .++. .+++++.+.|..++.++ .+++.+|+.+++..+...+... ..+.-..++ ++.+ +...+++++.|..+ T Consensus 13 M-a~~~-~Gld~l~~~L~~~~~~~-~~~~~~al~~~a~~v~~~ak~~----aPvdTG~Lr~SI~~-~~~~~g~~~~V~~~ 84 (149) T protein:vir:94 13 M-AKVK-YGADSMVVELDKFDKKI-EEWVKKGIAKTTTKIYNTAVAL----APVDLGFLEESIDF-KYFDGGLSSVISVG 84 (149) T ss_pred H-HHHH-HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh----CCcccchhhcCeeE-EeeCCcEEEEEecC Confidence 4 2332 38999999999998766 5688888888887776655433 334445554 3433 23445677777776 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) ..-=+...||++.............++ |..... .+.+.+..| .|=...+-|+ T Consensus 85 ~~YA~~VE~GT~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~~g----------~~a~PFl~pA------------- 136 (149) T protein:vir:94 85 ADYAIYVEYGTGIYATGPGGSRATKIP--WSFKGD---DGEWYTTYG----------QAPQPFWNPA------------- 136 (149) T ss_pred CCcccccccCccccccCCCcccccccc--ceeecC---ccceecCCC----------CCCCcchHHH------------- Confidence 665666678875432111111101111 111111 112222111 1111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIVHEIE 179 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~~Ei~ 179 (184) + +...+.|.++ |+ T Consensus 137 --~-~~~~~~i~~~----i~ 149 (149) T protein:vir:94 137 --I-DAGRKTFEQY----FS 149 (149) T ss_pred --H-HHHHHHHHHh----hC Confidence 1 1112222222 22 No 47 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=86.63 E-value=0.038 Score=28.37 Aligned_cols=125 Identities=11% Similarity=0.117 Sum_probs=55.5 Q ss_pred eEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheeccc-CCcEEEEEEe Q lcl|NC_021560. 2 RLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNA-GGNTSKVVVE 79 (184) Q Consensus 2 ~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~-~~~~a~i~~~ 79 (184) =.+|+.++|+++.+.|..|+... .++...||..++.-+...+...+..... ..+.++ .+.+.+... .+....+.+ T Consensus 1 M~~~~i~Gl~el~~~l~~l~~~~-~~~~~~al~~~a~~v~~~~k~~ap~~~~-~tg~l~~~I~~~~~k~~~~g~~~v~V- 77 (127) T protein:vir:12 1 MADMSFDGIDDLTQYFEKIGGDI-EKVEPVALKAGGEIIAERQRSHVNRSDK-KQPHMQDNITVSNVRESKDGVRFVAV- 77 (127) T ss_pred CeeeeehhHHHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCC-ChhHHHHhhhccccccccCceeEEEE- Confidence 24566779999999999998766 5788888888888877766544331100 012222 222211110 011111111 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) |++. ..+|.+.+ .-.|- ...|=.....|+ + T Consensus 78 ---------------------g~~~-~~~~y~~f--------~E~GT--------~~~~a~Pf~~pa------------~ 107 (127) T protein:vir:12 78 ---------------------GPNK-KVAYRGRF--------LEWGT--------SKMPPQPFIEKG------------G 107 (127) T ss_pred ---------------------eeCC-CCcceeee--------eccCc--------cCCCCCccchHh------------H Confidence 1111 12333221 11111 111211112221 2 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIVHEIE 179 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~~Ei~ 179 (184) +.-.+.+.+.+.+.|..+|. T Consensus 108 ~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 108 KEGEGPAVELMERILTAPIK 127 (127) T ss_pred HHHHHHHHHHHHHHHHHhcC Confidence 33333444444444444444 No 48 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=86.63 E-value=0.039 Score=28.34 Aligned_cols=123 Identities=14% Similarity=0.110 Sum_probs=53.7 Q ss_pred EEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--ccHHHHhhhhheecccCCcEEEEEEee Q lcl|NC_021560. 3 LEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQ--MPRELVAALTTAHFNAGGNTSKVVVES 80 (184) Q Consensus 3 i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~--ik~k~ik~~~~~ka~~~~~~a~i~~~g 80 (184) |+|+.++++++.+.|..|+... .++...||..++.-+...+...+....+ -+.+.++. T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~-~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d------------------- 60 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGV-AKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRD------------------- 60 (128) T ss_pred CccchhhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhh------------------- Confidence 5556679999999999998766 4688888888777776555443211100 00011110 Q ss_pred cceeeeecCCCCCCCcc-eE-ecccccCcceeeec-CCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 81 GWIPLQRLGAVQNATGV-YA-KLRGSYRHAFIAAM-KSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 81 ~~i~l~~f~~r~~~~gv-~~-~~~~~~~gaFia~~-~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) .|.+ +..+...|+ .+ .|++. ..+|.+.+ -+|. +..|=. |.+-. T Consensus 61 -~I~~---~~~k~~~g~~~~~VG~~k-~~~~y~~f~E~GT-----------------~k~~a~----pF~~p-------- 106 (128) T protein:vir:38 61 -DIKL---SSVRETSGLTEVDVGYGK-DTGWRAHFPNSGT-----------------SMQDPQ----HFIEE-------- 106 (128) T ss_pred -hhcc---ccccccCceeEEEeeecC-CCceEEeeeccCc-----------------cCCCCC----cchhH-------- Confidence 0111 111222221 11 23322 33455433 1121 111111 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIE 179 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~ 179 (184) .++.-.+.+.+.+.+.|.++|- T Consensus 107 a~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 107 TQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred HHHHhHHHHHHHHHHHHHhhcC Confidence 2233334444444444444443 No 49 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=86.57 E-value=0.035 Score=28.54 Aligned_cols=130 Identities=14% Similarity=0.145 Sum_probs=50.1 Q ss_pred EEeeHHHHHHHHHHHhhccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeec Q lcl|NC_021560. 3 LEMNSKDFEELERAFRRLPG-EIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESG 81 (184) Q Consensus 3 i~id~~~l~~~~~~L~~l~~-~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~ 81 (184) |+|+..+++++++.|...=+ ....+...+||+.++..+ ...+.+... +++-+ +...-.+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v----~~~~K~~~~----------~fkDT-G~t~~ev----- 60 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVI----VEEVKKQLK----------PSKDT-GALINEV----- 60 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHH----HHHHHhhhh----------hhhhc-cceeccE----- Confidence 55666689999998877511 222445555555555444 333332221 21111 1111111 Q ss_pred ceeeeecCCCCCCCcce-E-ecccc-cCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHH Q lcl|NC_021560. 82 WIPLQRLGAVQNATGVY-A-KLRGS-YRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVY 158 (184) Q Consensus 82 ~i~l~~f~~r~~~~gv~-~-~~~~~-~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~ 158 (184) .+..+.+..|+. + .+|+. -...|+--+ --+|-+++-.|.. | . |.=+| + T Consensus 61 -----~~s~p~~~~G~r~V~vgW~G~~~R~~iiHL--NE~Gytr~~~Gk~----------i----~---PrG~G-----~ 111 (134) T protein:vir:10 61 -----SFSKPEWINGKRTITVHWRGSKDRYKIVHL--IEYGHVQKGTGKF----------I----K---PKAMG-----G 111 (134) T ss_pred -----EecCeeecCCceEEEEEEEcCCceeEEEEe--ecccceecccCCc----------c----C---cchhh-----H Confidence 111111112211 0 12211 111222111 0111111110100 0 0 22222 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021560. 159 LDVLAGVIEDYFFPRIVHEIERL 181 (184) Q Consensus 159 ~~~~~~~~~~~~~~rl~~Ei~r~ 181 (184) .+...+..++.+.+-+..||+++ T Consensus 112 i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 112 VNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred HHHHHHhhhHHHHHHHHHHHhcC Confidence 23334455667777788888888 No 50 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=86.57 E-value=0.035 Score=28.54 Aligned_cols=130 Identities=14% Similarity=0.145 Sum_probs=50.1 Q ss_pred EEeeHHHHHHHHHHHhhccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeec Q lcl|NC_021560. 3 LEMNSKDFEELERAFRRLPG-EIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESG 81 (184) Q Consensus 3 i~id~~~l~~~~~~L~~l~~-~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~ 81 (184) |+|+..+++++++.|...=+ ....+...+||+.++..+ ...+.+... +++-+ +...-.+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v----~~~~K~~~~----------~fkDT-G~t~~ev----- 60 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVI----VEEVKKQLK----------PSKDT-GALINEV----- 60 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHH----HHHHHhhhh----------hhhhc-cceeccE----- Confidence 55666689999998877511 222445555555555444 333332221 21111 1111111 Q ss_pred ceeeeecCCCCCCCcce-E-ecccc-cCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHH Q lcl|NC_021560. 82 WIPLQRLGAVQNATGVY-A-KLRGS-YRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVY 158 (184) Q Consensus 82 ~i~l~~f~~r~~~~gv~-~-~~~~~-~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~ 158 (184) .+..+.+..|+. + .+|+. -...|+--+ --+|-+++-.|.. | . |.=+| + T Consensus 61 -----~~s~p~~~~G~r~V~vgW~G~~~R~~iiHL--NE~Gytr~~~Gk~----------i----~---PrG~G-----~ 111 (134) T protein:vir:95 61 -----SFSKPEWINGKRTITVHWRGSKDRYKIVHL--IEYGHVQKGTGKF----------I----K---PKAMG-----G 111 (134) T ss_pred -----EecCeeecCCceEEEEEEEcCCceeEEEEe--ecccceecccCCc----------c----C---cchhh-----H Confidence 111111112211 0 12211 111222111 0111111110100 0 0 22222 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021560. 159 LDVLAGVIEDYFFPRIVHEIERL 181 (184) Q Consensus 159 ~~~~~~~~~~~~~~rl~~Ei~r~ 181 (184) .+...+..++.+.+-+..||+++ T Consensus 112 i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 112 VNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred HHHHHHhhhHHHHHHHHHHHhcC Confidence 23334455667777788888888 No 51 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=86.02 E-value=0.032 Score=28.78 Aligned_cols=130 Identities=13% Similarity=0.149 Sum_probs=50.8 Q ss_pred EEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeecc Q lcl|NC_021560. 3 LEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESGW 82 (184) Q Consensus 3 i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~~ 82 (184) |++++.+++++++.|...=+ ++++.|..|+++..+=..+...+.+...+ ++-+ +.+.-.+. T Consensus 1 MsvevkGv~eil~~LE~k~g---~~~~~ri~dkAL~~age~v~~~~K~~~~~----------fkDT-Gati~ev~----- 61 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFG---IKEMVKVQDKALIAGAKVIVEEIKKQLKP----------SEDS-GALISEIG----- 61 (134) T ss_pred CeEEeecHHHHHHHHHHhhc---hhhhhhhhhHHHHHHhHHHHHHHHhhcCc----------cccc-cceeccEe----- Confidence 55566689999998876521 12344444444444434444443333222 2211 11111111 Q ss_pred eeeeecCCCCCCCcceE--ecccc-cCcceeeecCCCCeeeEE-ecCCeeecccCccccceeeeecCchhHhhcCchHHH Q lcl|NC_021560. 83 IPLQRLGAVQNATGVYA--KLRGS-YRHAFIAAMKSGHVGAFR-RVPGTQMSSATGKREQIRELFAANPAHAITNNPDVY 158 (184) Q Consensus 83 i~l~~f~~r~~~~gv~~--~~~~~-~~gaFia~~~~g~~~vf~-R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~ 158 (184) +..+....|+.. .+|+. ....++--+ --+|-+. |.|.. |. |.=+| + T Consensus 62 -----~s~p~~~~G~r~V~vgW~G~~~R~~ivHL--nE~Gyt~~r~Gk~-----------i~-------PrG~G-----~ 111 (134) T protein:vir:10 62 -----RTEPEWIKGKRTVTIRWRGPFERFRIVHL--IENGHVEKKSGKF-----------VK-------PKAMG-----G 111 (134) T ss_pred -----ecCeeecCCceEEEEEEEcCCceeeEEEe--eecceeecCCCCe-----------ec-------cchhh-----H Confidence 111111122211 12311 122222111 0011111 11110 00 22222 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021560. 159 LDVLAGVIEDYFFPRIVHEIERL 181 (184) Q Consensus 159 ~~~~~~~~~~~~~~rl~~Ei~r~ 181 (184) .+...+..++.+.+-+..||.++ T Consensus 112 i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 112 INRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred HHHHHHhhhHHHHHHHHHHHhcC Confidence 33344455667777788888888 No 52 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=83.16 E-value=0.06 Score=27.30 Aligned_cols=127 Identities=11% Similarity=0.077 Sum_probs=45.7 Q ss_pred EEeeHHHHHHHHHHHhhc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEeec Q lcl|NC_021560. 3 LEMNSKDFEELERAFRRL-PGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVESG 81 (184) Q Consensus 3 i~id~~~l~~~~~~L~~l-~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g~ 81 (184) |++++.+++++++.|..- ......+...+||+.++..+ ...+.+. +.+++-+... T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v----~~~lK~~----------~~~fkDTGat---------- 56 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFF----IKALKKE----------FESFKDTGAS---------- 56 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHH----HHHHHhh----------hhhhhcccce---------- Confidence 566666888888888654 22233445555555555444 3333222 2222211110 Q ss_pred ceeeeecCCCCCCCcce---E-ecccc-cCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchH Q lcl|NC_021560. 82 WIPLQRLGAVQNATGVY---A-KLRGS-YRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPD 156 (184) Q Consensus 82 ~i~l~~f~~r~~~~gv~---~-~~~~~-~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~ 156 (184) +.-..+..+.+..|+. + .+|+. -...=|--+ .-+| |.|-|.... |.=+| T Consensus 57 -i~ev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iVHL--NE~G-ytr~Gk~i~------------------PrG~G---- 110 (133) T protein:vir:93 57 -IEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNIIHL--NEHG-YTRDGKKYT------------------PRGFG---- 110 (133) T ss_pred -eeeEEecCeeeccCCcceEEEEEeecCCCceeEEEe--eccc-eecCCCeEc------------------cchhh---- Confidence 1111111111112210 0 12211 000000000 0011 222221111 11122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 157 VYLDVLAGVIEDYFFPRIVHEIER 180 (184) Q Consensus 157 ~~~~~~~~~~~~~~~~rl~~Ei~r 180 (184) +.+...+..+..|.+-+..||.+ T Consensus 111 -~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 111 -VIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred -HHHHHHHhhhHHHHHHHHHHhcC Confidence 12333344455566666667766 No 53 >protein:vir:79555 Length: 192 # NCBI annotation: putative tail component # Family: family:all:869 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272521;genbank:gi:148609390;genbank:GeneID:5204391 Probab=83.01 E-value=0.052 Score=27.63 Aligned_cols=165 Identities=12% Similarity=0.139 Sum_probs=84.6 Q ss_pred Ce-EEeeHHHHHHHHHH-HhhccchhHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHhcccHHHHh-hh-hhe Q lcl|NC_021560. 1 MR-LEMNSKDFEELERA-FRRLPGEIRTKAMRRAMTRLRQTARSRIV-----------ARLGPHTQMPRELVA-AL-TTA 65 (184) Q Consensus 1 m~-i~id~~~l~~~~~~-L~~l~~~~~~kA~~rAlnrt~~~~rt~~~-----------r~i~~~~~ik~k~ik-~~-~~~ 65 (184) |+ |+--.++|+.+.+. .-.-...++.++..+|+|+|+.|+++++. +.|++...+-...-. .+ -.. T Consensus 1 ~kgl~~a~~nl~~l~~~~vp~A~~~ainrva~ra~~~t~~~v~~~~~~~~~~~~~I~~k~iR~R~r~~ka~~~~~~~~~I 80 (192) T protein:vir:79 1 MKGLENAIRNLNSLDTRMVPQASAWAINRVAQKAVSVATRQVAGNTVAGDNQVKGIPLKLVRQRVRVFKASPSGKMTARI 80 (192) T ss_pred CchHHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcCcHHHHHhhhhcccccCCCceEEEE Confidence 21 22223456665544 33333445567899999999999999873 335555554321111 11 112 Q ss_pred ecccCCcEEEEEEe--------ec----ceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccC Q lcl|NC_021560. 66 HFNAGGNTSKVVVE--------SG----WIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSAT 133 (184) Q Consensus 66 ka~~~~~~a~i~~~--------g~----~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~ 133 (184) ..+.+++.+.=... ++ .-+-...|...-..+=.+.......|-|... +|. .|.+-. T Consensus 81 ~v~~~~l~ai~lg~~r~r~~rr~~~~~~~~s~~~vGk~~f~gaFia~m~ngr~~V~~R~--~gk----~R~PIe------ 148 (192) T protein:vir:79 81 RVNRGNLPAIKLGTARVRLARRGGKLQYRGSVLKVGKYLFRDAFIQQLANGRWHVMRRI--DGK----NRYPID------ 148 (192) T ss_pred EEecCceeeeeecccccccccccccccccccceEEcceecCchhccccCCCCccceEec--CCC----ccCCee------ Confidence 44444443321110 11 0111122222333333333233346667652 221 222221 Q ss_pred ccccceeeeecCchhHhhcCchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 134 GKREQIRELFAANPAHAITNNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 134 ~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) .-+.||.+. +.+.+.+ ++...+++++++.|...|.+||...|-| T Consensus 149 vvkIpis~~----l~~af~~---e~~r~~~~~~~~el~~~L~~qlr~~~~r 192 (192) T protein:vir:79 149 VVKIPLSGP----LTQAFED---ARDRIIAAEMPKQLGYALKQQLRLWLTR 192 (192) T ss_pred eEeechHHH----HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhhCC Confidence 236788543 3455533 3556778899999999999999999999 No 54 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=82.83 E-value=0.059 Score=27.33 Aligned_cols=136 Identities=13% Similarity=0.143 Sum_probs=62.6 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) | .++. .+++++.+.|..++.++ .+++.+|+.+++..+...+... .-+.-..++ ++.+ ....+.+++.|..+ T Consensus 13 M-a~v~-~Gld~l~~~l~~~~~~~-~~~~~~~l~~~a~~v~~~ak~~----aPvdTG~L~~SI~~-~~~~~g~~~~V~~~ 84 (149) T protein:vir:10 13 M-AKVK-YGADSMVVELDKFDKKI-EEWVKKGIAKTTTKIYNTAVAL----APVDLGFLEESIDF-KYFDGGLSSVISVG 84 (149) T ss_pred h-HHHH-HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh----CCcccchhhccceE-EecCCcEEEEEecC Confidence 4 2332 38999999999998766 4688888887777666655433 344445554 3433 23445677777776 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) ..-=+...||++........+.... ...| .. +..+.+.+..| .|=....-|++ T Consensus 85 ~~YA~~vE~GT~~~~~~~~~~~~~~-~~~~-~~---~~~~~~~~t~g----------~~a~PFl~pA~------------ 137 (149) T protein:vir:10 85 ADYAIYVEYGTGIYATGPGGSRATK-IPWS-FK---GDDGEWYTTYG----------QAPQPFWNPAI------------ 137 (149) T ss_pred CCcccccccCccccccCCccccccc-ccce-ee---ccccceecCCC----------CCCCcchhHHH------------ Confidence 6655666777754322111111000 1111 11 11122222211 11111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIVHEIE 179 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~~Ei~ 179 (184) + .....|.+ .|+ T Consensus 138 ~----~~k~~i~~----~i~ 149 (149) T protein:vir:10 138 D----AGRKTFEQ----YFS 149 (149) T ss_pred H----HHHHHHHH----hhC Confidence 1 11222222 222 No 55 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=82.48 E-value=0.055 Score=27.47 Aligned_cols=122 Identities=16% Similarity=0.196 Sum_probs=57.2 Q ss_pred CeEEeeHHHH-HHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHH-hhhhheecccCCcEEEEEE Q lcl|NC_021560. 1 MRLEMNSKDF-EELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELV-AALTTAHFNAGGNTSKVVV 78 (184) Q Consensus 1 m~i~id~~~l-~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~i-k~~~~~ka~~~~~~a~i~~ 78 (184) |-- |+.++| +++.+.|..++..+ ..++..|+..++..++..+.....+.+ ... +..++.+....+....+.. T Consensus 1 Ma~-i~id~la~~I~~~L~~y~~~v-~~~v~~~v~~~a~~~~~~ik~~aP~rT----G~y~ksw~vk~~~~~g~~~~vv~ 74 (126) T protein:vir:81 1 MAN-ITIDRLADELLQAVKEYTDDV-AEGVRKKVDETARKVLKEAQALAPKRT----GEYARTFTITKEDGYGTTKRIIW 74 (126) T ss_pred Ccc-cchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhCCccc----chhhccccccccccCCcceEEEe Confidence 543 666676 55888899998666 467777777777766655544433322 222 2333333222221111111 Q ss_pred eecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHH Q lcl|NC_021560. 79 ESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVY 158 (184) Q Consensus 79 ~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~ 158 (184) +...-+|.|| +-+||. .|-| +|.|= -|.+ T Consensus 75 ~~~~~~l~HL------------------------LEfGha---~r~g---------GrV~a-------~Phi-------- 103 (126) T protein:vir:81 75 NKKHYRRVHL------------------------LEFGHA---KVNG---------GRVKE-------YPHL-------- 103 (126) T ss_pred ccCCCCceee------------------------eeccee---cCCC---------CccCC-------Ccch-------- Confidence 1111111111 223332 1222 23221 1211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 159 LDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 159 ~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) . .+.+...++|+.+|.++|.- T Consensus 104 -~----Pa~e~~~~~~~~~i~~~l~~ 124 (126) T protein:vir:81 104 -R----PAYDKHGARLPDELKRVIEN 124 (126) T ss_pred -H----HHHHHHHHHHHHHHHHHhhc Confidence 1 12234555677788888878 No 56 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=77.53 E-value=0.088 Score=26.37 Aligned_cols=150 Identities=11% Similarity=0.046 Sum_probs=55.4 Q ss_pred CeEEeeHH-HHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----cHHHHhhhhh-eecccCCcEE Q lcl|NC_021560. 1 MRLEMNSK-DFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQM----PRELVAALTT-AHFNAGGNTS 74 (184) Q Consensus 1 m~i~id~~-~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~i----k~k~ik~~~~-~ka~~~~~~a 74 (184) |.+.|+.+ +++.+...|..+.+...+. ..+...+..+++.+.+....+..- +-..++..++ .+...+.... T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~~~~---~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~ 77 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVTRDR---AIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPG 77 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhhccH---HHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCC Confidence 66666665 6667777776664433222 345566778888888888887433 2122211000 0111110001 Q ss_pred EEEEeecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCc Q lcl|NC_021560. 75 KVVVESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNN 154 (184) Q Consensus 75 ~i~~~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~ 154 (184) .+-...+.+- ..+...-...+|.+ |- +.....+.-.|+..-.......+|-..+.|.+.. + T Consensus 78 ~~L~~tg~L~-~Si~~~~~~~~v~v-Gt------------~~~yA~vHqfG~~~~~~~~~~~iPaRpfLG~s~~-----d 138 (156) T protein:vir:19 78 SILTLHGDLA-RSITTDYGQDYALI-GS------------PKIYAAIHQWGGTPDMAPRPAGVPARPYMGLDKT-----G 138 (156) T ss_pred cchhhhHHHH-HHhhheecCCEEEE-ec------------chhhhHHhhcCcccccCCCccccCCccccCCCHH-----H Confidence 1111111110 01111122223322 10 0000000001111000001124666666664421 1 Q ss_pred hHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 155 PDVYLDVLAGVIEDYFFP 172 (184) Q Consensus 155 ~~~~~~~~~~~~~~~~~~ 172 (184) +.+|.+.+.+.+.+.|.+ T Consensus 139 ~~~I~~~i~~~l~~~~~~ 156 (156) T protein:vir:19 139 EQEIFDAIRKRVSAALRQ 156 (156) T ss_pred HHHHHHHHHHHHHHHhhC Confidence 223333333333333333 No 57 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=76.91 E-value=0.039 Score=28.34 Aligned_cols=135 Identities=10% Similarity=0.169 Sum_probs=61.3 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) -+++ .+++++.+.|..++..+ .+++.+||.+++..+.. ++....-+.-..++ ++.+ ..+.+++++.|..+ T Consensus 2 a~~~---~G~~~l~~~l~~~~~~~-~~~~~~al~~~a~~i~~----~ak~~aPv~TG~Lr~SI~~-~~~~~~~~~~V~~~ 72 (137) T protein:vir:10 2 AKVK---YGNWDLVKELEEFEKET-IRWAKKGIAKTTTIIHN----SIVSNMPVDTGYLRESVSM-DFKKGGLTGVINIG 72 (137) T ss_pred ccch---hCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH----HHHHhCCcCcchhhcCeee-EecCCcEEEEEecC Confidence 2222 38889999999987655 45677777776655544 44444445556665 3433 23555677888777 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) ...=+...||+............ ....+......| .|.+..| .|=...+-|+ T Consensus 73 ~~YA~~vE~GT~~~~~~~~~~~~--~~~~~~~~~~~~---~~~~t~g----------~~a~Pfl~pA------------- 124 (137) T protein:vir:10 73 SEYAVYVNYGTGIYAVGPGGSRA--KNIPWRYKDADG---HWHTTKG----------QHAQPFWEPA------------- 124 (137) T ss_pred CccccccccCccccccCCCcccc--cccceeeecccc---ccccCCC----------CCCCcchhHH------------- Confidence 66666667876443211111110 011111111111 1111111 1111111111 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIV 175 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~ 175 (184) + +.-.+.|.++|. T Consensus 125 --~-~~~~~~i~k~i~ 137 (137) T protein:vir:10 125 --I-DEGRAFFNKYFS 137 (137) T ss_pred --H-HHHHHHHHHhhC Confidence 1 112223333333 No 58 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=73.65 E-value=0.17 Score=24.83 Aligned_cols=107 Identities=7% Similarity=0.112 Sum_probs=45.2 Q ss_pred HHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEeecceee Q lcl|NC_021560. 7 SKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVESGWIPL 85 (184) Q Consensus 7 ~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~g~~i~l 85 (184) .++|+++.+.|..++..+ ..++.+||.+++..+...+.. ..-+.-+.++ ++.+.+ .+++.+.|..+..--+. T Consensus 1 i~Gld~l~~~l~~~~~~~-~~~v~~al~~~a~~i~~~ak~----~aPv~TG~Lr~sI~~~~--~~~~~~~v~~~~~Ya~~ 73 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSV-RIAVDKELSKSAARIERQAKI----LAPVDTGWLRAQIYSEQ--QRLLHYRVVSPALYSIY 73 (108) T ss_pred CchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh----cCCcCchhhhcceeeee--cCcEEEEeecCcccchh Confidence 789999999999998665 467788888777666544322 2222223333 222111 11222222211111111 Q ss_pred eecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHHHHHHH Q lcl|NC_021560. 86 QRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLDVLAGV 165 (184) Q Consensus 86 ~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~~~ 165 (184) . -.|+. ..|-+..+.|+.. . T Consensus 74 v---------------------------E~GT~-----------------~m~a~Pf~~pa~~----------------~ 93 (108) T protein:vir:99 74 L---------------------------ELGTR-----------------KMEAQSFLDPALR----------------K 93 (108) T ss_pred c---------------------------ccCcc-----------------ccCCCcchhhhHH----------------H Confidence 1 11211 1122211222221 1 Q ss_pred HHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 166 IEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 166 ~~~~~~~rl~~Ei~r~L~k 184 (184) ....|.+ +|..+|-| T Consensus 94 ~~~~~~~----~i~~~lrk 108 (108) T protein:vir:99 94 EWPVLMA----NIKKMFKR 108 (108) T ss_pred HHHHHHH----HHHHHhcC Confidence 1222333 33333333 No 59 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=71.88 E-value=0.19 Score=24.54 Aligned_cols=120 Identities=14% Similarity=0.244 Sum_probs=53.4 Q ss_pred CeEEeeHHHHH-HHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEE Q lcl|NC_021560. 1 MRLEMNSKDFE-ELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVV 78 (184) Q Consensus 1 m~i~id~~~l~-~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~ 78 (184) |.-.|+.++|. .+++.|..+...+. ..+..++++++..+...+. +..-...+..+ .-++++ +.. ...+++ T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~-~~v~~~v~~~a~~~~~~lk----~~sP~~TG~yaksW~~k~-~~~--~~~~v~ 72 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVV-DDIDDIKKDITKNGVKQLR----ESSPKRTGDYAKNWTSQK-LKN--GDQVIY 72 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH----hhCCccccccccceeeee-cCC--eeEEEE Confidence 98888888874 55889999987764 5788888887776654444 32222222222 222222 111 112222 Q ss_pred e-ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 79 E-SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 79 ~-g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) . ...-+|.++ +-+||. .|.|| |.|= .|. T Consensus 73 ~~~~~y~l~HL------------------------LE~GHa---~r~GG---------rV~a-------~ph-------- 101 (123) T protein:vir:96 73 QKAPTYRLTHL------------------------LENGHA---KRNGG---------RVSP-------KVH-------- 101 (123) T ss_pred EecCCcceEEe------------------------eeccee---ecCCc---------eeCc-------chh-------- Confidence 1 111111111 122221 12221 2211 121 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) +...++. +.+.|+.+|.+.|.| T Consensus 102 -I~paee~----~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 102 -IAPVEEE----LVSNYISRVEKRLSQ 123 (123) T ss_pred -hhHHHHH----HHHHHHHHHHHHhcC Confidence 1112223 333444455555555 No 60 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=68.04 E-value=0.11 Score=25.75 Aligned_cols=136 Identities=11% Similarity=0.109 Sum_probs=53.7 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhhe---ecccCCcEEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTA---HFNAGGNTSKV 76 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~---ka~~~~~~a~i 76 (184) |.++++.+++++ .|..++.++ ..++.++|++++..+..++...+ -+....++ ++... .......++.+ T Consensus 2 ~~~~~~~~gl~~---~l~~~~~~~-~~~~~~~i~~~a~~v~~~Ak~~a----Pv~tG~Lr~SI~~~~~~~~~~~~~~~~v 73 (142) T protein:vir:99 2 VQVSVRYEGFDY---NPVGAAAQV-GPILRRTHSSLTRQIANETRARV----PVLTGHLGRSVREDPQVMVTPFHVSGGV 73 (142) T ss_pred ceeEEEeeecch---hHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhC----CccchhhhcceeeeeccccccceEEEEe Confidence 888888877765 444445455 46888999988887766654433 33334444 23211 11111223333 Q ss_pred EEeecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCc-hhHhhcCch Q lcl|NC_021560. 77 VVESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAAN-PAHAITNNP 155 (184) Q Consensus 77 ~~~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gps-i~~m~~~~~ 155 (184) ..+..-=+..+||++...- .++ ...++.-. ..|+ .+|.+ .+..|. .|+-+ T Consensus 74 ~~~a~YA~~ve~GT~ph~i----~pk--~~~al~f~-~~g~-~~~~k-----------------~v~hpG~~a~Pf---- 124 (142) T protein:vir:99 74 TAHAKYAAAVHEGTRPHVI----RAK--HAQALHFW-WRGR-EVFVR-----------------QVNHPGTRARPY---- 124 (142) T ss_pred ccCccccceeccCCcccee----ccc--cCceeeEe-cCCc-eeeee-----------------eeecCCCCCCch---- Confidence 3333334445667643210 111 11111100 1121 12221 112221 12211 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 156 DVYLDVLAGVIEDYFFPRIV 175 (184) Q Consensus 156 ~~~~~~~~~~~~~~~~~rl~ 175 (184) +...+++...+....... T Consensus 125 --l~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 125 --LRNAGEAVVRRDRRIRVR 142 (142) T ss_pred --hHHHHHHHHhhhhhhccC Confidence 112222222221111111 No 61 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=68.04 E-value=0.11 Score=25.75 Aligned_cols=136 Identities=11% Similarity=0.109 Sum_probs=53.7 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhhe---ecccCCcEEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTA---HFNAGGNTSKV 76 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~---ka~~~~~~a~i 76 (184) |.++++.+++++ .|..++.++ ..++.++|++++..+..++...+ -+....++ ++... .......++.+ T Consensus 2 ~~~~~~~~gl~~---~l~~~~~~~-~~~~~~~i~~~a~~v~~~Ak~~a----Pv~tG~Lr~SI~~~~~~~~~~~~~~~~v 73 (142) T protein:vir:86 2 VQVSVRYEGFDY---NPVGAAAQV-GPILRRTHSSLTRQIANETRARV----PVLTGHLGRSVREDPQVMVTPFHVSGGV 73 (142) T ss_pred ceeEEEeeecch---hHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhC----CccchhhhcceeeeeccccccceEEEEe Confidence 888888877765 444445455 46888999988887766654433 33334444 23211 11111223333 Q ss_pred EEeecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCc-hhHhhcCch Q lcl|NC_021560. 77 VVESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAAN-PAHAITNNP 155 (184) Q Consensus 77 ~~~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gps-i~~m~~~~~ 155 (184) ..+..-=+..+||++...- .++ ...++.-. ..|+ .+|.+ .+..|. .|+-+ T Consensus 74 ~~~a~YA~~ve~GT~ph~i----~pk--~~~al~f~-~~g~-~~~~k-----------------~v~hpG~~a~Pf---- 124 (142) T protein:vir:86 74 TAHAKYAAAVHEGTRPHVI----RAK--HAQALHFW-WRGR-EVFVR-----------------QVNHPGTRARPY---- 124 (142) T ss_pred ccCccccceeccCCcccee----ccc--cCceeeEe-cCCc-eeeee-----------------eeecCCCCCCch---- Confidence 3333334445667643210 111 11111100 1121 12221 112221 12211 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 156 DVYLDVLAGVIEDYFFPRIV 175 (184) Q Consensus 156 ~~~~~~~~~~~~~~~~~rl~ 175 (184) +...+++...+....... T Consensus 125 --l~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 125 --LRNAGEAVVRRDRRIRVR 142 (142) T ss_pred --hHHHHHHHHhhhhhhccC Confidence 112222222221111111 No 62 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=66.83 E-value=0.26 Score=23.78 Aligned_cols=107 Identities=11% Similarity=0.262 Sum_probs=45.0 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEeecce Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVESGWI 83 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~g~~i 83 (184) |+.++|+++.+.|..+.. ..++.+||..++..+..++.. ...+....++ ++.+ ..+.+++.+.|..+.. T Consensus 1 i~i~Gld~l~~~l~~~~~---~~~~~~al~~~a~~i~~~ak~----~aPv~TG~Lr~si~~-~~~~~~~~~~V~~~~~-- 70 (108) T protein:vir:74 1 MKITGIDALQKKLRKNAT---LDDVKHVVKSNTASMNKNMQN----LAPVDTGNMKRSITS-EFTDGGLSGTTGPHTD-- 70 (108) T ss_pred CcchhHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHH----hCCCCchhhhcccee-eeecCceEEEeecCCC-- Confidence 888899999999987542 356778888887776665533 3333333443 2221 1122222222211110 Q ss_pred eeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHHHHH Q lcl|NC_021560. 84 PLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLDVLA 163 (184) Q Consensus 84 ~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~ 163 (184) | +.|+ -.|+. ..|-+..+.|+ ++.. T Consensus 71 ---------------------Y-a~~v---E~GT~-----------------km~aqpf~~pa------------~~~~- 95 (108) T protein:vir:74 71 ---------------------Y-AGYV---EYGTR-----------------FQSAQPFVKPA------------FNIQ- 95 (108) T ss_pred ---------------------c-ccce---ecccc-----------------ccCCCcchhhH------------HHHH- Confidence 1 1122 12221 11111111111 1111 Q ss_pred HHHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 164 GVIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 164 ~~~~~~~~~rl~~Ei~r~L~ 183 (184) ...|.++|+ ++|- T Consensus 96 ---~~~~~~~i~----~~~k 108 (108) T protein:vir:74 96 ---KKVFTNDLE----RLTK 108 (108) T ss_pred ---HHHHHHHHH----HHcC Confidence 222333332 2222 No 63 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=64.72 E-value=0.16 Score=24.97 Aligned_cols=137 Identities=13% Similarity=0.096 Sum_probs=53.8 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheec-ccCCcEEEEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHF-NAGGNTSKVVV 78 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka-~~~~~~a~i~~ 78 (184) |+++||.+ ++.+.|..++..+ ..++.++|++++..+..++... .-+....++ ++..... +.....+.|.. T Consensus 4 ~~~~~~~~---~l~~~l~~~~~~~-~~~~~~~l~~~a~~i~~~ak~~----aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~ 75 (142) T protein:vir:94 4 LNYRVNST---EFQGALRAALDRL-TGAAREATEAAANDMVNMAKGL----CPVDTGRLRSSIQAVPSGGRFSFSVTIGT 75 (142) T ss_pred eEEEecHH---HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh----CCccchhhhccceeeeccCCceEEEEEec Confidence 88888765 4566666666665 4688888888888776665444 334445565 3443222 22222344433 Q ss_pred eecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHH Q lcl|NC_021560. 79 ESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVY 158 (184) Q Consensus 79 ~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~ 158 (184) +-.--+...||+++..- .-+++..+ .|- ..++..-..+.+| .|=...+.|++.+ - T Consensus 76 ~~~YA~~vE~Gt~~~~i--~pk~~k~l--~~~---~~~~~~~~v~~pG----------~~~~pfl~~A~~~--------~ 130 (142) T protein:vir:94 76 NVTYAADVEYGTAPHVI--VPKDKKAL--YWP---GAAHPVAKVNHPG----------TRAQPFMRPAIAA--------A 130 (142) T ss_pred CcccchhhhccCCCcee--ccCCCccc--eec---ccceeeeeeeecC----------CCCCcchhHHHHH--------H Confidence 33334445577653210 00011000 110 0010000000000 0111111111111 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 159 LDVLAGVIEDYFFPRIVHEIE 179 (184) Q Consensus 159 ~~~~~~~~~~~~~~rl~~Ei~ 179 (184) .+.++ ++-.+|. T Consensus 131 ~~~i~---------~~~~~~~ 142 (142) T protein:vir:94 131 STFLR---------NHAKGIR 142 (142) T ss_pred HHHHH---------HHHHhcC Confidence 11111 1111222 No 64 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=62.91 E-value=0.23 Score=24.07 Aligned_cols=87 Identities=8% Similarity=0.147 Sum_probs=55.6 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhh-hhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAA-LTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~-~~~~ka~~~~~~a~i~~~ 79 (184) =+++|+.++++++.+.|..... ..++.+.+.+++..+..++.... .++-..+++ +.+ ....+++++.+.+. T Consensus 2 a~~~i~~~Gld~L~~~L~~~~~---~~~v~~vv~~~~~~l~~~ak~~a----p~dTG~lrrSI~~-~~~~~g~~~~v~~~ 73 (92) T protein:vir:99 2 ADYSISWDGLDALDEALANQQN---MNTVKKVVKKHTANLMTATQQAV----PVDTGHLKQSAQI-QISRDGFTGSVTYG 73 (92) T ss_pred CceeeEeehHHHHHHHHHhhcc---HHHHHHHHHHHHHHHHHHHHHhC----CCCccccceeeeE-EeecCCeeEEEEec Confidence 2334455589999999987543 24688888888888877766654 334444543 332 34556778888766 Q ss_pred ecc---eeeeecCCCCCCC Q lcl|NC_021560. 80 SGW---IPLQRLGAVQNAT 95 (184) Q Consensus 80 g~~---i~l~~f~~r~~~~ 95 (184) |.+ =|-..||+|.... T Consensus 74 gp~a~Ya~YvE~GTR~M~A 92 (92) T protein:vir:99 74 GGLVNYAAYVEFGTRFMDS 92 (92) T ss_pred cCccccccccccceeecCC Confidence 443 3334567777776 No 65 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=62.84 E-value=0.33 Score=23.24 Aligned_cols=113 Identities=5% Similarity=0.075 Sum_probs=46.3 Q ss_pred eEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhh-hhheecccCCcEEEEEEee Q lcl|NC_021560. 2 RLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAA-LTTAHFNAGGNTSKVVVES 80 (184) Q Consensus 2 ~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~-~~~~ka~~~~~~a~i~~~g 80 (184) =++|+.++|+++.+.|..+... ..+.+++.+++..+...+.........+.-..+++ +.+ +.++..++|-++ T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~---~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~---~~~~~~~~V~~~- 73 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNASP---EKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITL---QVESDKATVEAL- 73 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCH---HHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceee---eecCCeeEecCC- Confidence 2356667999999999887432 34455555555555444444432222333333332 211 111111221110 Q ss_pred cceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHH Q lcl|NC_021560. 81 GWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLD 160 (184) Q Consensus 81 ~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~ 160 (184) ++ ...|.-.|- +..|=+....|+.. .... T Consensus 74 ---------------------------~~--------Ya~~vEfGT--------~km~a~Pfl~PA~~--------~~~~ 102 (114) T protein:vir:27 74 ---------------------------TS--------YSGYLEVGT--------RKMEAQPFMKPALD--------EVAP 102 (114) T ss_pred ---------------------------CC--------ccceecccc--------cccCCCCchhhhHH--------HHHH Confidence 11 111111111 12222323333332 1233 Q ss_pred HHHHHHHHHHHH Q lcl|NC_021560. 161 VLAGVIEDYFFP 172 (184) Q Consensus 161 ~~~~~~~~~~~~ 172 (184) .+.+.+.+.|+. T Consensus 103 ~~~~~l~~l~k~ 114 (114) T protein:vir:27 103 KMVEELAKWDET 114 (114) T ss_pred HHHHHHHHHhcC Confidence 344444444444 No 66 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=62.84 E-value=0.33 Score=23.24 Aligned_cols=113 Identities=5% Similarity=0.075 Sum_probs=46.3 Q ss_pred eEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhh-hhheecccCCcEEEEEEee Q lcl|NC_021560. 2 RLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAA-LTTAHFNAGGNTSKVVVES 80 (184) Q Consensus 2 ~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~-~~~~ka~~~~~~a~i~~~g 80 (184) =++|+.++|+++.+.|..+... ..+.+++.+++..+...+.........+.-..+++ +.+ +.++..++|-++ T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~---~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~---~~~~~~~~V~~~- 73 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNASP---EKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITL---QVESDKATVEAL- 73 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCH---HHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceee---eecCCeeEecCC- Confidence 2356667999999999887432 34455555555555444444432222333333332 211 111111221110 Q ss_pred cceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHH Q lcl|NC_021560. 81 GWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLD 160 (184) Q Consensus 81 ~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~ 160 (184) ++ ...|.-.|- +..|=+....|+.. .... T Consensus 74 ---------------------------~~--------Ya~~vEfGT--------~km~a~Pfl~PA~~--------~~~~ 102 (114) T protein:vir:49 74 ---------------------------TS--------YSGYLEVGT--------RKMEAQPFMKPALD--------EVAP 102 (114) T ss_pred ---------------------------CC--------ccceecccc--------cccCCCCchhhhHH--------HHHH Confidence 11 111111111 12222323333332 1233 Q ss_pred HHHHHHHHHHHH Q lcl|NC_021560. 161 VLAGVIEDYFFP 172 (184) Q Consensus 161 ~~~~~~~~~~~~ 172 (184) .+.+.+.+.|+. T Consensus 103 ~~~~~l~~l~k~ 114 (114) T protein:vir:49 103 KMVEELAKWDET 114 (114) T ss_pred HHHHHHHHHhcC Confidence 344444444444 No 67 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=62.83 E-value=0.33 Score=23.24 Aligned_cols=162 Identities=10% Similarity=0.117 Sum_probs=76.1 Q ss_pred CeEEeeHHHHHHHHHHHhh--ccchhHHHHHHHHHHHHHHHHHHHHHHH-------HHHHhcccHHHHhhhh-heecccC Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRR--LPGEIRTKAMRRAMTRLRQTARSRIVAR-------LGPHTQMPRELVAALT-TAHFNAG 70 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~--l~~~~~~kA~~rAlnrt~~~~rt~~~r~-------i~~~~~ik~k~ik~~~-~~ka~~~ 70 (184) |+|+ +.+.+.+..+.|.. +| .+..+|+.++...+-..+..++.++ |++...+.......+. ..+++.+ T Consensus 1 ~~ik-~l~~~~~~L~~i~~~~vp-~A~~rAiNrta~~a~t~~~r~v~~e~~I~~k~Ir~r~r~~kAs~~~l~a~I~~~~~ 78 (192) T protein:vir:34 1 MAIK-GLEQAVENLSRISKTAVP-GAAAMAINRVASSAISQSASQVARETKVRRKLVKERARLKRATVKNPQARIKVNRG 78 (192) T ss_pred Ccch-hHHHHHHHHhhcCchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHhhheeccccCCCceEEEEEecc Confidence 9997 77777777777644 55 4445666666666665555555544 4555555332222211 1244444 Q ss_pred CcEEEEEEeecc--eeee------------------ecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeec Q lcl|NC_021560. 71 GNTSKVVVESGW--IPLQ------------------RLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMS 130 (184) Q Consensus 71 ~~~a~i~~~g~~--i~l~------------------~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~ 130 (184) ++. .+.....+ ++.. ..|.+.-..+-.+.-...-.|-|-.....++..| +. T Consensus 79 ~l~-~~~l~~~~~~~~rr~~~~~~~~~~~~~~g~~~k~Gk~~f~gaFia~m~ng~~~Vf~R~~gk~R~PI-e~------- 149 (192) T protein:vir:34 79 DLP-VIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPI-DV------- 149 (192) T ss_pred cee-eeeecccccccccccccccccccccccccceeeecceecCCcccccCCCCCceeEEEccCCCccce-eE------- Confidence 431 11111111 1100 1111111111111111112344443111122212 21 Q ss_pred ccCccccceeeeecCchhHhhcCchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 131 SATGKREQIRELFAANPAHAITNNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 131 ~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) -..||. +|+ .+.+.+ ++...++++++..|...|++||..+|-| T Consensus 150 ----vkIpis---~~l-~~af~~---~~~~~~~~~~~~El~~~L~~~lr~~~k~ 192 (192) T protein:vir:34 150 ----VKIPMA---VPL-TTAFKQ---NIERIRRERLPKELGYALQHQLRMVIKR 192 (192) T ss_pred ----EEechh---HHH-HHHHHH---HHHHHHHHHHHHHHHHHHHHHhhhhccC Confidence 125653 343 455532 3456677888888888888898888888 No 68 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=60.31 E-value=0.37 Score=22.92 Aligned_cols=136 Identities=13% Similarity=0.151 Sum_probs=63.0 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) |--. ..+++++.+.|..++.++ .+++.+|+.+++..+...+. ....+.-..++ ++.+ +...+++++.|..+ T Consensus 1 Ma~~--~~G~~~l~~~l~~~~~~~-~~~~~~~~~~~a~~v~~~ak----~~aPv~TG~L~~Si~~-~~~~~~~~~~V~~~ 72 (137) T protein:vir:95 1 MAKV--KYGNWDLVKELENYERDM-ERWVKRGIAKTTAKIHNTII----SLMPVDTGYLRESVTM-DFKDGGFTGVINIG 72 (137) T ss_pred Cchh--HHhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH----HhCCccchhhhcCeee-EeeCCceEEEEecC Confidence 4433 369999999999998766 56888888877776655554 33444445555 3432 33455677777766 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) ..-=+...||+........ +.....+.+......|. |.+. + +.| ||-+ + T Consensus 73 ~~YA~~vE~GT~~~~~~~~--~~~~~~~~~~~~~~~~~---~~~t-------~----------g~~--a~PF-------l 121 (137) T protein:vir:95 73 SEYAIYVNYGTGIYATGAG--GSRAKKIPWSYKDANGK---WHTT-------K----------GQH--AQPF-------W 121 (137) T ss_pred CCcccccccCccccccCCC--cccccccccceeccCcc---eeec-------C----------CCC--CCcc-------h Confidence 6555556677643321111 00001111111111111 1111 0 111 1111 0 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIV 175 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~ 175 (184) ..-.+.....|.++|. T Consensus 122 ~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 122 EPAIDAGRAFFNKYFS 137 (137) T ss_pred HHHHHHHHHHHHHhhC Confidence 1011112222333333 No 69 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=60.06 E-value=0.38 Score=22.89 Aligned_cols=136 Identities=13% Similarity=0.167 Sum_probs=63.4 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) |.-- ..+++++.+.|..++.++ .+++.+||++++..+...+...+ -+.-..++ ++.+ +...+++++.|.++ T Consensus 1 Ma~~--~~Gl~~l~~~l~~~~~~~-~~~~~~al~~~a~~i~~~ak~~a----PvdTG~Lr~SI~~-~~~~~~~~~~V~~~ 72 (137) T protein:vir:10 1 MAKV--KYGNWELVKELEDFEKET-IRWAKKGIAKTTTIIHNSIVSNM----PVDTGYLRESVSM-DFKKGGLTGVINIG 72 (137) T ss_pred Cchh--HhhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhC----CcCcchhhcCeeE-EeeCCcEEEEEecC Confidence 4433 248999999999998766 46778888888777766655443 34445555 3433 33455677777766 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) ..-=+...||++........+.. ....+......| .+....| .| ||-+- . T Consensus 73 ~~Ya~~vE~GT~~~~~~~~~~~~--~~~~~~~~~~~~---~~~~t~g----------~~---------a~PFl------~ 122 (137) T protein:vir:10 73 SEYAVYVNYGTGIYAVGPGGSRA--KNIPWCYKDADG---HWHTTKG----------QH---------AQPFW------E 122 (137) T ss_pred CCcccccccCccccccCCCcccc--ccccceeecccc---ceeccCC----------CC---------CCcch------h Confidence 55555566776543221111110 011111111111 1111111 11 11110 0 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIV 175 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~ 175 (184) ..+ ++..+.|.++|. T Consensus 123 pA~-~~~~~~i~k~i~ 137 (137) T protein:vir:10 123 PAI-DEGRAFFNKYFS 137 (137) T ss_pred HHH-HHHHHHHHHhcC Confidence 111 112223333333 No 70 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=56.59 E-value=0.45 Score=22.47 Aligned_cols=110 Identities=12% Similarity=0.074 Sum_probs=42.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccH---HHHhh-hhheecccCCcEEEEEEeecceeeeecCCCCCCCcceEe Q lcl|NC_021560. 25 RTKAMRRAMTRLRQTARSRIVARLGPHTQMPR---ELVAA-LTTAHFNAGGNTSKVVVESGWIPLQRLGAVQNATGVYAK 100 (184) Q Consensus 25 ~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~---k~ik~-~~~~ka~~~~~~a~i~~~g~~i~l~~f~~r~~~~gv~~~ 100 (184) +++++.+++|..|..+ .+.+-+.+=++. +.+++ ..+.+....+ .+|.-+..--+...||.|+.+.++. T Consensus 1 l~~~~~~~~~~~a~~l----~~~vk~rTPv~~~d~G~LR~sW~~g~v~k~~--~~v~N~~eYA~~VE~GHRq~~g~g~-- 72 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKL----LRKVKPKTPVAKIDGGTARKSWKYKELNLFD--GVVSNNVEYIHHLEYGHRTRQGTGT-- 72 (116) T ss_pred CchHHHHHHHHHHHHH----HHHHHhhCCCCcCCCcccccCceeeeeeccC--ceeecCCcccccccCCceeeCCcce-- Confidence 3455666666555444 444444454432 33432 2222222211 1122222222233334443322110 Q ss_pred cccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchh--HhhcCchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 101 LRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPA--HAITNNPDVYLDVLAGVIEDYFFPRIVHEI 178 (184) Q Consensus 101 ~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~--~m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei 178 (184) +-...++|+.+ +-++ -|+ +.-+++++..|++.|+..| T Consensus 73 ----------------------------~~~~~gkrlk~-----~~V~G~fml--------~~s~~e~~~~~~~~~~~~~ 111 (116) T protein:vir:10 73 ----------------------------SENYRPKPNGI-----SFVPGVFML--------ARSVDEMSSIIDDELNQII 111 (116) T ss_pred ----------------------------ecccccccccC-----CccCceehH--------HHHHHHHHHHHHHHHHHHH Confidence 00001122211 1111 133 2223344556666677777 Q ss_pred HHhcC Q lcl|NC_021560. 179 ERLLP 183 (184) Q Consensus 179 ~r~L~ 183 (184) +-+|. T Consensus 112 ~~~l~ 116 (116) T protein:vir:10 112 IDFWN 116 (116) T ss_pred HHhcC Confidence 77777 No 71 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=50.09 E-value=0.32 Score=23.28 Aligned_cols=110 Identities=6% Similarity=0.117 Sum_probs=53.3 Q ss_pred eEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhh-hhheecccCCcEEEEEEee Q lcl|NC_021560. 2 RLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAA-LTTAHFNAGGNTSKVVVES 80 (184) Q Consensus 2 ~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~-~~~~ka~~~~~~a~i~~~g 80 (184) =.+|+.++|+++.+.|..+... ..+.+++.+.+...-..+.......-.+....+++ + ..+.+++++.|..+- T Consensus 1 Ma~i~i~Gld~L~~~l~~~~~~---~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI---~~~~~~~~~~v~~~~ 74 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNASS---ERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSI---TLEAGSDRAVVEALT 74 (112) T ss_pred CceeeehHHHHHHHHHHhhcCH---HHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhce---eeecCceEEEecCCC Confidence 1246667999999999887532 34556666666655555555555444555555653 3 235556666665544 Q ss_pred cceeeeecCCCCCCCcceEeccc-ccCcceeeecCCCCeeeEEecCCeeecccCccccceeeee Q lcl|NC_021560. 81 GWIPLQRLGAVQNATGVYAKLRG-SYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELF 143 (184) Q Consensus 81 ~~i~l~~f~~r~~~~gv~~~~~~-~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~ 143 (184) .-=+...||+++....=...+.. ...--|+..+ +.|- T Consensus 75 ~Ya~~vE~GTr~m~AqPF~~PA~~~~~~~~~~~l--------------------------~~L~ 112 (112) T protein:vir:96 75 NYSGYLEVGTRKMEAQPFMRPALDQVVPEMVEEM--------------------------AKWE 112 (112) T ss_pred CccceeccCccccCCCCchhhhHHHHHHHHHHHH--------------------------HhcC Confidence 43444456655433211110000 0000112111 1111 No 72 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=49.26 E-value=0.64 Score=21.63 Aligned_cols=136 Identities=12% Similarity=0.179 Sum_probs=62.9 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) |--. ..+++++.+.|..++.++ .+++.+|+.+++..+...+.. ..-+.-..++ ++.+ ....+++++.|..+ T Consensus 1 Ma~~--~~g~~~l~~~l~~~~~~~-~~~~~~~~~~~a~~i~~~ak~----~aPvdTG~Lr~SI~~-~~~~~~~~~~V~~~ 72 (137) T protein:vir:97 1 MAKV--KYGNWDLVKELENYERDM-ERWVKRGIAKTTAKIHNTIIS----LMPVDTGYLRESVTM-DFKDSGFTGVINIG 72 (137) T ss_pred Cchh--HHhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH----hCCccccchhcccee-EeecCceEEEEecC Confidence 4444 359999999999998766 567888887777766555543 3444455555 3433 23455677887776 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) ..-=+...||+........ +...-...+......| .|.+. ++ .|=...+-|+ + T Consensus 73 ~~YA~~vE~GT~~~~~~~~--~~~~~~~~~~~~~~~~---~~~~t-------~g---~~a~PFl~pA------------~ 125 (137) T protein:vir:97 73 SEYAIYVNYGTGIYATGAG--GSRAKKIPWSYKDANG---KWHTT-------KG---QHAQPFWEPA------------I 125 (137) T ss_pred CCcccccccCccccccCCC--cccccccccceeccCc---ceeec-------CC---CCCCcchHHH------------H Confidence 6655666777643211110 0000011111111111 11111 00 1111111111 1 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIV 175 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~ 175 (184) + .....|.++|. T Consensus 126 ~----~~~~~~~~~l~ 137 (137) T protein:vir:97 126 D----AGRAFFNKYFS 137 (137) T ss_pred H----HHHHHHHHhhC Confidence 2 22222333333 No 73 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=49.26 E-value=0.64 Score=21.63 Aligned_cols=136 Identities=12% Similarity=0.179 Sum_probs=62.9 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) |--. ..+++++.+.|..++.++ .+++.+|+.+++..+...+.. ..-+.-..++ ++.+ ....+++++.|..+ T Consensus 1 Ma~~--~~g~~~l~~~l~~~~~~~-~~~~~~~~~~~a~~i~~~ak~----~aPvdTG~Lr~SI~~-~~~~~~~~~~V~~~ 72 (137) T protein:vir:93 1 MAKV--KYGNWDLVKELENYERDM-ERWVKRGIAKTTAKIHNTIIS----LMPVDTGYLRESVTM-DFKDSGFTGVINIG 72 (137) T ss_pred Cchh--HHhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH----hCCccccchhcccee-EeecCceEEEEecC Confidence 4444 359999999999998766 567888887777766555543 3444455555 3433 23455677887776 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) ..-=+...||+........ +...-...+......| .|.+. ++ .|=...+-|+ + T Consensus 73 ~~YA~~vE~GT~~~~~~~~--~~~~~~~~~~~~~~~~---~~~~t-------~g---~~a~PFl~pA------------~ 125 (137) T protein:vir:93 73 SEYAIYVNYGTGIYATGAG--GSRAKKIPWSYKDANG---KWHTT-------KG---QHAQPFWEPA------------I 125 (137) T ss_pred CCcccccccCccccccCCC--cccccccccceeccCc---ceeec-------CC---CCCCcchHHH------------H Confidence 6655666777643211110 0000011111111111 11111 00 1111111111 1 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIV 175 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~ 175 (184) + .....|.++|. T Consensus 126 ~----~~~~~~~~~l~ 137 (137) T protein:vir:93 126 D----AGRAFFNKYFS 137 (137) T ss_pred H----HHHHHHHHhhC Confidence 2 22222333333 No 74 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=49.26 E-value=0.64 Score=21.63 Aligned_cols=136 Identities=12% Similarity=0.179 Sum_probs=62.9 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) |--. ..+++++.+.|..++.++ .+++.+|+.+++..+...+.. ..-+.-..++ ++.+ ....+++++.|..+ T Consensus 1 Ma~~--~~g~~~l~~~l~~~~~~~-~~~~~~~~~~~a~~i~~~ak~----~aPvdTG~Lr~SI~~-~~~~~~~~~~V~~~ 72 (137) T protein:vir:94 1 MAKV--KYGNWDLVKELENYERDM-ERWVKRGIAKTTAKIHNTIIS----LMPVDTGYLRESVTM-DFKDSGFTGVINIG 72 (137) T ss_pred Cchh--HHhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH----hCCccccchhcccee-EeecCceEEEEecC Confidence 4444 359999999999998766 567888887777766555543 3444455555 3433 23455677887776 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) ..-=+...||+........ +...-...+......| .|.+. ++ .|=...+-|+ + T Consensus 73 ~~YA~~vE~GT~~~~~~~~--~~~~~~~~~~~~~~~~---~~~~t-------~g---~~a~PFl~pA------------~ 125 (137) T protein:vir:94 73 SEYAIYVNYGTGIYATGAG--GSRAKKIPWSYKDANG---KWHTT-------KG---QHAQPFWEPA------------I 125 (137) T ss_pred CCcccccccCccccccCCC--cccccccccceeccCc---ceeec-------CC---CCCCcchHHH------------H Confidence 6655666777643211110 0000011111111111 11111 00 1111111111 1 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIV 175 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~ 175 (184) + .....|.++|. T Consensus 126 ~----~~~~~~~~~l~ 137 (137) T protein:vir:94 126 D----AGRAFFNKYFS 137 (137) T ss_pred H----HHHHHHHHhhC Confidence 2 22222333333 No 75 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=48.35 E-value=0.67 Score=21.53 Aligned_cols=107 Identities=14% Similarity=0.262 Sum_probs=44.5 Q ss_pred eeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEeecce Q lcl|NC_021560. 5 MNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVESGWI 83 (184) Q Consensus 5 id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~g~~i 83 (184) |+.++|+++.+.|..+.. ..++..||.+++..+..++.. ...+.-+.++ ++.+ ..+.+...+.|..+. T Consensus 1 i~i~Gld~l~~~l~~~~~---~~~~~~al~~~a~~i~~~ak~----~apvdTG~Lr~si~~-~~~~~~~~~~V~~~~--- 69 (108) T protein:vir:98 1 MKITGIDALQKKLRKNAT---LNDVKHVVKRNTVSMNKNMQN----LAPVDTGNMKRSITS-EFTDGGLTGTTIPHT--- 69 (108) T ss_pred CcchhHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHH----hCCCCchhhHhhcee-eeecCceEEEeecCC--- Confidence 777899999999987643 346777777777766555433 2334334443 2211 111122222221111 Q ss_pred eeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHHHHHH Q lcl|NC_021560. 84 PLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYLDVLA 163 (184) Q Consensus 84 ~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~ 163 (184) +...|--.|- ++.|=+..+.|+. + T Consensus 70 ---------------------------------~Ya~~vE~GT--------~~m~aqPFl~pa~------------~--- 93 (108) T protein:vir:98 70 ---------------------------------DYAGYVEYGT--------RFQAAQPFVKPAF------------D--- 93 (108) T ss_pred ---------------------------------Cccceeeccc--------cccCCCcchhhHH------------H--- Confidence 1112211111 1122111122221 1 Q ss_pred HHHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 164 GVIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 164 ~~~~~~~~~rl~~Ei~r~L~ 183 (184) .....|.+ +|+++|- T Consensus 94 -~~~~~~~~----~i~~~lr 108 (108) T protein:vir:98 94 -VQKKIFTN----DLERLTK 108 (108) T ss_pred -HHHHHHHH----HHHHHcC Confidence 11122222 3333333 No 76 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=45.05 E-value=0.78 Score=21.16 Aligned_cols=170 Identities=15% Similarity=0.130 Sum_probs=79.4 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc------HH----HHh--hhh----- Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMP------RE----LVA--ALT----- 63 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik------~k----~ik--~~~----- 63 (184) |.++++.++++.+...|..+ +..|+.=.+-+.+.+..++....+.|+.|-+.. .+ -.+ +|. T Consensus 3 ~~~~~n~~dl~~l~~~L~ll--~L~p~kRrrLl~~iak~lr~~~~~rI~~Q~~PDGs~w~pRK~~~~k~k~~rm~~kL~~ 80 (231) T protein:vir:37 3 IRLGLKQEDLDAFVRDLRTL--NLTGKQKKKILTWTLGAIKRKSQKNIREQHSPDGTAWEKRKPVDGEIKNKRLLKKVLR 80 (231) T ss_pred ccCCcCHHHHHHHHHHHHHh--cCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCchhcccccchhhHHHHHHhHH Confidence 77788999999999999876 566778888899999999999999999998762 22 111 111 Q ss_pred --heecccCCcEEEEEEeecc--ee-eeecCCCCC--CCcceEecccc-------------cCcceeeecCCCC--eeeE Q lcl|NC_021560. 64 --TAHFNAGGNTSKVVVESGW--IP-LQRLGAVQN--ATGVYAKLRGS-------------YRHAFIAAMKSGH--VGAF 121 (184) Q Consensus 64 --~~ka~~~~~~a~i~~~g~~--i~-l~~f~~r~~--~~gv~~~~~~~-------------~~gaFia~~~~g~--~~vf 121 (184) -.+++.++....++.++.. |. .++||-+.. .....+.+... .+=+|......|. .+-+ T Consensus 81 ~~~~~~~~~~~~~~~~~~g~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~~pATr~QAk~Lr~lGy~v~~~k~k~~k~~~ 160 (231) T protein:vir:37 81 YASILAEERGKGRIYYKNPLTGEIAQKQQDGFTEHFRVFATDKNKNGSGNDRATIRQAQKLRSLGYRKRNGKNRQGKTKY 160 (231) T ss_pred hhccccccCCceEEeeecchHHHHHHHhhcCcccccchhhhhhccCCCCCCCCCHHHHHHHHHhcccccCCCCCCCCCCc Confidence 1134444445555555543 32 345664211 11111101000 1112332221111 1111 Q ss_pred EecCCe---eecccCccccceeeeec-------------Cchh-HhhcCchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 122 RRVPGT---QMSSATGKREQIRELFA-------------ANPA-HAITNNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 122 ~R~~~~---~~~~~~~~R~PI~~l~g-------------psi~-~m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) ++..-. .....+++-+=|..|-+ +-++ +.+|.. ++.|.+.|..+|+.||.. T Consensus 161 rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~~------------~~e~~~~l~~~l~~i~~~ 228 (231) T protein:vir:37 161 RLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDTR------------EKENVDILREITLKFLSG 228 (231) T ss_pred CcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCCC------------HHHHHHHHHHHHHHHhcc Confidence 111000 00000000011111111 0011 122322 234555666677777766 No 77 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=39.90 E-value=1 Score=20.59 Aligned_cols=120 Identities=16% Similarity=0.254 Sum_probs=52.0 Q ss_pred CeEEeeHHHH-HHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHH-HHhhhhheecccCCcEEEEEE Q lcl|NC_021560. 1 MRLEMNSKDF-EELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRE-LVAALTTAHFNAGGNTSKVVV 78 (184) Q Consensus 1 m~i~id~~~l-~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k-~ik~~~~~ka~~~~~~a~i~~ 78 (184) |.. |+.++| +++.+.|..+...+ ...+..+++.++..+=..+.+++.+..-..-+ .-+..++++.. .+ .+|+. T Consensus 1 M~~-i~id~La~~I~~~L~~Ys~~v-~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~-e~--~~V~n 75 (124) T protein:vir:95 1 MAK-IKIGRLADEITSQLRKYSQVI-ADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVP-NG--WVIHN 75 (124) T ss_pred Ccc-ccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeec-Cc--eeEEE Confidence 543 444454 67788898887655 45777777766666554444444433222221 22222222221 11 22332 Q ss_pred eecc--eeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchH Q lcl|NC_021560. 79 ESGW--IPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPD 156 (184) Q Consensus 79 ~g~~--i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~ 156 (184) +... ..|..| ||-. |-|| |.+- .|.+ T Consensus 76 k~~yqLtHLLE~---------------------------GHAk---r~GG---------RV~a-------~pHI------ 103 (124) T protein:vir:95 76 KTEYRLAHLLEY---------------------------GHAT---VDGG---------RVPG-------TPHI------ 103 (124) T ss_pred cCCCceeeeeec---------------------------ceec---cCCc---------ccCC-------ccch------ Confidence 2111 122222 2211 1111 2111 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 157 VYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 157 ~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) . .+.+.+.+.|+++|+++|.. T Consensus 104 ---~----paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 104 ---R----PIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred ---h----HHHHHHHHHHHHHHHHHhcC Confidence 1 12233444555677777777 No 78 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=39.81 E-value=1 Score=20.58 Aligned_cols=157 Identities=9% Similarity=0.133 Sum_probs=58.5 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------cH--HHHhh----------- Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQM------PR--ELVAA----------- 61 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~i------k~--k~ik~----------- 61 (184) ++|+||.+. +.+.|..+...+. .....+...+..++..+.+.+..+..- |. ..+++ T Consensus 4 i~i~~d~~~---~~~~L~~l~~~~~--~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~~ 78 (190) T protein:vir:99 4 ITLEWDGRR---ALDVLNAGSAALG--DPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILTL 78 (190) T ss_pred eEEEecHHH---HHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCcccee Confidence 666776554 4444444433331 234567777888888888888777543 11 11110 Q ss_pred ---h-hheecccCCcEEEEEEeecceeeeecCCCCCCCcceEecc-------cccCcceeeecCCCCeeeEEecCCeeec Q lcl|NC_021560. 62 ---L-TTAHFNAGGNTSKVVVESGWIPLQRLGAVQNATGVYAKLR-------GSYRHAFIAAMKSGHVGAFRRVPGTQMS 130 (184) Q Consensus 62 ---~-~~~ka~~~~~~a~i~~~g~~i~l~~f~~r~~~~gv~~~~~-------~~~~gaFia~~~~g~~~vf~R~~~~~~~ 130 (184) | ...+...+...+.|-++..--+++.||..-.....+...+ +...+-|.. ...+.|.+... .. T Consensus 79 tg~L~~Si~~~~~~~~v~vGtn~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~----~~~~~~~~~~~--~~ 152 (190) T protein:vir:99 79 DGHLRNLLRYQLDGSELLFGSDRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVP----RRRSNFAQDVQ--IG 152 (190) T ss_pred cHHHHHHHhheecCcEEEEecCcchhhhhhcCCcccccccchhhhhhhhhhhhhhhccccc----ccccccchhcc--cc Confidence 0 0011122233344433322233556664322211111100 001111111 11111111000 00 Q ss_pred ccCccccceeeeecCchhHhhcCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021560. 131 SATGKREQIRELFAANPAHAITNNPDVYLDVLAGVIEDYFFPRI 174 (184) Q Consensus 131 ~~~~~R~PI~~l~gpsi~~m~~~~~~~~~~~~~~~~~~~~~~rl 174 (184) . ..-.+|=..+.|.|... ...+.+.+++.+.+.|.++- T Consensus 153 ~-~~v~IPaRpfLG~s~~d-----~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 153 P-YTIQMPARPWLGTSSQD-----DDTILQRVERYLQRALRERA 190 (190) T ss_pred c-ceeeecCcccCCCCHHH-----HHHHHHHHHHHHHHHHhhcC Confidence 0 00124555555544221 22333333333333333333 No 79 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=39.21 E-value=1 Score=20.52 Aligned_cols=129 Identities=13% Similarity=0.144 Sum_probs=51.6 Q ss_pred CeEE------eeHHHHHHHHHHHhh-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcE Q lcl|NC_021560. 1 MRLE------MNSKDFEELERAFRR-LPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNT 73 (184) Q Consensus 1 m~i~------id~~~l~~~~~~L~~-l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~ 73 (184) |=++ -++.+++++++.|.. +......+...+||+.++..+.....+++. +++ .++. T Consensus 1 ~~~~~~~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~--------------~fk-DTGa-- 63 (138) T protein:vir:98 1 MLLEVSMSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAIS--------------IYK-RTGE-- 63 (138) T ss_pred CeeeecccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhh--------------hhh-hccc-- Confidence 3333 356688899888877 554445566677777766665444433322 111 1111 Q ss_pred EEEEEeecceeeeecCCCCCCCcceE--ecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhh Q lcl|NC_021560. 74 SKVVVESGWIPLQRLGAVQNATGVYA--KLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAI 151 (184) Q Consensus 74 a~i~~~g~~i~l~~f~~r~~~~gv~~--~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~ 151 (184) .+.-..++.+.+..|+.. .+|. |. ...+ =|..-|-. |.... |.=+ T Consensus 64 --------t~dev~~s~p~~~~G~r~V~igW~---Gp-R~~i--vHLNE~Gy-Gk~i~------------------PrG~ 110 (138) T protein:vir:98 64 --------TTESAVVSGVRREDGIPKVKLGFT---TP-RWNI--VHLQELEY-GWKHN------------------RRGV 110 (138) T ss_pred --------eeeeeeecCeeecCCceEEEEeee---cC-eeeE--Eeeecccc-cCCcC------------------CCcc Confidence 111111122222222211 1221 10 0000 00000000 11000 1111 Q ss_pred cCchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 152 TNNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) | +.+...+..+..+.+-+..|+.+.|.- T Consensus 111 G-----~I~ka~~~se~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 111 G-----VIRRYSDILETIYPRGIRDKLKRGFDG 138 (138) T ss_pred h-----HHHHHHHhhhHHHHHHHHHHHHHHhcC Confidence 2 123333344555666666666666666 No 80 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=30.34 E-value=1.6 Score=19.49 Aligned_cols=136 Identities=10% Similarity=0.136 Sum_probs=58.2 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhh-hhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAA-LTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~-~~~~ka~~~~~~a~i~~~ 79 (184) =+.++|..+++++.+.+..+|++.+ +++..+|..-+... +...|..-.-|....-.. ..-.+|..++. +. T Consensus 3 ~~~sld~s~~e~L~~~i~r~P~ksE-~~IN~~L~tkg~~~---~~~~I~~~iPvS~~~k~~~RnK~HAK~s~p---l~-- 73 (140) T protein:vir:40 3 AKWSLEFSDVERLSNLISQIPNKSE-AIINKTLETKAVPL---VKLNIEKRINLSKNWKGQLLNKNHAQSSGP---FN-- 73 (140) T ss_pred cceecchhhHHHHHHHHHhccchHH-HHHHHHHHhhhhHH---HHhhhhhccCcCccchhhhccccchhhhhh---hh-- Confidence 5678999999999999999998886 35544444222211 222233322332110000 01122222210 00 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) .+.-+=|..-+.++.| | +.||-=.|. +.+ -|+. ++.|...- T Consensus 74 ----------~~~~NLgf~i~~k~kf----------~-YLvfPD~G~------G~s-n~~~-------q~FmerGl---- 114 (140) T protein:vir:40 74 ----------VKMGNLGFELLTKPKF----------N-YLIFPDQGI------GKH-NKTK-------QDFMQLGV---- 114 (140) T ss_pred ----------hhhhhcceeEeecCcc----------c-ccccccccC------CCC-Ccch-------HHHHHhcc---- Confidence 0111111111111111 0 122211111 111 1221 22221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) +.-.+.+-+.|.+.|++||+.+||= T Consensus 115 ~~~t~~i~E~L~~~l~k~in~~Lgg 139 (140) T protein:vir:40 115 EESSQEIVEMLEQAVFKEINDTLGG 139 (140) T ss_pred ccchhHHHHHHHHHHHHHHHHhhcC Confidence 1112344566778889999999998 No 81 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=27.82 E-value=1.8 Score=19.18 Aligned_cols=122 Identities=8% Similarity=-0.003 Sum_probs=40.8 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEee Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVES 80 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g 80 (184) |.+++|.++|+...+.|+.. . .+....|+...+.-+ ...+.+.+-..- ....+.-.|.+ T Consensus 1 M~v~v~~~~L~~~l~~l~~~---~-~k~~~~Al~aga~~~----~e~l~~~aP~~~-----------~~~hl~d~I~v-- 59 (125) T protein:vir:81 1 MGARIESNNIEQGLKNAVLK---M-NLNSNVIVKAGAMSL----VPLLKSNTPFAN-----------TKKHARDHIAV-- 59 (125) T ss_pred CeeEeeHHHHHHHHHHHHHH---H-HHHHHHHHHHHHHHH----HHHHHHhCCCCC-----------CCchhhhheee-- Confidence 99999988877766666532 2 234444554443332 222232221100 00001111111 Q ss_pred cceeeeecCCCCCCC---cceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 81 GWIPLQRLGAVQNAT---GVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 81 ~~i~l~~f~~r~~~~---gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) +. .++.. .+.+..+..-+.+|.+.+- -.|- ...|=. |.+... T Consensus 60 -----s~---~k~~~~~g~~~v~VG~~k~~~~~a~F~--------E~GT--------~k~~a~----pF~~~a------- 104 (125) T protein:vir:81 60 -----SN---VKTDRHTSEKIVTIGYAKGVSHRIHAT--------EFGT--------MYQKPQ----LFITKT------- 104 (125) T ss_pred -----cc---cccccccceEEEEeccCCCCceEEEec--------cCCc--------cCCCCC----chhhHH------- Confidence 11 11111 1222221111234543321 1111 111111 222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L~ 183 (184) ++.-.+++.+ .+..||.+++- T Consensus 105 -~~~~~~ev~~----~~~~~lrk~~k 125 (125) T protein:vir:81 105 -EKQGKNKVLK----TMLDTAKRLQK 125 (125) T ss_pred -HHHhHHHHHH----HHHHHHHHHhC Confidence 2222222222 22334433333 No 82 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=27.82 E-value=1.8 Score=19.18 Aligned_cols=122 Identities=8% Similarity=-0.003 Sum_probs=40.8 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEee Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVES 80 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g 80 (184) |.+++|.++|+...+.|+.. . .+....|+...+.-+ ...+.+.+-..- ....+.-.|.+ T Consensus 1 M~v~v~~~~L~~~l~~l~~~---~-~k~~~~Al~aga~~~----~e~l~~~aP~~~-----------~~~hl~d~I~v-- 59 (125) T protein:vir:94 1 MGARIESNNIEQGLKNAVLK---M-NLNSNVIVKAGAMSL----VPLLKSNTPFAN-----------TKKHARDHIAV-- 59 (125) T ss_pred CeeEeeHHHHHHHHHHHHHH---H-HHHHHHHHHHHHHHH----HHHHHHhCCCCC-----------CCchhhhheee-- Confidence 99999988877766666532 2 234444554443332 222232221100 00001111111 Q ss_pred cceeeeecCCCCCCC---cceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 81 GWIPLQRLGAVQNAT---GVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 81 ~~i~l~~f~~r~~~~---gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) +. .++.. .+.+..+..-+.+|.+.+- -.|- ...|=. |.+... T Consensus 60 -----s~---~k~~~~~g~~~v~VG~~k~~~~~a~F~--------E~GT--------~k~~a~----pF~~~a------- 104 (125) T protein:vir:94 60 -----SN---VKTDRHTSEKIVTIGYAKGVSHRIHAT--------EFGT--------MYQKPQ----LFITKT------- 104 (125) T ss_pred -----cc---cccccccceEEEEeccCCCCceEEEec--------cCCc--------cCCCCC----chhhHH------- Confidence 11 11111 1222221111234543321 1111 111111 222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L~ 183 (184) ++.-.+++.+ .+..||.+++- T Consensus 105 -~~~~~~ev~~----~~~~~lrk~~k 125 (125) T protein:vir:94 105 -EKQGKNKVLK----TMLDTAKRLQK 125 (125) T ss_pred -HHHhHHHHHH----HHHHHHHHHhC Confidence 2222222222 22334433333 No 83 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=27.82 E-value=1.8 Score=19.18 Aligned_cols=122 Identities=8% Similarity=-0.003 Sum_probs=40.8 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEee Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVES 80 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g 80 (184) |.+++|.++|+...+.|+.. . .+....|+...+.-+ ...+.+.+-..- ....+.-.|.+ T Consensus 1 M~v~v~~~~L~~~l~~l~~~---~-~k~~~~Al~aga~~~----~e~l~~~aP~~~-----------~~~hl~d~I~v-- 59 (125) T protein:vir:47 1 MGARIESNNIEQGLKNAVLK---M-NLNSNVIVKAGAMSL----VPLLKSNTPFAN-----------TKKHARDHIAV-- 59 (125) T ss_pred CeeEeeHHHHHHHHHHHHHH---H-HHHHHHHHHHHHHHH----HHHHHHhCCCCC-----------CCchhhhheee-- Confidence 99999988877766666532 2 234444554443332 222232221100 00001111111 Q ss_pred cceeeeecCCCCCCC---cceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 81 GWIPLQRLGAVQNAT---GVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 81 ~~i~l~~f~~r~~~~---gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) +. .++.. .+.+..+..-+.+|.+.+- -.|- ...|=. |.+... T Consensus 60 -----s~---~k~~~~~g~~~v~VG~~k~~~~~a~F~--------E~GT--------~k~~a~----pF~~~a------- 104 (125) T protein:vir:47 60 -----SN---VKTDRHTSEKIVTIGYAKGVSHRIHAT--------EFGT--------MYQKPQ----LFITKT------- 104 (125) T ss_pred -----cc---cccccccceEEEEeccCCCCceEEEec--------cCCc--------cCCCCC----chhhHH------- Confidence 11 11111 1222221111234543321 1111 111111 222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L~ 183 (184) ++.-.+++.+ .+..||.+++- T Consensus 105 -~~~~~~ev~~----~~~~~lrk~~k 125 (125) T protein:vir:47 105 -EKQGKNKVLK----TMLDTAKRLQK 125 (125) T ss_pred -HHHhHHHHHH----HHHHHHHHHhC Confidence 2222222222 22334433333 No 84 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=27.82 E-value=1.8 Score=19.18 Aligned_cols=122 Identities=8% Similarity=-0.003 Sum_probs=40.8 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEee Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVES 80 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g 80 (184) |.+++|.++|+...+.|+.. . .+....|+...+.-+ ...+.+.+-..- ....+.-.|.+ T Consensus 1 M~v~v~~~~L~~~l~~l~~~---~-~k~~~~Al~aga~~~----~e~l~~~aP~~~-----------~~~hl~d~I~v-- 59 (125) T protein:vir:98 1 MGARIESNNIEQGLKNAVLK---M-NLNSNVIVKAGAMSL----VPLLKSNTPFAN-----------TKKHARDHIAV-- 59 (125) T ss_pred CeeEeeHHHHHHHHHHHHHH---H-HHHHHHHHHHHHHHH----HHHHHHhCCCCC-----------CCchhhhheee-- Confidence 99999988877766666532 2 234444554443332 222232221100 00001111111 Q ss_pred cceeeeecCCCCCCC---cceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 81 GWIPLQRLGAVQNAT---GVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 81 ~~i~l~~f~~r~~~~---gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) +. .++.. .+.+..+..-+.+|.+.+- -.|- ...|=. |.+... T Consensus 60 -----s~---~k~~~~~g~~~v~VG~~k~~~~~a~F~--------E~GT--------~k~~a~----pF~~~a------- 104 (125) T protein:vir:98 60 -----SN---VKTDRHTSEKIVTIGYAKGVSHRIHAT--------EFGT--------MYQKPQ----LFITKT------- 104 (125) T ss_pred -----cc---cccccccceEEEEeccCCCCceEEEec--------cCCc--------cCCCCC----chhhHH------- Confidence 11 11111 1222221111234543321 1111 111111 222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L~ 183 (184) ++.-.+++.+ .+..||.+++- T Consensus 105 -~~~~~~ev~~----~~~~~lrk~~k 125 (125) T protein:vir:98 105 -EKQGKNKVLK----TMLDTAKRLQK 125 (125) T ss_pred -HHHhHHHHHH----HHHHHHHHHhC Confidence 2222222222 22334433333 No 85 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=27.82 E-value=1.8 Score=19.18 Aligned_cols=122 Identities=8% Similarity=-0.003 Sum_probs=40.8 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEee Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVES 80 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g 80 (184) |.+++|.++|+...+.|+.. . .+....|+...+.-+ ...+.+.+-..- ....+.-.|.+ T Consensus 1 M~v~v~~~~L~~~l~~l~~~---~-~k~~~~Al~aga~~~----~e~l~~~aP~~~-----------~~~hl~d~I~v-- 59 (125) T protein:vir:79 1 MGARIESNNIEQGLKNAVLK---M-NLNSNVIVKAGAMSL----VPLLKSNTPFAN-----------TKKHARDHIAV-- 59 (125) T ss_pred CeeEeeHHHHHHHHHHHHHH---H-HHHHHHHHHHHHHHH----HHHHHHhCCCCC-----------CCchhhhheee-- Confidence 99999988877766666532 2 234444554443332 222232221100 00001111111 Q ss_pred cceeeeecCCCCCCC---cceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 81 GWIPLQRLGAVQNAT---GVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 81 ~~i~l~~f~~r~~~~---gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) +. .++.. .+.+..+..-+.+|.+.+- -.|- ...|=. |.+... T Consensus 60 -----s~---~k~~~~~g~~~v~VG~~k~~~~~a~F~--------E~GT--------~k~~a~----pF~~~a------- 104 (125) T protein:vir:79 60 -----SN---VKTDRHTSEKIVTIGYAKGVSHRIHAT--------EFGT--------MYQKPQ----LFITKT------- 104 (125) T ss_pred -----cc---cccccccceEEEEeccCCCCceEEEec--------cCCc--------cCCCCC----chhhHH------- Confidence 11 11111 1222221111234543321 1111 111111 222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLLP 183 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L~ 183 (184) ++.-.+++.+ .+..||.+++- T Consensus 105 -~~~~~~ev~~----~~~~~lrk~~k 125 (125) T protein:vir:79 105 -EKQGKNKVLK----TMLDTAKRLQK 125 (125) T ss_pred -HHHhHHHHHH----HHHHHHHHHhC Confidence 2222222222 22334433333 No 86 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=26.42 E-value=1.9 Score=19.00 Aligned_cols=135 Identities=13% Similarity=0.176 Sum_probs=62.2 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) .++. .+++++.+.|..++.++ .+++.+||.+++..+...+. ...-+.-..++ ++.+ ....+++++.|..+ T Consensus 2 a~~~---~G~~~l~~~L~~~~~~~-~~~~~~al~~~a~~v~~~ak----~~aPvdTG~Lr~SI~~-~~~~~~~~~~V~~~ 72 (137) T protein:vir:94 2 AKVK---YGNWDLVKELENYERDI-ERWVKRGIAKTTVKIHNTII----SLMPVDTGYLRESVTM-DFKDGGFTGVINIG 72 (137) T ss_pred chhH---HhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH----HhCCcCcchhhcCcee-EeecCcEEEEEecC Confidence 2222 38999999999998776 46778888777776655444 33444455565 3433 23445677777776 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) ..-=+...||+........ +.....+.+......| .|.+. ++ .| ||-+ + T Consensus 73 ~~YA~~vE~GT~~~~~~~~--~~~~~~~~~~~~~~~~---~~~~t-------~g---~~---------a~PF-------l 121 (137) T protein:vir:94 73 SEYAIYVNYGTGIYATGAG--GSRAKKIPWSYKDANG---KWHTT-------KG---QH---------AQPF-------W 121 (137) T ss_pred CCcccccccCccccccCCC--cccccccccceeccCC---ceeec-------CC---cC---------CCcc-------h Confidence 6555666777654321111 0000111111111111 11111 00 11 1111 1 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIV 175 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~ 175 (184) ..-.+...+.|.++|. T Consensus 122 ~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 122 EPAIDAGRVFFNKYFS 137 (137) T ss_pred HHHHHHHHHHHHHhhC Confidence 1111122233333333 No 87 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=25.65 E-value=2 Score=18.89 Aligned_cols=134 Identities=11% Similarity=0.071 Sum_probs=59.1 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) |-. + ..+++++.+.|..++.++ .+++.+|+..++..+...+ ....-+.-..++ ++.. ..+.+++++.|..+ T Consensus 1 Ma~-~-~~Gl~~l~~~l~~~~~~~-~~~~~~al~~~a~~v~~~a----k~~apvdTG~Lr~SI~~-~~~~~g~~~~V~~~ 72 (135) T protein:vir:96 1 MAK-V-KYGADSIVVDLEKYSKDM-EKWVKKGITKTTLKIYNTA----IHLMPVDTGFLRQSTTV-DFENGGFTGVVKIG 72 (135) T ss_pred Cch-h-hhhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH----HHhCCccchhhhcceeE-EeecCcEEEEEecC Confidence 443 2 238999999999998666 4677777777776665544 333444555555 3433 23455567777655 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) ..--+...||+.+....... ++ .+. .|. .... +.|-+.. +.| |+-+ + T Consensus 73 ~~YA~~ve~GT~~~~~~~~~-~~-~~~-~~~-~~~~---g~~~~~~-----------------~~~--a~pf-------l 119 (135) T protein:vir:96 73 SNYAVYVNYGTGIYATKGSR-AH-KIP-WTY-KDPN---GKWHTTY-----------------GQM--PQPF-------W 119 (135) T ss_pred CCccchhhcccccccCCCcc-cc-ccc-ccc-ccCC---cceeecC-----------------CcC--CCcc-------h Confidence 55455556665332111100 00 000 010 0011 1111110 111 1111 1 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIV 175 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~ 175 (184) ..-.+...+.|.+.|. T Consensus 120 ~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 120 EPAIDAGRQTFEQYFS 135 (135) T ss_pred hHHHHHHHHHHHHhcC Confidence 1111122233333333 No 88 >protein:vir:98860 Length: 230 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654736;genbank:gi:109302921;genbank:GeneID:4156065 Probab=24.54 E-value=2.2 Score=18.75 Aligned_cols=169 Identities=13% Similarity=0.106 Sum_probs=78.0 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc------HHH-----Hhh----hhhe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMP------REL-----VAA----LTTA 65 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik------~k~-----ik~----~~~~ 65 (184) |++++|.+++..+...|.-+ +..|+.=.+-|.+.+..++....+.|+.|-+.. .+. ++. +.+. T Consensus 5 i~~~ln~~~~~~l~~~L~ll--~L~p~kRrrll~~iak~lr~~~k~rIr~Q~~PDGs~w~pRKr~k~KMl~~L~k~l~~~ 82 (230) T protein:vir:98 5 IKMGVNPDDLRDFLKDLELL--KIPPKKKKEILIRTLQEMKKRSVKSASNQRTPTGSGWKPRKNGNAKMLRRIAKTLKFT 82 (230) T ss_pred CcccCCHHHHHHHHHHHHHh--cCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCChhhhhhhHHHHhhhHHHHHHh Confidence 88899999999999999866 566778888899999999999999999998772 221 211 1222 Q ss_pred ecccCCcEEEEEEeecc--ee-eeecCCCCCCCcceEeccc---cc-------------Ccceeeec---CCCCeeeEEe Q lcl|NC_021560. 66 HFNAGGNTSKVVVESGW--IP-LQRLGAVQNATGVYAKLRG---SY-------------RHAFIAAM---KSGHVGAFRR 123 (184) Q Consensus 66 ka~~~~~~a~i~~~g~~--i~-l~~f~~r~~~~gv~~~~~~---~~-------------~gaFia~~---~~g~~~vf~R 123 (184) ....+.....+|.++.. |. .++||.+.+-.......+. .+ .=+|.... +.|+ +-+++ T Consensus 83 ~~~~~~~~v~~~~~~~~~rIA~vHq~G~~~~~~~~~~~~r~~~~~~~~paTr~QAk~Lr~lGy~v~~g~~~~~~-k~~kk 161 (230) T protein:vir:98 83 SADREIKRVCTISRNAQRRSQKEHQRGAKITNLKSVILRKSRAGTAKDPATMRQAKKLRDLGYTVPNGTTKSGK-KRYRR 161 (230) T ss_pred hcccccceeeeecccchhhhhhhhhccchhhhhhhhhhhhhcCCCCcccccHHHHHHHHHcCCccCCCCCCcCC-CCCCC Confidence 22222334445665543 33 3456654322111111100 00 01132211 0000 00111 Q ss_pred cCCe---eecccCccccceeeeec-------------Cchh--HhhcCchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 124 VPGT---QMSSATGKREQIRELFA-------------ANPA--HAITNNPDVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 124 ~~~~---~~~~~~~~R~PI~~l~g-------------psi~--~m~~~~~~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) ..-. .....+++-+=|..|-+ ..+| +.+|.. ++.|.+.|..++..+=++ T Consensus 162 ps~kwI~~nls~~qAgliIR~L~~k~~k~~~~~t~W~I~~PaR~FLG~~------------~~e~~~~l~~~l~~i~~~ 228 (230) T protein:vir:98 162 PSAREIVATLSRAKASLLIRYFQEKEERQGKRLTKWIIPTEKRPFLDER------------DKENAEILKEFILKFSGI 228 (230) T ss_pred CCHHHHHHhhhHHHHHHHHHHHhccccccccCccceeeecCcccccCCC------------hHHHHHHHHHHHHHhccc Confidence 0000 00000000011111111 0000 011211 334555566666666555 No 89 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=24.45 E-value=2.2 Score=18.73 Aligned_cols=136 Identities=13% Similarity=0.105 Sum_probs=62.7 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHh-hhhheecccCCcEEEEEEe Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVA-ALTTAHFNAGGNTSKVVVE 79 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik-~~~~~ka~~~~~~a~i~~~ 79 (184) |---+ .+++++.+.|..++..+ .+++.+||.+++..+...+...+ -+.-..++ ++.+ +...+++++.|.++ T Consensus 1 Ma~~~--~G~~~l~~~l~~~~~~~-~~~~~~~l~~~a~~~~~~ak~~~----pvdTG~L~~Si~~-~~~~~g~~~~V~~~ 72 (137) T protein:vir:96 1 MAKVK--YGNWDLVAELEDYRDEM-EEWVKKGILKTTLAIYNTAVALA----PVDLGFLKESIDF-KVTDGGFSSVISVG 72 (137) T ss_pred CchhH--hhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhC----CcCccchhcCcee-EeecCceEEEEecC Confidence 33222 48999999999998666 57888888888877766554443 34444555 3433 33455677777776 Q ss_pred ecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHHHH Q lcl|NC_021560. 80 SGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDVYL 159 (184) Q Consensus 80 g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~~~ 159 (184) ..-=+...||++....... +.......+......| .+.+..| .| |+-+- . T Consensus 73 ~~YA~yvE~GT~~~~~~~~--~~~~~~~~~~~~~~~~---~~~~t~g----------~~---------a~pFl------~ 122 (137) T protein:vir:96 73 AEYAIYVEFGTGIYATGPG--GSRARKLPWTYKGDDG---EWHTTYG----------QQ---------AQPFW------N 122 (137) T ss_pred CCcccccccCccccccCCC--ccccccccceeeccCc---ceeecCC----------CC---------CCcch------h Confidence 5555566677754321110 0000001111111111 1111111 11 11110 0 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_021560. 160 DVLAGVIEDYFFPRIV 175 (184) Q Consensus 160 ~~~~~~~~~~~~~rl~ 175 (184) ..+ +.....|.++|. T Consensus 123 pA~-~~~~~~i~k~i~ 137 (137) T protein:vir:96 123 PAI-DEGRKVFNRYFS 137 (137) T ss_pred HHH-HHHHHHHHHhhC Confidence 111 122222333333 No 90 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=24.02 E-value=2.2 Score=18.67 Aligned_cols=130 Identities=18% Similarity=0.172 Sum_probs=50.2 Q ss_pred CeEEeeHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHhhhhheecccCCcEEEEEEee Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRRLPGEIRTKAMRRAMTRLRQTARSRIVARLGPHTQMPRELVAALTTAHFNAGGNTSKVVVES 80 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~~~~~ik~k~ik~~~~~ka~~~~~~a~i~~~g 80 (184) |. ++.+++++++.|..-=+ ++++.|..|+++..+=..+...+.+. +.+++-+... .-++. T Consensus 1 m~---evkGv~eilk~lE~k~G---~~~m~ri~dkAL~~~g~~v~~~lK~~----------~~~fkDTGat-idev~--- 60 (133) T protein:vir:96 1 MR---LIYDTKKLERELEKRLS---KRALMRITDRALTEAGEVVLEAIRTN----------LKYFRDTGAE-YGEVK--- 60 (133) T ss_pred Cc---cccCHHHHHHHHHHhcC---HHHHHHHhhHHHHHHHHHHHHHHHHh----------hHHHhhccce-eeeEE--- Confidence 33 45677777777744222 23444444444444433333333222 2222222111 11111 Q ss_pred cceeeeecCCCCCCCcce-E-ecccc-cCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCchHH Q lcl|NC_021560. 81 GWIPLQRLGAVQNATGVY-A-KLRGS-YRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNPDV 157 (184) Q Consensus 81 ~~i~l~~f~~r~~~~gv~-~-~~~~~-~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~~~ 157 (184) +..+....|+. + .+|.. -...=|--+ .-+|-|.|-|.... |.=+| T Consensus 61 -------~s~p~~~~g~rtV~i~W~gp~~R~~iVHL--NE~G~ytr~Gk~i~------------------PrG~G----- 108 (133) T protein:vir:96 61 -------LSKPTWENGKRTIRVYWEGEKHRYSIVHL--NEKGFYAKDGKFIR------------------PKGMG----- 108 (133) T ss_pred -------ecCceecCCceEEEEEeecCCCceeeEee--ecccceecCCceec------------------cchhh----- Confidence 11111111211 1 12211 000000000 11122333222111 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021560. 158 YLDVLAGVIEDYFFPRIVHEIERLL 182 (184) Q Consensus 158 ~~~~~~~~~~~~~~~rl~~Ei~r~L 182 (184) +.+...+..+..+.+-+..||+++| T Consensus 109 ~I~~al~~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 109 AIDKALRASRDKFFKVYAEEVSKLL 133 (133) T ss_pred HHHHHHHhhhHHHHHHHHHHHHHhC Confidence 2344445667778888889999999 No 91 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=23.61 E-value=2.3 Score=18.62 Aligned_cols=127 Identities=11% Similarity=0.092 Sum_probs=50.2 Q ss_pred CeEEeeHHHHHHHHHHHhh-ccchhHHHHHHHHHHHHHHHHHHHHHHHHH--HHhcccHHHHhhhhheecccCC--cEEE Q lcl|NC_021560. 1 MRLEMNSKDFEELERAFRR-LPGEIRTKAMRRAMTRLRQTARSRIVARLG--PHTQMPRELVAALTTAHFNAGG--NTSK 75 (184) Q Consensus 1 m~i~id~~~l~~~~~~L~~-l~~~~~~kA~~rAlnrt~~~~rt~~~r~i~--~~~~ik~k~ik~~~~~ka~~~~--~~a~ 75 (184) |.=--++.+++++.+.|.. +....+.+...+||+.++..+...+.+++. +.||- .+..+.+.++...+ -... T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~---t~dev~~s~~~~~~G~r~V~ 77 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGE---TTESAVVSGVRREDGIPKVK 77 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcch---hhcceeecCeeecCCceEEE Confidence 6666677799999999988 665445677777777777766655555543 12221 12222222222111 1111 Q ss_pred EEEeecceeeeecCCCCCCCcceEecccccCcceeeecCCCCeeeEEecCCeeecccCccccceeeeecCchhHhhcCch Q lcl|NC_021560. 76 VVVESGWIPLQRLGAVQNATGVYAKLRGSYRHAFIAAMKSGHVGAFRRVPGTQMSSATGKREQIRELFAANPAHAITNNP 155 (184) Q Consensus 76 i~~~g~~i~l~~f~~r~~~~gv~~~~~~~~~gaFia~~~~g~~~vf~R~~~~~~~~~~~~R~PI~~l~gpsi~~m~~~~~ 155 (184) |--.|..-.|+|++- .|+ |.... |.=+| T Consensus 78 VgW~GpR~~ivHLNE------------------------~Gy-------Gk~~~------------------PrG~G--- 105 (132) T protein:vir:96 78 LGFTTPRWNIVHLQE------------------------LEY-------GWKHN------------------RRGVG--- 105 (132) T ss_pred ecccCCceeEEeeec------------------------ccc-------cCCcC------------------CCcch--- Confidence 111111111111100 010 11000 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021560. 156 DVYLDVLAGVIEDYFFPRIVHEIERLLPR 184 (184) Q Consensus 156 ~~~~~~~~~~~~~~~~~rl~~Ei~r~L~k 184 (184) +.+...+..+..+..-+..||.+.|.- T Consensus 106 --~I~~a~~~se~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 106 --VIRRYSDILETIYPRGIRDKLKRGFDG 132 (132) T ss_pred --HHHHHHHhhhhHHHHHHHHHHHHHhcC Confidence 011111122222222333333333333 Done!