Query lcl|NC_010392.1_cdsid_YP_001700604.1 [gene=STM2600.Gifsy1] [protein=bacteriophage head-tail assembly protein; Lambda gpZ homolog] [protein_id=YP_001700604.1] [location=complement(20925..21503)] Match_columns 192 No_of_seqs 56 out of 93 Neff 4.9 Searched_HMMs 1612 Date Thu Nov 7 13:08:30 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_19 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_19_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:79555 Length: 192 100.0 3.2E-78 2E-81 445.5 18.4 192 1-192 1-192 (192) 2 protein:vir:3427 Length: 192 # 100.0 7.2E-70 4.4E-73 399.7 18.7 183 1-192 3-192 (192) 3 protein:vir:396 Length: 184 # 100.0 6.6E-59 4.1E-62 339.5 18.8 182 1-192 3-184 (184) 4 protein:vir:96763 Length: 177 100.0 7.1E-47 4.4E-50 273.6 16.0 164 1-186 11-177 (177) 5 protein:vir:6375 Length: 205 # 99.6 7.2E-19 4.5E-22 120.0 10.6 176 1-192 7-202 (205) 6 protein:vir:10326 Length: 62 # 99.1 3.1E-14 1.9E-17 94.6 4.1 59 128-187 1-62 (62) 7 protein:vir:79555 Length: 192 97.4 7E-06 4.3E-09 48.9 10.6 157 1-188 4-192 (192) 8 protein:vir:106570 Length: 182 96.6 0.00042 2.6E-07 39.1 14.6 165 1-188 6-182 (182) 9 protein:vir:101594 Length: 173 96.4 0.00011 6.6E-08 42.4 10.2 160 1-189 3-173 (173) 10 protein:vir:79034 Length: 141 96.2 0.00024 1.5E-07 40.4 10.8 124 1-192 9-141 (141) 11 protein:vir:107568 Length: 146 96.0 0.00012 7.3E-08 42.1 8.4 135 1-182 9-146 (146) 12 protein:vir:105007 Length: 146 96.0 0.00012 7.3E-08 42.1 8.4 135 1-182 9-146 (146) 13 protein:vir:102085 Length: 146 96.0 0.00012 7.3E-08 42.1 8.4 135 1-182 9-146 (146) 14 protein:vir:102875 Length: 146 96.0 0.00012 7.3E-08 42.1 8.4 135 1-182 9-146 (146) 15 protein:vir:194 Length: 149 # 95.7 0.00052 3.2E-07 38.6 10.7 140 1-184 8-149 (149) 16 protein:vir:5745 Length: 135 # 95.5 0.00076 4.7E-07 37.7 10.8 128 1-183 7-135 (135) 17 protein:vir:93617 Length: 148 95.4 0.00092 5.7E-07 37.2 11.0 138 1-184 8-148 (148) 18 protein:vir:1386 Length: 149 # 95.3 0.00052 3.2E-07 38.6 9.2 140 1-184 9-149 (149) 19 protein:vir:1273 Length: 127 # 95.1 0.00098 6.1E-07 37.1 10.1 122 1-179 6-127 (127) 20 protein:vir:100243 Length: 140 94.8 0.0013 8.1E-07 36.4 10.2 134 1-186 6-140 (140) 21 protein:vir:105089 Length: 133 94.6 0.0013 8.1E-07 36.4 9.5 128 1-181 6-133 (133) 22 protein:vir:1437 Length: 140 # 94.3 0.0031 1.9E-06 34.3 11.0 134 1-186 6-140 (140) 23 protein:vir:95894 Length: 137 93.9 0.0017 1.1E-06 35.8 8.8 132 1-175 5-137 (137) 24 protein:vir:4347 Length: 164 # 93.8 0.0062 3.8E-06 32.7 11.9 148 1-192 9-160 (164) 25 protein:vir:1891 Length: 179 # 93.8 0.0063 3.9E-06 32.7 12.0 163 1-192 9-175 (179) 26 protein:vir:107099 Length: 137 93.6 0.0018 1.1E-06 35.6 8.4 132 1-175 5-137 (137) 27 protein:vir:93738 Length: 137 93.4 0.0024 1.5E-06 34.9 8.8 132 1-175 5-137 (137) 28 protein:vir:97427 Length: 137 93.4 0.0024 1.5E-06 34.9 8.8 132 1-175 5-137 (137) 29 protein:vir:94490 Length: 137 93.4 0.0024 1.5E-06 34.9 8.8 132 1-175 5-137 (137) 30 protein:vir:94538 Length: 125 93.4 0.0024 1.5E-06 35.0 8.7 115 1-192 9-124 (125) 31 protein:vir:80362 Length: 140 93.3 0.0074 4.6E-06 32.3 11.2 134 1-186 6-140 (140) 32 protein:vir:100075 Length: 140 93.0 0.007 4.3E-06 32.4 10.7 134 1-186 6-140 (140) 33 protein:vir:9708 Length: 125 # 92.9 0.005 3.1E-06 33.2 9.8 124 1-180 2-125 (125) 34 protein:vir:94796 Length: 137 91.9 0.0058 3.6E-06 32.8 8.8 132 1-175 5-137 (137) 35 protein:vir:3873 Length: 128 # 91.7 0.0062 3.8E-06 32.7 8.7 124 1-179 5-128 (128) 36 protein:vir:95789 Length: 114 91.4 0.0082 5.1E-06 32.0 9.1 110 1-183 5-114 (114) 37 protein:vir:98636 Length: 138 91.1 0.015 9.1E-06 30.6 10.2 122 1-184 13-138 (138) 38 protein:vir:105330 Length: 137 89.4 0.015 9.6E-06 30.5 8.8 132 1-175 5-137 (137) 39 protein:vir:96121 Length: 137 89.4 0.013 8.3E-06 30.9 8.5 132 1-175 5-137 (137) 40 protein:vir:5978 Length: 144 # 89.3 0.018 1.1E-05 30.1 9.1 134 1-179 10-144 (144) 41 protein:vir:9930 Length: 108 # 89.2 0.016 9.9E-06 30.4 8.7 108 1-184 1-108 (108) 42 protein:vir:78335 Length: 133 89.0 0.014 8.9E-06 30.7 8.4 125 1-182 5-133 (133) 43 protein:vir:96829 Length: 135 88.9 0.015 9.2E-06 30.6 8.3 130 1-179 5-135 (135) 44 protein:vir:94108 Length: 149 88.7 0.017 1.1E-05 30.3 8.5 132 1-179 17-149 (149) 45 protein:vir:396 Length: 184 # 88.5 0.031 1.9E-05 28.8 9.8 166 1-188 6-184 (184) 46 protein:vir:1332 Length: 143 # 87.7 0.011 7.1E-06 31.2 6.9 128 1-176 11-143 (143) 47 protein:vir:105916 Length: 149 87.3 0.024 1.5E-05 29.4 8.5 132 1-179 17-149 (149) 48 protein:vir:96973 Length: 133 85.4 0.027 1.7E-05 29.2 7.8 124 1-180 5-133 (133) 49 protein:vir:9363 Length: 133 # 85.4 0.027 1.7E-05 29.2 7.8 124 1-180 5-133 (133) 50 protein:vir:94419 Length: 133 85.4 0.027 1.7E-05 29.2 7.8 124 1-180 5-133 (133) 51 protein:vir:78644 Length: 133 85.4 0.027 1.7E-05 29.2 7.8 124 1-180 5-133 (133) 52 protein:vir:9647 Length: 132 # 84.3 0.05 3.1E-05 27.7 8.6 123 1-184 7-132 (132) 53 protein:vir:6246 Length: 143 # 84.1 0.041 2.5E-05 28.2 8.1 128 1-176 11-143 (143) 54 protein:vir:97088 Length: 157 83.4 0.069 4.3E-05 27.0 10.1 129 1-192 5-155 (157) 55 protein:vir:105467 Length: 144 81.7 0.083 5.2E-05 26.5 9.5 135 1-190 8-144 (144) 56 protein:vir:743 Length: 108 # 80.9 0.083 5.1E-05 26.5 8.5 106 1-183 3-108 (108) 57 protein:vir:103917 Length: 115 80.1 0.098 6.1E-05 26.1 10.0 113 1-183 3-115 (115) 58 protein:vir:96358 Length: 115 80.1 0.098 6.1E-05 26.1 10.0 113 1-183 3-115 (115) 59 protein:vir:97144 Length: 115 80.1 0.098 6.1E-05 26.1 10.0 113 1-183 3-115 (115) 60 protein:vir:96225 Length: 115 80.1 0.098 6.1E-05 26.1 10.0 113 1-183 3-115 (115) 61 protein:vir:9312 Length: 115 # 80.1 0.098 6.1E-05 26.1 10.0 113 1-183 3-115 (115) 62 protein:vir:78858 Length: 115 80.1 0.098 6.1E-05 26.1 10.0 113 1-183 3-115 (115) 63 protein:vir:3617 Length: 112 # 78.2 0.12 7.2E-05 25.7 9.2 106 1-183 7-112 (112) 64 protein:vir:4906 Length: 114 # 78.2 0.058 3.6E-05 27.4 6.8 108 1-184 6-114 (114) 65 protein:vir:2740 Length: 114 # 78.2 0.058 3.6E-05 27.4 6.8 108 1-184 6-114 (114) 66 protein:vir:106623 Length: 115 73.9 0.16 0.0001 24.9 10.4 113 1-183 3-115 (115) 67 protein:vir:93898 Length: 133 66.6 0.26 0.00016 23.7 8.0 124 1-180 5-133 (133) 68 protein:vir:3427 Length: 192 # 63.9 0.31 0.00019 23.4 8.5 165 6-192 1-188 (192) 69 protein:vir:94654 Length: 142 63.7 0.12 7.6E-05 25.6 5.1 133 1-179 6-142 (142) 70 protein:vir:99744 Length: 115 61.5 0.35 0.00022 23.1 10.2 113 1-183 3-115 (115) 71 protein:vir:98409 Length: 108 55.3 0.48 0.0003 22.3 9.1 105 1-183 3-108 (108) 72 protein:vir:96486 Length: 112 55.0 0.45 0.00028 22.5 6.6 106 1-183 6-112 (112) 73 protein:vir:96012 Length: 133 53.8 0.52 0.00032 22.1 9.2 127 1-182 4-133 (133) 74 protein:vir:100652 Length: 134 43.1 0.86 0.00053 20.9 7.7 125 1-181 5-134 (134) 75 protein:vir:6071 Length: 150 # 41.0 0.95 0.00059 20.7 6.4 139 1-176 1-150 (150) 76 protein:vir:99528 Length: 92 # 36.2 0.78 0.00049 21.2 4.8 82 1-123 8-92 (92) 77 protein:vir:6375 Length: 205 # 31.1 1.5 0.00094 19.6 9.7 178 1-183 10-205 (205) 78 protein:vir:1838 Length: 149 # 30.4 1.3 0.0008 20.0 5.0 143 1-176 1-149 (149) 79 protein:vir:105773 Length: 131 30.2 1.6 0.00099 19.5 6.8 119 1-180 3-131 (131) 80 protein:vir:101302 Length: 134 29.7 1.6 0.001 19.4 8.4 126 1-181 5-134 (134) 81 protein:vir:9513 Length: 134 # 29.7 1.6 0.001 19.4 8.4 126 1-181 5-134 (134) 82 protein:vir:95062 Length: 116 29.4 1.7 0.001 19.4 6.9 115 19-179 1-116 (116) No 1 >protein:vir:79555 Length: 192 # NCBI annotation: putative tail component # Family: family:all:869 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272521;genbank:gi:148609390;genbank:GeneID:5204391 Probab=100.00 E-value=3.2e-78 Score=445.46 Aligned_cols=192 Identities=80% Similarity=1.233 Sum_probs=190.2 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |||||++|+||++|++++||+|+++|||+||.|++++|+++|++|+..++++.++||+|+||+|++++++++++.++++| T Consensus 1 ~kgl~~a~~nl~~l~~~~vp~A~~~ainrva~ra~~~t~~~v~~~~~~~~~~~~~I~~k~iR~R~r~~ka~~~~~~~~~I 80 (192) T protein:vir:79 1 MKGLENAIRNLNSLDTRMVPQASAWAINRVAQKAVSVATRQVAGNTVAGDNQVKGIPLKLVRQRVRVFKASPSGKMTARI 80 (192) T ss_pred CchHHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcCcHHHHHhhhhcccccCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999998888999999 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) |||++|||+|+||+++.+++++.++.+.++|+++||+|+|||||||+|+||+||||+|++||+||||||||||||+|||+ T Consensus 81 ~v~~~~l~ai~lg~~r~r~~rr~~~~~~~~s~~~vGk~~f~gaFia~m~ngr~~V~~R~~gk~R~PIevvkIpis~~l~~ 160 (192) T protein:vir:79 81 RVNRGNLPAIKLGTARVRLARRGGKLQYRGSVLKVGKYLFRDAFIQQLANGRWHVMRRIDGKNRYPIDVVKIPLSGPLTQ 160 (192) T ss_pred EEecCceeeeeecccccccccccccccccccceEEcceecCchhccccCCCCccceEecCCCccCCeeeEeechHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999998899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALKQQLRLYLSR 192 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~~qlr~~~kr 192 (192) +||+|+++.|+++||+||+|+|++||+++||| T Consensus 161 af~~e~~r~~~~~~~~el~~~L~~qlr~~~~r 192 (192) T protein:vir:79 161 AFEDARDRIIAAEMPKQLGYALKQQLRLWLTR 192 (192) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhCC Confidence 99999999999999999999999999999999 No 2 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=100.00 E-value=7.2e-70 Score=399.66 Aligned_cols=183 Identities=53% Similarity=0.830 Sum_probs=174.7 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |||||++++||++|++++||+|+++|||+||.|++++++++||+|+ +||+|+||+|++++||+ .+++++.| T Consensus 3 ik~l~~~~~~L~~i~~~~vp~A~~rAiNrta~~a~t~~~r~v~~e~--------~I~~k~Ir~r~r~~kAs-~~~l~a~I 73 (192) T protein:vir:34 3 IKGLEQAVENLSRISKTAVPGAAAMAINRVASSAISQSASQVARET--------KVRRKLVKERARLKRAT-VKNPQARI 73 (192) T ss_pred chhHHHHHHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHh--------CCCHHHHHhhheecccc-CCCceEEE Confidence 9999999999999999999999999999999999999999999998 69999999999999985 56799999 Q ss_pred EEeccCcCceecCCcceeeccc-------ccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecC Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKR-------RGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIP 153 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~r-------r~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkip 153 (192) ++|++|||+|+||+++.+.+++ ++..+.+++++++|+|+|+|||+++|+||+||||+|++||+|||||+|+|| T Consensus 74 ~~~~~~l~~~~l~~~~~~~~rr~~~~~~~~~~~~~~g~~~k~Gk~~f~gaFia~m~ng~~~Vf~R~~gk~R~PIe~vkIp 153 (192) T protein:vir:34 74 KVNRGDLPVIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVVKIP 153 (192) T ss_pred EEeccceeeeeecccccccccccccccccccccccccceeeecceecCCcccccCCCCCceeEEEccCCCccceeEEEec Confidence 9999999999999999887765 334467789999999999999999999999999999879999999999999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_010392. 154 LSGPLTQAFESATQSLIDEEIPKQLGYALKQQLRLYLSR 192 (192) Q Consensus 154 is~plt~afe~e~~~~~~~~~~kEl~~~L~~qlr~~~kr 192 (192) ||+|++++|++|+++.|+++||+||+|+|++||++++|| T Consensus 154 is~~l~~af~~~~~~~~~~~~~~El~~~L~~~lr~~~k~ 192 (192) T protein:vir:34 154 MAVPLTTAFKQNIERIRRERLPKELGYALQHQLRMVIKR 192 (192) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccC Confidence 999999999999999999999999999999999999999 No 3 >protein:vir:396 Length: 184 # NCBI annotation: gp11 # Family: family:all:869 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046906;genbank:gi:9630476;genbank:GeneID:1261650 Probab=100.00 E-value=6.6e-59 Score=339.54 Aligned_cols=182 Identities=55% Similarity=0.855 Sum_probs=174.8 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++||+.|++++||+|+++|||+||.|++++++++|++|+ +||++.|++|+++++++ .+++++.| T Consensus 3 v~~l~~~~~~L~~l~~~~v~kA~~rAiNrt~~~~rt~~~r~v~~~~--------~i~~~~ir~r~~~~kas-~~~l~a~I 73 (184) T protein:vir:39 3 LKGLEQAIENLNSISKTAVPRASAQAVNRVANRAVSRSVAVVSKDT--------RVPRKLVKQRARVKRAT-VNKPRALI 73 (184) T ss_pred hHHHHHHHHHHhccCHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------cCCHHHHHhhheecccC-CCCeEEEE Confidence 9999999999999999999999999999999999999999999998 69999999999999985 57899999 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) |++++|||++++|+++....+.+....+.++++++|+|.|+||||++|+|||||||+| .|++||||+++++|++.|++| T Consensus 74 ~~~~~~i~l~~~g~~~~k~~~~~~~~~~~~~~~~~g~~~~~gaFia~~~~G~~~Vf~R-~gk~R~PI~~~~~~i~~~~~e 152 (184) T protein:vir:39 74 RVNRGNLPAIKLGTASVRLSRRKRDKKGANSVLRIGPFRFPGGFIQQLKNGRWHVMRR-TSKPRYPIEVVSIPLAAPLTT 152 (184) T ss_pred EEeccceeeeeccccccccCccccccccccceeeecceecCcceeeecCCCceEEEEE-ecCcccceeEEEcCchHHHHH Confidence 9999999999999998887777777777888899999999999999999999999999 599999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALKQQLRLYLSR 192 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~~qlr~~~kr 192 (192) +|++|+++.|+++|++||.++|+|||+++||| T Consensus 153 ~~~~~~~~~~~~~~~~el~~~l~~~L~~~l~r 184 (184) T protein:vir:39 153 AFKEELPKLMESDMPKELRASLTNQLRLILTR 184 (184) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 99999999999999999999999999999999 No 4 >protein:vir:96763 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1091 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039824;genbank:gi:126010915;genbank:GeneID:5076273 Probab=100.00 E-value=7.1e-47 Score=273.59 Aligned_cols=164 Identities=18% Similarity=0.279 Sum_probs=146.6 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) +++..+.+++|-...++.+|+|+++|||+||.|++++++++|++|+ +||++.|++|+++++++ .+++++| T Consensus 11 v~~~l~~i~~~l~~~~~~~~~A~~rAlNrta~~~rt~~~r~v~~~~--------~i~~k~ir~r~~~~~a~--~~~~~~i 80 (177) T protein:vir:96 11 VSREAEDIAAMVAATTKQLELAAQRAMTKAGQWLRTHSVRELGQQL--------GIKQEPLKKRFRVYPQR--QKGEVRF 80 (177) T ss_pred hhHHHHHHHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------cCCHHHHHhhheeeccC--CCcEEEE Confidence 6777788889888899999999999999999999999999999998 69999999999999985 3678999 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) |++.+|||+++||+++++. +++++|+|.|+||||++|+||+||||+| .||+||||+++++|++.|+++ T Consensus 81 ~~~~~~i~l~~~~~~r~t~-----------~Gv~~g~~~~~gaFia~~~~g~~~Vf~R-~gk~R~PI~~~~~pi~~~~~~ 148 (177) T protein:vir:96 81 WVGLDPIGVYRLGTPKVTQ-----------KGVKVNRNEYDGAFISPMKSNYPLVFKR-RGKERLPIDLVDEDIDEPAME 148 (177) T ss_pred EEeccceehhhcccCCCCc-----------cceEEeeEEcCCceeccCCCCCceEEEE-ecCCccceEEEEcCchHHHHH Confidence 9999999999999877653 3379999999999999999999999999 599999999999999999988 Q ss_pred HHHHH---HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESA---TQSLIDEEIPKQLGYALKQQL 186 (192) Q Consensus 161 afe~e---~~~~~~~~~~kEl~~~L~~ql 186 (192) .||+. +...|+++|++|++++|+.+= T Consensus 149 ~~e~~~~~~~~~~~~~l~~Ei~~~L~g~~ 177 (177) T protein:vir:96 149 VVERWERRVFQRFKELFEQEARAIINGHA 177 (177) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 88854 445566788888888888777 No 5 >protein:vir:6375 Length: 205 # NCBI annotation: hypothetical protein # Family: family:all:10491 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918988;genbank:gi:34610163;genbank:gi:91214209;genbank:GeneID:2559587 Probab=99.64 E-value=7.2e-19 Score=120.05 Aligned_cols=176 Identities=17% Similarity=0.205 Sum_probs=123.1 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHhcccchhhhhcchhhhh--hceeeeecCCCCCeE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRK-VARETVAGDNQVRGLPLKLVR--QRVRLFKAGTDGKRS 77 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~-va~e~~~~~~~~~~I~~k~vr--~R~r~~ka~~~~~~~ 77 (192) +.|+++..+.|+.+++. .++|+++|||+||.|+.+.++++ +.+++. +|..-|+ .|+++.|.++.++++ T Consensus 7 ~~G~~~~~~~l~~l~~~-~~~a~~~AIN~ta~~~~~~~A~~~i~~~vn--------~k~~yv~~~~Rlti~k~As~~~L~ 77 (205) T protein:vir:63 7 AEGLGEFRDYVDRLPDI-SQQAAMIAINQTAQRTALPLARTEIGEQVN--------FPDNYLKDDSRLGVTKKATRNDLE 77 (205) T ss_pred hhhHHHHHHHHHhcchh-hhHHHHHHHHHHHHHhhHHHHHHhhhhccc--------cchhhhccceeeEEEeecCCCCee Confidence 99999999999999887 77999999999999999998886 788875 6666666 388898855889999 Q ss_pred EEEEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCC--------CCchhheeecccccccccee Q lcl|NC_010392. 78 ARIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLA--------NGRWHVMRRVNGKNRYPIDV 149 (192) Q Consensus 78 a~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~--------nGr~~V~~R~~Gk~R~PIev 149 (192) |.|.-...|.-+.++++++......|.. +..=-|.+-+-..|+|||+..|+ ||+.+|+.|+ +..+.|+.- T Consensus 78 A~I~ar~rpt~LsRF~~p~~~~~~~r~~-GVsV~Vk~G~ak~l~gaF~~~lk~g~~l~e~~~~vgva~R~-~~g~~~~~~ 155 (205) T protein:vir:63 78 AVIGARQRPTSLARFAEPGQTTKSTRKG-GVSVVVKPGRTKQFKRGFLVRLRAGKTLTEDKYNLGLAVRL-SPGETLHAT 155 (205) T ss_pred EEEecCCCcceeeeccCCCccccccccC-CeEEEEEcCCCeeccCceEEEeeccccccccccceEEEeee-cCccccccc Confidence 9999999999999999988664433211 11112334344599999999997 9999999996 568888744 Q ss_pred e---ecCc-----hHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_010392. 150 V---KIPL-----SGP-LTQAFESATQSLIDEEIPKQLGYALKQQLRLYLSR 192 (192) Q Consensus 150 v---kipi-----s~p-lt~afe~e~~~~~~~~~~kEl~~~L~~qlr~~~kr 192 (192) . |+|- =+| +-|-|.+.. ++...++...|..++...|-| T Consensus 156 ~g~~k~~~~~k~LYGPSV~Qvf~~~~-----e~I~~~i~~~l~~~f~r~~~~ 202 (205) T protein:vir:63 156 DGATKLSNNVYLLYGPSVDQVFRTVA-----DDITTEVLDALADEFLRQFTR 202 (205) T ss_pred cCceecCCceEEEEcCcHHHHHhhhh-----hhhhHHHHHHHHHHHHHhhhh Confidence 4 3332 133 334443211 233333333333333333333 No 6 >protein:vir:10326 Length: 62 # NCBI annotation: ORF28 # Family: family:all:1091 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758921;genbank:gi:27311195;genbank:GeneID:956157 Probab=99.15 E-value=3.1e-14 Score=94.61 Aligned_cols=59 Identities=29% Similarity=0.608 Sum_probs=56.7 Q ss_pred CCCCchhheeeccccccccceeeecCc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 128 LANGRWHVMRRVNGKNRYPIDVVKIPL---SGPLTQAFESATQSLIDEEIPKQLGYALKQQLR 187 (192) Q Consensus 128 ~~nGr~~V~~R~~Gk~R~PIevvkipi---s~plt~afe~e~~~~~~~~~~kEl~~~L~~qlr 187 (192) |++.+..||+| .||+|+|||+|.+|| +.++.+.|+..+++.|++.|++|++|+|+|+=- T Consensus 1 M~S~~llVfRR-~gkeRlpIe~V~~dI~e~~~~ivery~~r~~~rF~elf~qE~~yvLs~~~~ 62 (62) T protein:vir:10 1 MKSEHLNVFRR-KGRERLPIEVVRLPIEEQSNPIFERYYQRAQGRFTELLRQELNFALNHEGA 62 (62) T ss_pred CCCCccchhhc-cCccccchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 99999999999 899999999999999 477999999999999999999999999999876 No 7 >protein:vir:79555 Length: 192 # NCBI annotation: putative tail component # Family: family:all:869 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272521;genbank:gi:148609390;genbank:GeneID:5204391 Probab=97.36 E-value=7e-06 Score=48.86 Aligned_cols=157 Identities=13% Similarity=0.185 Sum_probs=89.8 Q ss_pred ChhHHHHHHH-----HhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhh----cchh---hhhhceeee Q lcl|NC_010392. 1 MKGLENAIRN-----LNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRG----LPLK---LVRQRVRLF 68 (192) Q Consensus 1 ~~gl~~~i~n-----L~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~----I~~k---~vr~R~r~~ 68 (192) |+-+..-+.. +...+..++.+++.+|+++++.|+.++++++++.|.++....+++ .+.. ...-++++. T Consensus 4 l~~a~~nl~~l~~~~vp~A~~~ainrva~ra~~~t~~~v~~~~~~~~~~~~~I~~k~iR~R~r~~ka~~~~~~~~~I~v~ 83 (192) T protein:vir:79 4 LENAIRNLNSLDTRMVPQASAWAINRVAQKAVSVATRQVAGNTVAGDNQVKGIPLKLVRQRVRVFKASPSGKMTARIRVN 83 (192) T ss_pred HHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcCcHHHHHhhhhcccccCCCceEEEEEEe Confidence 4444444444 334467788899999999999999999999999999988766554 1111 112223332 Q ss_pred ecCCCCCeEEEEEEeccCc-------------CceecCC------cceeecccccccccccceeee-cceecCcceeecC Q lcl|NC_010392. 69 KAGTDGKRSARIRINRGNL-------------PAIKLGA------AQVRMSKRRGKLLYRGSVLKI-GPYLFRDAFIQQL 128 (192) Q Consensus 69 ka~~~~~~~a~i~v~r~~l-------------paikLg~------~~~~~~~rr~~~~~~~s~lkv-Gk~~f~gaFi~~~ 128 (192) .- +..-++++...+ ..+++|. --.++.+.++..+- ++ |+..|| T Consensus 84 ~~-----~l~ai~lg~~r~r~~rr~~~~~~~~s~~~vGk~~f~gaFia~m~ngr~~V~~-----R~~gk~R~P------- 146 (192) T protein:vir:79 84 RG-----NLPAIKLGTARVRLARRGGKLQYRGSVLKVGKYLFRDAFIQQLANGRWHVMR-----RIDGKNRYP------- 146 (192) T ss_pred cC-----ceeeeeecccccccccccccccccccceEEcceecCchhccccCCCCccceE-----ecCCCccCC------- Confidence 21 222233322111 1112211 11121221111111 11 333333 Q ss_pred CCCchhheeeccccccccceeeecCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 129 ANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQAFESATQSLIDEEIPKQLGYALKQQLRL 188 (192) Q Consensus 129 ~nGr~~V~~R~~Gk~R~PIevvkipis~plt~afe~e~~~~~~~~~~kEl~~~L~~qlr~ 188 (192) .. ..+.|| .-|+.+...+..++.++..|+++|-.||.+.|.--|.+ T Consensus 147 -------Ie----vvkIpi---s~~l~~af~~e~~r~~~~~~~~el~~~L~~qlr~~~~r 192 (192) T protein:vir:79 147 -------ID----VVKIPL---SGPLTQAFEDARDRIIAAEMPKQLGYALKQQLRLWLTR 192 (192) T ss_pred -------ee----eEeech---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhCC Confidence 22 345788 44887777777777777888888888888888888888 No 8 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=96.64 E-value=0.00042 Score=39.09 Aligned_cols=165 Identities=13% Similarity=0.107 Sum_probs=66.0 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhh--hhhceeeeecCCCCCeEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKL--VRQRVRLFKAGTDGKRSA 78 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~--vr~R~r~~ka~~~~~~~a 78 (192) |+||+++.+.|++++.. +.+|+..|+.+++..+-.....+.-. .+|.++ ||.-++..-....+...+ T Consensus 6 i~Gld~L~~kl~~~~~~-~~~~v~~a~~~~~~~~a~~v~~~ak~----------~~PvdtG~Lr~SI~~~~~~~~~~~~g 74 (182) T protein:vir:10 6 LKGVNELRAKLKKLPDI-MAKATANAQENAIEQAEAYAVDELQS----------SIKYSTGELTRSFKHEVKVDGDEVIG 74 (182) T ss_pred EecHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHh----------hCCCCchhhhhceeeeeeecCCeEEE Confidence 99999999999998654 44555555544443333322222211 133322 333332211112345677 Q ss_pred EEEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCc---------hhheeecccccccccee Q lcl|NC_010392. 79 RIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGR---------WHVMRRVNGKNRYPIDV 149 (192) Q Consensus 79 ~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr---------~~V~~R~~Gk~R~PIev 149 (192) .|+.+...=+.+-+||...-.....+-.-...-+.+=+.-.|+..++.....+. -+.|.+++|.. T Consensus 75 ~V~~~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~------ 148 (182) T protein:vir:10 75 RWWNSSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQP------ 148 (182) T ss_pred EeecCCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCC------ Confidence 788777666667777643221111100000000000011112222222222221 23334555532 Q ss_pred eecCchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 150 VKIPLSGP-LTQAFESATQSLIDEEIPKQLGYALKQQLRL 188 (192) Q Consensus 150 vkipis~p-lt~afe~e~~~~~~~~~~kEl~~~L~~qlr~ 188 (192) +.| +.-||++-.++ +++.|-+.+..+|+.+|== T Consensus 149 -----aqPFl~pA~~~~~~~-i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 149 -----ARQFMTPAANKMAKE-APEIIKRSIDQELHDKLGG 182 (182) T ss_pred -----CCcchHHHHHHhHHH-HHHHHHHHHHHHHHHhhcC Confidence 223 44444322221 2222222222222222222 No 9 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=96.45 E-value=0.00011 Score=42.38 Aligned_cols=160 Identities=13% Similarity=0.017 Sum_probs=75.1 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||++|++.|+++++. +.+++..|+...|.-+...+-..+...|+ .++.-+.+......+...+.+ T Consensus 3 i~Gld~L~~~L~~l~~~-~~~~~~~a~~~~a~~i~~~ak~~aPv~TG------------~Lr~sI~~~~~~~~~~~~~~v 69 (173) T protein:vir:10 3 VKGVAEVIAELRKIGKD-IDKNINATTEEAANFIEDRAKTLAPKNFG------------KLAQSISTSDLKAKDLISKKI 69 (173) T ss_pred chhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCcCch------------hhhhcceeeeeccCceeEEee Confidence 99999999999998765 57888888888887666666554444332 234444443332233334444 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecC----------CCCchhheeeccccccccceee Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQL----------ANGRWHVMRRVNGKNRYPIDVV 150 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~----------~nGr~~V~~R~~Gk~R~PIevv 150 (192) +-+..--+-+-+||..+...+........+..=.++.-..+|.+-... .++-...|.+++|. T Consensus 70 ~~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~-------- 141 (173) T protein:vir:10 70 TVNELYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGI-------- 141 (173) T ss_pred CCCcccchhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCC-------- Confidence 333333344466765543322110000000000011111111100000 00001113333442 Q ss_pred ecCchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 151 KIPLSGP-LTQAFESATQSLIDEEIPKQLGYALKQQLRLY 189 (192) Q Consensus 151 kipis~p-lt~afe~e~~~~~~~~~~kEl~~~L~~qlr~~ 189 (192) | +.| |.=||+..- +++++.|...|+.+|+-+ T Consensus 142 --~-aqPFl~PA~~~~~-----~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 142 --N-PQPFLYPAWIEGK-----KQYLKDLENLLKTYNKKI 173 (173) T ss_pred --C-CCccchhHHHHhH-----HHHHHHHHHHHHHHhhcC Confidence 2 445 555555333 356666666666666666 No 10 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=96.18 E-value=0.00024 Score=40.42 Aligned_cols=124 Identities=11% Similarity=0.123 Sum_probs=54.3 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhh--hhhceeeeec-------C Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKL--VRQRVRLFKA-------G 71 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~--vr~R~r~~ka-------~ 71 (192) ++||++++++|..+....+|+....+++.+|..+...+.+ . .|+++ +|+=.....+ . T Consensus 9 ~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~----~----------tPVdTG~Lr~sw~~~~~~~~~~~~~ 74 (141) T protein:vir:79 9 FREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIR----R----------TPVDTGFLRQGWNGVAYARSLPVYK 74 (141) T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH----h----------CCCcchhhcccccccccccccceee Confidence 8899999999998888889999999999999776654433 2 23222 1110000000 0 Q ss_pred CCCCeEEEEEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeee Q lcl|NC_010392. 72 TDGKRSARIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVK 151 (192) Q Consensus 72 ~~~~~~a~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvk 151 (192) +.+.....|.=+..--+.+-.|+- .+.|+.++||+| T Consensus 75 ~g~~~~v~v~n~~~YA~~VE~Ghr-----------------~~~~~gfV~G~f--------------------------- 110 (141) T protein:vir:79 75 QGNNYIIEVVNPTEYASYVNFGHR-----------------TKDGKGWVKGQH--------------------------- 110 (141) T ss_pred cCCeeEEEEecCCcchhhhhccee-----------------ecCCcceeCCch--------------------------- Confidence 011111111111111111222220 011122333332 Q ss_pred cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_010392. 152 IPLSGPLTQAFESATQSLIDEEIPKQLGYALKQQLRLYLSR 192 (192) Q Consensus 152 ipis~plt~afe~e~~~~~~~~~~kEl~~~L~~qlr~~~kr 192 (192) |-+.....|+..|++.|...|+.-|+..|-= T Consensus 111 ----------ml~~s~~~~~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 111 ----------FLTISEMELQSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred ----------hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 1111122333444444444444444444433 No 11 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=96.03 E-value=0.00012 Score=42.12 Aligned_cols=135 Identities=13% Similarity=0.136 Sum_probs=65.7 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeec-CCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKA-GTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka-~~~~~~~a~ 79 (192) |+||+++++.|+.|+.. +.+++..|+..-|.-+...+-. .+|...--.+--.... .+.. T Consensus 9 i~Gl~el~~~l~~L~~~-~~~~~~~al~~ga~~i~~~ak~--------------~ap~~~~~~~~~~~~~~~~~~----- 68 (146) T protein:vir:10 9 LLGFDRLVTELDQMGLR-GEKIEDKALAAGGEPIRKAIAE--------------RAPRSPSPKKRSKSEPWRTGQ----- 68 (146) T ss_pred ehhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHH--------------hCCCccccccccccccccccc----- Confidence 99999999999999876 6677777776655333322222 1332110000000000 0000 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeeccee--cCcceeecCCCCchhheeeccccccccceeeecCchHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYL--FRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP 157 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~--f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p 157 (192) -.+. .|..+.++.. .++..+.||-.. -.++| +||..-- |....|= =|.-.| T Consensus 69 --~~~~---~i~~~~~~~~---------~g~~~~~vg~~~~~~~~~~-------y~~f~E~--GT~~~~a----~PFl~p 121 (146) T protein:vir:10 69 --HGAD---QIKVTKAKLE---------GGIKTVKIGLNKADRSPWF-------YLKFHEW--GTSKMPA----HPFIEP 121 (146) T ss_pred --cccc---cceecccccc---------ccceeEEeeeccCCCCCcc-------eeeeecc--CCCCCCC----CcchhH Confidence 0011 1111111111 112223343211 12233 2332222 4433331 266788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 158 LTQAFESATQSLIDEEIPKQLGYAL 182 (192) Q Consensus 158 lt~afe~e~~~~~~~~~~kEl~~~L 182 (192) -.++-++++.+.|.+++.++|..+| T Consensus 122 a~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 122 GFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred HHHHhHHHHHHHHHHHHHHHHhhcC Confidence 8888888888888888888888888 No 12 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=96.03 E-value=0.00012 Score=42.12 Aligned_cols=135 Identities=13% Similarity=0.136 Sum_probs=65.7 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeec-CCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKA-GTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka-~~~~~~~a~ 79 (192) |+||+++++.|+.|+.. +.+++..|+..-|.-+...+-. .+|...--.+--.... .+.. T Consensus 9 i~Gl~el~~~l~~L~~~-~~~~~~~al~~ga~~i~~~ak~--------------~ap~~~~~~~~~~~~~~~~~~----- 68 (146) T protein:vir:10 9 LLGFDRLVTELDQMGLR-GEKIEDKALAAGGEPIRKAIAE--------------RAPRSPSPKKRSKSEPWRTGQ----- 68 (146) T ss_pred ehhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHH--------------hCCCccccccccccccccccc----- Confidence 99999999999999876 6677777776655333322222 1332110000000000 0000 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeeccee--cCcceeecCCCCchhheeeccccccccceeeecCchHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYL--FRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP 157 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~--f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p 157 (192) -.+. .|..+.++.. .++..+.||-.. -.++| +||..-- |....|= =|.-.| T Consensus 69 --~~~~---~i~~~~~~~~---------~g~~~~~vg~~~~~~~~~~-------y~~f~E~--GT~~~~a----~PFl~p 121 (146) T protein:vir:10 69 --HGAD---QIKVTKAKLE---------GGIKTVKIGLNKADRSPWF-------YLKFHEW--GTSKMPA----HPFIEP 121 (146) T ss_pred --cccc---cceecccccc---------ccceeEEeeeccCCCCCcc-------eeeeecc--CCCCCCC----CcchhH Confidence 0011 1111111111 112223343211 12233 2332222 4433331 266788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 158 LTQAFESATQSLIDEEIPKQLGYAL 182 (192) Q Consensus 158 lt~afe~e~~~~~~~~~~kEl~~~L 182 (192) -.++-++++.+.|.+++.++|..+| T Consensus 122 a~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 122 GFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred HHHHhHHHHHHHHHHHHHHHHhhcC Confidence 8888888888888888888888888 No 13 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=96.03 E-value=0.00012 Score=42.12 Aligned_cols=135 Identities=13% Similarity=0.136 Sum_probs=65.7 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeec-CCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKA-GTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka-~~~~~~~a~ 79 (192) |+||+++++.|+.|+.. +.+++..|+..-|.-+...+-. .+|...--.+--.... .+.. T Consensus 9 i~Gl~el~~~l~~L~~~-~~~~~~~al~~ga~~i~~~ak~--------------~ap~~~~~~~~~~~~~~~~~~----- 68 (146) T protein:vir:10 9 LLGFDRLVTELDQMGLR-GEKIEDKALAAGGEPIRKAIAE--------------RAPRSPSPKKRSKSEPWRTGQ----- 68 (146) T ss_pred ehhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHH--------------hCCCccccccccccccccccc----- Confidence 99999999999999876 6677777776655333322222 1332110000000000 0000 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeeccee--cCcceeecCCCCchhheeeccccccccceeeecCchHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYL--FRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP 157 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~--f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p 157 (192) -.+. .|..+.++.. .++..+.||-.. -.++| +||..-- |....|= =|.-.| T Consensus 69 --~~~~---~i~~~~~~~~---------~g~~~~~vg~~~~~~~~~~-------y~~f~E~--GT~~~~a----~PFl~p 121 (146) T protein:vir:10 69 --HGAD---QIKVTKAKLE---------GGIKTVKIGLNKADRSPWF-------YLKFHEW--GTSKMPA----HPFIEP 121 (146) T ss_pred --cccc---cceecccccc---------ccceeEEeeeccCCCCCcc-------eeeeecc--CCCCCCC----CcchhH Confidence 0011 1111111111 112223343211 12233 2332222 4433331 266788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 158 LTQAFESATQSLIDEEIPKQLGYAL 182 (192) Q Consensus 158 lt~afe~e~~~~~~~~~~kEl~~~L 182 (192) -.++-++++.+.|.+++.++|..+| T Consensus 122 a~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 122 GFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred HHHHhHHHHHHHHHHHHHHHHhhcC Confidence 8888888888888888888888888 No 14 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=96.03 E-value=0.00012 Score=42.12 Aligned_cols=135 Identities=13% Similarity=0.136 Sum_probs=65.7 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeec-CCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKA-GTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka-~~~~~~~a~ 79 (192) |+||+++++.|+.|+.. +.+++..|+..-|.-+...+-. .+|...--.+--.... .+.. T Consensus 9 i~Gl~el~~~l~~L~~~-~~~~~~~al~~ga~~i~~~ak~--------------~ap~~~~~~~~~~~~~~~~~~----- 68 (146) T protein:vir:10 9 LLGFDRLVTELDQMGLR-GEKIEDKALAAGGEPIRKAIAE--------------RAPRSPSPKKRSKSEPWRTGQ----- 68 (146) T ss_pred ehhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHH--------------hCCCccccccccccccccccc----- Confidence 99999999999999876 6677777776655333322222 1332110000000000 0000 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeeccee--cCcceeecCCCCchhheeeccccccccceeeecCchHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYL--FRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP 157 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~--f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p 157 (192) -.+. .|..+.++.. .++..+.||-.. -.++| +||..-- |....|= =|.-.| T Consensus 69 --~~~~---~i~~~~~~~~---------~g~~~~~vg~~~~~~~~~~-------y~~f~E~--GT~~~~a----~PFl~p 121 (146) T protein:vir:10 69 --HGAD---QIKVTKAKLE---------GGIKTVKIGLNKADRSPWF-------YLKFHEW--GTSKMPA----HPFIEP 121 (146) T ss_pred --cccc---cceecccccc---------ccceeEEeeeccCCCCCcc-------eeeeecc--CCCCCCC----CcchhH Confidence 0011 1111111111 112223343211 12233 2332222 4433331 266788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 158 LTQAFESATQSLIDEEIPKQLGYAL 182 (192) Q Consensus 158 lt~afe~e~~~~~~~~~~kEl~~~L 182 (192) -.++-++++.+.|.+++.++|..+| T Consensus 122 a~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 122 GFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred HHHHhHHHHHHHHHHHHHHHHhhcC Confidence 8888888888888888888888888 No 15 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=95.73 E-value=0.00052 Score=38.61 Aligned_cols=140 Identities=19% Similarity=0.224 Sum_probs=61.1 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeee--ecCCCCCeEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLF--KAGTDGKRSA 78 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~--ka~~~~~~~a 78 (192) |+||+++++.|+.|+.....+++..|+...|.-+...+-..+...+. .++.-++.. +......... T Consensus 8 i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g------------~l~~si~~~~~~~~~~~~~~~ 75 (149) T protein:vir:19 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTG------------KLKKNVVVVTQKSRRRGEISS 75 (149) T ss_pred hhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCch------------hhhhhccccccccccccceee Confidence 99999999999999888666777777777775555544443322211 111111110 0100111111 Q ss_pred EEEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHH Q lcl|NC_010392. 79 RIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPL 158 (192) Q Consensus 79 ~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~pl 158 (192) .+.+.... .. .+.+...++...-.++| +|| |.= -|...-| -=|.-.|- T Consensus 76 ~v~~~~~~-----------------~~--~~~~~~~~~~~~~~~~~-------y~~-f~E-~GT~~~~----a~PF~~pA 123 (149) T protein:vir:19 76 GVHIRGVN-----------------PR--TGNSDNTMKANNPRNAF-------YWR-FVE-LGTANMP----AHPFVRPA 123 (149) T ss_pred cccccccc-----------------cc--cccccceeecCCCCccc-------eee-eec-cCCCCCC----CCcchhHH Confidence 11111000 00 00000111111111122 133 111 1322111 01444555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 159 TQAFESATQSLIDEEIPKQLGYALKQ 184 (192) Q Consensus 159 t~afe~e~~~~~~~~~~kEl~~~L~~ 184 (192) .++=+.++.+.|.++|.++|..+|+. T Consensus 124 ~~~~k~~~~~~~~~~l~~~l~k~~~k 149 (149) T protein:vir:19 124 YDTREEEAASVAIARMNQAIDEVLSK 149 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 55555666666666666666666555 No 16 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=95.49 E-value=0.00076 Score=37.70 Aligned_cols=128 Identities=15% Similarity=0.181 Sum_probs=64.8 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+.|+...-.++...|+...|.-+...+ + . .+|... ..+.+.....| T Consensus 7 i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~-k----~---------~ap~~~---------~~~~g~l~~~I 63 (135) T protein:vir:57 7 ISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADM-K----Q---------NAGYDN---------SSTNAHMRDSI 63 (135) T ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH-H----H---------hCCCCC---------CCchhhHHhhc Confidence 99999999999999877656666677766664333222 1 1 233211 01111111111 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCch-hheeeccccccccceeeecCchHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRW-HVMRRVNGKNRYPIDVVKIPLSGPLT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~-~V~~R~~Gk~R~PIevvkipis~plt 159 (192) .+. +.+ ..++. +.-.+.||.. +..+| .+|.= -|...-|=. |.-.|-. T Consensus 64 ~i~----------~~k----~~~~~---~~v~v~vg~~----------~~~~~~~~f~E-~GT~~~~a~----PF~~pa~ 111 (135) T protein:vir:57 64 KIR----------SSR----GKAGS---TVVVLRVGPT----------RSHYMKALAQE-FGTIKQVAK----PFIRPAL 111 (135) T ss_pred ccc----------ccc----ccccc---eeEEEEecCC----------CCcceeEeecc-cCCCCCCCC----cchhHhH Confidence 111 110 00111 1112344431 11223 33333 265444422 5667777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~~~L~ 183 (192) ++-..++.+.|.++|.++|..+.. T Consensus 112 ~~~~~~~~~~~~~~~~~~l~ka~r 135 (135) T protein:vir:57 112 DYNKMQVLRILTVEIRDGLSTLSR 135 (135) T ss_pred HHhHHHHHHHHHHHHHHHHHHhcC Confidence 777777777777777777777766 No 17 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=95.41 E-value=0.00092 Score=37.24 Aligned_cols=138 Identities=17% Similarity=0.216 Sum_probs=58.5 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchh--hhhhceeeeec-CCCCCeE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLK--LVRQRVRLFKA-GTDGKRS 77 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k--~vr~R~r~~ka-~~~~~~~ 77 (192) |+||+++++.|+.|+.....+++..|+...|.-+...+-.. .|.. .++.=+.+... ...+. T Consensus 8 i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~--------------aP~~~g~l~~~i~~~~~~~~~g~-- 71 (148) T protein:vir:93 8 FSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR--------------APVRRGKLRRNVVVLSRRSRDGG-- 71 (148) T ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh--------------CCCCcchhhhhceeccccccCCc-- Confidence 99999999999999876555666666666554333332222 2211 11111111111 01111 Q ss_pred EEEEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH Q lcl|NC_010392. 78 ARIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP 157 (192) Q Consensus 78 a~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p 157 (192) ..+.+...... ...+. +...++...-..+| +||..- -|...-|=. |.=.| T Consensus 72 ~~~~v~~~~~~---------------~~~~~--~~~~~~~~~~~~~~-------y~~f~E--~GT~~~pa~----PFl~p 121 (148) T protein:vir:93 72 MESGVHIRGVN---------------PDTGN--SDNTMKADNPRNAF-------YWRFVE--MGTVNMPPH----PFVRP 121 (148) T ss_pred eeeeeeecccc---------------ccccc--ccceeecCCCCCcc-------eeeeec--cCCCCCCCC----cchhH Confidence 11111111100 00000 00111111111122 132211 143322211 44566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 158 LTQAFESATQSLIDEEIPKQLGYALKQ 184 (192) Q Consensus 158 lt~afe~e~~~~~~~~~~kEl~~~L~~ 184 (192) -.++-.+++.+.|.++|.++|..+|+. T Consensus 122 A~~~~k~~~~~~~~~~~~~~i~k~~~k 148 (148) T protein:vir:93 122 AFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 666666667766666666666666655 No 18 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=95.29 E-value=0.00052 Score=38.61 Aligned_cols=140 Identities=12% Similarity=0.098 Sum_probs=68.4 Q ss_pred ChhHHHHHHHHhhcc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLD-RQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~-~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) |+||++++++|+.|. ...+.++...||...|.-+...+-..+... .-+.+..+...+ +.+ T Consensus 9 i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~---------~~~~~~~~~~~~-----~~~----- 69 (149) T protein:vir:13 9 FEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHIS---------DDNSKSGRKGSR-----PPG----- 69 (149) T ss_pred eecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc---------CCcccccccccc-----ccc----- Confidence 999999999999995 456778888888776644433322211110 001111100000 000 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLT 159 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt 159 (192) --+.++++-++ +. ..+...+.||-..=++. ..=+||..-- |....|=. |.-.|-. T Consensus 70 --~~~d~i~~~~~---~~---------~~g~~~~~VG~~~~~~~-----~~~y~~f~E~--GT~k~~a~----pF~~pa~ 124 (149) T protein:vir:13 70 --HAANNIPEPKI---RK---------KKGNLQCVVGWEKSDNT-----PFYYMKMEEW--GTSERPPH----HAFGKTN 124 (149) T ss_pred --hhhhcceeccc---cc---------ccceeEEEeeccCCCCC-----ccceeeeecc--CccCCCCC----ccchHHH Confidence 11222222111 11 11223356663321110 0113543332 66555522 6667767 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLGYALKQ 184 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~~~L~~ 184 (192) ++-++++.+.|.++|.+++..+|=. T Consensus 125 ~~~~~~~~~~~~~~l~k~i~~~lG~ 149 (149) T protein:vir:13 125 KILKRVYDNIAQKKYDNFVKEKLGD 149 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcC Confidence 7777777777777666666666666 No 19 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=95.06 E-value=0.00098 Score=37.08 Aligned_cols=122 Identities=10% Similarity=0.164 Sum_probs=54.6 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||++++++|+.|+.. +++++..|+...|.-+...+-. .+|... .+.+... T Consensus 6 i~Gl~el~~~l~~l~~~-~~~~~~~al~~~a~~v~~~~k~--------------~ap~~~----------~~tg~l~--- 57 (127) T protein:vir:12 6 FDGIDDLTQYFEKIGGD-IEKVEPVALKAGGEIIAERQRS--------------HVNRSD----------KKQPHMQ--- 57 (127) T ss_pred ehhHHHHHHHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHH--------------hCCCCC----------CChhHHH--- Confidence 99999999999998765 5677777777666443333221 133110 0011111 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+| ..+..+.. ..+...+.||-.. +.+| +||..-= |...-|= =|.-.|..+ T Consensus 58 ----~~I---~~~~~k~~--------~~g~~~v~Vg~~~-~~~~-------y~~f~E~--GT~~~~a----~Pf~~pa~~ 108 (127) T protein:vir:12 58 ----DNI---TVSNVRES--------KDGVRFVAVGPNK-KVAY-------RGRFLEW--GTSKMPP----QPFIEKGGK 108 (127) T ss_pred ----Hhh---hccccccc--------cCceeEEEEeeCC-CCcc-------eeeeecc--CccCCCC----CccchHhHH Confidence 111 11111110 0112223454211 1122 2332211 4332221 155566666 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLG 179 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~ 179 (192) +-..++.+.|++.|.++|. T Consensus 109 ~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 109 EGEGPAVELMERILTAPIK 127 (127) T ss_pred HHHHHHHHHHHHHHHHhcC Confidence 6666666555555555554 No 20 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=94.82 E-value=0.0013 Score=36.38 Aligned_cols=134 Identities=16% Similarity=0.156 Sum_probs=65.8 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCC-CCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGT-DGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~-~~~~~a~ 79 (192) |+||++++++|+.|+.....+++..|+...|.-+...+-..+...+. .++.=+.+..... .+..... T Consensus 6 i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG------------~l~~sI~~~~~~~~~~~~~~~ 73 (140) T protein:vir:10 6 ILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTG------------KLKRNIVTAALKQKDSPGIAT 73 (140) T ss_pred ehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChh------------hHHHhceecccccccccceeE Confidence 99999999999999877655666677766665444443333222211 1111111110000 0111111 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLT 159 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt 159 (192) +.++.+ ..+. . ......|-+-|+.- |...-| -=|.-.|-. T Consensus 74 ~~~~~~----------------~~~~---~----~~~~~~~y~~f~E~-------------GT~~~~----a~PFl~pA~ 113 (140) T protein:vir:10 74 AGVRVR----------------TKGK---A----DSPNNAFYWRFVEL-------------GTQFMK----AEPFMRPAF 113 (140) T ss_pred Eeeccc----------------cccc---c----CCCCcccccceecc-------------CcCCCC----CCcchhhhH Confidence 111100 0000 0 00011222223321 311111 115556777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLGYALKQQL 186 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~~~L~~ql 186 (192) ++=++++.+.|.+.+.+||..++..+| T Consensus 114 ~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 114 DASIAQAEGAIRTEIARAIDQVVGGGL 140 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 777788888888888888888888888 No 21 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=94.55 E-value=0.0013 Score=36.40 Aligned_cols=128 Identities=16% Similarity=0.174 Sum_probs=55.0 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||++++++|+.|+.++-.++...|+...|.- |.++.+ ..+|... ..++... T Consensus 6 i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~--------i~~~ak------~~ap~~~---------~~~~~~~---- 58 (133) T protein:vir:10 6 VKGLDELERQLTALGEKVATKVLRDAGREALKV--------VEEDMK------QHAGFDE---------TSTGQHM---- 58 (133) T ss_pred eehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH--------HHHHHH------HhCCCCC---------Ccchhhh---- Confidence 999999999999997665445555666555533 333322 1133211 0011100 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) +.+|.+ +...+.+. ..+.-.+.||..- +.+ .+|| |.= -|...-|= =|.-.|-.+ T Consensus 59 ---~~~I~v--------~~~~~~~~-~~~~~~v~vg~~~-~~~-------~y~~-f~E-~GT~k~~a----~PF~~pA~~ 112 (133) T protein:vir:10 59 ---RDSIKI--------RSSTRKAQ-GNAVVTLRVGPSK-QHH-------MKVL-AQE-FGTVKQVA----DPFIRPALD 112 (133) T ss_pred ---hhcccc--------cccccccC-ccceEEEEecCCC-Ccc-------ceEe-eec-cCCCCCCC----CccchHHHH Confidence 111111 00000000 0111123444210 001 1222 333 25544431 155566666 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYA 181 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~ 181 (192) +-++++.+.|.+++.++|..- T Consensus 113 ~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 113 YNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred HhHHHHHHHHHHHHHHHhhcC Confidence 666666665555555544443 No 22 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=94.31 E-value=0.0031 Score=34.35 Aligned_cols=134 Identities=16% Similarity=0.159 Sum_probs=62.2 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCC-CCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGT-DGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~-~~~~~a~ 79 (192) |+||+++++.|+.|+..+..+++..|+...|.-+...+-..+...+. .++.=+......+ .+..... T Consensus 6 i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG------------~l~~sI~~~~~~~~~~~~~~~ 73 (140) T protein:vir:14 6 IIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTG------------KLRRNIVSAALRQKDAPGLAT 73 (140) T ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChh------------hHHhhcccccccccccceeEE Confidence 99999999999999888666677777777765554443332222211 1111111110000 0101111 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLT 159 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt 159 (192) +.+..+. .+. ...+...|-..|+.- |....|= =|.=.|.. T Consensus 74 vg~~~~~----------------~~~-------~~~~~~~~y~~f~E~-------------GT~~~~a----~pFl~pa~ 113 (140) T protein:vir:14 74 AGVRVRT----------------KGK-------ADSPNNAFYWRFDEF-------------GTQHMKA----QPFMRPAF 113 (140) T ss_pred eeeeecc----------------ccc-------cCCCCccceeeeecc-------------ccCCCCC----CcchhHHH Confidence 1100000 000 000111122222221 3222221 14556666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLGYALKQQL 186 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~~~L~~ql 186 (192) ++-+.++.+.|.+++.++|..+|..|= T Consensus 114 ~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:14 114 DASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 666677777777777777777776654 No 23 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=93.91 E-value=0.0017 Score=35.77 Aligned_cols=132 Identities=16% Similarity=0.243 Sum_probs=69.0 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) ++|+++++++|+.+++. +.++..+++.++|..+...+-..+...|+ .++.=+.+.. ..+...+.| T Consensus 5 ~~G~~~l~~~l~~~~~~-~~~~~~~~~~~~a~~v~~~ak~~aPv~TG------------~L~~Si~~~~--~~~~~~~~V 69 (137) T protein:vir:95 5 KYGNWDLVKELENYERD-MERWVKRGIAKTTAKIHNTIISLMPVDTG------------YLRESVTMDF--KDGGFTGVI 69 (137) T ss_pred HHhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCccch------------hhhcCeeeEe--eCCceEEEE Confidence 88999999999998775 67899999999887777766555444432 1222222211 123356666 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) .-+..-=+.+-.||......+.... .-.+.|.....+|.||- ++|. |= .| |. T Consensus 70 ~~~~~YA~~vE~GT~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~---t~g~----------~a-~PFl~ 122 (137) T protein:vir:95 70 NIGSEYAIYVNYGTGIYATGAGGSR-------------AKKIPWSYKDANGKWHT---TKGQ----------HA-QPFWE 122 (137) T ss_pred ecCCCcccccccCccccccCCCccc-------------ccccccceeccCcceee---cCCC----------CC-CcchH Confidence 6665555555667654432221111 11111222233455552 2231 11 23 66 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIP 175 (192) Q Consensus 160 ~afe~e~~~~~~~~~~ 175 (192) .||+.+-. .|+++|. T Consensus 123 pA~~~~~~-~i~k~l~ 137 (137) T protein:vir:95 123 PAIDAGRA-FFNKYFS 137 (137) T ss_pred HHHHHHHH-HHHHhhC Confidence 66664433 2223333 No 24 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=93.84 E-value=0.0062 Score=32.69 Aligned_cols=148 Identities=17% Similarity=0.175 Sum_probs=61.1 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeee----cCCCCCe Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFK----AGTDGKR 76 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~k----a~~~~~~ 76 (192) |+||+++++.|+.|+.+.-.++...|+..-|.-+...+ ++.+-.... ..+-..++..+.+.. ....+.. T Consensus 9 i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~a-k~~ap~~~~------~~~~~~l~~~i~~~~~~~~~~~~~~~ 81 (164) T protein:vir:43 9 ITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAA-KQGAEKVDD------PGTGRSISDNIALRWNGRLFKRTGDL 81 (164) T ss_pred eecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH-HHhCCcccC------CCccchhhhhhhhhcccCccccccce Confidence 99999999999999877555667777766664333322 222111000 000011121111110 0011111 Q ss_pred EEEEEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchH Q lcl|NC_010392. 77 SARIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSG 156 (192) Q Consensus 77 ~a~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~ 156 (192) ...+.+. .|+.....+. . ...-.|...| +||..-= |....|= =|.-. T Consensus 82 ~~~vg~~--------~~~~~~~~~~---~-----~~~~~~~~~~-----------y~~f~Ef--GT~km~a----~PFlr 128 (164) T protein:vir:43 82 GFRIGVL--------HGAVLPKKGE---R-----SDKTANAPTP-----------HWRLLEF--GTEDMRA----QPFMR 128 (164) T ss_pred eEEeccc--------cccccccccc---c-----cccCCCCCcc-----------eEEEeec--CCCCCCC----Ccchh Confidence 1111111 0110000000 0 0000111122 2332221 4332221 15556 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_010392. 157 PLTQAFESATQSLIDEEIPKQLGYALKQQLRLYLSR 192 (192) Q Consensus 157 plt~afe~e~~~~~~~~~~kEl~~~L~~qlr~~~kr 192 (192) |-.++-.+++.+.|.++|.+||..+|+. .-++ T Consensus 129 PA~~~~k~~~~~~~~~~l~~~i~ka~~k----~~~~ 160 (164) T protein:vir:43 129 SALADNIAEVTSTFVSEYEKGIDRAIKR----AAKK 160 (164) T ss_pred hhHHHhHHHHHHHHHHHHHHHHHHHHHH----HHhh Confidence 6666666676666666666665555554 4444 No 25 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=93.83 E-value=0.0063 Score=32.67 Aligned_cols=163 Identities=14% Similarity=0.218 Sum_probs=60.4 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCC----CCCe Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGT----DGKR 76 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~----~~~~ 76 (192) |+||+++++.|+.|+.++--++...|+.+-|.-+. ..+++.|-... .....-.++..+.+...+. .+.. T Consensus 9 i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~-~~ak~~ap~~~------~~~~~~~l~~~i~~~~~~~~~~~~g~~ 81 (179) T protein:vir:18 9 LTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIR-DRARSNASRVD------DPLTKEAIHKNIVASFSSKQFRRTGDL 81 (179) T ss_pred eecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH-HHHHHhCCccc------cccchhhhhhheeecccccccccccce Confidence 88999999999999877556677777766663332 22332221100 0011122333333322111 1222 Q ss_pred EEEEEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchH Q lcl|NC_010392. 77 SARIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSG 156 (192) Q Consensus 77 ~a~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~ 156 (192) ...+.+..+..+-... .....++..+. .....|...-+ ...--+||.--- |....|= =|.=. T Consensus 82 ~~~vgv~~~~~~~~~~-----~~~~~~~~~~~--~~~~~g~~~~~-----~~~~~y~~fvEf--GT~kmpa----~PFlr 143 (179) T protein:vir:18 82 AFRVGVMGGARQYANT-----KANVRKGRAGK--TYKTSGDKGNP-----GGDTWYWRFLEF--GTEHTSA----RPILR 143 (179) T ss_pred eEeeeccccccccccc-----ccccccCcccc--cccccccccCC-----CCccceeEEecc--CCCCCCC----Cccch Confidence 2222222222221100 00011111000 00011110000 001112332222 4321111 13334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_010392. 157 PLTQAFESATQSLIDEEIPKQLGYALKQQLRLYLSR 192 (192) Q Consensus 157 plt~afe~e~~~~~~~~~~kEl~~~L~~qlr~~~kr 192 (192) |-.++=.+++.+. |..+|..+|...|+..-++ T Consensus 144 PA~~~~~~~a~~~----i~~~l~~~i~k~lk~~~~~ 175 (179) T protein:vir:18 144 PAMNGVDNDVINV----FSTEMGKAIDRAIRLAMKK 175 (179) T ss_pred hhHHhhHHHHHHH----HHHHHHHHHHHHHHhhccc Confidence 4444334444433 4444444444445555555 No 26 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=93.59 E-value=0.0018 Score=35.57 Aligned_cols=132 Identities=13% Similarity=0.205 Sum_probs=66.5 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) ++||++++++|+++++. +..++.+|++++|..+.+.+-..+...|+ .++.=+.+.. ..+...+.| T Consensus 5 ~~Gl~~l~~~l~~~~~~-~~~~~~~al~~~a~~i~~~ak~~aPvdTG------------~Lr~SI~~~~--~~~~~~~~V 69 (137) T protein:vir:10 5 KYGNWELVKELEDFEKE-TIRWAKKGIAKTTTIIHNSIVSNMPVDTG------------YLRESVSMDF--KKGGLTGVI 69 (137) T ss_pred HhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCcCcc------------hhhcCeeEEe--eCCcEEEEE Confidence 67999999999998775 67888999999998887777665554442 1222222211 123345566 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) ..+...=+.+-+||......+.. +.+ ....+......|.|+. ++|. | +.| |. T Consensus 70 ~~~~~Ya~~vE~GT~~~~~~~~~----------~~~---~~~~~~~~~~~~~~~~---t~g~----------~-a~PFl~ 122 (137) T protein:vir:10 70 NIGSEYAVYVNYGTGIYAVGPGG----------SRA---KNIPWCYKDADGHWHT---TKGQ----------H-AQPFWE 122 (137) T ss_pred ecCCCcccccccCccccccCCCc----------ccc---ccccceeeccccceec---cCCC----------C-CCcchh Confidence 65555555556665433211100 000 0111111112333332 2221 1 123 66 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIP 175 (192) Q Consensus 160 ~afe~e~~~~~~~~~~ 175 (192) .||+++-. .|++++. T Consensus 123 pA~~~~~~-~i~k~i~ 137 (137) T protein:vir:10 123 PAIDEGRA-FFNKYFS 137 (137) T ss_pred HHHHHHHH-HHHHhcC Confidence 67765443 2223333 No 27 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=93.39 E-value=0.0024 Score=34.91 Aligned_cols=132 Identities=15% Similarity=0.207 Sum_probs=69.8 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) ++|++++++.|+.+++. +.++...++.++|..+...+-..+...|+ .++.=+.+.. ..+...+.| T Consensus 5 ~~g~~~l~~~l~~~~~~-~~~~~~~~~~~~a~~i~~~ak~~aPvdTG------------~Lr~SI~~~~--~~~~~~~~V 69 (137) T protein:vir:93 5 KYGNWDLVKELENYERD-MERWVKRGIAKTTAKIHNTIISLMPVDTG------------YLRESVTMDF--KDSGFTGVI 69 (137) T ss_pred HHhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCcccc------------chhccceeEe--ecCceEEEE Confidence 88999999999998775 67899999999888777777665544442 1122222211 223456777 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) ..+..-=+.+-.||......+.... .-.+.|.....+|.||- ++| .|= .| |. T Consensus 70 ~~~~~YA~~vE~GT~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~---t~g----------~~a-~PFl~ 122 (137) T protein:vir:93 70 NIGSEYAIYVNYGTGIYATGAGGSR-------------AKKIPWSYKDANGKWHT---TKG----------QHA-QPFWE 122 (137) T ss_pred ecCCCcccccccCccccccCCCccc-------------ccccccceeccCcceee---cCC----------CCC-CcchH Confidence 7766666666777754332221100 00111112223455542 222 111 23 55 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIP 175 (192) Q Consensus 160 ~afe~e~~~~~~~~~~ 175 (192) -||+..... |+++|. T Consensus 123 pA~~~~~~~-~~~~l~ 137 (137) T protein:vir:93 123 PAIDAGRAF-FNKYFS 137 (137) T ss_pred HHHHHHHHH-HHHhhC Confidence 666544432 223333 No 28 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=93.39 E-value=0.0024 Score=34.91 Aligned_cols=132 Identities=15% Similarity=0.207 Sum_probs=69.8 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) ++|++++++.|+.+++. +.++...++.++|..+...+-..+...|+ .++.=+.+.. ..+...+.| T Consensus 5 ~~g~~~l~~~l~~~~~~-~~~~~~~~~~~~a~~i~~~ak~~aPvdTG------------~Lr~SI~~~~--~~~~~~~~V 69 (137) T protein:vir:97 5 KYGNWDLVKELENYERD-MERWVKRGIAKTTAKIHNTIISLMPVDTG------------YLRESVTMDF--KDSGFTGVI 69 (137) T ss_pred HHhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCcccc------------chhccceeEe--ecCceEEEE Confidence 88999999999998775 67899999999888777777665544442 1122222211 223456777 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) ..+..-=+.+-.||......+.... .-.+.|.....+|.||- ++| .|= .| |. T Consensus 70 ~~~~~YA~~vE~GT~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~---t~g----------~~a-~PFl~ 122 (137) T protein:vir:97 70 NIGSEYAIYVNYGTGIYATGAGGSR-------------AKKIPWSYKDANGKWHT---TKG----------QHA-QPFWE 122 (137) T ss_pred ecCCCcccccccCccccccCCCccc-------------ccccccceeccCcceee---cCC----------CCC-CcchH Confidence 7766666666777754332221100 00111112223455542 222 111 23 55 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIP 175 (192) Q Consensus 160 ~afe~e~~~~~~~~~~ 175 (192) -||+..... |+++|. T Consensus 123 pA~~~~~~~-~~~~l~ 137 (137) T protein:vir:97 123 PAIDAGRAF-FNKYFS 137 (137) T ss_pred HHHHHHHHH-HHHhhC Confidence 666544432 223333 No 29 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=93.39 E-value=0.0024 Score=34.91 Aligned_cols=132 Identities=15% Similarity=0.207 Sum_probs=69.8 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) ++|++++++.|+.+++. +.++...++.++|..+...+-..+...|+ .++.=+.+.. ..+...+.| T Consensus 5 ~~g~~~l~~~l~~~~~~-~~~~~~~~~~~~a~~i~~~ak~~aPvdTG------------~Lr~SI~~~~--~~~~~~~~V 69 (137) T protein:vir:94 5 KYGNWDLVKELENYERD-MERWVKRGIAKTTAKIHNTIISLMPVDTG------------YLRESVTMDF--KDSGFTGVI 69 (137) T ss_pred HHhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCcccc------------chhccceeEe--ecCceEEEE Confidence 88999999999998775 67899999999888777777665544442 1122222211 223456777 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) ..+..-=+.+-.||......+.... .-.+.|.....+|.||- ++| .|= .| |. T Consensus 70 ~~~~~YA~~vE~GT~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~---t~g----------~~a-~PFl~ 122 (137) T protein:vir:94 70 NIGSEYAIYVNYGTGIYATGAGGSR-------------AKKIPWSYKDANGKWHT---TKG----------QHA-QPFWE 122 (137) T ss_pred ecCCCcccccccCccccccCCCccc-------------ccccccceeccCcceee---cCC----------CCC-CcchH Confidence 7766666666777754332221100 00111112223455542 222 111 23 55 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIP 175 (192) Q Consensus 160 ~afe~e~~~~~~~~~~ 175 (192) -||+..... |+++|. T Consensus 123 pA~~~~~~~-~~~~l~ 137 (137) T protein:vir:94 123 PAIDAGRAF-FNKYFS 137 (137) T ss_pred HHHHHHHHH-HHHhhC Confidence 666544432 223333 No 30 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=93.39 E-value=0.0024 Score=34.97 Aligned_cols=115 Identities=13% Similarity=0.157 Sum_probs=50.5 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecC-CCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAG-TDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~-~~~~~~a~ 79 (192) |+||++++++|+.+++.+ .++...|+...+..+...+...+...|+ .++.=+++.... ..+... T Consensus 9 ~~Gld~l~~~L~~~~~~~-~~~v~~al~~~a~~i~~~ak~~ap~~tG------------~L~~sI~~~~~~~~~~~~~-- 73 (125) T protein:vir:94 9 FKGVDKLLDEFDISRKEL-VPYSVEAMKTSLSRAVEKSKGLARVDTG------------YMRNNIQQDEVKEEHGVVT-- 73 (125) T ss_pred ehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHhhCCCCCh------------hhhhhceecceeccCCcEE-- Confidence 899999999999987764 4666778877776655544332222211 011111110000 001111 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLT 159 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt 159 (192) ..||....-.-|+-- |...-|=. |.-.| T Consensus 74 ---------------------------------~~v~~~~~Ya~~vEf-------------GT~~~~a~----Pfl~p-- 101 (125) T protein:vir:94 74 ---------------------------------GRYVARADYSSYNEY-------------GTYRMSAQ----PFMAP-- 101 (125) T ss_pred ---------------------------------EEeeCCCCccceeec-------------ccccCCCC----cccch-- Confidence 223322211222211 32222211 44333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLGYALKQQLRLYLSR 192 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~~~L~~qlr~~~kr 192 (192) +|+.. ..++...|+..|+..+|| T Consensus 102 -a~~~~---------~~~~~~~l~~~l~~a~k~ 124 (125) T protein:vir:94 102 -SVAAM---------TPFFYKAVRDALNKAAKF 124 (125) T ss_pred -hHHHH---------HHHHHHHHHHHHHHHhcc Confidence 44422 223444445555555556 No 31 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=93.26 E-value=0.0074 Score=32.26 Aligned_cols=134 Identities=15% Similarity=0.140 Sum_probs=60.4 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCC-CCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTD-GKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~-~~~~a~ 79 (192) |+||+++++.|+.|+...-.++...|+...|.-+...+-..+...+. .++.-+......+. ...... T Consensus 6 i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG------------~l~~~i~~~~~~~~~~~~~~~ 73 (140) T protein:vir:80 6 IVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTG------------KLRRNIVSAALRQKDAPGLAT 73 (140) T ss_pred ehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc------------hhhhceeeeccccccccceee Confidence 99999999999999877666666677776665444433322211110 01111111100000 000111 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLT 159 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt 159 (192) +.+..+.-. . ...+...|-..|+.- |....| -=|.=.|-. T Consensus 74 ~~~~~~~~~-------------~----------~~~~~~~~y~~f~E~-------------GT~~~~----a~PFl~pA~ 113 (140) T protein:vir:80 74 AGVRVRTKG-------------K----------ADSPSNAFYWRFDEF-------------GTQHMK----AQPFMRPAF 113 (140) T ss_pred eeeeccccc-------------c----------cCCCCCcceeeeecc-------------CCCCCC----CCcchhhhH Confidence 111100000 0 000111122223221 322222 115556666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLGYALKQQL 186 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~~~L~~ql 186 (192) ++-++++.+.|++.+.++|..+|..+= T Consensus 114 ~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:80 114 DASIGEAEGAIRTELARAIDQALGGRR 140 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 666777777777777776666666553 No 32 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=93.05 E-value=0.007 Score=32.41 Aligned_cols=134 Identities=16% Similarity=0.130 Sum_probs=59.8 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCC-CCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGT-DGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~-~~~~~a~ 79 (192) |+||+++++.|++|+...-.++...|+...|.-+...+-..+...++ .++.-+....... ....... T Consensus 6 i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG------------~l~~sI~~~~~~~~~~~~~~~ 73 (140) T protein:vir:10 6 IIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTG------------KLRRNIVSAALRQKDAPGLAT 73 (140) T ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChh------------hHHHhccccccccccccceEE Confidence 99999999999999876555666666666554443333222212111 1111111111000 0111111 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLT 159 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt 159 (192) +.+..+.-+ . ...+...|-..|+.- |...-| -=|.=.|-. T Consensus 74 ~g~~~~~~~----------------~-------~~~~~~~~y~~f~E~-------------GT~~~~----a~PFl~pA~ 113 (140) T protein:vir:10 74 AGVRVRTKG----------------K-------ADSPNNAFYWRFDEF-------------GTQHMK----AQPFMRPAF 113 (140) T ss_pred eeeeecccc----------------c-------cCCCCccceeeeecc-------------CCCCCC----CCcchhhhH Confidence 111100000 0 000111222223221 322122 114456666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLGYALKQQL 186 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~~~L~~ql 186 (192) ++=++++.+.|+++|.++|..+|.-|= T Consensus 114 ~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 114 DASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 666677777777777777777666654 No 33 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=92.93 E-value=0.005 Score=33.21 Aligned_cols=124 Identities=18% Similarity=0.189 Sum_probs=60.9 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||++++++|++|... +.++...|+...|.- +.++.+ ..+|... ..+... T Consensus 2 v~Gl~el~~~l~~l~~~-~~~~~~~al~~ga~~--------~~~~~k------~~ap~~~---------~~~~~h----- 52 (125) T protein:vir:97 2 TKGLDEILANLTKLEVK-APKTAKAAVTEVAKE--------FEKALK------ANTPVYE---------VETDER----- 52 (125) T ss_pred chhHHHHHHHHHHhhHH-HHHHHHHHHHHHHHH--------HHHHHH------HhCCcCC---------CCchhh----- Confidence 99999999999998764 556666666655433 333221 1133211 100000 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) -+.+| ..++++.. ..+...+.||-.. ..+|. ||..-= |..+-|=+ |.-.|-.+ T Consensus 53 --l~d~I---~~~~~k~~--------~~g~~~~~VG~~k-~~~~y-------~~f~E~--GT~k~~~~----pF~~pa~~ 105 (125) T protein:vir:97 53 --LQEDT---VISGFKGA--------NVGIVSKEIGYGK-ATGWR-------AHYPND--GTIYQRGQ----DFKERTIN 105 (125) T ss_pred --HHhhh---hccccccc--------ccCceEEEEeecC-CCcee-------Eeeecc--CccCCCcC----ccchHhHH Confidence 01111 12111111 0122234555321 12333 332222 54444322 67778778 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGY 180 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~ 180 (192) +...++.+.|.+.|.++|+= T Consensus 106 ~~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 106 QMTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred HhHHHHHHHHHHHHHHHhcC Confidence 87888777777777666654 No 34 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=91.85 E-value=0.0058 Score=32.85 Aligned_cols=132 Identities=16% Similarity=0.234 Sum_probs=67.9 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) .+|+++++++|+++++. +.++..+++.++|..+...+-..+...|+ .++.=+.+.. ..+...+.| T Consensus 5 ~~G~~~l~~~L~~~~~~-~~~~~~~al~~~a~~v~~~ak~~aPvdTG------------~Lr~SI~~~~--~~~~~~~~V 69 (137) T protein:vir:94 5 KYGNWDLVKELENYERD-IERWVKRGIAKTTVKIHNTIISLMPVDTG------------YLRESVTMDF--KDGGFTGVI 69 (137) T ss_pred HHhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCcCcc------------hhhcCceeEe--ecCcEEEEE Confidence 46999999999998876 67888999988888777766655544442 1222222221 123356666 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) ..+..-=+.+-.||......+.... .-.+.+...-..|.|| ++.|- | +.| |. T Consensus 70 ~~~~~YA~~vE~GT~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~---~t~g~----------~-a~PFl~ 122 (137) T protein:vir:94 70 NIGSEYAIYVNYGTGIYATGAGGSR-------------AKKIPWSYKDANGKWH---TTKGQ----------H-AQPFWE 122 (137) T ss_pred ecCCCcccccccCccccccCCCccc-------------ccccccceeccCCcee---ecCCc----------C-CCcchH Confidence 6666555566677654332221111 0111111122234443 22231 1 233 56 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIP 175 (192) Q Consensus 160 ~afe~e~~~~~~~~~~ 175 (192) .||+.+-.. |+++|. T Consensus 123 pA~~~~~~~-~~~~l~ 137 (137) T protein:vir:94 123 PAIDAGRVF-FNKYFS 137 (137) T ss_pred HHHHHHHHH-HHHhhC Confidence 666544432 222222 No 35 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=91.69 E-value=0.0062 Score=32.70 Aligned_cols=124 Identities=16% Similarity=0.086 Sum_probs=61.9 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||++++++|+.|... +.++...|+...|.- +.++.+ ..+|... . ....+. T Consensus 5 i~Gl~el~~~l~~l~~~-~~k~~~~al~~ga~~--------~~~~~k------~~ap~~~---------~--~~~~~~-- 56 (128) T protein:vir:38 5 VTGDAELLANLNKLQFG-VAKEARAAVRDGAQK--------FADKLK------SNTPEWD---------G--ETDMSG-- 56 (128) T ss_pred hhhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHH--------HHHHHH------HhCCCcC---------C--CCcccc-- Confidence 99999999999999765 567776666665433 322221 1233211 0 000000 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) --..+| .++.++. ..+...+.||-.. +.+|. ||..-= |....|=+ |.-.|-.+ T Consensus 57 -h~~d~I---~~~~~k~---------~~g~~~~~VG~~k-~~~~y-------~~f~E~--GT~k~~a~----pF~~pa~~ 109 (128) T protein:vir:38 57 -HLRDDI---KLSSVRE---------TSGLTEVDVGYGK-DTGWR-------AHFPNS--GTSMQDPQ----HFIEETQE 109 (128) T ss_pred -hhhhhh---ccccccc---------cCceeEEEeeecC-CCceE-------Eeeecc--CccCCCCC----cchhHHHH Confidence 001111 1111110 0122335666321 12332 332222 44433322 66778777 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLG 179 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~ 179 (192) +-+.|+.+.|.++|-+++. T Consensus 110 ~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 110 IMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred HhHHHHHHHHHHHHHhhcC Confidence 7788888877777777777 No 36 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=91.38 E-value=0.0082 Score=32.01 Aligned_cols=110 Identities=15% Similarity=0.190 Sum_probs=48.6 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+.+...+ .....+|+...|..+...+-. ..|.. .+.+...| T Consensus 5 i~Gld~l~~~l~~~~~~~-~~~v~~al~~~a~~i~~~ak~--------------~aPv~-------------TG~Lr~sI 56 (114) T protein:vir:95 5 WQGIEKLVATISNAQPKA-VEQSLQVLKNNGEKGKRIAKQ--------------LAPKD-------------TEFLKDHI 56 (114) T ss_pred eehHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH--------------hCCcC-------------chhhhhce Confidence 999999999999988664 466677776666444332222 13321 12222222 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+.. .+....||...+-+.|+.- |...-|- =|.-.|-.+ T Consensus 57 ~~~~------------------------~g~~~~V~~~~~Ya~yvE~-------------GT~~~~a----qPfl~pa~~ 95 (114) T protein:vir:95 57 TTSY------------------------PGMEAHIHGEAGYDGYQEY-------------GTRFQPG----TPHFRPMME 95 (114) T ss_pred eeec------------------------CceEEEeecCCCccceeec-------------CccccCC----CccchhhHH Confidence 2210 0111234433333444433 3211111 144444433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~ 183 (192) ....++. ++|.++|...|+ T Consensus 96 ~~~~~~~----~~l~~~l~~~~k 114 (114) T protein:vir:95 96 QIQPQFQ----KDMTDVMKGAFK 114 (114) T ss_pred HHHHHHH----HHHHHHHHhhcC Confidence 3333333 333333333333 No 37 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=91.15 E-value=0.015 Score=30.63 Aligned_cols=122 Identities=16% Similarity=0.342 Sum_probs=58.0 Q ss_pred ChhHHHHHHHHhh-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeE-- Q lcl|NC_010392. 1 MKGLENAIRNLNS-LDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRS-- 77 (192) Q Consensus 1 ~~gl~~~i~nL~~-i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~-- 77 (192) ++|++++++||++ +.++.+.+.+.+||+..|..+...--+ .+.+|+.....-.+ T Consensus 13 vkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~-----------------------~~~~fkDTGat~dev~ 69 (138) T protein:vir:98 13 LKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKS-----------------------AISIYKRTGETTESAV 69 (138) T ss_pred ccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHh-----------------------hhhhhhhccceeeeee Confidence 9999999999999 889999999999999888655443333 33333321111111 Q ss_pred -EEEEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchH Q lcl|NC_010392. 78 -ARIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSG 156 (192) Q Consensus 78 -a~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~ 156 (192) +.++ ..+..+-|++|=. +.|.+ .+-..-+-| |+| -+-.|. + T Consensus 70 ~s~p~-~~~G~r~V~igW~-----GpR~~------ivHLNE~Gy----------Gk~---i~PrG~-------------G 111 (138) T protein:vir:98 70 VSGVR-REDGIPKVKLGFT-----TPRWN------IVHLQELEY----------GWK---HNRRGV-------------G 111 (138) T ss_pred ecCee-ecCCceEEEEeee-----cCeee------EEeeecccc----------cCC---cCCCcc-------------h Confidence 1111 1122344444311 00111 011111111 111 010121 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 157 PLTQAFESATQSLIDEEIPKQLGYALKQ 184 (192) Q Consensus 157 plt~afe~e~~~~~~~~~~kEl~~~L~~ 184 (192) .| +.+-++.+..|-+.+..||..+|.- T Consensus 112 ~I-~ka~~~se~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 112 VI-RRYSDILETIYPRGIRDKLKRGFDG 138 (138) T ss_pred HH-HHHHHhhhHHHHHHHHHHHHHHhcC Confidence 12 2233344555555566666666665 No 38 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=89.39 E-value=0.015 Score=30.51 Aligned_cols=132 Identities=14% Similarity=0.237 Sum_probs=65.3 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) .+|++++++.|+++++. +.+++.+|+.+.|.-+.+.+-..+...|+ .++.=+++.. ..+...+.| T Consensus 5 ~~G~~~l~~~l~~~~~~-~~~~~~~al~~~a~~i~~~ak~~aPv~TG------------~Lr~SI~~~~--~~~~~~~~V 69 (137) T protein:vir:10 5 KYGNWDLVKELEEFEKE-TIRWAKKGIAKTTTIIHNSIVSNMPVDTG------------YLRESVSMDF--KKGGLTGVI 69 (137) T ss_pred hhCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCcCcc------------hhhcCeeeEe--cCCcEEEEE Confidence 45999999999998774 66788888888886665554444333331 1222222221 223356666 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) ..+...-+.+-+||....-.+.... .++. + |.-.-.+|.|| +++|. | +.| |. T Consensus 70 ~~~~~YA~~vE~GT~~~~~~~~~~~------~~~~-----~--~~~~~~~~~~~---~t~g~----------~-a~Pfl~ 122 (137) T protein:vir:10 70 NIGSEYAVYVNYGTGIYAVGPGGSR------AKNI-----P--WRYKDADGHWH---TTKGQ----------H-AQPFWE 122 (137) T ss_pred ecCCccccccccCccccccCCCccc------cccc-----c--eeeeccccccc---cCCCC----------C-CCcchh Confidence 6666666666667644332111000 0011 1 11111234332 22231 1 123 66 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIP 175 (192) Q Consensus 160 ~afe~e~~~~~~~~~~ 175 (192) -||+++.. .|+.+|. T Consensus 123 pA~~~~~~-~i~k~i~ 137 (137) T protein:vir:10 123 PAIDEGRA-FFNKYFS 137 (137) T ss_pred HHHHHHHH-HHHHhhC Confidence 67765443 2333333 No 39 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=89.39 E-value=0.013 Score=30.86 Aligned_cols=132 Identities=12% Similarity=0.218 Sum_probs=70.0 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) ++|++++++.|+.+.+. +.+++.+++.+.|..+...+-..+...|+ .++.=+.+.. ..+...+.| T Consensus 5 ~~G~~~l~~~l~~~~~~-~~~~~~~~l~~~a~~~~~~ak~~~pvdTG------------~L~~Si~~~~--~~~g~~~~V 69 (137) T protein:vir:96 5 KYGNWDLVAELEDYRDE-MEEWVKKGILKTTLAIYNTAVALAPVDLG------------FLKESIDFKV--TDGGFSSVI 69 (137) T ss_pred HhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCcCcc------------chhcCceeEe--ecCceEEEE Confidence 77999999999998655 67888999999988777766655544442 1122222211 123466777 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) ..+..-=+-+-.||......+.... -+.+.-.| .-.+|.|+ +++|. | +.| |. T Consensus 70 ~~~~~YA~yvE~GT~~~~~~~~~~~-------~~~~~~~~------~~~~~~~~---~t~g~----------~-a~pFl~ 122 (137) T protein:vir:96 70 SVGAEYAIYVEFGTGIYATGPGGSR-------ARKLPWTY------KGDDGEWH---TTYGQ----------Q-AQPFWN 122 (137) T ss_pred ecCCCcccccccCccccccCCCccc-------ccccccee------eccCccee---ecCCC----------C-CCcchh Confidence 7776666666777754432221100 00111111 11233332 22331 2 233 77 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIP 175 (192) Q Consensus 160 ~afe~e~~~~~~~~~~ 175 (192) .||+++-. .|+.+|. T Consensus 123 pA~~~~~~-~i~k~i~ 137 (137) T protein:vir:96 123 PAIDEGRK-VFNRYFS 137 (137) T ss_pred HHHHHHHH-HHHHhhC Confidence 77775554 2333333 No 40 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=89.29 E-value=0.018 Score=30.14 Aligned_cols=134 Identities=16% Similarity=0.231 Sum_probs=69.2 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) ++|++++.++|+.+++. +.+++.+++.++|..+.+.+-..+...|+ .++.=+.... ..+..++.| T Consensus 10 ~~g~~~l~~~l~~~~~~-~~~~v~~~l~~~a~~i~~~ak~~apv~TG------------~Lr~SI~~~~--~~~g~~~~V 74 (144) T protein:vir:59 10 PSWRRIMSRNVRTFSGH-VLTQVEQVIIKTAEKIAGLAASLAPVDEG------------NLKNSIQIDY--KNNGLTAEI 74 (144) T ss_pred hHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCccch------------hhhcCeeEEe--ecCcEEEEE Confidence 78999999999998777 67888899999887776665544433332 1222222221 223467777 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) ..+...-+.+-.||......+.... +...|-+. ..|.|+ ++.| +| +.| +. T Consensus 75 ~~~~~YA~~vE~GT~~~~~~~~~~~----------~~~~~~~~-----~~g~~~---~t~g----------~~-a~Pfl~ 125 (144) T protein:vir:59 75 TVGAEYAIYVEYGTGIYAVDGNGRK----------TPWTYYSP-----KLGRYV---RTQG----------AP-AQPFFW 125 (144) T ss_pred ecCCCccchhhcCccccccCCCccc----------cccccccc-----ccccee---cCCC----------CC-CCcchh Confidence 7777766677778755432221101 01111111 123322 3223 12 223 55 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLG 179 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~ 179 (192) .||+++-+ .|.++|.+=.+ T Consensus 126 pA~~~~~~-~~~~~i~~~~g 144 (144) T protein:vir:59 126 PAVEEGGE-YFEREMRRLRG 144 (144) T ss_pred HHHHHHHH-HHHHHHHHhcC Confidence 66665432 22233333333 No 41 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=89.17 E-value=0.016 Score=30.43 Aligned_cols=108 Identities=15% Similarity=0.193 Sum_probs=45.0 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+.+... +..+...|+.+.|..+...+-. ..|.. .+.+...| T Consensus 1 i~Gld~l~~~l~~~~~~-~~~~v~~al~~~a~~i~~~ak~--------------~aPv~-------------TG~Lr~sI 52 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKS-VRIAVDKELSKSAARIERQAKI--------------LAPVD-------------TGWLRAQI 52 (108) T ss_pred CchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh--------------cCCcC-------------chhhhcce Confidence 99999999999997764 6677777777776554443222 13321 12222222 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+.. . ++.-..||....=..|+--.-. .++++ |.-.| T Consensus 53 ~~~~---------------~--------~~~~~~v~~~~~Ya~~vE~GT~-------~m~a~----------Pf~~p--- 89 (108) T protein:vir:99 53 YSEQ---------------Q--------RLLHYRVVSPALYSIYLELGTR-------KMEAQ----------SFLDP--- 89 (108) T ss_pred eeee---------------c--------CcEEEEeecCcccchhcccCcc-------ccCCC----------cchhh--- Confidence 1110 0 0001122222221222222211 11111 44344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALKQ 184 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~~ 184 (192) ||+..-. .|.++|...|+. T Consensus 90 a~~~~~~-----~~~~~i~~~lrk 108 (108) T protein:vir:99 90 ALRKEWP-----VLMANIKKMFKR 108 (108) T ss_pred hHHHHHH-----HHHHHHHHHhcC Confidence 4443221 223333322222 No 42 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=89.02 E-value=0.014 Score=30.68 Aligned_cols=125 Identities=19% Similarity=0.240 Sum_probs=57.2 Q ss_pred ChhHHHHHHHHhh-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNS-LDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~-i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++|++++++||++ +..+.+.+.+.+||+..+..+ .+.++..+.+|+.....-.++. T Consensus 5 vkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v-----------------------~~~lK~~~~~fkDTGati~ev~ 61 (133) T protein:vir:78 5 VTGVEELERQLVSLFGRENLPQLVDPALIAGATLV-----------------------AKTLKSEFVQFKDTGASIDEIN 61 (133) T ss_pred EecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHH-----------------------HHHHHHhhcchhcccceeeeEE Confidence 9999999999998 888888888888888877332 3345566667765322211211 Q ss_pred ---EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchH Q lcl|NC_010392. 80 ---IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSG 156 (192) Q Consensus 80 ---i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~ 156 (192) .. ..+..+.|++|= .+.. .....+-.. . |+ |.| .||-=-|=. -+ T Consensus 62 ~s~p~-~~~G~r~V~i~W-----~gp~----~R~~iVHLN----E-----------~G-Ytr-~Gk~i~PrG------~G 108 (133) T protein:vir:78 62 IEKPS-YDKGVRSIKIDW-----KGPK----DRYKIIHLN----E-----------YG-YTR-NGKKITPAG------TG 108 (133) T ss_pred ecCee-eeCCceEEEEEE-----ecCC----CceeEEEee----c-----------cc-eec-CCCeEccch------hh Confidence 11 122234444421 0000 000011111 1 11 233 232100100 01 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 157 PLTQAFESATQSLIDEEIPKQLGYAL 182 (192) Q Consensus 157 plt~afe~e~~~~~~~~~~kEl~~~L 182 (192) .|.. +-++.+..|.+.+..||...| T Consensus 109 ~i~~-a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 109 SVAR-SLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred HHHH-HHHhhhHHHHHHHHHHHHhhC Confidence 2222 223344444444445555444 No 43 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=88.87 E-value=0.015 Score=30.61 Aligned_cols=130 Identities=15% Similarity=0.277 Sum_probs=61.9 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) ..||++++++|+.+++. +.+++.+|+..+|..+...+-..+...|+ .++.=+.+.. ..+...+.| T Consensus 5 ~~Gl~~l~~~l~~~~~~-~~~~~~~al~~~a~~v~~~ak~~apvdTG------------~Lr~SI~~~~--~~~g~~~~V 69 (135) T protein:vir:96 5 KYGADSIVVDLEKYSKD-MEKWVKKGITKTTLKIYNTAIHLMPVDTG------------FLRQSTTVDF--ENGGFTGVV 69 (135) T ss_pred hhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCccch------------hhhcceeEEe--ecCcEEEEE Confidence 34999999999998775 57888888888877766655443332221 1222222221 123345566 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) .-+..--+.+-.||......+..+. -+...| ..++|.|+ +++|. | +.| |. T Consensus 70 ~~~~~YA~~ve~GT~~~~~~~~~~~---------~~~~~~------~~~~g~~~---~~~~~----------~-a~pfl~ 120 (135) T protein:vir:96 70 KIGSNYAVYVNYGTGIYATKGSRAH---------KIPWTY------KDPNGKWH---TTYGQ----------M-PQPFWE 120 (135) T ss_pred ecCCCccchhhcccccccCCCcccc---------cccccc------ccCCccee---ecCCc----------C-CCcchh Confidence 5555555555667654433322211 001111 11223322 22232 1 223 55 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLG 179 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~ 179 (192) .||+..-. . |++.|. T Consensus 121 ~A~~~~~~-~----~~~~i~ 135 (135) T protein:vir:96 121 PAIDAGRQ-T----FEQYFS 135 (135) T ss_pred HHHHHHHH-H----HHHhcC Confidence 56653222 2 233333 No 44 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=88.71 E-value=0.017 Score=30.29 Aligned_cols=132 Identities=12% Similarity=0.196 Sum_probs=68.3 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) ..||+++++.|+++++. +.+++.+|+.++|..+...+-..+...|+ .++.=+.+.. ..+...+.| T Consensus 17 ~~Gld~l~~~L~~~~~~-~~~~~~~al~~~a~~v~~~ak~~aPvdTG------------~Lr~SI~~~~--~~~g~~~~V 81 (149) T protein:vir:94 17 KYGADSMVVELDKFDKK-IEEWVKKGIAKTTTKIYNTAVALAPVDLG------------FLEESIDFKY--FDGGLSSVI 81 (149) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCcccc------------hhhcCeeEEe--eCCcEEEEE Confidence 56999999999998776 67899999999888877776555544432 1222222221 223356666 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) .-+..-=+.+-.||....-.+. +... .-....|.+. + +.+.+++|. | +.| |. T Consensus 82 ~~~~~YA~~VE~GT~~~~~~~~-~~~~------~~~~~~~~~~------~---~~~~~~~g~----------~-a~PFl~ 134 (149) T protein:vir:94 82 SVGADYAIYVEYGTGIYATGPG-GSRA------TKIPWSFKGD------D---GEWYTTYGQ----------A-PQPFWN 134 (149) T ss_pred ecCCCcccccccCccccccCCC-cccc------ccccceeecC------c---cceecCCCC----------C-CCcchH Confidence 6666555556667644322111 1100 0000112222 1 233344442 1 123 66 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLG 179 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~ 179 (192) .||+.+.. ++.+.|+ T Consensus 135 pA~~~~~~-----~i~~~i~ 149 (149) T protein:vir:94 135 PAIDAGRK-----TFEQYFS 149 (149) T ss_pred HHHHHHHH-----HHHHhhC Confidence 67765433 3444444 No 45 >protein:vir:396 Length: 184 # NCBI annotation: gp11 # Family: family:all:869 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046906;genbank:gi:9630476;genbank:GeneID:1261650 Probab=88.51 E-value=0.031 Score=28.84 Aligned_cols=166 Identities=12% Similarity=0.089 Sum_probs=70.0 Q ss_pred ChhHHHHHHHH-----hhcchhhHHHHHHHHHHHHHHHHHHH---HHHHHHHHhcccchhhhhcchhhhhhceeeeecCC Q lcl|NC_010392. 1 MKGLENAIRNL-----NSLDRQMVPRASIWAVNRVAQKAVSV---ATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGT 72 (192) Q Consensus 1 ~~gl~~~i~nL-----~~i~~~~Vp~A~arAiNrva~~a~s~---s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~ 72 (192) ++-+...+..| ......++-+++.++-..++..+..+ -.+.|.+-+++- .-....+ .+.+.- + T Consensus 6 l~~~~~~L~~l~~~~v~kA~~rAiNrt~~~~rt~~~r~v~~~~~i~~~~ir~r~~~~-----kas~~~l--~a~I~~--~ 76 (184) T protein:vir:39 6 LEQAIENLNSISKTAVPRASAQAVNRVANRAVSRSVAVVSKDTRVPRKLVKQRARVK-----RATVNKP--RALIRV--N 76 (184) T ss_pred HHHHHHHHhccCHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheec-----ccCCCCe--EEEEEE--e Confidence 33333333332 22334456666666666666555544 233344433210 0000001 111111 1 Q ss_pred CCCeEEEEEEeccCcCceecCCcceeecccccccccccceeee-----cceecCcceeecCCCCchhheeeccccccccc Q lcl|NC_010392. 73 DGKRSARIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKI-----GPYLFRDAFIQQLANGRWHVMRRVNGKNRYPI 147 (192) Q Consensus 73 ~~~~~a~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkv-----Gk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PI 147 (192) +.+...++.+ +.+. +.+.++....+..+..+.++..+.- .+.-..+-|.- ..+++..| .. .+.|| T Consensus 77 -~~~i~l~~~g--~~~~-k~~~~~~~~~~~~~~~~~g~~~~~gaFia~~~~G~~~Vf~R-~gk~R~PI-~~----~~~~i 146 (184) T protein:vir:39 77 -RGNLPAIKLG--TASV-RLSRRKRDKKGANSVLRIGPFRFPGGFIQQLKNGRWHVMRR-TSKPRYPI-EV----VSIPL 146 (184) T ss_pred -ccceeeeecc--cccc-ccCccccccccccceeeecceecCcceeeecCCCceEEEEE-ecCcccce-eE----EEcCc Confidence 1122222222 1111 2222222222222222222111110 01111123333 23445554 22 33564 Q ss_pred eeeecCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 148 DVVKIPLSGPLTQAFESATQSLIDEEIPKQLGYALKQQLRL 188 (192) Q Consensus 148 evvkipis~plt~afe~e~~~~~~~~~~kEl~~~L~~qlr~ 188 (192) . .|+++.+.+..++.++..|+.+|++||.+.|..-|+. T Consensus 147 ~---~~~~e~~~~~~~~~~~~~~~~el~~~l~~~L~~~l~r 184 (184) T protein:vir:39 147 A---APLTTAFKEELPKLMESDMPKELRASLTNQLRLILTR 184 (184) T ss_pred h---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 3 6777777777777777777788888888888888888 No 46 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=87.67 E-value=0.011 Score=31.23 Aligned_cols=128 Identities=19% Similarity=0.239 Sum_probs=63.5 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+|+.++..||.++-..-++++...|.+..|.- .+..+..++-+|..|-. ...|.+ ++.+.+.| T Consensus 11 V~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v----~~~~ar~~tP~g~~~p~------~srr~r------~G~L~~Si 74 (143) T protein:vir:13 11 VDGLRQFQRNVRALRDKELNKAVREANKASGEV----LIPQAKHESPDGHRDPK------SSKRYR------PGKLDKSI 74 (143) T ss_pred hHHHHHHHHHHHHhhCCcchHHHHHHHHHHHHH----HHHHHHhhcCCcccccc------cccccc------cchhhccc Confidence 999999999999986666899999988888743 33444444433311111 111111 11122222 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeeccee-cC-cceeecCCCCch---hheeeccccccccceeeecCch Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYL-FR-DAFIQQLANGRW---HVMRRVNGKNRYPIDVVKIPLS 155 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~-f~-gaFi~~~~nGr~---~V~~R~~Gk~R~PIevvkipis 155 (192) ++. -+. ...++++|+.. .| -+||+-.--++- .-|-. .|.+ .-+ T Consensus 75 r~a---------------aT~-------raa~VrAGr~arVPYA~~I~~G~r~r~Is~~rFl~-~a~a---------~te 122 (143) T protein:vir:13 75 KVT---------------ASA-------KGAVIKAGSAARVPYAAAIHFGYRKRNISANRFLY-RAMA---------RKS 122 (143) T ss_pred ccc---------------ccc-------cceeeeecCcCCCCcccccccCCcccccchhhhhh-hhhh---------ccC Confidence 221 111 12335556442 33 234443311110 01111 1222 224 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 156 GPLTQAFESATQSLIDEEIPK 176 (192) Q Consensus 156 ~plt~afe~e~~~~~~~~~~k 176 (192) .+....||.++++.++.+++. T Consensus 123 ~~~~r~Ye~~i~~vl~k~l~s 143 (143) T protein:vir:13 123 DVVAATYERRIAAVVEKYLES 143 (143) T ss_pred HHHHHHHHHHHHHHHHHHhcC Confidence 556667888888777766665 No 47 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=87.32 E-value=0.024 Score=29.43 Aligned_cols=132 Identities=14% Similarity=0.221 Sum_probs=68.6 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) ..||+++++.|++++++ +.+++.+++.++|..+...+-..+...|+ .++.=+.+.. ..+...+.| T Consensus 17 ~~Gld~l~~~l~~~~~~-~~~~~~~~l~~~a~~v~~~ak~~aPvdTG------------~L~~SI~~~~--~~~g~~~~V 81 (149) T protein:vir:10 17 KYGADSMVVELDKFDKK-IEEWVKKGIAKTTTKIYNTAVALAPVDLG------------FLEESIDFKY--FDGGLSSVI 81 (149) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCcccc------------hhhccceEEe--cCCcEEEEE Confidence 46999999999998776 67899999999888777776555444432 1222222221 223356666 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) ..+...=+.+-.||....-.+. +... .+. ...|.+ .. +.+.+++|. |= .| |. T Consensus 82 ~~~~~YA~~vE~GT~~~~~~~~-~~~~-----~~~-~~~~~~------~~---~~~~~t~g~----------~a-~PFl~ 134 (149) T protein:vir:10 82 SVGADYAIYVEYGTGIYATGPG-GSRA-----TKI-PWSFKG------DD---GEWYTTYGQ----------AP-QPFWN 134 (149) T ss_pred ecCCCcccccccCccccccCCc-cccc-----ccc-cceeec------cc---cceecCCCC----------CC-Ccchh Confidence 6666655666667644322111 0000 000 011111 11 233444442 21 23 66 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLG 179 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~ 179 (192) .||+++-. ++.+.|+ T Consensus 135 pA~~~~k~-----~i~~~i~ 149 (149) T protein:vir:10 135 PAIDAGRK-----TFEQYFS 149 (149) T ss_pred HHHHHHHH-----HHHHhhC Confidence 66665543 3444444 No 48 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=85.43 E-value=0.027 Score=29.16 Aligned_cols=124 Identities=16% Similarity=0.209 Sum_probs=53.8 Q ss_pred ChhHHHHHHHHhh-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNS-LDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~-i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++|++++++||++ +..+.+.+.+..||+..+..+.. .++..+.+|+.....-.++. T Consensus 5 vkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~-----------------------~lK~~~~~fkDTGati~ev~ 61 (133) T protein:vir:96 5 IKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIK-----------------------ALKKEFESFKDTGASIEEMT 61 (133) T ss_pred EecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHH-----------------------HHHhhhhhhhcccceeeeEE Confidence 9999999999998 88888899999998887744433 34445555554222211111 Q ss_pred E----EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCch Q lcl|NC_010392. 80 I----RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLS 155 (192) Q Consensus 80 i----~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis 155 (192) + |.+..+.+.|++|= .+.. .....+-.. .-+|. +||+|+ +-.|. T Consensus 62 ~s~p~~~~g~~~rtV~i~W-----~gp~----~R~~iVHLN----E~Gyt---r~Gk~i---~PrG~------------- 109 (133) T protein:vir:96 62 KSKPYTKVGSQERAVLIEW-----VGPM----NRKNIIHLN----EHGYT---RDGKKY---TPRGF------------- 109 (133) T ss_pred ecCeeeccCCcceeEEEEe-----ecCC----CceeEEEee----cccee---cCCCeE---ccchh------------- Confidence 0 11222233333321 0000 000011111 11111 133221 00111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 156 GPLTQAFESATQSLIDEEIPKQLGY 180 (192) Q Consensus 156 ~plt~afe~e~~~~~~~~~~kEl~~ 180 (192) +.|.. +-++.+..|.+.+..||.. T Consensus 110 G~i~~-a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 110 GVIAK-TLAASERKYREIIKKELAR 133 (133) T ss_pred hHHHH-HHHhhhHHHHHHHHHHhcC Confidence 11222 2233344443334444433 No 49 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=85.43 E-value=0.027 Score=29.16 Aligned_cols=124 Identities=16% Similarity=0.209 Sum_probs=53.8 Q ss_pred ChhHHHHHHHHhh-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNS-LDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~-i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++|++++++||++ +..+.+.+.+..||+..+..+.. .++..+.+|+.....-.++. T Consensus 5 vkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~-----------------------~lK~~~~~fkDTGati~ev~ 61 (133) T protein:vir:93 5 IKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIK-----------------------ALKKEFESFKDTGASIEEMT 61 (133) T ss_pred EecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHH-----------------------HHHhhhhhhhcccceeeeEE Confidence 9999999999998 88888899999998887744433 34445555554222211111 Q ss_pred E----EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCch Q lcl|NC_010392. 80 I----RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLS 155 (192) Q Consensus 80 i----~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis 155 (192) + |.+..+.+.|++|= .+.. .....+-.. .-+|. +||+|+ +-.|. T Consensus 62 ~s~p~~~~g~~~rtV~i~W-----~gp~----~R~~iVHLN----E~Gyt---r~Gk~i---~PrG~------------- 109 (133) T protein:vir:93 62 KSKPYTKVGSQERAVLIEW-----VGPM----NRKNIIHLN----EHGYT---RDGKKY---TPRGF------------- 109 (133) T ss_pred ecCeeeccCCcceeEEEEe-----ecCC----CceeEEEee----cccee---cCCCeE---ccchh------------- Confidence 0 11222233333321 0000 000011111 11111 133221 00111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 156 GPLTQAFESATQSLIDEEIPKQLGY 180 (192) Q Consensus 156 ~plt~afe~e~~~~~~~~~~kEl~~ 180 (192) +.|.. +-++.+..|.+.+..||.. T Consensus 110 G~i~~-a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 110 GVIAK-TLAASERKYREIIKKELAR 133 (133) T ss_pred hHHHH-HHHhhhHHHHHHHHHHhcC Confidence 11222 2233344443334444433 No 50 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=85.43 E-value=0.027 Score=29.16 Aligned_cols=124 Identities=16% Similarity=0.209 Sum_probs=53.8 Q ss_pred ChhHHHHHHHHhh-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNS-LDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~-i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++|++++++||++ +..+.+.+.+..||+..+..+.. .++..+.+|+.....-.++. T Consensus 5 vkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~-----------------------~lK~~~~~fkDTGati~ev~ 61 (133) T protein:vir:94 5 IKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIK-----------------------ALKKEFESFKDTGASIEEMT 61 (133) T ss_pred EecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHH-----------------------HHHhhhhhhhcccceeeeEE Confidence 9999999999998 88888899999998887744433 34445555554222211111 Q ss_pred E----EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCch Q lcl|NC_010392. 80 I----RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLS 155 (192) Q Consensus 80 i----~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis 155 (192) + |.+..+.+.|++|= .+.. .....+-.. .-+|. +||+|+ +-.|. T Consensus 62 ~s~p~~~~g~~~rtV~i~W-----~gp~----~R~~iVHLN----E~Gyt---r~Gk~i---~PrG~------------- 109 (133) T protein:vir:94 62 KSKPYTKVGSQERAVLIEW-----VGPM----NRKNIIHLN----EHGYT---RDGKKY---TPRGF------------- 109 (133) T ss_pred ecCeeeccCCcceeEEEEe-----ecCC----CceeEEEee----cccee---cCCCeE---ccchh------------- Confidence 0 11222233333321 0000 000011111 11111 133221 00111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 156 GPLTQAFESATQSLIDEEIPKQLGY 180 (192) Q Consensus 156 ~plt~afe~e~~~~~~~~~~kEl~~ 180 (192) +.|.. +-++.+..|.+.+..||.. T Consensus 110 G~i~~-a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 110 GVIAK-TLAASERKYREIIKKELAR 133 (133) T ss_pred hHHHH-HHHhhhHHHHHHHHHHhcC Confidence 11222 2233344443334444433 No 51 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=85.43 E-value=0.027 Score=29.16 Aligned_cols=124 Identities=16% Similarity=0.209 Sum_probs=53.8 Q ss_pred ChhHHHHHHHHhh-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNS-LDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~-i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++|++++++||++ +..+.+.+.+..||+..+..+.. .++..+.+|+.....-.++. T Consensus 5 vkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~-----------------------~lK~~~~~fkDTGati~ev~ 61 (133) T protein:vir:78 5 IKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIK-----------------------ALKKEFESFKDTGASIEEMT 61 (133) T ss_pred EecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHH-----------------------HHHhhhhhhhcccceeeeEE Confidence 9999999999998 88888899999998887744433 34445555554222211111 Q ss_pred E----EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCch Q lcl|NC_010392. 80 I----RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLS 155 (192) Q Consensus 80 i----~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis 155 (192) + |.+..+.+.|++|= .+.. .....+-.. .-+|. +||+|+ +-.|. T Consensus 62 ~s~p~~~~g~~~rtV~i~W-----~gp~----~R~~iVHLN----E~Gyt---r~Gk~i---~PrG~------------- 109 (133) T protein:vir:78 62 KSKPYTKVGSQERAVLIEW-----VGPM----NRKNIIHLN----EHGYT---RDGKKY---TPRGF------------- 109 (133) T ss_pred ecCeeeccCCcceeEEEEe-----ecCC----CceeEEEee----cccee---cCCCeE---ccchh------------- Confidence 0 11222233333321 0000 000011111 11111 133221 00111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 156 GPLTQAFESATQSLIDEEIPKQLGY 180 (192) Q Consensus 156 ~plt~afe~e~~~~~~~~~~kEl~~ 180 (192) +.|.. +-++.+..|.+.+..||.. T Consensus 110 G~i~~-a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 110 GVIAK-TLAASERKYREIIKKELAR 133 (133) T ss_pred hHHHH-HHHhhhHHHHHHHHHHhcC Confidence 11222 2233344443334444433 No 52 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=84.29 E-value=0.05 Score=27.73 Aligned_cols=123 Identities=15% Similarity=0.224 Sum_probs=55.4 Q ss_pred ChhHHHHHHHHhh-cchhhHHHHHHHHHHHHHHHHHHHHHHHHH--HHhcccchhhhhcchhhhhhceeeeecCCCCCeE Q lcl|NC_010392. 1 MKGLENAIRNLNS-LDRQMVPRASIWAVNRVAQKAVSVATRKVA--RETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRS 77 (192) Q Consensus 1 ~~gl~~~i~nL~~-i~~~~Vp~A~arAiNrva~~a~s~s~k~va--~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~ 77 (192) ++|++++++||++ +.++.+.+.+.+||+..|..+....-++++ ++|+. .+. . +.+.++ T Consensus 7 vkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~------t~d--e----v~~s~~------- 67 (132) T protein:vir:96 7 LKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGE------TTE--S----AVVSGV------- 67 (132) T ss_pred ccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcch------hhc--c----eeecCe------- Confidence 9999999999999 999899999999999999777665555554 33320 010 0 111111 Q ss_pred EEEEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH Q lcl|NC_010392. 78 ARIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP 157 (192) Q Consensus 78 a~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p 157 (192) + +.+-.+.|++|=.. .|.+ .+-..-|-| |-|+.+=- -++.+| T Consensus 68 ---~-~~~G~r~V~VgW~G-----pR~~------ivHLNE~Gy-Gk~~~PrG---~G~I~~------------------- 109 (132) T protein:vir:96 68 ---R-REDGIPKVKLGFTT-----PRWN------IVHLQELEY-GWKHNRRG---VGVIRR------------------- 109 (132) T ss_pred ---e-ecCCceEEEecccC-----Ccee------EEeeecccc-cCCcCCCc---chHHHH------------------- Confidence 1 12223455554211 1221 111111222 11111100 122222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 158 LTQAFESATQSLIDEEIPKQLGYALKQ 184 (192) Q Consensus 158 lt~afe~e~~~~~~~~~~kEl~~~L~~ 184 (192) ++ ++.+..+-..+..||...|.- T Consensus 110 ---a~-~~se~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 110 ---YS-DILETIYPRGIRDKLKRGFDG 132 (132) T ss_pred ---HH-HhhhhHHHHHHHHHHHHHhcC Confidence 11 122222222222333333332 No 53 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=84.06 E-value=0.041 Score=28.19 Aligned_cols=128 Identities=17% Similarity=0.198 Sum_probs=65.4 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||.++..||..+-..-++++...|.+..|.- .+..+..++-.|.+|-++-. .-.++.+.+.| T Consensus 11 V~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v----~~~~ar~~tP~g~r~~~~s~------------~~r~G~L~~Si 74 (143) T protein:vir:62 11 VDGLREFQRNVRTLRDKELNKAVREANKASGEV----LIPQAKHESPDGKRDAKSSK------------KYRPGKLDKSI 74 (143) T ss_pred hHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHH----HHHHHHhhcCCccccccccc------------ccCcchhhccc Confidence 999999999999986666899999998888743 34444444433322222111 00112222333 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecc-eecCc-ceeecCCCCch---hheeeccccccccceeeecCch Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGP-YLFRD-AFIQQLANGRW---HVMRRVNGKNRYPIDVVKIPLS 155 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk-~~f~g-aFi~~~~nGr~---~V~~R~~Gk~R~PIevvkipis 155 (192) ++. -+. ...++++|+ ...|= +||+-.--++- .-|-. .|.+ .-+ T Consensus 75 r~a---------------aT~-------raa~VrAG~~krVPYA~~I~~G~r~r~Isp~rFl~-~a~a---------~te 122 (143) T protein:vir:62 75 KVT---------------ASA-------KGAVIKAGSASRVPYAAAIHFGYRARNISPNRFLF-RAMA---------RKS 122 (143) T ss_pred ccc---------------ccc-------cceeeeeCCcCCCCcccccccCcccccccchhhhh-hhhh---------ccC Confidence 222 111 123356665 34442 34544311110 00111 1221 224 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 156 GPLTQAFESATQSLIDEEIPK 176 (192) Q Consensus 156 ~plt~afe~e~~~~~~~~~~k 176 (192) .+....||.++++.++.+++. T Consensus 123 ~~~~r~Ye~~i~~vl~k~l~s 143 (143) T protein:vir:62 123 DVVAATYERRIAAVVEKYLES 143 (143) T ss_pred HHHHHHHHHHHHHHHHHHhcC Confidence 556667888888777766665 No 54 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=83.38 E-value=0.069 Score=26.95 Aligned_cols=129 Identities=13% Similarity=0.210 Sum_probs=52.7 Q ss_pred Chh--HHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchh--hhhhceeeeec---CCC Q lcl|NC_010392. 1 MKG--LENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLK--LVRQRVRLFKA---GTD 73 (192) Q Consensus 1 ~~g--l~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k--~vr~R~r~~ka---~~~ 73 (192) |.+ |.++++.|+.|. +... ++++.+...++..|-+|.+.. +|.+ .|+.=+.+.-. +.. T Consensus 5 ~~~~d~s~l~~~l~~l~-~~~~--------~v~R~A~~~ga~vv~dear~~------aP~~tG~LkksI~~~~~~~~s~~ 69 (157) T protein:vir:97 5 IRSVDITGILAGLETVV-EHSS--------DVVRTMTYESAVAVRESAKAF------VNDETGKLRNNLYVAYSPEESVE 69 (157) T ss_pred eecccHHHHHHHHHHhH-HHHH--------HHHHHHHHHHHHHHHHHHHHh------CCCCcchhhhheeeeeccccCCC Confidence 544 558888887763 2232 333344445555555554321 3332 12222222110 011 Q ss_pred CCeEEEEEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhh--------------eeec Q lcl|NC_010392. 74 GKRSARIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHV--------------MRRV 139 (192) Q Consensus 74 ~~~~a~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V--------------~~R~ 139 (192) +..+..|.++.++.|- |-|+. +|+|.. |.+ T Consensus 70 g~~~~~Vg~~~~~a~~--------------------------------g~~vE---fG~~~~~~~~~~~~~~~~~~~~~- 113 (157) T protein:vir:97 70 GIQTYAVSWRKKAAPH--------------------------------GHLLE---FGHWQTHAAYRDKDGQWYSSKVK- 113 (157) T ss_pred ceEEEEEeecCCccce--------------------------------eeeee---cCcccccccccCCcccccccccc- Confidence 2122223333333321 11111 222221 111 Q ss_pred cccccccceeeecCchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_010392. 140 NGKNRYPIDVVKIPLSGP-LTQAFESATQSLIDEEIPKQLGYALKQQLRLYLSR 192 (192) Q Consensus 140 ~Gk~R~PIevvkipis~p-lt~afe~e~~~~~~~~~~kEl~~~L~~qlr~~~kr 192 (192) .|.+ +++| +.| |.-||+...+ ...+.|...|...|+.+++= T Consensus 114 ~~t~------~~~P-a~PFlRPA~d~~k~-----~a~~~~~~~l~k~I~e~l~g 155 (157) T protein:vir:97 114 LVNP------KWIP-AKPFLRPGYDSVAM-----QIPDIARAAGAKKYAELQRG 155 (157) T ss_pred cCCC------CcCC-CCcccchHHHHhHH-----HHHHHHHHHHHHHHHHHhcC Confidence 1211 2344 444 6666766554 34555555566666555555 No 55 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=81.71 E-value=0.083 Score=26.50 Aligned_cols=135 Identities=11% Similarity=0.064 Sum_probs=59.2 Q ss_pred ChhHHHHHHHHhhcc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeec-CCCCCeEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLD-RQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKA-GTDGKRSA 78 (192) Q Consensus 1 ~~gl~~~i~nL~~i~-~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka-~~~~~~~a 78 (192) ++||++++++|.... ...+|+....+++.+|..+...+-+.+...| -.+|+=.+.... .+.+.... T Consensus 8 ~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdT------------G~Lr~S~~~~~~~~~~~~~~~ 75 (144) T protein:vir:10 8 DAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQ------------GNLRRSWTAEGPTYGCGGWTI 75 (144) T ss_pred HHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCc------------chhccceeecceeeecCeeEE Confidence 899999999998864 4468899999999999666544333222111 122222221111 11222233 Q ss_pred EEEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHH Q lcl|NC_010392. 79 RIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPL 158 (192) Q Consensus 79 ~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~pl 158 (192) .|.-+..-=+.+-.||-. -+|.|+-...+| .....++|+ ++ T Consensus 76 ~V~n~~~YA~~VE~Ghr~-----------------------~~G~~v~~~~~~--~~~g~V~G~----------~~---- 116 (144) T protein:vir:10 76 KLINNAEYASYVESGHRQ-----------------------TPGRYVPVLKKR--LVRDWVPGQ----------FY---- 116 (144) T ss_pred EEecCCCcccccccceee-----------------------cCCcccccCCCc--cccceecCc----------cc---- Confidence 333333333333344411 111122111111 111222332 12 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010392. 159 TQAFESATQSLIDEEIPKQLGYALKQQLRLYL 190 (192) Q Consensus 159 t~afe~e~~~~~~~~~~kEl~~~L~~qlr~~~ 190 (192) -+.....+++.|+++|...|..=+.++= T Consensus 117 ----~~~a~~~~~~~~~~~l~k~l~~l~d~~~ 144 (144) T protein:vir:10 117 ----MKKSIPQIQRQLPQLVTEGLWGLKDLFE 144 (144) T ss_pred ----hHHHHHHHHHHHHHHHHHHHHHHhhhcC Confidence 1222223444455555555544444444 No 56 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=80.87 E-value=0.083 Score=26.51 Aligned_cols=106 Identities=14% Similarity=0.157 Sum_probs=43.0 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+... -+.+..+|+...|..+...+-.. .|.. .+.+...| T Consensus 3 i~Gld~l~~~l~~~~---~~~~~~~al~~~a~~i~~~ak~~--------------aPv~-------------TG~Lr~si 52 (108) T protein:vir:74 3 ITGIDALQKKLRKNA---TLDDVKHVVKSNTASMNKNMQNL--------------APVD-------------TGNMKRSI 52 (108) T ss_pred chhHHHHHHHHHHhh---hHHHHHHHHHHHHHHHHHHHHHh--------------CCCC-------------chhhhccc Confidence 999999999998642 34556666666665444433221 2221 12222222 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+.- . .++....||....=+.|+-- |....| -=|. +.- T Consensus 53 ~~~~------~----------------~~~~~~~V~~~~~Ya~~vE~-------------GT~km~----aqpf---~~p 90 (108) T protein:vir:74 53 TSEF------T----------------DGGLSGTTGPHTDYAGYVEY-------------GTRFQS----AQPF---VKP 90 (108) T ss_pred eeee------e----------------cCceEEEeecCCCcccceec-------------cccccC----CCcc---hhh Confidence 2110 0 00111334433322333332 311111 1133 333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~ 183 (192) ||+..- ..|.++|...|+ T Consensus 91 a~~~~~-----~~~~~~i~~~~k 108 (108) T protein:vir:74 91 AFNIQK-----KVFTNDLERLTK 108 (108) T ss_pred HHHHHH-----HHHHHHHHHHcC Confidence 444322 223334433333 No 57 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=80.14 E-value=0.098 Score=26.12 Aligned_cols=113 Identities=16% Similarity=0.143 Sum_probs=49.2 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+++++. +..++..|+.+.|..+...+-+. |..+ -..|. ..+.+...| T Consensus 3 ~~Gld~l~~~l~~~~~~-~~~~v~~a~~~~~~~i~~~a~~~-a~~~-------~~~p~-------------~TG~Lr~sI 60 (115) T protein:vir:10 3 IDGLDALLNQFHDMKTN-IDDDVDDILQENAKEYVVRAKLK-AREV-------MNKGY-------------WTGNLSRNI 60 (115) T ss_pred chhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh-cccc-------CCCCC-------------Cchhhhhcc Confidence 99999999999998655 45667777777766555433222 2110 01111 112122222 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+. .. ++....||...+ +|+ |-= -|....| -=|. +.- T Consensus 61 ~~~---------------~~--------g~~~~~v~~~~~-----------Ya~-~vE-~GT~km~----a~Pf---l~P 97 (115) T protein:vir:10 61 RYK---------------KT--------GDLQYTITSHAA-----------YSG-FLE-FGTRYME----AEPF---MWP 97 (115) T ss_pred eee---------------ec--------CceEEEeecCcc-----------chh-hhc-ccccccC----CCCc---hhh Confidence 211 00 011123333222 232 222 1422221 1133 444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~ 183 (192) ||+.+- ..|.++|..+|+ T Consensus 98 A~~~~~-----~~~~~~i~~~~k 115 (115) T protein:vir:10 98 VYEVIR-----KSTVEELKALFE 115 (115) T ss_pred hHHHHH-----HHHHHHHHHHhC Confidence 554332 234444555555 No 58 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=80.14 E-value=0.098 Score=26.12 Aligned_cols=113 Identities=16% Similarity=0.143 Sum_probs=49.2 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+++++. +..++..|+.+.|..+...+-+. |..+ -..|. ..+.+...| T Consensus 3 ~~Gld~l~~~l~~~~~~-~~~~v~~a~~~~~~~i~~~a~~~-a~~~-------~~~p~-------------~TG~Lr~sI 60 (115) T protein:vir:96 3 IDGLDALLNQFHDMKTN-IDDDVDDILQENAKEYVVRAKLK-AREV-------MNKGY-------------WTGNLSRNI 60 (115) T ss_pred chhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh-cccc-------CCCCC-------------Cchhhhhcc Confidence 99999999999998655 45667777777766555433222 2110 01111 112122222 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+. .. ++....||...+ +|+ |-= -|....| -=|. +.- T Consensus 61 ~~~---------------~~--------g~~~~~v~~~~~-----------Ya~-~vE-~GT~km~----a~Pf---l~P 97 (115) T protein:vir:96 61 RYK---------------KT--------GDLQYTITSHAA-----------YSG-FLE-FGTRYME----AEPF---MWP 97 (115) T ss_pred eee---------------ec--------CceEEEeecCcc-----------chh-hhc-ccccccC----CCCc---hhh Confidence 211 00 011123333222 232 222 1422221 1133 444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~ 183 (192) ||+.+- ..|.++|..+|+ T Consensus 98 A~~~~~-----~~~~~~i~~~~k 115 (115) T protein:vir:96 98 VYEVIR-----KSTVEELKALFE 115 (115) T ss_pred hHHHHH-----HHHHHHHHHHhC Confidence 554332 234444555555 No 59 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=80.14 E-value=0.098 Score=26.12 Aligned_cols=113 Identities=16% Similarity=0.143 Sum_probs=49.2 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+++++. +..++..|+.+.|..+...+-+. |..+ -..|. ..+.+...| T Consensus 3 ~~Gld~l~~~l~~~~~~-~~~~v~~a~~~~~~~i~~~a~~~-a~~~-------~~~p~-------------~TG~Lr~sI 60 (115) T protein:vir:97 3 IDGLDALLNQFHDMKTN-IDDDVDDILQENAKEYVVRAKLK-AREV-------MNKGY-------------WTGNLSRNI 60 (115) T ss_pred chhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh-cccc-------CCCCC-------------Cchhhhhcc Confidence 99999999999998655 45667777777766555433222 2110 01111 112122222 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+. .. ++....||...+ +|+ |-= -|....| -=|. +.- T Consensus 61 ~~~---------------~~--------g~~~~~v~~~~~-----------Ya~-~vE-~GT~km~----a~Pf---l~P 97 (115) T protein:vir:97 61 RYK---------------KT--------GDLQYTITSHAA-----------YSG-FLE-FGTRYME----AEPF---MWP 97 (115) T ss_pred eee---------------ec--------CceEEEeecCcc-----------chh-hhc-ccccccC----CCCc---hhh Confidence 211 00 011123333222 232 222 1422221 1133 444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~ 183 (192) ||+.+- ..|.++|..+|+ T Consensus 98 A~~~~~-----~~~~~~i~~~~k 115 (115) T protein:vir:97 98 VYEVIR-----KSTVEELKALFE 115 (115) T ss_pred hHHHHH-----HHHHHHHHHHhC Confidence 554332 234444555555 No 60 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=80.14 E-value=0.098 Score=26.12 Aligned_cols=113 Identities=16% Similarity=0.143 Sum_probs=49.2 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+++++. +..++..|+.+.|..+...+-+. |..+ -..|. ..+.+...| T Consensus 3 ~~Gld~l~~~l~~~~~~-~~~~v~~a~~~~~~~i~~~a~~~-a~~~-------~~~p~-------------~TG~Lr~sI 60 (115) T protein:vir:96 3 IDGLDALLNQFHDMKTN-IDDDVDDILQENAKEYVVRAKLK-AREV-------MNKGY-------------WTGNLSRNI 60 (115) T ss_pred chhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh-cccc-------CCCCC-------------Cchhhhhcc Confidence 99999999999998655 45667777777766555433222 2110 01111 112122222 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+. .. ++....||...+ +|+ |-= -|....| -=|. +.- T Consensus 61 ~~~---------------~~--------g~~~~~v~~~~~-----------Ya~-~vE-~GT~km~----a~Pf---l~P 97 (115) T protein:vir:96 61 RYK---------------KT--------GDLQYTITSHAA-----------YSG-FLE-FGTRYME----AEPF---MWP 97 (115) T ss_pred eee---------------ec--------CceEEEeecCcc-----------chh-hhc-ccccccC----CCCc---hhh Confidence 211 00 011123333222 232 222 1422221 1133 444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~ 183 (192) ||+.+- ..|.++|..+|+ T Consensus 98 A~~~~~-----~~~~~~i~~~~k 115 (115) T protein:vir:96 98 VYEVIR-----KSTVEELKALFE 115 (115) T ss_pred hHHHHH-----HHHHHHHHHHhC Confidence 554332 234444555555 No 61 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=80.14 E-value=0.098 Score=26.12 Aligned_cols=113 Identities=16% Similarity=0.143 Sum_probs=49.2 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+++++. +..++..|+.+.|..+...+-+. |..+ -..|. ..+.+...| T Consensus 3 ~~Gld~l~~~l~~~~~~-~~~~v~~a~~~~~~~i~~~a~~~-a~~~-------~~~p~-------------~TG~Lr~sI 60 (115) T protein:vir:93 3 IDGLDALLNQFHDMKTN-IDDDVDDILQENAKEYVVRAKLK-AREV-------MNKGY-------------WTGNLSRNI 60 (115) T ss_pred chhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh-cccc-------CCCCC-------------Cchhhhhcc Confidence 99999999999998655 45667777777766555433222 2110 01111 112122222 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+. .. ++....||...+ +|+ |-= -|....| -=|. +.- T Consensus 61 ~~~---------------~~--------g~~~~~v~~~~~-----------Ya~-~vE-~GT~km~----a~Pf---l~P 97 (115) T protein:vir:93 61 RYK---------------KT--------GDLQYTITSHAA-----------YSG-FLE-FGTRYME----AEPF---MWP 97 (115) T ss_pred eee---------------ec--------CceEEEeecCcc-----------chh-hhc-ccccccC----CCCc---hhh Confidence 211 00 011123333222 232 222 1422221 1133 444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~ 183 (192) ||+.+- ..|.++|..+|+ T Consensus 98 A~~~~~-----~~~~~~i~~~~k 115 (115) T protein:vir:93 98 VYEVIR-----KSTVEELKALFE 115 (115) T ss_pred hHHHHH-----HHHHHHHHHHhC Confidence 554332 234444555555 No 62 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=80.14 E-value=0.098 Score=26.12 Aligned_cols=113 Identities=16% Similarity=0.143 Sum_probs=49.2 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+++++. +..++..|+.+.|..+...+-+. |..+ -..|. ..+.+...| T Consensus 3 ~~Gld~l~~~l~~~~~~-~~~~v~~a~~~~~~~i~~~a~~~-a~~~-------~~~p~-------------~TG~Lr~sI 60 (115) T protein:vir:78 3 IDGLDALLNQFHDMKTN-IDDDVDDILQENAKEYVVRAKLK-AREV-------MNKGY-------------WTGNLSRNI 60 (115) T ss_pred chhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh-cccc-------CCCCC-------------Cchhhhhcc Confidence 99999999999998655 45667777777766555433222 2110 01111 112122222 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+. .. ++....||...+ +|+ |-= -|....| -=|. +.- T Consensus 61 ~~~---------------~~--------g~~~~~v~~~~~-----------Ya~-~vE-~GT~km~----a~Pf---l~P 97 (115) T protein:vir:78 61 RYK---------------KT--------GDLQYTITSHAA-----------YSG-FLE-FGTRYME----AEPF---MWP 97 (115) T ss_pred eee---------------ec--------CceEEEeecCcc-----------chh-hhc-ccccccC----CCCc---hhh Confidence 211 00 011123333222 232 222 1422221 1133 444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~ 183 (192) ||+.+- ..|.++|..+|+ T Consensus 98 A~~~~~-----~~~~~~i~~~~k 115 (115) T protein:vir:78 98 VYEVIR-----KSTVEELKALFE 115 (115) T ss_pred hHHHHH-----HHHHHHHHHHhC Confidence 554332 234444555555 No 63 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=78.21 E-value=0.12 Score=25.70 Aligned_cols=106 Identities=16% Similarity=0.254 Sum_probs=43.7 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||++++++|+.+.. ++++..|+...|..+...+-.. .|.. .+.+...| T Consensus 7 i~Gld~l~~~L~~~~~---~~~~~~al~~~~~~i~~~ak~~--------------aPvd-------------TG~Lr~si 56 (112) T protein:vir:36 7 FKGIDQLVKHLDKAAS---LKGVQQVVKSNTSNMTANMQKL--------------VPVD-------------TGYMKRSI 56 (112) T ss_pred ehhHHHHHHHHHhhhh---HHHHHHHHHHHHHHHHHHHHHh--------------CCCC-------------chhhhhce Confidence 9999999999987533 3555556666554444433221 2221 11111112 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+. .+ .++....||....-+.|+.-.- .+.|= =|. +.- T Consensus 57 ~~~---------------~~-------~~~~~~~V~~~~~Ya~~vE~GT-------------~k~~a----~Pf---l~p 94 (112) T protein:vir:36 57 KME---------------LT-------EGGFSGQAGPHTDYSAYVEYGT-------------RFQSA----QPF---VKP 94 (112) T ss_pred eee---------------ec-------CCceEEEeecCCCccceeeccc-------------cccCC----Ccc---hhh Confidence 111 00 0112234554433333433332 21111 133 333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~ 183 (192) +|+..-. .|.++|...|+ T Consensus 95 a~~~~~~-----~~~~~i~~~lr 112 (112) T protein:vir:36 95 AYNEQKG-----VFIKDLERLLK 112 (112) T ss_pred hHHHHHH-----HHHHHHHHHcC Confidence 4433222 23333433333 No 64 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=78.21 E-value=0.058 Score=27.35 Aligned_cols=108 Identities=11% Similarity=0.146 Sum_probs=40.7 Q ss_pred ChhHHHHHHHHhhcc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLD-RQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~-~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++||+++++.|+++. ...+.+|...++..++..+...+ +.. .|. ..++.... T Consensus 6 ~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a-----~~~---------~p~-------------~TG~Lr~s 58 (114) T protein:vir:49 6 FEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRA-----QFN---------KGY-------------STGATRRS 58 (114) T ss_pred eehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhc-----ccC---------CCC-------------Cchhhhhc Confidence 999999999999874 44455555555555443332221 110 121 11112111 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLT 159 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt 159 (192) |.+...+ +...||...+=..|+.- |....|= =|.=.|-. T Consensus 59 I~~~~~~------------------------~~~~V~~~~~Ya~~vEf-------------GT~km~a----~Pfl~PA~ 97 (114) T protein:vir:49 59 ITLQVES------------------------DKATVEALTSYSGYLEV-------------GTRKMEA----QPFMKPAL 97 (114) T ss_pred eeeeecC------------------------CeeEecCCCCccceecc-------------cccccCC----CCchhhhH Confidence 2111000 11233332222223322 3111111 13333322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLGYALKQ 184 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~~~L~~ 184 (192) +.... .|.++|...++. T Consensus 98 ~~~~~--------~~~~~l~~l~k~ 114 (114) T protein:vir:49 98 DEVAP--------KMVEELAKWDET 114 (114) T ss_pred HHHHH--------HHHHHHHHHhcC Confidence 22222 223333333333 No 65 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=78.21 E-value=0.058 Score=27.35 Aligned_cols=108 Identities=11% Similarity=0.146 Sum_probs=40.7 Q ss_pred ChhHHHHHHHHhhcc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLD-RQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~-~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++||+++++.|+++. ...+.+|...++..++..+...+ +.. .|. ..++.... T Consensus 6 ~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a-----~~~---------~p~-------------~TG~Lr~s 58 (114) T protein:vir:27 6 FEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRA-----QFN---------KGY-------------STGATRRS 58 (114) T ss_pred eehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhc-----ccC---------CCC-------------Cchhhhhc Confidence 999999999999874 44455555555555443332221 110 121 11112111 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLT 159 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt 159 (192) |.+...+ +...||...+=..|+.- |....|= =|.=.|-. T Consensus 59 I~~~~~~------------------------~~~~V~~~~~Ya~~vEf-------------GT~km~a----~Pfl~PA~ 97 (114) T protein:vir:27 59 ITLQVES------------------------DKATVEALTSYSGYLEV-------------GTRKMEA----QPFMKPAL 97 (114) T ss_pred eeeeecC------------------------CeeEecCCCCccceecc-------------cccccCC----CCchhhhH Confidence 2111000 11233332222223322 3111111 13333322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLGYALKQ 184 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~~~L~~ 184 (192) +.... .|.++|...++. T Consensus 98 ~~~~~--------~~~~~l~~l~k~ 114 (114) T protein:vir:27 98 DEVAP--------KMVEELAKWDET 114 (114) T ss_pred HHHHH--------HHHHHHHHHhcC Confidence 22222 223333333333 No 66 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=73.92 E-value=0.16 Score=24.88 Aligned_cols=113 Identities=13% Similarity=0.184 Sum_probs=49.3 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+++++. +..++.+|+.+.|..+...+-+...+.. ..|.. | +++...| T Consensus 3 i~Gld~L~~~l~~~~~~-~~~~~~~al~~~~~~i~~~a~~~a~~~~--------~~pv~------------T-G~Lr~sI 60 (115) T protein:vir:10 3 SKGLKKLMNHLKVMHDD-IEDDVDDILKNNAKEGVGIAVSNAKEVM--------NKGYW------------T-GNLASLI 60 (115) T ss_pred ehhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcccc--------CCCCc------------c-hhhhhce Confidence 99999999999997655 4566777777766555544433221110 11210 1 1111112 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+. . .++....||...+=+.|+.- |....|=+ |. +.- T Consensus 61 ~~~---------------~--------~g~~~~~v~~~~~Ya~~vEf-------------GT~km~a~----PF---l~P 97 (115) T protein:vir:10 61 EVK---------------K--------IGDLHYRVISTAHYSGFLEF-------------GTRYMEPA----PF---MFP 97 (115) T ss_pred eee---------------e--------cCcEEEEeeCCCccchheec-------------ccccCCCC----Cc---hhh Confidence 111 0 01111233332222222222 42222211 33 334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~ 183 (192) ||+.+. ..|.++|..+++ T Consensus 98 A~~~~k-----~~~~~~i~~~i~ 115 (115) T protein:vir:10 98 TYQTLK-----KSTINDLKRLLS 115 (115) T ss_pred hHHHHH-----HHHHHHHHHHhC Confidence 454332 235555555555 No 67 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=66.61 E-value=0.26 Score=23.75 Aligned_cols=124 Identities=17% Similarity=0.224 Sum_probs=51.7 Q ss_pred ChhHHHHHHHHhh-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNS-LDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~-i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++|++++++||++ +..+.+.+.+.+||+..+..+. +.++..+.+|+.....-.++. T Consensus 5 vkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~-----------------------~~lK~~~~~fkDTGati~ev~ 61 (133) T protein:vir:93 5 IKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFFI-----------------------KALKKEFESFKDTGASIEEMT 61 (133) T ss_pred EecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHH-----------------------HHHHhhhhhhhcccceeeeEE Confidence 9999999999986 4577788888888877764433 334455555654322212211 Q ss_pred E----EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCch Q lcl|NC_010392. 80 I----RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLS 155 (192) Q Consensus 80 i----~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis 155 (192) + |.+..+.+.|++|= .+.. .....+-.. .-+|. +||+|+ +-.|. T Consensus 62 ~s~p~~~~g~~~rtV~i~W-----~gp~----~R~~iVHLN----E~Gyt---r~Gk~i---~PrG~------------- 109 (133) T protein:vir:93 62 KSKPYTKVGSQERAVLIEW-----VGPM----NRKNIIHLN----EHGYT---RDGKKY---TPRGF------------- 109 (133) T ss_pred ecCeeeccCCcceEEEEEe-----ecCC----CceeEEEee----cccee---cCCCeE---ccchh------------- Confidence 1 11222233333321 0000 000011111 11111 123221 00111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 156 GPLTQAFESATQSLIDEEIPKQLGY 180 (192) Q Consensus 156 ~plt~afe~e~~~~~~~~~~kEl~~ 180 (192) +.|.. +-++.+..|.+.+..||.. T Consensus 110 G~i~~-a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 110 GVIAK-TLAANERKYREIIKKELAR 133 (133) T ss_pred hHHHH-HHHhhhHHHHHHHHHHhcC Confidence 11222 2233343333334444433 No 68 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=63.95 E-value=0.31 Score=23.38 Aligned_cols=165 Identities=16% Similarity=0.186 Sum_probs=70.7 Q ss_pred HHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEEEEecc Q lcl|NC_010392. 6 NAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARIRINRG 85 (192) Q Consensus 6 ~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i~v~r~ 85 (192) =.|.+|+.+... +..-...++.+.+.+++++++..+=..+ .+.|..-..+-. ......+++... T Consensus 1 ~~ik~l~~~~~~-L~~i~~~~vp~A~~rAiNrta~~a~t~~-----------~r~v~~e~~I~~----k~Ir~r~r~~kA 64 (192) T protein:vir:34 1 MAIKGLEQAVEN-LSRISKTAVPGAAAMAINRVASSAISQS-----------ASQVARETKVRR----KLVKERARLKRA 64 (192) T ss_pred CcchhHHHHHHH-HhhcCchhhHHHHHHHHHHHHHHHHHHH-----------HHHHHHHhCCCH----HHHHhhheeccc Confidence 445677664443 4444455566666666666665543332 112222222211 112223333221 Q ss_pred CcCceecCCcceeecccccccc---cccceeeecceec-----------CcceeecCCCCchhhe-eeccccccccc-e- Q lcl|NC_010392. 86 NLPAIKLGAAQVRMSKRRGKLL---YRGSVLKIGPYLF-----------RDAFIQQLANGRWHVM-RRVNGKNRYPI-D- 148 (192) Q Consensus 86 ~lpaikLg~~~~~~~~rr~~~~---~~~s~lkvGk~~f-----------~gaFi~~~~nGr~~V~-~R~~Gk~R~PI-e- 148 (192) .- +.+..+..-.+++.. -+......+++.+ .|.++-..+.---+-| .++. ..+..| . T Consensus 65 s~-----~~l~a~I~~~~~~l~~~~l~~~~~~~~rr~~~~~~~~~~~~~~g~~~k~Gk~~f~gaFia~m~-ng~~~Vf~R 138 (192) T protein:vir:34 65 TV-----KNPQARIKVNRGDLPVIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLK-NGRWHVMQR 138 (192) T ss_pred cC-----CCceEEEEEeccceeeeeecccccccccccccccccccccccccceeeecceecCCcccccCC-CCCceeEEE Confidence 11 111111111111110 0000011111111 1222221111111111 2221 112111 1 Q ss_pred ---eeecCc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_010392. 149 ---VVKIPL---SGPLTQAFESATQSLIDEEIPKQLGYALKQQLRLYLSR 192 (192) Q Consensus 149 ---vvkipi---s~plt~afe~e~~~~~~~~~~kEl~~~L~~qlr~~~kr 192 (192) --..|| .-||-+...+.++.-.+..|+++|.++|.++|...|+. T Consensus 139 ~~gk~R~PIe~vkIpis~~l~~af~~~~~~~~~~~~~~El~~~L~~~lr~ 188 (192) T protein:vir:34 139 VAGKNRYPIDVVKIPMAVPLTTAFKQNIERIRRERLPKELGYALQHQLRM 188 (192) T ss_pred ccCCCccceeEEEechhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 124677 36777877777877777888888888888888888888 No 69 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=63.68 E-value=0.12 Score=25.57 Aligned_cols=133 Identities=19% Similarity=0.177 Sum_probs=65.2 Q ss_pred Ch-hHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MK-GLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~-gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++ |++++..+|+.+... +..++.+++..+|.-+...+-..+...|+ .++.=++.....+.....+. T Consensus 6 ~~~~~~~l~~~l~~~~~~-~~~~~~~~l~~~a~~i~~~ak~~aPv~TG------------~Lr~SI~~~~~~~g~~~~~~ 72 (142) T protein:vir:94 6 YRVNSTEFQGALRAALDR-LTGAAREATEAAANDMVNMAKGLCPVDTG------------RLRSSIQAVPSGGRFSFSVT 72 (142) T ss_pred EEecHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCccch------------hhhccceeeeccCCceEEEE Confidence 44 899999999998776 66888888888887777766554443332 23333333333233333444 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecC-c-hHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIP-L-SGP 157 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkip-i-s~p 157 (192) |.-+..--+.+-.||.+-.-.++..... .|+++ .| | ..-|+.| + +.| T Consensus 73 v~~~~~YA~~vE~Gt~~~~i~pk~~k~l-----------~~~~~---------~~-~----------~~~v~~pG~~~~p 121 (142) T protein:vir:94 73 IGTNVTYAADVEYGTAPHVIVPKDKKAL-----------YWPGA---------AH-P----------VAKVNHPGTRAQP 121 (142) T ss_pred EecCcccchhhhccCCCceeccCCCccc-----------eeccc---------ce-e----------eeeeeecCCCCCc Confidence 4444444445556765433222111100 11111 01 1 1112222 1 233 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 158 -LTQAFESATQSLIDEEIPKQLG 179 (192) Q Consensus 158 -lt~afe~e~~~~~~~~~~kEl~ 179 (192) |.-||++.-.++ .++-+++. T Consensus 122 fl~~A~~~~~~~i--~~~~~~~~ 142 (142) T protein:vir:94 122 FMRPAIAAASTFL--RNHAKGIR 142 (142) T ss_pred chhHHHHHHHHHH--HHHHHhcC Confidence 777776544332 33444555 No 70 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=61.45 E-value=0.35 Score=23.06 Aligned_cols=113 Identities=16% Similarity=0.204 Sum_probs=52.4 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+++++. ++.++..|+.+.|..+...+-... ... ...|.. .+.+...| T Consensus 3 i~Gld~L~~~l~~~~~~-~~~~v~~av~~~~~~i~~~a~~~a-~~~-------~~~p~~-------------TG~Lr~SI 60 (115) T protein:vir:99 3 IDGLDALLNQFHDMKTN-IDDDVDDILQENAKEYVVRAKLKA-REV-------MNKGYW-------------TGNLSRNI 60 (115) T ss_pred chhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhh-ccc-------cCCCCc-------------chhhhhce Confidence 99999999999998754 668888888888777666654322 110 012221 12222222 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLTQ 160 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt~ 160 (192) .+.- .++-...||...+=..|+.- |....|=+ |. +.- T Consensus 61 ~~~~-----------------------~g~~~~~V~~~~~Ya~~vE~-------------GT~~m~a~----PF---l~P 97 (115) T protein:vir:99 61 RYKK-----------------------TVDLQYTITSHAAYSGFLEF-------------GTRYMEAE----PF---MWP 97 (115) T ss_pred eeee-----------------------cCcEEEEecCCccccccccc-------------cccccCCC----Cc---chh Confidence 2110 01111234433322223222 32111111 33 334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 161 AFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 161 afe~e~~~~~~~~~~kEl~~~L~ 183 (192) ||+.+.. .|.++|..+++ T Consensus 98 A~~~~k~-----~~~~~l~~~~k 115 (115) T protein:vir:99 98 VYEVIRK-----STVEELKTLFE 115 (115) T ss_pred hHHHHHH-----HHHHHHHHHhC Confidence 5553322 34444444444 No 71 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=55.32 E-value=0.48 Score=22.32 Aligned_cols=105 Identities=14% Similarity=0.143 Sum_probs=42.9 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |+||+++++.|+.+. .+.+...|+.+.|..+...+... .|.. .+.+...| T Consensus 3 i~Gld~l~~~l~~~~---~~~~~~~al~~~a~~i~~~ak~~--------------apvd-------------TG~Lr~si 52 (108) T protein:vir:98 3 ITGIDALQKKLRKNA---TLNDVKHVVKRNTVSMNKNMQNL--------------APVD-------------TGNMKRSI 52 (108) T ss_pred chhHHHHHHHHHHhh---hHHHHHHHHHHHHHHHHHHHHHh--------------CCCC-------------chhhHhhc Confidence 999999999998643 34555556666665444433321 2211 11111122 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LT 159 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt 159 (192) ++.. +- ++-...||....=+.|+.- |.... | +.| +. T Consensus 53 ~~~~------~~----------------~~~~~~V~~~~~Ya~~vE~-------------GT~~m-------~-aqPFl~ 89 (108) T protein:vir:98 53 TSEF------TD----------------GGLTGTTIPHTDYAGYVEY-------------GTRFQ-------A-AQPFVK 89 (108) T ss_pred eeee------ec----------------CceEEEeecCCCccceeec-------------ccccc-------C-CCcchh Confidence 1110 00 0011223322211222222 32111 1 233 45 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~~~L~ 183 (192) -||+.+.. .+.++|...|+ T Consensus 90 pa~~~~~~-----~~~~~i~~~lr 108 (108) T protein:vir:98 90 PAFDVQKK-----IFTNDLERLTK 108 (108) T ss_pred hHHHHHHH-----HHHHHHHHHcC Confidence 55554433 24444444444 No 72 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=55.02 E-value=0.45 Score=22.49 Aligned_cols=106 Identities=13% Similarity=0.189 Sum_probs=41.1 Q ss_pred ChhHHHHHHHHhhcc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLD-RQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~-~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) |+||+++++.|+.+. ...|.+++..++-..+..+...+ +.+ -|. ..+++.-. T Consensus 6 i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a-----~~~---------apv-------------dTG~Lr~s 58 (112) T protein:vir:96 6 FEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKA-----QFK---------KGY-------------STGATRRS 58 (112) T ss_pred ehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHh-----hhc---------CCC-------------Cchhhhhc Confidence 999999999999874 34555555554444443332222 111 121 11222222 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHHHH Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGPLT 159 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~plt 159 (192) |.+.-+ +-...||...+=..|+.-.. ...| -=|.=.| T Consensus 59 I~~~~~------------------------~~~~~v~~~~~Ya~~vE~GT-------------r~m~----AqPF~~P-- 95 (112) T protein:vir:96 59 ITLEAG------------------------SDRAVVEALTNYSGYLEVGT-------------RKME----AQPFMRP-- 95 (112) T ss_pred eeeecC------------------------ceEEEecCCCCccceeccCc-------------cccC----CCCchhh-- Confidence 222211 11123443333333433321 1111 1133334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 160 QAFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 160 ~afe~e~~~~~~~~~~kEl~~~L~ 183 (192) ||+.+-. .|.++|.. |+ T Consensus 96 -A~~~~~~-----~~~~~l~~-L~ 112 (112) T protein:vir:96 96 -ALDQVVP-----EMVEEMAK-WE 112 (112) T ss_pred -hHHHHHH-----HHHHHHHh-cC Confidence 4433222 12222221 11 No 73 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=53.81 E-value=0.52 Score=22.14 Aligned_cols=127 Identities=13% Similarity=0.171 Sum_probs=57.5 Q ss_pred ChhHHHHHHHHhh-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNS-LDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~-i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++|++++++||++ +..+.+.+.+.+||+..+..+. +.++..+.+|+.....-.++. T Consensus 4 vkGv~eilk~lE~k~G~~~m~ri~dkAL~~~g~~v~-----------------------~~lK~~~~~fkDTGatidev~ 60 (133) T protein:vir:96 4 IYDTKKLERELEKRLSKRALMRITDRALTEAGEVVL-----------------------EAIRTNLKYFRDTGAEYGEVK 60 (133) T ss_pred ccCHHHHHHHHHHhcCHHHHHHHhhHHHHHHHHHHH-----------------------HHHHHhhHHHhhccceeeeEE Confidence 9999999999975 6777777778888777664332 344555556665322211111 Q ss_pred EE--EeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH Q lcl|NC_010392. 80 IR--INRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP 157 (192) Q Consensus 80 i~--v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p 157 (192) +. ...+..+.|++|= .+.. .....+ |.-. |+-|.| .||-=-|=.. +. T Consensus 61 ~s~p~~~~g~rtV~i~W-----~gp~----~R~~iV----HLNE-----------~G~ytr-~Gk~i~PrG~------G~ 109 (133) T protein:vir:96 61 LSKPTWENGKRTIRVYW-----EGEK----HRYSIV----HLNE-----------KGFYAK-DGKFIRPKGM------GA 109 (133) T ss_pred ecCceecCCceEEEEEe-----ecCC----CceeeE----eeec-----------ccceec-CCceeccchh------hH Confidence 10 0111233344421 0000 000000 1111 222444 3431111110 22 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 158 LTQAFESATQSLIDEEIPKQLGYAL 182 (192) Q Consensus 158 lt~afe~e~~~~~~~~~~kEl~~~L 182 (192) |. .+-++.+..|.+.+..||...| T Consensus 110 I~-~al~~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 110 ID-KALRASRDKFFKVYAEEVSKLL 133 (133) T ss_pred HH-HHHHhhhHHHHHHHHHHHHHhC Confidence 22 2334455555556666666666 No 74 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=43.07 E-value=0.86 Score=20.95 Aligned_cols=125 Identities=16% Similarity=0.123 Sum_probs=47.2 Q ss_pred ChhHHHHHHHHhhc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSL-DRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i-~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++|+++++++|++. ..+.+.+.+.+||+..|..+. +.++..+.+|+.+...-.+.. T Consensus 5 vkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~-----------------------~~~K~~~~~fkDTGati~ev~ 61 (134) T protein:vir:10 5 VTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIV-----------------------EEIKKQLKPSEDSGALISEIG 61 (134) T ss_pred eecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHH-----------------------HHHHhhcCccccccceeccEe Confidence 99999999999874 466677888888877764332 233445555554221111100 Q ss_pred EEEe---ccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhhee-eccccccccceeeecCch Q lcl|NC_010392. 80 IRIN---RGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMR-RVNGKNRYPIDVVKIPLS 155 (192) Q Consensus 80 i~v~---r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~-R~~Gk~R~PIevvkipis 155 (192) .-+ .+-.+-|.+|= +| ....-||--|.- |+-.+ | .|+-=-|=. - T Consensus 62 -~s~p~~~~G~r~V~vgW--------~G--------------~~~R~~ivHLnE--~Gyt~~r-~Gk~i~PrG------~ 109 (134) T protein:vir:10 62 -RTEPEWIKGKRTVTIRW--------RG--------------PFERFRIVHLIE--NGHVEKK-SGKFVKPKA------M 109 (134) T ss_pred -ecCeeecCCceEEEEEE--------Ec--------------CCceeeEEEeee--cceeecC-CCCeeccch------h Confidence 000 01112222210 00 011111111110 11110 2 121000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 156 GPLTQAFESATQSLIDEEIPKQLGYA 181 (192) Q Consensus 156 ~plt~afe~e~~~~~~~~~~kEl~~~ 181 (192) +.+..++ +..+..|.+.+..||... T Consensus 110 G~i~~a~-~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 110 GGINRAI-RQGQNKYFETLKRELKKL 134 (134) T ss_pred hHHHHHH-HhhhHHHHHHHHHHHhcC Confidence 1122222 222333333333333322 No 75 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=40.98 E-value=0.95 Score=20.71 Aligned_cols=139 Identities=11% Similarity=0.058 Sum_probs=55.8 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchh-hhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQ-VRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~-~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) |++++++...|..+-.+.-|..-..-+..+|..+.+.+-.....+.. +|+. -.......++.+-. ..+ T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~-PdG~~W~p~~~~~~~~k~~-----~~~----- 69 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKA-PDGTPYAPRQQQSARKKTG-----RVK----- 69 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcC-CCCCCCcccChHHHHHhhc-----CCC----- Confidence 99999999988887666555555556777788888887777776642 2110 00111111111100 000 Q ss_pred EEEeccCcCceecCCcceeecccccccccccceeee----cceecCcceeecCCCCchhhee---eccccccccceeeec Q lcl|NC_010392. 80 IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKI----GPYLFRDAFIQQLANGRWHVMR---RVNGKNRYPIDVVKI 152 (192) Q Consensus 80 i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkv----Gk~~f~gaFi~~~~nGr~~V~~---R~~Gk~R~PIevvki 152 (192) -+.+.-|.-...+. .......+.| |.-..- +++|=|= |..+ ..-.+.| T Consensus 70 -------~~l~~~~~l~~sl~-----~~~~~~~a~vg~~~Gt~~~y---------AaiHQfG~~~~~~~----~~~~~~i 124 (150) T protein:vir:60 70 -------RKMFAKLITSRFLH-----IRASPEQASMEFYGGKSPKI---------ASVHQFGLSEENRK----DGKKIDY 124 (150) T ss_pred -------ccchhhhhhcceee-----eeeeCcEEEEEeeCCCchhh---------hhhhhccccccccC----CCCceec Confidence 00000000000000 0000111111 110000 0111110 1001 1112344 Q ss_pred CchHH---HHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 153 PLSGP---LTQAFESATQSLIDEEIPK 176 (192) Q Consensus 153 pis~p---lt~afe~e~~~~~~~~~~k 176 (192) |= .| +++.=+.|+...+.+.|.+ T Consensus 125 Pa-Rp~LG~s~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 125 PA-RPLLGFTGEDVQMIEEIILAHLDR 150 (150) T ss_pred CC-cccCCCCHHHHHHHHHHHHHHHhC Confidence 42 22 3344456666666666666 No 76 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=36.16 E-value=0.78 Score=21.16 Aligned_cols=82 Identities=12% Similarity=0.162 Sum_probs=37.1 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) ++||++|++.|+.... ..++.+.+.+.+..+...+.......|+ -+|.=+++.-. .+...+.| T Consensus 8 ~~Gld~L~~~L~~~~~---~~~v~~vv~~~~~~l~~~ak~~ap~dTG------------~lrrSI~~~~~--~~g~~~~v 70 (92) T protein:vir:99 8 WDGLDALDEALANQQN---MNTVKKVVKKHTANLMTATQQAVPVDTG------------HLKQSAQIQIS--RDGFTGSV 70 (92) T ss_pred eehHHHHHHHHHhhcc---HHHHHHHHHHHHHHHHHHHHHhCCCCcc------------ccceeeeEEee--cCCeeEEE Confidence 9999999999976443 2556677777777776666554433331 11111111111 11123333 Q ss_pred EEe---ccCcCceecCCcceeecccccccccccceeeecceecCcc Q lcl|NC_010392. 81 RIN---RGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDA 123 (192) Q Consensus 81 ~v~---r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~ga 123 (192) .+. ..==|-+-.|| +|-.| T Consensus 71 ~~~gp~a~Ya~YvE~GT------------------------R~M~A 92 (92) T protein:vir:99 71 TYGGGLVNYAAYVEFGT------------------------RFMDS 92 (92) T ss_pred EeccCccccccccccce------------------------eecCC Confidence 221 11111112222 12222 No 77 >protein:vir:6375 Length: 205 # NCBI annotation: hypothetical protein # Family: family:all:10491 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918988;genbank:gi:34610163;genbank:gi:91214209;genbank:GeneID:2559587 Probab=31.10 E-value=1.5 Score=19.58 Aligned_cols=178 Identities=13% Similarity=0.179 Sum_probs=78.0 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHH----HHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCe Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVA----QKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKR 76 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva----~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~ 76 (192) ++-+.+..+.|..+...++-.|..++.-+.+ ...+...+.==+.-++ .|. ...|..+.=+.++.-. -++...| T Consensus 10 ~~~~~~~l~~l~~~~~~a~~~AIN~ta~~~~~~~A~~~i~~~vn~k~~yv~-~~~-Rlti~k~As~~~L~A~-I~ar~rp 86 (205) T protein:vir:63 10 LGEFRDYVDRLPDISQQAAMIAINQTAQRTALPLARTEIGEQVNFPDNYLK-DDS-RLGVTKKATRNDLEAV-IGARQRP 86 (205) T ss_pred HHHHHHHHHhcchhhhHHHHHHHHHHHHHhhHHHHHHhhhhccccchhhhc-cce-eeEEEeecCCCCeeEE-EecCCCc Confidence 7777777777777777777677666665554 3332222221111111 000 0011111111111000 0012223 Q ss_pred EEEEEE--eccCcCceecCCcceeecccccccccccceeee--cce----ecCcceeecCCCCchhheeeccccccc--c Q lcl|NC_010392. 77 SARIRI--NRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKI--GPY----LFRDAFIQQLANGRWHVMRRVNGKNRY--P 146 (192) Q Consensus 77 ~a~i~v--~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkv--Gk~----~f~gaFi~~~~nGr~~V~~R~~Gk~R~--P 146 (192) .+..+. ...-.-+++-|.+.++-.+.+.....++..++. |.- .+.=+--.....|+|.++++ |..-+ + T Consensus 87 t~LsRF~~p~~~~~~~r~~GVsV~Vk~G~ak~l~gaF~~~lk~g~~l~e~~~~vgva~R~~~g~~~~~~~--g~~k~~~~ 164 (205) T protein:vir:63 87 TSLARFAEPGQTTKSTRKGGVSVVVKPGRTKQFKRGFLVRLRAGKTLTEDKYNLGLAVRLSPGETLHATD--GATKLSNN 164 (205) T ss_pred ceeeeccCCCccccccccCCeEEEEEcCCCeeccCceEEEeeccccccccccceEEEeeecCcccccccc--CceecCCc Confidence 333333 112222344556666655544444444443332 222 11223445678889999988 44322 4 Q ss_pred ceeeecCc-hHH---HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 147 IDVVKIPL-SGP---LTQAFESATQSLIDEEIPKQLGYALK 183 (192) Q Consensus 147 Ievvkipi-s~p---lt~afe~e~~~~~~~~~~kEl~~~L~ 183 (192) |-+.--|= +.. ..|..++++...+.++|.+|+...++ T Consensus 165 ~k~LYGPSV~Qvf~~~~e~I~~~i~~~l~~~f~r~~~~~~~ 205 (205) T protein:vir:63 165 VYLLYGPSVDQVFRTVADDITTEVLDALADEFLRQFTRLSE 205 (205) T ss_pred eEEEEcCcHHHHHhhhhhhhhHHHHHHHHHHHHHhhhhhcC Confidence 55555554 221 22344444444455555555555555 No 78 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=30.45 E-value=1.3 Score=19.97 Aligned_cols=143 Identities=20% Similarity=0.204 Sum_probs=50.7 Q ss_pred ChhHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARI 80 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i 80 (192) |++|+++...|+.+-...=|.+-...+..+|..+.+.+-.....+.. +| |-|=...+. ....... +. T Consensus 1 m~~~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~-Pd----G~~W~p~~~-~~~~~~~--g~----- 67 (149) T protein:vir:18 1 MSELTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQA-PD----GTPYAARKR-QPVRSKK--GR----- 67 (149) T ss_pred CchHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcC-CC----CCCCcccch-hhhhhcc--Cc----- Confidence 99999999998887665445555556677777777777776666532 11 111111000 0000000 00 Q ss_pred EEeccCcCceecCCcceeecccccccccccceeee---cceecCcceeecCCCCchhheeeccccccccceeeecCchHH Q lcl|NC_010392. 81 RINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKI---GPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP 157 (192) Q Consensus 81 ~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkv---Gk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p 157 (192) ...++- -+|...+. + +.. .....+.| |.-..=.+.-|-...+ |..++ .-.|+||= .| T Consensus 68 --~~~~~~-~~l~~~~~-l---~~~--~~~~~~~v~~~Gtn~~yAaiHQfG~~~------r~~~~----~~~v~iPa-Rp 127 (149) T protein:vir:18 68 --IKREMF-AKLRTSRF-M---KAK--GSDSAAVVEFTGKVQRMARVHQYGLKD------RPNRN----SRDVQYEA-RP 127 (149) T ss_pred --ccchhh-hhhhhhhh-h---hee--ecCceeEEEecccchhhhhhhhccccc------cccCC----Cccccccc-cc Confidence 000000 00000000 0 000 00000111 1111000000000000 00011 11234442 11 Q ss_pred ---HHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 158 ---LTQAFESATQSLIDEEIPK 176 (192) Q Consensus 158 ---lt~afe~e~~~~~~~~~~k 176 (192) +++.=+.|+..++.+.|.+ T Consensus 128 ~LG~s~~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 128 LLGFTRDDEQMIEDVIISHLGK 149 (149) T ss_pred cCCCCHHHHHHHHHHHHHHHhC Confidence 2333345555555555554 No 79 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=30.22 E-value=1.6 Score=19.47 Aligned_cols=119 Identities=17% Similarity=0.144 Sum_probs=49.2 Q ss_pred ChhHHHHHHHHhhcchh----hHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchh---hhhhceeeeecCCC Q lcl|NC_010392. 1 MKGLENAIRNLNSLDRQ----MVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLK---LVRQRVRLFKAGTD 73 (192) Q Consensus 1 ~~gl~~~i~nL~~i~~~----~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k---~vr~R~r~~ka~~~ 73 (192) ++||+++..||+.+-.. .+++|.-+|+.-.| ++|+. . .|.+ ||=. T Consensus 3 V~Gi~~~~~nl~~~i~~I~~~K~~Ral~~al~~~~----~~AA~----~----------TPIDTSTLiNS---------- 54 (131) T protein:vir:10 3 VKGIKRIQMNTRRVLSDIAGIRTEKVLYLVMNAGA----NHAAV----I----------TPVKSSTLINS---------- 54 (131) T ss_pred cchHHHHHHHHHHHHHhhccchHHHHHHHHHHHHH----hhhhh----c----------cccchhhhccc---------- Confidence 99999999999876554 44555555555444 44443 2 2222 1111 Q ss_pred CCeEEEEEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecC-CCCchhheeeccccccccceeeec Q lcl|NC_010392. 74 GKRSARIRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQL-ANGRWHVMRRVNGKNRYPIDVVKI 152 (192) Q Consensus 74 ~~~~a~i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~-~nGr~~V~~R~~Gk~R~PIevvki 152 (192) .+-.|.+|. .+..+|=|=. ++ .-+==|--||-..+|- ++|. +=| += T Consensus 55 --Qfrei~~ng------------tritGRVGYS--An--YA~yVHda~Gklkgqprp~gk-gn~--------------w~ 101 (131) T protein:vir:10 55 --QYKKLEPIP------------SGMIGRVGYT--AN--YAAAVNAAKGKLKGKPRPDGS-GNY--------------WD 101 (131) T ss_pred --cceeeeccC------------ceeEEeeccc--ee--eeeeeecCccccCCCcCCCCC-cce--------------ec Confidence 122222221 1111111000 00 0000111112211111 1111 112 33 Q ss_pred CchHH--HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 153 PLSGP--LTQAFESATQSLIDEEIPKQLGY 180 (192) Q Consensus 153 pis~p--lt~afe~e~~~~~~~~~~kEl~~ 180 (192) |-++| |+..||+.-.+.++..+.+|..- T Consensus 102 p~ae~eFL~kgfe~~~~d~i~avik~e~k~ 131 (131) T protein:vir:10 102 PNGEPDFLRKGFERDGLNEIKAIIRQGYKV 131 (131) T ss_pred CCCChhhhhhhhhccchHHHHHHHhhhcCC Confidence 55556 89999864344444444444433 No 80 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=29.65 E-value=1.6 Score=19.40 Aligned_cols=126 Identities=14% Similarity=0.038 Sum_probs=47.5 Q ss_pred ChhHHHHHHHHhhc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSL-DRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i-~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++|++++++||++. ..+.+.+.+.+||+..|..+....-+ .+.+|+.....-.+.. T Consensus 5 vkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~-----------------------~~~~fkDTG~t~~ev~ 61 (134) T protein:vir:10 5 VIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKK-----------------------QLKPSKDTGALINEVS 61 (134) T ss_pred EecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHh-----------------------hhhhhhhccceeccEE Confidence 99999999999874 46778888888888877555443333 3334443111111100 Q ss_pred ---EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchH Q lcl|NC_010392. 80 ---IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSG 156 (192) Q Consensus 80 ---i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~ 156 (192) ++ ..+..+.|++|= .+ . -..-||--|.- |+-+++-.|+-=-|=. -+ T Consensus 62 ~s~p~-~~~G~r~V~vgW-----~G---~--------------~~R~~iiHLNE--~Gytr~~~Gk~i~PrG------~G 110 (134) T protein:vir:10 62 FSKPE-WINGKRTITVHW-----RG---S--------------KDRYKIVHLIE--YGHVQKGTGKFIKPKA------MG 110 (134) T ss_pred ecCee-ecCCceEEEEEE-----Ec---C--------------CceeEEEEeec--ccceecccCCccCcch------hh Confidence 00 001112222220 00 0 00011111110 1112210121000000 01 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 157 PLTQAFESATQSLIDEEIPKQLGYA 181 (192) Q Consensus 157 plt~afe~e~~~~~~~~~~kEl~~~ 181 (192) .+..++ +..+..|.+.+..||... T Consensus 111 ~i~~a~-~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 111 GVNRAI-RQGQNKYFETLKRELKKL 134 (134) T ss_pred HHHHHH-HhhhHHHHHHHHHHHhcC Confidence 122222 223333333333333322 No 81 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=29.65 E-value=1.6 Score=19.40 Aligned_cols=126 Identities=14% Similarity=0.038 Sum_probs=47.5 Q ss_pred ChhHHHHHHHHhhc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEE Q lcl|NC_010392. 1 MKGLENAIRNLNSL-DRQMVPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSAR 79 (192) Q Consensus 1 ~~gl~~~i~nL~~i-~~~~Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~ 79 (192) ++|++++++||++. ..+.+.+.+.+||+..|..+....-+ .+.+|+.....-.+.. T Consensus 5 vkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~-----------------------~~~~fkDTG~t~~ev~ 61 (134) T protein:vir:95 5 VIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKK-----------------------QLKPSKDTGALINEVS 61 (134) T ss_pred EecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHh-----------------------hhhhhhhccceeccEE Confidence 99999999999874 46778888888888877555443333 3334443111111100 Q ss_pred ---EEEeccCcCceecCCcceeecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchH Q lcl|NC_010392. 80 ---IRINRGNLPAIKLGAAQVRMSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSG 156 (192) Q Consensus 80 ---i~v~r~~lpaikLg~~~~~~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~ 156 (192) ++ ..+..+.|++|= .+ . -..-||--|.- |+-+++-.|+-=-|=. -+ T Consensus 62 ~s~p~-~~~G~r~V~vgW-----~G---~--------------~~R~~iiHLNE--~Gytr~~~Gk~i~PrG------~G 110 (134) T protein:vir:95 62 FSKPE-WINGKRTITVHW-----RG---S--------------KDRYKIVHLIE--YGHVQKGTGKFIKPKA------MG 110 (134) T ss_pred ecCee-ecCCceEEEEEE-----Ec---C--------------CceeEEEEeec--ccceecccCCccCcch------hh Confidence 00 001112222220 00 0 00011111110 1112210121000000 01 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 157 PLTQAFESATQSLIDEEIPKQLGYA 181 (192) Q Consensus 157 plt~afe~e~~~~~~~~~~kEl~~~ 181 (192) .+..++ +..+..|.+.+..||... T Consensus 111 ~i~~a~-~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 111 GVNRAI-RQGQNKYFETLKRELKKL 134 (134) T ss_pred HHHHHH-HhhhHHHHHHHHHHHhcC Confidence 122222 223333333333333322 No 82 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=29.36 E-value=1.7 Score=19.37 Aligned_cols=115 Identities=17% Similarity=0.237 Sum_probs=56.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhcchhhhhhceeeeecCCCCCeEEEEEEeccCcCceecCCccee Q lcl|NC_010392. 19 VPRASIWAVNRVAQKAVSVATRKVARETVAGDNQVRGLPLKLVRQRVRLFKAGTDGKRSARIRINRGNLPAIKLGAAQVR 98 (192) Q Consensus 19 Vp~A~arAiNrva~~a~s~s~k~va~e~~~~~~~~~~I~~k~vr~R~r~~ka~~~~~~~a~i~v~r~~lpaikLg~~~~~ 98 (192) |.+++.+++.+++..+.+.+-..+...|+ .++.=+.... ..+...+.|+.+...=+-+-+||..-- T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG------------~Lr~SI~~~~--~~~~~~~~V~~~~~Ya~yvE~GTg~~~ 66 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTG------------YLRESVTMDF--KDGGFTGVINIGSEYAIYVNYGTGIYA 66 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCcccc------------ccccceeEEe--ecCcEEEEEecCCCccceeecCccccc Confidence 88888888888888877776655544442 1222222211 224466777776665566667765433 Q ss_pred ecccccccccccceeeecceecCcceeecCCCCchhheeeccccccccceeeecCchHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_010392. 99 MSKRRGKLLYRGSVLKIGPYLFRDAFIQQLANGRWHVMRRVNGKNRYPIDVVKIPLSGP-LTQAFESATQSLIDEEIPKQ 177 (192) Q Consensus 99 ~~~rr~~~~~~~s~lkvGk~~f~gaFi~~~~nGr~~V~~R~~Gk~R~PIevvkipis~p-lt~afe~e~~~~~~~~~~kE 177 (192) -.+. +. .+ + .++ |.-.-.+|.|| ++.|.. +.| |..||+++... |.+. T Consensus 67 ~~~~-~~--~~---~-----~~~--~~~~~~~g~~~---~t~g~~-----------a~Pfl~pA~~~~~~~-----i~k~ 114 (116) T protein:vir:95 67 TGAG-GS--RA---K-----NIP--WSYKDANGKWH---TTKGQH-----------AQPFWEPAIDAGRAF-----FNKY 114 (116) T ss_pred cCCC-cc--cc---c-----ccc--ceeecCcccee---eCCCCC-----------CCcchHHHHHHHHHH-----HHHh Confidence 2111 00 00 0 111 11122345554 333421 334 77788776642 3333 Q ss_pred HH Q lcl|NC_010392. 178 LG 179 (192) Q Consensus 178 l~ 179 (192) |. T Consensus 115 is 116 (116) T protein:vir:95 115 FS 116 (116) T ss_pred hC Confidence 33 Done!