Query lcl|Aclame:protein:vir:7651|NCBI_annot:gp122|genbank:acc:NP_818195;genbank:gi:29566629;genbank:GeneID:1259879 Match_columns 151 No_of_seqs 3 out of 6 Neff 1.7 Searched_HMMs 1612 Date Sat Nov 30 21:36:24 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_119 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_119_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:106041 Length: 137 96.0 2.9E-05 1.8E-08 45.5 5.1 115 1-141 5-137 (137) 2 protein:vir:97982 Length: 140 95.1 6.8E-05 4.2E-08 43.4 3.8 115 1-141 8-140 (140) 3 protein:vir:107545 Length: 140 95.1 6.8E-05 4.2E-08 43.4 3.8 115 1-141 8-140 (140) 4 protein:vir:102441 Length: 137 94.8 0.00023 1.4E-07 40.5 5.9 112 1-144 5-137 (137) 5 protein:vir:94654 Length: 142 93.5 0.00064 3.9E-07 38.1 5.8 115 1-146 6-142 (142) 6 protein:vir:106506 Length: 137 92.7 0.00091 5.6E-07 37.3 5.4 116 1-148 6-137 (137) 7 protein:vir:8669 Length: 142 # 91.5 0.0015 9.3E-07 36.1 5.2 114 1-139 6-142 (142) 8 protein:vir:99101 Length: 142 91.5 0.0015 9.3E-07 36.1 5.2 114 1-139 6-142 (142) 9 protein:vir:107099 Length: 137 91.5 0.001 6.5E-07 36.9 4.3 108 1-147 15-137 (137) 10 protein:vir:105330 Length: 137 89.6 0.0024 1.5E-06 34.9 4.6 107 1-147 2-137 (137) 11 protein:vir:105916 Length: 149 86.9 0.0052 3.3E-06 33.1 4.6 107 1-143 27-149 (149) 12 protein:vir:96121 Length: 137 83.7 0.0066 4.1E-06 32.5 3.6 108 1-147 4-137 (137) 13 protein:vir:96829 Length: 135 81.2 0.01 6.5E-06 31.4 3.7 110 1-143 1-135 (135) 14 protein:vir:94108 Length: 149 79.2 0.026 1.6E-05 29.2 5.2 111 1-143 27-149 (149) 15 protein:vir:106570 Length: 182 72.5 0.13 8.2E-05 25.4 7.2 138 1-149 2-182 (182) 16 protein:vir:94796 Length: 137 72.3 0.044 2.7E-05 28.0 4.5 112 1-147 4-137 (137) 17 protein:vir:5978 Length: 144 # 59.5 0.18 0.00011 24.7 5.2 114 1-147 4-144 (144) 18 protein:vir:95062 Length: 116 57.0 0.2 0.00012 24.4 5.0 107 1-147 1-116 (116) 19 protein:vir:95894 Length: 137 52.0 0.23 0.00014 24.1 4.5 107 1-147 15-137 (137) 20 protein:vir:1243 Length: 116 # 48.8 0.54 0.00034 22.0 6.0 104 5-147 1-116 (116) 21 protein:vir:97327 Length: 116 48.8 0.54 0.00034 22.0 6.0 104 5-147 1-116 (116) 22 protein:vir:93738 Length: 137 42.6 0.55 0.00034 22.0 5.0 108 1-147 1-137 (137) 23 protein:vir:97427 Length: 137 42.6 0.55 0.00034 22.0 5.0 108 1-147 1-137 (137) 24 protein:vir:94490 Length: 137 42.6 0.55 0.00034 22.0 5.0 108 1-147 1-137 (137) 25 protein:vir:78077 Length: 141 35.8 1.2 0.00075 20.1 6.0 115 1-150 11-141 (141) 26 protein:vir:101594 Length: 173 35.2 1 0.00062 20.6 5.2 136 1-145 13-173 (173) 27 protein:vir:79225 Length: 155 34.1 0.42 0.00026 22.7 3.0 116 1-150 23-155 (155) 28 protein:vir:99196 Length: 155 21.5 1.2 0.00077 20.1 3.2 124 1-150 23-155 (155) No 1 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=96.04 E-value=2.9e-05 Score=45.47 Aligned_cols=115 Identities=17% Similarity=0.248 Sum_probs=56.7 Q ss_pred CcCC-----chHHHHHHHHHHHHHHHHHhhhccccc----cccceeec-----cccc----ceehhhhheeeeeecCCCc Q lcl|Aclame:pro 1 MRVG-----APVELNRVIARRAVQYAREDMRGRGWT----STGALQPY-----SDTG----AVGISSTMKHLLIQNKGFD 62 (151) Q Consensus 1 ~rvg-----lP~~l~rvIs~~A~~~ar~d~rgRGWr----Sagalqp~-----s~~G----~VGirstmkhllyQn~G~~ 62 (151) .|+- |..++ ..+.+.++..+-.++....+. .+|+|.-. ++.+ .+.+-+++.|-.|.+-|++ T Consensus 5 ~~i~i~~~~l~~~v-~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ve~GT~ 83 (137) T protein:vir:10 5 ARIHINEPELERQT-GAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAAPVHEGSR 83 (137) T ss_pred EEEeeCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEecCCCceeeeeecCC Confidence 1111 11111 122233333333222221111 23444321 1111 2335678999999999999 Q ss_pred ceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHhhhHHHHHH Q lcl|Aclame:pro 63 PFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKESKRDIRDA 141 (151) Q Consensus 63 pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike~k~~IR~~ 141 (151) |+.-. |-..+. ++.. .+|+ .+-.. +=+|||-+|+.||+-|+.++ .+++|+|+-. T Consensus 84 ph~I~-------pk~~k~------l~f~---------~~G~-~v~~k-~v~hpG~~a~Pfl~~A~~~~-~~~~~ri~~~ 137 (137) T protein:vir:10 84 PHRIT-------ARHANA------LHFF---------WHGR-EVFRK-SVWHPGVRPRPFLRNAARRV-VAADPDIHMT 137 (137) T ss_pred Cceee-------cccCce------eeee---------eCCc-eEEee-eeecCCCCCCchHHHHHHHH-hhccccccCC Confidence 97522 221111 1110 1111 11111 23499999999999999886 6789999865 No 2 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=95.07 E-value=6.8e-05 Score=43.44 Aligned_cols=115 Identities=17% Similarity=0.229 Sum_probs=57.5 Q ss_pred CcCCchHHH----HHHHHHHHHHHHHHhhhcc-----cccccccee------e--cccccc-eehhhhheeeeeecCCCc Q lcl|Aclame:pro 1 MRVGAPVEL----NRVIARRAVQYAREDMRGR-----GWTSTGALQ------P--YSDTGA-VGISSTMKHLLIQNKGFD 62 (151) Q Consensus 1 ~rvglP~~l----~rvIs~~A~~~ar~d~rgR-----GWrSagalq------p--~s~~G~-VGirstmkhllyQn~G~~ 62 (151) .++.+.... ...+.+.+++++-..+... -|+ +|.|. . .+..+- ..+.+++.|-.|..-|++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvd-tG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~ 86 (140) T protein:vir:97 8 ARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVR-TGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSR 86 (140) T ss_pred eeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-chhhhccceeeeeeCCCceEEEEecCCccchhhhccCCC Confidence 333333222 2222333333222222111 111 22221 1 111111 234577999999999999 Q ss_pred ceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHhhhHHHHHH Q lcl|Aclame:pro 63 PFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKESKRDIRDA 141 (151) Q Consensus 63 pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike~k~~IR~~ 141 (151) |+. |..+.....+| +..|+ -+--.+=.|||-+|+.||+-|+... .+++++|+.- T Consensus 87 ph~----------I~pk~~k~L~~------------~~~G~--~~~~k~V~hpG~~a~Pfl~~A~~~~-~~~~~~i~~~ 140 (140) T protein:vir:97 87 PHA----------IRARNAQYLHF------------WWHGR--EMFRKSVWHPGTRARPFMRNSAQRV-VTNDPRVRMT 140 (140) T ss_pred Cce----------eecCCCcccee------------ecCCC--EEEeeeeecCCCCCChhHHHHHHHH-hhhhhhccCC Confidence 984 33332111111 11121 1111223699999999999999877 5688999865 No 3 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=95.07 E-value=6.8e-05 Score=43.44 Aligned_cols=115 Identities=17% Similarity=0.229 Sum_probs=57.5 Q ss_pred CcCCchHHH----HHHHHHHHHHHHHHhhhcc-----cccccccee------e--cccccc-eehhhhheeeeeecCCCc Q lcl|Aclame:pro 1 MRVGAPVEL----NRVIARRAVQYAREDMRGR-----GWTSTGALQ------P--YSDTGA-VGISSTMKHLLIQNKGFD 62 (151) Q Consensus 1 ~rvglP~~l----~rvIs~~A~~~ar~d~rgR-----GWrSagalq------p--~s~~G~-VGirstmkhllyQn~G~~ 62 (151) .++.+.... ...+.+.+++++-..+... -|+ +|.|. . .+..+- ..+.+++.|-.|..-|++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvd-tG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~ 86 (140) T protein:vir:10 8 ARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVR-TGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSR 86 (140) T ss_pred eeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-chhhhccceeeeeeCCCceEEEEecCCccchhhhccCCC Confidence 333333222 2222333333222222111 111 22221 1 111111 234577999999999999 Q ss_pred ceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHhhhHHHHHH Q lcl|Aclame:pro 63 PFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKESKRDIRDA 141 (151) Q Consensus 63 pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike~k~~IR~~ 141 (151) |+. |..+.....+| +..|+ -+--.+=.|||-+|+.||+-|+... .+++++|+.- T Consensus 87 ph~----------I~pk~~k~L~~------------~~~G~--~~~~k~V~hpG~~a~Pfl~~A~~~~-~~~~~~i~~~ 140 (140) T protein:vir:10 87 PHA----------IRARNAQYLHF------------WWHGR--EMFRKSVWHPGTRARPFMRNSAQRV-VTNDPRVRMT 140 (140) T ss_pred Cce----------eecCCCcccee------------ecCCC--EEEeeeeecCCCCCChhHHHHHHHH-hhhhhhccCC Confidence 984 33332111111 11121 1111223699999999999999877 5688999865 No 4 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=94.77 E-value=0.00023 Score=40.52 Aligned_cols=112 Identities=19% Similarity=0.303 Sum_probs=58.2 Q ss_pred CcCCc-----hHHHHHHHHHHHHHHHHHhhhcc-----ccccccceeec------ccccc----eehhhhheeeeeecCC Q lcl|Aclame:pro 1 MRVGA-----PVELNRVIARRAVQYAREDMRGR-----GWTSTGALQPY------SDTGA----VGISSTMKHLLIQNKG 60 (151) Q Consensus 1 ~rvgl-----P~~l~rvIs~~A~~~ar~d~rgR-----GWrSagalqp~------s~~G~----VGirstmkhllyQn~G 60 (151) +|+-. ..++.. +.+++++.+-.++... -|+ +|.|... .+.++ .++-+++.|-.|..-| T Consensus 5 ~~~~~~~~~~~~~~~~-v~r~~l~~~a~~v~~~Ak~~aPv~-tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve~G 82 (137) T protein:vir:10 5 ARYERNPVGEARQFQV-IARRRLSRITRGTANQARADVPVK-TGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVHDG 82 (137) T ss_pred EEeccCchhHHHHHHH-HHHHHHHHHHHHHHHHHHhcCCcc-chhhhcCceeeeeeccccceEEEEecCCCccceeeecC Confidence 44432 233222 4455665544333221 122 2333322 23332 3455779999999999 Q ss_pred CcceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCcee-cccccccCCCCchhHHHHHHHHHHHHhhhHHHH Q lcl|Aclame:pro 61 FDPFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKI-WRDQKWRHPGLKPKRFMEAAIAKAIKESKRDIR 139 (151) Q Consensus 61 ~~pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~-WRdQKWRhPGL~PkRF~E~aIa~Aike~k~~IR 139 (151) ++|.. |..+ -+.|+++.+++++. .+. +=+|||-+|+.||+-|+.+++- +- T Consensus 83 T~ph~----------I~Pk-------------~~k~~l~~~~~g~~vf~k-~V~hPG~~a~PfL~~A~~~~~~----~~- 133 (137) T protein:vir:10 83 TRAHV----------IRPR-------------RPGGVLRFTVGGRVVYAR-RVNHPGTRARPFLRNAAERVVA----RE- 133 (137) T ss_pred CCCce----------eecc-------------ccceeeeEeeCCeeEecc-eeecCCCCCCchHHHHHHHhhh----hh- Confidence 98853 2111 13455555554432 222 3459999999999987766543 21 Q ss_pred HHHHH Q lcl|Aclame:pro 140 DAAMT 144 (151) Q Consensus 140 ~~~~t 144 (151) -+.| T Consensus 134 -~~~~ 137 (137) T protein:vir:10 134 -TATS 137 (137) T ss_pred -cccC Confidence 1222 No 5 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=93.54 E-value=0.00064 Score=38.11 Aligned_cols=115 Identities=30% Similarity=0.347 Sum_probs=56.6 Q ss_pred CcCCchHHHHH----------HHHHHHHHHHHHhh----hccccccccceee--------cccccceehhhhheeeeeec Q lcl|Aclame:pro 1 MRVGAPVELNR----------VIARRAVQYAREDM----RGRGWTSTGALQP--------YSDTGAVGISSTMKHLLIQN 58 (151) Q Consensus 1 ~rvglP~~l~r----------vIs~~A~~~ar~d~----rgRGWrSagalqp--------~s~~G~VGirstmkhllyQn 58 (151) .||++.+ |.+ .....++..+-.++ +-.-=..+|+|.- .+.+..+.+-+++.|..|+. T Consensus 6 ~~~~~~~-l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA~~vE 84 (142) T protein:vir:94 6 YRVNSTE-FQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRFSFSVTIGTNVTYAADVE 84 (142) T ss_pred EEecHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCceEEEEEecCcccchhhh Confidence 4566531 221 11222222222221 1111112333322 22233466778899999999 Q ss_pred CCCcceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHhhhHHH Q lcl|Aclame:pro 59 KGFDPFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKESKRDI 138 (151) Q Consensus 59 ~G~~pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike~k~~I 138 (151) -|++|+. |+.+. +++. ..++.+. +. .+=+|||.+|+.||+-|+.+ +++.| T Consensus 85 ~Gt~~~~----------i~pk~-------------~k~l-~~~~~~~-~~-~~v~~pG~~~~pfl~~A~~~----~~~~i 134 (142) T protein:vir:94 85 YGTAPHV----------IVPKD-------------KKAL-YWPGAAH-PV-AKVNHPGTRAQPFMRPAIAA----ASTFL 134 (142) T ss_pred ccCCCce----------eccCC-------------Cccc-eecccce-ee-eeeeecCCCCCcchhHHHHH----HHHHH Confidence 9999974 22221 1111 1222221 11 11259999999999988765 55666 Q ss_pred HHHHHHHh Q lcl|Aclame:pro 139 RDAAMTIL 146 (151) Q Consensus 139 R~~~~ti~ 146 (151) .+.+.-|= T Consensus 135 ~~~~~~~~ 142 (142) T protein:vir:94 135 RNHAKGIR 142 (142) T ss_pred HHHHHhcC Confidence 55555443 No 6 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=92.74 E-value=0.00091 Score=37.26 Aligned_cols=116 Identities=16% Similarity=0.179 Sum_probs=55.5 Q ss_pred CcCCchH--HHHHHHHHHHH--------HHHHHhhh---ccccccccceeeccccc---ceehhhhheeeeeecCCCcce Q lcl|Aclame:pro 1 MRVGAPV--ELNRVIARRAV--------QYAREDMR---GRGWTSTGALQPYSDTG---AVGISSTMKHLLIQNKGFDPF 64 (151) Q Consensus 1 ~rvglP~--~l~rvIs~~A~--------~~ar~d~r---gRGWrSagalqp~s~~G---~VGirstmkhllyQn~G~~pf 64 (151) .++-.++ ++.-.+.+.++ ..|+++.- |.=.+|-. .++..+.| ..++-++++|-.|..-|++|+ T Consensus 6 ~~l~~~~l~~~~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~-~~~~~~~g~~v~~~V~~~~~YA~~ve~GT~ph 84 (137) T protein:vir:10 6 LRIERAQLHGLGMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGR-LVLGRERGAVVIGSVEYTARYAAAVHNGRRAL 84 (137) T ss_pred cccChhhHhhHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccce-eeeeeccccEEEEEecCCcccceeeecCCCCc Confidence 2332221 11222222322 23333321 11111111 11211212 234558899999999999996 Q ss_pred eeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHhhhHHHHHHHHH Q lcl|Aclame:pro 65 VMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKESKRDIRDAAMT 144 (151) Q Consensus 65 lM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike~k~~IR~~~~t 144 (151) . |..|... .+..+.+|+..--.+=+|||.+|+.||+-|+.+.+.. +-.- T Consensus 85 ~----------I~pk~~k--------------aL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~-------~~~~ 133 (137) T protein:vir:10 85 T----------IRAKGNG--------------RLKFTVEGRTVYARSVHQPARAGRPYLSQALREVAPQ-------EGFR 133 (137) T ss_pred e----------eecCCCc--------------cceeecCCeeEeccceecCCCCCChhhHHHHHHhhcc-------ccee Confidence 3 4444311 2222222333323355899999999999888755432 2333 Q ss_pred Hhcc Q lcl|Aclame:pro 145 ILSG 148 (151) Q Consensus 145 i~~~ 148 (151) ++-| T Consensus 134 ~~~~ 137 (137) T protein:vir:10 134 VTIG 137 (137) T ss_pred EeeC Confidence 4444 No 7 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=91.53 E-value=0.0015 Score=36.08 Aligned_cols=114 Identities=20% Similarity=0.306 Sum_probs=51.5 Q ss_pred CcC----CchHHHHHH---HHHHHHHHHHHhhhccc-----cccccceeec------cccc----ceehhhhheeeeeec Q lcl|Aclame:pro 1 MRV----GAPVELNRV---IARRAVQYAREDMRGRG-----WTSTGALQPY------SDTG----AVGISSTMKHLLIQN 58 (151) Q Consensus 1 ~rv----glP~~l~rv---Is~~A~~~ar~d~rgRG-----WrSagalqp~------s~~G----~VGirstmkhllyQn 58 (151) .|+ -.+..+.++ +.+.++..+-.++...- |+ +|.|.-. .+.. ..|+-++..|-.|.+ T Consensus 6 ~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~-tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~~ve 84 (142) T protein:vir:86 6 VRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVL-TGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAAAVH 84 (142) T ss_pred EEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc-chhhhcceeeeeccccccceEEEEeccCccccceec Confidence 111 123333332 33444443333332211 11 2333211 0111 234557889999999 Q ss_pred CCCcceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHh-hhHH Q lcl|Aclame:pro 59 KGFDPFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKE-SKRD 137 (151) Q Consensus 59 ~G~~pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike-~k~~ 137 (151) .|++|+. |..+.....+| +.+|+ .+-.. +=.|||-+|+.||+-|+.....+ -+-. T Consensus 85 ~GT~ph~----------i~pk~~~al~f------------~~~g~-~~~~k-~v~hpG~~a~Pfl~~A~~~~~~~~~~~~ 140 (142) T protein:vir:86 85 EGTRPHV----------IRAKHAQALHF------------WWRGR-EVFVR-QVNHPGTRARPYLRNAGEAVVRRDRRIR 140 (142) T ss_pred cCCccce----------eccccCceeeE------------ecCCc-eeeee-eeecCCCCCCchhHHHHHHHHhhhhhhc Confidence 9999963 33332111111 11111 11111 23599999999999888654433 2222 Q ss_pred HH Q lcl|Aclame:pro 138 IR 139 (151) Q Consensus 138 IR 139 (151) +| T Consensus 141 ~r 142 (142) T protein:vir:86 141 VR 142 (142) T ss_pred cC Confidence 33 No 8 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=91.53 E-value=0.0015 Score=36.08 Aligned_cols=114 Identities=20% Similarity=0.306 Sum_probs=51.5 Q ss_pred CcC----CchHHHHHH---HHHHHHHHHHHhhhccc-----cccccceeec------cccc----ceehhhhheeeeeec Q lcl|Aclame:pro 1 MRV----GAPVELNRV---IARRAVQYAREDMRGRG-----WTSTGALQPY------SDTG----AVGISSTMKHLLIQN 58 (151) Q Consensus 1 ~rv----glP~~l~rv---Is~~A~~~ar~d~rgRG-----WrSagalqp~------s~~G----~VGirstmkhllyQn 58 (151) .|+ -.+..+.++ +.+.++..+-.++...- |+ +|.|.-. .+.. ..|+-++..|-.|.+ T Consensus 6 ~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~-tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~~ve 84 (142) T protein:vir:99 6 VRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVL-TGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAAAVH 84 (142) T ss_pred EEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc-chhhhcceeeeeccccccceEEEEeccCccccceec Confidence 111 123333332 33444443333332211 11 2333211 0111 234557889999999 Q ss_pred CCCcceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHh-hhHH Q lcl|Aclame:pro 59 KGFDPFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKE-SKRD 137 (151) Q Consensus 59 ~G~~pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike-~k~~ 137 (151) .|++|+. |..+.....+| +.+|+ .+-.. +=.|||-+|+.||+-|+.....+ -+-. T Consensus 85 ~GT~ph~----------i~pk~~~al~f------------~~~g~-~~~~k-~v~hpG~~a~Pfl~~A~~~~~~~~~~~~ 140 (142) T protein:vir:99 85 EGTRPHV----------IRAKHAQALHF------------WWRGR-EVFVR-QVNHPGTRARPYLRNAGEAVVRRDRRIR 140 (142) T ss_pred cCCccce----------eccccCceeeE------------ecCCc-eeeee-eeecCCCCCCchhHHHHHHHHhhhhhhc Confidence 9999963 33332111111 11111 11111 23599999999999888654433 2222 Q ss_pred HH Q lcl|Aclame:pro 138 IR 139 (151) Q Consensus 138 IR 139 (151) +| T Consensus 141 ~r 142 (142) T protein:vir:99 141 VR 142 (142) T ss_pred cC Confidence 33 No 9 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=91.47 E-value=0.001 Score=36.92 Aligned_cols=108 Identities=16% Similarity=0.170 Sum_probs=50.0 Q ss_pred CcCCchHHHHHHHHHHHHHH--------HHHhhhc------cccccccceeecccccceehhhhheeeeeecCCCcceee Q lcl|Aclame:pro 1 MRVGAPVELNRVIARRAVQY--------AREDMRG------RGWTSTGALQPYSDTGAVGISSTMKHLLIQNKGFDPFVM 66 (151) Q Consensus 1 ~rvglP~~l~rvIs~~A~~~--------ar~d~rg------RGWrSagalqp~s~~G~VGirstmkhllyQn~G~~pflM 66 (151) ++ -+|.++.+.+ +.|++. |++..-- +.|.. .+..-+.++.| -+++.|-.|.+-|+.|+-. T Consensus 15 l~-~~~~~~~~~~-~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~--~~~~~~~~~~V--~~~~~Ya~~vE~GT~~~~~ 88 (137) T protein:vir:10 15 LE-DFEKETIRWA-KKGIAKTTTIIHNSIVSNMPVDTGYLRESVSM--DFKKGGLTGVI--NIGSEYAVYVNYGTGIYAV 88 (137) T ss_pred HH-HHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCcCcchhhcCeeE--EeeCCcEEEEE--ecCCCcccccccCcccccc Confidence 11 1333333322 333333 3333321 23322 11222233444 4778999999999988731 Q ss_pred eeecceeeeccCCcccceeeEEeeeCCC-CceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHhhhHHHHHHHHHH Q lcl|Aclame:pro 67 WWVEGRMVPITDKQTGKTRRIRGREPGK-PGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKESKRDIRDAAMTI 145 (151) Q Consensus 67 ~wvEGR~vpItdk~~~~t~~irgrevGK-PGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike~k~~IR~~~~ti 145 (151) .+ . ++...+ +.+-..|.+. | -.|||..|+.||+-| +.+++++|..- T Consensus 89 ---~~-------~---------~~~~~~~~~~~~~~~~~--~----~~t~g~~a~PFl~pA----~~~~~~~i~k~---- 135 (137) T protein:vir:10 89 ---GP-------G---------GSRAKNIPWCYKDADGH--W----HTTKGQHAQPFWEPA----IDEGRAFFNKY---- 135 (137) T ss_pred ---CC-------C---------ccccccccceeeccccc--e----eccCCCCCCcchhHH----HHHHHHHHHHh---- Confidence 11 0 011111 1111222211 1 237999999999866 55566665543 Q ss_pred hc Q lcl|Aclame:pro 146 LS 147 (151) Q Consensus 146 ~~ 147 (151) |+ T Consensus 136 i~ 137 (137) T protein:vir:10 136 FS 137 (137) T ss_pred cC Confidence 33 No 10 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=89.59 E-value=0.0024 Score=34.90 Aligned_cols=107 Identities=21% Similarity=0.216 Sum_probs=51.1 Q ss_pred Cc--CC----------chHHHHHHHHHHHHHH--------HHHhhhcccccccccee--------ecccccceehhhhhe Q lcl|Aclame:pro 1 MR--VG----------APVELNRVIARRAVQY--------AREDMRGRGWTSTGALQ--------PYSDTGAVGISSTMK 52 (151) Q Consensus 1 ~r--vg----------lP~~l~rvIs~~A~~~--------ar~d~rgRGWrSagalq--------p~s~~G~VGirstmk 52 (151) -. .| +|.++.+.+ +.|++. |+++.-- .+|+|. .-+.++.| -+++. T Consensus 2 a~~~~G~~~l~~~l~~~~~~~~~~~-~~al~~~a~~i~~~ak~~aPv----~TG~Lr~SI~~~~~~~~~~~~V--~~~~~ 74 (137) T protein:vir:10 2 AKVKYGNWDLVKELEEFEKETIRWA-KKGIAKTTTIIHNSIVSNMPV----DTGYLRESVSMDFKKGGLTGVI--NIGSE 74 (137) T ss_pred ccchhCHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCc----CcchhhcCeeeEecCCcEEEEE--ecCCc Confidence 00 01 355554433 344433 3333321 233332 11223333 46788 Q ss_pred eeeeecCCCcceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccc-cCCCCchhHHHHHHHHHHH Q lcl|Aclame:pro 53 HLLIQNKGFDPFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKW-RHPGLKPKRFMEAAIAKAI 131 (151) Q Consensus 53 hllyQn~G~~pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKW-RhPGL~PkRF~E~aIa~Ai 131 (151) |..|.+.|+.|+. +..+- .+-...|.+...+ +.+| +|||..|+.||+-|+ T Consensus 75 YA~~vE~GT~~~~----------~~~~~--------~~~~~~~~~~~~~-------~~~~~~t~g~~a~Pfl~pA~---- 125 (137) T protein:vir:10 75 YAVYVNYGTGIYA----------VGPGG--------SRAKNIPWRYKDA-------DGHWHTTKGQHAQPFWEPAI---- 125 (137) T ss_pred cccccccCccccc----------cCCCc--------ccccccceeeecc-------ccccccCCCCCCCcchhHHH---- Confidence 9999999998763 11110 0001112222222 2233 589999999999765 Q ss_pred HhhhHHHHHHHHHHhc Q lcl|Aclame:pro 132 KESKRDIRDAAMTILS 147 (151) Q Consensus 132 ke~k~~IR~~~~ti~~ 147 (151) .+++++|.+-+ + T Consensus 126 ~~~~~~i~k~i----~ 137 (137) T protein:vir:10 126 DEGRAFFNKYF----S 137 (137) T ss_pred HHHHHHHHHhh----C Confidence 45666655433 3 No 11 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=86.93 E-value=0.0052 Score=33.08 Aligned_cols=107 Identities=12% Similarity=0.171 Sum_probs=50.7 Q ss_pred CcCCchHHHHHHHHHHHH--------HHHHHhhhcccccccccee------e--cccccceehhhhheeeeeecCCCcce Q lcl|Aclame:pro 1 MRVGAPVELNRVIARRAV--------QYAREDMRGRGWTSTGALQ------P--YSDTGAVGISSTMKHLLIQNKGFDPF 64 (151) Q Consensus 1 ~rvglP~~l~rvIs~~A~--------~~ar~d~rgRGWrSagalq------p--~s~~G~VGirstmkhllyQn~G~~pf 64 (151) ++ -+|.++.. ..+.|+ ..|++..-- .+|+|. . -+.+|.|| ++..|..|.+-|+.++ T Consensus 27 l~-~~~~~~~~-~~~~~l~~~a~~v~~~ak~~aPv----dTG~L~~SI~~~~~~~g~~~~V~--~~~~YA~~vE~GT~~~ 98 (149) T protein:vir:10 27 LD-KFDKKIEE-WVKKGIAKTTTKIYNTAVALAPV----DLGFLEESIDFKYFDGGLSSVIS--VGADYAIYVEYGTGIY 98 (149) T ss_pred HH-HHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCc----ccchhhccceEEecCCcEEEEEe--cCCCcccccccCcccc Confidence 11 12333332 222222 333333321 123221 1 12344444 6788999999999876 Q ss_pred eeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHhhhHHHHHHHH Q lcl|Aclame:pro 65 VMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKESKRDIRDAAM 143 (151) Q Consensus 65 lM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike~k~~IR~~~~ 143 (151) --. + + ++...+....+-.+.+ +| -+|||.+|+.||+-| +.+++++|.+.++ T Consensus 99 ~~~---~-------~---------~~~~~~~~~~~~~~~~-~~----~~t~g~~a~PFl~pA----~~~~k~~i~~~i~ 149 (149) T protein:vir:10 99 ATG---P-------G---------GSRATKIPWSFKGDDG-EW----YTTYGQAPQPFWNPA----IDAGRKTFEQYFS 149 (149) T ss_pred ccC---C-------c---------ccccccccceeecccc-ce----ecCCCCCCCcchhHH----HHHHHHHHHHhhC Confidence 211 1 0 0111111111111111 11 158999999999865 5667777777776 No 12 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=83.75 E-value=0.0066 Score=32.54 Aligned_cols=108 Identities=15% Similarity=0.131 Sum_probs=47.7 Q ss_pred CcCC----------chHHHHHHH-------HHHHHHHHHHhhhcccccccccee------ec--ccccceehhhhheeee Q lcl|Aclame:pro 1 MRVG----------APVELNRVI-------ARRAVQYAREDMRGRGWTSTGALQ------PY--SDTGAVGISSTMKHLL 55 (151) Q Consensus 1 ~rvg----------lP~~l~rvI-------s~~A~~~ar~d~rgRGWrSagalq------p~--s~~G~VGirstmkhll 55 (151) ...| +|.++.+++ +.+....|++..-. .+|+|. .. +.++.|| ++..|-. T Consensus 4 ~~~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pv----dTG~L~~Si~~~~~~~g~~~~V~--~~~~YA~ 77 (137) T protein:vir:96 4 VKYGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPV----DLGFLKESIDFKVTDGGFSSVIS--VGAEYAI 77 (137) T ss_pred hHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----CccchhcCceeEeecCceEEEEe--cCCCccc Confidence 0111 233333222 12222233333321 123221 11 2234444 5678999 Q ss_pred eecCCCcceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccc-cCCCCchhHHHHHHHHHHHHhh Q lcl|Aclame:pro 56 IQNKGFDPFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKW-RHPGLKPKRFMEAAIAKAIKES 134 (151) Q Consensus 56 yQn~G~~pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKW-RhPGL~PkRF~E~aIa~Aike~ 134 (151) |.+-|++++. .+|+. +... ++ +.+ ..+-+.+| .|||.+|+.||.-|+ .++ T Consensus 78 yvE~GT~~~~---~~~~~--~~~~---~~----------~~~-------~~~~~~~~~~t~g~~a~pFl~pA~----~~~ 128 (137) T protein:vir:96 78 YVEFGTGIYA---TGPGG--SRAR---KL----------PWT-------YKGDDGEWHTTYGQQAQPFWNPAI----DEG 128 (137) T ss_pred ccccCccccc---cCCCc--cccc---cc----------cce-------eeccCcceeecCCCCCCcchhHHH----HHH Confidence 9999998773 11111 0000 00 000 11222333 589999999998765 455 Q ss_pred hHHHHHHHHHHhc Q lcl|Aclame:pro 135 KRDIRDAAMTILS 147 (151) Q Consensus 135 k~~IR~~~~ti~~ 147 (151) ++.|. .+|| T Consensus 129 ~~~i~----k~i~ 137 (137) T protein:vir:96 129 RKVFN----RYFS 137 (137) T ss_pred HHHHH----HhhC Confidence 55544 3444 No 13 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=81.20 E-value=0.01 Score=31.44 Aligned_cols=110 Identities=17% Similarity=0.151 Sum_probs=50.9 Q ss_pred Cc---CC----------chHHHHHHHHHHHHHHHHHhh----hcccccccccee--------ecccccceehhhhheeee Q lcl|Aclame:pro 1 MR---VG----------APVELNRVIARRAVQYAREDM----RGRGWTSTGALQ--------PYSDTGAVGISSTMKHLL 55 (151) Q Consensus 1 ~r---vg----------lP~~l~rvIs~~A~~~ar~d~----rgRGWrSagalq--------p~s~~G~VGirstmkhll 55 (151) |= .| +|.++.+.+ +.|++.+-+++ +-.-=..+|+|. .-+.++.|| +++.|.. T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~-~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V~--~~~~YA~ 77 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWV-KKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVVK--IGSNYAV 77 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCcEEEEEe--cCCCccc Confidence 11 01 244443332 33333332221 111111233322 223345555 8899999 Q ss_pred eecCCCcceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHhhh Q lcl|Aclame:pro 56 IQNKGFDPFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKESK 135 (151) Q Consensus 56 yQn~G~~pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike~k 135 (151) |.+.|+.|+.. ......++ +.+...|.+.. -+|||..|+.||.-|+.. ++ T Consensus 78 ~ve~GT~~~~~----------~~~~~~~~----------~~~~~~~~g~~------~~~~~~~a~pfl~~A~~~----~~ 127 (135) T protein:vir:96 78 YVNYGTGIYAT----------KGSRAHKI----------PWTYKDPNGKW------HTTYGQMPQPFWEPAIDA----GR 127 (135) T ss_pred hhhcccccccC----------CCcccccc----------ccccccCCcce------eecCCcCCCcchhHHHHH----HH Confidence 99999977732 11110000 00111122222 248999999999876654 55 Q ss_pred HHHHHHHH Q lcl|Aclame:pro 136 RDIRDAAM 143 (151) Q Consensus 136 ~~IR~~~~ 143 (151) ++|-..+. T Consensus 128 ~~~~~~i~ 135 (135) T protein:vir:96 128 QTFEQYFS 135 (135) T ss_pred HHHHHhcC Confidence 55554444 No 14 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=79.20 E-value=0.026 Score=29.25 Aligned_cols=111 Identities=11% Similarity=0.170 Sum_probs=50.1 Q ss_pred CcCCchHHHHHHHHHHHHHHH----HHhhhcccccccccee------ec--ccccceehhhhheeeeeecCCCcceeeee Q lcl|Aclame:pro 1 MRVGAPVELNRVIARRAVQYA----REDMRGRGWTSTGALQ------PY--SDTGAVGISSTMKHLLIQNKGFDPFVMWW 68 (151) Q Consensus 1 ~rvglP~~l~rvIs~~A~~~a----r~d~rgRGWrSagalq------p~--s~~G~VGirstmkhllyQn~G~~pflM~w 68 (151) ++ -+|.++.. ..++|++.+ +++.+..-=..+|+|. .. +.+|.| -++..|..|.+-|+.|+-- T Consensus 27 L~-~~~~~~~~-~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~g~~~~V--~~~~~YA~~VE~GT~~~~~-- 100 (149) T protein:vir:94 27 LD-KFDKKIEE-WVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVI--SVGADYAIYVEYGTGIYAT-- 100 (149) T ss_pred HH-HHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeCCcEEEEE--ecCCCcccccccCcccccc-- Confidence 11 12333322 222333222 1222211111233322 11 234444 4678899999999987621 Q ss_pred ecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHhhhHHHHHHHH Q lcl|Aclame:pro 69 VEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKESKRDIRDAAM 143 (151) Q Consensus 69 vEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike~k~~IR~~~~ 143 (151) .++ +.. .++-...+-.+.+ +| -+|||.+|+.||+-|+ .+++++|.+.++ T Consensus 101 -~~~--~~~--------------~~~~~~~~~~~~~-~~----~~~~g~~a~PFl~pA~----~~~~~~i~~~i~ 149 (149) T protein:vir:94 101 -GPG--GSR--------------ATKIPWSFKGDDG-EW----YTTYGQAPQPFWNPAI----DAGRKTFEQYFS 149 (149) T ss_pred -CCC--ccc--------------cccccceeecCcc-ce----ecCCCCCCCcchHHHH----HHHHHHHHHhhC Confidence 110 111 1111111111111 11 2389999999998655 557777777666 No 15 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=72.53 E-value=0.13 Score=25.41 Aligned_cols=138 Identities=21% Similarity=0.214 Sum_probs=62.1 Q ss_pred CcC---C----------chHHHHHHHHHHHH----HHH----HHhhhccccccccceeec----------ccccceehhh Q lcl|Aclame:pro 1 MRV---G----------APVELNRVIARRAV----QYA----REDMRGRGWTSTGALQPY----------SDTGAVGISS 49 (151) Q Consensus 1 ~rv---g----------lP~~l~rvIs~~A~----~~a----r~d~rgRGWrSagalqp~----------s~~G~VGirs 49 (151) |+| | +|..+...+ .+|+ ..+ +.+++---=-.+|+|.-. ..+|.|+ + T Consensus 2 ~~v~i~Gld~L~~kl~~~~~~~~~~v-~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~--~ 78 (182) T protein:vir:10 2 IEVELKGVNELRAKLKKLPDIMAKAT-ANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWW--N 78 (182) T ss_pred eEEEEecHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEee--c Confidence 221 1 233222222 1122 111 222221111223333211 1122333 5 Q ss_pred hheeeeeecCCCcceeeeeecceee----eccCCcccceeeEEeeeCCCCce-----EeeCCCceecccccccCCCCchh Q lcl|Aclame:pro 50 TMKHLLIQNKGFDPFVMWWVEGRMV----PITDKQTGKTRRIRGREPGKPGY-----VYIPGRGKIWRDQKWRHPGLKPK 120 (151) Q Consensus 50 tmkhllyQn~G~~pflM~wvEGR~v----pItdk~~~~t~~irgrevGKPGY-----V~iPGrgr~WRdQKWRhPGL~Pk 120 (151) +..|-.|-+-||.|+.- ..+..+ .+.-.++.-.......|++.-++ ++++| -.| ++.||..|+ T Consensus 79 ~~~ya~yvE~GTG~~~~--~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~--~~~----~~t~G~~aq 150 (182) T protein:vir:10 79 SSMVAVFREFGTGLVGE--RSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKING--KYF----YRTTGQPAR 150 (182) T ss_pred CCCccceeecCcccccc--cCccccCccceeeeecCCceeeccccccccccccccceeeecC--ceE----eecCCCCCC Confidence 56788888888887641 111111 11111112212222334443333 33332 223 567999999 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHh---ccC Q lcl|Aclame:pro 121 RFMEAAIAKAIKESKRDIRDAAMTIL---SGG 149 (151) Q Consensus 121 RF~E~aIa~Aike~k~~IR~~~~ti~---~~~ 149 (151) -||.-|+.+..++..+.|..++..-+ .|| T Consensus 151 PFl~pA~~~~~~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 151 QFMTPAANKMAKEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred cchHHHHHHhHHHHHHHHHHHHHHHHHHhhcC Confidence 99999998888888777775554432 345 No 16 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=72.32 E-value=0.044 Score=28.05 Aligned_cols=112 Identities=16% Similarity=0.130 Sum_probs=47.9 Q ss_pred CcCC----------chHHHHHHHHHHHHHHH----HHhhhcccccccccee--------ecccccceehhhhheeeeeec Q lcl|Aclame:pro 1 MRVG----------APVELNRVIARRAVQYA----REDMRGRGWTSTGALQ--------PYSDTGAVGISSTMKHLLIQN 58 (151) Q Consensus 1 ~rvg----------lP~~l~rvIs~~A~~~a----r~d~rgRGWrSagalq--------p~s~~G~VGirstmkhllyQn 58 (151) ...| +|.++.+. .+.|++.+ +++.+-.-=..+|+|. ..+.++.|| ++..|-.|.+ T Consensus 4 ~~~G~~~l~~~L~~~~~~~~~~-~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~--~~~~YA~~vE 80 (137) T protein:vir:94 4 VKYGNWDLVKELENYERDIERW-VKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVIN--IGSEYAIYVN 80 (137) T ss_pred hHHhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecCcEEEEEe--cCCCcccccc Confidence 0111 13333221 22222222 1222211111233332 223344555 6789999999 Q ss_pred CCCcceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHhhhHHH Q lcl|Aclame:pro 59 KGFDPFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKESKRDI 138 (151) Q Consensus 59 ~G~~pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike~k~~I 138 (151) -|+.|+. +.++. ...+. .+.+...|.+... +++|..|+.||+-|+. ++++.| T Consensus 81 ~GT~~~~----------~~~~~------~~~~~--~~~~~~~~~~~~~------~t~g~~a~PFl~pA~~----~~~~~~ 132 (137) T protein:vir:94 81 YGTGIYA----------TGAGG------SRAKK--IPWSYKDANGKWH------TTKGQHAQPFWEPAID----AGRVFF 132 (137) T ss_pred cCccccc----------cCCCc------ccccc--cccceeccCCcee------ecCCcCCCcchHHHHH----HHHHHH Confidence 9998862 11110 00010 1112222222222 2679999999997754 444443 Q ss_pred HHHHHHHhc Q lcl|Aclame:pro 139 RDAAMTILS 147 (151) Q Consensus 139 R~~~~ti~~ 147 (151) +.+|| T Consensus 133 ----~~~l~ 137 (137) T protein:vir:94 133 ----NKYFS 137 (137) T ss_pred ----HHhhC Confidence 44455 No 17 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=59.47 E-value=0.18 Score=24.66 Aligned_cols=114 Identities=16% Similarity=0.180 Sum_probs=57.0 Q ss_pred CcCC---------------chHHHHHHHHHHHHHHHHHhh----hccccccccceee--------cccccceehhhhhee Q lcl|Aclame:pro 1 MRVG---------------APVELNRVIARRAVQYAREDM----RGRGWTSTGALQP--------YSDTGAVGISSTMKH 53 (151) Q Consensus 1 ~rvg---------------lP~~l~rvIs~~A~~~ar~d~----rgRGWrSagalqp--------~s~~G~VGirstmkh 53 (151) |-+. +|.++...|. .|++.+.+++ +..-=..+|+|.- -+.++.| -++..| T Consensus 4 ms~~i~~~g~~~l~~~l~~~~~~~~~~v~-~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~g~~~~V--~~~~~Y 80 (144) T protein:vir:59 4 MSVRIDPSWRRIMSRNVRTFSGHVLTQVE-QVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNNGLTAEI--TVGAEY 80 (144) T ss_pred ceeeehhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecCcEEEEE--ecCCCc Confidence 1111 2444433332 3444433333 2222112444322 1223444 456789 Q ss_pred eeeecCCCcceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHh Q lcl|Aclame:pro 54 LLIQNKGFDPFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKE 133 (151) Q Consensus 54 llyQn~G~~pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike 133 (151) ..|.+-|+.|+. +..+. .++ |..-+.|+.++ -++++|..|+-||+-|+. + T Consensus 81 A~~vE~GT~~~~----------~~~~~-~~~----------~~~~~~~~~g~-----~~~t~g~~a~Pfl~pA~~----~ 130 (144) T protein:vir:59 81 AIYVEYGTGIYA----------VDGNG-RKT----------PWTYYSPKLGR-----YVRTQGAPAQPFFWPAVE----E 130 (144) T ss_pred cchhhcCccccc----------cCCCc-ccc----------ccccccccccc-----eecCCCCCCCcchhHHHH----H Confidence 999999998862 11111 000 11111222222 225889999999987654 5 Q ss_pred hhHHHHHHHHHHhc Q lcl|Aclame:pro 134 SKRDIRDAAMTILS 147 (151) Q Consensus 134 ~k~~IR~~~~ti~~ 147 (151) +++.|.+.+.-++- T Consensus 131 ~~~~~~~~i~~~~g 144 (144) T protein:vir:59 131 GGEYFEREMRRLRG 144 (144) T ss_pred HHHHHHHHHHHhcC Confidence 77888877777775 No 18 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=57.00 E-value=0.2 Score=24.42 Aligned_cols=107 Identities=18% Similarity=0.173 Sum_probs=48.0 Q ss_pred CcCCchHHHHHHHHHHHHHHHHHhhhcccccccccee--------ecccccceehhhhheeeeeecCCCcceeeeeecce Q lcl|Aclame:pro 1 MRVGAPVELNRVIARRAVQYAREDMRGRGWTSTGALQ--------PYSDTGAVGISSTMKHLLIQNKGFDPFVMWWVEGR 72 (151) Q Consensus 1 ~rvglP~~l~rvIs~~A~~~ar~d~rgRGWrSagalq--------p~s~~G~VGirstmkhllyQn~G~~pflM~wvEGR 72 (151) |.-.+-..+. ..+.+....|++..- ..+|+|. .-+.+|.|| ++..|-.|.+-|+.|+.- . T Consensus 1 v~~~v~~~~~-~~~~~i~~~ak~~ap----v~TG~Lr~SI~~~~~~~~~~~~V~--~~~~Ya~yvE~GTg~~~~---~-- 68 (116) T protein:vir:95 1 MERWVKRGIA-KTTAKIHNTIISLMP----VDTGYLRESVTMDFKDGGFTGVIN--IGSEYAIYVNYGTGIYAT---G-- 68 (116) T ss_pred ChHHHHHHHH-HHHHHHHHHHHhhCC----ccccccccceeEEeecCcEEEEEe--cCCCccceeecCcccccc---C-- Confidence 1111111010 111122223333222 1233332 112234444 677888888888887642 1 Q ss_pred eeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccc-cCCCCchhHHHHHHHHHHHHhhhHHHHHHHHHHhc Q lcl|Aclame:pro 73 MVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKW-RHPGLKPKRFMEAAIAKAIKESKRDIRDAAMTILS 147 (151) Q Consensus 73 ~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKW-RhPGL~PkRF~E~aIa~Aike~k~~IR~~~~ti~~ 147 (151) .+ ++...+..+.+-+.. .+| +|+|.+|+.||+-|+ .++++. ++.+|| T Consensus 69 -----~~---------~~~~~~~~~~~~~~~------g~~~~t~g~~a~Pfl~pA~----~~~~~~----i~k~is 116 (116) T protein:vir:95 69 -----AG---------GSRAKNIPWSYKDAN------GKWHTTKGQHAQPFWEPAI----DAGRAF----FNKYFS 116 (116) T ss_pred -----CC---------ccccccccceeecCc------cceeeCCCCCCCcchHHHH----HHHHHH----HHHhhC Confidence 11 111122223332222 233 589999999998764 444444 445555 No 19 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=52.02 E-value=0.23 Score=24.12 Aligned_cols=107 Identities=18% Similarity=0.220 Sum_probs=47.2 Q ss_pred CcCCchHHHHHHH-------HHHHHHHHHHhhhcccccccccee--------ecccccceehhhhheeeeeecCCCccee Q lcl|Aclame:pro 1 MRVGAPVELNRVI-------ARRAVQYAREDMRGRGWTSTGALQ--------PYSDTGAVGISSTMKHLLIQNKGFDPFV 65 (151) Q Consensus 1 ~rvglP~~l~rvI-------s~~A~~~ar~d~rgRGWrSagalq--------p~s~~G~VGirstmkhllyQn~G~~pfl 65 (151) ++ -++.++.+++ +......|++..-- .+|+|. .-+.+|.|| ++..|..|.+-|+.|+. T Consensus 15 l~-~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv----~TG~L~~Si~~~~~~~~~~~~V~--~~~~YA~~vE~GT~~~~ 87 (137) T protein:vir:95 15 LE-NYERDMERWVKRGIAKTTAKIHNTIISLMPV----DTGYLRESVTMDFKDGGFTGVIN--IGSEYAIYVNYGTGIYA 87 (137) T ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----cchhhhcCeeeEeeCCceEEEEe--cCCCcccccccCccccc Confidence 11 1233333222 11222223332221 233322 223345555 67889999999998762 Q ss_pred eeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccc-cCCCCchhHHHHHHHHHHHHhhhHHHHHHHHH Q lcl|Aclame:pro 66 MWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKW-RHPGLKPKRFMEAAIAKAIKESKRDIRDAAMT 144 (151) Q Consensus 66 M~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKW-RhPGL~PkRF~E~aIa~Aike~k~~IR~~~~t 144 (151) .+| +. +..-+.++++..+ +.+| +++|..|+.||+-|+. ++++.|.. T Consensus 88 ---~~~-------~~---------~~~~~~~~~~~~~------~~~~~~t~g~~a~PFl~pA~~----~~~~~i~k---- 134 (137) T protein:vir:95 88 ---TGA-------GG---------SRAKKIPWSYKDA------NGKWHTTKGQHAQPFWEPAID----AGRAFFNK---- 134 (137) T ss_pred ---cCC-------Cc---------ccccccccceecc------CcceeecCCCCCCcchHHHHH----HHHHHHHH---- Confidence 111 10 0000111111111 1222 3679999999997755 44554443 Q ss_pred Hhc Q lcl|Aclame:pro 145 ILS 147 (151) Q Consensus 145 i~~ 147 (151) +|| T Consensus 135 ~l~ 137 (137) T protein:vir:95 135 YFS 137 (137) T ss_pred hhC Confidence 344 No 20 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=48.80 E-value=0.54 Score=22.04 Aligned_cols=104 Identities=20% Similarity=0.181 Sum_probs=48.2 Q ss_pred chHHHHHHHH---HHHHHHHHHhhhcccccccccee--------ecccccceehhhhheeeeeecCCCcceeeeeeccee Q lcl|Aclame:pro 5 APVELNRVIA---RRAVQYAREDMRGRGWTSTGALQ--------PYSDTGAVGISSTMKHLLIQNKGFDPFVMWWVEGRM 73 (151) Q Consensus 5 lP~~l~rvIs---~~A~~~ar~d~rgRGWrSagalq--------p~s~~G~VGirstmkhllyQn~G~~pflM~wvEGR~ 73 (151) +-..+-+.+. .+....|++..- ..+|+|. .-+.+|.|| ++..|-.|.+-|+.|+. ..|.- T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aP----v~TG~Lr~SI~~~~~~~~~~~~V~--~~~~YA~yvE~GTg~~~---~~~~~ 71 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMP----VDTGYLRESVTMDFKDGGFTGVIN--IGSEYAIYVNYGTGIYA---TGAGG 71 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCC----cCcccccccceEEeecCcEEEEEe--cCCCcccccccCCcccc---cCCCc Confidence 2222222221 122233333222 1233332 112234444 67889999999998874 11210 Q ss_pred eeccCCcccceeeEEeeeCCCCceEeeCCCceecccccc-cCCCCchhHHHHHHHHHHHHhhhHHHHHHHHHHhc Q lcl|Aclame:pro 74 VPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKW-RHPGLKPKRFMEAAIAKAIKESKRDIRDAAMTILS 147 (151) Q Consensus 74 vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKW-RhPGL~PkRF~E~aIa~Aike~k~~IR~~~~ti~~ 147 (151) +.. -|.+.++.... ++| +|+|.+|+.||.-|+ .++++.| +.+|| T Consensus 72 --~~~--------------~~~~~~~~~~~------g~~~~t~g~~a~Pfl~pA~----~~~~~~i----~k~i~ 116 (116) T protein:vir:12 72 --SRA--------------KKIPWSYKDAN------GKWHTTKGQHAQPFWEPAI----DAGRAFF----NKYFS 116 (116) T ss_pred --ccc--------------cccceeeecCC------ceeeecCCcCCCcchHHHH----HHHHHHH----HHhhC Confidence 001 11222222222 233 488999999998764 4455544 44455 No 21 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=48.80 E-value=0.54 Score=22.04 Aligned_cols=104 Identities=20% Similarity=0.181 Sum_probs=48.2 Q ss_pred chHHHHHHHH---HHHHHHHHHhhhcccccccccee--------ecccccceehhhhheeeeeecCCCcceeeeeeccee Q lcl|Aclame:pro 5 APVELNRVIA---RRAVQYAREDMRGRGWTSTGALQ--------PYSDTGAVGISSTMKHLLIQNKGFDPFVMWWVEGRM 73 (151) Q Consensus 5 lP~~l~rvIs---~~A~~~ar~d~rgRGWrSagalq--------p~s~~G~VGirstmkhllyQn~G~~pflM~wvEGR~ 73 (151) +-..+-+.+. .+....|++..- ..+|+|. .-+.+|.|| ++..|-.|.+-|+.|+. ..|.- T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aP----v~TG~Lr~SI~~~~~~~~~~~~V~--~~~~YA~yvE~GTg~~~---~~~~~ 71 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMP----VDTGYLRESVTMDFKDGGFTGVIN--IGSEYAIYVNYGTGIYA---TGAGG 71 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCC----cCcccccccceEEeecCcEEEEEe--cCCCcccccccCCcccc---cCCCc Confidence 2222222221 122233333222 1233332 112234444 67889999999998874 11210 Q ss_pred eeccCCcccceeeEEeeeCCCCceEeeCCCceecccccc-cCCCCchhHHHHHHHHHHHHhhhHHHHHHHHHHhc Q lcl|Aclame:pro 74 VPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKW-RHPGLKPKRFMEAAIAKAIKESKRDIRDAAMTILS 147 (151) Q Consensus 74 vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKW-RhPGL~PkRF~E~aIa~Aike~k~~IR~~~~ti~~ 147 (151) +.. -|.+.++.... ++| +|+|.+|+.||.-|+ .++++.| +.+|| T Consensus 72 --~~~--------------~~~~~~~~~~~------g~~~~t~g~~a~Pfl~pA~----~~~~~~i----~k~i~ 116 (116) T protein:vir:97 72 --SRA--------------KKIPWSYKDAN------GKWHTTKGQHAQPFWEPAI----DAGRAFF----NKYFS 116 (116) T ss_pred --ccc--------------cccceeeecCC------ceeeecCCcCCCcchHHHH----HHHHHHH----HHhhC Confidence 001 11222222222 233 488999999998764 4455544 44455 No 22 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=42.58 E-value=0.55 Score=22.02 Aligned_cols=108 Identities=19% Similarity=0.221 Sum_probs=45.4 Q ss_pred Cc---CC----------chHHHHHHH-------HHHHHHHHHHhhhcccccccccee--------ecccccceehhhhhe Q lcl|Aclame:pro 1 MR---VG----------APVELNRVI-------ARRAVQYAREDMRGRGWTSTGALQ--------PYSDTGAVGISSTMK 52 (151) Q Consensus 1 ~r---vg----------lP~~l~rvI-------s~~A~~~ar~d~rgRGWrSagalq--------p~s~~G~VGirstmk 52 (151) |= .| ++.++.+.+ +......|++..-- .+|+|. ..+.+|.| -++.. T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPv----dTG~Lr~SI~~~~~~~~~~~~V--~~~~~ 74 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV----DTGYLRESVTMDFKDSGFTGVI--NIGSE 74 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----cccchhccceeEeecCceEEEE--ecCCC Confidence 10 01 233333222 11222223333221 223321 11223334 46778 Q ss_pred eeeeecCCCcceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccc-cCCCCchhHHHHHHHHHHH Q lcl|Aclame:pro 53 HLLIQNKGFDPFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKW-RHPGLKPKRFMEAAIAKAI 131 (151) Q Consensus 53 hllyQn~G~~pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKW-RhPGL~PkRF~E~aIa~Ai 131 (151) |-.|.+-|+.|+. .+| +. ...+. -+.+...+ +.+| +++|..|+.||.-|+. T Consensus 75 YA~~vE~GT~~~~---~~~-------~~------~~~~~--~~~~~~~~-------~~~~~~t~g~~a~PFl~pA~~--- 126 (137) T protein:vir:93 75 YAIYVNYGTGIYA---TGA-------GG------SRAKK--IPWSYKDA-------NGKWHTTKGQHAQPFWEPAID--- 126 (137) T ss_pred cccccccCccccc---cCC-------Cc------ccccc--cccceecc-------CcceeecCCCCCCcchHHHHH--- Confidence 9999999988761 111 10 00000 01111122 2223 2479999999987654 Q ss_pred HhhhHHHHHHHHHHhc Q lcl|Aclame:pro 132 KESKRDIRDAAMTILS 147 (151) Q Consensus 132 ke~k~~IR~~~~ti~~ 147 (151) ++++.|. .+|+ T Consensus 127 -~~~~~~~----~~l~ 137 (137) T protein:vir:93 127 -AGRAFFN----KYFS 137 (137) T ss_pred -HHHHHHH----HhhC Confidence 4444443 3344 No 23 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=42.58 E-value=0.55 Score=22.02 Aligned_cols=108 Identities=19% Similarity=0.221 Sum_probs=45.4 Q ss_pred Cc---CC----------chHHHHHHH-------HHHHHHHHHHhhhcccccccccee--------ecccccceehhhhhe Q lcl|Aclame:pro 1 MR---VG----------APVELNRVI-------ARRAVQYAREDMRGRGWTSTGALQ--------PYSDTGAVGISSTMK 52 (151) Q Consensus 1 ~r---vg----------lP~~l~rvI-------s~~A~~~ar~d~rgRGWrSagalq--------p~s~~G~VGirstmk 52 (151) |= .| ++.++.+.+ +......|++..-- .+|+|. ..+.+|.| -++.. T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPv----dTG~Lr~SI~~~~~~~~~~~~V--~~~~~ 74 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV----DTGYLRESVTMDFKDSGFTGVI--NIGSE 74 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----cccchhccceeEeecCceEEEE--ecCCC Confidence 10 01 233333222 11222223333221 223321 11223334 46778 Q ss_pred eeeeecCCCcceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccc-cCCCCchhHHHHHHHHHHH Q lcl|Aclame:pro 53 HLLIQNKGFDPFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKW-RHPGLKPKRFMEAAIAKAI 131 (151) Q Consensus 53 hllyQn~G~~pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKW-RhPGL~PkRF~E~aIa~Ai 131 (151) |-.|.+-|+.|+. .+| +. ...+. -+.+...+ +.+| +++|..|+.||.-|+. T Consensus 75 YA~~vE~GT~~~~---~~~-------~~------~~~~~--~~~~~~~~-------~~~~~~t~g~~a~PFl~pA~~--- 126 (137) T protein:vir:97 75 YAIYVNYGTGIYA---TGA-------GG------SRAKK--IPWSYKDA-------NGKWHTTKGQHAQPFWEPAID--- 126 (137) T ss_pred cccccccCccccc---cCC-------Cc------ccccc--cccceecc-------CcceeecCCCCCCcchHHHHH--- Confidence 9999999988761 111 10 00000 01111122 2223 2479999999987654 Q ss_pred HhhhHHHHHHHHHHhc Q lcl|Aclame:pro 132 KESKRDIRDAAMTILS 147 (151) Q Consensus 132 ke~k~~IR~~~~ti~~ 147 (151) ++++.|. .+|+ T Consensus 127 -~~~~~~~----~~l~ 137 (137) T protein:vir:97 127 -AGRAFFN----KYFS 137 (137) T ss_pred -HHHHHHH----HhhC Confidence 4444443 3344 No 24 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=42.58 E-value=0.55 Score=22.02 Aligned_cols=108 Identities=19% Similarity=0.221 Sum_probs=45.4 Q ss_pred Cc---CC----------chHHHHHHH-------HHHHHHHHHHhhhcccccccccee--------ecccccceehhhhhe Q lcl|Aclame:pro 1 MR---VG----------APVELNRVI-------ARRAVQYAREDMRGRGWTSTGALQ--------PYSDTGAVGISSTMK 52 (151) Q Consensus 1 ~r---vg----------lP~~l~rvI-------s~~A~~~ar~d~rgRGWrSagalq--------p~s~~G~VGirstmk 52 (151) |= .| ++.++.+.+ +......|++..-- .+|+|. ..+.+|.| -++.. T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPv----dTG~Lr~SI~~~~~~~~~~~~V--~~~~~ 74 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV----DTGYLRESVTMDFKDSGFTGVI--NIGSE 74 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----cccchhccceeEeecCceEEEE--ecCCC Confidence 10 01 233333222 11222223333221 223321 11223334 46778 Q ss_pred eeeeecCCCcceeeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccc-cCCCCchhHHHHHHHHHHH Q lcl|Aclame:pro 53 HLLIQNKGFDPFVMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKW-RHPGLKPKRFMEAAIAKAI 131 (151) Q Consensus 53 hllyQn~G~~pflM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKW-RhPGL~PkRF~E~aIa~Ai 131 (151) |-.|.+-|+.|+. .+| +. ...+. -+.+...+ +.+| +++|..|+.||.-|+. T Consensus 75 YA~~vE~GT~~~~---~~~-------~~------~~~~~--~~~~~~~~-------~~~~~~t~g~~a~PFl~pA~~--- 126 (137) T protein:vir:94 75 YAIYVNYGTGIYA---TGA-------GG------SRAKK--IPWSYKDA-------NGKWHTTKGQHAQPFWEPAID--- 126 (137) T ss_pred cccccccCccccc---cCC-------Cc------ccccc--cccceecc-------CcceeecCCCCCCcchHHHHH--- Confidence 9999999988761 111 10 00000 01111122 2223 2479999999987654 Q ss_pred HhhhHHHHHHHHHHhc Q lcl|Aclame:pro 132 KESKRDIRDAAMTILS 147 (151) Q Consensus 132 ke~k~~IR~~~~ti~~ 147 (151) ++++.|. .+|+ T Consensus 127 -~~~~~~~----~~l~ 137 (137) T protein:vir:94 127 -AGRAFFN----KYFS 137 (137) T ss_pred -HHHHHHH----HhhC Confidence 4444443 3344 No 25 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=35.80 E-value=1.2 Score=20.13 Aligned_cols=115 Identities=13% Similarity=0.166 Sum_probs=45.0 Q ss_pred CcC--CchHHHHHHHHHHHHHHH----HHhhh----------ccccccccceeecccccceehhhhheeeeeecCCCcce Q lcl|Aclame:pro 1 MRV--GAPVELNRVIARRAVQYA----REDMR----------GRGWTSTGALQPYSDTGAVGISSTMKHLLIQNKGFDPF 64 (151) Q Consensus 1 ~rv--glP~~l~rvIs~~A~~~a----r~d~r----------gRGWrSagalqp~s~~G~VGirstmkhllyQn~G~~pf 64 (151) +++ -+-...-+.+.+.|+..+ ++... -|-|.+ .+...+.++.|| ++..|-.|-.-|+-+| T Consensus 11 ~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~--~v~~~g~~~~V~--~~~~YA~yVE~GTG~~ 86 (141) T protein:vir:78 11 PKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGY--KVRKSSKEVIVG--NSSDYAIYYEFGTGEK 86 (141) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceee--eeecCCcEEEEe--cCCCccceeecCCccc Confidence 111 111111111111122111 11100 011110 011122233332 4566666666665443 Q ss_pred eeeeecceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHHHhhhHHHHHHHHH Q lcl|Aclame:pro 65 VMWWVEGRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAIKESKRDIRDAAMT 144 (151) Q Consensus 65 lM~wvEGR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Aike~k~~IR~~~~t 144 (151) - .+| + | .|.++.|-...|+ |. ++.|.+|+.||.-|+... +++|..-+.. T Consensus 87 ~---~~~-------~---------g---rk~~w~y~~~~g~-~~----~t~G~~aqpFl~~A~~~~----~~~i~~~i~~ 135 (141) T protein:vir:78 87 S---ERG-------G---------G---KAGGWFYMDKKGH-WH----FTRGSQASKRMRYTFRDE----QDKVRVFTER 135 (141) T ss_pred c---cCC-------C---------C---CcCcceeecCCCe-eE----eccCCCCchhhhhhHHhh----HHHHHHHHHH Confidence 1 111 0 0 2344444333332 42 467999999998766554 5555444444 Q ss_pred HhccCC Q lcl|Aclame:pro 145 ILSGGR 150 (151) Q Consensus 145 i~~~~~ 150 (151) .|.|=- T Consensus 136 ~~~~l~ 141 (141) T protein:vir:78 136 ALRGIN 141 (141) T ss_pred HhhccC Confidence 443322 No 26 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=35.17 E-value=1 Score=20.58 Aligned_cols=136 Identities=14% Similarity=0.077 Sum_probs=60.7 Q ss_pred CcCCchHHHHHHHHHHHHHHH----HHhhhcccccccccee------ecccccc--eehhhhheeeeeecCCCccee--- Q lcl|Aclame:pro 1 MRVGAPVELNRVIARRAVQYA----REDMRGRGWTSTGALQ------PYSDTGA--VGISSTMKHLLIQNKGFDPFV--- 65 (151) Q Consensus 1 ~rvglP~~l~rvIs~~A~~~a----r~d~rgRGWrSagalq------p~s~~G~--VGirstmkhllyQn~G~~pfl--- 65 (151) ++- +|..+. .+.+.|+..+ +++++-+==.++|.|. .....|. +++.+...|..|..-||...- T Consensus 13 L~~-l~~~~~-~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Ya~fvEfGT~~m~a~P 90 (173) T protein:vir:10 13 LRK-IGKDID-KNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAKDLISKKITVNELYGAYMEFGTGAKVSVP 90 (173) T ss_pred HHH-HHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccCceeEEeeCCCcccchhhhcccccccCCC Confidence 221 344443 3445554443 3334333223344443 2223333 345677788888888886321 Q ss_pred ----eeeec--ceeeeccCCcccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHH----Hhhh Q lcl|Aclame:pro 66 ----MWWVE--GRMVPITDKQTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAI----KESK 135 (151) Q Consensus 66 ----M~wvE--GR~vpItdk~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Ai----ke~k 135 (151) ....+ |+.-|-. .+........-++.|.+=..+.+ +. -.-+|||-.||=||--|+.+.= +.-+ T Consensus 91 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~~~~~~~---~~---~~~~~~G~~aqPFl~PA~~~~~~~~~~~i~ 163 (173) T protein:vir:10 91 KEFADMAASFKGQKTGSF-KDGLESIKAWCRAKGIDEKAAYP---IF---AKILGAGINPQPFLYPAWIEGKKQYLKDLE 163 (173) T ss_pred chhhhhhccccccccccc-ccccccccccccccccchhcccc---ee---eEeecCCCCCCccchhHHHHhHHHHHHHHH Confidence 11111 1111111 11111122222334433211111 11 2467999999999966655443 3334 Q ss_pred HHHHHHHHHH Q lcl|Aclame:pro 136 RDIRDAAMTI 145 (151) Q Consensus 136 ~~IR~~~~ti 145 (151) ..|+.++-.| T Consensus 164 ~~i~~~lrk~ 173 (173) T protein:vir:10 164 NLLKTYNKKI 173 (173) T ss_pred HHHHHHhhcC Confidence 4445555555 No 27 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=34.14 E-value=0.42 Score=22.68 Aligned_cols=116 Identities=20% Similarity=0.314 Sum_probs=57.0 Q ss_pred CcCCchHHHHHHHHHHHHHHHHHhhh--ccccccccc-------------eeecccccceehhhhheeeeeecCCCccee Q lcl|Aclame:pro 1 MRVGAPVELNRVIARRAVQYAREDMR--GRGWTSTGA-------------LQPYSDTGAVGISSTMKHLLIQNKGFDPFV 65 (151) Q Consensus 1 ~rvglP~~l~rvIs~~A~~~ar~d~r--gRGWrSaga-------------lqp~s~~G~VGirstmkhllyQn~G~~pfl 65 (151) -++.-+..+.+.|+.....+.+++.. |..|..-.. -.+-.++|. |+.++ T Consensus 23 ~~~~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~tG~--L~~Si-------------- 86 (155) T protein:vir:79 23 RSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVTNA--LARSV-------------- 86 (155) T ss_pred HHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccchh--hhhhh-------------- Confidence 22334677778888888888887764 556632100 011122221 22221 Q ss_pred eeeecceeeeccCC-cccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHH-HhhhHHHHHHHH Q lcl|Aclame:pro 66 MWWVEGRMVPITDK-QTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAI-KESKRDIRDAAM 143 (151) Q Consensus 66 M~wvEGR~vpItdk-~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Ai-ke~k~~IR~~~~ 143 (151) -+-+++-.|-|--+ .-...|-. |-+.|+++-|+||-| .||=.+-.+-+ .|..+.|.+.+. T Consensus 87 ~~~~~~~~v~vGt~~~YA~iHqf-Gg~~~~~~~v~iPaR-----------------pfLG~s~~~~l~~~~~~~I~~~i~ 148 (155) T protein:vir:79 87 TTWADRNEAGIGSNLVYAAIHQF-GGDAGRGHQVEIPAR-----------------RYLPFDENGQLAAGARQSILEVVL 148 (155) T ss_pred hceecCCEEEEecCchhhhhhhc-ccccCCCCccccCCc-----------------cccCCCCccccchHHHHHHHHHHH Confidence 12222222222111 00111111 223466777888754 35533332222 466778888899 Q ss_pred HHhccCC Q lcl|Aclame:pro 144 TILSGGR 150 (151) Q Consensus 144 ti~~~~~ 150 (151) ..|+-|| T Consensus 149 ~~l~r~r 155 (155) T protein:vir:79 149 TALSRNR 155 (155) T ss_pred HHHHhcC Confidence 9999999 No 28 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=21.52 E-value=1.2 Score=20.05 Aligned_cols=124 Identities=21% Similarity=0.332 Sum_probs=55.8 Q ss_pred CcCCchHHHHHHHHHHHHHHHHHhhh--ccccccccceeecc-----cccceehhhhheeeeeecCCCcceeeeeeccee Q lcl|Aclame:pro 1 MRVGAPVELNRVIARRAVQYAREDMR--GRGWTSTGALQPYS-----DTGAVGISSTMKHLLIQNKGFDPFVMWWVEGRM 73 (151) Q Consensus 1 ~rvglP~~l~rvIs~~A~~~ar~d~r--gRGWrSagalqp~s-----~~G~VGirstmkhllyQn~G~~pflM~wvEGR~ 73 (151) -.+.-+..+.+.|+....++.+++.. |..|.. +.|.. ..|--+ .-+|..+.-..-.+-+-+++-. T Consensus 23 ~~~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~p---ls~~t~~~r~~~g~~~-----~~iL~~tg~L~~Si~~~~~~~~ 94 (155) T protein:vir:99 23 RSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQ---LSPVTVAAREAKGRGP-----HPILQVTNALARSVTTWADRNE 94 (155) T ss_pred HHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCC---CChHHHHHHhccCCCC-----CCcchhchhhhhhhhceecCCE Confidence 12223667888888888888888764 555642 11100 000000 0011111111111122223222 Q ss_pred eeccCC-cccceeeEEeeeCCCCceEeeCCCceecccccccCCCCchhHHHHHHHHHHH-HhhhHHHHHHHHHHhccCC Q lcl|Aclame:pro 74 VPITDK-QTGKTRRIRGREPGKPGYVYIPGRGKIWRDQKWRHPGLKPKRFMEAAIAKAI-KESKRDIRDAAMTILSGGR 150 (151) Q Consensus 74 vpItdk-~~~~t~~irgrevGKPGYV~iPGrgr~WRdQKWRhPGL~PkRF~E~aIa~Ai-ke~k~~IR~~~~ti~~~~~ 150 (151) |-|.-+ .-...|-. |-..|.++-|+||-| .||=.+-.+.+ .|.++.|.+.+..-|+-+| T Consensus 95 v~vGtn~~YA~iHqf-Gg~~~~~~~v~iPaR-----------------pfLG~s~~~~l~~e~~~~I~~~i~~~l~~~~ 155 (155) T protein:vir:99 95 AGIGSNLVYAAIHQF-GGDAGRGHQVEIPAR-----------------RYLPFDENGQLAAGARQSILEIVLTALSRNR 155 (155) T ss_pred EEEecCccchhhhhc-ccccCCCCccccCCc-----------------cccCCCCccccchHHHHHHHHHHHHHHhccC Confidence 222111 11111111 222344555666653 35533333222 4667788888999999999 Done!