Query lcl|NC_020878.1_cdsid_YP_007677709.1 [gene=PRQG_00045] [protein=tail tube A] [protein_id=YP_007677709.1] [location=complement(38139..38795)] Match_columns 218 No_of_seqs 52 out of 57 Neff 4.6 Searched_HMMs 1612 Date Thu Nov 7 17:45:09 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_41 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_41_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78741 Length: 197 100.0 7.2E-96 4.5E-99 542.2 21.3 196 1-218 1-196 (197) 2 protein:vir:100027 Length: 204 100.0 1.3E-94 7.9E-98 535.3 21.8 202 1-218 2-204 (204) 3 protein:vir:1542 Length: 196 # 100.0 9.2E-87 5.7E-90 492.3 20.8 191 1-210 4-196 (196) 4 protein:vir:3365 Length: 196 # 100.0 1.1E-86 7.1E-90 491.7 20.8 191 1-210 4-196 (196) 5 protein:vir:8886 Length: 195 # 100.0 2.6E-86 1.6E-89 489.8 21.3 191 1-211 4-195 (195) 6 protein:vir:94565 Length: 196 100.0 5.2E-86 3.2E-89 488.1 20.9 191 1-218 4-196 (196) 7 protein:vir:94712 Length: 188 100.0 5.2E-86 3.2E-89 488.1 20.6 185 1-205 1-188 (188) 8 protein:vir:80215 Length: 211 100.0 2.6E-85 1.6E-88 484.3 20.6 192 6-218 1-202 (211) 9 protein:vir:99676 Length: 197 100.0 3.8E-85 2.4E-88 483.4 20.5 190 1-210 6-197 (197) 10 protein:vir:10451 Length: 196 100.0 4.2E-85 2.6E-88 483.1 20.5 191 1-210 4-196 (196) 11 protein:vir:2202 Length: 196 # 100.0 1.1E-84 6.6E-88 481.0 20.2 191 1-210 4-196 (196) 12 protein:vir:105646 Length: 245 100.0 7.3E-85 4.6E-88 481.8 18.9 205 1-218 14-239 (245) 13 protein:vir:97033 Length: 245 100.0 9.5E-85 5.9E-88 481.2 18.9 205 1-218 14-239 (245) 14 protein:vir:78929 Length: 184 100.0 4.3E-84 2.7E-87 477.6 18.9 184 8-218 1-184 (184) 15 protein:vir:7020 Length: 246 # 100.0 1.4E-83 8.6E-87 474.8 19.6 202 1-218 15-246 (246) 16 protein:vir:6325 Length: 184 # 100.0 1.2E-83 7.7E-87 475.1 18.5 184 8-213 1-184 (184) 17 protein:vir:103305 Length: 245 100.0 1.7E-83 1.1E-86 474.3 18.8 202 1-218 13-241 (245) 18 protein:vir:1780 Length: 67 # 100.0 7.4E-34 4.6E-37 202.2 5.9 67 1-84 1-67 (67) 19 protein:vir:103760 Length: 207 99.1 6.3E-12 3.9E-15 82.0 15.0 189 1-218 1-206 (207) 20 protein:vir:95323 Length: 201 99.0 2.2E-11 1.4E-14 79.0 12.6 179 1-209 1-201 (201) 21 protein:vir:7328 Length: 201 # 98.8 2E-10 1.2E-13 73.7 13.8 180 1-210 1-201 (201) 22 protein:vir:107429 Length: 223 98.3 6E-08 3.7E-11 60.2 13.4 197 1-218 1-222 (223) 23 protein:vir:107803 Length: 223 98.3 6E-08 3.7E-11 60.2 13.4 197 1-218 1-222 (223) 24 protein:vir:98502 Length: 223 98.3 6E-08 3.7E-11 60.2 13.4 197 1-218 1-222 (223) 25 protein:vir:94601 Length: 211 97.3 3.9E-06 2.4E-09 50.3 8.4 176 1-203 1-211 (211) 26 protein:vir:80185 Length: 221 93.6 0.002 1.3E-06 35.3 8.7 179 1-218 1-221 (221) No 1 >protein:vir:78741 Length: 197 # NCBI annotation: tail tube A # Family: family:all:824 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285449;genbank:gi:148724483;genbank:GeneID:5220212 Probab=100.00 E-value=7.2e-96 Score=542.18 Aligned_cols=196 Identities=35% Similarity=0.586 Sum_probs=192.2 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCcee Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVK 80 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~ 80 (218) |++++ |||||||+||++|||+||+|||+ +||+||+|+++|++++++||+||||||||++++ T Consensus 1 m~~~~---teLdAVN~~L~aIGEspV~sld~----------------~npdva~a~~iL~~v~~~vqseGW~FNte~~~~ 61 (197) T protein:vir:78 1 MASKL---TKLGAVNIVLTNIGMAPVTLIDS----------------NNPMVATAQTILDEVSGSVQSEGWSYNTERAYP 61 (197) T ss_pred Cccch---hHHHHHHHHHHhhCCcccceeeC----------------CCccHHHHHHHHHHHHHHHhhCCceEeecCCce Confidence 99999 99999999999999999999974 899999999999999999999999999999999 Q ss_pred EeecCCCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHHHHH Q lcl|NC_020878. 81 ISPDANGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARASVR 160 (218) Q Consensus 81 l~Pd~~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~Aa~~ 160 (218) |+||++|+|++|+|+|+|+++++ ++++||+|||||||+.||||+|++|++++||+++||||||++||+||++|||++ T Consensus 62 l~pd~~g~I~~P~n~L~vd~~~~---~~~~~v~Rgg~LYD~~n~T~~F~~pi~~~iv~~~~FedlPe~~~~yI~~rAa~~ 138 (197) T protein:vir:78 62 FIKDNTGRIAIPSNVLSLDCAST---SKYDLIIRGGFLYDKAGHTDVFTENLELDVVWCFEFDDLPEAVKNYITIRAANL 138 (197) T ss_pred ecCCCCCeEecCccceEEecCCC---ceeeEEEeCCeEEeccCCcEEeCCceEEEEEeecChhhhhHHHHHHHHHHHHHH Confidence 99999999999999999999874 468899999999999999999999999999999999999999999999999999 Q ss_pred HHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCccceecccCchhhhcC Q lcl|NC_020878. 161 AATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFFGFPHESNYRSYQPYKALIR 218 (218) Q Consensus 161 f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l~~p~~~~~~s~rP~~~l~r 218 (218) |+.+++|+++++|+++++|++|+++|++|||+||+||||++|+++.|++|||||+|+| T Consensus 139 f~~~~~G~~~~~q~l~~~e~~a~~~~~~~e~~q~~~Nml~~~~~~~~~~yrp~~~l~r 196 (197) T protein:vir:78 139 FAGRAVGSAEAVKYSQREEAAARAAIIEYETQQGDYNMLESESGRDIYTYRPFDAVYR 196 (197) T ss_pred HHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCcCcccCccccCcCcccchhhhhc Confidence 9999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:100027 Length: 204 # NCBI annotation: T7-like tail tubular protein A # Family: family:all:824 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214207;genbank:gi:61806430;genbank:GeneID:3294707 Probab=100.00 E-value=1.3e-94 Score=535.34 Aligned_cols=202 Identities=61% Similarity=1.002 Sum_probs=198.1 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCcee Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVK 80 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~ 80 (218) -+|+|+++|||||||+||++|||+||+|||+ +||+|++|+++|++++++|||||||||||++++ T Consensus 2 ~~~~~~~~teL~AVN~~L~aIGespV~sld~----------------~npdva~a~~iL~~v~~~vqs~GW~FNte~~~~ 65 (204) T protein:vir:10 2 ATTTIQPDTELSAVNSILGSIGQSPLTTLNY----------------NNPETAFVYNLLVEANKDVQGEGWHFNTEDHVL 65 (204) T ss_pred ceecccccchhHHHHHHHHhhCccccccccC----------------CCccHHHHHHHHHHHHHHHhcCCceeeccCCee Confidence 4678999999999999999999999999974 899999999999999999999999999999999 Q ss_pred EeecC-CCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHHHH Q lcl|NC_020878. 81 ISPDA-NGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARASV 159 (218) Q Consensus 81 l~Pd~-~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~Aa~ 159 (218) |+||+ +|+|++|+|+|+|+++++.++++.++|+|||||||+.|||++|++|++|+||+++||||||++||+||++|||+ T Consensus 66 ltPd~~~g~I~~P~n~L~v~~~~~~~d~~~~~v~Rgg~LYD~~~~t~~f~~~i~v~iv~~~~FeelPe~~~~~I~~rAa~ 145 (204) T protein:vir:10 66 VTPDATTKYINVPSNYLRYDLHSGHVDKSMDLVKRNGRLYDKVGHTDQFDDDLYLDIVTLYPFEDVPPIFQRYIISKAAV 145 (204) T ss_pred eeeeCCCCeEEcCcceeeeeecCCcccccceeEEeCCeEEecccCceeecCcceEEEEeecChhhccHHHHHHHHHHHHH Confidence 99996 79999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCccceecccCchhhhcC Q lcl|NC_020878. 160 RAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFFGFPHESNYRSYQPYKALIR 218 (218) Q Consensus 160 ~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l~~p~~~~~~s~rP~~~l~r 218 (218) +|+.+++|+++++|+|+++|++|++.|++||++||+||||++|+++.|+||+|||+|+| T Consensus 146 ~f~~~~~g~~~~~q~l~~~e~~ar~~~~~~e~~q~~~N~~~~~~~~~~~~~~p~~~l~r 204 (204) T protein:vir:10 146 RAATQLVANRELVALLQVQEQSARANVLEYECNQGDHSFMGWPHESSYRPYQPYKALQR 204 (204) T ss_pred HHHhhcCCchhHHHHHHHHHHHHHHHHHHHhHhhcCcccccCCCCCCccCcCchhhhcC Confidence 99999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:1542 Length: 196 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052110;swissprot:trembl:q9t106;genbank:gi:9634036;uniprot:Q9T106;genbank:GeneID:1262371 Probab=100.00 E-value=9.2e-87 Score=492.26 Aligned_cols=191 Identities=30% Similarity=0.494 Sum_probs=179.5 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCcee Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVK 80 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~ 80 (218) --++|+++|||||||+||++|||+||+||| +.+||+||+|+++|++++++||+||||||||++++ T Consensus 4 ~~~~~~~~teLdAVN~~L~aIGEspV~sld---------------~~~npdva~a~~iL~~v~~~vqseGW~FNte~~~~ 68 (196) T protein:vir:15 4 YEMNIETAEELSAVNDILASIGEPPVSTLE---------------GDANADVANARRVLNKINRQIQSRGWTFNIEEGVT 68 (196) T ss_pred cccchhhhhhhHHHHHHHHhcCcccccccc---------------CCCCccHHHHHHHHHHHHHHHhhCCceEeecCCce Confidence 357899999999999999999999999996 35899999999999999999999999999999999 Q ss_pred EeecC-CCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHHHH Q lcl|NC_020878. 81 ISPDA-NGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARASV 159 (218) Q Consensus 81 l~Pd~-~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~Aa~ 159 (218) |+||+ +|+|++|+|+|+|+++++ ++++++|||||||+.||||+|++|++++||+++||||||++||+||++|||+ T Consensus 69 ltPD~~~g~I~vP~n~L~v~~~~~----~~~~v~Rgg~LYD~~n~T~~F~~pi~v~iv~~~~FedlPe~~~~yI~~rAa~ 144 (196) T protein:vir:15 69 LLPDAFSGMIPFSSDYLSVMATSG----QTQYINRGGYLYDRSAKTDRFPSGVQVNLIRLREFDEMPECFRNYIVTKASR 144 (196) T ss_pred eeecCCCCeEecCcceeEEecCCC----ceeEEEcCCeEEeccCCcEEeCCceEEEEEeecChhhhhHHHHHHHHHHHHH Confidence 99998 899999999999999875 4789999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cCCccceeccc Q lcl|NC_020878. 160 RAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFF-GFPHESNYRSY 210 (218) Q Consensus 160 ~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l-~~p~~~~~~s~ 210 (218) +|+.+++|+++++|+++++|++|++.|++|||+||+|||| ++|..+..=+. T Consensus 145 ~f~~~~~G~~~~~q~l~~~e~~a~~~l~~~e~~q~~~Nml~~~~~~~~~~~r 196 (196) T protein:vir:15 145 QFNNRFFGAPEVDGVLQEEEQEAWRACFEYELDYGNYNMLDGDAFTSGLLNR 196 (196) T ss_pred HHHHhccCchhHHHHHHHHHHHHHHHHHHHHHhhCCcceeecCchhhccccC Confidence 9999999999999999999999999999999999999999 56655554443 No 4 >protein:vir:3365 Length: 196 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523336;swissprot:trembl:q8w5u4;genbank:gi:17570827;uniprot:Q8W5U4;genbank:GeneID:927450 Probab=100.00 E-value=1.1e-86 Score=491.74 Aligned_cols=191 Identities=30% Similarity=0.498 Sum_probs=179.4 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCcee Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVK 80 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~ 80 (218) --++|+++|||||||+||++|||+||+|||+ +.||+||+|+++|++++++||+||||||||++++ T Consensus 4 ~~~~~~~~teLdAVN~~L~aIGEspV~sld~---------------~~npdva~a~~iL~~v~~~vqseGW~FNte~~~~ 68 (196) T protein:vir:33 4 YEMNIETAEELSAVNDILASIGEPPVSTLEG---------------DANADVANARRVLNKINRQIQSRGWTFNIEEGVT 68 (196) T ss_pred cccchhhhhhhHHHHHHHHhcCccccccccC---------------CCCccHHHHHHHHHHHHHHHhhCCceEeecCcee Confidence 3578999999999999999999999999964 5799999999999999999999999999999999 Q ss_pred EeecC-CCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHHHH Q lcl|NC_020878. 81 ISPDA-NGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARASV 159 (218) Q Consensus 81 l~Pd~-~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~Aa~ 159 (218) |+||+ +|+|++|+|+|+|+++++ .+++++|||||||+.||||+|++|++++||+++||||||++||+||++|||+ T Consensus 69 ltPD~~~g~I~vP~n~L~v~~~~~----~~~~v~Rgg~LYD~~n~T~~F~~pi~v~iv~~~~FedlPe~~~~yI~~rAa~ 144 (196) T protein:vir:33 69 LLPDAFSGMIPFSSDYLSVMATSG----QTQYVNRGGYLYDRSAKTDRFPSGVQVNLIRLREFDEMPECFRNYIVTKASR 144 (196) T ss_pred EeeeCCCCeEecCcceeEEecCCC----ceeEEEcCCeEEeccCCcEEeCCceEEEEEeecChhhhhHHHHHHHHHHHHH Confidence 99997 899999999999999875 4789999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cCCccceeccc Q lcl|NC_020878. 160 RAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFF-GFPHESNYRSY 210 (218) Q Consensus 160 ~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l-~~p~~~~~~s~ 210 (218) +|+.+++|+++++|+++++|++|++.|++|||+||+|||| ++|..+..=+. T Consensus 145 ~f~~~~~G~~~~~q~l~~ee~~a~~~~~~~e~~q~~~Nml~~~~~~~~~~~r 196 (196) T protein:vir:33 145 QFNNRFFGAPEVDGVLQEEEQEAWSACFEYELDYGNYNMLDGDAFTSGLLNR 196 (196) T ss_pred HHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhCCcceeecCchhhccccC Confidence 9999999999999999999999999999999999999999 56655554443 No 5 >protein:vir:8886 Length: 195 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813775;genbank:gi:29366730;genbank:GeneID:1258838 Probab=100.00 E-value=2.6e-86 Score=489.80 Aligned_cols=191 Identities=28% Similarity=0.470 Sum_probs=179.0 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCcee Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVK 80 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~ 80 (218) --++++++|||||||+||++|||+||+|||+ ..||+||+|+++|++++++||+||||||||++++ T Consensus 4 ~~~~~~~~teLdAVN~~L~aIGEspV~sld~---------------~~npdva~a~~iL~~v~~~vqseGW~FNte~~~~ 68 (195) T protein:vir:88 4 YEATLETDDELAAINDMLAAIGESPVSSLEG---------------DPNADVANARRILNQVNREVQSRGWTFNIEEGAV 68 (195) T ss_pred ccccccccchhHHHHHHHHhccccccccccC---------------CCCccHHHHHHHHHHHHHHHhhCCceEeecCCee Confidence 4578999999999999999999999999964 5799999999999999999999999999999999 Q ss_pred EeecC-CCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHHHH Q lcl|NC_020878. 81 ISPDA-NGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARASV 159 (218) Q Consensus 81 l~Pd~-~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~Aa~ 159 (218) |+||+ +|+|++|+|+|+|+++++ ++|++|||||||+.||||+|++|++++||+++||||||++||+||++|||+ T Consensus 69 ltpD~~~g~I~~P~n~L~v~~~~~-----~~~v~Rgg~lYD~~n~T~~F~~pi~~~iv~~~~FedlPe~~~~yI~~rAa~ 143 (195) T protein:vir:88 69 LSPDSFSGLIEYLSDYLRITTSGG-----TVYVNRGGYVYDRSTKTDVYTNDITVDLIRFKTFSEMPECFRSYIVAKASR 143 (195) T ss_pred eeeeCCCCeEecCcceeEEeecCC-----eeEEEeCCEEEeccCCceEeCCceEEEEEeecChhhhhHHHHHHHHHHHHH Confidence 99996 799999999999999875 469999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCccceecccC Q lcl|NC_020878. 160 RAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFFGFPHESNYRSYQ 211 (218) Q Consensus 160 ~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l~~p~~~~~~s~r 211 (218) +|+.+++|+++++|+++++|++|++.|++|||+||+||||.+.++...-..| T Consensus 144 ~f~~~~~G~~~~~~~l~~~e~~A~~~~~e~e~~qg~~Nm~~~~~~~~~~~~r 195 (195) T protein:vir:88 144 RFNIRFFGAGEIEGSLQEQESEAWQQCQEYELDYGGFNMIDGDSYVGGIASR 195 (195) T ss_pred HHHHhcCCcHHHHHHHHHHHHHHHHHHHHHHHhhCCcceeecCcccchhccC Confidence 9999999999999999999999999999999999999999765554443333 No 6 >protein:vir:94565 Length: 196 # NCBI annotation: Tubular tail protein A # Family: family:all:824 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919013;genbank:gi:119637777;genbank:GeneID:5179325 Probab=100.00 E-value=5.2e-86 Score=488.14 Aligned_cols=191 Identities=28% Similarity=0.467 Sum_probs=178.3 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCcee Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVK 80 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~ 80 (218) --++|+++|||||||+||++|||+||+||| +.+||+||+|+++|++++++||+||||||||++++ T Consensus 4 ~~~~~~~~teLdAVN~~L~aIGEspV~sld---------------~~~npdva~a~~iL~~v~r~vqseGW~FNte~~~~ 68 (196) T protein:vir:94 4 YETTLETGEELAAVNDILASIGEPPVSTLE---------------GDTNADVDNARRVLNKINRQIQSKGWTFNIEGGQQ 68 (196) T ss_pred cccchhhhhhhHHHHHHHHhcccccccccc---------------CCCCccHHHHHHHHHHHHHHHhhCCceeeecCCee Confidence 357889999999999999999999999996 35899999999999999999999999999999999 Q ss_pred EeecC-CCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHHHH Q lcl|NC_020878. 81 ISPDA-NGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARASV 159 (218) Q Consensus 81 l~Pd~-~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~Aa~ 159 (218) |+||+ +|+|++|+|+|+|+++++ ++++++|||||||+.||||+|++|++++||+++||||||++||+||++|||+ T Consensus 69 ltPD~~~g~I~vP~n~L~v~~~~~----~~~~v~Rgg~lYD~~n~T~~F~~pi~~~iv~~~~FedlPe~~~~yI~~rAa~ 144 (196) T protein:vir:94 69 LLPDVFNGLIPYMSDYLSVLSEGG----ATAYVNRGGYVFDRTTGTDIFEGPVTVTIIKLREFYEMPECFRSWIVTKAAR 144 (196) T ss_pred eeeeCCCCeEecCcceeEEeeCCC----ceeeEEcCceEEeccCCceEeCCceEEEEEeecChhhhhHHHHHHHHHHHHH Confidence 99996 799999999999999875 4789999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cCCccceecccCchhhhcC Q lcl|NC_020878. 160 RAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFF-GFPHESNYRSYQPYKALIR 218 (218) Q Consensus 160 ~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l-~~p~~~~~~s~rP~~~l~r 218 (218) +|+.+++|+++++|+++++|++|+++|++|||+||+|||| ++|.....-+ | T Consensus 145 ~f~~~~~G~~~~~q~l~~~e~~a~~~~~e~e~~q~~~Nm~~~~~~~~~~~~--------r 196 (196) T protein:vir:94 145 QFNNRFFGAPEIDAVLAEEEQEAKMQCHEYELDFGNFNMLDGDAFTGGLLS--------R 196 (196) T ss_pred HHHHhhcCchHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCcccchhcc--------C Confidence 9999999999999999999999999999999999999999 4444443333 3 No 7 >protein:vir:94712 Length: 188 # NCBI annotation: tail tube # Family: family:all:824 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338121;genbank:gi:77118199;genbank:GeneID:3707735 Probab=100.00 E-value=5.2e-86 Score=488.12 Aligned_cols=185 Identities=22% Similarity=0.416 Sum_probs=176.1 Q ss_pred Cee--eccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCc Q lcl|NC_020878. 1 MTT--QIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDH 78 (218) Q Consensus 1 ~~~--~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~ 78 (218) |+| +|+++|||||||+||++|||+||+|||+ +||+||+|+++|++++++|||||||||||++ T Consensus 1 ~~~~~~~~~~teL~AVN~~L~aIGespV~sld~----------------~npdva~a~~iL~~v~~~vqs~GW~FNte~~ 64 (188) T protein:vir:94 1 MAQYIPLNANDDLDAINDMLAAIGEPAVLQLDE----------------GNADVSNAQRILHRVNRQVQAKGWNFNINEA 64 (188) T ss_pred CCccccccccchhHHHHHHHHhhCccccccccC----------------CCccHHHHHHHHHHHHHHHhcCCceeeecCC Confidence 886 8999999999999999999999999974 8999999999999999999999999999999 Q ss_pred eeEeecC-CCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHH Q lcl|NC_020878. 79 VKISPDA-NGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARA 157 (218) Q Consensus 79 ~~l~Pd~-~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~A 157 (218) ++|+||+ +|+|++|+|+|+|+++++ ++++++|||||||+.||||+|++|++|+||+++||||||++||+||++|| T Consensus 65 ~~ltPd~~~g~I~~P~n~L~v~~~~~----~~~~v~Rgg~LYD~~n~T~~F~~pi~v~iv~~~~FeelPe~~~~~I~~rA 140 (188) T protein:vir:94 65 AVLTPDVQDNRIRFLPSYLRVMTAGA----TSYYSNMGGYLYDLSTQSTTFTDPITVELVEMKPFSEMPVVFRDYIVTKA 140 (188) T ss_pred eeeeeeCCCCeEecCcceeeeecCCC----ceeEeecCCeeEeccCCcEeeCCceeEEEEeecChhhccHHHHHHHHHHH Confidence 9999996 799999999999999876 46799999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCccc Q lcl|NC_020878. 158 SVRAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFFGFPHES 205 (218) Q Consensus 158 a~~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l~~p~~~ 205 (218) |++|+.+++|+++++|+|+++|++|++.|++|||+||+||||++-.-- T Consensus 141 a~~f~~~~~G~~~~~q~l~~~e~~a~~~~~e~e~~q~~~Nml~~~~~~ 188 (188) T protein:vir:94 141 SREFNAKFFGSPESELYLREQEAELYQQVMEYEMDTGRYNMMSDIGRD 188 (188) T ss_pred HHHHHhhccCchhHHHHHHHHHHHHHHHHHHHHHhhCCccccccccCC Confidence 999999999999999999999999999999999999999999532222 No 8 >protein:vir:80215 Length: 211 # NCBI annotation: putative tail tubular protein A # Family: family:all:824 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522885;genbank:gi:158345178;genbank:GeneID:5687478 Probab=100.00 E-value=2.6e-85 Score=484.32 Aligned_cols=192 Identities=16% Similarity=0.227 Sum_probs=182.8 Q ss_pred cccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCceeEeecC Q lcl|NC_020878. 6 ATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVKISPDA 85 (218) Q Consensus 6 ~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~l~Pd~ 85 (218) -..|||||||+||++|||+||+|||+ +||+||+|+++|++++++|||||||||||++++|+||+ T Consensus 1 ~~~teLdAVN~~L~aIGEsPV~sld~----------------~npdva~a~~iL~~v~r~vqseGW~FNte~~~~ltPd~ 64 (211) T protein:vir:80 1 MQLTFLEAVNLVLRELGETPVTSVDE----------------TYPTLAQILPAMEDARRNTLAEGWWFNSFDDFTASPSP 64 (211) T ss_pred CcchHHHHHHHHHHhhCccccccccC----------------CchhHHHHHHHHHHHHHHHccCCeeEeecCCceeccCC Confidence 23489999999999999999999974 89999999999999999999999999999999999999 Q ss_pred CCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHHHHHHHhhc Q lcl|NC_020878. 86 NGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARASVRAATQL 165 (218) Q Consensus 86 ~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~Aa~~f~~~~ 165 (218) +|+|++|+|+|+|++.++ .++++|||||||+.||||+|++|++++||+++||||||++||+||++|||++|+.++ T Consensus 65 ~g~I~iP~n~L~v~~~~~-----~~~~~Rgg~LYD~~n~T~~F~~pi~v~iv~~~~FeeLPe~~~~yI~~rAa~~f~~~~ 139 (211) T protein:vir:80 65 AGEVLLSEDTLAFYPDDV-----EKFTWAGRYVRVTGTGSKVVGAPVKGRVVLDIPYDELPEGMRYLVVYRCAYEVYVAD 139 (211) T ss_pred CCeEecCccceEEeeCCC-----eeeeeeCceEEeccCCcEeeCCceEEEEEeecChhhccHHHHHHHHHHHHHHHHhhc Confidence 999999999999999874 468999999999999999999999999999999999999999999999999999999 Q ss_pred cCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCC-ccceec---------ccCchhhhcC Q lcl|NC_020878. 166 VANADLVKLLQLEEAQTKATALEYDCEQGDHTFFGFP-HESNYR---------SYQPYKALIR 218 (218) Q Consensus 166 ~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l~~p-~~~~~~---------s~rP~~~l~r 218 (218) +|+++++|+|+++|++|++.|++||++||+|||+..+ ..+..+ .|-|-+-||| T Consensus 140 ~G~d~~~q~l~~ee~~a~~~l~~~e~~q~~~Nm~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (211) T protein:vir:80 140 FGADSTAQVIANKMSAAYVEVRAVHIRQRKLTLRKRTPATSGVKRGTTNELLCRIVPAAPVWR 202 (211) T ss_pred CCchhHHHHHHHHHHHHHHHHHHHHHhhcCccccccCcccccccccccchhhhccccCccccc Confidence 9999999999999999999999999999999999555 556688 8999999999 No 9 >protein:vir:99676 Length: 197 # NCBI annotation: Tail tubular protein A # Family: family:all:824 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249590;genbank:gi:68299741;genbank:GeneID:3799991 Probab=100.00 E-value=3.8e-85 Score=483.41 Aligned_cols=190 Identities=27% Similarity=0.437 Sum_probs=178.0 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCcee Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVK 80 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~ 80 (218) -+++|+++|||||||+||++|||+||+|||+ ..||+||+|+++|++++++|||||||||||++++ T Consensus 6 ~~~~~~~~teLdAVN~~L~aIGesPV~sld~---------------~~npdva~a~~iL~~v~~~vqs~GW~FNte~~~~ 70 (197) T protein:vir:99 6 YESNWQYQAELDAINDILASIGESPVNTLES---------------DANADVVNARRILHKINRQEQSKGWTFNIEEGAT 70 (197) T ss_pred eeccccccchhHHHHHHHHhhcccccccccC---------------CCCccHHHHHHHHHHHHHHHhcCCceeeecCCee Confidence 3578999999999999999999999999964 5799999999999999999999999999999999 Q ss_pred EeecCCCe-EecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHHHH Q lcl|NC_020878. 81 ISPDANGH-YIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARASV 159 (218) Q Consensus 81 l~Pd~~g~-I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~Aa~ 159 (218) |+||++|+ |++|+|+|+|++.++ ++|++|||||||+.||||+|++|++++||+++||||||++||+||++|||+ T Consensus 71 ltPd~~~~~I~~P~n~L~v~~~~~-----~~~v~Rgg~LYD~~n~T~~F~~pi~v~iv~~~~FeelPe~~~~yI~~rAa~ 145 (197) T protein:vir:99 71 LVPDVYSQLIPYMPNYLSVTTTGG-----TPYVNRGGYVYDRINKTDRFTSPITVNLISLRTFDEMPEQFKSYIVTKASK 145 (197) T ss_pred eeecCCCCeEEcCcceeeeecCcC-----ceeEEeCCeeEeccCCcEeeCCceEEEEEEecChhhccHHHHHHHHHHHHH Confidence 99999986 889999999998765 479999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cCCccceeccc Q lcl|NC_020878. 160 RAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFF-GFPHESNYRSY 210 (218) Q Consensus 160 ~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l-~~p~~~~~~s~ 210 (218) +|+.+++|+++++|+++++|++|++.|++||++||+|||| ++|..+..-|. T Consensus 146 ~f~~~~~G~~~~~q~l~~~e~~a~~~~~e~e~~qg~~Nml~~~~~~~~~~~r 197 (197) T protein:vir:99 146 EFNIRFFGAPEIDTVLGNELIDLERAVNEYELDYGAFNIFNSDPYVSGAISR 197 (197) T ss_pred HHHhhccCchhHHHHHHHHHHHHHHHHHHHHHhhCCcceeeCChhhhccccC Confidence 9999999999999999999999999999999999999999 55555554443 No 10 >protein:vir:10451 Length: 196 # NCBI annotation: tail protein # Family: family:all:824 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848298;genbank:gi:30387489;genbank:GeneID:1733943 Probab=100.00 E-value=4.2e-85 Score=483.14 Aligned_cols=191 Identities=27% Similarity=0.483 Sum_probs=177.4 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCcee Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVK 80 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~ 80 (218) --++|+++|||||||+||++|||+||+|||+ +.||+||+|+++|++++++||+||||||||+++| T Consensus 4 ~~~~~~~~teLdAVN~~L~aIGEspV~sld~---------------~~npdva~a~~iL~~v~~~vqseGW~FNtE~~~~ 68 (196) T protein:vir:10 4 YDMNVETAAELSAVNDILASIGEPPVSTLEG---------------DSNADVANARRILNKINRQIQSRGWTFNIEEGIT 68 (196) T ss_pred ccccccccchhHHHHHHHHhccccccccccC---------------CCCccHHHHHHHHHHHHHHHhhCCceeeecCcee Confidence 3578999999999999999999999999963 5799999999999999999999999999999999 Q ss_pred EeecCCCeEec-CCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHHHH Q lcl|NC_020878. 81 ISPDANGHYII-PTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARASV 159 (218) Q Consensus 81 l~Pd~~g~I~~-P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~Aa~ 159 (218) |+||++|+|++ |+|+|+|++.++ ++++++|||||||+.||||+|++|++++||+++||||||++||+||++|||+ T Consensus 69 ltpD~~~~~i~~p~n~L~~~~~~~----~~~~v~Rgg~LYD~~n~T~~F~~pi~~~iv~~~~FedlPe~~~~yI~~rAa~ 144 (196) T protein:vir:10 69 LLPDVYSNLIVYSDDYLSLMATSG----QSIYVNRGGYVYDRTSQSDRFDSGITVNIIRLRDYDEMPECFRYWIVTKASR 144 (196) T ss_pred ecccCCCCeeeCCcceeeeecCCC----ceeeeeeCCeEEeccCCcEeeCCeeEEEEEeecChhhhhHHHHHHHHHHHHH Confidence 99999886555 889999998765 3679999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cCCccceeccc Q lcl|NC_020878. 160 RAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFF-GFPHESNYRSY 210 (218) Q Consensus 160 ~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l-~~p~~~~~~s~ 210 (218) +|+.+++|+++++|+++++|++|++.|++|||+||+|||| ++|..+..-|. T Consensus 145 ~f~~~~~G~~~~~q~l~~~e~~a~~~l~e~e~~q~~~Nml~~~p~~~~~~~r 196 (196) T protein:vir:10 145 QFNNRFFGAPEVEGVLQEEEDEARRLCMEYEVDYGGYNMLDGDAFTSGLLTR 196 (196) T ss_pred HHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCcceeecCchhhccccC Confidence 9999999999999999999999999999999999999999 66655544443 No 11 >protein:vir:2202 Length: 196 # NCBI annotation: tail tubular protein # Family: family:all:824 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041999;swissprot:sw:p03746;genbank:gi:9627471;uniprot:P03746;genbank:GeneID:1261030 Probab=100.00 E-value=1.1e-84 Score=480.95 Aligned_cols=191 Identities=27% Similarity=0.475 Sum_probs=177.3 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCcee Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVK 80 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~ 80 (218) --++|+++|||||||+||++|||+||+|||+ ..||+||+|+++|++++++||+||||||||+++| T Consensus 4 ~~~~~~~~teLdAVN~~L~aIGEspV~sld~---------------~~npdva~a~~iL~~v~~~vqseGW~FNtE~~~~ 68 (196) T protein:vir:22 4 YDMNVETAAELSAVNDILASIGEPPVSTLEG---------------DANADAANARRILNKINRQIQSRGWTFNIEEGIT 68 (196) T ss_pred cccchhhhhhhHHHHHHHHhcCccccccccC---------------CCCccHHHHHHHHHHHHHHHhhCCceeeecCcee Confidence 3578999999999999999999999999964 5799999999999999999999999999999999 Q ss_pred EeecCCCeEec-CCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHHHH Q lcl|NC_020878. 81 ISPDANGHYII-PTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARASV 159 (218) Q Consensus 81 l~Pd~~g~I~~-P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~Aa~ 159 (218) |+||++|+|++ |+|+|+|+..++ ++++++|||||||+.||||+|++|++++||+++||||||++||+||++|||+ T Consensus 69 ltpD~~~~~i~p~~~~L~~~~~~~----~~~~v~Rgg~LYD~~n~T~~F~~pi~~~iv~~~~FedlPe~~~~yI~~rAa~ 144 (196) T protein:vir:22 69 LLPDVYSNLIVYSDDYLSLMSTSG----QSIYVNRGGYVYDRTSQSDRFDSGITVNIIRLRDYDEMPECFRYWIVTKASR 144 (196) T ss_pred ecccCCCCeEeCccceeeeecCCC----ceeeeeeCCeEEeccCCcEeeCCceEEEEEeecChhhhhHHHHHHHHHHHHH Confidence 99999987666 579999998876 3689999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cCCccceeccc Q lcl|NC_020878. 160 RAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFF-GFPHESNYRSY 210 (218) Q Consensus 160 ~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l-~~p~~~~~~s~ 210 (218) +|+.+++|+++++|+++++|++|++.|++|||+||+|||| ++|..+..-|. T Consensus 145 ~f~~~~~G~~~~~q~l~~~e~~a~~~l~e~e~~q~~~Nml~~~p~~~~~~~r 196 (196) T protein:vir:22 145 QFNNRFFGAPEVEGVLQEEEDEARRLCMEYEMDYGGYNMLDGDAFTSGLLTR 196 (196) T ss_pred HHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCcceeecCchhhccccC Confidence 9999999999999999999999999999999999999999 66655544443 No 12 >protein:vir:105646 Length: 245 # NCBI annotation: putative tail tubular A protein # Family: family:all:824 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425010;genbank:gi:83571758;uniprot:Q2WC42;genbank:GeneID:3837287 Probab=100.00 E-value=7.3e-85 Score=481.84 Aligned_cols=205 Identities=22% Similarity=0.330 Sum_probs=188.5 Q ss_pred Ce-eecc-ccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCc Q lcl|NC_020878. 1 MT-TQIA-TDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDH 78 (218) Q Consensus 1 ~~-~~~~-~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~ 78 (218) |+ +... .+|||||||+||++|||+||+|||++. .+||+|++|+++|++++|+||+||||||||++ T Consensus 14 ~~~~~~~~~~TeLdAVN~~L~aIGEsPV~sld~~~-------------~~~~~va~al~~l~~~~r~vqseGW~FNte~~ 80 (245) T protein:vir:10 14 MEDVAFQIIDSKLEAVNLCMRAIGREGVDSLDSGD-------------LDAEDASKMIDIVSQRFQYNKGGGWWFNREPN 80 (245) T ss_pred hhhhhhhhhhhhHHHHHHHHHhhCccccceecCCC-------------cchHHHHHHHHHHHHHHHHHccCCeeEeecCC Confidence 43 3444 479999999999999999999998742 37899999999999999999999999999999 Q ss_pred eeEeecCCCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeE------EEEEEEecChhhhhHHHHHH Q lcl|NC_020878. 79 VKISPDANGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDH------YFDITYLLAFNDVPPAIQRY 152 (218) Q Consensus 79 ~~l~Pd~~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v------~~~iv~~l~FedLP~~aq~y 152 (218) ++|+||++|+|++|+|+|+|+++++.++++.++++|||||||+.||||+|++|| +++||+++||||||++||+| T Consensus 81 ~~ltPD~~g~I~iP~n~L~v~~~~~~~~~~~~~v~RggrLYD~~nhT~~F~~pi~~~~~~~v~Iv~~~pFEdLPe~~q~y 160 (245) T protein:vir:10 81 WQIAPDTNGEVNLPNNCLAVLQCYALGEKKVPMTMRAGKLYSTWSHTFDMRKHVNANGMIRLTLLTLLPYEHLPTSVMQA 160 (245) T ss_pred eeeccCCCCeEecCccceeeeccCccccccceeEeccceEEeccccceecccccccCcceEEEEEeeCChhhhhHHHHHH Confidence 999999999999999999999999988888999999999999999999999986 68999999999999999999 Q ss_pred HHHHHHHHHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cCCccceec------------ccCchhhhcC Q lcl|NC_020878. 153 IIARASVRAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFF-GFPHESNYR------------SYQPYKALIR 218 (218) Q Consensus 153 I~~~Aa~~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l-~~p~~~~~~------------s~rP~~~l~r 218 (218) |++|||++|+.+++|+++++|+++++|++|++.|++|||+||+|||| ++|..+.+| |+.||++--- T Consensus 161 I~~rAA~~f~~~~~G~~~~~q~l~qee~~a~~~l~e~e~~q~~~Nml~~~~~~r~~r~m~~~~~~~~~~~~~~~~~~~~ 239 (245) T protein:vir:10 161 IAYQAAVEFIVSKDADQTKLATAQQIATQLLMDVQSEQMSQKRLNMLVHNPTQRQFGIMAGGSQNVPAYSHSPYDSWAL 239 (245) T ss_pred HHHHHHHHHHhhccCchhHHHHHHHHHHHHHHHHHHHHHhhCCcceeeCchhhhhhhhhhcccccccccccCCccccCc Confidence 99999999999999999999999999999999999999999999999 555445555 8899987422 No 13 >protein:vir:97033 Length: 245 # NCBI annotation: 32 # Family: family:all:824 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654133;genbank:gi:108862017;genbank:GeneID:5075982 Probab=100.00 E-value=9.5e-85 Score=481.23 Aligned_cols=205 Identities=21% Similarity=0.329 Sum_probs=188.3 Q ss_pred Ce-eecc-ccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCc Q lcl|NC_020878. 1 MT-TQIA-TDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDH 78 (218) Q Consensus 1 ~~-~~~~-~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~ 78 (218) |+ +... .+|||||||+||++|||+||+|||++. .+||+|++|+++|++++|+||+||||||||++ T Consensus 14 ~~~~~~~~~~TeLdAVN~~L~aIGEsPV~sld~~~-------------~~~~~va~al~~l~~~~r~vqseGW~FNte~~ 80 (245) T protein:vir:97 14 MEDVAFQIIDSKLEAVNLCMRAIGREGVDSLDSGD-------------LDAEDASKMIDIVSQRFQYNKGGGWWFNREPN 80 (245) T ss_pred hhhhhhhhhhhhHHHHHHHHHhhCccccceecCCC-------------cchHHHHHHHHHHHHHHHHHccCCeeEeecCC Confidence 43 3444 579999999999999999999998742 37899999999999999999999999999999 Q ss_pred eeEeecCCCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeE------EEEEEEecChhhhhHHHHHH Q lcl|NC_020878. 79 VKISPDANGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDH------YFDITYLLAFNDVPPAIQRY 152 (218) Q Consensus 79 ~~l~Pd~~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v------~~~iv~~l~FedLP~~aq~y 152 (218) ++|+||++|+|++|+|+|+|+++++.++++.++++|||||||+.||||+|++|| +++||+++||||||++||+| T Consensus 81 ~~ltPD~~g~I~iP~n~L~v~~~~~~~~~~~~~v~RggrLYD~~nhT~~F~~pi~~~~~~~v~Iv~~~pFEdLPe~~q~y 160 (245) T protein:vir:97 81 WQLAPDTNGEVNLPNNCLAVLQCYALGEKKVPMTMRAGKLYSTWSHTFDMRKHVNANGMIRLTLLTLLPYEHLPTSVMQA 160 (245) T ss_pred eeeccCCCCeEecCccceeeeccCccccccceeEeccceEEeccccceecccccccCcceEEEEEeeCChhhhhHHHHHH Confidence 999999999999999999999999988888999999999999999999999986 68999999999999999999 Q ss_pred HHHHHHHHHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cCCccceec------------ccCchhhhcC Q lcl|NC_020878. 153 IIARASVRAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFF-GFPHESNYR------------SYQPYKALIR 218 (218) Q Consensus 153 I~~~Aa~~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l-~~p~~~~~~------------s~rP~~~l~r 218 (218) |++|||++|+.+++|+++++|+++++|++|++.|++|||+||+|||| ++|..+.+| |+.||.+--- T Consensus 161 I~~rAA~~f~~~~~G~~~~~q~l~qee~~a~~~l~e~e~~q~~~Nml~~~~~~r~~r~m~~~~~~~~~~~~~~~~~~~~ 239 (245) T protein:vir:97 161 IAYQAAVEFIVSKDADQTKLATAQQIATQLLMDVQSEQMSQKRLNMLVHNPTQRQFGIMAGGSQNVPAYSHSPYESWAL 239 (245) T ss_pred HHHHHHHHHHhhccCchhHHHHHHHHHHHHHHHHHHHHHhhCCcceeeCchhhhhhhhhhcccccccccccCcccccCc Confidence 99999999999999999999999999999999999999999999999 555445555 8889986422 No 14 >protein:vir:78929 Length: 184 # NCBI annotation: putative tail tubular protein A # Family: family:all:824 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522825;genbank:gi:158345060;genbank:GeneID:5687419 Probab=100.00 E-value=4.3e-84 Score=477.60 Aligned_cols=184 Identities=18% Similarity=0.208 Sum_probs=174.4 Q ss_pred cchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCceeEeecCCC Q lcl|NC_020878. 8 DTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVKISPDANG 87 (218) Q Consensus 8 ~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~l~Pd~~g 87 (218) =|||||||+||++|||+||+|||+ +||+||+|+++|++++++||+||||||||++++|+||++| T Consensus 1 ~teLdAVN~~L~aIGEspV~sld~----------------~npdva~a~~iL~~v~~~vqseGW~FNte~~~~ltPd~~g 64 (184) T protein:vir:78 1 MLLLDAVNVILRKIGELPIPSMDE----------------TYPTMAIALPELEDQRIQLLTQGWWFNTWWKHKLTPDPQG 64 (184) T ss_pred CchHHHHHHHHHhhCCcccccccC----------------CCccHHHHHHHHHHHHHHHhhCCceEeecCCeeeeecCCC Confidence 499999999999999999999974 8999999999999999999999999999999999999999 Q ss_pred eEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHHHHHHHhhccC Q lcl|NC_020878. 88 HYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARASVRAATQLVA 167 (218) Q Consensus 88 ~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~Aa~~f~~~~~g 167 (218) +|++|+|+|+|++.+ .|+++|||||||+.||||+|++|++|+||+++||||||++||+||++|||++|+.+++| T Consensus 65 ~I~~P~n~L~i~~~~------~d~~~Rgg~lYD~~n~T~~F~~~i~~~iv~~~~FedlPe~~~~yI~~rAa~~f~~~~~G 138 (184) T protein:vir:78 65 RINLPKDTLAFYPDS------PDLQWDGLGVRDANTGDDRIGKSVEGRLVLSREWDRIPEIAQRVIAHQAALAVYTHEIG 138 (184) T ss_pred eEEcCccceEeecCC------ceeEEcCcEEEeccCCcEEeCCeeEEEEEeecChhhhhHHHHHHHHHHHHHHHHHhhcC Confidence 999999999998755 46999999999999999999999999999999999999999999999999999999999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCccceecccCchhhhcC Q lcl|NC_020878. 168 NADLVKLLQLEEAQTKATALEYDCEQGDHTFFGFPHESNYRSYQPYKALIR 218 (218) Q Consensus 168 d~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l~~p~~~~~~s~rP~~~l~r 218 (218) +++++|+++++|++|+++|++|||+||+||||.+-.++.+|++ |+- T Consensus 139 ~~~~~q~l~~ee~~a~~~~~~~e~~q~~~N~~~~~~~~r~r~~-----~~~ 184 (184) T protein:vir:78 139 PDETAQVIAQELQGYQNELSRMHTRSRPLNTQAKRSFSRWRRS-----LRT 184 (184) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhcCcchHHhhhhhHHHhh-----hcC Confidence 9999999999999999999999999999999988888777643 222 No 15 >protein:vir:7020 Length: 246 # NCBI annotation: tail protein # Family: family:all:824 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853593;genbank:gi:31711675;genbank:GeneID:1481801 Probab=100.00 E-value=1.4e-83 Score=474.84 Aligned_cols=202 Identities=25% Similarity=0.366 Sum_probs=188.3 Q ss_pred Ceeeccc-cchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHh---hCCceEeec Q lcl|NC_020878. 1 MTTQIAT-DTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQ---NEGWHFNKE 76 (218) Q Consensus 1 ~~~~~~~-~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vq---seGWwFNtE 76 (218) -|+++.+ +|||||||+||++|||+||+|||+ +||+++.|+++|+++++++| +|||||||| T Consensus 15 ~~~~~~~~~TeLdAVN~~L~aIGEsPV~sld~----------------~n~d~~~a~~iL~~v~~~vq~~lseGW~FNte 78 (246) T protein:vir:70 15 SDASFSIIDSKLEAVNLCMRAIGREGVDSLDS----------------GDLDAEDASKMLDIVSQRFQYNKGGGWWFNRE 78 (246) T ss_pred eccccccchhhHHHHHHHHHhhCccccccccC----------------CCccHHHHHHHHHHHHHHHHHhccCCeeEeec Confidence 3567777 999999999999999999999974 79999999999999999988 999999999 Q ss_pred CceeEeecCCCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeC------CeEEEEEEEecChhhhhHHHH Q lcl|NC_020878. 77 DHVKISPDANGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFS------GDHYFDITYLLAFNDVPPAIQ 150 (218) Q Consensus 77 ~~~~l~Pd~~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~------~~v~~~iv~~l~FedLP~~aq 150 (218) ++++|+||++|+|++|+|+|+|++++++.+++.++++|||||||+.||||+|+ +||+++||+++||||||++|| T Consensus 79 ~~~~ltPD~~g~I~iP~n~L~v~~~~~~~~~~~~vv~RGgkLYD~~n~T~~F~~~~~~D~pv~v~IV~~~~FedLPe~~q 158 (246) T protein:vir:70 79 PNWRIVPDTNGEVNLPNNCLAVLQCYALGERKVPMTMRAGKLYSTWNHTFDMRSHVNKDGAIRLTLLTYLPFEHLPTSVM 158 (246) T ss_pred CceeeccCCCCeEecCccceeeeeccCcccCceeeEEcCCeeEeecccceecccccccCcceEEEEEecCChhhhhHHHH Confidence 99999999999999999999999999998889999999999999999999995 699999999999999999999 Q ss_pred HHHHHHHHHHHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cCCccceec------------ccCchhhh- Q lcl|NC_020878. 151 RYIIARASVRAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFF-GFPHESNYR------------SYQPYKAL- 216 (218) Q Consensus 151 ~yI~~~Aa~~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l-~~p~~~~~~------------s~rP~~~l- 216 (218) +||++|||++|+.+++||+++.++++++|++|+++|++|||+||+|||| ++|..+.+| |+.||++- T Consensus 159 ~yI~~rAA~~f~~~~~gd~~~~~~~~~~e~~a~~~~~~~e~~q~~~Nml~~~~~~r~~r~m~~~~~~~~~~~~~~~~~~~ 238 (246) T protein:vir:70 159 QAIAYQAAVEFIVSKDADKTKLTTHQQIAAQLFVDVQSEQMSQKRLNMLVHNPTQRQFGIMAGGSQNVPAYSHSPYDGHP 238 (246) T ss_pred HHHHHHHHHHHHhhccCchHHHHHHHHHHHHHHHHHHHHHHhhCCcceeeCchhhhhhhhhhcccccccccccCCcCCcc Confidence 9999999999999999999999999999999999999999999999999 555555555 78888874 Q ss_pred ------cC Q lcl|NC_020878. 217 ------IR 218 (218) Q Consensus 217 ------~r 218 (218) +| T Consensus 239 ~~~~~~~~ 246 (246) T protein:vir:70 239 LKPWESYR 246 (246) T ss_pred CCccccCC Confidence 34 No 16 >protein:vir:6325 Length: 184 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877472;genbank:gi:33300844;uniprot:Q7Y2D2;genbank:GeneID:1482614 Probab=100.00 E-value=1.2e-83 Score=475.11 Aligned_cols=184 Identities=18% Similarity=0.184 Sum_probs=175.6 Q ss_pred cchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCceeEeecCCC Q lcl|NC_020878. 8 DTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVKISPDANG 87 (218) Q Consensus 8 ~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~l~Pd~~g 87 (218) =|||||||+||++|||+||+|||+ +||+||+|+++|++++++|||||||||||++++|+||++| T Consensus 1 ~teL~AVN~~L~aIGespV~sld~----------------~npdva~a~~iL~~v~~~vqs~GW~FNte~~~~ltPd~~g 64 (184) T protein:vir:63 1 MLLLDAVNVILRKIGELPTLSMDE----------------TYPTMAIALPELEDQRIQLLTQGWWFNTWWRHKLTPDPTG 64 (184) T ss_pred CchHHHHHHHHHhhCccccceecC----------------CCccHHHHHHHHHHHHHHHhcCCceEeecCCceeeecCCC Confidence 499999999999999999999984 8999999999999999999999999999999999999999 Q ss_pred eEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhhhhHHHHHHHHHHHHHHHHhhccC Q lcl|NC_020878. 88 HYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFNDVPPAIQRYIIARASVRAATQLVA 167 (218) Q Consensus 88 ~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~FedLP~~aq~yI~~~Aa~~f~~~~~g 167 (218) +|++|+|+|+|+++++ ++++|||||||+.|||++|++|++|+||+++||||||++||+||++|||++|+.+++| T Consensus 65 ~I~~P~n~L~v~~~~~------d~~~Rgg~LyD~~n~t~~F~~~i~v~iv~~~~FeelPe~~~~~I~~rAa~~f~~~~~G 138 (184) T protein:vir:63 65 RINLPKGTLAFYPDSP------DLQWDGLGVRDANTGDDRIGKPVEGRLVLSREWDHIPEIAQRVIAHQAALAVYTHEIG 138 (184) T ss_pred eEEcCcceeeeecCCC------ceEEcCCEEEeccCCcEEeCCceEEEEEeecChhhccHHHHHHHHHHHHHHHHhhccC Confidence 9999999999998753 6999999999999999999999999999999999999999999999999999999999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCccceecccCch Q lcl|NC_020878. 168 NADLVKLLQLEEAQTKATALEYDCEQGDHTFFGFPHESNYRSYQPY 213 (218) Q Consensus 168 d~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l~~p~~~~~~s~rP~ 213 (218) +++++|+++++|++|++.|++|||+||+|||+..-+++.+|+|--- T Consensus 139 ~~~~~q~l~~~e~~a~~~~~~~e~~q~~~Nm~~~~~~~~~~~~l~~ 184 (184) T protein:vir:63 139 PDETAQVIAQELQAYQNELSRMHTRSRPLNTQAKRSFSRWRRSLRT 184 (184) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhcCcchhhhhhhHHHHHhhcC Confidence 9999999999999999999999999999999998888887765322 No 17 >protein:vir:103305 Length: 245 # NCBI annotation: tail-like protein # Family: family:all:824 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039669;genbank:gi:125999998;genbank:GeneID:4818381 Probab=100.00 E-value=1.7e-83 Score=474.29 Aligned_cols=202 Identities=24% Similarity=0.346 Sum_probs=189.8 Q ss_pred Ce--eeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHh---hCCceEee Q lcl|NC_020878. 1 MT--TQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQ---NEGWHFNK 75 (218) Q Consensus 1 ~~--~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vq---seGWwFNt 75 (218) |+ |++..+|||||||+||++|||+||+|||+ +||+|++|++||++++++|| +||||||| T Consensus 13 ~~~~~~~~~dteLdAVN~~L~aIGEsPV~sld~----------------~npdva~A~~IL~~v~~~vQ~llseGW~FNt 76 (245) T protein:vir:10 13 LASVNLDTVDTRLEAINLCLRAVGYASIESEDS----------------GDLDAADASKILATVGQRVQYNGGKGWWFNV 76 (245) T ss_pred CcccccccccchHHHHHHHHHhhcccccccccC----------------CchhHHHHHHHHHHHHHHHHhhcCCCeeEee Confidence 54 45556999999999999999999999974 89999999999999999999 99999999 Q ss_pred cCceeEeecCCCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCceEeCCeE------EEEEEEecChhhhhHHH Q lcl|NC_020878. 76 EDHVKISPDANGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHTFVFSGDH------YFDITYLLAFNDVPPAI 149 (218) Q Consensus 76 E~~~~l~Pd~~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T~~F~~~v------~~~iv~~l~FedLP~~a 149 (218) |++++|+||++|+|++|+|+|+|+++..+.+++.++++|||||||+.||||+|++|+ +++||+++||||||++| T Consensus 77 e~~~~ltPd~~g~i~iP~n~L~v~~~~~~~~~~~~~v~RGgkLYD~~n~T~~F~~pv~~~~~~~v~iV~~~pFedlPe~~ 156 (245) T protein:vir:10 77 EPNWQMTPDANGEILIPNNAIAAWQDVRYDDKKVLISIRGRKVYNMNTHSTDFSNSLNREGFFRMTFMLNLPFEHMPVSA 156 (245) T ss_pred cCCceeccCCCCceecCccchhhhcccccCCCccceEEcCCeeEecccCceeccCccccccceeEEEEeeCChhhccHHH Confidence 999999999999999999999999988888888899999999999999999999998 69999999999999999 Q ss_pred HHHHHHHHHHHHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-cCCccceec------------ccCchhhh Q lcl|NC_020878. 150 QRYIIARASVRAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFF-GFPHESNYR------------SYQPYKAL 216 (218) Q Consensus 150 q~yI~~~Aa~~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l-~~p~~~~~~------------s~rP~~~l 216 (218) |+||++|||++|+.+++|+++++|+|+++|++|++.|++||++|+++||| ++|..+.+| |+.||++- T Consensus 157 q~yI~~rAA~~f~~~~~G~~~~~q~l~q~e~~a~~~~~~~~~~q~~~Nm~~~~p~~~~~r~~v~~~~~~~~~~~~~~~~~ 236 (245) T protein:vir:10 157 RQAIAYQAAVEFMVSKEFDAQKVQIWQQLAQQMQIDMGQESANQQSLNMFVNNPTQAHFGSMVGGPNANATFSRNPYNAY 236 (245) T ss_pred HHHHHHHHHHHHHhhccCchhHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCchhhhcchhccccccccccccCCcccc Confidence 99999999999999999999999999999999999999999999999999 778778888 67788764 Q ss_pred ---cC Q lcl|NC_020878. 217 ---IR 218 (218) Q Consensus 217 ---~r 218 (218) .| T Consensus 237 ~~~~~ 241 (245) T protein:vir:10 237 GGYSR 241 (245) T ss_pred ccccc Confidence 33 No 18 >protein:vir:1780 Length: 67 # NCBI annotation: tail protein B # Family: family:all:824 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570346;genbank:gi:18640505;genbank:GeneID:932718 Probab=99.96 E-value=7.4e-34 Score=202.23 Aligned_cols=67 Identities=39% Similarity=0.699 Sum_probs=63.9 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhhCCceEeecCcee Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQNEGWHFNKEDHVK 80 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqseGWwFNtE~~~~ 80 (218) |. +|+++|||||||+||++|||+||+|||+ +||+||+|+++|++++|+||+||||||||++++ T Consensus 1 ~~-~~~~~teLdAVN~~L~aIGesPV~sld~----------------~npdva~a~~iL~~v~~~vqseGW~FNte~~~~ 63 (67) T protein:vir:17 1 MA-PIKRTSELDALNVKMTNIGQQPIVNINN----------------TNPQVALAKTVLNQVTSDVLTEGWIFNRELDYP 63 (67) T ss_pred CC-CccccchhhHHHHHHHhhCccccccccC----------------CCccHHHHHHHHHHHHHHHhhCCceeeccCcee Confidence 53 5889999999999999999999999974 899999999999999999999999999999999 Q ss_pred Eeec Q lcl|NC_020878. 81 ISPD 84 (218) Q Consensus 81 l~Pd 84 (218) |+|| T Consensus 64 ltPd 67 (67) T protein:vir:17 64 LTPQ 67 (67) T ss_pred ecCC Confidence 9999 No 19 >protein:vir:103760 Length: 207 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024931;genbank:gi:48697201;genbank:GeneID:2846084 Probab=99.11 E-value=6.3e-12 Score=82.01 Aligned_cols=189 Identities=17% Similarity=0.247 Sum_probs=142.7 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHh-hCCceEeecCce Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQ-NEGWHFNKEDHV 79 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vq-seGWwFNtE~~~ 79 (218) |+ |++|-.|.-|..||+.||+|+++ +.+....++...+.+++.++ ...|.|=+- .+ T Consensus 1 M~------S~v~IcN~AL~~lGa~~I~s~~e----------------~s~~A~~c~~~Y~~~r~~~L~~~pW~FA~~-r~ 57 (207) T protein:vir:10 1 MA------SQVGICNRALTKIGDKRITSLDE----------------DSKAAATLNSMYDDVLDACLRAHVWSFTKA-RA 57 (207) T ss_pred CC------CHHHHHHHHHHhhchhhhccccc----------------CCHHHHHHHHhhHHHHHHHHhccChhhHhh-hh Confidence 76 56689999999999999999986 55677789999999999999 679999985 68 Q ss_pred eEeecCC----C---eEecCCceEEEEecCCCcc-----cceeeEEeCCeEEeccCCceEeCCeEEEEEEEecChhh-hh Q lcl|NC_020878. 80 KISPDAN----G---HYIIPTNYLRYDIYEGLSD-----RTKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLLAFND-VP 146 (218) Q Consensus 80 ~l~Pd~~----g---~I~~P~n~L~v~~~~~~~d-----~~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l~Fed-LP 146 (218) .|.|++. | ...+|.|||+|........ ....+.+.||+|+=.. ..|+.++.|...+=++ .| T Consensus 58 ~La~~~~~P~~~~~yaY~LP~Dclrv~~v~~~~~~~~~~~~~~~~v~g~~ll~~~------~~~~~l~Y~~~v~d~~~fd 131 (207) T protein:vir:10 58 QLAALAEAPLFGFSYQYRLPTDFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDM------QAPLYIRYAKRVTDPNAMD 131 (207) T ss_pred hhcccccCCCCCCcccccCcccceEeeeecCCCCccccccccceEecCCeEEecC------CCcEEEEEeecCCChhhhh Confidence 9988754 3 3789999999976553211 0013555666653111 1368888888877444 69 Q ss_pred HHHHHHHHHHHHHHHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCccceecccCchhh---hcC Q lcl|NC_020878. 147 PAIQRYIIARASVRAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFFGFPHESNYRSYQPYKA---LIR 218 (218) Q Consensus 147 ~~aq~yI~~~Aa~~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l~~p~~~~~~s~rP~~~---l~r 218 (218) ..|...++.+=|-+.....-|+..+.+.+.++++.+..+.+..+..++.---+..+.+=..|++.-|-+ ++| T Consensus 132 ~~F~~ala~~LAa~lA~pLt~~~~~~~~~~q~~~~~l~~A~~~da~e~~~~~~~~~~~l~aR~~~~~~~~~~~~~ 206 (207) T protein:vir:10 132 ALFREAFACRLAAEACESLTQSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAFPGETPIIR 206 (207) T ss_pred HHHHHHHHHHHHHHhhHhhcCChHHHHHHHHHHHHHHHHHHhcccccCcccccCCcchhhhcccccccccCCccc Confidence 999999999999999999999999999999999999999988888887755554444434443333322 344 No 20 >protein:vir:95323 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512268;genbank:gi:89152435;genbank:GeneID:3952992 Probab=98.96 E-value=2.2e-11 Score=78.97 Aligned_cols=179 Identities=17% Similarity=0.164 Sum_probs=136.7 Q ss_pred CeeeccccchHHHHHHHHHhhccc-cccccccccccccccccccccccccchHHHHHHHHHHHHHHHh-hCCceEeecCc Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQS-PVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQ-NEGWHFNKEDH 78 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEs-PV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vq-seGWwFNtE~~ 78 (218) |+ |+.|-.|.-|..||++ ||+|+++ +.+....++...+.+++.++ ...|.|=+- . T Consensus 1 M~------S~v~IcN~AL~~iG~a~~I~s~~e----------------~s~~A~~C~~~Y~~~r~~~L~~~pW~FA~~-r 57 (201) T protein:vir:95 1 MA------SVVEICNRALSNIGNSRSINSLTE----------------ASKEAGECSLHFEACRDAVLSDFDWNFATK-R 57 (201) T ss_pred CC------CHHHHHHHHHHHhCCccccccccc----------------CCHHHHHHHHhhHHHHHHHHhhcCchhhhh-h Confidence 76 5568999999999975 8999986 55677789999999999999 678999985 5 Q ss_pred eeEeecCC---C---eEecCCceEEEEecCCCccc------------ceeeEEeCCeEEeccCCceEeCCeEEEEEEEec Q lcl|NC_020878. 79 VKISPDAN---G---HYIIPTNYLRYDIYEGLSDR------------TKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLL 140 (218) Q Consensus 79 ~~l~Pd~~---g---~I~~P~n~L~v~~~~~~~d~------------~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l 140 (218) +.|.|.++ | -..+|.|||+|......+++ ..++...|+.||-. ..|+.++-|... T Consensus 58 ~~La~~a~~~~~~~yay~LP~Dclrv~~v~~~g~~~~~~~~~~~f~v~~~~~~~g~~l~td-------~~~~~l~Yv~~v 130 (201) T protein:vir:95 58 VALADTSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTNGTGKLIYTD-------QPQAWLKYVSRV 130 (201) T ss_pred hhcccccCCCCCCcccccccchhhhhhhhccCCccccccccchhhhccccccccCceeeec-------CCceEEEEeecC Confidence 78877542 3 38899999999655332211 12345556666652 246777777666 Q ss_pred C-hhhhhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCccceec-c Q lcl|NC_020878. 141 A-FNDVPPAIQRYIIARASVRAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFFGFPHESNYR-S 209 (218) Q Consensus 141 ~-FedLP~~aq~yI~~~Aa~~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l~~p~~~~~~-s 209 (218) + =+..|..|...++.+=|-+.....-|+..+.+.+.++++.+....+..+..++.-.-+..+.+-..| | T Consensus 131 ~d~~~fd~~F~~ala~~LAa~la~plt~~~~~~~~~~q~~~~~l~~A~~~da~e~~~~~~~~~~~l~aRl~ 201 (201) T protein:vir:95 131 TDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVILSAGSHSQNESQEPQPPVDEFTIARLS 201 (201) T ss_pred CChhhccHHHHHHHHHHHHHHhhHhhcCChHHHHHHHHHHHHHHHHHHhcccccCcccCCCcchhhhhhcC Confidence 5 3457999999999999999999999999999999999999999999999888865444444443333 3 No 21 >protein:vir:7328 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848219;genbank:gi:30387390;genbank:GeneID:2641866 Probab=98.84 E-value=2e-10 Score=73.75 Aligned_cols=180 Identities=14% Similarity=0.117 Sum_probs=136.3 Q ss_pred CeeeccccchHHHHHHHHHhhccc-cccccccccccccccccccccccccchHHHHHHHHHHHHHHHh-hCCceEeecCc Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQS-PVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQ-NEGWHFNKEDH 78 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEs-PV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vq-seGWwFNtE~~ 78 (218) |+ |+.|-.|.-|..||++ ||+|+++ +.+....++...+.+++.++ ...|.|=+- . T Consensus 1 M~------S~v~IcN~AL~~iG~a~~I~s~~e----------------~s~~A~~c~~~Y~~~r~~~Lr~~pW~FA~~-r 57 (201) T protein:vir:73 1 MA------SVIEICNRALSNIGNSRSINSLIE----------------ASKEAGQCSLHFDACRDAALADFDWNFATK-R 57 (201) T ss_pred CC------CHHHHHHHHHHhhcCccccccccc----------------CCHHHHHHHHhhHHHHHHHHhhcCchhHhh-h Confidence 76 5668999999999985 8999986 55677789999999999999 778999985 5 Q ss_pred eeEeecCC---C---eEecCCceEEEEecCCCccc------------ceeeEEeCCeEEeccCCceEeCCeEEEEEEEec Q lcl|NC_020878. 79 VKISPDAN---G---HYIIPTNYLRYDIYEGLSDR------------TKDVVRKDGKLYDNVHHTFVFSGDHYFDITYLL 140 (218) Q Consensus 79 ~~l~Pd~~---g---~I~~P~n~L~v~~~~~~~d~------------~~~~v~RggkLYD~~~~T~~F~~~v~~~iv~~l 140 (218) +.|.|.++ | -..+|.|||+|....+.... ..++.+.|++||-. ..|+.++.|... T Consensus 58 ~~La~~a~~p~~~~yaY~LP~Dclrv~~v~~~~~~~~~~~~~~~~~~~~~~~ieg~~i~td-------~~~~~l~Y~~~v 130 (201) T protein:vir:73 58 VALADTNNPPPDWQYAYQYPSDCVRITEIMPTGIRNPTAAQRIEYVVGSNEDLTGKLIYTD-------QPKAWLKYMARV 130 (201) T ss_pred hhhhhcccCCCCCcccccccccceeeeeeccccccccccccccchhccccccccCCEeeec-------CCceeEEEeecC Confidence 78876542 3 38899999998765432110 12355667777642 246677777665 Q ss_pred C-hhhhhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCccceeccc Q lcl|NC_020878. 141 A-FNDVPPAIQRYIIARASVRAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFFGFPHESNYRSY 210 (218) Q Consensus 141 ~-FedLP~~aq~yI~~~Aa~~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l~~p~~~~~~s~ 210 (218) . =+-.|..|...++.|=|-++....-|+..+.+.+.++++.+.......+..++.-..+..+.+=..|=. T Consensus 131 ~d~~~fd~lF~~ala~~LAa~lA~plt~~~~~~~~~~q~~~~~~~~A~~~d~~e~~~~~~~~~~~l~aR~~ 201 (201) T protein:vir:73 131 TDVNMYDAIFMEALSWRLAAAINMALTGSADLGNNALTMYNRVILSAGSHSQNESQEPQPPVDEFTAARLS 201 (201) T ss_pred CCcccccHHHHHHHHHHHHHHhhHhhcCChHHHHHHHHHHHHHHHHHHHhhhccccCCCCCCchHHHhhcC Confidence 5 344699999999999999999999999999999999999999999988888887665544444332211 No 22 >protein:vir:107429 Length: 223 # NCBI annotation: Bbp14 # Family: family:all:1524 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958683;genbank:gi:41179375;genbank:GeneID:2717223 Probab=98.30 E-value=6e-08 Score=60.17 Aligned_cols=197 Identities=11% Similarity=0.055 Sum_probs=128.2 Q ss_pred CeeeccccchHHHHHHHHHhhcccccc-ccccccccccccccccccccccchHHHHHHHHHHHHHHHh-hCCceEeecCc Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVT-TLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQ-NEGWHFNKEDH 78 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~-sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vq-seGWwFNtE~~ 78 (218) |+ |+.|-.|.-|.-||..|+. +++ .+++.+..-.++...+.+++.++ +..|.|=+- . T Consensus 1 M~------S~v~IcN~AL~~lG~~~i~~~~s--------------~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~-r 59 (223) T protein:vir:10 1 MA------SEVDICNLALAYLGDEATVAGIN--------------PPEGSVQAEYCARFYPFARDSLLELHTWGFATK-C 59 (223) T ss_pred CC------CHHHHHHHHHHhcccchhhcccC--------------CCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhh-h Confidence 66 5568999999999987765 332 11355666789999999999999 779999985 5 Q ss_pred eeEeecCC---C---eEecCCceEEEEecCCCcccceeeEEeCCe--E--EeccCCceEeC--CeEEEEEEEecC-hhhh Q lcl|NC_020878. 79 VKISPDAN---G---HYIIPTNYLRYDIYEGLSDRTKDVVRKDGK--L--YDNVHHTFVFS--GDHYFDITYLLA-FNDV 145 (218) Q Consensus 79 ~~l~Pd~~---g---~I~~P~n~L~v~~~~~~~d~~~~~v~Rggk--L--YD~~~~T~~F~--~~v~~~iv~~l~-FedL 145 (218) +.|.|.++ | ...+|.|||+|....+.....-+--.+.+. . +|.......++ .|+.++.|...+ =+-. T Consensus 60 ~~La~~a~p~~~~~yaY~LP~Dclrv~~v~~~~~~~~~~~~~~~~~~~~e~~~~g~~~i~td~~~~~l~Y~~~v~d~~~f 139 (223) T protein:vir:10 60 AQLAAMGISRPEWRFAYAQPADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADIILTNQVNAVARYISLVKDTTKF 139 (223) T ss_pred hhhhhcccCCCCccccccccccceeeeeeccccccccccccccccceEEeeccccceeeeecCCceEEEEeecCCChhcc Confidence 78877642 3 489999999996654311000000000011 1 12222233333 367777777765 4556 Q ss_pred hHHHHHHHHHHHHHHHHhhccCcHHHHHHHH---HHHHHHHHHHHHHHhhhcccccccCCccce-------ecccCchhh Q lcl|NC_020878. 146 PPAIQRYIIARASVRAATQLVANADLVKLLQ---LEEAQTKATALEYDCEQGDHTFFGFPHESN-------YRSYQPYKA 215 (218) Q Consensus 146 P~~aq~yI~~~Aa~~f~~~~~gd~~~~q~l~---~~e~~a~~~l~~~e~~q~~~N~l~~p~~~~-------~~s~rP~~~ 215 (218) |..|...++.|=|-+++.-.-|+..+.|.+. ++++.+..+.+..+..++.-.-+..+++-. .+.+-|--. T Consensus 140 d~lF~~Ala~~LAa~lA~pLt~~~~~~q~a~~~~~~y~~~l~~A~~~da~e~~~~~~~~~~~l~aR~~~~~~~~~~~~~~ 219 (223) T protein:vir:10 140 SPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQAYLSQAMVSDANQRKTKPAHMPEWMRARGAGFVDGNIPGLPN 219 (223) T ss_pred cHHHHHHHHHHHHHHhhHhhcCCcchHHHHHHHHHHHHHHHHHHHhcccccCcccccccchhhhhcccCCCCCCCCCCCc Confidence 9999999999999999999999998887664 445567777777777777655554444422 223333334 Q ss_pred hcC Q lcl|NC_020878. 216 LIR 218 (218) Q Consensus 216 l~r 218 (218) .|| T Consensus 220 ~~~ 222 (223) T protein:vir:10 220 GWR 222 (223) T ss_pred ccc Confidence 566 No 23 >protein:vir:107803 Length: 223 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1524 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996624;genbank:gi:45580758;genbank:GeneID:2767879 Probab=98.30 E-value=6e-08 Score=60.17 Aligned_cols=197 Identities=11% Similarity=0.055 Sum_probs=128.2 Q ss_pred CeeeccccchHHHHHHHHHhhcccccc-ccccccccccccccccccccccchHHHHHHHHHHHHHHHh-hCCceEeecCc Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVT-TLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQ-NEGWHFNKEDH 78 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~-sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vq-seGWwFNtE~~ 78 (218) |+ |+.|-.|.-|.-||..|+. +++ .+++.+..-.++...+.+++.++ +..|.|=+- . T Consensus 1 M~------S~v~IcN~AL~~lG~~~i~~~~s--------------~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~-r 59 (223) T protein:vir:10 1 MA------SEVDICNLALAYLGDEATVAGIN--------------PPEGSVQAEYCARFYPFARDSLLELHTWGFATK-C 59 (223) T ss_pred CC------CHHHHHHHHHHhcccchhhcccC--------------CCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhh-h Confidence 66 5568999999999987765 332 11355666789999999999999 779999985 5 Q ss_pred eeEeecCC---C---eEecCCceEEEEecCCCcccceeeEEeCCe--E--EeccCCceEeC--CeEEEEEEEecC-hhhh Q lcl|NC_020878. 79 VKISPDAN---G---HYIIPTNYLRYDIYEGLSDRTKDVVRKDGK--L--YDNVHHTFVFS--GDHYFDITYLLA-FNDV 145 (218) Q Consensus 79 ~~l~Pd~~---g---~I~~P~n~L~v~~~~~~~d~~~~~v~Rggk--L--YD~~~~T~~F~--~~v~~~iv~~l~-FedL 145 (218) +.|.|.++ | ...+|.|||+|....+.....-+--.+.+. . +|.......++ .|+.++.|...+ =+-. T Consensus 60 ~~La~~a~p~~~~~yaY~LP~Dclrv~~v~~~~~~~~~~~~~~~~~~~~e~~~~g~~~i~td~~~~~l~Y~~~v~d~~~f 139 (223) T protein:vir:10 60 AQLAAMGISRPEWRFAYAQPADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADIILTNQVNAVARYISLVKDTTKF 139 (223) T ss_pred hhhhhcccCCCCccccccccccceeeeeeccccccccccccccccceEEeeccccceeeeecCCceEEEEeecCCChhcc Confidence 78877642 3 489999999996654311000000000011 1 12222233333 367777777765 4556 Q ss_pred hHHHHHHHHHHHHHHHHhhccCcHHHHHHHH---HHHHHHHHHHHHHHhhhcccccccCCccce-------ecccCchhh Q lcl|NC_020878. 146 PPAIQRYIIARASVRAATQLVANADLVKLLQ---LEEAQTKATALEYDCEQGDHTFFGFPHESN-------YRSYQPYKA 215 (218) Q Consensus 146 P~~aq~yI~~~Aa~~f~~~~~gd~~~~q~l~---~~e~~a~~~l~~~e~~q~~~N~l~~p~~~~-------~~s~rP~~~ 215 (218) |..|...++.|=|-+++.-.-|+..+.|.+. ++++.+..+.+..+..++.-.-+..+++-. .+.+-|--. T Consensus 140 d~lF~~Ala~~LAa~lA~pLt~~~~~~q~a~~~~~~y~~~l~~A~~~da~e~~~~~~~~~~~l~aR~~~~~~~~~~~~~~ 219 (223) T protein:vir:10 140 SPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQAYLSQAMVSDANQRKTKPAHMPEWMRARGAGFVDGNIPGLPN 219 (223) T ss_pred cHHHHHHHHHHHHHHhhHhhcCCcchHHHHHHHHHHHHHHHHHHHhcccccCcccccccchhhhhcccCCCCCCCCCCCc Confidence 9999999999999999999999998887664 445567777777777777655554444422 223333334 Q ss_pred hcC Q lcl|NC_020878. 216 LIR 218 (218) Q Consensus 216 l~r 218 (218) .|| T Consensus 220 ~~~ 222 (223) T protein:vir:10 220 GWR 222 (223) T ss_pred ccc Confidence 566 No 24 >protein:vir:98502 Length: 223 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1524 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996576;genbank:gi:45569507;genbank:GeneID:2767830 Probab=98.30 E-value=6e-08 Score=60.17 Aligned_cols=197 Identities=11% Similarity=0.055 Sum_probs=128.2 Q ss_pred CeeeccccchHHHHHHHHHhhcccccc-ccccccccccccccccccccccchHHHHHHHHHHHHHHHh-hCCceEeecCc Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVT-TLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQ-NEGWHFNKEDH 78 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~-sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vq-seGWwFNtE~~ 78 (218) |+ |+.|-.|.-|.-||..|+. +++ .+++.+..-.++...+.+++.++ +..|.|=+- . T Consensus 1 M~------S~v~IcN~AL~~lG~~~i~~~~s--------------~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~-r 59 (223) T protein:vir:98 1 MA------SEVDICNLALAYLGDEATVAGIN--------------PPEGSVQAEYCARFYPFARDSLLELHTWGFATK-C 59 (223) T ss_pred CC------CHHHHHHHHHHhcccchhhcccC--------------CCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhh-h Confidence 66 5568999999999987765 332 11355666789999999999999 779999985 5 Q ss_pred eeEeecCC---C---eEecCCceEEEEecCCCcccceeeEEeCCe--E--EeccCCceEeC--CeEEEEEEEecC-hhhh Q lcl|NC_020878. 79 VKISPDAN---G---HYIIPTNYLRYDIYEGLSDRTKDVVRKDGK--L--YDNVHHTFVFS--GDHYFDITYLLA-FNDV 145 (218) Q Consensus 79 ~~l~Pd~~---g---~I~~P~n~L~v~~~~~~~d~~~~~v~Rggk--L--YD~~~~T~~F~--~~v~~~iv~~l~-FedL 145 (218) +.|.|.++ | ...+|.|||+|....+.....-+--.+.+. . +|.......++ .|+.++.|...+ =+-. T Consensus 60 ~~La~~a~p~~~~~yaY~LP~Dclrv~~v~~~~~~~~~~~~~~~~~~~~e~~~~g~~~i~td~~~~~l~Y~~~v~d~~~f 139 (223) T protein:vir:98 60 AQLAAMGISRPEWRFAYAQPADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADIILTNQVNAVARYISLVKDTTKF 139 (223) T ss_pred hhhhhcccCCCCccccccccccceeeeeeccccccccccccccccceEEeeccccceeeeecCCceEEEEeecCCChhcc Confidence 78877642 3 489999999996654311000000000011 1 12222233333 367777777765 4556 Q ss_pred hHHHHHHHHHHHHHHHHhhccCcHHHHHHHH---HHHHHHHHHHHHHHhhhcccccccCCccce-------ecccCchhh Q lcl|NC_020878. 146 PPAIQRYIIARASVRAATQLVANADLVKLLQ---LEEAQTKATALEYDCEQGDHTFFGFPHESN-------YRSYQPYKA 215 (218) Q Consensus 146 P~~aq~yI~~~Aa~~f~~~~~gd~~~~q~l~---~~e~~a~~~l~~~e~~q~~~N~l~~p~~~~-------~~s~rP~~~ 215 (218) |..|...++.|=|-+++.-.-|+..+.|.+. ++++.+..+.+..+..++.-.-+..+++-. .+.+-|--. T Consensus 140 d~lF~~Ala~~LAa~lA~pLt~~~~~~q~a~~~~~~y~~~l~~A~~~da~e~~~~~~~~~~~l~aR~~~~~~~~~~~~~~ 219 (223) T protein:vir:98 140 SPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQAYLSQAMVSDANQRKTKPAHMPEWMRARGAGFVDGNIPGLPN 219 (223) T ss_pred cHHHHHHHHHHHHHHhhHhhcCCcchHHHHHHHHHHHHHHHHHHHhcccccCcccccccchhhhhcccCCCCCCCCCCCc Confidence 9999999999999999999999998887664 445567777777777777655554444422 223333334 Q ss_pred hcC Q lcl|NC_020878. 216 LIR 218 (218) Q Consensus 216 l~r 218 (218) .|| T Consensus 220 ~~~ 222 (223) T protein:vir:98 220 GWR 222 (223) T ss_pred ccc Confidence 566 No 25 >protein:vir:94601 Length: 211 # NCBI annotation: PfWMP4_36 # Family: family:all:12085 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762666;genbank:gi:115304374;genbank:GeneID:5142301 Probab=97.27 E-value=3.9e-06 Score=50.26 Aligned_cols=176 Identities=16% Similarity=0.087 Sum_probs=113.0 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhh---CCceEeecC Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQN---EGWHFNKED 77 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqs---eGWwFNtE~ 77 (218) |. .+|-|.|-|.+|.+|||-||..+.+ |.=-.|+...+.+-|.|-+ -.|-+|+-. T Consensus 1 M~----~TsLL~A~NEVL~~vGE~~~~~~t~------------------~~G~k~r~~~~~AiR~V~slH~W~~L~~~v~ 58 (211) T protein:vir:94 1 MP----TTSLLIACNEVLLNVGELEVADFTT------------------PVGKKARLAYNSAIRAVSSLHAWQHLQATVS 58 (211) T ss_pred CC----cchhHhhhHHHHhhccchhhhhhcc------------------chhHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 54 4589999999999999999988743 3444578888888888764 345555533 Q ss_pred ceeEeecCCCeEecCCceEEEEecCCCcccceeeEEeCCeEEeccCCc---------------eEe---------CCeEE Q lcl|NC_020878. 78 HVKISPDANGHYIIPTNYLRYDIYEGLSDRTKDVVRKDGKLYDNVHHT---------------FVF---------SGDHY 133 (218) Q Consensus 78 ~~~l~Pd~~g~I~~P~n~L~v~~~~~~~d~~~~~v~RggkLYD~~~~T---------------~~F---------~~~v~ 133 (218) . |+=...|.|-...+.-+....++..|.-+-+-. +-|||+.--+ -+. .-.|+ T Consensus 59 A--lSW~~~~D~A~L~~IQ~L~sVS~G~~~~rs~G~--~ELYer~~~~a~T~T~l~Y~~~~~~~V~L~P~P~~a~~~~Ik 134 (211) T protein:vir:94 59 A--LSWNVEGDIATLTPIQELYSVSLGTDVLRSVGF--DELYERDIRIAATATPLYYARAEQNSVLLYPTPSVADRPNIK 134 (211) T ss_pred h--heeccccceehhhhhhhhhhhhccchhhhhcch--hhhhhcccceeeccchhheeeccCCeeEeccCccccccccee Confidence 1 222233444333333333333322111111100 1234332111 111 12256 Q ss_pred EEEEEec--------ChhhhhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCc Q lcl|NC_020878. 134 FDITYLL--------AFNDVPPAIQRYIIARASVRAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTFFGFPH 203 (218) Q Consensus 134 ~~iv~~l--------~FedLP~~aq~yI~~~Aa~~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~l~~p~ 203 (218) +.|...+ .|. ||+-+-.++..+|+.....+-.-|-+-.|..+.|..----..+.-++.|---||=+.|. T Consensus 135 F~VL~~~T~PS~~t~~Ft-LPd~~~~LV~~~A~~LM~~~H~~D~~~A~~~~~E~El~t~~~R~~q~~~~~~~~~~~~~ 211 (211) T protein:vir:94 135 FRVLLQPTVPSLPTDNFT-LPDDFYDLVHIYAQMLMHRNHTTDLQAAQACQSEFELRTHMVRTRQTSQVVGNMGGYPT 211 (211) T ss_pred EEEeeccccCCCCCCccc-CchhHHHHHHHHHHHHHHHhhcchhhHHHHhhhHHHHHHHHHhcchhhhhhhccCCCCC Confidence 6654332 377 99999999999999999999999999999999988887778888888888888888887 No 26 >protein:vir:80185 Length: 221 # NCBI annotation: tail tubular protein A # Family: family:all:12085 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285796;genbank:gi:148747830;genbank:GeneID:5220461 Probab=93.62 E-value=0.002 Score=35.33 Aligned_cols=179 Identities=20% Similarity=0.228 Sum_probs=93.5 Q ss_pred CeeeccccchHHHHHHHHHhhccccccccccccccccccccccccccccchHHHHHHHHHHHHHHHhh---CCceEeecC Q lcl|NC_020878. 1 MTTQIATDTELSAVNSILGSIGQSPVTTLGTVTTDATNTGQEIVNTFANPQIAMIHGLLMEVTKDVQN---EGWHFNKED 77 (218) Q Consensus 1 ~~~~~~~~TeLdAVN~~L~aIGEsPV~sLd~~~~~~~~~~~~~ve~~~nP~Va~A~~~L~~v~~~vqs---eGWwFNtE~ 77 (218) |+ .+|-|.|-|.+|.+|||-||..+.+ |.=--++...+.+-|.|-+ =.|-+|+-. T Consensus 1 M~----~TtLL~A~NEVL~~iGE~~~~~~s~------------------~~G~r~k~~~~~AlR~V~aiH~W~~L~~~i~ 58 (221) T protein:vir:80 1 MS----DTTLLQASNEVLRSIGERPLLQLSG------------------TTGDRLKDVFRQALRDVEAIHTWDWLYNQIP 58 (221) T ss_pred CC----cchhHhhhHHHHhhccchhhhhhcc------------------chhHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 54 4589999999999999999998844 3334577788888887764 345555533 Q ss_pred ceeEeecCCCeEecCCceEEEEecCCCcccc---eeeEE-----------------eCCeEEeccC-CceEe-------- Q lcl|NC_020878. 78 HVKISPDANGHYIIPTNYLRYDIYEGLSDRT---KDVVR-----------------KDGKLYDNVH-HTFVF-------- 128 (218) Q Consensus 78 ~~~l~Pd~~g~I~~P~n~L~v~~~~~~~d~~---~~~v~-----------------RggkLYD~~~-~T~~F-------- 128 (218) ...- +|.|-...+.-+....++. +++ .++.+ .++-||=.++ +.-+- T Consensus 59 AiSW----~~D~A~L~~IQ~L~tVS~G-~kt~G~~ELqwvdftdYdk~~~~s~T~Tddn~m~Y~~~~~~~V~L~P~P~~s 133 (221) T protein:vir:80 59 AISW----TQDEAYLGDIQRLFTVSCG-DKTTGYRELQWVDFTDYDKQPITSYTGTDDNAMWYTMTSNGRVKLNPYPEDS 133 (221) T ss_pred ceee----ccccchhhhhhhhhhhccc-ccccchhhhhccccccccccceeeeecccCcceeeeeccCCeeEeccCcccc Confidence 2211 2333332222222222211 111 00100 1112222111 11111 Q ss_pred --CCeEEEEEEEec--------ChhhhhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|NC_020878. 129 --SGDHYFDITYLL--------AFNDVPPAIQRYIIARASVRAATQLVANADLVKLLQLEEAQTKATALEYDCEQGDHTF 198 (218) Q Consensus 129 --~~~v~~~iv~~l--------~FedLP~~aq~yI~~~Aa~~f~~~~~gd~~~~q~l~~~e~~a~~~l~~~e~~q~~~N~ 198 (218) .-.|.+.|...+ .|.-||+-+..++..+|+.....+-.-|-+-.+..+.|..---. .+-.+|++.-. T Consensus 134 ~a~~~IrF~VL~~~T~PS~~s~~FsvLPe~~~~LV~~~A~~LM~~~H~~D~~~A~~~~~E~Ei~s~---~~R~~erkapt 210 (221) T protein:vir:80 134 QAQQRIRFYVLQTLTMPSQDSSTFSVLPERYMPLVIKRASYLMALRHLDDTSGAAYFNNEYEILSQ---QYRNNERKAPT 210 (221) T ss_pred ccccceeEEEeeccccCCCCCccccccchhhhHHHHHHHHHHHHHhhcchhhHHHHhhhHHHHHHH---HHhhhhhcchh Confidence 112555554332 38889999999999999999999988888888888776543221 22223322211 Q ss_pred ccCCccceecccCchhhhcC Q lcl|NC_020878. 199 FGFPHESNYRSYQPYKALIR 218 (218) Q Consensus 199 l~~p~~~~~~s~rP~~~l~r 218 (218) -+ .+-||-.| | T Consensus 211 qq---lsmyrrrr------r 221 (221) T protein:vir:80 211 QQ---LSMYRRRR------R 221 (221) T ss_pred HH---HHHHHhhc------C Confidence 00 01111111 1 Done!