Query lcl|NC_019540.1_cdsid_YP_007010556.1 [gene=F490_gp26] [protein=hypothetical protein] [protein_id=YP_007010556.1] [location=complement(30512..31267)] Match_columns 251 No_of_seqs 5 out of 8 Neff 2.5 Searched_HMMs 1612 Date Thu Nov 7 16:02:24 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_45 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_45_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95463 Length: 267 100.0 3E-147 2E-150 823.9 23.1 251 1-251 1-251 (267) 2 protein:vir:3130 Length: 250 # 100.0 1E-127 7E-131 716.5 21.9 240 1-247 2-250 (250) 3 protein:vir:2626 Length: 206 # 100.0 1E-117 8E-121 661.5 14.5 204 44-248 1-206 (206) 4 protein:vir:80215 Length: 211 94.7 0.0038 2.4E-06 33.8 13.6 177 1-251 1-183 (211) 5 protein:vir:6325 Length: 184 # 86.6 0.044 2.8E-05 28.0 11.7 163 3-251 1-170 (184) 6 protein:vir:95323 Length: 201 82.1 0.08 4.9E-05 26.6 13.0 188 1-251 1-196 (201) 7 protein:vir:103760 Length: 207 76.6 0.13 8.3E-05 25.4 11.1 180 1-251 1-197 (207) 8 protein:vir:7328 Length: 201 # 70.7 0.21 0.00013 24.4 10.5 181 1-249 1-201 (201) 9 protein:vir:78929 Length: 184 66.5 0.27 0.00017 23.7 12.5 163 3-251 1-170 (184) 10 protein:vir:98502 Length: 223 54.3 0.51 0.00031 22.2 11.2 188 1-249 1-223 (223) 11 protein:vir:107803 Length: 223 54.3 0.51 0.00031 22.2 11.2 188 1-249 1-223 (223) 12 protein:vir:107429 Length: 223 54.3 0.51 0.00031 22.2 11.2 188 1-249 1-223 (223) 13 protein:vir:1780 Length: 67 # 51.8 0.2 0.00012 24.5 4.1 61 1-62 1-67 (67) 14 protein:vir:8886 Length: 195 # 30.6 1.6 0.00097 19.5 12.8 177 1-251 1-195 (195) 15 protein:vir:99676 Length: 197 30.0 1.6 0.001 19.4 12.4 165 1-251 3-185 (197) 16 protein:vir:79050 Length: 133 26.1 0.3 0.00019 23.4 0.7 99 1-102 1-133 (133) 17 protein:vir:103305 Length: 245 25.6 2 0.0013 18.9 11.8 185 1-251 20-245 (245) 18 protein:vir:78741 Length: 197 23.8 2.3 0.0014 18.6 13.1 180 1-251 3-193 (197) 19 protein:vir:3365 Length: 196 # 23.4 2.3 0.0014 18.6 11.5 168 1-251 1-184 (196) No 1 >protein:vir:95463 Length: 267 # NCBI annotation: hypothetical protein ORF040 # Family: family:all:7212 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294633;genbank:gi:149408199;genbank:GeneID:5237030 Probab=100.00 E-value=3e-147 Score=823.90 Aligned_cols=251 Identities=60% Similarity=1.001 Sum_probs=250.4 Q ss_pred CchhHHHHHHHHHhhcCCccccchhhhhHHHHHHHHHHHHHHHHhccCChhHhhcceeeecccCCccceEEecCCCccee Q lcl|NC_019540. 1 MKPTLLEIVQEVLNDMDSDEVNSINDTFESAQVASICKTVFRNMTSNRNWPHMKRAVNLVPFSDNNYPTHMRVDKSFTEI 80 (251) Q Consensus 1 mk~TlL~IVq~~lsdmdsDeVnSI~Dt~Es~q~a~i~k~vy~nmi~nr~w~~~kr~iql~~~sd~~~pt~l~~p~~v~ei 80 (251) |.+|||+|||+|||+|+||||||||||||++|||+|+|+||++||+||+||||++.+||+|++++|+||||+||++|||| T Consensus 1 ~~~tll~iv~~~~~~~~sdev~s~~dtie~~~~~~~~~~v~~~mi~~r~wp~~~~~lkl~p~~~~A~pth~~l~tpVk~i 80 (267) T protein:vir:95 1 MIKTLLDIVQDILSEMSSDEVNSINDTIESMQVAQIVKSVYMSMMSNRNWPHQRKLIQLEPSGDDAYPTHMKLQTPIKEM 80 (267) T ss_pred ChhHHHHHHHHHHHhcccchhhhhhhhHHHHHHHHHHHHHHHHHHhhcccchhhhheeeccccccccceeeeecccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeeccCCceEEEeccccc Q lcl|NC_019540. 81 VFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDDDVIVFDSYDKE 160 (251) Q Consensus 81 ~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd~~vVlDSYdk~ 160 (251) |||+|||+|+||++||||+||||+|||||++++++|+++||+++|+|+||||||||||+||+|||||||++||||||||+ T Consensus 81 ~fl~Y~~~kda~~~~~yRtlky~~PdeF~~~~~~rn~~~dn~~~~~d~sgveLlirnd~~P~YyTSFDd~tvVlDSYDas 160 (267) T protein:vir:95 81 CFINYDCVKDGETRKRYRTMKWAEPDDFLRSISKRNNDQDNIDVIIDPSGVELLIRNDLAPTYYTSFDDTTLIFDSYDKA 160 (267) T ss_pred eeeeeeeeeccccceeeeeeeccChHHHHHhhhccCCCCCCceeEEccCceEEEeecCCCcceecccCCceeeeeccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHHHhccCChhHHHHHHHHHHHHhhccccccCCC Q lcl|NC_019540. 161 VDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIKLKQQASQKDEQESVKQQRWLSRKSWVVNGGI 240 (251) Q Consensus 161 vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~~kq~~~~K~Eq~a~rq~~~~srk~~~vn~G~ 240 (251) ||+|++++|++||||+||.|++.|+|||+|||+|||+|++||||+||++||||+|||+||+|||||+|++||+|+||||+ T Consensus 161 vd~tl~~ak~~A~~~~~p~ff~~D~fVp~Ipd~~f~~ll~EAks~Af~~fkq~anpkaEq~arRq~v~~~~k~~~vn~G~ 240 (267) T protein:vir:95 161 VDDTLQKSKIQAMAYVMPVFFMDDDFIPEIPDEARAALLEEAKSRAFITIKQMANQKAEQEAQRQQAWLSRKAWRVNGGI 240 (267) T ss_pred cccccccccceeEEEEeeeeecCCccCCCCcHHHHHHHHHHhhHHHhhhhhccCCchhHHHHHHHHHHHHhhhhhhccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcCCcccCC Q lcl|NC_019540. 241 KYPNYGRRGRK 251 (251) Q Consensus 241 ~~~NYGR~~~~ 251 (251) ++||||||+|| T Consensus 241 ~~~NYGRn~~~ 251 (267) T protein:vir:95 241 KYPNYGRNSMK 251 (267) T ss_pred cccccCccccc Confidence 99999999999 No 2 >protein:vir:3130 Length: 250 # NCBI annotation: hypothetical protein # Family: family:all:7212 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640312;genbank:gi:21234411;genbank:GeneID:956053 Probab=100.00 E-value=1.2e-127 Score=716.45 Aligned_cols=240 Identities=25% Similarity=0.433 Sum_probs=222.5 Q ss_pred CchhHHHHHHHHHhhcCCccccchh-hhhHHHHHHHHHHHHHHHHhccCChhHhhcceeeecccCCccceE--EecCCCc Q lcl|NC_019540. 1 MKPTLLEIVQEVLNDMDSDEVNSIN-DTFESAQVASICKTVFRNMTSNRNWPHMKRAVNLVPFSDNNYPTH--MRVDKSF 77 (251) Q Consensus 1 mk~TlL~IVq~~lsdmdsDeVnSI~-Dt~Es~q~a~i~k~vy~nmi~nr~w~~~kr~iql~~~sd~~~pt~--l~~p~~v 77 (251) =|+|||+|||+|||+|+|||||||| ||||++|||+|+++||++||+||+| +.+||+|++++|+||| |++|+|| T Consensus 2 ~~~tll~iv~~~~~~~~sdev~s~~~dsie~~~~~~~~~~v~~~~~~~~~w----~~lkl~p~~~~A~pth~~ls~P~~v 77 (250) T protein:vir:31 2 PKRTLLQIVKKMAQKTGSDEVTSLSEDSIEIQDMVDCALEVLEDIIYRNDW----EFLKDRPAQLEAGTNAIELSIPDNV 77 (250) T ss_pred chhhHHHHHHHHHHhcccchhcccchhhhHHHHHHHHHHHHHHHHhhcCCc----ceeeecccccccccceeeeeccccc Confidence 5999999999999999999999999 9999999999999999999999999 5678999999999999 8899999 Q ss_pred ceeeEEeeeecccCCcceeeeeeeccCchHHHHHh-hccCCCCCCceEEEecCceEEEEecCCCCeeeeccCCceEEEec Q lcl|NC_019540. 78 TEIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRV-NKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDDDVIVFDS 156 (251) Q Consensus 78 ~ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~-~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd~~vVlDS 156 (251) ||||||+||++|+| ++||||+||||+|||||+|+ +|+++++|+.++ .+||||||||||+||+|||||||++||||| T Consensus 78 k~i~fl~Y~v~da~-~~~~yRtlky~~PdeF~~sl~sn~~~d~D~~~v--~I~gveLlirnd~~P~YyTSFDd~tvvlDS 154 (250) T protein:vir:31 78 RKIQTLRYRYEDAG-VQNCFRTLRYMYPHEFMERLQNNKPTDPDTTTV--TINGVELYPKTNRHPRYWTSFDEQNVVLDS 154 (250) T ss_pred cceeeeeeeecccc-cceeeeeeeccChHHHHHHHhhcCCCCccccee--EeeeeeeeeecCCCcceecccCCceeeeec Confidence 99999999988555 99999999999999999999 455555555554 555999999999999999999999999999 Q ss_pred cccccchhcccccceeEEEEeeeeeee---cccCCCCcHHHHHHHHHHHHHHHHHHHhccCChhHHHHHHHHHHHHhhc- Q lcl|NC_019540. 157 YDKEVDDTLQSSKTQCVAYVLPEFYVI---DDHKPDLPEEAMAAYISEVKSQAFIKLKQQASQKDEQESVKQQRWLSRK- 232 (251) Q Consensus 157 Ydk~vd~Tl~~sktq~ia~~~p~F~~~---dd~v~dip~~~f~~l~~EAks~af~~~kq~~~~K~Eq~a~rq~~~~srk- 232 (251) |||+||+|++++|++||++++|.|++. |+|||+|||+|||+|++||||+||++||||+|||+||+|||||+|+++| T Consensus 155 YDasvd~tl~~ak~~A~~~v~y~~F~ds~~D~fvp~Ipd~~f~~ll~EAks~Af~~fkq~anpkaEq~arR~~~q~~~k~ 234 (250) T protein:vir:31 155 YDATQNPTGVDATDSAIIATLYLDFTGSDADSWVAPIPESLFTLWEQEAVAEAFVQFRQTENPRAERRSRRTYVQQIKKE 234 (250) T ss_pred cccccccccccchheeeeeeecceecCCcccccCCCCcHHHHHHHHHHhhHHHhhhhhcccCchhHHHHHHHHHHHhhhc Confidence 999999999999999966665555554 5699999999999999999999999999999999999999999999999 Q ss_pred -cccccCCCCCCcCCc Q lcl|NC_019540. 233 -SWVVNGGIKYPNYGR 247 (251) Q Consensus 233 -~~~vn~G~~~~NYGR 247 (251) .|.+|||+++||||| T Consensus 235 ~~~hkn~G~~~~NYGR 250 (250) T protein:vir:31 235 PVTHKDEGSDEVNYGR 250 (250) T ss_pred cccccccCcCCCCCCC Confidence 888999999999999 No 3 >protein:vir:2626 Length: 206 # NCBI annotation: gp28 # Family: family:all:7212 # MgeID: mge:55 # MgeName: SIO1 # Cross-refs: genbank:acc:NP_064767;genbank:gi:9964637;genbank:GeneID:1263042 Probab=100.00 E-value=1.2e-117 Score=661.51 Aligned_cols=204 Identities=33% Similarity=0.525 Sum_probs=201.2 Q ss_pred HhccCChhHhhcceeeecccCCccceEEecCCCcceeeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCce Q lcl|NC_019540. 44 MTSNRNWPHMKRAVNLVPFSDNNYPTHMRVDKSFTEIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVD 123 (251) Q Consensus 44 mi~nr~w~~~kr~iql~~~sd~~~pt~l~~p~~v~ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~ 123 (251) ||+||+||||++++||+|+|+++||||||||++|||||||+|||+| ||++||||+||||+|||||+++++||++|+|++ T Consensus 1 m~~~r~~p~~~~~~~l~~~~~~a~pth~s~pt~vk~i~fl~Y~~~k-a~~~~~yRtlky~~PdeF~~~~~~r~~lq~N~~ 79 (206) T protein:vir:26 1 MIATRFIPEHSQTLKLTSFSSSARPTHFSFPTRVKNIEFLDYNVSK-AVGGVEYRRLKYLSPDEFFGLSDGRDSLASNVK 79 (206) T ss_pred CCccccccccceeeEeeccCCcccceeeecCccccceeeeeeeeee-cccceeeeeeeccCchhhhhhcccccchhccCc Confidence 9999999999999999999999999999999999999999999998 669999999999999999999999999999999 Q ss_pred EEEe-cCceEEEEecCCCCeeeeccCCceEEEeccccccchhcccccceeEEEEeeee-eeecccCCCCcHHHHHHHHHH Q lcl|NC_019540. 124 VILD-MSGVEVAVRNDRAPEYYTSFDDDVIVFDSYDKEVDDTLQSSKTQCVAYVLPEF-YVIDDHKPDLPEEAMAAYISE 201 (251) Q Consensus 124 ~v~d-~~gvel~irnd~aP~y~TSFDd~~vVlDSYdk~vd~Tl~~sktq~ia~~~p~F-~~~dd~v~dip~~~f~~l~~E 201 (251) +++| .||||||||||+||+|||||||++||||||||+||+||+++|++||||+||+| .++|+|||+|||+|||+|++| T Consensus 80 ~~~D~~sgveLlirnd~~P~YyTSFDd~tvvlDSYDasvd~tl~~ak~~A~a~~yp~F~s~~D~fvp~Ipd~~f~~ll~E 159 (206) T protein:vir:26 80 QVADVGSDSILLIRNDAMPMYYTSFDDDTVVLDSYDASVDAILTSAKTRAYGVKYPTFDSFSDTFVPDIDDTMFPFLLAE 159 (206) T ss_pred ccccCCCceEEEEecCCCcceecccCCceeeeeccccccccccccccceeeEEeeeeccccccccCCCCcHHHHHHHHHH Confidence 9999 89999999999999999999999999999999999999999999999999999 667779999999999999999 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHHHHhhccccccCCCCCCcCCcc Q lcl|NC_019540. 202 VKSQAFIKLKQQASQKDEQESVKQQRWLSRKSWVVNGGIKYPNYGRR 248 (251) Q Consensus 202 Aks~af~~~kq~~~~K~Eq~a~rq~~~~srk~~~vn~G~~~~NYGR~ 248 (251) |||+||++||||+|||+||+|||||+|+++++|+||||+++|||||+ T Consensus 160 Aks~Af~~fkq~anpkaEq~arRQq~~~~~~~hkvn~G~~~~NYGRn 206 (206) T protein:vir:26 160 AKSTAMSLFKSGADPKIEQTARRQKVYVQNDMHKVNTGRAKNNYGRN 206 (206) T ss_pred hhhhhhhhhhccCCchhhHHhhhhheeccccceecccCcccCCCCCC Confidence 99999999999999999999999999999999999999999999999 No 4 >protein:vir:80215 Length: 211 # NCBI annotation: putative tail tubular protein A # Family: family:all:824 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522885;genbank:gi:158345178;genbank:GeneID:5687478 Probab=94.66 E-value=0.0038 Score=33.84 Aligned_cols=177 Identities=16% Similarity=0.160 Sum_probs=93.7 Q ss_pred CchhHHHHHHHHHhhcCCccccchh-hhhHHHHHHHHHHHHHHHHhccCChhHhh-cceeeecccCCccceEEecCCCcc Q lcl|NC_019540. 1 MKPTLLEIVQEVLNDMDSDEVNSIN-DTFESAQVASICKTVFRNMTSNRNWPHMK-RAVNLVPFSDNNYPTHMRVDKSFT 78 (251) Q Consensus 1 mk~TlL~IVq~~lsdmdsDeVnSI~-Dt~Es~q~a~i~k~vy~nmi~nr~w~~~k-r~iql~~~sd~~~pt~l~~p~~v~ 78 (251) |++|.|+-|-.||..++--.|+||+ +.-+...+..|+..+-....+ ..|=|=. +.+.|.|-.| -+..+|.++- T Consensus 1 ~~~teLdAVN~~L~aIGEsPV~sld~~npdva~a~~iL~~v~r~vqs-eGW~FNte~~~~ltPd~~----g~I~iP~n~L 75 (211) T protein:vir:80 1 MQLTFLEAVNLVLRELGETPVTSVDETYPTLAQILPAMEDARRNTLA-EGWWFNSFDDFTASPSPA----GEVLLSEDTL 75 (211) T ss_pred CcchHHHHHHHHHHhhCccccccccCCchhHHHHHHHHHHHHHHHcc-CCeeEeecCCceeccCCC----CeEecCccce Confidence 9999999999999999999999999 445666666777777777777 5555554 6777887764 3666787765 Q ss_pred eeeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeeccCCceEEEeccc Q lcl|NC_019540. 79 EIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDDDVIVFDSYD 158 (251) Q Consensus 79 ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd~~vVlDSYd 158 (251) .| ++. |+.. | ..-|--||-|.+. +-+|| T Consensus 76 ~v-----~~~--~~~~--~-----------------------------~~Rgg~LYD~~n~-----------T~~F~--- 103 (211) T protein:vir:80 76 AF-----YPD--DVEK--F-----------------------------TWAGRYVRVTGTG-----------SKVVG--- 103 (211) T ss_pred EE-----eeC--CCee--e-----------------------------eeeCceEEeccCC-----------cEeeC--- Confidence 44 221 1000 1 0112233333221 12222 Q ss_pred cccchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHHHhccCC--hhHHHHHHHHHHHHhhccccc Q lcl|NC_019540. 159 KEVDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIKLKQQAS--QKDEQESVKQQRWLSRKSWVV 236 (251) Q Consensus 159 k~vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~~kq~~~--~K~Eq~a~rq~~~~srk~~~v 236 (251) ++-+-.|-+.+ .| | +||+.+.-+....|-.+|...+--.++ +...|+..+.++.+. ..+-- T Consensus 104 --------~pi~v~iv~~~-~F---e----eLPe~~~~yI~~rAa~~f~~~~~G~d~~~q~l~~ee~~a~~~l~-~~e~~ 166 (211) T protein:vir:80 104 --------APVKGRVVLDI-PY---D----ELPEGMRYLVVYRCAYEVYVADFGADSTAQVIANKMSAAYVEVR-AVHIR 166 (211) T ss_pred --------CceEEEEEeec-Ch---h----hccHHHHHHHHHHHHHHHHhhcCCchhHHHHHHHHHHHHHHHHH-HHHHh Confidence 11122222222 23 2 499988888887777777666433222 122222222222221 11211 Q ss_pred cC--CCCCCcCCcccCC Q lcl|NC_019540. 237 NG--GIKYPNYGRRGRK 251 (251) Q Consensus 237 n~--G~~~~NYGR~~~~ 251 (251) +| -...-|+.-+|++ T Consensus 167 q~~~Nm~~~~~~~~~~~ 183 (211) T protein:vir:80 167 QRKLTLRKRTPATSGVK 183 (211) T ss_pred hcCccccccCccccccc Confidence 11 1222233323333 No 5 >protein:vir:6325 Length: 184 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877472;genbank:gi:33300844;uniprot:Q7Y2D2;genbank:GeneID:1482614 Probab=86.60 E-value=0.044 Score=28.00 Aligned_cols=163 Identities=13% Similarity=0.094 Sum_probs=90.5 Q ss_pred hhHHHHHHHHHhhcCCccccchh-hhhHHHHHHHHHHHHHHHHhccCChhHhh-cceeeecccCCccceEEecCCCccee Q lcl|NC_019540. 3 PTLLEIVQEVLNDMDSDEVNSIN-DTFESAQVASICKTVFRNMTSNRNWPHMK-RAVNLVPFSDNNYPTHMRVDKSFTEI 80 (251) Q Consensus 3 ~TlL~IVq~~lsdmdsDeVnSI~-Dt~Es~q~a~i~k~vy~nmi~nr~w~~~k-r~iql~~~sd~~~pt~l~~p~~v~ei 80 (251) +|.|+-|-.||..+|--.|+||+ +.-+.+.+..|+..+=....+ ..|=|=. +.+.|+|-.+ -+..+|.++-.| T Consensus 1 ~teL~AVN~~L~aIGespV~sld~~npdva~a~~iL~~v~~~vqs-~GW~FNte~~~~ltPd~~----g~I~~P~n~L~v 75 (184) T protein:vir:63 1 MLLLDAVNVILRKIGELPTLSMDETYPTMAIALPELEDQRIQLLT-QGWWFNTWWRHKLTPDPT----GRINLPKGTLAF 75 (184) T ss_pred CchHHHHHHHHHhhCccccceecCCCccHHHHHHHHHHHHHHHhc-CCceEeecCCceeeecCC----CeEEcCcceeee Confidence 99999999999999999999999 555677777777777777776 5665654 6677777764 366688876555 Q ss_pred eEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeeccCCceEEEeccccc Q lcl|NC_019540. 81 VFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDDDVIVFDSYDKE 160 (251) Q Consensus 81 ~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd~~vVlDSYdk~ 160 (251) |.+ | . +++. -|--|+-+.+. +-+|| T Consensus 76 ----~~~---~-~-----------------------------d~~~--Rgg~LyD~~n~-----------t~~F~----- 100 (184) T protein:vir:63 76 ----YPD---S-P-----------------------------DLQW--DGLGVRDANTG-----------DDRIG----- 100 (184) T ss_pred ----ecC---C-C-----------------------------ceEE--cCCEEEeccCC-----------cEEeC----- Confidence 111 1 0 1111 12223333221 11222 Q ss_pred cchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHHHhccCC-----hhHHHHHHHHHHHHhhcccc Q lcl|NC_019540. 161 VDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIKLKQQAS-----QKDEQESVKQQRWLSRKSWV 235 (251) Q Consensus 161 vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~~kq~~~-----~K~Eq~a~rq~~~~srk~~~ 235 (251) ++-+-.|-+.+ .| | +||+.+.-+....|-.+|-..+=-.++ +..||+|+++-.. .+ T Consensus 101 ------~~i~v~iv~~~-~F---e----elPe~~~~~I~~rAa~~f~~~~~G~~~~~q~l~~~e~~a~~~~~~----~e- 161 (184) T protein:vir:63 101 ------KPVEGRLVLSR-EW---D----HIPEIAQRVIAHQAALAVYTHEIGPDETAQVIAQELQAYQNELSR----MH- 161 (184) T ss_pred ------CceEEEEEeec-Ch---h----hccHHHHHHHHHHHHHHHHhhccCchhHHHHHHHHHHHHHHHHHH----HH- Confidence 11122232322 23 2 499888888777776666555432221 2344444433221 11 Q ss_pred ccCCCCCCcCCcccCC Q lcl|NC_019540. 236 VNGGIKYPNYGRRGRK 251 (251) Q Consensus 236 vn~G~~~~NYGR~~~~ 251 (251) -+.|++.|- T Consensus 162 -------~~q~~~Nm~ 170 (184) T protein:vir:63 162 -------TRSRPLNTQ 170 (184) T ss_pred -------HhhcCcchh Confidence 122333332 No 6 >protein:vir:95323 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512268;genbank:gi:89152435;genbank:GeneID:3952992 Probab=82.13 E-value=0.08 Score=26.61 Aligned_cols=188 Identities=13% Similarity=0.148 Sum_probs=98.6 Q ss_pred CchhHHHHHHHHHhhcC-Cccccchh-hhhHHHHHHHHHHHHHHHHhccCChhHhhcceeeecccCCccceEEecCCCcc Q lcl|NC_019540. 1 MKPTLLEIVQEVLNDMD-SDEVNSIN-DTFESAQVASICKTVFRNMTSNRNWPHMKRAVNLVPFSDNNYPTHMRVDKSFT 78 (251) Q Consensus 1 mk~TlL~IVq~~lsdmd-sDeVnSI~-Dt~Es~q~a~i~k~vy~nmi~nr~w~~~kr~iql~~~sd~~~pt~l~~p~~v~ 78 (251) |- |.++|-..-|+.++ ++.|||++ +|-++.+....--.+-+.......|-|-+|-++|.+.+. | |. T Consensus 1 M~-S~v~IcN~AL~~iG~a~~I~s~~e~s~~A~~C~~~Y~~~r~~~L~~~pW~FA~~r~~La~~a~---~-----~~--- 68 (201) T protein:vir:95 1 MA-SVVEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALADTSN---P-----PP--- 68 (201) T ss_pred CC-CHHHHHHHHHHHhCCcccccccccCCHHHHHHHHhhHHHHHHHHhhcCchhhhhhhhcccccC---C-----CC--- Confidence 65 99999999999998 78899999 566666665555566677788999999999999977655 3 10 Q ss_pred eeeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeec--cCCceEEEec Q lcl|NC_019540. 79 EIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTS--FDDDVIVFDS 156 (251) Q Consensus 79 ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TS--FDd~~vVlDS 156 (251) .|. -.|..|.++++-++=++.+..+..- ...+.|-+. |+-+. T Consensus 69 -----~~~-------------yay~LP~Dclrv~~v~~~g~~~~~~-------------~~~~~f~v~~~~~~~g----- 112 (201) T protein:vir:95 69 -----DWE-------------YAYQYPSDCLRITEIMLPGVRNPTA-------------AMRVQYEVGADTNGTG----- 112 (201) T ss_pred -----CCc-------------ccccccchhhhhhhhccCCcccccc-------------ccchhhhccccccccC----- Confidence 122 3478899999877665554333221 111222111 11100 Q ss_pred cccccchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHH---HHHhccCChhHHHHHHHHHHHHhhcc Q lcl|NC_019540. 157 YDKEVDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAF---IKLKQQASQKDEQESVKQQRWLSRKS 233 (251) Q Consensus 157 Ydk~vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af---~~~kq~~~~K~Eq~a~rq~~~~srk~ 233 (251) ..|..+-..+ ++-=.+.+. ++++|+.+..||-+.-. +-..=+.|..-.+.....+....+.. T Consensus 113 ------~~l~td~~~~--~l~Yv~~v~-------d~~~fd~~F~~ala~~LAa~la~plt~~~~~~~~~~q~~~~~l~~A 177 (201) T protein:vir:95 113 ------KLIYTDQPQA--WLKYVSRVT-------DVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVILSA 177 (201) T ss_pred ------ceeeecCCce--EEEEeecCC-------ChhhccHHHHHHHHHHHHHHhhHhhcCChHHHHHHHHHHHHHHHHH Confidence 0011111000 000011111 13445555555543321 22222333333444555566666666 Q ss_pred ccccCC-CCCCcCCcccCC Q lcl|NC_019540. 234 WVVNGG-IKYPNYGRRGRK 251 (251) Q Consensus 234 ~~vn~G-~~~~NYGR~~~~ 251 (251) -.+++- .+.+..+--+.- T Consensus 178 ~~~da~e~~~~~~~~~~~l 196 (201) T protein:vir:95 178 GSHSQNESQEPQPPVDEFT 196 (201) T ss_pred HhcccccCcccCCCcchhh Confidence 666662 222233221111 No 7 >protein:vir:103760 Length: 207 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024931;genbank:gi:48697201;genbank:GeneID:2846084 Probab=76.63 E-value=0.13 Score=25.38 Aligned_cols=180 Identities=16% Similarity=0.230 Sum_probs=89.0 Q ss_pred CchhHHHHHHHHHhhcCCccccchh-hhhHHHHHHHHHHHHHHHHhccCChhHhhcceeeecccCCccceEEecCCCcce Q lcl|NC_019540. 1 MKPTLLEIVQEVLNDMDSDEVNSIN-DTFESAQVASICKTVFRNMTSNRNWPHMKRAVNLVPFSDNNYPTHMRVDKSFTE 79 (251) Q Consensus 1 mk~TlL~IVq~~lsdmdsDeVnSI~-Dt~Es~q~a~i~k~vy~nmi~nr~w~~~kr~iql~~~sd~~~pt~l~~p~~v~e 79 (251) |- |.++|...-|+.++.+.|+|++ +|.++.+....--.+-+.......|-|-+|-++|.+.+.. |.. T Consensus 1 M~-S~v~IcN~AL~~lGa~~I~s~~e~s~~A~~c~~~Y~~~r~~~L~~~pW~FA~~r~~La~~~~~--P~~--------- 68 (207) T protein:vir:10 1 MA-SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEA--PLF--------- 68 (207) T ss_pred CC-CHHHHHHHHHHhhchhhhcccccCCHHHHHHHHhhHHHHHHHHhccChhhHhhhhhhcccccC--CCC--------- Confidence 54 9999999999999999999999 6666666666666666778888999999999999765541 110 Q ss_pred eeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCce---EEEecCceEEEEecCCCCeeeeccCCceEEEec Q lcl|NC_019540. 80 IVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVD---VILDMSGVEVAVRNDRAPEYYTSFDDDVIVFDS 156 (251) Q Consensus 80 i~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~---~v~d~~gvel~irnd~aP~y~TSFDd~~vVlDS 156 (251) .|.+ .|..|.++++-+.-.+.+..+-+ ..--+.|-. T Consensus 69 ----~~~y-------------aY~LP~Dclrv~~v~~~~~~~~~~~~~~~~v~g~~------------------------ 107 (207) T protein:vir:10 69 ----GFSY-------------QYRLPTDFIRLLQVGQFDVYPRTDTRGLFSIENGN------------------------ 107 (207) T ss_pred ----CCcc-------------cccCcccceEeeeecCCCCccccccccceEecCCe------------------------ Confidence 1111 24557777665444332211100 000111111 Q ss_pred cccccchhcccccceeEEEEeeeeeeecccCCCCc-HHHHHHHHHHHHHHHH---HHHh-ccCChhHHHHHHHHHHHHhh Q lcl|NC_019540. 157 YDKEVDDTLQSSKTQCVAYVLPEFYVIDDHKPDLP-EEAMAAYISEVKSQAF---IKLK-QQASQKDEQESVKQQRWLSR 231 (251) Q Consensus 157 Ydk~vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip-~~~f~~l~~EAks~af---~~~k-q~~~~K~Eq~a~rq~~~~sr 231 (251) |..+... =+++. ++-+++ ++.|+-+..||-+.-+ +-.. .+++.+.+. ....+....+ T Consensus 108 --------ll~~~~~-~~~l~--------Y~~~v~d~~~fd~~F~~ala~~LAa~lA~pLt~~~~~~~~-~~q~~~~~l~ 169 (207) T protein:vir:10 108 --------ILTDMQA-PLYIR--------YAKRVTDPNAMDALFREAFACRLAAEACESLTQSATKRQG-AWAEHDQAIA 169 (207) T ss_pred --------EEecCCC-cEEEE--------EeecCCChhhhhHHHHHHHHHHHHHHhhHhhcCChHHHHH-HHHHHHHHHH Confidence 1111110 00010 111111 3345555555543322 1112 233334443 3333444445 Q ss_pred ccccccC--CCCCCcC------CcccCC Q lcl|NC_019540. 232 KSWVVNG--GIKYPNY------GRRGRK 251 (251) Q Consensus 232 k~~~vn~--G~~~~NY------GR~~~~ 251 (251) ..-.+++ +..++-+ .|.|-- T Consensus 170 ~A~~~da~e~~~~~~~~~~~l~aR~~~~ 197 (207) T protein:vir:10 170 AAIRVNAIERPAQPLGDDTWLESRNGVA 197 (207) T ss_pred HHHhcccccCcccccCCcchhhhccccc Confidence 5555665 2222211 122222 No 8 >protein:vir:7328 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848219;genbank:gi:30387390;genbank:GeneID:2641866 Probab=70.73 E-value=0.21 Score=24.36 Aligned_cols=181 Identities=15% Similarity=0.201 Sum_probs=94.1 Q ss_pred CchhHHHHHHHHHhhcC-Cccccchh-hhhHHHHHHHHHHHHHHHHhccCChhHhhcceeeecccCCccceEEecCCCcc Q lcl|NC_019540. 1 MKPTLLEIVQEVLNDMD-SDEVNSIN-DTFESAQVASICKTVFRNMTSNRNWPHMKRAVNLVPFSDNNYPTHMRVDKSFT 78 (251) Q Consensus 1 mk~TlL~IVq~~lsdmd-sDeVnSI~-Dt~Es~q~a~i~k~vy~nmi~nr~w~~~kr~iql~~~sd~~~pt~l~~p~~v~ 78 (251) |- |.++|--.-|+.++ ++.|+|++ +|-++.+....--.+-+.....-.|-|-+|-++|.+.+. | | . T Consensus 1 M~-S~v~IcN~AL~~iG~a~~I~s~~e~s~~A~~c~~~Y~~~r~~~Lr~~pW~FA~~r~~La~~a~---~-----p--~- 68 (201) T protein:vir:73 1 MA-SVIEICNRALSNIGNSRSINSLIEASKEAGQCSLHFDACRDAALADFDWNFATKRVALADTNN---P-----P--P- 68 (201) T ss_pred CC-CHHHHHHHHHHhhcCcccccccccCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhhhhccc---C-----C--C- Confidence 64 99999999999998 78899998 676777666666666677888999999999999977654 2 1 0 Q ss_pred eeeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEE----------EecCceEEEEecCCCCeeeeccC Q lcl|NC_019540. 79 EIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVI----------LDMSGVEVAVRNDRAPEYYTSFD 148 (251) Q Consensus 79 ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v----------~d~~gvel~irnd~aP~y~TSFD 148 (251) .|.+ -|..|.++++-++-.+.+..+..-. -++.|..|+ +|.+|- T Consensus 69 -----~~~y-------------aY~LP~Dclrv~~v~~~~~~~~~~~~~~~~~~~~~~~ieg~~i~--td~~~~------ 122 (201) T protein:vir:73 69 -----DWQY-------------AYQYPSDCVRITEIMPTGIRNPTAAQRIEYVVGSNEDLTGKLIY--TDQPKA------ 122 (201) T ss_pred -----CCcc-------------cccccccceeeeeeccccccccccccccchhccccccccCCEee--ecCCce------ Confidence 2222 2788999997776655554433211 223333222 222221 Q ss_pred CceEEEeccccccchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHHHhccCChhHHHHHHHHHHH Q lcl|NC_019540. 149 DDVIVFDSYDKEVDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIKLKQQASQKDEQESVKQQRW 228 (251) Q Consensus 149 d~~vVlDSYdk~vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~~kq~~~~K~Eq~a~rq~~~ 228 (251) +| .|=..|.++ .-.|++=.+|+++.|+-.-+ ..=+.|..-.|....++.. T Consensus 123 ----~l-~Y~~~v~d~--------------------~~fd~lF~~ala~~LAa~lA-----~plt~~~~~~~~~~q~~~~ 172 (201) T protein:vir:73 123 ----WL-KYMARVTDV--------------------NMYDAIFMEALSWRLAAAIN-----MALTGSADLGNNALTMYNR 172 (201) T ss_pred ----eE-EEeecCCCc--------------------ccccHHHHHHHHHHHHHHhh-----HhhcCChHHHHHHHHHHHH Confidence 11 111111110 01122223344443332222 2222233333333344444 Q ss_pred HhhccccccCCCCC-CcC-------Cccc Q lcl|NC_019540. 229 LSRKSWVVNGGIKY-PNY-------GRRG 249 (251) Q Consensus 229 ~srk~~~vn~G~~~-~NY-------GR~~ 249 (251) ..+..-.+++-... +.. .|.| T Consensus 173 ~~~~A~~~d~~e~~~~~~~~~~~l~aR~~ 201 (201) T protein:vir:73 173 VILSAGSHSQNESQEPQPPVDEFTAARLS 201 (201) T ss_pred HHHHHHHhhhccccCCCCCCchHHHhhcC Confidence 44444444442111 111 2334 No 9 >protein:vir:78929 Length: 184 # NCBI annotation: putative tail tubular protein A # Family: family:all:824 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522825;genbank:gi:158345060;genbank:GeneID:5687419 Probab=66.50 E-value=0.27 Score=23.73 Aligned_cols=163 Identities=15% Similarity=0.141 Sum_probs=86.3 Q ss_pred hhHHHHHHHHHhhcCCccccchhhhh-HHHHHHHHHHHHHHHHhccCChhHhh-cceeeecccCCccceEEecCCCccee Q lcl|NC_019540. 3 PTLLEIVQEVLNDMDSDEVNSINDTF-ESAQVASICKTVFRNMTSNRNWPHMK-RAVNLVPFSDNNYPTHMRVDKSFTEI 80 (251) Q Consensus 3 ~TlL~IVq~~lsdmdsDeVnSI~Dt~-Es~q~a~i~k~vy~nmi~nr~w~~~k-r~iql~~~sd~~~pt~l~~p~~v~ei 80 (251) +|.|+-|-.||..+|--.|+||++++ +...+-.|+.++=...-+ ..|=|=. +..+|.|-.+ -+..+|.++-.| T Consensus 1 ~teLdAVN~~L~aIGEspV~sld~~npdva~a~~iL~~v~~~vqs-eGW~FNte~~~~ltPd~~----g~I~~P~n~L~i 75 (184) T protein:vir:78 1 MLLLDAVNVILRKIGELPIPSMDETYPTMAIALPELEDQRIQLLT-QGWWFNTWWKHKLTPDPQ----GRINLPKDTLAF 75 (184) T ss_pred CchHHHHHHHHHhhCCcccccccCCCccHHHHHHHHHHHHHHHhh-CCceEeecCCeeeeecCC----CeEEcCccceEe Confidence 99999999999999999999999554 444445556666565554 4455554 6678887764 577888885444 Q ss_pred eEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeeccCCceEEEeccccc Q lcl|NC_019540. 81 VFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDDDVIVFDSYDKE 160 (251) Q Consensus 81 ~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd~~vVlDSYdk~ 160 (251) .+ .| . +++. -|--|+-+.+. +-+|| T Consensus 76 -----~~--~~-~-----------------------------d~~~--Rgg~lYD~~n~-----------T~~F~----- 100 (184) T protein:vir:78 76 -----YP--DS-P-----------------------------DLQW--DGLGVRDANTG-----------DDRIG----- 100 (184) T ss_pred -----ec--CC-c-----------------------------eeEE--cCcEEEeccCC-----------cEEeC----- Confidence 21 11 0 1111 12233333221 11232 Q ss_pred cchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHHHhccCC-----hhHHHHHHHHHHHHhhcccc Q lcl|NC_019540. 161 VDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIKLKQQAS-----QKDEQESVKQQRWLSRKSWV 235 (251) Q Consensus 161 vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~~kq~~~-----~K~Eq~a~rq~~~~srk~~~ 235 (251) ++-+-.|-+.+ .| -+||+.+.-+....|-.+|-..+=-.++ +..||+|+.+ +.+ .+ T Consensus 101 ------~~i~~~iv~~~-~F-------edlPe~~~~yI~~rAa~~f~~~~~G~~~~~q~l~~ee~~a~~~---~~~-~e- 161 (184) T protein:vir:78 101 ------KSVEGRLVLSR-EW-------DRIPEIAQRVIAHQAALAVYTHEIGPDETAQVIAQELQGYQNE---LSR-MH- 161 (184) T ss_pred ------CeeEEEEEeec-Ch-------hhhhHHHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHH---HHH-HH- Confidence 12222232222 33 2789888887777766666554432221 2334444332 221 11 Q ss_pred ccCCCCCCcCCcccCC Q lcl|NC_019540. 236 VNGGIKYPNYGRRGRK 251 (251) Q Consensus 236 vn~G~~~~NYGR~~~~ 251 (251) -+.|++.|- T Consensus 162 -------~~q~~~N~~ 170 (184) T protein:vir:78 162 -------TRSRPLNTQ 170 (184) T ss_pred -------HhhcCcchH Confidence 122332222 No 10 >protein:vir:98502 Length: 223 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1524 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996576;genbank:gi:45569507;genbank:GeneID:2767830 Probab=54.34 E-value=0.51 Score=22.21 Aligned_cols=188 Identities=13% Similarity=0.196 Sum_probs=85.2 Q ss_pred CchhHHHHHHHHHhhcCCccc-cchh---hhhHHHHHHHHHHHHHHHHhccCChhHhhcceeeecccCCccceEEecCCC Q lcl|NC_019540. 1 MKPTLLEIVQEVLNDMDSDEV-NSIN---DTFESAQVASICKTVFRNMTSNRNWPHMKRAVNLVPFSDNNYPTHMRVDKS 76 (251) Q Consensus 1 mk~TlL~IVq~~lsdmdsDeV-nSI~---Dt~Es~q~a~i~k~vy~nmi~nr~w~~~kr~iql~~~sd~~~pt~l~~p~~ 76 (251) |- |.++|--.-|+.++.+.| ||+| +|-++.+....--.+-+.....-.|-|-+|-++|-+.+. -|.. T Consensus 1 M~-S~v~IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~r~~La~~a~--p~~~------ 71 (223) T protein:vir:98 1 MA-SEVDICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQLAAMGI--SRPE------ 71 (223) T ss_pred CC-CHHHHHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhhhhccc--CCCC------ Confidence 65 999999999999998886 4554 688888887777777788888999999999999976553 1211 Q ss_pred cceeeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeeccCCceEEEec Q lcl|NC_019540. 77 FTEIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDDDVIVFDS 156 (251) Q Consensus 77 v~ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd~~vVlDS 156 (251) |.+ .|..|.++++-+.=.+.+..+++--.+ .-+.|-...|+ T Consensus 72 --------~~y-------------aY~LP~Dclrv~~v~~~~~~~~~~~~~-----------~~~~~~~e~~~------- 112 (223) T protein:vir:98 72 --------WRF-------------AYAQPADAIKIVAVLPHDAANIEAGID-----------NAQPFSCEIDN------- 112 (223) T ss_pred --------ccc-------------cccccccceeeeeeccccccccccccc-----------cccceEEeecc------- Confidence 222 577888888866654544433221111 11111111111 Q ss_pred cccccchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHH-H---hccCC---hhHHHHHHHHHHHH Q lcl|NC_019540. 157 YDKEVDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIK-L---KQQAS---QKDEQESVKQQRWL 229 (251) Q Consensus 157 Ydk~vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~-~---kq~~~---~K~Eq~a~rq~~~~ 229 (251) +-...|..+-. =+++-=.+...| +++|+.+..||-+.-++- | -.+++ ++++|..+.-+. . T Consensus 113 ---~g~~~i~td~~--~~~l~Y~~~v~d-------~~~fd~lF~~Ala~~LAa~lA~pLt~~~~~~q~a~~~~~~y~~-~ 179 (223) T protein:vir:98 113 ---TGADIILTNQV--NAVARYISLVKD-------TTKFSPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQA-Y 179 (223) T ss_pred ---ccceeeeecCC--ceEEEEeecCCC-------hhcccHHHHHHHHHHHHHHhhHhhcCCcchHHHHHHHHHHHHH-H Confidence 00001111000 000000111111 333444444443322210 0 11111 222222211111 1 Q ss_pred hhccccccCCCCCCc----------------CCc--------cc Q lcl|NC_019540. 230 SRKSWVVNGGIKYPN----------------YGR--------RG 249 (251) Q Consensus 230 srk~~~vn~G~~~~N----------------YGR--------~~ 249 (251) .++.-.+++--+.+. +|| +| T Consensus 180 l~~A~~~da~e~~~~~~~~~~~l~aR~~~~~~~~~~~~~~~~~~ 223 (223) T protein:vir:98 180 LSQAMVSDANQRKTKPAHMPEWMRARGAGFVDGNIPGLPNGWRG 223 (223) T ss_pred HHHHHhcccccCcccccccchhhhhcccCCCCCCCCCCCccccC Confidence 111111111000000 222 22 No 11 >protein:vir:107803 Length: 223 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1524 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996624;genbank:gi:45580758;genbank:GeneID:2767879 Probab=54.34 E-value=0.51 Score=22.21 Aligned_cols=188 Identities=13% Similarity=0.196 Sum_probs=85.2 Q ss_pred CchhHHHHHHHHHhhcCCccc-cchh---hhhHHHHHHHHHHHHHHHHhccCChhHhhcceeeecccCCccceEEecCCC Q lcl|NC_019540. 1 MKPTLLEIVQEVLNDMDSDEV-NSIN---DTFESAQVASICKTVFRNMTSNRNWPHMKRAVNLVPFSDNNYPTHMRVDKS 76 (251) Q Consensus 1 mk~TlL~IVq~~lsdmdsDeV-nSI~---Dt~Es~q~a~i~k~vy~nmi~nr~w~~~kr~iql~~~sd~~~pt~l~~p~~ 76 (251) |- |.++|--.-|+.++.+.| ||+| +|-++.+....--.+-+.....-.|-|-+|-++|-+.+. -|.. T Consensus 1 M~-S~v~IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~r~~La~~a~--p~~~------ 71 (223) T protein:vir:10 1 MA-SEVDICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQLAAMGI--SRPE------ 71 (223) T ss_pred CC-CHHHHHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhhhhccc--CCCC------ Confidence 65 999999999999998886 4554 688888887777777788888999999999999976553 1211 Q ss_pred cceeeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeeccCCceEEEec Q lcl|NC_019540. 77 FTEIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDDDVIVFDS 156 (251) Q Consensus 77 v~ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd~~vVlDS 156 (251) |.+ .|..|.++++-+.=.+.+..+++--.+ .-+.|-...|+ T Consensus 72 --------~~y-------------aY~LP~Dclrv~~v~~~~~~~~~~~~~-----------~~~~~~~e~~~------- 112 (223) T protein:vir:10 72 --------WRF-------------AYAQPADAIKIVAVLPHDAANIEAGID-----------NAQPFSCEIDN------- 112 (223) T ss_pred --------ccc-------------cccccccceeeeeeccccccccccccc-----------cccceEEeecc------- Confidence 222 577888888866654544433221111 11111111111 Q ss_pred cccccchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHH-H---hccCC---hhHHHHHHHHHHHH Q lcl|NC_019540. 157 YDKEVDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIK-L---KQQAS---QKDEQESVKQQRWL 229 (251) Q Consensus 157 Ydk~vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~-~---kq~~~---~K~Eq~a~rq~~~~ 229 (251) +-...|..+-. =+++-=.+...| +++|+.+..||-+.-++- | -.+++ ++++|..+.-+. . T Consensus 113 ---~g~~~i~td~~--~~~l~Y~~~v~d-------~~~fd~lF~~Ala~~LAa~lA~pLt~~~~~~q~a~~~~~~y~~-~ 179 (223) T protein:vir:10 113 ---TGADIILTNQV--NAVARYISLVKD-------TTKFSPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQA-Y 179 (223) T ss_pred ---ccceeeeecCC--ceEEEEeecCCC-------hhcccHHHHHHHHHHHHHHhhHhhcCCcchHHHHHHHHHHHHH-H Confidence 00001111000 000000111111 333444444443322210 0 11111 222222211111 1 Q ss_pred hhccccccCCCCCCc----------------CCc--------cc Q lcl|NC_019540. 230 SRKSWVVNGGIKYPN----------------YGR--------RG 249 (251) Q Consensus 230 srk~~~vn~G~~~~N----------------YGR--------~~ 249 (251) .++.-.+++--+.+. +|| +| T Consensus 180 l~~A~~~da~e~~~~~~~~~~~l~aR~~~~~~~~~~~~~~~~~~ 223 (223) T protein:vir:10 180 LSQAMVSDANQRKTKPAHMPEWMRARGAGFVDGNIPGLPNGWRG 223 (223) T ss_pred HHHHHhcccccCcccccccchhhhhcccCCCCCCCCCCCccccC Confidence 111111111000000 222 22 No 12 >protein:vir:107429 Length: 223 # NCBI annotation: Bbp14 # Family: family:all:1524 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958683;genbank:gi:41179375;genbank:GeneID:2717223 Probab=54.34 E-value=0.51 Score=22.21 Aligned_cols=188 Identities=13% Similarity=0.196 Sum_probs=85.2 Q ss_pred CchhHHHHHHHHHhhcCCccc-cchh---hhhHHHHHHHHHHHHHHHHhccCChhHhhcceeeecccCCccceEEecCCC Q lcl|NC_019540. 1 MKPTLLEIVQEVLNDMDSDEV-NSIN---DTFESAQVASICKTVFRNMTSNRNWPHMKRAVNLVPFSDNNYPTHMRVDKS 76 (251) Q Consensus 1 mk~TlL~IVq~~lsdmdsDeV-nSI~---Dt~Es~q~a~i~k~vy~nmi~nr~w~~~kr~iql~~~sd~~~pt~l~~p~~ 76 (251) |- |.++|--.-|+.++.+.| ||+| +|-++.+....--.+-+.....-.|-|-+|-++|-+.+. -|.. T Consensus 1 M~-S~v~IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~r~~La~~a~--p~~~------ 71 (223) T protein:vir:10 1 MA-SEVDICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQLAAMGI--SRPE------ 71 (223) T ss_pred CC-CHHHHHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhhhhccc--CCCC------ Confidence 65 999999999999998886 4554 688888887777777788888999999999999976553 1211 Q ss_pred cceeeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeeccCCceEEEec Q lcl|NC_019540. 77 FTEIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDDDVIVFDS 156 (251) Q Consensus 77 v~ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd~~vVlDS 156 (251) |.+ .|..|.++++-+.=.+.+..+++--.+ .-+.|-...|+ T Consensus 72 --------~~y-------------aY~LP~Dclrv~~v~~~~~~~~~~~~~-----------~~~~~~~e~~~------- 112 (223) T protein:vir:10 72 --------WRF-------------AYAQPADAIKIVAVLPHDAANIEAGID-----------NAQPFSCEIDN------- 112 (223) T ss_pred --------ccc-------------cccccccceeeeeeccccccccccccc-----------cccceEEeecc------- Confidence 222 577888888866654544433221111 11111111111 Q ss_pred cccccchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHH-H---hccCC---hhHHHHHHHHHHHH Q lcl|NC_019540. 157 YDKEVDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIK-L---KQQAS---QKDEQESVKQQRWL 229 (251) Q Consensus 157 Ydk~vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~-~---kq~~~---~K~Eq~a~rq~~~~ 229 (251) +-...|..+-. =+++-=.+...| +++|+.+..||-+.-++- | -.+++ ++++|..+.-+. . T Consensus 113 ---~g~~~i~td~~--~~~l~Y~~~v~d-------~~~fd~lF~~Ala~~LAa~lA~pLt~~~~~~q~a~~~~~~y~~-~ 179 (223) T protein:vir:10 113 ---TGADIILTNQV--NAVARYISLVKD-------TTKFSPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQA-Y 179 (223) T ss_pred ---ccceeeeecCC--ceEEEEeecCCC-------hhcccHHHHHHHHHHHHHHhhHhhcCCcchHHHHHHHHHHHHH-H Confidence 00001111000 000000111111 333444444443322210 0 11111 222222211111 1 Q ss_pred hhccccccCCCCCCc----------------CCc--------cc Q lcl|NC_019540. 230 SRKSWVVNGGIKYPN----------------YGR--------RG 249 (251) Q Consensus 230 srk~~~vn~G~~~~N----------------YGR--------~~ 249 (251) .++.-.+++--+.+. +|| +| T Consensus 180 l~~A~~~da~e~~~~~~~~~~~l~aR~~~~~~~~~~~~~~~~~~ 223 (223) T protein:vir:10 180 LSQAMVSDANQRKTKPAHMPEWMRARGAGFVDGNIPGLPNGWRG 223 (223) T ss_pred HHHHHhcccccCcccccccchhhhhcccCCCCCCCCCCCccccC Confidence 111111111000000 222 22 No 13 >protein:vir:1780 Length: 67 # NCBI annotation: tail protein B # Family: family:all:824 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570346;genbank:gi:18640505;genbank:GeneID:932718 Probab=51.80 E-value=0.2 Score=24.47 Aligned_cols=61 Identities=16% Similarity=0.284 Sum_probs=40.5 Q ss_pred Cc----hhHHHHHHHHHhhcCCccccchhhhhH-HHHHHHHHHHHHHHHhccCChhHhh-cceeeecc Q lcl|NC_019540. 1 MK----PTLLEIVQEVLNDMDSDEVNSINDTFE-SAQVASICKTVFRNMTSNRNWPHMK-RAVNLVPF 62 (251) Q Consensus 1 mk----~TlL~IVq~~lsdmdsDeVnSI~Dt~E-s~q~a~i~k~vy~nmi~nr~w~~~k-r~iql~~~ 62 (251) |+ -|.|+-|-.||..++--.|+||+++.- ...+-.++..+-+..- ..-|=|=. +.++|+|- T Consensus 1 ~~~~~~~teLdAVN~~L~aIGesPV~sld~~npdva~a~~iL~~v~~~vq-seGW~FNte~~~~ltPd 67 (67) T protein:vir:17 1 MAPIKRTSELDALNVKMTNIGQQPIVNINNTNPQVALAKTVLNQVTSDVL-TEGWIFNRELDYPLTPQ 67 (67) T ss_pred CCCccccchhhHHHHHHHhhCccccccccCCCccHHHHHHHHHHHHHHHh-hCCceeeccCceeecCC Confidence 54 589999999999999999999997654 3333334444444444 33444444 56666665 No 14 >protein:vir:8886 Length: 195 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813775;genbank:gi:29366730;genbank:GeneID:1258838 Probab=30.61 E-value=1.6 Score=19.52 Aligned_cols=177 Identities=19% Similarity=0.186 Sum_probs=86.5 Q ss_pred Cc--------hhHHHHHHHHHhhcCCccccchhhh--hHHHHHHHHHHHHHHHHhccCChhHhh-cceeeecccCCccce Q lcl|NC_019540. 1 MK--------PTLLEIVQEVLNDMDSDEVNSINDT--FESAQVASICKTVFRNMTSNRNWPHMK-RAVNLVPFSDNNYPT 69 (251) Q Consensus 1 mk--------~TlL~IVq~~lsdmdsDeVnSI~Dt--~Es~q~a~i~k~vy~nmi~nr~w~~~k-r~iql~~~sd~~~pt 69 (251) |+ -|-|+-|-.||..+|--.|+||+++ -+...+-.|+.++=...-+. .|=|=. +.++|+|-++ .. T Consensus 1 ~~~~~~~~~~~teLdAVN~~L~aIGEspV~sld~~~npdva~a~~iL~~v~~~vqse-GW~FNte~~~~ltpD~~---~g 76 (195) T protein:vir:88 1 MRSYEATLETDDELAAINDMLAAIGESPVSSLEGDPNADVANARRILNQVNREVQSR-GWTFNIEEGAVLSPDSF---SG 76 (195) T ss_pred CCcccccccccchhHHHHHHHHhccccccccccCCCCccHHHHHHHHHHHHHHHhhC-CceEeecCCeeeeeeCC---CC Confidence 44 4889999999999999999999864 57777777777777777754 454544 7788888655 45 Q ss_pred EEecCCCcceeeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeeccCC Q lcl|NC_019540. 70 HMRVDKSFTEIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDD 149 (251) Q Consensus 70 ~l~~p~~v~ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd 149 (251) +..+|.+.-.| ++. |.+ +++ .-|--|+-|.+. T Consensus 77 ~I~~P~n~L~v-----~~~--~~~------------------------------~~v-~Rgg~lYD~~n~---------- 108 (195) T protein:vir:88 77 LIEYLSDYLRI-----TTS--GGT------------------------------VYV-NRGGYVYDRSTK---------- 108 (195) T ss_pred eEecCcceeEE-----eec--CCe------------------------------eEE-EeCCEEEeccCC---------- Confidence 77788876433 222 100 111 111122222211 Q ss_pred ceEEEeccccccchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHHHhccCCh-------hHHHHH Q lcl|NC_019540. 150 DVIVFDSYDKEVDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIKLKQQASQ-------KDEQES 222 (251) Q Consensus 150 ~~vVlDSYdk~vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~~kq~~~~-------K~Eq~a 222 (251) +-+| .++-+-.|-+. ..| -+||+.+.-+....|--+|...+ -.++ +.||+| T Consensus 109 -T~~F-----------~~pi~~~iv~~-~~F-------edlPe~~~~yI~~rAa~~f~~~~--~G~~~~~~~l~~~e~~A 166 (195) T protein:vir:88 109 -TDVY-----------TNDITVDLIRF-KTF-------SEMPECFRSYIVAKASRRFNIRF--FGAGEIEGSLQEQESEA 166 (195) T ss_pred -ceEe-----------CCceEEEEEee-cCh-------hhhhHHHHHHHHHHHHHHHHHhc--CCcHHHHHHHHHHHHHH Confidence 1122 12222222222 233 26888777666644444443333 3333 334443 Q ss_pred HHHHHHHhhccccccCCCCCCcCCcccCC Q lcl|NC_019540. 223 VKQQRWLSRKSWVVNGGIKYPNYGRRGRK 251 (251) Q Consensus 223 ~rq~~~~srk~~~vn~G~~~~NYGR~~~~ 251 (251) ++.-.-..-.-=..|==...|+=|+--.. T Consensus 167 ~~~~~e~e~~qg~~Nm~~~~~~~~~~~~r 195 (195) T protein:vir:88 167 WQQCQEYELDYGGFNMIDGDSYVGGIASR 195 (195) T ss_pred HHHHHHHHHhhCCcceeecCcccchhccC Confidence 33222110000000000011222222111 No 15 >protein:vir:99676 Length: 197 # NCBI annotation: Tail tubular protein A # Family: family:all:824 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249590;genbank:gi:68299741;genbank:GeneID:3799991 Probab=29.95 E-value=1.6 Score=19.44 Aligned_cols=165 Identities=16% Similarity=0.233 Sum_probs=85.2 Q ss_pred Cc--------hhHHHHHHHHHhhcCCccccchhhh--hHHHHHHHHHHHHHHHHhccCChhHhh-cceeeecccCCccce Q lcl|NC_019540. 1 MK--------PTLLEIVQEVLNDMDSDEVNSINDT--FESAQVASICKTVFRNMTSNRNWPHMK-RAVNLVPFSDNNYPT 69 (251) Q Consensus 1 mk--------~TlL~IVq~~lsdmdsDeVnSI~Dt--~Es~q~a~i~k~vy~nmi~nr~w~~~k-r~iql~~~sd~~~pt 69 (251) |. -|-|+-|-.||..+|--.|+||+++ -+...+..|+..+=....+ ..|=|=. +.++|+|-.+..+ T Consensus 3 ~~~~~~~~~~~teLdAVN~~L~aIGesPV~sld~~~npdva~a~~iL~~v~~~vqs-~GW~FNte~~~~ltPd~~~~~-- 79 (197) T protein:vir:99 3 ISIYESNWQYQAELDAINDILASIGESPVNTLESDANADVVNARRILHKINRQEQS-KGWTFNIEEGATLVPDVYSQL-- 79 (197) T ss_pred cceeeccccccchhHHHHHHHHhhcccccccccCCCCccHHHHHHHHHHHHHHHhc-CCceeeecCCeeeeecCCCCe-- Confidence 22 4789999999999999999999963 6788888888888877776 5565554 6677777666322 Q ss_pred EEecCCCcceeeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeeccCC Q lcl|NC_019540. 70 HMRVDKSFTEIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDD 149 (251) Q Consensus 70 ~l~~p~~v~ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd 149 (251) ..+|.+.-.| ... |.+ +.+ .-|--||-|.+. T Consensus 80 -I~~P~n~L~v-----~~~--~~~-----------------------------~~v--~Rgg~LYD~~n~---------- 110 (197) T protein:vir:99 80 -IPYMPNYLSV-----TTT--GGT-----------------------------PYV--NRGGYVYDRINK---------- 110 (197) T ss_pred -EEcCcceeee-----ecC--cCc-----------------------------eeE--EeCCeeEeccCC---------- Confidence 3355554332 111 100 111 111123322211 Q ss_pred ceEEEeccccccchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHHHhccCCh-------hHHHHH Q lcl|NC_019540. 150 DVIVFDSYDKEVDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIKLKQQASQ-------KDEQES 222 (251) Q Consensus 150 ~~vVlDSYdk~vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~~kq~~~~-------K~Eq~a 222 (251) +-+|+ ++-+-.|-+.+ .| | +||+.+.-+....|-.+|-. +.-.++ +.||+| T Consensus 111 -T~~F~-----------~pi~v~iv~~~-~F---e----elPe~~~~yI~~rAa~~f~~--~~~G~~~~~q~l~~~e~~a 168 (197) T protein:vir:99 111 -TDRFT-----------SPITVNLISLR-TF---D----EMPEQFKSYIVTKASKEFNI--RFFGAPEIDTVLGNELIDL 168 (197) T ss_pred -cEeeC-----------CceEEEEEEec-Ch---h----hccHHHHHHHHHHHHHHHHh--hccCchhHHHHHHHHHHHH Confidence 11221 22222232322 23 2 48887776666444444433 333333 344444 Q ss_pred HHHHHHHhhccccccCCCCCCcCCcccCC Q lcl|NC_019540. 223 VKQQRWLSRKSWVVNGGIKYPNYGRRGRK 251 (251) Q Consensus 223 ~rq~~~~srk~~~vn~G~~~~NYGR~~~~ 251 (251) +++-+-. .-+.|++.|- T Consensus 169 ~~~~~e~------------e~~qg~~Nml 185 (197) T protein:vir:99 169 ERAVNEY------------ELDYGAFNIF 185 (197) T ss_pred HHHHHHH------------HHhhCCccee Confidence 4332211 1233444443 No 16 >protein:vir:79050 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:6416 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110727;genbank:gi:134287344;genbank:GeneID:4955224 Probab=26.08 E-value=0.3 Score=23.45 Aligned_cols=99 Identities=14% Similarity=0.090 Sum_probs=40.0 Q ss_pred CchhHHHHHHHHHhhcCCccccchhhhhHHHH-HHHHHH-----------------HHHHHHhccCChhHhhcceeeecc Q lcl|NC_019540. 1 MKPTLLEIVQEVLNDMDSDEVNSINDTFESAQ-VASICK-----------------TVFRNMTSNRNWPHMKRAVNLVPF 62 (251) Q Consensus 1 mk~TlL~IVq~~lsdmdsDeVnSI~Dt~Es~q-~a~i~k-----------------~vy~nmi~nr~w~~~kr~iql~~~ 62 (251) |--.++|.|.+.|...+...-.+ ....+| +-+.++ ++.-.|+--.+.-+.+...++.-. T Consensus 1 ~~~~i~e~i~~~Lk~~~~~~~~~---d~~iL~fa~e~~~n~I~N~cNi~eiP~~L~~v~~~mai~~fl~~kk~~~~~~l~ 77 (133) T protein:vir:79 1 MGNNIIDDIEKRLESFGYILKDG---DKWLIDFVREKIENIIKLDCNIKTMPIELKEIEADMIVGEFLFTKKNMGQLDIE 77 (133) T ss_pred CCchHHHHHHHHHHHhCCCCCcc---chHHHHHHHHHHHHHHhhhcChhhcchhHHHHHHHHHHHHHHhcccccCCCCcc Confidence 99999999999998655433221 111122 112222 222333333222223322222211 Q ss_pred cCCccceEEecCCCccee----------------eEEeeeecccCCcceeeeeeec Q lcl|NC_019540. 63 SDNNYPTHMRVDKSFTEI----------------VFINYNTAKAGETRKYYKSMTW 102 (251) Q Consensus 63 sd~~~pt~l~~p~~v~ei----------------~fl~Y~~~k~ge~~~~yr~lky 102 (251) +-+--|.-=+|..-=++| .|++|=.+.-+.--+|||+|+| T Consensus 78 ~~D~~~~v~sIkeGDTsv~f~~~~~s~t~eq~l~s~i~~L~~~~k~~l~~yRkLrW 133 (133) T protein:vir:79 78 SINFEAVEKSISEGDTKVDFAIGSGSQTPEQRFDSLIAYLTAYGKNKILTFRCLRW 133 (133) T ss_pred cccchhhhhheecccceeecccCCCccchhHHHHHHHHHHhhcccchhhccccccC Confidence 110001000111110111 2333333321224579999999 No 17 >protein:vir:103305 Length: 245 # NCBI annotation: tail-like protein # Family: family:all:824 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039669;genbank:gi:125999998;genbank:GeneID:4818381 Probab=25.64 E-value=2 Score=18.89 Aligned_cols=185 Identities=17% Similarity=0.180 Sum_probs=87.2 Q ss_pred CchhHHHHHHHHHhhcCCccccchhhhh-HHHHHHHHHHHHHHHHh--ccCChhHhh-cceeeecccCCccceEEecCCC Q lcl|NC_019540. 1 MKPTLLEIVQEVLNDMDSDEVNSINDTF-ESAQVASICKTVFRNMT--SNRNWPHMK-RAVNLVPFSDNNYPTHMRVDKS 76 (251) Q Consensus 1 mk~TlL~IVq~~lsdmdsDeVnSI~Dt~-Es~q~a~i~k~vy~nmi--~nr~w~~~k-r~iql~~~sd~~~pt~l~~p~~ 76 (251) -.-|.|+-|-.||..++--.|+||++++ |++-+..|+.++=+... ...-|=|=. +.+.|.|..+ .++-+|.+ T Consensus 20 ~~dteLdAVN~~L~aIGEsPV~sld~~npdva~A~~IL~~v~~~vQ~llseGW~FNte~~~~ltPd~~----g~i~iP~n 95 (245) T protein:vir:10 20 TVDTRLEAINLCLRAVGYASIESEDSGDLDAADASKILATVGQRVQYNGGKGWWFNVEPNWQMTPDAN----GEILIPNN 95 (245) T ss_pred cccchHHHHHHHHHhhcccccccccCCchhHHHHHHHHHHHHHHHHhhcCCCeeEeecCCceeccCCC----CceecCcc Confidence 3478999999999999999999999554 44444456666655533 345555544 5566766654 35666666 Q ss_pred cceeeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeeccCCceEEEec Q lcl|NC_019540. 77 FTEIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDDDVIVFDS 156 (251) Q Consensus 77 v~ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd~~vVlDS 156 (251) +-.+ |.+.+.|+.. .+.+. -+| -|+-|.+. +-+ T Consensus 96 ~L~v----~~~~~~~~~~---------------------------~~~v~-RGg-kLYD~~n~-----------T~~--- 128 (245) T protein:vir:10 96 AIAA----WQDVRYDDKK---------------------------VLISI-RGR-KVYNMNTH-----------STD--- 128 (245) T ss_pred chhh----hcccccCCCc---------------------------cceEE-cCC-eeEecccC-----------cee--- Confidence 4322 2222122111 11111 111 12222110 111 Q ss_pred cccccchhcccccc-----eeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHHHhccCC-----hhHHHHHHHH- Q lcl|NC_019540. 157 YDKEVDDTLQSSKT-----QCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIKLKQQAS-----QKDEQESVKQ- 225 (251) Q Consensus 157 Ydk~vd~Tl~~skt-----q~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~~kq~~~-----~K~Eq~a~rq- 225 (251) .+++-+ +.-=+.+..| -+||+.+.-++...|-.+|-+.+=-.++ ++.||+|+++ T Consensus 129 --------F~~pv~~~~~~~v~iV~~~pF-------edlPe~~q~yI~~rAA~~f~~~~~G~~~~~q~l~q~e~~a~~~~ 193 (245) T protein:vir:10 129 --------FSNSLNREGFFRMTFMLNLPF-------EHMPVSARQAIAYQAAVEFMVSKEFDAQKVQIWQQLAQQMQIDM 193 (245) T ss_pred --------ccCccccccceeEEEEeeCCh-------hhccHHHHHHHHHHHHHHHHhhccCchhHHHHHHHHHHHHHHHH Confidence 111111 1111222334 2588888888887777666655432221 2344444432 Q ss_pred -------HHHHhhccccc-------cCCC---------CCCcCC---cccCC Q lcl|NC_019540. 226 -------QRWLSRKSWVV-------NGGI---------KYPNYG---RRGRK 251 (251) Q Consensus 226 -------~~~~srk~~~v-------n~G~---------~~~NYG---R~~~~ 251 (251) +.+=....|.. -||. +|.-|| |-|+. T Consensus 194 ~~~~~~q~~~Nm~~~~p~~~~~r~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (245) T protein:vir:10 194 GQESANQQSLNMFVNNPTQAHFGSMVGGPNANATFSRNPYNAYGGYSRYGRS 245 (245) T ss_pred HHHHHhhcCcccccCCchhhhcchhccccccccccccCCcccccccccccCC Confidence 22222223311 1222 233232 33333 No 18 >protein:vir:78741 Length: 197 # NCBI annotation: tail tube A # Family: family:all:824 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285449;genbank:gi:148724483;genbank:GeneID:5220212 Probab=23.79 E-value=2.3 Score=18.64 Aligned_cols=180 Identities=17% Similarity=0.134 Sum_probs=82.1 Q ss_pred CchhHHHHHHHHHhhcCCccccchhhhh-HHHHHHHHHHHHHHHHhccCChhHhh-cceeeecccCCccceEEecCCCcc Q lcl|NC_019540. 1 MKPTLLEIVQEVLNDMDSDEVNSINDTF-ESAQVASICKTVFRNMTSNRNWPHMK-RAVNLVPFSDNNYPTHMRVDKSFT 78 (251) Q Consensus 1 mk~TlL~IVq~~lsdmdsDeVnSI~Dt~-Es~q~a~i~k~vy~nmi~nr~w~~~k-r~iql~~~sd~~~pt~l~~p~~v~ 78 (251) -++|.|+-|-.||..+|--.|+||++++ +...+-.|+.++=...-+ ..|=|=. +.+.|.|-++ -+..+|.+.- T Consensus 3 ~~~teLdAVN~~L~aIGEspV~sld~~npdva~a~~iL~~v~~~vqs-eGW~FNte~~~~l~pd~~----g~I~~P~n~L 77 (197) T protein:vir:78 3 SKLTKLGAVNIVLTNIGMAPVTLIDSNNPMVATAQTILDEVSGSVQS-EGWSYNTERAYPFIKDNT----GRIAIPSNVL 77 (197) T ss_pred cchhHHHHHHHHHHhhCCcccceeeCCCccHHHHHHHHHHHHHHHhh-CCceEeecCCceecCCCC----CeEecCccce Confidence 4679999999999999999999999654 344444455555555554 4444443 6777777443 5677777765 Q ss_pred eeeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeeccCCceEEEeccc Q lcl|NC_019540. 79 EIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDDDVIVFDSYD 158 (251) Q Consensus 79 ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd~~vVlDSYd 158 (251) .| ++... ..+ + ++- -|--||-+.+. +-+|+ T Consensus 78 ~v-----d~~~~--~~~---------------------------~-~v~-Rgg~LYD~~n~-----------T~~F~--- 107 (197) T protein:vir:78 78 SL-----DCAST--SKY---------------------------D-LII-RGGFLYDKAGH-----------TDVFT--- 107 (197) T ss_pred EE-----ecCCC--cee---------------------------e-EEE-eCCeEEeccCC-----------cEEeC--- Confidence 44 22211 110 0 001 11122222211 11221 Q ss_pred cccchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHHHhc-----cCChhHHHHHHHHHHHHh--h Q lcl|NC_019540. 159 KEVDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIKLKQ-----QASQKDEQESVKQQRWLS--R 231 (251) Q Consensus 159 k~vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~~kq-----~~~~K~Eq~a~rq~~~~s--r 231 (251) ++-+-.|-+. ..| -+||+.+.-+....|--++...+=- +--+..||+|+.+-.... . T Consensus 108 --------~pi~~~iv~~-~~F-------edlPe~~~~yI~~rAa~~f~~~~~G~~~~~q~l~~~e~~a~~~~~~~e~~q 171 (197) T protein:vir:78 108 --------ENLELDVVWC-FEF-------DDLPEAVKNYITIRAANLFAGRAVGSAEAVKYSQREEAAARAAIIEYETQQ 171 (197) T ss_pred --------CceEEEEEee-cCh-------hhhhHHHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhh Confidence 1112222222 233 2688877766665554444444311 111334444443322211 1 Q ss_pred ccccccCCCCCCcCCccc--CC Q lcl|NC_019540. 232 KSWVVNGGIKYPNYGRRG--RK 251 (251) Q Consensus 232 k~~~vn~G~~~~NYGR~~--~~ 251 (251) ..|-.-+|-..-.|-|.- .- T Consensus 172 ~~~Nml~~~~~~~~~~yrp~~~ 193 (197) T protein:vir:78 172 GDYNMLESESGRDIYTYRPFDA 193 (197) T ss_pred cCcCcccCccccCcCcccchhh Confidence 111111111111111100 00 No 19 >protein:vir:3365 Length: 196 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523336;swissprot:trembl:q8w5u4;genbank:gi:17570827;uniprot:Q8W5U4;genbank:GeneID:927450 Probab=23.44 E-value=2.3 Score=18.59 Aligned_cols=168 Identities=18% Similarity=0.247 Sum_probs=89.6 Q ss_pred Cc--------hhHHHHHHHHHhhcCCccccchhhh--hHHHHHHHHHHHHHHHHhccCChhHhh-cceeeecccCCccce Q lcl|NC_019540. 1 MK--------PTLLEIVQEVLNDMDSDEVNSINDT--FESAQVASICKTVFRNMTSNRNWPHMK-RAVNLVPFSDNNYPT 69 (251) Q Consensus 1 mk--------~TlL~IVq~~lsdmdsDeVnSI~Dt--~Es~q~a~i~k~vy~nmi~nr~w~~~k-r~iql~~~sd~~~pt 69 (251) |+ -|-|+-|-.||..+|--.|+||+++ -+...+-.|+.++=...-+. .|=|=. +.++|.|-++ .- T Consensus 1 ~~~~~~~~~~~teLdAVN~~L~aIGEspV~sld~~~npdva~a~~iL~~v~~~vqse-GW~FNte~~~~ltPD~~---~g 76 (196) T protein:vir:33 1 MRSYEMNIETAEELSAVNDILASIGEPPVSTLEGDANADVANARRVLNKINRQIQSR-GWTFNIEEGVTLLPDAF---SG 76 (196) T ss_pred CCccccchhhhhhhHHHHHHHHhcCccccccccCCCCccHHHHHHHHHHHHHHHhhC-CceEeecCceeEeeeCC---CC Confidence 33 5889999999999999999999964 57777777777777777654 454544 7778888654 34 Q ss_pred EEecCCCcceeeEEeeeecccCCcceeeeeeeccCchHHHHHhhccCCCCCCceEEEecCceEEEEecCCCCeeeeccCC Q lcl|NC_019540. 70 HMRVDKSFTEIVFINYNTAKAGETRKYYKSMTWLEPDDFLRRVNKWDTDSDSVDVILDMSGVEVAVRNDRAPEYYTSFDD 149 (251) Q Consensus 70 ~l~~p~~v~ei~fl~Y~~~k~ge~~~~yr~lkyl~PdeFl~r~~~~n~~~d~v~~v~d~~gvel~irnd~aP~y~TSFDd 149 (251) +..+|.+.-.| +.. .|.+ .++--+| -||-+.+ T Consensus 77 ~I~vP~n~L~v-----~~~-~~~~------------------------------~~v~Rgg-~LYD~~n----------- 108 (196) T protein:vir:33 77 MIPFSSDYLSV-----MAT-SGQT------------------------------QYVNRGG-YLYDRSA----------- 108 (196) T ss_pred eEecCcceeEE-----ecC-CCce------------------------------eEEEcCC-eEEeccC----------- Confidence 67777775443 221 1100 0111111 2222221 Q ss_pred ceEEEeccccccchhcccccceeEEEEeeeeeeecccCCCCcHHHHHHHHHHHHHHHHHHHhccCC-----hhHHHHHHH Q lcl|NC_019540. 150 DVIVFDSYDKEVDDTLQSSKTQCVAYVLPEFYVIDDHKPDLPEEAMAAYISEVKSQAFIKLKQQAS-----QKDEQESVK 224 (251) Q Consensus 150 ~~vVlDSYdk~vd~Tl~~sktq~ia~~~p~F~~~dd~v~dip~~~f~~l~~EAks~af~~~kq~~~-----~K~Eq~a~r 224 (251) .+-+|++ +-+-.|-+ +..| -+||+.+.-+....|-.+|-..+=-.++ +..||+|+. T Consensus 109 ~T~~F~~-----------pi~v~iv~-~~~F-------edlPe~~~~yI~~rAa~~f~~~~~G~~~~~q~l~~ee~~a~~ 169 (196) T protein:vir:33 109 KTDRFPS-----------GVQVNLIR-LREF-------DEMPECFRNYIVTKASRQFNNRFFGAPEVDGVLQEEEQEAWS 169 (196) T ss_pred CcEEeCC-----------ceEEEEEe-ecCh-------hhhhHHHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHH Confidence 1112221 11122222 2233 2689988888777776666555432222 334444443 Q ss_pred HHHHHhhccccccCCCCCCcCCcccCC Q lcl|NC_019540. 225 QQRWLSRKSWVVNGGIKYPNYGRRGRK 251 (251) Q Consensus 225 q~~~~srk~~~vn~G~~~~NYGR~~~~ 251 (251) .-... .-++|++.|- T Consensus 170 ~~~~~------------e~~q~~~Nml 184 (196) T protein:vir:33 170 ACFEY------------ELDYGNYNML 184 (196) T ss_pred HHHHH------------HHhhCCccee Confidence 32211 1123333333 Done!