Query lcl|NC_018276.1_cdsid_YP_006560633.1 [gene=B620_gp13] [protein=putative phage capsid protein] [protein_id=YP_006560633.1] [location=9737..10765] Match_columns 342 No_of_seqs 4 out of 7 Neff 2.4 Searched_HMMs 1612 Date Thu Nov 7 13:28:11 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_13 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_13_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:94494 Length: 274 92.9 0.0069 4.3E-06 32.4 10.5 258 1-300 1-274 (274) 2 protein:vir:97433 Length: 274 92.9 0.0069 4.3E-06 32.4 10.5 258 1-300 1-274 (274) 3 protein:vir:105334 Length: 276 92.3 0.0084 5.2E-06 32.0 10.1 242 1-330 1-276 (276) 4 protein:vir:95898 Length: 274 91.9 0.01 6.4E-06 31.5 10.2 258 1-298 1-274 (274) 5 protein:vir:96262 Length: 274 91.9 0.01 6.4E-06 31.5 10.2 258 1-298 1-274 (274) 6 protein:vir:93742 Length: 274 91.7 0.012 7.1E-06 31.2 10.2 257 1-298 1-274 (274) 7 protein:vir:3033 Length: 272 # 88.7 0.031 1.9E-05 28.9 11.3 258 1-328 1-272 (272) 8 protein:vir:9820 Length: 272 # 88.7 0.031 1.9E-05 28.9 11.3 258 1-328 1-272 (272) 9 protein:vir:1239 Length: 274 # 87.4 0.039 2.4E-05 28.3 11.0 259 1-329 1-274 (274) 10 protein:vir:80930 Length: 278 87.1 0.041 2.5E-05 28.2 10.2 258 1-295 1-278 (278) 11 protein:vir:96123 Length: 274 85.2 0.055 3.4E-05 27.5 12.5 261 1-332 1-274 (274) 12 protein:vir:96833 Length: 275 84.1 0.063 3.9E-05 27.2 10.6 259 1-329 3-275 (275) 13 protein:vir:94622 Length: 341 81.7 0.083 5.2E-05 26.5 12.8 303 1-327 3-341 (341) 14 protein:vir:7990 Length: 273 # 77.6 0.12 7.6E-05 25.6 13.3 254 1-295 1-273 (273) 15 protein:vir:3613 Length: 272 # 60.4 0.37 0.00023 22.9 10.2 258 1-324 1-272 (272) 16 protein:vir:102605 Length: 273 60.2 0.38 0.00023 22.9 12.7 247 1-325 1-273 (273) 17 protein:vir:105822 Length: 273 60.2 0.38 0.00023 22.9 12.7 247 1-325 1-273 (273) 18 protein:vir:103955 Length: 324 49.8 0.63 0.00039 21.7 11.3 279 1-342 1-318 (324) 19 protein:vir:9309 Length: 324 # 31.6 1.5 0.00092 19.6 10.1 263 1-342 1-324 (324) 20 protein:vir:99749 Length: 324 20.1 2.8 0.0018 18.1 15.3 278 1-342 4-318 (324) No 1 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=92.93 E-value=0.0069 Score=32.45 Aligned_cols=258 Identities=14% Similarity=0.114 Sum_probs=118.7 Q ss_pred Ccccccc-CCcchhhhhhhhhhhHHHhhhcccchhhhccchhH--HHHHhhcCCceeEEEEeecccEeeccceeEEeccc Q lcl|NC_018276. 1 MQNLRAK-ANLDKWELRPSRYGALDLFMQQTADPAGIISPELT--DKAEASIGSTLQVPVIDYDGTITIGNTRTVTIPDS 77 (342) Q Consensus 1 ~q~~r~k-~n~dk~~~r~s~~ga~d~f~~Qt~~~~~mls~Es~--~kl~aS~~~plkipVlny~a~~ti~na~~~~~~ds 77 (342) |-+---| .++ -=|.-|+.+= ..|. ......++=.+ ..+....|..+++|++++-+....=+-..-+.++. T Consensus 1 ma~~~T~~~d~----iiPev~~~~v--~~~~-~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~ 73 (274) T protein:vir:94 1 MPQGLTKTSDQ----IIPEVLAPMM--QAQL-EKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) T ss_pred CCccceehhhe----echHHHHHHH--HHhh-hhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccc Confidence 3331111 111 0122233221 0110 01111111111 22344568899999987644333222223344444 Q ss_pred cccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhcccCcc Q lcl|NC_018276. 78 ENNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFAGNV 157 (342) Q Consensus 78 en~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~~n~ 157 (342) -++... +.+-+...+||.++=-...- ++.+-+....+..-+.+-..+|+++++.|...++.|-.+.+. | T Consensus 74 lt~~~~-~~~i~~~~~~~~i~D~~~~~---~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~-~------ 142 (274) T protein:vir:94 74 LETKKR-EAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITK-L------ 142 (274) T ss_pred ccccee-EEEeeeecceecccHHHHHh---ccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccC-H------ Confidence 333332 22223344555544322111 222223344455557778899999999999888776554431 1 Q ss_pred eeecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCccc--cccee-Eeeeeeeeeeccccc Q lcl|NC_018276. 158 VTADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAEK------GLYNE--TNKQI-QYADKVWHWSNEVAN 228 (342) Q Consensus 158 ~q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kleaq------G~~N~--tNlq~-Q~~N~~~~~sn~vat 228 (342) + .+-+....|-..+..+.. ++=+..+++.|+|.... ++++. +|-++ .|+++.++.||.+. T Consensus 143 --------d-~i~dA~~~l~d~~~~~~~-ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p- 211 (274) T protein:vir:94 143 --------N-GLQSAIDKFNDEDLEPMV-LFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE- 211 (274) T ss_pred --------H-HHHHHHHHhhccCCCceE-EEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCC- Confidence 1 222333445555666654 66688899999986422 22222 12223 68999999999985 Q ss_pred hhhhchhhhccccccchhhhhh--hhhhhccccccCcccccceecccccceeeeeeeee--hhcchhcchhhhhhe Q lcl|NC_018276. 229 AADRYATAFAVPQGTVGMLTRF--EREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYES--VGDFSGIAGAATADL 300 (342) Q Consensus 229 ~A~~~~~~Y~m~sG~vGm~~wi--dr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~--IaD~S~~ageatad~ 300 (342) ..+.|-.-+|++|+...- .-|...+-...+-. ...+ .-||.++++. +.=..+..++- .| T Consensus 212 ----~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~---i~~~----~~y~~~~~~~~~vv~~t~~~~~~--~~ 274 (274) T protein:vir:94 212 ----AGTAILAKKGAVKLILKRDFFLEVARDASTKTTA---LYSD----KHYVAYLYDESKAVKITKGSGSL--EM 274 (274) T ss_pred ----cceEEEEeCcceEeeecCCceeccccchhhcccE---EEEE----EEEEEEEEcCCceEEEecCcccc--cC Confidence 356788889999976552 11211111111100 0000 1123333221 11112222222 22 No 2 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=92.93 E-value=0.0069 Score=32.45 Aligned_cols=258 Identities=14% Similarity=0.114 Sum_probs=118.7 Q ss_pred Ccccccc-CCcchhhhhhhhhhhHHHhhhcccchhhhccchhH--HHHHhhcCCceeEEEEeecccEeeccceeEEeccc Q lcl|NC_018276. 1 MQNLRAK-ANLDKWELRPSRYGALDLFMQQTADPAGIISPELT--DKAEASIGSTLQVPVIDYDGTITIGNTRTVTIPDS 77 (342) Q Consensus 1 ~q~~r~k-~n~dk~~~r~s~~ga~d~f~~Qt~~~~~mls~Es~--~kl~aS~~~plkipVlny~a~~ti~na~~~~~~ds 77 (342) |-+---| .++ -=|.-|+.+= ..|. ......++=.+ ..+....|..+++|++++-+....=+-..-+.++. T Consensus 1 ma~~~T~~~d~----iiPev~~~~v--~~~~-~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~ 73 (274) T protein:vir:97 1 MPQGLTKTSDQ----IIPEVLAPMM--QAQL-EKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) T ss_pred CCccceehhhe----echHHHHHHH--HHhh-hhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccc Confidence 3331111 111 0122233221 0110 01111111111 22344568899999987644333222223344444 Q ss_pred cccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhcccCcc Q lcl|NC_018276. 78 ENNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFAGNV 157 (342) Q Consensus 78 en~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~~n~ 157 (342) -++... +.+-+...+||.++=-...- ++.+-+....+..-+.+-..+|+++++.|...++.|-.+.+. | T Consensus 74 lt~~~~-~~~i~~~~~~~~i~D~~~~~---~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~-~------ 142 (274) T protein:vir:97 74 LETKKR-EAKIRKIAKGTSITDEALLS---GYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITK-L------ 142 (274) T ss_pred ccccee-EEEeeeecceecccHHHHHh---ccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccC-H------ Confidence 333332 22223344555544322111 222223344455557778899999999999888776554431 1 Q ss_pred eeecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCccc--cccee-Eeeeeeeeeeccccc Q lcl|NC_018276. 158 VTADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAEK------GLYNE--TNKQI-QYADKVWHWSNEVAN 228 (342) Q Consensus 158 ~q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kleaq------G~~N~--tNlq~-Q~~N~~~~~sn~vat 228 (342) + .+-+....|-..+..+.. ++=+..+++.|+|.... ++++. +|-++ .|+++.++.||.+. T Consensus 143 --------d-~i~dA~~~l~d~~~~~~~-ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p- 211 (274) T protein:vir:97 143 --------N-GLQSAIDKFNDEDLEPMV-LFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE- 211 (274) T ss_pred --------H-HHHHHHHHhhccCCCceE-EEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCC- Confidence 1 222333445555666654 66688899999986422 22222 12223 68999999999985 Q ss_pred hhhhchhhhccccccchhhhhh--hhhhhccccccCcccccceecccccceeeeeeeee--hhcchhcchhhhhhe Q lcl|NC_018276. 229 AADRYATAFAVPQGTVGMLTRF--EREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYES--VGDFSGIAGAATADL 300 (342) Q Consensus 229 ~A~~~~~~Y~m~sG~vGm~~wi--dr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~--IaD~S~~ageatad~ 300 (342) ..+.|-.-+|++|+...- .-|...+-...+-. ...+ .-||.++++. +.=..+..++- .| T Consensus 212 ----~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~---i~~~----~~y~~~~~~~~~vv~~t~~~~~~--~~ 274 (274) T protein:vir:97 212 ----AGTAILAKKGAVKLILKRDFFLEVARDASTKTTA---LYSD----KHYVAYLYDESKAVKITKGSGSL--EM 274 (274) T ss_pred ----cceEEEEeCcceEeeecCCceeccccchhhcccE---EEEE----EEEEEEEEcCCceEEEecCcccc--cC Confidence 356788889999976552 11211111111100 0000 1123333221 11112222222 22 No 3 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=92.27 E-value=0.0084 Score=31.97 Aligned_cols=242 Identities=18% Similarity=0.156 Sum_probs=111.5 Q ss_pred CccccccCCcchhhhhhhhhhhHHHhhhcccchhhhccchhH---------------------HHHHhhcCCceeEEEEe Q lcl|NC_018276. 1 MQNLRAKANLDKWELRPSRYGALDLFMQQTADPAGIISPELT---------------------DKAEASIGSTLQVPVID 59 (342) Q Consensus 1 ~q~~r~k~n~dk~~~r~s~~ga~d~f~~Qt~~~~~mls~Es~---------------------~kl~aS~~~plkipVln 59 (342) |-+- +| .++.|+-||.- ..|....|..+++|..+ T Consensus 1 Ma~~------------------------~T-~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~ 55 (276) T protein:vir:10 1 MAQG------------------------TT-TKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFV 55 (276) T ss_pred CCcc------------------------ee-ehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeec Confidence 2110 22 22333333321 22334568889999987 Q ss_pred ecccEeeccceeEEeccccccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018276. 60 YDGTITIGNTRTVTIPDSENNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDAN 139 (342) Q Consensus 60 y~a~~ti~na~~~~~~dsen~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~Lean 139 (342) +-+....-.-..-+.++. -...-.+.+-+.+.+||.++--...-..-+......+++-. .+-..+|.++++.|... T Consensus 56 ~igda~~~~eg~~i~~~~-lt~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~---~~a~~~d~~~~~~l~~~ 131 (276) T protein:vir:10 56 YSGDATVVPEGQKIPVDK-IETNRREAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGL---AIANKVDNDVLEALRGT 131 (276) T ss_pred CCCccccccCCCccCccc-cccceeeEEeehccccccccHHHHHhhccchHHHHHHHHHH---HHHHHHHHHHHHHHhcc Confidence 754332111111222322 22222233334456677766433322222333333344433 35568899999888765 Q ss_pred cccccccchhhhcccCcceeecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHhcC------ccc--ccc Q lcl|NC_018276. 140 KTQVLNDTLGLYTFAGNVVTADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAEKGL------YNE--TNK 211 (342) Q Consensus 140 KTqV~~d~L~~yt~~~n~~q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kleaqG~------~N~--tNl 211 (342) +-.+-.+.+ . -+.+-+.-.+|...++.+ ..++=+..++.+|+|+..... ++. .|- T Consensus 132 ~~~~~~~~~----------t------~d~i~~A~~~lgd~~~~~-~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G 194 (276) T protein:vir:10 132 KLTVSADIG----------T------LAGLEAAIDTFDDEDLEP-MVLFINPKDAGKLRSSASDNFTRATELGDNIIVKG 194 (276) T ss_pred ccccccccc----------C------HHHHHHHHHHhccccCcc-cEEEEcHHHHHHHHHhccccccccccccccceecc Confidence 544433322 1 123345555666666655 466778999999999853332 111 122 Q ss_pred ee-Eeeeeeeeeeccccchhhhchhhhccccccchhhhh--hhhhhhccccccCcccccceecccccceeeeeeeee--h Q lcl|NC_018276. 212 QI-QYADKVWHWSNEVANAADRYATAFAVPQGTVGMLTR--FEREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYES--V 286 (342) Q Consensus 212 q~-Q~~N~~~~~sn~vat~A~~~~~~Y~m~sG~vGm~~w--idr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~--I 286 (342) +| .|+++.+..++.+. ..+.|-.-.|++|+..- +.-|...+...++ . ....+ ..||+.+++. | T Consensus 195 ~ig~~~G~~Vi~s~~~p-----~~t~~l~~~gAi~~~~~~~~~vE~dRd~~~~~-d--~i~~~----~~y~~~~~~~~~v 262 (276) T protein:vir:10 195 AFGEALGAVIVRSKKLD-----EGEAILAKRGAVKLITKRDFFLETDRDPSTKT-T--ALYSD----KHYVAYLYDESKA 262 (276) T ss_pred ccceecceeEEEcCCCC-----cceEEEEeccceeeeecCCceeecccchhhcc-c--EEEEe----eEEEEEEEcCcce Confidence 22 57889999999874 35667777888886433 1111111111111 0 00000 1122333221 1 Q ss_pred hcchhcchhhhhheeeehhccccceeeeeEEEeecCCcccCCCc Q lcl|NC_018276. 287 GDFSGIAGAATADLTRAKKEHYGFAVDVAFVVAENSDAANNPGA 330 (342) Q Consensus 287 aD~S~~ageatad~~~a~~~~Y~~~vd~a~vVAfnsd~a~~~~~ 330 (342) .=.....|+.++- + T Consensus 263 v~~t~~~~~~~~~------------------------------~ 276 (276) T protein:vir:10 263 VKVTKGAGTTDSG------------------------------A 276 (276) T ss_pred EEEecCCcCCcCC------------------------------C Confidence 1112222222211 1 No 4 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=91.92 E-value=0.01 Score=31.46 Aligned_cols=258 Identities=16% Similarity=0.105 Sum_probs=113.1 Q ss_pred Ccccccc-CCcchhhhhhhhhhhHHHhhhcccch--hhhccchhHHHHHhhcCCceeEEEEeecccEeeccceeEEeccc Q lcl|NC_018276. 1 MQNLRAK-ANLDKWELRPSRYGALDLFMQQTADP--AGIISPELTDKAEASIGSTLQVPVIDYDGTITIGNTRTVTIPDS 77 (342) Q Consensus 1 ~q~~r~k-~n~dk~~~r~s~~ga~d~f~~Qt~~~--~~mls~Es~~kl~aS~~~plkipVlny~a~~ti~na~~~~~~ds 77 (342) |-+---| .++= =|.-|+.+=. ++-++.+ +.+.... ..+....|..++||++++-+....-....-+.++. T Consensus 1 m~~~~T~l~d~i----~Pev~~~~v~-~~~~~~l~~~~~~~~~--~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~ 73 (274) T protein:vir:95 1 MAQGMTKLTNQI----VPEVLAPMMQ-AELEKKLRFASFAEID--NTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDI 73 (274) T ss_pred CCcceeehhhee----chHHHHHHHH-HHHHhhhhccccceec--ccccCCCCCEEEeeeecCCCccccccCCCccchhh Confidence 2221111 1110 0112222100 0000110 0110000 22444568899999998754433222233444444 Q ss_pred cccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhcccCcc Q lcl|NC_018276. 78 ENNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFAGNV 157 (342) Q Consensus 78 en~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~~n~ 157 (342) -+.+.. +..-+.+..+|.++=-.. ..++.+-+....+..-+.+-..+|.+++++|...+-.|-.+.+ -|+ T Consensus 74 lt~~~~-~~~i~~~~~a~~i~D~~~---~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~-~~d----- 143 (274) T protein:vir:95 74 LETKKR-EAKIRKIAKGTSISDEAL---LSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADIT-KLT----- 143 (274) T ss_pred ccccee-EEEeeeeecceeehHHHH---hhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc-CHH----- Confidence 333332 222233444555442221 1222223333344444556678999999988765544443333 111 Q ss_pred eeecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCccc--cccee-Eeeeeeeeeeccccc Q lcl|NC_018276. 158 VTADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAEK------GLYNE--TNKQI-QYADKVWHWSNEVAN 228 (342) Q Consensus 158 ~q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kleaq------G~~N~--tNlq~-Q~~N~~~~~sn~vat 228 (342) .+-+....|-..+..++ .++=+..++..|+|.... .+++. .|-+| .|++..++.||.+. T Consensus 144 ----------~i~~A~~~lgd~~~~~~-~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~- 211 (274) T protein:vir:95 144 ----------GLQTAIDKFNDEDLEPM-VLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLE- 211 (274) T ss_pred ----------HHHHHHHHhcccccccc-EEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCC- Confidence 12233344444555665 466788999999996421 11111 12233 58899999999874 Q ss_pred hhhhchhhhccccccchhhhh--hhhhhhccccccCcccccceecccccceeeeeeeee--hhcchhcchhhhh Q lcl|NC_018276. 229 AADRYATAFAVPQGTVGMLTR--FEREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYES--VGDFSGIAGAATA 298 (342) Q Consensus 229 ~A~~~~~~Y~m~sG~vGm~~w--idr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~--IaD~S~~ageata 298 (342) ..+.|-.-+|++|.... +.-|...+...++ . ..+.+ ..||.++++. +-=..=-+|+-+- T Consensus 212 ----~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~-d--~i~~~----~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 212 ----AGTAILAKKGAVKLITKRDFFLETDRDPSTKT-T--ALYSD----KHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ----CceEEEEeccceeeeecCCccccccccccccc-C--EEEEe----EEEEEEEEcCCcEEEEEcCCccccC Confidence 35667777888887655 1222222211111 0 00011 2234444332 1111111222221 No 5 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=91.92 E-value=0.01 Score=31.46 Aligned_cols=258 Identities=16% Similarity=0.105 Sum_probs=113.1 Q ss_pred Ccccccc-CCcchhhhhhhhhhhHHHhhhcccch--hhhccchhHHHHHhhcCCceeEEEEeecccEeeccceeEEeccc Q lcl|NC_018276. 1 MQNLRAK-ANLDKWELRPSRYGALDLFMQQTADP--AGIISPELTDKAEASIGSTLQVPVIDYDGTITIGNTRTVTIPDS 77 (342) Q Consensus 1 ~q~~r~k-~n~dk~~~r~s~~ga~d~f~~Qt~~~--~~mls~Es~~kl~aS~~~plkipVlny~a~~ti~na~~~~~~ds 77 (342) |-+---| .++= =|.-|+.+=. ++-++.+ +.+.... ..+....|..++||++++-+....-....-+.++. T Consensus 1 m~~~~T~l~d~i----~Pev~~~~v~-~~~~~~l~~~~~~~~~--~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~ 73 (274) T protein:vir:96 1 MAQGMTKLTNQI----VPEVLAPMMQ-AELEKKLRFASFAEID--NTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDI 73 (274) T ss_pred CCcceeehhhee----chHHHHHHHH-HHHHhhhhccccceec--ccccCCCCCEEEeeeecCCCccccccCCCccchhh Confidence 2221111 1110 0112222100 0000110 0110000 22444568899999998754433222233444444 Q ss_pred cccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhcccCcc Q lcl|NC_018276. 78 ENNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFAGNV 157 (342) Q Consensus 78 en~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~~n~ 157 (342) -+.+.. +..-+.+..+|.++=-.. ..++.+-+....+..-+.+-..+|.+++++|...+-.|-.+.+ -|+ T Consensus 74 lt~~~~-~~~i~~~~~a~~i~D~~~---~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~-~~d----- 143 (274) T protein:vir:96 74 LETKKR-EAKIRKIAKGTSISDEAL---LSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADIT-KLT----- 143 (274) T ss_pred ccccee-EEEeeeeecceeehHHHH---hhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc-CHH----- Confidence 333332 222233444555442221 1222223333344444556678999999988765544443333 111 Q ss_pred eeecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCccc--cccee-Eeeeeeeeeeccccc Q lcl|NC_018276. 158 VTADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAEK------GLYNE--TNKQI-QYADKVWHWSNEVAN 228 (342) Q Consensus 158 ~q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kleaq------G~~N~--tNlq~-Q~~N~~~~~sn~vat 228 (342) .+-+....|-..+..++ .++=+..++..|+|.... .+++. .|-+| .|++..++.||.+. T Consensus 144 ----------~i~~A~~~lgd~~~~~~-~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~- 211 (274) T protein:vir:96 144 ----------GLQTAIDKFNDEDLEPM-VLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLE- 211 (274) T ss_pred ----------HHHHHHHHhcccccccc-EEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCC- Confidence 12233344444555665 466788999999996421 11111 12233 58899999999874 Q ss_pred hhhhchhhhccccccchhhhh--hhhhhhccccccCcccccceecccccceeeeeeeee--hhcchhcchhhhh Q lcl|NC_018276. 229 AADRYATAFAVPQGTVGMLTR--FEREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYES--VGDFSGIAGAATA 298 (342) Q Consensus 229 ~A~~~~~~Y~m~sG~vGm~~w--idr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~--IaD~S~~ageata 298 (342) ..+.|-.-+|++|.... +.-|...+...++ . ..+.+ ..||.++++. +-=..=-+|+-+- T Consensus 212 ----~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~-d--~i~~~----~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 212 ----AGTAILAKKGAVKLITKRDFFLETDRDPSTKT-T--ALYSD----KHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ----CceEEEEeccceeeeecCCccccccccccccc-C--EEEEe----EEEEEEEEcCCcEEEEEcCCccccC Confidence 35667777888887655 1222222211111 0 00011 2234444332 1111111222221 No 6 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=91.69 E-value=0.012 Score=31.21 Aligned_cols=257 Identities=14% Similarity=0.106 Sum_probs=118.3 Q ss_pred Ccccccc-CCcchhhhhhhhhhhHHHhhhcccchhhhccchhH--HHHHhhcCCceeEEEEeecccEeeccceeEEeccc Q lcl|NC_018276. 1 MQNLRAK-ANLDKWELRPSRYGALDLFMQQTADPAGIISPELT--DKAEASIGSTLQVPVIDYDGTITIGNTRTVTIPDS 77 (342) Q Consensus 1 ~q~~r~k-~n~dk~~~r~s~~ga~d~f~~Qt~~~~~mls~Es~--~kl~aS~~~plkipVlny~a~~ti~na~~~~~~ds 77 (342) |-+---| .++ -=|.-|+.+= ..|.. ....+++=.+ ..+....|.+++||++++-+...--+...-+.++. T Consensus 1 ma~~~T~~~~~----iiPev~~~~v--~~~~~-~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~ 73 (274) T protein:vir:93 1 MPQGITKTSNQ----IIPEVLAPMM--QAQLE-KKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) T ss_pred CCccceehhhe----echHHHHHHH--HHHHH-hhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccc Confidence 3221111 111 1122233221 01100 0001111111 12344468899999987643322212222233333 Q ss_pred cccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhcccCcc Q lcl|NC_018276. 78 ENNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFAGNV 157 (342) Q Consensus 78 en~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~~n~ 157 (342) -++ .-.+.+.+.+.++|.++=-...=...+..+ ...+..-+.+-..+|++++++|...+..|-.+.+. T Consensus 74 it~-~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~---~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~~~~~-------- 141 (274) T protein:vir:93 74 LET-KKREAKIRKIAKGTSITDEALLSGYGDPQG---EQVRQHGLAHANKVDNDVLEALMGAKLTVNADITK-------- 141 (274) T ss_pred ccc-ceeEEEeeeecccccccHHHHHhhccchHH---HHHHHHHHHHHHHHHHHHHHHHhcccccccccccC-------- Confidence 232 223444455666666554322222222333 34455557777899999999998777655443331 Q ss_pred eeecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCcccc--ccee-Eeeeeeeeeeccccc Q lcl|NC_018276. 158 VTADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAEK------GLYNET--NKQI-QYADKVWHWSNEVAN 228 (342) Q Consensus 158 ~q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kleaq------G~~N~t--Nlq~-Q~~N~~~~~sn~vat 228 (342) -+. +-+....|-.++..++ .++=+..++..|+|.... ++++.. |-++ .|+++.++.||.+. T Consensus 142 -------~d~-i~dA~~~l~d~~~~~~-~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p- 211 (274) T protein:vir:93 142 -------LNG-LQSAIDKFNDEDLEPM-VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE- 211 (274) T ss_pred -------HHH-HHHHHHHhhhccCCcc-EEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCCCC- Confidence 112 2233344445666666 467788999999986422 222221 2222 68999999999975 Q ss_pred hhhhchhhhccccccchhhhhh--hhhhhccccccCcccccceeccccc-ceeeeeeee--ehhcchhcchhhhh Q lcl|NC_018276. 229 AADRYATAFAVPQGTVGMLTRF--EREALLRSRSRTGHEWDIDTLPMLE-MPVGTYYYE--SVGDFSGIAGAATA 298 (342) Q Consensus 229 ~A~~~~~~Y~m~sG~vGm~~wi--dr~a~~~~~~~tG~~~t~~~dP~~g-~~~g~~~~~--~IaD~S~~ageata 298 (342) ..+.|..-.|++|+...- .-|.+.+...++ .. +.+ .-||..+.+ .+.=....+|+-+- T Consensus 212 ----~~t~~l~~~gai~~~~~~~~~vE~~Rd~~~~~-----d~---i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 212 ----AGTAILAKKGAVKLILKRDFFLEVARDASTKT-----TA---LYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ----cceEEEEeCCeEEEEecCCcccccccchhhcc-----cE---EEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 357788889999976541 112221111111 10 000 112222221 22222222333222 No 7 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=88.69 E-value=0.031 Score=28.87 Aligned_cols=258 Identities=17% Similarity=0.135 Sum_probs=115.3 Q ss_pred CccccccCCcchhhhhhhhhhhHHHhhhcccchhhhccchhH--HHHHhhcCCceeEEEEeecccEeeccceeEEecccc Q lcl|NC_018276. 1 MQNLRAKANLDKWELRPSRYGALDLFMQQTADPAGIISPELT--DKAEASIGSTLQVPVIDYDGTITIGNTRTVTIPDSE 78 (342) Q Consensus 1 ~q~~r~k~n~dk~~~r~s~~ga~d~f~~Qt~~~~~mls~Es~--~kl~aS~~~plkipVlny~a~~ti~na~~~~~~dse 78 (342) |-+.--+ ..-..=|..|+.+ +..+... .+.+++=.+ ..+....|..++||+++..+....-+--. .+|.++ T Consensus 1 MA~~~T~---~~~~~iPev~s~~--v~~~~~~-~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~-~i~~~~ 73 (272) T protein:vir:30 1 MAVGTTK---MAQMLDPEVLADM--IDAEVGK-AIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGE-AIPMTQ 73 (272) T ss_pred CCCcccc---chheechHHHHHH--HHHHHHH-HhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCC-cccccc Confidence 2211000 0001122233321 0000000 000000000 11223346679999997654443222222 233333 Q ss_pred ccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhcccCcce Q lcl|NC_018276. 79 NNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFAGNVV 158 (342) Q Consensus 79 n~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~~n~~ 158 (342) -...-++...+..+..|.++--...-.-.+..+...+++-.+ +...+|++.++.|..-.+.+- ... T Consensus 74 ~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~---~a~~~d~~i~~~~~~a~~~~~-~~~---------- 139 (272) T protein:vir:30 74 LGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEA---IDHKVDADVLDALSKSTQTVE-ATA---------- 139 (272) T ss_pred cccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHH---HHHHHHHHHHHHhcccccccc-ccc---------- Confidence 334445666666666677665544444445555555555444 445788888887754332221 111 Q ss_pred eecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHhc--Ccccc------ccee-Eeeeeeeeeeccccch Q lcl|NC_018276. 159 TADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAEKG--LYNET------NKQI-QYADKVWHWSNEVANA 229 (342) Q Consensus 159 q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kleaqG--~~N~t------Nlq~-Q~~N~~~~~sn~vat~ 229 (342) . -+.+-+....|-.++... -.++-+..++.+|+|.+..- .+++. |-++ .+++..++.||.|. T Consensus 140 ----t--~d~i~da~~~l~~~~~~~-~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p-- 210 (272) T protein:vir:30 140 ----T--VDGVSKALDIFNDEDDAE-TVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCP-- 210 (272) T ss_pred ----C--HHHHHHHHHHHhccCCCc-cEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCC-- Confidence 1 122344445565565554 46788889999999875321 11111 1111 57889999999985 Q ss_pred hhhchhhhccccccchhhhhhh--hhhhccccccCcccccceecccccceeeeeeeeehhcchhcchhhhhheeeehhcc Q lcl|NC_018276. 230 ADRYATAFAVPQGTVGMLTRFE--REALLRSRSRTGHEWDIDTLPMLEMPVGTYYYESVGDFSGIAGAATADLTRAKKEH 307 (342) Q Consensus 230 A~~~~~~Y~m~sG~vGm~~wid--r~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~IaD~S~~ageatad~~~a~~~~ 307 (342) -.+.|..-.|.+|+..+-+ -|...+...+ +| .+-..++ T Consensus 211 ---~~t~~~~~~~a~~~~~~~~~~ve~~r~~~~~------------------------------------~~-~i~~~~~ 250 (272) T protein:vir:30 211 ---KGTAYMVRKGALRIMLKRNTMVETDRDITKA------------------------------------IN-QIVANKH 250 (272) T ss_pred ---cceEEEEcCCeEEEEecCCceeeeccccccc------------------------------------ee-EEEEEEE Confidence 3456777888888765521 1111110000 01 1111222 Q ss_pred cccee-eeeEEEeecCCcccCC Q lcl|NC_018276. 308 YGFAV-DVAFVVAENSDAANNP 328 (342) Q Consensus 308 Y~~~v-d~a~vVAfnsd~a~~~ 328 (342) ||+.+ +..=||...-+||--. T Consensus 251 ~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 251 YGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred EEEEEEcCCceEEEEecccccC Confidence 33222 1111222233333222 No 8 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=88.69 E-value=0.031 Score=28.87 Aligned_cols=258 Identities=17% Similarity=0.135 Sum_probs=115.3 Q ss_pred CccccccCCcchhhhhhhhhhhHHHhhhcccchhhhccchhH--HHHHhhcCCceeEEEEeecccEeeccceeEEecccc Q lcl|NC_018276. 1 MQNLRAKANLDKWELRPSRYGALDLFMQQTADPAGIISPELT--DKAEASIGSTLQVPVIDYDGTITIGNTRTVTIPDSE 78 (342) Q Consensus 1 ~q~~r~k~n~dk~~~r~s~~ga~d~f~~Qt~~~~~mls~Es~--~kl~aS~~~plkipVlny~a~~ti~na~~~~~~dse 78 (342) |-+.--+ ..-..=|..|+.+ +..+... .+.+++=.+ ..+....|..++||+++..+....-+--. .+|.++ T Consensus 1 MA~~~T~---~~~~~iPev~s~~--v~~~~~~-~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~-~i~~~~ 73 (272) T protein:vir:98 1 MAVGTTK---MAQMLDPEVLADM--IDAEVGK-AIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGE-AIPMTQ 73 (272) T ss_pred CCCcccc---chheechHHHHHH--HHHHHHH-HhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCC-cccccc Confidence 2211000 0001122233321 0000000 000000000 11223346679999997654443222222 233333 Q ss_pred ccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhcccCcce Q lcl|NC_018276. 79 NNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFAGNVV 158 (342) Q Consensus 79 n~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~~n~~ 158 (342) -...-++...+..+..|.++--...-.-.+..+...+++-.+ +...+|++.++.|..-.+.+- ... T Consensus 74 ~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~---~a~~~d~~i~~~~~~a~~~~~-~~~---------- 139 (272) T protein:vir:98 74 LGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEA---IDHKVDADVLDALSKSTQTVE-ATA---------- 139 (272) T ss_pred cccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHH---HHHHHHHHHHHHhcccccccc-ccc---------- Confidence 334445666666666677665544444445555555555444 445788888887754332221 111 Q ss_pred eecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHhc--Ccccc------ccee-Eeeeeeeeeeccccch Q lcl|NC_018276. 159 TADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAEKG--LYNET------NKQI-QYADKVWHWSNEVANA 229 (342) Q Consensus 159 q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kleaqG--~~N~t------Nlq~-Q~~N~~~~~sn~vat~ 229 (342) . -+.+-+....|-.++... -.++-+..++.+|+|.+..- .+++. |-++ .+++..++.||.|. T Consensus 140 ----t--~d~i~da~~~l~~~~~~~-~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p-- 210 (272) T protein:vir:98 140 ----T--VDGVSKALDIFNDEDDAE-TVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCP-- 210 (272) T ss_pred ----C--HHHHHHHHHHHhccCCCc-cEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCC-- Confidence 1 122344445565565554 46788889999999875321 11111 1111 57889999999985 Q ss_pred hhhchhhhccccccchhhhhhh--hhhhccccccCcccccceecccccceeeeeeeeehhcchhcchhhhhheeeehhcc Q lcl|NC_018276. 230 ADRYATAFAVPQGTVGMLTRFE--REALLRSRSRTGHEWDIDTLPMLEMPVGTYYYESVGDFSGIAGAATADLTRAKKEH 307 (342) Q Consensus 230 A~~~~~~Y~m~sG~vGm~~wid--r~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~IaD~S~~ageatad~~~a~~~~ 307 (342) -.+.|..-.|.+|+..+-+ -|...+...+ +| .+-..++ T Consensus 211 ---~~t~~~~~~~a~~~~~~~~~~ve~~r~~~~~------------------------------------~~-~i~~~~~ 250 (272) T protein:vir:98 211 ---KGTAYMVRKGALRIMLKRNTMVETDRDITKA------------------------------------IN-QIVANKH 250 (272) T ss_pred ---cceEEEEcCCeEEEEecCCceeeeccccccc------------------------------------ee-EEEEEEE Confidence 3456777888888765521 1111110000 01 1111222 Q ss_pred cccee-eeeEEEeecCCcccCC Q lcl|NC_018276. 308 YGFAV-DVAFVVAENSDAANNP 328 (342) Q Consensus 308 Y~~~v-d~a~vVAfnsd~a~~~ 328 (342) ||+.+ +..=||...-+||--. T Consensus 251 ~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 251 YGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred EEEEEEcCCceEEEEecccccC Confidence 33222 1111222233333222 No 9 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=87.40 E-value=0.039 Score=28.31 Aligned_cols=259 Identities=15% Similarity=0.109 Sum_probs=106.8 Q ss_pred Ccccccc-CCcchhhhhhhhhhhHHHhhhcccch--hhhccchhHHHHHhhcCCceeEEEEeecccEeeccceeEEeccc Q lcl|NC_018276. 1 MQNLRAK-ANLDKWELRPSRYGALDLFMQQTADP--AGIISPELTDKAEASIGSTLQVPVIDYDGTITIGNTRTVTIPDS 77 (342) Q Consensus 1 ~q~~r~k-~n~dk~~~r~s~~ga~d~f~~Qt~~~--~~mls~Es~~kl~aS~~~plkipVlny~a~~ti~na~~~~~~ds 77 (342) |-+---| .++ -=|.-|+.+=. ++-.+.+ +.+.... ..+....|..++||.+++-+....-....-+.++. T Consensus 1 ma~~~T~l~d~----iiPev~~~~v~-~~~~~~l~~~~~~~~d--~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~ 73 (274) T protein:vir:12 1 MAQGLTKTSNQ----IIPEVLAPMMQ-AQLEKKLRFASFAEVD--STLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) T ss_pred CCcceeehhhh----hchHHHHHHHH-HHHHhhhhhcccceec--ccccCCCCCEEEEeeecCCCccccccCCCccchhh Confidence 2221100 110 01112222210 0000000 0111000 22334468889999987643322222222333333 Q ss_pred cccce-eeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhcccCc Q lcl|NC_018276. 78 ENNSR-LVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFAGN 156 (342) Q Consensus 78 en~~r-~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~~n 156 (342) =+.+. -.++-.+ ..+|.++=-... .++.+-+....+..-+.+-..+|.+.++.|.+.+..+-.+.+ - T Consensus 74 lt~~~~~~~i~~~--~~~~~i~D~~~~---~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~~a~-~------ 141 (274) T protein:vir:12 74 LETKKREAKIRKI--AKGTSITDEALL---SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADIT-K------ 141 (274) T ss_pred cccceeeEEeeee--cceeeecHHHHH---hcccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc-C------ Confidence 22222 2233233 334444331111 111122222333333556678999999998776555543332 1 Q ss_pred ceeecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCccc--cccee-Eeeeeeeeeecccc Q lcl|NC_018276. 157 VVTADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAEK------GLYNE--TNKQI-QYADKVWHWSNEVA 227 (342) Q Consensus 157 ~~q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kleaq------G~~N~--tNlq~-Q~~N~~~~~sn~va 227 (342) -+.+-+...+|-.++..++ .++=+..++..|+|.... .++++ .|-+| .|++..+..||.+. T Consensus 142 ---------~d~i~dA~~~lgd~~~~~~-~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p 211 (274) T protein:vir:12 142 ---------LNGLQSAIDKFNDEDLEPM-VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRSNKLE 211 (274) T ss_pred ---------HHHHHHHHHHhcccccccc-EEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEEEeCCCC Confidence 1223334444555556665 467788899999986421 11111 12222 48999999999875 Q ss_pred chhhhchhhhccccccchhhhh--hhhhhhccccccCcccccceecccccceeeeeeeeehhcchhcchhhhhheeeehh Q lcl|NC_018276. 228 NAADRYATAFAVPQGTVGMLTR--FEREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYESVGDFSGIAGAATADLTRAKK 305 (342) Q Consensus 228 t~A~~~~~~Y~m~sG~vGm~~w--idr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~IaD~S~~ageatad~~~a~~ 305 (342) ..+.|-.-.|++|+..- +.-|...+...++ . ..+.+ ..||.++++ T Consensus 212 -----~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~-d--~i~~~----~~y~~~~~~--------------------- 258 (274) T protein:vir:12 212 -----AGTAILAKKGAVKLILKRDFFLEVARDASTKT-T--ALYSD----KHYVAYLYD--------------------- 258 (274) T ss_pred -----cceEEEEeccceeeeecCCceeccccchhhcc-c--EEEee----eEEEEEEEc--------------------- Confidence 34667777888886543 1111111111111 0 00000 112333322 Q ss_pred ccccceeeeeEEEeecCCcccCCC Q lcl|NC_018276. 306 EHYGFAVDVAFVVAENSDAANNPG 329 (342) Q Consensus 306 ~~Y~~~vd~a~vVAfnsd~a~~~~ 329 (342) ..=||..+-..+...- T Consensus 259 --------~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 259 --------ESKAVKITKGSGSLEM 274 (274) T ss_pred --------CCceEEEEcCCccccC Confidence 2222222221111110 No 10 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=87.12 E-value=0.041 Score=28.20 Aligned_cols=258 Identities=13% Similarity=0.087 Sum_probs=107.5 Q ss_pred Ccccccc-CCcchhhhhhhhhhhHH--HhhhcccchhhhccchhH--HHHHhhcCCceeEEEEeecccEe-eccceeEEe Q lcl|NC_018276. 1 MQNLRAK-ANLDKWELRPSRYGALD--LFMQQTADPAGIISPELT--DKAEASIGSTLQVPVIDYDGTIT-IGNTRTVTI 74 (342) Q Consensus 1 ~q~~r~k-~n~dk~~~r~s~~ga~d--~f~~Qt~~~~~mls~Es~--~kl~aS~~~plkipVlny~a~~t-i~na~~~~~ 74 (342) |-+.--| .++ -=|..|+.+= -|. ..+.+++=.. ..+....|..++||.+++-+... +.. ..-+. T Consensus 1 Ma~~~T~~~~~----iiPev~s~~v~~~~~-----~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~-g~~i~ 70 (278) T protein:vir:80 1 MADLTTKLANL----IDPEVMGPMISAKLP-----KAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAE-GAAID 70 (278) T ss_pred CCCcceehhhe----ecHHHHHHHHHHHHH-----HhhhhcccceecccccCCCCCEEEEeeeccCCcceeecC-CCcCc Confidence 3321000 010 0111222211 010 0000000000 22334468889999987644322 221 12233 Q ss_pred ccccccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhccc Q lcl|NC_018276. 75 PDSENNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFA 154 (342) Q Consensus 75 ~dsen~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~ 154 (342) ++.-++. -.+...+....+|.+.=-...=...+..++. .+..=+.+...+|+++++.|...+..+-+..- .+.. T Consensus 71 ~~~lt~~-~~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~---~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t--~~~~ 144 (278) T protein:vir:80 71 YSALETE-SVKHGIKKAGKGVKLTDESVLSGYGDPVEEA---QKQIRMAIASKVDNDILEEALTTTLEVKGAIN--IGLI 144 (278) T ss_pred ccccccc-eeeEeeehhhccccccHHHHhhccccHHHHH---HHHHHHHHHHHHHHHHHHHHhccccccccccc--cchh Confidence 3322222 2222223222333433222111222333333 33344455678889999999876655433221 0111 Q ss_pred CcceeecccchhHHHhcchHHHh-ccCcccceEEeecchhHHHHHHHHHhcC------ccc--cccee-Eeeeeeeeeec Q lcl|NC_018276. 155 GNVVTADFAKREEIIGDINPMQA-SNDVYAPLHVVGNTGVESAVRKLAEKGL------YNE--TNKQI-QYADKVWHWSN 224 (342) Q Consensus 155 ~n~~q~p~aqR~e~~~~l~a~~r-~ndy~g~l~~iadt~~~s~l~kleaqG~------~N~--tNlq~-Q~~N~~~~~sn 224 (342) .+ -.+.+.+.--.|- .+.....+ ++=+..++..|+|...... +++ +|-++ .|++..++.|| T Consensus 145 ~~--------~~~~~~da~~~l~~~~~~~~~~-ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~ 215 (278) T protein:vir:80 145 DK--------IENTFTDAPDAIEDESITTTGV-LFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTK 215 (278) T ss_pred hh--------HHHHHHHHHHhhcccCCCcccE-EEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcC Confidence 11 1123333332333 33333333 4447889999998864322 222 12233 68999999999 Q ss_pred cccchhhhchhhhccccccchhhhhh--hhhhhccccccCcccccceecccccceeeeeeeee--hhcchhcchh Q lcl|NC_018276. 225 EVANAADRYATAFAVPQGTVGMLTRF--EREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYES--VGDFSGIAGA 295 (342) Q Consensus 225 ~vat~A~~~~~~Y~m~sG~vGm~~wi--dr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~--IaD~S~~age 295 (342) .+.. ++.|-+-.|++|....- .-|...+-...+ ...+.+ .-||.++.+. +--.+--||- T Consensus 216 ~~p~-----~t~~l~~~gAi~~~~~~~~~vE~~Rd~~~~~---d~i~~~----~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 216 KLAD-----GNALAVKAGALKTFLKRNLLAESGRDMDHKL---TKFNAD----QHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred CCCc-----ceEEEEeccceeeeecCCcccccccchhhcc---ceeeee----eEEEEEEEcCcceEEEeeccCC Confidence 9853 57788889999865441 112111111111 011111 1234444322 2222222222 No 11 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=85.16 E-value=0.055 Score=27.49 Aligned_cols=261 Identities=16% Similarity=0.112 Sum_probs=120.4 Q ss_pred Ccccccc-CCcchhhhhhhhhhhH--HHhhhcccchhhhccchhHHHHHhhcCCceeEEEEeecccEeeccceeEEeccc Q lcl|NC_018276. 1 MQNLRAK-ANLDKWELRPSRYGAL--DLFMQQTADPAGIISPELTDKAEASIGSTLQVPVIDYDGTITIGNTRTVTIPDS 77 (342) Q Consensus 1 ~q~~r~k-~n~dk~~~r~s~~ga~--d~f~~Qt~~~~~mls~Es~~kl~aS~~~plkipVlny~a~~ti~na~~~~~~ds 77 (342) |-+.--| +++= =|.-|+.+ +-+. +..-.+++.... ..+..+.|..++||++++.+....-+..+-+.+++ T Consensus 1 ma~~~T~~~d~i----~Pev~s~~v~~~~~-~~~~~~~~~~~~--~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~ 73 (274) T protein:vir:96 1 MAQGTTKVSNLI----VPEVLAPMMQAELD-KKLRFAQFADID--STLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQ 73 (274) T ss_pred CCccccchhhhh----hhHHHHHHHHHHHH-hhhhhccccccc--ccccCCCCCEEEEEeeccCCCccccCCCCcCchhh Confidence 3322111 1111 12222221 1110 000011111111 12334458889999997654433222233344443 Q ss_pred cccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhcccCcc Q lcl|NC_018276. 78 ENNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFAGNV 157 (342) Q Consensus 78 en~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~~n~ 157 (342) -++.. .+.+.+.+..+|.++=-.. ..+..+-+....+..-+.+...+|+++++.|..-+..+-.+.+ T Consensus 74 it~~~-~~~~i~~~~~~~~i~D~~~---~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~~~~~--------- 140 (274) T protein:vir:96 74 IGTSK-REAKVRKIGKGTELTDEAV---LSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADIT--------- 140 (274) T ss_pred cccce-eEEEEEeeeceeeecHHHH---HhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCcccc--------- Confidence 33332 3333344445555542221 1122233334445555667788999999998765554433332 Q ss_pred eeecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCcccc--ccee-Eeeeeeeeeeccccc Q lcl|NC_018276. 158 VTADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAEK------GLYNET--NKQI-QYADKVWHWSNEVAN 228 (342) Q Consensus 158 ~q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kleaq------G~~N~t--Nlq~-Q~~N~~~~~sn~vat 228 (342) .=+ .+-+....|-.+++.++. ++=+..++..|+|.... +.+|+. |-++ .|+++.+..||.+. T Consensus 141 ------~~d-~i~dA~~~l~d~~~~~~~-ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p- 211 (274) T protein:vir:96 141 ------KLD-GLQTAIDKFNDEDLEPMV-LFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLN- 211 (274) T ss_pred ------cHH-HHHHHHHHhcccCCCceE-EEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcCCCC- Confidence 112 222333445556666665 67788999999997532 222221 1122 47889999999974 Q ss_pred hhhhchhhhccccccchhhhhhhhhhhccccccCcccccceecccccceeeeeeeeehhcchhcchhhhhheeeehhccc Q lcl|NC_018276. 229 AADRYATAFAVPQGTVGMLTRFEREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYESVGDFSGIAGAATADLTRAKKEHY 308 (342) Q Consensus 229 ~A~~~~~~Y~m~sG~vGm~~widr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~IaD~S~~ageatad~~~a~~~~Y 308 (342) ..+.|-+-.|++|....-+=.-+ +.-||.. -+|.+.. .++| T Consensus 212 ----~~t~~l~~~gA~~~~~~~~~~vE------------~~Rd~~~----------------------~~d~i~~-~~~y 252 (274) T protein:vir:96 212 ----KGEALLAKKGAVKLITKRDFFLE------------KDRDASR----------------------KSTALYS-DKHY 252 (274) T ss_pred ----cceEEEEeCcceeeeecCCcccc------------cccchhh----------------------cccEEEE-eeEE Confidence 34667777888876444210000 0011110 1122222 2345 Q ss_pred ccee-eeeEEEeecCCcccCCCceE Q lcl|NC_018276. 309 GFAV-DVAFVVAENSDAANNPGAIL 332 (342) Q Consensus 309 ~~~v-d~a~vVAfnsd~a~~~~~i~ 332 (342) |..+ |-.=||......|+- +| T Consensus 253 g~~~~~~~~vv~~t~~~~~~---~~ 274 (274) T protein:vir:96 253 VAYLYDESKVVKITKGAGDE---VM 274 (274) T ss_pred EEEEEcCccEEEEEcCcccc---cC Confidence 5443 444455555555442 22 No 12 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=84.15 E-value=0.063 Score=27.18 Aligned_cols=259 Identities=14% Similarity=0.117 Sum_probs=111.7 Q ss_pred CccccccCCcchhhhhhhhhhhH--HHhhhcccchhhhccchhHHHHHhhcCCceeEEEEeecccEeeccceeEEecccc Q lcl|NC_018276. 1 MQNLRAKANLDKWELRPSRYGAL--DLFMQQTADPAGIISPELTDKAEASIGSTLQVPVIDYDGTITIGNTRTVTIPDSE 78 (342) Q Consensus 1 ~q~~r~k~n~dk~~~r~s~~ga~--d~f~~Qt~~~~~mls~Es~~kl~aS~~~plkipVlny~a~~ti~na~~~~~~dse 78 (342) |-+---.+++ -=|.-|+.+ +-+ .+..-.+.+.... ..+....|..++||++++-+....-+-..-+.++.- T Consensus 3 ~~~~T~l~d~----i~PEv~~~~v~~~~-~~~~~~~~~~~~~--~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~l 75 (275) T protein:vir:96 3 LENMTKLANM----VNPEVLAPMMQAEL-DKKLKFAQFADID--NTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLI 75 (275) T ss_pred Ccccchhhhh----hchHHHHHHHHHHH-HHhhhhcccceec--ccccCCCCCEEEeeeeccCCccccccCCCCcchhhc Confidence 2111000111 112222222 111 0111111111100 223445688899999987443332222233334332 Q ss_pred ccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhcccCcce Q lcl|NC_018276. 79 NNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFAGNVV 158 (342) Q Consensus 79 n~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~~n~~ 158 (342) ++.. .+..-+.+.++|.++--...=...+--++..++ .=+.+-..+|.++++.|..-.-.+-.+.+ T Consensus 76 t~~~-~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~---~a~~~a~~~d~~ll~~l~~a~~~~~~~~~---------- 141 (275) T protein:vir:96 76 ETKK-RQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQ---HGLAIANKVDNDVLEALQGATLKVEADIT---------- 141 (275) T ss_pred ccce-eeEEeehhcccccccHHHHHhhccchHHHHHHH---HHHHHHHHHHHHHHHHHhccccccccccc---------- Confidence 2222 233334456666654322211111222222333 33345578899999888654322211111 Q ss_pred eecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHH----hcC--ccc--cccee-Eeeeeeeeeeccccch Q lcl|NC_018276. 159 TADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAE----KGL--YNE--TNKQI-QYADKVWHWSNEVANA 229 (342) Q Consensus 159 q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~klea----qG~--~N~--tNlq~-Q~~N~~~~~sn~vat~ 229 (342) .- +.+-+....|-..+..++ .++=+..+++.|+|... +.. ++. +|-+| .|+++.+..||.+. T Consensus 142 -----~~-d~i~dA~~~lgd~~~~~~-~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p-- 212 (275) T protein:vir:96 142 -----KL-AGLQTAIDKFNDEDLEPM-VLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNKIK-- 212 (275) T ss_pred -----CH-HHHHHHHHHhccccCCcc-EEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCCCC-- Confidence 11 223344455555556665 46778999999999852 122 221 12233 58999999999874 Q ss_pred hhhchhhhccccccchhhhhh--hhhhhccccccCcccccceecccccceeeeeeeeehhcchhcchhhhhheeeehhcc Q lcl|NC_018276. 230 ADRYATAFAVPQGTVGMLTRF--EREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYESVGDFSGIAGAATADLTRAKKEH 307 (342) Q Consensus 230 A~~~~~~Y~m~sG~vGm~~wi--dr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~IaD~S~~ageatad~~~a~~~~ 307 (342) ..+.|-.-.|++|+...- .-|...+... -+|.+.. .+| T Consensus 213 ---~~t~~i~~~gA~~~~~~~~~~vE~~Rd~~~------------------------------------~~d~i~~-~~~ 252 (275) T protein:vir:96 213 ---EGEAILAKRGAVKLITKRDFFLETERHASH------------------------------------KSTALFS-DKH 252 (275) T ss_pred ---cceEEEEeccceeeeecCCcccccccchhh------------------------------------cCcEEEE-eEE Confidence 246677777777765441 1111111111 1122222 233 Q ss_pred ccce-eeeeEEEeecCCcccCCC Q lcl|NC_018276. 308 YGFA-VDVAFVVAENSDAANNPG 329 (342) Q Consensus 308 Y~~~-vd~a~vVAfnsd~a~~~~ 329 (342) ||.. ++-.=||...-+|++.-- T Consensus 253 y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 253 YVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred EEEEEEcCccEEEEEecccccCC Confidence 4422 122222222223333221 No 13 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=81.70 E-value=0.083 Score=26.50 Aligned_cols=303 Identities=13% Similarity=0.047 Sum_probs=134.1 Q ss_pred Cccccc-----cCCcchhhhhhhhhhh--HHHhhhcccchhhhccchhHHH-HHhhcCCceeEEEEeecccEeeccceeE Q lcl|NC_018276. 1 MQNLRA-----KANLDKWELRPSRYGA--LDLFMQQTADPAGIISPELTDK-AEASIGSTLQVPVIDYDGTITIGNTRTV 72 (342) Q Consensus 1 ~q~~r~-----k~n~dk~~~r~s~~ga--~d~f~~Qt~~~~~mls~Es~~k-l~aS~~~plkipVlny~a~~ti~na~~~ 72 (342) |-|=-- -++..-|- |..|+. .+.|.....-... . ++. .+...|.+++||++. +.+..-.++..- T Consensus 3 ~~~~~~~~~~~t~~v~~fi--pei~s~~i~~~l~~~~v~~~~-~----~d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~ 74 (341) T protein:vir:94 3 LGNTITGPSINTQRGQQFI--PEQWLSEVQMFRKAKMLDTSV-V----KTWGAQVKKGDTFHVPRIS-ELGVEDKATDVP 74 (341) T ss_pred chhhhccccccchhHHHHH--HHHHHHHHHHHHHhhcchhhc-c----ccccccccCCceEEEeccC-cceeeeecCCCc Confidence 111000 02222232 445553 3455333222221 1 111 122338899999984 222222233333 Q ss_pred Eeccccccceeeeeeeeee-eeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhh Q lcl|NC_018276. 73 TIPDSENNSRLVTIVFATY-SWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLY 151 (342) Q Consensus 73 ~~~dsen~~r~it~v~kT~-awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~y 151 (342) +.++.-+.. -++++.-+. .|+|.|..-=-.=...+. +..-.+.+-+++...+|.+.++.+.+-+.+..+.++.-. T Consensus 75 i~~~~~~~~-~~~itiD~~~~~~~~i~d~d~~~~~~d~---~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~ 150 (341) T protein:vir:94 75 VGVQPVNDT-DFVITVDTDRTTAVALDDLLEIQASYDL---RAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSS 150 (341) T ss_pred cccccccCc-eEEEEEeeeeecceeechHHHHhhccch---HHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCc Confidence 334332322 234544333 466666531110111122 334455666777888898888877665554444433111 Q ss_pred cccCcceeecccchhHHHhcchHHHhccCc--ccceEEeecchhHHHHHHHHH---hcCccc---ccce-eEeeeeeeee Q lcl|NC_018276. 152 TFAGNVVTADFAKREEIIGDINPMQASNDV--YAPLHVVGNTGVESAVRKLAE---KGLYNE---TNKQ-IQYADKVWHW 222 (342) Q Consensus 152 t~~~n~~q~p~aqR~e~~~~l~a~~r~ndy--~g~l~~iadt~~~s~l~klea---qG~~N~---tNlq-~Q~~N~~~~~ 222 (342) +... .-.+-.--.+.+-++...|.++++ .||+.+| +...+++|.+..+ ....++ +|=+ -.+++..|+. T Consensus 151 ~~~~--t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv-~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~ 227 (341) T protein:vir:94 151 NGAI--TGNGQAFSFAVFLAARRLLLEADVPEEKIVLLI-SPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIR 227 (341) T ss_pred cccc--cCchhhhhHHHHHHHHHHHhhcCCCccCCEEEe-CHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEE Confidence 1100 000111123556677888888887 6888766 7899998876321 111111 1111 2678999999 Q ss_pred eccccchhhhchhhhcccccc----------chhhhhhhhhhhccccccCc-------ccccceecccccceeeeeeeee Q lcl|NC_018276. 223 SNEVANAADRYATAFAVPQGT----------VGMLTRFEREALLRSRSRTG-------HEWDIDTLPMLEMPVGTYYYES 285 (342) Q Consensus 223 sn~vat~A~~~~~~Y~m~sG~----------vGm~~widr~a~~~~~~~tG-------~~~t~~~dP~~g~~~g~~~~~~ 285 (342) ||++...+. +.|..-.|. .|.-.+-. .......-.| -+--+++||...-.+..-..+. T Consensus 228 Sn~lp~~~~---~~~~~~~~~~~~~~~~~~i~~~~~~~~--~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~ 302 (341) T protein:vir:94 228 TSLIGNNSA---TGWRNGAPTIAPAEATPGFTGSRYLPK--QDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRV 302 (341) T ss_pred ecccccccc---ccccccccceecccccccccccccccc--cccccccEEEEEEecccccceeeecchhhhccccccccc Confidence 999976542 222111111 11111100 0000000001 1112344444444444444444 Q ss_pred hhcchhcchhhhhheeeehhcccccee-eeeEEEeecCCcccC Q lcl|NC_018276. 286 VGDFSGIAGAATADLTRAKKEHYGFAV-DVAFVVAENSDAANN 327 (342) Q Consensus 286 IaD~S~~ageatad~~~a~~~~Y~~~v-d~a~vVAfnsd~a~~ 327 (342) .++++. .+-+|.+.++- -||-.+ +-..+|-+..++++. T Consensus 303 ~~~~~~---~~~~~~i~~~~-~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 303 TQSFEN---REQVWLMVGRQ-AYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred cccchh---hhhhhhhhhhh-hhcccccCcceeEEEecCcCCC Confidence 444432 34567666654 445433 222344555555554 No 14 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=77.58 E-value=0.12 Score=25.57 Aligned_cols=254 Identities=15% Similarity=0.111 Sum_probs=117.4 Q ss_pred CccccccCCcchhhhhhhhhhh--HHHhhhcccchhhhccchhHHHHHhhcCCceeEEEEeecccEeecc--ceeEEecc Q lcl|NC_018276. 1 MQNLRAKANLDKWELRPSRYGA--LDLFMQQTADPAGIISPELTDKAEASIGSTLQVPVIDYDGTITIGN--TRTVTIPD 76 (342) Q Consensus 1 ~q~~r~k~n~dk~~~r~s~~ga--~d~f~~Qt~~~~~mls~Es~~kl~aS~~~plkipVlny~a~~ti~n--a~~~~~~d 76 (342) |-+-.++ |.-|+. .+.|..+.. +..+..... ......|.+++||++-+ +++.+ +....++. T Consensus 1 MA~~~~~---------pei~~~~v~~~~~~~lv-~~~l~~~~~--~~~~~~GdTv~ip~~~~---~~~~d~~~~~~~~~~ 65 (273) T protein:vir:79 1 MAFNNFI---------PELWSDMLLEEWTAQTV-FANLVNREY--EGIASKGNVVHIAGVVA---PTVKDYKAAGRQTSA 65 (273) T ss_pred Ccchhhh---------HHHHHHHHHHHHHhhcc-chhhhhccc--cccccCCcEEEEeecCc---ccccccccCCCccCc Confidence 3221111 344543 345544322 122211111 01234588999999732 11111 11112222 Q ss_pred ccccceeeeeeeee-eeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhcccC Q lcl|NC_018276. 77 SENNSRLVTIVFAT-YSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFAG 155 (342) Q Consensus 77 sen~~r~it~v~kT-~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~~ 155 (342) ++-...-++++... ..|+|.|.-.=.. +-++ |++.-++.+-+.+...+|.+.++.+.+..|.+-...- T Consensus 66 ~~~~~~~~~~tid~~~~~~~~i~d~d~~--~~~~--~~~~~~~~~~~ala~~vD~~i~~~~~~a~~~~~~~~~------- 134 (273) T protein:vir:79 66 DAISDTGVDLLIDQEKSIDFLVDDIDRV--QVAG--SLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAP------- 134 (273) T ss_pred cccccceEEEEEeeecccceeeccHHHH--hhcc--cHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc------- Confidence 23233345666644 3677776531111 1122 3444556677788899999999888776655422111 Q ss_pred cceeecccchhHHHhcchHHHhccCc--ccceEEeecchhHHHHHHHHH----hcCccccc----ce-eEeeeeeeeeec Q lcl|NC_018276. 156 NVVTADFAKREEIIGDINPMQASNDV--YAPLHVVGNTGVESAVRKLAE----KGLYNETN----KQ-IQYADKVWHWSN 224 (342) Q Consensus 156 n~~q~p~aqR~e~~~~l~a~~r~ndy--~g~l~~iadt~~~s~l~klea----qG~~N~tN----lq-~Q~~N~~~~~sn 224 (342) ..... =-+.+-++...|..+++ .||+.+|. ...+++|.++.+ ....++.+ -+ -.+.+..|++|| T Consensus 135 --~~~~~--~~~~i~~a~~~ld~~~vP~~~R~lvv~-p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~ 209 (273) T protein:vir:79 135 --SDADD--AFDLIASALKELTKANVPNVGRVVVVN-AEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN 209 (273) T ss_pred --cchhh--HHHHHHHHHHHhhhccCCccCcEEEEC-HHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecc Confidence 00000 01345556667778887 67776665 888888877543 22222221 11 147789999999 Q ss_pred cccchhhhchhhhccccccchhhhhhh-hhhhccccccCcccccceecccccceeeeeee--eehhcchhcchh Q lcl|NC_018276. 225 EVANAADRYATAFAVPQGTVGMLTRFE-REALLRSRSRTGHEWDIDTLPMLEMPVGTYYY--ESVGDFSGIAGA 295 (342) Q Consensus 225 ~vat~A~~~~~~Y~m~sG~vGm~~wid-r~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~--~~IaD~S~~age 295 (342) ++..... .+.++.-.+++|+...++ -|.. +..+.|...+.=+ +-||.... ++|.=... .|+ T Consensus 210 ~lp~~~~--~~~~a~~~~A~~~a~~~~~~e~~-----r~~~~~~~~v~~~--~~yg~~v~~p~~vv~~~~-~g~ 273 (273) T protein:vir:79 210 NLRDTDD--EQFVAFHPSAAAYVSQIDTVEAL-----RDQDSFSDRIRAL--HVYGGKVVRPTGVVVFNK-TGS 273 (273) T ss_pred cccccCc--eEEEEEeccceeeeeehhhhhcc-----cCcccceeeeeee--eeeeeEEecCceEEEEec-cCC Confidence 9965443 223333445556665554 1221 1112233332211 11222221 11111000 111 No 15 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=60.41 E-value=0.37 Score=22.93 Aligned_cols=258 Identities=12% Similarity=0.107 Sum_probs=104.8 Q ss_pred Ccccccc-CCcchhhhhhhhhhhH--HHhhhcccchhhhccchhHHHHHhhcCCceeEEEEeecccEeeccceeEEeccc Q lcl|NC_018276. 1 MQNLRAK-ANLDKWELRPSRYGAL--DLFMQQTADPAGIISPELTDKAEASIGSTLQVPVIDYDGTITIGNTRTVTIPDS 77 (342) Q Consensus 1 ~q~~r~k-~n~dk~~~r~s~~ga~--d~f~~Qt~~~~~mls~Es~~kl~aS~~~plkipVlny~a~~ti~na~~~~~~ds 77 (342) |-+---| +++ -=|.-|+.+ +-+.+. .-.+.+.... ..+....|..+++|.+++-+..+.-.--.-+.++. T Consensus 1 ma~~~T~~~d~----iiPev~~~~v~~~~~~~-~~~~~~~~~~--~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~ 73 (272) T protein:vir:36 1 MSKQKTTLADL----VNPEVLAPIVSYELNKA-LRFAPLAQVD--TTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDK 73 (272) T ss_pred CCCcceehhhh----hchHHHHHHHHHHHHhh-hhhccccccc--cccccCCCCEEEEeeeccCccccccCCCCccChhh Confidence 3321111 111 012223222 111000 0000110000 22344468889999987643222111112233333 Q ss_pred cccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhhcccCcc Q lcl|NC_018276. 78 ENNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLGLYTFAGNV 157 (342) Q Consensus 78 en~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~~yt~~~n~ 157 (342) -+.+. .+...+.+..+|.++=-...=...+-.+...+++-. .+...+|++++++|...+..+-+. T Consensus 74 lt~~~-~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~---~~a~~~d~~i~~~l~~~~~~~~~~----------- 138 (272) T protein:vir:36 74 IGTTT-KSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGL---SLANKVDDDLLSAAKTTSQTVSTK----------- 138 (272) T ss_pred cCCcc-eeEeeehhhccccccHHHHhhccchHHHHHHHHHHH---HHHHHHHHHHHHHhcccccccccc----------- Confidence 32222 222223334444543311111122233333344443 345688888888886554443211 Q ss_pred eeecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh---cCcccc----ccee-Eeeeeeeeeeccccch Q lcl|NC_018276. 158 VTADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAEK---GLYNET----NKQI-QYADKVWHWSNEVANA 229 (342) Q Consensus 158 ~q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kleaq---G~~N~t----Nlq~-Q~~N~~~~~sn~vat~ 229 (342) .. -+ -+-+....|...+.... .++=+..++..|+|...- +.+++. |-++ .|+++.+..||.+... T Consensus 139 ~~-----~d-~i~~A~~~lgd~~~~~~-~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~ 211 (272) T protein:vir:36 139 AN-----VD-GVQAALDIFNDEDAQAY-VLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEG 211 (272) T ss_pred cc-----HH-HHHHHHHHhhhcCCCce-EEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCC Confidence 11 11 23344445556666655 466788899999986531 222222 2223 5889999999998644 Q ss_pred hhhchhhhccccccchhhhh--hhhhhhccccccCcccccceeccccc-ceeeeeeeeehhcchhcchhhhhheeeehhc Q lcl|NC_018276. 230 ADRYATAFAVPQGTVGMLTR--FEREALLRSRSRTGHEWDIDTLPMLE-MPVGTYYYESVGDFSGIAGAATADLTRAKKE 306 (342) Q Consensus 230 A~~~~~~Y~m~sG~vGm~~w--idr~a~~~~~~~tG~~~t~~~dP~~g-~~~g~~~~~~IaD~S~~ageatad~~~a~~~ 306 (342) .. +.+.|..-.|.+|+..- +.=|..-+-.. +..+ +.+ .-||+++++. ++ T Consensus 212 ~~-~~~~~~~~~gA~~~~~~~~~~vE~~R~~~~-----~~d~---i~~~~~y~~~v~~~----~~--------------- 263 (272) T protein:vir:36 212 SA-LMFKIVSNSPALKLVLKRGVQVETDRDIVT-----KTTV---ITADEHYAAYLYDL----TK--------------- 263 (272) T ss_pred ce-eEEEEEecccceeeeecCCcccccccchhh-----cCcE---EEEEEEEEEEEEcC----cc--------------- Confidence 32 34445555677775432 11111111111 1111 000 1133333221 11 Q ss_pred cccceeeeeEEEeecCCc Q lcl|NC_018276. 307 HYGFAVDVAFVVAENSDA 324 (342) Q Consensus 307 ~Y~~~vd~a~vVAfnsd~ 324 (342) +|+.=+--- T Consensus 264 ---------vv~~t~~g~ 272 (272) T protein:vir:36 264 ---------VVNITFTGV 272 (272) T ss_pred ---------EEEEeecCC Confidence 111100000 No 16 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=60.24 E-value=0.38 Score=22.91 Aligned_cols=247 Identities=14% Similarity=0.082 Sum_probs=111.5 Q ss_pred CccccccCCcchhhhhhhhhhh--HHHhhhcccchhhhccchhHHH--HHhhcCCceeEEEEe------ecccEeeccce Q lcl|NC_018276. 1 MQNLRAKANLDKWELRPSRYGA--LDLFMQQTADPAGIISPELTDK--AEASIGSTLQVPVID------YDGTITIGNTR 70 (342) Q Consensus 1 ~q~~r~k~n~dk~~~r~s~~ga--~d~f~~Qt~~~~~mls~Es~~k--l~aS~~~plkipVln------y~a~~ti~na~ 70 (342) |-+-.++ +..|+. .+.|..+. .+++-..+. ..-..|.+++||++. |.+.++ T Consensus 1 MA~~~~~---------pe~~~~~v~~~~~~~l-----v~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~----- 61 (273) T protein:vir:10 1 MAFNNFI---------PELWSDMLLEEWTAQT-----VFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGR----- 61 (273) T ss_pred Ccchhhh---------HHHHHHHHHHHHHhhh-----ccchhhccccccccccCceEEEeecccccccccccCCC----- Confidence 3221111 445643 45553322 222211110 112357889999873 332222 Q ss_pred eEEeccccccceeeeeeeeee-eeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchh Q lcl|NC_018276. 71 TVTIPDSENNSRLVTIVFATY-SWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLG 149 (342) Q Consensus 71 ~~~~~dsen~~r~it~v~kT~-awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~ 149 (342) -+.+ .+-.-.-++++.... .|+|.|.=.=....-. |++.-.+.+-+.+...+|.+..+.+.+..+.+-... T Consensus 62 -~~~~-~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~----~~~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~-- 133 (273) T protein:vir:10 62 -QTSA-DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAG----SLEAYTRAGATALATDTDKFIADMLVDNGTALTGSA-- 133 (273) T ss_pred -ccCc-cccccceEEEEEeeeeecceEeecHHHhhhhc----cHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-- Confidence 1222 222333466666543 6777765211111111 233344456677888999999888876655543221 Q ss_pred hhcccCcceeecccchhHHHhc---chHHHhccCc--ccceEEeecchhHHHHHHHHH----hcCccccc-c---e-eEe Q lcl|NC_018276. 150 LYTFAGNVVTADFAKREEIIGD---INPMQASNDV--YAPLHVVGNTGVESAVRKLAE----KGLYNETN-K---Q-IQY 215 (342) Q Consensus 150 ~yt~~~n~~q~p~aqR~e~~~~---l~a~~r~ndy--~g~l~~iadt~~~s~l~klea----qG~~N~tN-l---q-~Q~ 215 (342) +. ...+++.. +...|..+++ .||+. +=+...+.+|.+..+ ...+++.+ + + -.+ T Consensus 134 -----------~~-~~~~~~~~i~~a~~~ld~~~vP~~~R~l-vv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i 200 (273) T protein:vir:10 134 -----------PT-DADDAFDLIAKALKELTKANVPNVGRVV-VVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNL 200 (273) T ss_pred -----------cc-chhHHHHHHHHHHHHhhhcCCCcCCCEE-EECHHHHHHHhcchhhhhhhhccccccceeeeeeeEE Confidence 11 22334444 4555677777 56654 557788888877543 12222211 1 1 346 Q ss_pred eeeeeeeeccccchhhhchhhhccccccchhhhhhhhhhhccccccCcccccceecccccceeeeeeeeehhcchhcchh Q lcl|NC_018276. 216 ADKVWHWSNEVANAADRYATAFAVPQGTVGMLTRFEREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYESVGDFSGIAGA 295 (342) Q Consensus 216 ~N~~~~~sn~vat~A~~~~~~Y~m~sG~vGm~~widr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~IaD~S~~age 295 (342) .+..|++||++..... .+.++.-.+++|+...++----.|...+ |... ..|.|+| T Consensus 201 ~G~~v~~s~~lp~~~~--~~~~~~~~~A~~~a~q~~~~e~~r~~~~----~~~~-------v~~~~~y------------ 255 (273) T protein:vir:10 201 LGARIVESNNLRDTDD--EQFVAFHPSAAAYVSQIDTVEALRDQDS----FSDR-------IRALHVY------------ 255 (273) T ss_pred eceEEEEecccccCCc--cEEEEEeccceeeeeeeehhhcccCCCc----ceee-------eeeeeee------------ Confidence 7899999999965433 3334444555666555432111111111 2222 2222221 Q ss_pred hhhheeeehhcccccee-eeeEEEeecCCcc Q lcl|NC_018276. 296 ATADLTRAKKEHYGFAV-DVAFVVAENSDAA 325 (342) Q Consensus 296 atad~~~a~~~~Y~~~v-d~a~vVAfnsd~a 325 (342) |..+ +--=||-.+.-.+ T Consensus 256 -------------g~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 256 -------------GGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -------------eeeEeccceEEEEeccCC Confidence 1100 0000000000000 No 17 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=60.24 E-value=0.38 Score=22.91 Aligned_cols=247 Identities=14% Similarity=0.082 Sum_probs=111.5 Q ss_pred CccccccCCcchhhhhhhhhhh--HHHhhhcccchhhhccchhHHH--HHhhcCCceeEEEEe------ecccEeeccce Q lcl|NC_018276. 1 MQNLRAKANLDKWELRPSRYGA--LDLFMQQTADPAGIISPELTDK--AEASIGSTLQVPVID------YDGTITIGNTR 70 (342) Q Consensus 1 ~q~~r~k~n~dk~~~r~s~~ga--~d~f~~Qt~~~~~mls~Es~~k--l~aS~~~plkipVln------y~a~~ti~na~ 70 (342) |-+-.++ +..|+. .+.|..+. .+++-..+. ..-..|.+++||++. |.+.++ T Consensus 1 MA~~~~~---------pe~~~~~v~~~~~~~l-----v~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~----- 61 (273) T protein:vir:10 1 MAFNNFI---------PELWSDMLLEEWTAQT-----VFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGR----- 61 (273) T ss_pred Ccchhhh---------HHHHHHHHHHHHHhhh-----ccchhhccccccccccCceEEEeecccccccccccCCC----- Confidence 3221111 445643 45553322 222211110 112357889999873 332222 Q ss_pred eEEeccccccceeeeeeeeee-eeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchh Q lcl|NC_018276. 71 TVTIPDSENNSRLVTIVFATY-SWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKTQVLNDTLG 149 (342) Q Consensus 71 ~~~~~dsen~~r~it~v~kT~-awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKTqV~~d~L~ 149 (342) -+.+ .+-.-.-++++.... .|+|.|.=.=....-. |++.-.+.+-+.+...+|.+..+.+.+..+.+-... T Consensus 62 -~~~~-~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~----~~~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~-- 133 (273) T protein:vir:10 62 -QTSA-DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAG----SLEAYTRAGATALATDTDKFIADMLVDNGTALTGSA-- 133 (273) T ss_pred -ccCc-cccccceEEEEEeeeeecceEeecHHHhhhhc----cHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-- Confidence 1222 222333466666543 6777765211111111 233344456677888999999888876655543221 Q ss_pred hhcccCcceeecccchhHHHhc---chHHHhccCc--ccceEEeecchhHHHHHHHHH----hcCccccc-c---e-eEe Q lcl|NC_018276. 150 LYTFAGNVVTADFAKREEIIGD---INPMQASNDV--YAPLHVVGNTGVESAVRKLAE----KGLYNETN-K---Q-IQY 215 (342) Q Consensus 150 ~yt~~~n~~q~p~aqR~e~~~~---l~a~~r~ndy--~g~l~~iadt~~~s~l~klea----qG~~N~tN-l---q-~Q~ 215 (342) +. ...+++.. +...|..+++ .||+. +=+...+.+|.+..+ ...+++.+ + + -.+ T Consensus 134 -----------~~-~~~~~~~~i~~a~~~ld~~~vP~~~R~l-vv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i 200 (273) T protein:vir:10 134 -----------PT-DADDAFDLIAKALKELTKANVPNVGRVV-VVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNL 200 (273) T ss_pred -----------cc-chhHHHHHHHHHHHHhhhcCCCcCCCEE-EECHHHHHHHhcchhhhhhhhccccccceeeeeeeEE Confidence 11 22334444 4555677777 56654 557788888877543 12222211 1 1 346 Q ss_pred eeeeeeeeccccchhhhchhhhccccccchhhhhhhhhhhccccccCcccccceecccccceeeeeeeeehhcchhcchh Q lcl|NC_018276. 216 ADKVWHWSNEVANAADRYATAFAVPQGTVGMLTRFEREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYESVGDFSGIAGA 295 (342) Q Consensus 216 ~N~~~~~sn~vat~A~~~~~~Y~m~sG~vGm~~widr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~IaD~S~~age 295 (342) .+..|++||++..... .+.++.-.+++|+...++----.|...+ |... ..|.|+| T Consensus 201 ~G~~v~~s~~lp~~~~--~~~~~~~~~A~~~a~q~~~~e~~r~~~~----~~~~-------v~~~~~y------------ 255 (273) T protein:vir:10 201 LGARIVESNNLRDTDD--EQFVAFHPSAAAYVSQIDTVEALRDQDS----FSDR-------IRALHVY------------ 255 (273) T ss_pred eceEEEEecccccCCc--cEEEEEeccceeeeeeeehhhcccCCCc----ceee-------eeeeeee------------ Confidence 7899999999965433 3334444555666555432111111111 2222 2222221 Q ss_pred hhhheeeehhcccccee-eeeEEEeecCCcc Q lcl|NC_018276. 296 ATADLTRAKKEHYGFAV-DVAFVVAENSDAA 325 (342) Q Consensus 296 atad~~~a~~~~Y~~~v-d~a~vVAfnsd~a 325 (342) |..+ +--=||-.+.-.+ T Consensus 256 -------------g~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 256 -------------GGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -------------eeeEeccceEEEEeccCC Confidence 1100 0000000000000 No 18 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=49.78 E-value=0.63 Score=21.69 Aligned_cols=279 Identities=14% Similarity=0.117 Sum_probs=106.2 Q ss_pred Ccccccc-CCcchhh---hhhhhhhhHHHhhhcccchhhhccchhHHH----HHh------------hcCCceeEEEEee Q lcl|NC_018276. 1 MQNLRAK-ANLDKWE---LRPSRYGALDLFMQQTADPAGIISPELTDK----AEA------------SIGSTLQVPVIDY 60 (342) Q Consensus 1 ~q~~r~k-~n~dk~~---~r~s~~ga~d~f~~Qt~~~~~mls~Es~~k----l~a------------S~~~plkipVlny 60 (342) |+.-... -++-+|. .|....++.-....++. .+++.++..+. ++. .-+..+++|+++- T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~--~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~ 78 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKK--DGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWAD 78 (324) T ss_pred CCCchHHHHHHHHHHHHhhccceecccceeccCCC--cceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeC Confidence 3322211 2344333 23333444333322222 12333333222 211 1134578888864 Q ss_pred cccEeeccceeEEeccccccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_018276. 61 DGTITIGNTRTVTIPDSENNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANK 140 (342) Q Consensus 61 ~a~~ti~na~~~~~~dsen~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanK 140 (342) ......- +..-.+|+++..-.-|++..++.+=-..|+-.+...+.++.+.-+...+. +++...+|...+.---+++ T Consensus 79 ~~~a~~v-~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~---~ai~~~~d~a~l~G~g~~~ 154 (324) T protein:vir:10 79 KPGAYWV-GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIA---EAFYKKFDEAGILNQGNNP 154 (324) T ss_pred CcceeEe-ccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHH---HHHHHHHHHHhhhcCCCCc Confidence 3222211 12233455544444455555554422224444555555544444444443 4456778887765433322 Q ss_pred ccccccchhhhc--ccCcceeecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHH-hcCcccccceeEeee Q lcl|NC_018276. 141 TQVLNDTLGLYT--FAGNVVTADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAE-KGLYNETNKQIQYAD 217 (342) Q Consensus 141 TqV~~d~L~~yt--~~~n~~q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~klea-qG~~N~tNlq~Q~~N 217 (342) .+.++++ ..++.....-.--++ +.++-..+..+++... .++-|......|+++.. +|- +.+. T Consensus 155 -----~~~~i~~~~~~~~~~~~~~~t~~~-i~~~~~~l~~~~~~~~-~~v~n~~~~~~L~~l~d~~g~-------~~~~- 219 (324) T protein:vir:10 155 -----FGKSIAQSIEKTNKVIKGDFTQDN-IIDLEALLEDDELEAN-AFISKTQNRSLLRKIVDPETK-------ERIY- 219 (324) T ss_pred -----cCccccccccccceeccccCCHHH-HHHHHHhhhhccCCCC-EEEEcHHHHHHHHHhhccCCc-------eeec- Confidence 1222222 222222111111122 2233333434433333 57889999999999853 221 1110 Q ss_pred eeeeeeccccchhhhchhhhccccccchhhhhhhhhhhccccccCcccccceecccccceeeeeeeeehhcchhcchhhh Q lcl|NC_018276. 218 KVWHWSNEVANAADRYATAFAVPQGTVGMLTRFEREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYESVGDFSGIAGAAT 297 (342) Q Consensus 218 ~~~~~sn~vat~A~~~~~~Y~m~sG~vGm~~widr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~IaD~S~~ageat 297 (342) .....+-.|.| .++.|.....-|..+ +||||.+.---- T Consensus 220 ------------~~~~~~l~G~P---------------------------V~~~~~~~~~~~~~~---~gd~~~~~~~~~ 257 (324) T protein:vir:10 220 ------------DRNSDTLDGLP---------------------------VVNLKSSNLKRGELI---TGDFDKLIYGIP 257 (324) T ss_pred ------------CCCCcccccee---------------------------EEeecCCCCCcceEE---EEecccEEEEEe Confidence 00111112222 122222222222211 344444311001 Q ss_pred hheeeehhcccc--------------ceee-eeEEEeecCCc-ccCCCceEEEEeccCCCC Q lcl|NC_018276. 298 ADLTRAKKEHYG--------------FAVD-VAFVVAENSDA-ANNPGAILKLDIKTEGAV 342 (342) Q Consensus 298 ad~~~a~~~~Y~--------------~~vd-~a~vVAfnsd~-a~~~~~i~k~~~~~e~~~ 342 (342) -++.+..+.+.. |.-| +++.+-.--|. -..|.++.+|...+.+.- T Consensus 258 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~ 318 (324) T protein:vir:10 258 QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTD 318 (324) T ss_pred cCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCC Confidence 122222222211 1111 12211111121 124677777776666553 No 19 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=31.56 E-value=1.5 Score=19.64 Aligned_cols=263 Identities=14% Similarity=0.128 Sum_probs=92.7 Q ss_pred Ccccccc-CCcchhhh---hhhhhhhHHHhhhcccchhhhccchhHH----HHHh-h----------c-CCceeEEEEee Q lcl|NC_018276. 1 MQNLRAK-ANLDKWEL---RPSRYGALDLFMQQTADPAGIISPELTD----KAEA-S----------I-GSTLQVPVIDY 60 (342) Q Consensus 1 ~q~~r~k-~n~dk~~~---r~s~~ga~d~f~~Qt~~~~~mls~Es~~----kl~a-S----------~-~~plkipVlny 60 (342) |++-+.. .|+-+|+. ++.+++|.-.... .+-.+++.++..+ .++. | + +..+++|+++- T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~--~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~ 78 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMH--EKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWAD 78 (324) T ss_pred CchhHHHHHHHHHHHHhhhhhhhccccccccc--CCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEec Confidence 4332221 23333322 2233333222111 1222344333332 2221 1 2 23478888765 Q ss_pred cccEeeccceeEEeccccccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_018276. 61 DGTITIGNTRTVTIPDSENNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANK 140 (342) Q Consensus 61 ~a~~ti~na~~~~~~dsen~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanK 140 (342) ......-. -.-.+|.++-.-.-|++..++.+=-+.|+=.+...+.+++++-+...+. +++...+|...+.---+++ T Consensus 79 ~~~a~~v~-Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~---~aia~~~d~a~l~G~g~~~ 154 (324) T protein:vir:93 79 KPGAYWVG-EGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIA---EAFYKKFDEAGILNQGNNP 154 (324) T ss_pred Ccceeeec-CCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHH---HHHHHHHHHHHhcCCCCCC Confidence 43332211 1222344433333455555554422223334444444444443333333 4566678877764322222 Q ss_pred ccccccchhhhcc--cCccee---ecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHH-HhcCcccccceeE Q lcl|NC_018276. 141 TQVLNDTLGLYTF--AGNVVT---ADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLA-EKGLYNETNKQIQ 214 (342) Q Consensus 141 TqV~~d~L~~yt~--~~n~~q---~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kle-aqG~~N~tNlq~Q 214 (342) .+. +..+. .++... ..+.+-.+++..|++ +++.. =.++-+......|+++. .+|-+ - T Consensus 155 ---~~~--~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~----~~~~~-~~~v~n~~~~~~L~~l~d~~G~~-------~ 217 (324) T protein:vir:93 155 ---FGK--SIAQSIEKTNKVIKGDFTQDNIIDLEALLED----DELEA-NAFISKTQNRSLLRKIVDPETKE-------R 217 (324) T ss_pred ---cCc--cccccccccceeccccccHHHHHHHHHhhhh----ccCCC-CEEEEcHHHHHHHHHhhCCCCCe-------e Confidence 111 11211 111111 112223333444443 33333 36899999999999884 22211 1 Q ss_pred eeeeeeeeeccccchhhhchhhhccccccchhhhhhhhhhhccccccCcccccceecccccceeeeeeeeehhcchhcch Q lcl|NC_018276. 215 YADKVWHWSNEVANAADRYATAFAVPQGTVGMLTRFEREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYESVGDFSGIAG 294 (342) Q Consensus 215 ~~N~~~~~sn~vat~A~~~~~~Y~m~sG~vGm~~widr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~IaD~S~~ag 294 (342) +.+ ....+-.|.|- ++.|.....-|.- .+||||-+.- T Consensus 218 ~~~-------------~~~~~l~G~PV---------------------------v~~~~~~~~~~~i---~~gdfs~~~~ 254 (324) T protein:vir:93 218 IYD-------------RNSDSLDGLPV---------------------------VNLKSSNLKRGEL---ITGDFDKLIY 254 (324) T ss_pred ecC-------------CCCCcccceee---------------------------EeecCCCCCcceE---EEEecceEEE Confidence 100 00111122221 0001100111110 1233332210 Q ss_pred ----h------------------------hhhhee-eehhccccceeeeeEEEeecCCcccCCCceEEEEeccCCC---- Q lcl|NC_018276. 295 ----A------------------------ATADLT-RAKKEHYGFAVDVAFVVAENSDAANNPGAILKLDIKTEGA---- 341 (342) Q Consensus 295 ----e------------------------atad~~-~a~~~~Y~~~vd~a~vVAfnsd~a~~~~~i~k~~~~~e~~---- 341 (342) . .+.|++ +-..+.+|+.+ ..|.+|.+|..-..++ T Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v-------------~~~~a~~~l~~a~~~~~~~~ 321 (324) T protein:vir:93 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHI-------------ADDKAFAKLVPADKRTDSVP 321 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEE-------------ecccceEEEecccccCCCCC Confidence 0 111111 11112222221 2234444443322222 Q ss_pred --C Q lcl|NC_018276. 342 --V 342 (342) Q Consensus 342 --~ 342 (342) | T Consensus 322 ~~~ 324 (324) T protein:vir:93 322 GEV 324 (324) T ss_pred CCC Confidence 2 No 20 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=20.06 E-value=2.8 Score=18.10 Aligned_cols=278 Identities=15% Similarity=0.131 Sum_probs=108.4 Q ss_pred CccccccCCcchhh---hhhhhhhhHHHhhhcccchhhhccchhHHH----HHh------------hcCCceeEEEEeec Q lcl|NC_018276. 1 MQNLRAKANLDKWE---LRPSRYGALDLFMQQTADPAGIISPELTDK----AEA------------SIGSTLQVPVIDYD 61 (342) Q Consensus 1 ~q~~r~k~n~dk~~---~r~s~~ga~d~f~~Qt~~~~~mls~Es~~k----l~a------------S~~~plkipVlny~ 61 (342) ||+++. ++-||. .+...+++.-....++.. +++.++..++ ++. .-+..+++|+++-. T Consensus 4 ~~~~~~--~~~~~~~~~~~~~~~~a~~~~~~~~~~--~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~ 79 (324) T protein:vir:99 4 TQKLKL--NLQHFASNNVKPQVFNPDNVMMHEKKD--GTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFTFWADK 79 (324) T ss_pred chHhhH--HHHHHHHHhhhhhhccccceeccCCCc--ceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecC Confidence 444432 333333 344455554444333222 2333322222 211 11345788887643 Q ss_pred ccEeeccceeEEeccccccceeeeeeeeeeeeceeeeeeeccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_018276. 62 GTITIGNTRTVTIPDSENNSRLVTIVFATYSWGFTIVPAAYMNNEVSIQADFERKFNKYLYKFGKTLDGAAVAALDANKT 141 (342) Q Consensus 62 a~~ti~na~~~~~~dsen~~r~it~v~kT~awgf~i~p~i~~NNei~~q~d~~~~~q~~L~~~~~~vd~~~~a~LeanKT 141 (342) .....- +..-.+|+++-.-.-|++..++.+=-..|+-.+...+.++.++-+...+.+. +..++|...+.---++ T Consensus 80 ~~a~~v-~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a---i~~~~d~~~l~G~g~~-- 153 (324) T protein:vir:99 80 PGAYWV-GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEA---FYKKFDEAGILNQGNN-- 153 (324) T ss_pred cceeEe-ccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHH---HHHHHHHHhhhcCCCC-- Confidence 222211 1233455555444455555555543333555555556665555555555443 4567777766432222 Q ss_pred cccccchhhhcc--cCcceeecccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHhcCcccccceeEeeeee Q lcl|NC_018276. 142 QVLNDTLGLYTF--AGNVVTADFAKREEIIGDINPMQASNDVYAPLHVVGNTGVESAVRKLAEKGLYNETNKQIQYADKV 219 (342) Q Consensus 142 qV~~d~L~~yt~--~~n~~q~p~aqR~e~~~~l~a~~r~ndy~g~l~~iadt~~~s~l~kleaqG~~N~tNlq~Q~~N~~ 219 (342) +.+.+.++- .++.....-.--++ +-++-..+..+++... .++.+......|+++..- .+ ++.+. T Consensus 154 ---~~~~~~~~~~~~~~~~~~~~~~~~~-i~~~~~~l~~~~~~~~-~~v~n~~~~~~L~~l~d~-~g-----~~~~~--- 219 (324) T protein:vir:99 154 ---PFGKSIAQSIEKTNKVIKGDFTQDN-IIDLEALLEDDELEAN-AFISKTQNRSLLRKIVDP-ET-----KERIY--- 219 (324) T ss_pred ---ccCccccccccccceeccccCCHHH-HHHHHHhhhhccCCCC-EEEEcHHHHHHHHHhhcC-CC-----ceeec--- Confidence 122222322 11111111111122 3333444444444444 588899999999988531 11 11110 Q ss_pred eeeeccccchhhhchhhhccccccchhhhhhhhhhhccccccCcccccceecccccceeeeeeeeehhcchhcchhhhhh Q lcl|NC_018276. 220 WHWSNEVANAADRYATAFAVPQGTVGMLTRFEREALLRSRSRTGHEWDIDTLPMLEMPVGTYYYESVGDFSGIAGAATAD 299 (342) Q Consensus 220 ~~~sn~vat~A~~~~~~Y~m~sG~vGm~~widr~a~~~~~~~tG~~~t~~~dP~~g~~~g~~~~~~IaD~S~~ageatad 299 (342) + ....+-.|.| .++.|.....-|..+ +||||-+.-----+ T Consensus 220 ---------~-~~~~~l~G~P---------------------------Vv~~~~~~~~~~~~i---~gd~~~~~~~~~~~ 259 (324) T protein:vir:99 220 ---------D-RNSDTLDGLP---------------------------VVNLKSSNLKRGELI---TGDFDKLIYGIPQL 259 (324) T ss_pred ---------C-CCCcccccee---------------------------EEeecCCCCCcceEE---EEecccEEEEEecC Confidence 0 0011111111 112222221112111 23444331100011 Q ss_pred eeeehhcccc--------------ceee-eeEEEeecCCc-ccCCCceEEEEeccCCCC Q lcl|NC_018276. 300 LTRAKKEHYG--------------FAVD-VAFVVAENSDA-ANNPGAILKLDIKTEGAV 342 (342) Q Consensus 300 ~~~a~~~~Y~--------------~~vd-~a~vVAfnsd~-a~~~~~i~k~~~~~e~~~ 342 (342) +.+..+.+.. |.-| +++.+-.--|. -..|.++.+|...+.+.- T Consensus 260 ~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~ 318 (324) T protein:vir:99 260 IEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTD 318 (324) T ss_pred cEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCC Confidence 2221111111 1111 11111111111 124666777766555443 Done!