Query lcl|NC_017732.1_cdsid_YP_006218736.1 [gene=EF62_phi0041] [protein=bacteriophage head-tail connector protein] [protein_id=YP_006218736.1] [location=21584..22585] Match_columns 333 No_of_seqs 12 out of 16 Neff 3.5 Searched_HMMs 1612 Date Thu Nov 7 12:53:14 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_41 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_41_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:5205 Length: 309 # 100.0 8E-122 5E-125 684.6 24.9 292 34-333 1-297 (309) 2 protein:vir:4732 Length: 337 # 100.0 1E-120 6E-124 678.5 24.3 297 18-333 1-331 (337) 3 protein:vir:9605 Length: 317 # 100.0 3E-118 2E-121 665.2 24.5 303 10-333 1-311 (317) 4 protein:vir:17 Length: 306 # N 100.0 2E-117 9E-121 661.0 23.5 290 36-333 1-298 (306) 5 protein:vir:121 Length: 308 # 100.0 1E-115 6E-119 651.0 25.5 291 35-333 1-296 (308) 6 protein:vir:79400 Length: 323 100.0 8E-113 5E-116 635.2 24.1 293 16-333 1-308 (323) 7 protein:vir:97362 Length: 327 100.0 3E-108 2E-111 610.3 24.4 291 20-333 1-319 (327) 8 protein:vir:9445 Length: 249 # 100.0 2.4E-84 1.5E-87 479.0 17.2 214 108-333 1-241 (249) 9 protein:vir:9466 Length: 249 # 100.0 2.4E-84 1.5E-87 479.0 17.2 214 108-333 1-241 (249) 10 protein:vir:102950 Length: 471 3.0 22 0.014 13.2 19.7 303 1-333 38-423 (471) No 1 >protein:vir:5205 Length: 309 # NCBI annotation: upper collar protein # Family: family:all:2168 # MgeID: mge:116 # MgeName: PZA # Cross-refs: genbank:acc:NP_040728;genbank:gi:9626399;genbank:GeneID:1260970 Probab=100.00 E-value=7.7e-122 Score=684.57 Aligned_cols=292 Identities=24% Similarity=0.371 Sum_probs=275.7 Q ss_pred cccccccccchhhhHHHHHHHHHHHHHHHHHHhhhhhhheecCCCCCcCHHHHHHHHHhcCcEEEEecCCcceeEEecCC Q lcl|NC_017732. 34 IETSSTTTGDNLGNNYVQFEAIQTFILIRQIKDMLINLFKYENMPPTLNTAQLETMLRQMGGGVCVGKDELGDLVILGRA 113 (333) Q Consensus 34 ~~~~~~t~~~~~~~~~v~~~~~~~~~l~~~y~~~~~n~F~wenLP~~ids~~IE~~L~~~G~~vf~~~~~~G~~~~lg~~ 113 (333) .----||+.++||+..++|+++|++++.||++++++++|+||||||||||+|||++|||+|+|+|++|++.| ||++|++ T Consensus 1 m~~k~~~~~~~i~~i~~~k~N~~~~~y~ryl~~l~~~lf~wenlp~~id~~flE~~L~q~g~V~f~kd~~~g-y~~~~~~ 79 (309) T protein:vir:52 1 MGRKRSNSYRSINEIQRQKRNRWFIHYLNYLQSLAYQLFEWENLPPTINPSFLEKSIHQFGYVGFYKDPVIS-YIACNGA 79 (309) T ss_pred CCccccchhhhHHHHHhhhchhHHHHHHHHHHhhhhhhhcccCCCCCcCHHHHHHHHHhCCCeEEEEeccee-eeeeccc Confidence 222246889999999999999999999999999999999999999999999999999999999999999999 9999999 Q ss_pred CccCCCcCcccCCceeeeeccccccc-ccccccCC--CcceEEEeecccccccccCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017732. 114 DELGYNLYGNVIPSLFDGNNNFLQSK-KVITNRNL--KGDYVVFYNKQSFNDFYATDYDIVEHYAKQLATIKATERMNIM 190 (333) Q Consensus 114 ~~~~~NiYg~~~P~~f~~~~~~~q~~-~~~~~~~~--~g~~VV~~Nk~~~N~~~~p~~~~ie~Ya~eLAeIe~ti~~n~~ 190 (333) .++++|+||| |++|+++++++|++ ++++.|++ |+.+||+|| |++++||++|||||||+||+|++||++|++ T Consensus 80 ~~g~~~ly~q--p~~f~an~~~~~~k~~~~~~r~~~ed~~~VViyN----N~~~~p~~~ile~ya~~LAei~~t~~~n~~ 153 (309) T protein:vir:52 80 LSGQRDVYNQ--ATVFRASSPVYQKEFKLYNYRDMKEDDMGVVIYN----NDMSFPTTPTLELFAAELAELKEIISVNQN 153 (309) T ss_pred cccccccccC--CceeeccccccceeeeeeechhhccCccEEEeeC----CccccCcccHHHHHHHHHHHHHHHHHHHHh Confidence 9999999999 67999999999999 66665554 555599999 777999999999999999999999999999 Q ss_pred hhhCcEEEEecCCC-hhHHHHHHHHhCCCceEeecCCcChhhhhheeccCchhhhHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_017732. 191 QMRSPYIVKGKKNG-QLLQVLQSKIQNGDLFVGVEEGSDITEKIEKLDLNVTDRTPSLQNAYRNTFNEMLTLFGIYNNPE 269 (333) Q Consensus 191 qqk~P~iIa~~~N~-~S~k~l~n~i~ngepvV~~~e~~d~~d~I~vl~t~a~~~ldkL~~e~~~~~sEllT~LGInN~n~ 269 (333) |||+|+||++++|+ +|+++|+|+|++|||||+++|++| +|+|+||+|+||+++|+|+++++..|+||||||||+|+++ T Consensus 154 ~~k~P~iI~a~~n~~~s~~~L~nki~~g~Pvi~~~~~~d-~D~I~V~~t~a~~v~d~l~~~k~~~~NE~~t~LGI~n~~~ 232 (309) T protein:vir:52 154 AQKTPVLIRANDNNQLSLKQVYNQYEGNAPVIFAHEALD-SDSIEVFKTDAPYVVDKLNAQKNAVWNEMMTFLGIKNANL 232 (309) T ss_pred hhhCCeEEEecCCCcchHHHHHHHHhCCcceEEecCCCC-cccceeeecCcchhcchhHHHHHHHHHHHHHhhccccccc Confidence 99999999887666 599999999999999999999999 6999999999999999999999999999999999999999 Q ss_pred chhhhhhhHhhhcchhhhhhhhhhhhcchhHHHHHHHhhcCCceeeehh-HHHHHHHHHHHhhcC Q lcl|NC_017732. 270 QKKERMIDREASSNNHVIEGMGDIYFNARQHAVDLLNLAFGTDIKVQWN-STVASMFRDLGQKQG 333 (333) Q Consensus 270 DKKERlv~~Ea~sNn~~i~~n~~iylK~RleA~elIN~~fGlnIkv~~~-~~v~~~~~~l~~~~~ 333 (333) |||||||++||+|||+||++|++||||+|+|||++|||+|||||||+|+ +.|.+||++|++++. T Consensus 233 dKkERvv~~Ea~SNn~~i~a~~~iylk~R~ea~elIN~~yGL~I~v~~~~~~V~~~~~~l~~~~~ 297 (309) T protein:vir:52 233 EKKERMVTDEVSSNDEQIESSGTVFLKSREEACEKINELYGLDVKVRFRYDIVEQMRRELQQIEN 297 (309) T ss_pred hhhhcchhhhhhhhhhhhhhhhHhhhhhhHHHHHHHHhHhccceeEeeccchHhhhhhhhHhHHH Confidence 9999999999999999999999999999999999999999999999999 999999999999987 No 2 >protein:vir:4732 Length: 337 # NCBI annotation: connector protein # Family: family:all:2168 # MgeID: mge:103 # MgeName: Cp-1 # Cross-refs: genbank:acc:NP_044823;swissprot:trembl:q37995;genbank:gi:9629535;uniprot:Q37995;genbank:GeneID:1261247 Probab=100.00 E-value=1e-120 Score=678.46 Aligned_cols=297 Identities=25% Similarity=0.379 Sum_probs=269.3 Q ss_pred hhhhhhhcchHHHhhhcccccccccchhhhHHHHHHHHHHHHHHHHHHhhhhhhheecCCCCCcCHHHHHHHHHhcCcEE Q lcl|NC_017732. 18 SSYNNVIGNRRDLLQGIETSSTTTGDNLGNNYVQFEAIQTFILIRQIKDMLINLFKYENMPPTLNTAQLETMLRQMGGGV 97 (333) Q Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~l~~~y~~~~~n~F~wenLP~~ids~~IE~~L~~~G~~v 97 (333) .||.|-- | -|--||-+..| +..+|++|+.++++|++++|+|+|+|||||+||||+|||++|||+|||| T Consensus 1 ~~~~n~~---r-~~~~i~l~k~~--------~~r~r~~~~~~y~ny~~~l~~n~f~wE~lP~~iD~~flE~~L~q~G~v~ 68 (337) T protein:vir:47 1 MSYKNYK---R-HLGKIELNKET--------VERNRLAFFEFYFNYFYNIVVNYFTWEGLPNDIDELFIEKKLIENGHVA 68 (337) T ss_pred CCccccc---c-ccceeeeccch--------hhhHHHHHHHHHHHHHHHHHHHhhhhcCCCCCccHHHHHHHHHhcCCeE Confidence 4554421 1 12224332211 2349999999999999999999999999999999999999999999999 Q ss_pred EEecCCcceeEEecCCCccCCCcCcccCCceeee----eccccccccccccc---------------CCCcceEEEeecc Q lcl|NC_017732. 98 CVGKDELGDLVILGRADELGYNLYGNVIPSLFDG----NNNFLQSKKVITNR---------------NLKGDYVVFYNKQ 158 (333) Q Consensus 98 f~~~~~~G~~~~lg~~~~~~~NiYg~~~P~~f~~----~~~~~q~~~~~~~~---------------~~~g~~VV~~Nk~ 158 (333) ||+|+++| |+++|+|.++++|+||+ |+.|.+ +.++.|+.+++.++ +++|.+|||+| T Consensus 69 f~~D~~~~-yia~g~t~~g~idiy~~--pt~y~~vna~s~n~~ksf~i~~~~n~f~~~~~l~k~~~~~~~~~~VVi~N-- 143 (337) T protein:vir:47 69 FFHDDTFG-YIAQGGTRGERLNHYDQ--PLTYQPVNASSMNYFKQMEIAYTENDFRVIEELHKDNPDKIKRPCIVIPN-- 143 (337) T ss_pred Eecccccc-eeeccccccCceeeccc--cccccccCcccccceeeeeEeecccchhhhhhhcccccchhccCeEEecC-- Confidence 99999999 99999999999999999 788873 33667776666644 36899999999 Q ss_pred cccccccCchhHHHHHHHHHHHHHHHHHHHHHhhhCcEEEE-ecCCChhHHHHHHHHhCCCceEeecCCcC--------- Q lcl|NC_017732. 159 SFNDFYATDYDIVEHYAKQLATIKATERMNIMQMRSPYIVK-GKKNGQLLQVLQSKIQNGDLFVGVEEGSD--------- 228 (333) Q Consensus 159 ~~N~~~~p~~~~ie~Ya~eLAeIe~ti~~n~~qqk~P~iIa-~~~N~~S~k~l~n~i~ngepvV~~~e~~d--------- 228 (333) |++++||++|||||||+||+|++||++|++|||+|+||+ +++|++|+|+|+|+|++|+|||+++|++| T Consensus 144 --N~~~~~~i~ilelya~eLAeI~~t~~~n~n~~K~P~~I~a~e~n~~s~q~l~nki~ng~Pvv~~~~~~d~~~~~~~~~ 221 (337) T protein:vir:47 144 --NNFYEPYIGYLELFCEKLADIELTIQLNRNAQITPYFIFADNTNVLSMKNIFNKIANFEPVVYLNKQKDQDGQDSFKQ 221 (337) T ss_pred --cccccccccHHHHHHHHHHHHHHHHHHHHHhhhCCeEEEEcCCccchHHHHHHHHhCCcceEEeccccCchhhhhhhh Confidence 888999999999999999999999999999999998775 68888999999999999999999999988 Q ss_pred hhhhhheeccCchhhhHHHHHHHHHHHHHHHHhhcccccccchhhhhhhHhhhcchhhhhhhhhhhhcchhHHHHHHHhh Q lcl|NC_017732. 229 ITEKIEKLDLNVTDRTPSLQNAYRNTFNEMLTLFGIYNNPEQKKERMIDREASSNNHVIEGMGDIYFNARQHAVDLLNLA 308 (333) Q Consensus 229 ~~d~I~vl~t~a~~~ldkL~~e~~~~~sEllT~LGInN~n~DKKERlv~~Ea~sNn~~i~~n~~iylK~RleA~elIN~~ 308 (333) ++|+|+||+|+||+++|+|++|++++||||||||||+|+++|||||||++||+|||+||++|++||||+|+|||++|||+ T Consensus 222 l~D~I~v~~t~A~~~ld~l~~e~~~~~nEl~t~LGI~n~~~DKkERvv~~Ea~SNn~~i~a~~~iylk~R~eaielINk~ 301 (337) T protein:vir:47 222 LSDYIQVFRTDAPFLLDKLHDEKLRVMNQLLTFIGINNNPSDKKERLVVSEAISNNGVISANIEVGWKSRRKFVELINKC 301 (337) T ss_pred hhhhhhhhccCchhhhHHHHHHHHHHHHHHHHhhhhccccchhhhhhhhhhhhhhhhhhhhhHHhhhhhhHHHHHHHHHH Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCceeeehhHHHH-----HHHHHHHhhcC Q lcl|NC_017732. 309 FGTDIKVQWNSTVA-----SMFRDLGQKQG 333 (333) Q Consensus 309 fGlnIkv~~~~~v~-----~~~~~l~~~~~ 333 (333) |||||||+|++||+ +|++||+|||| T Consensus 302 yGLnIsv~~~~t~~~~~~~~~~~~~~~~~~ 331 (337) T protein:vir:47 302 YGLEISVKPAETIQQFNLDKVALDLAEKEG 331 (337) T ss_pred hcCCceeehhhhhhhhhhHHHHHHhhhhcC Confidence 99999999999999 79999999999 No 3 >protein:vir:9605 Length: 317 # NCBI annotation: head-tail connector protein # Family: family:all:2168 # MgeID: mge:172 # MgeName: C1 # Cross-refs: genbank:acc:NP_852021;genbank:gi:31072023;genbank:GeneID:1489940 Probab=100.00 E-value=2.7e-118 Score=665.18 Aligned_cols=303 Identities=21% Similarity=0.315 Sum_probs=276.3 Q ss_pred cchhhcchhhhhhhhcchHHHhhhcccccccccchhhhHHHHHHHHHHHHHHHHHHhhhhhhheecCCCCCcCHHHHHHH Q lcl|NC_017732. 10 WGYQTGLASSYNNVIGNRRDLLQGIETSSTTTGDNLGNNYVQFEAIQTFILIRQIKDMLINLFKYENMPPTLNTAQLETM 89 (333) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~l~~~y~~~~~n~F~wenLP~~ids~~IE~~ 89 (333) .--.+|+..||-|- + -||+.+++.+++||||+||++++|++|+++++++|+|||| +|||+|||++ T Consensus 1 ~~~~~~~~~s~~~~----k---------~~~~~~~i~k~~~e~rnr~f~i~~~~y~~~L~~Lf~wEnl--~ID~~flE~~ 65 (317) T protein:vir:96 1 MQITSGIKPSEMNY----K---------MSTFTDDIAERVKLHKQNYFNIIYSRYVEFLPLLISYENY--DLDSLLIESY 65 (317) T ss_pred CcccccCchhhhhh----h---------hhcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--cccHHHHHHH Confidence 22234555555442 1 3688999999999999999999999999999999999999 6999999999 Q ss_pred HHhcCcEEEEecCCcceeEEecCCC-------ccCCCcCcccCCceeeeecccccccccccccCCCcceEEEeecccccc Q lcl|NC_017732. 90 LRQMGGGVCVGKDELGDLVILGRAD-------ELGYNLYGNVIPSLFDGNNNFLQSKKVITNRNLKGDYVVFYNKQSFND 162 (333) Q Consensus 90 L~~~G~~vf~~~~~~G~~~~lg~~~-------~~~~NiYg~~~P~~f~~~~~~~q~~~~~~~~~~~g~~VV~~Nk~~~N~ 162 (333) |||+|+|+|+ +|+.|+++++|++. ..+.++|++|+ .|+++++++|++++.+++..+|++||++|++.. T Consensus 66 L~q~g~V~f~-~d~~g~i~ilgy~a~~~~~~~~~~~~L~~~~i--~f~~~n~~~q~~~~~~~~~~~gn~Vv~~Nk~~~-- 140 (317) T protein:vir:96 66 LRAGYGVAIG-ETKTGKIDVLGYCSVNTNYLQPIKEPLQGKDI--TFIHNNILPKGKYKELTRYSDGNFVVLRNKRAS-- 140 (317) T ss_pred HHcCCceEEE-eccccceeEeeccccchhhccccccCcccccc--ccccccccCccceeeeeeeccCCeEEecCcccc-- Confidence 9999999777 55558899999986 56679999965 699999999999999999999999999999887 Q ss_pred cccCchhHHHHHHHHHHHHHHHHHHHHHhhhCcEEEEecCCChhHHHHHHHHhCCCceEeecCCcChhhhhheec-cCch Q lcl|NC_017732. 163 FYATDYDIVEHYAKQLATIKATERMNIMQMRSPYIVKGKKNGQLLQVLQSKIQNGDLFVGVEEGSDITEKIEKLD-LNVT 241 (333) Q Consensus 163 ~~~p~~~~ie~Ya~eLAeIe~ti~~n~~qqk~P~iIa~~~N~~S~k~l~n~i~ngepvV~~~e~~d~~d~I~vl~-t~a~ 241 (333) |+||+++|||||++||+|++||++|++|||+|+|||+++|++|+|+|+|+|++|||||+++|++|.+|+|..++ +.|+ T Consensus 141 -y~~didiie~Ya~eLAeI~~ti~~~~~q~k~p~Iik~~~n~~s~q~l~nki~ng~Pvv~~~~~~d~dd~i~~~~~~~v~ 219 (317) T protein:vir:96 141 -FLCDYNIITHYVMEMSEIANSRYSISIQAKVNTFIRNEGGSKDGQVMANNLFNGVPYTATTPKFDPEEHILTFNNASAV 219 (317) T ss_pred -cCCchhHHHHHHHHHHHHHHHHHHHHhhccCCEEEEeCCCcchHHHHHHHHhCCcceEEeccCCCCccccccccccccc Confidence 89999999999999999999999999999999999999999999999999999999999999999888653333 4555 Q ss_pred hhhHHHHHHHHHHHHHHHHhhcccccccchhhhhhhHhhhcchhhhhhhhhhhhcchhHHHHHHHhhcCCceeeehhHHH Q lcl|NC_017732. 242 DRTPSLQNAYRNTFNEMLTLFGIYNNPEQKKERMIDREASSNNHVIEGMGDIYFNARQHAVDLLNLAFGTDIKVQWNSTV 321 (333) Q Consensus 242 ~~ldkL~~e~~~~~sEllT~LGInN~n~DKKERlv~~Ea~sNn~~i~~n~~iylK~RleA~elIN~~fGlnIkv~~~~~v 321 (333) +.+++||+|++++||||||||||+|+|+|||||||++||+||++||.||++||||+|+|||++|||+|||||||+|||++ T Consensus 220 ~~l~~lk~e~~nk~nEllT~lGi~n~~~dKkermv~~EA~SN~~~i~an~~iylk~R~ea~elIN~~YGLnI~v~~~d~~ 299 (317) T protein:vir:96 220 SFLPELKREQQNKISELNAMLGLNTLGVDKESGVSEIEAQSNTAFKKANENIYLGIRNEALNLINNKYGLNIHAEYRDNM 299 (317) T ss_pred hhhHHHHHHHHHHHHHHHHHhhhccccchhhhcchhhhhhccchhhhcchhhhhhhhHHHHHHHHHhhCcceeEeecchh Confidence 66677789999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhcC Q lcl|NC_017732. 322 ASMFRDLGQKQG 333 (333) Q Consensus 322 ~~~~~~l~~~~~ 333 (333) ++++|||+|+|- T Consensus 300 ~~~~s~l~~~q~ 311 (317) T protein:vir:96 300 VAELSSIEKLQI 311 (317) T ss_pred hhhHhhHHHHHH Confidence 999999999999 No 4 >protein:vir:17 Length: 306 # NCBI annotation: head-tail connector (portal protein) # Family: family:all:2168 # MgeID: mge:323 # MgeName: GA-1 # Cross-refs: genbank:acc:NP_073693;swissprot:trembl:q9fzw5;genbank:gi:12248117;hssp:P04332;interpro:IPR008016;uniprot:Q9FZW5;genbank:GeneID:919912 Probab=100.00 E-value=1.5e-117 Score=661.03 Aligned_cols=290 Identities=23% Similarity=0.311 Sum_probs=265.2 Q ss_pred cccccccchhhhHHHHHHHHHHHHHHHHHH-hhhhhhheecCCCCCcCHHHHHHHHHhcCcEEEEecCCcceeEEecCCC Q lcl|NC_017732. 36 TSSTTTGDNLGNNYVQFEAIQTFILIRQIK-DMLINLFKYENMPPTLNTAQLETMLRQMGGGVCVGKDELGDLVILGRAD 114 (333) Q Consensus 36 ~~~~t~~~~~~~~~v~~~~~~~~~l~~~y~-~~~~n~F~wenLP~~ids~~IE~~L~~~G~~vf~~~~~~G~~~~lg~~~ 114 (333) --...+++++.+++++||++|++|+|++|. .||+++|+||||||||||+|||++|||+|+|+|++|+++| +++++++. T Consensus 1 ~~~~~~s~~~~~~v~~~~~n~~~~~y~Ryl~~l~l~lf~wenlp~~id~~flE~~L~q~g~V~f~kd~~~g-yia~~G~~ 79 (306) T protein:vir:17 1 MLEDGFSYKTIGEIQRRRGNLWFRTYQRYLFSLAYQMFEWQGLPKTVDPIFLEKQLHQRGFVAFYKDEMYG-YLGVQGTL 79 (306) T ss_pred CCcccccHHHHHHHHHHhhhhhhhHHHHHHHhhhhhhhcccCCCCCcCHHHHHHHHHhCCCeEEEeechhh-hheecccc Confidence 223457899999999999999999997765 5556699999999999999999999999999999999999 66666688 Q ss_pred ccCCCcCcccCCceeeeecccccccccc------cccCCCcceEEEeecccccccccCchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_017732. 115 ELGYNLYGNVIPSLFDGNNNFLQSKKVI------TNRNLKGDYVVFYNKQSFNDFYATDYDIVEHYAKQLATIKATERMN 188 (333) Q Consensus 115 ~~~~NiYg~~~P~~f~~~~~~~q~~~~~------~~~~~~g~~VV~~Nk~~~N~~~~p~~~~ie~Ya~eLAeIe~ti~~n 188 (333) ++++|+||| |++|+++++++|++++. +++.++|++||||| |++++||++|||||||+||+|++||++| T Consensus 80 ~~~~dly~q--p~~f~an~n~~~k~~~~~~y~~~~d~~~k~~~VviyN----N~~~~p~i~ilelya~~LAeI~~t~~~n 153 (306) T protein:vir:17 80 SGQINLYNQ--PNFYTASAPTYQKSFPLYWYDMGEDLNEKGQGIVIYN----NLERMPTLDILNLYAMNLAELKETIYVN 153 (306) T ss_pred CCccccccC--CcceecccccccceeEeeeeeccccccCCCCeEEecC----CccccccccHHHHHHHHHHHHHHHHHHH Confidence 889999999 67999999999988744 57889999999999 7779999999999999999999999999 Q ss_pred HHhhhCcEEEEecCCC-hhHHHHHHHHhCCCceEeecCCcChhhhhheeccCchhhhHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_017732. 189 IMQMRSPYIVKGKKNG-QLLQVLQSKIQNGDLFVGVEEGSDITEKIEKLDLNVTDRTPSLQNAYRNTFNEMLTLFGIYNN 267 (333) Q Consensus 189 ~~qqk~P~iIa~~~N~-~S~k~l~n~i~ngepvV~~~e~~d~~d~I~vl~t~a~~~ldkL~~e~~~~~sEllT~LGInN~ 267 (333) ++|||+|+||++++|+ +|+++|+|+|++|+|||+++|+++ .|+|+||+|+||+++|+|++|++++||||||||||+|+ T Consensus 154 ~n~~K~P~iI~a~~n~~~s~~~l~n~~~~~~Pvv~~~~~~~-~D~i~V~~t~A~~v~d~L~~e~~~~~nEl~t~lGI~n~ 232 (306) T protein:vir:17 154 QNAQKTPVIIKAGDNDLFSMKQVYNKYEGNEPVIFAGKKFN-TDDIEVLKTDAPYVADKLTMLFKDQWNEAMTFLGLSNA 232 (306) T ss_pred HhHhhCCeEEEecCCccchHHHHhHhhhCCcceEeecCCcC-hhhceeeecCchhhhHHHHHHHHHHHHHHHHhhhhccc Confidence 9999999999887766 599999999999999999999999 88899999999999999999999999999999999999 Q ss_pred ccchhhhhhhHhhhcchhhhhhhhhhhhcchhHHHHHHHhhcCCceeeehhHHHHHHHHHHHhhcC Q lcl|NC_017732. 268 PEQKKERMIDREASSNNHVIEGMGDIYFNARQHAVDLLNLAFGTDIKVQWNSTVASMFRDLGQKQG 333 (333) Q Consensus 268 n~DKKERlv~~Ea~sNn~~i~~n~~iylK~RleA~elIN~~fGlnIkv~~~~~v~~~~~~l~~~~~ 333 (333) ++|||||||++||+|||+||++|++||||+|+|||++|||+|||||||+||+.|+.--.--..++| T Consensus 233 ~~DKkERvv~~Ea~SNn~~i~a~~~iylk~R~eaielIN~~yGLnI~v~~~~~~v~~~~~~~~~~~ 298 (306) T protein:vir:17 233 NTDKKERLIQSEVESNNDQIQGSANIYLAPRQEACRLINEYYGLNVSVKLRKELVGNGELHNAIEG 298 (306) T ss_pred cchhhhhhhhhhhhhhhhhhhhhhHhhhhhhHHHHHHHHhHhccceeEEeccccccccchhhcccc Confidence 999999999999999999999999999999999999999999999999999888873222234555 No 5 >protein:vir:121 Length: 308 # NCBI annotation: upper collar protein # Family: family:all:2168 # MgeID: mge:4 # MgeName: B103 # Cross-refs: genbank:acc:NP_690644;swissprot:sw:q37891;genbank:gi:22855158;interpro:IPR008016;uniprot:Q37891;genbank:GeneID:955371 Probab=100.00 E-value=1e-115 Score=651.04 Aligned_cols=291 Identities=21% Similarity=0.315 Sum_probs=261.9 Q ss_pred ccccccccchhhhHHHHHHHHHHHHHHHHHHhhhhhhheecCCCCCcCHHHHHHHHHhcCcEEEEecCCcceeEEecCCC Q lcl|NC_017732. 35 ETSSTTTGDNLGNNYVQFEAIQTFILIRQIKDMLINLFKYENMPPTLNTAQLETMLRQMGGGVCVGKDELGDLVILGRAD 114 (333) Q Consensus 35 ~~~~~t~~~~~~~~~v~~~~~~~~~l~~~y~~~~~n~F~wenLP~~ids~~IE~~L~~~G~~vf~~~~~~G~~~~lg~~~ 114 (333) -+---.+--+|++..++++++|++++.||+++||+++|+||||||||||+|||++|||+||||||||+++|+++|+| +. T Consensus 1 ~~~~~~~~k~i~~~~~~~~n~~~~~y~ryl~~l~l~lf~wE~lp~~iD~~~lE~~l~q~G~vvf~~d~~~gyi~~~G-~~ 79 (308) T protein:vir:12 1 MARKRNSYKSINDIQRMRGNRWYYHYYQYLCSLAYQLFEWERLPPSVDPSYLEKSIHQFGYVGFYKDPRIGYIACQG-AL 79 (308) T ss_pred CCcccchhhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCcccHHHHHHHHHhcCCeEEEeccccceeeecc-ee Confidence 11111234568999999999999999999999999999999999999999999999999999999999999555555 78 Q ss_pred ccCCCcCcccCCceeeeecccccccc---cccccCCCcceEEEeecccccccccCchhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_017732. 115 ELGYNLYGNVIPSLFDGNNNFLQSKK---VITNRNLKGDYVVFYNKQSFNDFYATDYDIVEHYAKQLATIKATERMNIMQ 191 (333) Q Consensus 115 ~~~~NiYg~~~P~~f~~~~~~~q~~~---~~~~~~~~g~~VV~~Nk~~~N~~~~p~~~~ie~Ya~eLAeIe~ti~~n~~q 191 (333) |+.+|+|++ |+.|.++...+|..+ .+....++++.||||| |++++||++|||||||+||+|++||++|++| T Consensus 80 s~~~d~yn~--p~~~~~S~f~~~n~f~l~~y~d~~~~~~~VviyN----N~~~~~~~~ile~ya~~LAei~~ti~~n~n~ 153 (308) T protein:vir:12 80 SGTVDHYNL--PDRFHASSVGYQNTFKLYNYSDMKEKNMGVAIYN----NDLKCSTLPALEMFAQDLAELKEIIAVNQNA 153 (308) T ss_pred ccccccccC--CCceeeccccccccccccccccccccCCeeEecC----CccccCchhHHHHHHHHHHHHHHHHHHHHhh Confidence 899999998 567765555554432 2445667888899999 8889999999999999999999999999999 Q ss_pred hhCcEEEEecCCC-hhHHHHHHHHhCCCceEeecCCcChhhhhheeccCchhhhHHHHHHHHHHHHHHHHhhcccccccc Q lcl|NC_017732. 192 MRSPYIVKGKKNG-QLLQVLQSKIQNGDLFVGVEEGSDITEKIEKLDLNVTDRTPSLQNAYRNTFNEMLTLFGIYNNPEQ 270 (333) Q Consensus 192 qk~P~iIa~~~N~-~S~k~l~n~i~ngepvV~~~e~~d~~d~I~vl~t~a~~~ldkL~~e~~~~~sEllT~LGInN~n~D 270 (333) ||+|+||++++|+ +|+++|+|+|++|+|||+++|++|+ |+|+||+|+||+++|+|+++++++||||||||||+|+++| T Consensus 154 ~K~P~iI~a~~n~qls~~nL~n~i~~g~Pvi~~~~~~d~-d~l~V~~tda~~v~d~l~~~k~~~~nE~~t~LGI~n~~~d 232 (308) T protein:vir:12 154 QKTPVLIAANDNNQLSLKNIYNQYEGNAPVIFVHESLDL-DNLKVFKTDAPYVVDKLNAQKNAVWNEVMTYLGIKNANLE 232 (308) T ss_pred hhCCeEEEecCCCcchHHHHHHHHhcCcceEEecCCCCc-ccceeeecCchhhhhHHHHHHHHHHHHHHHhhhhccccch Confidence 9999999886666 6999999999999999999999996 8899999999999999999999999999999999999999 Q ss_pred hhhhhhhHhhhcchhhhhhhhhhhhcchhHHHHHHHhhcCCceeeehhHHH-HHHHHHHHhhcC Q lcl|NC_017732. 271 KKERMIDREASSNNHVIEGMGDIYFNARQHAVDLLNLAFGTDIKVQWNSTV-ASMFRDLGQKQG 333 (333) Q Consensus 271 KKERlv~~Ea~sNn~~i~~n~~iylK~RleA~elIN~~fGlnIkv~~~~~v-~~~~~~l~~~~~ 333 (333) ||||||++||+||++||++|++||||+|+|||++|||+|||||||+||+++ +++.+++++++. T Consensus 233 KkERmv~~Ea~SNn~qi~a~~niylk~R~ea~e~In~~yGLnI~v~~~~~~~~~~~~~~~~~~~ 296 (308) T protein:vir:12 233 KKERMVTSEVDSNDEQIESSGNIYLKARQEACNKISELYGLNLKVKFRYDIVEQMRLNAQGIEN 296 (308) T ss_pred hhhcchhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhcCCceeeechhhHHhhccChhhhhh Confidence 999999999999999999999999999999999999999999999999555 668889999887 No 6 >protein:vir:79400 Length: 323 # NCBI annotation: upper collar # Family: family:all:2168 # MgeID: mge:1869 # MgeName: Av-1 # Cross-refs: genbank:acc:YP_001333664;genbank:gi:151266301;genbank:GeneID:5329882 Probab=100.00 E-value=7.6e-113 Score=635.25 Aligned_cols=293 Identities=24% Similarity=0.338 Sum_probs=254.0 Q ss_pred chhh-h-hhhhcchHHHhhhcccccccccchhhhHHHHHHHHHHHHHHHH-HHhhhhhhheecCCCCCcCHHHHHHHHHh Q lcl|NC_017732. 16 LASS-Y-NNVIGNRRDLLQGIETSSTTTGDNLGNNYVQFEAIQTFILIRQ-IKDMLINLFKYENMPPTLNTAQLETMLRQ 92 (333) Q Consensus 16 ~~~~-~-~~~~~~~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~l~~~-y~~~~~n~F~wenLP~~ids~~IE~~L~~ 92 (333) .|-| | .|-|-|+--| ...|| -+-|+.|+.|||++ ++++|+|+|+||||||||||+|||++||| T Consensus 1 ~~~~~~~~~~~~~~~~~----~~~~~----------~~~~q~~~~~my~Ryl~~l~~n~f~wE~lP~~iD~~flE~~L~q 66 (323) T protein:vir:79 1 MAKSDYVKNGIYNKIML----KPPSS----------SEARQAQLEHMYRRQLMGKCLSRFTWEGLPNGIDPRFIEATIFN 66 (323) T ss_pred CCccchhhccccceeee----cCCCc----------hhhHHHHHHHHHHHHHHhHhhhhhhccCCCCCccHHHHHHHHHh Confidence 2222 2 1122221111 11122 25688899998865 47888899999999999999999999999 Q ss_pred cCcEEEEecCCcceeEEecCCCccCCCcCcccCCceeeeecccccccccccccCCCcceEEEeecccccccccCchhHHH Q lcl|NC_017732. 93 MGGGVCVGKDELGDLVILGRADELGYNLYGNVIPSLFDGNNNFLQSKKVITNRNLKGDYVVFYNKQSFNDFYATDYDIVE 172 (333) Q Consensus 93 ~G~~vf~~~~~~G~~~~lg~~~~~~~NiYg~~~P~~f~~~~~~~q~~~~~~~~~~~g~~VV~~Nk~~~N~~~~p~~~~ie 172 (333) +||||||||+++|+||++|++.++++++||+ |++|++++++.+++.+ .++++|||+| |++|+||++||| T Consensus 67 ~G~vvf~~d~~~g~yia~~~~~~g~~dly~q--pt~f~~n~N~~yqk~~-----~~~d~VVI~n----N~~~~p~idile 135 (323) T protein:vir:79 67 NGYSVFYFDTMFEMFMAMPATISGPLDIQDN--PTGYRVSRNGIYSREV-----SASDSVCIWG----NQVREPEIDLVL 135 (323) T ss_pred cCCeEEEecccccceeeeccccccccccccC--CcceeeccCceeEEEE-----EEeceeEecc----CccccccccHHH Confidence 9999999999999999999999999999999 7899999988877666 3588999999 999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhCcEEEEecCCC-hhHHHHHHHHhCCCceEeecCC---cChhhhhheeccCchhhhHHHH Q lcl|NC_017732. 173 HYAKQLATIKATERMNIMQMRSPYIVKGKKNG-QLLQVLQSKIQNGDLFVGVEEG---SDITEKIEKLDLNVTDRTPSLQ 248 (333) Q Consensus 173 ~Ya~eLAeIe~ti~~n~~qqk~P~iIa~~~N~-~S~k~l~n~i~ngepvV~~~e~---~d~~d~I~vl~t~a~~~ldkL~ 248 (333) ||||+||+|++||++|++|||+|+||++++|+ +|+++|+|+|++|+|||+++|+ +|++|+|+||+|++++++|+|+ T Consensus 136 lya~eLAeI~~ti~~n~n~~K~P~iI~a~~n~~~s~~~L~nki~~g~Pvi~~~~~~~~~dl~d~IeV~~t~~n~~a~~l~ 215 (323) T protein:vir:79 136 SYAARLAQIDRTIEIDLLNERNPMIVACSQDQRLTIQNLISRIYDGEPVVWGTENLSMENLANTIGVFPLNQNAGAGAVS 215 (323) T ss_pred HHHHHHHHHHHHHHHHHhHhhCCeEEEecCCCcchHHHHHHHHhcCcceEEeccCCChhhhhhhhceeecCccchhhhHH Confidence 99999999999999999999999999887766 5999999999999999999998 5688899999999666666554 Q ss_pred ----H-HHHHHHHHHHHhhcccccccchhhhhhhHhhhcchhhhhhhhhhhhcchhHHHHHHHhhcCCceeeehh--HHH Q lcl|NC_017732. 249 ----N-AYRNTFNEMLTLFGIYNNPEQKKERMIDREASSNNHVIEGMGDIYFNARQHAVDLLNLAFGTDIKVQWN--STV 321 (333) Q Consensus 249 ----~-e~~~~~sEllT~LGInN~n~DKKERlv~~Ea~sNn~~i~~n~~iylK~RleA~elIN~~fGlnIkv~~~--~~v 321 (333) + +++++||||||||||+|+++|||||||++||+|||+||++|++||||+|+|||++|||+|||||||+|+ |+. T Consensus 216 ~~l~~e~k~~i~nEl~t~LGI~n~~~DKKERvv~~EA~SNn~~i~a~~~iylk~R~eaielIN~~yGLnIsvk~~~dd~~ 295 (323) T protein:vir:79 216 SIKHMESKSKIWGEALTMLGIMNVNSEKRERMVVEEASANSGQVLASRESFMKPRLLACEQINEKFGLNISCSWAVDDNA 295 (323) T ss_pred HHHHHHHHHHHHHHHHHhhhhccccchhhhhhhhhhhhhhhhhhhhhhHhhhhhhHHHHHHHHhHhcCcceEEeeecccc Confidence 4 488889999999999999999999999999999999999999999999999999999999999999999 999 Q ss_pred HHHHHHH-HhhcC Q lcl|NC_017732. 322 ASMFRDL-GQKQG 333 (333) Q Consensus 322 ~~~~~~l-~~~~~ 333 (333) ++.++|- .++.- T Consensus 296 ~~~ls~~~~~~~~ 308 (323) T protein:vir:79 296 APNLSDYLTELNT 308 (323) T ss_pred hhHHHHHHHhhcc Confidence 9988884 34332 No 7 >protein:vir:97362 Length: 327 # NCBI annotation: ORF007 # Family: family:all:2168 # MgeID: mge:1669 # MgeName: 66 # Cross-refs: genbank:acc:YP_239465;genbank:gi:66395194;genbank:GeneID:5130537 Probab=100.00 E-value=2.7e-108 Score=610.28 Aligned_cols=291 Identities=27% Similarity=0.400 Sum_probs=257.1 Q ss_pred hhhhhcchHHHhhhcccccccccchhhhHHHHHHHHHHHHHHHHHHhhhhhhheecCC-CCCcCHHHHHHHHHhcCcEEE Q lcl|NC_017732. 20 YNNVIGNRRDLLQGIETSSTTTGDNLGNNYVQFEAIQTFILIRQIKDMLINLFKYENM-PPTLNTAQLETMLRQMGGGVC 98 (333) Q Consensus 20 ~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~l~~~y~~~~~n~F~wenL-P~~ids~~IE~~L~~~G~~vf 98 (333) .| ..+|.| .-.++++++.++++|+++|++|+|++|++|++.+++|++. |+||||+|||++|||+|+|+| T Consensus 1 ~~---~~~~~~-------~~~~~~~~~~~v~e~~~~~~n~~f~ry~~~L~~~l~Y~~~e~~~iD~~flE~~L~q~g~V~f 70 (327) T protein:vir:97 1 MN---NDKRGL-------NVELSKEISKRVVEHRNRFKRLMFNRYLEFLPLLINYTNRDTVGIDFIQLESALRQNINVVV 70 (327) T ss_pred CC---Cccccc-------ceehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCcCHHHHHHHHHcCCceEE Confidence 11 112222 2357899999999999999999999999999877777777 569999999999999999977 Q ss_pred EecCCcceeEEecCCCccCCCcCcccCCceeeeeccc------------------------ccccccccccCCCcceEEE Q lcl|NC_017732. 99 VGKDELGDLVILGRADELGYNLYGNVIPSLFDGNNNF------------------------LQSKKVITNRNLKGDYVVF 154 (333) Q Consensus 99 ~~~~~~G~~~~lg~~~~~~~NiYg~~~P~~f~~~~~~------------------------~q~~~~~~~~~~~g~~VV~ 154 (333) ++ |+.|+|++||+|+ ++|++++|. |++++.+ .|.|+..++ |++|++||| T Consensus 71 ~~-d~~~~~~~lGy~~----~~~~~~~p~-~n~n~~~~~~~pl~~kd~~~i~~~~~~~~~~~e~~k~~d~-~~~gn~VVi 143 (327) T protein:vir:97 71 GE-ARNKQIMILGYVN----NTYFNQAPN-FSSNFNFQFQKRLTKEDIYFIVPDYLIPDDCLQIHKLYDN-CMSGNFVVM 143 (327) T ss_pred EE-ecceeEEEEeeec----ccccccccc-cccccceeecccccccceeeEeecccCccchhhccccccc-cccCCeeEe Confidence 75 5555599999998 899999888 6555555 367777776 999999999 Q ss_pred eecccccccccCchhHHHHHHHHHHHHHHHHHHHHHhhhCcEEEEecCCChhHHHHHHHHhCCCceEeecCCcChhhhhh Q lcl|NC_017732. 155 YNKQSFNDFYATDYDIVEHYAKQLATIKATERMNIMQMRSPYIVKGKKNGQLLQVLQSKIQNGDLFVGVEEGSDITEKIE 234 (333) Q Consensus 155 ~Nk~~~N~~~~p~~~~ie~Ya~eLAeIe~ti~~n~~qqk~P~iIa~~~N~~S~k~l~n~i~ngepvV~~~e~~d~~d~I~ 234 (333) ||++++ |.||+++|||||++||+|++||++|++|||+|+|||+++|++|+|+|+|+|++|||||+++|++|.+|+| T Consensus 144 ~Nk~~~---~~~di~iie~Ya~eLAeI~~ti~~n~~q~k~p~Iik~~~n~~s~q~l~nki~ng~Pvv~~~~~~d~dd~i- 219 (327) T protein:vir:97 144 QNKPIQ---YNSDIEIIEHYTDELAEVALSRFSLIMQAKFSKIFKSEINDESINQLVSEIYNGAPFVKMSPMFNADDDI- 219 (327) T ss_pred ccCccc---cccchhhHHHHHHHHHHHHHHHHHHHhhhhCCEEEEcCCCchhHHHHHHHHhCCcceEEeccCCCCcccc- Confidence 999887 9999999999999999999999999999999999999999999999999999999999999999988865 Q ss_pred eeccCch---hhhHHHHHHHHHHHHHHHHhhcccccccchhhhhhhHhhhcchhhhhhhhhhhhcchhHHHHHHHhhcCC Q lcl|NC_017732. 235 KLDLNVT---DRTPSLQNAYRNTFNEMLTLFGIYNNPEQKKERMIDREASSNNHVIEGMGDIYFNARQHAVDLLNLAFGT 311 (333) Q Consensus 235 vl~t~a~---~~ldkL~~e~~~~~sEllT~LGInN~n~DKKERlv~~Ea~sNn~~i~~n~~iylK~RleA~elIN~~fGl 311 (333) |+++++ +.+++||+|++++||||||||||+|+|+|||||||++||+||++||.+|++||||+| +||++|||+||| T Consensus 220 -~~~~~~~v~~~L~~lk~e~~nk~nEllT~lGinn~~~dKkermv~~EA~SN~~~i~an~~Iylk~R-ea~elIN~~YGL 297 (327) T protein:vir:97 220 -IDLTSNSVIPALTEMKREYQNKISELSNYLGINSLAVDKESGVSDEEAKSNRGFTTSNSNIYLKGR-EPITFLSKRYGL 297 (327) T ss_pred -cccccccccchhHHHHHHHHHHHHHHHHHhhhccccchhhhcchhhhhhccchhhccchhhcccch-hhHHHHHHhhCc Confidence 555544 666667889999999999999999999999999999999999999999999999999 899999999999 Q ss_pred ceeeehhHHHHHHHHHHHhhcC Q lcl|NC_017732. 312 DIKVQWNSTVASMFRDLGQKQG 333 (333) Q Consensus 312 nIkv~~~~~v~~~~~~l~~~~~ 333 (333) ||||+|||++++.++.++.+-- T Consensus 298 nI~v~~~d~~~~~ls~i~~~~~ 319 (327) T protein:vir:97 298 DIKPYYDDETTSKISMVDTLFK 319 (327) T ss_pred ceeEeecchhhhhHHHHHHhhc Confidence 9999999999988888877644 No 8 >protein:vir:9445 Length: 249 # NCBI annotation: upper collar protein # Family: family:all:2168 # MgeID: mge:168 # MgeName: phiP68 # Cross-refs: genbank:acc:NP_817335;genbank:gi:29565762;genbank:GeneID:1258937 Probab=100.00 E-value=2.4e-84 Score=478.98 Aligned_cols=214 Identities=30% Similarity=0.459 Sum_probs=193.5 Q ss_pred EEecCCCccCCCcCcccCCceeeeeccc------------------------ccccccccccCCCcceEEEeeccccccc Q lcl|NC_017732. 108 VILGRADELGYNLYGNVIPSLFDGNNNF------------------------LQSKKVITNRNLKGDYVVFYNKQSFNDF 163 (333) Q Consensus 108 ~~lg~~~~~~~NiYg~~~P~~f~~~~~~------------------------~q~~~~~~~~~~~g~~VV~~Nk~~~N~~ 163 (333) |+||+.. |.|.++.|+ |.+++++ .|.|+++++ +++|++|||+|++.+ T Consensus 1 ~~~~~~~----~~~~~~~~~-~~~n~~~~~~~~l~~~di~fi~~~~l~~~~~~ql~k~~d~-~~kg~~VVi~Nn~~~--- 71 (249) T protein:vir:94 1 MILGYVN----NTYFNQAPN-FSSNFNFQFQKRLTKEDIYFIVPDYLIPDDCLQIHKLYDN-CMSGNFVVMQNKPIQ--- 71 (249) T ss_pred Ceeeeec----ccccccCCC-cccccchhhhhccccCceeEeecccccCccceeeeccccc-cccCCeeEecCCccc--- Confidence 8999988 788877776 5555443 677888877 499999999998876 Q ss_pred ccCchhHHHHHHHHHHHHHHHHHHHHHhhhCcEEEEecCCChhHHHHHHHHhCCCceEeecCCcChhhhhheeccCch-- Q lcl|NC_017732. 164 YATDYDIVEHYAKQLATIKATERMNIMQMRSPYIVKGKKNGQLLQVLQSKIQNGDLFVGVEEGSDITEKIEKLDLNVT-- 241 (333) Q Consensus 164 ~~p~~~~ie~Ya~eLAeIe~ti~~n~~qqk~P~iIa~~~N~~S~k~l~n~i~ngepvV~~~e~~d~~d~I~vl~t~a~-- 241 (333) |+||+++|||||++||+|++||++|++|||+|+|||+++|++|+|+|+|+|++|+|||+++|++|.+|+| |+++++ T Consensus 72 y~~didiie~Ya~eLAeI~~Ti~~n~~q~ktp~Iik~~~N~~s~q~l~nki~ngePvv~~~e~~d~dd~i--~~l~~n~v 149 (249) T protein:vir:94 72 YNSDIEIIEHYTDELAEVALSRFSLIMQAKFSKIFKSEINDESINQLVSEIYNGAPFVKMSPMFNADDDI--IDLTSNSV 149 (249) T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHhhhhcCEEEEcCCCchhHHHHHHHHhCCcceEEeccCCCCcccc--eecccchh Confidence 8999999999999999999999999999999999999999999999999999999999999999988865 555554 Q ss_pred -hhhHHHHHHHHHHHHHHHHhhcccccccchhhhhhhHhhhcchhhhhhhhhhhhcchhHHHHHHHhhcCCceeeehhHH Q lcl|NC_017732. 242 -DRTPSLQNAYRNTFNEMLTLFGIYNNPEQKKERMIDREASSNNHVIEGMGDIYFNARQHAVDLLNLAFGTDIKVQWNST 320 (333) Q Consensus 242 -~~ldkL~~e~~~~~sEllT~LGInN~n~DKKERlv~~Ea~sNn~~i~~n~~iylK~RleA~elIN~~fGlnIkv~~~~~ 320 (333) +.+++||+|++++||||||||||+|+|+|||||||++||+||++||.+|++||||+| +||++||++|||||||+|||+ T Consensus 150 ~~~L~~lK~e~~nk~nEllT~lGinN~~~dKkErmv~~EAsSNn~~i~an~nIylk~R-eA~elIN~~YGLnI~v~~~d~ 228 (249) T protein:vir:94 150 IPALTEMKREYQNKISELSNYLGINSLAVDKESGVSDEEAKSNRGFTTSNSNIYLKGR-EPITFLSKRYGLDIKPYYDDE 228 (249) T ss_pred chhhHHHHHHHHHHHHHHHHHhhhccccchhhhcchhhhhhccchhhccchhhcccch-hhHHHHHHhhCcceeEeecch Confidence 566667889999999999999999999999999999999999999999999999999 899999999999999999999 Q ss_pred HHHHHHHHHhhcC Q lcl|NC_017732. 321 VASMFRDLGQKQG 333 (333) Q Consensus 321 v~~~~~~l~~~~~ 333 (333) +++.++.++.+-- T Consensus 229 ~~~~lS~i~~l~r 241 (249) T protein:vir:94 229 TTSKISMVDTLFK 241 (249) T ss_pred hhhhHHHHHHhhc Confidence 8888888776644 No 9 >protein:vir:9466 Length: 249 # NCBI annotation: upper collar protein # Family: family:all:2168 # MgeID: mge:169 # MgeName: 44AHJD # Cross-refs: genbank:acc:NP_817313;genbank:gi:29565739;genbank:GeneID:1258929 Probab=100.00 E-value=2.4e-84 Score=478.98 Aligned_cols=214 Identities=30% Similarity=0.459 Sum_probs=193.5 Q ss_pred EEecCCCccCCCcCcccCCceeeeeccc------------------------ccccccccccCCCcceEEEeeccccccc Q lcl|NC_017732. 108 VILGRADELGYNLYGNVIPSLFDGNNNF------------------------LQSKKVITNRNLKGDYVVFYNKQSFNDF 163 (333) Q Consensus 108 ~~lg~~~~~~~NiYg~~~P~~f~~~~~~------------------------~q~~~~~~~~~~~g~~VV~~Nk~~~N~~ 163 (333) |+||+.. |.|.++.|+ |.+++++ .|.|+++++ +++|++|||+|++.+ T Consensus 1 ~~~~~~~----~~~~~~~~~-~~~n~~~~~~~~l~~~di~fi~~~~l~~~~~~ql~k~~d~-~~kg~~VVi~Nn~~~--- 71 (249) T protein:vir:94 1 MILGYVN----NTYFNQAPN-FSSNFNFQFQKRLTKEDIYFIVPDYLIPDDCLQIHKLYDN-CMSGNFVVMQNKPIQ--- 71 (249) T ss_pred Ceeeeec----ccccccCCC-cccccchhhhhccccCceeEeecccccCccceeeeccccc-cccCCeeEecCCccc--- Confidence 8999988 788877776 5555443 677888877 499999999998876 Q ss_pred ccCchhHHHHHHHHHHHHHHHHHHHHHhhhCcEEEEecCCChhHHHHHHHHhCCCceEeecCCcChhhhhheeccCch-- Q lcl|NC_017732. 164 YATDYDIVEHYAKQLATIKATERMNIMQMRSPYIVKGKKNGQLLQVLQSKIQNGDLFVGVEEGSDITEKIEKLDLNVT-- 241 (333) Q Consensus 164 ~~p~~~~ie~Ya~eLAeIe~ti~~n~~qqk~P~iIa~~~N~~S~k~l~n~i~ngepvV~~~e~~d~~d~I~vl~t~a~-- 241 (333) |+||+++|||||++||+|++||++|++|||+|+|||+++|++|+|+|+|+|++|+|||+++|++|.+|+| |+++++ T Consensus 72 y~~didiie~Ya~eLAeI~~Ti~~n~~q~ktp~Iik~~~N~~s~q~l~nki~ngePvv~~~e~~d~dd~i--~~l~~n~v 149 (249) T protein:vir:94 72 YNSDIEIIEHYTDELAEVALSRFSLIMQAKFSKIFKSEINDESINQLVSEIYNGAPFVKMSPMFNADDDI--IDLTSNSV 149 (249) T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHhhhhcCEEEEcCCCchhHHHHHHHHhCCcceEEeccCCCCcccc--eecccchh Confidence 8999999999999999999999999999999999999999999999999999999999999999988865 555554 Q ss_pred -hhhHHHHHHHHHHHHHHHHhhcccccccchhhhhhhHhhhcchhhhhhhhhhhhcchhHHHHHHHhhcCCceeeehhHH Q lcl|NC_017732. 242 -DRTPSLQNAYRNTFNEMLTLFGIYNNPEQKKERMIDREASSNNHVIEGMGDIYFNARQHAVDLLNLAFGTDIKVQWNST 320 (333) Q Consensus 242 -~~ldkL~~e~~~~~sEllT~LGInN~n~DKKERlv~~Ea~sNn~~i~~n~~iylK~RleA~elIN~~fGlnIkv~~~~~ 320 (333) +.+++||+|++++||||||||||+|+|+|||||||++||+||++||.+|++||||+| +||++||++|||||||+|||+ T Consensus 150 ~~~L~~lK~e~~nk~nEllT~lGinN~~~dKkErmv~~EAsSNn~~i~an~nIylk~R-eA~elIN~~YGLnI~v~~~d~ 228 (249) T protein:vir:94 150 IPALTEMKREYQNKISELSNYLGINSLAVDKESGVSDEEAKSNRGFTTSNSNIYLKGR-EPITFLSKRYGLDIKPYYDDE 228 (249) T ss_pred chhhHHHHHHHHHHHHHHHHHhhhccccchhhhcchhhhhhccchhhccchhhcccch-hhHHHHHHhhCcceeEeecch Confidence 566667889999999999999999999999999999999999999999999999999 899999999999999999999 Q ss_pred HHHHHHHHHhhcC Q lcl|NC_017732. 321 VASMFRDLGQKQG 333 (333) Q Consensus 321 v~~~~~~l~~~~~ 333 (333) +++.++.++.+-- T Consensus 229 ~~~~lS~i~~l~r 241 (249) T protein:vir:94 229 TTSKISMVDTLFK 241 (249) T ss_pred hhhhHHHHHHhhc Confidence 8888888776644 No 10 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=3.01 E-value=22 Score=13.16 Aligned_cols=303 Identities=10% Similarity=0.025 Sum_probs=104.2 Q ss_pred CCcchhhhhcchhh--------------cchhhhhhhhcc-hHHHhhhcccccccccchhhhHHHHH-HHHHHHHHHHHH Q lcl|NC_017732. 1 MENQLFEDTWGYQT--------------GLASSYNNVIGN-RRDLLQGIETSSTTTGDNLGNNYVQF-EAIQTFILIRQI 64 (333) Q Consensus 1 ~~~~~~~~~~~~~~--------------~~~~~~~~~~~~-~~~~~~~~~~~~~t~~~~~~~~~v~~-~~~~~~~l~~~y 64 (333) +..+...++.|-+. -++..|-..|-+ .-..+-|..-+-+...+..-+....| .+++..++.... T Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~~~~~~~~~l~~~~~n~~~~~~~~~~ 117 (471) T protein:vir:10 38 KRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKKAYALTYPPTFDVDDKKVNDMIVDVLGDDYERISKQLC 117 (471) T ss_pred ccccchhhhhcccccccccccccccccceeccchhHHHHHhhhhhhcccCceeccCChHHHHHHHHHHhcCHHHHHHHHH Confidence 32222222222111 123333333322 22344443322222222111111111 223332222222 Q ss_pred HhhhhhhheecCCCCCcCHHHHHHHHHhcCcEEEEecCCcceeEEecCCCccCCCcCcccC---Cceeeeeccc------ Q lcl|NC_017732. 65 KDMLINLFKYENMPPTLNTAQLETMLRQMGGGVCVGKDELGDLVILGRADELGYNLYGNVI---PSLFDGNNNF------ 135 (333) Q Consensus 65 ~~~~~n~F~wenLP~~ids~~IE~~L~~~G~~vf~~~~~~G~~~~lg~~~~~~~NiYg~~~---P~~f~~~~~~------ 135 (333) +....-- .+|...+.+..-|.+.+---.+...+.+|.++. |..+..-... T Consensus 118 ~~~~~~G---------------------~~~~~v~~d~~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~ 176 (471) T protein:vir:10 118 VNAGNAG---------------------IAWLHVWKDASDNSFRYACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDETDG 176 (471) T ss_pred HHHhhCC---------------------eEEEEEEeeCCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEeeccCCC Confidence 2222111 112222223233333222222233344443322 1111100000 Q ss_pred --------cccccccccc---------------------------------CCCcc--eEEEeecccccccccCchhHHH Q lcl|NC_017732. 136 --------LQSKKVITNR---------------------------------NLKGD--YVVFYNKQSFNDFYATDYDIVE 172 (333) Q Consensus 136 --------~q~~~~~~~~---------------------------------~~~g~--~VV~~Nk~~~N~~~~p~~~~ie 172 (333) +-...++..+ ..-|. +|-+.| |....|+.+-|. T Consensus 177 ~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~~~~~sd~e~v~ 252 (471) T protein:vir:10 177 KNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN----NEIETNDLKPIK 252 (471) T ss_pred ceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEEEecc----CCCCCCchHHHH Confidence 0000000000 01111 123334 334455555444 Q ss_pred HHHHHHHHHHHHHHHHHHhhhCcEEEEecCCChhHHHHHHHHhCCCceEeecC-CcChhhhhheeccCchhhhHHHHHHH Q lcl|NC_017732. 173 HYAKQLATIKATERMNIMQMRSPYIVKGKKNGQLLQVLQSKIQNGDLFVGVEE-GSDITEKIEKLDLNVTDRTPSLQNAY 251 (333) Q Consensus 173 ~Ya~eLAeIe~ti~~n~~qqk~P~iIa~~~N~~S~k~l~n~i~ngepvV~~~e-~~d~~d~I~vl~t~a~~~ldkL~~e~ 251 (333) -....+..+-+-....+.....|+++......-..+........+ ..+.+.. +....-+++.+.-+.+ ...++... T Consensus 253 ~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~ 329 (471) T protein:vir:10 253 DLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDLKRY-KMIKMDNDGMGDQSGVTTIAIDIP--TEARNLIL 329 (471) T ss_pred HHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHHhhcC-CeEEecCCCCccCccceEEeecCC--hHHHHHHH Confidence 444444444333344445667897765543322223333333322 2344432 2221223555543332 12233333 Q ss_pred HHHHHHHHHhhcccccccch---------hhhhhhHhhhcchhhhhhhhhhhhcchhHHH-HHHHhhcCCceeeehhHH- Q lcl|NC_017732. 252 RNTFNEMLTLFGIYNNPEQK---------KERMIDREASSNNHVIEGMGDIYFNARQHAV-DLLNLAFGTDIKVQWNST- 320 (333) Q Consensus 252 ~~~~sEllT~LGInN~n~DK---------KERlv~~Ea~sNn~~i~~n~~iylK~RleA~-elIN~~fGlnIkv~~~~~- 320 (333) ..-.+.++.+-+.-+...++ |.++..-++..+...-. ..-.++-|+..| +.++..=..+|.+.|++. T Consensus 330 ~~l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~--~~~~l~~~~~li~~~~~~~d~~~i~i~f~~~~ 407 (471) T protein:vir:10 330 ERTKKQIFISGQGVNPETDKLGNSSGVALKFLYSLLELKAGNMETQ--FRSGYATLVKMILKHLGLSDKLKIKQTWTRNS 407 (471) T ss_pred HHHHHHHHHHhCCcCCCcccccCccHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHhccCCCceeEEEeCCCC Confidence 33344445555544444333 22222223223222211 112233333222 334433356678888733 Q ss_pred ---HHHHHHHHHhhcC Q lcl|NC_017732. 321 ---VASMFRDLGQKQG 333 (333) Q Consensus 321 ---v~~~~~~l~~~~~ 333 (333) ..+...-+.+..| T Consensus 408 p~n~~e~~~~~~kl~g 423 (471) T protein:vir:10 408 INNDTEMAQVVSTLAT 423 (471) T ss_pred CCCHHHHHHHHHHHhc Confidence 3345555566666 Done!