Query lcl|NC_010363.1_cdsid_YP_001687528.1 [gene=ASCCphi28_gp15] [protein=lower collar protein] [protein_id=YP_001687528.1] [location=complement(9820..10527)] Match_columns 235 No_of_seqs 1 out of 4 Neff 1.0 Searched_HMMs 1612 Date Thu Nov 7 12:37:27 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_15 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_15_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:18 Length: 287 # N 72.7 0.15 9.3E-05 25.1 7.5 188 1-235 1-286 (287) 2 protein:vir:79392 Length: 240 52.2 0.56 0.00035 22.0 9.0 161 64-235 1-212 (240) 3 protein:vir:122 Length: 293 # 51.3 0.58 0.00036 21.9 6.7 162 1-235 1-168 (293) 4 protein:vir:5206 Length: 293 # 43.4 0.69 0.00043 21.5 5.7 150 1-235 1-154 (293) 5 protein:vir:4733 Length: 194 # 27.8 1.8 0.0011 19.2 7.2 173 49-235 1-194 (194) 6 protein:vir:106985 Length: 276 8.3 5.7 0.0035 16.4 2.8 132 86-235 1-162 (276) 7 protein:vir:80835 Length: 464 1.4 31 0.019 12.4 0.5 54 1-79 405-464 (464) 8 protein:vir:3989 Length: 392 # 1.3 51 0.032 11.2 4.4 176 1-235 127-327 (392) 9 protein:vir:1023 Length: 392 # 1.3 51 0.032 11.2 4.4 176 1-235 127-327 (392) 10 protein:vir:78191 Length: 351 1.2 37 0.023 12.0 0.5 84 1-97 264-351 (351) No 1 >protein:vir:18 Length: 287 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:323 # MgeName: GA-1 # Cross-refs: genbank:acc:NP_073694;swissprot:trembl:q9fzw4;genbank:gi:12248118;uniprot:Q9FZW4;genbank:GeneID:919889 Probab=72.71 E-value=0.15 Score=25.11 Aligned_cols=188 Identities=22% Similarity=0.358 Sum_probs=88.8 Q ss_pred CCCccchhhhHHhhhhhhhhhccchhccchhhhhhhhhhhccCcccccccccccccccchhhccchHHHHhhhhhhhhcc Q lcl|NC_010363. 1 MTPKTMKLSEYASKLVYQREVNTGYKSGFMGRVVHDPIKSFSTPKNSKFDSGLEGLDLSFDDHISQKEMIEKSMPDIFNN 80 (235) Q Consensus 1 mtpktmklseyasklvyqrevntgyksgfmgrvvhdpiksfstpknskfdsglegldlsfddhisqkemieksmpdifnn 80 (235) |..-||+|++|..-. |.|+.| ++-.|-|||+-|.+|.- T Consensus 1 ~~~~~~~~~~~~~~~-------------------------------~~~~~~-----------~~~~~~~~~~~~~~~~~ 38 (287) T protein:vir:18 1 MASFTMPLREIVEWA-------------------------------TQFDNK-----------LTRNEKIEEGRKKLFDF 38 (287) T ss_pred CcceeehHHHHHHHH-------------------------------hhhccc-----------chhhHHHhhhhhhhhhh Confidence 999999999885321 122222 34556699999998863 Q ss_pred eeeEeccchhhHHHHHHHHHHHHhhhhcCCcchhhhhhhhhhhhHHHH--HHHHHHhHhhhh-------cchhhhccccc Q lcl|NC_010363. 81 ITLTFNNTELITDEIKSMFIKRFWNKRLGMPTIAQWQGELEYYFMEEC--KNALIQNYLLST-------INIDDFLDGGS 151 (235) Q Consensus 81 itltfnntelitdeiksmfikrfwnkrlgmptiaqwqgeleyyfmeec--knaliqnyllst-------iniddfldggs 151 (235) ..-+ -|-...|+..+|||-||-|+.|.-||+.+.=.||-|.+--. -|+++..-++.+ -+-+.-.|+.. T Consensus 39 -~y~~--~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~mp~~n~~~ese~vntN~~~nTdant~tNtD~nT 115 (287) T protein:vir:18 39 -FYPI--ETDYKKEFETKFIKHFYFREIGFETEGRFKFALEEWLNLNMPYWNKIIESTHLDYNPLYNVDYKKDSDLIRNL 115 (287) T ss_pred -cCcc--chHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHhhcchhhhhHhhhhccCCccccccccCCCCcccCC Confidence 1112 55667788999999999999999999887766666554221 133333222210 00000111100 Q ss_pred cc-------CCCccc-ccccCCC-----------------------------CCCCceeeecchhHHHHHHHHH------ Q lcl|NC_010363. 152 VS-------GSNNTD-SMQGNKN-----------------------------TPDGSTLNITHDQKMLLDRATS------ 188 (235) Q Consensus 152 vs-------gsnntd-smqgnkn-----------------------------tpdgstlnithdqkmlldrats------ 188 (235) -+ ++.+++ ....+.+ .|++.+--.+.+. --+.++. T Consensus 116 t~ndn~dtdsnt~ad~ntntdtnTntd~nTtantdtntD~NTt~n~~tnt~dN~d~ntd~ntd~n--t~d~~T~~s~tns 193 (287) T protein:vir:18 116 DQVDNRVTDSKIENNGKASSESNVITSEKGEANSIQDADRNSTAKKKRMFEDTPDGRLDIVNDNN--IIQYATDLTQEDS 193 (287) T ss_pred CCCCCcccccCcccCCCcCCCCCCCCCcCCCCCCccccccccccCCCcCcccCCCCccccccCCC--CccccccCCCccc Confidence 00 000000 0000000 0010000000000 0000000 Q ss_pred ------------------HHHHhhcC--------------Cccccc----------hh--hHHHHHHHHhhhhH--HHHH Q lcl|NC_010363. 189 ------------------VVRAIGQG--------------SSDQNT----------KI--AKYQAFLMAQRTSL--IDDV 222 (235) Q Consensus 189 ------------------vvraigqg--------------ssdqnt----------ki--akyqaflmaqrtsl--iddv 222 (235) -......+ .++.|+ |. ..|-..+|--|..| |++. T Consensus 194 n~~~nTd~ntnsntdtnsd~nttsn~~~nstsn~nsn~d~nSd~N~~~n~n~~s~~~~gt~s~~~~~~~~~~~~~~~~~~ 273 (287) T protein:vir:18 194 TDSVKDKIKNDSSSKNDSTGNTTAEGKTNNITEGNNVDDFKSEKDEKQKLNDHIYGKQGNVSYPQLIKEHREAILNVERM 273 (287) T ss_pred cCCCCCCcccCCCccccCCCccccCCCCCcccCCccCCCCCCCCCccccccccccccccceeHHHHHHHHHHHHHhHHHH Confidence 00000000 000000 01 13555666666665 6888 Q ss_pred HHhcchheeeecC Q lcl|NC_010363. 223 ILQNARKLFMLIG 235 (235) Q Consensus 223 ilqnarklfmlig 235 (235) |..-...||||+= T Consensus 274 ~~~~~~~~~~~~~ 286 (287) T protein:vir:18 274 IFDQMEELFMFVY 286 (287) T ss_pred HHHHHHHHHhhhc Confidence 9989999999999 No 2 >protein:vir:79392 Length: 240 # NCBI annotation: lower collar # Family: family:all:5217 # MgeID: mge:1869 # MgeName: Av-1 # Cross-refs: genbank:acc:YP_001333665;genbank:gi:151266302;genbank:GeneID:5329873 Probab=52.20 E-value=0.56 Score=21.96 Aligned_cols=161 Identities=20% Similarity=0.299 Sum_probs=72.1 Q ss_pred cchHH-HHhhhhhhhhcceeeEecc----chhhHHHHHHHHHHHHhhhhcCCcchhhhhhhhhhhhHHHHHHHHHHhHhh Q lcl|NC_010363. 64 ISQKE-MIEKSMPDIFNNITLTFNN----TELITDEIKSMFIKRFWNKRLGMPTIAQWQGELEYYFMEECKNALIQNYLL 138 (235) Q Consensus 64 isqke-mieksmpdifnnitltfnn----telitdeiksmfikrfwnkrlgmptiaqwqgeleyyfmeecknaliqnyll 138 (235) .|-.. |+.. .-.+ .|=.+-..| -|-...|...|||+.||-+..|.-|++.+.=+||-+.+--.- ....+++ T Consensus 1 ~~~~~~~~~~-~~~~-~~~~~~~~~YPIfDesYrk~FEt~Fir~FYmrEIGFETeg~Fkf~Le~wL~lnMP--yyNk~~e 76 (240) T protein:vir:79 1 MSVTTIMLRD-VVKL-TNDHIGLDNYPIFDESYRKTLNDRIKREYWLQEIAHETIDIFIWRMSLRMDLIMP--RYNRMYL 76 (240) T ss_pred CchhhhHHHH-HHHh-hcccccccccCccchHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHhhcc--hhHHHHH Confidence 11111 1110 0000 011111222 233456788899999999999999999887777765542100 0012333 Q ss_pred hhc---------------chhhh-cccccccCC---Ccccc---cccCCCCCCCceeeecchhHHHHHHHHHHHHHhhcC Q lcl|NC_010363. 139 STI---------------NIDDF-LDGGSVSGS---NNTDS---MQGNKNTPDGSTLNITHDQKMLLDRATSVVRAIGQG 196 (235) Q Consensus 139 sti---------------niddf-ldggsvsgs---nntds---mqgnkntpdgstlnithdqkmlldratsvvraigqg 196 (235) |+. +-|+= .+-|.-+-+ +-||+ -.-|..|||+.--+ |- |-|+++-.+---| T Consensus 77 sEl~~YdPLen~r~~s~T~~d~r~~~sG~~~etG~gs~tddn~kr~~~sDtPDtRL~~---Dg----dYAS~isd~~t~~ 149 (240) T protein:vir:79 77 AELQNTDPLEGNRHYSRTGQDGRSQNSGINHQTGSGSGTNESKGRTVGSDTPQTRLAG---DG----DYATSISDASTGG 149 (240) T ss_pred HHhhccccccccccccccCCccceeecCccccccccccccccccccccCCCcchhhhc---cc----hhhhhhhhhhcCC Confidence 332 11111 011111000 01111 11133467664332 32 5555554443222 Q ss_pred Ccc----------------ccchhhHH-----HHH-HHHhhhhH--HHHHHHhcchheeeecC Q lcl|NC_010363. 197 SSD----------------QNTKIAKY-----QAF-LMAQRTSL--IDDVILQNARKLFMLIG 235 (235) Q Consensus 197 ssd----------------qntkiaky-----qaf-lmaqrtsl--iddvilqnarklfmlig 235 (235) +++ .|.+...| |++ .+.-|..| |++.|..-...|||+|= T Consensus 150 ~s~s~~dSdS~t~st~n~snNq~~~s~Gk~G~~syai~eyR~alL~ve~~if~em~eLFM~vy 212 (240) T protein:vir:79 150 SSTSRNESDSTSSSTSNYSNNQNSESWGYSGSKARAIAEYRSTLLNVDDLVIRELSDLFMGIW 212 (240) T ss_pred ccccccccccccccccccccccchhhhcccchHHHHHHHHHHHHHhHHHHHHHHHHHHhhhhc Confidence 221 11121111 222 33345444 68889999999999874 No 3 >protein:vir:122 Length: 293 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:4 # MgeName: B103 # Cross-refs: genbank:acc:NP_690645;swissprot:sw:q37892;genbank:gi:22855159;uniprot:Q37892;genbank:GeneID:955372 Probab=51.35 E-value=0.58 Score=21.86 Aligned_cols=162 Identities=24% Similarity=0.368 Sum_probs=69.1 Q ss_pred CCCccchhhhHHhhhhhhhhhccchhccchhhhhhhhhhhccCcccccccccccccccchhhccchHHHHhhhhhhhhcc Q lcl|NC_010363. 1 MTPKTMKLSEYASKLVYQREVNTGYKSGFMGRVVHDPIKSFSTPKNSKFDSGLEGLDLSFDDHISQKEMIEKSMPDIFNN 80 (235) Q Consensus 1 mtpktmklseyasklvyqrevntgyksgfmgrvvhdpiksfstpknskfdsglegldlsfddhisqkemieksmpdifnn 80 (235) |..-||+|++|..-.- .|++| +|-.|-|||+-|.+|.- T Consensus 1 ~~~~~~~~~~~~~~~~-------------------------------~~~~~-----------~~~~~~~~~~~~~~~~~ 38 (293) T protein:vir:12 1 MASYTMKLSTYIEMWS-------------------------------QYETG-----------LSMAEKIEKGRPKLFDF 38 (293) T ss_pred CcceeehHHHHHHHHh-------------------------------hccCc-----------cchhhhhhhhhhhhhhc Confidence 9999999998854211 33333 46778899999999863 Q ss_pred eeeEeccchhhHHHHHHHHHHHHhhhhcCCcchhhhhhhhhhhhHHHH--HHHHHHhHhhhhcchhhhcccccccCC--- Q lcl|NC_010363. 81 ITLTFNNTELITDEIKSMFIKRFWNKRLGMPTIAQWQGELEYYFMEEC--KNALIQNYLLSTINIDDFLDGGSVSGS--- 155 (235) Q Consensus 81 itltfnntelitdeiksmfikrfwnkrlgmptiaqwqgeleyyfmeec--knaliqnyllstiniddfldggsvsgs--- 155 (235) ..-| --|-...|...|||+.||-+..|.-|++.+.=+||-+.+--. -|.|+.-=++.+ +-++.-...++ T Consensus 39 -~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~s~~~~t----~~~nnT~~~~tsns 112 (293) T protein:vir:12 39 -QYPI-FDESYRKVFETHFIRNFYMREIGFETEGLFKFNLETWLIINMPYFNKLFESELIKY----DPLENTRLNTTGNK 112 (293) T ss_pred -cCCc-ccchHHHHHHHHHHHHHHHHHhhccchhHHHHHHHHHHhhhcchhcchhhcccccc----CCCcccccccccCc Confidence 1111 134556788899999999999999999988777776654321 111221111111 11111111111 Q ss_pred -CcccccccCCCCCCCceeeecchhHHHHHHHHHHHHHhhcCCccccchhhHHHHHHHHhhhhHHHHHHHhcchheeeec Q lcl|NC_010363. 156 -NNTDSMQGNKNTPDGSTLNITHDQKMLLDRATSVVRAIGQGSSDQNTKIAKYQAFLMAQRTSLIDDVILQNARKLFMLI 234 (235) Q Consensus 156 -nntdsmqgnkntpdgstlnithdqkmlldratsvvraigqgssdqntkiakyqaflmaqrtsliddvilqnarklfmli 234 (235) ++|+. +...+.+|++-.=. .+-.-.=+.+..+.++.-.. +.-.+.-++ +--. -. T Consensus 113 ~~sTd~--~~nt~sd~st~~ng----------~s~ttts~~t~~s~~t~s~~------ntt~s~t~~----stnt---t~ 167 (293) T protein:vir:12 113 KNDTER--NDNRDTTGSMKADG----------KSNTKTSDKTNATGSSKEDG------KTTGSVTDD----NFNR---KI 167 (293) T ss_pred ccccCC--CCCcCccccccccc----------ccccCCCCCccccccccCcc------ccCCcccCC----CCCC---CC Confidence 11111 11111222221100 00000001111111110000 000000000 0000 00 Q ss_pred C Q lcl|NC_010363. 235 G 235 (235) Q Consensus 235 g 235 (235) + T Consensus 168 s 168 (293) T protein:vir:12 168 D 168 (293) T ss_pred C Confidence 1 No 4 >protein:vir:5206 Length: 293 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:116 # MgeName: PZA # Cross-refs: genbank:acc:NP_040729;genbank:gi:9626400;genbank:GeneID:1260971 Probab=43.38 E-value=0.69 Score=21.45 Aligned_cols=150 Identities=23% Similarity=0.304 Sum_probs=65.8 Q ss_pred CCCccchhhhHHhhhhhhhhhccchhccchhhhhhhhhhhccCcccccccccccccccchhhccchHHHHhhhhhhhhcc Q lcl|NC_010363. 1 MTPKTMKLSEYASKLVYQREVNTGYKSGFMGRVVHDPIKSFSTPKNSKFDSGLEGLDLSFDDHISQKEMIEKSMPDIFNN 80 (235) Q Consensus 1 mtpktmklseyasklvyqrevntgyksgfmgrvvhdpiksfstpknskfdsglegldlsfddhisqkemieksmpdifnn 80 (235) |..-||+|++|..-. +.|+-| ++-.|-|||.-|.+|. T Consensus 1 ~~~~~~~~~~~~~~~-------------------------------~~~~~~-----------~~~~~~~~~~~~~~~~- 37 (293) T protein:vir:52 1 MSSYTMQLRTYIEMW-------------------------------SQGETG-----------LSTAEKIEKGRPKLFD- 37 (293) T ss_pred CcceeehHhHHHhhh-------------------------------hhcCCc-----------ccccchhhhhhhhhhc- Confidence 999999998875321 112222 3445669999999887 Q ss_pred eeeEecc-chhhHHHHHHHHHHHHhhhhcCCcchhhhhhhhhhhhH---HHHHHHHHHhHhhhhcchhhhcccccccCCC Q lcl|NC_010363. 81 ITLTFNN-TELITDEIKSMFIKRFWNKRLGMPTIAQWQGELEYYFM---EECKNALIQNYLLSTINIDDFLDGGSVSGSN 156 (235) Q Consensus 81 itltfnn-telitdeiksmfikrfwnkrlgmptiaqwqgeleyyfm---eecknaliqnyllstiniddfldggsvsgsn 156 (235) +-|.= -|-...|...+|||.||-+..|.-|++.+.=+||-+.+ .-|-+. |.-.-+| .+-++.-....+. T Consensus 38 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~----~~s~~~n-t~p~dNt~~~~tt 110 (293) T protein:vir:52 38 --FNYPIFDESYRTIFETHFIRNFYMREIGFETEGLFKFHLETWLMINMPYFNKL----FESELIK-YDPLENTRVGVKS 110 (293) T ss_pred --cCCCccchhHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHhhhccccccc----ccccccc-cCCcccccccccc Confidence 22211 45567788999999999999999999887777776653 222211 1111122 1222221111111 Q ss_pred cccccccCCCCCCCceeeecchhHHHHHHHHHHHHHhhcCCccccchhhHHHHHHHHhhhhHHHHHHHhcchheeeecC Q lcl|NC_010363. 157 NTDSMQGNKNTPDGSTLNITHDQKMLLDRATSVVRAIGQGSSDQNTKIAKYQAFLMAQRTSLIDDVILQNARKLFMLIG 235 (235) Q Consensus 157 ntdsmqgnkntpdgstlnithdqkmlldratsvvraigqgssdqntkiakyqaflmaqrtsliddvilqnarklfmlig 235 (235) +. . +-.++++++-.-... -+=|....+.++.- + -.+.- -=---+ T Consensus 111 n~---~-~sTs~d~st~tdss~------------t~d~~stTds~t~~----------------~--s~~ta-nst~ta 154 (293) T protein:vir:52 111 NT---K-NDTDRNDNRDVKQDL------------TSNGTSSTDAKQND----------------T--SKTTG-NEKSSG 154 (293) T ss_pred cc---c-ccccccCCcccCCcc------------ccCCccCCCCCcCC----------------C--ccccc-CcccCC Confidence 10 0 001111111000000 00011111111000 0 00000 000011 No 5 >protein:vir:4733 Length: 194 # NCBI annotation: collar protein # Family: family:all:28811 # MgeID: mge:103 # MgeName: Cp-1 # Cross-refs: genbank:acc:NP_044824;swissprot:trembl:q37996;genbank:gi:9629536;uniprot:Q37996;genbank:GeneID:1261240 Probab=27.76 E-value=1.8 Score=19.17 Aligned_cols=173 Identities=27% Similarity=0.503 Sum_probs=83.1 Q ss_pred ccccccccccchh-hccchHHHHhhhhhhhhcceeeEeccchhhHHHHHHHHHHHHhhhhcCCcchhhhhhhhhhhhHHH Q lcl|NC_010363. 49 FDSGLEGLDLSFD-DHISQKEMIEKSMPDIFNNITLTFNNTELITDEIKSMFIKRFWNKRLGMPTIAQWQGELEYYFMEE 127 (235) Q Consensus 49 fdsglegldlsfd-dhisqkemieksmpdifnnitltfnntelitdeiksmfikrfwnkrlgmptiaqwqgeleyyfmee 127 (235) ...-|+||...=. ..+--...|.....++|.+..|. .-+.|..|.-|.|.|+|...|+.|.|.+|-.||-..-.| T Consensus 1 mtgrldglavdengeflhyntiidqtynelfkdmelv----ngvsdnfkkefckhfynreigletfarfqialeevlnne 76 (194) T protein:vir:47 1 MTGRLDGLAVDENGEFLHYNTIIDQTYNELFKDMELV----NGVSDNFKKEFCKHFYNREIGLETFARFQIALEEVLNNE 76 (194) T ss_pred CcccccceeecCCCceeehhhHHHhhHHHHHHHHHHh----hhhhhhHHHHHHHHHhcchhhhhHHHHHHHHHHHHhhhh Confidence 2223444432211 11122234444445555443331 125688999999999999999999999999999999999 Q ss_pred HHHHHHHhHhhhhc--chhhhcccccccCCCcccccccCCCCCCCceeeecc--hh--HH--------HHHHHHHHHHHh Q lcl|NC_010363. 128 CKNALIQNYLLSTI--NIDDFLDGGSVSGSNNTDSMQGNKNTPDGSTLNITH--DQ--KM--------LLDRATSVVRAI 193 (235) Q Consensus 128 cknaliqnyllsti--niddfldggsvsgsnntdsmqgnkntpdgstlnith--dq--km--------lldratsvvrai 193 (235) |-|.. .||...- -|.|. .-|-|.|.. ||. ..||..|.|.. -| |. .+.-|..+|..- T Consensus 77 cfnlf--kylaeirnkaikdl------nqsmnidtv-gnq-kadgqalqianttpqerkeivfterygvieyadnlvenh 146 (194) T protein:vir:47 77 CFNLF--KYLAEIRNKAIKDL------NQSMNIDTV-GNQ-KADGQALQIANTTPQERKEIVFTERYGVIEYADNLVENH 146 (194) T ss_pred HHHHH--HHHHHHHhHHHHhh------hhhcccccc-ccc-ccccceeeeccCChhhhhhhhhhhhcchhHHHHHHHhhh Confidence 98754 2332211 12232 122333332 221 34666666532 11 11 234444555433 Q ss_pred hcCCccccchhhHHHHHHH---HhhhhHHHHHHHh---cchheeeecC Q lcl|NC_010363. 194 GQGSSDQNTKIAKYQAFLM---AQRTSLIDDVILQ---NARKLFMLIG 235 (235) Q Consensus 194 gqgssdqntkiakyqaflm---aqrtsliddvilq---narklfmlig 235 (235) -....|......-+..--. -||..-+.|+-.| -..|||.-+- T Consensus 147 qknnadtksnvsgwsgsslaerlqrnaelkdiqfqifnicdklflqvf 194 (194) T protein:vir:47 147 QKNNADTKSNVSGWSGSSLAERLQRNAELKDIQFQIFNICDKLFLQVF 194 (194) T ss_pred hccccccccccccccchhHHHHHhhccchhhhHHHHHHHHHHHHHhcC Confidence 2222222222222221111 2344444444222 2344443333 No 6 >protein:vir:106985 Length: 276 # NCBI annotation: neck protein gp13 # Family: family:all:1103 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195127;genbank:gi:58532904;uniprot:Q5GQW0;genbank:GeneID:3260482 Probab=8.26 E-value=5.7 Score=16.44 Aligned_cols=132 Identities=18% Similarity=0.186 Sum_probs=68.9 Q ss_pred ccchhhHHHHHHHHHHHHhhhhcCCcchh------hh----hhhhhhhhHHHHHHHHHHhHhhhhcchhhhcccccc--- Q lcl|NC_010363. 86 NNTELITDEIKSMFIKRFWNKRLGMPTIA------QW----QGELEYYFMEECKNALIQNYLLSTINIDDFLDGGSV--- 152 (235) Q Consensus 86 nntelitdeiksmfikrfwnkrlgmptia------qw----qgeleyyfmeecknaliqnyllstiniddfldggsv--- 152 (235) =++..-..|+|.. --+|||-|.|- |- +..|| +|-|-+-+.+.+-|+.-.++-||.-.|-+. T Consensus 1 ma~p~s~~eLkdy-----iLRrLGaPii~InVtd~Qi~DcId~Ale-ly~EyHydG~~k~y~~~~i~~dd~~~~~~~~~~ 74 (276) T protein:vir:10 1 MAKPSSKQELIDY-----CKRQLGAPVLQINTDSAQDDDIIDQAIQ-YYHEYHFDGIERMYLKHQFTADDVTRFTSSDQL 74 (276) T ss_pred CCCCCCHHHHHHH-----HHHhcCCceEEEEcCHHHHHHHHHHHHH-HhhhhcccccccceEEEEechHHhhhhhhhhhc Confidence 1111122333332 23589999873 22 33444 457888899999999888888887554111 Q ss_pred cCCCcccccccCCCCCCCceeeecchhHHHHHHHHHHHHHhhcCCccc--cchhhHHHHHHHHhhhhHHHH-HHHhcchh Q lcl|NC_010363. 153 SGSNNTDSMQGNKNTPDGSTLNITHDQKMLLDRATSVVRAIGQGSSDQ--NTKIAKYQAFLMAQRTSLIDD-VILQNARK 229 (235) Q Consensus 153 sgsnntdsmqgnkntpdgstlnithdqkmlldratsvvraigqgssdq--ntkiakyqaflmaqrtslidd-vilqnark 229 (235) .....+.+|.|+.|-|- +-|-...|...+|.+|+.- |--=|+||-|||.-....-+- +++-|-|. T Consensus 75 ~~~~s~~~~~g~~~Y~~------------~pd~v~gi~~v~~g~s~~~g~n~f~~~~q~fl~d~~~~~~~ql~~~g~~~~ 142 (276) T protein:vir:10 75 TTAPNNDDWENRNNYIQ------------VPDAVIGISKVFGVSSNFLRNNLFGLSNQYYLMDLFSFSTGSAFSFGNFDL 142 (276) T ss_pred ccccceeeeeccCccee------------cCcchhhhhhhcccCccccCCccccccHHHHHHHHhcccccceeeccceee Confidence 12223567888888653 1233344555566555443 333489999998654221000 11123332 Q ss_pred ee-------------eec-C Q lcl|NC_010363. 230 LF-------------MLI-G 235 (235) Q Consensus 230 lf-------------mli-g 235 (235) += ++- | T Consensus 143 ~dy~~ve~~~~~id~~~~~~ 162 (276) T protein:vir:10 143 TNYYMIKQHFETIDMIINTG 162 (276) T ss_pred eeEEEEEEEecccceeecCC Confidence 20 110 1 No 7 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=1.36 E-value=31 Score=12.40 Aligned_cols=54 Identities=22% Similarity=0.300 Sum_probs=15.6 Q ss_pred CCCccchhhhHHhhhhhhhhhccchhccchhhhhhhhhhhccCcccccccccc-cccccchhhccchHHHHhhh-----h Q lcl|NC_010363. 1 MTPKTMKLSEYASKLVYQREVNTGYKSGFMGRVVHDPIKSFSTPKNSKFDSGL-EGLDLSFDDHISQKEMIEKS-----M 74 (235) Q Consensus 1 mtpktmklseyasklvyqrevntgyksgfmgrvvhdpiksfstpknskfdsgl-egldlsfddhisqkemieks-----m 74 (235) |+|.++.|-|.-.-+-. |.-...+ .. .|---. -.|-|- ..++-+.-|. . T Consensus 405 ms~~ti~l~ellPm~rl-------------------plA~~n~-~~-~waVl~YGaLal~----aPk~~~~ikNv~~~~~ 459 (464) T protein:vir:80 405 LTPSVVHLFELLPMMRL-------------------PLAQVNA-SV-TFAVLWYGALALR----APKKWARIKNVKYIAT 459 (464) T ss_pred CCchHHHHHHHHHhhhC-------------------Cchhccc-ch-hhhhhhhhHHhhh----ccccceEEEEEEEeec Confidence 99988888776543221 1111110 00 000000 001110 0111110000 0 Q ss_pred hhhhc Q lcl|NC_010363. 75 PDIFN 79 (235) Q Consensus 75 pdifn 79 (235) --||| T Consensus 460 ~~~~~ 464 (464) T protein:vir:80 460 GNVFN 464 (464) T ss_pred ccCCC Confidence 12233 No 8 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=1.28 E-value=51 Score=11.18 Aligned_cols=176 Identities=19% Similarity=0.258 Sum_probs=56.4 Q ss_pred CCCccchh--hhHHhhhhhhhhhccchh----ccchhhhhhhhhhhccCccccccccccccccc--chhhccchHHHHhh Q lcl|NC_010363. 1 MTPKTMKL--SEYASKLVYQREVNTGYK----SGFMGRVVHDPIKSFSTPKNSKFDSGLEGLDL--SFDDHISQKEMIEK 72 (235) Q Consensus 1 mtpktmkl--seyasklvyqrevntgyk----sgfmgrvvhdpiksfstpknskfdsglegldl--sfddhisqkemiek 72 (235) +-|..+.. .+....+.|+-..+.+.. .-....|+|-+. ++ .|+.+-|+.. +.-+.|......++ T Consensus 127 l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eiih~~~--~~------~~~~~~G~s~i~~~~~~i~~~~~~~~ 198 (392) T protein:vir:39 127 LRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQSDLIHMKL--LS------IDGGKTGISPLYSLRRESKIQRASDR 198 (392) T ss_pred EcCceeEEEEcCCCceEEEEEEecCcccceeEEEccccEEEecC--CC------CCCccccccHHHHHHHHHHHHHHHHH Confidence 44554432 223334455433322211 111355677432 21 2222333321 22333444444455 Q ss_pred hhhhhhcc-----eeeEeccchhhHHHHHHHHHHHHhhhhc--CCcchhhhhhhhhhhhHHHHHHHHHHhHhhhhcchhh Q lcl|NC_010363. 73 SMPDIFNN-----ITLTFNNTELITDEIKSMFIKRFWNKRL--GMPTIAQWQGELEYYFMEECKNALIQNYLLSTINIDD 145 (235) Q Consensus 73 smpdifnn-----itltfnntelitdeiksmfikrfwnkrl--gmptiaqwqgeleyyfmeecknaliqnyllstinidd 145 (235) ..-..|.| --+++......+++.+..+.+.|++..- |.+.+ T Consensus 199 ~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl-------------------------------- 246 (392) T protein:vir:39 199 LTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVL-------------------------------- 246 (392) T ss_pred HHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeec-------------------------------- Confidence 55555555 2345555555666666666665554211 11111 Q ss_pred hcccccccCCCcccccccCCCCCCCceeeecchhHHHHH----HHHHHHHHhh------cCCccccchhhHHHHHHHHhh Q lcl|NC_010363. 146 FLDGGSVSGSNNTDSMQGNKNTPDGSTLNITHDQKMLLD----RATSVVRAIG------QGSSDQNTKIAKYQAFLMAQR 215 (235) Q Consensus 146 fldggsvsgsnntdsmqgnkntpdgstlnithdqkmlld----ratsvvraig------qgssdqntkiakyqaflmaqr 215 (235) ++| -+-+| +.++-+-..+++ -...|.++.| .++++.+.-...+++|+.--- T Consensus 247 --~~g-------------~~~~~----l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l 307 (392) T protein:vir:39 247 --DDL-------------EEFTA----LEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASAL 307 (392) T ss_pred --CCC-------------ceEEE----ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHH Confidence 111 00000 001000000000 0122333322 122332322333333332110 Q ss_pred hhHHHHHHHhcchheeeecC Q lcl|NC_010363. 216 TSLIDDVILQNARKLFMLIG 235 (235) Q Consensus 216 tsliddvilqnarklfmlig 235 (235) ..++.-+.-.=.++|+--++ T Consensus 308 ~P~~~~ie~~l~~~L~~~~~ 327 (392) T protein:vir:39 308 NRYLRPAISELEYKLSDHIS 327 (392) T ss_pred HHHHHHHHHHHHHhcccccc Confidence 01111000000122222222 No 9 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=1.28 E-value=51 Score=11.18 Aligned_cols=176 Identities=19% Similarity=0.258 Sum_probs=56.4 Q ss_pred CCCccchh--hhHHhhhhhhhhhccchh----ccchhhhhhhhhhhccCccccccccccccccc--chhhccchHHHHhh Q lcl|NC_010363. 1 MTPKTMKL--SEYASKLVYQREVNTGYK----SGFMGRVVHDPIKSFSTPKNSKFDSGLEGLDL--SFDDHISQKEMIEK 72 (235) Q Consensus 1 mtpktmkl--seyasklvyqrevntgyk----sgfmgrvvhdpiksfstpknskfdsglegldl--sfddhisqkemiek 72 (235) +-|..+.. .+....+.|+-..+.+.. .-....|+|-+. ++ .|+.+-|+.. +.-+.|......++ T Consensus 127 l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eiih~~~--~~------~~~~~~G~s~i~~~~~~i~~~~~~~~ 198 (392) T protein:vir:10 127 LRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQSDLIHMKL--LS------IDGGKTGISPLYSLRRESKIQRASDR 198 (392) T ss_pred EcCceeEEEEcCCCceEEEEEEecCcccceeEEEccccEEEecC--CC------CCCccccccHHHHHHHHHHHHHHHHH Confidence 44554432 223334455433322211 111355677432 21 2222333321 22333444444455 Q ss_pred hhhhhhcc-----eeeEeccchhhHHHHHHHHHHHHhhhhc--CCcchhhhhhhhhhhhHHHHHHHHHHhHhhhhcchhh Q lcl|NC_010363. 73 SMPDIFNN-----ITLTFNNTELITDEIKSMFIKRFWNKRL--GMPTIAQWQGELEYYFMEECKNALIQNYLLSTINIDD 145 (235) Q Consensus 73 smpdifnn-----itltfnntelitdeiksmfikrfwnkrl--gmptiaqwqgeleyyfmeecknaliqnyllstinidd 145 (235) ..-..|.| --+++......+++.+..+.+.|++..- |.+.+ T Consensus 199 ~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl-------------------------------- 246 (392) T protein:vir:10 199 LTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVL-------------------------------- 246 (392) T ss_pred HHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeec-------------------------------- Confidence 55555555 2345555555666666666665554211 11111 Q ss_pred hcccccccCCCcccccccCCCCCCCceeeecchhHHHHH----HHHHHHHHhh------cCCccccchhhHHHHHHHHhh Q lcl|NC_010363. 146 FLDGGSVSGSNNTDSMQGNKNTPDGSTLNITHDQKMLLD----RATSVVRAIG------QGSSDQNTKIAKYQAFLMAQR 215 (235) Q Consensus 146 fldggsvsgsnntdsmqgnkntpdgstlnithdqkmlld----ratsvvraig------qgssdqntkiakyqaflmaqr 215 (235) ++| -+-+| +.++-+-..+++ -...|.++.| .++++.+.-...+++|+.--- T Consensus 247 --~~g-------------~~~~~----l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l 307 (392) T protein:vir:10 247 --DDL-------------EEFTA----LEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASAL 307 (392) T ss_pred --CCC-------------ceEEE----ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHH Confidence 111 00000 001000000000 0122333322 122332322333333332110 Q ss_pred hhHHHHHHHhcchheeeecC Q lcl|NC_010363. 216 TSLIDDVILQNARKLFMLIG 235 (235) Q Consensus 216 tsliddvilqnarklfmlig 235 (235) ..++.-+.-.=.++|+--++ T Consensus 308 ~P~~~~ie~~l~~~L~~~~~ 327 (392) T protein:vir:10 308 NRYLRPAISELEYKLSDHIS 327 (392) T ss_pred HHHHHHHHHHHHHhcccccc Confidence 01111000000122222222 No 10 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=1.20 E-value=37 Score=11.98 Aligned_cols=84 Identities=10% Similarity=0.077 Sum_probs=36.2 Q ss_pred CCCccchhhhHHhhhhhhhhhccch--hccchhhhhhhhhhhccCcccccccccccccccchhhc--cchHHHHhhhhhh Q lcl|NC_010363. 1 MTPKTMKLSEYASKLVYQREVNTGY--KSGFMGRVVHDPIKSFSTPKNSKFDSGLEGLDLSFDDH--ISQKEMIEKSMPD 76 (235) Q Consensus 1 mtpktmklseyasklvyqrevntgy--ksgfmgrvvhdpiksfstpknskfdsglegldlsfddh--isqkemieksmpd 76 (235) ++|+.+.+.|.-. +-.+||-.-| -..++|. .|+++.--+..|.....|-.. +..-++||.-+-. T Consensus 264 ~~~~d~qf~e~k~--~~~~eIa~a~~VPp~llGi----------~~~~t~~~sn~e~~~~~f~~~~l~P~~~~iee~n~~ 331 (351) T protein:vir:78 264 EVAAKDEFFNIKN--VTRDDLLAAHRVPPQLLGI----------VPSNSGGFGTPDTAARVFGRNEIRPLQARFAELNDW 331 (351) T ss_pred CChhHHHHHHHHH--HhHHHHHHHhCCCHHHhcc----------cCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 4444444444321 2222332222 1222222 244443334556666666422 1222333321111 Q ss_pred hhcceeeEeccchhhHHHHHH Q lcl|NC_010363. 77 IFNNITLTFNNTELITDEIKS 97 (235) Q Consensus 77 ifnnitltfnntelitdeiks 97 (235) +-.. -+.||..+|..-..|| T Consensus 332 l~~~-~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 332 LGDE-VVRFDDYEIPPAPVAA 351 (351) T ss_pred cCcc-ceecChhhhccccccC Confidence 1111 1578998888888888 Done!