Query lcl|NC_015281.1_cdsid_YP_004322770.1 [gene=gp14] [protein=neck protein] [protein_id=YP_004322770.1] [location=103514..104677] Match_columns 387 No_of_seqs 167 out of 409 Neff 6.3 Searched_HMMs 1612 Date Thu Nov 7 14:48:16 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_104 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_104_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:103005 Length: 390 100.0 4E-142 3E-145 795.7 34.6 387 1-387 1-389 (390) 2 protein:vir:104476 Length: 308 100.0 3E-131 2E-134 736.2 25.1 307 1-387 1-307 (308) 3 protein:vir:106986 Length: 292 100.0 5E-113 3E-116 636.4 22.7 290 16-387 1-292 (292) 4 protein:vir:104739 Length: 470 100.0 1E-107 9E-111 606.4 30.4 361 16-380 1-470 (470) 5 protein:vir:7199 Length: 256 # 100.0 6.8E-95 4.2E-98 536.8 15.9 246 1-256 1-256 (256) 6 protein:vir:5658 Length: 278 # 100.0 4.2E-95 2.6E-98 538.0 14.3 261 1-278 13-278 (278) 7 protein:vir:103452 Length: 256 100.0 9.8E-95 6.1E-98 536.0 15.7 242 1-256 1-256 (256) 8 protein:vir:80995 Length: 246 100.0 1.3E-92 8.3E-96 524.3 12.8 238 1-253 1-246 (246) 9 protein:vir:6590 Length: 246 # 100.0 2.1E-92 1.3E-95 523.2 12.6 238 1-253 1-246 (246) 10 protein:vir:98258 Length: 248 100.0 2.2E-92 1.4E-95 523.1 12.4 238 1-251 1-248 (248) 11 protein:vir:100535 Length: 253 100.0 1E-91 6.5E-95 519.4 14.2 244 1-274 1-253 (253) 12 protein:vir:6890 Length: 254 # 100.0 3.2E-91 2E-94 516.7 13.3 244 1-261 1-254 (254) 13 protein:vir:101154 Length: 252 100.0 5.2E-91 3.2E-94 515.5 14.5 244 1-257 1-252 (252) 14 protein:vir:101800 Length: 252 100.0 5.2E-91 3.2E-94 515.5 14.5 244 1-257 1-252 (252) 15 protein:vir:106285 Length: 262 100.0 6.3E-91 3.9E-94 515.1 13.8 243 1-262 1-262 (262) 16 protein:vir:107937 Length: 257 100.0 4.4E-90 2.8E-93 510.4 12.1 247 1-257 1-257 (257) 17 protein:vir:103005 Length: 390 99.6 4.7E-16 2.9E-19 104.6 19.6 310 1-353 61-390 (390) 18 protein:vir:104739 Length: 470 99.0 3.4E-11 2.1E-14 78.0 15.3 331 44-387 1-400 (470) 19 protein:vir:97237 Length: 122 45.5 0.77 0.00048 21.2 9.2 121 22-171 1-122 (122) 20 protein:vir:1385 Length: 107 # 30.9 1.5 0.00095 19.6 9.2 106 44-173 1-107 (107) 21 protein:vir:80941 Length: 135 27.5 1.7 0.0011 19.3 5.1 86 1-86 37-135 (135) No 1 >protein:vir:103005 Length: 390 # NCBI annotation: gp110 # Family: family:all:1104 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717777;genbank:gi:113200614;genbank:GeneID:4239009 Probab=100.00 E-value=4.1e-142 Score=795.70 Aligned_cols=387 Identities=73% Similarity=1.174 Sum_probs=360.3 Q ss_pred CcccCCChhhhhhccccceEEEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccccceeeEeec Q lcl|NC_015281. 1 MAYSNTPANNCIQSDYDSACRININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSFDSAYLCRAY 80 (387) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~~~~~~~y 80 (387) |+|||+|++||+|+||.+..|||||||.+||+|+|+||+|||||||+||||||||+|++|+||+||++|||+|||+|||| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~~~~~~~y 80 (390) T protein:vir:10 1 MTYSNDPPNNCIQSDYTSSCRLNLNGSAQEQTFMENLIVESIELYGQNVYYLPRIYVNRDTILNEVETSRFEQALSVRAY 80 (390) T ss_pred CeecCCCcccceecceeeccEEEEeccCchhHHHHHHHHHHhHhcCceEEEechheeccccccccccccccccceEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeeeeCChhheec Q lcl|NC_015281. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVEAERPFYQLG 160 (387) Q Consensus 81 ~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~~pf~q~g 160 (387) |+|||||+|++|||||||||++|||||+|||+||+|++++...+.+++||||||||||||+|+||||+||||++||||+| T Consensus 81 ~~~~~~~~~~~~~~skfg~~~~de~~~~~~~~~~~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~~p~yq~G 160 (390) T protein:vir:10 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRKKFTTAVDDNAVLNVEGRPNEGDLIWFPATRHLFEIKFVEAERPFYQLG 160 (390) T ss_pred eechhccCCccceeeecCceecceEEEEECCcchhhhhCCcccccccCCCCCCceEEecCCCCEEEEEecCCCCCceEcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCCeEEEEEEee Q lcl|NC_015281. 161 KGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGDAVTSLAASN 240 (387) Q Consensus 161 ~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G~VtsItVtn 240 (387) |+|+|+|+|++|+|+++++++..+.++.........+.+.....+.++...+..+...+.++++.+++.+++|++|+|+| T Consensus 161 ~nyt~~i~a~lf~ySge~iat~~seid~I~~~~~~~v~~~~~t~g~~~~t~~~~v~~~g~ga~~~a~v~~g~Vt~vtItn 240 (390) T protein:vir:10 161 KGYVWECQCELFEYSDEDLDTGVAEIDAIETAFANAIKLVMDAGGTGAFTVGEEIVGDLYLATATATISGDAVDAVTVTD 240 (390) T ss_pred CceeeeeEEeeeccCCccccccccccccccccccceeeeeeccCCcccccccceeeecCcceeEEEEecCCeEEEEEEee Confidence 99999999999999999999999999888888888888777777777777777777788889999999999999999999 Q ss_pred cCCCcccCceeEEEeCCCCCcceEEEEeeccccceeEEEEecCCCCcccCcEEEEeCCCCCceEEEEEecccceeEEEEc Q lcl|NC_015281. 241 GGQYYKAALPPTVTLTGGGGTGATATATVSDAGLVTGFTVTAGGSGYTSAPTVVIQESPKDIHAEVKSWNNATRELQIIN 320 (387) Q Consensus 241 gGsGYts~~~ptVTisgg~GtgAtatatV~~~G~Vt~ItItn~GsGYts~ptVtI~g~g~ga~atv~~~~~~~~~~~i~n 320 (387) +|+||+.+++|+|++++++|++|+++++++.+|.|++|+|+++|+||+.+|+|+|.+++.++++.+..+++....+.+.+ T Consensus 241 ~GsGYt~~~~ptVtisgg~gtgAt~tatv~~~G~VtsItItn~GsGYt~~PtVtI~g~g~~~~a~~~~~~g~v~~i~Itn 320 (390) T protein:vir:10 241 GGEHYKSALPPTVTITGGGGSGATATATVSSAGIVTGITITSGGTGYTSAPTVTIDYSPKDNRAEVKSWNASTRELQVIN 320 (390) T ss_pred CCCCcccCceeEEEecCCCCccceeeeeecccceEEEEEEecCCccccCCCEEEEeCCCCCceeEEEEeccEEEEEEEec Confidence 99999999999999999999999999999989999999999999999999999999999999999999999999999999 Q ss_pred ccceEEecCceecccccceeEeeccceE--eeccccccccceEEecCCceEEecccCccccccCCCCCC Q lcl|NC_015281. 321 RTGTFNVAEYLKGETSGALWSPESYNTL--NNTNSTYDQNSLFETLDDDIIDWTEGNPFGYTGNDSDTF 387 (387) Q Consensus 321 ~~~~~tv~~~~tG~tsgat~tv~t~~s~--~~t~~~~s~~~~i~t~~D~iidftegNp~g~~g~~~~~~ 387 (387) .+++|+.++.+++..+++..++...... .......+.+..+++.++.+|+|+++||||++|+.+.|= T Consensus 321 ~GsgYtt~p~vt~~~~G~~~~~~~~~t~~~~~~~~~~~~~~~~~t~~~~ii~~t~gn~~g~v~n~T~t~ 389 (390) T protein:vir:10 321 RTGTFNTAEVITGLTSGAKWSPESYNTLNNTNTADTIDQNYSFETADDDIIDFTEVNPFGNIGSTTDTT 389 (390) T ss_pred CCcceeeccEEEEecCCcceEEEEEEecccceeeeeecccceeEeCCCceEeecccCcccccccceecc Confidence 9999999888887766655544433222 234445677788999999999999999999999999988 No 2 >protein:vir:104476 Length: 308 # NCBI annotation: gp14 # Family: family:all:1104 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214650;genbank:gi:61806291;genbank:GeneID:3294531 Probab=100.00 E-value=2.9e-131 Score=736.22 Aligned_cols=307 Identities=62% Similarity=1.005 Sum_probs=277.5 Q ss_pred CcccCCChhhhhhccccceEEEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccccceeeEeec Q lcl|NC_015281. 1 MAYSNTPANNCIQSDYDSACRININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSFDSAYLCRAY 80 (387) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~~~~~~~y 80 (387) |+|+|+|+++|+||||.+++|||+|||.+||+|+|+||+|||||||+||||||||+|++|+||+||++|||+|||+|||| T Consensus 1 ~~~~~~~~~py~~~~~~~~~~~n~~~~~~eQ~L~e~LV~EsIqm~G~dvyYlpRe~v~~D~i~~Ed~~skF~~a~~ieaY 80 (308) T protein:vir:10 1 MAIQNSPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNRDSVFEEDSDGKFESAKAIRAY 80 (308) T ss_pred CccccCCCCCcccccccccceEEEeccCchhHHHHHHHHHHHHhcCceEEEechhhcccccccccccccccccceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeeeeCChhheec Q lcl|NC_015281. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVEAERPFYQLG 160 (387) Q Consensus 81 ~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~~pf~q~g 160 (387) |+|||||+|+++||||||||++|||||+|||+||+||++++.++.+++||||||||||||+|+||||||||+++||||+| T Consensus 81 ~~~~egy~g~~~~~SKFG~~~~DE~t~~is~~rF~~~v~~~~~~~~~~rP~EGDLIYfPl~~~lFEI~~VE~~~PFyQ~G 160 (308) T protein:vir:10 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) T ss_pred eechhccCCCcceeeecCceecceEEEEEccchhhhhcCCccccccCCCCccccEEEecCCCceEEEEcccCCCchhhhC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCCeEEEEEEee Q lcl|NC_015281. 161 KGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGDAVTSLAASN 240 (387) Q Consensus 161 ~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G~VtsItVtn 240 (387) |+|+|+++|+||+|++|+++++.+.++++.....+...+....++.|.+............+++.+..+++.+..++|+| T Consensus 161 k~~~~~l~ce~F~Ys~E~~~~~i~~iD~i~~~~~~~LdL~pIs~l~G~fdInE~v~gest~itAEv~~wds~v~~ItV~N 240 (308) T protein:vir:10 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTVGETITGGTSNVTAEVKSFDASTRTLIVIN 240 (308) T ss_pred CceEEEEEEEEEeeCCcccccCCccccccccccccceeeeeeccCCccccccceecccccceEEEEEEecCCceEEEEEe Confidence 99999999999999999999999999999998888888888888888887777777666666666666666666666666 Q ss_pred cCCCcccCceeEEEeCCCCCcceEEEEeeccccceeEEEEecCCCCcccCcEEEEeCCCCCceEEEEEecccceeEEEEc Q lcl|NC_015281. 241 GGQYYKAALPPTVTLTGGGGTGATATATVSDAGLVTGFTVTAGGSGYTSAPTVVIQESPKDIHAEVKSWNNATRELQIIN 320 (387) Q Consensus 241 gGsGYts~~~ptVTisgg~GtgAtatatV~~~G~Vt~ItItn~GsGYts~ptVtI~g~g~ga~atv~~~~~~~~~~~i~n 320 (387) +|++|++ +|+|+ T Consensus 241 ~gGsfts--pptIt------------------------------------------------------------------ 252 (308) T protein:vir:10 241 RSGTFTV--PETVT------------------------------------------------------------------ 252 (308) T ss_pred CCCceee--CcEEE------------------------------------------------------------------ Confidence 6665553 23433 Q ss_pred ccceEEecCceecccccceeEeeccceEeeccccccccceEEecCCceEEecccCccccccCCCCCC Q lcl|NC_015281. 321 RTGTFNVAEYLKGETSGALWSPESYNTLNNTNSTYDQNSLFETLDDDIIDWTEGNPFGYTGNDSDTF 387 (387) Q Consensus 321 ~~~~~tv~~~~tG~tsgat~tv~t~~s~~~t~~~~s~~~~i~t~~D~iidftegNp~g~~g~~~~~~ 387 (387) |.++++..++.+++.+.+++..++.+..+|+.+|+||||||+||||++|+.+||- T Consensus 253 ------------Gsts~a~~t~~s~~~~~nt~~~~~~n~~fet~~D~iiDftE~NPFG~~g~~t~~~ 307 (308) T protein:vir:10 253 ------------GGTSSASWTTATYNTIDNQNLDYDQNNDFETLDNQIIDFTEANPFGSVGSITDNT 307 (308) T ss_pred ------------eccCCceeEEEeeeecccCCCcccCCcceeeccCcEEeeccCCCCcccccccccc Confidence 2333344444455556667788999999999999999999999999999999999 No 3 >protein:vir:106986 Length: 292 # NCBI annotation: neck protein gp14 # Family: family:all:1104 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195128;genbank:gi:58532905;uniprot:Q5GQV9;genbank:GeneID:3260483 Probab=100.00 E-value=4.7e-113 Score=636.38 Aligned_cols=290 Identities=41% Similarity=0.776 Sum_probs=256.0 Q ss_pred ccceEEEE--eccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccccceeeEeeccchhhccCCcchh Q lcl|NC_015281. 16 YDSACRIN--INGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSFDSAYLCRAYVNNVEGWEGQGEL 93 (387) Q Consensus 16 ~~~~~~~~--~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~~~~~~~y~~~~~~~~~~~~~ 93 (387) --..-||| +|||.+||+|+|+||+|||||||+||||||||+|+ |++|+||++|||+|||+|||||+|||||+|+++| T Consensus 1 m~~npyfn~~~~~~~~eQ~L~~~LV~Esiq~~G~dvyYlpRe~~~-d~~~~E~~~skF~~a~~ieaY~~~~eg~~g~~~~ 79 (292) T protein:vir:10 1 MPTSPYFPSYYSGYSGEQNLVQDLVDEQIKLFGTDIYYLPRTILR-DNTLDDVIYNKFERQFQVEMLLQNVEGFGSPSEF 79 (292) T ss_pred CCcCccccccccCcCchhHHHHHHHHHHHHhcCceEEEechhhhc-ccccccccccccccceeEEEEeechhccCCCcce Confidence 33455888 69999999999999999999999999999999999 9999999999999999999999999999999999 Q ss_pred hhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeeeeCChhheeccceEEEEEEEEEe Q lcl|NC_015281. 94 LSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVEAERPFYQLGKGYVWEMQCELFE 173 (387) Q Consensus 94 ~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~~pf~q~g~~yv~~~~~~~F~ 173 (387) |||||||++|||||+|||+||+|++++. ++.+++||||||||||||+|+||||||||+++||||+||+|+|+++|+||+ T Consensus 80 ~sKFG~~~~De~t~~is~~~f~~~~~~~-~~~~~~~P~eGDLIYfPl~~~lFEI~~ve~~~PfyQ~gk~~~~~l~~~~F~ 158 (292) T protein:vir:10 80 ISKFGLRITDEVRFIVSQRRWDEEAVNY-DLNVNGRPNEGDLLYFPLTQDIYEIKFVEREDPFYQLGKNYFYIMTAEIYE 158 (292) T ss_pred eeecCceecceEEEEEccchhhhhcCcc-cccccCCCccccEEEEcCCCcEEEEEcccCCCchhhhCCceEEEEEEEEEe Confidence 9999999999999999999999999974 889999999999999999999999999999999999999999999999999 Q ss_pred cCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCCeEEEEEEeecCCCcccCceeEE Q lcl|NC_015281. 174 YSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGDAVTSLAASNGGQYYKAALPPTV 253 (387) Q Consensus 174 ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G~VtsItVtngGsGYts~~~ptV 253 (387) |++|+++++.+.++++....... T Consensus 159 Ys~E~idtgl~eiD~i~~~~sse--------------------------------------------------------- 181 (292) T protein:vir:10 159 YGSDNISTGVEEIDELETLFSSA--------------------------------------------------------- 181 (292) T ss_pred ecCceecCCCCcccccccccccc--------------------------------------------------------- Confidence 99999999999876322211100 Q ss_pred EeCCCCCcceEEEEeeccccceeEEEEecCCCCcccCcEEEEeCCCCCceEEEEEecccceeEEEEcccceEEecCceec Q lcl|NC_015281. 254 TLTGGGGTGATATATVSDAGLVTGFTVTAGGSGYTSAPTVVIQESPKDIHAEVKSWNNATRELQIINRTGTFNVAEYLKG 333 (387) Q Consensus 254 Tisgg~GtgAtatatV~~~G~Vt~ItItn~GsGYts~ptVtI~g~g~ga~atv~~~~~~~~~~~i~n~~~~~tv~~~~tG 333 (387) ++-+.+.+.+..|...++|+ +..+++.+++..|..++..+++.+..+.+++++.+.| T Consensus 182 ---------------------LdL~pi~~g~G~f~inE~vt--ge~sg~~AEv~sw~~~t~~L~V~n~~GsF~T~e~i~G 238 (292) T protein:vir:10 182 ---------------------IAIALSIGGTGDFDLGEIVT--GGISGTEAEVKSWDSSSRILQVINRTGTFEEGESVTG 238 (292) T ss_pred ---------------------cceeecccCCccccCCceee--ecccceEEEEEEccCCCceEEEEeCccccccCceeeE Confidence 11111222233366666555 4445567788899999999999999999999999999 Q ss_pred ccccceeEeeccceEeeccccccccceEEecCCceEEecccCccccccCCCCCC Q lcl|NC_015281. 334 ETSGALWSPESYNTLNNTNSTYDQNSLFETLDDDIIDWTEGNPFGYTGNDSDTF 387 (387) Q Consensus 334 ~tsgat~tv~t~~s~~~t~~~~s~~~~i~t~~D~iidftegNp~g~~g~~~~~~ 387 (387) .+|++...+.++.....++.+++.+.+||+.+|+||||||+||||++|+++-.. T Consensus 239 ~~Sga~~~v~si~~~~g~~~t~a~~~~iEt~~d~i~df~e~npfg~~~~~~~~~ 292 (292) T protein:vir:10 239 NDSGSVWVVDSFDTLNNTNSEYDQNREIESTADTIIDWSESNPFGEYGNFTGSI 292 (292) T ss_pred eecCCeEEeeEEEEeCCCCCcccccceeccccCcEEeeccCCcCcccccccccC Confidence 999999999998888889999999999999999999999999999999999888 No 4 >protein:vir:104739 Length: 470 # NCBI annotation: T4-like neck protein # Family: family:all:1104 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214353;genbank:gi:61805993;genbank:GeneID:3294259 Probab=100.00 E-value=1.4e-107 Score=606.36 Aligned_cols=361 Identities=35% Similarity=0.612 Sum_probs=269.6 Q ss_pred ccceEEEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccccceeeEeeccchhhccCCcchhhh Q lcl|NC_015281. 16 YDSACRININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSFDSAYLCRAYVNNVEGWEGQGELLS 95 (387) Q Consensus 16 ~~~~~~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~~~~~~~y~~~~~~~~~~~~~~s 95 (387) --..-||| |+|.+||+|+|+||+|||||||+||||||||+|++|+||+||++|||+|||+|||||+|||||+|+++||| T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~e~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~ 79 (470) T protein:vir:10 1 MALNPFFL-QGTSSEQRLTQDLINEHLKIYGVEVTYIPRKYVNTKSIIEEVQSSKFDDNFAIEAYVNTYEGYGGQGDVLT 79 (470) T ss_pred CcccceeE-cCCCchhHHHHHHHHHHhHhccceEEEechhhcccccccccccccccccceeEEEEeecccCcCCcceeee Confidence 33445886 99999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCceeeeeEEEEEccchhhhhcCCc-------c----cccCCCCCccccEEEEcCCCcEEEEEeeeeCChhheeccceE Q lcl|NC_015281. 96 KFGIRIEDKTTFVISRKKFTEKVDDN-------V----TLAVEGRPNEGDLIWFPVTKHLFEIKFVEAERPFYQLGKGYV 164 (387) Q Consensus 96 kfg~~~~de~~~~~s~~~f~~~~~~~-------~----~~~~~~~p~egdliy~p~~~~lfei~~ve~~~pf~q~g~~yv 164 (387) |||||++|||+|+|||+||++.|+.. . .+.+..||+|||||||||+|+||||+|||+++||||+||+|+ T Consensus 80 ~fg~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~p~~~~G~~~~ 159 (470) T protein:vir:10 80 KFGMSIRDEVTLTISKERFEDFIAPFMAGLDDGPGGNEEVILATRPREGDLVFFPLGSRLFEVKFVEHEDPFYQLGKNYV 159 (470) T ss_pred ecCcccceEEEEEECCccccccccchhhcccCcccccccccccCCcccccEEEecCCCCEEEEEecCCCCcchhcCccee Confidence 99999999999999999999877722 1 233346999999999999999999999999999999999999 Q ss_pred EEEEEEEEecCccccccccccccceeeee-----------------------eeeEEEEeccCCccccccceeeeccCcC Q lcl|NC_015281. 165 WEMQCELFEYSDESIDTGVADIDAVETTF-----------------------ANSIKLVMDPGGSGDFSVGETITGNLYT 221 (387) Q Consensus 165 ~~~~~~~F~ysgE~~dtg~~ai~~~~~~~-----------------------~s~~~~ti~~~Gsgy~~~~~~~~~~g~g 221 (387) |+++|++|+|++++++.....+....... .....+++.+.|++|...|.+....+.. T Consensus 160 ~~it~~~f~ysge~~s~~v~~~~~~~~~~g~~~t~t~~~~g~~~~~t~~~~~g~vt~ititn~Gsgyt~~ptVti~~~~~ 239 (470) T protein:vir:10 160 YQLKCELFEYEDEVIDTSIDAIDTVVQDDGYISKLQLVGIGRTAEVAASIGVGYVREIFLNNDGSGFTSPPTITFSASPA 239 (470) T ss_pred EEeeeceeEecCCccccceecccccccccccceeeeecCCCccceeeeeecceeeeEeEeeccccceeccCEEEEccCCC Confidence 99999999999999987766544333111 1234577888899998888766543221 Q ss_pred -------eEEEEEEeCCeEEEEEEeecCCCcccCceeEEEeCCCCCcceEEEEeec-cccceeEEEEecCCCCcccCcEE Q lcl|NC_015281. 222 -------AVAAATITGDAVTSLAASNGGQYYKAALPPTVTLTGGGGTGATATATVS-DAGLVTGFTVTAGGSGYTSAPTV 293 (387) Q Consensus 222 -------atata~v~~G~VtsItVtngGsGYts~~~ptVTisgg~GtgAtatatV~-~~G~Vt~ItItn~GsGYts~ptV 293 (387) .........+.++.++++++|+||+.. |+|++.++++.++.+++.+. ..+.++.++++++|+||+.+|+| T Consensus 240 ~~~~~a~~~~~t~~~~g~vt~ititn~Gsgytt~--ptvt~~~~~g~ga~at~~~~~~~~g~~~itit~~GsgYtt~ptv 317 (470) T protein:vir:10 240 FTDARAVGILTTRANVTSIEKILMTSAGAGYITP--PTITISGGGGTGAAATCSIETVYQGVVNFNVVDGGVGYGTEPSI 317 (470) T ss_pred CCCccceeeEeecceeeEEEEEEEecCccccccc--ceEEEccCCCccceeeeeecccccceeeEEEccCCccccccceE Confidence 112233455789999999999999876 77888887777777665543 35567899999999999999999 Q ss_pred EEeCCCCCceEE--E----EEecccceeEEEEcccceEEecCc--------------------eeccccc---------- Q lcl|NC_015281. 294 VIQESPKDIHAE--V----KSWNNATRELQIINRTGTFNVAEY--------------------LKGETSG---------- 337 (387) Q Consensus 294 tI~g~g~ga~at--v----~~~~~~~~~~~i~n~~~~~tv~~~--------------------~tG~tsg---------- 337 (387) +|..++.+..+. . ....+......+.+.+.++...+. .++.+++ T Consensus 318 tit~~~sg~~a~~~a~~~~~~~~g~itsititn~Gsgyts~ptv~i~~~~~~~~~~t~~~~~~~tg~tsgt~~~~~~~~~ 397 (470) T protein:vir:10 318 AVTQPGAGTTAVGIASIGMAGSDQVLKSVYIGNPGRGYTATPNVIVADPPSMSGIGTFTFNEVIKGSRSGTEARVKSWDD 397 (470) T ss_pred EEecCCCCCcccceeEEEeecccceeeeEEeccCCcceeccceeEeecCccccccceeeeeeeeeccccceeeeeeeecc Confidence 998764332111 1 111112222223332222221110 0000000 Q ss_pred -------------------------------ceeEeeccceEeeccccccccceEEecCCceEEecccCccccc Q lcl|NC_015281. 338 -------------------------------ALWSPESYNTLNNTNSTYDQNSLFETLDDDIIDWTEGNPFGYT 380 (387) Q Consensus 338 -------------------------------at~tv~t~~s~~~t~~~~s~~~~i~t~~D~iidftegNp~g~~ 380 (387) +...+.+ .....++..+..+.++++.+|+||||+++||||.| T Consensus 398 ~t~~~~v~~~~~~~~~~~~~~g~tvt~~~~~a~~~~~s-~t~~~~~~~~ts~~~i~t~~~~i~~~~~~np~~~~ 470 (470) T protein:vir:10 398 DTKILLVSNVGIGSTVSGFYTGESIVGQESGASYALGS-YNSDDANDKYNDGDEFEFNADQILDFTESNPFGNF 470 (470) T ss_pred cceeeeecccceecccceeeeeeeEEeccccceeeEEE-ecccccCceeeccceeeccCCcEEeeeecCCCCCC Confidence 0000000 01112444667778899999999999999999999 No 5 >protein:vir:7199 Length: 256 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049773;genbank:gi:9632588;genbank:GeneID:1258695 Probab=100.00 E-value=6.8e-95 Score=536.83 Aligned_cols=246 Identities=27% Similarity=0.464 Sum_probs=192.2 Q ss_pred CcccCCC--hhhhhhccccceE-------EEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccc Q lcl|NC_015281. 1 MAYSNTP--ANNCIQSDYDSAC-------RININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSF 71 (387) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~~-------~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f 71 (387) |.=+++- |+---|+.|..+. |||+|||.+||+|+|+||+|||||||+||||||||+|++|+||+|+++||| T Consensus 1 m~~~d~~lfa~l~~~~gy~~~~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~~Ed~~skF 80 (256) T protein:vir:71 1 MATYDKNLFAKLENRTGYSQTNETEILNPYVNFNHYKNSQILADVLVAESIQMRGVECYYVPREYVSPDLIFGEDLKNKF 80 (256) T ss_pred CcccccceeeeecCCcchhhhhhhcccceeeeeeccCchhhHHHHHHHHHHHHcCceEEEechhhccccccccccccccc Confidence 6544432 2333356666554 999999999999999999999999999999999999999999999999999 Q ss_pred cceeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeee Q lcl|NC_015281. 72 DSAYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVE 151 (387) Q Consensus 72 ~~~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve 151 (387) +|||+|||||+|||||+|+++||||||||++|||||+|||+||+|+++++ ||||||||||||+|+||||+||| T Consensus 81 ~ka~~ieaYl~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE 153 (256) T protein:vir:71 81 TKAWKFAAYLNSFEGYEGAKSFFSNFGMQVQDEVTLSINPNLFKHQVNGK-------EPKEGDLIYFPMDNSLFEINWVE 153 (256) T ss_pred ccceeEEEEeehhhccCCccccceecCceecceEEEEEccchhhhhhcCC-------CCccccEEEEcCCCcEEEEEccc Confidence 99999999999999999999999999999999999999999999999987 99999999999999999999999 Q ss_pred eCChhheeccceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCC Q lcl|NC_015281. 152 AERPFYQLGKGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGD 231 (387) Q Consensus 152 ~~~pf~q~g~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G 231 (387) +++||||+||+|+|+++|++|+||+|+++++.+.++++.....+...+....+..|-........ ............ T Consensus 154 ~~~PFYQ~Gkn~~~~l~ce~F~Ys~E~~~~~l~~~d~i~~~e~~eldl~~i~~ldg~~di~~~~~---~e~~~i~~e~~~ 230 (256) T protein:vir:71 154 PYDPFYQLGQNAIRKITAGKFIYSGEEINPVLQKNEGINIPEFSELELNAVRNLNGIHDINIDQY---AEVDQINSEAKE 230 (256) T ss_pred CCCchhhhCCCeEEEEEEEEEecCCceecccCCCCCcccCCcccccccccccccCcccccccccc---cchhhhhhcccc Confidence 99999999999999999999999999999999999988765544433322223332222211110 000000000111 Q ss_pred eEEE-EEEeecCCCcccCceeEEEeC Q lcl|NC_015281. 232 AVTS-LAASNGGQYYKAALPPTVTLT 256 (387) Q Consensus 232 ~Vts-ItVtngGsGYts~~~ptVTis 256 (387) -|.. +.|++.|+|+++.+-..--.. T Consensus 231 ~ve~~~~in~~G~~~~~~pfd~~~~~ 256 (256) T protein:vir:71 231 YVEPYVVVNNRGKSFESSPFDNDFMD 256 (256) T ss_pred ccccceeecCCCCCCcCCCccccccC Confidence 1233 667777888765433221111 No 6 >protein:vir:5658 Length: 278 # NCBI annotation: gp14 # Family: family:all:1104 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899597;genbank:gi:34419584;genbank:GeneID:2545699 Probab=100.00 E-value=4.2e-95 Score=538.00 Aligned_cols=261 Identities=26% Similarity=0.379 Sum_probs=187.2 Q ss_pred CcccCCChhhhhhccccceEEEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccccceeeEeec Q lcl|NC_015281. 1 MAYSNTPANNCIQSDYDSACRININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSFDSAYLCRAY 80 (387) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~~~~~~~y 80 (387) ....|.---||+|+||....|||+|||.+||+|+|+||+|||||||+||||||||+|++|+||+|+++|||+|||+|||| T Consensus 13 a~l~~~~gy~~~~~~~~~NpYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 92 (278) T protein:vir:56 13 AKLESQKGYDQIYNNHLVNPYFNWVNHTNEQNLTDMLVAESIINRGVECVYLRREMEKVDLVFGEDPMSKFTQNFRMSLY 92 (278) T ss_pred EEecCCcccchhcccccccceeeccCCCchhHHHHHHHHHHHHHcCceEEEcchhhhccccccccccccccccceeEEEE Confidence 22345566799999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeeeeCChhheec Q lcl|NC_015281. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVEAERPFYQLG 160 (387) Q Consensus 81 ~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~~pf~q~g 160 (387) |+|||||+|+++||||||||++|||||+|||+||+|++++. ||||||||||||+|+||||+|||+++||||+| T Consensus 93 l~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~~~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 165 (278) T protein:vir:56 93 VESFEGWDGDGDWYSKFGFQVNDEMNVCINPKLFAQQGDGK-------QPLMGDLIYFPLANSLFEISWIEREDPWYMNG 165 (278) T ss_pred eehhhccCCCceeeeecCceecceEEEEEccchhhhcCCCC-------CCccccEEEEcCCCcEEEEEccCCCCchhhhC Confidence 99999999999999999999999999999999999999986 99999999999999999999999999999999 Q ss_pred cceEEEEEEEEEecCccccccc-----cccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCCeEEE Q lcl|NC_015281. 161 KGYVWEMQCELFEYSDESIDTG-----VADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGDAVTS 235 (387) Q Consensus 161 ~~yv~~~~~~~F~ysgE~~dtg-----~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G~Vts 235 (387) |+|+|+++|++|+|++|+++++ .++++.+ ...+..+...+.-.+.+.. .+......-..+...++ ...+.. T Consensus 166 k~~~~~l~ce~F~Ys~E~~dtg~pe~~~d~i~~i-~ef~e~~~~~ld~~~~~~l-~G~~di~~~~~~e~~~~--~~e~~~ 241 (278) T protein:vir:56 166 VLPMRKMKMTKFVYSGEEINLEKPEAVIDSIDDI-LNFGEDTDEMIDIDKINAL-DGRWDIGIEQGAEITQI--EDEVDK 241 (278) T ss_pred CceEEEEEEEEeeecCceecccCCccccccccch-hhcccchhhcccccccccc-cchhcccchhhhhhhhh--hhccce Confidence 9999999999999999999887 3333322 1122222333333333322 11111111111111111 112222 Q ss_pred EEEeecCCCcccCceeEEEeCCCCCcceEEEEeeccccceeEE Q lcl|NC_015281. 236 LAASNGGQYYKAALPPTVTLTGGGGTGATATATVSDAGLVTGF 278 (387) Q Consensus 236 ItVtngGsGYts~~~ptVTisgg~GtgAtatatV~~~G~Vt~I 278 (387) ++....+-++.+..+|+-.-..+-| .. |...-.+.+. T Consensus 242 f~~~~~v~~~~~~~~~t~~~n~~~g--~~----v~~~~~~D~f 278 (278) T protein:vir:56 242 FYESEQVVPSGSDVQPTDPRNATIG--FN----VNNSNPFDSF 278 (278) T ss_pred eeecCceecCCCCccccCcccccCC--Cc----CccccccccC Confidence 2222222222222222211111111 00 1001111112 No 7 >protein:vir:103452 Length: 256 # NCBI annotation: head completion # Family: family:all:1104 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803104;genbank:gi:116326384;genbank:GeneID:4405481 Probab=100.00 E-value=9.8e-95 Score=535.98 Aligned_cols=242 Identities=28% Similarity=0.484 Sum_probs=191.6 Q ss_pred CcccCCC--hhhhhhccccceE-------EEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccc Q lcl|NC_015281. 1 MAYSNTP--ANNCIQSDYDSAC-------RININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSF 71 (387) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~~-------~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f 71 (387) |.=+++- |+---|+.|..+. |||+|||.+||+|+|+||+|||||||+||||||||+|++|+||+|+++||| T Consensus 1 m~~~d~~lfa~l~~~~gy~~~~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF 80 (256) T protein:vir:10 1 MATYDKNLFAKLENHTGYSQTNETEILNPYVNFNHYKNSQILADVLVAESIQMRGVECYYVPREYVSPDLIFGEDLKNKF 80 (256) T ss_pred CcccccceeeeecCCcchhhhhhhcccceeeeeeccCchhhHHHHHHHHHHHHcCceEEEechhhccccccccccccccc Confidence 6544432 2333356666554 999999999999999999999999999999999999999999999999999 Q ss_pred cceeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeee Q lcl|NC_015281. 72 DSAYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVE 151 (387) Q Consensus 72 ~~~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve 151 (387) +|||+|||||+|||||+|+++||||||||++|||||+|||+||+|+++++ ||||||||||||+|+||||+||| T Consensus 81 ~ka~~ieaYl~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE 153 (256) T protein:vir:10 81 TKAWKFAAYLNSFEGYEGAKSFFSNFGMQVQDEVTLSINPNLFKHQVNGK-------EPKEGDLIYFPMDNSLFEINWVE 153 (256) T ss_pred ccceeEEEEeehhhccCCccccceecCceecceEEEEEcCchhhhhccCC-------CCccccEEEEcCCCcEEEEEecc Confidence 99999999999999999999999999999999999999999999999987 99999999999999999999999 Q ss_pred eCChhheeccceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEE--- Q lcl|NC_015281. 152 AERPFYQLGKGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATI--- 228 (387) Q Consensus 152 ~~~pf~q~g~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v--- 228 (387) +++||||+||+|+|+++|++|+||+|+++++.+.++++.....+...+--.....|...... .+.+.... T Consensus 154 ~~~PFYQ~Gkn~~~~l~ce~F~Ys~E~~~~~l~~~d~i~~~~~~eldl~pi~~ldG~~di~~-------~~~~e~~~~~~ 226 (256) T protein:vir:10 154 PYDPFYQLGQNAIRKITAGKFIYSGEEINPVLQKNEGINIPEFSELELNPVRNLNGIHDINI-------DQYAEVDQINS 226 (256) T ss_pred CCCchhhhCCCeEEEEEEEEEeeCCceecccCCCCccccCCccchhhhccccccccccccCc-------cccccchhhhh Confidence 99999999999999999999999999999999999988765544322211111222221111 11111111 Q ss_pred -eCCeEEE-EEEeecCCCcccCceeEEEeC Q lcl|NC_015281. 229 -TGDAVTS-LAASNGGQYYKAALPPTVTLT 256 (387) Q Consensus 229 -~~G~Vts-ItVtngGsGYts~~~ptVTis 256 (387) ...-|.. +.|++.|+|+++.+-..--.. T Consensus 227 e~~~~v~~~~~in~~G~~~~~~pfd~~~~~ 256 (256) T protein:vir:10 227 EAKEYVEPYVVVNNRGKSFESSPFDNDFMD 256 (256) T ss_pred ccccccccceEecCCCCCCcCCCccccccC Confidence 1112333 667777888876533221111 No 8 >protein:vir:80995 Length: 246 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469495;genbank:gi:157311452;genbank:GeneID:5602161 Probab=100.00 E-value=1.3e-92 Score=524.27 Aligned_cols=238 Identities=28% Similarity=0.466 Sum_probs=186.8 Q ss_pred CcccCCChhhhhhccccceE-------EEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccccc Q lcl|NC_015281. 1 MAYSNTPANNCIQSDYDSAC-------RININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSFDS 73 (387) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~ 73 (387) |.=+.-=|+---|++|++++ |||+|+|.+||+|+|+||+|||||||+||||||||+|++|+||+||++|||+| T Consensus 1 ~~~~~lfa~l~~~~gy~~~~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~k 80 (246) T protein:vir:80 1 MFDSTLFARLESQKDYENTRQTEILNPYVNFNSHTNTQTLADIMVAESIQMRGVEMYYIPREFVKPDMIFGEDVQSKFTK 80 (246) T ss_pred CCcccceeeecCCcchhhhcccccccceeeeCCCCchhhHHHHHHHHHHHHcCceEEEechhhhcccccccccccccccc Confidence 66666667777799999999 69999999999999999999999999999999999999999999999999999 Q ss_pred eeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeeeeC Q lcl|NC_015281. 74 AYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVEAE 153 (387) Q Consensus 74 ~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~ 153 (387) ||+|||||+|||||+|+++||||||||++|||||+|||+||+|++++. ||||||||||||+|+||||+|||++ T Consensus 81 a~~ieaYl~s~egy~G~gd~~SKFG~~v~DEvt~~Is~~rF~~~~~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~ 153 (246) T protein:vir:80 81 AWKFAAYINSFDGYEGAGNFFQSFGYTANDELTITINPNLFKHQVDNK-------EPKSGDLFYIPMSNDLFEISYVEPY 153 (246) T ss_pred ceeEEEeeehhhccCCccceeeecCceecceEEEEEccchHhhhhCCC-------CCccccEEEEcCCCceEEEecccCC Confidence 999999999999999999999999999999999999999999999986 9999999999999999999999999 Q ss_pred ChhheeccceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCCeE Q lcl|NC_015281. 154 RPFYQLGKGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGDAV 233 (387) Q Consensus 154 ~pf~q~g~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G~V 233 (387) +||||+||+|+|+++|++|+||+|+++++.+.++++.........+-....-.|......-.. .-...--.-..--| T Consensus 154 ~PFYQ~GKn~v~~l~ce~F~Ys~E~~~t~i~~~d~I~~de~~~ldl~~i~nldg~~Din~~~~---~e~~~~~~e~~~f~ 230 (246) T protein:vir:80 154 QPFFQAGKNAMRKIIAEKFVYSGEELRPELQRNEGINVDEFADLDLAPITNLDGLTDINLDQY---KEDKQFRSEGQDFI 230 (246) T ss_pred CchhhhCCCeEEEEEEEEEeecCcccccCCCCccccCCCCcccccHHHHhhcccccccCcccc---hhhhhhhcchhhhc Confidence 999999999999999999999999999999998887765443221100000011111000000 00000000011123 Q ss_pred EEEEEeec-CCCcccCceeEE Q lcl|NC_015281. 234 TSLAASNG-GQYYKAALPPTV 253 (387) Q Consensus 234 tsItVtng-GsGYts~~~ptV 253 (387) ....++|+ ||=+.. . T Consensus 231 ~~~~~~~~~gspf~~-----~ 246 (246) T protein:vir:80 231 DPFDPINGKGSPFAD-----F 246 (246) T ss_pred ccceeecCCCCcccc-----C Confidence 33333332 322211 0 No 9 >protein:vir:6590 Length: 246 # NCBI annotation: neck protein # Family: family:all:1104 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891721;genbank:gi:33620667;genbank:GeneID:1725307 Probab=100.00 E-value=2.1e-92 Score=523.18 Aligned_cols=238 Identities=28% Similarity=0.478 Sum_probs=186.0 Q ss_pred CcccCCChhhhhhccccceE-------EEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccccc Q lcl|NC_015281. 1 MAYSNTPANNCIQSDYDSAC-------RININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSFDS 73 (387) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~ 73 (387) |.=+.-=|+---|++|++++ |||+|+|.+||+|+|+||+|||||||+||||||||+|++|+||+||++|||+| T Consensus 1 ~~~~~lfa~l~~~~gy~~~~~~~~lNPYfN~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~k 80 (246) T protein:vir:65 1 MFNNTLFARLESQKDYENTRQTEILNPYVNFHKYTNTQTLADVMVAEAIQMRGVELYYIPREFVKPDMIFGEDVQSKFTK 80 (246) T ss_pred CCcccceeeecCCcchhhhcccccccceeecCCCcchhhHHHHHHHHHHHHcCceEEEechhhccccccccccccccccc Confidence 65555556667799999998 69999999999999999999999999999999999999999999999999999 Q ss_pred eeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeeeeC Q lcl|NC_015281. 74 AYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVEAE 153 (387) Q Consensus 74 ~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~ 153 (387) ||+|||||+|||||+|+++||||||||++|||||+|||+||+|++++. ||||||||||||+|+||||+|||++ T Consensus 81 a~~ieaYl~~~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~~~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~ 153 (246) T protein:vir:65 81 AWKFAAYINSFDGYEGAGNFFSSFGYQANDELTFTVNPNLFKHQVDDQ-------EPKSGDLIYIPMSNDLFEINYVEPY 153 (246) T ss_pred ceeEEEeeehhhccCCccceeeecCceecceEEEEEcCchhhhhcCCC-------CCccccEEEEcCCCcEEEEecccCC Confidence 999999999999999999999999999999999999999999999975 9999999999999999999999999 Q ss_pred ChhheeccceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCCeE Q lcl|NC_015281. 154 RPFYQLGKGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGDAV 233 (387) Q Consensus 154 ~pf~q~g~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G~V 233 (387) +||||+||+|+|+++|++|+||+|+++++.+.++++.........+-....-.|......-.. .-...--.-..--| T Consensus 154 ~PFYQ~GKn~v~~l~ce~F~Ys~E~~~~~i~~~d~I~~de~~~ldl~~i~~ldg~~Din~~~~---~e~~~~~~e~~~f~ 230 (246) T protein:vir:65 154 QPFFQAGKNAMRKIIAEKFVYSGEELRPELQRNEGINVDEFADLDLAPITNLDGLTDINLDQY---KEDKQFRSEGQDFI 230 (246) T ss_pred CchhhhCCCeEEEEEEEEEeecCcccccCCCCccccCCCCcccccHHHHhhcccccccCccch---hhhhhhhcchhhhc Confidence 999999999999999999999999999999998887765443221100000011111000000 00000000011123 Q ss_pred EEEEEeec-CCCcccCceeEE Q lcl|NC_015281. 234 TSLAASNG-GQYYKAALPPTV 253 (387) Q Consensus 234 tsItVtng-GsGYts~~~ptV 253 (387) ....++|+ |+=+.. . T Consensus 231 ~~~~~~~~~gspf~~-----~ 246 (246) T protein:vir:65 231 DPFDPINGKGSPFAD-----F 246 (246) T ss_pred ccceeecCCCCcccc-----C Confidence 33333332 322211 0 No 10 >protein:vir:98258 Length: 248 # NCBI annotation: gp14 head completion # Family: family:all:1104 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239191;genbank:gi:66391666;genbank:GeneID:3416360 Probab=100.00 E-value=2.2e-92 Score=523.05 Aligned_cols=238 Identities=25% Similarity=0.409 Sum_probs=197.0 Q ss_pred CcccCCC--hhhhhhccccceE-------EEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccc Q lcl|NC_015281. 1 MAYSNTP--ANNCIQSDYDSAC-------RININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSF 71 (387) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~~-------~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f 71 (387) |.=+++- |+---|+.|..+. |||+|+|.+||+|+|+||+|||||||+||||||||+|++|+||+||++||| T Consensus 1 m~~~d~~lfa~l~~~~gy~~t~~~~~lnPYfN~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF 80 (248) T protein:vir:98 1 MQNWDESLFAQLSTGEGVDRNLKDQVTNPYVNWYKYNPTQQLHDSLTAESIQMKSPDMYYVRREFVNIDKILGEDRESKF 80 (248) T ss_pred CcccccceeeEecCCcchhhhhhcccccceeeccccCchhHHHHHHHHHHHHHcCceEEEcchhhhcccccccccccccc Confidence 6554442 2222355555443 999999999999999999999999999999999999999999999999999 Q ss_pred cceeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeee Q lcl|NC_015281. 72 DSAYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVE 151 (387) Q Consensus 72 ~~~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve 151 (387) +|||+|||||+|||||+|+++||||||||++|||||+|||+||+|++++. ||||||||||||+|+||||+||| T Consensus 81 ~ka~~ieaYl~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~~~~~-------rP~EGDLIYfPm~n~LFEI~~VE 153 (248) T protein:vir:98 81 TKSWKIAAYIESYANYEGQRDFFSKFGLSSNDEMTLVLNPRLFAHQTDGG-------IPVLGDLVYFPMDNSLFEITWVE 153 (248) T ss_pred ccceeEEEeeehhhccCCccceeeecCceecceEEEEEccchhhhcCCCC-------CCccccEEEEcCCCceEEEEecC Confidence 99999999999999999999999999999999999999999999999986 99999999999999999999999 Q ss_pred eCChhheeccceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCC Q lcl|NC_015281. 152 AERPFYQLGKGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGD 231 (387) Q Consensus 152 ~~~pf~q~g~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G 231 (387) + +||||+||+|+|+++|++|+||+|+++++.+.++++.......+.+--..+..|...........-....+.+ .. T Consensus 154 ~-dPFYQ~Gkn~~~~l~ce~F~Ys~E~~~~~l~~~d~i~~~~~~~iDl~~i~~ldg~~Di~~~~~~e~~~~~~E~---~~ 229 (248) T protein:vir:98 154 A-DPFYQFGDRPQRKINLAKFIYTGEELAPELQRNEGIHIEPDAELDLEPIRNLDGLADINIEQYEEDKEFEREG---DE 229 (248) T ss_pred C-CchhhhCCCeEEEEEEEEeeeCCccccccCCCccccCCCCCcchhHHHhhcCccccccCcccccchhhhhhhh---hh Confidence 8 5999999999999999999999999999999999999888877666555566666654443332222221111 12 Q ss_pred eEEEEEEee-cCCCcccCcee Q lcl|NC_015281. 232 AVTSLAASN-GGQYYKAALPP 251 (387) Q Consensus 232 ~VtsItVtn-gGsGYts~~~p 251 (387) -|....|+| .|++..+. | T Consensus 230 ~~~~~~vin~~g~~~~~~--~ 248 (248) T protein:vir:98 230 FIESFDVVNGRGSPFATL--P 248 (248) T ss_pred hhcccceecCcCCCccCC--C Confidence 234444444 48877654 3 No 11 >protein:vir:100535 Length: 253 # NCBI annotation: gp14 head completion protein # Family: family:all:1104 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656376;genbank:gi:109290127;genbank:GeneID:4156513 Probab=100.00 E-value=1e-91 Score=519.36 Aligned_cols=244 Identities=27% Similarity=0.403 Sum_probs=186.1 Q ss_pred CcccCCChhhhhhccccceE-------EEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccccc Q lcl|NC_015281. 1 MAYSNTPANNCIQSDYDSAC-------RININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSFDS 73 (387) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~ 73 (387) |.=+.-=|+---|+.|..+. |||+|||.+||+|+|+||+|||||||+||||||||+|++|+||+||++|||+| T Consensus 1 ~~~~~lfa~l~~~~gy~~t~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~k 80 (253) T protein:vir:10 1 MMDKSLFATLENRSGYQQTNEQNILNPYVKFNRYEGSQALHDTLVAESIQMRGLEFYYLEREYTNLDLLFGEDPNSRFEK 80 (253) T ss_pred CcCccceeEecCCcchhhhhhhccccceeeccccCchhHHHHHHHHHHHHHcCceEEEcchhhccccCcccccccccccc Confidence 54444445444566666554 99999999999999999999999999999999999999999999999999999 Q ss_pred eeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeeeeC Q lcl|NC_015281. 74 AYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVEAE 153 (387) Q Consensus 74 ~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~ 153 (387) ||+|||||+|||||+|+++||||||||++|||||+|||+||+|+++++ ||||||||||||+|+||||+|||++ T Consensus 81 a~~ieaYl~~~egy~G~gd~~SKFG~~v~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~ 153 (253) T protein:vir:10 81 AWKFAAWLNSFESYEGQQSFFSKFGHQQNDEVRISINPGLFKYQVNGK-------EPKLGDLIYMPMDNSLFEITWVEPY 153 (253) T ss_pred ceeEEEeeehhhccCCccceeeecCceecceEEEEEcCchhhhhccCC-------CCccccEEEEcCCCcEEEEeccCCC Confidence 999999999999999999999999999999999999999999999987 9999999999999999999999999 Q ss_pred ChhheeccceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCCeE Q lcl|NC_015281. 154 RPFYQLGKGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGDAV 233 (387) Q Consensus 154 ~pf~q~g~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G~V 233 (387) +||||+||+|+|+++|++|+||+|+++++.+.++.+.... +...+.......|...........-....+.+ .--| T Consensus 154 ~PFYQ~Gkn~~~~l~ce~F~Ys~E~i~tgi~~id~Ie~~~-~~ldl~~i~~l~G~~Di~~~~~~e~~~~~~e~---~~~v 229 (253) T protein:vir:10 154 TPFYQMGKNPIRVIVAQKFIYSGEKLAPQFQEKPEIEDQY-NGLDLEPILNLDGFIDQKINEFGENVQAQNEA---RPFV 229 (253) T ss_pred CchhhhCCceEEEEEEEEEecCCccccccCcccccccchh-hhhhhhhhhcCCCccccccccccccchhhhcc---cccc Confidence 9999999999999999999999999999999999887543 33333333333333332222211111111110 1112 Q ss_pred EEEEEee--cCCCcccCceeEEEeCCCCCcceEEEEeeccccc Q lcl|NC_015281. 234 TSLAASN--GGQYYKAALPPTVTLTGGGGTGATATATVSDAGL 274 (387) Q Consensus 234 tsItVtn--gGsGYts~~~ptVTisgg~GtgAtatatV~~~G~ 274 (387) ....+++ +..|-++ |-..- .| . T Consensus 230 ~~~~~~~~~~~~g~~s---pf~~~---~~-------------~ 253 (253) T protein:vir:10 230 EPFDPISTNPVNSFNS---PFGRH---EG-------------Q 253 (253) T ss_pred ccceeccCCCCccccC---ccccc---CC-------------C Confidence 2222221 1112211 10000 00 0 No 12 >protein:vir:6890 Length: 254 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861866;genbank:gi:32453657;genbank:GeneID:1494292 Probab=100.00 E-value=3.2e-91 Score=516.72 Aligned_cols=244 Identities=28% Similarity=0.466 Sum_probs=182.3 Q ss_pred CcccCCC--hhhhhhccccceE-------EEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccc Q lcl|NC_015281. 1 MAYSNTP--ANNCIQSDYDSAC-------RININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSF 71 (387) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~~-------~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f 71 (387) |.=+++- |+---|+.|..+. |||+|||.+||+|+|+||+|||||||+||||||||+|++|+||+|+++||| T Consensus 1 m~~~d~~lfa~le~~~gy~~t~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF 80 (254) T protein:vir:68 1 MATYDKNLFAKLENRGGYSQTNETEILNPFVNFNNYENSQTLADVLVAESIQMRGIECFYVPREYVAVDLIFGEDLKNKF 80 (254) T ss_pred CcccccceeeeecCCcchhhhhhccccceeEEeeccCchhHHHHHHHHHHHHHcCceEEEechhhhcccccccccccccc Confidence 6544432 2233355665544 999999999999999999999999999999999999999999999999999 Q ss_pred cceeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeee Q lcl|NC_015281. 72 DSAYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVE 151 (387) Q Consensus 72 ~~~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve 151 (387) +|||+|||||+|||||+|+++||||||||++|||||+|||+||+|++++. ||||||||||||+|+||||+||| T Consensus 81 ~ka~~ieaYl~s~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE 153 (254) T protein:vir:68 81 TKAWKFAAYLNSFEGYEGAKSFFSNFGMQVQDEVTLSINPGLFKHQVNNQ-------EPKEGDLIYFPMDNSLFEINWVE 153 (254) T ss_pred ccceeEEEeeehhhccCCcccchhhcCceecceEEEEEcCchhhhhcCCC-------CCccccEEEEcCCCceEEEeccC Confidence 99999999999999999999999999999999999999999999999986 99999999999999999999999 Q ss_pred eCChhheeccceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCC Q lcl|NC_015281. 152 AERPFYQLGKGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGD 231 (387) Q Consensus 152 ~~~pf~q~g~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G 231 (387) +++||||+||+|+|+++|++|+||+|+++++.+.++++.........+.--..-.|-.....-- ..-...--.-..- T Consensus 154 ~~~PFYQ~GKn~~~~l~ce~F~Ys~E~idt~i~~id~I~~~e~~~ldl~~i~~ldG~~di~~~~---~~E~~~~~~e~~~ 230 (254) T protein:vir:68 154 PYDPFYQVGKNAIRKITAGKFIYSGEEINPVLQKNEGINIPEFSDLELNPVRNLDGIHDINIDE---YSEVEQINSEASE 230 (254) T ss_pred CCCchhhhCCceEEEEEEEEEeeCCccccCCCcccCCccCcccCCcchhhHhhhcchhhccccc---hhhHHHHHhhhhh Confidence 9999999999999999999999999999999999988865443332211001111111111000 0000000000112 Q ss_pred eEEEEEEee-cCCCcccCceeEEEeCCCCCc Q lcl|NC_015281. 232 AVTSLAASN-GGQYYKAALPPTVTLTGGGGT 261 (387) Q Consensus 232 ~VtsItVtn-gGsGYts~~~ptVTisgg~Gt 261 (387) -|....|+| .|+. ++ | +..+-=+ T Consensus 231 f~e~~~~vn~~g~~---~~-p---f~~~~~~ 254 (254) T protein:vir:68 231 YVEPYVVVNNRGRQ---NS-P---FDDGFMN 254 (254) T ss_pred hcccceeecCCCCC---CC-c---ccccccC Confidence 234444443 3431 11 1 1111000 No 13 >protein:vir:101154 Length: 252 # NCBI annotation: neck protein # Family: family:all:1104 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932505;genbank:gi:37651631;genbank:GeneID:2610647 Probab=100.00 E-value=5.2e-91 Score=515.54 Aligned_cols=244 Identities=25% Similarity=0.343 Sum_probs=181.7 Q ss_pred CcccCCChhhhhhccccceE-------EEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccccc Q lcl|NC_015281. 1 MAYSNTPANNCIQSDYDSAC-------RININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSFDS 73 (387) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~ 73 (387) |.=+.-=|.---|+.|..+. |||+++|++||+|+|+||+|||||||+||||||||+|++|+||+||++|||+| T Consensus 1 m~d~~lfa~le~~~gy~~t~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~~Ed~~SkF~k 80 (252) T protein:vir:10 1 MMDKSLFATLENRGGYMRTNEKNILNPYVKFNKHEGTQALQDTLVAESIQMRGIEFYYLEREFTDLDLLFGEDVNSRFEK 80 (252) T ss_pred CCCccceeEecCCcchhhhhhhccccceeeecCccchhHHHHHHHHHHHHHcCceEEEechhhccccccccccccccccc Confidence 44333334444455565443 99999999999999999999999999999999999999999999999999999 Q ss_pred eeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeeeeC Q lcl|NC_015281. 74 AYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVEAE 153 (387) Q Consensus 74 ~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~ 153 (387) ||+|||||+|||||+|+++||||||||++|||||+|||+||+|+++++ ||+|||||||||+|+||||+|||++ T Consensus 81 a~~ieaYl~s~egy~G~gd~~SKFG~~~~DEvt~~Is~~rF~~qv~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~ 153 (252) T protein:vir:10 81 AWKFAAWLNSFESYEGQQSFFSKFGHTQNDEIRISINPGLFKYQVNGK-------EPALGDLIYMPMDNSLFEITWVEPY 153 (252) T ss_pred ceeEEEeeehhhccCCccceeeecCceecceEEEEEcCchhhhhccCC-------CCccccEEEEcCCCcEEEEeccCCC Confidence 999999999999999999999999999999999999999999999987 9999999999999999999999999 Q ss_pred ChhheeccceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCCeE Q lcl|NC_015281. 154 RPFYQLGKGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGDAV 233 (387) Q Consensus 154 ~pf~q~g~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G~V 233 (387) +||||+||+|+|+++|++|+||+|+++++.+.++.+........ +..-..-.|....... ...-..+--....--| T Consensus 154 ~PFYQ~Gkn~~~~l~ce~F~Ys~E~idt~~~~id~Ie~~~s~ld-l~~i~~ldG~~Di~~~---~~~e~~~~~~e~~~~~ 229 (252) T protein:vir:10 154 SPFYQNGKNPIRVIVAQKFIYSGEKITPVVQEKPEIEDMYNGLD-LAPLLNLDGMIDQKID---QFAENIAVQQKVKQYA 229 (252) T ss_pred CchhhhCCCeEEEEEEEEEeeCCceecCcccccCccchhhhccc-hHHHhhcCCeeccccc---cchhhHHHHHhhhhhh Confidence 99999999999999999999999999999999988775444221 1000011111110000 0000000000011123 Q ss_pred EEEEEeec-CCCcccCceeEEEeCC Q lcl|NC_015281. 234 TSLAASNG-GQYYKAALPPTVTLTG 257 (387) Q Consensus 234 tsItVtng-GsGYts~~~ptVTisg 257 (387) ....++|+ |.+--++ |----.+ T Consensus 230 e~~~~i~~~~~~~~~~--pf~~~~~ 252 (252) T protein:vir:10 230 EPFDPISTNSFGNFDS--PFGKHEA 252 (252) T ss_pred ccceeecCCCCCCcCC--cccccCC Confidence 33333333 3221111 1111111 No 14 >protein:vir:101800 Length: 252 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238877;genbank:gi:66391952;genbank:GeneID:3416627 Probab=100.00 E-value=5.2e-91 Score=515.54 Aligned_cols=244 Identities=25% Similarity=0.343 Sum_probs=181.7 Q ss_pred CcccCCChhhhhhccccceE-------EEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccccc Q lcl|NC_015281. 1 MAYSNTPANNCIQSDYDSAC-------RININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSFDS 73 (387) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~ 73 (387) |.=+.-=|.---|+.|..+. |||+++|++||+|+|+||+|||||||+||||||||+|++|+||+||++|||+| T Consensus 1 m~d~~lfa~le~~~gy~~t~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~~Ed~~SkF~k 80 (252) T protein:vir:10 1 MMDKSLFATLENRGGYMRTNEKNILNPYVKFNKHEGTQALQDTLVAESIQMRGIEFYYLEREFTDLDLLFGEDVNSRFEK 80 (252) T ss_pred CCCccceeEecCCcchhhhhhhccccceeeecCccchhHHHHHHHHHHHHHcCceEEEechhhccccccccccccccccc Confidence 44333334444455565443 99999999999999999999999999999999999999999999999999999 Q ss_pred eeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeeeeC Q lcl|NC_015281. 74 AYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVEAE 153 (387) Q Consensus 74 ~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~ 153 (387) ||+|||||+|||||+|+++||||||||++|||||+|||+||+|+++++ ||+|||||||||+|+||||+|||++ T Consensus 81 a~~ieaYl~s~egy~G~gd~~SKFG~~~~DEvt~~Is~~rF~~qv~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~ 153 (252) T protein:vir:10 81 AWKFAAWLNSFESYEGQQSFFSKFGHTQNDEIRISINPGLFKYQVNGK-------EPALGDLIYMPMDNSLFEITWVEPY 153 (252) T ss_pred ceeEEEeeehhhccCCccceeeecCceecceEEEEEcCchhhhhccCC-------CCccccEEEEcCCCcEEEEeccCCC Confidence 999999999999999999999999999999999999999999999987 9999999999999999999999999 Q ss_pred ChhheeccceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCCeE Q lcl|NC_015281. 154 RPFYQLGKGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGDAV 233 (387) Q Consensus 154 ~pf~q~g~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G~V 233 (387) +||||+||+|+|+++|++|+||+|+++++.+.++.+........ +..-..-.|....... ...-..+--....--| T Consensus 154 ~PFYQ~Gkn~~~~l~ce~F~Ys~E~idt~~~~id~Ie~~~s~ld-l~~i~~ldG~~Di~~~---~~~e~~~~~~e~~~~~ 229 (252) T protein:vir:10 154 SPFYQNGKNPIRVIVAQKFIYSGEKITPVVQEKPEIEDMYNGLD-LAPLLNLDGMIDQKID---QFAENIAVQQKVKQYA 229 (252) T ss_pred CchhhhCCCeEEEEEEEEEeeCCceecCcccccCccchhhhccc-hHHHhhcCCeeccccc---cchhhHHHHHhhhhhh Confidence 99999999999999999999999999999999988775444221 1000011111110000 0000000000011123 Q ss_pred EEEEEeec-CCCcccCceeEEEeCC Q lcl|NC_015281. 234 TSLAASNG-GQYYKAALPPTVTLTG 257 (387) Q Consensus 234 tsItVtng-GsGYts~~~ptVTisg 257 (387) ....++|+ |.+--++ |----.+ T Consensus 230 e~~~~i~~~~~~~~~~--pf~~~~~ 252 (252) T protein:vir:10 230 EPFDPISTNSFGNFDS--PFGKHEA 252 (252) T ss_pred ccceeecCCCCCCcCC--cccccCC Confidence 33333333 3221111 1111111 No 15 >protein:vir:106285 Length: 262 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944101;genbank:gi:38640145;genbank:GeneID:2658033 Probab=100.00 E-value=6.3e-91 Score=515.09 Aligned_cols=243 Identities=25% Similarity=0.364 Sum_probs=182.3 Q ss_pred Cc--------------ccCCChhhhhhccccceEEEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCccccc Q lcl|NC_015281. 1 MA--------------YSNTPANNCIQSDYDSACRININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEI 66 (387) Q Consensus 1 ~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~ 66 (387) |. =-|.|.++||++.| ||+++|.+||+|+|+||+|||||||+||||||||+|++|+||+|+ T Consensus 1 m~~~d~~lfa~le~~~gy~~~~~~~vlNPY-----fN~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~gEd 75 (262) T protein:vir:10 1 MFARNQSMFAQLETGAGYNKTYQTNVLNPY-----VNKHEYEPTLSLHEMLVAESIQMTGVEMYYIRREFVNFDRIFGED 75 (262) T ss_pred CcccccceeeEecCCCcccCcchhccccce-----eccCCcCchhhHHHHHHHHHHHHcCceEEEcchhhhccccccccc Confidence 32 23678899998876 999999999999999999999999999999999999999999999 Q ss_pred ccccccceeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEE Q lcl|NC_015281. 67 QESSFDSAYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFE 146 (387) Q Consensus 67 ~~~~f~~~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfe 146 (387) ++|||+|||+|||||+|||||+|+++||||||||++|||||+|||+||++++++. ||||||||||||+|+||| T Consensus 76 ~~SkF~ka~~ieaYl~s~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~~~~~-------rP~EGDLIYfPl~nsLFE 148 (262) T protein:vir:10 76 MQSKFKKTYKVAMYLESFDEYSGQRDFFSKFGMQVNDEITMSVSPKLFETQADGD-------RVKEGDLIYFPLNNSLFE 148 (262) T ss_pred cccccccceeEEEeeehhhccCCccceeeecCceecceEEEEEccchhhhhhcCC-------CCccccEEEEcCCCceEE Confidence 9999999999999999999999999999999999999999999999999999986 999999999999999999 Q ss_pred EEeeeeCChhheeccceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEE Q lcl|NC_015281. 147 IKFVEAERPFYQLGKGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAA 226 (387) Q Consensus 147 i~~ve~~~pf~q~g~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata 226 (387) |+|||+++||||+||+|+|+++|++|+||+|+++++.+.++.+.....-...+... .|......--. ....+-- T Consensus 149 I~~VE~~~PFYQ~Gkn~~~~l~ce~F~Ys~E~i~~~i~~id~i~~e~~~l~~i~~l---Dg~~di~~~q~---~e~~~~~ 222 (262) T protein:vir:10 149 VTWVEPSSPVVKREQLAKYKVTAQKFIYSGEEIKPEFDPNRYVLGEDDPLSQIKAL---DGRADISLDEF---AEDDAFN 222 (262) T ss_pred EeeccCCCchhhhCCceEEEEEEEEEeeCCccccccCccccccccccccccccccc---cceeecccccc---cchhHHh Confidence 99999999999999999999999999999999999999998886543322222211 11111111000 0000000 Q ss_pred EEeCCeEEEEEEee-cCCCcccCce----eEEEeCCCCCcc Q lcl|NC_015281. 227 TITGDAVTSLAASN-GGQYYKAALP----PTVTLTGGGGTG 262 (387) Q Consensus 227 ~v~~G~VtsItVtn-gGsGYts~~~----ptVTisgg~Gtg 262 (387) .-..--|....++| .||=...-.+ |.-.+..-.. - T Consensus 223 ~e~~~fv~~~d~v~~~gsp~~~~~~~~~~~~~~fdd~~~-~ 262 (262) T protein:vir:10 223 EEAEDFVVEFDNIIGNGTPIAEHKPTKPAPVSAFDDLES-F 262 (262) T ss_pred hhhhhhcchhcccCCCCCcccccCCCCCCCCChhhhhhc-C Confidence 00001122222333 2332211111 1000110000 0 No 16 >protein:vir:107937 Length: 257 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595290;genbank:gi:161622596;genbank:GeneID:5783656 Probab=100.00 E-value=4.4e-90 Score=510.44 Aligned_cols=247 Identities=24% Similarity=0.421 Sum_probs=179.3 Q ss_pred CcccCCC--hhhhhhccccceE-------EEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccc Q lcl|NC_015281. 1 MAYSNTP--ANNCIQSDYDSAC-------RININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSF 71 (387) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~~-------~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f 71 (387) |.=+++- |+---|+.|..+. |||||+|.+||+|+|+||+|||||||+||||||||+|++|+||+|+++||| T Consensus 1 m~~~d~~lfa~le~~~gy~~t~~~~~lnPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF 80 (257) T protein:vir:10 1 MATFDSSLFAKLENNTGYANTNETEIMNPFVNFYRHENTQTLADALVAESIQMRGIELYYIPREYVNPDQLFGEDLQNKF 80 (257) T ss_pred CcccccceeeeecCCcchhhhhhhccccceeecccCCchhHHHHHHHHHHHHHcCceEEEcchhhhcccccccccccccc Confidence 5544432 2223355665544 999999999999999999999999999999999999999999999999999 Q ss_pred cceeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeee Q lcl|NC_015281. 72 DSAYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVE 151 (387) Q Consensus 72 ~~~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve 151 (387) ++||+|||||+|||||+|+++||||||||++|||||+|||+||+|++++. ||||||||||||+|+||||+||| T Consensus 81 ~ka~~ieaYl~s~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE 153 (257) T protein:vir:10 81 TKAWKFAGYLDSFEGYSGDNTYFSKFGMMVNDEVTITINPNLFKHQCNGT-------EPVSGDLIYFPMDNSLFEINWVQ 153 (257) T ss_pred ccceeEEEeeehhhccCCCcceeeecCceecceEEEEEccchhhhhccCC-------CCccccEEEEcCCCceEEEeccc Confidence 99999999999999999999999999999999999999999999999987 99999999999999999999999 Q ss_pred eCChhheeccceEEEEEEEEEecCccccccccccccceeeeeeeeEEEEeccCCccccccceeeeccCcCeEEEEEEeCC Q lcl|NC_015281. 152 AERPFYQLGKGYVWEMQCELFEYSDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGD 231 (387) Q Consensus 152 ~~~pf~q~g~~yv~~~~~~~F~ysgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G 231 (387) +++||||+||+|+|+++|++|+||+|++.++.+...+....-.....+--...-.|-.....-.... ...--..... T Consensus 154 ~~~PFYQ~Gkn~~~~l~ce~F~Ys~E~l~pel~~n~~~~V~e~~eldl~~~~~ldG~~di~~~~~~E---~~~~~~e~~~ 230 (257) T protein:vir:10 154 PYDPFYQVGTNVQRRITATKFIYNGEELRPELQRNEGINIPEFSELDLMPVKNIDGLADISDIQYEE---VNEINAEAAE 230 (257) T ss_pred CCCchhhhCCceEEEEEEEEeeeCCcccccccCCcccCCCCCccchhhhhhhhccchhhcCCchhhh---HHHHHHhhhh Confidence 9999999999999999999999999999888887765554433322111111111111100000000 0000000011 Q ss_pred eEEEE-EEeecCCCcccCceeEEEeCC Q lcl|NC_015281. 232 AVTSL-AASNGGQYYKAALPPTVTLTG 257 (387) Q Consensus 232 ~VtsI-tVtngGsGYts~~~ptVTisg 257 (387) -|..- .|.+.|+.-..++-..--.+. T Consensus 231 fi~p~~~~n~~g~~~~~~pf~~~~~~~ 257 (257) T protein:vir:10 231 FVHPYVVINGRGEDAPPTAFDDAFLDD 257 (257) T ss_pred hhccccccCCCCCCCCCCcccchhccC Confidence 12222 233445432211111101111 No 17 >protein:vir:103005 Length: 390 # NCBI annotation: gp110 # Family: family:all:1104 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717777;genbank:gi:113200614;genbank:GeneID:4239009 Probab=99.58 E-value=4.7e-16 Score=104.63 Aligned_cols=310 Identities=16% Similarity=0.146 Sum_probs=135.1 Q ss_pred CcccCCChhhhhhccccceEEEE-eccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCcccccccccccceeeEee Q lcl|NC_015281. 1 MAYSNTPANNCIQSDYDSACRIN-INGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQESSFDSAYLCRA 79 (387) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~~~~~~~ 79 (387) ++|+-++ +.-.|+-|.=+.|++ |.|+.+..+|+. -+|+-+ .|.+-==....+|+.. T Consensus 61 ~~~~e~~-~~~f~~~~~~~~y~~~~~~~~~~~~~~s--------kfg~~~---------~de~~~~~~~~~~~~~----- 117 (390) T protein:vir:10 61 TILNEVE-TSRFEQALSVRAYVNNVEGWEGQGDLLS--------KFGVRI---------EDKTTFIFSRKKFTTA----- 117 (390) T ss_pred ccccccc-ccccccceEEEEEeechhccCCccceee--------ecCcee---------cceEEEEECCcchhhh----- Confidence 2232221 122222333333432 333333333321 223111 0000000111222211 Q ss_pred ccchhhccCCcchhhhhCC-ceeeeeEEEEEccchhh-hhcCCcccccCCCCCccccEEEEcCCCcEEEEEeeeeCChhh Q lcl|NC_015281. 80 YVNNVEGWEGQGELLSKFG-IRIEDKTTFVISRKKFT-EKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVEAERPFY 157 (387) Q Consensus 80 y~~~~~~~~~~~~~~skfg-~~~~de~~~~~s~~~f~-~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~~pf~ 157 (387) ..++ ..+-+=+ ...-|.+=|-+.+++|+ ..|..+ +++=+-|++=-..+.=.||+-....-+.... T Consensus 118 ----~~~~----~~~~~~~~p~egdliy~p~~~~lfei~~ve~~-----~p~yq~G~nyt~~i~a~lf~ySge~iat~~s 184 (390) T protein:vir:10 118 ----VDDN----AVLNVEGRPNEGDLIWFPATRHLFEIKFVEAE-----RPFYQLGKGYVWECQCELFEYSDEDLDTGVA 184 (390) T ss_pred ----hCCc----ccccccCCCCCCceEEecCCCCEEEEEecCCC-----CCceEccCceeeeeEEeeeccCCcccccccc Confidence 1111 0111112 45567777878888886 233322 1111234443333333344443333222211 Q ss_pred eecc---ceEEEEEEEEEec-------CccccccccccccceeeeeeeeEEEEeccCCcccccccee--ee--ccCcCeE Q lcl|NC_015281. 158 QLGK---GYVWEMQCELFEY-------SDESIDTGVADIDAVETTFANSIKLVMDPGGSGDFSVGET--IT--GNLYTAV 223 (387) Q Consensus 158 q~g~---~yv~~~~~~~F~y-------sgE~~dtg~~ai~~~~~~~~s~~~~ti~~~Gsgy~~~~~~--~~--~~g~gat 223 (387) +... .+.-......-.. .......+..+........+....++++++|++|...+.+ .. .++.+++ T Consensus 185 eid~I~~~~~~~v~~~~~t~g~~~~t~~~~v~~~g~ga~~~a~v~~g~Vt~vtItn~GsGYt~~~~ptVtisgg~gtgAt 264 (390) T protein:vir:10 185 EIDAIETAFANAIKLVMDAGGTGAFTVGEEIVGDLYLATATATISGDAVDAVTVTDGGEHYKSALPPTVTITGGGGSGAT 264 (390) T ss_pred ccccccccccceeeeeeccCCcccccccceeeecCcceeEEEEecCCeEEEEEEeeCCCCcccCceeEEEecCCCCccce Confidence 1110 0000111000000 0011111111111122233455678999999999987543 22 2355666 Q ss_pred EEEEE-eCCeEEEEEEeecCCCcccCceeEEEeCCCCCcceEEEEeeccccceeEEEEecCCCCcccCcEEEEeCCCCCc Q lcl|NC_015281. 224 AAATI-TGDAVTSLAASNGGQYYKAALPPTVTLTGGGGTGATATATVSDAGLVTGFTVTAGGSGYTSAPTVVIQESPKDI 302 (387) Q Consensus 224 ata~v-~~G~VtsItVtngGsGYts~~~ptVTisgg~GtgAtatatV~~~G~Vt~ItItn~GsGYts~ptVtI~g~g~ga 302 (387) +++++ .+|.|++|+|+|+|+||+.+ |+|++++++ .++++.+.+. ++.|+.|+|+++|+||+.+|+|++.+++.++ T Consensus 265 ~tatv~~~G~VtsItItn~GsGYt~~--PtVtI~g~g-~~~~a~~~~~-~g~v~~i~Itn~GsgYtt~p~vt~~~~G~~~ 340 (390) T protein:vir:10 265 ATATVSSAGIVTGITITSGGTGYTSA--PTVTIDYSP-KDNRAEVKSW-NASTRELQVINRTGTFNTAEVITGLTSGAKW 340 (390) T ss_pred eeeeecccceEEEEEEecCCccccCC--CEEEEeCCC-CCceeEEEEe-ccEEEEEEEecCCcceeeccEEEEecCCcce Confidence 66655 47899999999999999876 678887554 4455666554 6899999999999999999999998776655 Q ss_pred eEEEEEecccceeEEEEcccceEEecCc-e-ecccccceeEeeccceEeeccc Q lcl|NC_015281. 303 HAEVKSWNNATRELQIINRTGTFNVAEY-L-KGETSGALWSPESYNTLNNTNS 353 (387) Q Consensus 303 ~atv~~~~~~~~~~~i~n~~~~~tv~~~-~-tG~tsgat~tv~t~~s~~~t~~ 353 (387) .+....................+..... . +.....+...+++. +++.. T Consensus 341 ~~~~~~t~~~~~~~~~~~~~~~~~t~~~~ii~~t~gn~~g~v~n~---T~t~v 390 (390) T protein:vir:10 341 SPESYNTLNNTNTADTIDQNYSFETADDDIIDFTEVNPFGNIGST---TDTTI 390 (390) T ss_pred EEEEEEecccceeeeeecccceeEeCCCceEeecccCcccccccc---eeccC Confidence 4433222222222221222222222111 1 11111111111111 11000 No 18 >protein:vir:104739 Length: 470 # NCBI annotation: T4-like neck protein # Family: family:all:1104 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214353;genbank:gi:61805993;genbank:GeneID:3294259 Probab=99.01 E-value=3.4e-11 Score=77.99 Aligned_cols=331 Identities=15% Similarity=0.125 Sum_probs=126.5 Q ss_pred hcCccEEEeeeeeec----cCCcccccccccccceeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcC Q lcl|NC_015281. 44 IYGQAIYYIPRTRVT----TDDVLNEIQESSFDSAYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVD 119 (387) Q Consensus 44 ~~g~~~~y~~r~~~~----~d~~~~e~~~~~f~~~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~ 119 (387) |- .--|+. +..-+ .|.|..|..|-+=.++|=|..=+-+-+--= .++..|||--.-. +.+++. .|+.+-. T Consensus 1 ~~-~~~~~~-~~~~~~~~~~~~~~~e~~~~~g~~~~~~~~~~~~~~~~~-~~~~~~~f~~~~~--~~~~~~--~~~~~~~ 73 (470) T protein:vir:10 1 MA-LNPFFL-QGTSSEQRLTQDLINEHLKIYGVEVTYIPRKYVNTKSII-EEVQSSKFDDNFA--IEAYVN--TYEGYGG 73 (470) T ss_pred Cc-ccceeE-cCCCchhHHHHHHHHHHhHhccceEEEechhhccccccc-cccccccccccee--EEEEee--cccCcCC Confidence 10 000111 00000 122223333322223333322111111111 1445666652221 122221 1111111 Q ss_pred CcccccCCCCCccccEEEEcCCCcEEE-----------------EEeeeeCCh-----hheeccceEEEEEEEEEecCcc Q lcl|NC_015281. 120 DNVTLAVEGRPNEGDLIWFPVTKHLFE-----------------IKFVEAERP-----FYQLGKGYVWEMQCELFEYSDE 177 (387) Q Consensus 120 ~~~~~~~~~~p~egdliy~p~~~~lfe-----------------i~~ve~~~p-----f~q~g~~yv~~~~~~~F~ysgE 177 (387) + .++....-.+.=|-+=|=..+++|| .+.+-...| +|---.+.+|+++ ..+...+ T Consensus 74 ~-~~~~~~fg~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~--~ve~~~p 150 (470) T protein:vir:10 74 Q-GDVLTKFGMSIRDEVTLTISKERFEDFIAPFMAGLDDGPGGNEEVILATRPREGDLVFFPLGSRLFEVK--FVEHEDP 150 (470) T ss_pred c-ceeeeecCcccceEEEEEECCccccccccchhhcccCcccccccccccCCcccccEEEecCCCCEEEEE--ecCCCCc Confidence 1 1111111222234444555555554 111111112 1212234566666 3344444 Q ss_pred ccccccccccceeeee----e-----eeEEEEeccCCccccccceeeeccCcCeEEEEEEeCCeEEEEEEeecCCCcccC Q lcl|NC_015281. 178 SIDTGVADIDAVETTF----A-----NSIKLVMDPGGSGDFSVGETITGNLYTAVAAATITGDAVTSLAASNGGQYYKAA 248 (387) Q Consensus 178 ~~dtg~~ai~~~~~~~----~-----s~~~~ti~~~Gsgy~~~~~~~~~~g~gatata~v~~G~VtsItVtngGsGYts~ 248 (387) -+..+........... . ..........+.++...... ...+..+..++....+.|+.++++++|++|+.. T Consensus 151 ~~~~G~~~~~~it~~~f~ysge~~s~~v~~~~~~~~~~g~~~t~t~-~~~g~~~~~t~~~~~g~vt~ititn~Gsgyt~~ 229 (470) T protein:vir:10 151 FYQLGKNYVYQLKCELFEYEDEVIDTSIDAIDTVVQDDGYISKLQL-VGIGRTAEVAASIGVGYVREIFLNNDGSGFTSP 229 (470) T ss_pred chhcCcceeEEeeeceeEecCCccccceecccccccccccceeeee-cCCCccceeeeeecceeeeEeEeeccccceecc Confidence 4444443332111110 0 01111222223333322222 234445566677788999999999999999865 Q ss_pred ceeEEEeCCCCCc----ceEEEEeeccccceeEEEEecCCCCcccCcEEEEeCCCC-CceEEE--EEecccceeEEEEcc Q lcl|NC_015281. 249 LPPTVTLTGGGGT----GATATATVSDAGLVTGFTVTAGGSGYTSAPTVVIQESPK-DIHAEV--KSWNNATRELQIINR 321 (387) Q Consensus 249 ~~ptVTisgg~Gt----gAtatatV~~~G~Vt~ItItn~GsGYts~ptVtI~g~g~-ga~atv--~~~~~~~~~~~i~n~ 321 (387) |+|++.++... .+....++...+.++.++++++|+||+.+|+|++.++.. ++.+.. ...........+.+. T Consensus 230 --ptVti~~~~~~~~~~a~~~~~t~~~~g~vt~ititn~Gsgytt~ptvt~~~~~g~ga~at~~~~~~~~g~~~itit~~ 307 (470) T protein:vir:10 230 --PTITFSASPAFTDARAVGILTTRANVTSIEKILMTSAGAGYITPPTITISGGGGTGAAATCSIETVYQGVVNFNVVDG 307 (470) T ss_pred --CEEEEccCCCCCCccceeeEeecceeeEEEEEEEecCcccccccceEEEccCCCccceeeeeecccccceeeEEEccC Confidence 66776654321 122223455568899999999999999999999986543 222222 223334445566666 Q ss_pred cceEEecCceec------ccccceeE---ee---ccceEee--ccccccccceEEecCCceEEecccC------ccc--- Q lcl|NC_015281. 322 TGTFNVAEYLKG------ETSGALWS---PE---SYNTLNN--TNSTYDQNSLFETLDDDIIDWTEGN------PFG--- 378 (387) Q Consensus 322 ~~~~tv~~~~tG------~tsgat~t---v~---t~~s~~~--t~~~~s~~~~i~t~~D~iidftegN------p~g--- 378 (387) +.+|+..+.++. ........ +. ...++++ .+..|...+.+....-.....+... -.+ T Consensus 308 GsgYtt~ptvtit~~~sg~~a~~~a~~~~~~~~g~itsititn~Gsgyts~ptv~i~~~~~~~~~~t~~~~~~~tg~tsg 387 (470) T protein:vir:10 308 GVGYGTEPSIAVTQPGAGTTAVGIASIGMAGSDQVLKSVYIGNPGRGYTATPNVIVADPPSMSGIGTFTFNEVIKGSRSG 387 (470) T ss_pred CccccccceEEEecCCCCCcccceeEEEeecccceeeeEEeccCCcceeccceeEeecCccccccceeeeeeeeeccccc Confidence 666554433221 11111111 00 1111111 1111222222211110000000000 000 Q ss_pred ----cccCCCCCC Q lcl|NC_015281. 379 ----YTGNDSDTF 387 (387) Q Consensus 379 ----~~g~~~~~~ 387 (387) .......+. T Consensus 388 t~~~~~~~~~~t~ 400 (470) T protein:vir:10 388 TEARVKSWDDDTK 400 (470) T ss_pred eeeeeeeecccce Confidence 000000011 No 19 >protein:vir:97237 Length: 122 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:704 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294533;genbank:gi:149408254;genbank:GeneID:5237102 Probab=45.49 E-value=0.77 Score=21.21 Aligned_cols=121 Identities=17% Similarity=0.155 Sum_probs=72.6 Q ss_pred EEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCCccccccc-ccccceeeEeeccchhhccCCcchhhhhCCce Q lcl|NC_015281. 22 ININGSEQEQVFFENLIVESIEIYGQAIYYIPRTRVTTDDVLNEIQE-SSFDSAYLCRAYVNNVEGWEGQGELLSKFGIR 100 (387) Q Consensus 22 ~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~-~~f~~~~~~~~y~~~~~~~~~~~~~~skfg~~ 100 (387) .++ +..-|. +..++|+-+|++|.+.++.-..-|+-.++... ..=...|...+.+.+|+--.=++.+ +| T Consensus 1 M~~--y~~~~~----~a~~Li~kfG~~vtl~r~~~g~y~~~~g~~~p~~~t~~~~~~~gv~~~~~~~~idGtl-----I~ 69 (122) T protein:vir:97 1 MAR--FDSAIA----LAKKLIKKNGQAVTLRGFTAGAAPDPAKPWKPGGNVAADQTIEAVFLDYEQRYIDGQT-----IR 69 (122) T ss_pred Ccc--chHHHH----HHHHHHHHhCCceEEEEeccceeCCCCCceecCCceeeeeeeEEEeeccchhhccCcE-----Ee Confidence 111 222233 45566667999999888877666766665322 2224678899999888764433332 45 Q ss_pred eeeeEEEEEccchhhhhcCCcccccCCCCCccccEEEEcCCCcEEEEEeeeeCChhheeccceEEEEEEEE Q lcl|NC_015281. 101 IEDKTTFVISRKKFTEKVDDNVTLAVEGRPNEGDLIWFPVTKHLFEIKFVEAERPFYQLGKGYVWEMQCEL 171 (387) Q Consensus 101 ~~de~~~~~s~~~f~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~~pf~q~g~~yv~~~~~~~ 171 (387) ..|..-+...+.. ...|+.||+|-+ +..-|.|-.|++..|-= .--.|++.+-+ T Consensus 70 ~GD~~l~~~a~~~-------------~~~P~~gD~v~~--~g~~~~Vi~v~~i~pa~---~~v~y~lqlRk 122 (122) T protein:vir:97 70 MGDQRVFMPAEGL-------------TAPPEVEGLVLR--GLEVWKVIAVKPLNPNG---QAIMYELQVRQ 122 (122) T ss_pred ecCEEEEEeeCCC-------------ccccccCCEEEe--CCEEEEEEeccccCCCC---ceEEEEEEeeC Confidence 5554444332221 237999999976 66689999999876652 22334444333 No 20 >protein:vir:1385 Length: 107 # NCBI annotation: Gp8 protein # Family: family:all:3858 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612837;genbank:gi:20065971;genbank:GeneID:935786 Probab=30.92 E-value=1.5 Score=19.56 Aligned_cols=106 Identities=16% Similarity=0.214 Sum_probs=70.6 Q ss_pred hcCc-cEEEeeeeeeccCCcccccccccccceeeEeeccchhhccCCcchhhhhCCceeeeeEEEEEccchhhhhcCCcc Q lcl|NC_015281. 44 IYGQ-AIYYIPRTRVTTDDVLNEIQESSFDSAYLCRAYVNNVEGWEGQGELLSKFGIRIEDKTTFVISRKKFTEKVDDNV 122 (387) Q Consensus 44 ~~g~-~~~y~~r~~~~~d~~~~e~~~~~f~~~~~~~~y~~~~~~~~~~~~~~skfg~~~~de~~~~~s~~~f~~~~~~~~ 122 (387) |.+- -|....++- .+|. .|... ..+...+++-|+++..-| .++++-=+++....+.|.|. |..-++.. T Consensus 1 ~~~~hRI~i~~~~~-~~D~-~G~~~-~~w~~~~~~WA~v~~~~g----~E~~~a~~~~~~~~~~f~iR---y~~~i~~~- 69 (107) T protein:vir:13 1 MARYERISIKKLEE-KNIK-GRRQE-ECLIPFYDCWAEILDLYG----QELYGALQMKLENTIIFKIR---YCKKVEEL- 69 (107) T ss_pred CCcceEEEEEeeee-eeCC-CCCee-cceEeEEEEEEEEecCCc----hheeecceeheeeeEEEEEE---ecCCcccc- Confidence 6654 455554444 4774 45443 468999999999998754 56666666777777777773 33333322 Q ss_pred cccCCCCCccccEEEEcCCCcEEEEEeeeeCChhheeccceEEEEEEEEEe Q lcl|NC_015281. 123 TLAVEGRPNEGDLIWFPVTKHLFEIKFVEAERPFYQLGKGYVWEMQCELFE 173 (387) Q Consensus 123 ~~~~~~~p~egdliy~p~~~~lfei~~ve~~~pf~q~g~~yv~~~~~~~F~ 173 (387) ++..++-|.+ .+++|+|..|.+. ..++=..++.|+... T Consensus 70 ------~~t~~~Ri~~--~g~~y~I~~v~~~-----~~~~~~l~i~c~eV~ 107 (107) T protein:vir:13 70 ------RNKENFIVEW--QGRKYEIYYPDFL-----GYNKQFVKLKCKEVL 107 (107) T ss_pred ------ccCcCcEEEE--CCeEEEEEecCCc-----ccCCeEEEEEEEEeC Confidence 4556677766 7889999999862 334456789999887 No 21 >protein:vir:80941 Length: 135 # NCBI annotation: gp11 # Family: family:all:4899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468397;genbank:gi:157324971;genbank:GeneID:5601374 Probab=27.48 E-value=1.7 Score=19.32 Aligned_cols=86 Identities=15% Similarity=0.301 Sum_probs=48.8 Q ss_pred CcccCCChhhhhhccccceEEEEeccccc-hhHHHHHH----------HHHHHHhcCccEEEeeeeeeccCCcc--cccc Q lcl|NC_015281. 1 MAYSNTPANNCIQSDYDSACRININGSEQ-EQVFFENL----------IVESIEIYGQAIYYIPRTRVTTDDVL--NEIQ 67 (387) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~q~l~~~l----------v~e~i~~~g~~~~y~~r~~~~~d~~~--~e~~ 67 (387) +.-.|...|-|-.-.|+-++-||+++++- |-..++-| -.|+||-..-....--+|--..-.+. .+|- T Consensus 37 lltpnndkqgyqdgsyersfsfnln~sskqemk~~~vlnai~ayfdn~e~~siqs~n~sfvledkettsv~n~vs~sddg 116 (135) T protein:vir:80 37 LLTPNNDKQGYQDGSYERSFSFNLNGSSKQEMKVLNVLNAITAYFDNTELESIQSLNNSFVLEDKETTSVANLVSASDDG 116 (135) T ss_pred EEccCCccccccCCccceeeeeeccCcchhhhHHHHHHHhHHhhcccchhhhhhhcCCcEEeecccccceeeeeeecCCc Confidence 33345555667777899999999999764 33333322 34666654433332222221111111 1233 Q ss_pred cccccceeeEeeccchhhc Q lcl|NC_015281. 68 ESSFDSAYLCRAYVNNVEG 86 (387) Q Consensus 68 ~~~f~~~~~~~~y~~~~~~ 86 (387) ..-+...|+|..|+++-|- T Consensus 117 tfiysa~fkiklyieseek 135 (135) T protein:vir:80 117 TFIYSASFKIKLYIESEEK 135 (135) T ss_pred cEEEecceEEEEEEeccCC Confidence 3345688999999998876 Done!