Query lcl|NC_021559.1_cdsid_YP_008129936.1 [gene=PRAG_00003] [protein=hypothetical protein] [protein_id=YP_008129936.1] [location=3308..4234] Match_columns 308 No_of_seqs 45 out of 49 Neff 4.4 Searched_HMMs 1612 Date Thu Nov 7 16:31:45 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:104476 Length: 308 100.0 2E-167 1E-170 934.5 27.5 308 1-308 1-308 (308) 2 protein:vir:106986 Length: 292 100.0 1E-142 9E-146 798.2 25.2 292 6-307 1-292 (292) 3 protein:vir:103005 Length: 390 100.0 3E-118 2E-121 665.2 25.5 308 1-308 1-390 (390) 4 protein:vir:103452 Length: 256 100.0 3E-111 2E-114 626.4 16.0 235 1-255 20-256 (256) 5 protein:vir:7199 Length: 256 # 100.0 5E-111 3E-114 625.2 16.3 235 1-255 20-256 (256) 6 protein:vir:98258 Length: 248 100.0 6E-111 4E-114 625.0 13.2 228 1-249 20-248 (248) 7 protein:vir:6890 Length: 254 # 100.0 4E-109 3E-112 614.8 13.2 235 1-255 20-254 (254) 8 protein:vir:100535 Length: 253 100.0 3E-108 2E-111 610.3 14.5 233 1-254 18-253 (253) 9 protein:vir:80995 Length: 246 100.0 4E-108 2E-111 609.5 13.9 228 1-248 18-246 (246) 10 protein:vir:6590 Length: 246 # 100.0 8E-108 5E-111 607.8 13.7 228 1-248 18-246 (246) 11 protein:vir:101154 Length: 252 100.0 1E-107 7E-111 606.8 12.6 233 1-258 18-252 (252) 12 protein:vir:101800 Length: 252 100.0 1E-107 7E-111 606.8 12.6 233 1-258 18-252 (252) 13 protein:vir:5658 Length: 278 # 100.0 9E-106 6E-109 596.5 14.3 244 1-265 23-278 (278) 14 protein:vir:107937 Length: 257 100.0 1E-105 9E-109 595.3 11.5 237 1-273 20-257 (257) 15 protein:vir:106285 Length: 262 100.0 5E-104 3E-107 587.1 13.4 242 1-284 20-262 (262) 16 protein:vir:104739 Length: 470 100.0 1.3E-97 8E-101 551.7 21.4 283 6-300 1-470 (470) 17 protein:vir:97237 Length: 122 26.6 1.9 0.0012 19.0 10.6 121 8-171 1-122 (122) No 1 >protein:vir:104476 Length: 308 # NCBI annotation: gp14 # Family: family:all:1104 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214650;genbank:gi:61806291;genbank:GeneID:3294531 Probab=100.00 E-value=2e-167 Score=934.50 Aligned_cols=308 Identities=97% Similarity=1.414 Sum_probs=307.4 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) |+|+|+|+||||||||.+++|||+|||++||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 1 ~~~~~~~~~py~~~~~~~~~~~n~~~~~~eQ~L~e~LV~EsIqm~G~dvyYlpRe~v~~D~i~~Ed~~skF~~a~~ieaY 80 (308) T protein:vir:10 1 MAIQNSPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNRDSVFEEDSDGKFESAKAIRAY 80 (308) T ss_pred CccccCCCCCcccccccccceEEEeccCchhHHHHHHHHHHHHhcCceEEEechhhcccccccccccccccccceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+|||||+|+++||||||||++|||||+|||+||+++++++.++.+++||||||||||||+|+|||||||||++||||+| T Consensus 81 ~~~~egy~g~~~~~SKFG~~~~DE~t~~is~~rF~~~v~~~~~~~~~~rP~EGDLIYfPl~~~lFEI~~VE~~~PFyQ~G 160 (308) T protein:vir:10 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) T ss_pred eechhccCCCcceeeecCceecceEEEEEccchhhhhcCCccccccCCCCccccEEEecCCCceEEEEcccCCCchhhhC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeEEEEEe Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRTLIVIN 240 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~l~V~n 240 (308) |+|||||+|+||+||+|+|+|||++||+|+++++++|+|+|+++++|+++++|+|+|++++++|||++|++++++|+|+| T Consensus 161 k~~~~~l~ce~F~Ys~E~~~~~i~~iD~i~~~~~~~LdL~pIs~l~G~fdInE~v~gest~itAEv~~wds~v~~ItV~N 240 (308) T protein:vir:10 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTVGETITGGTSNVTAEVKSFDASTRTLIVIN 240 (308) T ss_pred CceEEEEEEEEEeeCCcccccCCccccccccccccceeeeeeccCCccccccceecccccceEEEEEEecCCceEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCeecCceeeecCCCceeEEEEeeeeecCCCcccCcceeecCCCceEecccCCcceeecccccccC Q lcl|NC_021559. 241 RSGTFTVPETITGGTSSASWTTATYNTIDNQNTEFDQNNDFETLDNQIIDFTESNPFGSVGSITDNTI 308 (308) Q Consensus 241 ~gg~fts~eTItgsts~As~~~~s~~tl~~~~~~~~~n~~~e~~~D~i~dfte~Npfg~~g~~~~~~~ 308 (308) +||+|++||+|||++|||+|++++++++++++++|++|++|||+||+||||||+||||++||+||||| T Consensus 241 ~gGsftspptItGsts~a~~t~~s~~~~~nt~~~~~~n~~fet~~D~iiDftE~NPFG~~g~~t~~~~ 308 (308) T protein:vir:10 241 RSGTFTVPETVTGGTSSASWTTATYNTIDNQNLDYDQNNDFETLDNQIIDFTEANPFGSVGSITDNTI 308 (308) T ss_pred CCCceeeCcEEEeccCCceeEEEeeeecccCCCcccCCcceeeccCcEEeeccCCCCccccccccccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:106986 Length: 292 # NCBI annotation: neck protein gp14 # Family: family:all:1104 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195128;genbank:gi:58532905;uniprot:Q5GQV9;genbank:GeneID:3260483 Probab=100.00 E-value=1.5e-142 Score=798.15 Aligned_cols=292 Identities=46% Similarity=0.851 Sum_probs=286.4 Q ss_pred ccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeeccchhc Q lcl|NC_021559. 6 TPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAYVNNVE 85 (308) Q Consensus 6 ~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY~~n~e 85 (308) -|+||||+++| |||.+||+|+|+||+|||||||+|||||||++|+ |++|+|+++|||++||+|||||+||| T Consensus 1 m~~npyfn~~~--------~~~~~eQ~L~~~LV~Esiq~~G~dvyYlpRe~~~-d~~~~E~~~skF~~a~~ieaY~~~~e 71 (292) T protein:vir:10 1 MPTSPYFPSYY--------SGYSGEQNLVQDLVDEQIKLFGTDIYYLPRTILR-DNTLDDVIYNKFERQFQVEMLLQNVE 71 (292) T ss_pred CCcCccccccc--------cCcCchhHHHHHHHHHHHHhcCceEEEechhhhc-ccccccccccccccceeEEEEeechh Confidence 69999997766 9999999999999999999999999999999999 99999999999999999999999999 Q ss_pred ccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhCCceEE Q lcl|NC_021559. 86 GWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLGRNYVW 165 (308) Q Consensus 86 g~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~gk~~v~ 165 (308) ||+|+++||||||||++|||||+|||+||++++++. ++.+++||||||||||||+|+|||||||||++||||+||+||| T Consensus 72 g~~g~~~~~sKFG~~~~De~t~~is~~~f~~~~~~~-~~~~~~~P~eGDLIYfPl~~~lFEI~~ve~~~PfyQ~gk~~~~ 150 (292) T protein:vir:10 72 GFGSPSEFISKFGLRITDEVRFIVSQRRWDEEAVNY-DLNVNGRPNEGDLLYFPLTQDIYEIKFVEREDPFYQLGKNYFY 150 (292) T ss_pred ccCCCcceeeecCceecceEEEEEccchhhhhcCcc-cccccCCCccccEEEEcCCCcEEEEEcccCCCchhhhCCceEE Confidence 999999999999999999999999999999999975 8889999999999999999999999999999999999999999 Q ss_pred EEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeEEEEEeCCCCe Q lcl|NC_021559. 166 ECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRTLIVINRSGTF 245 (308) Q Consensus 166 ~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~l~V~n~gg~f 245 (308) ||+|+||+||+|+|+||++++|+|+++++++|+|+|+++++|+++++|+|+|+.++++|||++|+++++.|+|+|++|.| T Consensus 151 ~l~~~~F~Ys~E~idtgl~eiD~i~~~~sseLdL~pi~~g~G~f~inE~vtge~sg~~AEv~sw~~~t~~L~V~n~~GsF 230 (292) T protein:vir:10 151 IMTAEIYEYGSDNISTGVEEIDELETLFSSAIAIALSIGGTGDFDLGEIVTGGISGTEAEVKSWDSSSRILQVINRTGTF 230 (292) T ss_pred EEEEEEEeecCceecCCCCcccccccccccccceeecccCCccccCCceeeecccceEEEEEEccCCCceEEEEeCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCceeeecCCCceeEEEEeeeeecCCCcccCcceeecCCCceEecccCCcceeeccccccc Q lcl|NC_021559. 246 TVPETITGGTSSASWTTATYNTIDNQNTEFDQNNDFETLDNQIIDFTESNPFGSVGSITDNT 307 (308) Q Consensus 246 ts~eTItgsts~As~~~~s~~tl~~~~~~~~~n~~~e~~~D~i~dfte~Npfg~~g~~~~~~ 307 (308) +++++|+|.+|||+|+++++++++.++.++++|.+||+.+|+||||||+||||++||+|-.- T Consensus 231 ~T~e~i~G~~Sga~~~v~si~~~~g~~~t~a~~~~iEt~~d~i~df~e~npfg~~~~~~~~~ 292 (292) T protein:vir:10 231 EEGESVTGNDSGSVWVVDSFDTLNNTNSEYDQNREIESTADTIIDWSESNPFGEYGNFTGSI 292 (292) T ss_pred ccCceeeEeecCCeEEeeEEEEeCCCCCcccccceeccccCcEEeeccCCcCcccccccccC Confidence 99999999999999999999999999999999999999999999999999999999998754 No 3 >protein:vir:103005 Length: 390 # NCBI annotation: gp110 # Family: family:all:1104 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717777;genbank:gi:113200614;genbank:GeneID:4239009 Probab=100.00 E-value=2.7e-118 Score=665.17 Aligned_cols=308 Identities=63% Similarity=1.056 Sum_probs=259.9 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) |+|||+|.+.|.|++|.+..|||+|||.+||+|+|+||+|||||||+||||||||+|++|+||+|+++|||+|||+|||| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~~~~~~~y 80 (390) T protein:vir:10 1 MTYSNDPPNNCIQSDYTSSCRLNLNGSAQEQTFMENLIVESIELYGQNVYYLPRIYVNRDTILNEVETSRFEQALSVRAY 80 (390) T ss_pred CeecCCCcccceecceeeccEEEEeccCchhHHHHHHHHHHhHhcCceEEEechheeccccccccccccccccceEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+|||||+|+++||||||||++|||||+|||+||+++++++..+.+++||+|||||||||+|+||||+||||++||||+| T Consensus 81 ~~~~~~~~~~~~~~skfg~~~~de~~~~~~~~~~~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~~p~yq~G 160 (390) T protein:vir:10 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRKKFTTAVDDNAVLNVEGRPNEGDLIWFPATRHLFEIKFVEAERPFYQLG 160 (390) T ss_pred eechhccCCccceeeecCceecceEEEEECCcchhhhhCCcccccccCCCCCCceEEecCCCCEEEEEecCCCCCceEcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeEEEEE- Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRTLIVI- 239 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~l~V~- 239 (308) |+|+|+|+|++|+||+|+++++++++|.|.......+.+.+..++.+.++....+.+.+..+++.++.+++.+..++|+ T Consensus 161 ~nyt~~i~a~lf~ySge~iat~~seid~I~~~~~~~v~~~~~t~g~~~~t~~~~v~~~g~ga~~~a~v~~g~Vt~vtItn 240 (390) T protein:vir:10 161 KGYVWECQCELFEYSDEDLDTGVAEIDAIETAFANAIKLVMDAGGTGAFTVGEEIVGDLYLATATATISGDAVDAVTVTD 240 (390) T ss_pred CceeeeeEEeeeccCCccccccccccccccccccceeeeeeccCCcccccccceeeecCcceeEEEEecCCeEEEEEEee Confidence 9999999999999999999999999999977766666655555555544444444444333444443333333433333 Q ss_pred -------------------------------------------------------------------------------e Q lcl|NC_021559. 240 -------------------------------------------------------------------------------N 240 (308) Q Consensus 240 -------------------------------------------------------------------------------n 240 (308) | T Consensus 241 ~GsGYt~~~~ptVtisgg~gtgAt~tatv~~~G~VtsItItn~GsGYt~~PtVtI~g~g~~~~a~~~~~~g~v~~i~Itn 320 (390) T protein:vir:10 241 GGEHYKSALPPTVTITGGGGSGATATATVSSAGIVTGITITSGGTGYTSAPTVTIDYSPKDNRAEVKSWNASTRELQVIN 320 (390) T ss_pred CCCCcccCceeEEEecCCCCccceeeeeecccceEEEEEEecCCccccCCCEEEEeCCCCCceeEEEEeccEEEEEEEec Confidence 3 Q ss_pred CCCCeecCceeeecCCCceeEEEEeeeeecCCC--cccCcceeecCCCceEecccCCcceeecccccccC Q lcl|NC_021559. 241 RSGTFTVPETITGGTSSASWTTATYNTIDNQNT--EFDQNNDFETLDNQIIDFTESNPFGSVGSITDNTI 308 (308) Q Consensus 241 ~gg~fts~eTItgsts~As~~~~s~~tl~~~~~--~~~~n~~~e~~~D~i~dfte~Npfg~~g~~~~~~~ 308 (308) .|++|+++|+|+++.+|+.+.+++..++.+... .++++..+++.++.||||+++||||.+||.|++|| T Consensus 321 ~GsgYtt~p~vt~~~~G~~~~~~~~~t~~~~~~~~~~~~~~~~~t~~~~ii~~t~gn~~g~v~n~T~t~v 390 (390) T protein:vir:10 321 RTGTFNTAEVITGLTSGAKWSPESYNTLNNTNTADTIDQNYSFETADDDIIDFTEVNPFGNIGSTTDTTI 390 (390) T ss_pred CCcceeeccEEEEecCCcceEEEEEEecccceeeeeecccceeEeCCCceEeecccCcccccccceeccC Confidence 333333333333444444444444444444443 36888999999999999999999999999999999 No 4 >protein:vir:103452 Length: 256 # NCBI annotation: head completion # Family: family:all:1104 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803104;genbank:gi:116326384;genbank:GeneID:4405481 Probab=100.00 E-value=3.2e-111 Score=626.36 Aligned_cols=235 Identities=32% Similarity=0.519 Sum_probs=223.2 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) -|++++.+|||| |++||++||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 20 ~~~~~~~lNPYf----------n~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 89 (256) T protein:vir:10 20 QTNETEILNPYV----------NFNHYKNSQILADVLVAESIQMRGVECYYVPREYVSPDLIFGEDLKNKFTKAWKFAAY 89 (256) T ss_pred hhhhhcccceee----------eeeccCchhhHHHHHHHHHHHHcCceEEEechhhcccccccccccccccccceeEEEE Confidence 789999999999 8999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+|||||+|+++||||||||++|||||+|||+||++++++. ||||||||||||+|+|||||||||++||||+| T Consensus 90 l~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 162 (256) T protein:vir:10 90 LNSFEGYEGAKSFFSNFGMQVQDEVTLSINPNLFKHQVNGK-------EPKEGDLIYFPMDNSLFEINWVEPYDPFYQLG 162 (256) T ss_pred eehhhccCCccccceecCceecceEEEEEcCchhhhhccCC-------CCccccEEEEcCCCcEEEEEeccCCCchhhhC Confidence 99999999999999999999999999999999999999985 99999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeE-EEEE Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRT-LIVI 239 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~-l~V~ 239 (308) |+|||+|+|+||+||+|+|+||++++|+|++++++.|+|+|+++++|++|+++.++++.+.+.+|++.| +++ ++|+ T Consensus 163 kn~~~~l~ce~F~Ys~E~~~~~l~~~d~i~~~~~~eldl~pi~~ldG~~di~~~~~~e~~~~~~e~~~~---v~~~~~in 239 (256) T protein:vir:10 163 QNAIRKITAGKFIYSGEEINPVLQKNEGINIPEFSELELNPVRNLNGIHDINIDQYAEVDQINSEAKEY---VEPYVVVN 239 (256) T ss_pred CCeEEEEEEEEEeeCCceecccCCCCccccCCccchhhhccccccccccccCccccccchhhhhccccc---cccceEec Confidence 999999999999999999999999999999999999999999999999999999999999999999998 555 5667 Q ss_pred eCCCCeecCceee-ecC Q lcl|NC_021559. 240 NRSGTFTVPETIT-GGT 255 (308) Q Consensus 240 n~gg~fts~eTIt-gst 255 (308) ++|++++++|.=- +-. T Consensus 240 ~~G~~~~~~pfd~~~~~ 256 (256) T protein:vir:10 240 NRGKSFESSPFDNDFMD 256 (256) T ss_pred CCCCCCcCCCccccccC Confidence 8888888777321 111 No 5 >protein:vir:7199 Length: 256 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049773;genbank:gi:9632588;genbank:GeneID:1258695 Probab=100.00 E-value=5.2e-111 Score=625.21 Aligned_cols=235 Identities=32% Similarity=0.520 Sum_probs=223.2 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) -|++++.+|||| |+|||++||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 20 ~~~~~~~lNPYf----------n~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~~Ed~~skF~ka~~ieaY 89 (256) T protein:vir:71 20 QTNETEILNPYV----------NFNHYKNSQILADVLVAESIQMRGVECYYVPREYVSPDLIFGEDLKNKFTKAWKFAAY 89 (256) T ss_pred hhhhhcccceee----------eeeccCchhhHHHHHHHHHHHHcCceEEEechhhcccccccccccccccccceeEEEE Confidence 789999999999 8999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+|||||+|+++||||||||++|||||+|||+||++++++. ||||||||||||+|+|||||||||++||||+| T Consensus 90 l~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 162 (256) T protein:vir:71 90 LNSFEGYEGAKSFFSNFGMQVQDEVTLSINPNLFKHQVNGK-------EPKEGDLIYFPMDNSLFEINWVEPYDPFYQLG 162 (256) T ss_pred eehhhccCCccccceecCceecceEEEEEccchhhhhhcCC-------CCccccEEEEcCCCcEEEEEcccCCCchhhhC Confidence 99999999999999999999999999999999999999985 99999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeE-EEEE Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRT-LIVI 239 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~-l~V~ 239 (308) |+|||+|+|+||+||+|+|+||++++|+|++++++.|+|+++++++|.+|++++++++.+.+++|++.| +++ ++|+ T Consensus 163 kn~~~~l~ce~F~Ys~E~~~~~l~~~d~i~~~e~~eldl~~i~~ldg~~di~~~~~~e~~~i~~e~~~~---ve~~~~in 239 (256) T protein:vir:71 163 QNAIRKITAGKFIYSGEEINPVLQKNEGINIPEFSELELNAVRNLNGIHDINIDQYAEVDQINSEAKEY---VEPYVVVN 239 (256) T ss_pred CCeEEEEEEEEEecCCceecccCCCCCcccCCcccccccccccccCcccccccccccchhhhhhccccc---cccceeec Confidence 999999999999999999999999999999999999999999999999999999999999999999998 445 5667 Q ss_pred eCCCCeecCceee-ecC Q lcl|NC_021559. 240 NRSGTFTVPETIT-GGT 255 (308) Q Consensus 240 n~gg~fts~eTIt-gst 255 (308) ++|++++++|.=- +-. T Consensus 240 ~~G~~~~~~pfd~~~~~ 256 (256) T protein:vir:71 240 NRGKSFESSPFDNDFMD 256 (256) T ss_pred CCCCCCcCCCccccccC Confidence 8888888777321 111 No 6 >protein:vir:98258 Length: 248 # NCBI annotation: gp14 head completion # Family: family:all:1104 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239191;genbank:gi:66391666;genbank:GeneID:3416360 Probab=100.00 E-value=5.7e-111 Score=624.98 Aligned_cols=228 Identities=28% Similarity=0.472 Sum_probs=219.8 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) -|++++.+|||| |+|+|++||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 20 ~t~~~~~lnPYf----------N~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 89 (248) T protein:vir:98 20 RNLKDQVTNPYV----------NWYKYNPTQQLHDSLTAESIQMKSPDMYYVRREFVNIDKILGEDRESKFTKSWKIAAY 89 (248) T ss_pred hhhhccccccee----------eccccCchhHHHHHHHHHHHHHcCceEEEcchhhhccccccccccccccccceeEEEe Confidence 789999999999 8999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+||+||+|+++||||||||++|||||+|||+||++++++ +||+|||||||||+|+||||+|||+ +||||+| T Consensus 90 l~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~~~~-------~rP~EGDLIYfPm~n~LFEI~~VE~-dPFYQ~G 161 (248) T protein:vir:98 90 IESYANYEGQRDFFSKFGLSSNDEMTLVLNPRLFAHQTDG-------GIPVLGDLVYFPMDNSLFEITWVEA-DPFYQFG 161 (248) T ss_pred eehhhccCCccceeeecCceecceEEEEEccchhhhcCCC-------CCCccccEEEEcCCCceEEEEecCC-CchhhhC Confidence 9999999999999999999999999999999999999987 5999999999999999999999999 5999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeEEEE-E Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRTLIV-I 239 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~l~V-~ 239 (308) |+|||+|+|+||+||+|+|+||++++|+|+++++++|+|+|+++++|++|+++.++++.+++.+||++|. .+++| + T Consensus 162 kn~~~~l~ce~F~Ys~E~~~~~l~~~d~i~~~~~~~iDl~~i~~ldg~~Di~~~~~~e~~~~~~E~~~~~---~~~~vin 238 (248) T protein:vir:98 162 DRPQRKINLAKFIYTGEELAPELQRNEGIHIEPDAELDLEPIRNLDGLADINIEQYEEDKEFEREGDEFI---ESFDVVN 238 (248) T ss_pred CCeEEEEEEEEeeeCCccccccCCCccccCCCCCcchhHHHhhcCccccccCcccccchhhhhhhhhhhh---cccceec Confidence 9999999999999999999999999999999999999999999999999999999999999999999994 44555 5 Q ss_pred eCCCCeecCc Q lcl|NC_021559. 240 NRSGTFTVPE 249 (308) Q Consensus 240 n~gg~fts~e 249 (308) ++|+.++++| T Consensus 239 ~~g~~~~~~~ 248 (248) T protein:vir:98 239 GRGSPFATLP 248 (248) T ss_pred CcCCCccCCC Confidence 6677899988 No 7 >protein:vir:6890 Length: 254 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861866;genbank:gi:32453657;genbank:GeneID:1494292 Probab=100.00 E-value=4.1e-109 Score=614.79 Aligned_cols=235 Identities=30% Similarity=0.503 Sum_probs=222.2 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) -|++++.+|||| |+|||++||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 20 ~t~~~~~lNPYf----------n~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 89 (254) T protein:vir:68 20 QTNETEILNPFV----------NFNNYENSQTLADVLVAESIQMRGIECFYVPREYVAVDLIFGEDLKNKFTKAWKFAAY 89 (254) T ss_pred hhhhccccceeE----------EeeccCchhHHHHHHHHHHHHHcCceEEEechhhhccccccccccccccccceeEEEe Confidence 789999999999 8999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+||+||+|+++||||||||++|||||+|||+||++++++ +||+|||||||||+|+|||||||||++||||+| T Consensus 90 l~s~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~v~~-------~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 162 (254) T protein:vir:68 90 LNSFEGYEGAKSFFSNFGMQVQDEVTLSINPGLFKHQVNN-------QEPKEGDLIYFPMDNSLFEINWVEPYDPFYQVG 162 (254) T ss_pred eehhhccCCcccchhhcCceecceEEEEEcCchhhhhcCC-------CCCccccEEEEcCCCceEEEeccCCCCchhhhC Confidence 9999999999999999999999999999999999999987 499999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeEEEEEe Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRTLIVIN 240 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~l~V~n 240 (308) |+|||+|+|+||+||+|+|+||+++||+|+.+...+|+|+|+++++|..|+++.++++.+.+.+||..+ +++..|+| T Consensus 163 Kn~~~~l~ce~F~Ys~E~idt~i~~id~I~~~e~~~ldl~~i~~ldG~~di~~~~~~E~~~~~~e~~~f---~e~~~~vn 239 (254) T protein:vir:68 163 KNAIRKITAGKFIYSGEEINPVLQKNEGINIPEFSDLELNPVRNLDGIHDINIDEYSEVEQINSEASEY---VEPYVVVN 239 (254) T ss_pred CceEEEEEEEEEeeCCccccCCCcccCCccCcccCCcchhhHhhhcchhhccccchhhHHHHHhhhhhh---cccceeec Confidence 999999999999999999999999999999999999999999999999999999999999999999876 99999988 Q ss_pred CCCCeecCceeeecC Q lcl|NC_021559. 241 RSGTFTVPETITGGT 255 (308) Q Consensus 241 ~gg~fts~eTItgst 255 (308) +.|+.+||=-=.+-. T Consensus 240 ~~g~~~~pf~~~~~~ 254 (254) T protein:vir:68 240 NRGRQNSPFDDGFMN 254 (254) T ss_pred CCCCCCCcccccccC Confidence 877665542111111 No 8 >protein:vir:100535 Length: 253 # NCBI annotation: gp14 head completion protein # Family: family:all:1104 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656376;genbank:gi:109290127;genbank:GeneID:4156513 Probab=100.00 E-value=2.7e-108 Score=610.28 Aligned_cols=233 Identities=30% Similarity=0.477 Sum_probs=217.1 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) -|++++.+|||| |++||++||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 18 ~t~~~~~lNPYf----------n~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 87 (253) T protein:vir:10 18 QTNEQNILNPYV----------KFNRYEGSQALHDTLVAESIQMRGLEFYYLEREYTNLDLLFGEDPNSRFEKAWKFAAW 87 (253) T ss_pred hhhhhcccccee----------eccccCchhHHHHHHHHHHHHHcCceEEEcchhhccccCccccccccccccceeEEEe Confidence 789999999999 8999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+||+||+|+++||||||||++|||||+|||+||++++++. ||+|||||||||+|+||||+|||+++||||+| T Consensus 88 l~~~egy~G~gd~~SKFG~~v~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (253) T protein:vir:10 88 LNSFESYEGQQSFFSKFGHQQNDEVRISINPGLFKYQVNGK-------EPKLGDLIYMPMDNSLFEITWVEPYTPFYQMG 160 (253) T ss_pred eehhhccCCccceeeecCceecceEEEEEcCchhhhhccCC-------CCccccEEEEcCCCcEEEEeccCCCCchhhhC Confidence 99999999999999999999999999999999999999985 99999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeEEEEEe Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRTLIVIN 240 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~l~V~n 240 (308) |+|||+|+|+||+||+|+|+||+++||+|+ +.+++|+|+|+++++|.+|+++.++++.+++.+||+.+ +.+..|++ T Consensus 161 kn~~~~l~ce~F~Ys~E~i~tgi~~id~Ie-~~~~~ldl~~i~~l~G~~Di~~~~~~e~~~~~~e~~~~---v~~~~~~~ 236 (253) T protein:vir:10 161 KNPIRVIVAQKFIYSGEKLAPQFQEKPEIE-DQYNGLDLEPILNLDGFIDQKINEFGENVQAQNEARPF---VEPFDPIS 236 (253) T ss_pred CceEEEEEEEEEecCCccccccCccccccc-chhhhhhhhhhhcCCCccccccccccccchhhhccccc---cccceecc Confidence 999999999999999999999999999999 55689999999999999999999999999999999876 88777754 Q ss_pred CCC--CeecCc-eeeec Q lcl|NC_021559. 241 RSG--TFTVPE-TITGG 254 (308) Q Consensus 241 ~gg--~fts~e-TItgs 254 (308) +.+ +++||= .--|+ T Consensus 237 ~~~~~g~~spf~~~~~~ 253 (253) T protein:vir:10 237 TNPVNSFNSPFGRHEGQ 253 (253) T ss_pred CCCCccccCcccccCCC Confidence 432 444432 11222 No 9 >protein:vir:80995 Length: 246 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469495;genbank:gi:157311452;genbank:GeneID:5602161 Probab=100.00 E-value=3.8e-108 Score=609.47 Aligned_cols=228 Identities=29% Similarity=0.514 Sum_probs=218.0 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) -+++++.+|||| |++||++||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 18 ~~~~~~~lNPYf----------n~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 87 (246) T protein:vir:80 18 NTRQTEILNPYV----------NFNSHTNTQTLADIMVAESIQMRGVEMYYIPREFVKPDMIFGEDVQSKFTKAWKFAAY 87 (246) T ss_pred hhccccccccee----------eeCCCCchhhHHHHHHHHHHHHcCceEEEechhhhccccccccccccccccceeEEEe Confidence 789999999999 8999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+||+||+|+++||||||||++|||||+|||+||++++++ +||+|||||||||+|+||||+||||++||||+| T Consensus 88 l~s~egy~G~gd~~SKFG~~v~DEvt~~Is~~rF~~~~~~-------~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (246) T protein:vir:80 88 INSFDGYEGAGNFFQSFGYTANDELTITINPNLFKHQVDN-------KEPKSGDLFYIPMSNDLFEISYVEPYQPFFQAG 160 (246) T ss_pred eehhhccCCccceeeecCceecceEEEEEccchHhhhhCC-------CCCccccEEEEcCCCceEEEecccCCCchhhhC Confidence 9999999999999999999999999999999999999987 499999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeEEEEEe Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRTLIVIN 240 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~l~V~n 240 (308) |+|||+|+|+||+||+|+|+||+++||+|++++..+|+|+|+++++|+.|+++.++++.+.+.+|++.+ +++..|+| T Consensus 161 Kn~v~~l~ce~F~Ys~E~~~t~i~~~d~I~~de~~~ldl~~i~nldg~~Din~~~~~e~~~~~~e~~~f---~~~~~~~~ 237 (246) T protein:vir:80 161 KNAMRKIIAEKFVYSGEELRPELQRNEGINVDEFADLDLAPITNLDGLTDINLDQYKEDKQFRSEGQDF---IDPFDPIN 237 (246) T ss_pred CCeEEEEEEEEEeecCcccccCCCCccccCCCCcccccHHHHhhcccccccCcccchhhhhhhcchhhh---cccceeec Confidence 999999999999999999999999999999999999999999999999999999999999999999876 78777766 Q ss_pred CCC-CeecC Q lcl|NC_021559. 241 RSG-TFTVP 248 (308) Q Consensus 241 ~gg-~fts~ 248 (308) +.| .|.-= T Consensus 238 ~~gspf~~~ 246 (246) T protein:vir:80 238 GKGSPFADF 246 (246) T ss_pred CCCCccccC Confidence 554 33322 No 10 >protein:vir:6590 Length: 246 # NCBI annotation: neck protein # Family: family:all:1104 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891721;genbank:gi:33620667;genbank:GeneID:1725307 Probab=100.00 E-value=7.7e-108 Score=607.81 Aligned_cols=228 Identities=30% Similarity=0.527 Sum_probs=217.9 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) -+++++.+|||| |+|||.+||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 18 ~~~~~~~lNPYf----------N~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 87 (246) T protein:vir:65 18 NTRQTEILNPYV----------NFHKYTNTQTLADVMVAEAIQMRGVELYYIPREFVKPDMIFGEDVQSKFTKAWKFAAY 87 (246) T ss_pred hhccccccccee----------ecCCCcchhhHHHHHHHHHHHHcCceEEEechhhcccccccccccccccccceeEEEe Confidence 789999999999 8999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+||+||+|+++||||||||++|||||+|||+||++++++ +||+|||||||||+|+||||||||+++||||+| T Consensus 88 l~~~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~~~~-------~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (246) T protein:vir:65 88 INSFDGYEGAGNFFSSFGYQANDELTFTVNPNLFKHQVDD-------QEPKSGDLIYIPMSNDLFEINYVEPYQPFFQAG 160 (246) T ss_pred eehhhccCCccceeeecCceecceEEEEEcCchhhhhcCC-------CCCccccEEEEcCCCcEEEEecccCCCchhhhC Confidence 9999999999999999999999999999999999999986 599999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeEEEEEe Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRTLIVIN 240 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~l~V~n 240 (308) |+|||+|+|+||+||+|+|+||+++||+|++++..+|+|+|+++++|+.|+++.++++.+.+.+|++.+ +++..|+| T Consensus 161 Kn~v~~l~ce~F~Ys~E~~~~~i~~~d~I~~de~~~ldl~~i~~ldg~~Din~~~~~e~~~~~~e~~~f---~~~~~~~~ 237 (246) T protein:vir:65 161 KNAMRKIIAEKFVYSGEELRPELQRNEGINVDEFADLDLAPITNLDGLTDINLDQYKEDKQFRSEGQDF---IDPFDPIN 237 (246) T ss_pred CCeEEEEEEEEEeecCcccccCCCCccccCCCCcccccHHHHhhcccccccCccchhhhhhhhcchhhh---cccceeec Confidence 999999999999999999999999999999999999999999999999999999999999999999876 78777766 Q ss_pred CCC-CeecC Q lcl|NC_021559. 241 RSG-TFTVP 248 (308) Q Consensus 241 ~gg-~fts~ 248 (308) +.| .|.-= T Consensus 238 ~~gspf~~~ 246 (246) T protein:vir:65 238 GKGSPFADF 246 (246) T ss_pred CCCCccccC Confidence 554 33322 No 11 >protein:vir:101154 Length: 252 # NCBI annotation: neck protein # Family: family:all:1104 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932505;genbank:gi:37651631;genbank:GeneID:2610647 Probab=100.00 E-value=1.2e-107 Score=606.82 Aligned_cols=233 Identities=30% Similarity=0.461 Sum_probs=218.7 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) -|++++.+|||| |++||++||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 18 ~t~~~~~lNPYf----------n~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~~Ed~~SkF~ka~~ieaY 87 (252) T protein:vir:10 18 RTNEKNILNPYV----------KFNKHEGTQALQDTLVAESIQMRGIEFYYLEREFTDLDLLFGEDVNSRFEKAWKFAAW 87 (252) T ss_pred hhhhhcccccee----------eecCccchhHHHHHHHHHHHHHcCceEEEechhhcccccccccccccccccceeEEEe Confidence 789999999999 8999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+||+||+|+++||||||||++|||||+|||+||++++++. ||+|||||||||+|+||||+|||+++||||+| T Consensus 88 l~s~egy~G~gd~~SKFG~~~~DEvt~~Is~~rF~~qv~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (252) T protein:vir:10 88 LNSFESYEGQQSFFSKFGHTQNDEIRISINPGLFKYQVNGK-------EPALGDLIYMPMDNSLFEITWVEPYSPFYQNG 160 (252) T ss_pred eehhhccCCccceeeecCceecceEEEEEcCchhhhhccCC-------CCccccEEEEcCCCcEEEEeccCCCCchhhhC Confidence 99999999999999999999999999999999999999985 99999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeEEEEEe Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRTLIVIN 240 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~l~V~n 240 (308) |+|||+|+|+||+||+|+|+|++++||+|+ +..+.|+|+|+++++|+.|+++.++++.+.+.+||+.+ +++..|++ T Consensus 161 kn~~~~l~ce~F~Ys~E~idt~~~~id~Ie-~~~s~ldl~~i~~ldG~~Di~~~~~~e~~~~~~e~~~~---~e~~~~i~ 236 (252) T protein:vir:10 161 KNPIRVIVAQKFIYSGEKITPVVQEKPEIE-DMYNGLDLAPLLNLDGMIDQKIDQFAENIAVQQKVKQY---AEPFDPIS 236 (252) T ss_pred CCeEEEEEEEEEeeCCceecCcccccCccc-hhhhccchHHHhhcCCeeccccccchhhHHHHHhhhhh---hccceeec Confidence 999999999999999999999999999999 66799999999999999999999999999999999876 88888888 Q ss_pred CCCC--eecCceeeecCCCc Q lcl|NC_021559. 241 RSGT--FTVPETITGGTSSA 258 (308) Q Consensus 241 ~gg~--fts~eTItgsts~A 258 (308) ++|. ++||= .- --| T Consensus 237 ~~~~~~~~~pf--~~--~~~ 252 (252) T protein:vir:10 237 TNSFGNFDSPF--GK--HEA 252 (252) T ss_pred CCCCCCcCCcc--cc--cCC Confidence 8773 33321 10 000 No 12 >protein:vir:101800 Length: 252 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238877;genbank:gi:66391952;genbank:GeneID:3416627 Probab=100.00 E-value=1.2e-107 Score=606.82 Aligned_cols=233 Identities=30% Similarity=0.461 Sum_probs=218.7 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) -|++++.+|||| |++||++||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 18 ~t~~~~~lNPYf----------n~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~~Ed~~SkF~ka~~ieaY 87 (252) T protein:vir:10 18 RTNEKNILNPYV----------KFNKHEGTQALQDTLVAESIQMRGIEFYYLEREFTDLDLLFGEDVNSRFEKAWKFAAW 87 (252) T ss_pred hhhhhcccccee----------eecCccchhHHHHHHHHHHHHHcCceEEEechhhcccccccccccccccccceeEEEe Confidence 789999999999 8999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+||+||+|+++||||||||++|||||+|||+||++++++. ||+|||||||||+|+||||+|||+++||||+| T Consensus 88 l~s~egy~G~gd~~SKFG~~~~DEvt~~Is~~rF~~qv~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (252) T protein:vir:10 88 LNSFESYEGQQSFFSKFGHTQNDEIRISINPGLFKYQVNGK-------EPALGDLIYMPMDNSLFEITWVEPYSPFYQNG 160 (252) T ss_pred eehhhccCCccceeeecCceecceEEEEEcCchhhhhccCC-------CCccccEEEEcCCCcEEEEeccCCCCchhhhC Confidence 99999999999999999999999999999999999999985 99999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeEEEEEe Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRTLIVIN 240 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~l~V~n 240 (308) |+|||+|+|+||+||+|+|+|++++||+|+ +..+.|+|+|+++++|+.|+++.++++.+.+.+||+.+ +++..|++ T Consensus 161 kn~~~~l~ce~F~Ys~E~idt~~~~id~Ie-~~~s~ldl~~i~~ldG~~Di~~~~~~e~~~~~~e~~~~---~e~~~~i~ 236 (252) T protein:vir:10 161 KNPIRVIVAQKFIYSGEKITPVVQEKPEIE-DMYNGLDLAPLLNLDGMIDQKIDQFAENIAVQQKVKQY---AEPFDPIS 236 (252) T ss_pred CCeEEEEEEEEEeeCCceecCcccccCccc-hhhhccchHHHhhcCCeeccccccchhhHHHHHhhhhh---hccceeec Confidence 999999999999999999999999999999 66799999999999999999999999999999999876 88888888 Q ss_pred CCCC--eecCceeeecCCCc Q lcl|NC_021559. 241 RSGT--FTVPETITGGTSSA 258 (308) Q Consensus 241 ~gg~--fts~eTItgsts~A 258 (308) ++|. ++||= .- --| T Consensus 237 ~~~~~~~~~pf--~~--~~~ 252 (252) T protein:vir:10 237 TNSFGNFDSPF--GK--HEA 252 (252) T ss_pred CCCCCCcCCcc--cc--cCC Confidence 8773 33321 10 000 No 13 >protein:vir:5658 Length: 278 # NCBI annotation: gp14 # Family: family:all:1104 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899597;genbank:gi:34419584;genbank:GeneID:2545699 Probab=100.00 E-value=8.9e-106 Score=596.51 Aligned_cols=244 Identities=28% Similarity=0.463 Sum_probs=213.5 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) -+++++.+|||| |++||.+||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 23 ~~~~~~~~NpYf----------n~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 92 (278) T protein:vir:56 23 QIYNNHLVNPYF----------NWVNHTNEQNLTDMLVAESIINRGVECVYLRREMEKVDLVFGEDPMSKFTQNFRMSLY 92 (278) T ss_pred hhccccccccee----------eccCCCchhHHHHHHHHHHHHHcCceEEEcchhhhccccccccccccccccceeEEEE Confidence 667777777777 8999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+|||||+|+++||||||||++|||||+|||+||++++++. ||||||||||||+|+||||||||+++||||+| T Consensus 93 l~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~~~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 165 (278) T protein:vir:56 93 VESFEGWDGDGDWYSKFGFQVNDEMNVCINPKLFAQQGDGK-------QPLMGDLIYFPLANSLFEISWIEREDPWYMNG 165 (278) T ss_pred eehhhccCCCceeeeecCceecceEEEEEccchhhhcCCCC-------CCccccEEEEcCCCcEEEEEccCCCCchhhhC Confidence 99999999999999999999999999999999999999974 99999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCC-----CCcccceecCcc----ceeEEeeecccccceecceEeecCCcceeEEEEEecC Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTG-----ITELDAIETAFA----NAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDA 231 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~-----~~~~d~i~~~~~----~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~ 231 (308) |+|||+|+|+||+||+|+|+|| ++++|.|. .++ ..|++.++.+++|.+|+++.++++.+.+++||+.| T Consensus 166 k~~~~~l~ce~F~Ys~E~~dtg~pe~~~d~i~~i~-ef~e~~~~~ld~~~~~~l~G~~di~~~~~~e~~~~~~e~~~f-- 242 (278) T protein:vir:56 166 VLPMRKMKMTKFVYSGEEINLEKPEAVIDSIDDIL-NFGEDTDEMIDIDKINALDGRWDIGIEQGAEITQIEDEVDKF-- 242 (278) T ss_pred CceEEEEEEEEeeecCceecccCCccccccccchh-hcccchhhcccccccccccchhcccchhhhhhhhhhhcccee-- Confidence 9999999999999999999988 66666652 232 34899999999999999999999999999999988 Q ss_pred ceeEEEEEeCCCCee---cCceeeecCCCceeEEEEe Q lcl|NC_021559. 232 ATRTLIVINRSGTFT---VPETITGGTSSASWTTATY 265 (308) Q Consensus 232 ~~~~l~V~n~gg~ft---s~eTItgsts~As~~~~s~ 265 (308) +.+.+|++.+|.-. +.....|.-.+.+|.+.+. T Consensus 243 -~~~~~v~~~~~~~~~t~~~n~~~g~~v~~~~~~D~f 278 (278) T protein:vir:56 243 -YESEQVVPSGSDVQPTDPRNATIGFNVNNSNPFDSF 278 (278) T ss_pred -eecCceecCCCCccccCcccccCCCcCccccccccC Confidence 66666766655322 2223344444444444443 No 14 >protein:vir:107937 Length: 257 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595290;genbank:gi:161622596;genbank:GeneID:5783656 Probab=100.00 E-value=1.5e-105 Score=595.30 Aligned_cols=237 Identities=27% Similarity=0.475 Sum_probs=218.4 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) -|++++.+|||| |++||.+||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 20 ~t~~~~~lnPYf----------n~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 89 (257) T protein:vir:10 20 NTNETEIMNPFV----------NFYRHENTQTLADALVAESIQMRGIELYYIPREYVNPDQLFGEDLQNKFTKAWKFAGY 89 (257) T ss_pred hhhhhcccccee----------ecccCCchhHHHHHHHHHHHHHcCceEEEcchhhhccccccccccccccccceeEEEe Confidence 789999999999 8999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+||+||+|+++||||||||++|||||+|||+||++++++. ||+|||||||||+|+||||+|||+++||||+| T Consensus 90 l~s~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 162 (257) T protein:vir:10 90 LDSFEGYSGDNTYFSKFGMMVNDEVTITINPNLFKHQCNGT-------EPVSGDLIYFPMDNSLFEINWVQPYDPFYQVG 162 (257) T ss_pred eehhhccCCCcceeeecCceecceEEEEEccchhhhhccCC-------CCccccEEEEcCCCceEEEecccCCCchhhhC Confidence 99999999999999999999999999999999999999985 99999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeEEEEEe Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRTLIVIN 240 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~l~V~n 240 (308) |+|||+|+|+||+||+|+|.|+++++++|.+++..+|+|+++++++|+.|+++.++++.+++.+|+..+ +.+..|+| T Consensus 163 kn~~~~l~ce~F~Ys~E~l~pel~~n~~~~V~e~~eldl~~~~~ldG~~di~~~~~~E~~~~~~e~~~f---i~p~~~~n 239 (257) T protein:vir:10 163 TNVQRRITATKFIYNGEELRPELQRNEGINIPEFSELDLMPVKNIDGLADISDIQYEEVNEINAEAAEF---VHPYVVIN 239 (257) T ss_pred CceEEEEEEEEeeeCCcccccccCCcccCCCCCccchhhhhhhhccchhhcCCchhhhHHHHHHhhhhh---hccccccC Confidence 999999999999999999999999999999999999999999999999999999999999999999765 88888875 Q ss_pred CCCC-eecCceeeecCCCceeEEEEeeeeecCCC Q lcl|NC_021559. 241 RSGT-FTVPETITGGTSSASWTTATYNTIDNQNT 273 (308) Q Consensus 241 ~gg~-fts~eTItgsts~As~~~~s~~tl~~~~~ 273 (308) +.|. ..+.|-= . +.++. T Consensus 240 ~~g~~~~~~pf~---------------~-~~~~~ 257 (257) T protein:vir:10 240 GRGEDAPPTAFD---------------D-AFLDD 257 (257) T ss_pred CCCCCCCCCccc---------------c-hhccC Confidence 5543 3221100 0 00000 No 15 >protein:vir:106285 Length: 262 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944101;genbank:gi:38640145;genbank:GeneID:2658033 Probab=100.00 E-value=4.6e-104 Score=587.11 Aligned_cols=242 Identities=24% Similarity=0.428 Sum_probs=213.0 Q ss_pred CccccccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeec Q lcl|NC_021559. 1 MAIKNTPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAY 80 (308) Q Consensus 1 ~~~~~~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY 80 (308) -+++++.+|||| |+++|++||+|+|+||+|||||||+|||||||++|++|+||+|+++|||++||+|||| T Consensus 20 ~~~~~~vlNPYf----------N~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~gEd~~SkF~ka~~ieaY 89 (262) T protein:vir:10 20 KTYQTNVLNPYV----------NKHEYEPTLSLHEMLVAESIQMTGVEMYYIRREFVNFDRIFGEDMQSKFKKTYKVAMY 89 (262) T ss_pred Ccchhcccccee----------ccCCcCchhhHHHHHHHHHHHHcCceEEEcchhhhccccccccccccccccceeEEEe Confidence 788999999999 8999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhcccCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhC Q lcl|NC_021559. 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) Q Consensus 81 ~~n~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~g 160 (308) |+||+||+|+++||||||||++|||||+|||+||++++++ +||+|||||||||+|+||||+|||+++||||+| T Consensus 90 l~s~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~~~~-------~rP~EGDLIYfPl~nsLFEI~~VE~~~PFYQ~G 162 (262) T protein:vir:10 90 LESFDEYSGQRDFFSKFGMQVNDEITMSVSPKLFETQADG-------DRVKEGDLIYFPLNNSLFEVTWVEPSSPVVKRE 162 (262) T ss_pred eehhhccCCccceeeecCceecceEEEEEccchhhhhhcC-------CCCccccEEEEcCCCceEEEeeccCCCchhhhC Confidence 9999999999999999999999999999999999999997 499999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeecccccceecceEeecCCcceeEEEEEecCceeEE-EEE Q lcl|NC_021559. 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTAGETITGGTSNVTAEVKSFDAATRTL-IVI 239 (308) Q Consensus 161 k~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~~g~G~fdi~E~vtGe~s~atAev~~~d~~~~~l-~V~ 239 (308) |+|||+|+|+||+||+|+|+|++++||+|....+ +|.+++++||+.|+++.++++...+.+|+..+ +... +|+ T Consensus 163 kn~~~~l~ce~F~Ys~E~i~~~i~~id~i~~e~~---~l~~i~~lDg~~di~~~q~~e~~~~~~e~~~f---v~~~d~v~ 236 (262) T protein:vir:10 163 QLAKYKVTAQKFIYSGEEIKPEFDPNRYVLGEDD---PLSQIKALDGRADISLDEFAEDDAFNEEAEDF---VVEFDNII 236 (262) T ss_pred CceEEEEEEEEEeeCCccccccCccccccccccc---cccccccccceeecccccccchhHHhhhhhhh---cchhcccC Confidence 9999999999999999999999999999986543 79999999999999999999999999999876 4444 445 Q ss_pred eCCCCeecCceeeecCCCceeEEEEeeeeecCCCcccCcceeecC Q lcl|NC_021559. 240 NRSGTFTVPETITGGTSSASWTTATYNTIDNQNTEFDQNNDFETL 284 (308) Q Consensus 240 n~gg~fts~eTItgsts~As~~~~s~~tl~~~~~~~~~n~~~e~~ 284 (308) ++|+.|.-- -..+-+.+..++ ++|+. T Consensus 237 ~~gsp~~~~------~~~~~~~~~~fd-------------d~~~~ 262 (262) T protein:vir:10 237 GNGTPIAEH------KPTKPAPVSAFD-------------DLESF 262 (262) T ss_pred CCCCccccc------CCCCCCCCChhh-------------hhhcC Confidence 555555320 001100011111 22222 No 16 >protein:vir:104739 Length: 470 # NCBI annotation: T4-like neck protein # Family: family:all:1104 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214353;genbank:gi:61805993;genbank:GeneID:3294259 Probab=100.00 E-value=1.3e-97 Score=551.70 Aligned_cols=283 Identities=36% Similarity=0.668 Sum_probs=214.6 Q ss_pred ccCCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCccccccccccccceeeeeeccchhc Q lcl|NC_021559. 6 TPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSDGKFESAKAIRAYVNNVE 85 (308) Q Consensus 6 ~~~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~skF~~a~~i~aY~~n~e 85 (308) -|+|||| | |||.+||+|+|+||+|||||||+||||||||+|++|+||+|+++|||+|||+|||||+||| T Consensus 1 ~~~~~~~----------~-~~~~~~~~~~~~~~~e~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~ 69 (470) T protein:vir:10 1 MALNPFF----------L-QGTSSEQRLTQDLINEHLKIYGVEVTYIPRKYVNTKSIIEEVQSSKFDDNFAIEAYVNTYE 69 (470) T ss_pred Cccccee----------E-cCCCchhHHHHHHHHHHhHhccceEEEechhhcccccccccccccccccceeEEEEeeccc Confidence 6899999 3 9999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCccchhhhhcCceeeeeEEEEEccchhhhhhCccc-------cC----CCCCCCccccEEEEcCCCcEEEEEeeecCC Q lcl|NC_021559. 86 GWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSV-------TL----NVEGRPNEGDLIWFPITKHLFEIKFVEVER 154 (308) Q Consensus 86 g~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~-------~~----~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~ 154 (308) ||+|+++||||||||++|||+|+|||+||+..|+... .+ ....||+|||||||||+|+||||+|||++. T Consensus 70 ~~~~~~~~~~~fg~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~ 149 (470) T protein:vir:10 70 GYGGQGDVLTKFGMSIRDEVTLTISKERFEDFIAPFMAGLDDGPGGNEEVILATRPREGDLVFFPLGSRLFEVKFVEHED 149 (470) T ss_pred CcCCcceeeeecCcccceEEEEEECCccccccccchhhcccCcccccccccccCCcccccEEEecCCCCEEEEEecCCCC Confidence 9999999999999999999999999999998887432 22 233699999999999999999999999999 Q ss_pred hhHhhCCceEEEEEEEEEecCCceecCCCCcccceecCccceeEEeeec-----------------------ccccceec Q lcl|NC_021559. 155 PFYQLGRNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVA-----------------------GGTGTFTA 211 (308) Q Consensus 155 pFyQ~gk~~v~~l~c~~F~ys~E~i~t~~~~~d~i~~~~~~~L~L~~i~-----------------------~g~G~fdi 211 (308) ||||+||+++|+|+|++|+|++|.+++++..++......++.....+.. .++|+... T Consensus 150 p~~~~G~~~~~~it~~~f~ysge~~s~~v~~~~~~~~~~g~~~t~t~~~~g~~~~~t~~~~~g~vt~ititn~Gsgyt~~ 229 (470) T protein:vir:10 150 PFYQLGKNYVYQLKCELFEYEDEVIDTSIDAIDTVVQDDGYISKLQLVGIGRTAEVAASIGVGYVREIFLNNDGSGFTSP 229 (470) T ss_pred cchhcCcceeEEeeeceeEecCCccccceecccccccccccceeeeecCCCccceeeeeecceeeeEeEeeccccceecc Confidence 9999999999999999999999999999999998876655554433322 22222111 Q ss_pred ceEee-cC---------------------------------------------C--cceeEEEEEecCceeEEEEEeCCC Q lcl|NC_021559. 212 GETIT-GG---------------------------------------------T--SNVTAEVKSFDAATRTLIVINRSG 243 (308) Q Consensus 212 ~E~vt-Ge---------------------------------------------~--s~atAev~~~d~~~~~l~V~n~gg 243 (308) +.... +. . ..+++.+.........+.+.+.|+ T Consensus 230 ptVti~~~~~~~~~~a~~~~~t~~~~g~vt~ititn~Gsgytt~ptvt~~~~~g~ga~at~~~~~~~~g~~~itit~~Gs 309 (470) T protein:vir:10 230 PTITFSASPAFTDARAVGILTTRANVTSIEKILMTSAGAGYITPPTITISGGGGTGAAATCSIETVYQGVVNFNVVDGGV 309 (470) T ss_pred CEEEEccCCCCCCccceeeEeecceeeEEEEEEEecCcccccccceEEEccCCCccceeeeeecccccceeeEEEccCCc Confidence 10000 00 0 000111111122222344444444 Q ss_pred CeecCceeeec--------------------------------------------------------------------- Q lcl|NC_021559. 244 TFTVPETITGG--------------------------------------------------------------------- 254 (308) Q Consensus 244 ~fts~eTItgs--------------------------------------------------------------------- 254 (308) +|+++|+|+.. T Consensus 310 gYtt~ptvtit~~~sg~~a~~~a~~~~~~~~g~itsititn~Gsgyts~ptv~i~~~~~~~~~~t~~~~~~~tg~tsgt~ 389 (470) T protein:vir:10 310 GYGTEPSIAVTQPGAGTTAVGIASIGMAGSDQVLKSVYIGNPGRGYTATPNVIVADPPSMSGIGTFTFNEVIKGSRSGTE 389 (470) T ss_pred cccccceEEEecCCCCCcccceeEEEeecccceeeeEEeccCCcceeccceeEeecCccccccceeeeeeeeecccccee Confidence 55544433211 Q ss_pred ------------------------------------CCCceeEEEEeeeeecCCCcccCcceeecCCCceEecccCCcce Q lcl|NC_021559. 255 ------------------------------------TSSASWTTATYNTIDNQNTEFDQNNDFETLDNQIIDFTESNPFG 298 (308) Q Consensus 255 ------------------------------------ts~As~~~~s~~tl~~~~~~~~~n~~~e~~~D~i~dfte~Npfg 298 (308) ++++...+.+.+ ...+++++.++..|++.+|+||||+|+|||| T Consensus 390 ~~~~~~~~~t~~~~v~~~~~~~~~~~~~~g~tvt~~~~~a~~~~~s~t-~~~~~~~~ts~~~i~t~~~~i~~~~~~np~~ 468 (470) T protein:vir:10 390 ARVKSWDDDTKILLVSNVGIGSTVSGFYTGESIVGQESGASYALGSYN-SDDANDKYNDGDEFEFNADQILDFTESNPFG 468 (470) T ss_pred eeeeeecccceeeeecccceecccceeeeeeeEEeccccceeeEEEec-ccccCceeeccceeeccCCcEEeeeecCCCC Confidence 111111111111 1444557899999999999999999999999 Q ss_pred ee Q lcl|NC_021559. 299 SV 300 (308) Q Consensus 299 ~~ 300 (308) .+ T Consensus 469 ~~ 470 (470) T protein:vir:10 469 NF 470 (470) T ss_pred CC Confidence 88 No 17 >protein:vir:97237 Length: 122 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:704 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294533;genbank:gi:149408254;genbank:GeneID:5237102 Probab=26.56 E-value=1.9 Score=19.01 Aligned_cols=121 Identities=18% Similarity=0.194 Sum_probs=76.5 Q ss_pred CCCccccccccceeEEeccccchhHHHHHHHHHHHHhcCccEEEeeeeeeccCcccccccc-ccccceeeeeeccchhcc Q lcl|NC_021559. 8 AQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNKDSVFEEDSD-GKFESAKAIRAYVNNVEG 86 (308) Q Consensus 8 ~~py~~~~y~~~~~~n~~~~~~eQ~L~~~Lv~E~Iqm~G~dv~YlpR~~v~~D~v~~E~~~-skF~~a~~i~aY~~n~eg 86 (308) -+ .| . .|. ....++|+-+|++|.+.+..-..-|+-.++... ..-...|...+.+.++.- T Consensus 1 M~-----~y-----------~---~~~-~~a~~Li~kfG~~vtl~r~~~g~y~~~~g~~~p~~~t~~~~~~~gv~~~~~~ 60 (122) T protein:vir:97 1 MA-----RF-----------D---SAI-ALAKKLIKKNGQAVTLRGFTAGAAPDPAKPWKPGGNVAADQTIEAVFLDYEQ 60 (122) T ss_pred Cc-----cc-----------h---HHH-HHHHHHHHHhCCceEEEEeccceeCCCCCceecCCceeeeeeeEEEeeccch Confidence 11 11 1 222 255667777999999888776666766665322 333577999999988875 Q ss_pred cCccchhhhhcCceeeeeEEEEEccchhhhhhCccccCCCCCCCccccEEEEcCCCcEEEEEeeecCChhHhhCCceEEE Q lcl|NC_021559. 87 WEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLGRNYVWE 166 (308) Q Consensus 87 ~~g~~~~~sKFG~~~~De~t~~is~~~f~~~v~~~~~~~~~~~P~eGDLIyfP~~~~lfEI~~Ve~~~pFyQ~gk~~v~~ 166 (308) -.=+|.+ |+..|..-+...... ...|+-||+|-+ +..-|.|-.|++-+| .|-.-.|+ T Consensus 61 ~~idGtl-----I~~GD~~l~~~a~~~-------------~~~P~~gD~v~~--~g~~~~Vi~v~~i~p---a~~~v~y~ 117 (122) T protein:vir:97 61 RYIDGQT-----IRMGDQRVFMPAEGL-------------TAPPEVEGLVLR--GLEVWKVIAVKPLNP---NGQAIMYE 117 (122) T ss_pred hhccCcE-----EeecCEEEEEeeCCC-------------ccccccCCEEEe--CCEEEEEEeccccCC---CCceEEEE Confidence 4433222 455565444332211 248889999966 556789999977655 55566777 Q ss_pred EEEEE Q lcl|NC_021559. 167 CQCEL 171 (308) Q Consensus 167 l~c~~ 171 (308) |...+ T Consensus 118 lqlRk 122 (122) T protein:vir:97 118 LQVRQ 122 (122) T ss_pred EEeeC Confidence 77777 Done!