Query lcl|NC_019444.1_cdsid_YP_007001815.1 [protein=gp14 neck protein] [protein_id=YP_007001815.1] [location=93108..94568] Match_columns 486 No_of_seqs 181 out of 521 Neff 8.1 Searched_HMMs 1612 Date Thu Nov 7 17:48:02 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_89 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_89_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:103005 Length: 390 100.0 3E-121 2E-124 681.1 28.7 390 1-486 1-390 (390) 2 protein:vir:104476 Length: 308 100.0 2E-111 1E-114 627.0 20.8 308 1-486 1-308 (308) 3 protein:vir:104739 Length: 470 100.0 4E-102 2E-105 576.5 31.0 429 16-478 1-470 (470) 4 protein:vir:106986 Length: 292 100.0 5.6E-98 3E-101 553.8 20.3 290 16-485 1-292 (292) 5 protein:vir:103452 Length: 256 100.0 1.5E-93 9.1E-97 529.5 15.6 246 1-256 10-256 (256) 6 protein:vir:7199 Length: 256 # 100.0 1.7E-93 1E-96 529.2 15.2 246 1-256 10-256 (256) 7 protein:vir:5658 Length: 278 # 100.0 4E-92 2.5E-95 521.7 14.8 262 1-276 13-278 (278) 8 protein:vir:100535 Length: 253 100.0 1.9E-91 1.2E-94 517.9 14.5 246 1-258 8-253 (253) 9 protein:vir:98258 Length: 248 100.0 2.1E-91 1.3E-94 517.7 13.1 239 1-251 10-248 (248) 10 protein:vir:6890 Length: 254 # 100.0 5.5E-91 3.4E-94 515.4 12.7 244 1-268 10-254 (254) 11 protein:vir:101154 Length: 252 100.0 2.1E-90 1.3E-93 512.3 13.4 245 1-257 8-252 (252) 12 protein:vir:101800 Length: 252 100.0 2.1E-90 1.3E-93 512.3 13.4 245 1-257 8-252 (252) 13 protein:vir:80995 Length: 246 100.0 4E-90 2.5E-93 510.7 13.6 238 1-255 8-246 (246) 14 protein:vir:6590 Length: 246 # 100.0 3.9E-90 2.4E-93 510.7 13.4 238 1-255 8-246 (246) 15 protein:vir:107937 Length: 257 100.0 6.6E-90 4.1E-93 509.5 11.9 247 1-269 10-257 (257) 16 protein:vir:106285 Length: 262 100.0 6.8E-89 4.2E-92 504.0 13.3 241 1-255 10-262 (262) 17 protein:vir:104739 Length: 470 98.0 6.5E-06 4E-09 49.0 20.6 427 1-459 1-470 (470) 18 protein:vir:103005 Length: 390 95.9 0.0013 8.4E-07 36.3 14.7 341 1-374 14-390 (390) 19 protein:vir:97237 Length: 122 72.7 0.18 0.00011 24.7 11.2 121 24-171 1-122 (122) 20 protein:vir:81177 Length: 109 61.9 0.34 0.00021 23.1 8.2 105 44-177 1-109 (109) 21 protein:vir:1385 Length: 107 # 45.3 0.78 0.00048 21.2 9.0 106 44-173 1-107 (107) 22 protein:vir:4343 Length: 118 # 40.1 0.98 0.00061 20.6 7.7 110 44-177 1-118 (118) 23 protein:vir:1890 Length: 110 # 39.6 1 0.00063 20.6 8.1 106 44-176 1-110 (110) 24 protein:vir:7411 Length: 116 # 30.2 1.6 0.00099 19.5 6.9 101 52-173 1-116 (116) 25 protein:vir:100244 Length: 109 30.1 1.6 0.00099 19.5 8.4 107 41-176 1-109 (109) No 1 >protein:vir:103005 Length: 390 # NCBI annotation: gp110 # Family: family:all:1104 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717777;genbank:gi:113200614;genbank:GeneID:4239009 Probab=100.00 E-value=3.3e-121 Score=681.09 Aligned_cols=390 Identities=88% Similarity=1.317 Sum_probs=308.4 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) |+|||+|+|||+|++|++|||||||||.+||+|+|+||+|||||||+||||||||+|++|+||+||++|||+|||+|||| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~~~~~~~y 80 (390) T protein:vir:10 1 MTYSNDPPNNCIQSDYTSSCRLNLNGSAQEQTFMENLIVESIELYGQNVYYLPRIYVNRDTILNEVETSRFEQALSVRAY 80 (390) T ss_pred CeecCCCcccceecceeeccEEEEeccCchhHHHHHHHHHHhHhcCceEEEechheeccccccccccccccccceEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|++|||||||||++|||+|+||||||||++++++++.||+||||||||||||+++||||+||||++||||+| T Consensus 81 ~~~~~~~~~~~~~~skfg~~~~de~~~~~~~~~~~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~~p~yq~G 160 (390) T protein:vir:10 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRKKFTTAVDDNAVLNVEGRPNEGDLIWFPATRHLFEIKFVEAERPFYQLG 160 (390) T ss_pred eechhccCCccceeeecCceecceEEEEECCcchhhhhCCcccccccCCCCCCceEEecCCCCEEEEEecCCCCCceEcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccceeEEEec Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAITLTD 240 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsitvt~ 240 (486) ++|+|+++|++|+|+++.++.....++.+...............+................+.+...+..+.+..+++++ T Consensus 161 ~nyt~~i~a~lf~ySge~iat~~seid~I~~~~~~~v~~~~~t~g~~~~t~~~~v~~~g~ga~~~a~v~~g~Vt~vtItn 240 (390) T protein:vir:10 161 KGYVWECQCELFEYSDEDLDTGVAEIDAIETAFANAIKLVMDAGGTGAFTVGEEIVGDLYLATATATISGDAVDAVTVTD 240 (390) T ss_pred CceeeeeEEeeeccCCccccccccccccccccccceeeeeeccCCcccccccceeeecCcceeEEEEecCCeEEEEEEee Confidence 99999999999999999999999888777666555555555554444444444444444455555555566666777777 Q ss_pred CCccccccccceeecCCCCccceeeeeeecccceeeecCCCCcccccceeEcCCCCCcccceeeeeccCCcceeeeccce Q lcl|NC_019444. 241 GGEYYKSALPPTVTISDPPASGGITAFSTSTDTGSYSNPTGTGYTVGTYTTTNLTGTGSGAIATVSGVNGSGGLTVSGGD 320 (486) Q Consensus 241 ~Gsg~t~t~~~tVtis~~~~~~~~~~~~t~t~~~~~~~~~gsg~t~~~~~t~~~~~~~~~~t~~~~~~~~~~~~t~t~gt 320 (486) .|++|+.+.+++|++.+++..+ T Consensus 241 ~GsGYt~~~~ptVtisgg~gtg---------------------------------------------------------- 262 (390) T protein:vir:10 241 GGEHYKSALPPTVTITGGGGSG---------------------------------------------------------- 262 (390) T ss_pred CCCCcccCceeEEEecCCCCcc---------------------------------------------------------- Confidence 7777766655555543221111 Q ss_pred eeeeccCCccceecceeEecCCCCcCccceeecCccccceeeeEecCCCccceeeecCCCccccccceeEeecCCCCcee Q lcl|NC_019444. 321 LSLTNLTGTGYSTGDVLRINGGDDNAHIRVVSVGITSPASATATVSSAGIVTDITITSGGTGYTSVPTVTIDYSPKDSRA 400 (486) Q Consensus 321 ~t~t~~~gsg~t~~~tv~~~g~~~~~~~~~~~~~~~~~at~t~t~~~~g~~~~vtv~~~Gsg~t~~~~vtvt~~~~gtt~ 400 (486) +.+.+++...|.+..+++.++|+||+..|.+++.++..+..+ T Consensus 263 --------------------------------------At~tatv~~~G~VtsItItn~GsGYt~~PtVtI~g~g~~~~a 304 (390) T protein:vir:10 263 --------------------------------------ATATATVSSAGIVTGITITSGGTGYTSAPTVTIDYSPKDNRA 304 (390) T ss_pred --------------------------------------ceeeeeecccceEEEEEEecCCccccCCCEEEEeCCCCCcee Confidence 111122223344555566666666666666666666555555 Q ss_pred EEEeeccCCceeEEEecCcccccCceeccCCccccccccccccccccccccccccCceeeecCCceEEeeeccccceecc Q lcl|NC_019444. 401 EVKSWNASTRELQVINRTGTFNTAETITGLTSGARWSPESYNTLNNTNTADSIDQNYSFETADDDIIDFTEGNPFGSIGS 480 (486) Q Consensus 401 ~~~~~~~~~~~~~v~~~tgt~t~~~~~tg~ts~a~~~~~~~~t~~~t~t~~~~~~~~~~~t~~~~~~~~t~~np~G~~g~ 480 (486) .....++....+.+.+...+++.+..+++..+++.........+.+.......+......+.++.+++++.++|+|.+++ T Consensus 305 ~~~~~~g~v~~i~Itn~GsgYtt~p~vt~~~~G~~~~~~~~~t~~~~~~~~~~~~~~~~~t~~~~ii~~t~gn~~g~v~n 384 (390) T protein:vir:10 305 EVKSWNASTRELQVINRTGTFNTAEVITGLTSGAKWSPESYNTLNNTNTADTIDQNYSFETADDDIIDFTEVNPFGNIGS 384 (390) T ss_pred EEEEeccEEEEEEEecCCcceeeccEEEEecCCcceEEEEEEecccceeeeeecccceeEeCCCceEeecccCccccccc Confidence 55555555555555566556655555555555554444444555666666666667778889999999999999999999 Q ss_pred cccccC Q lcl|NC_019444. 481 VTDTTI 486 (486) Q Consensus 481 ~~~~t~ 486 (486) .+.++| T Consensus 385 ~T~t~v 390 (390) T protein:vir:10 385 TTDTTI 390 (390) T ss_pred ceeccC Confidence 999999 No 2 >protein:vir:104476 Length: 308 # NCBI annotation: gp14 # Family: family:all:1104 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214650;genbank:gi:61806291;genbank:GeneID:3294531 Probab=100.00 E-value=2.4e-111 Score=627.04 Aligned_cols=308 Identities=66% Similarity=1.093 Sum_probs=264.2 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) |+|+|+|...|.|++|.+|||||+|||.+||+|+|+||+|||||||+||||||||+|++|+||+||++|||+|||+|||| T Consensus 1 ~~~~~~~~~py~~~~~~~~~~~n~~~~~~eQ~L~e~LV~EsIqm~G~dvyYlpRe~v~~D~i~~Ed~~skF~~a~~ieaY 80 (308) T protein:vir:10 1 MAIQNSPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNRDSVFEEDSDGKFESAKAIRAY 80 (308) T ss_pred CccccCCCCCcccccccccceEEEeccCchhHHHHHHHHHHHHhcCceEEEechhhcccccccccccccccccceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|++|||||||||++|||+|+|||+|||+++++.++|.+++||||||||||||+++||||+|||++.||||+| T Consensus 81 ~~~~egy~g~~~~~SKFG~~~~DE~t~~is~~rF~~~v~~~~~~~~~~rP~EGDLIYfPl~~~lFEI~~VE~~~PFyQ~G 160 (308) T protein:vir:10 81 VNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFEIKFVEVERPFYQLG 160 (308) T ss_pred eechhccCCCcceeeecCceecceEEEEEccchhhhhcCCccccccCCCCccccEEEecCCCceEEEEcccCCCchhhhC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccceeEEEec Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAITLTD 240 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsitvt~ 240 (486) |+|+|+|+|++|+|++|.+++++++++.+.......+...+..++.+....+....+.... T Consensus 161 k~~~~~l~ce~F~Ys~E~~~~~i~~iD~i~~~~~~~LdL~pIs~l~G~fdInE~v~gest~------------------- 221 (308) T protein:vir:10 161 RNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTVGETITGGTSN------------------- 221 (308) T ss_pred CceEEEEEEEEEeeCCcccccCCccccccccccccceeeeeeccCCccccccceecccccc------------------- Confidence 9999999999999999999999999998887665554444444333322111111110000 Q ss_pred CCccccccccceeecCCCCccceeeeeeecccceeeecCCCCcccccceeEcCCCCCcccceeeeeccCCcceeeeccce Q lcl|NC_019444. 241 GGEYYKSALPPTVTISDPPASGGITAFSTSTDTGSYSNPTGTGYTVGTYTTTNLTGTGSGAIATVSGVNGSGGLTVSGGD 320 (486) Q Consensus 241 ~Gsg~t~t~~~tVtis~~~~~~~~~~~~t~t~~~~~~~~~gsg~t~~~~~t~~~~~~~~~~t~~~~~~~~~~~~t~t~gt 320 (486) T Consensus 222 -------------------------------------------------------------------------------- 221 (308) T protein:vir:10 222 -------------------------------------------------------------------------------- 221 (308) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eeeeccCCccceecceeEecCCCCcCccceeecCccccceeeeEecCCCccceeeecCCCccccccceeEeecCCCCcee Q lcl|NC_019444. 321 LSLTNLTGTGYSTGDVLRINGGDDNAHIRVVSVGITSPASATATVSSAGIVTDITITSGGTGYTSVPTVTIDYSPKDSRA 400 (486) Q Consensus 321 ~t~t~~~gsg~t~~~tv~~~g~~~~~~~~~~~~~~~~~at~t~t~~~~g~~~~vtv~~~Gsg~t~~~~vtvt~~~~gtt~ 400 (486) ..+ T Consensus 222 -----------------------------------------------------------------------------itA 224 (308) T protein:vir:10 222 -----------------------------------------------------------------------------VTA 224 (308) T ss_pred -----------------------------------------------------------------------------eEE Confidence 023 Q ss_pred EEEeeccCCceeEEEecCcccccCceeccCCccccccccccccccccccccccccCceeeecCCceEEeeeccccceecc Q lcl|NC_019444. 401 EVKSWNASTRELQVINRTGTFNTAETITGLTSGARWSPESYNTLNNTNTADSIDQNYSFETADDDIIDFTEGNPFGSIGS 480 (486) Q Consensus 401 ~~~~~~~~~~~~~v~~~tgt~t~~~~~tg~ts~a~~~~~~~~t~~~t~t~~~~~~~~~~~t~~~~~~~~t~~np~G~~g~ 480 (486) ++..|+.....+.+.+.+++++++..++|+++++.++..+....+++. ..++.+..+++.++.++||+++||||..|+ T Consensus 225 Ev~~wds~v~~ItV~N~gGsftspptItGsts~a~~t~~s~~~~~nt~--~~~~~n~~fet~~D~iiDftE~NPFG~~g~ 302 (308) T protein:vir:10 225 EVKSFDASTRTLIVINRSGTFTVPETVTGGTSSASWTTATYNTIDNQN--LDYDQNNDFETLDNQIIDFTEANPFGSVGS 302 (308) T ss_pred EEEEecCCceEEEEEeCCCceeeCcEEEeccCCceeEEEeeeecccCC--CcccCCcceeeccCcEEeeccCCCCccccc Confidence 333444444455566666666667777777777777777777666655 357788899999999999999999999999 Q ss_pred cccccC Q lcl|NC_019444. 481 VTDTTI 486 (486) Q Consensus 481 ~~~~t~ 486 (486) ++++|| T Consensus 303 ~t~~~~ 308 (308) T protein:vir:10 303 ITDNTI 308 (308) T ss_pred cccccC Confidence 999999 No 3 >protein:vir:104739 Length: 470 # NCBI annotation: T4-like neck protein # Family: family:all:1104 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214353;genbank:gi:61805993;genbank:GeneID:3294259 Probab=100.00 E-value=3.9e-102 Score=576.52 Aligned_cols=429 Identities=34% Similarity=0.538 Sum_probs=290.4 Q ss_pred ccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeeeecchhhcCCcccchh Q lcl|NC_019444. 16 YTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAYVNNVEGWEGQGDLLS 95 (486) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~ 95 (486) -.+||||| |+|.+||+|+|+||+|||||||+||||||||+|++|+||+||++|||+|||||||||+|||||+|++|||+ T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~e~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~ 79 (470) T protein:vir:10 1 MALNPFFL-QGTSSEQRLTQDLINEHLKIYGVEVTYIPRKYVNTKSIIEEVQSSKFDDNFAIEAYVNTYEGYGGQGDVLT 79 (470) T ss_pred CcccceeE-cCCCchhHHHHHHHHHHhHhccceEEEechhhcccccccccccccccccceeEEEEeecccCcCCcceeee Confidence 77899996 99999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCceecceEEEEEcccceehhhccceeeccc-----------CCCCcCCEEEEccCCceEEEEEeecCCCceecCceeE Q lcl|NC_019444. 96 KFGVRIEDKTTFIFSRSKFTEKVDDNAALNVE-----------GRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLGKGYV 164 (486) Q Consensus 96 ~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~-----------~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G~~~~ 164 (486) |||||.+|||+|+|+|+|||+.++++++..|| .||||||||||||++|||||||||+++||||+||+++ T Consensus 80 ~fg~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~p~~~~G~~~~ 159 (470) T protein:vir:10 80 KFGMSIRDEVTLTISKERFEDFIAPFMAGLDDGPGGNEEVILATRPREGDLVFFPLGSRLFEVKFVEHEDPFYQLGKNYV 159 (470) T ss_pred ecCcccceEEEEEECCccccccccchhhcccCcccccccccccCCcccccEEEecCCCCEEEEEecCCCCcchhcCccee Confidence 99999999999999999999999998866654 5799999999999999999999999999999999999 Q ss_pred EEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccceeEEEecCCcc Q lcl|NC_019444. 165 WECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAITLTDGGEY 244 (486) Q Consensus 165 ~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsitvt~~Gsg 244 (486) |+++|++|+|+++..+.....+................. .............+.+..+.+.+.|.+ T Consensus 160 ~~it~~~f~ysge~~s~~v~~~~~~~~~~g~~~t~t~~~--------------~g~~~~~t~~~~~g~vt~ititn~Gsg 225 (470) T protein:vir:10 160 YQLKCELFEYEDEVIDTSIDAIDTVVQDDGYISKLQLVG--------------IGRTAEVAASIGVGYVREIFLNNDGSG 225 (470) T ss_pred EEeeeceeEecCCccccceecccccccccccceeeeecC--------------CCccceeeeeecceeeeEeEeeccccc Confidence 999999999999999998877665543332222111111 111122233344566788899999998 Q ss_pred ccccccceeecCCCCccceeeeeee-------cccceeeecCCCCcccccceeEcCCCCCcccceeeeeccCCcceeeec Q lcl|NC_019444. 245 YKSALPPTVTISDPPASGGITAFST-------STDTGSYSNPTGTGYTVGTYTTTNLTGTGSGAIATVSGVNGSGGLTVS 317 (486) Q Consensus 245 ~t~t~~~tVtis~~~~~~~~~~~~t-------~t~~~~~~~~~gsg~t~~~~~t~~~~~~~~~~t~~~~~~~~~~~~t~t 317 (486) |.. +|+|++.++.......+... .........+.+++|+..+.+.....................+ T Consensus 226 yt~--~ptVti~~~~~~~~~~a~~~~~t~~~~g~vt~ititn~Gsgytt~ptvt~~~~~g~ga~at~~~~~~~~g----- 298 (470) T protein:vir:10 226 FTS--PPTITFSASPAFTDARAVGILTTRANVTSIEKILMTSAGAGYITPPTITISGGGGTGAAATCSIETVYQG----- 298 (470) T ss_pred eec--cCEEEEccCCCCCCccceeeEeecceeeEEEEEEEecCcccccccceEEEccCCCccceeeeeecccccc----- Confidence 855 57888776554332222111 1223344567788998888777665443222111111111100 Q ss_pred cceeeeeccCCccceecceeEecCCCCcCccceeecCccccceeeeEecCCCccceeeecCCCccccccceeEeecCCCC Q lcl|NC_019444. 318 GGDLSLTNLTGTGYSTGDVLRINGGDDNAHIRVVSVGITSPASATATVSSAGIVTDITITSGGTGYTSVPTVTIDYSPKD 397 (486) Q Consensus 318 ~gt~t~t~~~gsg~t~~~tv~~~g~~~~~~~~~~~~~~~~~at~t~t~~~~g~~~~vtv~~~Gsg~t~~~~vtvt~~~~g 397 (486) .........+.+|+..|++.+.......... ...........+.+..+++.++|++++..+.+.+...... T Consensus 299 -~~~itit~~GsgYtt~ptvtit~~~sg~~a~--------~~a~~~~~~~~g~itsititn~Gsgyts~ptv~i~~~~~~ 369 (470) T protein:vir:10 299 -VVNFNVVDGGVGYGTEPSIAVTQPGAGTTAV--------GIASIGMAGSDQVLKSVYIGNPGRGYTATPNVIVADPPSM 369 (470) T ss_pred -eeeEEEccCCccccccceEEEecCCCCCccc--------ceeEEEeecccceeeeEEeccCCcceeccceeEeecCccc Confidence 0011222356788888888776543322111 1222334445677888899999999998888776543322 Q ss_pred ceeEE------------------EeeccCCceeEEEec-----CcccccCceeccCCccccccccccccccccccccccc Q lcl|NC_019444. 398 SRAEV------------------KSWNASTRELQVINR-----TGTFNTAETITGLTSGARWSPESYNTLNNTNTADSID 454 (486) Q Consensus 398 tt~~~------------------~~~~~~~~~~~v~~~-----tgt~t~~~~~tg~ts~a~~~~~~~~t~~~t~t~~~~~ 454 (486) ..... ..+........+... ...+..+..+++..+++. ....+.........+. T Consensus 370 ~~~~t~~~~~~~tg~tsgt~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~g~tvt~~~~~a~---~~~~s~t~~~~~~~~t 446 (470) T protein:vir:10 370 SGIGTFTFNEVIKGSRSGTEARVKSWDDDTKILLVSNVGIGSTVSGFYTGESIVGQESGAS---YALGSYNSDDANDKYN 446 (470) T ss_pred cccceeeeeeeeeccccceeeeeeeecccceeeeecccceecccceeeeeeeEEeccccce---eeEEEecccccCceee Confidence 11100 000100000000000 000111111111111111 1122233334444556 Q ss_pred cCceeeecCCceEEeeecccccee Q lcl|NC_019444. 455 QNYSFETADDDIIDFTEGNPFGSI 478 (486) Q Consensus 455 ~~~~~~t~~~~~~~~t~~np~G~~ 478 (486) ....+.+.++++++++++||+|+. T Consensus 447 s~~~i~t~~~~i~~~~~~np~~~~ 470 (470) T protein:vir:10 447 DGDEFEFNADQILDFTESNPFGNF 470 (470) T ss_pred ccceeeccCCcEEeeeecCCCCCC Confidence 677789999999999999999999 No 4 >protein:vir:106986 Length: 292 # NCBI annotation: neck protein gp14 # Family: family:all:1104 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195128;genbank:gi:58532905;uniprot:Q5GQV9;genbank:GeneID:3260483 Probab=100.00 E-value=5.6e-98 Score=553.77 Aligned_cols=290 Identities=44% Similarity=0.777 Sum_probs=245.1 Q ss_pred ccccceEe--ecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeeeecchhhcCCcccc Q lcl|NC_019444. 16 YTSSCRLN--LNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAYVNNVEGWEGQGDL 93 (486) Q Consensus 16 ~~~~~~~~--~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~ 93 (486) --+||||| +|||.+||+|+|+||+|||||||+||||||||+|+ |.+|+||++|||+|||+|||||+|||||+|++|| T Consensus 1 m~~npyfn~~~~~~~~eQ~L~~~LV~Esiq~~G~dvyYlpRe~~~-d~~~~E~~~skF~~a~~ieaY~~~~eg~~g~~~~ 79 (292) T protein:vir:10 1 MPTSPYFPSYYSGYSGEQNLVQDLVDEQIKLFGTDIYYLPRTILR-DNTLDDVIYNKFERQFQVEMLLQNVEGFGSPSEF 79 (292) T ss_pred CCcCccccccccCcCchhHHHHHHHHHHHHhcCceEEEechhhhc-ccccccccccccccceeEEEEeechhccCCCcce Confidence 56899999 79999999999999999999999999999999999 9999999999999999999999999999999999 Q ss_pred hhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecCceeEEEEEeccee Q lcl|NC_019444. 94 LSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLGKGYVWECQCELFE 173 (486) Q Consensus 94 ~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G~~~~~~~~~~~~~ 173 (486) |||||||++|||+|+|||+||++++++. .+.+++||||||||||||+++||||+|||++.||||+||+|+||++|++|+ T Consensus 80 ~sKFG~~~~De~t~~is~~~f~~~~~~~-~~~~~~~P~eGDLIYfPl~~~lFEI~~ve~~~PfyQ~gk~~~~~l~~~~F~ 158 (292) T protein:vir:10 80 ISKFGLRITDEVRFIVSQRRWDEEAVNY-DLNVNGRPNEGDLLYFPLTQDIYEIKFVEREDPFYQLGKNYFYIMTAEIYE 158 (292) T ss_pred eeecCceecceEEEEEccchhhhhcCcc-cccccCCCccccEEEEcCCCcEEEEEcccCCCchhhhCCceEEEEEEEEEe Confidence 9999999999999999999999999975 777889999999999999999999999999999999999999999999999 Q ss_pred ecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccceeEEEecCCcccccccccee Q lcl|NC_019444. 174 YSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAITLTDGGEYYKSALPPTV 253 (486) Q Consensus 174 ~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsitvt~~Gsg~t~t~~~tV 253 (486) |+++.+++++++++.+....... +.+. T Consensus 159 Ys~E~idtgl~eiD~i~~~~sse---------------------------------------LdL~-------------- 185 (292) T protein:vir:10 159 YGSDNISTGVEEIDELETLFSSA---------------------------------------IAIA-------------- 185 (292) T ss_pred ecCceecCCCCcccccccccccc---------------------------------------ccee-------------- Confidence 99999999999886332211000 0000 Q ss_pred ecCCCCccceeeeeeecccceeeecCCCCcccccceeEcCCCCCcccceeeeeccCCcceeeeccceeeeeccCCcccee Q lcl|NC_019444. 254 TISDPPASGGITAFSTSTDTGSYSNPTGTGYTVGTYTTTNLTGTGSGAIATVSGVNGSGGLTVSGGDLSLTNLTGTGYST 333 (486) Q Consensus 254 tis~~~~~~~~~~~~t~t~~~~~~~~~gsg~t~~~~~t~~~~~~~~~~t~~~~~~~~~~~~t~t~gt~t~t~~~gsg~t~ 333 (486) T Consensus 186 -------------------------------------------------------------------------------- 185 (292) T protein:vir:10 186 -------------------------------------------------------------------------------- 185 (292) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred cceeEecCCCCcCccceeecCccccceeeeEecCCCccceeeecCCCccccccceeEeecCCCCceeEEEeeccCCceeE Q lcl|NC_019444. 334 GDVLRINGGDDNAHIRVVSVGITSPASATATVSSAGIVTDITITSGGTGYTSVPTVTIDYSPKDSRAEVKSWNASTRELQ 413 (486) Q Consensus 334 ~~tv~~~g~~~~~~~~~~~~~~~~~at~t~t~~~~g~~~~vtv~~~Gsg~t~~~~vtvt~~~~gtt~~~~~~~~~~~~~~ 413 (486) ....+.|.. .....+++...+..+++.+|+.+..... T Consensus 186 ------------------------------------------pi~~g~G~f-~inE~vtge~sg~~AEv~sw~~~t~~L~ 222 (292) T protein:vir:10 186 ------------------------------------------LSIGGTGDF-DLGEIVTGGISGTEAEVKSWDSSSRILQ 222 (292) T ss_pred ------------------------------------------ecccCCccc-cCCceeeecccceEEEEEEccCCCceEE Confidence 000000000 0001122333444567788888888888 Q ss_pred EEecCcccccCceeccCCccccccccccccccccccccccccCceeeecCCceEEeeeccccceeccccccc Q lcl|NC_019444. 414 VINRTGTFNTAETITGLTSGARWSPESYNTLNNTNTADSIDQNYSFETADDDIIDFTEGNPFGSIGSVTDTT 485 (486) Q Consensus 414 v~~~tgt~t~~~~~tg~ts~a~~~~~~~~t~~~t~t~~~~~~~~~~~t~~~~~~~~t~~np~G~~g~~~~~t 485 (486) +.+..+.+.++..++|..+++.+...+.....++. .+++.+..+++.+++++||+++||||..|+.+.+. T Consensus 223 V~n~~GsF~T~e~i~G~~Sga~~~v~si~~~~g~~--~t~a~~~~iEt~~d~i~df~e~npfg~~~~~~~~~ 292 (292) T protein:vir:10 223 VINRTGTFEEGESVTGNDSGSVWVVDSFDTLNNTN--SEYDQNREIESTADTIIDWSESNPFGEYGNFTGSI 292 (292) T ss_pred EEeCccccccCceeeEeecCCeEEeeEEEEeCCCC--CcccccceeccccCcEEeeccCCcCcccccccccC Confidence 88888899999999999999998777775555554 36888999999999999999999999999999998 No 5 >protein:vir:103452 Length: 256 # NCBI annotation: head completion # Family: family:all:1104 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803104;genbank:gi:116326384;genbank:GeneID:4405481 Probab=100.00 E-value=1.5e-93 Score=529.53 Aligned_cols=246 Identities=27% Similarity=0.465 Sum_probs=202.0 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) -.++|..+|+++++.+++|||||+|||.+||+|+|+||+|||||||+||||||||+|++|.||+|+++|||+|||+|||| T Consensus 10 a~l~~~~gy~~~~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 89 (256) T protein:vir:10 10 AKLENHTGYSQTNETEILNPYVNFNHYKNSQILADVLVAESIQMRGVECYYVPREYVSPDLIFGEDLKNKFTKAWKFAAY 89 (256) T ss_pred eeecCCcchhhhhhhcccceeeeeeccCchhhHHHHHHHHHHHHcCceEEEechhhcccccccccccccccccceeEEEE Confidence 57899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|++|||||||||++|||||+|||+||++++++. ||||||||||||+++||||+|||++.||||+| T Consensus 90 l~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 162 (256) T protein:vir:10 90 LNSFEGYEGAKSFFSNFGMQVQDEVTLSINPNLFKHQVNGK-------EPKEGDLIYFPMDNSLFEINWVEPYDPFYQLG 162 (256) T ss_pred eehhhccCCccccceecCceecceEEEEEcCchhhhhccCC-------CCccccEEEEcCCCcEEEEEeccCCCchhhhC Confidence 99999999999999999999999999999999999999974 99999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccce-eEEEe Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVS-AITLT 239 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~t-sitvt 239 (486) |+|+|+++|++|+|++|.+++++++++.+..............+..+....+.............+. .-+. -+.+. T Consensus 163 kn~~~~l~ce~F~Ys~E~~~~~l~~~d~i~~~~~~eldl~pi~~ldG~~di~~~~~~e~~~~~~e~~---~~v~~~~~in 239 (256) T protein:vir:10 163 QNAIRKITAGKFIYSGEEINPVLQKNEGINIPEFSELELNPVRNLNGIHDINIDQYAEVDQINSEAK---EYVEPYVVVN 239 (256) T ss_pred CCeEEEEEEEEEeeCCceecccCCCCccccCCccchhhhccccccccccccCccccccchhhhhccc---cccccceEec Confidence 9999999999999999999999999999987666665555555555444444332222211111000 0011 23444 Q ss_pred cCCccccccccceeecC Q lcl|NC_019444. 240 DGGEYYKSALPPTVTIS 256 (486) Q Consensus 240 ~~Gsg~t~t~~~tVtis 256 (486) +.|+++.+..-..--.. T Consensus 240 ~~G~~~~~~pfd~~~~~ 256 (256) T protein:vir:10 240 NRGKSFESSPFDNDFMD 256 (256) T ss_pred CCCCCCcCCCccccccC Confidence 44444333211000000 No 6 >protein:vir:7199 Length: 256 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049773;genbank:gi:9632588;genbank:GeneID:1258695 Probab=100.00 E-value=1.7e-93 Score=529.25 Aligned_cols=246 Identities=27% Similarity=0.466 Sum_probs=199.5 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) -.++|+.+|+++++.+++|||||+|||.+||+|+|+||+|||||||+||||||||+|++|.||+|+++|||+|||+|||| T Consensus 10 a~l~~~~gy~~~~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~~Ed~~skF~ka~~ieaY 89 (256) T protein:vir:71 10 AKLENRTGYSQTNETEILNPYVNFNHYKNSQILADVLVAESIQMRGVECYYVPREYVSPDLIFGEDLKNKFTKAWKFAAY 89 (256) T ss_pred eeecCCcchhhhhhhcccceeeeeeccCchhhHHHHHHHHHHHHcCceEEEechhhcccccccccccccccccceeEEEE Confidence 57899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|++|||||||||++|||||+|||+||++++++. ||||||||||||+++||||+|||++.||||+| T Consensus 90 l~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 162 (256) T protein:vir:71 90 LNSFEGYEGAKSFFSNFGMQVQDEVTLSINPNLFKHQVNGK-------EPKEGDLIYFPMDNSLFEINWVEPYDPFYQLG 162 (256) T ss_pred eehhhccCCccccceecCceecceEEEEEccchhhhhhcCC-------CCccccEEEEcCCCcEEEEEcccCCCchhhhC Confidence 99999999999999999999999999999999999999974 99999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccce-eEEEe Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVS-AITLT 239 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~t-sitvt 239 (486) |+|+|+++|++|+|++|.+++++++++.+.................+....+.............+. .-+. -+.+. T Consensus 163 kn~~~~l~ce~F~Ys~E~~~~~l~~~d~i~~~e~~eldl~~i~~ldg~~di~~~~~~e~~~i~~e~~---~~ve~~~~in 239 (256) T protein:vir:71 163 QNAIRKITAGKFIYSGEEINPVLQKNEGINIPEFSELELNAVRNLNGIHDINIDQYAEVDQINSEAK---EYVEPYVVVN 239 (256) T ss_pred CCeEEEEEEEEEecCCceecccCCCCCcccCCcccccccccccccCcccccccccccchhhhhhccc---cccccceeec Confidence 9999999999999999999999999998876655554444444444433333222221111100000 0001 23444 Q ss_pred cCCccccccccceeecC Q lcl|NC_019444. 240 DGGEYYKSALPPTVTIS 256 (486) Q Consensus 240 ~~Gsg~t~t~~~tVtis 256 (486) +.|+++.+..-..--.. T Consensus 240 ~~G~~~~~~pfd~~~~~ 256 (256) T protein:vir:71 240 NRGKSFESSPFDNDFMD 256 (256) T ss_pred CCCCCCcCCCccccccC Confidence 44444332211000000 No 7 >protein:vir:5658 Length: 278 # NCBI annotation: gp14 # Family: family:all:1104 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899597;genbank:gi:34419584;genbank:GeneID:2545699 Probab=100.00 E-value=4e-92 Score=521.66 Aligned_cols=262 Identities=26% Similarity=0.371 Sum_probs=187.1 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) -.|+|+.+|||+|++|++|||||+|||.+||+|+|+||+|||||||+||||||||+|++|+||+|+++|||+|||+|||| T Consensus 13 a~l~~~~gy~~~~~~~~~NpYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 92 (278) T protein:vir:56 13 AKLESQKGYDQIYNNHLVNPYFNWVNHTNEQNLTDMLVAESIINRGVECVYLRREMEKVDLVFGEDPMSKFTQNFRMSLY 92 (278) T ss_pred EEecCCcccchhcccccccceeeccCCCchhHHHHHHHHHHHHHcCceEEEcchhhhccccccccccccccccceeEEEE Confidence 57899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|++|||||||||++|||+|+|||+||+++++. .||||||||||||+++||||+|||++.||||+| T Consensus 93 l~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~~~~-------~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 165 (278) T protein:vir:56 93 VESFEGWDGDGDWYSKFGFQVNDEMNVCINPKLFAQQGDG-------KQPLMGDLIYFPLANSLFEISWIEREDPWYMNG 165 (278) T ss_pred eehhhccCCCceeeeecCceecceEEEEEccchhhhcCCC-------CCCccccEEEEcCCCcEEEEEccCCCCchhhhC Confidence 9999999999999999999999999999999999999997 499999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceecccccee--eeeeecccc--ccccccccCCccceeeeecceeeeccccceeeeeccccceeE Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAA--IDAIETAFA--NSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAI 236 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~--~~~i~~~~~--~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsi 236 (486) |+|+|+++|++|+|++|.++.+.+. ++.+..... ........-.+... ..+............... ....... T Consensus 166 k~~~~~l~ce~F~Ys~E~~dtg~pe~~~d~i~~i~ef~e~~~~~ld~~~~~~-l~G~~di~~~~~~e~~~~--~~e~~~f 242 (278) T protein:vir:56 166 VLPMRKMKMTKFVYSGEEINLEKPEAVIDSIDDILNFGEDTDEMIDIDKINA-LDGRWDIGIEQGAEITQI--EDEVDKF 242 (278) T ss_pred CceEEEEEEEEeeecCceecccCCccccccccchhhcccchhhccccccccc-ccchhcccchhhhhhhhh--hhcccee Confidence 9999999999999999999877444 444443322 11111222222211 112222221111111110 0011111 Q ss_pred EEecCCccccccccceeecCCCCccceeeeeeecccceee Q lcl|NC_019444. 237 TLTDGGEYYKSALPPTVTISDPPASGGITAFSTSTDTGSY 276 (486) Q Consensus 237 tvt~~Gsg~t~t~~~tVtis~~~~~~~~~~~~t~t~~~~~ 276 (486) +.....-++.+..+|+-.-..+.+.. + .....-.+. T Consensus 243 ~~~~~v~~~~~~~~~t~~~n~~~g~~-v---~~~~~~D~f 278 (278) T protein:vir:56 243 YESEQVVPSGSDVQPTDPRNATIGFN-V---NNSNPFDSF 278 (278) T ss_pred eecCceecCCCCccccCcccccCCCc-C---ccccccccC Confidence 11111111111111110000000000 0 000000000 No 8 >protein:vir:100535 Length: 253 # NCBI annotation: gp14 head completion protein # Family: family:all:1104 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656376;genbank:gi:109290127;genbank:GeneID:4156513 Probab=100.00 E-value=1.9e-91 Score=517.92 Aligned_cols=246 Identities=28% Similarity=0.388 Sum_probs=195.7 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) -.++|+.+|+++++++++|||||+|||.+||+|+|+||+|||||||+||||||||+|++|.||+||++|||+|||+|||| T Consensus 8 a~l~~~~gy~~t~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 87 (253) T protein:vir:10 8 ATLENRSGYQQTNEQNILNPYVKFNRYEGSQALHDTLVAESIQMRGLEFYYLEREYTNLDLLFGEDPNSRFEKAWKFAAW 87 (253) T ss_pred eEecCCcchhhhhhhccccceeeccccCchhHHHHHHHHHHHHHcCceEEEcchhhccccCccccccccccccceeEEEe Confidence 57999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|++|||||||||++|||+|+|||+||++++++. ||||||||||||+++||||+|||++.||||+| T Consensus 88 l~~~egy~G~gd~~SKFG~~v~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (253) T protein:vir:10 88 LNSFESYEGQQSFFSKFGHQQNDEVRISINPGLFKYQVNGK-------EPKLGDLIYMPMDNSLFEITWVEPYTPFYQMG 160 (253) T ss_pred eehhhccCCccceeeecCceecceEEEEEcCchhhhhccCC-------CCccccEEEEcCCCcEEEEeccCCCCchhhhC Confidence 99999999999999999999999999999999999999974 99999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccceeEEEec Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAITLTD 240 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsitvt~ 240 (486) |+|+|+++|++|+|+++.+++++++++.+...+ ......+.....+....+..........-.-+... +....+. T Consensus 161 kn~~~~l~ce~F~Ys~E~i~tgi~~id~Ie~~~-~~ldl~~i~~l~G~~Di~~~~~~e~~~~~~e~~~~---v~~~~~~- 235 (253) T protein:vir:10 161 KNPIRVIVAQKFIYSGEKLAPQFQEKPEIEDQY-NGLDLEPILNLDGFIDQKINEFGENVQAQNEARPF---VEPFDPI- 235 (253) T ss_pred CceEEEEEEEEEecCCccccccCcccccccchh-hhhhhhhhhcCCCccccccccccccchhhhccccc---cccceec- Confidence 999999999999999999999999999998443 45555555555544444333222111111000000 0000000 Q ss_pred CCccccccccceeecCCC Q lcl|NC_019444. 241 GGEYYKSALPPTVTISDP 258 (486) Q Consensus 241 ~Gsg~t~t~~~tVtis~~ 258 (486) .+.+...-++|-..-.+- T Consensus 236 ~~~~~~g~~spf~~~~~~ 253 (253) T protein:vir:10 236 STNPVNSFNSPFGRHEGQ 253 (253) T ss_pred cCCCCccccCcccccCCC Confidence 000000000110000000 No 9 >protein:vir:98258 Length: 248 # NCBI annotation: gp14 head completion # Family: family:all:1104 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239191;genbank:gi:66391666;genbank:GeneID:3416360 Probab=100.00 E-value=2.1e-91 Score=517.66 Aligned_cols=239 Identities=28% Similarity=0.409 Sum_probs=205.8 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) -.++|+.+|+++++++++|||||+|+|.+||+|+|+||+|||||||+||||||||+|++|+||+|+++|||+|||+|||| T Consensus 10 a~l~~~~gy~~t~~~~~lnPYfN~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 89 (248) T protein:vir:98 10 AQLSTGEGVDRNLKDQVTNPYVNWYKYNPTQQLHDSLTAESIQMKSPDMYYVRREFVNIDKILGEDRESKFTKSWKIAAY 89 (248) T ss_pred eEecCCcchhhhhhcccccceeeccccCchhHHHHHHHHHHHHHcCceEEEcchhhhccccccccccccccccceeEEEe Confidence 57899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|++|||||||||++|||+|+|||+||++++++ +||||||||||||+++||||+|||+ .||||+| T Consensus 90 l~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~~~~-------~rP~EGDLIYfPm~n~LFEI~~VE~-dPFYQ~G 161 (248) T protein:vir:98 90 IESYANYEGQRDFFSKFGLSSNDEMTLVLNPRLFAHQTDG-------GIPVLGDLVYFPMDNSLFEITWVEA-DPFYQFG 161 (248) T ss_pred eehhhccCCccceeeecCceecceEEEEEccchhhhcCCC-------CCCccccEEEEcCCCceEEEEecCC-CchhhhC Confidence 9999999999999999999999999999999999999986 5999999999999999999999998 6999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccceeEEEec Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAITLTD 240 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsitvt~ 240 (486) |+|+|+++|++|+|+++.+++++.+++.+.......+......+..+....+..........-.-+... +....+.+ T Consensus 162 kn~~~~l~ce~F~Ys~E~~~~~l~~~d~i~~~~~~~iDl~~i~~ldg~~Di~~~~~~e~~~~~~E~~~~---~~~~~vin 238 (248) T protein:vir:98 162 DRPQRKINLAKFIYTGEELAPELQRNEGIHIEPDAELDLEPIRNLDGLADINIEQYEEDKEFEREGDEF---IESFDVVN 238 (248) T ss_pred CCeEEEEEEEEeeeCCccccccCCCccccCCCCCcchhHHHhhcCccccccCcccccchhhhhhhhhhh---hcccceec Confidence 999999999999999999999999999999888888888888888777766655444333222211111 11122222 Q ss_pred CCccccccccc Q lcl|NC_019444. 241 GGEYYKSALPP 251 (486) Q Consensus 241 ~Gsg~t~t~~~ 251 (486) . .|...+.+| T Consensus 239 ~-~g~~~~~~~ 248 (248) T protein:vir:98 239 G-RGSPFATLP 248 (248) T ss_pred C-cCCCccCCC Confidence 1 111112222 No 10 >protein:vir:6890 Length: 254 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861866;genbank:gi:32453657;genbank:GeneID:1494292 Probab=100.00 E-value=5.5e-91 Score=515.41 Aligned_cols=244 Identities=27% Similarity=0.454 Sum_probs=194.1 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) -.++|+.||+++++.+++|||||+|||.+||+|+|+||+|||||||+||||||||+|++|.||+|+++|||+|||+|||| T Consensus 10 a~le~~~gy~~t~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 89 (254) T protein:vir:68 10 AKLENRGGYSQTNETEILNPFVNFNNYENSQTLADVLVAESIQMRGIECFYVPREYVAVDLIFGEDLKNKFTKAWKFAAY 89 (254) T ss_pred eeecCCcchhhhhhccccceeEEeeccCchhHHHHHHHHHHHHHcCceEEEechhhhccccccccccccccccceeEEEe Confidence 57899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|+++||||||||++|||||+|||+||+++++. .||||||||||||+++||||+|||++.||||+| T Consensus 90 l~s~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~v~~-------~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 162 (254) T protein:vir:68 90 LNSFEGYEGAKSFFSNFGMQVQDEVTLSINPGLFKHQVNN-------QEPKEGDLIYFPMDNSLFEINWVEPYDPFYQVG 162 (254) T ss_pred eehhhccCCcccchhhcCceecceEEEEEcCchhhhhcCC-------CCCccccEEEEcCCCceEEEeccCCCCchhhhC Confidence 9999999999999999999999999999999999999986 599999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccceeEEEe- Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAITLT- 239 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsitvt- 239 (486) |+|+|+++|++|+|+++.+++++++++.+...-...........-.+...............-.-+. --+....+. T Consensus 163 Kn~~~~l~ce~F~Ys~E~idt~i~~id~I~~~e~~~ldl~~i~~ldG~~di~~~~~~E~~~~~~e~~---~f~e~~~~vn 239 (254) T protein:vir:68 163 KNAIRKITAGKFIYSGEEINPVLQKNEGINIPEFSDLELNPVRNLDGIHDINIDEYSEVEQINSEAS---EYVEPYVVVN 239 (254) T ss_pred CceEEEEEEEEEeeCCccccCCCcccCCccCcccCCcchhhHhhhcchhhccccchhhHHHHHhhhh---hhcccceeec Confidence 9999999999999999999999999999976655555444444433333332221111110000000 000111111 Q ss_pred cCCccccccccceeecCCCCccceeeeee Q lcl|NC_019444. 240 DGGEYYKSALPPTVTISDPPASGGITAFS 268 (486) Q Consensus 240 ~~Gsg~t~t~~~tVtis~~~~~~~~~~~~ 268 (486) +.|+. ++|- ... - .. T Consensus 240 ~~g~~----~~pf---~~~----~---~~ 254 (254) T protein:vir:68 240 NRGRQ----NSPF---DDG----F---MN 254 (254) T ss_pred CCCCC----CCcc---ccc----c---cC Confidence 11110 0010 000 0 00 No 11 >protein:vir:101154 Length: 252 # NCBI annotation: neck protein # Family: family:all:1104 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932505;genbank:gi:37651631;genbank:GeneID:2610647 Probab=100.00 E-value=2.1e-90 Score=512.25 Aligned_cols=245 Identities=24% Similarity=0.360 Sum_probs=193.6 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) -.++|+.+|+++++++++|||||+++|.+||+|+|+||+|||||||+||||||||+|++|.||+||++|||+|||+|||| T Consensus 8 a~le~~~gy~~t~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~~Ed~~SkF~ka~~ieaY 87 (252) T protein:vir:10 8 ATLENRGGYMRTNEKNILNPYVKFNKHEGTQALQDTLVAESIQMRGIEFYYLEREFTDLDLLFGEDVNSRFEKAWKFAAW 87 (252) T ss_pred eEecCCcchhhhhhhccccceeeecCccchhHHHHHHHHHHHHHcCceEEEechhhcccccccccccccccccceeEEEe Confidence 57999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|++|||||||||++|||||+|||+||++++++. ||||||||||||+++||||+|||++.||||+| T Consensus 88 l~s~egy~G~gd~~SKFG~~~~DEvt~~Is~~rF~~qv~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (252) T protein:vir:10 88 LNSFESYEGQQSFFSKFGHTQNDEIRISINPGLFKYQVNGK-------EPALGDLIYMPMDNSLFEITWVEPYSPFYQNG 160 (252) T ss_pred eehhhccCCccceeeecCceecceEEEEEcCchhhhhccCC-------CCccccEEEEcCCCcEEEEeccCCCCchhhhC Confidence 99999999999999999999999999999999999999974 99999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccceeEEEec Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAITLTD 240 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsitvt~ 240 (486) |+|+|+++|++|+|+++.++++.++++.+...+. .....+...-.+...............-.-+. .-+....+.+ T Consensus 161 kn~~~~l~ce~F~Ys~E~idt~~~~id~Ie~~~s-~ldl~~i~~ldG~~Di~~~~~~e~~~~~~e~~---~~~e~~~~i~ 236 (252) T protein:vir:10 161 KNPIRVIVAQKFIYSGEKITPVVQEKPEIEDMYN-GLDLAPLLNLDGMIDQKIDQFAENIAVQQKVK---QYAEPFDPIS 236 (252) T ss_pred CCeEEEEEEEEEeeCCceecCcccccCccchhhh-ccchHHHhhcCCeeccccccchhhHHHHHhhh---hhhccceeec Confidence 9999999999999999999999999999885543 33333333333333333222111110000000 0011111111 Q ss_pred CCccccccccceeecCC Q lcl|NC_019444. 241 GGEYYKSALPPTVTISD 257 (486) Q Consensus 241 ~Gsg~t~t~~~tVtis~ 257 (486) ..+-.... +|-..-.+ T Consensus 237 ~~~~~~~~-~pf~~~~~ 252 (252) T protein:vir:10 237 TNSFGNFD-SPFGKHEA 252 (252) T ss_pred CCCCCCcC-CcccccCC Confidence 11100000 01000011 No 12 >protein:vir:101800 Length: 252 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238877;genbank:gi:66391952;genbank:GeneID:3416627 Probab=100.00 E-value=2.1e-90 Score=512.25 Aligned_cols=245 Identities=24% Similarity=0.360 Sum_probs=193.6 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) -.++|+.+|+++++++++|||||+++|.+||+|+|+||+|||||||+||||||||+|++|.||+||++|||+|||+|||| T Consensus 8 a~le~~~gy~~t~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~~Ed~~SkF~ka~~ieaY 87 (252) T protein:vir:10 8 ATLENRGGYMRTNEKNILNPYVKFNKHEGTQALQDTLVAESIQMRGIEFYYLEREFTDLDLLFGEDVNSRFEKAWKFAAW 87 (252) T ss_pred eEecCCcchhhhhhhccccceeeecCccchhHHHHHHHHHHHHHcCceEEEechhhcccccccccccccccccceeEEEe Confidence 57999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|++|||||||||++|||||+|||+||++++++. ||||||||||||+++||||+|||++.||||+| T Consensus 88 l~s~egy~G~gd~~SKFG~~~~DEvt~~Is~~rF~~qv~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (252) T protein:vir:10 88 LNSFESYEGQQSFFSKFGHTQNDEIRISINPGLFKYQVNGK-------EPALGDLIYMPMDNSLFEITWVEPYSPFYQNG 160 (252) T ss_pred eehhhccCCccceeeecCceecceEEEEEcCchhhhhccCC-------CCccccEEEEcCCCcEEEEeccCCCCchhhhC Confidence 99999999999999999999999999999999999999974 99999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccceeEEEec Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAITLTD 240 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsitvt~ 240 (486) |+|+|+++|++|+|+++.++++.++++.+...+. .....+...-.+...............-.-+. .-+....+.+ T Consensus 161 kn~~~~l~ce~F~Ys~E~idt~~~~id~Ie~~~s-~ldl~~i~~ldG~~Di~~~~~~e~~~~~~e~~---~~~e~~~~i~ 236 (252) T protein:vir:10 161 KNPIRVIVAQKFIYSGEKITPVVQEKPEIEDMYN-GLDLAPLLNLDGMIDQKIDQFAENIAVQQKVK---QYAEPFDPIS 236 (252) T ss_pred CCeEEEEEEEEEeeCCceecCcccccCccchhhh-ccchHHHhhcCCeeccccccchhhHHHHHhhh---hhhccceeec Confidence 9999999999999999999999999999885543 33333333333333333222111110000000 0011111111 Q ss_pred CCccccccccceeecCC Q lcl|NC_019444. 241 GGEYYKSALPPTVTISD 257 (486) Q Consensus 241 ~Gsg~t~t~~~tVtis~ 257 (486) ..+-.... +|-..-.+ T Consensus 237 ~~~~~~~~-~pf~~~~~ 252 (252) T protein:vir:10 237 TNSFGNFD-SPFGKHEA 252 (252) T ss_pred CCCCCCcC-CcccccCC Confidence 11100000 01000011 No 13 >protein:vir:80995 Length: 246 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469495;genbank:gi:157311452;genbank:GeneID:5602161 Probab=100.00 E-value=4e-90 Score=510.68 Aligned_cols=238 Identities=25% Similarity=0.447 Sum_probs=194.2 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) -.++|.++|+++++++++|||||+|||.+||+|+|+||+|||||||+||||||||+|++|.||+|+++|||+|||+|||| T Consensus 8 a~l~~~~gy~~~~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 87 (246) T protein:vir:80 8 ARLESQKDYENTRQTEILNPYVNFNSHTNTQTLADIMVAESIQMRGVEMYYIPREFVKPDMIFGEDVQSKFTKAWKFAAY 87 (246) T ss_pred eeecCCcchhhhcccccccceeeeCCCCchhhHHHHHHHHHHHHcCceEEEechhhhccccccccccccccccceeEEEe Confidence 57999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|++|||||||||++|||||+|||+||+++++. .||||||||||||+++||||+|||++.||||+| T Consensus 88 l~s~egy~G~gd~~SKFG~~v~DEvt~~Is~~rF~~~~~~-------~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (246) T protein:vir:80 88 INSFDGYEGAGNFFQSFGYTANDELTITINPNLFKHQVDN-------KEPKSGDLFYIPMSNDLFEISYVEPYQPFFQAG 160 (246) T ss_pred eehhhccCCccceeeecCceecceEEEEEccchHhhhhCC-------CCCccccEEEEcCCCceEEEecccCCCchhhhC Confidence 9999999999999999999999999999999999999996 599999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccceeEEE-e Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAITL-T 239 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsitv-t 239 (486) |+|+|+++|++|+|+++.+++++.+++++...............-.+....+..........-.-+.. -+....+ . T Consensus 161 Kn~v~~l~ce~F~Ys~E~~~t~i~~~d~I~~de~~~ldl~~i~nldg~~Din~~~~~e~~~~~~e~~~---f~~~~~~~~ 237 (246) T protein:vir:80 161 KNAMRKIIAEKFVYSGEELRPELQRNEGINVDEFADLDLAPITNLDGLTDINLDQYKEDKQFRSEGQD---FIDPFDPIN 237 (246) T ss_pred CCeEEEEEEEEEeecCcccccCCCCccccCCCCcccccHHHHhhcccccccCcccchhhhhhhcchhh---hcccceeec Confidence 99999999999999999999999999998876655554444443333333322221111110000000 0000111 1 Q ss_pred cCCccccccccceeec Q lcl|NC_019444. 240 DGGEYYKSALPPTVTI 255 (486) Q Consensus 240 ~~Gsg~t~t~~~tVti 255 (486) +-|+.... + T Consensus 238 ~~gspf~~-------~ 246 (246) T protein:vir:80 238 GKGSPFAD-------F 246 (246) T ss_pred CCCCcccc-------C Confidence 11110000 0 No 14 >protein:vir:6590 Length: 246 # NCBI annotation: neck protein # Family: family:all:1104 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891721;genbank:gi:33620667;genbank:GeneID:1725307 Probab=100.00 E-value=3.9e-90 Score=510.73 Aligned_cols=238 Identities=26% Similarity=0.460 Sum_probs=194.2 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) -.++|.++|+++++++++|||||+|||.+||+|+|+||+|||||||+||||||||+|++|.||+|+++|||+|||+|||| T Consensus 8 a~l~~~~gy~~~~~~~~lNPYfN~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 87 (246) T protein:vir:65 8 ARLESQKDYENTRQTEILNPYVNFHKYTNTQTLADVMVAEAIQMRGVELYYIPREFVKPDMIFGEDVQSKFTKAWKFAAY 87 (246) T ss_pred eeecCCcchhhhcccccccceeecCCCcchhhHHHHHHHHHHHHcCceEEEechhhcccccccccccccccccceeEEEe Confidence 57999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|++|||||||||++|||||+|||+||+++++. +||||||||||||+++||||+|||++.||||+| T Consensus 88 l~~~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~~~~-------~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (246) T protein:vir:65 88 INSFDGYEGAGNFFSSFGYQANDELTFTVNPNLFKHQVDD-------QEPKSGDLIYIPMSNDLFEINYVEPYQPFFQAG 160 (246) T ss_pred eehhhccCCccceeeecCceecceEEEEEcCchhhhhcCC-------CCCccccEEEEcCCCcEEEEecccCCCchhhhC Confidence 9999999999999999999999999999999999999996 699999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccceeEEE-e Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAITL-T 239 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsitv-t 239 (486) |+|+|+++|++|+|+++.+++++.+++++...............-.+....+..........-.-+.. -+....+ . T Consensus 161 Kn~v~~l~ce~F~Ys~E~~~~~i~~~d~I~~de~~~ldl~~i~~ldg~~Din~~~~~e~~~~~~e~~~---f~~~~~~~~ 237 (246) T protein:vir:65 161 KNAMRKIIAEKFVYSGEELRPELQRNEGINVDEFADLDLAPITNLDGLTDINLDQYKEDKQFRSEGQD---FIDPFDPIN 237 (246) T ss_pred CCeEEEEEEEEEeecCcccccCCCCccccCCCCcccccHHHHhhcccccccCccchhhhhhhhcchhh---hcccceeec Confidence 99999999999999999999999999998876655554444443333333322221111110000000 0000111 1 Q ss_pred cCCccccccccceeec Q lcl|NC_019444. 240 DGGEYYKSALPPTVTI 255 (486) Q Consensus 240 ~~Gsg~t~t~~~tVti 255 (486) +-|+.... + T Consensus 238 ~~gspf~~-------~ 246 (246) T protein:vir:65 238 GKGSPFAD-------F 246 (246) T ss_pred CCCCcccc-------C Confidence 11110000 0 No 15 >protein:vir:107937 Length: 257 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595290;genbank:gi:161622596;genbank:GeneID:5783656 Probab=100.00 E-value=6.6e-90 Score=509.49 Aligned_cols=247 Identities=25% Similarity=0.435 Sum_probs=188.3 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) -.++|+.+|+++++++++|||||+|||.+||+|+|+||+|||||||+||||||||+|++|+||+|+++|||+|||+|||| T Consensus 10 a~le~~~gy~~t~~~~~lnPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~ka~~ieaY 89 (257) T protein:vir:10 10 AKLENNTGYANTNETEIMNPFVNFYRHENTQTLADALVAESIQMRGIELYYIPREYVNPDQLFGEDLQNKFTKAWKFAGY 89 (257) T ss_pred eeecCCcchhhhhhhccccceeecccCCchhHHHHHHHHHHHHHcCceEEEcchhhhccccccccccccccccceeEEEe Confidence 57999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|+++||||||||++|||||+|||+||++++++. ||||||||||||+++||||+|||++.||||+| T Consensus 90 l~s~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~v~~~-------rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 162 (257) T protein:vir:10 90 LDSFEGYSGDNTYFSKFGMMVNDEVTITINPNLFKHQCNGT-------EPVSGDLIYFPMDNSLFEINWVQPYDPFYQVG 162 (257) T ss_pred eehhhccCCCcceeeecCceecceEEEEEccchhhhhccCC-------CCccccEEEEcCCCceEEEecccCCCchhhhC Confidence 99999999999999999999999999999999999999974 99999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeeccccceeeeeccccceeE-EEe Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLYLAKATATLTSGVVSAI-TLT 239 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~~~t~t~~~~~~~~tsi-tvt 239 (486) |+|+|+++|++|+|+++.+.+.+.+..++...-...........-.+....+..........-..+ ..-+..- .+. T Consensus 163 kn~~~~l~ce~F~Ys~E~l~pel~~n~~~~V~e~~eldl~~~~~ldG~~di~~~~~~E~~~~~~e~---~~fi~p~~~~n 239 (257) T protein:vir:10 163 TNVQRRITATKFIYNGEELRPELQRNEGINIPEFSELDLMPVKNIDGLADISDIQYEEVNEINAEA---AEFVHPYVVIN 239 (257) T ss_pred CceEEEEEEEEeeeCCcccccccCCcccCCCCCccchhhhhhhhccchhhcCCchhhhHHHHHHhh---hhhhccccccC Confidence 999999999999999999988888776664433333333333333222222211111110000000 0000000 011 Q ss_pred cCCccccccccceeecCCCCccceeeeeee Q lcl|NC_019444. 240 DGGEYYKSALPPTVTISDPPASGGITAFST 269 (486) Q Consensus 240 ~~Gsg~t~t~~~tVtis~~~~~~~~~~~~t 269 (486) +.|+....+ | +...- ... T Consensus 240 ~~g~~~~~~--p---f~~~~-------~~~ 257 (257) T protein:vir:10 240 GRGEDAPPT--A---FDDAF-------LDD 257 (257) T ss_pred CCCCCCCCC--c---ccchh-------ccC Confidence 111110000 0 00000 000 No 16 >protein:vir:106285 Length: 262 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944101;genbank:gi:38640145;genbank:GeneID:2658033 Probab=100.00 E-value=6.8e-89 Score=503.96 Aligned_cols=241 Identities=22% Similarity=0.319 Sum_probs=187.4 Q ss_pred CcccCCCcccceeccccccceEeecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCccccccccccccceeeeeee Q lcl|NC_019444. 1 MTYRNDPPENCIQSDYTSSCRLNLNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAY 80 (486) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~ 80 (486) -.++|+.|||++++++++|||||+|+|.+||+|+|+||+|||||||+||||||||+|++|+||+|+++|||+|||+|||| T Consensus 10 a~le~~~gy~~~~~~~vlNPYfN~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~gEd~~SkF~ka~~ieaY 89 (262) T protein:vir:10 10 AQLETGAGYNKTYQTNVLNPYVNKHEYEPTLSLHEMLVAESIQMTGVEMYYIRREFVNFDRIFGEDMQSKFKKTYKVAMY 89 (262) T ss_pred eEecCCCcccCcchhccccceeccCCcCchhhHHHHHHHHHHHHcCceEEEcchhhhccccccccccccccccceeEEEe Confidence 57899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhhcCCcccchhhcCceecceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecC Q lcl|NC_019444. 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLG 160 (486) Q Consensus 81 ~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G 160 (486) |+|||||+|++|||||||||++|||||+|||+||++++++ +||||||||||||+++||||+|||++.||||+| T Consensus 90 l~s~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~~~~-------~rP~EGDLIYfPl~nsLFEI~~VE~~~PFYQ~G 162 (262) T protein:vir:10 90 LESFDEYSGQRDFFSKFGMQVNDEITMSVSPKLFETQADG-------DRVKEGDLIYFPLNNSLFEVTWVEPSSPVVKRE 162 (262) T ss_pred eehhhccCCccceeeecCceecceEEEEEccchhhhhhcC-------CCCccccEEEEcCCCceEEEeeccCCCchhhhC Confidence 9999999999999999999999999999999999999997 599999999999999999999999999999999 Q ss_pred ceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeeeecceeeecc--------ccceeeeecccc Q lcl|NC_019444. 161 KGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTVGEEIVGDLY--------LAKATATLTSGV 232 (486) Q Consensus 161 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~~~~~~~~~~--------~~t~t~~~~~~~ 232 (486) |+|+|+++|++|+|+++.+++++.+++.+..... .........+....+........ -.-.--.+.+.+ T Consensus 163 kn~~~~l~ce~F~Ys~E~i~~~i~~id~i~~e~~---~l~~i~~lDg~~di~~~q~~e~~~~~~e~~~fv~~~d~v~~~g 239 (262) T protein:vir:10 163 QLAKYKVTAQKFIYSGEEIKPEFDPNRYVLGEDD---PLSQIKALDGRADISLDEFAEDDAFNEEAEDFVVEFDNIIGNG 239 (262) T ss_pred CceEEEEEEEEEeeCCccccccCccccccccccc---cccccccccceeecccccccchhHHhhhhhhhcchhcccCCCC Confidence 9999999999999999999999999998864432 11222222222222111100000 000000000111 Q ss_pred ce----eEEEecCCccccccccceeec Q lcl|NC_019444. 233 VS----AITLTDGGEYYKSALPPTVTI 255 (486) Q Consensus 233 ~t----sitvt~~Gsg~t~t~~~tVti 255 (486) .. .-+-....+..-.- -++ T Consensus 240 sp~~~~~~~~~~~~~~fdd~----~~~ 262 (262) T protein:vir:10 240 TPIAEHKPTKPAPVSAFDDL----ESF 262 (262) T ss_pred CcccccCCCCCCCCChhhhh----hcC Confidence 00 00000000000000 000 No 17 >protein:vir:104739 Length: 470 # NCBI annotation: T4-like neck protein # Family: family:all:1104 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214353;genbank:gi:61805993;genbank:GeneID:3294259 Probab=98.03 E-value=6.5e-06 Score=49.03 Aligned_cols=427 Identities=15% Similarity=0.117 Sum_probs=131.1 Q ss_pred Ccc-----cCCCcccceeccccccceEeecc-----cchhhHHHHHHHHHhhhhcCeeEEEeeeeccc-cCccccc-ccc Q lcl|NC_019444. 1 MTY-----RNDPPENCIQSDYTSSCRLNLNG-----SSQEQMFMGNLIIESIELYGQDIYYLPRTYVN-RDTILNE-VET 68 (486) Q Consensus 1 ~~~-----~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~-~d~~~~~-~~~ 68 (486) |.+ ++...+++-.++ +.+=..+++| ..-|+-..|.|..|-+|-+=.|-|=+ +-||+ -|.--++ ++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~e~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~-~~~~~~~~~~~~~~~~~ 78 (470) T protein:vir:10 1 MALNPFFLQGTSSEQRLTQD-LINEHLKIYGVEVTYIPRKYVNTKSIIEEVQSSKFDDNFAI-EAYVNTYEGYGGQGDVL 78 (470) T ss_pred CcccceeEcCCCchhHHHHH-HHHHHhHhccceEEEechhhcccccccccccccccccceeE-EEEeecccCcCCcceee Confidence 432 222222222111 1111111111 12233333334444443322222212 11121 1222222 344 Q ss_pred ccccce----eeeeeeecchhhcCC-------cccchhhc---C--ceecceEEEEEccccee-hhhccceeecccCCCC Q lcl|NC_019444. 69 SNFTQA----LSIRAYVNNVEGWEG-------QGDLLSKF---G--VRIEDKTTFIFSRSKFT-EKVDDNAALNVEGRPN 131 (486) Q Consensus 69 ~~f~~~----~~~~~~~~~~~~~~~-------~~~~~~~f---g--~~~~~~~~~~i~~~~~~-~~~~~~~~~~~~~~p~ 131 (486) +||... +.|--=.+.||.+.. +..+++|- + ..-.|-+=|.+.+++|| ..|..+ +.+=+ T Consensus 79 ~~fg~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~-----~p~~~ 153 (470) T protein:vir:10 79 TKFGMSIRDEVTLTISKERFEDFIAPFMAGLDDGPGGNEEVILATRPREGDLVFFPLGSRLFEVKFVEHE-----DPFYQ 153 (470) T ss_pred eecCcccceEEEEEECCccccccccchhhcccCcccccccccccCCcccccEEEecCCCCEEEEEecCCC-----Ccchh Confidence 444322 222222233332222 11222211 1 22346677788888888 223221 12223 Q ss_pred cCCEEEEccCCceEEEEEeecCCCceecCceeEEEEEecceeecceeccccceeeeeeeccccccccccccCCccceeee Q lcl|NC_019444. 132 EGDLIWFPTTNHLFEIQFVEAERPFYQLGKGYVWECQCELFEYSDEQLDTGVAAIDAIETAFANSIKLVMDAGGTGAFTV 211 (486) Q Consensus 132 egdl~~~p~~~~~~~i~~ve~~~Pf~~~G~~~~~~~~~~~~~~s~~~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~~ 211 (486) -|+..++.+..-+|++.-.+...++-..+... +...+ .........+......................+++.... T Consensus 154 ~G~~~~~~it~~~f~ysge~~s~~v~~~~~~~-~~~g~---~~t~t~~~~g~~~~~t~~~~~g~vt~ititn~Gsgyt~~ 229 (470) T protein:vir:10 154 LGKNYVYQLKCELFEYEDEVIDTSIDAIDTVV-QDDGY---ISKLQLVGIGRTAEVAASIGVGYVREIFLNNDGSGFTSP 229 (470) T ss_pred cCcceeEEeeeceeEecCCccccceecccccc-ccccc---ceeeeecCCCccceeeeeecceeeeEeEeeccccceecc Confidence 68888888888888876655554432222211 10000 000000011100000000000000111112222222211 Q ss_pred ecceeeec-------cccceeeeeccccceeEEEecCCccccccccceeecCCCCccceeeeeee----cccceeeecCC Q lcl|NC_019444. 212 GEEIVGDL-------YLAKATATLTSGVVSAITLTDGGEYYKSALPPTVTISDPPASGGITAFST----STDTGSYSNPT 280 (486) Q Consensus 212 ~~~~~~~~-------~~~t~t~~~~~~~~tsitvt~~Gsg~t~t~~~tVtis~~~~~~~~~~~~t----~t~~~~~~~~~ 280 (486) ........ ...........+.+..+.+.+.|++|+. .|+|++.++...+....... ........... T Consensus 230 ptVti~~~~~~~~~~a~~~~~t~~~~g~vt~ititn~Gsgytt--~ptvt~~~~~g~ga~at~~~~~~~~g~~~itit~~ 307 (470) T protein:vir:10 230 PTITFSASPAFTDARAVGILTTRANVTSIEKILMTSAGAGYIT--PPTITISGGGGTGAAATCSIETVYQGVVNFNVVDG 307 (470) T ss_pred CEEEEccCCCCCCccceeeEeecceeeEEEEEEEecCcccccc--cceEEEccCCCccceeeeeecccccceeeEEEccC Confidence 11110000 0011112223346677889999999875 46777766543332221111 11234456678 Q ss_pred CCcccccceeEcCCCCCcccceeeeeccCCcceeeeccceee--eeccCCccceecceeEecCCCCcCccceeecCcccc Q lcl|NC_019444. 281 GTGYTVGTYTTTNLTGTGSGAIATVSGVNGSGGLTVSGGDLS--LTNLTGTGYSTGDVLRINGGDDNAHIRVVSVGITSP 358 (486) Q Consensus 281 gsg~t~~~~~t~~~~~~~~~~t~~~~~~~~~~~~t~t~gt~t--~t~~~gsg~t~~~tv~~~g~~~~~~~~~~~~~~~~~ 358 (486) +++|+..+.+.................... ...+.+. .....+.+++..+.+.+................ T Consensus 308 GsgYtt~ptvtit~~~sg~~a~~~a~~~~~-----~~~g~itsititn~Gsgyts~ptv~i~~~~~~~~~~t~~~~~--- 379 (470) T protein:vir:10 308 GVGYGTEPSIAVTQPGAGTTAVGIASIGMA-----GSDQVLKSVYIGNPGRGYTATPNVIVADPPSMSGIGTFTFNE--- 379 (470) T ss_pred CccccccceEEEecCCCCCcccceeEEEee-----cccceeeeEEeccCCcceeccceeEeecCccccccceeeeee--- Confidence 889988877766554433222211111100 0111111 122346677777776665433322211110000 Q ss_pred ceeeeEecCCCccceeeecCCCccccccceeEeecCCCC-ceeEEEeeccCCceeEEEecCcccccCceeccCCcccccc Q lcl|NC_019444. 359 ASATATVSSAGIVTDITITSGGTGYTSVPTVTIDYSPKD-SRAEVKSWNASTRELQVINRTGTFNTAETITGLTSGARWS 437 (486) Q Consensus 359 at~t~t~~~~g~~~~vtv~~~Gsg~t~~~~vtvt~~~~g-tt~~~~~~~~~~~~~~v~~~tgt~t~~~~~tg~ts~a~~~ 437 (486) .......+....+.....+................. ..+.............+........ ..+........ T Consensus 380 ---~~tg~tsgt~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~g~tvt~~~~~a~~~~~s~t~~~~----~~~~ts~~~i~ 452 (470) T protein:vir:10 380 ---VIKGSRSGTEARVKSWDDDTKILLVSNVGIGSTVSGFYTGESIVGQESGASYALGSYNSDDA----NDKYNDGDEFE 452 (470) T ss_pred ---eeeccccceeeeeeeecccceeeeecccceecccceeeeeeeEEeccccceeeEEEeccccc----Cceeeccceee Confidence 000000000000000000000000000000000000 0000000000000000000000000 00000000000 Q ss_pred ccccccccccccccccccCcee Q lcl|NC_019444. 438 PESYNTLNNTNTADSIDQNYSF 459 (486) Q Consensus 438 ~~~~~t~~~t~t~~~~~~~~~~ 459 (486) .. .+........++-..+ T Consensus 453 t~----~~~i~~~~~~np~~~~ 470 (470) T protein:vir:10 453 FN----ADQILDFTESNPFGNF 470 (470) T ss_pred cc----CCcEEeeeecCCCCCC Confidence 00 0000001111111111 No 18 >protein:vir:103005 Length: 390 # NCBI annotation: gp110 # Family: family:all:1104 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717777;genbank:gi:113200614;genbank:GeneID:4239009 Probab=95.86 E-value=0.0013 Score=36.32 Aligned_cols=341 Identities=17% Similarity=0.066 Sum_probs=97.6 Q ss_pred CcccCCC-----cccceecc--ccccceEeecc-----cchhhHHHHHHHHHhhhhcCeeEEEeeeeccc-cCccccc-c Q lcl|NC_019444. 1 MTYRNDP-----PENCIQSD--YTSSCRLNLNG-----SSQEQMFMGNLIIESIELYGQDIYYLPRTYVN-RDTILNE-V 66 (486) Q Consensus 1 ~~~~~~~-----~~~~~~~~--~~~~~~~~~~~-----~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~-~d~~~~~-~ 66 (486) -.|.|.| ++...|.- .+.+=...+.| -.-|+-..|.|..|-+|-+=.+-|=+ +-||+ .|.--+. + T Consensus 14 ~~~~~~~~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~~~~~-~~y~~~~~~~~~~~~ 92 (390) T protein:vir:10 14 SDYTSSCRLNLNGSAQEQTFMENLIVESIELYGQNVYYLPRIYVNRDTILNEVETSRFEQALSV-RAYVNNVEGWEGQGD 92 (390) T ss_pred cceeeccEEEEeccCchhHHHHHHHHHHhHhcCceEEEechheeccccccccccccccccceEE-EEEeechhccCCccc Confidence 2222222 11111110 01111111111 11222223333444333221111111 11111 1111111 3 Q ss_pred ccccccceee--eee--eecchhhcCCcccchhhcC-ceecceEEEEEcccceeh-hhccceeecccCCCCcCCEEEEcc Q lcl|NC_019444. 67 ETSNFTQALS--IRA--YVNNVEGWEGQGDLLSKFG-VRIEDKTTFIFSRSKFTE-KVDDNAALNVEGRPNEGDLIWFPT 140 (486) Q Consensus 67 ~~~~f~~~~~--~~~--~~~~~~~~~~~~~~~~~fg-~~~~~~~~~~i~~~~~~~-~~~~~~~~~~~~~p~egdl~~~p~ 140 (486) +.|||--.-. +.. =.+-||-..+...++..=+ ...-|.+=|.+.+++||- .+..+ +.+=+-|++..+.| T Consensus 93 ~~skfg~~~~de~~~~~~~~~~~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~-----~p~yq~G~nyt~~i 167 (390) T protein:vir:10 93 LLSKFGVRIEDKTTFIFSRKKFTTAVDDNAVLNVEGRPNEGDLIWFPATRHLFEIKFVEAE-----RPFYQLGKGYVWEC 167 (390) T ss_pred eeeecCceecceEEEEECCcchhhhhCCcccccccCCCCCCceEEecCCCCEEEEEecCCC-----CCceEccCceeeee Confidence 4445431100 000 0011111111000000000 111244555566666651 11111 00111233333322 Q ss_pred CCceEEEEE-----eecCCCceecCceeEEEEEecc---eeecce--eccccceeeeeeeccccccccccccCCccceee Q lcl|NC_019444. 141 TNHLFEIQF-----VEAERPFYQLGKGYVWECQCEL---FEYSDE--QLDTGVAAIDAIETAFANSIKLVMDAGGTGAFT 210 (486) Q Consensus 141 ~~~~~~i~~-----ve~~~Pf~~~G~~~~~~~~~~~---~~~s~~--~~~~~~~~~~~i~~~~~~t~t~~~~~g~t~~~~ 210 (486) .-.+|+--. ..++++-.-........+.... ..+... ....+.............-.......++++... T Consensus 168 ~a~lf~ySge~iat~~seid~I~~~~~~~v~~~~~t~g~~~~t~~~~v~~~g~ga~~~a~v~~g~Vt~vtItn~GsGYt~ 247 (390) T protein:vir:10 168 QCELFEYSDEDLDTGVAEIDAIETAFANAIKLVMDAGGTGAFTVGEEIVGDLYLATATATISGDAVDAVTVTDGGEHYKS 247 (390) T ss_pred EEeeeccCCccccccccccccccccccceeeeeeccCCcccccccceeeecCcceeEEEEecCCeEEEEEEeeCCCCccc Confidence 222221111 1111110000111111111000 000000 000000000000000000111223333343332 Q ss_pred eecce----eeeccccceeeee-ccccceeEEEecCCccccccccceeecCCCCccceeeee-eecccceeeecCCCCcc Q lcl|NC_019444. 211 VGEEI----VGDLYLAKATATL-TSGVVSAITLTDGGEYYKSALPPTVTISDPPASGGITAF-STSTDTGSYSNPTGTGY 284 (486) Q Consensus 211 ~~~~~----~~~~~~~t~t~~~-~~~~~tsitvt~~Gsg~t~t~~~tVtis~~~~~~~~~~~-~t~t~~~~~~~~~gsg~ 284 (486) ..... .+.+..+.+.+.+ ..+.+..+++++.|++|+. .|+|++.+++........ ....+......+.+++| T Consensus 248 ~~~ptVtisgg~gtgAt~tatv~~~G~VtsItItn~GsGYt~--~PtVtI~g~g~~~~a~~~~~~g~v~~i~Itn~GsgY 325 (390) T protein:vir:10 248 ALPPTVTITGGGGSGATATATVSSAGIVTGITITSGGTGYTS--APTVTIDYSPKDNRAEVKSWNASTRELQVINRTGTF 325 (390) T ss_pred CceeEEEecCCCCccceeeeeecccceEEEEEEecCCccccC--CCEEEEeCCCCCceeEEEEeccEEEEEEEecCCcce Confidence 11111 1122333333333 3567889999999999976 478888877655544332 22344455566788999 Q ss_pred cccceeEcCCCCCcccceeeeeccCCcceeeeccceeeeeccCCccceecceeEecCCCCcCccceeecCccccceeeeE Q lcl|NC_019444. 285 TVGTYTTTNLTGTGSGAIATVSGVNGSGGLTVSGGDLSLTNLTGTGYSTGDVLRINGGDDNAHIRVVSVGITSPASATAT 364 (486) Q Consensus 285 t~~~~~t~~~~~~~~~~t~~~~~~~~~~~~t~t~gt~t~t~~~gsg~t~~~tv~~~g~~~~~~~~~~~~~~~~~at~t~t 364 (486) +..+.++...++...... ......... .......+..+...+...++.... ++.. ..+..+ T Consensus 326 tt~p~vt~~~~G~~~~~~-~~~t~~~~~--------~~~~~~~~~~~~t~~~~ii~~t~g-n~~g-------~v~n~T-- 386 (390) T protein:vir:10 326 NTAEVITGLTSGAKWSPE-SYNTLNNTN--------TADTIDQNYSFETADDDIIDFTEV-NPFG-------NIGSTT-- 386 (390) T ss_pred eeccEEEEecCCcceEEE-EEEecccce--------eeeeecccceeEeCCCceEeeccc-Cccc-------ccccce-- Confidence 888877655443222111 100000000 000001111111111111111100 0000 000000 Q ss_pred ecCCCcccee Q lcl|NC_019444. 365 VSSAGIVTDI 374 (486) Q Consensus 365 ~~~~g~~~~v 374 (486) ..++ T Consensus 387 ------~t~v 390 (390) T protein:vir:10 387 ------DTTI 390 (390) T ss_pred ------eccC Confidence 0000 No 19 >protein:vir:97237 Length: 122 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:704 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294533;genbank:gi:149408254;genbank:GeneID:5237102 Probab=72.71 E-value=0.18 Score=24.68 Aligned_cols=121 Identities=17% Similarity=0.183 Sum_probs=70.8 Q ss_pred ecccchhhHHHHHHHHHhhhhcCeeEEEeeeeccccCcccccccc-ccccceeeeeeeecchhhcCCcccchhhcCceec Q lcl|NC_019444. 24 LNGSSQEQMFMGNLIIESIELYGQDIYYLPRTYVNRDTILNEVET-SNFTQALSIRAYVNNVEGWEGQGDLLSKFGVRIE 102 (486) Q Consensus 24 ~~~~~~~~~~~~~l~~e~~~~~g~~~~y~~r~~~~~d~~~~~~~~-~~f~~~~~~~~~~~~~~~~~~~~~~~~~fg~~~~ 102 (486) -.-+..-|+..+.|++| +|.+|.+.+++-..-|+-.++... ..-...|.+.+.+.+|+--.=++.+ ++.- T Consensus 1 M~~y~~~~~~a~~Li~k----fG~~vtl~r~~~g~y~~~~g~~~p~~~t~~~~~~~gv~~~~~~~~idGtl-----I~~G 71 (122) T protein:vir:97 1 MARFDSAIALAKKLIKK----NGQAVTLRGFTAGAAPDPAKPWKPGGNVAADQTIEAVFLDYEQRYIDGQT-----IRMG 71 (122) T ss_pred CccchHHHHHHHHHHHH----hCCceEEEEeccceeCCCCCceecCCceeeeeeeEEEeeccchhhccCcE-----Eeec Confidence 11244455555666555 999999877776666655554322 2234568899999987643322222 2223 Q ss_pred ceEEEEEcccceehhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecCceeEEEEEecc Q lcl|NC_019444. 103 DKTTFIFSRSKFTEKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLGKGYVWECQCEL 171 (486) Q Consensus 103 ~~~~~~i~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G~~~~~~~~~~~ 171 (486) |.+-+....+ + .-+|+-||++-+ ++..|.|-.+++-.| .+....|++...+ T Consensus 72 D~~l~~~a~~-----~--------~~~P~~gD~v~~--~g~~~~Vi~v~~i~p---a~~~v~y~lqlRk 122 (122) T protein:vir:97 72 DQRVFMPAEG-----L--------TAPPEVEGLVLR--GLEVWKVIAVKPLNP---NGQAIMYELQVRQ 122 (122) T ss_pred CEEEEEeeCC-----C--------ccccccCCEEEe--CCEEEEEEeccccCC---CCceEEEEEEeeC Confidence 3332221111 1 126788998865 777888888876655 5555667666666 No 20 >protein:vir:81177 Length: 109 # NCBI annotation: putative head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285814;genbank:gi:148747735;genbank:GeneID:5247220 Probab=61.89 E-value=0.34 Score=23.12 Aligned_cols=105 Identities=17% Similarity=0.212 Sum_probs=64.6 Q ss_pred hcC----eeEEEeeeeccccCccccccccccccceeeeeeeecchhhcCCcccchhhcCceecceEEEEEcccceehhhc Q lcl|NC_019444. 44 LYG----QDIYYLPRTYVNRDTILNEVETSNFTQALSIRAYVNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVD 119 (486) Q Consensus 44 ~~g----~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~ 119 (486) |+= .-|.+..++.. .|. +++.+ .+|.+-+++-|.+....| .+.+..-+.+.....+|.|+ |.+.++ T Consensus 1 M~~g~L~~rI~i~~~~~~-~d~-~G~~~-~~w~~~~~~wA~v~~~s~----~e~~~a~~~~~~~~~~f~iR---~~~~i~ 70 (109) T protein:vir:81 1 MNPGQFRHKITLMKLVTT-QDE-IGNTI-EEWQPVRTCWAAIKTVNG----REYFAAASVQAERTYRFIIR---YTPGIN 70 (109) T ss_pred CCccccCccEEEEeeeee-eCC-CCCee-cceeeEEEEEEEEEecCc----hheeeccceeeeeeEEEEEE---eCCCCC Confidence 432 23445555444 466 56555 458899999999998744 45666666777888888886 333333 Q ss_pred cceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecCceeEEEEEecceeecce Q lcl|NC_019444. 120 DNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLGKGYVWECQCELFEYSDE 177 (486) Q Consensus 120 ~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G~~~~~~~~~~~~~~s~~ 177 (486) ..+.|.| .++.|+|.+|.+..+ +.-...+.|+...-.+- T Consensus 71 ------------~~~ri~~--~g~~y~I~~v~~~~~-----~~~~l~i~~~e~~~~~g 109 (109) T protein:vir:81 71 ------------ETMKIDY--QGRLFDIQSVLNDDE-----GKKTLTIIATERVAADG 109 (109) T ss_pred ------------cccEEEE--CCeEEEEEeecCCcc-----CCcEEEEEEEEeecCCC Confidence 3556666 689999999976533 32334667764221111 No 21 >protein:vir:1385 Length: 107 # NCBI annotation: Gp8 protein # Family: family:all:3858 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612837;genbank:gi:20065971;genbank:GeneID:935786 Probab=45.27 E-value=0.78 Score=21.19 Aligned_cols=106 Identities=12% Similarity=0.097 Sum_probs=68.4 Q ss_pred hcC-eeEEEeeeeccccCccccccccccccceeeeeeeecchhhcCCcccchhhcCceecceEEEEEcccceehhhccce Q lcl|NC_019444. 44 LYG-QDIYYLPRTYVNRDTILNEVETSNFTQALSIRAYVNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVDDNA 122 (486) Q Consensus 44 ~~g-~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~~~~ 122 (486) |.+ +-|.+..++-. .|..-+ ... .+.+-+++-|+++.. ...+++..=.++.+..+.|.|+-. +-+++ T Consensus 1 ~~~~hRI~i~~~~~~-~D~~G~-~~~-~w~~~~~~WA~v~~~----~g~E~~~a~~~~~~~~~~f~iRy~---~~i~~-- 68 (107) T protein:vir:13 1 MARYERISIKKLEEK-NIKGRR-QEE-CLIPFYDCWAEILDL----YGQELYGALQMKLENTIIFKIRYC---KKVEE-- 68 (107) T ss_pred CCcceEEEEEeeeee-eCCCCC-eec-ceEeEEEEEEEEecC----CchheeecceeheeeeEEEEEEec---CCccc-- Confidence 665 45677666655 575554 333 588999999999986 445777777777788888888543 22332 Q ss_pred eecccCCCCcCCEEEEccCCceEEEEEeecCCCceecCceeEEEEEeccee Q lcl|NC_019444. 123 ALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLGKGYVWECQCELFE 173 (486) Q Consensus 123 ~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G~~~~~~~~~~~~~ 173 (486) .++..++.|.| .++.|+|.+|.++.. +.-..++.|+... T Consensus 69 -----~~~t~~~Ri~~--~g~~y~I~~v~~~~~-----~~~~l~i~c~eV~ 107 (107) T protein:vir:13 69 -----LRNKENFIVEW--QGRKYEIYYPDFLGY-----NKQFVKLKCKEVL 107 (107) T ss_pred -----cccCcCcEEEE--CCeEEEEEecCCccc-----CCeEEEEEEEEeC Confidence 12333444444 789999999864432 3333577887765 No 22 >protein:vir:4343 Length: 118 # NCBI annotation: Orf10 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061506;genbank:gi:9635594;genbank:GeneID:1262867 Probab=40.13 E-value=0.98 Score=20.62 Aligned_cols=110 Identities=12% Similarity=0.087 Sum_probs=66.6 Q ss_pred hcC----eeEEEeeeeccccCccccccccc----cccceeeeeeeecchhhcCCcccchhhcCceecceEEEEEccccee Q lcl|NC_019444. 44 LYG----QDIYYLPRTYVNRDTILNEVETS----NFTQALSIRAYVNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFT 115 (486) Q Consensus 44 ~~g----~~~~y~~r~~~~~d~~~~~~~~~----~f~~~~~~~~~~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~ 115 (486) |+- .-|.+..|..+ .|...++.... .+....++-|.++.. ...+++..-..+..-.+.|+|+-..+. T Consensus 1 M~~G~l~~rI~i~~~~~~-~d~~~G~~~~~w~~~~~~~~~~~WA~v~~~----sg~e~~~a~~~~~~~~~~f~iRy~~~~ 75 (118) T protein:vir:43 1 MLAYRMRHRIQFQRQVHT-QDPDTGEETTTWETVLFSGHADLPAEVLTG----PGRELIAADATQAETTARINCRWFPVE 75 (118) T ss_pred CCccccCccEEEEeeeee-cCCCCCcccCceeeeeecccceEEEEEEec----CccceeecccchheeeEEEEEEecccc Confidence 432 34566667666 47776654433 223334677888775 334566555667778888888755443 Q ss_pred hhhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecCceeEEEEEecceeecce Q lcl|NC_019444. 116 EKVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLGKGYVWECQCELFEYSDE 177 (486) Q Consensus 116 ~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G~~~~~~~~~~~~~~s~~ 177 (486) ..++. .+.|.+ .++.|+|++|..+. + +.-...+.|+...-.+. T Consensus 76 ~~It~------------~~Ri~~--~g~~y~I~~v~~~~---~--~~~~l~i~~~e~v~~g~ 118 (118) T protein:vir:43 76 RLELY------------TWRVLW--DGRVYNITSAETDV---T--ARREWRLRCSDGLTDGR 118 (118) T ss_pred cCCCc------------ccEEEE--CCeEEEEEecCCcc---c--CCeEEEEEEEEeccCCC Confidence 22332 455554 68999999996432 2 32336888887666665 No 23 >protein:vir:1890 Length: 110 # NCBI annotation: gp9 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037670;genbank:gi:9634128;genbank:GeneID:1262503 Probab=39.64 E-value=1 Score=20.56 Aligned_cols=106 Identities=13% Similarity=0.139 Sum_probs=64.4 Q ss_pred hcC----eeEEEeeeeccccCccccccccccccceeeeeeeecchhhcCCcccchhhcCceecceEEEEEcccceehhhc Q lcl|NC_019444. 44 LYG----QDIYYLPRTYVNRDTILNEVETSNFTQALSIRAYVNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKVD 119 (486) Q Consensus 44 ~~g----~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~~ 119 (486) |+= .-|.+..+..+ .|...++.... |..-+++-|.+... ...+++..=+.+....+.|+|+.++ .++ T Consensus 1 M~~G~L~~rI~i~~~~~~-~d~~~G~~~~~-~~~~~~~wA~v~~~----~~~e~~~a~~~~~~~~~~~~iR~~~---~I~ 71 (110) T protein:vir:18 1 MQAGKLRHRITLQEPVKV-QNPTTGAVINT-WRDVATVRAEVSPL----SAREFIAAQASQGEITTRIVIRYRA---GVT 71 (110) T ss_pred CCccccCccEEEEeeeee-ecCCCCccccc-eeeeEEEEEEEEec----CchheeecceeeeeeeEEEEEEecC---CCC Confidence 432 34555555555 47777755544 88888888888775 3345666666777888888887542 233 Q ss_pred cceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecCceeEEEEEecceeecc Q lcl|NC_019444. 120 DNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLGKGYVWECQCELFEYSD 176 (486) Q Consensus 120 ~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G~~~~~~~~~~~~~~s~ 176 (486) .++.|.| .++.|+|..|.+... + +...-.+.|+...-.+ T Consensus 72 ------------~~~ri~~--~g~~y~I~~v~~d~~--~--~~~~l~i~~~e~~~~G 110 (110) T protein:vir:18 72 ------------RKHRILF--RGAVYNIHGVLPDPK--S--GREYLTLPCSEGVNDG 110 (110) T ss_pred ------------cccEEEE--CCeEEEEEeccCCcc--c--CCeEEEEEEEEeccCC Confidence 2555655 689999999954321 1 2112367777544333 No 24 >protein:vir:7411 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:1030 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839928;genbank:gi:30089898;genbank:GeneID:1260685 Probab=30.15 E-value=1.6 Score=19.47 Aligned_cols=101 Identities=12% Similarity=0.114 Sum_probs=51.0 Q ss_pred eeeeccccCc-----------ccccc----ccccccceeeeeeeecchhhcCCcccchhhcCceecceEEEEEcccceeh Q lcl|NC_019444. 52 LPRTYVNRDT-----------ILNEV----ETSNFTQALSIRAYVNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTE 116 (486) Q Consensus 52 ~~r~~~~~d~-----------~~~~~----~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~ 116 (486) |||.+-+.|. ..+|+ ...+|-.-|.+-+-...- -..+.++-.|...+|.++|.|+-++ T Consensus 1 M~~~~~p~~ln~ri~Fg~~~~~~n~~g~~~~~~~~~~~f~~w~a~~t~----t~~q~~~~~Gt~~edT~~~vIRh~~--- 73 (116) T protein:vir:74 1 MAKTYKPNDFNRKCKIGVTKTVTTPTGGKIEKIDPATVLNVRFAAKMR----SLALQFQIIGTTTADTFDIAIRHNK--- 73 (116) T ss_pred CcccccccccceeEEeeeeeeeeCCCCCcccceEEeeeEEEEEEEeec----chheeeeeccccccCcEEEEEEeCC--- Confidence 6665443321 11111 111233333332222221 2345677788888999999998752 Q ss_pred hhccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecCceeEEEEEeccee Q lcl|NC_019444. 117 KVDDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLGKGYVWECQCELFE 173 (486) Q Consensus 117 ~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G~~~~~~~~~~~~~ 173 (486) -+++. ...-+++.+|+|..|.+.........-++--...+++- T Consensus 74 ~i~~~--------------m~v~~~g~~Y~Iv~Is~Dd~~~~~~yD~iTlk~~~~ga 116 (116) T protein:vir:74 74 LVTKK--------------MFVQIDDVLYNIINISSDESAKLIKFDILTLQAKKKGA 116 (116) T ss_pred CCCcC--------------cEEEECCeEEEEEEeCCCCccCcceeeeEEEEEEeecC Confidence 23332 23556889999999988765444433332211111111 No 25 >protein:vir:100244 Length: 109 # NCBI annotation: gp73 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355409;genbank:gi:77864699;genbank:GeneID:3725966 Probab=30.09 E-value=1.6 Score=19.46 Aligned_cols=107 Identities=13% Similarity=0.018 Sum_probs=65.8 Q ss_pred hhhhc--CeeEEEeeeeccccCccccccccccccceeeeeeeecchhhcCCcccchhhcCceecceEEEEEcccceehhh Q lcl|NC_019444. 41 SIELY--GQDIYYLPRTYVNRDTILNEVETSNFTQALSIRAYVNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRSKFTEKV 118 (486) Q Consensus 41 ~~~~~--g~~~~y~~r~~~~~d~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~fg~~~~~~~~~~i~~~~~~~~~ 118 (486) +++.. -.-|.+..++.+ .|..- +.+...|.+-+++-|.+...- ..+.+..=+++......|.|+.+ +-+ T Consensus 1 mm~~g~L~~rI~i~~~~~~-~d~~G-~~~~~~w~~~~~~wA~i~~~~----g~e~~~a~~~~~~~~~~i~iR~~---~~I 71 (109) T protein:vir:10 1 MLRSSDLTEFIVIERKGGR-TNENG-EPLPDDWVTHDEVWASVRFVS----GKEHVISGAVRSSAIASIRIRFR---EDI 71 (109) T ss_pred CCCccccCccEEEEeeeec-cCCCC-CeeccceeeEEEEEEEEEecC----chheeeccceeeeeeEEEEEEec---CCC Confidence 22221 234666655544 45443 345556888889999998863 34566665677788888888743 222 Q ss_pred ccceeecccCCCCcCCEEEEccCCceEEEEEeecCCCceecCceeEEEEEecceeecc Q lcl|NC_019444. 119 DDNAALNVEGRPNEGDLIWFPTTNHLFEIQFVEAERPFYQLGKGYVWECQCELFEYSD 176 (486) Q Consensus 119 ~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~Pf~~~G~~~~~~~~~~~~~~s~ 176 (486) ..++.|.| .++.|+|..|. |-.. .-...+.|+..+-.. T Consensus 72 ------------~~~~ri~~--~g~~y~I~~v~---~~~~---~~~l~i~c~egv~~~ 109 (109) T protein:vir:10 72 ------------DSEMRIRY--GDQLYDIVAVL---PNRR---KGSLDLPVKVGEKYV 109 (109) T ss_pred ------------CcccEEEE--CCeEEEEEeec---cCCC---CcEEEEEEEeeeccC Confidence 33666665 78999999984 3222 234578888754443 Done!