Query lcl|NC_014636.1_cdsid_YP_003969296.1 [gene=phiAS5_ORF0007] [protein=neck protein] [protein_id=YP_003969296.1] [location=6339..7136] Match_columns 265 No_of_seqs 31 out of 35 Neff 3.7 Searched_HMMs 1612 Date Thu Nov 7 15:34:01 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_7 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_7_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:106285 Length: 262 100.0 2E-140 1E-143 786.5 18.2 262 1-265 1-262 (262) 2 protein:vir:6890 Length: 254 # 100.0 3E-132 2E-135 741.5 18.6 249 1-264 1-254 (254) 3 protein:vir:98258 Length: 248 100.0 2E-131 1E-134 737.7 17.4 245 1-246 1-248 (248) 4 protein:vir:80995 Length: 246 100.0 9E-131 5E-134 733.6 17.5 243 3-245 1-246 (246) 5 protein:vir:6590 Length: 246 # 100.0 2E-130 1E-133 732.1 17.5 243 3-245 1-246 (246) 6 protein:vir:101800 Length: 252 100.0 2E-130 1E-133 731.6 17.3 245 3-247 1-252 (252) 7 protein:vir:101154 Length: 252 100.0 2E-130 1E-133 731.6 17.3 245 3-247 1-252 (252) 8 protein:vir:7199 Length: 256 # 100.0 2E-130 1E-133 731.3 17.4 251 1-265 1-254 (256) 9 protein:vir:107937 Length: 257 100.0 4E-130 3E-133 729.9 18.1 253 1-265 1-257 (257) 10 protein:vir:103452 Length: 256 100.0 4E-130 2E-133 730.1 17.4 251 1-265 1-254 (256) 11 protein:vir:5658 Length: 278 # 100.0 5E-130 3E-133 729.3 17.0 262 1-262 1-278 (278) 12 protein:vir:100535 Length: 253 100.0 5E-128 3E-131 718.6 17.4 251 3-255 1-253 (253) 13 protein:vir:104476 Length: 308 100.0 5E-107 3E-110 603.5 13.5 251 1-265 1-303 (308) 14 protein:vir:106986 Length: 292 100.0 3E-102 2E-105 577.1 14.7 239 25-265 1-288 (292) 15 protein:vir:103005 Length: 390 100.0 7.6E-92 4.7E-95 520.1 14.4 229 10-265 1-236 (390) 16 protein:vir:104739 Length: 470 100.0 3.4E-81 2.1E-84 461.8 11.8 231 25-265 1-261 (470) 17 protein:vir:97237 Length: 122 29.4 1.7 0.001 19.4 10.1 121 27-173 1-122 (122) 18 protein:vir:1385 Length: 107 # 24.0 2.2 0.0014 18.7 8.3 106 53-175 1-107 (107) No 1 >protein:vir:106285 Length: 262 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944101;genbank:gi:38640145;genbank:GeneID:2658033 Probab=100.00 E-value=2e-140 Score=786.51 Aligned_cols=262 Identities=78% Similarity=1.297 Sum_probs=259.3 Q ss_pred CcccccceeeeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheeccccccccccccccc Q lcl|NC_014636. 1 MFARNQSMFAQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKF 80 (265) Q Consensus 1 m~~~~~~lfa~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF 80 (265) |+|||+|||||||+|+||++|++++|+|||||+++|++||+|+++||+|||||||+|||||||++|++|+||||+++||| T Consensus 1 m~~~d~~lfa~le~~~gy~~~~~~~vlNPYfN~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~gEd~~SkF 80 (262) T protein:vir:10 1 MFARNQSMFAQLETGAGYNKTYQTNVLNPYVNKHEYEPTLSLHEMLVAESIQMTGVEMYYIRREFVNFDRIFGEDMQSKF 80 (262) T ss_pred CcccccceeeEecCCCcccCcchhccccceeccCCcCchhhHHHHHHHHHHHHcCceEEEcchhhhcccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhh Q lcl|NC_014636. 81 KKAYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIK 160 (265) Q Consensus 81 ~~a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ 160 (265) ++||+|||||+||+||+|+|+||||||||++|||||+|||+||+++++++||+|||||||||+|+||||+|||+++|||| T Consensus 81 ~ka~~ieaYl~s~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~~~~~rP~EGDLIYfPl~nsLFEI~~VE~~~PFYQ 160 (262) T protein:vir:10 81 KKTYKVAMYLESFDEYSGQRDFFSKFGMQVNDEITMSVSPKLFETQADGDRVKEGDLIYFPLNNSLFEVTWVEPSSPVVK 160 (262) T ss_pred ccceeEEEeeehhhccCCccceeeecCceecceEEEEEccchhhhhhcCCCCccccEEEEcCCCceEEEeeccCCCchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCceEEEEEEEEeecCCccccCCccccccccccchhhhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccCCCCC Q lcl|NC_014636. 161 RERLAKYRITAQKFIYSGEEIKPEFDPNRYVLDDTDPFAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENIIGTGT 240 (265) Q Consensus 161 ~Gk~~vy~l~ce~F~YS~E~i~t~~~eid~i~~~~~~l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~~~g~ 240 (265) +||+|||+|+|+||+||||+|+|++++||+|+.+.+++++|++||||+||+++|++|.++++.||++||+||+||||+|| T Consensus 161 ~Gkn~~~~l~ce~F~Ys~E~i~~~i~~id~i~~e~~~l~~i~~lDg~~di~~~q~~e~~~~~~e~~~fv~~~d~v~~~gs 240 (262) T protein:vir:10 161 REQLAKYKVTAQKFIYSGEEIKPEFDPNRYVLGEDDPLSQIKALDGRADISLDEFAEDDAFNEEAEDFVVEFDNIIGNGT 240 (262) T ss_pred hCCceEEEEEEEEEeeCCccccccCccccccccccccccccccccceeecccccccchhHHhhhhhhhcchhcccCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCcccCCCCCccccchhhhhcC Q lcl|NC_014636. 241 PIAEHNKPAPAPVDMKNVFDDLESF 265 (265) Q Consensus 241 p~~~~~~~~~~~~~~~~~f~~~~~~ 265 (265) |+++|..|.|+++ +|||||+|| T Consensus 241 p~~~~~~~~~~~~---~~fdd~~~~ 262 (262) T protein:vir:10 241 PIAEHKPTKPAPV---SAFDDLESF 262 (262) T ss_pred cccccCCCCCCCC---ChhhhhhcC Confidence 9999998877765 999999999 No 2 >protein:vir:6890 Length: 254 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861866;genbank:gi:32453657;genbank:GeneID:1494292 Probab=100.00 E-value=3.1e-132 Score=741.53 Aligned_cols=249 Identities=47% Similarity=0.853 Sum_probs=238.5 Q ss_pred CcccccceeeeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheeccccccccccccccc Q lcl|NC_014636. 1 MFARNQSMFAQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKF 80 (265) Q Consensus 1 m~~~~~~lfa~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF 80 (265) |+|||+|||||||+++||++|++++|||||||++||++||+|+++||+|||||||+|||||||++|++|+||||+++||| T Consensus 1 m~~~d~~lfa~le~~~gy~~t~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF 80 (254) T protein:vir:68 1 MATYDKNLFAKLENRGGYSQTNETEILNPFVNFNNYENSQTLADVLVAESIQMRGIECFYVPREYVAVDLIFGEDLKNKF 80 (254) T ss_pred CcccccceeeeecCCcchhhhhhccccceeEEeeccCchhHHHHHHHHHHHHHcCceEEEechhhhcccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhh Q lcl|NC_014636. 81 KKAYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIK 160 (265) Q Consensus 81 ~~a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ 160 (265) ++||+||||++||+||+|+++||||||||++|||||+|||+||+++++++||+|||||||||+|+||||+||||++|||| T Consensus 81 ~ka~~ieaYl~s~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~v~~~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ 160 (254) T protein:vir:68 81 TKAWKFAAYLNSFEGYEGAKSFFSNFGMQVQDEVTLSINPGLFKHQVNNQEPKEGDLIYFPMDNSLFEINWVEPYDPFYQ 160 (254) T ss_pred ccceeEEEeeehhhccCCcccchhhcCceecceEEEEEcCchhhhhcCCCCCccccEEEEcCCCceEEEeccCCCCchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCceEEEEEEEEeecCCccccCCcccccccc----ccchhhhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccC Q lcl|NC_014636. 161 RERLAKYRITAQKFIYSGEEIKPEFDPNRYVL----DDTDPFAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENII 236 (265) Q Consensus 161 ~Gk~~vy~l~ce~F~YS~E~i~t~~~eid~i~----~~~~~l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~ 236 (265) +||+|||+|+|+||+||||+|+|++++||+|. ++. +++||++|||++||+++|++|.++|++||++||+||+||| T Consensus 161 ~GKn~~~~l~ce~F~Ys~E~idt~i~~id~I~~~e~~~l-dl~~i~~ldG~~di~~~~~~E~~~~~~e~~~f~e~~~~vn 239 (254) T protein:vir:68 161 VGKNAIRKITAGKFIYSGEEINPVLQKNEGINIPEFSDL-ELNPVRNLDGIHDINIDEYSEVEQINSEASEYVEPYVVVN 239 (254) T ss_pred hCCceEEEEEEEEEeeCCccccCCCcccCCccCcccCCc-chhhHhhhcchhhccccchhhHHHHHhhhhhhcccceeec Confidence 99999999999999999999999999999993 233 4789999999999999999999999999999999999999 Q ss_pred CCCCCCCCCcccCCCCCccccchhh-hhc Q lcl|NC_014636. 237 GTGTPIAEHNKPAPAPVDMKNVFDD-LES 264 (265) Q Consensus 237 ~~g~p~~~~~~~~~~~~~~~~~f~~-~~~ 264 (265) |||+|. +|||| |-| T Consensus 240 ~~g~~~--------------~pf~~~~~~ 254 (254) T protein:vir:68 240 NRGRQN--------------SPFDDGFMN 254 (254) T ss_pred CCCCCC--------------CcccccccC Confidence 999652 44443 223 No 3 >protein:vir:98258 Length: 248 # NCBI annotation: gp14 head completion # Family: family:all:1104 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239191;genbank:gi:66391666;genbank:GeneID:3416360 Probab=100.00 E-value=1.6e-131 Score=737.71 Aligned_cols=245 Identities=47% Similarity=0.867 Sum_probs=241.6 Q ss_pred CcccccceeeeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheeccccccccccccccc Q lcl|NC_014636. 1 MFARNQSMFAQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKF 80 (265) Q Consensus 1 m~~~~~~lfa~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF 80 (265) |+|||+|||||||+++||++|+++++||||||+|+|++||+|+++||+|||||||+|||||||++|++|+||||+++||| T Consensus 1 m~~~d~~lfa~l~~~~gy~~t~~~~~lnPYfN~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF 80 (248) T protein:vir:98 1 MQNWDESLFAQLSTGEGVDRNLKDQVTNPYVNWYKYNPTQQLHDSLTAESIQMKSPDMYYVRREFVNIDKILGEDRESKF 80 (248) T ss_pred CcccccceeeEecCCcchhhhhhcccccceeeccccCchhHHHHHHHHHHHHHcCceEEEcchhhhcccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhh Q lcl|NC_014636. 81 KKAYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIK 160 (265) Q Consensus 81 ~~a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ 160 (265) ++||+||||++||+||+|+|+||||||||++|||||+|||+||+++++++||+|||||||||+|+||||+|||+ +|||| T Consensus 81 ~ka~~ieaYl~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~~~~~rP~EGDLIYfPm~n~LFEI~~VE~-dPFYQ 159 (248) T protein:vir:98 81 TKSWKIAAYIESYANYEGQRDFFSKFGLSSNDEMTLVLNPRLFAHQTDGGIPVLGDLVYFPMDNSLFEITWVEA-DPFYQ 159 (248) T ss_pred ccceeEEEeeehhhccCCccceeeecCceecceEEEEEccchhhhcCCCCCCccccEEEEcCCCceEEEEecCC-Cchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998 59999 Q ss_pred hCCceEEEEEEEEeecCCccccCCccccccccccch---hhhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccCC Q lcl|NC_014636. 161 RERLAKYRITAQKFIYSGEEIKPEFDPNRYVLDDTD---PFAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENIIG 237 (265) Q Consensus 161 ~Gk~~vy~l~ce~F~YS~E~i~t~~~eid~i~~~~~---~l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~~ 237 (265) +||+|||+|+|+||+||||+|+|+++++|+|+.+.+ +++++++|||++||+++|++|.+++++||++|++||+|||+ T Consensus 160 ~Gkn~~~~l~ce~F~Ys~E~~~~~l~~~d~i~~~~~~~iDl~~i~~ldg~~Di~~~~~~e~~~~~~E~~~~~~~~~vin~ 239 (248) T protein:vir:98 160 FGDRPQRKINLAKFIYTGEELAPELQRNEGIHIEPDAELDLEPIRNLDGLADINIEQYEEDKEFEREGDEFIESFDVVNG 239 (248) T ss_pred hCCCeEEEEEEEEeeeCCccccccCCCccccCCCCCcchhHHHhhcCccccccCcccccchhhhhhhhhhhhcccceecC Confidence 999999999999999999999999999999988777 47899999999999999999999999999999999999999 Q ss_pred CCCCCCCCc Q lcl|NC_014636. 238 TGTPIAEHN 246 (265) Q Consensus 238 ~g~p~~~~~ 246 (265) ||+|++..| T Consensus 240 ~g~~~~~~~ 248 (248) T protein:vir:98 240 RGSPFATLP 248 (248) T ss_pred cCCCccCCC Confidence 999999887 No 4 >protein:vir:80995 Length: 246 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469495;genbank:gi:157311452;genbank:GeneID:5602161 Probab=100.00 E-value=8.7e-131 Score=733.62 Aligned_cols=243 Identities=47% Similarity=0.831 Sum_probs=237.4 Q ss_pred ccccceeeeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheeccccccccccccccccc Q lcl|NC_014636. 3 ARNQSMFAQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKFKK 82 (265) Q Consensus 3 ~~~~~lfa~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF~~ 82 (265) -||+|||||||+|+||++|++++|||||||++||++||+|+++||+|||||||+|||||||++|++|+||||+++|||++ T Consensus 1 ~~~~~lfa~l~~~~gy~~~~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~k 80 (246) T protein:vir:80 1 MFDSTLFARLESQKDYENTRQTEILNPYVNFNSHTNTQTLADIMVAESIQMRGVEMYYIPREFVKPDMIFGEDVQSKFTK 80 (246) T ss_pred CCcccceeeecCCcchhhhcccccccceeeeCCCCchhhHHHHHHHHHHHHcCceEEEechhhhcccccccccccccccc Confidence 79999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhhhC Q lcl|NC_014636. 83 AYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIKRE 162 (265) Q Consensus 83 a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ~G 162 (265) ||+||||++||+||+|+|+||||||||++|||||+|||+||+++++++||+|||||||||+|+||||+|||+++||||+| T Consensus 81 a~~ieaYl~s~egy~G~gd~~SKFG~~v~DEvt~~Is~~rF~~~~~~~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (246) T protein:vir:80 81 AWKFAAYINSFDGYEGAGNFFQSFGYTANDELTITINPNLFKHQVDNKEPKSGDLFYIPMSNDLFEISYVEPYQPFFQAG 160 (246) T ss_pred ceeEEEeeehhhccCCccceeeecCceecceEEEEEccchHhhhhCCCCCccccEEEEcCCCceEEEecccCCCchhhhC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEEEEeecCCccccCCccccccccccch---hhhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccCCCC Q lcl|NC_014636. 163 RLAKYRITAQKFIYSGEEIKPEFDPNRYVLDDTD---PFAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENIIGTG 239 (265) Q Consensus 163 k~~vy~l~ce~F~YS~E~i~t~~~eid~i~~~~~---~l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~~~g 239 (265) |+|||+|+|+||+||||+|+|+++++|+|..+.. +++++++|||++||+++|++|.+++++||++||+||+|||||| T Consensus 161 Kn~v~~l~ce~F~Ys~E~~~t~i~~~d~I~~de~~~ldl~~i~nldg~~Din~~~~~e~~~~~~e~~~f~~~~~~~~~~g 240 (246) T protein:vir:80 161 KNAMRKIIAEKFVYSGEELRPELQRNEGINVDEFADLDLAPITNLDGLTDINLDQYKEDKQFRSEGQDFIDPFDPINGKG 240 (246) T ss_pred CCeEEEEEEEEEeecCcccccCCCCccccCCCCcccccHHHHhhcccccccCcccchhhhhhhcchhhhcccceeecCCC Confidence 9999999999999999999999999999964333 3679999999999999999999999999999999999999999 Q ss_pred CCCCCC Q lcl|NC_014636. 240 TPIAEH 245 (265) Q Consensus 240 ~p~~~~ 245 (265) ||++++ T Consensus 241 spf~~~ 246 (246) T protein:vir:80 241 SPFADF 246 (246) T ss_pred CccccC Confidence 888887 No 5 >protein:vir:6590 Length: 246 # NCBI annotation: neck protein # Family: family:all:1104 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891721;genbank:gi:33620667;genbank:GeneID:1725307 Probab=100.00 E-value=1.6e-130 Score=732.12 Aligned_cols=243 Identities=50% Similarity=0.869 Sum_probs=237.4 Q ss_pred ccccceeeeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheeccccccccccccccccc Q lcl|NC_014636. 3 ARNQSMFAQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKFKK 82 (265) Q Consensus 3 ~~~~~lfa~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF~~ 82 (265) -||+|||||||+|+||++|++++|||||||+|||.+||+|+++||+|||||||+|||||||++|++|+||||+++|||++ T Consensus 1 ~~~~~lfa~l~~~~gy~~~~~~~~lNPYfN~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~k 80 (246) T protein:vir:65 1 MFNNTLFARLESQKDYENTRQTEILNPYVNFHKYTNTQTLADVMVAEAIQMRGVELYYIPREFVKPDMIFGEDVQSKFTK 80 (246) T ss_pred CCcccceeeecCCcchhhhcccccccceeecCCCcchhhHHHHHHHHHHHHcCceEEEechhhccccccccccccccccc Confidence 69999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhhhC Q lcl|NC_014636. 83 AYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIKRE 162 (265) Q Consensus 83 a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ~G 162 (265) ||+||||++||+||+|+|+||||||||++|||||+|||+||+++++++||+|||||||||+|+||||+|||+++||||+| T Consensus 81 a~~ieaYl~~~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~~~~~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (246) T protein:vir:65 81 AWKFAAYINSFDGYEGAGNFFSSFGYQANDELTFTVNPNLFKHQVDDQEPKSGDLIYIPMSNDLFEINYVEPYQPFFQAG 160 (246) T ss_pred ceeEEEeeehhhccCCccceeeecCceecceEEEEEcCchhhhhcCCCCCccccEEEEcCCCcEEEEecccCCCchhhhC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEEEEeecCCccccCCccccccccccch---hhhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccCCCC Q lcl|NC_014636. 163 RLAKYRITAQKFIYSGEEIKPEFDPNRYVLDDTD---PFAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENIIGTG 239 (265) Q Consensus 163 k~~vy~l~ce~F~YS~E~i~t~~~eid~i~~~~~---~l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~~~g 239 (265) |+|||+|+|+||+||||+|+|+++++|+|..+.. +++++++|||++||+++|++|.+++++||++||+||+|||||| T Consensus 161 Kn~v~~l~ce~F~Ys~E~~~~~i~~~d~I~~de~~~ldl~~i~~ldg~~Din~~~~~e~~~~~~e~~~f~~~~~~~~~~g 240 (246) T protein:vir:65 161 KNAMRKIIAEKFVYSGEELRPELQRNEGINVDEFADLDLAPITNLDGLTDINLDQYKEDKQFRSEGQDFIDPFDPINGKG 240 (246) T ss_pred CCeEEEEEEEEEeecCcccccCCCCccccCCCCcccccHHHHhhcccccccCccchhhhhhhhcchhhhcccceeecCCC Confidence 9999999999999999999999999999964333 3679999999999999999999999999999999999999999 Q ss_pred CCCCCC Q lcl|NC_014636. 240 TPIAEH 245 (265) Q Consensus 240 ~p~~~~ 245 (265) ||++++ T Consensus 241 spf~~~ 246 (246) T protein:vir:65 241 SPFADF 246 (246) T ss_pred CccccC Confidence 888887 No 6 >protein:vir:101800 Length: 252 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238877;genbank:gi:66391952;genbank:GeneID:3416627 Probab=100.00 E-value=2e-130 Score=731.64 Aligned_cols=245 Identities=45% Similarity=0.784 Sum_probs=239.0 Q ss_pred ccccceeeeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheeccccccccccccccccc Q lcl|NC_014636. 3 ARNQSMFAQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKFKK 82 (265) Q Consensus 3 ~~~~~lfa~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF~~ 82 (265) -+|+|||||||+++||++|++++|||||||++||++||+|+++||+|||||||+|||||||++|++|+||||+++|||++ T Consensus 1 m~d~~lfa~le~~~gy~~t~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~~Ed~~SkF~k 80 (252) T protein:vir:10 1 MMDKSLFATLENRGGYMRTNEKNILNPYVKFNKHEGTQALQDTLVAESIQMRGIEFYYLEREFTDLDLLFGEDVNSRFEK 80 (252) T ss_pred CCCccceeEecCCcchhhhhhhccccceeeecCccchhHHHHHHHHHHHHHcCceEEEechhhccccccccccccccccc Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhhhC Q lcl|NC_014636. 83 AYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIKRE 162 (265) Q Consensus 83 a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ~G 162 (265) ||+||||++||+||+|+|+||||||||++|||||+|||+||+++++++||+|||||||||+|+||||+|||+++||||+| T Consensus 81 a~~ieaYl~s~egy~G~gd~~SKFG~~~~DEvt~~Is~~rF~~qv~~~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (252) T protein:vir:10 81 AWKFAAWLNSFESYEGQQSFFSKFGHTQNDEIRISINPGLFKYQVNGKEPALGDLIYMPMDNSLFEITWVEPYSPFYQNG 160 (252) T ss_pred ceeEEEeeehhhccCCccceeeecCceecceEEEEEcCchhhhhccCCCCccccEEEEcCCCcEEEEeccCCCCchhhhC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEEEEeecCCccccCCccccccccccchh--hhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccCCCC- Q lcl|NC_014636. 163 RLAKYRITAQKFIYSGEEIKPEFDPNRYVLDDTDP--FAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENIIGTG- 239 (265) Q Consensus 163 k~~vy~l~ce~F~YS~E~i~t~~~eid~i~~~~~~--l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~~~g- 239 (265) |+|||+|+|+||+||||+|+|+++++|+|++.+.. +++|++||||+||+++|++|++++++||++||+||+|||+|| T Consensus 161 kn~~~~l~ce~F~Ys~E~idt~~~~id~Ie~~~s~ldl~~i~~ldG~~Di~~~~~~e~~~~~~e~~~~~e~~~~i~~~~~ 240 (252) T protein:vir:10 161 KNPIRVIVAQKFIYSGEKITPVVQEKPEIEDMYNGLDLAPLLNLDGMIDQKIDQFAENIAVQQKVKQYAEPFDPISTNSF 240 (252) T ss_pred CCeEEEEEEEEEeeCCceecCcccccCccchhhhccchHHHhhcCCeeccccccchhhHHHHHhhhhhhccceeecCCCC Confidence 99999999999999999999999999999887774 689999999999999999999999999999999999999999 Q ss_pred ----CCCCCCcc Q lcl|NC_014636. 240 ----TPIAEHNK 247 (265) Q Consensus 240 ----~p~~~~~~ 247 (265) ||++.|-. T Consensus 241 ~~~~~pf~~~~~ 252 (252) T protein:vir:10 241 GNFDSPFGKHEA 252 (252) T ss_pred CCcCCcccccCC Confidence 56666655 No 7 >protein:vir:101154 Length: 252 # NCBI annotation: neck protein # Family: family:all:1104 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932505;genbank:gi:37651631;genbank:GeneID:2610647 Probab=100.00 E-value=2e-130 Score=731.64 Aligned_cols=245 Identities=45% Similarity=0.784 Sum_probs=239.0 Q ss_pred ccccceeeeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheeccccccccccccccccc Q lcl|NC_014636. 3 ARNQSMFAQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKFKK 82 (265) Q Consensus 3 ~~~~~lfa~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF~~ 82 (265) -+|+|||||||+++||++|++++|||||||++||++||+|+++||+|||||||+|||||||++|++|+||||+++|||++ T Consensus 1 m~d~~lfa~le~~~gy~~t~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~~Ed~~SkF~k 80 (252) T protein:vir:10 1 MMDKSLFATLENRGGYMRTNEKNILNPYVKFNKHEGTQALQDTLVAESIQMRGIEFYYLEREFTDLDLLFGEDVNSRFEK 80 (252) T ss_pred CCCccceeEecCCcchhhhhhhccccceeeecCccchhHHHHHHHHHHHHHcCceEEEechhhccccccccccccccccc Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhhhC Q lcl|NC_014636. 83 AYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIKRE 162 (265) Q Consensus 83 a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ~G 162 (265) ||+||||++||+||+|+|+||||||||++|||||+|||+||+++++++||+|||||||||+|+||||+|||+++||||+| T Consensus 81 a~~ieaYl~s~egy~G~gd~~SKFG~~~~DEvt~~Is~~rF~~qv~~~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (252) T protein:vir:10 81 AWKFAAWLNSFESYEGQQSFFSKFGHTQNDEIRISINPGLFKYQVNGKEPALGDLIYMPMDNSLFEITWVEPYSPFYQNG 160 (252) T ss_pred ceeEEEeeehhhccCCccceeeecCceecceEEEEEcCchhhhhccCCCCccccEEEEcCCCcEEEEeccCCCCchhhhC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEEEEeecCCccccCCccccccccccchh--hhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccCCCC- Q lcl|NC_014636. 163 RLAKYRITAQKFIYSGEEIKPEFDPNRYVLDDTDP--FAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENIIGTG- 239 (265) Q Consensus 163 k~~vy~l~ce~F~YS~E~i~t~~~eid~i~~~~~~--l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~~~g- 239 (265) |+|||+|+|+||+||||+|+|+++++|+|++.+.. +++|++||||+||+++|++|++++++||++||+||+|||+|| T Consensus 161 kn~~~~l~ce~F~Ys~E~idt~~~~id~Ie~~~s~ldl~~i~~ldG~~Di~~~~~~e~~~~~~e~~~~~e~~~~i~~~~~ 240 (252) T protein:vir:10 161 KNPIRVIVAQKFIYSGEKITPVVQEKPEIEDMYNGLDLAPLLNLDGMIDQKIDQFAENIAVQQKVKQYAEPFDPISTNSF 240 (252) T ss_pred CCeEEEEEEEEEeeCCceecCcccccCccchhhhccchHHHhhcCCeeccccccchhhHHHHHhhhhhhccceeecCCCC Confidence 99999999999999999999999999999887774 689999999999999999999999999999999999999999 Q ss_pred ----CCCCCCcc Q lcl|NC_014636. 240 ----TPIAEHNK 247 (265) Q Consensus 240 ----~p~~~~~~ 247 (265) ||++.|-. T Consensus 241 ~~~~~pf~~~~~ 252 (252) T protein:vir:10 241 GNFDSPFGKHEA 252 (252) T ss_pred CCcCCcccccCC Confidence 56666655 No 8 >protein:vir:7199 Length: 256 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049773;genbank:gi:9632588;genbank:GeneID:1258695 Probab=100.00 E-value=2.3e-130 Score=731.34 Aligned_cols=251 Identities=47% Similarity=0.852 Sum_probs=241.1 Q ss_pred CcccccceeeeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheeccccccccccccccc Q lcl|NC_014636. 1 MFARNQSMFAQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKF 80 (265) Q Consensus 1 m~~~~~~lfa~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF 80 (265) |+|||+|||||||+++||++|++++|||||||++||++||+|+++||+|||||||+|||||||++|++|+||||+++||| T Consensus 1 m~~~d~~lfa~l~~~~gy~~~~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlpRe~v~~D~i~~Ed~~skF 80 (256) T protein:vir:71 1 MATYDKNLFAKLENRTGYSQTNETEILNPYVNFNHYKNSQILADVLVAESIQMRGVECYYVPREYVSPDLIFGEDLKNKF 80 (256) T ss_pred CcccccceeeeecCCcchhhhhhhcccceeeeeeccCchhhHHHHHHHHHHHHcCceEEEechhhccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhh Q lcl|NC_014636. 81 KKAYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIK 160 (265) Q Consensus 81 ~~a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ 160 (265) ++||+||||++||+||+|+++||||||||++|||||+|||+||+++++++||+|||||||||+|+||||+||||++|||| T Consensus 81 ~ka~~ieaYl~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~v~~~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ 160 (256) T protein:vir:71 81 TKAWKFAAYLNSFEGYEGAKSFFSNFGMQVQDEVTLSINPNLFKHQVNGKEPKEGDLIYFPMDNSLFEINWVEPYDPFYQ 160 (256) T ss_pred ccceeEEEEeehhhccCCccccceecCceecceEEEEEccchhhhhhcCCCCccccEEEEcCCCcEEEEEcccCCCchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCceEEEEEEEEeecCCccccCCcccccccc-ccchh--hhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccCC Q lcl|NC_014636. 161 RERLAKYRITAQKFIYSGEEIKPEFDPNRYVL-DDTDP--FAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENIIG 237 (265) Q Consensus 161 ~Gk~~vy~l~ce~F~YS~E~i~t~~~eid~i~-~~~~~--l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~~ 237 (265) +||+|||+|+|+||+||||+|+|+++++|+|. ++.+. +++++++|||+|++++|++|+++|++||+.|++||++||+ T Consensus 161 ~Gkn~~~~l~ce~F~Ys~E~~~~~l~~~d~i~~~e~~eldl~~i~~ldg~~di~~~~~~e~~~i~~e~~~~ve~~~~in~ 240 (256) T protein:vir:71 161 LGQNAIRKITAGKFIYSGEEINPVLQKNEGINIPEFSELELNAVRNLNGIHDINIDQYAEVDQINSEAKEYVEPYVVVNN 240 (256) T ss_pred hCCCeEEEEEEEEEecCCceecccCCCCCcccCCcccccccccccccCcccccccccccchhhhhhccccccccceeecC Confidence 99999999999999999999999999999995 34443 5789999999999999999999999999999999999999 Q ss_pred CCCCCCCCcccCCCCCccccchhhhhcC Q lcl|NC_014636. 238 TGTPIAEHNKPAPAPVDMKNVFDDLESF 265 (265) Q Consensus 238 ~g~p~~~~~~~~~~~~~~~~~f~~~~~~ 265 (265) ||+|.. ++|||| .| T Consensus 241 ~G~~~~------------~~pfd~--~~ 254 (256) T protein:vir:71 241 RGKSFE------------SSPFDN--DF 254 (256) T ss_pred CCCCCc------------CCCccc--cc Confidence 999865 356664 23 No 9 >protein:vir:107937 Length: 257 # NCBI annotation: gp14 neck protein # Family: family:all:1104 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595290;genbank:gi:161622596;genbank:GeneID:5783656 Probab=100.00 E-value=4.2e-130 Score=729.90 Aligned_cols=253 Identities=48% Similarity=0.810 Sum_probs=238.9 Q ss_pred CcccccceeeeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheeccccccccccccccc Q lcl|NC_014636. 1 MFARNQSMFAQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKF 80 (265) Q Consensus 1 m~~~~~~lfa~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF 80 (265) |+|||+|||||||+++||++|++++|||||||++||.+||+|+++||+|||||||+|||||||++|++|+||||+++||| T Consensus 1 m~~~d~~lfa~le~~~gy~~t~~~~~lnPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF 80 (257) T protein:vir:10 1 MATFDSSLFAKLENNTGYANTNETEIMNPFVNFYRHENTQTLADALVAESIQMRGIELYYIPREYVNPDQLFGEDLQNKF 80 (257) T ss_pred CcccccceeeeecCCcchhhhhhhccccceeecccCCchhHHHHHHHHHHHHHcCceEEEcchhhhcccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhh Q lcl|NC_014636. 81 KKAYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIK 160 (265) Q Consensus 81 ~~a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ 160 (265) ++||+||||++||+||+|+++||||||||++|||||+|||+||+++++++||+|||||||||+|+||||+|||+++|||| T Consensus 81 ~ka~~ieaYl~s~egy~G~~d~~SKFG~~v~DEvt~~Is~~rF~~~v~~~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ 160 (257) T protein:vir:10 81 TKAWKFAGYLDSFEGYSGDNTYFSKFGMMVNDEVTITINPNLFKHQCNGTEPVSGDLIYFPMDNSLFEINWVQPYDPFYQ 160 (257) T ss_pred ccceeEEEeeehhhccCCCcceeeecCceecceEEEEEccchhhhhccCCCCccccEEEEcCCCceEEEecccCCCchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCceEEEEEEEEeecCCccccCCccccccc-cccch--hhhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccCC Q lcl|NC_014636. 161 RERLAKYRITAQKFIYSGEEIKPEFDPNRYV-LDDTD--PFAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENIIG 237 (265) Q Consensus 161 ~Gk~~vy~l~ce~F~YS~E~i~t~~~eid~i-~~~~~--~l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~~ 237 (265) +||+|||+|+|+||+||||+|.+++++++++ .++.+ +|+++++||||+||+++|++|.+++++||++||+||+|||| T Consensus 161 ~Gkn~~~~l~ce~F~Ys~E~l~pel~~n~~~~V~e~~eldl~~~~~ldG~~di~~~~~~E~~~~~~e~~~fi~p~~~~n~ 240 (257) T protein:vir:10 161 VGTNVQRRITATKFIYNGEELRPELQRNEGINIPEFSELDLMPVKNIDGLADISDIQYEEVNEINAEAAEFVHPYVVING 240 (257) T ss_pred hCCceEEEEEEEEeeeCCcccccccCCcccCCCCCccchhhhhhhhccchhhcCCchhhhHHHHHHhhhhhhccccccCC Confidence 9999999999999999999999999998777 33343 37899999999999999999999999999999999999999 Q ss_pred CCCCCCCCcccCCCCCccccchhh-hhcC Q lcl|NC_014636. 238 TGTPIAEHNKPAPAPVDMKNVFDD-LESF 265 (265) Q Consensus 238 ~g~p~~~~~~~~~~~~~~~~~f~~-~~~~ 265 (265) ||++.. | +|||| |-|= T Consensus 241 ~g~~~~--~----------~pf~~~~~~~ 257 (257) T protein:vir:10 241 RGEDAP--P----------TAFDDAFLDD 257 (257) T ss_pred CCCCCC--C----------CcccchhccC Confidence 997543 2 45553 1111 No 10 >protein:vir:103452 Length: 256 # NCBI annotation: head completion # Family: family:all:1104 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803104;genbank:gi:116326384;genbank:GeneID:4405481 Probab=100.00 E-value=3.9e-130 Score=730.08 Aligned_cols=251 Identities=47% Similarity=0.847 Sum_probs=241.2 Q ss_pred CcccccceeeeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheeccccccccccccccc Q lcl|NC_014636. 1 MFARNQSMFAQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKF 80 (265) Q Consensus 1 m~~~~~~lfa~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF 80 (265) |+|||+|||||||+++||++|++++|||||||++||++||+|+++||+|||||||+|||||||++|++|+||||+++||| T Consensus 1 m~~~d~~lfa~l~~~~gy~~~~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF 80 (256) T protein:vir:10 1 MATYDKNLFAKLENHTGYSQTNETEILNPYVNFNHYKNSQILADVLVAESIQMRGVECYYVPREYVSPDLIFGEDLKNKF 80 (256) T ss_pred CcccccceeeeecCCcchhhhhhhcccceeeeeeccCchhhHHHHHHHHHHHHcCceEEEechhhccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhh Q lcl|NC_014636. 81 KKAYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIK 160 (265) Q Consensus 81 ~~a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ 160 (265) ++||+||||++||+||+|+++||||||||++|||||+|||+||+++++++||+|||||||||+|+||||+||||++|||| T Consensus 81 ~ka~~ieaYl~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~v~~~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ 160 (256) T protein:vir:10 81 TKAWKFAAYLNSFEGYEGAKSFFSNFGMQVQDEVTLSINPNLFKHQVNGKEPKEGDLIYFPMDNSLFEINWVEPYDPFYQ 160 (256) T ss_pred ccceeEEEEeehhhccCCccccceecCceecceEEEEEcCchhhhhccCCCCccccEEEEcCCCcEEEEEeccCCCchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCceEEEEEEEEeecCCccccCCccccccccc-cchh--hhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccCC Q lcl|NC_014636. 161 RERLAKYRITAQKFIYSGEEIKPEFDPNRYVLD-DTDP--FAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENIIG 237 (265) Q Consensus 161 ~Gk~~vy~l~ce~F~YS~E~i~t~~~eid~i~~-~~~~--l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~~ 237 (265) +||+|||+|+|+||+||||+|+|+++++|+|.. +... +.++++|||++|++++|++|++++++||+.|++||++||+ T Consensus 161 ~Gkn~~~~l~ce~F~Ys~E~~~~~l~~~d~i~~~~~~eldl~pi~~ldG~~di~~~~~~e~~~~~~e~~~~v~~~~~in~ 240 (256) T protein:vir:10 161 LGQNAIRKITAGKFIYSGEEINPVLQKNEGINIPEFSELELNPVRNLNGIHDINIDQYAEVDQINSEAKEYVEPYVVVNN 240 (256) T ss_pred hCCCeEEEEEEEEEeeCCceecccCCCCccccCCccchhhhccccccccccccCccccccchhhhhccccccccceEecC Confidence 999999999999999999999999999999963 4443 4789999999999999999999999999999999999999 Q ss_pred CCCCCCCCcccCCCCCccccchhhhhcC Q lcl|NC_014636. 238 TGTPIAEHNKPAPAPVDMKNVFDDLESF 265 (265) Q Consensus 238 ~g~p~~~~~~~~~~~~~~~~~f~~~~~~ 265 (265) ||+|.. ++|||| .| T Consensus 241 ~G~~~~------------~~pfd~--~~ 254 (256) T protein:vir:10 241 RGKSFE------------SSPFDN--DF 254 (256) T ss_pred CCCCCc------------CCCccc--cc Confidence 999865 356664 23 No 11 >protein:vir:5658 Length: 278 # NCBI annotation: gp14 # Family: family:all:1104 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899597;genbank:gi:34419584;genbank:GeneID:2545699 Probab=100.00 E-value=5.3e-130 Score=729.34 Aligned_cols=262 Identities=42% Similarity=0.711 Sum_probs=251.8 Q ss_pred Cccccc---ceeeeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheecccccccccccc Q lcl|NC_014636. 1 MFARNQ---SMFAQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQ 77 (265) Q Consensus 1 m~~~~~---~lfa~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~ 77 (265) |+|||+ |||||||+++||++|++++|||||||++||++||+|+++||+|||||||+|||||||++|++|+||+|+++ T Consensus 1 m~~~d~~~~~lfa~l~~~~gy~~~~~~~~~NpYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~ 80 (278) T protein:vir:56 1 MGAYDTNTGGMFAKLESQKGYDQIYNNHLVNPYFNWVNHTNEQNLTDMLVAESIINRGVECVYLRREMEKVDLVFGEDPM 80 (278) T ss_pred CccccccCceEEEEecCCcccchhcccccccceeeccCCCchhHHHHHHHHHHHHHcCceEEEcchhhhccccccccccc Confidence 999999 79999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCCh Q lcl|NC_014636. 78 SKFKKAYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSP 157 (265) Q Consensus 78 skF~~a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~P 157 (265) |||++||+|||||+||+||+|+++||||||||++|||||+|||+||+++++++||+|||||||||+|+||||+|||+++| T Consensus 81 skF~ka~~ieaYl~~~egy~G~~d~~SKFG~~~~DEvt~~Is~~rF~~~~~~~rP~EGDLIYfPm~n~LFEI~~VE~~~P 160 (278) T protein:vir:56 81 SKFTQNFRMSLYVESFEGWDGDGDWYSKFGFQVNDEMNVCINPKLFAQQGDGKQPLMGDLIYFPLANSLFEISWIEREDP 160 (278) T ss_pred cccccceeEEEEeehhhccCCCceeeeecCceecceEEEEEccchhhhcCCCCCCccccEEEEcCCCcEEEEEccCCCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhCCceEEEEEEEEeecCCccccCC-----ccccccccc---cch---hhhhhhhhhhhhhcccchhcchhhhhhccc Q lcl|NC_014636. 158 MIKRERLAKYRITAQKFIYSGEEIKPE-----FDPNRYVLD---DTD---PFAQVKALDGRMDINLDEFEEDDAIDVEAD 226 (265) Q Consensus 158 FyQ~Gk~~vy~l~ce~F~YS~E~i~t~-----~~eid~i~~---~~~---~l~~i~~lDg~~d~~~~q~~E~~~~~~e~~ 226 (265) |||+||+|||+|+|+||+||||+|+|+ +++++.|.+ +.+ ++.++++|||++|++++|++|.+++++||+ T Consensus 161 FYQ~Gk~~~~~l~ce~F~Ys~E~~dtg~pe~~~d~i~~i~ef~e~~~~~ld~~~~~~l~G~~di~~~~~~e~~~~~~e~~ 240 (278) T protein:vir:56 161 WYMNGVLPMRKMKMTKFVYSGEEINLEKPEAVIDSIDDILNFGEDTDEMIDIDKINALDGRWDIGIEQGAEITQIEDEVD 240 (278) T ss_pred hhhhCCceEEEEEEEEeeecCceecccCCccccccccchhhcccchhhcccccccccccchhcccchhhhhhhhhhhccc Confidence 999999999999999999999999987 677777633 111 356899999999999999999999999999 Q ss_pred chhhcccccCCCCC--CCCCCcccCCCCCccccchhhh Q lcl|NC_014636. 227 TFIEEFENIIGTGT--PIAEHNKPAPAPVDMKNVFDDL 262 (265) Q Consensus 227 ~~~~~f~~~~~~g~--p~~~~~~~~~~~~~~~~~f~~~ 262 (265) .|++||++++++|+ |++++++|+|.+|+++|||||| T Consensus 241 ~f~~~~~v~~~~~~~~~t~~~n~~~g~~v~~~~~~D~f 278 (278) T protein:vir:56 241 KFYESEQVVPSGSDVQPTDPRNATIGFNVNNSNPFDSF 278 (278) T ss_pred eeeecCceecCCCCccccCcccccCCCcCccccccccC Confidence 99999999998775 9999999999999999999999 No 12 >protein:vir:100535 Length: 253 # NCBI annotation: gp14 head completion protein # Family: family:all:1104 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656376;genbank:gi:109290127;genbank:GeneID:4156513 Probab=100.00 E-value=4.7e-128 Score=718.65 Aligned_cols=251 Identities=44% Similarity=0.772 Sum_probs=241.2 Q ss_pred ccccceeeeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheeccccccccccccccccc Q lcl|NC_014636. 3 ARNQSMFAQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKFKK 82 (265) Q Consensus 3 ~~~~~lfa~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF~~ 82 (265) -+|+|||||||+++||++|++++|||||||++||++||+|+++||+|||||||+|||||||++|++|+||||+++|||++ T Consensus 1 ~~~~~lfa~l~~~~gy~~t~~~~~lNPYfn~~~~~~eQ~L~e~LV~EsIqm~GvdvyYlPRe~v~~D~i~~Ed~~skF~k 80 (253) T protein:vir:10 1 MMDKSLFATLENRSGYQQTNEQNILNPYVKFNRYEGSQALHDTLVAESIQMRGLEFYYLEREYTNLDLLFGEDPNSRFEK 80 (253) T ss_pred CcCccceeEecCCcchhhhhhhccccceeeccccCchhHHHHHHHHHHHHHcCceEEEcchhhccccCcccccccccccc Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhhhC Q lcl|NC_014636. 83 AYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIKRE 162 (265) Q Consensus 83 a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ~G 162 (265) ||+||||++||+||+|+++||||||||++|||||+|||+||+++++++||+|||||||||+|+||||+|||+++||||+| T Consensus 81 a~~ieaYl~~~egy~G~gd~~SKFG~~v~DEvt~~Is~~rF~~~v~~~rP~EGDLIYfPm~n~LFEI~~VE~~~PFYQ~G 160 (253) T protein:vir:10 81 AWKFAAWLNSFESYEGQQSFFSKFGHQQNDEVRISINPGLFKYQVNGKEPKLGDLIYMPMDNSLFEITWVEPYTPFYQMG 160 (253) T ss_pred ceeEEEeeehhhccCCccceeeecCceecceEEEEEcCchhhhhccCCCCccccEEEEcCCCcEEEEeccCCCCchhhhC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEEEEeecCCccccCCccccccccccch--hhhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccCCCCC Q lcl|NC_014636. 163 RLAKYRITAQKFIYSGEEIKPEFDPNRYVLDDTD--PFAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENIIGTGT 240 (265) Q Consensus 163 k~~vy~l~ce~F~YS~E~i~t~~~eid~i~~~~~--~l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~~~g~ 240 (265) |+|||+|+|+||+||||+|+|++++||+|++.++ ++++|++|||++||+++|++|+++++.||++||+||+|++++| T Consensus 161 kn~~~~l~ce~F~Ys~E~i~tgi~~id~Ie~~~~~ldl~~i~~l~G~~Di~~~~~~e~~~~~~e~~~~v~~~~~~~~~~- 239 (253) T protein:vir:10 161 KNPIRVIVAQKFIYSGEKLAPQFQEKPEIEDQYNGLDLEPILNLDGFIDQKINEFGENVQAQNEARPFVEPFDPISTNP- 239 (253) T ss_pred CceEEEEEEEEEecCCccccccCcccccccchhhhhhhhhhhcCCCccccccccccccchhhhccccccccceeccCCC- Confidence 9999999999999999999999999999998766 4579999999999999999999999999999999999997744 Q ss_pred CCCCCcccCCCCCcc Q lcl|NC_014636. 241 PIAEHNKPAPAPVDM 255 (265) Q Consensus 241 p~~~~~~~~~~~~~~ 255 (265) ++.+++||+..-.- T Consensus 240 -~~g~~spf~~~~~~ 253 (253) T protein:vir:10 240 -VNSFNSPFGRHEGQ 253 (253) T ss_pred -CccccCcccccCCC Confidence 77777777654433 No 13 >protein:vir:104476 Length: 308 # NCBI annotation: gp14 # Family: family:all:1104 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214650;genbank:gi:61806291;genbank:GeneID:3294531 Probab=100.00 E-value=4.7e-107 Score=603.51 Aligned_cols=251 Identities=26% Similarity=0.414 Sum_probs=215.6 Q ss_pred CcccccceeeeecCCCCccCcchhc-----cccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheecccccccccc Q lcl|NC_014636. 1 MFARNQSMFAQLETGAGYNKTYQES-----VLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGED 75 (265) Q Consensus 1 m~~~~~~lfa~le~~~gy~~~~~~~-----~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~ 75 (265) |. |. +++.+.+ .+|||||+|||++||+|+|+||+|||||||+|||||||++|++|+||+|+ T Consensus 1 ~~-~~-------------~~~~~py~~~~~~~~~~~n~~~~~~eQ~L~e~LV~EsIqm~G~dvyYlpRe~v~~D~i~~Ed 66 (308) T protein:vir:10 1 MA-IQ-------------NSPAQDYVQSDYSNAGRLKANASSQEQKFIENLVVESIEIYGQDIYYVPRTIVNRDSVFEED 66 (308) T ss_pred Cc-cc-------------cCCCCCcccccccccceEEEeccCchhHHHHHHHHHHHHhcCceEEEechhhcccccccccc Confidence 21 11 2222222 34666799999999999999999999999999999999999999999999 Q ss_pred cccccccceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcC-------CCCccccEEEEcCCCcEEE Q lcl|NC_014636. 76 LQSKFKKAYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDG-------ERAKEGDLIYFPMNNSLFE 148 (265) Q Consensus 76 ~~skF~~a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~-------~~P~EGDLIYfPm~n~LFE 148 (265) ++|||++||+|||||+|||||+|+++||||||||++|||||+|||+||++++++ +||+|||||||||+|+||| T Consensus 67 ~~skF~~a~~ieaY~~~~egy~g~~~~~SKFG~~~~DE~t~~is~~rF~~~v~~~~~~~~~~rP~EGDLIYfPl~~~lFE 146 (308) T protein:vir:10 67 SDGKFESAKAIRAYVNNVEGWEGQGELLSKFGIRIEDKTTFIFSREKFKEHVDDSVTLNVEGRPNEGDLIWFPITKHLFE 146 (308) T ss_pred cccccccceeEEEEeechhccCCCcceeeecCceecceEEEEEccchhhhhcCCccccccCCCCccccEEEecCCCceEE Confidence 999999999999999999999999999999999999999999999999999988 4999999999999999999 Q ss_pred EeeeccCChhhhhCCceEEEEEEEEeecCCccccCCccccccccccch---hhhhhhhhhhhhhcccchhcchhhhhhcc Q lcl|NC_014636. 149 ITWVEPTSPMIKRERLAKYRITAQKFIYSGEEIKPEFDPNRYVLDDTD---PFAQVKALDGRMDINLDEFEEDDAIDVEA 225 (265) Q Consensus 149 I~~VE~~~PFyQ~Gk~~vy~l~ce~F~YS~E~i~t~~~eid~i~~~~~---~l~~i~~lDg~~d~~~~q~~E~~~~~~e~ 225 (265) ||||||++||||+||+|||+|+|+||+||||+|+|++++||+|+.+.+ .+.++++++|++++++.++++.+++..|+ T Consensus 147 I~~VE~~~PFyQ~Gk~~~~~l~ce~F~Ys~E~~~~~i~~iD~i~~~~~~~LdL~pIs~l~G~fdInE~v~gest~itAEv 226 (308) T protein:vir:10 147 IKFVEVERPFYQLGRNYVWECQCELFEYSDEEINTGITELDAIETAFANAITVGLVAGGTGTFTVGETITGGTSNVTAEV 226 (308) T ss_pred EEcccCCCchhhhCCceEEEEEEEEEeeCCcccccCCccccccccccccceeeeeeccCCccccccceecccccceEEEE Confidence 999999999999999999999999999999999999999999987765 35689999999999999999999999999 Q ss_pred cch---hhcccccCCCCCCCCCCc-----ccCCC-------CCcc----------------------ccchhhhhcC Q lcl|NC_014636. 226 DTF---IEEFENIIGTGTPIAEHN-----KPAPA-------PVDM----------------------KNVFDDLESF 265 (265) Q Consensus 226 ~~~---~~~f~~~~~~g~p~~~~~-----~~~~~-------~~~~----------------------~~~f~~~~~~ 265 (265) ..+ +++..|+|++|++..+.. +|... ..+. .|||-.+-|| T Consensus 227 ~~wds~v~~ItV~N~gGsftspptItGsts~a~~t~~s~~~~~nt~~~~~~n~~fet~~D~iiDftE~NPFG~~g~~ 303 (308) T protein:vir:10 227 KSFDASTRTLIVINRSGTFTVPETVTGGTSSASWTTATYNTIDNQNLDYDQNNDFETLDNQIIDFTEANPFGSVGSI 303 (308) T ss_pred EEecCCceEEEEEeCCCceeeCcEEEeccCCceeEEEeeeecccCCCcccCCcceeeccCcEEeeccCCCCcccccc Confidence 988 677888899996443222 11111 0111 2344444444 No 14 >protein:vir:106986 Length: 292 # NCBI annotation: neck protein gp14 # Family: family:all:1104 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195128;genbank:gi:58532905;uniprot:Q5GQV9;genbank:GeneID:3260483 Probab=100.00 E-value=3.1e-102 Score=577.12 Aligned_cols=239 Identities=24% Similarity=0.435 Sum_probs=212.5 Q ss_pred cccccccc--ccccchhHHHHHHHHHHHHHhcCcceEeeeheecccccccccccccccccceeeeeccchhhccCccchh Q lcl|NC_014636. 25 SVLNPYVN--KHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKFKKAYKVAMYLESFEEYSGQRDF 102 (265) Q Consensus 25 ~~~npYfN--~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF~~a~~IeaY~~n~egf~g~gd~ 102 (265) --|||||| +|||.+||+|+|+||+|||||||+|||||||++|+ |+||+|+++|||++||+|||||+|||||+|+++| T Consensus 1 m~~npyfn~~~~~~~~eQ~L~~~LV~Esiq~~G~dvyYlpRe~~~-d~~~~E~~~skF~~a~~ieaY~~~~eg~~g~~~~ 79 (292) T protein:vir:10 1 MPTSPYFPSYYSGYSGEQNLVQDLVDEQIKLFGTDIYYLPRTILR-DNTLDDVIYNKFERQFQVEMLLQNVEGFGSPSEF 79 (292) T ss_pred CCcCccccccccCcCchhHHHHHHHHHHHHhcCceEEEechhhhc-ccccccccccccccceeEEEEeechhccCCCcce Confidence 56999999 68999999999999999999999999999999999 9999999999999999999999999999999999 Q ss_pred hhhcCceeeceEEEEEcchhhhhhhcC------CCCccccEEEEcCCCcEEEEeeeccCChhhhhCCceEEEEEEEEeec Q lcl|NC_014636. 103 FSKFGMQVNDEVSFTVSPKLFEHQTDG------ERAKEGDLIYFPMNNSLFEITWVEPTSPMIKRERLAKYRITAQKFIY 176 (265) Q Consensus 103 ~SKFG~~~~DE~tf~IS~~~F~~~~~~------~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ~Gk~~vy~l~ce~F~Y 176 (265) |||||||++|||||+|||+||++++++ +||||||||||||+|+||||+||||++||||+||+|||+|+|+||+| T Consensus 80 ~sKFG~~~~De~t~~is~~~f~~~~~~~~~~~~~~P~eGDLIYfPl~~~lFEI~~ve~~~PfyQ~gk~~~~~l~~~~F~Y 159 (292) T protein:vir:10 80 ISKFGLRITDEVRFIVSQRRWDEEAVNYDLNVNGRPNEGDLLYFPLTQDIYEIKFVEREDPFYQLGKNYFYIMTAEIYEY 159 (292) T ss_pred eeecCceecceEEEEEccchhhhhcCcccccccCCCccccEEEEcCCCcEEEEEcccCCCchhhhCCceEEEEEEEEEee Confidence 999999999999999999999999987 89999999999999999999999999999999999999999999999 Q ss_pred CCccccCCccccccccccch---hhhhhhhhhhhhhcccchhcchhhhhhcccchhh---cccccCCCCCCCCCCcccC- Q lcl|NC_014636. 177 SGEEIKPEFDPNRYVLDDTD---PFAQVKALDGRMDINLDEFEEDDAIDVEADTFIE---EFENIIGTGTPIAEHNKPA- 249 (265) Q Consensus 177 S~E~i~t~~~eid~i~~~~~---~l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~---~f~~~~~~g~p~~~~~~~~- 249 (265) |+|+|+|+++++|+|..+.. .++++++++|.++++..+..+.+.+..+|..+.+ +..|+|++|+.-. ..... T Consensus 160 s~E~idtgl~eiD~i~~~~sseLdL~pi~~g~G~f~inE~vtge~sg~~AEv~sw~~~t~~L~V~n~~GsF~T-~e~i~G 238 (292) T protein:vir:10 160 GSDNISTGVEEIDELETLFSSAIAIALSIGGTGDFDLGEIVTGGISGTEAEVKSWDSSSRILQVINRTGTFEE-GESVTG 238 (292) T ss_pred cCceecCCCCcccccccccccccceeecccCCccccCCceeeecccceEEEEEEccCCCceEEEEeCcccccc-CceeeE Confidence 99999999999999977664 3467899999999999999999999999999866 5668999986311 00000 Q ss_pred ---------------CCCC-------------------ccccchhhhhcC Q lcl|NC_014636. 250 ---------------PAPV-------------------DMKNVFDDLESF 265 (265) Q Consensus 250 ---------------~~~~-------------------~~~~~f~~~~~~ 265 (265) +.+. +-.|||-.+-|| T Consensus 239 ~~Sga~~~v~si~~~~g~~~t~a~~~~iEt~~d~i~df~e~npfg~~~~~ 288 (292) T protein:vir:10 239 NDSGSVWVVDSFDTLNNTNSEYDQNREIESTADTIIDWSESNPFGEYGNF 288 (292) T ss_pred eecCCeEEeeEEEEeCCCCCcccccceeccccCcEEeeccCCcCcccccc Confidence 0111 123455555555 No 15 >protein:vir:103005 Length: 390 # NCBI annotation: gp110 # Family: family:all:1104 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717777;genbank:gi:113200614;genbank:GeneID:4239009 Probab=100.00 E-value=7.6e-92 Score=520.14 Aligned_cols=229 Identities=26% Similarity=0.408 Sum_probs=186.7 Q ss_pred eeecCCCCccCcchhccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheecccccccccccccccccceeeeec Q lcl|NC_014636. 10 AQLETGAGYNKTYQESVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKFKKAYKVAMY 89 (265) Q Consensus 10 a~le~~~gy~~~~~~~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF~~a~~IeaY 89 (265) -...+.+|||++++++++|||||+|||.+||+|+|+||+|||||||+||||||||+|++|+||+|+++|||++||+|||| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y~~r~~~~~d~~~~e~~~~~f~~~~~~~~y 80 (390) T protein:vir:10 1 MTYSNDPPNNCIQSDYTSSCRLNLNGSAQEQTFMENLIVESIELYGQNVYYLPRIYVNRDTILNEVETSRFEQALSVRAY 80 (390) T ss_pred CeecCCCcccceecceeeccEEEEeccCchhHHHHHHHHHHhHhcCceEEEechheeccccccccccccccccceEEEEE Confidence 34568899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcC-------CCCccccEEEEcCCCcEEEEeeeccCChhhhhC Q lcl|NC_014636. 90 LESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDG-------ERAKEGDLIYFPMNNSLFEITWVEPTSPMIKRE 162 (265) Q Consensus 90 ~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~-------~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ~G 162 (265) |+|||||+|+++||||||||++|||||+|||+||++++++ +||+|||||||||+++||||+||||++||||+| T Consensus 81 ~~~~~~~~~~~~~~skfg~~~~de~~~~~~~~~~~~~~~~~~~~~~~~~p~egdliy~p~~~~lfei~~ve~~~p~yq~G 160 (390) T protein:vir:10 81 VNNVEGWEGQGDLLSKFGVRIEDKTTFIFSRKKFTTAVDDNAVLNVEGRPNEGDLIWFPATRHLFEIKFVEAERPFYQLG 160 (390) T ss_pred eechhccCCccceeeecCceecceEEEEECCcchhhhhCCcccccccCCCCCCceEEecCCCCEEEEEecCCCCCceEcc Confidence 9999999999999999999999999999999999999997 599999999999999999999999999999999 Q ss_pred CceEEEEEEEEeecCCccccCCccccccccccchhhhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccCCCCCCC Q lcl|NC_014636. 163 RLAKYRITAQKFIYSGEEIKPEFDPNRYVLDDTDPFAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENIIGTGTPI 242 (265) Q Consensus 163 k~~vy~l~ce~F~YS~E~i~t~~~eid~i~~~~~~l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~~~g~p~ 242 (265) |+|+|+|+|++|+||+|.+++++++++.+.........+....+-. -.+..+ ..+.+.|+.. T Consensus 161 ~nyt~~i~a~lf~ySge~iat~~seid~I~~~~~~~v~~~~~t~g~-----------------~~~t~~-~~v~~~g~ga 222 (390) T protein:vir:10 161 KGYVWECQCELFEYSDEDLDTGVAEIDAIETAFANAIKLVMDAGGT-----------------GAFTVG-EEIVGDLYLA 222 (390) T ss_pred CceeeeeEEeeeccCCccccccccccccccccccceeeeeeccCCc-----------------cccccc-ceeeecCcce Confidence 9999999999999999999999999988865554222221111100 000000 1111111100 Q ss_pred CCCcccCCCCCccccchhhhhcC Q lcl|NC_014636. 243 AEHNKPAPAPVDMKNVFDDLESF 265 (265) Q Consensus 243 ~~~~~~~~~~~~~~~~f~~~~~~ 265 (265) . ..... . -....+. T Consensus 223 ~-~~a~v------~--~g~Vt~v 236 (390) T protein:vir:10 223 T-ATATI------S--GDAVDAV 236 (390) T ss_pred e-EEEEe------c--CCeEEEE Confidence 0 00000 0 0112222 No 16 >protein:vir:104739 Length: 470 # NCBI annotation: T4-like neck protein # Family: family:all:1104 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214353;genbank:gi:61805993;genbank:GeneID:3294259 Probab=100.00 E-value=3.4e-81 Score=461.76 Aligned_cols=231 Identities=27% Similarity=0.493 Sum_probs=172.3 Q ss_pred ccccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheecccccccccccccccccceeeeeccchhhccCccchhhh Q lcl|NC_014636. 25 SVLNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQSKFKKAYKVAMYLESFEEYSGQRDFFS 104 (265) Q Consensus 25 ~~~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~skF~~a~~IeaY~~n~egf~g~gd~~S 104 (265) -.+||||| |+|.+||+|+|+||+|||||||+||||||||+|++|+||+|+++|||++||+|||||+|||||+|+++||| T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~e~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~ 79 (470) T protein:vir:10 1 MALNPFFL-QGTSSEQRLTQDLINEHLKIYGVEVTYIPRKYVNTKSIIEEVQSSKFDDNFAIEAYVNTYEGYGGQGDVLT 79 (470) T ss_pred CcccceeE-cCCCchhHHHHHHHHHHhHhccceEEEechhhcccccccccccccccccceeEEEEeecccCcCCcceeee Confidence 78999997 99999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCceeeceEEEEEcchhhh------------------hhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhhhCCceE Q lcl|NC_014636. 105 KFGMQVNDEVSFTVSPKLFE------------------HQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIKRERLAK 166 (265) Q Consensus 105 KFG~~~~DE~tf~IS~~~F~------------------~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ~Gk~~v 166 (265) ||||+++|||+|+|||+||+ +|+.+.||+|||||||||+|+||||+|||++.||||+||+|+ T Consensus 80 ~fg~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~p~egdl~~~p~~~~~~~i~~ve~~~p~~~~G~~~~ 159 (470) T protein:vir:10 80 KFGMSIRDEVTLTISKERFEDFIAPFMAGLDDGPGGNEEVILATRPREGDLVFFPLGSRLFEVKFVEHEDPFYQLGKNYV 159 (470) T ss_pred ecCcccceEEEEEECCccccccccchhhcccCcccccccccccCCcccccEEEecCCCCEEEEEecCCCCcchhcCccee Confidence 99999999999999999996 455678999999999999999999999999999999999999 Q ss_pred EEEEEEEeecCCccccCCccccccccccchhhhhhhhhhhhhhcccchhcchhhhhhcccchhhcccccCCCCC-----C Q lcl|NC_014636. 167 YRITAQKFIYSGEEIKPEFDPNRYVLDDTDPFAQVKALDGRMDINLDEFEEDDAIDVEADTFIEEFENIIGTGT-----P 241 (265) Q Consensus 167 y~l~ce~F~YS~E~i~t~~~eid~i~~~~~~l~~i~~lDg~~d~~~~q~~E~~~~~~e~~~~~~~f~~~~~~g~-----p 241 (265) |+|+|++|+|++|.+++.+..++......++...+. +++.--. ..... .-+.-.+-..- +...|+ | T Consensus 160 ~~it~~~f~ysge~~s~~v~~~~~~~~~~g~~~t~t-~~~~g~~----~~~t~---~~~~g~vt~it-itn~Gsgyt~~p 230 (470) T protein:vir:10 160 YQLKCELFEYEDEVIDTSIDAIDTVVQDDGYISKLQ-LVGIGRT----AEVAA---SIGVGYVREIF-LNNDGSGFTSPP 230 (470) T ss_pred EEeeeceeEecCCccccceecccccccccccceeee-ecCCCcc----ceeee---eecceeeeEeE-eeccccceeccC Confidence 999999999999999999998887765554322111 1111000 00000 00111122211 222232 1 Q ss_pred CCCCcccCCCC-Cccc------cchhhhhcC Q lcl|NC_014636. 242 IAEHNKPAPAP-VDMK------NVFDDLESF 265 (265) Q Consensus 242 ~~~~~~~~~~~-~~~~------~~f~~~~~~ 265 (265) ..-...+-..+ .... ...-...+. T Consensus 231 tVti~~~~~~~~~~a~~~~~t~~~~g~vt~i 261 (470) T protein:vir:10 231 TITFSASPAFTDARAVGILTTRANVTSIEKI 261 (470) T ss_pred EEEEccCCCCCCccceeeEeecceeeEEEEE Confidence 11111110000 0000 000001111 No 17 >protein:vir:97237 Length: 122 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:704 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294533;genbank:gi:149408254;genbank:GeneID:5237102 Probab=29.41 E-value=1.7 Score=19.37 Aligned_cols=121 Identities=11% Similarity=0.140 Sum_probs=76.8 Q ss_pred ccccccccccchhHHHHHHHHHHHHHhcCcceEeeeheecccccccccccc-cccccceeeeeccchhhccCccchhhhh Q lcl|NC_014636. 27 LNPYVNKHDYDPTLTLQEMLVAESVQMNGVEVYYIRRQFVKLDQLFGEDLQ-SKFKKAYKVAMYLESFEEYSGQRDFFSK 105 (265) Q Consensus 27 ~npYfN~~g~~~eQ~L~~~LV~E~Iqm~G~dv~YlpRe~v~~D~v~~E~~~-skF~~a~~IeaY~~n~egf~g~gd~~SK 105 (265) .+.|- .-+..+ .++|+-+|..|.+.+..-..-|+-.|+... +.-...|.+.+.+.++.--.=+|.+ T Consensus 1 M~~y~------~~~~~a----~~Li~kfG~~vtl~r~~~g~y~~~~g~~~p~~~t~~~~~~~gv~~~~~~~~idGtl--- 67 (122) T protein:vir:97 1 MARFD------SAIALA----KKLIKKNGQAVTLRGFTAGAAPDPAKPWKPGGNVAADQTIEAVFLDYEQRYIDGQT--- 67 (122) T ss_pred Cccch------HHHHHH----HHHHHHhCCceEEEEeccceeCCCCCceecCCceeeeeeeEEEeeccchhhccCcE--- Confidence 22332 234444 455556999999888887777777675322 3335678999999888754333222 Q ss_pred cCceeeceEEEEEcchhhhhhhcCCCCccccEEEEcCCCcEEEEeeeccCChhhhhCCceEEEEEEEE Q lcl|NC_014636. 106 FGMQVNDEVSFTVSPKLFEHQTDGERAKEGDLIYFPMNNSLFEITWVEPTSPMIKRERLAKYRITAQK 173 (265) Q Consensus 106 FG~~~~DE~tf~IS~~~F~~~~~~~~P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ~Gk~~vy~l~ce~ 173 (265) |+..|..-+...... .-.|+-||+|-+ +..-|.|-.|++-+| .|--..|+|..-+ T Consensus 68 --I~~GD~~l~~~a~~~------~~~P~~gD~v~~--~g~~~~Vi~v~~i~p---a~~~v~y~lqlRk 122 (122) T protein:vir:97 68 --IRMGDQRVFMPAEGL------TAPPEVEGLVLR--GLEVWKVIAVKPLNP---NGQAIMYELQVRQ 122 (122) T ss_pred --EeecCEEEEEeeCCC------ccccccCCEEEe--CCEEEEEEeccccCC---CCceEEEEEEeeC Confidence 455565444332221 347888999976 556789999987665 4555667777777 No 18 >protein:vir:1385 Length: 107 # NCBI annotation: Gp8 protein # Family: family:all:3858 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612837;genbank:gi:20065971;genbank:GeneID:935786 Probab=24.01 E-value=2.2 Score=18.67 Aligned_cols=106 Identities=10% Similarity=0.162 Sum_probs=72.7 Q ss_pred hcCcc-eEeeeheecccccccccccccccccceeeeeccchhhccCccchhhhhcCceeeceEEEEEcchhhhhhhcCCC Q lcl|NC_014636. 53 MNGVE-VYYIRRQFVKLDQLFGEDLQSKFKKAYKVAMYLESFEEYSGQRDFFSKFGMQVNDEVSFTVSPKLFEHQTDGER 131 (265) Q Consensus 53 m~G~d-v~YlpRe~v~~D~v~~E~~~skF~~a~~IeaY~~n~egf~g~gd~~SKFG~~~~DE~tf~IS~~~F~~~~~~~~ 131 (265) |.+-. |....++- .+|. .|... ..+...+++-|.+...-| .++++-=+++..-.++|+|. |..-++..+ T Consensus 1 ~~~~hRI~i~~~~~-~~D~-~G~~~-~~w~~~~~~WA~v~~~~g----~E~~~a~~~~~~~~~~f~iR---y~~~i~~~~ 70 (107) T protein:vir:13 1 MARYERISIKKLEE-KNIK-GRRQE-ECLIPFYDCWAEILDLYG----QELYGALQMKLENTIIFKIR---YCKKVEELR 70 (107) T ss_pred CCcceEEEEEeeee-eeCC-CCCee-cceEeEEEEEEEEecCCc----hheeecceeheeeeEEEEEE---ecCCccccc Confidence 66644 55554444 4574 45443 468889999999998754 56666667788888888883 334445555 Q ss_pred CccccEEEEcCCCcEEEEeeeccCChhhhhCCceEEEEEEEEee Q lcl|NC_014636. 132 AKEGDLIYFPMNNSLFEITWVEPTSPMIKRERLAKYRITAQKFI 175 (265) Q Consensus 132 P~EGDLIYfPm~n~LFEI~~VE~~~PFyQ~Gk~~vy~l~ce~F~ 175 (265) +..++-|.+ ++++|+|+.|.+.+ +++-..+|.|+.=. T Consensus 71 ~t~~~Ri~~--~g~~y~I~~v~~~~-----~~~~~l~i~c~eV~ 107 (107) T protein:vir:13 71 NKENFIVEW--QGRKYEIYYPDFLG-----YNKQFVKLKCKEVL 107 (107) T ss_pred cCcCcEEEE--CCeEEEEEecCCcc-----cCCeEEEEEEEEeC Confidence 666777766 68899999997542 25557899999877 Done!