Query lcl|NC_019713.1_cdsid_YP_007111884.1 [gene=F862_gp038] [protein=hypothetical protein] [protein_id=YP_007111884.1] [location=22468..22887] Match_columns 139 No_of_seqs 31 out of 34 Neff 5.5 Searched_HMMs 1612 Date Thu Nov 7 16:16:23 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_38 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_38_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95155 Length: 151 100.0 1.2E-51 7.3E-55 299.8 15.0 139 1-139 2-151 (151) 2 protein:vir:97211 Length: 150 100.0 1.1E-50 7E-54 294.4 14.3 139 1-139 4-150 (150) 3 protein:vir:80429 Length: 150 100.0 2.4E-48 1.5E-51 281.7 14.0 139 1-139 1-150 (150) 4 protein:vir:94921 Length: 125 100.0 1.3E-43 7.8E-47 255.8 14.2 125 1-137 1-125 (125) 5 protein:vir:107704 Length: 132 99.3 2.1E-14 1.3E-17 95.6 12.6 129 2-139 1-131 (132) 6 protein:vir:103278 Length: 169 99.3 4E-14 2.5E-17 94.0 12.5 131 1-136 37-169 (169) 7 protein:vir:104348 Length: 129 99.3 3.6E-14 2.2E-17 94.3 12.2 127 1-136 1-129 (129) 8 protein:vir:79637 Length: 130 99.2 2E-13 1.3E-16 90.2 11.5 125 1-137 1-130 (130) 9 protein:vir:397 Length: 132 # 91.0 0.018 1.1E-05 30.1 11.7 124 1-137 1-132 (132) 10 protein:vir:3428 Length: 131 # 90.9 0.018 1.1E-05 30.1 11.0 118 1-135 1-131 (131) 11 protein:vir:79571 Length: 137 73.7 0.17 0.0001 24.8 10.2 124 1-137 6-137 (137) 12 protein:vir:96125 Length: 140 47.9 0.69 0.00043 21.5 12.4 126 1-139 3-135 (140) 13 protein:vir:97325 Length: 145 45.5 0.77 0.00048 21.2 11.1 126 1-139 1-135 (145) 14 protein:vir:96894 Length: 140 42.9 0.87 0.00054 20.9 11.1 124 1-139 1-135 (140) 15 protein:vir:93736 Length: 145 40.7 0.96 0.00059 20.7 11.3 126 1-139 1-135 (145) 16 protein:vir:97421 Length: 145 40.7 0.96 0.00059 20.7 11.3 126 1-139 1-135 (145) 17 protein:vir:94488 Length: 145 40.7 0.96 0.00059 20.7 11.3 126 1-139 1-135 (145) 18 protein:vir:95111 Length: 145 39.5 1 0.00063 20.5 11.3 126 1-139 1-135 (145) 19 protein:vir:94794 Length: 145 38.5 1.1 0.00066 20.4 11.2 124 1-139 1-135 (145) 20 protein:vir:95961 Length: 145 38.2 1.1 0.00067 20.4 11.2 124 1-139 1-135 (145) 21 protein:vir:105337 Length: 145 36.4 1.2 0.00073 20.2 12.1 124 1-139 1-135 (145) 22 protein:vir:107096 Length: 145 36.3 1.2 0.00073 20.2 12.1 124 1-139 1-135 (145) 23 protein:vir:98426 Length: 131 29.0 1.7 0.0011 19.3 10.7 119 1-138 4-131 (131) 24 protein:vir:5979 Length: 134 # 27.0 1.9 0.0012 19.1 10.5 125 1-138 1-134 (134) 25 protein:vir:107857 Length: 154 26.5 1.9 0.0012 19.0 11.0 131 1-139 1-135 (154) 26 protein:vir:78379 Length: 139 26.1 1.9 0.0011 19.1 5.1 131 1-139 1-139 (139) 27 protein:vir:79065 Length: 154 24.6 2.2 0.0013 18.8 11.1 131 1-139 1-135 (154) 28 protein:vir:1244 Length: 145 # 24.4 2.2 0.0014 18.7 12.6 125 1-139 1-137 (145) 29 protein:vir:105892 Length: 141 22.2 2.5 0.0015 18.4 12.2 124 1-139 1-137 (141) 30 protein:vir:96260 Length: 141 22.2 2.5 0.0015 18.4 12.2 124 1-139 1-137 (141) 31 protein:vir:94096 Length: 141 22.2 2.5 0.0015 18.4 12.2 124 1-139 1-137 (141) 32 protein:vir:79247 Length: 157 20.3 2.8 0.0017 18.1 10.9 134 1-139 5-153 (157) No 1 >protein:vir:95155 Length: 151 # NCBI annotation: hypothetical protein ORF015 # Family: family:all:5248 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293422;genbank:gi:148912843;genbank:GeneID:5228230 Probab=100.00 E-value=1.2e-51 Score=299.79 Aligned_cols=139 Identities=19% Similarity=0.324 Sum_probs=128.0 Q ss_pred CChHHHHHHHHHHHHhhhccC-----CcccccccCCCC-CC-CCCCCcEEEEEEecCCcceeee----eCCCCCeEeecc Q lcl|NC_019713. 1 MRDDKAREVIGEALVAGLKAA-----GITIDIAQEGYP-YK-GAANKPYLEVSFSVESTSSQSL----GDAGNRSFVRDG 69 (139) Q Consensus 1 Mt~~qar~aI~~~~~a~~~a~-----~~~~~I~~~n~~-f~-pp~~~~W~rv~i~~~~~~qasi----G~~g~~~~rr~G 69 (139) |||+|||++|+++|.++|.++ ....+|+|||.+ ++ |+++++||||+|+|+++.|+++ +.+|+|||||+| T Consensus 2 mtf~q~R~~i~~~~~~~w~~~~~~~a~~~p~v~~~~~~~~d~P~g~~~WaRLti~h~~~~qA~ls~~~eigggp~~~rtG 81 (151) T protein:vir:95 2 IEFDQVNDEVNALFLATWNAGSAAIAGYVPEIRWQGVQYRDLPDGSKFWVRLSKQTVFEEQATLSTCEGVPGQRKYTASG 81 (151) T ss_pred ccHHHHHHHHHHHhhhhcccCchhhhccccccccCCCCCCCCCCCCCceEEEEeecCCCccccccccccCCCCceEeeCc Confidence 999999999999999999653 455678888876 55 4468999999999999999988 556789999999 Q ss_pred eEEEEEEeecCCCHHHHHHHHHHHHHHHHhccccCCCceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 70 VFHVNVYTPFTAGSSAMTAHIYALAVRDIFEGKHFDKDLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 70 ~i~iq~F~p~~~G~~~~~a~~~a~~a~~~F~g~~~~g~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) +|+||||+|+++|+++.++.++|..|+++||+++++|+|||+++|++++|++++|||+||+|||+||||- T Consensus 82 li~VQiF~p~~~G~~Le~Adkla~~a~eaFe~~~t~g~i~f~~~s~~eiG~~~gWyQ~Nv~i~f~y~e~~ 151 (151) T protein:vir:95 82 LVFVQIFCPKSNTQAFELGQKLAKLARNAFRGKSTPGKVWFRNTRINELPPEELYERFNVVTEFEYDEIG 151 (151) T ss_pred EEEEEEeeeccCchhhHHHHHHHHHHHHHhhccCCCCCceeeeeeecccCCCCCeEEEEeeeeecccccC Confidence 9999999999999999889999999999999999999999999999999999999999999999999999 No 2 >protein:vir:97211 Length: 150 # NCBI annotation: hypothetical protein ORF026 # Family: family:all:5248 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294534;genbank:gi:149408255;genbank:GeneID:5237076 Probab=100.00 E-value=1.1e-50 Score=294.41 Aligned_cols=139 Identities=24% Similarity=0.461 Sum_probs=126.0 Q ss_pred CChHHHHHHHHHHHHhhhcc-----CCccc-ccccCCCCCCCC--CCCcEEEEEEecCCcceeeeeCCCCCeEeecceEE Q lcl|NC_019713. 1 MRDDKAREVIGEALVAGLKA-----AGITI-DIAQEGYPYKGA--ANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFH 72 (139) Q Consensus 1 Mt~~qar~aI~~~~~a~~~a-----~~~~~-~I~~~n~~f~pp--~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~ 72 (139) -||+|||++|+++|.++|.+ +|..+ .++|||..+.+| ++++||||+++|++++|+|+|.+++|||||+|+|+ T Consensus 4 ~tF~qaR~ei~t~f~~~W~a~~~a~~g~~p~~~~w~~~~~~~~P~g~~~WaRLti~~~~~~~as~G~~~gr~~~r~Gli~ 83 (150) T protein:vir:97 4 PTFDSARDEILGLFNTKWITDTPALNGGAPIRVEWPGVDAGDPPPADKPYARITLRHTTSRQATFGPTGGRRFTRPGLIT 83 (150) T ss_pred CcHHHHHHHHHhhhhhhccccchhhcCCcceeeccCCcccCCCcCCCCceEEEEeeccccccccccCCCCcEEeeCcEEE Confidence 47899999999999999953 23333 599999986544 45889999999999999999999999999999999 Q ss_pred EEEEeecCCCHHHHHHHHHHHHHHHHhccccCCCceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 73 VNVYTPFTAGSSAMTAHIYALAVRDIFEGKHFDKDLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 73 iq~F~p~~~G~~~~~a~~~a~~a~~~F~g~~~~g~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) ||||+|+.+|+++..+.+.|..|+++||+++++|+|||+++|++++|++++|||+||+||||||||| T Consensus 84 VQiF~p~~~G~G~~la~~~Ad~a~eaFe~~~t~g~i~f~~a~~~eig~~~gWyQ~Nv~i~Feyde~r 150 (150) T protein:vir:97 84 VQVFTPLSGGQGLSLAEKCAIIARDAFEGRGTASGIWFRNARIQEIGPDGAWYQMNVVVEFEYDELR 150 (150) T ss_pred EEEeeeccCCchhhHHHHHHHHHHHHHhccCCcCCeecccccccccCCCCceEEEEeEeeeeccccC Confidence 9999998878888888888888899999999999999999999999999999999999999999999 No 3 >protein:vir:80429 Length: 150 # NCBI annotation: BcepGomrgp11 # Family: family:all:5248 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210231;genbank:gi:146329923;genbank:GeneID:5123538 Probab=100.00 E-value=2.4e-48 Score=281.68 Aligned_cols=139 Identities=24% Similarity=0.370 Sum_probs=125.9 Q ss_pred CC--hHHHHHHHHHHHHhhhcc------CCcccccccCCCCCCCCC--CCcEEEEEEecCCcceeeeeCCC-CCeEeecc Q lcl|NC_019713. 1 MR--DDKAREVIGEALVAGLKA------AGITIDIAQEGYPYKGAA--NKPYLEVSFSVESTSSQSLGDAG-NRSFVRDG 69 (139) Q Consensus 1 Mt--~~qar~aI~~~~~a~~~a------~~~~~~I~~~n~~f~pp~--~~~W~rv~i~~~~~~qasiG~~g-~~~~rr~G 69 (139) |- --|||++|++++.+.|.+ .|+.+.|.|||..+.+|+ +++||||+++|++++|+++|.|+ +|||||+| T Consensus 1 ~~~~~~~ar~ei~~~f~~~W~~~~~~~~~g~~~~~~w~~~~~~~pP~g~~~WaRLti~h~~~~qA~~~~~~~gr~~~r~G 80 (150) T protein:vir:80 1 MIQDALQARSDINTMLFDQWSVADWSKVKGGKPNIAWEGRESARPPDGSAPYVAIFIKHVDGQQASLTDPDMLRRWSRDG 80 (150) T ss_pred CcchhhhhHHHHHHHHhhhhccCcchhhcCCcceeeecCcccCCcCCCCCceEEEEEecCCcccccccCCCCcceEeeCc Confidence 53 459999999999999966 366777999999984444 47899999999999999999765 79999999 Q ss_pred eEEEEEEeecCCCHHHHHHHHHHHHHHHHhccccCCCceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 70 VFHVNVYTPFTAGSSAMTAHIYALAVRDIFEGKHFDKDLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 70 ~i~iq~F~p~~~G~~~~~a~~~a~~a~~~F~g~~~~g~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) +|+||||+|+.+|+++..+.+.|..|+++||+++++|.|||+++|++++|++++|||+||+||||||||| T Consensus 81 lI~VQiF~p~~~G~G~~la~k~Ad~a~eaFe~~~t~g~i~f~~as~~eiG~d~gWYQ~NV~ipF~yde~r 150 (150) T protein:vir:80 81 LITVQCFGMLSAGQGLEDATYQATIAMRAFEGKQSANGIWFRNARIKEIGSDRGWYQVNMIVEFEYDEVR 150 (150) T ss_pred EEEEEEeeeccCCchhhHHHHHHHHHHHHHhccCCCCCcccccccccccCCCCceEEEEeEeeeeccccC Confidence 9999999998878877888888888899999999999999999999999999999999999999999999 No 4 >protein:vir:94921 Length: 125 # NCBI annotation: possible peptidoglycan binding protein # Family: family:all:5248 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239283;genbank:gi:66392065;genbank:GeneID:5076566 Probab=100.00 E-value=1.3e-43 Score=255.77 Aligned_cols=125 Identities=15% Similarity=0.137 Sum_probs=116.9 Q ss_pred CChHHHHHHHHHHHHhhhccCCcccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEEeecC Q lcl|NC_019713. 1 MRDDKAREVIGEALVAGLKAAGITIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVYTPFT 80 (139) Q Consensus 1 Mt~~qar~aI~~~~~a~~~a~~~~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F~p~~ 80 (139) ||++|||++|++||.+.| ..+||.|||.| ||++++||||+++|++++++|+|. ||+||+|+|+||||+|++ T Consensus 1 Mt~~q~r~~I~~r~~a~~----~~~~I~~~N~p--p~~~~~W~Rlti~~g~~~~a~iG~---~~~~rtGli~iqiF~p~~ 71 (125) T protein:vir:94 1 MSYFQEKLDIENYFKANW----PDTPIFYENRT--ANSTGTWVRLTIQNGDAFQASNGE---VSYRHPGVVFVQIFTKKE 71 (125) T ss_pred CCHHHHHHHHHHHHHhCC----CccceeeCCCC--CCCCCceEEEEeccCcccccccCC---ceeeeeeEEEEEeeecCC Confidence 999999999999998755 46799999986 777899999999999999999993 899999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHhccccCCCceeeeeccccccCCCCCEEEEEEEeeeeeec Q lcl|NC_019713. 81 AGSSAMTAHIYALAVRDIFEGKHFDKDLWFWECKASPAGSNGNYNVAYANCAFRFQE 137 (139) Q Consensus 81 ~G~~~~~a~~~a~~a~~~F~g~~~~g~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de 137 (139) +|+++.+.++|++. ++|+++ ++|+|+|++++++++|+|++|||+||+|+||+-. T Consensus 72 ~G~~~~~~~ad~~~--~~f~~~-~~g~i~f~~~~~~~~g~~~gwyQ~Nv~I~f~~~~ 125 (125) T protein:vir:94 72 VGSGEALKLADKVD--ALFRSK-TLGNIQFKVPQVQKVPSTTEWYQVNVSTEFYRGS 125 (125) T ss_pred cChHHHHHHHHHHH--HHHccC-CCCceEEeeceecCCCCCCCEEEEEEEEeeecCC Confidence 99999999999995 569998 5699999999999999999999999999999998 No 5 >protein:vir:107704 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003903;genbank:gi:45686319;genbank:GeneID:2773044 Probab=99.34 E-value=2.1e-14 Score=95.60 Aligned_cols=129 Identities=9% Similarity=0.050 Sum_probs=99.4 Q ss_pred ChHHHHHHHHHHHHhhhccCCcccccccCCCCCCCCCC-CcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEEeecC Q lcl|NC_019713. 2 RDDKAREVIGEALVAGLKAAGITIDIAQEGYPYKGAAN-KPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVYTPFT 80 (139) Q Consensus 2 t~~qar~aI~~~~~a~~~a~~~~~~I~~~n~~f~pp~~-~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F~p~~ 80 (139) -|-|++.||..++.+.+. ..||+|||..|+||++ ++|+|+.+.++++...+++ +.|=...|++-|.|+.|.| T Consensus 1 ~hyE~~~a~r~~la~~~~----~lpVA~eNv~F~Pp~~G~~yLr~~~lpa~T~~~~L~---~d~r~y~Gv~QI~Vv~paG 73 (132) T protein:vir:10 1 MHYELSAAARAAFLSKYR----DFPHYMENRNFTPPKDGGMWLRFNYIEGDTLYLSID---RKCKSYIAIVQIGVVFPPG 73 (132) T ss_pred CchHHHHHHHHHHHhhhc----CCcEeecCCCcCCCCCCceEEEEEEccCCceeeecc---CcCcEEEEEEEEEEEecCC Confidence 688999999999976443 4899999999999987 6999999999999999887 4566667999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHhccccCCCce-eeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 81 AGSSAMTAHIYALAVRDIFEGKHFDKDL-WFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 81 ~G~~~~~a~~~a~~a~~~F~g~~~~g~l-~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) +|+.....+++.+ .++|.-......- -+-..+..+.=.++.=|-+.|+.-||+|--. T Consensus 74 ~G~~~a~~iAd~i--~~~F~~g~~l~~Gyi~~~~~~~p~i~~~s~~~iPvrf~yR~Dt~~ 131 (132) T protein:vir:10 74 SGVDEARLKAKEI--ADFFKDGKMLNVGYIFEGAIVHQIVKHESGWMIPVRFTVRVDTKE 131 (132) T ss_pred CCcchhHHHHHHH--HHhccCcceeecceecCCCccCCceeCCcceEEEEEEEEEecccC Confidence 9998877777777 4779743332211 1222333344455666779999999999766 No 6 >protein:vir:103278 Length: 169 # NCBI annotation: phage-related conserved hypothetical protein # Family: family:all:5121 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277458;genbank:gi:71834100;genbank:GeneID:3562389 Probab=99.31 E-value=4e-14 Score=94.04 Aligned_cols=131 Identities=11% Similarity=0.124 Sum_probs=91.3 Q ss_pred CChHHHHHHHHHHHHhhhccCCcccccccCCCCCCCCCC-CcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEEeec Q lcl|NC_019713. 1 MRDDKAREVIGEALVAGLKAAGITIDIAQEGYPYKGAAN-KPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVYTPF 79 (139) Q Consensus 1 Mt~~qar~aI~~~~~a~~~a~~~~~~I~~~n~~f~pp~~-~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F~p~ 79 (139) --|-|+-.++.+++.+-..+=+...||+|||..|+||++ ++|+|+.+.++++...++|.. .|.|+ |++-|.|+.|. T Consensus 37 ~~h~ei~~a~rk~l~~~a~a~~~~LpVA~ENVaFtPp~dG~~YLr~~~lPadT~~~~L~gd-~R~y~--GVfQIsVV~Pa 113 (169) T protein:vir:10 37 NVHYEMMVAARKLVSDAAVDIAGSLPVAYENCGFTPPKNGSSWLKFDYTEVDSVTWGLQRT-CRYYV--GMVQVSIFFSP 113 (169) T ss_pred chHHHHHHHHHHHHHHHHhhcccCCcEeeCCCCcCCCCCCccEEEEEEecCCceeeeccCC-CceEE--EEEEEEEEecC Confidence 124445455555553322222345889999999999988 589999999999999999852 24444 99999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHhccccC-CCceeeeeccccccCCCCCEEEEEEEeeeeee Q lcl|NC_019713. 80 TAGSSAMTAHIYALAVRDIFEGKHF-DKDLWFWECKASPAGSNGNYNVAYANCAFRFQ 136 (139) Q Consensus 80 ~~G~~~~~a~~~a~~a~~~F~g~~~-~g~l~f~~~~~~~~g~dg~wyq~nv~i~fr~d 136 (139) |+|+......++-+ .++|.-..+ +.+--+-+.+..++-.++.=|-+.|+.-||+| T Consensus 114 GtG~~ka~qiAdei--adlF~~gt~L~~Gyi~~~~~~~p~i~~~s~~~iPvr~~~R~D 169 (169) T protein:vir:10 114 GEGTDRPRQLAGRL--SEAFADGTMLDSGYIYEGGSVFPPVKSQSGWFIPVRFYVRMD 169 (169) T ss_pred CCCcchhHHHHHHH--HHhhhCCceeeceeecCCCeECCeeecCCceEEeEEEEEEeC Confidence 99988766666655 588984332 22322455666555566644558999999999 No 7 >protein:vir:104348 Length: 129 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398976;genbank:gi:81343960;genbank:GeneID:3778880 Probab=99.31 E-value=3.6e-14 Score=94.30 Aligned_cols=127 Identities=15% Similarity=0.186 Sum_probs=97.3 Q ss_pred CChHHHHHHHHHHHHhhhccCCcccccccCCCCCCCCCC-CcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEEeec Q lcl|NC_019713. 1 MRDDKAREVIGEALVAGLKAAGITIDIAQEGYPYKGAAN-KPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVYTPF 79 (139) Q Consensus 1 Mt~~qar~aI~~~~~a~~~a~~~~~~I~~~n~~f~pp~~-~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F~p~ 79 (139) |+ ..||..+.+++.+-+.. ..||+|||..|+||++ +.|+|+.+.++++...++|.. .|.|+ |++-|.|+.|. T Consensus 1 ~s-~aar~~v~d~~~~~~~~---~lpVA~eNv~FtPp~~G~~YLr~~~lpa~T~~~~L~~d-~r~y~--Gv~QI~Vv~p~ 73 (129) T protein:vir:10 1 MS-LAARKFVNDLLVNEFPV---RYPVAWENAAFTPPADGSIWLKYDYTEVDTVTYGLSRK-CKYYV--GMVQISVFFSP 73 (129) T ss_pred Cc-hHHHHHHHHHHHHhhcC---CCcEeecCCCcCCCCCCceEEEEEecCCCceeeeccCC-CceEE--EEEEEEEEecC Confidence 65 57999999999776653 5789999999999998 479999999999999999852 24444 99999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHhccccC-CCceeeeeccccccCCCCCEEEEEEEeeeeee Q lcl|NC_019713. 80 TAGSSAMTAHIYALAVRDIFEGKHF-DKDLWFWECKASPAGSNGNYNVAYANCAFRFQ 136 (139) Q Consensus 80 ~~G~~~~~a~~~a~~a~~~F~g~~~-~g~l~f~~~~~~~~g~dg~wyq~nv~i~fr~d 136 (139) |+|+......++-+ .++|.-..+ ..+--+-+.+..++-.++.=|-+.|+.-||+| T Consensus 74 G~G~~~a~~iA~ei--~d~F~~g~~L~~Gyi~~~~~~~p~i~~~~~~~ipvr~~~r~d 129 (129) T protein:vir:10 74 GTGIDKPRQIANQL--AESIVDGTMLDSGTIYESGVVNPVIKSKSGWFIPVRFYVRLD 129 (129) T ss_pred CCCcchhhHHHHHH--HHhccCCceeeceeecCCCeECCeeecCCceEEeEEEEEEeC Confidence 99988866666666 478984333 22322455566555566644558899999999 No 8 >protein:vir:79637 Length: 130 # NCBI annotation: gp41 # Family: family:all:5121 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285530;genbank:gi:148734513;genbank:GeneID:5219995 Probab=99.21 E-value=2e-13 Score=90.15 Aligned_cols=125 Identities=14% Similarity=0.167 Sum_probs=90.0 Q ss_pred CChH---HHHHHHHHHHHhhhccCCcccccccCCCCCCCCCC-CcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEE Q lcl|NC_019713. 1 MRDD---KAREVIGEALVAGLKAAGITIDIAQEGYPYKGAAN-KPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVY 76 (139) Q Consensus 1 Mt~~---qar~aI~~~~~a~~~a~~~~~~I~~~n~~f~pp~~-~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F 76 (139) |-.| +||.++..++.. .-||+|||..|+||++ +.|+|+.+.++++...++|.. .|.|+ |++-|.|+ T Consensus 1 ~~~e~~~aaR~~~~~~~~~-------~lpVA~ENv~FtPp~~G~~YLr~~~lpa~T~~~~L~~d-~r~y~--Gv~QI~VV 70 (130) T protein:vir:79 1 MHYELSVAARMALAQEYES-------EYMIAYENVEFTPPKGGGIWLKYDYKEADTIIHDLKRK-CISYI--GMVQIGIE 70 (130) T ss_pred CcchhhHHHHHHHHhhhhh-------hCceeecCCCcCCCCCCceEEEEEecCCCceeeeccCC-CceEE--EEEEEEEE Confidence 4332 455555333322 4699999999999998 479999999999999999852 24444 99999999 Q ss_pred eecCCCHHHHHHHHHHHHHHHHhccccC-CCceeeeeccccccCCCCCEEEEEEEeeeeeec Q lcl|NC_019713. 77 TPFTAGSSAMTAHIYALAVRDIFEGKHF-DKDLWFWECKASPAGSNGNYNVAYANCAFRFQE 137 (139) Q Consensus 77 ~p~~~G~~~~~a~~~a~~a~~~F~g~~~-~g~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de 137 (139) .|.|+|+......++-+ .++|.-..+ ..+--+-+.+..++-.++.=|-+.|+.-+|+|- T Consensus 71 ~paG~G~~~a~~iA~ei--~dlF~~g~~L~~Gyi~~~~~~~p~i~~~~~~~iPvr~~~R~d~ 130 (130) T protein:vir:79 71 FPPGSGIDKARKLAKNI--ADFFEDGKMLSNGYISEGAKVHQVQKSESGWFYPVRFYVRYDG 130 (130) T ss_pred ecCCCCcchhhHHHHHH--HHhccCCceeeceeecCCCeECCeeecCCceEEeEEEEEEecC Confidence 99999988866666666 478984333 223224556665555666445689999999999 No 9 >protein:vir:397 Length: 132 # NCBI annotation: gp12 # Family: family:all:911 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046907;genbank:gi:9630477;genbank:GeneID:1261651 Probab=90.99 E-value=0.018 Score=30.14 Aligned_cols=124 Identities=13% Similarity=0.187 Sum_probs=76.0 Q ss_pred CChHHHHHHHHHHHHhhhccCCcccccccCCCC-CCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEEeec Q lcl|NC_019713. 1 MRDDKAREVIGEALVAGLKAAGITIDIAQEGYP-YKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVYTPF 79 (139) Q Consensus 1 Mt~~qar~aI~~~~~a~~~a~~~~~~I~~~n~~-f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F~p~ 79 (139) |.++|+|++|+.++.....- .--.|.|.| |--+.+.|=+-|.+.-......++.. ..|+ ..++|-||-|. T Consensus 1 ~~ht~IR~~Vid~L~~~l~~----~~~ffdGrP~fiDe~elPAVAV~l~d~~~~~~~ld~---~~w~--A~LhI~iyLka 71 (132) T protein:vir:39 1 MKHRDIRKVIIDALESAIGT----DAIYFDGRPAVLEEGDFPAVAVYLTDAEYTGEELDA---DTWQ--AILHIEVFLEA 71 (132) T ss_pred CchHHHHHHHHHHHHhhCCC----ceEEecCcceeeccccCcEEEEEeecCCCCcceecC---CeeE--EEEEEEEEeec Confidence 99999999999999887632 223477787 43377888888988877777777763 4565 57999999998 Q ss_pred CCCHHHHHHHHHHHHHHHHhc----cccCCCceeeeec---cccccCCCCCEEEEEEEeeeeeec Q lcl|NC_019713. 80 TAGSSAMTAHIYALAVRDIFE----GKHFDKDLWFWEC---KASPAGSNGNYNVAYANCAFRFQE 137 (139) Q Consensus 80 ~~G~~~~~a~~~a~~a~~~F~----g~~~~g~l~f~~~---~~~~~g~dg~wyq~nv~i~fr~de 137 (139) .++-+++..++-.. +|- ...+.+-+..... +=+-=.+...|.-..+.-.=.|.= T Consensus 72 ~~~ds~LD~~aE~~----i~p~i~~~~~l~~l~~~~~~~gy~Y~rD~~~atW~sadL~y~ItY~~ 132 (132) T protein:vir:39 72 QVPDSELDDWMETR----VYPVLAEVPGLESLITTMVQQGYDYQRDDDMALWSSADLKYSITYDM 132 (132) T ss_pred CCCHHHHHHHHHHH----hHhhhcccchhhhHhhhhhhcCCCcccccccceEEEEEEEEEEEEeC Confidence 88766655555433 331 1111111222111 111111234688766654444443 No 10 >protein:vir:3428 Length: 131 # NCBI annotation: tail component # Family: family:all:911 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040591;genbank:gi:9626255;genbank:GeneID:2703486 Probab=90.94 E-value=0.018 Score=30.11 Aligned_cols=118 Identities=17% Similarity=0.249 Sum_probs=73.5 Q ss_pred CChHHHHHHHHHHHHhhhccCCcccccccCCCC-CCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEEeec Q lcl|NC_019713. 1 MRDDKAREVIGEALVAGLKAAGITIDIAQEGYP-YKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVYTPF 79 (139) Q Consensus 1 Mt~~qar~aI~~~~~a~~~a~~~~~~I~~~n~~-f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F~p~ 79 (139) |.++|+|++|+.++.+... .. ..|.|.| |--+...|=+-|.+.-......++.. ..|+ ..++|-||-|. T Consensus 1 ~~ht~IR~~Vid~L~~~l~----~v-~~fdG~P~fide~ElPAVAV~l~d~~~~~~~ld~---~~w~--A~LhI~iyLka 70 (131) T protein:vir:34 1 MKHTELRAAVLDALEKHDT----GA-TFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDS---DTWQ--AELHIEVFLPA 70 (131) T ss_pred CchHHHHHHHHHHHhccCC----ce-EEecCCceeeccccCcEEEEEeecCCCCcceecC---CeeE--EEEEEEEEeec Confidence 9999999999999977431 12 2577777 33377888899998877777777774 4565 57999999998 Q ss_pred CCCHHHHHHHHHHHHHHHHhccccCCCceeeee--c-cccccC-------CCCCEEEEEEE--eeeee Q lcl|NC_019713. 80 TAGSSAMTAHIYALAVRDIFEGKHFDKDLWFWE--C-KASPAG-------SNGNYNVAYAN--CAFRF 135 (139) Q Consensus 80 ~~G~~~~~a~~~a~~a~~~F~g~~~~g~l~f~~--~-~~~~~g-------~dg~wyq~nv~--i~fr~ 135 (139) .++-+++..++-.. ++. ...++..+. + .+...| +...|.-..+. |.|+- T Consensus 71 ~~~ds~LD~~~E~~----i~~---v~~~~~~l~~l~~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) T protein:vir:34 71 QVPDSELDAWMESR----IYP---VMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) T ss_pred CCCHHHHHHHHHHH----hHH---HhhcchhhhhHhhhhhhccCCcccccccceEEEEEEEEEEEEeC Confidence 88777665554332 221 111111111 0 112222 33468876654 44444 No 11 >protein:vir:79571 Length: 137 # NCBI annotation: putative tail component # Family: family:all:911 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272522;genbank:gi:148609391;genbank:GeneID:5204407 Probab=73.70 E-value=0.17 Score=24.84 Aligned_cols=124 Identities=15% Similarity=0.177 Sum_probs=76.6 Q ss_pred CChHHHHHHHHHHHHhhhccCCcccccccCCCC-CCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEEeec Q lcl|NC_019713. 1 MRDDKAREVIGEALVAGLKAAGITIDIAQEGYP-YKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVYTPF 79 (139) Q Consensus 1 Mt~~qar~aI~~~~~a~~~a~~~~~~I~~~n~~-f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F~p~ 79 (139) --++|+|++|+.++.....- .--.|.+.| |......|=+-|.+.-......++.. ..|+ ..++|-||-|. T Consensus 6 ~iht~IR~~Vid~L~~~l~~----~~~ffdGrP~fiDe~ElPAVAV~l~da~~~~~~ld~---~~W~--A~LhI~iyLka 76 (137) T protein:vir:79 6 NRHTQIRQVVLARLREQCGD----SATFFDGLPAFVDAQELPAVSVWLSDAQYTGKMTDE---DDWQ--AVLHIAVFIRA 76 (137) T ss_pred HHHHHHHHHHHHHHHhhcCC----cEEEeCCccceechhhCcEEEEEeecCCCCcceecC---CeeE--EEEEEEEEeec Confidence 24899999999999886632 122577787 65556788888888877777777765 3465 57999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHhcc----ccCCCceeeeeccc---cccCCCCCEEEEEEEeeeeeec Q lcl|NC_019713. 80 TAGSSAMTAHIYALAVRDIFEG----KHFDKDLWFWECKA---SPAGSNGNYNVAYANCAFRFQE 137 (139) Q Consensus 80 ~~G~~~~~a~~~a~~a~~~F~g----~~~~g~l~f~~~~~---~~~g~dg~wyq~nv~i~fr~de 137 (139) +++-+++..++-- .+|.- ..+.+-+.....+- +-=.+...|.-..+.-.=.|.. T Consensus 77 ~~~ds~LD~~~E~----~I~~v~~~~~~l~~l~~~~~~~gY~Y~rD~e~~tW~sadL~y~ItYe~ 137 (137) T protein:vir:79 77 QAPDSELDMWMES----TIFPALNDVPALSGLIDTLIPLGFNYQRDNEMATWAMAEITYQITYTN 137 (137) T ss_pred CCCHHHHHHHHHH----HHHHhhcchhhhhhHhhhhhcccCCcccccccceeEEEEEEEEEEEcC Confidence 8887765554332 23420 00111122222211 1111334799888776666666 No 12 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=47.86 E-value=0.69 Score=21.47 Aligned_cols=126 Identities=7% Similarity=0.011 Sum_probs=60.6 Q ss_pred CChH-HHHHHHHHHHHhhhccCCcc-cccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEEee Q lcl|NC_019713. 1 MRDD-KAREVIGEALVAGLKAAGIT-IDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVYTP 78 (139) Q Consensus 1 Mt~~-qar~aI~~~~~a~~~a~~~~-~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F~p 78 (139) |+-+ +-+.||+++|.+......-- .+ -|++.|-++ .-||+-| =....... +. +-....+=.+.|+|+.. T Consensus 3 msa~~aLq~Ai~~~L~ad~~l~alvggr-VyD~~P~~~--~~PYV~l--G~~~~~~~--~~--~~~~g~~~~~tl~Vws~ 73 (140) T protein:vir:96 3 VTAEPLLYNKIMNNLIENPITDKLVGGR-VFDCVQKDV--VYPYIVV--GESNVTES--ER--SPGMREIIAITFHVYSQ 73 (140) T ss_pred cchhHHHHHHHHHHhccChhHHhhcCcc-cccCCccCC--CCCEEEe--CCceeeec--CC--CcccceEEEEEEEEEEc Confidence 8777 66788888887654321000 12 255554222 3466544 11112222 11 11233455678889876 Q ss_pred cCCCHHHHHHHHHHHHHHHHhcccc-CCC----ceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 79 FTAGSSAMTAHIYALAVRDIFEGKH-FDK----DLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 79 ~~~G~~~~~a~~~a~~a~~~F~g~~-~~g----~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) . .|-.+..+.+.+. .++..... ++| .+.|...+... .+||..++--+.+.|+...=+ T Consensus 74 ~-~g~~ea~~ia~ai--~~aL~~~l~l~~~~lv~l~~~~~~~~r-d~dg~t~hgvl~~ra~ve~~~ 135 (140) T protein:vir:96 74 Y-ENGAEARELLKYL--NYACRLNINFKDYELEWIKKDNSQVFT-DIDQYTKHGVLRLLYKVRHKT 135 (140) T ss_pred C-CCHHHHHHHHHHH--HHHhcCCccCCCceEEEEEEeeeEEee-cCCCceEEEEEEEEEEEeecc Confidence 3 4544444444443 33333322 223 13333333222 256766776677777765544 No 13 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=45.45 E-value=0.77 Score=21.21 Aligned_cols=126 Identities=11% Similarity=0.074 Sum_probs=61.7 Q ss_pred CChH---HHHHHHHHHHHhhhccCCc-ccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEE Q lcl|NC_019713. 1 MRDD---KAREVIGEALVAGLKAAGI-TIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVY 76 (139) Q Consensus 1 Mt~~---qar~aI~~~~~a~~~a~~~-~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F 76 (139) |+.. +-+.+|+++|.+......- .-+| |++.|-++ ..||+ ++-.......+-. -...++-.+.|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~~l~alvggrV-~D~~P~~a--~~PYv--~lG~~~~~d~~~~----~~~g~~~~~ti~Vw 71 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIRKQLDGRV-FDCVQKDA--VYPYI--VVGETNVTNKETT----TSMVEDVGITLHVY 71 (145) T ss_pred CcchHhHHHHHHHHHHhhcChhHHHhhcCce-ecCCccCC--CCCEE--EeCcceeeecCCC----cccceEEEEEEEEE Confidence 9954 4456666666553322100 0022 45444322 24554 4332222222211 12456777899999 Q ss_pred eecCCCHHHHHHHHHHHHHHHHhccc-cCCC----ceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 77 TPFTAGSSAMTAHIYALAVRDIFEGK-HFDK----DLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 77 ~p~~~G~~~~~a~~~a~~a~~~F~g~-~~~g----~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) +.. .|-.+..+.+.+. .++-.+. .++| .+.|...+.. -.+||.+++..+.+.++..+=+ T Consensus 72 s~~-~g~~eak~ia~av--~~aL~~~l~l~~~~lv~l~~~~~~~~-rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:97 72 SQA-RNRDEASQIIQFL--GFVLNNEIEIDYYSFIKSRIDTQEVI-TDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred EcC-CCHHHHHHHHHHH--HHHhccccCCCCCeEEEeEEeeeeEe-ecCCCceEEEEEEEEEEEecCc Confidence 863 3433333322222 3333322 1222 2333333333 2467888899999999988877 No 14 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=42.90 E-value=0.87 Score=20.93 Aligned_cols=124 Identities=15% Similarity=0.107 Sum_probs=59.6 Q ss_pred CC--h-HHHHHHHHHHHHhhhccC---CcccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEE Q lcl|NC_019713. 1 MR--D-DKAREVIGEALVAGLKAA---GITIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVN 74 (139) Q Consensus 1 Mt--~-~qar~aI~~~~~a~~~a~---~~~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq 74 (139) |+ . -+-+.+|+++|.+..... ++ +| |++.|-.+ .-||+-| -.......+-. -....+-.+.|+ T Consensus 1 Msms~~~aLq~Ai~a~L~ada~l~alvg~--~V-yD~~P~~~--~~Pyv~l--G~~~~~~~~~~----~~~g~~~~~~i~ 69 (140) T protein:vir:96 1 MWVSVEPELTVQIYKRLKASPIINKFVGD--RV-FDVVQEDA--VYPYIVV--GESNVTNNESS----TMMRETVGIVIH 69 (140) T ss_pred CCccHHHHHHHHHHHHhhcChhHHHhcCC--cc-ccCCccCC--CCCEEEe--cCceeeecCCC----cccceEEEEEEE Confidence 55 3 345677777776543221 11 33 55544222 2355544 22222222111 113456678888 Q ss_pred EEeecCCCHHHHHHHHHHHHHHHHhccc-cCCC----ceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 75 VYTPFTAGSSAMTAHIYALAVRDIFEGK-HFDK----DLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 75 ~F~p~~~G~~~~~a~~~a~~a~~~F~g~-~~~g----~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) |+... .|-.+..+.+.+ +.++-.+. .++| .+.|...+... .+||..++.-++++|+..-.+ T Consensus 70 Vws~~-~g~~ea~~ia~a--v~~AL~~~l~l~~~~lv~l~~~~~~~~r-d~dg~~~hgvl~~r~~v~~~~ 135 (140) T protein:vir:96 70 VYSQF-ATQYEAKQIISA--IGYVLNRPIDIENYEFQFSRIDSQSVFP-DIDRFTKHGTIRLLFKYRHIK 135 (140) T ss_pred EEEcC-CCHHHHHHHHHH--HHHHhCCCccCCCCeEEEEEEeeeEEEe-cCCCceEEEEEEEEEEEEeec Confidence 89863 343433333333 23333321 2223 12233333332 257778887788888877777 No 15 >protein:vir:93736 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240465;genbank:gi:66396143;genbank:GeneID:5133505 Probab=40.75 E-value=0.96 Score=20.69 Aligned_cols=126 Identities=12% Similarity=0.081 Sum_probs=60.4 Q ss_pred CChH---HHHHHHHHHHHhhhccCCc-ccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEE Q lcl|NC_019713. 1 MRDD---KAREVIGEALVAGLKAAGI-TIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVY 76 (139) Q Consensus 1 Mt~~---qar~aI~~~~~a~~~a~~~-~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F 76 (139) |+.. +-+.+|+++|.+......- .-+| |++.|-++ ..||+-| -.......+-+ -...++-.+.|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI-~D~~P~~a--~~PYV~l--G~~~~~d~~~~----~~~g~~~~~ti~Vw 71 (145) T protein:vir:93 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRV-FDCVQKDA--VYPYIVV--GETNVTNKETT----TSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCce-ecCCcCCC--CCCEEEe--CCceeeecCCC----cccceEEEEEEEEE Confidence 9954 3456666666553322100 0022 55544332 2466443 32222222211 12456677889999 Q ss_pred eecCCCHHHHHHHHHHHHHHHHhccc-cCCC-c---eeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 77 TPFTAGSSAMTAHIYALAVRDIFEGK-HFDK-D---LWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 77 ~p~~~G~~~~~a~~~a~~a~~~F~g~-~~~g-~---l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) +.. .|-.+.-+.+.+. .++-.+. .++| . +.|...+.. -.+||..++..+.+.++..+=. T Consensus 72 s~~-~g~~eak~ia~av--~~aL~~~l~l~~~~lv~l~~~~~~~~-rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:93 72 SQA-RNRDEASQIIQFL--GFVLNNEIEIDYYSFIKSRIDTQEVI-TDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred EcC-CCHHHHHHHHHHH--HHHhccccCCCCCeEEEeEEeeeeEe-ecCCcceEEEEEEEEEEEEecc Confidence 863 4434333333332 3333322 1222 1 234334333 2467788888888888876655 No 16 >protein:vir:97421 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240755;genbank:gi:66396436;genbank:GeneID:5133777 Probab=40.75 E-value=0.96 Score=20.69 Aligned_cols=126 Identities=12% Similarity=0.081 Sum_probs=60.4 Q ss_pred CChH---HHHHHHHHHHHhhhccCCc-ccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEE Q lcl|NC_019713. 1 MRDD---KAREVIGEALVAGLKAAGI-TIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVY 76 (139) Q Consensus 1 Mt~~---qar~aI~~~~~a~~~a~~~-~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F 76 (139) |+.. +-+.+|+++|.+......- .-+| |++.|-++ ..||+-| -.......+-+ -...++-.+.|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI-~D~~P~~a--~~PYV~l--G~~~~~d~~~~----~~~g~~~~~ti~Vw 71 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRV-FDCVQKDA--VYPYIVV--GETNVTNKETT----TSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCce-ecCCcCCC--CCCEEEe--CCceeeecCCC----cccceEEEEEEEEE Confidence 9954 3456666666553322100 0022 55544332 2466443 32222222211 12456677889999 Q ss_pred eecCCCHHHHHHHHHHHHHHHHhccc-cCCC-c---eeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 77 TPFTAGSSAMTAHIYALAVRDIFEGK-HFDK-D---LWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 77 ~p~~~G~~~~~a~~~a~~a~~~F~g~-~~~g-~---l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) +.. .|-.+.-+.+.+. .++-.+. .++| . +.|...+.. -.+||..++..+.+.++..+=. T Consensus 72 s~~-~g~~eak~ia~av--~~aL~~~l~l~~~~lv~l~~~~~~~~-rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:97 72 SQA-RNRDEASQIIQFL--GFVLNNEIEIDYYSFIKSRIDTQEVI-TDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred EcC-CCHHHHHHHHHHH--HHHhccccCCCCCeEEEeEEeeeeEe-ecCCcceEEEEEEEEEEEEecc Confidence 863 4434333333332 3333322 1222 1 234334333 2467788888888888876655 No 17 >protein:vir:94488 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240682;genbank:gi:66396364;genbank:GeneID:5133752 Probab=40.75 E-value=0.96 Score=20.69 Aligned_cols=126 Identities=12% Similarity=0.081 Sum_probs=60.4 Q ss_pred CChH---HHHHHHHHHHHhhhccCCc-ccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEE Q lcl|NC_019713. 1 MRDD---KAREVIGEALVAGLKAAGI-TIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVY 76 (139) Q Consensus 1 Mt~~---qar~aI~~~~~a~~~a~~~-~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F 76 (139) |+.. +-+.+|+++|.+......- .-+| |++.|-++ ..||+-| -.......+-+ -...++-.+.|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI-~D~~P~~a--~~PYV~l--G~~~~~d~~~~----~~~g~~~~~ti~Vw 71 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRV-FDCVQKDA--VYPYIVV--GETNVTNKETT----TSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCce-ecCCcCCC--CCCEEEe--CCceeeecCCC----cccceEEEEEEEEE Confidence 9954 3456666666553322100 0022 55544332 2466443 32222222211 12456677889999 Q ss_pred eecCCCHHHHHHHHHHHHHHHHhccc-cCCC-c---eeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 77 TPFTAGSSAMTAHIYALAVRDIFEGK-HFDK-D---LWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 77 ~p~~~G~~~~~a~~~a~~a~~~F~g~-~~~g-~---l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) +.. .|-.+.-+.+.+. .++-.+. .++| . +.|...+.. -.+||..++..+.+.++..+=. T Consensus 72 s~~-~g~~eak~ia~av--~~aL~~~l~l~~~~lv~l~~~~~~~~-rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:94 72 SQA-RNRDEASQIIQFL--GFVLNNEIEIDYYSFIKSRIDTQEVI-TDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred EcC-CCHHHHHHHHHHH--HHHhccccCCCCCeEEEeEEeeeeEe-ecCCcceEEEEEEEEEEEEecc Confidence 863 4434333333332 3333322 1222 1 234334333 2467788888888888876655 No 18 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=39.49 E-value=1 Score=20.55 Aligned_cols=126 Identities=11% Similarity=0.064 Sum_probs=60.3 Q ss_pred CChH---HHHHHHHHHHHhhhccCCc-ccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEE Q lcl|NC_019713. 1 MRDD---KAREVIGEALVAGLKAAGI-TIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVY 76 (139) Q Consensus 1 Mt~~---qar~aI~~~~~a~~~a~~~-~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F 76 (139) |+.. +-+.+|+++|.+......- .-+| |++.|-++ ..||+-| -.......+-. -...++-.+.|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV-~D~~P~~a--~~PYV~l--G~~~~~~~~~~----~~~g~~~~~ti~Vw 71 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNSIIQKQLDGRV-FDCVQKDA--VYPYIVV--GETNVTNKETT----TSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCce-ecCCcCCC--CCCEEEe--cCceeeecCCC----cccceEEEEEEEEE Confidence 9954 3456666666543322100 0022 55544332 2466443 32222222211 12456677889999 Q ss_pred eecCCCHHHHHHHHHHHHHHHHhccc-cCCC----ceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 77 TPFTAGSSAMTAHIYALAVRDIFEGK-HFDK----DLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 77 ~p~~~G~~~~~a~~~a~~a~~~F~g~-~~~g----~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) +.. .|-.+.-+.+.+. .++-.+. .++| .+.|...+... .+||.+++..+.+.|+..+=. T Consensus 72 s~~-~g~~eak~ia~av--~~aL~~~l~l~~~~lv~l~~~~~~~~r-d~dg~~~hgvl~~ra~ve~~~ 135 (145) T protein:vir:95 72 SQA-RNRDEASQIIQFL--GFVLNNEIEIDYYSFIKSRIDTQEVIT-DIDRYTKHGIIRLVFKYRHNT 135 (145) T ss_pred EcC-CCHHHHHHHHHHH--HHHhccccCCCCCeEEEeEEeeeeEee-cCCCceEEEEEEEEEEEEecc Confidence 863 3433333333222 3333321 1222 23333333332 367888888888888876655 No 19 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=38.47 E-value=1.1 Score=20.43 Aligned_cols=124 Identities=10% Similarity=0.066 Sum_probs=59.7 Q ss_pred CChH---HHHHHHHHHHHhhhccC---CcccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEE Q lcl|NC_019713. 1 MRDD---KAREVIGEALVAGLKAA---GITIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVN 74 (139) Q Consensus 1 Mt~~---qar~aI~~~~~a~~~a~---~~~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq 74 (139) |+.. +-+.+|+++|.+..... ++ +| |++.|-++ ..||+ ++-.......+-. -...++-.+.|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvgg--rV-~D~~P~~~--~~PYv--~lG~~~~~d~~~~----~~~g~~~~~ti~ 69 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDG--RV-FDCVQKDA--VYPYI--VVGETNVTNKETT----TSMVEDVGITLH 69 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhcc--cc-ccCCcCCC--CCCEE--EecCceeeecCCC----cccceEEEEEEE Confidence 9854 34556666664433221 11 22 55544322 23564 4332222222211 124566778899 Q ss_pred EEeecCCCHHHHHHHHHHHHHHHHhccc-cCCC----ceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 75 VYTPFTAGSSAMTAHIYALAVRDIFEGK-HFDK----DLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 75 ~F~p~~~G~~~~~a~~~a~~a~~~F~g~-~~~g----~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) |++.. .|-.+.-+.+.+. .++-.+. .++| .+.|...+... .+||..++..+.+.++..+=+ T Consensus 70 Vws~~-~g~~eak~ia~av--~~aL~~~l~l~~~~lv~l~~~~~~~~r-d~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:94 70 VYSQA-RNRDEASQIIQFL--GFVLNNEIEIDYYSFIKSRIDTQEVIT-DIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred EEEcC-CCHHHHHHHHHHH--HHHhccccCCCCCeEEEeEEeeeeEee-cCCCceEEEEEEEEEEEEecc Confidence 99863 3433333322222 3333321 1222 23333333332 367788888888888877665 No 20 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=38.15 E-value=1.1 Score=20.40 Aligned_cols=124 Identities=10% Similarity=0.065 Sum_probs=59.7 Q ss_pred CChH---HHHHHHHHHHHhhhccC---CcccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEE Q lcl|NC_019713. 1 MRDD---KAREVIGEALVAGLKAA---GITIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVN 74 (139) Q Consensus 1 Mt~~---qar~aI~~~~~a~~~a~---~~~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq 74 (139) |+.. +-+.+|+++|.+..... ++ +| |++.|-++ ..||+ ++-.......+-. -...++-.+.|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvgg--rV-~D~~P~~~--~~PYv--~lG~~~~~d~~~~----~~~g~~~~~ti~ 69 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDG--RV-FDCVQKDA--VYPYI--VVGETNVTNKETT----TSMVEDVGITLH 69 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhcc--cc-ccCCcCCC--CCCEE--EecCceeeecCCC----cccceEEEEEEE Confidence 9854 34556666664433221 11 22 55544322 23564 4332222222211 124566778899 Q ss_pred EEeecCCCHHHHHHHHHHHHHHHHhccc-cCCC----ceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 75 VYTPFTAGSSAMTAHIYALAVRDIFEGK-HFDK----DLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 75 ~F~p~~~G~~~~~a~~~a~~a~~~F~g~-~~~g----~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) |++.. .|-.+.-+.+.+. .++-.+. .++| .+.|...+... .+||..++.-+.+.++..+=+ T Consensus 70 Vws~~-~g~~eak~ia~av--~~aL~~~l~l~~~~lv~l~~~~~~~~r-d~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:95 70 VYSQA-RNRDEASQIIQFL--GFVLNNEIEIDYYSFIKSRIDTQEVIT-DIDQYTKHGVIRLVFKYRHNT 135 (145) T ss_pred EEEcC-CCHHHHHHHHHHH--HHHhccccCCCCCeEEEeEEeeeeEee-cCCCceEEEEEEEEEEEEecc Confidence 99863 3433333322222 3333321 1222 23333333332 367788888888888877665 No 21 >protein:vir:105337 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950674;genbank:gi:119967844;genbank:GeneID:4643216 Probab=36.38 E-value=1.2 Score=20.20 Aligned_cols=124 Identities=10% Similarity=0.042 Sum_probs=59.2 Q ss_pred CChH---HHHHHHHHHHHhhhccC---CcccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEE Q lcl|NC_019713. 1 MRDD---KAREVIGEALVAGLKAA---GITIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVN 74 (139) Q Consensus 1 Mt~~---qar~aI~~~~~a~~~a~---~~~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq 74 (139) |+.. +-+.+|+++|.+..... ++ + -|++.|-.+ ..||+-| -....... +. .-.....-.+.|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~--r-VyD~~P~~a--~~PyV~l--G~~~~~~~--~~--~~~~g~~~~~ti~ 69 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIVSKQLGG--R-VFDCVQKDA--VYPYIVV--GETNVTNK--ET--TTSMFEDVGVTLH 69 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcc--c-cccCCccCC--CCCEEEe--Ccceeeec--CC--CcccceEEEEEEE Confidence 9854 33556666655433211 11 2 255544222 3466443 22222222 22 1123556678899 Q ss_pred EEeecCCCHHHHHHHHHHHHHHHHhccccCC-C----ceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 75 VYTPFTAGSSAMTAHIYALAVRDIFEGKHFD-K----DLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 75 ~F~p~~~G~~~~~a~~~a~~a~~~F~g~~~~-g----~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) |+... .|-.+.-+.+.+ +.++-.+.-.. | .+.|...+.. -.+||..++..+.+.|+..+=+ T Consensus 70 Vws~~-~g~~ea~~ia~a--v~~aL~a~l~l~~~~lv~l~~~~~~~~-rd~dg~~~hgvl~~ra~ve~~~ 135 (145) T protein:vir:10 70 VYSQA-RNRDEASQIIQY--LGFVLNSEIEINNYSFIKSRIDTQEVI-TDIDQYTKHGIIRLIFKYRHNT 135 (145) T ss_pred EEEcC-CCHHHHHHHHHH--HHHHhCCCcCCCCCeEEEEEEeeeeEe-ecCCCceEEEEEEEEEEEeecc Confidence 99863 343333333333 23444432111 2 1233333332 2367788888888888877665 No 22 >protein:vir:107096 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950611;genbank:gi:119953691;genbank:GeneID:4643105 Probab=36.34 E-value=1.2 Score=20.19 Aligned_cols=124 Identities=10% Similarity=0.036 Sum_probs=59.2 Q ss_pred CChH---HHHHHHHHHHHhhhccC---CcccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEE Q lcl|NC_019713. 1 MRDD---KAREVIGEALVAGLKAA---GITIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVN 74 (139) Q Consensus 1 Mt~~---qar~aI~~~~~a~~~a~---~~~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq 74 (139) |+.. +-+.+|+++|.+..... ++ + -|++.|-.+ ..||+-| -....... +. .-.....-.+.|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~--r-VyD~~P~~a--~~PyV~l--G~~~~~~~--~~--~~~~g~~~~~ti~ 69 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIIKKQLGG--R-VFDCVQKDA--VYPYIVV--GETNVTNK--ET--TTSMFEDVGVTLH 69 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcc--c-cccCCccCC--CCCEEEe--Ccceeeec--CC--CcccceEEEEEEE Confidence 9854 33556666655433211 11 2 255544222 3466443 22222222 22 1123556678899 Q ss_pred EEeecCCCHHHHHHHHHHHHHHHHhccccCC-C----ceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 75 VYTPFTAGSSAMTAHIYALAVRDIFEGKHFD-K----DLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 75 ~F~p~~~G~~~~~a~~~a~~a~~~F~g~~~~-g----~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) |+... .|-.+.-+.+.+ +.++-.+.-.. | .+.|...+.. -.+||..++..+.+.|+..+=+ T Consensus 70 Vws~~-~g~~ea~~ia~a--v~~aL~a~l~l~~~~lv~l~~~~~~~~-rd~dg~~~hgvl~~ra~ve~~~ 135 (145) T protein:vir:10 70 VYSQA-RNRDEASQIIQY--LGFVLNSEIEINNYSFIKSRIDTQEVI-TDIDQYTKHGIIRLIFKYRHNT 135 (145) T ss_pred EEEcC-CCHHHHHHHHHH--HHHHhCCCcCCCCCeEEEEEEeeeeEe-ecCCCceEEEEEEEEEEEeecc Confidence 99863 343333333333 23444432111 2 1233333332 2367788888888888877665 No 23 >protein:vir:98426 Length: 131 # NCBI annotation: ORF6 # Family: family:all:12105 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958284;genbank:gi:41057258;uniprot:Q38599;genbank:GeneID:2732810 Probab=29.00 E-value=1.7 Score=19.32 Aligned_cols=119 Identities=16% Similarity=0.117 Sum_probs=61.7 Q ss_pred CChHHHHHHHHHHHHhhhccCCcccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEEeecC Q lcl|NC_019713. 1 MRDDKAREVIGEALVAGLKAAGITIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVYTPFT 80 (139) Q Consensus 1 Mt~~qar~aI~~~~~a~~~a~~~~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F~p~~ 80 (139) .+|..|-.-+-+-+.....+++...++ +.-.|.+.| +..++|+=. |.+.+......=.+.|||+.| T Consensus 4 i~~pda~~v~~~~lr~~l~a~~~~V~V-~t~vP~~RP--~rfV~Vert---------gG~~~~~~~Dr~~L~Vq~W~~-- 69 (131) T protein:vir:98 4 ILMPDAVAVIAGYLRAVLVARGVTVPV-GSRVPSPRP--ARFVRIERI---------GGPANTVVTDRPRLDVHCWGS-- 69 (131) T ss_pred ccCCchhHHHHHHHHHHHHhcCCceEe-cccCCCCCC--ceEEEEEec---------CCCcCCccccceEEEEEecCC-- Confidence 566677666655554444343333332 222333333 245555522 211111112233689999988 Q ss_pred CCHHHHHHHHHHHHHHHHhccccCCCceeeeeccccccC---------CCCCEEEEEEEeeeeeecc Q lcl|NC_019713. 81 AGSSAMTAHIYALAVRDIFEGKHFDKDLWFWECKASPAG---------SNGNYNVAYANCAFRFQEI 138 (139) Q Consensus 81 ~G~~~~~a~~~a~~a~~~F~g~~~~g~l~f~~~~~~~~g---------~dg~wyq~nv~i~fr~de~ 138 (139) ++..|.-+++.++++.. ...|-..+..+.-.+.+ +...-||+++.+--|---| T Consensus 70 ---t~~~A~~La~~vr~~ll--~~~~~~g~~~~~~~e~~gpy~~PD~es~~~Ryq~tv~l~~r~~~~ 131 (131) T protein:vir:98 70 ---SEEDAHDLMQLCRALLG--AARGSHGDTVLARPATGGPQFLPDAETGAARWAFTLDITMRGHAL 131 (131) T ss_pred ---CHHHHHHHHHHHHHHHh--hcccccchheeccccCCCCCcCCCCCCCCceeEEEEEEEeeeccC Confidence 55667777777766443 12332333222222221 2237999999998888777 No 24 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=26.97 E-value=1.9 Score=19.07 Aligned_cols=125 Identities=13% Similarity=0.114 Sum_probs=59.8 Q ss_pred CChHHHHHHHHHHHHhhhccCCccccc---ccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEEe Q lcl|NC_019713. 1 MRDDKAREVIGEALVAGLKAAGITIDI---AQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVYT 77 (139) Q Consensus 1 Mt~~qar~aI~~~~~a~~~a~~~~~~I---~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F~ 77 (139) ||..-+.-++-+++.+...++.+-.-+ -|++.|-.+ ..||+ ++-.......+-+ -...+.-.+.|+|+. T Consensus 1 m~~~s~~~aLq~Ai~~~L~ad~~l~alvg~I~D~~P~~~--~~PYV--~lG~~~~~d~~~~----~~~g~~~~~ti~Vws 72 (134) T protein:vir:59 1 MTWKLASRALQKATVENLESYQPLMEMVNQVTESPGKDD--PYPYV--VIGDQSSTPFETK----SSFGENITMDFHVWG 72 (134) T ss_pred CCccchhHHHHHHHHHHhhcChhHHHhhhhhhcCCCCCC--CCCEE--EeCCceeeecCCC----cccceEEEEEEEEEE Confidence 998766555555544444443222222 345443222 24554 4332222222211 123456678899998 Q ss_pred ecCCCHHHHHHHHHHHHHHHHhccccC--CC----ceeeeeccccccCCCCCEEEEEEEeeeeeecc Q lcl|NC_019713. 78 PFTAGSSAMTAHIYALAVRDIFEGKHF--DK----DLWFWECKASPAGSNGNYNVAYANCAFRFQEI 138 (139) Q Consensus 78 p~~~G~~~~~a~~~a~~a~~~F~g~~~--~g----~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~ 138 (139) . .|.. .. ...+-.+.++-.+..+ +| .+.|...+.. -.+||..++.-+.+.++.++= T Consensus 73 ~--~g~~-ea-~~ia~av~~aL~~~~L~l~~~~lv~l~~~~~~~~-rd~dg~~~hg~l~fra~ve~~ 134 (134) T protein:vir:59 73 G--TTRA-EA-QDISSRVLEALTYKPLMFEGFTFVAKKLVLAQVI-TDTDGVTKHGIIKVRFTINNN 134 (134) T ss_pred C--CChH-HH-HHHHHHHHHHhcCCCcccCCceEEEeEEeeeeEE-ecCCCceEEEEEEEEEEEecC Confidence 6 4533 22 2223233455544332 22 1333333333 237777777777777666555 No 25 >protein:vir:107857 Length: 154 # NCBI annotation: gp37 # Family: family:all:1532 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024710;genbank:gi:48696947;genbank:GeneID:2845945 Probab=26.52 E-value=1.9 Score=19.01 Aligned_cols=131 Identities=11% Similarity=0.002 Sum_probs=72.1 Q ss_pred CChH-HHHHHHHHHHHhhhccCCcccccc-cCCCC--CCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEE Q lcl|NC_019713. 1 MRDD-KAREVIGEALVAGLKAAGITIDIA-QEGYP--YKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVY 76 (139) Q Consensus 1 Mt~~-qar~aI~~~~~a~~~a~~~~~~I~-~~n~~--f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F 76 (139) |+-+ .+-++|++|+....- .-.|+ +|.++ |... .+=..|-|++.+|.-.+.=+.|.-.=.|+=.+.+=|+ T Consensus 1 m~~t~~ii~aiv~rL~~~lP----~~~ve~fP~~p~ey~l~--h~~GAvLV~Y~GS~f~~~~~~~~i~Q~R~~~~~vTVi 74 (154) T protein:vir:10 1 MATTLEMVDAIVARLRVKLP----ALVTEYFPERPDEYRLN--HAIGALLVSYPGSQYDTTVDTDMVVQPRRVKFAVAIV 74 (154) T ss_pred CchhHHHHHHHHHHHHHhCC----cceEeeCCCChhHcCCC--CCceeEEEEecCcccCCcccCCceeeeeEEEEEEEEE Confidence 7655 466778888877652 22333 45544 3332 3446788888777655443333322344445666666 Q ss_pred eecCCCHHHHHHHHHHHHHHHHhccccCCCceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 77 TPFTAGSSAMTAHIYALAVRDIFEGKHFDKDLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 77 ~p~~~G~~~~~a~~~a~~a~~~F~g~~~~g~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) .+.-.|-....+..|+. |.+--|+..+|-=.+|-++=+-+|+++|.||+-+.+.=..--|- T Consensus 75 ~r~l~g~~gal~~LD~v--R~aL~Gf~ppdc~~~~lv~d~f~ge~~G~W~Y~l~~at~t~~Ve 135 (154) T protein:vir:10 75 LRQLNGRGGAIDVLDHV--RTALVGFRPPDCKKLAAVSDKFLGESAGLWQYVIEFSAGAVIVE 135 (154) T ss_pred eeccCCcchhhHHHHHH--HHHHhccccCCCceeehhhhcccccccceeeeeeeeccchhhhh Confidence 66544433344455554 66777776665223444555778888887777554332222221 No 26 >protein:vir:78379 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:12033 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110845;genbank:gi:134288606;genbank:GeneID:5179642 Probab=26.15 E-value=1.9 Score=19.11 Aligned_cols=131 Identities=19% Similarity=0.182 Sum_probs=79.8 Q ss_pred CC--hHHHHHHHHHHHHhhhccCCcccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEEee Q lcl|NC_019713. 1 MR--DDKAREVIGEALVAGLKAAGITIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVYTP 78 (139) Q Consensus 1 Mt--~~qar~aI~~~~~a~~~a~~~~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F~p 78 (139) |. |+.--.+.-+++.+|-.-+| ..+.-+|-...-.+|-||+.--+.-.+..|+.+==. -.|-|+--|.+=+- T Consensus 1 matyfedltkafdtalvafgtnng--ikvalenidaptstdtpylasymllsdteqadlfwt----eqragvyqvdinvg 74 (139) T protein:vir:78 1 MATYFEDLTKAFDTALVAFGTNNG--IKVALENIDAPTSTDTPYLASYMLLSDTEQADLFWT----EQRAGVYQVDINVG 74 (139) T ss_pred CchhHHHHHHhhhheeeeeccCCc--eeEeeeccCCCccCCcchhhheeeeccCcccceeee----cccCceEEEeeecc Confidence 42 22222233333444443344 456677765555567888877777778888866421 23678888888777 Q ss_pred cCCCHHHHHHHHHHHHHHHHhccccCCCc-eeeeeccccccCC---CCCEEEEEEEeeeeeecc--C Q lcl|NC_019713. 79 FTAGSSAMTAHIYALAVRDIFEGKHFDKD-LWFWECKASPAGS---NGNYNVAYANCAFRFQEI--K 139 (139) Q Consensus 79 ~~~G~~~~~a~~~a~~a~~~F~g~~~~g~-l~f~~~~~~~~g~---dg~wyq~nv~i~fr~de~--~ 139 (139) ..-|+.-+.+++|++-| +|..-.+-+. --|-+++.+..|+ +.||-.-.++|.|-+=-- | T Consensus 75 salgsapinrladklna--afaagncfsrneicaevqsvslgplivengwakrplsinfiaftarir 139 (139) T protein:vir:78 75 SALGSAPINRLADKLNA--AFAAGNCFSRNEICAEVQSVSLGPLIVENGWAKRPLSINFIAFTARIR 139 (139) T ss_pred cccccchhHHHHhhhhh--hhhccccccchhhhhhhhhccccceeeccCcccCceeeeeeeeeeecC Confidence 77788888999999854 4653222221 1245666666664 468988888888766444 4 No 27 >protein:vir:79065 Length: 154 # NCBI annotation: gp11 # Family: family:all:1532 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111211;genbank:gi:134288825;genbank:GeneID:4960739 Probab=24.58 E-value=2.2 Score=18.75 Aligned_cols=131 Identities=11% Similarity=-0.003 Sum_probs=71.9 Q ss_pred CChH-HHHHHHHHHHHhhhccCCcccccc-cCCCC--CCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEEEE Q lcl|NC_019713. 1 MRDD-KAREVIGEALVAGLKAAGITIDIA-QEGYP--YKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVNVY 76 (139) Q Consensus 1 Mt~~-qar~aI~~~~~a~~~a~~~~~~I~-~~n~~--f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq~F 76 (139) |+-+ .+-++|++|+....- .-.|+ +|.++ |... .+=..|-|++.+|.-.+.=+.|.-.=.|+=.+.+=|+ T Consensus 1 m~~t~~ii~~iv~rL~~~lP----~~~ve~fP~~p~ey~l~--h~~GAvLV~Y~GS~f~~~~~~~~i~Q~R~~~~~vTVi 74 (154) T protein:vir:79 1 MATTLEMVDSVVARLRVKLP----ALVTEYFPERPDEYRLN--HAIGALLVSYPGSQYDTTVDTDMVVQPRRVKFAVAIV 74 (154) T ss_pred CchhHHHHHHHHHHHHHhCC----cceEeeCCCChhHcCCC--CCceeEEEEecCcccCCcccCCceeeeeEEEEEEEEE Confidence 7655 466778888877652 22333 45554 3332 3446788888777655443333322344445666666 Q ss_pred eecCCCHHHHHHHHHHHHHHHHhccccCCCceeeeeccccccCCCCCEEEEEEEeeeeeeccC Q lcl|NC_019713. 77 TPFTAGSSAMTAHIYALAVRDIFEGKHFDKDLWFWECKASPAGSNGNYNVAYANCAFRFQEIK 139 (139) Q Consensus 77 ~p~~~G~~~~~a~~~a~~a~~~F~g~~~~g~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de~~ 139 (139) .+.=.|-....+..|+. |.+--|+..+|-=.+|-++=+-+|+++|.|||-+.+.=..--|- T Consensus 75 ~r~l~g~~gal~~LD~v--R~aL~Gf~ppdc~~~~lv~d~f~ge~~G~W~Y~l~~at~t~~Ve 135 (154) T protein:vir:79 75 LRQLNGRGGAIDVLDHV--RTALVGFRPPDCKKLAAVSDKFLGESAGLWQYVIEFSAGAVIVE 135 (154) T ss_pred eeccCCcchhhHHHHHH--HHHHhccccCCCceeehhhhcccccccceeeeeeeeccchhhhc Confidence 66544433344455554 66777776655223444555778888887776554332222221 No 28 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=24.35 E-value=2.2 Score=18.72 Aligned_cols=125 Identities=12% Similarity=0.082 Sum_probs=55.4 Q ss_pred CChH---HHHHHHHHHHHhhhccC---CcccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEE Q lcl|NC_019713. 1 MRDD---KAREVIGEALVAGLKAA---GITIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVN 74 (139) Q Consensus 1 Mt~~---qar~aI~~~~~a~~~a~---~~~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq 74 (139) |+.. +-+++|+.++.+..... |. + .|++.|-++ .-||+-|- ...+.-...+-....+=.+.|+ T Consensus 1 M~~s~~~aLq~ai~~~L~ad~~l~~lvg~--~-vyD~~P~~~--~~PyV~lG------~~~~~~~~t~~~~~~~~~lti~ 69 (145) T protein:vir:12 1 MWVSVERYLFNKVYNKLKSNPIIQKQLGG--R-VFDCVQKDA--VYPYIVVG------ETNVTNKETTTSMVEDVGITLH 69 (145) T ss_pred CcccHHHHHHHHHHHHhhcChhHHHhcCc--c-cccCCccCC--CCCEEEec------cceeeecCCCcccceEEEEEEE Confidence 7742 23566666654322110 11 2 266555332 34675541 2222221111224456678889 Q ss_pred EEeecCCCHHHHHHHHHHHHHHHHhccccCCC-c---eeeeeccccccCCCCCEEEEEEEeeeeee--ccC Q lcl|NC_019713. 75 VYTPFTAGSSAMTAHIYALAVRDIFEGKHFDK-D---LWFWECKASPAGSNGNYNVAYANCAFRFQ--EIK 139 (139) Q Consensus 75 ~F~p~~~G~~~~~a~~~a~~a~~~F~g~~~~g-~---l~f~~~~~~~~g~dg~wyq~nv~i~fr~d--e~~ 139 (139) |+... .|-.+..+.+.++.+ -|...-.++| . +.+...++. -.+|+..|+-.++++|+.. -++ T Consensus 70 Vws~~-~gr~ea~~ia~ai~~-aL~~~l~l~~~~lv~l~~~~~~~~-rd~d~~~~hgvl~~ra~i~~~~~~ 137 (145) T protein:vir:12 70 VYSQA-RNRDEASQIIQFLGF-VLNNEIEIDYYSFIKSRIDTQEVI-TDIDQYTKHGIIRLVFKYRHNTLQ 137 (145) T ss_pred EEEcC-ccHHHHHHHHHHHHH-HhccccCCCCceEEEEEEeeEEEE-ecCCCceEEEEEEEEEEEEeCCcc Confidence 99864 354444444444421 2222222323 1 222222222 1356777877666666653 333 No 29 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=22.15 E-value=2.5 Score=18.41 Aligned_cols=124 Identities=10% Similarity=0.091 Sum_probs=57.8 Q ss_pred CC--h-HHHHHHHHHHHHhhhccC---CcccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEE Q lcl|NC_019713. 1 MR--D-DKAREVIGEALVAGLKAA---GITIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVN 74 (139) Q Consensus 1 Mt--~-~qar~aI~~~~~a~~~a~---~~~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq 74 (139) |+ . -+-+.+|+++|.+-.... ++ +| |++.|-++| -|| |++-.......+- .-....+-.+.|+ T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~--rI-~D~~P~~~~--~PY--v~lG~~~~~~~~~----~~~~g~~~~~ti~ 69 (141) T protein:vir:10 1 MWVSVEPELTNQIYKRLISDPNINKLVDD--RV-FDVVQDDAV--YPY--IVVGESNVTNNES----SATMRETVGIVIH 69 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCC--cc-ccCCccCCC--CCE--EEeCCceeeecCC----CcccceEEEEEEE Confidence 55 3 455677877776644321 21 33 565543322 355 4433323322221 1124566778889 Q ss_pred EEeecCCCHHHHHHHHHHHHHHHHhccc-cCCC----ceeeeeccccccCCCCCEEEEEEEeeeeeec--cC Q lcl|NC_019713. 75 VYTPFTAGSSAMTAHIYALAVRDIFEGK-HFDK----DLWFWECKASPAGSNGNYNVAYANCAFRFQE--IK 139 (139) Q Consensus 75 ~F~p~~~G~~~~~a~~~a~~a~~~F~g~-~~~g----~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de--~~ 139 (139) |++.. .|-.+ +...+..+.++-... .++| .+.|...+.. -.+||..++-.+.+.|+..+ ++ T Consensus 70 Vws~~-~g~~e--ak~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~-rd~dg~t~hgvl~~ra~v~~~~~~ 137 (141) T protein:vir:10 70 VYSQF-ATQYE--AKLILSAIGYVLNRPIEIDNYEFQFSRIDSQAVF-PDIDRFTKHGTIRLLFKYRHKKKN 137 (141) T ss_pred EEEcC-CCHHH--HHHHHHHHHHHhcccccCCCceEEEEEEeeeeee-ecCCCceEEEEEEEEEEEEecccc Confidence 99863 34333 222222233443221 1223 1223333332 23577777766777776443 33 No 30 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=22.15 E-value=2.5 Score=18.41 Aligned_cols=124 Identities=10% Similarity=0.091 Sum_probs=57.8 Q ss_pred CC--h-HHHHHHHHHHHHhhhccC---CcccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEE Q lcl|NC_019713. 1 MR--D-DKAREVIGEALVAGLKAA---GITIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVN 74 (139) Q Consensus 1 Mt--~-~qar~aI~~~~~a~~~a~---~~~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq 74 (139) |+ . -+-+.+|+++|.+-.... ++ +| |++.|-++| -|| |++-.......+- .-....+-.+.|+ T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~--rI-~D~~P~~~~--~PY--v~lG~~~~~~~~~----~~~~g~~~~~ti~ 69 (141) T protein:vir:96 1 MWVSVEPELTNQIYKRLISDPNINKLVDD--RV-FDVVQDDAV--YPY--IVVGESNVTNNES----SATMRETVGIVIH 69 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCC--cc-ccCCccCCC--CCE--EEeCCceeeecCC----CcccceEEEEEEE Confidence 55 3 455677877776644321 21 33 565543322 355 4433323322221 1124566778889 Q ss_pred EEeecCCCHHHHHHHHHHHHHHHHhccc-cCCC----ceeeeeccccccCCCCCEEEEEEEeeeeeec--cC Q lcl|NC_019713. 75 VYTPFTAGSSAMTAHIYALAVRDIFEGK-HFDK----DLWFWECKASPAGSNGNYNVAYANCAFRFQE--IK 139 (139) Q Consensus 75 ~F~p~~~G~~~~~a~~~a~~a~~~F~g~-~~~g----~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de--~~ 139 (139) |++.. .|-.+ +...+..+.++-... .++| .+.|...+.. -.+||..++-.+.+.|+..+ ++ T Consensus 70 Vws~~-~g~~e--ak~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~-rd~dg~t~hgvl~~ra~v~~~~~~ 137 (141) T protein:vir:96 70 VYSQF-ATQYE--AKLILSAIGYVLNRPIEIDNYEFQFSRIDSQAVF-PDIDRFTKHGTIRLLFKYRHKKKN 137 (141) T ss_pred EEEcC-CCHHH--HHHHHHHHHHHhcccccCCCceEEEEEEeeeeee-ecCCCceEEEEEEEEEEEEecccc Confidence 99863 34333 222222233443221 1223 1223333332 23577777766777776443 33 No 31 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=22.15 E-value=2.5 Score=18.41 Aligned_cols=124 Identities=10% Similarity=0.091 Sum_probs=57.8 Q ss_pred CC--h-HHHHHHHHHHHHhhhccC---CcccccccCCCCCCCCCCCcEEEEEEecCCcceeeeeCCCCCeEeecceEEEE Q lcl|NC_019713. 1 MR--D-DKAREVIGEALVAGLKAA---GITIDIAQEGYPYKGAANKPYLEVSFSVESTSSQSLGDAGNRSFVRDGVFHVN 74 (139) Q Consensus 1 Mt--~-~qar~aI~~~~~a~~~a~---~~~~~I~~~n~~f~pp~~~~W~rv~i~~~~~~qasiG~~g~~~~rr~G~i~iq 74 (139) |+ . -+-+.+|+++|.+-.... ++ +| |++.|-++| -|| |++-.......+- .-....+-.+.|+ T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~--rI-~D~~P~~~~--~PY--v~lG~~~~~~~~~----~~~~g~~~~~ti~ 69 (141) T protein:vir:94 1 MWVSVEPELTNQIYKRLISDPNINKLVDD--RV-FDVVQDDAV--YPY--IVVGESNVTNNES----SATMRETVGIVIH 69 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCC--cc-ccCCccCCC--CCE--EEeCCceeeecCC----CcccceEEEEEEE Confidence 55 3 455677877776644321 21 33 565543322 355 4433323322221 1124566778889 Q ss_pred EEeecCCCHHHHHHHHHHHHHHHHhccc-cCCC----ceeeeeccccccCCCCCEEEEEEEeeeeeec--cC Q lcl|NC_019713. 75 VYTPFTAGSSAMTAHIYALAVRDIFEGK-HFDK----DLWFWECKASPAGSNGNYNVAYANCAFRFQE--IK 139 (139) Q Consensus 75 ~F~p~~~G~~~~~a~~~a~~a~~~F~g~-~~~g----~l~f~~~~~~~~g~dg~wyq~nv~i~fr~de--~~ 139 (139) |++.. .|-.+ +...+..+.++-... .++| .+.|...+.. -.+||..++-.+.+.|+..+ ++ T Consensus 70 Vws~~-~g~~e--ak~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~-rd~dg~t~hgvl~~ra~v~~~~~~ 137 (141) T protein:vir:94 70 VYSQF-ATQYE--AKLILSAIGYVLNRPIEIDNYEFQFSRIDSQAVF-PDIDRFTKHGTIRLLFKYRHKKKN 137 (141) T ss_pred EEEcC-CCHHH--HHHHHHHHHHHhcccccCCCceEEEEEEeeeeee-ecCCCceEEEEEEEEEEEEecccc Confidence 99863 34333 222222233443221 1223 1223333332 23577777766777776443 33 No 32 >protein:vir:79247 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469166;genbank:gi:157835008;genbank:GeneID:5648828 Probab=20.26 E-value=2.8 Score=18.13 Aligned_cols=134 Identities=13% Similarity=0.002 Sum_probs=65.2 Q ss_pred CChHHHHHHHHHHHHhhhcc-CCcccccccCCCC---CCCCCCCcEEEEEEecCCc----ceeeeeCCCCCeEeecceEE Q lcl|NC_019713. 1 MRDDKAREVIGEALVAGLKA-AGITIDIAQEGYP---YKGAANKPYLEVSFSVEST----SSQSLGDAGNRSFVRDGVFH 72 (139) Q Consensus 1 Mt~~qar~aI~~~~~a~~~a-~~~~~~I~~~n~~---f~pp~~~~W~rv~i~~~~~----~qasiG~~g~~~~rr~G~i~ 72 (139) |.+=.++.+|++|+.+.-.. .--.+..++-+.+ ...|. .|+ ....... .+.+.+..=+...-+.++|. T Consensus 5 ~d~~a~~~~IierLka~v~~l~~V~~aadla~i~e~~q~tPa--ayV--v~~gd~~~~~~~~~~~~~~~Q~vtq~f~Vvl 80 (157) T protein:vir:79 5 FDYLFLEPLLIERIRSEVPGLAIVSGVPDLAALSEQDQPAPS--VYV--VYLGDEIGTGADYQGGRRAIQAIGQQWAVVL 80 (157) T ss_pred hhhhhhhHHHHHHHHhhhhhhhhhccccchhhhhhhcCCCcE--EEE--EecccccCCCcccccCcceeeeeeeeEEEEE Confidence 78889999999999753211 0001111222222 22222 233 2222111 11111110023344555555 Q ss_pred E-EEEeecCCCHHHHH-HHHHHHHHHHHhccccCC---CceeeeeccccccC-CCC-CEEEEEEEeeeeeeccC Q lcl|NC_019713. 73 V-NVYTPFTAGSSAMT-AHIYALAVRDIFEGKHFD---KDLWFWECKASPAG-SNG-NYNVAYANCAFRFQEIK 139 (139) Q Consensus 73 i-q~F~p~~~G~~~~~-a~~~a~~a~~~F~g~~~~---g~l~f~~~~~~~~g-~dg-~wyq~nv~i~fr~de~~ 139 (139) + +-.-....|..+.. +.......+.+..|+.-+ +.|.+-.. ..+.+ .+| .+|-+-++|.|.+-.+| T Consensus 81 avrn~~~~~~~~a~~d~ag~ll~~v~~AL~GW~P~~~~~pl~~~~~-~~~~~y~~gf~yypl~F~~~~~~~~~~ 153 (157) T protein:vir:79 81 VVHYADSSNSGEGARREAGPLLGRLVKALTGWAPAIDVAPLARSAR-QSPVTYASGYFYFPLVFTARFVYPRVK 153 (157) T ss_pred EEeccccccccchhHHHHHHHHHHHHHHhcCccccccCCceeeeec-CCcccccCCeEEEEEEEEEeeeccccc Confidence 4 33232234443332 343444457888897543 33554322 23233 445 57889999999999999 Done!