Query lcl|NC_020844.1_cdsid_YP_007673704.1 [gene=SLPG_00022] [protein=hypothetical protein] [protein_id=YP_007673704.1] [location=16483..16845] Match_columns 120 No_of_seqs 51 out of 54 Neff 6.1 Searched_HMMs 1612 Date Thu Nov 7 16:03:27 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_22 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_22_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80382 Length: 122 100.0 4.8E-37 2.9E-40 219.7 13.0 118 1-120 1-122 (122) 2 protein:vir:97237 Length: 122 100.0 2.2E-35 1.3E-38 210.6 12.9 117 4-120 1-122 (122) 3 protein:vir:104346 Length: 123 100.0 5.2E-33 3.3E-36 197.6 12.1 115 1-120 1-118 (123) 4 protein:vir:95145 Length: 126 99.9 9.8E-32 6.1E-35 190.6 11.7 117 4-120 1-126 (126) 5 protein:vir:107669 Length: 123 99.9 4.3E-27 2.6E-30 165.2 9.6 115 1-120 1-118 (123) 6 protein:vir:79639 Length: 123 99.9 1.6E-26 9.8E-30 162.0 9.4 115 1-120 1-118 (123) 7 protein:vir:78381 Length: 119 99.8 1.1E-22 7.1E-26 140.9 5.5 114 2-120 1-119 (119) 8 protein:vir:94992 Length: 78 # 99.1 2.8E-13 1.8E-16 89.4 6.0 78 38-120 1-78 (78) 9 protein:vir:103282 Length: 104 98.1 4.3E-08 2.7E-11 61.0 8.3 100 1-104 2-104 (104) 10 protein:vir:4199 Length: 113 # 94.5 0.00028 1.7E-07 40.1 5.8 106 7-120 1-109 (113) 11 protein:vir:4161 Length: 112 # 94.2 0.00092 5.7E-07 37.2 8.0 105 1-120 2-109 (112) 12 protein:vir:96807 Length: 132 93.5 0.00031 1.9E-07 39.8 4.1 116 1-120 1-132 (132) 13 protein:vir:3426 Length: 117 # 91.1 0.01 6.4E-06 31.5 9.4 101 1-120 1-108 (117) 14 protein:vir:7858 Length: 111 # 88.4 0.012 7.3E-06 31.2 7.5 103 18-120 1-108 (111) 15 protein:vir:101653 Length: 111 88.4 0.012 7.3E-06 31.2 7.5 103 18-120 1-108 (111) 16 protein:vir:95261 Length: 133 87.9 0.015 9.6E-06 30.5 7.7 104 15-120 1-122 (133) 17 protein:vir:5258 Length: 123 # 87.0 0.015 9E-06 30.7 7.1 105 1-120 1-117 (123) 18 protein:vir:79686 Length: 118 83.8 0.066 4.1E-05 27.1 9.3 105 1-119 2-118 (118) 19 protein:vir:395 Length: 117 # 80.4 0.096 5.9E-05 26.2 10.7 101 1-120 1-108 (117) 20 protein:vir:80934 Length: 120 78.3 0.12 7.2E-05 25.7 10.1 111 1-119 1-120 (120) 21 protein:vir:1582 Length: 117 # 76.9 0.13 8.1E-05 25.4 9.7 104 1-119 2-117 (117) 22 protein:vir:44 Length: 120 # N 75.6 0.15 9E-05 25.2 10.2 111 1-119 1-120 (120) 23 protein:vir:9822 Length: 120 # 74.7 0.15 9.2E-05 25.1 8.0 105 10-119 1-120 (120) 24 protein:vir:3035 Length: 110 # 67.2 0.26 0.00016 23.8 9.2 98 17-119 1-110 (110) 25 protein:vir:5977 Length: 109 # 64.1 0.31 0.00019 23.4 9.0 102 17-120 1-105 (109) 26 protein:vir:1385 Length: 107 # 45.0 0.78 0.00049 21.2 8.5 99 18-120 1-105 (107) 27 protein:vir:98924 Length: 113 44.8 0.79 0.00049 21.1 9.5 102 1-119 3-113 (113) 28 protein:vir:106729 Length: 152 38.7 1.1 0.00065 20.5 7.9 108 8-120 1-121 (152) 29 protein:vir:94034 Length: 141 37.5 1.1 0.00069 20.3 7.7 114 1-120 1-125 (141) 30 protein:vir:8431 Length: 114 # 33.2 1.4 0.00085 19.8 7.6 102 19-120 1-111 (114) 31 protein:vir:78609 Length: 152 33.0 1.4 0.00086 19.8 7.3 108 8-120 1-121 (152) 32 protein:vir:77651 Length: 152 32.9 1.2 0.00075 20.1 5.3 108 8-120 1-121 (152) 33 protein:vir:102144 Length: 113 31.9 1.5 0.00091 19.7 10.2 101 19-120 1-113 (113) 34 protein:vir:94767 Length: 104 30.7 1.4 0.00087 19.8 5.2 93 26-120 1-100 (104) 35 protein:vir:101560 Length: 152 29.5 1.6 0.001 19.4 5.4 108 8-120 1-121 (152) 36 protein:vir:81177 Length: 109 28.6 1.7 0.0011 19.3 9.2 100 1-120 1-105 (109) 37 protein:vir:9762 Length: 112 # 27.5 1.8 0.0011 19.1 6.0 102 17-120 1-108 (112) 38 protein:vir:9577 Length: 112 # 26.1 2 0.0012 18.9 5.9 102 17-120 1-108 (112) 39 protein:vir:4459 Length: 134 # 23.3 2.3 0.0014 18.6 8.2 115 1-120 1-119 (134) 40 protein:vir:100134 Length: 109 21.2 2.6 0.0016 18.3 9.3 102 1-120 1-108 (109) 41 protein:vir:4789 Length: 123 # 20.9 2.7 0.0017 18.2 8.9 101 12-119 1-123 (123) No 1 >protein:vir:80382 Length: 122 # NCBI annotation: BcepGomrgp10 # Family: family:all:704 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210230;genbank:gi:146329922;genbank:GeneID:5123491 Probab=100.00 E-value=4.8e-37 Score=219.72 Aligned_cols=118 Identities=14% Similarity=0.138 Sum_probs=104.6 Q ss_pred CCchhH-HHHHHHHHHHHhhhcCCe-EEEEEeCCccCCCCcCCCccCCcccceeEEEeeccccccCCceEEEcCeEEEEe Q lcl|NC_020844. 1 MPSAEI-YGELQGVASELMAEFQQG-TARYIHPGEQTGPDYDPQPGEPTPYTLDATVRGVAAQYVQEGYIAASDLQVTAS 78 (120) Q Consensus 1 ~~~a~~-Y~r~~atA~rLi~kfG~~-~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~v~~~~idGtlI~~gD~~v~~a 78 (120) |- .| |+||+++|+|||+|||++ +++|.....-+...|+|.++++++++++|++.+|++++||||||+.||++++++ T Consensus 1 m~--~f~Y~rl~~~A~~Li~kfG~~~tv~~~~~~~~~~~~~~P~t~~~~~~~v~gv~~~y~~r~idGtlIq~GD~~~~~~ 78 (122) T protein:vir:80 1 MK--SFNYPRLLKTVDRLIEKFGEECFIVEYIDAVDPTRPFDPPVRTEVRTPVKGVFVKATEKHADGTLIHIGDQLVLIS 78 (122) T ss_pred CC--CcchhHHHHHHHHHHHHhCCCeEEEEeeccCCCCCccccCCCceeeccceEEEecccccccCCcEEeeCCEEEEEe Confidence 76 77 999999999999999996 455443222223459999999999999999999999999999999999999988 Q ss_pred c--CCceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 79 V--FGREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 79 ~--~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) + +..+|+++|+|++||++|+||+++|++|||++++|++|+|= T Consensus 79 a~~~~~~P~~~D~v~~~g~~~~Vi~v~p~~pag~~v~y~~q~Rk 122 (122) T protein:vir:80 79 GSLKREAANIKGDLFRGAEKWKMWNVVPLKPGPVNMLFKIKVSQ 122 (122) T ss_pred cccccccCCcCCEEEeCCeeEEEEeccccCCCCceEEEEEEEeC Confidence 7 55579999999999999999999999999999999999999 No 2 >protein:vir:97237 Length: 122 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:704 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294533;genbank:gi:149408254;genbank:GeneID:5237102 Probab=100.00 E-value=2.2e-35 Score=210.63 Aligned_cols=117 Identities=20% Similarity=0.314 Sum_probs=103.3 Q ss_pred hhHHHHHHHHHHHHhhhcCCe-EEEEEeCCccCCC--CcCCCccCCcccceeEEEeeccccccCCceEEEcCeEEEEec- Q lcl|NC_020844. 4 AEIYGELQGVASELMAEFQQG-TARYIHPGEQTGP--DYDPQPGEPTPYTLDATVRGVAAQYVQEGYIAASDLQVTASV- 79 (120) Q Consensus 4 a~~Y~r~~atA~rLi~kfG~~-~~~r~~~g~~~~p--~~dp~~~~~~~~~~~~~~~~v~~~~idGtlI~~gD~~v~~a~- 79 (120) -.||+||+++|+|||+|||++ +++|..+|+..++ .|+|.+.++++|+++|++.+|++++|||++|++||+++++++ T Consensus 1 M~~y~~~~~~a~~Li~kfG~~vtl~r~~~g~y~~~~g~~~p~~~t~~~~~~~gv~~~~~~~~idGtlI~~GD~~l~~~a~ 80 (122) T protein:vir:97 1 MARFDSAIALAKKLIKKNGQAVTLRGFTAGAAPDPAKPWKPGGNVAADQTIEAVFLDYEQRYIDGQTIRMGDQRVFMPAE 80 (122) T ss_pred CccchHHHHHHHHHHHHhCCceEEEEeccceeCCCCCceecCCceeeeeeeEEEeeccchhhccCcEEeecCEEEEEeeC Confidence 149999999999999999995 6777766543222 466666677889999999999999999999999999999976 Q ss_pred -CCceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 80 -FGREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 80 -~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) +.++|++||+|++||++|+||+++|++|||++++|++|+|= T Consensus 81 ~~~~~P~~gD~v~~~g~~~~Vi~v~~i~pa~~~v~y~lqlRk 122 (122) T protein:vir:97 81 GLTAPPEVEGLVLRGLEVWKVIAVKPLNPNGQAIMYELQVRQ 122 (122) T ss_pred CCccccccCCEEEeCCEEEEEEeccccCCCCceEEEEEEeeC Confidence 46789999999999999999999999999999999999999 No 3 >protein:vir:104346 Length: 123 # NCBI annotation: conserved phage-related protein # Family: family:all:704 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398974;genbank:gi:81343958;genbank:GeneID:3778878 Probab=99.96 E-value=5.2e-33 Score=197.55 Aligned_cols=115 Identities=17% Similarity=0.208 Sum_probs=102.5 Q ss_pred CCchhHHHHHHHHHHHHhhhcCCe---EEEEEeCCccCCCCcCCCccCCcccceeEEEeeccccccCCceEEEcCeEEEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQG---TARYIHPGEQTGPDYDPQPGEPTPYTLDATVRGVAAQYVQEGYIAASDLQVTA 77 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~~---~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~v~~~~idGtlI~~gD~~v~~ 77 (120) |- |.+|+++|++||++||++ +...++++.+++.+..+....+..|+++|+.+.|++++|||++|++||+++++ T Consensus 1 mn----Y~~l~~~a~~lI~~f~~~~gv~~~~t~~~~v~~v~G~ev~~p~~~~~~~Gv~~~y~~r~IDG~lIq~gD~~~i~ 76 (123) T protein:vir:10 1 MN----YNEIESITRDGINFFSDANGVYEMSTGAGYVEIVNGVEVEVPAQTFQLKGLVREIKTRDIDGEFIQFGDKRGIF 76 (123) T ss_pred CC----hHHHHHHHHHHHHHhCCCCCceEEecCCCeeeCCCCceeeccceeEeeEEEeccCChhhccceeeeeccEEEEE Confidence 54 999999999999999874 45555566667666666555667899999999999999999999999999998 Q ss_pred ecCCceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 78 SVFGREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 78 a~~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) ++ +++++.||+|++||+.|+||+++||+|++++|+|++|+|. T Consensus 77 ~a-~~eik~Gd~i~vdGe~~rVV~~~pikPa~~~v~y~~qlRr 118 (123) T protein:vir:10 77 TA-QVEIKQGYQIKVDGETFVVVDPRPVKPTGTTVGYRPILRR 118 (123) T ss_pred ec-CceeccCCEEEECCeEEEEecCCccCccceeEEEeeeeee Confidence 76 8899999999999999999999999999999999999999 No 4 >protein:vir:95145 Length: 126 # NCBI annotation: hypothetical protein ORF014 # Family: family:all:704 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293421;genbank:gi:148912842;genbank:GeneID:5228220 Probab=99.95 E-value=9.8e-32 Score=190.60 Aligned_cols=117 Identities=19% Similarity=0.277 Sum_probs=107.8 Q ss_pred hhHHHHHHHHHHHHhhhcCCe-EEEEEeCCccCCCCc--CCCccCCcccceeEEEeecccc------ccCCceEEEcCeE Q lcl|NC_020844. 4 AEIYGELQGVASELMAEFQQG-TARYIHPGEQTGPDY--DPQPGEPTPYTLDATVRGVAAQ------YVQEGYIAASDLQ 74 (120) Q Consensus 4 a~~Y~r~~atA~rLi~kfG~~-~~~r~~~g~~~~p~~--dp~~~~~~~~~~~~~~~~v~~~------~idGtlI~~gD~~ 74 (120) -+-|+|++++|+|||+||||+ +++++..+..+||.. +|.+.+++++|+++++.+|+++ |||||+|+.||++ T Consensus 1 ma~y~rl~ata~rLIaK~Gq~~~~r~~~~~~~~DP~~p~~p~~~t~~d~p~t~v~l~yd~r~~~~~~~idGt~I~~GD~~ 80 (126) T protein:vir:95 1 MAQFDRAIKTAQRLIAKNGEKVKWRVIEDVTPTDPNKPWEPGPALPEDKDVTICFLPVDRQTQETFNFIKGTEVPKGSVM 80 (126) T ss_pred CcchHHHHHHHHHHHHHhCCceEEEEEeeccCCCCccCCCCCCCceeecccceEEecccccccchhhhcccceeccCcEE Confidence 358999999999999999995 788988888777764 4555577789999999999999 8999999999999 Q ss_pred EEEecCCceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 75 VTASVFGREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 75 v~~a~~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) ++++.+.++|++||.|+.||+.|+|++++|++|+|+.++|++...+ T Consensus 81 i~i~gl~~ap~vgd~V~~~g~~~~ivav~pl~P~gvavLy~l~~~~ 126 (126) T protein:vir:95 81 GLMGNVPFAPNLKDVVIRNGVELRLAYIDVLSPNGQKVLYTMVFQA 126 (126) T ss_pred EEecccccccccCceEEECcEEEEEEeeeeeCCCcceeeeeeeecC Confidence 9999999999999999999999999999999999999999999999 No 5 >protein:vir:107669 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:704 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003901;genbank:gi:45686317;genbank:GeneID:2773009 Probab=99.89 E-value=4.3e-27 Score=165.15 Aligned_cols=115 Identities=22% Similarity=0.279 Sum_probs=98.4 Q ss_pred CCchhHHHHHHHHHHHHhhhcCC--eEEEEEeCCccCCCCcCCCccC-CcccceeEEEeeccccccCCceEEEcCeEEEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQ--GTARYIHPGEQTGPDYDPQPGE-PTPYTLDATVRGVAAQYVQEGYIAASDLQVTA 77 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~--~~~~r~~~g~~~~p~~dp~~~~-~~~~~~~~~~~~v~~~~idGtlI~~gD~~v~~ 77 (120) |- |.+|+++|+++|++|.. +.+..++.++.+....+-+..+ +..|+++|+.+.|+.++|||++|++||+++++ T Consensus 1 mN----Y~~i~~~a~~~I~~fsd~~g~~~l~t~~~~~~~v~G~Ev~~p~~~~~~~G~~~~y~~reIDG~lI~~gDvk~if 76 (123) T protein:vir:10 1 MN----YSQIERMARKGVAFFTDPSRPMNLIKQGEYGYDENGFEIPPMEQVIPISGATRRPNAREIDGETIRASDILGIF 76 (123) T ss_pred CC----hHHHHHHHHHHHhhhcCCCCeEEeeeCCcccccCCCeecccCCeeeeeEEEEeeccccccccceeeeccEEEee Confidence 65 99999999999999943 4777766655333333334444 45699999999999999999999999999999 Q ss_pred ecCCceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 78 SVFGREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 78 a~~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) ++ .++.+.||+|++||+.|+||+++||+|++++|||++|+|. T Consensus 77 ~a-~veik~Gd~I~vDg~~~rVV~~~pvkPa~~~I~y~~qLRr 118 (123) T protein:vir:10 77 NN-DHEINEGDYIEIDGIRHVVVDARPVQASLEPVAYRPVLRR 118 (123) T ss_pred cc-ceeeccCCEEEECCeEEEEecCcccchhhhhhhhhhhhce Confidence 86 6788999999999999999999999999999999999999 No 6 >protein:vir:79639 Length: 123 # NCBI annotation: gp39 # Family: family:all:704 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285528;genbank:gi:148734511;genbank:GeneID:5219997 Probab=99.88 E-value=1.6e-26 Score=162.05 Aligned_cols=115 Identities=17% Similarity=0.173 Sum_probs=99.8 Q ss_pred CCchhHHHHHHHHHHHHhhhcCC---eEEEEEeCCccCCCCcCCCccCCcccceeEEEeeccccccCCceEEEcCeEEEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQ---GTARYIHPGEQTGPDYDPQPGEPTPYTLDATVRGVAAQYVQEGYIAASDLQVTA 77 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~---~~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~v~~~~idGtlI~~gD~~v~~ 77 (120) |- |..++.+|..+|+.|.. .....+++|..++-...+.+..+..|+++|+.+.|+.++|||++|++||+++++ T Consensus 1 mN----y~~l~~~~~~~I~~fsd~~G~~~~~t~~g~~~dv~G~ev~~p~~~~~~~Gv~~~~~~reIDg~lIq~gDvk~i~ 76 (123) T protein:vir:79 1 MN----YEQIRSMASAGINFFSDGTGEFDCITQPGSVEIVGGIEVEKPEIKVKIKGLVRAPRTREVDGEVIRVTDKLGVF 76 (123) T ss_pred Cc----hHHHHHHHHHHHHhhccCCCceeeeecCcceeecCCeeccccceEEeEEEEeecCCccccCCeeEEeccEEEEE Confidence 76 77777888889999932 267777788877655554444455799999999999999999999999999998 Q ss_pred ecCCceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 78 SVFGREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 78 a~~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) ++ +++.+.||+|++||+.|+||+++||+|++++|+|++|+|. T Consensus 77 ~a-~veik~Gd~i~vDge~~rVV~~~pvkPa~~~I~y~~qLRr 118 (123) T protein:vir:79 77 NA-DVELKNGYQIDIDGERYVMVETRPIRPTSITVAYRPIMRR 118 (123) T ss_pred ec-ceeeccCCEEEECCeEEEEecCccccchhhhhhhhhhhcc Confidence 76 7788999999999999999999999999999999999999 No 7 >protein:vir:78381 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:704 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110843;genbank:gi:134288604;genbank:GeneID:5179644 Probab=99.77 E-value=1.1e-22 Score=140.89 Aligned_cols=114 Identities=23% Similarity=0.368 Sum_probs=106.3 Q ss_pred CchhHHHHHHHHHHHHhhhcCCeEEEEEeCCccCCCCcCCCccC-----CcccceeEEEeeccccccCCceEEEcCeEEE Q lcl|NC_020844. 2 PSAEIYGELQGVASELMAEFQQGTARYIHPGEQTGPDYDPQPGE-----PTPYTLDATVRGVAAQYVQEGYIAASDLQVT 76 (120) Q Consensus 2 ~~a~~Y~r~~atA~rLi~kfG~~~~~r~~~g~~~~p~~dp~~~~-----~~~~~~~~~~~~v~~~~idGtlI~~gD~~v~ 76 (120) -|++|..|||...+|||+|||+ ++.++|+|+.. |||..|| ++..|++++...++...++||.|++||.++. T Consensus 1 m~t~fskrmqgvgtrll~k~gs-tv~lvr~g~k~---wd~vlgeyiw~~d~vlplkavpvpvn~glvngttiqagdm~vk 76 (119) T protein:vir:78 1 MGTSFSKRMQGVGTRLLSKYGS-TVNLVRKGQKT---WDPVLGEYVWGPDVVLPLKAVPVPVNAGLVNGTTIQAGDMMVK 76 (119) T ss_pred CCcchhhHHhhhhHHHHHhhcc-hhhhhhccchh---hhhhhhhhccCCceeeecccccccccccccccceeeccceeee Confidence 4789999999999999999997 89999999876 9999886 5668999999999999999999999998886 Q ss_pred EecCCceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 77 ASVFGREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 77 ~a~~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) +..++.|+..|+|.++|+.|+|+.+..-.-++.+++|.+|+|- T Consensus 77 -ad~svvpkm~dkv~f~geqwsvvaiekkmvnddvvayfiqvrk 119 (119) T protein:vir:78 77 -ADYSVVPKMDDKVRFSGEQWSVVAIEKKMVNDDVVAYFIQVRK 119 (119) T ss_pred -ccceeccccccceeecCceeeeeeeehhhcchhheeheeeecC Confidence 4679999999999999999999999999999999999999999 No 8 >protein:vir:94992 Length: 78 # NCBI annotation: hypothetical protein # Family: family:all:704 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224023;genbank:gi:62327310;genbank:GeneID:5176820 Probab=99.06 E-value=2.8e-13 Score=89.37 Aligned_cols=78 Identities=21% Similarity=0.277 Sum_probs=71.1 Q ss_pred CcCCCccCCcccceeEEEeeccccccCCceEEEcCeEEEEecCCceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEE Q lcl|NC_020844. 38 DYDPQPGEPTPYTLDATVRGVAAQYVQEGYIAASDLQVTASVFGREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIF 117 (120) Q Consensus 38 ~~dp~~~~~~~~~~~~~~~~v~~~~idGtlI~~gD~~v~~a~~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~q 117 (120) -| +.++..|++++...++...++||.|++||.++. +..++.|+..|++.++|+.|+|+.+..-.-++.+++|.+| T Consensus 1 ~w----~~d~vlplkavpvpvn~glvngttiqagdm~vk-ad~svvpkm~dkvqf~geqwsvvaiekkmvnddvvayfiq 75 (78) T protein:vir:94 1 MW----GPDVVLPLKAVPVPVNAGLVNGTTIQAGDMMVK-ADYSVVPKMDDKVQFSGEQWSVVAIEKKMVNDDVVAYFIQ 75 (78) T ss_pred CC----CCceeeecccccccccccccccceeeccceEEe-ccceeccccccceeecCceeEEEeeeeccccccceeeeee Confidence 23 345667899999999999999999999998886 4679999999999999999999999999999999999999 Q ss_pred EeC Q lcl|NC_020844. 118 VKG 120 (120) Q Consensus 118 vR~ 120 (120) +|- T Consensus 76 vrk 78 (78) T protein:vir:94 76 VRK 78 (78) T ss_pred ecC Confidence 999 No 9 >protein:vir:103282 Length: 104 # NCBI annotation: hypothetical protein # Family: family:all:704 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277461;genbank:gi:71834104;genbank:GeneID:3562393 Probab=98.07 E-value=4.3e-08 Score=60.96 Aligned_cols=100 Identities=15% Similarity=0.103 Sum_probs=79.0 Q ss_pred CCchhHHHHHHHHHHHHhhhcCC--eEEEEEeCCccCCCCcCCCccCCc-ccceeEEEeeccccccCCceEEEcCeEEEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQ--GTARYIHPGEQTGPDYDPQPGEPT-PYTLDATVRGVAAQYVQEGYIAASDLQVTA 77 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~--~~~~r~~~g~~~~p~~dp~~~~~~-~~~~~~~~~~v~~~~idGtlI~~gD~~v~~ 77 (120) +| ==|..++.+|+.-|+-|.. +.+..++.++ +.-..+-+...|. .++++|+++..++|+|||..|+.||++=++ T Consensus 2 ~~--MNy~~ie~~~r~GInffSD~~g~~e~~tq~~-~~ivnG~EV~~p~~~~~ikG~vR~~kaReIDGe~Ir~~D~~GIF 78 (104) T protein:vir:10 2 LP--MNHALLQQQIKAGINLLSDGDGVFEATTQPA-ISIVNGYEVRTPGTSYTVRGVIREFKARDIDGDIIKFGDRRGIF 78 (104) T ss_pred Cc--cCHHHHHHHHhhhhhhhcCCCceeeeecCCc-eeeecCeecCCCCeEEEeeeeeecccccccCccEEEeeceecee Confidence 33 2399999999999999975 5677776544 3333333444444 489999999999999999999999999888 Q ss_pred ecCCceeccCCEEEeCCeEEEEEeccc Q lcl|NC_020844. 78 SVFGREPTLEGVVEIDGVEKQIIRVDP 104 (120) Q Consensus 78 a~~~~~p~~~D~v~~dg~~~~Vv~v~p 104 (120) .+ ..|.+-||.|.+||++-+-...+- T Consensus 79 ~a-d~ei~eG~~I~IDge~~~~~~~~~ 104 (104) T protein:vir:10 79 TA-DSVVSEGDRIYIDQEATQSLTLDQ 104 (104) T ss_pred ec-ceeecCCcEEEEcccccceeeecC Confidence 65 678899999999999987777665 No 10 >protein:vir:4199 Length: 113 # NCBI annotation: unknown # Family: family:all:11763 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071824;genbank:gi:11863107;genbank:GeneID:1257609 Probab=94.54 E-value=0.00028 Score=40.06 Aligned_cols=106 Identities=12% Similarity=0.066 Sum_probs=67.7 Q ss_pred HHHHHHHHHHHhhhcCCeEEEEEeCCccCCCCcCCCccCCcccceeEEEeec---cccccCCceEEEcCeEEEEecCCce Q lcl|NC_020844. 7 YGELQGVASELMAEFQQGTARYIHPGEQTGPDYDPQPGEPTPYTLDATVRGV---AAQYVQEGYIAASDLQVTASVFGRE 83 (120) Q Consensus 7 Y~r~~atA~rLi~kfG~~~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~v---~~~~idGtlI~~gD~~v~~a~~~~~ 83 (120) ..|...--.+||.+||+. .+++...-....+-+.+..|+...+++.. +.+ +.+.|.|..|..=|--+++ ++.+. T Consensus 1 mkrikggF~~Li~rYG~~-~~LiH~~~~~RD~RGQPi~ED~~~~iK~F-FHVN~G~ERiI~GQ~i~DYDAY~LI-~~~~~ 77 (113) T protein:vir:41 1 MKRIKGGFQRLLRRYGQE-VTLVHHSLKERDQRGQPVYEDDTRPLKAF-FHVNRGGERIINGQRISDYDAYVLI-PLGVS 77 (113) T ss_pred CccchhHHHHHHHHhCCc-hhhhhhhhccccccCCCccccCCceEEEE-EEeeCCCceeeecceeecccceEEE-eeeee Confidence 457777888999999984 22332222223333444445555555543 444 6678999999888777665 45566 Q ss_pred eccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 84 PTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 84 p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) ..-+|.|++||+.|+|-++-|-.-- -++.+|- T Consensus 78 i~~~D~I~i~~~rY~V~aiI~~RTH-----~E~~L~R 109 (113) T protein:vir:41 78 IMRGDHILIGEDKYTVTSIIKNRTH-----IEATLQR 109 (113) T ss_pred eccCCeEEECCceEEEeeecccccc-----hhhheee Confidence 6679999999999999888664321 2223332 No 11 >protein:vir:4161 Length: 112 # NCBI annotation: unknown # Family: family:all:11763 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046970;genbank:gi:9630540;genbank:GeneID:1261714 Probab=94.25 E-value=0.00092 Score=37.23 Aligned_cols=105 Identities=15% Similarity=0.160 Sum_probs=66.0 Q ss_pred CCchhHHHHHHHHHHHHhhhcCCeEEEEEeCCccCCCCcCCCccCCcccceeEEEeec---cccccCCceEEEcCeEEEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQGTARYIHPGEQTGPDYDPQPGEPTPYTLDATVRGV---AAQYVQEGYIAASDLQVTA 77 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~~~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~v---~~~~idGtlI~~gD~~v~~ 77 (120) -||---| .+||.+||+. .+++.+.-....+-+.+..|+...+++.. +.+ +.+.|.|..|..=|--+++ T Consensus 2 kpsvtrF-------~~Li~~YG~~-~~LiH~~~~~RD~RGQPi~ED~~~~iK~F-FHVN~G~ERiI~GQ~i~DYDAY~LI 72 (112) T protein:vir:41 2 KPSVTRF-------NQLIYKYGMD-AKLIHKVMNGRDERGQPITEDTETSIKVF-FHVNTGNERLILGQQVVDYDAYALI 72 (112) T ss_pred CcchHHH-------HHHHHHhCCc-hhhhhhhhccccccCCCccccCCceEEEE-EEeeCCCceeeecceeecccceEEE Confidence 5664333 4789999984 22232222223333444445555555543 444 6678999999888777665 Q ss_pred ecCCceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 78 SVFGREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 78 a~~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) ++.+...-||.|++||+.|+|-++-|-.-- -++.+|- T Consensus 73 -~~~~~i~~~D~I~i~~~~Y~V~aiI~~RTH-----~E~~L~R 109 (112) T protein:vir:41 73 -VKTLNVNDDDEIEVDGKRYRVGAVIPQRTH-----QELHLRR 109 (112) T ss_pred -eeeeeeccCCeEEECCceEEEeeecccccc-----ceeeeee Confidence 455666679999999999999888765432 3444444 No 12 >protein:vir:96807 Length: 132 # NCBI annotation: hypothetical phage protein # Family: family:all:32158 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224252;genbank:gi:62362387;genbank:GeneID:3345746 Probab=93.53 E-value=0.00031 Score=39.79 Aligned_cols=116 Identities=20% Similarity=0.184 Sum_probs=79.4 Q ss_pred CCchhHHHHHHHHHHHHhhhcCCeEEEEEeC---CccCCCCcCCCccCCc-ccceeEEEeeccccccCCceEEEcCeEEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQGTARYIHP---GEQTGPDYDPQPGEPT-PYTLDATVRGVAAQYVQEGYIAASDLQVT 76 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~~~~~r~~~---g~~~~p~~dp~~~~~~-~~~~~~~~~~v~~~~idGtlI~~gD~~v~ 76 (120) |- - |=+--+-|++-|++-|- .+.++++ |++++...|-...+|- .||.-+.-..++.-+.-...-++||++++ T Consensus 1 mq--t-yfedyqdaretlkedgf-avtlikkglpgggydengdiqaaepdieypgygittsfsswhlkegiaqagdvkli 76 (132) T protein:vir:96 1 MQ--T-YFEDYQDARETLKEDGF-AVTLIKKGLPGGGYDENGDIQAAEPDIEYPGYGITTSFSSWHLKEGIAQAGDVKLI 76 (132) T ss_pred Cc--c-hhhhhHHHHHHHhhcCc-eEeeeeccCCCCCcCCCCceeccCCCccCCCcceecccchhhhhhhhccccceeEE Confidence 65 3 33344578888998886 3445554 4445444444444553 36655544455555555567899999999 Q ss_pred EecCCc--e--------eccCCE--EEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 77 ASVFGR--E--------PTLEGV--VEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 77 ~a~~~~--~--------p~~~D~--v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) +++.-+ | -.-||+ -++||+-|+||--.-++|..+-|.-++.+|- T Consensus 77 fapevmsdeyitfynqlrnggdrmyaevdgelwrvvmgeevkptstqiiaklhlrr 132 (132) T protein:vir:96 77 FAPEVMSDEYITFYNQLRNGGDRMYAEVDGELWRVVMGEEVKPTSTQIIAKLHLRR 132 (132) T ss_pred eccccccchhhhhHHhhhcchhhhhhhhcchheehccccccccchhhhhhhhcccC Confidence 986322 1 233777 5789999999999999999999999999999 No 13 >protein:vir:3426 Length: 117 # NCBI annotation: head-tail joining protein # Family: family:all:1908 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040589;genbank:gi:9626253;genbank:GeneID:2703484 Probab=91.12 E-value=0.01 Score=31.46 Aligned_cols=101 Identities=10% Similarity=0.026 Sum_probs=62.2 Q ss_pred CCchhH---HHHHHHHH-HHHhhhcCCeEEEEEeCCccCCCCcCCCccCCcccceeEEEee-cccccc-CCceEEEcCeE Q lcl|NC_020844. 1 MPSAEI---YGELQGVA-SELMAEFQQGTARYIHPGEQTGPDYDPQPGEPTPYTLDATVRG-VAAQYV-QEGYIAASDLQ 74 (120) Q Consensus 1 ~~~a~~---Y~r~~atA-~rLi~kfG~~~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~-v~~~~i-dGtlI~~gD~~ 74 (120) |+ +| |++.-+.| ..++..||...+.....|. ..++++++-+ ++..++ .|..|-.+.=. T Consensus 1 m~--~~dNlfd~a~~~aD~~i~~~fg~~a~i~~~~g~--------------~~~i~gVFDdP~~~~~~~gG~~i~~s~P~ 64 (117) T protein:vir:34 1 MA--DFDNLFDAAIARADETIRGYMGTSATITSGEQS--------------GAVIRGVFDDPENISYAGQGVRVEGSSPS 64 (117) T ss_pred CC--cccchhHHHHhhcchhhHhhcCeeEEEEeCCCc--------------ceEEEEEecCccchhhccCCEEeecCCcE Confidence 88 65 55444444 4566789975333332221 1235565533 434443 34566655555 Q ss_pred EEEecC-CceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 75 VTASVF-GREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 75 v~~a~~-~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) +.+... -+..+-+|.|+|+|+.|.|+. +.|.|.=.+|-.-.|| T Consensus 65 L~vk~aDv~~l~r~D~v~I~G~~y~V~~---~~PD~~G~~~l~L~rg 108 (117) T protein:vir:34 65 LFVRTDEVRQLRRGDTLTIGEENFWVDR---VSPDDGGSCHLWLGRG 108 (117) T ss_pred EEeeechhhccCCCCEEEECCCeeEeee---cccCCCceEEEEeecC Confidence 655432 335777899999999999997 5667777777777899 No 14 >protein:vir:7858 Length: 111 # NCBI annotation: gp15 # Family: family:all:3991 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817465;genbank:gi:29565894;genbank:GeneID:1259087 Probab=88.45 E-value=0.012 Score=31.17 Aligned_cols=103 Identities=13% Similarity=0.090 Sum_probs=60.1 Q ss_pred hhhcCCeEEEEEeCCccCCCCcCCCccC-CcccceeEEEe---eccccccCCceEEEcCeEEE-EecCCceeccCCEEEe Q lcl|NC_020844. 18 MAEFQQGTARYIHPGEQTGPDYDPQPGE-PTPYTLDATVR---GVAAQYVQEGYIAASDLQVT-ASVFGREPTLEGVVEI 92 (120) Q Consensus 18 i~kfG~~~~~r~~~g~~~~p~~dp~~~~-~~~~~~~~~~~---~v~~~~idGtlI~~gD~~v~-~a~~~~~p~~~D~v~~ 92 (120) +.+||..++..+.-++...|+|+-.... .+..++.++.+ .++..+-+=+++-..|+-+. -.+.-+.-+.+|.|.+ T Consensus 1 ~e~fG~~tVtfV~i~e~~~~~~G~~~~v~~T~~~~pgc~frPl~~~~~~~~va~~~~t~~~~app~pa~lA~k~~~~li~ 80 (111) T protein:vir:78 1 MERIGEDTVTFVQIGKGAKSDRGIPQAVEESSTDVQWCSFQPLGVHYHVTDVSLPDATDRCLAPPVPAAVACKAGDKLIF 80 (111) T ss_pred CccccCceEEEEEeccCCCCCCCCccchheeecCCCceeeccccccccccccccccccccccCCCcceEEEeCCCCeEEE Confidence 8999976666665555556677654432 22233333332 23333334444544444332 1122336788999999 Q ss_pred CCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 93 DGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 93 dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) ||.+|+|+-..-..+.|.+----+-+.= T Consensus 81 dGv~~~i~G~~~~f~dg~~~~~TIi~kR 108 (111) T protein:vir:78 81 HGVSHLVLGVRDHWDKGKLHHLTVITKR 108 (111) T ss_pred cceeeEEeeeeeeccCCCceEEEEEeee Confidence 9999999999998888765433333333 No 15 >protein:vir:101653 Length: 111 # NCBI annotation: gp16 # Family: family:all:3991 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654771;genbank:gi:109302769;genbank:GeneID:4156087 Probab=88.45 E-value=0.012 Score=31.17 Aligned_cols=103 Identities=13% Similarity=0.090 Sum_probs=60.1 Q ss_pred hhhcCCeEEEEEeCCccCCCCcCCCccC-CcccceeEEEe---eccccccCCceEEEcCeEEE-EecCCceeccCCEEEe Q lcl|NC_020844. 18 MAEFQQGTARYIHPGEQTGPDYDPQPGE-PTPYTLDATVR---GVAAQYVQEGYIAASDLQVT-ASVFGREPTLEGVVEI 92 (120) Q Consensus 18 i~kfG~~~~~r~~~g~~~~p~~dp~~~~-~~~~~~~~~~~---~v~~~~idGtlI~~gD~~v~-~a~~~~~p~~~D~v~~ 92 (120) +.+||..++..+.-++...|+|+-.... .+..++.++.+ .++..+-+=+++-..|+-+. -.+.-+.-+.+|.|.+ T Consensus 1 ~e~fG~~tVtfV~i~e~~~~~~G~~~~v~~T~~~~pgc~frPl~~~~~~~~va~~~~t~~~~app~pa~lA~k~~~~li~ 80 (111) T protein:vir:10 1 MERIGEDTVTFVQIGKGAKSDRGIPQAVEESSTDVQWCSFQPLGVHYHVTDVSLPDATDRCLAPPVPAAVACKAGDKLIF 80 (111) T ss_pred CccccCceEEEEEeccCCCCCCCCccchheeecCCCceeeccccccccccccccccccccccCCCcceEEEeCCCCeEEE Confidence 8999976666665555556677654432 22233333332 23333334444544444332 1122336788999999 Q ss_pred CCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 93 DGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 93 dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) ||.+|+|+-..-..+.|.+----+-+.= T Consensus 81 dGv~~~i~G~~~~f~dg~~~~~TIi~kR 108 (111) T protein:vir:10 81 HGVSHLVLGVRDHWDKGKLHHLTVITKR 108 (111) T ss_pred cceeeEEeeeeeeccCCCceEEEEEeee Confidence 9999999999998888765433333333 No 16 >protein:vir:95261 Length: 133 # NCBI annotation: Phage hypothetical protein # Family: family:all:31736 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944894;genbank:gi:38707834;genbank:GeneID:2744047 Probab=87.85 E-value=0.015 Score=30.52 Aligned_cols=104 Identities=12% Similarity=0.083 Sum_probs=57.8 Q ss_pred HHHhhhcCCeEEEEEeCCccC-CCCcCCCccCCc-ccceeEEEeeccccccCC-ce------EEEcCeE-EEEec-C--- Q lcl|NC_020844. 15 SELMAEFQQGTARYIHPGEQT-GPDYDPQPGEPT-PYTLDATVRGVAAQYVQE-GY------IAASDLQ-VTASV-F--- 80 (120) Q Consensus 15 ~rLi~kfG~~~~~r~~~g~~~-~p~~dp~~~~~~-~~~~~~~~~~v~~~~idG-tl------I~~gD~~-v~~a~-~--- 80 (120) -+|+..-. =++.|.+.-++| ..+..++.|++. ..++.+.+.+++...++. +. +..+|-. |+..+ + T Consensus 1 M~~~~rhs-~~~~R~~seg~Y~~~~GrWV~g~~~v~~~i~asIQP~~~ss~~~~q~~~lpeGrrit~avrIYTda~L~va 79 (133) T protein:vir:95 1 MRLLNRHS-FVVKRKVSEDGYYNDDGDWVASQDIVEVNCKGNIQPYIKGSVKNGTQIALPEGIRLTDTRILYTTYKLRTS 79 (133) T ss_pred CCccccce-eEEEEeecCCceEccCCcccCCCCccceeeeeeecccccccccccchhcccCCeeeeeEEEEEeeeeeeee Confidence 23333222 133444333334 334556776544 588999999976664432 21 1222332 22211 2 Q ss_pred -CceeccCCEEEeCCeEEEEEeccccCCCCc--EEEEEEE-EeC Q lcl|NC_020844. 81 -GREPTLEGVVEIDGVEKQIIRVDPVPAAGT--PVVWRIF-VKG 120 (120) Q Consensus 81 -~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~--~v~y~~q-vR~ 120 (120) ++.=..||+|++||.+|.|++..||.-+ + +-.|+-+ +|- T Consensus 80 ge~~~~~gDvvl~dg~eYev~~r~~w~~G-v~~isHyrY~aVR~ 122 (133) T protein:vir:95 80 DDVEWNESDIVMIDGHEYEVFMTMDWSQQ-LSHTSHYEYIIIRR 122 (133) T ss_pred cccccCCCcEEEEcCCceEEEEecchhhc-cccCCceeEEEEee Confidence 2244569999999999999999998754 4 4455432 333 No 17 >protein:vir:5258 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4880 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852763;genbank:gi:31544038;uniprot:Q776V7;genbank:GeneID:2777139 Probab=87.02 E-value=0.015 Score=30.66 Aligned_cols=105 Identities=14% Similarity=0.159 Sum_probs=61.5 Q ss_pred CCchhHHHHHHHHHHHHhh-hcCCe-EEEEEeCCccCCCCcCCCccCCcccceeEEEeeccccc----cCCceEEEcCeE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMA-EFQQG-TARYIHPGEQTGPDYDPQPGEPTPYTLDATVRGVAAQY----VQEGYIAASDLQ 74 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~-kfG~~-~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~v~~~~----idGtlI~~gD~~ 74 (120) || |.-....|+. +|-+. +++| ..|+.....| .+. ++..++.|++...+... .+|..|.. -++ T Consensus 1 m~-------~ldvs~v~ldpdF~~titv~R-~~g~~~~~g~-~~~--t~~~t~~avVqP~~~~dlq~LpeG~ri~~-sIk 68 (123) T protein:vir:52 1 MS-------LINQSGRFLNSRFRQQITVQK-QSGSHSASGF-DVR--YEKQQITAIVIPTSPNDVLLLPEGERYLP-SIK 68 (123) T ss_pred CC-------cccccccccCcccCceEEEEc-cCccEeCCcc-ccc--cccceEEEEEeeCChhhcccccccccccc-eEE Confidence 87 4556667777 66653 3443 2444333334 222 34566889988876554 46777743 355 Q ss_pred EEEecCCceeccCCEEEeCCeEEEEEeccccCCCC----cEEEEE--EEEeC Q lcl|NC_020844. 75 VTASVFGREPTLEGVVEIDGVEKQIIRVDPVPAAG----TPVVWR--IFVKG 120 (120) Q Consensus 75 v~~a~~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag----~~v~y~--~qvR~ 120 (120) |+.-. ....||.|..+|+.|+|+++.++.--| ..+-|. .|.++ T Consensus 69 I~Tq~---~L~vGD~vlw~G~~YrVi~~~d~s~YGYy~~i~~~~~~t~~~~~ 117 (123) T protein:vir:52 69 VYTQQ---QLNIGDLVDYRGQTYKIKTAANWGDYGYYNNIGVRHSQTAKVDS 117 (123) T ss_pred EEecc---ccccccEEEeCCcEEEEEEcCCccccceecceeecccccCcccc Confidence 54322 334599999999999999999985433 222222 12222 No 18 >protein:vir:79686 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:2747 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285885;genbank:gi:148750842;genbank:GeneID:5220385 Probab=83.76 E-value=0.066 Score=27.06 Aligned_cols=105 Identities=10% Similarity=0.140 Sum_probs=57.8 Q ss_pred CCchhHHHHHHHHHHHHhhhcCCeEEEEEeCCccCCCCcCCCcc-CCcccceeEEEeeccccc---cCCceEEEcCeEEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQGTARYIHPGEQTGPDYDPQPG-EPTPYTLDATVRGVAAQY---VQEGYIAASDLQVT 76 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~~~~~r~~~g~~~~p~~dp~~~-~~~~~~~~~~~~~v~~~~---idGtlI~~gD~~v~ 76 (120) || -+ -.+|+-. +-+++++.+-.. .++|+.++- +|+ +++-+.++-...+ -++..++.....++ T Consensus 2 m~--~i-------pk~~l~~--sit~k~~~~~~~-~D~yg~~~y~~p~--~I~nvrvd~~t~ySgt~n~rq~~~navif~ 67 (118) T protein:vir:79 2 KL--PI-------PYQMAVS--TVHLKLTDQSAK-KDRYGRTVPTWEG--DITKCVVNMQTTYSGTNNDRQIVANGLIVM 67 (118) T ss_pred CC--cc-------chhhccc--eEEEEEeccccC-cCCCCCeeccCCe--eeeeeEecccceecccCCCCeEEeceEEEE Confidence 44 11 1333321 124554432111 233665433 332 2344444322222 24555666556555 Q ss_pred EecCCc------eeccCCEEEeCCeEEEEEeccccC--CCCcEEEEEEEEe Q lcl|NC_020844. 77 ASVFGR------EPTLEGVVEIDGVEKQIIRVDPVP--AAGTPVVWRIFVK 119 (120) Q Consensus 77 ~a~~~~------~p~~~D~v~~dg~~~~Vv~v~pi~--pag~~v~y~~qvR 119 (120) .+..+- +=..|++|.+||++|+|.++.|.- ..+.+-+|+|-|= T Consensus 68 y~~~s~p~~~~~~~s~g~kivfdG~eYtI~~i~~~~ep~sn~vy~yElEVi 118 (118) T protein:vir:79 68 YAGYSNPIPTLTKENLGSKLTYQGLDYTVTSLNRFDQPGTEDLYCYELEVI 118 (118) T ss_pred ecccCccccEEeccccccceeeCCeeEEeeeeeecccCcCCcEEEEEEEeC Confidence 554331 123388999999999999999976 6778899999998 No 19 >protein:vir:395 Length: 117 # NCBI annotation: gp10 # Family: family:all:1908 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046905;genbank:gi:9630475;genbank:GeneID:1261649 Probab=80.36 E-value=0.096 Score=26.17 Aligned_cols=101 Identities=11% Similarity=0.008 Sum_probs=55.2 Q ss_pred CCchhH---HHHHHHHH-HHHhhhcCCeEEEEEeCCccCCCCcCCCccCCcccceeEEEeecccc--ccCCceEEEcCeE Q lcl|NC_020844. 1 MPSAEI---YGELQGVA-SELMAEFQQGTARYIHPGEQTGPDYDPQPGEPTPYTLDATVRGVAAQ--YVQEGYIAASDLQ 74 (120) Q Consensus 1 ~~~a~~---Y~r~~atA-~rLi~kfG~~~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~v~~~--~idGtlI~~gD~~ 74 (120) |+ +| |+.+-+.| ...+..||...+.....+. + -++++++-+=+.- ...|..|..+-=. T Consensus 1 m~--~~dNlFd~ama~aD~aI~~~~g~~a~i~~g~~~----------~----rti~gVFDdP~~~~~~aggg~ie~saP~ 64 (117) T protein:vir:39 1 MA--DFDNLFDEAMSRADGAIRGVMGTEAKVMSGTLS----------G----ATLVGVFDDPENIGYAGAGIRVEGTSPT 64 (117) T ss_pred CC--cccchHHHHHHhhhHHHHHhcCceEEEEeCCCC----------c----eEEEEEecCccccccccCceEEeccCcE Confidence 88 65 55444444 4566789974333322211 0 1133333222111 1123334322223 Q ss_pred EEEe-cCCceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 75 VTAS-VFGREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 75 v~~a-~~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) +.+. +.-+.++-+|.|+|+|+.|.|++ +.|.|+=.+|-.-.|| T Consensus 65 LfvktaDv~gl~r~D~vtI~g~~y~V~~---~~pDg~G~~~l~L~rg 108 (117) T protein:vir:39 65 LFVKTSTVSQLQRMDTLTINGRQFWVDR---VGPDDCGSCHIWLGNG 108 (117) T ss_pred EEEeeccccccCCCCEEEECCCceEEee---eccCCCceEEEEeecC Confidence 3332 33345788999999999999888 4567777777777899 No 20 >protein:vir:80934 Length: 120 # NCBI annotation: gp9 # Family: family:all:2747 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468395;genbank:gi:157324969;genbank:GeneID:5601367 Probab=78.28 E-value=0.12 Score=25.71 Aligned_cols=111 Identities=9% Similarity=-0.007 Sum_probs=59.1 Q ss_pred CCchhHHHHHHHHHHHHhhhcCCeEEEEEeCCccCCCCcCCC-ccCCcccceeEEEeecc---ccccCCceEEEcCeEEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQGTARYIHPGEQTGPDYDPQ-PGEPTPYTLDATVRGVA---AQYVQEGYIAASDLQVT 76 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~~~~~r~~~g~~~~p~~dp~-~~~~~~~~~~~~~~~v~---~~~idGtlI~~gD~~v~ 76 (120) |---.=--||-..-.|||-. +-+++... |+ .+|+.. ..+|+. ++-+.++-+ ...-++..++.....++ T Consensus 1 ~~~~~~~~~m~~iPk~~l~~--sit~k~~~-~~---d~~g~~~y~~pv~--I~nvRvd~~~~ysg~~n~rq~~~naviFi 72 (120) T protein:vir:80 1 MKVVKPVTNAPPLPLDWLIH--NISYEAYK-EE---GRHNQVVYEKGFE--IEHVRVDFSKSNQIAGLSDSDRYDAVIFI 72 (120) T ss_pred CcchhhhhhcCCcChhhccc--eEEEEEec-CC---CCCCCccccCcee--ccCeEEecceeeecCCCCceeeeeeEEEE Confidence 10000001111122344321 12444432 22 225443 234432 233333222 22345667777777776 Q ss_pred EecCCc----eeccCCEEEeCCeEEEEEeccccC-CCCcEEEEEEEEe Q lcl|NC_020844. 77 ASVFGR----EPTLEGVVEIDGVEKQIIRVDPVP-AAGTPVVWRIFVK 119 (120) Q Consensus 77 ~a~~~~----~p~~~D~v~~dg~~~~Vv~v~pi~-pag~~v~y~~qvR 119 (120) -+..+- -.+.|.+|.++|++|.|.++.|.- ..+.+=+|+|+|= T Consensus 73 ~a~~S~p~~~~~~~gskI~f~G~eytI~~i~~~~~~s~~vh~yEleVi 120 (120) T protein:vir:80 73 DAVNSMNVPDDFISRSRIFFSGKAYKIVKVIPCYATSENVHHWEIEVI 120 (120) T ss_pred ecccCCccceecccCCEEEeCCceEEEEEeeeccCCCCceeEEEEEeC Confidence 553222 235599999999999999999975 5678999999999 No 21 >protein:vir:1582 Length: 117 # NCBI annotation: minor capsid protein # Family: family:all:2747 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695164;swissprot:trembl:o03932;genbank:gi:23455805;uniprot:O03932;genbank:GeneID:955538 Probab=76.87 E-value=0.13 Score=25.43 Aligned_cols=104 Identities=13% Similarity=0.124 Sum_probs=57.6 Q ss_pred CCchhHHHHHHHHHHHHhhhcCCeEEEEEeCCccCCCCcCCCcc-CCcccceeEEEeeccccc---cCCceEEEcCeEEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQGTARYIHPGEQTGPDYDPQPG-EPTPYTLDATVRGVAAQY---VQEGYIAASDLQVT 76 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~~~~~r~~~g~~~~p~~dp~~~-~~~~~~~~~~~~~v~~~~---idGtlI~~gD~~v~ 76 (120) || -+ -.+|+-. +-++++....+ .++|+.+.- +|+ +++-+.++-...+ -++..++.....++ T Consensus 2 m~--~i-------pk~~l~~--sitlk~~~~~~--~d~yg~~~y~~pi--~I~nvrvd~~t~ysgt~n~Rq~~~navif~ 66 (117) T protein:vir:15 2 MM--KP-------PKWMCQQ--TITLTLTDPTK--TDEWGQLLTGEPV--TIEHCVVQPQTIYSGSNNDRTIVANAVVFV 66 (117) T ss_pred CC--cc-------chhhccc--eEEEEEeccCC--cCCCCCeeecCce--eeeeeEecccceecccCCCCeEEeceEEEE Confidence 33 00 1333321 11344443322 334655443 432 2344443322222 24455666556565 Q ss_pred EecCCc------eeccCCEEEeCCeEEEEEeccccC--CCCcEEEEEEEEe Q lcl|NC_020844. 77 ASVFGR------EPTLEGVVEIDGVEKQIIRVDPVP--AAGTPVVWRIFVK 119 (120) Q Consensus 77 ~a~~~~------~p~~~D~v~~dg~~~~Vv~v~pi~--pag~~v~y~~qvR 119 (120) .+..+- +=..|++|.++|++|+|.++.|.- ..+.+-+|+|-|= T Consensus 67 y~~~s~P~~~~~~~~~g~ki~f~G~eYtI~~i~~~~ep~sn~vy~yElEVi 117 (117) T protein:vir:15 67 YAGISNPLLTVTKNNVGSKLVFEGEEYTVQKIIDNREPFSNELHSYELEVL 117 (117) T ss_pred ecccCCcceEEecccccceeeeCCeeEEeeeeeecccCcCCcEEEEEEEeC Confidence 554331 123388999999999999999976 6778899999998 No 22 >protein:vir:44 Length: 120 # NCBI annotation: gp9 # Family: family:all:2747 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463470;swissprot:trembl:q9t1b4;genbank:gi:16798792;uniprot:Q9T1B4;genbank:GeneID:922373 Probab=75.56 E-value=0.15 Score=25.18 Aligned_cols=111 Identities=9% Similarity=0.006 Sum_probs=58.7 Q ss_pred CCchhHHHHHHHHHHHHhhhcCCeEEEEEeCCccCCCCcCCC-ccCCcccceeEEEeecc---ccccCCceEEEcCeEEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQGTARYIHPGEQTGPDYDPQ-PGEPTPYTLDATVRGVA---AQYVQEGYIAASDLQVT 76 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~~~~~r~~~g~~~~p~~dp~-~~~~~~~~~~~~~~~v~---~~~idGtlI~~gD~~v~ 76 (120) |---.=--||-..-.|||-. +-+++... |+ .+|+.. ..+|+. ++-+.++-+ ...-++..++.....++ T Consensus 1 ~~~~~~~~~m~~iPk~~l~~--sit~k~~~-~~---d~~g~~~y~~pv~--I~nvRvd~~~~ysg~~n~rq~~~naviFi 72 (120) T protein:vir:44 1 MKVLKPITNAPPLPLDWLIH--NISYEAYK-EE---DRHNQVVYEKGIE--IEHVRVDFSKSNQIAGLSDSDRYDAVIFI 72 (120) T ss_pred CcchhhhhhcCCcChhhccc--eEEEEEec-CC---CCCCCCcccCcee--ccCeEEecceeeecCCCCceeeeeeEEEE Confidence 10000000111111333321 12444432 22 235443 234432 223333222 22345667777777776 Q ss_pred EecCCc----eeccCCEEEeCCeEEEEEeccccC-CCCcEEEEEEEEe Q lcl|NC_020844. 77 ASVFGR----EPTLEGVVEIDGVEKQIIRVDPVP-AAGTPVVWRIFVK 119 (120) Q Consensus 77 ~a~~~~----~p~~~D~v~~dg~~~~Vv~v~pi~-pag~~v~y~~qvR 119 (120) -+..+- ..+.|.+|.++|++|.|.++.|.- ..+.+=+|+++|= T Consensus 73 ~a~~S~p~~~~~~~gskI~f~G~eytI~~i~~~~~~sn~vh~yEleVi 120 (120) T protein:vir:44 73 DAVNSMNVPSDFVSRSRIFFSGKAYKIVKVIPCYATSNSVHHWEIEVI 120 (120) T ss_pred ecccCCccceecCcCCEEEeCCceEEEEeeeeccCCCCceEEEEEEeC Confidence 554331 245599999999999999999965 5678999999999 No 23 >protein:vir:9822 Length: 120 # NCBI annotation: putative minor capsid protein # Family: family:all:1526 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795584;genbank:gi:28876337;genbank:GeneID:1257899 Probab=74.74 E-value=0.15 Score=25.13 Aligned_cols=105 Identities=14% Similarity=0.072 Sum_probs=58.4 Q ss_pred HHHHHHHHhhhcC----CeEEEEEeCCccCCCCcCCCc-cCCcccceeEEEeeccccc---cCCceEEEcCeEEEEecCC Q lcl|NC_020844. 10 LQGVASELMAEFQ----QGTARYIHPGEQTGPDYDPQP-GEPTPYTLDATVRGVAAQY---VQEGYIAASDLQVTASVFG 81 (120) Q Consensus 10 ~~atA~rLi~kfG----~~~~~r~~~g~~~~p~~dp~~-~~~~~~~~~~~~~~v~~~~---idGtlI~~gD~~v~~a~~~ 81 (120) |..--.|||.+=- .-++..+...+ .++|+.+. .+|+ +++-+.++-...+ -++..++.....++.+..+ T Consensus 1 ~~~~~~~~ip~ipk~~l~dsitlk~~~~--~d~yg~~~y~~pi--~I~nvrvd~~t~ySgt~N~Rqi~~navif~y~~~S 76 (120) T protein:vir:98 1 MLGWDIRVLAMIDKRLLIDELQVKLVKD--KGDYGGFVYDEPF--TLSPVRFDRNLATAGKDNARQETKPSVIFIYPKYC 76 (120) T ss_pred CccccccccCcCChhhccceEEEEEecC--cCCCCCeeecCce--eeeeeEecccceecccCCCCeEEeeeEEEEeccCc Confidence 3333344444211 11333333322 23466544 3442 2344444322222 2444555555555555433 Q ss_pred cee-----ccCCEEEeCCeEEEEEeccccCCC--CcEEEEEEEEe Q lcl|NC_020844. 82 REP-----TLEGVVEIDGVEKQIIRVDPVPAA--GTPVVWRIFVK 119 (120) Q Consensus 82 ~~p-----~~~D~v~~dg~~~~Vv~v~pi~pa--g~~v~y~~qvR 119 (120) .| ..|.+|.++|++|+|.++.|.-.+ +.+-+|+|-|= T Consensus 77 -~p~~~~~~~g~ki~~~g~eYtI~kI~~~y~p~sn~vysyElEVi 120 (120) T protein:vir:98 77 -KTVADRSWVDAVVIDGDTEYTVDKVIPVYHPLTNKIFCFEVEVI 120 (120) T ss_pred -ceeeeecccccEEEeCCeeEEeeeeeecccCcCCcEEEEEEEEC Confidence 23 347889999999999999998755 78899999998 No 24 >protein:vir:3035 Length: 110 # NCBI annotation: minor capsid protein # Family: family:all:1526 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438148;genbank:gi:16271811;genbank:GeneID:929234 Probab=67.19 E-value=0.26 Score=23.83 Aligned_cols=98 Identities=13% Similarity=0.058 Sum_probs=55.9 Q ss_pred HhhhcC-CeEEEEEeCCccCCCCcCCCc-cCCcccceeEEEeeccccc---cCCceEEEcCeEEEEecCCceec-----c Q lcl|NC_020844. 17 LMAEFQ-QGTARYIHPGEQTGPDYDPQP-GEPTPYTLDATVRGVAAQY---VQEGYIAASDLQVTASVFGREPT-----L 86 (120) Q Consensus 17 Li~kfG-~~~~~r~~~g~~~~p~~dp~~-~~~~~~~~~~~~~~v~~~~---idGtlI~~gD~~v~~a~~~~~p~-----~ 86 (120) ||.|.= .-++..+...+ .++|+.+. .+|+ +++-+.++-...+ -++..++.....++.+..+ .|. . T Consensus 1 ~ipk~~l~~sit~k~~~~--~d~yg~~~y~~pi--~I~nvrvd~~t~ysgt~n~rq~~~navif~y~~~s-~p~~~~~~~ 75 (110) T protein:vir:30 1 MIDKRLLIDELQVKLVKD--KGDYGGFVYDEPF--TLSPVRFDRNLATAGKDNARQETKPSVIFIYPKYC-KTVADRSWV 75 (110) T ss_pred CcChhhcceeEEEEEecC--cCCCCCeeecCce--eeeeeEecccceecccCCCCeEEeeeEEEEeccCc-ceeeeeccc Confidence 444321 11333333222 33466544 3442 2344443322222 2444555555555555433 233 3 Q ss_pred CCEEEeCCeEEEEEeccccCCC--CcEEEEEEEEe Q lcl|NC_020844. 87 EGVVEIDGVEKQIIRVDPVPAA--GTPVVWRIFVK 119 (120) Q Consensus 87 ~D~v~~dg~~~~Vv~v~pi~pa--g~~v~y~~qvR 119 (120) |.+|.++|++|+|.++.|.--+ +.+-+|+|-|= T Consensus 76 g~ki~~~g~eYtI~kii~~~~p~sn~vy~yElEVi 110 (110) T protein:vir:30 76 DAVVIDGDTEYTVDKVIPVYHPLTNKIFCFEVEVI 110 (110) T ss_pred ccEEEeCCeeEEeeeeeeccCCcCCcEEEEEEEEC Confidence 7889999999999999998755 78899999998 No 25 >protein:vir:5977 Length: 109 # NCBI annotation: hypothetical protein # Family: family:all:788 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690677;genbank:geneid:6329133;genbank:gi:22855071;interpro:IPR013045;uniprot:O48446;genbank:GeneID:955315 Probab=64.07 E-value=0.31 Score=23.40 Aligned_cols=102 Identities=11% Similarity=-0.013 Sum_probs=52.7 Q ss_pred HhhhcCCeEEEEEeCCccCCCCcCCCccCCcccceeEEEeecccccc--CCceEEEcCeEEEEecCCceeccCCEEEeCC Q lcl|NC_020844. 17 LMAEFQQGTARYIHPGEQTGPDYDPQPGEPTPYTLDATVRGVAAQYV--QEGYIAASDLQVTASVFGREPTLEGVVEIDG 94 (120) Q Consensus 17 Li~kfG~~~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~v~~~~i--dGtlI~~gD~~v~~a~~~~~p~~~D~v~~dg 94 (120) |+.|+-. -+...++....++..++...-...+++-|.+...+.+.. -+........+|.+=-.. ....+++|.++| T Consensus 1 ~~~~L~~-RI~i~~~~~~~D~~G~~~~~w~~~~~~WA~v~~~sg~E~~~a~~~~~~~~~~i~iRy~~-~I~~~~Ri~~~g 78 (109) T protein:vir:59 1 MYEEFPD-VITFQSYVEQSNGEGGKTYKWVDEFTAAAHVQPISQEEYYKAQQLQTPIGYNIYTPYDD-RIDKKMRVIYRG 78 (109) T ss_pred CccccCc-cEEEEeeeeeeCCCCCeeeeeEeeEEEEEEEecCChhheeeccccceeeEEEEEEeeCC-CCCcccEEEECC Confidence 8999986 333333333333323232221223456777776655532 233332333344432111 235689999999 Q ss_pred eEEEEEec-cccCCCCcEEEEEEEEeC Q lcl|NC_020844. 95 VEKQIIRV-DPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 95 ~~~~Vv~v-~pi~pag~~v~y~~qvR~ 120 (120) ..|.|+++ .+.+-....++..++--| T Consensus 79 r~y~I~~v~~d~~~~~~~l~~~~~e~g 105 (109) T protein:vir:59 79 KIVTFIGDPVDLSGLQEITRIKGKEDG 105 (109) T ss_pred eEEEEEeccCCCCCCeEEEEEEEEEee Confidence 99999986 334433334444444444 No 26 >protein:vir:1385 Length: 107 # NCBI annotation: Gp8 protein # Family: family:all:3858 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612837;genbank:gi:20065971;genbank:GeneID:935786 Probab=45.02 E-value=0.78 Score=21.16 Aligned_cols=99 Identities=13% Similarity=0.005 Sum_probs=55.6 Q ss_pred hhhcCCeEEEEEeCCccCCCCcCCCccCCcccceeEEEeeccccc--cCCceEEEcCeEEEEec-CCce---eccCCEEE Q lcl|NC_020844. 18 MAEFQQGTARYIHPGEQTGPDYDPQPGEPTPYTLDATVRGVAAQY--VQEGYIAASDLQVTASV-FGRE---PTLEGVVE 91 (120) Q Consensus 18 i~kfG~~~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~v~~~~--idGtlI~~gD~~v~~a~-~~~~---p~~~D~v~ 91 (120) +++|-+=++.+.. ...++..++...-...+++-+.+...+-++ ..|........++.+-- ..++ +..+++|. T Consensus 1 ~~~~hRI~i~~~~--~~~D~~G~~~~~w~~~~~~WA~v~~~~g~E~~~a~~~~~~~~~~f~iRy~~~i~~~~~t~~~Ri~ 78 (107) T protein:vir:13 1 MARYERISIKKLE--EKNIKGRRQEECLIPFYDCWAEILDLYGQELYGALQMKLENTIIFKIRYCKKVEELRNKENFIVE 78 (107) T ss_pred CCcceEEEEEeee--eeeCCCCCeecceEeEEEEEEEEecCCchheeecceeheeeeEEEEEEecCCccccccCcCcEEE Confidence 5556543443332 223333333332222367777777765553 33444333444555421 2333 46689999 Q ss_pred eCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 92 IDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 92 ~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) ++|..|.|.++.+...... ..++.|+. T Consensus 79 ~~g~~y~I~~v~~~~~~~~--~l~i~c~e 105 (107) T protein:vir:13 79 WQGRKYEIYYPDFLGYNKQ--FVKLKCKE 105 (107) T ss_pred ECCeEEEEEecCCcccCCe--EEEEEEEE Confidence 9999999999988877764 33444444 No 27 >protein:vir:98924 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:2747 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164421;genbank:gi:56694911;genbank:GeneID:3197315 Probab=44.78 E-value=0.79 Score=21.13 Aligned_cols=102 Identities=14% Similarity=0.157 Sum_probs=53.6 Q ss_pred CCchhHHHHHHHHHHHHhhhcCCeEEEEEeCCccCCCCcCCC-ccCCcccceeEEEeeccccc---cCCceEEEcCeEEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQGTARYIHPGEQTGPDYDPQ-PGEPTPYTLDATVRGVAAQY---VQEGYIAASDLQVT 76 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~~~~~r~~~g~~~~p~~dp~-~~~~~~~~~~~~~~~v~~~~---idGtlI~~gD~~v~ 76 (120) || ..-.|||-. +-+++.. .| ..+|+.. ..+|+ +++-+..+-+..+ -++-.+......++ T Consensus 3 m~---------~ipk~~l~~--sit~k~~-~~---~dd~g~~~y~~pv--~I~nvrv~~~~~ysgt~n~rq~~~naviF~ 65 (113) T protein:vir:98 3 MP---------KPPIDFLVD--SFMYKEY-MG---ENSWSEPEYARPV--LISNCRIDRGAEYTSTTSGRQLLYNAVVFC 65 (113) T ss_pred cC---------cCChhhccc--eEEEEEe-cc---cCCCCCcccCCcE--eecceEecccceeeccCCCceeeeeeEEEE Confidence 22 011444431 1234433 22 2235553 33332 2233332211111 12334444445555 Q ss_pred EecCC---ceeccCCEEEeCCeEEEEEeccccC--CCCcEEEEEEEEe Q lcl|NC_020844. 77 ASVFG---REPTLEGVVEIDGVEKQIIRVDPVP--AAGTPVVWRIFVK 119 (120) Q Consensus 77 ~a~~~---~~p~~~D~v~~dg~~~~Vv~v~pi~--pag~~v~y~~qvR 119 (120) .+..+ .+-.-+++|.+||++|+|.++.+.- ..+.+-+|+|-|= T Consensus 66 ya~~S~p~~~~~~~skivfdG~eytI~~i~~~~e~~sn~v~~yELEVi 113 (113) T protein:vir:98 66 YEGMTTPLPQFKAQSVLHFDGRDHVITKVIPNHEAYSKTLYSYELEVV 113 (113) T ss_pred ecccCccceEecCCCeEEeCCcceEEeeeccCcCCCCCceeEEEEEEC Confidence 44322 2344579999999999999999976 5788999999999 No 28 >protein:vir:106729 Length: 152 # NCBI annotation: gp06 # Family: family:all:3177 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944314;genbank:gi:38638613;genbank:GeneID:2657358 Probab=38.72 E-value=1.1 Score=20.46 Aligned_cols=108 Identities=12% Similarity=-0.060 Sum_probs=64.6 Q ss_pred HHHHHHHHHHhhhcC---CeEEEEEeCCccCCCCcCCCccCCc--ccceeEEEeec---cccccCCceEEEcCeEEEEec Q lcl|NC_020844. 8 GELQGVASELMAEFQ---QGTARYIHPGEQTGPDYDPQPGEPT--PYTLDATVRGV---AAQYVQEGYIAASDLQVTASV 79 (120) Q Consensus 8 ~r~~atA~rLi~kfG---~~~~~r~~~g~~~~p~~dp~~~~~~--~~~~~~~~~~v---~~~~idGtlI~~gD~~v~~a~ 79 (120) =++.+.|.+-|.--. .+++++- .|... .+-..+|. +.++..-+..+ +.++.||-.|+.-=+.+++.+ T Consensus 1 MNLh~Ia~~aI~aVNP~~pA~l~~s-tG~t~----~~G~r~p~Y~~~~v~vQ~QalS~~dL~h~dglnqQG~~~~iY~~G 75 (152) T protein:vir:10 1 MNLHDIVRGAITQVNPDEPGTMFVS-TGRNN----VRGILTPTFSSVDAQLQIQAQKHTPLQHERGALYTNSFLTVYAYG 75 (152) T ss_pred CchHHhhhhhhhccCCCCceEEEEe-cccee----cCceecceeccceeEEEEeecCchHHHHhhcccccceeeEEEecc Confidence 346666666666443 3455553 33211 11122221 22333333344 344678877754334455544 Q ss_pred --CCc---eeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 80 --FGR---EPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 80 --~~~---~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) .++ .=+=||++.|+|++|.|+.+-=.+|.=.-++-.+|+-+ T Consensus 76 n~~gv~R~~~qGGD~~vf~g~~WLVv~v~E~WpDWc~V~v~lQ~~a 121 (152) T protein:vir:10 76 KFDDLSRPLGKGGDFAAFRGGWWYITQFLEWWPDWCAFEVTQQLNA 121 (152) T ss_pred chhheechhhcCccEEEECCceEEEEEcccccccceeeeeeeccCh Confidence 122 12349999999999999999999999888888888888 No 29 >protein:vir:94034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:3177 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453621;genbank:gi:84662657;genbank:GeneID:5142544 Probab=37.53 E-value=1.1 Score=20.33 Aligned_cols=114 Identities=17% Similarity=-0.013 Sum_probs=62.3 Q ss_pred CCchhHHHHHHHHHHHHhhhcCCeEEEEEeCCccCCCCcCCCccCCcccceeEE---Eeec---cccccCCceEEEcCeE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQGTARYIHPGEQTGPDYDPQPGEPTPYTLDAT---VRGV---AAQYVQEGYIAASDLQ 74 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~~~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~---~~~v---~~~~idGtlI~~gD~~ 74 (120) |.|--+. .+..-|.+-+.++-.+++++- .|... .+...+|.-.+...+ +..+ +.++.||-.|+.-=+. T Consensus 1 ~~~MNLh-~Ia~~aI~aVNP~~pa~l~~s-tG~t~----~~G~~~P~y~~~~a~~iQ~QalS~~dL~h~dgln~QG~~~~ 74 (141) T protein:vir:94 1 MSGLNLH-RIVRGPIQVVNPDVPGDVYIS-TGHTT----LRGIVTPTFQRLPAQRLQVQAVTTNDLYQLNGLGYAKDTQK 74 (141) T ss_pred CCcchhH-HHhhhhhcccCCCCceEEEEe-eccEe----cCCceEeeeecccceEEEeeccChhHHHHhhcccccceeeE Confidence 5543322 333333333335544555553 33221 111112221111111 2222 4456788777543334 Q ss_pred EEEec--CCc---eeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 75 VTASV--FGR---EPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 75 v~~a~--~~~---~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) +++.+ .++ .=+=||++.|+|++|.|+.+-=.+|.=.-++-.+|+-+ T Consensus 75 iY~~G~~~gv~R~~~~GGD~~vf~g~~WLV~~v~E~WpDWc~v~v~lQ~~~ 125 (141) T protein:vir:94 75 LYAYGTLSGIVRPEGKGGDLVNLANTWWAIQGVIEWWPQWCSVAITRQVDA 125 (141) T ss_pred EEeccchhheechhhcCccEEEECCceEEEEEcccccccceeEeeeeccCh Confidence 55544 112 12349999999999999999999999777888888877 No 30 >protein:vir:8431 Length: 114 # NCBI annotation: gp26 # Family: family:all:3991 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818327;genbank:gi:29566763;genbank:GeneID:1260060 Probab=33.23 E-value=1.4 Score=19.83 Aligned_cols=102 Identities=20% Similarity=0.177 Sum_probs=51.4 Q ss_pred hhcCCeEEEEEeC--CccCCCCcCCCccCCcccceeEEEee-ccccc-cCCceEEEcCe-EEEEec--CCceeccCCEEE Q lcl|NC_020844. 19 AEFQQGTARYIHP--GEQTGPDYDPQPGEPTPYTLDATVRG-VAAQY-VQEGYIAASDL-QVTASV--FGREPTLEGVVE 91 (120) Q Consensus 19 ~kfG~~~~~r~~~--g~~~~p~~dp~~~~~~~~~~~~~~~~-v~~~~-idGtlI~~gD~-~v~~a~--~~~~p~~~D~v~ 91 (120) -+||..++..+.- +.-..++|+-.....+..++.++.++ ...++ .++..=..++. +.++-+ .-+.-+.+|.+. T Consensus 1 m~fG~~tVtfV~i~e~~~~~~~~G~~~~vrT~~~~pgc~frPlt~~e~~~~~~~~~t~~wk~taPp~~av~A~k~~~~li 80 (114) T protein:vir:84 1 MKFGGQTVTFVTITEDLDDRDDYGNPREIRTEVPVPGCRFRPLTAKEKVEFGYNTVADPWRCTAPPVPAVMAAGATGELI 80 (114) T ss_pred CccCCceEEEEEeccCCCCCCCCCCchhheeecCCCceeecccccccccccceeecccchhhccCcchhheeeCCCCeEE Confidence 6798655555443 33334666654433233344444332 22221 22222112222 332211 123567899999 Q ss_pred eCCeEEEEEeccccCCC--CcEEEEEEEEeC Q lcl|NC_020844. 92 IDGVEKQIIRVDPVPAA--GTPVVWRIFVKG 120 (120) Q Consensus 92 ~dg~~~~Vv~v~pi~pa--g~~v~y~~qvR~ 120 (120) +||.+|+|+-..-..+. |.+----+-+.= T Consensus 81 ~dGv~~~i~G~~~~f~~~~Gk~~~~TIi~kR 111 (114) T protein:vir:84 81 YDGVTYEITGGARTFPNFAGKPFKVTIICER 111 (114) T ss_pred EcceeeEEecchhhccccCCceeEEEEEeee Confidence 99999999999888876 733222222211 No 31 >protein:vir:78609 Length: 152 # NCBI annotation: BcepNY3gp05 # Family: family:all:3177 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294842;genbank:gi:149882905;genbank:GeneID:5291080 Probab=32.99 E-value=1.4 Score=19.81 Aligned_cols=108 Identities=13% Similarity=-0.047 Sum_probs=64.1 Q ss_pred HHHHHHHHHHhhhc---CCeEEEEEeCCccCCCCcCCCccCCc--ccceeEEEeec---cccccCCceEEEcCeEEEEec Q lcl|NC_020844. 8 GELQGVASELMAEF---QQGTARYIHPGEQTGPDYDPQPGEPT--PYTLDATVRGV---AAQYVQEGYIAASDLQVTASV 79 (120) Q Consensus 8 ~r~~atA~rLi~kf---G~~~~~r~~~g~~~~p~~dp~~~~~~--~~~~~~~~~~v---~~~~idGtlI~~gD~~v~~a~ 79 (120) =++.+.|.+-|.-- -.+++++- .|. +.+ -+ ..+|. +.++..-+..+ +.++.||-.|+.-=+.+++.+ T Consensus 1 MNLh~Ia~~aI~aVNP~~~A~l~~s-tG~-T~~-~G--~r~P~Y~~~~v~vQ~QalS~~dL~h~dglnqQG~~~~iY~~G 75 (152) T protein:vir:78 1 MNLHDIVRGAITQVNPDEAGTMFVS-TGR-TNV-RG--ILTPTFSSIDAQLQIQAQKHTPLQHERGALYTNSFLTVYAYG 75 (152) T ss_pred CchHHhhhhhhhccCCCCceEEEEe-ece-EcC-CC--cccceecceeeEEEEeecCchHHHHhhcccccceeeEEEecc Confidence 34666666666644 33455553 332 111 11 12221 12233333334 344678877754334455544 Q ss_pred --CCc---eeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 80 --FGR---EPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 80 --~~~---~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) .++ .=+=||++.|+|++|.|+.+-=.+|.=.-++-.+|+-+ T Consensus 76 n~~gv~R~~~qGGD~~vf~g~~WLVv~v~E~WpDWc~V~v~lQ~~a 121 (152) T protein:vir:78 76 KFDDLSRPLGKGGDFAAFRGGWWYITQFLEWWPDWCAFEVTQQLNA 121 (152) T ss_pred chhheechhhcCccEEEECCceEEEEEcccccccceeeeeeeccCh Confidence 122 12349999999999999999999999888888888888 No 32 >protein:vir:77651 Length: 152 # NCBI annotation: gp06 # Family: family:all:3177 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022740;genbank:gi:47835021;genbank:GeneID:2821448 Probab=32.86 E-value=1.2 Score=20.12 Aligned_cols=108 Identities=12% Similarity=-0.066 Sum_probs=64.0 Q ss_pred HHHHHHHHHHhhhcC---CeEEEEEeCCccCCCCcCCCccCCc--ccceeEEEeec---cccccCCceEEEcCeEEEEec Q lcl|NC_020844. 8 GELQGVASELMAEFQ---QGTARYIHPGEQTGPDYDPQPGEPT--PYTLDATVRGV---AAQYVQEGYIAASDLQVTASV 79 (120) Q Consensus 8 ~r~~atA~rLi~kfG---~~~~~r~~~g~~~~p~~dp~~~~~~--~~~~~~~~~~v---~~~~idGtlI~~gD~~v~~a~ 79 (120) =++.+.|.+-|.--. .+++++- .|. +.++. ..+|. +.++..-+..+ +.++.||-.|+.-=+.+++.+ T Consensus 1 MNLh~Ia~~aI~aVNP~~pA~l~~s-tG~-T~~~G---~r~P~Y~~~~v~vQ~QalS~~dL~h~dglnqQG~~~~iY~~G 75 (152) T protein:vir:77 1 MNLHDIVRGAITQVNPDEPGTMFVS-TGR-TNVRG---ILTPMFSSVNAQLQIQAQKHTPLQHERGALYTNSFLTVYAYG 75 (152) T ss_pred CchHHhhhhhhhccCCCCceEEEEe-ece-EcCCC---cccceecceeeEEEEeecCchHHHHhhcccccceeeEEEecc Confidence 346666766666443 3455553 332 22211 12221 12233333333 344678877754334455544 Q ss_pred --CCc---eeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 80 --FGR---EPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 80 --~~~---~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) .++ .=+=||++.|+|++|.|+.+-=.+|.=.-+.-.+|+-+ T Consensus 76 n~~gv~R~~~qGGD~~vf~g~~WLVv~v~E~WpDWc~V~v~lQ~~a 121 (152) T protein:vir:77 76 KFDDLSRPLGKGGDFASFRGGWWYITQFLEWWPDWCAFEVTQQLNA 121 (152) T ss_pred chhheechhhcCccEEEECCceEEEEEcccccchhhhhhhhhhhch Confidence 122 12349999999999999999999999888888888887 No 33 >protein:vir:102144 Length: 113 # NCBI annotation: phage head-tail adaptor, putative # Family: family:all:3858 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699939;genbank:gi:110804032;genbank:GeneID:4206688 Probab=31.89 E-value=1.5 Score=19.68 Aligned_cols=101 Identities=9% Similarity=-0.078 Sum_probs=53.6 Q ss_pred hhcCC--eEEEEEeCCccCCCCcCCCccCCc-ccceeEEEeeccccc--cCCceEEEcCeEEEEe-------cCCceecc Q lcl|NC_020844. 19 AEFQQ--GTARYIHPGEQTGPDYDPQPGEPT-PYTLDATVRGVAAQY--VQEGYIAASDLQVTAS-------VFGREPTL 86 (120) Q Consensus 19 ~kfG~--~~~~r~~~g~~~~p~~dp~~~~~~-~~~~~~~~~~v~~~~--idGtlI~~gD~~v~~a-------~~~~~p~~ 86 (120) =+.|. ..+...++....+. ++....+-+ .+++-+.+...+.+. ..+.+.......+.+- ........ T Consensus 1 M~~G~L~~rI~i~~~~~~~d~-~G~~~~~w~~~~~~wA~v~~~~g~E~~~a~~~~~~~~~~f~iRy~~~i~~~~~~~it~ 79 (113) T protein:vir:10 1 MAECRLNERIIIEELAIIQNS-NGFEEEKWHEYYRCWSSFKKVKGSKFIAAKADNAENIVTFTIRYCNKVKILLDIEAIN 79 (113) T ss_pred CCccccCceEEEEeeeeccCC-CCCeecceEeEEEEEEEEEecCchheeeccceeeeeeEEEEEEecCCCcccccccCCC Confidence 12232 12333333333333 333332322 356677776665553 2344433333444432 11224567 Q ss_pred CCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 87 EGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 87 ~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) +++|..+|..|.|.++.+.+..+.-+...+..-. T Consensus 80 ~~ri~~~g~~y~I~~i~~~~~~~~~l~i~~~~v~ 113 (113) T protein:vir:10 80 KFRINFKGHYYKLEYVDDYDQGHEWVDLKAKIIS 113 (113) T ss_pred CCeEEECCeEEEEEecCCcccCCeEEEEEEEEeC Confidence 8999999999999999998888765544444444 No 34 >protein:vir:94767 Length: 104 # NCBI annotation: unknown # Family: family:all:1270 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996709;genbank:gi:45597424;genbank:GeneID:2769039 Probab=30.75 E-value=1.4 Score=19.78 Aligned_cols=93 Identities=11% Similarity=0.108 Sum_probs=46.0 Q ss_pred EEEEeCCcc-CCCCcCCCccCCcccceeEEEee-ccccccCCceEEEcCeEEEE-e-cCC-ceeccCCEEEeCCeEEEEE Q lcl|NC_020844. 26 ARYIHPGEQ-TGPDYDPQPGEPTPYTLDATVRG-VAAQYVQEGYIAASDLQVTA-S-VFG-REPTLEGVVEIDGVEKQII 100 (120) Q Consensus 26 ~~r~~~g~~-~~p~~dp~~~~~~~~~~~~~~~~-v~~~~idGtlI~~gD~~v~~-a-~~~-~~p~~~D~v~~dg~~~~Vv 100 (120) +.++.+... .++ ++-+..+.+...++.+... ....+++..+-..|-+-.+. + +-. ...--|..|.+.|++|+|+ T Consensus 1 Vtl~~~~~~G~D~-~g~pi~~~~~e~V~nVLV~P~s~~d~~~~~~p~G~~v~~tla~PK~~~~~l~g~~V~~~G~~~~vv 79 (104) T protein:vir:94 1 MTLIDKVETGKDP-FGNPIYEDKEIVVNNVLVSPTSSDDIVNQLTLTGKKAIYTLAIPKKDTHDWENKKVRFFGKTWRTF 79 (104) T ss_pred CEeccceecCcCC-CCCCcccccceEcCceeeCCCChhhcccccCcCceEEEEEEecCCCCCCcccCceEEEeCcEEEEe Confidence 333333222 244 5555555555666666654 34445544333333222221 1 111 1244588899999999987 Q ss_pred eccccCCCC--cEEEEEEEEeC Q lcl|NC_020844. 101 RVDPVPAAG--TPVVWRIFVKG 120 (120) Q Consensus 101 ~v~pi~pag--~~v~y~~qvR~ 120 (120) - +|+...+ ....|+..+-- T Consensus 80 G-dP~~~~~~~~P~~WN~~V~v 100 (104) T protein:vir:94 80 G-EPLEGIEELIPLDWNKKVTV 100 (104) T ss_pred c-CCccccCCcCCcccCCeEEE Confidence 4 3433333 44556555544 No 35 >protein:vir:101560 Length: 152 # NCBI annotation: gp06 # Family: family:all:3177 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958110;genbank:gi:41057656;genbank:GeneID:2716817 Probab=29.46 E-value=1.6 Score=19.45 Aligned_cols=108 Identities=12% Similarity=-0.067 Sum_probs=63.3 Q ss_pred HHHHHHHHHHhhhcC---CeEEEEEeCCccCCCCcCCCccCCc--ccceeEEEeec---cccccCCceEEEcCeEEEEec Q lcl|NC_020844. 8 GELQGVASELMAEFQ---QGTARYIHPGEQTGPDYDPQPGEPT--PYTLDATVRGV---AAQYVQEGYIAASDLQVTASV 79 (120) Q Consensus 8 ~r~~atA~rLi~kfG---~~~~~r~~~g~~~~p~~dp~~~~~~--~~~~~~~~~~v---~~~~idGtlI~~gD~~v~~a~ 79 (120) =++.+.|.+-|.--. .+++++- .|. +.++. ..+|. +.++..-+..+ +.++.||-.|+.-=+.+++.+ T Consensus 1 MNLh~Ia~~aI~aVNP~~pA~l~~s-tG~-T~~~G---~r~P~Y~~~~v~vQ~QalS~~dL~h~dglnqQG~~~~iY~~G 75 (152) T protein:vir:10 1 MNLHDIVRGAITQVNPDEPGTMFVS-TGR-TNVRG---ILTPMFSSVNAQLQIQAQKHTPLQHERGALYTNSFLTVYAYG 75 (152) T ss_pred CchHHhhhhhhcccCCCCceEEEEe-ece-EcCCC---cccceecceeeEEEEeecCchHHHHhhcccccceeeEeEecc Confidence 346666666666443 3455553 332 22211 12221 12233333334 344678877754334455544 Q ss_pred --CCc---eeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 80 --FGR---EPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 80 --~~~---~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) .++ .=+=||++.|+|++|.|+.+-=.+|.=.-++-.+|+-+ T Consensus 76 n~~gv~R~~~qGGD~~vf~g~~WLVv~v~E~WpDWc~V~v~lQ~~a 121 (152) T protein:vir:10 76 KFDDLSRPLGKGGDFAAFRGGWWYITQFLEWWPDWCAFEVTQQLNA 121 (152) T ss_pred chhheechhhcCccEEEECCceEEEEEcccccchhhhhhhhhhhch Confidence 122 12349999999999999999999999777777888877 No 36 >protein:vir:81177 Length: 109 # NCBI annotation: putative head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285814;genbank:gi:148747735;genbank:GeneID:5247220 Probab=28.59 E-value=1.7 Score=19.27 Aligned_cols=100 Identities=10% Similarity=-0.103 Sum_probs=52.9 Q ss_pred CCchhHHHHHHHHHHHHhhhcCC--eEEEEEeCCccCCCCcCCCccCCc-ccceeEEEeecccccc--CCceEEEcCeEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQ--GTARYIHPGEQTGPDYDPQPGEPT-PYTLDATVRGVAAQYV--QEGYIAASDLQV 75 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~--~~~~r~~~g~~~~p~~dp~~~~~~-~~~~~~~~~~v~~~~i--dGtlI~~gD~~v 75 (120) |- .|+ --+...++....++ ++....+.+ .+++-+.+...+.+.. .+.+...-...+ T Consensus 1 M~------------------~g~L~~rI~i~~~~~~~d~-~G~~~~~w~~~~~~wA~v~~~s~~e~~~a~~~~~~~~~~f 61 (109) T protein:vir:81 1 MN------------------PGQFRHKITLMKLVTTQDE-IGNTIEEWQPVRTCWAAIKTVNGREYFAAASVQAERTYRF 61 (109) T ss_pred CC------------------ccccCccEEEEeeeeeeCC-CCCeecceeeEEEEEEEEEecCchheeeccceeeeeeEEE Confidence 11 222 12222222222222 333333322 2567777777655533 232222222334 Q ss_pred EEecCCceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 76 TASVFGREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 76 ~~a~~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) .+-... ..+..++|.++|+.|.|.++.|......-+...+.-.+ T Consensus 62 ~iR~~~-~i~~~~ri~~~g~~y~I~~v~~~~~~~~~l~i~~~e~~ 105 (109) T protein:vir:81 62 IIRYTP-GINETMKIDYQGRLFDIQSVLNDDEGKKTLTIIATERV 105 (109) T ss_pred EEEeCC-CCCcccEEEECCeEEEEEeecCCccCCcEEEEEEEEee Confidence 433211 35779999999999999999999888866666555444 No 37 >protein:vir:9762 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:1270 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795524;genbank:gi:28876280;genbank:GeneID:1257821 Probab=27.49 E-value=1.8 Score=19.13 Aligned_cols=102 Identities=9% Similarity=-0.009 Sum_probs=51.2 Q ss_pred HhhhcCCeEEEEEeCCccCCCCcCCCccCCcccceeEEEee-ccccccCCceEEEcCeEEEEecCC---ceeccCCEEEe Q lcl|NC_020844. 17 LMAEFQQGTARYIHPGEQTGPDYDPQPGEPTPYTLDATVRG-VAAQYVQEGYIAASDLQVTASVFG---REPTLEGVVEI 92 (120) Q Consensus 17 Li~kfG~~~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~-v~~~~idGtlI~~gD~~v~~a~~~---~~p~~~D~v~~ 92 (120) |=.-.| .++..+++......+++-+..+.+..+++.+... -...+++..+-..|-+-.+..+.. -..--|..|.+ T Consensus 1 m~~ikG-etVtvi~~~~tG~D~~g~p~~~~~~e~V~nVLV~P~s~~d~~~~~~p~G~~v~~tl~fPK~~~~~lrg~~V~~ 79 (112) T protein:vir:97 1 MGKLRG-ITITLIDKVTIDIDPFGNPIKKDKEISVDNVLVSPATSDDITSQLSLSGKKAVYTLAIPKGDNHDWGDKEVRF 79 (112) T ss_pred Cccccc-eeEEEeccccccccCCCCceecccceecCcEEeCCCChhhcccccCcCceEEEEEEecCCCCCCcccCcEEEE Confidence 212234 3555554433222335555555555667777654 344455444433433322211110 02344888999 Q ss_pred CCeEEEEEeccccCCCC--cEEEEEEEEeC Q lcl|NC_020844. 93 DGVEKQIIRVDPVPAAG--TPVVWRIFVKG 120 (120) Q Consensus 93 dg~~~~Vv~v~pi~pag--~~v~y~~qvR~ 120 (120) .|++|+||- .|+...+ ....|+..|-- T Consensus 80 ~G~~~~vvG-~P~~~~~~~~P~~WN~~V~V 108 (112) T protein:vir:97 80 FGEKWRTVG-LALEGIEELIPLEWNKKVMV 108 (112) T ss_pred eCCeeEEec-CCccccCCCCCCccCCeEEE Confidence 999999885 3433333 45566665555 No 38 >protein:vir:9577 Length: 112 # NCBI annotation: gp43 # Family: family:all:1270 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862882;genbank:gi:32469474;genbank:GeneID:1461319 Probab=26.07 E-value=2 Score=18.95 Aligned_cols=102 Identities=15% Similarity=0.057 Sum_probs=52.0 Q ss_pred HhhhcCCeEEEEEeCCccCCCCcCCCccCCcccceeEEEee-ccccccCCceEEEcCeEEEE-e-cCC-ceeccCCEEEe Q lcl|NC_020844. 17 LMAEFQQGTARYIHPGEQTGPDYDPQPGEPTPYTLDATVRG-VAAQYVQEGYIAASDLQVTA-S-VFG-REPTLEGVVEI 92 (120) Q Consensus 17 Li~kfG~~~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~-v~~~~idGtlI~~gD~~v~~-a-~~~-~~p~~~D~v~~ 92 (120) |=.-.| .++..+++......+++-+..+.+...++.+... ....+++..+-..|-+-.+. + +-. ...--|..|.+ T Consensus 1 m~~i~G-etVtvi~~~~tG~D~~G~p~~e~~~e~V~nVLV~P~s~~d~~~~~~p~G~~v~~tla~PK~~~~~l~g~~V~~ 79 (112) T protein:vir:95 1 MGRIKG-ITVTLIGKTKTGKDDFGHPIYENTEIQVDNVLVVPASTEDVTNQLNLTGKKASYTLGIPKGDQNEWKDREVRF 79 (112) T ss_pred Cccccc-eeEEEecceeccccCCCCCeeeccceecCceEeCCCChhhcccccCcceeEEEEEEecCCCCCCcccCcEEEE Confidence 112234 3555554433222335555556666667777654 44445554433333222221 1 111 13445888999 Q ss_pred CCeEEEEEeccccCCCC--cEEEEEEEEeC Q lcl|NC_020844. 93 DGVEKQIIRVDPVPAAG--TPVVWRIFVKG 120 (120) Q Consensus 93 dg~~~~Vv~v~pi~pag--~~v~y~~qvR~ 120 (120) .|++|+|+- +|+...+ ....|+..|-- T Consensus 80 ~G~~~~vvG-~P~~~~~~~~P~~WN~~V~v 108 (112) T protein:vir:95 80 FGRKWRTIG-IPLEGIEAMMPLDWNKKVMV 108 (112) T ss_pred eCcEEEEec-CCccccCCCCCCccCCeEEE Confidence 999999874 3443333 55666666555 No 39 >protein:vir:4459 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700382;genbank:gi:23505454;genbank:GeneID:955661 Probab=23.30 E-value=2.3 Score=18.58 Aligned_cols=115 Identities=13% Similarity=-0.007 Sum_probs=55.7 Q ss_pred CCchhHHHHHHHHHHHHhhhcCC--eEEEEEeCCccCCCCcCCCccCCcccceeEEEeeccccc--cCCceEEEcCeEEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQ--GTARYIHPGEQTGPDYDPQPGEPTPYTLDATVRGVAAQY--VQEGYIAASDLQVT 76 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~--~~~~r~~~g~~~~p~~dp~~~~~~~~~~~~~~~~v~~~~--idGtlI~~gD~~v~ 76 (120) |- -.|=.+ .|..++=+.|+ --+...++....++..++...-...+.+-+.+...+.+. ..+.+.......+. T Consensus 1 ~~-~~~~~~---~~~~~~M~aG~L~~RI~i~~~~~~~D~~G~~~~~w~~~~~vwA~v~~~sg~E~~~a~~~~~~~t~~i~ 76 (134) T protein:vir:44 1 MK-IRQAQT---SATYLLPDPGELDQRIVIRRRVDVPADDFGVTPTYPEQIRTWAKKAQPGAAAYQGSVQIENRVTHYFT 76 (134) T ss_pred Cc-cccccc---eeeEeccCccccCccEEEEeeeeeeCCCCCeecceEeeEEEEEEEEecCchheeeccceeeeeeEEEE Confidence 21 111111 12222223353 133333333333333333222222356777777766553 23333222233444 Q ss_pred EecCCceeccCCEEEeCCeEEEEEeccccCCCCcEEEEEEEEeC Q lcl|NC_020844. 77 ASVFGREPTLEGVVEIDGVEKQIIRVDPVPAAGTPVVWRIFVKG 120 (120) Q Consensus 77 ~a~~~~~p~~~D~v~~dg~~~~Vv~v~pi~pag~~v~y~~qvR~ 120 (120) +-... ....+++|.++|..|.|..+.+....+.-+...+.-=| T Consensus 77 IR~~~-~It~~~RI~~~g~~y~I~~I~~~~~~~~~L~i~c~evg 119 (134) T protein:vir:44 77 IRFRR-GITADHEVLHDDISYRVKRVRDLNGKRRFLLIECEALG 119 (134) T ss_pred EEeCC-CCCcccEEEECCeEEEEEEecCCCcCCcEEEEEEEEee Confidence 43211 24568999999999999999988887764443333222 No 40 >protein:vir:100134 Length: 109 # NCBI annotation: gp8 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945038;genbank:gi:38707898;genbank:GeneID:2744181 Probab=21.23 E-value=2.6 Score=18.28 Aligned_cols=102 Identities=16% Similarity=0.063 Sum_probs=52.3 Q ss_pred CCchhHHHHHHHHHHHHhhhcCC--eEEEEEeCCccCCCCcCCCccCCc-ccceeEEEeeccccccC--CceEEEcCeEE Q lcl|NC_020844. 1 MPSAEIYGELQGVASELMAEFQQ--GTARYIHPGEQTGPDYDPQPGEPT-PYTLDATVRGVAAQYVQ--EGYIAASDLQV 75 (120) Q Consensus 1 ~~~a~~Y~r~~atA~rLi~kfG~--~~~~r~~~g~~~~p~~dp~~~~~~-~~~~~~~~~~v~~~~id--GtlI~~gD~~v 75 (120) |. +.|+ --+...++....++..++..++-+ .+++-+.+...+.+... +.+-..-...+ T Consensus 1 mm-----------------~~G~L~~rI~i~~~~~~~d~~G~~~~~~w~~~~~~wA~v~~~s~~e~~~a~~~~~~~~~~~ 63 (109) T protein:vir:10 1 ML-----------------KAGELTERITIEKRGGGVNENGEPLPGDWVEHASVWANVRFLSGKEYVVSGAIHSSAIASM 63 (109) T ss_pred CC-----------------CccccCccEEEEeeeeeeCCCCCeeccceEEEEEEEEEEEecCchheeeccceeeeeEEEE Confidence 33 2232 123333333333332333343323 35677777776655432 22211111233 Q ss_pred EEecCCceeccCCEEEeCCeEEEEEecccc-CCCCcEEEEEEEEeC Q lcl|NC_020844. 76 TASVFGREPTLEGVVEIDGVEKQIIRVDPV-PAAGTPVVWRIFVKG 120 (120) Q Consensus 76 ~~a~~~~~p~~~D~v~~dg~~~~Vv~v~pi-~pag~~v~y~~qvR~ 120 (120) .+-.. -..+.+|+|..+|..|.|..+.|. +.....+.-++-.|. T Consensus 64 ~iR~~-~~I~~~~ri~~~g~~y~I~~v~~d~~~~~~~l~~~~~e~~ 108 (109) T protein:vir:10 64 RIRFR-RDVDSEMRIRHDGRLYDIAAVLPNRRQGYVDLSVKVGEKY 108 (109) T ss_pred EEEeC-CCCCcccEEEECCeEEEEeecCCCCCCCeEEEEEEEEEee Confidence 33221 135679999999999999999774 333455666666677 No 41 >protein:vir:4789 Length: 123 # NCBI annotation: putative minor capsid protein 2 # Family: family:all:1526 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150169;swissprot:trembl:q94m42;genbank:gi:15088780;uniprot:Q94M42;genbank:GeneID:955991 Probab=20.86 E-value=2.7 Score=18.22 Aligned_cols=101 Identities=14% Similarity=0.066 Sum_probs=50.6 Q ss_pred HHHHHHhhh----c-CCe-EEEEEeCCccCCCCcCCCc-cCCcccceeEEEeec---cccccCCceEEEc-------CeE Q lcl|NC_020844. 12 GVASELMAE----F-QQG-TARYIHPGEQTGPDYDPQP-GEPTPYTLDATVRGV---AAQYVQEGYIAAS-------DLQ 74 (120) Q Consensus 12 atA~rLi~k----f-G~~-~~~r~~~g~~~~p~~dp~~-~~~~~~~~~~~~~~v---~~~~idGtlI~~g-------D~~ 74 (120) -.-.|||.+ + -+. ++++. .| .++|+.+. ++|+. ++-+.++- -....+...+.+- ... T Consensus 1 ~~~~~~~p~ipK~~l~dsit~k~~-~~---kddyg~~~y~epvt--I~nvr~dr~t~ysG~~N~r~~taN~~~~~k~aVi 74 (123) T protein:vir:47 1 MIDKRLLKGIDKRLLKDVLTIKKV-AD---KNDYGDEVYSEPLT--IKNVRFDRSVGGSGNRNSKTGTGNSKSRQKQGVI 74 (123) T ss_pred CccccccCcCChhhcceeEEEEEe-cC---CCCcCCceeccceE--eeeeEEeeccccCCcccCcceecccccccCceEE Confidence 011222221 1 221 34433 23 24577655 45542 22222221 1112222223222 333 Q ss_pred EEEecCCceecc----CCEEEeCC-eEEEEEeccccCCCCcEEEEEEEEe Q lcl|NC_020844. 75 VTASVFGREPTL----EGVVEIDG-VEKQIIRVDPVPAAGTPVVWRIFVK 119 (120) Q Consensus 75 v~~a~~~~~p~~----~D~v~~dg-~~~~Vv~v~pi~pag~~v~y~~qvR 119 (120) ++.+..+ +|.+ -|.+.+|| .+|+|-++.|....+.+-+|+|.|= T Consensus 75 flY~~~s-~p~~d~~~~~~kv~dg~~EYtI~kIi~n~ysn~vysYELEVi 123 (123) T protein:vir:47 75 YLYPSLS-FVTVDNSWMGAKVNDGIGDYTINGFQTNYYDGEIFSQEIEVI 123 (123) T ss_pred EEecccc-ccceeccccceEEEcCCccEEecceeccccCCcEEEEEEEeC Confidence 4444332 3433 23466666 6999999999999999999999999 Done!