Query lcl|NC_021302.1_cdsid_YP_008051238.1 [gene=13] [protein=hypothetical protein] [protein_id=YP_008051238.1] [location=8306..8743] Match_columns 145 No_of_seqs 14 out of 17 Neff 4.0 Searched_HMMs 1612 Date Thu Nov 7 17:41:46 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_13 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_13_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:8107 Length: 138 # 100.0 1.6E-56 1E-59 326.4 10.9 136 1-144 1-138 (138) 2 protein:vir:7994 Length: 134 # 100.0 1.3E-52 8E-56 305.1 10.0 131 4-141 1-134 (134) 3 protein:vir:105826 Length: 134 100.0 1.6E-52 1E-55 304.5 10.0 131 4-141 1-134 (134) 4 protein:vir:102609 Length: 134 100.0 1.6E-52 1E-55 304.5 10.0 131 4-141 1-134 (134) 5 protein:vir:8331 Length: 150 # 100.0 8.7E-50 5.4E-53 289.6 8.1 132 1-144 17-150 (150) 6 protein:vir:8433 Length: 140 # 100.0 6.2E-35 3.8E-38 208.1 7.5 133 4-143 1-140 (140) 7 protein:vir:101655 Length: 134 100.0 6.2E-33 3.8E-36 197.2 9.2 134 1-143 1-134 (134) 8 protein:vir:7860 Length: 134 # 100.0 6.2E-33 3.8E-36 197.2 9.2 134 1-143 1-134 (134) 9 protein:vir:1643 Length: 111 # 97.1 1.2E-05 7.7E-09 47.5 9.9 108 11-139 1-111 (111) 10 protein:vir:94768 Length: 111 97.0 2.1E-05 1.3E-08 46.3 9.7 108 11-139 1-111 (111) 11 protein:vir:9764 Length: 111 # 96.9 1.9E-05 1.2E-08 46.5 9.2 108 1-139 1-111 (111) 12 protein:vir:100242 Length: 114 96.9 1.3E-05 8.2E-09 47.3 8.3 105 1-140 1-114 (114) 13 protein:vir:9579 Length: 111 # 96.9 2.3E-05 1.4E-08 46.0 9.3 108 11-139 1-111 (111) 14 protein:vir:1438 Length: 115 # 95.4 0.00049 3E-07 38.8 9.4 107 1-140 1-115 (115) 15 protein:vir:100116 Length: 115 95.1 0.00065 4E-07 38.1 9.3 107 1-140 1-115 (115) 16 protein:vir:80371 Length: 115 94.9 0.00062 3.9E-07 38.2 8.4 105 1-140 1-115 (115) 17 protein:vir:78124 Length: 139 94.2 0.0027 1.7E-06 34.6 10.6 126 5-145 1-138 (139) 18 protein:vir:98426 Length: 131 93.9 0.003 1.9E-06 34.4 10.2 123 4-143 1-131 (131) 19 protein:vir:4348 Length: 121 # 91.7 0.0074 4.6E-06 32.3 9.2 105 8-145 1-120 (121) 20 protein:vir:1892 Length: 121 # 87.2 0.029 1.8E-05 29.0 8.9 105 8-139 1-121 (121) 21 protein:vir:81066 Length: 118 85.6 0.034 2.1E-05 28.7 8.4 108 10-143 1-118 (118) 22 protein:vir:10368 Length: 118 83.4 0.054 3.4E-05 27.5 8.5 108 10-143 1-118 (118) 23 protein:vir:93602 Length: 114 83.3 0.051 3.1E-05 27.7 8.3 102 4-140 1-114 (114) 24 protein:vir:99005 Length: 170 81.0 0.028 1.7E-05 29.1 6.0 136 1-145 1-159 (170) 25 protein:vir:97070 Length: 118 79.6 0.09 5.6E-05 26.3 8.3 108 10-143 1-118 (118) 26 protein:vir:96260 Length: 141 77.3 0.13 7.8E-05 25.5 9.3 129 1-144 1-141 (141) 27 protein:vir:105892 Length: 141 77.3 0.13 7.8E-05 25.5 9.3 129 1-144 1-141 (141) 28 protein:vir:94096 Length: 141 77.3 0.13 7.8E-05 25.5 9.3 129 1-144 1-141 (141) 29 protein:vir:94794 Length: 145 73.9 0.17 0.0001 24.9 10.0 130 1-145 1-142 (145) 30 protein:vir:95961 Length: 145 73.8 0.17 0.0001 24.9 10.0 130 1-145 1-142 (145) 31 protein:vir:93736 Length: 145 70.8 0.2 0.00013 24.4 9.9 130 5-145 1-142 (145) 32 protein:vir:97421 Length: 145 70.8 0.2 0.00013 24.4 9.9 130 5-145 1-142 (145) 33 protein:vir:94488 Length: 145 70.8 0.2 0.00013 24.4 9.9 130 5-145 1-142 (145) 34 protein:vir:95111 Length: 145 70.7 0.21 0.00013 24.4 9.9 130 5-145 1-142 (145) 35 protein:vir:97325 Length: 145 70.4 0.21 0.00013 24.3 9.9 130 1-145 1-142 (145) 36 protein:vir:1387 Length: 116 # 70.0 0.12 7.4E-05 25.7 6.3 106 7-139 1-116 (116) 37 protein:vir:96125 Length: 140 66.7 0.26 0.00016 23.8 9.5 127 6-143 1-140 (140) 38 protein:vir:98343 Length: 126 65.8 0.28 0.00017 23.6 8.0 117 1-145 1-125 (126) 39 protein:vir:9415 Length: 126 # 65.8 0.28 0.00017 23.6 8.0 117 1-145 1-125 (126) 40 protein:vir:5979 Length: 134 # 57.4 0.44 0.00027 22.6 11.2 121 5-137 1-134 (134) 41 protein:vir:195 Length: 115 # 56.3 0.46 0.00028 22.4 8.6 102 1-140 1-115 (115) 42 protein:vir:107096 Length: 145 55.9 0.47 0.00029 22.4 9.6 120 1-145 1-132 (145) 43 protein:vir:105337 Length: 145 55.9 0.47 0.00029 22.4 9.6 120 1-145 1-132 (145) 44 protein:vir:1244 Length: 145 # 55.0 0.49 0.0003 22.3 9.4 119 1-145 1-132 (145) 45 protein:vir:2508 Length: 139 # 50.6 0.46 0.00029 22.4 5.9 131 2-144 1-139 (139) 46 protein:vir:96894 Length: 140 48.3 0.67 0.00042 21.5 8.8 127 1-143 1-140 (140) 47 protein:vir:99925 Length: 147 42.9 0.79 0.00049 21.1 5.9 131 1-145 1-146 (147) No 1 >protein:vir:8107 Length: 138 # NCBI annotation: gp11 # Family: family:all:2795 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817688;genbank:gi:29566119;genbank:GeneID:1259313 Probab=100.00 E-value=1.6e-56 Score=326.45 Aligned_cols=136 Identities=26% Similarity=0.455 Sum_probs=130.7 Q ss_pred CcccccCCCCCHHhhhhhhhhhhccccccCCCCcCCCceEEEEeecCCCCc--ceeccEEEEEEEeeccCChhhHHHHHH Q lcl|NC_021302. 1 MIELLDREAPPDIRFLRAWLLPIGGGVGAKRETGDPFPFTLIQKFDGWENS--HTQYGFYQFDHLAVAADGKSAYTACED 78 (145) Q Consensus 1 ~~~l~d~eaP~~~~~lia~L~plg~~v~~~R~~gdPlPf~~V~rV~G~dd~--~~~~~~~~v~~~~~~~d~~~~~eaa~d 78 (145) |++|+||++||+|+||||||+|+++ +..+|++|||||||+|+||+|+||+ +|+++++||++|+ ++.|||++ T Consensus 1 ~~~~~~~~aP~~e~~vv~WLspv~~-va~~R~~d~pLPF~~V~Rv~G~d~~e~~tD~avv~~~~fg------~g~eaA~d 73 (138) T protein:vir:81 1 MADLHDQDAPDEEDFVVCWMQPVMR-TAVERDIDAELPFCEVTRIDGADDPEAGTDNPVIQLDFYA------LGAEAAKA 73 (138) T ss_pred CcccccCCCCchheeeeeeccchhc-cccccCCCCCCCeEEEEEeCCCCCccccccCceEEEEEee------cCHHHHHH Confidence 9999999999999999999999998 7889999999999999999999988 4999999999995 45599999 Q ss_pred HHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEee Q lcl|NC_021302. 79 YARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVA 144 (145) Q Consensus 79 aA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~ 144 (145) +|+++||||++|. +.+++||+|||+++++||++++++|+|++|.||+|.||+|||++|++|+||+ T Consensus 74 ~a~~vHrRM~kL~-~~~~~vTl~dGt~~~ld~~~~~~~P~~~~y~dD~ivRYtaRY~~g~~y~~~~ 138 (138) T protein:vir:81 74 AAKQGHRRMLFLF-RNFPTVTLSDGTLADLDFGETLIKPFRMAFEHDQIVRYTARYQLGTSYVAVS 138 (138) T ss_pred HHHhHHHHHHHHh-hcccceecCCCceEecchhhhhccccccccCCCeeeEeeeeeeccceeeecC Confidence 9999999999997 6699999999999999999999999999999999999999999999999999 No 2 >protein:vir:7994 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817348;genbank:gi:29565776;genbank:GeneID:1259015 Probab=100.00 E-value=1.3e-52 Score=305.05 Aligned_cols=131 Identities=30% Similarity=0.503 Sum_probs=126.0 Q ss_pred cccCCCCCHHhhhhhhhhhhccccccCCCCcCCCceEEEEeecCCCCc--ceeccEEEEEEEeeccCChhhHHHHHHHHH Q lcl|NC_021302. 4 LLDREAPPDIRFLRAWLLPIGGGVGAKRETGDPFPFTLIQKFDGWENS--HTQYGFYQFDHLAVAADGKSAYTACEDYAR 81 (145) Q Consensus 4 l~d~eaP~~~~~lia~L~plg~~v~~~R~~gdPlPf~~V~rV~G~dd~--~~~~~~~~v~~~~~~~d~~~~~eaa~daA~ 81 (145) |+++|+||+|+||+||||||++ |.++|++||||||++|+||+|+||+ +|+++++||++|+ ++.|||+|+|+ T Consensus 1 m~~~saP~~e~~vv~WLsp~~~-va~~R~~~~PLPf~~V~Rv~G~d~~e~~tD~avvsv~~fg------~~~eaA~d~ad 73 (134) T protein:vir:79 1 MATDSAPSIHRVLVAWLSPLGK-VSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFA------ASDEAAENEAE 73 (134) T ss_pred CCcccCCChheeeeeecccchh-ceeccCCCCCCCeEEEEEeCCCCCccccccCceeEEEEee------CCHHHhhHHHH Confidence 9999999999999999999998 8889999999999999999999977 5999999999997 45599999999 Q ss_pred HHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCC-ceEEEEEEeeecceEE Q lcl|NC_021302. 82 TIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNT-DVERFIARYSVHLRLV 141 (145) Q Consensus 82 ~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd-~v~Ry~aRY~v~Lr~v 141 (145) ++||||+||..++..+|+++||+++++||++++++|+|++|+|| ++.||+|||++|++|+ T Consensus 74 ~vHrRM~kL~~~~~~~~~~~gG~~~~id~~~vl~~P~~~eY~dD~~~vrytgRY~~g~~y~ 134 (134) T protein:vir:79 74 LTHQRMLELVVNPLTEIPVGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) T ss_pred HHHHHHHHHhcccccceecCCceEEEeehhhhhccceeeeeCCCceEEEEeeeeeeccccC Confidence 99999999998999999999999999999999999999999888 9999999999999999 No 3 >protein:vir:105826 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655771;genbank:gi:109522094;genbank:GeneID:4157634 Probab=100.00 E-value=1.6e-52 Score=304.49 Aligned_cols=131 Identities=30% Similarity=0.505 Sum_probs=126.0 Q ss_pred cccCCCCCHHhhhhhhhhhhccccccCCCCcCCCceEEEEeecCCCCc--ceeccEEEEEEEeeccCChhhHHHHHHHHH Q lcl|NC_021302. 4 LLDREAPPDIRFLRAWLLPIGGGVGAKRETGDPFPFTLIQKFDGWENS--HTQYGFYQFDHLAVAADGKSAYTACEDYAR 81 (145) Q Consensus 4 l~d~eaP~~~~~lia~L~plg~~v~~~R~~gdPlPf~~V~rV~G~dd~--~~~~~~~~v~~~~~~~d~~~~~eaa~daA~ 81 (145) |+++|+||+|+||+||||||++ |.++|++||||||++|+||+|+||+ +|+++++||++|+ ++.|||+|+|+ T Consensus 1 m~~~saP~~e~~vv~WLsp~~~-va~~R~~~~PLPf~~V~Rv~G~d~~e~~tD~avvsv~~fg------~~~eaA~d~ad 73 (134) T protein:vir:10 1 MATDSAPSIHRVLVAWLSPLGK-VSTRRLSGDPLPHRVVRRVDGRDVPEEGSDVAVVSVHTFA------ASDEAAENEAE 73 (134) T ss_pred CCcccCCChheeeeeecccchh-ceeccCCCCCCCeEEEEEeCCCCCcccccccceEEEEEee------CCHHHhhHHHH Confidence 9999999999999999999998 8889999999999999999999977 5899999999997 45599999999 Q ss_pred HHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCC-ceEEEEEEeeecceEE Q lcl|NC_021302. 82 TIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNT-DVERFIARYSVHLRLV 141 (145) Q Consensus 82 ~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd-~v~Ry~aRY~v~Lr~v 141 (145) ++||||+||..++..+|+++||+++++||++++++|+|++|+|| ++.||+|||++|++|+ T Consensus 74 ~vHrRM~kL~~~~~~~~~~~gG~~~~id~~~v~~~P~~~eY~dD~~~vrytgRY~~g~~y~ 134 (134) T protein:vir:10 74 LTHQRMLELVVNPLTEIPVGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) T ss_pred HHHHHHHHHhcccccceecCCceEEEeehhhhhccceeeeeCCCceEEEEeeeeeeccccC Confidence 99999999998999999999999999999999999999999888 9999999999999999 No 4 >protein:vir:102609 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655006;genbank:gi:109392196;genbank:GeneID:4157231 Probab=100.00 E-value=1.6e-52 Score=304.49 Aligned_cols=131 Identities=30% Similarity=0.505 Sum_probs=126.0 Q ss_pred cccCCCCCHHhhhhhhhhhhccccccCCCCcCCCceEEEEeecCCCCc--ceeccEEEEEEEeeccCChhhHHHHHHHHH Q lcl|NC_021302. 4 LLDREAPPDIRFLRAWLLPIGGGVGAKRETGDPFPFTLIQKFDGWENS--HTQYGFYQFDHLAVAADGKSAYTACEDYAR 81 (145) Q Consensus 4 l~d~eaP~~~~~lia~L~plg~~v~~~R~~gdPlPf~~V~rV~G~dd~--~~~~~~~~v~~~~~~~d~~~~~eaa~daA~ 81 (145) |+++|+||+|+||+||||||++ |.++|++||||||++|+||+|+||+ +|+++++||++|+ ++.|||+|+|+ T Consensus 1 m~~~saP~~e~~vv~WLsp~~~-va~~R~~~~PLPf~~V~Rv~G~d~~e~~tD~avvsv~~fg------~~~eaA~d~ad 73 (134) T protein:vir:10 1 MATDSAPSIHRVLVAWLSPLGK-VSTRRLSGDPLPHRVVRRVDGRDVPEEGSDVAVVSVHTFA------ASDEAAENEAE 73 (134) T ss_pred CCcccCCChheeeeeecccchh-ceeccCCCCCCCeEEEEEeCCCCCcccccccceEEEEEee------CCHHHhhHHHH Confidence 9999999999999999999998 8889999999999999999999977 5899999999997 45599999999 Q ss_pred HHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCC-ceEEEEEEeeecceEE Q lcl|NC_021302. 82 TIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNT-DVERFIARYSVHLRLV 141 (145) Q Consensus 82 ~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd-~v~Ry~aRY~v~Lr~v 141 (145) ++||||+||..++..+|+++||+++++||++++++|+|++|+|| ++.||+|||++|++|+ T Consensus 74 ~vHrRM~kL~~~~~~~~~~~gG~~~~id~~~v~~~P~~~eY~dD~~~vrytgRY~~g~~y~ 134 (134) T protein:vir:10 74 LTHQRMLELVVNPLTEIPVGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) T ss_pred HHHHHHHHHhcccccceecCCceEEEeehhhhhccceeeeeCCCceEEEEeeeeeeccccC Confidence 99999999998999999999999999999999999999999888 9999999999999999 No 5 >protein:vir:8331 Length: 150 # NCBI annotation: gp48 # Family: family:all:2795 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817899;genbank:gi:29566332;genbank:GeneID:1259527 Probab=100.00 E-value=8.7e-50 Score=289.56 Aligned_cols=132 Identities=33% Similarity=0.438 Sum_probs=125.9 Q ss_pred CcccccCCCCCHHhhhhhhhhhhccccccCCCCcCCCceEEEEeecCCCCc--ceeccEEEEEEEeeccCChhhHHHHHH Q lcl|NC_021302. 1 MIELLDREAPPDIRFLRAWLLPIGGGVGAKRETGDPFPFTLIQKFDGWENS--HTQYGFYQFDHLAVAADGKSAYTACED 78 (145) Q Consensus 1 ~~~l~d~eaP~~~~~lia~L~plg~~v~~~R~~gdPlPf~~V~rV~G~dd~--~~~~~~~~v~~~~~~~d~~~~~eaa~d 78 (145) --+|+++++||+|+|++|||+|+++ +.++|+.||||||++|+||+|++++ +++.+++||++|+++.+|+ +||++ T Consensus 17 ~~~~~~~sapdae~~vv~wLsp~~r-vA~~R~~~dplPf~lv~rv~G~d~pde~td~avvsv~~fg~~v~G~---daA~~ 92 (150) T protein:vir:83 17 EPEILNEGPADAETFVVKWLGEVYR-AANTRRPGDPLPFLLIQQVAGKENLDESTADPVVQVDILCDKVDGE---DAARD 92 (150) T ss_pred CcccccCCCccHHHHHHHHhhHHhh-hhhcccCCCCCCeEEEEecCCCCCcccccccceeeeeeccccccch---hhhhh Confidence 6789999999999999999999998 8888888999999999999999964 6999999999999999988 99999 Q ss_pred HHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEee Q lcl|NC_021302. 79 YARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVA 144 (145) Q Consensus 79 aA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~ 144 (145) +|+++||||++|. .+++||| ++||++++++|+|++|.||+++||+|||++|++|++|| T Consensus 93 ~ad~vH~RM~~l~-----r~tl~~G---tld~~~v~~aP~~leY~dD~vvrYt~RY~~G~~Y~~~~ 150 (150) T protein:vir:83 93 IKDRVHRRMLLLG-----RYLEMDG---TLDWMKVFESPRRLEYTNDKVIRYTARYQFGQTYEQIA 150 (150) T ss_pred hhhhHHHHHHHHh-----hhhccCC---cchhhhhhccccccccCCCeEEEeeeeeeccCchhhcC Confidence 9999999999996 4999999 59999999999999999999999999999999999999 No 6 >protein:vir:8433 Length: 140 # NCBI annotation: gp28 # Family: family:all:30886 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818329;genbank:gi:29566765;genbank:GeneID:1260029 Probab=100.00 E-value=6.2e-35 Score=208.14 Aligned_cols=133 Identities=28% Similarity=0.464 Sum_probs=129.1 Q ss_pred cccCCCCCHHhhhhhhhhhhccc-cccCCCCcCCCceEEEEeecCCCCcc--eeccEEEEEEEeeccCChhhHHHHHHHH Q lcl|NC_021302. 4 LLDREAPPDIRFLRAWLLPIGGG-VGAKRETGDPFPFTLIQKFDGWENSH--TQYGFYQFDHLAVAADGKSAYTACEDYA 80 (145) Q Consensus 4 l~d~eaP~~~~~lia~L~plg~~-v~~~R~~gdPlPf~~V~rV~G~dd~~--~~~~~~~v~~~~~~~d~~~~~eaa~daA 80 (145) |+++|+|++.++||+||+.+.+. |.++||+++||||+.|+||+|++|++ |+++.|.+.+| +|++|+.+|.. T Consensus 1 mteyeyppgvkvlikwlsgiegvdvrherppnsplpfisvhrigggedencitdqgryafmvf------gssqemvddtv 74 (140) T protein:vir:84 1 MTEYEYPPGVKVLIKWLSGIEGVDVRHERPPNSPLPFISVHRIGGGEDENCITDQGRYAFMVF------GSSQEMVDDTV 74 (140) T ss_pred CCcccCCccHHHHHHHhcccccccccccCCCCCCCceeeeeeccCCCcccccccCCcEEEEEe------cCchhhhHHHH Confidence 99999999999999999999985 99999999999999999999999997 99999999999 58999999999 Q ss_pred HHHHHHhhhhc-CCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEE---eeecceEEEe Q lcl|NC_021302. 81 RTIKRRMLYLR-DRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIAR---YSVHLRLVSV 143 (145) Q Consensus 81 ~~~hrRMl~L~-~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aR---Y~v~Lr~va~ 143 (145) +++.|||++|. .+.|++|+++|.+++.....+.+++|+ ++.+|.++++|++. |+||||+||- T Consensus 75 rlvtrrmkklvgygsqekvtvgdksyyadeahkreerpi-dnlddaiprkffgtslmydvhmrivaa 140 (140) T protein:vir:84 75 RLVTRRMKKLVGYGSQEKVTVGDKSYYADEAHKREERPI-DNLDDAIPRKFFGTSLMYDVHMRIVAA 140 (140) T ss_pred HHHHHHHHHHhcCCCcceeeecccchhcchhhhhhcccc-chhhhhhhhhhhhhhhhheeeeeeecC Confidence 99999999998 799999999999999999999999999 99999999999999 9999999988 No 7 >protein:vir:101655 Length: 134 # NCBI annotation: gp18 # Family: family:all:2795 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654773;genbank:gi:109302771;genbank:GeneID:4156089 Probab=99.96 E-value=6.2e-33 Score=197.16 Aligned_cols=134 Identities=23% Similarity=0.320 Sum_probs=125.5 Q ss_pred CcccccCCCCCHHhhhhhhhhhhccccccCCCCcCCCceEEEEeecCCCCcceeccEEEEEEEeeccCChhhHHHHHHHH Q lcl|NC_021302. 1 MIELLDREAPPDIRFLRAWLLPIGGGVGAKRETGDPFPFTLIQKFDGWENSHTQYGFYQFDHLAVAADGKSAYTACEDYA 80 (145) Q Consensus 1 ~~~l~d~eaP~~~~~lia~L~plg~~v~~~R~~gdPlPf~~V~rV~G~dd~~~~~~~~~v~~~~~~~d~~~~~eaa~daA 80 (145) |.- |++++||+++.+++||+|+..+|.++|+.+.|.||++|+|++|+ .++..++++.+++..+|+| .++|.|.| T Consensus 1 mlp-lsrpnpnaeklvcaylspffenvashrwvdaptpfilvkrlpgg-gqgevsdcalmsikvfgkd----vdeagdla 74 (134) T protein:vir:10 1 MLP-LSRPNPNAEKLVCAYLSPFFENVASHRWVDAPTPFILVKRLPGG-GQGEVSDCALMSIKVFGKD----VDEAGDLA 74 (134) T ss_pred CCC-CCCCCCchhhhhhhhhhhHHhhhhccccccCCCceEEEeeCCCC-CCccccceeeeeeeeeccc----cccccchH Confidence 443 58899999999999999999999999999999999999999999 8889999999998888888 47889999 Q ss_pred HHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEe Q lcl|NC_021302. 81 RTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSV 143 (145) Q Consensus 81 ~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~ 143 (145) +++|+||.+|.+++.+++ ||+.++++.++++..|+|..|+||..+||+|||+||||---| T Consensus 75 devhermrkwkpkdtvsy---gghsfginllevedapfwldygddteecytarywvhlrvdyv 134 (134) T protein:vir:10 75 DEVHERMRKWKPKDTVSY---GGHSFGINLLEVEDAPFWLDYGDDTEECYTARYWVHLRVDYV 134 (134) T ss_pred HHHHHHHhccCccccccc---CchhhcceeEeecCCceeeecCCCccceeeeeEEEEEEEecC Confidence 999999999999999888 999999999999999999999999999999999999996666 No 8 >protein:vir:7860 Length: 134 # NCBI annotation: gp17 # Family: family:all:2795 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817467;genbank:gi:29565896;genbank:GeneID:1259089 Probab=99.96 E-value=6.2e-33 Score=197.16 Aligned_cols=134 Identities=23% Similarity=0.320 Sum_probs=125.5 Q ss_pred CcccccCCCCCHHhhhhhhhhhhccccccCCCCcCCCceEEEEeecCCCCcceeccEEEEEEEeeccCChhhHHHHHHHH Q lcl|NC_021302. 1 MIELLDREAPPDIRFLRAWLLPIGGGVGAKRETGDPFPFTLIQKFDGWENSHTQYGFYQFDHLAVAADGKSAYTACEDYA 80 (145) Q Consensus 1 ~~~l~d~eaP~~~~~lia~L~plg~~v~~~R~~gdPlPf~~V~rV~G~dd~~~~~~~~~v~~~~~~~d~~~~~eaa~daA 80 (145) |.- |++++||+++.+++||+|+..+|.++|+.+.|.||++|+|++|+ .++..++++.+++..+|+| .++|.|.| T Consensus 1 mlp-lsrpnpnaeklvcaylspffenvashrwvdaptpfilvkrlpgg-gqgevsdcalmsikvfgkd----vdeagdla 74 (134) T protein:vir:78 1 MLP-LSRPNPNAEKLVCAYLSPFFENVASHRWVDAPTPFILVKRLPGG-GQGEVSDCALMSIKVFGKD----VDEAGDLA 74 (134) T ss_pred CCC-CCCCCCchhhhhhhhhhhHHhhhhccccccCCCceEEEeeCCCC-CCccccceeeeeeeeeccc----cccccchH Confidence 443 58899999999999999999999999999999999999999999 8889999999998888888 47889999 Q ss_pred HHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEe Q lcl|NC_021302. 81 RTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSV 143 (145) Q Consensus 81 ~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~ 143 (145) +++|+||.+|.+++.+++ ||+.++++.++++..|+|..|+||..+||+|||+||||---| T Consensus 75 devhermrkwkpkdtvsy---gghsfginllevedapfwldygddteecytarywvhlrvdyv 134 (134) T protein:vir:78 75 DEVHERMRKWKPKDTVSY---GGHSFGINLLEVEDAPFWLDYGDDTEECYTARYWVHLRVDYV 134 (134) T ss_pred HHHHHHHhccCccccccc---CchhhcceeEeecCCceeeecCCCccceeeeeEEEEEEEecC Confidence 999999999999999888 999999999999999999999999999999999999996666 No 9 >protein:vir:1643 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695064;genbank:gi:23455755;genbank:GeneID:955492 Probab=97.14 E-value=1.2e-05 Score=47.49 Aligned_cols=108 Identities=12% Similarity=0.172 Sum_probs=78.4 Q ss_pred CHHhhhhhhhhhh-ccccccCCCCcCCCceEEEEeecCCCCcceeccEEEEEEEeeccCChhhHHHHHHHHHHHHHHhhh Q lcl|NC_021302. 11 PDIRFLRAWLLPI-GGGVGAKRETGDPFPFTLIQKFDGWENSHTQYGFYQFDHLAVAADGKSAYTACEDYARTIKRRMLY 89 (145) Q Consensus 11 ~~~~~lia~L~pl-g~~v~~~R~~gdPlPf~~V~rV~G~dd~~~~~~~~~v~~~~~~~d~~~~~eaa~daA~~~hrRMl~ 89 (145) =.|..++.||..- +-+|..+.|.+-|-+|..|.|++|..+...+.++..|..++ ++..+|..+|.++...|.. T Consensus 1 miE~~i~~~L~~~l~Vpv~~e~p~~~P~~FV~vErtGG~~~~~~~~~~lAVq~w~------~S~~eAa~La~~v~~~l~~ 74 (111) T protein:vir:16 1 MIEIIIKNFLDTHLSVSSFLEKKGEMPLSYILFEKTGSSKSNHLLSSTFAFQSYA------PSMYEAAKLNEQLKEVVER 74 (111) T ss_pred ChHHhHHHHHhhcCCceeEeecCCCCCCceEEEEecCCccccccccceEEEEecc------hhHHHHHHHHHHHHHHHhh Confidence 3466889999876 66799999999999999999999999999999999999994 7778888899999999988 Q ss_pred hcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCc--eEEEEEEeeecce Q lcl|NC_021302. 90 LRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTD--VERFIARYSVHLR 139 (145) Q Consensus 90 L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~--v~Ry~aRY~v~Lr 139 (145) |. .+..| ..+ +.-.. -+|.|+. =-||-+-|++--= T Consensus 75 l~--~~~~I-------~av-----~~~s~-ynf~d~~tk~~RYQav~~i~~~ 111 (111) T protein:vir:16 75 LI--ELNEI-------SNV-----SLNSD-YNFTDTETKEYRYQAVFDINHY 111 (111) T ss_pred cc--ccccc-------eee-----ecCCC-CcCCCCCCCCceEEEEEEEeeC Confidence 84 23334 111 11111 3444443 4455555554322 No 10 >protein:vir:94768 Length: 111 # NCBI annotation: unknown # Family: family:all:1269 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996711;genbank:gi:45597426;genbank:GeneID:2769040 Probab=96.96 E-value=2.1e-05 Score=46.26 Aligned_cols=108 Identities=12% Similarity=0.174 Sum_probs=78.5 Q ss_pred CHHhhhhhhhhhh-ccccccCCCCcCCCceEEEEeecCCCCcceeccEEEEEEEeeccCChhhHHHHHHHHHHHHHHhhh Q lcl|NC_021302. 11 PDIRFLRAWLLPI-GGGVGAKRETGDPFPFTLIQKFDGWENSHTQYGFYQFDHLAVAADGKSAYTACEDYARTIKRRMLY 89 (145) Q Consensus 11 ~~~~~lia~L~pl-g~~v~~~R~~gdPlPf~~V~rV~G~dd~~~~~~~~~v~~~~~~~d~~~~~eaa~daA~~~hrRMl~ 89 (145) =.|..++.||..= +-+|..+.|.+-|-+|..|.|++|..+...+.++..|..++ ++..+|..+|.++...|.. T Consensus 1 miE~~v~~~L~~~l~vpv~~e~p~~~p~~FV~vErtGG~~~~~~~~~~lAVQ~~~------~S~~eAa~La~~v~~~~~~ 74 (111) T protein:vir:94 1 MIEIIIKNFLDTHLSVSSFLEKKGEMPLSYVLFEKTGSSKSNHLLSSTFAFQSYA------PSMYEAAKLNEQLKEVVER 74 (111) T ss_pred ChHHhHHHHHhhcCCcceEeecCCCCCCceEEEEecCCccccccccceEEEEecc------hhHHHHHHHHHHHHHHHhh Confidence 3466889999765 66799999999999999999999999999999999999995 6677888899999999988 Q ss_pred hcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCc--eEEEEEEeeecce Q lcl|NC_021302. 90 LRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTD--VERFIARYSVHLR 139 (145) Q Consensus 90 L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~--v~Ry~aRY~v~Lr 139 (145) |. .+..| +.+ +.- -.-+|.|+. =-||-+-|++--= T Consensus 75 l~--~~~~i-------~~v-----~~~-s~Ynf~d~~tk~~RYQav~~i~~~ 111 (111) T protein:vir:94 75 LI--ELNEI-------SNV-----SLN-SDYNFTDTETKEYRYQAVFDINHY 111 (111) T ss_pred cc--ccccc-------cee-----ecC-CCcccCCCcCCCceEEEEEEEeeC Confidence 83 24344 121 111 113454443 4455555554322 No 11 >protein:vir:9764 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795526;genbank:gi:28876278;genbank:GeneID:1257819 Probab=96.93 E-value=1.9e-05 Score=46.46 Aligned_cols=108 Identities=17% Similarity=0.197 Sum_probs=79.2 Q ss_pred CcccccCCCCCHHhhhhhhhhhh-ccccccCCCCcCCCceEEEEeecCCCCcceeccEEEEEEEeeccCChhhHHHHHHH Q lcl|NC_021302. 1 MIELLDREAPPDIRFLRAWLLPI-GGGVGAKRETGDPFPFTLIQKFDGWENSHTQYGFYQFDHLAVAADGKSAYTACEDY 79 (145) Q Consensus 1 ~~~l~d~eaP~~~~~lia~L~pl-g~~v~~~R~~gdPlPf~~V~rV~G~dd~~~~~~~~~v~~~~~~~d~~~~~eaa~da 79 (145) |+ |..++.+|..- +-+|..+.|.+-|-+|..|.|.+|..+...+.++..|..++ ++..+|..+ T Consensus 1 mI----------E~~i~~yL~~~l~vpv~~e~p~~~P~~FV~vEkTGG~~~~~~~~a~lAvQsyg------~S~~~AA~L 64 (111) T protein:vir:97 1 MI----------EVIIKKYLDEHLDVPSFFEHQKDEPARFIILEKTSGAKQNHLLSSTFAFQSYA------ESLYEAALL 64 (111) T ss_pred Ch----------hhhhhHHHhhhcCceEEEeecCCCCCceEEEEeeCCccccccccceEEEEecc------hhHHHHHHH Confidence 54 45788888774 66799999999999999999999999999999999999995 777888889 Q ss_pred HHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCc--eEEEEEEeeecce Q lcl|NC_021302. 80 ARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTD--VERFIARYSVHLR 139 (145) Q Consensus 80 A~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~--v~Ry~aRY~v~Lr 139 (145) |.++...|..|. .+++| .+.-.+-| =+|.|+. =-||-|-|++--= T Consensus 65 a~~V~~a~~~l~--~l~~i---~~v~lns~----------Ynf~d~~tk~yRYQa~~di~~~ 111 (111) T protein:vir:97 65 NDKVKQVIEQLD--VLPQV---SGVHLNAD----------YNFTDTATKRYRYQAVFDINHY 111 (111) T ss_pred HHHHHHHhhhhc--cCccc---eeeeeccc----------ccCCCCCCCCccEEEEEEEeeC Confidence 999999998884 34445 12222222 2455543 3456666554332 No 12 >protein:vir:100242 Length: 114 # NCBI annotation: gp71 # Family: family:all:2712 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355407;genbank:gi:77864697;genbank:GeneID:3725964 Probab=96.92 E-value=1.3e-05 Score=47.32 Aligned_cols=105 Identities=21% Similarity=0.268 Sum_probs=67.4 Q ss_pred CcccccCCCCCHHhhhhhhhhhhccc--cccCCCCcCCCceEEEEeecCCC------CcceeccEEEEEEEeeccCChhh Q lcl|NC_021302. 1 MIELLDREAPPDIRFLRAWLLPIGGG--VGAKRETGDPFPFTLIQKFDGWE------NSHTQYGFYQFDHLAVAADGKSA 72 (145) Q Consensus 1 ~~~l~d~eaP~~~~~lia~L~plg~~--v~~~R~~gdPlPf~~V~rV~G~d------d~~~~~~~~~v~~~~~~~d~~~~ 72 (145) |++|+=+ +-|.-|++. --..+|+++|+||.+.+||.|.- +.+...+++||+.++ ++ T Consensus 1 ~~~~~i~----------~~l~~~~g~~~~~~~aP~~~~~Py~vy~rvsg~p~~tL~G~~g~~~~r~QiD~yA------~T 64 (114) T protein:vir:10 1 MSALTIR----------DAIGIVGGAKGYVSVASSAAQSPYYVVSRVSGTRDMALGGATGGKSGMFQIDVYA------KT 64 (114) T ss_pred Cceeeee----------hhhcccccccccCCCCCCCCCCceEEEEeccCcccccccCCCCcceEEEEEEeee------CC Confidence 8777544 445556642 34567899999999999999997 336889999999997 77 Q ss_pred HHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCC-ceEEEEEEeeecceE Q lcl|NC_021302. 73 YTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNT-DVERFIARYSVHLRL 140 (145) Q Consensus 73 ~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd-~v~Ry~aRY~v~Lr~ 140 (145) +++|+..|+++.++.+.-. + +.... ..+ ..+.|+.+ .+.|-.-=+.|. | T Consensus 65 ~~eA~~La~~~~~~l~~~~--~-----------f~~~~--l~~--~~d~ye~dT~l~Rvsld~si~--f 114 (114) T protein:vir:10 65 YTEADSLADQIIDRVESTG--M-----------FSVGG--VSD--LPDDYSSDTGVFRVSLEISVQ--F 114 (114) T ss_pred HHHHHHHHHHHHhhccccc--C-----------eeeec--ccc--CCCCCCcccCceEEEEEEEEe--C Confidence 8999999988877543221 0 11111 111 12456544 355544433333 3 No 13 >protein:vir:9579 Length: 111 # NCBI annotation: gp45 # Family: family:all:1269 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862884;genbank:gi:32469476;genbank:GeneID:1461321 Probab=96.88 E-value=2.3e-05 Score=46.02 Aligned_cols=108 Identities=17% Similarity=0.194 Sum_probs=76.6 Q ss_pred CHHhhhhhhhhh-hccccccCCCCcCCCceEEEEeecCCCCcceeccEEEEEEEeeccCChhhHHHHHHHHHHHHHHhhh Q lcl|NC_021302. 11 PDIRFLRAWLLP-IGGGVGAKRETGDPFPFTLIQKFDGWENSHTQYGFYQFDHLAVAADGKSAYTACEDYARTIKRRMLY 89 (145) Q Consensus 11 ~~~~~lia~L~p-lg~~v~~~R~~gdPlPf~~V~rV~G~dd~~~~~~~~~v~~~~~~~d~~~~~eaa~daA~~~hrRMl~ 89 (145) =.|..++.||.. ++-+|..+=|.+-|-+|..|.|++|..+...+.++..|+.++ ++..+|.++|.++...|.. T Consensus 1 miE~~v~~~L~~~l~vpv~~~vp~~~P~~FV~vErtGG~~~~~~~~p~laVq~wg------~S~~~Aa~La~~v~~a~~~ 74 (111) T protein:vir:95 1 MIEIIINKYLDGHLDVPSFFEHEAEAPDSFVIIQKTGGKERNHSGSATFAFQSYA------PTMQKAAELNVKVKSAVKG 74 (111) T ss_pred ChHHhHHHHhhhhcCeeEEeecCCCCCCceEEEEeeCCccccccccceEEEEecc------ccHHHHHHHHHHHHHHHhh Confidence 346789999964 455799999999999999999999999999999999999994 6778888899999999888 Q ss_pred hcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCc--eEEEEEEeeecce Q lcl|NC_021302. 90 LRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTD--VERFIARYSVHLR 139 (145) Q Consensus 90 L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~--v~Ry~aRY~v~Lr 139 (145) |. ...+| ..+ + ..++ -+|.|+. --||-+-|++-.= T Consensus 75 l~--~~~~i-------~~v---~-~~s~--ynf~d~~tk~~RYQ~~~~i~~~ 111 (111) T protein:vir:95 75 LI--ELDSI-------CGV---H-LNSD--YNFTDTETKQYRYQAVFDINYF 111 (111) T ss_pred hh--ccccc-------ccc---c-cCCc--cccCCCCCCCceEEEEEEEEeC Confidence 83 23333 111 1 1122 3444443 4556555554322 No 14 >protein:vir:1438 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:2712 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536367;genbank:gi:17975172;genbank:GeneID:929144 Probab=95.38 E-value=0.00049 Score=38.75 Aligned_cols=107 Identities=20% Similarity=0.294 Sum_probs=67.2 Q ss_pred CcccccCCCCCHHhhhhhhhhhhccc-c-ccCCCCcCCCceEEEEeecCCCC------cceeccEEEEEEEeeccCChhh Q lcl|NC_021302. 1 MIELLDREAPPDIRFLRAWLLPIGGG-V-GAKRETGDPFPFTLIQKFDGWEN------SHTQYGFYQFDHLAVAADGKSA 72 (145) Q Consensus 1 ~~~l~d~eaP~~~~~lia~L~plg~~-v-~~~R~~gdPlPf~~V~rV~G~dd------~~~~~~~~~v~~~~~~~d~~~~ 72 (145) |+-|+ +-+-|+|+..+ | --.-|.+.|+||.+-++|.|..+ ++.+..++||++++ .+ T Consensus 1 ~~~~~----------i~~aL~~l~~~RVyp~~aP~~~~~Pyiv~q~vsg~p~~~L~G~~~~~~~~vQIDvyA------~t 64 (115) T protein:vir:14 1 MSVIV----------IRDALQGIGGAKGYLGVAPAKAPAPYFVVTRVHGALDMALAGLTGGRSGSYQIDCYA------PT 64 (115) T ss_pred CeeEe----------eehhhccccccccccccCCCCCCCCEEEEEeecCcccccccCCCCCcceEEEEEEee------CC Confidence 66654 44556666543 2 22346788999999999999863 35789999999997 56 Q ss_pred HHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceE Q lcl|NC_021302. 73 YTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRL 140 (145) Q Consensus 73 ~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~ 140 (145) +++|+.+++.+..+|..+. +...+. .+ +...+.|+.+ .+-|-...++-+=| T Consensus 65 ~~~A~~l~~~v~~~~~~~~--~~~~~~-------~~-------~~~~d~ye~d-t~lyR~s~D~~vWf 115 (115) T protein:vir:14 65 FTDADRLADLAVDRAMSVQ--DRFSVG-------GV-------DELPDDYSED-TGLFRISLELSVEF 115 (115) T ss_pred HHHHHHHHHHHHHHHhcCc--cceeee-------ee-------cCCCCCCccc-ccceeeEEEEEEeC Confidence 6889999999888877764 222221 11 1111445433 23344444555455 No 15 >protein:vir:100116 Length: 115 # NCBI annotation: gp10 # Family: family:all:2712 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945040;genbank:gi:38707900;genbank:GeneID:2744163 Probab=95.13 E-value=0.00065 Score=38.06 Aligned_cols=107 Identities=20% Similarity=0.292 Sum_probs=67.0 Q ss_pred CcccccCCCCCHHhhhhhhhhhhccc-c-ccCCCCcCCCceEEEEeecCCCC------cceeccEEEEEEEeeccCChhh Q lcl|NC_021302. 1 MIELLDREAPPDIRFLRAWLLPIGGG-V-GAKRETGDPFPFTLIQKFDGWEN------SHTQYGFYQFDHLAVAADGKSA 72 (145) Q Consensus 1 ~~~l~d~eaP~~~~~lia~L~plg~~-v-~~~R~~gdPlPf~~V~rV~G~dd------~~~~~~~~~v~~~~~~~d~~~~ 72 (145) |+-|+ +-+-|+|+..+ | --.-|.+.|+||.+-++|.|..+ ++.+..++||++++ .+ T Consensus 1 ~~~~~----------i~~aL~~l~~~RVyp~~aP~~~~~Pyiv~q~vsg~p~~~L~G~~~~~~~~vQIDvyA------~t 64 (115) T protein:vir:10 1 MSVIV----------IRDALQGIGGAKGYLGVAPEKAPAPYFVVTRVHGALDMALAGLTGGRSGSYQIDCYA------PT 64 (115) T ss_pred CeeEE----------eehhhcccCCceeecccCCCCCCCCEEEEEeecCccccccCCCCCCcceEEEEEEee------CC Confidence 76664 44556666542 1 12345788999999999999863 35789999999997 56 Q ss_pred HHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceE Q lcl|NC_021302. 73 YTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRL 140 (145) Q Consensus 73 ~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~ 140 (145) +++|+.+++.+..++..+. +...+. .+ +...+.|+.+ .+-|-.+.++-+=| T Consensus 65 ~~~A~~l~~~v~~~~~~~~--~~~~~~-------~~-------~~~~d~ye~d-t~lyR~s~D~~vWf 115 (115) T protein:vir:10 65 FTDADRLADLAVDRAMSVQ--DRFSVG-------GV-------DELPDDYSED-TGLFRISLELSVEF 115 (115) T ss_pred HHHHHHHHHHHHHHHhcCc--cceeEe-------ee-------cCCCCCCccc-ccceeeEEEEEEeC Confidence 6889999999888877764 232221 11 1111445433 23344455555555 No 16 >protein:vir:80371 Length: 115 # NCBI annotation: gp11 # Family: family:all:2712 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111090;genbank:gi:134288622;genbank:GeneID:4960618 Probab=94.85 E-value=0.00062 Score=38.16 Aligned_cols=105 Identities=20% Similarity=0.274 Sum_probs=67.4 Q ss_pred CcccccCCCCCHHhhhhhhhhhhcccc--ccCCCCcCCCceEEEEeecCCCCc------ceeccEEEEEEEeeccCChhh Q lcl|NC_021302. 1 MIELLDREAPPDIRFLRAWLLPIGGGV--GAKRETGDPFPFTLIQKFDGWENS------HTQYGFYQFDHLAVAADGKSA 72 (145) Q Consensus 1 ~~~l~d~eaP~~~~~lia~L~plg~~v--~~~R~~gdPlPf~~V~rV~G~dd~------~~~~~~~~v~~~~~~~d~~~~ 72 (145) |+-+ |+.+-|..|+..= ---=|.+.|.||.+++||.|.-|- +...+++||+.++ ++ T Consensus 1 ~~~~----------vir~al~~i~~~~~~~~vAp~~~~~pyivy~rvsga~e~~L~G~ag~~~~~~QID~yA------~T 64 (115) T protein:vir:80 1 MSVI----------VVRDALQGIGGAKGYLGVAPEKAPARYFVVTRVHGALDMALAGPTGGRSGSYQIDCYA------PT 64 (115) T ss_pred Ceee----------eeechhhhccccccceeeccccCcCCeEEEeecCCCccccccCCCCCceeEEEEeeec------CC Confidence 5444 5566677776521 112256899999999999998754 4678899999996 67 Q ss_pred HHHHHHHHHHHHHHhhhhcCCCceEEE-cCCCeEEeeeeEeeccCccccccCCC-ceEEEEEEeeecceE Q lcl|NC_021302. 73 YTACEDYARTIKRRMLYLRDRPWTEVT-VPGWGVATADVVRCTASPRHDPYNNT-DVERFIARYSVHLRL 140 (145) Q Consensus 73 ~eaa~daA~~~hrRMl~L~~~~~~~v~-~~dg~~a~~d~~~~~~~P~~~~Y~dd-~v~Ry~aRY~v~Lr~ 140 (145) +.+|+..|+++.-||..+ -++.++- ++++ ++.|..| .+.| .+-++-.-| T Consensus 65 ~~ea~~La~~v~d~~~~~--~~~~~vg~l~e~---------------pd~Ye~DT~l~R--vs~dv~i~f 115 (115) T protein:vir:80 65 FTDADRLADLAVDRAMSV--QDRFSVGGVDEL---------------PDDYSADTGLFR--VSLELSVEF 115 (115) T ss_pred HHHHHHHHHHHHHhhhCC--ccccceecccCC---------------CcccccccceEE--EEEEEEEeC Confidence 799999999999988876 3333331 2221 2455444 2433 333444444 No 17 >protein:vir:78124 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:29862 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294806;genbank:gi:149882827;genbank:GeneID:5309152 Probab=94.25 E-value=0.0027 Score=34.63 Aligned_cols=126 Identities=18% Similarity=0.156 Sum_probs=76.6 Q ss_pred ccCCCCCHHhhhhhhh------hhhccccccCCCCcCC--C--ceEEEEeecCCC-CcceeccEEEEEEEeeccCChhhH Q lcl|NC_021302. 5 LDREAPPDIRFLRAWL------LPIGGGVGAKRETGDP--F--PFTLIQKFDGWE-NSHTQYGFYQFDHLAVAADGKSAY 73 (145) Q Consensus 5 ~d~eaP~~~~~lia~L------~plg~~v~~~R~~gdP--l--Pf~~V~rV~G~d-d~~~~~~~~~v~~~~~~~d~~~~~ 73 (145) ..--.|+.|.|+++|| +.+.+.||++-|++-- + |..+|+-=.|+- |-.+++-.+-|.+++.-..-+ T Consensus 1 ~~v~PPDlE~fl~~~LRa~i~~adVDgqvGnk~Pd~y~g~y~~PLvvVRDDgG~~~d~~tFDRSiGvnVlgwtrqd~--- 77 (139) T protein:vir:78 1 MRVAPPDLEEWFTALLRAEVRAAGVDAEVGNKEPDNLRVPLRRPLIVVRDDSGDRRDWTTFDRSVGFTVLAGTKQND--- 77 (139) T ss_pred CccCCccHHHHHHHHHHhhccccCccccccCcCCCCccccccCCeEEEEcCCCCcccceeeecccceeeeeccccCc--- Confidence 2233557799999998 4556679999987654 4 556665544443 446888888888886544333 Q ss_pred HHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeE-eeccCccccccCCCceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 74 TACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVV-RCTASPRHDPYNNTDVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 74 eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~-~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~~ 145 (145) .-|+|+|..+.. .|..+++ +-+-|..++.++.- |.---|+ .|+. -+|||-+-..|+++.| T Consensus 78 KPc~dLArrVy~---~lt~hp~--~LiegSpi~aVv~dgCnGPYpV----sdd~---d~aryYltveYst~G~ 138 (139) T protein:vir:78 78 KPANDLARVVAS---IVHDHEL--PLIEGSPIAAVVFDGCRGPYAV----PDTI---DVARRYLTGQYVASGS 138 (139) T ss_pred hhhHHHHHHHHH---HhccCcc--eeecCCceEEeecccCCCCCCC----Ccch---hheeeeeEEEEeeecc Confidence 567777766554 2222443 33356666655332 2222222 1221 2578889999999999 No 18 >protein:vir:98426 Length: 131 # NCBI annotation: ORF6 # Family: family:all:12105 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958284;genbank:gi:41057258;uniprot:Q38599;genbank:GeneID:2732810 Probab=93.93 E-value=0.003 Score=34.40 Aligned_cols=123 Identities=16% Similarity=0.154 Sum_probs=79.6 Q ss_pred cccCCCCCHHhhh----hhhh-hhhcc-ccccCCCCcCCCceEEEEeecCC-CCcceeccEEEEEEEeeccCChhhHHHH Q lcl|NC_021302. 4 LLDREAPPDIRFL----RAWL-LPIGG-GVGAKRETGDPFPFTLIQKFDGW-ENSHTQYGFYQFDHLAVAADGKSAYTAC 76 (145) Q Consensus 4 l~d~eaP~~~~~l----ia~L-~plg~-~v~~~R~~gdPlPf~~V~rV~G~-dd~~~~~~~~~v~~~~~~~d~~~~~eaa 76 (145) |-----|+++.++ ..|| +-..+ .|+++=|.+-|=.|..|.|.+|. .++.++.+...|..++ ++.++| T Consensus 1 ~~~i~~pda~~v~~~~lr~~l~a~~~~V~V~t~vP~~RP~rfV~VertgG~~~~~~~Dr~~L~Vq~W~------~t~~~A 74 (131) T protein:vir:98 1 MPPILMPDAVAVIAGYLRAVLVARGVTVPVGSRVPSPRPARFVRIERIGGPANTVVTDRPRLDVHCWG------SSEEDA 74 (131) T ss_pred CCCccCCchhHHHHHHHHHHHHhcCCceEecccCCCCCCceEEEEEecCCCcCCccccceEEEEEecC------CCHHHH Confidence 2211234665554 4566 33333 48999999999999999999994 4777898888888884 556899 Q ss_pred HHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeecc-CccccccCCCceEEEEEEeeecceEEEe Q lcl|NC_021302. 77 EDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTA-SPRHDPYNNTDVERFIARYSVHLRLVSV 143 (145) Q Consensus 77 ~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~-~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~ 143 (145) .+.|..+-..|+.+- + .. .-..+-+++. .|.+.+=.+....||...=++-+|=-+. T Consensus 75 ~~La~~vr~~ll~~~-~-~~---------g~~~~~~~e~~gpy~~PD~es~~~Ryq~tv~l~~r~~~~ 131 (131) T protein:vir:98 75 HDLMQLCRALLGAAR-G-SH---------GDTVLARPATGGPQFLPDAETGAARWAFTLDITMRGHAL 131 (131) T ss_pred HHHHHHHHHHHhhcc-c-cc---------chheeccccCCCCCcCCCCCCCCceeEEEEEEEeeeccC Confidence 999999988777552 1 11 1222334444 4553333334578888777777776555 No 19 >protein:vir:4348 Length: 121 # NCBI annotation: Orf15 # Family: family:all:896 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061511;genbank:gi:9635607;genbank:GeneID:1262874 Probab=91.75 E-value=0.0074 Score=32.28 Aligned_cols=105 Identities=14% Similarity=0.070 Sum_probs=54.5 Q ss_pred CCCCHHhhhhh------hhhhh-ccccc-cCCCCcCCCceEEEEeecCCCC------cceeccEEEEEEEeeccCChhhH Q lcl|NC_021302. 8 EAPPDIRFLRA------WLLPI-GGGVG-AKRETGDPFPFTLIQKFDGWEN------SHTQYGFYQFDHLAVAADGKSAY 73 (145) Q Consensus 8 eaP~~~~~lia------~L~pl-g~~v~-~~R~~gdPlPf~~V~rV~G~dd------~~~~~~~~~v~~~~~~~d~~~~~ 73 (145) +.|+.-+.+.+ -|+.- ++..- ..-|.+.|+||.+-++|.|... ++.+..++||++++ .++ T Consensus 1 m~~~i~~~l~~d~~v~allg~~~~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~~~l~g~~~~~~~~vQIDvyA------~t~ 74 (121) T protein:vir:43 1 MYPPIFKVCSSSPAVTAILGASPLRMYQFGLAPQLVVKPYATWQTISGSPENYLWGRPDADGFTIQVDIFS------ATA 74 (121) T ss_pred CChHHHHHHhhChhhhhhhcCCCceeeccCCCCCCCcCCeEEEEEecCcccceecCCCCcceeEEEEEeee------CCH Confidence 55655444433 22110 01111 1347789999999999999863 25778999999997 445 Q ss_pred HHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCc-eEEEEEEeeecceEEEeeC Q lcl|NC_021302. 74 TACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTD-VERFIARYSVHLRLVSVAS 145 (145) Q Consensus 74 eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~-v~Ry~aRY~v~Lr~va~~~ 145 (145) ++|+.+++.+..-|-.+++ +...+ ...|+.|. +-|+. .++. =+-. T Consensus 75 ~~A~~l~~av~~Al~~~~~-----~~~~~----------------~~~ye~dT~lyR~s--~Dv~----w~~~ 120 (121) T protein:vir:43 75 AEARDAAKAIRDAIELSAY-----VVRWG----------------GESVDPDTKTYRVS--FDVD----WIVQ 120 (121) T ss_pred HHHHHHHHHHHHHhhhcCC-----cccCC----------------CCCCcccccceeee--eEEE----Eeec Confidence 7788777777554443321 11111 13343332 22221 1111 1111 No 20 >protein:vir:1892 Length: 121 # NCBI annotation: gp11 # Family: family:all:896 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037672;genbank:gi:9634130;genbank:GeneID:1262490 Probab=87.23 E-value=0.029 Score=29.02 Aligned_cols=105 Identities=11% Similarity=0.083 Sum_probs=52.4 Q ss_pred CCCCHHhhhhh---hhhhhcc---ccc--cCCCCcCCCceEEEEeecCCCC------cceeccEEEEEEEeeccCChhhH Q lcl|NC_021302. 8 EAPPDIRFLRA---WLLPIGG---GVG--AKRETGDPFPFTLIQKFDGWEN------SHTQYGFYQFDHLAVAADGKSAY 73 (145) Q Consensus 8 eaP~~~~~lia---~L~plg~---~v~--~~R~~gdPlPf~~V~rV~G~dd------~~~~~~~~~v~~~~~~~d~~~~~ 73 (145) +.|+.-++|.+ -.+=+|. -+= ..-|.+.|+||.+-++|.|... ++.+..++||++++ .+. T Consensus 1 m~~~i~~~l~~d~~v~allg~~~~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~vQIDvyA------~t~ 74 (121) T protein:vir:18 1 MIAPIFSVCASSPEVTDLLGSNPVRIYPFGIQDDNVVYPYVVWQNITGSPENYIAQRPDADFFTLQVDAYA------DTV 74 (121) T ss_pred CchHHHHHHhcChhhhhhhcCCCceeeeccCCCCcCcCCeEEEEEecCcccceecCCCCcceeEEEEEeec------CCH Confidence 55555444421 1111111 021 2457789999999999999862 25778899999997 444 Q ss_pred HHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCC-ceEEEEEE-eeecce Q lcl|NC_021302. 74 TACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNT-DVERFIAR-YSVHLR 139 (145) Q Consensus 74 eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd-~v~Ry~aR-Y~v~Lr 139 (145) ++|+.+++.+-.-|-.+.+ ...+. ...|+.+ .+.|+.-. =|+-.| T Consensus 75 ~~A~~l~~avr~Ale~~~~--~~~~~-------------------~~~ye~dT~lyR~s~Dv~~~~~r 121 (121) T protein:vir:18 75 DEVIAVATALRDAIEPHAH--ITRWG-------------------GQERDPETKRYRYSFDVDWIVTR 121 (121) T ss_pred HHHHHHHHHHHHHhhhcCc--ccCCC-------------------CCCCcccccceeeeeEEEEeecC Confidence 6677666655554433321 11010 0223322 23222211 011111 No 21 >protein:vir:81066 Length: 118 # NCBI annotation: p13 # Family: family:all:3244 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285683;genbank:gi:148727191;genbank:GeneID:5247111 Probab=85.63 E-value=0.034 Score=28.66 Aligned_cols=108 Identities=16% Similarity=0.113 Sum_probs=59.3 Q ss_pred CCHHhhhhhhhhhhccc-cccCC-CCcCCC-ceEEEEeecCCCCc-------ceeccEEEEEEEeeccCChhhHHHHHHH Q lcl|NC_021302. 10 PPDIRFLRAWLLPIGGG-VGAKR-ETGDPF-PFTLIQKFDGWENS-------HTQYGFYQFDHLAVAADGKSAYTACEDY 79 (145) Q Consensus 10 P~~~~~lia~L~plg~~-v~~~R-~~gdPl-Pf~~V~rV~G~dd~-------~~~~~~~~v~~~~~~~d~~~~~eaa~da 79 (145) =+.++-|.+-|+|+-.+ |-+-. |.+.|+ ||.+-++|+|.... +.++.++||++++ .++.+|+.. T Consensus 1 Ms~e~~l~a~L~~~~~~Rvyp~~aP~~~~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~rvQIdvyA------~t~~~A~~l 74 (118) T protein:vir:81 1 MSYGRVLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPVYWQEGGMPEKVNARVQIQIWS------RSKQEAYLA 74 (118) T ss_pred CchHHHHHHHHHhhcCCccccccCCCCCccCceEEEEecCCcccccccCCCCCccceeEEEEEee------CCHHHHHHH Confidence 23455677778888542 44443 445685 99999999996422 2455789999997 556788888 Q ss_pred HHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEe Q lcl|NC_021302. 80 ARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSV 143 (145) Q Consensus 80 A~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~ 143 (145) ++.+..-|-... .. ...-.|+ +.|+.+. +-|-.+-++-+=|--- T Consensus 75 ~~av~~al~~~~--~~----------------~~~~~~~-d~ye~dt-~l~r~~~Df~iw~~~~ 118 (118) T protein:vir:81 75 TVQVLRLVSEAP--DM----------------QVLSQPI-DDYVREI-KLYGSRVDVSMWYPIT 118 (118) T ss_pred HHHHHHHhhhcc--ce----------------eeccCCc-ccccccc-CceeEEEEEEEEecCC Confidence 887776553331 11 1111222 4555442 2222232222111111 No 22 >protein:vir:10368 Length: 118 # NCBI annotation: conserved phage protein # Family: family:all:3244 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858960;genbank:gi:32128425;genbank:GeneID:2648389 Probab=83.39 E-value=0.054 Score=27.53 Aligned_cols=108 Identities=15% Similarity=0.108 Sum_probs=57.0 Q ss_pred CCHHhhhhhhhhhhccc-cccCC-CCcCCC-ceEEEEeecCCCCc-------ceeccEEEEEEEeeccCChhhHHHHHHH Q lcl|NC_021302. 10 PPDIRFLRAWLLPIGGG-VGAKR-ETGDPF-PFTLIQKFDGWENS-------HTQYGFYQFDHLAVAADGKSAYTACEDY 79 (145) Q Consensus 10 P~~~~~lia~L~plg~~-v~~~R-~~gdPl-Pf~~V~rV~G~dd~-------~~~~~~~~v~~~~~~~d~~~~~eaa~da 79 (145) =+.++-|.+-|+|+-.+ |-+-. |.+.|+ ||.+-++|+|...- ..++.++||++++ .++.+|+.. T Consensus 1 Ms~e~~l~a~L~~~~~~RVyp~~aP~~~~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~rvQIdvyA------~t~~~A~~l 74 (118) T protein:vir:10 1 MSYGRVLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPVYWQEGGMPEKVNARVQIQIWS------RSKQEAYLA 74 (118) T ss_pred CchHHHHHHHHhhhcCCccccccCCCCCCcCCEEEEEecCCcccccccCCCCccceeEEEEEEee------CCHHHHHHH Confidence 12345666778888653 44433 456685 99999999997422 2444689999997 455778877 Q ss_pred HHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEe Q lcl|NC_021302. 80 ARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSV 143 (145) Q Consensus 80 A~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~ 143 (145) ++.+..=| .- .. .......|+ +.|+.+. +-|-.+-++-+=|--- T Consensus 75 ~~av~~al-~~----~~-------------~~~~~~~~~-d~ye~dt-~l~r~~~Df~vw~~~~ 118 (118) T protein:vir:10 75 TVQVLRLV-SE----AN-------------DMQVLSQPI-DDYVREI-KLYGSRVDISMWYNLT 118 (118) T ss_pred HHHHHHHh-hh----cc-------------cceeccCCC-ccccccC-CceEEEEEEEEeeecC Confidence 77663322 11 10 011122233 4554442 3333332222211111 No 23 >protein:vir:93602 Length: 114 # NCBI annotation: putative structural component # Family: family:all:896 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449300;genbank:gi:157166048;uniprot:Q6H9U1;genbank:GeneID:5580424 Probab=83.35 E-value=0.051 Score=27.69 Aligned_cols=102 Identities=15% Similarity=0.202 Sum_probs=56.2 Q ss_pred cccCCCCCHHhhhhhhhhhhccc-----cccCC--CCcCCCceEEEEeecCCC-----CcceeccEEEEEEEeeccCChh Q lcl|NC_021302. 4 LLDREAPPDIRFLRAWLLPIGGG-----VGAKR--ETGDPFPFTLIQKFDGWE-----NSHTQYGFYQFDHLAVAADGKS 71 (145) Q Consensus 4 l~d~eaP~~~~~lia~L~plg~~-----v~~~R--~~gdPlPf~~V~rV~G~d-----d~~~~~~~~~v~~~~~~~d~~~ 71 (145) |+| .=+-.-|+|+-.+ +.++- .++-++||.+-++|.|.. ++..+..++||++++ . T Consensus 1 M~e-------~~i~~lL~~~~~gRvyp~~~P~~~~~~~~~~Pyiv~q~vsg~p~~~l~gp~~~~~~vQIDvyA------~ 67 (114) T protein:vir:93 1 MTE-------ADLYPHLAHLAGGQVYPYVVPLLDGRPSVALPWVVFSLISSVSADVMGGQAESSVSVQIDVYA------G 67 (114) T ss_pred Cch-------HHHHHHHHhhcCcccccccCCcccCcCCccCceEEEEeccCcccccccCccccceEEEEEeee------C Confidence 332 3455666766331 33332 246789999999999854 334678899999997 4 Q ss_pred hHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceE Q lcl|NC_021302. 72 AYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRL 140 (145) Q Consensus 72 ~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~ 140 (145) ++++|+.++..+-.-|-.++ +. ...+ . ..|+ +..+-|-.+.++...- T Consensus 68 t~~~A~~l~~~v~~Al~~~~--~~---~~~~---------------~-~~ye-~dt~lyR~~~d~~v~~ 114 (114) T protein:vir:93 68 TVTQARQIRQDAREAIMLLA--PG---SVSE---------------M-QDYI-PENRCYRATLEFQVTV 114 (114) T ss_pred CHHHHHHHHHHHHHHHhhcC--cE---eecC---------------C-Cccc-ccccceeeEEEEEEeC Confidence 45677777666655554443 11 1101 1 3343 3334443443333222 No 24 >protein:vir:99005 Length: 170 # NCBI annotation: gp34 # Family: family:all:32655 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655899;genbank:gi:109521471;genbank:GeneID:4157970 Probab=81.04 E-value=0.028 Score=29.10 Aligned_cols=136 Identities=13% Similarity=0.062 Sum_probs=79.6 Q ss_pred Cccc-ccCCCCCHHhh----hhhhhhhhcc--ccccCCC----------CcCCC----ceEEEEeecCCCCc--ceeccE Q lcl|NC_021302. 1 MIEL-LDREAPPDIRF----LRAWLLPIGG--GVGAKRE----------TGDPF----PFTLIQKFDGWENS--HTQYGF 57 (145) Q Consensus 1 ~~~l-~d~eaP~~~~~----lia~L~plg~--~v~~~R~----------~gdPl----Pf~~V~rV~G~dd~--~~~~~~ 57 (145) |++. =||=-|+++.- +..+++-|-- .|++==| .-+|. |-..|.|-+|.-|- -+|-+. T Consensus 1 Ma~~lPDW~egda~l~v~dl~~q~~qkl~Pn~~v~~WipdDw~~~~~~~da~pt~~~~Ptl~~~R~~Gq~D~d~~~Da~~ 80 (170) T protein:vir:99 1 MADFLPDWWEGPEYLDVEDLFAQHFQKLLPNVRVCHWIQPDWYIPTGFVDATPTYGTEPTLRLWRQPGQRDDESTTDAPL 80 (170) T ss_pred CccccCCccCCcHHHHHHHHHHHHHHHhCCCceeEeecCcccccccccccccccccccceEEEEecCCccchhhccchhh Confidence 7766 45544444333 3333222221 1332222 12444 66999999996543 488899 Q ss_pred EEEEEEeeccCChhhHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeec Q lcl|NC_021302. 58 YQFDHLAVAADGKSAYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVH 137 (145) Q Consensus 58 ~~v~~~~~~~d~~~~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~ 137 (145) +|+.+.. .+++.+-+.-.-+|..|+.+..++ ++.- .+.+..+-.+.---.|.-.+-.+..-+=-.|.|++- T Consensus 81 lq~~vvt------~Sr~DS~~l~~fvr~im~a~~~g~--~~~~-~~qvv~i~sv~e~~Gp~~iP~~~~D~r~V~atyevt 151 (170) T protein:vir:99 81 LQFAAVT------RSHGDSIQLIEFVHTVMRALNNGH--KIKY-NGQLVGIKNVGLWLGPQTIPEGPIDEFFVPVTYKFT 151 (170) T ss_pred hhhhhhc------cChHHHHHHHHHHHHHHHhhhcCC--eeee-CCceEEEEEeccccccccCCCCCccceEeeeEEEEE Confidence 9998764 566777777788888888774333 3432 333455544443346666666677666667889886 Q ss_pred ceEEEeeC Q lcl|NC_021302. 138 LRLVSVAS 145 (145) Q Consensus 138 Lr~va~~~ 145 (145) .|+=-.-- T Consensus 152 v~~~r~~p 159 (170) T protein:vir:99 152 VAGKKLQP 159 (170) T ss_pred eecccCCc Confidence 65533322 No 25 >protein:vir:97070 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:3244 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453569;genbank:gi:84662604;genbank:GeneID:5142485 Probab=79.62 E-value=0.09 Score=26.30 Aligned_cols=108 Identities=15% Similarity=0.114 Sum_probs=58.3 Q ss_pred CCHHhhhhhhhhhhccc-cccCC-CCcCCC-ceEEEEeecCCCCc-------ceeccEEEEEEEeeccCChhhHHHHHHH Q lcl|NC_021302. 10 PPDIRFLRAWLLPIGGG-VGAKR-ETGDPF-PFTLIQKFDGWENS-------HTQYGFYQFDHLAVAADGKSAYTACEDY 79 (145) Q Consensus 10 P~~~~~lia~L~plg~~-v~~~R-~~gdPl-Pf~~V~rV~G~dd~-------~~~~~~~~v~~~~~~~d~~~~~eaa~da 79 (145) =+.+.-|-+-|+|+-.+ |-+.- |.|.|+ ||.+-++|+|...- ..+..++||++++ .++..|+.. T Consensus 1 M~~e~~l~a~L~~~~~~Rvyp~~aP~~~~~~Pyiv~q~vsg~p~~~ldG~~~~~~~~rvQIdvyA------~t~~~A~~l 74 (118) T protein:vir:97 1 MSYGRMLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPVYWKEGGMPDKVNARVQVQIWS------RSKQEAYLA 74 (118) T ss_pred CchHHHHHHHHhhhcCCccccccCCCCCCcCCEEEEEecCCcccccccCCCCCccceeEEEEEee------CCHHHHHHH Confidence 23466777888888653 54444 445685 99999999996422 2444679999997 455778777 Q ss_pred HHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEe Q lcl|NC_021302. 80 ARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSV 143 (145) Q Consensus 80 A~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~ 143 (145) ++.+..=|-.. ... .....|+ +.|+.+. +.|-++-++-+=|--- T Consensus 75 ~~av~~al~~~--~~~----------------~~~~~~~-~~ye~dt-~lyr~~~Df~iw~~~~ 118 (118) T protein:vir:97 75 TVQVLRIVSEA--NDM----------------QVLSQPI-DDYVREL-KLYGSRVDISMWYNLT 118 (118) T ss_pred HHHHHHHhhcc--ccc----------------ccccCCc-ccccccC-CceEEEEEEEEEeecC Confidence 77664322111 101 1111233 4465442 3333332222222111 No 26 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=77.28 E-value=0.13 Score=25.51 Aligned_cols=129 Identities=10% Similarity=0.122 Sum_probs=63.8 Q ss_pred CcccccCCCCCH-Hhhhhhhhhh-------hccccccCCCCcCCCceEEEEeecCCCCcc----eeccEEEEEEEeeccC Q lcl|NC_021302. 1 MIELLDREAPPD-IRFLRAWLLP-------IGGGVGAKRETGDPFPFTLIQKFDGWENSH----TQYGFYQFDHLAVAAD 68 (145) Q Consensus 1 ~~~l~d~eaP~~-~~~lia~L~p-------lg~~v~~~R~~gdPlPf~~V~rV~G~dd~~----~~~~~~~v~~~~~~~d 68 (145) |+ .++-.+ -+.+++.|.- +|+.+=-.-|.+.|+||.++-...-.++.. -..-..+||++. T Consensus 1 Ms----ms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~~~~~~~g~~~~~ti~Vws---- 72 (141) T protein:vir:96 1 MW----VSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNNESSATMRETVGIVIHVYS---- 72 (141) T ss_pred Cc----cchhHHHHHHHHHHhhcChhhHhhcCCccccCCccCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEE---- Confidence 32 222122 5667777754 343343455667789998876655554332 334456666664 Q ss_pred ChhhHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEee Q lcl|NC_021302. 69 GKSAYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVA 144 (145) Q Consensus 69 ~~~~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~ 144 (145) .+.++++|+++|..+-.-+ .. ...+++++.+...+....-..-.+....=-+.||-+|..=---=..|- T Consensus 73 ~~~g~~eak~ia~av~~AL----~~---~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~~~~~~~ 141 (141) T protein:vir:96 73 QFATQYEAKLILSAIGYVL----NR---PIEIDNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKKKNEGVY 141 (141) T ss_pred cCCCHHHHHHHHHHHHHHh----cc---cccCCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEeccccccCC Confidence 2456788888888777632 22 267789888877554333222111111111344444411000000000 No 27 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=77.28 E-value=0.13 Score=25.51 Aligned_cols=129 Identities=10% Similarity=0.122 Sum_probs=63.8 Q ss_pred CcccccCCCCCH-Hhhhhhhhhh-------hccccccCCCCcCCCceEEEEeecCCCCcc----eeccEEEEEEEeeccC Q lcl|NC_021302. 1 MIELLDREAPPD-IRFLRAWLLP-------IGGGVGAKRETGDPFPFTLIQKFDGWENSH----TQYGFYQFDHLAVAAD 68 (145) Q Consensus 1 ~~~l~d~eaP~~-~~~lia~L~p-------lg~~v~~~R~~gdPlPf~~V~rV~G~dd~~----~~~~~~~v~~~~~~~d 68 (145) |+ .++-.+ -+.+++.|.- +|+.+=-.-|.+.|+||.++-...-.++.. -..-..+||++. T Consensus 1 Ms----ms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~~~~~~~g~~~~~ti~Vws---- 72 (141) T protein:vir:10 1 MW----VSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNNESSATMRETVGIVIHVYS---- 72 (141) T ss_pred Cc----cchhHHHHHHHHHHhhcChhhHhhcCCccccCCccCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEE---- Confidence 32 222122 5667777754 343343455667789998876655554332 334456666664 Q ss_pred ChhhHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEee Q lcl|NC_021302. 69 GKSAYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVA 144 (145) Q Consensus 69 ~~~~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~ 144 (145) .+.++++|+++|..+-.-+ .. ...+++++.+...+....-..-.+....=-+.||-+|..=---=..|- T Consensus 73 ~~~g~~eak~ia~av~~AL----~~---~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~~~~~~~ 141 (141) T protein:vir:10 73 QFATQYEAKLILSAIGYVL----NR---PIEIDNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKKKNEGVY 141 (141) T ss_pred cCCCHHHHHHHHHHHHHHh----cc---cccCCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEeccccccCC Confidence 2456788888888777632 22 267789888877554333222111111111344444411000000000 No 28 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=77.28 E-value=0.13 Score=25.51 Aligned_cols=129 Identities=10% Similarity=0.122 Sum_probs=63.8 Q ss_pred CcccccCCCCCH-Hhhhhhhhhh-------hccccccCCCCcCCCceEEEEeecCCCCcc----eeccEEEEEEEeeccC Q lcl|NC_021302. 1 MIELLDREAPPD-IRFLRAWLLP-------IGGGVGAKRETGDPFPFTLIQKFDGWENSH----TQYGFYQFDHLAVAAD 68 (145) Q Consensus 1 ~~~l~d~eaP~~-~~~lia~L~p-------lg~~v~~~R~~gdPlPf~~V~rV~G~dd~~----~~~~~~~v~~~~~~~d 68 (145) |+ .++-.+ -+.+++.|.- +|+.+=-.-|.+.|+||.++-...-.++.. -..-..+||++. T Consensus 1 Ms----ms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~~~~~~~g~~~~~ti~Vws---- 72 (141) T protein:vir:94 1 MW----VSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNNESSATMRETVGIVIHVYS---- 72 (141) T ss_pred Cc----cchhHHHHHHHHHHhhcChhhHhhcCCccccCCccCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEE---- Confidence 32 222122 5667777754 343343455667789998876655554332 334456666664 Q ss_pred ChhhHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEee Q lcl|NC_021302. 69 GKSAYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVA 144 (145) Q Consensus 69 ~~~~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~ 144 (145) .+.++++|+++|..+-.-+ .. ...+++++.+...+....-..-.+....=-+.||-+|..=---=..|- T Consensus 73 ~~~g~~eak~ia~av~~AL----~~---~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~~~~~~~ 141 (141) T protein:vir:94 73 QFATQYEAKLILSAIGYVL----NR---PIEIDNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKKKNEGVY 141 (141) T ss_pred cCCCHHHHHHHHHHHHHHh----cc---cccCCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEeccccccCC Confidence 2456788888888777632 22 267789888877554333222111111111344444411000000000 No 29 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=73.87 E-value=0.17 Score=24.87 Aligned_cols=130 Identities=10% Similarity=0.088 Sum_probs=67.6 Q ss_pred CcccccCCCCCH-Hhhhhhhh------hh-hccccccCCCCcCCCceEEEEeecCCCCc----ceeccEEEEEEEeeccC Q lcl|NC_021302. 1 MIELLDREAPPD-IRFLRAWL------LP-IGGGVGAKRETGDPFPFTLIQKFDGWENS----HTQYGFYQFDHLAVAAD 68 (145) Q Consensus 1 ~~~l~d~eaP~~-~~~lia~L------~p-lg~~v~~~R~~gdPlPf~~V~rV~G~dd~----~~~~~~~~v~~~~~~~d 68 (145) |+ +++-.+ -+.++++| .. +++.+--+-|.+.|.||.++-...-.++. .-..-..+||++.. T Consensus 1 Ms----~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~~~~~PYv~lG~~~~~d~~~~~~~g~~~~~ti~Vws~--- 73 (145) T protein:vir:94 1 MW----VSVERYLFNKVYNKLKSNPIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQ--- 73 (145) T ss_pred Cc----hhHHHHHHHHHHHHhhcCHhHHHhhccccccCCcCCCCCCEEEecCceeeecCCCcccceEEEEEEEEEEc--- Confidence 32 232222 45666665 22 23334445667789999888444333322 23445566666642 Q ss_pred ChhhHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 69 GKSAYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 69 ~~~~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~~ 145 (145) +.++++|++++..+-.- | .. .+.+++++.+++.+.+..-..-.+....=-+.||-+|++=---=..|-. T Consensus 74 -~~g~~eak~ia~av~~a---L-~~---~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~~~~~~ 142 (145) T protein:vir:94 74 -ARNRDEASQIIQFLGFV---L-NN---EIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQRSVTN 142 (145) T ss_pred -CCCHHHHHHHHHHHHHH---h-cc---ccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEeccccccccc Confidence 35678888888877653 2 12 2788899998886655443332222211123444444433322222222 No 30 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=73.83 E-value=0.17 Score=24.87 Aligned_cols=130 Identities=11% Similarity=0.089 Sum_probs=67.5 Q ss_pred CcccccCCCCCH-Hhhhhhhh------hh-hccccccCCCCcCCCceEEEEeecCCCCc----ceeccEEEEEEEeeccC Q lcl|NC_021302. 1 MIELLDREAPPD-IRFLRAWL------LP-IGGGVGAKRETGDPFPFTLIQKFDGWENS----HTQYGFYQFDHLAVAAD 68 (145) Q Consensus 1 ~~~l~d~eaP~~-~~~lia~L------~p-lg~~v~~~R~~gdPlPf~~V~rV~G~dd~----~~~~~~~~v~~~~~~~d 68 (145) |+ +++-.+ -+.++++| .. +++.+--+-|.+.|.||.++-...-.++. .-..-..+||++.. T Consensus 1 Ms----~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~~~~~PYv~lG~~~~~d~~~~~~~g~~~~~ti~Vws~--- 73 (145) T protein:vir:95 1 MW----VSVERYLFNKVYNKLKSNPIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQ--- 73 (145) T ss_pred Cc----hhHHHHHHHHHHHHhhcCHhHHHhhccccccCCcCCCCCCEEEecCceeeecCCCcccceEEEEEEEEEEc--- Confidence 32 232222 45666665 22 23334445667789999888444333322 23445566666642 Q ss_pred ChhhHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 69 GKSAYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 69 ~~~~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~~ 145 (145) +.++++|++++..+-.- | .. .+.+++++.+++.+.+..-..-.+....=-+.||-+|++=---=..|-. T Consensus 74 -~~g~~eak~ia~av~~a---L-~~---~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~~~~~~ 142 (145) T protein:vir:95 74 -ARNRDEASQIIQFLGFV---L-NN---EIEIDYYSFIKSRIDTQEVITDIDQYTKHGVIRLVFKYRHNTLQRSVTN 142 (145) T ss_pred -CCCHHHHHHHHHHHHHH---h-cc---ccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEeccccccccc Confidence 35678888888877653 2 12 2788899998876655443332222211123444444433322222222 No 31 >protein:vir:93736 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240465;genbank:gi:66396143;genbank:GeneID:5133505 Probab=70.77 E-value=0.2 Score=24.36 Aligned_cols=130 Identities=8% Similarity=0.054 Sum_probs=66.7 Q ss_pred ccCCCCCH-Hhhhhhhhh------h-hccccccCCCCcCCCceEEEEeecCCCCc----ceeccEEEEEEEeeccCChhh Q lcl|NC_021302. 5 LDREAPPD-IRFLRAWLL------P-IGGGVGAKRETGDPFPFTLIQKFDGWENS----HTQYGFYQFDHLAVAADGKSA 72 (145) Q Consensus 5 ~d~eaP~~-~~~lia~L~------p-lg~~v~~~R~~gdPlPf~~V~rV~G~dd~----~~~~~~~~v~~~~~~~d~~~~ 72 (145) ..+++-.+ -+.++++|. . +++.+--+-|.+.|+||.++-...-.++. .-..-..+||++.. +++ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~~a~~PYV~lG~~~~~d~~~~~~~g~~~~~ti~Vws~----~~g 76 (145) T protein:vir:93 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQ----ARN 76 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEc----CCC Confidence 22333333 456667762 1 23234445667789999888444333322 23344566666642 456 Q ss_pred HHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 73 YTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 73 ~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~~ 145 (145) +++|++++..+-.- | .. . +.+++++.+++.+.+..-..-.+....=-+.||-+|++=---=..|-. T Consensus 77 ~~eak~ia~av~~a---L-~~-~--l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~~~~~~ 142 (145) T protein:vir:93 77 RDEASQIIQFLGFV---L-NN-E--IEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQRSVTN 142 (145) T ss_pred HHHHHHHHHHHHHH---h-cc-c--cCCCCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEEEeccccccccc Confidence 78889888877553 2 22 2 788899988876654433322111111113444444433322222322 No 32 >protein:vir:97421 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240755;genbank:gi:66396436;genbank:GeneID:5133777 Probab=70.77 E-value=0.2 Score=24.36 Aligned_cols=130 Identities=8% Similarity=0.054 Sum_probs=66.7 Q ss_pred ccCCCCCH-Hhhhhhhhh------h-hccccccCCCCcCCCceEEEEeecCCCCc----ceeccEEEEEEEeeccCChhh Q lcl|NC_021302. 5 LDREAPPD-IRFLRAWLL------P-IGGGVGAKRETGDPFPFTLIQKFDGWENS----HTQYGFYQFDHLAVAADGKSA 72 (145) Q Consensus 5 ~d~eaP~~-~~~lia~L~------p-lg~~v~~~R~~gdPlPf~~V~rV~G~dd~----~~~~~~~~v~~~~~~~d~~~~ 72 (145) ..+++-.+ -+.++++|. . +++.+--+-|.+.|+||.++-...-.++. .-..-..+||++.. +++ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~~a~~PYV~lG~~~~~d~~~~~~~g~~~~~ti~Vws~----~~g 76 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQ----ARN 76 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEc----CCC Confidence 22333333 456667762 1 23234445667789999888444333322 23344566666642 456 Q ss_pred HHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 73 YTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 73 ~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~~ 145 (145) +++|++++..+-.- | .. . +.+++++.+++.+.+..-..-.+....=-+.||-+|++=---=..|-. T Consensus 77 ~~eak~ia~av~~a---L-~~-~--l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~~~~~~ 142 (145) T protein:vir:97 77 RDEASQIIQFLGFV---L-NN-E--IEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQRSVTN 142 (145) T ss_pred HHHHHHHHHHHHHH---h-cc-c--cCCCCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEEEeccccccccc Confidence 78889888877553 2 22 2 788899988876654433322111111113444444433322222322 No 33 >protein:vir:94488 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240682;genbank:gi:66396364;genbank:GeneID:5133752 Probab=70.77 E-value=0.2 Score=24.36 Aligned_cols=130 Identities=8% Similarity=0.054 Sum_probs=66.7 Q ss_pred ccCCCCCH-Hhhhhhhhh------h-hccccccCCCCcCCCceEEEEeecCCCCc----ceeccEEEEEEEeeccCChhh Q lcl|NC_021302. 5 LDREAPPD-IRFLRAWLL------P-IGGGVGAKRETGDPFPFTLIQKFDGWENS----HTQYGFYQFDHLAVAADGKSA 72 (145) Q Consensus 5 ~d~eaP~~-~~~lia~L~------p-lg~~v~~~R~~gdPlPf~~V~rV~G~dd~----~~~~~~~~v~~~~~~~d~~~~ 72 (145) ..+++-.+ -+.++++|. . +++.+--+-|.+.|+||.++-...-.++. .-..-..+||++.. +++ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~~a~~PYV~lG~~~~~d~~~~~~~g~~~~~ti~Vws~----~~g 76 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQ----ARN 76 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEc----CCC Confidence 22333333 456667762 1 23234445667789999888444333322 23344566666642 456 Q ss_pred HHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 73 YTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 73 ~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~~ 145 (145) +++|++++..+-.- | .. . +.+++++.+++.+.+..-..-.+....=-+.||-+|++=---=..|-. T Consensus 77 ~~eak~ia~av~~a---L-~~-~--l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~~~~~~ 142 (145) T protein:vir:94 77 RDEASQIIQFLGFV---L-NN-E--IEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQRSVTN 142 (145) T ss_pred HHHHHHHHHHHHHH---h-cc-c--cCCCCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEEEeccccccccc Confidence 78889888877553 2 22 2 788899988876654433322111111113444444433322222322 No 34 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=70.70 E-value=0.21 Score=24.35 Aligned_cols=130 Identities=9% Similarity=0.072 Sum_probs=67.8 Q ss_pred ccCCCCCH-Hhhhhhhhh------h-hccccccCCCCcCCCceEEEEeecCCCCc----ceeccEEEEEEEeeccCChhh Q lcl|NC_021302. 5 LDREAPPD-IRFLRAWLL------P-IGGGVGAKRETGDPFPFTLIQKFDGWENS----HTQYGFYQFDHLAVAADGKSA 72 (145) Q Consensus 5 ~d~eaP~~-~~~lia~L~------p-lg~~v~~~R~~gdPlPf~~V~rV~G~dd~----~~~~~~~~v~~~~~~~d~~~~ 72 (145) ..+++-.+ -+.++++|. . +++.+--+-|.+.|+||.++-...-.++. .-..-..+||++.. +.+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~~a~~PYV~lG~~~~~~~~~~~~~g~~~~~ti~Vws~----~~g 76 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNSIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQ----ARN 76 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEecCceeeecCCCcccceEEEEEEEEEEc----CCC Confidence 22333333 456666662 1 23234445667789999888444433322 23345566666642 456 Q ss_pred HHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 73 YTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 73 ~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~~ 145 (145) +++|++++..+-.- | .. .+.+++++.+++.+.+..-..-.+....=-+.||-+|++=---=..|-. T Consensus 77 ~~eak~ia~av~~a---L-~~---~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~~~~~~~~~~ 142 (145) T protein:vir:95 77 RDEASQIIQFLGFV---L-NN---EIEIDYYSFIKSRIDTQEVITDIDRYTKHGIIRLVFKYRHNTLQRSVTN 142 (145) T ss_pred HHHHHHHHHHHHHH---h-cc---ccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEeccccccccc Confidence 78889888877653 2 12 2788899998886665443332222111123444444433322223322 No 35 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=70.36 E-value=0.21 Score=24.30 Aligned_cols=130 Identities=10% Similarity=0.089 Sum_probs=66.0 Q ss_pred CcccccCCCCCH-Hhhhhhhhhh-------hccccccCCCCcCCCceEEEEeecCCCCc----ceeccEEEEEEEeeccC Q lcl|NC_021302. 1 MIELLDREAPPD-IRFLRAWLLP-------IGGGVGAKRETGDPFPFTLIQKFDGWENS----HTQYGFYQFDHLAVAAD 68 (145) Q Consensus 1 ~~~l~d~eaP~~-~~~lia~L~p-------lg~~v~~~R~~gdPlPf~~V~rV~G~dd~----~~~~~~~~v~~~~~~~d 68 (145) |+ +++-.+ -+.++++|.- +++.+--+-|.+.|+||..+-...-.++. .-..-..+||++.. T Consensus 1 Ms----~s~~~aLq~Ai~~~L~ad~~l~alvggrV~D~~P~~a~~PYv~lG~~~~~d~~~~~~~g~~~~~ti~Vws~--- 73 (145) T protein:vir:97 1 MW----VSVERYLFNKVYNKLKSNLIIRKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQ--- 73 (145) T ss_pred Cc----chHhHHHHHHHHHHhhcChhHHHhhcCceecCCccCCCCCEEEeCcceeeecCCCcccceEEEEEEEEEEc--- Confidence 32 232222 4566666651 23334445667889999888444333322 23445566666642 Q ss_pred ChhhHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 69 GKSAYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 69 ~~~~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~~ 145 (145) +.++++|++++..+-.- | .. .+.+++++.+++.+.+..-..-.+....=-+.||-+|++=.--=..|-. T Consensus 74 -~~g~~eak~ia~av~~a---L-~~---~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~~~~~~ 142 (145) T protein:vir:97 74 -ARNRDEASQIIQFLGFV---L-NN---EIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQRSVTN 142 (145) T ss_pred -CCCHHHHHHHHHHHHHH---h-cc---ccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEecCceeccccc Confidence 35678888888877653 2 22 2788899988886554443332111111113333333332211111211 No 36 >protein:vir:1387 Length: 116 # NCBI annotation: Gp10 protein # Family: family:all:517 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612839;genbank:gi:20065973;genbank:GeneID:935788 Probab=70.05 E-value=0.12 Score=25.65 Aligned_cols=106 Identities=11% Similarity=-0.015 Sum_probs=69.1 Q ss_pred CCCCCHHhhhhhhhhhhccccccCCCCcC-CCceEEEEeecCCC-----Ccc-eeccEEEEEEEeeccCChhhHHHHHHH Q lcl|NC_021302. 7 REAPPDIRFLRAWLLPIGGGVGAKRETGD-PFPFTLIQKFDGWE-----NSH-TQYGFYQFDHLAVAADGKSAYTACEDY 79 (145) Q Consensus 7 ~eaP~~~~~lia~L~plg~~v~~~R~~gd-PlPf~~V~rV~G~d-----d~~-~~~~~~~v~~~~~~~d~~~~~eaa~da 79 (145) -|.=+..+.+..-|.||+.+|.....+|+ -.||.++.-..-.. |+. .-.=.+||+++..++. .+.++ T Consensus 1 ~~~m~I~~~i~~~Lk~i~ipV~~~~y~~~~~~~~Itf~~y~e~~~~yaDd~e~~t~~~iQVDI~sk~~~------~~~~l 74 (116) T protein:vir:13 1 MEDFDIIALVYECLECLNVPVIEGWYDEELNKTHITVHEYLEQDESFEDDEAREEEHNIQIDVWSKDSL------EAFKL 74 (116) T ss_pred CCccchhHHHHHHHhhcCCeeeecccCCCCccceEEEEeeecCCCcccCCeeeeEEEEEEEEEeecCCc------cHHHH Confidence 33347888999999999998888776777 47988887775432 332 4455789999986663 34568 Q ss_pred HHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCC-ceEEEEEE--eeecce Q lcl|NC_021302. 80 ARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNT-DVERFIAR--YSVHLR 139 (145) Q Consensus 80 A~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd-~v~Ry~aR--Y~v~Lr 139 (145) +.++.+-|++..++. ... .+.|..| .+..+.=| |-..|. T Consensus 75 ~~~V~~lMk~~GF~r------------------~~~---~d~ye~dt~iyhk~~RF~y~~el~ 116 (116) T protein:vir:13 75 KKAIKKLLKKNNFYF------------------DSS---EDFYETKTRIYHKGLRFSYISEIS 116 (116) T ss_pred HHHHHHHHHHcCCEe------------------eec---CCCccchhhhhhhhhhheeeeecC Confidence 888999998886432 111 1446554 34433333 333344 No 37 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=66.72 E-value=0.26 Score=23.76 Aligned_cols=127 Identities=9% Similarity=0.053 Sum_probs=67.9 Q ss_pred cCCCCC-H-Hhhhhhhhhh-------hccccccCCCCcCCCceEEEEeecCCCCc----ceeccEEEEEEEeeccCChhh Q lcl|NC_021302. 6 DREAPP-D-IRFLRAWLLP-------IGGGVGAKRETGDPFPFTLIQKFDGWENS----HTQYGFYQFDHLAVAADGKSA 72 (145) Q Consensus 6 d~eaP~-~-~~~lia~L~p-------lg~~v~~~R~~gdPlPf~~V~rV~G~dd~----~~~~~~~~v~~~~~~~d~~~~ 72 (145) =|+.++ + -+.+.+.|.- +|+.+=-.-|.+.|.||..+-...-.++. .-..-..+||+.-. +.+ T Consensus 1 ~~msa~~aLq~Ai~~~L~ad~~l~alvggrVyD~~P~~~~~PYV~lG~~~~~~~~~~~~~g~~~~~tl~Vws~----~~g 76 (140) T protein:vir:96 1 MWVTAEPLLYNKIMNNLIENPITDKLVGGRVFDCVQKDVVYPYIVVGESNVTESERSPGMREIIAITFHVYSQ----YEN 76 (140) T ss_pred CccchhHHHHHHHHHHhccChhHHhhcCcccccCCccCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEc----CCC Confidence 123332 2 4566666651 22222223355668999977433333322 22334566666632 345 Q ss_pred HHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEe Q lcl|NC_021302. 73 YTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSV 143 (145) Q Consensus 73 ~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~ 143 (145) +.+|+++|..+..-+ . . .+.+++++.+++.+....-..-.+....=-+.||-+|++----=..| T Consensus 77 ~~ea~~ia~ai~~aL-~---~---~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~ve~~~~~~~~ 140 (140) T protein:vir:96 77 GAEARELLKYLNYAC-R---L---NINFKDYELEWIKKDNSQVFTDIDQYTKHGVLRLLYKVRHKTLQERV 140 (140) T ss_pred HHHHHHHHHHHHHHh-c---C---CccCCCceEEEEEEeeeEEeecCCCceEEEEEEEEEEEeecccccCC Confidence 788888888777643 1 2 26778999888766654433332322222256677776654444444 No 38 >protein:vir:98343 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918935;genbank:gi:119443697;genbank:GeneID:4594505 Probab=65.79 E-value=0.28 Score=23.63 Aligned_cols=117 Identities=11% Similarity=-0.003 Sum_probs=75.8 Q ss_pred CcccccCCC-CCHHhhhhhhhhhhccccccCCCCcCCCceEEEEeecCCC-----Ccc-eeccEEEEEEEeeccCChhhH Q lcl|NC_021302. 1 MIELLDREA-PPDIRFLRAWLLPIGGGVGAKRETGDPFPFTLIQKFDGWE-----NSH-TQYGFYQFDHLAVAADGKSAY 73 (145) Q Consensus 1 ~~~l~d~ea-P~~~~~lia~L~plg~~v~~~R~~gdPlPf~~V~rV~G~d-----d~~-~~~~~~~v~~~~~~~d~~~~~ 73 (145) |.+++-.=. |-..+.+..-|.|++-+|.-..=.|..-||..+.-..-.. |+. .-.=.+||++++.++|.. T Consensus 1 ~~~~~k~l~~~~I~~li~~~L~~~nvpv~~~~y~~~~~tyItf~ey~~~~~~yaDD~e~~t~~~iQVDIw~sk~d~~--- 77 (126) T protein:vir:98 1 MINVTKLIRNAIIANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQDEPN--- 77 (126) T ss_pred CccchhhhhhhHHHHhhhhhhhccCceeeeeeecCCCceEEEEEeecCCCCcccccceeeeEEEEEEEEeecCCCHH--- Confidence 777765433 3357788888999999888887788889999988775442 333 445579999977777633 Q ss_pred HHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCC-ceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 74 TACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNT-DVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 74 eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd-~v~Ry~aRY~v~Lr~va~~~ 145 (145) +.+.++.+-|++..+. +.... +.|+.| .+.++.=||+-.+-=.-.-. T Consensus 78 ----~l~~~V~~lMk~~GF~------------------r~~~~---dlYE~DtklyHk~~RF~~~~~~~~~~~ 125 (126) T protein:vir:98 78 ----EQAEKIVELLKVINFQ------------------CYYRE---PLYESDVMSFRHIIRAKGSILSMKLEE 125 (126) T ss_pred ----HHHHHHHHHHHHcCCe------------------eeecC---CCccchhhhheeeeeeeeeecceeecc Confidence 3567788888887533 22222 468766 48777777753221110001 No 39 >protein:vir:9415 Length: 126 # NCBI annotation: phi PVL orf 12-like protein # Family: family:all:517 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803393;genbank:gi:29028705;genbank:GeneID:1258143 Probab=65.79 E-value=0.28 Score=23.63 Aligned_cols=117 Identities=11% Similarity=-0.003 Sum_probs=75.8 Q ss_pred CcccccCCC-CCHHhhhhhhhhhhccccccCCCCcCCCceEEEEeecCCC-----Ccc-eeccEEEEEEEeeccCChhhH Q lcl|NC_021302. 1 MIELLDREA-PPDIRFLRAWLLPIGGGVGAKRETGDPFPFTLIQKFDGWE-----NSH-TQYGFYQFDHLAVAADGKSAY 73 (145) Q Consensus 1 ~~~l~d~ea-P~~~~~lia~L~plg~~v~~~R~~gdPlPf~~V~rV~G~d-----d~~-~~~~~~~v~~~~~~~d~~~~~ 73 (145) |.+++-.=. |-..+.+..-|.|++-+|.-..=.|..-||..+.-..-.. |+. .-.=.+||++++.++|.. T Consensus 1 ~~~~~k~l~~~~I~~li~~~L~~~nvpv~~~~y~~~~~tyItf~ey~~~~~~yaDD~e~~t~~~iQVDIw~sk~d~~--- 77 (126) T protein:vir:94 1 MINVTKLIRNAIIANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQDEPN--- 77 (126) T ss_pred CccchhhhhhhHHHHhhhhhhhccCceeeeeeecCCCceEEEEEeecCCCCcccccceeeeEEEEEEEEeecCCCHH--- Confidence 777765433 3357788888999999888887788889999988775442 333 445579999977777633 Q ss_pred HHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCC-ceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 74 TACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNT-DVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 74 eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd-~v~Ry~aRY~v~Lr~va~~~ 145 (145) +.+.++.+-|++..+. +.... +.|+.| .+.++.=||+-.+-=.-.-. T Consensus 78 ----~l~~~V~~lMk~~GF~------------------r~~~~---dlYE~DtklyHk~~RF~~~~~~~~~~~ 125 (126) T protein:vir:94 78 ----EQAEKIVELLKVINFQ------------------CYYRE---PLYESDVMSFRHIIRAKGSILSMKLEE 125 (126) T ss_pred ----HHHHHHHHHHHHcCCe------------------eeecC---CCccchhhhheeeeeeeeeecceeecc Confidence 3567788888887533 22222 468766 48777777753221110001 No 40 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=57.36 E-value=0.44 Score=22.56 Aligned_cols=121 Identities=11% Similarity=0.167 Sum_probs=67.4 Q ss_pred ccCCCCCH--Hhhhhhhhhh-------hccccccCCCCcCCCceEEEEe--ecCCCCc--ceeccEEEEEEEeeccCChh Q lcl|NC_021302. 5 LDREAPPD--IRFLRAWLLP-------IGGGVGAKRETGDPFPFTLIQK--FDGWENS--HTQYGFYQFDHLAVAADGKS 71 (145) Q Consensus 5 ~d~eaP~~--~~~lia~L~p-------lg~~v~~~R~~gdPlPf~~V~r--V~G~dd~--~~~~~~~~v~~~~~~~d~~~ 71 (145) ..|.-|+. -+.++++|.- +| .+--+-|.+.|.||.++-. +--.+++ .-..-..+||++.. . T Consensus 1 m~~~s~~~aLq~Ai~~~L~ad~~l~alvg-~I~D~~P~~~~~PYV~lG~~~~~d~~~~~~~g~~~~~ti~Vws~-~---- 74 (134) T protein:vir:59 1 MTWKLASRALQKATVENLESYQPLMEMVN-QVTESPGKDDPYPYVVIGDQSSTPFETKSSFGENITMDFHVWGG-T---- 74 (134) T ss_pred CCccchhHHHHHHHHHHhhcChhHHHhhh-hhhcCCCCCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEC-C---- Confidence 34554532 5566666643 22 2444556677899988732 3222222 23344556776642 1 Q ss_pred hHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeec Q lcl|NC_021302. 72 AYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVH 137 (145) Q Consensus 72 ~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~ 137 (145) ++.+|++++..+-.- |. .-.+++++++.+++.+.+..-....+....=-+.||.++.+=+ T Consensus 75 g~~ea~~ia~av~~a---L~---~~~L~l~~~~lv~l~~~~~~~~rd~dg~~~hg~l~fra~ve~~ 134 (134) T protein:vir:59 75 TRAEAQDISSRVLEA---LT---YKPLMFEGFTFVAKKLVLAQVITDTDGVTKHGIIKVRFTINNN 134 (134) T ss_pred ChHHHHHHHHHHHHH---hc---CCCcccCCceEEEeEEeeeeEEecCCCceEEEEEEEEEEEecC Confidence 235678887776553 32 1236788999998877665554443333222356666666666 No 41 >protein:vir:195 Length: 115 # NCBI annotation: Gp11 # Family: family:all:896 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037705;genbank:gi:9634170;genbank:GeneID:1262540 Probab=56.32 E-value=0.46 Score=22.44 Aligned_cols=102 Identities=17% Similarity=0.189 Sum_probs=51.6 Q ss_pred CcccccCCCCCHHhhhhhhhhhhccc-c-------ccCCCCcCCCceEEEEeecCCC-----CcceeccEEEEEEEeecc Q lcl|NC_021302. 1 MIELLDREAPPDIRFLRAWLLPIGGG-V-------GAKRETGDPFPFTLIQKFDGWE-----NSHTQYGFYQFDHLAVAA 67 (145) Q Consensus 1 ~~~l~d~eaP~~~~~lia~L~plg~~-v-------~~~R~~gdPlPf~~V~rV~G~d-----d~~~~~~~~~v~~~~~~~ 67 (145) |. | .=|-+-|+||-.+ | +..=.+.-++||.+-++|.|.. ++..+.-++||++++... T Consensus 1 M~---e-------~~i~~lL~~l~~gRvyp~~aP~~~~~~~~~~~Pyiv~q~vsg~p~~~L~G~~~~~~~vQIDvyA~t~ 70 (115) T protein:vir:19 1 MN---E-------DNIYALLSPLAEGRVYPYVAPLGSDGKPSVSPPWIIFSIVDDVSADVLCGQAESRVSVQVDVYSTSI 70 (115) T ss_pred Cc---h-------hHHHHHHhhhcCcccceeeccCCCCCCccccCCeEEEEeccCcccccccCCCccceEEEEEEeeCCh Confidence 33 2 3455666765321 2 2222235589999999998854 234577899999997444 Q ss_pred CChhhHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceE Q lcl|NC_021302. 68 DGKSAYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRL 140 (145) Q Consensus 68 d~~~~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~ 140 (145) +.|+..+..+-.=|-.+ .+.... +. ..|+ +..+-|-++.++..+= T Consensus 71 ------~~A~~l~~~i~~Al~~~--~p~~~~------------------~~-~~ye-~dt~lyR~s~d~~V~~ 115 (115) T protein:vir:19 71 ------AESRSLRDLVLASLEPL--TPTEVV------------------KI-PGYE-PDYRLYRATLDFKVTP 115 (115) T ss_pred ------HHHHHHHHHHHHHhhhc--CCEEec------------------CC-CCcc-cchhceeeEEEEEecC Confidence 55555554444433222 222111 11 4453 2333333333333222 No 42 >protein:vir:107096 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950611;genbank:gi:119953691;genbank:GeneID:4643105 Probab=55.93 E-value=0.47 Score=22.39 Aligned_cols=120 Identities=8% Similarity=0.034 Sum_probs=64.3 Q ss_pred CcccccCCCCCH-Hhhhhhhhh-------hhccccccCCCCcCCCceEEEEeecCCC--Cc--ceeccEEEEEEEeeccC Q lcl|NC_021302. 1 MIELLDREAPPD-IRFLRAWLL-------PIGGGVGAKRETGDPFPFTLIQKFDGWE--NS--HTQYGFYQFDHLAVAAD 68 (145) Q Consensus 1 ~~~l~d~eaP~~-~~~lia~L~-------plg~~v~~~R~~gdPlPf~~V~rV~G~d--d~--~~~~~~~~v~~~~~~~d 68 (145) |+ +++=.+ -+.++++|. =+|+.+--+-|.+.+.||.++-...-.+ ++ .-..-..+||+... T Consensus 1 Ms----~s~~~aLq~Ai~~~L~ad~al~alvg~rVyD~~P~~a~~PyV~lG~~~~~~~~~~~~~g~~~~~ti~Vws~--- 73 (145) T protein:vir:10 1 MW----VSVERYLFNKIYNKLKSNPIIKKQLGGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMFEDVGVTLHVYSQ--- 73 (145) T ss_pred Cc----hhHHHHHHHHHHHHhhcChhHHHhhccccccCCccCCCCCEEEeCcceeeecCCCcccceEEEEEEEEEEc--- Confidence 43 221111 455666654 2333355566678899999884443333 22 23445566666643 Q ss_pred ChhhHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 69 GKSAYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 69 ~~~~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~~ 145 (145) +.++++|++++..+-.- | .. .+.+++++.+++.+.+..-.+-. |.. ++..-|+|-++.- T Consensus 74 -~~g~~ea~~ia~av~~a---L-~a---~l~l~~~~lv~l~~~~~~~~rd~----dg~------~~hgvl~~ra~ve 132 (145) T protein:vir:10 74 -ARNRDEASQIIQYLGFV---L-NS---EIEINNYSFIKSRIDTQEVITDI----DQY------TKHGIIRLIFKYR 132 (145) T ss_pred -CCCHHHHHHHHHHHHHH---h-CC---CcCCCCCeEEEEEEeeeeEeecC----CCc------eEEEEEEEEEEEe Confidence 34567788888877653 2 22 26788999888866544333221 111 2233345555555 No 43 >protein:vir:105337 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950674;genbank:gi:119967844;genbank:GeneID:4643216 Probab=55.92 E-value=0.47 Score=22.39 Aligned_cols=120 Identities=8% Similarity=0.034 Sum_probs=64.3 Q ss_pred CcccccCCCCCH-Hhhhhhhhh-------hhccccccCCCCcCCCceEEEEeecCCC--Cc--ceeccEEEEEEEeeccC Q lcl|NC_021302. 1 MIELLDREAPPD-IRFLRAWLL-------PIGGGVGAKRETGDPFPFTLIQKFDGWE--NS--HTQYGFYQFDHLAVAAD 68 (145) Q Consensus 1 ~~~l~d~eaP~~-~~~lia~L~-------plg~~v~~~R~~gdPlPf~~V~rV~G~d--d~--~~~~~~~~v~~~~~~~d 68 (145) |+ +++=.+ -+.++++|. =+|+.+--+-|.+.+.||.++-...-.+ ++ .-..-..+||+... T Consensus 1 Ms----~s~~~aLq~Ai~~~L~ad~al~alvg~rVyD~~P~~a~~PyV~lG~~~~~~~~~~~~~g~~~~~ti~Vws~--- 73 (145) T protein:vir:10 1 MW----VSVERYLFNKIYNKLKSNPIVSKQLGGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMFEDVGVTLHVYSQ--- 73 (145) T ss_pred Cc----hhHHHHHHHHHHHHhhcChhHHHhhccccccCCccCCCCCEEEeCcceeeecCCCcccceEEEEEEEEEEc--- Confidence 43 221111 455666654 2333355566678899999884443333 22 23445566666643 Q ss_pred ChhhHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 69 GKSAYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 69 ~~~~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~~ 145 (145) +.++++|++++..+-.- | .. .+.+++++.+++.+.+..-.+-. |.. ++..-|+|-++.- T Consensus 74 -~~g~~ea~~ia~av~~a---L-~a---~l~l~~~~lv~l~~~~~~~~rd~----dg~------~~hgvl~~ra~ve 132 (145) T protein:vir:10 74 -ARNRDEASQIIQYLGFV---L-NS---EIEINNYSFIKSRIDTQEVITDI----DQY------TKHGIIRLIFKYR 132 (145) T ss_pred -CCCHHHHHHHHHHHHHH---h-CC---CcCCCCCeEEEEEEeeeeEeecC----CCc------eEEEEEEEEEEEe Confidence 34567788888877653 2 22 26788999888866544333221 111 2233355555555 No 44 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=55.04 E-value=0.49 Score=22.29 Aligned_cols=119 Identities=10% Similarity=0.049 Sum_probs=62.6 Q ss_pred CcccccCCCCCH--Hhhhhhhh-------hhhccccccCCCCcCCCceEEEEeecCCC--Cc--ceeccEEEEEEEeecc Q lcl|NC_021302. 1 MIELLDREAPPD--IRFLRAWL-------LPIGGGVGAKRETGDPFPFTLIQKFDGWE--NS--HTQYGFYQFDHLAVAA 67 (145) Q Consensus 1 ~~~l~d~eaP~~--~~~lia~L-------~plg~~v~~~R~~gdPlPf~~V~rV~G~d--d~--~~~~~~~~v~~~~~~~ 67 (145) |+- .|+. -+.++++| +=+|+.+-.+-|.+.+.||..+-...-.+ .+ .-..-..+||++.. T Consensus 1 M~~-----s~~~aLq~ai~~~L~ad~~l~~lvg~~vyD~~P~~~~~PyV~lG~~~~~~~~t~~~~~~~~~lti~Vws~-- 73 (145) T protein:vir:12 1 MWV-----SVERYLFNKVYNKLKSNPIIQKQLGGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQ-- 73 (145) T ss_pred Ccc-----cHHHHHHHHHHHHhhcChhHHHhcCcccccCCccCCCCCEEEeccceeeecCCCcccceEEEEEEEEEEc-- Confidence 432 2332 56778877 33344455566667889999874333332 22 24445566777743 Q ss_pred CChhhHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEeeC Q lcl|NC_021302. 68 DGKSAYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSVAS 145 (145) Q Consensus 68 d~~~~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~~~ 145 (145) +.++.+|+++++.+.+- | .. .+.+++++.+++......-.. + .|... |.--|+|.+... T Consensus 74 --~~gr~ea~~ia~ai~~a---L-~~---~l~l~~~~lv~l~~~~~~~~r--d--~d~~~------~hgvl~~ra~i~ 132 (145) T protein:vir:12 74 --ARNRDEASQIIQFLGFV---L-NN---EIEIDYYSFIKSRIDTQEVIT--D--IDQYT------KHGIIRLVFKYR 132 (145) T ss_pred --CccHHHHHHHHHHHHHH---h-cc---ccCCCCceEEEEEEeeEEEEe--c--CCCce------EEEEEEEEEEEE Confidence 34567888888877542 2 12 366778887766444322111 1 12211 222245555444 No 45 >protein:vir:2508 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:11707 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569749;genbank:gi:18496899;genbank:GeneID:932297 Probab=50.60 E-value=0.46 Score=22.42 Aligned_cols=131 Identities=18% Similarity=0.190 Sum_probs=70.6 Q ss_pred cccccCCCCC--HHhhhhhhhhhhcc--ccccCCCCcCCCceEEEEeecCCCCcceeccEEEEEEEeeccCChhhHHHHH Q lcl|NC_021302. 2 IELLDREAPP--DIRFLRAWLLPIGG--GVGAKRETGDPFPFTLIQKFDGWENSHTQYGFYQFDHLAVAADGKSAYTACE 77 (145) Q Consensus 2 ~~l~d~eaP~--~~~~lia~L~plg~--~v~~~R~~gdPlPf~~V~rV~G~dd~~~~~~~~~v~~~~~~~d~~~~~eaa~ 77 (145) .-|.--..|- +-..|+.-|.-=|- .|+..-|.|.|.-|.+++|++-.-+--...-.+.+.+|- +---.++ T Consensus 1 ~~lvP~v~P~~A~RaYLl~~L~~Rg~~L~Vga~pPeG~Pt~Yallsr~~s~r~~~l~~~LIRvRVyd------~D~~~~~ 74 (139) T protein:vir:25 1 MTLVPSVGPLVAARAYLLDELAARANPLPVGANPPEGEPSSYALLSRPGSDRDVFLGHFLIRVRVFD------SDVVRLE 74 (139) T ss_pred CcccCccchHHHHHHHHHHHHhhcCCcccccccCCCCCcceeEEEecCCCCceeehhheeEEEEeec------chhhhhc Confidence 1111111121 11222222222111 255556899999999999998776655677777777773 2234677 Q ss_pred HHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEE---EEE-eeecceEEEee Q lcl|NC_021302. 78 DYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERF---IAR-YSVHLRLVSVA 144 (145) Q Consensus 78 daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry---~aR-Y~v~Lr~va~~ 144 (145) .-|++.|+ ++|. ..+.+|.+|+|.+.-- .-.-+-.|. ++||+.+-=| .+= +.++|+=.--. T Consensus 75 r~A~LLHa--~Llg-A~h~kvv~PeG~vWiT-Ga~H~~GPa--~~DD~~v~LfG~q~aVFWTi~LkP~r~~ 139 (139) T protein:vir:25 75 RNADLLHA--LLCG-ANHRKVHTPEGDVWIT-GAAHHYGPA--DLDDPDVPLFGMQAAVFWTIGLKPARRS 139 (139) T ss_pred cchhHHHH--HHhh-hhcceeeccCCceEee-ccccccCCc--ccCCCccccccchhheeeeecccccccC Confidence 78899999 5554 5588899999877332 223344554 7777753221 111 22333211111 No 46 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=48.31 E-value=0.67 Score=21.52 Aligned_cols=127 Identities=11% Similarity=0.101 Sum_probs=61.8 Q ss_pred CcccccCCCCCH--Hhhhhhhhhh-------hccccccCCCCcCCCceEEEEeecCCCCc----ceeccEEEEEEEeecc Q lcl|NC_021302. 1 MIELLDREAPPD--IRFLRAWLLP-------IGGGVGAKRETGDPFPFTLIQKFDGWENS----HTQYGFYQFDHLAVAA 67 (145) Q Consensus 1 ~~~l~d~eaP~~--~~~lia~L~p-------lg~~v~~~R~~gdPlPf~~V~rV~G~dd~----~~~~~~~~v~~~~~~~ 67 (145) |+ +.|+. -+.+.+.|.- +++.+--+-|.+.+.||..+-...-.++. .-..-..+||++.. T Consensus 1 Ms-----ms~~~aLq~Ai~a~L~ada~l~alvg~~VyD~~P~~~~~Pyv~lG~~~~~~~~~~~~~g~~~~~~i~Vws~-- 73 (140) T protein:vir:96 1 MW-----VSVEPELTVQIYKRLKASPIINKFVGDRVFDVVQEDAVYPYIVVGESNVTNNESSTMMRETVGIVIHVYSQ-- 73 (140) T ss_pred CC-----ccHHHHHHHHHHHHhhcChhHHHhcCCccccCCccCCCCCEEEecCceeeecCCCcccceEEEEEEEEEEc-- Confidence 32 22322 4556666542 23334445667789999988544443322 23344566666642 Q ss_pred CChhhHHHHHHHHHHHHHHhhhhcCCCceEEEcCCCeEEeeeeEeeccCccccccCCCceEEEEEEeeecceEEEe Q lcl|NC_021302. 68 DGKSAYTACEDYARTIKRRMLYLRDRPWTEVTVPGWGVATADVVRCTASPRHDPYNNTDVERFIARYSVHLRLVSV 143 (145) Q Consensus 68 d~~~~~eaa~daA~~~hrRMl~L~~~~~~~v~~~dg~~a~~d~~~~~~~P~~~~Y~dd~v~Ry~aRY~v~Lr~va~ 143 (145) +.++.+|+++|..+-.- | .. .+.+++++.+++.+....-..-.+....=-+.+|-+|+.=.--=..| T Consensus 74 --~~g~~ea~~ia~av~~A---L-~~---~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~r~~v~~~~~~~~~ 140 (140) T protein:vir:96 74 --FATQYEAKQIISAIGYV---L-NR---PIDIENYEFQFSRIDSQSVFPDIDRFTKHGTIRLLFKYRHIKKGEGV 140 (140) T ss_pred --CCCHHHHHHHHHHHHHH---h-CC---CccCCCCeEEEEEEeeeEEEecCCCceEEEEEEEEEEEEeeccccCC Confidence 34578888888876653 2 12 37888999988755544333221111000122222221100000111 No 47 >protein:vir:99925 Length: 147 # NCBI annotation: gp12 # Family: family:all:11707 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655529;genbank:gi:109392299;genbank:GeneID:4157094 Probab=42.88 E-value=0.79 Score=21.15 Aligned_cols=131 Identities=15% Similarity=0.207 Sum_probs=72.1 Q ss_pred CcccccCCCCCH------Hhhhhhhhhhhcc--ccccCCCCcCCCceEEEEeecCCCCcceeccEEEEEEEeeccCChhh Q lcl|NC_021302. 1 MIELLDREAPPD------IRFLRAWLLPIGG--GVGAKRETGDPFPFTLIQKFDGWENSHTQYGFYQFDHLAVAADGKSA 72 (145) Q Consensus 1 ~~~l~d~eaP~~------~~~lia~L~plg~--~v~~~R~~gdPlPf~~V~rV~G~dd~~~~~~~~~v~~~~~~~d~~~~ 72 (145) |.. -|--.|+- -..|+.-|.-=|- .|+.-=|.|.|.-|.+++|++-.-+--...-.+.+.+|- +- T Consensus 1 ~~~-~~~~~P~v~P~~A~RaYLl~~L~~Rg~~L~VgatpPeG~Pt~Yallsr~~s~r~~~l~~~LIRvRVyd------~D 73 (147) T protein:vir:99 1 MTA-PEMVGPTMEPAIACRAYLMRRLDDRGIDLSVGATPPDGKPTRYVLVNQVDSRRRGPVADYLIRTRVYN------AD 73 (147) T ss_pred CCC-ccccCCcchhHHHHHHHHHHHHhhcCCcccccccCCCCCCcceEEEecCCCCceeehhheeEEEEeec------ch Confidence 110 01112221 1222222222221 367777889999999999998776655677777777773 22 Q ss_pred HHHHHHHHHHHHHHhhhhcCCCceEEEcCC-CeEEeeeeEeeccCcccccc-CCCceEEEEEE-----eeecceEEEeeC Q lcl|NC_021302. 73 YTACEDYARTIKRRMLYLRDRPWTEVTVPG-WGVATADVVRCTASPRHDPY-NNTDVERFIAR-----YSVHLRLVSVAS 145 (145) Q Consensus 73 ~eaa~daA~~~hrRMl~L~~~~~~~v~~~d-g~~a~~d~~~~~~~P~~~~Y-~dd~v~Ry~aR-----Y~v~Lr~va~~~ 145 (145) --.++.-|++.|+ ++|. ..+.+|.+|| |.+. +-.-.-+-.| .++ +|+.+ -.++- +.++|+=+-=-| T Consensus 74 ~~~~~r~A~LLHa--~Llg-A~h~kvv~Pd~G~vW-iTGa~H~~GP--ad~~DD~~v-~LfG~q~aVFWTi~LkP~~~~~ 146 (147) T protein:vir:99 74 AYECGQHATLLHA--ALLG-AAQARIVFPDVGQLW-VTGTEHVSGP--SDITDDDTT-TLFGQAISVFWTVALKPIEGNS 146 (147) T ss_pred hhhhccchhHHHH--HHhh-hhcceeeecCCCceE-eecccccccc--cccCCCCCc-cccchhhheeeeeeeeeccCCC Confidence 3467778899999 5553 5578899999 5542 1122333344 466 44432 22221 445665554444 Done!