Query lcl|NC_021334.1_cdsid_YP_008059975.1 [gene=M184_gp74] [protein=hypothetical protein] [protein_id=YP_008059975.1] [location=complement(46008..46412)] Match_columns 134 No_of_seqs 14 out of 17 Neff 3.9 Searched_HMMs 1612 Date Thu Nov 7 18:25:44 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_74 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_74_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:7994 Length: 134 # 100.0 2.6E-61 1.6E-64 352.7 10.8 134 1-134 1-134 (134) 2 protein:vir:102609 Length: 134 100.0 3E-61 1.9E-64 352.4 10.9 134 1-134 1-134 (134) 3 protein:vir:105826 Length: 134 100.0 3E-61 1.9E-64 352.4 10.9 134 1-134 1-134 (134) 4 protein:vir:8107 Length: 138 # 100.0 2.1E-57 1.3E-60 331.3 10.3 132 1-134 1-135 (138) 5 protein:vir:8331 Length: 150 # 100.0 1.8E-52 1.1E-55 304.2 8.5 125 1-134 20-147 (150) 6 protein:vir:101655 Length: 134 100.0 1.5E-34 9E-38 206.1 9.7 129 1-134 1-134 (134) 7 protein:vir:7860 Length: 134 # 100.0 1.5E-34 9E-38 206.1 9.7 129 1-134 1-134 (134) 8 protein:vir:8433 Length: 140 # 99.8 5.1E-24 3.1E-27 148.3 7.3 130 1-134 1-138 (140) 9 protein:vir:1643 Length: 111 # 97.6 1.7E-06 1.1E-09 52.2 10.2 107 8-134 1-109 (111) 10 protein:vir:94768 Length: 111 97.5 6.2E-07 3.8E-10 54.6 6.8 109 8-132 1-111 (111) 11 protein:vir:9579 Length: 111 # 97.5 6.2E-07 3.9E-10 54.6 6.5 109 8-134 1-111 (111) 12 protein:vir:9764 Length: 111 # 97.1 2.6E-06 1.6E-09 51.2 6.3 107 8-132 1-111 (111) 13 protein:vir:100242 Length: 114 97.0 1E-05 6.2E-09 48.0 8.3 107 8-133 1-114 (114) 14 protein:vir:1438 Length: 115 # 96.8 2.3E-05 1.4E-08 46.0 9.0 108 1-133 1-115 (115) 15 protein:vir:80371 Length: 115 96.8 2.2E-05 1.3E-08 46.2 8.4 107 8-133 1-115 (115) 16 protein:vir:100116 Length: 115 96.6 3.6E-05 2.2E-08 45.0 8.6 108 1-133 1-115 (115) 17 protein:vir:4348 Length: 121 # 95.9 0.00032 2E-07 39.7 10.4 105 5-134 1-118 (121) 18 protein:vir:81066 Length: 118 95.7 0.00069 4.3E-07 37.9 11.4 107 7-134 1-116 (118) 19 protein:vir:98426 Length: 131 95.6 0.00099 6.1E-07 37.1 11.8 117 1-134 1-125 (131) 20 protein:vir:10368 Length: 118 95.1 0.0016 1E-06 35.8 11.4 107 1-134 1-116 (118) 21 protein:vir:1892 Length: 121 # 95.0 0.00076 4.7E-07 37.7 9.3 105 5-134 1-118 (121) 22 protein:vir:97070 Length: 118 94.8 0.0022 1.4E-06 35.2 11.4 107 7-134 1-116 (118) 23 protein:vir:93602 Length: 114 94.6 0.0028 1.7E-06 34.6 11.3 101 1-131 1-114 (114) 24 protein:vir:78124 Length: 139 94.4 0.0027 1.6E-06 34.7 10.8 120 2-134 1-134 (139) 25 protein:vir:195 Length: 115 # 93.2 0.0078 4.8E-06 32.2 11.2 101 1-133 1-115 (115) 26 protein:vir:1387 Length: 116 # 90.4 0.0045 2.8E-06 33.5 6.7 106 4-132 1-116 (116) 27 protein:vir:99005 Length: 170 88.8 0.0084 5.2E-06 32.0 6.9 129 1-134 1-155 (170) 28 protein:vir:5979 Length: 134 # 81.4 0.086 5.3E-05 26.4 11.8 123 1-130 1-134 (134) 29 protein:vir:96894 Length: 140 74.9 0.15 9.5E-05 25.0 12.8 122 1-134 1-134 (140) 30 protein:vir:1244 Length: 145 # 64.7 0.3 0.00018 23.5 13.0 118 1-134 1-130 (145) 31 protein:vir:96260 Length: 141 63.1 0.32 0.0002 23.3 12.4 122 1-134 1-134 (141) 32 protein:vir:94096 Length: 141 63.1 0.32 0.0002 23.3 12.4 122 1-134 1-134 (141) 33 protein:vir:105892 Length: 141 63.1 0.32 0.0002 23.3 12.4 122 1-134 1-134 (141) 34 protein:vir:95111 Length: 145 57.5 0.43 0.00027 22.6 12.9 122 1-134 1-140 (145) 35 protein:vir:96125 Length: 140 55.9 0.47 0.00029 22.4 12.1 126 1-134 1-140 (140) 36 protein:vir:9415 Length: 126 # 54.3 0.51 0.00031 22.2 8.3 109 1-134 1-121 (126) 37 protein:vir:98343 Length: 126 54.3 0.51 0.00031 22.2 8.3 109 1-134 1-121 (126) 38 protein:vir:95961 Length: 145 53.6 0.52 0.00032 22.1 12.9 122 1-134 1-140 (145) 39 protein:vir:94794 Length: 145 53.6 0.52 0.00033 22.1 12.9 122 1-134 1-140 (145) 40 protein:vir:93736 Length: 145 47.8 0.69 0.00043 21.5 12.8 122 1-134 1-140 (145) 41 protein:vir:94488 Length: 145 47.8 0.69 0.00043 21.5 12.8 122 1-134 1-140 (145) 42 protein:vir:97421 Length: 145 47.8 0.69 0.00043 21.5 12.8 122 1-134 1-140 (145) 43 protein:vir:105337 Length: 145 46.6 0.73 0.00045 21.3 12.6 118 1-134 1-130 (145) 44 protein:vir:107096 Length: 145 46.5 0.73 0.00045 21.3 12.6 118 1-134 1-130 (145) 45 protein:vir:1274 Length: 162 # 45.5 0.77 0.00048 21.2 6.4 109 1-132 33-162 (162) 46 protein:vir:97325 Length: 145 41.8 0.91 0.00057 20.8 12.8 122 1-134 1-140 (145) 47 protein:vir:81093 Length: 126 25.8 2 0.0013 18.9 7.4 109 1-134 1-121 (126) 48 protein:vir:80001 Length: 126 25.8 2 0.0013 18.9 7.4 109 1-134 1-121 (126) 49 protein:vir:105008 Length: 119 23.7 2.3 0.0014 18.6 10.4 103 6-133 1-119 (119) 50 protein:vir:102888 Length: 119 23.7 2.3 0.0014 18.6 10.4 103 6-133 1-119 (119) 51 protein:vir:107581 Length: 119 23.7 2.3 0.0014 18.6 10.4 103 6-133 1-119 (119) 52 protein:vir:102086 Length: 119 23.7 2.3 0.0014 18.6 10.4 103 6-133 1-119 (119) 53 protein:vir:2508 Length: 139 # 22.6 2.4 0.0015 18.5 8.0 125 1-134 1-136 (139) 54 protein:vir:2689 Length: 131 # 20.9 2.3 0.0014 18.6 4.5 109 3-134 1-122 (131) 55 protein:vir:9364 Length: 131 # 20.9 2.3 0.0014 18.6 4.5 109 3-134 1-122 (131) 56 protein:vir:96972 Length: 131 20.9 2.3 0.0014 18.6 4.5 109 3-134 1-122 (131) 57 protein:vir:78648 Length: 131 20.9 2.3 0.0014 18.6 4.5 109 3-134 1-122 (131) 58 protein:vir:99925 Length: 147 20.7 2.7 0.0017 18.2 7.9 125 1-134 2-142 (147) No 1 >protein:vir:7994 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817348;genbank:gi:29565776;genbank:GeneID:1259015 Probab=100.00 E-value=2.6e-61 Score=352.73 Aligned_cols=134 Identities=97% Similarity=1.428 Sum_probs=133.1 Q ss_pred CCCCCCCcHHHhhhhhhhhhcccCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHHHHHHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSPLGKVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAELTHQRML 80 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~plg~v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~~~hrRMl 80 (134) |++.++||+|+||+||||||++|+++|++||||||++|+||+|.|++++++|+|+|||||||+|+|||+++|+++||||+ T Consensus 1 m~~~saP~~e~~vv~WLsp~~~va~~R~~~~PLPf~~V~Rv~G~d~~e~~tD~avvsv~~fg~~~eaA~d~ad~vHrRM~ 80 (134) T protein:vir:79 1 MATDSAPSIHRVLVAWLSPLGKVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAELTHQRML 80 (134) T ss_pred CCcccCCChheeeeeecccchhceeccCCCCCCCeEEEEEeCCCCCccccccCceeEEEEeeCCHHHhhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 81 ELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 81 ~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) ||..++..+++++||+++++||++++++|+|++|+||+++|||+|||++|++|| T Consensus 81 kL~~~~~~~~~~~gG~~~~id~~~vl~~P~~~eY~dD~~~vrytgRY~~g~~y~ 134 (134) T protein:vir:79 81 ELVVNPLTEIPVGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) T ss_pred HHhcccccceecCCceEEEeehhhhhccceeeeeCCCceEEEEeeeeeeccccC Confidence 999898899999999999999999999999999999999999999999999999 No 2 >protein:vir:102609 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655006;genbank:gi:109392196;genbank:GeneID:4157231 Probab=100.00 E-value=3e-61 Score=352.36 Aligned_cols=134 Identities=96% Similarity=1.420 Sum_probs=133.1 Q ss_pred CCCCCCCcHHHhhhhhhhhhcccCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHHHHHHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSPLGKVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAELTHQRML 80 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~plg~v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~~~hrRMl 80 (134) |++.++||+|+||+||||||++|+++|++||||||++||||+|.|++++++|+|+|||||||+|+|||+++|+++||||+ T Consensus 1 m~~~saP~~e~~vv~WLsp~~~va~~R~~~~PLPf~~V~Rv~G~d~~e~~tD~avvsv~~fg~~~eaA~d~ad~vHrRM~ 80 (134) T protein:vir:10 1 MATDSAPSIHRVLVAWLSPLGKVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDVAVVSVHTFAASDEAAENEAELTHQRML 80 (134) T ss_pred CCcccCCChheeeeeecccchhceeccCCCCCCCeEEEEEeCCCCCcccccccceEEEEEeeCCHHHhhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 81 ELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 81 ~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) ||..++..+++++||+++++||++++++|+|++|+||+++|||+|||++|++|| T Consensus 81 kL~~~~~~~~~~~gG~~~~id~~~v~~~P~~~eY~dD~~~vrytgRY~~g~~y~ 134 (134) T protein:vir:10 81 ELVVNPLTEIPVGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) T ss_pred HHhcccccceecCCceEEEeehhhhhccceeeeeCCCceEEEEeeeeeeccccC Confidence 999898899999999999999999999999999999999999999999999999 No 3 >protein:vir:105826 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655771;genbank:gi:109522094;genbank:GeneID:4157634 Probab=100.00 E-value=3e-61 Score=352.36 Aligned_cols=134 Identities=96% Similarity=1.420 Sum_probs=133.1 Q ss_pred CCCCCCCcHHHhhhhhhhhhcccCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHHHHHHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSPLGKVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAELTHQRML 80 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~plg~v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~~~hrRMl 80 (134) |++.++||+|+||+||||||++|+++|++||||||++||||+|.|++++++|+|+|||||||+|+|||+++|+++||||+ T Consensus 1 m~~~saP~~e~~vv~WLsp~~~va~~R~~~~PLPf~~V~Rv~G~d~~e~~tD~avvsv~~fg~~~eaA~d~ad~vHrRM~ 80 (134) T protein:vir:10 1 MATDSAPSIHRVLVAWLSPLGKVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDVAVVSVHTFAASDEAAENEAELTHQRML 80 (134) T ss_pred CCcccCCChheeeeeecccchhceeccCCCCCCCeEEEEEeCCCCCcccccccceEEEEEeeCCHHHhhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 81 ELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 81 ~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) ||..++..+++++||+++++||++++++|+|++|+||+++|||+|||++|++|| T Consensus 81 kL~~~~~~~~~~~gG~~~~id~~~v~~~P~~~eY~dD~~~vrytgRY~~g~~y~ 134 (134) T protein:vir:10 81 ELVVNPLTEIPVGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) T ss_pred HHhcccccceecCCceEEEeehhhhhccceeeeeCCCceEEEEeeeeeeccccC Confidence 999898899999999999999999999999999999999999999999999999 No 4 >protein:vir:8107 Length: 138 # NCBI annotation: gp11 # Family: family:all:2795 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817688;genbank:gi:29566119;genbank:GeneID:1259313 Probab=100.00 E-value=2.1e-57 Score=331.34 Aligned_cols=132 Identities=33% Similarity=0.647 Sum_probs=129.4 Q ss_pred CC---CCCCCcHHHhhhhhhhhhcccCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHHHHHH Q lcl|NC_021334. 1 MA---TDSAPSIHRVLVAWLSPLGKVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAELTHQ 77 (134) Q Consensus 1 ~~---~~~~P~~~~~lia~L~plg~v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~~~hr 77 (134) || -|+|||+|+||||||+|++++++||++|||||||+|+||+|+|+++++||+|+||||+||+|.+||+++|+++|| T Consensus 1 ~~~~~~~~aP~~e~~vv~WLspv~~va~~R~~d~pLPF~~V~Rv~G~d~~e~~tD~avv~~~~fg~g~eaA~d~a~~vHr 80 (138) T protein:vir:81 1 MADLHDQDAPDEEDFVVCWMQPVMRTAVERDIDAELPFCEVTRIDGADDPEAGTDNPVIQLDFYALGAEAAKAAAKQGHR 80 (138) T ss_pred CcccccCCCCchheeeeeeccchhccccccCCCCCCCeEEEEEeCCCCCccccccCceEEEEEeecCHHHHHHHHHhHHH Confidence 55 699999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 78 RMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 78 RMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) ||++|.++ +++||+|||+.+++||++++++|+|++|.|| +|.||+|||++|++|+ T Consensus 81 RM~kL~~~-~~~vTl~dGt~~~ld~~~~~~~P~~~~y~dD-~ivRYtaRY~~g~~y~ 135 (138) T protein:vir:81 81 RMLFLFRN-FPTVTLSDGTLADLDFGETLIKPFRMAFEHD-QIVRYTARYQLGTSYV 135 (138) T ss_pred HHHHHhhc-ccceecCCCceEecchhhhhccccccccCCC-eeeEeeeeeeccceee Confidence 99999988 8999999999999999999999999999999 8999999999999999 No 5 >protein:vir:8331 Length: 150 # NCBI annotation: gp48 # Family: family:all:2795 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817899;genbank:gi:29566332;genbank:GeneID:1259527 Probab=100.00 E-value=1.8e-52 Score=304.21 Aligned_cols=125 Identities=27% Similarity=0.538 Sum_probs=121.1 Q ss_pred CCCCCCCcHHHhhhhhhhhhcccCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecC---CHHHHHHHHHHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSPLGKVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAA---SDEAAENEAELTHQ 77 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~plg~v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~---g~~aa~d~a~~~hr 77 (134) .--|||||+|+|++|||+|+++++++|++||||||++|+||+|.+++++++|+|+||||+||+ |.+||+++|+++|| T Consensus 20 ~~~~sapdae~~vv~wLsp~~rvA~~R~~~dplPf~lv~rv~G~d~pde~td~avvsv~~fg~~v~G~daA~~~ad~vH~ 99 (150) T protein:vir:83 20 ILNEGPADAETFVVKWLGEVYRAANTRRPGDPLPFLLIQQVAGKENLDESTADPVVQVDILCDKVDGEDAARDIKDRVHR 99 (150) T ss_pred cccCCCccHHHHHHHHhhHHhhhhhcccCCCCCCeEEEEecCCCCCcccccccceeeeeeccccccchhhhhhhhhhHHH Confidence 778999999999999999999999999999999999999999999999999999999999985 88999999999999 Q ss_pred HHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 78 RMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 78 RMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) ||++|++ +++||| ++||++++++|+|++|+|| +++||+|||++|++|+ T Consensus 100 RM~~l~r-----~tl~~G---tld~~~v~~aP~~leY~dD-~vvrYt~RY~~G~~Y~ 147 (150) T protein:vir:83 100 RMLLLGR-----YLEMDG---TLDWMKVFESPRRLEYTND-KVIRYTARYQFGQTYE 147 (150) T ss_pred HHHHHhh-----hhccCC---cchhhhhhccccccccCCC-eEEEeeeeeeccCchh Confidence 9999993 899999 6999999999999999999 8889999999999999 No 6 >protein:vir:101655 Length: 134 # NCBI annotation: gp18 # Family: family:all:2795 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654773;genbank:gi:109302771;genbank:GeneID:4156089 Probab=100.00 E-value=1.5e-34 Score=206.09 Aligned_cols=129 Identities=28% Similarity=0.449 Sum_probs=121.8 Q ss_pred CCC--CCCCcHHHhhhhhhhhhcc-cCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHHHHHH Q lcl|NC_021334. 1 MAT--DSAPSIHRVLVAWLSPLGK-VSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAELTHQ 77 (134) Q Consensus 1 ~~~--~~~P~~~~~lia~L~plg~-v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~~~hr 77 (134) |.. ..-||+|+.++++|+|+.+ |+.+|+.+.|.||++|+|++|+ .+.+.+|++++||++||++.++|.|.|+++|. T Consensus 1 mlplsrpnpnaeklvcaylspffenvashrwvdaptpfilvkrlpgg-gqgevsdcalmsikvfgkdvdeagdladevhe 79 (134) T protein:vir:10 1 MLPLSRPNPNAEKLVCAYLSPFFENVASHRWVDAPTPFILVKRLPGG-GQGEVSDCALMSIKVFGKDVDEAGDLADEVHE 79 (134) T ss_pred CCCCCCCCCchhhhhhhhhhhHHhhhhccccccCCCceEEEeeCCCC-CCccccceeeeeeeeeccccccccchHHHHHH Confidence 544 4457899999999999985 9999999999999999999999 79999999999999999999999999999999 Q ss_pred HHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeee--eC Q lcl|NC_021334. 78 RMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQ--YI 134 (134) Q Consensus 78 RMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~--y~ 134 (134) ||.+|.+++.+++ ||+.++++.+++...|+|+.|+||++.| |+||||++|| || T Consensus 80 rmrkwkpkdtvsy---gghsfginllevedapfwldygddteec-ytarywvhlrvdyv 134 (134) T protein:vir:10 80 RMRKWKPKDTVSY---GGHSFGINLLEVEDAPFWLDYGDDTEEC-YTARYWVHLRVDYV 134 (134) T ss_pred HHhccCccccccc---CchhhcceeEeecCCceeeecCCCccce-eeeeEEEEEEEecC Confidence 9999999998888 9999999999999999999999999998 9999999998 55 No 7 >protein:vir:7860 Length: 134 # NCBI annotation: gp17 # Family: family:all:2795 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817467;genbank:gi:29565896;genbank:GeneID:1259089 Probab=100.00 E-value=1.5e-34 Score=206.09 Aligned_cols=129 Identities=28% Similarity=0.449 Sum_probs=121.8 Q ss_pred CCC--CCCCcHHHhhhhhhhhhcc-cCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHHHHHH Q lcl|NC_021334. 1 MAT--DSAPSIHRVLVAWLSPLGK-VSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAELTHQ 77 (134) Q Consensus 1 ~~~--~~~P~~~~~lia~L~plg~-v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~~~hr 77 (134) |.. ..-||+|+.++++|+|+.+ |+.+|+.+.|.||++|+|++|+ .+.+.+|++++||++||++.++|.|.|+++|. T Consensus 1 mlplsrpnpnaeklvcaylspffenvashrwvdaptpfilvkrlpgg-gqgevsdcalmsikvfgkdvdeagdladevhe 79 (134) T protein:vir:78 1 MLPLSRPNPNAEKLVCAYLSPFFENVASHRWVDAPTPFILVKRLPGG-GQGEVSDCALMSIKVFGKDVDEAGDLADEVHE 79 (134) T ss_pred CCCCCCCCCchhhhhhhhhhhHHhhhhccccccCCCceEEEeeCCCC-CCccccceeeeeeeeeccccccccchHHHHHH Confidence 544 4457899999999999985 9999999999999999999999 79999999999999999999999999999999 Q ss_pred HHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeee--eC Q lcl|NC_021334. 78 RMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQ--YI 134 (134) Q Consensus 78 RMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~--y~ 134 (134) ||.+|.+++.+++ ||+.++++.+++...|+|+.|+||++.| |+||||++|| || T Consensus 80 rmrkwkpkdtvsy---gghsfginllevedapfwldygddteec-ytarywvhlrvdyv 134 (134) T protein:vir:78 80 RMRKWKPKDTVSY---GGHSFGINLLEVEDAPFWLDYGDDTEEC-YTARYWVHLRVDYV 134 (134) T ss_pred HHhccCccccccc---CchhhcceeEeecCCceeeecCCCccce-eeeeEEEEEEEecC Confidence 9999999998888 9999999999999999999999999998 9999999998 55 No 8 >protein:vir:8433 Length: 140 # NCBI annotation: gp28 # Family: family:all:30886 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818329;genbank:gi:29566765;genbank:GeneID:1260029 Probab=99.82 E-value=5.1e-24 Score=148.30 Aligned_cols=130 Identities=24% Similarity=0.366 Sum_probs=100.9 Q ss_pred CCCCCCCcHHHhhhhhhhhhcc--cCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHHHHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSPLGK--VSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAELTHQR 78 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~plg~--v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~~~hrR 78 (134) |..-.-|++.++||+||+.+.+ |.-||+|++||||+.|+||+|+|++++++|++.+.+++||+++|+.+|..+.+.|| T Consensus 1 mteyeyppgvkvlikwlsgiegvdvrherppnsplpfisvhrigggedencitdqgryafmvfgssqemvddtvrlvtrr 80 (140) T protein:vir:84 1 MTEYEYPPGVKVLIKWLSGIEGVDVRHERPPNSPLPFISVHRIGGGEDENCITDQGRYAFMVFGSSQEMVDDTVRLVTRR 80 (140) T ss_pred CCcccCCccHHHHHHHhcccccccccccCCCCCCCceeeeeeccCCCcccccccCCcEEEEEecCchhhhHHHHHHHHHH Confidence 7777789999999999999975 66699999999999999999999999999999999999999999999999999999 Q ss_pred HHhc-cCCCccEEEcCCCeEEEeeEE-eeccCccccccCCCCeEEE----EEEEEEeeeeeC Q lcl|NC_021334. 79 MLEL-VSDPLVEIPLGGGVVARIDYA-RVLMKPVLVEYDDDGHLVR----HVGRYEIGVQYI 134 (134) Q Consensus 79 Ml~L-~~~~~~~v~~~gG~va~~D~~-~~~~~P~~~~Y~~D~~i~R----y~ARY~~gL~y~ 134 (134) |++| +.++|++|++.|-.- -+|-. +-.+.|. .--|| .|-| ..--|.++.|.| T Consensus 81 mkklvgygsqekvtvgdksy-yadeahkreerpi--dnldd-aiprkffgtslmydvhmriv 138 (140) T protein:vir:84 81 MKKLVGYGSQEKVTVGDKSY-YADEAHKREERPI--DNLDD-AIPRKFFGTSLMYDVHMRIV 138 (140) T ss_pred HHHHhcCCCcceeeecccch-hcchhhhhhcccc--chhhh-hhhhhhhhhhhhheeeeeee Confidence 9999 889999999998332 12111 2222222 11122 2211 122366666666 No 9 >protein:vir:1643 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695064;genbank:gi:23455755;genbank:GeneID:955492 Probab=97.61 E-value=1.7e-06 Score=52.17 Aligned_cols=107 Identities=16% Similarity=0.146 Sum_probs=77.2 Q ss_pred cHHHhhhhhhhhh-c-ccCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHHHHHHHHHhccCC Q lcl|NC_021334. 8 SIHRVLVAWLSPL-G-KVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAELTHQRMLELVSD 85 (134) Q Consensus 8 ~~~~~lia~L~pl-g-~v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~~~hrRMl~L~~~ 85 (134) =.|..++.||..- | +|..+.+.+-|-+|..|.|++|.. +...+.|.+.|+.+|.|..+|...|.++...|..|.-. T Consensus 1 miE~~i~~~L~~~l~Vpv~~e~p~~~P~~FV~vErtGG~~--~~~~~~~~lAVq~w~~S~~eAa~La~~v~~~l~~l~~~ 78 (111) T protein:vir:16 1 MIEIIIKNFLDTHLSVSSFLEKKGEMPLSYILFEKTGSSK--SNHLLSSTFAFQSYAPSMYEAAKLNEQLKEVVERLIEL 78 (111) T ss_pred ChHHhHHHHHhhcCCceeEeecCCCCCCceEEEEecCCcc--ccccccceEEEEecchhHHHHHHHHHHHHHHHhhcccc Confidence 6899999999887 6 688899999999999999999987 55779999999999999999999999999999888432 Q ss_pred CccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 86 PLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 86 ~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) ..| .+.-.|-| -.|. |+.=++ .||++-.... T Consensus 79 --~~I---~av~~~s~----------ynf~-d~~tk~--~RYQav~~i~ 109 (111) T protein:vir:16 79 --NEI---SNVSLNSD----------YNFT-DTETKE--YRYQAVFDIN 109 (111) T ss_pred --ccc---eeeecCCC----------CcCC-CCCCCC--ceEEEEEEEe Confidence 233 11111111 1222 122222 3444444433 No 10 >protein:vir:94768 Length: 111 # NCBI annotation: unknown # Family: family:all:1269 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996711;genbank:gi:45597426;genbank:GeneID:2769040 Probab=97.54 E-value=6.2e-07 Score=54.62 Aligned_cols=109 Identities=15% Similarity=0.137 Sum_probs=76.2 Q ss_pred cHHHhhhhhhhhh-c-ccCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHHHHHHHHHhccCC Q lcl|NC_021334. 8 SIHRVLVAWLSPL-G-KVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAELTHQRMLELVSD 85 (134) Q Consensus 8 ~~~~~lia~L~pl-g-~v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~~~hrRMl~L~~~ 85 (134) =.|..++.||..- | +|..+.+.+-|-+|..|.|++|.. +...+.|.+.|+.||.+..+|...|.++...|..|.- T Consensus 1 miE~~v~~~L~~~l~vpv~~e~p~~~p~~FV~vErtGG~~--~~~~~~~~lAVQ~~~~S~~eAa~La~~v~~~~~~l~~- 77 (111) T protein:vir:94 1 MIEIIIKNFLDTHLSVSSFLEKKGEMPLSYVLFEKTGSSK--SNHLLSSTFAFQSYAPSMYEAAKLNEQLKEVVERLIE- 77 (111) T ss_pred ChHHhHHHHHhhcCCcceEeecCCCCCCceEEEEecCCcc--ccccccceEEEEecchhHHHHHHHHHHHHHHHhhccc- Confidence 6899999999876 6 688899999999999999999987 6677999999999999999999999999999988842 Q ss_pred CccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeee Q lcl|NC_021334. 86 PLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQ 132 (134) Q Consensus 86 ~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~ 132 (134) +..| .+ + -+.+.---++.+. .=-||-+=|++--= T Consensus 78 -~~~i---~~----v----~~~s~Ynf~d~~t-k~~RYQav~~i~~~ 111 (111) T protein:vir:94 78 -LNEI---SN----V----SLNSDYNFTDTET-KEYRYQAVFDINHY 111 (111) T ss_pred -cccc---ce----e----ecCCCcccCCCcC-CCceEEEEEEEeeC Confidence 2233 11 1 1111111122211 11134443333211 No 11 >protein:vir:9579 Length: 111 # NCBI annotation: gp45 # Family: family:all:1269 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862884;genbank:gi:32469476;genbank:GeneID:1461321 Probab=97.51 E-value=6.2e-07 Score=54.60 Aligned_cols=109 Identities=14% Similarity=0.165 Sum_probs=74.3 Q ss_pred cHHHhhhhhhhhh-c-ccCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHHHHHHHHHhccCC Q lcl|NC_021334. 8 SIHRVLVAWLSPL-G-KVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAELTHQRMLELVSD 85 (134) Q Consensus 8 ~~~~~lia~L~pl-g-~v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~~~hrRMl~L~~~ 85 (134) =.|..++.||..- + +|+.+=+.+-|-+|..|.|++|.. +...+.|.+.|+.+|.+..+|.+++.++...|..|.-- T Consensus 1 miE~~v~~~L~~~l~vpv~~~vp~~~P~~FV~vErtGG~~--~~~~~~p~laVq~wg~S~~~Aa~La~~v~~a~~~l~~~ 78 (111) T protein:vir:95 1 MIEIIINKYLDGHLDVPSFFEHEAEAPDSFVIIQKTGGKE--RNHSGSATFAFQSYAPTMQKAAELNVKVKSAVKGLIEL 78 (111) T ss_pred ChHHhHHHHhhhhcCeeEEeecCCCCCCceEEEEeeCCcc--ccccccceEEEEeccccHHHHHHHHHHHHHHHhhhhcc Confidence 6899999999654 5 577788888899999999999987 66669999999999999999999999999999888322 Q ss_pred CccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 86 PLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 86 ~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) ..| + -++ +.++-..+..+. .--||-+=|++- |. T Consensus 79 --~~i---~-------~v~-~~s~ynf~d~~t-k~~RYQ~~~~i~--~~ 111 (111) T protein:vir:95 79 --DSI---C-------GVH-LNSDYNFTDTET-KQYRYQAVFDIN--YF 111 (111) T ss_pred --ccc---c-------ccc-cCCccccCCCCC-CCceEEEEEEEE--eC Confidence 222 1 111 112222233221 122333333332 11 No 12 >protein:vir:9764 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795526;genbank:gi:28876278;genbank:GeneID:1257819 Probab=97.15 E-value=2.6e-06 Score=51.20 Aligned_cols=107 Identities=17% Similarity=0.219 Sum_probs=78.2 Q ss_pred cHHHhhhhhhhhh-c-ccCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHHHHHHHHHhccCC Q lcl|NC_021334. 8 SIHRVLVAWLSPL-G-KVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAELTHQRMLELVSD 85 (134) Q Consensus 8 ~~~~~lia~L~pl-g-~v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~~~hrRMl~L~~~ 85 (134) =.|..++.+|..- | +|..|.+.+.|-+|..|.|.+|.. +...+.|.+.|+.+|.+..+|...+.++...|..|.- T Consensus 1 mIE~~i~~yL~~~l~vpv~~e~p~~~P~~FV~vEkTGG~~--~~~~~~a~lAvQsyg~S~~~AA~La~~V~~a~~~l~~- 77 (111) T protein:vir:97 1 MIEVIIKKYLDEHLDVPSFFEHQKDEPARFIILEKTSGAK--QNHLLSSTFAFQSYAESLYEAALLNDKVKQVIEQLDV- 77 (111) T ss_pred ChhhhhhHHHhhhcCceEEEeecCCCCCceEEEEeeCCcc--ccccccceEEEEecchhHHHHHHHHHHHHHHhhhhcc- Confidence 5788999999886 5 588899999999999999999965 8888999999999999999999999999999988853 Q ss_pred CccEEEcCCCeEEEeeEEeeccCccccccCCCCe--EEEEEEEEEeeee Q lcl|NC_021334. 86 PLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGH--LVRHVGRYEIGVQ 132 (134) Q Consensus 86 ~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~--i~Ry~ARY~~gL~ 132 (134) +++| .+.-.|-|| .|.| +. =-||-|=|++--= T Consensus 78 -l~~i---~~v~lns~Y----------nf~d-~~tk~yRYQa~~di~~~ 111 (111) T protein:vir:97 78 -LPQV---SGVHLNADY----------NFTD-TATKRYRYQAVFDINHY 111 (111) T ss_pred -Cccc---eeeeecccc----------cCCC-CCCCCccEEEEEEEeeC Confidence 2344 333333333 2322 22 2233333333211 No 13 >protein:vir:100242 Length: 114 # NCBI annotation: gp71 # Family: family:all:2712 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355407;genbank:gi:77864697;genbank:GeneID:3725964 Probab=97.01 E-value=1e-05 Score=47.99 Aligned_cols=107 Identities=20% Similarity=0.223 Sum_probs=70.7 Q ss_pred cHHHhhhhhhhhhcc---cCCcCCCCCCCceEEEEeecCCCC----CccEEeeeEEEEEeecCCHHHHHHHHHHHHHHHH Q lcl|NC_021334. 8 SIHRVLVAWLSPLGK---VSTRRLSGDPLPHRVVRRVDGRDV----PEEGSDSAVVSVHTFAASDEAAENEAELTHQRML 80 (134) Q Consensus 8 ~~~~~lia~L~plg~---v~~eR~~~dPlPf~~V~rV~G~d~----~d~~~d~a~vsvhtf~~g~~aa~d~a~~~hrRMl 80 (134) =.+-++.+-|.-|++ -=.-+++++|+||.+.+||.|... -..+-...+|||+.||++.++|+..|+++.++.+ T Consensus 1 ~~~~~i~~~l~~~~g~~~~~~~aP~~~~~Py~vy~rvsg~p~~tL~G~~g~~~~r~QiD~yA~T~~eA~~La~~~~~~l~ 80 (114) T protein:vir:10 1 MSALTIRDAIGIVGGAKGYVSVASSAAQSPYYVVSRVSGTRDMALGGATGGKSGMFQIDVYAKTYTEADSLADQIIDRVE 80 (114) T ss_pred CceeeeehhhcccccccccCCCCCCCCCCceEEEEeccCcccccccCCCCcceEEEEEEeeeCCHHHHHHHHHHHHhhcc Confidence 123445566666652 223678999999999999999862 2234467899999999999999999998877743 Q ss_pred hccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeee Q lcl|NC_021334. 81 ELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQY 133 (134) Q Consensus 81 ~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y 133 (134) .-.+ . .... -.=....|..|+.+.| ...++-.-| T Consensus 81 ~~~~-----f--------~~~~----l~~~~d~ye~dT~l~R--vsld~si~f 114 (114) T protein:vir:10 81 STGM-----F--------SVGG----VSDLPDDYSSDTGVFR--VSLEISVQF 114 (114) T ss_pred cccC-----e--------eeec----cccCCCCCCcccCceE--EEEEEEEeC Confidence 3221 0 1101 1123467888888876 344555555 No 14 >protein:vir:1438 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:2712 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536367;genbank:gi:17975172;genbank:GeneID:929144 Probab=96.83 E-value=2.3e-05 Score=45.99 Aligned_cols=108 Identities=24% Similarity=0.318 Sum_probs=71.2 Q ss_pred CCCCCCCcHHHhhhhhhhhhc--ccCC-cCCCCCCCceEEEEeecCCCCCc----cEEeeeEEEEEeecCCHHHHHHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSPLG--KVST-RRLSGDPLPHRVVRRVDGRDVPE----EGSDSAVVSVHTFAASDEAAENEAE 73 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~plg--~v~~-eR~~~dPlPf~~V~rV~G~d~~d----~~~d~a~vsvhtf~~g~~aa~d~a~ 73 (134) || -=++-+-|+|+. +|-+ .-|.+.|+||.+-++|.|..+.. -..+..+|||++||++.++|++.++ T Consensus 1 ~~-------~~~i~~aL~~l~~~RVyp~~aP~~~~~Pyiv~q~vsg~p~~~L~G~~~~~~~~vQIDvyA~t~~~A~~l~~ 73 (115) T protein:vir:14 1 MS-------VIVIRDALQGIGGAKGYLGVAPAKAPAPYFVVTRVHGALDMALAGLTGGRSGSYQIDCYAPTFTDADRLAD 73 (115) T ss_pred Ce-------eEeeehhhccccccccccccCCCCCCCCEEEEEeecCcccccccCCCCCcceEEEEEEeeCCHHHHHHHHH Confidence 32 123456788885 3433 45567799999999999975432 2346899999999999999999999 Q ss_pred HHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeee Q lcl|NC_021334. 74 LTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQY 133 (134) Q Consensus 74 ~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y 133 (134) .+..+|..+.. +..+ + .+ +.....|..|+.+.| ...++..=| T Consensus 74 ~v~~~~~~~~~--~~~~---~----~~-------~~~~d~ye~dt~lyR--~s~D~~vWf 115 (115) T protein:vir:14 74 LAVDRAMSVQD--RFSV---G----GV-------DELPDDYSEDTGLFR--ISLELSVEF 115 (115) T ss_pred HHHHHHhcCcc--ceee---e----ee-------cCCCCCCccccccee--eEEEEEEeC Confidence 99888777653 2122 1 11 112255766766643 445555555 No 15 >protein:vir:80371 Length: 115 # NCBI annotation: gp11 # Family: family:all:2712 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111090;genbank:gi:134288622;genbank:GeneID:4960618 Probab=96.77 E-value=2.2e-05 Score=46.17 Aligned_cols=107 Identities=21% Similarity=0.230 Sum_probs=72.9 Q ss_pred cHHHhhhhhhhhhcccC---CcCCCCCCCceEEEEeecCCCCC----ccEEeeeEEEEEeecCCHHHHHHHHHHHHHHHH Q lcl|NC_021334. 8 SIHRVLVAWLSPLGKVS---TRRLSGDPLPHRVVRRVDGRDVP----EEGSDSAVVSVHTFAASDEAAENEAELTHQRML 80 (134) Q Consensus 8 ~~~~~lia~L~plg~v~---~eR~~~dPlPf~~V~rV~G~d~~----d~~~d~a~vsvhtf~~g~~aa~d~a~~~hrRMl 80 (134) --.-++.+-|..|+... --=+.+.|.||.++|||.|.-+- +.+-.-..+||+.+|++..+|++.++++.-||. T Consensus 1 ~~~~vir~al~~i~~~~~~~~vAp~~~~~pyivy~rvsga~e~~L~G~ag~~~~~~QID~yA~T~~ea~~La~~v~d~~~ 80 (115) T protein:vir:80 1 MSVIVVRDALQGIGGAKGYLGVAPEKAPARYFVVTRVHGALDMALAGPTGGRSGSYQIDCYAPTFTDADRLADLAVDRAM 80 (115) T ss_pred CeeeeeechhhhccccccceeeccccCcCCeEEEeecCCCccccccCCCCCceeEEEEeeecCCHHHHHHHHHHHHHhhh Confidence 12235556676665211 12346789999999999997532 222345689999999999999999999999887 Q ss_pred hccCCCccEEE-cCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeee Q lcl|NC_021334. 81 ELVSDPLVEIP-LGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQY 133 (134) Q Consensus 81 ~L~~~~~~~v~-~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y 133 (134) .+-. +..+- +++ -+..|..|+.++| ++.++-.-| T Consensus 81 ~~~~--~~~vg~l~e---------------~pd~Ye~DT~l~R--vs~dv~i~f 115 (115) T protein:vir:80 81 SVQD--RFSVGGVDE---------------LPDDYSADTGLFR--VSLELSVEF 115 (115) T ss_pred CCcc--ccceecccC---------------CCcccccccceEE--EEEEEEEeC Confidence 6653 22331 333 2467888999977 566666666 No 16 >protein:vir:100116 Length: 115 # NCBI annotation: gp10 # Family: family:all:2712 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945040;genbank:gi:38707900;genbank:GeneID:2744163 Probab=96.63 E-value=3.6e-05 Score=44.97 Aligned_cols=108 Identities=24% Similarity=0.315 Sum_probs=70.4 Q ss_pred CCCCCCCcHHHhhhhhhhhhc--ccCC-cCCCCCCCceEEEEeecCCCCCc----cEEeeeEEEEEeecCCHHHHHHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSPLG--KVST-RRLSGDPLPHRVVRRVDGRDVPE----EGSDSAVVSVHTFAASDEAAENEAE 73 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~plg--~v~~-eR~~~dPlPf~~V~rV~G~d~~d----~~~d~a~vsvhtf~~g~~aa~d~a~ 73 (134) || -=++-+-|+++. +|=+ .-|.+.|+||.+-++|.|..+.. -..+..++||++||++.++|++.++ T Consensus 1 ~~-------~~~i~~aL~~l~~~RVyp~~aP~~~~~Pyiv~q~vsg~p~~~L~G~~~~~~~~vQIDvyA~t~~~A~~l~~ 73 (115) T protein:vir:10 1 MS-------VIVIRDALQGIGGAKGYLGVAPEKAPAPYFVVTRVHGALDMALAGLTGGRSGSYQIDCYAPTFTDADRLAD 73 (115) T ss_pred Ce-------eEEeehhhcccCCceeecccCCCCCCCCEEEEEeecCccccccCCCCCCcceEEEEEEeeCCHHHHHHHHH Confidence 32 123446778874 3422 44567799999999999975432 2346899999999999999999999 Q ss_pred HHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeee Q lcl|NC_021334. 74 LTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQY 133 (134) Q Consensus 74 ~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y 133 (134) .+..++..+.. +..+ + .+ +.....|..|+.+.| ...++..=| T Consensus 74 ~v~~~~~~~~~--~~~~---~----~~-------~~~~d~ye~dt~lyR--~s~D~~vWf 115 (115) T protein:vir:10 74 LAVDRAMSVQD--RFSV---G----GV-------DELPDDYSEDTGLFR--ISLELSVEF 115 (115) T ss_pred HHHHHHhcCcc--ceeE---e----ee-------cCCCCCCccccccee--eEEEEEEeC Confidence 99888777653 2222 1 11 112255766766643 455555555 No 17 >protein:vir:4348 Length: 121 # NCBI annotation: Orf15 # Family: family:all:896 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061511;genbank:gi:9635607;genbank:GeneID:1262874 Probab=95.93 E-value=0.00032 Score=39.74 Aligned_cols=105 Identities=16% Similarity=0.110 Sum_probs=66.5 Q ss_pred CCCcHHHhhhh------hhhhh-cccC--CcCCCCCCCceEEEEeecCCCCC----ccEEeeeEEEEEeecCCHHHHHHH Q lcl|NC_021334. 5 SAPSIHRVLVA------WLSPL-GKVS--TRRLSGDPLPHRVVRRVDGRDVP----EEGSDSAVVSVHTFAASDEAAENE 71 (134) Q Consensus 5 ~~P~~~~~lia------~L~pl-g~v~--~eR~~~dPlPf~~V~rV~G~d~~----d~~~d~a~vsvhtf~~g~~aa~d~ 71 (134) |-|+.-+.+.+ -|+.- +++= ..-+.+.|+||.+-++|+|...- ....+..+|||++|+++.++|+.. T Consensus 1 m~~~i~~~l~~d~~v~allg~~~~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~~~l~g~~~~~~~~vQIDvyA~t~~~A~~l 80 (121) T protein:vir:43 1 MYPPIFKVCSSSPAVTAILGASPLRMYQFGLAPQLVVKPYATWQTISGSPENYLWGRPDADGFTIQVDIFSATAAEARDA 80 (121) T ss_pred CChHHHHHHhhChhhhhhhcCCCceeeccCCCCCCCcCCeEEEEEecCcccceecCCCCcceeEEEEEeeeCCHHHHHHH Confidence 66666666554 22110 1121 13456789999999999986411 134567899999999999999999 Q ss_pred HHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 72 AELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 72 a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) ++.+..-|-.++. +.. .....|..|+.+.| +.+..+++ T Consensus 81 ~~av~~Al~~~~~-----~~~----------------~~~~~ye~dT~lyR----~s~Dv~w~ 118 (121) T protein:vir:43 81 AKAIRDAIELSAY-----VVR----------------WGGESVDPDTKTYR----VSFDVDWI 118 (121) T ss_pred HHHHHHHhhhcCC-----ccc----------------CCCCCCccccccee----eeeEEEEe Confidence 9988766644332 111 11256877877754 33445555 No 18 >protein:vir:81066 Length: 118 # NCBI annotation: p13 # Family: family:all:3244 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285683;genbank:gi:148727191;genbank:GeneID:5247111 Probab=95.72 E-value=0.00069 Score=37.90 Aligned_cols=107 Identities=23% Similarity=0.248 Sum_probs=68.1 Q ss_pred CcHHHhhhhhhhhhc--ccCCcCCCC-CCC-ceEEEEeecCCCCC-----ccEEeeeEEEEEeecCCHHHHHHHHHHHHH Q lcl|NC_021334. 7 PSIHRVLVAWLSPLG--KVSTRRLSG-DPL-PHRVVRRVDGRDVP-----EEGSDSAVVSVHTFAASDEAAENEAELTHQ 77 (134) Q Consensus 7 P~~~~~lia~L~plg--~v~~eR~~~-dPl-Pf~~V~rV~G~d~~-----d~~~d~a~vsvhtf~~g~~aa~d~a~~~hr 77 (134) =+.|+-|.+-|+|+- +|-+..-|+ .|+ ||.+-++|+|...- ....+..+|||++|+++..+|+..++.+.. T Consensus 1 Ms~e~~l~a~L~~~~~~Rvyp~~aP~~~~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~rvQIdvyA~t~~~A~~l~~av~~ 80 (118) T protein:vir:81 1 MSYGRVLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPVYWQEGGMPEKVNARVQIQIWSRSKQEAYLATVQVLR 80 (118) T ss_pred CchHHHHHHHHHhhcCCccccccCCCCCccCceEEEEecCCcccccccCCCCCccceeEEEEEeeCCHHHHHHHHHHHHH Confidence 245677888899993 476655555 475 99999999996311 122345789999999999999999999887 Q ss_pred HHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 78 RMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 78 RMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) -|-..... ...-.| ...|..|+.+.| +.-++..=|= T Consensus 81 al~~~~~~------------------~~~~~~-~d~ye~dt~l~r--~~~Df~iw~~ 116 (118) T protein:vir:81 81 LVSEAPDM------------------QVLSQP-IDDYVREIKLYG--SRVDVSMWYP 116 (118) T ss_pred Hhhhccce------------------eeccCC-ccccccccCcee--EEEEEEEEec Confidence 77544321 111111 235666666543 3333333333 No 19 >protein:vir:98426 Length: 131 # NCBI annotation: ORF6 # Family: family:all:12105 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958284;genbank:gi:41057258;uniprot:Q38599;genbank:GeneID:2732810 Probab=95.59 E-value=0.00099 Score=37.07 Aligned_cols=117 Identities=19% Similarity=0.200 Sum_probs=81.4 Q ss_pred CCCCCCCcHHHhhhh----hhh-hhc--ccCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVA----WLS-PLG--KVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAE 73 (134) Q Consensus 1 ~~~~~~P~~~~~lia----~L~-plg--~v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~ 73 (134) |-.-..|+++.++.. ||. -.. .|+.+=+.+-|=.|..|.|.+|. .++.++|.+.+.|+.+|++.++|.+.|. T Consensus 1 ~~~i~~pda~~v~~~~lr~~l~a~~~~V~V~t~vP~~RP~rfV~VertgG~-~~~~~~Dr~~L~Vq~W~~t~~~A~~La~ 79 (131) T protein:vir:98 1 MPPILMPDAVAVIAGYLRAVLVARGVTVPVGSRVPSPRPARFVRIERIGGP-ANTVVTDRPRLDVHCWGSSEEDAHDLMQ 79 (131) T ss_pred CCCccCCchhHHHHHHHHHHHHhcCCceEecccCCCCCCceEEEEEecCCC-cCCccccceEEEEEecCCCHHHHHHHHH Confidence 887788999887665 552 222 36666666669999999999887 6788999999999999999999999999 Q ss_pred HHHHHHHhccCCCccEEEcCCCeEEEeeEEeecc-CccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 74 LTHQRMLELVSDPLVEIPLGGGVVARIDYARVLM-KPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 74 ~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~-~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) .+-..|+.+-.- . + -..+-++.+ +|.+.+=. |+. ..||.+....+ T Consensus 80 ~vr~~ll~~~~~--~-----g----~~~~~~~e~~gpy~~PD~-es~----~~Ryq~tv~l~ 125 (131) T protein:vir:98 80 LCRALLGAARGS--H-----G----DTVLARPATGGPQFLPDA-ETG----AARWAFTLDIT 125 (131) T ss_pred HHHHHHhhcccc--c-----c----hheeccccCCCCCcCCCC-CCC----CceeEEEEEEE Confidence 998888865321 0 0 111223333 45544432 233 45677776666 No 20 >protein:vir:10368 Length: 118 # NCBI annotation: conserved phage protein # Family: family:all:3244 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858960;genbank:gi:32128425;genbank:GeneID:2648389 Probab=95.10 E-value=0.0016 Score=35.84 Aligned_cols=107 Identities=24% Similarity=0.306 Sum_probs=64.8 Q ss_pred CCCCCCCcHHHhhhhhhhhhc--ccCCcCCCC-CCC-ceEEEEeecCCCCC-c----cEEeeeEEEEEeecCCHHHHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSPLG--KVSTRRLSG-DPL-PHRVVRRVDGRDVP-E----EGSDSAVVSVHTFAASDEAAENE 71 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~plg--~v~~eR~~~-dPl-Pf~~V~rV~G~d~~-d----~~~d~a~vsvhtf~~g~~aa~d~ 71 (134) |. .|+-|.+-|+++- +|-+..-|. .|+ ||.+-++|+|...- - ...+...|||++|+++..+|+.. T Consensus 1 Ms------~e~~l~a~L~~~~~~RVyp~~aP~~~~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~rvQIdvyA~t~~~A~~l 74 (118) T protein:vir:10 1 MS------YGRVLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPVYWQEGGMPEKVNARVQIQIWSRSKQEAYLA 74 (118) T ss_pred Cc------hHHHHHHHHhhhcCCccccccCCCCCCcCCEEEEEecCCcccccccCCCCccceeEEEEEEeeCCHHHHHHH Confidence 54 4667778899984 466655544 585 99999999996311 1 11334689999999999999999 Q ss_pred HHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 72 AELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 72 a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) ++.+..=|-.. ... .....|+ ..|..|+.+.| +.-++..=|= T Consensus 75 ~~av~~al~~~--~~~----------------~~~~~~~-d~ye~dt~l~r--~~~Df~vw~~ 116 (118) T protein:vir:10 75 TVQVLRLVSEA--NDM----------------QVLSQPI-DDYVREIKLYG--SRVDISMWYN 116 (118) T ss_pred HHHHHHHhhhc--ccc----------------eeccCCC-ccccccCCceE--EEEEEEEeee Confidence 99884433222 101 1111121 45666666543 3333333222 No 21 >protein:vir:1892 Length: 121 # NCBI annotation: gp11 # Family: family:all:896 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037672;genbank:gi:9634130;genbank:GeneID:1262490 Probab=94.99 E-value=0.00076 Score=37.69 Aligned_cols=105 Identities=16% Similarity=0.134 Sum_probs=66.3 Q ss_pred CCCcHHHhhhhh---hhhhc----ccC--CcCCCCCCCceEEEEeecCCCC----CccEEeeeEEEEEeecCCHHHHHHH Q lcl|NC_021334. 5 SAPSIHRVLVAW---LSPLG----KVS--TRRLSGDPLPHRVVRRVDGRDV----PEEGSDSAVVSVHTFAASDEAAENE 71 (134) Q Consensus 5 ~~P~~~~~lia~---L~plg----~v~--~eR~~~dPlPf~~V~rV~G~d~----~d~~~d~a~vsvhtf~~g~~aa~d~ 71 (134) |-|+.-+++.+= .+=+| ++= ..-+.+.|+||.+-++|+|... .....+..+|||++++++.++|+.. T Consensus 1 m~~~i~~~l~~d~~v~allg~~~~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~vQIDvyA~t~~~A~~l 80 (121) T protein:vir:18 1 MIAPIFSVCASSPEVTDLLGSNPVRIYPFGIQDDNVVYPYVVWQNITGSPENYIAQRPDADFFTLQVDAYADTVDEVIAV 80 (121) T ss_pred CchHHHHHHhcChhhhhhhcCCCceeeeccCCCCcCcCCeEEEEEecCcccceecCCCCcceeEEEEEeecCCHHHHHHH Confidence 555555555221 11111 221 2446678999999999998741 1135567899999999999999999 Q ss_pred HHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 72 AELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 72 a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) ++.+-.-|-.++.. ..+ ....|..|+.+. |+.+-.+|+ T Consensus 81 ~~avr~Ale~~~~~--~~~-------------------~~~~ye~dT~ly----R~s~Dv~~~ 118 (121) T protein:vir:18 81 ATALRDAIEPHAHI--TRW-------------------GGQERDPETKRY----RYSFDVDWI 118 (121) T ss_pred HHHHHHHhhhcCcc--cCC-------------------CCCCCcccccce----eeeeEEEEe Confidence 99887766543321 111 112477777775 455667777 No 22 >protein:vir:97070 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:3244 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453569;genbank:gi:84662604;genbank:GeneID:5142485 Probab=94.83 E-value=0.0022 Score=35.17 Aligned_cols=107 Identities=22% Similarity=0.224 Sum_probs=65.8 Q ss_pred CcHHHhhhhhhhhhc--ccCCcCCCC-CCC-ceEEEEeecCCCCCc-----cEEeeeEEEEEeecCCHHHHHHHHHHHHH Q lcl|NC_021334. 7 PSIHRVLVAWLSPLG--KVSTRRLSG-DPL-PHRVVRRVDGRDVPE-----EGSDSAVVSVHTFAASDEAAENEAELTHQ 77 (134) Q Consensus 7 P~~~~~lia~L~plg--~v~~eR~~~-dPl-Pf~~V~rV~G~d~~d-----~~~d~a~vsvhtf~~g~~aa~d~a~~~hr 77 (134) =+.|+-|.+-|+++. +|-+.--|. .|+ ||.+-++|+|...-- ...+..+|||.+|+++..+|...++.+.+ T Consensus 1 M~~e~~l~a~L~~~~~~Rvyp~~aP~~~~~~Pyiv~q~vsg~p~~~ldG~~~~~~~~rvQIdvyA~t~~~A~~l~~av~~ 80 (118) T protein:vir:97 1 MSYGRMLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPVYWKEGGMPDKVNARVQVQIWSRSKQEAYLATVQVLR 80 (118) T ss_pred CchHHHHHHHHhhhcCCccccccCCCCCCcCCEEEEEecCCcccccccCCCCCccceeEEEEEeeCCHHHHHHHHHHHHH Confidence 245677888899994 476655555 485 999999999963211 11334679999999999999999998854 Q ss_pred HHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 78 RMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 78 RMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) =|-... .+..+ ..| ...|..|+.+.| ++-++..=|= T Consensus 81 al~~~~--~~~~~----------------~~~-~~~ye~dt~lyr--~~~Df~iw~~ 116 (118) T protein:vir:97 81 IVSEAN--DMQVL----------------SQP-IDDYVRELKLYG--SRVDISMWYN 116 (118) T ss_pred Hhhccc--ccccc----------------cCC-cccccccCCceE--EEEEEEEEee Confidence 432221 11001 111 144777776644 3333333333 No 23 >protein:vir:93602 Length: 114 # NCBI annotation: putative structural component # Family: family:all:896 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449300;genbank:gi:157166048;uniprot:Q6H9U1;genbank:GeneID:5580424 Probab=94.58 E-value=0.0028 Score=34.61 Aligned_cols=101 Identities=17% Similarity=0.199 Sum_probs=66.1 Q ss_pred CCCCCCCcHHHhhhhhhhhh--c----ccCCcC--CCCCCCceEEEEeecCC-----CCCccEEeeeEEEEEeecCCHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSPL--G----KVSTRR--LSGDPLPHRVVRRVDGR-----DVPEEGSDSAVVSVHTFAASDEA 67 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~pl--g----~v~~eR--~~~dPlPf~~V~rV~G~-----d~~d~~~d~a~vsvhtf~~g~~a 67 (134) |- |.=+-+-|+|+ | .+.++. .++.++||.+-++|.|. ++++ .+...|||++||++.++ T Consensus 1 M~-------e~~i~~lL~~~~~gRvyp~~~P~~~~~~~~~~Pyiv~q~vsg~p~~~l~gp~--~~~~~vQIDvyA~t~~~ 71 (114) T protein:vir:93 1 MT-------EADLYPHLAHLAGGQVYPYVVPLLDGRPSVALPWVVFSLISSVSADVMGGQA--ESSVSVQIDVYAGTVTQ 71 (114) T ss_pred Cc-------hHHHHHHHHhhcCcccccccCCcccCcCCccCceEEEEeccCcccccccCcc--ccceEEEEEeeeCCHHH Confidence 32 45566777776 3 255553 35679999999999884 2233 36789999999999999 Q ss_pred HHHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeee Q lcl|NC_021334. 68 AENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGV 131 (134) Q Consensus 68 a~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL 131 (134) |+..+..+-.-|-.+++ +...+ ...|..|+.+.|.+-=|.+-. T Consensus 72 A~~l~~~v~~Al~~~~~-----~~~~~----------------~~~ye~dt~lyR~~~d~~v~~ 114 (114) T protein:vir:93 72 ARQIRQDAREAIMLLAP-----GSVSE----------------MQDYIPENRCYRATLEFQVTV 114 (114) T ss_pred HHHHHHHHHHHHhhcCc-----EeecC----------------CCcccccccceeeEEEEEEeC Confidence 99999888877766654 21111 134766666654333333222 No 24 >protein:vir:78124 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:29862 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294806;genbank:gi:149882827;genbank:GeneID:5309152 Probab=94.39 E-value=0.0027 Score=34.70 Aligned_cols=120 Identities=23% Similarity=0.279 Sum_probs=75.6 Q ss_pred CCCCCCcHHHhhhhhhhh------h-cccCCcCCCCC--CC--ceEEEEeecCCCCCccEEeeeEEEEEeecC---CHHH Q lcl|NC_021334. 2 ATDSAPSIHRVLVAWLSP------L-GKVSTRRLSGD--PL--PHRVVRRVDGRDVPEEGSDSAVVSVHTFAA---SDEA 67 (134) Q Consensus 2 ~~~~~P~~~~~lia~L~p------l-g~v~~eR~~~d--Pl--Pf~~V~rV~G~d~~d~~~d~a~vsvhtf~~---g~~a 67 (134) .---||+.|.|+++||-. + |.||++.+++- |+ |..+|+ =+|++..|..+-+--+-|.++|- ++.- T Consensus 1 ~~v~PPDlE~fl~~~LRa~i~~adVDgqvGnk~Pd~y~g~y~~PLvvVR-DDgG~~~d~~tFDRSiGvnVlgwtrqd~KP 79 (139) T protein:vir:78 1 MRVAPPDLEEWFTALLRAEVRAAGVDAEVGNKEPDNLRVPLRRPLIVVR-DDSGDRRDWTTFDRSVGFTVLAGTKQNDKP 79 (139) T ss_pred CccCCccHHHHHHHHHHhhccccCccccccCcCCCCccccccCCeEEEE-cCCCCcccceeeecccceeeeeccccCchh Confidence 334689999999999943 3 35777777765 55 655555 55666779999999999999993 3456 Q ss_pred HHHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 68 AENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 68 a~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) |++.|+.+-. -|..+++ +-+-|+.|+-++.=-| -.|-..+ +|-. +|||-+-..|+ T Consensus 80 c~dLArrVy~---~lt~hp~--~LiegSpi~aVv~dgC-nGPYpVs--dd~d----~aryYltveYs 134 (139) T protein:vir:78 80 ANDLARVVAS---IVHDHEL--PLIEGSPIAAVVFDGC-RGPYAVP--DTID----VARRYLTGQYV 134 (139) T ss_pred hHHHHHHHHH---HhccCcc--eeecCCceEEeecccC-CCCCCCC--cchh----heeeeeEEEEe Confidence 7777766543 2233343 3335666666643222 2333322 3322 67777888888 No 25 >protein:vir:195 Length: 115 # NCBI annotation: Gp11 # Family: family:all:896 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037705;genbank:gi:9634170;genbank:GeneID:1262540 Probab=93.20 E-value=0.0078 Score=32.15 Aligned_cols=101 Identities=21% Similarity=0.249 Sum_probs=64.0 Q ss_pred CCCCCCCcHHHhhhhhhhhh--ccc----CC---cCCCCCCCceEEEEeecCCC-----CCccEEeeeEEEEEeecCCHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSPL--GKV----ST---RRLSGDPLPHRVVRRVDGRD-----VPEEGSDSAVVSVHTFAASDE 66 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~pl--g~v----~~---eR~~~dPlPf~~V~rV~G~d-----~~d~~~d~a~vsvhtf~~g~~ 66 (134) |- |.=|-+-|+|+ |+| .+ +-.|..++||.+-++|.|.. .+++ +...|||++|+++.+ T Consensus 1 M~-------e~~i~~lL~~l~~gRvyp~~aP~~~~~~~~~~~Pyiv~q~vsg~p~~~L~G~~~--~~~~vQIDvyA~t~~ 71 (115) T protein:vir:19 1 MN-------EDNIYALLSPLAEGRVYPYVAPLGSDGKPSVSPPWIIFSIVDDVSADVLCGQAE--SRVSVQVDVYSTSIA 71 (115) T ss_pred Cc-------hhHHHHHHhhhcCcccceeeccCCCCCCccccCCeEEEEeccCcccccccCCCc--cceEEEEEEeeCChH Confidence 32 45566777776 222 22 34566799999999998742 2332 678999999999999 Q ss_pred HHHHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeee Q lcl|NC_021334. 67 AAENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQY 133 (134) Q Consensus 67 aa~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y 133 (134) +|++.++.+-.=|-.+.+ +... ....|..|+.+.|.+-=|.+. = T Consensus 72 ~A~~l~~~i~~Al~~~~p-----~~~~----------------~~~~ye~dt~lyR~s~d~~V~--~ 115 (115) T protein:vir:19 72 ESRSLRDLVLASLEPLTP-----TEVV----------------KIPGYEPDYRLYRATLDFKVT--P 115 (115) T ss_pred HHHHHHHHHHHHhhhcCC-----EEec----------------CCCCcccchhceeeEEEEEec--C Confidence 999888877766644432 1111 125677777775543333333 3 No 26 >protein:vir:1387 Length: 116 # NCBI annotation: Gp10 protein # Family: family:all:517 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612839;genbank:gi:20065973;genbank:GeneID:935788 Probab=90.42 E-value=0.0045 Score=33.47 Aligned_cols=106 Identities=9% Similarity=-0.006 Sum_probs=74.0 Q ss_pred CCCCcHHHhhhhhhhhhc-ccCCcCCCCC-CCceEEEEeecCCCCCc------cEEeeeEEEEEeecCCHHHHHHHHHHH Q lcl|NC_021334. 4 DSAPSIHRVLVAWLSPLG-KVSTRRLSGD-PLPHRVVRRVDGRDVPE------EGSDSAVVSVHTFAASDEAAENEAELT 75 (134) Q Consensus 4 ~~~P~~~~~lia~L~plg-~v~~eR~~~d-PlPf~~V~rV~G~d~~d------~~~d~a~vsvhtf~~g~~aa~d~a~~~ 75 (134) ---=+..+.+..-|.|++ +|.-....++ ..||.++.-.. +.+. +..-.-.+||++|+++...+...+.++ T Consensus 1 ~~~m~I~~~i~~~Lk~i~ipV~~~~y~~~~~~~~Itf~~y~--e~~~~yaDd~e~~t~~~iQVDI~sk~~~~~~~l~~~V 78 (116) T protein:vir:13 1 MEDFDIIALVYECLECLNVPVIEGWYDEELNKTHITVHEYL--EQDESFEDDEAREEEHNIQIDVWSKDSLEAFKLKKAI 78 (116) T ss_pred CCccchhHHHHHHHhhcCCeeeecccCCCCccceEEEEeee--cCCCcccCCeeeeEEEEEEEEEeecCCccHHHHHHHH Confidence 111267889999999998 6887766666 57988888773 3333 333456899999999999999999999 Q ss_pred HHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEE--eeee Q lcl|NC_021334. 76 HQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYE--IGVQ 132 (134) Q Consensus 76 hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~--~gL~ 132 (134) .+-|++.+..- . .. ...|..|+.+-++.-||. ..|+ T Consensus 79 ~~lMk~~GF~r---~---------------~~---~d~ye~dt~iyhk~~RF~y~~el~ 116 (116) T protein:vir:13 79 KKLLKKNNFYF---D---------------SS---EDFYETKTRIYHKGLRFSYISEIS 116 (116) T ss_pred HHHHHHcCCEe---e---------------ec---CCCccchhhhhhhhhhheeeeecC Confidence 99999998741 1 11 134667777766566652 1222 No 27 >protein:vir:99005 Length: 170 # NCBI annotation: gp34 # Family: family:all:32655 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655899;genbank:gi:109521471;genbank:GeneID:4157970 Probab=88.75 E-value=0.0084 Score=31.97 Aligned_cols=129 Identities=18% Similarity=0.153 Sum_probs=84.4 Q ss_pred CCCCCC----CcHHH----hhhhhhhhh---cccCCcCC--------CCC--CC----ceEEEEeecCCCCCccEEeeeE Q lcl|NC_021334. 1 MATDSA----PSIHR----VLVAWLSPL---GKVSTRRL--------SGD--PL----PHRVVRRVDGRDVPEEGSDSAV 55 (134) Q Consensus 1 ~~~~~~----P~~~~----~lia~L~pl---g~v~~eR~--------~~d--Pl----Pf~~V~rV~G~d~~d~~~d~a~ 55 (134) ||.--| ++++. .+..+++-+ +.++.==+ -+| |. |-..|.|-+|.-|-|...|.++ T Consensus 1 Ma~~lPDW~egda~l~v~dl~~q~~qkl~Pn~~v~~WipdDw~~~~~~~da~pt~~~~Ptl~~~R~~Gq~D~d~~~Da~~ 80 (170) T protein:vir:99 1 MADFLPDWWEGPEYLDVEDLFAQHFQKLLPNVRVCHWIQPDWYIPTGFVDATPTYGTEPTLRLWRQPGQRDDESTTDAPL 80 (170) T ss_pred CccccCCccCCcHHHHHHHHHHHHHHHhCCCceeEeecCcccccccccccccccccccceEEEEecCCccchhhccchhh Confidence 654322 33333 333333333 22332111 234 44 5599999999999999999999 Q ss_pred EEEEeecCCHHHHHHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCC-CCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 56 VSVHTFAASDEAAENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDD-DGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 56 vsvhtf~~g~~aa~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~-D~~i~Ry~ARY~~gL~y~ 134 (134) +++.+.+-|-+.+-+.-.-+|..|+.+.+++. +.- .+.+.+|-.+.---.|+-.+-.+ |..+ ..|=|++-.|+= T Consensus 81 lq~~vvt~Sr~DS~~l~~fvr~im~a~~~g~~--~~~-~~qvv~i~sv~e~~Gp~~iP~~~~D~r~--V~atyevtv~~~ 155 (170) T protein:vir:99 81 LQFAAVTRSHGDSIQLIEFVHTVMRALNNGHK--IKY-NGQLVGIKNVGLWLGPQTIPEGPIDEFF--VPVTYKFTVAGK 155 (170) T ss_pred hhhhhhccChHHHHHHHHHHHHHHHhhhcCCe--eee-CCceEEEEEeccccccccCCCCCccceE--eeeEEEEEeecc Confidence 99999999999999999999999999876643 332 44456655554333555554443 3333 379999999987 No 28 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=81.43 E-value=0.086 Score=26.43 Aligned_cols=123 Identities=11% Similarity=0.092 Sum_probs=70.0 Q ss_pred CCCCCCC-cHHHhhhhhhhh-------hcccCCcCCCCCCCceEEEE--eecCCCCCccEEeeeEEEEEeec-CCHHHHH Q lcl|NC_021334. 1 MATDSAP-SIHRVLVAWLSP-------LGKVSTRRLSGDPLPHRVVR--RVDGRDVPEEGSDSAVVSVHTFA-ASDEAAE 69 (134) Q Consensus 1 ~~~~~~P-~~~~~lia~L~p-------lg~v~~eR~~~dPlPf~~V~--rV~G~d~~d~~~d~a~vsvhtf~-~g~~aa~ 69 (134) |.=-+|- .--+.++++|.- ||++=-.-|.+.+.||.++- .+--+++....-..-.++||++. .|-.+|+ T Consensus 1 m~~~s~~~aLq~Ai~~~L~ad~~l~alvg~I~D~~P~~~~~PYV~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~g~~ea~ 80 (134) T protein:vir:59 1 MTWKLASRALQKATVENLESYQPLMEMVNQVTESPGKDDPYPYVVIGDQSSTPFETKSSFGENITMDFHVWGGTTRAEAQ 80 (134) T ss_pred CCccchhHHHHHHHHHHhhcChhHHHhhhhhhcCCCCCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEECCChHHHH Confidence 6666553 345566777753 23222233345599998873 33222223334456677889987 4446677 Q ss_pred HHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEee Q lcl|NC_021334. 70 NEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIG 130 (134) Q Consensus 70 d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~g 130 (134) +++..+..- |... .+++++++++++.+.+............ --+.||.|+.+=. T Consensus 81 ~ia~av~~a---L~~~---~L~l~~~~lv~l~~~~~~~~rd~dg~~~-hg~l~fra~ve~~ 134 (134) T protein:vir:59 81 DISSRVLEA---LTYK---PLMFEGFTFVAKKLVLAQVITDTDGVTK-HGIIKVRFTINNN 134 (134) T ss_pred HHHHHHHHH---hcCC---CcccCCceEEEeEEeeeeEEecCCCceE-EEEEEEEEEEecC Confidence 777766444 3222 3788999999887765444333222211 1345677777777 No 29 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=74.87 E-value=0.15 Score=25.05 Aligned_cols=122 Identities=13% Similarity=0.056 Sum_probs=70.0 Q ss_pred CCCCCCCcHHHhhhhhhhh-------hcc-cCCcCCCCCCCceEEEEeecCCCCCc--cEEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP-------LGK-VSTRRLSGDPLPHRVVRRVDGRDVPE--EGSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p-------lg~-v~~eR~~~dPlPf~~V~rV~G~d~~d--~~~d~a~vsvhtf~--~g~~aa 68 (134) |.-.-...--+.+.+.|.- +|+ +=-.-|.+.++||..+-...-.+... ..-..-.++||++. .|-.+| T Consensus 1 Msms~~~aLq~Ai~a~L~ada~l~alvg~~VyD~~P~~~~~Pyv~lG~~~~~~~~~~~~~g~~~~~~i~Vws~~~g~~ea 80 (140) T protein:vir:96 1 MWVSVEPELTVQIYKRLKASPIINKFVGDRVFDVVQEDAVYPYIVVGESNVTNNESSTMMRETVGIVIHVYSQFATQYEA 80 (140) T ss_pred CCccHHHHHHHHHHHHhhcChhHHHhcCCccccCCccCCCCCEEEecCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 5444444455667777652 222 22234455699999884443333322 23456688999998 467888 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) +++++.+..-+ . . .+.+++++++++.+.....-.. .|+...+-+-||++-.+-. T Consensus 81 ~~ia~av~~AL-~---~---~l~l~~~~lv~l~~~~~~~~rd-----~dg~~~hgvl~~r~~v~~~ 134 (140) T protein:vir:96 81 KQIISAIGYVL-N---R---PIDIENYEFQFSRIDSQSVFPD-----IDRFTKHGTIRLLFKYRHI 134 (140) T ss_pred HHHHHHHHHHh-C---C---CccCCCCeEEEEEEeeeEEEec-----CCCceEEEEEEEEEEEEee Confidence 88888876653 2 2 3788999999875543322211 2344444445555444444 No 30 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=64.73 E-value=0.3 Score=23.49 Aligned_cols=118 Identities=13% Similarity=0.071 Sum_probs=62.4 Q ss_pred CCCCCCCcHHHhhhhhhh-------hhcc-cCCcCCCCCCCceEEEEeecCCCC--CccEEeeeEEEEEeecC--CHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLS-------PLGK-VSTRRLSGDPLPHRVVRRVDGRDV--PEEGSDSAVVSVHTFAA--SDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~-------plg~-v~~eR~~~dPlPf~~V~rV~G~d~--~d~~~d~a~vsvhtf~~--g~~aa 68 (134) |.-.-.-.--+.++++|. -||. +=-.-|.+.+.||..+-...-.+. .......-.++||+++. |-.+| T Consensus 1 M~~s~~~aLq~ai~~~L~ad~~l~~lvg~~vyD~~P~~~~~PyV~lG~~~~~~~~t~~~~~~~~~lti~Vws~~~gr~ea 80 (145) T protein:vir:12 1 MWVSVERYLFNKVYNKLKSNPIIQKQLGGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CcccHHHHHHHHHHHHhhcChhHHHhcCcccccCCccCCCCCEEEeccceeeecCCCcccceEEEEEEEEEEcCccHHHH Confidence 655322333567788773 2232 222333355999998743322222 22345566788999984 45788 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) +++++.+.+-+ +. .+.+++++.+++......... +.|+...+ --+-++|. T Consensus 81 ~~ia~ai~~aL----~~---~l~l~~~~lv~l~~~~~~~~r-----d~d~~~~h----gvl~~ra~ 130 (145) T protein:vir:12 81 SQIIQFLGFVL----NN---EIEIDYYSFIKSRIDTQEVIT-----DIDQYTKH----GIIRLVFK 130 (145) T ss_pred HHHHHHHHHHh----cc---ccCCCCceEEEEEEeeEEEEe-----cCCCceEE----EEEEEEEE Confidence 88888774332 22 277888888776544322111 12333322 22333333 No 31 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=63.13 E-value=0.32 Score=23.28 Aligned_cols=122 Identities=12% Similarity=0.046 Sum_probs=72.0 Q ss_pred CCCCCCCcHHHhhhhhhhh-------hcc-cCCcCCCCCCCceEEEEeecCCCCCcc--EEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP-------LGK-VSTRRLSGDPLPHRVVRRVDGRDVPEE--GSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p-------lg~-v~~eR~~~dPlPf~~V~rV~G~d~~d~--~~d~a~vsvhtf~--~g~~aa 68 (134) |.-.....--+.+++.|.- +|+ +=-.-|.+.|+||.++-...-.+.+++ .-..-.++||++. .|..+| T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (141) T protein:vir:96 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNNESSATMRETVGIVIHVYSQFATQYEA 80 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCccccCCccCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 5544555566778888765 232 211233455999988866655554443 2345678888884 777899 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) +++++.+..-+ ..+ +++++++++++.+....--.. .|+...+-+-||++-.+.= T Consensus 81 k~ia~av~~AL----~~~---l~l~~~~lv~l~~~~~~~~rd-----~dg~t~hgvl~~ra~v~~~ 134 (141) T protein:vir:96 81 KLILSAIGYVL----NRP---IEIDNYEFQFSRIDSQAVFPD-----IDRFTKHGTIRLLFKYRHK 134 (141) T ss_pred HHHHHHHHHHh----ccc---ccCCCceEEEEEEeeeeeeec-----CCCceEEEEEEEEEEEEec Confidence 99998887764 222 678999998875543322111 2334433344444444433 No 32 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=63.13 E-value=0.32 Score=23.28 Aligned_cols=122 Identities=12% Similarity=0.046 Sum_probs=72.0 Q ss_pred CCCCCCCcHHHhhhhhhhh-------hcc-cCCcCCCCCCCceEEEEeecCCCCCcc--EEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP-------LGK-VSTRRLSGDPLPHRVVRRVDGRDVPEE--GSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p-------lg~-v~~eR~~~dPlPf~~V~rV~G~d~~d~--~~d~a~vsvhtf~--~g~~aa 68 (134) |.-.....--+.+++.|.- +|+ +=-.-|.+.|+||.++-...-.+.+++ .-..-.++||++. .|..+| T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (141) T protein:vir:94 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNNESSATMRETVGIVIHVYSQFATQYEA 80 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCccccCCccCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 5544555566778888765 232 211233455999988866655554443 2345678888884 777899 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) +++++.+..-+ ..+ +++++++++++.+....--.. .|+...+-+-||++-.+.= T Consensus 81 k~ia~av~~AL----~~~---l~l~~~~lv~l~~~~~~~~rd-----~dg~t~hgvl~~ra~v~~~ 134 (141) T protein:vir:94 81 KLILSAIGYVL----NRP---IEIDNYEFQFSRIDSQAVFPD-----IDRFTKHGTIRLLFKYRHK 134 (141) T ss_pred HHHHHHHHHHh----ccc---ccCCCceEEEEEEeeeeeeec-----CCCceEEEEEEEEEEEEec Confidence 99998887764 222 678999998875543322111 2334433344444444433 No 33 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=63.13 E-value=0.32 Score=23.28 Aligned_cols=122 Identities=12% Similarity=0.046 Sum_probs=72.0 Q ss_pred CCCCCCCcHHHhhhhhhhh-------hcc-cCCcCCCCCCCceEEEEeecCCCCCcc--EEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP-------LGK-VSTRRLSGDPLPHRVVRRVDGRDVPEE--GSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p-------lg~-v~~eR~~~dPlPf~~V~rV~G~d~~d~--~~d~a~vsvhtf~--~g~~aa 68 (134) |.-.....--+.+++.|.- +|+ +=-.-|.+.|+||.++-...-.+.+++ .-..-.++||++. .|..+| T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (141) T protein:vir:10 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNNESSATMRETVGIVIHVYSQFATQYEA 80 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCccccCCccCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 5544555566778888765 232 211233455999988866655554443 2345678888884 777899 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) +++++.+..-+ ..+ +++++++++++.+....--.. .|+...+-+-||++-.+.= T Consensus 81 k~ia~av~~AL----~~~---l~l~~~~lv~l~~~~~~~~rd-----~dg~t~hgvl~~ra~v~~~ 134 (141) T protein:vir:10 81 KLILSAIGYVL----NRP---IEIDNYEFQFSRIDSQAVFPD-----IDRFTKHGTIRLLFKYRHK 134 (141) T ss_pred HHHHHHHHHHh----ccc---ccCCCceEEEEEEeeeeeeec-----CCCceEEEEEEEEEEEEec Confidence 99998887764 222 678999998875543322111 2334433344444444433 No 34 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=57.46 E-value=0.43 Score=22.57 Aligned_cols=122 Identities=11% Similarity=0.062 Sum_probs=68.5 Q ss_pred CCCCCCCcHHHhhhhhhhh---h----cc-cCCcCCCCCCCceEEEEeecCCCCCc--cEEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP---L----GK-VSTRRLSGDPLPHRVVRRVDGRDVPE--EGSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p---l----g~-v~~eR~~~dPlPf~~V~rV~G~d~~d--~~~d~a~vsvhtf~--~g~~aa 68 (134) |.--....--+.++++|.- | |+ +=-.-|.+.|+||.++-...-.+... ..-..-.++||++. .|-.+| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~~a~~PYV~lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNSIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEecCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 6544455556677777732 2 21 22233445699999884443333332 34556788899996 577899 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeE----EEEEEEEEeee--eeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHL----VRHVGRYEIGV--QYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i----~Ry~ARY~~gL--~y~ 134 (134) +++++.+..-+ ..+ +.+++++++++-+.+..-... .|+.. .||.||++=-. .=| T Consensus 81 k~ia~av~~aL----~~~---l~l~~~~lv~l~~~~~~~~rd-----~dg~~~hgvl~~ra~ve~~~~~~~~ 140 (145) T protein:vir:95 81 SQIIQFLGFVL----NNE---IEIDYYSFIKSRIDTQEVITD-----IDRYTKHGIIRLVFKYRHNTLQRSV 140 (145) T ss_pred HHHHHHHHHHh----ccc---cCCCCCeEEEeEEeeeeEeec-----CCCceEEEEEEEEEEEEeccccccc Confidence 99998887654 222 788999999876654332221 22222 23333332110 001 No 35 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=55.91 E-value=0.47 Score=22.39 Aligned_cols=126 Identities=11% Similarity=0.102 Sum_probs=67.9 Q ss_pred CCCCCCCcHHHhhhhhhhh---hcc-cCC---cCCC-CCCCceEEEEeecCCCC--CccEEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP---LGK-VST---RRLS-GDPLPHRVVRRVDGRDV--PEEGSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p---lg~-v~~---eR~~-~dPlPf~~V~rV~G~d~--~d~~~d~a~vsvhtf~--~g~~aa 68 (134) |--...+.--+.+.+.|.- |.. ++. .+.| +.|.||..+-...-.+. ....-..-.++||++. .|..+| T Consensus 1 ~~msa~~aLq~Ai~~~L~ad~~l~alvggrVyD~~P~~~~~PYV~lG~~~~~~~~~~~~~g~~~~~tl~Vws~~~g~~ea 80 (140) T protein:vir:96 1 MWVTAEPLLYNKIMNNLIENPITDKLVGGRVFDCVQKDVVYPYIVVGESNVTESERSPGMREIIAITFHVYSQYENGAEA 80 (140) T ss_pred CccchhHHHHHHHHHHhccChhHHhhcCcccccCCccCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 4433344566777777762 211 221 3444 44899997733332322 2234456678889995 788999 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeee--eeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGV--QYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL--~y~ 134 (134) +++++.+..-+ . + .+++++++.+++.+.....-.-..-... --+.||.+|++-.- .=| T Consensus 81 ~~ia~ai~~aL-~-~-----~l~l~~~~lv~l~~~~~~~~rd~dg~t~-hgvl~~ra~ve~~~~~~~~ 140 (140) T protein:vir:96 81 RELLKYLNYAC-R-L-----NINFKDYELEWIKKDNSQVFTDIDQYTK-HGVLRLLYKVRHKTLQERV 140 (140) T ss_pred HHHHHHHHHHh-c-C-----CccCCCceEEEEEEeeeEEeecCCCceE-EEEEEEEEEEeecccccCC Confidence 99999887666 2 3 2788999998886664433222111110 02234444433211 001 No 36 >protein:vir:9415 Length: 126 # NCBI annotation: phi PVL orf 12-like protein # Family: family:all:517 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803393;genbank:gi:29028705;genbank:GeneID:1258143 Probab=54.27 E-value=0.51 Score=22.20 Aligned_cols=109 Identities=12% Similarity=0.054 Sum_probs=73.9 Q ss_pred CC----CCCCCcHHHhhhhhhhhhc-ccCCcCCCCCCCceEEEEeecCCCCCc------cEEeeeEEEEEe-ecCCHHHH Q lcl|NC_021334. 1 MA----TDSAPSIHRVLVAWLSPLG-KVSTRRLSGDPLPHRVVRRVDGRDVPE------EGSDSAVVSVHT-FAASDEAA 68 (134) Q Consensus 1 ~~----~~~~P~~~~~lia~L~plg-~v~~eR~~~dPlPf~~V~rV~G~d~~d------~~~d~a~vsvht-f~~g~~aa 68 (134) |- -=+-|=..+.+..-|+|++ +|+-+.-.|..-||..+.-. .+.++ +.+-.-.+||++ +-+++- T Consensus 1 ~~~~~k~l~~~~I~~li~~~L~~~nvpv~~~~y~~~~~tyItf~ey--~~~~~~yaDD~e~~t~~~iQVDIw~sk~d~-- 76 (126) T protein:vir:94 1 MINVTKLIRNAIIANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPL--PFNPDTYADDNEISREYHYQIDVWWSQDEP-- 76 (126) T ss_pred CccchhhhhhhHHHHhhhhhhhccCceeeeeeecCCCceEEEEEee--cCCCCcccccceeeeEEEEEEEEeecCCCH-- Confidence 11 0112237788888899998 78888888889999998887 33333 344456799999 555543 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) .+.+.++.+-|++.+.. |... ...|..|+.+-+++=||+-.+-=. T Consensus 77 ~~l~~~V~~lMk~~GF~------------------r~~~---~dlYE~DtklyHk~~RF~~~~~~~ 121 (126) T protein:vir:94 77 NEQAEKIVELLKVINFQ------------------CYYR---EPLYESDVMSFRHIIRAKGSILSM 121 (126) T ss_pred HHHHHHHHHHHHHcCCe------------------eeec---CCCccchhhhheeeeeeeeeecce Confidence 34788899999998764 1111 246888998988888886544333 No 37 >protein:vir:98343 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918935;genbank:gi:119443697;genbank:GeneID:4594505 Probab=54.27 E-value=0.51 Score=22.20 Aligned_cols=109 Identities=12% Similarity=0.054 Sum_probs=73.9 Q ss_pred CC----CCCCCcHHHhhhhhhhhhc-ccCCcCCCCCCCceEEEEeecCCCCCc------cEEeeeEEEEEe-ecCCHHHH Q lcl|NC_021334. 1 MA----TDSAPSIHRVLVAWLSPLG-KVSTRRLSGDPLPHRVVRRVDGRDVPE------EGSDSAVVSVHT-FAASDEAA 68 (134) Q Consensus 1 ~~----~~~~P~~~~~lia~L~plg-~v~~eR~~~dPlPf~~V~rV~G~d~~d------~~~d~a~vsvht-f~~g~~aa 68 (134) |- -=+-|=..+.+..-|+|++ +|+-+.-.|..-||..+.-. .+.++ +.+-.-.+||++ +-+++- T Consensus 1 ~~~~~k~l~~~~I~~li~~~L~~~nvpv~~~~y~~~~~tyItf~ey--~~~~~~yaDD~e~~t~~~iQVDIw~sk~d~-- 76 (126) T protein:vir:98 1 MINVTKLIRNAIIANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPL--PFNPDTYADDNEISREYHYQIDVWWSQDEP-- 76 (126) T ss_pred CccchhhhhhhHHHHhhhhhhhccCceeeeeeecCCCceEEEEEee--cCCCCcccccceeeeEEEEEEEEeecCCCH-- Confidence 11 0112237788888899998 78888888889999998887 33333 344456799999 555543 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) .+.+.++.+-|++.+.. |... ...|..|+.+-+++=||+-.+-=. T Consensus 77 ~~l~~~V~~lMk~~GF~------------------r~~~---~dlYE~DtklyHk~~RF~~~~~~~ 121 (126) T protein:vir:98 77 NEQAEKIVELLKVINFQ------------------CYYR---EPLYESDVMSFRHIIRAKGSILSM 121 (126) T ss_pred HHHHHHHHHHHHHcCCe------------------eeec---CCCccchhhhheeeeeeeeeecce Confidence 34788899999998764 1111 246888998988888886544333 No 38 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=53.63 E-value=0.52 Score=22.12 Aligned_cols=122 Identities=11% Similarity=0.059 Sum_probs=67.5 Q ss_pred CCCCCCCcHHHhhhhhhh------hh-cc-cCCcCCCCCCCceEEEEeecCCCCCc--cEEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLS------PL-GK-VSTRRLSGDPLPHRVVRRVDGRDVPE--EGSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~------pl-g~-v~~eR~~~dPlPf~~V~rV~G~d~~d--~~~d~a~vsvhtf~--~g~~aa 68 (134) |.--....--+.++++|. .+ |+ +=-.-|.+.++||.++-...-.+.+. ..-..-.++||++. .|-.+| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~~~~~PYv~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhccccccCCcCCCCCCEEEecCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 654444445566777762 22 22 22234455699999884443333332 34556788999996 577899 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeE----EEEEEEEEeee--eeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHL----VRHVGRYEIGV--QYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i----~Ry~ARY~~gL--~y~ 134 (134) +++++.+-.-+ .. .+.+++++++++-+.+..--.. .|+.. .||.||++=.- .=| T Consensus 81 k~ia~av~~aL----~~---~l~l~~~~lv~l~~~~~~~~rd-----~dg~~~hgvl~fra~ve~~~~~~~~ 140 (145) T protein:vir:95 81 SQIIQFLGFVL----NN---EIEIDYYSFIKSRIDTQEVITD-----IDQYTKHGVIRLVFKYRHNTLQRSV 140 (145) T ss_pred HHHHHHHHHHh----cc---ccCCCCCeEEEeEEeeeeEeec-----CCCceEEEEEEEEEEEEeccccccc Confidence 99988877654 22 2788999999876654332221 22222 23333332110 001 No 39 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=53.60 E-value=0.52 Score=22.12 Aligned_cols=122 Identities=11% Similarity=0.059 Sum_probs=67.5 Q ss_pred CCCCCCCcHHHhhhhhhh------hh-cc-cCCcCCCCCCCceEEEEeecCCCCCc--cEEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLS------PL-GK-VSTRRLSGDPLPHRVVRRVDGRDVPE--EGSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~------pl-g~-v~~eR~~~dPlPf~~V~rV~G~d~~d--~~~d~a~vsvhtf~--~g~~aa 68 (134) |.--....--+.++++|. .+ |+ +=-.-|.+.++||.++-...-.+.+. ..-..-.++||++. .|-.+| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~~~~~PYv~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhccccccCCcCCCCCCEEEecCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 654444445566777762 22 22 22234455699999884443333332 34556788999996 577899 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeE----EEEEEEEEeee--eeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHL----VRHVGRYEIGV--QYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i----~Ry~ARY~~gL--~y~ 134 (134) +++++.+-.-+ .. .+.+++++++++-+.+..--.. .|+.. .||.||++=.- .=| T Consensus 81 k~ia~av~~aL----~~---~l~l~~~~lv~l~~~~~~~~rd-----~dg~~~hgvl~fra~ve~~~~~~~~ 140 (145) T protein:vir:94 81 SQIIQFLGFVL----NN---EIEIDYYSFIKSRIDTQEVITD-----IDQYTKHGIIRLVFKYRHNTLQRSV 140 (145) T ss_pred HHHHHHHHHHh----cc---ccCCCCCeEEEeEEeeeeEeec-----CCCceEEEEEEEEEEEEeccccccc Confidence 99988877654 22 2788999999876654332221 22222 23333332110 001 No 40 >protein:vir:93736 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240465;genbank:gi:66396143;genbank:GeneID:5133505 Probab=47.80 E-value=0.69 Score=21.47 Aligned_cols=122 Identities=11% Similarity=0.062 Sum_probs=67.2 Q ss_pred CCCCCCCcHHHhhhhhhhh---h----cc-cCCcCCCCCCCceEEEEeecCCCCCc--cEEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP---L----GK-VSTRRLSGDPLPHRVVRRVDGRDVPE--EGSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p---l----g~-v~~eR~~~dPlPf~~V~rV~G~d~~d--~~~d~a~vsvhtf~--~g~~aa 68 (134) |.--....--+.++++|.- | |+ +=-.-|.+.++||.++-...-.+... ..-..-.++||++. .|-.+| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~~a~~PYV~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:93 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 6544555556677777732 2 21 22233445699999884443333332 24556778889987 466889 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeE----EEEEEEEEeee--eeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHL----VRHVGRYEIGV--QYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i----~Ry~ARY~~gL--~y~ 134 (134) +++++.+..-+ ..+ +.+++++++++.+.+..-..- .|+.. .||.||++=-. .=| T Consensus 81 k~ia~av~~aL----~~~---l~l~~~~lv~l~~~~~~~~rd-----~dg~~~hgvl~fra~ve~~~~~~~~ 140 (145) T protein:vir:93 81 SQIIQFLGFVL----NNE---IEIDYYSFIKSRIDTQEVITD-----IDQYTKHGIIRLVFKYRHNTLQRSV 140 (145) T ss_pred HHHHHHHHHHh----ccc---cCCCCCeEEEeEEeeeeEeec-----CCcceEEEEEEEEEEEEeccccccc Confidence 99888885543 222 788999998875554322221 12222 23333332110 001 No 41 >protein:vir:94488 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240682;genbank:gi:66396364;genbank:GeneID:5133752 Probab=47.80 E-value=0.69 Score=21.47 Aligned_cols=122 Identities=11% Similarity=0.062 Sum_probs=67.2 Q ss_pred CCCCCCCcHHHhhhhhhhh---h----cc-cCCcCCCCCCCceEEEEeecCCCCCc--cEEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP---L----GK-VSTRRLSGDPLPHRVVRRVDGRDVPE--EGSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p---l----g~-v~~eR~~~dPlPf~~V~rV~G~d~~d--~~~d~a~vsvhtf~--~g~~aa 68 (134) |.--....--+.++++|.- | |+ +=-.-|.+.++||.++-...-.+... ..-..-.++||++. .|-.+| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~~a~~PYV~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 6544555556677777732 2 21 22233445699999884443333332 24556778889987 466889 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeE----EEEEEEEEeee--eeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHL----VRHVGRYEIGV--QYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i----~Ry~ARY~~gL--~y~ 134 (134) +++++.+..-+ ..+ +.+++++++++.+.+..-..- .|+.. .||.||++=-. .=| T Consensus 81 k~ia~av~~aL----~~~---l~l~~~~lv~l~~~~~~~~rd-----~dg~~~hgvl~fra~ve~~~~~~~~ 140 (145) T protein:vir:94 81 SQIIQFLGFVL----NNE---IEIDYYSFIKSRIDTQEVITD-----IDQYTKHGIIRLVFKYRHNTLQRSV 140 (145) T ss_pred HHHHHHHHHHh----ccc---cCCCCCeEEEeEEeeeeEeec-----CCcceEEEEEEEEEEEEeccccccc Confidence 99888885543 222 788999998875554322221 12222 23333332110 001 No 42 >protein:vir:97421 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240755;genbank:gi:66396436;genbank:GeneID:5133777 Probab=47.80 E-value=0.69 Score=21.47 Aligned_cols=122 Identities=11% Similarity=0.062 Sum_probs=67.2 Q ss_pred CCCCCCCcHHHhhhhhhhh---h----cc-cCCcCCCCCCCceEEEEeecCCCCCc--cEEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP---L----GK-VSTRRLSGDPLPHRVVRRVDGRDVPE--EGSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p---l----g~-v~~eR~~~dPlPf~~V~rV~G~d~~d--~~~d~a~vsvhtf~--~g~~aa 68 (134) |.--....--+.++++|.- | |+ +=-.-|.+.++||.++-...-.+... ..-..-.++||++. .|-.+| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~~a~~PYV~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 6544555556677777732 2 21 22233445699999884443333332 24556778889987 466889 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeE----EEEEEEEEeee--eeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHL----VRHVGRYEIGV--QYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i----~Ry~ARY~~gL--~y~ 134 (134) +++++.+..-+ ..+ +.+++++++++.+.+..-..- .|+.. .||.||++=-. .=| T Consensus 81 k~ia~av~~aL----~~~---l~l~~~~lv~l~~~~~~~~rd-----~dg~~~hgvl~fra~ve~~~~~~~~ 140 (145) T protein:vir:97 81 SQIIQFLGFVL----NNE---IEIDYYSFIKSRIDTQEVITD-----IDQYTKHGIIRLVFKYRHNTLQRSV 140 (145) T ss_pred HHHHHHHHHHh----ccc---cCCCCCeEEEeEEeeeeEeec-----CCcceEEEEEEEEEEEEeccccccc Confidence 99888885543 222 788999998875554322221 12222 23333332110 001 No 43 >protein:vir:105337 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950674;genbank:gi:119967844;genbank:GeneID:4643216 Probab=46.57 E-value=0.73 Score=21.33 Aligned_cols=118 Identities=13% Similarity=0.045 Sum_probs=66.2 Q ss_pred CCCCCCCcHHHhhhhhhhh-------hcc-cCCcCCCCCCCceEEEEeecCCCCCc--cEEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP-------LGK-VSTRRLSGDPLPHRVVRRVDGRDVPE--EGSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p-------lg~-v~~eR~~~dPlPf~~V~rV~G~d~~d--~~~d~a~vsvhtf~--~g~~aa 68 (134) |.---...--+.++++|.- ||+ +=-.-|.+.+.||.++-...-.+.+. ..-..-.++||++. .|..+| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~rVyD~~P~~a~~PyV~lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIVSKQLGGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMFEDVGVTLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhccccccCCccCCCCCEEEeCcceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 5433333344566667642 222 32244455699999884443333333 33456788899996 678899 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) +++++.+.+-+ . + .+.+++++++++-+.+..-... .|+... +--+-++|+ T Consensus 81 ~~ia~av~~aL-~-a-----~l~l~~~~lv~l~~~~~~~~rd-----~dg~~~----hgvl~~ra~ 130 (145) T protein:vir:10 81 SQIIQYLGFVL-N-S-----EIEINNYSFIKSRIDTQEVITD-----IDQYTK----HGIIRLIFK 130 (145) T ss_pred HHHHHHHHHHh-C-C-----CcCCCCCeEEEEEEeeeeEeec-----CCCceE----EEEEEEEEE Confidence 99988887653 2 2 2788999998886654322221 122221 222333333 No 44 >protein:vir:107096 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950611;genbank:gi:119953691;genbank:GeneID:4643105 Probab=46.54 E-value=0.73 Score=21.33 Aligned_cols=118 Identities=13% Similarity=0.045 Sum_probs=66.2 Q ss_pred CCCCCCCcHHHhhhhhhhh-------hcc-cCCcCCCCCCCceEEEEeecCCCCCc--cEEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP-------LGK-VSTRRLSGDPLPHRVVRRVDGRDVPE--EGSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p-------lg~-v~~eR~~~dPlPf~~V~rV~G~d~~d--~~~d~a~vsvhtf~--~g~~aa 68 (134) |.---...--+.++++|.- ||+ +=-.-|.+.+.||.++-...-.+.+. ..-..-.++||++. .|..+| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~rVyD~~P~~a~~PyV~lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIIKKQLGGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMFEDVGVTLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhccccccCCccCCCCCEEEeCcceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 5433333344566666642 222 32244455699999884443333333 33456788899996 678899 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) +++++.+.+-+ . + .+.+++++++++-+.+..-... .|+... +--+-++|+ T Consensus 81 ~~ia~av~~aL-~-a-----~l~l~~~~lv~l~~~~~~~~rd-----~dg~~~----hgvl~~ra~ 130 (145) T protein:vir:10 81 SQIIQYLGFVL-N-S-----EIEINNYSFIKSRIDTQEVITD-----IDQYTK----HGIIRLIFK 130 (145) T ss_pred HHHHHHHHHHh-C-C-----CcCCCCCeEEEEEEeeeeEeec-----CCCceE----EEEEEEEEE Confidence 99988887653 2 2 2788999998886654322221 122221 222333333 No 45 >protein:vir:1274 Length: 162 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690766;genbank:gi:22855006;genbank:GeneID:955217 Probab=45.53 E-value=0.77 Score=21.22 Aligned_cols=109 Identities=17% Similarity=0.122 Sum_probs=68.0 Q ss_pred CCCCCCCcHHHhhhhhhhh------h-cc-cCC-cCCCCCCCceEEEEeecCCCCCccEE------eeeEEEEEeecCCH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP------L-GK-VST-RRLSGDPLPHRVVRRVDGRDVPEEGS------DSAVVSVHTFAASD 65 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p------l-g~-v~~-eR~~~dPlPf~~V~rV~G~d~~d~~~------d~a~vsvhtf~~g~ 65 (134) -|-|++=|..+-|.+-|.- + ++ +-. .-..+...||.+..-+ .+.+..+. -...|||++|.++. T Consensus 33 ~~~~~~mn~~k~v~q~L~n~~~L~~l~~~~i~~l~~~~~~~~p~Itf~e~--~~~p~~yADD~e~ss~~~iQIDIwsk~s 110 (162) T protein:vir:12 33 SADQMTYSPKIELVSTLNSSAFLKGLTSGGIHNLVANDVSAFPRVVFSEI--QDADADFADNEVYSFEVRYQISIFTQAS 110 (162) T ss_pred chhhhhhhHHHHHHHHhcChhHHHhhCCCceEEEeecCCCCceEEEEEee--cCCCCcccccceeeEEEEEEEEEeecCC Confidence 5667777777777766632 1 11 211 2235668899999988 44554444 45689999999876 Q ss_pred H--HHHHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEE----Eeeee Q lcl|NC_021334. 66 E--AAENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRY----EIGVQ 132 (134) Q Consensus 66 ~--aa~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY----~~gL~ 132 (134) + ...+.+.++.+-|+.++..- ... -..|..|+.+-+..-|| ...++ T Consensus 111 t~~d~~~l~~~I~~lMk~~GF~R------------------~s~---~d~YE~DTklyHK~~RF~~~y~~E~~ 162 (162) T protein:vir:12 111 TRGKETAIASEIDRLMREIGYSR------------------YDS---QDLYETDTKVFHKARRYKKTYYQEVN 162 (162) T ss_pred cchhHHHHHHHHHHHHHHcCCEe------------------ecC---CCCCCChhhhhhhhheeccceeeecC Confidence 5 44578999999999988741 111 13566666655544444 33334 No 46 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=41.79 E-value=0.91 Score=20.80 Aligned_cols=122 Identities=11% Similarity=0.074 Sum_probs=67.5 Q ss_pred CCCCCCCcHHHhhhhhhhh---h----cc-cCCcCCCCCCCceEEEEeecCCCCCc--cEEeeeEEEEEeec--CCHHHH Q lcl|NC_021334. 1 MATDSAPSIHRVLVAWLSP---L----GK-VSTRRLSGDPLPHRVVRRVDGRDVPE--EGSDSAVVSVHTFA--ASDEAA 68 (134) Q Consensus 1 ~~~~~~P~~~~~lia~L~p---l----g~-v~~eR~~~dPlPf~~V~rV~G~d~~d--~~~d~a~vsvhtf~--~g~~aa 68 (134) |.---...--+.++++|.- | |+ +=-.-|.+.++||.++-...-.+.+. ..-..-.++||++. .|-.+| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~~l~alvggrV~D~~P~~a~~PYv~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIRKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CcchHhHHHHHHHHHHhhcChhHHHhhcCceecCCccCCCCCEEEeCcceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 6544444455667777752 2 21 22234455699998884443333332 23567788899998 466889 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeE----EEEEEEEEeee--eeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHL----VRHVGRYEIGV--QYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i----~Ry~ARY~~gL--~y~ 134 (134) +++++.+..-+ .. .+.+++++++++-+.+..-..- .|+.. .||.||++=.- .=| T Consensus 81 k~ia~av~~aL----~~---~l~l~~~~lv~l~~~~~~~~rd-----~dg~~~hgvl~fra~ve~~~~~~~~ 140 (145) T protein:vir:97 81 SQIIQFLGFVL----NN---EIEIDYYSFIKSRIDTQEVITD-----IDQYTKHGIIRLVFKYRHNTLQRSV 140 (145) T ss_pred HHHHHHHHHHh----cc---ccCCCCCeEEEeEEeeeeEeec-----CCCceEEEEEEEEEEEecCceeccc Confidence 99888877654 22 2788999998885553332221 12222 23333332110 001 No 47 >protein:vir:81093 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429879;genbank:gi:156603932;genbank:GeneID:5525313 Probab=25.76 E-value=2 Score=18.91 Aligned_cols=109 Identities=11% Similarity=0.050 Sum_probs=72.5 Q ss_pred CCC----CCCCcHHHhhhhhhhhhc-ccCCcCCCCCCCceEEEEeecCCCCCcc------EEeeeEEEEEe-ecCCHHHH Q lcl|NC_021334. 1 MAT----DSAPSIHRVLVAWLSPLG-KVSTRRLSGDPLPHRVVRRVDGRDVPEE------GSDSAVVSVHT-FAASDEAA 68 (134) Q Consensus 1 ~~~----~~~P~~~~~lia~L~plg-~v~~eR~~~dPlPf~~V~rV~G~d~~d~------~~d~a~vsvht-f~~g~~aa 68 (134) |.- =+-|-..+.+..-|.|++ +|+-..-.|..=||...--. .+.++. .+-.-.+||++ |-++. - T Consensus 1 ~~~~~~~i~n~~I~~li~~~Lk~~nvPV~~~~y~~~~ktyItf~ey--~~~~~~yADd~e~~t~~~iQIDIW~sk~~--~ 76 (126) T protein:vir:81 1 MINVTELIRNAIIANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPL--PFNPDTYADDNEISREYHYQIDVWWSQDE--P 76 (126) T ss_pred CcchHHhhhhhHHHhhhhhceeeccceeccccccCCCCcEEEEEee--cCCCCccccCeeeeeEEEEEEEEeeCCCC--H Confidence 211 012236777888899998 68887778888899988877 333333 33345789999 56655 4 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) .+.+.++.+-|++.+.. |.. ....|..|+.+-+++-||+-.+-=. T Consensus 77 ~~l~~~V~~~Mk~~GF~------------------R~~---~~d~YE~DtklyHk~~Rf~~~~~~~ 121 (126) T protein:vir:81 77 NEQAEKIVELLKVINFQ------------------CYY---REPLYESDVMSFRHIIRAKGSILSM 121 (126) T ss_pred HHHHHHHHHHHHHcCCe------------------eee---cCCCccchhhhhheeeeeeeeccce Confidence 56888899999998764 111 1246888888888887875443322 No 48 >protein:vir:80001 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430007;genbank:gi:156604062;genbank:GeneID:5525461 Probab=25.76 E-value=2 Score=18.91 Aligned_cols=109 Identities=11% Similarity=0.050 Sum_probs=72.5 Q ss_pred CCC----CCCCcHHHhhhhhhhhhc-ccCCcCCCCCCCceEEEEeecCCCCCcc------EEeeeEEEEEe-ecCCHHHH Q lcl|NC_021334. 1 MAT----DSAPSIHRVLVAWLSPLG-KVSTRRLSGDPLPHRVVRRVDGRDVPEE------GSDSAVVSVHT-FAASDEAA 68 (134) Q Consensus 1 ~~~----~~~P~~~~~lia~L~plg-~v~~eR~~~dPlPf~~V~rV~G~d~~d~------~~d~a~vsvht-f~~g~~aa 68 (134) |.- =+-|-..+.+..-|.|++ +|+-..-.|..=||...--. .+.++. .+-.-.+||++ |-++. - T Consensus 1 ~~~~~~~i~n~~I~~li~~~Lk~~nvPV~~~~y~~~~ktyItf~ey--~~~~~~yADd~e~~t~~~iQIDIW~sk~~--~ 76 (126) T protein:vir:80 1 MINVTELIRNAIIANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPL--PFNPDTYADDNEISREYHYQIDVWWSQDE--P 76 (126) T ss_pred CcchHHhhhhhHHHhhhhhceeeccceeccccccCCCCcEEEEEee--cCCCCccccCeeeeeEEEEEEEEeeCCCC--H Confidence 211 012236777888899998 68887778888899988877 333333 33345789999 56655 4 Q ss_pred HHHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 69 ENEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 69 ~d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) .+.+.++.+-|++.+.. |.. ....|..|+.+-+++-||+-.+-=. T Consensus 77 ~~l~~~V~~~Mk~~GF~------------------R~~---~~d~YE~DtklyHk~~Rf~~~~~~~ 121 (126) T protein:vir:80 77 NEQAEKIVELLKVINFQ------------------CYY---REPLYESDVMSFRHIIRAKGSILSM 121 (126) T ss_pred HHHHHHHHHHHHHcCCe------------------eee---cCCCccchhhhhheeeeeeeeccce Confidence 56888899999998764 111 1246888888888887875443322 No 49 >protein:vir:105008 Length: 119 # NCBI annotation: conserved structural protein # Family: family:all:517 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459973;genbank:gi:85701388;genbank:GeneID:3882149 Probab=23.72 E-value=2.3 Score=18.63 Aligned_cols=103 Identities=13% Similarity=0.173 Sum_probs=65.5 Q ss_pred CCcHHHhhhhhhhhhc---c-cCC------cCCCCCCCceEEEEeecCCCCCc------cEEeeeEEEEEeecCCHHHHH Q lcl|NC_021334. 6 APSIHRVLVAWLSPLG---K-VST------RRLSGDPLPHRVVRRVDGRDVPE------EGSDSAVVSVHTFAASDEAAE 69 (134) Q Consensus 6 ~P~~~~~lia~L~plg---~-v~~------eR~~~dPlPf~~V~rV~G~d~~d------~~~d~a~vsvhtf~~g~~aa~ 69 (134) ==|.-..+..-|.+.- . ++. ..+-+...||.++.-+ .+.+. +.+....+||++|.++. .. T Consensus 1 M~~i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~~~p~I~~~~~--~~~p~~~add~e~~~~~~~QIDVwsk~~--~~ 76 (119) T protein:vir:10 1 MINLRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFEL--DNRPDGFADNQEIESEILFQVDVWAKSS--TT 76 (119) T ss_pred CCchHHHHHHHhhcCchhhhhcCCceEEecccCCCCCCcEEEEEec--CCCCCcccCCceeeeEEEEEEEEeeCCC--HH Confidence 1255566666665421 1 222 2233446899999877 34444 34456789999999874 57 Q ss_pred HHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeee Q lcl|NC_021334. 70 NEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQY 133 (134) Q Consensus 70 d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y 133 (134) +.+.++.+.|+.++..- ... -..|..|+.+-+..-||+-.... T Consensus 77 ~i~~~I~~~m~~~gf~r------------------~~~---~d~ye~dt~lyhk~~Rf~~~~el 119 (119) T protein:vir:10 77 AIHQKVNEIMKRIGFSR------------------YAV---ADLYEEDTQIFHYAMRFAKGVEL 119 (119) T ss_pred HHHHHHHHHHHHcCCee------------------ecc---CCCcCChhhhheeeeeeeeeeeC Confidence 88999999999987641 111 13677777777777777655444 No 50 >protein:vir:102888 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:517 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338141;genbank:gi:77020213;genbank:GeneID:3703797 Probab=23.72 E-value=2.3 Score=18.63 Aligned_cols=103 Identities=13% Similarity=0.173 Sum_probs=65.5 Q ss_pred CCcHHHhhhhhhhhhc---c-cCC------cCCCCCCCceEEEEeecCCCCCc------cEEeeeEEEEEeecCCHHHHH Q lcl|NC_021334. 6 APSIHRVLVAWLSPLG---K-VST------RRLSGDPLPHRVVRRVDGRDVPE------EGSDSAVVSVHTFAASDEAAE 69 (134) Q Consensus 6 ~P~~~~~lia~L~plg---~-v~~------eR~~~dPlPf~~V~rV~G~d~~d------~~~d~a~vsvhtf~~g~~aa~ 69 (134) ==|.-..+..-|.+.- . ++. ..+-+...||.++.-+ .+.+. +.+....+||++|.++. .. T Consensus 1 M~~i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~~~p~I~~~~~--~~~p~~~add~e~~~~~~~QIDVwsk~~--~~ 76 (119) T protein:vir:10 1 MINLRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFEL--DNRPDGFADNQEIESEILFQVDVWAKSS--TT 76 (119) T ss_pred CCchHHHHHHHhhcCchhhhhcCCceEEecccCCCCCCcEEEEEec--CCCCCcccCCceeeeEEEEEEEEeeCCC--HH Confidence 1255566666665421 1 222 2233446899999877 34444 34456789999999874 57 Q ss_pred HHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeee Q lcl|NC_021334. 70 NEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQY 133 (134) Q Consensus 70 d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y 133 (134) +.+.++.+.|+.++..- ... -..|..|+.+-+..-||+-.... T Consensus 77 ~i~~~I~~~m~~~gf~r------------------~~~---~d~ye~dt~lyhk~~Rf~~~~el 119 (119) T protein:vir:10 77 AIHQKVNEIMKRIGFSR------------------YAV---ADLYEEDTQIFHYAMRFAKGVEL 119 (119) T ss_pred HHHHHHHHHHHHcCCee------------------ecc---CCCcCChhhhheeeeeeeeeeeC Confidence 88999999999987641 111 13677777777777777655444 No 51 >protein:vir:107581 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:517 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338192;genbank:gi:77020160;genbank:GeneID:3703712 Probab=23.72 E-value=2.3 Score=18.63 Aligned_cols=103 Identities=13% Similarity=0.173 Sum_probs=65.5 Q ss_pred CCcHHHhhhhhhhhhc---c-cCC------cCCCCCCCceEEEEeecCCCCCc------cEEeeeEEEEEeecCCHHHHH Q lcl|NC_021334. 6 APSIHRVLVAWLSPLG---K-VST------RRLSGDPLPHRVVRRVDGRDVPE------EGSDSAVVSVHTFAASDEAAE 69 (134) Q Consensus 6 ~P~~~~~lia~L~plg---~-v~~------eR~~~dPlPf~~V~rV~G~d~~d------~~~d~a~vsvhtf~~g~~aa~ 69 (134) ==|.-..+..-|.+.- . ++. ..+-+...||.++.-+ .+.+. +.+....+||++|.++. .. T Consensus 1 M~~i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~~~p~I~~~~~--~~~p~~~add~e~~~~~~~QIDVwsk~~--~~ 76 (119) T protein:vir:10 1 MINLRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFEL--DNRPDGFADNQEIESEILFQVDVWAKSS--TT 76 (119) T ss_pred CCchHHHHHHHhhcCchhhhhcCCceEEecccCCCCCCcEEEEEec--CCCCCcccCCceeeeEEEEEEEEeeCCC--HH Confidence 1255566666665421 1 222 2233446899999877 34444 34456789999999874 57 Q ss_pred HHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeee Q lcl|NC_021334. 70 NEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQY 133 (134) Q Consensus 70 d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y 133 (134) +.+.++.+.|+.++..- ... -..|..|+.+-+..-||+-.... T Consensus 77 ~i~~~I~~~m~~~gf~r------------------~~~---~d~ye~dt~lyhk~~Rf~~~~el 119 (119) T protein:vir:10 77 AIHQKVNEIMKRIGFSR------------------YAV---ADLYEEDTQIFHYAMRFAKGVEL 119 (119) T ss_pred HHHHHHHHHHHHcCCee------------------ecc---CCCcCChhhhheeeeeeeeeeeC Confidence 88999999999987641 111 13677777777777777655444 No 52 >protein:vir:102086 Length: 119 # NCBI annotation: structural protein # Family: family:all:517 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512319;genbank:gi:89152488;genbank:GeneID:3953079 Probab=23.72 E-value=2.3 Score=18.63 Aligned_cols=103 Identities=13% Similarity=0.173 Sum_probs=65.5 Q ss_pred CCcHHHhhhhhhhhhc---c-cCC------cCCCCCCCceEEEEeecCCCCCc------cEEeeeEEEEEeecCCHHHHH Q lcl|NC_021334. 6 APSIHRVLVAWLSPLG---K-VST------RRLSGDPLPHRVVRRVDGRDVPE------EGSDSAVVSVHTFAASDEAAE 69 (134) Q Consensus 6 ~P~~~~~lia~L~plg---~-v~~------eR~~~dPlPf~~V~rV~G~d~~d------~~~d~a~vsvhtf~~g~~aa~ 69 (134) ==|.-..+..-|.+.- . ++. ..+-+...||.++.-+ .+.+. +.+....+||++|.++. .. T Consensus 1 M~~i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~~~p~I~~~~~--~~~p~~~add~e~~~~~~~QIDVwsk~~--~~ 76 (119) T protein:vir:10 1 MINLRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFEL--DNRPDGFADNQEIESEILFQVDVWAKSS--TT 76 (119) T ss_pred CCchHHHHHHHhhcCchhhhhcCCceEEecccCCCCCCcEEEEEec--CCCCCcccCCceeeeEEEEEEEEeeCCC--HH Confidence 1255566666665421 1 222 2233446899999877 34444 34456789999999874 57 Q ss_pred HHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeee Q lcl|NC_021334. 70 NEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQY 133 (134) Q Consensus 70 d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y 133 (134) +.+.++.+.|+.++..- ... -..|..|+.+-+..-||+-.... T Consensus 77 ~i~~~I~~~m~~~gf~r------------------~~~---~d~ye~dt~lyhk~~Rf~~~~el 119 (119) T protein:vir:10 77 AIHQKVNEIMKRIGFSR------------------YAV---ADLYEEDTQIFHYAMRFAKGVEL 119 (119) T ss_pred HHHHHHHHHHHHcCCee------------------ecc---CCCcCChhhhheeeeeeeeeeeC Confidence 88999999999987641 111 13677777777777777655444 No 53 >protein:vir:2508 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:11707 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569749;genbank:gi:18496899;genbank:GeneID:932297 Probab=22.57 E-value=2.4 Score=18.47 Aligned_cols=125 Identities=18% Similarity=0.175 Sum_probs=76.1 Q ss_pred CCCCCC--C--cHHHhhhhhhhhhc---ccCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHHHHH Q lcl|NC_021334. 1 MATDSA--P--SIHRVLVAWLSPLG---KVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAENEAE 73 (134) Q Consensus 1 ~~~~~~--P--~~~~~lia~L~plg---~v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d~a~ 73 (134) |.--.+ | ..-..|..-|.-=| .|+..-+-|.|.-|.+++|++-. -+......++-+.+|-++--+++.-|+ T Consensus 1 ~~lvP~v~P~~A~RaYLl~~L~~Rg~~L~Vga~pPeG~Pt~Yallsr~~s~--r~~~l~~~LIRvRVyd~D~~~~~r~A~ 78 (139) T protein:vir:25 1 MTLVPSVGPLVAARAYLLDELAARANPLPVGANPPEGEPSSYALLSRPGSD--RDVFLGHFLIRVRVFDSDVVRLERNAD 78 (139) T ss_pred CcccCccchHHHHHHHHHHHHhhcCCcccccccCCCCCcceeEEEecCCCC--ceeehhheeEEEEeecchhhhhccchh Confidence 321111 1 12233333344334 26666688889999999999554 478888999999999999999999999 Q ss_pred HHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEE---EEEEEEE-eeeeeC Q lcl|NC_021334. 74 LTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLV---RHVGRYE-IGVQYI 134 (134) Q Consensus 74 ~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~---Ry~ARY~-~gL~y~ 134 (134) ..|+-++.-+ +.+|..|+|.+--- .-.-+-.|. +++|+ .+- -.+|=|| +||+=- T Consensus 79 LLHa~LlgA~---h~kvv~PeG~vWiT-Ga~H~~GPa--~~DD~-~v~LfG~q~aVFWTi~LkP~ 136 (139) T protein:vir:25 79 LLHALLCGAN---HRKVHTPEGDVWIT-GAAHHYGPA--DLDDP-DVPLFGMQAAVFWTIGLKPA 136 (139) T ss_pred HHHHHHhhhh---cceeeccCCceEee-ccccccCCc--ccCCC-ccccccchhheeeeeccccc Confidence 9999776665 45899999876211 223344454 66543 321 1112111 222222 No 54 >protein:vir:2689 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075508;genbank:gi:12719437;genbank:GeneID:920159 Probab=20.86 E-value=2.3 Score=18.58 Aligned_cols=109 Identities=12% Similarity=0.133 Sum_probs=59.3 Q ss_pred CCCCCcHHHhhhh--hhhhh--cccCC-cC--CCCCCCceEEEEeecCCCCCccE------EeeeEEEEEeecCCHHHHH Q lcl|NC_021334. 3 TDSAPSIHRVLVA--WLSPL--GKVST-RR--LSGDPLPHRVVRRVDGRDVPEEG------SDSAVVSVHTFAASDEAAE 69 (134) Q Consensus 3 ~~~~P~~~~~lia--~L~pl--g~v~~-eR--~~~dPlPf~~V~rV~G~d~~d~~------~d~a~vsvhtf~~g~~aa~ 69 (134) .+.=.-.-++|.. -|.-+ +.+-. +. -.++-.||.++.-++.. |..+ +-...+||++++++...|+ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~--p~~yadn~~l~~~~~~QIDVws~~~~~~~ 78 (131) T protein:vir:26 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPVYDL--PSDFMSDKYLSEEYLIQIDVESSNNQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCccccccceEEEeeCCCC--CccccCCceeeeEEEEEEEEEecCccchH Confidence 1110011111100 00000 11111 11 12345699999999643 3333 3346899999999999999 Q ss_pred HHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 70 NEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 70 d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) +...++.+=|++++.. + + ++ -..+|..|+.+-|+.-||+ |++|= T Consensus 79 ~i~~~I~~~M~~~gf~-q--~--s~---------------~~d~Yd~dtk~y~~arRYr-g~~~~ 122 (131) T protein:vir:26 79 DITKRIRYLLYQQNLI-Q--A--SS---------------QLDAYFEETKRYVMSRRYQ-GIPKN 122 (131) T ss_pred HHHHHHHHHHHHcCce-e--c--cC---------------CCCccchhhHHhhhhhhcc-ccchh Confidence 9999999999998875 2 1 11 1345766655555555664 33332 No 55 >protein:vir:9364 Length: 131 # NCBI annotation: SLT orf 131b-like protein # Family: family:all:508 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803342;genbank:gi:29028653;genbank:GeneID:1258094 Probab=20.86 E-value=2.3 Score=18.58 Aligned_cols=109 Identities=12% Similarity=0.133 Sum_probs=59.3 Q ss_pred CCCCCcHHHhhhh--hhhhh--cccCC-cC--CCCCCCceEEEEeecCCCCCccE------EeeeEEEEEeecCCHHHHH Q lcl|NC_021334. 3 TDSAPSIHRVLVA--WLSPL--GKVST-RR--LSGDPLPHRVVRRVDGRDVPEEG------SDSAVVSVHTFAASDEAAE 69 (134) Q Consensus 3 ~~~~P~~~~~lia--~L~pl--g~v~~-eR--~~~dPlPf~~V~rV~G~d~~d~~------~d~a~vsvhtf~~g~~aa~ 69 (134) .+.=.-.-++|.. -|.-+ +.+-. +. -.++-.||.++.-++.. |..+ +-...+||++++++...|+ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~--p~~yadn~~l~~~~~~QIDVws~~~~~~~ 78 (131) T protein:vir:93 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPVYDL--PSDFMSDKYLSEEYLIQIDVESSNNQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCccccccceEEEeeCCCC--CccccCCceeeeEEEEEEEEEecCccchH Confidence 1110011111100 00000 11111 11 12345699999999643 3333 3346899999999999999 Q ss_pred HHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 70 NEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 70 d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) +...++.+=|++++.. + + ++ -..+|..|+.+-|+.-||+ |++|= T Consensus 79 ~i~~~I~~~M~~~gf~-q--~--s~---------------~~d~Yd~dtk~y~~arRYr-g~~~~ 122 (131) T protein:vir:93 79 DITKRIRYLLYQQNLI-Q--A--SS---------------QLDAYFEETKRYVMSRRYQ-GIPKN 122 (131) T ss_pred HHHHHHHHHHHHcCce-e--c--cC---------------CCCccchhhHHhhhhhhcc-ccchh Confidence 9999999999998875 2 1 11 1345766655555555664 33332 No 56 >protein:vir:96972 Length: 131 # NCBI annotation: ORF035 # Family: family:all:508 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239865;genbank:gi:66395543;genbank:GeneID:5133005 Probab=20.86 E-value=2.3 Score=18.58 Aligned_cols=109 Identities=12% Similarity=0.133 Sum_probs=59.3 Q ss_pred CCCCCcHHHhhhh--hhhhh--cccCC-cC--CCCCCCceEEEEeecCCCCCccE------EeeeEEEEEeecCCHHHHH Q lcl|NC_021334. 3 TDSAPSIHRVLVA--WLSPL--GKVST-RR--LSGDPLPHRVVRRVDGRDVPEEG------SDSAVVSVHTFAASDEAAE 69 (134) Q Consensus 3 ~~~~P~~~~~lia--~L~pl--g~v~~-eR--~~~dPlPf~~V~rV~G~d~~d~~------~d~a~vsvhtf~~g~~aa~ 69 (134) .+.=.-.-++|.. -|.-+ +.+-. +. -.++-.||.++.-++.. |..+ +-...+||++++++...|+ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~--p~~yadn~~l~~~~~~QIDVws~~~~~~~ 78 (131) T protein:vir:96 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPVYDL--PSDFMSDKYLSEEYLIQIDVESSNNQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCccccccceEEEeeCCCC--CccccCCceeeeEEEEEEEEEecCccchH Confidence 1110011111100 00000 11111 11 12345699999999643 3333 3346899999999999999 Q ss_pred HHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 70 NEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 70 d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) +...++.+=|++++.. + + ++ -..+|..|+.+-|+.-||+ |++|= T Consensus 79 ~i~~~I~~~M~~~gf~-q--~--s~---------------~~d~Yd~dtk~y~~arRYr-g~~~~ 122 (131) T protein:vir:96 79 DITKRIRYLLYQQNLI-Q--A--SS---------------QLDAYFEETKRYVMSRRYQ-GIPKN 122 (131) T ss_pred HHHHHHHHHHHHcCce-e--c--cC---------------CCCccchhhHHhhhhhhcc-ccchh Confidence 9999999999998875 2 1 11 1345766655555555664 33332 No 57 >protein:vir:78648 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429947;genbank:gi:156604001;genbank:GeneID:5525394 Probab=20.86 E-value=2.3 Score=18.58 Aligned_cols=109 Identities=12% Similarity=0.133 Sum_probs=59.3 Q ss_pred CCCCCcHHHhhhh--hhhhh--cccCC-cC--CCCCCCceEEEEeecCCCCCccE------EeeeEEEEEeecCCHHHHH Q lcl|NC_021334. 3 TDSAPSIHRVLVA--WLSPL--GKVST-RR--LSGDPLPHRVVRRVDGRDVPEEG------SDSAVVSVHTFAASDEAAE 69 (134) Q Consensus 3 ~~~~P~~~~~lia--~L~pl--g~v~~-eR--~~~dPlPf~~V~rV~G~d~~d~~------~d~a~vsvhtf~~g~~aa~ 69 (134) .+.=.-.-++|.. -|.-+ +.+-. +. -.++-.||.++.-++.. |..+ +-...+||++++++...|+ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~--p~~yadn~~l~~~~~~QIDVws~~~~~~~ 78 (131) T protein:vir:78 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPVYDL--PSDFMSDKYLSEEYLIQIDVESSNNQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCccccccceEEEeeCCCC--CccccCCceeeeEEEEEEEEEecCccchH Confidence 1110011111100 00000 11111 11 12345699999999643 3333 3346899999999999999 Q ss_pred HHHHHHHHHHHhccCCCccEEEcCCCeEEEeeEEeeccCccccccCCCCeEEEEEEEEEeeeeeC Q lcl|NC_021334. 70 NEAELTHQRMLELVSDPLVEIPLGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) Q Consensus 70 d~a~~~hrRMl~L~~~~~~~v~~~gG~va~~D~~~~~~~P~~~~Y~~D~~i~Ry~ARY~~gL~y~ 134 (134) +...++.+=|++++.. + + ++ -..+|..|+.+-|+.-||+ |++|= T Consensus 79 ~i~~~I~~~M~~~gf~-q--~--s~---------------~~d~Yd~dtk~y~~arRYr-g~~~~ 122 (131) T protein:vir:78 79 DITKRIRYLLYQQNLI-Q--A--SS---------------QLDAYFEETKRYVMSRRYQ-GIPKN 122 (131) T ss_pred HHHHHHHHHHHHcCce-e--c--cC---------------CCCccchhhHHhhhhhhcc-ccchh Confidence 9999999999998875 2 1 11 1345766655555555664 33332 No 58 >protein:vir:99925 Length: 147 # NCBI annotation: gp12 # Family: family:all:11707 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655529;genbank:gi:109392299;genbank:GeneID:4157094 Probab=20.73 E-value=2.7 Score=18.20 Aligned_cols=125 Identities=17% Similarity=0.233 Sum_probs=79.6 Q ss_pred CCCCCCCcH-------HHhhhhhhhhhc---ccCCcCCCCCCCceEEEEeecCCCCCccEEeeeEEEEEeecCCHHHHHH Q lcl|NC_021334. 1 MATDSAPSI-------HRVLVAWLSPLG---KVSTRRLSGDPLPHRVVRRVDGRDVPEEGSDSAVVSVHTFAASDEAAEN 70 (134) Q Consensus 1 ~~~~~~P~~-------~~~lia~L~plg---~v~~eR~~~dPlPf~~V~rV~G~d~~d~~~d~a~vsvhtf~~g~~aa~d 70 (134) .|.|+--+. -..|..-|.-=| .|+..=+-|.|.-|.+++|++-.- +......++-+.+|-++--+++. T Consensus 2 ~~~~~~~P~v~P~~A~RaYLl~~L~~Rg~~L~VgatpPeG~Pt~Yallsr~~s~r--~~~l~~~LIRvRVyd~D~~~~~r 79 (147) T protein:vir:99 2 TAPEMVGPTMEPAIACRAYLMRRLDDRGIDLSVGATPPDGKPTRYVLVNQVDSRR--RGPVADYLIRTRVYNADAYECGQ 79 (147) T ss_pred CCccccCCcchhHHHHHHHHHHHHhhcCCcccccccCCCCCCcceEEEecCCCCc--eeehhheeEEEEeecchhhhhcc Confidence 233332222 223333333334 367777888999999999995544 78888999999999999999999 Q ss_pred HHHHHHHHHHhccCCCccEEEcCC-CeEEEeeEEee-ccCccccccCCCCeEE---EEEEEE-EeeeeeC Q lcl|NC_021334. 71 EAELTHQRMLELVSDPLVEIPLGG-GVVARIDYARV-LMKPVLVEYDDDGHLV---RHVGRY-EIGVQYI 134 (134) Q Consensus 71 ~a~~~hrRMl~L~~~~~~~v~~~g-G~va~~D~~~~-~~~P~~~~Y~~D~~i~---Ry~ARY-~~gL~y~ 134 (134) -|+..|+-++.-++ .+|..|| |.+ ||+= .----+.++.||+.+- -.+|=| -+||+=+ T Consensus 80 ~A~LLHa~LlgA~h---~kvv~Pd~G~v----WiTGa~H~~GPad~~DD~~v~LfG~q~aVFWTi~LkP~ 142 (147) T protein:vir:99 80 HATLLHAALLGAAQ---ARIVFPDVGQL----WVTGTEHVSGPSDITDDDTTTLFGQAISVFWTVALKPI 142 (147) T ss_pred chhHHHHHHhhhhc---ceeeecCCCce----EeecccccccccccCCCCCccccchhhheeeeeeeeec Confidence 99999997766664 5799998 654 4421 1112234565554332 122333 3667666 Done!