Query lcl|NC_014229.1_cdsid_YP_003714740.1 [gene=33] [protein=gp33] [protein_id=YP_003714740.1] [location=8110..8547] Match_columns 145 No_of_seqs 104 out of 205 Neff 7.2 Searched_HMMs 1612 Date Thu Nov 7 12:39:55 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_13 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_13_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95961 Length: 145 100.0 1.1E-44 6.6E-48 261.7 18.0 133 12-145 1-135 (145) 2 protein:vir:94794 Length: 145 100.0 1.1E-44 6.6E-48 261.6 18.0 133 12-145 1-135 (145) 3 protein:vir:107096 Length: 145 100.0 1.2E-44 7.3E-48 261.4 18.0 133 12-145 1-135 (145) 4 protein:vir:105337 Length: 145 100.0 1.2E-44 7.3E-48 261.4 18.0 133 12-145 1-135 (145) 5 protein:vir:95111 Length: 145 100.0 1.3E-44 7.9E-48 261.2 18.0 133 12-145 1-135 (145) 6 protein:vir:94488 Length: 145 100.0 1.5E-44 9E-48 260.9 18.0 133 12-145 1-135 (145) 7 protein:vir:97421 Length: 145 100.0 1.5E-44 9E-48 260.9 18.0 133 12-145 1-135 (145) 8 protein:vir:93736 Length: 145 100.0 1.5E-44 9E-48 260.9 18.0 133 12-145 1-135 (145) 9 protein:vir:97325 Length: 145 100.0 1.5E-44 9.1E-48 260.9 17.7 133 12-145 1-135 (145) 10 protein:vir:5979 Length: 134 # 100.0 2.7E-44 1.7E-47 259.4 17.8 134 10-144 1-134 (134) 11 protein:vir:94096 Length: 141 100.0 2.8E-44 1.8E-47 259.3 17.8 133 12-145 1-135 (141) 12 protein:vir:96260 Length: 141 100.0 2.8E-44 1.8E-47 259.3 17.8 133 12-145 1-135 (141) 13 protein:vir:105892 Length: 141 100.0 2.8E-44 1.8E-47 259.3 17.8 133 12-145 1-135 (141) 14 protein:vir:96894 Length: 140 100.0 1.4E-43 8.4E-47 255.6 18.0 133 12-145 1-135 (140) 15 protein:vir:96125 Length: 140 100.0 2.2E-43 1.3E-46 254.5 18.0 134 10-145 1-135 (140) 16 protein:vir:1244 Length: 145 # 100.0 8.5E-43 5.3E-46 251.2 18.0 134 11-145 1-135 (145) 17 protein:vir:2741 Length: 128 # 99.9 5.8E-27 3.6E-30 164.4 15.2 124 12-142 1-128 (128) 18 protein:vir:4907 Length: 128 # 99.9 2.2E-26 1.4E-29 161.3 14.8 124 12-142 1-128 (128) 19 protein:vir:96485 Length: 128 99.9 2.4E-26 1.5E-29 161.0 15.0 124 12-142 1-128 (128) 20 protein:vir:744 Length: 129 # 99.9 5.6E-26 3.5E-29 159.0 14.4 125 11-142 1-129 (129) 21 protein:vir:3972 Length: 129 # 99.9 7.1E-26 4.4E-29 158.5 14.5 125 11-142 1-129 (129) 22 protein:vir:3618 Length: 129 # 99.9 1.3E-25 7.9E-29 157.1 14.8 124 11-142 1-129 (129) 23 protein:vir:99537 Length: 125 99.8 9.3E-23 5.8E-26 141.4 14.3 122 13-142 1-125 (125) 24 protein:vir:106593 Length: 131 99.6 5.7E-18 3.5E-21 115.1 13.9 125 10-142 1-131 (131) 25 protein:vir:95765 Length: 127 99.6 9.1E-18 5.6E-21 114.0 13.9 122 13-145 1-126 (127) 26 protein:vir:9313 Length: 127 # 99.5 2.9E-16 1.8E-19 105.7 13.5 122 13-142 1-127 (127) 27 protein:vir:96355 Length: 127 99.5 3.3E-16 2E-19 105.5 13.4 122 13-142 1-127 (127) 28 protein:vir:78854 Length: 127 99.5 3.3E-16 2E-19 105.5 13.4 122 13-142 1-127 (127) 29 protein:vir:103918 Length: 127 99.5 3.6E-16 2.2E-19 105.2 13.4 122 13-142 1-127 (127) 30 protein:vir:97143 Length: 127 99.5 3.6E-16 2.2E-19 105.2 13.4 122 13-142 1-127 (127) 31 protein:vir:99769 Length: 127 99.5 3.6E-16 2.2E-19 105.2 13.4 122 13-142 1-127 (127) 32 protein:vir:96217 Length: 127 99.5 3.6E-16 2.2E-19 105.2 13.4 122 13-142 1-127 (127) 33 protein:vir:4348 Length: 121 # 99.3 1.1E-14 6.9E-18 97.1 9.8 113 19-143 1-121 (121) 34 protein:vir:1892 Length: 121 # 99.2 5.9E-14 3.7E-17 93.1 9.6 113 12-143 1-121 (121) 35 protein:vir:9880 Length: 136 # 99.2 5.1E-13 3.1E-16 88.0 11.8 129 11-145 1-132 (136) 36 protein:vir:102888 Length: 119 99.0 1.7E-12 1E-15 85.2 8.5 114 12-143 1-119 (119) 37 protein:vir:102086 Length: 119 99.0 1.7E-12 1E-15 85.2 8.5 114 12-143 1-119 (119) 38 protein:vir:105008 Length: 119 99.0 1.7E-12 1E-15 85.2 8.5 114 12-143 1-119 (119) 39 protein:vir:107581 Length: 119 99.0 1.7E-12 1E-15 85.2 8.5 114 12-143 1-119 (119) 40 protein:vir:100116 Length: 115 98.8 1.8E-11 1.1E-14 79.5 8.0 111 24-141 1-115 (115) 41 protein:vir:10368 Length: 118 98.8 5.7E-11 3.5E-14 76.8 10.4 112 12-144 1-118 (118) 42 protein:vir:97070 Length: 118 98.8 5.5E-11 3.4E-14 76.9 10.2 112 12-144 1-118 (118) 43 protein:vir:1438 Length: 115 # 98.8 2.5E-11 1.6E-14 78.7 8.2 111 24-141 1-115 (115) 44 protein:vir:81066 Length: 118 98.8 6.4E-11 4E-14 76.5 10.1 111 12-145 1-117 (118) 45 protein:vir:93602 Length: 114 98.7 2E-10 1.2E-13 73.8 10.2 106 12-141 1-114 (114) 46 protein:vir:195 Length: 115 # 98.7 1.8E-10 1.1E-13 74.0 8.9 106 12-141 1-115 (115) 47 protein:vir:100242 Length: 114 98.6 2.3E-10 1.4E-13 73.5 8.2 110 12-141 1-114 (114) 48 protein:vir:1274 Length: 162 # 98.5 9.9E-10 6.1E-13 70.0 7.7 134 1-144 21-162 (162) 49 protein:vir:2689 Length: 131 # 98.0 1.5E-08 9E-12 63.6 4.3 117 17-145 1-123 (131) 50 protein:vir:78648 Length: 131 98.0 1.5E-08 9E-12 63.6 4.3 117 17-145 1-123 (131) 51 protein:vir:96972 Length: 131 98.0 1.5E-08 9E-12 63.6 4.3 117 17-145 1-123 (131) 52 protein:vir:9364 Length: 131 # 98.0 1.5E-08 9E-12 63.6 4.3 117 17-145 1-123 (131) 53 protein:vir:80371 Length: 115 97.9 9.1E-08 5.7E-11 59.2 7.5 111 12-141 1-115 (115) 54 protein:vir:93902 Length: 131 97.9 2.3E-08 1.4E-11 62.5 3.9 118 17-145 1-124 (131) 55 protein:vir:94418 Length: 131 97.9 2.3E-08 1.4E-11 62.5 3.8 118 17-145 1-124 (131) 56 protein:vir:78349 Length: 127 97.8 1.3E-07 8.1E-11 58.3 6.8 119 12-145 1-126 (127) 57 protein:vir:98426 Length: 131 97.7 1.6E-06 9.7E-10 52.4 10.9 129 1-145 1-130 (131) 58 protein:vir:96002 Length: 133 97.6 3.8E-07 2.3E-10 55.8 6.5 121 12-145 1-133 (133) 59 protein:vir:98343 Length: 126 97.3 2.5E-06 1.5E-09 51.3 7.8 119 12-144 1-126 (126) 60 protein:vir:9415 Length: 126 # 97.3 2.5E-06 1.5E-09 51.3 7.8 119 12-144 1-126 (126) 61 protein:vir:101303 Length: 135 97.3 1.5E-06 9.2E-10 52.5 6.4 120 12-145 1-135 (135) 62 protein:vir:9514 Length: 135 # 97.3 1.5E-06 9.2E-10 52.5 6.4 120 12-145 1-135 (135) 63 protein:vir:100675 Length: 135 97.3 1.5E-06 9.2E-10 52.5 6.4 120 12-145 1-135 (135) 64 protein:vir:1387 Length: 116 # 97.2 1.3E-06 8.1E-10 52.9 5.2 113 12-144 1-116 (116) 65 protein:vir:81093 Length: 126 97.1 4.1E-06 2.5E-09 50.1 7.3 119 12-144 1-126 (126) 66 protein:vir:80001 Length: 126 97.1 4.1E-06 2.5E-09 50.1 7.3 119 12-144 1-126 (126) 67 protein:vir:1643 Length: 111 # 96.8 3.3E-05 2.1E-08 45.1 9.4 109 18-140 1-111 (111) 68 protein:vir:9709 Length: 141 # 96.7 1.3E-05 8.1E-09 47.4 6.6 118 13-143 1-141 (141) 69 protein:vir:94768 Length: 111 96.6 4.6E-05 2.8E-08 44.4 9.3 109 18-140 1-111 (111) 70 protein:vir:79571 Length: 137 96.6 5.6E-05 3.5E-08 43.9 9.4 127 12-145 1-137 (137) 71 protein:vir:9579 Length: 111 # 96.0 0.00012 7.8E-08 42.0 8.6 109 18-140 1-111 (111) 72 protein:vir:108220 Length: 133 95.8 0.00052 3.2E-07 38.6 11.0 118 12-145 1-130 (133) 73 protein:vir:9764 Length: 111 # 95.7 0.0003 1.8E-07 39.9 9.2 109 12-140 1-111 (111) 74 protein:vir:9931 Length: 119 # 95.0 0.00076 4.7E-07 37.7 9.2 113 12-142 1-119 (119) 75 protein:vir:78057 Length: 154 94.1 0.0019 1.2E-06 35.5 9.5 139 1-145 1-153 (154) 76 protein:vir:106554 Length: 122 93.4 0.0042 2.6E-06 33.6 10.1 112 12-145 1-115 (122) 77 protein:vir:7450 Length: 141 # 93.2 0.0032 2E-06 34.3 9.2 126 12-145 1-139 (141) 78 protein:vir:9648 Length: 126 # 92.7 0.0005 3.1E-07 38.7 3.9 117 11-144 1-126 (126) 79 protein:vir:3428 Length: 131 # 92.6 0.0052 3.2E-06 33.1 9.4 123 12-143 1-131 (131) 80 protein:vir:107857 Length: 154 91.0 0.0063 3.9E-06 32.6 8.1 126 13-145 1-154 (154) 81 protein:vir:94921 Length: 125 90.9 0.019 1.2E-05 30.1 12.5 120 12-145 1-125 (125) 82 protein:vir:79065 Length: 154 90.8 0.0068 4.2E-06 32.5 8.1 126 13-145 1-154 (154) 83 protein:vir:102955 Length: 138 90.2 0.022 1.4E-05 29.6 11.9 118 12-145 1-121 (138) 84 protein:vir:101509 Length: 139 88.5 0.0065 4E-06 32.6 6.1 124 12-145 1-138 (139) 85 protein:vir:102191 Length: 139 87.4 0.0087 5.4E-06 31.9 6.1 124 12-145 1-138 (139) 86 protein:vir:80105 Length: 162 86.6 0.044 2.7E-05 28.0 10.5 129 1-145 1-143 (162) 87 protein:vir:96764 Length: 177 86.5 0.045 2.8E-05 28.0 12.7 130 12-145 1-143 (177) 88 protein:vir:397 Length: 132 # 85.9 0.049 3.1E-05 27.8 9.7 125 12-143 1-132 (132) 89 protein:vir:10327 Length: 182 85.5 0.053 3.3E-05 27.6 12.0 130 12-145 1-142 (182) 90 protein:vir:7994 Length: 134 # 84.8 0.02 1.2E-05 29.9 6.7 119 12-145 1-131 (134) 91 protein:vir:102609 Length: 134 84.7 0.021 1.3E-05 29.8 6.8 119 12-145 1-131 (134) 92 protein:vir:105826 Length: 134 84.7 0.021 1.3E-05 29.8 6.8 119 12-145 1-131 (134) 93 protein:vir:79247 Length: 157 84.1 0.063 3.9E-05 27.1 11.8 129 12-145 1-151 (157) 94 protein:vir:98629 Length: 126 81.3 0.01 6.5E-06 31.5 3.7 117 11-145 1-126 (126) 95 protein:vir:8107 Length: 138 # 79.0 0.093 5.8E-05 26.2 8.1 120 12-145 1-132 (138) 96 protein:vir:3874 Length: 114 # 79.0 0.032 2E-05 28.7 5.6 106 12-134 1-114 (114) 97 protein:vir:79047 Length: 145 75.6 0.14 9E-05 25.2 12.6 119 12-145 1-124 (145) 98 protein:vir:99226 Length: 157 73.4 0.17 0.00011 24.8 12.1 129 12-145 1-151 (157) 99 protein:vir:107704 Length: 132 70.7 0.21 0.00013 24.3 9.7 121 15-145 1-129 (132) 100 protein:vir:105468 Length: 135 58.3 0.42 0.00026 22.7 11.5 118 17-145 1-122 (135) 101 protein:vir:78124 Length: 139 54.3 0.51 0.00031 22.2 8.8 126 12-145 1-138 (139) 102 protein:vir:6215 Length: 109 # 52.9 0.54 0.00034 22.0 8.0 106 12-142 1-109 (109) 103 protein:vir:95155 Length: 151 49.4 0.64 0.0004 21.6 11.8 127 12-145 1-151 (151) 104 protein:vir:1580 Length: 134 # 43.7 0.83 0.00052 21.0 9.7 125 17-145 1-131 (134) 105 protein:vir:95371 Length: 104 41.8 0.91 0.00057 20.8 9.9 102 12-140 1-104 (104) 106 protein:vir:80429 Length: 150 39.4 1 0.00063 20.5 8.7 129 12-145 1-150 (150) 107 protein:vir:4461 Length: 186 # 38.6 1.1 0.00066 20.5 12.1 125 12-145 1-143 (186) 108 protein:vir:1994 Length: 182 # 38.1 1.1 0.00067 20.4 8.4 120 12-145 1-140 (182) 109 protein:vir:101606 Length: 142 37.9 0.7 0.00043 21.4 4.8 129 12-142 1-142 (142) 110 protein:vir:103883 Length: 159 37.3 1.1 0.0007 20.3 10.8 131 10-145 1-153 (159) 111 protein:vir:98890 Length: 131 36.4 1.2 0.00073 20.2 9.5 124 17-145 1-129 (131) 112 protein:vir:104348 Length: 129 36.1 1.2 0.00074 20.2 10.5 117 12-144 1-129 (129) 113 protein:vir:8331 Length: 150 # 32.7 1.4 0.00087 19.8 7.1 118 12-145 1-144 (150) 114 protein:vir:103278 Length: 169 31.9 1.5 0.00091 19.7 10.3 133 1-144 27-169 (169) 115 protein:vir:80109 Length: 104 30.0 1.6 0.001 19.4 10.0 102 12-140 1-104 (104) 116 protein:vir:4515 Length: 186 # 30.0 1.6 0.001 19.4 11.4 125 12-145 1-143 (186) 117 protein:vir:9824 Length: 132 # 29.1 1.7 0.001 19.3 11.4 123 12-145 1-132 (132) 118 protein:vir:3037 Length: 132 # 29.1 1.7 0.001 19.3 11.4 123 12-145 1-132 (132) 119 protein:vir:488 Length: 187 # 28.1 1.8 0.0011 19.2 11.9 122 12-145 1-139 (187) 120 protein:vir:4705 Length: 126 # 25.3 2.1 0.0013 18.8 6.5 116 12-144 1-126 (126) No 1 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=100.00 E-value=1.1e-44 Score=261.67 Aligned_cols=133 Identities=19% Similarity=0.267 Sum_probs=130.5 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ |++.+||+|||++|++|++|+++++ +|||++|+++++|||+||+.+++|++++|..+.+|+++|||||++.|+.++ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~~~~~PYv~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhccccccCCcCCCCCCEEEecCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 99 9999999999999999999999997 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|++ +|++++|+++++++.++++++||||.++||+++|||++|+|+ T Consensus 81 k~ia~av~~aL~~-~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:95 81 SQIIQFLGFVLNN-EIEIDYYSFIKSRIDTQEVITDIDQYTKHGVIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHhcc-ccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEecc Confidence 9999999999985 799999999999999999999999999999999999999999 No 2 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=100.00 E-value=1.1e-44 Score=261.65 Aligned_cols=133 Identities=19% Similarity=0.266 Sum_probs=130.5 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ |++.+||+|||++|++|++|+++++ +|||++|+++++|||+||+.+++|++++|..+.+|+++|||||++.|+.++ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~~~~~PYv~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhccccccCCcCCCCCCEEEecCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 99 9999999999999999999999997 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|++ +|++++|+++++++.++++++||||.++||+++|||++|+|+ T Consensus 81 k~ia~av~~aL~~-~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:94 81 SQIIQFLGFVLNN-EIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHhcc-ccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEecc Confidence 9999999999985 799999999999999999999999999999999999999999 No 3 >protein:vir:107096 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950611;genbank:gi:119953691;genbank:GeneID:4643105 Probab=100.00 E-value=1.2e-44 Score=261.42 Aligned_cols=133 Identities=19% Similarity=0.268 Sum_probs=130.8 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ ||+.+||+|||++|++|++|++++| +|||.+|++++||||+||+.+++|++++|..+.+|+++|||||+++|+.++ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~rVyD~~P~~a~~PyV~lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIIKKQLGGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMFEDVGVTLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhccccccCCccCCCCCEEEeCcceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 99 9999999999999999999999997 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|+ .+++++||+++++++.++++++||||.++||+++|||++|+|+ T Consensus 81 ~~ia~av~~aL~-a~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~~~ 135 (145) T protein:vir:10 81 SQIIQYLGFVLN-SEIEINNYSFIKSRIDTQEVITDIDQYTKHGIIRLIFKYRHNT 135 (145) T ss_pred HHHHHHHHHHhC-CCcCCCCCeEEEEEEeeeeEeecCCCceEEEEEEEEEEEeecc Confidence 999999999996 7999999999999999999999999999999999999999999 No 4 >protein:vir:105337 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950674;genbank:gi:119967844;genbank:GeneID:4643216 Probab=100.00 E-value=1.2e-44 Score=261.41 Aligned_cols=133 Identities=19% Similarity=0.269 Sum_probs=130.8 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ ||+.+||+|||++|++|++|++++| +|||.+|++++||||+||+.+++|++++|..+.+|+++|||||+++|+.++ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~rVyD~~P~~a~~PyV~lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIVSKQLGGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMFEDVGVTLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhccccccCCccCCCCCEEEeCcceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 99 9999999999999999999999997 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|+ .+++++||+++++++.++++++||||.++||+++|||++|+|+ T Consensus 81 ~~ia~av~~aL~-a~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~~~ 135 (145) T protein:vir:10 81 SQIIQYLGFVLN-SEIEINNYSFIKSRIDTQEVITDIDQYTKHGIIRLIFKYRHNT 135 (145) T ss_pred HHHHHHHHHHhC-CCcCCCCCeEEEEEEeeeeEeecCCCceEEEEEEEEEEEeecc Confidence 999999999996 7999999999999999999999999999999999999999999 No 5 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=100.00 E-value=1.3e-44 Score=261.23 Aligned_cols=133 Identities=19% Similarity=0.266 Sum_probs=130.6 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ |++.+||+|||++|++|++|+++++ +|||.+|+++++|||+||+.+++|++++|..+.+++++|||||++.|+.++ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~~a~~PYV~lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNSIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEecCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 99 9999999999999999999999998 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|++ +|++++|+++++++.++++++||||.++||+++|||++|+|+ T Consensus 81 k~ia~av~~aL~~-~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~~~ 135 (145) T protein:vir:95 81 SQIIQFLGFVLNN-EIEIDYYSFIKSRIDTQEVITDIDRYTKHGIIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHhcc-ccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEecc Confidence 9999999999985 799999999999999999999999999999999999999999 No 6 >protein:vir:94488 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240682;genbank:gi:66396364;genbank:GeneID:5133752 Probab=100.00 E-value=1.5e-44 Score=260.92 Aligned_cols=133 Identities=19% Similarity=0.262 Sum_probs=130.5 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ |++.+||+|||++|++|++|+++++ +|||.+|+++++|||+||+.+++|++++|..+.+|+++|||||++.|+.++ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~~a~~PYV~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 99 9999999999999999999999998 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|++ ++++++|+++++++.++++++||||.++||+++|||++|+|+ T Consensus 81 k~ia~av~~aL~~-~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:94 81 SQIIQFLGFVLNN-EIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHhcc-ccCCCCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEEEecc Confidence 9999999999984 799999999999999999999999999999999999999999 No 7 >protein:vir:97421 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240755;genbank:gi:66396436;genbank:GeneID:5133777 Probab=100.00 E-value=1.5e-44 Score=260.92 Aligned_cols=133 Identities=19% Similarity=0.262 Sum_probs=130.5 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ |++.+||+|||++|++|++|+++++ +|||.+|+++++|||+||+.+++|++++|..+.+|+++|||||++.|+.++ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~~a~~PYV~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 99 9999999999999999999999998 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|++ ++++++|+++++++.++++++||||.++||+++|||++|+|+ T Consensus 81 k~ia~av~~aL~~-~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:97 81 SQIIQFLGFVLNN-EIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHhcc-ccCCCCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEEEecc Confidence 9999999999984 799999999999999999999999999999999999999999 No 8 >protein:vir:93736 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240465;genbank:gi:66396143;genbank:GeneID:5133505 Probab=100.00 E-value=1.5e-44 Score=260.92 Aligned_cols=133 Identities=19% Similarity=0.262 Sum_probs=130.5 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ |++.+||+|||++|++|++|+++++ +|||.+|+++++|||+||+.+++|++++|..+.+|+++|||||++.|+.++ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~~a~~PYV~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:93 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 99 9999999999999999999999998 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|++ ++++++|+++++++.++++++||||.++||+++|||++|+|+ T Consensus 81 k~ia~av~~aL~~-~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:93 81 SQIIQFLGFVLNN-EIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHhcc-ccCCCCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEEEecc Confidence 9999999999984 799999999999999999999999999999999999999999 No 9 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=100.00 E-value=1.5e-44 Score=260.91 Aligned_cols=133 Identities=19% Similarity=0.262 Sum_probs=130.5 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ |++.+||+|||++|++|++|+++++ +|||.+|+++++|||+||+.+++|++++|..+.+|+++|||||++.|+.++ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~~l~alvggrV~D~~P~~a~~PYv~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIRKQLDGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CcchHhHHHHHHHHHHhhcChhHHHhhcCceecCCccCCCCCEEEeCcceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 99 9999999999999999999999998 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|++ +|++++|+++++++.+++.++||||.++||+++|||++|+|. T Consensus 81 k~ia~av~~aL~~-~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:97 81 SQIIQFLGFVLNN-EIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHhcc-ccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEecCc Confidence 9999999999985 899999999999999999999999999999999999999999 No 10 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=100.00 E-value=2.7e-44 Score=259.41 Aligned_cols=134 Identities=28% Similarity=0.344 Sum_probs=129.5 Q ss_pred CcccchHHHHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 10 GDMATALPALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 10 ~~M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) =+++||+.+||+|||++|++|++|++++|+|||++|+++++|||+||+.+.+|++++|..+.+|+++|||||++ |+.++ T Consensus 1 m~~~s~~~aLq~Ai~~~L~ad~~l~alvg~I~D~~P~~~~~PYV~lG~~~~~d~~~~~~~g~~~~~ti~Vws~~-g~~ea 79 (134) T protein:vir:59 1 MTWKLASRALQKATVENLESYQPLMEMVNQVTESPGKDDPYPYVVIGDQSSTPFETKSSFGENITMDFHVWGGT-TRAEA 79 (134) T ss_pred CCccchhHHHHHHHHHHhhcChhHHHhhhhhhcCCCCCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEECC-ChHHH Confidence 34556789999999999999999999999999999999999999999999999999999999999999999987 78999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEec Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLD 144 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~ 144 (145) ++|+++|+++|++.+|++++++++++++.+++.++||||.++||+++||+++|+| T Consensus 80 ~~ia~av~~aL~~~~L~l~~~~lv~l~~~~~~~~rd~dg~~~hg~l~fra~ve~~ 134 (134) T protein:vir:59 80 QDISSRVLEALTYKPLMFEGFTFVAKKLVLAQVITDTDGVTKHGIIKVRFTINNN 134 (134) T ss_pred HHHHHHHHHHhcCCCcccCCceEEEeEEeeeeEEecCCCceEEEEEEEEEEEecC Confidence 9999999999999999999999999999999999999999999999999999999 No 11 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=100.00 E-value=2.8e-44 Score=259.34 Aligned_cols=133 Identities=22% Similarity=0.339 Sum_probs=130.4 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ +++.+||+|||++|++|++|++++| +|||.+|+++++|||+||+.+.+|++++|..+.+|+++|||||++.|+.++ T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (141) T protein:vir:94 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNNESSATMRETVGIVIHVYSQFATQYEA 80 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCccccCCccCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 88 9999999999999999999999998 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|+ .++.++|++++++++.++++++||||.++||+++|||++++|+ T Consensus 81 k~ia~av~~AL~-~~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~ 135 (141) T protein:vir:94 81 KLILSAIGYVLN-RPIEIDNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKK 135 (141) T ss_pred HHHHHHHHHHhc-ccccCCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecc Confidence 999999999997 5799999999999999999999999999999999999999999 No 12 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=100.00 E-value=2.8e-44 Score=259.34 Aligned_cols=133 Identities=22% Similarity=0.339 Sum_probs=130.4 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ +++.+||+|||++|++|++|++++| +|||.+|+++++|||+||+.+.+|++++|..+.+|+++|||||++.|+.++ T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (141) T protein:vir:96 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNNESSATMRETVGIVIHVYSQFATQYEA 80 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCccccCCccCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 88 9999999999999999999999998 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|+ .++.++|++++++++.++++++||||.++||+++|||++++|+ T Consensus 81 k~ia~av~~AL~-~~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~ 135 (141) T protein:vir:96 81 KLILSAIGYVLN-RPIEIDNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKK 135 (141) T ss_pred HHHHHHHHHHhc-ccccCCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecc Confidence 999999999997 5799999999999999999999999999999999999999999 No 13 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=100.00 E-value=2.8e-44 Score=259.34 Aligned_cols=133 Identities=22% Similarity=0.339 Sum_probs=130.4 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ +++.+||+|||++|++|++|++++| +|||.+|+++++|||+||+.+.+|++++|..+.+|+++|||||++.|+.++ T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~~~~~~~g~~~~~ti~Vws~~~g~~ea 80 (141) T protein:vir:10 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNNESSATMRETVGIVIHVYSQFATQYEA 80 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCccccCCccCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 88 9999999999999999999999998 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|+ .++.++|++++++++.++++++||||.++||+++|||++++|+ T Consensus 81 k~ia~av~~AL~-~~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~ 135 (141) T protein:vir:10 81 KLILSAIGYVLN-RPIEIDNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKK 135 (141) T ss_pred HHHHHHHHHHhc-ccccCCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecc Confidence 999999999997 5799999999999999999999999999999999999999999 No 14 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=100.00 E-value=1.4e-43 Score=255.59 Aligned_cols=133 Identities=22% Similarity=0.315 Sum_probs=130.6 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ ||+.|||+|||++|++|++|+++++ +|||++|++++||||+||+.+.+|++++|..+.+++++|||||+++|+.++ T Consensus 1 Msms~~~aLq~Ai~a~L~ada~l~alvg~~VyD~~P~~~~~Pyv~lG~~~~~~~~~~~~~g~~~~~~i~Vws~~~g~~ea 80 (140) T protein:vir:96 1 MWVSVEPELTVQIYKRLKASPIINKFVGDRVFDVVQEDAVYPYIVVGESNVTNNESSTMMRETVGIVIHVYSQFATQYEA 80 (140) T ss_pred CCccHHHHHHHHHHHHhhcChhHHHhcCCccccCCccCCCCCEEEecCceeeecCCCcccceEEEEEEEEEEcCCCHHHH Confidence 88 9999999999999999999999998 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|+ ++|+++||+++++++.++++++||||.++||+++|||+++|-+ T Consensus 81 ~~ia~av~~AL~-~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~r~~v~~~~ 135 (140) T protein:vir:96 81 KQIISAIGYVLN-RPIDIENYEFQFSRIDSQSVFPDIDRFTKHGTIRLLFKYRHIK 135 (140) T ss_pred HHHHHHHHHHhC-CCccCCCCeEEEEEEeeeEEEecCCCceEEEEEEEEEEEEeec Confidence 999999999996 7899999999999999999999999999999999999999999 No 15 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=100.00 E-value=2.2e-43 Score=254.50 Aligned_cols=134 Identities=22% Similarity=0.260 Sum_probs=128.8 Q ss_pred CcccchHHHHHHHHHHHhhcChhhhhhhhc-cccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHH Q lcl|NC_014229. 10 GDMATALPALQASVYAKLVGHAPLTALVSG-VYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAE 88 (145) Q Consensus 10 ~~M~~~~~aLq~Ai~~~L~~da~l~alv~~-IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~e 88 (145) =+ +|++.|||+|||++|++|++|++++|+ |||++|++++||||+||+.+.+|+++||..+.+++++|||||++.|+.| T Consensus 1 ~~-msa~~aLq~Ai~~~L~ad~~l~alvggrVyD~~P~~~~~PYV~lG~~~~~~~~~~~~~g~~~~~tl~Vws~~~g~~e 79 (140) T protein:vir:96 1 MW-VTAEPLLYNKIMNNLIENPITDKLVGGRVFDCVQKDVVYPYIVVGESNVTESERSPGMREIIAITFHVYSQYENGAE 79 (140) T ss_pred Cc-cchhHHHHHHHHHHhccChhHHhhcCcccccCCccCCCCCEEEeCCceeeecCCCcccceEEEEEEEEEEcCCCHHH Confidence 12 366789999999999999999999984 9999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 89 AHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 89 a~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +++|+++|+++|+ .+++++||+++++++.+++.++||||.++||+++|||++|+|+ T Consensus 80 a~~ia~ai~~aL~-~~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~ve~~~ 135 (140) T protein:vir:96 80 ARELLKYLNYACR-LNINFKDYELEWIKKDNSQVFTDIDQYTKHGVLRLLYKVRHKT 135 (140) T ss_pred HHHHHHHHHHHhc-CCccCCCceEEEEEEeeeEEeecCCCceEEEEEEEEEEEeecc Confidence 9999999999996 7999999999999999999999999999999999999999999 No 16 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=100.00 E-value=8.5e-43 Score=251.24 Aligned_cols=134 Identities=18% Similarity=0.252 Sum_probs=129.8 Q ss_pred cccchHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 11 DMATALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 11 ~M~~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) =|+||+.+||+|||++|++|++|++++| +|||++|++++||||+||+.+.+|+++||..+.+++++|||||+.+|+.++ T Consensus 1 M~~s~~~aLq~ai~~~L~ad~~l~~lvg~~vyD~~P~~~~~PyV~lG~~~~~~~~t~~~~~~~~~lti~Vws~~~gr~ea 80 (145) T protein:vir:12 1 MWVSVERYLFNKVYNKLKSNPIIQKQLGGRVFDCVQKDAVYPYIVVGETNVTNKETTTSMVEDVGITLHVYSQARNRDEA 80 (145) T ss_pred CcccHHHHHHHHHHHHhhcChhHHHhcCcccccCCccCCCCCEEEeccceeeecCCCcccceEEEEEEEEEEcCccHHHH Confidence 3349999999999999999999999997 599999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ++|+++|+++|++ ++.++||+++++++.++++++||||.++||+++|||++|+|+ T Consensus 81 ~~ia~ai~~aL~~-~l~l~~~~lv~l~~~~~~~~rd~d~~~~hgvl~~ra~i~~~~ 135 (145) T protein:vir:12 81 SQIIQFLGFVLNN-EIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHhcc-ccCCCCceEEEEEEeeEEEEecCCCceEEEEEEEEEEEEeCC Confidence 9999999999985 899999999999999999999999999999999999999999 No 17 >protein:vir:2741 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695114;genbank:gi:23455883;genbank:GeneID:955650 Probab=99.91 E-value=5.8e-27 Score=164.43 Aligned_cols=124 Identities=16% Similarity=0.261 Sum_probs=111.6 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCc-ccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVP-EPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP-~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~ 90 (145) |++|..+|++++|++|++ +..+|||.+| ++++||||++|+.+..++++|+..+++++++||||+..++|.++. T Consensus 1 M~sp~qeL~~~lf~~l~~------~g~~vyD~lP~~~~~YPfV~ig~~~~~~~~tkt~~~g~~~l~i~vW~~~~~R~~v~ 74 (128) T protein:vir:27 1 MKQPDQLLHDEMYRISCE------LGYNTYTYLPPDDAAYPFVVMGETMVLPQSTKSHLIGRLSSTVHVWGHVDDRKTLS 74 (128) T ss_pred CCCHHHHHHHHHHHHHHh------cCCceeccCCCCCCCcCEEEeccceecCCccccccccEEEEEEEEEECCcchhHHH Confidence 999999999999999985 3447999988 689999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCCCccCCceEE-EEEEeeeeeeec--CCCceEEEEEEEEEEEE Q lcl|NC_014229. 91 RIFAALDAALDRVPLTVAGCTDV-SIKHSNHQALKD--PEPGVRHINAEYRVRLT 142 (145) Q Consensus 91 ~I~~aV~~aL~~~~l~l~g~~~v-~~~~~~~~~~~d--~d~~~~hg~l~fra~~~ 142 (145) +|+++|..++.+. ...+||++. .....+.+.+.| .++.++||+++|++.+- T Consensus 75 ~i~~~i~~~~~~~-~~t~~y~~~~~~~~~~~qil~Dtst~~~l~Hgii~l~f~~~ 128 (128) T protein:vir:27 75 DMAGQLMSSFFAI-KKIGGKQFSAEVNESSIDSNRDNSTDEVLYHFIIYTYFKFI 128 (128) T ss_pred HHHHHHHHHhccc-cccCCeeEEEEeecceEEEeeecCCCceeeEEEEEEEEEeC Confidence 9999999999765 677888864 456677788876 67889999999999999 No 18 >protein:vir:4907 Length: 128 # NCBI annotation: gp128 # Family: family:all:504 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056685;genbank:gi:9635020;genbank:GeneID:1262660 Probab=99.90 E-value=2.2e-26 Score=161.25 Aligned_cols=124 Identities=17% Similarity=0.275 Sum_probs=109.0 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCc-ccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVP-EPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP-~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~ 90 (145) |++|..+|++++|++|++ ++..|||.+| ++++||||++|+++..++++|+..+++++++||||++.++|.++. T Consensus 1 m~sp~q~L~~~~f~~l~~------~g~~vyD~lP~~~v~YPfV~ig~~~~~~~~tKt~~~g~v~ltihVW~~~~~R~ev~ 74 (128) T protein:vir:49 1 MKQPDQLLHDEMYRISCE------LGYNTYTYLPPDDAAYPFVVMGETMVLPQSTKSHLIGRLSSTVHVWGRVDDRKTLS 74 (128) T ss_pred CCchHHHHHHHHHHHHHh------cCCceecccCCCCCCCCEEEeeeeeecCCccccccccEEEEEEEEEeCCCCchhHH Confidence 999999999999999975 3347999988 578999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCCCccCCceEE-EEEEeeeeeeec--CCCceEEEEEEEEEEEE Q lcl|NC_014229. 91 RIFAALDAALDRVPLTVAGCTDV-SIKHSNHQALKD--PEPGVRHINAEYRVRLT 142 (145) Q Consensus 91 ~I~~aV~~aL~~~~l~l~g~~~v-~~~~~~~~~~~d--~d~~~~hg~l~fra~~~ 142 (145) +|+++|.+++.+. ...+++.+. ++...+.+.+.| .++.++||++++++.+- T Consensus 75 ~i~~~i~~~l~~~-~~t~~y~f~~~i~~s~~~~~~D~st~~~L~Hgvl~l~f~~~ 128 (128) T protein:vir:49 75 DMAGQLMSSFFAI-KNIGGKQFSAEINQSSIDSNRDNSTDEVLYHFVIYTYFKFV 128 (128) T ss_pred HHHHHHHHHhhcc-cccCCeEEEEEeccceEEEEeecCCCcceeeEEEEEEEEeC Confidence 9999999999764 577887653 566666777766 56678999999888888 No 19 >protein:vir:96485 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238497;genbank:gi:66391773;genbank:GeneID:5176907 Probab=99.90 E-value=2.4e-26 Score=161.05 Aligned_cols=124 Identities=18% Similarity=0.288 Sum_probs=111.8 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCc-ccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVP-EPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP-~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~ 90 (145) |++|..+|++++|++|++ +...|||.+| ++++||||++|+++..++++|+..+++++++||||++.++|.++. T Consensus 1 m~sp~qeL~d~~f~~l~~------~g~~vyd~lP~~~v~YPfV~ig~~~~~~~~tKt~~~g~v~ltihVW~~~~~R~~v~ 74 (128) T protein:vir:96 1 MKQPDQLLHDEMYRISSG------LGYDTYTYLPPEGAAYPFVVMGETMVLPQSTKSHLIGRLSSTVHVWGRVDDRKTLS 74 (128) T ss_pred CCCHHHHHHHHHHHHHHh------cCCeeecccCCCCCCCCEEEEeeeeecCCccccccccEEEEEEEEEECCCCchhHH Confidence 999999999999999985 3347999987 788999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCCCccCCceEE-EEEEeeeeeeec--CCCceEEEEEEEEEEEE Q lcl|NC_014229. 91 RIFAALDAALDRVPLTVAGCTDV-SIKHSNHQALKD--PEPGVRHINAEYRVRLT 142 (145) Q Consensus 91 ~I~~aV~~aL~~~~l~l~g~~~v-~~~~~~~~~~~d--~d~~~~hg~l~fra~~~ 142 (145) +|+++|..+|.+. ...+||.+. +....+.+.+.| .++.++||++++++.+- T Consensus 75 ~i~~~i~~~l~~~-~~t~~y~~~~~~~~~~~qii~D~st~~~l~Hgil~l~f~~~ 128 (128) T protein:vir:96 75 DMAGQLMSSFFTI-KNIDGMQFSAEVNESSIDSNRDNSTDEVLYHFIIYTYFKFI 128 (128) T ss_pred HHHHHHHHHhhhh-hccCCeEEEEEEeeeeEEEeeecCCCceeeEEEEEEEEEeC Confidence 9999999999765 688999885 456677788888 56789999999999888 No 20 >protein:vir:744 Length: 129 # NCBI annotation: major structural protein 2 # Family: family:all:504 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108721;genbank:gi:13487843;genbank:GeneID:920879 Probab=99.89 E-value=5.6e-26 Score=159.03 Aligned_cols=125 Identities=18% Similarity=0.246 Sum_probs=109.5 Q ss_pred cccchHHHHHHHHHHHhhcChhhhhhhhccccCCc-ccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 11 DMATALPALQASVYAKLVGHAPLTALVSGVYDEVP-EPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 11 ~M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP-~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) =|++|..+|++++|++|++ +..+|||.+| ++++||||++|+++..++++|+..+++++++||||+..++|.++ T Consensus 1 mmksp~qeL~d~~~~~l~~------lG~~vyD~lP~~~v~YPfV~ig~~~~~~~~tKt~~~g~v~ltihVW~~~~~R~~v 74 (129) T protein:vir:74 1 MIKTRDQSIFDELFKRIQA------LGYTVYDYKPMNEVGYPFVELENTQTIHEANKTDIKGTVSLSLSVWGLQKKRKEV 74 (129) T ss_pred CCcChhHHHHHHHHHHHHh------cCCeeeeccCCCCCCcCEEEeeeeeecCCccccccccEEEEEEEEeeCCccchhH Confidence 7999999999999999974 4457999977 67889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEE-EEEEeeeeeeec--CCCceEEEEEEEEEEEE Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDV-SIKHSNHQALKD--PEPGVRHINAEYRVRLT 142 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v-~~~~~~~~~~~d--~d~~~~hg~l~fra~~~ 142 (145) .+|+++|..++.+...+ +||.+. .....+.+.+.| +|+.++||++++++.++ T Consensus 75 ~~i~~~i~~~~~~~~~t-~~y~~~~~~~~~~~q~~~Dtst~~~L~Hgvi~l~f~~r 129 (129) T protein:vir:74 75 SDMASNIFNQALNISAT-DGYSWALNSQASTIQMLDDTTTHTPLKRALINLEFRLR 129 (129) T ss_pred HHHHHHHHHHhcccccc-CCcEEEEeecceeEEEcccCCCCceeeeEEEEEEEEeC Confidence 99999999999865544 777653 223355577777 88999999999999999 No 21 >protein:vir:3972 Length: 129 # NCBI annotation: structural protein # Family: family:all:504 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663680;genbank:gi:21716117;genbank:GeneID:951217 Probab=99.89 E-value=7.1e-26 Score=158.47 Aligned_cols=125 Identities=15% Similarity=0.205 Sum_probs=110.9 Q ss_pred cccchHHHHHHHHHHHhhcChhhhhhhhccccCCc-ccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 11 DMATALPALQASVYAKLVGHAPLTALVSGVYDEVP-EPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 11 ~M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP-~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) =|++|..+|++++|++|++ +..+|||.+| ++++||||++|+++..++++|+..+++++++||||+..++|.++ T Consensus 1 mmksp~qeL~d~~f~~l~~------lG~~vyD~lP~~~v~YPfV~ig~~~~~~~~tKt~~~g~v~ltihVW~~~~~R~~v 74 (129) T protein:vir:39 1 MIKTRDQSIFDELFKRIQA------LGYTVYDYKQMNEVGYPFVEMENTQTIHEPNKTDIKGTVSLSLSVWGLQKKRKEV 74 (129) T ss_pred CCcChhHHHHHHHHHHHHh------cCCeeeeccCCCCCCcCEEEeeeeeecCCccccccccEEEEEEEEEeCCcCchhH Confidence 7999999999999999974 4457999977 67899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEE-EEEEeeeeeeec--CCCceEEEEEEEEEEEE Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDV-SIKHSNHQALKD--PEPGVRHINAEYRVRLT 142 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v-~~~~~~~~~~~d--~d~~~~hg~l~fra~~~ 142 (145) .+|+++|..++.+ ....+||++. .......|.+.| +|+.++||++++++.++ T Consensus 75 ~~i~~~i~~~~~~-~~~t~~y~~~~~~~~~~~q~~~Dts~~~~L~Hgvi~l~f~~r 129 (129) T protein:vir:39 75 SDMASNIFNQALN-ISATDGYSWALNLQASTIQMMDDTTTGTPLKRAFINLEFRLR 129 (129) T ss_pred HHHHHHHHHHhcc-cccCCCeeEEEeecceeEEEecccCCCceeeeEEEEEEEEeC Confidence 9999999998865 4566888765 344556678877 78899999999999999 No 22 >protein:vir:3618 Length: 129 # NCBI annotation: ORF41 # Family: family:all:504 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112704;genbank:gi:13786572;genbank:GeneID:921070 Probab=99.88 E-value=1.3e-25 Score=157.05 Aligned_cols=124 Identities=17% Similarity=0.251 Sum_probs=107.7 Q ss_pred cccchHHHHHHHHHHHhhcChhhhhhhhccccCCc-ccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 11 DMATALPALQASVYAKLVGHAPLTALVSGVYDEVP-EPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 11 ~M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP-~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) =|++|..+|++++|++|++ +..+|||.+| ++++||||++|+++..++++|+..+++++++||||+..++|.++ T Consensus 1 mmksp~qeL~d~~f~~l~~------lG~~vyD~lP~~~v~YPfV~ig~~~~~~~~tKt~~~g~v~ltihVW~~~~~R~~v 74 (129) T protein:vir:36 1 MIKTRDQSIFDELFKRIQA------LGYTVYDYKPMNEVGYPFVELENTQTIHEANKTDIKGTVSLSLSVWGLQKKRKEV 74 (129) T ss_pred CCcChhHHHHHHHHHHHHh------cCCeeeeccCCCCCCcCEEEeeeeeecCCccccccccEEEEEEEEEeCCcCchhH Confidence 7999999999999999974 4457999988 67889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEee--eeeeec--CCCceEEEEEEEEEEEE Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSN--HQALKD--PEPGVRHINAEYRVRLT 142 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~--~~~~~d--~d~~~~hg~l~fra~~~ 142 (145) .+|++.|..++.+...+ +||++. .+... .+...| +++..+||++++++.++ T Consensus 75 ~~i~~~i~~~~~~~~~t-~~y~~~-~~~~~~~~q~~~D~st~~~L~Hgii~l~f~~r 129 (129) T protein:vir:36 75 SDMASNIFNQALNISAT-DGYSWA-LNSQASTIQMLDDTTTNTPLKRALINLEFRLR 129 (129) T ss_pred HHHHHHHHHHhcccccC-CCeEEE-EEeeeeeEEEeccCCCCceeeEEEEEEEEEeC Confidence 99999999999866544 888753 24444 455666 56778999999999999 No 23 >protein:vir:99537 Length: 125 # NCBI annotation: putative protein # Family: family:all:504 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958542;genbank:gi:41179324;genbank:GeneID:2717175 Probab=99.82 E-value=9.3e-23 Score=141.37 Aligned_cols=122 Identities=17% Similarity=0.159 Sum_probs=108.3 Q ss_pred cchHHHHHHHHHHHhhcChhhhhhhhccccCCc-ccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHHH Q lcl|NC_014229. 13 ATALPALQASVYAKLVGHAPLTALVSGVYDEVP-EPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAHR 91 (145) Q Consensus 13 ~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP-~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~~ 91 (145) +||..+|..++|+.++. +...|||.+| ++++||||++|+.+..+..+|+..+++++++||||+..++|.++.+ T Consensus 1 m~P~q~Lfd~~f~~~~~------lG~~vyD~lP~~~v~YPFVvig~~~~~~~~tKt~~~g~i~lti~VWg~~~~R~~v~~ 74 (125) T protein:vir:99 1 MNPYEELFKTVIEYCKK------TGYPTFDYLPDESQGYPFIMVGDQINNDIYAKDFVTGTSNLTIHVFAEYNYRAEVAT 74 (125) T ss_pred CchhHHHHHHHHHHHHh------cCCceeeecCCCCCCcCEEEEeeeeecCCCCccccceEEEEEEEEeeCcccchhHHH Confidence 78999999999998874 4447999988 5688999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCce--EEEEEEEEEEEE Q lcl|NC_014229. 92 IFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGV--RHINAEYRVRLT 142 (145) Q Consensus 92 I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~--~hg~l~fra~~~ 142 (145) |+++|..++. .....+||.+. +.-.+.+.+.|.++.+ +||++++++.++ T Consensus 75 i~~~i~~~~~-~~~~t~~y~~~-~~~~~~qii~D~s~~t~L~Hg~l~l~F~ir 125 (125) T protein:vir:99 75 IMEQIQQLIP-KFITTNHYLFG-LTGSSSNILGETADSIQLQHGRLILDFNLR 125 (125) T ss_pred HHHHHHHHhc-cceeccCcEEE-eeeeeEEEeecCCCCceeeEEEEEEEEeeC Confidence 9999999774 55788998874 5567788898876655 999999999999 No 24 >protein:vir:106593 Length: 131 # NCBI annotation: ORF039 # Family: family:all:504 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239498;genbank:gi:66395251;genbank:GeneID:4555747 Probab=99.63 E-value=5.7e-18 Score=115.12 Aligned_cols=125 Identities=22% Similarity=0.258 Sum_probs=105.5 Q ss_pred CcccchHHHHHHHHHHHhhcChhhhhhhhccccCCcc--cCCCCEEEecccee-eecCCCcccceEEEEEEEEEECCCCH Q lcl|NC_014229. 10 GDMATALPALQASVYAKLVGHAPLTALVSGVYDEVPE--PAPYPYVSFGSMTE-FPEDAHDRQGLSVTVVIHVWSKSPGF 86 (145) Q Consensus 10 ~~M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~--~a~~Pyv~iG~~~~-~~~~~~~~~~~~~~~~I~vws~~~g~ 86 (145) =-|++|..+|...+|+.+++ +...+||..|. .++||||++|+++. .+..+|+..+.+++++||||+..+.| T Consensus 1 mm~ksp~qeLfd~~f~~~~~------lGy~vyd~lP~~~ev~YPFVvig~~~~~~~~~tKt~~~g~v~lti~VWg~~~~R 74 (131) T protein:vir:10 1 MLKTTPQQALFDSIYAQLLG------YGIDVIDFKELNSQLTYPFFVLRDVEANKSKYTMESVGGELTVIIDLWNYAEDR 74 (131) T ss_pred CCccChhHHHHHHHHHHHHh------cCCceeeccCCCCCCCCCEEEEeeeeccCCCCcccccceEEEEEEEEeecchhh Confidence 35799999999999998884 44479999884 47899999999987 58999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeec---CCCceEEEEEEEEEEEE Q lcl|NC_014229. 87 AEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKD---PEPGVRHINAEYRVRLT 142 (145) Q Consensus 87 ~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d---~d~~~~hg~l~fra~~~ 142 (145) .++.+|++.+...+.+. ...+||.+ .+...+++.+.| ++....||++++++.+= T Consensus 75 ~~vs~i~~~i~~~~~~~-~~td~y~~-~~~~~~~~~i~D~sttn~~L~Hg~i~lef~~~ 131 (131) T protein:vir:10 75 GQHDSIVGATEWMLTGI-ESVEGYQL-MIDDINIKTLNDVENSDRQLLHTVIIAIYKLF 131 (131) T ss_pred hhHHHHHHHHHHHhhcc-eecccceE-EecceEEEEEeccCCCCceeeeEEEEEEEEeC Confidence 99999999999998543 36788887 445555555554 56679999999999999 No 25 >protein:vir:95765 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950594;genbank:gi:119953789;genbank:GeneID:5076835 Probab=99.62 E-value=9.1e-18 Score=114.01 Aligned_cols=122 Identities=20% Similarity=0.263 Sum_probs=97.7 Q ss_pred cchHHHHHHHHHHHhhcChhhhhhhhccccCCc-ccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHHH Q lcl|NC_014229. 13 ATALPALQASVYAKLVGHAPLTALVSGVYDEVP-EPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAHR 91 (145) Q Consensus 13 ~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP-~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~~ 91 (145) ++|..+|..++|+ +. .+...+||..| ++++||||++|+++..+..+|+.. +++.++||||+...+|.++.+ T Consensus 1 m~P~qeLfd~~f~-~~------~~Gy~vYD~lP~~~v~YPFVvig~~~~~~~~tKt~~-G~i~l~i~VWg~~~~R~~vs~ 72 (127) T protein:vir:95 1 MTPNHALFRRLFA-IS------NIRVDTYDFLPDAKSAYPFVYIGENNGSDIPNKDLL-GRLRQTVHLYGLRTDRANLDD 72 (127) T ss_pred CchhHHHHHHHHH-HH------hcCCccccccCcCCCCcCEEEEeeeeecccccceee-eEEEEEEEeecCchhhhhHHH Confidence 7899999999995 55 23237999987 789999999999999999999965 589999999999999999999 Q ss_pred HHHHHHHHhcCCCCccCCceE-EEEEEeeeeeeecC--CCceEEEEEEEEEEEEecC Q lcl|NC_014229. 92 IFAALDAALDRVPLTVAGCTD-VSIKHSNHQALKDP--EPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 92 I~~aV~~aL~~~~l~l~g~~~-v~~~~~~~~~~~d~--d~~~~hg~l~fra~~~~~~ 145 (145) |+++|++++... ..++.. ....-.+.+.+.|. +...+||++++++....-. T Consensus 73 i~~~i~~~~~~~---~~~~~y~~~~~~s~~qil~Dtstnt~L~Hgil~l~f~f~~~~ 126 (127) T protein:vir:95 73 ISAYLESEVKRA---HDGYDYHLYHVETSKQIIPDNTDVQPLLHIVLDFTFDYTKKE 126 (127) T ss_pred HHHHHHHHhhhh---cccceeEEEEecceeEEecccCCcceeEEEEEEEEEEeeccC Confidence 999999988543 233222 23344566777775 5667999999999887655 No 26 >protein:vir:9313 Length: 127 # NCBI annotation: phi Mu50B-like protein # Family: family:all:504 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803291;genbank:gi:29028601;genbank:GeneID:1258049 Probab=99.51 E-value=2.9e-16 Score=105.73 Aligned_cols=122 Identities=16% Similarity=0.175 Sum_probs=101.7 Q ss_pred cchHHHHHHHHHHHhhcChhhhhhhhccccCCcc--cCCCCEEEecccee-eecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 13 ATALPALQASVYAKLVGHAPLTALVSGVYDEVPE--PAPYPYVSFGSMTE-FPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 13 ~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~--~a~~Pyv~iG~~~~-~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) ++|..+|...+|+.++. +.-.+||..|. ..+||||++|+++. .+..+|+..+.+++++||||+..+.|.+. T Consensus 1 mtp~qeLfd~~f~~~~~------lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:93 1 MTPNLQLYNKAYETLQG------YGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHh------cCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchH Confidence 78999999999998873 33479999885 36899999999987 58899999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecC--CCceEEEEEEEEEEEE Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDP--EPGVRHINAEYRVRLT 142 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~--d~~~~hg~l~fra~~~ 142 (145) .+|++.+...+.+ ....+||.+ .++..+.+.+.|. +....||++++++..= T Consensus 75 s~i~~~i~~~~~~-~~~t~~y~~-~~~~~~~~~l~d~ttn~~L~Hgvl~lef~~~ 127 (127) T protein:vir:93 75 DGLVKRCIDDLTP-SVKTNDYDF-EEEDTNITQLVDDTTNQELLHTSVTISYKTF 127 (127) T ss_pred HHHHHHHHHHhcc-ceeccceeE-EeeeeeeEEcccCCCcceeeeEEEEEEEeeC Confidence 9999999988863 346688876 3455566666654 3368999999999888 No 27 >protein:vir:96355 Length: 127 # NCBI annotation: ORF038 # Family: family:all:504 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239652;genbank:gi:66395404;genbank:GeneID:5132831 Probab=99.51 E-value=3.3e-16 Score=105.49 Aligned_cols=122 Identities=16% Similarity=0.168 Sum_probs=101.6 Q ss_pred cchHHHHHHHHHHHhhcChhhhhhhhccccCCcc--cCCCCEEEecccee-eecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 13 ATALPALQASVYAKLVGHAPLTALVSGVYDEVPE--PAPYPYVSFGSMTE-FPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 13 ~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~--~a~~Pyv~iG~~~~-~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) ++|..+|...+|+.++. +.-.+||..|. ..+||||++|+++. .+..+|+..+.+++++||||+..+.|.+. T Consensus 1 mtp~qeLfd~~f~~~~~------lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:96 1 MTPNLQLYNKAYETLQG------YGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHh------cCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchH Confidence 78999999999998873 33479999885 36899999999987 58899999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCC--CceEEEEEEEEEEEE Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPE--PGVRHINAEYRVRLT 142 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d--~~~~hg~l~fra~~~ 142 (145) .+|++.+...+.+ ....+||.+ .++..+.+.+.|.. ....||++++++..= T Consensus 75 s~i~~~i~~~~~~-~~~t~~y~~-~~~~~~~~~l~d~ttn~~L~Hgvi~lef~~~ 127 (127) T protein:vir:96 75 DGLVKRCIDDLTP-SVKTNDYDF-EEDDTNITQLVDDTTNQELLHTSITISYKTF 127 (127) T ss_pred HHHHHHHHHHhcc-ceeccceeE-EeeeeeeEEcccCCCcceeeeEEEEEEEeeC Confidence 9999999988863 346688876 44555666666543 367899999999888 No 28 >protein:vir:78854 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285366;genbank:gi:148717894;genbank:GeneID:5246985 Probab=99.51 E-value=3.3e-16 Score=105.49 Aligned_cols=122 Identities=16% Similarity=0.168 Sum_probs=101.6 Q ss_pred cchHHHHHHHHHHHhhcChhhhhhhhccccCCcc--cCCCCEEEecccee-eecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 13 ATALPALQASVYAKLVGHAPLTALVSGVYDEVPE--PAPYPYVSFGSMTE-FPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 13 ~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~--~a~~Pyv~iG~~~~-~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) ++|..+|...+|+.++. +.-.+||..|. ..+||||++|+++. .+..+|+..+.+++++||||+..+.|.+. T Consensus 1 mtp~qeLfd~~f~~~~~------lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:78 1 MTPNLQLYNKAYETLQG------YGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHh------cCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchH Confidence 78999999999998873 33479999885 36899999999987 58899999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCC--CceEEEEEEEEEEEE Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPE--PGVRHINAEYRVRLT 142 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d--~~~~hg~l~fra~~~ 142 (145) .+|++.+...+.+ ....+||.+ .++..+.+.+.|.. ....||++++++..= T Consensus 75 s~i~~~i~~~~~~-~~~t~~y~~-~~~~~~~~~l~d~ttn~~L~Hgvi~lef~~~ 127 (127) T protein:vir:78 75 DGLVKRCIDDLTP-SVKTNDYDF-EEDDTNITQLVDDTTNQELLHTSITISYKTF 127 (127) T ss_pred HHHHHHHHHHhcc-ceeccceeE-EeeeeeeEEcccCCCcceeeeEEEEEEEeeC Confidence 9999999988863 346688876 44555666666543 367899999999888 No 29 >protein:vir:103918 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873997;genbank:gi:118430772;genbank:GeneID:4525410 Probab=99.50 E-value=3.6e-16 Score=105.24 Aligned_cols=122 Identities=16% Similarity=0.168 Sum_probs=100.8 Q ss_pred cchHHHHHHHHHHHhhcChhhhhhhhccccCCcc--cCCCCEEEecccee-eecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 13 ATALPALQASVYAKLVGHAPLTALVSGVYDEVPE--PAPYPYVSFGSMTE-FPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 13 ~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~--~a~~Pyv~iG~~~~-~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) ++|..+|...+|+.++. +.-.+||..|. ..+||||++|+++. .+..+|+..+.+++++||||+..+.|.+. T Consensus 1 mtp~qeLfd~~f~~~~~------lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:10 1 MTPNLQLYNKAYETLQG------YGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHh------cCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchH Confidence 78999999999998873 33479999885 36899999999987 58899999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCC--CceEEEEEEEEEEEE Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPE--PGVRHINAEYRVRLT 142 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d--~~~~hg~l~fra~~~ 142 (145) .+|++.+...+.+ ....+||.+ .++..+.+.+.|.. ....||++++++..= T Consensus 75 s~i~~~i~~~~~~-~~~t~~y~~-~~~~~~~~~l~d~tTn~~L~Hgvi~lef~~~ 127 (127) T protein:vir:10 75 DGLVKRCIDDLTP-SVKTNDYDF-EEDDTNIAQLVDDTTNQELLHTSITISYKTF 127 (127) T ss_pred HHHHHHHHHHhcc-ceeccceeE-EeeeeeeeecccCCCcceeeeEEEEEEEeeC Confidence 9999999988863 346688876 34445556665533 358999999999888 No 30 >protein:vir:97143 Length: 127 # NCBI annotation: ORF041 # Family: family:all:504 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239730;genbank:gi:66394906;genbank:GeneID:5130876 Probab=99.50 E-value=3.6e-16 Score=105.24 Aligned_cols=122 Identities=16% Similarity=0.168 Sum_probs=100.8 Q ss_pred cchHHHHHHHHHHHhhcChhhhhhhhccccCCcc--cCCCCEEEecccee-eecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 13 ATALPALQASVYAKLVGHAPLTALVSGVYDEVPE--PAPYPYVSFGSMTE-FPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 13 ~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~--~a~~Pyv~iG~~~~-~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) ++|..+|...+|+.++. +.-.+||..|. ..+||||++|+++. .+..+|+..+.+++++||||+..+.|.+. T Consensus 1 mtp~qeLfd~~f~~~~~------lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:97 1 MTPNLQLYNKAYETLQG------YGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHh------cCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchH Confidence 78999999999998873 33479999885 36899999999987 58899999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCC--CceEEEEEEEEEEEE Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPE--PGVRHINAEYRVRLT 142 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d--~~~~hg~l~fra~~~ 142 (145) .+|++.+...+.+ ....+||.+ .++..+.+.+.|.. ....||++++++..= T Consensus 75 s~i~~~i~~~~~~-~~~t~~y~~-~~~~~~~~~l~d~tTn~~L~Hgvi~lef~~~ 127 (127) T protein:vir:97 75 DGLVKRCIDDLTP-SVKTNDYDF-EEDDTNIAQLVDDTTNQELLHTSITISYKTF 127 (127) T ss_pred HHHHHHHHHHhcc-ceeccceeE-EeeeeeeeecccCCCcceeeeEEEEEEEeeC Confidence 9999999988863 346688876 34445556665533 358999999999888 No 31 >protein:vir:99769 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004312;genbank:gi:122891766;genbank:GeneID:4712324 Probab=99.50 E-value=3.6e-16 Score=105.24 Aligned_cols=122 Identities=16% Similarity=0.168 Sum_probs=100.8 Q ss_pred cchHHHHHHHHHHHhhcChhhhhhhhccccCCcc--cCCCCEEEecccee-eecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 13 ATALPALQASVYAKLVGHAPLTALVSGVYDEVPE--PAPYPYVSFGSMTE-FPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 13 ~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~--~a~~Pyv~iG~~~~-~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) ++|..+|...+|+.++. +.-.+||..|. ..+||||++|+++. .+..+|+..+.+++++||||+..+.|.+. T Consensus 1 mtp~qeLfd~~f~~~~~------lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:99 1 MTPNLQLYNKAYETLQG------YGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHh------cCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchH Confidence 78999999999998873 33479999885 36899999999987 58899999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCC--CceEEEEEEEEEEEE Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPE--PGVRHINAEYRVRLT 142 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d--~~~~hg~l~fra~~~ 142 (145) .+|++.+...+.+ ....+||.+ .++..+.+.+.|.. ....||++++++..= T Consensus 75 s~i~~~i~~~~~~-~~~t~~y~~-~~~~~~~~~l~d~tTn~~L~Hgvi~lef~~~ 127 (127) T protein:vir:99 75 DGLVKRCIDDLTP-SVKTNDYDF-EEDDTNIAQLVDDTTNQELLHTSITISYKTF 127 (127) T ss_pred HHHHHHHHHHhcc-ceeccceeE-EeeeeeeeecccCCCcceeeeEEEEEEEeeC Confidence 9999999988863 346688876 34445556665533 358999999999888 No 32 >protein:vir:96217 Length: 127 # NCBI annotation: ORF036 # Family: family:all:504 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239575;genbank:gi:66395326;genbank:GeneID:5132763 Probab=99.50 E-value=3.6e-16 Score=105.24 Aligned_cols=122 Identities=16% Similarity=0.168 Sum_probs=100.8 Q ss_pred cchHHHHHHHHHHHhhcChhhhhhhhccccCCcc--cCCCCEEEecccee-eecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 13 ATALPALQASVYAKLVGHAPLTALVSGVYDEVPE--PAPYPYVSFGSMTE-FPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 13 ~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~--~a~~Pyv~iG~~~~-~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) ++|..+|...+|+.++. +.-.+||..|. ..+||||++|+++. .+..+|+..+.+++++||||+..+.|.+. T Consensus 1 mtp~qeLfd~~f~~~~~------lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:96 1 MTPNLQLYNKAYETLQG------YGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHh------cCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchH Confidence 78999999999998873 33479999885 36899999999987 58899999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCC--CceEEEEEEEEEEEE Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPE--PGVRHINAEYRVRLT 142 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d--~~~~hg~l~fra~~~ 142 (145) .+|++.+...+.+ ....+||.+ .++..+.+.+.|.. ....||++++++..= T Consensus 75 s~i~~~i~~~~~~-~~~t~~y~~-~~~~~~~~~l~d~tTn~~L~Hgvi~lef~~~ 127 (127) T protein:vir:96 75 DGLVKRCIDDLTP-SVKTNDYDF-EEDDTNIAQLVDDTTNQELLHTSITISYKTF 127 (127) T ss_pred HHHHHHHHHHhcc-ceeccceeE-EeeeeeeeecccCCCcceeeeEEEEEEEeeC Confidence 9999999988863 346688876 34445556665533 358999999999888 No 33 >protein:vir:4348 Length: 121 # NCBI annotation: Orf15 # Family: family:all:896 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061511;genbank:gi:9635607;genbank:GeneID:1262874 Probab=99.32 E-value=1.1e-14 Score=97.09 Aligned_cols=113 Identities=15% Similarity=0.232 Sum_probs=91.7 Q ss_pred HHHHHHHHhhcChhhhhhhh----cccc--CCcccCCCCEEEeccceeeecCCCccc--ceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 19 LQASVYAKLVGHAPLTALVS----GVYD--EVPEPAPYPYVSFGSMTEFPEDAHDRQ--GLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 19 Lq~Ai~~~L~~da~l~alv~----~IyD--~vP~~a~~Pyv~iG~~~~~~~~~~~~~--~~~~~~~I~vws~~~g~~ea~ 90 (145) .-..||+.|++|++|++++| |||+ ..|+++++|||++-.....+.+..+.. ....++||+||++. +.+|+ T Consensus 1 m~~~i~~~l~~d~~v~allg~~~~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~~~l~g~~~~~~~~vQIDvyA~t--~~~A~ 78 (121) T protein:vir:43 1 MYPPIFKVCSSSPAVTAILGASPLRMYQFGLAPQLVVKPYATWQTISGSPENYLWGRPDADGFTIQVDIFSAT--AAEAR 78 (121) T ss_pred CChHHHHHHhhChhhhhhhcCCCceeeccCCCCCCCcCCeEEEEEecCcccceecCCCCcceeEEEEEeeeCC--HHHHH Confidence 45679999999999999997 5986 469999999999999998888777654 35689999999886 88999 Q ss_pred HHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEe Q lcl|NC_014229. 91 RIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTL 143 (145) Q Consensus 91 ~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~ 143 (145) +++++|+.+|+... +.. ......+|+|++.+|..+++...+.+ T Consensus 79 ~l~~av~~Al~~~~-----~~~-----~~~~~~ye~dT~lyR~s~Dv~w~~~r 121 (121) T protein:vir:43 79 DAAKAIRDAIELSA-----YVV-----RWGGESVDPDTKTYRVSFDVDWIVQR 121 (121) T ss_pred HHHHHHHHHhhhcC-----Ccc-----cCCCCCCcccccceeeeeEEEEeecC Confidence 99999999996432 211 11224478999999999999888888 No 34 >protein:vir:1892 Length: 121 # NCBI annotation: gp11 # Family: family:all:896 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037672;genbank:gi:9634130;genbank:GeneID:1262490 Probab=99.24 E-value=5.9e-14 Score=93.12 Aligned_cols=113 Identities=18% Similarity=0.261 Sum_probs=91.1 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh----cccc--CCcccCCCCEEEeccceeeecCCCccc--ceEEEEEEEEEECC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS----GVYD--EVPEPAPYPYVSFGSMTEFPEDAHDRQ--GLSVTVVIHVWSKS 83 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~----~IyD--~vP~~a~~Pyv~iG~~~~~~~~~~~~~--~~~~~~~I~vws~~ 83 (145) |.- =||+.|++|++|++++| |||+ ..|+++++|||++-.....+..+.++. ....++||+||++. T Consensus 1 m~~-------~i~~~l~~d~~v~allg~~~~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~vQIDvyA~t 73 (121) T protein:vir:18 1 MIA-------PIFSVCASSPEVTDLLGSNPVRIYPFGIQDDNVVYPYVVWQNITGSPENYIAQRPDADFFTLQVDAYADT 73 (121) T ss_pred Cch-------HHHHHHhcChhhhhhhcCCCceeeeccCCCCcCcCCeEEEEEecCcccceecCCCCcceeEEEEEeecCC Confidence 444 38999999999999996 6987 479999999999999998888776653 35689999999986 Q ss_pred CCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEe Q lcl|NC_014229. 84 PGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTL 143 (145) Q Consensus 84 ~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~ 143 (145) +.+|++++++|+.+|+.. ++. .+ .....+|+|++.+|..++....+.+ T Consensus 74 --~~~A~~l~~avr~Ale~~-----~~~-~~----~~~~~ye~dT~lyR~s~Dv~~~~~r 121 (121) T protein:vir:18 74 --VDEVIAVATALRDAIEPH-----AHI-TR----WGGQERDPETKRYRYSFDVDWIVTR 121 (121) T ss_pred --HHHHHHHHHHHHHHhhhc-----Ccc-cC----CCCCCCcccccceeeeeEEEEeecC Confidence 889999999999999642 221 11 1123478999999999999999988 No 35 >protein:vir:9880 Length: 136 # NCBI annotation: hypothetical protein # Family: family:all:1887 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795642;genbank:gi:28876399;genbank:GeneID:1257930 Probab=99.17 E-value=5.1e-13 Score=88.00 Aligned_cols=129 Identities=23% Similarity=0.299 Sum_probs=115.0 Q ss_pred cccc-hHHHHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCC-CHHH Q lcl|NC_014229. 11 DMAT-ALPALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSP-GFAE 88 (145) Q Consensus 11 ~M~~-~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~-g~~e 88 (145) -|+. +...|..||...+....+| +.||.+|++.+.||..|+-.+.+|.++|+..-+..++.||.+|+.+ ++.. T Consensus 1 mLKkLsl~~l~~aV~~~iee~tgL-----~c~d~~p~~ep~Pfyfie~I~~rpe~sKtmw~e~y~~~IHais~~g~t~~~ 75 (136) T protein:vir:98 1 MLKKLGLVDLHASIKQKIEDKTGL-----MAYDHVPEDMPSPFYFIEVVDKRPEDTKVMWCEVFTVWIHAIAEAGKSKIA 75 (136) T ss_pred CccccchHHHHHHHHHHhhccCCc-----eEEEecccCCCCCEEEEEeecCCccccceeeeeEEEEEEEEEcCCCCccch Confidence 3333 3578999999999999988 5899999999999999999999999999999999999999999965 7888 Q ss_pred HHHHHHHHHHHhcCCCCcc-CCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 89 AHRIFAALDAALDRVPLTV-AGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 89 a~~I~~aV~~aL~~~~l~l-~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) .-++.+++..||... +.| .||++...+....+.+..++++.+|+++.|++++--.= T Consensus 76 ~~~mI~~l~EAlte~-i~Lpe~y~l~~q~~~G~q~~~~~etge~HAi~~fei~vsygf 132 (136) T protein:vir:98 76 IYDMIEKLEEALTEE-LVLPEEIDILRQSEVGMQSLQEDETGEMHAIVAYEIKVSYGF 132 (136) T ss_pred HHHHHHHHHhhhhce-eecCCCeEEEEEechhhhheecccCCceeeeeeEEEEEeeeE Confidence 999999999999654 455 67999999999999999999999999999999987766 No 36 >protein:vir:102888 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:517 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338141;genbank:gi:77020213;genbank:GeneID:3703797 Probab=99.02 E-value=1.7e-12 Score=85.16 Aligned_cols=114 Identities=16% Similarity=0.355 Sum_probs=90.3 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh--ccccC-CcccCCCCEEEeccceeeec--CCCcccceEEEEEEEEEECCCCH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS--GVYDE-VPEPAPYPYVSFGSMTEFPE--DAHDRQGLSVTVVIHVWSKSPGF 86 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~--~IyD~-vP~~a~~Pyv~iG~~~~~~~--~~~~~~~~~~~~~I~vws~~~g~ 86 (145) |.+ +.+-|+.+|++|+.++++++ .||+. +|++...|||++-+....|. ..+.....++++||||||..+ T Consensus 1 M~~----i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~~~p~I~~~~~~~~p~~~add~e~~~~~~~QIDVwsk~~-- 74 (119) T protein:vir:10 1 MIN----LRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFELDNRPDGFADNQEIESEILFQVDVWAKSS-- 74 (119) T ss_pred CCc----hHHHHHHHhhcCchhhhhcCCceEEecccCCCCCCcEEEEEecCCCCCcccCCceeeeEEEEEEEEeeCCC-- Confidence 776 45778899999999999997 48885 78888899999976666554 344556789999999999853 Q ss_pred HHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEe Q lcl|NC_014229. 87 AEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTL 143 (145) Q Consensus 87 ~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~ 143 (145) ..+|..+|..+|.. .||. ..++....++|...+|-++||+..++- T Consensus 75 --~~~i~~~I~~~m~~-----~gf~-----r~~~~d~ye~dt~lyhk~~Rf~~~~el 119 (119) T protein:vir:10 75 --TTAIHQKVNEIMKR-----IGFS-----RYAVADLYEEDTQIFHYAMRFAKGVEL 119 (119) T ss_pred --HHHHHHHHHHHHHH-----cCCe-----eeccCCCcCChhhhheeeeeeeeeeeC Confidence 56889999999843 2443 233445778999999999999999999 No 37 >protein:vir:102086 Length: 119 # NCBI annotation: structural protein # Family: family:all:517 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512319;genbank:gi:89152488;genbank:GeneID:3953079 Probab=99.02 E-value=1.7e-12 Score=85.16 Aligned_cols=114 Identities=16% Similarity=0.355 Sum_probs=90.3 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh--ccccC-CcccCCCCEEEeccceeeec--CCCcccceEEEEEEEEEECCCCH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS--GVYDE-VPEPAPYPYVSFGSMTEFPE--DAHDRQGLSVTVVIHVWSKSPGF 86 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~--~IyD~-vP~~a~~Pyv~iG~~~~~~~--~~~~~~~~~~~~~I~vws~~~g~ 86 (145) |.+ +.+-|+.+|++|+.++++++ .||+. +|++...|||++-+....|. ..+.....++++||||||..+ T Consensus 1 M~~----i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~~~p~I~~~~~~~~p~~~add~e~~~~~~~QIDVwsk~~-- 74 (119) T protein:vir:10 1 MIN----LRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFELDNRPDGFADNQEIESEILFQVDVWAKSS-- 74 (119) T ss_pred CCc----hHHHHHHHhhcCchhhhhcCCceEEecccCCCCCCcEEEEEecCCCCCcccCCceeeeEEEEEEEEeeCCC-- Confidence 776 45778899999999999997 48885 78888899999976666554 344556789999999999853 Q ss_pred HHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEe Q lcl|NC_014229. 87 AEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTL 143 (145) Q Consensus 87 ~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~ 143 (145) ..+|..+|..+|.. .||. ..++....++|...+|-++||+..++- T Consensus 75 --~~~i~~~I~~~m~~-----~gf~-----r~~~~d~ye~dt~lyhk~~Rf~~~~el 119 (119) T protein:vir:10 75 --TTAIHQKVNEIMKR-----IGFS-----RYAVADLYEEDTQIFHYAMRFAKGVEL 119 (119) T ss_pred --HHHHHHHHHHHHHH-----cCCe-----eeccCCCcCChhhhheeeeeeeeeeeC Confidence 56889999999843 2443 233445778999999999999999999 No 38 >protein:vir:105008 Length: 119 # NCBI annotation: conserved structural protein # Family: family:all:517 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459973;genbank:gi:85701388;genbank:GeneID:3882149 Probab=99.02 E-value=1.7e-12 Score=85.16 Aligned_cols=114 Identities=16% Similarity=0.355 Sum_probs=90.3 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh--ccccC-CcccCCCCEEEeccceeeec--CCCcccceEEEEEEEEEECCCCH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS--GVYDE-VPEPAPYPYVSFGSMTEFPE--DAHDRQGLSVTVVIHVWSKSPGF 86 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~--~IyD~-vP~~a~~Pyv~iG~~~~~~~--~~~~~~~~~~~~~I~vws~~~g~ 86 (145) |.+ +.+-|+.+|++|+.++++++ .||+. +|++...|||++-+....|. ..+.....++++||||||..+ T Consensus 1 M~~----i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~~~p~I~~~~~~~~p~~~add~e~~~~~~~QIDVwsk~~-- 74 (119) T protein:vir:10 1 MIN----LRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFELDNRPDGFADNQEIESEILFQVDVWAKSS-- 74 (119) T ss_pred CCc----hHHHHHHHhhcCchhhhhcCCceEEecccCCCCCCcEEEEEecCCCCCcccCCceeeeEEEEEEEEeeCCC-- Confidence 776 45778899999999999997 48885 78888899999976666554 344556789999999999853 Q ss_pred HHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEe Q lcl|NC_014229. 87 AEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTL 143 (145) Q Consensus 87 ~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~ 143 (145) ..+|..+|..+|.. .||. ..++....++|...+|-++||+..++- T Consensus 75 --~~~i~~~I~~~m~~-----~gf~-----r~~~~d~ye~dt~lyhk~~Rf~~~~el 119 (119) T protein:vir:10 75 --TTAIHQKVNEIMKR-----IGFS-----RYAVADLYEEDTQIFHYAMRFAKGVEL 119 (119) T ss_pred --HHHHHHHHHHHHHH-----cCCe-----eeccCCCcCChhhhheeeeeeeeeeeC Confidence 56889999999843 2443 233445778999999999999999999 No 39 >protein:vir:107581 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:517 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338192;genbank:gi:77020160;genbank:GeneID:3703712 Probab=99.02 E-value=1.7e-12 Score=85.16 Aligned_cols=114 Identities=16% Similarity=0.355 Sum_probs=90.3 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh--ccccC-CcccCCCCEEEeccceeeec--CCCcccceEEEEEEEEEECCCCH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS--GVYDE-VPEPAPYPYVSFGSMTEFPE--DAHDRQGLSVTVVIHVWSKSPGF 86 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~--~IyD~-vP~~a~~Pyv~iG~~~~~~~--~~~~~~~~~~~~~I~vws~~~g~ 86 (145) |.+ +.+-|+.+|++|+.++++++ .||+. +|++...|||++-+....|. ..+.....++++||||||..+ T Consensus 1 M~~----i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~~~p~I~~~~~~~~p~~~add~e~~~~~~~QIDVwsk~~-- 74 (119) T protein:vir:10 1 MIN----LRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFELDNRPDGFADNQEIESEILFQVDVWAKSS-- 74 (119) T ss_pred CCc----hHHHHHHHhhcCchhhhhcCCceEEecccCCCCCCcEEEEEecCCCCCcccCCceeeeEEEEEEEEeeCCC-- Confidence 776 45778899999999999997 48885 78888899999976666554 344556789999999999853 Q ss_pred HHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEe Q lcl|NC_014229. 87 AEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTL 143 (145) Q Consensus 87 ~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~ 143 (145) ..+|..+|..+|.. .||. ..++....++|...+|-++||+..++- T Consensus 75 --~~~i~~~I~~~m~~-----~gf~-----r~~~~d~ye~dt~lyhk~~Rf~~~~el 119 (119) T protein:vir:10 75 --TTAIHQKVNEIMKR-----IGFS-----RYAVADLYEEDTQIFHYAMRFAKGVEL 119 (119) T ss_pred --HHHHHHHHHHHHHH-----cCCe-----eeccCCCcCChhhhheeeeeeeeeeeC Confidence 56889999999843 2443 233445778999999999999999999 No 40 >protein:vir:100116 Length: 115 # NCBI annotation: gp10 # Family: family:all:2712 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945040;genbank:gi:38707900;genbank:GeneID:2744163 Probab=98.84 E-value=1.8e-11 Score=79.52 Aligned_cols=111 Identities=19% Similarity=0.165 Sum_probs=84.6 Q ss_pred HHHhhcChhhhhhhh-ccccC-CcccCCCCEEEeccceeeecCCCcc--cceEEEEEEEEEECCCCHHHHHHHHHHHHHH Q lcl|NC_014229. 24 YAKLVGHAPLTALVS-GVYDE-VPEPAPYPYVSFGSMTEFPEDAHDR--QGLSVTVVIHVWSKSPGFAEAHRIFAALDAA 99 (145) Q Consensus 24 ~~~L~~da~l~alv~-~IyD~-vP~~a~~Pyv~iG~~~~~~~~~~~~--~~~~~~~~I~vws~~~g~~ea~~I~~aV~~a 99 (145) .+.|.-+++|.++.+ |||.. .|++++.||+++-.....|.++-++ .....++||+||++. +.+|++++++|+.+ T Consensus 1 ~~~~~i~~aL~~l~~~RVyp~~aP~~~~~Pyiv~q~vsg~p~~~L~G~~~~~~~~vQIDvyA~t--~~~A~~l~~~v~~~ 78 (115) T protein:vir:10 1 MSVIVIRDALQGIGGAKGYLGVAPEKAPAPYFVVTRVHGALDMALAGLTGGRSGSYQIDCYAPT--FTDADRLADLAVDR 78 (115) T ss_pred CeeEEeehhhcccCCceeecccCCCCCCCCEEEEEeecCccccccCCCCCCcceEEEEEEeeCC--HHHHHHHHHHHHHH Confidence 455666777778886 79864 7999999999999999999887764 336899999999986 88999999999988 Q ss_pred hcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEE Q lcl|NC_014229. 100 LDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRL 141 (145) Q Consensus 100 L~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~ 141 (145) +...+. ..++. .+..-+...|+|++.+|..++|.+-. T Consensus 79 ~~~~~~---~~~~~--~~~~~~d~ye~dt~lyR~s~D~~vWf 115 (115) T protein:vir:10 79 AMSVQD---RFSVG--GVDELPDDYSEDTGLFRISLELSVEF 115 (115) T ss_pred HhcCcc---ceeEe--eecCCCCCCcccccceeeEEEEEEeC Confidence 754331 12211 12223345779999999999999888 No 41 >protein:vir:10368 Length: 118 # NCBI annotation: conserved phage protein # Family: family:all:3244 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858960;genbank:gi:32128425;genbank:GeneID:2648389 Probab=98.83 E-value=5.7e-11 Score=76.75 Aligned_cols=112 Identities=17% Similarity=0.151 Sum_probs=80.0 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh-ccccC-CcccCC-CCEEEeccceeeecCCCccc---ceEEEEEEEEEECCCC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS-GVYDE-VPEPAP-YPYVSFGSMTEFPEDAHDRQ---GLSVTVVIHVWSKSPG 85 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~-vP~~a~-~Pyv~iG~~~~~~~~~~~~~---~~~~~~~I~vws~~~g 85 (145) |+ +++.|++.|. ++.+ |||.. .|++++ +|||++-.....|.+.-+.. ...+++||+||+.. T Consensus 1 Ms-----~e~~l~a~L~------~~~~~RVyp~~aP~~~~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~rvQIdvyA~t-- 67 (118) T protein:vir:10 1 MS-----YGRVLKDLLD------PVFSGRVYADIPPDSPPLDAYAIYQRVGGVPVYWQEGGMPEKVNARVQIQIWSRS-- 67 (118) T ss_pred Cc-----hHHHHHHHHh------hhcCCccccccCCCCCCcCCEEEEEecCCcccccccCCCCccceeEEEEEEeeCC-- Confidence 43 4444555444 4443 79975 788877 59999999999988877664 34479999999986 Q ss_pred HHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEec Q lcl|NC_014229. 86 FAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLD 144 (145) Q Consensus 86 ~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~ 144 (145) +.+|++++.+|+.+|+...- + ..+..-....|+|++.++..++|.+--..- T Consensus 68 ~~~A~~l~~av~~al~~~~~------~--~~~~~~~d~ye~dt~l~r~~~Df~vw~~~~ 118 (118) T protein:vir:10 68 KQEAYLATVQVLRLVSEAND------M--QVLSQPIDDYVREIKLYGSRVDISMWYNLT 118 (118) T ss_pred HHHHHHHHHHHHHHhhhccc------c--eeccCCCccccccCCceEEEEEEEEeeecC Confidence 89999999999999975421 1 112223356788999998888887644443 No 42 >protein:vir:97070 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:3244 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453569;genbank:gi:84662604;genbank:GeneID:5142485 Probab=98.83 E-value=5.5e-11 Score=76.85 Aligned_cols=112 Identities=16% Similarity=0.159 Sum_probs=80.0 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh-ccccC-CcccCC-CCEEEeccceeeecCCCccc---ceEEEEEEEEEECCCC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS-GVYDE-VPEPAP-YPYVSFGSMTEFPEDAHDRQ---GLSVTVVIHVWSKSPG 85 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~-vP~~a~-~Pyv~iG~~~~~~~~~~~~~---~~~~~~~I~vws~~~g 85 (145) |+ ++..|++.|. ++++ |||.. .|++++ +|||++-.....|....+.. ....++||+||+.. T Consensus 1 M~-----~e~~l~a~L~------~~~~~Rvyp~~aP~~~~~~Pyiv~q~vsg~p~~~ldG~~~~~~~~rvQIdvyA~t-- 67 (118) T protein:vir:97 1 MS-----YGRMLKDLLD------PVFSGRVYADIPPDSPPLDAYAIYQRVGGVPVYWKEGGMPDKVNARVQVQIWSRS-- 67 (118) T ss_pred Cc-----hHHHHHHHHh------hhcCCccccccCCCCCCcCCEEEEEecCCcccccccCCCCCccceeEEEEEeeCC-- Confidence 43 4444555443 4444 79985 788877 69999999999999887664 34478999999986 Q ss_pred HHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEec Q lcl|NC_014229. 86 FAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLD 144 (145) Q Consensus 86 ~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~ 144 (145) +.+|++++.+|+.+|++.+.. . .+.......|+|++.++..++|.+-...- T Consensus 68 ~~~A~~l~~av~~al~~~~~~-~-------~~~~~~~~ye~dt~lyr~~~Df~iw~~~~ 118 (118) T protein:vir:97 68 KQEAYLATVQVLRIVSEANDM-Q-------VLSQPIDDYVRELKLYGSRVDISMWYNLT 118 (118) T ss_pred HHHHHHHHHHHHHHhhccccc-c-------cccCCcccccccCCceEEEEEEEEEeecC Confidence 899999999999999765311 1 12223345789999998887776644444 No 43 >protein:vir:1438 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:2712 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536367;genbank:gi:17975172;genbank:GeneID:929144 Probab=98.82 E-value=2.5e-11 Score=78.69 Aligned_cols=111 Identities=18% Similarity=0.156 Sum_probs=83.0 Q ss_pred HHHhhcChhhhhhhh-ccccC-CcccCCCCEEEeccceeeecCCCcc--cceEEEEEEEEEECCCCHHHHHHHHHHHHHH Q lcl|NC_014229. 24 YAKLVGHAPLTALVS-GVYDE-VPEPAPYPYVSFGSMTEFPEDAHDR--QGLSVTVVIHVWSKSPGFAEAHRIFAALDAA 99 (145) Q Consensus 24 ~~~L~~da~l~alv~-~IyD~-vP~~a~~Pyv~iG~~~~~~~~~~~~--~~~~~~~~I~vws~~~g~~ea~~I~~aV~~a 99 (145) .+.|.-+++|.++.+ |||.. .|++++.||+++-.....|.++-++ .....++||+||++. +.+|++++++|+.+ T Consensus 1 ~~~~~i~~aL~~l~~~RVyp~~aP~~~~~Pyiv~q~vsg~p~~~L~G~~~~~~~~vQIDvyA~t--~~~A~~l~~~v~~~ 78 (115) T protein:vir:14 1 MSVIVIRDALQGIGGAKGYLGVAPAKAPAPYFVVTRVHGALDMALAGLTGGRSGSYQIDCYAPT--FTDADRLADLAVDR 78 (115) T ss_pred CeeEeeehhhccccccccccccCCCCCCCCEEEEEeecCcccccccCCCCCcceEEEEEEeeCC--HHHHHHHHHHHHHH Confidence 444555566666764 79864 7999999999999999999887764 336899999999986 88999999999988 Q ss_pred hcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEE Q lcl|NC_014229. 100 LDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRL 141 (145) Q Consensus 100 L~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~ 141 (145) +...+.. .+.. .+..-+...|+|++.+|..++|.+-. T Consensus 79 ~~~~~~~---~~~~--~~~~~~d~ye~dt~lyR~s~D~~vWf 115 (115) T protein:vir:14 79 AMSVQDR---FSVG--GVDELPDDYSEDTGLFRISLELSVEF 115 (115) T ss_pred HhcCccc---eeee--eecCCCCCCcccccceeeEEEEEEeC Confidence 8543311 1211 12223355779999999999999888 No 44 >protein:vir:81066 Length: 118 # NCBI annotation: p13 # Family: family:all:3244 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285683;genbank:gi:148727191;genbank:GeneID:5247111 Probab=98.81 E-value=6.4e-11 Score=76.47 Aligned_cols=111 Identities=17% Similarity=0.167 Sum_probs=78.0 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhh-hccccC-CcccCC-CCEEEeccceeeecCCCccc-c--eEEEEEEEEEECCCC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALV-SGVYDE-VPEPAP-YPYVSFGSMTEFPEDAHDRQ-G--LSVTVVIHVWSKSPG 85 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv-~~IyD~-vP~~a~-~Pyv~iG~~~~~~~~~~~~~-~--~~~~~~I~vws~~~g 85 (145) |+ ++..|++.|. +++ +|||.. .|.+++ +|||++-.....|...-|.. . ...++||+||+.. T Consensus 1 Ms-----~e~~l~a~L~------~~~~~Rvyp~~aP~~~~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~rvQIdvyA~t-- 67 (118) T protein:vir:81 1 MS-----YGRVLKDLLD------PVFSGRVYADIPPDSPPLDAYAIYQRVGGVPVYWQEGGMPEKVNARVQIQIWSRS-- 67 (118) T ss_pred Cc-----hHHHHHHHHH------hhcCCccccccCCCCCccCceEEEEecCCcccccccCCCCCccceeEEEEEeeCC-- Confidence 43 3444555443 444 479975 788877 59999999999998887664 2 3479999999986 Q ss_pred HHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 86 FAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 86 ~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +.+|++++.+|+.+|...+-. ..+.......|+|++.++..++ |.|-+++ T Consensus 68 ~~~A~~l~~av~~al~~~~~~--------~~~~~~~d~ye~dt~l~r~~~D--f~iw~~~ 117 (118) T protein:vir:81 68 KQEAYLATVQVLRLVSEAPDM--------QVLSQPIDDYVREIKLYGSRVD--VSMWYPI 117 (118) T ss_pred HHHHHHHHHHHHHHhhhccce--------eeccCCccccccccCceeEEEE--EEEEecC Confidence 899999999999999654311 1122233457889988876554 5555555 No 45 >protein:vir:93602 Length: 114 # NCBI annotation: putative structural component # Family: family:all:896 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449300;genbank:gi:157166048;uniprot:Q6H9U1;genbank:GeneID:5580424 Probab=98.72 E-value=2e-10 Score=73.76 Aligned_cols=106 Identities=25% Similarity=0.343 Sum_probs=78.6 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhh-hccccC-Ccc-----cCCCCEEEeccceeeecCCCcc-cceEEEEEEEEEECC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALV-SGVYDE-VPE-----PAPYPYVSFGSMTEFPEDAHDR-QGLSVTVVIHVWSKS 83 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv-~~IyD~-vP~-----~a~~Pyv~iG~~~~~~~~~~~~-~~~~~~~~I~vws~~ 83 (145) |+. ..|++.|. .++ ||||.. .|. ++++||+++-.....|.++-|+ .....++||+||++. T Consensus 1 M~e------~~i~~lL~------~~~~gRvyp~~~P~~~~~~~~~~Pyiv~q~vsg~p~~~l~gp~~~~~~vQIDvyA~t 68 (114) T protein:vir:93 1 MTE------ADLYPHLA------HLAGGQVYPYVVPLLDGRPSVALPWVVFSLISSVSADVMGGQAESSVSVQIDVYAGT 68 (114) T ss_pred Cch------HHHHHHHH------hhcCcccccccCCcccCcCCccCceEEEEeccCcccccccCccccceEEEEEeeeCC Confidence 774 34666554 333 479975 565 3578999999999888877655 346679999999986 Q ss_pred CCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEE Q lcl|NC_014229. 84 PGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRL 141 (145) Q Consensus 84 ~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~ 141 (145) +++|++++++|+.||.... +. . ....+. +|+|++.++..++|.+.+ T Consensus 69 --~~~A~~l~~~v~~Al~~~~-----~~-~---~~~~~~-ye~dt~lyR~~~d~~v~~ 114 (114) T protein:vir:93 69 --VTQARQIRQDAREAIMLLA-----PG-S---VSEMQD-YIPENRCYRATLEFQVTV 114 (114) T ss_pred --HHHHHHHHHHHHHHHhhcC-----cE-e---ecCCCc-ccccccceeeEEEEEEeC Confidence 8899999999999995321 11 1 112222 689999999999999999 No 46 >protein:vir:195 Length: 115 # NCBI annotation: Gp11 # Family: family:all:896 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037705;genbank:gi:9634170;genbank:GeneID:1262540 Probab=98.69 E-value=1.8e-10 Score=74.05 Aligned_cols=106 Identities=23% Similarity=0.310 Sum_probs=77.4 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhh-hccccC-CcccC------CCCEEEeccceeeecCCCccc-ceEEEEEEEEEEC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALV-SGVYDE-VPEPA------PYPYVSFGSMTEFPEDAHDRQ-GLSVTVVIHVWSK 82 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv-~~IyD~-vP~~a------~~Pyv~iG~~~~~~~~~~~~~-~~~~~~~I~vws~ 82 (145) |+.+ .|++.|. .++ ||||.. +|.+. ++|||++-.....|.++-|+. ....++||+||++ T Consensus 1 M~e~------~i~~lL~------~l~~gRvyp~~aP~~~~~~~~~~~Pyiv~q~vsg~p~~~L~G~~~~~~~vQIDvyA~ 68 (115) T protein:vir:19 1 MNED------NIYALLS------PLAEGRVYPYVAPLGSDGKPSVSPPWIIFSIVDDVSADVLCGQAESRVSVQVDVYST 68 (115) T ss_pred Cchh------HHHHHHh------hhcCcccceeeccCCCCCCccccCCeEEEEeccCcccccccCCCccceEEEEEEeeC Confidence 8763 3565554 333 479985 67743 799999999988888876653 4678999999998 Q ss_pred CCCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEE Q lcl|NC_014229. 83 SPGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRL 141 (145) Q Consensus 83 ~~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~ 141 (145) . +++|++++++|+.||+... +. . .. -...+|+|+..|+..++|.+.- T Consensus 69 t--~~~A~~l~~~i~~Al~~~~-----p~-~---~~-~~~~ye~dt~lyR~s~d~~V~~ 115 (115) T protein:vir:19 69 S--IAESRSLRDLVLASLEPLT-----PT-E---VV-KIPGYEPDYRLYRATLDFKVTP 115 (115) T ss_pred C--hHHHHHHHHHHHHHhhhcC-----CE-E---ec-CCCCcccchhceeeEEEEEecC Confidence 6 8899999999999995221 11 1 11 2244689999998888887776 No 47 >protein:vir:100242 Length: 114 # NCBI annotation: gp71 # Family: family:all:2712 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355407;genbank:gi:77864697;genbank:GeneID:3725964 Probab=98.64 E-value=2.3e-10 Score=73.46 Aligned_cols=110 Identities=15% Similarity=0.137 Sum_probs=79.7 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh-cc-ccCCcccCCCCEEEeccceeeecCCCcccc--eEEEEEEEEEECCCCHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS-GV-YDEVPEPAPYPYVSFGSMTEFPEDAHDRQG--LSVTVVIHVWSKSPGFA 87 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~-~I-yD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~--~~~~~~I~vws~~~g~~ 87 (145) |+... |+ ++|.++.| ++ |...|++++.||+++-....+|..+-|+.. ...++||+||++. +. T Consensus 1 ~~~~~------i~------~~l~~~~g~~~~~~~aP~~~~~Py~vy~rvsg~p~~tL~G~~g~~~~r~QiD~yA~T--~~ 66 (114) T protein:vir:10 1 MSALT------IR------DAIGIVGGAKGYVSVASSAAQSPYYVVSRVSGTRDMALGGATGGKSGMFQIDVYAKT--YT 66 (114) T ss_pred Cceee------ee------hhhcccccccccCCCCCCCCCCceEEEEeccCcccccccCCCCcceEEEEEEeeeCC--HH Confidence 33221 22 34445555 45 567899999999999999999988876643 6889999999986 88 Q ss_pred HHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEE Q lcl|NC_014229. 88 EAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRL 141 (145) Q Consensus 88 ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~ 141 (145) ||+++++++...+... +.+++. .+.+-+...|+|++.+|..++|.+-. T Consensus 67 eA~~La~~~~~~l~~~----~~f~~~--~l~~~~d~ye~dT~l~Rvsld~si~f 114 (114) T protein:vir:10 67 EADSLADQIIDRVEST----GMFSVG--GVSDLPDDYSSDTGVFRVSLEISVQF 114 (114) T ss_pred HHHHHHHHHHhhcccc----cCeeee--ccccCCCCCCcccCceEEEEEEEEeC Confidence 9999998777666322 123322 24555667789999999888888777 No 48 >protein:vir:1274 Length: 162 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690766;genbank:gi:22855006;genbank:GeneID:955217 Probab=98.48 E-value=9.9e-10 Score=69.95 Aligned_cols=134 Identities=16% Similarity=0.242 Sum_probs=96.1 Q ss_pred CceEEEeeCCcccch--HHHHHHHHHHHhhcChhhhhhhh-ccccC-CcccCCCCEEEeccceeeecC--CCcccceEEE Q lcl|NC_014229. 1 MPLLAIWAGGDMATA--LPALQASVYAKLVGHAPLTALVS-GVYDE-VPEPAPYPYVSFGSMTEFPED--AHDRQGLSVT 74 (145) Q Consensus 1 ~~~~~~~~~~~M~~~--~~aLq~Ai~~~L~~da~l~alv~-~IyD~-vP~~a~~Pyv~iG~~~~~~~~--~~~~~~~~~~ 74 (145) -....|-+..+-+.- ..-+.+.|++.|.+++.|..+++ +||.. +|++...|||++-+....|.+ -+.....+++ T Consensus 21 ~~~~~~~~~~~~~~~~~~mn~~k~v~q~L~n~~~L~~l~~~~i~~l~~~~~~~~p~Itf~e~~~~p~~yADD~e~ss~~~ 100 (162) T protein:vir:12 21 GACCGINGANTYSADQMTYSPKIELVSTLNSSAFLKGLTSGGIHNLVANDVSAFPRVVFSEIQDADADFADNEVYSFEVR 100 (162) T ss_pred CceeccccccccchhhhhhhHHHHHHHHhcChhHHHhhCCCceEEEeecCCCCceEEEEEeecCCCCcccccceeeEEEE Confidence 111122111111111 23578899999999999999997 57775 567889999999998888754 3345668999 Q ss_pred EEEEEEECCCCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEE--EEEEec Q lcl|NC_014229. 75 VVIHVWSKSPGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYR--VRLTLD 144 (145) Q Consensus 75 ~~I~vws~~~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fr--a~~~~~ 144 (145) +||||||+.+-.....+|..+|..+|.. .|| ....+....++|...+|=+++|+ .-.|-| T Consensus 101 iQIDIwsk~st~~d~~~l~~~I~~lMk~-----~GF-----~R~s~~d~YE~DTklyHK~~RF~~~y~~E~~ 162 (162) T protein:vir:12 101 YQISIFTQASTRGKETAIASEIDRLMRE-----IGY-----SRYDSQDLYETDTKVFHKARRYKKTYYQEVN 162 (162) T ss_pred EEEEEeecCCcchhHHHHHHHHHHHHHH-----cCC-----EeecCCCCCCChhhhhhhhheeccceeeecC Confidence 9999999876566788999999999843 233 24445567889999999999997 444555 No 49 >protein:vir:2689 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075508;genbank:gi:12719437;genbank:GeneID:920159 Probab=97.99 E-value=1.5e-08 Score=63.56 Aligned_cols=117 Identities=15% Similarity=0.156 Sum_probs=86.5 Q ss_pred HHHHHHHHHHhhcChhhhhhhh-cccc-CCcc--cCCCCEEEeccceeeecC--CCcccceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 17 PALQASVYAKLVGHAPLTALVS-GVYD-EVPE--PAPYPYVSFGSMTEFPED--AHDRQGLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 17 ~aLq~Ai~~~L~~da~l~alv~-~IyD-~vP~--~a~~Pyv~iG~~~~~~~~--~~~~~~~~~~~~I~vws~~~g~~ea~ 90 (145) ..+-+-|++.|++|..|+.+++ +||- .+|+ +...|||+|-+....|.+ .......++.+||+|||.. +.+++ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDVws~~--~~~~~ 78 (131) T protein:vir:26 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPVYDLPSDFMSDKYLSEEYLIQIDVESSN--NQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCccccccceEEEeeCCCCCccccCCceeeeEEEEEEEEEecC--ccchH Confidence 5677889999999999999997 6755 4776 456899999988766544 2234457889999999987 66889 Q ss_pred HHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 91 RIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 91 ~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +|..+|..+|-. -||. +....-..-|+|+..+|=..+||... -++ T Consensus 79 ~i~~~I~~~M~~-----~gf~----q~s~~~d~Yd~dtk~y~~arRYrg~~-~~~ 123 (131) T protein:vir:26 79 DITKRIRYLLYQ-----QNLI----QASSQLDAYFEETKRYVMSRRYQGIP-KNI 123 (131) T ss_pred HHHHHHHHHHHH-----cCce----eccCCCCccchhhHHhhhhhhccccc-hhh Confidence 999999999942 2232 22222233478999999999998866 222 No 50 >protein:vir:78648 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429947;genbank:gi:156604001;genbank:GeneID:5525394 Probab=97.99 E-value=1.5e-08 Score=63.56 Aligned_cols=117 Identities=15% Similarity=0.156 Sum_probs=86.5 Q ss_pred HHHHHHHHHHhhcChhhhhhhh-cccc-CCcc--cCCCCEEEeccceeeecC--CCcccceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 17 PALQASVYAKLVGHAPLTALVS-GVYD-EVPE--PAPYPYVSFGSMTEFPED--AHDRQGLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 17 ~aLq~Ai~~~L~~da~l~alv~-~IyD-~vP~--~a~~Pyv~iG~~~~~~~~--~~~~~~~~~~~~I~vws~~~g~~ea~ 90 (145) ..+-+-|++.|++|..|+.+++ +||- .+|+ +...|||+|-+....|.+ .......++.+||+|||.. +.+++ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDVws~~--~~~~~ 78 (131) T protein:vir:78 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPVYDLPSDFMSDKYLSEEYLIQIDVESSN--NQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCccccccceEEEeeCCCCCccccCCceeeeEEEEEEEEEecC--ccchH Confidence 5677889999999999999997 6755 4776 456899999988766544 2234457889999999987 66889 Q ss_pred HHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 91 RIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 91 ~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +|..+|..+|-. -||. +....-..-|+|+..+|=..+||... -++ T Consensus 79 ~i~~~I~~~M~~-----~gf~----q~s~~~d~Yd~dtk~y~~arRYrg~~-~~~ 123 (131) T protein:vir:78 79 DITKRIRYLLYQ-----QNLI----QASSQLDAYFEETKRYVMSRRYQGIP-KNI 123 (131) T ss_pred HHHHHHHHHHHH-----cCce----eccCCCCccchhhHHhhhhhhccccc-hhh Confidence 999999999942 2232 22222233478999999999998866 222 No 51 >protein:vir:96972 Length: 131 # NCBI annotation: ORF035 # Family: family:all:508 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239865;genbank:gi:66395543;genbank:GeneID:5133005 Probab=97.99 E-value=1.5e-08 Score=63.56 Aligned_cols=117 Identities=15% Similarity=0.156 Sum_probs=86.5 Q ss_pred HHHHHHHHHHhhcChhhhhhhh-cccc-CCcc--cCCCCEEEeccceeeecC--CCcccceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 17 PALQASVYAKLVGHAPLTALVS-GVYD-EVPE--PAPYPYVSFGSMTEFPED--AHDRQGLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 17 ~aLq~Ai~~~L~~da~l~alv~-~IyD-~vP~--~a~~Pyv~iG~~~~~~~~--~~~~~~~~~~~~I~vws~~~g~~ea~ 90 (145) ..+-+-|++.|++|..|+.+++ +||- .+|+ +...|||+|-+....|.+ .......++.+||+|||.. +.+++ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDVws~~--~~~~~ 78 (131) T protein:vir:96 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPVYDLPSDFMSDKYLSEEYLIQIDVESSN--NQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCccccccceEEEeeCCCCCccccCCceeeeEEEEEEEEEecC--ccchH Confidence 5677889999999999999997 6755 4776 456899999988766544 2234457889999999987 66889 Q ss_pred HHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 91 RIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 91 ~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +|..+|..+|-. -||. +....-..-|+|+..+|=..+||... -++ T Consensus 79 ~i~~~I~~~M~~-----~gf~----q~s~~~d~Yd~dtk~y~~arRYrg~~-~~~ 123 (131) T protein:vir:96 79 DITKRIRYLLYQ-----QNLI----QASSQLDAYFEETKRYVMSRRYQGIP-KNI 123 (131) T ss_pred HHHHHHHHHHHH-----cCce----eccCCCCccchhhHHhhhhhhccccc-hhh Confidence 999999999942 2232 22222233478999999999998866 222 No 52 >protein:vir:9364 Length: 131 # NCBI annotation: SLT orf 131b-like protein # Family: family:all:508 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803342;genbank:gi:29028653;genbank:GeneID:1258094 Probab=97.99 E-value=1.5e-08 Score=63.56 Aligned_cols=117 Identities=15% Similarity=0.156 Sum_probs=86.5 Q ss_pred HHHHHHHHHHhhcChhhhhhhh-cccc-CCcc--cCCCCEEEeccceeeecC--CCcccceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 17 PALQASVYAKLVGHAPLTALVS-GVYD-EVPE--PAPYPYVSFGSMTEFPED--AHDRQGLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 17 ~aLq~Ai~~~L~~da~l~alv~-~IyD-~vP~--~a~~Pyv~iG~~~~~~~~--~~~~~~~~~~~~I~vws~~~g~~ea~ 90 (145) ..+-+-|++.|++|..|+.+++ +||- .+|+ +...|||+|-+....|.+ .......++.+||+|||.. +.+++ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDVws~~--~~~~~ 78 (131) T protein:vir:93 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPVYDLPSDFMSDKYLSEEYLIQIDVESSN--NQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCccccccceEEEeeCCCCCccccCCceeeeEEEEEEEEEecC--ccchH Confidence 5677889999999999999997 6755 4776 456899999988766544 2234457889999999987 66889 Q ss_pred HHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 91 RIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 91 ~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +|..+|..+|-. -||. +....-..-|+|+..+|=..+||... -++ T Consensus 79 ~i~~~I~~~M~~-----~gf~----q~s~~~d~Yd~dtk~y~~arRYrg~~-~~~ 123 (131) T protein:vir:93 79 DITKRIRYLLYQ-----QNLI----QASSQLDAYFEETKRYVMSRRYQGIP-KNI 123 (131) T ss_pred HHHHHHHHHHHH-----cCce----eccCCCCccchhhHHhhhhhhccccc-hhh Confidence 999999999942 2232 22222233478999999999998866 222 No 53 >protein:vir:80371 Length: 115 # NCBI annotation: gp11 # Family: family:all:2712 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111090;genbank:gi:134288622;genbank:GeneID:4960618 Probab=97.92 E-value=9.1e-08 Score=59.17 Aligned_cols=111 Identities=18% Similarity=0.161 Sum_probs=74.2 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh-ccccC-CcccCCCCEEEeccceeeecCCCccc--ceEEEEEEEEEECCCCHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS-GVYDE-VPEPAPYPYVSFGSMTEFPEDAHDRQ--GLSVTVVIHVWSKSPGFA 87 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~-vP~~a~~Pyv~iG~~~~~~~~~~~~~--~~~~~~~I~vws~~~g~~ 87 (145) |+-- - |+++| .++-+ +.|-. .|++++.|||++-....-+..+-|+. ++..++||+||.+. +. T Consensus 1 ~~~~--v----ir~al------~~i~~~~~~~~vAp~~~~~pyivy~rvsga~e~~L~G~ag~~~~~~QID~yA~T--~~ 66 (115) T protein:vir:80 1 MSVI--V----VRDAL------QGIGGAKGYLGVAPEKAPARYFVVTRVHGALDMALAGPTGGRSGSYQIDCYAPT--FT 66 (115) T ss_pred Ceee--e----eechh------hhccccccceeeccccCcCCeEEEeecCCCccccccCCCCCceeEEEEeeecCC--HH Confidence 2211 1 12222 22322 46654 68999999999999988888877664 58899999999986 88 Q ss_pred HHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEE Q lcl|NC_014229. 88 EAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRL 141 (145) Q Consensus 88 ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~ 141 (145) ++++++++++.++-..+-. ++... +..-+...++|+..++..++|.+-. T Consensus 67 ea~~La~~v~d~~~~~~~~---~~vg~--l~e~pd~Ye~DT~l~Rvs~dv~i~f 115 (115) T protein:vir:80 67 DADRLADLAVDRAMSVQDR---FSVGG--VDELPDDYSADTGLFRVSLELSVEF 115 (115) T ss_pred HHHHHHHHHHHhhhCCccc---cceec--ccCCCcccccccceEEEEEEEEEeC Confidence 9999999999977543322 22111 2233456778998887777665555 No 54 >protein:vir:93902 Length: 131 # NCBI annotation: ORF029 # Family: family:all:508 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239943;genbank:gi:66395617;genbank:GeneID:5130968 Probab=97.90 E-value=2.3e-08 Score=62.49 Aligned_cols=118 Identities=15% Similarity=0.137 Sum_probs=86.9 Q ss_pred HHHHHHHHHHhhcChhhhhhhh-cccc-CCcc--cCCCCEEEeccceeeecC--CCcccceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 17 PALQASVYAKLVGHAPLTALVS-GVYD-EVPE--PAPYPYVSFGSMTEFPED--AHDRQGLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 17 ~aLq~Ai~~~L~~da~l~alv~-~IyD-~vP~--~a~~Pyv~iG~~~~~~~~--~~~~~~~~~~~~I~vws~~~g~~ea~ 90 (145) ..+-+-|++.|++|..|+.+++ +||- .+|+ +...|||+|-+....|.+ .......+..+||+|||.. +..++ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDV~s~~--~~~~~ 78 (131) T protein:vir:93 1 MNILNTIKEILLSDAELQTYINSRIYYYKVTENAETSKPFVVITPIYDLPSDFMSDKYLSEEYLIQIDVESSN--NQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEEecCCccccccceEEEeeCCCCcccccCCceeeeEEEEEEEEEecC--ccchH Confidence 5677889999999999999997 5654 4776 456899999988877654 2334557889999999976 77888 Q ss_pred HHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 91 RIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 91 ~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +|..+|..+|-.. ||. +....-..-|+|+..+|=..+||...-..= T Consensus 79 ~i~~~I~~~M~~~-----gf~----q~s~~~d~Yd~dtk~y~~arRYrg~~~~~y 124 (131) T protein:vir:93 79 DITKRIRYLLYQQ-----NLI----QASSQLDAYFEETKRYVMSRRYQGIPKNIY 124 (131) T ss_pred HHHHHHHHHHHHc-----Cce----eccCCCCccchhHHHhhhhhhhccchhhhh Confidence 9999999999422 332 222222334789999999999987443322 No 55 >protein:vir:94418 Length: 131 # NCBI annotation: ORF029 # Family: family:all:508 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240011;genbank:gi:66395684;genbank:GeneID:5133078 Probab=97.89 E-value=2.3e-08 Score=62.49 Aligned_cols=118 Identities=15% Similarity=0.137 Sum_probs=86.6 Q ss_pred HHHHHHHHHHhhcChhhhhhhh-cccc-CCcc--cCCCCEEEeccceeeecC--CCcccceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 17 PALQASVYAKLVGHAPLTALVS-GVYD-EVPE--PAPYPYVSFGSMTEFPED--AHDRQGLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 17 ~aLq~Ai~~~L~~da~l~alv~-~IyD-~vP~--~a~~Pyv~iG~~~~~~~~--~~~~~~~~~~~~I~vws~~~g~~ea~ 90 (145) ..+-+-|++.|++|..|+.+++ +||- .+|+ +...|||+|-+....|.+ .......+..+||+|||.. +..++ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDV~s~~--~~~~~ 78 (131) T protein:vir:94 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPIYDLPSDFMSDKYLSEEYLIQIDVESSN--NQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEEecCCccccccceEEEeeCCCCcccccCCceeeeEEEEEEEEEecC--ccchH Confidence 5677889999999999999997 6654 4776 456899999988876654 2334457889999999987 77888 Q ss_pred HHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 91 RIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 91 ~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +|..+|...|-. -||. +....-..-|+|+..+|=..+||...-..= T Consensus 79 ~i~~~I~~~M~~-----~gf~----q~s~~~d~Yd~dtk~y~~arRYrg~~~~~y 124 (131) T protein:vir:94 79 DITKRIRYLLYQ-----QNLI----QASSQLDAYFEETKRYVMSRRYQGIPKNIY 124 (131) T ss_pred HHHHHHHHHHHH-----cCce----eccCCCCccchhHHHhhhhhhhccchhhhh Confidence 999999999942 2222 222222334789999999999987443322 No 56 >protein:vir:78349 Length: 127 # NCBI annotation: gp10 # Family: family:all:508 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468649;genbank:gi:157325227;genbank:GeneID:5601695 Probab=97.81 E-value=1.3e-07 Score=58.32 Aligned_cols=119 Identities=13% Similarity=0.110 Sum_probs=82.9 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh-cccc-CCcc--cCCCCEEEecccee-eec--CCCcccceEEEEEEEEEECCC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS-GVYD-EVPE--PAPYPYVSFGSMTE-FPE--DAHDRQGLSVTVVIHVWSKSP 84 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~-~IyD-~vP~--~a~~Pyv~iG~~~~-~~~--~~~~~~~~~~~~~I~vws~~~ 84 (145) |.++. +.|+.+|.+|..|..+++ +||= .+|+ +...|||+|-+... .|. .....-..+..+||+||+.. T Consensus 1 M~d~l----~~iy~~L~~d~~l~~~~~~~I~~~~~Pe~~d~~~p~I~I~~i~~p~p~~yadn~~l~~~~~~QIDV~s~~- 75 (127) T protein:vir:78 1 MIDIL----NVIYTTLSKNDIIHTTCEERIKYYDFPGTGDSTKTFLLIIPLDVPIPTNFSSNESRMEDFLVQIDVQSND- 75 (127) T ss_pred CcchH----HHHHHHhhcchhhhhhcCCceEEEecCCCccccCcEEEEeeCCCCCCCcccCCccceeEEEEEEEEEEcC- Confidence 88765 457788889988877665 6765 4776 56789999998864 233 33444568899999999876 Q ss_pred CHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 85 GFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 85 g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +.+.++|..+|..+|-. -||. +....-...++|+..+|=.-+||...-..= T Consensus 76 -r~~~~~i~~~I~~~M~~-----~gf~----q~s~~~d~Y~~dtk~y~~arRYrg~~~~~y 126 (127) T protein:vir:78 76 -RLIVKKIQDEVRKEMKQ-----IGFG----QLAGGLDEYFPETGRFVDARKYSGLPYKLY 126 (127) T ss_pred -CCchHHHHHHHHHHHHH-----cCce----eccCCCCccchhhhhhhheeeeeecccccc Confidence 77899999999999942 1221 122111234788888998899988433222 No 57 >protein:vir:98426 Length: 131 # NCBI annotation: ORF6 # Family: family:all:12105 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958284;genbank:gi:41057258;uniprot:Q38599;genbank:GeneID:2732810 Probab=97.68 E-value=1.6e-06 Score=52.41 Aligned_cols=129 Identities=21% Similarity=0.244 Sum_probs=85.9 Q ss_pred CceEEEeeCCcccchHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEE Q lcl|NC_014229. 1 MPLLAIWAGGDMATALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHV 79 (145) Q Consensus 1 ~~~~~~~~~~~M~~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~v 79 (145) ||-+. |-|+..-+-.-|.++|.+ +..+ .|+..+|.+-+..||++..+- -...+..-...++.|+| T Consensus 1 ~~~i~------~pda~~v~~~~lr~~l~a-----~~~~V~V~t~vP~~RP~rfV~Vertg---G~~~~~~~Dr~~L~Vq~ 66 (131) T protein:vir:98 1 MPPIL------MPDAVAVIAGYLRAVLVA-----RGVTVPVGSRVPSPRPARFVRIERIG---GPANTVVTDRPRLDVHC 66 (131) T ss_pred CCCcc------CCchhHHHHHHHHHHHHh-----cCCceEecccCCCCCCceEEEEEecC---CCcCCccccceEEEEEe Confidence 88442 456554444455556653 1222 699999999999999996551 11223345788999999 Q ss_pred EECCCCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 80 WSKSPGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 80 ws~~~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) |... ..+|.+++..+++.|-..+....-..+.......-..++|||++..+=..+.+++++-++ T Consensus 67 W~~t--~~~A~~La~~vr~~ll~~~~~~g~~~~~~~e~~gpy~~PD~es~~~Ryq~tv~l~~r~~~ 130 (131) T protein:vir:98 67 WGSS--EEDAHDLMQLCRALLGAARGSHGDTVLARPATGGPQFLPDAETGAARWAFTLDITMRGHA 130 (131) T ss_pred cCCC--HHHHHHHHHHHHHHHhhcccccchheeccccCCCCCcCCCCCCCCceeEEEEEEEeeecc Confidence 9875 889999999999987533322211222222333445678888877777888889998888 No 58 >protein:vir:96002 Length: 133 # NCBI annotation: ORF024 # Family: family:all:508 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239806;genbank:gi:66395472;genbank:GeneID:5132919 Probab=97.61 E-value=3.8e-07 Score=55.80 Aligned_cols=121 Identities=17% Similarity=0.171 Sum_probs=85.5 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh--cccc-CCcc--cCCCCEEEecccee-e--ecCCCcccceEEEEEEEEEECC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS--GVYD-EVPE--PAPYPYVSFGSMTE-F--PEDAHDRQGLSVTVVIHVWSKS 83 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~--~IyD-~vP~--~a~~Pyv~iG~~~~-~--~~~~~~~~~~~~~~~I~vws~~ 83 (145) |.+.. +-|+..|++|..|+.+++ +|+= ..|+ +..-|||+|-+... . .......-..+..+||+||+.. T Consensus 1 m~diL----~eIy~~L~~d~~L~~~v~~~~Ik~~~~Pe~~d~~~p~IvI~pi~~p~p~~f~sn~~ls~~~~~QIDV~sk~ 76 (133) T protein:vir:96 1 MIDIL----MEVYNILKSDDDLMRLIDKKNIKFNQYPDVKDKMAPYIVIDDYDDPIPEWHSDGDRIAYNYAFQIDVMVKA 76 (133) T ss_pred CcchH----HHHHHHhhcchHHHHhcCccceEEeecCCccccccceEEEecCCCCCcccccCcceeeeEEEEEEeeeeec Confidence 77755 457888999999999997 3643 4665 45679999998777 3 3345555678899999999976 Q ss_pred ----CCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 84 ----PGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 84 ----~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) .+|...++|..+|...|-.. |+. +..+.-..-|+|+..+|-.=+||...=.+= T Consensus 77 ~~~~~~R~~~~~i~~rI~~~m~~~-----gf~----Q~~~~~deYd~et~~y~~aRRYrg~~Y~~y 133 (133) T protein:vir:96 77 SDAYNARKRRNEISNRISELLWKN-----QMK----QIRNLGNEYDKNLALYRSTRRYEAIFYENY 133 (133) T ss_pred cccccchhhhHHHHHHHHHHHHHc-----Cce----ecCCCccccchhhhhhhhhheeeccccccC Confidence 57899999999999999321 322 122222334677777777777777644444 No 59 >protein:vir:98343 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918935;genbank:gi:119443697;genbank:GeneID:4594505 Probab=97.32 E-value=2.5e-06 Score=51.31 Aligned_cols=119 Identities=10% Similarity=0.077 Sum_probs=79.3 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecC-C-CcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPED-A-HDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~-~-~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |.+-...|.+++...|..+. |..+.-+|.+.-=.+..-|||++-+-...|.. + +.....++++||+||...+... T Consensus 1 ~~~~~k~l~~~~I~~li~~~-L~~~nvpv~~~~y~~~~~tyItf~ey~~~~~~yaDD~e~~t~~~iQVDIw~sk~d~~-- 77 (126) T protein:vir:98 1 MINVTKLIRNAIIANNITDE-VNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQDEPN-- 77 (126) T ss_pred CccchhhhhhhHHHHhhhhh-hhccCceeeeeeecCCCceEEEEEeecCCCCcccccceeeeEEEEEEEEeecCCCHH-- Confidence 88877778777777666532 22222234443335677899999888777654 2 3345688999999954333333 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEE-----ec Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLT-----LD 144 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~-----~~ 144 (145) ++...|..+|-. .|| ....+....++|+..||=++|||..+- +| T Consensus 78 -~l~~~V~~lMk~-----~GF-----~r~~~~dlYE~DtklyHk~~RF~~~~~~~~~~~~ 126 (126) T protein:vir:98 78 -EQAEKIVELLKV-----INF-----QCYYREPLYESDVMSFRHIIRAKGSILSMKLEEN 126 (126) T ss_pred -HHHHHHHHHHHH-----cCC-----eeeecCCCccchhhhheeeeeeeeeecceeeccC Confidence 367778888732 344 355566789999999999999987653 34 No 60 >protein:vir:9415 Length: 126 # NCBI annotation: phi PVL orf 12-like protein # Family: family:all:517 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803393;genbank:gi:29028705;genbank:GeneID:1258143 Probab=97.32 E-value=2.5e-06 Score=51.31 Aligned_cols=119 Identities=10% Similarity=0.077 Sum_probs=79.3 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecC-C-CcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPED-A-HDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~-~-~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |.+-...|.+++...|..+. |..+.-+|.+.-=.+..-|||++-+-...|.. + +.....++++||+||...+... T Consensus 1 ~~~~~k~l~~~~I~~li~~~-L~~~nvpv~~~~y~~~~~tyItf~ey~~~~~~yaDD~e~~t~~~iQVDIw~sk~d~~-- 77 (126) T protein:vir:94 1 MINVTKLIRNAIIANNITDE-VNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQDEPN-- 77 (126) T ss_pred CccchhhhhhhHHHHhhhhh-hhccCceeeeeeecCCCceEEEEEeecCCCCcccccceeeeEEEEEEEEeecCCCHH-- Confidence 88877778777777666532 22222234443335677899999888777654 2 3345688999999954333333 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEE-----ec Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLT-----LD 144 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~-----~~ 144 (145) ++...|..+|-. .|| ....+....++|+..||=++|||..+- +| T Consensus 78 -~l~~~V~~lMk~-----~GF-----~r~~~~dlYE~DtklyHk~~RF~~~~~~~~~~~~ 126 (126) T protein:vir:94 78 -EQAEKIVELLKV-----INF-----QCYYREPLYESDVMSFRHIIRAKGSILSMKLEEN 126 (126) T ss_pred -HHHHHHHHHHHH-----cCC-----eeeecCCCccchhhhheeeeeeeeeecceeeccC Confidence 367778888732 344 355566789999999999999987653 34 No 61 >protein:vir:101303 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908836;genbank:gi:118725100;genbank:GeneID:4555874 Probab=97.30 E-value=1.5e-06 Score=52.54 Aligned_cols=120 Identities=15% Similarity=0.117 Sum_probs=78.9 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh--cccc-CCcc--cCCCCEEEecccee-ee--cCCCcccceEEEEEEEEEECC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS--GVYD-EVPE--PAPYPYVSFGSMTE-FP--EDAHDRQGLSVTVVIHVWSKS 83 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~--~IyD-~vP~--~a~~Pyv~iG~~~~-~~--~~~~~~~~~~~~~~I~vws~~ 83 (145) |.+.. +-|+..|++|..|+.+++ +|+= ..|+ +..-|||+|-+... .| ......-..+..+||+||+.. T Consensus 1 m~diL----~eIy~~L~~d~~L~~~v~~~rIk~~~~Pe~~d~~~p~IvI~pl~~p~p~~~~sn~~ls~~~~~QIDV~~k~ 76 (135) T protein:vir:10 1 MIDIL----YKVHEVISQDRIIREHVNINNIKFNKYPNVKDTDVPFIVIDDIDDPIPTTYTDGDECAYSYIVQIDVFVKY 76 (135) T ss_pred CcchH----HHHHHHhhcchHHHhhcCccceEEEecCCccccccceEEEecCCCCCCccccCchhceeeeeEEEeeeeec Confidence 77755 457888999999999998 6754 4665 45679999988775 34 334455668899999999976 Q ss_pred C----CHHHHHHHHHHHHHHh-cC-CCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEE-EEEecC Q lcl|NC_014229. 84 P----GFAEAHRIFAALDAAL-DR-VPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRV-RLTLDS 145 (145) Q Consensus 84 ~----g~~ea~~I~~aV~~aL-~~-~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra-~~~~~~ 145 (145) . +|.++++|..+|+..| .. .--.++|+ -...++|+..++-.-+||. ..+++| T Consensus 77 ~~~~~~R~~~~~i~~~I~~~l~~~~~f~q~s~~----------ldeY~~et~~y~~aRRYrG~~Y~~e~ 135 (135) T protein:vir:10 77 NDEYNARIIRNKISNRIQKLLWSELKMGNVSNG----------KPEYIEEFKTYRSSRVYEGIFYKEEN 135 (135) T ss_pred ccccchhhHHHHHHHHHHHHHHHHcCccccCCC----------CccchhhhhhhhhhheeeeecccCCC Confidence 3 4889999999999999 22 22222221 0112344444444444443 356666 No 62 >protein:vir:9514 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835561;genbank:gi:30043946;genbank:GeneID:1260543 Probab=97.30 E-value=1.5e-06 Score=52.54 Aligned_cols=120 Identities=15% Similarity=0.117 Sum_probs=78.9 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh--cccc-CCcc--cCCCCEEEecccee-ee--cCCCcccceEEEEEEEEEECC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS--GVYD-EVPE--PAPYPYVSFGSMTE-FP--EDAHDRQGLSVTVVIHVWSKS 83 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~--~IyD-~vP~--~a~~Pyv~iG~~~~-~~--~~~~~~~~~~~~~~I~vws~~ 83 (145) |.+.. +-|+..|++|..|+.+++ +|+= ..|+ +..-|||+|-+... .| ......-..+..+||+||+.. T Consensus 1 m~diL----~eIy~~L~~d~~L~~~v~~~rIk~~~~Pe~~d~~~p~IvI~pl~~p~p~~~~sn~~ls~~~~~QIDV~~k~ 76 (135) T protein:vir:95 1 MIDIL----YKVHEVISQDRIIREHVNINNIKFNKYPNVKDTDVPFIVIDDIDDPIPTTYTDGDECAYSYIVQIDVFVKY 76 (135) T ss_pred CcchH----HHHHHHhhcchHHHhhcCccceEEEecCCccccccceEEEecCCCCCCccccCchhceeeeeEEEeeeeec Confidence 77755 457888999999999998 6754 4665 45679999988775 34 334455668899999999976 Q ss_pred C----CHHHHHHHHHHHHHHh-cC-CCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEE-EEEecC Q lcl|NC_014229. 84 P----GFAEAHRIFAALDAAL-DR-VPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRV-RLTLDS 145 (145) Q Consensus 84 ~----g~~ea~~I~~aV~~aL-~~-~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra-~~~~~~ 145 (145) . +|.++++|..+|+..| .. .--.++|+ -...++|+..++-.-+||. ..+++| T Consensus 77 ~~~~~~R~~~~~i~~~I~~~l~~~~~f~q~s~~----------ldeY~~et~~y~~aRRYrG~~Y~~e~ 135 (135) T protein:vir:95 77 NDEYNARIIRNKISNRIQKLLWSELKMGNVSNG----------KPEYIEEFKTYRSSRVYEGIFYKEEN 135 (135) T ss_pred ccccchhhHHHHHHHHHHHHHHHHcCccccCCC----------CccchhhhhhhhhhheeeeecccCCC Confidence 3 4889999999999999 22 22222221 0112344444444444443 356666 No 63 >protein:vir:100675 Length: 135 # NCBI annotation: 77ORF027 # Family: family:all:508 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958611;genbank:gi:41189540;genbank:GeneID:2743821 Probab=97.30 E-value=1.5e-06 Score=52.54 Aligned_cols=120 Identities=15% Similarity=0.117 Sum_probs=78.9 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh--cccc-CCcc--cCCCCEEEecccee-ee--cCCCcccceEEEEEEEEEECC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS--GVYD-EVPE--PAPYPYVSFGSMTE-FP--EDAHDRQGLSVTVVIHVWSKS 83 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~--~IyD-~vP~--~a~~Pyv~iG~~~~-~~--~~~~~~~~~~~~~~I~vws~~ 83 (145) |.+.. +-|+..|++|..|+.+++ +|+= ..|+ +..-|||+|-+... .| ......-..+..+||+||+.. T Consensus 1 m~diL----~eIy~~L~~d~~L~~~v~~~rIk~~~~Pe~~d~~~p~IvI~pl~~p~p~~~~sn~~ls~~~~~QIDV~~k~ 76 (135) T protein:vir:10 1 MIDIL----YKVHEVISQDRIIREHVNINNIKFNKYPNVKDTDVPFIVIDDIDDPIPTTYTDGDECAYSYIVQIDVFVKY 76 (135) T ss_pred CcchH----HHHHHHhhcchHHHhhcCccceEEEecCCccccccceEEEecCCCCCCccccCchhceeeeeEEEeeeeec Confidence 77755 457888999999999998 6754 4665 45679999988775 34 334455668899999999976 Q ss_pred C----CHHHHHHHHHHHHHHh-cC-CCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEE-EEEecC Q lcl|NC_014229. 84 P----GFAEAHRIFAALDAAL-DR-VPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRV-RLTLDS 145 (145) Q Consensus 84 ~----g~~ea~~I~~aV~~aL-~~-~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra-~~~~~~ 145 (145) . +|.++++|..+|+..| .. .--.++|+ -...++|+..++-.-+||. ..+++| T Consensus 77 ~~~~~~R~~~~~i~~~I~~~l~~~~~f~q~s~~----------ldeY~~et~~y~~aRRYrG~~Y~~e~ 135 (135) T protein:vir:10 77 NDEYNARIIRNKISNRIQKLLWSELKMGNVSNG----------KPEYIEEFKTYRSSRVYEGIFYKEEN 135 (135) T ss_pred ccccchhhHHHHHHHHHHHHHHHHcCccccCCC----------CccchhhhhhhhhhheeeeecccCCC Confidence 3 4889999999999999 22 22222221 0112344444444444443 356666 No 64 >protein:vir:1387 Length: 116 # NCBI annotation: Gp10 protein # Family: family:all:517 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612839;genbank:gi:20065973;genbank:GeneID:935788 Probab=97.21 E-value=1.3e-06 Score=52.85 Aligned_cols=113 Identities=17% Similarity=0.186 Sum_probs=84.2 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCccc-CCCCEEEeccceeeecC--CCcccceEEEEEEEEEECCCCHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVPEP-APYPYVSFGSMTEFPED--AHDRQGLSVTVVIHVWSKSPGFAE 88 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~-a~~Pyv~iG~~~~~~~~--~~~~~~~~~~~~I~vws~~~g~~e 88 (145) |- ...+.+-|.+.|+ .+-..|+.....+ ...|||++-+-...|.. -+.....++++||+|||.. ..+ T Consensus 1 ~~--~m~I~~~i~~~Lk------~i~ipV~~~~y~~~~~~~~Itf~~y~e~~~~yaDd~e~~t~~~iQVDI~sk~--~~~ 70 (116) T protein:vir:13 1 ME--DFDIIALVYECLE------CLNVPVIEGWYDEELNKTHITVHEYLEQDESFEDDEAREEEHNIQIDVWSKD--SLE 70 (116) T ss_pred CC--ccchhHHHHHHHh------hcCCeeeecccCCCCccceEEEEeeecCCCcccCCeeeeEEEEEEEEEeecC--Ccc Confidence 32 2357777888886 2223677775554 46899999888888774 2344668899999999986 557 Q ss_pred HHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEec Q lcl|NC_014229. 89 AHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLD 144 (145) Q Consensus 89 a~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~ 144 (145) +.+|..+|..+|.. .||. ...+....++|+..+|=++||....|-| T Consensus 71 ~~~l~~~V~~lMk~-----~GF~-----r~~~~d~ye~dt~iyhk~~RF~y~~el~ 116 (116) T protein:vir:13 71 AFKLKKAIKKLLKK-----NNFY-----FDSSEDFYETKTRIYHKGLRFSYISEIS 116 (116) T ss_pred HHHHHHHHHHHHHH-----cCCE-----eeecCCCccchhhhhhhhhhheeeeecC Confidence 77899999999943 3433 4445667899999999999999999999 No 65 >protein:vir:81093 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429879;genbank:gi:156603932;genbank:GeneID:5525313 Probab=97.14 E-value=4.1e-06 Score=50.14 Aligned_cols=119 Identities=9% Similarity=0.072 Sum_probs=79.2 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCC--CcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDA--HDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~--~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |.+....|.+++.+.|..+. |..+--+|.+.-=.+..-|||++-+-...|..- +.....++++||+||+..+.. T Consensus 1 ~~~~~~~i~n~~I~~li~~~-Lk~~nvPV~~~~y~~~~ktyItf~ey~~~~~~yADd~e~~t~~~iQIDIW~sk~~~--- 76 (126) T protein:vir:81 1 MINVTELIRNAIIANNITDE-VNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQDEP--- 76 (126) T ss_pred CcchHHhhhhhHHHhhhhhc-eeeccceeccccccCCCCcEEEEEeecCCCCccccCeeeeeEEEEEEEEeeCCCCH--- Confidence 88888888888877666522 221111333332356777999998887776542 234568899999999544443 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEE-----Eec Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRL-----TLD 144 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~-----~~~ 144 (145) .++...|...|-. .|| ....+....++|+..+|=++|||..+ ++| T Consensus 77 ~~l~~~V~~~Mk~-----~GF-----~R~~~~d~YE~DtklyHk~~Rf~~~~~~~~~~~~ 126 (126) T protein:vir:81 77 NEQAEKIVELLKV-----INF-----QCYYREPLYESDVMSFRHIIRAKGSILSMKLEEN 126 (126) T ss_pred HHHHHHHHHHHHH-----cCC-----eeeecCCCccchhhhhheeeeeeeeccceeeccC Confidence 4566678777732 244 34556677899999999999998765 344 No 66 >protein:vir:80001 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430007;genbank:gi:156604062;genbank:GeneID:5525461 Probab=97.14 E-value=4.1e-06 Score=50.14 Aligned_cols=119 Identities=9% Similarity=0.072 Sum_probs=79.2 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCC--CcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDA--HDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~--~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |.+....|.+++.+.|..+. |..+--+|.+.-=.+..-|||++-+-...|..- +.....++++||+||+..+.. T Consensus 1 ~~~~~~~i~n~~I~~li~~~-Lk~~nvPV~~~~y~~~~ktyItf~ey~~~~~~yADd~e~~t~~~iQIDIW~sk~~~--- 76 (126) T protein:vir:80 1 MINVTELIRNAIIANNITDE-VNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQDEP--- 76 (126) T ss_pred CcchHHhhhhhHHHhhhhhc-eeeccceeccccccCCCCcEEEEEeecCCCCccccCeeeeeEEEEEEEEeeCCCCH--- Confidence 88888888888877666522 221111333332356777999998887776542 234568899999999544443 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEE-----Eec Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRL-----TLD 144 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~-----~~~ 144 (145) .++...|...|-. .|| ....+....++|+..+|=++|||..+ ++| T Consensus 77 ~~l~~~V~~~Mk~-----~GF-----~R~~~~d~YE~DtklyHk~~Rf~~~~~~~~~~~~ 126 (126) T protein:vir:80 77 NEQAEKIVELLKV-----INF-----QCYYREPLYESDVMSFRHIIRAKGSILSMKLEEN 126 (126) T ss_pred HHHHHHHHHHHHH-----cCC-----eeeecCCCccchhhhhheeeeeeeeccceeeccC Confidence 4566678777732 244 34556677899999999999998765 344 No 67 >protein:vir:1643 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695064;genbank:gi:23455755;genbank:GeneID:955492 Probab=96.77 E-value=3.3e-05 Score=45.13 Aligned_cols=109 Identities=15% Similarity=0.193 Sum_probs=75.4 Q ss_pred HHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHHHHHHHHH Q lcl|NC_014229. 18 ALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAHRIFAALD 97 (145) Q Consensus 18 aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~~I~~aV~ 97 (145) =+..-|...|.++.++ ++|-.+|++.+-+||++..+-.. +.......++.++||+.. +.+|.+++..|+ T Consensus 1 miE~~i~~~L~~~l~V-----pv~~e~p~~~P~~FV~vErtGG~----~~~~~~~~~lAVq~w~~S--~~eAa~La~~v~ 69 (111) T protein:vir:16 1 MIEIIIKNFLDTHLSV-----SSFLEKKGEMPLSYILFEKTGSS----KSNHLLSSTFAFQSYAPS--MYEAAKLNEQLK 69 (111) T ss_pred ChHHhHHHHHhhcCCc-----eeEeecCCCCCCceEEEEecCCc----cccccccceEEEEecchh--HHHHHHHHHHHH Confidence 2334477778876654 58989999999999999655443 333557889999999875 779999999999 Q ss_pred HHhcCCCCccCCceEEEEEEeeeeeeecCCCce--EEEEEEEEEE Q lcl|NC_014229. 98 AALDRVPLTVAGCTDVSIKHSNHQALKDPEPGV--RHINAEYRVR 140 (145) Q Consensus 98 ~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~--~hg~l~fra~ 140 (145) .+|.+. ..++ ++..+.+.+-=.++|++++. |..++++..- T Consensus 70 ~~l~~l-~~~~--~I~av~~~s~ynf~d~~tk~~RYQav~~i~~~ 111 (111) T protein:vir:16 70 EVVERL-IELN--EISNVSLNSDYNFTDTETKEYRYQAVFDINHY 111 (111) T ss_pred HHHhhc-cccc--cceeeecCCCCcCCCCCCCCceEEEEEEEeeC Confidence 999544 2344 45566666666667776543 4444433333 No 68 >protein:vir:9709 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:2110 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795471;genbank:gi:28876220;genbank:GeneID:1257764 Probab=96.69 E-value=1.3e-05 Score=47.36 Aligned_cols=118 Identities=19% Similarity=0.328 Sum_probs=82.9 Q ss_pred cchHHHHHHHHHHHhhcChhhhhhhhc-------------ccc-CCcccC-------CCCEEEeccceeeecC-CC-ccc Q lcl|NC_014229. 13 ATALPALQASVYAKLVGHAPLTALVSG-------------VYD-EVPEPA-------PYPYVSFGSMTEFPED-AH-DRQ 69 (145) Q Consensus 13 ~~~~~aLq~Ai~~~L~~da~l~alv~~-------------IyD-~vP~~a-------~~Pyv~iG~~~~~~~~-~~-~~~ 69 (145) +.|. +-|++.|.++..|.+|++. ||- .+|+.+ ..|+|.|-+....+.. ++ ... T Consensus 1 mlp~----~~vy~~L~~n~~L~~lm~~~r~~~~~~~~~~~If~~~vPE~~~~~qk~~~aP~IrI~~i~~~~~~yADn~~~ 76 (141) T protein:vir:97 1 MIAE----TTAYKLLSNDKTLNELLDKLRGGPFKNGFKQGIFTYDIPDNPIDLRKAELAPFMRIKTTLDGPADYADDEIL 76 (141) T ss_pred CchH----HHHHHHhcccHHHHHHHhhhccccccccccccccccccCCChhhhhhhccCCeEEEeccCCCcccccccccc Confidence 2333 2488999999999999862 665 578763 5899999888776554 33 345 Q ss_pred ceEEEEEEEEEECCCCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEe Q lcl|NC_014229. 70 GLSVTVVIHVWSKSPGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTL 143 (145) Q Consensus 70 ~~~~~~~I~vws~~~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~ 143 (145) ..+.+++|++|++. ..+..+|...|..+|.. .||.. ......+...|||...+|...+||.---- T Consensus 77 ~~~~~vQIdiW~~~--~~~~e~i~~~Id~~M~~-----~gf~r--Y~~~~~~~~~dpD~d~~~~~rRYr~~~~~ 141 (141) T protein:vir:97 77 CNEQRITINFWCKT--ASEADQINKCIDNILKQ-----GGFER--YTANEKPRYKDSDIDLLMNVRKYRCFDFY 141 (141) T ss_pred eeeeeeEeeeeecC--hhHHHHHHHHHHHHHHh-----cCcee--ccccCCCCCCccchhhhhhhhheeeeccC Confidence 68899999999995 56888899999999953 23321 11223456788998888888877643222 No 69 >protein:vir:94768 Length: 111 # NCBI annotation: unknown # Family: family:all:1269 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996711;genbank:gi:45597426;genbank:GeneID:2769040 Probab=96.64 E-value=4.6e-05 Score=44.37 Aligned_cols=109 Identities=16% Similarity=0.194 Sum_probs=76.0 Q ss_pred HHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHHHHHHHHH Q lcl|NC_014229. 18 ALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAHRIFAALD 97 (145) Q Consensus 18 aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~~I~~aV~ 97 (145) =+..-|.+.|.++.++ ++|-.+|++.+-+||++..+-.. +.......++.|+||+.. +.+|.+++..|+ T Consensus 1 miE~~v~~~L~~~l~v-----pv~~e~p~~~p~~FV~vErtGG~----~~~~~~~~~lAVQ~~~~S--~~eAa~La~~v~ 69 (111) T protein:vir:94 1 MIEIIIKNFLDTHLSV-----SSFLEKKGEMPLSYVLFEKTGSS----KSNHLLSSTFAFQSYAPS--MYEAAKLNEQLK 69 (111) T ss_pred ChHHhHHHHHhhcCCc-----ceEeecCCCCCCceEEEEecCCc----cccccccceEEEEecchh--HHHHHHHHHHHH Confidence 2334477788877654 58888999999999999554433 444557889999999875 779999999999 Q ss_pred HHhcCCCCccCCceEEEEEEeeeeeeecCCCce--EEEEEEEEEE Q lcl|NC_014229. 98 AALDRVPLTVAGCTDVSIKHSNHQALKDPEPGV--RHINAEYRVR 140 (145) Q Consensus 98 ~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~--~hg~l~fra~ 140 (145) .+|.+. ..++ ++..+.+.+-=.++|++++. |.++.++..- T Consensus 70 ~~~~~l-~~~~--~i~~v~~~s~Ynf~d~~tk~~RYQav~~i~~~ 111 (111) T protein:vir:94 70 EVVERL-IELN--EISNVSLNSDYNFTDTETKEYRYQAVFDINHY 111 (111) T ss_pred HHHhhc-cccc--ccceeecCCCcccCCCcCCCceEEEEEEEeeC Confidence 999544 3444 45556666666677777543 4444433333 No 70 >protein:vir:79571 Length: 137 # NCBI annotation: putative tail component # Family: family:all:911 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272522;genbank:gi:148609391;genbank:GeneID:5204407 Probab=96.57 E-value=5.6e-05 Score=43.91 Aligned_cols=127 Identities=18% Similarity=0.219 Sum_probs=79.2 Q ss_pred ccchH---HHHHHHHHHHhhcChhhhhhhhccccCCcc---cCCCCEEE--eccceeeecCCCcccceEEEEEEEEEECC Q lcl|NC_014229. 12 MATAL---PALQASVYAKLVGHAPLTALVSGVYDEVPE---PAPYPYVS--FGSMTEFPEDAHDRQGLSVTVVIHVWSKS 83 (145) Q Consensus 12 M~~~~---~aLq~Ai~~~L~~da~l~alv~~IyD~vP~---~a~~Pyv~--iG~~~~~~~~~~~~~~~~~~~~I~vws~~ 83 (145) |.+|+ .++|++|.++|+. .|...+ -+||+-|. ....|-|. +.+.+.++.. -|..-++..+.|.|+=+. T Consensus 1 ~~~~M~iht~IR~~Vid~L~~--~l~~~~-~ffdGrP~fiDe~ElPAVAV~l~da~~~~~~-ld~~~W~A~LhI~iyLka 76 (137) T protein:vir:79 1 MADPMNRHTQIRQVVLARLRE--QCGDSA-TFFDGLPAFVDAQELPAVSVWLSDAQYTGKM-TDEDDWQAVLHIAVFIRA 76 (137) T ss_pred CCchhHHHHHHHHHHHHHHHh--hcCCcE-EEeCCccceechhhCcEEEEEeecCCCCcce-ecCCeeEEEEEEEEEeec Confidence 88885 7899999999986 333322 27887763 24577644 4444444443 355568999999999776 Q ss_pred CCHHH-HHHHHHH-HHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 84 PGFAE-AHRIFAA-LDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 84 ~g~~e-a~~I~~a-V~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ..... .-++++. |..++...+ .+.+. +..+...+-+.-||.+-.+|+ +..+.+.++=+| T Consensus 77 ~~~ds~LD~~~E~~I~~v~~~~~-~l~~l-~~~~~~~gY~Y~rD~e~~tW~-sadL~y~ItYe~ 137 (137) T protein:vir:79 77 QAPDSELDMWMESTIFPALNDVP-ALSGL-IDTLIPLGFNYQRDNEMATWA-MAEITYQITYTN 137 (137) T ss_pred CCCHHHHHHHHHHHHHHhhcchh-hhhhH-hhhhhcccCCcccccccceeE-EEEEEEEEEEcC Confidence 54444 4457774 888886543 22321 334555666777888877774 455555555555 No 71 >protein:vir:9579 Length: 111 # NCBI annotation: gp45 # Family: family:all:1269 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862884;genbank:gi:32469476;genbank:GeneID:1461321 Probab=96.05 E-value=0.00012 Score=41.98 Aligned_cols=109 Identities=12% Similarity=0.149 Sum_probs=72.3 Q ss_pred HHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHHHHHHHHH Q lcl|NC_014229. 18 ALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAHRIFAALD 97 (145) Q Consensus 18 aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~~I~~aV~ 97 (145) =+..-|...|...-++ .+|-.+|.+.+-+||++..+-.. +.......++.++||+.. +.+|.+++..|+ T Consensus 1 miE~~v~~~L~~~l~v-----pv~~~vp~~~P~~FV~vErtGG~----~~~~~~~p~laVq~wg~S--~~~Aa~La~~v~ 69 (111) T protein:vir:95 1 MIEIIINKYLDGHLDV-----PSFFEHEAEAPDSFVIIQKTGGK----ERNHSGSATFAFQSYAPT--MQKAAELNVKVK 69 (111) T ss_pred ChHHhHHHHhhhhcCe-----eEEeecCCCCCCceEEEEeeCCc----cccccccceEEEEecccc--HHHHHHHHHHHH Confidence 2233355566542222 57888999888999999555443 444457889999999875 889999999999 Q ss_pred HHhcCCCCccCCceEEEEEEeeeeeeecCCCc--eEEEEEEEEEE Q lcl|NC_014229. 98 AALDRVPLTVAGCTDVSIKHSNHQALKDPEPG--VRHINAEYRVR 140 (145) Q Consensus 98 ~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~--~~hg~l~fra~ 140 (145) .++.+. ..++ ++....+.+--.++|++++ .|..++++..- T Consensus 70 ~a~~~l-~~~~--~i~~v~~~s~ynf~d~~tk~~RYQ~~~~i~~~ 111 (111) T protein:vir:95 70 SAVKGL-IELD--SICGVHLNSDYNFTDTETKQYRYQAVFDINYF 111 (111) T ss_pred HHHhhh-hccc--cccccccCCccccCCCCCCCceEEEEEEEEeC Confidence 999544 2233 3556667766677787764 34444444444 No 72 >protein:vir:108220 Length: 133 # NCBI annotation: gp14 # Family: family:all:6424 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552343;genbank:gi:160700663;genbank:GeneID:5758940 Probab=95.81 E-value=0.00052 Score=38.60 Aligned_cols=118 Identities=14% Similarity=0.176 Sum_probs=73.2 Q ss_pred ccc------hHHHHHHHHHHHhhcChhhhhhhhccccCCccc----CCCCEEEeccc-eeeecCCCcccceEEEEEEEEE Q lcl|NC_014229. 12 MAT------ALPALQASVYAKLVGHAPLTALVSGVYDEVPEP----APYPYVSFGSM-TEFPEDAHDRQGLSVTVVIHVW 80 (145) Q Consensus 12 M~~------~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~----a~~Pyv~iG~~-~~~~~~~~~~~~~~~~~~I~vw 80 (145) |+. |..++..-+.+.|-+|. .+-+.+|++ +..|+|+++.- ...+|-.++ ...+.+.+| T Consensus 1 m~~~Rvp~D~~~~Ik~~L~~~l~a~v-------~~~~~lPddW~~~s~~P~vvV~dDggpv~wpv~t----~~~IRvtv~ 69 (133) T protein:vir:10 1 MSDVRVVGDPVPPVKAYLAAFWGARV-------RIADEVPDDWHVETDVPLIVVDDDGGPIDWPVKS----DPLVRCGIY 69 (133) T ss_pred CCCcccCCCChHHHHHHHHhhccccc-------eeeeecCCCccccCCceEEEEecCCCccccceec----cceEEEEEe Confidence 764 34444444555555442 356677764 56699777432 223333322 235667788 Q ss_pred ECCCCHHHHHHHHHHHHHHhcCCCCccCCceEE-EEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 81 SKSPGFAEAHRIFAALDAALDRVPLTVAGCTDV-SIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 81 s~~~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v-~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +++ |.+|++|+.+...+|-.. .++|...+ +-.. .--.-||++++-+-+..++++..+.+- T Consensus 70 a~g--r~~Ar~l~~~~~g~LLa~--~i~Gva~ii~~g~-glL~aRD~~tgg~iAsfTV~A~~rt~~ 130 (133) T protein:vir:10 70 ANG--KQTAKNLRRITMGALLAE--PIPGIAHIQRTGI-GYVDARDPDTGADIASFTVTATVRTEV 130 (133) T ss_pred ecC--ChhHHHHHHHHHHHHhcC--CCCceeEEcCCCc-eEEecCCCCCCceEEEEEEEeeeeeeE Confidence 874 889999999999888443 45553322 1111 223448899888889999999888877 No 73 >protein:vir:9764 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795526;genbank:gi:28876278;genbank:GeneID:1257819 Probab=95.67 E-value=0.0003 Score=39.92 Aligned_cols=109 Identities=12% Similarity=0.141 Sum_probs=73.7 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAHR 91 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~~ 91 (145) |.. .-|...|...-++ .+|-.+|++.+-+||++..+-. .++......++.|++|+.. +.+|.+ T Consensus 1 mIE------~~i~~yL~~~l~v-----pv~~e~p~~~P~~FV~vEkTGG----~~~~~~~~a~lAvQsyg~S--~~~AA~ 63 (111) T protein:vir:97 1 MIE------VIIKKYLDEHLDV-----PSFFEHQKDEPARFIILEKTSG----AKQNHLLSSTFAFQSYAES--LYEAAL 63 (111) T ss_pred Chh------hhhhHHHhhhcCc-----eEEEeecCCCCCceEEEEeeCC----ccccccccceEEEEecchh--HHHHHH Confidence 322 1244455542222 5777778878889999955544 4555568889999999875 889999 Q ss_pred HHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCc--eEEEEEEEEEE Q lcl|NC_014229. 92 IFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPG--VRHINAEYRVR 140 (145) Q Consensus 92 I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~--~~hg~l~fra~ 140 (145) ++..|+.++.+.+ .++ ++.++.+.+.=.++|++++ .|.++.++..- T Consensus 64 La~~V~~a~~~l~-~l~--~i~~v~lns~Ynf~d~~tk~yRYQa~~di~~~ 111 (111) T protein:vir:97 64 LNDKVKQVIEQLD-VLP--QVSGVHLNADYNFTDTATKRYRYQAVFDINHY 111 (111) T ss_pred HHHHHHHHhhhhc-cCc--cceeeeecccccCCCCCCCCccEEEEEEEeeC Confidence 9999999996433 555 5667777777777888864 45555444444 No 74 >protein:vir:9931 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:2393 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795693;genbank:gi:28876455;genbank:GeneID:1258023 Probab=94.96 E-value=0.00076 Score=37.67 Aligned_cols=113 Identities=16% Similarity=0.110 Sum_probs=78.5 Q ss_pred cc-chHHHHHHHHHHHhhcChhhhhhhhccccCCcc-cCCCCEEEeccceeeecC-CC-cccceEEEEEEEEEECCCCHH Q lcl|NC_014229. 12 MA-TALPALQASVYAKLVGHAPLTALVSGVYDEVPE-PAPYPYVSFGSMTEFPED-AH-DRQGLSVTVVIHVWSKSPGFA 87 (145) Q Consensus 12 M~-~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~-~a~~Pyv~iG~~~~~~~~-~~-~~~~~~~~~~I~vws~~~g~~ 87 (145) |- ||+..+-+.|..+|..-. -+||=..|. +..-||+++|.....+.. ++ +....+..++|+++-..++|. T Consensus 1 md~sp~t~~Lk~i~~kL~~~~------IPiYfkLP~sdi~EPF~ViGsh~~DdsktA~~Ga~ivdt~lqIDlFyp~~sR~ 74 (119) T protein:vir:99 1 MDYSLETLYLKKVKNRLGVLD------IPIYFKLPKSDVLEPFIVVGTNISDLSKTAQTGAVIDDFSLNIDAFLPGDSRL 74 (119) T ss_pred CCcchhhHHHHHHHHhhcccC------cceEEeCCCCCcCCceEEEecccCccccccccceEEEeeeEEEEEeecCcccc Confidence 53 678888899999887522 168988886 677899999988765443 22 334467799999999999999 Q ss_pred HHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCC--CceEEEEEEEEEEEE Q lcl|NC_014229. 88 EAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPE--PGVRHINAEYRVRLT 142 (145) Q Consensus 88 ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d--~~~~hg~l~fra~~~ 142 (145) .+.+|-.++.++|.... -..++.+.|-. ...||.+..+...+= T Consensus 75 d~eeiks~~~~~l~r~~------------~it~qil~DnSIGReVYhV~f~isd~i~ 119 (119) T protein:vir:99 75 DAEEIKSRMLRLLGRNN------------QIKAQILVDNSIGREVYRVAINITETLF 119 (119) T ss_pred cHHHHHHHHHHHhhhhh------------hhhhcccccccccceeeeeeeEeeeecC Confidence 99999999998884321 11233333321 136787777666665 No 75 >protein:vir:78057 Length: 154 # NCBI annotation: gp10 # Family: family:all:29813 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468794;genbank:gi:157325375;genbank:GeneID:5601819 Probab=94.12 E-value=0.0019 Score=35.50 Aligned_cols=139 Identities=14% Similarity=0.190 Sum_probs=90.8 Q ss_pred CceEEEeeCCcc---------cchHHHHHHHHHHHhhcChhhhhhhhccccC----CcccCCCCEEEeccceeeecCCCc Q lcl|NC_014229. 1 MPLLAIWAGGDM---------ATALPALQASVYAKLVGHAPLTALVSGVYDE----VPEPAPYPYVSFGSMTEFPEDAHD 67 (145) Q Consensus 1 ~~~~~~~~~~~M---------~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~----vP~~a~~Pyv~iG~~~~~~~~~~~ 67 (145) |-...=+-.|+. +----++-.-||.-|.. ++.++ ++|+-. |-..-.|||-++.-- ..+. .+. T Consensus 1 m~~~ir~~dg~~r~lydv~pnayn~ge~le~~y~ml~E--~i~s~-~~i~rn~nP~P~~si~YPy~tfe~D-~e~~-~dn 75 (154) T protein:vir:78 1 MAVNIRFPDGTVRPLYDVKPNAYNRGELLEIIYEMLNE--AVKSE-IDVFRNKNPKPVNSITYPYMTFEVD-NAKV-DDN 75 (154) T ss_pred CeeEeecCCCcccceeecCCCccchhHHHHHHHHHHHH--HHHHH-HHHHhhcCCCcceeEecceeeeeec-cccc-cCC Confidence 333333333332 22234556667776664 44443 345543 223457999999322 2221 145 Q ss_pred ccceEEEEEEEEEECCCCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecC-CCceEEEEEEEEEEEEecC Q lcl|NC_014229. 68 RQGLSVTVVIHVWSKSPGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDP-EPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 68 ~~~~~~~~~I~vws~~~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~-d~~~~hg~l~fra~~~~~~ 145 (145) ..|.-+++.|+++.+..+..-...+.+.++.+|+.+....+.+. +.+.+..++.++|. |-..-+-.+..+|-++++. T Consensus 76 q~~~gvylDidlfDR~~s~~nl~~l~d~L~~~Ld~kR~lt~dy~-~~~~~e~snkIP~ET~reLLRR~va~~FYIer~~ 153 (154) T protein:vir:78 76 EHGTMVAVDCELFDRGTTSDMIDKYTDMLNNELDHKRHSYEDYW-VKTELERDRDIPDETDKELLRRMVALTFYIERND 153 (154) T ss_pred cccceEEEEEEEeecCCCchhHHHHHHHHHhhhhhhccccccee-EEEEEccCCCCCchhHHHHHhhhhheeEEEEecC Confidence 66788999999999999999999999999999988877777764 55788888888865 4333333456778888877 No 76 >protein:vir:106554 Length: 122 # NCBI annotation: putative protein # Family: family:all:6476 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958589;genbank:gi:41179248;genbank:GeneID:2717090 Probab=93.42 E-value=0.0042 Score=33.63 Aligned_cols=112 Identities=17% Similarity=0.187 Sum_probs=72.9 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCccc-CCCCEEEeccceeeec--CCCcccceEEEEEEEEEECCCCHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVPEP-APYPYVSFGSMTEFPE--DAHDRQGLSVTVVIHVWSKSPGFAE 88 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~-a~~Pyv~iG~~~~~~~--~~~~~~~~~~~~~I~vws~~~g~~e 88 (145) |.- .-+...+|+.|+.-+++. .|-|.-|.+ +.||-+.+-+.+.... +-+...-.+.+.+|++|++..+ T Consensus 1 m~~--INiK~~vy~~L~~v~e~k----~Vs~~YP~~w~~fP~~iY~t~~~~~~~~~~~~E~~t~w~itIDi~~~~~S--- 71 (122) T protein:vir:10 1 MEI--YNVKALVFKTLKSMPELK----LVSPSYPDKFTTFPAAIYSTSQSSYIRNAQQEETDTEWKITIDLYNDHGS--- 71 (122) T ss_pred Cce--eeccHHHHHHHhhccccc----ccCCCCCCCcccCcEEEEecCCCceeeecCcceeeEEEEEEEEEEcCCcc--- Confidence 432 346678899888766653 355665654 7899999865554322 2334445788999999997543 Q ss_pred HHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 89 AHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 89 a~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ..+|+.+|.+++... | |.......||.| ..|-+++|+.++.-++ T Consensus 72 tt~ia~~i~~~f~~l-----G-------ft~~~~~~d~sg-lkr~vmr~~gIVDn~t 115 (122) T protein:vir:10 72 LTNIKAKLIARFSAM-----G-------FSNSVGDQDLNG-VSRVVIVFAGIVDNTS 115 (122) T ss_pred HHHHHHHHHHHHhhc-----c-------ccccCCCCCcCC-CeEEEEEEEEEEEccc Confidence 455666676666322 1 233333445554 6799999999999888 No 77 >protein:vir:7450 Length: 141 # NCBI annotation: gp27 # Family: family:all:6926 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818565;genbank:gi:29567002;genbank:GeneID:1260235 Probab=93.24 E-value=0.0032 Score=34.28 Aligned_cols=126 Identities=14% Similarity=0.160 Sum_probs=73.6 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh---ccccCC----cccCCCCEEEeccceeeecCCCcccceEEEEEEEE-EECC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS---GVYDEV----PEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHV-WSKS 83 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~---~IyD~v----P~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~v-ws~~ 83 (145) |+ +.+++.+|++|++|.++++ .||+.- +.+-+-|||++-=...+=.++-...-+.+.+.+|. |... T Consensus 1 M~------~a~vl~~lr~D~~L~a~g~~~~~v~~~~~~d~rP~~~G~FiV~~W~~~~i~~~I~rgPr~~~iwvH~P~~~s 74 (141) T protein:vir:74 1 MH------PSILYDSIAHDPELNAMGITPSRIKELDSIDKRPFDSGYFIVTRWLDQDLHPTINRGPRDLMVWCHMPKDRG 74 (141) T ss_pred Cc------HHHHHHHHhccchhhhhccccceeeecccccCCCCCCCcEEEEeccCcccccccCCCCceEEEEEecchhcc Confidence 54 3579999999999999964 476642 22236788887333222223333334666666665 4556 Q ss_pred CCHHHHHHHHHHHHHHhcCCCCc--cCCceEEEEEEee-eeeeecC--CCceEEEEEEEEEEEEecC Q lcl|NC_014229. 84 PGFAEAHRIFAALDAALDRVPLT--VAGCTDVSIKHSN-HQALKDP--EPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 84 ~g~~ea~~I~~aV~~aL~~~~l~--l~g~~~v~~~~~~-~~~~~d~--d~~~~hg~l~fra~~~~~~ 145 (145) .++....+|.++|.+....-+-. .+|-++...++.. +..+.|+ .+..+|+ +|-++..+|- T Consensus 75 tdf~~id~il~Ri~eI~~svE~~~G~DG~~l~~v~~~g~s~dl~D~G~kTi~R~A--Ty~vL~d~nt 139 (141) T protein:vir:74 75 RNFLPIERILERINDIWASVEAQTGTDGVRVTSVKRRGQSGNLEDEGWKTLARNA--TFSVLYDRNT 139 (141) T ss_pred CCcchHHHHHHHHHHHHhhccccccCCceEEEEEeeeccCCCccccchhhhhhhc--eeeeeeccee Confidence 67888888888888776443322 3566766666642 3333433 1233443 3455555444 No 78 >protein:vir:9648 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795410;genbank:gi:28876183;genbank:GeneID:1257699 Probab=92.67 E-value=0.0005 Score=38.70 Aligned_cols=117 Identities=15% Similarity=0.072 Sum_probs=72.8 Q ss_pred cccchHHHHHHHHHHHhhcChhhhhhhhcccc-CCcc--cCCCCEEEecccee-ee--cCCCcccceEEEEEEEEEECCC Q lcl|NC_014229. 11 DMATALPALQASVYAKLVGHAPLTALVSGVYD-EVPE--PAPYPYVSFGSMTE-FP--EDAHDRQGLSVTVVIHVWSKSP 84 (145) Q Consensus 11 ~M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD-~vP~--~a~~Pyv~iG~~~~-~~--~~~~~~~~~~~~~~I~vws~~~ 84 (145) =|. .+-+-|+..|.+|.-|... +|+= ..|+ +..-|||+|-+... .| ...+..-..+..+||+|||.. T Consensus 1 mm~----DiL~~Iy~~L~~d~~l~~~--rIk~~~~Pe~~d~~~p~IvI~pl~~P~p~~~~sd~~ls~~ylyQIDVes~~- 73 (126) T protein:vir:96 1 MVR----DMLAEVFDLLKADNVLKLV--KIKSFERPESLLDDQTSIVILPITAPKQSTFGSDTALSKKFLYQIEVESTS- 73 (126) T ss_pred Chh----HHHHHHHHHHhccceecce--eeeeeecCCCCCCCcceEEEeeCCCCCCccccCchhhhhhceeeEeeeecC- Confidence 333 4556778888888655432 5532 3444 67899999988755 33 334455568889999998875 Q ss_pred CHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEE---EEec Q lcl|NC_014229. 85 GFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVR---LTLD 144 (145) Q Consensus 85 g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~---~~~~ 144 (145) |.+.++|..+|+..|-. +- +. +....-..-++|+..+|-.-+||-. .++= T Consensus 74 -r~~~~~i~~rI~~~l~~--ig---f~----q~s~gldeY~~etkry~daRRYrg~~k~yeey 126 (126) T protein:vir:96 74 -RLECKDLQCRIEKQLEK--IG---FY----QNDAGFERFDRDTGRYLDARTFRGFSNIYEDY 126 (126) T ss_pred -ccchHHHHHHHHHHHHH--cC---cc----ccccCcchhhhhhhhhhhhheecccchhhhcC Confidence 88999999999999942 11 11 0111112234566666666666663 2222 No 79 >protein:vir:3428 Length: 131 # NCBI annotation: tail component # Family: family:all:911 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040591;genbank:gi:9626255;genbank:GeneID:2703486 Probab=92.61 E-value=0.0052 Score=33.11 Aligned_cols=123 Identities=19% Similarity=0.260 Sum_probs=79.4 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCcc---cCCCCEEE--eccceeeecCCCcccceEEEEEEEEEECCCCH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVPE---PAPYPYVS--FGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGF 86 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~---~a~~Pyv~--iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~ 86 (145) |+ -.++|++|.++|++. +... ..||+-|. ....|-|. +.+.+.++.. -|..-++..+.|.|+=+.... T Consensus 1 ~~--ht~IR~~Vid~L~~~--l~~v--~~fdG~P~fide~ElPAVAV~l~d~~~~~~~-ld~~~w~A~LhI~iyLka~~~ 73 (131) T protein:vir:34 1 MK--HTELRAAVLDALEKH--DTGA--TFFDGRPAVFDEADFPAVAVYLTGAEYTGEE-LDSDTWQAELHIEVFLPAQVP 73 (131) T ss_pred Cc--hHHHHHHHHHHHhcc--CCce--EEecCCceeeccccCcEEEEEeecCCCCcce-ecCCeeEEEEEEEEEeecCCC Confidence 55 468999999999873 4332 27888773 45677754 4444444333 367788899999999776544 Q ss_pred H-HHHHHHHH-HHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEE-EEEEEEEEEEe Q lcl|NC_014229. 87 A-EAHRIFAA-LDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRH-INAEYRVRLTL 143 (145) Q Consensus 87 ~-ea~~I~~a-V~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~h-g~l~fra~~~~ 143 (145) . +.-++++. |..++.+.+ .+.+ .+..+....-+.-||.+-.+|+ +.+.|++..+= T Consensus 74 ds~LD~~~E~~i~~v~~~~~-~l~~-l~~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) T protein:vir:34 74 DSELDAWMESRIYPVMSDIP-ALSD-LITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) T ss_pred HHHHHHHHHHHhHHHhhcch-hhhh-HhhhhhhccCCcccccccceEEEEEEEEEEEEeC Confidence 4 44457776 778885322 1121 1445666667777888877665 45667777666 No 80 >protein:vir:107857 Length: 154 # NCBI annotation: gp37 # Family: family:all:1532 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024710;genbank:gi:48696947;genbank:GeneID:2845945 Probab=91.00 E-value=0.0063 Score=32.65 Aligned_cols=126 Identities=17% Similarity=0.183 Sum_probs=76.4 Q ss_pred cchHHHHHHHHHHHhhcChhhhhhhhccccCCcccC------CCCEEEeccceee-ecCCC-cccceEEEEEEEEEECC- Q lcl|NC_014229. 13 ATALPALQASVYAKLVGHAPLTALVSGVYDEVPEPA------PYPYVSFGSMTEF-PEDAH-DRQGLSVTVVIHVWSKS- 83 (145) Q Consensus 13 ~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a------~~Pyv~iG~~~~~-~~~~~-~~~~~~~~~~I~vws~~- 83 (145) +++..++..+|.+||+..-+ .+---.|..-|++- ..=.|.++-+... +.+++ -.+-+++.+.+.|..+. T Consensus 1 m~~t~~ii~aiv~rL~~~lP--~~~ve~fP~~p~ey~l~h~~GAvLV~Y~GS~f~~~~~~~~i~Q~R~~~~~vTVi~r~l 78 (154) T protein:vir:10 1 MATTLEMVDAIVARLRVKLP--ALVTEYFPERPDEYRLNHAIGALLVSYPGSQYDTTVDTDMVVQPRRVKFAVAIVLRQL 78 (154) T ss_pred CchhHHHHHHHHHHHHHhCC--cceEeeCCCChhHcCCCCCceeEEEEecCcccCCcccCCceeeeeEEEEEEEEEeecc Confidence 56789999999999997322 22112454433321 1223555555443 44433 44668888888888765 Q ss_pred CCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEE-------------------EEec Q lcl|NC_014229. 84 PGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVR-------------------LTLD 144 (145) Q Consensus 84 ~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~-------------------~~~~ 144 (145) .|+..+..+.++|+.+|-+. .++|.. .|+..+-+...+.+| .||=.+.|-.- .|++ T Consensus 79 ~g~~gal~~LD~vR~aL~Gf--~ppdc~--~~~lv~d~f~ge~~G-~W~Y~l~~at~t~~Ve~~~~~d~pll~~v~yee~ 153 (154) T protein:vir:10 79 NGRGGAIDVLDHVRTALVGF--RPPDCK--KLAAVSDKFLGESAG-LWQYVIEFSAGAVIVEDAEPNDGPLLTQVTYEEE 153 (154) T ss_pred CCcchhhHHHHHHHHHHhcc--ccCCCc--eeehhhhcccccccc-eeeeeeeeccchhhhhccCCCCCceeeeeeeccc Confidence 68899999999999999554 455643 577776666665554 46544444321 1111 Q ss_pred C Q lcl|NC_014229. 145 S 145 (145) Q Consensus 145 ~ 145 (145) + T Consensus 154 ~ 154 (154) T protein:vir:10 154 S 154 (154) T ss_pred C Confidence 1 No 81 >protein:vir:94921 Length: 125 # NCBI annotation: possible peptidoglycan binding protein # Family: family:all:5248 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239283;genbank:gi:66392065;genbank:GeneID:5076566 Probab=90.88 E-value=0.019 Score=30.08 Aligned_cols=120 Identities=13% Similarity=0.149 Sum_probs=71.8 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhc-cccCCcccCCCCEEEec--cceeeecC-CCcccceEEEEEEEEEECC-CCH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSG-VYDEVPEPAPYPYVSFG--SMTEFPED-AHDRQGLSVTVVIHVWSKS-PGF 86 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~-IyD~vP~~a~~Pyv~iG--~~~~~~~~-~~~~~~~~~~~~I~vws~~-~g~ 86 (145) |+. .+++.+|.+++++. ... .+ +|++.|.+..-+|+.+- .....-.+ .+-...+.-.+.|.|++.. .|. T Consensus 1 Mt~--~q~r~~I~~r~~a~--~~~--~~I~~~N~pp~~~~~W~Rlti~~g~~~~a~iG~~~~~rtGli~iqiF~p~~~G~ 74 (125) T protein:vir:94 1 MSY--FQEKLDIENYFKAN--WPD--TPIFYENRTANSTGTWVRLTIQNGDAFQASNGEVSYRHPGVVFVQIFTKKEVGS 74 (125) T ss_pred CCH--HHHHHHHHHHHHhC--CCc--cceeeCCCCCCCCCceEEEEeccCcccccccCCceeeeeeEEEEEeeecCCcCh Confidence 875 57899999999852 211 13 68876655567776552 21111111 1122335568899999975 578 Q ss_pred HHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 87 AEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 87 ~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) ..+.++++++++....+ +..+.. +.-.+..... +|++.|..-|.. ...+-| T Consensus 75 ~~~~~~ad~~~~~f~~~--~~g~i~---f~~~~~~~~g-~~~gwyQ~Nv~I--~f~~~~ 125 (125) T protein:vir:94 75 GEALKLADKVDALFRSK--TLGNIQ---FKVPQVQKVP-STTEWYQVNVST--EFYRGS 125 (125) T ss_pred HHHHHHHHHHHHHHccC--CCCceE---EeeceecCCC-CCCCEEEEEEEE--eeecCC Confidence 89999999999999766 334432 2222333333 356777555443 444555 No 82 >protein:vir:79065 Length: 154 # NCBI annotation: gp11 # Family: family:all:1532 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111211;genbank:gi:134288825;genbank:GeneID:4960739 Probab=90.85 E-value=0.0068 Score=32.45 Aligned_cols=126 Identities=18% Similarity=0.188 Sum_probs=76.4 Q ss_pred cchHHHHHHHHHHHhhcChhhhhhhhccccCCcccC------CCCEEEeccceee-ecCCC-cccceEEEEEEEEEECC- Q lcl|NC_014229. 13 ATALPALQASVYAKLVGHAPLTALVSGVYDEVPEPA------PYPYVSFGSMTEF-PEDAH-DRQGLSVTVVIHVWSKS- 83 (145) Q Consensus 13 ~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a------~~Pyv~iG~~~~~-~~~~~-~~~~~~~~~~I~vws~~- 83 (145) +++..++..+|.+||+..-+ .+---.|..-|++- ..=.|.++-+... +.+++ -.+-+++.+.+.|..+. T Consensus 1 m~~t~~ii~~iv~rL~~~lP--~~~ve~fP~~p~ey~l~h~~GAvLV~Y~GS~f~~~~~~~~i~Q~R~~~~~vTVi~r~l 78 (154) T protein:vir:79 1 MATTLEMVDSVVARLRVKLP--ALVTEYFPERPDEYRLNHAIGALLVSYPGSQYDTTVDTDMVVQPRRVKFAVAIVLRQL 78 (154) T ss_pred CchhHHHHHHHHHHHHHhCC--cceEeeCCCChhHcCCCCCceeEEEEecCcccCCcccCCceeeeeEEEEEEEEEeecc Confidence 56789999999999997322 22112454433321 1223555555443 44433 44668888888888765 Q ss_pred CCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEE-------------------EEEec Q lcl|NC_014229. 84 PGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRV-------------------RLTLD 144 (145) Q Consensus 84 ~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra-------------------~~~~~ 144 (145) .|+..+..+.++|+.+|-+. .++|.. .|+..+-+...+.+| .||=.+.|-. ..|++ T Consensus 79 ~g~~gal~~LD~vR~aL~Gf--~ppdc~--~~~lv~d~f~ge~~G-~W~Y~l~~at~t~~Ve~~e~~d~pll~~v~yee~ 153 (154) T protein:vir:79 79 NGRGGAIDVLDHVRTALVGF--RPPDCK--KLAAVSDKFLGESAG-LWQYVIEFSAGAVIVEDAEPNDGPLLTQVTYEEE 153 (154) T ss_pred CCcchhhHHHHHHHHHHhcc--ccCCCc--eeehhhhcccccccc-eeeeeeeeccchhhhccCCCCCCceeeeEeeeec Confidence 68899999999999999554 455643 577776666665554 4654444432 11111 Q ss_pred C Q lcl|NC_014229. 145 S 145 (145) Q Consensus 145 ~ 145 (145) + T Consensus 154 ~ 154 (154) T protein:vir:79 154 S 154 (154) T ss_pred C Confidence 1 No 83 >protein:vir:102955 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1535 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945290;genbank:gi:39653725;uniprot:Q708M2;genbank:GeneID:2672869 Probab=90.17 E-value=0.022 Score=29.64 Aligned_cols=118 Identities=15% Similarity=0.096 Sum_probs=78.0 Q ss_pred ccchHHHHHHHHHHHhhcC-hhhhhhhhcccc-CCcccCCCCEEEeccceee-ecCCCcccceEEEEEEEEEECCCCHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGH-APLTALVSGVYD-EVPEPAPYPYVSFGSMTEF-PEDAHDRQGLSVTVVIHVWSKSPGFAE 88 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~d-a~l~alv~~IyD-~vP~~a~~Pyv~iG~~~~~-~~~~~~~~~~~~~~~I~vws~~~g~~e 88 (145) |+.--.+|+.||-..|+.. |. -.||+ .++++-..|+.-+--.+.. ........-..+.+.|+=+.+.+...+ T Consensus 1 ~~~~~~~I~~aI~~~Lk~~fpd-----~~Iy~e~i~Qgf~~PcFFI~ll~~~~~~~~~~r~~r~~~~dI~Yfp~~~~~~e 75 (138) T protein:vir:10 1 MANKGFRLVEELVSHIKGLYPD-----IRIYLDEVEQGFKEPCFFIHVVDTKYTPEANKYVKVRSKVDLSYFPPKKKRSE 75 (138) T ss_pred CCcchhhhHHHHHHHHHHhcCC-----ceeeecccccCCcCCeEEEEEecccCccccCceEEEEEEEEEEEecCcchhHH Confidence 9988999999999999963 12 15897 5899998888544333332 223445567889999998877777899 Q ss_pred HHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 89 AHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 89 a~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +.++++.+..+|...+ -++..+. +..+ -||. -|-.++++..+.... T Consensus 76 ~~~v~e~L~~~f~~~~----~i~~~~~---~~~I---~DgV-Lhf~f~~~~~~~k~~ 121 (138) T protein:vir:10 76 CLAMQEELSYKLLHLP----TIHLFDR---QYEV---VDNV-LHCIFNASTRLKLEE 121 (138) T ss_pred HHHHHHHHHHHHhhcC----eeeeecc---eeeE---EcCe-EEEEEEEEEEEeeec Confidence 9999999999995432 1221111 1222 2543 456666666555444 No 84 >protein:vir:101509 Length: 139 # NCBI annotation: gp22 # Family: family:all:6926 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655401;genbank:gi:109522589;genbank:GeneID:4157581 Probab=88.54 E-value=0.0065 Score=32.58 Aligned_cols=124 Identities=20% Similarity=0.253 Sum_probs=67.6 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh--ccccC-----Cccc-CCCCEEEeccceeeecCCCcccceEEEEEEEE-EEC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS--GVYDE-----VPEP-APYPYVSFGSMTEFPEDAHDRQGLSVTVVIHV-WSK 82 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~--~IyD~-----vP~~-a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~v-ws~ 82 (145) |+ +.+++.+|++|++|..++- .||.. .|.- .+-|||++-=...+=..+-...-+++.+.+|. |.. T Consensus 1 M~------~s~l~d~lr~D~~L~~~lvps~i~~~~~~d~rPnh~d~G~FiV~~W~~~~i~~~I~rgPr~~~iwvH~P~~~ 74 (139) T protein:vir:10 1 MS------RAAVLDALRADVALGQMLVPSNILTNYSKEGPPNHLAPGPFAVIRWGGKTIDPAVNRGPRDVNIWVHIPQRQ 74 (139) T ss_pred Cc------HHHHHHHHhcccccCeeeccchhhhcccccCCCCCCCCCceEEEeccccccccccCCCCceEEEEEecchhc Confidence 54 3579999999999988874 46643 3321 45688887322222223333344666666665 455 Q ss_pred CCCHHHHHHHHHHHHHHhcCCC--CccCCceEEEEEEee-eeeeecC--CCceEEEEEEEEEEEEecC Q lcl|NC_014229. 83 SPGFAEAHRIFAALDAALDRVP--LTVAGCTDVSIKHSN-HQALKDP--EPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 83 ~~g~~ea~~I~~aV~~aL~~~~--l~l~g~~~v~~~~~~-~~~~~d~--d~~~~hg~l~fra~~~~~~ 145 (145) ..++....+|.++|.+....-+ .-.+|-++...++.. +..+.|+ .+..+|+.. -+ --++ T Consensus 75 std~~~id~il~Ri~eI~~svE~~~G~DG~~v~~vr~~g~s~nl~D~G~kTi~R~AT~--~v--Ls~~ 138 (139) T protein:vir:10 75 STDYTRIDQILKRTKEIMLSLEDVAGADGAHLVSTRFLAESDDLVDPGFETITRYATF--SV--LSRS 138 (139) T ss_pred cCCcchHHHHHHHHHHHHHHhhhhccCCceEEEEEeeeccCCCccccchhhhhhhhhh--hh--eecC Confidence 6678888888887776653221 123566666666642 2333332 122334321 11 1222 No 85 >protein:vir:102191 Length: 139 # NCBI annotation: gp22 # Family: family:all:6926 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655218;genbank:gi:109522798;genbank:GeneID:4157430 Probab=87.42 E-value=0.0087 Score=31.87 Aligned_cols=124 Identities=21% Similarity=0.261 Sum_probs=67.6 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh--ccccC-----Cccc-CCCCEEEeccceeeecCCCcccceEEEEEEEE-EEC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS--GVYDE-----VPEP-APYPYVSFGSMTEFPEDAHDRQGLSVTVVIHV-WSK 82 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~--~IyD~-----vP~~-a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~v-ws~ 82 (145) |+ +.+++.+|++|++|..++- .||.. .|.. ++-|||++-=...+=..+-...-+++.+.+|. |.. T Consensus 1 M~------~s~l~d~lr~D~~L~~~lvps~i~~~~~~d~rP~h~~~G~FiV~~W~~~~i~~~I~rgPr~~~iwvH~P~~~ 74 (139) T protein:vir:10 1 MS------RAAVLDALRADVALGQMLVPSNILTNYSKEGPPNHLAPGPFAVIRWGGKTIDPAVNRGPRDVNIWVHIPQRQ 74 (139) T ss_pred Cc------HHHHHHHHhcccccCeeecchhhhhcccccCCCCCCCCCceEEEeccCcccccccCCCCceEEEEEecchhc Confidence 54 3579999999999988874 46643 3332 36688887322222223333344666666665 455 Q ss_pred CCCHHHHHHHHHHHHHHhcCCC--CccCCceEEEEEEee-eeeeecC--CCceEEEEEEEEEEEEecC Q lcl|NC_014229. 83 SPGFAEAHRIFAALDAALDRVP--LTVAGCTDVSIKHSN-HQALKDP--EPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 83 ~~g~~ea~~I~~aV~~aL~~~~--l~l~g~~~v~~~~~~-~~~~~d~--d~~~~hg~l~fra~~~~~~ 145 (145) ..++....+|.++|.+....-+ .-.+|-++...++.. +..+.|+ .+..+|+.. -+ --++ T Consensus 75 std~~~idril~Ri~eI~~svE~~~G~DG~~v~~vr~~g~s~nl~D~G~kTi~R~AT~--~v--Ls~~ 138 (139) T protein:vir:10 75 STDYTRIDRILKRTKEIMLSLEDVAGADGAHLVSTRFLAESDDLVDPGFETITRYATF--SV--LSRS 138 (139) T ss_pred cCCcchHHHHHHHHHHHHHHhhhhccCCceeEEEeeeeccCCCccccchhhhhhhhhh--hh--eecC Confidence 6678888888887776653221 123566666666642 2333332 122333321 11 1222 No 86 >protein:vir:80105 Length: 162 # NCBI annotation: gp13 # Family: family:all:2729 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468717;genbank:gi:157325297;genbank:GeneID:5601796 Probab=86.62 E-value=0.044 Score=28.01 Aligned_cols=129 Identities=16% Similarity=0.120 Sum_probs=69.2 Q ss_pred CceEEEeeCCcccchHHHHHHHHHHHhhcChhhhhhhh--c-ccc-CCcccCCCCEEEeccceeeec---CCCccc--ce Q lcl|NC_014229. 1 MPLLAIWAGGDMATALPALQASVYAKLVGHAPLTALVS--G-VYD-EVPEPAPYPYVSFGSMTEFPE---DAHDRQ--GL 71 (145) Q Consensus 1 ~~~~~~~~~~~M~~~~~aLq~Ai~~~L~~da~l~alv~--~-IyD-~vP~~a~~Pyv~iG~~~~~~~---~~~~~~--~~ 71 (145) ||-.- |+=+- .-|-+-|+..|.+ +.+ . |++ ...+.+.|||+++-= ++|+ +.+-.+ -- T Consensus 1 ~~~~~--~~~~~----~~lv~~ii~~i~~------~~~gl~vI~~~~~g~~p~yPF~TY~v--~~pyi~~~~~~~~~e~~ 66 (162) T protein:vir:80 1 MPNDT--AGYDY----GKLVKTLINAVNE------LSGGLQLIESSSGGEQPEYPFCQYTI--TSPYIAISPDIVEGEQF 66 (162) T ss_pred CCCcc--ccccH----HHHHHHHHHHHHh------hhcceeEEEccCCCCCCCCCeEEEEE--ecCccccCCcccCCcce Confidence 66321 11111 1133444444432 332 2 444 467889999999742 2232 222112 34 Q ss_pred EEEEEEEEEECCCCHHHHHHHHHHHHHHhcCCCCc----c-CCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 72 SVTVVIHVWSKSPGFAEAHRIFAALDAALDRVPLT----V-AGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 72 ~~~~~I~vws~~~g~~ea~~I~~aV~~aL~~~~l~----l-~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +.+++|+|.|.. ..||.++++.++..|...... - +|..+++..-...|...-..-..++-=-.++++++++- T Consensus 67 ~~~isi~~~S~~--~~eAl~la~~l~~~f~~~~~~~~~~~~~gIvvvdv~~~~~R~~~~~~~yerR~GFD~~~Rv~r~~ 143 (162) T protein:vir:80 67 EIVISLTWRALS--GHQALNLANITNKYFRSQKGRFFMQENGGIVVVSVQNSGLRDTFISIEYERSAGIDLRLRVVDSY 143 (162) T ss_pred EEEEEEEEEeCC--HHHHHHHHHHHHHHhhcCCceeeeeecCcEEEEecCCCccceeEeeeeeeeeecceEEEEEeecc Confidence 567888888875 789999999999999643211 1 12334444444444433333334555566777777765 No 87 >protein:vir:96764 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1090 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039825;genbank:gi:126010857;genbank:GeneID:5076274 Probab=86.54 E-value=0.045 Score=27.98 Aligned_cols=130 Identities=15% Similarity=0.079 Sum_probs=70.4 Q ss_pred ccchH--HHHHHHHHHHhhc-ChhhhhhhhccccCCccc-CCCCEEEeccceeee--cCCCcccceEEEEEEEEEECC-- Q lcl|NC_014229. 12 MATAL--PALQASVYAKLVG-HAPLTALVSGVYDEVPEP-APYPYVSFGSMTEFP--EDAHDRQGLSVTVVIHVWSKS-- 83 (145) Q Consensus 12 M~~~~--~aLq~Ai~~~L~~-da~l~alv~~IyD~vP~~-a~~Pyv~iG~~~~~~--~~~~~~~~~~~~~~I~vws~~-- 83 (145) |+..+ .+|+.||.+.|++ -|.+... ..|+..++. -.-|-+.|+=....+ ....+......+++++|.-.. T Consensus 1 ~~~l~~~s~lh~AI~~~l~~~~P~l~tV--~~y~~~~~~~~~tPAv~iel~~~~~~~d~g~G~~~~~~r~~a~vvv~~~~ 78 (177) T protein:vir:96 1 MVTLKQPSDLYDAIQAELESRLADEVTV--ASYADFGDVQVVDAMVLIEFEQTSPATRGHDGRYCHQYDITLHAVVGRQR 78 (177) T ss_pred CCccchhHHHHHHHHHHHHHhCccceee--ccccccccccccCceeEEeeccCCcccCCCCCceEEEEEEEEEEEeCCCC Confidence 88765 3699999999997 3333211 457665542 234766665233222 233344456888999996432 Q ss_pred -CCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEE---eeeeeeecCCCc-eEEEEEEEEEEEEecC Q lcl|NC_014229. 84 -PGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKH---SNHQALKDPEPG-VRHINAEYRVRLTLDS 145 (145) Q Consensus 84 -~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~---~~~~~~~d~d~~-~~hg~l~fra~~~~~~ 145 (145) +-..++..++.++...+++..--+++-.+..-.+ .-....++-||. .| .|+|+=.+---+ T Consensus 79 ~~~~l~a~~lAa~l~~~v~~~~wGLp~~~v~~~~~i~a~pd~f~p~ldgy~vW--~Vew~Q~i~LG~ 143 (177) T protein:vir:96 79 QRAELEAINLAAAIERVTDENLWGLPYQQVDRPENIRSAPSMFKVGSDGYDAW--GVSFRQRIYLGA 143 (177) T ss_pred CChHHHHHHHHHHHHHHHhcccccCCccccccceeeeccccccccccCceeEE--EEEEEEEEecCC Confidence 3368899999999999988765554332221111 111112222332 22 333333332222 No 88 >protein:vir:397 Length: 132 # NCBI annotation: gp12 # Family: family:all:911 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046907;genbank:gi:9630477;genbank:GeneID:1261651 Probab=85.92 E-value=0.049 Score=27.75 Aligned_cols=125 Identities=12% Similarity=0.154 Sum_probs=74.6 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCcc---cCCCCEEEeccceeeecC-CCcccceEEEEEEEEEECCC-CH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVPE---PAPYPYVSFGSMTEFPED-AHDRQGLSVTVVIHVWSKSP-GF 86 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~---~a~~Pyv~iG~~~~~~~~-~~~~~~~~~~~~I~vws~~~-g~ 86 (145) |+ -.++|++|.++|+.. +... .-+||+-|. ....|-|.+==.+..+.+ +-|..-++..+.|.|+=+.. +- T Consensus 1 ~~--ht~IR~~Vid~L~~~--l~~~-~~ffdGrP~fiDe~elPAVAV~l~d~~~~~~~ld~~~w~A~LhI~iyLka~~~d 75 (132) T protein:vir:39 1 MK--HRDIRKVIIDALESA--IGTD-AIYFDGRPAVLEEGDFPAVAVYLTDAEYTGEELDADTWQAILHIEVFLEAQVPD 75 (132) T ss_pred Cc--hHHHHHHHHHHHHhh--CCCc-eEEecCcceeeccccCcEEEEEeecCCCCcceecCCeeEEEEEEEEEeecCCCH Confidence 55 468999999999973 2221 137888773 556787554333333332 34677888899999996654 34 Q ss_pred HHHHHHHH-HHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEE-EEEEEEEEEEe Q lcl|NC_014229. 87 AEAHRIFA-ALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRH-INAEYRVRLTL 143 (145) Q Consensus 87 ~ea~~I~~-aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~h-g~l~fra~~~~ 143 (145) .+.-.+++ .|+.++.+.+ .+.+ -+..+....-+.-||.+..+|+ +.+.|++..+- T Consensus 76 s~LD~~aE~~i~p~i~~~~-~l~~-l~~~~~~~gy~Y~rD~~~atW~sadL~y~ItY~~ 132 (132) T protein:vir:39 76 SELDDWMETRVYPVLAEVP-GLES-LITTMVQQGYDYQRDDDMALWSSADLKYSITYDM 132 (132) T ss_pred HHHHHHHHHHhHhhhcccc-hhhh-HhhhhhhcCCCcccccccceEEEEEEEEEEEEeC Confidence 55557777 4556664321 1221 1122333445677888888775 34666666666 No 89 >protein:vir:10327 Length: 182 # NCBI annotation: ORF29 # Family: family:all:1090 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758922;genbank:gi:27311196;genbank:GeneID:956141 Probab=85.46 E-value=0.053 Score=27.59 Aligned_cols=130 Identities=16% Similarity=0.039 Sum_probs=75.8 Q ss_pred ccchH-HHHHHHHHHHhhc-ChhhhhhhhccccCCcccCCCCEEEeccceeeec--CCCcccceEEEEEEEEEECC---C Q lcl|NC_014229. 12 MATAL-PALQASVYAKLVG-HAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPE--DAHDRQGLSVTVVIHVWSKS---P 84 (145) Q Consensus 12 M~~~~-~aLq~Ai~~~L~~-da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~--~~~~~~~~~~~~~I~vws~~---~ 84 (145) ||.-. .+|+.||.+.|++ -|.|+ +...|+....+-..|-+.+.=....+. ...++.....++.++|.-.. . T Consensus 1 mt~~~l~~lh~AI~~~Lk~~~p~l~--~~~~y~~~~~~i~~PAv~vel~~~~~~~d~~tGq~~~~~~~~a~~vv~~~~~~ 78 (182) T protein:vir:10 1 MSQTTITEVHEAIKAKLRETFPKVT--VDDYNPEPELSVLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVKS 78 (182) T ss_pred CCcCCHHHHHHHHHHHHHHhcCCce--eeecCccccCccccceeeeeeecCCcCCCCCCCcEEEEEEEEEEEEecccCCC Confidence 88764 5699999999994 55552 224676666655567666655554433 23455567788888888653 2 Q ss_pred CHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeee---eeec-CCCc-eEEEEEEEEEEEEecC Q lcl|NC_014229. 85 GFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQ---ALKD-PEPG-VRHINAEYRVRLTLDS 145 (145) Q Consensus 85 g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~---~~~d-~d~~-~~hg~l~fra~~~~~~ 145 (145) -..++..++.++...+++..--+++..+-..++..+. ..++ .||. .| .|+|.=.+---+ T Consensus 79 ~~~~~~~lAa~l~~~v~~~~wGL~~~~v~~a~~i~a~p~~f~~~~~dgy~vW--~VeW~Q~i~LG~ 142 (182) T protein:vir:10 79 LALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSR--VVTWNQTLYLGE 142 (182) T ss_pred chHHHHHHHHHHHHHHhcCcccCCccccCccceeeeccCccChhhcCceEEE--EEEEEEEEeeCC Confidence 3578999999999999887655543222222332221 1121 2432 22 344443332211 No 90 >protein:vir:7994 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817348;genbank:gi:29565776;genbank:GeneID:1259015 Probab=84.76 E-value=0.02 Score=29.88 Aligned_cols=119 Identities=16% Similarity=0.089 Sum_probs=69.3 Q ss_pred ccchHH-HHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 12 MATALP-ALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 12 M~~~~~-aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~ 90 (145) |+.-+. ...+-+.++|.. + +++--+-+.+.+.||+++-+....+.. ......-.+++|++.. |..+|. T Consensus 1 m~~~saP~~e~~vv~WLsp---~----~~va~~R~~~~PLPf~~V~Rv~G~d~~--e~~tD~avvsv~~fg~--~~eaA~ 69 (134) T protein:vir:79 1 MATDSAPSIHRVLVAWLSP---L----GKVSTRRLSGDPLPHRVVRRVDGRDVP--EEGSDSAVVSVHTFAA--SDEAAE 69 (134) T ss_pred CCcccCCChheeeeeeccc---c----hhceeccCCCCCCCeEEEEEeCCCCCc--cccccCceeEEEEeeC--CHHHhh Confidence 544321 133444455542 1 122223467889999999665544332 2333556788999984 567788 Q ss_pred HHHHHHHHHh----cCC--CCccCCceEEEEEEeeeeee-----ecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 91 RIFAALDAAL----DRV--PLTVAGCTDVSIKHSNHQAL-----KDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 91 ~I~~aV~~aL----~~~--~l~l~g~~~v~~~~~~~~~~-----~d~d~~~~hg~l~fra~~~~~~ 145 (145) .+++.+-+.+ -+. ..++.|+.+..+.+...-.- .+.|+ | +++|.+++|--. T Consensus 70 d~ad~vHrRM~kL~~~~~~~~~~~gG~~~~id~~~vl~~P~~~eY~dD~---~-~vrytgRY~~g~ 131 (134) T protein:vir:79 70 NEAELTHQRMLELVVNPLTEIPVGGGVVARIDYARVLMKPVLVEYDDDG---H-LVRHVGRYEIGV 131 (134) T ss_pred HHHHHHHHHHHHHhcccccceecCCceEEEeehhhhhccceeeeeCCCc---e-EEEEeeeeeecc Confidence 8887776655 233 35567888777776544222 22343 2 677777777777 No 91 >protein:vir:102609 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655006;genbank:gi:109392196;genbank:GeneID:4157231 Probab=84.70 E-value=0.021 Score=29.80 Aligned_cols=119 Identities=15% Similarity=0.079 Sum_probs=69.3 Q ss_pred ccchHH-HHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 12 MATALP-ALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 12 M~~~~~-aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~ 90 (145) |+.-+. ...+-+.++|.. + +++--+-+.+.+.||+++-+....+. .......-.+++|++.. |..+|. T Consensus 1 m~~~saP~~e~~vv~WLsp---~----~~va~~R~~~~PLPf~~V~Rv~G~d~--~e~~tD~avvsv~~fg~--~~eaA~ 69 (134) T protein:vir:10 1 MATDSAPSIHRVLVAWLSP---L----GKVSTRRLSGDPLPHRVVRRVDGRDV--PEEGSDVAVVSVHTFAA--SDEAAE 69 (134) T ss_pred CCcccCCChheeeeeeccc---c----hhceeccCCCCCCCeEEEEEeCCCCC--cccccccceEEEEEeeC--CHHHhh Confidence 544321 133444455542 1 12222346788999999966554433 23333556789999984 567788 Q ss_pred HHHHHHHHHh----cCC--CCccCCceEEEEEEeeeeee-----ecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 91 RIFAALDAAL----DRV--PLTVAGCTDVSIKHSNHQAL-----KDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 91 ~I~~aV~~aL----~~~--~l~l~g~~~v~~~~~~~~~~-----~d~d~~~~hg~l~fra~~~~~~ 145 (145) .+++.+-+.+ -+. ..++.|+.+..+.+...-.- .+.|+ | +++|.+++|--. T Consensus 70 d~ad~vHrRM~kL~~~~~~~~~~~gG~~~~id~~~v~~~P~~~eY~dD~---~-~vrytgRY~~g~ 131 (134) T protein:vir:10 70 NEAELTHQRMLELVVNPLTEIPVGGGVVARIDYARVLMKPVLVEYDDDG---H-LVRHVGRYEIGV 131 (134) T ss_pred HHHHHHHHHHHHHhcccccceecCCceEEEeehhhhhccceeeeeCCCc---e-EEEEeeeeeecc Confidence 8887776655 233 35567888777776544222 22343 2 677777777777 No 92 >protein:vir:105826 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655771;genbank:gi:109522094;genbank:GeneID:4157634 Probab=84.70 E-value=0.021 Score=29.80 Aligned_cols=119 Identities=15% Similarity=0.079 Sum_probs=69.3 Q ss_pred ccchHH-HHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHH Q lcl|NC_014229. 12 MATALP-ALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAH 90 (145) Q Consensus 12 M~~~~~-aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~ 90 (145) |+.-+. ...+-+.++|.. + +++--+-+.+.+.||+++-+....+. .......-.+++|++.. |..+|. T Consensus 1 m~~~saP~~e~~vv~WLsp---~----~~va~~R~~~~PLPf~~V~Rv~G~d~--~e~~tD~avvsv~~fg~--~~eaA~ 69 (134) T protein:vir:10 1 MATDSAPSIHRVLVAWLSP---L----GKVSTRRLSGDPLPHRVVRRVDGRDV--PEEGSDVAVVSVHTFAA--SDEAAE 69 (134) T ss_pred CCcccCCChheeeeeeccc---c----hhceeccCCCCCCCeEEEEEeCCCCC--cccccccceEEEEEeeC--CHHHhh Confidence 544321 133444455542 1 12222346788999999966554433 23333556789999984 567788 Q ss_pred HHHHHHHHHh----cCC--CCccCCceEEEEEEeeeeee-----ecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 91 RIFAALDAAL----DRV--PLTVAGCTDVSIKHSNHQAL-----KDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 91 ~I~~aV~~aL----~~~--~l~l~g~~~v~~~~~~~~~~-----~d~d~~~~hg~l~fra~~~~~~ 145 (145) .+++.+-+.+ -+. ..++.|+.+..+.+...-.- .+.|+ | +++|.+++|--. T Consensus 70 d~ad~vHrRM~kL~~~~~~~~~~~gG~~~~id~~~v~~~P~~~eY~dD~---~-~vrytgRY~~g~ 131 (134) T protein:vir:10 70 NEAELTHQRMLELVVNPLTEIPVGGGVVARIDYARVLMKPVLVEYDDDG---H-LVRHVGRYEIGV 131 (134) T ss_pred HHHHHHHHHHHHHhcccccceecCCceEEEeehhhhhccceeeeeCCCc---e-EEEEeeeeeecc Confidence 8887776655 233 35567888777776544222 22343 2 677777777777 No 93 >protein:vir:79247 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469166;genbank:gi:157835008;genbank:GeneID:5648828 Probab=84.06 E-value=0.063 Score=27.15 Aligned_cols=129 Identities=16% Similarity=0.158 Sum_probs=68.1 Q ss_pred ccchH--HHHHHHHHHHhhc-Chhhhhhhhcc--ccCCcc-c--CCCCEEEeccceeeec-CCCccc----ceEEEEEEE Q lcl|NC_014229. 12 MATAL--PALQASVYAKLVG-HAPLTALVSGV--YDEVPE-P--APYPYVSFGSMTEFPE-DAHDRQ----GLSVTVVIH 78 (145) Q Consensus 12 M~~~~--~aLq~Ai~~~L~~-da~l~alv~~I--yD~vP~-~--a~~Pyv~iG~~~~~~~-~~~~~~----~~~~~~~I~ 78 (145) |++|. ++++++|.++|++ -+++. .|++. |..+++ + ++.=||+++..+..+. +..+.. --+.++.+- T Consensus 1 ~~~~~d~~a~~~~IierLka~v~~l~-~V~~aadla~i~e~~q~tPaayVv~~gd~~~~~~~~~~~~~~~Q~vtq~f~Vv 79 (157) T protein:vir:79 1 MSDPFDYLFLEPLLIERIRSEVPGLA-IVSGVPDLAALSEQDQPAPSVYVVYLGDEIGTGADYQGGRRAIQAIGQQWAVV 79 (157) T ss_pred CCCchhhhhhhHHHHHHHHhhhhhhh-hhccccchhhhhhhcCCCcEEEEEecccccCCCcccccCcceeeeeeeeEEEE Confidence 99994 6889999999985 56664 55433 333443 2 3445676666544322 211111 122222222 Q ss_pred EE--EC---CC---CHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeee-eecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 79 VW--SK---SP---GFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQA-LKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 79 vw--s~---~~---g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~-~~d~d~~~~hg~l~fra~~~~~~ 145 (145) +- +. .+ ...++..+..+|+++|.++..+ .++.- +.+..+.. ..-.+|-.|+ .+.|++.+-+-. T Consensus 80 lavrn~~~~~~~~a~~d~ag~ll~~v~~AL~GW~P~-~~~~p--l~~~~~~~~~~y~~gf~yy-pl~F~~~~~~~~ 151 (157) T protein:vir:79 80 LVVHYADSSNSGEGARREAGPLLGRLVKALTGWAPA-IDVAP--LARSARQSPVTYASGYFYF-PLVFTARFVYPR 151 (157) T ss_pred EEEeccccccccchhHHHHHHHHHHHHHHhcCcccc-ccCCc--eeeeecCCcccccCCeEEE-EEEEEEeeeccc Confidence 22 11 12 2467999999999999998665 33322 22221211 1223443343 566666555544 No 94 >protein:vir:98629 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039928;genbank:gi:126011103;genbank:GeneID:4818465 Probab=81.32 E-value=0.01 Score=31.45 Aligned_cols=117 Identities=15% Similarity=0.094 Sum_probs=66.2 Q ss_pred cccchHHHHHHHHHHHhhcChhhhhhhhcccc-CCcc--cCCCCEEEecccee-e--ecCCCcccceEEEEEEEEEECCC Q lcl|NC_014229. 11 DMATALPALQASVYAKLVGHAPLTALVSGVYD-EVPE--PAPYPYVSFGSMTE-F--PEDAHDRQGLSVTVVIHVWSKSP 84 (145) Q Consensus 11 ~M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD-~vP~--~a~~Pyv~iG~~~~-~--~~~~~~~~~~~~~~~I~vws~~~ 84 (145) =|. .+-+-|+.+|++|+-+... +|+= ..|+ +..-|||+|-+... . .......-..+..+||+| +.. T Consensus 1 mm~----DiL~~Iy~~L~~d~~i~~~--~Ikfye~Pe~~d~~~p~IVI~Pl~~P~p~~~~sd~~ls~~y~yQIDV--es~ 72 (126) T protein:vir:98 1 MVR----DMLAEVFDLLKADNVLKLV--KIKSFERPESLLDDQTSIVILPITAPKQSTFGSDTALSKKFLYQIEV--EST 72 (126) T ss_pred Chh----HHHHHHHHHHhcCceecee--eeeeeecCCccccCcceEEEeeCCCCCcccccCChhhheeeeeeeec--ccc Confidence 333 4556678888888866553 5532 3454 67899999987755 3 334445556888999999 555 Q ss_pred CHHHHHHHHHHHHHHhcCC-CCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEE--EEecC Q lcl|NC_014229. 85 GFAEAHRIFAALDAALDRV-PLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVR--LTLDS 145 (145) Q Consensus 85 g~~ea~~I~~aV~~aL~~~-~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~--~~~~~ 145 (145) +|.+.++|..+|+..|-.. -..++|+ + -+ -++++.++.-.=+||-. +=+|= T Consensus 73 ~R~~~~~i~~rI~~~l~~~gf~q~~~g-l-------de--Y~~Et~ryvdaRrY~G~~k~y~~y 126 (126) T protein:vir:98 73 SRLECKDLQRRIEKQLEKIGFYQNDAG-F-------ER--FDRDTGRYLDARTFRGFSNIYEDY 126 (126) T ss_pred cccchHHHHHHHHHHHHHcCccccccC-c-------ch--hhhhhhhhhhhhhhccCchhhhcC Confidence 7999999999999999421 1122221 0 01 11222222222222221 00000 No 95 >protein:vir:8107 Length: 138 # NCBI annotation: gp11 # Family: family:all:2795 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817688;genbank:gi:29566119;genbank:GeneID:1259313 Probab=79.04 E-value=0.093 Score=26.24 Aligned_cols=120 Identities=13% Similarity=0.061 Sum_probs=66.6 Q ss_pred ccchH----HHHHHHHHHHhhcChhhhhhhhcccc---CCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCC Q lcl|NC_014229. 12 MATAL----PALQASVYAKLVGHAPLTALVSGVYD---EVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSP 84 (145) Q Consensus 12 M~~~~----~aLq~Ai~~~L~~da~l~alv~~IyD---~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~ 84 (145) |.+.- -...+-+..+|. +||- +-+.+.+.||+.+-+....+.. ......-.++|+++.. T Consensus 1 ~~~~~~~~aP~~e~~vv~WLs----------pv~~va~~R~~d~pLPF~~V~Rv~G~d~~--e~~tD~avv~~~~fg~-- 66 (138) T protein:vir:81 1 MADLHDQDAPDEEDFVVCWMQ----------PVMRTAVERDIDAELPFCEVTRIDGADDP--EAGTDNPVIQLDFYAL-- 66 (138) T ss_pred CcccccCCCCchheeeeeecc----------chhccccccCCCCCCCeEEEEEeCCCCCc--cccccCceEEEEEeec-- Confidence 43310 011122222222 2332 2344569999999655544322 2223456789999954 Q ss_pred CHHHHHHHHHHHHHHhcC----C-CCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 85 GFAEAHRIFAALDAALDR----V-PLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 85 g~~ea~~I~~aV~~aL~~----~-~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) |..+|+.+++.+-+.+-. . ..+++|+.+..+.+-.....+-+=--.--.+++|.+++|--. T Consensus 67 g~eaA~d~a~~vHrRM~kL~~~~~~vTl~dGt~~~ld~~~~~~~P~~~~y~dD~ivRYtaRY~~g~ 132 (138) T protein:vir:81 67 GAEAAKAAAKQGHRRMLFLFRNFPTVTLSDGTLADLDFGETLIKPFRMAFEHDQIVRYTARYQLGT 132 (138) T ss_pred CHHHHHHHHHhHHHHHHHHhhcccceecCCCceEecchhhhhccccccccCCCeeeEeeeeeeccc Confidence 577888888888666532 2 346788887777765554433211111124778888887776 No 96 >protein:vir:3874 Length: 114 # NCBI annotation: putative head-tail joining protein # Family: family:all:28620 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680491;swissprot:trembl:p94215;genbank:gi:22296531;uniprot:P94215;genbank:GeneID:951676 Probab=78.97 E-value=0.032 Score=28.74 Aligned_cols=106 Identities=16% Similarity=0.153 Sum_probs=68.0 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccC-----CCCEEEeccceeeecC-CCcc-cceEEEEEEEEEECC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPA-----PYPYVSFGSMTEFPED-AHDR-QGLSVTVVIHVWSKS 83 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a-----~~Pyv~iG~~~~~~~~-~~~~-~~~~~~~~I~vws~~ 83 (145) |. | + +.|++-|.+|..|.+++. ++|...|+++ ..|||.+-..-..+.. +++. ..+=-+++++-|-+. T Consensus 1 ~~-P--E--~~vaDiLsad~~lv~~mYipift~tpdd~fik~SsAPWiRiTpiPGDda~yaDD~R~~EYPrVqVDfWvr~ 75 (114) T protein:vir:38 1 MA-P--E--KRVYDILSANLDIADKVYIGTPNFNNQTSATPESLAPWVRITYLPGDAADYADDSRILEYPKVQVDFWVGI 75 (114) T ss_pred CC-c--h--hhhhhhhccchhhhhheeccCCCCCCCCcccccccCCeeEeeecCCccccccccceeeecCceeEEEeecc Confidence 32 2 2 348899999999999986 7888777653 5899998665544332 2333 334458999999999 Q ss_pred CCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEE Q lcl|NC_014229. 84 PGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHIN 134 (145) Q Consensus 84 ~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~ 134 (145) .+-.+..+|-..|+++||... |. |+-.. +=+||.-..-. T Consensus 76 e~~d~~e~iqe~IY~~Lha~g-----we----RYY~n---sY~D~~~~~~~ 114 (114) T protein:vir:38 76 TDWDQQEKIETQIYQALHAAD-----WE----RYYRN---SYVDGIPQPFA 114 (114) T ss_pred CChhhHHHHHHHHHHHHHhcC-----cc----eeeec---cccCCCCCCCC Confidence 999999999999999997532 21 01000 01122110000 No 97 >protein:vir:79047 Length: 145 # NCBI annotation: hypothetical protein # Family: family:all:1535 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110730;genbank:gi:134287347;genbank:GeneID:4955221 Probab=75.61 E-value=0.14 Score=25.19 Aligned_cols=119 Identities=16% Similarity=0.146 Sum_probs=77.4 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhh--hccccC-CcccCCCCEEEeccceee-ecCCCcccceEEEEEEEEEECC-CCH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALV--SGVYDE-VPEPAPYPYVSFGSMTEF-PEDAHDRQGLSVTVVIHVWSKS-PGF 86 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv--~~IyD~-vP~~a~~Pyv~iG~~~~~-~~~~~~~~~~~~~~~I~vws~~-~g~ 86 (145) |. .++..||-..|+.. .- -.||+. ++++-..|+-.+--.+.+ ........-..+++.|+=+.+. ... T Consensus 1 mi---~dI~~aI~~~Lk~~-----Fp~~~~IY~e~i~Qgf~~PcFFI~ll~~~~~~~~~~r~~r~~~~dI~Yfp~~~~~~ 72 (145) T protein:vir:79 1 ML---NNIIDGISVKLDKS-----FGEKYTIYSEDVEQGINEPCFFIVPLNPSKTPYPSGRELKKNSFDVHYFPRSEAKN 72 (145) T ss_pred Ch---HHHHHHHHHHHHHh-----cCCceEEEecccccCccCCeeEEEEeccccccccCceEEEEEEEEEEEeecCCCCc Confidence 65 68999999999852 21 158984 899988888655333333 2334455668888899888754 356 Q ss_pred HHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 87 AEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 87 ~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) .++.++++.+...|+. +.+.|.. +..+=.+.++. ||. -|-.++++..+.... T Consensus 73 ~e~~ev~e~L~~~le~--i~v~~~~-~~~~~~~~eiv---Dgv-Lhf~~~~~~~~~k~~ 124 (145) T protein:vir:79 73 FEINEIAEMLLEELEY--IEINGDL-VRGTNMNFEII---DNV-LHFFVDYNYFTIKSN 124 (145) T ss_pred hhHHHHHHHHHhhhcc--eeecCcE-EeeecceeEEe---ece-EEEEEEEEEEEeeec Confidence 7999999999999943 5665543 33333334443 543 466776666654444 No 98 >protein:vir:99226 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950461;genbank:gi:119953662;genbank:GeneID:4643086 Probab=73.35 E-value=0.17 Score=24.78 Aligned_cols=129 Identities=19% Similarity=0.202 Sum_probs=68.9 Q ss_pred ccchH--HHHHHHHHHHhhc-Chhhhhhhhcc--ccCCcc---cCCCCEEEeccceeeecC-CCcccc------eEEEEE Q lcl|NC_014229. 12 MATAL--PALQASVYAKLVG-HAPLTALVSGV--YDEVPE---PAPYPYVSFGSMTEFPED-AHDRQG------LSVTVV 76 (145) Q Consensus 12 M~~~~--~aLq~Ai~~~L~~-da~l~alv~~I--yD~vP~---~a~~Pyv~iG~~~~~~~~-~~~~~~------~~~~~~ 76 (145) |++|. ++++++|.++|++ -+++. .|+.. |..+++ .++-=||.++..+..+.. ..+..+ ....+. T Consensus 1 ~~~~~d~~a~~~~IierLka~vp~l~-~V~~aadla~i~~~~q~tPaayVi~~gd~~~~~~~~~~~~~~~Q~i~q~~~Vv 79 (157) T protein:vir:99 1 MSDPFDYLFLEPLLIERIRSEVPGLA-IVSGVPDLAALSEQDQPAPSVYVVYLGDEIGTGADHQGGRRAIQAIGQQWAVV 79 (157) T ss_pred CCCchhhhhhhHHHHHHHHhhhhHHH-hhhcccchHHHhhccCCCcEEEEEecccccCCCcccccccceeeeeeeeEEEE Confidence 99994 6899999999984 56665 55532 334443 344456777666543211 111111 112222 Q ss_pred EEEEEC---CCC---HHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeee-ecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 77 IHVWSK---SPG---FAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQAL-KDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 77 I~vws~---~~g---~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~-~d~d~~~~hg~l~fra~~~~~~ 145 (145) +-|-+- .+| ..++.++.++|+++|.++..+- +. ..++...+... .=.+|-.|+ .+.|++.+-+-. T Consensus 80 lavr~~~~~~~g~~a~d~ag~ll~~v~~AL~GW~P~~-~~--~pl~~~~~~~~~~y~~gf~yy-pl~F~~~~~~~~ 151 (157) T protein:vir:99 80 LVVHYADSSNSGEGARREAGPLLGRLVKALTGWAPAI-DV--APLARSARQSPVTYASGYFYF-PLVFTARFVYPR 151 (157) T ss_pred EEEeccccccccchhHHHHHHHHHHHHHHhcCCcCcc-cC--CceeeeecCCcccccCceEEE-EEEEEEeeeccc Confidence 333211 122 3679999999999999986542 22 22222211111 113444443 666766666555 No 99 >protein:vir:107704 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003903;genbank:gi:45686319;genbank:GeneID:2773044 Probab=70.67 E-value=0.21 Score=24.35 Aligned_cols=121 Identities=14% Similarity=0.044 Sum_probs=61.4 Q ss_pred hHHHHHHHHHHHhhcChhhhhhhhccccCCcc---cCCCCEEEe----ccceeeecCCCcccceEEEEEEEEEEC-CCCH Q lcl|NC_014229. 15 ALPALQASVYAKLVGHAPLTALVSGVYDEVPE---PAPYPYVSF----GSMTEFPEDAHDRQGLSVTVVIHVWSK-SPGF 86 (145) Q Consensus 15 ~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~---~a~~Pyv~i----G~~~~~~~~~~~~~~~~~~~~I~vws~-~~g~ 86 (145) --.++..|+.++|.+-+. . +.=.|.++.- +.--+|+.+ +.+.....+-||. ...-.+||+|... +.|. T Consensus 1 ~hyE~~~a~r~~la~~~~--~-lpVA~eNv~F~Pp~~G~~yLr~~~lpa~T~~~~L~~d~r-~y~Gv~QI~Vv~paG~G~ 76 (132) T protein:vir:10 1 MHYELSAAARAAFLSKYR--D-FPHYMENRNFTPPKDGGMWLRFNYIEGDTLYLSIDRKCK-SYIAIVQIGVVFPPGSGV 76 (132) T ss_pred CchHHHHHHHHHHHhhhc--C-CcEeecCCCcCCCCCCceEEEEEEccCCceeeeccCcCc-EEEEEEEEEEEecCCCCc Confidence 234666666666653211 0 1112444322 111244433 3333344443333 4556788888775 4689 Q ss_pred HHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 87 AEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 87 ~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) .++.+|++.|.+.+.+.- .++.+.+..--.++ ..+.+. ..|-..+ |+.++.|+ T Consensus 77 ~~a~~iAd~i~~~F~~g~-~l~~Gyi~~~~~~~-p~i~~~--s~~~iPv--rf~yR~Dt 129 (132) T protein:vir:10 77 DEARLKAKEIADFFKDGK-MLNVGYIFEGAIVH-QIVKHE--SGWMIPV--RFTVRVDT 129 (132) T ss_pred chhHHHHHHHHHhccCcc-eeecceecCCCccC-CceeCC--cceEEEE--EEEEEecc Confidence 999999999999996543 33333333222222 333332 3454444 55555565 No 100 >protein:vir:105468 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:1535 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529878;genbank:gi:90592618;genbank:GeneID:3974532 Probab=58.32 E-value=0.42 Score=22.67 Aligned_cols=118 Identities=11% Similarity=0.138 Sum_probs=70.4 Q ss_pred HHHHHHHHHHhhcChhhhhhhhccccC-CcccCCCCEEEeccceeee-cCCCcccceEEEEEEEEEEC-CCCHHHHHHHH Q lcl|NC_014229. 17 PALQASVYAKLVGHAPLTALVSGVYDE-VPEPAPYPYVSFGSMTEFP-EDAHDRQGLSVTVVIHVWSK-SPGFAEAHRIF 93 (145) Q Consensus 17 ~aLq~Ai~~~L~~da~l~alv~~IyD~-vP~~a~~Pyv~iG~~~~~~-~~~~~~~~~~~~~~I~vws~-~~g~~ea~~I~ 93 (145) .+|..||-++|+..=+ . ..||+. ++.+-..|+--|--.+.+. .......-+++++.|+=+.+ .+...++.+++ T Consensus 1 ~~ii~~I~~~L~~~fp--d--~~IY~e~i~Qg~~~PcFFI~~l~~~~~~~~~~ry~r~~~fdI~Yfp~~~~~~~e~~~va 76 (135) T protein:vir:10 1 MTIVERIAKRISEIFP--D--VTIYSEKQKSGFQVPSFYISKIMTVTKSRFFDIQDRSLSYSITYFANPDRPNADMEEVE 76 (135) T ss_pred ChhHHHHHHHHHHhcC--c--eeeecccccCCCcCCeeEEEEecCCccccccceEEEEeeEEEEEeecCCCchhhHHHHH Confidence 6888888888885211 0 259974 8999999985554443332 23445666888999998875 34588999999 Q ss_pred HHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCc-eEEEEEEEEEEEEecC Q lcl|NC_014229. 94 AALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPG-VRHINAEYRVRLTLDS 145 (145) Q Consensus 94 ~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~-~~hg~l~fra~~~~~~ 145 (145) +.+.+.|.-- ++ . ++..-.+.+... .|+. +.+-.++|+..-+.+. T Consensus 77 e~L~~~le~i----~~-~-~~~~~~~~~i~~-~D~VLhf~~~~~~~~~k~~~~ 122 (135) T protein:vir:10 77 QKLLNNFTRL----DD-Y-ATVRNRETTINQ-DDETLVMSFDLRLEMYPVQDG 122 (135) T ss_pred HHHHHhhhhc----Cc-e-eEEeCCceEEEe-ecCeEEEEEEEEEEEeecCCc Confidence 9998888432 11 1 222222233321 1432 3444444444444444 No 101 >protein:vir:78124 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:29862 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294806;genbank:gi:149882827;genbank:GeneID:5309152 Probab=54.32 E-value=0.51 Score=22.20 Aligned_cols=126 Identities=17% Similarity=0.155 Sum_probs=62.1 Q ss_pred ccchHHHHHHHHHHHhhcC---hhhhhhhhccccCCccc--CCC--CEEEeccceeeecCCCcccceEEEEEEEEEECCC Q lcl|NC_014229. 12 MATALPALQASVYAKLVGH---APLTALVSGVYDEVPEP--APY--PYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSP 84 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~d---a~l~alv~~IyD~vP~~--a~~--Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~ 84 (145) |.-.-..|..=+-+.|+++ +++-+. +-...|.+ -+| |.|++-.-..... .-...-+.+-+++--|++.+ T Consensus 1 ~~v~PPDlE~fl~~~LRa~i~~adVDgq---vGnk~Pd~y~g~y~~PLvvVRDDgG~~~-d~~tFDRSiGvnVlgwtrqd 76 (139) T protein:vir:78 1 MRVAPPDLEEWFTALLRAEVRAAGVDAE---VGNKEPDNLRVPLRRPLIVVRDDSGDRR-DWTTFDRSVGFTVLAGTKQN 76 (139) T ss_pred CccCCccHHHHHHHHHHhhccccCcccc---ccCcCCCCccccccCCeEEEEcCCCCcc-cceeeecccceeeeeccccC Confidence 5433222222233444432 233332 22333443 344 8888743332222 12223456678888999875 Q ss_pred CHHHHHHHHHHHHHHhcCCCCcc-CCceEEEEEEeeee----eeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 85 GFAEAHRIFAALDAALDRVPLTV-AGCTDVSIKHSNHQ----ALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 85 g~~ea~~I~~aV~~aL~~~~l~l-~g~~~v~~~~~~~~----~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) -+-+++++..|..+|++.++.| +|-.++...+..-+ +-.|.|...|.-+++|.. .-| T Consensus 77 -~KPc~dLArrVy~~lt~hp~~LiegSpi~aVv~dgCnGPYpVsdd~d~aryYltveYst---~G~ 138 (139) T protein:vir:78 77 -DKPANDLARVVASIVHDHELPLIEGSPIAAVVFDGCRGPYAVPDTIDVARRYLTGQYVA---SGS 138 (139) T ss_pred -chhhHHHHHHHHHHhccCcceeecCCceEEeecccCCCCCCCCcchhheeeeeEEEEee---ecc Confidence 4679999999999999999887 44444433332211 111223333333333322 222 No 102 >protein:vir:6215 Length: 109 # NCBI annotation: hypothetical protein # Family: family:all:10885 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852595;genbank:gi:31415855;genbank:GeneID:1489213 Probab=52.91 E-value=0.54 Score=22.04 Aligned_cols=106 Identities=15% Similarity=0.211 Sum_probs=54.8 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh-ccc-cCCcccCCCCEEEeccceeeecCCCccc-ceEEEEEEEEEECCCCHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS-GVY-DEVPEPAPYPYVSFGSMTEFPEDAHDRQ-GLSVTVVIHVWSKSPGFAE 88 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~-~Iy-D~vP~~a~~Pyv~iG~~~~~~~~~~~~~-~~~~~~~I~vws~~~g~~e 88 (145) |+---..|+.++ +. +| .|| |..|.++.|||+++..+...-.-+.+.. -....-||.+++.+.. .+ T Consensus 1 M~i~Fe~lr~~L----k~-------~g~~V~RD~ap~~t~YPyivYs~v~e~~k~AS~kv~~~~~~YQvSl~T~GtE-~d 68 (109) T protein:vir:62 1 MQINFEQLRSLM----KK-------SGIPVSRDNAPTGIDYPYIVYEFVNEQHKRASNKVLKDMPLYQIAVITNGTE-KD 68 (109) T ss_pred CcccHHHHHHHH----Hh-------cCCceeeccCCCCCCCceEEEEeecCceeeeccceEeecceeEEEEeeccch-hH Confidence 655444555433 32 12 577 7899999999999977766544333333 3344569999997632 22 Q ss_pred HHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEE Q lcl|NC_014229. 89 AHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLT 142 (145) Q Consensus 89 a~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~ 142 (145) ...+..++++..+...++. ..+--...|+.+. -.++-=.+| T Consensus 69 ----l~~l~k~f~~~~vpfs~f~-------gIqgDENDdTiTn--fyTyVrcie 109 (109) T protein:vir:62 69 ----YEPLKAVFNEVGVSYSQFD-------GMDYDENDDTITQ--FITYVRCIQ 109 (109) T ss_pred ----HHHHHHHHhhcCCcccccc-------ccCCCCCcchhee--eeeeeEEeC Confidence 3445666666554444432 1111112233332 111111222 No 103 >protein:vir:95155 Length: 151 # NCBI annotation: hypothetical protein ORF015 # Family: family:all:5248 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293422;genbank:gi:148912843;genbank:GeneID:5228230 Probab=49.39 E-value=0.64 Score=21.65 Aligned_cols=127 Identities=15% Similarity=0.124 Sum_probs=65.6 Q ss_pred ccchHHHHHHHHHHHh-----hcChhhhhhhhc-cccCCcc----cCCCCEEEec------cc----eeeecCCCcccce Q lcl|NC_014229. 12 MATALPALQASVYAKL-----VGHAPLTALVSG-VYDEVPE----PAPYPYVSFG------SM----TEFPEDAHDRQGL 71 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L-----~~da~l~alv~~-IyD~vP~----~a~~Pyv~iG------~~----~~~~~~~~~~~~~ 71 (145) |.. -.+++..|.+++ .+++++.-.... -|..+|. +..-+|+.|- .. ....-..+-...+ T Consensus 1 ~mt-f~q~R~~i~~~~~~~w~~~~~~~a~~~p~v~~~~~~~~d~P~g~~~WaRLti~h~~~~qA~ls~~~eigggp~~~r 79 (151) T protein:vir:95 1 MIE-FDQVNDEVNALFLATWNAGSAAIAGYVPEIRWQGVQYRDLPDGSKFWVRLSKQTVFEEQATLSTCEGVPGQRKYTA 79 (151) T ss_pred Ccc-HHHHHHHHHHHhhhhcccCchhhhccccccccCCCCCCCCCCCCCceEEEEeecCCCccccccccccCCCCceEee Confidence 555 457888898888 334443221112 2444332 3357888882 11 1111111222233 Q ss_pred EEEEEEEEEECCCCHHHHHHHHHH----HHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 72 SVTVVIHVWSKSPGFAEAHRIFAA----LDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 72 ~~~~~I~vws~~~g~~ea~~I~~a----V~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) .-.+.|.|+... +..++.+++++ .+++...+. +..+ ++++-.+...+. +|+++|...|..++-+.+=. T Consensus 80 tGli~VQiF~p~-~~G~~Le~Adkla~~a~eaFe~~~-t~g~---i~f~~~s~~eiG-~~~gWyQ~Nv~i~f~y~e~~ 151 (151) T protein:vir:95 80 SGLVFVQIFCPK-SNTQAFELGQKLAKLARNAFRGKS-TPGK---VWFRNTRINELP-PEELYERFNVVTEFEYDEIG 151 (151) T ss_pred CcEEEEEEeeec-cCchhhHHHHHHHHHHHHHhhccC-CCCC---ceeeeeeecccC-CCCCeEEEEeeeeecccccC Confidence 445666666653 12244444544 577886653 2222 445555555555 46689988877777666555 No 104 >protein:vir:1580 Length: 134 # NCBI annotation: minor capsid protein # Family: family:all:1267 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695162;swissprot:trembl:o03934;genbank:gi:23455807;uniprot:O03934;genbank:GeneID:955515 Probab=43.74 E-value=0.83 Score=21.02 Aligned_cols=125 Identities=10% Similarity=0.085 Sum_probs=68.5 Q ss_pred HHHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHHHHHHHH Q lcl|NC_014229. 17 PALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAHRIFAAL 96 (145) Q Consensus 17 ~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~~I~~aV 96 (145) ..|+..|.+.....++|. +.-..+..+.+...-.+-+--.......-++. .+..+...|--+.+++..|.+....| T Consensus 1 mDf~e~l~~~I~~~~~Lp--~k~~~~yL~~~~sl~lyp~PGs~~~~ey~dG~--~~~sl~fEIa~ktKd~~~a~~~Lw~I 76 (134) T protein:vir:15 1 MDLLERLAASINKVPNLP--MKCTLGYLTAADSLSLYPLPGSRVLDEDYAGN--QQWQMNYEVGMRTKNQQQANTTLWLV 76 (134) T ss_pred CChHHHHHHHhhcccCCC--ceeeecccCCCCcEEEEECCCCcccccccCCc--eeEEeeeeeecccchhHHHHHHHHHH Confidence 467777777776555542 11123444433331111111111112222233 45555555655666799999999999 Q ss_pred HHHhcCCCCc-c---C-CceEEEEEEeeeeeeecCCC-ceEEEEEEEEEEEEecC Q lcl|NC_014229. 97 DAALDRVPLT-V---A-GCTDVSIKHSNHQALKDPEP-GVRHINAEYRVRLTLDS 145 (145) Q Consensus 97 ~~aL~~~~l~-l---~-g~~~v~~~~~~~~~~~d~d~-~~~hg~l~fra~~~~~~ 145 (145) -..|+.-.+. + . .|.+..+.+.....+.+.|. +++-=.+.|.+.+.... T Consensus 77 s~~Ld~~~~~~l~S~NgSf~f~~levt~~P~~~~~D~qG~~~Ylld~~v~i~~~~ 131 (134) T protein:vir:15 77 SQALDVLTADDLVSSNGSFEFESLTINGQPSISEQDTQGYSTYQLSFSVIVNTFT 131 (134) T ss_pred HHHHhhcCcccceecCCCEEeecceecCCCceeeeccCceEEEEEeeEEEEEEee Confidence 9999855442 3 2 27777888877766665543 34444556665555544 No 105 >protein:vir:95371 Length: 104 # NCBI annotation: aminopeptidase # Family: family:all:1089 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764481;genbank:gi:115334635;genbank:GeneID:5179258 Probab=41.78 E-value=0.91 Score=20.80 Aligned_cols=102 Identities=18% Similarity=0.190 Sum_probs=64.8 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhc-cccCCcccCCCCEEEeccceeeecCCCcccc-eEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSG-VYDEVPEPAPYPYVSFGSMTEFPEDAHDRQG-LSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~-IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~-~~~~~~I~vws~~~g~~ea 89 (145) |+- +-|.+.|++ .+| + .|++--.++.+||+++=........+++..- ..-.++|..+++..++..= T Consensus 1 Mt~------~~l~~~Lk~-~gl-----Pvay~hF~~~p~pPyivy~~~~~~~~~ADn~~y~~~~~~~IELYT~~Kd~~~E 68 (104) T protein:vir:95 1 MKL------TELDDLLKA-TGL-----PVAYSHFSKPQKPPFITYMVAYSSNFTADDQVYQEIENVQIELYTLKKDFEAE 68 (104) T ss_pred CCH------HHHHHHHHh-cCC-----CeeeccccCCCCCceEEEEecCCcceeccceEEEeecceEEEEEeeccCHHHH Confidence 553 335666663 111 3 5777666677899999888888877776644 4457899999998764332 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEE Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVR 140 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~ 140 (145) ..|+++|++..+. +..++.--|.+ ..+...-+|++. T Consensus 69 ----~~iE~~Ld~~~i~----------y~k~et~IesE-klyq~~Y~~~l~ 104 (104) T protein:vir:95 69 ----EKVKAVLDANNLV----------YETSETYIPSE-KLYQKVYEVRLL 104 (104) T ss_pred ----HHHHHHHHhCCCc----------eeeEEEEecCc-ceEEEEEEEEeC Confidence 2677788655433 33444444333 566667777777 No 106 >protein:vir:80429 Length: 150 # NCBI annotation: BcepGomrgp11 # Family: family:all:5248 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210231;genbank:gi:146329923;genbank:GeneID:5123538 Probab=39.36 E-value=1 Score=20.53 Aligned_cols=129 Identities=10% Similarity=0.125 Sum_probs=66.6 Q ss_pred ccchHHHHHHHHHHHhhc-----C-hhhhhhhhcc-ccC----CcccCCCCEEEecccee--eecCCCc-ccc----eEE Q lcl|NC_014229. 12 MATALPALQASVYAKLVG-----H-APLTALVSGV-YDE----VPEPAPYPYVSFGSMTE--FPEDAHD-RQG----LSV 73 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~-----d-a~l~alv~~I-yD~----vP~~a~~Pyv~iG~~~~--~~~~~~~-~~~----~~~ 73 (145) |.--.+..+..|..++.+ + +++.+-+..| ++. .|.+...||+.|--... ...+..+ ..+ +.- T Consensus 1 ~~~~~~~ar~ei~~~f~~~W~~~~~~~~~g~~~~~~w~~~~~~~pP~g~~~WaRLti~h~~~~qA~~~~~~~gr~~~r~G 80 (150) T protein:vir:80 1 MIQDALQARSDINTMLFDQWSVADWSKVKGGKPNIAWEGRESARPPDGSAPYVAIFIKHVDGQQASLTDPDMLRRWSRDG 80 (150) T ss_pred CcchhhhhHHHHHHHHhhhhccCcchhhcCCcceeeecCcccCCcCCCCCceEEEEEecCCcccccccCCCCcceEeeCc Confidence 665555555555555443 2 3333333223 232 35556678877742222 2222111 111 223 Q ss_pred EEEEEEEEC---CCCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 74 TVVIHVWSK---SPGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 74 ~~~I~vws~---~~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) .+.|.|+.. +.|..-+.+.++..+++...++-. .+ +.++-.+...+. +|+++|...|..++...+=- T Consensus 81 lI~VQiF~p~~~G~G~~la~k~Ad~a~eaFe~~~t~-g~---i~f~~as~~eiG-~d~gWYQ~NV~ipF~yde~r 150 (150) T protein:vir:80 81 LITVQCFGMLSAGQGLEDATYQATIAMRAFEGKQSA-NG---IWFRNARIKEIG-SDRGWYQVNMIVEFEYDEVR 150 (150) T ss_pred EEEEEEeeeccCCchhhHHHHHHHHHHHHHhccCCC-CC---cccccccccccC-CCCceEEEEeEeeeeccccC Confidence 455666544 467778888999999999877522 22 233344444444 35588876666655443333 No 107 >protein:vir:4461 Length: 186 # NCBI annotation: hypothetical protein # Family: family:all:964 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700384;genbank:gi:23505456;genbank:GeneID:955663 Probab=38.62 E-value=1.1 Score=20.45 Aligned_cols=125 Identities=15% Similarity=0.140 Sum_probs=60.6 Q ss_pred ccchHHHHHHHHHHHhhcC-hhhhhhhhc--cccCCcc----cCCCCEEEeccceeeecCCCcccceEEEEEEEEE---- Q lcl|NC_014229. 12 MATALPALQASVYAKLVGH-APLTALVSG--VYDEVPE----PAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVW---- 80 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~d-a~l~alv~~--IyD~vP~----~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vw---- 80 (145) |+- ..|.+||++- |.+...|++ =|..+++ +++.=||..+.........-+.....++.++.|+ T Consensus 1 mkl------~~Vi~RLra~vP~l~~rV~gaad~aai~~~~~lp~PaAyVip~~d~~g~~~s~g~~~Q~i~~~f~Vvl~vr 74 (186) T protein:vir:44 1 MKL------TPIIAALRSRCPRFENRVGGAAQFKAIPEAGKLRLPAAYVVPAEDVTGEQKSQTDYWQDLTEGFSVIVVLS 74 (186) T ss_pred CCh------hHHHHHHHHhcchhhhhhhhhhhhhhhhhhcCCCCceEEEEeccccCCCCCcccceeEeeeeeEEEEEEEe Confidence 664 3566666532 333333432 3444544 2233377776665543332232223333333333 Q ss_pred --ECCCC----HHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCc-eEEEEEEEEEEEEecC Q lcl|NC_014229. 81 --SKSPG----FAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPG-VRHINAEYRVRLTLDS 145 (145) Q Consensus 81 --s~~~g----~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~-~~hg~l~fra~~~~~~ 145 (145) .+..| ..+...+.++|+.||-++... +++. .+.+...+...=.+|. .|.=.-++.+.+..+. T Consensus 75 n~~d~~G~~aa~D~l~~lr~~v~~AL~GW~P~-~~~~--pi~~~gG~lvd~~~g~l~y~~~F~~~~~l~~~~ 143 (186) T protein:vir:44 75 NERDEKGQWASYDAVHDVRQEIWKALLGWEPD-SQVH--EIQYAGGMLLDLNRHELYYQFDFTVKYEITETD 143 (186) T ss_pred ccCCCCCCccchHHHHHHHHHHHHHHcCcCcC-CCCc--eEEEcCceEEeecCcEEEEEEEEEEeeccCCCC Confidence 12233 355778899999999988766 4544 4556566555433442 3322222222222222 No 108 >protein:vir:1994 Length: 182 # NCBI annotation: Hypothetical protein # Family: family:all:1387 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050641;genbank:gi:9633528;genbank:GeneID:2636286 Probab=38.07 E-value=1.1 Score=20.39 Aligned_cols=120 Identities=16% Similarity=0.162 Sum_probs=61.0 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh------cccc-C-Cc---ccCCCCEEEeccceeeecCCCcccceEEEEEEEEE Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS------GVYD-E-VP---EPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVW 80 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~------~IyD-~-vP---~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vw 80 (145) |.+ ++..||.++|++- +-..+. |=|| . +. .+.+.=||+++.... ........-++.+-|. T Consensus 1 mI~---~iEdAi~~rl~~~--~g~~v~~V~sy~Gefd~e~l~~~~~~~PAv~Va~~G~~~----~~~r~~~~~r~~v~V~ 71 (182) T protein:vir:19 1 MLE---ETEAALLARVREL--FGATLRQVEPLTGTWTNEDVHRLFLAPPSVFLAWMGCGE----GRTRREVESRWAFFVV 71 (182) T ss_pred ChH---HHHHHHHHHHHHH--hhhhhhhhccCCCCCChhhhhHhhhcCceeEEEeccccC----cCCceeeeeEEEEEEE Confidence 654 6888888888863 212221 2343 2 22 234555777753221 1122334445666665 Q ss_pred ECC-CC----HHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeee----cCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 81 SKS-PG----FAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALK----DPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 81 s~~-~g----~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~----d~d~~~~hg~l~fra~~~~~~ 145 (145) ++. .| +..+.+|.++|+..|+++.+...+ .++..+.+.+. +..|..- =.++|+....-++ T Consensus 72 a~~~~g~~~~rvG~y~lv~~v~~lL~~q~~g~~~----~l~p~~vrnL~s~~~~~~gvsv-yavef~~~~~lp~ 140 (182) T protein:vir:19 72 AELLNGEPVNRPGIYQIVERLIAGVNGQTFGPTT----GMRLTQVRNLCDDNRINAGVVL-YGVLFSGTTPLPS 140 (182) T ss_pred ecCCCChhhhhhhHHHHHHHHHHHHhccCCCCcc----ccccceeeeeechhhhhCceEE-EEEEeeccccCCC Confidence 543 22 345889999999999987766543 13333333332 2233221 2344443333332 No 109 >protein:vir:101606 Length: 142 # NCBI annotation: hypothetical protein # Family: family:all:26512 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112511;genbank:gi:53793611;uniprot:Q5ZGE2;genbank:GeneID:3101714 Probab=37.89 E-value=0.7 Score=21.44 Aligned_cols=129 Identities=13% Similarity=0.135 Sum_probs=73.2 Q ss_pred c--cchHHHHHHHHHHHhhcChhhhhhhhcccc-CCccc-CCCCEEEecccee-eecCCCcccceEEEEEEEEEECC--- Q lcl|NC_014229. 12 M--ATALPALQASVYAKLVGHAPLTALVSGVYD-EVPEP-APYPYVSFGSMTE-FPEDAHDRQGLSVTVVIHVWSKS--- 83 (145) Q Consensus 12 M--~~~~~aLq~Ai~~~L~~da~l~alv~~IyD-~vP~~-a~~Pyv~iG~~~~-~~~~~~~~~~~~~~~~I~vws~~--- 83 (145) | +.|..-+++|+|+.+.+ --+....-.-|| ++..+ +.--||.+-.... .+..+||...++.++-|.++++. T Consensus 1 migtnpdkyirkavfdlinn-ivvntktikcydtrvtgnaavneyvlltnqtkeidkatkcvynwetsllieiytktssn 79 (142) T protein:vir:10 1 MIGTNPDKYIRKAVFDLINN-IVVNTKTIKCYDTRVTGNAAVNEYVLLTNQTKEIDKATKCVYNWETSLLIEIYTKTSSN 79 (142) T ss_pred CCCCchhHHHHHHHHHHhhh-heeccceeEEeeeeeccccccceeEEeeccchhhhhhhheeeeccceeEEEEeeeccCC Confidence 4 46788899999997664 111221124688 45544 4567888766554 36679999999999999999763 Q ss_pred ---CCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecC--CCceEEEEEEEEEEEE Q lcl|NC_014229. 84 ---PGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDP--EPGVRHINAEYRVRLT 142 (145) Q Consensus 84 ---~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~--d~~~~hg~l~fra~~~ 142 (145) ++|.-+.+|-++|+.++ ...|+++.|--....+.+..-+... ....++.-+++...+- T Consensus 80 gnsgsrllvndieqaiytli-nptltienfinqtqnvtfetqletittteiifrsfirlnltli 142 (142) T protein:vir:10 80 GNSGSRLLVNDIEQAIYTLI-NPTLTIENFINQTQNVTFETQLETITTTEIIFRSFIRLNLTLI 142 (142) T ss_pred CCccceehhhhHHHHHHHHh-CcceehhhhhchhhcceeeeeeeehhhHHHHHhhhhheeeeeC Confidence 34677888888888887 3556776643221111111100000 0011112222221111 No 110 >protein:vir:103883 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938246;genbank:gi:38229151;genbank:GeneID:2648198 Probab=37.33 E-value=1.1 Score=20.30 Aligned_cols=131 Identities=14% Similarity=0.100 Sum_probs=62.8 Q ss_pred CcccchH--HHHHHHHHHHhhc-Chhhhhhhhcc--ccCCc---ccCCCCEEEeccceeeecC-CCcccc------eEEE Q lcl|NC_014229. 10 GDMATAL--PALQASVYAKLVG-HAPLTALVSGV--YDEVP---EPAPYPYVSFGSMTEFPED-AHDRQG------LSVT 74 (145) Q Consensus 10 ~~M~~~~--~aLq~Ai~~~L~~-da~l~alv~~I--yD~vP---~~a~~Pyv~iG~~~~~~~~-~~~~~~------~~~~ 74 (145) =+|+.|. ++++++|.+||++ -++|. .|+.. |..+. +.++.=||.+......+.. .....+ .... T Consensus 1 ~~~~~~~n~lav~~~IieRLka~v~~lr-~V~~aadla~i~el~q~tPaayV~~~g~~~~~~~~~~~~~~~~q~v~q~w~ 79 (159) T protein:vir:10 1 MSTAEPFDYLFLETLLVERIRAEVPGLQ-DVSGVPDLATLDEQRQGSPCVYVVYLGDEIGTGASHQGGSRAIQTVTQHWA 79 (159) T ss_pred CCcccchhhhhhhHHHHHHHHhhhhHHH-hhhcccchHHHHhhhCCCcEEEEEecccccCCCcccccccceeeeeeeEEE Confidence 4666774 6899999999985 56664 44432 22333 3344556776655432211 111112 1222 Q ss_pred EEEEEEECC---C---CHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeec-CCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 75 VVIHVWSKS---P---GFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKD-PEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 75 ~~I~vws~~---~---g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d-~d~~~~hg~l~fra~~~~~~ 145 (145) +.|-|-+-. + ...++.++.++|+++|.++..+ .+. -.++...+....+ .+|--|+ .+.|+..+---. T Consensus 80 Vvlavr~~~~q~~~~a~~d~aG~ll~~v~~AL~GW~P~-~~~--~Pl~r~~~~~~~~y~~gfayy-Pl~F~~~~~~~~ 153 (159) T protein:vir:10 80 AVLTLYYADAQGDGQGARREAGPLLGRLLKALTGWVPD-QGV--TPLARSPQASPVSYSNGFFYF-PLVFTANFVFPR 153 (159) T ss_pred EEEEEecccccCccchhhHHHHHHHHHHHHHhcCcccC-CcC--CCeeecccCCCccccCCEEEe-eeeEEeeeeccc Confidence 333333211 1 1357899999999999887643 222 1122111111111 1332232 444444332222 No 111 >protein:vir:98890 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:1267 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164423;genbank:gi:56694913;genbank:GeneID:3197280 Probab=36.38 E-value=1.2 Score=20.20 Aligned_cols=124 Identities=9% Similarity=0.048 Sum_probs=69.0 Q ss_pred HHHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccceeeecCCCcccceEEEEEEEEEECCCCHHHHHHHHHHH Q lcl|NC_014229. 17 PALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVWSKSPGFAEAHRIFAAL 96 (145) Q Consensus 17 ~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea~~I~~aV 96 (145) ..|+..|.+....-++|. +.--.+..+.+...-.+-+--.......-++. .+..+...|--+.+++..|.+....| T Consensus 1 mDf~e~l~~~I~~~~~Lp--~k~~~~yL~~~~sl~lyp~PGs~~~~ey~dG~--~~~sl~fEIa~ktKd~~~a~~~Lw~I 76 (131) T protein:vir:98 1 MDFIERLTERVNSIPGLP--ISCKKGYLGTEESFVVYPLPGSRTVSQYMDGT--KDRRLNYEFAMKSKSQRKIDETLWLV 76 (131) T ss_pred CChHHHHHHHhhccCCCc--ceeeecccCCCCcEEEEECCCCcccccccCCc--eeEEeeeeeecccchhHHHHHHHHHH Confidence 567777777776655543 11123444443331111111111112222233 44555555555666788999999999 Q ss_pred HHHhcCCCCccC---C-ceEEEEEEeeeeeeecCCC-ceEEEEEEEEEEEEecC Q lcl|NC_014229. 97 DAALDRVPLTVA---G-CTDVSIKHSNHQALKDPEP-GVRHINAEYRVRLTLDS 145 (145) Q Consensus 97 ~~aL~~~~l~l~---g-~~~v~~~~~~~~~~~d~d~-~~~hg~l~fra~~~~~~ 145 (145) -..|+.-. .++ | |.+..+.+.+.....+.|. +++-=.+.|.+.+.... T Consensus 77 s~~L~~id-~l~S~NgSf~f~~levt~~P~~~~~D~qG~~~ylld~~v~i~~~~ 129 (131) T protein:vir:98 77 QNVLDDLG-ELESADGSFEFEGIDITNTPFINNADNQGWFVFLLDVQAKITVFE 129 (131) T ss_pred HHHHHhhc-ccccCCCCEEEccceecCCCceeeeccCceEEEEEeeEEEEEEEe Confidence 99987543 332 2 7777888887666665543 44444666666666555 No 112 >protein:vir:104348 Length: 129 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398976;genbank:gi:81343960;genbank:GeneID:3778880 Probab=36.09 E-value=1.2 Score=20.16 Aligned_cols=117 Identities=16% Similarity=0.134 Sum_probs=61.7 Q ss_pred ccchHHHHHHHHHHHhhc----ChhhhhhhhccccCCc-----ccCCC--CEEEeccceeeecCCCcccceEEEEEEEEE Q lcl|NC_014229. 12 MATALPALQASVYAKLVG----HAPLTALVSGVYDEVP-----EPAPY--PYVSFGSMTEFPEDAHDRQGLSVTVVIHVW 80 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~----da~l~alv~~IyD~vP-----~~a~~--Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vw 80 (145) |+ .+.++.+-+++.+ +-.+ .|.++. .+..| .|+.=+.+.....+.||..- .-.+||+|. T Consensus 1 ~s---~aar~~v~d~~~~~~~~~lpV------A~eNv~FtPp~~G~~YLr~~~lpa~T~~~~L~~d~r~y-~Gv~QI~Vv 70 (129) T protein:vir:10 1 MS---LAARKFVNDLLVNEFPVRYPV------AWENAAFTPPADGSIWLKYDYTEVDTVTYGLSRKCKYY-VGMVQISVF 70 (129) T ss_pred Cc---hHHHHHHHHHHHHhhcCCCcE------eecCCCcCCCCCCceEEEEEecCCCceeeeccCCCceE-EEEEEEEEE Confidence 54 4555555554443 2111 344432 22222 33334555555666666543 467888887 Q ss_pred EC-CCCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEec Q lcl|NC_014229. 81 SK-SPGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLD 144 (145) Q Consensus 81 s~-~~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~ 144 (145) .. +.|..++.+|++.|.+++.+. +.|+.+.+..-- ..-+.+.+ ...|--.++|-.+. | T Consensus 71 ~p~G~G~~~a~~iA~ei~d~F~~g-~~L~~Gyi~~~~-~~~p~i~~--~~~~~ipvr~~~r~--d 129 (129) T protein:vir:10 71 FSPGTGIDKPRQIANQLAESIVDG-TMLDSGTIYESG-VVNPVIKS--KSGWFIPVRFYVRL--D 129 (129) T ss_pred ecCCCCcchhhHHHHHHHHhccCC-ceeeceeecCCC-eECCeeec--CCceEEeEEEEEEe--C Confidence 65 568999999999999999654 344444222111 11223333 33365555554444 4 No 113 >protein:vir:8331 Length: 150 # NCBI annotation: gp48 # Family: family:all:2795 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817899;genbank:gi:29566332;genbank:GeneID:1259527 Probab=32.70 E-value=1.4 Score=19.77 Aligned_cols=118 Identities=15% Similarity=0.118 Sum_probs=67.1 Q ss_pred ccchH--------------------HHHHHHHHHHhhcChhhhhhhhcccc----CCcccCCCCEEEeccceeeecCCCc Q lcl|NC_014229. 12 MATAL--------------------PALQASVYAKLVGHAPLTALVSGVYD----EVPEPAPYPYVSFGSMTEFPEDAHD 67 (145) Q Consensus 12 M~~~~--------------------~aLq~Ai~~~L~~da~l~alv~~IyD----~vP~~a~~Pyv~iG~~~~~~~~~~~ 67 (145) |+.|. ....+-+..+|. .+|- +.+ +.+.||+.+-+....+. -+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~sapdae~~vv~wLs----------p~~rvA~~R~~-~dplPf~lv~rv~G~d~--pd 67 (150) T protein:vir:83 1 MTEPLDDEEPETPEPPEPEILNEGPADAETFVVKWLG----------EVYRAANTRRP-GDPLPFLLIQQVAGKEN--LD 67 (150) T ss_pred CCCCCCCcCCCCcccCCcccccCCCccHHHHHHHHhh----------HHhhhhhcccC-CCCCCeEEEEecCCCCC--cc Confidence 44331 012223333333 2332 122 34599999966554432 23 Q ss_pred ccceEEEEEEEEEEC-CCCHHHHHHHHHHHHHHhcCC-CCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 68 RQGLSVTVVIHVWSK-SPGFAEAHRIFAALDAALDRV-PLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 68 ~~~~~~~~~I~vws~-~~g~~ea~~I~~aV~~aL~~~-~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) .....--++||++.. ..|-.+|+.+++.+-+.+-.. ..++.++.+..++...+-..-+=... -+++|.++++--+ T Consensus 68 e~td~avvsv~~fg~~v~G~daA~~~ad~vH~RM~~l~r~tl~~Gtld~~~v~~aP~~leY~dD---~vvrYt~RY~~G~ 144 (150) T protein:vir:83 68 ESTADPVVQVDILCDKVDGEDAARDIKDRVHRRMLLLGRYLEMDGTLDWMKVFESPRRLEYTND---KVIRYTARYQFGQ 144 (150) T ss_pred cccccceeeeeeccccccchhhhhhhhhhHHHHHHHHhhhhccCCcchhhhhhccccccccCCC---eEEEeeeeeeccC Confidence 334456789999987 458899999999997776432 45667776655554444332221111 3777888887777 No 114 >protein:vir:103278 Length: 169 # NCBI annotation: phage-related conserved hypothetical protein # Family: family:all:5121 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277458;genbank:gi:71834100;genbank:GeneID:3562389 Probab=31.91 E-value=1.5 Score=19.68 Aligned_cols=133 Identities=14% Similarity=0.085 Sum_probs=63.1 Q ss_pred CceEEEeeCCcccc-hHHHHHHHHHHHhhcC-hhhhhhhhccccCCcc---cCCCCEEE----eccceeeecCCCcccce Q lcl|NC_014229. 1 MPLLAIWAGGDMAT-ALPALQASVYAKLVGH-APLTALVSGVYDEVPE---PAPYPYVS----FGSMTEFPEDAHDRQGL 71 (145) Q Consensus 1 ~~~~~~~~~~~M~~-~~~aLq~Ai~~~L~~d-a~l~alv~~IyD~vP~---~a~~Pyv~----iG~~~~~~~~~~~~~~~ 71 (145) .-||--+--=.|-. -+.+.++++-+++++- +++. =.|.++.- +..-+|+. =+.+...+.+.||.. - T Consensus 27 ~~~~~~~~~~~~h~ei~~a~rk~l~~~a~a~~~~Lp----VA~ENVaFtPp~dG~~YLr~~~lPadT~~~~L~gd~R~-y 101 (169) T protein:vir:10 27 VTLLRRYRRLNVHYEMMVAARKLVSDAAVDIAGSLP----VAYENCGFTPPKNGSSWLKFDYTEVDSVTWGLQRTCRY-Y 101 (169) T ss_pred hhhhhhhhhcchHHHHHHHHHHHHHHHHhhcccCCc----EeeCCCCcCCCCCCccEEEEEEecCCceeeeccCCCce-E Confidence 11111111111111 1345666666666542 1111 13444321 11123433 344444555555543 3 Q ss_pred EEEEEEEEEEC-CCCHHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEec Q lcl|NC_014229. 72 SVTVVIHVWSK-SPGFAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLD 144 (145) Q Consensus 72 ~~~~~I~vws~-~~g~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~ 144 (145) .-.+||+|... +.|..++.+|++.|.+++.+. ..++.+.+..-- ..-+.+.+ ...|--.++|-.+. | T Consensus 102 ~GVfQIsVV~PaGtG~~ka~qiAdeiadlF~~g-t~L~~Gyi~~~~-~~~p~i~~--~s~~~iPvr~~~R~--D 169 (169) T protein:vir:10 102 VGMVQVSIFFSPGEGTDRPRQLAGRLSEAFADG-TMLDSGYIYEGG-SVFPPVKS--QSGWFIPVRFYVRM--D 169 (169) T ss_pred EEEEEEEEEecCCCCcchhHHHHHHHHHhhhCC-ceeeceeecCCC-eECCeeec--CCceEEeEEEEEEe--C Confidence 46788888765 568999999999999999644 344444222111 11223333 33354555554444 4 No 115 >protein:vir:80109 Length: 104 # NCBI annotation: Putative aminopeptidase # Family: family:all:1089 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425609;genbank:gi:155042942;genbank:GeneID:5469534 Probab=29.98 E-value=1.6 Score=19.45 Aligned_cols=102 Identities=16% Similarity=0.171 Sum_probs=64.9 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhc-cccCCcccCCCCEEEeccceeeecCCCcccc-eEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSG-VYDEVPEPAPYPYVSFGSMTEFPEDAHDRQG-LSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~-IyD~vP~~a~~Pyv~iG~~~~~~~~~~~~~~-~~~~~~I~vws~~~g~~ea 89 (145) |+- +-|.+.|++ .+| + .|++-...+.+||+++=........+++..- ..-.++|..+++..++..= T Consensus 1 Mt~------~~l~~~Lk~-~gl-----Pvay~~F~~~P~pPyivy~~~~~~~~~ADn~~y~~~~~~~IELYT~~Kd~~~E 68 (104) T protein:vir:80 1 MNL------DELNTILKQ-TGF-----PVAYSHFGKPQKPPFITYVVAYSSNFGADDKVYQDIENVQIELYTDKKDLEAE 68 (104) T ss_pred CCH------HHHHHHHHh-cCC-----CeeeecCCCcCCCCEEEEEecCCcceeccceEEEeecceEEEEEeeccCHHHH Confidence 553 346666665 222 3 4777666678899999888888777776644 4557899999998764332 Q ss_pred HHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEE Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVR 140 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~ 140 (145) ..|+++|++..+. +..++.--|.+ ..+...-+|++. T Consensus 69 ----~~iE~~Ld~~~i~----------y~k~et~IesE-klyq~~Y~~~l~ 104 (104) T protein:vir:80 69 ----ERIKAVLDANSLY----------YETTETYIPSE-RLYQKVYEVRLL 104 (104) T ss_pred ----HHHHHHHhhCCCc----------eeeEEEEecCc-ceEEEEEEEEeC Confidence 2677788665433 33444444333 456666677776 No 116 >protein:vir:4515 Length: 186 # NCBI annotation: unknown # Family: family:all:964 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599041;genbank:gi:19548999;genbank:GeneID:935225 Probab=29.97 E-value=1.6 Score=19.44 Aligned_cols=125 Identities=18% Similarity=0.116 Sum_probs=60.9 Q ss_pred ccchHHHHHHHHHHHhhc-Chhhhhhhhc--cccCCcc----cCCCCEEEeccceeeecCCCcccceEEEEEEEEE--E- Q lcl|NC_014229. 12 MATALPALQASVYAKLVG-HAPLTALVSG--VYDEVPE----PAPYPYVSFGSMTEFPEDAHDRQGLSVTVVIHVW--S- 81 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~-da~l~alv~~--IyD~vP~----~a~~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vw--s- 81 (145) |+- ..|.+||++ -|.+...+++ =|-.+++ .++.-||..+.........-+.....+..++.|+ - T Consensus 1 Mkl------~~Ii~RLra~vP~l~grV~gaad~a~l~~~~~lp~PaAyVip~~d~~~~~~sq~~~~Q~i~e~f~Vvl~vr 74 (186) T protein:vir:45 1 MKL------TPVIAALRARCPYFENRVAGAAQFKNLPEVGKLRLPAAYVVPGDDSPGENKSQTDYWQELKEGFSVVVILS 74 (186) T ss_pred CCh------HHHHHHHHHhcchhhchhhhhhhhhhhHhhcCCCCceEEEEecccccCCCccccceeeeeeeEEEEEEEEe Confidence 764 346666653 2333323322 2333433 2344588887766543332222223222222222 1 Q ss_pred ---CCCC----HHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCC-ceEEEEEEEEEEEEecC Q lcl|NC_014229. 82 ---KSPG----FAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEP-GVRHINAEYRVRLTLDS 145 (145) Q Consensus 82 ---~~~g----~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~-~~~hg~l~fra~~~~~~ 145 (145) ++.| ..+...+.++|++||-++... +++. -+.+...+...=.+| ..|.=.-++.+.+..|. T Consensus 75 n~~d~~G~~aa~D~l~~lr~~v~~AL~GW~P~-~~~~--pi~~~gG~lvd~~~g~l~y~~~F~~~~~l~~~~ 143 (186) T protein:vir:45 75 NGRDERGQFASYDVVDDVRQMLFKALLGWNPE-ACGN--PITYDGGTLLDLNRHELIYQFDFSVISELTEDD 143 (186) T ss_pred ccCCCCCcccchhHHHHHHHHHHHHHhCcccC-CCCc--eEEEcCceEEeecCcEEEEEEEEEEeeccCCCc Confidence 2233 356889999999999988766 5544 366666666543343 23322222223333333 No 117 >protein:vir:9824 Length: 132 # NCBI annotation: putative minor capsid protein # Family: family:all:1267 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795586;genbank:gi:28876335;genbank:GeneID:1257908 Probab=29.09 E-value=1.7 Score=19.34 Aligned_cols=123 Identities=12% Similarity=0.132 Sum_probs=66.6 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccc--eeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSM--TEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~--~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ +.|+..|.+...+ -+|. +.-..|..-.... .+.+-.+ .......++. .+..+...+--+.++...| T Consensus 1 ~~---~Df~e~L~~~In~-l~LP--~k~~~~yL~~~es--l~iyp~PGs~v~~ey~dG~--~e~~l~feIa~ktK~~~~a 70 (132) T protein:vir:98 1 MT---NDFATVLRQFVEG-LDLG--IKPRLDYLTRQED--LAIYPMPGGKVNNEYMDGT--REISLPFEIAIKTKNQELA 70 (132) T ss_pred Cc---hhHHHHHHHHhcc-cCCC--ceeeecccCCCcc--EEEeecCCCcccccccCce--eEEEEeeEeeccccchhHH Confidence 54 4777777777643 1221 0111232222122 2222111 1222222333 4455555555566678999 Q ss_pred HHHHHHHHHHhcCCCCccC---C-ceEEEEEEeeeeeeecCCC-c--eEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVA---G-CTDVSIKHSNHQALKDPEP-G--VRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~---g-~~~v~~~~~~~~~~~d~d~-~--~~hg~l~fra~~~~~~ 145 (145) .+....|-..|+.-.+.+. | |.+..+.+. +..+-+.|. + +|---+..++.+|.|| T Consensus 71 ~~tLw~Is~~Ld~~~~~l~S~n~Sf~F~~lev~-~P~i~~~D~QG~~iYlld~~v~i~ie~~~ 132 (132) T protein:vir:98 71 STVMWTINSALSNFDLKLPSLNHSYTFISLDVE-KPFLNDLSDQGFYIYVLDITAHLEIEGNN 132 (132) T ss_pred HHHHHHHHHHHhhcCCcCcccCCcEEecceeec-cceeeeeecCceEEEEEEEEEEEEEeeCC Confidence 9999999999987765553 2 677777774 555555443 2 4555555566667777 No 118 >protein:vir:3037 Length: 132 # NCBI annotation: minor capsid protein # Family: family:all:1267 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438150;genbank:gi:16271813;genbank:GeneID:929243 Probab=29.09 E-value=1.7 Score=19.34 Aligned_cols=123 Identities=12% Similarity=0.132 Sum_probs=66.6 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhhccccCCcccCCCCEEEeccc--eeeecCCCcccceEEEEEEEEEECCCCHHHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVSGVYDEVPEPAPYPYVSFGSM--TEFPEDAHDRQGLSVTVVIHVWSKSPGFAEA 89 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~~IyD~vP~~a~~Pyv~iG~~--~~~~~~~~~~~~~~~~~~I~vws~~~g~~ea 89 (145) |+ +.|+..|.+...+ -+|. +.-..|..-.... .+.+-.+ .......++. .+..+...+--+.++...| T Consensus 1 ~~---~Df~e~L~~~In~-l~LP--~k~~~~yL~~~es--l~iyp~PGs~v~~ey~dG~--~e~~l~feIa~ktK~~~~a 70 (132) T protein:vir:30 1 MT---NDFATVLRQFVEG-LDLG--IKPRLDYLTRQED--LAIYPMPGGKVNNEYMDGT--REISLPFEIAIKTKNQELA 70 (132) T ss_pred Cc---hhHHHHHHHHhcc-cCCC--ceeeecccCCCcc--EEEeecCCCcccccccCce--eEEEEeeEeeccccchhHH Confidence 54 4777777777643 1221 0111232222122 2222111 1222222333 4455555555566678999 Q ss_pred HHHHHHHHHHhcCCCCccC---C-ceEEEEEEeeeeeeecCCC-c--eEEEEEEEEEEEEecC Q lcl|NC_014229. 90 HRIFAALDAALDRVPLTVA---G-CTDVSIKHSNHQALKDPEP-G--VRHINAEYRVRLTLDS 145 (145) Q Consensus 90 ~~I~~aV~~aL~~~~l~l~---g-~~~v~~~~~~~~~~~d~d~-~--~~hg~l~fra~~~~~~ 145 (145) .+....|-..|+.-.+.+. | |.+..+.+. +..+-+.|. + +|---+..++.+|.|| T Consensus 71 ~~tLw~Is~~Ld~~~~~l~S~n~Sf~F~~lev~-~P~i~~~D~QG~~iYlld~~v~i~ie~~~ 132 (132) T protein:vir:30 71 STVMWTINSALSNFDLKLPSLNHSYTFISLDVE-KPFLNDLSDQGFYIYVLDITAHLEIEGNN 132 (132) T ss_pred HHHHHHHHHHHhhcCCcCcccCCcEEecceeec-cceeeeeecCceEEEEEEEEEEEEEeeCC Confidence 9999999999987765553 2 677777774 555555443 2 4555555566667777 No 119 >protein:vir:488 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:964 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543095;swissprot:trembl:q8w624;genbank:gi:18249907;uniprot:Q8W624;genbank:GeneID:929697 Probab=28.10 E-value=1.8 Score=19.21 Aligned_cols=122 Identities=14% Similarity=0.105 Sum_probs=60.6 Q ss_pred ccchHHHHHHHHHHHhhc-Chhhhhhhhc--cccCCcccCC----CCEEEeccceeeecCCCcccceEEEEEEEEE---- Q lcl|NC_014229. 12 MATALPALQASVYAKLVG-HAPLTALVSG--VYDEVPEPAP----YPYVSFGSMTEFPEDAHDRQGLSVTVVIHVW---- 80 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~-da~l~alv~~--IyD~vP~~a~----~Pyv~iG~~~~~~~~~~~~~~~~~~~~I~vw---- 80 (145) |+- ..|.+||++ -|.+...|++ =|..+++... .=||...........+-+.....++.++.|+ T Consensus 1 Mkl------~~Ii~rLra~vP~l~grV~gaad~aal~~~~~lp~PaAyVlp~~d~~~~~~sq~~~~Q~i~e~f~Vvl~vr 74 (187) T protein:vir:48 1 MKL------TTIIAALRERCPRFEDRVGGAAQFKAIPDAGKLRLPAAYVVPSDDAPGEQKSQTDYWQDLTEGFSVIVVLS 74 (187) T ss_pred Cch------hHHHHHHHHhcchhhhhhhhhhhhhhhhhhcCCCCceEEEEeccccCCCCCCCcceeeeeeeEEEEEEEEe Confidence 664 346666653 2333333432 3555554332 2377776665543332223223333333332 Q ss_pred E--CCCC----HHHHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecC Q lcl|NC_014229. 81 S--KSPG----FAEAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEPGVRHINAEYRVRLTLDS 145 (145) Q Consensus 81 s--~~~g----~~ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~~~~hg~l~fra~~~~~~ 145 (145) + ++.| ..++..+.++|.+||-++... +++. -+.+...+...=.+|.. .-.++|.++.+= T Consensus 75 n~~D~~G~~~a~D~l~~lr~~v~~AL~GW~P~-~~~~--pi~~~gG~lvd~~~g~l---~y~~~F~~~~ql 139 (187) T protein:vir:48 75 NERDEKGQWAAYDAVHDVRRELWKALLGWMPD-PQGG--EIVYAGGTLLDLNRYEL---YYQFDFTAKYEI 139 (187) T ss_pred ccCCCCCcchhhHHHHHHHHHHHHHHhCcCcC-CCCc--eEEEcCceEeeecCcEE---EEEEEEEeeccc Confidence 1 2333 345778899999999988766 5554 35666666654333321 223333333322 No 120 >protein:vir:4705 Length: 126 # NCBI annotation: phi PVL ORF 12 homologue # Family: family:all:517 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061637;genbank:gi:9635724;genbank:GeneID:1263007 Probab=25.29 E-value=2.1 Score=18.85 Aligned_cols=116 Identities=12% Similarity=0.161 Sum_probs=61.3 Q ss_pred ccchHHHHHHHHHHHhhcChhhhhhhh-ccccCCcccCCCCEEEeccceeeecCC--CcccceEEEEEEEEEECC-CCHH Q lcl|NC_014229. 12 MATALPALQASVYAKLVGHAPLTALVS-GVYDEVPEPAPYPYVSFGSMTEFPEDA--HDRQGLSVTVVIHVWSKS-PGFA 87 (145) Q Consensus 12 M~~~~~aLq~Ai~~~L~~da~l~alv~-~IyD~vP~~a~~Pyv~iG~~~~~~~~~--~~~~~~~~~~~I~vws~~-~g~~ 87 (145) |.....-+++||.+.=..|. + ...+ .|-|+.-+++.-|.|.+=+.-..|... +...+++...+|+||-.+ .-.. T Consensus 1 minvtklirnaiiannitde-v-nvfnytiddhfhektdkpiiriyplpfnpdtyaddneisreyhyqidvwwsqdepne 78 (126) T protein:vir:47 1 MINVTKLIRNAIIANNITDE-V-NVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQDEPNE 78 (126) T ss_pred CcchHHHHhhhhhccccccc-e-eeeeeehhhhhhhhcCCceEEEeeccCCCccccCcccccceeeeEEEEEEcCCCcch Confidence 88877778888876433221 1 1111 355677788999999987776666542 345668889999999544 3345 Q ss_pred HHHHHHHHHHHHhcCCCCccCCceEEEEEEeeeeeeecCCC------ceEEEEEEEEEEEEec Q lcl|NC_014229. 88 EAHRIFAALDAALDRVPLTVAGCTDVSIKHSNHQALKDPEP------GVRHINAEYRVRLTLD 144 (145) Q Consensus 88 ea~~I~~aV~~aL~~~~l~l~g~~~v~~~~~~~~~~~d~d~------~~~hg~l~fra~~~~~ 144 (145) ++..|.+.+.- +. |.| .+. +-+.+.|- ....|.+ +...+|+| T Consensus 79 qaekivdllkv-in--------fqc---yyr--eplyesdvmsfrhiirakgsi-lsmkleen 126 (126) T protein:vir:47 79 QAEKIVDLLKV-IN--------FQC---YYR--EPLYESDVMSFRHIIRAKGSI-LSMKLEEN 126 (126) T ss_pred hHHHHHHHHHH-hc--------cee---eec--CccchhhhHHHHHHhhcccce-EEeEeccC Confidence 56555554431 11 111 111 00111110 0111111 24455666 Done!