Query lcl|NC_018086.1_cdsid_YP_006488746.1 [gene=efb19] [protein=phage tail protein] [protein_id=YP_006488746.1] [location=13391..13825] Match_columns 144 No_of_seqs 108 out of 141 Neff 6.6 Searched_HMMs 1612 Date Thu Nov 7 13:10:39 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_19 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_19_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:5979 Length: 134 # 100.0 1.9E-39 1.2E-42 232.9 16.1 128 7-141 1-134 (134) 2 protein:vir:96125 Length: 140 100.0 1E-38 6.4E-42 228.9 16.4 132 8-144 1-139 (140) 3 protein:vir:105892 Length: 141 100.0 7.2E-38 4.5E-41 224.2 16.3 132 1-144 1-139 (141) 4 protein:vir:94096 Length: 141 100.0 7.2E-38 4.5E-41 224.2 16.3 132 1-144 1-139 (141) 5 protein:vir:96260 Length: 141 100.0 7.2E-38 4.5E-41 224.2 16.3 132 1-144 1-139 (141) 6 protein:vir:97325 Length: 145 100.0 8.8E-38 5.5E-41 223.7 16.1 132 8-144 1-140 (145) 7 protein:vir:95111 Length: 145 100.0 1.1E-37 6.6E-41 223.3 16.2 132 8-144 1-139 (145) 8 protein:vir:94794 Length: 145 100.0 1.2E-37 7.1E-41 223.1 16.2 132 8-144 1-139 (145) 9 protein:vir:93736 Length: 145 100.0 1.2E-37 7.2E-41 223.1 16.2 132 8-144 1-139 (145) 10 protein:vir:94488 Length: 145 100.0 1.2E-37 7.2E-41 223.1 16.2 132 8-144 1-139 (145) 11 protein:vir:97421 Length: 145 100.0 1.2E-37 7.2E-41 223.1 16.2 132 8-144 1-139 (145) 12 protein:vir:95961 Length: 145 100.0 1.2E-37 7.2E-41 223.1 16.2 132 8-144 1-139 (145) 13 protein:vir:107096 Length: 145 100.0 1.5E-37 9.4E-41 222.4 16.2 132 8-144 1-139 (145) 14 protein:vir:105337 Length: 145 100.0 1.5E-37 9.5E-41 222.4 16.2 132 8-144 1-139 (145) 15 protein:vir:96894 Length: 140 100.0 3.3E-37 2.1E-40 220.6 16.6 132 1-144 1-139 (140) 16 protein:vir:1244 Length: 145 # 100.0 5.4E-37 3.3E-40 219.4 16.3 132 8-144 1-139 (145) 17 protein:vir:4907 Length: 128 # 99.9 6E-29 3.7E-32 175.3 14.4 126 9-137 1-128 (128) 18 protein:vir:2741 Length: 128 # 99.9 1.1E-28 7E-32 173.8 14.7 126 9-137 1-128 (128) 19 protein:vir:96485 Length: 128 99.9 5E-28 3.1E-31 170.2 14.4 126 9-137 1-128 (128) 20 protein:vir:3618 Length: 129 # 99.9 1.4E-26 8.8E-30 162.3 14.5 127 8-137 1-129 (129) 21 protein:vir:3972 Length: 129 # 99.9 2.2E-26 1.4E-29 161.2 14.3 127 8-137 1-129 (129) 22 protein:vir:744 Length: 129 # 99.9 2.6E-26 1.6E-29 160.8 14.2 125 8-137 1-129 (129) 23 protein:vir:99537 Length: 125 99.9 1.6E-24 9.8E-28 151.1 13.8 124 10-137 1-125 (125) 24 protein:vir:106593 Length: 131 99.8 5.3E-21 3.3E-24 131.8 13.8 128 7-137 1-131 (131) 25 protein:vir:95765 Length: 127 99.7 7E-20 4.4E-23 125.6 13.3 125 10-141 1-127 (127) 26 protein:vir:9313 Length: 127 # 99.7 2.4E-19 1.5E-22 122.7 13.3 125 10-137 1-127 (127) 27 protein:vir:96355 Length: 127 99.7 2.6E-19 1.6E-22 122.4 13.3 125 10-137 1-127 (127) 28 protein:vir:78854 Length: 127 99.7 2.6E-19 1.6E-22 122.4 13.3 125 10-137 1-127 (127) 29 protein:vir:97143 Length: 127 99.7 3.4E-19 2.1E-22 121.8 13.2 125 10-137 1-127 (127) 30 protein:vir:96217 Length: 127 99.7 3.4E-19 2.1E-22 121.8 13.2 125 10-137 1-127 (127) 31 protein:vir:99769 Length: 127 99.7 3.4E-19 2.1E-22 121.8 13.2 125 10-137 1-127 (127) 32 protein:vir:103918 Length: 127 99.7 3.4E-19 2.1E-22 121.8 13.2 125 10-137 1-127 (127) 33 protein:vir:9880 Length: 136 # 98.6 9.4E-10 5.8E-13 70.1 11.6 130 1-141 1-136 (136) 34 protein:vir:97070 Length: 118 98.5 3.2E-09 2E-12 67.2 11.0 112 14-141 1-118 (118) 35 protein:vir:10368 Length: 118 98.5 3.9E-09 2.4E-12 66.7 11.1 112 14-141 1-118 (118) 36 protein:vir:81066 Length: 118 98.4 5.3E-09 3.3E-12 65.9 10.8 111 14-141 1-118 (118) 37 protein:vir:107581 Length: 119 98.2 3E-08 1.8E-11 61.8 9.9 109 13-140 1-119 (119) 38 protein:vir:105008 Length: 119 98.2 3E-08 1.8E-11 61.8 9.9 109 13-140 1-119 (119) 39 protein:vir:102086 Length: 119 98.2 3E-08 1.8E-11 61.8 9.9 109 13-140 1-119 (119) 40 protein:vir:102888 Length: 119 98.2 3E-08 1.8E-11 61.8 9.9 109 13-140 1-119 (119) 41 protein:vir:93602 Length: 114 98.2 4E-08 2.5E-11 61.1 9.7 107 1-138 1-114 (114) 42 protein:vir:195 Length: 115 # 98.1 4.3E-08 2.7E-11 61.0 9.4 107 1-138 1-115 (115) 43 protein:vir:4348 Length: 121 # 97.9 1.9E-07 1.2E-10 57.5 9.5 107 1-140 1-121 (121) 44 protein:vir:1892 Length: 121 # 97.8 2.8E-07 1.7E-10 56.5 8.9 107 1-140 1-121 (121) 45 protein:vir:1274 Length: 162 # 97.7 5E-07 3.1E-10 55.1 8.3 127 1-142 26-162 (162) 46 protein:vir:100242 Length: 114 97.7 8.2E-07 5.1E-10 53.9 9.2 110 1-138 1-114 (114) 47 protein:vir:1438 Length: 115 # 97.5 1.7E-06 1.1E-09 52.2 8.7 111 13-138 1-115 (115) 48 protein:vir:100116 Length: 115 97.4 2.1E-06 1.3E-09 51.7 8.4 111 13-138 1-115 (115) 49 protein:vir:1387 Length: 116 # 96.3 3.3E-05 2E-08 45.2 6.8 114 1-142 1-116 (116) 50 protein:vir:80371 Length: 115 95.9 0.00017 1.1E-07 41.2 8.6 110 10-138 1-115 (115) 51 protein:vir:98426 Length: 131 95.5 0.0014 8.8E-07 36.2 12.3 127 4-141 1-131 (131) 52 protein:vir:98343 Length: 126 95.2 0.00039 2.4E-07 39.3 8.3 122 1-144 2-126 (126) 53 protein:vir:9415 Length: 126 # 95.2 0.00039 2.4E-07 39.3 8.3 122 1-144 2-126 (126) 54 protein:vir:79247 Length: 157 95.0 0.0014 8.5E-07 36.3 10.8 127 1-144 1-151 (157) 55 protein:vir:9931 Length: 119 # 95.0 0.00075 4.6E-07 37.7 9.3 114 8-141 1-119 (119) 56 protein:vir:99226 Length: 157 94.8 0.0017 1.1E-06 35.7 10.7 127 1-144 1-151 (157) 57 protein:vir:94768 Length: 111 94.6 0.0013 8.2E-07 36.4 9.6 110 15-139 1-111 (111) 58 protein:vir:1643 Length: 111 # 94.5 0.0015 9.1E-07 36.1 9.7 110 15-139 1-111 (111) 59 protein:vir:96002 Length: 133 93.7 0.0011 6.6E-07 36.9 7.3 110 13-144 1-130 (133) 60 protein:vir:103883 Length: 159 93.6 0.0026 1.6E-06 34.7 9.3 130 1-144 1-153 (159) 61 protein:vir:101303 Length: 135 93.5 0.0015 9.3E-07 36.1 7.9 111 13-144 1-131 (135) 62 protein:vir:100675 Length: 135 93.5 0.0015 9.3E-07 36.1 7.9 111 13-144 1-131 (135) 63 protein:vir:9514 Length: 135 # 93.5 0.0015 9.3E-07 36.1 7.9 111 13-144 1-131 (135) 64 protein:vir:9579 Length: 111 # 93.1 0.0036 2.3E-06 34.0 9.3 110 13-139 1-111 (111) 65 protein:vir:108220 Length: 133 92.7 0.01 6.5E-06 31.4 11.4 126 1-143 1-133 (133) 66 protein:vir:81093 Length: 126 92.7 0.0022 1.3E-06 35.2 7.4 118 1-144 1-126 (126) 67 protein:vir:80001 Length: 126 92.7 0.0022 1.3E-06 35.2 7.4 118 1-144 1-126 (126) 68 protein:vir:6374 Length: 179 # 92.5 0.0013 8.3E-07 36.3 6.1 135 1-144 1-172 (179) 69 protein:vir:78349 Length: 127 91.9 0.0028 1.7E-06 34.6 7.1 112 13-143 1-127 (127) 70 protein:vir:9764 Length: 111 # 91.9 0.0064 4E-06 32.6 9.0 110 1-139 1-111 (111) 71 protein:vir:96972 Length: 131 90.9 0.0038 2.3E-06 33.9 6.7 111 14-142 1-131 (131) 72 protein:vir:9364 Length: 131 # 90.9 0.0038 2.3E-06 33.9 6.7 111 14-142 1-131 (131) 73 protein:vir:78648 Length: 131 90.9 0.0038 2.3E-06 33.9 6.7 111 14-142 1-131 (131) 74 protein:vir:2689 Length: 131 # 90.9 0.0038 2.3E-06 33.9 6.7 111 14-142 1-131 (131) 75 protein:vir:94418 Length: 131 90.2 0.0036 2.2E-06 34.0 6.0 111 14-142 1-131 (131) 76 protein:vir:93902 Length: 131 89.9 0.0041 2.6E-06 33.6 6.0 111 14-142 1-131 (131) 77 protein:vir:107857 Length: 154 89.6 0.02 1.3E-05 29.8 9.6 125 10-144 1-140 (154) 78 protein:vir:79065 Length: 154 89.3 0.023 1.4E-05 29.6 9.7 125 10-144 1-140 (154) 79 protein:vir:78124 Length: 139 87.7 0.037 2.3E-05 28.5 10.5 133 1-140 1-139 (139) 80 protein:vir:106554 Length: 122 87.6 0.037 2.3E-05 28.4 9.9 113 1-144 1-119 (122) 81 protein:vir:102955 Length: 138 87.5 0.038 2.4E-05 28.3 11.2 120 7-144 1-125 (138) 82 protein:vir:79571 Length: 137 87.3 0.033 2.1E-05 28.7 9.3 127 3-140 1-137 (137) 83 protein:vir:9648 Length: 126 # 87.3 0.0091 5.6E-06 31.8 6.1 114 12-144 1-126 (126) 84 protein:vir:79047 Length: 145 86.0 0.049 3E-05 27.8 11.9 119 12-144 1-126 (145) 85 protein:vir:94921 Length: 125 85.3 0.054 3.3E-05 27.5 11.7 115 1-138 1-125 (125) 86 protein:vir:107704 Length: 132 81.2 0.088 5.4E-05 26.4 10.6 119 12-141 1-132 (132) 87 protein:vir:9709 Length: 141 # 80.4 0.05 3.1E-05 27.7 7.1 113 13-140 1-141 (141) 88 protein:vir:101509 Length: 139 80.3 0.042 2.6E-05 28.1 6.7 121 10-141 1-139 (139) 89 protein:vir:101606 Length: 142 79.2 0.037 2.3E-05 28.4 6.0 129 7-137 1-142 (142) 90 protein:vir:102191 Length: 139 78.9 0.05 3.1E-05 27.7 6.6 121 10-141 1-139 (139) 91 protein:vir:78057 Length: 154 77.7 0.12 7.6E-05 25.6 8.5 133 1-141 1-154 (154) 92 protein:vir:7450 Length: 141 # 76.7 0.13 8.3E-05 25.4 9.0 122 10-143 1-141 (141) 93 protein:vir:3428 Length: 131 # 75.2 0.15 9.3E-05 25.1 9.3 122 7-140 1-131 (131) 94 protein:vir:1994 Length: 182 # 72.7 0.18 0.00011 24.7 8.5 122 12-144 1-150 (182) 95 protein:vir:98629 Length: 126 71.4 0.089 5.5E-05 26.3 6.0 115 12-144 1-126 (126) 96 protein:vir:103278 Length: 169 67.2 0.26 0.00016 23.8 8.6 126 1-139 20-169 (169) 97 protein:vir:6215 Length: 109 # 63.8 0.31 0.00019 23.4 7.8 107 1-138 1-109 (109) 98 protein:vir:104348 Length: 129 62.2 0.34 0.00021 23.2 9.5 116 12-139 1-129 (129) 99 protein:vir:10327 Length: 182 57.6 0.43 0.00027 22.6 11.6 130 1-144 1-148 (182) 100 protein:vir:81158 Length: 109 57.2 0.35 0.00022 23.1 6.4 105 1-139 2-109 (109) 101 protein:vir:5259 Length: 213 # 47.6 0.33 0.0002 23.3 4.6 131 1-144 51-197 (213) 102 protein:vir:105468 Length: 135 46.3 0.74 0.00046 21.3 10.9 117 14-144 1-125 (135) 103 protein:vir:95371 Length: 104 40.2 0.98 0.00061 20.6 9.0 103 1-137 1-104 (104) 104 protein:vir:3874 Length: 114 # 37.3 1.1 0.0007 20.3 6.4 100 14-128 1-114 (114) 105 protein:vir:80105 Length: 162 36.4 1.2 0.00073 20.2 11.0 131 7-144 1-153 (162) 106 protein:vir:397 Length: 132 # 33.8 1.3 0.00083 19.9 9.3 121 7-140 1-132 (132) 107 protein:vir:80109 Length: 104 32.7 1.4 0.00087 19.8 9.1 103 1-137 1-104 (104) 108 protein:vir:7994 Length: 134 # 30.4 1.6 0.00098 19.5 9.2 123 1-141 1-134 (134) 109 protein:vir:102609 Length: 134 29.9 1.6 0.001 19.4 9.3 123 1-141 1-134 (134) 110 protein:vir:105826 Length: 134 29.9 1.6 0.001 19.4 9.3 123 1-141 1-134 (134) No 1 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=100.00 E-value=1.9e-39 Score=232.89 Aligned_cols=128 Identities=20% Similarity=0.353 Sum_probs=116.6 Q ss_pred CCCCChhhHHHHHHHHHHhhcC------CcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 7 FRARSSSVALQRAIVKEIRAQG------INIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 7 ~~~~S~~~aLQ~AI~~~L~a~~------~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~e 80 (144) |+++||+.|||+|||+||+++. ++|||++|++++||||+||+.+++|+ +++|..|.+|+++|||||++ |++| T Consensus 1 m~~~s~~~aLq~Ai~~~L~ad~~l~alvg~I~D~~P~~~~~PYV~lG~~~~~d~-~~~~~~g~~~~~ti~Vws~~-g~~e 78 (134) T protein:vir:59 1 MTWKLASRALQKATVENLESYQPLMEMVNQVTESPGKDDPYPYVVIGDQSSTPF-ETKSSFGENITMDFHVWGGT-TRAE 78 (134) T ss_pred CCccchhHHHHHHHHHHhhcChhHHHhhhhhhcCCCCCCCCCEEEeCCceeeec-CCCcccceEEEEEEEEEECC-ChHH Confidence 9999999999999999999873 38999999999999999999999997 58999999999999999986 7899 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeeccc Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTI 141 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~ 141 (144) +|+|+++|+++|++++|+|+ |+.++.+++.+.+.++|+||.+. |+.|+||+++++= T Consensus 79 a~~ia~av~~aL~~~~L~l~-~~~lv~l~~~~~~~~rd~dg~~~----hg~l~fra~ve~~ 134 (134) T protein:vir:59 79 AQDISSRVLEALTYKPLMFE-GFTFVAKKLVLAQVITDTDGVTK----HGIIKVRFTINNN 134 (134) T ss_pred HHHHHHHHHHHhcCCCcccC-CceEEEeEEeeeeEEecCCCceE----EEEEEEEEEEecC Confidence 99999999999999999995 78999999999999999999874 4577777775444 No 2 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=100.00 E-value=1e-38 Score=228.85 Aligned_cols=132 Identities=18% Similarity=0.282 Sum_probs=119.1 Q ss_pred CCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~e 80 (144) -..|++.|||+|||+||++| +.+|||++|++++||||+||+.+++++ +++|+.|.+|+++|||||++.|++| T Consensus 1 ~~msa~~aLq~Ai~~~L~ad~~l~alvggrVyD~~P~~~~~PYV~lG~~~~~~~-~~~~~~g~~~~~tl~Vws~~~g~~e 79 (140) T protein:vir:96 1 MWVTAEPLLYNKIMNNLIENPITDKLVGGRVFDCVQKDVVYPYIVVGESNVTES-ERSPGMREIIAITFHVYSQYENGAE 79 (140) T ss_pred CccchhHHHHHHHHHHhccChhHHhhcCcccccCCccCCCCCEEEeCCceeeec-CCCcccceEEEEEEEEEEcCCCHHH Confidence 13478889999999999987 347999999999999999999999997 5899999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) +|+|+++|+++|+ ++|+| +||+++.+++.+.++++|+||.+.| +.++++|++-+++..-- T Consensus 80 a~~ia~ai~~aL~-~~l~l-~~~~lv~l~~~~~~~~rd~dg~t~h--gvl~~ra~ve~~~~~~~ 139 (140) T protein:vir:96 80 ARELLKYLNYACR-LNINF-KDYELEWIKKDNSQVFTDIDQYTKH--GVLRLLYKVRHKTLQER 139 (140) T ss_pred HHHHHHHHHHHhc-CCccC-CCceEEEEEEeeeEEeecCCCceEE--EEEEEEEEEeecccccC Confidence 9999999999996 79999 5899999999999999999998754 67888888888877666 No 3 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=100.00 E-value=7.2e-38 Score=224.22 Aligned_cols=132 Identities=17% Similarity=0.279 Sum_probs=117.3 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEe Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWS 73 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs 73 (144) || -|++.|||+|||++|++| +.+|||++|+++++|||+||+.+++++ +++|..|++|+++||||| T Consensus 1 Ms-------ms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~-~~~~~~g~~~~~ti~Vws 72 (141) T protein:vir:10 1 MW-------VSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNN-ESSATMRETVGIVIHVYS 72 (141) T ss_pred Cc-------cchhHHHHHHHHHHhhcChhhHhhcCCccccCCccCCCCCEEEeCCceeeec-CCCcccceEEEEEEEEEE Confidence 43 267899999999999997 448999999999999999999999997 588999999999999999 Q ss_pred CCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 74 DYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 74 ~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) ++.|+.|||+|+++|+++|++ +|.| +||+++.+++.+.++++|+||.+.| +.++++||+.++++.-- T Consensus 73 ~~~g~~eak~ia~av~~AL~~-~l~l-~~~~lv~l~~~~~~~~rd~dg~t~h--gvl~~ra~v~~~~~~~~ 139 (141) T protein:vir:10 73 QFATQYEAKLILSAIGYVLNR-PIEI-DNYEFQFSRIDSQAVFPDIDRFTKH--GTIRLLFKYRHKKKNEG 139 (141) T ss_pred cCCCHHHHHHHHHHHHHHhcc-cccC-CCceEEEEEEeeeeeeecCCCceEE--EEEEEEEEEEecccccc Confidence 999999999999999999985 6888 4899999999999999999998754 66777777777777554 No 4 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=100.00 E-value=7.2e-38 Score=224.22 Aligned_cols=132 Identities=17% Similarity=0.279 Sum_probs=117.3 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEe Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWS 73 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs 73 (144) || -|++.|||+|||++|++| +.+|||++|+++++|||+||+.+++++ +++|..|++|+++||||| T Consensus 1 Ms-------ms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~-~~~~~~g~~~~~ti~Vws 72 (141) T protein:vir:94 1 MW-------VSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNN-ESSATMRETVGIVIHVYS 72 (141) T ss_pred Cc-------cchhHHHHHHHHHHhhcChhhHhhcCCccccCCccCCCCCEEEeCCceeeec-CCCcccceEEEEEEEEEE Confidence 43 267899999999999997 448999999999999999999999997 588999999999999999 Q ss_pred CCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 74 DYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 74 ~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) ++.|+.|||+|+++|+++|++ +|.| +||+++.+++.+.++++|+||.+.| +.++++||+.++++.-- T Consensus 73 ~~~g~~eak~ia~av~~AL~~-~l~l-~~~~lv~l~~~~~~~~rd~dg~t~h--gvl~~ra~v~~~~~~~~ 139 (141) T protein:vir:94 73 QFATQYEAKLILSAIGYVLNR-PIEI-DNYEFQFSRIDSQAVFPDIDRFTKH--GTIRLLFKYRHKKKNEG 139 (141) T ss_pred cCCCHHHHHHHHHHHHHHhcc-cccC-CCceEEEEEEeeeeeeecCCCceEE--EEEEEEEEEEecccccc Confidence 999999999999999999985 6888 4899999999999999999998754 66777777777777554 No 5 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=100.00 E-value=7.2e-38 Score=224.22 Aligned_cols=132 Identities=17% Similarity=0.279 Sum_probs=117.3 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEe Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWS 73 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs 73 (144) || -|++.|||+|||++|++| +.+|||++|+++++|||+||+.+++++ +++|..|++|+++||||| T Consensus 1 Ms-------ms~~~aLQ~Ai~~~L~adaal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~-~~~~~~g~~~~~ti~Vws 72 (141) T protein:vir:96 1 MW-------VSVEPELTNQIYKRLISDPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNN-ESSATMRETVGIVIHVYS 72 (141) T ss_pred Cc-------cchhHHHHHHHHHHhhcChhhHhhcCCccccCCccCCCCCEEEeCCceeeec-CCCcccceEEEEEEEEEE Confidence 43 267899999999999997 448999999999999999999999997 588999999999999999 Q ss_pred CCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 74 DYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 74 ~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) ++.|+.|||+|+++|+++|++ +|.| +||+++.+++.+.++++|+||.+.| +.++++||+.++++.-- T Consensus 73 ~~~g~~eak~ia~av~~AL~~-~l~l-~~~~lv~l~~~~~~~~rd~dg~t~h--gvl~~ra~v~~~~~~~~ 139 (141) T protein:vir:96 73 QFATQYEAKLILSAIGYVLNR-PIEI-DNYEFQFSRIDSQAVFPDIDRFTKH--GTIRLLFKYRHKKKNEG 139 (141) T ss_pred cCCCHHHHHHHHHHHHHHhcc-cccC-CCceEEEEEEeeeeeeecCCCceEE--EEEEEEEEEEecccccc Confidence 999999999999999999985 6888 4899999999999999999998754 66777777777777554 No 6 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=100.00 E-value=8.8e-38 Score=223.73 Aligned_cols=132 Identities=18% Similarity=0.260 Sum_probs=115.1 Q ss_pred CCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~e 80 (144) =+.|++.|||+|||++|++| +.+|||++|+++++|||+||+.+++++ +++|..|.+|+++|||||++.|++| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~~l~alvggrV~D~~P~~a~~PYv~lG~~~~~d~-~~~~~~g~~~~~ti~Vws~~~g~~e 79 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIRKQLDGRVFDCVQKDAVYPYIVVGETNVTNK-ETTTSMVEDVGITLHVYSQARNRDE 79 (145) T ss_pred CcchHhHHHHHHHHHHhhcChhHHHhhcCceecCCccCCCCCEEEeCcceeeec-CCCcccceEEEEEEEEEEcCCCHHH Confidence 14578999999999999987 348999999999999999999999997 5899999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEee-cccCCC Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVID-STIDPY 144 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t-~~~~~~ 144 (144) +|+|+++|+++|++ +|+|+ ||+++.+++...+.++|+||.+.| +.++++|++-+ +-.+|. T Consensus 80 ak~ia~av~~aL~~-~l~l~-~~~lv~l~~~~~~~~rd~dg~~~h--gvl~fra~ve~~~~~~~~ 140 (145) T protein:vir:97 80 ASQIIQFLGFVLNN-EIEID-YYSFIKSRIDTQEVITDIDQYTKH--GIIRLVFKYRHNTLQRSV 140 (145) T ss_pred HHHHHHHHHHHhcc-ccCCC-CCeEEEeEEeeeeEeecCCCceEE--EEEEEEEEEecCceeccc Confidence 99999999999985 79996 789999999999999999998643 45555555554 346777 No 7 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=100.00 E-value=1.1e-37 Score=223.29 Aligned_cols=132 Identities=19% Similarity=0.278 Sum_probs=116.0 Q ss_pred CCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~e 80 (144) =+.|++.|||+|||+||+++ +.+|||++|+++++|||+||+.+++++ +++|..|.+|+++|||||++.|+.| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~~a~~PYV~lG~~~~~~~-~~~~~~g~~~~~ti~Vws~~~g~~e 79 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNSIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNK-ETTTSMVEDVGITLHVYSQARNRDE 79 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEecCceeeec-CCCcccceEEEEEEEEEEcCCCHHH Confidence 14578999999999999987 347999999999999999999999997 5899999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) ||+|+++|+++|++ +|+|+ ||.++.+++...+.++|+||.+.| +.++++|++-++|.--- T Consensus 80 ak~ia~av~~aL~~-~l~l~-~~~lv~l~~~~~~~~rd~dg~~~h--gvl~~ra~ve~~~~~~~ 139 (145) T protein:vir:95 80 ASQIIQFLGFVLNN-EIEID-YYSFIKSRIDTQEVITDIDRYTKH--GIIRLVFKYRHNTLQRS 139 (145) T ss_pred HHHHHHHHHHHhcc-ccCCC-CCeEEEeEEeeeeEeecCCCceEE--EEEEEEEEEEecccccc Confidence 99999999999985 79995 789999999999999999998644 56666666666666544 No 8 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=100.00 E-value=1.2e-37 Score=223.10 Aligned_cols=132 Identities=19% Similarity=0.278 Sum_probs=115.9 Q ss_pred CCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~e 80 (144) =+.|++.|||+|||++|+++ +.+|||++|+++++|||+||+.+++++ +++|..|.+|+++|||||++.|+.| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~~~~~PYv~lG~~~~~d~-~~~~~~g~~~~~ti~Vws~~~g~~e 79 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNK-ETTTSMVEDVGITLHVYSQARNRDE 79 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhccccccCCcCCCCCCEEEecCceeeec-CCCcccceEEEEEEEEEEcCCCHHH Confidence 14578999999999999887 358999999999999999999999997 5899999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) +|+|+++|+++|++ +|+|+ ||+++.+++...+.++|+||.+.| +.++++|++-+++.--- T Consensus 80 ak~ia~av~~aL~~-~l~l~-~~~lv~l~~~~~~~~rd~dg~~~h--gvl~fra~ve~~~~~~~ 139 (145) T protein:vir:94 80 ASQIIQFLGFVLNN-EIEID-YYSFIKSRIDTQEVITDIDQYTKH--GIIRLVFKYRHNTLQRS 139 (145) T ss_pred HHHHHHHHHHHhcc-ccCCC-CCeEEEeEEeeeeEeecCCCceEE--EEEEEEEEEEecccccc Confidence 99999999999985 79986 789999999999999999998643 56666666666665544 No 9 >protein:vir:93736 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240465;genbank:gi:66396143;genbank:GeneID:5133505 Probab=100.00 E-value=1.2e-37 Score=223.07 Aligned_cols=132 Identities=19% Similarity=0.278 Sum_probs=116.1 Q ss_pred CCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~e 80 (144) =+.|++.|||+|||++|+++ +.+|||++|+++++|||+||+.+++++ +++|..|.+|+++|||||++.|+.| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~~a~~PYV~lG~~~~~d~-~~~~~~g~~~~~ti~Vws~~~g~~e 79 (145) T protein:vir:93 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNK-ETTTSMVEDVGITLHVYSQARNRDE 79 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEeCCceeeec-CCCcccceEEEEEEEEEEcCCCHHH Confidence 14578999999999999987 347999999999999999999999997 5899999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) +|+|+++|+++|++ +|.|+ ||+++.+++...+.++|+||.+.| +.++++|++-++|.--- T Consensus 80 ak~ia~av~~aL~~-~l~l~-~~~lv~l~~~~~~~~rd~dg~~~h--gvl~fra~ve~~~~~~~ 139 (145) T protein:vir:93 80 ASQIIQFLGFVLNN-EIEID-YYSFIKSRIDTQEVITDIDQYTKH--GIIRLVFKYRHNTLQRS 139 (145) T ss_pred HHHHHHHHHHHhcc-ccCCC-CCeEEEeEEeeeeEeecCCcceEE--EEEEEEEEEEecccccc Confidence 99999999999985 79885 789999999999999999998754 56666666666666544 No 10 >protein:vir:94488 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240682;genbank:gi:66396364;genbank:GeneID:5133752 Probab=100.00 E-value=1.2e-37 Score=223.07 Aligned_cols=132 Identities=19% Similarity=0.278 Sum_probs=116.1 Q ss_pred CCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~e 80 (144) =+.|++.|||+|||++|+++ +.+|||++|+++++|||+||+.+++++ +++|..|.+|+++|||||++.|+.| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~~a~~PYV~lG~~~~~d~-~~~~~~g~~~~~ti~Vws~~~g~~e 79 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNK-ETTTSMVEDVGITLHVYSQARNRDE 79 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEeCCceeeec-CCCcccceEEEEEEEEEEcCCCHHH Confidence 14578999999999999987 347999999999999999999999997 5899999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) +|+|+++|+++|++ +|.|+ ||+++.+++...+.++|+||.+.| +.++++|++-++|.--- T Consensus 80 ak~ia~av~~aL~~-~l~l~-~~~lv~l~~~~~~~~rd~dg~~~h--gvl~fra~ve~~~~~~~ 139 (145) T protein:vir:94 80 ASQIIQFLGFVLNN-EIEID-YYSFIKSRIDTQEVITDIDQYTKH--GIIRLVFKYRHNTLQRS 139 (145) T ss_pred HHHHHHHHHHHhcc-ccCCC-CCeEEEeEEeeeeEeecCCcceEE--EEEEEEEEEEecccccc Confidence 99999999999985 79885 789999999999999999998754 56666666666666544 No 11 >protein:vir:97421 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240755;genbank:gi:66396436;genbank:GeneID:5133777 Probab=100.00 E-value=1.2e-37 Score=223.07 Aligned_cols=132 Identities=19% Similarity=0.278 Sum_probs=116.1 Q ss_pred CCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~e 80 (144) =+.|++.|||+|||++|+++ +.+|||++|+++++|||+||+.+++++ +++|..|.+|+++|||||++.|+.| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~D~~P~~a~~PYV~lG~~~~~d~-~~~~~~g~~~~~ti~Vws~~~g~~e 79 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNK-ETTTSMVEDVGITLHVYSQARNRDE 79 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCceecCCcCCCCCCEEEeCCceeeec-CCCcccceEEEEEEEEEEcCCCHHH Confidence 14578999999999999987 347999999999999999999999997 5899999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) +|+|+++|+++|++ +|.|+ ||+++.+++...+.++|+||.+.| +.++++|++-++|.--- T Consensus 80 ak~ia~av~~aL~~-~l~l~-~~~lv~l~~~~~~~~rd~dg~~~h--gvl~fra~ve~~~~~~~ 139 (145) T protein:vir:97 80 ASQIIQFLGFVLNN-EIEID-YYSFIKSRIDTQEVITDIDQYTKH--GIIRLVFKYRHNTLQRS 139 (145) T ss_pred HHHHHHHHHHHhcc-ccCCC-CCeEEEeEEeeeeEeecCCcceEE--EEEEEEEEEEecccccc Confidence 99999999999985 79885 789999999999999999998754 56666666666666544 No 12 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=100.00 E-value=1.2e-37 Score=223.08 Aligned_cols=132 Identities=19% Similarity=0.277 Sum_probs=115.9 Q ss_pred CCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~e 80 (144) =+.|++.|||+|||++|+++ +.+|||++|+++++|||+||+.+++++ +++|..|.+|+++|||||++.|+.| T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~D~~P~~~~~PYv~lG~~~~~d~-~~~~~~g~~~~~ti~Vws~~~g~~e 79 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDGRVFDCVQKDAVYPYIVVGETNVTNK-ETTTSMVEDVGITLHVYSQARNRDE 79 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhccccccCCcCCCCCCEEEecCceeeec-CCCcccceEEEEEEEEEEcCCCHHH Confidence 14578999999999999887 358999999999999999999999997 5899999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) +|+|+++|+++|++ +|+|+ ||+++.+++...+.++|+||.+.| +.++++|++-+++.--- T Consensus 80 ak~ia~av~~aL~~-~l~l~-~~~lv~l~~~~~~~~rd~dg~~~h--gvl~fra~ve~~~~~~~ 139 (145) T protein:vir:95 80 ASQIIQFLGFVLNN-EIEID-YYSFIKSRIDTQEVITDIDQYTKH--GVIRLVFKYRHNTLQRS 139 (145) T ss_pred HHHHHHHHHHHhcc-ccCCC-CCeEEEeEEeeeeEeecCCCceEE--EEEEEEEEEEecccccc Confidence 99999999999985 79986 789999999999999999998643 56666666666665544 No 13 >protein:vir:107096 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950611;genbank:gi:119953691;genbank:GeneID:4643105 Probab=100.00 E-value=1.5e-37 Score=222.43 Aligned_cols=132 Identities=20% Similarity=0.285 Sum_probs=115.0 Q ss_pred CCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~e 80 (144) =+.|++.|||+|||++|+++ +.+|||++|++++||||+||+.+++++ +++|..|.+|+++|||||++.|+.+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~rVyD~~P~~a~~PyV~lG~~~~~~~-~~~~~~g~~~~~ti~Vws~~~g~~e 79 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIIKKQLGGRVFDCVQKDAVYPYIVVGETNVTNK-ETTTSMFEDVGVTLHVYSQARNRDE 79 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhccccccCCccCCCCCEEEeCcceeeec-CCCcccceEEEEEEEEEEcCCCHHH Confidence 14578999999999999987 458999999999999999999999997 5899999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) +|+|+++|+++|+ ++|+|+ ||.++.+++.+.+.++|+||.+.| +.++++|++-+++.--- T Consensus 80 a~~ia~av~~aL~-a~l~l~-~~~lv~l~~~~~~~~rd~dg~~~h--gvl~~ra~ve~~~~~~~ 139 (145) T protein:vir:10 80 ASQIIQYLGFVLN-SEIEIN-NYSFIKSRIDTQEVITDIDQYTKH--GIIRLIFKYRHNTLQRS 139 (145) T ss_pred HHHHHHHHHHHhC-CCcCCC-CCeEEEEEEeeeeEeecCCCceEE--EEEEEEEEEeecccccc Confidence 9999999999996 889996 899999999999999999998643 45555555555555433 No 14 >protein:vir:105337 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950674;genbank:gi:119967844;genbank:GeneID:4643216 Probab=100.00 E-value=1.5e-37 Score=222.42 Aligned_cols=132 Identities=20% Similarity=0.285 Sum_probs=115.0 Q ss_pred CCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~e 80 (144) =+.|++.|||+|||++|+++ +.+|||++|++++||||+||+.+++++ +++|..|.+|+++|||||++.|+.+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~rVyD~~P~~a~~PyV~lG~~~~~~~-~~~~~~g~~~~~ti~Vws~~~g~~e 79 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIVSKQLGGRVFDCVQKDAVYPYIVVGETNVTNK-ETTTSMFEDVGVTLHVYSQARNRDE 79 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhccccccCCccCCCCCEEEeCcceeeec-CCCcccceEEEEEEEEEEcCCCHHH Confidence 14578999999999999987 458999999999999999999999997 5899999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) +|+|+++|+++|+ ++|+|+ ||.++.+++.+.+.++|+||.+.| +.++++|++-+++.--- T Consensus 80 a~~ia~av~~aL~-a~l~l~-~~~lv~l~~~~~~~~rd~dg~~~h--gvl~~ra~ve~~~~~~~ 139 (145) T protein:vir:10 80 ASQIIQYLGFVLN-SEIEIN-NYSFIKSRIDTQEVITDIDQYTKH--GIIRLIFKYRHNTLQRS 139 (145) T ss_pred HHHHHHHHHHHhC-CCcCCC-CCeEEEEEEeeeeEeecCCCceEE--EEEEEEEEEeecccccc Confidence 9999999999996 889996 899999999999999999998643 45555555555555433 No 15 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=100.00 E-value=3.3e-37 Score=220.58 Aligned_cols=132 Identities=20% Similarity=0.304 Sum_probs=114.6 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEe Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWS 73 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs 73 (144) || -||+.|||+|||++|+++ +.+|||++|++++||||+||+.+++|+ +++|..|.+|+++||||| T Consensus 1 Ms-------ms~~~aLq~Ai~a~L~ada~l~alvg~~VyD~~P~~~~~Pyv~lG~~~~~~~-~~~~~~g~~~~~~i~Vws 72 (140) T protein:vir:96 1 MW-------VSVEPELTVQIYKRLKASPIINKFVGDRVFDVVQEDAVYPYIVVGESNVTNN-ESSTMMRETVGIVIHVYS 72 (140) T ss_pred CC-------ccHHHHHHHHHHHHhhcChhHHHhcCCccccCCccCCCCCEEEecCceeeec-CCCcccceEEEEEEEEEE Confidence 44 367899999999999987 447999999999999999999999997 589999999999999999 Q ss_pred CCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 74 DYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 74 ~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) ++.|+.|||+|+++|++||+ ++|+|+ ||.++.+++.+.+.++|+||.+.| +.++++|++.+.+.--- T Consensus 73 ~~~g~~ea~~ia~av~~AL~-~~l~l~-~~~lv~l~~~~~~~~rd~dg~~~h--gvl~~r~~v~~~~~~~~ 139 (140) T protein:vir:96 73 QFATQYEAKQIISAIGYVLN-RPIDIE-NYEFQFSRIDSQSVFPDIDRFTKH--GTIRLLFKYRHIKKGEG 139 (140) T ss_pred cCCCHHHHHHHHHHHHHHhC-CCccCC-CCeEEEEEEeeeEEEecCCCceEE--EEEEEEEEEEeeccccC Confidence 99999999999999999996 789995 899999999999999999998754 45566666655444333 No 16 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=100.00 E-value=5.4e-37 Score=219.42 Aligned_cols=132 Identities=20% Similarity=0.293 Sum_probs=119.2 Q ss_pred CCCChhhHHHHHHHHHHhhc-------CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQ-------GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~e 80 (144) =.+||+.|||+|||++|+++ +.+|||++|++++||||+||+.+++++ ++||..|.+|+++|||||++.||++ T Consensus 1 M~~s~~~aLq~ai~~~L~ad~~l~~lvg~~vyD~~P~~~~~PyV~lG~~~~~~~-~t~~~~~~~~~lti~Vws~~~gr~e 79 (145) T protein:vir:12 1 MWVSVERYLFNKVYNKLKSNPIIQKQLGGRVFDCVQKDAVYPYIVVGETNVTNK-ETTTSMVEDVGITLHVYSQARNRDE 79 (145) T ss_pred CcccHHHHHHHHHHHHhhcChhHHHhcCcccccCCccCCCCCEEEeccceeeec-CCCcccceEEEEEEEEEEcCccHHH Confidence 14578999999999999887 458999999999999999999999997 6899999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) |++|+++|+++|++ +|.++ ||+++.++..+.++++|++|.+. |++++++|+|-+++.--- T Consensus 80 a~~ia~ai~~aL~~-~l~l~-~~~lv~l~~~~~~~~rd~d~~~~--hgvl~~ra~i~~~~~~~~ 139 (145) T protein:vir:12 80 ASQIIQFLGFVLNN-EIEID-YYSFIKSRIDTQEVITDIDQYTK--HGIIRLVFKYRHNTLQRS 139 (145) T ss_pred HHHHHHHHHHHhcc-ccCCC-CceEEEEEEeeEEEEecCCCceE--EEEEEEEEEEEeCCcccc Confidence 99999999999986 78885 88999999999999999999864 478889999988887655 No 17 >protein:vir:4907 Length: 128 # NCBI annotation: gp128 # Family: family:all:504 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056685;genbank:gi:9635020;genbank:GeneID:1262660 Probab=99.93 E-value=6e-29 Score=175.32 Aligned_cols=126 Identities=13% Similarity=0.213 Sum_probs=109.7 Q ss_pred CCChhhHHHHHHHHHHhhcCCcccccCc-CCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 9 ARSSSVALQRAIVKEIRAQGINIWDGVN-KKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 9 ~~S~~~aLQ~AI~~~L~a~~~~VyD~vP-~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) -+||..+||+++|++|++.|..|||.+| ++++||||+||+.+..++ +||++.|++++++||||+++.+|+|+++|+++ T Consensus 1 m~sp~q~L~~~~f~~l~~~g~~vyD~lP~~~v~YPfV~ig~~~~~~~-~tKt~~~g~v~ltihVW~~~~~R~ev~~i~~~ 79 (128) T protein:vir:49 1 MKQPDQLLHDEMYRISCELGYNTYTYLPPDDAAYPFVVMGETMVLPQ-STKSHLIGRLSSTVHVWGRVDDRKTLSDMAGQ 79 (128) T ss_pred CCchHHHHHHHHHHHHHhcCCceecccCCCCCCCCEEEeeeeeecCC-ccccccccEEEEEEEEEeCCCCchhHHHHHHH Confidence 3899999999999999999999999999 578999999999999986 68999999999999999999999999999999 Q ss_pred HHHHhcCCccccCCCceEE-EEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIG-KKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~-~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) |..+|++.. ..+||.+. ...-.+.+.+.|.+-.+.-.||+++++|++| T Consensus 80 i~~~l~~~~--~t~~y~f~~~i~~s~~~~~~D~st~~~L~Hgvl~l~f~~~ 128 (128) T protein:vir:49 80 LMSSFFAIK--NIGGKQFSAEINQSSIDSNRDNSTDEVLYHFVIYTYFKFV 128 (128) T ss_pred HHHHhhccc--ccCCeEEEEEeccceEEEEeecCCCcceeeEEEEEEEEeC Confidence 999997653 34788764 4555677777776554555678999999999 No 18 >protein:vir:2741 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695114;genbank:gi:23455883;genbank:GeneID:955650 Probab=99.93 E-value=1.1e-28 Score=173.82 Aligned_cols=126 Identities=13% Similarity=0.224 Sum_probs=109.3 Q ss_pred CCChhhHHHHHHHHHHhhcCCcccccCc-CCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 9 ARSSSVALQRAIVKEIRAQGINIWDGVN-KKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 9 ~~S~~~aLQ~AI~~~L~a~~~~VyD~vP-~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) -+||..|||.++|++|++.|.+|||++| ++++||||+||+.+..++ +||+..|++++++||||+.+.||+++++|+++ T Consensus 1 M~sp~qeL~~~lf~~l~~~g~~vyD~lP~~~~~YPfV~ig~~~~~~~-~tkt~~~g~~~l~i~vW~~~~~R~~v~~i~~~ 79 (128) T protein:vir:27 1 MKQPDQLLHDEMYRISCELGYNTYTYLPPDDAAYPFVVMGETMVLPQ-STKSHLIGRLSSTVHVWGHVDDRKTLSDMAGQ 79 (128) T ss_pred CCCHHHHHHHHHHHHHHhcCCceeccCCCCCCCcCEEEeccceecCC-ccccccccEEEEEEEEEECCcchhHHHHHHHH Confidence 3899999999999999999999999999 578999999999999986 69999999999999999999999999999999 Q ss_pred HHHHhcCCccccCCCceEE-EEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIG-KKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~-~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) |..++.+. +. .+||++. .....+.+.+.|.+-.+.-.|++++++|++| T Consensus 80 i~~~~~~~-~~-t~~y~~~~~~~~~~~qil~Dtst~~~l~Hgii~l~f~~~ 128 (128) T protein:vir:27 80 LMSSFFAI-KK-IGGKQFSAEVNESSIDSNRDNSTDEVLYHFIIYTYFKFI 128 (128) T ss_pred HHHHhccc-cc-cCCeeEEEEeecceEEEeeecCCCceeeEEEEEEEEEeC Confidence 99999775 33 3788875 4566788888774333344568999999999 No 19 >protein:vir:96485 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238497;genbank:gi:66391773;genbank:GeneID:5176907 Probab=99.92 E-value=5e-28 Score=170.24 Aligned_cols=126 Identities=13% Similarity=0.244 Sum_probs=111.0 Q ss_pred CCChhhHHHHHHHHHHhhcCCcccccCc-CCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 9 ARSSSVALQRAIVKEIRAQGINIWDGVN-KKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 9 ~~S~~~aLQ~AI~~~L~a~~~~VyD~vP-~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) -+||..+|+.++|++|+..|..|||.+| ++++||||+||+.+..++ +||+..|++++++||||+++.+|+++++|+++ T Consensus 1 m~sp~qeL~d~~f~~l~~~g~~vyd~lP~~~v~YPfV~ig~~~~~~~-~tKt~~~g~v~ltihVW~~~~~R~~v~~i~~~ 79 (128) T protein:vir:96 1 MKQPDQLLHDEMYRISSGLGYDTYTYLPPEGAAYPFVVMGETMVLPQ-STKSHLIGRLSSTVHVWGRVDDRKTLSDMAGQ 79 (128) T ss_pred CCCHHHHHHHHHHHHHHhcCCeeecccCCCCCCCCEEEEeeeeecCC-ccccccccEEEEEEEEEECCCCchhHHHHHHH Confidence 3899999999999999999999999998 788999999999999986 69999999999999999999999999999999 Q ss_pred HHHHhcCCccccCCCceEE-EEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIG-KKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~-~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) |..+|++. .. .+||.+. ...-.+.+++.|..-.+.-.||+++++|++| T Consensus 80 i~~~l~~~-~~-t~~y~~~~~~~~~~~qii~D~st~~~l~Hgil~l~f~~~ 128 (128) T protein:vir:96 80 LMSSFFTI-KN-IDGMQFSAEVNESSIDSNRDNSTDEVLYHFIIYTYFKFI 128 (128) T ss_pred HHHHhhhh-hc-cCCeEEEEEEeeeeEEEeeecCCCceeeEEEEEEEEEeC Confidence 99999876 33 4799885 4566788888885333445678999999999 No 20 >protein:vir:3618 Length: 129 # NCBI annotation: ORF41 # Family: family:all:504 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112704;genbank:gi:13786572;genbank:GeneID:921070 Probab=99.90 E-value=1.4e-26 Score=162.29 Aligned_cols=127 Identities=17% Similarity=0.332 Sum_probs=106.4 Q ss_pred CCCChhhHHHHHHHHHHhhcCCcccccCc-CCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQGINIWDGVN-KKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTD 86 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~~~~VyD~vP-~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~ 86 (144) --+||..+|+.++|++|++.|.+|||++| ++++||||+||+.+..++ .||+..|++++++||||+.+.+|+++++|++ T Consensus 1 mmksp~qeL~d~~f~~l~~lG~~vyD~lP~~~v~YPfV~ig~~~~~~~-~tKt~~~g~v~ltihVW~~~~~R~~v~~i~~ 79 (129) T protein:vir:36 1 MIKTRDQSIFDELFKRIQALGYTVYDYKPMNEVGYPFVELENTQTIHE-ANKTDIKGTVSLSLSVWGLQKKRKEVSDMAS 79 (129) T ss_pred CCcChhHHHHHHHHHHHHhcCCeeeeccCCCCCCcCEEEeeeeeecCC-ccccccccEEEEEEEEEeCCcCchhHHHHHH Confidence 13577899999999999999999999999 668999999999999986 6899999999999999999999999999999 Q ss_pred HHHHHhcCCccccCCCceEE-EEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 87 FLVGLLINSPLQLEEGFCIG-KKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 87 ~V~~aL~~~~L~L~~g~~~~-~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) .|..+|.+..+ .+||.+. ...-.+.+...|....+.-.||++++.|++- T Consensus 80 ~i~~~~~~~~~--t~~y~~~~~~~~~~~q~~~D~st~~~L~Hgii~l~f~~r 129 (129) T protein:vir:36 80 NIFNQALNISA--TDGYSWALNSQASTIQMLDDTTTNTPLKRALINLEFRLR 129 (129) T ss_pred HHHHHhccccc--CCCeEEEEEeeeeeEEEeccCCCCceeeEEEEEEEEEeC Confidence 99999987654 4899875 3344556777776544445667877777766 No 21 >protein:vir:3972 Length: 129 # NCBI annotation: structural protein # Family: family:all:504 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663680;genbank:gi:21716117;genbank:GeneID:951217 Probab=99.90 E-value=2.2e-26 Score=161.24 Aligned_cols=127 Identities=17% Similarity=0.328 Sum_probs=103.9 Q ss_pred CCCChhhHHHHHHHHHHhhcCCcccccCc-CCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQGINIWDGVN-KKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTD 86 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~~~~VyD~vP-~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~ 86 (144) =-+||..+|+.++|++|++.|.+|||.+| ++++||||+||+.+..++ .+|+..|++++++||||+.+.+|+++++|++ T Consensus 1 mmksp~qeL~d~~f~~l~~lG~~vyD~lP~~~v~YPfV~ig~~~~~~~-~tKt~~~g~v~ltihVW~~~~~R~~v~~i~~ 79 (129) T protein:vir:39 1 MIKTRDQSIFDELFKRIQALGYTVYDYKQMNEVGYPFVEMENTQTIHE-PNKTDIKGTVSLSLSVWGLQKKRKEVSDMAS 79 (129) T ss_pred CCcChhHHHHHHHHHHHHhcCCeeeeccCCCCCCcCEEEeeeeeecCC-ccccccccEEEEEEEEEeCCcCchhHHHHHH Confidence 13578899999999999999999999988 668999999999999986 6899999999999999999999999999999 Q ss_pred HHHHHhcCCccccCCCceEEE-EEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 87 FLVGLLINSPLQLEEGFCIGK-KELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 87 ~V~~aL~~~~L~L~~g~~~~~-~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) .|..++.+... .+||.+.. ..-.+.+.+.|.+-.+.-.||++.+.|++- T Consensus 80 ~i~~~~~~~~~--t~~y~~~~~~~~~~~q~~~Dts~~~~L~Hgvi~l~f~~r 129 (129) T protein:vir:39 80 NIFNQALNISA--TDGYSWALNLQASTIQMMDDTTTGTPLKRAFINLEFRLR 129 (129) T ss_pred HHHHHhccccc--CCCeeEEEeecceeEEEecccCCCceeeeEEEEEEEEeC Confidence 99998876433 37998753 445677788774322333456777766665 No 22 >protein:vir:744 Length: 129 # NCBI annotation: major structural protein 2 # Family: family:all:504 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108721;genbank:gi:13487843;genbank:GeneID:920879 Probab=99.89 E-value=2.6e-26 Score=160.81 Aligned_cols=125 Identities=18% Similarity=0.345 Sum_probs=103.9 Q ss_pred CCCChhhHHHHHHHHHHhhcCCcccccCc-CCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQGINIWDGVN-KKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTD 86 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~~~~VyD~vP-~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~ 86 (144) =-+||..+|+.++|++|++.|.+|||++| ++++||||+||+.+..++ .||+..|++++++||||+.+.+|+++++|++ T Consensus 1 mmksp~qeL~d~~~~~l~~lG~~vyD~lP~~~v~YPfV~ig~~~~~~~-~tKt~~~g~v~ltihVW~~~~~R~~v~~i~~ 79 (129) T protein:vir:74 1 MIKTRDQSIFDELFKRIQALGYTVYDYKPMNEVGYPFVELENTQTIHE-ANKTDIKGTVSLSLSVWGLQKKRKEVSDMAS 79 (129) T ss_pred CCcChhHHHHHHHHHHHHhcCCeeeeccCCCCCCcCEEEeeeeeecCC-ccccccccEEEEEEEEeeCCccchhHHHHHH Confidence 13578899999999999999999999988 568899999999999986 6899999999999999999999999999999 Q ss_pred HHHHHhcCCccccCCCceEEE-EEEeeeeeeec--ccccccCccEEEEEEEEEe Q lcl|NC_018086. 87 FLVGLLINSPLQLEEGFCIGK-KELDHVRYTEA--ANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 87 ~V~~aL~~~~L~L~~g~~~~~-~~~~~~r~~~d--~dg~~~~~~~~~~l~fri~ 137 (144) .|..+|.+..+ .+||.+.. ..-.+.+++.| +++.. .||++.+.|.+- T Consensus 80 ~i~~~~~~~~~--t~~y~~~~~~~~~~~q~~~Dtst~~~L--~Hgvi~l~f~~r 129 (129) T protein:vir:74 80 NIFNQALNISA--TDGYSWALNSQASTIQMLDDTTTHTPL--KRALINLEFRLR 129 (129) T ss_pred HHHHHhccccc--cCCcEEEEeecceeEEEcccCCCCcee--eeEEEEEEEEeC Confidence 99999986554 48998753 33456778877 45544 446766666655 No 23 >protein:vir:99537 Length: 125 # NCBI annotation: putative protein # Family: family:all:504 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958542;genbank:gi:41179324;genbank:GeneID:2717175 Probab=99.86 E-value=1.6e-24 Score=151.08 Aligned_cols=124 Identities=19% Similarity=0.366 Sum_probs=109.1 Q ss_pred CChhhHHHHHHHHHHhhcCCcccccCc-CCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHHH Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGINIWDGVN-KKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDFL 88 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~~VyD~vP-~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~V 88 (144) -||..+|..++|++|+..|..|||.+| ++++||||++|+.+..+. .||+..|++++++||||+.+.+|+++++|++.| T Consensus 1 m~P~q~Lfd~~f~~~~~lG~~vyD~lP~~~v~YPFVvig~~~~~~~-~tKt~~~g~i~lti~VWg~~~~R~~v~~i~~~i 79 (125) T protein:vir:99 1 MNPYEELFKTVIEYCKKTGYPTFDYLPDESQGYPFIMVGDQINNDI-YAKDFVTGTSNLTIHVFAEYNYRAEVATIMEQI 79 (125) T ss_pred CchhHHHHHHHHHHHHhcCCceeeecCCCCCCcCEEEEeeeeecCC-CCccccceEEEEEEEEeeCcccchhHHHHHHHH Confidence 289999999999999999999999999 567899999999999986 699999999999999999999999999999999 Q ss_pred HHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 89 VGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 89 ~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) ..++.+. .. .+||.+. ..-.+.+++.|.+..+.-.||.+++.|++- T Consensus 80 ~~~~~~~-~~-t~~y~~~-~~~~~~qii~D~s~~t~L~Hg~l~l~F~ir 125 (125) T protein:vir:99 80 QQLIPKF-IT-TNHYLFG-LTGSSSNILGETADSIQLQHGRLILDFNLR 125 (125) T ss_pred HHHhccc-ee-ccCcEEE-eeeeeEEEeecCCCCceeeEEEEEEEEeeC Confidence 9877443 44 3798884 445667899999888777789999999988 No 24 >protein:vir:106593 Length: 131 # NCBI annotation: ORF039 # Family: family:all:504 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239498;genbank:gi:66395251;genbank:GeneID:4555747 Probab=99.76 E-value=5.3e-21 Score=131.76 Aligned_cols=128 Identities=12% Similarity=0.215 Sum_probs=107.1 Q ss_pred CCCCChhhHHHHHHHHHHhhcCCcccccCcC--CCCCCEEEeCCcee-eecCCCcccCccEEEEEEEEEeCCcchHHHHH Q lcl|NC_018086. 7 FRARSSSVALQRAIVKEIRAQGINIWDGVNK--KPEYPFIKIGEELT-SGRTISKDAIGKMHNLTLHIWSDYDSSFEVKN 83 (144) Q Consensus 7 ~~~~S~~~aLQ~AI~~~L~a~~~~VyD~vP~--~a~~PYV~lG~~~~-~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~ 83 (144) |--|||..+|-..+|+.++..|..|||..|. +++||||++|+.+. .+ ..||+..+.+++++||||+....|+++.+ T Consensus 1 mm~ksp~qeLfd~~f~~~~~lGy~vyd~lP~~~ev~YPFVvig~~~~~~~-~~tKt~~~g~v~lti~VWg~~~~R~~vs~ 79 (131) T protein:vir:10 1 MLKTTPQQALFDSIYAQLLGYGIDVIDFKELNSQLTYPFFVLRDVEANKS-KYTMESVGGELTVIIDLWNYAEDRGQHDS 79 (131) T ss_pred CCccChhHHHHHHHHHHHHhcCCceeeccCCCCCCCCCEEEEeeeeccCC-CCcccccceEEEEEEEEeecchhhhhHHH Confidence 5567788999999999999999999999995 47999999999886 35 46899999999999999999999999999 Q ss_pred HHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 84 LTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 84 Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) |++.+..++.+.. -.+||.+.--+.....+.++.+-.+.-.|+.+++.|++- T Consensus 80 i~~~i~~~~~~~~--~td~y~~~~~~~~~~~i~D~sttn~~L~Hg~i~lef~~~ 131 (131) T protein:vir:10 80 IVGATEWMLTGIE--SVEGYQLMIDDINIKTLNDVENSDRQLLHTVIIAIYKLF 131 (131) T ss_pred HHHHHHHHhhcce--ecccceEEecceEEEEEeccCCCCceeeeEEEEEEEEeC Confidence 9999999885544 348998854444444566655666667788999999988 No 25 >protein:vir:95765 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950594;genbank:gi:119953789;genbank:GeneID:5076835 Probab=99.72 E-value=7e-20 Score=125.58 Aligned_cols=125 Identities=18% Similarity=0.242 Sum_probs=102.3 Q ss_pred CChhhHHHHHHHHHHhhcCCcccccCc-CCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHHH Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGINIWDGVN-KKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDFL 88 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~~VyD~vP-~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~V 88 (144) =||..+|-.++|+ |...|..|||++| ++++||||++|+.+..+. .+|+..| ++.++||||+....|+++.+|+++| T Consensus 1 m~P~qeLfd~~f~-~~~~Gy~vYD~lP~~~v~YPFVvig~~~~~~~-~tKt~~G-~i~l~i~VWg~~~~R~~vs~i~~~i 77 (127) T protein:vir:95 1 MTPNHALFRRLFA-ISNIRVDTYDFLPDAKSAYPFVYIGENNGSDI-PNKDLLG-RLRQTVHLYGLRTDRANLDDISAYL 77 (127) T ss_pred CchhHHHHHHHHH-HHhcCCccccccCcCCCCcCEEEEeeeeeccc-ccceeee-EEEEEEEeecCchhhhhHHHHHHHH Confidence 2888999999996 7778999999999 678999999999999886 6899665 8899999999999999999999999 Q ss_pred HHHhcCCccccCCCceEE-EEEEeeeeeeecccccccCccEEEEEEEEEeeccc Q lcl|NC_018086. 89 VGLLINSPLQLEEGFCIG-KKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTI 141 (144) Q Consensus 89 ~~aL~~~~L~L~~g~~~~-~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~ 141 (144) .+++... .+++... ...-.+.+++.|..-.+.-.||.+++.|+..--+- T Consensus 78 ~~~~~~~----~~~~~y~~~~~~s~~qil~Dtstnt~L~Hgil~l~f~f~~~~~ 127 (127) T protein:vir:95 78 ESEVKRA----HDGYDYHLYHVETSKQIIPDNTDVQPLLHIVLDFTFDYTKKEN 127 (127) T ss_pred HHHhhhh----cccceeEEEEecceeEEecccCCcceeEEEEEEEEEEeeccCC Confidence 9988543 2344432 34456788888876666667899999999884333 No 26 >protein:vir:9313 Length: 127 # NCBI annotation: phi Mu50B-like protein # Family: family:all:504 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803291;genbank:gi:29028601;genbank:GeneID:1258049 Probab=99.69 E-value=2.4e-19 Score=122.68 Aligned_cols=125 Identities=14% Similarity=0.159 Sum_probs=104.1 Q ss_pred CChhhHHHHHHHHHHhhcCCcccccCcC--CCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGINIWDGVNK--KPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~~VyD~vP~--~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) =||..+|-..+|+.++..|..|||..|. +++||||++|+.+..-...+|+..+.+++++||||+....|+++.+|++. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~vs~i~~~ 80 (127) T protein:vir:93 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHHDGLVKR 80 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchHHHHHHH Confidence 2888999999999999999999999995 47999999999886422468999999999999999999999999999999 Q ss_pred HHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) +..++.+.. -++||.+.. +-.+.+.+.|..-.+.-.|+++++.|+.- T Consensus 81 i~~~~~~~~--~t~~y~~~~-~~~~~~~l~d~ttn~~L~Hgvl~lef~~~ 127 (127) T protein:vir:93 81 CIDDLTPSV--KTNDYDFEE-EDTNITQLVDDTTNQELLHTSVTISYKTF 127 (127) T ss_pred HHHHhccce--eccceeEEe-eeeeeEEcccCCCcceeeeEEEEEEEeeC Confidence 998886443 358998833 44566666666556667788999999977 No 27 >protein:vir:96355 Length: 127 # NCBI annotation: ORF038 # Family: family:all:504 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239652;genbank:gi:66395404;genbank:GeneID:5132831 Probab=99.69 E-value=2.6e-19 Score=122.44 Aligned_cols=125 Identities=13% Similarity=0.159 Sum_probs=103.6 Q ss_pred CChhhHHHHHHHHHHhhcCCcccccCcC--CCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGINIWDGVNK--KPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~~VyD~vP~--~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) =||..+|-..+|+.++..|..|||..|. +++||||++|+.+..-...+|+..+.+++++||||+....|+++.+|++. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~vs~i~~~ 80 (127) T protein:vir:96 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHHDGLVKR 80 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchHHHHHHH Confidence 2888999999999999999999999995 47899999999886422468999999999999999999999999999999 Q ss_pred HHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) +..++.+.. -++||.+.. +-.+.+.+.|..-.+.-.|+.+++.|+.- T Consensus 81 i~~~~~~~~--~t~~y~~~~-~~~~~~~l~d~ttn~~L~Hgvi~lef~~~ 127 (127) T protein:vir:96 81 CIDDLTPSV--KTNDYDFEE-DDTNITQLVDDTTNQELLHTSITISYKTF 127 (127) T ss_pred HHHHhccce--eccceeEEe-eeeeeEEcccCCCcceeeeEEEEEEEeeC Confidence 998886443 358998733 34566666665555666788899999977 No 28 >protein:vir:78854 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285366;genbank:gi:148717894;genbank:GeneID:5246985 Probab=99.69 E-value=2.6e-19 Score=122.44 Aligned_cols=125 Identities=13% Similarity=0.159 Sum_probs=103.6 Q ss_pred CChhhHHHHHHHHHHhhcCCcccccCcC--CCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGINIWDGVNK--KPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~~VyD~vP~--~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) =||..+|-..+|+.++..|..|||..|. +++||||++|+.+..-...+|+..+.+++++||||+....|+++.+|++. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~vs~i~~~ 80 (127) T protein:vir:78 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHHDGLVKR 80 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchHHHHHHH Confidence 2888999999999999999999999995 47899999999886422468999999999999999999999999999999 Q ss_pred HHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) +..++.+.. -++||.+.. +-.+.+.+.|..-.+.-.|+.+++.|+.- T Consensus 81 i~~~~~~~~--~t~~y~~~~-~~~~~~~l~d~ttn~~L~Hgvi~lef~~~ 127 (127) T protein:vir:78 81 CIDDLTPSV--KTNDYDFEE-DDTNITQLVDDTTNQELLHTSITISYKTF 127 (127) T ss_pred HHHHhccce--eccceeEEe-eeeeeEEcccCCCcceeeeEEEEEEEeeC Confidence 998886443 358998733 34566666665555666788899999977 No 29 >protein:vir:97143 Length: 127 # NCBI annotation: ORF041 # Family: family:all:504 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239730;genbank:gi:66394906;genbank:GeneID:5130876 Probab=99.68 E-value=3.4e-19 Score=121.83 Aligned_cols=125 Identities=13% Similarity=0.158 Sum_probs=103.1 Q ss_pred CChhhHHHHHHHHHHhhcCCcccccCcC--CCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGINIWDGVNK--KPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~~VyD~vP~--~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) =||..+|-..+|+.++..|..|||..|. +++||||++|+.+..-...+|+..+.+++++||||+....|+++.+|++. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~vs~i~~~ 80 (127) T protein:vir:97 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHHDGLVKR 80 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchHHHHHHH Confidence 2888999999999999999999999995 47899999999886422468999999999999999999999999999999 Q ss_pred HHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) +..++.+.. -++||.+.. +-.+.+.+.|..-.+.-.|+.+++.|+.- T Consensus 81 i~~~~~~~~--~t~~y~~~~-~~~~~~~l~d~tTn~~L~Hgvi~lef~~~ 127 (127) T protein:vir:97 81 CIDDLTPSV--KTNDYDFEE-DDTNIAQLVDDTTNQELLHTSITISYKTF 127 (127) T ss_pred HHHHhccce--eccceeEEe-eeeeeeecccCCCcceeeeEEEEEEEeeC Confidence 998886443 358998743 33466666665545566788899999977 No 30 >protein:vir:96217 Length: 127 # NCBI annotation: ORF036 # Family: family:all:504 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239575;genbank:gi:66395326;genbank:GeneID:5132763 Probab=99.68 E-value=3.4e-19 Score=121.83 Aligned_cols=125 Identities=13% Similarity=0.158 Sum_probs=103.1 Q ss_pred CChhhHHHHHHHHHHhhcCCcccccCcC--CCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGINIWDGVNK--KPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~~VyD~vP~--~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) =||..+|-..+|+.++..|..|||..|. +++||||++|+.+..-...+|+..+.+++++||||+....|+++.+|++. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~vs~i~~~ 80 (127) T protein:vir:96 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHHDGLVKR 80 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchHHHHHHH Confidence 2888999999999999999999999995 47899999999886422468999999999999999999999999999999 Q ss_pred HHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) +..++.+.. -++||.+.. +-.+.+.+.|..-.+.-.|+.+++.|+.- T Consensus 81 i~~~~~~~~--~t~~y~~~~-~~~~~~~l~d~tTn~~L~Hgvi~lef~~~ 127 (127) T protein:vir:96 81 CIDDLTPSV--KTNDYDFEE-DDTNIAQLVDDTTNQELLHTSITISYKTF 127 (127) T ss_pred HHHHhccce--eccceeEEe-eeeeeeecccCCCcceeeeEEEEEEEeeC Confidence 998886443 358998743 33466666665545566788899999977 No 31 >protein:vir:99769 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004312;genbank:gi:122891766;genbank:GeneID:4712324 Probab=99.68 E-value=3.4e-19 Score=121.83 Aligned_cols=125 Identities=13% Similarity=0.158 Sum_probs=103.1 Q ss_pred CChhhHHHHHHHHHHhhcCCcccccCcC--CCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGINIWDGVNK--KPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~~VyD~vP~--~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) =||..+|-..+|+.++..|..|||..|. +++||||++|+.+..-...+|+..+.+++++||||+....|+++.+|++. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~vs~i~~~ 80 (127) T protein:vir:99 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHHDGLVKR 80 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchHHHHHHH Confidence 2888999999999999999999999995 47899999999886422468999999999999999999999999999999 Q ss_pred HHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) +..++.+.. -++||.+.. +-.+.+.+.|..-.+.-.|+.+++.|+.- T Consensus 81 i~~~~~~~~--~t~~y~~~~-~~~~~~~l~d~tTn~~L~Hgvi~lef~~~ 127 (127) T protein:vir:99 81 CIDDLTPSV--KTNDYDFEE-DDTNIAQLVDDTTNQELLHTSITISYKTF 127 (127) T ss_pred HHHHhccce--eccceeEEe-eeeeeeecccCCCcceeeeEEEEEEEeeC Confidence 998886443 358998743 33466666665545566788899999977 No 32 >protein:vir:103918 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873997;genbank:gi:118430772;genbank:GeneID:4525410 Probab=99.68 E-value=3.4e-19 Score=121.83 Aligned_cols=125 Identities=13% Similarity=0.158 Sum_probs=103.1 Q ss_pred CChhhHHHHHHHHHHhhcCCcccccCcC--CCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGINIWDGVNK--KPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~~VyD~vP~--~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) =||..+|-..+|+.++..|..|||..|. +++||||++|+.+..-...+|+..+.+++++||||+....|+++.+|++. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~ev~YPFV~ig~~q~~~~~~tKt~~~G~v~ltIdVWg~~~~R~~vs~i~~~ 80 (127) T protein:vir:10 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQEIPYPFFVIKMPESNRSKYTFDSYSGDTNLVIDIWSVSDDLGHHDGLVKR 80 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCCCCCCCEEEEcceeccCCCCcccccceEEEEEEEEeecccccchHHHHHHH Confidence 2888999999999999999999999995 47899999999886422468999999999999999999999999999999 Q ss_pred HHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) +..++.+.. -++||.+.. +-.+.+.+.|..-.+.-.|+.+++.|+.- T Consensus 81 i~~~~~~~~--~t~~y~~~~-~~~~~~~l~d~tTn~~L~Hgvi~lef~~~ 127 (127) T protein:vir:10 81 CIDDLTPSV--KTNDYDFEE-DDTNIAQLVDDTTNQELLHTSITISYKTF 127 (127) T ss_pred HHHHhccce--eccceeEEe-eeeeeeecccCCCcceeeeEEEEEEEeeC Confidence 998886443 358998743 33466666665545566788899999977 No 33 >protein:vir:9880 Length: 136 # NCBI annotation: hypothetical protein # Family: family:all:1887 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795642;genbank:gi:28876399;genbank:GeneID:1257930 Probab=98.64 E-value=9.4e-10 Score=70.08 Aligned_cols=130 Identities=18% Similarity=0.202 Sum_probs=102.2 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcC-CcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCc-ch Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQG-INIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYD-SS 78 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~-~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~-G~ 78 (144) |.++-++ .+|-.||.+++...+ -+.||..|.+.+.||-.|+-.+..|+ .+|+---...++.||.+|..+ ++ T Consensus 1 mLKkLsl------~~l~~aV~~~iee~tgL~c~d~~p~~ep~Pfyfie~I~~rpe-~sKtmw~e~y~~~IHais~~g~t~ 73 (136) T protein:vir:98 1 MLKKLGL------VDLHASIKQKIEDKTGLMAYDHVPEDMPSPFYFIEVVDKRPE-DTKVMWCEVFTVWIHAIAEAGKSK 73 (136) T ss_pred Cccccch------HHHHHHHHHHhhccCCceEEEecccCCCCCEEEEEeecCCcc-ccceeeeeEEEEEEEEEcCCCCcc Confidence 9999888 899999999998764 48999999999999999988888886 578988889999999999976 88 Q ss_pred HHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeee-eeecccccccCccEEEEEEEEEe---eccc Q lcl|NC_018086. 79 FEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVR-YTEAANGTYKNERAYLFLDFEVI---DSTI 141 (144) Q Consensus 79 ~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r-~~~d~dg~~~~~~~~~~l~fri~---t~~~ 141 (144) ...-++.+++.+||.+. +.||+||.+.+.+-.+.+ +.++.+|. . |++..-.|+|- ---| T Consensus 74 ~~~~~mI~~l~EAlte~-i~Lpe~y~l~~q~~~G~q~~~~~etge-~--HAi~~fei~vsygfkVKi 136 (136) T protein:vir:98 74 IAIYDMIEKLEEALTEE-LVLPEEIDILRQSEVGMQSLQEDETGE-M--HAIVAYEIKVSYGFKVKV 136 (136) T ss_pred chHHHHHHHHHhhhhce-eecCCCeEEEEEechhhhheecccCCc-e--eeeeeEEEEEeeeEEEeC Confidence 99999999999999654 889999999988776654 67777764 2 23322222210 0011 No 34 >protein:vir:97070 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:3244 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453569;genbank:gi:84662604;genbank:GeneID:5142485 Probab=98.50 E-value=3.2e-09 Score=67.16 Aligned_cols=112 Identities=11% Similarity=-0.015 Sum_probs=72.7 Q ss_pred hHHHHHHHHHHhhc-CCccc-ccCcCCCC-CCEEEeCCceeeecCCCccc---CccEEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 14 VALQRAIVKEIRAQ-GINIW-DGVNKKPE-YPFIKIGEELTSGRTISKDA---IGKMHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 14 ~aLQ~AI~~~L~a~-~~~Vy-D~vP~~a~-~PYV~lG~~~~~~~d~t~~~---~g~~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) -.|+..|++.|.+. +.||| +..|++++ +|||++-.....+. ...++ ....++++|+||+. .+.+|++++.+ T Consensus 1 M~~e~~l~a~L~~~~~~Rvyp~~aP~~~~~~Pyiv~q~vsg~p~-~~ldG~~~~~~~~rvQIdvyA~--t~~~A~~l~~a 77 (118) T protein:vir:97 1 MSYGRMLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPV-YWKEGGMPDKVNARVQVQIWSR--SKQEAYLATVQ 77 (118) T ss_pred CchHHHHHHHHhhhcCCccccccCCCCCCcCCEEEEEecCCccc-ccccCCCCCccceeEEEEEeeC--CHHHHHHHHHH Confidence 15677778888876 45899 66788887 59999965544443 23333 23457899999986 68899999999 Q ss_pred HHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeeccc Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTI 141 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~ 141 (144) |+.+|......-+ +.+-.-..+++. +....++.|.|--+++ T Consensus 78 v~~al~~~~~~~~---------~~~~~~~ye~dt----~lyr~~~Df~iw~~~~ 118 (118) T protein:vir:97 78 VLRIVSEANDMQV---------LSQPIDDYVREL----KLYGSRVDISMWYNLT 118 (118) T ss_pred HHHHhhccccccc---------ccCCcccccccC----CceEEEEEEEEEeecC Confidence 9999965432111 111111112221 2234689999987777 No 35 >protein:vir:10368 Length: 118 # NCBI annotation: conserved phage protein # Family: family:all:3244 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858960;genbank:gi:32128425;genbank:GeneID:2648389 Probab=98.49 E-value=3.9e-09 Score=66.71 Aligned_cols=112 Identities=11% Similarity=0.006 Sum_probs=72.0 Q ss_pred hHHHHHHHHHHhhc-CCccc-ccCcCCCC-CCEEEeCCceeeecCCCccc---CccEEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 14 VALQRAIVKEIRAQ-GINIW-DGVNKKPE-YPFIKIGEELTSGRTISKDA---IGKMHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 14 ~aLQ~AI~~~L~a~-~~~Vy-D~vP~~a~-~PYV~lG~~~~~~~d~t~~~---~g~~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) =.|+..|++.|.+. +.||| +..|++++ +|||++-.....+. ..-++ ....++++|+||+. .+.+|++++.+ T Consensus 1 Ms~e~~l~a~L~~~~~~RVyp~~aP~~~~~~Pyiv~q~vsg~p~-~~l~G~~~~~~~~rvQIdvyA~--t~~~A~~l~~a 77 (118) T protein:vir:10 1 MSYGRVLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPV-YWQEGGMPEKVNARVQIQIWSR--SKQEAYLATVQ 77 (118) T ss_pred CchHHHHHHHHhhhcCCccccccCCCCCCcCCEEEEEecCCccc-ccccCCCCccceeEEEEEEeeC--CHHHHHHHHHH Confidence 05677788888876 45899 66788887 59999965444443 23332 23457899999986 67899999999 Q ss_pred HHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeeccc Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTI 141 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~ 141 (144) |+.+|....- + . .+.+-.-..+++ .+....++.|.|--+++ T Consensus 78 v~~al~~~~~-~----~----~~~~~~d~ye~d----t~l~r~~~Df~vw~~~~ 118 (118) T protein:vir:10 78 VLRLVSEAND-M----Q----VLSQPIDDYVRE----IKLYGSRVDISMWYNLT 118 (118) T ss_pred HHHHhhhccc-c----e----eccCCCcccccc----CCceEEEEEEEEeeecC Confidence 9999965421 1 1 111111111222 12345689999987777 No 36 >protein:vir:81066 Length: 118 # NCBI annotation: p13 # Family: family:all:3244 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285683;genbank:gi:148727191;genbank:GeneID:5247111 Probab=98.44 E-value=5.3e-09 Score=65.95 Aligned_cols=111 Identities=13% Similarity=0.089 Sum_probs=71.1 Q ss_pred hHHHHHHHHHHhhc-CCccc-ccCcCCCC-CCEEEeCCceeeecCCCccc--Ccc-EEEEEEEEEeCCcchHHHHHHHHH Q lcl|NC_018086. 14 VALQRAIVKEIRAQ-GINIW-DGVNKKPE-YPFIKIGEELTSGRTISKDA--IGK-MHNLTLHIWSDYDSSFEVKNLTDF 87 (144) Q Consensus 14 ~aLQ~AI~~~L~a~-~~~Vy-D~vP~~a~-~PYV~lG~~~~~~~d~t~~~--~g~-~~~l~i~VWs~~~G~~eak~Ia~~ 87 (144) -.|+..|++.|.+. +.||| +..|++++ +|||++-.....+. ..-++ .+. .++++|+||+. .+.+|++++.+ T Consensus 1 Ms~e~~l~a~L~~~~~~Rvyp~~aP~~~~~~Pyiv~q~vsg~p~-~~l~G~~~~~~~~rvQIdvyA~--t~~~A~~l~~a 77 (118) T protein:vir:81 1 MSYGRVLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPV-YWQEGGMPEKVNARVQIQIWSR--SKQEAYLATVQ 77 (118) T ss_pred CchHHHHHHHHHhhcCCccccccCCCCCccCceEEEEecCCccc-ccccCCCCCccceeEEEEEeeC--CHHHHHHHHHH Confidence 05666777778775 55899 66788887 59999965544443 23333 222 47899999986 67899999999 Q ss_pred HHHHhcCCccccCCCceEEEEEEeeeeeeecccccccC-ccEEEEEEEEEeeccc Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKN-ERAYLFLDFEVIDSTI 141 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~-~~~~~~l~fri~t~~~ 141 (144) |+.+|....- +. .+ -.+.+..... +....++.|.|--.++ T Consensus 78 v~~al~~~~~-----~~----~~-----~~~~d~ye~dt~l~r~~~Df~iw~~~~ 118 (118) T protein:vir:81 78 VLRLVSEAPD-----MQ----VL-----SQPIDDYVREIKLYGSRVDVSMWYPIT 118 (118) T ss_pred HHHHhhhccc-----ee----ec-----cCCccccccccCceeEEEEEEEEecCC Confidence 9999954321 11 11 1111222212 2234689999887777 No 37 >protein:vir:107581 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:517 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338192;genbank:gi:77020160;genbank:GeneID:3703712 Probab=98.21 E-value=3e-08 Score=61.84 Aligned_cols=109 Identities=14% Similarity=0.178 Sum_probs=72.6 Q ss_pred hhHHHHHHHHHHhhc--------CCccccc-CcCCCCCCEEEeCCceeeecC-CCcccCccEEEEEEEEEeCCcchHHHH Q lcl|NC_018086. 13 SVALQRAIVKEIRAQ--------GINIWDG-VNKKPEYPFIKIGEELTSGRT-ISKDAIGKMHNLTLHIWSDYDSSFEVK 82 (144) Q Consensus 13 ~~aLQ~AI~~~L~a~--------~~~VyD~-vP~~a~~PYV~lG~~~~~~~d-~t~~~~g~~~~l~i~VWs~~~G~~eak 82 (144) |--+.+-||+.|+++ +.+||+. +|++...|||+|=+....|.+ +.......++.++|+|||.. ... T Consensus 1 M~~i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~~~p~I~~~~~~~~p~~~add~e~~~~~~~QIDVwsk~----~~~ 76 (119) T protein:vir:10 1 MINLRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFELDNRPDGFADNQEIESEILFQVDVWAKS----STT 76 (119) T ss_pred CCchHHHHHHHhhcCchhhhhcCCceEEecccCCCCCCcEEEEEecCCCCCcccCCceeeeEEEEEEEEeeCC----CHH Confidence 467888999999875 3368876 667778999999444333322 23445678999999999985 367 Q ss_pred HHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecc Q lcl|NC_018086. 83 NLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDST 140 (144) Q Consensus 83 ~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~ 140 (144) +|+.+|.++|.. -||. +.......+.+.+. -|-.+|||.+++= T Consensus 77 ~i~~~I~~~m~~------~gf~----r~~~~d~ye~dt~l-----yhk~~Rf~~~~el 119 (119) T protein:vir:10 77 AIHQKVNEIMKR------IGFS----RYAVADLYEEDTQI-----FHYAMRFAKGVEL 119 (119) T ss_pred HHHHHHHHHHHH------cCCe----eeccCCCcCChhhh-----heeeeeeeeeeeC Confidence 899999998842 3653 22222232333333 3457899977665 No 38 >protein:vir:105008 Length: 119 # NCBI annotation: conserved structural protein # Family: family:all:517 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459973;genbank:gi:85701388;genbank:GeneID:3882149 Probab=98.21 E-value=3e-08 Score=61.84 Aligned_cols=109 Identities=14% Similarity=0.178 Sum_probs=72.6 Q ss_pred hhHHHHHHHHHHhhc--------CCccccc-CcCCCCCCEEEeCCceeeecC-CCcccCccEEEEEEEEEeCCcchHHHH Q lcl|NC_018086. 13 SVALQRAIVKEIRAQ--------GINIWDG-VNKKPEYPFIKIGEELTSGRT-ISKDAIGKMHNLTLHIWSDYDSSFEVK 82 (144) Q Consensus 13 ~~aLQ~AI~~~L~a~--------~~~VyD~-vP~~a~~PYV~lG~~~~~~~d-~t~~~~g~~~~l~i~VWs~~~G~~eak 82 (144) |--+.+-||+.|+++ +.+||+. +|++...|||+|=+....|.+ +.......++.++|+|||.. ... T Consensus 1 M~~i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~~~p~I~~~~~~~~p~~~add~e~~~~~~~QIDVwsk~----~~~ 76 (119) T protein:vir:10 1 MINLRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFELDNRPDGFADNQEIESEILFQVDVWAKS----STT 76 (119) T ss_pred CCchHHHHHHHhhcCchhhhhcCCceEEecccCCCCCCcEEEEEecCCCCCcccCCceeeeEEEEEEEEeeCC----CHH Confidence 467888999999875 3368876 667778999999444333322 23445678999999999985 367 Q ss_pred HHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecc Q lcl|NC_018086. 83 NLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDST 140 (144) Q Consensus 83 ~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~ 140 (144) +|+.+|.++|.. -||. +.......+.+.+. -|-.+|||.+++= T Consensus 77 ~i~~~I~~~m~~------~gf~----r~~~~d~ye~dt~l-----yhk~~Rf~~~~el 119 (119) T protein:vir:10 77 AIHQKVNEIMKR------IGFS----RYAVADLYEEDTQI-----FHYAMRFAKGVEL 119 (119) T ss_pred HHHHHHHHHHHH------cCCe----eeccCCCcCChhhh-----heeeeeeeeeeeC Confidence 899999998842 3653 22222232333333 3457899977665 No 39 >protein:vir:102086 Length: 119 # NCBI annotation: structural protein # Family: family:all:517 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512319;genbank:gi:89152488;genbank:GeneID:3953079 Probab=98.21 E-value=3e-08 Score=61.84 Aligned_cols=109 Identities=14% Similarity=0.178 Sum_probs=72.6 Q ss_pred hhHHHHHHHHHHhhc--------CCccccc-CcCCCCCCEEEeCCceeeecC-CCcccCccEEEEEEEEEeCCcchHHHH Q lcl|NC_018086. 13 SVALQRAIVKEIRAQ--------GINIWDG-VNKKPEYPFIKIGEELTSGRT-ISKDAIGKMHNLTLHIWSDYDSSFEVK 82 (144) Q Consensus 13 ~~aLQ~AI~~~L~a~--------~~~VyD~-vP~~a~~PYV~lG~~~~~~~d-~t~~~~g~~~~l~i~VWs~~~G~~eak 82 (144) |--+.+-||+.|+++ +.+||+. +|++...|||+|=+....|.+ +.......++.++|+|||.. ... T Consensus 1 M~~i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~~~p~I~~~~~~~~p~~~add~e~~~~~~~QIDVwsk~----~~~ 76 (119) T protein:vir:10 1 MINLRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFELDNRPDGFADNQEIESEILFQVDVWAKS----STT 76 (119) T ss_pred CCchHHHHHHHhhcCchhhhhcCCceEEecccCCCCCCcEEEEEecCCCCCcccCCceeeeEEEEEEEEeeCC----CHH Confidence 467888999999875 3368876 667778999999444333322 23445678999999999985 367 Q ss_pred HHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecc Q lcl|NC_018086. 83 NLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDST 140 (144) Q Consensus 83 ~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~ 140 (144) +|+.+|.++|.. -||. +.......+.+.+. -|-.+|||.+++= T Consensus 77 ~i~~~I~~~m~~------~gf~----r~~~~d~ye~dt~l-----yhk~~Rf~~~~el 119 (119) T protein:vir:10 77 AIHQKVNEIMKR------IGFS----RYAVADLYEEDTQI-----FHYAMRFAKGVEL 119 (119) T ss_pred HHHHHHHHHHHH------cCCe----eeccCCCcCChhhh-----heeeeeeeeeeeC Confidence 899999998842 3653 22222232333333 3457899977665 No 40 >protein:vir:102888 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:517 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338141;genbank:gi:77020213;genbank:GeneID:3703797 Probab=98.21 E-value=3e-08 Score=61.84 Aligned_cols=109 Identities=14% Similarity=0.178 Sum_probs=72.6 Q ss_pred hhHHHHHHHHHHhhc--------CCccccc-CcCCCCCCEEEeCCceeeecC-CCcccCccEEEEEEEEEeCCcchHHHH Q lcl|NC_018086. 13 SVALQRAIVKEIRAQ--------GINIWDG-VNKKPEYPFIKIGEELTSGRT-ISKDAIGKMHNLTLHIWSDYDSSFEVK 82 (144) Q Consensus 13 ~~aLQ~AI~~~L~a~--------~~~VyD~-vP~~a~~PYV~lG~~~~~~~d-~t~~~~g~~~~l~i~VWs~~~G~~eak 82 (144) |--+.+-||+.|+++ +.+||+. +|++...|||+|=+....|.+ +.......++.++|+|||.. ... T Consensus 1 M~~i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~~~p~I~~~~~~~~p~~~add~e~~~~~~~QIDVwsk~----~~~ 76 (119) T protein:vir:10 1 MINLRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFELDNRPDGFADNQEIESEILFQVDVWAKS----STT 76 (119) T ss_pred CCchHHHHHHHhhcCchhhhhcCCceEEecccCCCCCCcEEEEEecCCCCCcccCCceeeeEEEEEEEEeeCC----CHH Confidence 467888999999875 3368876 667778999999444333322 23445678999999999985 367 Q ss_pred HHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecc Q lcl|NC_018086. 83 NLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDST 140 (144) Q Consensus 83 ~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~ 140 (144) +|+.+|.++|.. -||. +.......+.+.+. -|-.+|||.+++= T Consensus 77 ~i~~~I~~~m~~------~gf~----r~~~~d~ye~dt~l-----yhk~~Rf~~~~el 119 (119) T protein:vir:10 77 AIHQKVNEIMKR------IGFS----RYAVADLYEEDTQI-----FHYAMRFAKGVEL 119 (119) T ss_pred HHHHHHHHHHHH------cCCe----eeccCCCcCChhhh-----heeeeeeeeeeeC Confidence 899999998842 3653 22222232333333 3457899977665 No 41 >protein:vir:93602 Length: 114 # NCBI annotation: putative structural component # Family: family:all:896 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449300;genbank:gi:157166048;uniprot:Q6H9U1;genbank:GeneID:5580424 Probab=98.16 E-value=4e-08 Score=61.13 Aligned_cols=107 Identities=7% Similarity=0.079 Sum_probs=68.3 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhc-CCccc-ccCcCC-----CCCCEEEeCCceeeecCCCcccCccEEEEEEEEEe Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQ-GINIW-DGVNKK-----PEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWS 73 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~-~~~Vy-D~vP~~-----a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs 73 (144) |+ +.-||+.|... ++||| +-.|+. .++|||++-.....+.+.-+.-.....+++|+||+ T Consensus 1 M~--------------e~~i~~lL~~~~~gRvyp~~~P~~~~~~~~~~Pyiv~q~vsg~p~~~l~gp~~~~~~vQIDvyA 66 (114) T protein:vir:93 1 MT--------------EADLYPHLAHLAGGQVYPYVVPLLDGRPSVALPWVVFSLISSVSADVMGGQAESSVSVQIDVYA 66 (114) T ss_pred Cc--------------hHHHHHHHHhhcCcccccccCCcccCcCCccCceEEEEeccCcccccccCccccceEEEEEeee Confidence 44 35688888763 56999 556753 57899999665555554333344568999999998 Q ss_pred CCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEee Q lcl|NC_018086. 74 DYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVID 138 (144) Q Consensus 74 ~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t 138 (144) . .+.+|++++++|+.||... ++- ......- .|++- +....++.|.|+- T Consensus 67 ~--t~~~A~~l~~~v~~Al~~~------~~~----~~~~~~~-ye~dt----~lyR~~~d~~v~~ 114 (114) T protein:vir:93 67 G--TVTQARQIRQDAREAIMLL------APG----SVSEMQD-YIPEN----RCYRATLEFQVTV 114 (114) T ss_pred C--CHHHHHHHHHHHHHHHhhc------CcE----eecCCCc-ccccc----cceeeEEEEEEeC Confidence 6 6788999999999999421 110 1111111 23332 2234678888877 No 42 >protein:vir:195 Length: 115 # NCBI annotation: Gp11 # Family: family:all:896 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037705;genbank:gi:9634170;genbank:GeneID:1262540 Probab=98.14 E-value=4.3e-08 Score=60.98 Aligned_cols=107 Identities=14% Similarity=0.088 Sum_probs=67.8 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhh-cCCccc-ccCcCCC------CCCEEEeCCceeeecCCCcccCccEEEEEEEEE Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRA-QGINIW-DGVNKKP------EYPFIKIGEELTSGRTISKDAIGKMHNLTLHIW 72 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a-~~~~Vy-D~vP~~a------~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VW 72 (144) |++ .-|++.|.. .++||| +..|.+. ++|||++-.....+.+.-+.-....++++|+|| T Consensus 1 M~e--------------~~i~~lL~~l~~gRvyp~~aP~~~~~~~~~~~Pyiv~q~vsg~p~~~L~G~~~~~~~vQIDvy 66 (115) T protein:vir:19 1 MNE--------------DNIYALLSPLAEGRVYPYVAPLGSDGKPSVSPPWIIFSIVDDVSADVLCGQAESRVSVQVDVY 66 (115) T ss_pred Cch--------------hHHHHHHhhhcCcccceeeccCCCCCCccccCCeEEEEeccCcccccccCCCccceEEEEEEe Confidence 554 457888875 467999 6678753 899999866655555433334467899999999 Q ss_pred eCCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEee Q lcl|NC_018086. 73 SDYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVID 138 (144) Q Consensus 73 s~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t 138 (144) ++ ...+|++++++|+.||.. +. .+. .... -..++|-. ....++.|+|-- T Consensus 67 A~--t~~~A~~l~~~i~~Al~~----~~-p~~-----~~~~-~~ye~dt~----lyR~s~d~~V~~ 115 (115) T protein:vir:19 67 ST--SIAESRSLRDLVLASLEP----LT-PTE-----VVKI-PGYEPDYR----LYRATLDFKVTP 115 (115) T ss_pred eC--ChHHHHHHHHHHHHHhhh----cC-CEE-----ecCC-CCcccchh----ceeeEEEEEecC Confidence 86 678899999999999941 11 111 1111 11233221 123577777654 No 43 >protein:vir:4348 Length: 121 # NCBI annotation: Orf15 # Family: family:all:896 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061511;genbank:gi:9635607;genbank:GeneID:1262874 Probab=97.93 E-value=1.9e-07 Score=57.46 Aligned_cols=107 Identities=10% Similarity=0.059 Sum_probs=59.5 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhc----------CCcccc--cCcCCCCCCEEEeCCceeeecCCCccc--CccEEE Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQ----------GINIWD--GVNKKPEYPFIKIGEELTSGRTISKDA--IGKMHN 66 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~----------~~~VyD--~vP~~a~~PYV~lG~~~~~~~d~t~~~--~g~~~~ 66 (144) |- .+||+.|.++ ..|||. ..|.++++|||++-.....+.. +.++ .....+ T Consensus 1 m~---------------~~i~~~l~~d~~v~allg~~~~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~~-~l~g~~~~~~~~ 64 (121) T protein:vir:43 1 MY---------------PPIFKVCSSSPAVTAILGASPLRMYQFGLAPQLVVKPYATWQTISGSPEN-YLWGRPDADGFT 64 (121) T ss_pred CC---------------hHHHHHHhhChhhhhhhcCCCceeeccCCCCCCCcCCeEEEEEecCcccc-eecCCCCcceeE Confidence 22 2344444443 348984 5699999999998554444432 2222 235789 Q ss_pred EEEEEEeCCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecc Q lcl|NC_018086. 67 LTLHIWSDYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDST 140 (144) Q Consensus 67 l~i~VWs~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~ 140 (144) ++|+||+. .+.+|++++.+|++||.... |.+. .-.-..++| | +.-++++.+.-+-.- T Consensus 65 vQIDvyA~--t~~~A~~l~~av~~Al~~~~------~~~~-----~~~~~ye~d--T--~lyR~s~Dv~w~~~r 121 (121) T protein:vir:43 65 IQVDIFSA--TAAEARDAAKAIRDAIELSA------YVVR-----WGGESVDPD--T--KTYRVSFDVDWIVQR 121 (121) T ss_pred EEEEeeeC--CHHHHHHHHHHHHHHhhhcC------Cccc-----CCCCCCccc--c--cceeeeeEEEEeecC Confidence 99999975 67899999999999996322 1110 001111121 1 112334444433222 No 44 >protein:vir:1892 Length: 121 # NCBI annotation: gp11 # Family: family:all:896 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037672;genbank:gi:9634130;genbank:GeneID:1262490 Probab=97.83 E-value=2.8e-07 Score=56.51 Aligned_cols=107 Identities=13% Similarity=0.107 Sum_probs=58.1 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhc----------CCcccc--cCcCCCCCCEEEeCCceeeecCCCcc-c-CccEEE Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQ----------GINIWD--GVNKKPEYPFIKIGEELTSGRTISKD-A-IGKMHN 66 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~----------~~~VyD--~vP~~a~~PYV~lG~~~~~~~d~t~~-~-~g~~~~ 66 (144) |.. . ||+.|.++ ..|||. ..|.++++|||++-.....+.. +.+ . .....+ T Consensus 1 m~~-----------~----i~~~l~~d~~v~allg~~~~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~~-~l~G~~~~~~~~ 64 (121) T protein:vir:18 1 MIA-----------P----IFSVCASSPEVTDLLGSNPVRIYPFGIQDDNVVYPYVVWQNITGSPEN-YIAQRPDADFFT 64 (121) T ss_pred Cch-----------H----HHHHHhcChhhhhhhcCCCceeeeccCCCCcCcCCeEEEEEecCcccc-eecCCCCcceeE Confidence 322 1 33333332 348985 6799999999999654444432 222 2 334689 Q ss_pred EEEEEEeCCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecc Q lcl|NC_018086. 67 LTLHIWSDYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDST 140 (144) Q Consensus 67 l~i~VWs~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~ 140 (144) ++|+||+. ...+|++++++|+++|... +|.. ... .-..+++ |.. -+.+++.+.+-.- T Consensus 65 vQIDvyA~--t~~~A~~l~~avr~Ale~~------~~~~-~~~----~~~ye~d--T~l--yR~s~Dv~~~~~r 121 (121) T protein:vir:18 65 LQVDAYAD--TVDEVIAVATALRDAIEPH------AHIT-RWG----GQERDPE--TKR--YRYSFDVDWIVTR 121 (121) T ss_pred EEEEeecC--CHHHHHHHHHHHHHHhhhc------Cccc-CCC----CCCCccc--ccc--eeeeeEEEEeecC Confidence 99999986 5678999999999999532 2211 000 0112222 211 1223333322222 No 45 >protein:vir:1274 Length: 162 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690766;genbank:gi:22855006;genbank:GeneID:955217 Probab=97.69 E-value=5e-07 Score=55.12 Aligned_cols=127 Identities=10% Similarity=0.163 Sum_probs=75.7 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhc-------CCcccccC-cCCCCCCEEEeCCceeeecCCCc-ccCccEEEEEEEE Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQ-------GINIWDGV-NKKPEYPFIKIGEELTSGRTISK-DAIGKMHNLTLHI 71 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~-------~~~VyD~v-P~~a~~PYV~lG~~~~~~~d~t~-~~~g~~~~l~i~V 71 (144) ..----|++.-..--+-+.|++.|..+ +.+||... |++...|||+|=+-...|..-++ .....+++++|+| T Consensus 26 ~~~~~~~~~~~~~mn~~k~v~q~L~n~~~L~~l~~~~i~~l~~~~~~~~p~Itf~e~~~~p~~yADD~e~ss~~~iQIDI 105 (162) T protein:vir:12 26 INGANTYSADQMTYSPKIELVSTLNSSAFLKGLTSGGIHNLVANDVSAFPRVVFSEIQDADADFADNEVYSFEVRYQISI 105 (162) T ss_pred cccccccchhhhhhhHHHHHHHHhcChhHHHhhCCCceEEEeecCCCCceEEEEEeecCCCCcccccceeeEEEEEEEEE Confidence 111111111111113345566666544 45788765 56778999999776665543332 3457799999999 Q ss_pred EeCCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEE-eecccC Q lcl|NC_018086. 72 WSDYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEV-IDSTID 142 (144) Q Consensus 72 Ws~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri-~t~~~~ 142 (144) ||..+-.....+|..+|..+|. .-||. +.......+++.+..+ =.+|||. .-++++ T Consensus 106 wsk~st~~d~~~l~~~I~~lMk------~~GF~----R~s~~d~YE~DTklyH-----K~~RF~~~y~~E~~ 162 (162) T protein:vir:12 106 FTQASTRGKETAIASEIDRLMR------EIGYS----RYDSQDLYETDTKVFH-----KARRYKKTYYQEVN 162 (162) T ss_pred eecCCcchhHHHHHHHHHHHHH------HcCCE----eecCCCCCCChhhhhh-----hhheeccceeeecC Confidence 9987777788899999999883 23653 3333344455455543 3578862 345555 No 46 >protein:vir:100242 Length: 114 # NCBI annotation: gp71 # Family: family:all:2712 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355407;genbank:gi:77864697;genbank:GeneID:3725964 Probab=97.67 E-value=8.2e-07 Score=53.94 Aligned_cols=110 Identities=14% Similarity=0.189 Sum_probs=68.2 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcC-Cccc-ccCcCCCCCCEEEeCCceeeecCCCccc--CccEEEEEEEEEeCCc Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQG-INIW-DGVNKKPEYPFIKIGEELTSGRTISKDA--IGKMHNLTLHIWSDYD 76 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~-~~Vy-D~vP~~a~~PYV~lG~~~~~~~d~t~~~--~g~~~~l~i~VWs~~~ 76 (144) ||.- -|+++|.+.+ .++| ...|++++.||+++-.....+. ++-++ .+...+++|+||.. T Consensus 1 ~~~~--------------~i~~~l~~~~g~~~~~~~aP~~~~~Py~vy~rvsg~p~-~tL~G~~g~~~~r~QiD~yA~-- 63 (114) T protein:vir:10 1 MSAL--------------TIRDAIGIVGGAKGYVSVASSAAQSPYYVVSRVSGTRD-MALGGATGGKSGMFQIDVYAK-- 63 (114) T ss_pred Ccee--------------eeehhhcccccccccCCCCCCCCCCceEEEEeccCccc-ccccCCCCcceEEEEEEeeeC-- Confidence 3332 3778888754 4555 8899999999999966655553 44443 35788999999975 Q ss_pred chHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEee Q lcl|NC_018086. 77 SSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVID 138 (144) Q Consensus 77 G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t 138 (144) .+.||+++++++.++|... +.|+++. +.+.+=.-+++. +..++++.|.|-= T Consensus 64 T~~eA~~La~~~~~~l~~~-----~~f~~~~--l~~~~d~ye~dT----~l~Rvsld~si~f 114 (114) T protein:vir:10 64 TYTEADSLADQIIDRVEST-----GMFSVGG--VSDLPDDYSSDT----GVFRVSLEISVQF 114 (114) T ss_pred CHHHHHHHHHHHHhhcccc-----cCeeeec--cccCCCCCCccc----CceEEEEEEEEeC Confidence 6789999998887766322 1233221 222221122221 2345677777655 No 47 >protein:vir:1438 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:2712 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536367;genbank:gi:17975172;genbank:GeneID:929144 Probab=97.49 E-value=1.7e-06 Score=52.22 Aligned_cols=111 Identities=12% Similarity=0.157 Sum_probs=62.5 Q ss_pred hhHHHHHHHHHHhhc-CCccc-ccCcCCCCCCEEEeCCceeeecCCCcc--cCccEEEEEEEEEeCCcchHHHHHHHHHH Q lcl|NC_018086. 13 SVALQRAIVKEIRAQ-GINIW-DGVNKKPEYPFIKIGEELTSGRTISKD--AIGKMHNLTLHIWSDYDSSFEVKNLTDFL 88 (144) Q Consensus 13 ~~aLQ~AI~~~L~a~-~~~Vy-D~vP~~a~~PYV~lG~~~~~~~d~t~~--~~g~~~~l~i~VWs~~~G~~eak~Ia~~V 88 (144) +..| -|++.|... +.||| +..|++++.|||++-.....+. ++-+ ......+++|+||+. .+.+|++++++| T Consensus 1 ~~~~--~i~~aL~~l~~~RVyp~~aP~~~~~Pyiv~q~vsg~p~-~~L~G~~~~~~~~vQIDvyA~--t~~~A~~l~~~v 75 (115) T protein:vir:14 1 MSVI--VIRDALQGIGGAKGYLGVAPAKAPAPYFVVTRVHGALD-MALAGLTGGRSGSYQIDCYAP--TFTDADRLADLA 75 (115) T ss_pred CeeE--eeehhhccccccccccccCCCCCCCCEEEEEeecCccc-ccccCCCCCcceEEEEEEeeC--CHHHHHHHHHHH Confidence 0011 134555543 56897 7889999999999854444443 2333 233689999999986 678999999999 Q ss_pred HHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEee Q lcl|NC_018086. 89 VGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVID 138 (144) Q Consensus 89 ~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t 138 (144) ++++...+. ..++.. +...+-..+++- +...+++.|.+-= T Consensus 76 ~~~~~~~~~----~~~~~~--~~~~~d~ye~dt----~lyR~s~D~~vWf 115 (115) T protein:vir:14 76 VDRAMSVQD----RFSVGG--VDELPDDYSEDT----GLFRISLELSVEF 115 (115) T ss_pred HHHHhcCcc----ceeeee--ecCCCCCCcccc----cceeeEEEEEEeC Confidence 887743321 111111 111111111221 1233566666544 No 48 >protein:vir:100116 Length: 115 # NCBI annotation: gp10 # Family: family:all:2712 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945040;genbank:gi:38707900;genbank:GeneID:2744163 Probab=97.41 E-value=2.1e-06 Score=51.73 Aligned_cols=111 Identities=12% Similarity=0.159 Sum_probs=62.7 Q ss_pred hhHHHHHHHHHHhhc-CCccc-ccCcCCCCCCEEEeCCceeeecCCCcc--cCccEEEEEEEEEeCCcchHHHHHHHHHH Q lcl|NC_018086. 13 SVALQRAIVKEIRAQ-GINIW-DGVNKKPEYPFIKIGEELTSGRTISKD--AIGKMHNLTLHIWSDYDSSFEVKNLTDFL 88 (144) Q Consensus 13 ~~aLQ~AI~~~L~a~-~~~Vy-D~vP~~a~~PYV~lG~~~~~~~d~t~~--~~g~~~~l~i~VWs~~~G~~eak~Ia~~V 88 (144) +..| -|++.|... +.||| +..|++++.|||++-.....+. ++-+ ......+++|+||+. ...+|++++++| T Consensus 1 ~~~~--~i~~aL~~l~~~RVyp~~aP~~~~~Pyiv~q~vsg~p~-~~L~G~~~~~~~~vQIDvyA~--t~~~A~~l~~~v 75 (115) T protein:vir:10 1 MSVI--VIRDALQGIGGAKGYLGVAPEKAPAPYFVVTRVHGALD-MALAGLTGGRSGSYQIDCYAP--TFTDADRLADLA 75 (115) T ss_pred CeeE--EeehhhcccCCceeecccCCCCCCCCEEEEEeecCccc-cccCCCCCCcceEEEEEEeeC--CHHHHHHHHHHH Confidence 0111 135566654 45897 7889999999999854444453 2333 233689999999986 678899999999 Q ss_pred HHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEee Q lcl|NC_018086. 89 VGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVID 138 (144) Q Consensus 89 ~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t 138 (144) ++++...+. .+++.. +...+-..+++ + +...+++.|.+-= T Consensus 76 ~~~~~~~~~----~~~~~~--~~~~~d~ye~d--t--~lyR~s~D~~vWf 115 (115) T protein:vir:10 76 VDRAMSVQD----RFSVGG--VDELPDDYSED--T--GLFRISLELSVEF 115 (115) T ss_pred HHHHhcCcc----ceeEee--ecCCCCCCccc--c--cceeeEEEEEEeC Confidence 887743221 111111 11111111122 1 1233566666544 No 49 >protein:vir:1387 Length: 116 # NCBI annotation: Gp10 protein # Family: family:all:517 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612839;genbank:gi:20065973;genbank:GeneID:935788 Probab=96.33 E-value=3.3e-05 Score=45.18 Aligned_cols=114 Identities=19% Similarity=0.290 Sum_probs=73.1 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCcccccCcCCC-CCCEEEeCCceeeecCCCc-ccCccEEEEEEEEEeCCcch Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGINIWDGVNKKP-EYPFIKIGEELTSGRTISK-DAIGKMHNLTLHIWSDYDSS 78 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~VyD~vP~~a-~~PYV~lG~~~~~~~d~t~-~~~g~~~~l~i~VWs~~~G~ 78 (144) |- - --+.+-|++.|+-.+.+|+.....+. ..|||+|=+-...|..-++ .....++.++|+|||.. . T Consensus 1 ~~---~-------m~I~~~i~~~Lk~i~ipV~~~~y~~~~~~~~Itf~~y~e~~~~yaDd~e~~t~~~iQVDI~sk~--~ 68 (116) T protein:vir:13 1 ME---D-------FDIIALVYECLECLNVPVIEGWYDEELNKTHITVHEYLEQDESFEDDEAREEEHNIQIDVWSKD--S 68 (116) T ss_pred CC---c-------cchhHHHHHHHhhcCCeeeecccCCCCccceEEEEeeecCCCcccCCeeeeEEEEEEEEEeecC--C Confidence 11 1 14566788899888888999876654 6899999776666543332 34577999999999963 4 Q ss_pred HHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccC Q lcl|NC_018086. 79 FEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTID 142 (144) Q Consensus 79 ~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~ 142 (144) .++++|..+|.++|. .-||. +.......+...+.. |=.+||.-+++ ++ T Consensus 69 ~~~~~l~~~V~~lMk------~~GF~----r~~~~d~ye~dt~iy-----hk~~RF~y~~e-l~ 116 (116) T protein:vir:13 69 LEAFKLKKAIKKLLK------KNNFY----FDSSEDFYETKTRIY-----HKGLRFSYISE-IS 116 (116) T ss_pred ccHHHHHHHHHHHHH------HcCCE----eeecCCCccchhhhh-----hhhhhheeeee-cC Confidence 456668888888873 34653 333333333333333 33588876643 44 No 50 >protein:vir:80371 Length: 115 # NCBI annotation: gp11 # Family: family:all:2712 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111090;genbank:gi:134288622;genbank:GeneID:4960618 Probab=95.87 E-value=0.00017 Score=41.22 Aligned_cols=110 Identities=11% Similarity=0.121 Sum_probs=60.7 Q ss_pred CChhhHHHHHHHHHHhhc-CCccc-ccCcCCCCCCEEEeCCceeeecCCCccc--CccEEEEEEEEEeCCcchHHHHHHH Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQ-GINIW-DGVNKKPEYPFIKIGEELTSGRTISKDA--IGKMHNLTLHIWSDYDSSFEVKNLT 85 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~-~~~Vy-D~vP~~a~~PYV~lG~~~~~~~d~t~~~--~g~~~~l~i~VWs~~~G~~eak~Ia 85 (144) -|.+ -|++.|.+- +.+.| -..|++++.|||++-....-++ .+-|+ .++..+++|+||.+ ..+++++++ T Consensus 1 ~~~~-----vir~al~~i~~~~~~~~vAp~~~~~pyivy~rvsga~e-~~L~G~ag~~~~~~QID~yA~--T~~ea~~La 72 (115) T protein:vir:80 1 MSVI-----VVRDALQGIGGAKGYLGVAPEKAPARYFVVTRVHGALD-MALAGPTGGRSGSYQIDCYAP--TFTDADRLA 72 (115) T ss_pred Ceee-----eeechhhhccccccceeeccccCcCCeEEEeecCCCcc-ccccCCCCCceeEEEEeeecC--CHHHHHHHH Confidence 0221 245556553 44565 3568999999999965544343 33333 36789999999975 678999999 Q ss_pred HHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCcc-EEEEEEEEEee Q lcl|NC_018086. 86 DFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNER-AYLFLDFEVID 138 (144) Q Consensus 86 ~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~-~~~~l~fri~t 138 (144) ++++.++...+- .|.+.+ +-+.||-.+.+.. ..+++.|.|-- T Consensus 73 ~~v~d~~~~~~~----~~~vg~-------l~e~pd~Ye~DT~l~Rvs~dv~i~f 115 (115) T protein:vir:80 73 DLAVDRAMSVQD----RFSVGG-------VDELPDDYSADTGLFRVSLELSVEF 115 (115) T ss_pred HHHHHhhhCCcc----ccceec-------ccCCCcccccccceEEEEEEEEEeC Confidence 999997654332 222221 1122222222211 12333333221 No 51 >protein:vir:98426 Length: 131 # NCBI annotation: ORF6 # Family: family:all:12105 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958284;genbank:gi:41057258;uniprot:Q38599;genbank:GeneID:2732810 Probab=95.50 E-value=0.0014 Score=36.20 Aligned_cols=127 Identities=16% Similarity=0.228 Sum_probs=77.5 Q ss_pred CCCCCCCChhhHHHHHHHHHHhhcC--CcccccCcCCCCCCEEEeCCceeeecCCCcccC-ccEEEEEEEEEeCCcchHH Q lcl|NC_018086. 4 RPPFRARSSSVALQRAIVKEIRAQG--INIWDGVNKKPEYPFIKIGEELTSGRTISKDAI-GKMHNLTLHIWSDYDSSFE 80 (144) Q Consensus 4 ~~~~~~~S~~~aLQ~AI~~~L~a~~--~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~-g~~~~l~i~VWs~~~G~~e 80 (144) -||..-.++..-+=+-+.++|.+.+ -.|+..+|.+-+..+|++-.. . +..... -...++.|+||.. ...+ T Consensus 1 ~~~i~~pda~~v~~~~lr~~l~a~~~~V~V~t~vP~~RP~rfV~Vert---g--G~~~~~~~Dr~~L~Vq~W~~--t~~~ 73 (131) T protein:vir:98 1 MPPILMPDAVAVIAGYLRAVLVARGVTVPVGSRVPSPRPARFVRIERI---G--GPANTVVTDRPRLDVHCWGS--SEED 73 (131) T ss_pred CCCccCCchhHHHHHHHHHHHHhcCCceEecccCCCCCCceEEEEEec---C--CCcCCccccceEEEEEecCC--CHHH Confidence 3577777887777777788886643 479999999999999999322 1 112223 3678899999974 6789 Q ss_pred HHHHHHHHHHHhcCCccccCCCceE-EEEEEeeeeeeecccccccCccEEEEEEEEEeeccc Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCI-GKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTI 141 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~-~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~ 141 (144) |.+++..+++.|-..+-.+ |... ...+..+...+.|++ ++..|-..+++..+--+.+ T Consensus 74 A~~La~~vr~~ll~~~~~~--g~~~~~~~e~~gpy~~PD~e--s~~~Ryq~tv~l~~r~~~~ 131 (131) T protein:vir:98 74 AHDLMQLCRALLGAARGSH--GDTVLARPATGGPQFLPDAE--TGAARWAFTLDITMRGHAL 131 (131) T ss_pred HHHHHHHHHHHHhhccccc--chheeccccCCCCCcCCCCC--CCCceeEEEEEEEeeeccC Confidence 9999999999774222111 2222 123333344444554 3333333344433333333 No 52 >protein:vir:98343 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918935;genbank:gi:119443697;genbank:GeneID:4594505 Probab=95.18 E-value=0.00039 Score=39.25 Aligned_cols=122 Identities=15% Similarity=0.222 Sum_probs=64.3 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCcccccCcCCCCCCEEEeCCceeeecCCCc-ccCccEEEEEEEEEeCCcchH Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGINIWDGVNKKPEYPFIKIGEELTSGRTISK-DAIGKMHNLTLHIWSDYDSSF 79 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~-~~~g~~~~l~i~VWs~~~G~~ 79 (144) |-..+ ..+++. ..+-|-..|+..+.+|.+.-=.+..-|||+|=+-...+.+-++ .....+|.++|+||...+. T Consensus 2 ~~~~k--~l~~~~--I~~li~~~L~~~nvpv~~~~y~~~~~tyItf~ey~~~~~~yaDD~e~~t~~~iQVDIw~sk~d-- 75 (126) T protein:vir:98 2 INVTK--LIRNAI--IANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQDE-- 75 (126) T ss_pred ccchh--hhhhhH--HHHhhhhhhhccCceeeeeeecCCCceEEEEEeecCCCCcccccceeeeEEEEEEEEeecCCC-- Confidence 21111 122222 3333334444445566665556778899999766554433332 3446799999999543333 Q ss_pred HHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEE--EeecccCCC Q lcl|NC_018086. 80 EVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFE--VIDSTIDPY 144 (144) Q Consensus 80 eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fr--i~t~~~~~~ 144 (144) -.+|+..|.++|. .-||. |.......+...+..+. .+||| +++.-++-- T Consensus 76 -~~~l~~~V~~lMk------~~GF~----r~~~~dlYE~DtklyHk-----~~RF~~~~~~~~~~~~ 126 (126) T protein:vir:98 76 -PNEQAEKIVELLK------VINFQ----CYYREPLYESDVMSFRH-----IIRAKGSILSMKLEEN 126 (126) T ss_pred -HHHHHHHHHHHHH------HcCCe----eeecCCCccchhhhhee-----eeeeeeeecceeeccC Confidence 2336677777772 23653 44455555544444433 57776 333333333 No 53 >protein:vir:9415 Length: 126 # NCBI annotation: phi PVL orf 12-like protein # Family: family:all:517 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803393;genbank:gi:29028705;genbank:GeneID:1258143 Probab=95.18 E-value=0.00039 Score=39.25 Aligned_cols=122 Identities=15% Similarity=0.222 Sum_probs=64.3 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCcccccCcCCCCCCEEEeCCceeeecCCCc-ccCccEEEEEEEEEeCCcchH Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGINIWDGVNKKPEYPFIKIGEELTSGRTISK-DAIGKMHNLTLHIWSDYDSSF 79 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~-~~~g~~~~l~i~VWs~~~G~~ 79 (144) |-..+ ..+++. ..+-|-..|+..+.+|.+.-=.+..-|||+|=+-...+.+-++ .....+|.++|+||...+. T Consensus 2 ~~~~k--~l~~~~--I~~li~~~L~~~nvpv~~~~y~~~~~tyItf~ey~~~~~~yaDD~e~~t~~~iQVDIw~sk~d-- 75 (126) T protein:vir:94 2 INVTK--LIRNAI--IANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQDE-- 75 (126) T ss_pred ccchh--hhhhhH--HHHhhhhhhhccCceeeeeeecCCCceEEEEEeecCCCCcccccceeeeEEEEEEEEeecCCC-- Confidence 21111 122222 3333334444445566665556778899999766554433332 3446799999999543333 Q ss_pred HHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEE--EeecccCCC Q lcl|NC_018086. 80 EVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFE--VIDSTIDPY 144 (144) Q Consensus 80 eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fr--i~t~~~~~~ 144 (144) -.+|+..|.++|. .-||. |.......+...+..+. .+||| +++.-++-- T Consensus 76 -~~~l~~~V~~lMk------~~GF~----r~~~~dlYE~DtklyHk-----~~RF~~~~~~~~~~~~ 126 (126) T protein:vir:94 76 -PNEQAEKIVELLK------VINFQ----CYYREPLYESDVMSFRH-----IIRAKGSILSMKLEEN 126 (126) T ss_pred -HHHHHHHHHHHHH------HcCCe----eeecCCCccchhhhhee-----eeeeeeeecceeeccC Confidence 2336677777772 23653 44455555544444433 57776 333333333 No 54 >protein:vir:79247 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469166;genbank:gi:157835008;genbank:GeneID:5648828 Probab=95.03 E-value=0.0014 Score=36.28 Aligned_cols=127 Identities=13% Similarity=0.119 Sum_probs=69.2 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCC---cccc-----cCcC-CCCCC--EEEeCCceeeecCCCcccCccEEE--- Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGI---NIWD-----GVNK-KPEYP--FIKIGEELTSGRTISKDAIGKMHN--- 66 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~---~VyD-----~vP~-~a~~P--YV~lG~~~~~~~d~t~~~~g~~~~--- 66 (144) ||++=.| +|+.++|.+||++.+. .|+- .+++ +..-| ||+++..+..+...-....+.... T Consensus 1 ~~~~~d~------~a~~~~IierLka~v~~l~~V~~aadla~i~e~~q~tPaayVv~~gd~~~~~~~~~~~~~~~Q~vtq 74 (157) T protein:vir:79 1 MSDPFDY------LFLEPLLIERIRSEVPGLAIVSGVPDLAALSEQDQPAPSVYVVYLGDEIGTGADYQGGRRAIQAIGQ 74 (157) T ss_pred CCCchhh------hhhhHHHHHHHHhhhhhhhhhccccchhhhhhhcCCCcEEEEEecccccCCCcccccCcceeeeeee Confidence 9998788 8999999999998743 2222 2232 12234 898876654332111111112111 Q ss_pred -EEEEEE----e-CCcc---hHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccc-cccCccEEEEEEEEE Q lcl|NC_018086. 67 -LTLHIW----S-DYDS---SFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANG-TYKNERAYLFLDFEV 136 (144) Q Consensus 67 -l~i~VW----s-~~~G---~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg-~~~~~~~~~~l~fri 136 (144) +.+-|- . ...| ..++.++.++|++||.++... .++.. +++.-.+.. ....|-.+..+.|.+ T Consensus 75 ~f~Vvlavrn~~~~~~~~a~~d~ag~ll~~v~~AL~GW~P~--~~~~p-------l~~~~~~~~~~y~~gf~yypl~F~~ 145 (157) T protein:vir:79 75 QWAVVLVVHYADSSNSGEGARREAGPLLGRLVKALTGWAPA--IDVAP-------LARSARQSPVTYASGYFYFPLVFTA 145 (157) T ss_pred eEEEEEEEeccccccccchhHHHHHHHHHHHHHHhcCcccc--ccCCc-------eeeeecCCcccccCCeEEEEEEEEE Confidence 111111 1 1122 357999999999999888653 33221 122211221 245566777888886 Q ss_pred eecccCCC Q lcl|NC_018086. 137 IDSTIDPY 144 (144) Q Consensus 137 ~t~~~~~~ 144 (144) ..+=|- T Consensus 146 --~~~~~~ 151 (157) T protein:vir:79 146 --RFVYPR 151 (157) T ss_pred --eeeccc Confidence 455555 No 55 >protein:vir:9931 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:2393 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795693;genbank:gi:28876455;genbank:GeneID:1258023 Probab=95.02 E-value=0.00075 Score=37.73 Aligned_cols=114 Identities=16% Similarity=0.226 Sum_probs=74.8 Q ss_pred CCCChhhHHHHHHHHHHhhcCCcccccCcC-CCCCCEEEeCCceeeecCCCccc-Ccc---EEEEEEEEEeCCcchHHHH Q lcl|NC_018086. 8 RARSSSVALQRAIVKEIRAQGINIWDGVNK-KPEYPFIKIGEELTSGRTISKDA-IGK---MHNLTLHIWSDYDSSFEVK 82 (144) Q Consensus 8 ~~~S~~~aLQ~AI~~~L~a~~~~VyD~vP~-~a~~PYV~lG~~~~~~~d~t~~~-~g~---~~~l~i~VWs~~~G~~eak 82 (144) -..||+..+-+-|+.+|...+-+||-..|. +..-||+++|.... | + +++. .|+ ..+++|+++=.-.+|..+. T Consensus 1 md~sp~t~~Lk~i~~kL~~~~IPiYfkLP~sdi~EPF~ViGsh~~-D-d-sktA~~Ga~ivdt~lqIDlFyp~~sR~d~e 77 (119) T protein:vir:99 1 MDYSLETLYLKKVKNRLGVLDIPIYFKLPKSDVLEPFIVVGTNIS-D-L-SKTAQTGAVIDDFSLNIDAFLPGDSRLDAE 77 (119) T ss_pred CCcchhhHHHHHHHHhhcccCcceEEeCCCCCcCCceEEEecccC-c-c-ccccccceEEEeeeEEEEEeecCcccccHH Confidence 255889999999999999988899999997 46899999998653 2 2 3332 344 5679999999889999999 Q ss_pred HHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeeccc Q lcl|NC_018086. 83 NLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTI 141 (144) Q Consensus 83 ~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~ 141 (144) +|-.++..+|... - ++...-..+..-|++ +--+-|++-..-- T Consensus 78 eiks~~~~~l~r~-~-----------~it~qil~DnSIGRe-----VYhV~f~isd~i~ 119 (119) T protein:vir:99 78 EIKSRMLRLLGRN-N-----------QIKAQILVDNSIGRE-----VYRVAINITETLF 119 (119) T ss_pred HHHHHHHHHhhhh-h-----------hhhhcccccccccce-----eeeeeeEeeeecC Confidence 9999998887321 1 111111222223332 2223333322111 No 56 >protein:vir:99226 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950461;genbank:gi:119953662;genbank:GeneID:4643086 Probab=94.80 E-value=0.0017 Score=35.73 Aligned_cols=127 Identities=13% Similarity=0.103 Sum_probs=70.0 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCc---cc-----ccCcC---CCCCCEEEeCCceeeecCCCcccCccEEEEE- Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGIN---IW-----DGVNK---KPEYPFIKIGEELTSGRTISKDAIGKMHNLT- 68 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~---Vy-----D~vP~---~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~- 68 (144) ||++=.| +|+.++|.+||++.+.. |+ ..+++ .++-=||+++..+..+..+.....+....++ T Consensus 1 ~~~~~d~------~a~~~~IierLka~vp~l~~V~~aadla~i~~~~q~tPaayVi~~gd~~~~~~~~~~~~~~~Q~i~q 74 (157) T protein:vir:99 1 MSDPFDY------LFLEPLLIERIRSEVPGLAIVSGVPDLAALSEQDQPAPSVYVVYLGDEIGTGADHQGGRRAIQAIGQ 74 (157) T ss_pred CCCchhh------hhhhHHHHHHHHhhhhHHHhhhcccchHHHhhccCCCcEEEEEecccccCCCcccccccceeeeeee Confidence 9998788 89999999999987431 22 22332 1223389987665432111111222212222 Q ss_pred ----EEEEeCC----cc---hHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccc-cccCccEEEEEEEEE Q lcl|NC_018086. 69 ----LHIWSDY----DS---SFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANG-TYKNERAYLFLDFEV 136 (144) Q Consensus 69 ----i~VWs~~----~G---~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg-~~~~~~~~~~l~fri 136 (144) +-|-..+ .| ..++.++.++|++||.++... .+.. -++..-.+.. ....|-.+..+.|.+ T Consensus 75 ~~~Vvlavr~~~~~~~g~~a~d~ag~ll~~v~~AL~GW~P~--~~~~-------pl~~~~~~~~~~y~~gf~yypl~F~~ 145 (157) T protein:vir:99 75 QWAVVLVVHYADSSNSGEGARREAGPLLGRLVKALTGWAPA--IDVA-------PLARSARQSPVTYASGYFYFPLVFTA 145 (157) T ss_pred eEEEEEEEeccccccccchhHHHHHHHHHHHHHHhcCCcCc--ccCC-------ceeeeecCCcccccCceEEEEEEEEE Confidence 2222211 23 367999999999999888543 2211 1122111121 245666778888886 Q ss_pred eecccCCC Q lcl|NC_018086. 137 IDSTIDPY 144 (144) Q Consensus 137 ~t~~~~~~ 144 (144) ..+=|- T Consensus 146 --~~~~~~ 151 (157) T protein:vir:99 146 --RFVYPR 151 (157) T ss_pred --eeeccc Confidence 555565 No 57 >protein:vir:94768 Length: 111 # NCBI annotation: unknown # Family: family:all:1269 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996711;genbank:gi:45597426;genbank:GeneID:2769040 Probab=94.57 E-value=0.0013 Score=36.36 Aligned_cols=110 Identities=13% Similarity=0.142 Sum_probs=70.9 Q ss_pred HHHHHHHHHHhhc-CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHHHHHHhc Q lcl|NC_018086. 15 ALQRAIVKEIRAQ-GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDFLVGLLI 93 (144) Q Consensus 15 aLQ~AI~~~L~a~-~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~V~~aL~ 93 (144) =+-.-|.+.|.++ +-++|=.+|++.+-+||++ |- + .+.+.......++-+++|.. ++.+|.+++..|+++|. T Consensus 1 miE~~v~~~L~~~l~vpv~~e~p~~~p~~FV~v-Er--t--GG~~~~~~~~~~lAVQ~~~~--S~~eAa~La~~v~~~~~ 73 (111) T protein:vir:94 1 MIEIIIKNFLDTHLSVSSFLEKKGEMPLSYVLF-EK--T--GSSKSNHLLSSTFAFQSYAP--SMYEAAKLNEQLKEVVE 73 (111) T ss_pred ChHHhHHHHHhhcCCcceEeecCCCCCCceEEE-Ee--c--CCccccccccceEEEEecch--hHHHHHHHHHHHHHHHh Confidence 1223455666665 6689989999999999999 21 1 23445566788899999964 56699999999999994 Q ss_pred CCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeec Q lcl|NC_018086. 94 NSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDS 139 (144) Q Consensus 94 ~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~ 139 (144) +- ..++ .+.++++.+.--+.|+ -+++-| -.+.|++.+. T Consensus 74 ~l-~~~~---~i~~v~~~s~Ynf~d~--~tk~~R--YQav~~i~~~ 111 (111) T protein:vir:94 74 RL-IELN---EISNVSLNSDYNFTDT--ETKEYR--YQAVFDINHY 111 (111) T ss_pred hc-cccc---ccceeecCCCcccCCC--cCCCce--EEEEEEEeeC Confidence 33 3343 3446666655444333 232222 2456666666 No 58 >protein:vir:1643 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695064;genbank:gi:23455755;genbank:GeneID:955492 Probab=94.52 E-value=0.0015 Score=36.12 Aligned_cols=110 Identities=14% Similarity=0.149 Sum_probs=70.2 Q ss_pred HHHHHHHHHHhhc-CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHHHHHHhc Q lcl|NC_018086. 15 ALQRAIVKEIRAQ-GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDFLVGLLI 93 (144) Q Consensus 15 aLQ~AI~~~L~a~-~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~V~~aL~ 93 (144) =+-.-|.+-|.+. +-++|=.+|++.+-+||++ |- + .+.+.......+|.+++|.. ++.+|.+++..|+++|. T Consensus 1 miE~~i~~~L~~~l~Vpv~~e~p~~~P~~FV~v-Er--t--GG~~~~~~~~~~lAVq~w~~--S~~eAa~La~~v~~~l~ 73 (111) T protein:vir:16 1 MIEIIIKNFLDTHLSVSSFLEKKGEMPLSYILF-EK--T--GSSKSNHLLSSTFAFQSYAP--SMYEAAKLNEQLKEVVE 73 (111) T ss_pred ChHHhHHHHHhhcCCceeEeecCCCCCCceEEE-Ee--c--CCccccccccceEEEEecch--hHHHHHHHHHHHHHHHh Confidence 1223455666665 6689999999999999999 21 1 23345566788899999974 56699999999999994 Q ss_pred CCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeec Q lcl|NC_018086. 94 NSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDS 139 (144) Q Consensus 94 ~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~ 139 (144) +- ..++ .+.++++.+.=-+ +|+-+++-| -.+.|++.+. T Consensus 74 ~l-~~~~---~I~av~~~s~ynf--~d~~tk~~R--YQav~~i~~~ 111 (111) T protein:vir:16 74 RL-IELN---EISNVSLNSDYNF--TDTETKEYR--YQAVFDINHY 111 (111) T ss_pred hc-cccc---cceeeecCCCCcC--CCCCCCCce--EEEEEEEeeC Confidence 33 3333 3456666554333 333333322 2455666666 No 59 >protein:vir:96002 Length: 133 # NCBI annotation: ORF024 # Family: family:all:508 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239806;genbank:gi:66395472;genbank:GeneID:5132919 Probab=93.68 E-value=0.0011 Score=36.89 Aligned_cols=110 Identities=13% Similarity=0.201 Sum_probs=67.0 Q ss_pred hhHHHHHHHHHHhhcC--------Cccc-ccCcCC--CCCCEEEeCCcee---eecCCCcccCccEEEEEEEEEeCC--- Q lcl|NC_018086. 13 SVALQRAIVKEIRAQG--------INIW-DGVNKK--PEYPFIKIGEELT---SGRTISKDAIGKMHNLTLHIWSDY--- 75 (144) Q Consensus 13 ~~aLQ~AI~~~L~a~~--------~~Vy-D~vP~~--a~~PYV~lG~~~~---~~~d~t~~~~g~~~~l~i~VWs~~--- 75 (144) +-.+-.-||+.|+++. .+|+ =.+|+. ..-|||+|-+... ..+.+ ......++.++|+|||.+ T Consensus 1 m~diL~eIy~~L~~d~~L~~~v~~~~Ik~~~~Pe~~d~~~p~IvI~pi~~p~p~~f~s-n~~ls~~~~~QIDV~sk~~~~ 79 (133) T protein:vir:96 1 MIDILMEVYNILKSDDDLMRLIDKKNIKFNQYPDVKDKMAPYIVIDDYDDPIPEWHSD-GDRIAYNYAFQIDVMVKASDA 79 (133) T ss_pred CcchHHHHHHHhhcchHHHHhcCccceEEeecCCccccccceEEEecCCCCCcccccC-cceeeeEEEEEEeeeeecccc Confidence 3556677888887762 2344 345653 3579999976543 23322 344578999999999976 Q ss_pred -cchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeee--eeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 76 -DSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHV--RYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 76 -~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~--r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) .+|..+++|..+|...|-. .||. +..+. ++..+.. +... +=|||-. || T Consensus 80 ~~~R~~~~~i~~rI~~~m~~------~gf~----Q~~~~~deYd~et~-~y~~-----aRRYrg~-----~Y 130 (133) T protein:vir:96 80 YNARKRRNEISNRISELLWK------NQMK----QIRNLGNEYDKNLA-LYRS-----TRRYEAI-----FY 130 (133) T ss_pred ccchhhhHHHHHHHHHHHHH------cCce----ecCCCccccchhhh-hhhh-----hheeecc-----cc Confidence 6899999999999999832 2442 12221 1222221 1222 3466654 66 No 60 >protein:vir:103883 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938246;genbank:gi:38229151;genbank:GeneID:2648198 Probab=93.62 E-value=0.0026 Score=34.75 Aligned_cols=130 Identities=19% Similarity=0.192 Sum_probs=68.0 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCc---ccc-----cCc---CCCCCCEEEeCCceeeecCCCcccCccEEEEEE Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGIN---IWD-----GVN---KKPEYPFIKIGEELTSGRTISKDAIGKMHNLTL 69 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~---VyD-----~vP---~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i 69 (144) |+-..||-. +|+..+|.+||++.+.. |+- .+. .-++-=||.+....-.+..+.....+....++. T Consensus 1 ~~~~~~~n~----lav~~~IieRLka~v~~lr~V~~aadla~i~el~q~tPaayV~~~g~~~~~~~~~~~~~~~~q~v~q 76 (159) T protein:vir:10 1 MSTAEPFDY----LFLETLLVERIRAEVPGLQDVSGVPDLATLDEQRQGSPCVYVVYLGDEIGTGASHQGGSRAIQTVTQ 76 (159) T ss_pred CCcccchhh----hhhhHHHHHHHHhhhhHHHhhhcccchHHHHhhhCCCcEEEEEecccccCCCcccccccceeeeeee Confidence 766666621 68899999999987542 221 112 122334888865543221111122333232222 Q ss_pred EE-----EeCCc-------chHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 70 HI-----WSDYD-------SSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 70 ~V-----Ws~~~-------G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) +- -..+. ...++.++.++|+++|.++.- +++... ..|...+..-.+..|..|..+.|.+ T Consensus 77 ~w~Vvlavr~~~~q~~~~a~~d~aG~ll~~v~~AL~GW~P--~~~~~P------l~r~~~~~~~~y~~gfayyPl~F~~- 147 (159) T protein:vir:10 77 HWAAVLTLYYADAQGDGQGARREAGPLLGRLLKALTGWVP--DQGVTP------LARSPQASPVSYSNGFFYFPLVFTA- 147 (159) T ss_pred EEEEEEEEecccccCccchhhHHHHHHHHHHHHHhcCccc--CCcCCC------eeecccCCCccccCCEEEeeeeEEe- Confidence 21 12111 134689999999999988754 233211 1112112122355777888888886 Q ss_pred ecccCCC Q lcl|NC_018086. 138 DSTIDPY 144 (144) Q Consensus 138 t~~~~~~ 144 (144) ..+=|- T Consensus 148 -~~~~~~ 153 (159) T protein:vir:10 148 -NFVFPR 153 (159) T ss_pred -eeeccc Confidence 444444 No 61 >protein:vir:101303 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908836;genbank:gi:118725100;genbank:GeneID:4555874 Probab=93.55 E-value=0.0015 Score=36.07 Aligned_cols=111 Identities=17% Similarity=0.243 Sum_probs=65.1 Q ss_pred hhHHHHHHHHHHhhcC--------Cccc-ccCcCC--CCCCEEEeCCcee---eecCCCcccCccEEEEEEEEEeCC--- Q lcl|NC_018086. 13 SVALQRAIVKEIRAQG--------INIW-DGVNKK--PEYPFIKIGEELT---SGRTISKDAIGKMHNLTLHIWSDY--- 75 (144) Q Consensus 13 ~~aLQ~AI~~~L~a~~--------~~Vy-D~vP~~--a~~PYV~lG~~~~---~~~d~t~~~~g~~~~l~i~VWs~~--- 75 (144) +-.+-.-||+.|+++. .+|+ =.+|+. ..-|||+|-+... ..+. +......+..++|+||+.+ T Consensus 1 m~diL~eIy~~L~~d~~L~~~v~~~rIk~~~~Pe~~d~~~p~IvI~pl~~p~p~~~~-sn~~ls~~~~~QIDV~~k~~~~ 79 (135) T protein:vir:10 1 MIDILYKVHEVISQDRIIREHVNINNIKFNKYPNVKDTDVPFIVIDDIDDPIPTTYT-DGDECAYSYIVQIDVFVKYNDE 79 (135) T ss_pred CcchHHHHHHHhhcchHHHhhcCccceEEEecCCccccccceEEEecCCCCCCcccc-CchhceeeeeEEEeeeeecccc Confidence 3566677888887762 3554 345654 3579999965432 1232 2344578999999999965 Q ss_pred -cchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEee--eeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 76 -DSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDH--VRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 76 -~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~--~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) .+|.++++|..+|+..|-.. + ||. +..+ =++..+..+ ... +-|||-. || T Consensus 80 ~~~R~~~~~i~~~I~~~l~~~---~--~f~----q~s~~ldeY~~et~~-y~~-----aRRYrG~-----~Y 131 (135) T protein:vir:10 80 YNARIIRNKISNRIQKLLWSE---L--KMG----NVSNGKPEYIEEFKT-YRS-----SRVYEGI-----FY 131 (135) T ss_pred cchhhHHHHHHHHHHHHHHHH---c--Ccc----ccCCCCccchhhhhh-hhh-----hheeeee-----cc Confidence 45899999999999998211 1 221 1111 123333222 222 3455543 45 No 62 >protein:vir:100675 Length: 135 # NCBI annotation: 77ORF027 # Family: family:all:508 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958611;genbank:gi:41189540;genbank:GeneID:2743821 Probab=93.55 E-value=0.0015 Score=36.07 Aligned_cols=111 Identities=17% Similarity=0.243 Sum_probs=65.1 Q ss_pred hhHHHHHHHHHHhhcC--------Cccc-ccCcCC--CCCCEEEeCCcee---eecCCCcccCccEEEEEEEEEeCC--- Q lcl|NC_018086. 13 SVALQRAIVKEIRAQG--------INIW-DGVNKK--PEYPFIKIGEELT---SGRTISKDAIGKMHNLTLHIWSDY--- 75 (144) Q Consensus 13 ~~aLQ~AI~~~L~a~~--------~~Vy-D~vP~~--a~~PYV~lG~~~~---~~~d~t~~~~g~~~~l~i~VWs~~--- 75 (144) +-.+-.-||+.|+++. .+|+ =.+|+. ..-|||+|-+... ..+. +......+..++|+||+.+ T Consensus 1 m~diL~eIy~~L~~d~~L~~~v~~~rIk~~~~Pe~~d~~~p~IvI~pl~~p~p~~~~-sn~~ls~~~~~QIDV~~k~~~~ 79 (135) T protein:vir:10 1 MIDILYKVHEVISQDRIIREHVNINNIKFNKYPNVKDTDVPFIVIDDIDDPIPTTYT-DGDECAYSYIVQIDVFVKYNDE 79 (135) T ss_pred CcchHHHHHHHhhcchHHHhhcCccceEEEecCCccccccceEEEecCCCCCCcccc-CchhceeeeeEEEeeeeecccc Confidence 3566677888887762 3554 345654 3579999965432 1232 2344578999999999965 Q ss_pred -cchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEee--eeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 76 -DSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDH--VRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 76 -~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~--~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) .+|.++++|..+|+..|-.. + ||. +..+ =++..+..+ ... +-|||-. || T Consensus 80 ~~~R~~~~~i~~~I~~~l~~~---~--~f~----q~s~~ldeY~~et~~-y~~-----aRRYrG~-----~Y 131 (135) T protein:vir:10 80 YNARIIRNKISNRIQKLLWSE---L--KMG----NVSNGKPEYIEEFKT-YRS-----SRVYEGI-----FY 131 (135) T ss_pred cchhhHHHHHHHHHHHHHHHH---c--Ccc----ccCCCCccchhhhhh-hhh-----hheeeee-----cc Confidence 45899999999999998211 1 221 1111 123333222 222 3455543 45 No 63 >protein:vir:9514 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835561;genbank:gi:30043946;genbank:GeneID:1260543 Probab=93.55 E-value=0.0015 Score=36.07 Aligned_cols=111 Identities=17% Similarity=0.243 Sum_probs=65.1 Q ss_pred hhHHHHHHHHHHhhcC--------Cccc-ccCcCC--CCCCEEEeCCcee---eecCCCcccCccEEEEEEEEEeCC--- Q lcl|NC_018086. 13 SVALQRAIVKEIRAQG--------INIW-DGVNKK--PEYPFIKIGEELT---SGRTISKDAIGKMHNLTLHIWSDY--- 75 (144) Q Consensus 13 ~~aLQ~AI~~~L~a~~--------~~Vy-D~vP~~--a~~PYV~lG~~~~---~~~d~t~~~~g~~~~l~i~VWs~~--- 75 (144) +-.+-.-||+.|+++. .+|+ =.+|+. ..-|||+|-+... ..+. +......+..++|+||+.+ T Consensus 1 m~diL~eIy~~L~~d~~L~~~v~~~rIk~~~~Pe~~d~~~p~IvI~pl~~p~p~~~~-sn~~ls~~~~~QIDV~~k~~~~ 79 (135) T protein:vir:95 1 MIDILYKVHEVISQDRIIREHVNINNIKFNKYPNVKDTDVPFIVIDDIDDPIPTTYT-DGDECAYSYIVQIDVFVKYNDE 79 (135) T ss_pred CcchHHHHHHHhhcchHHHhhcCccceEEEecCCccccccceEEEecCCCCCCcccc-CchhceeeeeEEEeeeeecccc Confidence 3566677888887762 3554 345654 3579999965432 1232 2344578999999999965 Q ss_pred -cchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEee--eeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 76 -DSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDH--VRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 76 -~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~--~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) .+|.++++|..+|+..|-.. + ||. +..+ =++..+..+ ... +-|||-. || T Consensus 80 ~~~R~~~~~i~~~I~~~l~~~---~--~f~----q~s~~ldeY~~et~~-y~~-----aRRYrG~-----~Y 131 (135) T protein:vir:95 80 YNARIIRNKISNRIQKLLWSE---L--KMG----NVSNGKPEYIEEFKT-YRS-----SRVYEGI-----FY 131 (135) T ss_pred cchhhHHHHHHHHHHHHHHHH---c--Ccc----ccCCCCccchhhhhh-hhh-----hheeeee-----cc Confidence 45899999999999998211 1 221 1111 123333222 222 3455543 45 No 64 >protein:vir:9579 Length: 111 # NCBI annotation: gp45 # Family: family:all:1269 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862884;genbank:gi:32469476;genbank:GeneID:1461321 Probab=93.13 E-value=0.0036 Score=33.96 Aligned_cols=110 Identities=10% Similarity=0.061 Sum_probs=68.3 Q ss_pred hhHHHHHHHHHHhh-cCCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHHHHHHHHHHHH Q lcl|NC_018086. 13 SVALQRAIVKEIRA-QGINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEVKNLTDFLVGL 91 (144) Q Consensus 13 ~~aLQ~AI~~~L~a-~~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~eak~Ia~~V~~a 91 (144) |. -.-|..-|.. .+..+|=.+|++.|-+||++ |- + .+.+.......+|.+++|. ++..+|.+++..|+++ T Consensus 1 mi--E~~v~~~L~~~l~vpv~~~vp~~~P~~FV~v-Er--t--GG~~~~~~~~p~laVq~wg--~S~~~Aa~La~~v~~a 71 (111) T protein:vir:95 1 MI--EIIINKYLDGHLDVPSFFEHEAEAPDSFVII-QK--T--GGKERNHSGSATFAFQSYA--PTMQKAAELNVKVKSA 71 (111) T ss_pred Ch--HHhHHHHhhhhcCeeEEeecCCCCCCceEEE-Ee--e--CCccccccccceEEEEecc--ccHHHHHHHHHHHHHH Confidence 12 2234444533 35678889999999999999 21 1 2344556678889999996 4678899999999999 Q ss_pred hcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeec Q lcl|NC_018086. 92 LINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDS 139 (144) Q Consensus 92 L~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~ 139 (144) +.+. ..++ .+.+.++.+.-.+.|++ +++-|- .+.|++.+. T Consensus 72 ~~~l-~~~~---~i~~v~~~s~ynf~d~~--tk~~RY--Q~~~~i~~~ 111 (111) T protein:vir:95 72 VKGL-IELD---SICGVHLNSDYNFTDTE--TKQYRY--QAVFDINYF 111 (111) T ss_pred Hhhh-hccc---cccccccCCccccCCCC--CCCceE--EEEEEEEeC Confidence 8443 3343 34456666655554443 323222 345555555 No 65 >protein:vir:108220 Length: 133 # NCBI annotation: gp14 # Family: family:all:6424 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552343;genbank:gi:160700663;genbank:GeneID:5758940 Probab=92.67 E-value=0.01 Score=31.44 Aligned_cols=126 Identities=17% Similarity=0.158 Sum_probs=73.3 Q ss_pred CCCC-CCCCCCChhhHHHHHHHHHHhhcCCcccccCcCC----CCCCEEEeCC-ceeeecCCCcccCccEEEEEEEEEeC Q lcl|NC_018086. 1 MSKR-PPFRARSSSVALQRAIVKEIRAQGINIWDGVNKK----PEYPFIKIGE-ELTSGRTISKDAIGKMHNLTLHIWSD 74 (144) Q Consensus 1 m~~~-~~~~~~S~~~aLQ~AI~~~L~a~~~~VyD~vP~~----a~~PYV~lG~-~~~~~~d~t~~~~g~~~~l~i~VWs~ 74 (144) |+-. -|- .+..++-.-+.++|-++. .+=|.+|++ ..-|+|++++ ...++|. . .....+.+.||++ T Consensus 1 m~~~Rvp~---D~~~~Ik~~L~~~l~a~v-~~~~~lPddW~~~s~~P~vvV~dDggpv~wp---v--~t~~~IRvtv~a~ 71 (133) T protein:vir:10 1 MSDVRVVG---DPVPPVKAYLAAFWGARV-RIADEVPDDWHVETDVPLIVVDDDGGPIDWP---V--KSDPLVRCGIYAN 71 (133) T ss_pred CCCcccCC---CChHHHHHHHHhhccccc-eeeeecCCCccccCCceEEEEecCCCccccc---e--eccceEEEEEeec Confidence 6532 111 333344333444555554 477888875 4669887743 3333332 1 1233467778885 Q ss_pred CcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEE-eeeeeeecccccccCccEEEEEEEEEeecccCC Q lcl|NC_018086. 75 YDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKEL-DHVRYTEAANGTYKNERAYLFLDFEVIDSTIDP 143 (144) Q Consensus 75 ~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~-~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~ 143 (144) ||.+|++|+.+...+|-.... .|. ++.+ .++-++.+.|..+....+..+++-+.-|+.|-- T Consensus 72 --gr~~Ar~l~~~~~g~LLa~~i---~Gv---a~ii~~g~glL~aRD~~tgg~iAsfTV~A~~rt~~~~~ 133 (133) T protein:vir:10 72 --GKQTAKNLRRITMGALLAEPI---PGI---AHIQRTGIGYVDARDPDTGADIASFTVTATVRTEVITV 133 (133) T ss_pred --CChhHHHHHHHHHHHHhcCCC---Cce---eEEcCCCceEEecCCCCCCceEEEEEEEeeeeeeEeeC Confidence 889999999999999854443 143 2223 233466666655544445567888777776655 No 66 >protein:vir:81093 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429879;genbank:gi:156603932;genbank:GeneID:5525313 Probab=92.66 E-value=0.0022 Score=35.20 Aligned_cols=118 Identities=15% Similarity=0.219 Sum_probs=61.2 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhc-----CCcccccCcCCCCCCEEEeCCceeeecCCCc-ccCccEEEEEEEEEeC Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQ-----GINIWDGVNKKPEYPFIKIGEELTSGRTISK-DAIGKMHNLTLHIWSD 74 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~-----~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~-~~~g~~~~l~i~VWs~ 74 (144) |--= ...|..++...|..+ ..+|.+.-=++..-|||+|-+-...+..-++ .....++.++|+||+. T Consensus 1 ~~~~--------~~~i~n~~I~~li~~~Lk~~nvPV~~~~y~~~~ktyItf~ey~~~~~~yADd~e~~t~~~iQIDIW~s 72 (126) T protein:vir:81 1 MINV--------TELIRNAIIANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWS 72 (126) T ss_pred Ccch--------HHhhhhhHHHhhhhhceeeccceeccccccCCCCcEEEEEeecCCCCccccCeeeeeEEEEEEEEeeC Confidence 2211 112333333333222 3344444446777899999766555433332 2446799999999954 Q ss_pred CcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEE--EeecccCCC Q lcl|NC_018086. 75 YDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFE--VIDSTIDPY 144 (144) Q Consensus 75 ~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fr--i~t~~~~~~ 144 (144) .+. ..++...|.++|. .-||. |.......+......+. .+||| +++.-++-- T Consensus 73 k~~---~~~l~~~V~~~Mk------~~GF~----R~~~~d~YE~DtklyHk-----~~Rf~~~~~~~~~~~~ 126 (126) T protein:vir:81 73 QDE---PNEQAEKIVELLK------VINFQ----CYYREPLYESDVMSFRH-----IIRAKGSILSMKLEEN 126 (126) T ss_pred CCC---HHHHHHHHHHHHH------HcCCe----eeecCCCccchhhhhhe-----eeeeeeeccceeeccC Confidence 433 3456666777772 23663 44444555544444433 46676 333333333 No 67 >protein:vir:80001 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430007;genbank:gi:156604062;genbank:GeneID:5525461 Probab=92.66 E-value=0.0022 Score=35.20 Aligned_cols=118 Identities=15% Similarity=0.219 Sum_probs=61.2 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhc-----CCcccccCcCCCCCCEEEeCCceeeecCCCc-ccCccEEEEEEEEEeC Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQ-----GINIWDGVNKKPEYPFIKIGEELTSGRTISK-DAIGKMHNLTLHIWSD 74 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~-----~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~-~~~g~~~~l~i~VWs~ 74 (144) |--= ...|..++...|..+ ..+|.+.-=++..-|||+|-+-...+..-++ .....++.++|+||+. T Consensus 1 ~~~~--------~~~i~n~~I~~li~~~Lk~~nvPV~~~~y~~~~ktyItf~ey~~~~~~yADd~e~~t~~~iQIDIW~s 72 (126) T protein:vir:80 1 MINV--------TELIRNAIIANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWS 72 (126) T ss_pred Ccch--------HHhhhhhHHHhhhhhceeeccceeccccccCCCCcEEEEEeecCCCCccccCeeeeeEEEEEEEEeeC Confidence 2211 112333333333222 3344444446777899999766555433332 2446799999999954 Q ss_pred CcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEE--EeecccCCC Q lcl|NC_018086. 75 YDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFE--VIDSTIDPY 144 (144) Q Consensus 75 ~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fr--i~t~~~~~~ 144 (144) .+. ..++...|.++|. .-||. |.......+......+. .+||| +++.-++-- T Consensus 73 k~~---~~~l~~~V~~~Mk------~~GF~----R~~~~d~YE~DtklyHk-----~~Rf~~~~~~~~~~~~ 126 (126) T protein:vir:80 73 QDE---PNEQAEKIVELLK------VINFQ----CYYREPLYESDVMSFRH-----IIRAKGSILSMKLEEN 126 (126) T ss_pred CCC---HHHHHHHHHHHHH------HcCCe----eeecCCCccchhhhhhe-----eeeeeeeccceeeccC Confidence 433 3456666777772 23663 44444555544444433 46676 333333333 No 68 >protein:vir:6374 Length: 179 # NCBI annotation: hypothetical protein # Family: family:all:29418 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918987;genbank:gi:34610162;genbank:gi:91214208;genbank:GeneID:2559591 Probab=92.51 E-value=0.0013 Score=36.33 Aligned_cols=135 Identities=14% Similarity=0.186 Sum_probs=76.7 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhh----c------CCcccc-cC--cCCCCCCEEEeCCceee------ecCCCcccC Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRA----Q------GINIWD-GV--NKKPEYPFIKIGEELTS------GRTISKDAI 61 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a----~------~~~VyD-~v--P~~a~~PYV~lG~~~~~------~~d~t~~~~ 61 (144) |-++| + -+-+-|++-++|.. . ++.||- ++ -++.+-|.|+|=|.+-- |..+..++. T Consensus 1 ~~~~p-~-----~l~i~k~LTs~L~~iT~aNGy~fDl~~~vfRgR~~fg~~~p~P~vsilE~~~p~~~lg~d~ng~vq~~ 74 (179) T protein:vir:63 1 MQRDP-K-----KLVILKKLTAHLEGVTPTNGFQFDLSSGIYRNRVQFGAETPAPAVSILEAQRPDHGLDADENGQAQSE 74 (179) T ss_pred CCCCc-h-----hhhhhHHHHHHhhhcccccccccchhhhhhhcceeecCCCCCcEEEeecccCCccccCCCCCCccccc Confidence 66554 4 35666788888854 2 346774 33 35678899988774321 111222334 Q ss_pred ccEEEEEEEEEeC-----CcchHHHHHHHHHHHHHhcCCccccC-CCceEEEEE-------Eeee----eeeecccc-cc Q lcl|NC_018086. 62 GKMHNLTLHIWSD-----YDSSFEVKNLTDFLVGLLINSPLQLE-EGFCIGKKE-------LDHV----RYTEAANG-TY 123 (144) Q Consensus 62 g~~~~l~i~VWs~-----~~G~~eak~Ia~~V~~aL~~~~L~L~-~g~~~~~~~-------~~~~----r~~~d~dg-~~ 123 (144) +|.. -+.-|-. ..--.+|.+|++.|..+|. .-.+|+ .||.-.+.. +... -+.|.|.. .. T Consensus 75 ~w~~--l~Qg~V~~aed~~hPtD~Ah~lmADVkkrL~-~~~~~~~~~~np~~~~~~~~~n~i~~~~~gpgv~r~p~e~~s 151 (179) T protein:vir:63 75 DWLL--LVQGWVNHAEGDKNPTDEAYRLMADVQVRLG-ELIAIDSSSGNPQYPSVYMLENLIAGMRAGPGVCRAPAEGAS 151 (179) T ss_pred chhh--hhhhhhccccCCCCCccHHHHHHHHHHHHHh-hhhccccCCCCCCCcchHHHHHHHhhhccCCccccCchhhcc Confidence 4444 3334432 2335789999999999996 445565 233322211 1111 23444432 22 Q ss_pred cCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 124 KNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 124 ~~~~~~~~l~fri~t~~~~~~ 144 (144) .....++++...|+++++||| T Consensus 152 ~~~yf~l~l~l~i~~~~~dp~ 172 (179) T protein:vir:63 152 GRSYFYLPLNLKIANNTTDPY 172 (179) T ss_pred cceeEEEEeEEEEeccCCCcc Confidence 222345899999999999999 No 69 >protein:vir:78349 Length: 127 # NCBI annotation: gp10 # Family: family:all:508 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468649;genbank:gi:157325227;genbank:GeneID:5601695 Probab=91.95 E-value=0.0028 Score=34.60 Aligned_cols=112 Identities=13% Similarity=0.079 Sum_probs=67.5 Q ss_pred hhHHHHHHHHHHhhc-------CCcccc-cCcCC--CCCCEEEeCCcee---eecCCCcccCccEEEEEEEEEeCCcchH Q lcl|NC_018086. 13 SVALQRAIVKEIRAQ-------GINIWD-GVNKK--PEYPFIKIGEELT---SGRTISKDAIGKMHNLTLHIWSDYDSSF 79 (144) Q Consensus 13 ~~aLQ~AI~~~L~a~-------~~~VyD-~vP~~--a~~PYV~lG~~~~---~~~d~t~~~~g~~~~l~i~VWs~~~G~~ 79 (144) +--+-+.||+.|.++ +.+||= .+|+. ..-|||+|-+... ..+. .......++.++|+|||. ++. T Consensus 1 M~d~l~~iy~~L~~d~~l~~~~~~~I~~~~~Pe~~d~~~p~I~I~~i~~p~p~~ya-dn~~l~~~~~~QIDV~s~--~r~ 77 (127) T protein:vir:78 1 MIDILNVIYTTLSKNDIIHTTCEERIKYYDFPGTGDSTKTFLLIIPLDVPIPTNFS-SNESRMEDFLVQIDVQSN--DRL 77 (127) T ss_pred CcchHHHHHHHhhcchhhhhhcCCceEEEecCCCccccCcEEEEeeCCCCCCCccc-CCccceeEEEEEEEEEEc--CCC Confidence 355667888888876 336764 36764 4679999966532 1232 233456799999999975 478 Q ss_pred HHHHHHHHHHHHhcCCccccCCCceEEEEEEeee-e-eeecccccccCccEEEEEEEEEeecccCC Q lcl|NC_018086. 80 EVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHV-R-YTEAANGTYKNERAYLFLDFEVIDSTIDP 143 (144) Q Consensus 80 eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~-r-~~~d~dg~~~~~~~~~~l~fri~t~~~~~ 143 (144) ++++|..+|..+|-. -||. +..+. . +..|. .+ -+-.-|||-+-.++-- T Consensus 78 ~~~~i~~~I~~~M~~------~gf~----q~s~~~d~Y~~dt-k~-----y~~arRYrg~~~~~y~ 127 (127) T protein:vir:78 78 IVKKIQDEVRKEMKQ------IGFG----QLAGGLDEYFPET-GR-----FVDARKYSGLPYKLYQ 127 (127) T ss_pred chHHHHHHHHHHHHH------cCce----eccCCCCccchhh-hh-----hhheeeeeeccccccC Confidence 899999999999842 2442 23221 1 22221 22 2235677765433322 No 70 >protein:vir:9764 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795526;genbank:gi:28876278;genbank:GeneID:1257819 Probab=91.86 E-value=0.0064 Score=32.61 Aligned_cols=110 Identities=15% Similarity=0.107 Sum_probs=69.5 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhc-CCcccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchH Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQ-GINIWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSF 79 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~-~~~VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~ 79 (144) |-|. -|-+.|... +..+|=.+|++.|-+||++ |- + .+.++......++-+++|. +++. T Consensus 1 mIE~--------------~i~~yL~~~l~vpv~~e~p~~~P~~FV~v-Ek--T--GG~~~~~~~~a~lAvQsyg--~S~~ 59 (111) T protein:vir:97 1 MIEV--------------IIKKYLDEHLDVPSFFEHQKDEPARFIIL-EK--T--SGAKQNHLLSSTFAFQSYA--ESLY 59 (111) T ss_pred Chhh--------------hhhHHHhhhcCceEEEeecCCCCCceEEE-Ee--e--CCccccccccceEEEEecc--hhHH Confidence 2222 244556543 5578878888888899999 22 1 2344556677888888886 5789 Q ss_pred HHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeec Q lcl|NC_018086. 80 EVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDS 139 (144) Q Consensus 80 eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~ 139 (144) +|.++++.|++++. .-.+++ .+.+.++.+.=-+.|++. ++-|- ...|++... T Consensus 60 ~AA~La~~V~~a~~-~l~~l~---~i~~v~lns~Ynf~d~~t--k~yRY--Qa~~di~~~ 111 (111) T protein:vir:97 60 EAALLNDKVKQVIE-QLDVLP---QVSGVHLNADYNFTDTAT--KRYRY--QAVFDINHY 111 (111) T ss_pred HHHHHHHHHHHHhh-hhccCc---cceeeeecccccCCCCCC--CCccE--EEEEEEeeC Confidence 99999999999995 334555 456777766544555442 22222 234454444 No 71 >protein:vir:96972 Length: 131 # NCBI annotation: ORF035 # Family: family:all:508 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239865;genbank:gi:66395543;genbank:GeneID:5133005 Probab=90.86 E-value=0.0038 Score=33.87 Aligned_cols=111 Identities=14% Similarity=0.169 Sum_probs=65.7 Q ss_pred hHHHHHHHHHHhhc-------CCcccc-cCcCC--CCCCEEEeCCceeeecCC-CcccCccEEEEEEEEEeCCcchHHHH Q lcl|NC_018086. 14 VALQRAIVKEIRAQ-------GINIWD-GVNKK--PEYPFIKIGEELTSGRTI-SKDAIGKMHNLTLHIWSDYDSSFEVK 82 (144) Q Consensus 14 ~aLQ~AI~~~L~a~-------~~~VyD-~vP~~--a~~PYV~lG~~~~~~~d~-t~~~~g~~~~l~i~VWs~~~G~~eak 82 (144) -.+-.-||+.|.++ +.+||= .+|+. ...|||+|-+....|.+- .......++.++|+|||.. +.+++ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDVws~~--~~~~~ 78 (131) T protein:vir:96 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPVYDLPSDFMSDKYLSEEYLIQIDVESSN--NQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCccccccceEEEeeCCCCCccccCCceeeeEEEEEEEEEecC--ccchH Confidence 25567788888776 235553 46763 367999996543322211 1234467899999999976 45789 Q ss_pred HHHHHHHHHhcCCccccCCCceEEEEEEeee-e-eeecccccccCccEEEEEEEEEee-------cccC Q lcl|NC_018086. 83 NLTDFLVGLLINSPLQLEEGFCIGKKELDHV-R-YTEAANGTYKNERAYLFLDFEVID-------STID 142 (144) Q Consensus 83 ~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~-r-~~~d~dg~~~~~~~~~~l~fri~t-------~~~~ 142 (144) +|..+|..+|- ..||. +..+. . +..| ..+ -+-..|||-++ +.|+ T Consensus 79 ~i~~~I~~~M~------~~gf~----q~s~~~d~Yd~d-tk~-----y~~arRYrg~~~~~y~~~~~~~ 131 (131) T protein:vir:96 79 DITKRIRYLLY------QQNLI----QASSQLDAYFEE-TKR-----YVMSRRYQGIPKNIYYKNQRIE 131 (131) T ss_pred HHHHHHHHHHH------HcCce----eccCCCCccchh-hHH-----hhhhhhccccchhhhccccccC Confidence 99999999983 23552 33221 2 2222 122 23467887665 2233 No 72 >protein:vir:9364 Length: 131 # NCBI annotation: SLT orf 131b-like protein # Family: family:all:508 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803342;genbank:gi:29028653;genbank:GeneID:1258094 Probab=90.86 E-value=0.0038 Score=33.87 Aligned_cols=111 Identities=14% Similarity=0.169 Sum_probs=65.7 Q ss_pred hHHHHHHHHHHhhc-------CCcccc-cCcCC--CCCCEEEeCCceeeecCC-CcccCccEEEEEEEEEeCCcchHHHH Q lcl|NC_018086. 14 VALQRAIVKEIRAQ-------GINIWD-GVNKK--PEYPFIKIGEELTSGRTI-SKDAIGKMHNLTLHIWSDYDSSFEVK 82 (144) Q Consensus 14 ~aLQ~AI~~~L~a~-------~~~VyD-~vP~~--a~~PYV~lG~~~~~~~d~-t~~~~g~~~~l~i~VWs~~~G~~eak 82 (144) -.+-.-||+.|.++ +.+||= .+|+. ...|||+|-+....|.+- .......++.++|+|||.. +.+++ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDVws~~--~~~~~ 78 (131) T protein:vir:93 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPVYDLPSDFMSDKYLSEEYLIQIDVESSN--NQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCccccccceEEEeeCCCCCccccCCceeeeEEEEEEEEEecC--ccchH Confidence 25567788888776 235553 46763 367999996543322211 1234467899999999976 45789 Q ss_pred HHHHHHHHHhcCCccccCCCceEEEEEEeee-e-eeecccccccCccEEEEEEEEEee-------cccC Q lcl|NC_018086. 83 NLTDFLVGLLINSPLQLEEGFCIGKKELDHV-R-YTEAANGTYKNERAYLFLDFEVID-------STID 142 (144) Q Consensus 83 ~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~-r-~~~d~dg~~~~~~~~~~l~fri~t-------~~~~ 142 (144) +|..+|..+|- ..||. +..+. . +..| ..+ -+-..|||-++ +.|+ T Consensus 79 ~i~~~I~~~M~------~~gf~----q~s~~~d~Yd~d-tk~-----y~~arRYrg~~~~~y~~~~~~~ 131 (131) T protein:vir:93 79 DITKRIRYLLY------QQNLI----QASSQLDAYFEE-TKR-----YVMSRRYQGIPKNIYYKNQRIE 131 (131) T ss_pred HHHHHHHHHHH------HcCce----eccCCCCccchh-hHH-----hhhhhhccccchhhhccccccC Confidence 99999999983 23552 33221 2 2222 122 23467887665 2233 No 73 >protein:vir:78648 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429947;genbank:gi:156604001;genbank:GeneID:5525394 Probab=90.86 E-value=0.0038 Score=33.87 Aligned_cols=111 Identities=14% Similarity=0.169 Sum_probs=65.7 Q ss_pred hHHHHHHHHHHhhc-------CCcccc-cCcCC--CCCCEEEeCCceeeecCC-CcccCccEEEEEEEEEeCCcchHHHH Q lcl|NC_018086. 14 VALQRAIVKEIRAQ-------GINIWD-GVNKK--PEYPFIKIGEELTSGRTI-SKDAIGKMHNLTLHIWSDYDSSFEVK 82 (144) Q Consensus 14 ~aLQ~AI~~~L~a~-------~~~VyD-~vP~~--a~~PYV~lG~~~~~~~d~-t~~~~g~~~~l~i~VWs~~~G~~eak 82 (144) -.+-.-||+.|.++ +.+||= .+|+. ...|||+|-+....|.+- .......++.++|+|||.. +.+++ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDVws~~--~~~~~ 78 (131) T protein:vir:78 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPVYDLPSDFMSDKYLSEEYLIQIDVESSN--NQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCccccccceEEEeeCCCCCccccCCceeeeEEEEEEEEEecC--ccchH Confidence 25567788888776 235553 46763 367999996543322211 1234467899999999976 45789 Q ss_pred HHHHHHHHHhcCCccccCCCceEEEEEEeee-e-eeecccccccCccEEEEEEEEEee-------cccC Q lcl|NC_018086. 83 NLTDFLVGLLINSPLQLEEGFCIGKKELDHV-R-YTEAANGTYKNERAYLFLDFEVID-------STID 142 (144) Q Consensus 83 ~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~-r-~~~d~dg~~~~~~~~~~l~fri~t-------~~~~ 142 (144) +|..+|..+|- ..||. +..+. . +..| ..+ -+-..|||-++ +.|+ T Consensus 79 ~i~~~I~~~M~------~~gf~----q~s~~~d~Yd~d-tk~-----y~~arRYrg~~~~~y~~~~~~~ 131 (131) T protein:vir:78 79 DITKRIRYLLY------QQNLI----QASSQLDAYFEE-TKR-----YVMSRRYQGIPKNIYYKNQRIE 131 (131) T ss_pred HHHHHHHHHHH------HcCce----eccCCCCccchh-hHH-----hhhhhhccccchhhhccccccC Confidence 99999999983 23552 33221 2 2222 122 23467887665 2233 No 74 >protein:vir:2689 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075508;genbank:gi:12719437;genbank:GeneID:920159 Probab=90.86 E-value=0.0038 Score=33.87 Aligned_cols=111 Identities=14% Similarity=0.169 Sum_probs=65.7 Q ss_pred hHHHHHHHHHHhhc-------CCcccc-cCcCC--CCCCEEEeCCceeeecCC-CcccCccEEEEEEEEEeCCcchHHHH Q lcl|NC_018086. 14 VALQRAIVKEIRAQ-------GINIWD-GVNKK--PEYPFIKIGEELTSGRTI-SKDAIGKMHNLTLHIWSDYDSSFEVK 82 (144) Q Consensus 14 ~aLQ~AI~~~L~a~-------~~~VyD-~vP~~--a~~PYV~lG~~~~~~~d~-t~~~~g~~~~l~i~VWs~~~G~~eak 82 (144) -.+-.-||+.|.++ +.+||= .+|+. ...|||+|-+....|.+- .......++.++|+|||.. +.+++ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDVws~~--~~~~~ 78 (131) T protein:vir:26 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPVYDLPSDFMSDKYLSEEYLIQIDVESSN--NQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCccccccceEEEeeCCCCCccccCCceeeeEEEEEEEEEecC--ccchH Confidence 25567788888776 235553 46763 367999996543322211 1234467899999999976 45789 Q ss_pred HHHHHHHHHhcCCccccCCCceEEEEEEeee-e-eeecccccccCccEEEEEEEEEee-------cccC Q lcl|NC_018086. 83 NLTDFLVGLLINSPLQLEEGFCIGKKELDHV-R-YTEAANGTYKNERAYLFLDFEVID-------STID 142 (144) Q Consensus 83 ~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~-r-~~~d~dg~~~~~~~~~~l~fri~t-------~~~~ 142 (144) +|..+|..+|- ..||. +..+. . +..| ..+ -+-..|||-++ +.|+ T Consensus 79 ~i~~~I~~~M~------~~gf~----q~s~~~d~Yd~d-tk~-----y~~arRYrg~~~~~y~~~~~~~ 131 (131) T protein:vir:26 79 DITKRIRYLLY------QQNLI----QASSQLDAYFEE-TKR-----YVMSRRYQGIPKNIYYKNQRIE 131 (131) T ss_pred HHHHHHHHHHH------HcCce----eccCCCCccchh-hHH-----hhhhhhccccchhhhccccccC Confidence 99999999983 23552 33221 2 2222 122 23467887665 2233 No 75 >protein:vir:94418 Length: 131 # NCBI annotation: ORF029 # Family: family:all:508 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240011;genbank:gi:66395684;genbank:GeneID:5133078 Probab=90.18 E-value=0.0036 Score=33.98 Aligned_cols=111 Identities=14% Similarity=0.166 Sum_probs=65.7 Q ss_pred hHHHHHHHHHHhhc-------CCcccc-cCcCC--CCCCEEEeCCceeeecCC-CcccCccEEEEEEEEEeCCcchHHHH Q lcl|NC_018086. 14 VALQRAIVKEIRAQ-------GINIWD-GVNKK--PEYPFIKIGEELTSGRTI-SKDAIGKMHNLTLHIWSDYDSSFEVK 82 (144) Q Consensus 14 ~aLQ~AI~~~L~a~-------~~~VyD-~vP~~--a~~PYV~lG~~~~~~~d~-t~~~~g~~~~l~i~VWs~~~G~~eak 82 (144) -.+-.-||+.|.++ +.+||= .+|+. ...|||+|-+-...|.+- .......+..++|+|||.. +.+++ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDV~s~~--~~~~~ 78 (131) T protein:vir:94 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAETSKPFVVITPIYDLPSDFMSDKYLSEEYLIQIDVESSN--NQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEEecCCccccccceEEEeeCCCCcccccCCceeeeEEEEEEEEEecC--ccchH Confidence 25567788888776 235553 47763 467999996544332211 1234567899999999876 67788 Q ss_pred HHHHHHHHHhcCCccccCCCceEEEEEEeee-e-eeecccccccCccEEEEEEEEEeeccc-------C Q lcl|NC_018086. 83 NLTDFLVGLLINSPLQLEEGFCIGKKELDHV-R-YTEAANGTYKNERAYLFLDFEVIDSTI-------D 142 (144) Q Consensus 83 ~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~-r-~~~d~dg~~~~~~~~~~l~fri~t~~~-------~ 142 (144) +|..+|..+|-+ .||. +..+. . +..| ..+ -+-+.|||-++.++ + T Consensus 79 ~i~~~I~~~M~~------~gf~----q~s~~~d~Yd~d-tk~-----y~~arRYrg~~~~~y~~~~~~~ 131 (131) T protein:vir:94 79 DITKRIRYLLYQ------QNLI----QASSQLDAYFEE-TKR-----YVMSRRYQGIPKNIYYKNQRIE 131 (131) T ss_pred HHHHHHHHHHHH------cCce----eccCCCCccchh-HHH-----hhhhhhhccchhhhhccccccC Confidence 999999998832 3542 23221 2 2222 112 23367777654332 2 No 76 >protein:vir:93902 Length: 131 # NCBI annotation: ORF029 # Family: family:all:508 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239943;genbank:gi:66395617;genbank:GeneID:5130968 Probab=89.85 E-value=0.0041 Score=33.65 Aligned_cols=111 Identities=14% Similarity=0.179 Sum_probs=65.8 Q ss_pred hHHHHHHHHHHhhc-------CCcccc-cCcCC--CCCCEEEeCCceeeecCC-CcccCccEEEEEEEEEeCCcchHHHH Q lcl|NC_018086. 14 VALQRAIVKEIRAQ-------GINIWD-GVNKK--PEYPFIKIGEELTSGRTI-SKDAIGKMHNLTLHIWSDYDSSFEVK 82 (144) Q Consensus 14 ~aLQ~AI~~~L~a~-------~~~VyD-~vP~~--a~~PYV~lG~~~~~~~d~-t~~~~g~~~~l~i~VWs~~~G~~eak 82 (144) -.+-.-||+.|.++ +.+||= .+|+. ...|||+|-+-...|.+- .......+..++|+|||. .+.+++ T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDV~s~--~~~~~~ 78 (131) T protein:vir:93 1 MNILNTIKEILLSDAELQTYINSRIYYYKVTENAETSKPFVVITPIYDLPSDFMSDKYLSEEYLIQIDVESS--NNQKTI 78 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEEecCCccccccceEEEeeCCCCcccccCCceeeeEEEEEEEEEec--CccchH Confidence 25567788888776 236553 47763 467999996544332211 123456789999999995 477888 Q ss_pred HHHHHHHHHhcCCccccCCCceEEEEEEeee-e-eeecccccccCccEEEEEEEEEeeccc-------C Q lcl|NC_018086. 83 NLTDFLVGLLINSPLQLEEGFCIGKKELDHV-R-YTEAANGTYKNERAYLFLDFEVIDSTI-------D 142 (144) Q Consensus 83 ~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~-r-~~~d~dg~~~~~~~~~~l~fri~t~~~-------~ 142 (144) +|..+|..+|-+ .||. +..+. . +..| ..+ -+-+.|||-++.++ + T Consensus 79 ~i~~~I~~~M~~------~gf~----q~s~~~d~Yd~d-tk~-----y~~arRYrg~~~~~y~~~~~~~ 131 (131) T protein:vir:93 79 DITKRIRYLLYQ------QNLI----QASSQLDAYFEE-TKR-----YVMSRRYQGIPKNIYYKNQRIE 131 (131) T ss_pred HHHHHHHHHHHH------cCce----eccCCCCccchh-HHH-----hhhhhhhccchhhhhccccccC Confidence 999999998842 2542 33321 2 2222 122 23357777654332 3 No 77 >protein:vir:107857 Length: 154 # NCBI annotation: gp37 # Family: family:all:1532 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024710;genbank:gi:48696947;genbank:GeneID:2845945 Probab=89.56 E-value=0.02 Score=29.85 Aligned_cols=125 Identities=10% Similarity=0.109 Sum_probs=70.7 Q ss_pred CChhhHHHHHHHHHHhhcCCccc-ccCcCCC-----CCC----EEEeCCce-eeecCCCcccCccEEEEEEEEEeCC-cc Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGINIW-DGVNKKP-----EYP----FIKIGEEL-TSGRTISKDAIGKMHNLTLHIWSDY-DS 77 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~~Vy-D~vP~~a-----~~P----YV~lG~~~-~~~~d~t~~~~g~~~~l~i~VWs~~-~G 77 (144) -|+..+...||.+||++.-..+. ..-|+++ ..| .|.++-+. ..+.+......-..+.+.+.|..++ .| T Consensus 1 m~~t~~ii~aiv~rL~~~lP~~~ve~fP~~p~ey~l~h~~GAvLV~Y~GS~f~~~~~~~~i~Q~R~~~~~vTVi~r~l~g 80 (154) T protein:vir:10 1 MATTLEMVDAIVARLRVKLPALVTEYFPERPDEYRLNHAIGALLVSYPGSQYDTTVDTDMVVQPRRVKFAVAIVLRQLNG 80 (154) T ss_pred CchhHHHHHHHHHHHHHhCCcceEeeCCCChhHcCCCCCceeEEEEecCcccCCcccCCceeeeeEEEEEEEEEeeccCC Confidence 27788999999999998643222 2233321 122 45554332 2333333445567788888888765 78 Q ss_pred hHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEE---eecccCCC Q lcl|NC_018086. 78 SFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEV---IDSTIDPY 144 (144) Q Consensus 78 ~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri---~t~~~~~~ 144 (144) +..+.++.++|+.+|.+-.+ ++. ..+++++-+|....+|. ++- .+.|-. .-+.-||- T Consensus 81 ~~gal~~LD~vR~aL~Gf~p--pdc---~~~~lv~d~f~ge~~G~---W~Y--~l~~at~t~~Ve~~~~~ 140 (154) T protein:vir:10 81 RGGAIDVLDHVRTALVGFRP--PDC---KKLAAVSDKFLGESAGL---WQY--VIEFSAGAVIVEDAEPN 140 (154) T ss_pred cchhhHHHHHHHHHHhcccc--CCC---ceeehhhhcccccccce---eee--eeeeccchhhhhccCCC Confidence 99999999999999965443 322 24566665666555543 111 222221 11111221 No 78 >protein:vir:79065 Length: 154 # NCBI annotation: gp11 # Family: family:all:1532 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111211;genbank:gi:134288825;genbank:GeneID:4960739 Probab=89.33 E-value=0.023 Score=29.58 Aligned_cols=125 Identities=9% Similarity=0.104 Sum_probs=70.8 Q ss_pred CChhhHHHHHHHHHHhhcCCccc-ccCcCCC-----CCC----EEEeCCce-eeecCCCcccCccEEEEEEEEEeCC-cc Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGINIW-DGVNKKP-----EYP----FIKIGEEL-TSGRTISKDAIGKMHNLTLHIWSDY-DS 77 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~~Vy-D~vP~~a-----~~P----YV~lG~~~-~~~~d~t~~~~g~~~~l~i~VWs~~-~G 77 (144) -|+..+...+|.+||++.-..+. ..-|+++ ..| .|.++-+. ..+.+......-..+.+.+.|..++ .| T Consensus 1 m~~t~~ii~~iv~rL~~~lP~~~ve~fP~~p~ey~l~h~~GAvLV~Y~GS~f~~~~~~~~i~Q~R~~~~~vTVi~r~l~g 80 (154) T protein:vir:79 1 MATTLEMVDSVVARLRVKLPALVTEYFPERPDEYRLNHAIGALLVSYPGSQYDTTVDTDMVVQPRRVKFAVAIVLRQLNG 80 (154) T ss_pred CchhHHHHHHHHHHHHHhCCcceEeeCCCChhHcCCCCCceeEEEEecCcccCCcccCCceeeeeEEEEEEEEEeeccCC Confidence 27788999999999998643222 2233321 122 45554332 2233333345567788888888765 78 Q ss_pred hHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEE---eecccCCC Q lcl|NC_018086. 78 SFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEV---IDSTIDPY 144 (144) Q Consensus 78 ~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri---~t~~~~~~ 144 (144) +..+.++.++|+.+|.+-.+ ++. ..+++++-+|....+|. ++- .+.|-. .-+.-||- T Consensus 81 ~~gal~~LD~vR~aL~Gf~p--pdc---~~~~lv~d~f~ge~~G~---W~Y--~l~~at~t~~Ve~~e~~ 140 (154) T protein:vir:79 81 RGGAIDVLDHVRTALVGFRP--PDC---KKLAAVSDKFLGESAGL---WQY--VIEFSAGAVIVEDAEPN 140 (154) T ss_pred cchhhHHHHHHHHHHhcccc--CCC---ceeehhhhcccccccce---eee--eeeeccchhhhccCCCC Confidence 99999999999999965443 322 24566665666555543 111 222221 11122222 No 79 >protein:vir:78124 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:29862 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294806;genbank:gi:149882827;genbank:GeneID:5309152 Probab=87.75 E-value=0.037 Score=28.45 Aligned_cols=133 Identities=19% Similarity=0.199 Sum_probs=73.6 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcC--CcccccCcCC--CCC--CEEEeCCceeeecCCCcccCccEEEEEEEEEeC Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQG--INIWDGVNKK--PEY--PFIKIGEELTSGRTISKDAIGKMHNLTLHIWSD 74 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~--~~VyD~vP~~--a~~--PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~ 74 (144) |-.-||= .+.=|-.-+++.+.+.+ ..|=...|.+ .+| |.|+.-+..-...+ -+-.-..+-+++--|++ T Consensus 1 ~~v~PPD----lE~fl~~~LRa~i~~adVDgqvGnk~Pd~y~g~y~~PLvvVRDDgG~~~d--~~tFDRSiGvnVlgwtr 74 (139) T protein:vir:78 1 MRVAPPD----LEEWFTALLRAEVRAAGVDAEVGNKEPDNLRVPLRRPLIVVRDDSGDRRD--WTTFDRSVGFTVLAGTK 74 (139) T ss_pred CccCCcc----HHHHHHHHHHhhccccCccccccCcCCCCccccccCCeEEEEcCCCCccc--ceeeecccceeeeeccc Confidence 7766654 23333444555564421 1344455654 356 88888332111100 11122345577788988 Q ss_pred CcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecc Q lcl|NC_018086. 75 YDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDST 140 (144) Q Consensus 75 ~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~ 140 (144) +. -+-++.+|..|..+|++.++.|.+|-.++......++=--.-...-+-.|.|+++.|...-+- T Consensus 75 qd-~KPc~dLArrVy~~lt~hp~~LiegSpi~aVv~dgCnGPYpVsdd~d~aryYltveYst~G~~ 139 (139) T protein:vir:78 75 QN-DKPANDLARVVASIVHDHELPLIEGSPIAAVVFDGCRGPYAVPDTIDVARRYLTGQYVASGSW 139 (139) T ss_pred cC-chhhHHHHHHHHHHhccCcceeecCCceEEeecccCCCCCCCCcchhheeeeeEEEEeeeccC Confidence 65 457999999999999999999988855544333333311000000112356788888766555 No 80 >protein:vir:106554 Length: 122 # NCBI annotation: putative protein # Family: family:all:6476 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958589;genbank:gi:41179248;genbank:GeneID:2717090 Probab=87.63 E-value=0.037 Score=28.40 Aligned_cols=113 Identities=15% Similarity=0.239 Sum_probs=67.9 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCC--cccccCcCC-CCCCEEEeCCceeee-cCCCcccCccEEEEEEEEEeCCc Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGI--NIWDGVNKK-PEYPFIKIGEELTSG-RTISKDAIGKMHNLTLHIWSDYD 76 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~--~VyD~vP~~-a~~PYV~lG~~~~~~-~d~t~~~~g~~~~l~i~VWs~~~ 76 (144) |.+= -+-..||+.|+.-.. .|-|.-|.+ +.||-+..-+++-.. .++.+.+.-.+.+++|++|+++. T Consensus 1 m~~I----------NiK~~vy~~L~~v~e~k~Vs~~YP~~w~~fP~~iY~t~~~~~~~~~~~~E~~t~w~itIDi~~~~~ 70 (122) T protein:vir:10 1 MEIY----------NVKALVFKTLKSMPELKLVSPSYPDKFTTFPAAIYSTSQSSYIRNAQQEETDTEWKITIDLYNDHG 70 (122) T ss_pred Ccee----------eccHHHHHHHhhcccccccCCCCCCCcccCcEEEEecCCCceeeecCcceeeEEEEEEEEEEcCCc Confidence 5432 345578999987544 777877776 789998886554332 22333445578899999999864 Q ss_pred chHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe--ecccCCC Q lcl|NC_018086. 77 SSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI--DSTIDPY 144 (144) Q Consensus 77 G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~--t~~~~~~ 144 (144) + ..+|+.+|.+++.. | || .+..--.|+. |..|..+||.-+ +++-.=| T Consensus 71 S---tt~ia~~i~~~f~~----l--Gf-------t~~~~~~d~s-----glkr~vmr~~gIVDn~t~~VY 119 (122) T protein:vir:10 71 S---LTNIKAKLIARFSA----M--GF-------SNSVGDQDLN-----GVSRVVIVFAGIVDNTSHRVY 119 (122) T ss_pred c---HHHHHHHHHHHHhh----c--cc-------cccCCCCCcC-----CCeEEEEEEEEEEEcccceee Confidence 4 34555666555521 1 44 1112223333 346788999966 3333444 No 81 >protein:vir:102955 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1535 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945290;genbank:gi:39653725;uniprot:Q708M2;genbank:GeneID:2672869 Probab=87.48 E-value=0.038 Score=28.34 Aligned_cols=120 Identities=14% Similarity=0.156 Sum_probs=74.7 Q ss_pred CCCCChhhHHHHHHHHHHhhc--CCcccc-cCcCCCCCCE--EEeCCceeeecCCCcccCccEEEEEEEEEeCCcchHHH Q lcl|NC_018086. 7 FRARSSSVALQRAIVKEIRAQ--GINIWD-GVNKKPEYPF--IKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSFEV 81 (144) Q Consensus 7 ~~~~S~~~aLQ~AI~~~L~a~--~~~VyD-~vP~~a~~PY--V~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~ea 81 (144) ||+|- .++..||-++|+.. ...||+ .++.+-..|+ |.+=+....+. .....-..+.+.|+=++...+..++ T Consensus 1 ~~~~~--~~I~~aI~~~Lk~~fpd~~Iy~e~i~Qgf~~PcFFI~ll~~~~~~~--~~~r~~r~~~~dI~Yfp~~~~~~e~ 76 (138) T protein:vir:10 1 MANKG--FRLVEELVSHIKGLYPDIRIYLDEVEQGFKEPCFFIHVVDTKYTPE--ANKYVKVRSKVDLSYFPPKKKRSEC 76 (138) T ss_pred CCcch--hhhHHHHHHHHHHhcCCceeeecccccCCcCCeEEEEEecccCccc--cCceEEEEEEEEEEEecCcchhHHH Confidence 44443 58999999999986 578995 6899999995 66644444342 3344567888899977877788999 Q ss_pred HHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 82 KNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 82 k~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) .++++.+..+|..-++ + +..+.++ +-.||+- |-.++++|.+.-..-++. T Consensus 77 ~~v~e~L~~~f~~~~~-----i-----~~~~~~~-~I~DgVL---hf~f~~~~~~~k~~~~~~ 125 (138) T protein:vir:10 77 LAMQEELSYKLLHLPT-----I-----HLFDRQY-EVVDNVL---HCIFNASTRLKLEEEDIK 125 (138) T ss_pred HHHHHHHHHHHhhcCe-----e-----eeeccee-eEEcCeE---EEEEEEEEEEeeecCccc Confidence 9999999888843321 1 2222222 1123431 233455555543222222 No 82 >protein:vir:79571 Length: 137 # NCBI annotation: putative tail component # Family: family:all:911 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272522;genbank:gi:148609391;genbank:GeneID:5204407 Probab=87.34 E-value=0.033 Score=28.69 Aligned_cols=127 Identities=14% Similarity=0.234 Sum_probs=82.7 Q ss_pred CCCCCCCCChhhHHHHHHHHHHhhc---CCcccccCcC---CCCCCE--EEeCCceeeecCCCcccCccEEEEEEEEEeC Q lcl|NC_018086. 3 KRPPFRARSSSVALQRAIVKEIRAQ---GINIWDGVNK---KPEYPF--IKIGEELTSGRTISKDAIGKMHNLTLHIWSD 74 (144) Q Consensus 3 ~~~~~~~~S~~~aLQ~AI~~~L~a~---~~~VyD~vP~---~a~~PY--V~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~ 74 (144) --+|| +--.+++++|.++|++. ...+||..|. ....|- |-|-+.+.++. +-|..-|+-.|.|.|+=+ T Consensus 1 ~~~~M---~iht~IR~~Vid~L~~~l~~~~~ffdGrP~fiDe~ElPAVAV~l~da~~~~~--~ld~~~W~A~LhI~iyLk 75 (137) T protein:vir:79 1 MADPM---NRHTQIRQVVLARLREQCGDSATFFDGLPAFVDAQELPAVSVWLSDAQYTGK--MTDEDDWQAVLHIAVFIR 75 (137) T ss_pred CCchh---HHHHHHHHHHHHHHHhhcCCcEEEeCCccceechhhCcEEEEEeecCCCCcc--eecCCeeEEEEEEEEEee Confidence 23567 55578999999999875 3357898883 346774 45566666554 346667999999999966 Q ss_pred Cc-chHHHHHHHHH-HHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecc Q lcl|NC_018086. 75 YD-SSFEVKNLTDF-LVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDST 140 (144) Q Consensus 75 ~~-G~~eak~Ia~~-V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~ 140 (144) .. +-.+.-++++. |..++.+.+. |. |+ +..+.+.+..+-+|..-.+ + +...|.|+|-=++ T Consensus 76 a~~~ds~LD~~~E~~I~~v~~~~~~-l~-~l-~~~~~~~gY~Y~rD~e~~t--W-~sadL~y~ItYe~ 137 (137) T protein:vir:79 76 AQAPDSELDMWMESTIFPALNDVPA-LS-GL-IDTLIPLGFNYQRDNEMAT--W-AMAEITYQITYTN 137 (137) T ss_pred cCCCHHHHHHHHHHHHHHhhcchhh-hh-hH-hhhhhcccCCcccccccce--e-EEEEEEEEEEEcC Confidence 53 44555668885 7777765532 22 21 1234556677888766543 1 3457889988666 No 83 >protein:vir:9648 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795410;genbank:gi:28876183;genbank:GeneID:1257699 Probab=87.25 E-value=0.0091 Score=31.79 Aligned_cols=114 Identities=18% Similarity=0.140 Sum_probs=71.2 Q ss_pred hhhHHHHHHHHHHhhcC----Cccc-ccCcC--CCCCCEEEeCCcee---eecCCCcccCccEEEEEEEEEeCCcchHHH Q lcl|NC_018086. 12 SSVALQRAIVKEIRAQG----INIW-DGVNK--KPEYPFIKIGEELT---SGRTISKDAIGKMHNLTLHIWSDYDSSFEV 81 (144) Q Consensus 12 ~~~aLQ~AI~~~L~a~~----~~Vy-D~vP~--~a~~PYV~lG~~~~---~~~d~t~~~~g~~~~l~i~VWs~~~G~~ea 81 (144) .+..+-.-||..|.++. .+|+ =..|+ +..-|||+|-+... ..+. .......+...+|+|||. +|.++ T Consensus 1 mm~DiL~~Iy~~L~~d~~l~~~rIk~~~~Pe~~d~~~p~IvI~pl~~P~p~~~~-sd~~ls~~ylyQIDVes~--~r~~~ 77 (126) T protein:vir:96 1 MVRDMLAEVFDLLKADNVLKLVKIKSFERPESLLDDQTSIVILPITAPKQSTFG-SDTALSKKFLYQIEVEST--SRLEC 77 (126) T ss_pred ChhHHHHHHHHHHhccceecceeeeeeecCCCCCCCcceEEEeeCCCCCCcccc-CchhhhhhceeeEeeeec--Cccch Confidence 35677788999998872 3443 23443 45789999965532 2332 234456789999999774 78999 Q ss_pred HHHHHHHHHHhcCCccccCCCceEEEEEEee--eeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 82 KNLTDFLVGLLINSPLQLEEGFCIGKKELDH--VRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 82 k~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~--~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) ++|+.+|+..|- .+ ||. ++.+ =+++.+..+ -+-+-|||-++.=.|-| T Consensus 78 ~~i~~rI~~~l~----~i--gf~----q~s~gldeY~~etkr------y~daRRYrg~~k~yeey 126 (126) T protein:vir:96 78 KDLQCRIEKQLE----KI--GFY----QNDAGFERFDRDTGR------YLDARTFRGFSNIYEDY 126 (126) T ss_pred HHHHHHHHHHHH----Hc--Ccc----ccccCcchhhhhhhh------hhhhheecccchhhhcC Confidence 999999999993 22 331 1111 123333221 12256788777667777 No 84 >protein:vir:79047 Length: 145 # NCBI annotation: hypothetical protein # Family: family:all:1535 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110730;genbank:gi:134287347;genbank:GeneID:4955221 Probab=85.99 E-value=0.049 Score=27.78 Aligned_cols=119 Identities=10% Similarity=0.145 Sum_probs=72.6 Q ss_pred hhhHHHHHHHHHHhhc---CCcccc-cCcCCCCCCE--EEeCCceeeecCCCcccCccEEEEEEEEEeCC-cchHHHHHH Q lcl|NC_018086. 12 SSVALQRAIVKEIRAQ---GINIWD-GVNKKPEYPF--IKIGEELTSGRTISKDAIGKMHNLTLHIWSDY-DSSFEVKNL 84 (144) Q Consensus 12 ~~~aLQ~AI~~~L~a~---~~~VyD-~vP~~a~~PY--V~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~-~G~~eak~I 84 (144) ...++..||-++|++. ...||+ .++.+-..|+ |.+=+....+ ....+.-..+.+.|+=+.+. ....++.++ T Consensus 1 mi~dI~~aI~~~Lk~~Fp~~~~IY~e~i~Qgf~~PcFFI~ll~~~~~~--~~~~r~~r~~~~dI~Yfp~~~~~~~e~~ev 78 (145) T protein:vir:79 1 MLNNIIDGISVKLDKSFGEKYTIYSEDVEQGINEPCFFIVPLNPSKTP--YPSGRELKKNSFDVHYFPRSEAKNFEINEI 78 (145) T ss_pred ChHHHHHHHHHHHHHhcCCceEEEecccccCccCCeeEEEEecccccc--ccCceEEEEEEEEEEEeecCCCCchhHHHH Confidence 3789999999999975 348996 6889988895 5554433323 23344456778888888654 456899999 Q ss_pred HHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecccCCC Q lcl|NC_018086. 85 TDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDSTIDPY 144 (144) Q Consensus 85 a~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~~~~~ 144 (144) ++.+...|.. +.+.++ . .+..+.+..- -||+ .|-.++++|.+. -.+++ T Consensus 79 ~e~L~~~le~--i~v~~~-~---~~~~~~~~ei-vDgv---Lhf~~~~~~~~~--k~~~~ 126 (145) T protein:vir:79 79 AEMLLEELEY--IEINGD-L---VRGTNMNFEI-IDNV---LHFFVDYNYFTI--KSNNA 126 (145) T ss_pred HHHHHhhhcc--eeecCc-E---EeeecceeEE-eece---EEEEEEEEEEEe--eecCc Confidence 9999999943 555332 1 2222222221 1443 223345555543 33445 No 85 >protein:vir:94921 Length: 125 # NCBI annotation: possible peptidoglycan binding protein # Family: family:all:5248 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239283;genbank:gi:66392065;genbank:GeneID:5076566 Probab=85.33 E-value=0.054 Score=27.55 Aligned_cols=115 Identities=13% Similarity=0.137 Sum_probs=69.1 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCC--c-ccccCcCCCCCCEEEe----CCceeeecCCCcccCccEEEEEEEEEe Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGI--N-IWDGVNKKPEYPFIKI----GEELTSGRTISKDAIGKMHNLTLHIWS 73 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~--~-VyD~vP~~a~~PYV~l----G~~~~~~~d~t~~~~g~~~~l~i~VWs 73 (144) ||. .+...+|.+|++|.-. + +|++.|.+..-+++-| |..+.... +..| ....=.++|.|++ T Consensus 1 Mt~----------~q~r~~I~~r~~a~~~~~~I~~~N~pp~~~~~W~Rlti~~g~~~~a~i-G~~~-~~rtGli~iqiF~ 68 (125) T protein:vir:94 1 MSY----------FQEKLDIENYFKANWPDTPIFYENRTANSTGTWVRLTIQNGDAFQASN-GEVS-YRHPGVVFVQIFT 68 (125) T ss_pred CCH----------HHHHHHHHHHHHhCCCccceeeCCCCCCCCCceEEEEeccCccccccc-CCce-eeeeeEEEEEeee Confidence 443 5788999999997532 2 6777776666666544 33322111 1111 2244568999998 Q ss_pred CC-cchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCcc--EEEEEEEEEee Q lcl|NC_018086. 74 DY-DSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNER--AYLFLDFEVID 138 (144) Q Consensus 74 ~~-~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~--~~~~l~fri~t 138 (144) .. .|.....++|++++++...+. . | +....+.+..+.+ . +++- ..+.+.||.=. T Consensus 69 p~~~G~~~~~~~ad~~~~~f~~~~--~--g----~i~f~~~~~~~~g--~-~~gwyQ~Nv~I~f~~~~ 125 (125) T protein:vir:94 69 KKEVGSGEALKLADKVDALFRSKT--L--G----NIQFKVPQVQKVP--S-TTEWYQVNVSTEFYRGS 125 (125) T ss_pred cCCcChHHHHHHHHHHHHHHccCC--C--C----ceEEeeceecCCC--C-CCCEEEEEEEEeeecCC Confidence 65 689999999999999985552 2 2 3334334444432 2 2221 34677888655 No 86 >protein:vir:107704 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003903;genbank:gi:45686319;genbank:GeneID:2773044 Probab=81.23 E-value=0.088 Score=26.38 Aligned_cols=119 Identities=8% Similarity=0.014 Sum_probs=62.0 Q ss_pred hhhHHHHHHHHHHhhcCC--cc-cccCc---CCCCCCEEEe----CCceeeecCCCcccCccEEEEEEEEEeC-CcchHH Q lcl|NC_018086. 12 SSVALQRAIVKEIRAQGI--NI-WDGVN---KKPEYPFIKI----GEELTSGRTISKDAIGKMHNLTLHIWSD-YDSSFE 80 (144) Q Consensus 12 ~~~aLQ~AI~~~L~a~~~--~V-yD~vP---~~a~~PYV~l----G~~~~~~~d~t~~~~g~~~~l~i~VWs~-~~G~~e 80 (144) .-.|+..||.++|.+... +| |.++. ...--+|+-+ ++.+.... ....+...=.++|+|... ..|..+ T Consensus 1 ~hyE~~~a~r~~la~~~~~lpVA~eNv~F~Pp~~G~~yLr~~~lpa~T~~~~L--~~d~r~y~Gv~QI~Vv~paG~G~~~ 78 (132) T protein:vir:10 1 MHYELSAAARAAFLSKYRDFPHYMENRNFTPPKDGGMWLRFNYIEGDTLYLSI--DRKCKSYIAIVQIGVVFPPGSGVDE 78 (132) T ss_pred CchHHHHHHHHHHHhhhcCCcEeecCCCcCCCCCCceEEEEEEccCCceeeec--cCcCcEEEEEEEEEEEecCCCCcch Confidence 246888899988876432 11 22221 1111245433 33333232 222334444578887664 579999 Q ss_pred HHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEE--EEEEEEEeeccc Q lcl|NC_018086. 81 VKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAY--LFLDFEVIDSTI 141 (144) Q Consensus 81 ak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~--~~l~fri~t~~~ 141 (144) +++||+.|.+++.+. +.|..||-... -. .+ -.+++...-. +...+|+=|.++ T Consensus 79 a~~iAd~i~~~F~~g-~~l~~Gyi~~~-~~---~~----p~i~~~s~~~iPvrf~yR~Dt~~~ 132 (132) T protein:vir:10 79 ARLKAKEIADFFKDG-KMLNVGYIFEG-AI---VH----QIVKHESGWMIPVRFTVRVDTKET 132 (132) T ss_pred hHHHHHHHHHhccCc-ceeecceecCC-Cc---cC----CceeCCcceEEEEEEEEEecccCC Confidence 999999999988544 44566632211 11 11 1233222112 345556666666 No 87 >protein:vir:9709 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:2110 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795471;genbank:gi:28876220;genbank:GeneID:1257764 Probab=80.38 E-value=0.05 Score=27.73 Aligned_cols=113 Identities=13% Similarity=0.223 Sum_probs=62.8 Q ss_pred hhHHHHHHHHHHhhc-------------------CCcccc-cCcCCC-------CCCEEEeCCceeeecCCC-cccCccE Q lcl|NC_018086. 13 SVALQRAIVKEIRAQ-------------------GINIWD-GVNKKP-------EYPFIKIGEELTSGRTIS-KDAIGKM 64 (144) Q Consensus 13 ~~aLQ~AI~~~L~a~-------------------~~~VyD-~vP~~a-------~~PYV~lG~~~~~~~d~t-~~~~g~~ 64 (144) |++..+ ||+.|.++ ...||- .+|+.+ ..|+|.|-+..-.+.+.+ ......+ T Consensus 1 mlp~~~-vy~~L~~n~~L~~lm~~~r~~~~~~~~~~~If~~~vPE~~~~~qk~~~aP~IrI~~i~~~~~~yADn~~~~~~ 79 (141) T protein:vir:97 1 MIAETT-AYKLLSNDKTLNELLDKLRGGPFKNGFKQGIFTYDIPDNPIDLRKAELAPFMRIKTTLDGPADYADDEILCNE 79 (141) T ss_pred CchHHH-HHHHhcccHHHHHHHhhhccccccccccccccccccCCChhhhhhhccCCeEEEeccCCCcccccccccceee Confidence 244433 45555443 124774 688863 589999976644433333 2345789 Q ss_pred EEEEEEEEeCCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecc Q lcl|NC_018086. 65 HNLTLHIWSDYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDST 140 (144) Q Consensus 65 ~~l~i~VWs~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~ 140 (144) ..++|++|++.. ++..+|...|.++|++ .||.- .......-..|||- + .-+...|||--+.- T Consensus 80 ~~vQIdiW~~~~--~~~e~i~~~Id~~M~~------~gf~r--Y~~~~~~~~~dpD~--d--~~~~~rRYr~~~~~ 141 (141) T protein:vir:97 80 QRITINFWCKTA--SEADQINKCIDNILKQ------GGFER--YTANEKPRYKDSDI--D--LLMNVRKYRCFDFY 141 (141) T ss_pred eeeEeeeeecCh--hHHHHHHHHHHHHHHh------cCcee--ccccCCCCCCccch--h--hhhhhhheeeeccC Confidence 999999999954 5788899999888853 35531 11111123345542 1 12334455433222 No 88 >protein:vir:101509 Length: 139 # NCBI annotation: gp22 # Family: family:all:6926 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655401;genbank:gi:109522589;genbank:GeneID:4157581 Probab=80.25 E-value=0.042 Score=28.10 Aligned_cols=121 Identities=14% Similarity=0.155 Sum_probs=65.1 Q ss_pred CChhhHHHHHHHHHHhhcCC--------cccc-----cCcCC-CCCCEEEeCCceeeecCCCcccCccEEEEEEEE-EeC Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGI--------NIWD-----GVNKK-PEYPFIKIGEELTSGRTISKDAIGKMHNLTLHI-WSD 74 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~--------~VyD-----~vP~~-a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~V-Ws~ 74 (144) -+ ..|++.+|++|.. +||- ..|.- ++-|||+| ..+.++.+.+-...-+...+-+|. |.. T Consensus 1 M~-----~s~l~d~lr~D~~L~~~lvps~i~~~~~~d~rPnh~d~G~FiV~-~W~~~~i~~~I~rgPr~~~iwvH~P~~~ 74 (139) T protein:vir:10 1 MS-----RAAVLDALRADVALGQMLVPSNILTNYSKEGPPNHLAPGPFAVI-RWGGKTIDPAVNRGPRDVNIWVHIPQRQ 74 (139) T ss_pred Cc-----HHHHHHHHhcccccCeeeccchhhhcccccCCCCCCCCCceEEE-eccccccccccCCCCceEEEEEecchhc Confidence 02 2478899998842 4552 33332 46789998 333333333332222333333332 345 Q ss_pred CcchHHHHHHHHHHHHHhcCCc-cccCCCceEEEEEEeeeeeeecccccccCccEEE--EEEEEEeeccc Q lcl|NC_018086. 75 YDSSFEVKNLTDFLVGLLINSP-LQLEEGFCIGKKELDHVRYTEAANGTYKNERAYL--FLDFEVIDSTI 141 (144) Q Consensus 75 ~~G~~eak~Ia~~V~~aL~~~~-L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~--~l~fri~t~~~ 141 (144) ...+....+|.++|.+....-+ .+=++ +.++++.++.--.+...+.|--.+ -.+|.+|...+ T Consensus 75 std~~~id~il~Ri~eI~~svE~~~G~D-----G~~v~~vr~~g~s~nl~D~G~kTi~R~AT~~vLs~~~ 139 (139) T protein:vir:10 75 STDYTRIDQILKRTKEIMLSLEDVAGAD-----GAHLVSTRFLAESDDLVDPGFETITRYATFSVLSRST 139 (139) T ss_pred cCCcchHHHHHHHHHHHHHHhhhhccCC-----ceEEEEEeeeccCCCccccchhhhhhhhhhhheecCC Confidence 5678888888888877653211 11123 456666676654444444333222 26788887766 No 89 >protein:vir:101606 Length: 142 # NCBI annotation: hypothetical protein # Family: family:all:26512 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112511;genbank:gi:53793611;uniprot:Q5ZGE2;genbank:GeneID:3101714 Probab=79.22 E-value=0.037 Score=28.44 Aligned_cols=129 Identities=16% Similarity=0.217 Sum_probs=76.0 Q ss_pred CCCCChhhHHHHHHHHHHhhcC-----Ccccc-cCcCCC-CCCEEEeCCceeeecCCCcccCccEEEEEEEEEeC----- Q lcl|NC_018086. 7 FRARSSSVALQRAIVKEIRAQG-----INIWD-GVNKKP-EYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSD----- 74 (144) Q Consensus 7 ~~~~S~~~aLQ~AI~~~L~a~~-----~~VyD-~vP~~a-~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~----- 74 (144) |--..|..-+++|+|..+..-. ..-|| ++..++ .--||.+-...-.-+-.+||...|+..+-|.++.. T Consensus 1 migtnpdkyirkavfdlinnivvntktikcydtrvtgnaavneyvlltnqtkeidkatkcvynwetsllieiytktssng 80 (142) T protein:vir:10 1 MIGTNPDKYIRKAVFDLINNIVVNTKTIKCYDTRVTGNAAVNEYVLLTNQTKEIDKATKCVYNWETSLLIEIYTKTSSNG 80 (142) T ss_pred CCCCchhHHHHHHHHHHhhhheeccceeEEeeeeeccccccceeEEeeccchhhhhhhheeeeccceeEEEEeeeccCCC Confidence 5556677889999999886542 25787 455544 56788886544333346899999999999999963 Q ss_pred -CcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 75 -YDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 75 -~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) .++|.-+.+|-.+|..++ +..|++ ++|--....+..-..++....-+-.-|..+.+....+ T Consensus 81 nsgsrllvndieqaiytli-nptlti-enfinqtqnvtfetqletittteiifrsfirlnltli 142 (142) T protein:vir:10 81 NSGSRLLVNDIEQAIYTLI-NPTLTI-ENFINQTQNVTFETQLETITTTEIIFRSFIRLNLTLI 142 (142) T ss_pred CccceehhhhHHHHHHHHh-Ccceeh-hhhhchhhcceeeeeeeehhhHHHHHhhhhheeeeeC Confidence 345677899999998866 566776 3442222222211111111111111112233333333 No 90 >protein:vir:102191 Length: 139 # NCBI annotation: gp22 # Family: family:all:6926 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655218;genbank:gi:109522798;genbank:GeneID:4157430 Probab=78.92 E-value=0.05 Score=27.72 Aligned_cols=121 Identities=14% Similarity=0.154 Sum_probs=65.0 Q ss_pred CChhhHHHHHHHHHHhhcCC--------cccc-----cCcCC-CCCCEEEeCCceeeecCCCcccCccEEEEEEEE-EeC Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQGI--------NIWD-----GVNKK-PEYPFIKIGEELTSGRTISKDAIGKMHNLTLHI-WSD 74 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~~--------~VyD-----~vP~~-a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~V-Ws~ 74 (144) -+ ..|++.+|++|.. +||- ..|.. ++-|||+| ..+.++.+.+-...-+...+-+|. |.. T Consensus 1 M~-----~s~l~d~lr~D~~L~~~lvps~i~~~~~~d~rP~h~~~G~FiV~-~W~~~~i~~~I~rgPr~~~iwvH~P~~~ 74 (139) T protein:vir:10 1 MS-----RAAVLDALRADVALGQMLVPSNILTNYSKEGPPNHLAPGPFAVI-RWGGKTIDPAVNRGPRDVNIWVHIPQRQ 74 (139) T ss_pred Cc-----HHHHHHHHhcccccCeeecchhhhhcccccCCCCCCCCCceEEE-eccCcccccccCCCCceEEEEEecchhc Confidence 02 2478899998842 4552 33332 46789998 333333333332222333333332 345 Q ss_pred CcchHHHHHHHHHHHHHhcCCc-cccCCCceEEEEEEeeeeeeecccccccCccEEE--EEEEEEeeccc Q lcl|NC_018086. 75 YDSSFEVKNLTDFLVGLLINSP-LQLEEGFCIGKKELDHVRYTEAANGTYKNERAYL--FLDFEVIDSTI 141 (144) Q Consensus 75 ~~G~~eak~Ia~~V~~aL~~~~-L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~--~l~fri~t~~~ 141 (144) ...+....+|.++|.+....-+ .+=++ +.++++.++.--.+...+.|--.+ -.+|.+|...+ T Consensus 75 std~~~idril~Ri~eI~~svE~~~G~D-----G~~v~~vr~~g~s~nl~D~G~kTi~R~AT~~vLs~~~ 139 (139) T protein:vir:10 75 STDYTRIDRILKRTKEIMLSLEDVAGAD-----GAHLVSTRFLAESDDLVDPGFETITRYATFSVLSRST 139 (139) T ss_pred cCCcchHHHHHHHHHHHHHHhhhhccCC-----ceeEEEeeeeccCCCccccchhhhhhhhhhhheecCC Confidence 5677888888888877653211 11123 456666666654444443333222 26788887766 No 91 >protein:vir:78057 Length: 154 # NCBI annotation: gp10 # Family: family:all:29813 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468794;genbank:gi:157325375;genbank:GeneID:5601819 Probab=77.72 E-value=0.12 Score=25.60 Aligned_cols=133 Identities=18% Similarity=0.218 Sum_probs=81.0 Q ss_pred CCC---------C-----CCCCCCChhhHHHHHHHHHHhhc---CCccccc-CcCC---CCCCEEEeCCceeeecCCCcc Q lcl|NC_018086. 1 MSK---------R-----PPFRARSSSVALQRAIVKEIRAQ---GINIWDG-VNKK---PEYPFIKIGEELTSGRTISKD 59 (144) Q Consensus 1 m~~---------~-----~~~~~~S~~~aLQ~AI~~~L~a~---~~~VyD~-vP~~---a~~PYV~lG~~~~~~~d~t~~ 59 (144) |.- + -|-|++-. ++-.-||.-|... ..+||-. .|++ -.|||-+| +.+..+.+ +. T Consensus 1 m~~~ir~~dg~~r~lydv~pnayn~g--e~le~~y~ml~E~i~s~~~i~rn~nP~P~~si~YPy~tf-e~D~e~~~--dn 75 (154) T protein:vir:78 1 MAVNIRFPDGTVRPLYDVKPNAYNRG--ELLEIIYEMLNEAVKSEIDVFRNKNPKPVNSITYPYMTF-EVDNAKVD--DN 75 (154) T ss_pred CeeEeecCCCcccceeecCCCccchh--HHHHHHHHHHHHHHHHHHHHHhhcCCCcceeEecceeee-eecccccc--CC Confidence 221 1 13333322 5556677666443 3367743 3543 27999999 44443432 35 Q ss_pred cCccEEEEEEEEEeCCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeec Q lcl|NC_018086. 60 AIGKMHNLTLHIWSDYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDS 139 (144) Q Consensus 60 ~~g~~~~l~i~VWs~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~ 139 (144) ..|.-+.++|+++.+..++.-.-.+.+.++.+|+..... .+.|.+. ..+...+=+.|.+-++ --++.+.++|=|=-. T Consensus 76 q~~~gvylDidlfDR~~s~~nl~~l~d~L~~~Ld~kR~l-t~dy~~~-~~~e~snkIP~ET~re-LLRR~va~~FYIer~ 152 (154) T protein:vir:78 76 EHGTMVAVDCELFDRGTTSDMIDKYTDMLNNELDHKRHS-YEDYWVK-TELERDRDIPDETDKE-LLRRMVALTFYIERN 152 (154) T ss_pred cccceEEEEEEEeecCCCchhHHHHHHHHHhhhhhhccc-ccceeEE-EEEccCCCCCchhHHH-HHhhhhheeEEEEec Confidence 678899999999999999999999999999999988763 4455442 3333333333321111 112346788887765 Q ss_pred cc Q lcl|NC_018086. 140 TI 141 (144) Q Consensus 140 ~~ 141 (144) .. T Consensus 153 ~s 154 (154) T protein:vir:78 153 DS 154 (154) T ss_pred CC Confidence 55 No 92 >protein:vir:7450 Length: 141 # NCBI annotation: gp27 # Family: family:all:6926 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818565;genbank:gi:29567002;genbank:GeneID:1260235 Probab=76.66 E-value=0.13 Score=25.39 Aligned_cols=122 Identities=9% Similarity=-0.016 Sum_probs=63.6 Q ss_pred CChhhHHHHHHHHHHhhcC---------Cccccc-----CcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEE-EeC Q lcl|NC_018086. 10 RSSSVALQRAIVKEIRAQG---------INIWDG-----VNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHI-WSD 74 (144) Q Consensus 10 ~S~~~aLQ~AI~~~L~a~~---------~~VyD~-----vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~V-Ws~ 74 (144) -+ ..|++.+|++|. +.|++. .|.. +-|||+| ..+.++...+-...-+...+-+|. |.. T Consensus 1 M~-----~a~vl~~lr~D~~L~a~g~~~~~v~~~~~~d~rP~~-~G~FiV~-~W~~~~i~~~I~rgPr~~~iwvH~P~~~ 73 (141) T protein:vir:74 1 MH-----PSILYDSIAHDPELNAMGITPSRIKELDSIDKRPFD-SGYFIVT-RWLDQDLHPTINRGPRDLMVWCHMPKDR 73 (141) T ss_pred Cc-----HHHHHHHHhccchhhhhccccceeeecccccCCCCC-CCcEEEE-eccCcccccccCCCCceEEEEEecchhc Confidence 02 246788888873 356653 3333 6789999 344444444432222333333333 355 Q ss_pred CcchHHHHHHHHHHHHHhcCCcc-ccCCCceEEEEEEeeeeeeecccccccCccEEE--EEEEEEe-ecccCC Q lcl|NC_018086. 75 YDSSFEVKNLTDFLVGLLINSPL-QLEEGFCIGKKELDHVRYTEAANGTYKNERAYL--FLDFEVI-DSTIDP 143 (144) Q Consensus 75 ~~G~~eak~Ia~~V~~aL~~~~L-~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~--~l~fri~-t~~~~~ 143 (144) ...+....+|.++|.+....-+- +=++ +.++.+.++.--.+...+.|--.+ -.+|.+| +.++=- T Consensus 74 stdf~~id~il~Ri~eI~~svE~~~G~D-----G~~l~~v~~~g~s~dl~D~G~kTi~R~ATy~vL~d~nt~~ 141 (141) T protein:vir:74 74 GRNFLPIERILERINDIWASVEAQTGTD-----GVRVTSVKRRGQSGNLEDEGWKTLARNATFSVLYDRNTVQ 141 (141) T ss_pred cCCcchHHHHHHHHHHHHhhccccccCC-----ceEEEEEeeeccCCCccccchhhhhhhceeeeeecceecC Confidence 66788899999998887643221 1123 356666666554443333332222 2566666 222222 No 93 >protein:vir:3428 Length: 131 # NCBI annotation: tail component # Family: family:all:911 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040591;genbank:gi:9626255;genbank:GeneID:2703486 Probab=75.21 E-value=0.15 Score=25.11 Aligned_cols=122 Identities=11% Similarity=0.211 Sum_probs=77.1 Q ss_pred CCCCChhhHHHHHHHHHHhhcCC--cccccCcC---CCCCCE--EEeCCceeeecCCCcccCccEEEEEEEEEeCCc-ch Q lcl|NC_018086. 7 FRARSSSVALQRAIVKEIRAQGI--NIWDGVNK---KPEYPF--IKIGEELTSGRTISKDAIGKMHNLTLHIWSDYD-SS 78 (144) Q Consensus 7 ~~~~S~~~aLQ~AI~~~L~a~~~--~VyD~vP~---~a~~PY--V~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~-G~ 78 (144) |.+ .+++++|.++|++... .+||+.|. +...|- |-|-+.+.++. +-|..-|+-.|.|.|+=+.. +- T Consensus 1 ~~h----t~IR~~Vid~L~~~l~~v~~fdG~P~fide~ElPAVAV~l~d~~~~~~--~ld~~~w~A~LhI~iyLka~~~d 74 (131) T protein:vir:34 1 MKH----TELRAAVLDALEKHDTGATFFDGRPAVFDEADFPAVAVYLTGAEYTGE--ELDSDTWQAELHIEVFLPAQVPD 74 (131) T ss_pred Cch----HHHHHHHHHHHhccCCceEEecCCceeeccccCcEEEEEeecCCCCcc--eecCCeeEEEEEEEEEeecCCCH Confidence 422 5899999999988532 48898883 456785 45566666554 44677899999999996653 44 Q ss_pred HHHHHHHHH-HHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecc Q lcl|NC_018086. 79 FEVKNLTDF-LVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDST 140 (144) Q Consensus 79 ~eak~Ia~~-V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~ 140 (144) .+.-++++. |..++.+.+- |. |. +..+.+.+..+-+|..-.+ + +...|.|+|-=+- T Consensus 75 s~LD~~~E~~i~~v~~~~~~-l~-~l-~~~~~~~gy~Y~rD~e~~t--W-~sadL~y~ItY~~ 131 (131) T protein:vir:34 75 SELDAWMESRIYPVMSDIPA-LS-DL-ITSMVASGYDYRRDDDAGL--W-SSADLTYVITYEM 131 (131) T ss_pred HHHHHHHHHHhHHHhhcchh-hh-hH-hhhhhhccCCcccccccce--E-EEEEEEEEEEEeC Confidence 556668886 6677754221 11 11 2356667777888776543 1 2345666654222 No 94 >protein:vir:1994 Length: 182 # NCBI annotation: Hypothetical protein # Family: family:all:1387 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050641;genbank:gi:9633528;genbank:GeneID:2636286 Probab=72.73 E-value=0.18 Score=24.68 Aligned_cols=122 Identities=11% Similarity=0.118 Sum_probs=63.0 Q ss_pred hhhHHHHHHHHHHhhc-CC---------cccc-cCcC----CCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEe--- Q lcl|NC_018086. 12 SSVALQRAIVKEIRAQ-GI---------NIWD-GVNK----KPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWS--- 73 (144) Q Consensus 12 ~~~aLQ~AI~~~L~a~-~~---------~VyD-~vP~----~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs--- 73 (144) .-.+.+.||.+||++. |. +=|| ..+. ..|-=||+++...... ......-++.+=|-. T Consensus 1 mI~~iEdAi~~rl~~~~g~~v~~V~sy~Gefd~e~l~~~~~~~PAv~Va~~G~~~~~-----~r~~~~~r~~v~V~a~~~ 75 (182) T protein:vir:19 1 MLEETEAALLARVRELFGATLRQVEPLTGTWTNEDVHRLFLAPPSVFLAWMGCGEGR-----TRREVESRWAFFVVAELL 75 (182) T ss_pred ChHHHHHHHHHHHHHHhhhhhhhhccCCCCCChhhhhHhhhcCceeEEEeccccCcC-----CceeeeeEEEEEEEecCC Confidence 3578999999999774 11 3444 2222 2233489985322111 111222233333333 Q ss_pred --CCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEee--------cccCC Q lcl|NC_018086. 74 --DYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVID--------STIDP 143 (144) Q Consensus 74 --~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t--------~~~~~ 143 (144) ....+..+.+|.++|+.+|+++.+.+...+ +....|.+-. ......|...-.+.|.... .++++ T Consensus 76 ~g~~~~rvG~y~lv~~v~~lL~~q~~g~~~~l-----~p~~vrnL~s-~~~~~~gvsvyavef~~~~~lp~~~d~~~l~d 149 (182) T protein:vir:19 76 NGEPVNRPGIYQIVERLIAGVNGQTFGPTTGM-----RLTQVRNLCD-DNRINAGVVLYGVLFSGTTPLPSVVDLDSLDD 149 (182) T ss_pred CChhhhhhhHHHHHHHHHHHHhccCCCCcccc-----ccceeeeeec-hhhhhCceEEEEEEeeccccCCCcCCCCCCcc Confidence 334445689999999999999888754322 2333333321 1122234455578887441 11222 Q ss_pred C Q lcl|NC_018086. 144 Y 144 (144) Q Consensus 144 ~ 144 (144) | T Consensus 150 f 150 (182) T protein:vir:19 150 Y 150 (182) T ss_pred h Confidence 2 No 95 >protein:vir:98629 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039928;genbank:gi:126011103;genbank:GeneID:4818465 Probab=71.35 E-value=0.089 Score=26.33 Aligned_cols=115 Identities=17% Similarity=0.193 Sum_probs=67.3 Q ss_pred hhhHHHHHHHHHHhhcCC----ccc-ccCcC--CCCCCEEEeCCcee---eecCCCcccCccEEEEEEEEEeCCcchHHH Q lcl|NC_018086. 12 SSVALQRAIVKEIRAQGI----NIW-DGVNK--KPEYPFIKIGEELT---SGRTISKDAIGKMHNLTLHIWSDYDSSFEV 81 (144) Q Consensus 12 ~~~aLQ~AI~~~L~a~~~----~Vy-D~vP~--~a~~PYV~lG~~~~---~~~d~t~~~~g~~~~l~i~VWs~~~G~~ea 81 (144) .+...-.-||..|+++.. +|+ =..|+ +..-|||+|-+-.. +.+. +......+...+|+| ...+|.++ T Consensus 1 mm~DiL~~Iy~~L~~d~~i~~~~Ikfye~Pe~~d~~~p~IVI~Pl~~P~p~~~~-sd~~ls~~y~yQIDV--es~~R~~~ 77 (126) T protein:vir:98 1 MVRDMLAEVFDLLKADNVLKLVKIKSFERPESLLDDQTSIVILPITAPKQSTFG-SDTALSKKFLYQIEV--ESTSRLEC 77 (126) T ss_pred ChhHHHHHHHHHHhcCceeceeeeeeeecCCccccCcceEEEeeCCCCCccccc-CChhhheeeeeeeec--ccccccch Confidence 245666789999988742 443 23443 56789999965422 2332 234556789999999 67889999 Q ss_pred HHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEE-EEEEEEeecccCCC Q lcl|NC_018086. 82 KNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYL-FLDFEVIDSTIDPY 144 (144) Q Consensus 82 k~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~-~l~fri~t~~~~~~ 144 (144) ++|+.+|+..|-. + ||.=.+-- .-+++.+.. .++ +=|||-++.==|-| T Consensus 78 ~~i~~rI~~~l~~----~--gf~q~~~g--ldeY~~Et~-------ryvdaRrY~G~~k~y~~y 126 (126) T protein:vir:98 78 KDLQRRIEKQLEK----I--GFYQNDAG--FERFDRDTG-------RYLDARTFRGFSNIYEDY 126 (126) T ss_pred HHHHHHHHHHHHH----c--CccccccC--cchhhhhhh-------hhhhhhhhccCchhhhcC Confidence 9999999999942 2 32110000 112333222 122 23566555444445 No 96 >protein:vir:103278 Length: 169 # NCBI annotation: phage-related conserved hypothetical protein # Family: family:all:5121 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277458;genbank:gi:71834100;genbank:GeneID:3562389 Probab=67.16 E-value=0.26 Score=23.82 Aligned_cols=126 Identities=13% Similarity=0.124 Sum_probs=59.0 Q ss_pred CCCC------------CCCCCCChhhHHHHHHHHHHhhcCC--c-ccccCc---CCCCCCEEEe----CCceeeecCCCc Q lcl|NC_018086. 1 MSKR------------PPFRARSSSVALQRAIVKEIRAQGI--N-IWDGVN---KKPEYPFIKI----GEELTSGRTISK 58 (144) Q Consensus 1 m~~~------------~~~~~~S~~~aLQ~AI~~~L~a~~~--~-VyD~vP---~~a~~PYV~l----G~~~~~~~d~t~ 58 (144) .|++ --| +.--+.+.++++-.++++... + .|.++. ....-+|+-+ ++.+..+. .. T Consensus 20 ~~~~~~~~~~~~~~~~~~~-h~ei~~a~rk~l~~~a~a~~~~LpVA~ENVaFtPp~dG~~YLr~~~lPadT~~~~L--~g 96 (169) T protein:vir:10 20 LSKRGLAVTLLRRYRRLNV-HYEMMVAARKLVSDAAVDIAGSLPVAYENCGFTPPKNGSSWLKFDYTEVDSVTWGL--QR 96 (169) T ss_pred hcccceehhhhhhhhhcch-HHHHHHHHHHHHHHHHhhcccCCcEeeCCCCcCCCCCCccEEEEEEecCCceeeec--cC Confidence 1211 111 111244666666666665322 2 223322 1222244333 33333332 22 Q ss_pred ccCccEEEEEEEEEe-CCcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEE-EEEEEEE Q lcl|NC_018086. 59 DAIGKMHNLTLHIWS-DYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAY-LFLDFEV 136 (144) Q Consensus 59 ~~~g~~~~l~i~VWs-~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~-~~l~fri 136 (144) ......=.++|+|.. -..|..++++||+.|.+++.+. +.|++||--.. -. ....+.+.. ++ +-+||.+ T Consensus 97 d~R~y~GVfQIsVV~PaGtG~~ka~qiAdeiadlF~~g-t~L~~Gyi~~~-~~-~~p~i~~~s-------~~~iPvr~~~ 166 (169) T protein:vir:10 97 TCRYYVGMVQVSIFFSPGEGTDRPRQLAGRLSEAFADG-TMLDSGYIYEG-GS-VFPPVKSQS-------GWFIPVRFYV 166 (169) T ss_pred CCceEEEEEEEEEEecCCCCcchhHHHHHHHHHhhhCC-ceeeceeecCC-Ce-ECCeeecCC-------ceEEeEEEEE Confidence 223334457888765 5579999999999999998544 55777742111 01 111222222 33 3466664 Q ss_pred eec Q lcl|NC_018086. 137 IDS 139 (144) Q Consensus 137 ~t~ 139 (144) --- T Consensus 167 R~D 169 (169) T protein:vir:10 167 RMD 169 (169) T ss_pred EeC Confidence 422 No 97 >protein:vir:6215 Length: 109 # NCBI annotation: hypothetical protein # Family: family:all:10885 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852595;genbank:gi:31415855;genbank:GeneID:1489213 Probab=63.79 E-value=0.31 Score=23.36 Aligned_cols=107 Identities=13% Similarity=0.144 Sum_probs=56.1 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCccc-ccCcCCCCCCEEEeCCcee-eecCCCcccCccEEEEEEEEEeCCcch Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGINIW-DGVNKKPEYPFIKIGEELT-SGRTISKDAIGKMHNLTLHIWSDYDSS 78 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~Vy-D~vP~~a~~PYV~lG~~~~-~~~d~t~~~~g~~~~l~i~VWs~~~G~ 78 (144) |+..- .-+..+|+..|-.|| |..|.++.|||++..-... .-+.|.|...... .-+|++++...- T Consensus 1 M~i~F------------e~lr~~Lk~~g~~V~RD~ap~~t~YPyivYs~v~e~~k~AS~kv~~~~~-~YQvSl~T~GtE- 66 (109) T protein:vir:62 1 MQINF------------EQLRSLMKKSGIPVSRDNAPTGIDYPYIVYEFVNEQHKRASNKVLKDMP-LYQIAVITNGTE- 66 (109) T ss_pred CcccH------------HHHHHHHHhcCCceeeccCCCCCCCceEEEEeecCceeeeccceEeecc-eeEEEEeeccch- Confidence 44432 346678888888999 9999999999998742211 1234555544333 358999986432 Q ss_pred HHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEee Q lcl|NC_018086. 79 FEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVID 138 (144) Q Consensus 79 ~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t 138 (144) ++ ...+.+++.+.-+.-. + ..+++-=+..|-+|. .---.|.+. T Consensus 67 ~d----l~~l~k~f~~~~vpfs-~-------f~gIqgDENDdTiTn-----fyTyVrcie 109 (109) T protein:vir:62 67 KD----YEPLKAVFNEVGVSYS-Q-------FDGMDYDENDDTITQ-----FITYVRCIQ 109 (109) T ss_pred hH----HHHHHHHHhhcCCccc-c-------ccccCCCCCcchhee-----eeeeeEEeC Confidence 22 3345566655443221 2 222222222222221 112334444 No 98 >protein:vir:104348 Length: 129 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398976;genbank:gi:81343960;genbank:GeneID:3778880 Probab=62.20 E-value=0.34 Score=23.16 Aligned_cols=116 Identities=10% Similarity=0.102 Sum_probs=60.9 Q ss_pred hhhHHHHHHHHHHhhcCC---c-ccccCc---CCCCCCEEEe----CCceeeecCCCcccCccEEEEEEEEEe-CCcchH Q lcl|NC_018086. 12 SSVALQRAIVKEIRAQGI---N-IWDGVN---KKPEYPFIKI----GEELTSGRTISKDAIGKMHNLTLHIWS-DYDSSF 79 (144) Q Consensus 12 ~~~aLQ~AI~~~L~a~~~---~-VyD~vP---~~a~~PYV~l----G~~~~~~~d~t~~~~g~~~~l~i~VWs-~~~G~~ 79 (144) .|.|.++.+-+++.+.-. + .|.++. ....-+|+-+ ++.+..... -+| +...=.++|+|.. -..|.. T Consensus 1 ~s~aar~~v~d~~~~~~~~~lpVA~eNv~FtPp~~G~~YLr~~~lpa~T~~~~L~-~d~-r~y~Gv~QI~Vv~p~G~G~~ 78 (129) T protein:vir:10 1 MSLAARKFVNDLLVNEFPVRYPVAWENAAFTPPADGSIWLKYDYTEVDTVTYGLS-RKC-KYYVGMVQISVFFSPGTGID 78 (129) T ss_pred CchHHHHHHHHHHHHhhcCCCcEeecCCCcCCCCCCceEEEEEecCCCceeeecc-CCC-ceEEEEEEEEEEecCCCCcc Confidence 467999988888876311 2 223322 1111134332 444433332 223 3344457888765 457899 Q ss_pred HHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEE-EEEEEEEeec Q lcl|NC_018086. 80 EVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAY-LFLDFEVIDS 139 (144) Q Consensus 80 eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~-~~l~fri~t~ 139 (144) ++++||+.|.+++.+. +.|..||--.. -.+ ...+.+. -++ +-+||.+--- T Consensus 79 ~a~~iA~ei~d~F~~g-~~L~~Gyi~~~-~~~-~p~i~~~-------~~~~ipvr~~~r~d 129 (129) T protein:vir:10 79 KPRQIANQLAESIVDG-TMLDSGTIYES-GVV-NPVIKSK-------SGWFIPVRFYVRLD 129 (129) T ss_pred hhhHHHHHHHHhccCC-ceeeceeecCC-CeE-CCeeecC-------CceEEeEEEEEEeC Confidence 9999999999988554 55777742111 111 1122222 233 3466654422 No 99 >protein:vir:10327 Length: 182 # NCBI annotation: ORF29 # Family: family:all:1090 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758922;genbank:gi:27311196;genbank:GeneID:956141 Probab=57.58 E-value=0.43 Score=22.59 Aligned_cols=130 Identities=10% Similarity=0.068 Sum_probs=66.5 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCC-----cccccCcCCCCCCEEEeCCceeeec--CCCcccCccEEEEEEEEEe Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGI-----NIWDGVNKKPEYPFIKIGEELTSGR--TISKDAIGKMHNLTLHIWS 73 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~-----~VyD~vP~~a~~PYV~lG~~~~~~~--d~t~~~~g~~~~l~i~VWs 73 (144) ||. ..+ .+|+.||.+.|++.-. ..|+.....-.-|-|-+.=....+. .++. +.....++.++|-- T Consensus 1 mt~-~~l------~~lh~AI~~~Lk~~~p~l~~~~~y~~~~~~i~~PAv~vel~~~~~~~d~~tG-q~~~~~~~~a~~vv 72 (182) T protein:vir:10 1 MSQ-TTI------TEVHEAIKAKLRETFPKVTVDDYNPEPELSVLAPALLLELEEFPMGADVGDD-RYPAACRFSVHCVL 72 (182) T ss_pred CCc-CCH------HHHHHHHHHHHHHhcCCceeeecCccccCccccceeeeeeecCCcCCCCCCC-cEEEEEEEEEEEEe Confidence 777 445 8999999999997522 3444444444456444432222121 1222 22345666677665 Q ss_pred C---CcchHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeec-cccc---ccCccEEEEEEEE--E-eeccc-C Q lcl|NC_018086. 74 D---YDSSFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEA-ANGT---YKNERAYLFLDFE--V-IDSTI-D 142 (144) Q Consensus 74 ~---~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d-~dg~---~~~~~~~~~l~fr--i-~t~~~-~ 142 (144) . ..-..++..+|.+|...+++...-|+.+ ++. ..+++.. |+.. ..+|...=.|.|+ + |-+++ + T Consensus 73 ~~~~~~~~~~~~~lAa~l~~~v~~~~wGL~~~-~v~-----~a~~i~a~p~~f~~~~~dgy~vW~VeW~Q~i~LG~s~w~ 146 (182) T protein:vir:10 73 GWEVKSLALELWEFSAAVAQLIRKSGVWVKGG-VLT-----KPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESMWN 146 (182) T ss_pred cccCCCchHHHHHHHHHHHHHHhcCcccCCcc-ccC-----ccceeeeccCccChhhcCceEEEEEEEEEEEeeCCcccC Confidence 3 2235789999999999999888776511 111 1222221 1111 1133333355555 1 11111 1 Q ss_pred CC Q lcl|NC_018086. 143 PY 144 (144) Q Consensus 143 ~~ 144 (144) .= T Consensus 147 ~~ 148 (182) T protein:vir:10 147 AD 148 (182) T ss_pred CC Confidence 00 No 100 >protein:vir:81158 Length: 109 # NCBI annotation: hypothetical protein # Family: family:all:1089 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285817;genbank:gi:148747738;genbank:GeneID:5247201 Probab=57.17 E-value=0.35 Score=23.11 Aligned_cols=105 Identities=8% Similarity=0.152 Sum_probs=54.8 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCc-ccccCcCCCCCCEEEeCCceee--ecCCCcccCccEEEEEEEEEeCCcc Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGIN-IWDGVNKKPEYPFIKIGEELTS--GRTISKDAIGKMHNLTLHIWSDYDS 77 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~-VyD~vP~~a~~PYV~lG~~~~~--~~d~t~~~~g~~~~l~i~VWs~~~G 77 (144) |+ ==+.-|++.|++-|-+ -||+-..++.+||+++=..... -.|+ ..-...-.++|..+++... T Consensus 2 ~~------------mt~~~l~~~Lk~~GlPvay~~F~~gp~pPyivY~~~~~~~~~ADn--~vy~~~~~~~IELYT~~KD 67 (109) T protein:vir:81 2 VK------------MTQAELYQALKSIGFPVAYGSFTNPVTPPFITYQFAYSNDMMADN--INYVAIDDFQVELYTKKKD 67 (109) T ss_pred ee------------ecHHHHHHHHHhcCCCeeeccCCCCCCCceEEEEeccCcceeccc--eEEEeccceEEEEEeeccC Confidence 22 1245588999988766 4688888888899998332222 2222 2234455679999996553 Q ss_pred hHHHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeec Q lcl|NC_018086. 78 SFEVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDS 139 (144) Q Consensus 78 ~~eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~ 139 (144) ...= ..|.++|++..+. |.-... ++++..= -.+.=.|+++-- T Consensus 68 ~~~E----~~iE~~L~~~~i~----y~k~et------~IesEkl------yq~~Y~~~~~g~ 109 (109) T protein:vir:81 68 PVAE----QKVQDKLKELGLP----YRKFET------FIDTENL------FQILYEIQILGG 109 (109) T ss_pred hHHH----HHHHHHHHhcCCc----eeeeEE------EecCCce------EEEEEEEEEecC Confidence 3211 2667777655443 211111 2222110 112234444444 No 101 >protein:vir:5259 Length: 213 # NCBI annotation: hypothetical protein # Family: family:all:4879 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852764;genbank:gi:31544039;uniprot:Q7Y5T6;genbank:GeneID:2753558 Probab=47.59 E-value=0.33 Score=23.25 Aligned_cols=131 Identities=18% Similarity=0.227 Sum_probs=73.6 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCcccccC-cCCCCCCEEEeCCceeeecCCCccc-Cc--------cEEEEEEE Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGINIWDGV-NKKPEYPFIKIGEELTSGRTISKDA-IG--------KMHNLTLH 70 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~VyD~v-P~~a~~PYV~lG~~~~~~~d~t~~~-~g--------~~~~l~i~ 70 (144) |--+-+| .+|++-|.+.|.=-.+.|-|.- |.++--|||++-....++-...++. .| .+..+.+. T Consensus 51 TvS~lD~------~~LRq~ir~Ll~LPeg~Vid~~~da~p~~pFITV~~l~ss~lG~a~reFdg~rEvit~S~et~vs~t 124 (213) T protein:vir:52 51 TISGFDI------VRLRKLIQQALQLPDGVVIGGWLPENPLSAFITVDVLMSSETGIARRDFDGKRERITMSMQNTVSFS 124 (213) T ss_pred ccccccH------HHHHHHHHHHHhCCcceecCCcCCCCCCCCeEEEeecccchhhhhhhhccCchhhhhhhhccEEEEE Confidence 3334445 7999999888876555677764 5555559999855444332111110 12 23334444 Q ss_pred EEeCCcchHHHHHHHHHHHHHhcCCccccCCCceEE---EEEEeeeeeeecccccccCccEEEEEEEE---EeecccCCC Q lcl|NC_018086. 71 IWSDYDSSFEVKNLTDFLVGLLINSPLQLEEGFCIG---KKELDHVRYTEAANGTYKNERAYLFLDFE---VIDSTIDPY 144 (144) Q Consensus 71 VWs~~~G~~eak~Ia~~V~~aL~~~~L~L~~g~~~~---~~~~~~~r~~~d~dg~~~~~~~~~~l~fr---i~t~~~~~~ 144 (144) .+. .+|.+.....+..|+.. ..+ .++... -.+....+.+...-|....+|+.+.++|. .+.+++||- T Consensus 125 afG-----tnAy~ll~kl~a~L~ss-~al-~~LK~l~aGlVr~S~v~nLsa~iggg~e~RArfdltfsH~HrVet~l~~~ 197 (213) T protein:vir:52 125 CFG-----TNAMAQCYKLKAILQSS-VIL-QALKTMNVGIVSFSDVRNLTATIGSDYEERGQFDAVFSHHHIVDTPLDPI 197 (213) T ss_pred EeC-----hhHHHHHHHHHHHHhHH-HHH-HHHHHhccceeeeccccccceecCCCchhheeeeeeeeeeeeeccchHHH Confidence 443 34556666665555422 222 122221 23445556666655555677888888887 667888887 No 102 >protein:vir:105468 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:1535 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529878;genbank:gi:90592618;genbank:GeneID:3974532 Probab=46.32 E-value=0.74 Score=21.30 Aligned_cols=117 Identities=12% Similarity=0.131 Sum_probs=71.5 Q ss_pred hHHHHHHHHHHhhc--CCcccc-cCcCCCCCC--EEEeCCceeeecCCCcccCccEEEEEEEEEeC-CcchHHHHHHHHH Q lcl|NC_018086. 14 VALQRAIVKEIRAQ--GINIWD-GVNKKPEYP--FIKIGEELTSGRTISKDAIGKMHNLTLHIWSD-YDSSFEVKNLTDF 87 (144) Q Consensus 14 ~aLQ~AI~~~L~a~--~~~VyD-~vP~~a~~P--YV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~-~~G~~eak~Ia~~ 87 (144) -+|-.||-++|+.. ...||+ .++.+-..| ||.+=+...... -...--.++.+.|+=+.+ .....++.++++. T Consensus 1 ~~ii~~I~~~L~~~fpd~~IY~e~i~Qg~~~PcFFI~~l~~~~~~~--~~~ry~r~~~fdI~Yfp~~~~~~~e~~~vae~ 78 (135) T protein:vir:10 1 MTIVERIAKRISEIFPDVTIYSEKQKSGFQVPSFYISKIMTVTKSR--FFDIQDRSLSYSITYFANPDRPNADMEEVEQK 78 (135) T ss_pred ChhHHHHHHHHHHhcCceeeecccccCCCcCCeeEEEEecCCcccc--ccceEEEEeeEEEEEeecCCCchhhHHHHHHH Confidence 48889999999874 357995 689999999 466544433332 233445677888887874 4568999999999 Q ss_pred HHHHhcCCccccCCCceEEEEEEeeeeeeecc-cccccCccEEEEEEEEEee-cccCCC Q lcl|NC_018086. 88 LVGLLINSPLQLEEGFCIGKKELDHVRYTEAA-NGTYKNERAYLFLDFEVID-STIDPY 144 (144) Q Consensus 88 V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~-dg~~~~~~~~~~l~fri~t-~~~~~~ 144 (144) +.+.|.-- ++ ..+..+.++.... ||+ .|-.+.++|++.- .+.+|. T Consensus 79 L~~~le~i-----~~----~~~~~~~~~~i~~~D~V---Lhf~~~~~~~~~k~~~~~~M 125 (135) T protein:vir:10 79 LLNNFTRL-----DD----YATVRNRETTINQDDET---LVMSFDLRLEMYPVQDGGKL 125 (135) T ss_pred HHHhhhhc-----Cc----eeEEeCCceEEEeecCe---EEEEEEEEEEEeecCCcchh Confidence 98877321 22 1233333333211 333 2345667777663 344444 No 103 >protein:vir:95371 Length: 104 # NCBI annotation: aminopeptidase # Family: family:all:1089 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764481;genbank:gi:115334635;genbank:GeneID:5179258 Probab=40.17 E-value=0.98 Score=20.62 Aligned_cols=103 Identities=13% Similarity=0.179 Sum_probs=56.6 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCc-ccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchH Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGIN-IWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSF 79 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~-VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~ 79 (144) |+. .-|.+.|++-|-+ -|++--.++..||+++=.....+.......-...-.++|..++...... T Consensus 1 Mt~--------------~~l~~~Lk~~glPvay~hF~~~p~pPyivy~~~~~~~~~ADn~~y~~~~~~~IELYT~~Kd~~ 66 (104) T protein:vir:95 1 MKL--------------TELDDLLKATGLPVAYSHFSKPQKPPFITYMVAYSSNFTADDQVYQEIENVQIELYTLKKDFE 66 (104) T ss_pred CCH--------------HHHHHHHHhcCCCeeeccccCCCCCceEEEEecCCcceeccceEEEeecceEEEEEeeccCHH Confidence 553 4488899987765 4677666667799998443333321111223445567999999877543 Q ss_pred HHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 80 EVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 80 eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) .=+ .|.++|++..+. |.... .++++..= -.+.=.|+++ T Consensus 67 ~E~----~iE~~Ld~~~i~----y~k~e------t~IesEkl------yq~~Y~~~l~ 104 (104) T protein:vir:95 67 AEE----KVKAVLDANNLV----YETSE------TYIPSEKL------YQKVYEVRLL 104 (104) T ss_pred HHH----HHHHHHHhCCCc----eeeEE------EEecCcce------EEEEEEEEeC Confidence 322 566677655442 22111 13332211 1234577777 No 104 >protein:vir:3874 Length: 114 # NCBI annotation: putative head-tail joining protein # Family: family:all:28620 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680491;swissprot:trembl:p94215;genbank:gi:22296531;uniprot:P94215;genbank:GeneID:951676 Probab=37.32 E-value=1.1 Score=20.30 Aligned_cols=100 Identities=11% Similarity=0.113 Sum_probs=57.2 Q ss_pred hHHHHHHHHHHhhcC-------CcccccCcCCC-----CCCEEEeCC--ceeeecCCCcccCccEEEEEEEEEeCCcchH Q lcl|NC_018086. 14 VALQRAIVKEIRAQG-------INIWDGVNKKP-----EYPFIKIGE--ELTSGRTISKDAIGKMHNLTLHIWSDYDSSF 79 (144) Q Consensus 14 ~aLQ~AI~~~L~a~~-------~~VyD~vP~~a-----~~PYV~lG~--~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~ 79 (144) .|=-+-|+.-|.++. .++|-..|.+. ..|.|-+-+ ++.... +.+.+.-+--.++++-|-+..|-. T Consensus 1 ~~PE~~vaDiLsad~~lv~~mYipift~tpdd~fik~SsAPWiRiTpiPGDda~y-aDD~R~~EYPrVqVDfWvr~e~~d 79 (114) T protein:vir:38 1 MAPEKRVYDILSANLDIADKVYIGTPNFNNQTSATPESLAPWVRITYLPGDAADY-ADDSRILEYPKVQVDFWVGITDWD 79 (114) T ss_pred CCchhhhhhhhccchhhhhheeccCCCCCCCCcccccccCCeeEeeecCCccccc-cccceeeecCceeEEEeeccCChh Confidence 022234667777763 26777777543 568877632 222121 122334445578999999999999 Q ss_pred HHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccE Q lcl|NC_018086. 80 EVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERA 128 (144) Q Consensus 80 eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~ 128 (144) +..+|-..|.++||....+ -|. .-+=+||...-- + T Consensus 80 ~~e~iqe~IY~~Lha~gwe---RYY----------~nsY~D~~~~~~-~ 114 (114) T protein:vir:38 80 QQEKIETQIYQALHAADWE---RYY----------RNSYVDGIPQPF-A 114 (114) T ss_pred hHHHHHHHHHHHHHhcCcc---eee----------eccccCCCCCCC-C Confidence 9999999999999855332 111 111234432110 0 No 105 >protein:vir:80105 Length: 162 # NCBI annotation: gp13 # Family: family:all:2729 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468717;genbank:gi:157325297;genbank:GeneID:5601796 Probab=36.37 E-value=1.2 Score=20.20 Aligned_cols=131 Identities=17% Similarity=0.191 Sum_probs=64.3 Q ss_pred CCCCChh--h-HHHHHHHHHHhhcCC--cc-c-ccCcCCCCCCEEEeCCceeeecC--CCcccCcc--EEEEEEEEEeCC Q lcl|NC_018086. 7 FRARSSS--V-ALQRAIVKEIRAQGI--NI-W-DGVNKKPEYPFIKIGEELTSGRT--ISKDAIGK--MHNLTLHIWSDY 75 (144) Q Consensus 7 ~~~~S~~--~-aLQ~AI~~~L~a~~~--~V-y-D~vP~~a~~PYV~lG~~~~~~~d--~t~~~~g~--~~~l~i~VWs~~ 75 (144) |..-.+. + -|=+-|.++|....+ .| . |..-+.+.|||+++- .. .|+- ..+-..++ +..+++.|.|. T Consensus 1 ~~~~~~~~~~~~lv~~ii~~i~~~~~gl~vI~~~~~g~~p~yPF~TY~-v~-~pyi~~~~~~~~~e~~~~~isi~~~S~- 77 (162) T protein:vir:80 1 MPNDTAGYDYGKLVKTLINAVNELSGGLQLIESSSGGEQPEYPFCQYT-IT-SPYIAISPDIVEGEQFEIVISLTWRAL- 77 (162) T ss_pred CCCccccccHHHHHHHHHHHHHhhhcceeEEEccCCCCCCCCCeEEEE-Ee-cCccccCCcccCCcceEEEEEEEEEeC- Confidence 2222221 1 244666777766543 33 3 345567899999973 21 2211 11111333 45566667665 Q ss_pred cchHHHHHHHHHHHHHhcCC--ccccCCCceEEEEEEeeeeeeeccccc--ccCccEEE-EEEEEEee------cccCCC Q lcl|NC_018086. 76 DSSFEVKNLTDFLVGLLINS--PLQLEEGFCIGKKELDHVRYTEAANGT--YKNERAYL-FLDFEVID------STIDPY 144 (144) Q Consensus 76 ~G~~eak~Ia~~V~~aL~~~--~L~L~~g~~~~~~~~~~~r~~~d~dg~--~~~~~~~~-~l~fri~t------~~~~~~ 144 (144) ..-||.++|..+++.|... .-.+-.+ .++-+++..-..+.+-. ..-.+.++ .++||+.. ++|+-+ T Consensus 78 -~~~eAl~la~~l~~~f~~~~~~~~~~~~---~gIvvvdv~~~~~R~~~~~~~yerR~GFD~~~Rv~r~~e~~~~tIe~i 153 (162) T protein:vir:80 78 -SGHQALNLANITNKYFRSQKGRFFMQEN---GGIVVVSVQNSGLRDTFISIEYERSAGIDLRLRVVDSYSSEIQEIDNI 153 (162) T ss_pred -CHHHHHHHHHHHHHHhhcCCceeeeeec---CcEEEEecCCCccceeEeeeeeeeeecceEEEEEeeccccccceeeee Confidence 5599999999999999532 1111110 01112222211111111 11134554 68899874 345555 No 106 >protein:vir:397 Length: 132 # NCBI annotation: gp12 # Family: family:all:911 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046907;genbank:gi:9630477;genbank:GeneID:1261651 Probab=33.80 E-value=1.3 Score=19.90 Aligned_cols=121 Identities=12% Similarity=0.177 Sum_probs=73.9 Q ss_pred CCCCChhhHHHHHHHHHHhhcC---CcccccCcC---CCCCCE--EEeCCceeeecCCCcccCccEEEEEEEEEeCC-cc Q lcl|NC_018086. 7 FRARSSSVALQRAIVKEIRAQG---INIWDGVNK---KPEYPF--IKIGEELTSGRTISKDAIGKMHNLTLHIWSDY-DS 77 (144) Q Consensus 7 ~~~~S~~~aLQ~AI~~~L~a~~---~~VyD~vP~---~a~~PY--V~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~-~G 77 (144) |.+ .+++++|.++|++.. ..+||..|. +...|- |-|-+.+.++. +-|+.-|+-.|.|.|+=+. .+ T Consensus 1 ~~h----t~IR~~Vid~L~~~l~~~~~ffdGrP~fiDe~elPAVAV~l~d~~~~~~--~ld~~~w~A~LhI~iyLka~~~ 74 (132) T protein:vir:39 1 MKH----RDIRKVIIDALESAIGTDAIYFDGRPAVLEEGDFPAVAVYLTDAEYTGE--ELDADTWQAILHIEVFLEAQVP 74 (132) T ss_pred Cch----HHHHHHHHHHHHhhCCCceEEecCcceeeccccCcEEEEEeecCCCCcc--eecCCeeEEEEEEEEEeecCCC Confidence 422 589999999999853 357898883 556785 45566665553 4467788999999999654 35 Q ss_pred hHHHHHHHHH-HHHHhcCC-ccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEeecc Q lcl|NC_018086. 78 SFEVKNLTDF-LVGLLINS-PLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVIDST 140 (144) Q Consensus 78 ~~eak~Ia~~-V~~aL~~~-~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~t~~ 140 (144) -.+.-++++. |..++.+. .|. +. +..+...+..+-+|....+ + +...|.|+|-=+- T Consensus 75 ds~LD~~aE~~i~p~i~~~~~l~---~l-~~~~~~~gy~Y~rD~~~at--W-~sadL~y~ItY~~ 132 (132) T protein:vir:39 75 DSELDDWMETRVYPVLAEVPGLE---SL-ITTMVQQGYDYQRDDDMAL--W-SSADLKYSITYDM 132 (132) T ss_pred HHHHHHHHHHHhHhhhcccchhh---hH-hhhhhhcCCCcccccccce--E-EEEEEEEEEEEeC Confidence 5667778874 45566442 221 10 0112334456777776543 1 2346777754333 No 107 >protein:vir:80109 Length: 104 # NCBI annotation: Putative aminopeptidase # Family: family:all:1089 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425609;genbank:gi:155042942;genbank:GeneID:5469534 Probab=32.65 E-value=1.4 Score=19.77 Aligned_cols=103 Identities=12% Similarity=0.183 Sum_probs=56.0 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCc-ccccCcCCCCCCEEEeCCceeeecCCCcccCccEEEEEEEEEeCCcchH Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGIN-IWDGVNKKPEYPFIKIGEELTSGRTISKDAIGKMHNLTLHIWSDYDSSF 79 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~-VyD~vP~~a~~PYV~lG~~~~~~~d~t~~~~g~~~~l~i~VWs~~~G~~ 79 (144) ||. .-|.+.|++-|-+ -|++--..+..||+++=.....+.......-...-.++|..+++..... T Consensus 1 Mt~--------------~~l~~~Lk~~glPvay~~F~~~P~pPyivy~~~~~~~~~ADn~~y~~~~~~~IELYT~~Kd~~ 66 (104) T protein:vir:80 1 MNL--------------DELNTILKQTGFPVAYSHFGKPQKPPFITYVVAYSSNFGADDKVYQDIENVQIELYTDKKDLE 66 (104) T ss_pred CCH--------------HHHHHHHHhcCCCeeeecCCCcCCCCEEEEEecCCcceeccceEEEeecceEEEEEeeccCHH Confidence 554 4488899887765 4576655567799998443332221111223445567999999877543 Q ss_pred HHHHHHHHHHHHhcCCccccCCCceEEEEEEeeeeeeecccccccCccEEEEEEEEEe Q lcl|NC_018086. 80 EVKNLTDFLVGLLINSPLQLEEGFCIGKKELDHVRYTEAANGTYKNERAYLFLDFEVI 137 (144) Q Consensus 80 eak~Ia~~V~~aL~~~~L~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~~~l~fri~ 137 (144) .=+ .|.++|++..+. |.... .++++..= -.+.=.|+++ T Consensus 67 ~E~----~iE~~Ld~~~i~----y~k~e------t~IesEkl------yq~~Y~~~l~ 104 (104) T protein:vir:80 67 AEE----RIKAVLDANSLY----YETTE------TYIPSERL------YQKVYEVRLL 104 (104) T ss_pred HHH----HHHHHHhhCCCc----eeeEE------EEecCcce------EEEEEEEEeC Confidence 322 566677655442 22111 13332211 1234477777 No 108 >protein:vir:7994 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817348;genbank:gi:29565776;genbank:GeneID:1259015 Probab=30.35 E-value=1.6 Score=19.49 Aligned_cols=123 Identities=14% Similarity=0.064 Sum_probs=63.8 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCcccccCcCCCCCCEEEeCCceee--ecCCCcccCccEEEEEEEEEeCCcch Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGINIWDGVNKKPEYPFIKIGEELTS--GRTISKDAIGKMHNLTLHIWSDYDSS 78 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~VyD~vP~~a~~PYV~lG~~~~~--~~d~t~~~~g~~~~l~i~VWs~~~G~ 78 (144) |++.- +.|+. +-+.+=|..- .+|=-+-+.+.+.||+.+-...-. ++.+| ..-.+++|++. .|. T Consensus 1 m~~~s---aP~~e----~~vv~WLsp~-~~va~~R~~~~PLPf~~V~Rv~G~d~~e~~t-----D~avvsv~~fg--~~~ 65 (134) T protein:vir:79 1 MATDS---APSIH----RVLVAWLSPL-GKVSTRRLSGDPLPHRVVRRVDGRDVPEEGS-----DSAVVSVHTFA--ASD 65 (134) T ss_pred CCccc---CCChh----eeeeeecccc-hhceeccCCCCCCCeEEEEEeCCCCCccccc-----cCceeEEEEee--CCH Confidence 55432 12221 1111112111 112223477889999998432221 11122 23347889987 677 Q ss_pred HHHHHHHHHHHH----HhcCCcc--ccCCCceEEEEEEeeeeeeecccccccCccEE---EEEEEEEeeccc Q lcl|NC_018086. 79 FEVKNLTDFLVG----LLINSPL--QLEEGFCIGKKELDHVRYTEAANGTYKNERAY---LFLDFEVIDSTI 141 (144) Q Consensus 79 ~eak~Ia~~V~~----aL~~~~L--~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~---~~l~fri~t~~~ 141 (144) ..|+.+++.+-+ ++-+.+. ++.+|. -..+...+++..|...+....++ -+-||.+=++-+ T Consensus 66 eaA~d~ad~vHrRM~kL~~~~~~~~~~~gG~---~~~id~~~vl~~P~~~eY~dD~~~vrytgRY~~g~~y~ 134 (134) T protein:vir:79 66 EAAENEAELTHQRMLELVVNPLTEIPVGGGV---VARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) T ss_pred HHhhHHHHHHHHHHHHHhcccccceecCCce---EEEeehhhhhccceeeeeCCCceEEEEeeeeeeccccC Confidence 788888887744 4444444 344563 34677788888888765332222 145666555444 No 109 >protein:vir:102609 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655006;genbank:gi:109392196;genbank:GeneID:4157231 Probab=29.87 E-value=1.6 Score=19.43 Aligned_cols=123 Identities=14% Similarity=0.072 Sum_probs=63.8 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCcccccCcCCCCCCEEEeCCceee--ecCCCcccCccEEEEEEEEEeCCcch Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGINIWDGVNKKPEYPFIKIGEELTS--GRTISKDAIGKMHNLTLHIWSDYDSS 78 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~VyD~vP~~a~~PYV~lG~~~~~--~~d~t~~~~g~~~~l~i~VWs~~~G~ 78 (144) |++.- +.|+. +-+.+=|..- .+|=-+-+.+.+.||+.+-...-. ++.+| ..-.+++|++. .|. T Consensus 1 m~~~s---aP~~e----~~vv~WLsp~-~~va~~R~~~~PLPf~~V~Rv~G~d~~e~~t-----D~avvsv~~fg--~~~ 65 (134) T protein:vir:10 1 MATDS---APSIH----RVLVAWLSPL-GKVSTRRLSGDPLPHRVVRRVDGRDVPEEGS-----DVAVVSVHTFA--ASD 65 (134) T ss_pred CCccc---CCChh----eeeeeecccc-hhceeccCCCCCCCeEEEEEeCCCCCccccc-----ccceEEEEEee--CCH Confidence 55432 12221 1111112110 112223477889999998332211 11122 23447889987 677 Q ss_pred HHHHHHHHHHHH----HhcCCcc--ccCCCceEEEEEEeeeeeeecccccccCccEE---EEEEEEEeeccc Q lcl|NC_018086. 79 FEVKNLTDFLVG----LLINSPL--QLEEGFCIGKKELDHVRYTEAANGTYKNERAY---LFLDFEVIDSTI 141 (144) Q Consensus 79 ~eak~Ia~~V~~----aL~~~~L--~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~---~~l~fri~t~~~ 141 (144) ..|+.+++.+-+ ++-+.+. ++.+|. -..+...+++..|...+....++ -+-||.+=++-+ T Consensus 66 eaA~d~ad~vHrRM~kL~~~~~~~~~~~gG~---~~~id~~~v~~~P~~~eY~dD~~~vrytgRY~~g~~y~ 134 (134) T protein:vir:10 66 EAAENEAELTHQRMLELVVNPLTEIPVGGGV---VARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) T ss_pred HHhhHHHHHHHHHHHHHhcccccceecCCce---EEEeehhhhhccceeeeeCCCceEEEEeeeeeeccccC Confidence 788888887744 4444444 344563 34677788888888765332222 145666555444 No 110 >protein:vir:105826 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655771;genbank:gi:109522094;genbank:GeneID:4157634 Probab=29.87 E-value=1.6 Score=19.43 Aligned_cols=123 Identities=14% Similarity=0.072 Sum_probs=63.8 Q ss_pred CCCCCCCCCCChhhHHHHHHHHHHhhcCCcccccCcCCCCCCEEEeCCceee--ecCCCcccCccEEEEEEEEEeCCcch Q lcl|NC_018086. 1 MSKRPPFRARSSSVALQRAIVKEIRAQGINIWDGVNKKPEYPFIKIGEELTS--GRTISKDAIGKMHNLTLHIWSDYDSS 78 (144) Q Consensus 1 m~~~~~~~~~S~~~aLQ~AI~~~L~a~~~~VyD~vP~~a~~PYV~lG~~~~~--~~d~t~~~~g~~~~l~i~VWs~~~G~ 78 (144) |++.- +.|+. +-+.+=|..- .+|=-+-+.+.+.||+.+-...-. ++.+| ..-.+++|++. .|. T Consensus 1 m~~~s---aP~~e----~~vv~WLsp~-~~va~~R~~~~PLPf~~V~Rv~G~d~~e~~t-----D~avvsv~~fg--~~~ 65 (134) T protein:vir:10 1 MATDS---APSIH----RVLVAWLSPL-GKVSTRRLSGDPLPHRVVRRVDGRDVPEEGS-----DVAVVSVHTFA--ASD 65 (134) T ss_pred CCccc---CCChh----eeeeeecccc-hhceeccCCCCCCCeEEEEEeCCCCCccccc-----ccceEEEEEee--CCH Confidence 55432 12221 1111112110 112223477889999998332211 11122 23447889987 677 Q ss_pred HHHHHHHHHHHH----HhcCCcc--ccCCCceEEEEEEeeeeeeecccccccCccEE---EEEEEEEeeccc Q lcl|NC_018086. 79 FEVKNLTDFLVG----LLINSPL--QLEEGFCIGKKELDHVRYTEAANGTYKNERAY---LFLDFEVIDSTI 141 (144) Q Consensus 79 ~eak~Ia~~V~~----aL~~~~L--~L~~g~~~~~~~~~~~r~~~d~dg~~~~~~~~---~~l~fri~t~~~ 141 (144) ..|+.+++.+-+ ++-+.+. ++.+|. -..+...+++..|...+....++ -+-||.+=++-+ T Consensus 66 eaA~d~ad~vHrRM~kL~~~~~~~~~~~gG~---~~~id~~~v~~~P~~~eY~dD~~~vrytgRY~~g~~y~ 134 (134) T protein:vir:10 66 EAAENEAELTHQRMLELVVNPLTEIPVGGGV---VARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGVQYI 134 (134) T ss_pred HHhhHHHHHHHHHHHHHhcccccceecCCce---EEEeehhhhhccceeeeeCCCceEEEEeeeeeeccccC Confidence 788888887744 4444444 344563 34677788888888765332222 145666555444 Done!