Query lcl|NC_021539.1_cdsid_YP_008126706.1 [gene=M615_gp10] [protein=hypothetical protein] [protein_id=YP_008126706.1] [location=6344..6766] Match_columns 140 No_of_seqs 12 out of 15 Neff 3.1 Searched_HMMs 1612 Date Thu Nov 7 17:07:08 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_10 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_10_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:4096 Length: 140 # 100.0 5.6E-81 3.5E-84 460.5 12.2 140 1-140 1-140 (140) 2 protein:vir:1386 Length: 149 # 99.3 1.8E-14 1.1E-17 96.0 9.5 136 1-139 1-149 (149) 3 protein:vir:102875 Length: 146 99.0 1.4E-12 9E-16 85.5 8.6 134 1-137 1-146 (146) 4 protein:vir:102085 Length: 146 99.0 1.4E-12 9E-16 85.5 8.6 134 1-137 1-146 (146) 5 protein:vir:105007 Length: 146 99.0 1.4E-12 9E-16 85.5 8.6 134 1-137 1-146 (146) 6 protein:vir:107568 Length: 146 99.0 1.4E-12 9E-16 85.5 8.6 134 1-137 1-146 (146) 7 protein:vir:3873 Length: 128 # 98.8 4.7E-11 2.9E-14 77.2 8.6 124 5-134 1-128 (128) 8 protein:vir:5745 Length: 135 # 98.7 2.1E-10 1.3E-13 73.7 10.0 128 1-138 1-135 (135) 9 protein:vir:80362 Length: 140 98.5 1.4E-09 8.6E-13 69.1 9.1 125 2-140 1-139 (140) 10 protein:vir:100075 Length: 140 98.5 1.9E-09 1.2E-12 68.4 9.5 125 2-140 1-139 (140) 11 protein:vir:1273 Length: 127 # 98.4 4.4E-09 2.7E-12 66.4 8.5 122 2-134 1-127 (127) 12 protein:vir:1891 Length: 179 # 98.3 5.3E-09 3.3E-12 66.0 8.5 136 1-140 1-172 (179) 13 protein:vir:1437 Length: 140 # 98.3 9.8E-09 6.1E-12 64.5 9.5 125 2-140 1-139 (140) 14 protein:vir:4956 Length: 153 # 98.3 8.4E-09 5.2E-12 64.9 8.5 130 2-140 1-141 (153) 15 protein:vir:105089 Length: 133 98.3 7.8E-09 4.9E-12 65.0 8.3 125 2-136 1-133 (133) 16 protein:vir:100887 Length: 139 98.3 1.3E-08 8E-12 63.8 9.4 127 7-140 1-137 (139) 17 protein:vir:93617 Length: 148 98.3 1.4E-08 8.7E-12 63.6 9.3 135 1-139 1-148 (148) 18 protein:vir:4347 Length: 164 # 98.1 3.1E-08 1.9E-11 61.8 8.5 136 1-140 1-157 (164) 19 protein:vir:94538 Length: 125 98.1 4E-08 2.5E-11 61.2 8.5 125 1-136 1-125 (125) 20 protein:vir:100243 Length: 140 98.0 7.1E-08 4.4E-11 59.8 8.9 125 2-140 1-139 (140) 21 protein:vir:194 Length: 149 # 98.0 1.7E-07 1.1E-10 57.7 10.0 134 2-139 1-149 (149) 22 protein:vir:5000 Length: 141 # 98.0 1.1E-07 6.9E-11 58.7 8.8 129 2-140 1-140 (141) 23 protein:vir:81106 Length: 125 98.0 6.5E-08 4E-11 60.0 7.5 119 1-135 1-125 (125) 24 protein:vir:9414 Length: 125 # 98.0 6.5E-08 4E-11 60.0 7.5 119 1-135 1-125 (125) 25 protein:vir:79988 Length: 125 98.0 6.5E-08 4E-11 60.0 7.5 119 1-135 1-125 (125) 26 protein:vir:4704 Length: 125 # 98.0 6.5E-08 4E-11 60.0 7.5 119 1-135 1-125 (125) 27 protein:vir:98342 Length: 125 98.0 6.5E-08 4E-11 60.0 7.5 119 1-135 1-125 (125) 28 protein:vir:9708 Length: 125 # 97.9 1E-07 6.2E-11 59.0 8.2 120 8-135 1-125 (125) 29 protein:vir:100223 Length: 139 97.9 1.6E-07 1E-10 57.8 9.3 126 7-140 1-137 (139) 30 protein:vir:106570 Length: 182 97.8 3.1E-07 1.9E-10 56.2 8.6 123 1-139 1-182 (182) 31 protein:vir:102154 Length: 119 97.7 2.5E-07 1.5E-10 56.8 6.5 116 4-134 1-119 (119) 32 protein:vir:4859 Length: 140 # 97.6 1.1E-06 6.5E-10 53.4 8.6 129 2-140 1-140 (140) 33 protein:vir:4833 Length: 140 # 97.4 2.5E-06 1.5E-09 51.3 8.9 129 2-139 1-140 (140) 34 protein:vir:95789 Length: 114 97.3 2.1E-06 1.3E-09 51.7 6.9 114 5-134 1-114 (114) 35 protein:vir:79034 Length: 141 97.1 1.9E-05 1.2E-08 46.4 10.7 133 1-139 1-141 (141) 36 protein:vir:101594 Length: 173 97.0 1.1E-05 6.7E-09 47.8 8.4 118 7-136 1-173 (173) 37 protein:vir:9930 Length: 108 # 96.8 9.4E-06 5.8E-09 48.2 6.7 107 9-131 1-108 (108) 38 protein:vir:78858 Length: 115 96.6 1.9E-05 1.2E-08 46.4 7.1 112 7-130 1-115 (115) 39 protein:vir:97144 Length: 115 96.6 1.9E-05 1.2E-08 46.4 7.1 112 7-130 1-115 (115) 40 protein:vir:96225 Length: 115 96.6 1.9E-05 1.2E-08 46.4 7.1 112 7-130 1-115 (115) 41 protein:vir:96358 Length: 115 96.6 1.9E-05 1.2E-08 46.4 7.1 112 7-130 1-115 (115) 42 protein:vir:9312 Length: 115 # 96.6 1.9E-05 1.2E-08 46.4 7.1 112 7-130 1-115 (115) 43 protein:vir:103917 Length: 115 96.6 1.9E-05 1.2E-08 46.4 7.1 112 7-130 1-115 (115) 44 protein:vir:97088 Length: 157 96.6 3.2E-05 2E-08 45.3 7.9 129 1-140 1-156 (157) 45 protein:vir:106623 Length: 115 96.4 3E-05 1.9E-08 45.4 7.2 112 7-130 1-115 (115) 46 protein:vir:94108 Length: 149 96.4 2.3E-05 1.4E-08 46.1 6.2 109 1-126 12-149 (149) 47 protein:vir:105916 Length: 149 96.2 3.7E-05 2.3E-08 44.9 6.4 109 1-126 12-149 (149) 48 protein:vir:94796 Length: 137 96.0 6.3E-05 3.9E-08 43.6 6.9 110 2-126 1-137 (137) 49 protein:vir:3848 Length: 159 # 96.0 0.00015 9.1E-08 41.6 8.9 128 1-139 1-159 (159) 50 protein:vir:99744 Length: 115 95.8 9.5E-05 5.9E-08 42.6 6.9 108 7-130 1-115 (115) 51 protein:vir:105330 Length: 137 95.7 9.6E-05 6E-08 42.6 6.6 110 2-126 1-137 (137) 52 protein:vir:96829 Length: 135 95.5 0.00013 8.1E-08 41.9 6.8 110 2-126 1-135 (135) 53 protein:vir:94654 Length: 142 95.4 0.00017 1E-07 41.3 6.9 113 2-130 1-142 (142) 54 protein:vir:96121 Length: 137 95.3 0.0002 1.3E-07 40.8 7.0 108 2-126 1-137 (137) 55 protein:vir:107099 Length: 137 95.2 0.00022 1.3E-07 40.7 7.0 110 2-126 1-137 (137) 56 protein:vir:2740 Length: 114 # 95.1 0.00019 1.2E-07 41.0 6.3 110 1-131 1-114 (114) 57 protein:vir:4906 Length: 114 # 95.1 0.00019 1.2E-07 41.0 6.3 110 1-131 1-114 (114) 58 protein:vir:5978 Length: 144 # 95.1 0.00035 2.2E-07 39.5 7.7 115 2-138 1-144 (144) 59 protein:vir:743 Length: 108 # 95.0 0.00027 1.7E-07 40.1 6.9 106 7-130 1-108 (108) 60 protein:vir:3617 Length: 112 # 95.0 0.0003 1.8E-07 39.9 7.1 110 1-130 1-112 (112) 61 protein:vir:102963 Length: 163 94.8 0.0014 8.9E-07 36.2 10.4 135 1-140 1-161 (163) 62 protein:vir:95894 Length: 137 94.6 0.00045 2.8E-07 38.9 7.0 110 2-126 1-137 (137) 63 protein:vir:105467 Length: 144 94.5 0.0014 8.7E-07 36.2 9.5 122 2-140 1-139 (144) 64 protein:vir:97427 Length: 137 93.5 0.0012 7.4E-07 36.6 7.3 110 2-126 1-137 (137) 65 protein:vir:94490 Length: 137 93.5 0.0012 7.4E-07 36.6 7.3 110 2-126 1-137 (137) 66 protein:vir:93738 Length: 137 93.5 0.0012 7.4E-07 36.6 7.3 110 2-126 1-137 (137) 67 protein:vir:98409 Length: 108 93.1 0.0014 8.9E-07 36.2 7.0 106 7-130 1-108 (108) 68 protein:vir:6246 Length: 143 # 90.7 0.019 1.2E-05 30.0 10.6 138 1-139 1-143 (143) 69 protein:vir:1332 Length: 143 # 90.7 0.02 1.2E-05 29.9 10.6 138 1-139 1-143 (143) 70 protein:vir:96486 Length: 112 90.0 0.0052 3.2E-06 33.1 6.7 110 2-129 1-112 (112) 71 protein:vir:9513 Length: 134 # 86.4 0.0099 6.1E-06 31.6 5.8 123 5-132 1-134 (134) 72 protein:vir:101302 Length: 134 86.4 0.0099 6.1E-06 31.6 5.8 123 5-132 1-134 (134) 73 protein:vir:100652 Length: 134 85.9 0.012 7.2E-06 31.2 6.0 123 5-132 1-134 (134) 74 protein:vir:78077 Length: 141 85.8 0.023 1.4E-05 29.6 7.5 116 2-140 1-140 (141) 75 protein:vir:9647 Length: 132 # 79.6 0.072 4.5E-05 26.8 7.7 125 1-139 1-132 (132) 76 protein:vir:99546 Length: 200 78.4 0.012 7.2E-06 31.2 3.0 123 1-140 7-145 (200) 77 protein:vir:99528 Length: 92 # 78.2 0.018 1.1E-05 30.1 4.0 87 2-112 1-92 (92) 78 protein:vir:98636 Length: 138 76.3 0.073 4.5E-05 26.8 6.7 125 1-139 7-138 (138) 79 protein:vir:8669 Length: 142 # 59.8 0.083 5.1E-05 26.5 3.4 118 2-127 1-142 (142) 80 protein:vir:99101 Length: 142 59.8 0.083 5.1E-05 26.5 3.4 118 2-127 1-142 (142) 81 protein:vir:80970 Length: 112 52.9 0.54 0.00034 22.0 7.4 108 1-137 1-112 (112) 82 protein:vir:1243 Length: 116 # 52.0 0.25 0.00015 23.9 4.7 89 26-126 1-116 (116) 83 protein:vir:97327 Length: 116 52.0 0.25 0.00015 23.9 4.7 89 26-126 1-116 (116) 84 protein:vir:99833 Length: 190 51.7 0.34 0.00021 23.2 5.3 132 1-137 1-190 (190) 85 protein:vir:45 Length: 112 # N 51.2 0.59 0.00036 21.8 7.5 108 1-137 1-112 (112) 86 protein:vir:81147 Length: 126 50.9 0.6 0.00037 21.8 9.8 115 2-140 1-125 (126) 87 protein:vir:78335 Length: 133 40.3 0.98 0.00061 20.6 6.2 117 5-129 1-133 (133) 88 protein:vir:80037 Length: 199 36.0 0.34 0.00021 23.2 2.8 120 3-140 1-141 (199) 89 protein:vir:95062 Length: 116 35.0 0.74 0.00046 21.3 4.5 87 26-126 1-116 (116) 90 protein:vir:6375 Length: 205 # 34.7 1.3 0.00079 20.0 9.3 135 3-138 1-205 (205) 91 protein:vir:99196 Length: 155 34.3 0.95 0.00059 20.7 5.0 126 1-133 1-155 (155) 92 protein:vir:1988 Length: 156 # 33.1 0.89 0.00055 20.9 4.6 127 1-135 1-156 (156) 93 protein:vir:966 Length: 123 # 30.8 1.5 0.00096 19.5 7.3 118 1-135 1-123 (123) 94 protein:vir:93898 Length: 133 29.4 1.4 0.00084 19.9 4.9 115 5-131 1-133 (133) 95 protein:vir:94419 Length: 133 27.0 1.7 0.001 19.4 5.0 115 5-131 1-133 (133) 96 protein:vir:9363 Length: 133 # 27.0 1.7 0.001 19.4 5.0 115 5-131 1-133 (133) 97 protein:vir:78644 Length: 133 27.0 1.7 0.001 19.4 5.0 115 5-131 1-133 (133) 98 protein:vir:96973 Length: 133 27.0 1.7 0.001 19.4 5.0 115 5-131 1-133 (133) 99 protein:vir:79225 Length: 155 25.3 2.1 0.0013 18.8 6.2 126 1-133 1-155 (155) 100 protein:vir:102338 Length: 116 21.6 2.6 0.0016 18.3 5.7 103 22-134 1-116 (116) 101 protein:vir:79091 Length: 175 21.5 2.6 0.0016 18.3 6.7 131 1-136 1-175 (175) No 1 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=100.00 E-value=5.6e-81 Score=460.53 Aligned_cols=140 Identities=79% Similarity=1.234 Sum_probs=139.8 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccc Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLG 80 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLg 80 (140) |||+||||||++|+|+++|+|||++||++||++|||+|+|+|+++|+++||||++|||++|||+|||+|+||+++|+||| T Consensus 1 m~~~~sld~s~~e~L~~~i~r~P~ksE~~IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~HAK~s~pl~~~~~NLg 80 (140) T protein:vir:40 1 MCAKWSLEFSDVERLSNLISQIPNKSEAIINKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKNHAQSSGPFNVKMGNLG 80 (140) T ss_pred CCcceecchhhHHHHHHHHHhccchHHHHHHHHHHhhhhHHHHhhhhhccCcCccchhhhccccchhhhhhhhhhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 81 FELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 81 f~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg~ 140 (140) |+|+||+||+||||||+|||+||+++|+||||||++++|+|+|+||++||+|||+||||- T Consensus 81 f~i~~k~kf~YLvfPD~G~G~sn~~~q~FmerGl~~~t~~i~E~L~~~l~k~in~~Lgg~ 140 (140) T protein:vir:40 81 FELLTKPKFNYLIFPDQGIGKHNKTKQDFMQLGVEESSQEIVEMLEQAVFKEINDTLGGK 140 (140) T ss_pred eeEeecCcccccccccccCCCCCcchHHHHHhccccchhHHHHHHHHHHHHHHHHhhcCC Confidence 999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=99.29 E-value=1.8e-14 Score=95.97 Aligned_cols=136 Identities=17% Similarity=0.229 Sum_probs=110.9 Q ss_pred CCccccccHHHHHHHHHHHHhcc--hHHHHHHHHHHhhhhhHHHHHhhhhcCCccccch----hhhcccchhhhhhhhh- Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIP--NKSEEIINKTLETKAVPLAKQNIEKRINLSKNWK----GQLLNKNHAQTAGPFV- 73 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP--~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k----~~~rnK~HAk~s~pl~- 73 (140) |+..|++++..+++|+.+|++|. .+.+++.+++| .+|+.++.+.+...+|+|+... .+.+...|++++-+.. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al-~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~ 79 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSIL-KECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPK 79 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHH-HHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceecc Confidence 88899999999999999999995 47788999999 7899999999999999985322 2234455887765543 Q ss_pred --hchhccceEEeec----CcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021539. 74 --AKMSNLGFELVSK----PKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGG 139 (140) Q Consensus 74 --~~~~NLgf~i~~k----~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg 139 (140) .+..+...+|-+. +.+-|.-|++ .||++-.||.||+...+...+++++...++|.++|.+.||- T Consensus 80 ~~~~~g~~~~~VG~~~~~~~~~~y~~f~E--~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~~~lG~ 149 (149) T protein:vir:13 80 IRKKKGNLQCVVGWEKSDNTPFYYMKMEE--WGTSERPPHHAFGKTNKILKRVYDNIAQKKYDNFVKEKLGD 149 (149) T ss_pred cccccceeEEEeeccCCCCCccceeeeec--cCccCCCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 2344444555543 3455999998 46999999999999999999999999999999999999999 No 3 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=99.03 E-value=1.4e-12 Score=85.50 Aligned_cols=134 Identities=19% Similarity=0.258 Sum_probs=105.8 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccc-----hhhhcccchhhhhhhhhhc Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNW-----KGQLLNKNHAQTAGPFVAK 75 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~-----k~~~rnK~HAk~s~pl~~~ 75 (140) |+..|++++..+++|++.+++||.+.+++.+++| ..|+..+.+.+...+|++.-. ++..+.-.|+++.-.+... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al-~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKAL-AAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccc Confidence 8889999999999999999999999999999998 679999999999999998421 1222333566654433221 Q ss_pred h-------hccceEEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_021539. 76 M-------SNLGFELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNIL 137 (140) Q Consensus 76 ~-------~NLgf~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~l 137 (140) + ...||.-.....+-|..|+.. ||++-.||.||+..++...+.+++.+.+++.++|.+.| T Consensus 80 ~~~~g~~~~~vg~~~~~~~~~~y~~f~E~--GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 80 KLEGGIKTVKIGLNKADRSPWFYLKFHEW--GTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccccceeEEeeeccCCCCCcceeeeecc--CCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 1 123333322445668999885 69999999999999999999999999999999999999 No 4 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=99.03 E-value=1.4e-12 Score=85.50 Aligned_cols=134 Identities=19% Similarity=0.258 Sum_probs=105.8 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccc-----hhhhcccchhhhhhhhhhc Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNW-----KGQLLNKNHAQTAGPFVAK 75 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~-----k~~~rnK~HAk~s~pl~~~ 75 (140) |+..|++++..+++|++.+++||.+.+++.+++| ..|+..+.+.+...+|++.-. ++..+.-.|+++.-.+... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al-~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKAL-AAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccc Confidence 8889999999999999999999999999999998 679999999999999998421 1222333566654433221 Q ss_pred h-------hccceEEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_021539. 76 M-------SNLGFELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNIL 137 (140) Q Consensus 76 ~-------~NLgf~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~l 137 (140) + ...||.-.....+-|..|+.. ||++-.||.||+..++...+.+++.+.+++.++|.+.| T Consensus 80 ~~~~g~~~~~vg~~~~~~~~~~y~~f~E~--GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 80 KLEGGIKTVKIGLNKADRSPWFYLKFHEW--GTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccccceeEEeeeccCCCCCcceeeeecc--CCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 1 123333322445668999885 69999999999999999999999999999999999999 No 5 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=99.03 E-value=1.4e-12 Score=85.50 Aligned_cols=134 Identities=19% Similarity=0.258 Sum_probs=105.8 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccc-----hhhhcccchhhhhhhhhhc Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNW-----KGQLLNKNHAQTAGPFVAK 75 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~-----k~~~rnK~HAk~s~pl~~~ 75 (140) |+..|++++..+++|++.+++||.+.+++.+++| ..|+..+.+.+...+|++.-. ++..+.-.|+++.-.+... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al-~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKAL-AAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccc Confidence 8889999999999999999999999999999998 679999999999999998421 1222333566654433221 Q ss_pred h-------hccceEEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_021539. 76 M-------SNLGFELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNIL 137 (140) Q Consensus 76 ~-------~NLgf~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~l 137 (140) + ...||.-.....+-|..|+.. ||++-.||.||+..++...+.+++.+.+++.++|.+.| T Consensus 80 ~~~~g~~~~~vg~~~~~~~~~~y~~f~E~--GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 80 KLEGGIKTVKIGLNKADRSPWFYLKFHEW--GTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccccceeEEeeeccCCCCCcceeeeecc--CCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 1 123333322445668999885 69999999999999999999999999999999999999 No 6 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=99.03 E-value=1.4e-12 Score=85.50 Aligned_cols=134 Identities=19% Similarity=0.258 Sum_probs=105.8 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccc-----hhhhcccchhhhhhhhhhc Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNW-----KGQLLNKNHAQTAGPFVAK 75 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~-----k~~~rnK~HAk~s~pl~~~ 75 (140) |+..|++++..+++|++.+++||.+.+++.+++| ..|+..+.+.+...+|++.-. ++..+.-.|+++.-.+... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al-~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKAL-AAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccc Confidence 8889999999999999999999999999999998 679999999999999998421 1222333566654433221 Q ss_pred h-------hccceEEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_021539. 76 M-------SNLGFELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNIL 137 (140) Q Consensus 76 ~-------~NLgf~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~l 137 (140) + ...||.-.....+-|..|+.. ||++-.||.||+..++...+.+++.+.+++.++|.+.| T Consensus 80 ~~~~g~~~~~vg~~~~~~~~~~y~~f~E~--GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 80 KLEGGIKTVKIGLNKADRSPWFYLKFHEW--GTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccccceeEEeeeccCCCCCcceeeeecc--CCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 1 123333322445668999885 69999999999999999999999999999999999999 No 7 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=98.79 E-value=4.7e-11 Score=77.21 Aligned_cols=124 Identities=10% Similarity=0.030 Sum_probs=100.7 Q ss_pred ccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh---hchhccce Q lcl|NC_021539. 5 WSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV---AKMSNLGF 81 (140) Q Consensus 5 ~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~---~~~~NLgf 81 (140) +|+++..+++|++++++++.+++++.+++|.. |+..+.+.+....|++. +..|...|.+++=... .+...... T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~k~~~~al~~-ga~~~~~~~k~~ap~~~---~~~~~~~h~~d~I~~~~~k~~~g~~~~ 76 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVAKEARAAVRD-GAQKFADKLKSNTPEWD---GETDMSGHLRDDIKLSSVRETSGLTEV 76 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHHHHHHHHHHH-HHHHHHHHHHHhCCCcC---CCCcccchhhhhhccccccccCceeEE Confidence 99999999999999999999999999888765 88888899999999984 3345567888775443 33444445 Q ss_pred EEee-cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHH Q lcl|NC_021539. 82 ELVS-KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEIN 134 (140) Q Consensus 82 ~i~~-k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in 134 (140) +|-+ ++..-|--|++-| |++-.||.||+...+...+++++.+.++|.+.|= T Consensus 77 ~VG~~k~~~~y~~f~E~G--T~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 77 DVGYGKDTGWRAHFPNSG--TSMQDPQHFIEETQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred EeeecCCCceEEeeeccC--ccCCCCCcchhHHHHHhHHHHHHHHHHHHHhhcC Confidence 5544 4556688898876 8998999999999999999999888888888776 No 8 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=98.71 E-value=2.1e-10 Score=73.69 Aligned_cols=128 Identities=9% Similarity=0.208 Sum_probs=96.8 Q ss_pred CCccccccHHHHHHHHHHHHhcchHH-HHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh---hch Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKS-EEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV---AKM 76 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~s-E~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~---~~~ 76 (140) ||.++ ++..+++|++.+++||.+. +++.+++| .+|+..+.+.+....|+++-+ ...|.+.|=... .+. T Consensus 1 M~~~~--~i~Gl~el~~~l~~L~~~~~~k~~~~Al-~~~a~~v~~~~k~~ap~~~~~-----~~g~l~~~I~i~~~k~~~ 72 (135) T protein:vir:57 1 MIPEI--EISGLQELERRLIAVGEEVGTKILRDAG-RAAMAVVEADMKQNAGYDNSS-----TNAHMRDSIKIRSSRGKA 72 (135) T ss_pred Cceee--eehhHHHHHHHHHHhHHHHHHHHHHHHH-HHHHHHHHHHHHHhCCCCCCC-----chhhHHhhcccccccccc Confidence 87655 5789999999999999987 57778888 678888889999999998532 234666654333 333 Q ss_pred hccceEEe--e-cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021539. 77 SNLGFELV--S-KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILG 138 (140) Q Consensus 77 ~NLgf~i~--~-k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lg 138 (140) .+-+.++. + ++-|-|-.|+. .|||+-.||.||+..++...+.+++.+.++|.++|.+..= T Consensus 73 ~~~~v~v~vg~~~~~~~~~~f~E--~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~l~ka~r 135 (135) T protein:vir:57 73 GSTVVVLRVGPTRSHYMKALAQE--FGTIKQVAKPFIRPALDYNKMQVLRILTVEIRDGLSTLSR 135 (135) T ss_pred cceeEEEEecCCCCcceeEeecc--cCCCCCCCCcchhHhHHHhHHHHHHHHHHHHHHHHHHhcC Confidence 34444443 3 55566788887 5699999999999999999998888888887777777655 No 9 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=98.50 E-value=1.4e-09 Score=69.14 Aligned_cols=125 Identities=14% Similarity=0.259 Sum_probs=96.3 Q ss_pred CccccccHHHHHHHHHHHHhcchHH-HHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhh-------h Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKS-EEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPF-------V 73 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~s-E~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl-------~ 73 (140) |+ ++++..+++|.+.+++++.++ ++++.++|.. |+..+.+.+....|++.- |.++|=.. . T Consensus 1 Ma--~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~-~a~~v~~~ak~~aP~~tG---------~l~~~i~~~~~~~~~~ 68 (140) T protein:vir:80 1 MS--SIQIVGLADLLADFERLAKSQSTKALRRATVA-GAKVIRDEARKRAPKKTG---------KLRRNIVSAALRQKDA 68 (140) T ss_pred Cc--eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCCCcc---------hhhhceeeeccccccc Confidence 55 577789999999999998775 5677777765 788889999999999842 22222111 1 Q ss_pred hchhccceEEee------cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 74 AKMSNLGFELVS------KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 74 ~~~~NLgf~i~~------k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg~ 140 (140) ....-.|+.+.+ ++.+-|-.|.. .||++-.+|.||...++...+++++.+.+++-++|.+.|||- T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E--~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~ 139 (140) T protein:vir:80 69 PGLATAGVRVRTKGKADSPSNAFYWRFDE--FGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQALGGR 139 (140) T ss_pred cceeeeeeecccccccCCCCCcceeeeec--cCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 112234454443 34466888876 679999999999999999999999999999999999999999 No 10 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=98.50 E-value=1.9e-09 Score=68.45 Aligned_cols=125 Identities=14% Similarity=0.261 Sum_probs=97.1 Q ss_pred CccccccHHHHHHHHHHHHhcchHH-HHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh------- Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKS-EEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV------- 73 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~s-E~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~------- 73 (140) |++ +++..+++|++.++.|+.++ ++++.++|.. |+..+.+.+....|++.- |.++|=... T Consensus 1 Ma~--~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~-~a~~v~~~ak~~aP~~tG---------~l~~sI~~~~~~~~~~ 68 (140) T protein:vir:10 1 MSS--IQIIGLADLRADFEKLAKSQSTKALRRATVA-GAKVIRDEARKRAPKKTG---------KLRRNIVSAALRQKDA 68 (140) T ss_pred Cce--eeehhHHHHHHHHHHhHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCCChh---------hHHHhccccccccccc Confidence 554 56779999999999999876 5688888765 778888889999999832 444432221 Q ss_pred hchhccceEEee------cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 74 AKMSNLGFELVS------KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 74 ~~~~NLgf~i~~------k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg~ 140 (140) ....-.|+.+.. ++.+-|..|.. .||++-.+|.||...++...+++++.+.+++.++|.+.+||- T Consensus 69 ~~~~~~g~~~~~~~~~~~~~~~~y~~f~E--~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~ 139 (140) T protein:vir:10 69 PGLATAGVRVRTKGKADSPNNAFYWRFDE--FGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGR 139 (140) T ss_pred cceEEeeeeeccccccCCCCccceeeeec--cCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 122334554443 34566888888 568998999999999999999999999999999999999999 No 11 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=98.36 E-value=4.4e-09 Score=66.42 Aligned_cols=122 Identities=20% Similarity=0.177 Sum_probs=96.7 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhch----h Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKM----S 77 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~----~ 77 (140) |++ +++..+++|++.++++|.+.+++++++|. +|+..+.+.+....|++.++.| |.+++=.....+ . T Consensus 1 M~~--~~i~Gl~el~~~l~~l~~~~~~~~~~al~-~~a~~v~~~~k~~ap~~~~~tg------~l~~~I~~~~~k~~~~g 71 (127) T protein:vir:12 1 MAD--MSFDGIDDLTQYFEKIGGDIEKVEPVALK-AGGEIIAERQRSHVNRSDKKQP------HMQDNITVSNVRESKDG 71 (127) T ss_pred Cee--eeehhHHHHHHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHhCCCCCCChh------HHHHhhhccccccccCc Confidence 444 67789999999999999999999888886 4677788899999999876544 777776554322 2 Q ss_pred ccceEEee-cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHH Q lcl|NC_021539. 78 NLGFELVS-KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEIN 134 (140) Q Consensus 78 NLgf~i~~-k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in 134 (140) ....+|-+ ++..-|--|+.. ||++-.||.||+...+...+.+++.+.+.+.++|. T Consensus 72 ~~~v~Vg~~~~~~~y~~f~E~--GT~~~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 72 VRFVAVGPNKKVAYRGRFLEW--GTSKMPPQPFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred eeEEEEeeCCCCcceeeeecc--CccCCCCCccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 22333444 445667777765 58988999999999999999999999999999999 No 12 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=98.34 E-value=5.3e-09 Score=65.96 Aligned_cols=136 Identities=10% Similarity=0.140 Sum_probs=93.1 Q ss_pred CCccccccHHHHHHHHHHHHhcchHH-HHHHHHHHhhhhhHHHHHhhhhcCCccccch--hhh-cccchhhhhhhhhhch Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKS-EEIINKTLETKAVPLAKQNIEKRINLSKNWK--GQL-LNKNHAQTAGPFVAKM 76 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~s-E~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k--~~~-rnK~HAk~s~pl~~~~ 76 (140) |+..+++++..+++|+.+|++||.++ .++++++|+.-+ ..+.+.+..+-|+++.-. +.+ .+-.... +.....+. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa-~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~-~~~~~~~~ 78 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAA-NIIRDRARSNASRVDDPLTKEAIHKNIVASF-SSKQFRRT 78 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCccccccchhhhhhheeecc-cccccccc Confidence 88899999999999999999999887 568899997654 777788888888764210 000 0000000 11111111 Q ss_pred hccceE--------------------------------EeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHH Q lcl|NC_021539. 77 SNLGFE--------------------------------LVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEM 124 (140) Q Consensus 77 ~NLgf~--------------------------------i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~ 124 (140) .++.+. -.+.+..-|.-|... |||+-.||-||.-.++...+++++. T Consensus 79 g~~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEf--GT~kmpa~PFlrPA~~~~~~~a~~~ 156 (179) T protein:vir:18 79 GDLAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEF--GTEHTSARPILRPAMNGVDNDVINV 156 (179) T ss_pred cceeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEecc--CCCCCCCCccchhhHHhhHHHHHHH Confidence 222111 112334557777775 7999999999999999999988888 Q ss_pred HHHHHHHHHHHHhcCC Q lcl|NC_021539. 125 LEEDVLKEINNILGGN 140 (140) Q Consensus 125 L~~~l~k~in~~lgg~ 140 (140) +.+.+-++|.+.|--. T Consensus 157 i~~~l~~~i~k~lk~~ 172 (179) T protein:vir:18 157 FSTEMGKAIDRAIRLA 172 (179) T ss_pred HHHHHHHHHHHHHHhh Confidence 8888888888888655 No 13 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=98.32 E-value=9.8e-09 Score=64.50 Aligned_cols=125 Identities=14% Similarity=0.245 Sum_probs=96.3 Q ss_pred CccccccHHHHHHHHHHHHhcchHH-HHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh------- Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKS-EEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV------- 73 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~s-E~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~------- 73 (140) |++ +++..+++|.+.++++|.++ +++..++|. +|+..+.+.+....|++. + |.+.|=... T Consensus 1 M~~--~~i~Gld~l~~~l~~l~~~~~~~~~~~al~-~~a~~v~~~ak~~aP~~t-G--------~l~~sI~~~~~~~~~~ 68 (140) T protein:vir:14 1 MSS--IQIIGLADLRADFEKLAKSQSAKALRRATL-AGAKVIRDEARKRAPKKT-G--------KLRRNIVSAALRQKDA 68 (140) T ss_pred Cce--eeehhHHHHHHHHHHhHHHHHHHHHHHHHH-HHHHHHHHHHHHhCCCCh-h--------hHHhhccccccccccc Confidence 554 66779999999999999876 567777776 566788888999999883 2 444442221 Q ss_pred hchhccceEEee------cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 74 AKMSNLGFELVS------KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 74 ~~~~NLgf~i~~------k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg~ 140 (140) ....-.|++... .+.+-|..|.. .||++-.+|.||...++...+++++.+.+++.++|.+.+||- T Consensus 69 ~~~~~vg~~~~~~~~~~~~~~~~y~~f~E--~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~~ 139 (140) T protein:vir:14 69 PGLATAGVRVRTKGKADSPNNAFYWRFDE--FGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGR 139 (140) T ss_pred ceeEEeeeeeccccccCCCCccceeeeec--cccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 123344555444 34566888876 679999999999999999999999999999999999999999 No 14 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=98.29 E-value=8.4e-09 Score=64.86 Aligned_cols=130 Identities=12% Similarity=0.077 Sum_probs=100.6 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhc------ Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAK------ 75 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~------ 75 (140) |+.|+ ..++.|.+.++++-...++.=++++. .|+.+..+.+..--|.|.-...+-.+-+|.+++=..+.. T Consensus 1 M~~~~---~glee~~~~lekL~~~~~~~~~katk-AGA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~~~idG~~ 76 (153) T protein:vir:49 1 MTGLD---EALEGWLKTVASIGDLTPAEQAKITT-AGAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQSTNADGRK 76 (153) T ss_pred CccHH---HHHHHHHHHHHHhccCCHHHHHHHHH-HHHHHHHHHHHHhccccCCCCCCCCCCCcccccceeccccccccc Confidence 55544 45899999999998888888888887 899999999998888875333344456799998776522 Q ss_pred --hhccceEEeecCcccce-eccccCCCCCCCchHHHHHhchhhh--hHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 76 --MSNLGFELVSKPKFNYL-IFPDQGVGKNNKTKQDFMLLGLEES--TAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 76 --~~NLgf~i~~k~kf~YL-vFPd~GiG~sn~~~q~FmerGl~~~--~~~i~E~L~~~l~k~in~~lgg~ 140 (140) ..+.||. +...+|. -||+.| |++-.+++|.++-.+++ .+.|+++-.+++.+.|++.+|=- T Consensus 77 dG~s~VG~~---~~~~a~~a~f~n~G--T~km~~~hFie~tr~e~~~k~~vl~A~~~~~~~il~~~~~~~ 141 (153) T protein:vir:49 77 NGVSTVGWK---NNYHAQNARRLNDG--TKKYRADHFITNVQNDSTVKNKVLLAEKEEYEKLIRRKGGVY 141 (153) T ss_pred cceeeeccc---CCccceeeeecccC--cccCCCChhhHHHHHHhhHHHHHHHHHHHHHHHHHHhcCCee Confidence 3356665 3344565 999987 88888999999999886 36789999999999999987744 No 15 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=98.28 E-value=7.8e-09 Score=65.02 Aligned_cols=125 Identities=10% Similarity=0.242 Sum_probs=95.5 Q ss_pred CccccccHHHHHHHHHHHHhcchHH-HHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh--hc--- Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKS-EEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV--AK--- 75 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~s-E~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~--~~--- 75 (140) |+ ++++..+++|+..+++|+.++ +++.+++|. +|+..+.+.+....|+++. . -+.|.+.+=... .+ T Consensus 1 M~--~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~-~~a~~i~~~ak~~ap~~~~-~----~~~~~~~~I~v~~~~~~~~ 72 (133) T protein:vir:10 1 MI--RMEVKGLDELERQLTALGEKVATKVLRDAGR-EALKVVEEDMKQHAGFDET-S----TGQHMRDSIKIRSSTRKAQ 72 (133) T ss_pred Ce--eEeeehHHHHHHHHHHhHHHHHHHHHHHHHH-HHHHHHHHHHHHhCCCCCC-c----chhhhhhcccccccccccC Confidence 44 456789999999999999886 567788875 5667777999999999942 2 245766554332 11 Q ss_pred -hhccceEEee-cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_021539. 76 -MSNLGFELVS-KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNI 136 (140) Q Consensus 76 -~~NLgf~i~~-k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~ 136 (140) -.-+.+++-+ ++.|-|..|+. .|||+-.||.||+..++...+.+++.+.++|.++|++. T Consensus 73 ~~~~~~v~vg~~~~~~~y~~f~E--~GT~k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 73 GNAVVTLRVGPSKQHHMKVLAQE--FGTVKQVADPFIRPALDYNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred ccceEEEEecCCCCccceEeeec--cCCCCCCCCccchHHHHHhHHHHHHHHHHHHHHHhhcC Confidence 1223344444 55667999998 46999999999999999999999999999999999999 No 16 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=98.28 E-value=1.3e-08 Score=63.83 Aligned_cols=127 Identities=13% Similarity=0.111 Sum_probs=99.2 Q ss_pred ccHH-HHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhh-cccchhhhhhhhhhch---h---- Q lcl|NC_021539. 7 VDFA-DVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQL-LNKNHAQTAGPFVAKM---S---- 77 (140) Q Consensus 7 ld~s-~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~-rnK~HAk~s~pl~~~~---~---- 77 (140) ||++ .+|.++++++++=+...+.-+++ -+.|+.++.+.+..-.|-|....+.. ..-+|++++=.+++.. . T Consensus 1 v~~~~~lee~l~~i~kl~~~~~~~~~ki-~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~~~g~ 79 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAELSISDQEKI-TKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGDHNGS 79 (139) T ss_pred CCHHHHHHHHHHHHHHhhccCHHHHHHH-HHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCccccccccee Confidence 8888 78888888888866666655554 46799999999999999775432221 2337999987776421 1 Q ss_pred -ccceEEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 78 -NLGFELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 78 -NLgf~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg~ 140 (140) -.||. | +|-+--||+.| |++-.||+|+++.+..+.+.+++...+++-+.|++..||- T Consensus 80 ~~VG~~---k-~~~~A~f~n~G--T~k~~~~hFie~t~~e~~~evl~a~~~~~k~~l~~~~~~~ 137 (139) T protein:vir:10 80 STVGFH---N-KAHIARFLNDG--TKYIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKANGGG 137 (139) T ss_pred eeeCCC---C-CcceEeecccC--ccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC Confidence 15664 3 35455899986 8999999999999999999999999999999999999999 No 17 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=98.26 E-value=1.4e-08 Score=63.62 Aligned_cols=135 Identities=10% Similarity=0.176 Sum_probs=88.0 Q ss_pred CCccccccHHHHHHHHHHHHhcchHH-HHHHHHHHhhhhhHHHHHhhhhcCCccccc-------hhhhcccchhhhhhhh Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKS-EEIINKTLETKAVPLAKQNIEKRINLSKNW-------KGQLLNKNHAQTAGPF 72 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~s-E~~IN~~L~tkg~~~a~~~I~~~iPvS~r~-------k~~~rnK~HAk~s~pl 72 (140) || +.++++..+++|+..+++||.+. +++..++|+ +|+..+.+.+....|++.-. ....+...|....-.. T Consensus 1 mm-~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~-~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~ 78 (148) T protein:vir:93 1 MI-ETLLDFSGLEDISRDLQLLSGAENNRVLREATR-AGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHI 78 (148) T ss_pred Cc-ceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHH-HHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeee Confidence 44 58999999999999999999775 566677775 56788889999999997320 0000001111111111 Q ss_pred h---hchhccce--EEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021539. 73 V---AKMSNLGF--ELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGG 139 (140) Q Consensus 73 ~---~~~~NLgf--~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg 139 (140) . ....+.+- .-..++..-|-.|.. .||++-.||.||+..++...+.+++.+.+.+.++|.+.|.= T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E--~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i~k~~~k 148 (148) T protein:vir:93 79 RGVNPDTGNSDNTMKADNPRNAFYWRFVE--MGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) T ss_pred cccccccccccceeecCCCCCcceeeeec--cCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 0 01111111 112245556777876 56999999999999999988888777777777777766666 No 18 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=98.13 E-value=3.1e-08 Score=61.76 Aligned_cols=136 Identities=15% Similarity=0.154 Sum_probs=94.0 Q ss_pred CCccccccHHHHHHHHHHHHhcchHH-HHHHHHHHhhhhhHHHHHhhhhcCCccccchh---hhcccchhhhhhhhhhch Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKS-EEIINKTLETKAVPLAKQNIEKRINLSKNWKG---QLLNKNHAQTAGPFVAKM 76 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~s-E~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~---~~rnK~HAk~s~pl~~~~ 76 (140) |+.+.++++..+++|++.+++||.+. +++++++|.. |+.++.+.+....|.++.... ...+..+.. +....... T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~-aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~-~~~~~~~~ 78 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRK-AAMIVVQAAKQGAEKVDDPGTGRSISDNIALRW-NGRLFKRT 78 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCcccCCCccchhhhhhhhhc-ccCccccc Confidence 99899999999999999999999886 5788999875 668888999999998753111 011111110 11112223 Q ss_pred hccceEEee-----------------cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021539. 77 SNLGFELVS-----------------KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGG 139 (140) Q Consensus 77 ~NLgf~i~~-----------------k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg 139 (140) ..+.+.+-. .+..-|--|... ||++-.||-||+-.++...+++++.+.+.|.++|.+.|.= T Consensus 79 ~~~~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~Ef--GT~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~i~ka~~k 156 (164) T protein:vir:43 79 GDLGFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEF--GTEDMRAQPFMRSALADNIAEVTSTFVSEYEKGIDRAIKR 156 (164) T ss_pred cceeEEecccccccccccccccccCCCCCcceEEEeec--CCCCCCCCcchhhhHHHhHHHHHHHHHHHHHHHHHHHHHH Confidence 344444432 122336666664 6999999999999999999999888887777777777632 Q ss_pred C Q lcl|NC_021539. 140 N 140 (140) Q Consensus 140 ~ 140 (140) - T Consensus 157 ~ 157 (164) T protein:vir:43 157 A 157 (164) T ss_pred H Confidence 2 No 19 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=98.09 E-value=4e-08 Score=61.16 Aligned_cols=125 Identities=14% Similarity=0.172 Sum_probs=93.7 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccc Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLG 80 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLg 80 (140) |++.|++++..+++|.+.+++++..+.+.+.+.|.+-+- ...+.+...-|++ + |-+|+-=.. .+.+.+-.-+. T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~-~i~~~ak~~ap~~-t--G~L~~sI~~---~~~~~~~~~~~ 73 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLS-RAVEKSKGLARVD-T--GYMRNNIQQ---DEVKEEHGVVT 73 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHH-HHHHHHHhhCCCC-C--hhhhhhcee---cceeccCCcEE Confidence 999999999999999999999999999999999987554 5566788888987 3 222211100 11222233455 Q ss_pred eEEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_021539. 81 FELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNI 136 (140) Q Consensus 81 f~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~ 136 (140) .+|.+... |-.|.- .||++-.+|-||...++...+.+...|.++|.++|--- T Consensus 74 ~~v~~~~~--Ya~~vE--fGT~~~~a~Pfl~pa~~~~~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 74 GRYVARAD--YSSYNE--YGTYRMSAQPFMAPSVAAMTPFFYKAVRDALNKAAKFS 125 (125) T ss_pred EEeeCCCC--ccceee--cccccCCCCcccchhHHHHHHHHHHHHHHHHHHHhccC Confidence 55666554 555554 45888889999999999999999999999999988887 No 20 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=98.04 E-value=7.1e-08 Score=59.78 Aligned_cols=125 Identities=12% Similarity=0.160 Sum_probs=91.8 Q ss_pred CccccccHHHHHHHHHHHHhcchHHH-HHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhh-----hhc Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSE-EIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPF-----VAK 75 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE-~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl-----~~~ 75 (140) |+ ++++..+++|.+.+++++.+++ +++.++|.+ |+..+.+.+....|++. + |.+.|=.. ... T Consensus 1 Ma--~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~-~a~~v~~~ak~~ap~~t-G--------~l~~sI~~~~~~~~~~ 68 (140) T protein:vir:10 1 MS--SVQILGLADLQADFLKLAKAQSTKALRRATVA-GANVIRDEARARAPKKT-G--------KLKRNIVTAALKQKDS 68 (140) T ss_pred Cc--eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCCCh-h--------hHHHhceecccccccc Confidence 55 4667799999999999998875 577778765 88999999999999973 2 22222111 111 Q ss_pred hhccceEEe--e------cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 76 MSNLGFELV--S------KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 76 ~~NLgf~i~--~------k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg~ 140 (140) ..++-..+. + ++..-|--|... ||++-.+|.||...++...+++++.+.+++.++|.+.++|- T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~--GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~ 139 (140) T protein:vir:10 69 PGIATAGVRVRTKGKADSPNNAFYWRFVEL--GTQFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVVGGG 139 (140) T ss_pred cceeEEeeccccccccCCCCcccccceecc--CcCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 122222222 1 223346667664 79999999999999999999999999999999999999998 No 21 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=97.98 E-value=1.7e-07 Score=57.66 Aligned_cols=134 Identities=11% Similarity=0.195 Sum_probs=87.8 Q ss_pred CccccccHHHHHHHHHHHHhcchHHH-HHHHHHHhhhhhHHHHHhhhhcCCccccchh--hh-----cccc--hhhhhhh Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSE-EIINKTLETKAVPLAKQNIEKRINLSKNWKG--QL-----LNKN--HAQTAGP 71 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE-~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~--~~-----rnK~--HAk~s~p 71 (140) |..+++++..+++|+..+++++.++. ++.+++|.+ |+..+.+.+....|++. ++- .. +.+. |....-. T Consensus 1 mm~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~-~a~~i~~~ak~~aP~~~-g~l~~si~~~~~~~~~~~~~~~~v~ 78 (149) T protein:vir:19 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRA-GAEVLKEEVIDRAPVRT-GKLKKNVVVVTQKSRRRGEISSGVH 78 (149) T ss_pred CcceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHH-HHHHHHHHHHhhCCCCc-hhhhhhccccccccccccceeeccc Confidence 55599999999999999999998864 688888875 67788888999999863 210 00 0011 1111111 Q ss_pred h---hhchhccceE--EeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021539. 72 F---VAKMSNLGFE--LVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGG 139 (140) Q Consensus 72 l---~~~~~NLgf~--i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg 139 (140) . .......+-. ...+...-|-.|.. .||++-.||.||+..++...+++++.+.+.|.++|.+.++= T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E--~GT~~~~a~PF~~pA~~~~k~~~~~~~~~~l~~~l~k~~~k 149 (149) T protein:vir:19 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVE--LGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) T ss_pred ccccccccccccceeecCCCCccceeeeec--cCCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 1 0111111112 22234455777877 45888889999999999888877777777666666666666 No 22 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=97.97 E-value=1.1e-07 Score=58.69 Aligned_cols=129 Identities=9% Similarity=0.085 Sum_probs=99.4 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhc------ Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAK------ 75 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~------ 75 (140) |++|+ ..++.|.+.++++-...++.=++++. .|+.++.+.+..--|.|.-.-.+-++.+|.+++=-.+.. T Consensus 1 M~~~~---~gl~e~~~~lekl~~~~~~~~~katk-AGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~~~DG~~ 76 (141) T protein:vir:50 1 MVGLA---EALDEWLKTVASIGNLTPAEQVEITT-AGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQSTNADGRK 76 (141) T ss_pred CccHH---HHHHHHHHHHHHhcCCCHHHHHHHHH-HHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccCcccccc Confidence 66655 55899999999998788888888887 899999999998888775333344567898887655422 Q ss_pred --hhccceEEeecCcccce-eccccCCCCCCCchHHHHHhchhhh--hHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 76 --MSNLGFELVSKPKFNYL-IFPDQGVGKNNKTKQDFMLLGLEES--TAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 76 --~~NLgf~i~~k~kf~YL-vFPd~GiG~sn~~~q~FmerGl~~~--~~~i~E~L~~~l~k~in~~lgg~ 140 (140) ..+.||. +.+-+|+ -||+.| |++-.+|+|.++-.+++ .+.|+++-.+.+-++|++. ||. T Consensus 77 dg~s~VG~~---~~~~~~~A~f~n~G--T~k~~~~hFve~~~~~a~~k~~Vl~A~~~~~k~~l~~~-~~~ 140 (141) T protein:vir:50 77 NGVSTVGWK---NNYHAQNARRLNDG--TKKYRADHFVTNVQNDSTVQKKVLLEKKRNTKNSLEEK-EGC 140 (141) T ss_pred CCeeeeccC---CCccceeeeccccC--ccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHhc-cCC Confidence 4467773 4444666 999987 88889999999999876 4678999888888888875 555 No 23 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=97.97 E-value=6.5e-08 Score=59.99 Aligned_cols=119 Identities=10% Similarity=0.064 Sum_probs=83.6 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhc-- Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSN-- 78 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~N-- 78 (140) |... +|++.+++..++| -.+++++-|++| ..|+.++.+.+....|.|+ .++|.+++=.+.+.+.+ T Consensus 1 M~v~--v~~~~L~~~l~~l---~~~~~k~~~~Al-~aga~~~~e~l~~~aP~~~-------~~~hl~d~I~vs~~k~~~~ 67 (125) T protein:vir:81 1 MGAR--IESNNIEQGLKNA---VLKMNLNSNVIV-KAGAMSLVPLLKSNTPFAN-------TKKHARDHIAVSNVKTDRH 67 (125) T ss_pred CeeE--eeHHHHHHHHHHH---HHHHHHHHHHHH-HHHHHHHHHHHHHhCCCCC-------CCchhhhheeecccccccc Confidence 6554 4455555555554 456666667776 6899999999999999985 35788887555432221 Q ss_pred ---cceEEee-cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_021539. 79 ---LGFELVS-KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINN 135 (140) Q Consensus 79 ---Lgf~i~~-k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~ 135 (140) ...++=| |...=|.-|++. ||++-.||.||++.++...++++..+.++| +.|++ T Consensus 68 ~g~~~v~VG~~k~~~~~a~F~E~--GT~k~~a~pF~~~a~~~~~~ev~~~~~~~l-rk~~k 125 (125) T protein:vir:81 68 TSEKIVTIGYAKGVSHRIHATEF--GTMYQKPQLFITKTEKQGKNKVLKTMLDTA-KRLQK 125 (125) T ss_pred cceEEEEeccCCCCceEEEeccC--CccCCCCCchhhHHHHHhHHHHHHHHHHHH-HHHhC Confidence 1223333 444456779986 699999999999999999999998888887 55555 No 24 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=97.97 E-value=6.5e-08 Score=59.99 Aligned_cols=119 Identities=10% Similarity=0.064 Sum_probs=83.6 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhc-- Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSN-- 78 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~N-- 78 (140) |... +|++.+++..++| -.+++++-|++| ..|+.++.+.+....|.|+ .++|.+++=.+.+.+.+ T Consensus 1 M~v~--v~~~~L~~~l~~l---~~~~~k~~~~Al-~aga~~~~e~l~~~aP~~~-------~~~hl~d~I~vs~~k~~~~ 67 (125) T protein:vir:94 1 MGAR--IESNNIEQGLKNA---VLKMNLNSNVIV-KAGAMSLVPLLKSNTPFAN-------TKKHARDHIAVSNVKTDRH 67 (125) T ss_pred CeeE--eeHHHHHHHHHHH---HHHHHHHHHHHH-HHHHHHHHHHHHHhCCCCC-------CCchhhhheeecccccccc Confidence 6554 4455555555554 456666667776 6899999999999999985 35788887555432221 Q ss_pred ---cceEEee-cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_021539. 79 ---LGFELVS-KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINN 135 (140) Q Consensus 79 ---Lgf~i~~-k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~ 135 (140) ...++=| |...=|.-|++. ||++-.||.||++.++...++++..+.++| +.|++ T Consensus 68 ~g~~~v~VG~~k~~~~~a~F~E~--GT~k~~a~pF~~~a~~~~~~ev~~~~~~~l-rk~~k 125 (125) T protein:vir:94 68 TSEKIVTIGYAKGVSHRIHATEF--GTMYQKPQLFITKTEKQGKNKVLKTMLDTA-KRLQK 125 (125) T ss_pred cceEEEEeccCCCCceEEEeccC--CccCCCCCchhhHHHHHhHHHHHHHHHHHH-HHHhC Confidence 1223333 444456779986 699999999999999999999998888887 55555 No 25 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=97.97 E-value=6.5e-08 Score=59.99 Aligned_cols=119 Identities=10% Similarity=0.064 Sum_probs=83.6 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhc-- Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSN-- 78 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~N-- 78 (140) |... +|++.+++..++| -.+++++-|++| ..|+.++.+.+....|.|+ .++|.+++=.+.+.+.+ T Consensus 1 M~v~--v~~~~L~~~l~~l---~~~~~k~~~~Al-~aga~~~~e~l~~~aP~~~-------~~~hl~d~I~vs~~k~~~~ 67 (125) T protein:vir:79 1 MGAR--IESNNIEQGLKNA---VLKMNLNSNVIV-KAGAMSLVPLLKSNTPFAN-------TKKHARDHIAVSNVKTDRH 67 (125) T ss_pred CeeE--eeHHHHHHHHHHH---HHHHHHHHHHHH-HHHHHHHHHHHHHhCCCCC-------CCchhhhheeecccccccc Confidence 6554 4455555555554 456666667776 6899999999999999985 35788887555432221 Q ss_pred ---cceEEee-cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_021539. 79 ---LGFELVS-KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINN 135 (140) Q Consensus 79 ---Lgf~i~~-k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~ 135 (140) ...++=| |...=|.-|++. ||++-.||.||++.++...++++..+.++| +.|++ T Consensus 68 ~g~~~v~VG~~k~~~~~a~F~E~--GT~k~~a~pF~~~a~~~~~~ev~~~~~~~l-rk~~k 125 (125) T protein:vir:79 68 TSEKIVTIGYAKGVSHRIHATEF--GTMYQKPQLFITKTEKQGKNKVLKTMLDTA-KRLQK 125 (125) T ss_pred cceEEEEeccCCCCceEEEeccC--CccCCCCCchhhHHHHHhHHHHHHHHHHHH-HHHhC Confidence 1223333 444456779986 699999999999999999999998888887 55555 No 26 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=97.97 E-value=6.5e-08 Score=59.99 Aligned_cols=119 Identities=10% Similarity=0.064 Sum_probs=83.6 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhc-- Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSN-- 78 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~N-- 78 (140) |... +|++.+++..++| -.+++++-|++| ..|+.++.+.+....|.|+ .++|.+++=.+.+.+.+ T Consensus 1 M~v~--v~~~~L~~~l~~l---~~~~~k~~~~Al-~aga~~~~e~l~~~aP~~~-------~~~hl~d~I~vs~~k~~~~ 67 (125) T protein:vir:47 1 MGAR--IESNNIEQGLKNA---VLKMNLNSNVIV-KAGAMSLVPLLKSNTPFAN-------TKKHARDHIAVSNVKTDRH 67 (125) T ss_pred CeeE--eeHHHHHHHHHHH---HHHHHHHHHHHH-HHHHHHHHHHHHHhCCCCC-------CCchhhhheeecccccccc Confidence 6554 4455555555554 456666667776 6899999999999999985 35788887555432221 Q ss_pred ---cceEEee-cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_021539. 79 ---LGFELVS-KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINN 135 (140) Q Consensus 79 ---Lgf~i~~-k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~ 135 (140) ...++=| |...=|.-|++. ||++-.||.||++.++...++++..+.++| +.|++ T Consensus 68 ~g~~~v~VG~~k~~~~~a~F~E~--GT~k~~a~pF~~~a~~~~~~ev~~~~~~~l-rk~~k 125 (125) T protein:vir:47 68 TSEKIVTIGYAKGVSHRIHATEF--GTMYQKPQLFITKTEKQGKNKVLKTMLDTA-KRLQK 125 (125) T ss_pred cceEEEEeccCCCCceEEEeccC--CccCCCCCchhhHHHHHhHHHHHHHHHHHH-HHHhC Confidence 1223333 444456779986 699999999999999999999998888887 55555 No 27 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=97.97 E-value=6.5e-08 Score=59.99 Aligned_cols=119 Identities=10% Similarity=0.064 Sum_probs=83.6 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhc-- Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSN-- 78 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~N-- 78 (140) |... +|++.+++..++| -.+++++-|++| ..|+.++.+.+....|.|+ .++|.+++=.+.+.+.+ T Consensus 1 M~v~--v~~~~L~~~l~~l---~~~~~k~~~~Al-~aga~~~~e~l~~~aP~~~-------~~~hl~d~I~vs~~k~~~~ 67 (125) T protein:vir:98 1 MGAR--IESNNIEQGLKNA---VLKMNLNSNVIV-KAGAMSLVPLLKSNTPFAN-------TKKHARDHIAVSNVKTDRH 67 (125) T ss_pred CeeE--eeHHHHHHHHHHH---HHHHHHHHHHHH-HHHHHHHHHHHHHhCCCCC-------CCchhhhheeecccccccc Confidence 6554 4455555555554 456666667776 6899999999999999985 35788887555432221 Q ss_pred ---cceEEee-cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_021539. 79 ---LGFELVS-KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINN 135 (140) Q Consensus 79 ---Lgf~i~~-k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~ 135 (140) ...++=| |...=|.-|++. ||++-.||.||++.++...++++..+.++| +.|++ T Consensus 68 ~g~~~v~VG~~k~~~~~a~F~E~--GT~k~~a~pF~~~a~~~~~~ev~~~~~~~l-rk~~k 125 (125) T protein:vir:98 68 TSEKIVTIGYAKGVSHRIHATEF--GTMYQKPQLFITKTEKQGKNKVLKTMLDTA-KRLQK 125 (125) T ss_pred cceEEEEeccCCCCceEEEeccC--CccCCCCCchhhHHHHHhHHHHHHHHHHHH-HHHhC Confidence 1223333 444456779986 699999999999999999999998888887 55555 No 28 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=97.94 E-value=1e-07 Score=58.95 Aligned_cols=120 Identities=12% Similarity=0.123 Sum_probs=92.6 Q ss_pred cHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhc----hhccceEE Q lcl|NC_021539. 8 DFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAK----MSNLGFEL 83 (140) Q Consensus 8 d~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~----~~NLgf~i 83 (140) =.+.++.|++.+++++.+.+++.+++|. +|+..+.+.+....|+++. + ...|.+++-..... ..+...+| T Consensus 1 mv~Gl~el~~~l~~l~~~~~~~~~~al~-~ga~~~~~~~k~~ap~~~~-~----~~~hl~d~I~~~~~k~~~~g~~~~~V 74 (125) T protein:vir:97 1 MTKGLDEILANLTKLEVKAPKTAKAAVT-EVAKEFEKALKANTPVYEV-E----TDERLQEDTVISGFKGANVGIVSKEI 74 (125) T ss_pred CchhHHHHHHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHhCCcCCC-C----chhhHHhhhhcccccccccCceEEEE Confidence 3578999999999999999999888875 5899999999999999953 2 35587776654322 12222233 Q ss_pred ee-cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_021539. 84 VS-KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINN 135 (140) Q Consensus 84 ~~-k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~ 135 (140) -+ +...-|--|+.. |||+-.||.||+...+...+++++.+.+.+.++|.= T Consensus 75 G~~k~~~~y~~f~E~--GT~k~~~~pF~~pa~~~~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 75 GYGKATGWRAHYPND--GTIYQRGQDFKERTINQMTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred eecCCCceeEeeecc--CccCCCcCccchHhHHHhHHHHHHHHHHHHHHHhcC Confidence 23 555678888887 589989999999999999999998888888777764 No 29 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=97.94 E-value=1.6e-07 Score=57.78 Aligned_cols=126 Identities=15% Similarity=0.150 Sum_probs=97.8 Q ss_pred ccHH-HHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchh-hhcccchhhhhhhhhhch-------- Q lcl|NC_021539. 7 VDFA-DVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKG-QLLNKNHAQTAGPFVAKM-------- 76 (140) Q Consensus 7 ld~s-~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~-~~rnK~HAk~s~pl~~~~-------- 76 (140) |||+ .++.++..++++=+++++.-++++ +.|+.++.+.+..-.|-+.-..+ .-++-+|.+++=..+..- T Consensus 1 ~~~~~~l~e~l~~lekl~~~~~~~~~k~t-kaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~~idg~~~g~ 79 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAQLSVSDQEKIT-KAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAGDIDGDHNGS 79 (139) T ss_pred CCHHHHHHHHHHHHHHhccCCHHHHHHHH-HHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCcccccccccc Confidence 8888 788888889888777777666654 57999999999988885532111 123346988886665321 Q ss_pred hccceEEeecCcccce-eccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 77 SNLGFELVSKPKFNYL-IFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 77 ~NLgf~i~~k~kf~YL-vFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg~ 140 (140) ...||. + + .|. -||+.| |++-.+|+|.++-..++.+.++++-.+++-+.|++..||- T Consensus 80 ~~VG~~---~-~-~~~Ahf~n~G--T~~~~~~hFie~t~~e~~~ev~~a~~~~~ke~l~~~~~~~ 137 (139) T protein:vir:10 80 STVGFH---N-K-AHIARFLNDG--TKNIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKANGGD 137 (139) T ss_pred ceeCCC---C-C-ceeeeeeccC--ccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC Confidence 347775 3 3 355 899987 8899999999999999999999999999999999999998 No 30 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=97.79 E-value=3.1e-07 Score=56.24 Aligned_cols=123 Identities=20% Similarity=0.337 Sum_probs=80.9 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhh---hHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchh Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKA---VPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMS 77 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg---~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~ 77 (140) || ++++..+++|+++|+++|..+++.+.+.+..-. .-.++.......||. + |.+ .++++.+.. T Consensus 1 m~---~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~Pvd-t--G~L--------r~SI~~~~~ 66 (182) T protein:vir:10 1 MI---EVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYS-T--GEL--------TRSFKHEVK 66 (182) T ss_pred Ce---EEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCC-c--hhh--------hhceeeeee Confidence 54 458889999999999999999988877775433 334455666777886 3 222 222322211 Q ss_pred ccc----eEEeecCcccceeccccCC----------------------------------------------------CC Q lcl|NC_021539. 78 NLG----FELVSKPKFNYLIFPDQGV----------------------------------------------------GK 101 (140) Q Consensus 78 NLg----f~i~~k~kf~YLvFPd~Gi----------------------------------------------------G~ 101 (140) .=| -+|.+.. .|=+|...|- ++ T Consensus 67 ~~~~~~~g~V~~~~--~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t 144 (182) T protein:vir:10 67 VDGDEVIGRWWNSS--MVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRT 144 (182) T ss_pred ecCCeEEEEeecCC--CccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeec Confidence 111 1122211 1222222222 23 Q ss_pred CCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021539. 102 NNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGG 139 (140) Q Consensus 102 sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg 139 (140) +.--+|-||.-.++...++|.+.+.+++.+++.+.+|| T Consensus 145 ~G~~aqPFl~pA~~~~~~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 145 TGQPARQFMTPAANKMAKEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred CCCCCCcchHHHHHHhHHHHHHHHHHHHHHHHHHhhcC Confidence 44467779999999999999999999999999999999 No 31 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=97.68 E-value=2.5e-07 Score=56.82 Aligned_cols=116 Identities=17% Similarity=0.246 Sum_probs=91.0 Q ss_pred cccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccceEE Q lcl|NC_021539. 4 KWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGFEL 83 (140) Q Consensus 4 ~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf~i 83 (140) =+++++..+|+|...++++=...+.+-|++|.. |.....+.+.+..|++. .++.+-+... -.=||-. T Consensus 1 Ma~iel~G~del~~~l~~~g~~~~~ie~kAlk~-g~e~I~~~~~~n~P~~t--------g~lkkik~~~----kk~g~~~ 67 (119) T protein:vir:10 1 MASLEIEGFEEFEKFISEDMVLDESTKRKGIKA-GITKIGKAIEKNSPIKS--------GRLSKVKIRV----KNTGLAT 67 (119) T ss_pred CceeehhhHHHHHHHHHhhhhhhHHHHHHHHHH-HhHHHHHHHhhcCCccc--------CCcceeeeee----ecCceeE Confidence 246788899999999999999999999999975 55666678999999973 2233433333 3335644 Q ss_pred ee--cCcccceeccccCCCCCCCchH-HHHHhchhhhhHHHHHHHHHHHHHHHH Q lcl|NC_021539. 84 VS--KPKFNYLIFPDQGVGKNNKTKQ-DFMLLGLEESTAEIVEMLEEDVLKEIN 134 (140) Q Consensus 84 ~~--k~kf~YLvFPd~GiG~sn~~~q-~FmerGl~~~~~~i~E~L~~~l~k~in 134 (140) +- |..+=|-+||.- |||+--++ -||+++++...+++++.+.++|.+... T Consensus 68 VG~~ks~~fy~kF~EF--GTSkm~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 68 EGTASSSEFYDIFQNF--GTSEQKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred eccCCcchhhhhhccc--cccccCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 43 777889999985 68877677 499999999999999999999888887 No 32 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=97.57 E-value=1.1e-06 Score=53.36 Aligned_cols=129 Identities=11% Similarity=0.088 Sum_probs=95.5 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhc------ Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAK------ 75 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~------ 75 (140) |++|+ ..++.|.+.++++-...++.=++++. .|+.+..+.+..--|.+...-..-.+-+|.+++=-.+.. T Consensus 1 M~~~~---d~l~e~~~~lekl~~~~~~~~~katk-AGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~~~iDg~~ 76 (140) T protein:vir:48 1 MTGLD---EALEGWLKTVASIGDLTPAEQAKITT-AGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQSTNVDGRK 76 (140) T ss_pred CccHH---HHHHHHHHHHHHhccCCHHHHHHHHH-HHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeeccccccccc Confidence 66655 36999999999998878888888887 899999999999888764322233456788877655421 Q ss_pred --hhccceEEeecCcccce-eccccCCCCCCCchHHHHHhchhhh--hHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 76 --MSNLGFELVSKPKFNYL-IFPDQGVGKNNKTKQDFMLLGLEES--TAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 76 --~~NLgf~i~~k~kf~YL-vFPd~GiG~sn~~~q~FmerGl~~~--~~~i~E~L~~~l~k~in~~lgg~ 140 (140) ..+.||. +..-+|+ -||+.| |++-.+++|.++-.+++ .+.|+++-.+.+-++|++.=| - T Consensus 77 ~g~s~VG~~---kk~~a~~A~f~n~G--T~k~~~~hFve~~~~e~~~k~~vl~A~~~~~~~~l~~~~~-~ 140 (140) T protein:vir:48 77 NGVSTVGWV---NRYHAQNARRLNDG--TKKYRADHFVTNVQNDSAVQTKVLLAEKEEYEKLIRKKGG-E 140 (140) T ss_pred CceeeeccC---CCcceeeeeccccC--ccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHhhcC-C Confidence 3355663 3323555 899987 78889999999999987 457888888888888887644 4 No 33 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=97.42 E-value=2.5e-06 Score=51.32 Aligned_cols=129 Identities=12% Similarity=0.090 Sum_probs=93.3 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhc------ Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAK------ 75 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~------ 75 (140) |++|+ ..++.|.+.++++-....+.=.+++ +.|+....+.+..--|.|....+.-++=+|.+++=..+.. T Consensus 1 M~~~~---d~l~e~~~~v~kl~~~~~~~~~kat-kAGAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~~~idg~~ 76 (140) T protein:vir:48 1 MTGLD---EALEGWLKTVASIGDLTPAEQAKIT-TAGAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQSTNVDGRK 76 (140) T ss_pred CccHH---HHHHHHHHHHHHhccCCHHHHHHHH-HHhHHHHHHHHHHhcccCCCCCCCCCCCCcccccceeccccccccc Confidence 66655 3588888888777665555555554 4689999999888888875433323344699988776521 Q ss_pred --hhccceEEeecCcccce-eccccCCCCCCCchHHHHHhchhhh--hHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021539. 76 --MSNLGFELVSKPKFNYL-IFPDQGVGKNNKTKQDFMLLGLEES--TAEIVEMLEEDVLKEINNILGG 139 (140) Q Consensus 76 --~~NLgf~i~~k~kf~YL-vFPd~GiG~sn~~~q~FmerGl~~~--~~~i~E~L~~~l~k~in~~lgg 139 (140) ..+.||. +..-+|+ -||+.| |++-.+++|.++-.+++ .+.|+++-.+++.+.|++..|- T Consensus 77 dG~s~VG~~---k~~~a~~a~f~NdG--T~k~~~~hFve~t~~e~~~~~~vl~A~~~~y~~~l~kk~~~ 140 (140) T protein:vir:48 77 NGVATVGWK---NNYHAQNARRLNDG--TKKYRADHFVTNVQNDSAVRDKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred ccceeeccc---CCCceeEEeecccC--ccccCCCchHHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 3446666 2222444 899987 78888999999999976 5789999999999999998777 No 34 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=97.26 E-value=2.1e-06 Score=51.66 Aligned_cols=114 Identities=11% Similarity=0.124 Sum_probs=89.1 Q ss_pred ccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccceEEe Q lcl|NC_021539. 5 WSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGFELV 84 (140) Q Consensus 5 ~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf~i~ 84 (140) -|++++.+++|++.+++++..+.+.+.++|+.-+.+ ..+.+..+.|++ + |.+| ++++.+..-+..+|. T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~-i~~~ak~~aPv~-T--G~Lr--------~sI~~~~~g~~~~V~ 68 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEK-GKRIAKQLAPKD-T--EFLK--------DHITTSYPGMEAHIH 68 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhCCcC-c--hhhh--------hceeeecCceEEEee Confidence 466777999999999999999999999999876655 577889999998 3 3222 234444444555777 Q ss_pred ecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHH Q lcl|NC_021539. 85 SKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEIN 134 (140) Q Consensus 85 ~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in 134 (140) ++.. |-.|-. -||++-.+|-||.-+++...+++++.|.+.|-+.+. T Consensus 69 ~~~~--Ya~yvE--~GT~~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 69 GEAG--YDGYQE--YGTRFQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred cCCC--ccceee--cCccccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 7664 555543 468888899999999999999999999999999998 No 35 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=97.10 E-value=1.9e-05 Score=46.41 Aligned_cols=133 Identities=15% Similarity=0.223 Sum_probs=88.5 Q ss_pred CCccccccHHHHHHHHHHHHhc-chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhh-hhhhhhhchhc Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKI-PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQ-TAGPFVAKMSN 78 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~i-P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk-~s~pl~~~~~N 78 (140) |++--++||+++++|++++.++ +...++.+-+.|..-+..+ ...|..+.||- + |.+|+-=++. ..+.+...+.. T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l-~~~vk~~tPVd-T--G~Lr~sw~~~~~~~~~~~~~~g 76 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARL-LGKVIRRTPVD-T--GFLRQGWNGVAYARSLPVYKQG 76 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH-HHHHHHhCCCc-c--hhhcccccccccccccceeecC Confidence 4444589999999999999765 6677888888777666554 55677889996 3 4444433332 23445555666 Q ss_pred cceEEeecCcccceeccccCCCCCCC-----chH-HHHHhchhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021539. 79 LGFELVSKPKFNYLIFPDQGVGKNNK-----TKQ-DFMLLGLEESTAEIVEMLEEDVLKEINNILGG 139 (140) Q Consensus 79 Lgf~i~~k~kf~YLvFPd~GiG~sn~-----~~q-~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg 139 (140) -|++|..-..-.|--|=..| .... .+. .++++.++.....+-..+++.|++.+++.+-| T Consensus 77 ~~~~v~v~n~~~YA~~VE~G--hr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 77 NNYIIEVVNPTEYASYVNFG--HRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred CeeEEEEecCCcchhhhhcc--eeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 66665554444555555555 2222 233 35678888777777888888888888888888 No 36 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=96.99 E-value=1.1e-05 Score=47.82 Aligned_cols=118 Identities=14% Similarity=0.093 Sum_probs=83.7 Q ss_pred ccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhc--hhccceEEe Q lcl|NC_021539. 7 VDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAK--MSNLGFELV 84 (140) Q Consensus 7 ld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~--~~NLgf~i~ 84 (140) |++..+++|.+.++++|..+++++.++|. +++..+.+.+....|+. ++ |-+.| ++.+ ...-|.... T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~-~~a~~i~~~ak~~aPv~-TG--------~Lr~s--I~~~~~~~~~~~~~~ 68 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTE-EAANFIEDRAKTLAPKN-FG--------KLAQS--ISTSDLKAKDLISKK 68 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhCCcC-ch--------hhhhc--ceeeeeccCceeEEe Confidence 99999999999999999999999999987 67777788888899987 32 22222 2211 111122222 Q ss_pred ecCcccceeccccCCCC-----------------------------------------------------CCCchHHHHH Q lcl|NC_021539. 85 SKPKFNYLIFPDQGVGK-----------------------------------------------------NNKTKQDFML 111 (140) Q Consensus 85 ~k~kf~YLvFPd~GiG~-----------------------------------------------------sn~~~q~Fme 111 (140) ......|-.|-..|-|. +.-.+|-||. T Consensus 69 v~~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~ 148 (173) T protein:vir:10 69 ITVNELYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLY 148 (173) T ss_pred eCCCcccchhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccch Confidence 23333444444444332 2344666999 Q ss_pred hchhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_021539. 112 LGLEESTAEIVEMLEEDVLKEINNI 136 (140) Q Consensus 112 rGl~~~~~~i~E~L~~~l~k~in~~ 136 (140) --++...+++++.+.+.+.++|-++ T Consensus 149 PA~~~~~~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 149 PAWIEGKKQYLKDLENLLKTYNKKI 173 (173) T ss_pred hHHHHhHHHHHHHHHHHHHHHhhcC Confidence 9999999999999999999999999 No 37 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=96.82 E-value=9.4e-06 Score=48.15 Aligned_cols=107 Identities=12% Similarity=0.224 Sum_probs=83.0 Q ss_pred HHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhc-cceEEeecC Q lcl|NC_021539. 9 FADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSN-LGFELVSKP 87 (140) Q Consensus 9 ~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~N-Lgf~i~~k~ 87 (140) +..+++|++.++++|..+.+.+++.|.. ++..+.+.+...-|+. + |. +.++++.++.+ +..+|.+.. T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~-~a~~i~~~ak~~aPv~-T--G~--------Lr~sI~~~~~~~~~~~v~~~~ 68 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSK-SAARIERQAKILAPVD-T--GW--------LRAQIYSEQQRLLHYRVVSPA 68 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCcC-c--hh--------hhcceeeeecCcEEEEeecCc Confidence 9999999999999999999999999985 5555677788899997 3 32 22334433333 456677655 Q ss_pred cccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHH Q lcl|NC_021539. 88 KFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLK 131 (140) Q Consensus 88 kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k 131 (140) . |-.|-.- ||++-.+|-||+-.++...+.+.+.|.+.|.| T Consensus 69 ~--Ya~~vE~--GT~~m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 69 L--YSIYLEL--GTRKMEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred c--cchhccc--CccccCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 4 5555554 58888899999999999999999999988888 No 38 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=96.62 E-value=1.9e-05 Score=46.43 Aligned_cols=112 Identities=16% Similarity=0.184 Sum_probs=78.9 Q ss_pred ccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccch--hhhhhhhhhchh-ccceEE Q lcl|NC_021539. 7 VDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNH--AQTAGPFVAKMS-NLGFEL 83 (140) Q Consensus 7 ld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~H--Ak~s~pl~~~~~-NLgf~i 83 (140) +++..+++|++.++++|.++++.+.++|..-|.+++. ......|-+.+ +++ =.+.++++.++. .+.++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~-~a~~~a~~~~~-------~p~~TG~Lr~sI~~~~~g~~~~~v 72 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVV-RAKLKAREVMN-------KGYWTGNLSRNIRYKKTGDLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhccccCC-------CCCCchhhhhcceeeecCceEEEe Confidence 9999999999999999999999999999987777654 44454432211 111 012333443333 355677 Q ss_pred eecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHH Q lcl|NC_021539. 84 VSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVL 130 (140) Q Consensus 84 ~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~ 130 (140) .+..- |-.|.-. ||++-.+|-||.-.++...+.+...|.+.+- T Consensus 73 ~~~~~--Ya~~vE~--GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 73 TSHAA--YSGFLEF--GTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ecCcc--chhhhcc--cccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 77654 5555554 6888899999999999888888888776665 No 39 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=96.62 E-value=1.9e-05 Score=46.43 Aligned_cols=112 Identities=16% Similarity=0.184 Sum_probs=78.9 Q ss_pred ccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccch--hhhhhhhhhchh-ccceEE Q lcl|NC_021539. 7 VDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNH--AQTAGPFVAKMS-NLGFEL 83 (140) Q Consensus 7 ld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~H--Ak~s~pl~~~~~-NLgf~i 83 (140) +++..+++|++.++++|.++++.+.++|..-|.+++. ......|-+.+ +++ =.+.++++.++. .+.++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~-~a~~~a~~~~~-------~p~~TG~Lr~sI~~~~~g~~~~~v 72 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVV-RAKLKAREVMN-------KGYWTGNLSRNIRYKKTGDLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhccccCC-------CCCCchhhhhcceeeecCceEEEe Confidence 9999999999999999999999999999987777654 44454432211 111 012333443333 355677 Q ss_pred eecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHH Q lcl|NC_021539. 84 VSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVL 130 (140) Q Consensus 84 ~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~ 130 (140) .+..- |-.|.-. ||++-.+|-||.-.++...+.+...|.+.+- T Consensus 73 ~~~~~--Ya~~vE~--GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 73 TSHAA--YSGFLEF--GTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ecCcc--chhhhcc--cccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 77654 5555554 6888899999999999888888888776665 No 40 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=96.62 E-value=1.9e-05 Score=46.43 Aligned_cols=112 Identities=16% Similarity=0.184 Sum_probs=78.9 Q ss_pred ccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccch--hhhhhhhhhchh-ccceEE Q lcl|NC_021539. 7 VDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNH--AQTAGPFVAKMS-NLGFEL 83 (140) Q Consensus 7 ld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~H--Ak~s~pl~~~~~-NLgf~i 83 (140) +++..+++|++.++++|.++++.+.++|..-|.+++. ......|-+.+ +++ =.+.++++.++. .+.++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~-~a~~~a~~~~~-------~p~~TG~Lr~sI~~~~~g~~~~~v 72 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVV-RAKLKAREVMN-------KGYWTGNLSRNIRYKKTGDLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhccccCC-------CCCCchhhhhcceeeecCceEEEe Confidence 9999999999999999999999999999987777654 44454432211 111 012333443333 355677 Q ss_pred eecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHH Q lcl|NC_021539. 84 VSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVL 130 (140) Q Consensus 84 ~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~ 130 (140) .+..- |-.|.-. ||++-.+|-||.-.++...+.+...|.+.+- T Consensus 73 ~~~~~--Ya~~vE~--GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 73 TSHAA--YSGFLEF--GTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ecCcc--chhhhcc--cccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 77654 5555554 6888899999999999888888888776665 No 41 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=96.62 E-value=1.9e-05 Score=46.43 Aligned_cols=112 Identities=16% Similarity=0.184 Sum_probs=78.9 Q ss_pred ccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccch--hhhhhhhhhchh-ccceEE Q lcl|NC_021539. 7 VDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNH--AQTAGPFVAKMS-NLGFEL 83 (140) Q Consensus 7 ld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~H--Ak~s~pl~~~~~-NLgf~i 83 (140) +++..+++|++.++++|.++++.+.++|..-|.+++. ......|-+.+ +++ =.+.++++.++. .+.++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~-~a~~~a~~~~~-------~p~~TG~Lr~sI~~~~~g~~~~~v 72 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVV-RAKLKAREVMN-------KGYWTGNLSRNIRYKKTGDLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhccccCC-------CCCCchhhhhcceeeecCceEEEe Confidence 9999999999999999999999999999987777654 44454432211 111 012333443333 355677 Q ss_pred eecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHH Q lcl|NC_021539. 84 VSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVL 130 (140) Q Consensus 84 ~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~ 130 (140) .+..- |-.|.-. ||++-.+|-||.-.++...+.+...|.+.+- T Consensus 73 ~~~~~--Ya~~vE~--GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 73 TSHAA--YSGFLEF--GTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ecCcc--chhhhcc--cccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 77654 5555554 6888899999999999888888888776665 No 42 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=96.62 E-value=1.9e-05 Score=46.43 Aligned_cols=112 Identities=16% Similarity=0.184 Sum_probs=78.9 Q ss_pred ccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccch--hhhhhhhhhchh-ccceEE Q lcl|NC_021539. 7 VDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNH--AQTAGPFVAKMS-NLGFEL 83 (140) Q Consensus 7 ld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~H--Ak~s~pl~~~~~-NLgf~i 83 (140) +++..+++|++.++++|.++++.+.++|..-|.+++. ......|-+.+ +++ =.+.++++.++. .+.++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~-~a~~~a~~~~~-------~p~~TG~Lr~sI~~~~~g~~~~~v 72 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVV-RAKLKAREVMN-------KGYWTGNLSRNIRYKKTGDLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhccccCC-------CCCCchhhhhcceeeecCceEEEe Confidence 9999999999999999999999999999987777654 44454432211 111 012333443333 355677 Q ss_pred eecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHH Q lcl|NC_021539. 84 VSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVL 130 (140) Q Consensus 84 ~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~ 130 (140) .+..- |-.|.-. ||++-.+|-||.-.++...+.+...|.+.+- T Consensus 73 ~~~~~--Ya~~vE~--GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 73 TSHAA--YSGFLEF--GTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ecCcc--chhhhcc--cccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 77654 5555554 6888899999999999888888888776665 No 43 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=96.62 E-value=1.9e-05 Score=46.43 Aligned_cols=112 Identities=16% Similarity=0.184 Sum_probs=78.9 Q ss_pred ccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccch--hhhhhhhhhchh-ccceEE Q lcl|NC_021539. 7 VDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNH--AQTAGPFVAKMS-NLGFEL 83 (140) Q Consensus 7 ld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~H--Ak~s~pl~~~~~-NLgf~i 83 (140) +++..+++|++.++++|.++++.+.++|..-|.+++. ......|-+.+ +++ =.+.++++.++. .+.++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~-~a~~~a~~~~~-------~p~~TG~Lr~sI~~~~~g~~~~~v 72 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVV-RAKLKAREVMN-------KGYWTGNLSRNIRYKKTGDLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhccccCC-------CCCCchhhhhcceeeecCceEEEe Confidence 9999999999999999999999999999987777654 44454432211 111 012333443333 355677 Q ss_pred eecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHH Q lcl|NC_021539. 84 VSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVL 130 (140) Q Consensus 84 ~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~ 130 (140) .+..- |-.|.-. ||++-.+|-||.-.++...+.+...|.+.+- T Consensus 73 ~~~~~--Ya~~vE~--GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 73 TSHAA--YSGFLEF--GTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ecCcc--chhhhcc--cccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 77654 5555554 6888899999999999888888888776665 No 44 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=96.56 E-value=3.2e-05 Score=45.26 Aligned_cols=129 Identities=11% Similarity=0.128 Sum_probs=80.5 Q ss_pred CCccc-cccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhh----hhhc Q lcl|NC_021539. 1 MCAKW-SVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGP----FVAK 75 (140) Q Consensus 1 m~a~~-sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~p----l~~~ 75 (140) |..+. ++||++ |...++++|..++++...++. +|+....+.+..+-|+- . |.++.-=..+.... ... T Consensus 1 m~~~~~~~d~s~---l~~~l~~l~~~~~~v~R~A~~-~ga~vv~dear~~aP~~-t--G~LkksI~~~~~~~~s~~g~~- 72 (157) T protein:vir:97 1 MKFSIRSVDITG---ILAGLETVVEHSSDVVRTMTY-ESAVAVRESAKAFVNDE-T--GKLRNNLYVAYSPEESVEGIQ- 72 (157) T ss_pred CeeEeecccHHH---HHHHHHHhHHHHHHHHHHHHH-HHHHHHHHHHHHhCCCC-c--chhhhheeeeeccccCCCceE- Confidence 65555 566665 666667778888887777765 67888888888999974 2 33332221111110 000 Q ss_pred hhccceEE---------------------eecCcccceeccccCCCCCCCch-HHHHHhchhhhhHHHHHHHHHHHHHHH Q lcl|NC_021539. 76 MSNLGFEL---------------------VSKPKFNYLIFPDQGVGKNNKTK-QDFMLLGLEESTAEIVEMLEEDVLKEI 133 (140) Q Consensus 76 ~~NLgf~i---------------------~~k~kf~YLvFPd~GiG~sn~~~-q~FmerGl~~~~~~i~E~L~~~l~k~i 133 (140) .-..|+.- .|++.|-+... -.|+++..| |-||.-+.+...++++++..+.+-+.| T Consensus 73 ~~~Vg~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~---~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I 149 (157) T protein:vir:97 73 TYAVSWRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKV---KLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKY 149 (157) T ss_pred EEEEeecCCccceeeeeecCcccccccccCCccccccccc---ccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHH Confidence 00111111 11222322222 144555445 669999999999999999999999999 Q ss_pred HHHhcCC Q lcl|NC_021539. 134 NNILGGN 140 (140) Q Consensus 134 n~~lgg~ 140 (140) -+.|+|- T Consensus 150 ~e~l~g~ 156 (157) T protein:vir:97 150 AELQRGD 156 (157) T ss_pred HHHhcCC Confidence 9999999 No 45 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=96.45 E-value=3e-05 Score=45.39 Aligned_cols=112 Identities=15% Similarity=0.166 Sum_probs=77.9 Q ss_pred ccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccch--hhhhhhhhhchh-ccceEE Q lcl|NC_021539. 7 VDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNH--AQTAGPFVAKMS-NLGFEL 83 (140) Q Consensus 7 ld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~H--Ak~s~pl~~~~~-NLgf~i 83 (140) +.+..+++|++.++.+|.++.+.+.++|++-|.++. +.+....|-+.+ ++| -.+.++++.++. -+..++ T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~-~~a~~~a~~~~~-------~pv~TG~Lr~sI~~~~~g~~~~~v 72 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGV-GIAVSNAKEVMN-------KGYWTGNLASLIEVKKIGDLHYRV 72 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhhccccC-------CCCcchhhhhceeeeecCcEEEEe Confidence 999999999999999999999999999998777755 344444442211 111 012233443332 244566 Q ss_pred eecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHH Q lcl|NC_021539. 84 VSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVL 130 (140) Q Consensus 84 ~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~ 130 (140) .+... |-.|-- -||++-.+|-||.-.++...+.+.+.|.+.++ T Consensus 73 ~~~~~--Ya~~vE--fGT~km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 73 ISTAH--YSGFLE--FGTRYMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred eCCCc--cchhee--cccccCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 66554 444544 46999899999999999988888888877777 No 46 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=96.40 E-value=2.3e-05 Score=46.06 Aligned_cols=109 Identities=17% Similarity=0.192 Sum_probs=75.4 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchh--c Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMS--N 78 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~--N 78 (140) -||++ .| .+++|.+.++++|..+.+.+++.|.. .+..+...+....|+. + |.+| ++++.+.. . T Consensus 12 ~Ma~~--~~-Gld~l~~~L~~~~~~~~~~~~~al~~-~a~~v~~~ak~~aPvd-T--G~Lr--------~SI~~~~~~~g 76 (149) T protein:vir:94 12 HMAKV--KY-GADSMVVELDKFDKKIEEWVKKGIAK-TTTKIYNTAVALAPVD-L--GFLE--------ESIDFKYFDGG 76 (149) T ss_pred hHHHH--HH-HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCcc-c--chhh--------cCeeEEeeCCc Confidence 23553 34 89999999999999999999999984 5556677788889985 3 3333 33332222 2 Q ss_pred cceEEeecCcccceeccccCCCCCC---------------------------CchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 79 LGFELVSKPKFNYLIFPDQGVGKNN---------------------------KTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 79 Lgf~i~~k~kf~YLvFPd~GiG~sn---------------------------~~~q~FmerGl~~~~~~i~E~L~ 126 (140) +..+|.+.. .|-.|-..|=|.+. --||.||...+++..++|.+.|+ T Consensus 77 ~~~~V~~~~--~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 77 LSSVISVGA--DYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred EEEEEecCC--CcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 444555543 46666666654431 13667999999999999988888 No 47 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=96.20 E-value=3.7e-05 Score=44.90 Aligned_cols=109 Identities=17% Similarity=0.220 Sum_probs=77.2 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhch--hc Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKM--SN 78 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~--~N 78 (140) -||+ +.| .+++|.+.++++|..+.+.+.++|.. ++..+...+....|+. + |.+| ++++.+. .. T Consensus 12 ~Ma~--v~~-Gld~l~~~l~~~~~~~~~~~~~~l~~-~a~~v~~~ak~~aPvd-T--G~L~--------~SI~~~~~~~g 76 (149) T protein:vir:10 12 HMAK--VKY-GADSMVVELDKFDKKIEEWVKKGIAK-TTTKIYNTAVALAPVD-L--GFLE--------ESIDFKYFDGG 76 (149) T ss_pred hhHH--HHH-HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCcc-c--chhh--------ccceEEecCCc Confidence 2344 444 89999999999999999999999985 6667777888889985 3 3333 3333221 22 Q ss_pred cceEEeecCcccceeccccCCCCCC---------------------------CchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 79 LGFELVSKPKFNYLIFPDQGVGKNN---------------------------KTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 79 Lgf~i~~k~kf~YLvFPd~GiG~sn---------------------------~~~q~FmerGl~~~~~~i~E~L~ 126 (140) +..+|.+.. .|-.|-..|-|.+. --||.||+..+++..++|.+.|+ T Consensus 77 ~~~~V~~~~--~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 77 LSSVISVGA--DYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred EEEEEecCC--CcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 444555544 46677777765442 13677999999999999998888 No 48 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=96.02 E-value=6.3e-05 Score=43.61 Aligned_cols=110 Identities=13% Similarity=0.141 Sum_probs=76.6 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccce Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGF 81 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf 81 (140) ||+++ + .+++|.+.++++|.+.++.+.+.|+. ++..+...+....|+. + |.+|+-=+.+ .+...+.. T Consensus 1 Ma~~~--~-G~~~l~~~L~~~~~~~~~~~~~al~~-~a~~v~~~ak~~aPvd-T--G~Lr~SI~~~------~~~~~~~~ 67 (137) T protein:vir:94 1 MAKVK--Y-GNWDLVKELENYERDIERWVKRGIAK-TTVKIHNTIISLMPVD-T--GYLRESVTMD------FKDGGFTG 67 (137) T ss_pred CchhH--H-hHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCcC-c--chhhcCceeE------eecCcEEE Confidence 67665 2 88899999999999999999999966 4556677888889985 3 4444422211 12223445 Q ss_pred EEeecCcccceeccccCCCCC---------------------------CCchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 82 ELVSKPKFNYLIFPDQGVGKN---------------------------NKTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 82 ~i~~k~kf~YLvFPd~GiG~s---------------------------n~~~q~FmerGl~~~~~~i~E~L~ 126 (140) +|.+.. .|-.|-..|-|.| .--+|-||...++...+.|...|- T Consensus 68 ~V~~~~--~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 68 VINIGS--EYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred EEecCC--CcccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 555555 4557777776554 124566999999999999888887 No 49 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=96.02 E-value=0.00015 Score=41.61 Aligned_cols=128 Identities=11% Similarity=0.173 Sum_probs=83.7 Q ss_pred CCccccccHHHHHHHHHHHHh--cchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccc--h------------hhhcccc Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISK--IPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNW--K------------GQLLNKN 64 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~--iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~--k------------~~~rnK~ 64 (140) ||+.|+ | .++.+.+.+++ -|+.+++. -.-..|+..+.+.+..--|.|... + ++-++-+ T Consensus 1 mm~~~~-~--~l~~~l~~v~k~~~~~~~~k~---kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~ 74 (159) T protein:vir:38 1 MANDMG-E--FYNNWVNEVEKGMKLSVEDKA---KITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTK 74 (159) T ss_pred CcchHH-H--HHHHHHHHHHHhcCCCHHHHH---HHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCC Confidence 998866 3 26666666644 26666654 234567777777776666665321 1 1223445 Q ss_pred hhhhhhhhhh---------chhccceEEeecCcccce-eccccCCCCCCCchH-----HHHHhchhhhhHHHHHHHHHHH Q lcl|NC_021539. 65 HAQTAGPFVA---------KMSNLGFELVSKPKFNYL-IFPDQGVGKNNKTKQ-----DFMLLGLEESTAEIVEMLEEDV 129 (140) Q Consensus 65 HAk~s~pl~~---------~~~NLgf~i~~k~kf~YL-vFPd~GiG~sn~~~q-----~FmerGl~~~~~~i~E~L~~~l 129 (140) |-.++=-+++ -..+.||. +...+|+ -|++.| |++--+| +|.++=..++.+.|+++-.+++ T Consensus 75 HlaD~I~~~~~~~iDg~~dG~s~VGw~---~~~~a~~a~f~NdG--T~~m~~k~~~gdHFvekt~~~~k~~Vl~A~~~~~ 149 (159) T protein:vir:38 75 HLQDSITYKPGYTADKLHTGDTDVGFE---GKYYDFLAKIVNNG--QHHMSPKRYKNMHFLDKAQQEAKKSVAEAELKAY 149 (159) T ss_pred ccccceeeecCccccccccceeeeccc---CCccceEeeecccC--ccccCCCCccCChhHHHHHHHHHHHHHHHHHHHH Confidence 7555544432 26678997 3333455 899986 4443333 7999999999999999999999 Q ss_pred HHHHHHHhcC Q lcl|NC_021539. 130 LKEINNILGG 139 (140) Q Consensus 130 ~k~in~~lgg 139 (140) .+.|+..--- T Consensus 150 ~~il~~~~~~ 159 (159) T protein:vir:38 150 KEVMNHDSDK 159 (159) T ss_pred HHHhhcccCC Confidence 9999887666 No 50 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=95.78 E-value=9.5e-05 Score=42.64 Aligned_cols=108 Identities=16% Similarity=0.188 Sum_probs=78.5 Q ss_pred ccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcC------Cccccchhhhcccchhhhhhhhhhchhc-c Q lcl|NC_021539. 7 VDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRI------NLSKNWKGQLLNKNHAQTAGPFVAKMSN-L 79 (140) Q Consensus 7 ld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~i------PvS~r~k~~~rnK~HAk~s~pl~~~~~N-L 79 (140) +.+..+++|++.++++|.++.+.++++|++-|.++..+ +.... |+. + | .+.+++..++.+ + T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~-a~~~a~~~~~~p~~-T--G--------~Lr~SI~~~~~g~~ 68 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVR-AKLKAREVMNKGYW-T--G--------NLSRNIRYKKTVDL 68 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhhccccCCCCc-c--h--------hhhhceeeeecCcE Confidence 99999999999999999999999999998888776544 33333 443 1 2 122334433333 5 Q ss_pred ceEEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHH Q lcl|NC_021539. 80 GFELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVL 130 (140) Q Consensus 80 gf~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~ 130 (140) ..+|.|... |-.|. =.||++-.+|-||.-.++...+.+...|.+.+- T Consensus 69 ~~~V~~~~~--Ya~~v--E~GT~~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 69 QYTITSHAA--YSGFL--EFGTRYMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred EEEecCCcc--ccccc--cccccccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 556666654 44453 457999899999999999999988888876665 No 51 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=95.70 E-value=9.6e-05 Score=42.61 Aligned_cols=110 Identities=15% Similarity=0.160 Sum_probs=73.7 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccce Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGF 81 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf 81 (140) |++++. ..++|.+.++++|....+.++++|..-+. .+...+....|+. + |.+|+.=+. ..+-.++.. T Consensus 1 Ma~~~~---G~~~l~~~l~~~~~~~~~~~~~al~~~a~-~i~~~ak~~aPv~-T--G~Lr~SI~~------~~~~~~~~~ 67 (137) T protein:vir:10 1 MAKVKY---GNWDLVKELEEFEKETIRWAKKGIAKTTT-IIHNSIVSNMPVD-T--GYLRESVSM------DFKKGGLTG 67 (137) T ss_pred Cccchh---CHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhCCcC-c--chhhcCeee------EecCCcEEE Confidence 777653 67889999999999999999999988665 4577888889985 3 444433211 112234555 Q ss_pred EEeecCcccceeccccCCCCC---------------------------CCchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 82 ELVSKPKFNYLIFPDQGVGKN---------------------------NKTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 82 ~i~~k~kf~YLvFPd~GiG~s---------------------------n~~~q~FmerGl~~~~~~i~E~L~ 126 (140) +|.+... |-.|-..|-|.+ .--+|.||...++...++|...|- T Consensus 68 ~V~~~~~--YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 68 VINIGSE--YAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred EEecCCc--cccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 5555433 455555554332 234667999999988888877776 No 52 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=95.54 E-value=0.00013 Score=41.87 Aligned_cols=110 Identities=13% Similarity=0.175 Sum_probs=73.5 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccce Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGF 81 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf 81 (140) ||++ .+ .+++|.+.++++|..+.+.+.+.|.. ++..+...+....|+- + |.+|+-=+.. .+...+.. T Consensus 1 Ma~~--~~-Gl~~l~~~l~~~~~~~~~~~~~al~~-~a~~v~~~ak~~apvd-T--G~Lr~SI~~~------~~~~g~~~ 67 (135) T protein:vir:96 1 MAKV--KY-GADSIVVDLEKYSKDMEKWVKKGITK-TTLKIYNTAIHLMPVD-T--GFLRQSTTVD------FENGGFTG 67 (135) T ss_pred Cchh--hh-hHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCcc-c--hhhhcceeEE------eecCcEEE Confidence 6664 35 88999999999999999999999987 4555677788888985 3 4444332211 11222344 Q ss_pred EEeecCcccceeccccCCCCC-------------------------CCchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 82 ELVSKPKFNYLIFPDQGVGKN-------------------------NKTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 82 ~i~~k~kf~YLvFPd~GiG~s-------------------------n~~~q~FmerGl~~~~~~i~E~L~ 126 (140) +|-+.. .|-.|-..|-|.| .--+|.||...++...++|.+.+. T Consensus 68 ~V~~~~--~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 68 VVKIGS--NYAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred EEecCC--CccchhhcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 455433 3556666665544 124666999999988888777776 No 53 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=95.41 E-value=0.00017 Score=41.29 Aligned_cols=113 Identities=12% Similarity=0.132 Sum_probs=71.6 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchh---- Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMS---- 77 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~---- 77 (140) |+++++.+ +.++|+++++.+++.+..+++++|..-+-.+ .......-|+- + |.+|+ +++.... T Consensus 1 Ma~~~~~~-~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i-~~~ak~~aPv~-T--G~Lr~--------SI~~~~~~~g~ 67 (142) T protein:vir:94 1 MAGLNYRV-NSTEFQGALRAALDRLTGAAREATEAAANDM-VNMAKGLCPVD-T--GRLRS--------SIQAVPSGGRF 67 (142) T ss_pred CceeEEEe-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhCCcc-c--hhhhc--------cceeeeccCCc Confidence 89998776 4678999999999999999999998776655 56677788874 2 43333 3332222 Q ss_pred ccceEEeecCcccceeccccCCCCC----------------------C---CchHHHHHhchhhhhHHHHHHHHHHHH Q lcl|NC_021539. 78 NLGFELVSKPKFNYLIFPDQGVGKN----------------------N---KTKQDFMLLGLEESTAEIVEMLEEDVL 130 (140) Q Consensus 78 NLgf~i~~k~kf~YLvFPd~GiG~s----------------------n---~~~q~FmerGl~~~~~~i~E~L~~~l~ 130 (140) .++.+|.+ .-.|-.|-..|-|.| + --||.||..+++...++|.+.+ ++|- T Consensus 68 ~~~~~v~~--~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~-~~~~ 142 (142) T protein:vir:94 68 SFSVTIGT--NVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHA-KGIR 142 (142) T ss_pred eEEEEEec--CcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHH-HhcC Confidence 22233333 234555555554433 1 1267799999998877774433 2233 No 54 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=95.29 E-value=0.0002 Score=40.81 Aligned_cols=108 Identities=14% Similarity=0.117 Sum_probs=75.2 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhh--chhcc Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVA--KMSNL 79 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~--~~~NL 79 (140) ||++. .+.++|.+.++++|....+.+.++|+.-+. .+...+....|+. + |.+|+ ++.. ....+ T Consensus 1 Ma~~~---~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~-~~~~~ak~~~pvd-T--G~L~~--------Si~~~~~~~g~ 65 (137) T protein:vir:96 1 MAKVK---YGNWDLVAELEDYRDEMEEWVKKGILKTTL-AIYNTAVALAPVD-L--GFLKE--------SIDFKVTDGGF 65 (137) T ss_pred CchhH---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhCCcC-c--cchhc--------CceeEeecCce Confidence 66664 378999999999999999999999977665 4566788889985 3 43333 3332 22334 Q ss_pred ceEEeecCcccceeccccCCCCCC---------------------------CchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 80 GFELVSKPKFNYLIFPDQGVGKNN---------------------------KTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 80 gf~i~~k~kf~YLvFPd~GiG~sn---------------------------~~~q~FmerGl~~~~~~i~E~L~ 126 (140) ..+|.+.. .|-.|-..|=|.+- --+|.||...+++..++|...|- T Consensus 66 ~~~V~~~~--~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 66 SSVISVGA--EYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred EEEEecCC--CcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 45555544 46677777765541 23567999999988887766665 No 55 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=95.23 E-value=0.00022 Score=40.67 Aligned_cols=110 Identities=15% Similarity=0.169 Sum_probs=76.3 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccce Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGF 81 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf 81 (140) |++++ -..++|++.++++|.+..+.+.++|.. ++-.+...+....|+. + |.+|+-=+.+ .+...+.. T Consensus 1 Ma~~~---~Gl~~l~~~l~~~~~~~~~~~~~al~~-~a~~i~~~ak~~aPvd-T--G~Lr~SI~~~------~~~~~~~~ 67 (137) T protein:vir:10 1 MAKVK---YGNWELVKELEDFEKETIRWAKKGIAK-TTTIIHNSIVSNMPVD-T--GYLRESVSMD------FKKGGLTG 67 (137) T ss_pred CchhH---hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCcC-c--chhhcCeeEE------eeCCcEEE Confidence 66663 378899999999999999999999876 5566788889999995 3 4444432221 12234555 Q ss_pred EEeecCcccceeccccCCCCC---------------------------CCchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 82 ELVSKPKFNYLIFPDQGVGKN---------------------------NKTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 82 ~i~~k~kf~YLvFPd~GiG~s---------------------------n~~~q~FmerGl~~~~~~i~E~L~ 126 (140) +|.+... |-.|-..|-|.| .--+|-||+..+++..++|...|- T Consensus 68 ~V~~~~~--Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 68 VINIGSE--YAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred EEecCCC--cccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 6666543 556666665443 224677999999998888877776 No 56 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=95.11 E-value=0.00019 Score=41.02 Aligned_cols=110 Identities=19% Similarity=0.228 Sum_probs=74.0 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhh----cCCccccchhhhcccchhhhhhhhhhch Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEK----RINLSKNWKGQLLNKNHAQTAGPFVAKM 76 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~----~iPvS~r~k~~~rnK~HAk~s~pl~~~~ 76 (140) |+ +++++.+++|++.++++.. .+.++++|...+..++ +.+.. .-|+. + | + +.++++.+. T Consensus 1 Ma---~i~~~Gld~l~~~L~~~~~--~~~v~~~~~~~~~~~~-~~~~~~a~~~~p~~-T--G------~--Lr~sI~~~~ 63 (114) T protein:vir:27 1 MA---TIEFEGLDEMAQSLLKNAS--PEKRSKVLRKYGSKLK-EAAVNRAQFNKGYS-T--G------A--TRRSITLQV 63 (114) T ss_pred Ce---eeeeehHHHHHHHHHHhcC--HHHHHHHHHHHHHHHH-HHHHHhcccCCCCC-c--h------h--hhhceeeee Confidence 44 4888999999999998742 3456677766665544 23333 34664 2 2 2 233456667 Q ss_pred hccceEEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHH Q lcl|NC_021539. 77 SNLGFELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLK 131 (140) Q Consensus 77 ~NLgf~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k 131 (140) .+.|++|-+...|+ |=+=.||++-.+|-||.-.++...+.+.+.|.+-+-- T Consensus 64 ~~~~~~V~~~~~Ya----~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 64 ESDKATVEALTSYS----GYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred cCCeeEecCCCCcc----ceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 88999999887654 3344578998999999998888887777766543322 No 57 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=95.11 E-value=0.00019 Score=41.02 Aligned_cols=110 Identities=19% Similarity=0.228 Sum_probs=74.0 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhh----cCCccccchhhhcccchhhhhhhhhhch Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEK----RINLSKNWKGQLLNKNHAQTAGPFVAKM 76 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~----~iPvS~r~k~~~rnK~HAk~s~pl~~~~ 76 (140) |+ +++++.+++|++.++++.. .+.++++|...+..++ +.+.. .-|+. + | + +.++++.+. T Consensus 1 Ma---~i~~~Gld~l~~~L~~~~~--~~~v~~~~~~~~~~~~-~~~~~~a~~~~p~~-T--G------~--Lr~sI~~~~ 63 (114) T protein:vir:49 1 MA---TIEFEGLDEMAQSLLKNAS--PEKRSKVLRKYGSKLK-EAAVNRAQFNKGYS-T--G------A--TRRSITLQV 63 (114) T ss_pred Ce---eeeeehHHHHHHHHHHhcC--HHHHHHHHHHHHHHHH-HHHHHhcccCCCCC-c--h------h--hhhceeeee Confidence 44 4888999999999998742 3456677766665544 23333 34664 2 2 2 233456667 Q ss_pred hccceEEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHH Q lcl|NC_021539. 77 SNLGFELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLK 131 (140) Q Consensus 77 ~NLgf~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k 131 (140) .+.|++|-+...|+ |=+=.||++-.+|-||.-.++...+.+.+.|.+-+-- T Consensus 64 ~~~~~~V~~~~~Ya----~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 64 ESDKATVEALTSYS----GYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred cCCeeEecCCCCcc----ceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 88999999887654 3344578998999999998888887777766543322 No 58 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=95.08 E-value=0.00035 Score=39.52 Aligned_cols=115 Identities=16% Similarity=0.188 Sum_probs=73.5 Q ss_pred Cc--cccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchh-- Q lcl|NC_021539. 2 CA--KWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMS-- 77 (140) Q Consensus 2 ~a--~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~-- 77 (140) |+ +..+|.+.++.|++.++++|.++.+.|.+.|..-+. .+...+....||. + |.+|+ ++..+.. T Consensus 1 m~~ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~-~i~~~ak~~apv~-T--G~Lr~--------SI~~~~~~~ 68 (144) T protein:vir:59 1 MALMSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAE-KIAGLAASLAPVD-E--GNLKN--------SIQIDYKNN 68 (144) T ss_pred CCcceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhCCcc-c--hhhhc--------CeeEEeecC Confidence 33 345688999999999999999999999999876654 5667777888985 3 43333 3332222 Q ss_pred ccceEEeecCcccceeccccCCCCCC-------------------------CchHHHHHhchhhhhHHHHHHHHHHHHHH Q lcl|NC_021539. 78 NLGFELVSKPKFNYLIFPDQGVGKNN-------------------------KTKQDFMLLGLEESTAEIVEMLEEDVLKE 132 (140) Q Consensus 78 NLgf~i~~k~kf~YLvFPd~GiG~sn-------------------------~~~q~FmerGl~~~~~~i~E~L~~~l~k~ 132 (140) .+..+|.+.. .|-.|-..|-|.|- --+|-||...++...+.+ .+. T Consensus 69 g~~~~V~~~~--~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~--------~~~ 138 (144) T protein:vir:59 69 GLTAEITVGA--EYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYF--------ERE 138 (144) T ss_pred cEEEEEecCC--CccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHH--------HHH Confidence 2334454433 46666666654431 124557777766655544 455 Q ss_pred HHHHhc Q lcl|NC_021539. 133 INNILG 138 (140) Q Consensus 133 in~~lg 138 (140) |++..| T Consensus 139 i~~~~g 144 (144) T protein:vir:59 139 MRRLRG 144 (144) T ss_pred HHHhcC Confidence 666667 No 59 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=95.02 E-value=0.00027 Score=40.13 Aligned_cols=106 Identities=10% Similarity=0.193 Sum_probs=77.8 Q ss_pred ccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhc--hhccceEEe Q lcl|NC_021539. 7 VDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAK--MSNLGFELV 84 (140) Q Consensus 7 ld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~--~~NLgf~i~ 84 (140) ++++.+++|.+.+++.. +++.+.++|+.-+.++ .+.+...-|++ + |.+| ++++.+ ...+..+|. T Consensus 1 i~i~Gld~l~~~l~~~~--~~~~~~~al~~~a~~i-~~~ak~~aPv~-T--G~Lr--------~si~~~~~~~~~~~~V~ 66 (108) T protein:vir:74 1 MKITGIDALQKKLRKNA--TLDDVKHVVKSNTASM-NKNMQNLAPVD-T--GNMK--------RSITSEFTDGGLSGTTG 66 (108) T ss_pred CcchhHHHHHHHHHHhh--hHHHHHHHHHHHHHHH-HHHHHHhCCCC-c--hhhh--------ccceeeeecCceEEEee Confidence 99999999999999864 5788999999887665 45788899997 3 3222 223322 345667888 Q ss_pred ecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHH Q lcl|NC_021539. 85 SKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVL 130 (140) Q Consensus 85 ~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~ 130 (140) +...|+ .|-. -|+++-.+|-||.-.++...+++++.|++.|- T Consensus 67 ~~~~Ya--~~vE--~GT~km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 67 PHTDYA--GYVE--YGTRFQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred cCCCcc--ccee--ccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 777654 4444 46888888889999999988888877765554 No 60 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=94.99 E-value=0.0003 Score=39.93 Aligned_cols=110 Identities=15% Similarity=0.265 Sum_probs=74.3 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchh--c Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMS--N 78 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~--N 78 (140) |.. +++++.+++|.+.++++.. ++.+.++|+.-+.+ ..+.+....|+. + |.+| ++++.+.. . T Consensus 1 M~~--~i~i~Gld~l~~~L~~~~~--~~~~~~al~~~~~~-i~~~ak~~aPvd-T--G~Lr--------~si~~~~~~~~ 64 (112) T protein:vir:36 1 MKS--SLSFKGIDQLVKHLDKAAS--LKGVQQVVKSNTSN-MTANMQKLVPVD-T--GYMK--------RSIKMELTEGG 64 (112) T ss_pred Cce--eeeehhHHHHHHHHHhhhh--HHHHHHHHHHHHHH-HHHHHHHhCCCC-c--hhhh--------hceeeeecCCc Confidence 654 6777899999999998754 47788888776655 556777889997 3 2222 23332222 2 Q ss_pred cceEEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHH Q lcl|NC_021539. 79 LGFELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVL 130 (140) Q Consensus 79 Lgf~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~ 130 (140) +..+|.|... |-.|-.- |+++-.+|-||+-.++...+++++.|.+.|- T Consensus 65 ~~~~V~~~~~--Ya~~vE~--GT~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 65 FSGQAGPHTD--YSAYVEY--GTRFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred eEEEeecCCC--ccceeec--cccccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 3444555443 6666554 5777788899999999998888877765555 No 61 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=94.81 E-value=0.0014 Score=36.16 Aligned_cols=135 Identities=15% Similarity=0.192 Sum_probs=87.1 Q ss_pred CCccccccHHHHHHHHHHHHhcch--HHHHHHHHHHhhhhhHHHHHhhhhcCCcccc----chhhhcccchhhhhhhhh- Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPN--KSEEIINKTLETKAVPLAKQNIEKRINLSKN----WKGQLLNKNHAQTAGPFV- 73 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~--~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r----~k~~~rnK~HAk~s~pl~- 73 (140) |.. .+||++++++++++.+.-. ..+..|-++|..-|. .++..+.++-||-.. .+...++.+.++.+.+.+ T Consensus 1 m~~--~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~-~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~ 77 (163) T protein:vir:10 1 MSG--GFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGT-ELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHG 77 (163) T ss_pred CCC--ccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHH-HHHHHHHHhCCcccchhhhhhhhhcccchhhhhccccc Confidence 544 6799999999999987633 345566666655444 447788899998431 123445666777766655 Q ss_pred hchhcc--ceEEee--------------cCcccceeccccCCCCCC--C-chHHHHHhchhhhhHHHHHHHHHHHHHHHH Q lcl|NC_021539. 74 AKMSNL--GFELVS--------------KPKFNYLIFPDQGVGKNN--K-TKQDFMLLGLEESTAEIVEMLEEDVLKEIN 134 (140) Q Consensus 74 ~~~~NL--gf~i~~--------------k~kf~YLvFPd~GiG~sn--~-~~q~FmerGl~~~~~~i~E~L~~~l~k~in 134 (140) .+-.|| ||.+-+ +..| --|=..|==+-+ - --+.+|++.+++....+-+.|++.|++.+. T Consensus 78 k~tG~lr~swk~~~~~k~~~~~~v~v~N~~~Y--A~~VE~GHR~~~gGfV~G~fml~~s~~~~~~~~~~~~e~~l~~~l~ 155 (163) T protein:vir:10 78 KQGGTLQKGWSKSRIEVSGRTYKQKVYNKVYY--APHVEYGHKTVNGGFVPGQFFLHKTVEDTKSDMEKRVRDKYDGFMR 155 (163) T ss_pred cccchhhccceecceeecCCceEEEEEecCCc--cchhhcceeecCCceeccchhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 234444 333322 2222 222233311111 1 245688999999999999999999999999 Q ss_pred HHhcCC Q lcl|NC_021539. 135 NILGGN 140 (140) Q Consensus 135 ~~lgg~ 140 (140) +.+-|+ T Consensus 156 k~~~~~ 161 (163) T protein:vir:10 156 KVVLGN 161 (163) T ss_pred HhhcCC Confidence 999999 No 62 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=94.56 E-value=0.00045 Score=38.94 Aligned_cols=110 Identities=14% Similarity=0.124 Sum_probs=73.5 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccce Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGF 81 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf 81 (140) ||+. +..+++|.+.++++|.++.+.+++.|.. ++..+...+....|+. + |.+|+.=+.+ .....+.. T Consensus 1 Ma~~---~~G~~~l~~~l~~~~~~~~~~~~~~~~~-~a~~v~~~ak~~aPv~-T--G~L~~Si~~~------~~~~~~~~ 67 (137) T protein:vir:95 1 MAKV---KYGNWDLVKELENYERDMERWVKRGIAK-TTAKIHNTIISLMPVD-T--GYLRESVTMD------FKDGGFTG 67 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCcc-c--hhhhcCeeeE------eeCCceEE Confidence 4444 2488999999999999999999999976 4556777888889985 3 4444332211 11122334 Q ss_pred EEeecCcccceeccccCCCCC---------------------------CCchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 82 ELVSKPKFNYLIFPDQGVGKN---------------------------NKTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 82 ~i~~k~kf~YLvFPd~GiG~s---------------------------n~~~q~FmerGl~~~~~~i~E~L~ 126 (140) +|.+... |-.|-..|-|.| .--+|.||...++...+.|...|- T Consensus 68 ~V~~~~~--YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 68 VINIGSE--YAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred EEecCCC--cccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 5554443 556666666554 124566999999998888887777 No 63 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=94.49 E-value=0.0014 Score=36.23 Aligned_cols=122 Identities=12% Similarity=0.112 Sum_probs=66.9 Q ss_pred CccccccHHHHHHHHHHHHhcch--HHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhcc Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPN--KSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNL 79 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~--~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NL 79 (140) |+..++|+++++++.+.+++.-+ ...+.+.+.|..-+..+ ...|...-||- + |.+|+-=.+. + ..+..- T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~-~~~vk~~tPVd-T--G~Lr~S~~~~---~--~~~~~~ 71 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQS-LRILEANTPVK-Q--GNLRRSWTAE---G--PTYGCG 71 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHH-HHHHHHhCCCC-c--chhccceeec---c--eeeecC Confidence 88889999999999999988744 34667777777666554 45578888986 2 3333211100 0 123333 Q ss_pred ceEEeecCcccceeccccC-------------CCCCCC--chHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 80 GFELVSKPKFNYLIFPDQG-------------VGKNNK--TKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 80 gf~i~~k~kf~YLvFPd~G-------------iG~sn~--~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg~ 140 (140) |+++.--..=.|-.|=..| -|.... --|.|+++.++ ..+..+.+.+++.|++- T Consensus 72 ~~~~~V~n~~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~--------~~~~~~~~~l~k~l~~l 139 (144) T protein:vir:10 72 GWTIKLINNAEYASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIP--------QIQRQLPQLVTEGLWGL 139 (144) T ss_pred eeEEEEecCCCcccccccceeecCCcccccCCCccccceecCccchHHHHH--------HHHHHHHHHHHHHHHHH Confidence 4443332222333444444 333322 22345555444 45555555555555555 No 64 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=93.51 E-value=0.0012 Score=36.60 Aligned_cols=110 Identities=15% Similarity=0.115 Sum_probs=75.2 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccce Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGF 81 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf 81 (140) ||+. +...++|.+.++++|.++.+.+.+.|..- +..+...+....|+- + |.+|+-=+- ..+-..+.. T Consensus 1 Ma~~---~~g~~~l~~~l~~~~~~~~~~~~~~~~~~-a~~i~~~ak~~aPvd-T--G~Lr~SI~~------~~~~~~~~~ 67 (137) T protein:vir:97 1 MAKV---KYGNWDLVKELENYERDMERWVKRGIAKT-TAKIHNTIISLMPVD-T--GYLRESVTM------DFKDSGFTG 67 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCcc-c--cchhcccee------EeecCceEE Confidence 5544 24788999999999999999999998654 556777888889985 2 444432211 112223445 Q ss_pred EEeecCcccceeccccCCCCCC---------------------------CchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 82 ELVSKPKFNYLIFPDQGVGKNN---------------------------KTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 82 ~i~~k~kf~YLvFPd~GiG~sn---------------------------~~~q~FmerGl~~~~~~i~E~L~ 126 (140) +|.+... |-.|-..|-|.|- --+|-||...++...+.|...|- T Consensus 68 ~V~~~~~--YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 68 VINIGSE--YAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred EEecCCC--cccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 6666554 4567777766541 24566999999999998888877 No 65 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=93.51 E-value=0.0012 Score=36.60 Aligned_cols=110 Identities=15% Similarity=0.115 Sum_probs=75.2 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccce Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGF 81 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf 81 (140) ||+. +...++|.+.++++|.++.+.+.+.|..- +..+...+....|+- + |.+|+-=+- ..+-..+.. T Consensus 1 Ma~~---~~g~~~l~~~l~~~~~~~~~~~~~~~~~~-a~~i~~~ak~~aPvd-T--G~Lr~SI~~------~~~~~~~~~ 67 (137) T protein:vir:94 1 MAKV---KYGNWDLVKELENYERDMERWVKRGIAKT-TAKIHNTIISLMPVD-T--GYLRESVTM------DFKDSGFTG 67 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCcc-c--cchhcccee------EeecCceEE Confidence 5544 24788999999999999999999998654 556777888889985 2 444432211 112223445 Q ss_pred EEeecCcccceeccccCCCCCC---------------------------CchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 82 ELVSKPKFNYLIFPDQGVGKNN---------------------------KTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 82 ~i~~k~kf~YLvFPd~GiG~sn---------------------------~~~q~FmerGl~~~~~~i~E~L~ 126 (140) +|.+... |-.|-..|-|.|- --+|-||...++...+.|...|- T Consensus 68 ~V~~~~~--YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 68 VINIGSE--YAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred EEecCCC--cccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 6666554 4567777766541 24566999999999998888877 No 66 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=93.51 E-value=0.0012 Score=36.60 Aligned_cols=110 Identities=15% Similarity=0.115 Sum_probs=75.2 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccce Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGF 81 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf 81 (140) ||+. +...++|.+.++++|.++.+.+.+.|..- +..+...+....|+- + |.+|+-=+- ..+-..+.. T Consensus 1 Ma~~---~~g~~~l~~~l~~~~~~~~~~~~~~~~~~-a~~i~~~ak~~aPvd-T--G~Lr~SI~~------~~~~~~~~~ 67 (137) T protein:vir:93 1 MAKV---KYGNWDLVKELENYERDMERWVKRGIAKT-TAKIHNTIISLMPVD-T--GYLRESVTM------DFKDSGFTG 67 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCcc-c--cchhcccee------EeecCceEE Confidence 5544 24788999999999999999999998654 556777888889985 2 444432211 112223445 Q ss_pred EEeecCcccceeccccCCCCCC---------------------------CchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 82 ELVSKPKFNYLIFPDQGVGKNN---------------------------KTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 82 ~i~~k~kf~YLvFPd~GiG~sn---------------------------~~~q~FmerGl~~~~~~i~E~L~ 126 (140) +|.+... |-.|-..|-|.|- --+|-||...++...+.|...|- T Consensus 68 ~V~~~~~--YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 68 VINIGSE--YAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred EEecCCC--cccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 6666554 4567777766541 24566999999999998888877 No 67 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=93.07 E-value=0.0014 Score=36.18 Aligned_cols=106 Identities=12% Similarity=0.199 Sum_probs=75.4 Q ss_pred ccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh--hchhccceEEe Q lcl|NC_021539. 7 VDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV--AKMSNLGFELV 84 (140) Q Consensus 7 ld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~--~~~~NLgf~i~ 84 (140) ++++.+++|++.+++.. .+..+.++|+.-+.+++. .+...-||. + |.+| +++. .+...+..+|. T Consensus 1 i~i~Gld~l~~~l~~~~--~~~~~~~al~~~a~~i~~-~ak~~apvd-T--G~Lr--------~si~~~~~~~~~~~~V~ 66 (108) T protein:vir:98 1 MKITGIDALQKKLRKNA--TLNDVKHVVKRNTVSMNK-NMQNLAPVD-T--GNMK--------RSITSEFTDGGLTGTTI 66 (108) T ss_pred CcchhHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHH-HHHHhCCCC-c--hhhH--------hhceeeeecCceEEEee Confidence 99999999999999875 467788888887776554 567788996 3 2222 2233 22334566777 Q ss_pred ecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHH Q lcl|NC_021539. 85 SKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVL 130 (140) Q Consensus 85 ~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~ 130 (140) +...|+ .|-. .||++-.+|-||...++...+++.+.|.+.|- T Consensus 67 ~~~~Ya--~~vE--~GT~~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 67 PHTDYA--GYVE--YGTRFQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred cCCCcc--ceee--ccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 766543 4444 36787788999999999999888888766555 No 68 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=90.71 E-value=0.019 Score=29.96 Aligned_cols=138 Identities=7% Similarity=0.112 Sum_probs=93.6 Q ss_pred CCc--cccccHHHHHHHHHHHHhc-chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchh Q lcl|NC_021539. 1 MCA--KWSVDFADVDKLTELISKI-PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMS 77 (140) Q Consensus 1 m~a--~~sld~s~~e~L~~~m~~i-P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~ 77 (140) |++ -+.|..+..-+++.+|.++ -.+.-+.+-++ +.+.+.+++...--.-|+-.|-+..-|.+.--++..+++..-. T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl~k~lk~a-~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT 79 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKELNKAVREA-NKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTAS 79 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCchhHHHHHH-HHHHHHHHHHHHHhhcCCcccccccccccCcchhhcccccccc Confidence 665 3788899999999999998 22221222111 2344555555555566775553333334444566667765555 Q ss_pred ccceEEeecC--cccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021539. 78 NLGFELVSKP--KFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGG 139 (140) Q Consensus 78 NLgf~i~~k~--kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg 139 (140) -=+-.|++-+ .--|--|=+-|-=..+-.+++|..+|+-..+|+....-++.+.+++++.||- T Consensus 80 ~raa~VrAG~~krVPYA~~I~~G~r~r~Isp~rFl~~a~a~te~~~~r~Ye~~i~~vl~k~l~s 143 (143) T protein:vir:62 80 AKGAVIKAGSASRVPYAAAIHFGYRARNISPNRFLFRAMARKSDVVAATYERRIAAVVEKYLES 143 (143) T ss_pred ccceeeeeCCcCCCCcccccccCcccccccchhhhhhhhhccCHHHHHHHHHHHHHHHHHHhcC Confidence 5567777743 5566555555644445569999999999999999999999999999999998 No 69 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=90.66 E-value=0.02 Score=29.94 Aligned_cols=138 Identities=7% Similarity=0.112 Sum_probs=91.7 Q ss_pred CCc--cccccHHHHHHHHHHHHhc-chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchh Q lcl|NC_021539. 1 MCA--KWSVDFADVDKLTELISKI-PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMS 77 (140) Q Consensus 1 m~a--~~sld~s~~e~L~~~m~~i-P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~ 77 (140) |++ -+.|..+..-++..+|.++ -.+.-+.+-++ +.+.+.+++...--.-|++-|--..-|...--++..+++..-. T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl~k~lk~a-~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT 79 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKELNKAVREA-NKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTAS 79 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcchHHHHHH-HHHHHHHHHHHHHhhcCCcccccccccccccchhhcccccccc Confidence 665 3788899999999999998 22221222111 2344555555555556776431111112233456666765555 Q ss_pred ccceEEee--cCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021539. 78 NLGFELVS--KPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGG 139 (140) Q Consensus 78 NLgf~i~~--k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg 139 (140) -=+-.|++ +..--|--|=+-|-=+.+-.+++|..+|+-..+|+....-++.+.+++++.||- T Consensus 80 ~raa~VrAGr~arVPYA~~I~~G~r~r~Is~~rFl~~a~a~te~~~~r~Ye~~i~~vl~k~l~s 143 (143) T protein:vir:13 80 AKGAVIKAGSAARVPYAAAIHFGYRKRNISANRFLYRAMARKSDVVAATYERRIAAVVEKYLES 143 (143) T ss_pred ccceeeeecCcCCCCcccccccCCcccccchhhhhhhhhhccCHHHHHHHHHHHHHHHHHHhcC Confidence 55677887 334667666666644445569999999999999999999999999999999998 No 70 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=90.02 E-value=0.0052 Score=33.10 Aligned_cols=110 Identities=18% Similarity=0.225 Sum_probs=69.4 Q ss_pred CccccccHHHHHHHHHHHHhc--chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhcc Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKI--PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNL 79 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~i--P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NL 79 (140) |++ +++..+++|.+.++++ |.++++++-+.+. +++..+++.....-||. + |. +.++++.+..-+ T Consensus 1 Ma~--i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~-~~~~~~~~~a~~~apvd-T--G~--------Lr~sI~~~~~~~ 66 (112) T protein:vir:96 1 MAT--IEFEGLDEMAQSLLKNASSERRSKVLRKYGA-KLKEAAVSKAQFKKGYS-T--GA--------TRRSITLEAGSD 66 (112) T ss_pred Cce--eeehHHHHHHHHHHhhcCHHHHHHHHHHHHH-HHHHHHHHHhhhcCCCC-c--hh--------hhhceeeecCce Confidence 444 7889999999999998 4455554443322 22334455555566876 3 21 222344444455 Q ss_pred ceEEeecCcccceeccccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHH Q lcl|NC_021539. 80 GFELVSKPKFNYLIFPDQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDV 129 (140) Q Consensus 80 gf~i~~k~kf~YLvFPd~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l 129 (140) ..+|.|... |=.|- =.||++-.+|-||.-.++...+++++.|.+-- T Consensus 67 ~~~v~~~~~--Ya~~v--E~GTr~m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 67 RAVVEALTN--YSGYL--EVGTRKMEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred EEEecCCCC--cccee--ccCccccCCCCchhhhHHHHHHHHHHHHHhcC Confidence 666666554 43343 45799989999999999998888866665422 No 71 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=86.36 E-value=0.0099 Score=31.58 Aligned_cols=123 Identities=16% Similarity=0.180 Sum_probs=85.2 Q ss_pred ccccHHHHHHHHHHHHhc--chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh---hchhcc Q lcl|NC_021539. 5 WSVDFADVDKLTELISKI--PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV---AKMSNL 79 (140) Q Consensus 5 ~sld~s~~e~L~~~m~~i--P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~---~~~~NL 79 (140) -||+.-.++.|+++|++. |.++-++.|++|.+ +....++.+..-|++.++. | --..-.+.|+|-. .++.=+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~-age~v~~~~K~~~~~fkDT-G--~t~~ev~~s~p~~~~G~r~V~v 76 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIA-GAKVIVEEVKKQLKPSKDT-G--ALINEVSFSKPEWINGKRTITV 76 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHH-HHHHHHHHHHhhhhhhhhc-c--ceeccEEecCeeecCCceEEEE Confidence 689999999999999999 99999999999976 5566778888899987442 1 1233456777764 335556 Q ss_pred ceEEeecCcccceeccccCC---CCCCCchHH---HHHhchhhhhHHHHHHHHHHHHHH Q lcl|NC_021539. 80 GFELVSKPKFNYLIFPDQGV---GKNNKTKQD---FMLLGLEESTAEIVEMLEEDVLKE 132 (140) Q Consensus 80 gf~i~~k~kf~YLvFPd~Gi---G~sn~~~q~---FmerGl~~~~~~i~E~L~~~l~k~ 132 (140) || -=|+.+|.|.-+=-.|- |+.+...++ =+++=++...+..-+.+.++|-+. T Consensus 77 gW-~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 77 HW-RGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred EE-EcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 66 34577899988888884 333322222 344555555666666666666665 No 72 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=86.36 E-value=0.0099 Score=31.58 Aligned_cols=123 Identities=16% Similarity=0.180 Sum_probs=85.2 Q ss_pred ccccHHHHHHHHHHHHhc--chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh---hchhcc Q lcl|NC_021539. 5 WSVDFADVDKLTELISKI--PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV---AKMSNL 79 (140) Q Consensus 5 ~sld~s~~e~L~~~m~~i--P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~---~~~~NL 79 (140) -||+.-.++.|+++|++. |.++-++.|++|.+ +....++.+..-|++.++. | --..-.+.|+|-. .++.=+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~-age~v~~~~K~~~~~fkDT-G--~t~~ev~~s~p~~~~G~r~V~v 76 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIA-GAKVIVEEVKKQLKPSKDT-G--ALINEVSFSKPEWINGKRTITV 76 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHH-HHHHHHHHHHhhhhhhhhc-c--ceeccEEecCeeecCCceEEEE Confidence 689999999999999999 99999999999976 5566778888899987442 1 1233456777764 335556 Q ss_pred ceEEeecCcccceeccccCC---CCCCCchHH---HHHhchhhhhHHHHHHHHHHHHHH Q lcl|NC_021539. 80 GFELVSKPKFNYLIFPDQGV---GKNNKTKQD---FMLLGLEESTAEIVEMLEEDVLKE 132 (140) Q Consensus 80 gf~i~~k~kf~YLvFPd~Gi---G~sn~~~q~---FmerGl~~~~~~i~E~L~~~l~k~ 132 (140) || -=|+.+|.|.-+=-.|- |+.+...++ =+++=++...+..-+.+.++|-+. T Consensus 77 gW-~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 77 HW-RGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred EE-EcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 66 34577899988888884 333322222 344555555666666666666665 No 73 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=85.91 E-value=0.012 Score=31.19 Aligned_cols=123 Identities=15% Similarity=0.156 Sum_probs=84.8 Q ss_pred ccccHHHHHHHHHHHHhc--chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh---hchhcc Q lcl|NC_021539. 5 WSVDFADVDKLTELISKI--PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV---AKMSNL 79 (140) Q Consensus 5 ~sld~s~~e~L~~~m~~i--P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~---~~~~NL 79 (140) -|++.-++++|+++|++. |.++-++.|++|.+ +....++.+...|++.++. |. -..-.+.|+|-. .++.=+ T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~-age~v~~~~K~~~~~fkDT-Ga--ti~ev~~s~p~~~~G~r~V~v 76 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIA-GAKVIVEEIKKQLKPSEDS-GA--LISEIGRTEPEWIKGKRTVTI 76 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHH-HhHHHHHHHHhhcCccccc-cc--eeccEeecCeeecCCceEEEE Confidence 688999999999999999 99999999999975 5567788899999998542 21 234466677754 235555 Q ss_pred ceEEeecCcccceeccccCCC--C-CCCchHH---HHHhchhhhhHHHHHHHHHHHHHH Q lcl|NC_021539. 80 GFELVSKPKFNYLIFPDQGVG--K-NNKTKQD---FMLLGLEESTAEIVEMLEEDVLKE 132 (140) Q Consensus 80 gf~i~~k~kf~YLvFPd~GiG--~-sn~~~q~---FmerGl~~~~~~i~E~L~~~l~k~ 132 (140) || -=|+.+|.|.-+=-.|-= + .+...++ =+++=++...+..-+.+.++|-+. T Consensus 77 gW-~G~~~R~~ivHLnE~Gyt~~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 77 RW-RGPFERFRIVHLIENGHVEKKSGKFVKPKAMGGINRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred EE-EcCCceeeEEEeeecceeecCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 66 345778888888777762 2 2222222 344555556666666666666665 No 74 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=85.81 E-value=0.023 Score=29.61 Aligned_cols=116 Identities=15% Similarity=0.119 Sum_probs=56.6 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccce Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGF 81 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf 81 (140) |++|..|-+.- +.+.++-.+.++.+-++=......++.......-||- -|.+| +++......-|+ T Consensus 1 ~~~~~f~~~~~----~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvd---tG~L~--------~SI~~~v~~~g~ 65 (141) T protein:vir:78 1 MNEFEFDSNIP----KARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNND---TGEYA--------QKSGYKVRKSSK 65 (141) T ss_pred CcchhHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc---cchhh--------cceeeeeecCCc Confidence 77776663322 2233333344443333211222223333334456765 23333 344332222234 Q ss_pred EEeecCcccceeccccCCCCCC------------------------CchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_021539. 82 ELVSKPKFNYLIFPDQGVGKNN------------------------KTKQDFMLLGLEESTAEIVEMLEEDVLKEINNIL 137 (140) Q Consensus 82 ~i~~k~kf~YLvFPd~GiG~sn------------------------~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~l 137 (140) +++=-..=.|-+|-..|=|.+- .-+|.||...+....++|.+ .|++.+ T Consensus 66 ~~~V~~~~~YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~--------~i~~~~ 137 (141) T protein:vir:78 66 EVIVGNSSDYAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRV--------FTERAL 137 (141) T ss_pred EEEEecCCCccceeecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHH--------HHHHHh Confidence 4333334457777777766531 24778999888887766544 444455 Q ss_pred cCC Q lcl|NC_021539. 138 GGN 140 (140) Q Consensus 138 gg~ 140 (140) +|- T Consensus 138 ~~l 140 (141) T protein:vir:78 138 RGI 140 (141) T ss_pred hcc Confidence 555 No 75 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=79.61 E-value=0.072 Score=26.85 Aligned_cols=125 Identities=14% Similarity=0.128 Sum_probs=77.9 Q ss_pred CCccccccHHHHHHHHHHHHh-cch-HHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh---hc Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISK-IPN-KSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV---AK 75 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~-iP~-~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~---~~ 75 (140) |..-.+|. .+++|+++|+. +-. ++-++.|++|. +|+..+++.+..-||++.+.-. -+.-.+.|+|-. .+ T Consensus 1 ~~~~aevk--Gv~Eilk~lE~klG~~~v~ri~nkAL~-~~ge~v~~~lK~~~~~f~DTG~---t~dev~~s~~~~~~G~r 74 (132) T protein:vir:96 1 MSGFANLK--GVEELLANMEKKLGPAKVNRVVNRSLK-EIGKELEPSFKSAISIYKRTGE---TTESAVVSGVRREDGIP 74 (132) T ss_pred CCcccccc--CHHHHHHHHHHhhCHHHHHHHhHHHHH-HHHHHHHHHHHHhhhhhhhcch---hhcceeecCeeecCCce Confidence 54444454 89999999998 544 68999999996 4678889999999999865322 333456667764 23 Q ss_pred hhccceEEeecCcccceeccc-cCCCCC-CCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021539. 76 MSNLGFELVSKPKFNYLIFPD-QGVGKN-NKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGG 139 (140) Q Consensus 76 ~~NLgf~i~~k~kf~YLvFPd-~GiG~s-n~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg 139 (140) +.=+||. ++=-|||+=| .|.|+- ++---=++++=++...+.....+ -+++.+.|-| T Consensus 75 ~V~VgW~----GpR~~ivHLNE~GyGk~~~PrG~G~I~~a~~~se~~~~~~~----~~elkk~l~~ 132 (132) T protein:vir:96 75 KVKLGFT----TPRWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGI----RDKLKRGFDG 132 (132) T ss_pred EEEeccc----CCceeEEeeecccccCCcCCCcchHHHHHHHhhhhHHHHHH----HHHHHHHhcC Confidence 5556664 3334677776 677665 44333355555555554333333 3334444445 No 76 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=78.45 E-value=0.012 Score=31.19 Aligned_cols=123 Identities=11% Similarity=-0.016 Sum_probs=67.2 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHH--------HHHHHhhhhhHHHH-Hhhhhc-----CCccccchhhhcccchh Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEI--------INKTLETKAVPLAK-QNIEKR-----INLSKNWKGQLLNKNHA 66 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~--------IN~~L~tkg~~~a~-~~I~~~-----iPvS~r~k~~~rnK~HA 66 (140) |..+.+ +.+.+.++.+.|+.+-++.=.+ =+..=..+|+|+|. -.+..+ +|--. T Consensus 7 ~~~k~~-~~~~~~~~~~~l~~l~~~~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~------------ 73 (200) T protein:vir:99 7 KSNSVA-APLKHFQMLKQFDALKGKTVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGT------------ 73 (200) T ss_pred eeeeee-cchHHHHHHHHHHHhhCCeEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCc------------ Confidence 444444 3357777777776643221000 00001234555533 222221 22211 Q ss_pred hhhhhhhhchhccceEEeecCcccceeccccCCCCC-CCchHH-HHHhchhhhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 67 QTAGPFVAKMSNLGFELVSKPKFNYLIFPDQGVGKN-NKTKQD-FMLLGLEESTAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 67 k~s~pl~~~~~NLgf~i~~k~kf~YLvFPd~GiG~s-n~~~q~-FmerGl~~~~~~i~E~L~~~l~k~in~~lgg~ 140 (140) +...+-...-.-.|.+-+-+..|+|.+| +|.+ ..+|.| ||+-++++..+++.+.+.+.+.+++.--+... T Consensus 74 ~~~~~~~~~g~~~g~rfv~k~~~~~~~~----~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~l~g~~~~~ 145 (200) T protein:vir:99 74 KYIKDAIVDGRYVGTRFVHKSFQGEHEV----TKAHQIVIPARPFMRLAWATFNKDKVKIQAQIARQLLDGTINPE 145 (200) T ss_pred cccccccccccccccccccccccceeee----eccccccCCCcchhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHH Confidence 1111111112223555556778888888 7888 578877 99999999999999988888877765433222 No 77 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=78.18 E-value=0.018 Score=30.14 Aligned_cols=87 Identities=17% Similarity=0.290 Sum_probs=58.8 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccce Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGF 81 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf 81 (140) |+++++++...++|.+.+++-+.. +.+-+++.+-|+.+ ...+....|++ ++ .+.++++.+..|-|| T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~~~--~~v~~vv~~~~~~l-~~~ak~~ap~d-TG----------~lrrSI~~~~~~~g~ 66 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQNM--NTVKKVVKKHTANL-MTATQQAVPVD-TG----------HLKQSAQIQISRDGF 66 (92) T ss_pred CCceeeEeehHHHHHHHHHhhccH--HHHHHHHHHHHHHH-HHHHHHhCCCC-cc----------ccceeeeEEeecCCe Confidence 778999999999999999887654 44788888888887 67778889998 32 234455555666666 Q ss_pred EEe-----ecCcccceeccccCCCCCCCchHHHHHh Q lcl|NC_021539. 82 ELV-----SKPKFNYLIFPDQGVGKNNKTKQDFMLL 112 (140) Q Consensus 82 ~i~-----~k~kf~YLvFPd~GiG~sn~~~q~Fmer 112 (140) +-. |+.. |=.|-- .||. ||+- T Consensus 67 ~~~v~~~gp~a~--Ya~YvE--~GTR------~M~A 92 (92) T protein:vir:99 67 TGSVTYGGGLVN--YAAYVE--FGTR------FMDS 92 (92) T ss_pred eEEEEeccCccc--cccccc--ccee------ecCC Confidence 443 4444 333322 3554 4443 No 78 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=76.28 E-value=0.073 Score=26.82 Aligned_cols=125 Identities=15% Similarity=0.177 Sum_probs=79.6 Q ss_pred CCccccccHHHHHHHHHHHHhc--chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh---hc Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKI--PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV---AK 75 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~i--P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~---~~ 75 (140) |..=.+|. .+++++++|+.- |.++-++.|++|. ++....++.+..-|+++.+. |. -..-...|+|-. .+ T Consensus 7 ~~~~aevk--Gv~Eilk~lE~klG~~~~~ri~nkAL~-~~ge~v~~~lK~~~~~fkDT-Ga--t~dev~~s~p~~~~G~r 80 (138) T protein:vir:98 7 MSGFANLK--GVEELLANMEKKLGPAKVNRVVNRSLK-EIGKELEPSFKSAISIYKRT-GE--TTESAVVSGVRREDGIP 80 (138) T ss_pred cccccccc--CHHHHHHHHHHhhCHHhhhhhhhHHHH-HHHHHHHHHHHhhhhhhhhc-cc--eeeeeeecCeeecCCce Confidence 44433444 899999999985 6678999999996 46788899999999999653 21 333456666664 34 Q ss_pred hhccceEEeecCcccceeccc-cCCCCC-CCchHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021539. 76 MSNLGFELVSKPKFNYLIFPD-QGVGKN-NKTKQDFMLLGLEESTAEIVEMLEEDVLKEINNILGG 139 (140) Q Consensus 76 ~~NLgf~i~~k~kf~YLvFPd-~GiG~s-n~~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg 139 (140) +.=+||+ ++=-|||+=| .|.|+- ++----++++=++...+.-.+.+..+| .+-|-| T Consensus 81 ~V~igW~----GpR~~ivHLNE~GyGk~i~PrG~G~I~ka~~~se~~y~~~vk~el----~k~l~~ 138 (138) T protein:vir:98 81 KVKLGFT----TPRWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKL----KRGFDG 138 (138) T ss_pred EEEEeee----cCeeeEEeeecccccCCcCCCcchHHHHHHHhhhHHHHHHHHHHH----HHHhcC Confidence 6667775 3333577777 677764 333323566655555555554444333 344445 No 79 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=59.76 E-value=0.083 Score=26.52 Aligned_cols=118 Identities=14% Similarity=0.074 Sum_probs=57.5 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhh---h--hh--- Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAG---P--FV--- 73 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~---p--l~--- 73 (140) |.+.++.++.++ +.++.++...++++-+.|..- +...........||- . |.+|+-=+.+... + +. T Consensus 1 m~~~~~~~~gl~---~~l~~~~~~~~~~~~~~i~~~-a~~v~~~Ak~~aPv~-t--G~Lr~SI~~~~~~~~~~~~~~~~v 73 (142) T protein:vir:86 1 MVQVSVRYEGFD---YNPVGAAAQVGPILRRTHSSL-TRQIANETRARVPVL-T--GHLGRSVREDPQVMVTPFHVSGGV 73 (142) T ss_pred CceeEEEeeecc---hhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCcc-c--hhhhcceeeeeccccccceEEEEe Confidence 556666666554 466677777888888887644 445556667778883 2 3232221111000 0 00 Q ss_pred ------hchhccce---EEeecCcccceeccccCCCCC----C---CchHHHHHhchhhhhHHHHHHHHH Q lcl|NC_021539. 74 ------AKMSNLGF---ELVSKPKFNYLIFPDQGVGKN----N---KTKQDFMLLGLEESTAEIVEMLEE 127 (140) Q Consensus 74 ------~~~~NLgf---~i~~k~kf~YLvFPd~GiG~s----n---~~~q~FmerGl~~~~~~i~E~L~~ 127 (140) .-.-+.|= .|+|+. -..|.|+--|.+-- + -.||.|++..++...++...-..+ T Consensus 74 ~~~a~YA~~ve~GT~ph~i~pk~-~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 74 TAHAKYAAAVHEGTRPHVIRAKH-AQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred ccCccccceeccCCccceecccc-CceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 01111221 244442 35666665554221 1 136779988887543332221111 No 80 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=59.76 E-value=0.083 Score=26.52 Aligned_cols=118 Identities=14% Similarity=0.074 Sum_probs=57.5 Q ss_pred CccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhh---h--hh--- Q lcl|NC_021539. 2 CAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAG---P--FV--- 73 (140) Q Consensus 2 ~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~---p--l~--- 73 (140) |.+.++.++.++ +.++.++...++++-+.|..- +...........||- . |.+|+-=+.+... + +. T Consensus 1 m~~~~~~~~gl~---~~l~~~~~~~~~~~~~~i~~~-a~~v~~~Ak~~aPv~-t--G~Lr~SI~~~~~~~~~~~~~~~~v 73 (142) T protein:vir:99 1 MVQVSVRYEGFD---YNPVGAAAQVGPILRRTHSSL-TRQIANETRARVPVL-T--GHLGRSVREDPQVMVTPFHVSGGV 73 (142) T ss_pred CceeEEEeeecc---hhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCcc-c--hhhhcceeeeeccccccceEEEEe Confidence 556666666554 466677777888888887644 445556667778883 2 3232221111000 0 00 Q ss_pred ------hchhccce---EEeecCcccceeccccCCCCC----C---CchHHHHHhchhhhhHHHHHHHHH Q lcl|NC_021539. 74 ------AKMSNLGF---ELVSKPKFNYLIFPDQGVGKN----N---KTKQDFMLLGLEESTAEIVEMLEE 127 (140) Q Consensus 74 ------~~~~NLgf---~i~~k~kf~YLvFPd~GiG~s----n---~~~q~FmerGl~~~~~~i~E~L~~ 127 (140) .-.-+.|= .|+|+. -..|.|+--|.+-- + -.||.|++..++...++...-..+ T Consensus 74 ~~~a~YA~~ve~GT~ph~i~pk~-~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 74 TAHAKYAAAVHEGTRPHVIRAKH-AQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred ccCccccceeccCCccceecccc-CceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 01111221 244442 35666665554221 1 136779988887543332221111 No 81 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=52.90 E-value=0.54 Score=22.04 Aligned_cols=108 Identities=15% Similarity=-0.004 Sum_probs=60.6 Q ss_pred CCccccccHHHHHH-HHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhcc Q lcl|NC_021539. 1 MCAKWSVDFADVDK-LTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNL 79 (140) Q Consensus 1 m~a~~sld~s~~e~-L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NL 79 (140) |..+..||++.+++ |++..+ ..-+.+-|+++ .-+.+.+|.-. |.+++ |.. -.+- T Consensus 1 M~vkV~id~~~~~~~l~~a~~---~aq~~~~~ev~---------~~~~~yVP~~t---G~L~~------s~~----~~~~ 55 (112) T protein:vir:80 1 MPIKVRVDLSKAKGSVKKAKE---RGQFALINQAA---------ADIALYVPFLS---GDLSN------QYV----IMND 55 (112) T ss_pred CceeEEeehHHHHHHHHHHHH---HHHHHHHHHHH---------HHhhcCCCccc---Ccccc------cee----eccC Confidence 99999999999875 443322 11223333333 33567888863 33322 221 1233 Q ss_pred ceEEeecCcccceeccccCCCCC--CC-chHHHHHhchhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_021539. 80 GFELVSKPKFNYLIFPDQGVGKN--NK-TKQDFMLLGLEESTAEIVEMLEEDVLKEINNIL 137 (140) Q Consensus 80 gf~i~~k~kf~YLvFPd~GiG~s--n~-~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~l 137 (140) |..+..+|-=.|+-|=..|-|.. ++ +-.+++||-....- +.+.+.+.+.+.+-| T Consensus 56 g~I~y~tPYAr~qYY~~~~~~~~~~~p~ag~~W~erak~~~~----~~~~~~~~k~~~~~l 112 (112) T protein:vir:80 56 KEIMWTSIYARRLYNGINFNFTLTHHPLAGPKWDQRAKVDKL----ESWIEVAQKAVEEGL 112 (112) T ss_pred ceEEecCchhhHhhhcccCCCCcCCCCCcchhhHHHHHhhhh----HHHHHHHHHHHhhcC Confidence 66666665555555655555544 33 55679999666554 445555555566656 No 82 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=51.98 E-value=0.25 Score=23.89 Aligned_cols=89 Identities=16% Similarity=0.229 Sum_probs=52.8 Q ss_pred HHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccceEEeecCcccceeccccCCCCC--- Q lcl|NC_021539. 26 SEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGFELVSKPKFNYLIFPDQGVGKN--- 102 (140) Q Consensus 26 sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf~i~~k~kf~YLvFPd~GiG~s--- 102 (140) ++++|-++|+.-+ -.+...+....|+. + |.+ -++++.+..+=|++..=.+.=.|-.|-..|-|.| T Consensus 1 v~~~v~~~~~~~~-~~i~~~ak~~aPv~-T--G~L--------r~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~ 68 (116) T protein:vir:12 1 MERWVKRGIAKTT-AKIHNTIISLMPVD-T--GYL--------RESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATG 68 (116) T ss_pred ChHHHHHHHHHHH-HHHHHHHHHhCCcC-c--ccc--------cccceEEeecCcEEEEEecCCCcccccccCCcccccC Confidence 7888888887554 44677777788986 3 322 2334333332233333223334666666673333 Q ss_pred ------------------------CCchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 103 ------------------------NKTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 103 ------------------------n~~~q~FmerGl~~~~~~i~E~L~ 126 (140) .--+|.||...++...+.|...|- T Consensus 69 ~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred CCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 235667888888888887766665 No 83 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=51.98 E-value=0.25 Score=23.89 Aligned_cols=89 Identities=16% Similarity=0.229 Sum_probs=52.8 Q ss_pred HHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccceEEeecCcccceeccccCCCCC--- Q lcl|NC_021539. 26 SEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLGFELVSKPKFNYLIFPDQGVGKN--- 102 (140) Q Consensus 26 sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLgf~i~~k~kf~YLvFPd~GiG~s--- 102 (140) ++++|-++|+.-+ -.+...+....|+. + |.+ -++++.+..+=|++..=.+.=.|-.|-..|-|.| T Consensus 1 v~~~v~~~~~~~~-~~i~~~ak~~aPv~-T--G~L--------r~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~ 68 (116) T protein:vir:97 1 MERWVKRGIAKTT-AKIHNTIISLMPVD-T--GYL--------RESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATG 68 (116) T ss_pred ChHHHHHHHHHHH-HHHHHHHHHhCCcC-c--ccc--------cccceEEeecCcEEEEEecCCCcccccccCCcccccC Confidence 7888888887554 44677777788986 3 322 2334333332233333223334666666673333 Q ss_pred ------------------------CCchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 103 ------------------------NKTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 103 ------------------------n~~~q~FmerGl~~~~~~i~E~L~ 126 (140) .--+|.||...++...+.|...|- T Consensus 69 ~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred CCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 235667888888888887766665 No 84 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=51.71 E-value=0.34 Score=23.18 Aligned_cols=132 Identities=8% Similarity=0.125 Sum_probs=55.0 Q ss_pred CC-ccccccHHHHHHHHHHHHhcchHHHHH--HHHHHhhhhhHHHHHhhhh--------cCCccccchhhhccc------ Q lcl|NC_021539. 1 MC-AKWSVDFADVDKLTELISKIPNKSEEI--INKTLETKAVPLAKQNIEK--------RINLSKNWKGQLLNK------ 63 (140) Q Consensus 1 m~-a~~sld~s~~e~L~~~m~~iP~~sE~~--IN~~L~tkg~~~a~~~I~~--------~iPvS~r~k~~~rnK------ 63 (140) || .+.++|++. |...++++....+.. +-+.+...-.....++|.. ..|.|.....+.+.. T Consensus 1 M~~i~i~~d~~~---~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~ 77 (190) T protein:vir:99 1 MAGITLEWDGRR---ALDVLNAGSAALGDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILT 77 (190) T ss_pred CceeEEEecHHH---HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCccce Confidence 43 334555543 344444444333322 2223333333334444432 245543221111111 Q ss_pred chhhhhhhhhhch----------------hccceEEeecCcccceeccc-c---------------------CCCCC-CC Q lcl|NC_021539. 64 NHAQTAGPFVAKM----------------SNLGFELVSKPKFNYLIFPD-Q---------------------GVGKN-NK 104 (140) Q Consensus 64 ~HAk~s~pl~~~~----------------~NLgf~i~~k~kf~YLvFPd-~---------------------GiG~s-n~ 104 (140) ..-.+.++++..- -|.|-+|.+++...|..|.- . .++.+ -. T Consensus 78 ~tg~L~~Si~~~~~~~~v~vGtn~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~v~ 157 (190) T protein:vir:99 78 LDGHLRNLLRYQLDGSELLFGSDRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDVQIGPYTIQ 157 (190) T ss_pred ecHHHHHHHhheecCcEEEEecCcchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccchhcccccceee Confidence 1123445555332 23444454443332222110 0 01111 23 Q ss_pred chHH-HHHhchhhh-hHHHHHHHHHHHHHHHHHHh Q lcl|NC_021539. 105 TKQD-FMLLGLEES-TAEIVEMLEEDVLKEINNIL 137 (140) Q Consensus 105 ~~q~-FmerGl~~~-~~~i~E~L~~~l~k~in~~l 137 (140) +|++ || |+... ...|.+.+.+.|.+.+++-- T Consensus 158 IPaRpfL--G~s~~d~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 158 MPARPWL--GTSSQDDDTILQRVERYLQRALRERA 190 (190) T ss_pred ecCcccC--CCCHHHHHHHHHHHHHHHHHHHhhcC Confidence 5676 87 66543 44567777777777777766 No 85 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=51.20 E-value=0.59 Score=21.85 Aligned_cols=108 Identities=13% Similarity=0.014 Sum_probs=61.5 Q ss_pred CCccccccHHHHHH-HHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhcc Q lcl|NC_021539. 1 MCAKWSVDFADVDK-LTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNL 79 (140) Q Consensus 1 m~a~~sld~s~~e~-L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NL 79 (140) |..+..+|++.+++ |.+..++ .-..+.|++| .-+.+.+|.-. |.+++ |.. -.+- T Consensus 1 M~vkv~vn~~~~~~~l~~a~~r---~q~~~~~ev~---------~~~~~yVP~~~---G~L~~------S~~----~~~~ 55 (112) T protein:vir:45 1 MPIKVRVDLSKAKGSVKKAKER---GQFALINQAA---------ADIALYVPFLS---GDLSN------QYV----IMND 55 (112) T ss_pred CceeEEeehHHHHHHHHHHHHH---HHHHHHHHHH---------HHhhcCCcccc---Ccccc------cee----eccC Confidence 99999999999873 4433332 1222333333 33577889863 33333 222 2334 Q ss_pred ceEEeecCcccceeccccCCCC--CCC-chHHHHHhchhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_021539. 80 GFELVSKPKFNYLIFPDQGVGK--NNK-TKQDFMLLGLEESTAEIVEMLEEDVLKEINNIL 137 (140) Q Consensus 80 gf~i~~k~kf~YLvFPd~GiG~--sn~-~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~l 137 (140) |..+..+|-=.|+-|=..+-|+ +++ +-.+++||.....-+.|++. +.+.+++-| T Consensus 56 g~I~y~tPYAr~qYY~~~~~~~~~~~p~ag~~W~erak~~~~~~~~~~----~~k~~~~gl 112 (112) T protein:vir:45 56 KEIMWTSIYARRLYKGINFNFTLTHHPLAGPEWDQRAKIDKMDVWEKV----AQKAVEEGL 112 (112) T ss_pred CeEEecChhhHHhhhccccCCCCCCCCCCchhhHHHHHHhhHHHHHHH----HHHHHhhcC Confidence 6667776644556565555553 344 55779999776665555544 444444444 No 86 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=50.91 E-value=0.6 Score=21.82 Aligned_cols=115 Identities=11% Similarity=0.118 Sum_probs=66.7 Q ss_pred CccccccHHHH-HHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhh-chhcc Q lcl|NC_021539. 2 CAKWSVDFADV-DKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVA-KMSNL 79 (140) Q Consensus 2 ~a~~sld~s~~-e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~-~~~NL 79 (140) |++ ++.+++ +++.+.++.++....+.+++.++ +++..+.+.|...-|+. ++ .-+|.-.-=+. +..+. T Consensus 1 Ma~--i~id~la~~I~~~L~~y~~~v~~~v~~~v~-~~a~~~~~~ik~~aP~r-TG-------~y~ksw~vk~~~~~g~~ 69 (126) T protein:vir:81 1 MAN--ITIDRLADELLQAVKEYTDDVAEGVRKKVD-ETARKVLKEAQALAPKR-TG-------EYARTFTITKEDGYGTT 69 (126) T ss_pred Ccc--cchhhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhCCcc-cc-------hhhccccccccccCCcc Confidence 555 888886 56899999999999999988876 56778888999999974 32 11221110011 12333 Q ss_pred ceEEeecCcccc---eeccccCCCCCCC----ch-HHHHHhchhhhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 80 GFELVSKPKFNY---LIFPDQGVGKNNK----TK-QDFMLLGLEESTAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 80 gf~i~~k~kf~Y---LvFPd~GiG~sn~----~~-q~FmerGl~~~~~~i~E~L~~~l~k~in~~lgg~ 140 (140) .+.+.-++.|.. |=| |..++ .+ +-|++- .-|.+.++|.+.|.+.|.|- T Consensus 70 ~~vv~~~~~~~l~HLLEf-----Gha~r~gGrV~a~Phi~P--------a~e~~~~~~~~~i~~~l~~g 125 (126) T protein:vir:81 70 KRIIWNKKHYRRVHLLEF-----GHAKVNGGRVKEYPHLRP--------AYDKHGARLPDELKRVIENG 125 (126) T ss_pred eEEEeccCCCCceeeeec-----ceecCCCCccCCCcchHH--------HHHHHHHHHHHHHHHHhhcC Confidence 345555666665 434 33322 33 335432 23445555555555555333 No 87 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=40.28 E-value=0.98 Score=20.64 Aligned_cols=117 Identities=10% Similarity=0.218 Sum_probs=64.5 Q ss_pred ccccHHHHHHHHHHHHhc--chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh---hchhcc Q lcl|NC_021539. 5 WSVDFADVDKLTELISKI--PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV---AKMSNL 79 (140) Q Consensus 5 ~sld~s~~e~L~~~m~~i--P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~---~~~~NL 79 (140) -||+.-++++|+++|+.- |-++-++.|++|.. +....++.+...|-+....=. -..-...|+|-. .++.=+ T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~-~g~~v~~~lK~~~~~fkDTGa---ti~ev~~s~p~~~~G~r~V~i 76 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIA-GATLVAKTLKSEFVQFKDTGA---SIDEINIEKPSYDKGVRSIKI 76 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHH-HHHHHHHHHHHhhcchhcccc---eeeeEEecCeeeeCCceEEEE Confidence 689999999999999983 55567899999975 557777888888877743211 223355666642 334555 Q ss_pred ceE-------EeecCcccceecc----ccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHH Q lcl|NC_021539. 80 GFE-------LVSKPKFNYLIFP----DQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDV 129 (140) Q Consensus 80 gf~-------i~~k~kf~YLvFP----d~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l 129 (140) ||+ |+-=--|||--++ --|.| .-|+.++.+=......+-++|.+.| T Consensus 77 ~W~gp~~R~~iVHLNE~GYtr~Gk~i~PrG~G----~i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 77 DWKGPKDRYKIIHLNEYGYTRNGKKITPAGTG----SVARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred EEecCCCceeEEEeeccceecCCCeEccchhh----HHHHHHHhhhHHHHHHHHHHHHhhC Confidence 553 2222335554432 11222 2233333333333333333333333 No 88 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=36.04 E-value=0.34 Score=23.17 Aligned_cols=120 Identities=13% Similarity=0.111 Sum_probs=60.4 Q ss_pred ccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhh-------------hhHHHHHhhhhcCCccccchhhhcccchhhhh Q lcl|NC_021539. 3 AKWSVDFADVDKLTELISKIPNKSEEIINKTLETK-------------AVPLAKQNIEKRINLSKNWKGQLLNKNHAQTA 69 (140) Q Consensus 3 a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tk-------------g~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s 69 (140) =+-+-|.+.++++.+.++.+-.++=.+ =++..+ |+.+-..+=.-.||.-+. .| T Consensus 1 m~vt~~~~~~~~~~~~l~~L~~k~v~v--Gi~~~d~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a--------~~---- 66 (199) T protein:vir:80 1 MKVTTDKSTMNKAIRELDQLDRYSLQI--GLFGEDDSFIQMIAGVHEFGLTIRPKGKYLTIPTPEA--------GD---- 66 (199) T ss_pred CcccccHHHHHHHHHHHHHhcCCEEEE--EEecCCCcchhheeehhhcCCeeecCCceeeecchhh--------hc---- Confidence 233478888999999988764332110 011122 222111111111222110 01 Q ss_pred hhhhhchhccceEEeecCcccceeccccCC------CCC-CCchHH-HHHhchhhhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021539. 70 GPFVAKMSNLGFELVSKPKFNYLIFPDQGV------GKN-NKTKQD-FMLLGLEESTAEIVEMLEEDVLKEINNILGGN 140 (140) Q Consensus 70 ~pl~~~~~NLgf~i~~k~kf~YLvFPd~Gi------G~s-n~~~q~-FmerGl~~~~~~i~E~L~~~l~k~in~~lgg~ 140 (140) .+-.+.+=...|++.+++.+.=+.|+ |.. ..+|.| ||+-++++..+++.+.+.+.+.+++.--.-.. T Consensus 67 ----~k~~~~~~~~~p~g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~vl~g~~~a~ 141 (199) T protein:vir:80 67 ----RRARDIPGLFKPKGKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELFEGWIDDVIHGKLSAE 141 (199) T ss_pred ----ccccccCcccccCCcceeeeeccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCcHH Confidence 11112221335777777665433332 333 466766 99999999999988888877776665311111 No 89 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=35.03 E-value=0.74 Score=21.29 Aligned_cols=87 Identities=13% Similarity=0.174 Sum_probs=51.5 Q ss_pred HHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchh--ccceEEeecCcccceeccccCCCCC- Q lcl|NC_021539. 26 SEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMS--NLGFELVSKPKFNYLIFPDQGVGKN- 102 (140) Q Consensus 26 sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~--NLgf~i~~k~kf~YLvFPd~GiG~s- 102 (140) ++++|-+.|+.- +..+...+...-|+. + |.+| ++++.+.. ++.-+|.+. =.|-.|-..|=|.| T Consensus 1 v~~~v~~~~~~~-~~~i~~~ak~~apv~-T--G~Lr--------~SI~~~~~~~~~~~~V~~~--~~Ya~yvE~GTg~~~ 66 (116) T protein:vir:95 1 MERWVKRGIAKT-TAKIHNTIISLMPVD-T--GYLR--------ESVTMDFKDGGFTGVINIG--SEYAIYVNYGTGIYA 66 (116) T ss_pred ChHHHHHHHHHH-HHHHHHHHHhhCCcc-c--cccc--------cceeEEeecCcEEEEEecC--CCccceeecCccccc Confidence 777777777754 444566777788986 3 3222 33332222 333444443 34666666663332 Q ss_pred --------------------------CCchHHHHHhchhhhhHHHHHHHH Q lcl|NC_021539. 103 --------------------------NKTKQDFMLLGLEESTAEIVEMLE 126 (140) Q Consensus 103 --------------------------n~~~q~FmerGl~~~~~~i~E~L~ 126 (140) .--+|-||...++...+.|...|- T Consensus 67 ~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 67 TGAGGSRAKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred cCCCccccccccceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 235677998888888887766666 No 90 >protein:vir:6375 Length: 205 # NCBI annotation: hypothetical protein # Family: family:all:10491 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918988;genbank:gi:34610163;genbank:gi:91214209;genbank:GeneID:2559587 Probab=34.69 E-value=1.3 Score=20.00 Aligned_cols=135 Identities=19% Similarity=0.315 Sum_probs=78.4 Q ss_pred ccccccHHHHHHHHHHHHhcchHHH----HHHHHHHhhhhhHHHHHhhhhcCCccccchh----------hhcccchhhh Q lcl|NC_021539. 3 AKWSVDFADVDKLTELISKIPNKSE----EIINKTLETKAVPLAKQNIEKRINLSKNWKG----------QLLNKNHAQT 68 (140) Q Consensus 3 a~~sld~s~~e~L~~~m~~iP~~sE----~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~----------~~rnK~HAk~ 68 (140) =.++||...+++..+..+++|+.+- .+||++..+.+..+|.++|...+.+-+.-=. --+++--|.- T Consensus 1 m~i~v~~~G~~~~~~~l~~l~~~~~~a~~~AIN~ta~~~~~~~A~~~i~~~vn~k~~yv~~~~Rlti~k~As~~~L~A~I 80 (205) T protein:vir:63 1 MSIEIVAEGLGEFRDYVDRLPDISQQAAMIAINQTAQRTALPLARTEIGEQVNFPDNYLKDDSRLGVTKKATRNDLEAVI 80 (205) T ss_pred CeeeeehhhHHHHHHHHHhcchhhhHHHHHHHHHHHHHhhHHHHHHhhhhccccchhhhccceeeEEEeecCCCCeeEEE Confidence 4699999999999999999998765 5789999999999999998887765421100 0112222222 Q ss_pred hhhhh-hc-----------------------------hhccceEEeecC-------cccceeccccCCCCCC-------- Q lcl|NC_021539. 69 AGPFV-AK-----------------------------MSNLGFELVSKP-------KFNYLIFPDQGVGKNN-------- 103 (140) Q Consensus 69 s~pl~-~~-----------------------------~~NLgf~i~~k~-------kf~YLvFPd~GiG~sn-------- 103 (140) +.+.- .. .+-=||.+.=++ +|+-=+|=-.|-|.+. T Consensus 81 ~ar~rpt~LsRF~~p~~~~~~~r~~GVsV~Vk~G~ak~l~gaF~~~lk~g~~l~e~~~~vgva~R~~~g~~~~~~~g~~k 160 (205) T protein:vir:63 81 GARQRPTSLARFAEPGQTTKSTRKGGVSVVVKPGRTKQFKRGFLVRLRAGKTLTEDKYNLGLAVRLSPGETLHATDGATK 160 (205) T ss_pred ecCCCcceeeeccCCCccccccccCCeEEEEEcCCCeeccCceEEEeeccccccccccceEEEeeecCccccccccCcee Confidence 11111 00 111123333222 2222222233333221 Q ss_pred ----------C-chHHHHHhchhhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021539. 104 ----------K-TKQDFMLLGLEESTAEIVEMLEEDVLKEINNILG 138 (140) Q Consensus 104 ----------~-~~q~FmerGl~~~~~~i~E~L~~~l~k~in~~lg 138 (140) | .+|=|-+.- +.-+++|.+.|-+++++..-+++- T Consensus 161 ~~~~~k~LYGPSV~Qvf~~~~-e~I~~~i~~~l~~~f~r~~~~~~~ 205 (205) T protein:vir:63 161 LSNNVYLLYGPSVDQVFRTVA-DDITTEVLDALADEFLRQFTRLSE 205 (205) T ss_pred cCCceEEEEcCcHHHHHhhhh-hhhhHHHHHHHHHHHHHhhhhhcC Confidence 1 456665432 566778888888888888777777 No 91 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=34.31 E-value=0.95 Score=20.70 Aligned_cols=126 Identities=11% Similarity=0.106 Sum_probs=58.6 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhc--------CCccccchhhhcccch------- Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKR--------INLSKNWKGQLLNKNH------- 65 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~--------iPvS~r~k~~~rnK~H------- 65 (140) |....++.+++ +.+.+.++++....+.. +-|-.+....+..++..+ -|+|..+....+.+.+ T Consensus 1 Ms~~i~i~~d~-~~~~~~L~~l~~~~~d~--~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~ 77 (155) T protein:vir:99 1 MTTRIDVELDD-QEVRQRLALLMRSVTDT--LPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQ 77 (155) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhhhH--HHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcch Confidence 88777776665 45666677666555431 333333444444444433 3666443332222111 Q ss_pred --hhhhhhhhhchhccceEEeecCcccceeccccC----CCCCCCchHH-HHHhchh-------hhhHHHHHHHHHHHHH Q lcl|NC_021539. 66 --AQTAGPFVAKMSNLGFELVSKPKFNYLIFPDQG----VGKNNKTKQD-FMLLGLE-------ESTAEIVEMLEEDVLK 131 (140) Q Consensus 66 --Ak~s~pl~~~~~NLgf~i~~k~kf~YLvFPd~G----iG~sn~~~q~-FmerGl~-------~~~~~i~E~L~~~l~k 131 (140) -.+.++++..-..-+.+|=. ...|=.+=..| .+..-..|.+ |+ |++ +....|.+.+++.|.+ T Consensus 78 ~tg~L~~Si~~~~~~~~v~vGt--n~~YA~iHqfGg~~~~~~~v~iPaRpfL--G~s~~~~l~~e~~~~I~~~i~~~l~~ 153 (155) T protein:vir:99 78 VTNALARSVTTWADRNEAGIGS--NLVYAAIHQFGGDAGRGHQVEIPARRYL--PFDENGQLAAGARQSILEIVLTALSR 153 (155) T ss_pred hchhhhhhhhceecCCEEEEec--CccchhhhhcccccCCCCccccCCcccc--CCCCccccchHHHHHHHHHHHHHHhc Confidence 11233333333333333322 12233333334 1112234555 76 443 3455677777777777 Q ss_pred HH Q lcl|NC_021539. 132 EI 133 (140) Q Consensus 132 ~i 133 (140) .- T Consensus 154 ~~ 155 (155) T protein:vir:99 154 NR 155 (155) T ss_pred cC Confidence 66 No 92 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=33.10 E-value=0.89 Score=20.87 Aligned_cols=127 Identities=13% Similarity=0.231 Sum_probs=51.9 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcC-----Cc-cccchh-----hhcccchh--- Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRI-----NL-SKNWKG-----QLLNKNHA--- 66 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~i-----Pv-S~r~k~-----~~rnK~HA--- 66 (140) |...+++.++ .+.|+..++++-+..+. +.|-.+....+..++..++ |- ...|+. ..+.+++. T Consensus 1 ms~~i~~~~d-~~~l~~~L~~l~~~~~~---~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~ 76 (156) T protein:vir:19 1 MSLDMNVAVD-VRRIQLALDELGTVTRD---RAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVP 76 (156) T ss_pred CeEEEEEeec-HHHHHHHHHHHHhhhcc---HHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCC Confidence 6655444432 23455555555444332 1222223333333333222 32 334421 11111121 Q ss_pred --------hhhhhhhhchhccceEEeecCcccceeccccC----CCCC-CCchHH-HHHhchhh-hhHHHHHHHHHHHHH Q lcl|NC_021539. 67 --------QTAGPFVAKMSNLGFELVSKPKFNYLIFPDQG----VGKN-NKTKQD-FMLLGLEE-STAEIVEMLEEDVLK 131 (140) Q Consensus 67 --------k~s~pl~~~~~NLgf~i~~k~kf~YLvFPd~G----iG~s-n~~~q~-FmerGl~~-~~~~i~E~L~~~l~k 131 (140) .+.++++.+-.+-+.+|=.-..|+ .+=..| ++.. -..|++ |+ |+.. ....|.+.+.+.|.+ T Consensus 77 ~~~L~~tg~L~~Si~~~~~~~~v~vGt~~~yA--~vHqfG~~~~~~~~~~~iPaRpfL--G~s~~d~~~I~~~i~~~l~~ 152 (156) T protein:vir:19 77 GSILTLHGDLARSITTDYGQDYALIGSPKIYA--AIHQWGGTPDMAPRPAGVPARPYM--GLDKTGEQEIFDAIRKRVSA 152 (156) T ss_pred CcchhhhHHHHHHhhheecCCEEEEecchhhh--HHhhcCcccccCCCccccCCcccc--CCCHHHHHHHHHHHHHHHHH Confidence 122233322233333332222121 111122 1222 146666 77 6644 345678888888888 Q ss_pred HHHH Q lcl|NC_021539. 132 EINN 135 (140) Q Consensus 132 ~in~ 135 (140) ++.+ T Consensus 153 ~~~~ 156 (156) T protein:vir:19 153 ALRQ 156 (156) T ss_pred HhhC Confidence 8888 No 93 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=30.76 E-value=1.5 Score=19.54 Aligned_cols=118 Identities=13% Similarity=0.148 Sum_probs=63.8 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhhhchhccc Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFVAKMSNLG 80 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~~~~~NLg 80 (140) |+.+.++|- =.+++.+.++++++.+++.+.+.++. .+..+++.|...=|+. ++ .-||. .+.++.--| T Consensus 1 m~~~v~id~-L~~~i~~~L~~y~~~v~~~v~~~v~~-~a~~~~~~lk~~sP~~-TG-------~yaks---W~~k~~~~~ 67 (123) T protein:vir:96 1 MANKISIDD-LAKTIESEVRNWTKDVVDDIDDIKKD-ITKNGVKQLRESSPKR-TG-------DYAKN---WTSQKLKNG 67 (123) T ss_pred CCcccchhh-HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhCCcc-cc-------ccccc---eeeeecCCe Confidence 888777652 23567888899999998888888765 4567788888877864 32 11221 121111112 Q ss_pred --eEEeecCcccceeccccCCCCCC--Cch-HHHHHhchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_021539. 81 --FELVSKPKFNYLIFPDQGVGKNN--KTK-QDFMLLGLEESTAEIVEMLEEDVLKEINN 135 (140) Q Consensus 81 --f~i~~k~kf~YLvFPd~GiG~sn--~~~-q~FmerGl~~~~~~i~E~L~~~l~k~in~ 135 (140) .++..+..|..--.=.-|=++-| +++ +.|+. .+.+++.+.|.+.+.+.|++ T Consensus 68 ~~~v~~~~~~y~l~HLLE~GHa~r~GGrV~a~phI~----paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 68 DQVIYQKAPTYRLTHLLENGHAKRNGGRVSPKVHIA----PVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred eEEEEEecCCcceEEeeecceeecCCceeCcchhhh----HHHHHHHHHHHHHHHHHhcC Confidence 22333444422111122322221 333 33544 45566666666666667766 No 94 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=29.37 E-value=1.4 Score=19.86 Aligned_cols=115 Identities=10% Similarity=0.212 Sum_probs=69.5 Q ss_pred ccccHHHHHHHHHHHHhc--chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh--h---chh Q lcl|NC_021539. 5 WSVDFADVDKLTELISKI--PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV--A---KMS 77 (140) Q Consensus 5 ~sld~s~~e~L~~~m~~i--P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~--~---~~~ 77 (140) -||+.-.+++|+++|+.- |-++-++.|++|.. +....++.+..-|.+....=. -..-...|+|-. . ++. T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~-~g~~v~~~lK~~~~~fkDTGa---ti~ev~~s~p~~~~g~~~rtV 76 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNE-ASEFFIKALKKEFESFKDTGA---SIEEMTKSKPYTKVGSQERAV 76 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHH-HHHHHHHHHHhhhhhhhcccc---eeeeEEecCeeeccCCcceEE Confidence 689999999999999998 88889999999975 567788889999988754211 223356677743 2 344 Q ss_pred ccceE-------EeecCcccceecc----ccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHH Q lcl|NC_021539. 78 NLGFE-------LVSKPKFNYLIFP----DQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLK 131 (140) Q Consensus 78 NLgf~-------i~~k~kf~YLvFP----d~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k 131 (140) =+||+ |+-=--|||--++ --|.|. -|++|+ ...+.--+.+.++|-+ T Consensus 77 ~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~----i~~a~~----~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 77 LIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV----IAKTLA----ANERKYREIIKKELAR 133 (133) T ss_pred EEEeecCCCceeEEEeeccceecCCCeEccchhhH----HHHHHH----hhhHHHHHHHHHHhcC Confidence 56663 2223345665443 122221 223333 3333333333344433 No 95 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=26.97 E-value=1.7 Score=19.35 Aligned_cols=115 Identities=11% Similarity=0.217 Sum_probs=67.9 Q ss_pred ccccHHHHHHHHHHHHhc--chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh-----hchh Q lcl|NC_021539. 5 WSVDFADVDKLTELISKI--PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV-----AKMS 77 (140) Q Consensus 5 ~sld~s~~e~L~~~m~~i--P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~-----~~~~ 77 (140) -||+.-.+++|+++|+.- |-++-++.|++|.. +....++.+..-|.+....=. -..-...|+|-. .++. T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~-~g~~v~~~lK~~~~~fkDTGa---ti~ev~~s~p~~~~g~~~rtV 76 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNE-ASEFFIKALKKEFESFKDTGA---SIEEMTKSKPYTKVGSQERAV 76 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHH-HHHHHHHHHHhhhhhhhcccc---eeeeEEecCeeeccCCcceeE Confidence 689999999999999983 55567999999975 567788889999988754211 233456677743 2445 Q ss_pred ccceE-------EeecCcccceecc----ccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHH Q lcl|NC_021539. 78 NLGFE-------LVSKPKFNYLIFP----DQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLK 131 (140) Q Consensus 78 NLgf~-------i~~k~kf~YLvFP----d~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k 131 (140) =+||+ |+-=--|||--++ --|.|. -|++| +.+.+.--+.+.++|-+ T Consensus 77 ~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~----i~~a~----~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 77 LIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV----IAKTL----AASERKYREIIKKELAR 133 (133) T ss_pred EEEeecCCCceeEEEeeccceecCCCeEccchhhH----HHHHH----HhhhHHHHHHHHHHhcC Confidence 66663 2223345665443 122221 22333 33333333334444444 No 96 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=26.97 E-value=1.7 Score=19.35 Aligned_cols=115 Identities=11% Similarity=0.217 Sum_probs=67.9 Q ss_pred ccccHHHHHHHHHHHHhc--chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh-----hchh Q lcl|NC_021539. 5 WSVDFADVDKLTELISKI--PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV-----AKMS 77 (140) Q Consensus 5 ~sld~s~~e~L~~~m~~i--P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~-----~~~~ 77 (140) -||+.-.+++|+++|+.- |-++-++.|++|.. +....++.+..-|.+....=. -..-...|+|-. .++. T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~-~g~~v~~~lK~~~~~fkDTGa---ti~ev~~s~p~~~~g~~~rtV 76 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNE-ASEFFIKALKKEFESFKDTGA---SIEEMTKSKPYTKVGSQERAV 76 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHH-HHHHHHHHHHhhhhhhhcccc---eeeeEEecCeeeccCCcceeE Confidence 689999999999999983 55567999999975 567788889999988754211 233456677743 2445 Q ss_pred ccceE-------EeecCcccceecc----ccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHH Q lcl|NC_021539. 78 NLGFE-------LVSKPKFNYLIFP----DQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLK 131 (140) Q Consensus 78 NLgf~-------i~~k~kf~YLvFP----d~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k 131 (140) =+||+ |+-=--|||--++ --|.|. -|++| +.+.+.--+.+.++|-+ T Consensus 77 ~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~----i~~a~----~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 77 LIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV----IAKTL----AASERKYREIIKKELAR 133 (133) T ss_pred EEEeecCCCceeEEEeeccceecCCCeEccchhhH----HHHHH----HhhhHHHHHHHHHHhcC Confidence 66663 2223345665443 122221 22333 33333333334444444 No 97 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=26.97 E-value=1.7 Score=19.35 Aligned_cols=115 Identities=11% Similarity=0.217 Sum_probs=67.9 Q ss_pred ccccHHHHHHHHHHHHhc--chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh-----hchh Q lcl|NC_021539. 5 WSVDFADVDKLTELISKI--PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV-----AKMS 77 (140) Q Consensus 5 ~sld~s~~e~L~~~m~~i--P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~-----~~~~ 77 (140) -||+.-.+++|+++|+.- |-++-++.|++|.. +....++.+..-|.+....=. -..-...|+|-. .++. T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~-~g~~v~~~lK~~~~~fkDTGa---ti~ev~~s~p~~~~g~~~rtV 76 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNE-ASEFFIKALKKEFESFKDTGA---SIEEMTKSKPYTKVGSQERAV 76 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHH-HHHHHHHHHHhhhhhhhcccc---eeeeEEecCeeeccCCcceeE Confidence 689999999999999983 55567999999975 567788889999988754211 233456677743 2445 Q ss_pred ccceE-------EeecCcccceecc----ccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHH Q lcl|NC_021539. 78 NLGFE-------LVSKPKFNYLIFP----DQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLK 131 (140) Q Consensus 78 NLgf~-------i~~k~kf~YLvFP----d~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k 131 (140) =+||+ |+-=--|||--++ --|.|. -|++| +.+.+.--+.+.++|-+ T Consensus 77 ~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~----i~~a~----~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 77 LIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV----IAKTL----AASERKYREIIKKELAR 133 (133) T ss_pred EEEeecCCCceeEEEeeccceecCCCeEccchhhH----HHHHH----HhhhHHHHHHHHHHhcC Confidence 66663 2223345665443 122221 22333 33333333334444444 No 98 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=26.97 E-value=1.7 Score=19.35 Aligned_cols=115 Identities=11% Similarity=0.217 Sum_probs=67.9 Q ss_pred ccccHHHHHHHHHHHHhc--chHHHHHHHHHHhhhhhHHHHHhhhhcCCccccchhhhcccchhhhhhhhh-----hchh Q lcl|NC_021539. 5 WSVDFADVDKLTELISKI--PNKSEEIINKTLETKAVPLAKQNIEKRINLSKNWKGQLLNKNHAQTAGPFV-----AKMS 77 (140) Q Consensus 5 ~sld~s~~e~L~~~m~~i--P~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~r~k~~~rnK~HAk~s~pl~-----~~~~ 77 (140) -||+.-.+++|+++|+.- |-++-++.|++|.. +....++.+..-|.+....=. -..-...|+|-. .++. T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~-~g~~v~~~lK~~~~~fkDTGa---ti~ev~~s~p~~~~g~~~rtV 76 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNE-ASEFFIKALKKEFESFKDTGA---SIEEMTKSKPYTKVGSQERAV 76 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHH-HHHHHHHHHHhhhhhhhcccc---eeeeEEecCeeeccCCcceeE Confidence 689999999999999983 55567999999975 567788889999988754211 233456677743 2445 Q ss_pred ccceE-------EeecCcccceecc----ccCCCCCCCchHHHHHhchhhhhHHHHHHHHHHHHH Q lcl|NC_021539. 78 NLGFE-------LVSKPKFNYLIFP----DQGVGKNNKTKQDFMLLGLEESTAEIVEMLEEDVLK 131 (140) Q Consensus 78 NLgf~-------i~~k~kf~YLvFP----d~GiG~sn~~~q~FmerGl~~~~~~i~E~L~~~l~k 131 (140) =+||+ |+-=--|||--++ --|.|. -|++| +.+.+.--+.+.++|-+ T Consensus 77 ~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~----i~~a~----~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 77 LIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV----IAKTL----AASERKYREIIKKELAR 133 (133) T ss_pred EEEeecCCCceeEEEeeccceecCCCeEccchhhH----HHHHH----HhhhHHHHHHHHHHhcC Confidence 66663 2223345665443 122221 22333 33333333334444444 No 99 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=25.29 E-value=2.1 Score=18.85 Aligned_cols=126 Identities=11% Similarity=0.110 Sum_probs=57.8 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHHHHHHHHHhhhhhHHHHHhhhhc--------CCccccchhhhcccch------- Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSEEIINKTLETKAVPLAKQNIEKR--------INLSKNWKGQLLNKNH------- 65 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE~~IN~~L~tkg~~~a~~~I~~~--------iPvS~r~k~~~rnK~H------- 65 (140) |....++.+++ +.+.+.++++....+.. +.|-.+....+..++..+ .|+|..++...+.+.+ T Consensus 1 M~~~i~i~~d~-~~~~~~L~~l~~~~~d~--~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~ 77 (155) T protein:vir:79 1 MTTRIDVELDD-QEVRQRLAVLMRSVTDT--LPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQ 77 (155) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhhhH--HHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccc Confidence 88777777666 45666677666655521 233333333333333333 4666544333322221 Q ss_pred --hhhhhhhhhchhccceEEeecCcccceeccccC----CCCCCCchHH-HHHhchh-------hhhHHHHHHHHHHHHH Q lcl|NC_021539. 66 --AQTAGPFVAKMSNLGFELVSKPKFNYLIFPDQG----VGKNNKTKQD-FMLLGLE-------ESTAEIVEMLEEDVLK 131 (140) Q Consensus 66 --Ak~s~pl~~~~~NLgf~i~~k~kf~YLvFPd~G----iG~sn~~~q~-FmerGl~-------~~~~~i~E~L~~~l~k 131 (140) -.+.++++.+...-+..|=. ..-|-.+=..| .+..-..|++ |+ |++ +..+.|++.+++.|.+ T Consensus 78 ~tG~L~~Si~~~~~~~~v~vGt--~~~YA~iHqfGg~~~~~~~v~iPaRpfL--G~s~~~~l~~~~~~~I~~~i~~~l~r 153 (155) T protein:vir:79 78 VTNALARSVTTWADRNEAGIGS--NLVYAAIHQFGGDAGRGHQVEIPARRYL--PFDENGQLAAGARQSILEVVLTALSR 153 (155) T ss_pred cchhhhhhhhceecCCEEEEec--CchhhhhhhcccccCCCCccccCCcccc--CCCCccccchHHHHHHHHHHHHHHHh Confidence 11333333333333333322 12233333334 1112235665 76 433 3345677777777766 Q ss_pred HH Q lcl|NC_021539. 132 EI 133 (140) Q Consensus 132 ~i 133 (140) .= T Consensus 154 ~r 155 (155) T protein:vir:79 154 NR 155 (155) T ss_pred cC Confidence 54 No 100 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=21.57 E-value=2.6 Score=18.33 Aligned_cols=103 Identities=13% Similarity=0.089 Sum_probs=58.1 Q ss_pred cchHHHHHHHHHHhhhhhHHHHHhhhhcCCccc--------cchhhhcccchhhhhhhhh-hchhccceEEeecCcccce Q lcl|NC_021539. 22 IPNKSEEIINKTLETKAVPLAKQNIEKRINLSK--------NWKGQLLNKNHAQTAGPFV-AKMSNLGFELVSKPKFNYL 92 (140) Q Consensus 22 iP~~sE~~IN~~L~tkg~~~a~~~I~~~iPvS~--------r~k~~~rnK~HAk~s~pl~-~~~~NLgf~i~~k~kf~YL 92 (140) +++..++++|++ +......+..+.||-+ .|+-.-.+|.--..+|+.. ...-+-|.++++.. +. T Consensus 1 l~~~~~~~~~~~-----a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~k~~~~v~N~~eYA~~VE~GHRq~~g~-g~-- 72 (116) T protein:vir:10 1 MSKNLRRAKNNI-----GNKLLRKVKPKTPVAKIDGGTARKSWKYKELNLFDGVVSNNVEYIHHLEYGHRTRQGT-GT-- 72 (116) T ss_pred CchHHHHHHHHH-----HHHHHHHHHhhCCCCcCCCcccccCceeeeeeccCceeecCCcccccccCCceeeCCc-ce-- Confidence 777777777754 3344556677888742 2222111222112334444 45667777777652 11 Q ss_pred eccccCCCCCC---Cch-HHHHHhchhhhhHHHHHHHHHHHHHHHH Q lcl|NC_021539. 93 IFPDQGVGKNN---KTK-QDFMLLGLEESTAEIVEMLEEDVLKEIN 134 (140) Q Consensus 93 vFPd~GiG~sn---~~~-q~FmerGl~~~~~~i~E~L~~~l~k~in 134 (140) |.-+.|+-. -.+ +.||++.+.+....+-++|++.+++.+| T Consensus 73 --~~~~~gkrlk~~~V~G~fml~~s~~e~~~~~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 73 --SENYRPKPNGISFVPGVFMLARSVDEMSSIIDDELNQIIIDFWN 116 (116) T ss_pred --ecccccccccCCccCceehHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 112234332 233 4477788887777777778888888777 No 101 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=21.52 E-value=2.6 Score=18.32 Aligned_cols=131 Identities=11% Similarity=0.072 Sum_probs=51.9 Q ss_pred CCccccccHHHHHHHHHHHHhcchHHH------HHHHHHHhhhhhHHHHHhhh-hcCCccccchhhh------------- Q lcl|NC_021539. 1 MCAKWSVDFADVDKLTELISKIPNKSE------EIINKTLETKAVPLAKQNIE-KRINLSKNWKGQL------------- 60 (140) Q Consensus 1 m~a~~sld~s~~e~L~~~m~~iP~~sE------~~IN~~L~tkg~~~a~~~I~-~~iPvS~r~k~~~------------- 60 (140) |..-.++.+++ +.+...++++=...+ +.|=+.|.+.--.-+.++-- .--|+|..+..+. T Consensus 1 Ms~~i~i~~d~-~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~ 79 (175) T protein:vir:79 1 MSDFVNFQIDD-SALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGEL 79 (175) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccc Confidence 77655665544 233333333322221 22223333222111111110 1134443321111 Q ss_pred ----cccc--------hhhhhhhhhhchhccceEEeecCcccceecccc----CCCCCCCchHH-HHHhchhhh------ Q lcl|NC_021539. 61 ----LNKN--------HAQTAGPFVAKMSNLGFELVSKPKFNYLIFPDQ----GVGKNNKTKQD-FMLLGLEES------ 117 (140) Q Consensus 61 ----rnK~--------HAk~s~pl~~~~~NLgf~i~~k~kf~YLvFPd~----GiG~sn~~~q~-FmerGl~~~------ 117 (140) |.+. .-.+.++++.+-.+-+.+|=..-. |=.+=.. |.|..-.+|.+ | .|+++. T Consensus 80 ~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGtn~~--YAaiHqfGg~~~~~~~v~IPARPf--LG~s~~de~~~~ 155 (175) T protein:vir:79 80 TAAASRRKAGLMILQDSGQMAASTATDSGEDYSVIGSNKE--YAAIQHFGGQAGRGLKVTIPGRAW--LPVTADGELQPE 155 (175) T ss_pred hhhHhhhccCCCcceechhhhhhhhheecCCEEEEecCcc--hhhHhhcccccCCCcccccCcccc--cCCCcccchhHH Confidence 1000 112344455444555555532111 1112222 23333356776 7 466443 Q ss_pred -hHHHHHHHHHHHHHHHHHH Q lcl|NC_021539. 118 -TAEIVEMLEEDVLKEINNI 136 (140) Q Consensus 118 -~~~i~E~L~~~l~k~in~~ 136 (140) .+.|++.+.+-|.+++.+- T Consensus 156 ~~~~I~~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 156 AVEPVLNTILRHLMDAANRR 175 (175) T ss_pred HHHHHHHHHHHHHHHHhccC Confidence 3566777777776666666 Done!