Query lcl|NC_013597.1_cdsid_YP_003344800.1 [gene=D11S_2227] [protein=hypothetical protein] [protein_id=YP_003344800.1] [location=complement(13448..13807)] Match_columns 119 No_of_seqs 101 out of 119 Neff 6.5 Searched_HMMs 1612 Date Thu Nov 7 13:23:44 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_20 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_20_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:5256 Length: 119 # 100.0 7.4E-51 4.6E-54 295.4 12.3 119 1-119 1-119 (119) 2 protein:vir:107756 Length: 147 100.0 5.8E-45 3.6E-48 263.1 12.1 118 1-119 4-129 (147) 3 protein:vir:96108 Length: 155 100.0 2.4E-44 1.5E-47 259.7 12.0 119 1-119 1-138 (155) 4 protein:vir:99570 Length: 153 100.0 1.1E-43 7.1E-47 256.0 12.2 119 1-119 5-136 (153) 5 protein:vir:94064 Length: 167 100.0 2.7E-43 1.7E-46 253.9 11.8 119 1-119 5-134 (167) 6 protein:vir:78595 Length: 158 100.0 9.9E-43 6.1E-46 250.9 11.9 119 1-119 1-135 (158) 7 protein:vir:106739 Length: 158 100.0 9.9E-43 6.1E-46 250.9 11.9 119 1-119 1-135 (158) 8 protein:vir:3639 Length: 158 # 100.0 3.3E-42 2E-45 248.0 11.3 119 1-119 1-135 (158) 9 protein:vir:101559 Length: 158 100.0 3.3E-42 2E-45 248.0 11.3 119 1-119 1-135 (158) 10 protein:vir:107702 Length: 136 100.0 1.1E-39 6.8E-43 234.2 9.0 118 1-119 7-128 (136) 11 protein:vir:79640 Length: 134 100.0 7.1E-39 4.4E-42 229.7 8.7 118 1-119 4-126 (134) 12 protein:vir:103283 Length: 125 99.9 2.1E-30 1.3E-33 183.2 7.2 112 7-119 1-117 (125) 13 protein:vir:104344 Length: 132 99.9 3.7E-30 2.3E-33 182.0 8.0 116 1-119 1-123 (132) 14 protein:vir:80036 Length: 111 99.6 1.9E-18 1.2E-21 117.8 7.4 105 1-119 2-107 (111) 15 protein:vir:43 Length: 131 # N 95.6 0.00019 1.2E-07 40.9 7.8 109 1-117 1-131 (131) 16 protein:vir:98900 Length: 132 95.1 0.0008 4.9E-07 37.6 9.7 111 1-119 1-131 (132) 17 protein:vir:80967 Length: 131 94.7 0.00055 3.4E-07 38.4 7.8 109 1-117 1-131 (131) 18 protein:vir:105776 Length: 133 94.0 0.0017 1.1E-06 35.8 9.0 101 1-119 1-109 (133) 19 protein:vir:102961 Length: 131 93.8 0.0012 7.5E-07 36.6 7.7 108 3-112 1-131 (131) 20 protein:vir:95176 Length: 172 93.5 0.0049 3E-06 33.3 10.5 110 1-119 19-166 (172) 21 protein:vir:80389 Length: 172 93.4 0.0049 3E-06 33.2 10.4 115 1-119 17-167 (172) 22 protein:vir:4788 Length: 130 # 93.1 0.0035 2.2E-06 34.0 9.2 109 1-119 1-127 (130) 23 protein:vir:94955 Length: 170 86.6 0.044 2.7E-05 28.0 10.4 106 1-119 16-162 (170) 24 protein:vir:9821 Length: 138 # 82.8 0.05 3.1E-05 27.7 8.0 110 1-119 6-135 (138) 25 protein:vir:99517 Length: 124 80.7 0.054 3.4E-05 27.5 7.4 99 1-119 6-110 (124) 26 protein:vir:3970 Length: 110 # 79.5 0.1 6.4E-05 26.0 9.0 100 1-119 1-108 (110) 27 protein:vir:5976 Length: 102 # 76.2 0.044 2.7E-05 28.0 5.5 91 2-110 1-102 (102) 28 protein:vir:3615 Length: 110 # 75.4 0.15 9.1E-05 25.1 8.6 100 1-119 1-105 (110) 29 protein:vir:79050 Length: 133 73.7 0.11 6.9E-05 25.8 7.1 113 1-113 3-133 (133) 30 protein:vir:106596 Length: 128 71.3 0.19 0.00011 24.6 7.7 103 1-119 19-126 (128) 31 protein:vir:10365 Length: 115 65.7 0.21 0.00013 24.3 6.8 85 1-118 1-115 (115) 32 protein:vir:81159 Length: 95 # 65.6 0.16 9.7E-05 25.0 6.0 87 1-100 3-95 (95) 33 protein:vir:1887 Length: 108 # 62.9 0.18 0.00011 24.7 5.8 90 1-95 8-108 (108) 34 protein:vir:192 Length: 108 # 62.9 0.18 0.00011 24.7 5.8 90 1-95 8-108 (108) 35 protein:vir:100103 Length: 120 62.7 0.33 0.0002 23.2 7.6 86 1-114 7-120 (120) 36 protein:vir:96128 Length: 98 # 61.9 0.096 6E-05 26.2 4.2 91 2-106 1-98 (98) 37 protein:vir:741 Length: 110 # 61.9 0.34 0.00021 23.1 8.5 99 1-119 1-108 (110) 38 protein:vir:81069 Length: 115 59.8 0.38 0.00024 22.9 8.1 84 1-115 1-115 (115) 39 protein:vir:4998 Length: 106 # 59.1 0.12 7.7E-05 25.6 4.3 93 1-94 1-106 (106) 40 protein:vir:94507 Length: 113 57.6 0.43 0.00027 22.6 9.1 104 1-119 1-111 (113) 41 protein:vir:9877 Length: 114 # 56.3 0.46 0.00029 22.4 7.1 99 1-119 1-112 (114) 42 protein:vir:95004 Length: 169 56.1 0.46 0.00029 22.4 10.6 109 1-119 17-163 (169) 43 protein:vir:97069 Length: 115 55.4 0.46 0.00029 22.4 6.7 84 1-115 1-115 (115) 44 protein:vir:5742 Length: 110 # 54.8 0.15 9.3E-05 25.1 3.9 83 1-114 3-110 (110) 45 protein:vir:96831 Length: 98 # 53.5 0.19 0.00012 24.5 4.3 92 2-106 1-98 (98) 46 protein:vir:93592 Length: 108 49.8 0.31 0.00019 23.4 4.8 87 1-118 4-108 (108) 47 protein:vir:102158 Length: 99 49.5 0.31 0.00019 23.4 4.8 91 1-117 2-99 (99) 48 protein:vir:106583 Length: 105 48.4 0.67 0.00042 21.5 7.7 99 1-119 1-104 (105) 49 protein:vir:1640 Length: 132 # 47.7 0.6 0.00037 21.8 6.1 101 1-119 1-125 (132) 50 protein:vir:4954 Length: 104 # 47.3 0.28 0.00018 23.6 4.2 90 1-92 2-104 (104) 51 protein:vir:1241 Length: 104 # 47.3 0.71 0.00044 21.4 6.8 97 2-118 1-104 (104) 52 protein:vir:79701 Length: 144 46.3 0.74 0.00046 21.3 8.5 114 1-119 4-143 (144) 53 protein:vir:93740 Length: 104 46.2 0.38 0.00023 22.9 4.7 97 2-118 1-104 (104) 54 protein:vir:97145 Length: 110 45.9 0.75 0.00047 21.3 8.8 102 1-119 1-108 (110) 55 protein:vir:96390 Length: 110 45.9 0.75 0.00047 21.3 8.8 102 1-119 1-108 (110) 56 protein:vir:96221 Length: 110 45.9 0.75 0.00047 21.3 8.8 102 1-119 1-108 (110) 57 protein:vir:9311 Length: 110 # 45.9 0.75 0.00047 21.3 8.8 102 1-119 1-108 (110) 58 protein:vir:99796 Length: 110 45.9 0.75 0.00047 21.3 8.8 102 1-119 1-108 (110) 59 protein:vir:78849 Length: 110 45.9 0.75 0.00047 21.3 8.8 102 1-119 1-108 (110) 60 protein:vir:103957 Length: 110 45.9 0.75 0.00047 21.3 8.8 102 1-119 1-108 (110) 61 protein:vir:94761 Length: 132 45.3 0.77 0.00048 21.2 8.8 101 1-119 1-123 (132) 62 protein:vir:7410 Length: 107 # 44.6 0.49 0.00031 22.3 5.1 83 1-100 2-107 (107) 63 protein:vir:97430 Length: 104 43.4 0.46 0.00028 22.4 4.7 97 2-118 1-104 (104) 64 protein:vir:94492 Length: 104 43.4 0.46 0.00028 22.4 4.7 97 2-118 1-104 (104) 65 protein:vir:4857 Length: 104 # 42.6 0.45 0.00028 22.5 4.5 85 1-116 1-104 (104) 66 protein:vir:105005 Length: 96 42.4 0.51 0.00032 22.2 4.8 87 1-93 2-96 (96) 67 protein:vir:107614 Length: 96 42.4 0.51 0.00032 22.2 4.8 87 1-93 2-96 (96) 68 protein:vir:102083 Length: 96 42.4 0.51 0.00032 22.2 4.8 87 1-93 2-96 (96) 69 protein:vir:102863 Length: 96 42.4 0.51 0.00032 22.2 4.8 87 1-93 2-96 (96) 70 protein:vir:95071 Length: 104 42.2 0.5 0.00031 22.2 4.7 97 2-118 1-104 (104) 71 protein:vir:7857 Length: 188 # 40.2 0.27 0.00017 23.7 2.9 76 1-118 100-188 (188) 72 protein:vir:101652 Length: 188 40.2 0.27 0.00017 23.7 2.9 76 1-118 100-188 (188) 73 protein:vir:4512 Length: 107 # 39.1 1 0.00063 20.5 5.9 83 1-114 2-107 (107) 74 protein:vir:99922 Length: 165 36.7 0.69 0.00043 21.5 4.6 105 1-119 3-122 (165) 75 protein:vir:486 Length: 107 # 34.4 1.3 0.0008 20.0 7.7 83 1-114 2-107 (107) 76 protein:vir:97267 Length: 172 32.8 1.4 0.00087 19.8 9.6 113 1-119 18-170 (172) 77 protein:vir:9706 Length: 100 # 32.7 0.82 0.00051 21.0 4.3 85 1-116 1-100 (100) 78 protein:vir:4831 Length: 105 # 32.3 0.94 0.00058 20.7 4.6 85 1-116 2-105 (105) 79 protein:vir:100245 Length: 113 31.9 1.5 0.00091 19.7 7.2 83 1-114 3-113 (113) 80 protein:vir:1384 Length: 92 # 31.6 1 0.00062 20.6 4.6 85 2-116 1-92 (92) 81 protein:vir:80668 Length: 153 30.6 1.1 0.00066 20.4 4.6 101 1-119 4-114 (153) 82 protein:vir:1026 Length: 107 # 28.9 1.5 0.0009 19.7 5.0 85 1-100 2-107 (107) 83 protein:vir:9576 Length: 131 # 24.5 2.2 0.0013 18.7 8.6 105 1-119 1-122 (131) 84 protein:vir:9761 Length: 140 # 21.8 2.5 0.0016 18.4 8.8 105 1-119 1-122 (140) 85 protein:vir:107119 Length: 104 21.7 2.6 0.0016 18.4 6.9 97 2-118 1-104 (104) 86 protein:vir:105327 Length: 104 21.7 2.6 0.0016 18.4 6.9 97 2-118 1-104 (104) 87 protein:vir:4702 Length: 113 # 20.2 2.8 0.0017 18.1 6.4 94 1-99 1-113 (113) No 1 >protein:vir:5256 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852761;genbank:gi:31544036;uniprot:Q7Y5T9;genbank:GeneID:2753553 Probab=100.00 E-value=7.4e-51 Score=295.43 Aligned_cols=119 Identities=100% Similarity=1.403 Sum_probs=116.9 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCchhHHHHHHHHHHHHHHHHhhhhcccccccccceeeeeeccee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWGKLYDRGVMALTAHLLKLSADAEISGGAANRNLASESAGELS 80 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~g~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~svG~vS 80 (119) |||+++||++||||++|||++|+.||++|++++|+++||++++++++|||||+|+|+.....++++.+|+|+|+|+|+|| T Consensus 1 m~t~~~Fr~~~PeF~~~pd~~i~~~l~~A~~~l~~~~~g~~~~~~~~L~~AH~l~l~~~~~~~~g~~~g~v~S~s~G~vS 80 (119) T protein:vir:52 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWGKLYDRGVMALTAHLLKLSADAEISGGAANRNLASESAGELS 80 (119) T ss_pred CCcHHHHHHhhhhccCCCHHHHHHHHHHHHHhhCCcCCchHHHHHHHHHHHHHHHhhhhhhccccccccceeeeeeccee Confidence 99999999999999999999999999999999999999999999999999999999988888888899999999999999 Q ss_pred eeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 81 VSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 81 vs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) |||+++++.+++++||++||||||||+|+|++|+||+|| T Consensus 81 vS~~~~~~~~~~~~w~~~T~YG~~y~~L~r~~g~Gg~Va 119 (119) T protein:vir:52 81 VSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) T ss_pred eeeeccccCCcchhhhhcCHHHHHHHHHHHHhcCCCcCC Confidence 999999999999999999999999999999999999999 No 2 >protein:vir:107756 Length: 147 # NCBI annotation: gp21 # Family: family:all:664 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024869;genbank:gi:48697511;genbank:GeneID:2948377 Probab=100.00 E-value=5.8e-45 Score=263.09 Aligned_cols=118 Identities=24% Similarity=0.264 Sum_probs=110.6 Q ss_pred CCCHHHHHHhhhhhcC---CCHHHHHHHHHHHHHHhCCcCc-----hhHHHHHHHHHHHHHHHHhhhhccccccccccee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK---TDAKRIGLFLSDAQAEVSKVQW-----GKLYDRGVMALTAHLLKLSADAEISGGAANRNLA 72 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~---vpd~~i~~~~~~A~~~~~~~~~-----g~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vt 72 (119) -.++++||++||||+| +||++|+.||++|++++|+++| |++++++++|||||+|+|+..... +++++|+|+ T Consensus 4 ~fd~~~Fr~~fPeFad~~~~pd~~i~~~l~~A~~~l~~~~~~~~~~g~~~~~~l~Ll~AHll~l~~~~~~-g~g~~G~v~ 82 (147) T protein:vir:10 4 TLDITKFRALFPEFNNDVKYPDALLEQWYAVAGEYLGLTDYACGLNGNTLDLALMQLTAHLMKSATILSS-NKGAPMVMT 82 (147) T ss_pred ecCHHHHHHhcccccCCccCCHHHHHHHHHHHHHhhccccCCcccChhhHHHHHHHHHHHHHHHHHhhcc-CCCccccee Confidence 4578999999999986 7999999999999999999999 899999999999999999876654 456789999 Q ss_pred eeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 73 SESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 73 S~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) |+|||+|||||+++++.+++++||++||||||||+|+|++|+||+|+ T Consensus 83 Sas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~y~~l~~~~~~Gg~vv 129 (147) T protein:vir:10 83 SATIDKVSISTLAPPIKNGWQYWLSTTPYGQMLWALLSMRSSGGFVY 129 (147) T ss_pred eeeecceeeeeecCCCCCcchhhhhcCHHHHHHHHHHHhhCccceec Confidence 99999999999999999999999999999999999999999999999 No 3 >protein:vir:96108 Length: 155 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:664 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294442;genbank:gi:149408339;genbank:GeneID:5237227 Probab=100.00 E-value=2.4e-44 Score=259.72 Aligned_cols=119 Identities=32% Similarity=0.385 Sum_probs=108.2 Q ss_pred CC--CHHHHHHhhhhhcC---CCHHHHHHHHHHHHHHhCC------cCchhHHHHHHHHHHHHHHHHhhhhcc------- Q lcl|NC_013597. 1 MP--LTEDFLLRYTEFGK---TDAKRIGLFLSDAQAEVSK------VQWGKLYDRGVMALTAHLLKLSADAEI------- 62 (119) Q Consensus 1 m~--t~~~Fr~~~P~F~~---vpd~~i~~~~~~A~~~~~~------~~~g~~~~~~~~l~~AH~l~l~~~~~~------- 62 (119) || ++++||++||||+| +||++|+.||++|++++++ ++||++++++++|||||+|.|+....+ T Consensus 1 ~v~fd~~~FR~~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~~~~s~~~~g~~~~~~l~Ll~AH~l~L~~~~~~gaa~~g~ 80 (155) T protein:vir:96 1 MVIFDEQKFRTLFPEFADPASYPAVRLQLYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) T ss_pred CcccCHHHHHHhCccccCcccCCHHHHHHHHHHHHHhhcCCCccccccChHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 66 78999999999985 7999999999999999974 578999999999999999999764322 Q ss_pred -cccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 63 -SGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 63 -~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) .+++.+|+|+|||||+||||||++++.+++++||++||||||||+|+|++|+||+|+ T Consensus 81 ~~~g~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~f~~l~~~~~~Gg~~v 138 (155) T protein:vir:96 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) T ss_pred ccccccccceeeceecceeeeeecCCCCCchhHHhhcCHHHHHHHHHHHHhccccccc Confidence 235678999999999999999999999999999999999999999999999999998 No 4 >protein:vir:99570 Length: 153 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039799;genbank:gi:126011049;genbank:GeneID:4818265 Probab=100.00 E-value=1.1e-43 Score=255.99 Aligned_cols=119 Identities=26% Similarity=0.317 Sum_probs=107.4 Q ss_pred CCCHHHHHHhhhhhcC---CCHHHHHHHHHHHHHHhCCcC------chhHHHHHHHHHHHHHHHHhhhhc----cccccc Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK---TDAKRIGLFLSDAQAEVSKVQ------WGKLYDRGVMALTAHLLKLSADAE----ISGGAA 67 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~---vpd~~i~~~~~~A~~~~~~~~------~g~~~~~~~~l~~AH~l~l~~~~~----~~~~~~ 67 (119) --++++||++||||+| +||++|+.||++|++++++.+ +|+.++++++||+||+|+|+...+ ..+++. T Consensus 5 ~fd~~~Fr~~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~~~~~~~~~g~~~~~~l~Ll~AH~l~L~~~~~~~~~~a~~~~ 84 (153) T protein:vir:99 5 VYNDGLFRIMYPEFADQEKYPPEVIEIYYDTATLFITGSMFPCAALSGKQLVGALNMLTAHLMSLSMQRSQTALGATNDQ 84 (153) T ss_pred cCChHHHHHhcccccCccccCHHHHHHHHHHHHHhhcCccccccccChHHHHHHHHHHHHHHHHHHhhhhcccccCCCcc Confidence 4488999999999985 799999999999999998654 589999999999999999975432 234567 Q ss_pred ccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 68 NRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 68 ~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) +|+|+|||||+||||||++++.+++++||++||||||||+|+|++|+||.|+ T Consensus 85 ~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~fw~l~~~~~~Gg~v~ 136 (153) T protein:vir:99 85 GGYTLSATIGEVSVSKMAPPAKDGWEFWLAQTPYGQALWALLKMLSVGGFAI 136 (153) T ss_pred ccceeeeeecceeeeeecCCCCCchhHhhhcCHHHHHHHHHHHHhccccccc Confidence 8999999999999999999999999999999999999999999999999999 No 5 >protein:vir:94064 Length: 167 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453623;genbank:gi:84662659;genbank:GeneID:5142574 Probab=100.00 E-value=2.7e-43 Score=253.95 Aligned_cols=119 Identities=19% Similarity=0.205 Sum_probs=104.9 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHh-CCcC-----chhHHHHHHHHHHHHHHHHhhhh-----ccccccccc Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEV-SKVQ-----WGKLYDRGVMALTAHLLKLSADA-----EISGGAANR 69 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~-~~~~-----~g~~~~~~~~l~~AH~l~l~~~~-----~~~~~~~~g 69 (119) --++++||++||||+++||++|+.||++|++++ ++++ +++.++++++|||||+|+|+... ..++++.+| T Consensus 5 ~Fd~~~FR~~fPeFa~~Pd~~i~~~l~~A~~~~l~~~~~s~~~~~~~~~~~l~LltAHll~L~~~~~a~~~~~~~~g~~G 84 (167) T protein:vir:94 5 VFDPTAFKLVYPEFVAVPDARLTALFNTVGYTILDNTDASVIVDPLRRAPLLDLLVAHMLALFGYVNADGSITPGTGTVG 84 (167) T ss_pred cCChHHHHHhchhcccCCHHHHHHHHHHHHHhhcCCCCcccccchhhHHHHHHHHHHHHHHHhhhhhhhcccccccccch Confidence 448899999999999999999999999998765 4333 45778999999999999996432 223456779 Q ss_pred ceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 70 NLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 70 ~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) +|+|||||+|||||+++++.+++++||++||||||||+|+|++|+||+|+ T Consensus 85 ~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~fwaL~~~~g~Gg~v~ 134 (167) T protein:vir:94 85 RVANASEGSVSTSLAYSTPTGAGEAWFTQTPYGAMYWAMSAPFRSFHYVA 134 (167) T ss_pred heeeccccceeeeeecCCCCCchhhhhhcCHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999 No 6 >protein:vir:78595 Length: 158 # NCBI annotation: BcepNY3gp07 # Family: family:all:664 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294844;genbank:gi:149882907;genbank:GeneID:5291066 Probab=100.00 E-value=9.9e-43 Score=250.87 Aligned_cols=119 Identities=19% Similarity=0.280 Sum_probs=104.7 Q ss_pred CC--------CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcC------chhHHHHHHHHHHHHHHHHhhhh-ccccc Q lcl|NC_013597. 1 MP--------LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQ------WGKLYDRGVMALTAHLLKLSADA-EISGG 65 (119) Q Consensus 1 m~--------t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~------~g~~~~~~~~l~~AH~l~l~~~~-~~~~~ 65 (119) |. .+++||++||||+++||++|+.|+++|++++.+++ .++.++++++|||||+|+|+... ..+++ T Consensus 1 ~~~~~~~v~Fd~a~FR~~fPeFa~~pd~~i~~~~~~A~~~~~~~~~~s~~~~~~~r~~ll~LltAHll~L~~~~~~~a~~ 80 (158) T protein:vir:78 1 MSTPPYRITFDPAGFIAEYPEFATVATPRLQAMFNQAQTALLDNTGGSPVTDDNVLRELFNMLVAHLLTLFGATPTSANS 80 (158) T ss_pred CCCCCceEEcChHHHHHhchhhccCCHHHHHHHHHHhhhhhcCCCccccccChhHHHHHHHHHHHHHHHHhHhhhccccC Confidence 44 56999999999999999999999999998875432 35778999999999999997654 34466 Q ss_pred ccccceeeeeecceeeeeccCc-cCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 66 AANRNLASESAGELSVSYTAPI-SANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 66 ~~~g~vtS~svG~vSvs~~~~~-~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) +++|+|+|||||+||||||+++ +.+++++||++||||||||+|++++++||+|+ T Consensus 81 g~~G~isSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fwal~~~~~~Ggy~~ 135 (158) T protein:vir:78 81 RPPGRLSSAAEGTVSSSFEFKLPEGSAIAPWYNQTQYGAMFWMATARYRSARYMV 135 (158) T ss_pred CcccceeeeeecceEEEEeecCccccchhHHhhcCHHHHHHHHHHHHhccccccc Confidence 7899999999999999999865 56678999999999999999999999999999 No 7 >protein:vir:106739 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944316;genbank:gi:38638615;genbank:GeneID:2657368 Probab=100.00 E-value=9.9e-43 Score=250.87 Aligned_cols=119 Identities=19% Similarity=0.280 Sum_probs=104.7 Q ss_pred CC--------CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcC------chhHHHHHHHHHHHHHHHHhhhh-ccccc Q lcl|NC_013597. 1 MP--------LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQ------WGKLYDRGVMALTAHLLKLSADA-EISGG 65 (119) Q Consensus 1 m~--------t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~------~g~~~~~~~~l~~AH~l~l~~~~-~~~~~ 65 (119) |. .+++||++||||+++||++|+.|+++|++++.+++ .++.++++++|||||+|+|+... ..+++ T Consensus 1 ~~~~~~~v~Fd~a~FR~~fPeFa~~pd~~i~~~~~~A~~~~~~~~~~s~~~~~~~r~~ll~LltAHll~L~~~~~~~a~~ 80 (158) T protein:vir:10 1 MSTPPYRITFDPAGFIAEYPEFATVATPRLQAMFNQAQTALLDNTGGSPVTDDNVLRELFNMLVAHLLTLFGATPTSANS 80 (158) T ss_pred CCCCCceEEcChHHHHHhchhhccCCHHHHHHHHHHhhhhhcCCCccccccChhHHHHHHHHHHHHHHHHhHhhhccccC Confidence 44 56999999999999999999999999998875432 35778999999999999997654 34466 Q ss_pred ccccceeeeeecceeeeeccCc-cCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 66 AANRNLASESAGELSVSYTAPI-SANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 66 ~~~g~vtS~svG~vSvs~~~~~-~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) +++|+|+|||||+||||||+++ +.+++++||++||||||||+|++++++||+|+ T Consensus 81 g~~G~isSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fwal~~~~~~Ggy~~ 135 (158) T protein:vir:10 81 RPPGRLSSAAEGTVSSSFEFKLPEGSAIAPWYNQTQYGAMFWMATARYRSARYMV 135 (158) T ss_pred CcccceeeeeecceEEEEeecCccccchhHHhhcCHHHHHHHHHHHHhccccccc Confidence 7899999999999999999865 56678999999999999999999999999999 No 8 >protein:vir:3639 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705634;genbank:gi:23752319;genbank:GeneID:955737 Probab=100.00 E-value=3.3e-42 Score=248.02 Aligned_cols=119 Identities=22% Similarity=0.315 Sum_probs=102.9 Q ss_pred CC--------CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHh-CCcCc-----hhHHHHHHHHHHHHHHHHhhhhccc-cc Q lcl|NC_013597. 1 MP--------LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEV-SKVQW-----GKLYDRGVMALTAHLLKLSADAEIS-GG 65 (119) Q Consensus 1 m~--------t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~-~~~~~-----g~~~~~~~~l~~AH~l~l~~~~~~~-~~ 65 (119) |. .+++||++||||+++||++|+.|+++|++++ ++++| ++.++++++|||||+|.|+.....+ ++ T Consensus 1 ~~~~~~~v~Fd~a~FR~~fPeFa~~pd~~i~~~~~~A~~~~l~n~~~s~~~~~~~r~~ll~LltAHll~L~~~~~~g~~~ 80 (158) T protein:vir:36 1 MSTPPYRITFDPAGFIAEYPEFATVPTPRLQAMFNQAQAALLDNTGGSPVTDDNVLRELFNMLVAHLLTLFSAAPTSANS 80 (158) T ss_pred CCCCCceEEcChHHHHHhCcccccCCHHHHHHHHHhhhheeeCCcccccccChHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 44 5699999999999999999999999998754 44333 4678899999999999998654444 45 Q ss_pred ccccceeeeeecceeeeeccCc-cCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 66 AANRNLASESAGELSVSYTAPI-SANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 66 ~~~g~vtS~svG~vSvs~~~~~-~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) +.+|+|+|||||+||||||+++ +.+++++||++||||||||+|++++|+||+|+ T Consensus 81 g~vG~vsSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fw~L~~~~~~Gg~v~ 135 (158) T protein:vir:36 81 RPPGRLSSATEGTVSSSFEYALPQGSAIAPWYNQTQYGAMFWMATAHYRSARYMV 135 (158) T ss_pred CcccceeeeeeCceeEEeeeccccCCchhhhhhcCHHHHHHHHHHHhhCcccccc Confidence 6679999999999999999754 56788999999999999999999999999999 No 9 >protein:vir:101559 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958112;genbank:gi:41057658;genbank:GeneID:2716816 Probab=100.00 E-value=3.3e-42 Score=248.02 Aligned_cols=119 Identities=22% Similarity=0.315 Sum_probs=102.9 Q ss_pred CC--------CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHh-CCcCc-----hhHHHHHHHHHHHHHHHHhhhhccc-cc Q lcl|NC_013597. 1 MP--------LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEV-SKVQW-----GKLYDRGVMALTAHLLKLSADAEIS-GG 65 (119) Q Consensus 1 m~--------t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~-~~~~~-----g~~~~~~~~l~~AH~l~l~~~~~~~-~~ 65 (119) |. .+++||++||||+++||++|+.|+++|++++ ++++| ++.++++++|||||+|.|+.....+ ++ T Consensus 1 ~~~~~~~v~Fd~a~FR~~fPeFa~~pd~~i~~~~~~A~~~~l~n~~~s~~~~~~~r~~ll~LltAHll~L~~~~~~g~~~ 80 (158) T protein:vir:10 1 MSTPPYRITFDPAGFIAEYPEFATVPTPRLQAMFNQAQAALLDNTGGSPVTDDNVLRELFNMLVAHLLTLFSAAPTSANS 80 (158) T ss_pred CCCCCceEEcChHHHHHhCcccccCCHHHHHHHHHhhhheeeCCcccccccChHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 44 5699999999999999999999999998754 44333 4678899999999999998654444 45 Q ss_pred ccccceeeeeecceeeeeccCc-cCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 66 AANRNLASESAGELSVSYTAPI-SANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 66 ~~~g~vtS~svG~vSvs~~~~~-~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) +.+|+|+|||||+||||||+++ +.+++++||++||||||||+|++++|+||+|+ T Consensus 81 g~vG~vsSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fw~L~~~~~~Gg~v~ 135 (158) T protein:vir:10 81 RPPGRLSSATEGTVSSSFEYALPQGSAIAPWYNQTQYGAMFWMATAHYRSARYMV 135 (158) T ss_pred CcccceeeeeeCceeEEeeeccccCCchhhhhhcCHHHHHHHHHHHhhCcccccc Confidence 6679999999999999999754 56788999999999999999999999999999 No 10 >protein:vir:107702 Length: 136 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003900;genbank:gi:45686316;genbank:GeneID:2773042 Probab=100.00 E-value=1.1e-39 Score=234.19 Aligned_cols=118 Identities=15% Similarity=0.182 Sum_probs=100.0 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCchhHHHHHHHHHHHHHHHHhhhhcc----cccccccceeeeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWGKLYDRGVMALTAHLLKLSADAEI----SGGAANRNLASESA 76 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~g~~~~~~~~l~~AH~l~l~~~~~~----~~~~~~g~vtS~sv 76 (119) ..++|+||++||||+||||++|+.|+++|+++||.++|||.++++++|||||+|.+++..++ +.+..++.++|+++ T Consensus 7 ~~~ve~fR~l~PeF~dvPde~i~~~~d~A~~~v~~~~~Gk~y~~al~lltAHLl~l~~~~~~~~~~~~~~s~rv~ssat~ 86 (136) T protein:vir:10 7 IAVVEQMRKLVPALRKVPDETLYAWVEMAELFVCQKTFKDAYVKALALYALHLAFLDGALKGEDEDLESYSRRVTSFSLS 86 (136) T ss_pred HHHHHHHHHhccccccCCHHHHHHHHHHHHHhhcCCCChhHHHHHHHHHHHHHHhcccccccccccccccccceehheec Confidence 34678899999999999999999999999999999999999999999999999988764333 23334555566889 Q ss_pred cceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 77 GELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 77 G~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) |+|||||+. ++.++|+.||++|||||+||+|+|+++.|--+- T Consensus 87 GevSVS~a~-~s~~~s~~WL~~TpyGq~y~aL~k~~~gGf~l~ 128 (136) T protein:vir:10 87 GEFSQTFGE-VTKNQSGDMMLSTPWGKMFEQLKARRRGRFALM 128 (136) T ss_pred cceeEeecc-ccCchhhHhhhcCHHHHHHHHHHhhcccchhhh Confidence 999999985 567889999999999999999999876653333 No 11 >protein:vir:79640 Length: 134 # NCBI annotation: gp38 # Family: family:all:5122 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285527;genbank:gi:148734510;genbank:GeneID:5219998 Probab=100.00 E-value=7.1e-39 Score=229.73 Aligned_cols=118 Identities=21% Similarity=0.302 Sum_probs=98.3 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCchhHHHHHHHHHHHHHHHHhhhhccc---ccccccce-eeeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWGKLYDRGVMALTAHLLKLSADAEIS---GGAANRNL-ASESA 76 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~g~~~~~~~~l~~AH~l~l~~~~~~~---~~~~~g~v-tS~sv 76 (119) ..++|.||++||||++|||++|+.|+++|+++||.++|||.++++++|||||+|.|++..++. +++..|+| +|+++ T Consensus 4 ~~~ve~Fr~l~PeF~~vpde~l~~~~~~A~~~i~~~~~g~~~~~al~lltAHLl~l~~~~~~~g~~~~~~~grv~ssst~ 83 (134) T protein:vir:79 4 IEILEQIYKIAPAFKKVDPELIQAWIELAKDFVCEKHFKDKYFRAVALYTLHLMTLDGAMKQESESVESYSHRIASFSLT 83 (134) T ss_pred HHHHHHHHHhccccccCCHHHHHHHHHHhhhhhcCCCCChHHHHHHHHHHHHHHhhcccccccccccccccchhhhhhhh Confidence 345899999999999999999999999999999999999999999999999999997543222 22334445 55779 Q ss_pred cceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCC-CccC Q lcl|NC_013597. 77 GELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG-VMVA 119 (119) Q Consensus 77 G~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G-g~va 119 (119) |+|||||+. ++.+++++||++|||||+||+|+|+++.| |+.+ T Consensus 84 G~vSvS~a~-ps~~~~~~Wl~~TpYGq~y~~L~k~~~GGf~~~t 126 (134) T protein:vir:79 84 GEFSQTFSK-VSDDTSGNTLRQTPWGKMYEVLNKKKGGGFGLTT 126 (134) T ss_pred cceeeeccC-cccchhHHHHhcCHHHHHHHHHHHhhccchHhhh Confidence 999999976 46678999999999999999999987664 2222 No 12 >protein:vir:103283 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277463;genbank:gi:71834105;genbank:GeneID:3562394 Probab=99.93 E-value=2.1e-30 Score=183.24 Aligned_cols=112 Identities=18% Similarity=0.255 Sum_probs=96.8 Q ss_pred HHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCchhHHHHHHHHHHHHHHHHhhhhccccc---ccccceeeee-ecceeee Q lcl|NC_013597. 7 FLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWGKLYDRGVMALTAHLLKLSADAEISGG---AANRNLASES-AGELSVS 82 (119) Q Consensus 7 Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~g~~~~~~~~l~~AH~l~l~~~~~~~~~---~~~g~vtS~s-vG~vSvs 82 (119) .|.+||+|++||||+|+.|++.|++|||.+.|||.+.+++.||++|+|.++++.++.+. ...++|+|-+ .|++||| T Consensus 1 mR~l~P~f~~vpdevi~~wid~A~lFVC~~~fg~~~~~Al~lytlHLm~~dga~k~e~~~~~~~s~r~~s~slsGE~Sit 80 (125) T protein:vir:10 1 MRTLYPPLKSQPDDVLNAWIEVAKLFICLDKFGDKQVQALAFYTLHLLSQDIALKTENDSSQTSSERVKSYSLSGEYTIS 80 (125) T ss_pred CccccchhhccCHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccccccccccccceeeeeeccceEee Confidence 99999999999999999999999999999999999999999999999999987665553 3468888877 8999999 Q ss_pred eccCccCCcchhhhhcCHHHHHHHHHHHHhCCC-CccC Q lcl|NC_013597. 83 YTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG-VMVA 119 (119) Q Consensus 83 ~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G-g~va 119 (119) |+.++ .+.++.|+.+||||++||+|+|+.+.| |++. T Consensus 81 ~~~~s-~d~s~~~L~~T~wGk~~~~L~k~~~GgFaL~T 117 (125) T protein:vir:10 81 YDTST-AAASSSNLEESSWGKLYIDLMRLKVGRWGLIT 117 (125) T ss_pred ccccc-ccccccccccCchHHHHHHHHHhcCCceeeec Confidence 98775 467889999999999999998854433 2222 No 13 >protein:vir:104344 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398973;genbank:gi:81343957;genbank:GeneID:3778876 Probab=99.93 E-value=3.7e-30 Score=181.97 Aligned_cols=116 Identities=22% Similarity=0.263 Sum_probs=98.6 Q ss_pred CC--CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCchhHHHHHHHHHHHHHHHHhhhhccccc---ccccceeeee Q lcl|NC_013597. 1 MP--LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWGKLYDRGVMALTAHLLKLSADAEISGG---AANRNLASES 75 (119) Q Consensus 1 m~--t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~g~~~~~~~~l~~AH~l~l~~~~~~~~~---~~~g~vtS~s 75 (119) |- .+|.||..||+|++|||++|+.|++.|+++||.+.+||.+++++.|||||++++++..++.+. ....+|+|.| T Consensus 1 ~~~~~~e~~R~l~P~f~kvpdevI~~wielA~lfVc~~~~g~~~~~AlaL~taHLm~~dga~k~en~~~~t~S~rvaS~S 80 (132) T protein:vir:10 1 MNDAILAFMRSLVPALKAVDDESINVWIDLARLYVCADKFGNDADRAVGLYALHLMLSDGAFKGENEGLETYSRRMASYS 80 (132) T ss_pred CchHHHHHHHHhcchhhcCChHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHhhccccccccccchhhhhhhhhhhc Confidence 65 579999999999999999999999999999999999999999999999999999876654443 3457899999 Q ss_pred -ecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCC-CccC Q lcl|NC_013597. 76 -AGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG-VMVA 119 (119) Q Consensus 76 -vG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G-g~va 119 (119) +|++||||+.++ ++++|+.+||||+.|++|+|+.+.| |+.. T Consensus 81 l~Ge~Sisf~~~s---a~~s~L~~tp~Gkl~~~L~k~~~GgfgL~t 123 (132) T protein:vir:10 81 LSGEFSITYDNQS---AIQGDLSSSSWGRMYKALLRKKGGGFGLIT 123 (132) T ss_pred ccCceeeeccccc---ccccccccCcHHHHHHHHHHhccCcccccc Confidence 799999998765 5667999999999999998854432 3333 No 14 >protein:vir:80036 Length: 111 # NCBI annotation: gp10 # Family: family:all:3186 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468714;genbank:gi:157325294;genbank:GeneID:5601727 Probab=99.59 E-value=1.9e-18 Score=117.78 Aligned_cols=105 Identities=26% Similarity=0.333 Sum_probs=93.3 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCch-hHHHHHHHHHHHHHHHHhhhhcccccccccceeeeeecce Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWG-KLYDRGVMALTAHLLKLSADAEISGGAANRNLASESAGEL 79 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~g-~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~svG~v 79 (119) =.|++..|...|.++.++|+.|+.+|++|..++.++.|. ..+|++..+|+||+++++++ +|+|+.||++ T Consensus 2 ~ttv~~vkl~a~~L~~~sDDsl~~~I~dA~~e~~a~gFp~~~~e~a~rYLa~HLat~~~~----------~v~sE~V~~L 71 (111) T protein:vir:80 2 KTDVSKLKLTASSLASVSDDSLQVHIDDSYLEVQEKGFPEKFEERANRYLAAHLATLANK----------NVKSEAVGSL 71 (111) T ss_pred chhHHHHHHhhHhhcCCChHHHHHHHHHHHHHhhcCCCChhHHHHHHHHHHHHHHHhcCC----------CCchhhhhhH Confidence 458999999999999999999999999999999999995 68899999999999999633 5899999999 Q ss_pred eeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 80 SVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 80 Svs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) .-.|... ....||..|+|||+||+|.+.++.|+-.. T Consensus 72 k~~Y~~~----~~~~~l~~s~wGq~Y~rL~k~~~~gs~~~ 107 (111) T protein:vir:80 72 KREYYEV----KGDSGLLSTEYGQEYARLLKEANGGSGIS 107 (111) T ss_pred HHHhhhc----ccccccccchhHHHHHHHHHHhcCCccce Confidence 9888532 12279999999999999999999998777 No 15 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=95.55 E-value=0.00019 Score=40.93 Aligned_cols=109 Identities=17% Similarity=0.127 Sum_probs=62.7 Q ss_pred CC--CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCch---------------hHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_013597. 1 MP--LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWG---------------KLYDRGVMALTAHLLKLSADAEIS 63 (119) Q Consensus 1 m~--t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~g---------------~~~~~~~~l~~AH~l~l~~~~~~~ 63 (119) || |.+.|+..| ....+|++.+..++..|...||.--++ +.-+++++..+-++...... T Consensus 1 M~Y~d~~~Y~~~y-~g~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~~~~~~~~~~~vk~A~c~q~e~~~~~g~~---- 75 (131) T protein:vir:43 1 MPYTTLEFYNDEY-AGEHLEQDEFDKLLKHAERKIDSVTFYRIRKGGIESFSEFIQHQIQLATCNQIEYFKEAGGT---- 75 (131) T ss_pred CCCCCHHHHHHhh-CCCCCCHhHHHHHHHHHHHHHHHHhcccccccCccccchhhHHHHHHHHHHHHHHHHHhHHH---- Confidence 88 999999988 556689999999999999999753221 11234555555555444321 Q ss_pred ccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHh-----CCCCc Q lcl|NC_013597. 64 GGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLI-----GVGVM 117 (119) Q Consensus 64 ~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~-----g~Gg~ 117 (119) .....+.++|.|+|+.||||...+....... ....-+.-..+++.. |.+=+ T Consensus 76 s~~~~~~~~S~svG~~Svs~~~~~~~~~~~~---~~~~~~~a~~~L~~TGLlyrGV~~~ 131 (131) T protein:vir:43 76 SELAVSKPDNVSIGRTSISDSNFASTATSLN---SGLIGSDVRSYLAHTGLLYNGVGVR 131 (131) T ss_pred hhhhccccCeeecCceEEeecccccchhhhc---hhhhHHHHHHHHhccCCeecCCCCC Confidence 1233445899999999999976443322211 001111122222221 11222 No 16 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=95.08 E-value=0.0008 Score=37.57 Aligned_cols=111 Identities=19% Similarity=0.157 Sum_probs=59.3 Q ss_pred CC--CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCch---------------hHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_013597. 1 MP--LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWG---------------KLYDRGVMALTAHLLKLSADAEIS 63 (119) Q Consensus 1 m~--t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~g---------------~~~~~~~~l~~AH~l~l~~~~~~~ 63 (119) || |.+.|++. -...+|++.++.++..|...||.--++ +.-++|+++.+-++...+.. . T Consensus 1 M~Y~t~~~Y~~~--~G~~i~e~~F~~l~~rAs~~ID~iT~~ri~~~~~~~d~~~~~~~vk~A~c~qiey~~~~G~~---s 75 (132) T protein:vir:98 1 MPYLTYEEFMDL--NGRDIDDKKFEKLLPKASAIIDGVTGHFYQKVDMEKDNAWRVNQFKLALCAQIEYFDALGAT---T 75 (132) T ss_pred CCCCCHHHHHhh--cCCCCCHHHHHHHHHHHHHHHHHHhcccccCCCccccChHHHHHHHHHHHHHHHHHHhccch---h Confidence 76 88888764 233689999999999999999752221 11234555444443332221 1 Q ss_pred ccccccceeeeeecceeeeeccCccCCcc-hhhhhcCHHHHHHHHHHHHhCC--CCccC Q lcl|NC_013597. 64 GGAANRNLASESAGELSVSYTAPISANGS-DDFYQLTAYGQEYLRLRRLIGV--GVMVA 119 (119) Q Consensus 64 ~~~~~g~vtS~svG~vSvs~~~~~~~~~~-~~w~~~T~YG~~y~~L~~~~g~--Gg~va 119 (119) +....+.++|.|+|..||||..+...... ..-.+ .-+.-..+++..|. .|+=. T Consensus 76 ae~~~~~~~S~svG~~Svs~~s~~~~~~~~~~~~~---~~~~a~~~L~~tGLLyrGV~~ 131 (132) T protein:vir:98 76 FEEINNSPQTFQAGRTSVSNASRYNPSGANESKPL---VAEDVYIYLQGTGLLFQGVKT 131 (132) T ss_pred hhhccCccceeeeCcEEEEeeccCCcccccccccc---hHHHHHHHHhhcCCccccCCC Confidence 23345669999999999999643222211 11111 11223333333322 01111 No 17 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=94.72 E-value=0.00055 Score=38.45 Aligned_cols=109 Identities=17% Similarity=0.139 Sum_probs=60.8 Q ss_pred CC--CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCch---------------hHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_013597. 1 MP--LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWG---------------KLYDRGVMALTAHLLKLSADAEIS 63 (119) Q Consensus 1 m~--t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~g---------------~~~~~~~~l~~AH~l~l~~~~~~~ 63 (119) || |.+.|+..|.- ..+|++.+..++..|...||.--++ +.-+++++..+-++...... T Consensus 1 M~Y~d~~~Y~~~y~G-~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~d~~~~~~~~~vk~A~c~q~e~~~~~g~~---- 75 (131) T protein:vir:80 1 MPYTTLEFYTNEYAG-EHLEQDEFAKLLKHAERKIDSVTFYRIRKSGIEAFSEFIQHQIQLATCNQIEYFKEAGGT---- 75 (131) T ss_pred CCCCCHHHHHHhhCC-CCCchhHHHHHHHHHHHHHHHHhcccccccccccCchhHHHHHHHHHHHHHHHHHHhhhh---- Confidence 88 99999999832 3378888999999999999753222 11224555555544443322 Q ss_pred ccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHh-----CCCCc Q lcl|NC_013597. 64 GGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLI-----GVGVM 117 (119) Q Consensus 64 ~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~-----g~Gg~ 117 (119) .....+.++|.|+|+.||||...+.......-. .--+.-..+++.. |.+=+ T Consensus 76 ~~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~---~~~~~a~~~L~~TGLlyrGV~~~ 131 (131) T protein:vir:80 76 SELAVSKPDNVSIGRTSISDSNFASTATSLNSG---LVGSDVRSYLAHTGLLYNGVGVR 131 (131) T ss_pred hhhcccccCeeeeCceEEeeccccchhhhhhhh---hhHHHHHHHHhccCCeecCCCCC Confidence 122345589999999999997544332222100 0111222222222 11222 No 18 >protein:vir:105776 Length: 133 # NCBI annotation: gp11 # Family: family:all:10997 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224149;genbank:gi:62362224;genbank:GeneID:3342529 Probab=94.00 E-value=0.0017 Score=35.76 Aligned_cols=101 Identities=14% Similarity=0.114 Sum_probs=67.0 Q ss_pred CCCHHHHHHhhhhhc-CCCHHHHHHHHHHHHHH---hCCcCchhHHHHHHHHHHHHHHHHhhhhcccccccccceeeeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFG-KTDAKRIGLFLSDAQAE---VSKVQWGKLYDRGVMALTAHLLKLSADAEISGGAANRNLASESA 76 (119) Q Consensus 1 m~t~~~Fr~~~P~F~-~vpd~~i~~~~~~A~~~---~~~~~~g~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~sv 76 (119) |+|.++.|+..-+.. ++||..|+.++++++.. ++ ..+.+..+.++.+|++-++.+... ..+|+|.+. T Consensus 1 mIT~~qa~~~L~slG~svP~~iL~~~v~q~nsi~~cLd-agY~e~tq~LI~lya~~LlA~~~g--------~R~IsSQ~A 71 (133) T protein:vir:10 1 MITTEQAKEYLESVGITLPDFILQAIVEQANSIQECLD-AHYPPATALLIQSYLLGLMALGQG--------DRYISSQTA 71 (133) T ss_pred CCCHHHHHHHHHhcCCcchHHHHHHHHHHHhhHHHHHh-CCCCHHHHHHHHHHHHHHHhhccC--------CceeecccC Confidence 999999999888866 49999999999999644 44 467788999999999999987532 345666554 Q ss_pred -cceeeeeccCccCCcchhhhhcCHHHHHHHHH---HHHhCCCCccC Q lcl|NC_013597. 77 -GELSVSYTAPISANGSDDFYQLTAYGQEYLRL---RRLIGVGVMVA 119 (119) Q Consensus 77 -G~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L---~~~~g~Gg~va 119 (119) -.-|.||+...... +|=+.|-+| =+.-..|++|- T Consensus 72 PSGASrSF~Y~~~~~---------~~~~l~~~L~~lD~~gCt~~Lip 109 (133) T protein:vir:10 72 PNGASRSFRYQSFAD---------RWKGALSLLRGADKFRCANGLIP 109 (133) T ss_pred CccccccccccCCCc---------cHHHHHHHHHhhhhccccccccC Confidence 23456665543221 233333333 23334455542 No 19 >protein:vir:102961 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:26777 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945287;genbank:gi:39653722;uniprot:Q708M5;genbank:GeneID:2672875 Probab=93.75 E-value=0.0012 Score=36.58 Aligned_cols=108 Identities=15% Similarity=0.131 Sum_probs=62.5 Q ss_pred CHHHHHHhhhh---------hcC--CCH-HHHHHHHHHHHHHh-CCcC---chhH-----HHHHHHHHHHHHHHHhhhhc Q lcl|NC_013597. 3 LTEDFLLRYTE---------FGK--TDA-KRIGLFLSDAQAEV-SKVQ---WGKL-----YDRGVMALTAHLLKLSADAE 61 (119) Q Consensus 3 t~~~Fr~~~P~---------F~~--vpd-~~i~~~~~~A~~~~-~~~~---~g~~-----~~~~~~l~~AH~l~l~~~~~ 61 (119) .++..++.--- +.| .-| ..+++.++.+...+ +-+. +-+- .++++-+|..|.+..-.. T Consensus 1 ~~~~lkq~~~~~~~~~~l~~~~d~~~kD~~vl~faie~v~~~IlnycNikeiP~~Le~v~~~maiDll~~e~~~~~k~-- 78 (131) T protein:vir:10 1 MIQELKQDNTMYLISCVRKMRQDNYFKDMEVLHYALTQAENEILNYIHQDSVPGRLENVWIDMTNDLLDKVKEQSVLA-- 78 (131) T ss_pred ChhhhhhhhhhhhhhhhhccccccccchHHHHHHHHHHHHHHHhhhcCCcccchhhHHHHHHHHHHHHhhhccccccc-- Confidence 66666663211 111 224 47899999999865 2222 2222 234445555554322111 Q ss_pred ccccccccceeeeeecceeeeeccCccCCcc--hhhhhcCHHHHHHHHHHHHh Q lcl|NC_013597. 62 ISGGAANRNLASESAGELSVSYTAPISANGS--DDFYQLTAYGQEYLRLRRLI 112 (119) Q Consensus 62 ~~~~~~~g~vtS~svG~vSvs~~~~~~~~~~--~~w~~~T~YG~~y~~L~~~~ 112 (119) ...+...|.|+|-++|+-||||..++..... +--=-.+.|++|+-..||++ T Consensus 79 ~~i~~~~g~VsSI~eGDTsIsf~s~t~~~qrl~~~~s~l~~Y~~qL~~yRRL~ 131 (131) T protein:vir:10 79 EKAGADDFSVKSIKMGDTTIEKVSPYEMIQRMKQVPSSLERYKRQLNRFRKLL 131 (131) T ss_pred ccccccccceeeeeecceeeeccCCccHHHHHHHHHHHHhhhHHHHhhhcccC Confidence 1123466789999999999999665422111 11123468999999999999 No 20 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=93.47 E-value=0.0049 Score=33.27 Aligned_cols=110 Identities=14% Similarity=0.129 Sum_probs=61.5 Q ss_pred CCCHHHHHHhhhhhcC---CCHHHHHHHHHHHHHHhCC--cCc-h-----------------------------hHHHHH Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK---TDAKRIGLFLSDAQAEVSK--VQW-G-----------------------------KLYDRG 45 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~---vpd~~i~~~~~~A~~~~~~--~~~-g-----------------------------~~~~~~ 45 (119) ..|++++++.+-+... .+|+..+..|-.|..+|+. .+| | +..+++ T Consensus 19 Yvtv~ea~aY~~~rg~~~~~~~~~ke~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~~~~v~~~~IP~~V~~A 98 (172) T protein:vir:95 19 YVSVADARIYASNRGVELPLDDDELAAMLIRSTDYLEAQACRFQGKPTSTTQALQWPRTGVFLNEDEVPSNVIPKSLIAA 98 (172) T ss_pred cccHHHHHHHHHhcCCcCCCChHHHHHHHHHHHHHhhccCCceeeeecCCcccccCCcCCcccCcccccccchhHHHHHH Confidence 7788888886655432 5788889999999999984 333 1 112333 Q ss_pred HHHHHHHHHHHhhhhcccccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHh---CCCCccC Q lcl|NC_013597. 46 VMALTAHLLKLSADAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLI---GVGVMVA 119 (119) Q Consensus 46 ~~l~~AH~l~l~~~~~~~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~---g~Gg~va 119 (119) .+.++. ..+.+..-.........|+|++||+|+|+|..+.... +.+.|- .--+|++-+ +.|+-.+ T Consensus 99 ~~elA~--~~~~~~~~~~~~~~~~~vk~~kVG~I~veY~~~~~~~------~~~~~~-~v~~LL~p~l~~~~~~~~~ 166 (172) T protein:vir:95 99 QVQLTM--AINAGFDLQPNVSPQDYVTREKVGPIETEYADPLSVG------IMPTFT-AANALLAPLFGECASNKFA 166 (172) T ss_pred HHHHHH--HHHcCccccccCCcccceeEEeccceEEeeccCCCCC------CcccHH-HHHHHHhhhhcccCCccee Confidence 344432 1111111111122345689999999999997654322 123342 333454444 4444444 No 21 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=93.39 E-value=0.0049 Score=33.25 Aligned_cols=115 Identities=15% Similarity=0.178 Sum_probs=66.0 Q ss_pred CCCHHHHHHhhhhhcC-CCHHHHHHHHHHHHHHhCC--cCc-hh-----------------------------HHHHHHH Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK-TDAKRIGLFLSDAQAEVSK--VQW-GK-----------------------------LYDRGVM 47 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~-vpd~~i~~~~~~A~~~~~~--~~~-g~-----------------------------~~~~~~~ 47 (119) ..|++++++-+.+... +|++..+..|-.|..+|+. .+| |. ..+++.+ T Consensus 17 Yvt~~~a~aY~~~rg~~~~~d~~e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~g~~~~~~~IP~~v~~A~~ 96 (172) T protein:vir:80 17 YAGADFVIAYAQARGVTVDADEAERLILEAMDYIESFRRRWKGERNTREQGLTWPRHDAVVDGFVIPSDVIPKELQSAVA 96 (172) T ss_pred cccHHHHHHHHHHcCCCcCHHHHHHHHHHHHHHHhhccCccccccCCccccccccccCcccCcccccccchhHHHHHHHH Confidence 7789999988877654 7988999999999999985 235 21 1133444 Q ss_pred HHHHHHHHHhhhhcccccccccceeeeeecceeeeeccCccCCcch-hhhhcCHHH--HHHHHHHHHhCCCCccC Q lcl|NC_013597. 48 ALTAHLLKLSADAEISGGAANRNLASESAGELSVSYTAPISANGSD-DFYQLTAYG--QEYLRLRRLIGVGVMVA 119 (119) Q Consensus 48 l~~AH~l~l~~~~~~~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~-~w~~~T~YG--~~y~~L~~~~g~Gg~va 119 (119) .++.-+ +++. ..........|.|++||+++++|+.+....... .-=..+.|- ..+++=. ++|.||+-. T Consensus 97 elA~~~--~~g~-~~~~~~~~~~v~~ekVG~i~~eY~~~~~~~~~~~~~~~~~~~~~v~~LL~p~-l~~~gg~~~ 167 (172) T protein:vir:80 97 AAVIEQ--VNGF-ELQQSQDQWAVRIEKVDVIEVQYAAGGGGQSASANAPMKPTFPKIDALLNPL-LVGDGGLFL 167 (172) T ss_pred HHHHHH--hcCC-ccCcCCCCceeeEEeccceEEeeecccCccccccccCCccchHHHHHHHhhh-hcCCCCeee Confidence 444321 1211 111122245689999999999998654322110 001123333 3333322 556677666 No 22 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=93.11 E-value=0.0035 Score=34.04 Aligned_cols=109 Identities=17% Similarity=0.199 Sum_probs=63.8 Q ss_pred CC--CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcC---c--h-------hHHHHHH-HHHHHHHHHHhhhhccccc Q lcl|NC_013597. 1 MP--LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQ---W--G-------KLYDRGV-MALTAHLLKLSADAEISGG 65 (119) Q Consensus 1 m~--t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~---~--g-------~~~~~~~-~l~~AH~l~l~~~~~~~~~ 65 (119) || |.++|++.-++ +++..+..+..|+..||.-. + + +.+..++ ..+++-+..+...+..... T Consensus 1 M~YlT~eey~el~~~----~~~~F~kl~k~A~~~ID~~t~~~y~~~~~~~~~~~~r~~~vK~A~a~QieY~~~~G~~s~~ 76 (130) T protein:vir:47 1 MTYLTQEEFDELDFD----EVTDFEKLAKRAKIAIDLYTNGIYQKDIDFEKEIAYRKSAVKLAMAFQIAYLDASGIMSAD 76 (130) T ss_pred CCCCchhhHhhcCCC----ChhhHHHHHHHHHHHHHHHhcccccccCCccCcchHHHHHHHHHHHHHHHHHHHhccccch Confidence 77 88999876444 34448889999998886411 1 1 2222222 2344444444432222222 Q ss_pred ccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCC---CccC Q lcl|NC_013597. 66 AANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG---VMVA 119 (119) Q Consensus 66 ~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G---g~va 119 (119) . .+-++|.++|..|||+...++.....+ + ....+-+.++...|.| |+ + T Consensus 77 ~-~~~~~S~svGrtSis~~~~~~~~~~~~-~---~vs~da~~~L~~tGL~Ly~GV-~ 127 (130) T protein:vir:47 77 D-KQLANSVSIGRTSISYSTSQSTLAGQR-F---NLSMDAENALRQAGFSLVVGV-A 127 (130) T ss_pred h-ccCcceeeecceeeecCcCccccccCC-c---cccHHHHHHHHhcccccccCC-C Confidence 2 566899999999999987665544433 2 2455666677777765 33 5 No 23 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=86.63 E-value=0.044 Score=28.01 Aligned_cols=106 Identities=13% Similarity=0.130 Sum_probs=58.6 Q ss_pred CCCHHHHHHhhhh------hcCCCHHHHHHHHHHHHHHhCCc-Cc-h-----------------------------hHHH Q lcl|NC_013597. 1 MPLTEDFLLRYTE------FGKTDAKRIGLFLSDAQAEVSKV-QW-G-----------------------------KLYD 43 (119) Q Consensus 1 m~t~~~Fr~~~P~------F~~vpd~~i~~~~~~A~~~~~~~-~~-g-----------------------------~~~~ 43 (119) ..|++++++-+.. ....+|+..+..|-.|..+|+.. +| | +..+ T Consensus 16 Yvtv~ea~aY~~~r~~~~~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~~~dg~~~~~~~IP~~V~ 95 (170) T protein:vir:94 16 YVTVAEANSYFDGSYGRPLWTSASEDEKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCKNAVIGGMTLSQVSIPVKVK 95 (170) T ss_pred eecHHHHHHHHHhhccccccCCCCHHHHHHHHHHHHHHhccccccccccCCcchhhcccccCcccCccccccchhhHHHH Confidence 5666777665443 23578999999999999999852 44 1 1123 Q ss_pred HHHHHHHHHHHHHhhhhcccccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCC----CccC Q lcl|NC_013597. 44 RGVMALTAHLLKLSADAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG----VMVA 119 (119) Q Consensus 44 ~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G----g~va 119 (119) ++.+.++.-++ ++... .....+.|+|++||+|+|+|+.+... ++.|. ..++|++=+..+ +.=+ T Consensus 96 ~Aq~elA~~~~--~~~~~--~~~~~~~v~~~kVG~i~veY~~~~~~--------~~~~~-~v~~LL~p~l~~~~~g~~~~ 162 (170) T protein:vir:94 96 IAVFELAYFML--ESGAA--LSFADQTIDSVKVGTIRVEFTKNSTD--------AGLPT-FVEAMLSGFGSPVLYGSNAA 162 (170) T ss_pred HHHHHHHHHHH--hCccc--CcccccceeeEecceeEEEecCCCCC--------CccHH-HHHHHhhhhhcccccccccc Confidence 34444433222 11111 11223558999999999999744322 22333 335565444333 2222 No 24 >protein:vir:9821 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795583;genbank:gi:28876338;genbank:GeneID:1257921 Probab=82.78 E-value=0.05 Score=27.71 Aligned_cols=110 Identities=16% Similarity=0.182 Sum_probs=52.5 Q ss_pred CC--CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcC---c--h------hHHHHHH-HHHHHHHHHHhhhhcccccc Q lcl|NC_013597. 1 MP--LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQ---W--G------KLYDRGV-MALTAHLLKLSADAEISGGA 66 (119) Q Consensus 1 m~--t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~---~--g------~~~~~~~-~l~~AH~l~l~~~~~~~~~~ 66 (119) || |.++|.+..+. +++.++..+..|+..||.-. + + ++++..+ ..+++.+..++..+...... T Consensus 6 M~YlT~eey~~l~~~----~~~dF~kllk~As~~ID~~t~~~y~~~d~e~d~~~r~~~vKkA~a~QIeY~~~~G~ts~~d 81 (138) T protein:vir:98 6 IAFLTQKEFEDLGFD----DVEDFEKMEKRASHAVNLYCRNRYDYKDLKKEIALVQKAVKRAIAYQIAYLNDSGVMTAED 81 (138) T ss_pred ccccchHHHhccCCC----ChhhHHHHHHHHHHHhhhhhccccccccccchhHHHHHHHHHHHHHHHHHHHHcCCcchhh Confidence 55 77777654332 34459999999999887421 1 1 2222222 23444444444332222222 Q ss_pred cccceeeeeecceeeeeccCccCC----cchhhhhcCHHHHHHHHHHHHhCC--CCccC Q lcl|NC_013597. 67 ANRNLASESAGELSVSYTAPISAN----GSDDFYQLTAYGQEYLRLRRLIGV--GVMVA 119 (119) Q Consensus 67 ~~g~vtS~svG~vSvs~~~~~~~~----~~~~w~~~T~YG~~y~~L~~~~g~--Gg~va 119 (119) .+-.+|.++|..||||......+ ...+-++.+.=- ..++...|. .| |+ T Consensus 82 -~~~~~s~svGrTSiS~~~~~~~~s~~~~~~~~~~~s~~A---~~~L~~tGLLY~G-V~ 135 (138) T protein:vir:98 82 -KQSFAGISLGRTSISYTVGHGQGSQQKTLADRFNLCLDA---ENELLVVGLGYTG-IS 135 (138) T ss_pred -ccCcCceEeeeeEeecccccccccccccccccccccHHH---HHHHhhcCccccc-Cc Confidence 56679999999999973221111 111112222211 112233332 11 13 No 25 >protein:vir:99517 Length: 124 # NCBI annotation: putative protein # Family: family:all:372 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958539;genbank:gi:41179321;genbank:GeneID:2717155 Probab=80.75 E-value=0.054 Score=27.53 Aligned_cols=99 Identities=11% Similarity=-0.028 Sum_probs=62.6 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCC----cC-chhHHH-HHHHHHHHHHHHHhhhhcccccccccceeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSK----VQ-WGKLYD-RGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~----~~-~g~~~~-~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~ 74 (119) |.++++.+.+---=.+..|+.|+.+|+.|+..|+. +. .-+.++ ....+-+.|+=-+. .. -.+|. T Consensus 6 ~~~Le~vK~~LgI~d~~~D~lL~~lI~~a~~~i~~~l~~~e~iP~~L~~Iv~evavkryNR~g----~E------G~~S~ 75 (124) T protein:vir:99 6 DDQLKKLKTALQLTDTKHDDLLKLYLEDATDFLKLRLSITGVIPTEMLAIVRGAAVKKFNRFK----NE------GMASY 75 (124) T ss_pred HHHHHHHHHHhCCCCcchhHHHHHHHHHHHHHHHHhcCCcccchhHHHHHHHHHHHHHhcccC----Cc------cccee Confidence 77788888774322345699999999999988753 11 212222 23344444542222 11 26899 Q ss_pred eecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) |++.+|+||..+- .-+|-..+-++++..+.-|.+. T Consensus 76 SeeG~S~sf~d~d----------~~~y~~~L~~y~~~~~~~g~~~ 110 (124) T protein:vir:99 76 SQDGESITFASSD----------FDEWEDEINQWRKDHTGMNKGM 110 (124) T ss_pred eeCceeeeecccC----------hhhHHHHHHHHhhccCcCCcee Confidence 9999999995421 1278888888877777666655 No 26 >protein:vir:3970 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663678;genbank:gi:21716115;genbank:GeneID:951203 Probab=79.54 E-value=0.1 Score=25.98 Aligned_cols=100 Identities=13% Similarity=0.097 Sum_probs=59.3 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCc--hhHHHHHHHHHHHHHHHHh-hhhcccccccccceeeeeec Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQW--GKLYDRGVMALTAHLLKLS-ADAEISGGAANRNLASESAG 77 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~--g~~~~~~~~l~~AH~l~l~-~~~~~~~~~~~g~vtS~svG 77 (119) |.++++++.+-+. -.|+.|+.+++.|...|..--- -+.....+..++.-...-. ...+.. -.+|.|+| T Consensus 1 M~iL~~vK~~lgi---~~D~lL~~li~~a~~~i~~~l~~~~~~iP~~l~~iv~evav~ryNR~g~E------G~~S~See 71 (110) T protein:vir:39 1 MAITDDLKKLLGG---SSDERLEVIEKRTRERLLLILSSNIKEVPPELEYVVLDVSLKRFNRIGQE------GMQSYSQE 71 (110) T ss_pred CchHHHHHHhcCC---ChhHHHHHHHHHHHHHHHHHhCCChhhhhhHHHHHHHHHHHHHhcccccc------ccceeecC Confidence 9999999998664 4699999999999987742110 0112333333333332221 111111 16899999 Q ss_pred ceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhC-CC----CccC Q lcl|NC_013597. 78 ELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIG-VG----VMVA 119 (119) Q Consensus 78 ~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g-~G----g~va 119 (119) .+|+||..+- ..+|-...-++++... .| |+|- T Consensus 72 G~S~sf~~~d----------~~~y~~~l~~y~~~~~~~~~~~~g~~~ 108 (110) T protein:vir:39 72 GLSMTFSESD----------FDEYADEIESWRKSKETEGDKKIGRFR 108 (110) T ss_pred CeeeeecccC----------cchhHHHHHHHhhhccccccCcceeee Confidence 9999994321 1277777777765542 22 2333 No 27 >protein:vir:5976 Length: 102 # NCBI annotation: hypothetical protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690676;genbank:geneid:6329129;genbank:gi:22855070;uniprot:Q38584;genbank:GeneID:955305 Probab=76.15 E-value=0.044 Score=28.01 Aligned_cols=91 Identities=14% Similarity=0.148 Sum_probs=57.0 Q ss_pred CCHHHHHHhhhhhcCCCH----HHHHHHHHHHHHHhCCcCchh-----HHHHHHHHHHHHHHHHhhhhccccccccccee Q lcl|NC_013597. 2 PLTEDFLLRYTEFGKTDA----KRIGLFLSDAQAEVSKVQWGK-----LYDRGVMALTAHLLKLSADAEISGGAANRNLA 72 (119) Q Consensus 2 ~t~~~Fr~~~P~F~~vpd----~~i~~~~~~A~~~~~~~~~g~-----~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vt 72 (119) -.+.+.+.+-|==.+-.| .+|..+++.|+.+-+ ..|.+ -+.-++.+++|-.+.-++ ..+.++ T Consensus 1 Md~~~VK~ll~i~~~s~d~~i~~lip~y~e~aedyCN-~~F~dkdg~~~lP~gVkkfvAe~ik~y~--------~~~nis 71 (102) T protein:vir:59 1 MDIQRVKRLLSITNDKHDEYLTEMVPLLVEFAKDECH-NPFIDKDGNESIPSGVLIFVAKAAQFYM--------TNAGLT 71 (102) T ss_pred CChHHhhhhhcCCCCccHHHHHHHHHHHHHHHHHHhC-CccccccccccCCccHHHHHHHHHHhcC--------CCCCcc Confidence 455666665321111223 567788888977654 46764 477888999999876543 347799 Q ss_pred eeeecceeeeeccCccCCcchhhhhcCHHH--HHHHHHHH Q lcl|NC_013597. 73 SESAGELSVSYTAPISANGSDDFYQLTAYG--QEYLRLRR 110 (119) Q Consensus 73 S~svG~vSvs~~~~~~~~~~~~w~~~T~YG--~~y~~L~~ 110 (119) |.|.|+||.+|.+- +-++.|+ .-|.+|.| T Consensus 72 sRsMgtVSYty~T~---------iP~~i~~~L~PyRrl~~ 102 (102) T protein:vir:59 72 GRSMDTVSYNFATE---------IPSTILKKLNPYRKMAR 102 (102) T ss_pred cccccceeeechhh---------hhHHHHHHhhHHHhhcC Confidence 99999999999542 2222333 23444444 No 28 >protein:vir:3615 Length: 110 # NCBI annotation: ORF38 # Family: family:all:372 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112701;genbank:gi:13786569;genbank:GeneID:921067 Probab=75.37 E-value=0.15 Score=25.14 Aligned_cols=100 Identities=11% Similarity=0.079 Sum_probs=58.5 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCchh--HHHHHHHHHHHHHHHHh-hhhcccccccccceeeeeec Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWGK--LYDRGVMALTAHLLKLS-ADAEISGGAANRNLASESAG 77 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~g~--~~~~~~~l~~AH~l~l~-~~~~~~~~~~~g~vtS~svG 77 (119) |.+++++|.+-+. -.|+.|+.+++.|...|...---+ -....+..++.-.+.-. ...+.. -.+|.|++ T Consensus 1 M~~L~~vK~~lg~---~~D~lL~~li~~a~~~i~~~~~~~~~eiP~~l~~iv~evav~ryNR~g~E------G~~S~See 71 (110) T protein:vir:36 1 MAITDDLKMLLGG---SLDERLEVIEKRTRDRLLLILGSDIKEVPPELEYVVLDVSLKRFNRIGQE------GMQSYSQE 71 (110) T ss_pred ChhHHHHHhhcCC---ChhHHHHHHHHHHHHHHHHHhCCChhhhhhHHHHHHHHHHHHHhcccccc------ccceeecC Confidence 9999999998663 579999999999998874311111 12223333333322221 111111 15899999 Q ss_pred ceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhC--CCCccC Q lcl|NC_013597. 78 ELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIG--VGVMVA 119 (119) Q Consensus 78 ~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g--~Gg~va 119 (119) .+|+||..+-- -+|-..+-+.++.-. .+.-+. T Consensus 72 G~S~sf~~~d~----------~~y~~~l~~y~~~~~~~~~~~~g 105 (110) T protein:vir:36 72 GLSMTFSESDF----------DEYADEIESWRKSRETEGDKKIG 105 (110) T ss_pred CceeeecccCc----------chHHHHHHHHHhhhccccCCcce Confidence 99999954321 267777766665532 222222 No 29 >protein:vir:79050 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:6416 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110727;genbank:gi:134287344;genbank:GeneID:4955224 Probab=73.73 E-value=0.11 Score=25.82 Aligned_cols=113 Identities=14% Similarity=0.106 Sum_probs=64.0 Q ss_pred CCCHHHHHHhhhhhcC----CCHHHHHHHHHHHHHHhC-CcCc----hhHHHHHHHHHHHHHHHHhhhhcc---cccccc Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK----TDAKRIGLFLSDAQAEVS-KVQW----GKLYDRGVMALTAHLLKLSADAEI---SGGAAN 68 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~----vpd~~i~~~~~~A~~~~~-~~~~----g~~~~~~~~l~~AH~l~l~~~~~~---~~~~~~ 68 (119) --..++++++.-+|.- -++..|++.++.+...+. -+.. ..+.......-+.-++........ ++-..- T Consensus 3 ~~i~e~i~~~Lk~~~~~~~~~d~~iL~fa~e~~~n~I~N~cNi~eiP~~L~~v~~~mai~~fl~~kk~~~~~~l~~~D~~ 82 (133) T protein:vir:79 3 NNIIDDIEKRLESFGYILKDGDKWLIDFVREKIENIIKLDCNIKTMPIELKEIEADMIVGEFLFTKKNMGQLDIESINFE 82 (133) T ss_pred chHHHHHHHHHHHhCCCCCccchHHHHHHHHHHHHHHhhhcChhhcchhHHHHHHHHHHHHHHhcccccCCCCcccccch Confidence 2356778888877764 356788889999887552 2222 222222233333333322211110 111123 Q ss_pred cceeeeeecceeeeeccCccCCcch----hh--hhcCHHHHHHHHHHHHhC Q lcl|NC_013597. 69 RNLASESAGELSVSYTAPISANGSD----DF--YQLTAYGQEYLRLRRLIG 113 (119) Q Consensus 69 g~vtS~svG~vSvs~~~~~~~~~~~----~w--~~~T~YG~~y~~L~~~~g 113 (119) +.|+|-++|+-||+|..++++...+ .| +-.+-|..|+-..||+.= T Consensus 83 ~~v~sIkeGDTsv~f~~~~~s~t~eq~l~s~i~~L~~~~k~~l~~yRkLrW 133 (133) T protein:vir:79 83 AVEKSISEGDTKVDFAIGSGSQTPEQRFDSLIAYLTAYGKNKILTFRCLRW 133 (133) T ss_pred hhhhheecccceeecccCCCccchhHHHHHHHHHHhhcccchhhccccccC Confidence 4589999999999997665433332 34 224555678888888776 No 30 >protein:vir:106596 Length: 128 # NCBI annotation: ORF042 # Family: family:all:372 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239495;genbank:gi:66395254;genbank:GeneID:4555750 Probab=71.32 E-value=0.19 Score=24.60 Aligned_cols=103 Identities=11% Similarity=0.018 Sum_probs=58.1 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcC--chhHHHHHHHHHHHHHHHHhh-hhcccccccccceeeeeec Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQ--WGKLYDRGVMALTAHLLKLSA-DAEISGGAANRNLASESAG 77 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~--~g~~~~~~~~l~~AH~l~l~~-~~~~~~~~~~g~vtS~svG 77 (119) |.++++.+.+---=.+..|+.|+.+++.|...++..- -++-....+..++--...-.. ..+.. -.+|.|++ T Consensus 19 m~~Le~vK~~LgI~d~~~D~lL~~lI~~a~~~i~~~l~~~~~~iP~~L~~Iv~evaVkryNR~g~E------G~~S~See 92 (128) T protein:vir:10 19 MNYLDDVKSRIGLNDNEQDKQLNSIINNVAAELLSRLPVDTISIPDKLQFIVVEVSTKRYNRIGAE------GMSTDSQD 92 (128) T ss_pred HHHHHHHHHHhCCCCcchhhHHHHHHHHHHHHHHHHcCCChhhhhhhHHHHHHHHHHHHhcccCcc------CcceeeeC Confidence 8888888887533234569999999999998764211 011223333333333322211 11111 16899999 Q ss_pred ceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCC--CccC Q lcl|NC_013597. 78 ELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG--VMVA 119 (119) Q Consensus 78 ~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G--g~va 119 (119) .+|+||..+-- -+|-...-+.++.-+.. |.|- T Consensus 93 G~S~tf~dnd~----------~~Y~~~L~~y~~~~~~~~kG~v~ 126 (128) T protein:vir:10 93 GRSNTFERNDF----------EEYQSIIDALYPKLDSSERGSVN 126 (128) T ss_pred ceeeeeccCCc----------chhHHHHHHHHhhccCCCCCcee Confidence 99999944311 26766666666543332 2333 No 31 >protein:vir:10365 Length: 115 # NCBI annotation: conserved phage protein # Family: family:all:363 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858957;genbank:gi:32128422;genbank:GeneID:2648387 Probab=65.72 E-value=0.21 Score=24.30 Aligned_cols=85 Identities=6% Similarity=-0.075 Sum_probs=49.0 Q ss_pred CCCHHHHHHhh---hhhcCCCHHHHHHHHHHHHHHhCC----cC----------------------chhHHHHHHHHHHH Q lcl|NC_013597. 1 MPLTEDFLLRY---TEFGKTDAKRIGLFLSDAQAEVSK----VQ----------------------WGKLYDRGVMALTA 51 (119) Q Consensus 1 m~t~~~Fr~~~---P~F~~vpd~~i~~~~~~A~~~~~~----~~----------------------~g~~~~~~~~l~~A 51 (119) |+|+++.|.-. +...|..|+.|+.+++-|+.++.. .- .-.....++.|+++ T Consensus 1 mvtLe~~K~hLRid~~d~d~dD~li~~~i~AA~~~v~~~~~r~l~~~~~~~~~~~~~~~~~~~~~~~p~~i~~AiLLlvg 80 (115) T protein:vir:10 1 MITLAMVQRHLQAELYEDDERDYVMQQLLPAARESAELFINRKLYDTQADMLADQAAGVDPAGQLLITRTVEQAILLTVG 80 (115) T ss_pred CCCHHHHHHHcCCCCCCCchhhHHHHHHHHHHHHHHHHHhCCcccccccccccccccccCCcccccCChHHHHHHHHHHH Confidence 99999999854 334457799999999999877621 10 01335779999999 Q ss_pred HHHHHhhhhcccccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHh-CCCCcc Q lcl|NC_013597. 52 HLLKLSADAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLI-GVGVMV 118 (119) Q Consensus 52 H~l~l~~~~~~~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~-g~Gg~v 118 (119) |+-..+-.. ++|. .+.-|+|.+. |+.-+ ..||+- T Consensus 81 ~~Y~nRe~~--------------~~~~-----------------~~elP~~v~~--LL~pyR~~~gv~ 115 (115) T protein:vir:10 81 EWYANREQV--------------WVKG-----------------VGLVTSSAQN--LLHPYRKFAGVR 115 (115) T ss_pred HHHhcchhc--------------ccch-----------------hhhcCHHHHH--HHHHHHhcCCCC Confidence 987654211 0111 1233666433 22221 122222 No 32 >protein:vir:81159 Length: 95 # NCBI annotation: putative DNA packaging protein # Family: family:all:316 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285813;genbank:gi:148747734;genbank:GeneID:5247202 Probab=65.56 E-value=0.16 Score=25.01 Aligned_cols=87 Identities=16% Similarity=0.026 Sum_probs=46.4 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCc------CchhHHHHHHHHHHHHHHHHhhhhcccccccccceeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKV------QWGKLYDRGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~------~~g~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~ 74 (119) |.|++++|.--===.+..|+.|+.+++-|+.++... ......+.++.++++|+=.-+..........+-.|.| T Consensus 3 ~vtLee~K~~LRID~d~dD~lI~~li~aA~~~i~~~~g~~~~~~~~~~~~Avl~lv~~~YeNRe~~~~~~~~~p~~v~s- 81 (95) T protein:vir:81 3 IVTLEEVKNWLRVDFSDDDALITTLINAAEEYLKNATGTTFDATNHLAKIFCMTLIADWYENRELVGRASDQVRPILQS- 81 (95) T ss_pred cCCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHhhccccccCchHHHHHHHHHHHHHHhhccccccccccccHHHHH- Confidence 779999986432222468999999999999998431 2235778899999999987654221111111100000 Q ss_pred eecceeeeeccCccCCcchhhhhcCH Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTA 100 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~ 100 (119) -+....-.| +..|. T Consensus 82 ll~~lr~~~------------~~~~~ 95 (95) T protein:vir:81 82 ILAQLTYAY------------GGETA 95 (95) T ss_pred HHHHhhhcc------------ccccC Confidence 000000000 11111 No 33 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=62.90 E-value=0.18 Score=24.72 Aligned_cols=90 Identities=10% Similarity=-0.055 Sum_probs=44.8 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHh----CCcC------chhHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEV----SKVQ------WGKLYDRGVMALTAHLLKLSADAEISGGAANRN 70 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~----~~~~------~g~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~ 70 (119) |+|++++|+-.=-=.+..|+.|+.+++-|..++ +.+- .-...+.|+.|+++|+-.-+-......-..+- T Consensus 8 ~vtLee~K~hLRid~dddD~lI~~~i~AA~~~v~~~~~~~~~~~~~~~p~~ik~AiLllv~~~YenRE~~~~~~~~~~~- 86 (108) T protein:vir:18 8 VISLSLFKQQIEFEEDDRDELITLYAQAAFDYCMRWCDEPAWKVAADIPAAVKGAVLLVFADMFEHRTAQSEVQLYENA- 86 (108) T ss_pred ccCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHHhCCcccccccccchHHHHHHHHHHHHHHhcccccccchhhhhH- Confidence 889999998543324478999999999998776 2211 12345678999999987665321111100000 Q ss_pred eeeeeecceeeeeccCccCCcc-hhh Q lcl|NC_013597. 71 LASESAGELSVSYTAPISANGS-DDF 95 (119) Q Consensus 71 vtS~svG~vSvs~~~~~~~~~~-~~w 95 (119) ++-.+=--|..-.....+ ++- T Consensus 87 ----~~~~LL~pYR~~~g~~~~~~~~ 108 (108) T protein:vir:18 87 ----AAERMMFIHRNWRGKAESEEGS 108 (108) T ss_pred ----HHHHHHHHHHhcCCCCCcccCC Confidence 000000001100000000 000 No 34 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=62.90 E-value=0.18 Score=24.72 Aligned_cols=90 Identities=10% Similarity=-0.055 Sum_probs=44.8 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHh----CCcC------chhHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEV----SKVQ------WGKLYDRGVMALTAHLLKLSADAEISGGAANRN 70 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~----~~~~------~g~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~ 70 (119) |+|++++|+-.=-=.+..|+.|+.+++-|..++ +.+- .-...+.|+.|+++|+-.-+-......-..+- T Consensus 8 ~vtLee~K~hLRid~dddD~lI~~~i~AA~~~v~~~~~~~~~~~~~~~p~~ik~AiLllv~~~YenRE~~~~~~~~~~~- 86 (108) T protein:vir:19 8 VISLSLFKQQIEFEEDDRDELITLYAQAAFDYCMRWCDEPAWKVAADIPAAVKGAVLLVFADMFEHRTAQSEVQLYENA- 86 (108) T ss_pred ccCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHHhCCcccccccccchHHHHHHHHHHHHHHhcccccccchhhhhH- Confidence 889999998543324478999999999998776 2211 12345678999999987665321111100000 Q ss_pred eeeeeecceeeeeccCccCCcc-hhh Q lcl|NC_013597. 71 LASESAGELSVSYTAPISANGS-DDF 95 (119) Q Consensus 71 vtS~svG~vSvs~~~~~~~~~~-~~w 95 (119) ++-.+=--|..-.....+ ++- T Consensus 87 ----~~~~LL~pYR~~~g~~~~~~~~ 108 (108) T protein:vir:19 87 ----AAERMMFIHRNWRGKAESEEGS 108 (108) T ss_pred ----HHHHHHHHHHhcCCCCCcccCC Confidence 000000001100000000 000 No 35 >protein:vir:100103 Length: 120 # NCBI annotation: gp7 # Family: family:all:363 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945037;genbank:gi:38707897;genbank:GeneID:2744150 Probab=62.73 E-value=0.33 Score=23.23 Aligned_cols=86 Identities=16% Similarity=0.139 Sum_probs=47.5 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCC----c----C------------------chhHHHHHHHHHHHHHH Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSK----V----Q------------------WGKLYDRGVMALTAHLL 54 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~----~----~------------------~g~~~~~~~~l~~AH~l 54 (119) |+|++++|.-.=-=.+.+|+.|+.+++-|+.++.. + . .-...+.++.|+++|+- T Consensus 7 ~vtL~e~K~hLRvd~d~DD~lI~~~i~AA~~~v~~~~~r~l~~~~~~~~~~~~~~~~~~~~~~~~~~i~~AvLllvg~~Y 86 (120) T protein:vir:10 7 IVSLEVALAHLREDAGVADDLIKIYIGAATQSASDYVDRKLYANDAEMQAAVADATAGADPIVANDAIRAAILLTIGKLY 86 (120) T ss_pred ccCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHHhCCcccccccccchhhhccccccccccCCHHHHHHHHHHHHHHH Confidence 55899988744322347899999999999888732 0 0 01335678999999987 Q ss_pred HHhhhhcccccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHH--HHHHHhCC Q lcl|NC_013597. 55 KLSADAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYL--RLRRLIGV 114 (119) Q Consensus 55 ~l~~~~~~~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~--~L~~~~g~ 114 (119) ..+.....+.. ++ ...-|+|.+.+ .+|+..|. T Consensus 87 enRe~~~~~~~--------~~--------------------~~~lP~~v~~Ll~~yR~~~gv 120 (120) T protein:vir:10 87 AFREDVVSGAS--------AS--------------------VTELPSGAKSLLFPYRVGLGV 120 (120) T ss_pred hchhhhhhccc--------cc--------------------ccccCHHHHHHHHHhhhccCC Confidence 66532110000 00 11124553321 22333444 No 36 >protein:vir:96128 Length: 98 # NCBI annotation: ORF050 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240080;genbank:gi:66395776;genbank:GeneID:5133109 Probab=61.92 E-value=0.096 Score=26.15 Aligned_cols=91 Identities=14% Similarity=0.193 Sum_probs=56.4 Q ss_pred CCHHHHHHh--hhhhcCC-----CHHHHHHHHHHHHHHhCCcCchhHHHHHHHHHHHHHHHHhhhhcccccccccceeee Q lcl|NC_013597. 2 PLTEDFLLR--YTEFGKT-----DAKRIGLFLSDAQAEVSKVQWGKLYDRGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) Q Consensus 2 ~t~~~Fr~~--~P~F~~v-----pd~~i~~~~~~A~~~~~~~~~g~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~ 74 (119) -.+.+.+.+ -| ..+. =...|..+++.|+.+-+ ..|.+-+.-++..++|-.+.- +..+.++|. T Consensus 1 Md~~dVK~ln~~~-i~~~~~d~~~~~li~~y~e~aedyCN-~~F~k~lP~gVkkfiAe~iky---------~~~~nissR 69 (98) T protein:vir:96 1 MEPKEVKQLNLMP-IEDTSNDDVLGDLIKFYKGIAEEYCN-KTFEAPYPFGVRKFIAECIKY---------GTNSNVSSR 69 (98) T ss_pred CchHHhHHhhccc-CCCcchHHHHHHHHHHHHHHHHHHhC-CcccccCCccHHHHHHHHHhh---------CCCCCcccc Confidence 344555554 11 1111 13567888899977654 568777888999999987763 245689999 Q ss_pred eecceeeeeccCccCCcchhhhhcCHHHHHHH Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEYL 106 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~ 106 (119) |.|+||.+|.+-- -...-+||+ ||=+.=| T Consensus 70 sMgtVSYty~T~i-P~~i~~~L~--PyRrlrw 98 (98) T protein:vir:96 70 TMGTVSYTFVTDL-PKATYRHLK--PFRRLRW 98 (98) T ss_pred cccceeeechhhh-hHHHHHHhh--hhhhccC Confidence 9999999995421 112223333 4444333 No 37 >protein:vir:741 Length: 110 # NCBI annotation: unknown # Family: family:all:372 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108718;genbank:gi:13487840;genbank:GeneID:920873 Probab=61.92 E-value=0.34 Score=23.12 Aligned_cols=99 Identities=11% Similarity=0.072 Sum_probs=57.4 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCch-h--HHHHHHHHHHHHHHHHh-hhhcccccccccceeeeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWG-K--LYDRGVMALTAHLLKLS-ADAEISGGAANRNLASESA 76 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~g-~--~~~~~~~l~~AH~l~l~-~~~~~~~~~~~g~vtS~sv 76 (119) |.+++++|.+-.. -.|+.++.+++.|+..+... .| + -....+..++--.+.-. .+.+.. -.+|.|+ T Consensus 1 M~~L~~vK~~lgi---~~D~lL~~li~~a~~~i~~~-l~~~~~~iP~~l~~iv~evav~ryNR~g~E------G~~S~Se 70 (110) T protein:vir:74 1 MAITYEIKKLLGG---SSDERLEIIEKRTRERLLLI-LGSDLKEVPPELEYVVLDVSLKRFNRIGQE------GMQSYSQ 70 (110) T ss_pred ChHHHHHHHHcCC---ChhHHHHHHHHHHHHHHHHH-hCCChhhhhHHHHHHHHHHHHHHhcccCcc------ccceeec Confidence 9999999998654 46999999999999887532 11 1 12233333333322221 111111 1689999 Q ss_pred cceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHh-CC----CCccC Q lcl|NC_013597. 77 GELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLI-GV----GVMVA 119 (119) Q Consensus 77 G~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~-g~----Gg~va 119 (119) +.+|+||..+- ..+|-...-+.++.. +. .|.|- T Consensus 71 eG~S~sf~~~d----------~~~y~~~l~~y~~~~~~~~~~~~~~~~ 108 (110) T protein:vir:74 71 EGLSMTFSESD----------FDEYADEIESRRKSKETEGDKKIGRFR 108 (110) T ss_pred CCeeeeecccc----------hhhHHHHHHHHHhhccccccCcceeee Confidence 99999994321 226776666655432 11 12222 No 38 >protein:vir:81069 Length: 115 # NCBI annotation: p10 # Family: family:all:363 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285680;genbank:gi:148727188;genbank:GeneID:5247114 Probab=59.82 E-value=0.38 Score=22.86 Aligned_cols=84 Identities=10% Similarity=0.041 Sum_probs=49.4 Q ss_pred CCCHHHHHHhh---hhhcCCCHHHHHHHHHHHHHHh----CCcC----------------------chhHHHHHHHHHHH Q lcl|NC_013597. 1 MPLTEDFLLRY---TEFGKTDAKRIGLFLSDAQAEV----SKVQ----------------------WGKLYDRGVMALTA 51 (119) Q Consensus 1 m~t~~~Fr~~~---P~F~~vpd~~i~~~~~~A~~~~----~~~~----------------------~g~~~~~~~~l~~A 51 (119) |+|+++.|+-. +.+.+-+|+.|+.++.-|...+ +.+- .-+....++.|+++ T Consensus 1 ivtLee~K~HlRid~dd~deDD~li~~~i~AA~~~v~~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~p~~i~~AiLllvg 80 (115) T protein:vir:81 1 MITLAMVQRHLQAELYEDDERDYVMQQLLPAARESAELFINRKLYDTQADMLADQAAGVDPAGQLLITRTVEQAILLTLG 80 (115) T ss_pred CCCHHHHHHHcCCCCCCCccchHHHHHHHHHHHHHHHHHhCCccccccccccccccccCCCCcccccCHHHHHHHHHHHH Confidence 99999999854 3455578999999999998665 2111 11235679999999 Q ss_pred HHHHHhhhhcccccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHH--HHHHHhCCC Q lcl|NC_013597. 52 HLLKLSADAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYL--RLRRLIGVG 115 (119) Q Consensus 52 H~l~l~~~~~~~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~--~L~~~~g~G 115 (119) |+-..+-... +|. ...-|+|.+.+ ..|+..|.- T Consensus 81 ~~Y~NRE~v~--------------~~~-----------------~~elP~~~~~LL~pyR~~~g~~ 115 (115) T protein:vir:81 81 EWYSSREQVW--------------TKG-----------------AGLVTSSAQNLLHPYRKFAGVR 115 (115) T ss_pred HHHhccchhc--------------chh-----------------hhhcCHHHHHHHHHHHhhcCCC Confidence 9876642210 111 11224553221 234444444 No 39 >protein:vir:4998 Length: 106 # NCBI annotation: putative DNA packaging protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049972;genbank:gi:9632944;genbank:GeneID:1262107 Probab=59.09 E-value=0.12 Score=25.55 Aligned_cols=93 Identities=19% Similarity=0.205 Sum_probs=46.2 Q ss_pred CC-CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCc-----Cc------hhHHHHHHHHHHHHHHHHhhhhccccc-cc Q lcl|NC_013597. 1 MP-LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKV-----QW------GKLYDRGVMALTAHLLKLSADAEISGG-AA 67 (119) Q Consensus 1 m~-t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~-----~~------g~~~~~~~~l~~AH~l~l~~~~~~~~~-~~ 67 (119) |. +++++|.-.-==.+-.|+.|+.+++-|+.+|... .+ ..+++.++.++++|+-.-+.....+.- .. T Consensus 1 M~v~Le~iK~~LRID~ddDD~li~~~i~AA~~yi~~aig~~~~~~~~~~~~~~~~~Avl~Lv~~~YeNR~~~~~~~~~~v 80 (106) T protein:vir:49 1 MSVSKEIIMQTLNLDETDDTALIPAYIESAQQYIINAVGSDPKFYELENVKYLFDTAVIALTSTYFTYRVALNETLTYPI 80 (106) T ss_pred CcccHHHHHHHcCCCCccchHHHHHHHHHHHHHHHhhcCCCCCCCCcCCCchHHHHHHHHHHHHHHhhcccccCcccccc Confidence 54 7888887443222358999999999999997421 11 235788999999999877643221110 00 Q ss_pred ccceeeeeecceeeeeccCccCCcchh Q lcl|NC_013597. 68 NRNLASESAGELSVSYTAPISANGSDD 94 (119) Q Consensus 68 ~g~vtS~svG~vSvs~~~~~~~~~~~~ 94 (119) +-.|.| =+..+.-.|..-.-...+++ T Consensus 81 p~~v~s-lI~qLR~~y~~~~e~~~~~~ 106 (106) T protein:vir:49 81 NLTLNS-IIGQLRGLYATYSDGGVNNA 106 (106) T ss_pred cHHHHH-HHHHHHhhhhhhhhccccCC Confidence 000000 00111111111100011111 No 40 >protein:vir:94507 Length: 113 # NCBI annotation: putative DNA packaging protein # Family: family:all:372 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223891;genbank:gi:62327103;genbank:GeneID:5075523 Probab=57.63 E-value=0.43 Score=22.59 Aligned_cols=104 Identities=17% Similarity=0.113 Sum_probs=58.2 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcCch-----hHHHHHHHHHHHHHHHHh-hhhcccccccccceeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWG-----KLYDRGVMALTAHLLKLS-ADAEISGGAANRNLASE 74 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~~g-----~~~~~~~~l~~AH~l~l~-~~~~~~~~~~~g~vtS~ 74 (119) |.++++.+.+-.-=.+-.|+.++.+|+.|+..++..--. ......+..++--...-. ...+.. ..+|. T Consensus 1 M~~L~~vK~~lgi~d~~~D~lL~~iI~~a~~~i~~~l~~~~~~~~~iP~~l~~Iv~evavkryNR~g~E------G~~S~ 74 (113) T protein:vir:94 1 MALLDSIKLRIGIEDTKQDDLLTDIISDVQARVLAYVNQDGLVQSELPNGLDFVIKDVTIRIYNKIGDE------GKESS 74 (113) T ss_pred CchHHHHHHHhCCCCCchhhHHHHHHHHHHHHHHHHhCCccchhhhhhhHHHHHHHHHHHHHhcccCCc------cceee Confidence 999999999865533456999999999999888532111 122233333333332221 111111 15899 Q ss_pred eecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCcc-C Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMV-A 119 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~v-a 119 (119) |++.+|+||....- ..+|=-.+-++++....++.- - T Consensus 75 SeeG~S~sf~~~~d---------f~~y~~~l~~~~~~~~~~~~g~r 111 (113) T protein:vir:94 75 SEGNVSNTWDTPAD---------LSEYSDVLDVYRKSYKRRSAGMR 111 (113) T ss_pred ecCceeeeecCccc---------hhhHHHHHHHHHhhccCCCCCce Confidence 99999999943211 125655555555432222221 2 No 41 >protein:vir:9877 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:2716 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795639;genbank:gi:28876402;genbank:GeneID:1257933 Probab=56.27 E-value=0.46 Score=22.43 Aligned_cols=99 Identities=15% Similarity=0.237 Sum_probs=57.3 Q ss_pred CC-----CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCC----cCchhHHH-HHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_013597. 1 MP-----LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSK----VQWGKLYD-RGVMALTAHLLKLSADAEISGGAANRN 70 (119) Q Consensus 1 m~-----t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~----~~~g~~~~-~~~~l~~AH~l~l~~~~~~~~~~~~g~ 70 (119) |- ++++.|.+-+-=.+-.|+.++.+++.|+..++- ...-+..+ .....-++|+=.+.. . . T Consensus 1 m~~~~~~~L~~vK~~Lgi~d~~~D~lL~~ii~~~~~~i~~~l~~~~iP~~L~~Iv~ev~vkryNR~g~----E------G 70 (114) T protein:vir:98 1 MDETKQAIIDRVRVRLADETSLKEELLEELTQTAIDRINLKVGDVVFNPLFNSIAVDVVVKMYRRMYF----E------G 70 (114) T ss_pred CchhHHHHHHHHHHHhCCCCCchhhHHHHHHHHHHHHHHHhhCccccchHHHHHHHHHHHHHhcccCc----c------c Confidence 43 355666665543446799999999999988752 11222222 122333344332221 1 1 Q ss_pred eeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHH---HHhCCCCccC Q lcl|NC_013597. 71 LASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLR---RLIGVGVMVA 119 (119) Q Consensus 71 vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~---~~~g~Gg~va 119 (119) .+|.|++.+|+||..+- ..+|--.+-+.+ +..+.|+.|- T Consensus 71 ~~S~S~eG~S~tf~dnd----------f~ey~~~l~~y~~~~~~~~~g~~v~ 112 (114) T protein:vir:98 71 IDTEKADTISTKFIENV----------LAEYGEELASYKKDRLAILNKKVVR 112 (114) T ss_pred cceeeccceeeeeeccc----------cchhHHHHHHHHhhhhhhhcCceee Confidence 58999999999995421 226775555544 4556777777 No 42 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=56.08 E-value=0.46 Score=22.41 Aligned_cols=109 Identities=15% Similarity=0.067 Sum_probs=56.6 Q ss_pred CCCHHHHHHhhhhhcC---CCHHHHHHHHHHHHHHhCC--cCch-h-----------------------------HHHHH Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK---TDAKRIGLFLSDAQAEVSK--VQWG-K-----------------------------LYDRG 45 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~---vpd~~i~~~~~~A~~~~~~--~~~g-~-----------------------------~~~~~ 45 (119) ..|++++++.+-+... ..|+..+..|-.|..+|+. .+|. . ..+.+ T Consensus 17 Yvt~~ea~aY~~~rg~~~~~dd~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~~~~~~IP~~V~~A 96 (169) T protein:vir:95 17 YVSLEDGRALAAKYGLELPEDDIAAEASLRNGAVYVGLFESQMCGRRVSANQALAFPRTGIDLHGFPQPSNVIPSLVIQA 96 (169) T ss_pred cccHHHHHHHHHHcCCcCCCCHHHHHHHHHHHHHHhhccccccccccCCcchhhccccCCceecccccccccchHHHHHH Confidence 6788888887665432 3578889999999999985 2442 1 11222 Q ss_pred HHHHHHHHHHHhhhhccccccccccee-eeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHh--CCCCccC Q lcl|NC_013597. 46 VMALTAHLLKLSADAEISGGAANRNLA-SESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLI--GVGVMVA 119 (119) Q Consensus 46 ~~l~~AH~l~l~~~~~~~~~~~~g~vt-S~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~--g~Gg~va 119 (119) .+.++.-.+ ++...... ...+.|. ++.+|.++|+|..+...... ..-|. --+|++-+ |.||.-+ T Consensus 97 ~~elA~~~~--~g~~~~~~-~~~~~v~~e~v~G~i~veY~~~~~~~~~----~~~~a---~~~LL~p~l~g~~g~~~ 163 (169) T protein:vir:95 97 QVMAAVEYG--AGTDVRGS-TDGREVQTERVEGAVTVSYFKNGYSGGT----VSITA---ADDALRPLLCGSNNAYS 163 (169) T ss_pred HHHHHHHHH--cCccccCC-CCccceeeeeeccceeEeecCCCCcCcc----ccHHH---HHHhhhhhcccCCCcce Confidence 232222222 11111111 2234454 55669999999765433211 11111 22444433 6667444 No 43 >protein:vir:97069 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453566;genbank:gi:84662601;genbank:GeneID:5142484 Probab=55.42 E-value=0.46 Score=22.43 Aligned_cols=84 Identities=10% Similarity=0.050 Sum_probs=48.0 Q ss_pred CCCHHHHHHhh---hhhcCCCHHHHHHHHHHHHHHh----CCcC----------------------chhHHHHHHHHHHH Q lcl|NC_013597. 1 MPLTEDFLLRY---TEFGKTDAKRIGLFLSDAQAEV----SKVQ----------------------WGKLYDRGVMALTA 51 (119) Q Consensus 1 m~t~~~Fr~~~---P~F~~vpd~~i~~~~~~A~~~~----~~~~----------------------~g~~~~~~~~l~~A 51 (119) |+|+++.|.-. +.+.+-+|+.|+.++.-|+..+ +..- .-...+.++.|+++ T Consensus 1 mvtLee~K~hLRid~d~~d~DDali~~~i~AA~~~v~~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~p~~i~~AiLllvg 80 (115) T protein:vir:97 1 MITLAMMQRHLQAELYEDDERDYVMQQLLPAARESAELFLNRKLYDVQADMLADQVLGVDPSDQLLITRTVEQAILLTVG 80 (115) T ss_pred CCCHHHHHHHcCCCCCCCchhhHHHHHHHHHHHHHHHHHhCCcccchhhcccccccccCCCcccccCCHHHHHHHHHHHH Confidence 99999999844 4555567889999999887654 2111 11234678899999 Q ss_pred HHHHHhhhhcccccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHH--HHHHHhCCC Q lcl|NC_013597. 52 HLLKLSADAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYL--RLRRLIGVG 115 (119) Q Consensus 52 H~l~l~~~~~~~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~--~L~~~~g~G 115 (119) |+-..+-.. ++|+ .+.-|+|.+.+ ..|+-.|.- T Consensus 81 ~~Y~NRE~v--------------~~~~-----------------~~elP~~~~~LL~pyR~~~Gv~ 115 (115) T protein:vir:97 81 EWYSSREQV--------------WIKG-----------------AGLVTSSAQNLLHPYRKFAGVR 115 (115) T ss_pred HHHhccccc--------------cccc-----------------ccccCHHHHHHHHHHHhhcCCC Confidence 987654211 0111 11225554321 123333333 No 44 >protein:vir:5742 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892053;genbank:gi:33770516;uniprot:Q7Y407;genbank:GeneID:2637465 Probab=54.77 E-value=0.15 Score=25.09 Aligned_cols=83 Identities=13% Similarity=0.067 Sum_probs=44.3 Q ss_pred CCCHHHHHHhh---hhhcCCCHHHHHHHHHHHHHHhC----CcC------------------chhHHHHHHHHHHHHHHH Q lcl|NC_013597. 1 MPLTEDFLLRY---TEFGKTDAKRIGLFLSDAQAEVS----KVQ------------------WGKLYDRGVMALTAHLLK 55 (119) Q Consensus 1 m~t~~~Fr~~~---P~F~~vpd~~i~~~~~~A~~~~~----~~~------------------~g~~~~~~~~l~~AH~l~ 55 (119) |+|++.+|+-. |.|. -.|+.|+.|++-|...+- .+- ..+.-+.++.|+++|+-. T Consensus 3 mitLeeiK~hlRid~D~~-~eD~lL~~y~~AA~~~~e~~~~rkLy~~~~~~~~~p~~~~gl~~~~di~~A~Lllv~hwYe 81 (110) T protein:vir:57 3 MTSLSNVKTQLRLEEDFT-EHDDFIESLIDAAQRSIERTYYCVLVDSQEALEKLPEGVRGFLIEPDTQLAARMMVAQWYL 81 (110) T ss_pred CCCHHHHHHHcCCCCCCC-hhHHHHHHHHHHHHHHHHHHhCCcccCCccccccCCCCCCccccCHHHHHHHHHHHHHHHh Confidence 88999999854 3442 469999999998886652 111 224456899999999987 Q ss_pred HhhhhcccccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCC Q lcl|NC_013597. 56 LSADAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGV 114 (119) Q Consensus 56 l~~~~~~~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~ 114 (119) .+-....++ .-++ |. .-++..-|| +..-. T Consensus 82 NREav~~~~-------------~~~~----P~-----~v~~Ll~P~--------~~~~~ 110 (110) T protein:vir:57 82 NPKGTSPDG-------------DTPA----QL-----GVEYLLFPL--------MEHTV 110 (110) T ss_pred ccccccccc-------------ccch----hH-----HHHHHHHHH--------HhhcC Confidence 653211100 0000 00 001111122 11111 No 45 >protein:vir:96831 Length: 98 # NCBI annotation: ORF052 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240159;genbank:gi:66395852;genbank:GeneID:5133172 Probab=53.53 E-value=0.19 Score=24.49 Aligned_cols=92 Identities=18% Similarity=0.204 Sum_probs=55.5 Q ss_pred CCHHHHHHhhh-hhcC-----CCHHHHHHHHHHHHHHhCCcCchhHHHHHHHHHHHHHHHHhhhhcccccccccceeeee Q lcl|NC_013597. 2 PLTEDFLLRYT-EFGK-----TDAKRIGLFLSDAQAEVSKVQWGKLYDRGVMALTAHLLKLSADAEISGGAANRNLASES 75 (119) Q Consensus 2 ~t~~~Fr~~~P-~F~~-----vpd~~i~~~~~~A~~~~~~~~~g~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~s 75 (119) -.+.+.+.+-- ...+ +=...|..+++.|+.+-+ ..|.+-+.-++..++|-.+.- +..+.++|.| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN-~~F~~~lP~gVkkfvAe~iky---------~~~~nissRs 70 (98) T protein:vir:96 1 MDALDVKMLNGTRIDDVSNDDVINKLILAYKQVAEEYCN-QVFGDPLPGGVKKFIAECIKY---------GVSGNIASRS 70 (98) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhC-CcccccCCccHHHHHHHHHhh---------cccCCccccc Confidence 23444444311 1111 113567888999977654 568887888999999987763 2335799999 Q ss_pred ecceeeeeccCccCCcchhhhhcCHHHHHHH Q lcl|NC_013597. 76 AGELSVSYTAPISANGSDDFYQLTAYGQEYL 106 (119) Q Consensus 76 vG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~ 106 (119) .|+||.+|.+-- -...-+||+ ||=+.=| T Consensus 71 MgtVSYty~T~i-P~~i~~~L~--PyRrlrw 98 (98) T protein:vir:96 71 MGTVSYTYVTDV-PSSMYKYLK--PYRKLRW 98 (98) T ss_pred ccceeeechhhh-hHHHHHHhh--hhhhccC Confidence 999999995421 112223333 4444333 No 46 >protein:vir:93592 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:1879 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449297;genbank:gi:157166045;uniprot:Q6H9U4;genbank:GeneID:5580414 Probab=49.79 E-value=0.31 Score=23.38 Aligned_cols=87 Identities=15% Similarity=0.007 Sum_probs=46.3 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHh----CCc--Cc------------hhHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEV----SKV--QW------------GKLYDRGVMALTAHLLKLSADAEI 62 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~----~~~--~~------------g~~~~~~~~l~~AH~l~l~~~~~~ 62 (119) |.|+++.|+-.==-.+..|+.|+.+++-|+.++ +.. ++ -...+.|+.|+++|+-.-+..... T Consensus 4 ~vtLeevK~hLRId~d~dD~li~~~i~aA~~~v~~~l~~~~~~~~~~~~~~~~~~~~~~i~~AvLlLv~~~YenRe~~~~ 83 (108) T protein:vir:93 4 LLTLEEIKAHLRVDHDADDDMLMDKVRQATAVLLAYIQGSRDKVIREDGELIPGEALTRMKGAAMRLTGMLYRNPDLAER 83 (108) T ss_pred CCCHHHHHHHcCCCCCcChHHHHHHHHHHHHHHHHHhccccccccccccccccccCChHHHHHHHHHHHHHHhccccccc Confidence 779999998554333568999999999997765 211 11 123678999999998766532111 Q ss_pred cccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCcc Q lcl|NC_013597. 63 SGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMV 118 (119) Q Consensus 63 ~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~v 118 (119) ++++. +..|+|.+ .|+..+..=.++ T Consensus 84 --------------~~~~~---------------~elP~~v~--~Ll~~~R~p~~~ 108 (108) T protein:vir:93 84 --------------EELLQ---------------GELPFSVS--VLIYDLRCPTVL 108 (108) T ss_pred --------------ccccc---------------ccCCHHHH--HHHHHccccccC Confidence 00000 01122211 111111111122 No 47 >protein:vir:102158 Length: 99 # NCBI annotation: uncharacterized phage protein (possible DNA packaging) # Family: family:all:316 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699940;genbank:gi:110804046;genbank:GeneID:4206702 Probab=49.50 E-value=0.31 Score=23.40 Aligned_cols=91 Identities=12% Similarity=-0.038 Sum_probs=46.8 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCc------CchhHHHHHHHHHHHHHHHHhhhhcccccccccceeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKV------QWGKLYDRGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~------~~g~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~ 74 (119) |.|++++|.--===.+..|+.|+.+++-|+.++... ......+.|+.++++|+=.-+.....+..... T Consensus 2 ~vtLee~K~~LRID~d~dD~lI~~~i~aA~~~i~~~~~~~~~~~~~~~k~Avl~lv~~~YenR~~~~~~~~~~~------ 75 (99) T protein:vir:10 2 ILSVDEVKNYLRVDYDEDDILIQDLIESAEDYLYNATGKKFTEKNKLAKRYCLALVYDWYKDKGMNIRATKNTT------ 75 (99) T ss_pred cCCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHhhCCCCCCCChHHHHHHHHHHHHhHhcchhhhhhhhccc------ Confidence 889999997433223468999999999999987421 12356788999999999876543221110000 Q ss_pred eecceeeeeccCccCCcchhhhhcCHHHHHH-HHHHHHhCCCCc Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEY-LRLRRLIGVGVM 117 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y-~~L~~~~g~Gg~ 117 (119) . -...|||..- +.-.|.+|--=- T Consensus 76 --~------------------~~~lp~~v~sli~qlr~~~~~~~ 99 (99) T protein:vir:10 76 --V------------------SEKVKYTLQSILLQLKFCKEEDT 99 (99) T ss_pred --h------------------hhhhhHHHHHHHHHHhhccCCCC Confidence 0 0001111000 000000000000 No 48 >protein:vir:106583 Length: 105 # NCBI annotation: putative protein # Family: family:all:6481 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958586;genbank:gi:41179246;genbank:GeneID:2717119 Probab=48.42 E-value=0.67 Score=21.54 Aligned_cols=99 Identities=10% Similarity=0.057 Sum_probs=51.1 Q ss_pred CCCHHHHHHhhhhhc----CCCHHHHHHHHHHHHHHhCCcCchhHHHHHHHHHHHHHHHHhh-hhcccccccccceeeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFG----KTDAKRIGLFLSDAQAEVSKVQWGKLYDRGVMALTAHLLKLSA-DAEISGGAANRNLASES 75 (119) Q Consensus 1 m~t~~~Fr~~~P~F~----~vpd~~i~~~~~~A~~~~~~~~~g~~~~~~~~l~~AH~l~l~~-~~~~~~~~~~g~vtS~s 75 (119) |-..+..-...-.+. ...|+.|..+|++|...+-.-.-.+.....+..++.-++.-.. ..+..| .+|-| T Consensus 1 ~~~~~~~~e~ik~L~~~~d~~~DelL~~lieda~~~vl~y~nr~~ip~~l~~~v~evav~~fNR~G~EG------~tS~S 74 (105) T protein:vir:10 1 MLNVDQLTEIVSALSTRLENVNNALLTELVKESIAQVLDYTGQKKLVGSMDIYVKKLAVINYNRLGIEG------ETQRS 74 (105) T ss_pred CCchHHHHHHHHHHhccCCCchhHHHHHHHHHHHHHHHHHcCCcccchhHHHHHHHHHHHHhcccCCcc------cceee Confidence 655555444444433 3568999999999998772211112233344455544443322 111111 58999 Q ss_pred ecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 76 AGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 76 vG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) +|.+|.||...-. ..|.+-++.+..+...- T Consensus 75 egGvS~sy~~~~~--------------~~~~~~l~~yR~~~v~~ 104 (105) T protein:vir:10 75 EGGITNYLETGIP--------------KDIRQGLNSYRIAKVKK 104 (105) T ss_pred cCCeeeeeeccCc--------------HHHHHHHHHHhhhcccC Confidence 9999999965311 12333333333322222 No 49 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=47.73 E-value=0.6 Score=21.81 Aligned_cols=101 Identities=16% Similarity=0.129 Sum_probs=48.2 Q ss_pred CC---CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhC---CcC------c--------hhHHHHHHHHHHHHHHHHhhhh Q lcl|NC_013597. 1 MP---LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVS---KVQ------W--------GKLYDRGVMALTAHLLKLSADA 60 (119) Q Consensus 1 m~---t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~---~~~------~--------g~~~~~~~~l~~AH~l~l~~~~ 60 (119) |+ |++++.+++-++.+=..++++.+|++|..+|- +.. + ....++...-+++--+... T Consensus 1 m~~fAtv~Dv~~r~r~L~~~E~~ra~~lL~dAs~~ir~~~p~~~~~l~a~~~e~~~~~~~~~~~V~~~~V~Ral~~~--- 77 (132) T protein:vir:16 1 MNPFATVDDLTMLWRPLKGDEKERAEKLLEIVSDSLREEADKVGRDLYAMIAEKPSYFASVVKSVTVDIVARTLMTS--- 77 (132) T ss_pred CCccCCHHHHHHHhcCCCHhHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccchhHHHHHHHHHHHHHhcCC--- Confidence 55 88899988865554444699999999999982 211 0 1112222223333222111 Q ss_pred cccccccccceeeeeecceeee--eccCccCCcchhhhhcCHHHHHHHHHHHHhCCCC--ccC Q lcl|NC_013597. 61 EISGGAANRNLASESAGELSVS--YTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGV--MVA 119 (119) Q Consensus 61 ~~~~~~~~g~vtS~svG~vSvs--~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg--~va 119 (119) .+..+....|.+.|..|.| |.++. +-+-.|. ..+++.|.++ +-+ T Consensus 78 ---~~~~G~tq~S~TaG~ys~S~t~~~p~------G~lylt~------~e~~~LG~~~~r~~~ 125 (132) T protein:vir:16 78 ---TDQEPMTQTTESALGYSVSGSYLVPG------GGLFIKN------SELSRLGLKKQRFGV 125 (132) T ss_pred ---CCCCCceeeeeeccchheeeeeecCC------CcceeCh------HHHHhhCCCCCceEE Confidence 1111223467888988555 54332 1122222 1122223221 111 No 50 >protein:vir:4954 Length: 104 # NCBI annotation: putative DNA packaging protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049930;genbank:gi:9632901;genbank:GeneID:1262077 Probab=47.32 E-value=0.28 Score=23.59 Aligned_cols=90 Identities=18% Similarity=0.247 Sum_probs=46.0 Q ss_pred CCCHHHHHHhhh-hhcCCCHHHHHHHHHHHHHHhCCc-----Cc------hhHHHHHHHHHHHHHHHHhhhhccccc-cc Q lcl|NC_013597. 1 MPLTEDFLLRYT-EFGKTDAKRIGLFLSDAQAEVSKV-----QW------GKLYDRGVMALTAHLLKLSADAEISGG-AA 67 (119) Q Consensus 1 m~t~~~Fr~~~P-~F~~vpd~~i~~~~~~A~~~~~~~-----~~------g~~~~~~~~l~~AH~l~l~~~~~~~~~-~~ 67 (119) ..|++++|.-.= .|. ..|+.|+.+++-|+.+|... .+ ......++.++++|+-.-+.......- .. T Consensus 2 ~vtLeeiK~~LRID~d-ddD~li~~~i~aA~~yi~~aig~~~~~~~~~~~~~~~~~Avl~Lv~~~YeNR~~~~~~~~~~v 80 (104) T protein:vir:49 2 SVSKTSIMQTLNLDET-DDTALIPAYIESAKQYIINAVGSDSKFYDLDSVRALFDTAVIALTSSYFTYRVALTDTATYPV 80 (104) T ss_pred cccHHHHHHHcCCCCc-cchHHHHHHHHHHHHHHHHhhCCCCccccccCCChHHHHHHHHHHHHHHhhchhccccccchh Confidence 448999886433 233 58999999999999988421 11 246788999999999877643221110 00 Q ss_pred ccceeeeeecceeeeeccCccCCcc Q lcl|NC_013597. 68 NRNLASESAGELSVSYTAPISANGS 92 (119) Q Consensus 68 ~g~vtS~svG~vSvs~~~~~~~~~~ 92 (119) +-.|.| -+..+.-.|..-.-..++ T Consensus 81 p~~v~s-li~qLr~~y~~~~e~~~~ 104 (104) T protein:vir:49 81 NLTLNS-IIGQLRGLYATYSEERGD 104 (104) T ss_pred hHHHHH-HHHHHHHhhhhhhhccCC Confidence 000000 001111112111111111 No 51 >protein:vir:1241 Length: 104 # NCBI annotation: similar to phage Spp1 gp15 (product required for head morphogenesis) # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510940;genbank:gi:17426274;genbank:GeneID:927373 Probab=47.31 E-value=0.71 Score=21.41 Aligned_cols=97 Identities=21% Similarity=0.238 Sum_probs=56.3 Q ss_pred CCHHHHHHhhh-hhcCC-C----HHHHHHHHHHHHHHhCCcCch-hHHHHHHHHHHHHHHHHhhhhcccccccccceeee Q lcl|NC_013597. 2 PLTEDFLLRYT-EFGKT-D----AKRIGLFLSDAQAEVSKVQWG-KLYDRGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) Q Consensus 2 ~t~~~Fr~~~P-~F~~v-p----d~~i~~~~~~A~~~~~~~~~g-~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~ 74 (119) -.+.+.+.+-- ...++ . ...|..+++.|+.+-+ ..|+ +.+.-++..++|-.+.- +..+.++|. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN-~~F~~~~lP~gVkkfvAe~iky---------~~~~NissR 70 (104) T protein:vir:12 1 MDAKDVKMINGLSLNDSSDDEQIEYLIEEYKSVAEDYCN-QKFDDKEVPSGVKKFIAECIKF---------GTTGNISAR 70 (104) T ss_pred CCHHHHHHHhCCCCCCCccHHHHHHHHHHHHHHHHHHhC-CCCCCccCCccHHHHHHHHHhh---------CCCCCcccc Confidence 33444444311 11111 1 3567888999977755 5676 46778889999987763 245689999 Q ss_pred eecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCcc Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMV 118 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~v 118 (119) |.|+||.+|.+- +-++.|+- +...||+.=.|=-| T Consensus 71 sMgtVSYTy~T~---------iP~~i~~~-L~PYRrlrw~~~~~ 104 (104) T protein:vir:12 71 TMGTVSYTYVTD---------IPSSAYAY-LLPYRKLSWGKRYV 104 (104) T ss_pred cccceeeechhh---------hhHHHHHh-hhhhhhhcccccCC Confidence 999999999542 22223331 11223443334444 No 52 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=46.31 E-value=0.74 Score=21.30 Aligned_cols=114 Identities=18% Similarity=0.097 Sum_probs=54.5 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcC----------------------chhHHHHH-HHHHHHHHHHHh Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQ----------------------WGKLYDRG-VMALTAHLLKLS 57 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~----------------------~g~~~~~~-~~l~~AH~l~l~ 57 (119) ..|-++|...-++. +.++..+..+..|+..||.-. |-..+..+ -..+++-+..++ T Consensus 4 YLTy~ef~~lg~~~--~~~d~F~kllk~A~~~ID~~T~y~~~~y~~~~i~~d~~~d~~~~~~~r~~~vKkA~a~QIeY~~ 81 (144) T protein:vir:79 4 YLTTSDFEKLGYEL--KKPDNFGKLLKSATVLINQICSYYDPAFAYHDLEADSQADPDSYLFRQAMAFKKAVALEMLFLE 81 (144) T ss_pred ccchhhhhhhCCCC--cchhhhhhHHHHHHHHhhhhhhhhccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH Confidence 23555555443332 355668888888888876521 11111111 223444444443 Q ss_pred hhhccc-ccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCC--CCccC Q lcl|NC_013597. 58 ADAEIS-GGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGV--GVMVA 119 (119) Q Consensus 58 ~~~~~~-~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~--Gg~va 119 (119) ..+... .....+.++|.++|..|||+.+.+..+...+-.+-++ .=+.++...|. .|+-. T Consensus 82 ~~G~~sa~e~~~~~~~S~svGrtsvs~~~~~~~s~t~~~~~v~~---~a~~yL~~tGLLYrGV~s 143 (144) T protein:vir:79 82 DSGYSSAYDVAQGALNSFTVGHTSMSLNPSAGQNLTVGSTGVVK---SAYDLLGRYGLLFSGVAS 143 (144) T ss_pred HcCCcchhhhhcCccceeEecceEEeecCCCccccccccccccH---HHHHHHhhcCcccccccc Confidence 322211 2334678899999999999976544433321112112 22233333332 11111 No 53 >protein:vir:93740 Length: 104 # NCBI annotation: ORF047 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240461;genbank:gi:66396159;genbank:GeneID:5133509 Probab=46.19 E-value=0.38 Score=22.90 Aligned_cols=97 Identities=20% Similarity=0.203 Sum_probs=56.5 Q ss_pred CCHHHHHHhhh-hhcC-----CCHHHHHHHHHHHHHHhCCcCch-hHHHHHHHHHHHHHHHHhhhhcccccccccceeee Q lcl|NC_013597. 2 PLTEDFLLRYT-EFGK-----TDAKRIGLFLSDAQAEVSKVQWG-KLYDRGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) Q Consensus 2 ~t~~~Fr~~~P-~F~~-----vpd~~i~~~~~~A~~~~~~~~~g-~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~ 74 (119) -.+++.+.+-- ...+ +=+..|..+++.|+.+-+ ..|+ +.+.-++..++|-.+.- +..+.++|. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN-~~F~~~~lP~gVkkfvAe~iky---------~~~~NissR 70 (104) T protein:vir:93 1 MDAKDVKMINGLSLNDSSNDEQIDYLIEEYKSVAEDYCN-QKFDDKEVPSGVKKFIAECIKF---------GTTGNISAR 70 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhC-CCCCCccCCccHHHHHHHHHhh---------CCCCCcccc Confidence 33444444311 0111 124678889999987755 4676 46778889999987763 245689999 Q ss_pred eecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCcc Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMV 118 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~v 118 (119) |.|+||.+|.+- +-++.|+- +...||+.=.|=-| T Consensus 71 sMgtVSYTy~T~---------iP~~i~~~-L~PYRrlrw~~~~~ 104 (104) T protein:vir:93 71 TMGTVSYTYVTD---------IPSSAYAY-LLPYRKLSWGKRYV 104 (104) T ss_pred cccceeeechhh---------hhHHHHHh-hhhhhhhcccccCC Confidence 999999999542 22223331 11223443334444 No 54 >protein:vir:97145 Length: 110 # NCBI annotation: ORF049 # Family: family:all:372 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239728;genbank:gi:66394913;genbank:GeneID:5130878 Probab=45.91 E-value=0.75 Score=21.26 Aligned_cols=102 Identities=8% Similarity=-0.010 Sum_probs=57.4 Q ss_pred CCCHHHHHHhhhhhcC-CCHHHHHHHHHHHHHHhCCcC-c-hhHHHHHHHHHHHHHHHHh-hhhcccccccccceeeeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK-TDAKRIGLFLSDAQAEVSKVQ-W-GKLYDRGVMALTAHLLKLS-ADAEISGGAANRNLASESA 76 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~-vpd~~i~~~~~~A~~~~~~~~-~-g~~~~~~~~l~~AH~l~l~-~~~~~~~~~~~g~vtS~sv 76 (119) |.++++.+.+- ...+ --|+.++.+|+.|...++.-- . -+.....+..++--...-. ...+.. -.+|.|+ T Consensus 1 M~~L~~vK~~l-gI~d~~~D~lL~~ii~~a~~~i~~~l~~~~~~iP~~l~~iv~ev~vkryNR~g~E------G~~S~S~ 73 (110) T protein:vir:97 1 MTTLADVKKRI-GLKDEKQDEQLEEIIKSCESQLLSMLPIEVEQIPERFSYMIKEVAVKRYNRIGAE------GMTSEAV 73 (110) T ss_pred CchHHHHHHHh-CCCCCchhHHHHHHHHHHHHHHHHHhccchhhhhhHHHHHHHHHHHHHhcccCcc------ccceeec Confidence 99999999885 2233 458999999999998875311 1 1112223333332222211 111111 1689999 Q ss_pred cceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCC--CccC Q lcl|NC_013597. 77 GELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG--VMVA 119 (119) Q Consensus 77 G~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G--g~va 119 (119) +.+|+||..+-- -+|-..+-+.++.-+.. |+|- T Consensus 74 eG~S~sf~d~d~----------~~y~~~l~~y~~~~~~~~kG~v~ 108 (110) T protein:vir:97 74 DGRSNAYELNDF----------KEYEAIIDNYFNARTRTKKGRAV 108 (110) T ss_pred Cceeeeeccccc----------chHHHHHHHHHhhcCCCCCceee Confidence 999999943211 16666665555443322 3344 No 55 >protein:vir:96390 Length: 110 # NCBI annotation: ORF048 # Family: family:all:372 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239650;genbank:gi:66395410;genbank:GeneID:5132866 Probab=45.91 E-value=0.75 Score=21.26 Aligned_cols=102 Identities=8% Similarity=-0.010 Sum_probs=57.4 Q ss_pred CCCHHHHHHhhhhhcC-CCHHHHHHHHHHHHHHhCCcC-c-hhHHHHHHHHHHHHHHHHh-hhhcccccccccceeeeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK-TDAKRIGLFLSDAQAEVSKVQ-W-GKLYDRGVMALTAHLLKLS-ADAEISGGAANRNLASESA 76 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~-vpd~~i~~~~~~A~~~~~~~~-~-g~~~~~~~~l~~AH~l~l~-~~~~~~~~~~~g~vtS~sv 76 (119) |.++++.+.+- ...+ --|+.++.+|+.|...++.-- . -+.....+..++--...-. ...+.. -.+|.|+ T Consensus 1 M~~L~~vK~~l-gI~d~~~D~lL~~ii~~a~~~i~~~l~~~~~~iP~~l~~iv~ev~vkryNR~g~E------G~~S~S~ 73 (110) T protein:vir:96 1 MTTLADVKKRI-GLKDEKQDEQLEEIIKSCESQLLSMLPIEVEQIPERFSYMIKEVAVKRYNRIGAE------GMTSEAV 73 (110) T ss_pred CchHHHHHHHh-CCCCCchhHHHHHHHHHHHHHHHHHhccchhhhhhHHHHHHHHHHHHHhcccCcc------ccceeec Confidence 99999999885 2233 458999999999998875311 1 1112223333332222211 111111 1689999 Q ss_pred cceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCC--CccC Q lcl|NC_013597. 77 GELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG--VMVA 119 (119) Q Consensus 77 G~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G--g~va 119 (119) +.+|+||..+-- -+|-..+-+.++.-+.. |+|- T Consensus 74 eG~S~sf~d~d~----------~~y~~~l~~y~~~~~~~~kG~v~ 108 (110) T protein:vir:96 74 DGRSNAYELNDF----------KEYEAIIDNYFNARTRTKKGRAV 108 (110) T ss_pred Cceeeeeccccc----------chHHHHHHHHHhhcCCCCCceee Confidence 999999943211 16666665555443322 3344 No 56 >protein:vir:96221 Length: 110 # NCBI annotation: ORF044 # Family: family:all:372 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239573;genbank:gi:66395333;genbank:GeneID:5132767 Probab=45.91 E-value=0.75 Score=21.26 Aligned_cols=102 Identities=8% Similarity=-0.010 Sum_probs=57.4 Q ss_pred CCCHHHHHHhhhhhcC-CCHHHHHHHHHHHHHHhCCcC-c-hhHHHHHHHHHHHHHHHHh-hhhcccccccccceeeeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK-TDAKRIGLFLSDAQAEVSKVQ-W-GKLYDRGVMALTAHLLKLS-ADAEISGGAANRNLASESA 76 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~-vpd~~i~~~~~~A~~~~~~~~-~-g~~~~~~~~l~~AH~l~l~-~~~~~~~~~~~g~vtS~sv 76 (119) |.++++.+.+- ...+ --|+.++.+|+.|...++.-- . -+.....+..++--...-. ...+.. -.+|.|+ T Consensus 1 M~~L~~vK~~l-gI~d~~~D~lL~~ii~~a~~~i~~~l~~~~~~iP~~l~~iv~ev~vkryNR~g~E------G~~S~S~ 73 (110) T protein:vir:96 1 MTTLADVKKRI-GLKDEKQDEQLEEIIKSCESQLLSMLPIEVEQIPERFSYMIKEVAVKRYNRIGAE------GMTSEAV 73 (110) T ss_pred CchHHHHHHHh-CCCCCchhHHHHHHHHHHHHHHHHHhccchhhhhhHHHHHHHHHHHHHhcccCcc------ccceeec Confidence 99999999885 2233 458999999999998875311 1 1112223333332222211 111111 1689999 Q ss_pred cceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCC--CccC Q lcl|NC_013597. 77 GELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG--VMVA 119 (119) Q Consensus 77 G~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G--g~va 119 (119) +.+|+||..+-- -+|-..+-+.++.-+.. |+|- T Consensus 74 eG~S~sf~d~d~----------~~y~~~l~~y~~~~~~~~kG~v~ 108 (110) T protein:vir:96 74 DGRSNAYELNDF----------KEYEAIIDNYFNARTRTKKGRAV 108 (110) T ss_pred Cceeeeeccccc----------chHHHHHHHHHhhcCCCCCceee Confidence 999999943211 16666665555443322 3344 No 57 >protein:vir:9311 Length: 110 # NCBI annotation: phi Mu50B-like protein # Family: family:all:372 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803289;genbank:gi:29028599;genbank:GeneID:1258047 Probab=45.91 E-value=0.75 Score=21.26 Aligned_cols=102 Identities=8% Similarity=-0.010 Sum_probs=57.4 Q ss_pred CCCHHHHHHhhhhhcC-CCHHHHHHHHHHHHHHhCCcC-c-hhHHHHHHHHHHHHHHHHh-hhhcccccccccceeeeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK-TDAKRIGLFLSDAQAEVSKVQ-W-GKLYDRGVMALTAHLLKLS-ADAEISGGAANRNLASESA 76 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~-vpd~~i~~~~~~A~~~~~~~~-~-g~~~~~~~~l~~AH~l~l~-~~~~~~~~~~~g~vtS~sv 76 (119) |.++++.+.+- ...+ --|+.++.+|+.|...++.-- . -+.....+..++--...-. ...+.. -.+|.|+ T Consensus 1 M~~L~~vK~~l-gI~d~~~D~lL~~ii~~a~~~i~~~l~~~~~~iP~~l~~iv~ev~vkryNR~g~E------G~~S~S~ 73 (110) T protein:vir:93 1 MTTLADVKKRI-GLKDEKQDEQLEEIIKSCESQLLSMLPIEVEQIPERFSYMIKEVAVKRYNRIGAE------GMTSEAV 73 (110) T ss_pred CchHHHHHHHh-CCCCCchhHHHHHHHHHHHHHHHHHhccchhhhhhHHHHHHHHHHHHHhcccCcc------ccceeec Confidence 99999999885 2233 458999999999998875311 1 1112223333332222211 111111 1689999 Q ss_pred cceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCC--CccC Q lcl|NC_013597. 77 GELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG--VMVA 119 (119) Q Consensus 77 G~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G--g~va 119 (119) +.+|+||..+-- -+|-..+-+.++.-+.. |+|- T Consensus 74 eG~S~sf~d~d~----------~~y~~~l~~y~~~~~~~~kG~v~ 108 (110) T protein:vir:93 74 DGRSNAYELNDF----------KEYEAIIDNYFNARTRTKKGRAV 108 (110) T ss_pred Cceeeeeccccc----------chHHHHHHHHHhhcCCCCCceee Confidence 999999943211 16666665555443322 3344 No 58 >protein:vir:99796 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004309;genbank:gi:122891763;genbank:GeneID:4712351 Probab=45.91 E-value=0.75 Score=21.26 Aligned_cols=102 Identities=8% Similarity=-0.010 Sum_probs=57.4 Q ss_pred CCCHHHHHHhhhhhcC-CCHHHHHHHHHHHHHHhCCcC-c-hhHHHHHHHHHHHHHHHHh-hhhcccccccccceeeeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK-TDAKRIGLFLSDAQAEVSKVQ-W-GKLYDRGVMALTAHLLKLS-ADAEISGGAANRNLASESA 76 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~-vpd~~i~~~~~~A~~~~~~~~-~-g~~~~~~~~l~~AH~l~l~-~~~~~~~~~~~g~vtS~sv 76 (119) |.++++.+.+- ...+ --|+.++.+|+.|...++.-- . -+.....+..++--...-. ...+.. -.+|.|+ T Consensus 1 M~~L~~vK~~l-gI~d~~~D~lL~~ii~~a~~~i~~~l~~~~~~iP~~l~~iv~ev~vkryNR~g~E------G~~S~S~ 73 (110) T protein:vir:99 1 MTTLADVKKRI-GLKDEKQDEQLEEIIKSCESQLLSMLPIEVEQIPERFSYMIKEVAVKRYNRIGAE------GMTSEAV 73 (110) T ss_pred CchHHHHHHHh-CCCCCchhHHHHHHHHHHHHHHHHHhccchhhhhhHHHHHHHHHHHHHhcccCcc------ccceeec Confidence 99999999885 2233 458999999999998875311 1 1112223333332222211 111111 1689999 Q ss_pred cceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCC--CccC Q lcl|NC_013597. 77 GELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG--VMVA 119 (119) Q Consensus 77 G~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G--g~va 119 (119) +.+|+||..+-- -+|-..+-+.++.-+.. |+|- T Consensus 74 eG~S~sf~d~d~----------~~y~~~l~~y~~~~~~~~kG~v~ 108 (110) T protein:vir:99 74 DGRSNAYELNDF----------KEYEAIIDNYFNARTRTKKGRAV 108 (110) T ss_pred Cceeeeeccccc----------chHHHHHHHHHhhcCCCCCceee Confidence 999999943211 16666665555443322 3344 No 59 >protein:vir:78849 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285363;genbank:gi:148717891;genbank:GeneID:5246980 Probab=45.91 E-value=0.75 Score=21.26 Aligned_cols=102 Identities=8% Similarity=-0.010 Sum_probs=57.4 Q ss_pred CCCHHHHHHhhhhhcC-CCHHHHHHHHHHHHHHhCCcC-c-hhHHHHHHHHHHHHHHHHh-hhhcccccccccceeeeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK-TDAKRIGLFLSDAQAEVSKVQ-W-GKLYDRGVMALTAHLLKLS-ADAEISGGAANRNLASESA 76 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~-vpd~~i~~~~~~A~~~~~~~~-~-g~~~~~~~~l~~AH~l~l~-~~~~~~~~~~~g~vtS~sv 76 (119) |.++++.+.+- ...+ --|+.++.+|+.|...++.-- . -+.....+..++--...-. ...+.. -.+|.|+ T Consensus 1 M~~L~~vK~~l-gI~d~~~D~lL~~ii~~a~~~i~~~l~~~~~~iP~~l~~iv~ev~vkryNR~g~E------G~~S~S~ 73 (110) T protein:vir:78 1 MTTLADVKKRI-GLKDEKQDEQLEEIIKSCESQLLSMLPIEVEQIPERFSYMIKEVAVKRYNRIGAE------GMTSEAV 73 (110) T ss_pred CchHHHHHHHh-CCCCCchhHHHHHHHHHHHHHHHHHhccchhhhhhHHHHHHHHHHHHHhcccCcc------ccceeec Confidence 99999999885 2233 458999999999998875311 1 1112223333332222211 111111 1689999 Q ss_pred cceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCC--CccC Q lcl|NC_013597. 77 GELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG--VMVA 119 (119) Q Consensus 77 G~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G--g~va 119 (119) +.+|+||..+-- -+|-..+-+.++.-+.. |+|- T Consensus 74 eG~S~sf~d~d~----------~~y~~~l~~y~~~~~~~~kG~v~ 108 (110) T protein:vir:78 74 DGRSNAYELNDF----------KEYEAIIDNYFNARTRTKKGRAV 108 (110) T ss_pred Cceeeeeccccc----------chHHHHHHHHHhhcCCCCCceee Confidence 999999943211 16666665555443322 3344 No 60 >protein:vir:103957 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873994;genbank:gi:118430769;genbank:GeneID:4525451 Probab=45.91 E-value=0.75 Score=21.26 Aligned_cols=102 Identities=8% Similarity=-0.010 Sum_probs=57.4 Q ss_pred CCCHHHHHHhhhhhcC-CCHHHHHHHHHHHHHHhCCcC-c-hhHHHHHHHHHHHHHHHHh-hhhcccccccccceeeeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK-TDAKRIGLFLSDAQAEVSKVQ-W-GKLYDRGVMALTAHLLKLS-ADAEISGGAANRNLASESA 76 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~-vpd~~i~~~~~~A~~~~~~~~-~-g~~~~~~~~l~~AH~l~l~-~~~~~~~~~~~g~vtS~sv 76 (119) |.++++.+.+- ...+ --|+.++.+|+.|...++.-- . -+.....+..++--...-. ...+.. -.+|.|+ T Consensus 1 M~~L~~vK~~l-gI~d~~~D~lL~~ii~~a~~~i~~~l~~~~~~iP~~l~~iv~ev~vkryNR~g~E------G~~S~S~ 73 (110) T protein:vir:10 1 MTTLADVKKRI-GLKDEKQDEQLEEIIKSCESQLLSMLPIEVEQIPERFSYMIKEVAVKRYNRIGAE------GMTSEAV 73 (110) T ss_pred CchHHHHHHHh-CCCCCchhHHHHHHHHHHHHHHHHHhccchhhhhhHHHHHHHHHHHHHhcccCcc------ccceeec Confidence 99999999885 2233 458999999999998875311 1 1112223333332222211 111111 1689999 Q ss_pred cceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCC--CccC Q lcl|NC_013597. 77 GELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVG--VMVA 119 (119) Q Consensus 77 G~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~G--g~va 119 (119) +.+|+||..+-- -+|-..+-+.++.-+.. |+|- T Consensus 74 eG~S~sf~d~d~----------~~y~~~l~~y~~~~~~~~kG~v~ 108 (110) T protein:vir:10 74 DGRSNAYELNDF----------KEYEAIIDNYFNARTRTKKGRAV 108 (110) T ss_pred Cceeeeeccccc----------chHHHHHHHHHhhcCCCCCceee Confidence 999999943211 16666665555443322 3344 No 61 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=45.34 E-value=0.77 Score=21.20 Aligned_cols=101 Identities=15% Similarity=0.100 Sum_probs=52.7 Q ss_pred CC---CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhC---C----------cCchhHHH----HHHHHHHHHHHHHhhhh Q lcl|NC_013597. 1 MP---LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVS---K----------VQWGKLYD----RGVMALTAHLLKLSADA 60 (119) Q Consensus 1 m~---t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~---~----------~~~g~~~~----~~~~l~~AH~l~l~~~~ 60 (119) |+ |++++.+++.++.+=..++++..|++|..+|- + ..+.+..+ +...-+++--|.. T Consensus 1 m~~fAtv~Dl~~r~r~L~~dE~~ra~~LL~dAs~~iR~~~~~~~~~~~~~~~~~~d~~~~~~k~V~~~~V~Ral~~---- 76 (132) T protein:vir:94 1 MNPFATVDDLTMLWRPLKGDEKERAEKLLEIVSDTLREEADKVGRDLDVMISEKPSYFSSVVKSVTVDIVARTLMT---- 76 (132) T ss_pred CCCcCCHHHHHHHhccCChhHHHHHHHHHHHHHHHHHHHHhhhccccccccCCCCccchhHHHHHHHHHHHHHhcC---- Confidence 55 89999999987776667899999999999982 1 11333322 2222333322211 Q ss_pred cccccccccceeeeeeccee--eeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 61 EISGGAANRNLASESAGELS--VSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 61 ~~~~~~~~g~vtS~svG~vS--vs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) +.+..+--..|.+.|..| .+|.+|. +-+-.|.- .+++.|.++-=. T Consensus 77 --~~~~~g~tq~S~TaG~ys~S~T~~np~------G~lylt~~------e~~~LGl~~~r~ 123 (132) T protein:vir:94 77 --STDQEPMTQTTESALGYSVSGSYLVPG------GGLFIKNS------ELSRLGLKKQRF 123 (132) T ss_pred --CCCCCCceeeeeecccceeeeeeecCC------CCceeChH------HHHhhCCCCCce Confidence 111112234678889774 5554332 11333332 333334332111 No 62 >protein:vir:7410 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839927;genbank:gi:30089897;genbank:GeneID:1260684 Probab=44.62 E-value=0.49 Score=22.26 Aligned_cols=83 Identities=16% Similarity=0.168 Sum_probs=45.2 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCc---Cc-----------hhHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKV---QW-----------GKLYDRGVMALTAHLLKLSADAEISGGA 66 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~---~~-----------g~~~~~~~~l~~AH~l~l~~~~~~~~~~ 66 (119) -+|+++||.--- ..+-+|+.|+.++.-|+.+|... .+ .++++.++.+|++|+-.-+.......-+ T Consensus 2 ~v~LdeiK~~LR-IDddDD~ll~~~i~aAe~yI~~Aig~~~~~~~fy~~e~~~~l~~~Avl~La~~wYeNR~at~~vp~~ 80 (107) T protein:vir:74 2 SVTVDDLLDQLS-EDDDRKPQLQIYFDTATAYVKNAVSSDTVDAPFFNVENVSPIYDVAVLSYSMDLWINRSTTMPPTTA 80 (107) T ss_pred eecHHHHHHHcC-CCCChhHHHHHHHHHHHHHHhhhcCCcccccccccccCcchHHHHHHHHHHHHHHHhccccccccHH Confidence 448999886432 22448999999999999999421 11 2367889999999998765332111100 Q ss_pred c-------ccceeeeee--cceeeeeccCccCCcchhhhhcCH Q lcl|NC_013597. 67 A-------NRNLASESA--GELSVSYTAPISANGSDDFYQLTA 100 (119) Q Consensus 67 ~-------~g~vtS~sv--G~vSvs~~~~~~~~~~~~w~~~T~ 100 (119) . .|.-...++ ++. +. +|. T Consensus 81 v~siI~QLRg~y~~~~e~~~~~------~~----------~~~ 107 (107) T protein:vir:74 81 VDHMVGQLRGLYSSWKEEQGGQ------NL----------QTE 107 (107) T ss_pred HHHHHHHHhhcccchhhhcCCC------cc----------cCC Confidence 0 111111110 110 00 111 No 63 >protein:vir:97430 Length: 104 # NCBI annotation: ORF051 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240751;genbank:gi:66396455;genbank:GeneID:5133786 Probab=43.38 E-value=0.46 Score=22.44 Aligned_cols=97 Identities=19% Similarity=0.198 Sum_probs=56.2 Q ss_pred CCHHHHHHhhh-hhcC-----CCHHHHHHHHHHHHHHhCCcCch-hHHHHHHHHHHHHHHHHhhhhcccccccccceeee Q lcl|NC_013597. 2 PLTEDFLLRYT-EFGK-----TDAKRIGLFLSDAQAEVSKVQWG-KLYDRGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) Q Consensus 2 ~t~~~Fr~~~P-~F~~-----vpd~~i~~~~~~A~~~~~~~~~g-~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~ 74 (119) -.+++.+.+-- ...+ +=...|..+++.|+.+-+ ..|+ +.+.-++..++|-.+.- +..+.++|. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN-~~F~~~~lP~gVkkfvAe~iky---------~~~~NissR 70 (104) T protein:vir:97 1 MDAKDVKMINGLSLNDSSNDEQIEYLIEEYKGVAEDYCN-QKFDDKEVPSGVKKFIAECIKF---------GTTGNISAR 70 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhC-CCCCCccCCccHHHHHHHHHhh---------CCCCCcccc Confidence 33444444311 0111 113567889999977755 5676 46778889999987763 245689999 Q ss_pred eecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCcc Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMV 118 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~v 118 (119) |.|+||.+|.+- +-++.|+- +...+|+.=.|=-| T Consensus 71 sMgtVSYTy~T~---------iP~~i~~~-L~PYRrlrw~~~~~ 104 (104) T protein:vir:97 71 TMGTVSYTYVTD---------IPSSAYAY-LLPYRKLSWGKRYV 104 (104) T ss_pred cccceeeechhh---------hhHHHHHh-hhhhhhhcccccCC Confidence 999999999542 22223331 11223443334444 No 64 >protein:vir:94492 Length: 104 # NCBI annotation: ORF049 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240678;genbank:gi:66396380;genbank:GeneID:5133756 Probab=43.38 E-value=0.46 Score=22.44 Aligned_cols=97 Identities=19% Similarity=0.198 Sum_probs=56.2 Q ss_pred CCHHHHHHhhh-hhcC-----CCHHHHHHHHHHHHHHhCCcCch-hHHHHHHHHHHHHHHHHhhhhcccccccccceeee Q lcl|NC_013597. 2 PLTEDFLLRYT-EFGK-----TDAKRIGLFLSDAQAEVSKVQWG-KLYDRGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) Q Consensus 2 ~t~~~Fr~~~P-~F~~-----vpd~~i~~~~~~A~~~~~~~~~g-~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~ 74 (119) -.+++.+.+-- ...+ +=...|..+++.|+.+-+ ..|+ +.+.-++..++|-.+.- +..+.++|. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN-~~F~~~~lP~gVkkfvAe~iky---------~~~~NissR 70 (104) T protein:vir:94 1 MDAKDVKMINGLSLNDSSNDEQIEYLIEEYKGVAEDYCN-QKFDDKEVPSGVKKFIAECIKF---------GTTGNISAR 70 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhC-CCCCCccCCccHHHHHHHHHhh---------CCCCCcccc Confidence 33444444311 0111 113567889999977755 5676 46778889999987763 245689999 Q ss_pred eecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCcc Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMV 118 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~v 118 (119) |.|+||.+|.+- +-++.|+- +...+|+.=.|=-| T Consensus 71 sMgtVSYTy~T~---------iP~~i~~~-L~PYRrlrw~~~~~ 104 (104) T protein:vir:94 71 TMGTVSYTYVTD---------IPSSAYAY-LLPYRKLSWGKRYV 104 (104) T ss_pred cccceeeechhh---------hhHHHHHh-hhhhhhhcccccCC Confidence 999999999542 22223331 11223443334444 No 65 >protein:vir:4857 Length: 104 # NCBI annotation: putative DNA packaging protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049397;genbank:gi:9632425;genbank:GeneID:1258493 Probab=42.58 E-value=0.45 Score=22.50 Aligned_cols=85 Identities=19% Similarity=0.157 Sum_probs=45.3 Q ss_pred CC-CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCc-----C------chhHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_013597. 1 MP-LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKV-----Q------WGKLYDRGVMALTAHLLKLSADAEISGGAAN 68 (119) Q Consensus 1 m~-t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~-----~------~g~~~~~~~~l~~AH~l~l~~~~~~~~~~~~ 68 (119) |+ |++++|.-.===.+..|+.|+.+++-|+.+|... . .....+.|+.++++|+-.-+......... T Consensus 1 M~vtLeevK~~LRID~d~dD~li~~~i~aA~~~i~~~ig~~~~~~~~~~~~~~~~~Avl~lv~~~Y~NR~~~~~~~~~-- 78 (104) T protein:vir:48 1 MSVSKETIMQTLNLDETDDTALIPAYIESARQYVVNSVGDDPKFYNLDSVRALFDTAVIALTSSYFTYRVALTDTATY-- 78 (104) T ss_pred CcccHHHHHHHcCCCCccchHHHHHHHHHHHHHHHHhhCCCCCcccccCCChhHHHHHHHHHHHHHhhhhhhcccccc-- Confidence 55 8888887433222358999999999999987421 1 12467889999999998776432111000 Q ss_pred cceeeeeecceeeeeccCccCCcchhhhhcCHHHH--HHHHHHHHh-----CCCC Q lcl|NC_013597. 69 RNLASESAGELSVSYTAPISANGSDDFYQLTAYGQ--EYLRLRRLI-----GVGV 116 (119) Q Consensus 69 g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~--~y~~L~~~~-----g~Gg 116 (119) ..|+|. ...+|+..+ +-|= T Consensus 79 -----------------------------~ip~~v~sli~~lR~~y~~~~~~~~~ 104 (104) T protein:vir:48 79 -----------------------------PVNLTLNSIIGQLRGLYATYSEERGD 104 (104) T ss_pred -----------------------------hhhHHHHHHHHHHHHhhhhhcccCCC Confidence 011111 011111111 0000 No 66 >protein:vir:105005 Length: 96 # NCBI annotation: putative DNA packaging protein phage # Family: family:all:316 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459970;genbank:gi:85701385;genbank:GeneID:3882146 Probab=42.42 E-value=0.51 Score=22.18 Aligned_cols=87 Identities=10% Similarity=-0.019 Sum_probs=45.8 Q ss_pred CCCHHHHHHhhh-hhcCCCHHHHHHHHHHHHHHhCCc---Cc---hhHHHHHHHHHHHHHHHHhhhhccccc-cccccee Q lcl|NC_013597. 1 MPLTEDFLLRYT-EFGKTDAKRIGLFLSDAQAEVSKV---QW---GKLYDRGVMALTAHLLKLSADAEISGG-AANRNLA 72 (119) Q Consensus 1 m~t~~~Fr~~~P-~F~~vpd~~i~~~~~~A~~~~~~~---~~---g~~~~~~~~l~~AH~l~l~~~~~~~~~-~~~g~vt 72 (119) |.|++++|+--= .+ | .|+.|+.+++-|+.++... .+ -.....++.++++|+-.-+.....+.. ..+-.|. T Consensus 2 ~vtLee~K~~LRID~-D-dD~lI~~~i~aA~~~i~~~~g~~~~e~~~~~k~Avl~lv~~~YenR~~~~~~~~~~ip~~v~ 79 (96) T protein:vir:10 2 LVTLEEAKEWIRVDG-D-DDPTITMLIKAAELYIYKATGKTFTQTNEDAKLLCLFLVADWYGNRLLVGEKASEKIRTIVQ 79 (96) T ss_pred cCCHHHHHHHcCCCC-c-hhHHHHHHHHHHHHHHHHhhCCCCCCCcchHHHHHHHHHHHHHhhhhhccccccchhhHHHH Confidence 889999987432 34 4 6889999999999998431 12 246678999999999876643221110 0110010 Q ss_pred eeeecceeeeeccCccCCcch Q lcl|NC_013597. 73 SESAGELSVSYTAPISANGSD 93 (119) Q Consensus 73 S~svG~vSvs~~~~~~~~~~~ 93 (119) | =+..+.-.|+...- +. T Consensus 80 s-li~qLr~~~~~~~e---~~ 96 (96) T protein:vir:10 80 S-MILQLQYASEPQEE---RK 96 (96) T ss_pred H-HHHHHhhcCCcccc---cC Confidence 0 00111111111100 00 No 67 >protein:vir:107614 Length: 96 # NCBI annotation: conserved phage protein # Family: family:all:316 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338189;genbank:gi:77020184;genbank:GeneID:3703745 Probab=42.42 E-value=0.51 Score=22.18 Aligned_cols=87 Identities=10% Similarity=-0.019 Sum_probs=45.8 Q ss_pred CCCHHHHHHhhh-hhcCCCHHHHHHHHHHHHHHhCCc---Cc---hhHHHHHHHHHHHHHHHHhhhhccccc-cccccee Q lcl|NC_013597. 1 MPLTEDFLLRYT-EFGKTDAKRIGLFLSDAQAEVSKV---QW---GKLYDRGVMALTAHLLKLSADAEISGG-AANRNLA 72 (119) Q Consensus 1 m~t~~~Fr~~~P-~F~~vpd~~i~~~~~~A~~~~~~~---~~---g~~~~~~~~l~~AH~l~l~~~~~~~~~-~~~g~vt 72 (119) |.|++++|+--= .+ | .|+.|+.+++-|+.++... .+ -.....++.++++|+-.-+.....+.. ..+-.|. T Consensus 2 ~vtLee~K~~LRID~-D-dD~lI~~~i~aA~~~i~~~~g~~~~e~~~~~k~Avl~lv~~~YenR~~~~~~~~~~ip~~v~ 79 (96) T protein:vir:10 2 LVTLEEAKEWIRVDG-D-DDPTITMLIKAAELYIYKATGKTFTQTNEDAKLLCLFLVADWYGNRLLVGEKASEKIRTIVQ 79 (96) T ss_pred cCCHHHHHHHcCCCC-c-hhHHHHHHHHHHHHHHHHhhCCCCCCCcchHHHHHHHHHHHHHhhhhhccccccchhhHHHH Confidence 889999987432 34 4 6889999999999998431 12 246678999999999876643221110 0110010 Q ss_pred eeeecceeeeeccCccCCcch Q lcl|NC_013597. 73 SESAGELSVSYTAPISANGSD 93 (119) Q Consensus 73 S~svG~vSvs~~~~~~~~~~~ 93 (119) | =+..+.-.|+...- +. T Consensus 80 s-li~qLr~~~~~~~e---~~ 96 (96) T protein:vir:10 80 S-MILQLQYASEPQEE---RK 96 (96) T ss_pred H-HHHHHhhcCCcccc---cC Confidence 0 00111111111100 00 No 68 >protein:vir:102083 Length: 96 # NCBI annotation: DNA packaging protein # Family: family:all:316 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512316;genbank:gi:89152485;genbank:GeneID:3953076 Probab=42.42 E-value=0.51 Score=22.18 Aligned_cols=87 Identities=10% Similarity=-0.019 Sum_probs=45.8 Q ss_pred CCCHHHHHHhhh-hhcCCCHHHHHHHHHHHHHHhCCc---Cc---hhHHHHHHHHHHHHHHHHhhhhccccc-cccccee Q lcl|NC_013597. 1 MPLTEDFLLRYT-EFGKTDAKRIGLFLSDAQAEVSKV---QW---GKLYDRGVMALTAHLLKLSADAEISGG-AANRNLA 72 (119) Q Consensus 1 m~t~~~Fr~~~P-~F~~vpd~~i~~~~~~A~~~~~~~---~~---g~~~~~~~~l~~AH~l~l~~~~~~~~~-~~~g~vt 72 (119) |.|++++|+--= .+ | .|+.|+.+++-|+.++... .+ -.....++.++++|+-.-+.....+.. ..+-.|. T Consensus 2 ~vtLee~K~~LRID~-D-dD~lI~~~i~aA~~~i~~~~g~~~~e~~~~~k~Avl~lv~~~YenR~~~~~~~~~~ip~~v~ 79 (96) T protein:vir:10 2 LVTLEEAKEWIRVDG-D-DDPTITMLIKAAELYIYKATGKTFTQTNEDAKLLCLFLVADWYGNRLLVGEKASEKIRTIVQ 79 (96) T ss_pred cCCHHHHHHHcCCCC-c-hhHHHHHHHHHHHHHHHHhhCCCCCCCcchHHHHHHHHHHHHHhhhhhccccccchhhHHHH Confidence 889999987432 34 4 6889999999999998431 12 246678999999999876643221110 0110010 Q ss_pred eeeecceeeeeccCccCCcch Q lcl|NC_013597. 73 SESAGELSVSYTAPISANGSD 93 (119) Q Consensus 73 S~svG~vSvs~~~~~~~~~~~ 93 (119) | =+..+.-.|+...- +. T Consensus 80 s-li~qLr~~~~~~~e---~~ 96 (96) T protein:vir:10 80 S-MILQLQYASEPQEE---RK 96 (96) T ss_pred H-HHHHHhhcCCcccc---cC Confidence 0 00111111111100 00 No 69 >protein:vir:102863 Length: 96 # NCBI annotation: conserved phage protein # Family: family:all:316 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338138;genbank:gi:77020236;genbank:GeneID:3703772 Probab=42.42 E-value=0.51 Score=22.18 Aligned_cols=87 Identities=10% Similarity=-0.019 Sum_probs=45.8 Q ss_pred CCCHHHHHHhhh-hhcCCCHHHHHHHHHHHHHHhCCc---Cc---hhHHHHHHHHHHHHHHHHhhhhccccc-cccccee Q lcl|NC_013597. 1 MPLTEDFLLRYT-EFGKTDAKRIGLFLSDAQAEVSKV---QW---GKLYDRGVMALTAHLLKLSADAEISGG-AANRNLA 72 (119) Q Consensus 1 m~t~~~Fr~~~P-~F~~vpd~~i~~~~~~A~~~~~~~---~~---g~~~~~~~~l~~AH~l~l~~~~~~~~~-~~~g~vt 72 (119) |.|++++|+--= .+ | .|+.|+.+++-|+.++... .+ -.....++.++++|+-.-+.....+.. ..+-.|. T Consensus 2 ~vtLee~K~~LRID~-D-dD~lI~~~i~aA~~~i~~~~g~~~~e~~~~~k~Avl~lv~~~YenR~~~~~~~~~~ip~~v~ 79 (96) T protein:vir:10 2 LVTLEEAKEWIRVDG-D-DDPTITMLIKAAELYIYKATGKTFTQTNEDAKLLCLFLVADWYGNRLLVGEKASEKIRTIVQ 79 (96) T ss_pred cCCHHHHHHHcCCCC-c-hhHHHHHHHHHHHHHHHHhhCCCCCCCcchHHHHHHHHHHHHHhhhhhccccccchhhHHHH Confidence 889999987432 34 4 6889999999999998431 12 246678999999999876643221110 0110010 Q ss_pred eeeecceeeeeccCccCCcch Q lcl|NC_013597. 73 SESAGELSVSYTAPISANGSD 93 (119) Q Consensus 73 S~svG~vSvs~~~~~~~~~~~ 93 (119) | =+..+.-.|+...- +. T Consensus 80 s-li~qLr~~~~~~~e---~~ 96 (96) T protein:vir:10 80 S-MILQLQYASEPQEE---RK 96 (96) T ss_pred H-HHHHHhhcCCcccc---cC Confidence 0 00111111111100 00 No 70 >protein:vir:95071 Length: 104 # NCBI annotation: ORF050 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240825;genbank:gi:66394717;genbank:GeneID:5133865 Probab=42.16 E-value=0.5 Score=22.24 Aligned_cols=97 Identities=19% Similarity=0.200 Sum_probs=56.0 Q ss_pred CCHHHHHHhhh-hhcC-----CCHHHHHHHHHHHHHHhCCcCch-hHHHHHHHHHHHHHHHHhhhhcccccccccceeee Q lcl|NC_013597. 2 PLTEDFLLRYT-EFGK-----TDAKRIGLFLSDAQAEVSKVQWG-KLYDRGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) Q Consensus 2 ~t~~~Fr~~~P-~F~~-----vpd~~i~~~~~~A~~~~~~~~~g-~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~ 74 (119) -.+++.+.+-- ...+ +=...|..+++.|+.+-+ ..|+ +.+.-++..++|-.+.- +..+.++|. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN-~~F~~~~lP~gVkkfvAe~iky---------~~~~NissR 70 (104) T protein:vir:95 1 MDAKDVKMINGLSLNDSSNDEQIEYLIEEYKSVAEDYCN-QKFDDKEVPSGVKKFIAECIKF---------GTTGNISAR 70 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhC-CCCCCccCCccHHHHHHHHHhh---------CCCCCcccc Confidence 33444444311 0111 113567889999977755 5676 46778889999987763 245689999 Q ss_pred eecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCcc Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMV 118 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~v 118 (119) |.|+||.+|.+- +-++.|+- +...+|+.=.|=-| T Consensus 71 sMgtVSYTy~T~---------iP~~i~~~-L~PYRrlrw~~~~~ 104 (104) T protein:vir:95 71 TMGTVSYTYVTD---------IPSSAYAY-LMPYRKLSWGKRYV 104 (104) T ss_pred cccceeeechhh---------hhHHHHHh-hhhhhhhcccccCC Confidence 999999999542 22223331 11223333333344 No 71 >protein:vir:7857 Length: 188 # NCBI annotation: gp14 # Family: family:all:11114 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817464;genbank:gi:29565893;genbank:GeneID:1259086 Probab=40.16 E-value=0.27 Score=23.72 Aligned_cols=76 Identities=13% Similarity=0.190 Sum_probs=47.0 Q ss_pred CCC------------HHHHHHhhhh-hcCCCHHHHHHHHHHHHHHhCCcCchhHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_013597. 1 MPL------------TEDFLLRYTE-FGKTDAKRIGLFLSDAQAEVSKVQWGKLYDRGVMALTAHLLKLSADAEISGGAA 67 (119) Q Consensus 1 m~t------------~~~Fr~~~P~-F~~vpd~~i~~~~~~A~~~~~~~~~g~~~~~~~~l~~AH~l~l~~~~~~~~~~~ 67 (119) +|- |...|-.|-. ..++|++.|+..+++|..++.+ T Consensus 100 ~pg~~~~r~~~WPw~p~~VtVTytHGy~evP~eiv~lv~d~A~~~~~n-------------------------------- 147 (188) T protein:vir:78 100 LPGSEWSTGHTWPWLPGSLRVTYTHGYNPVPDELIDVAIRLAREYQSN-------------------------------- 147 (188) T ss_pred CcccccccccccccCcceEEEEEecCCCcccHHHHHHHHHHHHHHhcC-------------------------------- Confidence 221 1222222222 3469999999999999987754 Q ss_pred ccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCcc Q lcl|NC_013597. 68 NRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMV 118 (119) Q Consensus 68 ~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~v 118 (119) .....++++|+.|++|.+.+..+ -.+.=+++++++-.+.+- T Consensus 148 p~~L~q~~vG~~S~tfa~~~~~s----------l~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:78 148 PELLVSKQVGEIERRFGSVAGTS----------LSKADQAILDRYVIATLA 188 (188) T ss_pred cccceeeecCceeeecccccCCc----------ccchhHHhhccccccccC Confidence 23468899999999997543322 223335666666665554 No 72 >protein:vir:101652 Length: 188 # NCBI annotation: gp15 # Family: family:all:11114 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654770;genbank:gi:109302768;genbank:GeneID:4156086 Probab=40.16 E-value=0.27 Score=23.72 Aligned_cols=76 Identities=13% Similarity=0.190 Sum_probs=47.0 Q ss_pred CCC------------HHHHHHhhhh-hcCCCHHHHHHHHHHHHHHhCCcCchhHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_013597. 1 MPL------------TEDFLLRYTE-FGKTDAKRIGLFLSDAQAEVSKVQWGKLYDRGVMALTAHLLKLSADAEISGGAA 67 (119) Q Consensus 1 m~t------------~~~Fr~~~P~-F~~vpd~~i~~~~~~A~~~~~~~~~g~~~~~~~~l~~AH~l~l~~~~~~~~~~~ 67 (119) +|- |...|-.|-. ..++|++.|+..+++|..++.+ T Consensus 100 ~pg~~~~r~~~WPw~p~~VtVTytHGy~evP~eiv~lv~d~A~~~~~n-------------------------------- 147 (188) T protein:vir:10 100 LPGSEWSTGHTWPWLPGSLRVTYTHGYNPVPDELIDVAIRLAREYQSN-------------------------------- 147 (188) T ss_pred CcccccccccccccCcceEEEEEecCCCcccHHHHHHHHHHHHHHhcC-------------------------------- Confidence 221 1222222222 3469999999999999987754 Q ss_pred ccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCcc Q lcl|NC_013597. 68 NRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMV 118 (119) Q Consensus 68 ~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~v 118 (119) .....++++|+.|++|.+.+..+ -.+.=+++++++-.+.+- T Consensus 148 p~~L~q~~vG~~S~tfa~~~~~s----------l~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:10 148 PELLVSKQVGEIERRFGSVAGTS----------LSKADQAILDRYVIATLA 188 (188) T ss_pred cccceeeecCceeeecccccCCc----------ccchhHHhhccccccccC Confidence 23468899999999997543322 223335666666665554 No 73 >protein:vir:4512 Length: 107 # NCBI annotation: unknown # Family: family:all:363 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599038;genbank:gi:19548996;genbank:GeneID:935218 Probab=39.12 E-value=1 Score=20.55 Aligned_cols=83 Identities=12% Similarity=0.101 Sum_probs=46.2 Q ss_pred CCCHHHHHHhhhhhcC--CCHHHHHHHHHHHHHHhCC--------c--Cc----------hhHHHHHHHHHHHHHHHHhh Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGK--TDAKRIGLFLSDAQAEVSK--------V--QW----------GKLYDRGVMALTAHLLKLSA 58 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~--vpd~~i~~~~~~A~~~~~~--------~--~~----------g~~~~~~~~l~~AH~l~l~~ 58 (119) |+|++++|.-.=--.| -.|+.|+.+++-|+.++.. . ++ -...+.++.++++|+-..+. T Consensus 2 ~vtL~e~K~hLRId~D~~ddD~lI~~~i~AA~~~i~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~AvLllv~~~Y~NRe 81 (107) T protein:vir:45 2 LLKMEEIKLQLRLDDDFSDEDELLELLGKAAQSRTENFLNRKLYATADDRPADDPDGLVISDDVKLALLLLVSHFYENRS 81 (107) T ss_pred CCCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHHhccccccccccccccccccccCChhHHHHHHHHHHHHHhhhh Confidence 8899999985432223 2588899999999887631 1 00 13356788999999875542 Q ss_pred hhcccccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHH-HHHHHHHHhCC Q lcl|NC_013597. 59 DAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQ-EYLRLRRLIGV 114 (119) Q Consensus 59 ~~~~~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~-~y~~L~~~~g~ 114 (119) ... +.+ ...-|+|. ..++-.|..++ T Consensus 82 ~~~----------------~~~---------------~~~lp~~v~~Ll~~~R~~~~ 107 (107) T protein:vir:45 82 TVT----------------DVE---------------KMELPMSFNWLVAPYRLIPL 107 (107) T ss_pred hcc----------------ccc---------------hhccchHHHHHHHHHhhcCC Confidence 110 000 01124552 22222344444 No 74 >protein:vir:99922 Length: 165 # NCBI annotation: gp9 # Family: family:all:7267 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655526;genbank:gi:109392296;genbank:GeneID:4157091 Probab=36.74 E-value=0.69 Score=21.45 Aligned_cols=105 Identities=13% Similarity=0.125 Sum_probs=55.8 Q ss_pred CCCHHH-----HHHhhhhhcCCCHHHHHHHHHHHHHHh---CCcCc--h-hHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_013597. 1 MPLTED-----FLLRYTEFGKTDAKRIGLFLSDAQAEV---SKVQW--G-KLYDRGVMALTAHLLKLSADAEISGGAANR 69 (119) Q Consensus 1 m~t~~~-----Fr~~~P~F~~vpd~~i~~~~~~A~~~~---~~~~~--g-~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g 69 (119) -||+.+ --+.--.|+++|.+.+..+++++...- -|+-- + +..+.+-..+.--+|--+. .-.| T Consensus 3 ~~~~~~p~~ii~~eDl~Pf~~i~~~ka~~mI~da~A~A~~vAPCi~~~~f~~~~aAKaIlrgAiLRW~e-------~GSG 75 (165) T protein:vir:99 3 EPTPTEPEPLLTAEDLAPFATIPKAKADEMIEDALGMAEVHAPCINDPGFAHRRAAKAILRGAILRWNE-------AGAG 75 (165) T ss_pred CCCCCCcceeeehhhccccccCCHHHHHHHHhhhhhhhhhhccccCCCCcccHHHHHHHHHHhhhhhhc-------ccCc Confidence 122211 111223478899988888888776542 23221 1 2223232333333332221 1157 Q ss_pred ceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhC-CC---CccC Q lcl|NC_013597. 70 NLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIG-VG---VMVA 119 (119) Q Consensus 70 ~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g-~G---g~va 119 (119) .++++|.|...+++|+.+-.... -|=-+--+|.|+|. -| +-.+ T Consensus 76 Ait~~TaGPf~qT~DtRs~r~~m-------fwPSEItqLqklC~~~g~~~~AFs 122 (165) T protein:vir:99 76 AATTKTAGIYGQTVDTRQPRKAM-------FFPSEIDQLRKLCRPDDDNGGAFS 122 (165) T ss_pred eeeecccccceeeeccccccccc-------cChhhHHHHHHHhcCCCCCCccee Confidence 78999999999999887543221 12246668999995 32 3333 No 75 >protein:vir:486 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543093;swissprot:trembl:q8w626;genbank:gi:18249905;uniprot:Q8W626;genbank:GeneID:929692 Probab=34.37 E-value=1.3 Score=19.97 Aligned_cols=83 Identities=10% Similarity=0.010 Sum_probs=45.5 Q ss_pred CCCHHHHHHhhhhhcCC--CHHHHHHHHHHHHHHhCC----------cCc----------hhHHHHHHHHHHHHHHHHhh Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKT--DAKRIGLFLSDAQAEVSK----------VQW----------GKLYDRGVMALTAHLLKLSA 58 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~v--pd~~i~~~~~~A~~~~~~----------~~~----------g~~~~~~~~l~~AH~l~l~~ 58 (119) |+|+++.|.-.==-.|+ .|+.|+.+++-|..++.. +.. -...+.++.++++|+=..+. T Consensus 2 ~vtL~e~K~hLRid~D~~ddD~li~~~i~aA~~~i~~~~~r~l~~~~~~~~~~~~~~~~~~~~ik~Avlllv~~~Y~NRe 81 (107) T protein:vir:48 2 LLKEEEIKSHLRLDDGLYSDGDFLKLLAQAVQKRTETYLNRKLYAPEETIPEDDPDGMHLTDDVRLAMLMLVSHFYENRS 81 (107) T ss_pred CCCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHHhccccccccccccccCccccccchhHHHHHHHHHHHHHhhhh Confidence 88999999854332233 588899999999877621 111 13356799999999876542 Q ss_pred hhcccccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHH-HHHHHHHHhCC Q lcl|NC_013597. 59 DAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQ-EYLRLRRLIGV 114 (119) Q Consensus 59 ~~~~~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~-~y~~L~~~~g~ 114 (119) .. +.+ ++ ..-|+|. ..++-.|.++. T Consensus 82 ~v-----------~~~-----~~---------------~~iP~~v~~LL~~yR~~~l 107 (107) T protein:vir:48 82 TI-----------TDV-----EK---------------LETPMSFRWLAGPYRIVPL 107 (107) T ss_pred hh-----------ccc-----cc---------------cccCHHHHHHHHHhhccCC Confidence 21 000 00 1114441 12222233333 No 76 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=32.82 E-value=1.4 Score=19.79 Aligned_cols=113 Identities=22% Similarity=0.144 Sum_probs=58.3 Q ss_pred CCCHHHHHHhhhhhc----CCCHHHHHHHHHHHHHHhCC-cCc-hhH---HHHHH--------------------HHHHH Q lcl|NC_013597. 1 MPLTEDFLLRYTEFG----KTDAKRIGLFLSDAQAEVSK-VQW-GKL---YDRGV--------------------MALTA 51 (119) Q Consensus 1 m~t~~~Fr~~~P~F~----~vpd~~i~~~~~~A~~~~~~-~~~-g~~---~~~~~--------------------~l~~A 51 (119) ..|++++++.+-... +..|+..+..+..|..+||. .+| |+. .++++ .+..| T Consensus 18 Yvtv~~a~aY~~~rg~~~~a~~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WPRtg~~d~~~~~~~~IP~~v~~A 97 (172) T protein:vir:97 18 YISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEVKEA 97 (172) T ss_pred cccHHHHHHHHHhcCcccCCCCcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcccCCCCCCcccccccccHHHHHH Confidence 778888887665433 24467788899999999986 356 211 11111 11111 Q ss_pred ----HHHHHhhhhc--cccccccc--ceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHHHHHH---HhCCCCccC Q lcl|NC_013597. 52 ----HLLKLSADAE--ISGGAANR--NLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRR---LIGVGVMVA 119 (119) Q Consensus 52 ----H~l~l~~~~~--~~~~~~~g--~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~---~~g~Gg~va 119 (119) -+..|.+... .......+ .+.++++|+|++.|...+.... .++.|..- -+|++ ..+.||+.- T Consensus 98 ~~elA~~al~~~l~~d~~~~~~~~~v~~kr~kvg~i~~~y~~~~~~~~-----~~p~~~~v-~aLL~p~gl~~~~~~~~ 170 (172) T protein:vir:97 98 CAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVF-----QMPKYPAA-DQKLVRAGLVRSGGTLL 170 (172) T ss_pred HHHHHHHHHhcccccccccccccccceeeeeeecceeeEeeccCCCCC-----ccccHHHH-HHHHhhhccccCcceec Confidence 1122322211 11122233 4567888999999865433211 23445432 44443 444555544 No 77 >protein:vir:9706 Length: 100 # NCBI annotation: hypothetical protein # Family: family:all:316 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795468;genbank:gi:28876223;genbank:GeneID:1257767 Probab=32.65 E-value=0.82 Score=21.05 Aligned_cols=85 Identities=14% Similarity=0.067 Sum_probs=44.0 Q ss_pred CCCHHHHHHhhhh-hc---CCCHHHHHHHHHHHHHHhCCc--------Cch--hHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_013597. 1 MPLTEDFLLRYTE-FG---KTDAKRIGLFLSDAQAEVSKV--------QWG--KLYDRGVMALTAHLLKLSADAEISGGA 66 (119) Q Consensus 1 m~t~~~Fr~~~P~-F~---~vpd~~i~~~~~~A~~~~~~~--------~~g--~~~~~~~~l~~AH~l~l~~~~~~~~~~ 66 (119) |+--+++...... +. +.+|+.|+.+++-|+.+|+.. -|. ....+|++++++|+---++...... T Consensus 1 m~~t~e~L~~lK~~lRID~d~DD~li~~~i~~Ae~~I~~AV~~~~t~~~~~~~~rF~~Av~~Lv~~~Y~nR~~t~d~~-- 78 (100) T protein:vir:97 1 MAVSKELLNSVKLYCKIDFDFENDIIKEMIESAQEQICFAIDDGSTPEMFEGHAKFALAVKKQVKEEYDHRGLSADSF-- 78 (100) T ss_pred CcccHHHHHHHHHHcCCCCCcchHHHHHHHHHHHHHHhhhccCCCCcchhhccchHHHHHHHHHHHHHHhccccchhh-- Confidence 8855554443333 22 478999999999999999632 122 3567999999999987653221110 Q ss_pred cccceeeeeecceeeeeccCccCCcchhhhhcCHHHH-HHHHHHHHhCCCC Q lcl|NC_013597. 67 ANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQ-EYLRLRRLIGVGV 116 (119) Q Consensus 67 ~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~-~y~~L~~~~g~Gg 116 (119) ...-|+|. -..+=+|.+|--- T Consensus 79 -----------------------------~~~ip~gv~~lI~QLR~~~~~~ 100 (100) T protein:vir:97 79 -----------------------------RYPLANGVLNIIHQLRLRGDDS 100 (100) T ss_pred -----------------------------cchhhhhHHHHHHHHHHhhcCC Confidence 00001110 0000011111111 No 78 >protein:vir:4831 Length: 105 # NCBI annotation: ORF27 # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038328;genbank:gi:9634654;genbank:GeneID:1262588 Probab=32.31 E-value=0.94 Score=20.73 Aligned_cols=85 Identities=14% Similarity=0.106 Sum_probs=45.9 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCc-----Cc------hhHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKV-----QW------GKLYDRGVMALTAHLLKLSADAEISGGAANR 69 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~-----~~------g~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g 69 (119) ..|++++|.-.===.+..|+.|+.+++-|+.++... .+ -.....|+.++++|+-.-+....... T Consensus 2 ~vtLee~K~~LRID~dddD~lI~~~i~aA~~yi~~~ig~~~~~~~~~~~~~~~~~Avl~lv~~~YeNR~~~~~~~----- 76 (105) T protein:vir:48 2 SVSKTSIMQTLNLDETDDTALIPAYIESAKQYIINAVGSDSKFYDLENVQPLFDTAVIALTSSYFTYRVALTDTV----- 76 (105) T ss_pred cccHHHHHHHcCCCCccchHHHHHHHHHHHHHHHHhhCCCCccccccCCchHHHHHHHHHHHHHHhhhhhccCcc----- Confidence 448999987432112358999999999999997421 11 23678899999999987663211000 Q ss_pred ceeeeeecceeeeeccCccCCcchhhhhcCHHHH--HHHHHHHH------hCCCC Q lcl|NC_013597. 70 NLASESAGELSVSYTAPISANGSDDFYQLTAYGQ--EYLRLRRL------IGVGV 116 (119) Q Consensus 70 ~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~--~y~~L~~~------~g~Gg 116 (119) .+..|+|. ..-+|+.. ..--| T Consensus 77 --------------------------~~~ip~~v~sli~~lR~~y~~~~e~~~~g 105 (105) T protein:vir:48 77 --------------------------TYPINLTLNSIIGQLRGLYATYSEVVANG 105 (105) T ss_pred --------------------------cchhhHHHHHHHHHHhhhhhhhhhcccCC Confidence 00112221 11111111 11111 No 79 >protein:vir:100245 Length: 113 # NCBI annotation: gp74 # Family: family:all:363 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355410;genbank:gi:77864700;genbank:GeneID:3725967 Probab=31.94 E-value=1.5 Score=19.68 Aligned_cols=83 Identities=14% Similarity=0.088 Sum_probs=46.2 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCC----c------C----------------chhHHHHHHHHHHHHHH Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSK----V------Q----------------WGKLYDRGVMALTAHLL 54 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~----~------~----------------~g~~~~~~~~l~~AH~l 54 (119) |+|+++.|.-.=-=.|.+|+.|+.+++-|+..+.. + . .-...+.++.|+++|+- T Consensus 3 ~vtLee~K~hLRvd~d~dD~lI~~li~AA~~~ve~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~p~~i~~AvLllv~~~Y 82 (113) T protein:vir:10 3 LVELKLALGFVRANAGVEDDVVQMLLDAATQSAVDYLNRQVFETEDAMTTAIEAGTAGQNPMVVNAAIRAAILKITAELY 82 (113) T ss_pred CCCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHHhCccccccccccccccccccccccccccChHHHHHHHHHHHHHH Confidence 67999998754322346899999999999866521 1 0 11335789999999987 Q ss_pred HHhhhhcccccccccceeeeeecceeeeeccCccCCcchhhhhcCHHHHHHH--HHHHHhCC Q lcl|NC_013597. 55 KLSADAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYL--RLRRLIGV 114 (119) Q Consensus 55 ~l~~~~~~~~~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~--~L~~~~g~ 114 (119) ..+-... .|. ...-|+|.+.+ .+|+-.|. T Consensus 83 ~nRe~~~--------------~~~-----------------~~~lP~~v~~Ll~~yR~~~g~ 113 (113) T protein:vir:10 83 ANREDTA--------------FGP-----------------ITELPLNARALLRPHRIIPGV 113 (113) T ss_pred hhhhhhc--------------hhh-----------------hhccCHHHHHHHHHhhhhcCC Confidence 6542210 010 11225553322 12222333 No 80 >protein:vir:1384 Length: 92 # NCBI annotation: Gp7 protein # Family: family:all:316 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612836;genbank:gi:20065970;genbank:GeneID:935785 Probab=31.60 E-value=1 Score=20.58 Aligned_cols=85 Identities=13% Similarity=-0.022 Sum_probs=47.6 Q ss_pred CCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCcC---c--hhHHHHHHHHHHHHHHHHhhhhcccccccccceeeeee Q lcl|NC_013597. 2 PLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQ---W--GKLYDRGVMALTAHLLKLSADAEISGGAANRNLASESA 76 (119) Q Consensus 2 ~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~~---~--g~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~sv 76 (119) .|++++|+-.===.+..|+.|+.+++-|+.+|.... + -...+.++.++++|+-.-+.....+.. T Consensus 1 vtLeevK~~LRID~ddDD~lI~~~i~aA~~~i~~~~~~~~~~~~~~~~Avlllv~~~YenR~~~~~~~~----------- 69 (92) T protein:vir:13 1 MDLRELKEYLRIDFEEDDILLRSLLLAAEEYLYNAGIKRDYKKSLYSLAIKILVKHWYDNRDCVVAGNV----------- 69 (92) T ss_pred CCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHhhccccccchhHHHHHHHHHHHHhHhccccccccch----------- Confidence 899999875432224689999999999999984321 1 246678999999998766532110000 Q ss_pred cceeeeeccCccCCcchhhhhcCHHHHH-HHHHHHHhCC-CC Q lcl|NC_013597. 77 GELSVSYTAPISANGSDDFYQLTAYGQE-YLRLRRLIGV-GV 116 (119) Q Consensus 77 G~vSvs~~~~~~~~~~~~w~~~T~YG~~-y~~L~~~~g~-Gg 116 (119) -..-|||.+ .++-.|.+|- -| T Consensus 70 -------------------~~~ip~~v~sll~~lR~~~~~~~ 92 (92) T protein:vir:13 70 -------------------NNKLEYSLNAILTQLRYCGDDNG 92 (92) T ss_pred -------------------hhhhhHHHHHHHHHhhhccCCCC Confidence 001133322 1222222222 11 No 81 >protein:vir:80668 Length: 153 # NCBI annotation: gp7 # Family: family:all:7267 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285583;genbank:gi:148727089;genbank:GeneID:5247039 Probab=30.57 E-value=1.1 Score=20.44 Aligned_cols=101 Identities=18% Similarity=0.192 Sum_probs=55.7 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHH---hCCcCc--h-hHHHHHHHHHHHHHHHHhhhhcccccccccceeee Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAE---VSKVQW--G-KLYDRGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~---~~~~~~--g-~~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~ 74 (119) .+++++ -+.|+++|.+.++..++++... +-|+-- + +..+.+-..+.--+|--+ ..+..|.++++ T Consensus 4 ~i~~~D----l~pF~dI~~~k~~ami~D~~a~A~~vAPCi~~~~f~~~~aAKaIlrgAiLRW~------e~G~SGait~~ 73 (153) T protein:vir:80 4 ILKPED----IEPFADIPREKLEAMIADVEAVAVSVAPCIAKPDFKYKDAAKAILRRALLRWN------DTGVSGQVQYE 73 (153) T ss_pred eechhh----ccccccCCHHHHHHHHHhhhhhhhhhccccCCCCcccHHHHHHHHHHHhhhhh------hcCcccceeee Confidence 234444 4778999988777777766543 223221 1 122222233333333222 23456789999 Q ss_pred eecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCcc----C Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMV----A 119 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~v----a 119 (119) |.|...++.|+.+-. +-- |=-+--+|.|+|..-|.- + T Consensus 74 taGpf~qT~dtrs~r--~lf------wPSEItqLqklC~~~~~~g~Af~ 114 (153) T protein:vir:80 74 SAGPFAQTTRSNTPT--NLL------WPSEIAALKKLCEGDGGAGKAFT 114 (153) T ss_pred ccccceeeeccCCce--ecc------ChhhHHHHHHHhcCCCCCcceeE Confidence 999999998876532 111 234566899999422222 2 No 82 >protein:vir:1026 Length: 107 # NCBI annotation: Orf46 # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076680;genbank:gi:13095789;genbank:GeneID:920344 Probab=28.90 E-value=1.5 Score=19.69 Aligned_cols=85 Identities=15% Similarity=0.156 Sum_probs=44.9 Q ss_pred CCCHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCC----c----Cc------hhHHHHHHHHHHHHHHHHhhhhcccc-- Q lcl|NC_013597. 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSK----V----QW------GKLYDRGVMALTAHLLKLSADAEISG-- 64 (119) Q Consensus 1 m~t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~----~----~~------g~~~~~~~~l~~AH~l~l~~~~~~~~-- 64 (119) ..|++++|..---=.+ +|+.|+.+|.-|+.+|.. + .| -.+++.|+.+|++|+-.-+....... T Consensus 2 ~vtld~iK~sLriD~d-Dd~~l~~~l~aA~~YIk~Aig~d~~~~~Fy~~e~~~~lfd~Avl~La~~~Y~nR~at~~vp~~ 80 (107) T protein:vir:10 2 SVTVDDLLDQLSEDDD-RKPQLQIYFDTATAYVKNAVSSDTVDAPFFNVENVSPIYDVAVLSYSMDLWINRSTTMPPTTA 80 (107) T ss_pred eecHHHHHHHhcCCCC-chHHHHHHHHHHHHHHhhhcCcccccCCccccccchhHHHHHHHHHHHHHhhcccceeecchH Confidence 4599998875422233 799999999999999821 1 12 23678899999999965543221100 Q ss_pred -----cccccceeeeeecceeeeeccCccCCcchhhhhcCH Q lcl|NC_013597. 65 -----GAANRNLASESAGELSVSYTAPISANGSDDFYQLTA 100 (119) Q Consensus 65 -----~~~~g~vtS~svG~vSvs~~~~~~~~~~~~w~~~T~ 100 (119) +...|.-....++.= +.+ .+|. T Consensus 81 v~siI~QLRg~y~~~~e~~~----~~~----------~~~~ 107 (107) T protein:vir:10 81 VDHMVGQLRGLYSSWKEAQD----GQN----------LQTE 107 (107) T ss_pred HHHHHHHHhhhhcccccccC----CCc----------ccCC Confidence 011121111111100 000 1111 No 83 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=24.49 E-value=2.2 Score=18.74 Aligned_cols=105 Identities=17% Similarity=0.079 Sum_probs=50.2 Q ss_pred CC---CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhC---Cc------Cch---hHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_013597. 1 MP---LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVS---KV------QWG---KLYDRGVMALTAHLLKLSADAEISGG 65 (119) Q Consensus 1 m~---t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~---~~------~~g---~~~~~~~~l~~AH~l~l~~~~~~~~~ 65 (119) |+ |++++.+++.++..=..++++..|++|..+|- |. .|- ...+..+...+|-...-... .+.+ T Consensus 1 m~~fAtv~D~~~rwr~Lt~~E~~ra~~LL~~As~~ir~~~p~~~~~l~~~~~~~~~~~~~~~~V~~~~V~Ral~--~~~~ 78 (131) T protein:vir:95 1 MENFATVEDLKKLWRALKFDEEKRAEALLEVVSHSLRVEAKKVGKDLDGLVATDPSFTMVVKSVTVDVVARTLM--TSTD 78 (131) T ss_pred CCccCCHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccCCccchHHHHHHHHHHHHHHhc--CCCC Confidence 55 88999999987765556789999999999872 21 111 11122222222221111110 0111 Q ss_pred ccccceeeeeeccee--eeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 66 AANRNLASESAGELS--VSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 66 ~~~g~vtS~svG~vS--vs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) ..+-...|.+.|..| .+|.+|.. -+-.|.- .+++.|.++-=. T Consensus 79 ~~G~tq~S~TaG~ys~S~t~~~p~g------~lylt~~------e~~~LGl~~~r~ 122 (131) T protein:vir:95 79 QEPMTQVAESALGYSFSGSYLVPGG------GLFIKDS------ELKRLGLKKQRY 122 (131) T ss_pred CCCceeeeeecccceeeeeeecCCC------CceeChH------HHHHhCCCCCce Confidence 122234678899884 45544422 1222221 222223322111 No 84 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=21.82 E-value=2.5 Score=18.36 Aligned_cols=105 Identities=17% Similarity=0.149 Sum_probs=51.4 Q ss_pred CC---CHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCC--cCchh----------HHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_013597. 1 MP---LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSK--VQWGK----------LYDRGVMALTAHLLKLSADAEISGG 65 (119) Q Consensus 1 m~---t~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~--~~~g~----------~~~~~~~l~~AH~l~l~~~~~~~~~ 65 (119) |+ |++++.+++.++.+=.+++++..|++|..+|-. .++|. ..+..+...++-...-... . +.+ T Consensus 1 m~~fATv~Dv~~rwr~Lt~dE~~ra~~LL~dAS~~iR~~~p~~g~~~~~~~~~~~~~~~~~k~V~~~mV~Ral~-~-~~d 78 (140) T protein:vir:97 1 MGNFATTDDVILLWRPLSVDELKRANALLKVVSDTLRMEADKVGKDLDKTMVDKPYFVNVIKSVTVDIVARTLM-T-STQ 78 (140) T ss_pred CCcCCCHHHHHHHhcCCCHhHHHHHHHHHHHHHHHHHHhhhhccCCcchhcccCccchhHHHHHHHHHHHHHhc-C-CCC Confidence 55 889999999777655567999999999998821 11331 1112222222222111110 0 111 Q ss_pred ccccceeeeeecce--eeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCccC Q lcl|NC_013597. 66 AANRNLASESAGEL--SVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) Q Consensus 66 ~~~g~vtS~svG~v--Svs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~va 119 (119) ..+-...|.+.|.. |.+|.+|.. -+-.|.- .+++.|.++-=. T Consensus 79 ~~G~tq~S~TaG~ys~S~T~~np~G------~lylt~~------e~~~LGl~~~r~ 122 (140) T protein:vir:97 79 GEPMSQESQSALGYTWSGTYLVPGG------GLFIKDN------ELKRLGLKKQRY 122 (140) T ss_pred CCcceeeeeeccchhheeeeecCCC------CceeChH------HHHHhCCCCCce Confidence 12223567888988 455544421 1222221 233333332211 No 85 >protein:vir:107119 Length: 104 # NCBI annotation: conserved phage protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950608;genbank:gi:119953688;genbank:GeneID:4643128 Probab=21.74 E-value=2.6 Score=18.35 Aligned_cols=97 Identities=19% Similarity=0.271 Sum_probs=57.2 Q ss_pred CCHHHHHHhhh-hhcCC-C----HHHHHHHHHHHHHHhCCcCchh-HHHHHHHHHHHHHHHHhhhhcccccccccceeee Q lcl|NC_013597. 2 PLTEDFLLRYT-EFGKT-D----AKRIGLFLSDAQAEVSKVQWGK-LYDRGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) Q Consensus 2 ~t~~~Fr~~~P-~F~~v-p----d~~i~~~~~~A~~~~~~~~~g~-~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~ 74 (119) -.+.+.+.+-- ...++ - ...|..+++.|+.+-+ ..|++ -+.-.+..++|-.+.- ++.++++|. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn-~~F~~~~lP~gV~~fvA~~iky---------~~~~NissR 70 (104) T protein:vir:10 1 MNAQDVKLLNNLSLDDTSNDETIELLIEKYLNVAEEYCN-QTFNRKSLPSNVEKFIANCIKQ---------GTTSNISSR 70 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcC-CCCCCCCCCccHHHHHHHHHhh---------cCCCCcccc Confidence 23444444311 11111 1 3567889999977655 56876 6778888999987763 235689999 Q ss_pred eecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCcc Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMV 118 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~v 118 (119) |.|+||.+|.+..- ...-+||+ ..||+.-.|=-| T Consensus 71 SMGtVSyTy~t~iP-~~i~~~L~---------PYRklr~~~~~~ 104 (104) T protein:vir:10 71 TMGTVSYTFVTDLP-KETYGYLK---------PFRRLRWTGYHV 104 (104) T ss_pred cccceeecccchhH-HHHHHhhh---------hhhhhccccccC Confidence 99999999954211 11122322 234555555555 No 86 >protein:vir:105327 Length: 104 # NCBI annotation: putative head morphogenesis protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950671;genbank:gi:119967841;genbank:GeneID:4643206 Probab=21.74 E-value=2.6 Score=18.35 Aligned_cols=97 Identities=19% Similarity=0.271 Sum_probs=57.2 Q ss_pred CCHHHHHHhhh-hhcCC-C----HHHHHHHHHHHHHHhCCcCchh-HHHHHHHHHHHHHHHHhhhhcccccccccceeee Q lcl|NC_013597. 2 PLTEDFLLRYT-EFGKT-D----AKRIGLFLSDAQAEVSKVQWGK-LYDRGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) Q Consensus 2 ~t~~~Fr~~~P-~F~~v-p----d~~i~~~~~~A~~~~~~~~~g~-~~~~~~~l~~AH~l~l~~~~~~~~~~~~g~vtS~ 74 (119) -.+.+.+.+-- ...++ - ...|..+++.|+.+-+ ..|++ -+.-.+..++|-.+.- ++.++++|. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn-~~F~~~~lP~gV~~fvA~~iky---------~~~~NissR 70 (104) T protein:vir:10 1 MNAQDVKLLNNLSLDDTSNDETIELLIEKYLNVAEEYCN-QTFNRKSLPSNVEKFIANCIKQ---------GTTSNISSR 70 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcC-CCCCCCCCCccHHHHHHHHHhh---------cCCCCcccc Confidence 23444444311 11111 1 3567889999977655 56876 6778888999987763 235689999 Q ss_pred eecceeeeeccCccCCcchhhhhcCHHHHHHHHHHHHhCCCCcc Q lcl|NC_013597. 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMV 118 (119) Q Consensus 75 svG~vSvs~~~~~~~~~~~~w~~~T~YG~~y~~L~~~~g~Gg~v 118 (119) |.|+||.+|.+..- ...-+||+ ..||+.-.|=-| T Consensus 71 SMGtVSyTy~t~iP-~~i~~~L~---------PYRklr~~~~~~ 104 (104) T protein:vir:10 71 TMGTVSYTFVTDLP-KETYGYLK---------PFRRLRWTGYHV 104 (104) T ss_pred cccceeecccchhH-HHHHHhhh---------hhhhhccccccC Confidence 99999999954211 11122322 234555555555 No 87 >protein:vir:4702 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061634;genbank:gi:9635721;genbank:GeneID:1263015 Probab=20.18 E-value=2.8 Score=18.12 Aligned_cols=94 Identities=15% Similarity=0.134 Sum_probs=42.3 Q ss_pred CCC----HHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCc------C---c---hhHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_013597. 1 MPL----TEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKV------Q---W---GKLYDRGVMALTAHLLKLSADAEISG 64 (119) Q Consensus 1 m~t----~~~Fr~~~P~F~~vpd~~i~~~~~~A~~~~~~~------~---~---g~~~~~~~~l~~AH~l~l~~~~~~~~ 64 (119) |.. ++++|.-.===.+..|+.|+.++.-|+.+|... + + -...+.++.++++|+-.-+......+ T Consensus 1 M~vt~~dLeeiK~~LRID~d~DD~li~~~i~AA~~~I~~ai~~~~~~~~~~~~~~~~~~~AvllLv~~~YeNR~a~~~~~ 80 (113) T protein:vir:47 1 MQLTAEELKLLKKHCKIDHNSEDDLLEIYYSWAFHEIASAVTDEPSKYIDWFKSHPLFARAIYPLASYYFENRIAYLDRD 80 (113) T ss_pred CcccHHHHHHHHHHhCCCCCcchHHHHHHHHHHHHHHHhhccccccccccccCCchHHHHHHHHHHHHHHhhhhhccccc Confidence 553 333444221112358999999999999988321 1 1 13678899999999987765322111 Q ss_pred c-ccccceeeeeecceeeeec--cCccCCcchhhhhcC Q lcl|NC_013597. 65 G-AANRNLASESAGELSVSYT--APISANGSDDFYQLT 99 (119) Q Consensus 65 ~-~~~g~vtS~svG~vSvs~~--~~~~~~~~~~w~~~T 99 (119) - ..+-.|.| =+....-.|. .......+ .+| T Consensus 81 ~~~vp~~v~s-li~qlR~~y~~~~~~~~~~~----~~~ 113 (113) T protein:vir:47 81 LSLAPHMVLS-TVHKLRGSFEQFLESENDEE----SGT 113 (113) T ss_pred cccccHHHHH-HHHHHHHHHHHHhhhcCCCC----CCC Confidence 0 01100000 0000000010 00000000 111 Done!