Query lcl|NC_019935.1_cdsid_YP_007238180.1 [gene=G182_gp25] [protein=putative structural protein] [protein_id=YP_007238180.1] [location=15847..16314] Match_columns 155 No_of_seqs 92 out of 105 Neff 6.0 Searched_HMMs 1612 Date Thu Nov 7 17:08:18 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_25 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_25_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:96108 Length: 155 100.0 4.6E-72 2.8E-75 411.7 14.9 155 1-155 1-155 (155) 2 protein:vir:99570 Length: 153 100.0 5.6E-67 3.5E-70 383.8 14.6 151 1-155 3-153 (153) 3 protein:vir:107756 Length: 147 100.0 1.2E-63 7.3E-67 365.6 14.2 145 1-155 2-146 (147) 4 protein:vir:106739 Length: 158 100.0 8.7E-61 5.4E-64 349.8 13.7 145 1-155 7-152 (158) 5 protein:vir:78595 Length: 158 100.0 8.7E-61 5.4E-64 349.8 13.7 145 1-155 7-152 (158) 6 protein:vir:94064 Length: 167 100.0 2.2E-60 1.4E-63 347.7 13.5 149 1-155 3-155 (167) 7 protein:vir:3639 Length: 158 # 100.0 6.2E-60 3.8E-63 345.2 13.4 145 1-155 7-152 (158) 8 protein:vir:101559 Length: 158 100.0 6.2E-60 3.8E-63 345.2 13.4 145 1-155 7-152 (158) 9 protein:vir:5256 Length: 119 # 100.0 8.4E-46 5.2E-49 267.7 12.0 119 3-138 1-119 (119) 10 protein:vir:79640 Length: 134 100.0 7E-41 4.3E-44 240.7 9.3 129 4-153 1-134 (134) 11 protein:vir:107702 Length: 136 100.0 1.8E-37 1.1E-40 222.0 10.5 128 4-146 1-136 (136) 12 protein:vir:104344 Length: 132 99.9 3.5E-31 2.2E-34 187.6 8.3 128 4-152 1-132 (132) 13 protein:vir:103283 Length: 125 99.9 2E-28 1.2E-31 172.5 7.9 124 9-148 1-125 (125) 14 protein:vir:80036 Length: 111 98.8 1.1E-11 7.1E-15 80.6 6.4 111 2-142 1-111 (111) 15 protein:vir:80967 Length: 131 95.0 0.00088 5.5E-07 37.3 9.7 122 1-144 1-131 (131) 16 protein:vir:43 Length: 131 # N 94.4 0.0021 1.3E-06 35.2 10.4 122 1-144 1-131 (131) 17 protein:vir:98900 Length: 132 94.1 0.0024 1.5E-06 35.0 10.0 123 1-144 1-132 (132) 18 protein:vir:94955 Length: 170 92.9 0.0077 4.8E-06 32.2 10.7 129 1-152 16-170 (170) 19 protein:vir:102961 Length: 131 92.3 0.0028 1.7E-06 34.6 7.6 115 5-131 1-131 (131) 20 protein:vir:95071 Length: 104 88.1 0.012 7.5E-06 31.1 7.3 101 4-138 1-104 (104) 21 protein:vir:1241 Length: 104 # 87.9 0.013 7.9E-06 31.0 7.3 101 4-138 1-104 (104) 22 protein:vir:93740 Length: 104 87.9 0.013 7.8E-06 31.0 7.3 101 4-138 1-104 (104) 23 protein:vir:94492 Length: 104 87.9 0.013 7.8E-06 31.0 7.3 101 4-138 1-104 (104) 24 protein:vir:97430 Length: 104 87.9 0.013 7.8E-06 31.0 7.3 101 4-138 1-104 (104) 25 protein:vir:95176 Length: 172 87.3 0.04 2.5E-05 28.3 11.6 130 1-149 6-172 (172) 26 protein:vir:97329 Length: 104 86.9 0.013 8.1E-06 30.9 6.8 101 4-138 1-104 (104) 27 protein:vir:96281 Length: 104 86.9 0.013 8.1E-06 30.9 6.8 101 4-138 1-104 (104) 28 protein:vir:95891 Length: 104 86.9 0.013 8.1E-06 30.9 6.8 101 4-138 1-104 (104) 29 protein:vir:94798 Length: 104 86.8 0.013 8.3E-06 30.9 6.8 101 4-138 1-104 (104) 30 protein:vir:107119 Length: 104 86.7 0.014 8.7E-06 30.7 6.8 101 4-138 1-104 (104) 31 protein:vir:105327 Length: 104 86.7 0.014 8.7E-06 30.7 6.8 101 4-138 1-104 (104) 32 protein:vir:80389 Length: 172 85.9 0.05 3.1E-05 27.7 11.2 131 1-149 4-172 (172) 33 protein:vir:96128 Length: 98 # 83.6 0.0081 5E-06 32.1 4.0 95 4-125 1-98 (98) 34 protein:vir:5976 Length: 102 # 80.2 0.018 1.1E-05 30.2 4.6 100 4-148 1-102 (102) 35 protein:vir:96831 Length: 98 # 73.7 0.052 3.2E-05 27.6 5.2 95 4-125 1-98 (98) 36 protein:vir:79701 Length: 144 69.9 0.22 0.00013 24.2 8.8 126 1-144 1-144 (144) 37 protein:vir:79050 Length: 133 61.6 0.32 0.0002 23.3 7.0 123 4-132 1-133 (133) 38 protein:vir:4788 Length: 130 # 56.4 0.46 0.00028 22.4 9.4 123 1-144 1-130 (130) 39 protein:vir:78383 Length: 169 50.6 0.6 0.00038 21.8 11.8 128 1-149 4-169 (169) 40 protein:vir:4702 Length: 113 # 36.1 1.2 0.00074 20.2 7.2 102 1-118 3-113 (113) 41 protein:vir:95004 Length: 169 35.3 1.2 0.00077 20.1 11.0 128 1-149 4-169 (169) 42 protein:vir:105899 Length: 116 33.4 1 0.00064 20.5 5.0 109 4-132 1-116 (116) 43 protein:vir:94126 Length: 116 33.4 1 0.00064 20.5 5.0 109 4-132 1-116 (116) 44 protein:vir:81159 Length: 95 # 29.1 1.3 0.00081 19.9 4.8 94 1-119 1-95 (95) 45 protein:vir:7857 Length: 188 # 28.0 1.4 0.00086 19.8 4.7 70 1-137 119-188 (188) 46 protein:vir:101652 Length: 188 28.0 1.4 0.00086 19.8 4.7 70 1-137 119-188 (188) 47 protein:vir:7410 Length: 107 # 27.7 1.8 0.0011 19.2 7.7 96 2-119 1-107 (107) 48 protein:vir:102158 Length: 99 25.4 2.1 0.0013 18.9 6.8 98 2-118 1-99 (99) 49 protein:vir:105776 Length: 133 24.9 2.1 0.0013 18.8 8.2 119 3-155 1-129 (133) 50 protein:vir:4602 Length: 110 # 23.7 2.2 0.0014 18.7 5.0 102 1-115 3-110 (110) 51 protein:vir:9928 Length: 118 # 22.4 2.4 0.0015 18.4 6.6 115 4-150 1-118 (118) 52 protein:vir:4954 Length: 104 # 22.3 2.5 0.0015 18.4 5.1 99 2-111 1-104 (104) 53 protein:vir:1384 Length: 92 # 20.7 2.7 0.0017 18.2 7.4 92 4-113 1-92 (92) No 1 >protein:vir:96108 Length: 155 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:664 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294442;genbank:gi:149408339;genbank:GeneID:5237227 Probab=100.00 E-value=4.6e-72 Score=411.69 Aligned_cols=155 Identities=97% Similarity=1.601 Sum_probs=152.6 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) |||||+++||++||||+|+++|||++|+.||++|+++|+++++.|++++|++++++++|||||+|+|..+..+|++++++ T Consensus 1 ~v~fd~~~FR~~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~~~~s~~~~g~~~~~~l~Ll~AH~l~L~~~~~~gaa~~g~ 80 (155) T protein:vir:96 1 MVIFDEQKFRTLFPEFADPASYPAVRLQLYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) T ss_pred CcccCHHHHHHhCccccCcccCCHHHHHHHHHHHHHhhcCCCccccccChHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999888 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCcccccccccCcccC Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFRKVGGTFW 155 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~~r~vgg~~~ 155 (155) +.+++++|+|+|||||+||||||+|++++++++||||||||||||+|++++|+|++++||+|||+|||||||||| T Consensus 81 ~~~g~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~f~~l~~~~~~Gg~~vgG~per~~~r~vgg~f~ 155 (155) T protein:vir:96 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFRKVGGTFW 155 (155) T ss_pred ccccccccceeeceecceeeeeecCCCCCchhHHhhcCHHHHHHHHHHHHhcccccccCCCCccccccccCcccC Confidence 888999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:99570 Length: 153 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039799;genbank:gi:126011049;genbank:GeneID:4818265 Probab=100.00 E-value=5.6e-67 Score=383.80 Aligned_cols=151 Identities=54% Similarity=0.965 Sum_probs=142.1 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) |||||+++||++||||+|+++|||++|+.||++|++++++++|.|++.+|+.++++++|||||+|+|+.+...++ . T Consensus 3 ~~~fd~~~Fr~~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~~~~~~~~~g~~~~~~l~Ll~AH~l~L~~~~~~~~----~ 78 (153) T protein:vir:99 3 DPVYNDGLFRIMYPEFADQEKYPPEVIEIYYDTATLFITGSMFPCAALSGKQLVGALNMLTAHLMSLSMQRSQTA----L 78 (153) T ss_pred cccCChHHHHHhcccccCccccCHHHHHHHHHHHHHhhcCccccccccChHHHHHHHHHHHHHHHHHHhhhhccc----c Confidence 999999999999999999999999999999999999999999999999999999999999999999976643332 2 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCcccccccccCcccC Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFRKVGGTFW 155 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~~r~vgg~~~ 155 (155) ..++.++|+|+|||||+||||||+|++.+++++||||||||||||+|++++|+||+|+||+|||+|||||||||= T Consensus 79 ~a~~~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~fw~l~~~~~~Gg~v~gg~pe~~~~r~vgg~f~ 153 (153) T protein:vir:99 79 GATNDQGGYTLSATIGEVSVSKMAPPAKDGWEFWLAQTPYGQALWALLKMLSVGGFAIGGLPERTGFRKVGGVFL 153 (153) T ss_pred cCCCccccceeeeeecceeeeeecCCCCCchhHhhhcCHHHHHHHHHHHHhcccccccCCCCccccccccCcccC Confidence 234678999999999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:107756 Length: 147 # NCBI annotation: gp21 # Family: family:all:664 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024869;genbank:gi:48697511;genbank:GeneID:2948377 Probab=100.00 E-value=1.2e-63 Score=365.58 Aligned_cols=145 Identities=47% Similarity=0.826 Sum_probs=136.9 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) =||||+++||++||||+|+++|||++|+.||++|+++|++++| +++++|++++++++|||||+|+|..+..++ T Consensus 2 ~v~fd~~~Fr~~fPeFad~~~~pd~~i~~~l~~A~~~l~~~~~-~~~~~g~~~~~~l~Ll~AHll~l~~~~~~g------ 74 (147) T protein:vir:10 2 DHTLDITKFRALFPEFNNDVKYPDALLEQWYAVAGEYLGLTDY-ACGLNGNTLDLALMQLTAHLMKSATILSSN------ 74 (147) T ss_pred ceecCHHHHHHhcccccCCccCCHHHHHHHHHHHHHhhccccC-CcccChhhHHHHHHHHHHHHHHHHHhhccC------ Confidence 6799999999999999999999999999999999999999998 568999999999999999999998765432 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCcccccccccCcccC Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFRKVGGTFW 155 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~~r~vgg~~~ 155 (155) ++++|+|+|||||+||||||++++.+++++||||||||||||+|++++|+|++|+||+|||+|||||||||= T Consensus 75 ---~g~~G~v~Sas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~y~~l~~~~~~Gg~vvgG~p~r~a~r~vgg~f~ 146 (147) T protein:vir:10 75 ---KGAPMVMTSATIDKVSISTLAPPIKNGWQYWLSTTPYGQMLWALLSMRSSGGFVYGGSPELSGYRRIGGVFK 146 (147) T ss_pred ---CCcccceeeeeecceeeeeecCCCCCcchhhhhcCHHHHHHHHHHHhhCccceecCCCCccccccccCceeC Confidence 246799999999999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:106739 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944316;genbank:gi:38638615;genbank:GeneID:2657368 Probab=100.00 E-value=8.7e-61 Score=349.85 Aligned_cols=145 Identities=24% Similarity=0.312 Sum_probs=135.5 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) |||||+++||++||||+| +||++|+.||++|++++++++++|++.++++++++|+|||||+|+|+.+...++ T Consensus 7 ~v~Fd~a~FR~~fPeFa~---~pd~~i~~~~~~A~~~~~~~~~~s~~~~~~~r~~ll~LltAHll~L~~~~~~~a----- 78 (158) T protein:vir:10 7 RITFDPAGFIAEYPEFAT---VATPRLQAMFNQAQTALLDNTGGSPVTDDNVLRELFNMLVAHLLTLFGATPTSA----- 78 (158) T ss_pred eEEcChHHHHHhchhhcc---CCHHHHHHHHHHhhhhhcCCCccccccChhHHHHHHHHHHHHHHHHhHhhhccc----- Confidence 999999999999999987 899999999999999999999999999999999999999999999987765543 Q ss_pred ccccccccceeeeeecceEEEeecCC-CCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCcccccccccCcccC Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPP-AKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFRKVGGTFW 155 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~-~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~~r~vgg~~~ 155 (155) ++.++|+|+|||||+||||||+++ +.+++++||||||||||||+|++++|+|+|++||+|||.+||+|||.|= T Consensus 79 --~~g~~G~isSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fwal~~~~~~Ggy~~gg~pe~~~~r~~g~~~~ 152 (158) T protein:vir:10 79 --NSRPPGRLSSAAEGTVSSSFEFKLPEGSAIAPWYNQTQYGAMFWMATARYRSARYMVSGGSGIGTARAYGQPTI 152 (158) T ss_pred --cCCcccceeeeeecceEEEEeecCccccchhHHhhcCHHHHHHHHHHHHhcccccccccCCcccceeecCccee Confidence 234689999999999999999866 6678899999999999999999999999999999999999999999999 No 5 >protein:vir:78595 Length: 158 # NCBI annotation: BcepNY3gp07 # Family: family:all:664 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294844;genbank:gi:149882907;genbank:GeneID:5291066 Probab=100.00 E-value=8.7e-61 Score=349.85 Aligned_cols=145 Identities=24% Similarity=0.312 Sum_probs=135.5 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) |||||+++||++||||+| +||++|+.||++|++++++++++|++.++++++++|+|||||+|+|+.+...++ T Consensus 7 ~v~Fd~a~FR~~fPeFa~---~pd~~i~~~~~~A~~~~~~~~~~s~~~~~~~r~~ll~LltAHll~L~~~~~~~a----- 78 (158) T protein:vir:78 7 RITFDPAGFIAEYPEFAT---VATPRLQAMFNQAQTALLDNTGGSPVTDDNVLRELFNMLVAHLLTLFGATPTSA----- 78 (158) T ss_pred eEEcChHHHHHhchhhcc---CCHHHHHHHHHHhhhhhcCCCccccccChhHHHHHHHHHHHHHHHHhHhhhccc----- Confidence 999999999999999987 899999999999999999999999999999999999999999999987765543 Q ss_pred ccccccccceeeeeecceEEEeecCC-CCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCcccccccccCcccC Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPP-AKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFRKVGGTFW 155 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~-~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~~r~vgg~~~ 155 (155) ++.++|+|+|||||+||||||+++ +.+++++||||||||||||+|++++|+|+|++||+|||.+||+|||.|= T Consensus 79 --~~g~~G~isSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fwal~~~~~~Ggy~~gg~pe~~~~r~~g~~~~ 152 (158) T protein:vir:78 79 --NSRPPGRLSSAAEGTVSSSFEFKLPEGSAIAPWYNQTQYGAMFWMATARYRSARYMVSGGSGIGTARAYGQPTI 152 (158) T ss_pred --cCCcccceeeeeecceEEEEeecCccccchhHHhhcCHHHHHHHHHHHHhcccccccccCCcccceeecCccee Confidence 234689999999999999999866 6678899999999999999999999999999999999999999999999 No 6 >protein:vir:94064 Length: 167 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453623;genbank:gi:84662659;genbank:GeneID:5142574 Probab=100.00 E-value=2.2e-60 Score=347.66 Aligned_cols=149 Identities=26% Similarity=0.382 Sum_probs=136.7 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) +||||+++||++||||+| +||++|+.||++|++++.++++++.+.+++.++++|+|||||+|+|+.+....++. T Consensus 3 ~~~Fd~~~FR~~fPeFa~---~Pd~~i~~~l~~A~~~~l~~~~~s~~~~~~~~~~~l~LltAHll~L~~~~~a~~~~--- 76 (167) T protein:vir:94 3 VVVFDPTAFKLVYPEFVA---VPDARLTALFNTVGYTILDNTDASVIVDPLRRAPLLDLLVAHMLALFGYVNADGSI--- 76 (167) T ss_pred cccCChHHHHHhchhccc---CCHHHHHHHHHHHHHhhcCCCCcccccchhhHHHHHHHHHHHHHHHhhhhhhhccc--- Confidence 999999999999999987 89999999999999887777777899999999999999999999997665433222 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCc----ccccccccCcccC Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLP----ERRGFRKVGGTFW 155 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p----~r~~~r~vgg~~~ 155 (155) .+++.++|+|+|||||+||||||++++.+++++||+|||||||||+|++++|+|+|++|||| ||++||||||||= T Consensus 77 ~~~~g~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~fwaL~~~~g~Gg~v~gG~~~~~~~~~~~r~vgg~f~ 155 (167) T protein:vir:94 77 TPGTGTVGRVANASEGSVSTSLAYSTPTGAGEAWFTQTPYGAMYWAMSAPFRSFHYVAAGLSGVGYSQDYLSTYAGVET 155 (167) T ss_pred ccccccchheeeccccceeeeeecCCCCCchhhhhhcCHHHHHHHHHHHHhcccccccCCCCCCCCCccccCccceeEe Confidence 23456789999999999999999999999999999999999999999999999999999999 9999999999999 No 7 >protein:vir:3639 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705634;genbank:gi:23752319;genbank:GeneID:955737 Probab=100.00 E-value=6.2e-60 Score=345.18 Aligned_cols=145 Identities=26% Similarity=0.341 Sum_probs=133.8 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) |||||+++||++||||+| +||++|+.||++|++++.+++++|.+.+++.++++|+|||||+|+|+.+...++ T Consensus 7 ~v~Fd~a~FR~~fPeFa~---~pd~~i~~~~~~A~~~~l~n~~~s~~~~~~~r~~ll~LltAHll~L~~~~~~g~----- 78 (158) T protein:vir:36 7 RITFDPAGFIAEYPEFAT---VPTPRLQAMFNQAQAALLDNTGGSPVTDDNVLRELFNMLVAHLLTLFSAAPTSA----- 78 (158) T ss_pred eEEcChHHHHHhCccccc---CCHHHHHHHHHhhhheeeCCcccccccChHHHHHHHHHHHHHHHHHhhhhhccc----- Confidence 999999999999999986 999999999999999877777779999999999999999999999987765543 Q ss_pred ccccccccceeeeeecceEEEeecCC-CCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCcccccccccCcccC Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPP-AKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFRKVGGTFW 155 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~-~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~~r~vgg~~~ 155 (155) ++.++|+|+||||||||||||+++ +.+++++||||||||||||+|++++|+|+|++||+|||.+||+|||.|= T Consensus 79 --~~g~vG~vsSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fw~L~~~~~~Gg~v~Gg~pe~~~~r~~g~~~~ 152 (158) T protein:vir:36 79 --NSRPPGRLSSATEGTVSSSFEYALPQGSAIAPWYNQTQYGAMFWMATAHYRSARYMVSGGSGIGTARAYGQPTI 152 (158) T ss_pred --ccCcccceeeeeeCceeEEeeeccccCCchhhhhhcCHHHHHHHHHHHhhCccccccccCCcccceeccCceee Confidence 245679999999999999999864 6778899999999999999999999999999999999999999999999 No 8 >protein:vir:101559 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958112;genbank:gi:41057658;genbank:GeneID:2716816 Probab=100.00 E-value=6.2e-60 Score=345.18 Aligned_cols=145 Identities=26% Similarity=0.341 Sum_probs=133.8 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) |||||+++||++||||+| +||++|+.||++|++++.+++++|.+.+++.++++|+|||||+|+|+.+...++ T Consensus 7 ~v~Fd~a~FR~~fPeFa~---~pd~~i~~~~~~A~~~~l~n~~~s~~~~~~~r~~ll~LltAHll~L~~~~~~g~----- 78 (158) T protein:vir:10 7 RITFDPAGFIAEYPEFAT---VPTPRLQAMFNQAQAALLDNTGGSPVTDDNVLRELFNMLVAHLLTLFSAAPTSA----- 78 (158) T ss_pred eEEcChHHHHHhCccccc---CCHHHHHHHHHhhhheeeCCcccccccChHHHHHHHHHHHHHHHHHhhhhhccc----- Confidence 999999999999999986 999999999999999877777779999999999999999999999987765543 Q ss_pred ccccccccceeeeeecceEEEeecCC-CCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCcccccccccCcccC Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPP-AKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFRKVGGTFW 155 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~-~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~~r~vgg~~~ 155 (155) ++.++|+|+||||||||||||+++ +.+++++||||||||||||+|++++|+|+|++||+|||.+||+|||.|= T Consensus 79 --~~g~vG~vsSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fw~L~~~~~~Gg~v~Gg~pe~~~~r~~g~~~~ 152 (158) T protein:vir:10 79 --NSRPPGRLSSATEGTVSSSFEYALPQGSAIAPWYNQTQYGAMFWMATAHYRSARYMVSGGSGIGTARAYGQPTI 152 (158) T ss_pred --ccCcccceeeeeeCceeEEeeeccccCCchhhhhhcCHHHHHHHHHHHhhCccccccccCCcccceeccCceee Confidence 245679999999999999999864 6778899999999999999999999999999999999999999999999 No 9 >protein:vir:5256 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852761;genbank:gi:31544036;uniprot:Q7Y5T9;genbank:GeneID:2753553 Probab=100.00 E-value=8.4e-46 Score=267.72 Aligned_cols=119 Identities=29% Similarity=0.351 Sum_probs=110.1 Q ss_pred ecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_019935. 3 IFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGVTA 82 (155) Q Consensus 3 ~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~~~ 82 (155) --++++||++||||++ +||++|+.||++|++++++++| |++++++++|||||+|+|......+ T Consensus 1 m~t~~~Fr~~~PeF~~---~pd~~i~~~l~~A~~~l~~~~~------g~~~~~~~~L~~AH~l~l~~~~~~~-------- 63 (119) T protein:vir:52 1 MPLTEDFLLRYTEFGK---TDAKRIGLFLSDAQAEVSKVQW------GKLYDRGVMALTAHLLKLSADAEIS-------- 63 (119) T ss_pred CCcHHHHHHhhhhccC---CCHHHHHHHHHHHHHhhCCcCC------chHHHHHHHHHHHHHHHhhhhhhcc-------- Confidence 5568999999999986 8999999999999999999999 8999999999999999998665432 Q ss_pred ccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccccccc Q lcl|NC_019935. 83 GGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) Q Consensus 83 ~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~v 138 (155) ++..+|+|+|||+|+||||||++++.+++++||+|||||||||+|+|++|+|++|. T Consensus 64 ~g~~~g~v~S~s~G~vSvS~~~~~~~~~~~~w~~~T~YG~~y~~L~r~~g~Gg~Va 119 (119) T protein:vir:52 64 GGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) T ss_pred ccccccceeeeeecceeeeeeccccCCcchhhhhcCHHHHHHHHHHHHhcCCCcCC Confidence 24568999999999999999999999999999999999999999999999999988 No 10 >protein:vir:79640 Length: 134 # NCBI annotation: gp38 # Family: family:all:5122 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285527;genbank:gi:148734510;genbank:GeneID:5219998 Probab=100.00 E-value=7e-41 Score=240.74 Aligned_cols=129 Identities=19% Similarity=0.215 Sum_probs=109.4 Q ss_pred cC----HHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_019935. 4 FD----EHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGG 79 (155) Q Consensus 4 fd----~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~ 79 (155) |+ +|.||++||||+| +||++|+.|+++|+++|++++| ||.++++++|||||+|.|++.....+. T Consensus 1 m~d~~~ve~Fr~l~PeF~~---vpde~l~~~~~~A~~~i~~~~~------g~~~~~al~lltAHLl~l~~~~~~~g~--- 68 (134) T protein:vir:79 1 MNDIEILEQIYKIAPAFKK---VDPELIQAWIELAKDFVCEKHF------KDKYFRAVALYTLHLMTLDGAMKQESE--- 68 (134) T ss_pred CchHHHHHHHHHhcccccc---CCHHHHHHHHHHhhhhhcCCCC------ChHHHHHHHHHHHHHHhhccccccccc--- Confidence 33 6999999999997 7999999999999999999887 899999999999999999754322111 Q ss_pred cccccccccceee-eeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCcccccccccCcc Q lcl|NC_019935. 80 VTAGGTQGGFITS-ATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFRKVGGT 153 (155) Q Consensus 80 ~~~~g~~~G~vtS-aS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~~r~vgg~ 153 (155) ++...+|+|+| +++|+|||||+. ++.+++++||++|||||+||+|+|+ ++||-.+|++||.+|++ T Consensus 69 --~~~~~~grv~ssst~G~vSvS~a~-ps~~~~~~Wl~~TpYGq~y~~L~k~------~~GGf~~~t~~~~~~~r 134 (134) T protein:vir:79 69 --SVESYSHRIASFSLTGEFSQTFSK-VSDDTSGNTLRQTPWGKMYEVLNKK------KGGGFGLTTAFHRRCSR 134 (134) T ss_pred --ccccccchhhhhhhhcceeeeccC-cccchhHHHHhcCHHHHHHHHHHHh------hccchHhhhhccccCCC Confidence 12335566666 669999999986 5777899999999999999999996 46788889999999999 No 11 >protein:vir:107702 Length: 136 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003900;genbank:gi:45686316;genbank:GeneID:2773042 Probab=100.00 E-value=1.8e-37 Score=221.99 Aligned_cols=128 Identities=21% Similarity=0.187 Sum_probs=103.2 Q ss_pred cC-------HHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_019935. 4 FD-------EHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAA 76 (155) Q Consensus 4 fd-------~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~ 76 (155) |+ +|+||.+||||+| +||++|+.|+++|++++|.++| ||.++++++|||||+|.|++.....+ T Consensus 1 ~~~~~~~~~ve~fR~l~PeF~d---vPde~i~~~~d~A~~~v~~~~~------Gk~y~~al~lltAHLl~l~~~~~~~~- 70 (136) T protein:vir:10 1 MNQETLIAVVEQMRKLVPALRK---VPDETLYAWVEMAELFVCQKTF------KDAYVKALALYALHLAFLDGALKGED- 70 (136) T ss_pred CCchHHHHHHHHHHHhcccccc---CCHHHHHHHHHHHHHhhcCCCC------hhHHHHHHHHHHHHHHhccccccccc- Confidence 43 5779999999987 7999999999999999998887 89999999999999998865533222 Q ss_pred ccccccccccccceee-eeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCccccc Q lcl|NC_019935. 77 GGGVTAGGTQGGFITS-ATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRG 146 (155) Q Consensus 77 ~~~~~~~g~~~G~vtS-aS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~ 146 (155) .++++..|+|+| +++|+|||||+. ++++.++.||++||||||||+|+|+++.|.-+++|.-.|-- T Consensus 71 ----~~~~~~s~rv~ssat~GevSVS~a~-~s~~~s~~WL~~TpyGq~y~aL~k~~~gGf~l~t~~~~~c~ 136 (136) T protein:vir:10 71 ----EDLESYSRRVTSFSLSGEFSQTFGE-VTKNQSGDMMLSTPWGKMFEQLKARRRGRFALMTGLRGGCH 136 (136) T ss_pred ----ccccccccceehheeccceeEeecc-ccCchhhHhhhcCHHHHHHHHHHhhcccchhhhhcccccCC Confidence 223344566665 889999999985 57888999999999999999999998877666655533322 No 12 >protein:vir:104344 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398973;genbank:gi:81343957;genbank:GeneID:3778876 Probab=99.94 E-value=3.5e-31 Score=187.57 Aligned_cols=128 Identities=23% Similarity=0.245 Sum_probs=110.8 Q ss_pred cC---HHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FD---EHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd---~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) |+ +|.||..||+|+ ++||++|+.|+++|++||+.+.+ ||.++++|.|||||+|.++...+. +. T Consensus 1 ~~~~~~e~~R~l~P~f~---kvpdevI~~wielA~lfVc~~~~------g~~~~~AlaL~taHLm~~dga~k~-----en 66 (132) T protein:vir:10 1 MNDAILAFMRSLVPALK---AVDDESINVWIDLARLYVCADKF------GNDADRAVGLYALHLMLSDGAFKG-----EN 66 (132) T ss_pred CchHHHHHHHHhcchhh---cCChHHHHHHHHHHHHHHHhhcC------chhHHHHHHHHHHHHhhccccccc-----cc Confidence 55 689999999998 59999999999999999998877 899999999999999988665542 23 Q ss_pred ccccccccceeeee-ecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCcccccccccCc Q lcl|NC_019935. 81 TAGGTQGGFITSAT-VGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFRKVGG 152 (155) Q Consensus 81 ~~~g~~~G~vtSaS-~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~~r~vgg 152 (155) .++...+-||+|.| +|++||||+.++ ++++|+.+||||++|++|.|+ ..||+ |+|+|-+.|.+|= T Consensus 67 ~~~~t~S~rvaS~Sl~Ge~Sisf~~~s---a~~s~L~~tp~Gkl~~~L~k~-~~Ggf---gL~t~~~~~~cgc 132 (132) T protein:vir:10 67 EGLETYSRRMASYSLSGEFSITYDNQS---AIQGDLSSSSWGRMYKALLRK-KGGGF---GLITSAAGGGCGC 132 (132) T ss_pred cchhhhhhhhhhhcccCceeeeccccc---ccccccccCcHHHHHHHHHHh-ccCcc---ccccccCcCCCCC Confidence 34456789999999 799999999755 557799999999999999994 45676 9999999999999 No 13 >protein:vir:103283 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277463;genbank:gi:71834105;genbank:GeneID:3562394 Probab=99.91 E-value=2e-28 Score=172.46 Aligned_cols=124 Identities=23% Similarity=0.263 Sum_probs=98.3 Q ss_pred HHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccccccccccc Q lcl|NC_019935. 9 FRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGVTAGGTQGG 88 (155) Q Consensus 9 Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~~~~g~~~G 88 (155) .|..||+|++ +|||+|+.|+++|+.|||.+.+ ||.+.++++|||+|+|.|+.+... ...+.....+ T Consensus 1 mR~l~P~f~~---vpdevi~~wid~A~lFVC~~~f------g~~~~~Al~lytlHLm~~dga~k~-----e~~~~~~~s~ 66 (125) T protein:vir:10 1 MRTLYPPLKS---QPDDVLNAWIEVAKLFICLDKF------GDKQVQALAFYTLHLLSQDIALKT-----ENDSSQTSSE 66 (125) T ss_pred Cccccchhhc---cCHHHHHHHHHHHHHHHHHhhh------hhHHHHHHHHHHHHHHhccccccc-----cccccccccc Confidence 8999999986 7999999999999999998876 999999999999999998765433 3334556789 Q ss_pred ceeeee-ecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCccccccc Q lcl|NC_019935. 89 FITSAT-VGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFR 148 (155) Q Consensus 89 ~vtSaS-~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~~r 148 (155) ||+|-| +|++||||+.+ +.+.++.|+.+||||++||+|+|+.+ ||+-+.-..-+..-| T Consensus 67 r~~s~slsGE~Sit~~~~-s~d~s~~~L~~T~wGk~~~~L~k~~~-GgFaL~T~~~~~~cr 125 (125) T protein:vir:10 67 RVKSYSLSGEYTISYDTS-TAAASSSNLEESSWGKLYIDLMRLKV-GRWGLITSGGSRCCR 125 (125) T ss_pred ceeeeeeccceEeecccc-cccccccccccCchHHHHHHHHHhcC-CceeeeccccccCCC Confidence 999977 79999999875 45677889999999999999999554 554331111122222 No 14 >protein:vir:80036 Length: 111 # NCBI annotation: gp10 # Family: family:all:3186 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468714;genbank:gi:157325294;genbank:GeneID:5601727 Probab=98.83 E-value=1.1e-11 Score=80.56 Aligned_cols=111 Identities=25% Similarity=0.197 Sum_probs=86.6 Q ss_pred eecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_019935. 2 VIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGVT 81 (155) Q Consensus 2 v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~~ 81 (155) .-=|++.-|..-|..+ .+||+.|+.+|+.|...+.+..|+ -+.++++.-+++||+++++. T Consensus 1 m~ttv~~vkl~a~~L~---~~sDDsl~~~I~dA~~e~~a~gFp-----~~~~e~a~rYLa~HLat~~~------------ 60 (111) T protein:vir:80 1 MKTDVSKLKLTASSLA---SVSDDSLQVHIDDSYLEVQEKGFP-----EKFEERANRYLAAHLATLAN------------ 60 (111) T ss_pred CchhHHHHHHhhHhhc---CCChHHHHHHHHHHHHHhhcCCCC-----hhHHHHHHHHHHHHHHHhcC------------ Confidence 2224899999999776 499999999999999999998774 46788999999999999841 Q ss_pred cccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCc Q lcl|NC_019935. 82 AGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLP 142 (155) Q Consensus 82 ~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p 142 (155) -+|+|+.||+.--.|..- .. -.|+..|+|||+||+|.+.++-|+-.-++-- T Consensus 61 ------~~v~sE~V~~Lk~~Y~~~--~~--~~~l~~s~wGq~Y~rL~k~~~~gs~~~~vVv 111 (111) T protein:vir:80 61 ------KNVKSEAVGSLKREYYEV--KG--DSGLLSTEYGQEYARLLKEANGGSGISMVVV 111 (111) T ss_pred ------CCCchhhhhhHHHHhhhc--cc--ccccccchhHHHHHHHHHHhcCCccceeeeC Confidence 137899999988887521 11 2589999999999999999987753221111 No 15 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=95.01 E-value=0.00088 Score=37.33 Aligned_cols=122 Identities=8% Similarity=-0.011 Sum_probs=68.4 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccC---------hhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILN---------GKALEACLYLLTAHLLSLSTMQ 71 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~---------g~~~~~~l~L~tAH~l~L~~~~ 71 (155) |-=.|.+.|+..|.-- .+|++....++..|...|+.-.+...+.. .+.-+.+++..+-++....... T Consensus 1 M~Y~d~~~Y~~~y~G~----~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~d~~~~~~~~~vk~A~c~q~e~~~~~g~~~ 76 (131) T protein:vir:80 1 MPYTTLEFYTNEYAGE----HLEQDEFAKLLKHAERKIDSVTFYRIRKSGIEAFSEFIQHQIQLATCNQIEYFKEAGGTS 76 (131) T ss_pred CCCCCHHHHHHhhCCC----CCchhHHHHHHHHHHHHHHHHhcccccccccccCchhHHHHHHHHHHHHHHHHHHhhhhh Confidence 5555788888888532 37899999999999999987655332211 1233355555555444432211 Q ss_pred hccccccccccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCccc Q lcl|NC_019935. 72 VQGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPER 144 (155) Q Consensus 72 ~~g~~~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r 144 (155) + ...+-++|.|+|+.||||...+....... ...--+.-..++++- |+...|.+=| T Consensus 77 ---~---------~~~~~~~S~svG~~Svs~~~~~~~~~~~~---~~~~~~~a~~~L~~T---GLlyrGV~~~ 131 (131) T protein:vir:80 77 ---E---------LAVSKPDNVSIGRTSISDSNFASTATSLN---SGLVGSDVRSYLAHT---GLLYNGVGVR 131 (131) T ss_pred ---h---------hcccccCeeeeCceEEeeccccchhhhhh---hhhhHHHHHHHHhcc---CCeecCCCCC Confidence 1 12244789999999999975443322221 111122233333332 4455666655 No 16 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=94.44 E-value=0.0021 Score=35.23 Aligned_cols=122 Identities=8% Similarity=0.002 Sum_probs=69.3 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccC---------hhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILN---------GKALEACLYLLTAHLLSLSTMQ 71 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~---------g~~~~~~l~L~tAH~l~L~~~~ 71 (155) |-=.|.+.|++.|.- ..+|++....++..|...|+.-.+...+.. .+.-+.+++..+-++....... T Consensus 1 M~Y~d~~~Y~~~y~g----~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~~~~~~~~~~~vk~A~c~q~e~~~~~g~~s 76 (131) T protein:vir:43 1 MPYTTLEFYNDEYAG----EHLEQDEFDKLLKHAERKIDSVTFYRIRKGGIESFSEFIQHQIQLATCNQIEYFKEAGGTS 76 (131) T ss_pred CCCCCHHHHHHhhCC----CCCCHhHHHHHHHHHHHHHHHHhcccccccCccccchhhHHHHHHHHHHHHHHHHHhHHHh Confidence 555578888888842 237999999999999999987655322211 2334455666555555432211 Q ss_pred hccccccccccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCccc Q lcl|NC_019935. 72 VQGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPER 144 (155) Q Consensus 72 ~~g~~~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r 144 (155) + ...+.++|.|+|+.||||...+....... ....=+.-..++++ + |+...|.+=| T Consensus 77 ---~---------~~~~~~~S~svG~~Svs~~~~~~~~~~~~---~~~~~~~a~~~L~~--T-GLlyrGV~~~ 131 (131) T protein:vir:43 77 ---E---------LAVSKPDNVSIGRTSISDSNFASTATSLN---SGLIGSDVRSYLAH--T-GLLYNGVGVR 131 (131) T ss_pred ---h---------hhccccCeeecCceEEeecccccchhhhc---hhhhHHHHHHHHhc--c-CCeecCCCCC Confidence 1 12234789999999999976443332221 11112222233332 2 4455666655 No 17 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=94.10 E-value=0.0024 Score=34.97 Aligned_cols=123 Identities=12% Similarity=0.107 Sum_probs=64.0 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCccccc-----Ch----hHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRIL-----NG----KALEACLYLLTAHLLSLSTMQ 71 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~-----~g----~~~~~~l~L~tAH~l~L~~~~ 71 (155) |-=.|.+.|++.+. ..+||+.+..++..|...|+.-.+..... +- +..+.++++-+-++-..... T Consensus 1 M~Y~t~~~Y~~~~G-----~~i~e~~F~~l~~rAs~~ID~iT~~ri~~~~~~~d~~~~~~~vk~A~c~qiey~~~~G~~- 74 (132) T protein:vir:98 1 MPYLTYEEFMDLNG-----RDIDDKKFEKLLPKASAIIDGVTGHFYQKVDMEKDNAWRVNQFKLALCAQIEYFDALGAT- 74 (132) T ss_pred CCCCCHHHHHhhcC-----CCCCHHHHHHHHHHHHHHHHHHhcccccCCCccccChHHHHHHHHHHHHHHHHHHhccch- Confidence 55567788876333 24799999999999999998654432211 11 22344555444433322111 Q ss_pred hccccccccccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCccc Q lcl|NC_019935. 72 VQGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPER 144 (155) Q Consensus 72 ~~g~~~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r 144 (155) +.....+.++|.|+|..||||........... +.-..-+.-..++++.| +...|.+.= T Consensus 75 ----------sae~~~~~~~S~svG~~Svs~~s~~~~~~~~~--~~~~~~~~a~~~L~~tG---LLyrGV~~~ 132 (132) T protein:vir:98 75 ----------TFEEINNSPQTFQAGRTSVSNASRYNPSGANE--SKPLVAEDVYIYLQGTG---LLFQGVKTW 132 (132) T ss_pred ----------hhhhccCccceeeeCcEEEEeeccCCcccccc--cccchHHHHHHHHhhcC---CccccCCCC Confidence 01112355899999999999964332221111 11111133444444433 233444422 No 18 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=92.87 E-value=0.0077 Score=32.16 Aligned_cols=129 Identities=12% Similarity=-0.004 Sum_probs=69.5 Q ss_pred CeecC-HHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCC-CCcc---------------cccC---------hhHHH Q lcl|NC_019935. 1 MVIFD-EHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDR-DSPY---------------RILN---------GKALE 54 (155) Q Consensus 1 ~v~fd-~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~-~~~~---------------~~~~---------g~~~~ 54 (155) -|+.+ ..+|.+..+........+|+..+..|-.|..+|+.. +|.. ...+ ++..+ T Consensus 16 Yvtv~ea~aY~~~r~~~~~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~~~dg~~~~~~~IP~~V~ 95 (170) T protein:vir:94 16 YVTVAEANSYFDGSYGRPLWTSASEDEKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCKNAVIGGMTLSQVSIPVKVK 95 (170) T ss_pred eecHHHHHHHHHhhccccccCCCCHHHHHHHHHHHHHHhccccccccccCCcchhhcccccCcccCccccccchhhHHHH Confidence 22221 333333334221223478999999999999999853 3311 0011 23444 Q ss_pred HHHHHHHHHHHHHHhhhhccccccccccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccc Q lcl|NC_019935. 55 ACLYLLTAHLLSLSTMQVQGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVG 134 (155) Q Consensus 55 ~~l~L~tAH~l~L~~~~~~g~~~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~G 134 (155) ++-+.++.-++. .... .....+-|+|+++|+++|+|+.++. +++.+. ..++|++.+..+ T Consensus 96 ~Aq~elA~~~~~--~~~~----------~~~~~~~v~~~kVG~i~veY~~~~~--------~~~~~~-~v~~LL~p~l~~ 154 (170) T protein:vir:94 96 IAVFELAYFMLE--SGAA----------LSFADQTIDSVKVGTIRVEFTKNST--------DAGLPT-FVEAMLSGFGSP 154 (170) T ss_pred HHHHHHHHHHHh--Cccc----------CcccccceeeEecceeEEEecCCCC--------CCccHH-HHHHHhhhhhcc Confidence 555444442221 1110 0112244899999999999974433 233444 447888887753 Q ss_pred ccccCCCcccccccccCc Q lcl|NC_019935. 135 GFYIGGLPERRGFRKVGG 152 (155) Q Consensus 135 g~~vgg~p~r~~~r~vgg 152 (155) .-.|+.+ ++.||-|=| T Consensus 155 -~~~g~~~-~~~~~~~r~ 170 (170) T protein:vir:94 155 -VLYGSNA-ARSIDLVRA 170 (170) T ss_pred -ccccccc-cceeeeecC Confidence 2244554 667777777 No 19 >protein:vir:102961 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:26777 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945287;genbank:gi:39653722;uniprot:Q708M5;genbank:GeneID:2672875 Probab=92.30 E-value=0.0028 Score=34.57 Aligned_cols=115 Identities=13% Similarity=0.053 Sum_probs=62.0 Q ss_pred CHHHHHHh---------ccccCCCCcCCHH-HHHHHHHHHHHhhcCCCCcccccChhH----HHHHHHHHHHHHHHHHhh Q lcl|NC_019935. 5 DEHKFRTL---------FPEFADPAAYPDV-RLQMYFDIACEFISDRDSPYRILNGKA----LEACLYLLTAHLLSLSTM 70 (155) Q Consensus 5 d~~~Fr~~---------fPeFad~~~~pD~-~i~~~~~~A~~~~~~~~~~~~~~~g~~----~~~~l~L~tAH~l~L~~~ 70 (155) -|+..+++ ---+ +.....|+ +|++.++.+...+.+. ++-......+ .+++..++..|.+.--.. T Consensus 1 ~~~~lkq~~~~~~~~~~l~~~-~d~~~kD~~vl~faie~v~~~Ilny-cNikeiP~~Le~v~~~maiDll~~e~~~~~k~ 78 (131) T protein:vir:10 1 MIQELKQDNTMYLISCVRKMR-QDNYFKDMEVLHYALTQAENEILNY-IHQDSVPGRLENVWIDMTNDLLDKVKEQSVLA 78 (131) T ss_pred Chhhhhhhhhhhhhhhhhccc-cccccchHHHHHHHHHHHHHHHhhh-cCCcccchhhHHHHHHHHHHHHhhhccccccc Confidence 34444441 1112 22233454 7899999999976542 2111111222 334444444554321000 Q ss_pred hhccccccccccccccccceeeeeecceEEEeecCCCCCcc-hHh-hhcCHHHHHHHHHHHHh Q lcl|NC_019935. 71 QVQGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGW-QWW-LSGTPYGQELWALLSVK 131 (155) Q Consensus 71 ~~~g~~~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~-~~w-~~~T~YG~~y~~l~~~~ 131 (155) ...++..|.|+|.++|+-||||..++..... ... =-.+.|++||-..+|++ T Consensus 79 ----------~~i~~~~g~VsSI~eGDTsIsf~s~t~~~qrl~~~~s~l~~Y~~qL~~yRRL~ 131 (131) T protein:vir:10 79 ----------EKAGADDFSVKSIKMGDTTIEKVSPYEMIQRMKQVPSSLERYKRQLNRFRKLL 131 (131) T ss_pred ----------ccccccccceeeeeecceeeeccCCccHHHHHHHHHHHHhhhHHHHhhhcccC Confidence 0123456789999999999999765432211 011 12578999999888888 No 20 >protein:vir:95071 Length: 104 # NCBI annotation: ORF050 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240825;genbank:gi:66394717;genbank:GeneID:5133865 Probab=88.06 E-value=0.012 Score=31.10 Aligned_cols=101 Identities=16% Similarity=0.242 Sum_probs=65.1 Q ss_pred cCHHHHHHhccccCCCCc---CCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAA---YPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~---~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+---=-|.++ +=+.+|..+++.|+.+-++ +| +.+.+..++-+|+|-.+.-. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN~-~F-----~~~~lP~gVkkfvAe~iky~------------ 62 (104) T protein:vir:95 1 MDAKDVKMINGLSLNDSSNDEQIEYLIEEYKSVAEDYCNQ-KF-----DDKEVPSGVKKFIAECIKFG------------ 62 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhCC-CC-----CCccCCccHHHHHHHHHhhC------------ Confidence 999999987321122222 2234678899999988765 44 24568889999999888621 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~v 138 (155) ..++++|-|.|.||.+|.+. .-...-.|| -|| |+++=+++++ T Consensus 63 -----~~~NissRsMgtVSYTy~T~-iP~~i~~~L--~PY--------Rrlrw~~~~~ 104 (104) T protein:vir:95 63 -----TTGNISARTMGTVSYTYVTD-IPSSAYAYL--MPY--------RKLSWGKRYV 104 (104) T ss_pred -----CCCCcccccccceeeechhh-hhHHHHHhh--hhh--------hhhcccccCC Confidence 23568999999999999652 111222332 233 5566677777 No 21 >protein:vir:1241 Length: 104 # NCBI annotation: similar to phage Spp1 gp15 (product required for head morphogenesis) # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510940;genbank:gi:17426274;genbank:GeneID:927373 Probab=87.93 E-value=0.013 Score=30.98 Aligned_cols=101 Identities=16% Similarity=0.239 Sum_probs=64.9 Q ss_pred cCHHHHHHhccccCCCCc---CCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAA---YPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~---~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+---=-|.++ +=+.+|..+++.|+.+-++ +| +.+.+..++-+|+|-.+.-. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN~-~F-----~~~~lP~gVkkfvAe~iky~------------ 62 (104) T protein:vir:12 1 MDAKDVKMINGLSLNDSSDDEQIEYLIEEYKSVAEDYCNQ-KF-----DDKEVPSGVKKFIAECIKFG------------ 62 (104) T ss_pred CCHHHHHHHhCCCCCCCccHHHHHHHHHHHHHHHHHHhCC-CC-----CCccCCccHHHHHHHHHhhC------------ Confidence 999999987321122222 2234677889999988765 44 24568889999999888621 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~v 138 (155) ..++++|-|.|.||.+|.+. .-...-.|| -|| |+++=+++++ T Consensus 63 -----~~~NissRsMgtVSYTy~T~-iP~~i~~~L--~PY--------Rrlrw~~~~~ 104 (104) T protein:vir:12 63 -----TTGNISARTMGTVSYTYVTD-IPSSAYAYL--LPY--------RKLSWGKRYV 104 (104) T ss_pred -----CCCCcccccccceeeechhh-hhHHHHHhh--hhh--------hhhcccccCC Confidence 23568999999999999652 111222332 233 5566677777 No 22 >protein:vir:93740 Length: 104 # NCBI annotation: ORF047 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240461;genbank:gi:66396159;genbank:GeneID:5133509 Probab=87.92 E-value=0.013 Score=31.01 Aligned_cols=101 Identities=17% Similarity=0.246 Sum_probs=65.4 Q ss_pred cCHHHHHHhccccCCCCc---CCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAA---YPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~---~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+---=-|.++ +=+.+|..+++.|+.+-++ +| +.+.+..++-+|+|-.+.-. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN~-~F-----~~~~lP~gVkkfvAe~iky~------------ 62 (104) T protein:vir:93 1 MDAKDVKMINGLSLNDSSNDEQIDYLIEEYKSVAEDYCNQ-KF-----DDKEVPSGVKKFIAECIKFG------------ 62 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhCC-CC-----CCccCCccHHHHHHHHHhhC------------ Confidence 999999987321122122 2245778899999988765 44 24568889999999888621 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~v 138 (155) ..++++|-|.|.||.+|.+. .-...-.|| -|| |+++=+++++ T Consensus 63 -----~~~NissRsMgtVSYTy~T~-iP~~i~~~L--~PY--------Rrlrw~~~~~ 104 (104) T protein:vir:93 63 -----TTGNISARTMGTVSYTYVTD-IPSSAYAYL--LPY--------RKLSWGKRYV 104 (104) T ss_pred -----CCCCcccccccceeeechhh-hhHHHHHhh--hhh--------hhhcccccCC Confidence 23568999999999999652 111222332 233 5566677777 No 23 >protein:vir:94492 Length: 104 # NCBI annotation: ORF049 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240678;genbank:gi:66396380;genbank:GeneID:5133756 Probab=87.92 E-value=0.013 Score=31.01 Aligned_cols=101 Identities=16% Similarity=0.238 Sum_probs=65.0 Q ss_pred cCHHHHHHhccccCCCCc---CCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAA---YPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~---~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+---=-|.++ +=+.+|..+++.|+.+-++ +| +.+.+..++-+|+|-.+.-. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN~-~F-----~~~~lP~gVkkfvAe~iky~------------ 62 (104) T protein:vir:94 1 MDAKDVKMINGLSLNDSSNDEQIEYLIEEYKGVAEDYCNQ-KF-----DDKEVPSGVKKFIAECIKFG------------ 62 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhCC-CC-----CCccCCccHHHHHHHHHhhC------------ Confidence 999999987321122222 2234678899999988765 44 24568889999999888621 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~v 138 (155) ..++++|-|.|.||.+|.+. .-...-.|| -|| |+++=+++++ T Consensus 63 -----~~~NissRsMgtVSYTy~T~-iP~~i~~~L--~PY--------Rrlrw~~~~~ 104 (104) T protein:vir:94 63 -----TTGNISARTMGTVSYTYVTD-IPSSAYAYL--LPY--------RKLSWGKRYV 104 (104) T ss_pred -----CCCCcccccccceeeechhh-hhHHHHHhh--hhh--------hhhcccccCC Confidence 23568999999999999652 111222332 233 5566677777 No 24 >protein:vir:97430 Length: 104 # NCBI annotation: ORF051 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240751;genbank:gi:66396455;genbank:GeneID:5133786 Probab=87.92 E-value=0.013 Score=31.01 Aligned_cols=101 Identities=16% Similarity=0.238 Sum_probs=65.0 Q ss_pred cCHHHHHHhccccCCCCc---CCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAA---YPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~---~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+---=-|.++ +=+.+|..+++.|+.+-++ +| +.+.+..++-+|+|-.+.-. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN~-~F-----~~~~lP~gVkkfvAe~iky~------------ 62 (104) T protein:vir:97 1 MDAKDVKMINGLSLNDSSNDEQIEYLIEEYKGVAEDYCNQ-KF-----DDKEVPSGVKKFIAECIKFG------------ 62 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhCC-CC-----CCccCCccHHHHHHHHHhhC------------ Confidence 999999987321122222 2234678899999988765 44 24568889999999888621 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~v 138 (155) ..++++|-|.|.||.+|.+. .-...-.|| -|| |+++=+++++ T Consensus 63 -----~~~NissRsMgtVSYTy~T~-iP~~i~~~L--~PY--------Rrlrw~~~~~ 104 (104) T protein:vir:97 63 -----TTGNISARTMGTVSYTYVTD-IPSSAYAYL--LPY--------RKLSWGKRYV 104 (104) T ss_pred -----CCCCcccccccceeeechhh-hhHHHHHhh--hhh--------hhhcccccCC Confidence 23568999999999999652 111222332 233 5566677777 No 25 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=87.27 E-value=0.04 Score=28.26 Aligned_cols=130 Identities=16% Similarity=0.095 Sum_probs=67.0 Q ss_pred Ce-------e----cCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcC--CCCccc------c---------cC--- Q lcl|NC_019935. 1 MV-------I----FDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISD--RDSPYR------I---------LN--- 49 (155) Q Consensus 1 ~v-------~----fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~--~~~~~~------~---------~~--- 49 (155) +| . .|++++++-+-+.......+|+..+..|-.|..+|+. -+|... . .+ T Consensus 6 ive~~~g~~~anSYvtv~ea~aY~~~rg~~~~~~~~~ke~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~~~~ 85 (172) T protein:vir:95 6 VVEDGSGVTNANSYVSVADARIYASNRGVELPLDDDELAAMLIRSTDYLEAQACRFQGKPTSTTQALQWPRTGVFLNEDE 85 (172) T ss_pred EEeCCCCCCcccccccHHHHHHHHHhcCCcCCCChHHHHHHHHHHHHHhhccCCceeeeecCCcccccCCcCCcccCccc Confidence 22 1 2356666544333333345788999999999999985 233210 0 01 Q ss_pred ------hhHHHHHHHHHHHHHHHHHhhhhccccccccccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHH Q lcl|NC_019935. 50 ------GKALEACLYLLTAHLLSLSTMQVQGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQE 123 (155) Q Consensus 50 ------g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~ 123 (155) ++..+++-+.++.-++ . +. ...+.....+.|+|+++|+|||+|+.+.... +++.|- . T Consensus 86 v~~~~IP~~V~~A~~elA~~~~--~-----~~---~~~~~~~~~~~vk~~kVG~I~veY~~~~~~~------~~~~~~-~ 148 (172) T protein:vir:95 86 VPSNVIPKSLIAAQVQLTMAIN--A-----GF---DLQPNVSPQDYVTREKVGPIETEYADPLSVG------IMPTFT-A 148 (172) T ss_pred ccccchhHHHHHHHHHHHHHHH--c-----Cc---cccccCCcccceeEEeccceEEeeccCCCCC------CcccHH-H Confidence 2334445444443111 1 10 0111122346689999999999997654332 234442 3 Q ss_pred HHHHHHHhcccccccCCCcccccccc Q lcl|NC_019935. 124 LWALLSVKAVGGFYIGGLPERRGFRK 149 (155) Q Consensus 124 y~~l~~~~g~Gg~~vgg~p~r~~~r~ 149 (155) --+|++.+..++ -|+.-.=+.||- T Consensus 149 v~~LL~p~l~~~--~~~~~~~r~~r~ 172 (172) T protein:vir:95 149 ANALLAPLFGEC--ASNKFALRTIRV 172 (172) T ss_pred HHHHHhhhhccc--CCcceeeEEEeC Confidence 345666664433 233333455665 No 26 >protein:vir:97329 Length: 104 # NCBI annotation: ORF048 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240613;genbank:gi:66396311;genbank:GeneID:5133685 Probab=86.92 E-value=0.013 Score=30.92 Aligned_cols=101 Identities=16% Similarity=0.221 Sum_probs=64.5 Q ss_pred cCHHHHHHhccccCCCCc---CCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAA---YPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~---~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+---=-|.++ +=+.+|..+++.|+.+-++. |. -+-+...+-+++|-.+.-. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn~~-F~-----~~~lP~gV~~fvA~~iky~------------ 62 (104) T protein:vir:97 1 MDTKDVKMINGLSLNDSSNDEQIEYLIEEYKSVAEDYCNQK-FD-----DKAVPSGVKKFIAECIKFG------------ 62 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcCCC-CC-----CCCCCccHHHHHHHHHhhC------------ Confidence 999999987321122222 22346788999999887653 41 2367778889999888631 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~v 138 (155) ..++|+|-|.|+||.+|.+. .-...-.|| -| -|+++-+++++ T Consensus 63 -----~~~NissRSMGtVSYty~t~-iP~~i~~~L--kP--------YRklr~~~~~~ 104 (104) T protein:vir:97 63 -----TTGNISARTMGTVSYTYVTD-IPSSAYAYL--MP--------YRKLSWGKRYV 104 (104) T ss_pred -----CCCCcccccccceeecccch-hHHHHHHhh--hh--------hhhhcccccCC Confidence 23568999999999999652 111122232 23 36667778888 No 27 >protein:vir:96281 Length: 104 # NCBI annotation: ORF047 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240313;genbank:gi:66396008;genbank:GeneID:5133358 Probab=86.90 E-value=0.013 Score=30.92 Aligned_cols=101 Identities=17% Similarity=0.238 Sum_probs=64.5 Q ss_pred cCHHHHHHhccccCCCCc---CCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAA---YPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~---~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+---=-|.++ +=..+|..+++.|+.+-++. |. -+-+...+-+++|-.+.-. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn~~-F~-----~~~lP~gV~~fvA~~iky~------------ 62 (104) T protein:vir:96 1 MDAKDVKMINGLSLNDSSNDEQIKYLIEEYKSVAEDYCNQK-FD-----DKAVPSGVKKFIAECIKFG------------ 62 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcCCC-CC-----CCCCCccHHHHHHHHHhhC------------ Confidence 999999987321122222 22347788999999887653 41 2367778889999888631 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~v 138 (155) ..++|+|-|.|+||.+|.+. .-...-.|| -|| |+++-+++++ T Consensus 63 -----~~~NissRSMGtVSYTy~t~-iP~~i~~~L--kPY--------Rklr~~~~~~ 104 (104) T protein:vir:96 63 -----TTGNISARTMGTVSYTYVTD-IPSSAYAYL--LPY--------RKLSWGKRYV 104 (104) T ss_pred -----CCCCcccccccceeecccch-hHHHHHHhh--hhh--------hhhcccccCC Confidence 23568999999999999652 111122332 233 6667778888 No 28 >protein:vir:95891 Length: 104 # NCBI annotation: ORF051 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240387;genbank:gi:66396087;genbank:GeneID:5133402 Probab=86.90 E-value=0.013 Score=30.92 Aligned_cols=101 Identities=17% Similarity=0.238 Sum_probs=64.5 Q ss_pred cCHHHHHHhccccCCCCc---CCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAA---YPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~---~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+---=-|.++ +=..+|..+++.|+.+-++. |. -+-+...+-+++|-.+.-. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn~~-F~-----~~~lP~gV~~fvA~~iky~------------ 62 (104) T protein:vir:95 1 MDAKDVKMINGLSLNDSSNDEQIKYLIEEYKSVAEDYCNQK-FD-----DKAVPSGVKKFIAECIKFG------------ 62 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcCCC-CC-----CCCCCccHHHHHHHHHhhC------------ Confidence 999999987321122222 22347788999999887653 41 2367778889999888631 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~v 138 (155) ..++|+|-|.|+||.+|.+. .-...-.|| -|| |+++-+++++ T Consensus 63 -----~~~NissRSMGtVSYTy~t~-iP~~i~~~L--kPY--------Rklr~~~~~~ 104 (104) T protein:vir:95 63 -----TTGNISARTMGTVSYTYVTD-IPSSAYAYL--LPY--------RKLSWGKRYV 104 (104) T ss_pred -----CCCCcccccccceeecccch-hHHHHHHhh--hhh--------hhhcccccCC Confidence 23568999999999999652 111122332 233 6667778888 No 29 >protein:vir:94798 Length: 104 # NCBI annotation: ORF043 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240538;genbank:gi:66396233;genbank:GeneID:5133578 Probab=86.83 E-value=0.013 Score=30.87 Aligned_cols=101 Identities=16% Similarity=0.225 Sum_probs=64.4 Q ss_pred cCHHHHHHhccccCCCCc---CCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAA---YPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~---~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+---=-|.++ +=+.+|..+++.|+.+-++. |. -+-+...+-+++|-.+.-. T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn~~-F~-----~~~lP~gVk~fvA~~iky~------------ 62 (104) T protein:vir:94 1 MDTKDVKMINGLSLNDSSNDEQIEYLIEEYKSVAEDYCNQK-FD-----DKAVPSGVKKFIAECIKFG------------ 62 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcCCC-CC-----CCCCCccHHHHHHHHHhhC------------ Confidence 999999987321122222 22346788999999887653 41 2367778889999888631 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~v 138 (155) ..++|+|-|.|+||.+|.+. .-...-.|| -| -|+++=+++++ T Consensus 63 -----~~~NissRSMGtVSYTy~T~-iP~~i~~~L--kP--------YRklr~~~~~~ 104 (104) T protein:vir:94 63 -----TTGNISARTMGTVSYTYITD-IPSSAYAYL--MP--------YRKLSWGKRYV 104 (104) T ss_pred -----CCCCcccccccceeecccch-hHHHHHHhh--hh--------hhhhcccccCC Confidence 23568999999999999652 111122232 23 36667778888 No 30 >protein:vir:107119 Length: 104 # NCBI annotation: conserved phage protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950608;genbank:gi:119953688;genbank:GeneID:4643128 Probab=86.68 E-value=0.014 Score=30.74 Aligned_cols=101 Identities=16% Similarity=0.172 Sum_probs=64.5 Q ss_pred cCHHHHHHhccccCCCCc---CCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAA---YPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~---~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+---=-|.++ +=+.+|..+++.|+.+-++. |. -+-+...+-+++|-.+.- + T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn~~-F~-----~~~lP~gV~~fvA~~iky---~--------- 62 (104) T protein:vir:10 1 MNAQDVKLLNNLSLDDTSNDETIELLIEKYLNVAEEYCNQT-FN-----RKSLPSNVEKFIANCIKQ---G--------- 62 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcCCC-CC-----CCCCCccHHHHHHHHHhh---c--------- Confidence 999999987321122222 22346788899999887653 41 236777888999988862 1 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~v 138 (155) ..++|+|-|.|+||.+|.+. .-...-.|| .=-|+++-+++++ T Consensus 63 -----~~~NissRSMGtVSyTy~t~-iP~~i~~~L----------~PYRklr~~~~~~ 104 (104) T protein:vir:10 63 -----TTSNISSRTMGTVSYTFVTD-LPKETYGYL----------KPFRRLRWTGYHV 104 (104) T ss_pred -----CCCCcccccccceeecccch-hHHHHHHhh----------hhhhhhccccccC Confidence 23568999999999999652 111122232 2336667778888 No 31 >protein:vir:105327 Length: 104 # NCBI annotation: putative head morphogenesis protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950671;genbank:gi:119967841;genbank:GeneID:4643206 Probab=86.68 E-value=0.014 Score=30.74 Aligned_cols=101 Identities=16% Similarity=0.172 Sum_probs=64.5 Q ss_pred cCHHHHHHhccccCCCCc---CCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAA---YPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~---~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+---=-|.++ +=+.+|..+++.|+.+-++. |. -+-+...+-+++|-.+.- + T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn~~-F~-----~~~lP~gV~~fvA~~iky---~--------- 62 (104) T protein:vir:10 1 MNAQDVKLLNNLSLDDTSNDETIELLIEKYLNVAEEYCNQT-FN-----RKSLPSNVEKFIANCIKQ---G--------- 62 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcCCC-CC-----CCCCCccHHHHHHHHHhh---c--------- Confidence 999999987321122222 22346788899999887653 41 236777888999988862 1 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhccccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYI 138 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~v 138 (155) ..++|+|-|.|+||.+|.+. .-...-.|| .=-|+++-+++++ T Consensus 63 -----~~~NissRSMGtVSyTy~t~-iP~~i~~~L----------~PYRklr~~~~~~ 104 (104) T protein:vir:10 63 -----TTSNISSRTMGTVSYTFVTD-LPKETYGYL----------KPFRRLRWTGYHV 104 (104) T ss_pred -----CCCCcccccccceeecccch-hHHHHHHhh----------hhhhhhccccccC Confidence 23568999999999999652 111122232 2336667778888 No 32 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=85.87 E-value=0.05 Score=27.73 Aligned_cols=131 Identities=14% Similarity=0.016 Sum_probs=67.9 Q ss_pred Ce-------e----cCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCC--CCcccc---------------cC--- Q lcl|NC_019935. 1 MV-------I----FDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDR--DSPYRI---------------LN--- 49 (155) Q Consensus 1 ~v-------~----fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~--~~~~~~---------------~~--- 49 (155) +| . -|++++.+-+..... .+|++..+..|-.|..+|+.- +|...- .+ T Consensus 4 ived~~g~~~anSYvt~~~a~aY~~~rg~--~~~~d~~e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~g~~ 81 (172) T protein:vir:80 4 IVEDGTGKPDANTYAGADFVIAYAQARGV--TVDADEAERLILEAMDYIESFRRRWKGERNTREQGLTWPRHDAVVDGFV 81 (172) T ss_pred EeeCCCCCccccccccHHHHHHHHHHcCC--CcCHHHHHHHHHHHHHHHhhccCccccccCCccccccccccCcccCccc Confidence 12 1 135677665555443 478889999999999999862 342110 01 Q ss_pred ------hhHHHHHHHHHHHHHHHHHhhhhccccccccccccccccceeeeeecceEEEeecCCCCCcchH-hhhcCHHHH Q lcl|NC_019935. 50 ------GKALEACLYLLTAHLLSLSTMQVQGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQW-WLSGTPYGQ 122 (155) Q Consensus 50 ------g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~-w~~~T~YG~ 122 (155) ++..+++-+.++.-++ + +.. .........|+|+++|+++++|+.+........ -=.++.|- T Consensus 82 ~~~~~IP~~v~~A~~elA~~~~--~-----g~~----~~~~~~~~~v~~ekVG~i~~eY~~~~~~~~~~~~~~~~~~~~- 149 (172) T protein:vir:80 82 IPSDVIPKELQSAVAAAVIEQV--N-----GFE----LQQSQDQWAVRIEKVDVIEVQYAAGGGGQSASANAPMKPTFP- 149 (172) T ss_pred ccccchhHHHHHHHHHHHHHHh--c-----CCc----cCcCCCCceeeEEeccceEEeeecccCccccccccCCccchH- Confidence 2344455554443221 1 100 111122345899999999999987644321110 00123332 Q ss_pred HHHHHHHHhcccccccCCCcccccccc Q lcl|NC_019935. 123 ELWALLSVKAVGGFYIGGLPERRGFRK 149 (155) Q Consensus 123 ~y~~l~~~~g~Gg~~vgg~p~r~~~r~ 149 (155) ..-+|++++-.|+ ||+|-| -+|. T Consensus 150 ~v~~LL~p~l~~~---gg~~~~-~vrg 172 (172) T protein:vir:80 150 KIDALLNPLLVGD---GGLFLV-AVRG 172 (172) T ss_pred HHHHHHhhhhcCC---CCeeee-eecC Confidence 2334666663332 566644 4554 No 33 >protein:vir:96128 Length: 98 # NCBI annotation: ORF050 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240080;genbank:gi:66395776;genbank:GeneID:5133109 Probab=83.58 E-value=0.0081 Score=32.06 Aligned_cols=95 Identities=16% Similarity=0.215 Sum_probs=62.3 Q ss_pred cCHHHHHHh--ccccCCC-CcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTL--FPEFADP-AAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~--fPeFad~-~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+ -|-=.|. ..+=+.+|..+++.|+.+-++ +| .+-+..++-+|+|..+.-. T Consensus 1 Md~~dVK~ln~~~i~~~~~d~~~~~li~~y~e~aedyCN~-~F------~k~lP~gVkkfiAe~iky~------------ 61 (98) T protein:vir:96 1 MEPKEVKQLNLMPIEDTSNDDVLGDLIKFYKGIAEEYCNK-TF------EAPYPFGVRKFIAECIKYG------------ 61 (98) T ss_pred CchHHhHHhhcccCCCcchHHHHHHHHHHHHHHHHHHhCC-cc------cccCCccHHHHHHHHHhhC------------ Confidence 999999988 4432221 123345788899999988754 44 3668889999999888631 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHH Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELW 125 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~ 125 (155) ..+.++|-|.|.||.+|.+. .-...-.|| -||=..=| T Consensus 62 -----~~~nissRsMgtVSYty~T~-iP~~i~~~L--~PyRrlrw 98 (98) T protein:vir:96 62 -----TNSNVSSRTMGTVSYTFVTD-LPKATYRHL--KPFRRLRW 98 (98) T ss_pred -----CCCCcccccccceeeechhh-hhHHHHHHh--hhhhhccC Confidence 23568999999999999652 111223332 35544444 No 34 >protein:vir:5976 Length: 102 # NCBI annotation: hypothetical protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690676;genbank:geneid:6329129;genbank:gi:22855070;uniprot:Q38584;genbank:GeneID:955305 Probab=80.16 E-value=0.018 Score=30.19 Aligned_cols=100 Identities=15% Similarity=0.167 Sum_probs=60.2 Q ss_pred cCHHHHHHhccccCCC-CcCCHHHHHHHHHHHHHhhcCCCCcccccChh-HHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADP-AAYPDVRLQMYFDIACEFISDRDSPYRILNGK-ALEACLYLLTAHLLSLSTMQVQGAAGGGVT 81 (155) Q Consensus 4 fd~~~Fr~~fPeFad~-~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~-~~~~~l~L~tAH~l~L~~~~~~g~~~~~~~ 81 (155) ||+++-+.+.|-=.+. ..+=.++|..+++.|+.+-++ +| .+.+|. -+..++-+|+|..+.-++. T Consensus 1 Md~~~VK~ll~i~~~s~d~~i~~lip~y~e~aedyCN~-~F--~dkdg~~~lP~gVkkfvAe~ik~y~~----------- 66 (102) T protein:vir:59 1 MDIQRVKRLLSITNDKHDEYLTEMVPLLVEFAKDECHN-PF--IDKDGNESIPSGVLIFVAKAAQFYMT----------- 66 (102) T ss_pred CChHHhhhhhcCCCCccHHHHHHHHHHHHHHHHHHhCC-cc--ccccccccCCccHHHHHHHHHHhcCC----------- Confidence 9999999986532110 012234677889999988765 34 122222 3778899999988864322 Q ss_pred cccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCccccccc Q lcl|NC_019935. 82 AGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFR 148 (155) Q Consensus 82 ~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~~r 148 (155) .++++|-|.|.||.+|.+. .-...-.||+ || |+++| T Consensus 67 -----~~nissRsMgtVSYty~T~-iP~~i~~~L~--Py-----------------------Rrl~~ 102 (102) T protein:vir:59 67 -----NAGLTGRSMDTVSYNFATE-IPSTILKKLN--PY-----------------------RKMAR 102 (102) T ss_pred -----CCCcccccccceeeechhh-hhHHHHHHhh--HH-----------------------HhhcC Confidence 3668999999999999652 1112233331 22 22222 No 35 >protein:vir:96831 Length: 98 # NCBI annotation: ORF052 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240159;genbank:gi:66395852;genbank:GeneID:5133172 Probab=73.67 E-value=0.052 Score=27.64 Aligned_cols=95 Identities=20% Similarity=0.270 Sum_probs=60.8 Q ss_pred cCHHHHHHhccccCCCCc---CCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAA---YPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~---~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||+++-+.+---=-|.++ +=+.+|..+++.|+.+-++ +| .+-+..++-+|+|..+.- + T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN~-~F------~~~lP~gVkkfvAe~iky---~--------- 61 (98) T protein:vir:96 1 MDALDVKMLNGTRIDDVSNDDVINKLILAYKQVAEEYCNQ-VF------GDPLPGGVKKFIAECIKY---G--------- 61 (98) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhCC-cc------cccCCccHHHHHHHHHhh---c--------- Confidence 999999987321122222 2234678889999988754 45 366888999999988862 1 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHH Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELW 125 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~ 125 (155) ..++++|-|.|.||.+|.+. .-...-.|| -||=..=| T Consensus 62 -----~~~nissRsMgtVSYty~T~-iP~~i~~~L--~PyRrlrw 98 (98) T protein:vir:96 62 -----VSGNIASRSMGTVSYTYVTD-VPSSMYKYL--KPYRKLRW 98 (98) T ss_pred -----ccCCcccccccceeeechhh-hhHHHHHHh--hhhhhccC Confidence 22568999999999999652 111223333 35544444 No 36 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=69.89 E-value=0.22 Score=24.23 Aligned_cols=126 Identities=16% Similarity=0.061 Sum_probs=58.9 Q ss_pred Ceec-CHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCC--------cccccCh--------hHHHH-HHHHHHH Q lcl|NC_019935. 1 MVIF-DEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDS--------PYRILNG--------KALEA-CLYLLTA 62 (155) Q Consensus 1 ~v~f-d~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~--------~~~~~~g--------~~~~~-~l~L~tA 62 (155) +--+ |.++|...-++. ..|+..+..+..|+..|+.-.. +..+.++ ..+.. .-..+++ T Consensus 1 ~~pYLTy~ef~~lg~~~-----~~~d~F~kllk~A~~~ID~~T~y~~~~y~~~~i~~d~~~d~~~~~~~r~~~vKkA~a~ 75 (144) T protein:vir:79 1 MKPYLTTSDFEKLGYEL-----KKPDNFGKLLKSATVLINQICSYYDPAFAYHDLEADSQADPDSYLFRQAMAFKKAVAL 75 (144) T ss_pred CCcccchhhhhhhCCCC-----cchhhhhhHHHHHHHHhhhhhhhhccccccccccccccccchhhhhHHHHHHHHHHHH Confidence 2222 456665554443 3456688888889888886331 1222222 11221 1222233 Q ss_pred HHHHHHhhhhccccccccccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCc Q lcl|NC_019935. 63 HLLSLSTMQVQGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLP 142 (155) Q Consensus 63 H~l~L~~~~~~g~~~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p 142 (155) -+..+...+...+ ..-..+.++|.|+|..|||+...+..+...+=.+-+ ..=+.++.+.|. ...|-| T Consensus 76 QIeY~~~~G~~sa-------~e~~~~~~~S~svGrtsvs~~~~~~~s~t~~~~~v~---~~a~~yL~~tGL---LYrGV~ 142 (144) T protein:vir:79 76 EMLFLEDSGYSSA-------YDVAQGALNSFTVGHTSMSLNPSAGQNLTVGSTGVV---KSAYDLLGRYGL---LFSGVA 142 (144) T ss_pred HHHHHHHcCCcch-------hhhhcCccceeEecceEEeecCCCcccccccccccc---HHHHHHHhhcCc---cccccc Confidence 3333322211110 011246689999999999997655443221111111 233344444433 334555 Q ss_pred cc Q lcl|NC_019935. 143 ER 144 (155) Q Consensus 143 ~r 144 (155) +- T Consensus 143 s~ 144 (144) T protein:vir:79 143 SL 144 (144) T ss_pred cC Confidence 43 No 37 >protein:vir:79050 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:6416 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110727;genbank:gi:134287344;genbank:GeneID:4955224 Probab=61.58 E-value=0.32 Score=23.30 Aligned_cols=123 Identities=7% Similarity=-0.064 Sum_probs=64.1 Q ss_pred cC---HHHHHHhccccCCCC-cCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_019935. 4 FD---EHKFRTLFPEFADPA-AYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGG 79 (155) Q Consensus 4 fd---~~~Fr~~fPeFad~~-~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~ 79 (155) |+ .++...+..+|.-.. .-++..|++.++.+...|.+. +.--.....+...+.-+.++-++...... +...- T Consensus 1 ~~~~i~e~i~~~Lk~~~~~~~~~d~~iL~fa~e~~~n~I~N~-cNi~eiP~~L~~v~~~mai~~fl~~kk~~--~~~~l- 76 (133) T protein:vir:79 1 MGNNIIDDIEKRLESFGYILKDGDKWLIDFVREKIENIIKLD-CNIKTMPIELKEIEADMIVGEFLFTKKNM--GQLDI- 76 (133) T ss_pred CCchHHHHHHHHHHHhCCCCCccchHHHHHHHHHHHHHHhhh-cChhhcchhHHHHHHHHHHHHHHhccccc--CCCCc- Confidence 55 467777777775432 246778999999999877542 22111223334444444444444332211 11110 Q ss_pred cccccccccceeeeeecceEEEeecCCCCCcch----Hh--hhcCHHHHHHHHHHHHhc Q lcl|NC_019935. 80 VTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQ----WW--LSGTPYGQELWALLSVKA 132 (155) Q Consensus 80 ~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~----~w--~~~T~YG~~y~~l~~~~g 132 (155) ++-..-+.|+|.++|+-||++..++++...+ .| +-.+-|..++-..+|+.= T Consensus 77 --~~~D~~~~v~sIkeGDTsv~f~~~~~s~t~eq~l~s~i~~L~~~~k~~l~~yRkLrW 133 (133) T protein:vir:79 77 --ESINFEAVEKSISEGDTKVDFAIGSGSQTPEQRFDSLIAYLTAYGKNKILTFRCLRW 133 (133) T ss_pred --ccccchhhhhheecccceeecccCCCccchhHHHHHHHHHHhhcccchhhccccccC Confidence 1112345589999999999997665432222 22 113444455554444422 No 38 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=56.40 E-value=0.46 Score=22.45 Aligned_cols=123 Identities=11% Similarity=0.071 Sum_probs=64.8 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCccc-----ccC-hhHHHHHH-HHHHHHHHHHHhhhhc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYR-----ILN-GKALEACL-YLLTAHLLSLSTMQVQ 73 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~-----~~~-g~~~~~~l-~L~tAH~l~L~~~~~~ 73 (155) |-=.|.++|++.-.+ +++..+..+..|+..|+.-....- ..+ -+.++..+ ..+.+.+..+...+.. T Consensus 1 M~YlT~eey~el~~~-------~~~~F~kl~k~A~~~ID~~t~~~y~~~~~~~~~~~~r~~~vK~A~a~QieY~~~~G~~ 73 (130) T protein:vir:47 1 MTYLTQEEFDELDFD-------EVTDFEKLAKRAKIAIDLYTNGIYQKDIDFEKEIAYRKSAVKLAMAFQIAYLDASGIM 73 (130) T ss_pred CCCCchhhHhhcCCC-------ChhhHHHHHHHHHHHHHHHhcccccccCCccCcchHHHHHHHHHHHHHHHHHHHhccc Confidence 666678888865332 233488888888888764321111 111 12333222 2233333344322211 Q ss_pred cccccccccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCccc Q lcl|NC_019935. 74 GAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPER 144 (155) Q Consensus 74 g~~~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r 144 (155) .+ . -.+-++|.|+|..|||+...+.+....+. .....-+.++.+.|.| ++-|=...| T Consensus 74 s~--------~-~~~~~~S~svGrtSis~~~~~~~~~~~~~----~vs~da~~~L~~tGL~-Ly~GV~yd~ 130 (130) T protein:vir:47 74 SA--------D-DKQLANSVSIGRTSISYSTSQSTLAGQRF----NLSMDAENALRQAGFS-LVVGVAYDR 130 (130) T ss_pred cc--------h-hccCcceeeecceeeecCcCccccccCCc----cccHHHHHHHHhcccc-cccCCCccC Confidence 11 1 13557899999999999876665544331 2455566666666652 233334455 No 39 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=50.61 E-value=0.6 Score=21.78 Aligned_cols=128 Identities=13% Similarity=0.075 Sum_probs=66.4 Q ss_pred Ce-------e----cCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCC--CCccccc---------------Ch-- Q lcl|NC_019935. 1 MV-------I----FDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDR--DSPYRIL---------------NG-- 50 (155) Q Consensus 1 ~v-------~----fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~--~~~~~~~---------------~g-- 50 (155) +| . .|++++++-+-+......-+|+..+..|-.|..+|+.- +|...-+ +| T Consensus 4 iV~~~~g~~~anSYvtv~~a~aY~~~rg~~~~~d~~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~ 83 (169) T protein:vir:78 4 IVETGQGIPNADSYVSLEDGRALAAKYGLELPEDDTAAEAALRNGAVYVGLFESQMCGRRVSANQALAFPRTGVTLHGFP 83 (169) T ss_pred EeeCCCCCccccccccHHHHHHHHHHcCCcCCCChHHHHHHHHHHHHHhhhccccceeeeCCcccccccccCCceecccc Confidence 22 1 24666666544433333346889999999999999842 3422100 11 Q ss_pred -------hHHHHHHHHHHHHHHHHHhhhhccccccccccccccccceeeeee-cceEEEeecCCCCCcchHhhhcCHHHH Q lcl|NC_019935. 51 -------KALEACLYLLTAHLLSLSTMQVQGAAGGGVTAGGTQGGFITSATV-GEVSVAKLAPPAKNGWQWWLSGTPYGQ 122 (155) Q Consensus 51 -------~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~~~~g~~~G~vtSaS~-G~vSVS~d~~~~~~~~~~w~~~T~YG~ 122 (155) ...+.+-+.++.-++ .... .......+.|.++++ |.++|.|+.+.... ++..|- T Consensus 84 ~~~~~IP~~v~~A~~elA~~~~--~g~~---------~~~~~~~~~v~~e~v~G~i~veY~~~~~~~------~~~~~~- 145 (169) T protein:vir:78 84 QPSNVIPPLVIQAQVMAAVEYG--AGTD---------VRGSTDGREVQTERVEGAVTVSYFKNGYSG------GTVSIT- 145 (169) T ss_pred cccccchHHHHHHHHHHHHHHh--cCcc---------cCCCCCcceeEEEEecCceeEeecCCCCCC------CcccHH- Confidence 333344444333221 1110 111223466788777 99999998654332 122221 Q ss_pred HHHHHHHHhcccccccCCCcccccccc Q lcl|NC_019935. 123 ELWALLSVKAVGGFYIGGLPERRGFRK 149 (155) Q Consensus 123 ~y~~l~~~~g~Gg~~vgg~p~r~~~r~ 149 (155) .--+|++++-.|+ +|.++=..||. T Consensus 146 ~~~~LL~p~l~~~---~g~~~i~~~rg 169 (169) T protein:vir:78 146 TADDALRPLLCGS---NNAYSFNVFRG 169 (169) T ss_pred HHHHHhhhhcccC---CCcceeeeecC Confidence 2225677665443 45566666666 No 40 >protein:vir:4702 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061634;genbank:gi:9635721;genbank:GeneID:1263015 Probab=36.08 E-value=1.2 Score=20.16 Aligned_cols=102 Identities=13% Similarity=0.057 Sum_probs=46.2 Q ss_pred CeecCHHHHHHhcc-ccCCCCcCCHHHHHHHHHHHHHhhcCCC------CcccccChhHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_019935. 1 MVIFDEHKFRTLFP-EFADPAAYPDVRLQMYFDIACEFISDRD------SPYRILNGKALEACLYLLTAHLLSLSTMQVQ 73 (155) Q Consensus 1 ~v~fd~~~Fr~~fP-eFad~~~~pD~~i~~~~~~A~~~~~~~~------~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~ 73 (155) |++=|+++++.--- .|. ..|+.|+.+++-|+.+|.+.- ..-.......++.++.++++|+-.=+.... T Consensus 3 vt~~dLeeiK~~LRID~d----~DD~li~~~i~AA~~~I~~ai~~~~~~~~~~~~~~~~~~~AvllLv~~~YeNR~a~~- 77 (113) T protein:vir:47 3 LTAEELKLLKKHCKIDHN----SEDDLLEIYYSWAFHEIASAVTDEPSKYIDWFKSHPLFARAIYPLASYYFENRIAYL- 77 (113) T ss_pred ccHHHHHHHHHHhCCCCC----cchHHHHHHHHHHHHHHHhhccccccccccccCCchHHHHHHHHHHHHHHhhhhhcc- Confidence 44445777877443 232 479999999999999995421 111112235788999999999975332111 Q ss_pred cccccccccccccccceeeeeecceEEEee--cCCCCCcchHhhhcC Q lcl|NC_019935. 74 GAAGGGVTAGGTQGGFITSATVGEVSVAKL--APPAKNGWQWWLSGT 118 (155) Q Consensus 74 g~~~~~~~~~g~~~G~vtSaS~G~vSVS~d--~~~~~~~~~~w~~~T 118 (155) ...... ..--|.| =+....-.|. .....+.+ .+| T Consensus 78 ~~~~~~------vp~~v~s-li~qlR~~y~~~~~~~~~~~----~~~ 113 (113) T protein:vir:47 78 DRDLSL------APHMVLS-TVHKLRGSFEQFLESENDEE----SGT 113 (113) T ss_pred cccccc------ccHHHHH-HHHHHHHHHHHHhhhcCCCC----CCC Confidence 000000 0000000 0000000010 00000000 122 No 41 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=35.32 E-value=1.2 Score=20.08 Aligned_cols=128 Identities=14% Similarity=0.079 Sum_probs=62.7 Q ss_pred Ce-------e----cCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCC--CCccc---------------ccC--- Q lcl|NC_019935. 1 MV-------I----FDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDR--DSPYR---------------ILN--- 49 (155) Q Consensus 1 ~v-------~----fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~--~~~~~---------------~~~--- 49 (155) +| . .+++++++-+-+......-+|+..+..|-.|..+|+.- +|... +.+ T Consensus 4 iv~~~~g~~~anSYvt~~ea~aY~~~rg~~~~~dd~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~ 83 (169) T protein:vir:95 4 IVETGQGLPNADSYVSLEDGRALAAKYGLELPEDDIAAEASLRNGAVYVGLFESQMCGRRVSANQALAFPRTGIDLHGFP 83 (169) T ss_pred EEeCCCCCCcccccccHHHHHHHHHHcCCcCCCCHHHHHHHHHHHHHHhhccccccccccCCcchhhccccCCceecccc Confidence 22 1 23566666544333333346788999999999999852 34211 001 Q ss_pred ------hhHHHHHHHHHHHHHHHHHhhhhcccccccccccccccccee-eeeecceEEEeecCCCCCcchHhhhcCHHHH Q lcl|NC_019935. 50 ------GKALEACLYLLTAHLLSLSTMQVQGAAGGGVTAGGTQGGFIT-SATVGEVSVAKLAPPAKNGWQWWLSGTPYGQ 122 (155) Q Consensus 50 ------g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~~~~g~~~G~vt-SaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~ 122 (155) ++..+++-+.++.-++. ..... +....++|. ++.+|.++|+|+.+..... +..|= T Consensus 84 ~~~~~IP~~V~~A~~elA~~~~~--g~~~~---------~~~~~~~v~~e~v~G~i~veY~~~~~~~~------~~~~~- 145 (169) T protein:vir:95 84 QPSNVIPSLVIQAQVMAAVEYGA--GTDVR---------GSTDGREVQTERVEGAVTVSYFKNGYSGG------TVSIT- 145 (169) T ss_pred cccccchHHHHHHHHHHHHHHHc--Ccccc---------CCCCccceeeeeeccceeEeecCCCCcCc------cccHH- Confidence 12333343333332221 11100 111223444 4667999999976544321 11111 Q ss_pred HHHHHHHHhcccccccCCCcccccccc Q lcl|NC_019935. 123 ELWALLSVKAVGGFYIGGLPERRGFRK 149 (155) Q Consensus 123 ~y~~l~~~~g~Gg~~vgg~p~r~~~r~ 149 (155) .--+|++++-.|+ +|.++=..||. T Consensus 146 a~~~LL~p~l~g~---~g~~~i~~~rg 169 (169) T protein:vir:95 146 AADDALRPLLCGS---NNAYSFNVFRG 169 (169) T ss_pred HHHHhhhhhcccC---CCcceeeeecC Confidence 1225677665443 45566666666 No 42 >protein:vir:105899 Length: 116 # NCBI annotation: head completion protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004377;genbank:gi:122891832;genbank:GeneID:4712370 Probab=33.40 E-value=1 Score=20.49 Aligned_cols=109 Identities=9% Similarity=-0.049 Sum_probs=58.7 Q ss_pred cC-HHHHHHhccccCCCCcCCHH------HHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_019935. 4 FD-EHKFRTLFPEFADPAAYPDV------RLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAA 76 (155) Q Consensus 4 fd-~~~Fr~~fPeFad~~~~pD~------~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~ 76 (155) |+ .++-+.+--.--...+ .|+ .+..++.+|+.+-++ +| .+.+.+-+..++-.|+|.-+.-+.+ T Consensus 1 ~~~~~DVk~ln~k~~~~~t-sD~~d~~l~ev~~~l~~A~dyCnn-~F--~~dg~~~lP~gVkkFVA~~iky~~~------ 70 (116) T protein:vir:10 1 MTLYEDVKLLLKKNGVEVK-SDEEEIFKMEVDGILEDVRDITNN-DF--MKDGQVIYPYSIKKYVADVLEYYQR------ 70 (116) T ss_pred CchHHHHHHHhcCCCCCcc-cchHHHHHHhhHHHHHHHHHHhcC-cc--cccCCccCcchhHHHHHHHHHhhcc------ Confidence 44 2444443211001111 222 334678888777654 44 2223566888999999988876543 Q ss_pred ccccccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhc Q lcl|NC_019935. 77 GGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKA 132 (155) Q Consensus 77 ~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g 132 (155) .+..+.++|-|.|.||-+|.+.- +..--=.+-||=..=|.-.++.+ T Consensus 71 -------p~t~~nlssRSMGTVSYty~Te~---P~~~~~~L~PyRklrw~~~~~~~ 116 (116) T protein:vir:10 71 -------PEVKKNLKSRSMGTVSYTYNDGV---PDYISGVLNRYKRAKFHPFKPIR 116 (116) T ss_pred -------cccccCcccccccceeeeccccc---hHHHHHhhhhhhhcccCCCCCCC Confidence 12456799999999999985421 11111123455444444444433 No 43 >protein:vir:94126 Length: 116 # NCBI annotation: ORF041 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240236;genbank:gi:66395926;genbank:GeneID:5133295 Probab=33.40 E-value=1 Score=20.49 Aligned_cols=109 Identities=9% Similarity=-0.049 Sum_probs=58.7 Q ss_pred cC-HHHHHHhccccCCCCcCCHH------HHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_019935. 4 FD-EHKFRTLFPEFADPAAYPDV------RLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAA 76 (155) Q Consensus 4 fd-~~~Fr~~fPeFad~~~~pD~------~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~ 76 (155) |+ .++-+.+--.--...+ .|+ .+..++.+|+.+-++ +| .+.+.+-+..++-.|+|.-+.-+.+ T Consensus 1 ~~~~~DVk~ln~k~~~~~t-sD~~d~~l~ev~~~l~~A~dyCnn-~F--~~dg~~~lP~gVkkFVA~~iky~~~------ 70 (116) T protein:vir:94 1 MTLYEDVKLLLKKNGVEVK-SDEEEIFKMEVDGILEDVRDITNN-DF--MKDGQVIYPYSIKKYVADVLEYYQR------ 70 (116) T ss_pred CchHHHHHHHhcCCCCCcc-cchHHHHHHhhHHHHHHHHHHhcC-cc--cccCCccCcchhHHHHHHHHHhhcc------ Confidence 44 2444443211001111 222 334678888777654 44 2223566888999999988876543 Q ss_pred ccccccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhc Q lcl|NC_019935. 77 GGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKA 132 (155) Q Consensus 77 ~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g 132 (155) .+..+.++|-|.|.||-+|.+.- +..--=.+-||=..=|.-.++.+ T Consensus 71 -------p~t~~nlssRSMGTVSYty~Te~---P~~~~~~L~PyRklrw~~~~~~~ 116 (116) T protein:vir:94 71 -------PEVKKNLKSRSMGTVSYTYNDGV---PDYISGVLNRYKRAKFHPFKPIR 116 (116) T ss_pred -------cccccCcccccccceeeeccccc---hHHHHHhhhhhhhcccCCCCCCC Confidence 12456799999999999985421 11111123455444444444433 No 44 >protein:vir:81159 Length: 95 # NCBI annotation: putative DNA packaging protein # Family: family:all:316 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285813;genbank:gi:148747734;genbank:GeneID:5247202 Probab=29.08 E-value=1.3 Score=19.95 Aligned_cols=94 Identities=11% Similarity=0.045 Sum_probs=50.9 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) +-.+|+++++.-----.| ..|+.|+.+++-|+.++.+.--..........+.++.++++|+=.=+.. .+. T Consensus 1 Mm~vtLee~K~~LRID~d---~dD~lI~~li~aA~~~i~~~~g~~~~~~~~~~~~Avl~lv~~~YeNRe~--~~~----- 70 (95) T protein:vir:81 1 MMIVTLEEVKNWLRVDFS---DDDALITTLINAAEEYLKNATGTTFDATNHLAKIFCMTLIADWYENREL--VGR----- 70 (95) T ss_pred CCcCCHHHHHHHcCCCCC---cchHHHHHHHHHHHHHHHHhhccccccCchHHHHHHHHHHHHHHhhccc--ccc----- Confidence 777788888874432222 4799999999999999976533233445678999999999998742111 000 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcch-HhhhcCH Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQ-WWLSGTP 119 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~-~w~~~T~ 119 (155) ++..+.+........-. ..+.+|. T Consensus 71 ---------------~~~~~p~~v~sll~~lr~~~~~~~~ 95 (95) T protein:vir:81 71 ---------------ASDQVRPILQSILAQLTYAYGGETA 95 (95) T ss_pred ---------------ccccccHHHHHHHHHhhhccccccC Confidence 00011111110000000 0122222 No 45 >protein:vir:7857 Length: 188 # NCBI annotation: gp14 # Family: family:all:11114 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817464;genbank:gi:29565893;genbank:GeneID:1259086 Probab=28.01 E-value=1.4 Score=19.80 Aligned_cols=70 Identities=17% Similarity=0.228 Sum_probs=45.2 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) =|||+= =|| .+|++.|+..+++|..++.+. T Consensus 119 tVTytH-----Gy~------evP~eiv~lv~d~A~~~~~np--------------------------------------- 148 (188) T protein:vir:78 119 RVTYTH-----GYN------PVPDELIDVAIRLAREYQSNP--------------------------------------- 148 (188) T ss_pred EEEEec-----CCC------cccHHHHHHHHHHHHHHhcCc--------------------------------------- Confidence 233321 122 279999999999999988642 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFY 137 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~ 137 (155) ....|.++|+.|++|...+..+ -.+.=+.++++|..+... T Consensus 149 -------~~L~q~~vG~~S~tfa~~~~~s----------l~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:78 149 -------ELLVSKQVGEIERRFGSVAGTS----------LSKADQAILDRYVIATLA 188 (188) T ss_pred -------ccceeeecCceeeecccccCCc----------ccchhHHhhccccccccC Confidence 1246899999999998543322 223335677777766554 No 46 >protein:vir:101652 Length: 188 # NCBI annotation: gp15 # Family: family:all:11114 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654770;genbank:gi:109302768;genbank:GeneID:4156086 Probab=28.01 E-value=1.4 Score=19.80 Aligned_cols=70 Identities=17% Similarity=0.228 Sum_probs=45.2 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) =|||+= =|| .+|++.|+..+++|..++.+. T Consensus 119 tVTytH-----Gy~------evP~eiv~lv~d~A~~~~~np--------------------------------------- 148 (188) T protein:vir:10 119 RVTYTH-----GYN------PVPDELIDVAIRLAREYQSNP--------------------------------------- 148 (188) T ss_pred EEEEec-----CCC------cccHHHHHHHHHHHHHHhcCc--------------------------------------- Confidence 233321 122 279999999999999988642 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFY 137 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~ 137 (155) ....|.++|+.|++|...+..+ -.+.=+.++++|..+... T Consensus 149 -------~~L~q~~vG~~S~tfa~~~~~s----------l~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:10 149 -------ELLVSKQVGEIERRFGSVAGTS----------LSKADQAILDRYVIATLA 188 (188) T ss_pred -------ccceeeecCceeeecccccCCc----------ccchhHHhhccccccccC Confidence 1246899999999998543322 223335677777766554 No 47 >protein:vir:7410 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839927;genbank:gi:30089897;genbank:GeneID:1260684 Probab=27.73 E-value=1.8 Score=19.16 Aligned_cols=96 Identities=17% Similarity=0.146 Sum_probs=49.5 Q ss_pred eecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCC--------cccccChhHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_019935. 2 VIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDS--------PYRILNGKALEACLYLLTAHLLSLSTMQVQ 73 (155) Q Consensus 2 v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~--------~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~ 73 (155) .+.|+++|+.----- ++ .|+.|+.++.-|+.+|.+.-- -....+-.+++.+..+|++|+-.=+... T Consensus 1 M~v~LdeiK~~LRID-dd---DD~ll~~~i~aAe~yI~~Aig~~~~~~~fy~~e~~~~l~~~Avl~La~~wYeNR~at-- 74 (107) T protein:vir:74 1 MSVTVDDLLDQLSED-DD---RKPQLQIYFDTATAYVKNAVSSDTVDAPFFNVENVSPIYDVAVLSYSMDLWINRSTT-- 74 (107) T ss_pred CeecHHHHHHHcCCC-CC---hhHHHHHHHHHHHHHHhhhcCCcccccccccccCcchHHHHHHHHHHHHHHHhcccc-- Confidence 777888888855432 32 799999999999999975421 0111234578899999999997422111 Q ss_pred cccccccc-cccccccceeeeee--cceEEEeecCCCCCcchHhhhcCH Q lcl|NC_019935. 74 GAAGGGVT-AGGTQGGFITSATV--GEVSVAKLAPPAKNGWQWWLSGTP 119 (155) Q Consensus 74 g~~~~~~~-~~g~~~G~vtSaS~--G~vSVS~d~~~~~~~~~~w~~~T~ 119 (155) ...+-+.. --.++.|.-.+.++ ++.-- +|. T Consensus 75 ~~vp~~v~siI~QLRg~y~~~~e~~~~~~~----------------~~~ 107 (107) T protein:vir:74 75 MPPTTAVDHMVGQLRGLYSSWKEEQGGQNL----------------QTE 107 (107) T ss_pred ccccHHHHHHHHHHhhcccchhhhcCCCcc----------------cCC Confidence 00000000 00112222222111 11111 111 No 48 >protein:vir:102158 Length: 99 # NCBI annotation: uncharacterized phage protein (possible DNA packaging) # Family: family:all:316 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699940;genbank:gi:110804046;genbank:GeneID:4206702 Probab=25.42 E-value=2.1 Score=18.86 Aligned_cols=98 Identities=10% Similarity=0.052 Sum_probs=50.7 Q ss_pred eecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_019935. 2 VIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGVT 81 (155) Q Consensus 2 v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~~ 81 (155) -.+|++++++-----.| ..|+.|+.+++-|+.++.+.--..........+.++.++++|+=.=+.....+... T Consensus 1 M~vtLee~K~~LRID~d---~dD~lI~~~i~aA~~~i~~~~~~~~~~~~~~~k~Avl~lv~~~YenR~~~~~~~~~---- 73 (99) T protein:vir:10 1 MILSVDEVKNYLRVDYD---EDDILIQDLIESAEDYLYNATGKKFTEKNKLAKRYCLALVYDWYKDKGMNIRATKN---- 73 (99) T ss_pred CcCCHHHHHHHcCCCCC---cchHHHHHHHHHHHHHHHHhhCCCCCCCChHHHHHHHHHHHHhHhcchhhhhhhhc---- Confidence 66778888874432222 47999999999999999765333344556778899999999997533221111100 Q ss_pred cccccccceeeeeecceEEEeecCCCCCcchHh-hhcC Q lcl|NC_019935. 82 AGGTQGGFITSATVGEVSVAKLAPPAKNGWQWW-LSGT 118 (155) Q Consensus 82 ~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w-~~~T 118 (155) +. ...-+.|........-... +..| T Consensus 74 ----------~~--~~~~lp~~v~sli~qlr~~~~~~~ 99 (99) T protein:vir:10 74 ----------TT--VSEKVKYTLQSILLQLKFCKEEDT 99 (99) T ss_pred ----------cc--hhhhhhHHHHHHHHHHhhccCCCC Confidence 00 0000111111111111000 1112 No 49 >protein:vir:105776 Length: 133 # NCBI annotation: gp11 # Family: family:all:10997 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224149;genbank:gi:62362224;genbank:GeneID:3342529 Probab=24.90 E-value=2.1 Score=18.79 Aligned_cols=119 Identities=17% Similarity=0.163 Sum_probs=61.0 Q ss_pred ecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHh---hcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_019935. 3 IFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEF---ISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGG 79 (155) Q Consensus 3 ~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~---~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~ 79 (155) -.|.++-|+..-+.. ..+||.+|+.++++++.. ++. .+ .+..+.++.+|.+-+|.+... T Consensus 1 mIT~~qa~~~L~slG--~svP~~iL~~~v~q~nsi~~cLda-gY------~e~tq~LI~lya~~LlA~~~g--------- 62 (133) T protein:vir:10 1 MITTEQAKEYLESVG--ITLPDFILQAIVEQANSIQECLDA-HY------PPATALLIQSYLLGLMALGQG--------- 62 (133) T ss_pred CCCHHHHHHHHHhcC--CcchHHHHHHHHHHHhhHHHHHhC-CC------CHHHHHHHHHHHHHHHhhccC--------- Confidence 344555555444433 248999999999998543 332 22 477888999999999876322 Q ss_pred cccccccccceeeeee-cceEEEeecCCCCCcchHhhhcCHHHHHHHHHH--HHhcccccccCCCcccccc----cccCc Q lcl|NC_019935. 80 VTAGGTQGGFITSATV-GEVSVAKLAPPAKNGWQWWLSGTPYGQELWALL--SVKAVGGFYIGGLPERRGF----RKVGG 152 (155) Q Consensus 80 ~~~~g~~~G~vtSaS~-G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~--~~~g~Gg~~vgg~p~r~~~----r~vgg 152 (155) ..+|+|.+. -.-|-||++......| =+.|=+|+ -+.|--+=.++--|.-++| =-||| T Consensus 63 -------~R~IsSQ~APSGASrSF~Y~~~~~~~---------~~l~~~L~~lD~~gCt~~Lip~d~~~~a~vG~f~vvgg 126 (133) T protein:vir:10 63 -------DRYISSQTAPNGASRSFRYQSFADRW---------KGALSLLRGADKFRCANGLIPPDPTNTAFAGIWIGKGG 126 (133) T ss_pred -------CceeecccCCccccccccccCCCccH---------HHHHHHHHhhhhccccccccCCCccccccceeeeeccc Confidence 245666555 3456677664333222 12222222 2222211112222443332 22333 Q ss_pred ccC Q lcl|NC_019935. 153 TFW 155 (155) Q Consensus 153 ~~~ 155 (155) .-= T Consensus 127 c~c 129 (133) T protein:vir:10 127 CMC 129 (133) T ss_pred ccc Confidence 333 No 50 >protein:vir:4602 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058447;genbank:gi:9635173;genbank:GeneID:1262723 Probab=23.75 E-value=2.2 Score=18.68 Aligned_cols=102 Identities=10% Similarity=-0.015 Sum_probs=46.4 Q ss_pred CeecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCC------CcccccChhHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_019935. 1 MVIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRD------SPYRILNGKALEACLYLLTAHLLSLSTMQVQG 74 (155) Q Consensus 1 ~v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~------~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g 74 (155) |.+=|+++++.-.---.| ..|++|+.+++-|..+|.+.- ..-...+-..++.++.++++|+-.=+.....+ T Consensus 3 ~t~~dL~~iK~~lRID~d---~DD~li~~yi~AA~~yI~~aig~~~~~~~~~~~~~~~~~~Avl~Lv~~~YeNR~a~~~~ 79 (110) T protein:vir:46 3 LTAEELKLLKKHCKIDHN---SEDDLLEIYYSWAFHEIASAVTDEPSKYIDWFKSHPLFARATYPLASYYFENRIAYLDR 79 (110) T ss_pred ccHHHHHHHHHHhCCCCC---chHHHHHHHHHHHHHHHHhhccCCcccccCccCcchHHHHHHHHHHHHHHHhccccccc Confidence 444457777774443222 589999999999999996521 10011224578899999999998532111110 Q ss_pred ccccccccccccccceeeeeecceEEEeecCCCCCcchHhh Q lcl|NC_019935. 75 AAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWL 115 (155) Q Consensus 75 ~~~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~ 115 (155) . .....--|.| =+....-.|..-.-...++ + T Consensus 80 ~-------~~~vp~~v~s-lI~qLRg~y~~~~e~e~~~--~ 110 (110) T protein:vir:46 80 D-------LSLAPHMVLS-TVHKLRGSFEQFLESENDE--I 110 (110) T ss_pred c-------cccccHHHHH-HHHHHHHhHhHhhcccccC--C Confidence 0 0000000000 0000001110000000000 0 No 51 >protein:vir:9928 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795690;genbank:gi:28876458;genbank:GeneID:1258013 Probab=22.42 E-value=2.4 Score=18.45 Aligned_cols=115 Identities=9% Similarity=0.030 Sum_probs=50.3 Q ss_pred cCHHHHHHhcccc---CCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEF---ADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) Q Consensus 4 fd~~~Fr~~fPeF---ad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~ 80 (155) ||-+.-.+.-... .+...--|++|+.+++.|+..|...--.....+.+....-|..++--+..- .-...|+.| T Consensus 1 md~~~~L~~vK~~lgI~~~D~~~D~lL~~~i~~a~~~i~~~l~~~~~~~~~eiP~~l~~iv~evav~-ryNR~g~EG--- 76 (118) T protein:vir:99 1 MGDKQLIDDIKLFIGISKGDGAQDELITLAIYESKERVLAKLNEYSETEITKIPDRLRFIVRDVAIK-RFNRINSEG--- 76 (118) T ss_pred CchhhHHHHHHHHhCCCCCchhhHHHHHHHHHHHHHHHHHHhccccccchhhhhHHHHHHHHHHHHH-HhcCcCCcc--- Confidence 8866555444332 211123488999999999998853210000011122233333333222211 111222221 Q ss_pred ccccccccceeeeeecceEEEeecCCCCCcchHhhhcCHHHHHHHHHHHHhcccccccCCCccccccccc Q lcl|NC_019935. 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFYIGGLPERRGFRKV 150 (155) Q Consensus 81 ~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~w~~~T~YG~~y~~l~~~~g~Gg~~vgg~p~r~~~r~v 150 (155) ++|.|++.+|+||+.. -.....++ ..++-+.- ...|..+|-+ T Consensus 77 ---------~~S~SeeG~S~sf~~d--~~ey~~~l-------------~~~~~~~~----~~~~g~v~Fi 118 (118) T protein:vir:99 77 ---------AVEDSEEGKTFKWDSY--LKEYESTL-------------RSAAIGKV----YSGKGVARFI 118 (118) T ss_pred ---------cceeecCCeeeeeccC--chhHHHHH-------------HHHhhhcc----cCcCcceeeC Confidence 5899999999999631 22223333 22222111 0112122222 No 52 >protein:vir:4954 Length: 104 # NCBI annotation: putative DNA packaging protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049930;genbank:gi:9632901;genbank:GeneID:1262077 Probab=22.34 E-value=2.5 Score=18.44 Aligned_cols=99 Identities=9% Similarity=0.029 Sum_probs=48.0 Q ss_pred eecCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCc-----ccccChhHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_019935. 2 VIFDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSP-----YRILNGKALEACLYLLTAHLLSLSTMQVQGAA 76 (155) Q Consensus 2 v~fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~-----~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~ 76 (155) -.+|+++++.-----.| -.|+.|+.+++-|+.+|.+.--. .........+.++.++++|+-.=+.....++. T Consensus 1 M~vtLeeiK~~LRID~d---ddD~li~~~i~aA~~yi~~aig~~~~~~~~~~~~~~~~~Avl~Lv~~~YeNR~~~~~~~~ 77 (104) T protein:vir:49 1 MSVSKTSIMQTLNLDET---DDTALIPAYIESAKQYIINAVGSDSKFYDLDSVRALFDTAVIALTSSYFTYRVALTDTAT 77 (104) T ss_pred CcccHHHHHHHcCCCCc---cchHHHHHHHHHHHHHHHHhhCCCCccccccCCChHHHHHHHHHHHHHHhhchhcccccc Confidence 55578888874432112 37999999999999998753210 11112357889999999999753222111100 Q ss_pred ccccccccccccceeeeeecceEEEeecCCCCCcc Q lcl|NC_019935. 77 GGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGW 111 (155) Q Consensus 77 ~~~~~~~g~~~G~vtSaS~G~vSVS~d~~~~~~~~ 111 (155) . ...--|.| =+....-.|+.-.-.+.+ T Consensus 78 ~-------~vp~~v~s-li~qLr~~y~~~~e~~~~ 104 (104) T protein:vir:49 78 Y-------PVNLTLNS-IIGQLRGLYATYSEERGD 104 (104) T ss_pred c-------hhhHHHHH-HHHHHHHhhhhhhhccCC Confidence 0 00000000 001111122221111111 No 53 >protein:vir:1384 Length: 92 # NCBI annotation: Gp7 protein # Family: family:all:316 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612836;genbank:gi:20065970;genbank:GeneID:935785 Probab=20.74 E-value=2.7 Score=18.20 Aligned_cols=92 Identities=11% Similarity=0.027 Sum_probs=48.5 Q ss_pred cCHHHHHHhccccCCCCcCCHHHHHHHHHHHHHhhcCCCCcccccChhHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019935. 4 FDEHKFRTLFPEFADPAAYPDVRLQMYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGVTAG 83 (155) Q Consensus 4 fd~~~Fr~~fPeFad~~~~pD~~i~~~~~~A~~~~~~~~~~~~~~~g~~~~~~l~L~tAH~l~L~~~~~~g~~~~~~~~~ 83 (155) +|++++++-----.| ..|+.|+.+++-|+.+|.+.. ..........+.++.|+++|+-.=+.....++.. T Consensus 1 vtLeevK~~LRID~d---dDD~lI~~~i~aA~~~i~~~~-~~~~~~~~~~~~Avlllv~~~YenR~~~~~~~~~------ 70 (92) T protein:vir:13 1 MDLRELKEYLRIDFE---EDDILLRSLLLAAEEYLYNAG-IKRDYKKSLYSLAIKILVKHWYDNRDCVVAGNVN------ 70 (92) T ss_pred CCHHHHHHHcCCCCC---cchHHHHHHHHHHHHHHHhhc-cccccchhHHHHHHHHHHHHhHhccccccccchh------ Confidence 999999985443222 479999999999999997653 2333445678899999999997532111111100 Q ss_pred cccccceeeeeecceEEEeecCCCCCcchH Q lcl|NC_019935. 84 GTQGGFITSATVGEVSVAKLAPPAKNGWQW 113 (155) Q Consensus 84 g~~~G~vtSaS~G~vSVS~d~~~~~~~~~~ 113 (155) ....--|.| +=-.+.... ..++ T Consensus 71 ~~ip~~v~s-----ll~~lR~~~---~~~~ 92 (92) T protein:vir:13 71 NKLEYSLNA-----ILTQLRYCG---DDNG 92 (92) T ss_pred hhhhHHHHH-----HHHHhhhcc---CCCC Confidence 000000100 000111111 1111 Done!