Query lcl|NC_019509.1_cdsid_YP_007005390.1 [gene=F414_gp09] [protein=hypothetical protein] [protein_id=YP_007005390.1] [location=complement(6851..7246)] Match_columns 131 No_of_seqs 100 out of 120 Neff 6.5 Searched_HMMs 1612 Date Thu Nov 7 17:15:02 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_11 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_11_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:79640 Length: 134 100.0 8.8E-48 5.5E-51 278.6 10.4 131 1-131 1-134 (134) 2 protein:vir:107702 Length: 136 100.0 6.4E-47 4E-50 273.8 10.7 130 1-130 1-136 (136) 3 protein:vir:5256 Length: 119 # 100.0 9.4E-43 5.9E-46 251.0 11.8 116 1-122 1-119 (119) 4 protein:vir:104344 Length: 132 100.0 4E-43 2.5E-46 253.0 9.5 131 1-131 1-131 (132) 5 protein:vir:107756 Length: 147 100.0 3.5E-39 2.2E-42 231.4 12.0 125 1-131 1-146 (147) 6 protein:vir:103283 Length: 125 100.0 1.1E-39 6.5E-43 234.3 8.5 123 9-131 1-125 (125) 7 protein:vir:99570 Length: 153 100.0 6.1E-39 3.8E-42 230.1 12.3 129 1-131 1-144 (153) 8 protein:vir:96108 Length: 155 100.0 1.1E-38 6.9E-42 228.6 12.0 129 1-131 1-146 (155) 9 protein:vir:94064 Length: 167 100.0 2.1E-38 1.3E-41 227.2 12.0 130 1-131 1-143 (167) 10 protein:vir:78595 Length: 158 100.0 1.4E-36 8.4E-40 217.2 12.5 127 1-131 1-144 (158) 11 protein:vir:106739 Length: 158 100.0 1.4E-36 8.4E-40 217.2 12.5 127 1-131 1-144 (158) 12 protein:vir:3639 Length: 158 # 100.0 2.2E-36 1.4E-39 216.0 11.9 127 1-131 1-144 (158) 13 protein:vir:101559 Length: 158 100.0 2.2E-36 1.4E-39 216.0 11.9 127 1-131 1-144 (158) 14 protein:vir:80036 Length: 111 99.7 1.3E-21 8.2E-25 135.1 7.9 110 1-126 1-111 (111) 15 protein:vir:43 Length: 131 # N 96.5 4.6E-05 2.9E-08 44.4 8.7 115 1-131 1-130 (131) 16 protein:vir:80967 Length: 131 95.8 0.00019 1.2E-07 41.0 8.8 115 1-131 1-130 (131) 17 protein:vir:102961 Length: 131 95.7 0.00017 1E-07 41.3 7.9 105 5-115 1-131 (131) 18 protein:vir:98900 Length: 132 92.8 0.0041 2.5E-06 33.7 9.0 116 1-129 1-132 (132) 19 protein:vir:80389 Length: 172 89.4 0.026 1.6E-05 29.2 11.1 119 1-128 1-172 (172) 20 protein:vir:79050 Length: 133 88.2 0.012 7.2E-06 31.2 7.3 113 1-116 1-133 (133) 21 protein:vir:94955 Length: 170 85.0 0.056 3.5E-05 27.4 11.0 114 1-128 1-170 (170) 22 protein:vir:95004 Length: 169 83.9 0.065 4E-05 27.1 10.4 118 1-128 1-169 (169) 23 protein:vir:78383 Length: 169 83.5 0.068 4.2E-05 27.0 10.5 118 1-128 1-169 (169) 24 protein:vir:95176 Length: 172 83.3 0.07 4.3E-05 26.9 9.9 118 1-128 17-172 (172) 25 protein:vir:4788 Length: 130 # 76.8 0.13 8.2E-05 25.4 8.4 114 1-128 1-130 (130) 26 protein:vir:96128 Length: 98 # 76.5 0.032 2E-05 28.8 4.8 89 1-109 1-98 (98) 27 protein:vir:5976 Length: 102 # 73.0 0.056 3.4E-05 27.5 5.2 91 1-113 1-102 (102) 28 protein:vir:79253 Length: 138 68.8 0.22 0.00013 24.2 7.5 96 1-98 1-138 (138) 29 protein:vir:99222 Length: 138 68.8 0.22 0.00013 24.2 7.5 96 1-98 1-138 (138) 30 protein:vir:107119 Length: 104 67.9 0.017 1.1E-05 30.2 1.3 95 1-122 1-104 (104) 31 protein:vir:105327 Length: 104 67.9 0.017 1.1E-05 30.2 1.3 95 1-122 1-104 (104) 32 protein:vir:105776 Length: 133 67.3 0.25 0.00016 23.9 7.8 110 1-131 1-130 (133) 33 protein:vir:97329 Length: 104 66.6 0.019 1.2E-05 30.0 1.2 95 1-122 1-104 (104) 34 protein:vir:94798 Length: 104 66.5 0.019 1.2E-05 30.0 1.2 95 1-122 1-104 (104) 35 protein:vir:95891 Length: 104 66.2 0.02 1.2E-05 30.0 1.2 95 1-122 1-104 (104) 36 protein:vir:96281 Length: 104 66.2 0.02 1.2E-05 30.0 1.2 95 1-122 1-104 (104) 37 protein:vir:96831 Length: 98 # 65.4 0.013 8.1E-06 30.9 0.1 90 1-109 1-98 (98) 38 protein:vir:103846 Length: 138 62.1 0.34 0.00021 23.2 7.5 96 1-98 1-138 (138) 39 protein:vir:9706 Length: 100 # 61.2 0.08 5E-05 26.6 3.6 85 1-86 3-100 (100) 40 protein:vir:1241 Length: 104 # 58.9 0.3 0.00019 23.4 6.4 97 1-122 1-104 (104) 41 protein:vir:100885 Length: 110 58.8 0.22 0.00014 24.2 5.6 95 1-97 1-110 (110) 42 protein:vir:93740 Length: 104 57.5 0.049 3E-05 27.8 1.7 97 1-122 1-104 (104) 43 protein:vir:94492 Length: 104 54.5 0.06 3.7E-05 27.3 1.7 97 1-122 1-104 (104) 44 protein:vir:97430 Length: 104 54.5 0.06 3.7E-05 27.3 1.7 97 1-122 1-104 (104) 45 protein:vir:106583 Length: 105 53.9 0.52 0.00032 22.2 9.1 99 1-119 2-105 (105) 46 protein:vir:95071 Length: 104 53.8 0.063 3.9E-05 27.2 1.7 97 1-122 1-104 (104) 47 protein:vir:9821 Length: 138 # 51.9 0.57 0.00035 21.9 8.1 111 1-128 6-138 (138) 48 protein:vir:81159 Length: 95 # 48.0 0.33 0.0002 23.3 4.7 89 1-103 1-95 (95) 49 protein:vir:99922 Length: 165 45.1 0.78 0.00048 21.2 6.8 114 1-131 1-146 (165) 50 protein:vir:192 Length: 108 # 42.8 0.61 0.00038 21.8 5.3 88 1-119 6-108 (108) 51 protein:vir:1887 Length: 108 # 42.8 0.61 0.00038 21.8 5.3 88 1-119 6-108 (108) 52 protein:vir:1640 Length: 132 # 41.9 0.91 0.00056 20.8 8.1 110 1-127 1-132 (132) 53 protein:vir:100103 Length: 120 40.6 0.96 0.0006 20.7 6.6 90 1-121 5-120 (120) 54 protein:vir:107864 Length: 150 40.6 0.96 0.0006 20.7 7.3 96 1-98 1-150 (150) 55 protein:vir:4702 Length: 113 # 39.8 1 0.00062 20.6 7.2 97 1-102 1-113 (113) 56 protein:vir:100211 Length: 114 38.7 0.98 0.0006 20.6 5.8 91 1-97 1-114 (114) 57 protein:vir:79074 Length: 150 36.2 1.2 0.00074 20.2 7.1 96 1-98 1-150 (150) 58 protein:vir:93592 Length: 108 34.0 1.1 0.00069 20.3 5.3 89 1-122 2-108 (108) 59 protein:vir:94761 Length: 132 33.1 1.4 0.00085 19.8 8.3 114 1-127 1-132 (132) 60 protein:vir:97267 Length: 172 31.7 1.5 0.00092 19.6 9.8 117 1-125 16-172 (172) 61 protein:vir:1993 Length: 141 # 28.4 1.8 0.0011 19.2 7.6 97 1-99 1-141 (141) 62 protein:vir:79701 Length: 144 26.4 1.9 0.0012 19.0 8.0 117 1-129 1-144 (144) 63 protein:vir:80668 Length: 153 25.9 2 0.0012 18.9 8.2 112 1-131 1-138 (153) 64 protein:vir:9928 Length: 118 # 23.5 2.3 0.0014 18.6 8.5 99 1-122 1-118 (118) 65 protein:vir:9576 Length: 131 # 21.5 2.6 0.0016 18.3 7.9 113 1-126 1-131 (131) 66 protein:vir:9877 Length: 114 # 20.9 2.7 0.0017 18.2 9.3 101 1-122 1-114 (114) No 1 >protein:vir:79640 Length: 134 # NCBI annotation: gp38 # Family: family:all:5122 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285527;genbank:gi:148734510;genbank:GeneID:5219998 Probab=100.00 E-value=8.8e-48 Score=278.55 Aligned_cols=131 Identities=59% Similarity=1.019 Sum_probs=124.4 Q ss_pred CCH-HHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCCCCchHHHHHHHHHHHHHHHHhhhhccccccccccccceeee Q lcl|NC_019509. 1 MNE-NILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCESKFGDDYDRALALYTLHLMTLEGALKTEKDSVESYTQRVASF 79 (131) Q Consensus 1 m~~-~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g~v~S~ 79 (131) ||+ +++|+||++||||++|||++|+.|+++|+++|++++|||.++++++|||||+|.|++..++++.+..+++++|+|+ T Consensus 1 m~d~~~ve~Fr~l~PeF~~vpde~l~~~~~~A~~~i~~~~~g~~~~~al~lltAHLl~l~~~~~~~g~~~~~~~grv~ss 80 (134) T protein:vir:79 1 MNDIEILEQIYKIAPAFKKVDPELIQAWIELAKDFVCEKHFKDKYFRAVALYTLHLMTLDGAMKQESESVESYSHRIASF 80 (134) T ss_pred CchHHHHHHHHHhccccccCCHHHHHHHHHHhhhhhcCCCCChHHHHHHHHHHHHHHhhcccccccccccccccchhhhh Confidence 999 7899999999999999999999999999999999999999999999999999999988888888888889999998 Q ss_pred eeeceeEEeeecCcc--chhhhhcCHHHHHHHHHHHHhCCCCeEeecCCCCCCC Q lcl|NC_019509. 80 SLSGEFSQTFQSTTG--GDKSLSATPWGEMYRALNRKKGGGFGLITGLRRGCCE 131 (131) Q Consensus 80 s~~G~vSvsy~~~~~--~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~~~~~~ 131 (131) +++|+|||||+.+++ ++.||++|||||+||+|+|++++||++++++++|||. T Consensus 81 st~G~vSvS~a~ps~~~~~~Wl~~TpYGq~y~~L~k~~~GGf~~~t~~~~~~~r 134 (134) T protein:vir:79 81 SLTGEFSQTFSKVSDDTSGNTLRQTPWGKMYEVLNKKKGGGFGLTTAFHRRCSR 134 (134) T ss_pred hhhcceeeeccCcccchhHHHHhcCHHHHHHHHHHHhhccchHhhhhccccCCC Confidence 889999999987764 5679999999999999999999999999999999999 No 2 >protein:vir:107702 Length: 136 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003900;genbank:gi:45686316;genbank:GeneID:2773042 Probab=100.00 E-value=6.4e-47 Score=273.83 Aligned_cols=130 Identities=54% Similarity=0.970 Sum_probs=121.0 Q ss_pred CCHHH----HHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCCCCchHHHHHHHHHHHHHHHHhhhhccccccccccccce Q lcl|NC_019509. 1 MNENI----LLIIRQLAPPMKKIPDETIEAWVEMAKLFVCESKFGDDYDRALALYTLHLMTLEGALKTEKDSVESYTQRV 76 (131) Q Consensus 1 m~~~t----i~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g~v 76 (131) ||++| +|+||++||||+++||++|+.|+++|+++|+.++|||.+++++++||||||++++..++++.+..+.+++| T Consensus 1 ~~~~~~~~~ve~fR~l~PeF~dvPde~i~~~~d~A~~~v~~~~~Gk~y~~al~lltAHLl~l~~~~~~~~~~~~~~s~rv 80 (136) T protein:vir:10 1 MNQETLIAVVEQMRKLVPALRKVPDETLYAWVEMAELFVCQKTFKDAYVKALALYALHLAFLDGALKGEDEDLESYSRRV 80 (136) T ss_pred CCchHHHHHHHHHHHhccccccCCHHHHHHHHHHHHHhhcCCCChhHHHHHHHHHHHHHHhcccccccccccccccccce Confidence 99996 79999999999999999999999999999999999999999999999999999998888888888889999 Q ss_pred eeeeeeceeEEeeecCcc--chhhhhcCHHHHHHHHHHHHhCCCCeEeecCCCCCC Q lcl|NC_019509. 77 ASFSLSGEFSQTFQSTTG--GDKSLSATPWGEMYRALNRKKGGGFGLITGLRRGCC 130 (131) Q Consensus 77 ~S~s~~G~vSvsy~~~~~--~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~~~~~ 130 (131) +|++.+|+|||||+.+++ |+.||++|||||+||+|+|++++||++++|.+|||- T Consensus 81 ~ssat~GevSVS~a~~s~~~s~~WL~~TpyGq~y~aL~k~~~gGf~l~t~~~~~c~ 136 (136) T protein:vir:10 81 TSFSLSGEFSQTFGEVTKNQSGDMMLSTPWGKMFEQLKARRRGRFALMTGLRGGCH 136 (136) T ss_pred ehheeccceeEeeccccCchhhHhhhcCHHHHHHHHHHhhcccchhhhhcccccCC Confidence 998888999999987654 567999999999999999999999999997777766 No 3 >protein:vir:5256 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852761;genbank:gi:31544036;uniprot:Q7Y5T9;genbank:GeneID:2753553 Probab=100.00 E-value=9.4e-43 Score=250.98 Aligned_cols=116 Identities=24% Similarity=0.318 Sum_probs=105.2 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCCCCchHHHHHHHHHHHHHHHHhhhhccccccccccccceeeee Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCESKFGDDYDRALALYTLHLMTLEGALKTEKDSVESYTQRVASFS 80 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g~v~S~s 80 (131) || ||++||++||||+++||++|+.||++|+++|++++||++++++++|||||+|+|++....++ +..+|+|+|++ T Consensus 1 m~--t~~~Fr~~~PeF~~~pd~~i~~~l~~A~~~l~~~~~g~~~~~~~~L~~AH~l~l~~~~~~~~---g~~~g~v~S~s 75 (119) T protein:vir:52 1 MP--LTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSKVQWGKLYDRGVMALTAHLLKLSADAEISG---GAANRNLASES 75 (119) T ss_pred CC--cHHHHHHhhhhccCCCHHHHHHHHHHHHHhhCCcCCchHHHHHHHHHHHHHHHhhhhhhccc---cccccceeeee Confidence 88 78999999999999999999999999999999999999999999999999999976554433 35679999999 Q ss_pred eeceeEEeeecC---ccchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 81 LSGEFSQTFQST---TGGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 81 ~~G~vSvsy~~~---~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) +|+|||||+++ +.+++||++||||||||+|+|++|+|++|+ T Consensus 76 -~G~vSvS~~~~~~~~~~~~w~~~T~YG~~y~~L~r~~g~Gg~Va 119 (119) T protein:vir:52 76 -AGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) T ss_pred -ecceeeeeeccccCCcchhhhhcCHHHHHHHHHHHHhcCCCcCC Confidence 69999999865 457889999999999999999999998887 No 4 >protein:vir:104344 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398973;genbank:gi:81343957;genbank:GeneID:3778876 Probab=100.00 E-value=4e-43 Score=253.02 Aligned_cols=131 Identities=51% Similarity=0.972 Sum_probs=128.6 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCCCCchHHHHHHHHHHHHHHHHhhhhccccccccccccceeeee Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCESKFGDDYDRALALYTLHLMTLEGALKTEKDSVESYTQRVASFS 80 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g~v~S~s 80 (131) ||++++|.||.+||+|+++||++|+.|+|.|+++|+.+.|||.+++|+.||||||+++++...+++.+....+.+|+|.| T Consensus 1 ~~~~~~e~~R~l~P~f~kvpdevI~~wielA~lfVc~~~~g~~~~~AlaL~taHLm~~dga~k~en~~~~t~S~rvaS~S 80 (132) T protein:vir:10 1 MNDAILAFMRSLVPALKAVDDESINVWIDLARLYVCADKFGNDADRAVGLYALHLMLSDGAFKGENEGLETYSRRMASYS 80 (132) T ss_pred CchHHHHHHHHhcchhhcCChHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHhhccccccccccchhhhhhhhhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999988888999999999 Q ss_pred eeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEeecCCCCCCC Q lcl|NC_019509. 81 LSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLITGLRRGCCE 131 (131) Q Consensus 81 ~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~~~~~~ 131 (131) .+|++|+||++++++++|+.+|||||.|+.|+|+.++||+|+|++++|||. T Consensus 81 l~Ge~Sisf~~~sa~~s~L~~tp~Gkl~~~L~k~~~GgfgL~t~~~~~~cg 131 (132) T protein:vir:10 81 LSGEFSITYDNQSAIQGDLSSSSWGRMYKALLRKKGGGFGLITSAAGGGCG 131 (132) T ss_pred ccCceeeecccccccccccccCcHHHHHHHHHHhccCccccccccCcCCCC Confidence 999999999999999999999999999999999999999999999999999 No 5 >protein:vir:107756 Length: 147 # NCBI annotation: gp21 # Family: family:all:664 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024869;genbank:gi:48697511;genbank:GeneID:2948377 Probab=100.00 E-value=3.5e-39 Score=231.39 Aligned_cols=125 Identities=27% Similarity=0.368 Sum_probs=103.9 Q ss_pred CCHH-HHHHHHHhhhhhc---CCCHHHHHHHHHHHHHHhCCCCC-----chHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_019509. 1 MNEN-ILLIIRQLAPPMK---KIPDETIEAWVEMAKLFVCESKF-----GDDYDRALALYTLHLMTLEGALKTEKDSVES 71 (131) Q Consensus 1 m~~~-ti~~Fr~~~P~F~---~~pD~~i~~~l~~A~~~v~~~~~-----g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~ 71 (131) |.-+ ++++||++||||+ ++||++|+.||++|+.+|++++| |++++++++|||||+|+|+....+++ + T Consensus 1 m~v~fd~~~Fr~~fPeFad~~~~pd~~i~~~l~~A~~~l~~~~~~~~~~g~~~~~~l~Ll~AHll~l~~~~~~g~----g 76 (147) T protein:vir:10 1 MDHTLDITKFRALFPEFNNDVKYPDALLEQWYAVAGEYLGLTDYACGLNGNTLDLALMQLTAHLMKSATILSSNK----G 76 (147) T ss_pred CceecCHHHHHHhcccccCCccCCHHHHHHHHHHHHHhhccccCCcccChhhHHHHHHHHHHHHHHHHHhhccCC----C Confidence 6554 3899999999998 48999999999999999999999 89999999999999999986554332 3 Q ss_pred cccceeeeeeeceeEEeeecC---ccchhhhhcCHHHHHHHHHHHHhCCCCeEeecCCC---------CCCC Q lcl|NC_019509. 72 YTQRVASFSLSGEFSQTFQST---TGGDKSLSATPWGEMYRALNRKKGGGFGLITGLRR---------GCCE 131 (131) Q Consensus 72 ~~g~v~S~s~~G~vSvsy~~~---~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~~---------~~~~ 131 (131) ++|+|+|++ +|+|||||+.+ +.+++||++||||||||+|+|+++.|.+++ ||-. |=-. T Consensus 77 ~~G~v~Sas-~G~VSVSy~~~~~~~~~~~w~~~T~YGq~y~~l~~~~~~Gg~vv-gG~p~r~a~r~vgg~f~ 146 (147) T protein:vir:10 77 APMVMTSAT-IDKVSISTLAPPIKNGWQYWLSTTPYGQMLWALLSMRSSGGFVY-GGSPELSGYRRIGGVFK 146 (147) T ss_pred cccceeeee-ecceeeeeecCCCCCcchhhhhcCHHHHHHHHHHHhhCccceec-CCCCccccccccCceeC Confidence 578999999 59999999965 346789999999999999999999985554 4322 2222 No 6 >protein:vir:103283 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277463;genbank:gi:71834105;genbank:GeneID:3562394 Probab=100.00 E-value=1.1e-39 Score=234.28 Aligned_cols=123 Identities=46% Similarity=0.865 Sum_probs=119.9 Q ss_pred HHHhhhhhcCCCHHHHHHHHHHHHHHhCCCCCchHHHHHHHHHHHHHHHHhhhhccccccccccccceeeeeeeceeEEe Q lcl|NC_019509. 9 IRQLAPPMKKIPDETIEAWVEMAKLFVCESKFGDDYDRALALYTLHLMTLEGALKTEKDSVESYTQRVASFSLSGEFSQT 88 (131) Q Consensus 9 Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g~v~S~s~~G~vSvs 88 (131) +|.+||+|+++|||+|+.|+|.|++|||.+.|||.+.+|+.|||+|||.++++.++++.+....+++|+|.+.+|++|+| T Consensus 1 mR~l~P~f~~vpdevi~~wid~A~lFVC~~~fg~~~~~Al~lytlHLm~~dga~k~e~~~~~~~s~r~~s~slsGE~Sit 80 (125) T protein:vir:10 1 MRTLYPPLKSQPDDVLNAWIEVAKLFICLDKFGDKQVQALAFYTLHLLSQDIALKTENDSSQTSSERVKSYSLSGEYTIS 80 (125) T ss_pred CccccchhhccCHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccccccccccccceeeeeeccceEee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCccch--hhhhcCHHHHHHHHHHHHhCCCCeEeecCCCCCCC Q lcl|NC_019509. 89 FQSTTGGD--KSLSATPWGEMYRALNRKKGGGFGLITGLRRGCCE 131 (131) Q Consensus 89 y~~~~~~~--~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~~~~~~ 131 (131) |++++++. .||.+||||+.||+|+|+.++||+|+|++++|||. T Consensus 81 ~~~~s~d~s~~~L~~T~wGk~~~~L~k~~~GgFaL~T~~~~~~cr 125 (125) T protein:vir:10 81 YDTSTAAASSSNLEESSWGKLYIDLMRLKVGRWGLITSGGSRCCR 125 (125) T ss_pred cccccccccccccccCchHHHHHHHHHhcCCceeeeccccccCCC Confidence 99988765 59999999999999999999999999999999999 No 7 >protein:vir:99570 Length: 153 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039799;genbank:gi:126011049;genbank:GeneID:4818265 Probab=100.00 E-value=6.1e-39 Score=230.08 Aligned_cols=129 Identities=27% Similarity=0.335 Sum_probs=105.4 Q ss_pred CCHHH--HHHHHHhhhhhc---CCCHHHHHHHHHHHHHHhCCCCC------chHHHHHHHHHHHHHHHHhhhhcccc-cc Q lcl|NC_019509. 1 MNENI--LLIIRQLAPPMK---KIPDETIEAWVEMAKLFVCESKF------GDDYDRALALYTLHLMTLEGALKTEK-DS 68 (131) Q Consensus 1 m~~~t--i~~Fr~~~P~F~---~~pD~~i~~~l~~A~~~v~~~~~------g~~~~~a~~l~~AHll~l~~~~~~~~-~~ 68 (131) |...| +++||++||||+ ++||++|+.||++|+.++++++| |+.++++++|||||+|+|+.....++ +. T Consensus 1 m~~~~fd~~~Fr~~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~~~~~~~~~g~~~~~~l~Ll~AH~l~L~~~~~~~~~~a 80 (153) T protein:vir:99 1 MADPVYNDGLFRIMYPEFADQEKYPPEVIEIYYDTATLFITGSMFPCAALSGKQLVGALNMLTAHLMSLSMQRSQTALGA 80 (153) T ss_pred CCcccCChHHHHHhcccccCccccCHHHHHHHHHHHHHhhcCccccccccChHHHHHHHHHHHHHHHHHHhhhhcccccC Confidence 88775 789999999998 58999999999999999997654 79999999999999999965444333 33 Q ss_pred ccccccceeeeeeeceeEEeeecCc---cchhhhhcCHHHHHHHHHHHHhCCCCeEeecCCCCCCC Q lcl|NC_019509. 69 VESYTQRVASFSLSGEFSQTFQSTT---GGDKSLSATPWGEMYRALNRKKGGGFGLITGLRRGCCE 131 (131) Q Consensus 69 ~~~~~g~v~S~s~~G~vSvsy~~~~---~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~~~~~~ 131 (131) .++.+|+|+|++ +|+|||||+.++ .+++||++||||||||+|+|++++| ++++||-.=.-- T Consensus 81 ~~~~~G~vsSas-~G~VSVSy~~~~~~~~~~~w~~~T~YGq~fw~l~~~~~~G-g~v~gg~pe~~~ 144 (153) T protein:vir:99 81 TNDQGGYTLSAT-IGEVSVSKMAPPAKDGWEFWLAQTPYGQALWALLKMLSVG-GFAIGGLPERTG 144 (153) T ss_pred CCccccceeeee-ecceeeeeecCCCCCchhHhhhcCHHHHHHHHHHHHhccc-ccccCCCCcccc Confidence 345789999999 599999999653 4678999999999999999999998 666655331111 No 8 >protein:vir:96108 Length: 155 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:664 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294442;genbank:gi:149408339;genbank:GeneID:5237227 Probab=100.00 E-value=1.1e-38 Score=228.64 Aligned_cols=129 Identities=22% Similarity=0.175 Sum_probs=103.3 Q ss_pred CCHHHHHHHHHhhhhhc---CCCHHHHHHHHHHHHHHhCC------CCCchHHHHHHHHHHHHHHHHhhhhcccc----- Q lcl|NC_019509. 1 MNENILLIIRQLAPPMK---KIPDETIEAWVEMAKLFVCE------SKFGDDYDRALALYTLHLMTLEGALKTEK----- 66 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~---~~pD~~i~~~l~~A~~~v~~------~~~g~~~~~a~~l~~AHll~l~~~~~~~~----- 66 (131) |+--.+++||++||||+ ++||++|+.|+++|+++|++ .+||++++++++|||||+|+|.....+++ T Consensus 1 ~v~fd~~~FR~~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~~~~s~~~~g~~~~~~l~Ll~AH~l~L~~~~~~gaa~~g~ 80 (155) T protein:vir:96 1 MVIFDEQKFRTLFPEFADPASYPAVRLQLYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) T ss_pred CcccCHHHHHHhCccccCcccCCHHHHHHHHHHHHHhhcCCCccccccChHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 55555999999999998 58999999999999999975 46799999999999999999976543322 Q ss_pred ccccccccceeeeeeeceeEEeeecCc---cchhhhhcCHHHHHHHHHHHHhCCCCeEeecCCCCCCC Q lcl|NC_019509. 67 DSVESYTQRVASFSLSGEFSQTFQSTT---GGDKSLSATPWGEMYRALNRKKGGGFGLITGLRRGCCE 131 (131) Q Consensus 67 ~~~~~~~g~v~S~s~~G~vSvsy~~~~---~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~~~~~~ 131 (131) +..+..+|+|+|++ +|+|||||+.++ .+++||++||||||||+|+|++++|.++ +||..=.-- T Consensus 81 ~~~g~~~G~vsSas-~G~VSVSy~~~~~~~~~~~w~~~T~YGq~f~~l~~~~~~Gg~~-vgG~per~~ 146 (155) T protein:vir:96 81 TAGGTQGGFITSAT-VGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGGFY-IGGLPERRG 146 (155) T ss_pred ccccccccceeece-ecceeeeeecCCCCCchhHHhhcCHHHHHHHHHHHHhcccccc-cCCCCcccc Confidence 22346789999999 599999998754 4678999999999999999999998444 443321111 No 9 >protein:vir:94064 Length: 167 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453623;genbank:gi:84662659;genbank:GeneID:5142574 Probab=100.00 E-value=2.1e-38 Score=227.15 Aligned_cols=130 Identities=20% Similarity=0.257 Sum_probs=100.9 Q ss_pred CCHHH--HHHHHHhhhhhcCCCHHHHHHHHHHHHHHh-CCCCC-----chHHHHHHHHHHHHHHHHhhhhccc--ccccc Q lcl|NC_019509. 1 MNENI--LLIIRQLAPPMKKIPDETIEAWVEMAKLFV-CESKF-----GDDYDRALALYTLHLMTLEGALKTE--KDSVE 70 (131) Q Consensus 1 m~~~t--i~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v-~~~~~-----g~~~~~a~~l~~AHll~l~~~~~~~--~~~~~ 70 (131) |...+ +++||++||||+++||++|+.||++|+.++ ++++| ++.++++++|||||+|+|++...+. .+..+ T Consensus 1 M~~~~Fd~~~FR~~fPeFa~~Pd~~i~~~l~~A~~~~l~~~~~s~~~~~~~~~~~l~LltAHll~L~~~~~a~~~~~~~~ 80 (167) T protein:vir:94 1 MAVVVFDPTAFKLVYPEFVAVPDARLTALFNTVGYTILDNTDASVIVDPLRRAPLLDLLVAHMLALFGYVNADGSITPGT 80 (167) T ss_pred CCcccCChHHHHHhchhcccCCHHHHHHHHHHHHHhhcCCCCcccccchhhHHHHHHHHHHHHHHHhhhhhhhccccccc Confidence 88775 789999999999999999999999998654 54443 4678899999999999997654332 23344 Q ss_pred ccccceeeeeeeceeEEeeecCc---cchhhhhcCHHHHHHHHHHHHhCCCCeEeecCCCCCCC Q lcl|NC_019509. 71 SYTQRVASFSLSGEFSQTFQSTT---GGDKSLSATPWGEMYRALNRKKGGGFGLITGLRRGCCE 131 (131) Q Consensus 71 ~~~g~v~S~s~~G~vSvsy~~~~---~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~~~~~~ 131 (131) +.+|+|+|++ +|+|||||+.+. .+++||++||||||||+|+|++++|..++.|.-+-.-| T Consensus 81 g~~G~vsSas-~G~VSVSy~~~~~~~~~~~w~~~T~YGq~fwaL~~~~g~Gg~v~gG~~~~~~~ 143 (167) T protein:vir:94 81 GTVGRVANAS-EGSVSTSLAYSTPTGAGEAWFTQTPYGAMYWAMSAPFRSFHYVAAGLSGVGYS 143 (167) T ss_pred ccchheeecc-ccceeeeeecCCCCCchhhhhhcCHHHHHHHHHHHHhcccccccCCCCCCCCC Confidence 5678999999 599999998654 46789999999999999999999983332222211112 No 10 >protein:vir:78595 Length: 158 # NCBI annotation: BcepNY3gp07 # Family: family:all:664 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294844;genbank:gi:149882907;genbank:GeneID:5291066 Probab=100.00 E-value=1.4e-36 Score=217.23 Aligned_cols=127 Identities=15% Similarity=0.137 Sum_probs=98.3 Q ss_pred CCHH------HHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCCC------CchHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_019509. 1 MNEN------ILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCESK------FGDDYDRALALYTLHLMTLEGALKTEKDS 68 (131) Q Consensus 1 m~~~------ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~------~g~~~~~a~~l~~AHll~l~~~~~~~~~~ 68 (131) |..- -+++||++||||+++||++|+.|+++|+.++.+++ .++.++++++|||||+|+|......+++ T Consensus 1 ~~~~~~~v~Fd~a~FR~~fPeFa~~pd~~i~~~~~~A~~~~~~~~~~s~~~~~~~r~~ll~LltAHll~L~~~~~~~a~- 79 (158) T protein:vir:78 1 MSTPPYRITFDPAGFIAEYPEFATVATPRLQAMFNQAQTALLDNTGGSPVTDDNVLRELFNMLVAHLLTLFGATPTSAN- 79 (158) T ss_pred CCCCCceEEcChHHHHHhchhhccCCHHHHHHHHHHhhhhhcCCCccccccChhHHHHHHHHHHHHHHHHhHhhhcccc- Confidence 5432 37999999999999999999999999998775432 2567899999999999999754433322 Q ss_pred ccccccceeeeeeeceeEEeeecCc----cchhhhhcCHHHHHHHHHHHHhCCCCeEeecCC-CCCCC Q lcl|NC_019509. 69 VESYTQRVASFSLSGEFSQTFQSTT----GGDKSLSATPWGEMYRALNRKKGGGFGLITGLR-RGCCE 131 (131) Q Consensus 69 ~~~~~g~v~S~s~~G~vSvsy~~~~----~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~-~~~~~ 131 (131) ++.+|+|+|++ +|+||||||.++ .+++||++||||||||+|++++++| ++++||- .+.-- T Consensus 80 -~g~~G~isSas-~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fwal~~~~~~G-gy~~gg~pe~~~~ 144 (158) T protein:vir:78 80 -SRPPGRLSSAA-EGTVSSSFEFKLPEGSAIAPWYNQTQYGAMFWMATARYRSA-RYMVSGGSGIGTA 144 (158) T ss_pred -CCcccceeeee-ecceEEEEeecCccccchhHHhhcCHHHHHHHHHHHHhccc-ccccccCCcccce Confidence 35689999988 599999998532 3568999999999999999999998 5555443 11111 No 11 >protein:vir:106739 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944316;genbank:gi:38638615;genbank:GeneID:2657368 Probab=100.00 E-value=1.4e-36 Score=217.23 Aligned_cols=127 Identities=15% Similarity=0.137 Sum_probs=98.3 Q ss_pred CCHH------HHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCCC------CchHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_019509. 1 MNEN------ILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCESK------FGDDYDRALALYTLHLMTLEGALKTEKDS 68 (131) Q Consensus 1 m~~~------ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~------~g~~~~~a~~l~~AHll~l~~~~~~~~~~ 68 (131) |..- -+++||++||||+++||++|+.|+++|+.++.+++ .++.++++++|||||+|+|......+++ T Consensus 1 ~~~~~~~v~Fd~a~FR~~fPeFa~~pd~~i~~~~~~A~~~~~~~~~~s~~~~~~~r~~ll~LltAHll~L~~~~~~~a~- 79 (158) T protein:vir:10 1 MSTPPYRITFDPAGFIAEYPEFATVATPRLQAMFNQAQTALLDNTGGSPVTDDNVLRELFNMLVAHLLTLFGATPTSAN- 79 (158) T ss_pred CCCCCceEEcChHHHHHhchhhccCCHHHHHHHHHHhhhhhcCCCccccccChhHHHHHHHHHHHHHHHHhHhhhcccc- Confidence 5432 37999999999999999999999999998775432 2567899999999999999754433322 Q ss_pred ccccccceeeeeeeceeEEeeecCc----cchhhhhcCHHHHHHHHHHHHhCCCCeEeecCC-CCCCC Q lcl|NC_019509. 69 VESYTQRVASFSLSGEFSQTFQSTT----GGDKSLSATPWGEMYRALNRKKGGGFGLITGLR-RGCCE 131 (131) Q Consensus 69 ~~~~~g~v~S~s~~G~vSvsy~~~~----~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~-~~~~~ 131 (131) ++.+|+|+|++ +|+||||||.++ .+++||++||||||||+|++++++| ++++||- .+.-- T Consensus 80 -~g~~G~isSas-~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fwal~~~~~~G-gy~~gg~pe~~~~ 144 (158) T protein:vir:10 80 -SRPPGRLSSAA-EGTVSSSFEFKLPEGSAIAPWYNQTQYGAMFWMATARYRSA-RYMVSGGSGIGTA 144 (158) T ss_pred -CCcccceeeee-ecceEEEEeecCccccchhHHhhcCHHHHHHHHHHHHhccc-ccccccCCcccce Confidence 35689999988 599999998532 3568999999999999999999998 5555443 11111 No 12 >protein:vir:3639 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705634;genbank:gi:23752319;genbank:GeneID:955737 Probab=100.00 E-value=2.2e-36 Score=216.02 Aligned_cols=127 Identities=15% Similarity=0.127 Sum_probs=98.5 Q ss_pred CCHH------HHHHHHHhhhhhcCCCHHHHHHHHHHHHHH-hCCCCC-----chHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_019509. 1 MNEN------ILLIIRQLAPPMKKIPDETIEAWVEMAKLF-VCESKF-----GDDYDRALALYTLHLMTLEGALKTEKDS 68 (131) Q Consensus 1 m~~~------ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~-v~~~~~-----g~~~~~a~~l~~AHll~l~~~~~~~~~~ 68 (131) |..- -+++||++||||+++||++|+.|+++|+.+ +++.++ ++.++++++|||||+|+|+....++ + T Consensus 1 ~~~~~~~v~Fd~a~FR~~fPeFa~~pd~~i~~~~~~A~~~~l~n~~~s~~~~~~~r~~ll~LltAHll~L~~~~~~g--~ 78 (158) T protein:vir:36 1 MSTPPYRITFDPAGFIAEYPEFATVPTPRLQAMFNQAQAALLDNTGGSPVTDDNVLRELFNMLVAHLLTLFSAAPTS--A 78 (158) T ss_pred CCCCCceEEcChHHHHHhCcccccCCHHHHHHHHHhhhheeeCCcccccccChHHHHHHHHHHHHHHHHHhhhhhcc--c Confidence 5432 379999999999999999999999999875 555443 4678899999999999997654333 3 Q ss_pred ccccccceeeeeeeceeEEeeecC----ccchhhhhcCHHHHHHHHHHHHhCCCCeEeecCC-CCCCC Q lcl|NC_019509. 69 VESYTQRVASFSLSGEFSQTFQST----TGGDKSLSATPWGEMYRALNRKKGGGFGLITGLR-RGCCE 131 (131) Q Consensus 69 ~~~~~g~v~S~s~~G~vSvsy~~~----~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~-~~~~~ 131 (131) .++.+|+|+|++ +|+|||||+.. +.+++||++||||||||+|++++++| ++++||- .+.-- T Consensus 79 ~~g~vG~vsSas-~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fw~L~~~~~~G-g~v~Gg~pe~~~~ 144 (158) T protein:vir:36 79 NSRPPGRLSSAT-EGTVSSSFEYALPQGSAIAPWYNQTQYGAMFWMATAHYRSA-RYMVSGGSGIGTA 144 (158) T ss_pred ccCcccceeeee-eCceeEEeeeccccCCchhhhhhcCHHHHHHHHHHHhhCcc-ccccccCCcccce Confidence 345679999999 59999999842 34578999999999999999999998 5555443 11111 No 13 >protein:vir:101559 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958112;genbank:gi:41057658;genbank:GeneID:2716816 Probab=100.00 E-value=2.2e-36 Score=216.02 Aligned_cols=127 Identities=15% Similarity=0.127 Sum_probs=98.5 Q ss_pred CCHH------HHHHHHHhhhhhcCCCHHHHHHHHHHHHHH-hCCCCC-----chHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_019509. 1 MNEN------ILLIIRQLAPPMKKIPDETIEAWVEMAKLF-VCESKF-----GDDYDRALALYTLHLMTLEGALKTEKDS 68 (131) Q Consensus 1 m~~~------ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~-v~~~~~-----g~~~~~a~~l~~AHll~l~~~~~~~~~~ 68 (131) |..- -+++||++||||+++||++|+.|+++|+.+ +++.++ ++.++++++|||||+|+|+....++ + T Consensus 1 ~~~~~~~v~Fd~a~FR~~fPeFa~~pd~~i~~~~~~A~~~~l~n~~~s~~~~~~~r~~ll~LltAHll~L~~~~~~g--~ 78 (158) T protein:vir:10 1 MSTPPYRITFDPAGFIAEYPEFATVPTPRLQAMFNQAQAALLDNTGGSPVTDDNVLRELFNMLVAHLLTLFSAAPTS--A 78 (158) T ss_pred CCCCCceEEcChHHHHHhCcccccCCHHHHHHHHHhhhheeeCCcccccccChHHHHHHHHHHHHHHHHHhhhhhcc--c Confidence 5432 379999999999999999999999999875 555443 4678899999999999997654333 3 Q ss_pred ccccccceeeeeeeceeEEeeecC----ccchhhhhcCHHHHHHHHHHHHhCCCCeEeecCC-CCCCC Q lcl|NC_019509. 69 VESYTQRVASFSLSGEFSQTFQST----TGGDKSLSATPWGEMYRALNRKKGGGFGLITGLR-RGCCE 131 (131) Q Consensus 69 ~~~~~g~v~S~s~~G~vSvsy~~~----~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~-~~~~~ 131 (131) .++.+|+|+|++ +|+|||||+.. +.+++||++||||||||+|++++++| ++++||- .+.-- T Consensus 79 ~~g~vG~vsSas-~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fw~L~~~~~~G-g~v~Gg~pe~~~~ 144 (158) T protein:vir:10 79 NSRPPGRLSSAT-EGTVSSSFEYALPQGSAIAPWYNQTQYGAMFWMATAHYRSA-RYMVSGGSGIGTA 144 (158) T ss_pred ccCcccceeeee-eCceeEEeeeccccCCchhhhhhcCHHHHHHHHHHHhhCcc-ccccccCCcccce Confidence 345679999999 59999999842 34578999999999999999999998 5555443 11111 No 14 >protein:vir:80036 Length: 111 # NCBI annotation: gp10 # Family: family:all:3186 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468714;genbank:gi:157325294;genbank:GeneID:5601727 Probab=99.75 E-value=1.3e-21 Score=135.06 Aligned_cols=110 Identities=23% Similarity=0.365 Sum_probs=94.5 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCCCCch-HHHHHHHHHHHHHHHHhhhhccccccccccccceeee Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCESKFGD-DYDRALALYTLHLMTLEGALKTEKDSVESYTQRVASF 79 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~~g~-~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g~v~S~ 79 (131) |+ +||+..|.+.|+++.+||+.|+.+|++|..++.++.|++ .+|++.++|+|||+++++ .+|+|+ T Consensus 1 m~-ttv~~vkl~a~~L~~~sDDsl~~~I~dA~~e~~a~gFp~~~~e~a~rYLa~HLat~~~-------------~~v~sE 66 (111) T protein:vir:80 1 MK-TDVSKLKLTASSLASVSDDSLQVHIDDSYLEVQEKGFPEKFEERANRYLAAHLATLAN-------------KNVKSE 66 (111) T ss_pred Cc-hhHHHHHHhhHhhcCCChHHHHHHHHHHHHHhhcCCCChhHHHHHHHHHHHHHHHhcC-------------CCCchh Confidence 76 669999999999999999999999999999999999985 679999999999999952 357888 Q ss_pred eeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEeecCC Q lcl|NC_019509. 80 SLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLITGLR 126 (131) Q Consensus 80 s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~ 126 (131) +| |++...|...++. .||..|+|||+||+|++.++.|..+...-+ T Consensus 67 ~V-~~Lk~~Y~~~~~~-~~l~~s~wGq~Y~rL~k~~~~gs~~~~vVv 111 (111) T protein:vir:80 67 AV-GSLKREYYEVKGD-SGLLSTEYGQEYARLLKEANGGSGISMVVV 111 (111) T ss_pred hh-hhHHHHhhhcccc-cccccchhHHHHHHHHHHhcCCccceeeeC Confidence 86 7888898855553 899999999999999999998865543333 No 15 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=96.54 E-value=4.6e-05 Score=44.35 Aligned_cols=115 Identities=15% Similarity=0.096 Sum_probs=67.2 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCCCCc---------------hHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCESKFG---------------DDYDRALALYTLHLMTLEGALKTE 65 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~~g---------------~~~~~a~~l~~AHll~l~~~~~~~ 65 (131) |+.-|.+.|+..| ....+|++.+..++..|...||.--++ +.-++|+...+-++... +... T Consensus 1 M~Y~d~~~Y~~~y-~g~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~~~~~~~~~~~vk~A~c~q~e~~~~~-g~~s-- 76 (131) T protein:vir:43 1 MPYTTLEFYNDEY-AGEHLEQDEFDKLLKHAERKIDSVTFYRIRKGGIESFSEFIQHQIQLATCNQIEYFKEA-GGTS-- 76 (131) T ss_pred CCCCCHHHHHHhh-CCCCCCHhHHHHHHHHHHHHHHHHhcccccccCccccchhhHHHHHHHHHHHHHHHHHh-HHHh-- Confidence 9999999999988 445799999999999999999743221 11234555555555433 2211 Q ss_pred cccccccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEeecCCCCCCC Q lcl|NC_019509. 66 KDSVESYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLITGLRRGCCE 131 (131) Q Consensus 66 ~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~~~~~~ 131 (131) ....+.++|.++ |..||||.+.+....--.....=+.-..+++.. |+ ..+|+|- T Consensus 77 ----~~~~~~~~S~sv-G~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~T--GL-----lyrGV~~ 130 (131) T protein:vir:43 77 ----ELAVSKPDNVSI-GRTSISDSNFASTATSLNSGLIGSDVRSYLAHT--GL-----LYNGVGV 130 (131) T ss_pred ----hhhccccCeeec-CceEEeecccccchhhhchhhhHHHHHHHHhcc--CC-----eecCCCC Confidence 122345678885 999999976443221111111222333344333 32 2345555 No 16 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=95.84 E-value=0.00019 Score=40.96 Aligned_cols=115 Identities=15% Similarity=0.089 Sum_probs=66.0 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCCCCc---------------hHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCESKFG---------------DDYDRALALYTLHLMTLEGALKTE 65 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~~g---------------~~~~~a~~l~~AHll~l~~~~~~~ 65 (131) |+.-|.+.|+..|.- ..+|++.+..++..|...||.--++ +.-++|+...+-++... +... T Consensus 1 M~Y~d~~~Y~~~y~G-~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~d~~~~~~~~~vk~A~c~q~e~~~~~-g~~~-- 76 (131) T protein:vir:80 1 MPYTTLEFYTNEYAG-EHLEQDEFAKLLKHAERKIDSVTFYRIRKSGIEAFSEFIQHQIQLATCNQIEYFKEA-GGTS-- 76 (131) T ss_pred CCCCCHHHHHHhhCC-CCCchhHHHHHHHHHHHHHHHHhcccccccccccCchhHHHHHHHHHHHHHHHHHHh-hhhh-- Confidence 999999999998833 3589999999999999999743221 11234555555544433 2211 Q ss_pred cccccccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEeecCCCCCCC Q lcl|NC_019509. 66 KDSVESYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLITGLRRGCCE 131 (131) Q Consensus 66 ~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~~~~~~ 131 (131) ....+.++|.++ |..||||.+.+....--....--+.-..+++.. |+ ..+|+|- T Consensus 77 ----~~~~~~~~S~sv-G~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~T--GL-----lyrGV~~ 130 (131) T protein:vir:80 77 ----ELAVSKPDNVSI-GRTSISDSNFASTATSLNSGLVGSDVRSYLAHT--GL-----LYNGVGV 130 (131) T ss_pred ----hhcccccCeeee-CceEEeeccccchhhhhhhhhhHHHHHHHHhcc--CC-----eecCCCC Confidence 122345678886 999999976433211111111222333444433 32 2345555 No 17 >protein:vir:102961 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:26777 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945287;genbank:gi:39653722;uniprot:Q708M5;genbank:GeneID:2672875 Probab=95.72 E-value=0.00017 Score=41.32 Aligned_cols=105 Identities=10% Similarity=0.003 Sum_probs=60.9 Q ss_pred HHHHHHHhhhhh-----cC------CCHH-HHHHHHHHHHHHhC----CCCCchHHHH-----HHHHHHHHHHHHhhhhc Q lcl|NC_019509. 5 ILLIIRQLAPPM-----KK------IPDE-TIEAWVEMAKLFVC----ESKFGDDYDR-----ALALYTLHLMTLEGALK 63 (131) Q Consensus 5 ti~~Fr~~~P~F-----~~------~pD~-~i~~~l~~A~~~v~----~~~~g~~~~~-----a~~l~~AHll~l~~~~~ 63 (131) +|+..++.-.-. -+ ..|+ .|++.++.+...|- -..+++-.+. ++.+|..|.+. . T Consensus 1 ~~~~lkq~~~~~~~~~~l~~~~d~~~kD~~vl~faie~v~~~IlnycNikeiP~~Le~v~~~maiDll~~e~~~-----~ 75 (131) T protein:vir:10 1 MIQELKQDNTMYLISCVRKMRQDNYFKDMEVLHYALTQAENEILNYIHQDSVPGRLENVWIDMTNDLLDKVKEQ-----S 75 (131) T ss_pred ChhhhhhhhhhhhhhhhhccccccccchHHHHHHHHHHHHHHHhhhcCCcccchhhHHHHHHHHHHHHhhhccc-----c Confidence 566666622211 11 2344 68999999988663 2334544443 44444444321 1 Q ss_pred cccccccccccceeeeeeeceeEEeeecCccchhh-----hhcCHHHHHHHHHHHHh Q lcl|NC_019509. 64 TEKDSVESYTQRVASFSLSGEFSQTFQSTTGGDKS-----LSATPWGEMYRALNRKK 115 (131) Q Consensus 64 ~~~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~-----l~~T~YGq~y~~L~~~~ 115 (131) ......+...+.|+|-+ +||.||||.+++....- =-.+.|.+||.+.+|+. T Consensus 76 ~k~~~i~~~~g~VsSI~-eGDTsIsf~s~t~~~qrl~~~~s~l~~Y~~qL~~yRRL~ 131 (131) T protein:vir:10 76 VLAEKAGADDFSVKSIK-MGDTTIEKVSPYEMIQRMKQVPSSLERYKRQLNRFRKLL 131 (131) T ss_pred cccccccccccceeeee-ecceeeeccCCccHHHHHHHHHHHHhhhHHHHhhhcccC Confidence 11222334556677777 79999999755432111 23567999999999998 No 18 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=92.75 E-value=0.0041 Score=33.68 Aligned_cols=116 Identities=17% Similarity=0.077 Sum_probs=63.4 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCCCCc-----------h----HHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCESKFG-----------D----DYDRALALYTLHLMTLEGALKTE 65 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~~g-----------~----~~~~a~~l~~AHll~l~~~~~~~ 65 (131) |+.-|.+.|++.+. ..+|++.++.++..|...||.--++ + .-.+|+++.+-++-.. +... T Consensus 1 M~Y~t~~~Y~~~~G--~~i~e~~F~~l~~rAs~~ID~iT~~ri~~~~~~~d~~~~~~~vk~A~c~qiey~~~~-G~~s-- 75 (132) T protein:vir:98 1 MPYLTYEEFMDLNG--RDIDDKKFEKLLPKASAIIDGVTGHFYQKVDMEKDNAWRVNQFKLALCAQIEYFDAL-GATT-- 75 (132) T ss_pred CCCCCHHHHHhhcC--CCCCHHHHHHHHHHHHHHHHHHhcccccCCCccccChHHHHHHHHHHHHHHHHHHhc-cchh-- Confidence 99999999986433 3689999999999999999742221 1 1234555544444322 2111 Q ss_pred cccccccccceeeeeeeceeEEeeecCcc-chhhhhcCHHHHHHHHHHHHhCCCCeEeecCCCCC Q lcl|NC_019509. 66 KDSVESYTQRVASFSLSGEFSQTFQSTTG-GDKSLSATPWGEMYRALNRKKGGGFGLITGLRRGC 129 (131) Q Consensus 66 ~~~~~~~~g~v~S~s~~G~vSvsy~~~~~-~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~~~~ 129 (131) .....+.++|.++ |..||||..... ...-.+.-..-+.-..+++..| |.-.||+.= T Consensus 76 ---ae~~~~~~~S~sv-G~~Svs~~s~~~~~~~~~~~~~~~~~a~~~L~~tG----LLyrGV~~~ 132 (132) T protein:vir:98 76 ---FEEINNSPQTFQA-GRTSVSNASRYNPSGANESKPLVAEDVYIYLQGTG----LLFQGVKTW 132 (132) T ss_pred ---hhhccCccceeee-CcEEEEeeccCCcccccccccchHHHHHHHHhhcC----CccccCCCC Confidence 1122345788886 999999963221 1111111111234445555443 222333333 No 19 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=89.44 E-value=0.026 Score=29.24 Aligned_cols=119 Identities=14% Similarity=0.146 Sum_probs=61.4 Q ss_pred CC--------------HHHHHHHHHhhhhhc-CCCHHHHHHHHHHHHHHhCC--CCC-c--------------------- Q lcl|NC_019509. 1 MN--------------ENILLIIRQLAPPMK-KIPDETIEAWVEMAKLFVCE--SKF-G--------------------- 41 (131) Q Consensus 1 m~--------------~~ti~~Fr~~~P~F~-~~pD~~i~~~l~~A~~~v~~--~~~-g--------------------- 41 (131) |. .-+++++++.+.+.. .+|++..+..|-.|..+|+. .+| | T Consensus 1 Malived~~g~~~anSYvt~~~a~aY~~~rg~~~~~d~~e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~g~ 80 (172) T protein:vir:80 1 MALIVEDGTGKPDANTYAGADFVIAYAQARGVTVDADEAERLILEAMDYIESFRRRWKGERNTREQGLTWPRHDAVVDGF 80 (172) T ss_pred CeeEeeCCCCCccccccccHHHHHHHHHHcCCCcCHHHHHHHHHHHHHHHhhccCccccccCCccccccccccCcccCcc Confidence 21 126888887766644 58999999999999999985 223 2 Q ss_pred --------hHHHHHHHHHHHHHHHHhhhhccccccccccccceeeeeeeceeEEeeecCccchhh-h---hcCHHHHHHH Q lcl|NC_019509. 42 --------DDYDRALALYTLHLMTLEGALKTEKDSVESYTQRVASFSLSGEFSQTFQSTTGGDKS-L---SATPWGEMYR 109 (131) Q Consensus 42 --------~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~-l---~~T~YGq~y~ 109 (131) +..++|.+.++.-+ +++. ..-.......|+|+++ |+++++|+.......- . ..+.|-. .. T Consensus 81 ~~~~~~IP~~v~~A~~elA~~~--~~g~----~~~~~~~~~~v~~ekV-G~i~~eY~~~~~~~~~~~~~~~~~~~~~-v~ 152 (172) T protein:vir:80 81 VIPSDVIPKELQSAVAAAVIEQ--VNGF----ELQQSQDQWAVRIEKV-DVIEVQYAAGGGGQSASANAPMKPTFPK-ID 152 (172) T ss_pred cccccchhHHHHHHHHHHHHHH--hcCC----ccCcCCCCceeeEEec-cceEEeeecccCccccccccCCccchHH-HH Confidence 12244555555422 2211 1111122335889997 9999999854332111 1 1222322 22 Q ss_pred HHHHH--hCCCCeEeecCCCC Q lcl|NC_019509. 110 ALNRK--KGGGFGLITGLRRG 128 (131) Q Consensus 110 ~L~~~--~g~G~~l~~g~~~~ 128 (131) +|++- .+.|++.+. -|+| T Consensus 153 ~LL~p~l~~~gg~~~~-~vrg 172 (172) T protein:vir:80 153 ALLNPLLVGDGGLFLV-AVRG 172 (172) T ss_pred HHHhhhhcCCCCeeee-eecC Confidence 23332 233333332 3344 No 20 >protein:vir:79050 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:6416 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110727;genbank:gi:134287344;genbank:GeneID:4955224 Probab=88.22 E-value=0.012 Score=31.18 Aligned_cols=113 Identities=12% Similarity=0.003 Sum_probs=64.8 Q ss_pred CCHHHHHHHHHhhhhhcC----CCHHHHHHHHHHHHHHhCCC----CCch-HHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKK----IPDETIEAWVEMAKLFVCES----KFGD-DYDRALALYTLHLMTLEGALKTEKDSVES 71 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~----~pD~~i~~~l~~A~~~v~~~----~~g~-~~~~a~~l~~AHll~l~~~~~~~~~~~~~ 71 (131) |--+.+++++++.-+|.- -++..|++.++.++..|.+. .++. +.......-+.-++...-.. +....++ T Consensus 1 ~~~~i~e~i~~~Lk~~~~~~~~~d~~iL~fa~e~~~n~I~N~cNi~eiP~~L~~v~~~mai~~fl~~kk~~--~~~~l~~ 78 (133) T protein:vir:79 1 MGNNIIDDIEKRLESFGYILKDGDKWLIDFVREKIENIIKLDCNIKTMPIELKEIEADMIVGEFLFTKKNM--GQLDIES 78 (133) T ss_pred CCchHHHHHHHHHHHhCCCCCccchHHHHHHHHHHHHHHhhhcChhhcchhHHHHHHHHHHHHHHhccccc--CCCCccc Confidence 999999999999988863 46778888999998766432 2333 22222223333333321100 0000011 Q ss_pred c--ccceeeeeeeceeEEeeecCccc-------hhh--hhcCHHHHHHHHHHHHhC Q lcl|NC_019509. 72 Y--TQRVASFSLSGEFSQTFQSTTGG-------DKS--LSATPWGEMYRALNRKKG 116 (131) Q Consensus 72 ~--~g~v~S~s~~G~vSvsy~~~~~~-------~~~--l~~T~YGq~y~~L~~~~g 116 (131) . -+.|.|-+ +||.||+|...++. ..| +-.+-|..++.+.||+.= T Consensus 79 ~D~~~~v~sIk-eGDTsv~f~~~~~s~t~eq~l~s~i~~L~~~~k~~l~~yRkLrW 133 (133) T protein:vir:79 79 INFEAVEKSIS-EGDTKVDFAIGSGSQTPEQRFDSLIAYLTAYGKNKILTFRCLRW 133 (133) T ss_pred ccchhhhhhee-cccceeecccCCCccchhHHHHHHHHHHhhcccchhhccccccC Confidence 1 12244545 79999999755443 234 335666778888888754 No 21 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=85.00 E-value=0.056 Score=27.44 Aligned_cols=114 Identities=12% Similarity=0.070 Sum_probs=60.3 Q ss_pred CC-------------HHHHHHHHHhhhh------hcCCCHHHHHHHHHHHHHHhCCC-CC-------------------- Q lcl|NC_019509. 1 MN-------------ENILLIIRQLAPP------MKKIPDETIEAWVEMAKLFVCES-KF-------------------- 40 (131) Q Consensus 1 m~-------------~~ti~~Fr~~~P~------F~~~pD~~i~~~l~~A~~~v~~~-~~-------------------- 40 (131) |+ .-+++++++-+.. ....+|+..+..|-.|..+|+.. +| T Consensus 1 m~~i~~~~g~~~AnSYvtv~ea~aY~~~r~~~~~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~~~ 80 (170) T protein:vir:94 1 MPTVDATPGSITANSYVTVAEANSYFDGSYGRPLWTSASEDEKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCKNAVI 80 (170) T ss_pred CceeecCCCCCcccceecHHHHHHHHHhhccccccCCCCHHHHHHHHHHHHHHhccccccccccCCcchhhcccccCccc Confidence 21 1146665554332 23689999999999999999852 22 Q ss_pred ----------chHHHHHHHHHHHHHHHHhhhhccccccccccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHH Q lcl|NC_019509. 41 ----------GDDYDRALALYTLHLMTLEGALKTEKDSVESYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRA 110 (131) Q Consensus 41 ----------g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~ 110 (131) ++..++|.+.++.-++ ++.. ......+.|+|+++ |+++++|+..+.. ++.|. ..++ T Consensus 81 dg~~~~~~~IP~~V~~Aq~elA~~~~--~~~~-----~~~~~~~~v~~~kV-G~i~veY~~~~~~-----~~~~~-~v~~ 146 (170) T protein:vir:94 81 GGMTLSQVSIPVKVKIAVFELAYFML--ESGA-----ALSFADQTIDSVKV-GTIRVEFTKNSTD-----AGLPT-FVEA 146 (170) T ss_pred CccccccchhhHHHHHHHHHHHHHHH--hCcc-----cCcccccceeeEec-ceeEEEecCCCCC-----CccHH-HHHH Confidence 1223455555555333 2211 11222345889997 9999999854432 22333 3356 Q ss_pred HHHHhCCCC------eEeecCCCC Q lcl|NC_019509. 111 LNRKKGGGF------GLITGLRRG 128 (131) Q Consensus 111 L~~~~g~G~------~l~~g~~~~ 128 (131) |++=+..+. +-..--++| T Consensus 147 LL~p~l~~~~~g~~~~~~~~~~r~ 170 (170) T protein:vir:94 147 MLSGFGSPVLYGSNAARSIDLVRA 170 (170) T ss_pred HhhhhhccccccccccceeeeecC Confidence 766544321 111112233 No 22 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=83.89 E-value=0.065 Score=27.10 Aligned_cols=118 Identities=16% Similarity=0.074 Sum_probs=60.2 Q ss_pred CC--------------HHHHHHHHHhhhhhc---CCCHHHHHHHHHHHHHHhCCC--CC-ch------------------ Q lcl|NC_019509. 1 MN--------------ENILLIIRQLAPPMK---KIPDETIEAWVEMAKLFVCES--KF-GD------------------ 42 (131) Q Consensus 1 m~--------------~~ti~~Fr~~~P~F~---~~pD~~i~~~l~~A~~~v~~~--~~-g~------------------ 42 (131) |+ .-+++++++.+-+.. ..+|+..+..|-.|..+|+.- +| |+ T Consensus 1 M~liv~~~~g~~~anSYvt~~ea~aY~~~rg~~~~~dd~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~ 80 (169) T protein:vir:95 1 MPLIVETGQGLPNADSYVSLEDGRALAAKYGLELPEDDIAAEASLRNGAVYVGLFESQMCGRRVSANQALAFPRTGIDLH 80 (169) T ss_pred CeeEEeCCCCCCcccccccHHHHHHHHHHcCCcCCCCHHHHHHHHHHHHHHhhccccccccccCCcchhhccccCCceec Confidence 33 125777777655432 246788999999999999852 23 21 Q ss_pred -----------HHHHHHHHHHHHHHHHhhhhccccccccccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHH Q lcl|NC_019509. 43 -----------DYDRALALYTLHLMTLEGALKTEKDSVESYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRAL 111 (131) Q Consensus 43 -----------~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L 111 (131) ..+.|...++.-++. +. ..-.....+.|.+++++|.++++|+..+...+-.. -+ .-..| T Consensus 81 g~~~~~~~IP~~V~~A~~elA~~~~~--g~----~~~~~~~~~~v~~e~v~G~i~veY~~~~~~~~~~~-~~---a~~~L 150 (169) T protein:vir:95 81 GFPQPSNVIPSLVIQAQVMAAVEYGA--GT----DVRGSTDGREVQTERVEGAVTVSYFKNGYSGGTVS-IT---AADDA 150 (169) T ss_pred ccccccccchHHHHHHHHHHHHHHHc--Cc----cccCCCCccceeeeeeccceeEeecCCCCcCcccc-HH---HHHHh Confidence 123444444443332 11 11111223467788778999999976443322111 11 12345 Q ss_pred HHHh--CCCCeEeecCCCC Q lcl|NC_019509. 112 NRKK--GGGFGLITGLRRG 128 (131) Q Consensus 112 ~~~~--g~G~~l~~g~~~~ 128 (131) ++-+ +.|.+..+--++| T Consensus 151 L~p~l~g~~g~~~i~~~rg 169 (169) T protein:vir:95 151 LRPLLCGSNNAYSFNVFRG 169 (169) T ss_pred hhhhcccCCCcceeeeecC Confidence 5543 4333333333344 No 23 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=83.51 E-value=0.068 Score=26.99 Aligned_cols=118 Identities=15% Similarity=0.059 Sum_probs=60.8 Q ss_pred CC--------------HHHHHHHHHhhhhhc---CCCHHHHHHHHHHHHHHhCCC--CC-c------------------- Q lcl|NC_019509. 1 MN--------------ENILLIIRQLAPPMK---KIPDETIEAWVEMAKLFVCES--KF-G------------------- 41 (131) Q Consensus 1 m~--------------~~ti~~Fr~~~P~F~---~~pD~~i~~~l~~A~~~v~~~--~~-g------------------- 41 (131) |+ .-+++++++.+-+.. ..+|+..+..|-.|..+|+.- +| | T Consensus 1 MaliV~~~~g~~~anSYvtv~~a~aY~~~rg~~~~~d~~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~ 80 (169) T protein:vir:78 1 MPLIVETGQGIPNADSYVSLEDGRALAAKYGLELPEDDTAAEAALRNGAVYVGLFESQMCGRRVSANQALAFPRTGVTLH 80 (169) T ss_pred CeeEeeCCCCCccccccccHHHHHHHHHHcCCcCCCChHHHHHHHHHHHHHhhhccccceeeeCCcccccccccCCceec Confidence 33 126788777655433 235888999999999999841 23 2 Q ss_pred ----------hHHHHHHHHHHHHHHHHhhhhccccccccccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHH Q lcl|NC_019509. 42 ----------DDYDRALALYTLHLMTLEGALKTEKDSVESYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRAL 111 (131) Q Consensus 42 ----------~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L 111 (131) ...+.|...++.-++ ++ ...-.....+.|++++++|.++++|+.+....+ ...| ..-..| T Consensus 81 g~~~~~~~IP~~v~~A~~elA~~~~--~g----~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~~~---~~~~-~~~~~L 150 (169) T protein:vir:78 81 GFPQPSNVIPPLVIQAQVMAAVEYG--AG----TDVRGSTDGREVQTERVEGAVTVSYFKNGYSGG---TVSI-TTADDA 150 (169) T ss_pred ccccccccchHHHHHHHHHHHHHHh--cC----cccCCCCCcceeEEEEecCceeEeecCCCCCCC---cccH-HHHHHH Confidence 122344444444222 11 111112234568899988999999976443221 1111 122245 Q ss_pred HHHh--CCCCeEeecCCCC Q lcl|NC_019509. 112 NRKK--GGGFGLITGLRRG 128 (131) Q Consensus 112 ~~~~--g~G~~l~~g~~~~ 128 (131) ++-+ +.|++..+--++| T Consensus 151 L~p~l~~~~g~~~i~~~rg 169 (169) T protein:vir:78 151 LRPLLCGSNNAYSFNVFRG 169 (169) T ss_pred hhhhcccCCCcceeeeecC Confidence 5443 3333233333344 No 24 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=83.29 E-value=0.07 Score=26.93 Aligned_cols=118 Identities=13% Similarity=0.034 Sum_probs=58.7 Q ss_pred CCHHHHHHHHHhhhhhc---CCCHHHHHHHHHHHHHHhCC--CCC-c-----------------------------hHHH Q lcl|NC_019509. 1 MNENILLIIRQLAPPMK---KIPDETIEAWVEMAKLFVCE--SKF-G-----------------------------DDYD 45 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~---~~pD~~i~~~l~~A~~~v~~--~~~-g-----------------------------~~~~ 45 (131) =..-|++++++.+-+.. ..+|+..+..|-.|..+|+. -+| | +..+ T Consensus 17 nSYvtv~ea~aY~~~rg~~~~~~~~~ke~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~~~~v~~~~IP~~V~ 96 (172) T protein:vir:95 17 NSYVSVADARIYASNRGVELPLDDDELAAMLIRSTDYLEAQACRFQGKPTSTTQALQWPRTGVFLNEDEVPSNVIPKSLI 96 (172) T ss_pred cccccHHHHHHHHHhcCCcCCCChHHHHHHHHHHHHHhhccCCceeeeecCCcccccCCcCCcccCcccccccchhHHHH Confidence 11125777776554433 35888899999999999984 122 1 2234 Q ss_pred HHHHHHHHHHHHHhhhhccccccccccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHh---CCCCeEe Q lcl|NC_019509. 46 RALALYTLHLMTLEGALKTEKDSVESYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKK---GGGFGLI 122 (131) Q Consensus 46 ~a~~l~~AHll~l~~~~~~~~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~---g~G~~l~ 122 (131) +|.+.++.-+ +++..- .......+.|+|+++ |+++++|+...... +.+.|- .-.+|++-+ ++|...+ T Consensus 97 ~A~~elA~~~--~~~~~~---~~~~~~~~~vk~~kV-G~I~veY~~~~~~~---~~~~~~-~v~~LL~p~l~~~~~~~~~ 166 (172) T protein:vir:95 97 AAQVQLTMAI--NAGFDL---QPNVSPQDYVTREKV-GPIETEYADPLSVG---IMPTFT-AANALLAPLFGECASNKFA 166 (172) T ss_pred HHHHHHHHHH--HcCccc---cccCCcccceeEEec-cceEEeeccCCCCC---CcccHH-HHHHHHhhhhcccCCccee Confidence 4555555311 111100 111122356889997 99999997543321 123332 334555543 1121222 Q ss_pred ecCCCC Q lcl|NC_019509. 123 TGLRRG 128 (131) Q Consensus 123 ~g~~~~ 128 (131) +--.+= T Consensus 167 ~r~~r~ 172 (172) T protein:vir:95 167 LRTIRV 172 (172) T ss_pred eEEEeC Confidence 211111 No 25 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=76.78 E-value=0.13 Score=25.41 Aligned_cols=114 Identities=15% Similarity=0.106 Sum_probs=60.4 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCCC---C--c-------hHHHHHHH-HHHHHHHHH--hhhhccc Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCESK---F--G-------DDYDRALA-LYTLHLMTL--EGALKTE 65 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~---~--g-------~~~~~a~~-l~~AHll~l--~~~~~~~ 65 (131) |+.-|.+.|++.-++ +++..+..+..|+..||.-. + + +.+..++. .+++-+..+ .|..... T Consensus 1 M~YlT~eey~el~~~----~~~~F~kl~k~A~~~ID~~t~~~y~~~~~~~~~~~~r~~~vK~A~a~QieY~~~~G~~s~~ 76 (130) T protein:vir:47 1 MTYLTQEEFDELDFD----EVTDFEKLAKRAKIAIDLYTNGIYQKDIDFEKEIAYRKSAVKLAMAFQIAYLDASGIMSAD 76 (130) T ss_pred CCCCchhhHhhcCCC----ChhhHHHHHHHHHHHHHHHhcccccccCCccCcchHHHHHHHHHHHHHHHHHHHhccccch Confidence 999999999876444 34459999999998887321 1 1 22222222 223333333 2322221 Q ss_pred cccccccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEeecC-CCC Q lcl|NC_019509. 66 KDSVESYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLITGL-RRG 128 (131) Q Consensus 66 ~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~-~~~ 128 (131) ..+.++|.++ |..|||+.+.++...-- .......-+.++...|. +|+.|- +-+ T Consensus 77 ------~~~~~~S~sv-GrtSis~~~~~~~~~~~-~~~vs~da~~~L~~tGL--~Ly~GV~yd~ 130 (130) T protein:vir:47 77 ------DKQLANSVSI-GRTSISYSTSQSTLAGQ-RFNLSMDAENALRQAGF--SLVVGVAYDR 130 (130) T ss_pred ------hccCcceeee-cceeeecCcCccccccC-CccccHHHHHHHHhccc--ccccCCCccC Confidence 1344567786 89999997644321100 01244555566666553 333322 112 No 26 >protein:vir:96128 Length: 98 # NCBI annotation: ORF050 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240080;genbank:gi:66395776;genbank:GeneID:5133109 Probab=76.45 E-value=0.032 Score=28.78 Aligned_cols=89 Identities=19% Similarity=0.377 Sum_probs=54.8 Q ss_pred CCHHHHHHHHHh--hhhhcC-----CCHHHHHHHHHHHHHHhCCCCCchHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQL--APPMKK-----IPDETIEAWVEMAKLFVCESKFGDDYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~--~P~F~~-----~pD~~i~~~l~~A~~~v~~~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |. +.+.|.+ -| ..+ +=...|..+++.|+.+ |+..|.+-+.-++..++|-.+.- .+. T Consensus 1 Md---~~dVK~ln~~~-i~~~~~d~~~~~li~~y~e~aedy-CN~~F~k~lP~gVkkfiAe~iky------------~~~ 63 (98) T protein:vir:96 1 ME---PKEVKQLNLMP-IEDTSNDDVLGDLIKFYKGIAEEY-CNKTFEAPYPFGVRKFIAECIKY------------GTN 63 (98) T ss_pred Cc---hHHhHHhhccc-CCCcchHHHHHHHHHHHHHHHHHH-hCCcccccCCccHHHHHHHHHhh------------CCC Confidence 77 5555554 11 111 2244577888999987 55568888888999999988763 134 Q ss_pred cceeeeeeeceeEEeeecC--ccchhhhhcCHHHHHHH Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQST--TGGDKSLSATPWGEMYR 109 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~--~~~~~~l~~T~YGq~y~ 109 (131) ++++|.|. |+||.+|-+- ...-.||+ ||=+.=| T Consensus 64 ~nissRsM-gtVSYty~T~iP~~i~~~L~--PyRrlrw 98 (98) T protein:vir:96 64 SNVSSRTM-GTVSYTFVTDLPKATYRHLK--PFRRLRW 98 (98) T ss_pred CCcccccc-cceeeechhhhhHHHHHHhh--hhhhccC Confidence 57788887 8999999652 22223321 2222212 No 27 >protein:vir:5976 Length: 102 # NCBI annotation: hypothetical protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690676;genbank:geneid:6329129;genbank:gi:22855070;uniprot:Q38584;genbank:GeneID:955305 Probab=72.98 E-value=0.056 Score=27.47 Aligned_cols=91 Identities=21% Similarity=0.281 Sum_probs=55.9 Q ss_pred CCHHHHHHHHHhhhhhcCCCH----HHHHHHHHHHHHHhCCCCCch-----HHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPD----ETIEAWVEMAKLFVCESKFGD-----DYDRALALYTLHLMTLEGALKTEKDSVES 71 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD----~~i~~~l~~A~~~v~~~~~g~-----~~~~a~~l~~AHll~l~~~~~~~~~~~~~ 71 (131) |. +.+.|.+-|==.+-.| +.|..+++.|+.+ |+..|.+ -+.-++..++|..+.-+ + T Consensus 1 Md---~~~VK~ll~i~~~s~d~~i~~lip~y~e~aedy-CN~~F~dkdg~~~lP~gVkkfvAe~ik~y-----------~ 65 (102) T protein:vir:59 1 MD---IQRVKRLLSITNDKHDEYLTEMVPLLVEFAKDE-CHNPFIDKDGNESIPSGVLIFVAKAAQFY-----------M 65 (102) T ss_pred CC---hHHhhhhhcCCCCccHHHHHHHHHHHHHHHHHH-hCCccccccccccCCccHHHHHHHHHHhc-----------C Confidence 88 5566655332112233 4467788899887 4556763 47788999999887643 2 Q ss_pred cccceeeeeeeceeEEeeecC--ccchhhhhcCHHHHHHHHHHH Q lcl|NC_019509. 72 YTQRVASFSLSGEFSQTFQST--TGGDKSLSATPWGEMYRALNR 113 (131) Q Consensus 72 ~~g~v~S~s~~G~vSvsy~~~--~~~~~~l~~T~YGq~y~~L~~ 113 (131) .+++++|.|. |+||.+|.+- ...-.||+ -|.+|.| T Consensus 66 ~~~nissRsM-gtVSYty~T~iP~~i~~~L~------PyRrl~~ 102 (102) T protein:vir:59 66 TNAGLTGRSM-DTVSYNFATEIPSTILKKLN------PYRKMAR 102 (102) T ss_pred CCCCcccccc-cceeeechhhhhHHHHHHhh------HHHhhcC Confidence 3467788887 8999999652 22223432 3334443 No 28 >protein:vir:79253 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469165;genbank:gi:157835007;genbank:GeneID:5648834 Probab=68.80 E-value=0.22 Score=24.23 Aligned_cols=96 Identities=9% Similarity=-0.022 Sum_probs=49.4 Q ss_pred CCHHHHHHHHHhhhhh-----c--------CCCHHHHHHHHHHHHHHhCC---CCC-------chHHHHHHHHHHHHHHH Q lcl|NC_019509. 1 MNENILLIIRQLAPPM-----K--------KIPDETIEAWVEMAKLFVCE---SKF-------GDDYDRALALYTLHLMT 57 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F-----~--------~~pD~~i~~~l~~A~~~v~~---~~~-------g~~~~~a~~l~~AHll~ 57 (131) |+.-|+++++++|++= . .++++.|+..|++|..+|+. .|+ +....+...-++.|+|. T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY~lPl~~vP~~L~~~a~dIA~Y~L~ 80 (138) T protein:vir:79 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHh Confidence 9999999999999862 1 36788899999999999973 333 24445555555666654 Q ss_pred Hhhh------------------hccccccccccc-cceeeeeeeceeEEeeecCccchhh Q lcl|NC_019509. 58 LEGA------------------LKTEKDSVESYT-QRVASFSLSGEFSQTFQSTTGGDKS 98 (131) Q Consensus 58 l~~~------------------~~~~~~~~~~~~-g~v~S~s~~G~vSvsy~~~~~~~~~ 98 (131) .+.. ...+.-..+... +...+ + ++.+.++-....-+..| T Consensus 81 ~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~-~-~~~~~~~~~~r~F~Rd~ 138 (138) T protein:vir:79 81 IVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAP-V-ANTVQISEGRNDWGADW 138 (138) T ss_pred cCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCC-C-CCceeeecCCCCCCCCC Confidence 2100 011111111010 11111 1 22333332211122334 No 29 >protein:vir:99222 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950460;genbank:gi:119953661;genbank:GeneID:4643082 Probab=68.80 E-value=0.22 Score=24.23 Aligned_cols=96 Identities=9% Similarity=-0.022 Sum_probs=49.4 Q ss_pred CCHHHHHHHHHhhhhh-----c--------CCCHHHHHHHHHHHHHHhCC---CCC-------chHHHHHHHHHHHHHHH Q lcl|NC_019509. 1 MNENILLIIRQLAPPM-----K--------KIPDETIEAWVEMAKLFVCE---SKF-------GDDYDRALALYTLHLMT 57 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F-----~--------~~pD~~i~~~l~~A~~~v~~---~~~-------g~~~~~a~~l~~AHll~ 57 (131) |+.-|+++++++|++= . .++++.|+..|++|..+|+. .|+ +....+...-++.|+|. T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY~lPl~~vP~~L~~~a~dIA~Y~L~ 80 (138) T protein:vir:99 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHh Confidence 9999999999999862 1 36788899999999999973 333 24445555555666654 Q ss_pred Hhhh------------------hccccccccccc-cceeeeeeeceeEEeeecCccchhh Q lcl|NC_019509. 58 LEGA------------------LKTEKDSVESYT-QRVASFSLSGEFSQTFQSTTGGDKS 98 (131) Q Consensus 58 l~~~------------------~~~~~~~~~~~~-g~v~S~s~~G~vSvsy~~~~~~~~~ 98 (131) .+.. ...+.-..+... +...+ + ++.+.++-....-+..| T Consensus 81 ~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~-~-~~~~~~~~~~r~F~Rd~ 138 (138) T protein:vir:99 81 IVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAP-V-ANTVQISEGRNDWGADW 138 (138) T ss_pred cCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCC-C-CCceeeecCCCCCCCCC Confidence 2100 011111111010 11111 1 22333332211122334 No 30 >protein:vir:107119 Length: 104 # NCBI annotation: conserved phage protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950608;genbank:gi:119953688;genbank:GeneID:4643128 Probab=67.90 E-value=0.017 Score=30.22 Aligned_cols=95 Identities=18% Similarity=0.268 Sum_probs=57.4 Q ss_pred CCHHHHHHHHHhh--hhhc----CCCHHHHHHHHHHHHHHhCCCCCch-HHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLA--PPMK----KIPDETIEAWVEMAKLFVCESKFGD-DYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~~--P~F~----~~pD~~i~~~l~~A~~~v~~~~~g~-~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |. +.+.|.+- |-=. +.=...|..+++.|+.+ |+..|++ -+.-.+..++|-.+.- ++. T Consensus 1 Md---~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedy-Cn~~F~~~~lP~gV~~fvA~~iky------------~~~ 64 (104) T protein:vir:10 1 MN---AQDVKLLNNLSLDDTSNDETIELLIEKYLNVAEEY-CNQTFNRKSLPSNVEKFIANCIKQ------------GTT 64 (104) T ss_pred CC---HHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHh-cCCCCCCCCCCccHHHHHHHHHhh------------cCC Confidence 77 44444331 1100 11245678889999987 5556875 6778889999988763 235 Q ss_pred cceeeeeeeceeEEeeecCc--cchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQSTT--GGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~~--~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) ++++|.|. |+||.||.+.- ..-.|| ...+|+.-.| ..+ T Consensus 65 ~NissRSM-GtVSyTy~t~iP~~i~~~L---------~PYRklr~~~-~~~ 104 (104) T protein:vir:10 65 SNISSRTM-GTVSYTFVTDLPKETYGYL---------KPFRRLRWTG-YHV 104 (104) T ss_pred CCcccccc-cceeecccchhHHHHHHhh---------hhhhhhcccc-ccC Confidence 57888897 89999996522 222332 3345555544 444 No 31 >protein:vir:105327 Length: 104 # NCBI annotation: putative head morphogenesis protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950671;genbank:gi:119967841;genbank:GeneID:4643206 Probab=67.90 E-value=0.017 Score=30.22 Aligned_cols=95 Identities=18% Similarity=0.268 Sum_probs=57.4 Q ss_pred CCHHHHHHHHHhh--hhhc----CCCHHHHHHHHHHHHHHhCCCCCch-HHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLA--PPMK----KIPDETIEAWVEMAKLFVCESKFGD-DYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~~--P~F~----~~pD~~i~~~l~~A~~~v~~~~~g~-~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |. +.+.|.+- |-=. +.=...|..+++.|+.+ |+..|++ -+.-.+..++|-.+.- ++. T Consensus 1 Md---~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedy-Cn~~F~~~~lP~gV~~fvA~~iky------------~~~ 64 (104) T protein:vir:10 1 MN---AQDVKLLNNLSLDDTSNDETIELLIEKYLNVAEEY-CNQTFNRKSLPSNVEKFIANCIKQ------------GTT 64 (104) T ss_pred CC---HHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHh-cCCCCCCCCCCccHHHHHHHHHhh------------cCC Confidence 77 44444331 1100 11245678889999987 5556875 6778889999988763 235 Q ss_pred cceeeeeeeceeEEeeecCc--cchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQSTT--GGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~~--~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) ++++|.|. |+||.||.+.- ..-.|| ...+|+.-.| ..+ T Consensus 65 ~NissRSM-GtVSyTy~t~iP~~i~~~L---------~PYRklr~~~-~~~ 104 (104) T protein:vir:10 65 SNISSRTM-GTVSYTFVTDLPKETYGYL---------KPFRRLRWTG-YHV 104 (104) T ss_pred CCcccccc-cceeecccchhHHHHHHhh---------hhhhhhcccc-ccC Confidence 57888897 89999996522 222332 3345555544 444 No 32 >protein:vir:105776 Length: 133 # NCBI annotation: gp11 # Family: family:all:10997 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224149;genbank:gi:62362224;genbank:GeneID:3342529 Probab=67.34 E-value=0.25 Score=23.85 Aligned_cols=110 Identities=22% Similarity=0.271 Sum_probs=61.8 Q ss_pred CCHHHHHHHHHhhhhhc-CCCHHHHHHHHHHHHH---HhCCCCCchHHHHHHHHHHHHHHHHhhhhccccccccccccce Q lcl|NC_019509. 1 MNENILLIIRQLAPPMK-KIPDETIEAWVEMAKL---FVCESKFGDDYDRALALYTLHLMTLEGALKTEKDSVESYTQRV 76 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~-~~pD~~i~~~l~~A~~---~v~~~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g~v 76 (131) |= |.++.|+.--+.. .+||-.|+.+++++.. .++ ..+.+..+.++.+|++-+|.+.. ...+| T Consensus 1 mI--T~~qa~~~L~slG~svP~~iL~~~v~q~nsi~~cLd-agY~e~tq~LI~lya~~LlA~~~-----------g~R~I 66 (133) T protein:vir:10 1 MI--TTEQAKEYLESVGITLPDFILQAIVEQANSIQECLD-AHYPPATALLIQSYLLGLMALGQ-----------GDRYI 66 (133) T ss_pred CC--CHHHHHHHHHhcCCcchHHHHHHHHHHHhhHHHHHh-CCCCHHHHHHHHHHHHHHHhhcc-----------CCcee Confidence 32 3455555555554 6999999999999864 345 47889999999999999987632 23456 Q ss_pred eeeee-eceeEEeeecCccchhhhhcCHHHHHHHHHHHH--hCCCCeEe------------ecCCCCC-CC Q lcl|NC_019509. 77 ASFSL-SGEFSQTFQSTTGGDKSLSATPWGEMYRALNRK--KGGGFGLI------------TGLRRGC-CE 131 (131) Q Consensus 77 ~S~s~-~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~--~g~G~~l~------------~g~~~~~-~~ 131 (131) +|++. +| -|.||.-.... ++|=+.|-+|+.. .|=-..|+ .=.+||| |. T Consensus 67 sSQ~APSG-ASrSF~Y~~~~------~~~~~l~~~L~~lD~~gCt~~Lip~d~~~~a~vG~f~vvggc~c~ 130 (133) T protein:vir:10 67 SSQTAPNG-ASRSFRYQSFA------DRWKGALSLLRGADKFRCANGLIPPDPTNTAFAGIWIGKGGCMCN 130 (133) T ss_pred ecccCCcc-ccccccccCCC------ccHHHHHHHHHhhhhccccccccCCCccccccceeeeeccccccC Confidence 55543 33 36665432222 2233344444433 22111232 1234555 33 No 33 >protein:vir:97329 Length: 104 # NCBI annotation: ORF048 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240613;genbank:gi:66396311;genbank:GeneID:5133685 Probab=66.57 E-value=0.019 Score=30.02 Aligned_cols=95 Identities=15% Similarity=0.249 Sum_probs=57.1 Q ss_pred CCHHHHHHHHHhh--hhhc----CCCHHHHHHHHHHHHHHhCCCCCch-HHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLA--PPMK----KIPDETIEAWVEMAKLFVCESKFGD-DYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~~--P~F~----~~pD~~i~~~l~~A~~~v~~~~~g~-~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |. +.+.|.+- |-=. +.=...|..+++.|+.+ |+..|++ -+.-.+..++|-.+.- ... T Consensus 1 Md---~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedy-Cn~~F~~~~lP~gV~~fvA~~iky------------~~~ 64 (104) T protein:vir:97 1 MD---TKDVKMINGLSLNDSSNDEQIEYLIEEYKSVAEDY-CNQKFDDKAVPSGVKKFIAECIKF------------GTT 64 (104) T ss_pred CC---HHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHh-cCCCCCCCCCCccHHHHHHHHHhh------------CCC Confidence 77 44444331 1100 11245678889999997 5556875 6778899999988763 135 Q ss_pred cceeeeeeeceeEEeeecCc--cchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQSTT--GGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~~--~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) ++++|.|. |+||.||.+.- ..-.|| ...+|+.-.| ..+ T Consensus 65 ~NissRSM-GtVSYty~t~iP~~i~~~L---------kPYRklr~~~-~~~ 104 (104) T protein:vir:97 65 GNISARTM-GTVSYTYVTDIPSSAYAYL---------MPYRKLSWGK-RYV 104 (104) T ss_pred CCcccccc-cceeecccchhHHHHHHhh---------hhhhhhcccc-cCC Confidence 57888897 89999996522 222332 2344554444 443 No 34 >protein:vir:94798 Length: 104 # NCBI annotation: ORF043 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240538;genbank:gi:66396233;genbank:GeneID:5133578 Probab=66.46 E-value=0.019 Score=30.01 Aligned_cols=95 Identities=15% Similarity=0.248 Sum_probs=57.1 Q ss_pred CCHHHHHHHHHhh--hhhc----CCCHHHHHHHHHHHHHHhCCCCCch-HHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLA--PPMK----KIPDETIEAWVEMAKLFVCESKFGD-DYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~~--P~F~----~~pD~~i~~~l~~A~~~v~~~~~g~-~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |. +.+.|.+- |-=. +.=...|..+++.|+.+ |+..|++ -+.-.+..++|-.+.- .+. T Consensus 1 Md---~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedy-Cn~~F~~~~lP~gVk~fvA~~iky------------~~~ 64 (104) T protein:vir:94 1 MD---TKDVKMINGLSLNDSSNDEQIEYLIEEYKSVAEDY-CNQKFDDKAVPSGVKKFIAECIKF------------GTT 64 (104) T ss_pred CC---HHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHh-cCCCCCCCCCCccHHHHHHHHHhh------------CCC Confidence 77 44444331 1100 11245678889999997 5556875 6778889999988763 135 Q ss_pred cceeeeeeeceeEEeeecCc--cchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQSTT--GGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~~--~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) ++++|.|. |+||.||.+.- ..-.|| ...+|+.-.| ..+ T Consensus 65 ~NissRSM-GtVSYTy~T~iP~~i~~~L---------kPYRklr~~~-~~~ 104 (104) T protein:vir:94 65 GNISARTM-GTVSYTYITDIPSSAYAYL---------MPYRKLSWGK-RYV 104 (104) T ss_pred CCcccccc-cceeecccchhHHHHHHhh---------hhhhhhcccc-cCC Confidence 57888897 89999996522 222332 2344554444 443 No 35 >protein:vir:95891 Length: 104 # NCBI annotation: ORF051 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240387;genbank:gi:66396087;genbank:GeneID:5133402 Probab=66.21 E-value=0.02 Score=29.96 Aligned_cols=95 Identities=15% Similarity=0.240 Sum_probs=57.2 Q ss_pred CCHHHHHHHHHhh--hhhc----CCCHHHHHHHHHHHHHHhCCCCCch-HHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLA--PPMK----KIPDETIEAWVEMAKLFVCESKFGD-DYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~~--P~F~----~~pD~~i~~~l~~A~~~v~~~~~g~-~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |. +.+.|.+- |-=. +.=...|..+++.|+.+ |+..|++ -+.-.+..++|-.+.- .+. T Consensus 1 Md---~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedy-Cn~~F~~~~lP~gV~~fvA~~iky------------~~~ 64 (104) T protein:vir:95 1 MD---AKDVKMINGLSLNDSSNDEQIKYLIEEYKSVAEDY-CNQKFDDKAVPSGVKKFIAECIKF------------GTT 64 (104) T ss_pred CC---HHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHh-cCCCCCCCCCCccHHHHHHHHHhh------------CCC Confidence 77 44444331 1100 11245678889999997 5556875 6778899999988763 135 Q ss_pred cceeeeeeeceeEEeeecCc--cchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQSTT--GGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~~--~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) ++++|.|. |+||.||.+.- ..-.|| ...+|+.-.| ..+ T Consensus 65 ~NissRSM-GtVSYTy~t~iP~~i~~~L---------kPYRklr~~~-~~~ 104 (104) T protein:vir:95 65 GNISARTM-GTVSYTYVTDIPSSAYAYL---------LPYRKLSWGK-RYV 104 (104) T ss_pred CCcccccc-cceeecccchhHHHHHHhh---------hhhhhhcccc-cCC Confidence 57888897 89999996522 222332 3345554444 443 No 36 >protein:vir:96281 Length: 104 # NCBI annotation: ORF047 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240313;genbank:gi:66396008;genbank:GeneID:5133358 Probab=66.21 E-value=0.02 Score=29.96 Aligned_cols=95 Identities=15% Similarity=0.240 Sum_probs=57.2 Q ss_pred CCHHHHHHHHHhh--hhhc----CCCHHHHHHHHHHHHHHhCCCCCch-HHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLA--PPMK----KIPDETIEAWVEMAKLFVCESKFGD-DYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~~--P~F~----~~pD~~i~~~l~~A~~~v~~~~~g~-~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |. +.+.|.+- |-=. +.=...|..+++.|+.+ |+..|++ -+.-.+..++|-.+.- .+. T Consensus 1 Md---~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedy-Cn~~F~~~~lP~gV~~fvA~~iky------------~~~ 64 (104) T protein:vir:96 1 MD---AKDVKMINGLSLNDSSNDEQIKYLIEEYKSVAEDY-CNQKFDDKAVPSGVKKFIAECIKF------------GTT 64 (104) T ss_pred CC---HHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHh-cCCCCCCCCCCccHHHHHHHHHhh------------CCC Confidence 77 44444331 1100 11245678889999997 5556875 6778899999988763 135 Q ss_pred cceeeeeeeceeEEeeecCc--cchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQSTT--GGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~~--~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) ++++|.|. |+||.||.+.- ..-.|| ...+|+.-.| ..+ T Consensus 65 ~NissRSM-GtVSYTy~t~iP~~i~~~L---------kPYRklr~~~-~~~ 104 (104) T protein:vir:96 65 GNISARTM-GTVSYTYVTDIPSSAYAYL---------LPYRKLSWGK-RYV 104 (104) T ss_pred CCcccccc-cceeecccchhHHHHHHhh---------hhhhhhcccc-cCC Confidence 57888897 89999996522 222332 3345554444 443 No 37 >protein:vir:96831 Length: 98 # NCBI annotation: ORF052 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240159;genbank:gi:66395852;genbank:GeneID:5133172 Probab=65.38 E-value=0.013 Score=30.91 Aligned_cols=90 Identities=21% Similarity=0.351 Sum_probs=53.8 Q ss_pred CCHHHHHHHHHhh--hhhc----CCCHHHHHHHHHHHHHHhCCCCCchHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLA--PPMK----KIPDETIEAWVEMAKLFVCESKFGDDYDRALALYTLHLMTLEGALKTEKDSVESYTQ 74 (131) Q Consensus 1 m~~~ti~~Fr~~~--P~F~----~~pD~~i~~~l~~A~~~v~~~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g 74 (131) |. +.+.|.+- |-=. ++=...|..+++.|+.+ |+..|.+-+.-++..++|-.+.- ...+ T Consensus 1 Md---~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedy-CN~~F~~~lP~gVkkfvAe~iky------------~~~~ 64 (98) T protein:vir:96 1 MD---ALDVKMLNGTRIDDVSNDDVINKLILAYKQVAEEY-CNQVFGDPLPGGVKKFIAECIKY------------GVSG 64 (98) T ss_pred CC---HHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHH-hCCcccccCCccHHHHHHHHHhh------------cccC Confidence 77 44444331 1100 11245678889999987 55568888888999999988763 1235 Q ss_pred ceeeeeeeceeEEeeecC--ccchhhhhcCHHHHHHH Q lcl|NC_019509. 75 RVASFSLSGEFSQTFQST--TGGDKSLSATPWGEMYR 109 (131) Q Consensus 75 ~v~S~s~~G~vSvsy~~~--~~~~~~l~~T~YGq~y~ 109 (131) +++|.|. |+||.+|.+- ...-.||+ ||=+.=| T Consensus 65 nissRsM-gtVSYty~T~iP~~i~~~L~--PyRrlrw 98 (98) T protein:vir:96 65 NIASRSM-GTVSYTYVTDVPSSMYKYLK--PYRKLRW 98 (98) T ss_pred Ccccccc-cceeeechhhhhHHHHHHhh--hhhhccC Confidence 6788887 8999999652 22223321 2222222 No 38 >protein:vir:103846 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938245;genbank:gi:38229150;genbank:GeneID:2648161 Probab=62.14 E-value=0.34 Score=23.15 Aligned_cols=96 Identities=9% Similarity=-0.007 Sum_probs=49.3 Q ss_pred CCHHHHHHHHHhhhhh-----c--------CCCHHHHHHHHHHHHHHhCC---CCC-------chHHHHHHHHHHHHHHH Q lcl|NC_019509. 1 MNENILLIIRQLAPPM-----K--------KIPDETIEAWVEMAKLFVCE---SKF-------GDDYDRALALYTLHLMT 57 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F-----~--------~~pD~~i~~~l~~A~~~v~~---~~~-------g~~~~~a~~l~~AHll~ 57 (131) |+.-|.++++++|++= . .++++.|+..|++|..+|+. .|+ +....+...-++.|+|. T Consensus 1 M~Y~T~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~~eIdgyL~~RY~lPl~~vP~~L~~~a~dIA~Y~L~ 80 (138) T protein:vir:10 1 MSYCTQADLVEQYGEASIRQLSDRVNKPATTIDPAVVAQAIADADAEIDLHLHARYQLPLAQVPVVLKRVACVLAFANLH 80 (138) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccCCCcCccCHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHh Confidence 9999999999999863 1 36788899999999999973 333 34455555556666664 Q ss_pred Hhhh------------------hccccccccccc-cceeeeeeeceeEEeeecCccchhh Q lcl|NC_019509. 58 LEGA------------------LKTEKDSVESYT-QRVASFSLSGEFSQTFQSTTGGDKS 98 (131) Q Consensus 58 l~~~------------------~~~~~~~~~~~~-g~v~S~s~~G~vSvsy~~~~~~~~~ 98 (131) -+.. ...+.-..+-.. +...+. ++.+.++-...--+..| T Consensus 81 ~~~~~~e~~~~rY~~Ai~~L~~Ia~G~~~Lg~~~~~~~~~~--~~~~~~~s~~r~Fg~d~ 138 (138) T protein:vir:10 81 TQVKDDHPAILDAERKRKLLGGISSGKLSLALTSSGTPAPI--ANTVQISSQRNDFGGTW 138 (138) T ss_pred cCCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcccCCC--CCceeeecCCccCCCCC Confidence 3110 001111111000 011111 12233322111123345 No 39 >protein:vir:9706 Length: 100 # NCBI annotation: hypothetical protein # Family: family:all:316 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795468;genbank:gi:28876223;genbank:GeneID:1257767 Probab=61.17 E-value=0.08 Score=26.59 Aligned_cols=85 Identities=13% Similarity=0.165 Sum_probs=44.0 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCC--------CCc--hHHHHHHHHHHHHHHHHhhhhccccccc- Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCES--------KFG--DDYDRALALYTLHLMTLEGALKTEKDSV- 69 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~--------~~g--~~~~~a~~l~~AHll~l~~~~~~~~~~~- 69 (131) .+++-++.+|.---==-+..|+.|+.+++-|+.+|... .|. ..+.+|++++++|+---++.. ....-. T Consensus 3 ~t~e~L~~lK~~lRID~d~DD~li~~~i~~Ae~~I~~AV~~~~t~~~~~~~~rF~~Av~~Lv~~~Y~nR~~t-~d~~~~~ 81 (100) T protein:vir:97 3 VSKELLNSVKLYCKIDFDFENDIIKEMIESAQEQICFAIDDGSTPEMFEGHAKFALAVKKQVKEEYDHRGLS-ADSFRYP 81 (100) T ss_pred ccHHHHHHHHHHcCCCCCcchHHHHHHHHHHHHHHhhhccCCCCcchhhccchHHHHHHHHHHHHHHhcccc-chhhcch Confidence 34555667765322111689999999999999999632 221 355799999999986543211 110000 Q ss_pred --cccccceeeeeeeceeE Q lcl|NC_019509. 70 --ESYTQRVASFSLSGEFS 86 (131) Q Consensus 70 --~~~~g~v~S~s~~G~vS 86 (131) -+....|..-...|+-| T Consensus 82 ip~gv~~lI~QLR~~~~~~ 100 (100) T protein:vir:97 82 LANGVLNIIHQLRLRGDDS 100 (100) T ss_pred hhhhHHHHHHHHHHhhcCC Confidence 00001111111112211 No 40 >protein:vir:1241 Length: 104 # NCBI annotation: similar to phage Spp1 gp15 (product required for head morphogenesis) # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510940;genbank:gi:17426274;genbank:GeneID:927373 Probab=58.90 E-value=0.3 Score=23.41 Aligned_cols=97 Identities=12% Similarity=0.269 Sum_probs=56.2 Q ss_pred CCHHHHHHHHHhh--hhhc----CCCHHHHHHHHHHHHHHhCCCCCc-hHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLA--PPMK----KIPDETIEAWVEMAKLFVCESKFG-DDYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~~--P~F~----~~pD~~i~~~l~~A~~~v~~~~~g-~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |. +.+.|.+- |-=. +.=...|..+++.|+.+ |+..|+ +.+.-++..|+|-.+.- .+. T Consensus 1 Md---~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedy-CN~~F~~~~lP~gVkkfvAe~iky------------~~~ 64 (104) T protein:vir:12 1 MD---AKDVKMINGLSLNDSSDDEQIEYLIEEYKSVAEDY-CNQKFDDKEVPSGVKKFIAECIKF------------GTT 64 (104) T ss_pred CC---HHHHHHHhCCCCCCCccHHHHHHHHHHHHHHHHHH-hCCCCCCccCCccHHHHHHHHHhh------------CCC Confidence 77 44444431 1100 11245677889999987 555676 46788999999988763 134 Q ss_pred cceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) ++++|.|. |+||.+|.+- +-++.|+- +...+|+.=.| ..+ T Consensus 65 ~NissRsM-gtVSYTy~T~------iP~~i~~~-L~PYRrlrw~~-~~~ 104 (104) T protein:vir:12 65 GNISARTM-GTVSYTYVTD------IPSSAYAY-LLPYRKLSWGK-RYV 104 (104) T ss_pred CCcccccc-cceeeechhh------hhHHHHHh-hhhhhhhcccc-cCC Confidence 57888887 8999999652 11222221 22334443333 443 No 41 >protein:vir:100885 Length: 110 # NCBI annotation: putative DNA packaging protein # Family: family:all:6491 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358765;genbank:gi:78000029;genbank:GeneID:3726156 Probab=58.79 E-value=0.22 Score=24.17 Aligned_cols=95 Identities=12% Similarity=0.142 Sum_probs=47.7 Q ss_pred CCHH---HHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCC----C----Cc--hHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_019509. 1 MNEN---ILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCES----K----FG--DDYDRALALYTLHLMTLEGALKTEKD 67 (131) Q Consensus 1 m~~~---ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~----~----~g--~~~~~a~~l~~AHll~l~~~~~~~~~ 67 (131) |+++ |++++|..--==.+-+|+.|+.+|..|+.+|... - +. ...+.|+.+|+=|+-.-+++...... T Consensus 1 ~~~s~gVT~ddiK~~LriD~~~DD~~L~~li~tAe~yI~~AI~~ti~~~~~~~~p~Fn~AV~lLvd~wY~nRga~s~~~~ 80 (110) T protein:vir:10 1 MTDGQGVTPEDMQQYLNLDTNGDASVLADMISTAEEAITGAIDDTIDVGIYRKYPLFNQAVRVLVDFMYYSRGTLSDQSK 80 (110) T ss_pred CCCCccccHHHHHHHhcCCCCchhHHHHHHHHHHHHHHhhccCCCcchhhhhhchhhhHHHHHHHHHhhhcccccchhhc Confidence 9988 7999887321112467889999999999998431 1 11 23467999999998765543221110 Q ss_pred c-ccccccceeeeeeeceeEEeee-cCccchh Q lcl|NC_019509. 68 S-VESYTQRVASFSLSGEFSQTFQ-STTGGDK 97 (131) Q Consensus 68 ~-~~~~~g~v~S~s~~G~vSvsy~-~~~~~~~ 97 (131) - +-+....+..-.. .+...-. ...++++ T Consensus 81 ~~P~sv~smIqQlR~--~~~~~~~~~~~~~~~ 110 (110) T protein:vir:10 81 AYPPSYAYMINSIRW--KIQRDQAAKAGGTDG 110 (110) T ss_pred ccchHHHHHHHHHHh--hhhhhhhhccCCCCC Confidence 0 0000000100000 0010001 1123334 No 42 >protein:vir:93740 Length: 104 # NCBI annotation: ORF047 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240461;genbank:gi:66396159;genbank:GeneID:5133509 Probab=57.47 E-value=0.049 Score=27.78 Aligned_cols=97 Identities=13% Similarity=0.276 Sum_probs=56.6 Q ss_pred CCHHHHHHHHHhh--hhhc----CCCHHHHHHHHHHHHHHhCCCCCc-hHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLA--PPMK----KIPDETIEAWVEMAKLFVCESKFG-DDYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~~--P~F~----~~pD~~i~~~l~~A~~~v~~~~~g-~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |. +.+.|.+- |-=. +.=+..|..+++.|+.+ |+..|+ +.+.-++..|+|-.+.- .+. T Consensus 1 Md---~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedy-CN~~F~~~~lP~gVkkfvAe~iky------------~~~ 64 (104) T protein:vir:93 1 MD---AKDVKMINGLSLNDSSNDEQIDYLIEEYKSVAEDY-CNQKFDDKEVPSGVKKFIAECIKF------------GTT 64 (104) T ss_pred CC---HHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHH-hCCCCCCccCCccHHHHHHHHHhh------------CCC Confidence 77 44444431 1100 12256688899999997 555676 46788999999988763 134 Q ss_pred cceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) ++++|.|. |+||.+|.+- +-++.|+- +...+|+.=.| ..+ T Consensus 65 ~NissRsM-gtVSYTy~T~------iP~~i~~~-L~PYRrlrw~~-~~~ 104 (104) T protein:vir:93 65 GNISARTM-GTVSYTYVTD------IPSSAYAY-LLPYRKLSWGK-RYV 104 (104) T ss_pred CCcccccc-cceeeechhh------hhHHHHHh-hhhhhhhcccc-cCC Confidence 57888887 8999999652 11222221 22334443333 443 No 43 >protein:vir:94492 Length: 104 # NCBI annotation: ORF049 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240678;genbank:gi:66396380;genbank:GeneID:5133756 Probab=54.54 E-value=0.06 Score=27.27 Aligned_cols=97 Identities=12% Similarity=0.266 Sum_probs=56.2 Q ss_pred CCHHHHHHHHHhh--hhhc----CCCHHHHHHHHHHHHHHhCCCCCc-hHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLA--PPMK----KIPDETIEAWVEMAKLFVCESKFG-DDYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~~--P~F~----~~pD~~i~~~l~~A~~~v~~~~~g-~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |. +.+.|.+- |-=. +.=...|..+++.|+.+ |+..|+ +.+.-++..|+|-.+.- .+. T Consensus 1 Md---~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedy-CN~~F~~~~lP~gVkkfvAe~iky------------~~~ 64 (104) T protein:vir:94 1 MD---AKDVKMINGLSLNDSSNDEQIEYLIEEYKGVAEDY-CNQKFDDKEVPSGVKKFIAECIKF------------GTT 64 (104) T ss_pred CC---HHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHH-hCCCCCCccCCccHHHHHHHHHhh------------CCC Confidence 77 44444431 1100 11245678889999997 555676 46788999999988763 134 Q ss_pred cceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) ++++|.|. |+||.+|.+- +-++.|+- +...+|+.=.| ..+ T Consensus 65 ~NissRsM-gtVSYTy~T~------iP~~i~~~-L~PYRrlrw~~-~~~ 104 (104) T protein:vir:94 65 GNISARTM-GTVSYTYVTD------IPSSAYAY-LLPYRKLSWGK-RYV 104 (104) T ss_pred CCcccccc-cceeeechhh------hhHHHHHh-hhhhhhhcccc-cCC Confidence 57888887 8999999652 11222221 22334443333 443 No 44 >protein:vir:97430 Length: 104 # NCBI annotation: ORF051 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240751;genbank:gi:66396455;genbank:GeneID:5133786 Probab=54.54 E-value=0.06 Score=27.27 Aligned_cols=97 Identities=12% Similarity=0.266 Sum_probs=56.2 Q ss_pred CCHHHHHHHHHhh--hhhc----CCCHHHHHHHHHHHHHHhCCCCCc-hHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLA--PPMK----KIPDETIEAWVEMAKLFVCESKFG-DDYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~~--P~F~----~~pD~~i~~~l~~A~~~v~~~~~g-~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |. +.+.|.+- |-=. +.=...|..+++.|+.+ |+..|+ +.+.-++..|+|-.+.- .+. T Consensus 1 Md---~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedy-CN~~F~~~~lP~gVkkfvAe~iky------------~~~ 64 (104) T protein:vir:97 1 MD---AKDVKMINGLSLNDSSNDEQIEYLIEEYKGVAEDY-CNQKFDDKEVPSGVKKFIAECIKF------------GTT 64 (104) T ss_pred CC---HHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHH-hCCCCCCccCCccHHHHHHHHHhh------------CCC Confidence 77 44444431 1100 11245678889999997 555676 46788999999988763 134 Q ss_pred cceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) ++++|.|. |+||.+|.+- +-++.|+- +...+|+.=.| ..+ T Consensus 65 ~NissRsM-gtVSYTy~T~------iP~~i~~~-L~PYRrlrw~~-~~~ 104 (104) T protein:vir:97 65 GNISARTM-GTVSYTYVTD------IPSSAYAY-LLPYRKLSWGK-RYV 104 (104) T ss_pred CCcccccc-cceeeechhh------hhHHHHHh-hhhhhhhcccc-cCC Confidence 57888887 8999999652 11222221 22334443333 443 No 45 >protein:vir:106583 Length: 105 # NCBI annotation: putative protein # Family: family:all:6481 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958586;genbank:gi:41179246;genbank:GeneID:2717119 Probab=53.87 E-value=0.52 Score=22.15 Aligned_cols=99 Identities=12% Similarity=0.113 Sum_probs=51.3 Q ss_pred CCHH-HHHHHHHhhhhhcCCCHHHHHHHHHHHHHHh----CCCCCchHHHHHHHHHHHHHHHHhhhhccccccccccccc Q lcl|NC_019509. 1 MNEN-ILLIIRQLAPPMKKIPDETIEAWVEMAKLFV----CESKFGDDYDRALALYTLHLMTLEGALKTEKDSVESYTQR 75 (131) Q Consensus 1 m~~~-ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v----~~~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g~ 75 (131) |+.+ .++..+.+--.-....|+.|+.+|++|...+ +.+..++..+..+.=++.+.. +..+.+ . T Consensus 2 ~~~~~~~e~ik~L~~~~d~~~DelL~~lieda~~~vl~y~nr~~ip~~l~~~v~evav~~f---NR~G~E---------G 69 (105) T protein:vir:10 2 LNVDQLTEIVSALSTRLENVNNALLTELVKESIAQVLDYTGQKKLVGSMDIYVKKLAVINY---NRLGIE---------G 69 (105) T ss_pred CchHHHHHHHHHHhccCCCchhHHHHHHHHHHHHHHHHHcCCcccchhHHHHHHHHHHHHh---cccCCc---------c Confidence 7766 3444544333323567999999999998766 455555555555544444432 222222 2 Q ss_pred eeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCC Q lcl|NC_019509. 76 VASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGF 119 (131) Q Consensus 76 v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~ 119 (131) .+|.| +|-+|.||.+.-. +. |=...-..++...+-| T Consensus 70 ~tS~S-egGvS~sy~~~~~-~~------~~~~l~~yR~~~v~~~ 105 (105) T protein:vir:10 70 ETQRS-EGGITNYLETGIP-KD------IRQGLNSYRIAKVKKL 105 (105) T ss_pred cceee-cCCeeeeeeccCc-HH------HHHHHHHHhhhcccCC Confidence 35667 5789999975211 11 1122222222222222 No 46 >protein:vir:95071 Length: 104 # NCBI annotation: ORF050 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240825;genbank:gi:66394717;genbank:GeneID:5133865 Probab=53.84 E-value=0.063 Score=27.15 Aligned_cols=97 Identities=12% Similarity=0.271 Sum_probs=56.1 Q ss_pred CCHHHHHHHHHhh--hhhc----CCCHHHHHHHHHHHHHHhCCCCCc-hHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLA--PPMK----KIPDETIEAWVEMAKLFVCESKFG-DDYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~~--P~F~----~~pD~~i~~~l~~A~~~v~~~~~g-~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |. +.+.|.+- |-=. +.=...|..+++.|+.+ |+..|+ +.+.-++..|+|-.+.- .+. T Consensus 1 Md---~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedy-CN~~F~~~~lP~gVkkfvAe~iky------------~~~ 64 (104) T protein:vir:95 1 MD---AKDVKMINGLSLNDSSNDEQIEYLIEEYKSVAEDY-CNQKFDDKEVPSGVKKFIAECIKF------------GTT 64 (104) T ss_pred CC---HHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHH-hCCCCCCccCCccHHHHHHHHHhh------------CCC Confidence 77 44444431 1100 11245678889999997 555676 46788999999988763 134 Q ss_pred cceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) ++++|.|. |+||.+|.+- +-++.|+- +...+|+.=.| ..+ T Consensus 65 ~NissRsM-gtVSYTy~T~------iP~~i~~~-L~PYRrlrw~~-~~~ 104 (104) T protein:vir:95 65 GNISARTM-GTVSYTYVTD------IPSSAYAY-LMPYRKLSWGK-RYV 104 (104) T ss_pred CCcccccc-cceeeechhh------hhHHHHHh-hhhhhhhcccc-cCC Confidence 57888887 8999999652 11122221 22334443333 343 No 47 >protein:vir:9821 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795583;genbank:gi:28876338;genbank:GeneID:1257921 Probab=51.94 E-value=0.57 Score=21.93 Aligned_cols=111 Identities=17% Similarity=0.092 Sum_probs=52.1 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCC---CC-----c---hHHHHHHH-HHHHHHHHH--hhhhcccc Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCES---KF-----G---DDYDRALA-LYTLHLMTL--EGALKTEK 66 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~---~~-----g---~~~~~a~~-l~~AHll~l--~~~~~~~~ 66 (131) ||.-|.+.|.+..++ +++.++..+..|+..||.- ++ . ++++.++. .+++.+..+ .|...... T Consensus 6 M~YlT~eey~~l~~~----~~~dF~kllk~As~~ID~~t~~~y~~~d~e~d~~~r~~~vKkA~a~QIeY~~~~G~ts~~d 81 (138) T protein:vir:98 6 IAFLTQKEFEDLGFD----DVEDFEKMEKRASHAVNLYCRNRYDYKDLKKEIALVQKAVKRAIAYQIAYLNDSGVMTAED 81 (138) T ss_pred ccccchHHHhccCCC----ChhhHHHHHHHHHHHhhhhhccccccccccchhHHHHHHHHHHHHHHHHHHHHcCCcchhh Confidence 999999988765333 4445999999999988742 11 1 22222222 333333333 23222211 Q ss_pred ccccccccceeeeeeeceeEEeeecCccc-------hhhhhcCHHHHHHHHHHHHhCCCCeEeecC-CCC Q lcl|NC_019509. 67 DSVESYTQRVASFSLSGEFSQTFQSTTGG-------DKSLSATPWGEMYRALNRKKGGGFGLITGL-RRG 128 (131) Q Consensus 67 ~~~~~~~g~v~S~s~~G~vSvsy~~~~~~-------~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~-~~~ 128 (131) .+.+.|.++ |..||||...+.+ .+-++.+. .=..++...|. ++.|- +-+ T Consensus 82 ------~~~~~s~sv-GrTSiS~~~~~~~~s~~~~~~~~~~~s~---~A~~~L~~tGL---LY~GV~yd~ 138 (138) T protein:vir:98 82 ------KQSFAGISL-GRTSISYTVGHGQGSQQKTLADRFNLCL---DAENELLVVGL---GYTGISYDR 138 (138) T ss_pred ------ccCcCceEe-eeeEeecccccccccccccccccccccH---HHHHHHhhcCc---ccccCcccC Confidence 334567775 8999998322111 11122221 11123333332 22211 111 No 48 >protein:vir:81159 Length: 95 # NCBI annotation: putative DNA packaging protein # Family: family:all:316 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285813;genbank:gi:148747734;genbank:GeneID:5247202 Probab=48.00 E-value=0.33 Score=23.26 Aligned_cols=89 Identities=9% Similarity=-0.010 Sum_probs=46.1 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCC------CCCchHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCE------SKFGDDYDRALALYTLHLMTLEGALKTEKDSVESYTQ 74 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~------~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~g 74 (131) |.--|++++|.--===.+-.|+.|+.+|+-|+.+|.. .......+.|+.++++|+=.-+........ ..+- T Consensus 1 Mm~vtLee~K~~LRID~d~dD~lI~~li~aA~~~i~~~~g~~~~~~~~~~~~Avl~lv~~~YeNRe~~~~~~~---~~p~ 77 (95) T protein:vir:81 1 MMIVTLEEVKNWLRVDFSDDDALITTLINAAEEYLKNATGTTFDATNHLAKIFCMTLIADWYENRELVGRASD---QVRP 77 (95) T ss_pred CCcCCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHhhccccccCchHHHHHHHHHHHHHHhhccccccccc---cccH Confidence 8778899888622111246899999999999988843 123467889999999998653321110000 0000 Q ss_pred ceeeeeeeceeEEeeecCccchhhhhcCH Q lcl|NC_019509. 75 RVASFSLSGEFSQTFQSTTGGDKSLSATP 103 (131) Q Consensus 75 ~v~S~s~~G~vSvsy~~~~~~~~~l~~T~ 103 (131) .|.|- -..+.-.|. ..|. T Consensus 78 ~v~sl--l~~lr~~~~---------~~~~ 95 (95) T protein:vir:81 78 ILQSI--LAQLTYAYG---------GETA 95 (95) T ss_pred HHHHH--HHHhhhccc---------cccC Confidence 00000 001110111 1111 No 49 >protein:vir:99922 Length: 165 # NCBI annotation: gp9 # Family: family:all:7267 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655526;genbank:gi:109392296;genbank:GeneID:4157091 Probab=45.13 E-value=0.78 Score=21.17 Aligned_cols=114 Identities=15% Similarity=0.178 Sum_probs=60.7 Q ss_pred CCHHHHH-----HHHHhhhhhcCCCHHHHHHHHHHHHHHh-------CCCCCchHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_019509. 1 MNENILL-----IIRQLAPPMKKIPDETIEAWVEMAKLFV-------CESKFGDDYDRALALYTLHLMTLEGALKTEKDS 68 (131) Q Consensus 1 m~~~ti~-----~Fr~~~P~F~~~pD~~i~~~l~~A~~~v-------~~~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~ 68 (131) |++-|-. .--+.--.|++.|.+....+|++|.... ++..| +..+.|-..+.--+|..+. T Consensus 1 ~~~~~~~~p~~ii~~eDl~Pf~~i~~~ka~~mI~da~A~A~~vAPCi~~~~f-~~~~aAKaIlrgAiLRW~e-------- 71 (165) T protein:vir:99 1 MTEPTPTEPEPLLTAEDLAPFATIPKAKADEMIEDALGMAEVHAPCINDPGF-AHRRAAKAILRGAILRWNE-------- 71 (165) T ss_pred CCCCCCCCcceeeehhhccccccCCHHHHHHHHhhhhhhhhhhccccCCCCc-ccHHHHHHHHHHhhhhhhc-------- Confidence 7665321 1111122367889999999988776543 22222 2334444444444554421 Q ss_pred ccccccceeeeeeeceeEEeeecCccchh-hhhcCHHHHHHHHHHHHhC-CC-----CeEee---cCC----------CC Q lcl|NC_019509. 69 VESYTQRVASFSLSGEFSQTFQSTTGGDK-SLSATPWGEMYRALNRKKG-GG-----FGLIT---GLR----------RG 128 (131) Q Consensus 69 ~~~~~g~v~S~s~~G~vSvsy~~~~~~~~-~l~~T~YGq~y~~L~~~~g-~G-----~~l~~---g~~----------~~ 128 (131) ..+|.+++++ .|...+++|+-+.... + |=.+.-+|.|+|. -| |.+-+ |++ || T Consensus 72 --~GSGAit~~T-aGPf~qT~DtRs~r~~mf-----wPSEItqLqklC~~~g~~~~AFsIDt~p~g~v~Hs~~Cs~~fGg 143 (165) T protein:vir:99 72 --AGAGAATTKT-AGIYGQTVDTRQPRKAMF-----FPSEIDQLRKLCRPDDDNGGAFSIDLLPQETVTHAEICSIYFGG 143 (165) T ss_pred --ccCceeeecc-cccceeeecccccccccc-----ChhhHHHHHHHhcCCCCCCcceeeecccCCCcccccccceeecC Confidence 1145566666 4999999987554221 2 2236679999983 22 33333 333 44 Q ss_pred CCC Q lcl|NC_019509. 129 CCE 131 (131) Q Consensus 129 ~~~ 131 (131) +|. T Consensus 144 ~CS 146 (165) T protein:vir:99 144 GCS 146 (165) T ss_pred ccc Confidence 444 No 50 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=42.75 E-value=0.61 Score=21.75 Aligned_cols=88 Identities=15% Similarity=0.116 Sum_probs=43.6 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhC----C------CCCchHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVC----E------SKFGDDYDRALALYTLHLMTLEGALKTEKDSVE 70 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~----~------~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~ 70 (131) |.--|++.+|+--=-=.+..|+.|+.+|+-|..++. . .........|+.|+++|+-.=+-.... T Consensus 6 M~~vtLee~K~hLRid~dddD~lI~~~i~AA~~~v~~~~~~~~~~~~~~~p~~ik~AiLllv~~~YenRE~~~~------ 79 (108) T protein:vir:19 6 LDVISLSLFKQQIEFEEDDRDELITLYAQAAFDYCMRWCDEPAWKVAADIPAAVKGAVLLVFADMFEHRTAQSE------ 79 (108) T ss_pred ccccCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHHhCCcccccccccchHHHHHHHHHHHHHHhccccccc------ Confidence 777789988863222135799999999999987762 1 123355678999999998643211000 Q ss_pred ccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhC-----CCC Q lcl|NC_019509. 71 SYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKG-----GGF 119 (131) Q Consensus 71 ~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g-----~G~ 119 (131) ..+++- .+ ..+ ..+..++..| -|. T Consensus 80 ------------~~~~~~----~~-~~~--------LL~pYR~~~g~~~~~~~~ 108 (108) T protein:vir:19 80 ------------VQLYEN----AA-AER--------MMFIHRNWRGKAESEEGS 108 (108) T ss_pred ------------chhhhh----HH-HHH--------HHHHHHhcCCCCCcccCC Confidence 000000 00 000 0000000000 000 No 51 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=42.75 E-value=0.61 Score=21.75 Aligned_cols=88 Identities=15% Similarity=0.116 Sum_probs=43.6 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhC----C------CCCchHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVC----E------SKFGDDYDRALALYTLHLMTLEGALKTEKDSVE 70 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~----~------~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~ 70 (131) |.--|++.+|+--=-=.+..|+.|+.+|+-|..++. . .........|+.|+++|+-.=+-.... T Consensus 6 M~~vtLee~K~hLRid~dddD~lI~~~i~AA~~~v~~~~~~~~~~~~~~~p~~ik~AiLllv~~~YenRE~~~~------ 79 (108) T protein:vir:18 6 LDVISLSLFKQQIEFEEDDRDELITLYAQAAFDYCMRWCDEPAWKVAADIPAAVKGAVLLVFADMFEHRTAQSE------ 79 (108) T ss_pred ccccCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHHhCCcccccccccchHHHHHHHHHHHHHHhccccccc------ Confidence 777789988863222135799999999999987762 1 123355678999999998643211000 Q ss_pred ccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhC-----CCC Q lcl|NC_019509. 71 SYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKG-----GGF 119 (131) Q Consensus 71 ~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g-----~G~ 119 (131) ..+++- .+ ..+ ..+..++..| -|. T Consensus 80 ------------~~~~~~----~~-~~~--------LL~pYR~~~g~~~~~~~~ 108 (108) T protein:vir:18 80 ------------VQLYEN----AA-AER--------MMFIHRNWRGKAESEEGS 108 (108) T ss_pred ------------chhhhh----HH-HHH--------HHHHHHhcCCCCCcccCC Confidence 000000 00 000 0000000000 000 No 52 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=41.93 E-value=0.91 Score=20.82 Aligned_cols=110 Identities=17% Similarity=0.068 Sum_probs=50.8 Q ss_pred CCHH-HHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCC--CC-----------ch----HHHHHHHHHHHHHHHHhhhh Q lcl|NC_019509. 1 MNEN-ILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCES--KF-----------GD----DYDRALALYTLHLMTLEGAL 62 (131) Q Consensus 1 m~~~-ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~--~~-----------g~----~~~~a~~l~~AHll~l~~~~ 62 (131) |+.- |++++.+++-++..=-.+.++.+|++|..+|-.. .+ ++ ..++....+++-.|.. T Consensus 1 m~~fAtv~Dv~~r~r~L~~~E~~ra~~lL~dAs~~ir~~~p~~~~~l~a~~~e~~~~~~~~~~~V~~~~V~Ral~~---- 76 (132) T protein:vir:16 1 MNPFATVDDLTMLWRPLKGDEKERAEKLLEIVSDSLREEADKVGRDLYAMIAEKPSYFASVVKSVTVDIVARTLMT---- 76 (132) T ss_pred CCccCCHHHHHHHhcCCCHhHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccchhHHHHHHHHHHHHHhcC---- Confidence 7765 8999999886543333458999999999888211 01 11 1122222333322211 Q ss_pred ccccccccccccce-eeeeeecee--EEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEee-cCCC Q lcl|NC_019509. 63 KTEKDSVESYTQRV-ASFSLSGEF--SQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLIT-GLRR 127 (131) Q Consensus 63 ~~~~~~~~~~~g~v-~S~s~~G~v--Svsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~-g~~~ 127 (131) . .+. -|.. .|++ .|.. |.+|.++++.- -.|. .-|.++...+.+++.+. .|-- T Consensus 77 -~-~~~----~G~tq~S~T-aG~ys~S~t~~~p~G~l---ylt~---~e~~~LG~~~~r~~~i~~~~~~ 132 (132) T protein:vir:16 77 -S-TDQ----EPMTQTTES-ALGYSVSGSYLVPGGGL---FIKN---SELSRLGLKKQRFGVIDFYGND 132 (132) T ss_pred -C-CCC----CCceeeeee-ccchheeeeeecCCCcc---eeCh---HHHHhhCCCCCceEEEeecCCC Confidence 1 010 1111 2333 4766 55676665421 1121 22333343444434432 3322 No 53 >protein:vir:100103 Length: 120 # NCBI annotation: gp7 # Family: family:all:363 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945037;genbank:gi:38707897;genbank:GeneID:2744150 Probab=40.64 E-value=0.96 Score=20.68 Aligned_cols=90 Identities=18% Similarity=0.154 Sum_probs=51.2 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCC----CC----------------------CchHHHHHHHHHHHH Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCE----SK----------------------FGDDYDRALALYTLH 54 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~----~~----------------------~g~~~~~a~~l~~AH 54 (131) ||=-|++.+|.--=-=.+.+|+.|+.+|+-|+.++.. +. .....+.|+.|+++| T Consensus 5 m~~vtL~e~K~hLRvd~d~DD~lI~~~i~AA~~~v~~~~~r~l~~~~~~~~~~~~~~~~~~~~~~~~~~i~~AvLllvg~ 84 (120) T protein:vir:10 5 TPIVSLEVALAHLREDAGVADDLIKIYIGAATQSASDYVDRKLYANDAEMQAAVADATAGADPIVANDAIRAAILLTIGK 84 (120) T ss_pred CCccCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHHhCCcccccccccchhhhccccccccccCCHHHHHHHHHHHHH Confidence 7766788888632211257899999999999887742 10 134567899999999 Q ss_pred HHHHhhhhccccccccccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeE Q lcl|NC_019509. 55 LMTLEGALKTEKDSVESYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGL 121 (131) Q Consensus 55 ll~l~~~~~~~~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l 121 (131) +=.-+.... .|.. .-.+.-|+| +..|+..++.++++ T Consensus 85 ~YenRe~~~------------------~~~~-----------~~~~~lP~~--v~~Ll~~yR~~~gv 120 (120) T protein:vir:10 85 LYAFREDVV------------------SGAS-----------ASVTELPSG--AKSLLFPYRVGLGV 120 (120) T ss_pred HHhchhhhh------------------hccc-----------ccccccCHH--HHHHHHHhhhccCC Confidence 865321100 0100 001122344 34466655555555 No 54 >protein:vir:107864 Length: 150 # NCBI annotation: gp36 # Family: family:all:348 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024709;genbank:gi:48696946;genbank:GeneID:2845952 Probab=40.62 E-value=0.96 Score=20.67 Aligned_cols=96 Identities=13% Similarity=0.118 Sum_probs=49.2 Q ss_pred CCHHHHHHHHHhhhh--h---c-------------CCCHHHHHHHHHHHHHHhCC---CCC-------chHHHHHHHHHH Q lcl|NC_019509. 1 MNENILLIIRQLAPP--M---K-------------KIPDETIEAWVEMAKLFVCE---SKF-------GDDYDRALALYT 52 (131) Q Consensus 1 m~~~ti~~Fr~~~P~--F---~-------------~~pD~~i~~~l~~A~~~v~~---~~~-------g~~~~~a~~l~~ 52 (131) |+.-|+++++++|++ + . .+.++.|+..|++|..+|+. .|+ +....+...-++ T Consensus 1 M~Y~T~~Dl~~r~ge~el~~Ltd~~~~g~~~~~~~~~d~~~i~~Al~dA~~eIDgYL~~RY~lPl~~vP~~L~~~a~dIA 80 (150) T protein:vir:10 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESSVRQAEEIVDAHLRGRYNLPLSPVPTVIKDVTVNLA 80 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccCccccchhhcCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHH Confidence 999999999999985 2 1 25678899999999998873 333 345555555666 Q ss_pred HHHHHHhh----------------------hhccccccccccccceeeeeeeceeEEeeecCcc----chhh Q lcl|NC_019509. 53 LHLMTLEG----------------------ALKTEKDSVESYTQRVASFSLSGEFSQTFQSTTG----GDKS 98 (131) Q Consensus 53 AHll~l~~----------------------~~~~~~~~~~~~~g~v~S~s~~G~vSvsy~~~~~----~~~~ 98 (131) .|+|..+- ....+....+......++++ +.+.+.-..... -.+| T Consensus 81 rY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~--~~~~v~~~~r~f~r~~l~gf 150 (150) T protein:vir:10 81 RHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSGPATPEP--GEMKVRARRRQFDADLLERF 150 (150) T ss_pred HHHHHhcccccCCCCHHHHHHHHHHHHHHHHHhcCcccCCCCCCCCCCCC--ceeeeecCCCccChhhccCC Confidence 66664310 01111111111111222211 223332111000 1133 No 55 >protein:vir:4702 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061634;genbank:gi:9635721;genbank:GeneID:1263015 Probab=39.76 E-value=1 Score=20.58 Aligned_cols=97 Identities=14% Similarity=0.134 Sum_probs=45.6 Q ss_pred CCHHH--HHHHHHhhh-hhcCCCHHHHHHHHHHHHHHhCCC------CC------chHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_019509. 1 MNENI--LLIIRQLAP-PMKKIPDETIEAWVEMAKLFVCES------KF------GDDYDRALALYTLHLMTLEGALKTE 65 (131) Q Consensus 1 m~~~t--i~~Fr~~~P-~F~~~pD~~i~~~l~~A~~~v~~~------~~------g~~~~~a~~l~~AHll~l~~~~~~~ 65 (131) |.-++ ++.+|.--= +| +..|+.|+.+|+-|..+|... +. ....+.|+.++++|+-.=+...... T Consensus 1 M~vt~~dLeeiK~~LRID~-d~DD~li~~~i~AA~~~I~~ai~~~~~~~~~~~~~~~~~~~AvllLv~~~YeNR~a~~~~ 79 (113) T protein:vir:47 1 MQLTAEELKLLKKHCKIDH-NSEDDLLEIYYSWAFHEIASAVTDEPSKYIDWFKSHPLFARAIYPLASYYFENRIAYLDR 79 (113) T ss_pred CcccHHHHHHHHHHhCCCC-CcchHHHHHHHHHHHHHHHhhccccccccccccCCchHHHHHHHHHHHHHHhhhhhcccc Confidence 77665 777776321 22 468999999999999888321 11 2367899999999987644321110 Q ss_pred cccccccccceeeeeeeceeEEeeec-CccchhhhhcC Q lcl|NC_019509. 66 KDSVESYTQRVASFSLSGEFSQTFQS-TTGGDKSLSAT 102 (131) Q Consensus 66 ~~~~~~~~g~v~S~s~~G~vSvsy~~-~~~~~~~l~~T 102 (131) .....+-.|.| +-..+.-.|.. ....++-=..| T Consensus 80 --~~~~vp~~v~s--li~qlR~~y~~~~~~~~~~~~~~ 113 (113) T protein:vir:47 80 --DLSLAPHMVLS--TVHKLRGSFEQFLESENDEESGT 113 (113) T ss_pred --ccccccHHHHH--HHHHHHHHHHHHhhhcCCCCCCC Confidence 00000000000 00011111110 00000000122 No 56 >protein:vir:100211 Length: 114 # NCBI annotation: Hypothetical protein # Family: family:all:6491 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025032;genbank:gi:48697265;genbank:GeneID:2948309 Probab=38.73 E-value=0.98 Score=20.64 Aligned_cols=91 Identities=12% Similarity=0.165 Sum_probs=46.1 Q ss_pred CCHHH-------HHHHHHhhh-hhcCCCHHHHHHHHHHHHHHhCCC--------CCc--hHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019509. 1 MNENI-------LLIIRQLAP-PMKKIPDETIEAWVEMAKLFVCES--------KFG--DDYDRALALYTLHLMTLEGAL 62 (131) Q Consensus 1 m~~~t-------i~~Fr~~~P-~F~~~pD~~i~~~l~~A~~~v~~~--------~~g--~~~~~a~~l~~AHll~l~~~~ 62 (131) |+.+| ++.+|.--= .| +-+|+.|+.+|+-|+.+|... .+. .....|+.+|++|+-.-+... T Consensus 1 ~~~~~~~~~~vtLeevK~~LRID~-ddDD~lI~~lI~aA~~yI~~aig~~~~~~~~~~~~~~~~Avl~Lv~~~YeNR~~~ 79 (114) T protein:vir:10 1 MADETADTVGVTADDMQSYLNLDS-DGDASILEGLISTAESAVMNAIDDTIAVEVYRTYPLFNQAVRVLVDFMYYSRGTL 79 (114) T ss_pred CCCcccccccccHHHHHHHhCCCC-ccchHHHHHHHHHHHHHHHHhhCCCCCcccccCchhHHHHHHHHHHHHHhhhhhh Confidence 88774 566665211 12 469999999999999998421 111 356789999999987544322 Q ss_pred ccccccccccccceeeeeeeceeEEee----ecC-ccchh Q lcl|NC_019509. 63 KTEKDSVESYTQRVASFSLSGEFSQTF----QST-TGGDK 97 (131) Q Consensus 63 ~~~~~~~~~~~g~v~S~s~~G~vSvsy----~~~-~~~~~ 97 (131) ..+. ...+-.|.|- --.+.-+| .+- ...++ T Consensus 80 ~~~~---~~vp~~v~sl--I~qLR~~~~~d~~~~~~~~d~ 114 (114) T protein:vir:10 80 SDQS---KAYPPSYAYM--INSIRWKIQRDQAAKAGGNDG 114 (114) T ss_pred cccc---ccccHHHHHH--HHHHHHHhhhhhhhhccCCCC Confidence 1111 1111111110 00111111 111 12344 No 57 >protein:vir:79074 Length: 150 # NCBI annotation: gp10, conserved hypothetical protein # Family: family:all:348 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111210;genbank:gi:134288797;genbank:GeneID:4960748 Probab=36.20 E-value=1.2 Score=20.18 Aligned_cols=96 Identities=13% Similarity=0.135 Sum_probs=49.4 Q ss_pred CCHHHHHHHHHhhhh--h----------------cCCCHHHHHHHHHHHHHHhCC---CCC-------chHHHHHHHHHH Q lcl|NC_019509. 1 MNENILLIIRQLAPP--M----------------KKIPDETIEAWVEMAKLFVCE---SKF-------GDDYDRALALYT 52 (131) Q Consensus 1 m~~~ti~~Fr~~~P~--F----------------~~~pD~~i~~~l~~A~~~v~~---~~~-------g~~~~~a~~l~~ 52 (131) |+.-|+++++++|++ + ..+.++.|+..|++|..+|+. .|+ +....+...-++ T Consensus 1 M~Y~T~~Dl~~r~ge~~l~~Ltd~~~~~~~~~~~~~~d~~~i~~Al~dA~~~IDgyL~~RY~lPl~~vP~~L~~~a~dIA 80 (150) T protein:vir:79 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESAVRQAEEIVDAHLRGRYNLPLSPVPTVIKDVTVNLA 80 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccccccccccccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHH Confidence 999999999999985 2 125678899999999999973 333 345555555666 Q ss_pred HHHHHHhhh----------------------hccccccccccccceeeeeeeceeEEeeecCccc----hhh Q lcl|NC_019509. 53 LHLMTLEGA----------------------LKTEKDSVESYTQRVASFSLSGEFSQTFQSTTGG----DKS 98 (131) Q Consensus 53 AHll~l~~~----------------------~~~~~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~----~~~ 98 (131) .|+|.-+-. ...+.-..+......++++ |.+.+.-....-+ .+| T Consensus 81 ~Y~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~--~~~~v~~~~r~f~r~~l~g~ 150 (150) T protein:vir:79 81 RHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSGPATPEP--GEMKVRARRRQFDADLLERF 150 (150) T ss_pred HHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCccCCCCC--CceeeecCCCccChhhccCC Confidence 666543100 0111111111111222221 2333331111000 133 No 58 >protein:vir:93592 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:1879 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449297;genbank:gi:157166045;uniprot:Q6H9U4;genbank:GeneID:5580414 Probab=33.96 E-value=1.1 Score=20.32 Aligned_cols=89 Identities=15% Similarity=-0.026 Sum_probs=48.0 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHh----CCC--------------CCchHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFV----CES--------------KFGDDYDRALALYTLHLMTLEGAL 62 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v----~~~--------------~~g~~~~~a~~l~~AHll~l~~~~ 62 (131) |.--|++++|+--==-.+..|+.|+.+|+-|+.++ +.. ......+.|+.++++|+-.=+... T Consensus 2 m~~vtLeevK~hLRId~d~dD~li~~~i~aA~~~v~~~l~~~~~~~~~~~~~~~~~~~~~~i~~AvLlLv~~~YenRe~~ 81 (108) T protein:vir:93 2 TALLTLEEIKAHLRVDHDADDDMLMDKVRQATAVLLAYIQGSRDKVIREDGELIPGEALTRMKGAAMRLTGMLYRNPDLA 81 (108) T ss_pred CcCCCHHHHHHHcCCCCCcChHHHHHHHHHHHHHHHHHhccccccccccccccccccCChHHHHHHHHHHHHHHhccccc Confidence 77778998887433222568999999999997766 211 112346889999999986421100 Q ss_pred ccccccccccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEe Q lcl|NC_019509. 63 KTEKDSVESYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLI 122 (131) Q Consensus 63 ~~~~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~ 122 (131) + +++++. +..|+|. ..|+..++. |.|+ T Consensus 82 ---------------~---~~~~~~------------~elP~~v--~~Ll~~~R~-p~~~ 108 (108) T protein:vir:93 82 ---------------E---REELLQ------------GELPFSV--SVLIYDLRC-PTVL 108 (108) T ss_pred ---------------c---cccccc------------ccCCHHH--HHHHHHccc-cccC Confidence 0 011110 1123332 223333332 4554 No 59 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=33.14 E-value=1.4 Score=19.82 Aligned_cols=114 Identities=20% Similarity=0.132 Sum_probs=52.7 Q ss_pred CCHH-HHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCC-------------CCCchHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_019509. 1 MNEN-ILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCE-------------SKFGDDYDRALALYTLHLMTLEGALKTEK 66 (131) Q Consensus 1 m~~~-ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~-------------~~~g~~~~~a~~l~~AHll~l~~~~~~~~ 66 (131) |+.- |++++.+++.++..=..++++..|++|..+|-. ..+++..+-.+.-.++-...- - +..+. T Consensus 1 m~~fAtv~Dl~~r~r~L~~dE~~ra~~LL~dAs~~iR~~~~~~~~~~~~~~~~~~d~~~~~~k~V~~~~V~R-a-l~~~~ 78 (132) T protein:vir:94 1 MNPFATVDDLTMLWRPLKGDEKERAEKLLEIVSDTLREEADKVGRDLDVMISEKPSYFSSVVKSVTVDIVAR-T-LMTST 78 (132) T ss_pred CCCcCCHHHHHHHhccCChhHHHHHHHHHHHHHHHHHHHHhhhccccccccCCCCccchhHHHHHHHHHHHH-H-hcCCC Confidence 7665 899999999777655568899999999988831 123333333232232222111 0 11110 Q ss_pred ccccccccce-eeeeeecee--EEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEe-ecCCC Q lcl|NC_019509. 67 DSVESYTQRV-ASFSLSGEF--SQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLI-TGLRR 127 (131) Q Consensus 67 ~~~~~~~g~v-~S~s~~G~v--Svsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~-~g~~~ 127 (131) +. .|.. .|++ .|.. |.+|.+++++ +-.|.- -|.++...+.+.+.+ ..|-- T Consensus 79 ~~----~g~tq~S~T-aG~ys~S~T~~np~G~---lylt~~---e~~~LGl~~~r~~~i~~~~~~ 132 (132) T protein:vir:94 79 DQ----EPMTQTTES-ALGYSVSGSYLVPGGG---LFIKNS---ELSRLGLKKQRFGVIDFYGND 132 (132) T ss_pred CC----CCceeeeee-cccceeeeeeecCCCC---ceeChH---HHHhhCCCCCceEEEeecCCC Confidence 10 0111 2333 4655 6667766553 222221 122222222222222 12222 No 60 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=31.67 E-value=1.5 Score=19.65 Aligned_cols=117 Identities=17% Similarity=0.121 Sum_probs=55.4 Q ss_pred CCHHHHHHHHHhhhhhc----CCCHHHHHHHHHHHHHHhCCC-CC-ch-----------------------------HHH Q lcl|NC_019509. 1 MNENILLIIRQLAPPMK----KIPDETIEAWVEMAKLFVCES-KF-GD-----------------------------DYD 45 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~----~~pD~~i~~~l~~A~~~v~~~-~~-g~-----------------------------~~~ 45 (131) =..-+++++++.+-... ...|+..+..|-.|..+|+.. +| |+ ... T Consensus 16 nSYvtv~~a~aY~~~rg~~~~a~~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WPRtg~~d~~~~~~~~IP~~v~ 95 (172) T protein:vir:97 16 NAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEVK 95 (172) T ss_pred cccccHHHHHHHHHhcCcccCCCCcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcccCCCCCCcccccccccHHHH Confidence 11125777776554432 234677888888999888853 44 21 112 Q ss_pred HHHHHHHHHHHHHhhhhcccccccccccc--ceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHH---hCCCCe Q lcl|NC_019509. 46 RALALYTLHLMTLEGALKTEKDSVESYTQ--RVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRK---KGGGFG 120 (131) Q Consensus 46 ~a~~l~~AHll~l~~~~~~~~~~~~~~~g--~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~---~g~G~~ 120 (131) .|..-++. ..|++...... ......+ .+.++++ |+|+++|...+...+. ++.|..- .+|++. .+.| + T Consensus 96 ~A~~elA~--~al~~~l~~d~-~~~~~~~~v~~kr~kv-g~i~~~y~~~~~~~~~--~p~~~~v-~aLL~p~gl~~~~-~ 167 (172) T protein:vir:97 96 EACAEYAL--RALAAELNPDP-ERNASGVAVLSKSEAV-GPISESVTFVGGAVFQ--MPKYPAA-DQKLVRAGLVRSG-G 167 (172) T ss_pred HHHHHHHH--HHHhccccccc-ccccccccceeeeeee-cceeeEeeccCCCCCc--cccHHHH-HHHHhhhccccCc-c Confidence 22222222 22322211110 0112223 3456664 9999999765544332 3444432 555543 3333 4 Q ss_pred EeecC Q lcl|NC_019509. 121 LITGL 125 (131) Q Consensus 121 l~~g~ 125 (131) .++.| T Consensus 168 ~~~r~ 172 (172) T protein:vir:97 168 TLLRG 172 (172) T ss_pred eeccC Confidence 44433 No 61 >protein:vir:1993 Length: 141 # NCBI annotation: Hypothetical protein # Family: family:all:348 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050640;genbank:gi:9633527;genbank:GeneID:2636296 Probab=28.39 E-value=1.8 Score=19.25 Aligned_cols=97 Identities=9% Similarity=0.004 Sum_probs=47.6 Q ss_pred CCHHHHHHHHHhhhh-----hc-------CCCHHHHHHHHHHHHHHhCC---CCC-------chHHHHHHHHHHHHHHHH Q lcl|NC_019509. 1 MNENILLIIRQLAPP-----MK-------KIPDETIEAWVEMAKLFVCE---SKF-------GDDYDRALALYTLHLMTL 58 (131) Q Consensus 1 m~~~ti~~Fr~~~P~-----F~-------~~pD~~i~~~l~~A~~~v~~---~~~-------g~~~~~a~~l~~AHll~l 58 (131) |+.-|++++.++|++ +. .++++.|+..|++|..+|+. .|+ +....+...-++.|+|.- T Consensus 1 M~Y~T~~Dl~~~~ge~~l~~Lt~d~~~~g~~d~~~i~~Al~dA~~eIdgyL~~RY~lPl~~~P~~L~~~a~dIA~Y~L~~ 80 (141) T protein:vir:19 1 MNYATVNDLCARYTRTRLDILTRPKTADGQPDDAVAEQALADASAFIDGYLAARFVLPLTVVPSLLKRQCCVVAWFYLNE 80 (141) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcCCCCccccCHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHhc Confidence 999999999999985 22 26778999999999999973 333 233444444445555432 Q ss_pred h------------------hhhccccccccc-cccceeeeeeeceeEEeeecCc---cchhhh Q lcl|NC_019509. 59 E------------------GALKTEKDSVES-YTQRVASFSLSGEFSQTFQSTT---GGDKSL 99 (131) Q Consensus 59 ~------------------~~~~~~~~~~~~-~~g~v~S~s~~G~vSvsy~~~~---~~~~~l 99 (131) + ... .+....+- ..+..+..+ .+.+.++-.... ..-+|+ T Consensus 81 ~~~~e~i~~rY~~Ai~~L~~Ia-~Gk~~Lg~~~~~~~~~~~-~~~~~~~~~~r~f~r~~~G~~ 141 (141) T protein:vir:19 81 SQPTEQITATYRDTVRWLEQVR-DGKTDPGVESRTAASPEG-EDLVQVQSDPPVFSRKQKGFI 141 (141) T ss_pred CCCChHHHHHHHHHHHHHHHHh-cCccccCCCCCCCCCCCC-CceeEeecCCcccCcccccCC Confidence 1 111 11111110 000011111 122333211110 113455 No 62 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=26.40 E-value=1.9 Score=18.99 Aligned_cols=117 Identities=13% Similarity=0.052 Sum_probs=52.4 Q ss_pred CCHH-HHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCCCC--------------c--------hHH----HHHHHHHHH Q lcl|NC_019509. 1 MNEN-ILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCESKF--------------G--------DDY----DRALALYTL 53 (131) Q Consensus 1 m~~~-ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~~~--------------g--------~~~----~~a~~l~~A 53 (131) |..= |-++|...-++ .+.++..+..+..|...||.-.. + ..+ .+|+++-+ T Consensus 1 ~~pYLTy~ef~~lg~~--~~~~d~F~kllk~A~~~ID~~T~y~~~~y~~~~i~~d~~~d~~~~~~~r~~~vKkA~a~QI- 77 (144) T protein:vir:79 1 MKPYLTTSDFEKLGYE--LKKPDNFGKLLKSATVLINQICSYYDPAFAYHDLEADSQADPDSYLFRQAMAFKKAVALEM- 77 (144) T ss_pred CCcccchhhhhhhCCC--CcchhhhhhHHHHHHHHhhhhhhhhccccccccccccccccchhhhhHHHHHHHHHHHHHH- Confidence 5433 45555444333 24567788899999888875221 0 111 23333322 Q ss_pred HHHHHhhhhccccccccccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEeecCCCCC Q lcl|NC_019509. 54 HLMTLEGALKTEKDSVESYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLITGLRRGC 129 (131) Q Consensus 54 Hll~l~~~~~~~~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~g~~~~~ 129 (131) +++...|..... ....+.++|.++ |..|||+.+.+....--+.+.--..=+.++...|.. -.||.-- T Consensus 78 eY~~~~G~~sa~----e~~~~~~~S~sv-Grtsvs~~~~~~~s~t~~~~~v~~~a~~yL~~tGLL----YrGV~s~ 144 (144) T protein:vir:79 78 LFLEDSGYSSAY----DVAQGALNSFTV-GHTSMSLNPSAGQNLTVGSTGVVKSAYDLLGRYGLL----FSGVASL 144 (144) T ss_pred HHHHHcCCcchh----hhhcCccceeEe-cceEEeecCCCccccccccccccHHHHHHHhhcCcc----ccccccC Confidence 233332322211 112344667775 999999975443221111111122333444444322 1222222 No 63 >protein:vir:80668 Length: 153 # NCBI annotation: gp7 # Family: family:all:7267 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285583;genbank:gi:148727089;genbank:GeneID:5247039 Probab=25.93 E-value=2 Score=18.93 Aligned_cols=112 Identities=23% Similarity=0.297 Sum_probs=59.3 Q ss_pred CCHHHHHHHHHhhhhhcCCCHHHHHHHHHHHHHHh-------CCCCCchHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_019509. 1 MNENILLIIRQLAPPMKKIPDETIEAWVEMAKLFV-------CESKFGDDYDRALALYTLHLMTLEGALKTEKDSVESYT 73 (131) Q Consensus 1 m~~~ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v-------~~~~~g~~~~~a~~l~~AHll~l~~~~~~~~~~~~~~~ 73 (131) |..++- -+.-+.|+++|.+.++..|++|.... ++..| +..+.|-..+.--+|..+. .+.+ T Consensus 1 m~v~i~---~~Dl~pF~dI~~~k~~ami~D~~a~A~~vAPCi~~~~f-~~~~aAKaIlrgAiLRW~e---------~G~S 67 (153) T protein:vir:80 1 MGIILK---PEDIEPFADIPREKLEAMIADVEAVAVSVAPCIAKPDF-KYKDAAKAILRRALLRWND---------TGVS 67 (153) T ss_pred Cceeec---hhhccccccCCHHHHHHHHHhhhhhhhhhccccCCCCc-ccHHHHHHHHHHHhhhhhh---------cCcc Confidence 764421 13456788888888888877765432 22222 2334444444444554421 1235 Q ss_pred cceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHhC----CCCeEeec-CCCCC--------------CC Q lcl|NC_019509. 74 QRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKKG----GGFGLITG-LRRGC--------------CE 131 (131) Q Consensus 74 g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~g----~G~~l~~g-~~~~~--------------~~ 131 (131) |.+++++ .|...+++++-+.-.=+ |=.|--+|.|++. .|..+.+= -.+++ |. T Consensus 68 Gait~~t-aGpf~qT~dtrs~r~lf-----wPSEItqLqklC~~~~~~g~Af~id~t~~~~v~Hs~~Cs~~fGg~CS 138 (153) T protein:vir:80 68 GQVQYES-AGPFAQTTRSNTPTNLL-----WPSEIAALKKLCEGDGGAGKAFTITPTMRSSVNHSEVCSTVWGEGCS 138 (153) T ss_pred cceeeec-cccceeeeccCCceecc-----ChhhHHHHHHHhcCCCCCcceeEeecCCCCccccccccceeecCccc Confidence 5677776 48988888765432111 2236668999983 33334332 22333 33 No 64 >protein:vir:9928 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795690;genbank:gi:28876458;genbank:GeneID:1258013 Probab=23.54 E-value=2.3 Score=18.61 Aligned_cols=99 Identities=16% Similarity=0.091 Sum_probs=50.7 Q ss_pred CCHHH-HHHHHHhhhhh---cCCCHHHHHHHHHHHHHHhCC----------CCCchHHHHH-HHHHHHHHHHHhhhhccc Q lcl|NC_019509. 1 MNENI-LLIIRQLAPPM---KKIPDETIEAWVEMAKLFVCE----------SKFGDDYDRA-LALYTLHLMTLEGALKTE 65 (131) Q Consensus 1 m~~~t-i~~Fr~~~P~F---~~~pD~~i~~~l~~A~~~v~~----------~~~g~~~~~a-~~l~~AHll~l~~~~~~~ 65 (131) |.+++ +++.|.+ ... ...-|+.|+.+|+.|...|.. ...++.++.. ...-++++ +..+++ T Consensus 1 md~~~~L~~vK~~-lgI~~~D~~~D~lL~~~i~~a~~~i~~~l~~~~~~~~~eiP~~l~~iv~evav~ry----NR~g~E 75 (118) T protein:vir:99 1 MGDKQLIDDIKLF-IGISKGDGAQDELITLAIYESKERVLAKLNEYSETEITKIPDRLRFIVRDVAIKRF----NRINSE 75 (118) T ss_pred CchhhHHHHHHHH-hCCCCCchhhHHHHHHHHHHHHHHHHHHhccccccchhhhhHHHHHHHHHHHHHHh----cCcCCc Confidence 99985 8888764 222 234589999999999977632 1234333332 23333444 223332 Q ss_pred cccccccccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHH---hCCC-CeEe Q lcl|NC_019509. 66 KDSVESYTQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRK---KGGG-FGLI 122 (131) Q Consensus 66 ~~~~~~~~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~---~g~G-~~l~ 122 (131) ..+|.|. +-+|+||++. - .+|=-.+.+.++. .+.| .++. T Consensus 76 ---------G~~S~Se-eG~S~sf~~d-~-------~ey~~~l~~~~~~~~~~~~g~v~Fi 118 (118) T protein:vir:99 76 ---------GAVEDSE-EGKTFKWDSY-L-------KEYESTLRSAAIGKVYSGKGVARFI 118 (118) T ss_pred ---------ccceeec-CCeeeeeccC-c-------hhHHHHHHHHhhhcccCcCcceeeC Confidence 2367785 5689999632 1 1222233332221 1122 1233 No 65 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=21.47 E-value=2.6 Score=18.31 Aligned_cols=113 Identities=16% Similarity=0.123 Sum_probs=51.4 Q ss_pred CCHH-HHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhCCC---------CCc---hHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_019509. 1 MNEN-ILLIIRQLAPPMKKIPDETIEAWVEMAKLFVCES---------KFG---DDYDRALALYTLHLMTLEGALKTEKD 67 (131) Q Consensus 1 m~~~-ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~~~---------~~g---~~~~~a~~l~~AHll~l~~~~~~~~~ 67 (131) |+.- |++++.+++.++..=-.++++.+|++|..+|-.. .|- ...+..+...++-...- - +....+ T Consensus 1 m~~fAtv~D~~~rwr~Lt~~E~~ra~~LL~~As~~ir~~~p~~~~~l~~~~~~~~~~~~~~~~V~~~~V~R-a-l~~~~~ 78 (131) T protein:vir:95 1 MENFATVEDLKKLWRALKFDEEKRAEALLEVVSHSLRVEAKKVGKDLDGLVATDPSFTMVVKSVTVDVVAR-T-LMTSTD 78 (131) T ss_pred CCccCCHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccCCccchHHHHHHHHHHHHH-H-hcCCCC Confidence 7765 8999999997665445568999999999887321 110 11122233333222111 0 111111 Q ss_pred cccccccce-eeeeeecee--EEeeecCccchhhhhcCHHHHHHHHHHHHhCCCCeEee--cCC Q lcl|NC_019509. 68 SVESYTQRV-ASFSLSGEF--SQTFQSTTGGDKSLSATPWGEMYRALNRKKGGGFGLIT--GLR 126 (131) Q Consensus 68 ~~~~~~g~v-~S~s~~G~v--Svsy~~~~~~~~~l~~T~YGq~y~~L~~~~g~G~~l~~--g~~ 126 (131) . .|.. .|++ .|.. |.+|.++++.-. .|. .-|+++...+-+.+.+. |.- T Consensus 79 ~----~G~tq~S~T-aG~ys~S~t~~~p~g~ly---lt~---~e~~~LGl~~~r~~~i~~~~~~ 131 (131) T protein:vir:95 79 Q----EPMTQVAES-ALGYSFSGSYLVPGGGLF---IKD---SELKRLGLKKQRYGVIDIYGTD 131 (131) T ss_pred C----CCceeeeee-cccceeeeeeecCCCCce---eCh---HHHHHhCCCCCceeEEeeccCC Confidence 1 1111 2344 4766 556766654311 111 12333333332222221 222 No 66 >protein:vir:9877 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:2716 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795639;genbank:gi:28876402;genbank:GeneID:1257933 Probab=20.91 E-value=2.7 Score=18.23 Aligned_cols=101 Identities=14% Similarity=0.184 Sum_probs=59.9 Q ss_pred CCHH---HHHHHHHhhhhhcCCCHHHHHHHHHHHHHHhC----CCCCchHHHH-HHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_019509. 1 MNEN---ILLIIRQLAPPMKKIPDETIEAWVEMAKLFVC----ESKFGDDYDR-ALALYTLHLMTLEGALKTEKDSVESY 72 (131) Q Consensus 1 m~~~---ti~~Fr~~~P~F~~~pD~~i~~~l~~A~~~v~----~~~~g~~~~~-a~~l~~AHll~l~~~~~~~~~~~~~~ 72 (131) |.++ ++++.|.+-.-=.+-.|+.|+.+|+.|...+. ...+++..+. ....-++|+=. ...+ T Consensus 1 m~~~~~~~L~~vK~~Lgi~d~~~D~lL~~ii~~~~~~i~~~l~~~~iP~~L~~Iv~ev~vkryNR----~g~E------- 69 (114) T protein:vir:98 1 MDETKQAIIDRVRVRLADETSLKEELLEELTQTAIDRINLKVGDVVFNPLFNSIAVDVVVKMYRR----MYFE------- 69 (114) T ss_pred CchhHHHHHHHHHHHhCCCCCchhhHHHHHHHHHHHHHHHhhCccccchHHHHHHHHHHHHHhcc----cCcc------- Confidence 8777 79999988776556789999999999987664 4445554443 33344455432 2222 Q ss_pred ccceeeeeeeceeEEeeecCccchhhhhcCHHHHHHHHHHHHh---CCC--CeEe Q lcl|NC_019509. 73 TQRVASFSLSGEFSQTFQSTTGGDKSLSATPWGEMYRALNRKK---GGG--FGLI 122 (131) Q Consensus 73 ~g~v~S~s~~G~vSvsy~~~~~~~~~l~~T~YGq~y~~L~~~~---g~G--~~l~ 122 (131) ..+|.|.+ -+|+||... -..+|--.+.+.++.. +.| .++. T Consensus 70 --G~~S~S~e-G~S~tf~dn-------df~ey~~~l~~y~~~~~~~~~g~~v~Fl 114 (114) T protein:vir:98 70 --GIDTEKAD-TISTKFIEN-------VLAEYGEELASYKKDRLAILNKKVVRFL 114 (114) T ss_pred --ccceeecc-ceeeeeecc-------ccchhHHHHHHHHhhhhhhhcCceeecC Confidence 24677854 579999643 1234555555554432 122 2222 Done!