Query lcl|NC_018848.1_cdsid_YP_006906951.1 [gene=SV1_9] [protein=hypothetical protein] [protein_id=YP_006906951.1] [location=7311..7727] Match_columns 138 No_of_seqs 25 out of 28 Neff 4.3 Searched_HMMs 1612 Date Thu Nov 7 13:03:46 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_9 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_9_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:4788 Length: 130 # 99.9 4.9E-25 3E-28 153.9 10.9 128 4-134 1-130 (130) 2 protein:vir:98900 Length: 132 99.8 5.2E-24 3.3E-27 148.2 11.7 127 4-138 1-132 (132) 3 protein:vir:80967 Length: 131 99.8 1.2E-23 7.2E-27 146.3 11.4 127 4-138 1-130 (131) 4 protein:vir:43 Length: 131 # N 99.8 9.4E-24 5.8E-27 146.8 10.9 127 4-138 1-130 (131) 5 protein:vir:9821 Length: 138 # 99.8 2.2E-23 1.3E-26 144.8 10.3 130 1-134 3-138 (138) 6 protein:vir:79701 Length: 144 99.7 5.7E-21 3.5E-24 131.6 10.4 131 3-138 1-143 (144) 7 protein:vir:3034 Length: 111 # 99.3 3.1E-15 1.9E-18 100.1 6.6 105 27-134 1-111 (111) 8 protein:vir:9761 Length: 140 # 96.9 1.1E-05 6.6E-09 47.9 7.8 128 1-138 1-132 (140) 9 protein:vir:94761 Length: 132 96.7 2.6E-05 1.6E-08 45.7 8.4 122 1-137 1-132 (132) 10 protein:vir:1640 Length: 132 # 96.7 2.2E-05 1.4E-08 46.1 7.8 127 1-137 1-132 (132) 11 protein:vir:9576 Length: 131 # 96.5 3.7E-05 2.3E-08 44.9 8.2 122 1-137 1-131 (131) 12 protein:vir:94955 Length: 170 96.1 0.00011 7.1E-08 42.2 8.7 127 1-138 11-161 (170) 13 protein:vir:80389 Length: 172 95.9 0.00023 1.4E-07 40.5 9.4 133 1-135 12-172 (172) 14 protein:vir:95176 Length: 172 95.0 0.00036 2.3E-07 39.4 7.4 127 1-138 14-164 (172) 15 protein:vir:7773 Length: 123 # 94.7 0.00023 1.4E-07 40.5 5.7 118 4-138 1-122 (123) 16 protein:vir:78254 Length: 149 93.9 0.00082 5.1E-07 37.5 7.1 118 4-138 1-136 (149) 17 protein:vir:78478 Length: 149 93.9 0.00082 5.1E-07 37.5 7.1 118 4-138 1-136 (149) 18 protein:vir:104088 Length: 125 93.6 0.00066 4.1E-07 38.0 5.9 114 4-134 1-125 (125) 19 protein:vir:95004 Length: 169 93.1 0.0025 1.6E-06 34.8 8.3 126 1-138 12-162 (169) 20 protein:vir:98481 Length: 136 92.9 0.0019 1.1E-06 35.6 7.4 111 1-138 1-134 (136) 21 protein:vir:97267 Length: 172 92.9 0.006 3.8E-06 32.7 10.1 131 1-135 13-172 (172) 22 protein:vir:2432 Length: 124 # 92.5 0.001 6.3E-07 37.0 5.3 114 4-134 1-124 (124) 23 protein:vir:78383 Length: 169 92.2 0.004 2.5E-06 33.7 8.3 127 1-138 12-162 (169) 24 protein:vir:2505 Length: 128 # 92.2 0.00033 2.1E-07 39.7 2.3 114 1-133 2-128 (128) 25 protein:vir:78916 Length: 117 89.1 0.0043 2.7E-06 33.6 5.5 112 3-130 1-117 (117) 26 protein:vir:94064 Length: 167 88.4 0.017 1E-05 30.3 8.3 127 1-138 1-142 (167) 27 protein:vir:107756 Length: 147 87.3 0.035 2.2E-05 28.6 9.4 125 1-138 1-134 (147) 28 protein:vir:4228 Length: 125 # 84.0 0.013 8.3E-06 30.9 5.4 114 4-134 1-125 (125) 29 protein:vir:4458 Length: 107 # 75.1 0.1 6.2E-05 26.1 7.2 99 5-116 1-107 (107) 30 protein:vir:99002 Length: 158 73.4 0.17 0.00011 24.8 8.1 126 3-138 1-156 (158) 31 protein:vir:100245 Length: 113 72.2 0.12 7.2E-05 25.7 6.8 100 4-126 1-113 (113) 32 protein:vir:100103 Length: 120 70.6 0.2 0.00012 24.5 7.7 106 1-130 1-120 (120) 33 protein:vir:1887 Length: 108 # 70.4 0.15 9.4E-05 25.1 7.0 97 1-108 3-108 (108) 34 protein:vir:192 Length: 108 # 70.4 0.15 9.4E-05 25.1 7.0 97 1-108 3-108 (108) 35 protein:vir:102083 Length: 96 68.4 0.067 4.2E-05 27.0 4.6 93 5-108 1-96 (96) 36 protein:vir:105005 Length: 96 68.4 0.067 4.2E-05 27.0 4.6 93 5-108 1-96 (96) 37 protein:vir:107614 Length: 96 68.4 0.067 4.2E-05 27.0 4.6 93 5-108 1-96 (96) 38 protein:vir:102863 Length: 96 68.4 0.067 4.2E-05 27.0 4.6 93 5-108 1-96 (96) 39 protein:vir:81159 Length: 95 # 67.4 0.081 5E-05 26.6 4.8 88 4-103 1-95 (95) 40 protein:vir:102158 Length: 99 67.2 0.076 4.7E-05 26.7 4.7 87 5-102 1-99 (99) 41 protein:vir:4998 Length: 106 # 67.2 0.12 7.3E-05 25.7 5.7 102 5-125 1-106 (106) 42 protein:vir:4512 Length: 107 # 67.0 0.26 0.00016 23.8 7.9 99 5-126 1-107 (107) 43 protein:vir:486 Length: 107 # 64.4 0.25 0.00015 23.9 6.9 98 5-126 1-107 (107) 44 protein:vir:5256 Length: 119 # 62.8 0.33 0.0002 23.2 7.6 112 6-132 1-119 (119) 45 protein:vir:2345 Length: 125 # 59.4 0.21 0.00013 24.3 5.6 115 3-134 1-125 (125) 46 protein:vir:4857 Length: 104 # 59.1 0.22 0.00014 24.2 5.6 101 5-138 1-104 (104) 47 protein:vir:4954 Length: 104 # 58.7 0.17 0.0001 24.8 4.9 100 5-112 1-104 (104) 48 protein:vir:97069 Length: 115 58.1 0.36 0.00022 23.0 6.6 99 6-127 1-115 (115) 49 protein:vir:103846 Length: 138 57.3 0.44 0.00027 22.6 7.6 113 1-138 1-138 (138) 50 protein:vir:99570 Length: 153 56.4 0.46 0.00028 22.4 10.3 127 1-138 1-141 (153) 51 protein:vir:93592 Length: 108 55.9 0.47 0.00029 22.4 8.1 104 1-127 1-108 (108) 52 protein:vir:4831 Length: 105 # 55.5 0.26 0.00016 23.8 5.4 102 5-125 1-105 (105) 53 protein:vir:106739 Length: 158 47.5 0.7 0.00043 21.4 9.5 127 1-138 4-140 (158) 54 protein:vir:78595 Length: 158 47.5 0.7 0.00043 21.4 9.5 127 1-138 4-140 (158) 55 protein:vir:101559 Length: 158 47.3 0.7 0.00044 21.4 8.8 128 1-138 1-140 (158) 56 protein:vir:3639 Length: 158 # 47.3 0.7 0.00044 21.4 8.8 128 1-138 1-140 (158) 57 protein:vir:99222 Length: 138 47.1 0.71 0.00044 21.4 7.8 113 1-138 1-138 (138) 58 protein:vir:79253 Length: 138 47.1 0.71 0.00044 21.4 7.8 113 1-138 1-138 (138) 59 protein:vir:80668 Length: 153 43.2 0.85 0.00053 21.0 7.0 114 3-138 1-124 (153) 60 protein:vir:10365 Length: 115 41.3 0.93 0.00058 20.7 7.0 98 6-127 1-115 (115) 61 protein:vir:4602 Length: 110 # 40.7 0.37 0.00023 22.9 3.8 102 5-113 1-110 (110) 62 protein:vir:1384 Length: 92 # 40.1 0.6 0.00037 21.8 4.8 84 7-102 1-92 (92) 63 protein:vir:99922 Length: 165 37.0 1.1 0.00071 20.3 6.0 122 1-138 7-164 (165) 64 protein:vir:3871 Length: 99 # 35.9 0.37 0.00023 22.9 3.0 95 14-111 1-99 (99) 65 protein:vir:1271 Length: 115 # 34.5 1.3 0.0008 20.0 7.3 96 7-125 1-115 (115) 66 protein:vir:81255 Length: 180 32.9 1.4 0.00086 19.8 8.2 114 1-131 1-180 (180) 67 protein:vir:5742 Length: 110 # 32.5 1.4 0.00088 19.8 6.8 100 4-125 1-110 (110) 68 protein:vir:1993 Length: 141 # 29.7 1.6 0.001 19.4 7.5 116 1-127 1-141 (141) 69 protein:vir:81069 Length: 115 28.9 1.7 0.0011 19.3 7.6 99 6-127 1-115 (115) 70 protein:vir:96108 Length: 155 27.0 1.9 0.0012 19.1 9.8 125 4-138 1-155 (155) 71 protein:vir:100211 Length: 114 26.7 1.9 0.0012 19.0 7.6 96 1-106 1-114 (114) 72 protein:vir:79990 Length: 110 24.5 1.2 0.00074 20.1 3.7 99 5-113 1-110 (110) 73 protein:vir:98340 Length: 110 24.5 1.2 0.00074 20.1 3.7 99 5-113 1-110 (110) 74 protein:vir:99848 Length: 172 21.0 2.7 0.0017 18.2 7.0 119 3-138 1-172 (172) 75 protein:vir:107864 Length: 150 20.7 2.7 0.0017 18.2 7.0 116 4-138 1-150 (150) No 1 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=99.86 E-value=4.9e-25 Score=153.87 Aligned_cols=128 Identities=26% Similarity=0.216 Sum_probs=106.6 Q ss_pred eecccHHHHHHhcCCCCCcchHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCcccccc Q lcl|NC_018848. 4 RVYATPAQLAQWTGEPAPADAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGTGAA 83 (138) Q Consensus 4 rvyAt~~~l~~~~g~~~p~~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t~~~ 83 (138) =-|+|-+++.++.+++ +++|++|++|||++||.+|+. +|+.+.+.-+.++.++.+||+|+|+||+|+-..|++..... T Consensus 1 M~YlT~eey~el~~~~-~~~F~kl~k~A~~~ID~~t~~-~y~~~~~~~~~~~~r~~~vK~A~a~QieY~~~~G~~s~~~~ 78 (130) T protein:vir:47 1 MTYLTQEEFDELDFDE-VTDFEKLAKRAKIAIDLYTNG-IYQKDIDFEKEIAYRKSAVKLAMAFQIAYLDASGIMSADDK 78 (130) T ss_pred CCCCchhhHhhcCCCC-hhhHHHHHHHHHHHHHHHhcc-cccccCCccCcchHHHHHHHHHHHHHHHHHHHhccccchhc Confidence 5799999999998886 678999999999999999984 47655555667788999999999999999999999867778 Q ss_pred cccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCC-C-CCccCCC Q lcl|NC_018848. 84 GRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGL-T-PGEIYPP 134 (138) Q Consensus 84 g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGL-l-~g~~~~~ 134 (138) ++.+|||||+.|+|.+....+...... .||..|+++|+.+|| | -||=|-. T Consensus 79 ~~~~S~svGrtSis~~~~~~~~~~~~~-~vs~da~~~L~~tGL~Ly~GV~yd~ 130 (130) T protein:vir:47 79 QLANSVSIGRTSISYSTSQSTLAGQRF-NLSMDAENALRQAGFSLVVGVAYDR 130 (130) T ss_pred cCcceeeecceeeecCcCccccccCCc-cccHHHHHHHHhcccccccCCCccC Confidence 999999999999997664444333333 489999999999999 4 5555555 No 2 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=99.84 E-value=5.2e-24 Score=148.22 Aligned_cols=127 Identities=23% Similarity=0.263 Sum_probs=103.1 Q ss_pred eecccHHHHHHhcCCCCCc-chHHHHHHHHHHHHHHhcchheeeccCCCCCCH-HHHHHHHHHHHHHHHHHHHhCCccc- Q lcl|NC_018848. 4 RVYATPAQLAQWTGEPAPA-DAERLLTRASEDVDDALLTAVYDVDEAGMPTDP-AVAQALADAACAQVAYRQESGDTGT- 80 (138) Q Consensus 4 rvyAt~~~l~~~~g~~~p~-~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp-~v~~alr~AtcAQV~~~~~~G~~~t- 80 (138) =-|+|.+++++|.|+++|+ +|.||++|||++||++|. |.++..++++|+ .+++++|+|+|+||+|+.+.|...- T Consensus 1 M~Y~t~~~Y~~~~G~~i~e~~F~~l~~rAs~~ID~iT~---~ri~~~~~~~d~~~~~~~vk~A~c~qiey~~~~G~~sae 77 (132) T protein:vir:98 1 MPYLTYEEFMDLNGRDIDDKKFEKLLPKASAIIDGVTG---HFYQKVDMEKDNAWRVNQFKLALCAQIEYFDALGATTFE 77 (132) T ss_pred CCCCCHHHHHhhcCCCCCHHHHHHHHHHHHHHHHHHhc---ccccCCCccccChHHHHHHHHHHHHHHHHHHhccchhhh Confidence 4799999999999988775 699999999999999999 667888999995 4888999999999999999998622 Q ss_pred ccccccceeeeCceeeccCcc-ccccchhhhhhhHHHHHHHHhhcCCCCCccCCCCC-CC Q lcl|NC_018848. 81 GAAGRWSSVSIGPVSMSGPRQ-SAGGTGAGSVDLGEQASRALARAGLTPGEIYPPGV-NW 138 (138) Q Consensus 81 ~~~g~~~s~sIG~~S~s~~~~-~as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~~-~~ 138 (138) ...+.++|+|+|+.|+|.+.. +..........+++.|+.||+++|||= -|| .| T Consensus 78 ~~~~~~~S~svG~~Svs~~s~~~~~~~~~~~~~~~~~a~~~L~~tGLLy-----rGV~~~ 132 (132) T protein:vir:98 78 EINNSPQTFQAGRTSVSNASRYNPSGANESKPLVAEDVYIYLQGTGLLF-----QGVKTW 132 (132) T ss_pred hccCccceeeeCcEEEEeeccCCcccccccccchHHHHHHHHhhcCCcc-----ccCCCC Confidence 357779999999999996432 223333334457899999999999975 344 35 No 3 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=99.83 E-value=1.2e-23 Score=146.33 Aligned_cols=127 Identities=26% Similarity=0.376 Sum_probs=102.8 Q ss_pred eecccHHHHHH-hcCCCCCc-chHHHHHHHHHHHHHHhcchheeeccCCCCCCH-HHHHHHHHHHHHHHHHHHHhCCccc Q lcl|NC_018848. 4 RVYATPAQLAQ-WTGEPAPA-DAERLLTRASEDVDDALLTAVYDVDEAGMPTDP-AVAQALADAACAQVAYRQESGDTGT 80 (138) Q Consensus 4 rvyAt~~~l~~-~~g~~~p~-~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp-~v~~alr~AtcAQV~~~~~~G~~~t 80 (138) =-|+|.+++++ |.|.++|+ +|.+|++|||++||++|. +.++..++.+++ .+++++|+|+|+|++|+.+.|+... T Consensus 1 M~Y~d~~~Y~~~y~G~~i~e~~F~~l~~rAs~~ID~~T~---~ri~~~~~d~~~~~~~~~vk~A~c~q~e~~~~~g~~~~ 77 (131) T protein:vir:80 1 MPYTTLEFYTNEYAGEHLEQDEFAKLLKHAERKIDSVTF---YRIRKSGIEAFSEFIQHQIQLATCNQIEYFKEAGGTSE 77 (131) T ss_pred CCCCCHHHHHHhhCCCCCchhHHHHHHHHHHHHHHHHhc---ccccccccccCchhHHHHHHHHHHHHHHHHHHhhhhhh Confidence 47999999987 56777774 799999999999999999 667788888884 5889999999999999999998744 Q ss_pred ccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 81 GAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 81 ~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) ...++++|+|+|+.|+|....+.........++++.|..||+++|||= -||.. T Consensus 78 ~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~TGLly-----rGV~~ 130 (131) T protein:vir:80 78 LAVSKPDNVSIGRTSISDSNFASTATSLNSGLVGSDVRSYLAHTGLLY-----NGVGV 130 (131) T ss_pred hcccccCeeeeCceEEeeccccchhhhhhhhhhHHHHHHHHhccCCee-----cCCCC Confidence 457789999999999997544333333334458999999999999974 34444 No 4 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=99.83 E-value=9.4e-24 Score=146.83 Aligned_cols=127 Identities=26% Similarity=0.360 Sum_probs=103.4 Q ss_pred eecccHHHHHHh-cCCCCCc-chHHHHHHHHHHHHHHhcchheeeccCCCCCC-HHHHHHHHHHHHHHHHHHHHhCCccc Q lcl|NC_018848. 4 RVYATPAQLAQW-TGEPAPA-DAERLLTRASEDVDDALLTAVYDVDEAGMPTD-PAVAQALADAACAQVAYRQESGDTGT 80 (138) Q Consensus 4 rvyAt~~~l~~~-~g~~~p~-~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtd-p~v~~alr~AtcAQV~~~~~~G~~~t 80 (138) =-|+|.++++++ -|.++|+ +|.+|++|||++||++|. +.++..++.++ +.+++++|+|+|+|++|+.+.|+... T Consensus 1 M~Y~d~~~Y~~~y~g~~i~e~~F~~l~~rAs~~ID~~T~---~ri~~~~~~~~~~~~~~~vk~A~c~q~e~~~~~g~~s~ 77 (131) T protein:vir:43 1 MPYTTLEFYNDEYAGEHLEQDEFDKLLKHAERKIDSVTF---YRIRKGGIESFSEFIQHQIQLATCNQIEYFKEAGGTSE 77 (131) T ss_pred CCCCCHHHHHHhhCCCCCCHhHHHHHHHHHHHHHHHHhc---ccccccCccccchhhHHHHHHHHHHHHHHHHHhHHHhh Confidence 479999999875 4687774 799999999999999999 66777788888 45889999999999999999998644 Q ss_pred ccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 81 GAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 81 ~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) ...++.+|+|+|+.|+|.+..+.........++++.|..+|+++|||= -||.. T Consensus 78 ~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~TGLly-----rGV~~ 130 (131) T protein:vir:43 78 LAVSKPDNVSIGRTSISDSNFASTATSLNSGLIGSDVRSYLAHTGLLY-----NGVGV 130 (131) T ss_pred hhccccCeeecCceEEeecccccchhhhchhhhHHHHHHHHhccCCee-----cCCCC Confidence 456779999999999997554444444445668999999999999974 34444 No 5 >protein:vir:9821 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795583;genbank:gi:28876338;genbank:GeneID:1257921 Probab=99.82 E-value=2.2e-23 Score=144.84 Aligned_cols=130 Identities=21% Similarity=0.183 Sum_probs=102.7 Q ss_pred CCceecccHHHHHHhcCCCCCcchHHHHHHHHHHHHHHhcchheeeccCCCCCCH-HHHHHHHHHHHHHHHHHHHhCCcc Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTGEPAPADAERLLTRASEDVDDALLTAVYDVDEAGMPTDP-AVAQALADAACAQVAYRQESGDTG 79 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g~~~p~~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp-~v~~alr~AtcAQV~~~~~~G~~~ 79 (138) |--=-|.|-+++.+..+++ |++|++|++|||++||.+|+ |-.+..+|.+|. ..+.+||+|+|+||+|+-+.|+.. T Consensus 3 ~~~M~YlT~eey~~l~~~~-~~dF~kllk~As~~ID~~t~---~~y~~~d~e~d~~~r~~~vKkA~a~QIeY~~~~G~ts 78 (138) T protein:vir:98 3 VVIIAFLTQKEFEDLGFDD-VEDFEKMEKRASHAVNLYCR---NRYDYKDLKKEIALVQKAVKRAIAYQIAYLNDSGVMT 78 (138) T ss_pred cccccccchHHHhccCCCC-hhhHHHHHHHHHHHhhhhhc---cccccccccchhHHHHHHHHHHHHHHHHHHHHcCCcc Confidence 5555699999999987776 56899999999999999999 445666888884 477789999999999999999875 Q ss_pred cccccccceeeeCceeecc--Cccccccchhhhhh--hHHHHHHHHhhcCCCC-CccCCC Q lcl|NC_018848. 80 TGAAGRWSSVSIGPVSMSG--PRQSAGGTGAGSVD--LGEQASRALARAGLTP-GEIYPP 134 (138) Q Consensus 80 t~~~g~~~s~sIG~~S~s~--~~~~as~~~~~a~~--ls~~a~~~L~~aGLl~-g~~~~~ 134 (138) ....+..+|||||+.|+|. +.++.+.......+ +|..|+++|+.+|||= ||=|-. T Consensus 79 ~~d~~~~~s~svGrTSiS~~~~~~~~s~~~~~~~~~~~s~~A~~~L~~tGLLY~GV~yd~ 138 (138) T protein:vir:98 79 AEDKQSFAGISLGRTSISYTVGHGQGSQQKTLADRFNLCLDAENELLVVGLGYTGISYDR 138 (138) T ss_pred hhhccCcCceEeeeeEeecccccccccccccccccccccHHHHHHHhhcCcccccCcccC Confidence 5668889999999999983 22333333333333 8999999999999974 333333 No 6 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=99.74 E-value=5.7e-21 Score=131.58 Aligned_cols=131 Identities=21% Similarity=0.232 Sum_probs=99.6 Q ss_pred ceecccHHHHHHhcCCCCC-cchHHHHHHHHHHHHHHhcch--he---eeccCCCC---C-CHHHHHHHHHHHHHHHHHH Q lcl|NC_018848. 3 RRVYATPAQLAQWTGEPAP-ADAERLLTRASEDVDDALLTA--VY---DVDEAGMP---T-DPAVAQALADAACAQVAYR 72 (138) Q Consensus 3 ~rvyAt~~~l~~~~g~~~p-~~~~rLl~rAS~~VD~~t~~a--vY---dv~~~GlP---t-dp~v~~alr~AtcAQV~~~ 72 (138) -.-|.|-+++.+..|+... ++|++|++||+++||.+|+.. .| ++++++-- . .+..+++||+|+|+||+|+ T Consensus 1 ~~pYLTy~ef~~lg~~~~~~d~F~kllk~A~~~ID~~T~y~~~~y~~~~i~~d~~~d~~~~~~~r~~~vKkA~a~QIeY~ 80 (144) T protein:vir:79 1 MKPYLTTSDFEKLGYELKKPDNFGKLLKSATVLINQICSYYDPAFAYHDLEADSQADPDSYLFRQAMAFKKAVALEMLFL 80 (144) T ss_pred CCcccchhhhhhhCCCCcchhhhhhHHHHHHHHhhhhhhhhccccccccccccccccchhhhhHHHHHHHHHHHHHHHHH Confidence 4679999999888885544 569999999999999999952 23 44443211 1 1457799999999999999 Q ss_pred HHhCCccc--ccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 73 QESGDTGT--GAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 73 ~~~G~~~t--~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) -+.|+... .+.+.++|+|||+.|+|.+.++..+.......+++.|++||+.+|||=. ||.= T Consensus 81 ~~~G~~sa~e~~~~~~~S~svGrtsvs~~~~~~~s~t~~~~~v~~~a~~yL~~tGLLYr-----GV~s 143 (144) T protein:vir:79 81 EDSGYSSAYDVAQGALNSFTVGHTSMSLNPSAGQNLTVGSTGVVKSAYDLLGRYGLLFS-----GVAS 143 (144) T ss_pred HHcCCcchhhhhcCccceeEecceEEeecCCCccccccccccccHHHHHHHhhcCcccc-----cccc Confidence 99998643 4588899999999999976655555555556799999999999999642 2222 No 7 >protein:vir:3034 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438147;genbank:gi:16271810;genbank:GeneID:929268 Probab=99.31 E-value=3.1e-15 Score=100.11 Aligned_cols=105 Identities=26% Similarity=0.306 Sum_probs=77.7 Q ss_pred HHHHHHHHHHHHhcchheeeccCCCCCCHH-HHHHHHHHHHHHHHHHHHhCCcccccccccceeeeCceeeccCccc-cc Q lcl|NC_018848. 27 LLTRASEDVDDALLTAVYDVDEAGMPTDPA-VAQALADAACAQVAYRQESGDTGTGAAGRWSSVSIGPVSMSGPRQS-AG 104 (138) Q Consensus 27 Ll~rAS~~VD~~t~~avYdv~~~GlPtdp~-v~~alr~AtcAQV~~~~~~G~~~t~~~g~~~s~sIG~~S~s~~~~~-as 104 (138) |+++|+.+||..|+. +|+.+ -|-+|.+ -+.+||+|+|.||+|+-..|+....-.+..+|||||..|+|...+. .+ T Consensus 1 L~k~A~~~Id~~t~~-fY~~~--dle~D~~~R~~~fK~Aia~QI~Yld~~G~~t~~d~~s~~SisvGrTsiS~~~~~~~~ 77 (111) T protein:vir:30 1 MEKRASHAVNLYCRN-RYDYK--DLKKEIALVQKAVKRAIAYQIAYLNDSGVMTAEDKQSFAGISLGRTSISYTVGHGQG 77 (111) T ss_pred CchhhHHHHhHhhch-hhhhh--hHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhccCcceeeecceeeeccCccCCC Confidence 999999999999973 67755 4445533 4578999999999999999997555588899999999999942221 11 Q ss_pred cchh---hhhhhHHHHHHHHhhcCCC-CCccCCC Q lcl|NC_018848. 105 GTGA---GSVDLGEQASRALARAGLT-PGEIYPP 134 (138) Q Consensus 105 ~~~~---~a~~ls~~a~~~L~~aGLl-~g~~~~~ 134 (138) .... .---||..|+++|..+||+ .||=|-. T Consensus 78 ~~~~~t~~~~~l~~da~n~L~~~Glly~GV~yd~ 111 (111) T protein:vir:30 78 SQQKTLADRFNLCLDAENELLVVGLGYTGISYDR 111 (111) T ss_pred CccccccccccchHHHHHHHHhhccccccccccC Confidence 1111 1123899999999999996 3444444 No 8 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=96.93 E-value=1.1e-05 Score=47.87 Aligned_cols=128 Identities=20% Similarity=0.158 Sum_probs=74.6 Q ss_pred CCceecccHHHHHH-hcCCCCCc---chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_018848. 1 MDRRVYATPAQLAQ-WTGEPAPA---DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESG 76 (138) Q Consensus 1 ~~~rvyAt~~~l~~-~~g~~~p~---~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G 76 (138) |+ .|||++||+. | +.--|+ -++.||..||++|+...-.+=++++ +-.|..++...++|.-+|+=|.-=...+ T Consensus 1 m~--~fATv~Dv~~rw-r~Lt~dE~~ra~~LL~dAS~~iR~~~p~~g~~~~-~~~~~~~~~~~~~k~V~~~mV~Ral~~~ 76 (140) T protein:vir:97 1 MG--NFATTDDVILLW-RPLSVDELKRANALLKVVSDTLRMEADKVGKDLD-KTMVDKPYFVNVIKSVTVDIVARTLMTS 76 (140) T ss_pred CC--cCCCHHHHHHHh-cCCCHhHHHHHHHHHHHHHHHHHHhhhhccCCcc-hhcccCccchhHHHHHHHHHHHHHhcCC Confidence 76 6999999999 6 633333 3667999999999976553223322 2345556777778888888777766667 Q ss_pred CcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 77 DTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 77 ~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) .++.+. ...|.+.|++|.|..-...+ .-..+.++-+..|.-.|-==|.|=+-|.+= T Consensus 77 ~d~~G~--tq~S~TaG~ys~S~T~~np~----G~lylt~~e~~~LGl~~~r~~~i~~~g~~~ 132 (140) T protein:vir:97 77 TQGEPM--SQESQSALGYTWSGTYLVPG----GGLFIKDNELKRLGLKKQRYGGIELYGEIK 132 (140) T ss_pred CCCCcc--eeeeeeccchhheeeeecCC----CCceeChHHHHHhCCCCCceeeecccCccc Confidence 665322 23456889998774222221 112355666666622221112222333322 No 9 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=96.70 E-value=2.6e-05 Score=45.72 Aligned_cols=122 Identities=20% Similarity=0.227 Sum_probs=69.5 Q ss_pred CCceecccHHHHHH-hcCCCCCc---chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHH-HHHHHHHHHHHHHh Q lcl|NC_018848. 1 MDRRVYATPAQLAQ-WTGEPAPA---DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQAL-ADAACAQVAYRQES 75 (138) Q Consensus 1 ~~~rvyAt~~~l~~-~~g~~~p~---~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~al-r~AtcAQV~~~~~~ 75 (138) |+ .|||++||+. | +.--|+ -+.-||..||++|+...-. ++...+...|.+|+..++. |.-+|+=|.-=... T Consensus 1 m~--~fAtv~Dl~~r~-r~L~~dE~~ra~~LL~dAs~~iR~~~~~-~~~~~~~~~~~~~d~~~~~~k~V~~~~V~Ral~~ 76 (132) T protein:vir:94 1 MN--PFATVDDLTMLW-RPLKGDEKERAEKLLEIVSDTLREEADK-VGRDLDVMISEKPSYFSSVVKSVTVDIVARTLMT 76 (132) T ss_pred CC--CcCCHHHHHHHh-ccCChhHHHHHHHHHHHHHHHHHHHHhh-hccccccccCCCCccchhHHHHHHHHHHHHHhcC Confidence 65 7999999999 7 522222 3667999999999854331 2344455667788876665 55556666555555 Q ss_pred CCcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCC---c--cCCCCCC Q lcl|NC_018848. 76 GDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPG---E--IYPPGVN 137 (138) Q Consensus 76 G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g---~--~~~~~~~ 137 (138) |.++.+. ...|.+.|++|.|..-...+ .-..+.++-+..| ||-+- . +| |-| T Consensus 77 ~~~~~g~--tq~S~TaG~ys~S~T~~np~----G~lylt~~e~~~L---Gl~~~r~~~i~~~--~~~ 132 (132) T protein:vir:94 77 STDQEPM--TQTTESALGYSVSGSYLVPG----GGLFIKNSELSRL---GLKKQRFGVIDFY--GND 132 (132) T ss_pred CCCCCCc--eeeeeecccceeeeeeecCC----CCceeChHHHHhh---CCCCCceEEEeec--CCC Confidence 6554322 23456789887774222111 1133556656666 44210 0 12 112 No 10 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=96.67 E-value=2.2e-05 Score=46.08 Aligned_cols=127 Identities=19% Similarity=0.151 Sum_probs=72.7 Q ss_pred CCceecccHHHHHH-hcCCCCCc---chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHH-HHHHHHHHHHHHHHHh Q lcl|NC_018848. 1 MDRRVYATPAQLAQ-WTGEPAPA---DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQ-ALADAACAQVAYRQES 75 (138) Q Consensus 1 ~~~rvyAt~~~l~~-~~g~~~p~---~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~-alr~AtcAQV~~~~~~ 75 (138) |+ .|||++||+. | +.-.|+ -+..||.+||++|+..+-.....++. ....+|++.+ .+|.-+|+.|.-=..+ T Consensus 1 m~--~fAtv~Dv~~r~-r~L~~~E~~ra~~lL~dAs~~ir~~~p~~~~~l~a-~~~e~~~~~~~~~~~V~~~~V~Ral~~ 76 (132) T protein:vir:16 1 MN--PFATVDDLTMLW-RPLKGDEKERAEKLLEIVSDSLREEADKVGRDLYA-MIAEKPSYFASVVKSVTVDIVARTLMT 76 (132) T ss_pred CC--ccCCHHHHHHHh-cCCCHhHHHHHHHHHHHHHHHHHHhhhhhcccccc-ccccccccchhHHHHHHHHHHHHHhcC Confidence 65 6999999999 6 633333 36679999999999766433333332 2223466544 4688889999888887 Q ss_pred CCcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCCccCCCCCC Q lcl|NC_018848. 76 GDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGEIYPPGVN 137 (138) Q Consensus 76 G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~~~ 137 (138) +.++.+ ....|.+.|++|.|..-...+ .-..+.+.-+..|.-.+===|.|==-|-| T Consensus 77 ~~~~~G--~tq~S~TaG~ys~S~t~~~p~----G~lylt~~e~~~LG~~~~r~~~i~~~~~~ 132 (132) T protein:vir:16 77 STDQEP--MTQTTESALGYSVSGSYLVPG----GGLFIKNSELSRLGLKKQRFGVIDFYGND 132 (132) T ss_pred CCCCCC--ceeeeeeccchheeeeeecCC----CcceeChHHHHhhCCCCCceEEEeecCCC Confidence 766432 234467899998874222221 12335666666662211100011111222 No 11 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=96.54 E-value=3.7e-05 Score=44.89 Aligned_cols=122 Identities=25% Similarity=0.293 Sum_probs=73.6 Q ss_pred CCceecccHHHHHH-hcCCCCCc---chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_018848. 1 MDRRVYATPAQLAQ-WTGEPAPA---DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESG 76 (138) Q Consensus 1 ~~~rvyAt~~~l~~-~~g~~~p~---~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G 76 (138) |+ .|||++||+. | +.--|+ -++.||.+||.+|+...-..-.+++ .....+|.....+|+-+|+=|.-=...+ T Consensus 1 m~--~fAtv~D~~~rw-r~Lt~~E~~ra~~LL~~As~~ir~~~p~~~~~l~-~~~~~~~~~~~~~~~V~~~~V~Ral~~~ 76 (131) T protein:vir:95 1 ME--NFATVEDLKKLW-RALKFDEEKRAEALLEVVSHSLRVEAKKVGKDLD-GLVATDPSFTMVVKSVTVDVVARTLMTS 76 (131) T ss_pred CC--ccCCHHHHHHHh-cCCCHHHHHHHHHHHHHHHHHHHHhhhhccCCcc-ccccCCccchHHHHHHHHHHHHHHhcCC Confidence 65 6999999999 6 643343 3667999999999976542211111 2223347778888999998888877777 Q ss_pred CcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCC---Cc--cCCCCCC Q lcl|NC_018848. 77 DTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTP---GE--IYPPGVN 137 (138) Q Consensus 77 ~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~---g~--~~~~~~~ 137 (138) .++. +-...|.+.|++|.|..-...+ ....+.++-+..| ||-+ |. +| |-| T Consensus 77 ~~~~--G~tq~S~TaG~ys~S~t~~~p~----g~lylt~~e~~~L---Gl~~~r~~~i~~~--~~~ 131 (131) T protein:vir:95 77 TDQE--PMTQVAESALGYSFSGSYLVPG----GGLFIKDSELKRL---GLKKQRYGVIDIY--GTD 131 (131) T ss_pred CCCC--CceeeeeecccceeeeeeecCC----CCceeChHHHHHh---CCCCCceeEEeec--cCC Confidence 5543 2234578899998874222221 1233555666666 4321 01 22 222 No 12 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=96.12 E-value=0.00011 Score=42.19 Aligned_cols=127 Identities=17% Similarity=0.177 Sum_probs=74.8 Q ss_pred CCceecccHHHHHHh---cCC-----CCC-cchHHHHHHHHHHHHHHhc--c-------------hheeeccCCCCCCHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQW---TGE-----PAP-ADAERLLTRASEDVDDALL--T-------------AVYDVDEAGMPTDPA 56 (138) Q Consensus 1 ~~~rvyAt~~~l~~~---~g~-----~~p-~~~~rLl~rAS~~VD~~t~--~-------------avYdv~~~GlPtdp~ 56 (138) -|--.|+|.++.+.| .+. +.+ +..+++|.+|++.||+..+ + .=+.+++..+| +.. T Consensus 11 ~~AnSYvtv~ea~aY~~~r~~~~~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~~~dg~~~~-~~~ 89 (170) T protein:vir:94 11 ITANSYVTVAEANSYFDGSYGRPLWTSASEDEKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCKNAVIGGMTLS-QVS 89 (170) T ss_pred CcccceecHHHHHHHHHhhccccccCCCCHHHHHHHHHHHHHHhccccccccccCCcchhhcccccCcccCccccc-cch Confidence 455899999999998 331 223 2457899999999996421 0 00234555555 346 Q ss_pred HHHHHHHHHHHHHHHHHHhCCcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCCccCCCCC Q lcl|NC_018848. 57 VAQALADAACAQVAYRQESGDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGEIYPPGV 136 (138) Q Consensus 57 v~~alr~AtcAQV~~~~~~G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~~ 136 (138) |=..+|+|+|+=....+..++......+.+.+..||.++++-...+.... ....+..+| .||++++- +|- T Consensus 90 IP~~V~~Aq~elA~~~~~~~~~~~~~~~~v~~~kVG~i~veY~~~~~~~~------~~~~v~~LL--~p~l~~~~--~g~ 159 (170) T protein:vir:94 90 IPVKVKIAVFELAYFMLESGAALSFADQTIDSVKVGTIRVEFTKNSTDAG------LPTFVEAML--SGFGSPVL--YGS 159 (170) T ss_pred hhHHHHHHHHHHHHHHHhCcccCcccccceeeEecceeEEEecCCCCCCc------cHHHHHHHh--hhhhcccc--ccc Confidence 77779999998777677665543344566889999999988532221111 222223333 34554221 122 Q ss_pred CC Q lcl|NC_018848. 137 NW 138 (138) Q Consensus 137 ~~ 138 (138) |= T Consensus 160 ~~ 161 (170) T protein:vir:94 160 NA 161 (170) T ss_pred cc Confidence 22 No 13 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=95.90 E-value=0.00023 Score=40.54 Aligned_cols=133 Identities=14% Similarity=0.094 Sum_probs=75.3 Q ss_pred CCceecccHHHHHHhcC---CCCC-cchHHHHHHHHHHHHHHhc----c------------hheeeccCCCCCCHHHHHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTG---EPAP-ADAERLLTRASEDVDDALL----T------------AVYDVDEAGMPTDPAVAQA 60 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g---~~~p-~~~~rLl~rAS~~VD~~t~----~------------avYdv~~~GlPtdp~v~~a 60 (138) -|--.|+|.++++.|.. ...| ++.++.|++|++.||.+.. . .=+.+++..+|. ..|=.. T Consensus 12 ~~anSYvt~~~a~aY~~~rg~~~~~d~~e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~g~~~~~-~~IP~~ 90 (172) T protein:vir:80 12 PDANTYAGADFVIAYAQARGVTVDADEAERLILEAMDYIESFRRRWKGERNTREQGLTWPRHDAVVDGFVIPS-DVIPKE 90 (172) T ss_pred ccccccccHHHHHHHHHHcCCCcCHHHHHHHHHHHHHHHhhccCccccccCCccccccccccCcccCcccccc-cchhHH Confidence 46688999999998754 5666 4688899999999998421 0 001134444553 346667 Q ss_pred HHHHHHHHHHHHHHhCCccc-ccccccceeeeCceeeccCcccccc--chhh-hhhhHHHHHHHHhhcCCCCC----ccC Q lcl|NC_018848. 61 LADAACAQVAYRQESGDTGT-GAAGRWSSVSIGPVSMSGPRQSAGG--TGAG-SVDLGEQASRALARAGLTPG----EIY 132 (138) Q Consensus 61 lr~AtcAQV~~~~~~G~~~t-~~~g~~~s~sIG~~S~s~~~~~as~--~~~~-a~~ls~~a~~~L~~aGLl~g----~~~ 132 (138) ++.|+|+=.......++... ....++.+.+||.++++-..+.++. .... +....+.+-.+|++= |-|+ .+- T Consensus 91 v~~A~~elA~~~~~g~~~~~~~~~~~v~~ekVG~i~~eY~~~~~~~~~~~~~~~~~~~~~v~~LL~p~-l~~~gg~~~~~ 169 (172) T protein:vir:80 91 LQSAVAAAVIEQVNGFELQQSQDQWAVRIEKVDVIEVQYAAGGGGQSASANAPMKPTFPKIDALLNPL-LVGDGGLFLVA 169 (172) T ss_pred HHHHHHHHHHHHhcCCccCcCCCCceeeEEeccceEEeeecccCccccccccCCccchHHHHHHHhhh-hcCCCCeeeee Confidence 89999986654444334322 3355688899999988742221111 1111 122344445555432 2221 122 Q ss_pred CCC Q lcl|NC_018848. 133 PPG 135 (138) Q Consensus 133 ~~~ 135 (138) +-| T Consensus 170 vrg 172 (172) T protein:vir:80 170 VRG 172 (172) T ss_pred ecC Confidence 333 No 14 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=94.95 E-value=0.00036 Score=39.44 Aligned_cols=127 Identities=14% Similarity=0.130 Sum_probs=72.5 Q ss_pred CCceecccHHHHHHhcC---CCCC---cchHHHHHHHHHHHHHH-hcch-------------h--eeeccCCCCCCHHHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTG---EPAP---ADAERLLTRASEDVDDA-LLTA-------------V--YDVDEAGMPTDPAVA 58 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g---~~~p---~~~~rLl~rAS~~VD~~-t~~a-------------v--Ydv~~~GlPtdp~v~ 58 (138) -|--.|+|.++++.|.. ...| +..+++|.+|++.||.. .+.. | +-++...+| +..|= T Consensus 14 ~~anSYvtv~ea~aY~~~rg~~~~~~~~~ke~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~~~~v~-~~~IP 92 (172) T protein:vir:95 14 TNANSYVSVADARIYASNRGVELPLDDDELAAMLIRSTDYLEAQACRFQGKPTSTTQALQWPRTGVFLNEDEVP-SNVIP 92 (172) T ss_pred CcccccccHHHHHHHHHhcCCcCCCChHHHHHHHHHHHHHhhccCCceeeeecCCcccccCCcCCcccCccccc-ccchh Confidence 36678999999999865 3334 24578999999999951 2100 0 112222223 34566 Q ss_pred HHHHHHHHHHHHHHHHhCC-ccc-ccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCCccCCCCC Q lcl|NC_018848. 59 QALADAACAQVAYRQESGD-TGT-GAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGEIYPPGV 136 (138) Q Consensus 59 ~alr~AtcAQV~~~~~~G~-~~t-~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~~ 136 (138) ..++.|+|.=.......++ .++ .....+.+..||.++++-..+.+.. ....-+.+-.+| .+|+. |-|- T Consensus 93 ~~V~~A~~elA~~~~~~~~~~~~~~~~~~vk~~kVG~I~veY~~~~~~~----~~~~~~~v~~LL--~p~l~----~~~~ 162 (172) T protein:vir:95 93 KSLIAAQVQLTMAINAGFDLQPNVSPQDYVTREKVGPIETEYADPLSVG----IMPTFTAANALL--APLFG----ECAS 162 (172) T ss_pred HHHHHHHHHHHHHHHcCccccccCCcccceeEEeccceEEeeccCCCCC----CcccHHHHHHHH--hhhhc----ccCC Confidence 7789999987655555443 333 3345578999999998742222111 112234444455 34443 3333 Q ss_pred CC Q lcl|NC_018848. 137 NW 138 (138) Q Consensus 137 ~~ 138 (138) |= T Consensus 163 ~~ 164 (172) T protein:vir:95 163 NK 164 (172) T ss_pred cc Confidence 32 No 15 >protein:vir:7773 Length: 123 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817607;genbank:gi:29566037;genbank:GeneID:1259231 Probab=94.70 E-value=0.00023 Score=40.54 Aligned_cols=118 Identities=18% Similarity=0.185 Sum_probs=69.9 Q ss_pred eecccHHHHHH-hcCCCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCcc Q lcl|NC_018848. 4 RVYATPAQLAQ-WTGEPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTG 79 (138) Q Consensus 4 rvyAt~~~l~~-~~g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~ 79 (138) --|||++|+.+ |....-|++ ++.||..||.+|.+.+- ++ +....||++-+-+++=+|+=|.--..+ T Consensus 1 ~~~At~~Dv~ar~~r~LT~~E~~~ve~lL~dAs~~ir~r~P----~l--~~~a~d~~~~~~~~~V~~~~V~R~~rn---- 70 (123) T protein:vir:77 1 MPYATASDVTSRWARQPTDEETALINVRLADVERMIKRRIP----DL--ATKVTDPDYLEDLKQVEADAVLRLVRN---- 70 (123) T ss_pred CCcCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhcc----Cc--ccccCCcchhHHHHHHHHHHHHHHhhC---- Confidence 57999999999 666544443 67899999999998544 22 133458888888888788877764433 Q ss_pred cccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 80 TGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 80 t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) ..+..|.+.|++|.|......+ .-..+-++=|..|....-==|+|-|--+-= T Consensus 71 ---peG~~s~T~G~ys~sl~~a~~~----g~Lylt~~E~~~Lg~~~~~~~~i~p~~~~~ 122 (123) T protein:vir:77 71 ---PEGYLSETDGNYTYMLRSDLAS----GKLEIFPEEWEILGYRRSRMTVIVPNPVMP 122 (123) T ss_pred ---CCCceecccchhhhhhcccCCC----CcceeCHHHHHhhcCCCCceeEEeeceecC Confidence 2235667779988885432111 123345555555544332111222211111 No 16 >protein:vir:78254 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491668;genbank:gi:157786492;genbank:GeneID:5625732 Probab=93.92 E-value=0.00082 Score=37.49 Aligned_cols=118 Identities=16% Similarity=0.294 Sum_probs=68.5 Q ss_pred eecccHHHHHH-hcCCCCC-cc--hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCcc Q lcl|NC_018848. 4 RVYATPAQLAQ-WTGEPAP-AD--AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTG 79 (138) Q Consensus 4 rvyAt~~~l~~-~~g~~~p-~~--~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~ 79 (138) --|||++|+.+ |....-| +. ++.||..||.+|.+.+- +. +++..||++-+-+|+=+|+=|.--.. + T Consensus 1 ~afAtv~Dve~rw~r~LT~eE~~~ae~lL~dAs~~IR~~iP----~L--a~~~~dp~~~a~v~~V~~~mV~R~~r-n--- 70 (149) T protein:vir:78 1 MAYAEPSDVVARLGRPLTDDEETQVETFLEDAEIEIRSRIP----DL--DDKAEDEDYLKRVIKVEASAVTRLIR-N--- 70 (149) T ss_pred CCcCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhcc----cc--ccccCCcchhhHHHHHHHHHHHHHhc-C--- Confidence 57999999999 6564434 32 77899999999998553 11 24455888888788888887776442 2 Q ss_pred cccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHh---hcCCC------C---CccCCC--CCCC Q lcl|NC_018848. 80 TGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALA---RAGLT------P---GEIYPP--GVNW 138 (138) Q Consensus 80 t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~---~aGLl------~---g~~~~~--~~~~ 138 (138) ..+..|.|+|++|.|......+. -..+-++=+..|. ..|-+ | ..-||. .|.| T Consensus 71 ---peG~~S~T~G~YS~slt~~np~G----~LylT~~E~a~LG~~r~~G~~~i~p~~~~~~~~~~~~~~~~~~ 136 (149) T protein:vir:78 71 ---PDGYIGETDGNYSYQLNWRLNTG----AIEITDKEWAQLGLSKNVGVLNVRPKTPLERSGEYPAFGSVEW 136 (149) T ss_pred ---CCCeeeeecchhhhhhhccCCCC----ceeeCHHHHHhhCCcccccceeecccCccccCCCCCcccceee Confidence 22355788899988853322111 1112222222221 11221 1 113443 4667 No 17 >protein:vir:78478 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491587;genbank:gi:157786410;genbank:GeneID:5625630 Probab=93.92 E-value=0.00082 Score=37.49 Aligned_cols=118 Identities=16% Similarity=0.294 Sum_probs=68.5 Q ss_pred eecccHHHHHH-hcCCCCC-cc--hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCcc Q lcl|NC_018848. 4 RVYATPAQLAQ-WTGEPAP-AD--AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTG 79 (138) Q Consensus 4 rvyAt~~~l~~-~~g~~~p-~~--~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~ 79 (138) --|||++|+.+ |....-| +. ++.||..||.+|.+.+- +. +++..||++-+-+|+=+|+=|.--.. + T Consensus 1 ~afAtv~Dve~rw~r~LT~eE~~~ae~lL~dAs~~IR~~iP----~L--a~~~~dp~~~a~v~~V~~~mV~R~~r-n--- 70 (149) T protein:vir:78 1 MAYAEPSDVVARLGRPLTDDEETQVETFLEDAEIEIRSRIP----DL--DDKAEDEDYLKRVIKVEASAVTRLIR-N--- 70 (149) T ss_pred CCcCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhcc----cc--ccccCCcchhhHHHHHHHHHHHHHhc-C--- Confidence 57999999999 6564434 32 77899999999998553 11 24455888888788888887776442 2 Q ss_pred cccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHh---hcCCC------C---CccCCC--CCCC Q lcl|NC_018848. 80 TGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALA---RAGLT------P---GEIYPP--GVNW 138 (138) Q Consensus 80 t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~---~aGLl------~---g~~~~~--~~~~ 138 (138) ..+..|.|+|++|.|......+. -..+-++=+..|. ..|-+ | ..-||. .|.| T Consensus 71 ---peG~~S~T~G~YS~slt~~np~G----~LylT~~E~a~LG~~r~~G~~~i~p~~~~~~~~~~~~~~~~~~ 136 (149) T protein:vir:78 71 ---PDGYIGETDGNYSYQLNWRLNTG----AIEITDKEWAQLGLSKNVGVLNVRPKTPLERSGEYPAFGSVEW 136 (149) T ss_pred ---CCCeeeeecchhhhhhhccCCCC----ceeeCHHHHHhhCCcccccceeecccCccccCCCCCcccceee Confidence 22355788899988853322111 1112222222221 11221 1 113443 4667 No 18 >protein:vir:104088 Length: 125 # NCBI annotation: gp20 # Family: family:all:2817 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655599;genbank:gi:109392470;genbank:GeneID:4156956 Probab=93.57 E-value=0.00066 Score=38.03 Aligned_cols=114 Identities=18% Similarity=0.235 Sum_probs=67.5 Q ss_pred eecccHHHHHH-hcCCCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCH-HHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_018848. 4 RVYATPAQLAQ-WTGEPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDP-AVAQALADAACAQVAYRQESGDT 78 (138) Q Consensus 4 rvyAt~~~l~~-~~g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp-~v~~alr~AtcAQV~~~~~~G~~ 78 (138) --|||++|+.+ |...+-|++ ++.+|.+||.||.+.+-.=.-.++. -+.|+ +|.+...+||.. + T Consensus 1 ma~A~~~Dv~~~w~r~lT~~E~~~v~~~L~~Ae~~Ir~riP~L~~r~~a--~~~~~~~v~~Vea~aV~R-----v----- 68 (125) T protein:vir:10 1 MAYANAQDVVTLWAKEPEPEVMELIERRLAQVERMIKRRIPNLDLKVAA--DATFQADLIDIEADAVLR-----L----- 68 (125) T ss_pred CCcCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhCCChhhhhhc--CCCccccHHHHHHHHHHH-----H----- Confidence 57999999999 555555543 5689999999998766421111221 13332 244544555544 1 Q ss_pred ccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcC---C---CCCccCCC Q lcl|NC_018848. 79 GTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAG---L---TPGEIYPP 134 (138) Q Consensus 79 ~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aG---L---l~g~~~~~ 134 (138) .-+.++..|.+.|.+|+|.....++ ....+.+.=|..|...- . .|-.+-|. T Consensus 69 -~rNPeGy~s~T~G~Ys~~l~~~~~~----g~L~it~~Ew~~Lg~~r~s~~~~i~p~~~~~~ 125 (125) T protein:vir:10 69 -VRNPEGYISETDGAYTYQLQTDLSQ----GRLTILDDEWTTLGVNRLSRMSVIAPNIVMPT 125 (125) T ss_pred -hcCCCcccccccchhHHhhhccccc----CceeeCHHHHHhhccccccceeeeecccccCC Confidence 2334446778889999986443222 23456777777776544 3 34444444 No 19 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=93.08 E-value=0.0025 Score=34.85 Aligned_cols=126 Identities=13% Similarity=0.061 Sum_probs=66.8 Q ss_pred CCceecccHHHHHHhcC---CCCCc---chHHHHHHHHHHHHHHhc---c-------------hheeeccCCCCCCHHHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTG---EPAPA---DAERLLTRASEDVDDALL---T-------------AVYDVDEAGMPTDPAVA 58 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g---~~~p~---~~~rLl~rAS~~VD~~t~---~-------------avYdv~~~GlPtdp~v~ 58 (138) -|--.|+|.++++.|.. ...|. ..+++|.+|++.||++.. + .=+.++.+.+|. ..|= T Consensus 12 ~~anSYvt~~ea~aY~~~rg~~~~~dd~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~~~~-~~IP 90 (169) T protein:vir:95 12 PNADSYVSLEDGRALAAKYGLELPEDDIAAEASLRNGAVYVGLFESQMCGRRVSANQALAFPRTGIDLHGFPQPS-NVIP 90 (169) T ss_pred CcccccccHHHHHHHHHHcCCcCCCCHHHHHHHHHHHHHHhhccccccccccCCcchhhccccCCceeccccccc-ccch Confidence 46778999999999764 45553 357899999999998521 0 003344444453 3455 Q ss_pred HHHHHHHHHHHHHHHHhCCccccc--ccccc-eeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCCccCCCC Q lcl|NC_018848. 59 QALADAACAQVAYRQESGDTGTGA--AGRWS-SVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGEIYPPG 135 (138) Q Consensus 59 ~alr~AtcAQV~~~~~~G~~~t~~--~g~~~-s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~ 135 (138) ..++.|+|+=....+. |.+.... .+.+. ...+|.++++-..++.... .....+.+ +||.+.+.+-| T Consensus 91 ~~V~~A~~elA~~~~~-g~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~~~-----~~~~~a~~-----~LL~p~l~g~~ 159 (169) T protein:vir:95 91 SLVIQAQVMAAVEYGA-GTDVRGSTDGREVQTERVEGAVTVSYFKNGYSGG-----TVSITAAD-----DALRPLLCGSN 159 (169) T ss_pred HHHHHHHHHHHHHHHc-CccccCCCCccceeeeeeccceeEeecCCCCcCc-----cccHHHHH-----HhhhhhcccCC Confidence 6788888875444443 4322222 23333 3445888887322222211 11111221 44444444433 Q ss_pred CCC Q lcl|NC_018848. 136 VNW 138 (138) Q Consensus 136 ~~~ 138 (138) =+. T Consensus 160 g~~ 162 (169) T protein:vir:95 160 NAY 162 (169) T ss_pred Ccc Confidence 222 No 20 >protein:vir:98481 Length: 136 # NCBI annotation: ORF3 # Family: family:all:32440 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958281;genbank:gi:41057255;uniprot:Q38596;genbank:GeneID:2732865 Probab=92.94 E-value=0.0019 Score=35.56 Aligned_cols=111 Identities=23% Similarity=0.300 Sum_probs=63.4 Q ss_pred CCceecccHHHHHHhcCCCCCc------chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTGEPAPA------DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQE 74 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g~~~p~------~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~ 74 (138) |. -|||++||+.-.+.+.++ .+..||..||++|..-+- +.-+.+| +.+|.=+|+=|.-... T Consensus 1 M~--~fAtv~Dl~~rw~~~~~dee~~ra~~~~lL~dAS~~ir~~~p--------~~~~~~~---~~~~~V~~~~V~R~~~ 67 (136) T protein:vir:98 1 MA--AYATVEDYQARAAVTLPDGSPRRAQVEAYLDDASALMARHIP--------TGHTPDP---GTLRAICVAVVRRVMA 67 (136) T ss_pred CC--ccCCHHHHHHHhccCCCCchhHHHHHHHHHHHHHHHHHHhCC--------CCCCCCh---hHHHHHHHHHHHHHhh Confidence 65 699999999955533332 256799999999997643 2223344 3466667777765553 Q ss_pred hCCcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhc-------------CCCC----CccCCCCCC Q lcl|NC_018848. 75 SGDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARA-------------GLTP----GEIYPPGVN 137 (138) Q Consensus 75 ~G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~a-------------GLl~----g~~~~~~~~ 137 (138) + | .+..|-|+|++|.|.+-.. ..-+.++=+..|... .+.+ ..==|-|-. T Consensus 68 n---p----~G~~s~TaG~ys~s~t~~G-------~Lylt~~E~~~Lg~~rqr~~~~d~a~si~~~~~~~~~~~dp~~~~ 133 (136) T protein:vir:98 68 N---P----GGYRQRTIGQYAETLGEDG-------GLYLTEDEKGQLQPPDQTAPDADAAYSLDLDPGTRAWVDDPAGCG 133 (136) T ss_pred C---C----CCcccccchhHHHhhhcCC-------CcccChHHHHHhCCCCCcccccccceecccCCCcCCcCCCCCCCC Confidence 3 2 2344577899888854311 122344444444222 1222 222355666 Q ss_pred C Q lcl|NC_018848. 138 W 138 (138) Q Consensus 138 ~ 138 (138) | T Consensus 134 ~ 134 (136) T protein:vir:98 134 W 134 (136) T ss_pred C Confidence 7 No 21 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=92.85 E-value=0.006 Score=32.75 Aligned_cols=131 Identities=16% Similarity=0.126 Sum_probs=74.2 Q ss_pred CCceecccHHHHHHhcC---CCCC----cchHHHHHHHHHHHHHHhcc--hhe-e------------eccCCCCCCHHHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTG---EPAP----ADAERLLTRASEDVDDALLT--AVY-D------------VDEAGMPTDPAVA 58 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g---~~~p----~~~~rLl~rAS~~VD~~t~~--avY-d------------v~~~GlPtdp~v~ 58 (138) -|--.|+|.+++..|.. ...| +..+++|.+|++.||+..+- .+- . +++...|.+ .|= T Consensus 13 ~~AnSYvtv~~a~aY~~~rg~~~~a~~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WPRtg~~d~~~~~~~-~IP 91 (172) T protein:vir:97 13 AGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYIN-DIP 91 (172) T ss_pred CCccccccHHHHHHHHHhcCcccCCCCcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcccCCCCCCcccccc-ccc Confidence 35578999999998754 3334 23567999999999975331 010 0 233444432 255 Q ss_pred HHHHHHHHHHHHHHHHhCCc--ccc-c---ccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCC-cc Q lcl|NC_018848. 59 QALADAACAQVAYRQESGDT--GTG-A---AGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPG-EI 131 (138) Q Consensus 59 ~alr~AtcAQV~~~~~~G~~--~t~-~---~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g-~~ 131 (138) ..++.|+|+=...-+..+.- ... . .....++.+|.+++.....+.+... .--.+.+-.+|++-||.+| .. T Consensus 92 ~~v~~A~~elA~~al~~~l~~d~~~~~~~~~v~~kr~kvg~i~~~y~~~~~~~~~---~p~~~~v~aLL~p~gl~~~~~~ 168 (172) T protein:vir:97 92 PEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQ---MPKYPAADQKLVRAGLVRSGGT 168 (172) T ss_pred HHHHHHHHHHHHHHHhcccccccccccccccceeeeeeecceeeEeeccCCCCCc---cccHHHHHHHHhhhccccCcce Confidence 67888998765544444332 111 1 1235588889988863221111110 1225667778999888653 33 Q ss_pred CCCC Q lcl|NC_018848. 132 YPPG 135 (138) Q Consensus 132 ~~~~ 135 (138) +=-| T Consensus 169 ~~r~ 172 (172) T protein:vir:97 169 LLRG 172 (172) T ss_pred eccC Confidence 4444 No 22 >protein:vir:2432 Length: 124 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046834;genbank:gi:9630402;genbank:GeneID:1261576 Probab=92.45 E-value=0.001 Score=36.98 Aligned_cols=114 Identities=18% Similarity=0.246 Sum_probs=64.3 Q ss_pred eecccHHHHHH-hcCCCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCcc Q lcl|NC_018848. 4 RVYATPAQLAQ-WTGEPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTG 79 (138) Q Consensus 4 rvyAt~~~l~~-~~g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~ 79 (138) --|||++|+.+ |....-|++ ++.||..||+||.+..-- ++ +..+-|.++. -+++=+|+=|.--..+ T Consensus 1 ~~~At~~Dv~~rw~r~Lt~~E~~~ve~lL~dAs~~ir~r~P~--l~-~~~~~~~~~~---~v~~V~a~~V~R~~rn---- 70 (124) T protein:vir:24 1 MAYATADDVVTLWAKEPEPEVMALIERRLEQVERMIRRRIPD--LD-ARVSSDIFRA---DLIDIEADAVLRLVRN---- 70 (124) T ss_pred CCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHhcCCC--cc-hhcCCCCChh---hHHHHHHHHHHHHhhC---- Confidence 57999999999 556544432 678999999999964331 11 1122233332 2455555555543322 Q ss_pred cccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhc---CC---CCCccCCC Q lcl|NC_018848. 80 TGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARA---GL---TPGEIYPP 134 (138) Q Consensus 80 t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~a---GL---l~g~~~~~ 134 (138) ..+..|.+.|++|+|.....++ .-..+.++=|..|... |. .|-.+-|. T Consensus 71 ---P~G~~s~T~G~Ys~sl~~~~~~----g~Lylt~~E~~~Lg~~r~~~~~~i~p~~~~~~ 124 (124) T protein:vir:24 71 ---PEGYLSETDGAYTYQLQADLSQ----GKLVILDEEWTTLGVNRLSRMSTLVPNIVMPT 124 (124) T ss_pred ---CCCceecccchhHHhhhhcccC----CceeeCHHHHHhhCcccccceeEeecceeeCC Confidence 2235677789999985432221 1233556666666554 22 34444444 No 23 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=92.22 E-value=0.004 Score=33.70 Aligned_cols=127 Identities=10% Similarity=0.035 Sum_probs=66.3 Q ss_pred CCceecccHHHHHHhcC---CCCC---cchHHHHHHHHHHHHHHhc---chh-------------eeeccCCCCCCHHHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTG---EPAP---ADAERLLTRASEDVDDALL---TAV-------------YDVDEAGMPTDPAVA 58 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g---~~~p---~~~~rLl~rAS~~VD~~t~---~av-------------Ydv~~~GlPtdp~v~ 58 (138) -|--.|+|.++++.|.. ...| +..+++|++|++.||.+.. +.+ +.++...+| +..|= T Consensus 12 ~~anSYvtv~~a~aY~~~rg~~~~~d~~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~~~-~~~IP 90 (169) T protein:vir:78 12 PNADSYVSLEDGRALAAKYGLELPEDDTAAEAALRNGAVYVGLFESQMCGRRVSANQALAFPRTGVTLHGFPQP-SNVIP 90 (169) T ss_pred ccccccccHHHHHHHHHHcCCcCCCChHHHHHHHHHHHHHhhhccccceeeeCCcccccccccCCceecccccc-cccch Confidence 36678999999999764 4455 2466899999999997421 100 113333444 23455 Q ss_pred HHHHHHHHHHHHHHHHhCCcccc-cccccceeee-CceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCCccCCCCC Q lcl|NC_018848. 59 QALADAACAQVAYRQESGDTGTG-AAGRWSSVSI-GPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGEIYPPGV 136 (138) Q Consensus 59 ~alr~AtcAQV~~~~~~G~~~t~-~~g~~~s~sI-G~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~~ 136 (138) ..++.|+|+=....+..++.... ..+...+-.+ |.+++.-..++... ......+++ |||.+.+.+-|= T Consensus 91 ~~v~~A~~elA~~~~~g~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~~-----~~~~~~~~~-----~LL~p~l~~~~g 160 (169) T protein:vir:78 91 PLVIQAQVMAAVEYGAGTDVRGSTDGREVQTERVEGAVTVSYFKNGYSG-----GTVSITTAD-----DALRPLLCGSNN 160 (169) T ss_pred HHHHHHHHHHHHHHhcCcccCCCCCcceeEEEEecCceeEeecCCCCCC-----CcccHHHHH-----HHhhhhcccCCC Confidence 67888988766655443443222 2344555455 66666532211111 111111222 344444444332 Q ss_pred CC Q lcl|NC_018848. 137 NW 138 (138) Q Consensus 137 ~~ 138 (138) +. T Consensus 161 ~~ 162 (169) T protein:vir:78 161 AY 162 (169) T ss_pred cc Confidence 22 No 24 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=92.17 E-value=0.00033 Score=39.66 Aligned_cols=114 Identities=17% Similarity=0.122 Sum_probs=61.2 Q ss_pred CCceecccHHHHHHhcC-CCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHH-Hh Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTG-EPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQ-ES 75 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g-~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~-~~ 75 (138) -.-.-+||.+|+++=++ +.-|++ +.-||..||++|+-.+.. |.|- +-+| + +++. +||||..-. .. T Consensus 2 ~~~~alAtvdDv~~~lrr~Lt~dE~~~a~~Ll~eAsdlI~g~l~~--~~vp-~~~p--~----~v~r-VvA~ivarAltr 71 (128) T protein:vir:25 2 TECKALATSQDVKRALRRDLTEAEQTDLSELLAEATDLVVGYLHP--YPVP-TPTP--G----PIKR-VVASMVAAVLTR 71 (128) T ss_pred ccchhccCHHHHHHHhcCCCCHHHHHHHHHHHhcchheeeeecCC--CCCC-CCCC--c----hHHH-HHHHHHHHHhhC Confidence 46788999999999554 454543 446999999999877662 3322 2222 2 2333 455555544 34 Q ss_pred CCcccccccccceeeeCceeeccCccccccch----hhhhhhHHHHHHHHhhcCC----CCCccCC Q lcl|NC_018848. 76 GDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTG----AGSVDLGEQASRALARAGL----TPGEIYP 133 (138) Q Consensus 76 G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~----~~a~~ls~~a~~~L~~aGL----l~g~~~~ 133 (138) |.++ .+..+|++-|+||.++..++.+..- .+-.+|=| - ..|- |+.+=|= T Consensus 72 ~~~~---~pe~~S~TAgpfs~~ft~~~~~~g~yLTaa~k~~Lrp-----~-R~~~~sV~l~sery~ 128 (128) T protein:vir:25 72 PTQI---LPETQSLTADGFGVTFTPGGNSPGPYLSAALKQRLRP-----Y-RTGMVAVEMGSERYC 128 (128) T ss_pred CCcc---CCCceeeecccccccccCCCCCCCceEcHHHHhhccc-----c-cceeeEeecccccCC Confidence 6554 4477888889999765433322211 11111100 0 1111 3333333 No 25 >protein:vir:78916 Length: 117 # NCBI annotation: gp8 # Family: family:all:10242 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468848;genbank:gi:157325483;genbank:GeneID:5601913 Probab=89.07 E-value=0.0043 Score=33.55 Aligned_cols=112 Identities=23% Similarity=0.360 Sum_probs=64.5 Q ss_pred ceecccHHHHHHhcCC-CCCcchHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCcccc Q lcl|NC_018848. 3 RRVYATPAQLAQWTGE-PAPADAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGTG 81 (138) Q Consensus 3 ~rvyAt~~~l~~~~g~-~~p~~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t~ 81 (138) -+-|+||.+|+..+.- --|-++..||+.||-.||.-.-..+-|+|. .|.+ +|+|++-|-+-.-.-|.-- T Consensus 1 mktyitpselasltnlsiepteadnlikaasvaidkqimpnivdldn----vddd----ikqavawqcehikkygefi-- 70 (117) T protein:vir:78 1 MKTYITPSELASLTNLSIEPTEADNLIKAASVAIDKQIMPNIVDLDN----VDDD----IKQAVAWQCEHIKKYGEFI-- 70 (117) T ss_pred Cccccchhhhhhhhccccccccchhhhhhhhhhhhhhhccccccccc----cchh----hHHHHhhhHHHHHhhhhhc-- Confidence 5789999999998773 335678899999999999777766655442 2333 5667766666555555431 Q ss_pred cccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCC----CCc Q lcl|NC_018848. 82 AAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLT----PGE 130 (138) Q Consensus 82 ~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl----~g~ 130 (138) +-+.|++|+..+.+-...+. +---++-..+.++|-..|.| ||. T Consensus 71 ---gignftlgkltmggqsqnsn---nfipdvpdkvmdlllssgwlyagvggc 117 (117) T protein:vir:78 71 ---GIGNFTLGKLTMGGQSQNSN---NFIPDVPDKVMDLLLSSGWLYAGVGGC 117 (117) T ss_pred ---ccccceeeeeeecccccccC---CcCCccchHHHHHHhhccchhcccCCC Confidence 11223444433321111110 11223455567777776653 333 No 26 >protein:vir:94064 Length: 167 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453623;genbank:gi:84662659;genbank:GeneID:5142574 Probab=88.37 E-value=0.017 Score=30.31 Aligned_cols=127 Identities=23% Similarity=0.259 Sum_probs=66.6 Q ss_pred CCceecccHHHHHH-hcC-CCCCc-chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHH----- Q lcl|NC_018848. 1 MDRRVYATPAQLAQ-WTG-EPAPA-DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYR----- 72 (138) Q Consensus 1 ~~~rvyAt~~~l~~-~~g-~~~p~-~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~----- 72 (138) |+.-+| .+++++. |=. .++|+ .+...+..|..++=.-|+...+ .|...++.+-....|+.... T Consensus 1 M~~~~F-d~~~FR~~fPeFa~~Pd~~i~~~l~~A~~~~l~~~~~s~~--------~~~~~~~~~l~LltAHll~L~~~~~ 71 (167) T protein:vir:94 1 MAVVVF-DPTAFKLVYPEFVAVPDARLTALFNTVGYTILDNTDASVI--------VDPLRRAPLLDLLVAHMLALFGYVN 71 (167) T ss_pred CCcccC-ChHHHHHhchhcccCCHHHHHHHHHHHHHhhcCCCCcccc--------cchhhHHHHHHHHHHHHHHHhhhhh Confidence 999999 5555554 332 33454 4445555554443222332221 14556666666677777544 Q ss_pred HHhCCc-ccccccccceeeeCceeeccCccccccchhh---hhhhHHHHHHHHhhcCCCCCccCCCC---CCC Q lcl|NC_018848. 73 QESGDT-GTGAAGRWSSVSIGPVSMSGPRQSAGGTGAG---SVDLGEQASRALARAGLTPGEIYPPG---VNW 138 (138) Q Consensus 73 ~~~G~~-~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~---a~~ls~~a~~~L~~aGLl~g~~~~~~---~~~ 138 (138) ...++. ..+..|..+|.|+|++|+|-+.......... ....--+-|.+++..|.=+ .+++| ++. T Consensus 72 a~~~~~~~~g~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~fwaL~~~~g~Gg--~v~gG~~~~~~ 142 (167) T protein:vir:94 72 ADGSITPGTGTVGRVANASEGSVSTSLAYSTPTGAGEAWFTQTPYGAMYWAMSAPFRSFH--YVAAGLSGVGY 142 (167) T ss_pred hhcccccccccchheeeccccceeeeeecCCCCCchhhhhhcCHHHHHHHHHHHHhcccc--cccCCCCCCCC Confidence 222222 2233566899999999999655543322211 1123344466677666644 44433 333 No 27 >protein:vir:107756 Length: 147 # NCBI annotation: gp21 # Family: family:all:664 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024869;genbank:gi:48697511;genbank:GeneID:2948377 Probab=87.33 E-value=0.035 Score=28.57 Aligned_cols=125 Identities=15% Similarity=0.019 Sum_probs=69.2 Q ss_pred CCceecccHHHHHH-hcC----CCCCc-chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQ-WTG----EPAPA-DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQE 74 (138) Q Consensus 1 ~~~rvyAt~~~l~~-~~g----~~~p~-~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~ 74 (138) ||. ..++++++. |=. +.+|+ .+...+..|-..|+..- |. ...+.+..+.+-....|+..+... T Consensus 1 m~v--~fd~~~Fr~~fPeFad~~~~pd~~i~~~l~~A~~~l~~~~----~~-----~~~~g~~~~~~l~Ll~AHll~l~~ 69 (147) T protein:vir:10 1 MDH--TLDITKFRALFPEFNNDVKYPDALLEQWYAVAGEYLGLTD----YA-----CGLNGNTLDLALMQLTAHLMKSAT 69 (147) T ss_pred Cce--ecCHHHHHHhcccccCCccCCHHHHHHHHHHHHHhhcccc----CC-----cccChhhHHHHHHHHHHHHHHHHH Confidence 886 678888887 322 23554 45567888866554322 21 112356666666666677655543 Q ss_pred hCCcccccccccceeeeCceeeccCccccccchhh--h-hhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 75 SGDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAG--S-VDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 75 ~G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~--a-~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) .--.+.+..|..+|.|+|++|+|-+.+........ . ...--+=|.+++..|.=| .|.+|.-= T Consensus 70 ~~~~g~g~~G~v~Sas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~y~~l~~~~~~Gg--~vvgG~p~ 134 (147) T protein:vir:10 70 ILSSNKGAPMVMTSATIDKVSISTLAPPIKNGWQYWLSTTPYGQMLWALLSMRSSGG--FVYGGSPE 134 (147) T ss_pred hhccCCCcccceeeeeecceeeeeecCCCCCcchhhhhcCHHHHHHHHHHHhhCccc--eecCCCCc Confidence 32233345677899999999999765533222211 1 112334466666666533 44444332 No 28 >protein:vir:4228 Length: 125 # NCBI annotation: predicted 14.0Kd protein # Family: family:all:2817 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039683;swissprot:sw:q05225;genbank:gi:9625449;uniprot:Q05225;genbank:GeneID:2942926 Probab=83.97 E-value=0.013 Score=30.87 Aligned_cols=114 Identities=18% Similarity=0.174 Sum_probs=66.0 Q ss_pred eecccHHHHHHhc-CCCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCHH-HHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_018848. 4 RVYATPAQLAQWT-GEPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDPA-VAQALADAACAQVAYRQESGDT 78 (138) Q Consensus 4 rvyAt~~~l~~~~-g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~-v~~alr~AtcAQV~~~~~~G~~ 78 (138) --|||.+|+.+.+ ..+-|++ .+.+|.+|+.||.+..-.=.=.+. +-|.|+. +.+.--.||. - + T Consensus 1 m~~A~~eDV~a~w~r~lt~~e~~~v~~~L~~Ae~~Ir~riPdL~~r~~--~~~~~~~~v~~Vea~aV~----R-v----- 68 (125) T protein:vir:42 1 MAYATAEDVVTLWAKEPEPEVMALIERRLQQIERMIKRRIPDLDVKAA--ASATFRADLIDIEADAVL----R-L----- 68 (125) T ss_pred CCcccHhHHHHHhCCCCChHHHHHHHHHHHHHHHHHHHhCCCchhhhc--ccCcchhhHHHHHHHHHH----H-H----- Confidence 5799999999944 5554543 568999999999876541100111 1233433 3332233322 2 1 Q ss_pred ccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhh---cCC---CCCccCCC Q lcl|NC_018848. 79 GTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALAR---AGL---TPGEIYPP 134 (138) Q Consensus 79 ~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~---aGL---l~g~~~~~ 134 (138) .-+.++-.|-|.|+.|++....-++ ....+.+.=|..|.. +|. .|-++-|. T Consensus 69 -~RNpeGy~s~T~G~Ys~~l~~~~~~----g~L~it~eEw~~L~p~~~~g~~~i~P~~~~~~ 125 (125) T protein:vir:42 69 -VRNPEGYLSETDGAYTYQLQADLSQ----GKLTILDEEWEILGVNSQKRMAVIVPNVVMPT 125 (125) T ss_pred -HhCCCccccccchhHHHhhhccccc----CceeeCHHHHHhhCccccccceeecccceeCC Confidence 1233445667779999986543222 334577777888876 454 55555665 No 29 >protein:vir:4458 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700381;genbank:gi:23505453;genbank:GeneID:955660 Probab=75.08 E-value=0.1 Score=26.07 Aligned_cols=99 Identities=11% Similarity=0.009 Sum_probs=61.1 Q ss_pred ecccHHHHHHhcCC--CCCc-c--hHHHHHHHHHHHHHHhcchheeeccCCCCCC---HHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_018848. 5 VYATPAQLAQWTGE--PAPA-D--AERLLTRASEDVDDALLTAVYDVDEAGMPTD---PAVAQALADAACAQVAYRQESG 76 (138) Q Consensus 5 vyAt~~~l~~~~g~--~~p~-~--~~rLl~rAS~~VD~~t~~avYdv~~~GlPtd---p~v~~alr~AtcAQV~~~~~~G 76 (138) .+.|.+++|+++.- ++++ | +..||.-|.+.|...|++.+|+.+..--+++ ..+-..+|.|+.-=|.+|.++= T Consensus 1 M~vtLee~K~hLRId~D~~dDD~lI~~~i~AA~~~i~~~~~r~l~~~~~~~~~~~~~~~~~~~~~~~AiLllv~~~Y~NR 80 (107) T protein:vir:44 1 MLLSVEEIKAQLRLDEDFEADERYLQLLARAVQKRTETYLNRKLYAPDETIPDSDPDGLLLQDDIRLGMLMLISHFYENR 80 (107) T ss_pred CCCCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHhhcCccccccccccccccccccchhhHHHHHHHHHHHHHhhh Confidence 79999999999995 3333 3 6679999999999999999887554311222 2345668999999999999986 Q ss_pred CcccccccccceeeeCceeeccCccccccchhhhhhhHHH Q lcl|NC_018848. 77 DTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQ 116 (138) Q Consensus 77 ~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~ 116 (138) ...+.. +...+..| +. + =-.--|.+|+ T Consensus 81 e~~~~~--~~~~lP~~---v~----~----Ll~~yR~~p~ 107 (107) T protein:vir:44 81 SSVTEV--EKLDMPQS---FG----W----LVGPYRYFPQ 107 (107) T ss_pred hhhccc--cccccCHH---HH----H----HHHHhhhcCC Confidence 442211 11111111 10 0 0111233444 No 30 >protein:vir:99002 Length: 158 # NCBI annotation: gp31 # Family: family:all:32652 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655896;genbank:gi:109521468;genbank:GeneID:4157967 Probab=73.41 E-value=0.17 Score=24.79 Aligned_cols=126 Identities=17% Similarity=0.135 Sum_probs=62.5 Q ss_pred ceecccHHHHHHhcCCCCCcchHHHHHHHHHHHHHHhcchhee-eccCCCCCC-HHHHHHHHHHHHHHHHHHHHhCCccc Q lcl|NC_018848. 3 RRVYATPAQLAQWTGEPAPADAERLLTRASEDVDDALLTAVYD-VDEAGMPTD-PAVAQALADAACAQVAYRQESGDTGT 80 (138) Q Consensus 3 ~rvyAt~~~l~~~~g~~~p~~~~rLl~rAS~~VD~~t~~avYd-v~~~GlPtd-p~v~~alr~AtcAQV~~~~~~G~~~t 80 (138) .--|||.++|+..+..|.|++.+|-..+|--++|.+-.-+++- =-+=.+|+| |+...+ +|-++.--. . T Consensus 1 ~~alasvee~~trl~~~lp~~~~r~~a~a~~vLd~~S~~ar~~~gr~W~~~~daP~~vr~----ivL~aa~R~------~ 70 (158) T protein:vir:99 1 MAALVSVEEFTTFLRVPLPEEGSEKYTQMEFLLTLASDWARELSCKPWLLPADAPVTARG----IILAASRRE------W 70 (158) T ss_pred CcceeeHhhhhhhhcccCChhhhHHHHHHHHHHHHHHHHHHHhcCccCCCCCcchhHHHH----HHHHHHHHH------H Confidence 6789999999999999999877765555544444443322221 112245677 775443 343333222 2 Q ss_pred ccccccceeeeCceeeccCccc-c-----ccchhhhhhhHHHH----------HHHHhhcCCCC----CccCCC------ Q lcl|NC_018848. 81 GAAGRWSSVSIGPVSMSGPRQS-A-----GGTGAGSVDLGEQA----------SRALARAGLTP----GEIYPP------ 134 (138) Q Consensus 81 ~~~g~~~s~sIG~~S~s~~~~~-a-----s~~~~~a~~ls~~a----------~~~L~~aGLl~----g~~~~~------ 134 (138) .+..+..+..+|.+++..+... . ...-+...|+..+. -+-++.+|-++ |-.||- T Consensus 71 ~NP~g~~~~~~G~~~~~~~~~g~~~~ffT~~E~~~L~r~~~s~GG~~~~~ttR~d~~~~~~yv~v~~~GdpfP~~~~~d~ 150 (158) T protein:vir:99 71 NNPKRVSYVVKGPQSATFMQSAYPPGFFTDAEEAKLRSYGRSTGNWGVIETYRDDEEQLNGYLEVYPHGGLMPVYHPDDI 150 (158) T ss_pred hcCCceEEeeecchhhhcccccCCCcccCHHHHHHHHHhhcccCceeEEEeecCccccCCceecccCCCCcccccCcccc Confidence 3455677888888888743321 1 11112222221110 01223333333 222331 Q ss_pred --CCCC Q lcl|NC_018848. 135 --GVNW 138 (138) Q Consensus 135 --~~~~ 138 (138) |-.| T Consensus 151 g~g~~~ 156 (158) T protein:vir:99 151 GYGGSI 156 (158) T ss_pred CCCccc Confidence 1122 No 31 >protein:vir:100245 Length: 113 # NCBI annotation: gp74 # Family: family:all:363 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355410;genbank:gi:77864700;genbank:GeneID:3725967 Probab=72.16 E-value=0.12 Score=25.72 Aligned_cols=100 Identities=14% Similarity=0.111 Sum_probs=59.7 Q ss_pred eecccHHHHHHhcCCCCCcc---hHHHHHHHHHHHHHHhcchheeeccC---------CCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_018848. 4 RVYATPAQLAQWTGEPAPAD---AERLLTRASEDVDDALLTAVYDVDEA---------GMPTDPAVAQALADAACAQVAY 71 (138) Q Consensus 4 rvyAt~~~l~~~~g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~---------GlPtdp~v~~alr~AtcAQV~~ 71 (138) =-+.|.+++|+|+.-+.+++ +..||.-|.+.|.+.|++.+|+.... .-+....+-..||.|+.--|.+ T Consensus 1 M~~vtLee~K~hLRvd~d~dD~lI~~li~AA~~~ve~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~p~~i~~AvLllv~~ 80 (113) T protein:vir:10 1 MALVELKLALGFVRANAGVEDDVVQMLLDAATQSAVDYLNRQVFETEDAMTTAIEAGTAGQNPMVVNAAIRAAILKITAE 80 (113) T ss_pred CCCCCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHHhCccccccccccccccccccccccccccChHHHHHHHHHHHH Confidence 24899999999999665532 66899999999999999988875321 1111111445578899999999 Q ss_pred HHHhCCcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhh-cCC Q lcl|NC_018848. 72 RQESGDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALAR-AGL 126 (138) Q Consensus 72 ~~~~G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~-aGL 126 (138) |.++-...+.. +.+-+..+ + ..-+.-+|. .|+ T Consensus 81 ~Y~nRe~~~~~--~~~~lP~~-----------------v----~~Ll~~yR~~~g~ 113 (113) T protein:vir:10 81 LYANREDTAFG--PITELPLN-----------------A----RALLRPHRIIPGV 113 (113) T ss_pred HHhhhhhhchh--hhhccCHH-----------------H----HHHHHHhhhhcCC Confidence 99986542111 11100000 0 011222222 333 No 32 >protein:vir:100103 Length: 120 # NCBI annotation: gp7 # Family: family:all:363 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945037;genbank:gi:38707897;genbank:GeneID:2744150 Probab=70.62 E-value=0.2 Score=24.46 Aligned_cols=106 Identities=15% Similarity=0.104 Sum_probs=61.9 Q ss_pred CCce-ecccHHHHHHhcCCCCCc-c--hHHHHHHHHHHHHHHhcchheeeccC------CCCCCH---HHHHHHHHHHHH Q lcl|NC_018848. 1 MDRR-VYATPAQLAQWTGEPAPA-D--AERLLTRASEDVDDALLTAVYDVDEA------GMPTDP---AVAQALADAACA 67 (138) Q Consensus 1 ~~~r-vyAt~~~l~~~~g~~~p~-~--~~rLl~rAS~~VD~~t~~avYdv~~~------GlPtdp---~v~~alr~AtcA 67 (138) |-.+ -..|.+++|+++.-+.++ | +..||.-|.+.|.+.|++++|+.+.. ..+.++ .+-..||.|+.- T Consensus 1 ~~~~m~~vtL~e~K~hLRvd~d~DD~lI~~~i~AA~~~v~~~~~r~l~~~~~~~~~~~~~~~~~~~~~~~~~~i~~AvLl 80 (120) T protein:vir:10 1 MADQTPIVSLEVALAHLREDAGVADDLIKIYIGAATQSASDYVDRKLYANDAEMQAAVADATAGADPIVANDAIRAAILL 80 (120) T ss_pred CCCCCCccCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHHhCCcccccccccchhhhccccccccccCCHHHHHHHHH Confidence 5444 458999999999966553 2 56899999999999999988864321 111111 144668889999 Q ss_pred HHHHHHHhCCccc-ccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCCc Q lcl|NC_018848. 68 QVAYRQESGDTGT-GAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGE 130 (138) Q Consensus 68 QV~~~~~~G~~~t-~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~ 130 (138) -|.+|.++=...+ +...+. +++-..+..+|.+==..+|+ T Consensus 81 lvg~~YenRe~~~~~~~~~~------------------------~~lP~~v~~Ll~~yR~~~gv 120 (120) T protein:vir:10 81 TIGKLYAFREDVVSGASASV------------------------TELPSGAKSLLFPYRVGLGV 120 (120) T ss_pred HHHHHHhchhhhhhcccccc------------------------cccCHHHHHHHHHhhhccCC Confidence 9999999854311 111000 11111122222222222222 No 33 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=70.36 E-value=0.15 Score=25.08 Aligned_cols=97 Identities=11% Similarity=-0.026 Sum_probs=59.1 Q ss_pred CCceecccHHHHHHhcCCCCCc-c--hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTGEPAPA-D--AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGD 77 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g~~~p~-~--~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~ 77 (138) ||+=-+.|.+++|+++.-+.++ | +..||.-|.+.|...++..+|+.. .++-..||.|+.-=|.+|.++-. T Consensus 3 ~~~M~~vtLee~K~hLRid~dddD~lI~~~i~AA~~~v~~~~~~~~~~~~-------~~~p~~ik~AiLllv~~~YenRE 75 (108) T protein:vir:18 3 IDVLDVISLSLFKQQIEFEEDDRDELITLYAQAAFDYCMRWCDEPAWKVA-------ADIPAAVKGAVLLVFADMFEHRT 75 (108) T ss_pred CCcccccCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHHhCCcccccc-------cccchHHHHHHHHHHHHHHhccc Confidence 9999999999999999966553 2 567999999999999997776643 12334578888888999999875 Q ss_pred cccccccc-cceeeeCceeec----c-Cccccccchh Q lcl|NC_018848. 78 TGTGAAGR-WSSVSIGPVSMS----G-PRQSAGGTGA 108 (138) Q Consensus 78 ~~t~~~g~-~~s~sIG~~S~s----~-~~~~as~~~~ 108 (138) ..+..... ..++ -++= . .+..-+-.+. T Consensus 76 ~~~~~~~~~~~~~----~~LL~pYR~~~g~~~~~~~~ 108 (108) T protein:vir:18 76 AQSEVQLYENAAA----ERMMFIHRNWRGKAESEEGS 108 (108) T ss_pred ccccchhhhhHHH----HHHHHHHHhcCCCCCcccCC Confidence 32111100 0000 0010 0 0000000000 No 34 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=70.36 E-value=0.15 Score=25.08 Aligned_cols=97 Identities=11% Similarity=-0.026 Sum_probs=59.1 Q ss_pred CCceecccHHHHHHhcCCCCCc-c--hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTGEPAPA-D--AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGD 77 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g~~~p~-~--~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~ 77 (138) ||+=-+.|.+++|+++.-+.++ | +..||.-|.+.|...++..+|+.. .++-..||.|+.-=|.+|.++-. T Consensus 3 ~~~M~~vtLee~K~hLRid~dddD~lI~~~i~AA~~~v~~~~~~~~~~~~-------~~~p~~ik~AiLllv~~~YenRE 75 (108) T protein:vir:19 3 IDVLDVISLSLFKQQIEFEEDDRDELITLYAQAAFDYCMRWCDEPAWKVA-------ADIPAAVKGAVLLVFADMFEHRT 75 (108) T ss_pred CCcccccCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHHhCCcccccc-------cccchHHHHHHHHHHHHHHhccc Confidence 9999999999999999966553 2 567999999999999997776643 12334578888888999999875 Q ss_pred cccccccc-cceeeeCceeec----c-Cccccccchh Q lcl|NC_018848. 78 TGTGAAGR-WSSVSIGPVSMS----G-PRQSAGGTGA 108 (138) Q Consensus 78 ~~t~~~g~-~~s~sIG~~S~s----~-~~~~as~~~~ 108 (138) ..+..... ..++ -++= . .+..-+-.+. T Consensus 76 ~~~~~~~~~~~~~----~~LL~pYR~~~g~~~~~~~~ 108 (108) T protein:vir:19 76 AQSEVQLYENAAA----ERMMFIHRNWRGKAESEEGS 108 (108) T ss_pred ccccchhhhhHHH----HHHHHHHHhcCCCCCcccCC Confidence 32111100 0000 0010 0 0000000000 No 35 >protein:vir:102083 Length: 96 # NCBI annotation: DNA packaging protein # Family: family:all:316 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512316;genbank:gi:89152485;genbank:GeneID:3953076 Probab=68.42 E-value=0.067 Score=27.01 Aligned_cols=93 Identities=10% Similarity=-0.042 Sum_probs=53.7 Q ss_pred ecccHHHHHHhcCCCCCcc--hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCccccc Q lcl|NC_018848. 5 VYATPAQLAQWTGEPAPAD--AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGTGA 82 (138) Q Consensus 5 vyAt~~~l~~~~g~~~p~~--~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t~~ 82 (138) ...|.+++++|+.-+.++| +..||.-|...|..+++..+++. .| .+|-|++.-|..|.++=...+.. T Consensus 1 M~vtLee~K~~LRID~DdD~lI~~~i~aA~~~i~~~~g~~~~e~-------~~----~~k~Avl~lv~~~YenR~~~~~~ 69 (96) T protein:vir:10 1 MLVTLEEAKEWIRVDGDDDPTITMLIKAAELYIYKATGKTFTQT-------NE----DAKLLCLFLVADWYGNRLLVGEK 69 (96) T ss_pred CcCCHHHHHHHcCCCCchhHHHHHHHHHHHHHHHHhhCCCCCCC-------cc----hHHHHHHHHHHHHHhhhhhcccc Confidence 6789999999999655644 45799999999999998554432 12 47889999999999995421111 Q ss_pred ccccceeeeCceeec-cCccccccchh Q lcl|NC_018848. 83 AGRWSSVSIGPVSMS-GPRQSAGGTGA 108 (138) Q Consensus 83 ~g~~~s~sIG~~S~s-~~~~~as~~~~ 108 (138) ....-.+++.+.=.. .+.+......+ T Consensus 70 ~~~~ip~~v~sli~qLr~~~~~~~e~~ 96 (96) T protein:vir:10 70 ASEKIRTIVQSMILQLQYASEPQEERK 96 (96) T ss_pred ccchhhHHHHHHHHHHhhcCCcccccC Confidence 111111111110000 11111111222 No 36 >protein:vir:105005 Length: 96 # NCBI annotation: putative DNA packaging protein phage # Family: family:all:316 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459970;genbank:gi:85701385;genbank:GeneID:3882146 Probab=68.42 E-value=0.067 Score=27.01 Aligned_cols=93 Identities=10% Similarity=-0.042 Sum_probs=53.7 Q ss_pred ecccHHHHHHhcCCCCCcc--hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCccccc Q lcl|NC_018848. 5 VYATPAQLAQWTGEPAPAD--AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGTGA 82 (138) Q Consensus 5 vyAt~~~l~~~~g~~~p~~--~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t~~ 82 (138) ...|.+++++|+.-+.++| +..||.-|...|..+++..+++. .| .+|-|++.-|..|.++=...+.. T Consensus 1 M~vtLee~K~~LRID~DdD~lI~~~i~aA~~~i~~~~g~~~~e~-------~~----~~k~Avl~lv~~~YenR~~~~~~ 69 (96) T protein:vir:10 1 MLVTLEEAKEWIRVDGDDDPTITMLIKAAELYIYKATGKTFTQT-------NE----DAKLLCLFLVADWYGNRLLVGEK 69 (96) T ss_pred CcCCHHHHHHHcCCCCchhHHHHHHHHHHHHHHHHhhCCCCCCC-------cc----hHHHHHHHHHHHHHhhhhhcccc Confidence 6789999999999655644 45799999999999998554432 12 47889999999999995421111 Q ss_pred ccccceeeeCceeec-cCccccccchh Q lcl|NC_018848. 83 AGRWSSVSIGPVSMS-GPRQSAGGTGA 108 (138) Q Consensus 83 ~g~~~s~sIG~~S~s-~~~~~as~~~~ 108 (138) ....-.+++.+.=.. .+.+......+ T Consensus 70 ~~~~ip~~v~sli~qLr~~~~~~~e~~ 96 (96) T protein:vir:10 70 ASEKIRTIVQSMILQLQYASEPQEERK 96 (96) T ss_pred ccchhhHHHHHHHHHHhhcCCcccccC Confidence 111111111110000 11111111222 No 37 >protein:vir:107614 Length: 96 # NCBI annotation: conserved phage protein # Family: family:all:316 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338189;genbank:gi:77020184;genbank:GeneID:3703745 Probab=68.42 E-value=0.067 Score=27.01 Aligned_cols=93 Identities=10% Similarity=-0.042 Sum_probs=53.7 Q ss_pred ecccHHHHHHhcCCCCCcc--hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCccccc Q lcl|NC_018848. 5 VYATPAQLAQWTGEPAPAD--AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGTGA 82 (138) Q Consensus 5 vyAt~~~l~~~~g~~~p~~--~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t~~ 82 (138) ...|.+++++|+.-+.++| +..||.-|...|..+++..+++. .| .+|-|++.-|..|.++=...+.. T Consensus 1 M~vtLee~K~~LRID~DdD~lI~~~i~aA~~~i~~~~g~~~~e~-------~~----~~k~Avl~lv~~~YenR~~~~~~ 69 (96) T protein:vir:10 1 MLVTLEEAKEWIRVDGDDDPTITMLIKAAELYIYKATGKTFTQT-------NE----DAKLLCLFLVADWYGNRLLVGEK 69 (96) T ss_pred CcCCHHHHHHHcCCCCchhHHHHHHHHHHHHHHHHhhCCCCCCC-------cc----hHHHHHHHHHHHHHhhhhhcccc Confidence 6789999999999655644 45799999999999998554432 12 47889999999999995421111 Q ss_pred ccccceeeeCceeec-cCccccccchh Q lcl|NC_018848. 83 AGRWSSVSIGPVSMS-GPRQSAGGTGA 108 (138) Q Consensus 83 ~g~~~s~sIG~~S~s-~~~~~as~~~~ 108 (138) ....-.+++.+.=.. .+.+......+ T Consensus 70 ~~~~ip~~v~sli~qLr~~~~~~~e~~ 96 (96) T protein:vir:10 70 ASEKIRTIVQSMILQLQYASEPQEERK 96 (96) T ss_pred ccchhhHHHHHHHHHHhhcCCcccccC Confidence 111111111110000 11111111222 No 38 >protein:vir:102863 Length: 96 # NCBI annotation: conserved phage protein # Family: family:all:316 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338138;genbank:gi:77020236;genbank:GeneID:3703772 Probab=68.42 E-value=0.067 Score=27.01 Aligned_cols=93 Identities=10% Similarity=-0.042 Sum_probs=53.7 Q ss_pred ecccHHHHHHhcCCCCCcc--hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCccccc Q lcl|NC_018848. 5 VYATPAQLAQWTGEPAPAD--AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGTGA 82 (138) Q Consensus 5 vyAt~~~l~~~~g~~~p~~--~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t~~ 82 (138) ...|.+++++|+.-+.++| +..||.-|...|..+++..+++. .| .+|-|++.-|..|.++=...+.. T Consensus 1 M~vtLee~K~~LRID~DdD~lI~~~i~aA~~~i~~~~g~~~~e~-------~~----~~k~Avl~lv~~~YenR~~~~~~ 69 (96) T protein:vir:10 1 MLVTLEEAKEWIRVDGDDDPTITMLIKAAELYIYKATGKTFTQT-------NE----DAKLLCLFLVADWYGNRLLVGEK 69 (96) T ss_pred CcCCHHHHHHHcCCCCchhHHHHHHHHHHHHHHHHhhCCCCCCC-------cc----hHHHHHHHHHHHHHhhhhhcccc Confidence 6789999999999655644 45799999999999998554432 12 47889999999999995421111 Q ss_pred ccccceeeeCceeec-cCccccccchh Q lcl|NC_018848. 83 AGRWSSVSIGPVSMS-GPRQSAGGTGA 108 (138) Q Consensus 83 ~g~~~s~sIG~~S~s-~~~~~as~~~~ 108 (138) ....-.+++.+.=.. .+.+......+ T Consensus 70 ~~~~ip~~v~sli~qLr~~~~~~~e~~ 96 (96) T protein:vir:10 70 ASEKIRTIVQSMILQLQYASEPQEERK 96 (96) T ss_pred ccchhhHHHHHHHHHHhhcCCcccccC Confidence 111111111110000 11111111222 No 39 >protein:vir:81159 Length: 95 # NCBI annotation: putative DNA packaging protein # Family: family:all:316 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285813;genbank:gi:148747734;genbank:GeneID:5247202 Probab=67.37 E-value=0.081 Score=26.58 Aligned_cols=88 Identities=14% Similarity=0.068 Sum_probs=53.1 Q ss_pred eecccHHHHHHhcCCCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCccc Q lcl|NC_018848. 4 RVYATPAQLAQWTGEPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGT 80 (138) Q Consensus 4 rvyAt~~~l~~~~g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t 80 (138) =-+.|.+++++|+.-+.+++ +..||.-|...|...++...++.. ..+|.|++-.|..|.++-...+ T Consensus 1 Mm~vtLee~K~~LRID~d~dD~lI~~li~aA~~~i~~~~g~~~~~~~-----------~~~~~Avl~lv~~~YeNRe~~~ 69 (95) T protein:vir:81 1 MMIVTLEEVKNWLRVDFSDDDALITTLINAAEEYLKNATGTTFDATN-----------HLAKIFCMTLIADWYENRELVG 69 (95) T ss_pred CCcCCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHhhccccccCc-----------hHHHHHHHHHHHHHHhhccccc Confidence 35789999999999665532 557999999999999986544321 3578899999999999965311 Q ss_pred ccccccceeeeC----ceeeccCcccc Q lcl|NC_018848. 81 GAAGRWSSVSIG----PVSMSGPRQSA 103 (138) Q Consensus 81 ~~~g~~~s~sIG----~~S~s~~~~~a 103 (138) .. ...-.+++. .-.......++ T Consensus 70 ~~-~~~~p~~v~sll~~lr~~~~~~~~ 95 (95) T protein:vir:81 70 RA-SDQVRPILQSILAQLTYAYGGETA 95 (95) T ss_pred cc-cccccHHHHHHHHHhhhccccccC Confidence 11 111111110 00000111111 No 40 >protein:vir:102158 Length: 99 # NCBI annotation: uncharacterized phage protein (possible DNA packaging) # Family: family:all:316 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699940;genbank:gi:110804046;genbank:GeneID:4206702 Probab=67.23 E-value=0.076 Score=26.72 Aligned_cols=87 Identities=8% Similarity=0.022 Sum_probs=53.7 Q ss_pred ecccHHHHHHhcCCCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCcccc Q lcl|NC_018848. 5 VYATPAQLAQWTGEPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGTG 81 (138) Q Consensus 5 vyAt~~~l~~~~g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t~ 81 (138) ...|.+++++|+.-+.+++ +..||.-|...|..+++...++. .| .+|.|++.-|.+|.++-..... T Consensus 1 M~vtLee~K~~LRID~d~dD~lI~~~i~aA~~~i~~~~~~~~~~~-------~~----~~k~Avl~lv~~~YenR~~~~~ 69 (99) T protein:vir:10 1 MILSVDEVKNYLRVDYDEDDILIQDLIESAEDYLYNATGKKFTEK-------NK----LAKRYCLALVYDWYKDKGMNIR 69 (99) T ss_pred CcCCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHHhhCCCCCCC-------Ch----HHHHHHHHHHHHhHhcchhhhh Confidence 7889999999999666642 66799999999999998665442 23 4788999999999999653211 Q ss_pred ccccc-----ceeeeC----ceeeccCccc Q lcl|NC_018848. 82 AAGRW-----SSVSIG----PVSMSGPRQS 102 (138) Q Consensus 82 ~~g~~-----~s~sIG----~~S~s~~~~~ 102 (138) .+... -.+++. .....+...+ T Consensus 70 ~~~~~~~~~~lp~~v~sli~qlr~~~~~~~ 99 (99) T protein:vir:10 70 ATKNTTVSEKVKYTLQSILLQLKFCKEEDT 99 (99) T ss_pred hhhccchhhhhhHHHHHHHHHHhhccCCCC Confidence 11000 011111 1000111111 No 41 >protein:vir:4998 Length: 106 # NCBI annotation: putative DNA packaging protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049972;genbank:gi:9632944;genbank:GeneID:1262107 Probab=67.18 E-value=0.12 Score=25.67 Aligned_cols=102 Identities=10% Similarity=-0.053 Sum_probs=55.7 Q ss_pred ecccHHHHHHhcCCCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCcccc Q lcl|NC_018848. 5 VYATPAQLAQWTGEPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGTG 81 (138) Q Consensus 5 vyAt~~~l~~~~g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t~ 81 (138) .-.|.+++++|+.-+.+++ +..||.-|..-|..+++. + .+++.+..+-..++-|++..|..|.++=...+. T Consensus 1 M~v~Le~iK~~LRID~ddDD~li~~~i~AA~~yi~~aig~---~---~~~~~~~~~~~~~~~Avl~Lv~~~YeNR~~~~~ 74 (106) T protein:vir:49 1 MSVSKEIIMQTLNLDETDDTALIPAYIESAQQYIINAVGS---D---PKFYELENVKYLFDTAVIALTSTYFTYRVALNE 74 (106) T ss_pred CcccHHHHHHHcCCCCccchHHHHHHHHHHHHHHHhhcCC---C---CCCCCcCCCchHHHHHHHHHHHHHHhhcccccC Confidence 5568999999999666642 557999999999999872 2 234445556666899999999999999643221 Q ss_pred cccccceeeeCceeeccCccccccchhhhhhhHHHHH-HHHhhcC Q lcl|NC_018848. 82 AAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQAS-RALARAG 125 (138) Q Consensus 82 ~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~-~~L~~aG 125 (138) .+.. .+..|=-|+=. + .|-+-..+ +--..-| T Consensus 75 ~~~~--~vp~~v~slI~----------q-LR~~y~~~~e~~~~~~ 106 (106) T protein:vir:49 75 TLTY--PINLTLNSIIG----------Q-LRGLYATYSDGGVNNA 106 (106) T ss_pred cccc--cccHHHHHHHH----------H-HHhhhhhhhhccccCC Confidence 1111 11111101100 0 00000000 0000011 No 42 >protein:vir:4512 Length: 107 # NCBI annotation: unknown # Family: family:all:363 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599038;genbank:gi:19548996;genbank:GeneID:935218 Probab=67.02 E-value=0.26 Score=23.80 Aligned_cols=99 Identities=14% Similarity=0.014 Sum_probs=63.1 Q ss_pred ecccHHHHHHhcCC--CCC-cc--hHHHHHHHHHHHHHHhcchheeeccCCCCCCHH---HHHHHHHHHHHHHHHHHHhC Q lcl|NC_018848. 5 VYATPAQLAQWTGE--PAP-AD--AERLLTRASEDVDDALLTAVYDVDEAGMPTDPA---VAQALADAACAQVAYRQESG 76 (138) Q Consensus 5 vyAt~~~l~~~~g~--~~p-~~--~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~---v~~alr~AtcAQV~~~~~~G 76 (138) -+.|.+++|+|+.- ++. +| +..||.-|.+-|...|++++|+.....-+.+|. +-..+|.|+.-=|.+|.++= T Consensus 1 M~vtL~e~K~hLRId~D~~ddD~lI~~~i~AA~~~i~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~AvLllv~~~Y~NR 80 (107) T protein:vir:45 1 MLLKMEEIKLQLRLDDDFSDEDELLELLGKAAQSRTENFLNRKLYATADDRPADDPDGLVISDDVKLALLLLVSHFYENR 80 (107) T ss_pred CCCCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHHhccccccccccccccccccccCChhHHHHHHHHHHHHHhhh Confidence 79999999999995 332 34 667999999999999999988865443222222 34557889998999999875 Q ss_pred CcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCC Q lcl|NC_018848. 77 DTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGL 126 (138) Q Consensus 77 ~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGL 126 (138) ...+. ...+ .. ... ...-+.-+|.=|| T Consensus 81 e~~~~---~~~~----~l------------p~~----v~~Ll~~~R~~~~ 107 (107) T protein:vir:45 81 STVTD---VEKM----EL------------PMS----FNWLVAPYRLIPL 107 (107) T ss_pred hhccc---cchh----cc------------chH----HHHHHHHHhhcCC Confidence 33111 0000 00 011 1222555666666 No 43 >protein:vir:486 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543093;swissprot:trembl:q8w626;genbank:gi:18249905;uniprot:Q8W626;genbank:GeneID:929692 Probab=64.38 E-value=0.25 Score=23.89 Aligned_cols=98 Identities=14% Similarity=0.080 Sum_probs=63.7 Q ss_pred ecccHHHHHHhcCCC--CC-cc--hHHHHHHHHHHHHHHhcchheeeccCCCCCC-H---HHHHHHHHHHHHHHHHHHHh Q lcl|NC_018848. 5 VYATPAQLAQWTGEP--AP-AD--AERLLTRASEDVDDALLTAVYDVDEAGMPTD-P---AVAQALADAACAQVAYRQES 75 (138) Q Consensus 5 vyAt~~~l~~~~g~~--~p-~~--~~rLl~rAS~~VD~~t~~avYdv~~~GlPtd-p---~v~~alr~AtcAQV~~~~~~ 75 (138) .+.|.+++|+|+.-+ +. +| +..||.-|++.|...|++.+|+.... +|+. | .+-..||.|+--=|.+|.++ T Consensus 1 M~vtL~e~K~hLRid~D~~ddD~li~~~i~aA~~~i~~~~~r~l~~~~~~-~~~~~~~~~~~~~~ik~Avlllv~~~Y~N 79 (107) T protein:vir:48 1 MLLKEEEIKSHLRLDDGLYSDGDFLKLLAQAVQKRTETYLNRKLYAPEET-IPEDDPDGMHLTDDVRLAMLMLVSHFYEN 79 (107) T ss_pred CCCCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHHhcccccccccc-ccccCccccccchhHHHHHHHHHHHHHhh Confidence 789999999999953 32 34 56799999999999999988775433 3322 2 35567899999999999997 Q ss_pred CCcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCC Q lcl|NC_018848. 76 GDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGL 126 (138) Q Consensus 76 G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGL 126 (138) =...+.. +...+ -+ . ...-+.-+|.=|| T Consensus 80 Re~v~~~--~~~~i-----P~------------~----v~~LL~~yR~~~l 107 (107) T protein:vir:48 80 RSTITDV--EKLET-----PM------------S----FRWLAGPYRIVPL 107 (107) T ss_pred hhhhccc--ccccc-----CH------------H----HHHHHHHhhccCC Confidence 5432111 11110 00 0 1122455566666 No 44 >protein:vir:5256 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852761;genbank:gi:31544036;uniprot:Q7Y5T9;genbank:GeneID:2753553 Probab=62.76 E-value=0.33 Score=23.23 Aligned_cols=112 Identities=14% Similarity=0.121 Sum_probs=60.5 Q ss_pred cccHHHHHH-hcC-CCCC-cchHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCc-ccc Q lcl|NC_018848. 6 YATPAQLAQ-WTG-EPAP-ADAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDT-GTG 81 (138) Q Consensus 6 yAt~~~l~~-~~g-~~~p-~~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~-~t~ 81 (138) -.|.++++. |=. .++| +.+...+..|...|+. .+ + .++.+-......|+...+...+.. ..+ T Consensus 1 m~t~~~Fr~~~PeF~~~pd~~i~~~l~~A~~~l~~-~~---~----------g~~~~~~~~L~~AH~l~l~~~~~~~~g~ 66 (119) T protein:vir:52 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAEVSK-VQ---W----------GKLYDRGVMALTAHLLKLSADAEISGGA 66 (119) T ss_pred CCcHHHHHHhhhhccCCCHHHHHHHHHHHHHhhCC-cC---C----------chHHHHHHHHHHHHHHHhhhhhhccccc Confidence 567777766 333 3345 4577788888777753 22 1 123333333455555544433322 123 Q ss_pred cccccceeeeCceeeccCccccccchhh--hh-hhHHHHHHHHhhcCCCCCccC Q lcl|NC_018848. 82 AAGRWSSVSIGPVSMSGPRQSAGGTGAG--SV-DLGEQASRALARAGLTPGEIY 132 (138) Q Consensus 82 ~~g~~~s~sIG~~S~s~~~~~as~~~~~--a~-~ls~~a~~~L~~aGLl~g~~~ 132 (138) ..|..+|.|+|++|+|-+.+........ .+ ..--+-|.+++..|. ||-|= T Consensus 67 ~~g~v~S~s~G~vSvS~~~~~~~~~~~~w~~~T~YG~~y~~L~r~~g~-Gg~Va 119 (119) T protein:vir:52 67 ANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGV-GVMVA 119 (119) T ss_pred cccceeeeeecceeeeeeccccCCcchhhhhcCHHHHHHHHHHHHhcC-CCcCC Confidence 4577999999999999655433322211 11 133444666777665 33333 No 45 >protein:vir:2345 Length: 125 # NCBI annotation: gp15 # Family: family:all:2817 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075282;genbank:gi:12657869;genbank:GeneID:920134 Probab=59.35 E-value=0.21 Score=24.26 Aligned_cols=115 Identities=17% Similarity=0.166 Sum_probs=59.0 Q ss_pred ceecccHHHHHHhc-CCCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCHH-HHHHHHHHHHHHHHHHHHhCC Q lcl|NC_018848. 3 RRVYATPAQLAQWT-GEPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDPA-VAQALADAACAQVAYRQESGD 77 (138) Q Consensus 3 ~rvyAt~~~l~~~~-g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~-v~~alr~AtcAQV~~~~~~G~ 77 (138) .-=||+.+|+.+.. ...-|++ .+.||..|+.||.+..--=.=.+. +-+.|+. +.+..-+||.- .. T Consensus 1 ma~~A~~eDV~a~w~R~lt~eE~~~V~~~L~~ae~~irrriPdL~~r~~--~~~~~~~~v~~V~a~~V~R----v~---- 70 (125) T protein:vir:23 1 MATLATHEDVTAFWARTPTAEEIVLINRRLAQAERMLLRAIPELLIKAS--SDPVFRAEVIDIEAEAVLR----LV---- 70 (125) T ss_pred CCcccCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhcCChhhhhc--CCCcchhhHHHHHHHHHHH----Hh---- Confidence 45699999999955 5444443 557999999999865431000011 1122322 33322333322 11 Q ss_pred cccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhc--CC---CCCccCCC Q lcl|NC_018848. 78 TGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARA--GL---TPGEIYPP 134 (138) Q Consensus 78 ~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~a--GL---l~g~~~~~ 134 (138) .+.++-.|-|.|+.|+|....-++ ....+.++=|..|... |. .|-..-|. T Consensus 71 ---rnPeGy~seT~g~Yt~~l~~~~~~----g~L~it~~E~a~Lg~~~s~~~vi~p~~~~p~ 125 (125) T protein:vir:23 71 ---RNHEGYLSETDGNYTYMLQAQDPN----RKLEILPEEWEVLGIVRSGLGILVPTVVLPS 125 (125) T ss_pred ---cCCCCccccccchhhhhhhccCCC----CceeecHHHHHhhccccccceEEeeceecCC Confidence 334446667789999986443222 2234555555555432 22 22223333 No 46 >protein:vir:4857 Length: 104 # NCBI annotation: putative DNA packaging protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049397;genbank:gi:9632425;genbank:GeneID:1258493 Probab=59.11 E-value=0.22 Score=24.18 Aligned_cols=101 Identities=14% Similarity=0.011 Sum_probs=54.5 Q ss_pred ecccHHHHHHhcCCCCCc--c-hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCcccc Q lcl|NC_018848. 5 VYATPAQLAQWTGEPAPA--D-AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGTG 81 (138) Q Consensus 5 vyAt~~~l~~~~g~~~p~--~-~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t~ 81 (138) -..|.+++++|+.-+.++ + +..||.-|..-|..+++. ++ +.+.+..+..-++.|++..|..|.++=...+. T Consensus 1 M~vtLeevK~~LRID~d~dD~li~~~i~aA~~~i~~~ig~---~~---~~~~~~~~~~~~~~Avl~lv~~~Y~NR~~~~~ 74 (104) T protein:vir:48 1 MSVSKETIMQTLNLDETDDTALIPAYIESARQYVVNSVGD---DP---KFYNLDSVRALFDTAVIALTSSYFTYRVALTD 74 (104) T ss_pred CcccHHHHHHHcCCCCccchHHHHHHHHHHHHHHHHhhCC---CC---CcccccCCChhHHHHHHHHHHHHHhhhhhhcc Confidence 678999999999966664 2 556899999999998873 11 23333444456899999999999998643222 Q ss_pred cccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 82 AAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 82 ~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) .+...-.+++ -|+ +.=| .|+-. .|=.+.|= T Consensus 75 ~~~~~ip~~v--~sl---------------------i~~l--R~~y~--~~~~~~~~ 104 (104) T protein:vir:48 75 TATYPVNLTL--NSI---------------------IGQL--RGLYA--TYSEERGD 104 (104) T ss_pred cccchhhHHH--HHH---------------------HHHH--HHhhh--hhcccCCC Confidence 1111101111 000 0000 00000 11111111 No 47 >protein:vir:4954 Length: 104 # NCBI annotation: putative DNA packaging protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049930;genbank:gi:9632901;genbank:GeneID:1262077 Probab=58.71 E-value=0.17 Score=24.82 Aligned_cols=100 Identities=12% Similarity=-0.025 Sum_probs=53.6 Q ss_pred ecccHHHHHHhcCCCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCcccc Q lcl|NC_018848. 5 VYATPAQLAQWTGEPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGTG 81 (138) Q Consensus 5 vyAt~~~l~~~~g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t~ 81 (138) --.|.+++++|+.-+++++ +..||.-|..-|..+++. +. .+..+..+-..++-|++..|..|.++=...+. T Consensus 1 M~vtLeeiK~~LRID~dddD~li~~~i~aA~~yi~~aig~---~~---~~~~~~~~~~~~~~Avl~Lv~~~YeNR~~~~~ 74 (104) T protein:vir:49 1 MSVSKTSIMQTLNLDETDDTALIPAYIESAKQYIINAVGS---DS---KFYDLDSVRALFDTAVIALTSSYFTYRVALTD 74 (104) T ss_pred CcccHHHHHHHcCCCCccchHHHHHHHHHHHHHHHHhhCC---CC---ccccccCCChHHHHHHHHHHHHHHhhchhccc Confidence 5668999999999666642 557899999999999983 21 22223334455788999999999999543222 Q ss_pred cccccceeeeCceeec-cCccccccchhhhhh Q lcl|NC_018848. 82 AAGRWSSVSIGPVSMS-GPRQSAGGTGAGSVD 112 (138) Q Consensus 82 ~~g~~~s~sIG~~S~s-~~~~~as~~~~~a~~ 112 (138) .+...-.+++...=.. .+.. . .-....++ T Consensus 75 ~~~~~vp~~v~sli~qLr~~y-~-~~~e~~~~ 104 (104) T protein:vir:49 75 TATYPVNLTLNSIIGQLRGLY-A-TYSEERGD 104 (104) T ss_pred cccchhhHHHHHHHHHHHHhh-h-hhhhccCC Confidence 1111111111000000 0000 0 00000011 No 48 >protein:vir:97069 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453566;genbank:gi:84662601;genbank:GeneID:5142484 Probab=58.12 E-value=0.36 Score=23.01 Aligned_cols=99 Identities=18% Similarity=0.111 Sum_probs=57.9 Q ss_pred cccHHHHHHhcCCCCCc----c--hHHHHHHHHHHHHHHhcchheeeccC------CCCCCHH---HHHHHHHHHHHHHH Q lcl|NC_018848. 6 YATPAQLAQWTGEPAPA----D--AERLLTRASEDVDDALLTAVYDVDEA------GMPTDPA---VAQALADAACAQVA 70 (138) Q Consensus 6 yAt~~~l~~~~g~~~p~----~--~~rLl~rAS~~VD~~t~~avYdv~~~------GlPtdp~---v~~alr~AtcAQV~ 70 (138) =.|.+++|.++.-+.++ | +..+|.-|++.|.+.|++.+|+...+ -.+.+++ +=.+||.|+.--|. T Consensus 1 mvtLee~K~hLRid~d~~d~DDali~~~i~AA~~~v~~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~p~~i~~AiLllvg 80 (115) T protein:vir:97 1 MITLAMMQRHLQAELYEDDERDYVMQQLLPAARESAELFLNRKLYDVQADMLADQVLGVDPSDQLLITRTVEQAILLTVG 80 (115) T ss_pred CCCHHHHHHHcCCCCCCCchhhHHHHHHHHHHHHHHHHHhCCcccchhhcccccccccCCCcccccCCHHHHHHHHHHHH Confidence 68999999999954332 2 56799999999999999999974321 1111221 44557889999999 Q ss_pred HHHHhCCcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhh-cCCC Q lcl|NC_018848. 71 YRQESGDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALAR-AGLT 127 (138) Q Consensus 71 ~~~~~G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~-aGLl 127 (138) +|.++=...+. ++.+.+..| + ..| +.-+|. .|+- T Consensus 81 ~~Y~NRE~v~~--~~~~elP~~---~--------------~~L----L~pyR~~~Gv~ 115 (115) T protein:vir:97 81 EWYSSREQVWI--KGAGLVTSS---A--------------QNL----LHPYRKFAGVR 115 (115) T ss_pred HHHhccccccc--ccccccCHH---H--------------HHH----HHHHHhhcCCC Confidence 99998654211 111111100 0 111 111111 3333 No 49 >protein:vir:103846 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938245;genbank:gi:38229150;genbank:GeneID:2648161 Probab=57.30 E-value=0.44 Score=22.55 Aligned_cols=113 Identities=19% Similarity=0.261 Sum_probs=60.6 Q ss_pred CCceecccHHHHHHhcCCC---------------CC-cchHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTGEP---------------AP-ADAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADA 64 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g~~---------------~p-~~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~A 64 (138) |. |||.+||.+..|+. ++ .-+.+-|.+||..||..++. +|.+.- + .+=..|++. T Consensus 1 M~---Y~T~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~~eIdgyL~~-RY~lPl---~---~vP~~L~~~ 70 (138) T protein:vir:10 1 MS---YCTQADLVEQYGEASIRQLSDRVNKPATTIDPAVVAQAIADADAEIDLHLHA-RYQLPL---A---QVPVVLKRV 70 (138) T ss_pred CC---cCCHHHHHHhcCHHHHHHHhcccCCCcCccCHHHHHHHHHHHHHHHHHHHhh-cccCCc---c---ccchHHHHH Confidence 75 99999999874421 11 22568899999999999874 698752 2 233458888 Q ss_pred HHHHHHHHHHhCCcccc-c-------ccccceeeeCceeeccCccc-cccchhhhhhhHHHHHHHHhhcCCCCCccCCCC Q lcl|NC_018848. 65 ACAQVAYRQESGDTGTG-A-------AGRWSSVSIGPVSMSGPRQS-AGGTGAGSVDLGEQASRALARAGLTPGEIYPPG 135 (138) Q Consensus 65 tcAQV~~~~~~G~~~t~-~-------~g~~~s~sIG~~S~s~~~~~-as~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~ 135 (138) +|.=..||+-.+...+. . -.-...+.=|++++...... .....+.+ .+ =. +.-+| | T Consensus 71 a~dIA~Y~L~~~~~~~e~~~~rY~~Ai~~L~~Ia~G~~~Lg~~~~~~~~~~~~~~-~~-------~s-----~~r~F--g 135 (138) T protein:vir:10 71 ACVLAFANLHTQVKDDHPAILDAERKRKLLGGISSGKLSLALTSSGTPAPIANTV-QI-------SS-----QRNDF--G 135 (138) T ss_pred HHHHHHHHHhcCCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcccCCCCCce-ee-------ec-----CCccC--C Confidence 88888888875543221 1 01144455566666432221 11111110 00 00 01111 1 Q ss_pred CCC Q lcl|NC_018848. 136 VNW 138 (138) Q Consensus 136 ~~~ 138 (138) =.| T Consensus 136 ~d~ 138 (138) T protein:vir:10 136 GTW 138 (138) T ss_pred CCC Confidence 234 No 50 >protein:vir:99570 Length: 153 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039799;genbank:gi:126011049;genbank:GeneID:4818265 Probab=56.37 E-value=0.46 Score=22.44 Aligned_cols=127 Identities=15% Similarity=0.040 Sum_probs=70.0 Q ss_pred CCceecccHHHHHH-hc---C-CCCCc-chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQ-WT---G-EPAPA-DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQE 74 (138) Q Consensus 1 ~~~rvyAt~~~l~~-~~---g-~~~p~-~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~ 74 (138) |..-|| .+++++. |= . +.+|+ .+...+..|-..++. +.+ -+.+. +.+..+.+-....|+...... T Consensus 1 m~~~~f-d~~~Fr~~fPeFad~~~~Pd~~i~~~l~~A~~~l~~-~~~-~~~~~------~g~~~~~~l~Ll~AH~l~L~~ 71 (153) T protein:vir:99 1 MADPVY-NDGLFRIMYPEFADQEKYPPEVIEIYYDTATLFITG-SMF-PCAAL------SGKQLVGALNMLTAHLMSLSM 71 (153) T ss_pred CCcccC-ChHHHHHhcccccCccccCHHHHHHHHHHHHHhhcC-ccc-ccccc------ChHHHHHHHHHHHHHHHHHHh Confidence 887777 4555554 32 2 23454 466788888777763 111 01111 244555555666666655432 Q ss_pred h---CC--cccccccccceeeeCceeeccCccccccchhh--h-hhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 75 S---GD--TGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAG--S-VDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 75 ~---G~--~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~--a-~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) . |. .+.+..|..+|.|+|++|+|-+.+........ . ...--+=|.+++..|.=+ .|++|..= T Consensus 72 ~~~~~~~~a~~~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~fw~l~~~~~~Gg--~v~gg~pe 141 (153) T protein:vir:99 72 QRSQTALGATNDQGGYTLSATIGEVSVSKMAPPAKDGWEFWLAQTPYGQALWALLKMLSVGG--FAIGGLPE 141 (153) T ss_pred hhhcccccCCCccccceeeeeecceeeeeecCCCCCchhHhhhcCHHHHHHHHHHHHhcccc--cccCCCCc Confidence 2 21 22233566899999999999766544332211 1 113335577777777655 56666554 No 51 >protein:vir:93592 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:1879 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449297;genbank:gi:157166045;uniprot:Q6H9U4;genbank:GeneID:5580414 Probab=55.87 E-value=0.47 Score=22.38 Aligned_cols=104 Identities=13% Similarity=0.011 Sum_probs=61.8 Q ss_pred CCceecccHHHHHHhcCCCCCc--c-hHHHHHHHHHHHHHHhcchheeeccC-CCCCCHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTGEPAPA--D-AERLLTRASEDVDDALLTAVYDVDEA-GMPTDPAVAQALADAACAQVAYRQESG 76 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g~~~p~--~-~~rLl~rAS~~VD~~t~~avYdv~~~-GlPtdp~v~~alr~AtcAQV~~~~~~G 76 (138) |- -+.|.+++|+|+.-+.++ + +..+|.-|+..|-..++...+.+..+ +-+....+-..+|.|++-=|.+|.++= T Consensus 1 mm--~~vtLeevK~hLRId~d~dD~li~~~i~aA~~~v~~~l~~~~~~~~~~~~~~~~~~~~~~i~~AvLlLv~~~YenR 78 (108) T protein:vir:93 1 MT--ALLTLEEIKAHLRVDHDADDDMLMDKVRQATAVLLAYIQGSRDKVIREDGELIPGEALTRMKGAAMRLTGMLYRNP 78 (108) T ss_pred CC--cCCCHHHHHHHcCCCCCcChHHHHHHHHHHHHHHHHHhccccccccccccccccccCChHHHHHHHHHHHHHHhcc Confidence 32 378999999999954442 2 66799999999999988766554333 333222233458999999999999986 Q ss_pred CcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcCCC Q lcl|NC_018848. 77 DTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAGLT 127 (138) Q Consensus 77 ~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl 127 (138) ...+... ++. +..-+ ....| +.=+|..-|+ T Consensus 79 e~~~~~~--~~~---~elP~------------~v~~L----l~~~R~p~~~ 108 (108) T protein:vir:93 79 DLAEREE--LLQ---GELPF------------SVSVL----IYDLRCPTVL 108 (108) T ss_pred ccccccc--ccc---ccCCH------------HHHHH----HHHccccccC Confidence 5421111 000 00000 01112 4445555555 No 52 >protein:vir:4831 Length: 105 # NCBI annotation: ORF27 # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038328;genbank:gi:9634654;genbank:GeneID:1262588 Probab=55.47 E-value=0.26 Score=23.75 Aligned_cols=102 Identities=11% Similarity=-0.011 Sum_probs=55.6 Q ss_pred ecccHHHHHHhcCCCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCcccc Q lcl|NC_018848. 5 VYATPAQLAQWTGEPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGTG 81 (138) Q Consensus 5 vyAt~~~l~~~~g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t~ 81 (138) -..|.+++++|+.-+.+++ +..||.-|..-|..+++.. .....+..+...++.|+...|..|.++=...+. T Consensus 1 M~vtLee~K~~LRID~dddD~lI~~~i~aA~~yi~~~ig~~------~~~~~~~~~~~~~~~Avl~lv~~~YeNR~~~~~ 74 (105) T protein:vir:48 1 MSVSKTSIMQTLNLDETDDTALIPAYIESAKQYIINAVGSD------SKFYDLENVQPLFDTAVIALTSSYFTYRVALTD 74 (105) T ss_pred CcccHHHHHHHcCCCCccchHHHHHHHHHHHHHHHHhhCCC------CccccccCCchHHHHHHHHHHHHHHhhhhhccC Confidence 6789999999999655532 5679999999999998731 111222234455899999999999999643221 Q ss_pred cccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcC Q lcl|NC_018848. 82 AAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAG 125 (138) Q Consensus 82 ~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aG 125 (138) .+...-.+++ -|+= .-.|-+..-+.-...-| T Consensus 75 ~~~~~ip~~v--~sli-----------~~lR~~y~~~~e~~~~g 105 (105) T protein:vir:48 75 TVTYPINLTL--NSII-----------GQLRGLYATYSEVVANG 105 (105) T ss_pred cccchhhHHH--HHHH-----------HHHhhhhhhhhhcccCC Confidence 1111100000 0000 00011112222223333 No 53 >protein:vir:106739 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944316;genbank:gi:38638615;genbank:GeneID:2657368 Probab=47.55 E-value=0.7 Score=21.44 Aligned_cols=127 Identities=18% Similarity=0.134 Sum_probs=62.6 Q ss_pred CCceecccHHHHHH-hcC-CCCCc-chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHH-H-- Q lcl|NC_018848. 1 MDRRVYATPAQLAQ-WTG-EPAPA-DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQ-E-- 74 (138) Q Consensus 1 ~~~rvyAt~~~l~~-~~g-~~~p~-~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~-~-- 74 (138) --++|=-.+++++. |=. .++|+ .+...+.-|-.++-.-+.+..++ +...++.+-.-..|+..... . T Consensus 4 ~~~~v~Fd~a~FR~~fPeFa~~pd~~i~~~~~~A~~~~~~~~~~s~~~--------~~~~r~~ll~LltAHll~L~~~~~ 75 (158) T protein:vir:10 4 PPYRITFDPAGFIAEYPEFATVATPRLQAMFNQAQTALLDNTGGSPVT--------DDNVLRELFNMLVAHLLTLFGATP 75 (158) T ss_pred CCceEEcChHHHHHhchhhccCCHHHHHHHHHHhhhhhcCCCcccccc--------ChhHHHHHHHHHHHHHHHHhHhhh Confidence 23444445555554 222 22343 45556666655443333433221 34555555555556654432 1 Q ss_pred hCCcccccccccceeeeCceeeccCccccccchh----hhhhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 75 SGDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGA----GSVDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 75 ~G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~----~a~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) .+. ..+..|..+|.|+|++|+|-+.+..+.+.. .....--+-|.+++..|.=| .|+.|..= T Consensus 76 ~~a-~~g~~G~isSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fwal~~~~~~Gg--y~~gg~pe 140 (158) T protein:vir:10 76 TSA-NSRPPGRLSSAAEGTVSSSFEFKLPEGSAIAPWYNQTQYGAMFWMATARYRSAR--YMVSGGSG 140 (158) T ss_pred ccc-cCCcccceeeeeecceEEEEeecCccccchhHHhhcCHHHHHHHHHHHHhcccc--cccccCCc Confidence 222 235577899999999999975544322221 11223445566666666533 33333211 No 54 >protein:vir:78595 Length: 158 # NCBI annotation: BcepNY3gp07 # Family: family:all:664 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294844;genbank:gi:149882907;genbank:GeneID:5291066 Probab=47.55 E-value=0.7 Score=21.44 Aligned_cols=127 Identities=18% Similarity=0.134 Sum_probs=62.6 Q ss_pred CCceecccHHHHHH-hcC-CCCCc-chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHH-H-- Q lcl|NC_018848. 1 MDRRVYATPAQLAQ-WTG-EPAPA-DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQ-E-- 74 (138) Q Consensus 1 ~~~rvyAt~~~l~~-~~g-~~~p~-~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~-~-- 74 (138) --++|=-.+++++. |=. .++|+ .+...+.-|-.++-.-+.+..++ +...++.+-.-..|+..... . T Consensus 4 ~~~~v~Fd~a~FR~~fPeFa~~pd~~i~~~~~~A~~~~~~~~~~s~~~--------~~~~r~~ll~LltAHll~L~~~~~ 75 (158) T protein:vir:78 4 PPYRITFDPAGFIAEYPEFATVATPRLQAMFNQAQTALLDNTGGSPVT--------DDNVLRELFNMLVAHLLTLFGATP 75 (158) T ss_pred CCceEEcChHHHHHhchhhccCCHHHHHHHHHHhhhhhcCCCcccccc--------ChhHHHHHHHHHHHHHHHHhHhhh Confidence 23444445555554 222 22343 45556666655443333433221 34555555555556654432 1 Q ss_pred hCCcccccccccceeeeCceeeccCccccccchh----hhhhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 75 SGDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGA----GSVDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 75 ~G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~----~a~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) .+. ..+..|..+|.|+|++|+|-+.+..+.+.. .....--+-|.+++..|.=| .|+.|..= T Consensus 76 ~~a-~~g~~G~isSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fwal~~~~~~Gg--y~~gg~pe 140 (158) T protein:vir:78 76 TSA-NSRPPGRLSSAAEGTVSSSFEFKLPEGSAIAPWYNQTQYGAMFWMATARYRSAR--YMVSGGSG 140 (158) T ss_pred ccc-cCCcccceeeeeecceEEEEeecCccccchhHHhhcCHHHHHHHHHHHHhcccc--cccccCCc Confidence 222 235577899999999999975544322221 11223445566666666533 33333211 No 55 >protein:vir:101559 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958112;genbank:gi:41057658;genbank:GeneID:2716816 Probab=47.34 E-value=0.7 Score=21.42 Aligned_cols=128 Identities=20% Similarity=0.166 Sum_probs=60.2 Q ss_pred CC---ceecccHHHHHH-hcC-CCCCc-chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018848. 1 MD---RRVYATPAQLAQ-WTG-EPAPA-DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQE 74 (138) Q Consensus 1 ~~---~rvyAt~~~l~~-~~g-~~~p~-~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~ 74 (138) |. ++|=-.+++++. |=. .++|+ .+...+.-|-.++=.-+.+.. =.|...++.+-.-..|+...... T Consensus 1 ~~~~~~~v~Fd~a~FR~~fPeFa~~pd~~i~~~~~~A~~~~l~n~~~s~--------~~~~~~r~~ll~LltAHll~L~~ 72 (158) T protein:vir:10 1 MSTPPYRITFDPAGFIAEYPEFATVPTPRLQAMFNQAQAALLDNTGGSP--------VTDDNVLRELFNMLVAHLLTLFS 72 (158) T ss_pred CCCCCceEEcChHHHHHhCcccccCCHHHHHHHHHhhhheeeCCccccc--------ccChHHHHHHHHHHHHHHHHHhh Confidence 43 444445555554 222 33443 344444444322211122111 12456666665566666655533 Q ss_pred hCC--cccccccccceeeeCceeeccCccccccchh----hhhhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 75 SGD--TGTGAAGRWSSVSIGPVSMSGPRQSAGGTGA----GSVDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 75 ~G~--~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~----~a~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) ... ...+..|..+|-|+|++|+|-+.+..+.+.. .....--+-|.+++..|.=+ .++.|..= T Consensus 73 ~~~~g~~~g~vG~vsSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fw~L~~~~~~Gg--~v~Gg~pe 140 (158) T protein:vir:10 73 AAPTSANSRPPGRLSSATEGTVSSSFEYALPQGSAIAPWYNQTQYGAMFWMATAHYRSAR--YMVSGGSG 140 (158) T ss_pred hhhcccccCcccceeeeeeCceeEEeeeccccCCchhhhhhcCHHHHHHHHHHHhhCccc--cccccCCc Confidence 322 2123457899999999999975433222221 11223444566666666533 33333211 No 56 >protein:vir:3639 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705634;genbank:gi:23752319;genbank:GeneID:955737 Probab=47.34 E-value=0.7 Score=21.42 Aligned_cols=128 Identities=20% Similarity=0.166 Sum_probs=60.2 Q ss_pred CC---ceecccHHHHHH-hcC-CCCCc-chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018848. 1 MD---RRVYATPAQLAQ-WTG-EPAPA-DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQE 74 (138) Q Consensus 1 ~~---~rvyAt~~~l~~-~~g-~~~p~-~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~ 74 (138) |. ++|=-.+++++. |=. .++|+ .+...+.-|-.++=.-+.+.. =.|...++.+-.-..|+...... T Consensus 1 ~~~~~~~v~Fd~a~FR~~fPeFa~~pd~~i~~~~~~A~~~~l~n~~~s~--------~~~~~~r~~ll~LltAHll~L~~ 72 (158) T protein:vir:36 1 MSTPPYRITFDPAGFIAEYPEFATVPTPRLQAMFNQAQAALLDNTGGSP--------VTDDNVLRELFNMLVAHLLTLFS 72 (158) T ss_pred CCCCCceEEcChHHHHHhCcccccCCHHHHHHHHHhhhheeeCCccccc--------ccChHHHHHHHHHHHHHHHHHhh Confidence 43 444445555554 222 33443 344444444322211122111 12456666665566666655533 Q ss_pred hCC--cccccccccceeeeCceeeccCccccccchh----hhhhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 75 SGD--TGTGAAGRWSSVSIGPVSMSGPRQSAGGTGA----GSVDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 75 ~G~--~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~----~a~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) ... ...+..|..+|-|+|++|+|-+.+..+.+.. .....--+-|.+++..|.=+ .++.|..= T Consensus 73 ~~~~g~~~g~vG~vsSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fw~L~~~~~~Gg--~v~Gg~pe 140 (158) T protein:vir:36 73 AAPTSANSRPPGRLSSATEGTVSSSFEYALPQGSAIAPWYNQTQYGAMFWMATAHYRSAR--YMVSGGSG 140 (158) T ss_pred hhhcccccCcccceeeeeeCceeEEeeeccccCCchhhhhhcCHHHHHHHHHHHhhCccc--cccccCCc Confidence 322 2123457899999999999975433222221 11223444566666666533 33333211 No 57 >protein:vir:99222 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950460;genbank:gi:119953661;genbank:GeneID:4643082 Probab=47.15 E-value=0.71 Score=21.40 Aligned_cols=113 Identities=19% Similarity=0.224 Sum_probs=62.0 Q ss_pred CCceecccHHHHHHhcCCC---------------CC-cchHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTGEP---------------AP-ADAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADA 64 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g~~---------------~p-~~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~A 64 (138) |. |||.+||.+-.|+. ++ .-+.+-|.+||..||-.++. +|.+.- + .+=..|++. T Consensus 1 M~---YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~-RY~lPl---~---~vP~~L~~~ 70 (138) T protein:vir:99 1 MS---YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHG-RYQLPL---A---SVPTALKRI 70 (138) T ss_pred CC---CCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhh-cccCCc---c---ccchHHHHH Confidence 75 99999999876631 11 12557899999999999874 688652 2 233558888 Q ss_pred HHHHHHHHHHhCCcccc-cc-------cccceeeeCceeeccCccccc-cchhhhhhhHHHHHHHHhhcCCCCCccCCCC Q lcl|NC_018848. 65 ACAQVAYRQESGDTGTG-AA-------GRWSSVSIGPVSMSGPRQSAG-GTGAGSVDLGEQASRALARAGLTPGEIYPPG 135 (138) Q Consensus 65 tcAQV~~~~~~G~~~t~-~~-------g~~~s~sIG~~S~s~~~~~as-~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~ 135 (138) +|.=..|++-.+...+. .. .-...++=|++|+........ ...+.+ ... -+.-+| + T Consensus 71 a~dIA~Y~L~~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~-~~~------------~~~r~F--~ 135 (138) T protein:vir:99 71 ACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTV-QIS------------EGRNDW--G 135 (138) T ss_pred HHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCCCCCce-eee------------cCCCCC--C Confidence 88888888776543221 11 114455667777653322111 000000 000 000011 2 Q ss_pred CCC Q lcl|NC_018848. 136 VNW 138 (138) Q Consensus 136 ~~~ 138 (138) =|| T Consensus 136 Rd~ 138 (138) T protein:vir:99 136 ADW 138 (138) T ss_pred CCC Confidence 356 No 58 >protein:vir:79253 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469165;genbank:gi:157835007;genbank:GeneID:5648834 Probab=47.15 E-value=0.71 Score=21.40 Aligned_cols=113 Identities=19% Similarity=0.224 Sum_probs=62.0 Q ss_pred CCceecccHHHHHHhcCCC---------------CC-cchHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTGEP---------------AP-ADAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADA 64 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g~~---------------~p-~~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~A 64 (138) |. |||.+||.+-.|+. ++ .-+.+-|.+||..||-.++. +|.+.- + .+=..|++. T Consensus 1 M~---YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~-RY~lPl---~---~vP~~L~~~ 70 (138) T protein:vir:79 1 MS---YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHG-RYQLPL---A---SVPTALKRI 70 (138) T ss_pred CC---CCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhh-cccCCc---c---ccchHHHHH Confidence 75 99999999876631 11 12557899999999999874 688652 2 233558888 Q ss_pred HHHHHHHHHHhCCcccc-cc-------cccceeeeCceeeccCccccc-cchhhhhhhHHHHHHHHhhcCCCCCccCCCC Q lcl|NC_018848. 65 ACAQVAYRQESGDTGTG-AA-------GRWSSVSIGPVSMSGPRQSAG-GTGAGSVDLGEQASRALARAGLTPGEIYPPG 135 (138) Q Consensus 65 tcAQV~~~~~~G~~~t~-~~-------g~~~s~sIG~~S~s~~~~~as-~~~~~a~~ls~~a~~~L~~aGLl~g~~~~~~ 135 (138) +|.=..|++-.+...+. .. .-...++=|++|+........ ...+.+ ... -+.-+| + T Consensus 71 a~dIA~Y~L~~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~-~~~------------~~~r~F--~ 135 (138) T protein:vir:79 71 ACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTV-QIS------------EGRNDW--G 135 (138) T ss_pred HHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCCCCCce-eee------------cCCCCC--C Confidence 88888888776543221 11 114455667777653322111 000000 000 000011 2 Q ss_pred CCC Q lcl|NC_018848. 136 VNW 138 (138) Q Consensus 136 ~~~ 138 (138) =|| T Consensus 136 Rd~ 138 (138) T protein:vir:79 136 ADW 138 (138) T ss_pred CCC Confidence 356 No 59 >protein:vir:80668 Length: 153 # NCBI annotation: gp7 # Family: family:all:7267 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285583;genbank:gi:148727089;genbank:GeneID:5247039 Probab=43.22 E-value=0.85 Score=20.96 Aligned_cols=114 Identities=16% Similarity=0.144 Sum_probs=65.8 Q ss_pred ceecccHHHHHHhcCCCCC-cchHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHH-HHHHHHHHHHHHHHhCCccc Q lcl|NC_018848. 3 RRVYATPAQLAQWTGEPAP-ADAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQA-LADAACAQVAYRQESGDTGT 80 (138) Q Consensus 3 ~rvyAt~~~l~~~~g~~~p-~~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~a-lr~AtcAQV~~~~~~G~~~t 80 (138) --|-+|++||..|. +.| +.++-+|+.|--|-.+.--| +++.+++ .++-..+ ||.| |.-|.+.|.. T Consensus 1 m~v~i~~~Dl~pF~--dI~~~k~~ami~D~~a~A~~vAPC----i~~~~f~-~~~aAKaIlrgA----iLRW~e~G~S-- 67 (153) T protein:vir:80 1 MGIILKPEDIEPFA--DIPREKLEAMIADVEAVAVSVAPC----IAKPDFK-YKDAAKAILRRA----LLRWNDTGVS-- 67 (153) T ss_pred Cceeechhhccccc--cCCHHHHHHHHHhhhhhhhhhccc----cCCCCcc-cHHHHHHHHHHH----hhhhhhcCcc-- Confidence 35778999999996 334 45666777766665555444 5555555 3444444 5777 5678887754 Q ss_pred ccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHh----hcCCCCCc--c--CCCCCCC Q lcl|NC_018848. 81 GAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALA----RAGLTPGE--I--YPPGVNW 138 (138) Q Consensus 81 ~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~----~aGLl~g~--~--~~~~~~~ 138 (138) |+.++-|-|.|-++.+..+ .+ +.++|+=++-|+ ..|=-|+. | -|-+-+| T Consensus 68 ---Gait~~taGpf~qT~dtrs--~r----~lfwPSEItqLqklC~~~~~~g~Af~id~t~~~~v~ 124 (153) T protein:vir:80 68 ---GQVQYESAGPFAQTTRSNT--PT----NLLWPSEIAALKKLCEGDGGAGKAFTITPTMRSSVN 124 (153) T ss_pred ---cceeeeccccceeeeccCC--ce----eccChhhHHHHHHHhcCCCCCcceeEeecCCCCccc Confidence 5566667788777743322 11 234666555554 23433322 1 2345556 No 60 >protein:vir:10365 Length: 115 # NCBI annotation: conserved phage protein # Family: family:all:363 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858957;genbank:gi:32128422;genbank:GeneID:2648387 Probab=41.29 E-value=0.93 Score=20.75 Aligned_cols=98 Identities=15% Similarity=0.117 Sum_probs=56.0 Q ss_pred cccHHHHHHhcCCCCC---cc---hHHHHHHHHHHHHHHhcchheeeccCCCC---------CC-HHHHHHHHHHHHHHH Q lcl|NC_018848. 6 YATPAQLAQWTGEPAP---AD---AERLLTRASEDVDDALLTAVYDVDEAGMP---------TD-PAVAQALADAACAQV 69 (138) Q Consensus 6 yAt~~~l~~~~g~~~p---~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlP---------td-p~v~~alr~AtcAQV 69 (138) =.|.+++|.++.-+++ ++ +..||.-|.+.|.+.|++.+|+...+ ++ .+ ..+=..||.|+.--| T Consensus 1 mvtLe~~K~hLRid~~d~d~dD~li~~~i~AA~~~v~~~~~r~l~~~~~~-~~~~~~~~~~~~~~~~~p~~i~~AiLLlv 79 (115) T protein:vir:10 1 MITLAMVQRHLQAELYEDDERDYVMQQLLPAARESAELFINRKLYDTQAD-MLADQAAGVDPAGQLLITRTVEQAILLTV 79 (115) T ss_pred CCCHHHHHHHcCCCCCCCchhhHHHHHHHHHHHHHHHHHhCCcccccccc-cccccccccCCcccccCChHHHHHHHHHH Confidence 6899999999985332 22 66899999999999999998863211 11 11 114466899999999 Q ss_pred HHHHHhCCcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhh-cCCC Q lcl|NC_018848. 70 AYRQESGDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALAR-AGLT 127 (138) Q Consensus 70 ~~~~~~G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~-aGLl 127 (138) .+|.++=...+. ++.+-+..| +. .| +.-+|. .|.- T Consensus 80 g~~Y~nRe~~~~--~~~~elP~~---v~--------------~L----L~pyR~~~gv~ 115 (115) T protein:vir:10 80 GEWYANREQVWV--KGVGLVTSS---AQ--------------NL----LHPYRKFAGVR 115 (115) T ss_pred HHHHhcchhccc--chhhhcCHH---HH--------------HH----HHHHHhcCCCC Confidence 999997543211 111111111 00 00 000111 1111 No 61 >protein:vir:4602 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058447;genbank:gi:9635173;genbank:GeneID:1262723 Probab=40.74 E-value=0.37 Score=22.94 Aligned_cols=102 Identities=12% Similarity=-0.063 Sum_probs=51.7 Q ss_pred ecccHH---HHHHhcCCCCCc--c-hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_018848. 5 VYATPA---QLAQWTGEPAPA--D-AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDT 78 (138) Q Consensus 5 vyAt~~---~l~~~~g~~~p~--~-~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~ 78 (138) .=.|.+ +++.|+.-+.++ + +..+|.-|.+-|..+++ .+.+ ..+.+.+.-..|+-|++.+|..|.++=.. T Consensus 1 M~~t~~dL~~iK~~lRID~d~DD~li~~yi~AA~~yI~~aig---~~~~--~~~~~~~~~~~~~~Avl~Lv~~~YeNR~a 75 (110) T protein:vir:46 1 MQLTAEELKLLKKHCKIDHNSEDDLLEIYYSWAFHEIASAVT---DEPS--KYIDWFKSHPLFARATYPLASYYFENRIA 75 (110) T ss_pred CcccHHHHHHHHHHhCCCCCchHHHHHHHHHHHHHHHHhhcc---CCcc--cccCccCcchHHHHHHHHHHHHHHHhccc Confidence 334544 478899966664 3 55799999999999887 2222 22333344456899999999999998643 Q ss_pred ccccccccceeeeCceeec--cCccccccchhhhhhh Q lcl|NC_018848. 79 GTGAAGRWSSVSIGPVSMS--GPRQSAGGTGAGSVDL 113 (138) Q Consensus 79 ~t~~~g~~~s~sIG~~S~s--~~~~~as~~~~~a~~l 113 (138) .+... ..-+..|=-|+= --+..+..-..+-.++ T Consensus 76 ~~~~~--~~~vp~~v~slI~qLRg~y~~~~e~e~~~~ 110 (110) T protein:vir:46 76 YLDRD--LSLAPHMVLSTVHKLRGSFEQFLESENDEI 110 (110) T ss_pred ccccc--cccccHHHHHHHHHHHHhHhHhhcccccCC Confidence 22211 111111111110 0000000000000111 No 62 >protein:vir:1384 Length: 92 # NCBI annotation: Gp7 protein # Family: family:all:316 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612836;genbank:gi:20065970;genbank:GeneID:935785 Probab=40.08 E-value=0.6 Score=21.79 Aligned_cols=84 Identities=15% Similarity=0.047 Sum_probs=49.3 Q ss_pred ccHHHHHHhcCCCCCc-c--hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCccc-cc Q lcl|NC_018848. 7 ATPAQLAQWTGEPAPA-D--AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGT-GA 82 (138) Q Consensus 7 At~~~l~~~~g~~~p~-~--~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t-~~ 82 (138) .|.+++++|+.-+.++ | +..||.-|...|..+++. .+ +.-..+|.|++..|..|.++=...+ +. T Consensus 1 vtLeevK~~LRID~ddDD~lI~~~i~aA~~~i~~~~~~-~~-----------~~~~~~~~Avlllv~~~YenR~~~~~~~ 68 (92) T protein:vir:13 1 MDLRELKEYLRIDFEEDDILLRSLLLAAEEYLYNAGIK-RD-----------YKKSLYSLAIKILVKHWYDNRDCVVAGN 68 (92) T ss_pred CCHHHHHHHcCCCCCcchHHHHHHHHHHHHHHHhhccc-cc-----------cchhHHHHHHHHHHHHhHhccccccccc Confidence 9999999999965553 2 667999999999988873 11 1123578999999999999864311 11 Q ss_pred ccccceee----eCceeeccCccc Q lcl|NC_018848. 83 AGRWSSVS----IGPVSMSGPRQS 102 (138) Q Consensus 83 ~g~~~s~s----IG~~S~s~~~~~ 102 (138) ....-.++ |..-.+.++... T Consensus 69 ~~~~ip~~v~sll~~lR~~~~~~~ 92 (92) T protein:vir:13 69 VNNKLEYSLNAILTQLRYCGDDNG 92 (92) T ss_pred hhhhhhHHHHHHHHHhhhccCCCC Confidence 11100000 111111111111 No 63 >protein:vir:99922 Length: 165 # NCBI annotation: gp9 # Family: family:all:7267 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655526;genbank:gi:109392296;genbank:GeneID:4157091 Probab=37.01 E-value=1.1 Score=20.27 Aligned_cols=122 Identities=18% Similarity=0.197 Sum_probs=61.3 Q ss_pred CCceecccHHHHHHhcCCCCCcchHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHH-HHHHHHHHHHHHHHhCCcc Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTGEPAPADAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQA-LADAACAQVAYRQESGDTG 79 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g~~~p~~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~a-lr~AtcAQV~~~~~~G~~~ 79 (138) -..-|-.|++||.-|..-+ ++.++-+|+.|--|-.+.--| |++.+++ .++-..+ ||.| |.-|.+.| T Consensus 7 ~~p~~ii~~eDl~Pf~~i~-~~ka~~mI~da~A~A~~vAPC----i~~~~f~-~~~aAKaIlrgA----iLRW~e~G--- 73 (165) T protein:vir:99 7 TEPEPLLTAEDLAPFATIP-KAKADEMIEDALGMAEVHAPC----INDPGFA-HRRAAKAILRGA----ILRWNEAG--- 73 (165) T ss_pred CCcceeeehhhccccccCC-HHHHHHHHhhhhhhhhhhccc----cCCCCcc-cHHHHHHHHHHh----hhhhhccc--- Confidence 5567788889987774222 345555555555554444444 4555555 2443344 5777 46788776 Q ss_pred cccccccceeeeCceeeccCcccccc------chhhhhhhHH--------------------------HHHHHHhhcCC- Q lcl|NC_018848. 80 TGAAGRWSSVSIGPVSMSGPRQSAGG------TGAGSVDLGE--------------------------QASRALARAGL- 126 (138) Q Consensus 80 t~~~g~~~s~sIG~~S~s~~~~~as~------~~~~a~~ls~--------------------------~a~~~L~~aGL- 126 (138) .|+.++.|.|.|-++.++.+... .-.+.++||. -.+--++.+|- T Consensus 74 ---SGAit~~TaGPf~qT~DtRs~r~~mfwPSEItqLqklC~~~g~~~~AFsIDt~p~g~v~Hs~~Cs~~fGg~CSCGav 150 (165) T protein:vir:99 74 ---AGAATTKTAGIYGQTVDTRQPRKAMFFPSEIDQLRKLCRPDDDNGGAFSIDLLPQETVTHAEICSIYFGGGCSCGAI 150 (165) T ss_pred ---CceeeecccccceeeeccccccccccChhhHHHHHHHhcCCCCCCcceeeecccCCCcccccccceeecCcccchhh Confidence 45666677788777765543221 1122333332 11111111111 Q ss_pred -C-CCccCCCCCCC Q lcl|NC_018848. 127 -T-PGEIYPPGVNW 138 (138) Q Consensus 127 -l-~g~~~~~~~~~ 138 (138) . +|-||-..-.| T Consensus 151 l~~~gplwe~~~~~ 164 (165) T protein:vir:99 151 LTQGLPLYEKNNGW 164 (165) T ss_pred hccCCccccccCCC Confidence 1 23344444444 No 64 >protein:vir:3871 Length: 99 # NCBI annotation: putative DNA packaging protein # Family: family:all:316 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680488;swissprot:trembl:q8ltb9;genbank:gi:22296528;uniprot:Q8LTB9;genbank:GeneID:951686 Probab=35.95 E-value=0.37 Score=22.93 Aligned_cols=95 Identities=9% Similarity=0.083 Sum_probs=56.6 Q ss_pred HhcC--CCCCcchH-HHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCCcccccccccceee Q lcl|NC_018848. 14 QWTG--EPAPADAE-RLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGDTGTGAAGRWSSVS 90 (138) Q Consensus 14 ~~~g--~~~p~~~~-rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~~~t~~~g~~~s~s 90 (138) =|+. .++++++. .++--|...|-.+.. +.-.++++=.-|++..-|+-|+-.||..|.+.=...+......-.++ T Consensus 1 lylRID~d~DDelL~~~i~aAe~~Ik~AI~---~~s~~~~F~en~~~~~rF~~Av~~lv~~~Y~~R~~~sd~~~~~i~~g 77 (99) T protein:vir:38 1 MYLKVDQTIEDPMIMQLVNDACGEISSAIS---FGSNPEQFLSNPETRDRFFTALMKQVKEDYDYRGMGAEVMRFPLQTS 77 (99) T ss_pred CeeeccCCcchHHHHHHHhHHHHHHHHhhc---CCCCccchhccccchhHHHHHHHHHHHHHHhhhcccccceeccchhh Confidence 3444 44555644 599999999999987 66677777777889999999999999999888433222222111111 Q ss_pred e-CceeeccCccccccchhhhh Q lcl|NC_018848. 91 I-GPVSMSGPRQSAGGTGAGSV 111 (138) Q Consensus 91 I-G~~S~s~~~~~as~~~~~a~ 111 (138) | +-++.=.|.-..-.+...++ T Consensus 78 v~~iI~QLRge~~~~~~~~d~~ 99 (99) T protein:vir:38 78 TTNIINQLRSELPEEDGDSDAN 99 (99) T ss_pred HHHHHHHhhcccccccCCCCCC Confidence 1 11111123333333334444 No 65 >protein:vir:1271 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:316 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690763;genbank:gi:22855003;genbank:GeneID:955211 Probab=34.53 E-value=1.3 Score=19.99 Aligned_cols=96 Identities=13% Similarity=0.105 Sum_probs=50.8 Q ss_pred ccHHHHHHhcCCCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhCC-ccccc Q lcl|NC_018848. 7 ATPAQLAQWTGEPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQESGD-TGTGA 82 (138) Q Consensus 7 At~~~l~~~~g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G~-~~t~~ 82 (138) .|.+++++|+.-+.+++ +..||.-|...|..+++. .|+ ..| .++.|++..|.+|.++=. ...+. T Consensus 1 ltLeevK~~LRID~ddDD~lI~~li~AA~~yi~~~~g~-~~e-------~~~----~~~~Avl~Lv~~~YeNRe~~~~~~ 68 (115) T protein:vir:12 1 MDLEAIKNYLKVEHEEDDRQLLNQMAAAKSYIINGIGR-YIE-------GHP----QFELVLQMLVQHWYENKGIYETGG 68 (115) T ss_pred CCHHHHHHHcCCCCccchHHHHHHHHHHHHHHHHHhCC-CCC-------Cch----hHHHHHHHHHHHHHhccccccccc Confidence 99999999999665532 557999999999999984 221 122 478999999999999843 21121 Q ss_pred ccccceeee----Cce---eec--------cCccccccchhhhhhhHHHHHHHHhhcC Q lcl|NC_018848. 83 AGRWSSVSI----GPV---SMS--------GPRQSAGGTGAGSVDLGEQASRALARAG 125 (138) Q Consensus 83 ~g~~~s~sI----G~~---S~s--------~~~~~as~~~~~a~~ls~~a~~~L~~aG 125 (138) ....-.+++ ..- +.. ....+-+...+..++ || T Consensus 69 ~~~~lp~~v~sll~~lR~~~~~~~e~~~~~~~~~~~~~~~~~~~~-----------~~ 115 (115) T protein:vir:12 69 TGLSIPFTAENILTQLRYVSVEEQENEKKDQPTPAPSDLSKEKRD-----------TG 115 (115) T ss_pred chhhhhHHHHHHHHHhhcccCCcccccccccCCCCCCCccccccc-----------cC Confidence 111011111 000 000 000011111111111 11 No 66 >protein:vir:81255 Length: 180 # NCBI annotation: gp8 # Family: family:all:3238 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456738;genbank:gi:157168381;uniprot:Q9MBJ7;genbank:GeneID:5580378 Probab=32.88 E-value=1.4 Score=19.79 Aligned_cols=114 Identities=16% Similarity=0.126 Sum_probs=63.0 Q ss_pred CCceecccHHHHHHhcCCCCC--cchHHHHHHHHHHHHHHhcchhee-------eccCC-----CCCCHH--HHHH---- Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTGEPAP--ADAERLLTRASEDVDDALLTAVYD-------VDEAG-----MPTDPA--VAQA---- 60 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g~~~p--~~~~rLl~rAS~~VD~~t~~avYd-------v~~~G-----lPtdp~--v~~a---- 60 (138) ||---=.||+++.++.+...+ .+..++|..|+..|++.++..+.. ++..| ||+-|- |.+. T Consensus 1 ~~~p~~l~p~~~~~~~~g~~~~~~~~q~~l~aA~aavRr~cGwhv~pv~~~t~~ldg~G~~~l~LPt~~vvsV~sV~~dG 80 (180) T protein:vir:81 1 MQPPHGLTPEILRTYPGGHLLSKDLTQEHVDAVVATVRKLCGWHVFPVATTEYSFPWRGDPEFLVPTKRLVSVESVTCGD 80 (180) T ss_pred CCCCccCCcchhhhhhccccCCchhhHHHHHHHHHHHHHHhCCcccceeeeEEEEecCCCeeEeCCCCcceeeeeEEECC Confidence 998888999999999885444 467799999999999999866652 22222 333211 0000 Q ss_pred ------------------H-HH--------------------------HH-HHHHHHHHHhCCcccccccccceeeeCce Q lcl|NC_018848. 61 ------------------L-AD--------------------------AA-CAQVAYRQESGDTGTGAAGRWSSVSIGPV 94 (138) Q Consensus 61 ------------------l-r~--------------------------At-cAQV~~~~~~G~~~t~~~g~~~s~sIG~~ 94 (138) + |. |+ |.|...- + ....++.++.+.|++ T Consensus 81 ~~v~~~~~~~~~~~~~G~l~r~~G~~~rg~~~V~Vt~~hGye~vP~~~aVi~~~a~ra---~---~s~~~~v~~~tvG~~ 154 (180) T protein:vir:81 81 LSIPNEDIVFYPYGEVNLLRRVHGTPWRVARPMTVTMTHGYEDAPGLVGVIAQMLTRA---F---TSTGGGDGNLTVGNM 154 (180) T ss_pred eeeCCccceecccCCCCeeEecCCccccccceEEEEEEeCCCCCchHHHHHHHHHHHh---c---cccccccccceecce Confidence 0 00 11 1111100 1 122345677888888 Q ss_pred eeccCccccccchhhhhhhHHHHHHHHhhcCCCCCcc Q lcl|NC_018848. 95 SMSGPRQSAGGTGAGSVDLGEQASRALARAGLTPGEI 131 (138) Q Consensus 95 S~s~~~~~as~~~~~a~~ls~~a~~~L~~aGLl~g~~ 131 (138) ||+.. .+.+ +.++.+.+|..=-|=| + T Consensus 155 S~~~~-~~~~--------~~~~e~aiLdrYrl~~--~ 180 (180) T protein:vir:81 155 SYGLS-TGIT--------PKSSEWLIIDQYRLHP--V 180 (180) T ss_pred eeccc-cCCC--------ccHHHHHHHHhhhccC--C Confidence 88422 1122 3334466666554433 2 No 67 >protein:vir:5742 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892053;genbank:gi:33770516;uniprot:Q7Y407;genbank:GeneID:2637465 Probab=32.53 E-value=1.4 Score=19.75 Aligned_cols=100 Identities=13% Similarity=0.061 Sum_probs=53.5 Q ss_pred eecccHHHHHHhcCC--CCC-c-c-hHHHHHHHHHHHHHHhcchheeeccC--CCCCCHH---HHHHHHHHHHHHHHHHH Q lcl|NC_018848. 4 RVYATPAQLAQWTGE--PAP-A-D-AERLLTRASEDVDDALLTAVYDVDEA--GMPTDPA---VAQALADAACAQVAYRQ 73 (138) Q Consensus 4 rvyAt~~~l~~~~g~--~~p-~-~-~~rLl~rAS~~VD~~t~~avYdv~~~--GlPtdp~---v~~alr~AtcAQV~~~~ 73 (138) -=..|.++++.++.- ++. + + +..++.-|-..+.+.|++.+|+.+++ +.|++++ +-+.+|.|+--=|..|. T Consensus 1 m~mitLeeiK~hlRid~D~~~eD~lL~~y~~AA~~~~e~~~~rkLy~~~~~~~~~p~~~~gl~~~~di~~A~Lllv~hwY 80 (110) T protein:vir:57 1 MGMTSLSNVKTQLRLEEDFTEHDDFIESLIDAAQRSIERTYYCVLVDSQEALEKLPEGVRGFLIEPDTQLAARMMVAQWY 80 (110) T ss_pred CCCCCHHHHHHHcCCCCCCChhHHHHHHHHHHHHHHHHHHhCCcccCCccccccCCCCCCccccCHHHHHHHHHHHHHHH Confidence 246899999999994 332 2 3 44688889999999999999986543 3343332 22334555555678899 Q ss_pred HhCCcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHHHHhhcC Q lcl|NC_018848. 74 ESGDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASRALARAG 125 (138) Q Consensus 74 ~~G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~~L~~aG 125 (138) ++=...+..+. +.+ .+++. .. +.|- -+-+= T Consensus 81 eNREav~~~~~--~~~---P~~v~----------~L---l~P~----~~~~~ 110 (110) T protein:vir:57 81 LNPKGTSPDGD--TPA---QLGVE----------YL---LFPL----MEHTV 110 (110) T ss_pred hcccccccccc--cch---hHHHH----------HH---HHHH----HhhcC Confidence 98654211111 111 11111 00 0000 00000 No 68 >protein:vir:1993 Length: 141 # NCBI annotation: Hypothetical protein # Family: family:all:348 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050640;genbank:gi:9633527;genbank:GeneID:2636296 Probab=29.71 E-value=1.6 Score=19.41 Aligned_cols=116 Identities=18% Similarity=0.183 Sum_probs=56.3 Q ss_pred CCceecccHHHHHHhcCC---------C-----CCc-chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHH Q lcl|NC_018848. 1 MDRRVYATPAQLAQWTGE---------P-----APA-DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAA 65 (138) Q Consensus 1 ~~~rvyAt~~~l~~~~g~---------~-----~p~-~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~At 65 (138) |- |||.+||.+..|+ . .++ -+.+-|.+||..||..++. +|.+.- + .+=..|++.+ T Consensus 1 M~---Y~T~~Dl~~~~ge~~l~~Lt~d~~~~g~~d~~~i~~Al~dA~~eIdgyL~~-RY~lPl---~---~~P~~L~~~a 70 (141) T protein:vir:19 1 MN---YATVNDLCARYTRTRLDILTRPKTADGQPDDAVAEQALADASAFIDGYLAA-RFVLPL---T---VVPSLLKRQC 70 (141) T ss_pred CC---cCCHHHHHHhcCHHHHHHHhcCCCCccccCHHHHHHHHHHHHHHHHHHHhh-cccCCc---c---ccchHHHHHH Confidence 65 9999999986652 1 111 2457899999999999875 688652 2 2334478888 Q ss_pred HHHHHHHHHhCCccccc-------ccccceeeeCceeeccCccc-cccchhhhhhh--HHHHHHHHhhcCCC Q lcl|NC_018848. 66 CAQVAYRQESGDTGTGA-------AGRWSSVSIGPVSMSGPRQS-AGGTGAGSVDL--GEQASRALARAGLT 127 (138) Q Consensus 66 cAQV~~~~~~G~~~t~~-------~g~~~s~sIG~~S~s~~~~~-as~~~~~a~~l--s~~a~~~L~~aGLl 127 (138) |.=..||+-..-.+... -.-...+.=|++++...... ....+...... .+-++.= ...|.| T Consensus 71 ~dIA~Y~L~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~~r~f~r-~~~G~~ 141 (141) T protein:vir:19 71 CVVAWFYLNESQPTEQITATYRDTVRWLEQVRDGKTDPGVESRTAASPEGEDLVQVQSDPPVFSR-KQKGFI 141 (141) T ss_pred HHHHHHHHhcCCCChHHHHHHHHHHHHHHHHhcCccccCCCCCCCCCCCCCceeEeecCCcccCc-ccccCC Confidence 87777766543211110 01133444466555321111 00000000000 0111110 112333 No 69 >protein:vir:81069 Length: 115 # NCBI annotation: p10 # Family: family:all:363 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285680;genbank:gi:148727188;genbank:GeneID:5247114 Probab=28.92 E-value=1.7 Score=19.31 Aligned_cols=99 Identities=14% Similarity=0.071 Sum_probs=58.2 Q ss_pred cccHHHHHHhcCCCCCc---c---hHHHHHHHHHHHHHHhcchheeeccC------CCCCCHH---HHHHHHHHHHHHHH Q lcl|NC_018848. 6 YATPAQLAQWTGEPAPA---D---AERLLTRASEDVDDALLTAVYDVDEA------GMPTDPA---VAQALADAACAQVA 70 (138) Q Consensus 6 yAt~~~l~~~~g~~~p~---~---~~rLl~rAS~~VD~~t~~avYdv~~~------GlPtdp~---v~~alr~AtcAQV~ 70 (138) =.|.+++|.++.-+.++ + +..||.-|.+.|.+.|++.+|+...+ +-+.+++ +=..||.|+.--|. T Consensus 1 ivtLee~K~HlRid~dd~deDD~li~~~i~AA~~~v~~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~p~~i~~AiLllvg 80 (115) T protein:vir:81 1 MITLAMVQRHLQAELYEDDERDYVMQQLLPAARESAELFINRKLYDTQADMLADQAAGVDPAGQLLITRTVEQAILLTLG 80 (115) T ss_pred CCCHHHHHHHcCCCCCCCccchHHHHHHHHHHHHHHHHHhCCccccccccccccccccCCCCcccccCHHHHHHHHHHHH Confidence 68999999999854432 1 56799999999999999999975332 1122221 44557888999999 Q ss_pred HHHHhCCcccccccccceeeeCceeeccCccccccchhhhhhhHHHHHH-HHhhcCCC Q lcl|NC_018848. 71 YRQESGDTGTGAAGRWSSVSIGPVSMSGPRQSAGGTGAGSVDLGEQASR-ALARAGLT 127 (138) Q Consensus 71 ~~~~~G~~~t~~~g~~~s~sIG~~S~s~~~~~as~~~~~a~~ls~~a~~-~L~~aGLl 127 (138) .|.++=...+.+ ..+.+..|= ..| +. |=.-.|+- T Consensus 81 ~~Y~NRE~v~~~--~~~elP~~~-----------------~~L----L~pyR~~~g~~ 115 (115) T protein:vir:81 81 EWYSSREQVWTK--GAGLVTSSA-----------------QNL----LHPYRKFAGVR 115 (115) T ss_pred HHHhccchhcch--hhhhcCHHH-----------------HHH----HHHHHhhcCCC Confidence 999986642111 111111110 001 00 11122332 No 70 >protein:vir:96108 Length: 155 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:664 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294442;genbank:gi:149408339;genbank:GeneID:5237227 Probab=26.98 E-value=1.9 Score=19.07 Aligned_cols=125 Identities=18% Similarity=0.187 Sum_probs=66.4 Q ss_pred eecccHHHHHH-h---cC-CCCCc-chHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHHH--- Q lcl|NC_018848. 4 RVYATPAQLAQ-W---TG-EPAPA-DAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQE--- 74 (138) Q Consensus 4 rvyAt~~~l~~-~---~g-~~~p~-~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~--- 74 (138) =|=-++++++. | .. +.+|+ .+...+..|-..|..-+. .+.+ .+.+..+.+-....|+..+... T Consensus 1 ~v~fd~~~FR~~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~~~--~s~~------~~g~~~~~~l~Ll~AH~l~L~~~~~ 72 (155) T protein:vir:96 1 MVIFDEQKFRTLFPEFADPASYPAVRLQLYFDIACEFISDRDS--PYRI------LNGKALEACLYLLTAHLLSLSTMQV 72 (155) T ss_pred CcccCHHHHHHhCccccCcccCCHHHHHHHHHHHHHhhcCCCc--cccc------cChHHHHHHHHHHHHHHHHHHHHhh Confidence 12224555554 2 22 23453 455666666655532111 1211 1466777777777787776553 Q ss_pred hCCccc------ccccccceeeeCceeeccCccccccchhh---hhhhHHHHHHHHhhcCCCCCccCCC----------- Q lcl|NC_018848. 75 SGDTGT------GAAGRWSSVSIGPVSMSGPRQSAGGTGAG---SVDLGEQASRALARAGLTPGEIYPP----------- 134 (138) Q Consensus 75 ~G~~~t------~~~g~~~s~sIG~~S~s~~~~~as~~~~~---a~~ls~~a~~~L~~aGLl~g~~~~~----------- 134 (138) +|-... +..|-.+|.|+|++|+|-+.+........ ....--+=|.+++..|.=+ .|.+ T Consensus 73 ~gaa~~g~~~~g~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~f~~l~~~~~~Gg--~~vgG~per~~~r~v 150 (155) T protein:vir:96 73 QGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKAVGG--FYIGGLPERRGFRKV 150 (155) T ss_pred hhccccccccccccccceeeceecceeeeeecCCCCCchhHHhhcCHHHHHHHHHHHHhcccc--cccCCCCcccccccc Confidence 343221 23455899999999999766543322221 1123334567777777644 4544 Q ss_pred -CCCC Q lcl|NC_018848. 135 -GVNW 138 (138) Q Consensus 135 -~~~~ 138 (138) ||-| T Consensus 151 gg~f~ 155 (155) T protein:vir:96 151 GGTFW 155 (155) T ss_pred CcccC Confidence 3556 No 71 >protein:vir:100211 Length: 114 # NCBI annotation: Hypothetical protein # Family: family:all:6491 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025032;genbank:gi:48697265;genbank:GeneID:2948309 Probab=26.70 E-value=1.9 Score=19.03 Aligned_cols=96 Identities=16% Similarity=0.114 Sum_probs=54.5 Q ss_pred CCceec----ccHHHHHHhcCCCCCcc---hHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_018848. 1 MDRRVY----ATPAQLAQWTGEPAPAD---AERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALADAACAQVAYRQ 73 (138) Q Consensus 1 ~~~rvy----At~~~l~~~~g~~~p~~---~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~ 73 (138) |-.-++ .|.+++++|+.-+++++ +..||.-|..-|..+++ ..+++..+...| .|+.|+...|.+|. T Consensus 1 ~~~~~~~~~~vtLeevK~~LRID~ddDD~lI~~lI~aA~~yI~~aig---~~~~~~~~~~~~----~~~~Avl~Lv~~~Y 73 (114) T protein:vir:10 1 MADETADTVGVTADDMQSYLNLDSDGDASILEGLISTAESAVMNAID---DTIAVEVYRTYP----LFNQAVRVLVDFMY 73 (114) T ss_pred CCCcccccccccHHHHHHHhCCCCccchHHHHHHHHHHHHHHHHhhC---CCCCcccccCch----hHHHHHHHHHHHHH Confidence 322222 57899999999766643 56799999999999998 444444333333 47889999999999 Q ss_pred HhCCcccccccccceeeeCceee--------ccCc---cccccc Q lcl|NC_018848. 74 ESGDTGTGAAGRWSSVSIGPVSM--------SGPR---QSAGGT 106 (138) Q Consensus 74 ~~G~~~t~~~g~~~s~sIG~~S~--------s~~~---~~as~~ 106 (138) ++=....... ..+..|=.|+ ..+- ....++ T Consensus 74 eNR~~~~~~~---~~vp~~v~slI~qLR~~~~~d~~~~~~~~d~ 114 (114) T protein:vir:10 74 YSRGTLSDQS---KAYPPSYAYMINSIRWKIQRDQAAKAGGNDG 114 (114) T ss_pred hhhhhhcccc---ccccHHHHHHHHHHHHHhhhhhhhhccCCCC Confidence 9853211111 1121111111 1100 011111 No 72 >protein:vir:79990 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430004;genbank:gi:156604059;genbank:GeneID:5525450 Probab=24.52 E-value=1.2 Score=20.15 Aligned_cols=99 Identities=11% Similarity=-0.018 Sum_probs=50.5 Q ss_pred ecccHHH---HHHhcCCCCCc--c-hHHHHHHHHHHHHHHhcchheeec--cCCCCCCHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_018848. 5 VYATPAQ---LAQWTGEPAPA--D-AERLLTRASEDVDDALLTAVYDVD--EAGMPTDPAVAQALADAACAQVAYRQESG 76 (138) Q Consensus 5 vyAt~~~---l~~~~g~~~p~--~-~~rLl~rAS~~VD~~t~~avYdv~--~~GlPtdp~v~~alr~AtcAQV~~~~~~G 76 (138) +=.|.++ +++|+.-+.++ + +..||.-|..-|..+++. +.+ ++.+... ..|+-|+..+|..|.++= T Consensus 1 m~vt~~dL~~iK~~lRID~d~DD~lI~~~i~aA~~yI~~aig~---~~~~~~~~~~~~----~~~~~Avl~Lv~~~YeNR 73 (110) T protein:vir:79 1 MQLTAEELKLLKKHCKIDHNSEDDLLEIYYSWAFHEIASAVTD---EPSKYIDWFKSH----PLFARAIYPLASYYFENR 73 (110) T ss_pred CeecHHHHHHHHHHhCCCCCchhHHHHHHHHHHHHHHhhhccC---cccccccCCCCc----hHHHHHHHHHHHHHHHhh Confidence 4567666 78899966664 3 446999999999988873 322 1122222 247889999999999996 Q ss_pred CcccccccccceeeeCceeec---cCccccccchhhhhhh Q lcl|NC_018848. 77 DTGTGAAGRWSSVSIGPVSMS---GPRQSAGGTGAGSVDL 113 (138) Q Consensus 77 ~~~t~~~g~~~s~sIG~~S~s---~~~~~as~~~~~a~~l 113 (138) ...+..+... +..|=-|+= .+. .+....-+-.++ T Consensus 74 ~a~~~~~~~~--vp~~v~slI~qlR~~-y~~~~~~e~~~~ 110 (110) T protein:vir:79 74 IAYLDRDLSL--APHMVLSTVHKLRGS-FERFLESENDEI 110 (110) T ss_pred hhcccccccc--ccHHHHHHHHHHHhh-hhhhcccccccC Confidence 4322221111 111111110 000 000000000011 No 73 >protein:vir:98340 Length: 110 # NCBI annotation: DNA packaging protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918932;genbank:gi:119443694;genbank:GeneID:4594502 Probab=24.52 E-value=1.2 Score=20.15 Aligned_cols=99 Identities=11% Similarity=-0.018 Sum_probs=50.5 Q ss_pred ecccHHH---HHHhcCCCCCc--c-hHHHHHHHHHHHHHHhcchheeec--cCCCCCCHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_018848. 5 VYATPAQ---LAQWTGEPAPA--D-AERLLTRASEDVDDALLTAVYDVD--EAGMPTDPAVAQALADAACAQVAYRQESG 76 (138) Q Consensus 5 vyAt~~~---l~~~~g~~~p~--~-~~rLl~rAS~~VD~~t~~avYdv~--~~GlPtdp~v~~alr~AtcAQV~~~~~~G 76 (138) +=.|.++ +++|+.-+.++ + +..||.-|..-|..+++. +.+ ++.+... ..|+-|+..+|..|.++= T Consensus 1 m~vt~~dL~~iK~~lRID~d~DD~lI~~~i~aA~~yI~~aig~---~~~~~~~~~~~~----~~~~~Avl~Lv~~~YeNR 73 (110) T protein:vir:98 1 MQLTAEELKLLKKHCKIDHNSEDDLLEIYYSWAFHEIASAVTD---EPSKYIDWFKSH----PLFARAIYPLASYYFENR 73 (110) T ss_pred CeecHHHHHHHHHHhCCCCCchhHHHHHHHHHHHHHHhhhccC---cccccccCCCCc----hHHHHHHHHHHHHHHHhh Confidence 4567666 78899966664 3 446999999999988873 322 1122222 247889999999999996 Q ss_pred CcccccccccceeeeCceeec---cCccccccchhhhhhh Q lcl|NC_018848. 77 DTGTGAAGRWSSVSIGPVSMS---GPRQSAGGTGAGSVDL 113 (138) Q Consensus 77 ~~~t~~~g~~~s~sIG~~S~s---~~~~~as~~~~~a~~l 113 (138) ...+..+... +..|=-|+= .+. .+....-+-.++ T Consensus 74 ~a~~~~~~~~--vp~~v~slI~qlR~~-y~~~~~~e~~~~ 110 (110) T protein:vir:98 74 IAYLDRDLSL--APHMVLSTVHKLRGS-FERFLESENDEI 110 (110) T ss_pred hhcccccccc--ccHHHHHHHHHHHhh-hhhhcccccccC Confidence 4322221111 111111110 000 000000000011 No 74 >protein:vir:99848 Length: 172 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164077;genbank:gi:56692609;genbank:GeneID:3192576 Probab=20.96 E-value=2.7 Score=18.23 Aligned_cols=119 Identities=13% Similarity=0.110 Sum_probs=55.0 Q ss_pred ceecccHHHHHHhcC-----------------------------------------CCCCcchHHHHHHHHHHHHHHhcc Q lcl|NC_018848. 3 RRVYATPAQLAQWTG-----------------------------------------EPAPADAERLLTRASEDVDDALLT 41 (138) Q Consensus 3 ~rvyAt~~~l~~~~g-----------------------------------------~~~p~~~~rLl~rAS~~VD~~t~~ 41 (138) --.|||.+||..-.| +.-+.-+.+=|..||..||-.+.. T Consensus 1 ~~mYaT~~dl~~r~g~~el~qLt~~~~~~~~~~~l~d~~~~~~~~~~~~~~~~~~g~~d~~~i~~Al~dA~aeIDgYL~~ 80 (172) T protein:vir:99 1 MAVYITLPELAERPGAEELSQAATPRPLQAVDSELLDALLRGLPVDRWTPEEIEVGHATVEVINSAVSDAQGYIDGFLQR 80 (172) T ss_pred CcccccHHHHHhhcCHHHHHHHhccccccCCHHHHHHHhhcchhhhhcccccccccccCHHHHHHHHHHHHHHHHHHHhc Confidence 245777777766543 211223567799999999999986 Q ss_pred hheeeccCCCCCCHHHHHHHHHHHHHHHHHHHHhC--Cc---cccc----c---cccceeeeCceeeccCccccccchhh Q lcl|NC_018848. 42 AVYDVDEAGMPTDPAVAQALADAACAQVAYRQESG--DT---GTGA----A---GRWSSVSIGPVSMSGPRQSAGGTGAG 109 (138) Q Consensus 42 avYdv~~~GlPtdp~v~~alr~AtcAQV~~~~~~G--~~---~t~~----~---g~~~s~sIG~~S~s~~~~~as~~~~~ 109 (138) ..|.+. |+ .+=..|+..+|.=..||+-.- .. +... . .-...+.=|++|+............. T Consensus 81 R~Y~lP---L~---~vP~~L~~~a~dIArY~L~~~r~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~ 154 (172) T protein:vir:99 81 RGYSLP---LA---KRYGVVTGWTRAIARYLLHQDRLGPGAEKDPIVRDYRDALKFLQLIAEGKFSLGPDDPLTPPGGGV 154 (172) T ss_pred ccccCC---Cc---ccchHHHHHHHHHHHHHHHhccCCcccCCHHHHHHHHHHHHHHHHHhcCccccCCCCCCCCCCCCc Confidence 547765 23 233568888888888887641 11 0000 0 11344445666654211110000000 Q ss_pred hhhhHHHHHHHHhhcCCCCCccCCCCCCC Q lcl|NC_018848. 110 SVDLGEQASRALARAGLTPGEIYPPGVNW 138 (138) Q Consensus 110 a~~ls~~a~~~L~~aGLl~g~~~~~~~~~ 138 (138) .++... -..+-..- |-| | T Consensus 155 -~~v~~~-~r~F~rd~-L~g--------f 172 (172) T protein:vir:99 155 -PQVLAP-ARTFSHDT-LKD--------Y 172 (172) T ss_pred -eeeecC-CCccChhh-ccC--------C Confidence 000000 00000000 000 0 No 75 >protein:vir:107864 Length: 150 # NCBI annotation: gp36 # Family: family:all:348 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024709;genbank:gi:48696946;genbank:GeneID:2845952 Probab=20.68 E-value=2.7 Score=18.19 Aligned_cols=116 Identities=15% Similarity=0.077 Sum_probs=55.6 Q ss_pred eecccHHHHHHhcCC--------------------CCC-cchHHHHHHHHHHHHHHhcchheeeccCCCCCCHHHHHHHH Q lcl|NC_018848. 4 RVYATPAQLAQWTGE--------------------PAP-ADAERLLTRASEDVDDALLTAVYDVDEAGMPTDPAVAQALA 62 (138) Q Consensus 4 rvyAt~~~l~~~~g~--------------------~~p-~~~~rLl~rAS~~VD~~t~~avYdv~~~GlPtdp~v~~alr 62 (138) =-|||.+||.+..|+ ..+ .-+.+=|.+||..||-.++. +|.+. |++ +=..|+ T Consensus 1 M~Y~T~~Dl~~r~ge~el~~Ltd~~~~g~~~~~~~~~d~~~i~~Al~dA~~eIDgYL~~-RY~lP---l~~---vP~~L~ 73 (150) T protein:vir:10 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESSVRQAEEIVDAHLRG-RYNLP---LSP---VPTVIK 73 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccCccccchhhcCHHHHHHHHHHHHHHHHHHHhh-hccCC---ccc---ccHHHH Confidence 479999999987662 111 12457899999999998874 58865 232 334588 Q ss_pred HHHHHHHHHHHHhCC----ccc-cc-------ccccceeeeCceeeccCccccccchhhhh-hhHHHHHHHHhhcCCCCC Q lcl|NC_018848. 63 DAACAQVAYRQESGD----TGT-GA-------AGRWSSVSIGPVSMSGPRQSAGGTGAGSV-DLGEQASRALARAGLTPG 129 (138) Q Consensus 63 ~AtcAQV~~~~~~G~----~~t-~~-------~g~~~s~sIG~~S~s~~~~~as~~~~~a~-~ls~~a~~~L~~aGLl~g 129 (138) +.+|.=..||+-... +.+ .. -.-...+.=|++|+..+............ .-.+..+ -...|=| T Consensus 74 ~~a~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~v~~~~r~f---~r~~l~g- 149 (150) T protein:vir:10 74 DVTVNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSGPATPEPGEMKVRARRRQF---DADLLER- 149 (150) T ss_pred HHHHHHHHHHHHhcccccCCCCHHHHHHHHHHHHHHHHHhcCcccCCCCCCCCCCCCceeeeecCCCcc---ChhhccC- Confidence 999988888876421 111 00 00133334455555322111000000000 0000000 0000001 Q ss_pred ccCCCCCCC Q lcl|NC_018848. 130 EIYPPGVNW 138 (138) Q Consensus 130 ~~~~~~~~~ 138 (138) | T Consensus 150 --------f 150 (150) T protein:vir:10 150 --------F 150 (150) T ss_pred --------C Confidence 1 Done!