Query lcl|NC_020866.1_cdsid_YP_007676424.1 [gene=RHVG_00045] [protein=hypothetical protein] [protein_id=YP_007676424.1] [location=26883..27317] Match_columns 144 No_of_seqs 66 out of 73 Neff 6.2 Searched_HMMs 1612 Date Thu Nov 7 17:26:03 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_45 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_45_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:488 Length: 187 # 100.0 1.1E-50 6.9E-54 294.4 15.8 140 1-144 2-143 (187) 2 protein:vir:4461 Length: 186 # 100.0 7E-50 4.3E-53 290.1 16.1 140 1-144 2-143 (186) 3 protein:vir:4515 Length: 186 # 100.0 1E-49 6.4E-53 289.2 15.3 140 1-144 2-143 (186) 4 protein:vir:99874 Length: 154 100.0 2.5E-48 1.6E-51 281.6 15.9 141 1-144 10-154 (154) 5 protein:vir:99226 Length: 157 100.0 8.1E-34 5E-37 202.0 14.3 139 1-144 10-153 (157) 6 protein:vir:79247 Length: 157 100.0 2.9E-33 1.8E-36 198.9 13.9 139 1-144 10-153 (157) 7 protein:vir:103883 Length: 159 99.9 6.8E-30 4.2E-33 180.5 14.0 139 1-144 12-155 (159) 8 protein:vir:107857 Length: 154 98.7 1.6E-10 9.8E-14 74.3 9.6 130 1-144 7-137 (154) 9 protein:vir:79065 Length: 154 98.7 1.7E-10 1.1E-13 74.1 9.8 130 1-144 7-137 (154) 10 protein:vir:1994 Length: 182 # 97.2 8.7E-06 5.4E-09 48.3 9.6 129 1-144 1-152 (182) 11 protein:vir:10327 Length: 182 95.5 0.00069 4.3E-07 37.9 10.5 131 1-144 9-144 (182) 12 protein:vir:96764 Length: 177 94.3 0.0035 2.2E-06 34.0 11.2 132 1-144 10-145 (177) 13 protein:vir:93736 Length: 145 77.2 0.13 7.9E-05 25.5 9.5 126 1-144 9-139 (145) 14 protein:vir:97421 Length: 145 77.2 0.13 7.9E-05 25.5 9.5 126 1-144 9-139 (145) 15 protein:vir:94488 Length: 145 77.2 0.13 7.9E-05 25.5 9.5 126 1-144 9-139 (145) 16 protein:vir:95111 Length: 145 74.1 0.16 0.0001 24.9 9.4 126 1-144 9-139 (145) 17 protein:vir:1892 Length: 121 # 72.1 0.15 9.3E-05 25.1 7.4 121 1-142 1-121 (121) 18 protein:vir:105337 Length: 145 68.4 0.24 0.00015 24.0 9.7 126 1-144 9-139 (145) 19 protein:vir:107096 Length: 145 68.4 0.24 0.00015 24.0 9.6 126 1-144 9-139 (145) 20 protein:vir:95961 Length: 145 65.7 0.28 0.00017 23.6 9.6 126 1-144 9-139 (145) 21 protein:vir:94794 Length: 145 65.7 0.28 0.00017 23.6 9.6 126 1-144 9-139 (145) 22 protein:vir:1244 Length: 145 # 63.9 0.31 0.00019 23.4 9.1 126 1-144 9-139 (145) 23 protein:vir:4348 Length: 121 # 55.4 0.48 0.0003 22.3 7.6 121 1-142 1-121 (121) 24 protein:vir:97211 Length: 150 50.9 0.6 0.00037 21.8 8.7 134 1-144 1-150 (150) 25 protein:vir:1643 Length: 111 # 49.4 0.64 0.0004 21.6 7.8 107 1-139 1-111 (111) 26 protein:vir:96894 Length: 140 44.4 0.81 0.0005 21.1 10.0 126 1-144 9-139 (140) 27 protein:vir:97325 Length: 145 42.2 0.89 0.00055 20.9 9.4 126 1-144 9-139 (145) 28 protein:vir:9579 Length: 111 # 42.2 0.89 0.00055 20.9 6.7 105 1-139 1-111 (111) 29 protein:vir:96260 Length: 141 39.1 1 0.00064 20.5 9.2 126 1-144 9-139 (141) 30 protein:vir:105892 Length: 141 39.1 1 0.00064 20.5 9.2 126 1-144 9-139 (141) 31 protein:vir:94096 Length: 141 39.1 1 0.00064 20.5 9.2 126 1-144 9-139 (141) 32 protein:vir:96125 Length: 140 35.5 1.2 0.00076 20.1 9.7 126 1-144 9-139 (140) 33 protein:vir:94768 Length: 111 35.3 1.2 0.00077 20.1 7.8 105 1-139 1-111 (111) 34 protein:vir:9764 Length: 111 # 31.9 1.5 0.00091 19.7 7.1 107 1-139 1-111 (111) 35 protein:vir:5979 Length: 134 # 22.7 2.4 0.0015 18.5 10.4 120 1-139 10-134 (134) 36 protein:vir:93602 Length: 114 20.4 2.8 0.0017 18.1 8.7 114 1-140 1-114 (114) No 1 >protein:vir:488 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:964 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543095;swissprot:trembl:q8w624;genbank:gi:18249907;uniprot:Q8W624;genbank:GeneID:929697 Probab=100.00 E-value=1.1e-50 Score=294.45 Aligned_cols=140 Identities=25% Similarity=0.456 Sum_probs=131.2 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecC-CC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRS-FD 79 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~-~d 79 (144) -|++||+|||++||+|++||++|+||+++++.+++| +|||||||.+|.++ ++.+||.|+|+++++|+|||+++| +| T Consensus 2 kl~~Ii~rLra~vP~l~grV~gaad~aal~~~~~lp--~PaAyVlp~~d~~~-~~~sq~~~~Q~i~e~f~Vvl~vrn~~D 78 (187) T protein:vir:48 2 KLTTIIAALRERCPRFEDRVGGAAQFKAIPDAGKLR--LPAAYVVPSDDAPG-EQKSQTDYWQDLTEGFSVIVVLSNERD 78 (187) T ss_pred chhHHHHHHHHhcchhhhhhhhhhhhhhhhhhcCCC--CceEEEEeccccCC-CCCCCcceeeeeeeEEEEEEEEeccCC Confidence 799999999999999999999999999999999986 59999999999986 577899999999999999999976 59 Q ss_pred CCchh-hhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEeecCcEEEEEEeeeeeeeeeccC Q lcl|NC_020866. 80 ASGKQ-ALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSMQAGLLVYQLDFALTDQMRIAR 144 (144) Q Consensus 80 ~~G~~-a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~~~g~l~y~~~F~~~~~l~~~r 144 (144) ++|+. ++|+||++|++||+||+||+|++. .+||+|+||++++|++|+++|+++|++++||+++- T Consensus 79 ~~G~~~a~D~l~~lr~~v~~AL~GW~P~~~-~~pi~~~gG~lvd~~~g~l~y~~~F~~~~ql~~~~ 143 (187) T protein:vir:48 79 EKGQWAAYDAVHDVRRELWKALLGWMPDPQ-GGEIVYAGGTLLDLNRYELYYQFDFTAKYEITEED 143 (187) T ss_pred CCCcchhhHHHHHHHHHHHHHHhCcCcCCC-CceEEEcCceEeeecCcEEEEEEEEEeecccCCCC Confidence 99985 589999999999999999999855 57999999999999999999999999999999876 No 2 >protein:vir:4461 Length: 186 # NCBI annotation: hypothetical protein # Family: family:all:964 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700384;genbank:gi:23505456;genbank:GeneID:955663 Probab=100.00 E-value=7e-50 Score=290.08 Aligned_cols=140 Identities=24% Similarity=0.460 Sum_probs=130.9 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecC-CC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRS-FD 79 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~-~d 79 (144) -|++||+|||++||+|++||+||+||+++++.+++| +|||||||.+|.++ ++.+||.|+|+++++|+|||+++| +| T Consensus 2 kl~~Vi~RLra~vP~l~~rV~gaad~aai~~~~~lp--~PaAyVip~~d~~g-~~~s~g~~~Q~i~~~f~Vvl~vrn~~d 78 (186) T protein:vir:44 2 KLTPIIAALRSRCPRFENRVGGAAQFKAIPEAGKLR--LPAAYVVPAEDVTG-EQKSQTDYWQDLTEGFSVIVVLSNERD 78 (186) T ss_pred ChhHHHHHHHHhcchhhhhhhhhhhhhhhhhhcCCC--CceEEEEeccccCC-CCCcccceeEeeeeeEEEEEEEeccCC Confidence 799999999999999999999999999999999986 59999999999886 578899999999999999999965 69 Q ss_pred CCchh-hhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEeecCcEEEEEEeeeeeeeeeccC Q lcl|NC_020866. 80 ASGKQ-ALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSMQAGLLVYQLDFALTDQMRIAR 144 (144) Q Consensus 80 ~~G~~-a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~~~g~l~y~~~F~~~~~l~~~r 144 (144) ++|+. ++|+||++|++||+||+||+|++ ..+||+|.+|++++|++|+++|+++|+++++|+++= T Consensus 79 ~~G~~aa~D~l~~lr~~v~~AL~GW~P~~-~~~pi~~~gG~lvd~~~g~l~y~~~F~~~~~l~~~~ 143 (186) T protein:vir:44 79 EKGQWASYDAVHDVRQEIWKALLGWEPDS-QVHEIQYAGGMLLDLNRHELYYQFDFTVKYEITETD 143 (186) T ss_pred CCCCccchHHHHHHHHHHHHHHcCcCcCC-CCceEEEcCceEEeecCcEEEEEEEEEEeeccCCCC Confidence 99985 58999999999999999999985 468999999999999999999999999999999876 No 3 >protein:vir:4515 Length: 186 # NCBI annotation: unknown # Family: family:all:964 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599041;genbank:gi:19548999;genbank:GeneID:935225 Probab=100.00 E-value=1e-49 Score=289.15 Aligned_cols=140 Identities=25% Similarity=0.439 Sum_probs=131.1 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecC-CC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRS-FD 79 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~-~d 79 (144) -|++||+|||++||+|++||++|+||+++++.+++| +|||||||.+|.++ ++.+||.|+|+++++|+|||+++| +| T Consensus 2 kl~~Ii~RLra~vP~l~grV~gaad~a~l~~~~~lp--~PaAyVip~~d~~~-~~~sq~~~~Q~i~e~f~Vvl~vrn~~d 78 (186) T protein:vir:45 2 KLTPVIAALRARCPYFENRVAGAAQFKNLPEVGKLR--LPAAYVVPGDDSPG-ENKSQTDYWQELKEGFSVVVILSNGRD 78 (186) T ss_pred ChHHHHHHHHHhcchhhchhhhhhhhhhhHhhcCCC--CceEEEEecccccC-CCccccceeeeeeeEEEEEEEEeccCC Confidence 799999999999999999999999999999999986 59999999999985 577899999999999999999976 69 Q ss_pred CCchh-hhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEeecCcEEEEEEeeeeeeeeeccC Q lcl|NC_020866. 80 ASGKQ-ALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSMQAGLLVYQLDFALTDQMRIAR 144 (144) Q Consensus 80 ~~G~~-a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~~~g~l~y~~~F~~~~~l~~~r 144 (144) ++|+. ++|++|++|++||+||+||+|+++ .+||+|.||++++|++|+++|+++|++++||+++= T Consensus 79 ~~G~~aa~D~l~~lr~~v~~AL~GW~P~~~-~~pi~~~gG~lvd~~~g~l~y~~~F~~~~~l~~~~ 143 (186) T protein:vir:45 79 ERGQFASYDVVDDVRQMLFKALLGWNPEAC-GNPITYDGGTLLDLNRHELIYQFDFSVISELTEDD 143 (186) T ss_pred CCCcccchhHHHHHHHHHHHHHhCcccCCC-CceEEEcCceEEeecCcEEEEEEEEEEeeccCCCc Confidence 99986 579999999999999999999855 57999999999999999999999999999999876 No 4 >protein:vir:99874 Length: 154 # NCBI annotation: hypothetical protein # Family: family:all:964 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164078;genbank:gi:56692610;genbank:GeneID:3192602 Probab=100.00 E-value=2.5e-48 Score=281.55 Aligned_cols=141 Identities=23% Similarity=0.345 Sum_probs=130.3 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcC--cccccceeEeeeeEEEEEEEecC- Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGV--YSTTGAFTQEFEEAASIVLAIRS- 77 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~--~~~~g~~~Q~v~~~f~Vvv~~~~- 77 (144) =|+|||+|||++||+|. ||++|+||+++++.+++| +|||||||.+|.++++ +.++|.++|+++++|+|||++++ T Consensus 10 dl~~Vi~RLra~~p~l~-~V~gaadlAal~~~~~~p--~PaAyVlp~~d~~~~~~~~~~~g~~~Q~i~~~f~Vvl~v~~~ 86 (154) T protein:vir:99 10 DHNLVIERLRDQVKVLK-HVGGAAELGTITQLRDFR--TPAAYVLLAQETLSPKPAGHAGGATRQMANVHFAITVAVRNY 86 (154) T ss_pred ccHHHHHHHHHhCcchh-hhhhhhhhhhhhhhcCCC--CceEEEEecccccCCCCCCccccceeeeeeeEEEEEEEeecc Confidence 57999999999999998 799999999999999986 6999999999876433 45678999999999999999987 Q ss_pred CCCCchhhhHHHHHHHHHHHHHHcCCCCCC-ccCCceEecCceeEeecCcEEEEEEeeeeeeeeeccC Q lcl|NC_020866. 78 FDASGKQALDPMRELIMEVFRSLGGWAPSE-DAPDVLRLLNGRLLSMQAGLLVYQLDFALTDQMRIAR 144 (144) Q Consensus 78 ~d~~G~~a~d~l~~lr~~v~~AL~GW~P~~-~~~~pi~~~~G~l~~~~~g~l~y~~~F~~~~~l~~~r 144 (144) +|.+|+++.|++|++|++||+||+||+|++ ++..||+|.+|++++|++|++||+++|+++++|+||- T Consensus 87 ~d~~G~~a~d~l~~lr~~v~~AL~GW~P~~~~G~~pi~~~gG~l~d~~~g~l~y~~~F~~~~~lgr~~ 154 (154) T protein:vir:99 87 RDNKGVTAADDLRPVLGDVRKALIGWTPPGLAGARDCQLVQGQVVDYDASVLIWTDLYQTQHAIGRTS 154 (154) T ss_pred CcccchhhHHHHHHHHHHHHHHHhCCCCCcccCCceeeecCcceeeccCcEEEEeeeeeeeeecCCCC Confidence 689999999999999999999999999974 5568999999999999999999999999999999999 No 5 >protein:vir:99226 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950461;genbank:gi:119953662;genbank:GeneID:4643086 Probab=100.00 E-value=8.1e-34 Score=202.00 Aligned_cols=139 Identities=21% Similarity=0.272 Sum_probs=117.9 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCc--CcccccceeEeeeeEEEEEEEecCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRG--VYSTTGAFTQEFEEAASIVLAIRSF 78 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~--~~~~~g~~~Q~v~~~f~Vvv~~~~~ 78 (144) .-++||+|||++||+|+ +|.+++||++|.+.++ .+|||||+|.+|++.. .+.+++..+|+++++|+|||+++++ T Consensus 10 ~~~~IierLka~vp~l~-~V~~aadla~i~~~~q---~tPaayVi~~gd~~~~~~~~~~~~~~~Q~i~q~~~Vvlavr~~ 85 (157) T protein:vir:99 10 LEPLLIERIRSEVPGLA-IVSGVPDLAALSEQDQ---PAPSVYVVYLGDEIGTGADHQGGRRAIQAIGQQWAVVLVVHYA 85 (157) T ss_pred hhHHHHHHHHhhhhHHH-hhhcccchHHHhhccC---CCcEEEEEecccccCCCcccccccceeeeeeeeEEEEEEEecc Confidence 45779999999999998 7999999999987665 4699999999998753 3445667789999999999999876 Q ss_pred C-CC-chhhhHHHHHHHHHHHHHHcCCCCCCccCCceEec-CceeEeecCcEEEEEEeeeeeeeeeccC Q lcl|NC_020866. 79 D-AS-GKQALDPMRELIMEVFRSLGGWAPSEDAPDVLRLL-NGRLLSMQAGLLVYQLDFALTDQMRIAR 144 (144) Q Consensus 79 d-~~-G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~-~G~l~~~~~g~l~y~~~F~~~~~l~~~r 144 (144) + .+ |+.+.|++++++.+||+||+||+|+++ ..|+++. .+....|++|..||++.|++++=.-+-. T Consensus 86 ~~~~~g~~a~d~ag~ll~~v~~AL~GW~P~~~-~~pl~~~~~~~~~~y~~gf~yypl~F~~~~~~~~~~ 153 (157) T protein:vir:99 86 DSSNSGEGARREAGPLLGRLVKALTGWAPAID-VAPLARSARQSPVTYASGYFYFPLVFTARFVYPRVK 153 (157) T ss_pred ccccccchhHHHHHHHHHHHHHHhcCCcCccc-CCceeeeecCCcccccCceEEEEEEEEEeeeccccc Confidence 4 44 556789999999999999999999865 5799875 5667899999999999999977665555 No 6 >protein:vir:79247 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469166;genbank:gi:157835008;genbank:GeneID:5648828 Probab=99.96 E-value=2.9e-33 Score=198.94 Aligned_cols=139 Identities=21% Similarity=0.274 Sum_probs=117.1 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcC--cccccceeEeeeeEEEEEEEecCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGV--YSTTGAFTQEFEEAASIVLAIRSF 78 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~--~~~~g~~~Q~v~~~f~Vvv~~~~~ 78 (144) .-++||+|||++||+|+ +|.+++||++|.+.++ .+|||||+|.+|++... ..+++.++|.++++|+|||+++|+ T Consensus 10 ~~~~IierLka~v~~l~-~V~~aadla~i~e~~q---~tPaayVv~~gd~~~~~~~~~~~~~~~Q~vtq~f~Vvlavrn~ 85 (157) T protein:vir:79 10 LEPLLIERIRSEVPGLA-IVSGVPDLAALSEQDQ---PAPSVYVVYLGDEIGTGADYQGGRRAIQAIGQQWAVVLVVHYA 85 (157) T ss_pred hhHHHHHHHHhhhhhhh-hhccccchhhhhhhcC---CCcEEEEEecccccCCCcccccCcceeeeeeeeEEEEEEEecc Confidence 34679999999999998 6999999999988765 36999999999987543 344566889999999999999876 Q ss_pred C-CCch-hhhHHHHHHHHHHHHHHcCCCCCCccCCceEec-CceeEeecCcEEEEEEeeeeeeeeeccC Q lcl|NC_020866. 79 D-ASGK-QALDPMRELIMEVFRSLGGWAPSEDAPDVLRLL-NGRLLSMQAGLLVYQLDFALTDQMRIAR 144 (144) Q Consensus 79 d-~~G~-~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~-~G~l~~~~~g~l~y~~~F~~~~~l~~~r 144 (144) + .+++ .+.|++++++.+||+||+||+|+++ ..|+++. .+....|++|..||++.|++++=.-+-. T Consensus 86 ~~~~~~~a~~d~ag~ll~~v~~AL~GW~P~~~-~~pl~~~~~~~~~~y~~gf~yypl~F~~~~~~~~~~ 153 (157) T protein:vir:79 86 DSSNSGEGARREAGPLLGRLVKALTGWAPAID-VAPLARSARQSPVTYASGYFYFPLVFTARFVYPRVK 153 (157) T ss_pred ccccccchhHHHHHHHHHHHHHHhcCcccccc-CCceeeeecCCcccccCCeEEEEEEEEEeeeccccc Confidence 4 4555 4678899999999999999999865 5899997 5666899999999999999977665555 No 7 >protein:vir:103883 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938246;genbank:gi:38229151;genbank:GeneID:2648198 Probab=99.94 E-value=6.8e-30 Score=180.49 Aligned_cols=139 Identities=17% Similarity=0.231 Sum_probs=115.1 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCc--CcccccceeEeeeeEEEEEEEecCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRG--VYSTTGAFTQEFEEAASIVLAIRSF 78 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~--~~~~~g~~~Q~v~~~f~Vvv~~~~~ 78 (144) .-++||+|||++||+|+ +|.+++||++|.+.++ .+|||||++.++.+.. .+.+.++.+|.++++|+|||+++++ T Consensus 12 v~~~IieRLka~v~~lr-~V~~aadla~i~el~q---~tPaayV~~~g~~~~~~~~~~~~~~~~q~v~q~w~Vvlavr~~ 87 (159) T protein:vir:10 12 LETLLVERIRAEVPGLQ-DVSGVPDLATLDEQRQ---GSPCVYVVYLGDEIGTGASHQGGSRAIQTVTQHWAAVLTLYYA 87 (159) T ss_pred hhHHHHHHHHhhhhHHH-hhhcccchHHHHhhhC---CCcEEEEEecccccCCCcccccccceeeeeeeEEEEEEEEecc Confidence 56889999999999997 6999999999987664 5899999999998643 3445667889999999999999875 Q ss_pred -CCCchhhh-HHHHHHHHHHHHHHcCCCCCCccCCceEecCce-eEeecCcEEEEEEeeeeeeeeeccC Q lcl|NC_020866. 79 -DASGKQAL-DPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGR-LLSMQAGLLVYQLDFALTDQMRIAR 144 (144) Q Consensus 79 -d~~G~~a~-d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~-l~~~~~g~l~y~~~F~~~~~l~~~r 144 (144) +++|+.+. |++++++.+|++||+||+||+. ..|++...-. -.+|++|..||.+.|++++=.-+-. T Consensus 88 ~~q~~~~a~~d~aG~ll~~v~~AL~GW~P~~~-~~Pl~r~~~~~~~~y~~gfayyPl~F~~~~~~~~~~ 155 (159) T protein:vir:10 88 DAQGDGQGARREAGPLLGRLLKALTGWVPDQG-VTPLARSPQASPVSYSNGFFYFPLVFTANFVFPRLK 155 (159) T ss_pred cccCccchhhHHHHHHHHHHHHHhcCcccCCc-CCCeeecccCCCccccCCEEEeeeeEEeeeeccccc Confidence 66776664 5699999999999999999865 5788744322 4679999999999999977665555 No 8 >protein:vir:107857 Length: 154 # NCBI annotation: gp37 # Family: family:all:1532 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024710;genbank:gi:48696947;genbank:GeneID:2845945 Probab=98.72 E-value=1.6e-10 Score=74.32 Aligned_cols=130 Identities=18% Similarity=0.288 Sum_probs=96.8 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHh-HHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVD-FGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFD 79 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAad-la~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d 79 (144) ||+.|++|||++.|+|. |+.-.+ -+.. .+....-|+-|.+.|..= +...+++...|.=+..|.|.|++++-+ T Consensus 7 ii~aiv~rL~~~lP~~~--ve~fP~~p~ey----~l~h~~GAvLV~Y~GS~f-~~~~~~~~i~Q~R~~~~~vTVi~r~l~ 79 (154) T protein:vir:10 7 MVDAIVARLRVKLPALV--TEYFPERPDEY----RLNHAIGALLVSYPGSQY-DTTVDTDMVVQPRRVKFAVAIVLRQLN 79 (154) T ss_pred HHHHHHHHHHHhCCcce--EeeCCCChhHc----CCCCCceeEEEEecCccc-CCcccCCceeeeeEEEEEEEEEeeccC Confidence 99999999999999874 444332 2222 223344677888887543 456778999999999999999888643 Q ss_pred CCchhhhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEeecCcEEEEEEeeeeeeeeeccC Q lcl|NC_020866. 80 ASGKQALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSMQAGLLVYQLDFALTDQMRIAR 144 (144) Q Consensus 80 ~~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~~~g~l~y~~~F~~~~~l~~~r 144 (144) .+. .|+ .+..+||.||.||+|+++ ++++..+-+.+.-++|.--|+.+|.+++..=-.. T Consensus 80 g~~-gal----~~LD~vR~aL~Gf~ppdc--~~~~lv~d~f~ge~~G~W~Y~l~~at~t~~Ve~~ 137 (154) T protein:vir:10 80 GRG-GAI----DVLDHVRTALVGFRPPDC--KKLAAVSDKFLGESAGLWQYVIEFSAGAVIVEDA 137 (154) T ss_pred Ccc-hhh----HHHHHHHHHHhccccCCC--ceeehhhhcccccccceeeeeeeeccchhhhhcc Confidence 322 222 345678899999999853 6899999999999999999999999877543333 No 9 >protein:vir:79065 Length: 154 # NCBI annotation: gp11 # Family: family:all:1532 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111211;genbank:gi:134288825;genbank:GeneID:4960739 Probab=98.72 E-value=1.7e-10 Score=74.15 Aligned_cols=130 Identities=18% Similarity=0.286 Sum_probs=97.0 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHh-HHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVD-FGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFD 79 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAad-la~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d 79 (144) ||+.|++|||++.|+|. |+.-.+ -+.. .+....-|+-|.+.|..= +...+++...|.=+..|.|.|++++-+ T Consensus 7 ii~~iv~rL~~~lP~~~--ve~fP~~p~ey----~l~h~~GAvLV~Y~GS~f-~~~~~~~~i~Q~R~~~~~vTVi~r~l~ 79 (154) T protein:vir:79 7 MVDSVVARLRVKLPALV--TEYFPERPDEY----RLNHAIGALLVSYPGSQY-DTTVDTDMVVQPRRVKFAVAIVLRQLN 79 (154) T ss_pred HHHHHHHHHHHhCCcce--EeeCCCChhHc----CCCCCceeEEEEecCccc-CCcccCCceeeeeEEEEEEEEEeeccC Confidence 99999999999999874 444332 2222 223344677888887543 456778999999999999999888643 Q ss_pred CCchhhhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEeecCcEEEEEEeeeeeeeeeccC Q lcl|NC_020866. 80 ASGKQALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSMQAGLLVYQLDFALTDQMRIAR 144 (144) Q Consensus 80 ~~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~~~g~l~y~~~F~~~~~l~~~r 144 (144) .+. .|+ .+..+||.||.||+|+++ ++++..+-+.+.-++|.--|+.+|.+++..=-.. T Consensus 80 g~~-gal----~~LD~vR~aL~Gf~ppdc--~~~~lv~d~f~ge~~G~W~Y~l~~at~t~~Ve~~ 137 (154) T protein:vir:79 80 GRG-GAI----DVLDHVRTALVGFRPPDC--KKLAAVSDKFLGESAGLWQYVIEFSAGAVIVEDA 137 (154) T ss_pred Ccc-hhh----HHHHHHHHHHhccccCCC--ceeehhhhcccccccceeeeeeeeccchhhhccC Confidence 322 222 345678899999999853 6899999999999999999999999877654433 No 10 >protein:vir:1994 Length: 182 # NCBI annotation: Hypothetical protein # Family: family:all:1387 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050641;genbank:gi:9633528;genbank:GeneID:2636286 Probab=97.19 E-value=8.7e-06 Score=48.33 Aligned_cols=129 Identities=16% Similarity=0.258 Sum_probs=76.5 Q ss_pred CchH----HHHHHHHhC-cchhhHHHHHH-hH--HHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEE Q lcl|NC_020866. 1 MIDE----IIQKLSDEI-PRLKERVQGAV-DF--GKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIV 72 (144) Q Consensus 1 ~l~~----vi~rLra~~-p~~~~rV~gAa-dl--a~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vv 72 (144) ||+. |++|||+.+ |.++. |+.=. +| +.+.. +..+.||+||-..|.... . + +=.++-+|.|. T Consensus 1 mI~~iEdAi~~rl~~~~g~~v~~-V~sy~Gefd~e~l~~---~~~~~PAv~Va~~G~~~~-~----~--r~~~~~r~~v~ 69 (182) T protein:vir:19 1 MLEETEAALLARVRELFGATLRQ-VEPLTGTWTNEDVHR---LFLAPPSVFLAWMGCGEG-R----T--RREVESRWAFF 69 (182) T ss_pred ChHHHHHHHHHHHHHHhhhhhhh-hccCCCCCChhhhhH---hhhcCceeEEEeccccCc-C----C--ceeeeeEEEEE Confidence 7765 667777765 55543 55433 22 22321 123679999999874331 1 2 22466788887 Q ss_pred EEecCCCCCchhhhH-HHHHHHHHHHHHHcCCCCCCccCCceEecCceeE----eecCcEEEEEEeeeeeeeeec----- Q lcl|NC_020866. 73 LAIRSFDASGKQALD-PMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLL----SMQAGLLVYQLDFALTDQMRI----- 142 (144) Q Consensus 73 v~~~~~d~~G~~a~d-~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~----~~~~g~l~y~~~F~~~~~l~~----- 142 (144) |+.++ .+|..+.+ -+-.+...|++.|.|-... + .++++..+-+-+ .-..|-..|..+|..++.+-- T Consensus 70 V~a~~--~~g~~~~rvG~y~lv~~v~~lL~~q~~g-~-~~~l~p~~vrnL~s~~~~~~gvsvyavef~~~~~lp~~~d~~ 145 (182) T protein:vir:19 70 VVAEL--LNGEPVNRPGIYQIVERLIAGVNGQTFG-P-TTGMRLTQVRNLCDDNRINAGVVLYGVLFSGTTPLPSVVDLD 145 (182) T ss_pred EEecC--CCChhhhhhhHHHHHHHHHHHHhccCCC-C-ccccccceeeeeechhhhhCceEEEEEEeeccccCCCcCCCC Confidence 77653 34544432 2567899999999884433 2 245554444444 234699999999998766541 Q ss_pred -----cC Q lcl|NC_020866. 143 -----AR 144 (144) Q Consensus 143 -----~r 144 (144) .| T Consensus 146 ~l~df~~ 152 (182) T protein:vir:19 146 SLDDYER 152 (182) T ss_pred CCcchhh Confidence 11 No 11 >protein:vir:10327 Length: 182 # NCBI annotation: ORF29 # Family: family:all:1090 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758922;genbank:gi:27311196;genbank:GeneID:956141 Probab=95.48 E-value=0.00069 Score=37.92 Aligned_cols=131 Identities=11% Similarity=-0.083 Sum_probs=82.6 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) +-++|++.||++.|.+. +.+.-+. . .+.+ .+||+||=-.+-.+ .....+|.+ .++-+|.+-+++..... T Consensus 9 lh~AI~~~Lk~~~p~l~--~~~~y~~---~-~~~i--~~PAv~vel~~~~~-~~d~~tGq~--~~~~~~~a~~vv~~~~~ 77 (182) T protein:vir:10 9 VHEAIKAKLRETFPKVT--VDDYNPE---P-ELSV--LAPALLLELEEFPM-GADVGDDRY--PAACRFSVHCVLGWEVK 77 (182) T ss_pred HHHHHHHHHHHhcCCce--eeecCcc---c-cCcc--ccceeeeeeecCCc-CCCCCCCcE--EEEEEEEEEEEecccCC Confidence 77889999999999984 4333322 1 2334 57999987765443 233456665 36788888777764433 Q ss_pred Cch-hhhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEee----cCcEEEEEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGK-QALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSM----QAGLLVYQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~-~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~----~~g~l~y~~~F~~~~~l~~~r 144 (144) .-+ .+..-.-.+...|.....| -|.+. .+|-++.+++-..+ .+|...|.=+|+=+..||-+- T Consensus 78 ~~~~~~~~lAa~l~~~v~~~~wG-L~~~~-v~~a~~i~a~p~~f~~~~~dgy~vW~VeW~Q~i~LG~s~ 144 (182) T protein:vir:10 78 SLALELWEFSAAVAQLIRKSGVW-VKGGV-LTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESM 144 (182) T ss_pred CchHHHHHHHHHHHHHHhcCccc-CCccc-cCccceeeeccCccChhhcCceEEEEEEEEEEEeeCCcc Confidence 211 3333344555666666554 24322 23455555555544 378999999999999999887 No 12 >protein:vir:96764 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1090 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039825;genbank:gi:126010857;genbank:GeneID:5076274 Probab=94.26 E-value=0.0035 Score=34.03 Aligned_cols=132 Identities=11% Similarity=0.085 Sum_probs=75.9 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) +-++|++.||++.|.+.. |+.=.+ ..+ ....+||+||=-.+..+ +...++|.+. +.-+|.+-+++..... T Consensus 10 lh~AI~~~l~~~~P~l~t-V~~y~~---~~~---~~~~tPAv~iel~~~~~-~~d~g~G~~~--~~~r~~a~vvv~~~~~ 79 (177) T protein:vir:96 10 LYDAIQAELESRLADEVT-VASYAD---FGD---VQVVDAMVLIEFEQTSP-ATRGHDGRYC--HQYDITLHAVVGRQRQ 79 (177) T ss_pred HHHHHHHHHHHhCcccee-eccccc---ccc---ccccCceeEEeeccCCc-ccCCCCCceE--EEEEEEEEEEeCCCCC Confidence 677899999999999864 554332 211 12247999976544444 4456677665 5566776665543322 Q ss_pred Cc-hhhhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEee---cCcEEEEEEeeeeeeeeeccC Q lcl|NC_020866. 81 SG-KQALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSM---QAGLLVYQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G-~~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~---~~g~l~y~~~F~~~~~l~~~r 144 (144) .- -.+.+..-.+...|+....|= |.+.. .+-++.++.-=.+ -+|...|.=+|+=+..||-+- T Consensus 80 ~~~l~a~~lAa~l~~~v~~~~wGL-p~~~v-~~~~~i~a~pd~f~p~ldgy~vW~Vew~Q~i~LG~s~ 145 (177) T protein:vir:96 80 RAELEAINLAAAIERVTDENLWGL-PYQQV-DRPENIRSAPSMFKVGSDGYDAWGVSFRQRIYLGASL 145 (177) T ss_pred ChHHHHHHHHHHHHHHHhcccccC-Ccccc-ccceeeeccccccccccCceeEEEEEEEEEEecCCCc Confidence 11 133333334444555544442 32222 2323333322222 358899999999999999988 No 13 >protein:vir:93736 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240465;genbank:gi:66396143;genbank:GeneID:5133505 Probab=77.19 E-value=0.13 Score=25.49 Aligned_cols=126 Identities=11% Similarity=0.101 Sum_probs=68.2 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-..|++||++. |.+..-|++. =|+.+++. +|.=||+-..++....+...+ ....++.++.|+= +. T Consensus 9 Lq~Ai~~~L~ad-a~l~alvggr-I~D~~P~~------a~~PYV~lG~~~~~d~~~~~~-~g~~~~~ti~Vws-----~~ 74 (145) T protein:vir:93 9 LFNKVYNKLKSN-LIIQKQLDGR-VFDCVQKD------AVYPYIVVGETNVTNKETTTS-MVEDVGITLHVYS-----QA 74 (145) T ss_pred HHHHHHHHhhcC-hhHHHhhcCc-eecCCcCC------CCCCEEEeCCceeeecCCCcc-cceEEEEEEEEEE-----cC Confidence 778899999874 3333335442 25555432 344488865444433232222 3334444444431 22 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCC--ceEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPD--VLRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~--pi~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .| ..+...+.+.|..||-++-.- +.+. -+++...+...-.+|..+ =+++|.+++--...| T Consensus 75 ~g---~~eak~ia~av~~aL~~~l~l-~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~~~ 139 (145) T protein:vir:93 75 RN---RDEASQIIQFLGFVLNNEIEI-DYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQRS 139 (145) T ss_pred CC---HHHHHHHHHHHHHHhccccCC-CCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEEEecccccc Confidence 23 234667788899999886332 3332 344566666654444433 467777777777777 No 14 >protein:vir:97421 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240755;genbank:gi:66396436;genbank:GeneID:5133777 Probab=77.19 E-value=0.13 Score=25.49 Aligned_cols=126 Identities=11% Similarity=0.101 Sum_probs=68.2 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-..|++||++. |.+..-|++. =|+.+++. +|.=||+-..++....+...+ ....++.++.|+= +. T Consensus 9 Lq~Ai~~~L~ad-a~l~alvggr-I~D~~P~~------a~~PYV~lG~~~~~d~~~~~~-~g~~~~~ti~Vws-----~~ 74 (145) T protein:vir:97 9 LFNKVYNKLKSN-LIIQKQLDGR-VFDCVQKD------AVYPYIVVGETNVTNKETTTS-MVEDVGITLHVYS-----QA 74 (145) T ss_pred HHHHHHHHhhcC-hhHHHhhcCc-eecCCcCC------CCCCEEEeCCceeeecCCCcc-cceEEEEEEEEEE-----cC Confidence 778899999874 3333335442 25555432 344488865444433232222 3334444444431 22 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCC--ceEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPD--VLRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~--pi~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .| ..+...+.+.|..||-++-.- +.+. -+++...+...-.+|..+ =+++|.+++--...| T Consensus 75 ~g---~~eak~ia~av~~aL~~~l~l-~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~~~ 139 (145) T protein:vir:97 75 RN---RDEASQIIQFLGFVLNNEIEI-DYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQRS 139 (145) T ss_pred CC---HHHHHHHHHHHHHHhccccCC-CCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEEEecccccc Confidence 23 234667788899999886332 3332 344566666654444433 467777777777777 No 15 >protein:vir:94488 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240682;genbank:gi:66396364;genbank:GeneID:5133752 Probab=77.19 E-value=0.13 Score=25.49 Aligned_cols=126 Identities=11% Similarity=0.101 Sum_probs=68.2 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-..|++||++. |.+..-|++. =|+.+++. +|.=||+-..++....+...+ ....++.++.|+= +. T Consensus 9 Lq~Ai~~~L~ad-a~l~alvggr-I~D~~P~~------a~~PYV~lG~~~~~d~~~~~~-~g~~~~~ti~Vws-----~~ 74 (145) T protein:vir:94 9 LFNKVYNKLKSN-LIIQKQLDGR-VFDCVQKD------AVYPYIVVGETNVTNKETTTS-MVEDVGITLHVYS-----QA 74 (145) T ss_pred HHHHHHHHhhcC-hhHHHhhcCc-eecCCcCC------CCCCEEEeCCceeeecCCCcc-cceEEEEEEEEEE-----cC Confidence 778899999874 3333335442 25555432 344488865444433232222 3334444444431 22 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCC--ceEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPD--VLRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~--pi~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .| ..+...+.+.|..||-++-.- +.+. -+++...+...-.+|..+ =+++|.+++--...| T Consensus 75 ~g---~~eak~ia~av~~aL~~~l~l-~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~~~ 139 (145) T protein:vir:94 75 RN---RDEASQIIQFLGFVLNNEIEI-DYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQRS 139 (145) T ss_pred CC---HHHHHHHHHHHHHHhccccCC-CCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEEEecccccc Confidence 23 234667788899999886332 3332 344566666654444433 467777777777777 No 16 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=74.07 E-value=0.16 Score=24.91 Aligned_cols=126 Identities=11% Similarity=0.107 Sum_probs=67.9 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-..|++||++- |.+..-|++. =|+.+++. +|.=||+-..++....+...+ ....++.++.|+= +. T Consensus 9 Lq~Ai~~~L~ad-a~l~alvggr-V~D~~P~~------a~~PYV~lG~~~~~~~~~~~~-~g~~~~~ti~Vws-----~~ 74 (145) T protein:vir:95 9 LFNKVYNKLKSN-SIIQKQLDGR-VFDCVQKD------AVYPYIVVGETNVTNKETTTS-MVEDVGITLHVYS-----QA 74 (145) T ss_pred HHHHHHHHhhcC-hhHHHhhcCc-eecCCcCC------CCCCEEEecCceeeecCCCcc-cceEEEEEEEEEE-----cC Confidence 778899999874 3333335442 25555432 344488865444433232222 3334444444441 22 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCC--ceEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPD--VLRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~--pi~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .| ..+...+.+.|..||-++- .-+.+. -+++...+...-.+|..+ =+++|.+++--...| T Consensus 75 ~g---~~eak~ia~av~~aL~~~l-~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~~~~~~~ 139 (145) T protein:vir:95 75 RN---RDEASQIIQFLGFVLNNEI-EIDYYSFIKSRIDTQEVITDIDRYTKHGIIRLVFKYRHNTLQRS 139 (145) T ss_pred CC---HHHHHHHHHHHHHHhcccc-CCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEecccccc Confidence 23 3446677888899997753 223222 345555566554455433 467777777777777 No 17 >protein:vir:1892 Length: 121 # NCBI annotation: gp11 # Family: family:all:896 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037672;genbank:gi:9634130;genbank:GeneID:1262490 Probab=72.13 E-value=0.15 Score=25.09 Aligned_cols=121 Identities=9% Similarity=0.152 Sum_probs=65.1 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |.-||-.-|++- |.+.. +.|+.. ..+-.-+..|+.+|.-|++...-++.+.+...| +.-.+++.| =++.+.. T Consensus 1 m~~~i~~~l~~d-~~v~a-llg~~~-~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~~~l~G---~~~~~~~~v--QIDvyA~ 72 (121) T protein:vir:18 1 MIAPIFSVCASS-PEVTD-LLGSNP-VRIYPFGIQDDNVVYPYVVWQNITGSPENYIAQ---RPDADFFTL--QVDAYAD 72 (121) T ss_pred CchHHHHHHhcC-hhhhh-hhcCCC-ceeeeccCCCCcCcCCeEEEEEecCcccceecC---CCCcceeEE--EEEeecC Confidence 999988888764 44444 223321 112111223556666688886555444333333 222222222 2222332 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEeecCcEEEEEEeeeeeeeeec Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSMQAGLLVYQLDFALTDQMRI 142 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~~~g~l~y~~~F~~~~~l~~ 142 (144) +... ...+.++|+.||-+ + .. +...++. +|+..+-+|+-.|.+++.+.| T Consensus 73 t~~~----A~~l~~avr~Ale~---~-~~---~~~~~~~--~ye~dT~lyR~s~Dv~~~~~r 121 (121) T protein:vir:18 73 TVDE----VIAVATALRDAIEP---H-AH---ITRWGGQ--ERDPETKRYRYSFDVDWIVTR 121 (121) T ss_pred CHHH----HHHHHHHHHHHhhh---c-Cc---ccCCCCC--CCcccccceeeeeEEEEeecC Confidence 2223 35677788888853 2 11 1112222 677888889999999999999 No 18 >protein:vir:105337 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950674;genbank:gi:119967844;genbank:GeneID:4643216 Probab=68.44 E-value=0.24 Score=24.01 Aligned_cols=126 Identities=11% Similarity=0.111 Sum_probs=65.8 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-+.|++||++- |.+..-|++ .=|..+++ .+|.=||+....+...... .+.....++.++.|+- +. T Consensus 9 Lq~Ai~~~L~ad-~al~alvg~-rVyD~~P~------~a~~PyV~lG~~~~~~~~~-~~~~g~~~~~ti~Vws-----~~ 74 (145) T protein:vir:10 9 LFNKIYNKLKSN-PIVSKQLGG-RVFDCVQK------DAVYPYIVVGETNVTNKET-TTSMFEDVGVTLHVYS-----QA 74 (145) T ss_pred HHHHHHHHhhcC-hhHHHhhcc-ccccCCcc------CCCCCEEEeCcceeeecCC-CcccceEEEEEEEEEE-----cC Confidence 778899999874 334333544 22444443 2344488764444322222 2223334444444432 22 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCC--ceEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPD--VLRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~--pi~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .|. .+...+.+.|..||-+. +.-++.. -+++...+...-.+|..+ =+++|.++.--...| T Consensus 75 ~g~---~ea~~ia~av~~aL~a~-l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~~~~~~~ 139 (145) T protein:vir:10 75 RNR---DEASQIIQYLGFVLNSE-IEINNYSFIKSRIDTQEVITDIDQYTKHGIIRLIFKYRHNTLQRS 139 (145) T ss_pred CCH---HHHHHHHHHHHHHhCCC-cCCCCCeEEEEEEeeeeEeecCCCceEEEEEEEEEEEeecccccc Confidence 232 23566778888888543 2222222 344555555554455433 577888888777777 No 19 >protein:vir:107096 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950611;genbank:gi:119953691;genbank:GeneID:4643105 Probab=68.35 E-value=0.24 Score=24.00 Aligned_cols=126 Identities=12% Similarity=0.122 Sum_probs=65.8 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-+.|++||++- |.+..-|++ .=|..+++ .+|.=||+....+...... .+.....++.++.|+- +. T Consensus 9 Lq~Ai~~~L~ad-~al~alvg~-rVyD~~P~------~a~~PyV~lG~~~~~~~~~-~~~~g~~~~~ti~Vws-----~~ 74 (145) T protein:vir:10 9 LFNKIYNKLKSN-PIIKKQLGG-RVFDCVQK------DAVYPYIVVGETNVTNKET-TTSMFEDVGVTLHVYS-----QA 74 (145) T ss_pred HHHHHHHHhhcC-hhHHHhhcc-ccccCCcc------CCCCCEEEeCcceeeecCC-CcccceEEEEEEEEEE-----cC Confidence 778899999873 334433544 22444443 2344488764444322222 2223334444444432 22 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCC--ceEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPD--VLRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~--pi~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .|. .+...+.+.|..||-+. +.-++.. -+++...+...-.+|..+ =+++|.++.--...| T Consensus 75 ~g~---~ea~~ia~av~~aL~a~-l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~~~~~~~ 139 (145) T protein:vir:10 75 RNR---DEASQIIQYLGFVLNSE-IEINNYSFIKSRIDTQEVITDIDQYTKHGIIRLIFKYRHNTLQRS 139 (145) T ss_pred CCH---HHHHHHHHHHHHHhCCC-cCCCCCeEEEEEEeeeeEeecCCCceEEEEEEEEEEEeecccccc Confidence 232 23566778888888543 2222222 344555555554455433 577888888777777 No 20 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=65.72 E-value=0.28 Score=23.62 Aligned_cols=126 Identities=12% Similarity=0.123 Sum_probs=66.6 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-+.|++||++- |.+..-|++ .=|+.+++. +|.=||+-..++....+..++ ....++.++.|+= +. T Consensus 9 Lq~Ai~~~L~ad-a~l~alvgg-rV~D~~P~~------~~~PYv~lG~~~~~d~~~~~~-~g~~~~~ti~Vws-----~~ 74 (145) T protein:vir:95 9 LFNKVYNKLKSN-PIIQKQLDG-RVFDCVQKD------AVYPYIVVGETNVTNKETTTS-MVEDVGITLHVYS-----QA 74 (145) T ss_pred HHHHHHHHhhcC-HhHHHhhcc-ccccCCcCC------CCCCEEEecCceeeecCCCcc-cceEEEEEEEEEE-----cC Confidence 777899999874 334333544 225555432 344488764444433222222 3334444444431 22 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCC--ceEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPD--VLRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~--pi~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .| ..+...+.+.|..||-++- .-+.+. -+++...+...-.+|..+ =+++|.+++--...| T Consensus 75 ~g---~~eak~ia~av~~aL~~~l-~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~~~ 139 (145) T protein:vir:95 75 RN---RDEASQIIQFLGFVLNNEI-EIDYYSFIKSRIDTQEVITDIDQYTKHGVIRLVFKYRHNTLQRS 139 (145) T ss_pred CC---HHHHHHHHHHHHHHhcccc-CCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEecccccc Confidence 23 3446677888888997742 222222 345555566554444433 467777777777777 No 21 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=65.68 E-value=0.28 Score=23.62 Aligned_cols=126 Identities=12% Similarity=0.123 Sum_probs=66.6 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-+.|++||++- |.+..-|++ .=|+.+++. +|.=||+-..++....+..++ ....++.++.|+= +. T Consensus 9 Lq~Ai~~~L~ad-a~l~alvgg-rV~D~~P~~------~~~PYv~lG~~~~~d~~~~~~-~g~~~~~ti~Vws-----~~ 74 (145) T protein:vir:94 9 LFNKVYNKLKSN-PIIQKQLDG-RVFDCVQKD------AVYPYIVVGETNVTNKETTTS-MVEDVGITLHVYS-----QA 74 (145) T ss_pred HHHHHHHHhhcC-HhHHHhhcc-ccccCCcCC------CCCCEEEecCceeeecCCCcc-cceEEEEEEEEEE-----cC Confidence 777899999874 334333544 225555432 344488764444433222222 3334444444431 22 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCC--ceEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPD--VLRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~--pi~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .| ..+...+.+.|..||-++- .-+.+. -+++...+...-.+|..+ =+++|.+++--...| T Consensus 75 ~g---~~eak~ia~av~~aL~~~l-~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~~~ 139 (145) T protein:vir:94 75 RN---RDEASQIIQFLGFVLNNEI-EIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQRS 139 (145) T ss_pred CC---HHHHHHHHHHHHHHhcccc-CCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEecccccc Confidence 23 3446677888888997742 222222 345555566554444433 467777777777777 No 22 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=63.87 E-value=0.31 Score=23.37 Aligned_cols=126 Identities=11% Similarity=0.107 Sum_probs=60.8 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-+.|+++|++. |.+..-|++ .=|+.+++. .|.=||+-...+....+.. +.....++.++.|+= +. T Consensus 9 Lq~ai~~~L~ad-~~l~~lvg~-~vyD~~P~~------~~~PyV~lG~~~~~~~~t~-~~~~~~~~lti~Vws-----~~ 74 (145) T protein:vir:12 9 LFNKVYNKLKSN-PIIQKQLGG-RVFDCVQKD------AVYPYIVVGETNVTNKETT-TSMVEDVGITLHVYS-----QA 74 (145) T ss_pred HHHHHHHHhhcC-hhHHHhcCc-ccccCCccC------CCCCEEEeccceeeecCCC-cccceEEEEEEEEEE-----cC Confidence 556688888763 334333544 225555543 2333777533332222222 223334444444431 22 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCC--ceEecCceeEeecCcEEEE---EEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPD--VLRLLNGRLLSMQAGLLVY---QLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~--pi~~~~G~l~~~~~g~l~y---~~~F~~~~~l~~~r 144 (144) .| ..++.++...|.+||.+.-.- +++. -+++..-+.+.-.++..+. .++|+++.-=..-| T Consensus 75 ~g---r~ea~~ia~ai~~aL~~~l~l-~~~~lv~l~~~~~~~~rd~d~~~~hgvl~~ra~i~~~~~~~~ 139 (145) T protein:vir:12 75 RN---RDEASQIIQFLGFVLNNEIEI-DYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQRS 139 (145) T ss_pred cc---HHHHHHHHHHHHHHhccccCC-CCceEEEEEEeeEEEEecCCCceEEEEEEEEEEEEeCCcccc Confidence 22 233567788888888874322 2222 2445555555444554332 56666655444444 No 23 >protein:vir:4348 Length: 121 # NCBI annotation: Orf15 # Family: family:all:896 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061511;genbank:gi:9635607;genbank:GeneID:1262874 Probab=55.41 E-value=0.48 Score=22.33 Aligned_cols=121 Identities=17% Similarity=0.170 Sum_probs=63.2 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |..+|-.-|++- |.+.. +.|+.. ..+=.-+..|+.+|.-||+...-++.+.+...|. .-.+++.| =++.+.. T Consensus 1 m~~~i~~~l~~d-~~v~a-llg~~~-~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~~~l~g~---~~~~~~~v--QIDvyA~ 72 (121) T protein:vir:43 1 MYPPIFKVCSSS-PAVTA-ILGASP-LRMYQFGLAPQLVVKPYATWQTISGSPENYLWGR---PDADGFTI--QVDIFSA 72 (121) T ss_pred CChHHHHHHhhC-hhhhh-hhcCCC-ceeeccCCCCCCCcCCeEEEEEecCcccceecCC---CCcceeEE--EEEeeeC Confidence 999888888752 22222 222211 1111111235556666888876555444443342 22222222 2222222 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEeecCcEEEEEEeeeeeeeeec Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSMQAGLLVYQLDFALTDQMRI 142 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~~~g~l~y~~~F~~~~~l~~ 142 (144) + .++...+.++|+.||-+=.+ +.-.++ -+|+..+-+|+-.|.++..++| T Consensus 73 t----~~~A~~l~~av~~Al~~~~~-------~~~~~~--~~ye~dT~lyR~s~Dv~w~~~r 121 (121) T protein:vir:43 73 T----AAEARDAAKAIRDAIELSAY-------VVRWGG--ESVDPDTKTYRVSFDVDWIVQR 121 (121) T ss_pred C----HHHHHHHHHHHHHHhhhcCC-------cccCCC--CCCcccccceeeeeEEEEeecC Confidence 2 23345677888888865222 111122 3477777889999999988888 No 24 >protein:vir:97211 Length: 150 # NCBI annotation: hypothetical protein ORF026 # Family: family:all:5248 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294534;genbank:gi:149408255;genbank:GeneID:5237076 Probab=50.85 E-value=0.6 Score=21.81 Aligned_cols=134 Identities=15% Similarity=0.098 Sum_probs=78.4 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhcc------------CCCccCCeEEEEecccc-CCcCcccccceeEeeee Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGG------------QMPSSTFSAFVLPSGLS-GRGVYSTTGAFTQEFEE 67 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~------------~~p~~~PaAyVip~~d~-~~~~~~~~g~~~Q~v~~ 67 (144) |.-|--+.+|+++=++..-.. .|+..++.-.- .+| ..-+-||=.. .. +.++..+.|.-.|.... T Consensus 1 ~~~~tF~qaR~ei~t~f~~~W-~a~~~a~~g~~p~~~~w~~~~~~~~P-~g~~~WaRLt-i~~~~~~~as~G~~~gr~~~ 77 (150) T protein:vir:97 1 MTLPTFDSARDEILGLFNTKW-ITDTPALNGGAPIRVEWPGVDAGDPP-PADKPYARIT-LRHTTSRQATFGPTGGRRFT 77 (150) T ss_pred CCCCcHHHHHHHHHhhhhhhc-cccchhhcCCcceeeccCCcccCCCc-CCCCceEEEE-eeccccccccccCCCCcEEe Confidence 777777777776554433122 23333332111 111 0111132211 11 22233334545577778 Q ss_pred EEEEEEEe---cCCCCCchhhhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEeecCcEEEEEEeeeeeeeeeccC Q lcl|NC_020866. 68 AASIVLAI---RSFDASGKQALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSMQAGLLVYQLDFALTDQMRIAR 144 (144) Q Consensus 68 ~f~Vvv~~---~~~d~~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~~~g~l~y~~~F~~~~~l~~~r 144 (144) +.|+|+|= .-+.. +++..+.++-..++.|.-||.= + +-|.|..-..+..-....|||..-+++|+.---| T Consensus 78 r~Gli~VQiF~p~~~G---~G~~la~~~Ad~a~eaFe~~~t--~--g~i~f~~a~~~eig~~~gWyQ~Nv~i~Feyde~r 150 (150) T protein:vir:97 78 RPGLITVQVFTPLSGG---QGLSLAEKCAIIARDAFEGRGT--A--SGIWFRNARIQEIGPDGAWYQMNVVVEFEYDELR 150 (150) T ss_pred eCcEEEEEEeeeccCC---chhhHHHHHHHHHHHHHhccCC--c--CCeecccccccccCCCCceEEEEeEeeeeccccC Confidence 88887543 22233 3444466677888999999862 2 2378888888888888899999999999988888 No 25 >protein:vir:1643 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695064;genbank:gi:23455755;genbank:GeneID:955492 Probab=49.40 E-value=0.64 Score=21.65 Aligned_cols=107 Identities=15% Similarity=0.121 Sum_probs=54.7 Q ss_pred CchHHH-HHHHHh--CcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecC Q lcl|NC_020866. 1 MIDEII-QKLSDE--IPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRS 77 (144) Q Consensus 1 ~l~~vi-~rLra~--~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~ 77 (144) ||..+| +.|.+. ||.+.+ .|...|.-||+- |..+++. ++ +. ..--+++.+ T Consensus 1 miE~~i~~~L~~~l~Vpv~~e----------------~p~~~P~~FV~v--ErtGG~~--~~-~~------~~~~lAVq~ 53 (111) T protein:vir:16 1 MIEIIIKNFLDTHLSVSSFLE----------------KKGEMPLSYILF--EKTGSSK--SN-HL------LSSTFAFQS 53 (111) T ss_pred ChHHhHHHHHhhcCCceeEee----------------cCCCCCCceEEE--EecCCcc--cc-cc------ccceEEEEe Confidence 888754 466654 444332 345568889986 3432211 11 11 223344444 Q ss_pred CCCCchhhhHHHHHHHHHHHHHHcCCCCCCccCCceEecCce-eEeecCcEEEEEEeeeeeee Q lcl|NC_020866. 78 FDASGKQALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGR-LLSMQAGLLVYQLDFALTDQ 139 (144) Q Consensus 78 ~d~~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~-l~~~~~g~l~y~~~F~~~~~ 139 (144) +-.+=.. .+.|-.+|+.+|..+. ..+.-..+...+.- ..+-..++.=||..|.+++. T Consensus 54 w~~S~~e----Aa~La~~v~~~l~~l~-~~~~I~av~~~s~ynf~d~~tk~~RYQav~~i~~~ 111 (111) T protein:vir:16 54 YAPSMYE----AAKLNEQLKEVVERLI-ELNEISNVSLNSDYNFTDTETKEYRYQAVFDINHY 111 (111) T ss_pred cchhHHH----HHHHHHHHHHHHhhcc-ccccceeeecCCCCcCCCCCCCCceEEEEEEEeeC Confidence 4332223 3444555555555552 22222234444441 12223578999999999999 No 26 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=44.40 E-value=0.81 Score=21.09 Aligned_cols=126 Identities=10% Similarity=0.056 Sum_probs=62.8 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-+.|++||++- |.+..-|++. =|+.+++ .+|.=||+-..++....+..++ ....++.++.|+ - +. T Consensus 9 Lq~Ai~a~L~ad-a~l~alvg~~-VyD~~P~------~~~~Pyv~lG~~~~~~~~~~~~-~g~~~~~~i~Vw--s---~~ 74 (140) T protein:vir:96 9 LTVQIYKRLKAS-PIINKFVGDR-VFDVVQE------DAVYPYIVVGESNVTNNESSTM-MRETVGIVIHVY--S---QF 74 (140) T ss_pred HHHHHHHHhhcC-hhHHHhcCCc-cccCCcc------CCCCCEEEecCceeeecCCCcc-cceEEEEEEEEE--E---cC Confidence 667789999874 4444435442 2555543 2344488864444333222222 233444444433 1 22 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCCc--eEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPDV--LRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~p--i~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .|. .+..++-+.|..||-+. +.-+++.. +++...+...-.+|..+ =+++|.+..--..-| T Consensus 75 ~g~---~ea~~ia~av~~AL~~~-l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~r~~v~~~~~~~~ 139 (140) T protein:vir:96 75 ATQ---YEAKQIISAIGYVLNRP-IDIENYEFQFSRIDSQSVFPDIDRFTKHGTIRLLFKYRHIKKGEG 139 (140) T ss_pred CCH---HHHHHHHHHHHHHhCCC-ccCCCCeEEEEEEeeeEEEecCCCceEEEEEEEEEEEEeeccccC Confidence 232 33567778888888543 22232222 33455555544455443 355666555555555 No 27 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=42.24 E-value=0.89 Score=20.85 Aligned_cols=126 Identities=10% Similarity=0.092 Sum_probs=61.5 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-..|++||++- |.+..-|++. =|+.+++ .+|.=||+-..++....+.. ......++.++.|+= +. T Consensus 9 Lq~Ai~~~L~ad-~~l~alvggr-V~D~~P~------~a~~PYv~lG~~~~~d~~~~-~~~g~~~~~ti~Vws-----~~ 74 (145) T protein:vir:97 9 LFNKVYNKLKSN-LIIRKQLDGR-VFDCVQK------DAVYPYIVVGETNVTNKETT-TSMVEDVGITLHVYS-----QA 74 (145) T ss_pred HHHHHHHHhhcC-hhHHHhhcCc-eecCCcc------CCCCCEEEeCcceeeecCCC-cccceEEEEEEEEEE-----cC Confidence 778899999874 3333335442 2445543 23444888644444332222 223344444444441 22 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCC--ceEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPD--VLRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~--pi~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .| ..+...+-+.|..||-++ +.-+++. -+++...+...-.+|..+ =+++|.+++---.-| T Consensus 75 ~g---~~eak~ia~av~~aL~~~-l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~~~ 139 (145) T protein:vir:97 75 RN---RDEASQIIQFLGFVLNNE-IEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQRS 139 (145) T ss_pred CC---HHHHHHHHHHHHHHhccc-cCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEecCceecc Confidence 23 234567788888899774 2223222 234455555544444422 234444444333333 No 28 >protein:vir:9579 Length: 111 # NCBI annotation: gp45 # Family: family:all:1269 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862884;genbank:gi:32469476;genbank:GeneID:1461321 Probab=42.22 E-value=0.89 Score=20.85 Aligned_cols=105 Identities=20% Similarity=0.268 Sum_probs=55.7 Q ss_pred CchHHHH-HHHH--hCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecC Q lcl|NC_020866. 1 MIDEIIQ-KLSD--EIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRS 77 (144) Q Consensus 1 ~l~~vi~-rLra--~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~ 77 (144) ||..+|- .|.. .||.+.. .|...|.-||+- |..+++ .+... ..--+++.+ T Consensus 1 miE~~v~~~L~~~l~vpv~~~----------------vp~~~P~~FV~v--ErtGG~-------~~~~~--~~p~laVq~ 53 (111) T protein:vir:95 1 MIEIIINKYLDGHLDVPSFFE----------------HEAEAPDSFVII--QKTGGK-------ERNHS--GSATFAFQS 53 (111) T ss_pred ChHHhHHHHhhhhcCeeEEee----------------cCCCCCCceEEE--EeeCCc-------ccccc--ccceEEEEe Confidence 8887655 4422 2444432 233457788886 333221 11111 222344444 Q ss_pred CCCCchhhhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEeec---CcEEEEEEeeeeeee Q lcl|NC_020866. 78 FDASGKQALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSMQ---AGLLVYQLDFALTDQ 139 (144) Q Consensus 78 ~d~~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~~---~g~l~y~~~F~~~~~ 139 (144) .-.+ ..+..+|-.+|+.++-+|.=.+.. ..+...+ ...|. .+..=||..|.+++. T Consensus 54 wg~S----~~~Aa~La~~v~~a~~~l~~~~~i-~~v~~~s--~ynf~d~~tk~~RYQ~~~~i~~~ 111 (111) T protein:vir:95 54 YAPT----MQKAAELNVKVKSAVKGLIELDSI-CGVHLNS--DYNFTDTETKQYRYQAVFDINYF 111 (111) T ss_pred cccc----HHHHHHHHHHHHHHHhhhhccccc-cccccCC--ccccCCCCCCCceEEEEEEEEeC Confidence 4322 223455667788888888532222 2233333 33333 578999999999999 No 29 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=39.11 E-value=1 Score=20.50 Aligned_cols=126 Identities=9% Similarity=0.094 Sum_probs=59.1 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-+.|++||++. |.+..-|++. =|+.+++. +|.=||+...++....+...+ ....++.++.|+ - +. T Consensus 9 LQ~Ai~~~L~ad-aal~alvg~r-I~D~~P~~------~~~PYv~lG~~~~~~~~~~~~-~g~~~~~ti~Vw--s---~~ 74 (141) T protein:vir:96 9 LTNQIYKRLISD-PNINKLVDDR-VFDVVQDD------AVYPYIVVGESNVTNNESSAT-MRETVGIVIHVY--S---QF 74 (141) T ss_pred HHHHHHHHhhcC-hhhHhhcCCc-cccCCccC------CCCCEEEeCCceeeecCCCcc-cceEEEEEEEEE--E---cC Confidence 777899999874 3333335442 25555443 333388765444433232222 223344444333 1 23 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCCc--eEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPDV--LRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~p--i~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .|. .+..++-+.|..||-++-+- +++.. +++...+...-.+|..+ =+++|.+++--..-+ T Consensus 75 ~g~---~eak~ia~av~~AL~~~l~l-~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~~~~~ 139 (141) T protein:vir:96 75 ATQ---YEAKLILSAIGYVLNRPIEI-DNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKKKNEG 139 (141) T ss_pred CCH---HHHHHHHHHHHHHhcccccC-CCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecccccc Confidence 332 33456678888888654222 22221 33334444433344332 345555444333333 No 30 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=39.11 E-value=1 Score=20.50 Aligned_cols=126 Identities=9% Similarity=0.094 Sum_probs=59.1 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-+.|++||++. |.+..-|++. =|+.+++. +|.=||+...++....+...+ ....++.++.|+ - +. T Consensus 9 LQ~Ai~~~L~ad-aal~alvg~r-I~D~~P~~------~~~PYv~lG~~~~~~~~~~~~-~g~~~~~ti~Vw--s---~~ 74 (141) T protein:vir:10 9 LTNQIYKRLISD-PNINKLVDDR-VFDVVQDD------AVYPYIVVGESNVTNNESSAT-MRETVGIVIHVY--S---QF 74 (141) T ss_pred HHHHHHHHhhcC-hhhHhhcCCc-cccCCccC------CCCCEEEeCCceeeecCCCcc-cceEEEEEEEEE--E---cC Confidence 777899999874 3333335442 25555443 333388765444433232222 223344444333 1 23 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCCc--eEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPDV--LRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~p--i~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .|. .+..++-+.|..||-++-+- +++.. +++...+...-.+|..+ =+++|.+++--..-+ T Consensus 75 ~g~---~eak~ia~av~~AL~~~l~l-~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~~~~~ 139 (141) T protein:vir:10 75 ATQ---YEAKLILSAIGYVLNRPIEI-DNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKKKNEG 139 (141) T ss_pred CCH---HHHHHHHHHHHHHhcccccC-CCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecccccc Confidence 332 33456678888888654222 22221 33334444433344332 345555444333333 No 31 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=39.11 E-value=1 Score=20.50 Aligned_cols=126 Identities=9% Similarity=0.094 Sum_probs=59.1 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-+.|++||++. |.+..-|++. =|+.+++. +|.=||+...++....+...+ ....++.++.|+ - +. T Consensus 9 LQ~Ai~~~L~ad-aal~alvg~r-I~D~~P~~------~~~PYv~lG~~~~~~~~~~~~-~g~~~~~ti~Vw--s---~~ 74 (141) T protein:vir:94 9 LTNQIYKRLISD-PNINKLVDDR-VFDVVQDD------AVYPYIVVGESNVTNNESSAT-MRETVGIVIHVY--S---QF 74 (141) T ss_pred HHHHHHHHhhcC-hhhHhhcCCc-cccCCccC------CCCCEEEeCCceeeecCCCcc-cceEEEEEEEEE--E---cC Confidence 777899999874 3333335442 25555443 333388765444433232222 223344444333 1 23 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCCc--eEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPDV--LRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~p--i~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .|. .+..++-+.|..||-++-+- +++.. +++...+...-.+|..+ =+++|.+++--..-+ T Consensus 75 ~g~---~eak~ia~av~~AL~~~l~l-~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~~~~~ 139 (141) T protein:vir:94 75 ATQ---YEAKLILSAIGYVLNRPIEI-DNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKKKNEG 139 (141) T ss_pred CCH---HHHHHHHHHHHHHhcccccC-CCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecccccc Confidence 332 33456678888888654222 22221 33334444433344332 345555444333333 No 32 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=35.47 E-value=1.2 Score=20.09 Aligned_cols=126 Identities=10% Similarity=0.072 Sum_probs=62.6 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |-..|++||++- |.+..-|+| .=|+.+++. +|.=||.-...+....... ......++.++.|+ - +. T Consensus 9 Lq~Ai~~~L~ad-~~l~alvgg-rVyD~~P~~------~~~PYV~lG~~~~~~~~~~-~~~g~~~~~tl~Vw--s---~~ 74 (140) T protein:vir:96 9 LYNKIMNNLIEN-PITDKLVGG-RVFDCVQKD------VVYPYIVVGESNVTESERS-PGMREIIAITFHVY--S---QY 74 (140) T ss_pred HHHHHHHHhccC-hhHHhhcCc-ccccCCccC------CCCCEEEeCCceeeecCCC-cccceEEEEEEEEE--E---cC Confidence 777899999874 333333544 225554432 3333887644333222222 22333444444433 1 23 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCC--ceEecCceeEeecCcEEE---EEEeeeeeeeeeccC Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPD--VLRLLNGRLLSMQAGLLV---YQLDFALTDQMRIAR 144 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~--pi~~~~G~l~~~~~g~l~---y~~~F~~~~~l~~~r 144 (144) .|. .+..++-+.|..||.+ .+.-+++. -+++.+.+...-.+|..+ =+++|.++.--..-| T Consensus 75 ~g~---~ea~~ia~ai~~aL~~-~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~ve~~~~~~~ 139 (140) T protein:vir:96 75 ENG---AEARELLKYLNYACRL-NINFKDYELEWIKKDNSQVFTDIDQYTKHGVLRLLYKVRHKTLQER 139 (140) T ss_pred CCH---HHHHHHHHHHHHHhcC-CccCCCceEEEEEEeeeEEeecCCCceEEEEEEEEEEEeecccccC Confidence 332 2356677888888863 33223222 245555555554455433 456666665544444 No 33 >protein:vir:94768 Length: 111 # NCBI annotation: unknown # Family: family:all:1269 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996711;genbank:gi:45597426;genbank:GeneID:2769040 Probab=35.31 E-value=1.2 Score=20.07 Aligned_cols=105 Identities=17% Similarity=0.148 Sum_probs=52.4 Q ss_pred CchHHH-HHHHHh--CcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecC Q lcl|NC_020866. 1 MIDEII-QKLSDE--IPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRS 77 (144) Q Consensus 1 ~l~~vi-~rLra~--~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~ 77 (144) ||..+| +.|.++ ||.+.+ .|...|.-||+- |..+++. ++.. .+--+++.+ T Consensus 1 miE~~v~~~L~~~l~vpv~~e----------------~p~~~p~~FV~v--ErtGG~~--~~~~-------~~~~lAVQ~ 53 (111) T protein:vir:94 1 MIEIIIKNFLDTHLSVSSFLE----------------KKGEMPLSYVLF--EKTGSSK--SNHL-------LSSTFAFQS 53 (111) T ss_pred ChHHhHHHHHhhcCCcceEee----------------cCCCCCCceEEE--EecCCcc--cccc-------ccceEEEEe Confidence 888744 466554 444332 345568889986 3332211 1111 223344544 Q ss_pred CCCCchhhhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEeec---CcEEEEEEeeeeeee Q lcl|NC_020866. 78 FDASGKQALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSMQ---AGLLVYQLDFALTDQ 139 (144) Q Consensus 78 ~d~~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~~---~g~l~y~~~F~~~~~ 139 (144) +-.+=..| ..|-.+|+.++..+. ..+.-..+...+. .+|. .++.=||..|.+++. T Consensus 54 ~~~S~~eA----a~La~~v~~~~~~l~-~~~~i~~v~~~s~--Ynf~d~~tk~~RYQav~~i~~~ 111 (111) T protein:vir:94 54 YAPSMYEA----AKLNEQLKEVVERLI-ELNEISNVSLNSD--YNFTDTETKEYRYQAVFDINHY 111 (111) T ss_pred cchhHHHH----HHHHHHHHHHHhhcc-cccccceeecCCC--cccCCCcCCCceEEEEEEEeeC Confidence 44322233 333444444444442 1121223444333 3333 578899999999998 No 34 >protein:vir:9764 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795526;genbank:gi:28876278;genbank:GeneID:1257819 Probab=31.94 E-value=1.5 Score=19.68 Aligned_cols=107 Identities=19% Similarity=0.202 Sum_probs=59.4 Q ss_pred CchHHH-HHHHH--hCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecC Q lcl|NC_020866. 1 MIDEII-QKLSD--EIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRS 77 (144) Q Consensus 1 ~l~~vi-~rLra--~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~ 77 (144) ||..|| ..|.. -||.+.+ + |...|.-||+- |..++ +.++ .. ....+++.+ T Consensus 1 mIE~~i~~yL~~~l~vpv~~e----------~------p~~~P~~FV~v--EkTGG--~~~~-----~~--~~a~lAvQs 53 (111) T protein:vir:97 1 MIEVIIKKYLDEHLDVPSFFE----------H------QKDEPARFIIL--EKTSG--AKQN-----HL--LSSTFAFQS 53 (111) T ss_pred ChhhhhhHHHhhhcCceEEEe----------e------cCCCCCceEEE--EeeCC--cccc-----cc--ccceEEEEe Confidence 888854 47766 4777654 1 23457889986 33322 1112 11 233455555 Q ss_pred CCCCchhhhHHHHHHHHHHHHHHcCCCCCCccCCceEecCc-eeEeecCcEEEEEEeeeeeee Q lcl|NC_020866. 78 FDASGKQALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNG-RLLSMQAGLLVYQLDFALTDQ 139 (144) Q Consensus 78 ~d~~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G-~l~~~~~g~l~y~~~F~~~~~ 139 (144) +..+ .-+...|-.+|+.|+.+|.--+. -..+...+. ...+-+.++.=||-.|.+++. T Consensus 54 yg~S----~~~AA~La~~V~~a~~~l~~l~~-i~~v~lns~Ynf~d~~tk~yRYQa~~di~~~ 111 (111) T protein:vir:97 54 YAES----LYEAALLNDKVKQVIEQLDVLPQ-VSGVHLNADYNFTDTATKRYRYQAVFDINHY 111 (111) T ss_pred cchh----HHHHHHHHHHHHHHhhhhccCcc-ceeeeecccccCCCCCCCCccEEEEEEEeeC Confidence 5432 22345666778888888752222 223444443 112223578889999999988 No 35 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=22.71 E-value=2.4 Score=18.49 Aligned_cols=120 Identities=11% Similarity=0.167 Sum_probs=61.8 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |=+.|++||++.-+ +..-|+ .|-|. .|..+|.=||+...++....+...+ ....+..++.|+ -+ . T Consensus 10 Lq~Ai~~~L~ad~~-l~alvg------~I~D~--~P~~~~~PYV~lG~~~~~d~~~~~~-~g~~~~~ti~Vw--s~---~ 74 (134) T protein:vir:59 10 LQKATVENLESYQP-LMEMVN------QVTES--PGKDDPYPYVVIGDQSSTPFETKSS-FGENITMDFHVW--GG---T 74 (134) T ss_pred HHHHHHHHhhcChh-HHHhhh------hhhcC--CCCCCCCCEEEeCCceeeecCCCcc-cceEEEEEEEEE--EC---C Confidence 77779999987433 333253 23332 2233445588864444332222222 223333333333 12 2 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCC--ceEecCceeEeecCcEE---EEEEeeeeeee Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPD--VLRLLNGRLLSMQAGLL---VYQLDFALTDQ 139 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~--pi~~~~G~l~~~~~g~l---~y~~~F~~~~~ 139 (144) |.. +...+-+.|..||.++...-+++. .+++...+...-.+|.. .-+++|.++.- T Consensus 75 -g~~---ea~~ia~av~~aL~~~~L~l~~~~lv~l~~~~~~~~rd~dg~~~hg~l~fra~ve~~ 134 (134) T protein:vir:59 75 -TRA---EAQDISSRVLEALTYKPLMFEGFTFVAKKLVLAQVITDTDGVTKHGIIKVRFTINNN 134 (134) T ss_pred -ChH---HHHHHHHHHHHHhcCCCcccCCceEEEeEEeeeeEEecCCCceEEEEEEEEEEEecC Confidence 322 356778899999998875433332 34455556654445543 34555556665 No 36 >protein:vir:93602 Length: 114 # NCBI annotation: putative structural component # Family: family:all:896 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449300;genbank:gi:157166048;uniprot:Q6H9U1;genbank:GeneID:5580424 Probab=20.39 E-value=2.8 Score=18.15 Aligned_cols=114 Identities=12% Similarity=0.061 Sum_probs=54.5 Q ss_pred CchHHHHHHHHhCcchhhHHHHHHhHHHHHhccCCCccCCeEEEEeccccCCcCcccccceeEeeeeEEEEEEEecCCCC Q lcl|NC_020866. 1 MIDEIIQKLSDEIPRLKERVQGAVDFGKLVAGGQMPSSTFSAFVLPSGLSGRGVYSTTGAFTQEFEEAASIVLAIRSFDA 80 (144) Q Consensus 1 ~l~~vi~rLra~~p~~~~rV~gAadla~l~~~~~~p~~~PaAyVip~~d~~~~~~~~~g~~~Q~v~~~f~Vvv~~~~~d~ 80 (144) |++..|-.|=. |-+.+||--- ..++. .-.+.+|.-|++...-++.+++...|. . .+++.| =++.+.. T Consensus 1 M~e~~i~~lL~--~~~~gRvyp~----~~P~~-~~~~~~~~Pyiv~q~vsg~p~~~l~gp---~-~~~~~v--QIDvyA~ 67 (114) T protein:vir:93 1 MTEADLYPHLA--HLAGGQVYPY----VVPLL-DGRPSVALPWVVFSLISSVSADVMGGQ---A-ESSVSV--QIDVYAG 67 (114) T ss_pred CchHHHHHHHH--hhcCcccccc----cCCcc-cCcCCccCceEEEEeccCcccccccCc---c-ccceEE--EEEeeeC Confidence 88875554421 2333455411 11111 111134556888876665556665552 2 233333 2222332 Q ss_pred CchhhhHHHHHHHHHHHHHHcCCCCCCccCCceEecCceeEeecCcEEEEEEeeeeeeee Q lcl|NC_020866. 81 SGKQALDPMRELIMEVFRSLGGWAPSEDAPDVLRLLNGRLLSMQAGLLVYQLDFALTDQM 140 (144) Q Consensus 81 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~~~pi~~~~G~l~~~~~g~l~y~~~F~~~~~l 140 (144) ..++...++++|+.||-.|.|. ...+++ +|+..+-+|+-.|.+.--. T Consensus 68 ----t~~~A~~l~~~v~~Al~~~~~~-------~~~~~~--~ye~dt~lyR~~~d~~v~~ 114 (114) T protein:vir:93 68 ----TVTQARQIRQDAREAIMLLAPG-------SVSEMQ--DYIPENRCYRATLEFQVTV 114 (114) T ss_pred ----CHHHHHHHHHHHHHHHhhcCcE-------eecCCC--cccccccceeeEEEEEEeC Confidence 2334578899999999877653 222222 2555555555433322222 Done!