Query lcl|NC_019410.1_cdsid_YP_006989468.1 [gene=D867_gp266] [protein=hypothetical protein] [protein_id=YP_006989468.1] [location=55122..55493] Match_columns 123 No_of_seqs 68 out of 72 Neff 5.8 Searched_HMMs 1612 Date Thu Nov 7 19:10:41 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_88 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_88_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:94955 Length: 170 100.0 1.1E-40 7E-44 239.6 9.7 122 1-123 47-170 (170) 2 protein:vir:95176 Length: 172 100.0 3.4E-40 2.1E-43 237.0 10.1 121 1-123 47-172 (172) 3 protein:vir:80389 Length: 172 100.0 4.4E-39 2.7E-42 230.9 10.3 119 1-123 43-172 (172) 4 protein:vir:78383 Length: 169 100.0 4.3E-38 2.6E-41 225.5 9.9 120 1-123 45-169 (169) 5 protein:vir:95004 Length: 169 100.0 1.1E-37 6.7E-41 223.2 9.8 120 1-123 45-169 (169) 6 protein:vir:97267 Length: 172 100.0 5.1E-36 3.2E-39 214.0 9.2 116 1-123 47-172 (172) 7 protein:vir:80967 Length: 131 97.1 2.4E-06 1.5E-09 51.4 5.6 95 1-119 27-131 (131) 8 protein:vir:43 Length: 131 # N 97.0 3E-06 1.9E-09 50.8 5.5 95 1-119 27-131 (131) 9 protein:vir:98900 Length: 132 95.6 0.00011 6.6E-08 42.4 6.5 97 1-123 26-132 (132) 10 protein:vir:7857 Length: 188 # 90.7 0.0033 2.1E-06 34.2 6.2 100 1-112 79-188 (188) 11 protein:vir:101652 Length: 188 90.7 0.0033 2.1E-06 34.2 6.2 100 1-112 79-188 (188) 12 protein:vir:103283 Length: 125 74.3 0.077 4.8E-05 26.7 6.3 87 28-123 1-122 (125) 13 protein:vir:2505 Length: 128 # 68.9 0.07 4.3E-05 26.9 4.8 92 1-123 33-127 (128) 14 protein:vir:8104 Length: 170 # 58.6 0.3 0.00019 23.5 6.3 103 1-115 63-170 (170) 15 protein:vir:4788 Length: 130 # 45.3 0.5 0.00031 22.2 5.2 101 1-119 24-130 (130) 16 protein:vir:3160 Length: 198 # 45.1 0.44 0.00027 22.5 4.9 103 1-116 77-198 (198) 17 protein:vir:9928 Length: 118 # 43.7 0.84 0.00052 21.0 6.5 88 1-121 31-118 (118) 18 protein:vir:80036 Length: 111 42.8 0.17 0.00011 24.8 2.3 102 1-121 4-111 (111) 19 protein:vir:107702 Length: 136 41.2 0.75 0.00046 21.3 5.5 112 1-123 9-133 (136) 20 protein:vir:9821 Length: 138 # 39.5 0.83 0.00051 21.0 5.5 97 1-119 29-138 (138) 21 protein:vir:79701 Length: 144 39.3 1 0.00063 20.5 7.0 106 1-118 27-144 (144) 22 protein:vir:104344 Length: 132 38.9 0.26 0.00016 23.8 2.6 94 8-123 1-129 (132) 23 protein:vir:9576 Length: 131 # 32.3 1.4 0.00089 19.7 6.9 100 1-123 29-130 (131) 24 protein:vir:5256 Length: 119 # 31.6 1.5 0.00092 19.6 6.3 108 1-119 1-119 (119) 25 protein:vir:99570 Length: 153 31.2 1.5 0.00094 19.6 6.9 109 1-123 5-147 (153) 26 protein:vir:94507 Length: 113 30.2 1.6 0.00099 19.5 6.3 101 5-121 1-113 (113) 27 protein:vir:8430 Length: 189 # 30.0 1.6 0.001 19.5 5.5 105 1-118 76-189 (189) 28 protein:vir:9761 Length: 140 # 29.3 1.7 0.001 19.4 6.2 102 1-123 29-134 (140) 29 protein:vir:79640 Length: 134 28.7 1.2 0.00075 20.1 4.5 111 1-123 6-130 (134) 30 protein:vir:80320 Length: 188 26.7 1.9 0.0012 19.0 5.3 91 1-115 81-188 (188) 31 protein:vir:103957 Length: 110 22.3 2.5 0.0015 18.4 6.3 102 5-121 1-110 (110) 32 protein:vir:96390 Length: 110 22.3 2.5 0.0015 18.4 6.3 102 5-121 1-110 (110) 33 protein:vir:9311 Length: 110 # 22.3 2.5 0.0015 18.4 6.3 102 5-121 1-110 (110) 34 protein:vir:97145 Length: 110 22.3 2.5 0.0015 18.4 6.3 102 5-121 1-110 (110) 35 protein:vir:99796 Length: 110 22.3 2.5 0.0015 18.4 6.3 102 5-121 1-110 (110) 36 protein:vir:78849 Length: 110 22.3 2.5 0.0015 18.4 6.3 102 5-121 1-110 (110) 37 protein:vir:96221 Length: 110 22.3 2.5 0.0015 18.4 6.3 102 5-121 1-110 (110) No 1 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=100.00 E-value=1.1e-40 Score=239.58 Aligned_cols=122 Identities=21% Similarity=0.438 Sum_probs=111.7 Q ss_pred CchhhhhhhhhcccCCccCCcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceeeeecC Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVD 80 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ekVG 80 (123) |++|++|||++|+|+|+|++++|+|+|||+|+. .||+.+++|.||.+||+||||||++++++++.++..++++|+|||| T Consensus 47 L~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~~-~dg~~~~~~~IP~~V~~Aq~elA~~~~~~~~~~~~~~~~v~~~kVG 125 (170) T protein:vir:94 47 VISASRYLDQMMAWIGAPTNPEQSMWWPCKNAV-IGGMTLSQVSIPVKVKIAVFELAYFMLESGAALSFADQTIDSVKVG 125 (170) T ss_pred HHHHHHHhccccccccccCCcchhhcccccCcc-cCccccccchhhHHHHHHHHHHHHHHHhCcccCcccccceeeEecc Confidence 999999999989999999999999999999976 8999999999999999999999999999998888888889999999 Q ss_pred eeEEeecCCCCCCcchHHHHHHHhhhcccccCCcc--ceeeEeeC Q lcl|NC_019410. 81 VIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRP--AFKKIIRH 123 (123) Q Consensus 81 ~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~--~~~~v~R~ 123 (123) +|||||+.++++.++|++|++||+||+.....+++ -..+|+|- T Consensus 126 ~i~veY~~~~~~~~~~~~v~~LL~p~l~~~~~g~~~~~~~~~~r~ 170 (170) T protein:vir:94 126 TIRVEFTKNSTDAGLPTFVEAMLSGFGSPVLYGSNAARSIDLVRA 170 (170) T ss_pred eeEEEecCCCCCCccHHHHHHHhhhhhccccccccccceeeeecC Confidence 99999999888889999999999999976443333 34599999 No 2 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=100.00 E-value=3.4e-40 Score=236.97 Aligned_cols=121 Identities=21% Similarity=0.367 Sum_probs=107.6 Q ss_pred Cchhhhhhhh-hcccCCccCCcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCC--Cccceeee Q lcl|NC_019410. 1 MVRASKYLDR-TIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQ--TTRGMKEI 77 (123) Q Consensus 1 Li~As~yld~-~~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~--~~~~v~~e 77 (123) |++|++|||+ .++|+|+|++++|+|+|||+|+. .+|..+++|.||.+||+||||||+++++++++.+. ....||+| T Consensus 47 L~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~-~~~~~v~~~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~~~vk~~ 125 (172) T protein:vir:95 47 LIRSTDYLEAQACRFQGKPTSTTQALQWPRTGVF-LNEDEVPSNVIPKSLIAAQVQLTMAINAGFDLQPNVSPQDYVTRE 125 (172) T ss_pred HHHHHHHhhccCCceeeeecCCcccccCCcCCcc-cCcccccccchhHHHHHHHHHHHHHHHcCccccccCCcccceeEE Confidence 9999999997 37999999999999999999986 79999999999999999999999999999876554 34569999 Q ss_pred ecCeeEEeecCCCC--CCcchHHHHHHHhhhcccccCCccceeeEeeC Q lcl|NC_019410. 78 QVDVIELKFDSEIQ--RGSMPDIVMSILEGLGVVKTGTRPAFKKIIRH 123 (123) Q Consensus 78 kVG~I~v~Y~~~~~--~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~R~ 123 (123) |||+|||||+.+.+ +.++|+++++||+||++..+|++|+ +|++|. T Consensus 126 kVG~I~veY~~~~~~~~~~~~~~v~~LL~p~l~~~~~~~~~-~r~~r~ 172 (172) T protein:vir:95 126 KVGPIETEYADPLSVGIMPTFTAANALLAPLFGECASNKFA-LRTIRV 172 (172) T ss_pred eccceEEeeccCCCCCCcccHHHHHHHHhhhhcccCCccee-eEEEeC Confidence 99999999987553 3478999999999999988777777 789999 No 3 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=100.00 E-value=4.4e-39 Score=230.86 Aligned_cols=119 Identities=29% Similarity=0.483 Sum_probs=103.2 Q ss_pred Cchhhhhhhhh-cccCCccCCcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCC-CCCccceeeee Q lcl|NC_019410. 1 MVRASKYLDRT-IAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTS-PQTTRGMKEIQ 78 (123) Q Consensus 1 Li~As~yld~~-~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~-~~~~~~v~~ek 78 (123) |++|+||||+. ++|+|+|++++|+|+|||+|+. +||+.+|+|.||.+||+||||||++++++..+. ...+.+||+|| T Consensus 43 L~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~-~~g~~~~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~~~~v~~ek 121 (172) T protein:vir:80 43 ILEAMDYIESFRRRWKGERNTREQGLTWPRHDAV-VDGFVIPSDVIPKELQSAVAAAVIEQVNGFELQQSQDQWAVRIEK 121 (172) T ss_pred HHHHHHHHhhccCccccccCCccccccccccCcc-cCcccccccchhHHHHHHHHHHHHHHhcCCccCcCCCCceeeEEe Confidence 99999999984 3699999999999999999976 899999999999999999999999999985544 45566799999 Q ss_pred cCeeEEeecCCC---------CCCcchHHHHHHHhhhcccccCCccceeeEeeC Q lcl|NC_019410. 79 VDVIELKFDSEI---------QRGSMPDIVMSILEGLGVVKTGTRPAFKKIIRH 123 (123) Q Consensus 79 VG~I~v~Y~~~~---------~~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~R~ 123 (123) ||+||+||+.+. +..++|++|++||+||++ |.++.+++++|- T Consensus 122 VG~i~~eY~~~~~~~~~~~~~~~~~~~~~v~~LL~p~l~---~~gg~~~~~vrg 172 (172) T protein:vir:80 122 VDVIEVQYAAGGGGQSASANAPMKPTFPKIDALLNPLLV---GDGGLFLVAVRG 172 (172) T ss_pred ccceEEeeecccCccccccccCCccchHHHHHHHhhhhc---CCCCeeeeeecC Confidence 999999998542 234679999999999987 445556789999 No 4 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=100.00 E-value=4.3e-38 Score=225.46 Aligned_cols=120 Identities=18% Similarity=0.255 Sum_probs=103.4 Q ss_pred Cchhhhhhhhh-cccCCccCCcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCC-CCCccceeeee Q lcl|NC_019410. 1 MVRASKYLDRT-IAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTS-PQTTRGMKEIQ 78 (123) Q Consensus 1 Li~As~yld~~-~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~-~~~~~~v~~ek 78 (123) |++|++|||+. ++|+|+|++++|+|+|||+|+. .||+++|+|.||.+||+||||||++++++++.+ +...+.+++|| T Consensus 45 L~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~-~~g~~~~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~~~~v~~e~ 123 (169) T protein:vir:78 45 LRNGAVYVGLFESQMCGRRVSANQALAFPRTGVT-LHGFPQPSNVIPPLVIQAQVMAAVEYGAGTDVRGSTDGREVQTER 123 (169) T ss_pred HHHHHHHhhhccccceeeeCCcccccccccCCce-ecccccccccchHHHHHHHHHHHHHHhcCcccCCCCCcceeEEEE Confidence 99999999983 3799999999999999999976 999999999999999999999999999997665 45667799888 Q ss_pred c-CeeEEeecCCCCCC--cchHHHHHHHhhhcccccCCccceeeEeeC Q lcl|NC_019410. 79 V-DVIELKFDSEIQRG--SMPDIVMSILEGLGVVKTGTRPAFKKIIRH 123 (123) Q Consensus 79 V-G~I~v~Y~~~~~~~--~~~~~v~~lL~~ll~~~~G~~~~~~~v~R~ 123 (123) | |+||+||+.+++.+ +.|+++++||+||++.+ |++++ .+++|- T Consensus 124 v~G~i~veY~~~~~~~~~~~~~~~~~LL~p~l~~~-~g~~~-i~~~rg 169 (169) T protein:vir:78 124 VEGAVTVSYFKNGYSGGTVSITTADDALRPLLCGS-NNAYS-FNVFRG 169 (169) T ss_pred ecCceeEeecCCCCCCCcccHHHHHHHhhhhcccC-CCcce-eeeecC Confidence 7 99999999766543 67899999999999732 33333 589999 No 5 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=100.00 E-value=1.1e-37 Score=223.24 Aligned_cols=120 Identities=18% Similarity=0.267 Sum_probs=102.5 Q ss_pred Cchhhhhhhhh-cccCCccCCcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCC-CCccceeeee Q lcl|NC_019410. 1 MVRASKYLDRT-IAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSP-QTTRGMKEIQ 78 (123) Q Consensus 1 Li~As~yld~~-~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~-~~~~~v~~ek 78 (123) |++|++|||+. ++|+|+|++++|+|+|||+|+ +.||.++++|.||.+||+||||||++++++++..+ ...+.|++|| T Consensus 45 L~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~-~~~g~~~~~~~IP~~V~~A~~elA~~~~~g~~~~~~~~~~~v~~e~ 123 (169) T protein:vir:95 45 LRNGAVYVGLFESQMCGRRVSANQALAFPRTGI-DLHGFPQPSNVIPSLVIQAQVMAAVEYGAGTDVRGSTDGREVQTER 123 (169) T ss_pred HHHHHHHhhccccccccccCCcchhhccccCCc-eecccccccccchHHHHHHHHHHHHHHHcCccccCCCCccceeeee Confidence 99999999984 379999999999999999996 59999999999999999999999999999876544 5566788877 Q ss_pred c-CeeEEeecCCCCCC--cchHHHHHHHhhhcccccCCccceeeEeeC Q lcl|NC_019410. 79 V-DVIELKFDSEIQRG--SMPDIVMSILEGLGVVKTGTRPAFKKIIRH 123 (123) Q Consensus 79 V-G~I~v~Y~~~~~~~--~~~~~v~~lL~~ll~~~~G~~~~~~~v~R~ 123 (123) | |+||+||+.+++.+ +.|+++++||+|||+.+ |++|+ .+++|- T Consensus 124 v~G~i~veY~~~~~~~~~~~~~a~~~LL~p~l~g~-~g~~~-i~~~rg 169 (169) T protein:vir:95 124 VEGAVTVSYFKNGYSGGTVSITAADDALRPLLCGS-NNAYS-FNVFRG 169 (169) T ss_pred eccceeEeecCCCCcCccccHHHHHHhhhhhcccC-CCcce-eeeecC Confidence 6 99999999866554 67899999999999732 33333 589999 No 6 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=100.00 E-value=5.1e-36 Score=214.05 Aligned_cols=116 Identities=23% Similarity=0.316 Sum_probs=96.4 Q ss_pred CchhhhhhhhhcccCCcc-CCcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCC----cc--c Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEK-VDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQT----TR--G 73 (123) Q Consensus 1 Li~As~yld~~~~~~G~r-~~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~----~~--~ 73 (123) |++|++|||++|+|+|+| ++++|+|+|||+|+. ||..+|+|.||.+||+||||||++++++++.++.. .+ . T Consensus 47 Li~A~~yiD~~~~f~G~r~~~~~Q~l~WPRtg~~--d~~~~~~~~IP~~v~~A~~elA~~al~~~l~~d~~~~~~~~~v~ 124 (172) T protein:vir:97 47 VIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAW--DRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVL 124 (172) T ss_pred HHHHHHHHhhhhhcccCCCCCcchhhhcccCCCC--CCcccccccccHHHHHHHHHHHHHHHhcccccccccccccccce Confidence 999999999989999988 589999999999984 79999999999999999999999999998765322 12 4 Q ss_pred eeeeecCeeEEeecCCCC---CCcchHHHHHHHhhhcccccCCccceeeEeeC Q lcl|NC_019410. 74 MKEIQVDVIELKFDSEIQ---RGSMPDIVMSILEGLGVVKTGTRPAFKKIIRH 123 (123) Q Consensus 74 v~~ekVG~I~v~Y~~~~~---~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~R~ 123 (123) +||+|||+|+++|+..+. ..++|++|++||+|++...+| + +++|. T Consensus 125 ~kr~kvg~i~~~y~~~~~~~~~~p~~~~v~aLL~p~gl~~~~---~--~~~r~ 172 (172) T protein:vir:97 125 SKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSG---G--TLLRG 172 (172) T ss_pred eeeeeecceeeEeeccCCCCCccccHHHHHHHHhhhccccCc---c--eeccC Confidence 889999999999976443 347899999999997443222 2 56677 No 7 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=97.09 E-value=2.4e-06 Score=51.41 Aligned_cols=95 Identities=17% Similarity=0.104 Sum_probs=55.7 Q ss_pred CchhhhhhhhhcccCCccCCcccccccccCCceeeCCeeec--CCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceeeee Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIP--SDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQ 78 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~--~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ek 78 (123) |.||+++||.. .| ++ +++.-+. .+.+|.+||.|+|+.|-.+...+.......+.+++++ T Consensus 27 ~~rAs~~ID~~-T~-~r-----------------i~~~~~d~~~~~~~~~vk~A~c~q~e~~~~~g~~~~~~~~~~~S~s 87 (131) T protein:vir:80 27 LKHAERKIDSV-TF-YR-----------------IRKSGIEAFSEFIQHQIQLATCNQIEYFKEAGGTSELAVSKPDNVS 87 (131) T ss_pred HHHHHHHHHHH-hc-cc-----------------ccccccccCchhHHHHHHHHHHHHHHHHHHhhhhhhhcccccCeee Confidence 88888888873 33 11 1111121 2468999999999999665554433333456789999 Q ss_pred cCeeEEeecCCCCCC---c---chHHHHHHHhh--hcccccCCccceee Q lcl|NC_019410. 79 VDVIELKFDSEIQRG---S---MPDIVMSILEG--LGVVKTGTRPAFKK 119 (123) Q Consensus 79 VG~I~v~Y~~~~~~~---~---~~~~v~~lL~~--ll~~~~G~~~~~~~ 119 (123) ||..+|+|...+..+ . ....+..+|.+ ||...-+- | T Consensus 88 vG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~TGLlyrGV~~-----~ 131 (131) T protein:vir:80 88 IGRTSISDSNFASTATSLNSGLVGSDVRSYLAHTGLLYNGVGV-----R 131 (131) T ss_pred eCceEEeeccccchhhhhhhhhhHHHHHHHHhccCCeecCCCC-----C Confidence 999999998744332 1 22334444543 33311111 1 No 8 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=97.01 E-value=3e-06 Score=50.83 Aligned_cols=95 Identities=17% Similarity=0.101 Sum_probs=55.0 Q ss_pred CchhhhhhhhhcccCCccCCcccccccccCCceeeCCeeec--CCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceeeee Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIP--SDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQ 78 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~--~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ek 78 (123) |.||+++||.. .| ++ +++.-+. .+.+|.+||.|+|+.|-.+...+.......+.+++++ T Consensus 27 ~~rAs~~ID~~-T~-~r-----------------i~~~~~~~~~~~~~~~vk~A~c~q~e~~~~~g~~s~~~~~~~~S~s 87 (131) T protein:vir:43 27 LKHAERKIDSV-TF-YR-----------------IRKGGIESFSEFIQHQIQLATCNQIEYFKEAGGTSELAVSKPDNVS 87 (131) T ss_pred HHHHHHHHHHH-hc-cc-----------------ccccCccccchhhHHHHHHHHHHHHHHHHHhHHHhhhhccccCeee Confidence 88888888873 33 11 1111111 2468999999999999655544333333445689999 Q ss_pred cCeeEEeecCCCCCCc------chHHHHHHHhh--hcccccCCccceee Q lcl|NC_019410. 79 VDVIELKFDSEIQRGS------MPDIVMSILEG--LGVVKTGTRPAFKK 119 (123) Q Consensus 79 VG~I~v~Y~~~~~~~~------~~~~v~~lL~~--ll~~~~G~~~~~~~ 119 (123) ||..+|+|...+..+. ....+..+|.+ ||...-+- | T Consensus 88 vG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~TGLlyrGV~~-----~ 131 (131) T protein:vir:43 88 IGRTSISDSNFASTATSLNSGLIGSDVRSYLAHTGLLYNGVGV-----R 131 (131) T ss_pred cCceEEeecccccchhhhchhhhHHHHHHHHhccCCeecCCCC-----C Confidence 9999999986443221 23344444543 33211111 1 No 9 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=95.60 E-value=0.00011 Score=42.37 Aligned_cols=97 Identities=15% Similarity=0.069 Sum_probs=54.6 Q ss_pred CchhhhhhhhhcccCCccCCcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCC-CCCccceeeeec Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTS-PQTTRGMKEIQV 79 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~-~~~~~~v~~ekV 79 (123) |.||+++||. +.| ++ ++..++.=.+..++.+||.|+|..+-.+.+.+... ....+.+++++| T Consensus 26 ~~rAs~~ID~-iT~-~r---------------i~~~~~~~d~~~~~~~vk~A~c~qiey~~~~G~~sae~~~~~~~S~sv 88 (132) T protein:vir:98 26 LPKASAIIDG-VTG-HF---------------YQKVDMEKDNAWRVNQFKLALCAQIEYFDALGATTFEEINNSPQTFQA 88 (132) T ss_pred HHHHHHHHHH-Hhc-cc---------------ccCCCccccChHHHHHHHHHHHHHHHHHHhccchhhhhccCccceeee Confidence 8899999996 344 21 11111111123577889999999996655443322 234566999999 Q ss_pred CeeEEeecCCCCCC----c---chHHHHHHHhh--hcccccCCccceeeEeeC Q lcl|NC_019410. 80 DVIELKFDSEIQRG----S---MPDIVMSILEG--LGVVKTGTRPAFKKIIRH 123 (123) Q Consensus 80 G~I~v~Y~~~~~~~----~---~~~~v~~lL~~--ll~~~~G~~~~~~~v~R~ 123 (123) |..+++|..+.+.. . ..+-+..+|.+ ||.. | |-|- T Consensus 89 G~~Svs~~s~~~~~~~~~~~~~~~~~a~~~L~~tGLLyr--G-------V~~~ 132 (132) T protein:vir:98 89 GRTSVSNASRYNPSGANESKPLVAEDVYIYLQGTGLLFQ--G-------VKTW 132 (132) T ss_pred CcEEEEeeccCCcccccccccchHHHHHHHHhhcCCccc--c-------CCCC Confidence 99999997543211 1 12334445544 3321 1 1111 No 10 >protein:vir:7857 Length: 188 # NCBI annotation: gp14 # Family: family:all:11114 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817464;genbank:gi:29565893;genbank:GeneID:1259086 Probab=90.67 E-value=0.0033 Score=34.18 Aligned_cols=100 Identities=21% Similarity=0.294 Sum_probs=50.6 Q ss_pred Cchhhhhh-hh----hcccCCccCC-ccccccccc----CCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCC Q lcl|NC_019410. 1 MVRASKYL-DR----TIAWAGEKVD-EDSGLRWPR----AGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQT 70 (123) Q Consensus 1 Li~As~yl-d~----~~~~~G~r~~-~~Q~laWPR----~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~ 70 (123) +++-.+|- ++ .+.+.|.+-. .+-.-.||+ .-|.-..| -+.||.+|+...|++|-+++.++ T Consensus 79 ~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytHG----y~evP~eiv~lv~d~A~~~~~np------ 148 (188) T protein:vir:78 79 WVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTHG----YNPVPDELIDVAIRLAREYQSNP------ 148 (188) T ss_pred cccccccccccceeeecccccCcccccccccccccCcceEEEEEecC----CCcccHHHHHHHHHHHHHHhcCc------ Confidence 22222221 00 0112222210 122234663 22222233 25799999999999998876553 Q ss_pred ccceeeeecCeeEEeecCCCCCCcchHHHHHHHhhhcccccC Q lcl|NC_019410. 71 TRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTG 112 (123) Q Consensus 71 ~~~v~~ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G 112 (123) .....++||.+|++|+..+..+. -++=..+|+.+--...- T Consensus 149 -~~L~q~~vG~~S~tfa~~~~~sl-~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:78 149 -ELLVSKQVGEIERRFGSVAGTSL-SKADQAILDRYVIATLA 188 (188) T ss_pred -ccceeeecCceeeecccccCCcc-cchhHHhhccccccccC Confidence 33467999999999996433332 23334455554321111 No 11 >protein:vir:101652 Length: 188 # NCBI annotation: gp15 # Family: family:all:11114 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654770;genbank:gi:109302768;genbank:GeneID:4156086 Probab=90.67 E-value=0.0033 Score=34.18 Aligned_cols=100 Identities=21% Similarity=0.294 Sum_probs=50.6 Q ss_pred Cchhhhhh-hh----hcccCCccCC-ccccccccc----CCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCC Q lcl|NC_019410. 1 MVRASKYL-DR----TIAWAGEKVD-EDSGLRWPR----AGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQT 70 (123) Q Consensus 1 Li~As~yl-d~----~~~~~G~r~~-~~Q~laWPR----~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~ 70 (123) +++-.+|- ++ .+.+.|.+-. .+-.-.||+ .-|.-..| -+.||.+|+...|++|-+++.++ T Consensus 79 ~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytHG----y~evP~eiv~lv~d~A~~~~~np------ 148 (188) T protein:vir:10 79 WVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTHG----YNPVPDELIDVAIRLAREYQSNP------ 148 (188) T ss_pred cccccccccccceeeecccccCcccccccccccccCcceEEEEEecC----CCcccHHHHHHHHHHHHHHhcCc------ Confidence 22222221 00 0112222210 122234663 22222233 25799999999999998876553 Q ss_pred ccceeeeecCeeEEeecCCCCCCcchHHHHHHHhhhcccccC Q lcl|NC_019410. 71 TRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTG 112 (123) Q Consensus 71 ~~~v~~ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G 112 (123) .....++||.+|++|+..+..+. -++=..+|+.+--...- T Consensus 149 -~~L~q~~vG~~S~tfa~~~~~sl-~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:10 149 -ELLVSKQVGEIERRFGSVAGTSL-SKADQAILDRYVIATLA 188 (188) T ss_pred -ccceeeecCceeeecccccCCcc-cchhHHhhccccccccC Confidence 33467999999999996433332 23334455554321111 No 12 >protein:vir:103283 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277463;genbank:gi:71834105;genbank:GeneID:3562394 Probab=74.27 E-value=0.077 Score=26.68 Aligned_cols=87 Identities=15% Similarity=0.136 Sum_probs=49.0 Q ss_pred ccCCceeeCCeeecC-CCCcHHHHHHHHHHHHHHh-----------------------c----CCCC-CCCCccceeeee Q lcl|NC_019410. 28 PRAGVYDIDGFLIPS-DAIPQQLMEATAEMAAALM-----------------------N----NDWT-SPQTTRGMKEIQ 78 (123) Q Consensus 28 PR~g~~~~~G~~i~~-d~IP~~V~~A~~eLA~~~~-----------------------~----~~~~-~~~~~~~v~~ek 78 (123) =| ..+|. -.+|++|++|=+|+|-.++ + ++.. ..+..+.+++-+ T Consensus 1 mR--------~l~P~f~~vpdevi~~wid~A~lFVC~~~fg~~~~~Al~lytlHLm~~dga~k~e~~~~~~~s~r~~s~s 72 (125) T protein:vir:10 1 MR--------TLYPPLKSQPDDVLNAWIEVAKLFICLDKFGDKQVQALAFYTLHLLSQDIALKTENDSSQTSSERVKSYS 72 (125) T ss_pred Cc--------cccchhhccCHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccccccccccccceeeee Confidence 12 12332 3479999999888872211 1 1111 123456788888 Q ss_pred c-CeeEEeecCCCCCCcc----hHHHHHHHhhhcccccCCcccee-eEeeC Q lcl|NC_019410. 79 V-DVIELKFDSEIQRGSM----PDIVMSILEGLGVVKTGTRPAFK-KIIRH 123 (123) Q Consensus 79 V-G~I~v~Y~~~~~~~~~----~~~v~~lL~~ll~~~~G~~~~~~-~v~R~ 123 (123) . |+++++|+..+....- -.-.-.|+.-|+. ..||+|+++ +..+. T Consensus 73 lsGE~Sit~~~~s~d~s~~~L~~T~wGk~~~~L~k-~~~GgFaL~T~~~~~ 122 (125) T protein:vir:10 73 LSGEYTISYDTSTAAASSSNLEESSWGKLYIDLMR-LKVGRWGLITSGGSR 122 (125) T ss_pred eccceEeecccccccccccccccCchHHHHHHHHH-hcCCceeeecccccc Confidence 4 9999999875543321 1123445556665 457788877 33333 No 13 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=68.90 E-value=0.07 Score=26.92 Aligned_cols=92 Identities=13% Similarity=0.079 Sum_probs=54.1 Q ss_pred CchhhhhhhhhcccCCccCCcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceeeeecC Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVD 80 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ekVG 80 (123) |-+|+|-|++. -|.+ ++ .|.+|..|+.-+|..+..+++-+..... .-++..-| T Consensus 33 l~eAsdlI~g~-l~~~----------------------~v-p~~~p~~v~rVvA~ivarAltr~~~~~p---e~~S~TAg 85 (128) T protein:vir:25 33 LAEATDLVVGY-LHPY----------------------PV-PTPTPGPIKRVVASMVAAVLTRPTQILP---ETQSLTAD 85 (128) T ss_pred Hhcchheeeee-cCCC----------------------CC-CCCCCchHHHHHHHHHHHHhhCCCccCC---Cceeeecc Confidence 55555555542 2322 11 3678999999999999888766543322 23344679 Q ss_pred eeEEeecCCCCCCcch--HHHHHHHhhhcccccCCcccee-eEeeC Q lcl|NC_019410. 81 VIELKFDSEIQRGSMP--DIVMSILEGLGVVKTGTRPAFK-KIIRH 123 (123) Q Consensus 81 ~I~v~Y~~~~~~~~~~--~~v~~lL~~ll~~~~G~~~~~~-~v~R~ 123 (123) +.+.+|..+++++..| ..-..+|+|+=. | .+++. --.|- T Consensus 86 pfs~~ft~~~~~~g~yLTaa~k~~Lrp~R~---~-~~sV~l~sery 127 (128) T protein:vir:25 86 GFGVTFTPGGNSPGPYLSAALKQRLRPYRT---G-MVAVEMGSERY 127 (128) T ss_pred cccccccCCCCCCCceEcHHHHhhcccccc---e-eeEeecccccC Confidence 9998888877776543 555667888722 1 11111 11222 No 14 >protein:vir:8104 Length: 170 # NCBI annotation: gp8 # Family: family:all:3238 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817685;genbank:gi:29566116;genbank:GeneID:1259310 Probab=58.58 E-value=0.3 Score=23.45 Aligned_cols=103 Identities=10% Similarity=0.096 Sum_probs=54.3 Q ss_pred CchhhhhhhhhcccCCccCC-ccccccccc--CCce--eeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCcccee Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVD-EDSGLRWPR--AGVY--DIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMK 75 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~-~~Q~laWPR--~g~~--~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~ 75 (123) .++....-...|.|.++.-- .-..-.||| .++. -..| +..+++|..+..-.|.+|-++.. +......+ T Consensus 63 ~~~G~~l~~~~~~~~~~~glL~r~~G~~~~~~~~V~VT~tHG--y~~~~apd~~~~vi~~~a~r~~~-----s~~~~~l~ 135 (170) T protein:vir:81 63 VELGYALDVSTLDRSRRKGTLTKPYGRWTARDGAIVVTATHG--FTETEAADWRRAVVQLVGRRAQT-----SRPSADLK 135 (170) T ss_pred EECCeeecCccceeecCCceEEecCCccccccceEEEEEEeC--CCCCccchHHHHHHHHHHHHhhc-----cCCcccce Confidence 22222211112444333221 123336887 2211 0123 45568999998888888876543 23334578 Q ss_pred eeecCeeEEeecCCCCCCcchHHHHHHHhhhcccccCCcc Q lcl|NC_019410. 76 EIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRP 115 (123) Q Consensus 76 ~ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~ 115 (123) +.+++.++.+|.+.+.... +.-..+|+.+ .+|..+ T Consensus 136 ~~~~~~vs~~~~~~~~s~~--~~~~~iL~~Y---rl~~~p 170 (170) T protein:vir:81 136 RKKVDDVEYEWFETAVSVD--AELSAVFSPF---RILPSP 170 (170) T ss_pred eeeccceeeeecccccccC--HHHHHhhhhc---ccCCCC Confidence 8899999999975332222 3333466554 456666 No 15 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=45.27 E-value=0.5 Score=22.22 Aligned_cols=101 Identities=13% Similarity=0.030 Sum_probs=51.1 Q ss_pred Cchhhhhhhhhcc-cCCccCCcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceeeeec Q lcl|NC_019410. 1 MVRASKYLDRTIA-WAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQV 79 (123) Q Consensus 1 Li~As~yld~~~~-~~G~r~~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ekV 79 (123) |-+|++-||...+ |.=. + .++.=+...+=.+||.|.|.=...+-..+.....+...+++..| T Consensus 24 ~k~A~~~ID~~t~~~y~~-------------~----~~~~~~~~~r~~~vK~A~a~QieY~~~~G~~s~~~~~~~~S~sv 86 (130) T protein:vir:47 24 AKRAKIAIDLYTNGIYQK-------------D----IDFEKEIAYRKSAVKLAMAFQIAYLDASGIMSADDKQLANSVSI 86 (130) T ss_pred HHHHHHHHHHHhcccccc-------------c----CCccCcchHHHHHHHHHHHHHHHHHHHhccccchhccCcceeee Confidence 7777777776432 2100 0 01111223445677778777665555444444445777999999 Q ss_pred CeeEEeecCCCCCC--cchHHHHH---HHhhhcccccCCccceee Q lcl|NC_019410. 80 DVIELKFDSEIQRG--SMPDIVMS---ILEGLGVVKTGTRPAFKK 119 (123) Q Consensus 80 G~I~v~Y~~~~~~~--~~~~~v~~---lL~~ll~~~~G~~~~~~~ 119 (123) |--++.|...+.+. ..+..... +|.+-+-. +-+|=.+=| T Consensus 87 GrtSis~~~~~~~~~~~~~~vs~da~~~L~~tGL~-Ly~GV~yd~ 130 (130) T protein:vir:47 87 GRTSISYSTSQSTLAGQRFNLSMDAENALRQAGFS-LVVGVAYDR 130 (130) T ss_pred cceeeecCcCccccccCCccccHHHHHHHHhcccc-cccCCCccC Confidence 99999998744332 22322222 23222110 011223333 No 16 >protein:vir:3160 Length: 198 # NCBI annotation: unknown # Family: family:all:28414 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665931;genbank:gi:22091117;genbank:GeneID:951344 Probab=45.14 E-value=0.44 Score=22.55 Aligned_cols=103 Identities=15% Similarity=0.171 Sum_probs=43.2 Q ss_pred CchhhhhhhhhcccCCccCCc------------cc---cccccc--CCce--eeCCeeecCCCCcHHHHHHHHHHHHHHh Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDE------------DS---GLRWPR--AGVY--DIDGFLIPSDAIPQQLMEATAEMAAALM 61 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~------------~Q---~laWPR--~g~~--~~~G~~i~~d~IP~~V~~A~~eLA~~~~ 61 (123) +-=-+-.+|.. .-.|+..+. .+ -..||. +++. -.-|+ ..||..|++|+|+|+...+ T Consensus 77 ~sVsSV~iD~~-~~~g~~v~~~dy~l~~~~~~~~~G~~r~~~p~~~rnV~V~y~AGy----e~VPeDikeAVI~lv~~~~ 151 (198) T protein:vir:31 77 QDVVSVTIDTD-RAMGRDVDEDDYWVEETHLELKPGADRKSWPTDRRCITVEWEYGY----EEVPESPKKAIIRLVRARL 151 (198) T ss_pred ceeEEEEEecC-ccccccccchhhhhhhhhhhhcccccccccccccceEEEEeecCc----cccchHHHHHHHHHHHHHH Confidence 00011111110 000111100 00 012555 2321 11233 3599999999999996543 Q ss_pred cCCCCCCCCccceeeeecCeeEEeecCCCCCCcchHHHHHHHhhhcccccCCccc Q lcl|NC_019410. 62 NNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPA 116 (123) Q Consensus 62 ~~~~~~~~~~~~v~~ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~ 116 (123) +. ...+++++++++-=++.|. +.+...+....=.++|-.++-.++-- T Consensus 152 ~e-----~~~~Gi~s~T~~gesvSy~---~~~e~~~~~~~~~~~~~~~~~~~~~~ 198 (198) T protein:vir:31 152 RA-----INAEGISSDTIMGDSISYD---PEDEVVLAARKDVAGFEAPSYYGGVE 198 (198) T ss_pred hh-----hhcccceeeeecCcceeec---CcccchhhhhhhhccccCcccccCCC Confidence 21 1223577788744455565 22223333444444554433222222 No 17 >protein:vir:9928 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795690;genbank:gi:28876458;genbank:GeneID:1258013 Probab=43.66 E-value=0.84 Score=21.01 Aligned_cols=88 Identities=17% Similarity=0.121 Sum_probs=51.2 Q ss_pred CchhhhhhhhhcccCCccCCcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceeeeecC Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVD 80 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ekVG 80 (123) |-+|.+-|=. +.|. .+ ..-.+.||.++...++|+|+...+- ....+++++.++ T Consensus 31 i~~a~~~i~~---~l~~------------~~-------~~~~~eiP~~l~~iv~evav~ryNR-----~g~EG~~S~See 83 (118) T protein:vir:99 31 IYESKERVLA---KLNE------------YS-------ETEITKIPDRLRFIVRDVAIKRFNR-----INSEGAVEDSEE 83 (118) T ss_pred HHHHHHHHHH---Hhcc------------cc-------ccchhhhhHHHHHHHHHHHHHHhcC-----cCCcccceeecC Confidence 2222222211 1121 00 0012569999999999999876532 234578999999 Q ss_pred eeEEeecCCCCCCcchHHHHHHHhhhcccccCCccceeeEe Q lcl|NC_019410. 81 VIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 121 (123) Q Consensus 81 ~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~ 121 (123) -.+++|.. -|+-.+..|.-+.......+.+.++++ T Consensus 84 G~S~sf~~------d~~ey~~~l~~~~~~~~~~~~g~v~Fi 118 (118) T protein:vir:99 84 GKTFKWDS------YLKEYESTLRSAAIGKVYSGKGVARFI 118 (118) T ss_pred Ceeeeecc------CchhHHHHHHHHhhhcccCcCcceeeC Confidence 99999953 133355555565544444555666677 No 18 >protein:vir:80036 Length: 111 # NCBI annotation: gp10 # Family: family:all:3186 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468714;genbank:gi:157325294;genbank:GeneID:5601727 Probab=42.77 E-value=0.17 Score=24.80 Aligned_cols=102 Identities=13% Similarity=0.055 Sum_probs=50.2 Q ss_pred CchhhhhhhhhcccCCccCCccccc---ccccCCceeeCCeeecCCCCcHHHH-HHHHHHHHHHhcCCCCCCCCccceee Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDEDSGL---RWPRAGVYDIDGFLIPSDAIPQQLM-EATAEMAAALMNNDWTSPQTTRGMKE 76 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~~Q~l---aWPR~g~~~~~G~~i~~d~IP~~V~-~A~~eLA~~~~~~~~~~~~~~~~v~~ 76 (123) -+++..-+-. .-.|-..|.-|.+ ||= +.-+++-|++++ .||--||+.++.= ....|++ T Consensus 4 tv~~vkl~a~--~L~~~sDDsl~~~I~dA~~----------e~~a~gFp~~~~e~a~rYLa~HLat~------~~~~v~s 65 (111) T protein:vir:80 4 DVSKLKLTAS--SLASVSDDSLQVHIDDSYL----------EVQEKGFPEKFEERANRYLAAHLATL------ANKNVKS 65 (111) T ss_pred hHHHHHHhhH--hhcCCChHHHHHHHHHHHH----------HhhcCCCChhHHHHHHHHHHHHHHHh------cCCCCch Confidence 1111111111 1112222222211 121 122345556655 4666677765432 3667999 Q ss_pred eecCeeEEeecCCCCCC--cchHHHHHHHhhhcccccCCccceeeEe Q lcl|NC_019410. 77 IQVDVIELKFDSEIQRG--SMPDIVMSILEGLGVVKTGTRPAFKKII 121 (123) Q Consensus 77 ekVG~I~v~Y~~~~~~~--~~~~~v~~lL~~ll~~~~G~~~~~~~v~ 121 (123) ||||.++-+|++.+.-. .+-+|-.-.+ .|+....|+|...+.|+ T Consensus 66 E~V~~Lk~~Y~~~~~~~~l~~s~wGq~Y~-rL~k~~~~gs~~~~vVv 111 (111) T protein:vir:80 66 EAVGSLKREYYEVKGDSGLLSTEYGQEYA-RLLKEANGGSGISMVVV 111 (111) T ss_pred hhhhhHHHHhhhcccccccccchhHHHHH-HHHHHhcCCccceeeeC Confidence 99999999999644322 2334433333 45555556777767777 No 19 >protein:vir:107702 Length: 136 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003900;genbank:gi:45686316;genbank:GeneID:2773042 Probab=41.21 E-value=0.75 Score=21.28 Aligned_cols=112 Identities=15% Similarity=0.018 Sum_probs=47.2 Q ss_pred CchhhhhhhhhcccCCccCCc-ccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCC------CCC-CCcc Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDE-DSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDW------TSP-QTTR 72 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~-~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~------~~~-~~~~ 72 (123) ++++.+-|.= .|.-.+.+. .+-..|=+-=+ .++.-.+...+|..-+++.++.-+. ... ...+ T Consensus 9 ~ve~fR~l~P--eF~dvPde~i~~~~d~A~~~v--------~~~~~Gk~y~~al~lltAHLl~l~~~~~~~~~~~~~~s~ 78 (136) T protein:vir:10 9 VVEQMRKLVP--ALRKVPDETLYAWVEMAELFV--------CQKTFKDAYVKALALYALHLAFLDGALKGEDEDLESYSR 78 (136) T ss_pred HHHHHHHhcc--ccccCCHHHHHHHHHHHHHhh--------cCCCChhHHHHHHHHHHHHHHhccccccccccccccccc Confidence 2222222211 122111111 12222222111 1123344555555555554442111 111 1234 Q ss_pred ceee-eecCeeEEeecCCCCCCcc----hHHHHHHHhhhcccccCCccceeeEeeC Q lcl|NC_019410. 73 GMKE-IQVDVIELKFDSEIQRGSM----PDIVMSILEGLGVVKTGTRPAFKKIIRH 123 (123) Q Consensus 73 ~v~~-ekVG~I~v~Y~~~~~~~~~----~~~v~~lL~~ll~~~~G~~~~~~~v~R~ 123 (123) .+++ ..+|+++|.|+..+..... -...=+++.-|+. ..|+||+++--.+. T Consensus 79 rv~ssat~GevSVS~a~~s~~~s~~WL~~TpyGq~y~aL~k-~~~gGf~l~t~~~~ 133 (136) T protein:vir:10 79 RVTSFSLSGEFSQTFGEVTKNQSGDMMLSTPWGKMFEQLKA-RRRGRFALMTGLRG 133 (136) T ss_pred ceehheeccceeEeeccccCchhhHhhhcCHHHHHHHHHHh-hcccchhhhhcccc Confidence 4554 6689999999864433321 1223345556666 46888888743332 No 20 >protein:vir:9821 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795583;genbank:gi:28876338;genbank:GeneID:1257921 Probab=39.49 E-value=0.83 Score=21.03 Aligned_cols=97 Identities=14% Similarity=0.088 Sum_probs=49.4 Q ss_pred CchhhhhhhhhcccCCccCCcccccccccCCceeeCCeeecCC--CCcHHHHHHHHHHHHHHhcCCCCCCCCccceeeee Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSD--AIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQ 78 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~~d--~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ek 78 (123) |-+|++-||...++. .++.-|.+| .+=.+||.|.|.=...+...+.....+....++.. T Consensus 29 lk~As~~ID~~t~~~-------------------y~~~d~e~d~~~r~~~vKkA~a~QIeY~~~~G~ts~~d~~~~~s~s 89 (138) T protein:vir:98 29 EKRASHAVNLYCRNR-------------------YDYKDLKKEIALVQKAVKRAIAYQIAYLNDSGVMTAEDKQSFAGIS 89 (138) T ss_pred HHHHHHHhhhhhccc-------------------cccccccchhHHHHHHHHHHHHHHHHHHHHcCCcchhhccCcCceE Confidence 889999999854321 122222222 23345666666555444444444444567789999 Q ss_pred cCeeEEeecCCCCCC-------cch----HHHHHHHhhhcccccCCccceee Q lcl|NC_019410. 79 VDVIELKFDSEIQRG-------SMP----DIVMSILEGLGVVKTGTRPAFKK 119 (123) Q Consensus 79 VG~I~v~Y~~~~~~~-------~~~----~~v~~lL~~ll~~~~G~~~~~~~ 119 (123) ||-.++.|+....++ .++ .+.+ +|.+.+-.. +|=++=| T Consensus 90 vGrTSiS~~~~~~~~s~~~~~~~~~~~s~~A~~-~L~~tGLLY--~GV~yd~ 138 (138) T protein:vir:98 90 LGRTSISYTVGHGQGSQQKTLADRFNLCLDAEN-ELLVVGLGY--TGISYDR 138 (138) T ss_pred eeeeEeecccccccccccccccccccccHHHHH-HHhhcCccc--ccCcccC Confidence 999888885333221 111 2233 454432212 2223334 No 21 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=39.32 E-value=1 Score=20.53 Aligned_cols=106 Identities=10% Similarity=0.179 Sum_probs=52.7 Q ss_pred Cchhhhhhhhhccc-CCccCCcccccccccCCceeeCCeeecCCCCc---HHHHHHHHHHHHHHhcCCCCCC--CCccce Q lcl|NC_019410. 1 MVRASKYLDRTIAW-AGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIP---QQLMEATAEMAAALMNNDWTSP--QTTRGM 74 (123) Q Consensus 1 Li~As~yld~~~~~-~G~r~~~~Q~laWPR~g~~~~~G~~i~~d~IP---~~V~~A~~eLA~~~~~~~~~~~--~~~~~v 74 (123) |-+|++-||...++ .+. +=+.. .+.|+..=....|| .+||.|.|.=..++..-+.... ...+.+ T Consensus 27 lk~A~~~ID~~T~y~~~~---------y~~~~-i~~d~~~d~~~~~~~r~~~vKkA~a~QIeY~~~~G~~sa~e~~~~~~ 96 (144) T protein:vir:79 27 LKSATVLINQICSYYDPA---------FAYHD-LEADSQADPDSYLFRQAMAFKKAVALEMLFLEDSGYSSAYDVAQGAL 96 (144) T ss_pred HHHHHHHhhhhhhhhccc---------ccccc-ccccccccchhhhhHHHHHHHHHHHHHHHHHHHcCCcchhhhhcCcc Confidence 88999999985432 111 00001 01111111122356 4467887776655444443332 246779 Q ss_pred eeeecCeeEEeecCCCCCC-c--ch---HHHHHHHhhhcccccCCcccee Q lcl|NC_019410. 75 KEIQVDVIELKFDSEIQRG-S--MP---DIVMSILEGLGVVKTGTRPAFK 118 (123) Q Consensus 75 ~~ekVG~I~v~Y~~~~~~~-~--~~---~~v~~lL~~ll~~~~G~~~~~~ 118 (123) ++..||-.+++|...+..+ . ++ +-+-..|.+.+-...|-+- + T Consensus 97 ~S~svGrtsvs~~~~~~~s~t~~~~~v~~~a~~yL~~tGLLYrGV~s--~ 144 (144) T protein:vir:79 97 NSFTVGHTSMSLNPSAGQNLTVGSTGVVKSAYDLLGRYGLLFSGVAS--L 144 (144) T ss_pred ceeEecceEEeecCCCccccccccccccHHHHHHHhhcCcccccccc--C Confidence 9999999999997544321 1 12 3344445443322222211 1 No 22 >protein:vir:104344 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398973;genbank:gi:81343957;genbank:GeneID:3778876 Probab=38.90 E-value=0.26 Score=23.78 Aligned_cols=94 Identities=14% Similarity=0.168 Sum_probs=44.1 Q ss_pred hhhhcccCCccCCcccccccccCCceeeCCeeecC-CCCcHHHHHHHHHHHH---------------------HHhcCCC Q lcl|NC_019410. 8 LDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPS-DAIPQQLMEATAEMAA---------------------ALMNNDW 65 (123) Q Consensus 8 ld~~~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~~-d~IP~~V~~A~~eLA~---------------------~~~~~~~ 65 (123) +|- --++.=|+ .+|. ..+|++|+++=+|+|- .++.-|. T Consensus 1 ~~~------------~~~e~~R~--------l~P~f~kvpdevI~~wielA~lfVc~~~~g~~~~~AlaL~taHLm~~dg 60 (132) T protein:vir:10 1 MND------------AILAFMRS--------LVPALKAVDDESINVWIDLARLYVCADKFGNDADRAVGLYALHLMLSDG 60 (132) T ss_pred Cch------------HHHHHHHH--------hcchhhcCChHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHhhccc Confidence 000 00000011 1121 4567777777666662 2222221 Q ss_pred --CCCCCccceeeeec------CeeEEeecCCCCCCc---chHHHHHHHhhhcccccCCcccee-eEe-eC Q lcl|NC_019410. 66 --TSPQTTRGMKEIQV------DVIELKFDSEIQRGS---MPDIVMSILEGLGVVKTGTRPAFK-KII-RH 123 (123) Q Consensus 66 --~~~~~~~~v~~ekV------G~I~v~Y~~~~~~~~---~~~~v~~lL~~ll~~~~G~~~~~~-~v~-R~ 123 (123) -+.+.++..-+++| |+++++|++.+..+. .-||= .|+.-|+. ..||+|+++ +.. |- T Consensus 61 a~k~en~~~~t~S~rvaS~Sl~Ge~Sisf~~~sa~~s~L~~tp~G-kl~~~L~k-~~~GgfgL~t~~~~~~ 129 (132) T protein:vir:10 61 AFKGENEGLETYSRRMASYSLSGEFSITYDNQSAIQGDLSSSSWG-RMYKALLR-KKGGGFGLITSAAGGG 129 (132) T ss_pred cccccccchhhhhhhhhhhcccCceeeecccccccccccccCcHH-HHHHHHHH-hccCccccccccCcCC Confidence 11222233345555 999999997553322 12443 67766666 447777776 333 33 No 23 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=32.27 E-value=1.4 Score=19.72 Aligned_cols=100 Identities=15% Similarity=0.068 Sum_probs=49.5 Q ss_pred CchhhhhhhhhcccCCccCCcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceeeeecC Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVD 80 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ekVG 80 (123) |-.|+++|..+ +|+.+. +.+...-+++..+.-++.-+|+...+++..+.- ..+-.-.++..| T Consensus 29 L~~As~~ir~~---------------~p~~~~-~l~~~~~~~~~~~~~~~~V~~~~V~Ral~~~~~--~~G~tq~S~TaG 90 (131) T protein:vir:95 29 LEVVSHSLRVE---------------AKKVGK-DLDGLVATDPSFTMVVKSVTVDVVARTLMTSTD--QEPMTQVAESAL 90 (131) T ss_pred HHHHHHHHHHh---------------hhhccC-CccccccCCccchHHHHHHHHHHHHHHhcCCCC--CCCceeeeeecc Confidence 55555555443 354442 345555555667788999999999987744321 011223578899 Q ss_pred ee--EEeecCCCCCCcchHHHHHHHhhhcccccCCccceeeEeeC Q lcl|NC_019410. 81 VI--ELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKIIRH 123 (123) Q Consensus 81 ~I--~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~R~ 123 (123) +. +.+|..++..-..-..-..+| |+- |...+-+.+-=. T Consensus 91 ~ys~S~t~~~p~g~lylt~~e~~~L-Gl~----~~r~~~i~~~~~ 130 (131) T protein:vir:95 91 GYSFSGSYLVPGGGLFIKDSELKRL-GLK----KQRYGVIDIYGT 130 (131) T ss_pred cceeeeeeecCCCCceeChHHHHHh-CCC----CCceeEEeeccC Confidence 88 455554332222223333344 221 222221222111 No 24 >protein:vir:5256 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852761;genbank:gi:31544036;uniprot:Q7Y5T9;genbank:GeneID:2753553 Probab=31.62 E-value=1.5 Score=19.64 Aligned_cols=108 Identities=12% Similarity=0.095 Sum_probs=41.4 Q ss_pred Cchhhhhhhhhc-ccCCccCCc-ccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHh--cCCCCC--CCCccce Q lcl|NC_019410. 1 MVRASKYLDRTI-AWAGEKVDE-DSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALM--NNDWTS--PQTTRGM 74 (123) Q Consensus 1 Li~As~yld~~~-~~~G~r~~~-~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~--~~~~~~--~~~~~~v 74 (123) |+...+|... | .|..-+.+. ++-++.=.. .+.++.-...-.++.+.++..++ .+.... ....+.+ T Consensus 1 m~t~~~Fr~~-~PeF~~~pd~~i~~~l~~A~~--------~l~~~~~g~~~~~~~~L~~AH~l~l~~~~~~~~g~~~g~v 71 (119) T protein:vir:52 1 MPLTEDFLLR-YTEFGKTDAKRIGLFLSDAQA--------EVSKVQWGKLYDRGVMALTAHLLKLSADAEISGGAANRNL 71 (119) T ss_pred CCcHHHHHHh-hhhccCCCHHHHHHHHHHHHH--------hhCCcCCchHHHHHHHHHHHHHHHhhhhhhccccccccce Confidence 7777666643 3 243211110 111111100 01112222333334444443322 111111 1223568 Q ss_pred eeeecCeeEEeecCCCCCCcchHHH-----HHHHhhhcccccCCccceee Q lcl|NC_019410. 75 KEIQVDVIELKFDSEIQRGSMPDIV-----MSILEGLGVVKTGTRPAFKK 119 (123) Q Consensus 75 ~~ekVG~I~v~Y~~~~~~~~~~~~v-----~~lL~~ll~~~~G~~~~~~~ 119 (123) ++.++|.++|.|+........-.|. =+.+.-|+... |.| +++- T Consensus 72 ~S~s~G~vSvS~~~~~~~~~~~~w~~~T~YG~~y~~L~r~~-g~G-g~Va 119 (119) T protein:vir:52 72 ASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLI-GVG-VMVA 119 (119) T ss_pred eeeeecceeeeeeccccCCcchhhhhcCHHHHHHHHHHHHh-cCC-CcCC Confidence 9999999999998654432211111 11222232222 212 2122 No 25 >protein:vir:99570 Length: 153 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039799;genbank:gi:126011049;genbank:GeneID:4818265 Probab=31.24 E-value=1.5 Score=19.60 Aligned_cols=109 Identities=16% Similarity=0.120 Sum_probs=42.1 Q ss_pred Cchhhhhhhhhc-ccCCc-cC-------------CcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhc--- Q lcl|NC_019410. 1 MVRASKYLDRTI-AWAGE-KV-------------DEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMN--- 62 (123) Q Consensus 1 Li~As~yld~~~-~~~G~-r~-------------~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~--- 62 (123) ..--.+|... | .|.-. +. .-.....|+... .-.....++.+-++..++. T Consensus 5 ~fd~~~Fr~~-fPeFad~~~~Pd~~i~~~l~~A~~~l~~~~~~~~~------------~~g~~~~~~l~Ll~AH~l~L~~ 71 (153) T protein:vir:99 5 VYNDGLFRIM-YPEFADQEKYPPEVIEIYYDTATLFITGSMFPCAA------------LSGKQLVGALNMLTAHLMSLSM 71 (153) T ss_pred cCChHHHHHh-cccccCccccCHHHHHHHHHHHHHhhcCccccccc------------cChHHHHHHHHHHHHHHHHHHh Confidence 1111111111 1 01000 00 000011122111 1134555666666543321 Q ss_pred ----CC-CCCCCCccceeeeecCeeEEeecCCCCCCcchHH---------HHHHHhhhcc--cccCCccceeeEeeC Q lcl|NC_019410. 63 ----ND-WTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDI---------VMSILEGLGV--VKTGTRPAFKKIIRH 123 (123) Q Consensus 63 ----~~-~~~~~~~~~v~~ekVG~I~v~Y~~~~~~~~~~~~---------v~~lL~~ll~--~~~G~~~~~~~v~R~ 123 (123) +. .......+.+++.++|.|+|.|+.+......-.| .=+|++.+.. ...|+.+- -.-+|. T Consensus 72 ~~~~~~~~a~~~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~fw~l~~~~~~Gg~v~gg~pe-~~~~r~ 147 (153) T protein:vir:99 72 QRSQTALGATNDQGGYTLSATIGEVSVSKMAPPAKDGWEFWLAQTPYGQALWALLKMLSVGGFAIGGLPE-RTGFRK 147 (153) T ss_pred hhhcccccCCCccccceeeeeecceeeeeecCCCCCchhHhhhcCHHHHHHHHHHHHhcccccccCCCCc-cccccc Confidence 11 1112234668999999999999865543321112 2223333322 22344443 233444 No 26 >protein:vir:94507 Length: 113 # NCBI annotation: putative DNA packaging protein # Family: family:all:372 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223891;genbank:gi:62327103;genbank:GeneID:5075523 Probab=30.15 E-value=1.6 Score=19.47 Aligned_cols=101 Identities=13% Similarity=0.134 Sum_probs=55.4 Q ss_pred hhhhhhhcccCCccCCc-c-----------cccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCcc Q lcl|NC_019410. 5 SKYLDRTIAWAGEKVDE-D-----------SGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTR 72 (123) Q Consensus 5 s~yld~~~~~~G~r~~~-~-----------Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~ 72 (123) |..|+..=...|-..+. + |.+-. +.|. ++ +-.+.||.++..-++|.|+...+- .... T Consensus 1 M~~L~~vK~~lgi~d~~~D~lL~~iI~~a~~~i~~-~l~~---~~--~~~~~iP~~l~~Iv~evavkryNR-----~g~E 69 (113) T protein:vir:94 1 MALLDSIKLRIGIEDTKQDDLLTDIISDVQARVLA-YVNQ---DG--LVQSELPNGLDFVIKDVTIRIYNK-----IGDE 69 (113) T ss_pred CchHHHHHHHhCCCCCchhhHHHHHHHHHHHHHHH-HhCC---cc--chhhhhhhHHHHHHHHHHHHHhcc-----cCCc Confidence 44444310123321100 0 00000 0110 11 123679999999999999976543 3456 Q ss_pred ceeeeecCeeEEeecCCCCCCcchHHHHHHHhhhcccccCCccceeeEe Q lcl|NC_019410. 73 GMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 121 (123) Q Consensus 73 ~v~~ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~ 121 (123) +++++.++-.+++|..... |.-.+.+|..+.....+++.+ +|++ T Consensus 70 G~~S~SeeG~S~sf~~~~d----f~~y~~~l~~~~~~~~~~~~g-~rF~ 113 (113) T protein:vir:94 70 GKESSSEGNVSNTWDTPAD----LSEYSDVLDVYRKSYKRRSAG-MRFI 113 (113) T ss_pred cceeeecCceeeeecCccc----hhhHHHHHHHHHhhccCCCCC-ceeC Confidence 7999999999999976332 444555565665544455555 3666 No 27 >protein:vir:8430 Length: 189 # NCBI annotation: gp25 # Family: family:all:3238 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818326;genbank:gi:29566762;genbank:GeneID:1260021 Probab=30.03 E-value=1.6 Score=19.45 Aligned_cols=105 Identities=13% Similarity=0.150 Sum_probs=50.4 Q ss_pred Cchhhhh-hhhhcccCCccCC--cccccccccCCcee--eCCeeecCCCCc-HHHHHHHHHHHHHHhcCCCCCCCCccce Q lcl|NC_019410. 1 MVRASKY-LDRTIAWAGEKVD--EDSGLRWPRAGVYD--IDGFLIPSDAIP-QQLMEATAEMAAALMNNDWTSPQTTRGM 74 (123) Q Consensus 1 Li~As~y-ld~~~~~~G~r~~--~~Q~laWPR~g~~~--~~G~~i~~d~IP-~~V~~A~~eLA~~~~~~~~~~~~~~~~v 74 (123) ...++.+ +|. ..|.-++.+ .-+.-.|||-.+.- ..| + +..| .++..+.|++|-++. -.. ..-..+-. T Consensus 76 t~dG~~~~~~~-v~~~~~~~Gll~r~~Gw~~~g~I~VT~tHG--y--~~~pa~di~~vv~~mA~rA~-~~~-~~~~~g~~ 148 (189) T protein:vir:84 76 TEDGEEVDLDE-VYFVSREPGVLYKKCGWWCRGPIEVTLTHG--F--TAEEAGDFREVVLQAVDVAN-LMV-GTGATGPI 148 (189) T ss_pred eecCeeccccc-ceeccCCcceeEeCCCcccCCeEEEEEEcC--C--CCCCchhHHHHHHHHHHhhh-ccC-CCcccccc Confidence 2222222 111 124323222 12344678644321 123 2 3456 489999999996541 000 11134558 Q ss_pred eeeecCeeEEeecCCCC---CCcchHHHHHHHhhhcccccCCcccee Q lcl|NC_019410. 75 KEIQVDVIELKFDSEIQ---RGSMPDIVMSILEGLGVVKTGTRPAFK 118 (123) Q Consensus 75 ~~ekVG~I~v~Y~~~~~---~~~~~~~v~~lL~~ll~~~~G~~~~~~ 118 (123) ++.|||.|+..|++-.. +-..-+..+++|..|--. ++. T Consensus 149 ~~~~v~dv~~r~~~~~~~~~~~~~~~~l~~~~~~~~~~------~~~ 189 (189) T protein:vir:84 149 TGLEVDDVNMRWSGLVDRSWGIAKNPMLESVLYQYRLV------AIA 189 (189) T ss_pred cceeecceeeehhhhcccccccccchHHHHHHhhhhhh------ccC Confidence 99999999999986222 122235556655443111 111 No 28 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=29.33 E-value=1.7 Score=19.36 Aligned_cols=102 Identities=14% Similarity=0.082 Sum_probs=43.1 Q ss_pred CchhhhhhhhhcccCCccCCcccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccc--eeeee Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRG--MKEIQ 78 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~--v~~ek 78 (123) |-.|+++|.. .+|+.|. +.+-.....+.-+.-++.-+|....+++.. +.+..+ -.++. T Consensus 29 L~dAS~~iR~---------------~~p~~g~-~~~~~~~~~~~~~~~~k~V~~~mV~Ral~~----~~d~~G~tq~S~T 88 (140) T protein:vir:97 29 LKVVSDTLRM---------------EADKVGK-DLDKTMVDKPYFVNVIKSVTVDIVARTLMT----STQGEPMSQESQS 88 (140) T ss_pred HHHHHHHHHH---------------hhhhccC-CcchhcccCccchhHHHHHHHHHHHHHhcC----CCCCCcceeeeee Confidence 4455555543 3455442 122111122233455667788877775422 112223 34678 Q ss_pred cCee--EEeecCCCCCCcchHHHHHHHhhhcccccCCccceeeEeeC Q lcl|NC_019410. 79 VDVI--ELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKIIRH 123 (123) Q Consensus 79 VG~I--~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~R~ 123 (123) .|+. +.+|..++..-..-+.-..+| |+.....|.=.-.-...|+ T Consensus 89 aG~ys~S~T~~np~G~lylt~~e~~~L-Gl~~~r~~~i~~~g~~~~~ 134 (140) T protein:vir:97 89 ALGYTWSGTYLVPGGGLFIKDNELKRL-GLKKQRYGGIELYGEIKRD 134 (140) T ss_pred ccchhheeeeecCCCCceeChHHHHHh-CCCCCceeeecccCccccC Confidence 8988 566654433323323333344 2221111111111133343 No 29 >protein:vir:79640 Length: 134 # NCBI annotation: gp38 # Family: family:all:5122 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285527;genbank:gi:148734510;genbank:GeneID:5219998 Probab=28.72 E-value=1.2 Score=20.13 Aligned_cols=111 Identities=10% Similarity=0.057 Sum_probs=45.5 Q ss_pred CchhhhhhhhhcccCCccCCc-ccccccccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcC------CCCCC-CCcc Q lcl|NC_019410. 1 MVRASKYLDRTIAWAGEKVDE-DSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNN------DWTSP-QTTR 72 (123) Q Consensus 1 Li~As~yld~~~~~~G~r~~~-~Q~laWPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~------~~~~~-~~~~ 72 (123) +++...-+-= .|.-.+.+. .|-..+=+. -+..+........|..-+++.++.- +.... ...+ T Consensus 6 ~ve~Fr~l~P--eF~~vpde~l~~~~~~A~~--------~i~~~~~g~~~~~al~lltAHLl~l~~~~~~~g~~~~~~~g 75 (134) T protein:vir:79 6 ILEQIYKIAP--AFKKVDPELIQAWIELAKD--------FVCEKHFKDKYFRAVALYTLHLMTLDGAMKQESESVESYSH 75 (134) T ss_pred HHHHHHHhcc--ccccCCHHHHHHHHHHhhh--------hhcCCCCChHHHHHHHHHHHHHHhhcccccccccccccccc Confidence 1111111100 122111110 111111111 1112334455666666666554421 21111 1233 Q ss_pred ceee-eecCeeEEeecCCCCCCcc-----hHHHHHHHhhhcccccCCccceeeEeeC Q lcl|NC_019410. 73 GMKE-IQVDVIELKFDSEIQRGSM-----PDIVMSILEGLGVVKTGTRPAFKKIIRH 123 (123) Q Consensus 73 ~v~~-ekVG~I~v~Y~~~~~~~~~-----~~~v~~lL~~ll~~~~G~~~~~~~v~R~ 123 (123) +|.+ ...|+++|.|+..+..+.. -|| =+|..-|... .++||+.+--.|. T Consensus 76 rv~ssst~G~vSvS~a~ps~~~~~~Wl~~TpY-Gq~y~~L~k~-~~GGf~~~t~~~~ 130 (134) T protein:vir:79 76 RIASFSLTGEFSQTFSKVSDDTSGNTLRQTPW-GKMYEVLNKK-KGGGFGLTTAFHR 130 (134) T ss_pred hhhhhhhhcceeeeccCcccchhHHHHhcCHH-HHHHHHHHHh-hccchHhhhhccc Confidence 4554 5589999999864433321 122 2455555553 4677776644444 No 30 >protein:vir:80320 Length: 188 # NCBI annotation: gp8, conserved hypothetical protein # Family: family:all:501 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111087;genbank:gi:134288682;genbank:GeneID:4960567 Probab=26.66 E-value=1.9 Score=19.03 Aligned_cols=91 Identities=16% Similarity=0.065 Sum_probs=43.3 Q ss_pred Cchhhhhhhh----------hccc--CCcc--CCcccccccccCCceee---CCeeecCCCCcHHHHHHHHHHHHHHhcC Q lcl|NC_019410. 1 MVRASKYLDR----------TIAW--AGEK--VDEDSGLRWPRAGVYDI---DGFLIPSDAIPQQLMEATAEMAAALMNN 63 (123) Q Consensus 1 Li~As~yld~----------~~~~--~G~r--~~~~Q~laWPR~g~~~~---~G~~i~~d~IP~~V~~A~~eLA~~~~~~ 63 (123) =+.+..|+|. .|.. .|.+ .-..-...||+.+...+ .|+ .+.+|..+|+|...++..+.++ T Consensus 81 sV~sV~~~d~~G~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~---~~~vP~~ik~aill~va~~Ye~ 157 (188) T protein:vir:80 81 RVDSIEIRDASGATTTLDADAFELVQLGREALLVPEGQARWPFARAVTITYQAGV---DLARYPSVRTWMLLAAAWAYDH 157 (188) T ss_pred eeeEEEEEcCCCcEEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEecc---cccChHHHHHHHHHHHHHHHhc Confidence 1112223221 1111 1211 11222356887664322 343 2569999999999999766543 Q ss_pred CCCCCCCccceeeeecCeeEEeecCCCCCCcchHHHHHHHhhhcccccCCcc Q lcl|NC_019410. 64 DWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRP 115 (123) Q Consensus 64 ~~~~~~~~~~v~~ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~ 115 (123) -..-. +| .+.+..-+.++++||+++=.+. || T Consensus 158 Re~~~----------~g--------~~~~~~P~~~v~~Ll~pyRvp~---~~ 188 (188) T protein:vir:80 158 RELFS----------EG--------QPIGEMPGGYADVLLNPITVPP---RF 188 (188) T ss_pred ccccc----------cc--------cccccccHHHHHHHhhccCCCC---CC Confidence 21100 00 0111122355899999996642 23 No 31 >protein:vir:103957 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873994;genbank:gi:118430769;genbank:GeneID:4525451 Probab=22.29 E-value=2.5 Score=18.43 Aligned_cols=102 Identities=17% Similarity=0.113 Sum_probs=53.3 Q ss_pred hhhhhhhcccCCccCCcccc--cc------cccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceee Q lcl|NC_019410. 5 SKYLDRTIAWAGEKVDEDSG--LR------WPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKE 76 (123) Q Consensus 5 s~yld~~~~~~G~r~~~~Q~--la------WPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ 76 (123) |..|+..=.-.|-..+ .|- |. ==|.-.+ . | +-.+.||.++...+.|+|+...+- ....++++ T Consensus 1 M~~L~~vK~~lgI~d~-~~D~lL~~ii~~a~~~i~~~-l-~--~~~~~iP~~l~~iv~ev~vkryNR-----~g~EG~~S 70 (110) T protein:vir:10 1 MTTLADVKKRIGLKDE-KQDEQLEEIIKSCESQLLSM-L-P--IEVEQIPERFSYMIKEVAVKRYNR-----IGAEGMTS 70 (110) T ss_pred CchHHHHHHHhCCCCC-chhHHHHHHHHHHHHHHHHH-h-c--cchhhhhhHHHHHHHHHHHHHhcc-----cCccccce Confidence 4444431112332111 000 00 0000000 0 0 113679999999999999876543 23457999 Q ss_pred eecCeeEEeecCCCCCCcchHHHHHHHhhhcccccCCccceeeEe Q lcl|NC_019410. 77 IQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 121 (123) Q Consensus 77 ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~ 121 (123) ++++-.+++|.++ -|.-+...|.-+.......+.+.+|+. T Consensus 71 ~S~eG~S~sf~d~-----d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:10 71 EAVDGRSNAYELN-----DFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eecCceeeeeccc-----ccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999652 233344445455443444455656666 No 32 >protein:vir:96390 Length: 110 # NCBI annotation: ORF048 # Family: family:all:372 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239650;genbank:gi:66395410;genbank:GeneID:5132866 Probab=22.29 E-value=2.5 Score=18.43 Aligned_cols=102 Identities=17% Similarity=0.113 Sum_probs=53.3 Q ss_pred hhhhhhhcccCCccCCcccc--cc------cccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceee Q lcl|NC_019410. 5 SKYLDRTIAWAGEKVDEDSG--LR------WPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKE 76 (123) Q Consensus 5 s~yld~~~~~~G~r~~~~Q~--la------WPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ 76 (123) |..|+..=.-.|-..+ .|- |. ==|.-.+ . | +-.+.||.++...+.|+|+...+- ....++++ T Consensus 1 M~~L~~vK~~lgI~d~-~~D~lL~~ii~~a~~~i~~~-l-~--~~~~~iP~~l~~iv~ev~vkryNR-----~g~EG~~S 70 (110) T protein:vir:96 1 MTTLADVKKRIGLKDE-KQDEQLEEIIKSCESQLLSM-L-P--IEVEQIPERFSYMIKEVAVKRYNR-----IGAEGMTS 70 (110) T ss_pred CchHHHHHHHhCCCCC-chhHHHHHHHHHHHHHHHHH-h-c--cchhhhhhHHHHHHHHHHHHHhcc-----cCccccce Confidence 4444431112332111 000 00 0000000 0 0 113679999999999999876543 23457999 Q ss_pred eecCeeEEeecCCCCCCcchHHHHHHHhhhcccccCCccceeeEe Q lcl|NC_019410. 77 IQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 121 (123) Q Consensus 77 ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~ 121 (123) ++++-.+++|.++ -|.-+...|.-+.......+.+.+|+. T Consensus 71 ~S~eG~S~sf~d~-----d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:96 71 EAVDGRSNAYELN-----DFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eecCceeeeeccc-----ccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999652 233344445455443444455656666 No 33 >protein:vir:9311 Length: 110 # NCBI annotation: phi Mu50B-like protein # Family: family:all:372 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803289;genbank:gi:29028599;genbank:GeneID:1258047 Probab=22.29 E-value=2.5 Score=18.43 Aligned_cols=102 Identities=17% Similarity=0.113 Sum_probs=53.3 Q ss_pred hhhhhhhcccCCccCCcccc--cc------cccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceee Q lcl|NC_019410. 5 SKYLDRTIAWAGEKVDEDSG--LR------WPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKE 76 (123) Q Consensus 5 s~yld~~~~~~G~r~~~~Q~--la------WPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ 76 (123) |..|+..=.-.|-..+ .|- |. ==|.-.+ . | +-.+.||.++...+.|+|+...+- ....++++ T Consensus 1 M~~L~~vK~~lgI~d~-~~D~lL~~ii~~a~~~i~~~-l-~--~~~~~iP~~l~~iv~ev~vkryNR-----~g~EG~~S 70 (110) T protein:vir:93 1 MTTLADVKKRIGLKDE-KQDEQLEEIIKSCESQLLSM-L-P--IEVEQIPERFSYMIKEVAVKRYNR-----IGAEGMTS 70 (110) T ss_pred CchHHHHHHHhCCCCC-chhHHHHHHHHHHHHHHHHH-h-c--cchhhhhhHHHHHHHHHHHHHhcc-----cCccccce Confidence 4444431112332111 000 00 0000000 0 0 113679999999999999876543 23457999 Q ss_pred eecCeeEEeecCCCCCCcchHHHHHHHhhhcccccCCccceeeEe Q lcl|NC_019410. 77 IQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 121 (123) Q Consensus 77 ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~ 121 (123) ++++-.+++|.++ -|.-+...|.-+.......+.+.+|+. T Consensus 71 ~S~eG~S~sf~d~-----d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:93 71 EAVDGRSNAYELN-----DFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eecCceeeeeccc-----ccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999652 233344445455443444455656666 No 34 >protein:vir:97145 Length: 110 # NCBI annotation: ORF049 # Family: family:all:372 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239728;genbank:gi:66394913;genbank:GeneID:5130878 Probab=22.29 E-value=2.5 Score=18.43 Aligned_cols=102 Identities=17% Similarity=0.113 Sum_probs=53.3 Q ss_pred hhhhhhhcccCCccCCcccc--cc------cccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceee Q lcl|NC_019410. 5 SKYLDRTIAWAGEKVDEDSG--LR------WPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKE 76 (123) Q Consensus 5 s~yld~~~~~~G~r~~~~Q~--la------WPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ 76 (123) |..|+..=.-.|-..+ .|- |. ==|.-.+ . | +-.+.||.++...+.|+|+...+- ....++++ T Consensus 1 M~~L~~vK~~lgI~d~-~~D~lL~~ii~~a~~~i~~~-l-~--~~~~~iP~~l~~iv~ev~vkryNR-----~g~EG~~S 70 (110) T protein:vir:97 1 MTTLADVKKRIGLKDE-KQDEQLEEIIKSCESQLLSM-L-P--IEVEQIPERFSYMIKEVAVKRYNR-----IGAEGMTS 70 (110) T ss_pred CchHHHHHHHhCCCCC-chhHHHHHHHHHHHHHHHHH-h-c--cchhhhhhHHHHHHHHHHHHHhcc-----cCccccce Confidence 4444431112332111 000 00 0000000 0 0 113679999999999999876543 23457999 Q ss_pred eecCeeEEeecCCCCCCcchHHHHHHHhhhcccccCCccceeeEe Q lcl|NC_019410. 77 IQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 121 (123) Q Consensus 77 ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~ 121 (123) ++++-.+++|.++ -|.-+...|.-+.......+.+.+|+. T Consensus 71 ~S~eG~S~sf~d~-----d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:97 71 EAVDGRSNAYELN-----DFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eecCceeeeeccc-----ccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999652 233344445455443444455656666 No 35 >protein:vir:99796 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004309;genbank:gi:122891763;genbank:GeneID:4712351 Probab=22.29 E-value=2.5 Score=18.43 Aligned_cols=102 Identities=17% Similarity=0.113 Sum_probs=53.3 Q ss_pred hhhhhhhcccCCccCCcccc--cc------cccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceee Q lcl|NC_019410. 5 SKYLDRTIAWAGEKVDEDSG--LR------WPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKE 76 (123) Q Consensus 5 s~yld~~~~~~G~r~~~~Q~--la------WPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ 76 (123) |..|+..=.-.|-..+ .|- |. ==|.-.+ . | +-.+.||.++...+.|+|+...+- ....++++ T Consensus 1 M~~L~~vK~~lgI~d~-~~D~lL~~ii~~a~~~i~~~-l-~--~~~~~iP~~l~~iv~ev~vkryNR-----~g~EG~~S 70 (110) T protein:vir:99 1 MTTLADVKKRIGLKDE-KQDEQLEEIIKSCESQLLSM-L-P--IEVEQIPERFSYMIKEVAVKRYNR-----IGAEGMTS 70 (110) T ss_pred CchHHHHHHHhCCCCC-chhHHHHHHHHHHHHHHHHH-h-c--cchhhhhhHHHHHHHHHHHHHhcc-----cCccccce Confidence 4444431112332111 000 00 0000000 0 0 113679999999999999876543 23457999 Q ss_pred eecCeeEEeecCCCCCCcchHHHHHHHhhhcccccCCccceeeEe Q lcl|NC_019410. 77 IQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 121 (123) Q Consensus 77 ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~ 121 (123) ++++-.+++|.++ -|.-+...|.-+.......+.+.+|+. T Consensus 71 ~S~eG~S~sf~d~-----d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:99 71 EAVDGRSNAYELN-----DFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eecCceeeeeccc-----ccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999652 233344445455443444455656666 No 36 >protein:vir:78849 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285363;genbank:gi:148717891;genbank:GeneID:5246980 Probab=22.29 E-value=2.5 Score=18.43 Aligned_cols=102 Identities=17% Similarity=0.113 Sum_probs=53.3 Q ss_pred hhhhhhhcccCCccCCcccc--cc------cccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceee Q lcl|NC_019410. 5 SKYLDRTIAWAGEKVDEDSG--LR------WPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKE 76 (123) Q Consensus 5 s~yld~~~~~~G~r~~~~Q~--la------WPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ 76 (123) |..|+..=.-.|-..+ .|- |. ==|.-.+ . | +-.+.||.++...+.|+|+...+- ....++++ T Consensus 1 M~~L~~vK~~lgI~d~-~~D~lL~~ii~~a~~~i~~~-l-~--~~~~~iP~~l~~iv~ev~vkryNR-----~g~EG~~S 70 (110) T protein:vir:78 1 MTTLADVKKRIGLKDE-KQDEQLEEIIKSCESQLLSM-L-P--IEVEQIPERFSYMIKEVAVKRYNR-----IGAEGMTS 70 (110) T ss_pred CchHHHHHHHhCCCCC-chhHHHHHHHHHHHHHHHHH-h-c--cchhhhhhHHHHHHHHHHHHHhcc-----cCccccce Confidence 4444431112332111 000 00 0000000 0 0 113679999999999999876543 23457999 Q ss_pred eecCeeEEeecCCCCCCcchHHHHHHHhhhcccccCCccceeeEe Q lcl|NC_019410. 77 IQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 121 (123) Q Consensus 77 ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~ 121 (123) ++++-.+++|.++ -|.-+...|.-+.......+.+.+|+. T Consensus 71 ~S~eG~S~sf~d~-----d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:78 71 EAVDGRSNAYELN-----DFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eecCceeeeeccc-----ccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999652 233344445455443444455656666 No 37 >protein:vir:96221 Length: 110 # NCBI annotation: ORF044 # Family: family:all:372 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239573;genbank:gi:66395333;genbank:GeneID:5132767 Probab=22.29 E-value=2.5 Score=18.43 Aligned_cols=102 Identities=17% Similarity=0.113 Sum_probs=53.3 Q ss_pred hhhhhhhcccCCccCCcccc--cc------cccCCceeeCCeeecCCCCcHHHHHHHHHHHHHHhcCCCCCCCCccceee Q lcl|NC_019410. 5 SKYLDRTIAWAGEKVDEDSG--LR------WPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKE 76 (123) Q Consensus 5 s~yld~~~~~~G~r~~~~Q~--la------WPR~g~~~~~G~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~v~~ 76 (123) |..|+..=.-.|-..+ .|- |. ==|.-.+ . | +-.+.||.++...+.|+|+...+- ....++++ T Consensus 1 M~~L~~vK~~lgI~d~-~~D~lL~~ii~~a~~~i~~~-l-~--~~~~~iP~~l~~iv~ev~vkryNR-----~g~EG~~S 70 (110) T protein:vir:96 1 MTTLADVKKRIGLKDE-KQDEQLEEIIKSCESQLLSM-L-P--IEVEQIPERFSYMIKEVAVKRYNR-----IGAEGMTS 70 (110) T ss_pred CchHHHHHHHhCCCCC-chhHHHHHHHHHHHHHHHHH-h-c--cchhhhhhHHHHHHHHHHHHHhcc-----cCccccce Confidence 4444431112332111 000 00 0000000 0 0 113679999999999999876543 23457999 Q ss_pred eecCeeEEeecCCCCCCcchHHHHHHHhhhcccccCCccceeeEe Q lcl|NC_019410. 77 IQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 121 (123) Q Consensus 77 ekVG~I~v~Y~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~~v~ 121 (123) ++++-.+++|.++ -|.-+...|.-+.......+.+.+|+. T Consensus 71 ~S~eG~S~sf~d~-----d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:96 71 EAVDGRSNAYELN-----DFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eecCceeeeeccc-----ccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999652 233344445455443444455656666 Done!