Query lcl|NC_018275.1_cdsid_YP_006560558.1 [gene=B606_gp08] [protein=putative structural protein] [protein_id=YP_006560558.1] [location=7438..8823] Match_columns 461 No_of_seqs 36 out of 46 Neff 4.9 Searched_HMMs 1612 Date Thu Nov 7 13:27:01 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_8 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_8_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:9268 Length: 472 # 100.0 4E-228 2E-231 1267.3 48.4 461 1-461 12-472 (472) 2 protein:vir:100960 Length: 472 100.0 6E-228 4E-231 1266.4 48.3 461 1-461 12-472 (472) 3 protein:vir:2109 Length: 472 # 100.0 4E-226 3E-229 1256.1 48.1 461 1-461 12-472 (472) 4 protein:vir:105428 Length: 472 100.0 2E-222 1E-225 1236.3 47.4 460 1-461 12-472 (472) 5 protein:vir:177 Length: 472 # 100.0 3E-222 2E-225 1234.8 48.0 460 1-461 12-472 (472) 6 protein:vir:3529 Length: 477 # 100.0 1E-217 7E-221 1210.0 48.4 459 1-461 18-477 (477) 7 protein:vir:105525 Length: 472 100.0 8E-215 5E-218 1194.2 47.5 458 1-461 12-472 (472) 8 protein:vir:108312 Length: 458 100.0 4E-185 2E-188 1031.8 42.8 434 1-461 8-458 (458) 9 protein:vir:8837 Length: 513 # 99.9 4.1E-25 2.5E-28 154.3 35.5 428 1-461 9-507 (513) 10 protein:vir:95475 Length: 771 99.0 5.8E-09 3.6E-12 65.7 25.4 413 1-461 230-766 (771) 11 protein:vir:3133 Length: 911 # 98.9 5.7E-09 3.5E-12 65.8 25.0 416 1-461 215-732 (911) 12 protein:vir:352 Length: 536 # 98.3 1.7E-06 1.1E-09 52.2 27.8 420 1-461 9-531 (536) 13 protein:vir:2625 Length: 715 # 97.9 1E-05 6.4E-09 47.9 27.3 441 1-461 101-710 (715) 14 protein:vir:1778 Length: 680 # 94.4 0.0046 2.9E-06 33.4 20.2 304 1-327 319-680 (680) 15 protein:vir:102644 Length: 594 93.9 0.0061 3.8E-06 32.7 34.9 429 1-461 18-590 (594) 16 protein:vir:2203 Length: 794 # 81.4 0.086 5.3E-05 26.4 35.4 426 1-461 209-739 (794) 17 protein:vir:7329 Length: 825 # 78.7 0.11 6.9E-05 25.8 32.0 405 1-461 193-681 (825) 18 protein:vir:105647 Length: 800 78.4 0.11 7.1E-05 25.7 32.8 435 1-461 205-753 (800) 19 protein:vir:95324 Length: 823 71.1 0.2 0.00012 24.4 31.7 405 1-461 193-665 (823) 20 protein:vir:80253 Length: 777 63.6 0.31 0.00019 23.3 31.7 417 1-461 195-693 (777) 21 protein:vir:103790 Length: 768 59.6 0.39 0.00024 22.8 34.4 435 1-461 199-763 (768) 22 protein:vir:107802 Length: 681 59.3 0.39 0.00024 22.8 31.9 420 1-461 136-678 (681) 23 protein:vir:107423 Length: 681 59.3 0.39 0.00024 22.8 31.9 420 1-461 136-678 (681) 24 protein:vir:98487 Length: 681 59.3 0.39 0.00024 22.8 31.9 420 1-461 136-678 (681) 25 protein:vir:78703 Length: 905 55.8 0.47 0.00029 22.4 31.5 427 1-461 325-846 (905) 26 protein:vir:827 Length: 567 # 52.9 0.54 0.00034 22.0 30.0 384 1-461 145-563 (567) 27 protein:vir:3306 Length: 567 # 49.2 0.65 0.0004 21.6 29.8 384 1-461 145-563 (567) 28 protein:vir:10145 Length: 567 49.2 0.65 0.0004 21.6 29.8 384 1-461 145-563 (567) 29 protein:vir:9979 Length: 567 # 49.2 0.65 0.0004 21.6 29.8 384 1-461 145-563 (567) 30 protein:vir:2792 Length: 567 # 49.2 0.65 0.0004 21.6 29.8 384 1-461 145-563 (567) 31 protein:vir:100022 Length: 976 48.4 0.67 0.00042 21.5 33.1 428 1-461 380-965 (976) 32 protein:vir:8887 Length: 808 # 43.7 0.83 0.00052 21.0 37.6 434 1-461 159-798 (808) 33 protein:vir:7021 Length: 803 # 37.4 1.1 0.00069 20.3 35.4 438 1-461 207-753 (803) 34 protein:vir:100244 Length: 109 25.0 1.1 0.00068 20.4 3.6 42 417-461 1-42 (109) 35 protein:vir:5977 Length: 109 # 21.2 1.5 0.00096 19.5 3.6 39 419-461 1-39 (109) 36 protein:vir:193 Length: 112 # 20.5 1.7 0.001 19.3 3.7 40 417-461 1-40 (112) 37 protein:vir:100134 Length: 109 20.3 1.7 0.001 19.4 3.6 42 417-461 1-42 (109) No 1 >protein:vir:9268 Length: 472 # NCBI annotation: 10 # Family: family:all:1540 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720332;genbank:gi:24371590;genbank:GeneID:955815 Probab=100.00 E-value=4e-228 Score=1267.26 Aligned_cols=461 Identities=97% Similarity=1.479 Sum_probs=458.8 Q ss_pred CCCccccCceeEeeeecccccccccccccceeEeCCCceecccCCCcccceEEEEecCeEEEEeCCeEEeccceEEeecC Q lcl|NC_018275. 1 MGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAVYRVLGSKLYKGETVVGDVAG 80 (461) Q Consensus 1 ~~~~~~~~d~~~~~pvn~~a~~~~~~~s~~~L~~~PGl~~~~~v~G~~rG~~y~~~~~~lY~V~G~~Ly~v~~~iG~i~g 80 (461) |+|+.++|||+|+|||||||+||+++||+++||+|||++++++++|++||++||+++++||||+|++||||..++|+|+| T Consensus 12 ~~~~~~~a~~~~~~pvn~y~~~~~~~~ss~~Lr~~pG~~~~a~~~G~~RG~~~~~~~~~ly~V~G~~Ly~v~~~iG~i~g 91 (472) T protein:vir:92 12 MGKDFKNADYIDYLPINMLATPKEVLDSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAVYRVCGGKLYKGEAVVGDVAG 91 (472) T ss_pred ccccCccCcceeeeecccccccccccccccceeecccceeecCCCCcccceeeeeeCCeEEEEeCcceEEEEeeEeeccC Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCccccceeccceEEEEeeCCceEEEeccCCccc Q lcl|NC_018275. 81 SGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESH 160 (461) Q Consensus 81 sg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dGyfV~~~~gt~~f~iS~L~d~t~ 160 (461) +|||||+||+++|+|++++++++|+||++++++++||+|++||++||++++||||+||||||++||+++||||+|+|+++ T Consensus 92 sgrVsMa~n~~~~av~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~~~iS~l~d~~~ 171 (472) T protein:vir:92 92 SGRVSMAHGRTSQAVGVNGQLIEYRYDGAVKTVSNWPADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESH 171 (472) T ss_pred cccEEEecCCeEEEEEECCceeEEEEecchhhhhcccCccccccccccceeEEEEecceEEEccCCCceEEEeccCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCcccccccceEEeccccchhhhccCceEEEEe Q lcl|NC_018275. 161 PDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFIS 240 (461) Q Consensus 161 ~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg 240 (461) |++|++||+||++||+||+++++|++|||||++|||||+|||++|..+|||+|+||+|||+||||++|||+++||+|||| T Consensus 172 ~~~y~~fa~AE~~pD~Ivgi~~~~~~l~lfG~~TiEvw~ntG~ad~~~fpy~r~~g~~i~~Gcaa~~sv~~~~~s~~~l~ 251 (472) T protein:vir:92 172 PDRYSAEYRAESQPDGIIGIGSWRDFIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFIS 251 (472) T ss_pred cccccccccccCCCCceEEEEeeccEEEEEeccceEEEEecCCCCcCcCceEEcCcceeeecccCcchhhecCceEEEEe Confidence 99999999999999999999999999999999999999999999988999999999999999999999999999999999 Q ss_pred eccccceEEEEccCceeeecCCHHHHHHHHhcCcccccceEEEEEEECCEEEEEEEcCCeEEEEEccccCCcceeeeecC Q lcl|NC_018275. 241 HPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRFDSHELLIIHLPRHVLVYDASSSQNGPQWCVLKT 320 (461) Q Consensus 241 ~d~~g~~~Vy~l~g~q~~rIST~~IE~~i~~y~~~el~~A~~~ty~~~GH~fyvlt~P~~Tw~yD~~t~~w~e~w~~~~t 320 (461) ||++|+++||+++||||||||||+||++|++|++||+++|++|+||||||+||+||||+||||||++|+|||||||++++ T Consensus 252 ~~~~g~~~V~~~~g~qa~rIST~aIE~~i~~y~~~e~~~a~~~s~~~eGH~fy~LtfP~~Tw~yD~at~~~~e~W~~~~s 331 (472) T protein:vir:92 252 HPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRFDSHELLIIHLPRHVLVYDASSSQNGPQWCVLKT 331 (472) T ss_pred cCCCcccEEEEccCceeEEecCHHHHHHHHhcCcchhceeeEEEEEecCeeEEEEEcCCceEEEEcccCcCCceeeeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEEEEeeccccCCCceEEEEEEEEEcCCCCCchhhee Q lcl|NC_018275. 321 GLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEHLLFTPLFKADNARCFDLEVESSTGVAQYADRLFL 400 (461) Q Consensus 321 g~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~~~~tP~~~~~~~rv~~~~le~~~Gv~q~~~~~~l 400 (461) |++++|||++|+|++||||||||++||+||+||+|.++|+|+|++|++++|++|+||+|+|++|||+++||+|++|+||| T Consensus 332 g~~~~~~R~~~~~~~~g~~ivGD~~nG~ly~l~~~~~t~~~~~~~~~~~~P~~~~dn~R~~d~eve~~~Gv~q~~d~v~L 411 (472) T protein:vir:92 332 GLYDDVYRAIDFMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFKADNARCFDLEVESSTGVAQYADRLFL 411 (472) T ss_pred CCcccceeEEEEEeeCCeEEEEEcCCCeEEEEeccccccCCCcceEEEEeceEecCCCEEEEEeeeccCCCCCcCceEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 401 SATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 401 s~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) ||||||++||||||+++|++|||+||++||||||||+||||||||++|+||+|+||+|||| T Consensus 412 ~wSddG~~~~~~~~~~~g~~g~~~tr~~~~RlG~~r~~v~f~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:92 412 SATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred EeeccccccccceeeccCCccchhcceeeeeeeecccceeEEEEEEecCcceeeeeEEeeC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:100960 Length: 472 # NCBI annotation: gp10 # Family: family:all:1540 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006412;genbank:gi:46358704;genbank:GeneID:2777110 Probab=100.00 E-value=5.8e-228 Score=1266.36 Aligned_cols=461 Identities=96% Similarity=1.468 Sum_probs=458.7 Q ss_pred CCCccccCceeEeeeecccccccccccccceeEeCCCceecccCCCcccceEEEEecCeEEEEeCCeEEeccceEEeecC Q lcl|NC_018275. 1 MGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAVYRVLGSKLYKGETVVGDVAG 80 (461) Q Consensus 1 ~~~~~~~~d~~~~~pvn~~a~~~~~~~s~~~L~~~PGl~~~~~v~G~~rG~~y~~~~~~lY~V~G~~Ly~v~~~iG~i~g 80 (461) |+|+.++|||+|+|||||||+||+++||+++||+|||++++++++|++||++||+++++||||+|++||||..++|+|+| T Consensus 12 ~~~~~~~a~~~~~~pvn~y~~~~~~~~ss~~Lr~~pG~~~~a~~~G~~RG~~~~~~~~~ly~V~G~~Ly~v~~~iG~i~g 91 (472) T protein:vir:10 12 MGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAVYRVCGGKLYKGEAVVGDVAG 91 (472) T ss_pred cccCCCcCcceeeeeeccccccccccccccceeecccceeecCCCCcccceeeeeeCCeEEEEeCcceEEEEeeEeeccC Confidence 79999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCccccceeccceEEEEeeCCceEEEeccCCccc Q lcl|NC_018275. 81 SGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESH 160 (461) Q Consensus 81 sg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dGyfV~~~~gt~~f~iS~L~d~t~ 160 (461) +|||||+||+++|+||+++++++|+||++++++++||+|++||++||++++||||+||||||++||+++||||+|+|+++ T Consensus 92 sgrVsMa~n~~~~~v~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~~~iS~l~d~~~ 171 (472) T protein:vir:10 92 SGRVSMAHGRTSQAVGVNGQLIEYRYDGAVKTVSNWPADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESH 171 (472) T ss_pred cccEEEeeCCeEEEEEECCceeEEEEecchhhhhcccCccccccccccceeEEEEecceEEEccCCCceEEEeccCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCcccccccceEEeccccchhhhccCceEEEEe Q lcl|NC_018275. 161 PDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFIS 240 (461) Q Consensus 161 ~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg 240 (461) |++|++||+||++||+||+++++|++|||||++|||||+|||++|..+|||+|+||+|||+||||++|||+++||+|||| T Consensus 172 ~~~y~~fa~AE~~pD~Ivgi~~~~~~l~lfG~~TiEvw~ntG~ad~~~fpy~r~~g~~i~~Gcaa~~sv~~~~~s~~~l~ 251 (472) T protein:vir:10 172 PDRYSAEYRAESQPDGIIGIGSWRDFIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFIS 251 (472) T ss_pred cccccccccccCCCCceEEEEeeccEEEEEeccceEEEEecCCCCcCcCceEEcCcceeeecccCcchhhecCceEEEEe Confidence 99999999999999999999999999999999999999999999988999999999999999999999999999999999 Q ss_pred eccccceEEEEccCceeeecCCHHHHHHHHhcCcccccceEEEEEEECCEEEEEEEcCCeEEEEEccccCCcceeeeecC Q lcl|NC_018275. 241 HPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRFDSHELLIIHLPRHVLVYDASSSQNGPQWCVLKT 320 (461) Q Consensus 241 ~d~~g~~~Vy~l~g~q~~rIST~~IE~~i~~y~~~el~~A~~~ty~~~GH~fyvlt~P~~Tw~yD~~t~~w~e~w~~~~t 320 (461) ||++|+++||+++||||||||||+||++|++|+++|+++|+||||+||||+||+||||+||||||++|+|||||||++++ T Consensus 252 ~~~~g~~~V~~~~g~qa~rIST~aIE~~i~~y~~~e~~~A~~~t~~~~GH~fy~LtfP~~Tw~yD~at~~w~erw~~~~~ 331 (472) T protein:vir:10 252 HPATGAPSVYIIGSGQASPIATASIEKIIRSYTAEELATGVMETLRFDSHELLIIHLPRHVLVYDASSSQNGPQWCVLKT 331 (472) T ss_pred cCCCcccEEEEccCceeEEecCHHHHHHHHhcCCccccceEEEEEEeCCeEEEEEEcCCeeEEEEcccCcccceeeeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEEEEeeccccCCCceEEEEEEEEEcCCCCCchhhee Q lcl|NC_018275. 321 GLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEHLLFTPLFKADNARCFDLEVESSTGVAQYADRLFL 400 (461) Q Consensus 321 g~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~~~~tP~~~~~~~rv~~~~le~~~Gv~q~~~~~~l 400 (461) |++++|||++|+|++|||+||||++||+||+||+|+++|+|+|++|++++|++|+||+|+|++|||+++||+|++|+||| T Consensus 332 g~~~~~~R~~~~~~~~g~~ivGD~~nG~ly~ld~~~~t~~g~~~~~~~~~p~l~~dn~R~~d~eve~~~Gv~~~~d~v~L 411 (472) T protein:vir:10 332 GLYDDVYRAVDFMYEGNQITCGDKSEALTGQLQFDISSQYGLQQEHLLFTPLFKADNARCFDLEVESSTGVAQYADRLFL 411 (472) T ss_pred CCcccceeEEEEEeeCCeEEEEEcCCCeEEEEecccCCCCCCcccceEEcccccCCCCEEEEEeeeccCCCCCcCcEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 401 SATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 401 s~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) ||||||++||||||+++|++|||+||++||||||||+||||||||++|+||+|+||+|||| T Consensus 412 ~wSddG~~~~~~~~~~~g~~g~~~tr~~~~RlG~~r~~v~f~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:10 412 SATTDGINYGREQMIEQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred EeeccccccccceeeccCCccchhcceeeeeeeecccceeEEEEEEecCcceeeeeEEeeC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:2109 Length: 472 # NCBI annotation: head completion protein # Family: family:all:1540 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059633;genbank:gi:9635541;genbank:GeneID:1262840 Probab=100.00 E-value=4.4e-226 Score=1256.05 Aligned_cols=461 Identities=96% Similarity=1.468 Sum_probs=458.4 Q ss_pred CCCccccCceeEeeeecccccccccccccceeEeCCCceecccCCCcccceEEEEecCeEEEEeCCeEEeccceEEeecC Q lcl|NC_018275. 1 MGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAVYRVLGSKLYKGETVVGDVAG 80 (461) Q Consensus 1 ~~~~~~~~d~~~~~pvn~~a~~~~~~~s~~~L~~~PGl~~~~~v~G~~rG~~y~~~~~~lY~V~G~~Ly~v~~~iG~i~g 80 (461) |+||.+++||+|+|||||||+||++++|++|||+|||++++++++|++||++||++++.||+|||++||+|.+++|+|+| T Consensus 12 ~~~~~~~~d~~~~~pVN~~a~~~~~~~s~~~lr~tPG~~~~~~~~g~~RG~~~~t~~~~ly~V~G~~LY~v~~~~G~i~g 91 (472) T protein:vir:21 12 MGKDFKNADYIDYLPVNMLATPKEILNSSGYLRSFPGITKRYDMNGVSRGVEYNTAQNAVYRVCGGKLYKGESEVGDVAG 91 (472) T ss_pred ccccccccceeeeeeeeeeeeccCCcccceeeeecCCcceeccCCCceeeeeecccCCeEEEEeCCceEEEeeeeeeecc Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCccccceeccceEEEEeeCCceEEEeccCCccc Q lcl|NC_018275. 81 SGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESH 160 (461) Q Consensus 81 sg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dGyfV~~~~gt~~f~iS~L~d~t~ 160 (461) +|||||+||+++|+||+++++++|+||++++++++||+|++||++||++++||||+||||||++||+++||||+|+|+++ T Consensus 92 sgrVsMa~n~~~~~v~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~f~is~l~d~~~ 171 (472) T protein:vir:21 92 SGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESH 171 (472) T ss_pred cccEEEeeCCeEEEEEECCceeEEEEecchhhhhcccCccccccccccceeEEEEecceEEEccCCcceeEEecCCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCcccccccceEEeccccchhhhccCceEEEEe Q lcl|NC_018275. 161 PDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFIS 240 (461) Q Consensus 161 ~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg 240 (461) |++|++|||||++||+||+++++|++|||||++|||||+|||++|..+|||+|+||+|||+||||++|||+++||+|||| T Consensus 172 ~~~y~~FatAE~~pD~Iv~i~~~~~~l~lfG~~TiEvw~ntG~ad~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~s~~~l~ 251 (472) T protein:vir:21 172 PDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFIS 251 (472) T ss_pred ccCCccceeeccCCCceEEEEeeccEEEEEeccceEEEEecCCCCcCcCceEEcCcceeeecccCcchhhecCceEEEEe Confidence 99999999999999999999999999999999999999999999988999999999999999999999999999999999 Q ss_pred eccccceEEEEccCceeeecCCHHHHHHHHhcCcccccceEEEEEEECCEEEEEEEcCCeEEEEEccccCCcceeeeecC Q lcl|NC_018275. 241 HPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRFDSHELLIIHLPRHVLVYDASSSQNGPQWCVLKT 320 (461) Q Consensus 241 ~d~~g~~~Vy~l~g~q~~rIST~~IE~~i~~y~~~el~~A~~~ty~~~GH~fyvlt~P~~Tw~yD~~t~~w~e~w~~~~t 320 (461) ||++|+++||+++||||||||||+||++|++|+++|+++|+||||+||||+||+||||+||||||++|+|||||||++++ T Consensus 252 ~~~~g~~~V~~~~g~qa~rIST~aIE~~i~~y~~~e~~~A~~~t~~~eGH~fy~LtfP~~Tw~yD~at~~~~e~W~~~~s 331 (472) T protein:vir:21 252 HPATGAPSVYIIGSGQASPIATASIEKIIRSYTAEEMATGVMETLRFDSHELLIIHLPRHVLVYDASSSQNGPQWCVLKT 331 (472) T ss_pred cCCCcccEEEEccCceeEEecCHHHHHHHHhcCCccccceEEEEEEeCCeEEEEEEcCCeeEEEEcccCccCceeeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEEEEeeccccCCCceEEEEEEEEEcCCCCCchhhee Q lcl|NC_018275. 321 GLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEHLLFTPLFKADNARCFDLEVESSTGVAQYADRLFL 400 (461) Q Consensus 321 g~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~~~~tP~~~~~~~rv~~~~le~~~Gv~q~~~~~~l 400 (461) |++++|||++++|++||||||||++||+||+|+++..+++++|+|+++++|++++||+|+|++|||+++||+|++|+||| T Consensus 332 g~~~~~~R~~~~~~~~g~~ivGD~~nG~ly~L~fd~~~~~d~~~~~~r~~p~~~~dn~R~fd~eve~~~Gv~q~~d~v~L 411 (472) T protein:vir:21 332 GLYDDVYRGVDFMYEGNQITCGDKSEAVVGQLQFDISSQYDKQQEHLLFTPLFKADNARCFDLEVESSTGVAQYADRLFL 411 (472) T ss_pred CCCcCceeEEEEEeeCCeEEEEEcCCCeEEEEEecccccCCCcCcEEEEccceeCCCCEEEEEeeeccCCCCCcCcEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 401 SATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 401 s~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) ||||||++||||||+++|++|||+||++||||||||+||||||||++|+||+|+||+|||| T Consensus 412 ~wSddG~~~~~~~~~~~g~~g~~~tr~~~~RlG~~r~~v~f~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:21 412 SATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred EeeccccccccceeeccCCccchhcceeeeeeeecccceeEEEEEEecCcceeeeeEEeeC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:105428 Length: 472 # NCBI annotation: gene 8 protein # Family: family:all:1540 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958184;genbank:gi:41057286;genbank:GeneID:2716675 Probab=100.00 E-value=1.7e-222 Score=1236.34 Aligned_cols=460 Identities=92% Similarity=1.423 Sum_probs=453.8 Q ss_pred CCCccccCceeEeeeecccccccccccccceeEeCCCceecccCCCcccceEEEEecCeEEEEeCCeEEeccceEEeecC Q lcl|NC_018275. 1 MGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAVYRVLGSKLYKGETVVGDVAG 80 (461) Q Consensus 1 ~~~~~~~~d~~~~~pvn~~a~~~~~~~s~~~L~~~PGl~~~~~v~G~~rG~~y~~~~~~lY~V~G~~Ly~v~~~iG~i~g 80 (461) ++||.+++||+|+|||||||+||+.++|+++||+||||+++++|+|++||++||++|+.||+|||++||+|+++||+|+| T Consensus 12 ~~~~~~~~d~~~~~pVN~~a~~~~~~~s~~~l~~tPGl~~~a~v~G~~RG~~~~~~~g~lY~V~G~~LY~v~~~iGsiag 91 (472) T protein:vir:10 12 VGKDFRNADYIDYLPVNMLATPKEILNSSGYLRSFPGIAKRSDVNGVSRGVEYNMAQNAVYRVCGGKLYKGESEVGDVAG 91 (472) T ss_pred ceeeccccchhheeeeeeeeeccCCCcccceeecCCCceeeccCCccccceEEEeeCCeEEEEecceEeeeecceecccC Confidence 78999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCccccceeccceEEEEeeCCceEEEeccCCccc Q lcl|NC_018275. 81 SGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESH 160 (461) Q Consensus 81 sg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dGyfV~~~~gt~~f~iS~L~d~t~ 160 (461) +|||||+||+++|+|++++++++|+||+++++..+||+|+.||++||++++||||+||||||++||+++||||+|+|+++ T Consensus 92 ~grVsMa~n~~~~av~~~g~~~~Y~yd~~v~t~~~~~~d~~~p~~dlg~~~dv~f~dGyfV~~~~Gt~~~~is~l~d~~~ 171 (472) T protein:vir:10 92 SGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPTDSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESH 171 (472) T ss_pred cccEEEecCCcEEEEEECCceeEEEeeccchhhhccccccccccccccceeeeeeecceEEEeccCcceEEEeccCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCcccccccceEEeccccchhhhccCceEEEEe Q lcl|NC_018275. 161 PDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFIS 240 (461) Q Consensus 161 ~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg 240 (461) |++|.+||+||++||+||+++++|++|||||++|||||+|||++|+.+|||+|+||+|||+||||++|||+++||+|||| T Consensus 172 ~~~y~~fa~AE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~ 251 (472) T protein:vir:10 172 PDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFIS 251 (472) T ss_pred cccccccccccCCCCceEEEEeeccEEEEEeccceEEEEecCCCCcccCceeecccceeeecccCcchhhecCceEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccccceEEEEccCceeeecCCHHHHHHHHhcCcccccceEEEEEEECCEEEEEEEcCCeEEEEEccccCCcceeeeecC Q lcl|NC_018275. 241 HPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRFDSHELLIIHLPRHVLVYDASSSQNGPQWCVLKT 320 (461) Q Consensus 241 ~d~~g~~~Vy~l~g~q~~rIST~~IE~~i~~y~~~el~~A~~~ty~~~GH~fyvlt~P~~Tw~yD~~t~~w~e~w~~~~t 320 (461) ||++|+++||+++||||||||||+||++|++|++||+++|+||+|+||||+||+||||+||||||++|+||||||+++++ T Consensus 252 ~d~~g~~~V~~~~g~~~~rIST~aIE~~i~~y~~~e~~dA~~~t~~~~GH~fy~LtfP~~Tw~yD~~t~~Wherw~~~~~ 331 (472) T protein:vir:10 252 NPATGAPSVYIIGSGQVSPIASASIEKILRSYTADELADGVMESLRFDAHELLIIHLPRHVLVYDASSSANGPQWCVLKT 331 (472) T ss_pred cCCccccEEEEccCceeEEecCHHHHHHHHhcCCccccceeEEEEEeCCeEEEEEEcCCceeEeecccccCceeeeeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEEEEeeccccCCCceEEEEEEEEEcCCCCCchhhe- Q lcl|NC_018275. 321 GLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEHLLFTPLFKADNARCFDLEVESSTGVAQYADRLF- 399 (461) Q Consensus 321 g~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~~~~tP~~~~~~~rv~~~~le~~~Gv~q~~~~~~- 399 (461) |++++|||++|+|+++||++|||++||+||+||+|+++|+|+|++|++++|++|+||+|+|++|||+++|++|.+++++ T Consensus 332 g~~~~~~Ra~~~~~~~g~~~vGD~~ng~ly~l~~~~~td~G~~i~~~~~~p~~~~d~~Rv~d~~ve~~~G~~~~adp~~~ 411 (472) T protein:vir:10 332 GLYDDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYGLQQEHLLFTPLFKADNARCFDLEVESSTGVAQYADRLFL 411 (472) T ss_pred CCccCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCcCCCcceEEEeccceeCCCCeEEEEEEEeecCCCcccCceEE Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999877755 Q ss_pred eeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 400 LSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 400 ls~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) |.||| |.+||+|+|+++|++|||++|++||||||||+||||||||++|+||+|+||||||| T Consensus 412 ~~~sD-g~~~g~~~~~~~~~~g~~~~R~~~~RlG~~r~~vgf~~r~~~~~~v~l~ga~~~~e 472 (472) T protein:vir:10 412 SATTD-GINYGREQMIEQNEPFVYDKRVLWKRVGRIRKNVGFKLRVITKSPVTLSGAQIRIE 472 (472) T ss_pred EeccC-CcccchhhhhhhccCcccccceeeeeeeeccccceEEEEEEeccccceeeeeEEeC Confidence 55555 99999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:177 Length: 472 # NCBI annotation: DNA stabilization protein # Family: family:all:1540 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112082;genbank:gi:13559872;genbank:GeneID:920982 Probab=100.00 E-value=3.4e-222 Score=1234.77 Aligned_cols=460 Identities=92% Similarity=1.432 Sum_probs=454.1 Q ss_pred CCCccccCceeEeeeecccccccccccccceeEeCCCceecccCCCcccceEEEEecCeEEEEeCCeEEeccceEEeecC Q lcl|NC_018275. 1 MGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAVYRVLGSKLYKGETVVGDVAG 80 (461) Q Consensus 1 ~~~~~~~~d~~~~~pvn~~a~~~~~~~s~~~L~~~PGl~~~~~v~G~~rG~~y~~~~~~lY~V~G~~Ly~v~~~iG~i~g 80 (461) ++||.+++||+|+|||||||+||+.++|+++||+||||+++++|+|++||++||++|+.||+|||++||+|+++||+|+| T Consensus 12 ~~~~~~~~d~~~~~pVN~~a~~~~~~~s~~~l~~tPGl~~~a~v~G~~RG~~~~~~~g~lY~V~G~~LY~v~~~iGsiag 91 (472) T protein:vir:17 12 VGKDFRNADYIDYLPVNMLATPKEILNSSGYLRSFPGIAKRSDVNGVSRGVEYNMAQNAVYRVCGGKLYKGESEVGDVAG 91 (472) T ss_pred ceeeccccchhheeeeeeeeeccCCCcccceeecCCCceeeccCCccccceEEEeeCCeEEEEecceEeeeecceecccC Confidence 78999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCccccceeccceEEEEeeCCceEEEeccCCccc Q lcl|NC_018275. 81 SGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESH 160 (461) Q Consensus 81 sg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dGyfV~~~~gt~~f~iS~L~d~t~ 160 (461) +|||||+||+++|+|++++++++|+||+++++..+||+|+.||++||++++||||+||||||++||+++||||+|+|+++ T Consensus 92 ~grVsMa~n~~~~av~~~g~~~~Y~y~~~v~t~~~~~~d~~~~~~dlg~~~dv~f~dGyfV~~~~Gt~~~~is~l~d~~~ 171 (472) T protein:vir:17 92 SGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPTDSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESH 171 (472) T ss_pred cccEEEecCCcEEEEEECCceeEEEeeccchhhhccccccccccccccceeeeeeecceEEEeccCcceEEEeccCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCcccccccceEEeccccchhhhccCceEEEEe Q lcl|NC_018275. 161 PDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFIS 240 (461) Q Consensus 161 ~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg 240 (461) |++|.+||+||++||+||+++++|++|||||++|||||+|||++|.++|||+|+||+|||+||||++|||+++||+|||| T Consensus 172 ~~~y~~fa~AE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~ 251 (472) T protein:vir:17 172 PDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFIS 251 (472) T ss_pred cccccccccccCCCCceEEEEeeccEEEEEeccceEEEEeeCCCCCCcCceeecCcceeeecccCcchhhecCceEEEEe Confidence 99999999999999999999999999999999999999999999988999999999999999999999999999999999 Q ss_pred eccccceEEEEccCceeeecCCHHHHHHHHhcCcccccceEEEEEEECCEEEEEEEcCCeEEEEEccccCCcceeeeecC Q lcl|NC_018275. 241 HPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRFDSHELLIIHLPRHVLVYDASSSQNGPQWCVLKT 320 (461) Q Consensus 241 ~d~~g~~~Vy~l~g~q~~rIST~~IE~~i~~y~~~el~~A~~~ty~~~GH~fyvlt~P~~Tw~yD~~t~~w~e~w~~~~t 320 (461) ||++|+++||+++||||||||||+||++|++|++||+++|+||+|+||||+||+||||+||||||++|+||||||+++++ T Consensus 252 ~d~~g~~~V~~~~g~~~~rIST~aIE~~i~~y~~~e~~dA~~~t~~~~GH~fy~LtfP~~Tw~yD~~t~~Wherw~~~~~ 331 (472) T protein:vir:17 252 NPATGAPSVYIIGSGQVSPISSASIEKILRSYTADELADGVMESLRFDAHELLIIHLPRHVLVYDASSSANGPQWCVLKT 331 (472) T ss_pred cCCccccEEEEccCceeEEecCHHHHHHHHhcCCccccceeEEEEEeCCeEEEEEEcCCceeEeecccccCceeeeeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEEEEeeccccCCCceEEEEEEEEEcCCCCCchhhee Q lcl|NC_018275. 321 GLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEHLLFTPLFKADNARCFDLEVESSTGVAQYADRLFL 400 (461) Q Consensus 321 g~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~~~~tP~~~~~~~rv~~~~le~~~Gv~q~~~~~~l 400 (461) |++++|||++|+|+++||++|||++||+||+||+|++||+|+|++|++++|++|++|+|||++|||+++|++|.++++|| T Consensus 332 g~~~~~~Ra~~~~~~~g~~~vGD~~ng~ly~ld~~~~td~g~pi~~~~~~p~~~~~~~RV~d~el~~~tG~~~~adp~~l 411 (472) T protein:vir:17 332 GLYDDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYDKQQEHLLFTPLFKADNARVFDLEVESSTGVAQYADRLFL 411 (472) T ss_pred CCccCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCCCCceeEEEEecceeeCCCceEEEEEEeeeCCcccCCCceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998877665 Q ss_pred -eeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 401 -SATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 401 -s~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) .||| |.+||+|+|+++|++|||++|++||||||||+||||||||++|+||+|++|||++| T Consensus 412 ~~~sD-g~~~g~~~~~~~~~~g~~~~R~~~~RlG~~r~~v~f~~~~~~~~~~~l~~a~~~~e 472 (472) T protein:vir:17 412 SATTD-GINYGREQMIEQNEPFVYDKRVLWKRVGRIRKNVGFKLRVITKSPVTLSGCQIRIE 472 (472) T ss_pred EcccC-CcccchhhhhhhccCcccccceeeeeeeeccccceEEEEEeecccceeeeeEEEeC Confidence 5555 99999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:3529 Length: 477 # NCBI annotation: P28 # Family: family:all:1540 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050989;genbank:gi:9633575;genbank:GeneID:1262322 Probab=100.00 E-value=1.1e-217 Score=1209.96 Aligned_cols=459 Identities=61% Similarity=1.047 Sum_probs=449.4 Q ss_pred CCCccccCceeEeeeecccccccccccccceeEeCCCceecccCCCcccceEEEEecCeEEEEeCCeEEeccceEEeecC Q lcl|NC_018275. 1 MGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAVYRVLGSKLYKGETVVGDVAG 80 (461) Q Consensus 1 ~~~~~~~~d~~~~~pvn~~a~~~~~~~s~~~L~~~PGl~~~~~v~G~~rG~~y~~~~~~lY~V~G~~Ly~v~~~iG~i~g 80 (461) ++|+.++|||+++|||||||+||++++|+++|++|||++++++.+|++||++||+.++.||+|||++|||+.+++|+|+| T Consensus 18 ~~~~~~~~d~~~~~PVN~~a~p~~~~~s~~~L~~~pG~~~~~~~~G~~RG~~~~~~~g~lY~V~G~~LY~v~~~vG~I~g 97 (477) T protein:vir:35 18 LVKDIKTADYIDALPVNMLATPKEVLNASGYLRSFPGIEKKQDAKGVSRGVHFNTKNNALYRVCGNTLYRNDKEVADIAG 97 (477) T ss_pred cccccccccceeeeeeccceeeccccccccccccCCcceeeccCCccccceeEeecCCeEEEEecCeeEeeeeeeeeecc Confidence 79999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCccccceeccceEEEEeeCCceEEEeccCCccc Q lcl|NC_018275. 81 SGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESH 160 (461) Q Consensus 81 sg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dGyfV~~~~gt~~f~iS~L~d~t~ 160 (461) +|||||+||+++|+||++|++++|+||++++++.+++.+ .||++||+++++|||+||||||++||+++||||+|+|+++ T Consensus 98 sg~VsMa~n~~~~aIv~~g~~~gy~y~~t~~~~~~~~~~-~~p~~~l~~~~~v~f~dGyfV~~~~gt~~~~iS~L~d~s~ 176 (477) T protein:vir:35 98 MSRVSMSHSSHSQAICFEGKVKLYRYDGTEKALSNWPKD-KYPQYDLGEVIDVCRNRGRYIWLQKGGERFGVTDLEDESK 176 (477) T ss_pred cccEEEeeCCcEEEEEECCcceeEEEecccceeeecCcc-ccCCccccceeEEEeeCceEEEeecCCCeEEEeecCCccc Confidence 999999999999999999999999999999999985544 6999999999999999999999999999999999999999 Q ss_pred cccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCcccccc-cceEEeccccchhhhccCceEEEE Q lcl|NC_018275. 161 PDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQP-SLMVQKGIAGTYCKTPFADSYAFI 239 (461) Q Consensus 161 ~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~~~~-~~~I~~Gca~~~sv~~~~~s~~wl 239 (461) +|+|+.||+||++||+||+++++|++|||||++|||||+|||+++ ++|||+|.+ ++|||+||||++|||+++||+||| T Consensus 177 ~d~~~~FasAE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~-f~~p~~r~~~~~mIq~Gcaa~~sv~~~~~t~~~l 255 (477) T protein:vir:35 177 PDRYQPFYRAESQPDGIVSVDAWRDLIVCFGSSSIEYFTLTGSAD-TSQPLYIHQAAYMIQAGIAGRDCKCRYQDKYAIL 255 (477) T ss_pred cccccccccccCCCCceEEEEeeccEEEEEeccceEEEEecCCCC-CCcceeecCCceeeeecccCchhhhhhCceEEEE Confidence 999988999999999999999999999999999999999999997 556888875 556899999999999999999999 Q ss_pred eeccccceEEEEccCceeeecCCHHHHHHHHhcCcccccceEEEEEEECCEEEEEEEcCCeEEEEEccccCCcceeeeec Q lcl|NC_018275. 240 SHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRFDSHELLIIHLPRHVLVYDASSSQNGPQWCVLK 319 (461) Q Consensus 240 g~d~~g~~~Vy~l~g~q~~rIST~~IE~~i~~y~~~el~~A~~~ty~~~GH~fyvlt~P~~Tw~yD~~t~~w~e~w~~~~ 319 (461) |||++|+++||+++||||||||||+||++|++|+++|+++|+||+|+||||+||+||||+||||||++|++||||||+++ T Consensus 256 ~~d~~g~~~V~~~~g~q~~rIST~aIE~~i~ay~~~e~a~af~~t~~~eGH~fy~LtfP~~Tw~yD~at~~w~e~W~~~~ 335 (477) T protein:vir:35 256 SHQSTGQPAVYLIGAGEKNKISTATIDKIIRYYSADELAASFMESIRFDNHELLLLHLPKHTLCFDGSASHQYSQWSLLK 335 (477) T ss_pred ecCCCcccEEEEccCceeEEecCHHHHHHHHhcCCcchhceeEEEEEeCCeeEEEEEcCCceEEEecccccccceeeeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEEEEeeccccCCCceEEEEEEEEEcCCCCCchhhe Q lcl|NC_018275. 320 TGLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEHLLFTPLFKADNARCFDLEVESSTGVAQYADRLF 399 (461) Q Consensus 320 tg~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~~~~tP~~~~~~~rv~~~~le~~~Gv~q~~~~~~ 399 (461) +|++++|||++|+|+++||++|||++||+||+||+++++|+|+|++|++++|++|+||+|+|++|||+++||+|.+++|| T Consensus 336 ~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~l~~ld~~~~~d~g~~i~~~~~~p~~~~d~~Rv~~~el~~~tGvgq~~d~v~ 415 (477) T protein:vir:35 336 SGFYDEPYRAIDFMFFDNQITVGDKKEGVLGHLIFNASNQYEQQTEHLLYTPMIKADNARLFDFELEASTGVAQIADKLF 415 (477) T ss_pred cCCccCceEEEEEEEeCCeEEEEEcCCCeEEEECCCCcccCCCccceEEecceeeCCCCeEEEEEEEEecCcCccCceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 400 LSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 400 ls~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) |||||||++||||||+++|++|||+||++||||||||+||||||||++|+||+|++|+++|| T Consensus 416 L~~sddG~~~~~~~~~~~g~~g~~~~r~~~~RlG~~r~~vgf~~r~~~~~pv~l~~~~~~~e 477 (477) T protein:vir:35 416 LSVTTDGINYSREQLIEQNSPFQYDKRILWRRIGRVRKNIGFKIRIITKSPVTLSDLSIRME 477 (477) T ss_pred EEEeccccccccceeecCCCccccccceeeeeeeeceeccceEEEEEecCCceeccceeEeC Confidence 99999999999999999999999999999999999999999999999999999999999999 No 7 >protein:vir:105525 Length: 472 # NCBI annotation: phage DNA stabilization protein # Family: family:all:1540 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516194;genbank:gi:89885997;genbank:GeneID:3964385 Probab=100.00 E-value=8.4e-215 Score=1194.22 Aligned_cols=458 Identities=66% Similarity=1.123 Sum_probs=453.1 Q ss_pred CCCccccCceeEeeeecccccccccccccceeEeCCCceecccCCCcccceEEEEecCeEEEEeCCeEEeccceEEeecC Q lcl|NC_018275. 1 MGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAVYRVLGSKLYKGETVVGDVAG 80 (461) Q Consensus 1 ~~~~~~~~d~~~~~pvn~~a~~~~~~~s~~~L~~~PGl~~~~~v~G~~rG~~y~~~~~~lY~V~G~~Ly~v~~~iG~i~g 80 (461) |+|+++++||++.|||||||+||+++|++++||++||++++++++|++||++||++++.||+|||++||++++++|+|+| T Consensus 12 ~~~~~~~~~~~~~lpvN~y~~p~~~~~ss~~lr~~PG~~~~~~~~g~~RG~~~~~~~~~lY~V~G~~Ly~v~~~vG~iag 91 (472) T protein:vir:10 12 LGKARDDADYIDALPVNMLATPKPVLNASGYLRSFPGITHKAEVAGVSRGVQYNTHEKTVYRGLGNQLYKGHKPIADLAG 91 (472) T ss_pred cccCccccCceeeeeeeeeeccccccccceeecccCCceeecCCCcccceeEeeeeCCeEEEEecceEEEEEeeeeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCccccceeccceEEEEeeCCceEEEeccCCccc Q lcl|NC_018275. 81 SGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESH 160 (461) Q Consensus 81 sg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dGyfV~~~~gt~~f~iS~L~d~t~ 160 (461) +|||||+||+.+|+|+.++++++|+|++++++++||+.+..++++||+.+++|||+||||||++||+++||||+|+|+++ T Consensus 92 sg~VsMa~~~~~q~v~v~g~~~~y~y~g~~~t~~~~~~~~~it~~dl~~~~~v~~~dGyfV~~~~gt~~~~iS~L~d~s~ 171 (472) T protein:vir:10 92 KGRISMAFSRNSQAVVAAGKMTLYRYDGTVKTLENWPKEKKYTQYDIGNVRDMCHLRGRYVWCKDGSDIFGVTDLEDESH 171 (472) T ss_pred cccEEEEecCCceEEEEecceeEEEeccchhhhhhccccccCCccccCCceeEEEeCceEEEeecCCceEEEeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCcccc---cccceEEeccccchhhhccCceEE Q lcl|NC_018275. 161 PDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVA---QPSLMVQKGIAGTYCKTPFADSYA 237 (461) Q Consensus 161 ~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~~---~~~~~I~~Gca~~~sv~~~~~s~~ 237 (461) |++|+.||+||++||+||+++++|++|||||++|||||+|||+++ |||+| +||+|||+||||++|||+++||+| T Consensus 172 ~~~~~~FatAE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~---fpf~r~~~~pg~~iq~Gcaa~~sv~~~~~s~~ 248 (472) T protein:vir:10 172 PDRYRALYRAESQPDGIIGIDSWRDFIVCFGASTIEYFSLTGAAD---GQSAIYAAQPALMVEKGIAGTHCKTRLGDAHV 248 (472) T ss_pred CCcccceeeecCCCCceEEEEeeccEEEEEeccceEEEEecCCCC---cceeeeccCccceeeecccCchhhhhhCceEE Confidence 999988999999999999999999999999999999999999998 99998 889999999999999999999999 Q ss_pred EEeeccccceEEEEccCceeeecCCHHHHHHHHhcCcccccceEEEEEEECCEEEEEEEcCCeEEEEEccccCCcceeee Q lcl|NC_018275. 238 FISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRFDSHELLIIHLPRHVLVYDASSSQNGPQWCV 317 (461) Q Consensus 238 wlg~d~~g~~~Vy~l~g~q~~rIST~~IE~~i~~y~~~el~~A~~~ty~~~GH~fyvlt~P~~Tw~yD~~t~~w~e~w~~ 317 (461) |||||++|+++||+++||||||||||+||++|++|+++|+++|+||+|+||||+||+||||+||||||++|+|||||||. T Consensus 249 ~l~~d~~g~~~V~~~~g~q~~rIST~aIE~~i~~y~~~e~~dA~~~s~~~eGH~fy~LtfP~~Tw~yD~at~~~~~~w~~ 328 (472) T protein:vir:10 249 IISHQATGAPSVFLINQAQATSIATATIEKILRSYTHDELASAVMETVRFDSHELVLIHLSRQVLCYDAAANQNGLQWSL 328 (472) T ss_pred EEecCCCcceEEEEccCceEEEecCHHHHHHHHhCCcccccceeEEEEEeCCeEEEEEEcCCeeEEEeccCCccceeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEEEEeeccccCCCceEEEEEEEEEcCCCCCchh Q lcl|NC_018275. 318 LKTGLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEHLLFTPLFKADNARCFDLEVESSTGVAQYADR 397 (461) Q Consensus 318 ~~tg~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~~~~tP~~~~~~~rv~~~~le~~~Gv~q~~~~ 397 (461) +++|++++|||++|+|+++||++|||++||+||+||++.++|+|+|++|++++|+++++|+|||++|||+++||++.+++ T Consensus 329 ~~~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~l~~ld~~~~td~g~pi~~~~~tp~~~~~n~Rvfd~el~~~tGvg~~~~~ 408 (472) T protein:vir:10 329 LKTGFYHAPYRGIDFMFADHHLTCGDKNDSLLGQLDFASSAQYEKPQEHVLYTPLFKADNARVFDFELEASTGVAHIADR 408 (472) T ss_pred eecCCccCceEEEEEEEeCCeEEEEEcCCCeEEEEcCcCcCCCCceeEEEeeccceecCCCeEEEEEEEeeCCcCccCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred heeeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 398 LFLSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 398 ~~ls~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) |||||||||.++++++++.++.++.|++|++||||||||+|||||||+++++||.+++++|+|| T Consensus 409 v~L~wSddg~~~~~~~~~~~~g~~~~~~r~~w~RlG~ar~~vgf~~rv~~s~pv~~~~~~a~~e 472 (472) T protein:vir:10 409 LFLSATADGLHFGREQMINQNAPFAYDRRILWRRMGRVRKNLGFKVRVITSSPVTLSGCQIRME 472 (472) T ss_pred EEEEEeccccccchhHHHhhcCccchhheeeeheeeccccccceEEEEEEecccccccceeeeC Confidence 9999999999988888999999999999999999999999999999999999999999999999 No 8 >protein:vir:108312 Length: 458 # NCBI annotation: hypothetical protein # Family: family:all:1540 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552278;genbank:gi:160700603;genbank:GeneID:5758828 Probab=100.00 E-value=3.5e-185 Score=1031.81 Aligned_cols=434 Identities=19% Similarity=0.252 Sum_probs=392.0 Q ss_pred CCCccccCcee-EeeeecccccccccccccceeEeCCCceecccCCCcccceEEEEecCeEEEEeCCeEEec-----cce Q lcl|NC_018275. 1 MGKDFKNADYI-DYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAVYRVLGSKLYKG-----ETV 74 (461) Q Consensus 1 ~~~~~~~~d~~-~~~pvn~~a~~~~~~~s~~~L~~~PGl~~~~~v~G~~rG~~y~~~~~~lY~V~G~~Ly~v-----~~~ 74 (461) +|+.-.+.+.+ +.+||||||+|+|++||+++||++||+++|++++|++++.+|. +++.||+|+|++||+| .++ T Consensus 8 ~gsy~a~~~~~daq~~VN~yp~~~e~g~ss~~l~~tPGl~~f~~~~~~~~~g~~~-~~g~ly~v~g~~LY~V~~~~~~~~ 86 (458) T protein:vir:10 8 LVATTAEGDVSGQEILVNVYPRKSDGGKYPFTLRHTPGLAFFCELPTFPVMAMHQ-NGSRAFAVTPRDMYEISKDGTYKR 86 (458) T ss_pred eeeeecccccccceeeeeeeeecccccccccceEecCCceeeecCCCCceeeEEe-cCCEEEEeeCceEEEEeCCceEEE Confidence 88877777777 5599999999999999999999999999999998877533332 2444455555555542 249 Q ss_pred EEeecCcccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCccccceeccceEEEEeeCCceEEEec Q lcl|NC_018275. 75 VGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRGRYAWSKDGTDSWFITD 154 (461) Q Consensus 75 iG~i~gsg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dGyfV~~~~gt~~f~iS~ 154 (461) +|+|+|+|||||+||+++++|+++. ++|+||++++.++ +++|+.|++ +.+|||+||||||++||+++||||+ T Consensus 87 iG~i~gsg~VsMa~ng~q~vi~~G~--~gY~yd~at~~~~-~i~d~~~~~-----~~~v~~~dGy~V~~~~g~~~~~is~ 158 (458) T protein:vir:10 87 LGSVDFKGRVVMEDNGKQIVMVDGE--KGYYYDSETEIVQ-EIKAEGFYP-----ASTVTYQDGYFIFDRKGTGQFFISE 158 (458) T ss_pred EecccCceeEEEeeCCcEEEEEECC--eEEEEeecccEEE-eccCccccC-----cceEEEeCcEEEEEeeCCCEEEEEe Confidence 9999999999999999977777664 5999999888777 467776555 5699999999999999999999999 Q ss_pred cCCccccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCcccccccceEEeccccchhhhccCc Q lcl|NC_018275. 155 LEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFAD 234 (461) Q Consensus 155 L~d~t~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~~~~~~~I~~Gca~~~sv~~~~~ 234 (461) |+|++ .|.++ ||+||++||+||+++++|++|||||++|||||+|||++| |||+|+||+|||+||||++|||+++| T Consensus 159 L~d~s-~d~l~-fa~Ae~~pD~iv~i~~~~~~i~~fG~~TiEvw~ntG~a~---fpy~r~~ga~i~~Gcaa~~sv~~~~~ 233 (458) T protein:vir:10 159 LLDVA-FDPLD-FATAEGQPDPLLAVLSDHREVFMFGQETIEVWYNSGAAD---FPFERNQGAFIEKGIGAPYSVAKTNN 233 (458) T ss_pred cCcce-eCcce-eeeecCCCCceEEEEeeccEEEEEeccceEEEEecCCCC---cceeecccceeeecccCcchhhhhCc Confidence 99975 55654 999999999999999999999999999999999999998 99999999999999999999999999 Q ss_pred eEEEEeeccccceEEEEccCceeeecCCHHHHHHHHhcCcccccceEEEEEEECCEEEEEEEcC--CeEEEEEccccCCc Q lcl|NC_018275. 235 SYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRFDSHELLIIHLP--RHVLVYDASSSQNG 312 (461) Q Consensus 235 s~~wlg~d~~g~~~Vy~l~g~q~~rIST~~IE~~i~~y~~~el~~A~~~ty~~~GH~fyvlt~P--~~Tw~yD~~t~~w~ 312 (461) |+||||||. +||+++||||+|||||+||++|++|+ +++|+||||++|||+||+|||| ++|||||++|+||| T Consensus 234 t~~~l~~d~----~Vy~l~g~~~~rIST~aIE~~i~sy~---~~da~a~t~~~eGH~fy~LtfP~a~~Tw~yD~~t~~Wh 306 (458) T protein:vir:10 234 TVYFIGSDL----MIYQITGYTPVRISTHAVEQTLKGVN---LSDAFAYTYQSEGHLFYVLTIPGKNLTWCYDISSGSWH 306 (458) T ss_pred eEEEEcCCe----EEEEecCceeEEeeCHHHHHHHhcCC---hhheEEEEEEecCeEEEEEECCCCCceeEEecccccce Confidence 999999986 69999999999999999999999994 7779999999999999999999 58999999999655 Q ss_pred ceeeeecCCccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEEEEeeccccCCCceEE--EEEEEEEcC Q lcl|NC_018275. 313 PQWCVLKTGLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEHLLFTPLFKADNARCF--DLEVESSTG 390 (461) Q Consensus 313 e~w~~~~tg~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~~~~tP~~~~~~~rv~--~~~le~~~G 390 (461) | |+||.+ +|||++|+|+++||++|||++||+||+||+++++|+|+|++|++++|++|++++|++ ++|||+++| T Consensus 307 e----r~Sg~~-~~~Ra~~~v~~~g~~~vGD~~ng~ly~ld~~~~td~g~~i~~~~~~p~~~~~~~rl~~~~~el~~~tG 381 (458) T protein:vir:10 307 V----RQSYQF-DRHVSNNSIYFDQKTLVGDFQNGRIYIMADNYYTDDGDPVVREFILPVVNNGREFLTVDSLELDLSSG 381 (458) T ss_pred e----eccCCC-CceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCCCCceeeeeeeccceeCCCCeEEEEEEEEEEecc Confidence 5 667544 699999999999999999999999999999999999999999999999999999875 899999999 Q ss_pred CCCC-----chhheeeeccC-ccccCcceee-ccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 391 VAQY-----ADRLFLSATTD-GINYGREQMI-EQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 391 v~q~-----~~~~~ls~sdD-G~~~~~~~~~-~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) |+|. +|++||.|||| |.|||+++++ ++|++|||+||++||||||+|+|| |||||++|+|++|+|||++|. T Consensus 382 vg~~~~~~~~p~~~l~~S~d~g~~~s~~~~~~~lg~~gey~tr~~~~rlG~ar~rv-f~v~~s~p~~~~l~ga~~~~r 458 (458) T protein:vir:10 382 VGLTVGQGSDPELRVYFSKDNGNEYSQNFKVGKIGRKGEFLTRAKVNRFGCARQFT-FKVEISDPIPVDIGGAWVEVR 458 (458) T ss_pred eeeeeCCCCCceEEEEEeeCCCcccchhHHHhhcCCcchhhhhhhhhhhccCcceE-EEEEEecchhhcceeeeEEeC Confidence 9953 68899999998 9999999999 689999999999999999999999 999999999999999999999 No 9 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=99.93 E-value=4.1e-25 Score=154.30 Aligned_cols=428 Identities=12% Similarity=0.114 Sum_probs=255.8 Q ss_pred CCCccccCceeEe-eeecccccccccccccceeEeCCCceec-ccCCCcccceE-EEEecCeEEEEeCCeEEeccc--eE Q lcl|NC_018275. 1 MGKDFKNADYIDY-LPINMLATPKEVLNSSGYLRSFPGIAKR-NDVNGVSRGVE-YNTAQNAVYRVLGSKLYKGET--VV 75 (461) Q Consensus 1 ~~~~~~~~d~~~~-~pvn~~a~~~~~~~s~~~L~~~PGl~~~-~~v~G~~rG~~-y~~~~~~lY~V~G~~Ly~v~~--~i 75 (461) |+|.=-=.|+.-+ ||.|-+-+-..+....+.....||.++. +.++.+++|+. |-.-++..+.++++++|.... -. T Consensus 9 ~~~~g~~~d~~p~~lp~~a~s~~~N~~~~~~~~~~~~g~~pv~a~~~~~~~g~~~~~~~g~~~~~~~~~~~~~~~~~~t~ 88 (513) T protein:vir:88 9 KNPTGIVTDIAPADLPLDKWSFGNNVRFKNGKAQKALGHSPIFDTAQAPILDMFPFIRNNIPYWLLCSEKRLYLADGTTI 88 (513) T ss_pred cccccceeccChhhcCCCcceeeeeeeEecceeeecCccceeeecCCCCceeeeeeecCCCeEEEEeeceEEEEecCcee Confidence 5544333444433 5665444455667778888999999877 67777888874 323333456678887775332 23 Q ss_pred EeecCc-------ccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCccccceeccceEEEEee--- Q lcl|NC_018275. 76 GDVAGS-------GRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRGRYAWSKD--- 145 (461) Q Consensus 76 G~i~gs-------g~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dGyfV~~~~--- 145 (461) .+|+++ .|.+++-=+..+ +..++.-.-..+|++-.+++- .+++|+ +.....+....++ +|... T Consensus 89 ~dvs~~~~~~~~~~~w~~~~f~~~i-~a~ng~~~~q~~~~~s~~f~d---l~g~p~--~~~a~~i~v~~~f-lv~~~~t~ 161 (513) T protein:vir:88 89 IDVSPGPYSASVTNRWSVGSFNGVI-FANDGVNPPHHLPPTESVFRV---LPNFPA--NTTFRRLKSFKNF-LIGLNVTS 161 (513) T ss_pred eeccccceeecccCceeeeeecCEE-EEEcCCCcceEEcCCCceeee---ccCCCc--ccceEEEEEEeeE-EEEeeccc Confidence 355542 244555433333 334443333446765555542 333332 2344455555665 44322 Q ss_pred C----CceEEEeccCCc----cccccCc--c---eeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCccc Q lcl|NC_018275. 146 G----TDSWFITDLEDE----SHPDRYS--A---QYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYV 212 (461) Q Consensus 146 g----t~~f~iS~L~d~----t~~d~~~--~---f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~ 212 (461) + -++...|+++|+ ++|+.-. . |--.-...+.||..++.++.+++|-+++|-.+..+|.+. .|.|. T Consensus 162 ~~~~~PnrV~wS~~~D~~~~P~~W~~t~~t~~a~~~~l~d~~g~~v~g~~~g~~liif~e~~i~~m~y~g~~~--if~~~ 239 (513) T protein:vir:88 162 NSIEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIGGLY--IFQFQ 239 (513) T ss_pred CcCCCCceEEEecccCCcccccccccccccCcccccccCCCccceeeeeecccceEEEecccEEEEEecCCCc--eEEEE Confidence 2 368999999996 4443210 0 001111346799999999999999999997777677653 34444 Q ss_pred ccccceEEeccccchhhhccCceEEEEeeccccceEEEEccCceeeecCCHHHHHHHHh-cCcccccceEEEEEEECCEE Q lcl|NC_018275. 213 AQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRS-YTADELATGVMEALRFDSHE 291 (461) Q Consensus 213 ~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~l~g~q~~rIST~~IE~~i~~-y~~~el~~A~~~ty~~~GH~ 291 (461) . +.-..||.+|.|++.+++.+|||++++ ||+++|.++++|+-..|++.+-. .+...++.. +....+=++ T Consensus 240 ~---i~~~~G~~~p~SI~~~~~~~ffls~~G-----f~~~~G~~~~~Ig~ekVdk~f~~~~n~~~~~~~--~~~~d~~~~ 309 (513) T protein:vir:88 240 Q---LFNDVGILGPNCAIEFDGNHFVVGHGD-----VYVHNGVQKQSVIDAQVRKFFFSDINPDNYQRT--FVLADHVNT 309 (513) T ss_pred e---ecccccccCCceeEEECCeEEEEeCCc-----eEEecCceeeecccchhhhhhhccCCcccceEE--EEEEcCccc Confidence 4 455899999999999999999999997 99999999999999889996533 443333332 223334444 Q ss_pred EEEEEcC----------CeEEEEEccccCCcceeeeecCC------c-----ccc-ce--------------EEEEEEec Q lcl|NC_018275. 292 LLIIHLP----------RHVLVYDASSSQNGPQWCVLKTG------L-----YDD-VY--------------RAIDFMYE 335 (461) Q Consensus 292 fyvlt~P----------~~Tw~yD~~t~~w~e~w~~~~tg------~-----~~~-~~--------------Ra~~~~~~ 335 (461) -+++.+| ++.++||-.+. +|+.++-. + ... -| ........ T Consensus 310 ~v~~~y~s~~~~~~~~~~~~lVYd~~~~----~Ws~~~~p~~~~g~~g~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 385 (513) T protein:vir:88 310 EMWVCYSSTRSEPGKHCDRAIIWNWKEN----TWSIRDLPNVLSGAYGIIDPKTSNLWDDDSNPWDTDTSVWGEGSYNPA 385 (513) T ss_pred EEEEEecCCCCCCCcccceEEEEEccCC----eEEEEeccchhhcccccccccccceecccccccccchhhhhccccccc Confidence 4555555 24689998887 56554310 0 000 01 11111111 Q ss_pred CCeEEEEEcCCCeEEEEcCCccCcCCCEEEEEEeeccccC-CCceEEEEE--EEEEcCCCCCchhheeeecc--C-cccc Q lcl|NC_018275. 336 GNQIACGDKSEAVTGQLQFDISSQYDKQQEHLLFTPLFKA-DNARCFDLE--VESSTGVAQYADRLFLSATT--D-GINY 409 (461) Q Consensus 336 ~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~~~~tP~~~~-~~~rv~~~~--le~~~Gv~q~~~~~~ls~sd--D-G~~~ 409 (461) ....+.++++.|.++.+|.+. +..|.|++..+++|-+.. ++.++..+. +...++-+. -.+-|...+ + -.+| T Consensus 386 ~~sl~~~~~~~~~~~~fd~~~-~f~G~~lea~~~t~~~~~~~~~~~~~i~~v~~~~t~~g~--~t~~vg~~~~~~~~~~~ 462 (513) T protein:vir:88 386 KSSMIFTSFQDAKLFLFGETS-TFSGQSFTSTLERSDIYLGDDRMMKTVSAVIPHITGNGV--CNIWVGNAQVQGSGIRW 462 (513) T ss_pred cceeEeeeccCCceeeecccc-cccCCceEEEEEecCccccCchhheeeeeeeeeeecceE--EEEEEeeeccCcccccc Confidence 133456778888899888664 679999999999988774 344432211 001111111 111233333 3 6778 Q ss_pred CcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 410 GREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 410 ~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) +.+..-....-+. +..|+-| |- ..||||+....|-.+.|..+++- T Consensus 463 s~~~~~~~~~~~~----~~~r~~g--Ry-~~~ri~i~~~~~w~~~G~~ve~~ 507 (513) T protein:vir:88 463 KGPYPYRIGQDYK----IDTKHVG--RY-IALKFDFASAGDWYFNGYTLEMA 507 (513) T ss_pred ccceeeecccCce----EEeccCC--ce-EEEEEEccCCCceEEeeEEEEEe Confidence 8775555544443 3333333 22 34888888888899999888887 No 10 >protein:vir:95475 Length: 771 # NCBI annotation: hypothetical protein ORF038 # Family: family:all:5234 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294631;genbank:gi:149408197;genbank:GeneID:5237042 Probab=98.95 E-value=5.8e-09 Score=65.73 Aligned_cols=413 Identities=15% Similarity=0.102 Sum_probs=198.5 Q ss_pred CCCccccCceeE-eeeecccccccccccccceeEeCCCc--eec-ccCCCcccceE----EEEecCeE-EEEeCCeEEec Q lcl|NC_018275. 1 MGKDFKNADYID-YLPINMLATPKEVLNSSGYLRSFPGI--AKR-NDVNGVSRGVE----YNTAQNAV-YRVLGSKLYKG 71 (461) Q Consensus 1 ~~~~~~~~d~~~-~~pvn~~a~~~~~~~s~~~L~~~PGl--~~~-~~v~G~~rG~~----y~~~~~~l-Y~V~G~~Ly~v 71 (461) -||=+++.|-+| +|- |++-+-+..|-.|..- ++- -+-.-+++|.. |+.--+.| =.|-++.=|.. T Consensus 230 ~g~~pS~sd~~N~a~~-------k~~~~Ei~t~~~f~~~~~~~~~~Gt~~~~~G~yi~da~~~g~~~Lt~~ve~~gr~~s 302 (771) T protein:vir:95 230 SGKFPSNSDSVNLALS-------KRADVEPSTTDRFRAEDIVLNPIGTYETARGFFIIDAMARGKSRLEEIVKLKQRYPS 302 (771) T ss_pred CCCCcCCceeeccccc-------hhhccceeeecccchhhhhhcccCcccccCcceeeehhhhcccccceeeeccccchh Confidence 566566655544 221 1111111111111100 000 00000222210 00000000 01111111111 Q ss_pred ----------------cceEEeecCcccEEEEeCCeEEEEEECCc---EEEEEeecccccceeccccccccCccCCcccc Q lcl|NC_018275. 72 ----------------ETVVGDVAGSGRVSMAHGRTSQAVGVNGQ---LVEYRYDGTVKTVSNWPADSDYTQYELGSVRD 132 (461) Q Consensus 72 ----------------~~~iG~i~gsg~VsMa~N~~~~avv~~g~---~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~ 132 (461) +..|.. +.|||. =||+..+.|=.+.. ++.|.|= ..+.+.++ |++ T Consensus 303 ~~~~~~~l~~~~t~~~~~~vae--yagRvw-Yag~~~~~iD~dkng~~~~~~ilf---SqLv~s~~-------di~---- 365 (771) T protein:vir:95 303 LSFGVSSLPQDETPGGASVVCE--YAGRVW-YAGFSGQIIDGDDQSPRLVSYILF---SQLVDSPA-------DIV---- 365 (771) T ss_pred hhccccccccccCCCCceeEEe--eeeeEE-EecceeEEeeccccCCceeeeEee---ehhhcchh-------hcc---- Confidence 112222 444544 22323332221111 1122111 01111111 111 Q ss_pred ceeccceEEEEeeCCceEEEeccCCccccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCC----cc Q lcl|NC_018275. 133 ITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTA----GA 208 (461) Q Consensus 133 v~~~dGyfV~~~~gt~~f~iS~L~d~t~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~----~~ 208 (461) -||+|+-= + .=-+++|.| -|.. |.+-|+. -.|+.|..++..|++|++..+ |...|.++. .. T Consensus 366 nCyQd~DP---T----see~~dLid---TDGg--~iri~ga-h~ii~Lv~f~~sLlvfc~NGV--WAi~ggsd~g~tAtd 430 (771) T protein:vir:95 366 NCYQDGDP---T----STEEPELVD---TDGG--FIRIEGA-HDIINLVNVGSAVMVVAANGI--WMIQGGSDYGFTATN 430 (771) T ss_pred cccccCCC---c----hhhhhhhhh---cCCC--EEEecCC-CCceeEEEecceEEEEEecce--EEEEeccCCceeeee Confidence 26666531 1 002444442 2332 5555432 368999999999999999998 999666652 22 Q ss_pred CcccccccceEEeccccchhhhccCceEEEEeeccccceEEEEccCceeeecCCHHHHHHHHhcCcccccceEEEEE-EE Q lcl|NC_018275. 209 ALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEAL-RF 287 (461) Q Consensus 209 fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~l~g~q~~rIST~~IE~~i~~y~~~el~~A~~~ty-~~ 287 (461) |...+. =++||-+|.|+.-+|++++|.|.++-.+..+..+|.+.+|-|+...||+.+++.+. +...++.-.| .. T Consensus 431 Y~ltKI----s~vg~sspnSvVvvg~~i~ywsdtgIyal~~Ndfn~~tAqnLTekTIq~~~~~I~~-dk~knVtg~fd~~ 505 (771) T protein:vir:95 431 YLVTKI----SEHGCSSPNSVVVVDNSFMYWGDDGIYHLTRNQYGDYVANNLTEKTIQKYYEKIPS-DAILNATGFYDSY 505 (771) T ss_pred eEEEEe----eeeccCCCccEEEecceEEEeeCCceEEEeecccCcchhhccchHHHHHHHhhcch-hhhcceEEEEEcc Confidence 333333 25899999999999999999999999999999999999999999999999999985 4455555444 44 Q ss_pred CCEEEEEEEcCCe---------EEEEEccccCCcceeeeec-CCccccceEEEEEEecC------CeE------------ Q lcl|NC_018275. 288 DSHELLIIHLPRH---------VLVYDASSSQNGPQWCVLK-TGLYDDVYRAIDFMYEG------NQI------------ 339 (461) Q Consensus 288 ~GH~fyvlt~P~~---------Tw~yD~~t~~w~e~w~~~~-tg~~~~~~Ra~~~~~~~------g~~------------ 339 (461) |++ |.+.+|++ -+++|++++-..+ |.+.. ++.....--+.-++.+. .+. T Consensus 506 e~r--vyw~yPn~~D~~~e~~t~LV~dLalgaFYp-~~i~~~~ag~l~~~vg~~~~p~~~lv~T~~eV~v~~~~v~~tG~ 582 (771) T protein:vir:95 506 DKK--VKWLYNTVLDGRTEPVTELVFDLALGAFYP-SKIGSLTAGRLPIPVGSVKIPPYKLVETGEEVTVASEQVTATGE 582 (771) T ss_pred CCE--EEEEecceecCCCcceeeeeeeeccccccc-ccccccccCccceeeeeeecCccccccccceEEecceeeEecCC Confidence 777 66778832 3999999987765 43222 11111111111111100 000 Q ss_pred -----------------EEEEcCCCeEEEEcCCccCcCC-------------------------CEEEEEEeeccccCCC Q lcl|NC_018275. 340 -----------------ACGDKSEAVTGQLQFDISSQYD-------------------------KQQEHLLFTPLFKADN 377 (461) Q Consensus 340 -----------------~vGD~~~G~l~~ld~~~~td~g-------------------------~p~~~~~~tP~~~~~~ 377 (461) +.--...|.-+++.+..+++.+ .-+-+....|++++.= T Consensus 583 ~vtV~~~~r~~~~~~~~y~~~~~dg~~g~~~Fa~~~~~~f~DW~sv~~~~vdy~sy~~~gY~~~gd~~~~k~~PYit~y~ 662 (771) T protein:vir:95 583 LVTVKVSTRSPVIRETKYIIVEKLSSPMRISFGGYTDEEFVDWKSVDGIGVDAPAYLLTGYLAGGDYQREKFVPYITFHF 662 (771) T ss_pred ceEEEEEEeeccccceEEEEEEecCCCeeEEeccccCcceeecccCCCcccchHHHHHhhhhccchheeeeccceEEEEE Confidence 1111223333444444443322 2222233335444321 Q ss_pred ceEEEEEEEEEcCCCC-Cchh--he---eeeccCccc--cCcceeec--------c-CCCccccee----EEEEeeEecc Q lcl|NC_018275. 378 ARCFDLEVESSTGVAQ-YADR--LF---LSATTDGIN--YGREQMIE--------Q-NEPFVYDKR----VLWKRVGRIR 436 (461) Q Consensus 378 ~rv~~~~le~~~Gv~q-~~~~--~~---ls~sdDG~~--~~~~~~~~--------~-g~~g~y~~R----~~~~rlG~~r 436 (461) ..--|=-||-..|-=. +++- || .+||-++++ |++.+.+= . +.---|+.- -+.|--|+.| T Consensus 663 ~~tedg~v~~~~g~~~p~n~sSclm~~sw~ws~s~~t~k~~~~~eaYk~~~~~~p~~~~~~~yp~~~VV~TKsriRG~Gr 742 (771) T protein:vir:95 663 KKTEDGFVEDAEGDWTPTNQSSCMVQSQWSWTNSPASNKWGRTWQAYRFRRHFFPDNIDNQFDDGNSVVETKSRLRGSGK 742 (771) T ss_pred EeecccceecccccccccCCcceEEEEEeeeecCCCCCccccchheeeecceeccCCcchhcCCccceeeeeheeeecce Confidence 1111223555555111 1221 44 567777665 77765541 1 111112211 2344446544 Q ss_pred cceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 437 RLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 437 ~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) .- .|+|.--.+-..-|.|-+|=++ T Consensus 743 ~~-~~rf~s~~gKdlhl~Gysil~~ 766 (771) T protein:vir:95 743 VL-SLYITTEPKKNLHIYGWSMLVD 766 (771) T ss_pred EE-EEEEEecCCcceEEEeEEEEEe Confidence 44 4667666677788888888888 No 11 >protein:vir:3133 Length: 911 # NCBI annotation: hypothetical protein # Family: family:all:5234 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640315;genbank:gi:21234408;genbank:GeneID:956056 Probab=98.94 E-value=5.7e-09 Score=65.78 Aligned_cols=416 Identities=15% Similarity=0.160 Sum_probs=191.2 Q ss_pred CCCccccCceeEeeeecccccccccccccceeEeC------CCce---eccc---------CCCcccceEEEEecCeEEE Q lcl|NC_018275. 1 MGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSF------PGIA---KRND---------VNGVSRGVEYNTAQNAVYR 62 (461) Q Consensus 1 ~~~~~~~~d~~~~~pvn~~a~~~~~~~s~~~L~~~------PGl~---~~~~---------v~G~~rG~~y~~~~~~lY~ 62 (461) --||-+-.-.+---||.+|=.++...+|-..|+.. .-++ .|.+ -.-++-|.. .-++.|- T Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~ 291 (911) T protein:vir:31 215 ATKDKSGSGTVYVNPVQYYFDKRGVYPSHSVLYNSMKQESAKEIVALNVFSPWADEKINFGTTTPPLGRY---IHSAYYF 291 (911) T ss_pred ecccCCccceEEEchhheeecccCcCcchhhhhhhhhhhccceeEEEeeeccccccccccccCCCchhhh---hhhheee Confidence 23333333333334555555544444444444321 1110 0100 011222221 1111111 Q ss_pred EeCCeEEe-------ccceEEeecCcccEEEEeCCeEEEEEECCcEEEEEe--ecccccceeccccccccCccCCccccc Q lcl|NC_018275. 63 VLGSKLYK-------GETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRY--DGTVKTVSNWPADSDYTQYELGSVRDI 133 (461) Q Consensus 63 V~G~~Ly~-------v~~~iG~i~gsg~VsMa~N~~~~avv~~g~~~~Y~y--dg~~~~~~~~~~d~~~~~~dl~~~~~v 133 (461) ...++.. --+.-|+..|+||--=+-. +-+-.-+-|.+..++| .|++ +|+.- | .++++ T Consensus 292 -~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~e~~-np~gl~~igt~~n~k~~a~~~~----~~~~~------~--r~r~~ 357 (911) T protein:vir:31 292 -DSAAILSLGIGNLTPPTSDGTTEGSGPAEEEIS-NPIGLDNIGTVNNLKLIAEGTV----RWTVK------D--RPRCS 357 (911) T ss_pred -ccceeeeecccccCCCCCCCccCCCCCchhhhc-CCCCcccccchhceeeeeccce----eeeec------c--cccce Confidence 1111111 0123345556665321110 0000000111111111 1111 12211 2 34567 Q ss_pred eeccce-EEEEe--eCCceEEEeccCCcc--ccccCcc-eeEEecCC---------------CceEEEEecCCEEEEEEc Q lcl|NC_018275. 134 TRLRGR-YAWSK--DGTDSWFITDLEDES--HPDRYSA-QYRAESQP---------------DGIIGIGTWRDFIVCFGS 192 (461) Q Consensus 134 ~~~dGy-fV~~~--~gt~~f~iS~L~d~t--~~d~~~~-f~tAE~~P---------------D~iv~v~~~~~~l~lfG~ 192 (461) .|..|| |+-.+ .|..+...|.|.+-. -++-|.+ =-+||..| ++|+.+..++.-|++|++ T Consensus 358 ~~yaGRVfyaD~dkngk~rIlFSqLv~sl~di~nCYQdaDPTSeee~DLIdTDGg~vri~gah~Ii~LV~~G~sLlVFca 437 (911) T protein:vir:31 358 GYHNGHVYFGDRDKNGKTRILVSQLVNSLDNIPKCFQDADPTAEEINDLIATDGFTMYPVGMGAPITMVEFNKRLLLLCT 437 (911) T ss_pred eeeccEEEEeeeccCcceeEEEEeeccccccccccccCCCccccccchhhhcCCcEEecCCCCCceEEEEecCeEEEEEe Confidence 788888 32233 235578889887522 2333321 11344444 479999999999999999 Q ss_pred ceEEEEEecCCCCCccC---cccccccceEEeccccchhhhccCceEEEEeeccccceEEEEccCceeeecCCHHHHHHH Q lcl|NC_018275. 193 STIEYFSLTGATTAGAA---LYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKII 269 (461) Q Consensus 193 ~T~Evw~ntG~~~~~~f---p~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~l~g~q~~rIST~~IE~~i 269 (461) .++ |.-.|.++ +.| -|.-+.= =++||+.|.|+.-+|+.++|+|+++-.++.+.+++-+.++-++...|+..+ T Consensus 438 NGV--WAI~G~d~-~g~TATdy~ItKI--sdvGcsspNSVVvVgn~i~fWSd~GIyaLganqfnD~tAnNLTesTIQ~y~ 512 (911) T protein:vir:31 438 NGV--WAIRGTSG-GGATATDFTLDKV--ASVEFNSPQSVVDIGTAIVFWSERGIIAIGVNDFGDLTSNNLTENTIDEYY 512 (911) T ss_pred CcE--EEEeccCC-CceeeeeeEEEEE--eeeeeCCCCeEEEecCceEEeeCCcEEEEeecccCccccccccHHHHHHHH Confidence 998 99987552 222 2332221 246999999999999999999999988888888888989988878899999 Q ss_pred HhcCcccccceEEEEEEECCEEEEEEEcCC-------------eEEEEEccccCCcceeeeecCCccc-cceEEEE---- Q lcl|NC_018275. 270 RSYTADELATGVMEALRFDSHELLIIHLPR-------------HVLVYDASSSQNGPQWCVLKTGLYD-DVYRAID---- 331 (461) Q Consensus 270 ~~y~~~el~~A~~~ty~~~GH~fyvlt~P~-------------~Tw~yD~~t~~w~e~w~~~~tg~~~-~~~Ra~~---- 331 (461) ++.+++-+..+-++....|++ |++.+|+ +.++||++|+.|++ |.. +++... -++|.-. T Consensus 513 d~I~~dkIkNVtgtyd~de~r--VyW~yPn~lDe~teykt~~~~ILVfdLatgaFYP-wtv-s~gpLl~~p~y~Lv~Tre 588 (911) T protein:vir:31 513 DSLDRDIIKNVKGTFINDENR--VYWVVPNKQDSNGEYKTDGELVLVLNLDTGGFYK-HTV-SGGPLLHAPFRRLVNTRA 588 (911) T ss_pred hhcChhhhceEEEEEEccCCE--EEEEecCccCCccceeecCceEEEEEeccCcccc-eee-ecceeecccccccccccc Confidence 999987777777766566777 7777884 36999999999985 654 333221 1111100 Q ss_pred ----EEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEEEEeeccccCCCceEEEEEEEEEcCCC-------CCchhhee Q lcl|NC_018275. 332 ----FMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEHLLFTPLFKADNARCFDLEVESSTGVA-------QYADRLFL 400 (461) Q Consensus 332 ----~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~~~~tP~~~~~~~rv~~~~le~~~Gv~-------q~~~~~~l 400 (461) ++-.-++-++=++.+..+..--...++.-+.++-.+ -+--|+. .--|.-|+ T Consensus 589 EvtvPi~~etgaiIve~gsdPV~~tl~vdttGvDg~ayLl------------------~frdg~~g~~~f~a~~~~~~~~ 650 (911) T protein:vir:31 589 EVSIPITETDGTVITDTLGDPVTVTRTVTTTGVDGLAYFA------------------SFDDGVNGQFNFIAEHQPWGFA 650 (911) T ss_pred cceeeEEeecceEEEecCCCCeEEEEeeecccccceeEEE------------------eeccCCcceEEEEEeecCCeee Confidence 011111112222222222211111111111111100 0001111 11122233 Q ss_pred eecc-------------C-ccccCcceee-ccCCCc---ccceeEEEEeeEecccceeEEEEEE-ecCcceEEEe---EE Q lcl|NC_018275. 401 SATT-------------D-GINYGREQMI-EQNEPF---VYDKRVLWKRVGRIRRLIGFKLRVI-TKSPVTLSGC---QI 458 (461) Q Consensus 401 s~sd-------------D-G~~~~~~~~~-~~g~~g---~y~~R~~~~rlG~~r~~v~f~~r~~-~~~p~~l~ga---~~ 458 (461) .|-- | |..|- |.|+ ..+.|- -|.+-++.+--|++.+-.+++|.-. ..--++|.-. ++ T Consensus 651 dw~~~~~~~~~~y~s~~~~~y~~~-~~~~~~~~~pyi~sy~~~~~rv~~~~y~~~~a~~~f~~~~~~~~~~~~~~~~~~~ 729 (911) T protein:vir:31 651 DWANVPNMTRVNYSSYVDFAYEYP-EVMIGNISLPYIHSYYLTGIRVQTEQYTTETAHLSFHRVQAHQTTALGTVTFHKV 729 (911) T ss_pred ccccCccccccchhHHHHhhhhhh-hhhhhcccCceeeeeeeeeeEEeccceeeecccceeEeeecccceeeeeeeeeee Confidence 3321 0 11111 1111 112221 1344445555555555544444321 1112222111 11 Q ss_pred EeC Q lcl|NC_018275. 459 RLE 461 (461) Q Consensus 459 ~~e 461 (461) +|- T Consensus 730 ~~~ 732 (911) T protein:vir:31 730 DMM 732 (911) T ss_pred eeh Confidence 111 No 12 >protein:vir:352 Length: 536 # NCBI annotation: hypothetical protein # Family: family:all:3197 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203466;genbank:gi:15320622;genbank:GeneID:921729 Probab=98.28 E-value=1.7e-06 Score=52.19 Aligned_cols=420 Identities=14% Similarity=0.102 Sum_probs=204.9 Q ss_pred CCCccccCceeEeee-----------------------ecccccccccccccceeEeCCCceeccc-CCCcccceE-E-- Q lcl|NC_018275. 1 MGKDFKNADYIDYLP-----------------------INMLATPKEVLNSSGYLRSFPGIAKRND-VNGVSRGVE-Y-- 53 (461) Q Consensus 1 ~~~~~~~~d~~~~~p-----------------------vn~~a~~~~~~~s~~~L~~~PGl~~~~~-v~G~~rG~~-y-- 53 (461) |.|.. .-=...|| -|++|++.. +|-=-|-.+.+. +.++.|=.+ | T Consensus 9 ~~~~~--~~~~~~~pAPv~G~~t~~~~A~m~~~~A~vldN~fpt~~g-------~r~R~G~~~~at~~~~~v~s~~~~~~ 79 (536) T protein:vir:35 9 VPPPP--SIQEAHLPAPVGGLNTVSAGSAMPVSDCLQGFNLIASELG-------LRSRLGYREWCTGLGVPARSTLPFAG 79 (536) T ss_pred CCCCc--cceeeeeCccccceeccchhhcCCCCceEEEeecCCChhh-------hhhhccchhHhcCCccceEEeeeeee Confidence 22221 12223333 133333321 111122233333 233332221 2 Q ss_pred ---EEecCeEEEEeCCeEEeccc------eEE-eecCcc---c-EEEEeCC--eEEEEEECCcEEEEEeecccccceecc Q lcl|NC_018275. 54 ---NTAQNAVYRVLGSKLYKGET------VVG-DVAGSG---R-VSMAHGR--TSQAVGVNGQLVEYRYDGTVKTVSNWP 117 (461) Q Consensus 54 ---~~~~~~lY~V~G~~Ly~v~~------~iG-~i~gsg---~-VsMa~N~--~~~avv~~g~~~~Y~ydg~~~~~~~~~ 117 (461) .=-++.|+-+.++.+|.|.+ ++- +-..+| - ++..||- -...++.+|.-..-+|++++...+... T Consensus 80 ~~~~Ga~~klf~at~~~i~dvT~pa~p~~~~~~~g~~~g~~~~w~~v~~~~~gG~~l~~~nG~~~~~~~~gt~~~w~~v~ 159 (536) T protein:vir:35 80 SAKSGAANRLFQTTSEGIWDVSASSQTPTQVLTFGDQTGDAGFGVSHAFVTQRGHFLFYADETNGLFRYSESTDTWTAVA 159 (536) T ss_pred ccccCcceeEEEecccceeeeecCCCCcceEEEeccCCCceeeEEEEEecCCCceEEEEEEcCCCceEeecccCchhhcc Confidence 01234577777777777643 111 000122 1 3334432 234455555555667888874433222 Q ss_pred ccc--cc-cCccCCccccceeccceEEEEeeCCce-EEEeccCCc-c-ccccCcceeEEecCCCceEEEEecC------- Q lcl|NC_018275. 118 ADS--DY-TQYELGSVRDITRLRGRYAWSKDGTDS-WFITDLEDE-S-HPDRYSAQYRAESQPDGIIGIGTWR------- 184 (461) Q Consensus 118 ~d~--~~-~~~dl~~~~~v~~~dGyfV~~~~gt~~-f~iS~L~d~-t-~~d~~~~f~tAE~~PD~iv~v~~~~------- 184 (461) ..+ .+ .+-|-.++.-++-.+-|..|.+.++-+ ||.-...-. + .+..+.+...+ =--|+++.+|. T Consensus 160 ~~t~~~~i~Gv~~~~l~~i~~~knRLffvq~~s~~awYLp~~av~G~A~~f~lg~~~~~---GGsL~~~~sWS~~~G~Gl 236 (536) T protein:vir:35 160 QGTGVGEIDGVNPANIVFVAVFKQRVWLVERDTARAWYLPAGAIAGTAQPFEMGAQFRA---GGHLVGLWNWTYDGGAGM 236 (536) T ss_pred cCCcccccCCCCcccceeeeeEeeeEEEEEeCCceEEEeecccccceeeeeeccCcccc---CceEccceeeccccCCCc Confidence 111 11 112222333455556664455444433 544322210 0 11111100000 01133333322 Q ss_pred -CEEEEEEcceEEEEEecCCCCC-ccCcccccccceEEec---cccchhhhccCceEEEEeeccccceEEEEccCc-eee Q lcl|NC_018275. 185 -DFIVCFGSSTIEYFSLTGATTA-GAALYVAQPSLMVQKG---IAGTYCKTPFADSYAFISHPATGAPSVYIIGSG-QAS 258 (461) Q Consensus 185 -~~l~lfG~~T~Evw~ntG~~~~-~~fp~~~~~~~~I~~G---ca~~~sv~~~~~s~~wlg~d~~g~~~Vy~l~g~-q~~ 258 (461) +++. |=+...||+-=.| +++ .+=.+.-.. .-.+| -.++.|.-+++.-+.+++.|+- -.|+.- |-. T Consensus 237 ~d~~V-fvSs~GeVaVyqG-sdPs~s~~Wsl~g--iy~IG~~pp~G~r~~i~~G~Dl~iit~dGi-----vplsq~~q~d 307 (536) T protein:vir:35 237 DDSLV-AISGGGDVAIWQG-TDPASSATFGLRG--VWSLGGSPPAGRRIATDYGGDVLVLSRLGV-----RPLSRLVAGE 307 (536) T ss_pred ceeEE-EEecCCcEEEEec-CCCCcccceeEEE--EEEeccCCCCCceEEEeecCeeEeeecCCc-----cchhhhhhhh Confidence 2222 2223355554444 332 122233322 23456 4788899999999999999873 333332 222 Q ss_pred ecC----CHHHHHHHHhcCcccccceEEE-EEEECCEEEEEEEcC------CeEEEEEccccCCcceeeeecCCccccce Q lcl|NC_018275. 259 PIA----TASIEKIIRSYTADELATGVME-ALRFDSHELLIIHLP------RHVLVYDASSSQNGPQWCVLKTGLYDDVY 327 (461) Q Consensus 259 rIS----T~~IE~~i~~y~~~el~~A~~~-ty~~~GH~fyvlt~P------~~Tw~yD~~t~~w~e~w~~~~tg~~~~~~ 327 (461) +.| |.+||..++++-..+ ....+| .-..--..++++.+| ++|++++..|+ .||.- ..| T Consensus 308 ~~a~~~it~~I~~~~~~~v~~~-a~~~gWq~~~~P~~n~liV~~P~~~g~~~~~fV~N~~tg----aW~~f------tgw 376 (536) T protein:vir:35 308 VDKDTYVTAKVSNLFSALMLTR-ASLPGWSMQLHPEDNALLVTVPTYPGQPTEQLVMALAGR----AWFRY------RDL 376 (536) T ss_pred hhcccCCCccchhhHHHHHhhc-cCCCccEEEEccCCCeEEEEccCCCCCCceEEEeecccC----ceeee------cCC Confidence 222 567777666532111 111122 112222334777777 37999999999 55532 489 Q ss_pred EEEEEEecCCeEEEEEcCCCeEEEEcC------CccCcCCCEEEEEEeeccccCC--C------ce-------------E Q lcl|NC_018275. 328 RAIDFMYEGNQIACGDKSEAVTGQLQF------DISSQYDKQQEHLLFTPLFKAD--N------AR-------------C 380 (461) Q Consensus 328 Ra~~~~~~~g~~~vGD~~~G~l~~ld~------~~~td~g~p~~~~~~tP~~~~~--~------~r-------------v 380 (461) -+.|...+.++..+|.- +|++|+.|- ....+.|+||...+....-|.- . +| + T Consensus 377 ~a~C~~v~~~~LyFG~~-dG~v~~~da~v~g~D~~~~~ag~~I~~~~~~af~~~G~~~~K~~~~~r~~~~s~~~~p~l~l 455 (536) T protein:vir:35 377 PIYSSAVWGGKLYFGTV-DGRVCVNDGYVDGVLLSEPSAFTPVQWSLLSAFTNLGSARQKQVQLLRPTLLSESATPSYEV 455 (536) T ss_pred cceEEEEecCeEEEeec-CCEEEecccccCccccccCcCcceeeeccccchhhcCchHHHHHHHhhhhhhhccCCceEEE Confidence 99999999999999987 999999773 1233468888876655433221 0 12 1 Q ss_pred -EEEEEEEEcCCCCCchhheeeeccCccccCcceee--ccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeE Q lcl|NC_018275. 381 -FDLEVESSTGVAQYADRLFLSATTDGINYGREQMI--EQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQ 457 (461) Q Consensus 381 -~~~~le~~~Gv~q~~~~~~ls~sdDG~~~~~~~~~--~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~ 457 (461) .++++++ +..+|..-++ -.+.+|....+= ... ++|.-|=+|+-++..-..+...+|-.+..+..|-.+. T Consensus 456 ~~~~d~D~----~~p~~~~~~~--~~~~~Wd~s~Wd~~~Ws--~~~~v~~~~~s~~g~G~~is~~~~g~a~~~~~~~~~d 527 (536) T protein:vir:35 456 QARYRYDF----AELAPVSAMG--GGSGTWDGSTWDVDVWS--GEYQASQQVRGGTGVGVDLAIAIRGTAVARTVLVGID 527 (536) T ss_pred EEEEEecc----CCCCCcCCCC--CCcccCCcccCCceecC--CcceeEeeeeEeccceEEEEEEEeeccccceEEEEEE Confidence 1233332 2222211111 113344444332 222 4677777888888888888899998888899999999 Q ss_pred EEeC Q lcl|NC_018275. 458 IRLE 461 (461) Q Consensus 458 ~~~e 461 (461) +..| T Consensus 528 ~~~e 531 (536) T protein:vir:35 528 ILFT 531 (536) T ss_pred EEEe Confidence 9999 No 13 >protein:vir:2625 Length: 715 # NCBI annotation: gp27 # Family: family:all:5234 # MgeID: mge:55 # MgeName: SIO1 # Cross-refs: genbank:acc:NP_064766;genbank:gi:9964636;genbank:GeneID:1263056 Probab=97.93 E-value=1e-05 Score=47.91 Aligned_cols=441 Identities=14% Similarity=0.159 Sum_probs=205.1 Q ss_pred CCCccccCce-------eEeeeecccccccccc--cccceeE-eCCCceec-ccC----------CCcccceEEEE---- Q lcl|NC_018275. 1 MGKDFKNADY-------IDYLPINMLATPKEVL--NSSGYLR-SFPGIAKR-NDV----------NGVSRGVEYNT---- 55 (461) Q Consensus 1 ~~~~~~~~d~-------~~~~pvn~~a~~~~~~--~s~~~L~-~~PGl~~~-~~v----------~G~~rG~~y~~---- 55 (461) -+...+++.| +++=-+||-|.....+ --.|+|. .-|-+..+ -++ +=..|-..|.+ T Consensus 101 t~~pLs~~n~~~svdl~~~~~~vn~SPsh~~v~v~~~~G~livanp~i~~~~~~~d~~t~s~t~~~ll~r~r~f~~qg~d 180 (715) T protein:vir:26 101 STDPLSANKYSGSVDLNTHSASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNTSTEAFTATSISFKERDFEWQGSD 180 (715) T ss_pred cCCccccCceeEEeeecceecccccccceeEEEEEEeeeEEEEecCCccEEEEEecCCcceeEeeEEEEEeeeheeeccc Confidence 5666777777 6666777777763222 2223433 33433211 011 00111111111 Q ss_pred -ecCeEEE-----EeCCeEEeccceEEeecCcccEEEEeCCeEEEEEECC------------------------------ Q lcl|NC_018275. 56 -AQNAVYR-----VLGSKLYKGETVVGDVAGSGRVSMAHGRTSQAVGVNG------------------------------ 99 (461) Q Consensus 56 -~~~~lY~-----V~G~~Ly~v~~~iG~i~gsg~VsMa~N~~~~avv~~g------------------------------ 99 (461) .-+.+|- +.-..+|.-- -.-.+.++++-+-|--+--||-.. T Consensus 181 ~~~g~~y~~~gt~~tn~~iynly---N~gw~~p~gt~~~N~~~~yiVypa~s~~~~S~kd~n~afsk~ad~ei~tGt~~~ 257 (715) T protein:vir:26 181 VDVTSLYFGEGTSVSNQRIYDTY---NVGWVGPKGSAALNTYGSYIVYPALTHPWYSGKDANGAFNKADWLEIYTGSSLA 257 (715) T ss_pred cccccccccCCcccCchhheecc---cceeecceeEEEEcCCCCceEecccccccCCCcccccccChhhccccccccccc Confidence 1112221 1111222200 001122333333332221111111 Q ss_pred cEEEEEeecccccceeccccccccCccCCccccceeccceEEEE----eeCCceEEEeccCCcc--ccccCcc-eeEEec Q lcl|NC_018275. 100 QLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRGRYAWS----KDGTDSWFITDLEDES--HPDRYSA-QYRAES 172 (461) Q Consensus 100 ~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dGyfV~~----~~gt~~f~iS~L~d~t--~~d~~~~-f~tAE~ 172 (461) ..-.|..|+..+.-+.. .+.. .-+.+++||+-.|+-.|. -+...++..|.|-+-. -++-|.. =-|+|. T Consensus 258 ~~G~yi~D~~~~g~~~l--eeev---~k~R~rsv~~yaGrV~yagiD~dkng~rilfSqLv~s~~di~nCyQd~DPTsee 332 (715) T protein:vir:26 258 SNGHYVLDVFNKARTGL--TTEV---ETGRFRSVAAYAGRVFYAGIDSAKNGGKVYFSRLTERMSDVGNCYQVNDPTSEV 332 (715) T ss_pred cCceEEEeeeecCCccc--hhhh---hcCCCcceeeecceEEEeecccccCCCeEEEehhhcchhhcccccccCCCchhh Confidence 11234444444433211 1100 134567899999993232 2344578889886421 1111100 012333 Q ss_pred CC---------------CceEEEEecCCEEEEEEcceEEEEEecCCC---CCccCcccccccceEEeccccchhhhccCc Q lcl|NC_018275. 173 QP---------------DGIIGIGTWRDFIVCFGSSTIEYFSLTGAT---TAGAALYVAQPSLMVQKGIAGTYCKTPFAD 234 (461) Q Consensus 173 ~P---------------D~iv~v~~~~~~l~lfG~~T~Evw~ntG~~---~~~~fp~~~~~~~~I~~Gca~~~sv~~~~~ 234 (461) -| -.|+.|..++..|++|++..+ |...|.. ++..|...+.. ++||-+|.|+.-+|+ T Consensus 333 ~~dLidTDGg~iri~gah~ii~Lv~f~~sLlvf~~NGV--WAi~G~d~g~tATdY~ltKIs----~vg~sspnSvVvv~~ 406 (715) T protein:vir:26 333 LSDLLDTDGGVVRIPDAHNIRKLHVLGASLLVFAENGV--WAVAGVDNVFRATEYAITRIS----DVGLSNENSFVVADG 406 (715) T ss_pred hhhhhhcCCCEEEecCCCCceeEEEecceEEEEEecce--EEEeccCCceeeeeeEEEEee----eeccCCCccEEEecc Confidence 33 368899999999999999998 9994322 12334444432 589999999999999 Q ss_pred eEEEEeeccccceEEEE-ccCceeeecCCHHHHHHHHhcCcccccceEEEEE-EECCEEEEEEEcCC-e---------EE Q lcl|NC_018275. 235 SYAFISHPATGAPSVYI-IGSGQASPIATASIEKIIRSYTADELATGVMEAL-RFDSHELLIIHLPR-H---------VL 302 (461) Q Consensus 235 s~~wlg~d~~g~~~Vy~-l~g~q~~rIST~~IE~~i~~y~~~el~~A~~~ty-~~~GH~fyvlt~P~-~---------Tw 302 (461) +++|.|.++-.+..+.. +|.+.+|-|+...||+.+++.+. +...++.-.| ..|++ |.+.+|| . .+ T Consensus 407 ~i~~WsdtGIyal~~Nd~fn~~tAqNLTekTIq~~~~~I~~-dk~knVtg~fd~~e~r--VyW~yPn~dt~vdykyd~vL 483 (715) T protein:vir:26 407 IPIWWGKTGIYAVQQSENLNTPTAQNLSLSTIQTLWNNISN-AKKAQVTVEYDKINQR--VFWFYPDNDESVDYKYNNIL 483 (715) T ss_pred eEEEeeCCcEEEEEeccccCcchhhccchHHHHHHHhhcch-hhhcceEEEEEccCCE--EEEEEcCCceeeceeecCeE Confidence 99999999988888888 99999999999999999999985 4444454444 45777 5666773 2 57 Q ss_pred EEEccccCCcceeeeecCCccccceEEEEEEe------cCCeEEEEE--cCCC--------------------------e Q lcl|NC_018275. 303 VYDASSSQNGPQWCVLKTGLYDDVYRAIDFMY------EGNQIACGD--KSEA--------------------------V 348 (461) Q Consensus 303 ~yD~~t~~w~e~w~~~~tg~~~~~~Ra~~~~~------~~g~~~vGD--~~~G--------------------------~ 348 (461) |+|++++-..+ |.+..+-......-+..... -..|.+.|- ..+| . T Consensus 484 V~dLalgaFYp-~~v~~~a~~~~~~ig~~~~~~~~~~~t~~~vv~~~~v~~~g~~~~v~~~~r~~~~~~~~~~~~~~~~~ 562 (715) T protein:vir:26 484 VMDLALQAFYP-WRVEDEASSTSYIIGTSYYGGLGSTSTETQVVNGADVVVNGSDNVVATLYRDYLEGDSEIKLLVRDGT 562 (715) T ss_pred EEEeccccccc-ccccccccccceeeeeeeeCCcccccchhheeccceEEEeccceEEEEeecccccccceEEEEEEcCC Confidence 89999987766 54443311111111111100 011111111 1122 2 Q ss_pred EEEEcCCccCcCCCEEEEE--------------------EeeccccCCCceEE-EEEEEEEcCCCCCchh--he---eee Q lcl|NC_018275. 349 TGQLQFDISSQYDKQQEHL--------------------LFTPLFKADNARCF-DLEVESSTGVAQYADR--LF---LSA 402 (461) Q Consensus 349 l~~ld~~~~td~g~p~~~~--------------------~~tP~~~~~~~rv~-~~~le~~~Gv~q~~~~--~~---ls~ 402 (461) -++|.+..++++-=.-+.. -..|++.+. .|+- |=-||-..|-.-+++- || .+| T Consensus 563 ~~~~~f~~~~~~~~~dw~s~d~~~~~~~gy~~~gd~~~~k~~pyvt~~-~~~tedg~v~~~~g~~p~n~sSclm~~sw~w 641 (715) T protein:vir:26 563 TGKMTFATFRGDTYLDWGSADYKSFAEAGYDFMGDITTFKNAPYVTTY-MRVTEDGYVASGAGYEFINPSSCLMSVSWNL 641 (715) T ss_pred ceeEEEecccCceeeeccccchhhHHHhhhhhcccceeeecCceEEEE-EEEecccceeccCCccccCCcceEEEEEeee Confidence 2233333333321111110 001111100 0000 1124444443222222 44 334 Q ss_pred ccCccc------cCcceeeccCCCcc--cc--eeE-EEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 403 TTDGIN------YGREQMIEQNEPFV--YD--KRV-LWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 403 sdDG~~------~~~~~~~~~g~~g~--y~--~R~-~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) |.-+++ +.+.+.+.-|.... |+ +-+ +.|--|+.|.- .|+|.--..-..-|.|-+|=-- T Consensus 642 s~s~st~~eaYk~~~~~~~~p~~~s~~~yp~~~VvTKsriRG~Gr~~-~~rf~s~~gKdlhl~Gysilg~ 710 (715) T protein:vir:26 642 SKSGSTPREIYKLKDVPVVNPNDLSSINYPTDTVVTKSKVRGRGRSM-KFRFESVAGKDFHLVGYEVIGA 710 (715) T ss_pred ccCCCChhhhheecceeeeCCCccccccCCcceeEeeeeeeccceEE-EEEEEecCCcceEEEeEEEEec Confidence 433332 11111221122222 11 111 22233443332 3555544455556666655444 No 14 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=94.36 E-value=0.0046 Score=33.39 Aligned_cols=304 Identities=13% Similarity=0.074 Sum_probs=119.3 Q ss_pred CC-Cc-cccCceeEe-----e-eecccccccccccccceeEeCCC-ceecccCCC-cccceEEEEec---Ce--EEEE-- Q lcl|NC_018275. 1 MG-KD-FKNADYIDY-----L-PINMLATPKEVLNSSGYLRSFPG-IAKRNDVNG-VSRGVEYNTAQ---NA--VYRV-- 63 (461) Q Consensus 1 ~~-~~-~~~~d~~~~-----~-pvn~~a~~~~~~~s~~~L~~~PG-l~~~~~v~G-~~rG~~y~~~~---~~--lY~V-- 63 (461) ++ -. .+.+++|-- . +.+..-+....+.-. .++..-. +...++++. ++.|....+++ +. -|-| T Consensus 319 ~~~~~~~~~g~~i~i~~~~~~~~~~~~~~~~~g~~~~-~~~~~~~~v~~~~~Lp~~a~~g~~v~v~~~~~~~~~~Yyv~~ 397 (680) T protein:vir:17 319 LGTFTAESIGNVIRVRYSDPTRTDEFTMSARGGTSGT-GLESIKYSVDTLAELPTKCWNDYQVAVRNTQDTEVDDYYVKF 397 (680) T ss_pred cCcEEEEECCCEEEEEeccCCCceEEEeeccCCCCce-eeeeeeeeeccccccccccCCCcEEEEEeCCCCcccceEEEE Confidence 11 00 001111000 0 000000111100000 0000000 000011111 00110000100 00 0111 Q ss_pred e---------CCeEEe--cc--ceEEeecCcccEEEEeCCeE--EEEEECCcEEEEEee--cccccceeccccccccCc- Q lcl|NC_018275. 64 L---------GSKLYK--GE--TVVGDVAGSGRVSMAHGRTS--QAVGVNGQLVEYRYD--GTVKTVSNWPADSDYTQY- 125 (461) Q Consensus 64 ~---------G~~Ly~--v~--~~iG~i~gsg~VsMa~N~~~--~avv~~g~~~~Y~yd--g~~~~~~~~~~d~~~~~~- 125 (461) . +.+-+. ++ ...+-..++=|.-+.+++.- .....+++...-.|. ....+.+| .+|.+ T Consensus 398 ~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~~~~~r~~Gdd~tn-----p~psF~ 472 (680) T protein:vir:17 398 ETDVEDADVPGSGYWVETVKNGDDGGLVDDTMPHVLVRNALGDFTFSSLNNSSYGKTWADRSVGSEDTN-----PHPTFT 472 (680) T ss_pred eccCcccCcccccceeecccCcccceeccCcceEEEEEccCceeEEEeeccccccccccccccCCcccC-----CCcccc Confidence 0 001111 00 01111112223333332211 111112111000011 01111111 12222 Q ss_pred cCC-ccccceeccceEEEEeeCCceEEEeccCCcc-----------ccccCcceeEEecCCCceEEEEecCCEEEEEEcc Q lcl|NC_018275. 126 ELG-SVRDITRLRGRYAWSKDGTDSWFITDLEDES-----------HPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSS 193 (461) Q Consensus 126 dl~-~~~~v~~~dGyfV~~~~gt~~f~iS~L~d~t-----------~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~ 193 (461) +-| -+.+|+|..+|++|..| +....|.-.|.. +-|+. ++.-+..+++.|.-++.+++.|+||... T Consensus 473 ~~G~~p~~v~f~q~RL~f~s~--~~v~~Srtgd~~nF~~~t~~~~~DdD~I-~~~~ss~~~~~i~~~v~~~~~L~l~t~g 549 (680) T protein:vir:17 473 ESGNGIYGMFMYKNRLGFLTQ--DAVIMSQVGDYFNFYATSGVTISDADPI-DMATSDTKPVKLEAAISSTSGAILFGNQ 549 (680) T ss_pred cCCCCceEEEEEcceEEEeeC--CeEEEEccCCcccccccccccCCCCccE-EEEEcCCcceeeeEEeecCCcEEEEecC Confidence 112 26679999999988743 234445433322 22333 2666677788888899999999999885 Q ss_pred eEEEEEecCCCCCccCcccccccceE-EeccccchhhhccCceEEEEeeccccceEE------EEccCceeeecCCHHHH Q lcl|NC_018275. 194 TIEYFSLTGATTAGAALYVAQPSLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSV------YIIGSGQASPIATASIE 266 (461) Q Consensus 194 T~Evw~ntG~~~~~~fp~~~~~~~~I-~~Gca~~~sv~~~~~s~~wlg~d~~g~~~V------y~l~g~q~~rIST~~IE 266 (461) -| |..+|..+ ++......--.+ ..+|+..-.-..++++++|++..+. --.| +.-++|+++.+ |.-++ T Consensus 550 -~q-~~ls~~~~--~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~-~s~vre~~y~~~~d~y~a~Dl-T~~a~ 623 (680) T protein:vir:17 550 -AQ-FRLSSPDE--SFGPKTATLDKISNYTYESKADPVQTGVSMIFPTNMGT-YSSVYELSTESAKGTPVIEDS-SRVIP 623 (680) T ss_pred -eE-EEEecCCc--eecceeEEEEEEEeecccCCCCceEeCCeEEEeecCCC-cceEEEEeeeeccCceehhhH-HHHHH Confidence 33 66666433 233322111112 3578877777789999999998752 1223 44466777777 33456 Q ss_pred HHHHhcCcccccceEEEEEEECCEEEEEEEcCC-eEEEEEc---cccCCcceeeeecCCccccce Q lcl|NC_018275. 267 KIIRSYTADELATGVMEALRFDSHELLIIHLPR-HVLVYDA---SSSQNGPQWCVLKTGLYDDVY 327 (461) Q Consensus 267 ~~i~~y~~~el~~A~~~ty~~~GH~fyvlt~P~-~Tw~yD~---~t~~w~e~w~~~~tg~~~~~~ 327 (461) .+|+.- +.. ++.++.+.+.++....-+ .-++|-- .-.+...-||.-.=+ +-.| T Consensus 624 hl~~g~----v~~--~~~~~~~~~~~~~~~~~~~~l~~~~yl~~~~e~~v~aW~rw~~~--~~d~ 680 (680) T protein:vir:17 624 RLIPSG----LTW--STASMNNDTVFFGNAKKGRNVYVFRFFNEGQERKVAGWTTWYYE--DQDH 680 (680) T ss_pred HhcCCc----eEE--EEeeCCCCeEEEEEEcCCCEEEEEEEeeCCCceEEEEEEEEecC--CCCC Confidence 666542 222 455678887665554433 4445432 122211235532221 2233 No 15 >protein:vir:102644 Length: 594 # NCBI annotation: Hypothetical protein # Family: family:all:780 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024422;genbank:gi:48696643;genbank:GeneID:2948111 Probab=93.88 E-value=0.0061 Score=32.73 Aligned_cols=429 Identities=12% Similarity=0.062 Sum_probs=179.2 Q ss_pred CCCccccCceeEee--eecccccccccccccceeEeCCCceecccCCCcc-cceE----EEEecCeEEEE---------- Q lcl|NC_018275. 1 MGKDFKNADYIDYL--PINMLATPKEVLNSSGYLRSFPGIAKRNDVNGVS-RGVE----YNTAQNAVYRV---------- 63 (461) Q Consensus 1 ~~~~~~~~d~~~~~--pvn~~a~~~~~~~s~~~L~~~PGl~~~~~v~G~~-rG~~----y~~~~~~lY~V---------- 63 (461) |---.+.+=|-+.+ =-||++.| .|.+++=||+...+.+++.. +.+| |+..+.-+.+. T Consensus 18 l~~r~Dl~~y~~~~~~~~n~~~~~------~G~~~rR~G~~~~~~~~~~~~~~~lipF~~s~~~~~~le~g~~~~r~~~~ 91 (594) T protein:vir:10 18 LQFNEYESAYHHSIEDAVNFVVTE------QGSLITRCGSEEVGLCQDGEVRLFRLPAVDAPSNDVIVEVGNTNIAVWVN 91 (594) T ss_pred eccchhHHHHHHHHhhhhceEEEe------cCCeecCChhHhhhhccCCCCCEEEEEEEeCCCCeEEEEEcCCeEEEEec Confidence 11111111111111 13455444 34466677777766665543 2222 22111112111 Q ss_pred -------eCCeEEeccceEE-e-ecCcccEEEEeCCeEEEEEECCcE--EEEEeecccccceecc-ccccccCccCCccc Q lcl|NC_018275. 64 -------LGSKLYKGETVVG-D-VAGSGRVSMAHGRTSQAVGVNGQL--VEYRYDGTVKTVSNWP-ADSDYTQYELGSVR 131 (461) Q Consensus 64 -------~G~~Ly~v~~~iG-~-i~gsg~VsMa~N~~~~avv~~g~~--~~Y~ydg~~~~~~~~~-~d~~~~~~dl~~~~ 131 (461) .++..|...++.- + .+....+.++.++..+-++..... ..|+.......+..+. ....+...+-+-+. T Consensus 92 ~~~~v~~~~~~~~~~~tp~~~t~~~~l~~i~~tqsad~~~~~~~~~~p~~L~R~~~~~w~~~~~~~~~~p~~~~~~~~p~ 171 (594) T protein:vir:10 92 DVRQVVANTPSEWRNTIDRIQTAYDTIGDDAGAANTGRLIMVHPALQPKRLYRDNNNAWQFVNMHTGAVPAEWSPSNYPQ 171 (594) T ss_pred CcEEEEccCCCcccccccceeeccCCccceEEEEEeeEEEEEcCCCCceEEEEccCCCceEEecccCcccccccCCccce Confidence 1111111111110 0 011233455555554444433322 2333333333332221 11112222334455 Q ss_pred cceeccceEEEEeeCC--ceEEEeccCCc---------cccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEe Q lcl|NC_018275. 132 DITRLRGRYAWSKDGT--DSWFITDLEDE---------SHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSL 200 (461) Q Consensus 132 ~v~~~dGyfV~~~~gt--~~f~iS~L~d~---------t~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~n 200 (461) .|+|...|.+|.-.-. +.+..|.-.|- .+-|+++ | +-++-+.++.++...+.|++|.+..- |.. T Consensus 172 ~v~f~q~RL~f~~~~~~p~~v~~Srtgd~~nF~~~~~~~ddd~i~-~--~~s~~~~~~~~v~~~~~L~i~t~~~e--~~l 246 (594) T protein:vir:10 172 TVGIFQNRVWYVGSPVHRTYFWATRAGKLEDIAPSTANNPNDPIS-F--VGIMEGTPCWIIASSDVLTIGTTIND--YQL 246 (594) T ss_pred EEEEEeeeEEEEeCCCCCceEEEEecccccccccCCCCCCCccEE-E--EEecccceEEEEecCCceEEEecCce--EEE Confidence 7889999977753211 22333322221 1222321 4 44455788888889888887765554 666 Q ss_pred cCCCC----CccCcccccccceEEeccccchhhhccCceEEEEeeccccceEEEE------ccCceeeecCCHHHHHHHH Q lcl|NC_018275. 201 TGATT----AGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYI------IGSGQASPIATASIEKIIR 270 (461) Q Consensus 201 tG~~~----~~~fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~------l~g~q~~rIST~~IE~~i~ 270 (461) +|.++ +...-..++ + ..||.+- --..++++++|++..+. .|+- .++|+++.+|-+ ++.++. T Consensus 247 ~~~~~~~lTp~~~~~~~~-s---~~g~~~~-~P~~vg~~~~fv~~~g~---~vre~~y~~~~d~y~~~dlt~~-a~hl~~ 317 (594) T protein:vir:10 247 AASTGVSVTAATAILRRS-S---VQGTAAV-QGIPAEEQVIFCSRNKS---KVYAMNYVREQDNWIPDEMSSQ-AQHLFT 317 (594) T ss_pred ecCCCcccccceEEEEEe-e---eeccCCC-cceeeCCeEEEEcCCCC---EEEEEEEeeccCceeccchhhh-hhhhcC Confidence 66543 111222222 1 2476432 33578999999987653 3333 457888888654 455543 Q ss_pred hcCcccccceEEEEEEECCEEEEEEEcCC---eEEEEEccccCCcceeeeecCCccccceEEEEEEecCC---e-EEE-- Q lcl|NC_018275. 271 SYTADELATGVMEALRFDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGN---Q-IAC-- 341 (461) Q Consensus 271 ~y~~~el~~A~~~ty~~~GH~fyvlt~P~---~Tw~yD~~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g---~-~~v-- 341 (461) .-...+-..-+..+|+++-+.++.+...| ..|.|+-...++ -||.-+.. +..-+..|.++... - .+| T Consensus 318 ~~~~~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~~eq~v~--aWs~~~~t--~G~v~~va~i~~~~~d~l~~~V~R 393 (594) T protein:vir:10 318 PISSAKGASVRRVAYISDAAKSLWVVLENGQINYCCFDRTTDTK--AWTQLELS--GGKVIDIAAAFNPDSDYAYVAVVR 393 (594) T ss_pred ccccccCceEEEEEEecCCceEEEEEeCCCeEEEEEEeccccee--eeEeeccC--CCcEEEEEEeecCCCCEEEEEEEE Confidence 21100112235566777777777777665 366777655444 57755421 12344444443211 0 111 Q ss_pred EEcCCCe------EEEEcCCccCcC------------------------------------------------------- Q lcl|NC_018275. 342 GDKSEAV------TGQLQFDISSQY------------------------------------------------------- 360 (461) Q Consensus 342 GD~~~G~------l~~ld~~~~td~------------------------------------------------------- 360 (461) .+..+|. |=+|+. .... T Consensus 394 ~~ti~g~~~~y~~lE~~~~--~~~~~~~~~~~~d~~~~~~~~vsgl~hLeg~tv~v~aDG~~~~~~~V~~g~itL~~~~~ 471 (594) T protein:vir:10 394 SKAINGVQKNYTVLEKISS--PRTDWKRADGWVVAQVNQNGDVLNLDRYIGRTAVIFSKYGLEAEVEVNNIGLTHRINGY 471 (594) T ss_pred CCccccceeeEEEeecCCC--ccccccccceeeeecccccceeecccccCCceEEEEeCCeecCCeEEcCCeeEeeccCC Confidence 1111110 111111 1111 Q ss_pred --------CCEEEEEEeeccccC---------CCceEEEEEE--EEEcC--CCCCchhheeeeccCccccCcceeec-cC Q lcl|NC_018275. 361 --------DKQQEHLLFTPLFKA---------DNARCFDLEV--ESSTG--VAQYADRLFLSATTDGINYGREQMIE-QN 418 (461) Q Consensus 361 --------g~p~~~~~~tP~~~~---------~~~rv~~~~l--e~~~G--v~q~~~~~~ls~sdDG~~~~~~~~~~-~g 418 (461) |.|+.+.+.++.+.. .+.||..+.| +-+.| ++.. +.. ++-.++ .+ +.+... .| T Consensus 472 ~~~~~v~VGl~Y~s~i~~lp~~~~~~~gs~~g~r~ri~r~~v~~~~S~g~~vg~~-~~~-~r~~~~--~~-~~~~~~~~g 546 (594) T protein:vir:10 472 DPNTVYYVGYKMDSYFRTLTPSNGDMKKSMFGSKIRISKVQLALFDSIEPTVNGE-PAD-DRSTDD--IM-DARLLDFSS 546 (594) T ss_pred CCcceEEEeeeeeEEEEeecccccCCcccccCccEEEEEEEEEEEcceeeEECCc-ccc-cccchh--hc-cccCCcccC Confidence 111111111111100 0001111110 01101 0000 000 000000 00 000000 00 Q ss_pred --CCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 419 --EPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 419 --~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) ...-.+.++++..+|..++. -.+|+-+.|-|.+|.+..+++| T Consensus 547 ~~~~~tg~~~v~~~~~G~~~~~-~i~I~qd~PlPltvlai~~ev~ 590 (594) T protein:vir:10 547 NSGSSNGTRLVDYNPLGWENDG-KMVIAVEQPFLCEVVGVFSVVQ 590 (594) T ss_pred cccccCCceEEEEccCCcCccc-EEEEEECCCcCEEEEEEEEEEE Confidence 11122445667777875555 4678888888888888888888 No 16 >protein:vir:2203 Length: 794 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042000;swissprot:sw:p03747;genbank:gi:9627472;uniprot:P03747;genbank:GeneID:1261024 Probab=81.41 E-value=0.086 Score=26.43 Aligned_cols=426 Identities=10% Similarity=0.087 Sum_probs=151.7 Q ss_pred CCCccccCc-----------eeEeeeecccccccccccccce--eEeCCCceecccCCCcc-cceEEEEec---C--eEE Q lcl|NC_018275. 1 MGKDFKNAD-----------YIDYLPINMLATPKEVLNSSGY--LRSFPGIAKRNDVNGVS-RGVEYNTAQ---N--AVY 61 (461) Q Consensus 1 ~~~~~~~~d-----------~~~~~pvn~~a~~~~~~~s~~~--L~~~PGl~~~~~v~G~~-rG~~y~~~~---~--~lY 61 (461) -+-.+...+ .++.+. .+.+...... ..... ..++.++.-+ .|....+.+ + .-| T Consensus 209 ~~~t~~~~~~~~~i~a~~~~~~~~~t------~~~g~~~t~~~~~~~~~--~~~~~lp~~~~~G~~v~i~~~~~~~~~~Y 280 (794) T protein:vir:22 209 SDWTVNVGQGFIHVTAPSGQQIDSFT------TKDGYADQLINPVTHYA--QSFSKLPPNAPNGYMVKIVGDASKSADQY 280 (794) T ss_pred ccceEEeCCceEEEEEcCCceEEEEe------eecccCcceeEEEEecc--ccceeccccCCCCeEEEEEeCCCCCccee Confidence 011111111 111110 0011000000 00000 0111222111 232211111 1 113 Q ss_pred EE---eCCeEEeccceEEeecCcccEEEEeCCeEEEEEECCcEEEEEeecccccceecccccc--ccCccCCccccceec Q lcl|NC_018275. 62 RV---LGSKLYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSD--YTQYELGSVRDITRL 136 (461) Q Consensus 62 ~V---~G~~Ly~v~~~iG~i~gsg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~--~~~~dl~~~~~v~~~ 136 (461) -| ..++.|+-....|.+.+-..-.|.+. ++-..++. +...-.......+..+.+ +|.+---.+.+|+|. T Consensus 281 ~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~---lv~~~~~~---~~~~~~~w~~r~~Gd~~tnp~psf~g~~i~~v~f~ 354 (794) T protein:vir:22 281 YVRYDAERKVWTETLGWNTEDQVLWETMPHA---LVRAADGN---FDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFF 354 (794) T ss_pred EEEEeccceEEEEeeeccceeeecccceeeE---eeeccCCc---EEEeeccccccccCccccCCcceecCCCcceEEEE Confidence 22 12222321111111112222223221 11111221 111111111112222222 222222224679999 Q ss_pred cceEEEEeeCCceEEEeccCCcc-----------ccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCC Q lcl|NC_018275. 137 RGRYAWSKDGTDSWFITDLEDES-----------HPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATT 205 (461) Q Consensus 137 dGyfV~~~~gt~~f~iS~L~d~t-----------~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~ 205 (461) .+|++|..+ +....|.-.|-. +-|+. ++..+..+++.|.-++.+++.|+||.+..- |..+|+. T Consensus 355 q~RL~f~~~--~~v~~Srtgd~~nF~~~t~~~~~DdD~i-~~~~ss~~~~~i~~~v~~~~~L~i~t~~~e--~~l~~~~- 428 (794) T protein:vir:22 355 RNRLGFLSG--ENIILSRTAKYFNFYPASIANLSDDDPI-DVAVSTNRIAILKYAVPFSEELLIWSDEAQ--FVLTASG- 428 (794) T ss_pred cceEEEecC--CeEEEEccCCccccccccCcCCCCCccE-EEEecCCcceeeEEEeecCCcEEEEecCcE--EEEeCCC- Confidence 999988642 233344333322 22332 256666677777778999999999966554 7777642 Q ss_pred CccCcccccccceE-EeccccchhhhccCceEEEEeeccccceEEEE-------ccCceeeecCCHHHHHHHHhcCcccc Q lcl|NC_018275. 206 AGAALYVAQPSLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVYI-------IGSGQASPIATASIEKIIRSYTADEL 277 (461) Q Consensus 206 ~~~fp~~~~~~~~I-~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~-------l~g~q~~rIST~~IE~~i~~y~~~el 277 (461) ++....-.--.+ ..+|+..-.-..++++++|++..+.. -.|+| .++|+++.+| .-++..|+.- + T Consensus 429 --~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~-~~~~r~~~~~~~~d~y~~~Dlt-~~~~~~~~~~----~ 500 (794) T protein:vir:22 429 --TLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSF-TSIHRYYAVQDVSSVKNAEDIT-SHVPNYIPNG----V 500 (794) T ss_pred --cccceeEEEEEEEEeeccCCCCceEeCCeEEEEecCCCe-eEEEEeEeeecccCceehhhHH-HHHHHhcCCc----e Confidence 233322111111 35788877888999999999987632 22322 5577777773 3456666542 2 Q ss_pred cceEEEEEEECCEEEEEEEcCC-eEEEEE---ccccCCcceeeeecCCccccceEEEEEE-ecCCeEEEEEcCC-CeEEE Q lcl|NC_018275. 278 ATGVMEALRFDSHELLIIHLPR-HVLVYD---ASSSQNGPQWCVLKTGLYDDVYRAIDFM-YEGNQIACGDKSE-AVTGQ 351 (461) Q Consensus 278 ~~A~~~ty~~~GH~fyvlt~P~-~Tw~yD---~~t~~w~e~w~~~~tg~~~~~~Ra~~~~-~~~g~~~vGD~~~-G~l~~ 351 (461) .. +..++.+.+.++....-+ .-.+|- ....|..--||.-.++ ...+..|+. ..+-.+++-.... +.+-+ T Consensus 501 ~~--~~~~~~~~~~v~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~---g~~~~~~~~~~~d~l~~iv~r~~~~~~~r 575 (794) T protein:vir:22 501 FS--ICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFG---ENVQVLACQSISSDMYVILRNEFNTFLAR 575 (794) T ss_pred EE--EEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceeEEeeEEEEcC---CCEEEEEEEecCCEEEEEEEeCCCEEEEE Confidence 22 334556666555555444 333433 2232222248877763 344444443 2344555555333 33444 Q ss_pred Ec--CCccCcCCCEEEEEEee------cc--ccCCCceEEEEEEEEEcCCCCCchhheeeeccC---------------- Q lcl|NC_018275. 352 LQ--FDISSQYDKQQEHLLFT------PL--FKADNARCFDLEVESSTGVAQYADRLFLSATTD---------------- 405 (461) Q Consensus 352 ld--~~~~td~g~p~~~~~~t------P~--~~~~~~rv~~~~le~~~Gv~q~~~~~~ls~sdD---------------- 405 (461) ++ .+..+..++|....+-. |- .+ +...+-.+.+....|...-.-+.. ..-.| T Consensus 576 ~~~~~~~~~~~~~~~~~~lD~~~~~~~~~g~~~-~~~~~t~~~~~~~~g~~~~~g~~v-~~~~dg~~~~~~~~~~~~~~~ 653 (794) T protein:vir:22 576 ISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYN-DDTFTTSIHIPTIYGANFGRGKIT-VLEPDGKITVFEQPTAGWNSD 653 (794) T ss_pred EEEeeccccCCCccceeeeeeeEEEeeccceee-cCCcceEEEcccccCcccccceEE-EEEcCCceeeceeeeeeeecc Confidence 33 12222223332111000 00 00 000000011111111100000000 01111 Q ss_pred ----------------ccccCcceeec------c-C-CCccc--ceeEEEEeeE-ecccceeEEEEEEecCc---ceEEE Q lcl|NC_018275. 406 ----------------GINYGREQMIE------Q-N-EPFVY--DKRVLWKRVG-RIRRLIGFKLRVITKSP---VTLSG 455 (461) Q Consensus 406 ----------------G~~~~~~~~~~------~-g-~~g~y--~~R~~~~rlG-~~r~~v~f~~r~~~~~p---~~l~g 455 (461) |-.|..+.... . | ..+.. .-|++.+|.= .+.+--+|++++..+.+ +.+.+ T Consensus 654 ~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~~~~~~~~~~~grl~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~ 733 (794) T protein:vir:22 654 PWLRLSGNLEGRMVYIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWVNYENSGTFDIYVENQSSNWKYTMAG 733 (794) T ss_pred ceEEeCCCCCCcEEEEeeeeeEEEEecceEEEecCCCccceeeecceEEEEEEEEEeccccceEEEEcCCCcccceeecC Confidence 22222211110 0 0 00000 1122222211 00111124444433221 11111 Q ss_pred eEEEeC Q lcl|NC_018275. 456 CQIRLE 461 (461) Q Consensus 456 a~~~~e 461 (461) ..+... T Consensus 734 ~~~g~~ 739 (794) T protein:vir:22 734 ARLGSN 739 (794) T ss_pred ceeccc Confidence 111100 No 17 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=78.74 E-value=0.11 Score=25.81 Aligned_cols=405 Identities=13% Similarity=0.107 Sum_probs=142.5 Q ss_pred CCCccccC-ceeEeeeecccccccccccccceeEeCCCceecc---cC-----CCcccceEEEEec-------CeEEEE- Q lcl|NC_018275. 1 MGKDFKNA-DYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRN---DV-----NGVSRGVEYNTAQ-------NAVYRV- 63 (461) Q Consensus 1 ~~~~~~~~-d~~~~~pvn~~a~~~~~~~s~~~L~~~PGl~~~~---~v-----~G~~rG~~y~~~~-------~~lY~V- 63 (461) .|+-++-+ .-++.++ .... ...+..+..+.+.+....+ .. +..+.|..+..++ +..|++ T Consensus 193 vG~~i~~~~~~v~si~--~~~~--~~~~~~~~v~~~~~~~~~~~~~~~~~t~~~~a~~g~~~~~~~g~~~~~~~~~~~~~ 268 (825) T protein:vir:73 193 VGKLFYLEQPAVDSVP--VWET--SKTTAINDVRRADSNYYRANTSGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWEYL 268 (825) T ss_pred cCeEEEEecccccccc--eeee--eeEEEeeeEEECCCceeeeecccccceeeccccCCceeEeeeeecccCCceEEEEE Confidence 11111100 0000000 0000 0001111111122111110 00 0111111110000 011111 Q ss_pred -eCCeEEeccceEE---eecCcccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCccccceeccce Q lcl|NC_018275. 64 -LGSKLYKGETVVG---DVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRGR 139 (461) Q Consensus 64 -~G~~Ly~v~~~iG---~i~gsg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dGy 139 (461) .+.+..+-....+ ...+.....|.++ ++.+.. ..+ + .....|-...+|| ..|+|..+| T Consensus 269 ~~~~g~~~it~~~~~~~~~~~~~~~~~~~~-----~~~~~~-~t~--~---~~~~~~~~~~gyP-------s~v~f~q~R 330 (825) T protein:vir:73 269 HSGFGIAKITAVAGDGLTATADVVSFIPSQ-----VVGSAN-ASY--K---WAKYAWNSVNGYP-------STVVYYQQR 330 (825) T ss_pred ecCCceEEEeeccccceeeccccceecccc-----cccCCC-CCc--c---cccCCcccCCCCc-------cEEEEEcce Confidence 1111111000000 0001111111111 111110 000 1 0111122222343 458898899 Q ss_pred EEEEe----------eCCceEEEeccCCc-cccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCC--- Q lcl|NC_018275. 140 YAWSK----------DGTDSWFITDLEDE-SHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATT--- 205 (461) Q Consensus 140 fV~~~----------~gt~~f~iS~L~d~-t~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~--- 205 (461) ++|.- .-++.|+.=....+ .+-|+. ++.-+..+++.|.-++..+ .|++|.+. -| |..+|+.+ T Consensus 331 L~f~g~~~~p~~v~~Srtgd~~nF~~~~~~~DdD~I-~~~~s~~~~~~i~~~~~~~-~L~~~t~~-~e-~~l~~~~~~~l 406 (825) T protein:vir:73 331 LYFAASTAYPQTIWASRTGDYKDFGKNNPIQDDDRI-IYTYAGRQVNEIRHLIDVG-NLVALTSG-GE-YTISGDQNKVL 406 (825) T ss_pred EEEeecCCCCCEEEEEccCCccccccCCCCCCCccE-EEEEcCCcceeEEEEeecC-cEEEEecC-ce-EEEecCCCccc Confidence 88751 11222222212111 123333 3666777788887788875 56666655 55 45565432 Q ss_pred -CccCcccccccceEEeccccchhhhccCceEEEEeeccccc---eEEEEccCceeeecCCHHHHHHHHhcCcccccceE Q lcl|NC_018275. 206 -AGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGA---PSVYIIGSGQASPIATASIEKIIRSYTADELATGV 281 (461) Q Consensus 206 -~~~fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~---~~Vy~l~g~q~~rIST~~IE~~i~~y~~~el~~A~ 281 (461) +...-..++ + ..||.. -.-..++++++|++..+..- ..-+..++|+++.+|-++ +.+++.. .-+ T Consensus 407 TP~~~~~~~~-s---~~g~~~-~~Pv~vg~~~~Fv~~~g~~vre~~~~~~~d~~~~~dlt~~a-~hl~~~~------~~~ 474 (825) T protein:vir:73 407 TPSAFSFSSQ-G---NNGSSN-VPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQGTDLTILA-NHLFQKH------SIV 474 (825) T ss_pred ceeeEEEEee-e---eecccc-ccceEeCCeEEEEeCCCCeEEEEEEeeecCceeccchhhhh-HhhccCC------ceE Confidence 112222222 2 568864 45567899999998765310 112345677888886333 5555442 133 Q ss_pred EEEEEECCEEEEEEEcCC---eEEEEEccccCCcceeeeecCCccccceEEEEEEecCCe---EEEEEc-CCC----eEE Q lcl|NC_018275. 282 MEALRFDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQ---IACGDK-SEA----VTG 350 (461) Q Consensus 282 ~~ty~~~GH~fyvlt~P~---~Tw~yD~~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~---~~vGD~-~~G----~l~ 350 (461) .++|+++.+.++.+...| ..+.|+-....| -||.-.++ ...+..|++..+++ +++=.+ .+| .|= T Consensus 475 ~~a~~~~p~~~~~~v~~dg~l~~~ty~~~q~v~--aW~~~~~~---g~v~~~~~i~~~~~D~l~~iV~R~~~g~~~~yiE 549 (825) T protein:vir:73 475 DWSFCIVPYSSAFCIRDDGKLLVLTYLRDQQVF--AWAPQSSA---GKYESTCSISEGSEDAVYFVVNRTINGQTVRYIE 549 (825) T ss_pred EEEEcCCCceEEEEEecCCeEEEEEEeccccce--eeEEEecC---CcEEEEEEecCCCccEEEEEEEEeeCCceEEEEE Confidence 466788888888777775 467787554433 58877773 57888888865331 221111 111 122 Q ss_pred EEcCCccCcCCCEEEEEE----------eeccccCCCceEEEEEEEEEcCCCCCchhheeeeccCccccCcceeeccCCC Q lcl|NC_018275. 351 QLQFDISSQYDKQQEHLL----------FTPLFKADNARCFDLEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEP 420 (461) Q Consensus 351 ~ld~~~~td~g~p~~~~~----------~tP~~~~~~~rv~~~~le~~~Gv~q~~~~~~ls~sdDG~~~~~~~~~~~g~~ 420 (461) +|+....++..+-. ++- .+.+.|-++..+. +.. +. .+-+-.++.-++...+.....|=+ T Consensus 550 ~~~~~~~~~~~~~~-~vD~g~~~~g~~~~~~l~~l~g~tv~---~~~-----~g--~~~~~v~~g~itl~~~~~~~i~l~ 618 (825) T protein:vir:73 550 RLSSRLFTNDEDAF-FVDCGLSYDGRNTSSRTMTISGGTGD---WSY-----QV--DYPVTVSGGAYFVNTDVGAQIQFP 618 (825) T ss_pred EecccccCCCccee-EEEEEeeecccceeeceeeeCCceEE---EEe-----CC--eEEEEEcCCeEEecccceEEEEec Confidence 23322222221100 000 0111111111110 000 00 000000111112222211111111 Q ss_pred ccc---------ceeEEEEeeEec-ccceeEEEEEEecCcceEEEeEEEe--------------C Q lcl|NC_018275. 421 FVY---------DKRVLWKRVGRI-RRLIGFKLRVITKSPVTLSGCQIRL--------------E 461 (461) Q Consensus 421 g~y---------~~R~~~~rlG~~-r~~v~f~~r~~~~~p~~l~ga~~~~--------------e 461 (461) ... ... +..+.+++ +.++ .+++.-...|-.+-+|.+-- | T Consensus 619 ~~~~~~~~~~~~~~~-~~~~i~~~~~~~~-v~v~~~~~~~a~~~~~~~t~~~~a~~~~~gL~hLe 681 (825) T protein:vir:73 619 YTGTDPDTNEPVAKE-LRGDIISVTSNTA-VVVRFNRNVPPVLRNVATTNWQMARQTFSGLAHLE 681 (825) T ss_pred ccCcccccccceece-eeEEEccccCceE-EEEEecccccceeeeecccCCCcchheeccccccC Confidence 100 000 01111111 1112 22333233333333333221 1 No 18 >protein:vir:105647 Length: 800 # NCBI annotation: putative tail tubular B protein # Family: family:all:825 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425011;genbank:gi:83571759;uniprot:Q2WC41;genbank:GeneID:3837288 Probab=78.43 E-value=0.11 Score=25.75 Aligned_cols=435 Identities=11% Similarity=0.095 Sum_probs=153.0 Q ss_pred CCC---cc-ccCc--eeE-----eeeecccccccccccccceeEeCCCce-ecccCCCc-ccceEEE-----EecCeEEE Q lcl|NC_018275. 1 MGK---DF-KNAD--YID-----YLPINMLATPKEVLNSSGYLRSFPGIA-KRNDVNGV-SRGVEYN-----TAQNAVYR 62 (461) Q Consensus 1 ~~~---~~-~~~d--~~~-----~~pvn~~a~~~~~~~s~~~L~~~PGl~-~~~~v~G~-~rG~~y~-----~~~~~lY~ 62 (461) .|. ++ +... ||. .+.++. .++++-.++ ++.-+.+ ...+++.. ..|..-. .....-|. T Consensus 205 ~~~~~~t~~~~g~~i~i~~~~~~~~~~~~----~~~~~~~~~-~~~~~~v~~~~~Lp~~~~~g~~~~i~~~~~~~~~~y~ 279 (800) T protein:vir:10 205 SGVNDYEIQRDGTSIFIERRDGKSFTVTT----TDGAKGKDL-VAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRYW 279 (800) T ss_pred CcccceEEEEcCcEEEEEEecCCceEEEE----eecCCcceE-EEEEeeccceeeccccCCCCceEEEEcCCCCCCceeE Confidence 111 00 0001 111 112221 111111111 1100000 00111110 1110000 01111121 Q ss_pred Ee------CCeEEeccceEEeecCcccEEEEeCCeEEEEEE-CCcEEEEEeecccccceec--cccccccCccC-Ccccc Q lcl|NC_018275. 63 VL------GSKLYKGETVVGDVAGSGRVSMAHGRTSQAVGV-NGQLVEYRYDGTVKTVSNW--PADSDYTQYEL-GSVRD 132 (461) Q Consensus 63 V~------G~~Ly~v~~~iG~i~gsg~VsMa~N~~~~avv~-~g~~~~Y~ydg~~~~~~~~--~~d~~~~~~dl-~~~~~ 132 (461) |- +.+.++-....|.+.+-..=.|.+=-....++. ++.-..+..+........- -....|.+..- .-+.+ T Consensus 280 ~~~~~~~~~~~~w~e~~~~~~~~~~~~~tmp~~lv~~~~~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~~~~~~~~i~~ 359 (800) T protein:vir:10 280 LQAEPKEGNLVSWKETIAADVLLGFDKGTMPYIIERTGIIDGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGG 359 (800) T ss_pred EEEEeccccceEEEeecccCceeeeecccccEEEEEeeeeecceeEEEEeccccccccCCCCCCCCchhcCCCCCCCcee Confidence 11 112222111111111100000111111111111 2222334444443332110 01112222111 12567 Q ss_pred ceeccceEEEEeeCCceEEEeccCCcc-----------ccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEec Q lcl|NC_018275. 133 ITRLRGRYAWSKDGTDSWFITDLEDES-----------HPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLT 201 (461) Q Consensus 133 v~~~dGyfV~~~~gt~~f~iS~L~d~t-----------~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~nt 201 (461) |+|..+|++|.-| +....|.-.|.. +-|+. +++-+..+++.|.-++.+++.|+||.+..- |..+ T Consensus 360 v~f~q~RL~f~~~--~~v~~Srtgd~~nF~~~t~~~~~DdD~I-~~~~ss~~~~~i~~~v~~~~~L~l~T~g~q--~~l~ 434 (800) T protein:vir:10 360 MFMVQNRLCFTAG--EAVIASRTSYFFDFFRYTVISALATDPF-DIFSDASEVYQLKHAVTLDGATVLFSDKSQ--FILP 434 (800) T ss_pred EEEEeeeEEEeeC--CeEEEEccCCccccccccccCCCCCccE-EEEEcCCcceeeeeEeecCCcEEEEecCcE--EEEe Confidence 9999999988742 333344333322 22333 266667778888888999999999976655 7777 Q ss_pred CCCCCccCcccccccceE-EeccccchhhhccCceEEEEeeccccceEEE------EccCceeeecCCHHHHHHHHhcCc Q lcl|NC_018275. 202 GATTAGAALYVAQPSLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVY------IIGSGQASPIATASIEKIIRSYTA 274 (461) Q Consensus 202 G~~~~~~fp~~~~~~~~I-~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy------~l~g~q~~rIST~~IE~~i~~y~~ 274 (461) |+. ++....-.--.+ ..+|+..-.-..++++++|++..+. -..|+ ..++|+++.+| .-++..|+. T Consensus 435 g~~---~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~-~s~vre~~~~~~~d~~~a~DlT-~~~~hl~~~--- 506 (800) T protein:vir:10 435 GDK---PLEKSNALLKPVTTFEVNNKVKPVVTGESVMFATNDGS-YSGVREFYTDSYSDTKKAQAIT-SHVNKLIEG--- 506 (800) T ss_pred CCC---cccceeEEEEEEEeeeccCCCCceEeCCeEEEecCCCC-eeEEEEEeeeecccceehhhHH-hHHHHhcCC--- Confidence 742 133222111111 3578888788899999999988752 12243 33677877774 345566654 Q ss_pred ccccceEEEEEEECCEEEEEEEcC-CeEEEEEc---cccCCcceeeeecCCccccceEEEEEEecCCeEEEEEcCCCeE- Q lcl|NC_018275. 275 DELATGVMEALRFDSHELLIIHLP-RHVLVYDA---SSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQIACGDKSEAVT- 349 (461) Q Consensus 275 ~el~~A~~~ty~~~GH~fyvlt~P-~~Tw~yD~---~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~~~vGD~~~G~l- 349 (461) .+... ...+.+.+.++....- +.-++|-- .-.+...-||.-.-+.+ ....+.++ -.+.-|++=...++.. T Consensus 507 -~v~~~--~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~~~aW~~w~~~~~-~~~~~~~~-~~d~l~~iv~r~~~~~i 581 (800) T protein:vir:10 507 -NITNM--AASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQSAWHVWEWPMG-TKVRGMFY-SGELLYLLLERGDGVYL 581 (800) T ss_pred -ceEEE--EEeCCCCeEEEEEEcCCCeEEEEEEeecCCceEEEEEEEEEcCCC-cEEEEEEE-eCCeEEEEEECCCcEEE Confidence 22221 1223344433333322 23334331 12222123775442211 13333333 3566677777655543 Q ss_pred EEEcCCccCcCCCEEEEEEe---ecccc---C--------------CCceEEEEEEEEEcCCCC-Cchhhe--eeeccCc Q lcl|NC_018275. 350 GQLQFDISSQYDKQQEHLLF---TPLFK---A--------------DNARCFDLEVESSTGVAQ-YADRLF--LSATTDG 406 (461) Q Consensus 350 ~~ld~~~~td~g~p~~~~~~---tP~~~---~--------------~~~rv~~~~le~~~Gv~q-~~~~~~--ls~sdDG 406 (461) -+|++....+.+.+....+- ++.+. + +...+..+.+ .|... .+-.++ ...++++ T Consensus 582 er~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~g~v~~~~~~~~g~ 658 (800) T protein:vir:10 582 EKMDMGDALTYGLNDRIRMDRQAELIFKHFKAEDEWISEPLPWTPTNPELLDCILI---EGWDSYIGGSFLFKYKPSDNT 658 (800) T ss_pred EEEecccCccccccceeeeecceeecccccccCcceEEEeccccccCCcceEEeee---ccceeecCceeEEEEEecCCc Confidence 33655544444444322111 11111 1 1111111111 11100 001111 1222223 Q ss_pred cccCcc--------eeeccCCC----------------cc--cceeEEEEeeE-ecccceeEEEEEEecCc----ceE-- Q lcl|NC_018275. 407 INYGRE--------QMIEQNEP----------------FV--YDKRVLWKRVG-RIRRLIGFKLRVITKSP----VTL-- 453 (461) Q Consensus 407 ~~~~~~--------~~~~~g~~----------------g~--y~~R~~~~rlG-~~r~~v~f~~r~~~~~p----~~l-- 453 (461) +++-.+ ..+-.|.+ |. ...|.+.+|+= +.++--+|++++..... +.+ T Consensus 659 ~~~~~~~~~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~ 738 (800) T protein:vir:10 659 LSTTFDMHDDNHVKAKVIVGQIYPQEFEPTPVVIRDRQDRVSYIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLAS 738 (800) T ss_pred eEeeeeecCCCcccceEEEeeeeeEEEeecceEEEcCCCcccccCCeEEEEEEEEeecCceEEEEeccCcccceeEEccC Confidence 222110 01112211 11 11222222210 00111124443322211 111 Q ss_pred ---EEeEEEe----C Q lcl|NC_018275. 454 ---SGCQIRL----E 461 (461) Q Consensus 454 ---~ga~~~~----e 461 (461) .|..... + T Consensus 739 ~~~~g~~~~~~g~~~ 753 (800) T protein:vir:10 739 NRIGGALNNTVGYVE 753 (800) T ss_pred CeeccccccccCccc Confidence 1111111 0 No 19 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=71.10 E-value=0.2 Score=24.42 Aligned_cols=405 Identities=14% Similarity=0.153 Sum_probs=151.5 Q ss_pred CCCccccCce-eEeeeecccccccccccccceeEeCCCcee---cccCCC-----cccceEEE----EecCe---EEEE- Q lcl|NC_018275. 1 MGKDFKNADY-IDYLPINMLATPKEVLNSSGYLRSFPGIAK---RNDVNG-----VSRGVEYN----TAQNA---VYRV- 63 (461) Q Consensus 1 ~~~~~~~~d~-~~~~pvn~~a~~~~~~~s~~~L~~~PGl~~---~~~v~G-----~~rG~~y~----~~~~~---lY~V- 63 (461) +|+-++...+ +...|+-.. +.....+.+.....-.- .+.-.| ...|..+. ...+. -|++ T Consensus 193 vg~~~~l~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 268 (823) T protein:vir:95 193 VGKLFYLEQPAVDSVPVWET----SKSTSIGDIRRADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL 268 (823) T ss_pred ccceEEEeccccceeeecce----eeeecccceEEecccceeeeeccccceeecccCCcceEEeceecccccceeEEEEE Confidence 4444443222 222222100 00001111111110000 000000 00111000 00000 1111 Q ss_pred -eCCeEEeccceEEeec-CcccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCccccceeccceEE Q lcl|NC_018275. 64 -LGSKLYKGETVVGDVA-GSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRGRYA 141 (461) Q Consensus 64 -~G~~Ly~v~~~iG~i~-gsg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dGyfV 141 (461) .+.++.+.....+.+. +.....|.++ ++.+.. ..+.+ ....|....+|| ..|+|..+|++ T Consensus 269 ~~~~g~~~~t~v~~~~~~~~~~~~~~~~-----~~~~~~-~t~~~-----~~~~~~~~~g~P-------s~v~f~q~RL~ 330 (823) T protein:vir:95 269 HSGFGIARITAVNGTTATAEVISYIPSQ-----VVGEDN-ASYKW-----AKYAWNSVNGYP-------GTVVYYQQRLY 330 (823) T ss_pred eCCcceEEEEeecceeeeceEeeeeccc-----cccCCc-CCccc-----cccccCcCCCCc-------cEEEEEeceEE Confidence 1112211110111111 1111112221 111110 01111 112233333444 46889999988 Q ss_pred EEee--CCceEEEeccCC--------cc-ccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCC----CC Q lcl|NC_018275. 142 WSKD--GTDSWFITDLED--------ES-HPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGAT----TA 206 (461) Q Consensus 142 ~~~~--gt~~f~iS~L~d--------~t-~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~----~~ 206 (461) |.-. .-+.+..|.-.| ++ +-|+. ++.-+..+++.|.-++..+ .|++|.+. -| |..+|+. ++ T Consensus 331 f~g~~~~p~~v~~Srtgd~~nF~~~~~~~DdD~I-~~~~s~~~~~~i~~~v~~~-~Lli~t~~-~e-~~l~~~~~~~lTP 406 (823) T protein:vir:95 331 FAASTAFPQTIWASRTGDYKDFGKSNPTQDDDRI-IYTYAGRQVNEIRHLIDVG-SLVALTSG-GE-YVITGDQNKVLTP 406 (823) T ss_pred EEEcCCCCcEEEEeccCCccccccccCCCCCCcE-EEEEcCCcceEEEEEeecC-cEEEEecC-cE-EEEEcCCCcccce Confidence 7521 011222332221 11 33333 2677778888888898886 56666666 44 5555543 22 Q ss_pred ccCcccccccceEEeccccchhhhccCceEEEEeeccccc---eEEEEccCceeeecCCHHHHHHHHhcCcccccceEEE Q lcl|NC_018275. 207 GAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGA---PSVYIIGSGQASPIATASIEKIIRSYTADELATGVME 283 (461) Q Consensus 207 ~~fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~---~~Vy~l~g~q~~rIST~~IE~~i~~y~~~el~~A~~~ 283 (461) ...-+.++ + ..||.. -.-..++++++|++..+..- ..-+..++|+++.+| --++.+++.. . -+.+ T Consensus 407 ~~~~~~~~-s---~~g~~~-~~Pv~vg~~~~Fv~~~g~~vre~~~~~~~d~~~~~dlT-~~a~hl~~~~---~---i~~~ 474 (823) T protein:vir:95 407 SSFAFSSQ-G---SNGSSN-VPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDLT-ILANHLFQKH---S---IVDW 474 (823) T ss_pred eeEEEEEe-e---cccccc-ccceEeCCeEEEEecCCCEEEEEEEeeecCceecchhh-hhhhhhcCCC---c---eEEE Confidence 22333333 2 568853 45567899999998765310 111334778888885 2223444332 1 2345 Q ss_pred EEEECCEEEEEEEcCC---eEEEEEccccCCcceeeeecCCccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcC Q lcl|NC_018275. 284 ALRFDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQY 360 (461) Q Consensus 284 ty~~~GH~fyvlt~P~---~Tw~yD~~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~ 360 (461) +|+.+.+....+.+-| ..+.|+-....+ -||.-.++ ...+..|++..+. ...||.+=.. +-+ T Consensus 475 a~~~~p~~~~~~v~~dG~l~~~ty~~~q~v~--aW~~~~~~---g~~~~~~~i~~~~--------~d~l~~~v~R--~i~ 539 (823) T protein:vir:95 475 CFSIVPYSSAFCIRDDGKLLVMTYLRDQQVF--AWAPQSST---GKYESTCSISEGN--------EDAVYFVVNR--TVN 539 (823) T ss_pred EEecCCCeEEEEEecCCcEEEEEEeccccee--eeEEEecC---CcEEEEEEecCCC--------CCEEEEEEEe--ccC Confidence 5667767666666664 356777543323 57777663 4566777765322 2355554332 234 Q ss_pred CCEEEEE--EeeccccCCCceE-EEEEEEEEcCCCCCchhhe---------------eeeccCccccCcc---eeeccCC Q lcl|NC_018275. 361 DKQQEHL--LFTPLFKADNARC-FDLEVESSTGVAQYADRLF---------------LSATTDGINYGRE---QMIEQNE 419 (461) Q Consensus 361 g~p~~~~--~~tP~~~~~~~rv-~~~~le~~~Gv~q~~~~~~---------------ls~sdDG~~~~~~---~~~~~g~ 419 (461) |+....+ +.+..+..+..++ .|..+... |.+-.+...- ++. .||...... -.+.++. T Consensus 540 g~~~~yiE~~~~~~~~~~~~~~~lD~~~s~~-g~~~~~~~~~l~~g~~~l~~l~g~~v~~-adg~~~~~~~v~g~i~l~~ 617 (823) T protein:vir:95 540 GQTVRYIERLSSRLFTSDEDAFFVDSGLSYD-GRNTSDRTMTITGGSGEWDYLAEYTISV-SGGAYFTSSDVGAQLQFPY 617 (823) T ss_pred CeEEEEEEeeccccCCCccceeEEEEEEEee-cCcccceeeEecCCCCcccccCceEEEe-cCcceECCccceeEEEeCc Confidence 4443322 2223333222222 23332221 1111111111 222 122221111 0011111 Q ss_pred C-------cccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 420 P-------FVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 420 ~-------g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) + -.|.++++...... .++-.-+++--..+|..+..+..--. T Consensus 618 ~~~~~~vGl~~~~~i~~~~~~v-~~~~a~~~~~~r~v~a~l~~~~t~~~ 665 (823) T protein:vir:95 618 TGADPDTGYEVSKELRCDIISV-TSNTAVVVRANRNVPPSLRNVATTNW 665 (823) T ss_pred CCCccccccceEEEEEEeecee-eCCceEEEccCCcccceeeeeecccc Confidence 0 01344444443332 22211222222223333333222211 No 20 >protein:vir:80253 Length: 777 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522886;genbank:gi:158345179;genbank:GeneID:5687516 Probab=63.63 E-value=0.31 Score=23.34 Aligned_cols=417 Identities=11% Similarity=0.015 Sum_probs=155.0 Q ss_pred CCCcccc---CceeEeee---ecccccccccccccc-eeEeCCCc--------ee---cccCCC-cccce-EE--E--Ee Q lcl|NC_018275. 1 MGKDFKN---ADYIDYLP---INMLATPKEVLNSSG-YLRSFPGI--------AK---RNDVNG-VSRGV-EY--N--TA 56 (461) Q Consensus 1 ~~~~~~~---~d~~~~~p---vn~~a~~~~~~~s~~-~L~~~PGl--------~~---~~~v~G-~~rG~-~y--~--~~ 56 (461) +++-... ..-++..| +.-.+.-.-..+..+ .+....|. .. ..+++. .+.+. .+ + .. T Consensus 195 a~~L~~~~~~~~~~~s~~~~~~~~~g~~~~i~~~~~~~~t~~~g~~~~~~~~~~~v~~~~~lp~~~~~~~~~~~~~~~~~ 274 (777) T protein:vir:80 195 ITQLRTAAEADATIGTAAGFAYYQDGAYLYVTAPEAIAVSTDSGSNFLRASNAASIRDAAELPAKLPADADGFIIATGAA 274 (777) T ss_pred hhhhhhhhccccceeecCceEEEeCCcEEEEEecCceeEecCCcCccceeeeeEEEeeccccccccccccceEEEeCCCC Confidence 2222111 11111111 110000000000000 01111110 00 011111 00110 00 0 00 Q ss_pred cCeEEEE--eCCeEEeccceEEeecCcccEEEEeCCeEEEEEECCcEEEEEeecccccceeccc--cccccCccCCcccc Q lcl|NC_018275. 57 QNAVYRV--LGSKLYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPA--DSDYTQYELGSVRD 132 (461) Q Consensus 57 ~~~lY~V--~G~~Ly~v~~~iG~i~gsg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~--d~~~~~~dl~~~~~ 132 (461) +..-|+. .+++.++-....|.+.+- ..|-+ . ++..+.. |.+.-....-..... .+.+|.+.--.+.+ T Consensus 275 ~~~~y~~~~~~~~~w~e~~~~~~~~~~--~t~p~----~-l~~~~~~--~~~~~~~w~~r~~gd~~tn~~Psf~g~~i~~ 345 (777) T protein:vir:80 275 KNKTYFRWVDLERKWDEDASRGAQAEL--IDMPL----R-ITYSAPN--FSLTALNYERRASGDATSNPALKFTEQGISG 345 (777) T ss_pred CCceEEEEEccCcEEEEeecccccccc--cccce----E-EEecCCc--eEeeccCCccccccccccCCCceecCCceeE Confidence 0111111 111111111111221111 12211 1 2212211 221111111110011 12333332222346 Q ss_pred ceeccceEEEEeeCCceEEEeccCCcc-----------ccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEec Q lcl|NC_018275. 133 ITRLRGRYAWSKDGTDSWFITDLEDES-----------HPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLT 201 (461) Q Consensus 133 v~~~dGyfV~~~~gt~~f~iS~L~d~t-----------~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~nt 201 (461) |+|..+|++|..+ +....|.-.|.. +-|+. +++-+..+++.|.-++.+++.|+||.+..- |..+ T Consensus 346 v~f~q~RL~f~~~--~~v~~Srtgd~~nF~~~s~~~~~DdDpI-~~~~ss~~~~~i~~~v~~~~~L~i~T~~~e--~~l~ 420 (777) T protein:vir:80 346 MTTMQGRLVLLAG--EYVCMSASGNPLRWFRASVSTQSDDDPI-EVAATAPVASPYEYAVAFNKDLVLFAKTHQ--GLVP 420 (777) T ss_pred EEEEcceeeeecC--CeEEEEeccCccccccccccCCCCCccE-EEEEcCCcceeeeeeeecCCcEEEEecCce--EEEe Confidence 9999999998652 233344333322 22232 256666777777778999999999976665 7777 Q ss_pred CCCCCccCcccccccceE--EeccccchhhhccCceEEEEeeccccceEEEE-------ccCceeeecCCHHHHHHHHhc Q lcl|NC_018275. 202 GATTAGAALYVAQPSLMV--QKGIAGTYCKTPFADSYAFISHPATGAPSVYI-------IGSGQASPIATASIEKIIRSY 272 (461) Q Consensus 202 G~~~~~~fp~~~~~~~~I--~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~-------l~g~q~~rIST~~IE~~i~~y 272 (461) |+. ++....-. +-. ..+|...-.-..++++++|+++.+..--.|++ .++|+++.+|- -++..|+. T Consensus 421 ~~~---~lTP~~~~-~~~~s~~~~~~~~~Pv~vG~~v~Fv~~r~g~~s~v~e~~~~~~~~d~y~a~Dlt~-~~~hl~~~- 494 (777) T protein:vir:80 421 GAN---LLTSRNAT-AAVVTEYSFQNSCSPVVAGRTVFFASPRSGPWSAVWEMLPSQYTDAQVEASDSTS-HLPKYIAG- 494 (777) T ss_pred CCC---cccceeEE-EEEEEeeccCCCCCceEeCCeEEEEecCCCceeEEeeeeecccccCceehhHHHH-HHHHhcCC- Confidence 642 13332211 122 35788777778999999999864321112333 35678887844 35556654 Q ss_pred CcccccceEEEEEEECCEEEEEEEcCC-eEEEEEc---cccCCcceeeeecCCccccceEEEEEEecCCeEEEEEcCCCe Q lcl|NC_018275. 273 TADELATGVMEALRFDSHELLIIHLPR-HVLVYDA---SSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQIACGDKSEAV 348 (461) Q Consensus 273 ~~~el~~A~~~ty~~~GH~fyvlt~P~-~Tw~yD~---~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~~~vGD~~~G~ 348 (461) . ....+|+.+-+....+..-+ .-+||-- .-.|..--||.-.++ ...+..|++ .+--|++=..+++ T Consensus 495 ---~---v~~~a~s~~p~~v~~~~~~dg~l~~~ty~~~~~e~~v~aW~r~~~~---g~v~~v~~i-~d~l~~iv~r~~~- 563 (777) T protein:vir:80 495 ---P---VRFLATSSTTSIVVVGTSNLRELVVHEYLWQGGEKVHAAWHKWSFP---QDITGAYFR-GDRLILLFHVAGR- 563 (777) T ss_pred ---c---eEEEEEcCCCceEEEEEcCCCeEEEEEEeecCCceEEEeeEEeccC---CcEEEEEEE-CCEEEEEEEcCCe- Confidence 2 33345677777776666664 4444331 122221248777663 467766665 4445555554332 Q ss_pred EEEEcCCccCcCCC---EEEEEEee----ccccCCCceEEEEEEEEEcCCC----CCchhheeee------------ccC Q lcl|NC_018275. 349 TGQLQFDISSQYDK---QQEHLLFT----PLFKADNARCFDLEVESSTGVA----QYADRLFLSA------------TTD 405 (461) Q Consensus 349 l~~ld~~~~td~g~---p~~~~~~t----P~~~~~~~rv~~~~le~~~Gv~----q~~~~~~ls~------------sdD 405 (461) ++.--.+...+.+. +..++... -..+.... +- ..+.+ .....+.+.. .-+ T Consensus 564 ~~le~~~~~~~~d~~~~~~~~~D~~~~~~~~~~~~~~------~~-~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~v~~~ 636 (777) T protein:vir:80 564 VILGELFMQRLGDAQSIPGGFLDLYRVGAANADEEVA------IP-AFAADLYPEDSTFAYKLSGEFQSLGQRCGDRRVD 636 (777) T ss_pred EEEEEEeeccCCCCcccceeeeeeeeeeeeeeCCccc------ee-EeeccccCCcceeEEEecCcccccceeeeeEEeC Confidence 22211121222121 11111100 00000000 00 00000 0000011111 112 Q ss_pred ccccCcceeeccCCCc-------ccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 406 GINYGREQMIEQNEPF-------VYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 406 G~~~~~~~~~~~g~~g-------~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) |.++ +-.++++.++ .|..++.+-+.-. +..=| +-....|+-|..++++++ T Consensus 637 ~~~~--~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~-~~~~g---~~~~~~r~~i~r~~~~~~ 693 (777) T protein:vir:80 637 GATV--YIKVVGAQAGDQYRIGLRYLSKLGPTRPIL-RDPNG---VPITTERTQLHRLTWSLD 693 (777) T ss_pred Ccee--eEEEcCCCCCCEEEEeeeeEEEEEeCceEE-eCCCC---ceeeecCeEEEEEEEEee Confidence 2111 1112222221 1223333222110 10001 111224556777888887 No 21 >protein:vir:103790 Length: 768 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024932;genbank:gi:48697202;genbank:GeneID:2846114 Probab=59.64 E-value=0.39 Score=22.84 Aligned_cols=435 Identities=11% Similarity=0.031 Sum_probs=152.8 Q ss_pred CCCccccCceeEeeeecccccccc------cccccceeEeCCCceecccCCCcc-----cceEEEEecCeEEEEeCCe-- Q lcl|NC_018275. 1 MGKDFKNADYIDYLPINMLATPKE------VLNSSGYLRSFPGIAKRNDVNGVS-----RGVEYNTAQNAVYRVLGSK-- 67 (461) Q Consensus 1 ~~~~~~~~d~~~~~pvn~~a~~~~------~~~s~~~L~~~PGl~~~~~v~G~~-----rG~~y~~~~~~lY~V~G~~-- 67 (461) .|+.++...-.+.++..-....+. .....-++..+.|-.. ....+.. .|..+....+....+.... T Consensus 199 v~~~~~l~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~-~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (768) T protein:vir:10 199 VGTLFYLEQEDNSFVKPWVVHQKIGPSELRRVGDRVYLCTAVGTAT-PQVTGTETPTHTSGSRWDGTGQDESATDEYGSI 277 (768) T ss_pred cceeeeeeeeccccccccEEEEeeeeEEEEecCCceEEeeeecccc-ccccceeccccccCceEEEecCccccccccccc Confidence 122211111111110000000000 0000000111111100 0000000 1111111111100000000 Q ss_pred -----EEeccceEEeecCcccEEEEeCCeEEEEEEC----CcEEEEEeecccccceeccccccccCccCCccccceeccc Q lcl|NC_018275. 68 -----LYKGETVVGDVAGSGRVSMAHGRTSQAVGVN----GQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITRLRG 138 (461) Q Consensus 68 -----Ly~v~~~iG~i~gsg~VsMa~N~~~~avv~~----g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~~dG 138 (461) .++.....+.|.+...-+|+++-........ .....+...........|....+| |..|+|..+ T Consensus 278 ~~~~~~~~~~~~~~~i~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~g~-------Ps~v~f~q~ 350 (768) T protein:vir:10 278 GAEWEYQHSGYGTVLITGYTNDQVVTGTVATNDPADPGMLPNTVVTLTGTYKWARSLFNSTDGF-------PQMGTFWRN 350 (768) T ss_pred ceEEEEEEcCCceEEEEEecCCeeEEeeeeeecCcccccccccccccCCCcccccCCCcCCCCC-------ceEEEEEee Confidence 0000001111111111122222110000000 000000000000111112222233 346889999 Q ss_pred eEEEEeeCC------c---eEEEeccCCccccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCC----- Q lcl|NC_018275. 139 RYAWSKDGT------D---SWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGAT----- 204 (461) Q Consensus 139 yfV~~~~gt------~---~f~iS~L~d~t~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~----- 204 (461) |++|.-|.+ + .|+.+.+...++.|+. ++.-+..+++.|.-++.++ .|+||.+..- |..+|++ T Consensus 351 RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I-~~~~ss~~~~~i~~~v~~~-~L~i~T~~~q--~~l~~~~~~~~l 426 (768) T protein:vir:10 351 RLCLMRDRWLAMSVSADFETFKTKDADQQTDDSAI-VQQLNARQLNKLAWMVESD-SLLIGMTGDE--WVIGPANASQPV 426 (768) T ss_pred eEEEeeCCEEEEEcccccccccccccccccCCccE-EEEecCCcceeEEEEeecC-cEEEEecCce--EEEecCCCCccc Confidence 988864211 1 1333344333345554 3666777778888888885 6888777755 7776643 Q ss_pred CCccCcccccccceEEeccccchhhhccCceEEEEeeccccce---EEEEccCceeeecCCHHHHHHHHhcCcccccceE Q lcl|NC_018275. 205 TAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAP---SVYIIGSGQASPIATASIEKIIRSYTADELATGV 281 (461) Q Consensus 205 ~~~~fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~---~Vy~l~g~q~~rIST~~IE~~i~~y~~~el~~A~ 281 (461) ++...-..++ + ..||.. -.-..++++++|++..+..-. .-+..++|+++.+|- -++.+++.... .-..-+ T Consensus 427 TP~~~~i~~~-s---~~g~~~-~~Pv~vG~~v~fv~~~g~~vre~~y~~~~d~y~a~DlT~-~a~hl~~~~~~-~~~~i~ 499 (768) T protein:vir:10 427 SAANLNAARR-T---SYGSKR-IQPVQVGGTIMFVQKAGRKLRDFKYDFSSDNYVSTDVTK-IADHITRGRAG-TNSGIM 499 (768) T ss_pred ccceEEEEEe-e---hhcccc-cccEEeCCeEEEEcCCCCEEEEEEeeeecCceecchhhh-hhhhhccccCc-ccccee Confidence 2222222222 1 368854 344678999999998874211 112357888888862 23445544321 112345 Q ss_pred EEEEEECCEEEEEEEcCC---eEEEEEcccc-CCcceeeeecCCccccceEEEEEEec-----CCeEEEEE-cCCCe--- Q lcl|NC_018275. 282 MEALRFDSHELLIIHLPR---HVLVYDASSS-QNGPQWCVLKTGLYDDVYRAIDFMYE-----GNQIACGD-KSEAV--- 348 (461) Q Consensus 282 ~~ty~~~GH~fyvlt~P~---~Tw~yD~~t~-~w~e~w~~~~tg~~~~~~Ra~~~~~~-----~g~~~vGD-~~~G~--- 348 (461) .+.|+.+.+.++.+-..| ..+.|+-..+ |..--||.-..+. ..-.+.|.+.. +--|++=. .-+|. T Consensus 500 ~~a~~~~p~~v~~~v~~dg~l~~~ty~~e~~~q~v~aW~~~~~~~--g~v~~v~~i~~~~g~~d~l~~~v~r~~~g~~~~ 577 (768) T protein:vir:10 500 SLCFQQEPHSVVWAARADGQLIGCTYDEEAGRSDVYGWHRHPDAN--GFVECVASMPAPDGASDDLWVIVRRQVNGQTVR 577 (768) T ss_pred eEEEeecCCeEEEEEecCCeEEEEEEecCCCceeEEeEEEEEcCC--CEEEEEEEEecCCCCccEEEEEEEecCCCeEEE Confidence 566778888777666664 3566665432 2222466554211 12223333211 00111111 11111 Q ss_pred -E----------------EEEcCCccCc-----------------------------------------------CCCEE Q lcl|NC_018275. 349 -T----------------GQLQFDISSQ-----------------------------------------------YDKQQ 364 (461) Q Consensus 349 -l----------------~~ld~~~~td-----------------------------------------------~g~p~ 364 (461) + +.||....-+ =|-++ T Consensus 578 ~ie~l~~~~~~~~~~~~~~~~D~~~~~~~~~~~~~~gl~~leg~~v~v~~dG~~~~~~~v~~g~itl~~~~~~v~vG~~y 657 (768) T protein:vir:10 578 YVEYLNPALQDDEPQSSAFYVDAGITYNGVPTSTIAGLGHLEGVTVAVLTDGAVHPSRTVTAGAITLDWSASIVHIGVPT 657 (768) T ss_pred EEEecCcccccccccccceEeccccccCCcceeeecCCCCcccceEEEEECCEeccCceecCCEEEeCCCCceEEEeEee Confidence 1 1222211100 01111 Q ss_pred EEEEeecccc--CC-------CceEEEEEEEEEcCCCCCchhheeeeccCc--cccCcceee--ccCCC-cccceeEEEE Q lcl|NC_018275. 365 EHLLFTPLFK--AD-------NARCFDLEVESSTGVAQYADRLFLSATTDG--INYGREQMI--EQNEP-FVYDKRVLWK 430 (461) Q Consensus 365 ~~~~~tP~~~--~~-------~~rv~~~~le~~~Gv~q~~~~~~ls~sdDG--~~~~~~~~~--~~g~~-g~y~~R~~~~ 430 (461) ++.+.++.+. .+ +.|+..+.|.+..=.+ +.+.-+++. ..+-..+.. .+|++ --+.-.+++. T Consensus 658 ~s~~~~~p~~~~~~~gs~~~~~~ri~r~~v~~~~S~~-----~~~~~~~~~~~~~~~~~r~~~~~~~~~~~l~TG~~~v~ 732 (768) T protein:vir:10 658 TCRIQTMQLNAGAANGTAQGKTKRVTNIATRFSRSLG-----GVVGPTFDDNDLEQLSFRKPSNAMDRAVPLFDGDMESD 732 (768) T ss_pred eEEEEecceEeecCCccccccceEEEEEEEEEecccc-----eEEEecCCCCCceeeeeEecCcccCccCCcccCEEEEE Confidence 1222221111 00 0011111111000000 000000000 000000110 01111 1122233333 Q ss_pred eeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 431 RVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 431 rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) -.|...++.-++|+-..|-|..|.++..+++ T Consensus 733 ~~~~~~~~~~i~i~~d~P~P~tvlsi~~~~~ 763 (768) T protein:vir:10 733 WRGGYEGQSWICYQNDQPLPVTLLGFFPILD 763 (768) T ss_pred ecCCCCcceEEEEEECCCCCEEEEEEEEEEE Confidence 3454455555677777777777777777777 No 22 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=59.34 E-value=0.39 Score=22.80 Aligned_cols=420 Identities=12% Similarity=0.100 Sum_probs=151.0 Q ss_pred CCCcccc-CceeEe----------eeecccccccccc-cccc--eeEe--CCCc-eecccCCCcccceEEEEe-cCe-EE Q lcl|NC_018275. 1 MGKDFKN-ADYIDY----------LPINMLATPKEVL-NSSG--YLRS--FPGI-AKRNDVNGVSRGVEYNTA-QNA-VY 61 (461) Q Consensus 1 ~~~~~~~-~d~~~~----------~pvn~~a~~~~~~-~s~~--~L~~--~PGl-~~~~~v~G~~rG~~y~~~-~~~-lY 61 (461) .-|=.|. .+-|.. .|.|+.++....+ .... .+.. ..+. .......+...-..|... .++ .+ T Consensus 136 p~~L~r~~~~~W~l~~~~f~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~~~~~t~~w 215 (681) T protein:vir:10 136 PRELRRLGATNWQLATIAFTSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTNGGANTIAW 215 (681) T ss_pred ceEEEEccCCceEEEEEEeccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeeeeecCCcceeEEE Confidence 1111111 111222 2333333322111 0000 0000 0000 000000000000000000 000 01 Q ss_pred -EEeCCeEEeccceEE---eecCcccEE--EEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCcccccee Q lcl|NC_018275. 62 -RVLGSKLYKGETVVG---DVAGSGRVS--MAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITR 135 (461) Q Consensus 62 -~V~G~~Ly~v~~~iG---~i~gsg~Vs--Ma~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~ 135 (461) -+.|..-|.+...-+ .+.|...+. ..+| +..+.. -.++. ....|....+| +..|+| T Consensus 216 ~a~~g~~~~~V~~~~~gi~g~ig~~~~~~~~~~~-----~~~~~~---~t~~~---~~~~~~~~~gy-------P~~v~f 277 (681) T protein:vir:10 216 SASSGASRYNVYKEQGGLYGYIGQTTGTSLVDDN-----IAPDLS---VTPPI---YDAVFNAAGDY-------PAAVSY 277 (681) T ss_pred EecCCceeeeecccceeEEEEeeccceeeeeecc-----cccCcc---ccccc---cccccccCCCc-------eEEEEE Confidence 112222111111101 011111111 0111 110100 00111 11112223333 346899 Q ss_pred ccceEEEEee-C-CceEEEeccCC--------cc-ccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCC Q lcl|NC_018275. 136 LRGRYAWSKD-G-TDSWFITDLED--------ES-HPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGAT 204 (461) Q Consensus 136 ~dGyfV~~~~-g-t~~f~iS~L~d--------~t-~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~ 204 (461) ..+|++|.-. . .+.+..|.-.| ++ +-|+. ++.-+-.+++.|.-++.+++ |++|.+. -|++-..++. T Consensus 278 ~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ddD~i-~~~~~~~~~~~i~~~v~~~~-lli~t~~-~e~~l~~~~~ 354 (681) T protein:vir:10 278 FEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPVRDDDRV-AFRVAAREANAIRHIVPLTE-LLLLTSS-GEWRVASVNS 354 (681) T ss_pred EcceEEEeeCCCCCcEEEEEcccCcccccccCCCCCCccE-EEEEcCCcceeEEEEEecCc-EEEEEcC-cEEEEecCCC Confidence 9999888521 1 11222232222 11 22333 25666777888888888864 5555555 5554433322 Q ss_pred C---CccCcccccccceEEeccccchhhhccCceEEEEeeccccce-EE--EEccCceeeecCCHHHHHHHHhcCccccc Q lcl|NC_018275. 205 T---AGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAP-SV--YIIGSGQASPIATASIEKIIRSYTADELA 278 (461) Q Consensus 205 ~---~~~fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~-~V--y~l~g~q~~rIST~~IE~~i~~y~~~el~ 278 (461) + +...-..++ =..||.. -.-..++++++|++..+..-. .. +..++|+++.+| --.+.+++.. T Consensus 355 ~~lTP~~~~~~~~----s~~g~~~-~~Pv~vg~~v~fv~~~g~~vre~~y~~~~d~~~~~dlt-~~a~Hl~~~~------ 422 (681) T protein:vir:10 355 DAVTPTTISVRPQ----SYVGATD-VQPVVVNNTTIYGAARGGHVRELAYNWQANGFVTGDLS-LRAAHLFDNL------ 422 (681) T ss_pred ccccceeEEEEEe----eeecccc-ccceeeCCeEEEEecCCCEEEEEEEeeecCceeccchh-hhhhhhcCCC------ Confidence 2 111112222 1468854 456778999999988874211 11 245677888875 1223333332 Q ss_pred ceEEEEEEECCEEEEEEEcCC---eEEEEEccccCCcceeeeecCCccccceEEEEEEecCCe---EEE-----EEcCCC Q lcl|NC_018275. 279 TGVMEALRFDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQ---IAC-----GDKSEA 347 (461) Q Consensus 279 ~A~~~ty~~~GH~fyvlt~P~---~Tw~yD~~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~---~~v-----GD~~~G 347 (461) .-+..+|+++.+.+..+.+.| ..+.|+-....+ -||.-.++ ...+..|++..+++ |++ +....- T Consensus 423 ~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~~eq~v~--aW~~~~~~---g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~ 497 (681) T protein:vir:10 423 DILDMAYAKAPQPIVWFISSSGKLLGLTYVPEQQIG--AWHQHDTD---GVFESCAVVAEGNEDRLYAVVRRTIGGNEVR 497 (681) T ss_pred CeEEEEEecCCCEEEEEEecCCcEEEEEEeccccee--eEEEEecC---CcEEEEEEecCCCCcEEEEEEEecCCCCeEE Confidence 233466788899888888885 477787554433 58877763 34555555543222 111 100000 Q ss_pred eEEEEc--------------CCccCcCCCEEEEEEeeccccCCCceEE-------------------------------- Q lcl|NC_018275. 348 VTGQLQ--------------FDISSQYDKQQEHLLFTPLFKADNARCF-------------------------------- 381 (461) Q Consensus 348 ~l~~ld--------------~~~~td~g~p~~~~~~tP~~~~~~~rv~-------------------------------- 381 (461) .|=+|+ ..... .+.+...+--.+++ ++..+. T Consensus 498 yie~~~~~~~~~~~~~~~vD~~~t~-~~~~~~~~sgl~~l--eG~tv~i~aDG~~~~~~~V~~G~itl~~~~~~v~VGl~ 574 (681) T protein:vir:10 498 YVERMASRQFDAQADAFFVDSGLTY-SGEPVSHISGLEHL--EGKTVSILADGAVHPQRVVTDGAIDLDVEAGTVHIGLP 574 (681) T ss_pred EEEecCCccccccccceEeeccccc-cCcceeeeccccCC--CCcEEEEEeCCeecCcEeecCcEEEeCcCCceEEEeee Confidence 111121 11110 11111110000111 111000 Q ss_pred -EE-----EEEEEcCCC--CC----chhheeeeccC-ccccC--ccee--e------ccCC-CcccceeEEEEeeEeccc Q lcl|NC_018275. 382 -DL-----EVESSTGVA--QY----ADRLFLSATTD-GINYG--REQM--I------EQNE-PFVYDKRVLWKRVGRIRR 437 (461) Q Consensus 382 -~~-----~le~~~Gv~--q~----~~~~~ls~sdD-G~~~~--~~~~--~------~~g~-~g~y~~R~~~~rlG~~r~ 437 (461) .. .+++...-+ +. -.++-|+.-+. |...+ ..++ + .+|. +-.+.-.++.---|.-.+ T Consensus 575 Y~s~i~~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g~~~~~~~~~l~~~~~~~~~~~g~~~~l~TG~~~v~v~~~~~~ 654 (681) T protein:vir:10 575 ITAELQTLPVAMQLDGSFGQGRVKNINKLWLRVHRSSGIFAGPHADALTEVKQRTSEPYGSPPALKSEEIPLVLSPKWGD 654 (681) T ss_pred ceeEEEecceeeecCCcccCCceEEEEEEEEEEEcccceEEeeCCCceEEEEEeccccccccCCccCCeEEEEeCCCcCc Confidence 00 011100000 00 01112222221 11100 0000 0 0111 111222222221122223 Q ss_pred ceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 438 LIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 438 ~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) +.-++|+-..|-|..|.++..++| T Consensus 655 ~~~v~I~qd~PlP~tvlsi~~ev~ 678 (681) T protein:vir:10 655 SGQLFVRQADPLPLMIVSMSAEIA 678 (681) T ss_pred ceEEEEEECCCcCEEEEEeeEEEE Confidence 334667777777777777777777 No 23 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=59.34 E-value=0.39 Score=22.80 Aligned_cols=420 Identities=12% Similarity=0.100 Sum_probs=151.0 Q ss_pred CCCcccc-CceeEe----------eeecccccccccc-cccc--eeEe--CCCc-eecccCCCcccceEEEEe-cCe-EE Q lcl|NC_018275. 1 MGKDFKN-ADYIDY----------LPINMLATPKEVL-NSSG--YLRS--FPGI-AKRNDVNGVSRGVEYNTA-QNA-VY 61 (461) Q Consensus 1 ~~~~~~~-~d~~~~----------~pvn~~a~~~~~~-~s~~--~L~~--~PGl-~~~~~v~G~~rG~~y~~~-~~~-lY 61 (461) .-|=.|. .+-|.. .|.|+.++....+ .... .+.. ..+. .......+...-..|... .++ .+ T Consensus 136 p~~L~r~~~~~W~l~~~~f~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~~~~~t~~w 215 (681) T protein:vir:10 136 PRELRRLGATNWQLATIAFTSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTNGGANTIAW 215 (681) T ss_pred ceEEEEccCCceEEEEEEeccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeeeeecCCcceeEEE Confidence 1111111 111222 2333333322111 0000 0000 0000 000000000000000000 000 01 Q ss_pred -EEeCCeEEeccceEE---eecCcccEE--EEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCcccccee Q lcl|NC_018275. 62 -RVLGSKLYKGETVVG---DVAGSGRVS--MAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITR 135 (461) Q Consensus 62 -~V~G~~Ly~v~~~iG---~i~gsg~Vs--Ma~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~ 135 (461) -+.|..-|.+...-+ .+.|...+. ..+| +..+.. -.++. ....|....+| +..|+| T Consensus 216 ~a~~g~~~~~V~~~~~gi~g~ig~~~~~~~~~~~-----~~~~~~---~t~~~---~~~~~~~~~gy-------P~~v~f 277 (681) T protein:vir:10 216 SASSGASRYNVYKEQGGLYGYIGQTTGTSLVDDN-----IAPDLS---VTPPI---YDAVFNAAGDY-------PAAVSY 277 (681) T ss_pred EecCCceeeeecccceeEEEEeeccceeeeeecc-----cccCcc---ccccc---cccccccCCCc-------eEEEEE Confidence 112222111111101 011111111 0111 110100 00111 11112223333 346899 Q ss_pred ccceEEEEee-C-CceEEEeccCC--------cc-ccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCC Q lcl|NC_018275. 136 LRGRYAWSKD-G-TDSWFITDLED--------ES-HPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGAT 204 (461) Q Consensus 136 ~dGyfV~~~~-g-t~~f~iS~L~d--------~t-~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~ 204 (461) ..+|++|.-. . .+.+..|.-.| ++ +-|+. ++.-+-.+++.|.-++.+++ |++|.+. -|++-..++. T Consensus 278 ~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ddD~i-~~~~~~~~~~~i~~~v~~~~-lli~t~~-~e~~l~~~~~ 354 (681) T protein:vir:10 278 FEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPVRDDDRV-AFRVAAREANAIRHIVPLTE-LLLLTSS-GEWRVASVNS 354 (681) T ss_pred EcceEEEeeCCCCCcEEEEEcccCcccccccCCCCCCccE-EEEEcCCcceeEEEEEecCc-EEEEEcC-cEEEEecCCC Confidence 9999888521 1 11222232222 11 22333 25666777888888888864 5555555 5554433322 Q ss_pred C---CccCcccccccceEEeccccchhhhccCceEEEEeeccccce-EE--EEccCceeeecCCHHHHHHHHhcCccccc Q lcl|NC_018275. 205 T---AGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAP-SV--YIIGSGQASPIATASIEKIIRSYTADELA 278 (461) Q Consensus 205 ~---~~~fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~-~V--y~l~g~q~~rIST~~IE~~i~~y~~~el~ 278 (461) + +...-..++ =..||.. -.-..++++++|++..+..-. .. +..++|+++.+| --.+.+++.. T Consensus 355 ~~lTP~~~~~~~~----s~~g~~~-~~Pv~vg~~v~fv~~~g~~vre~~y~~~~d~~~~~dlt-~~a~Hl~~~~------ 422 (681) T protein:vir:10 355 DAVTPTTISVRPQ----SYVGATD-VQPVVVNNTTIYGAARGGHVRELAYNWQANGFVTGDLS-LRAAHLFDNL------ 422 (681) T ss_pred ccccceeEEEEEe----eeecccc-ccceeeCCeEEEEecCCCEEEEEEEeeecCceeccchh-hhhhhhcCCC------ Confidence 2 111112222 1468854 456778999999988874211 11 245677888875 1223333332 Q ss_pred ceEEEEEEECCEEEEEEEcCC---eEEEEEccccCCcceeeeecCCccccceEEEEEEecCCe---EEE-----EEcCCC Q lcl|NC_018275. 279 TGVMEALRFDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQ---IAC-----GDKSEA 347 (461) Q Consensus 279 ~A~~~ty~~~GH~fyvlt~P~---~Tw~yD~~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~---~~v-----GD~~~G 347 (461) .-+..+|+++.+.+..+.+.| ..+.|+-....+ -||.-.++ ...+..|++..+++ |++ +....- T Consensus 423 ~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~~eq~v~--aW~~~~~~---g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~ 497 (681) T protein:vir:10 423 DILDMAYAKAPQPIVWFISSSGKLLGLTYVPEQQIG--AWHQHDTD---GVFESCAVVAEGNEDRLYAVVRRTIGGNEVR 497 (681) T ss_pred CeEEEEEecCCCEEEEEEecCCcEEEEEEeccccee--eEEEEecC---CcEEEEEEecCCCCcEEEEEEEecCCCCeEE Confidence 233466788899888888885 477787554433 58877763 34555555543222 111 100000 Q ss_pred eEEEEc--------------CCccCcCCCEEEEEEeeccccCCCceEE-------------------------------- Q lcl|NC_018275. 348 VTGQLQ--------------FDISSQYDKQQEHLLFTPLFKADNARCF-------------------------------- 381 (461) Q Consensus 348 ~l~~ld--------------~~~~td~g~p~~~~~~tP~~~~~~~rv~-------------------------------- 381 (461) .|=+|+ ..... .+.+...+--.+++ ++..+. T Consensus 498 yie~~~~~~~~~~~~~~~vD~~~t~-~~~~~~~~sgl~~l--eG~tv~i~aDG~~~~~~~V~~G~itl~~~~~~v~VGl~ 574 (681) T protein:vir:10 498 YVERMASRQFDAQADAFFVDSGLTY-SGEPVSHISGLEHL--EGKTVSILADGAVHPQRVVTDGAIDLDVEAGTVHIGLP 574 (681) T ss_pred EEEecCCccccccccceEeeccccc-cCcceeeeccccCC--CCcEEEEEeCCeecCcEeecCcEEEeCcCCceEEEeee Confidence 111121 11110 11111110000111 111000 Q ss_pred -EE-----EEEEEcCCC--CC----chhheeeeccC-ccccC--ccee--e------ccCC-CcccceeEEEEeeEeccc Q lcl|NC_018275. 382 -DL-----EVESSTGVA--QY----ADRLFLSATTD-GINYG--REQM--I------EQNE-PFVYDKRVLWKRVGRIRR 437 (461) Q Consensus 382 -~~-----~le~~~Gv~--q~----~~~~~ls~sdD-G~~~~--~~~~--~------~~g~-~g~y~~R~~~~rlG~~r~ 437 (461) .. .+++...-+ +. -.++-|+.-+. |...+ ..++ + .+|. +-.+.-.++.---|.-.+ T Consensus 575 Y~s~i~~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g~~~~~~~~~l~~~~~~~~~~~g~~~~l~TG~~~v~v~~~~~~ 654 (681) T protein:vir:10 575 ITAELQTLPVAMQLDGSFGQGRVKNINKLWLRVHRSSGIFAGPHADALTEVKQRTSEPYGSPPALKSEEIPLVLSPKWGD 654 (681) T ss_pred ceeEEEecceeeecCCcccCCceEEEEEEEEEEEcccceEEeeCCCceEEEEEeccccccccCCccCCeEEEEeCCCcCc Confidence 00 011100000 00 01112222221 11100 0000 0 0111 111222222221122223 Q ss_pred ceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 438 LIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 438 ~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) +.-++|+-..|-|..|.++..++| T Consensus 655 ~~~v~I~qd~PlP~tvlsi~~ev~ 678 (681) T protein:vir:10 655 SGQLFVRQADPLPLMIVSMSAEIA 678 (681) T ss_pred ceEEEEEECCCcCEEEEEeeEEEE Confidence 334667777777777777777777 No 24 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=59.34 E-value=0.39 Score=22.80 Aligned_cols=420 Identities=12% Similarity=0.100 Sum_probs=151.0 Q ss_pred CCCcccc-CceeEe----------eeecccccccccc-cccc--eeEe--CCCc-eecccCCCcccceEEEEe-cCe-EE Q lcl|NC_018275. 1 MGKDFKN-ADYIDY----------LPINMLATPKEVL-NSSG--YLRS--FPGI-AKRNDVNGVSRGVEYNTA-QNA-VY 61 (461) Q Consensus 1 ~~~~~~~-~d~~~~----------~pvn~~a~~~~~~-~s~~--~L~~--~PGl-~~~~~v~G~~rG~~y~~~-~~~-lY 61 (461) .-|=.|. .+-|.. .|.|+.++....+ .... .+.. ..+. .......+...-..|... .++ .+ T Consensus 136 p~~L~r~~~~~W~l~~~~f~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~~~~~t~~w 215 (681) T protein:vir:98 136 PRELRRLGATNWQLATIAFTSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTNGGANTIAW 215 (681) T ss_pred ceEEEEccCCceEEEEEEeccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeeeeecCCcceeEEE Confidence 1111111 111222 2333333322111 0000 0000 0000 000000000000000000 000 01 Q ss_pred -EEeCCeEEeccceEE---eecCcccEE--EEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCcccccee Q lcl|NC_018275. 62 -RVLGSKLYKGETVVG---DVAGSGRVS--MAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDITR 135 (461) Q Consensus 62 -~V~G~~Ly~v~~~iG---~i~gsg~Vs--Ma~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v~~ 135 (461) -+.|..-|.+...-+ .+.|...+. ..+| +..+.. -.++. ....|....+| +..|+| T Consensus 216 ~a~~g~~~~~V~~~~~gi~g~ig~~~~~~~~~~~-----~~~~~~---~t~~~---~~~~~~~~~gy-------P~~v~f 277 (681) T protein:vir:98 216 SASSGASRYNVYKEQGGLYGYIGQTTGTSLVDDN-----IAPDLS---VTPPI---YDAVFNAAGDY-------PAAVSY 277 (681) T ss_pred EecCCceeeeecccceeEEEEeeccceeeeeecc-----cccCcc---ccccc---cccccccCCCc-------eEEEEE Confidence 112222111111101 011111111 0111 110100 00111 11112223333 346899 Q ss_pred ccceEEEEee-C-CceEEEeccCC--------cc-ccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCC Q lcl|NC_018275. 136 LRGRYAWSKD-G-TDSWFITDLED--------ES-HPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGAT 204 (461) Q Consensus 136 ~dGyfV~~~~-g-t~~f~iS~L~d--------~t-~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~ 204 (461) ..+|++|.-. . .+.+..|.-.| ++ +-|+. ++.-+-.+++.|.-++.+++ |++|.+. -|++-..++. T Consensus 278 ~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ddD~i-~~~~~~~~~~~i~~~v~~~~-lli~t~~-~e~~l~~~~~ 354 (681) T protein:vir:98 278 FEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPVRDDDRV-AFRVAAREANAIRHIVPLTE-LLLLTSS-GEWRVASVNS 354 (681) T ss_pred EcceEEEeeCCCCCcEEEEEcccCcccccccCCCCCCccE-EEEEcCCcceeEEEEEecCc-EEEEEcC-cEEEEecCCC Confidence 9999888521 1 11222232222 11 22333 25666777888888888864 5555555 5554433322 Q ss_pred C---CccCcccccccceEEeccccchhhhccCceEEEEeeccccce-EE--EEccCceeeecCCHHHHHHHHhcCccccc Q lcl|NC_018275. 205 T---AGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAP-SV--YIIGSGQASPIATASIEKIIRSYTADELA 278 (461) Q Consensus 205 ~---~~~fp~~~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~-~V--y~l~g~q~~rIST~~IE~~i~~y~~~el~ 278 (461) + +...-..++ =..||.. -.-..++++++|++..+..-. .. +..++|+++.+| --.+.+++.. T Consensus 355 ~~lTP~~~~~~~~----s~~g~~~-~~Pv~vg~~v~fv~~~g~~vre~~y~~~~d~~~~~dlt-~~a~Hl~~~~------ 422 (681) T protein:vir:98 355 DAVTPTTISVRPQ----SYVGATD-VQPVVVNNTTIYGAARGGHVRELAYNWQANGFVTGDLS-LRAAHLFDNL------ 422 (681) T ss_pred ccccceeEEEEEe----eeecccc-ccceeeCCeEEEEecCCCEEEEEEEeeecCceeccchh-hhhhhhcCCC------ Confidence 2 111112222 1468854 456778999999988874211 11 245677888875 1223333332 Q ss_pred ceEEEEEEECCEEEEEEEcCC---eEEEEEccccCCcceeeeecCCccccceEEEEEEecCCe---EEE-----EEcCCC Q lcl|NC_018275. 279 TGVMEALRFDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQ---IAC-----GDKSEA 347 (461) Q Consensus 279 ~A~~~ty~~~GH~fyvlt~P~---~Tw~yD~~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~---~~v-----GD~~~G 347 (461) .-+..+|+++.+.+..+.+.| ..+.|+-....+ -||.-.++ ...+..|++..+++ |++ +....- T Consensus 423 ~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~~eq~v~--aW~~~~~~---g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~ 497 (681) T protein:vir:98 423 DILDMAYAKAPQPIVWFISSSGKLLGLTYVPEQQIG--AWHQHDTD---GVFESCAVVAEGNEDRLYAVVRRTIGGNEVR 497 (681) T ss_pred CeEEEEEecCCCEEEEEEecCCcEEEEEEeccccee--eEEEEecC---CcEEEEEEecCCCCcEEEEEEEecCCCCeEE Confidence 233466788899888888885 477787554433 58877763 34555555543222 111 100000 Q ss_pred eEEEEc--------------CCccCcCCCEEEEEEeeccccCCCceEE-------------------------------- Q lcl|NC_018275. 348 VTGQLQ--------------FDISSQYDKQQEHLLFTPLFKADNARCF-------------------------------- 381 (461) Q Consensus 348 ~l~~ld--------------~~~~td~g~p~~~~~~tP~~~~~~~rv~-------------------------------- 381 (461) .|=+|+ ..... .+.+...+--.+++ ++..+. T Consensus 498 yie~~~~~~~~~~~~~~~vD~~~t~-~~~~~~~~sgl~~l--eG~tv~i~aDG~~~~~~~V~~G~itl~~~~~~v~VGl~ 574 (681) T protein:vir:98 498 YVERMASRQFDAQADAFFVDSGLTY-SGEPVSHISGLEHL--EGKTVSILADGAVHPQRVVTDGAIDLDVEAGTVHIGLP 574 (681) T ss_pred EEEecCCccccccccceEeeccccc-cCcceeeeccccCC--CCcEEEEEeCCeecCcEeecCcEEEeCcCCceEEEeee Confidence 111121 11110 11111110000111 111000 Q ss_pred -EE-----EEEEEcCCC--CC----chhheeeeccC-ccccC--ccee--e------ccCC-CcccceeEEEEeeEeccc Q lcl|NC_018275. 382 -DL-----EVESSTGVA--QY----ADRLFLSATTD-GINYG--REQM--I------EQNE-PFVYDKRVLWKRVGRIRR 437 (461) Q Consensus 382 -~~-----~le~~~Gv~--q~----~~~~~ls~sdD-G~~~~--~~~~--~------~~g~-~g~y~~R~~~~rlG~~r~ 437 (461) .. .+++...-+ +. -.++-|+.-+. |...+ ..++ + .+|. +-.+.-.++.---|.-.+ T Consensus 575 Y~s~i~~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g~~~~~~~~~l~~~~~~~~~~~g~~~~l~TG~~~v~v~~~~~~ 654 (681) T protein:vir:98 575 ITAELQTLPVAMQLDGSFGQGRVKNINKLWLRVHRSSGIFAGPHADALTEVKQRTSEPYGSPPALKSEEIPLVLSPKWGD 654 (681) T ss_pred ceeEEEecceeeecCCcccCCceEEEEEEEEEEEcccceEEeeCCCceEEEEEeccccccccCCccCCeEEEEeCCCcCc Confidence 00 011100000 00 01112222221 11100 0000 0 0111 111222222221122223 Q ss_pred ceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 438 LIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 438 ~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) +.-++|+-..|-|..|.++..++| T Consensus 655 ~~~v~I~qd~PlP~tvlsi~~ev~ 678 (681) T protein:vir:98 655 SGQLFVRQADPLPLMIVSMSAEIA 678 (681) T ss_pred ceEEEEEECCCcCEEEEEeeEEEE Confidence 334667777777777777777777 No 25 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=55.78 E-value=0.47 Score=22.37 Aligned_cols=427 Identities=11% Similarity=0.073 Sum_probs=138.8 Q ss_pred CCCccccCceeEeeeeccccccc----------c--cccccceeEeCCCceec-ccCCCcccceEEEEecCeEEE---Ee Q lcl|NC_018275. 1 MGKDFKNADYIDYLPINMLATPK----------E--VLNSSGYLRSFPGIAKR-NDVNGVSRGVEYNTAQNAVYR---VL 64 (461) Q Consensus 1 ~~~~~~~~d~~~~~pvn~~a~~~----------~--~~~s~~~L~~~PGl~~~-~~v~G~~rG~~y~~~~~~lY~---V~ 64 (461) .|.... -..++.-.+|+-.+-. . ...+..+.++.+|.... ..-. . .....-|+ -. T Consensus 325 ~g~~i~-v~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~~~---~-----~~~d~yyv~~~~~ 395 (905) T protein:vir:78 325 VGNVIE-IERTDGRDFNLGVRGGATNRAMTAIKGTANSIVDLPGQCFDGFELKVINTE---N-----AESDDYYVVFRSA 395 (905) T ss_pred cCcEEE-EEecCCCccEEEEeccCCcceEEEEeccccccccCccccCCCcEEEEEeCC---C-----CCcceEEEEEEec Confidence 111100 0000110112100000 0 00011111222332110 0000 0 00011121 11 Q ss_pred -----CCeEEeccceEEeecCcccEEEEeC------CeEEEEEECCcEEEEEeecccccceeccccccccCccCCccccc Q lcl|NC_018275. 65 -----GSKLYKGETVVGDVAGSGRVSMAHG------RTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRDI 133 (461) Q Consensus 65 -----G~~Ly~v~~~iG~i~gsg~VsMa~N------~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~v 133 (461) |+.-++=....|.+.+-....|.+. |.-.+...++.. +..+........ ...+.+|.+.-..+.+| T Consensus 396 ~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~--~~~~~~~r~~Gd-~~Tnp~psf~g~~is~v 472 (905) T protein:vir:78 396 AEGIPGSGSWEETVAPGIERGFNTSTMPHALIRQADGNFTLEALNDEG--TITGWAQREVGD-DDTNPKPSFVGRGISDM 472 (905) T ss_pred ccCCcCceeEEEecccccccccccccccEEEEEecCceEEEEEecccc--ccccccccccCC-cccCCCCcccCCCcceE Confidence 1112210001122222222333332 111111111111 000111111100 01112333333345679 Q ss_pred eeccceEEEEeeCCceEEEeccCCcc-----------ccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecC Q lcl|NC_018275. 134 TRLRGRYAWSKDGTDSWFITDLEDES-----------HPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTG 202 (461) Q Consensus 134 ~~~dGyfV~~~~gt~~f~iS~L~d~t-----------~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG 202 (461) +|..+|++|..| +....|.-.|.. +-|+. +++-+..+++.|.-++.+++.|+||.+..- |..+| T Consensus 473 ~f~q~RL~f~s~--~~v~~Srtgd~~nF~~~t~~~~~DdDpI-~~~~ss~~~~~i~~~v~~~~~L~ifT~g~e--f~lsg 547 (905) T protein:vir:78 473 FFYNNRLGFLSE--DAVIMSQPGDYFNFFVTSAITISDSDPI-DVTASSTKPAILRAAIGAPKGLILFAENSQ--FLLAS 547 (905) T ss_pred EEEcceEEEecC--CeEEEEccCCccccccccccCCCCCccE-EEEEcCCcceeeEEEeecCCcEEEEecCce--EEEec Confidence 999999988743 233344333322 22333 256666777788888999999999988776 77776 Q ss_pred CCCCccCcccccccceE-EeccccchhhhccCceEEEEeeccccceEEEEc------cCceeeecCCHHHHHHHHhcCcc Q lcl|NC_018275. 203 ATTAGAALYVAQPSLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVYII------GSGQASPIATASIEKIIRSYTAD 275 (461) Q Consensus 203 ~~~~~~fp~~~~~~~~I-~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~l------~g~q~~rIST~~IE~~i~~y~~~ 275 (461) +.+ .+....-.--.+ ..||...-.-..++++++|++..+. -..|+.+ ++|+++.+| .-++..|+. T Consensus 548 ~~~--~lTP~s~~i~~~S~~~~~~~v~Pv~vG~~vlFv~~~g~-~s~vre~~y~~~~d~y~a~DlT-~~a~hl~~g---- 619 (905) T protein:vir:78 548 QEV--VFSTATIKLTEISDYFYRSLAKPVSTGVSIAFVSEADT-YSKIFEMSIDSVDNRPQVADIT-RIVPEYVPT---- 619 (905) T ss_pred CCc--cccceeEEEEeEEeecccCCCCcEEeCCeEEEeecCCC-eeEEEEEEeeecccceehhHHH-HHHHHhcCC---- Confidence 542 233222110112 3588765555789999999998752 1224332 456666663 334555543 Q ss_pred cccceEEEEEEECCEEEEEEE-cCCeEEEEEc---cccCCcceeeeecCCccccceEEEEEEecCCeEEEEEcCCCeE-- Q lcl|NC_018275. 276 ELATGVMEALRFDSHELLIIH-LPRHVLVYDA---SSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQIACGDKSEAVT-- 349 (461) Q Consensus 276 el~~A~~~ty~~~GH~fyvlt-~P~~Tw~yD~---~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~~~vGD~~~G~l-- 349 (461) .+ .+.+..+-+.+++.. -.+.-+||-- .-.+..--||.-.++ ...+..|.+...-.+++=...+|.. T Consensus 620 ~v----~~~~~s~~~~~v~~~~~~~~l~~ytyl~~~~eq~v~AWsrw~~~---G~~~~~a~i~d~~~~vV~r~~~G~~~~ 692 (905) T protein:vir:78 620 GL----TWSVSTPNNSMMLFGDNSNTAYIFKFFNQGNERQVAGWSKWILP---GEQRMCGFFADTGYFVLYDSTTGSYVL 692 (905) T ss_pred ce----EEEEecCCCcEEEEEcCCCeEEEEEeecCCCceeEEeEEEEecC---CCeEEEEEEcCCEEEEEEEccCCeEEE Confidence 22 222333333333322 2245555432 222222248877663 3455555553333333322223332 Q ss_pred EEEcCCccCcCCCEEEE-EEeeccccCC----CceEEE-----EEEEEEcCCCCCchhheeeeccCccc----------c Q lcl|NC_018275. 350 GQLQFDISSQYDKQQEH-LLFTPLFKAD----NARCFD-----LEVESSTGVAQYADRLFLSATTDGIN----------Y 409 (461) Q Consensus 350 ~~ld~~~~td~g~p~~~-~~~tP~~~~~----~~rv~~-----~~le~~~Gv~q~~~~~~ls~sdDG~~----------~ 409 (461) +.++....-+....-.. ....|.+..- ..-.++ ..+.+..|.....-++-+.+.|-+.. | T Consensus 693 ~~~~l~~~~~~~~~d~~~~~~~~~~d~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dG~~~~~~~~~~~~~~ 772 (905) T protein:vir:78 693 SAMELLDDPDSASIDTAFSSFLPRLDNYVVKSDLTVVDNGDGTLTVDLEAGQAMTGATPVIMFTDGPSEFAFSQPTITAG 772 (905) T ss_pred EEEeeccccCccccccceeeeeeccceeeecccceecccCcceEeeeccCccccccceeEEEeeCCceeeeEEEEEeece Confidence 22221000000000000 0000100000 000000 00000011110000111111110000 0 Q ss_pred Cc----ceeeccCCCcccceeEEE----------------EeeEecccc----eeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 410 GR----EQMIEQNEPFVYDKRVLW----------------KRVGRIRRL----IGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 410 ~~----~~~~~~g~~g~y~~R~~~----------------~rlG~~r~~----v~f~~r~~~~~p~~l~ga~~~~e 461 (461) -. ...+-.|.+ |..++.. .|++++.-| -+|++.+....+-......-..+ T Consensus 773 ~~t~~~a~~v~VGl~--Y~s~v~~~p~~~~~~~~s~~~~~~rI~rv~lr~~~Sg~~~v~v~~~~~~~~~~~~~~~~ 846 (905) T protein:vir:78 773 QFTVDTTDDFVVGFK--YETKITLPGFFTSEENKADRVYAPIVEFLYLDLYYSGRYQIEVDRIGYDTINIDAGSID 846 (905) T ss_pred eeccccCCeEEEeee--eeEEEeecceEeccCCCcccccceEEEEEEEEeecceeEEEEEcCCCcceeccccccee Confidence 00 000111111 1111111 122222211 12333322211111100000000 No 26 >protein:vir:827 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050560;genbank:gi:9633457;genbank:GeneID:1262210 Probab=52.87 E-value=0.54 Score=22.04 Aligned_cols=384 Identities=12% Similarity=0.119 Sum_probs=180.5 Q ss_pred CC-----------------------CccccCceeEeeeeccccccc-ccccccceeEeCCCcee-cccCCCcccceEEEE Q lcl|NC_018275. 1 MG-----------------------KDFKNADYIDYLPINMLATPK-EVLNSSGYLRSFPGIAK-RNDVNGVSRGVEYNT 55 (461) Q Consensus 1 ~~-----------------------~~~~~~d~~~~~pvn~~a~~~-~~~~s~~~L~~~PGl~~-~~~v~G~~rG~~y~~ 55 (461) || .|--+-=|+--+ |+-+-++. ++..|+..+...||-.. ....+-++.+. +. T Consensus 145 LgVpaps~aP~~a~~~~~~~~~~~p~d~etr~Yv~Tf-Vt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~--~i 221 (567) T protein:vir:82 145 LGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETF-VSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNA--SI 221 (567) T ss_pred cccCCccccceeeecCCCCCCCCCCccccceEEEEEE-EcCCCCcCCCcccccceeeecCCceEEEeeccCCcccc--cc Confidence 11 111111133322 33334444 23344445566677643 12222122221 23 Q ss_pred ecCeEEEE-eCC--eEEeccceEEeecCcccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCcccc Q lcl|NC_018275. 56 AQNAVYRV-LGS--KLYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRD 132 (461) Q Consensus 56 ~~~~lY~V-~G~--~Ly~v~~~iG~i~gsg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~ 132 (461) ....|||= .|+ .=|. .+++++ -+..++.||.-+- --+..+..+.|+.+.+... .+.. T Consensus 222 ~~~RIYRS~tg~~gtdy~---lVael~-as~~sf~D~~~~~--~lg~~Lps~~w~~PP~~m~--------------GL~~ 281 (567) T protein:vir:82 222 KRRRIYRSASGGGEADFL---LVAELD-ASVLSYTDKIPAK--NLGPSLATWDYLPPPENMT--------------GLCL 281 (567) T ss_pred ceEEEEEecCCCCceeeE---EEEeec-cceeeeeeccchh--hcccccccccccCcCcccc--------------eeee Confidence 55567774 221 1233 233333 1233455552111 0022233334443333332 1111 Q ss_pred ceeccceEEEEeeCCceEEEeccCCccccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCccc Q lcl|NC_018275. 133 ITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYV 212 (461) Q Consensus 133 v~~~dGyfV~~~~gt~~f~iS~L~d~t~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~ 212 (461) | ..|. .-.-.|+..+|-- +..|.++ +-..-..-.+.||+++++..-|+++-..-- +--+|.+ +.+-.-+ T Consensus 282 m--~NGi-mAgF~GneV~FsE----pylPyAW-P~~Yr~t~~~dIVaiA~~gt~LVV~TkG~P--Yl~sG~s-P~sms~~ 350 (567) T protein:vir:82 282 M--ANGI-AAGFAGNEVMFSE----AYLPYAW-PEVNRHTTAEDIVAICPLRTSLVVATKGEP--YLFSGVS-PSTISGS 350 (567) T ss_pred c--ccce-EEeecCCEEEEec----CCCCccc-chhhccCCCCCeEEEEecccEEEEEEcCce--EEEEcCC-hhhcccc Confidence 1 1233 2223355555433 3344444 222234457889999999999988876555 5556644 3333334 Q ss_pred ccccceEEeccccchhhhccCceEEEEeeccccceEEEEccCceeeecCCHHHH-HHHHhcCcccccceEEEEEEECCEE Q lcl|NC_018275. 213 AQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIE-KIIRSYTADELATGVMEALRFDSHE 291 (461) Q Consensus 213 ~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~l~g~q~~rIST~~IE-~~i~~y~~~el~~A~~~ty~~~GH~ 291 (461) +. -+.--|+.+.|+..++..+.|=|.|+- |...+.+++..++..=+. +.+++ ++.-+....++.||. T Consensus 351 kL---~~~qpCvS~rsiV~~~g~v~Yas~dGL----v~i~a~G~a~vvT~~l~t~~qW~a----~~~P~ti~A~~~eG~- 418 (567) T protein:vir:82 351 KI---PSMQACLSRRSMVAMEGFVLYAGTNGL----VSVDANGNVALATEQIVSPEQWQS----QFNPASIVAYPWRGE- 418 (567) T ss_pred cc---ccccccccccceeeecceEEeecCCcE----EEEecCCchhhhhhhccChHHHHh----cCCcceEEEEeecCe- Confidence 43 346689999999999999999999984 555555666655322221 23443 233334456789999 Q ss_pred EEEEEcC-----CeEEEEEccccCCcceeeeecCCccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEE Q lcl|NC_018275. 292 LLIIHLP-----RHVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEH 366 (461) Q Consensus 292 fyvlt~P-----~~Tw~yD~~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~ 366 (461) |+.-.- +.+.-||.... .=..+++ +|-+.+.=...++..+ .+++.|++++... .|+.. T Consensus 419 -Y~a~Y~~~~g~~~~fifdp~~~----~~~~i~~-----~~~~~~~d~~~d~Ly~--~~~~~l~~~~~g~-----~~~~~ 481 (567) T protein:vir:82 419 -YIACYTKPDGKQDVFVFSPVNM----DIRYLST-----PFDCAWVDLAKDMMRV--VTGDKMSVLAGGA-----LPSTI 481 (567) T ss_pred -EEEEEeCCCCCcceEEEccccc----EEEEEec-----CceeEEEEeecCeEEE--eeCCEEeeecCCC-----CceeE Confidence 444433 25788886532 2222333 2222222222233333 3445566654432 25555 Q ss_pred EEeeccccCCCceEEE-EEEEEEcCCCCCchhheeeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEE Q lcl|NC_018275. 367 LLFTPLFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRV 445 (461) Q Consensus 367 ~~~tP~~~~~~~rv~~-~~le~~~Gv~q~~~~~~ls~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~ 445 (461) +-.|+.+..+..--|. +.++. ..+ ..+.+-+-...|+. +++ .+|+-..|. .||=-++-|. ++|.+ T Consensus 482 ~WrSK~f~~p~~~sf~~~rV~s--~~~-~~v~i~~~~dg~~v-------~~~-~~g~~~~~~--~rlp~~~ar~-Wevei 547 (567) T protein:vir:82 482 RWHSKIFSLPERTSFSCIRVKS--PAP-ERVGITIMADDVPV-------IHF-APGTFKGSV--VRLPAATGQN-WQVMV 547 (567) T ss_pred EEecceEEecCccceeEEEEec--cCC-CceeEEEEEcCCce-------eec-CCcccCCce--eeccCcccce-EEEEE Confidence 6677887777653332 22322 111 11112222111111 221 334433443 3443334443 77888 Q ss_pred EecCcceEEEeEEEeC Q lcl|NC_018275. 446 ITKSPVTLSGCQIRLE 461 (461) Q Consensus 446 ~~~~p~~l~ga~~~~e 461 (461) +...+|.---+.-.|| T Consensus 548 sg~~~V~~v~LA~S~~ 563 (567) T protein:vir:82 548 SGFGQVERITLSTSMS 563 (567) T ss_pred EecccEEEEEEecchh Confidence 8888876555555555 No 27 >protein:vir:3306 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049522;genbank:gi:9632528;genbank:GeneID:1262016 Probab=49.16 E-value=0.65 Score=21.62 Aligned_cols=384 Identities=11% Similarity=0.114 Sum_probs=179.1 Q ss_pred CC-----------------------CccccCceeEeeeecccccccc-cccccceeEeCCCcee-cccCCCcccceEEEE Q lcl|NC_018275. 1 MG-----------------------KDFKNADYIDYLPINMLATPKE-VLNSSGYLRSFPGIAK-RNDVNGVSRGVEYNT 55 (461) Q Consensus 1 ~~-----------------------~~~~~~d~~~~~pvn~~a~~~~-~~~s~~~L~~~PGl~~-~~~v~G~~rG~~y~~ 55 (461) || .|--+-=|+--+ |+-+-++.. +..|+..+...||-.. ....+-++.+. +. T Consensus 145 LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~Tf-Vt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~--~i 221 (567) T protein:vir:33 145 LGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETF-VSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNA--SI 221 (567) T ss_pred cccCCccccceeeecCCCCCCCCCCcccceeEEEEEE-EcCCCCcCCCcccccceeeecCCceEEEeeccCCcccc--cc Confidence 11 111111132222 233334432 3344444555666643 11121122221 23 Q ss_pred ecCeEEEE-eCC--eEEeccceEEeecCcccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCcccc Q lcl|NC_018275. 56 AQNAVYRV-LGS--KLYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRD 132 (461) Q Consensus 56 ~~~~lY~V-~G~--~Ly~v~~~iG~i~gsg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~ 132 (461) ....|||= .|+ .=|. .+++++ -+..++.||.-+- --+..+..+.|+.+.+... .+.. T Consensus 222 ~~~RIYRS~tg~~gtdy~---lVael~-as~~sf~D~~~~~--~lg~~Lps~~w~~PP~~m~--------------GL~~ 281 (567) T protein:vir:33 222 KRRRIYRSASGGGEADFL---LVAELD-ASVLSYTDKIPAK--NLGPSLATWDYLPPPENMT--------------GLCL 281 (567) T ss_pred ceEEEEEecCCCCceeeE---EEEeec-cceeeeeeccchh--hcccccccccccCcCcccc--------------eeee Confidence 55567774 221 1233 233333 1233455542111 0022233334444333332 1111 Q ss_pred ceeccceEEEEeeCCceEEEeccCCccccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCccc Q lcl|NC_018275. 133 ITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYV 212 (461) Q Consensus 133 v~~~dGyfV~~~~gt~~f~iS~L~d~t~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~ 212 (461) | ..|. .-.-.|+..+|-- +..|.++ +-..-..-.+.||+++++..-|+++-..-- +--+|.+ +.+-.-+ T Consensus 282 m--~NGi-mAgF~GneV~FsE----pylPyAW-P~~Yr~t~~~dIVaiA~~gt~LVV~TkG~P--Yl~sG~s-P~sms~~ 350 (567) T protein:vir:33 282 M--ANGI-AAGFAGNEVMFSE----AYLPYAW-PEVNRHTTAEDIVAICPLGTSLVVATKGEP--YLFSGVS-PSTISGS 350 (567) T ss_pred c--ccce-EEeecCCEEEEec----CCCCccc-chhhccCCCCCeEEEeecccEEEEEEcCce--EEEEcCC-hhhcccc Confidence 1 1233 2223355555433 3344444 222234457889999999999988876555 5556644 3344334 Q ss_pred ccccceEEeccccchhhhccCceEEEEeeccccceEEEEccCceeeecCCHHHH-HHHHhcCcccccceEEEEEEECCEE Q lcl|NC_018275. 213 AQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIE-KIIRSYTADELATGVMEALRFDSHE 291 (461) Q Consensus 213 ~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~l~g~q~~rIST~~IE-~~i~~y~~~el~~A~~~ty~~~GH~ 291 (461) +. -+.--|+.+.|+..++..+.|=|.|+- |...+.+++..++..=+. +.+++ ++.-+....++.||. T Consensus 351 kL---~~~qpCvS~rsiV~~~g~v~Yas~dGL----v~i~a~G~a~vvT~~l~t~~qW~a----~~~P~ti~A~~~eG~- 418 (567) T protein:vir:33 351 KI---PSMQACLSRRSMVAMEGFVLYAGTNGL----VSVDANGNVALATEQIVSPEQWQS----QFNPASIVAYPWRGE- 418 (567) T ss_pred cc---ccccccccccceeEeccEEEeecCCcE----EEEecCCchhhhhhhccChHHHHh----cCCcceEEEEeecCe- Confidence 43 346689999999999999999999984 555455666655322221 23443 233334456789999 Q ss_pred EEEEEcC-----CeEEEEEccccCCcceeeeecCCccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEE Q lcl|NC_018275. 292 LLIIHLP-----RHVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEH 366 (461) Q Consensus 292 fyvlt~P-----~~Tw~yD~~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~ 366 (461) |+.-.- +.+.-||.... .=..+++ +|-+.+.=...++..+ .+++.|++++... .|+.. T Consensus 419 -Y~a~Y~~~~g~~~~fifdp~~~----~~~~i~~-----~~~~~~~d~~~d~Ly~--~~~~~l~~~~~g~-----~~~~~ 481 (567) T protein:vir:33 419 -YIACYTKPDGKQDVFVFSPVNM----DIRYLST-----PFDCAWVDLAKDMMRV--VTGDKMSVLAGGA-----LPSTI 481 (567) T ss_pred -EEEEEecCCCCcceEEEccccc----EEEEEec-----CceeEEEEeecCeEEE--eeCCEEeeecCCC-----CceeE Confidence 444433 25788886532 2222333 2222222222233333 3445566654432 25555 Q ss_pred EEeeccccCCCceEEE-EEEEEEcCCCCCchhheeeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEE Q lcl|NC_018275. 367 LLFTPLFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRV 445 (461) Q Consensus 367 ~~~tP~~~~~~~rv~~-~~le~~~Gv~q~~~~~~ls~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~ 445 (461) +-.|+.+..+..--|. +.++. ..+ ..+.+-+-...|+. +++ .+|+-..|. .||=-++-|. ++|.+ T Consensus 482 ~WrSK~f~~p~~~sf~~~rV~s--~~~-~~v~i~~~~dg~~v-------~~~-~~g~~~~~~--~rlp~~~ar~-Wevei 547 (567) T protein:vir:33 482 RWHSKIFSLPERTSFSCIRVKS--PAP-ERVGITIMADDVPV-------IHF-APGTFKGSV--VRLPAATGQN-WQVMV 547 (567) T ss_pred EEecceEEecCccceeEEEEec--cCC-cceeEEEEEcCCce-------eec-CCccccCce--eecCCcccce-EEEEE Confidence 6677887777653332 22322 111 11112222111111 221 334433443 3443334443 77888 Q ss_pred EecCcceEEEeEEEeC Q lcl|NC_018275. 446 ITKSPVTLSGCQIRLE 461 (461) Q Consensus 446 ~~~~p~~l~ga~~~~e 461 (461) +...+|.---+.-.|| T Consensus 548 sg~~~V~~v~LA~S~~ 563 (567) T protein:vir:33 548 SGFGQVERITLSTSMS 563 (567) T ss_pred EecccEEEEEEecchh Confidence 8888876555555555 No 28 >protein:vir:10145 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859275;genbank:gi:32171031;genbank:GeneID:2653447 Probab=49.16 E-value=0.65 Score=21.62 Aligned_cols=384 Identities=11% Similarity=0.114 Sum_probs=179.1 Q ss_pred CC-----------------------CccccCceeEeeeecccccccc-cccccceeEeCCCcee-cccCCCcccceEEEE Q lcl|NC_018275. 1 MG-----------------------KDFKNADYIDYLPINMLATPKE-VLNSSGYLRSFPGIAK-RNDVNGVSRGVEYNT 55 (461) Q Consensus 1 ~~-----------------------~~~~~~d~~~~~pvn~~a~~~~-~~~s~~~L~~~PGl~~-~~~v~G~~rG~~y~~ 55 (461) || .|--+-=|+--+ |+-+-++.. +..|+..+...||-.. ....+-++.+. +. T Consensus 145 LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~Tf-Vt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~--~i 221 (567) T protein:vir:10 145 LGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETF-VSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNA--SI 221 (567) T ss_pred cccCCccccceeeecCCCCCCCCCCcccceeEEEEEE-EcCCCCcCCCcccccceeeecCCceEEEeeccCCcccc--cc Confidence 11 111111132222 233334432 3344444555666643 11121122221 23 Q ss_pred ecCeEEEE-eCC--eEEeccceEEeecCcccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCcccc Q lcl|NC_018275. 56 AQNAVYRV-LGS--KLYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRD 132 (461) Q Consensus 56 ~~~~lY~V-~G~--~Ly~v~~~iG~i~gsg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~ 132 (461) ....|||= .|+ .=|. .+++++ -+..++.||.-+- --+..+..+.|+.+.+... .+.. T Consensus 222 ~~~RIYRS~tg~~gtdy~---lVael~-as~~sf~D~~~~~--~lg~~Lps~~w~~PP~~m~--------------GL~~ 281 (567) T protein:vir:10 222 KRRRIYRSASGGGEADFL---LVAELD-ASVLSYTDKIPAK--NLGPSLATWDYLPPPENMT--------------GLCL 281 (567) T ss_pred ceEEEEEecCCCCceeeE---EEEeec-cceeeeeeccchh--hcccccccccccCcCcccc--------------eeee Confidence 55567774 221 1233 233333 1233455542111 0022233334444333332 1111 Q ss_pred ceeccceEEEEeeCCceEEEeccCCccccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCccc Q lcl|NC_018275. 133 ITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYV 212 (461) Q Consensus 133 v~~~dGyfV~~~~gt~~f~iS~L~d~t~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~ 212 (461) | ..|. .-.-.|+..+|-- +..|.++ +-..-..-.+.||+++++..-|+++-..-- +--+|.+ +.+-.-+ T Consensus 282 m--~NGi-mAgF~GneV~FsE----pylPyAW-P~~Yr~t~~~dIVaiA~~gt~LVV~TkG~P--Yl~sG~s-P~sms~~ 350 (567) T protein:vir:10 282 M--ANGI-AAGFAGNEVMFSE----AYLPYAW-PEVNRHTTAEDIVAICPLGTSLVVATKGEP--YLFSGVS-PSTISGS 350 (567) T ss_pred c--ccce-EEeecCCEEEEec----CCCCccc-chhhccCCCCCeEEEeecccEEEEEEcCce--EEEEcCC-hhhcccc Confidence 1 1233 2223355555433 3344444 222234457889999999999988876555 5556644 3344334 Q ss_pred ccccceEEeccccchhhhccCceEEEEeeccccceEEEEccCceeeecCCHHHH-HHHHhcCcccccceEEEEEEECCEE Q lcl|NC_018275. 213 AQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIE-KIIRSYTADELATGVMEALRFDSHE 291 (461) Q Consensus 213 ~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~l~g~q~~rIST~~IE-~~i~~y~~~el~~A~~~ty~~~GH~ 291 (461) +. -+.--|+.+.|+..++..+.|=|.|+- |...+.+++..++..=+. +.+++ ++.-+....++.||. T Consensus 351 kL---~~~qpCvS~rsiV~~~g~v~Yas~dGL----v~i~a~G~a~vvT~~l~t~~qW~a----~~~P~ti~A~~~eG~- 418 (567) T protein:vir:10 351 KI---PSMQACLSRRSMVAMEGFVLYAGTNGL----VSVDANGNVALATEQIVSPEQWQS----QFNPASIVAYPWRGE- 418 (567) T ss_pred cc---ccccccccccceeEeccEEEeecCCcE----EEEecCCchhhhhhhccChHHHHh----cCCcceEEEEeecCe- Confidence 43 346689999999999999999999984 555455666655322221 23443 233334456789999 Q ss_pred EEEEEcC-----CeEEEEEccccCCcceeeeecCCccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEE Q lcl|NC_018275. 292 LLIIHLP-----RHVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEH 366 (461) Q Consensus 292 fyvlt~P-----~~Tw~yD~~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~ 366 (461) |+.-.- +.+.-||.... .=..+++ +|-+.+.=...++..+ .+++.|++++... .|+.. T Consensus 419 -Y~a~Y~~~~g~~~~fifdp~~~----~~~~i~~-----~~~~~~~d~~~d~Ly~--~~~~~l~~~~~g~-----~~~~~ 481 (567) T protein:vir:10 419 -YIACYTKPDGKQDVFVFSPVNM----DIRYLST-----PFDCAWVDLAKDMMRV--VTGDKMSVLAGGA-----LPSTI 481 (567) T ss_pred -EEEEEecCCCCcceEEEccccc----EEEEEec-----CceeEEEEeecCeEEE--eeCCEEeeecCCC-----CceeE Confidence 444433 25788886532 2222333 2222222222233333 3445566654432 25555 Q ss_pred EEeeccccCCCceEEE-EEEEEEcCCCCCchhheeeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEE Q lcl|NC_018275. 367 LLFTPLFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRV 445 (461) Q Consensus 367 ~~~tP~~~~~~~rv~~-~~le~~~Gv~q~~~~~~ls~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~ 445 (461) +-.|+.+..+..--|. +.++. ..+ ..+.+-+-...|+. +++ .+|+-..|. .||=-++-|. ++|.+ T Consensus 482 ~WrSK~f~~p~~~sf~~~rV~s--~~~-~~v~i~~~~dg~~v-------~~~-~~g~~~~~~--~rlp~~~ar~-Wevei 547 (567) T protein:vir:10 482 RWHSKIFSLPERTSFSCIRVKS--PAP-ERVGITIMADDVPV-------IHF-APGTFKGSV--VRLPAATGQN-WQVMV 547 (567) T ss_pred EEecceEEecCccceeEEEEec--cCC-cceeEEEEEcCCce-------eec-CCccccCce--eecCCcccce-EEEEE Confidence 6677887777653332 22322 111 11112222111111 221 334433443 3443334443 77888 Q ss_pred EecCcceEEEeEEEeC Q lcl|NC_018275. 446 ITKSPVTLSGCQIRLE 461 (461) Q Consensus 446 ~~~~p~~l~ga~~~~e 461 (461) +...+|.---+.-.|| T Consensus 548 sg~~~V~~v~LA~S~~ 563 (567) T protein:vir:10 548 SGFGQVERITLSTSMS 563 (567) T ss_pred EecccEEEEEEecchh Confidence 8888876555555555 No 29 >protein:vir:9979 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859109;genbank:gi:32170864;genbank:GeneID:2653256 Probab=49.16 E-value=0.65 Score=21.62 Aligned_cols=384 Identities=11% Similarity=0.114 Sum_probs=179.1 Q ss_pred CC-----------------------CccccCceeEeeeecccccccc-cccccceeEeCCCcee-cccCCCcccceEEEE Q lcl|NC_018275. 1 MG-----------------------KDFKNADYIDYLPINMLATPKE-VLNSSGYLRSFPGIAK-RNDVNGVSRGVEYNT 55 (461) Q Consensus 1 ~~-----------------------~~~~~~d~~~~~pvn~~a~~~~-~~~s~~~L~~~PGl~~-~~~v~G~~rG~~y~~ 55 (461) || .|--+-=|+--+ |+-+-++.. +..|+..+...||-.. ....+-++.+. +. T Consensus 145 LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~Tf-Vt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~--~i 221 (567) T protein:vir:99 145 LGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETF-VSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNA--SI 221 (567) T ss_pred cccCCccccceeeecCCCCCCCCCCcccceeEEEEEE-EcCCCCcCCCcccccceeeecCCceEEEeeccCCcccc--cc Confidence 11 111111132222 233334432 3344444555666643 11121122221 23 Q ss_pred ecCeEEEE-eCC--eEEeccceEEeecCcccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCcccc Q lcl|NC_018275. 56 AQNAVYRV-LGS--KLYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRD 132 (461) Q Consensus 56 ~~~~lY~V-~G~--~Ly~v~~~iG~i~gsg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~ 132 (461) ....|||= .|+ .=|. .+++++ -+..++.||.-+- --+..+..+.|+.+.+... .+.. T Consensus 222 ~~~RIYRS~tg~~gtdy~---lVael~-as~~sf~D~~~~~--~lg~~Lps~~w~~PP~~m~--------------GL~~ 281 (567) T protein:vir:99 222 KRRRIYRSASGGGEADFL---LVAELD-ASVLSYTDKIPAK--NLGPSLATWDYLPPPENMT--------------GLCL 281 (567) T ss_pred ceEEEEEecCCCCceeeE---EEEeec-cceeeeeeccchh--hcccccccccccCcCcccc--------------eeee Confidence 55567774 221 1233 233333 1233455542111 0022233334444333332 1111 Q ss_pred ceeccceEEEEeeCCceEEEeccCCccccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCccc Q lcl|NC_018275. 133 ITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYV 212 (461) Q Consensus 133 v~~~dGyfV~~~~gt~~f~iS~L~d~t~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~ 212 (461) | ..|. .-.-.|+..+|-- +..|.++ +-..-..-.+.||+++++..-|+++-..-- +--+|.+ +.+-.-+ T Consensus 282 m--~NGi-mAgF~GneV~FsE----pylPyAW-P~~Yr~t~~~dIVaiA~~gt~LVV~TkG~P--Yl~sG~s-P~sms~~ 350 (567) T protein:vir:99 282 M--ANGI-AAGFAGNEVMFSE----AYLPYAW-PEVNRHTTAEDIVAICPLGTSLVVATKGEP--YLFSGVS-PSTISGS 350 (567) T ss_pred c--ccce-EEeecCCEEEEec----CCCCccc-chhhccCCCCCeEEEeecccEEEEEEcCce--EEEEcCC-hhhcccc Confidence 1 1233 2223355555433 3344444 222234457889999999999988876555 5556644 3344334 Q ss_pred ccccceEEeccccchhhhccCceEEEEeeccccceEEEEccCceeeecCCHHHH-HHHHhcCcccccceEEEEEEECCEE Q lcl|NC_018275. 213 AQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIE-KIIRSYTADELATGVMEALRFDSHE 291 (461) Q Consensus 213 ~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~l~g~q~~rIST~~IE-~~i~~y~~~el~~A~~~ty~~~GH~ 291 (461) +. -+.--|+.+.|+..++..+.|=|.|+- |...+.+++..++..=+. +.+++ ++.-+....++.||. T Consensus 351 kL---~~~qpCvS~rsiV~~~g~v~Yas~dGL----v~i~a~G~a~vvT~~l~t~~qW~a----~~~P~ti~A~~~eG~- 418 (567) T protein:vir:99 351 KI---PSMQACLSRRSMVAMEGFVLYAGTNGL----VSVDANGNVALATEQIVSPEQWQS----QFNPASIVAYPWRGE- 418 (567) T ss_pred cc---ccccccccccceeEeccEEEeecCCcE----EEEecCCchhhhhhhccChHHHHh----cCCcceEEEEeecCe- Confidence 43 346689999999999999999999984 555455666655322221 23443 233334456789999 Q ss_pred EEEEEcC-----CeEEEEEccccCCcceeeeecCCccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEE Q lcl|NC_018275. 292 LLIIHLP-----RHVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEH 366 (461) Q Consensus 292 fyvlt~P-----~~Tw~yD~~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~ 366 (461) |+.-.- +.+.-||.... .=..+++ +|-+.+.=...++..+ .+++.|++++... .|+.. T Consensus 419 -Y~a~Y~~~~g~~~~fifdp~~~----~~~~i~~-----~~~~~~~d~~~d~Ly~--~~~~~l~~~~~g~-----~~~~~ 481 (567) T protein:vir:99 419 -YIACYTKPDGKQDVFVFSPVNM----DIRYLST-----PFDCAWVDLAKDMMRV--VTGDKMSVLAGGA-----LPSTI 481 (567) T ss_pred -EEEEEecCCCCcceEEEccccc----EEEEEec-----CceeEEEEeecCeEEE--eeCCEEeeecCCC-----CceeE Confidence 444433 25788886532 2222333 2222222222233333 3445566654432 25555 Q ss_pred EEeeccccCCCceEEE-EEEEEEcCCCCCchhheeeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEE Q lcl|NC_018275. 367 LLFTPLFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRV 445 (461) Q Consensus 367 ~~~tP~~~~~~~rv~~-~~le~~~Gv~q~~~~~~ls~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~ 445 (461) +-.|+.+..+..--|. +.++. ..+ ..+.+-+-...|+. +++ .+|+-..|. .||=-++-|. ++|.+ T Consensus 482 ~WrSK~f~~p~~~sf~~~rV~s--~~~-~~v~i~~~~dg~~v-------~~~-~~g~~~~~~--~rlp~~~ar~-Wevei 547 (567) T protein:vir:99 482 RWHSKIFSLPERTSFSCIRVKS--PAP-ERVGITIMADDVPV-------IHF-APGTFKGSV--VRLPAATGQN-WQVMV 547 (567) T ss_pred EEecceEEecCccceeEEEEec--cCC-cceeEEEEEcCCce-------eec-CCccccCce--eecCCcccce-EEEEE Confidence 6677887777653332 22322 111 11112222111111 221 334433443 3443334443 77888 Q ss_pred EecCcceEEEeEEEeC Q lcl|NC_018275. 446 ITKSPVTLSGCQIRLE 461 (461) Q Consensus 446 ~~~~p~~l~ga~~~~e 461 (461) +...+|.---+.-.|| T Consensus 548 sg~~~V~~v~LA~S~~ 563 (567) T protein:vir:99 548 SGFGQVERITLSTSMS 563 (567) T ss_pred EecccEEEEEEecchh Confidence 8888876555555555 No 30 >protein:vir:2792 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612909;genbank:gi:20065826;genbank:GeneID:935648 Probab=49.16 E-value=0.65 Score=21.62 Aligned_cols=384 Identities=11% Similarity=0.114 Sum_probs=179.1 Q ss_pred CC-----------------------CccccCceeEeeeecccccccc-cccccceeEeCCCcee-cccCCCcccceEEEE Q lcl|NC_018275. 1 MG-----------------------KDFKNADYIDYLPINMLATPKE-VLNSSGYLRSFPGIAK-RNDVNGVSRGVEYNT 55 (461) Q Consensus 1 ~~-----------------------~~~~~~d~~~~~pvn~~a~~~~-~~~s~~~L~~~PGl~~-~~~v~G~~rG~~y~~ 55 (461) || .|--+-=|+--+ |+-+-++.. +..|+..+...||-.. ....+-++.+. +. T Consensus 145 LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~Tf-Vt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~--~i 221 (567) T protein:vir:27 145 LGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETF-VSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNA--SI 221 (567) T ss_pred cccCCccccceeeecCCCCCCCCCCcccceeEEEEEE-EcCCCCcCCCcccccceeeecCCceEEEeeccCCcccc--cc Confidence 11 111111132222 233334432 3344444555666643 11121122221 23 Q ss_pred ecCeEEEE-eCC--eEEeccceEEeecCcccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCcccc Q lcl|NC_018275. 56 AQNAVYRV-LGS--KLYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGSVRD 132 (461) Q Consensus 56 ~~~~lY~V-~G~--~Ly~v~~~iG~i~gsg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~~~~ 132 (461) ....|||= .|+ .=|. .+++++ -+..++.||.-+- --+..+..+.|+.+.+... .+.. T Consensus 222 ~~~RIYRS~tg~~gtdy~---lVael~-as~~sf~D~~~~~--~lg~~Lps~~w~~PP~~m~--------------GL~~ 281 (567) T protein:vir:27 222 KRRRIYRSASGGGEADFL---LVAELD-ASVLSYTDKIPAK--NLGPSLATWDYLPPPENMT--------------GLCL 281 (567) T ss_pred ceEEEEEecCCCCceeeE---EEEeec-cceeeeeeccchh--hcccccccccccCcCcccc--------------eeee Confidence 55567774 221 1233 233333 1233455542111 0022233334444333332 1111 Q ss_pred ceeccceEEEEeeCCceEEEeccCCccccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCccc Q lcl|NC_018275. 133 ITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYV 212 (461) Q Consensus 133 v~~~dGyfV~~~~gt~~f~iS~L~d~t~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~ 212 (461) | ..|. .-.-.|+..+|-- +..|.++ +-..-..-.+.||+++++..-|+++-..-- +--+|.+ +.+-.-+ T Consensus 282 m--~NGi-mAgF~GneV~FsE----pylPyAW-P~~Yr~t~~~dIVaiA~~gt~LVV~TkG~P--Yl~sG~s-P~sms~~ 350 (567) T protein:vir:27 282 M--ANGI-AAGFAGNEVMFSE----AYLPYAW-PEVNRHTTAEDIVAICPLGTSLVVATKGEP--YLFSGVS-PSTISGS 350 (567) T ss_pred c--ccce-EEeecCCEEEEec----CCCCccc-chhhccCCCCCeEEEeecccEEEEEEcCce--EEEEcCC-hhhcccc Confidence 1 1233 2223355555433 3344444 222234457889999999999988876555 5556644 3344334 Q ss_pred ccccceEEeccccchhhhccCceEEEEeeccccceEEEEccCceeeecCCHHHH-HHHHhcCcccccceEEEEEEECCEE Q lcl|NC_018275. 213 AQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIE-KIIRSYTADELATGVMEALRFDSHE 291 (461) Q Consensus 213 ~~~~~~I~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~l~g~q~~rIST~~IE-~~i~~y~~~el~~A~~~ty~~~GH~ 291 (461) +. -+.--|+.+.|+..++..+.|=|.|+- |...+.+++..++..=+. +.+++ ++.-+....++.||. T Consensus 351 kL---~~~qpCvS~rsiV~~~g~v~Yas~dGL----v~i~a~G~a~vvT~~l~t~~qW~a----~~~P~ti~A~~~eG~- 418 (567) T protein:vir:27 351 KI---PSMQACLSRRSMVAMEGFVLYAGTNGL----VSVDANGNVALATEQIVSPEQWQS----QFNPASIVAYPWRGE- 418 (567) T ss_pred cc---ccccccccccceeEeccEEEeecCCcE----EEEecCCchhhhhhhccChHHHHh----cCCcceEEEEeecCe- Confidence 43 346689999999999999999999984 555455666655322221 23443 233334456789999 Q ss_pred EEEEEcC-----CeEEEEEccccCCcceeeeecCCccccceEEEEEEecCCeEEEEEcCCCeEEEEcCCccCcCCCEEEE Q lcl|NC_018275. 292 LLIIHLP-----RHVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQIACGDKSEAVTGQLQFDISSQYDKQQEH 366 (461) Q Consensus 292 fyvlt~P-----~~Tw~yD~~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~~~vGD~~~G~l~~ld~~~~td~g~p~~~ 366 (461) |+.-.- +.+.-||.... .=..+++ +|-+.+.=...++..+ .+++.|++++... .|+.. T Consensus 419 -Y~a~Y~~~~g~~~~fifdp~~~----~~~~i~~-----~~~~~~~d~~~d~Ly~--~~~~~l~~~~~g~-----~~~~~ 481 (567) T protein:vir:27 419 -YIACYTKPDGKQDVFVFSPVNM----DIRYLST-----PFDCAWVDLAKDMMRV--VTGDKMSVLAGGA-----LPSTI 481 (567) T ss_pred -EEEEEecCCCCcceEEEccccc----EEEEEec-----CceeEEEEeecCeEEE--eeCCEEeeecCCC-----CceeE Confidence 444433 25788886532 2222333 2222222222233333 3445566654432 25555 Q ss_pred EEeeccccCCCceEEE-EEEEEEcCCCCCchhheeeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEE Q lcl|NC_018275. 367 LLFTPLFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRV 445 (461) Q Consensus 367 ~~~tP~~~~~~~rv~~-~~le~~~Gv~q~~~~~~ls~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~~r~~v~f~~r~ 445 (461) +-.|+.+..+..--|. +.++. ..+ ..+.+-+-...|+. +++ .+|+-..|. .||=-++-|. ++|.+ T Consensus 482 ~WrSK~f~~p~~~sf~~~rV~s--~~~-~~v~i~~~~dg~~v-------~~~-~~g~~~~~~--~rlp~~~ar~-Wevei 547 (567) T protein:vir:27 482 RWHSKIFSLPERTSFSCIRVKS--PAP-ERVGITIMADDVPV-------IHF-APGTFKGSV--VRLPAATGQN-WQVMV 547 (567) T ss_pred EEecceEEecCccceeEEEEec--cCC-cceeEEEEEcCCce-------eec-CCccccCce--eecCCcccce-EEEEE Confidence 6677887777653332 22322 111 11112222111111 221 334433443 3443334443 77888 Q ss_pred EecCcceEEEeEEEeC Q lcl|NC_018275. 446 ITKSPVTLSGCQIRLE 461 (461) Q Consensus 446 ~~~~p~~l~ga~~~~e 461 (461) +...+|.---+.-.|| T Consensus 548 sg~~~V~~v~LA~S~~ 563 (567) T protein:vir:27 548 SGFGQVERITLSTSMS 563 (567) T ss_pred EecccEEEEEEecchh Confidence 8888876555555555 No 31 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=48.40 E-value=0.67 Score=21.53 Aligned_cols=428 Identities=11% Similarity=0.131 Sum_probs=151.4 Q ss_pred CCCccccCc-eeEeeeeccccc-cc----ccccccceeEeCCC-ceecccCCCc-ccceEEEEecC--e---EEEE---- Q lcl|NC_018275. 1 MGKDFKNAD-YIDYLPINMLAT-PK----EVLNSSGYLRSFPG-IAKRNDVNGV-SRGVEYNTAQN--A---VYRV---- 63 (461) Q Consensus 1 ~~~~~~~~d-~~~~~pvn~~a~-~~----~~~~s~~~L~~~PG-l~~~~~v~G~-~rG~~y~~~~~--~---lY~V---- 63 (461) |.-+-.-.. .+.-.+-.++.+ |- .+...+-.++..-+ +....+++.. ..|....+-+. . -|+. T Consensus 380 l~a~~~~~g~tv~~~g~~~~i~~~~~~~~~s~~~~~~~~~~~~~V~~~~~LP~~~~~g~~v~V~~~~~~~d~yyv~~~~~ 459 (976) T protein:vir:10 380 IIATGNFTSANVQQIGTGLYVTRPSGTFNVTAPSSDLLRVMSGEVANVDDLPSQCKHGYVVKVANSEADADDYYVKFFGH 459 (976) T ss_pred hcccccccceEEEEcCcEEEEEecCcceEecCCCceeEEEEEeeecchhhhhhhccCCcEEEEecCCCCceeEEEEeecc Confidence 111000000 000011111111 00 01111112222222 1112333321 12222211111 1 1221 Q ss_pred ---eCCeEEecc----ceEEeecCcccEEEEeCCeEEEEEECCcEEEEEeecccccceeccccccccCccCCc-ccccee Q lcl|NC_018275. 64 ---LGSKLYKGE----TVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSDYTQYELGS-VRDITR 135 (461) Q Consensus 64 ---~G~~Ly~v~----~~iG~i~gsg~VsMa~N~~~~avv~~g~~~~Y~ydg~~~~~~~~~~d~~~~~~dl~~-~~~v~~ 135 (461) .|...|+=. ..+|--..+=|..+++.+.-...+..- .|.--....+.+| .+|.+ ++. +.+|+| T Consensus 460 ~~~~~~~~w~E~~~~g~~~g~~~~tmP~~l~~~~~g~f~~~~~---~w~~r~vGd~~tn-----p~psf-~g~~is~v~f 530 (976) T protein:vir:10 460 NNRDGDGVWEECAKPSRNIEFDKGTMPIQLVRQANGTFTVSQA---TWQNAEVGDELTN-----PNPSF-VGKTINQLVF 530 (976) T ss_pred ccccccceEEEeeccccccccccccccEEEEecccCeEEeeec---cccccccCCcccC-----cCcee-cccccceEEE Confidence 111122200 011111111223333221111000000 0000000011111 12222 122 356899 Q ss_pred ccceEEEEeeCCceEEEeccCCcc-----------ccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCC Q lcl|NC_018275. 136 LRGRYAWSKDGTDSWFITDLEDES-----------HPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGAT 204 (461) Q Consensus 136 ~dGyfV~~~~gt~~f~iS~L~d~t-----------~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~ 204 (461) ..+|++|..| +....|.-.|-. +-|+. +++.+..+++.|.-++.+++.|+||.+..- |..+|+. T Consensus 531 ~q~RL~f~s~--~~v~~Srtgd~~nF~~~t~~~~~DdD~I-~~~~ss~~~~~i~~~v~~~~~L~l~T~g~e--~~lsg~~ 605 (976) T protein:vir:10 531 FRNRLVFLSD--ENVIMSRPGEFFNFWSKTATTFTPQDVI-DLSCSSTYPAIVYDGIQVNAGLLLFTKNQQ--FMLTTDS 605 (976) T ss_pred EcceEEEecC--CeEEEEecCCccccccccccCCCCCccE-EEEecCCcceeeEEEEecCCcEEEEecCce--EEEecCC Confidence 9999988742 344445433322 22333 266677788888889999999999887765 7777754 Q ss_pred CCccCcccccccceE--EeccccchhhhccCceEEEEeeccccceEEEEc------cCceeeecCCHHHHHHHHhcCccc Q lcl|NC_018275. 205 TAGAALYVAQPSLMV--QKGIAGTYCKTPFADSYAFISHPATGAPSVYII------GSGQASPIATASIEKIIRSYTADE 276 (461) Q Consensus 205 ~~~~fp~~~~~~~~I--~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~l------~g~q~~rIST~~IE~~i~~y~~~e 276 (461) + ++....-. +-. ..+|+..-.-..++++++|++..+. -..+|.+ .++.++.+ |.-+++.|+. . T Consensus 606 ~--~lTP~t~~-i~~~s~~~~~~~v~Pv~vG~~v~Fv~~~g~-~~r~~~~~~~~~~~~~~~~dl-t~~~~~l~~g----~ 676 (976) T protein:vir:10 606 D--ILSPETAK-INAVSSYNFNEKTHPVSLGTTVAFIDNANQ-FTRFFEMSNVVRQGEPDVVDQ-SKVISRLLDK----N 676 (976) T ss_pred c--eecceeEE-EEEEEeeeccCCCccEEeCCeEEEEecCCC-eEEEEEEeecccccccchhHH-HHHhhhhcCC----c Confidence 3 23322211 112 3578888888899999999987763 2334333 23333344 3333444432 2 Q ss_pred ccceEEEEEEECCEEEEEEEcC-CeEEEEEc---cccCCcceeeeecCCccccceEEEEEEecCCeEEEEEc-CCCeEEE Q lcl|NC_018275. 277 LATGVMEALRFDSHELLIIHLP-RHVLVYDA---SSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQIACGDK-SEAVTGQ 351 (461) Q Consensus 277 l~~A~~~ty~~~GH~fyvlt~P-~~Tw~yD~---~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~~g~~~vGD~-~~G~l~~ 351 (461) ++ ..+|+.+-+.+.....- +.-.||-- .-.+..--||.-.++ ...+..|++ .+--|++=-+ .+|.+.+ T Consensus 677 ~~---~~a~~~~~~~vv~~~~~~g~l~~~ty~~~~~eq~v~aWsr~~~~---G~v~sv~~i-~D~ly~vV~r~~~g~~~r 749 (976) T protein:vir:10 677 IS---LVSVSRENSVVFFSQKDTDKIYCFRYFTSGEKRLLQAWTTWTIT---GNIQYHCML-DDALYVVTRNNNKDQIVK 749 (976) T ss_pred eE---EEEEcCCCcEEEEEEcCCCEEEEEEEeecCCceeEEeeEEEecC---CcEEEEEEe-CCeEEEEEEecCCeEEEE Confidence 22 23456676655444433 34444321 222222247776663 356666665 3343333222 2222221 Q ss_pred E----cCC----------ccCcCCCE----------------------EEEEEeecc----------------------- Q lcl|NC_018275. 352 L----QFD----------ISSQYDKQ----------------------QEHLLFTPL----------------------- 372 (461) Q Consensus 352 l----d~~----------~~td~g~p----------------------~~~~~~tP~----------------------- 372 (461) + +.. ..++.+.+ ....+..|- T Consensus 750 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~t~~~~t~~t~~~~~~~~~~~~~~~~~~~d~~~~~~~~~ 829 (976) T protein:vir:10 750 YSLKLDDAGHFVTDTQGTTSTDDDSIYRVHLDHSSSVTAASNTYNTTTIKTTIPKPNGYESTKQLVAYDTDAGNDLGRYA 829 (976) T ss_pred EEEEECCccceeeeccCccccccCCcceeeeccceEEEeccccccCCceeEEeecCccccCceeEEEEecccCcccccce Confidence 1 100 00001111 000000000 Q ss_pred --------cc----CCCceE-----EEEEEEEE-------cCCCC--Cc-h-----hheee----------eccCccc-c Q lcl|NC_018275. 373 --------FK----ADNARC-----FDLEVESS-------TGVAQ--YA-D-----RLFLS----------ATTDGIN-Y 409 (461) Q Consensus 373 --------~~----~~~~rv-----~~~~le~~-------~Gv~q--~~-~-----~~~ls----------~sdDG~~-~ 409 (461) +. .+...| +..++|+. .|-+. .+ - ++-++ -..+|.. | T Consensus 830 ~~~v~g~~i~l~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~~~~~gRl~i~r~~~~~~~tg~~~v~v~~~~~~~~ 909 (976) T protein:vir:10 830 LVTVSGSNLEIPGNWSNNSFIIGYLYEMDVQLPTLYVTQQVGDKYRSDAKSSLIVHRIKFSFGPLGVYSTTIQRDGKPDF 909 (976) T ss_pred eeeecCCeeEecCCCCCCeEEEeeeeEEEEeecceeEEeCCCCcccccceeeEEEEEEEEEeecccceEEEEcCCCCccc Confidence 00 000011 01111110 01100 00 0 00010 0111110 1 Q ss_pred C-cceeeccCCCcc-----ccee-EEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 410 G-REQMIEQNEPFV-----YDKR-VLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 410 ~-~~~~~~~g~~g~-----y~~R-~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) - .++....+...- .+.. .++--.|. +--.++.+....|.++...++..| T Consensus 910 ~~~~~~~~~~~~~~~~~pl~~~~~~~vP~~~~---~~~~~v~i~~d~PlP~tilsi~~e 965 (976) T protein:vir:10 910 TETKELGLAGVVGASRLPIVPEVIETVPCYER---NTNLKVNVKSEHPAPATLYSLAWE 965 (976) T ss_pred cccccccccCcccccccceecCcEEEEEeccC---CceeEEEEEECCCCceEEEEEEEE Confidence 1 000000000000 0000 01111121 122455666677777777777777 No 32 >protein:vir:8887 Length: 808 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813776;genbank:gi:29366731;genbank:GeneID:1258831 Probab=43.75 E-value=0.83 Score=21.02 Aligned_cols=434 Identities=10% Similarity=0.043 Sum_probs=155.2 Q ss_pred CCCccccCceeEeeeecccc-----ccccccc-----ccceeEeCCCc----------eecccCCCcccceEEEEecCeE Q lcl|NC_018275. 1 MGKDFKNADYIDYLPINMLA-----TPKEVLN-----SSGYLRSFPGI----------AKRNDVNGVSRGVEYNTAQNAV 60 (461) Q Consensus 1 ~~~~~~~~d~~~~~pvn~~a-----~~~~~~~-----s~~~L~~~PGl----------~~~~~v~G~~rG~~y~~~~~~l 60 (461) -+|+++- +|+.-+-.... ++..+.. ++-.....+.. +......+...|..+....+.+ T Consensus 159 y~~~y~i--~i~g~~s~~~~t~t~~~~~~s~~~v~~~~~~~~~~~~~~~~~~~ia~~l~~~~~~~~~~~~~~~~~~~~~~ 236 (808) T protein:vir:88 159 YGRTLSI--TINGDGTGSSPQASIKMPNGSAEKVPAGDPYAGMNQVDMTDASWIAAELARQLTVSLGGSGWSFQAGTGWI 236 (808) T ss_pred cCceEEE--EEecCCcceeeeEeEEEccCcccceeeccceeecccCCccccccchhhheeeeeecccccceEEEeccceE Confidence 2222210 01110000000 0000000 00000000010 0000111111122222222333 Q ss_pred EEEe--CCeEEeccceEEeecCcccEEEE-------------eCCeEEEEEECC----cEEEEEeeccccc--------- Q lcl|NC_018275. 61 YRVL--GSKLYKGETVVGDVAGSGRVSMA-------------HGRTSQAVGVNG----QLVEYRYDGTVKT--------- 112 (461) Q Consensus 61 Y~V~--G~~Ly~v~~~iG~i~gsg~VsMa-------------~N~~~~avv~~g----~~~~Y~ydg~~~~--------- 112 (461) |.+. +.+.....+.-|. +++....+. .||..+.|...+ ...+++|+..... T Consensus 237 ~i~~~a~~~~~~~~t~~g~-~~~~~~~~~~~v~~~~~lp~~~p~g~~v~i~~~~~~~~~~~yv~~~~~~~~w~e~~~~~~ 315 (808) T protein:vir:88 237 LINAPANDNVRQIATKDGY-ADTLLSGFIYQVQTFTKLPANAPPGYLVEITGESARSGDNYWVQYDASGKVWKETAKPKI 315 (808) T ss_pred EEEeccCceeEEEcccCCc-CcceeeeeeeeccceeeccccCCCCcEEEEEecCCCCCceeEEEEEcCCeEEEEeeeccc Confidence 3331 1111111110000 000111000 122222222111 1122233322111 Q ss_pred --------------------ce----ecc-------ccccccCccCCccccceeccceEEEEeeCCceEEEeccCCcc-- Q lcl|NC_018275. 113 --------------------VS----NWP-------ADSDYTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDES-- 159 (461) Q Consensus 113 --------------------~~----~~~-------~d~~~~~~dl~~~~~v~~~dGyfV~~~~gt~~f~iS~L~d~t-- 159 (461) +. .|. ..+.+|.+.-..+.+|+|..+|++|.-| +....|.-.|.. T Consensus 316 ~~~~~~~tmp~~lv~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~--~~v~~Srtgd~~nF 393 (808) T protein:vir:88 316 IAGFNNATLPHALVRAADGQFDWTPLTWDGRNAGDDDTNPMPSFVGATINDVFFFRNRLGFLSG--ENVVMSRTSKYFNF 393 (808) T ss_pred eeeecccceeEEEEecCCceEEEEecccccccccccccCccceecCCceeEEEEEcceEEEeeC--CeEEEEeccCcccc Confidence 00 010 0011333322345679999999998653 334444433322 Q ss_pred ---------ccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccCcccccccceE-Eeccccchhh Q lcl|NC_018275. 160 ---------HPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMV-QKGIAGTYCK 229 (461) Q Consensus 160 ---------~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG~~~~~~fp~~~~~~~~I-~~Gca~~~sv 229 (461) +-|+. ++..+..+++.|.-++.+++.|+||.+..- |..+|+. ++....-.--.+ ..||+..-.- T Consensus 394 ~~~t~~~~~DdD~i-~~~~~~~~~~~i~~~v~~~~~L~i~T~~~e--~~l~~~~---~lTP~~~~~~~~s~~~~~~~~~P 467 (808) T protein:vir:88 394 FPSSVATLSDDDPI-DVAISHNRISILKYAVPFSEQLLLWSDQAQ--FVLSSKT---ILSSKTIELDLTTEFDVSDGARP 467 (808) T ss_pred cCCcccCCCCCccE-EEEecCCccceeeEEeecCCcEEEEecCcE--EEEeCCC---cccceeEEEEEEEEecccCCCCc Confidence 22232 255566667777778999999999966554 7777642 233322211122 4699888888 Q ss_pred hccCceEEEEeeccccceEEEE-------ccCceeeecCCHHHHHHHHhcCcccccceEEEEEEECCEE----------E Q lcl|NC_018275. 230 TPFADSYAFISHPATGAPSVYI-------IGSGQASPIATASIEKIIRSYTADELATGVMEALRFDSHE----------L 292 (461) Q Consensus 230 ~~~~~s~~wlg~d~~g~~~Vy~-------l~g~q~~rIST~~IE~~i~~y~~~el~~A~~~ty~~~GH~----------f 292 (461) ..++++++|++..+..- .|+| .++|+++.+| .-++..|+.- +. .+++++.+.+. + T Consensus 468 v~vG~~v~f~~~~g~~~-~v~r~~~~~~~~d~y~~~dlt-~~~~h~~~~~----~~--~~~~~~~~~~~~v~~~~~~g~l 539 (808) T protein:vir:88 468 YGIGRGVYFAAPRASFT-SLKRYYAIQDVSDVKSAEDVS-AHVPSYITNT----VH--AIHGSGTENFVSILSDGSPNKV 539 (808) T ss_pred eEeCCeEEEEecCCCee-EEEEEEEeeeccCceehhhHH-HHHHHhcCCC----eE--EEEEeCCCCeEEEEEEcCCCEE Confidence 89999999999886321 2222 4567777774 3455565442 11 12223333333 4 Q ss_pred EEEEc----CC-----e-EEEEEcc--------ccCCcceeeeecC--Cccccce----------------EEEE----- Q lcl|NC_018275. 293 LIIHL----PR-----H-VLVYDAS--------SSQNGPQWCVLKT--GLYDDVY----------------RAID----- 331 (461) Q Consensus 293 yvlt~----P~-----~-Tw~yD~~--------t~~w~e~w~~~~t--g~~~~~~----------------Ra~~----- 331 (461) |++++ +. | .|-++.. ....-+-|.+-+- +.+.+|. ...+ T Consensus 540 ~~~~y~~~~~e~~v~aW~r~~~~g~~~~~~~~~~~~~d~l~~vV~r~~~~~ler~~~~~~~~~~~~~~~~~~lD~~~~~~ 619 (808) T protein:vir:88 540 FIYKFLYLDEILQQQSFSHWEFGDAATTRVLAASCIGSYCYLMIDRPEGLCLERMEFTQHTIDYSIEPYRTYMDMKKTIV 619 (808) T ss_pred EEEEEeccCCceeEEeeEEEecCCCeeEEEEEEeccCCEEEEEEEcCCcEEEEEEeeccCCCCCccccceeeeeeeeeec Confidence 44443 21 0 1222110 0000011221111 0111110 0000 Q ss_pred ------------------------------EEecCCeEEEEEcCCC---eEEEEcCCccC---cCCCEEEEEEeecc--c Q lcl|NC_018275. 332 ------------------------------FMYEGNQIACGDKSEA---VTGQLQFDISS---QYDKQQEHLLFTPL--F 373 (461) Q Consensus 332 ------------------------------~~~~~g~~~vGD~~~G---~l~~ld~~~~t---d~g~p~~~~~~tP~--~ 373 (461) .+-.+|..+.+|..+. .-..+..+... .=|-++++.++++. + T Consensus 620 ~g~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~p~~~ 699 (808) T protein:vir:88 620 LGAYNIDTNLTSFDVRTAYGGTPGPESTFYTIDQQGVLIEHEARDWATNPYISFVGNRAGEQMVIGKQYTFQYEFSKFLI 699 (808) T ss_pred cccccCccccceeecccccccccccceeEEEEcCCceEEeeecccccCcceEEeCCCccCceEEEeeeeeEEEEecceEE Confidence 0111111111111110 00111111100 01555566655522 2 Q ss_pred cCC-C-----------ceEEEEEEEEE-cC-----CCC-CchhheeeeccCccccCcceeeccCCCcccceeEEEEeeEe Q lcl|NC_018275. 374 KAD-N-----------ARCFDLEVESS-TG-----VAQ-YADRLFLSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGR 434 (461) Q Consensus 374 ~~~-~-----------~rv~~~~le~~-~G-----v~q-~~~~~~ls~sdDG~~~~~~~~~~~g~~g~y~~R~~~~rlG~ 434 (461) ... + .||....+.+. +| |.. ..+.+ -...|..++.+.+. |.+-.+..-+++.-.|. T Consensus 700 ~~~~g~~~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~---~~~~~~~~~~~~~~--~~~~~~tg~~~vp~~~~ 774 (808) T protein:vir:88 700 KQTADDGSTSTEDIGRLQLRRAWLNYEESGAFEINVNNGSSEFV---YVMTGGRLGIQRVL--GELSVGTGQFKFPVTGN 774 (808) T ss_pred ecCCCCcceeecccceEEEEEEEEEeecccceEEEeCCCcccce---eeccCcccCccccc--CccccccceEEEEeccc Confidence 211 1 13332222221 11 111 11111 12246666655432 22222222223333354 Q ss_pred cccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 435 IRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 435 ~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) .++ .+|++....|.++...++..| T Consensus 775 ~~~---~~v~i~~d~P~P~tilsi~~e 798 (808) T protein:vir:88 775 AVN---QRVTITSSNPNPLNVIGCGWE 798 (808) T ss_pred Cce---eEEEEEECCCCceEEEEEEEE Confidence 332 345666777777777777777 No 33 >protein:vir:7021 Length: 803 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853594;genbank:gi:31711676;genbank:GeneID:1481802 Probab=37.44 E-value=1.1 Score=20.32 Aligned_cols=438 Identities=11% Similarity=0.052 Sum_probs=157.4 Q ss_pred CCCccccCc-----eeEeeeecccccccccccccc-eeEeCC-CceecccCCCcc-cceEEEE-----ecCeEEEE---- Q lcl|NC_018275. 1 MGKDFKNAD-----YIDYLPINMLATPKEVLNSSG-YLRSFP-GIAKRNDVNGVS-RGVEYNT-----AQNAVYRV---- 63 (461) Q Consensus 1 ~~~~~~~~d-----~~~~~pvn~~a~~~~~~~s~~-~L~~~P-Gl~~~~~v~G~~-rG~~y~~-----~~~~lY~V---- 63 (461) -..++...- ||...+.+-..+......... .+..+- .+....+++..+ .|..-.+ .+..-|-| T Consensus 207 s~a~~~~~~~g~~~~i~~~~~~~~~~~~t~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~~g~~~~d~y~v~~~~ 286 (803) T protein:vir:70 207 KIADYEIQLDGTSIYITRRDGSTTFDITTEDGAKGKDLVAIKYKVASTDLLPSRAPEGYKVQVWPTGSKPESRYWLQAEK 286 (803) T ss_pred cccceEEEECCcEEEEEEcCCCCeeEEEeecCcCCcEEEEEEecccceeeccccCCCCceEEEEcCCCCCCceeeEEEEe Confidence 112222111 233333221111111101111 111100 001111111111 1110001 11112221 Q ss_pred eC--CeEEeccceEEee----cCcccEEEEeCCeEEEEEE-CCcEEEEEeecccccceec--cccccccCcc-CCccccc Q lcl|NC_018275. 64 LG--SKLYKGETVVGDV----AGSGRVSMAHGRTSQAVGV-NGQLVEYRYDGTVKTVSNW--PADSDYTQYE-LGSVRDI 133 (461) Q Consensus 64 ~G--~~Ly~v~~~iG~i----~gsg~VsMa~N~~~~avv~-~g~~~~Y~ydg~~~~~~~~--~~d~~~~~~d-l~~~~~v 133 (461) .+ ..-++-....|.. ..+.|..+.+ ++.+. .+.......+........- .....|-+.. -.-+.+| T Consensus 287 ~~~~~~~w~e~a~~g~~~~~~~~t~p~~~v~----~~~~~~~~~~~~~~~~~~~r~~gdd~tnp~psf~~~~~~~~~~~v 362 (803) T protein:vir:70 287 QNGNIVSWKETLAADVLIGFDKSTMPYIIER----TGFVNGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTLGGM 362 (803) T ss_pred ccCCccceEeeeccceeeeeecccccEEEEE----EEEeecceeEEEEeeccccccccccccCccccccCccCCCCceeE Confidence 00 0111100011111 1122222211 11110 1111111222222211100 0111222211 1236779 Q ss_pred eeccceEEEEeeCCceEEEeccCCcc-----------ccccCcceeEEecCCCceEEEEecCCEEEEEEcceEEEEEecC Q lcl|NC_018275. 134 TRLRGRYAWSKDGTDSWFITDLEDES-----------HPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTG 202 (461) Q Consensus 134 ~~~dGyfV~~~~gt~~f~iS~L~d~t-----------~~d~~~~f~tAE~~PD~iv~v~~~~~~l~lfG~~T~Evw~ntG 202 (461) +|..+|++|..| +....|.-.|.. +-|+. ++..+..+++.|.-++.+++.|+||.+..- |..+| T Consensus 363 ~f~q~RL~f~~~--~~v~~Srtgd~~nF~~~t~~~~~DdD~I-~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q--~~l~g 437 (803) T protein:vir:70 363 FMVQNRLCVTAG--EAVIATRTSYFFDFFRYTAVSAVATDPF-DVFSDASEVYQLKHAVTLDGSTVLFADKSQ--FILPG 437 (803) T ss_pred EEEeceEEEeeC--CeEEEEccCCccccccccccCCCCCccE-EEEecCCcceeeEEEeecCCcEEEEecCcE--EEEeC Confidence 999999998753 333344333322 22333 266667777888889999999999966554 77776 Q ss_pred CCCCccCcccccccceE-EeccccchhhhccCceEEEEeeccccceEEEE------ccCceeeecCCHHHHHHHHhcCcc Q lcl|NC_018275. 203 ATTAGAALYVAQPSLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVYI------IGSGQASPIATASIEKIIRSYTAD 275 (461) Q Consensus 203 ~~~~~~fp~~~~~~~~I-~~Gca~~~sv~~~~~s~~wlg~d~~g~~~Vy~------l~g~q~~rIST~~IE~~i~~y~~~ 275 (461) +. ++....-.--.+ ..+|+..-.-..++++++|++..+. --.|+. -++|+++.+|- -++.+|+. T Consensus 438 ~~---~lTP~~~~i~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~-~s~vre~~~~~~~d~y~a~Dlt~-~a~hl~~~---- 508 (803) T protein:vir:70 438 DK---PLEKSNVLLKPVTTFEVNNNVKPVATGESVMFATSEGA-YSGIREFYTDSYSDTKKAQAITS-HVNKLLEG---- 508 (803) T ss_pred CC---cccceeEEEEEEEEeeccCCCccEEeCCeEEEeccCCC-eeEEEEEeccccccceehhhhhh-hhHhhcCC---- Confidence 42 233322111112 3578877777899999999998752 123433 36777777743 34556654 Q ss_pred cccceEEEEEEECCEEEEEEEcC-CeEEEEEc---cccCCcceeeeecCCccccceEEEEEEec-CCeEEEEEcC-CCe- Q lcl|NC_018275. 276 ELATGVMEALRFDSHELLIIHLP-RHVLVYDA---SSSQNGPQWCVLKTGLYDDVYRAIDFMYE-GNQIACGDKS-EAV- 348 (461) Q Consensus 276 el~~A~~~ty~~~GH~fyvlt~P-~~Tw~yD~---~t~~w~e~w~~~~tg~~~~~~Ra~~~~~~-~g~~~vGD~~-~G~- 348 (461) .+... ...+.+.+.++....- +.-.||-- .-.|..--||.-.++ ...+..|+++. +.-|++-... +|. T Consensus 509 ~v~~~--~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~v~aW~r~~~~---g~~~~~~~~~~~d~l~~vv~r~~~g~~ 583 (803) T protein:vir:70 509 NVIMM--SASTNVNRLLVLTDKYRNIIYCYDWLWQGTERVQAAWHKWEWP---LGTFIRGMFYSGEHLYLLIERGSTGVY 583 (803) T ss_pred ceEEE--EEeCCCCeEEEEEEcCCCeEEEEEEEecCCcEEEEeEEEEEcC---CCEEEEEEEecCCEEEEEEEECCCeEE Confidence 22221 1123334333222222 23334432 222222248877773 35666666653 3446666654 343 Q ss_pred EEEEcCCccCcCCCEEEEEEe---e---ccccCCCc-----------eEEEEEEEEEcCCCCCchh-heeeeccCccccC Q lcl|NC_018275. 349 TGQLQFDISSQYDKQQEHLLF---T---PLFKADNA-----------RCFDLEVESSTGVAQYADR-LFLSATTDGINYG 410 (461) Q Consensus 349 l~~ld~~~~td~g~p~~~~~~---t---P~~~~~~~-----------rv~~~~le~~~Gv~q~~~~-~~ls~sdDG~~~~ 410 (461) |-+|+....++.+.+....+- + +....... .+-.++.-+..|....... +.....+.+.++. T Consensus 584 ier~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~g~~t~~ 663 (803) T protein:vir:70 584 LERMDMGDALVYNLNDRIRMDRQAELIFRHIKAEDVWVSEPLPWQPTDVTLLDCVLIDGWDSYIGGSFLFSYNPGDNTLT 663 (803) T ss_pred EEEEecccccccCCcceeEeccceeEeeccccCCceeeeecccccCcccceeeEEEeeeeeeecCCeEEEEEcCCCccce Confidence 456776665555544332211 1 11111110 1111121122222221111 1111111122221 Q ss_pred cc-e---------eeccCCCcccceeEE---E------------EeeEecccc----eeEEEEEEecCcce-----EEEe Q lcl|NC_018275. 411 RE-Q---------MIEQNEPFVYDKRVL---W------------KRVGRIRRL----IGFKLRVITKSPVT-----LSGC 456 (461) Q Consensus 411 ~~-~---------~~~~g~~g~y~~R~~---~------------~rlG~~r~~----v~f~~r~~~~~p~~-----l~ga 456 (461) -+ - .+-.|.+-+..-+.. . .|+.++.-+ -.|++++....... .++. T Consensus 664 ~~~~~~~~~~~a~~v~VGl~Y~~~~~~~~~~i~~~~~~~~~~~~~rl~r~~~~~~~sg~~~v~v~~~~~~~~~~~~~s~~ 743 (803) T protein:vir:70 664 TTFDMHDDDHVKAKVVVGQLYPQEFEPTQVVIRDNQERVSYIDVPTVGLVHLNLDKYPDFKVEVKNLKSGKVRNVLASNR 743 (803) T ss_pred eeeeEECCCCcccEEEEeeeeeEEEeecceEEEcCCCccccccccEEEEEEEEeecccceEEEEecCCccccceeeccch Confidence 11 0 111222211111000 0 011111111 11333322211110 0000 Q ss_pred EE-----EeC Q lcl|NC_018275. 457 QI-----RLE 461 (461) Q Consensus 457 ~~-----~~e 461 (461) .+ .++ T Consensus 744 ~~g~~~~~~g 753 (803) T protein:vir:70 744 VGGAINNIVG 753 (803) T ss_pred hccccccccC Confidence 00 000 No 34 >protein:vir:100244 Length: 109 # NCBI annotation: gp73 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355409;genbank:gi:77864699;genbank:GeneID:3725966 Probab=25.04 E-value=1.1 Score=20.37 Aligned_cols=42 Identities=14% Similarity=-0.009 Sum_probs=33.2 Q ss_pred cCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 417 QNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 417 ~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) |=++|+.++|+.+++....++..|+.+ ......+.-|||.++ T Consensus 1 mm~~g~L~~rI~i~~~~~~~d~~G~~~---~~~w~~~~~~wA~i~ 42 (109) T protein:vir:10 1 MLRSSDLTEFIVIERKGGRTNENGEPL---PDDWVTHDEVWASVR 42 (109) T ss_pred CCCccccCccEEEEeeeeccCCCCCee---ccceeeEEEEEEEEE Confidence 557799999999999998888877543 334556778999998 No 35 >protein:vir:5977 Length: 109 # NCBI annotation: hypothetical protein # Family: family:all:788 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690677;genbank:geneid:6329133;genbank:gi:22855071;interpro:IPR013045;uniprot:O48446;genbank:GeneID:955315 Probab=21.25 E-value=1.5 Score=19.54 Aligned_cols=39 Identities=8% Similarity=-0.160 Sum_probs=31.8 Q ss_pred CCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 419 EPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 419 ~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) +..++..|+++++.-..++..|....- =+.+.-+|+.+| T Consensus 1 ~~~~L~~RI~i~~~~~~~D~~G~~~~~----w~~~~~~WA~v~ 39 (109) T protein:vir:59 1 MYEEFPDVITFQSYVEQSNGEGGKTYK----WVDEFTAAAHVQ 39 (109) T ss_pred CccccCccEEEEeeeeeeCCCCCeeee----eEeeEEEEEEEe Confidence 899999999999999999888877751 223456888888 No 36 >protein:vir:193 Length: 112 # NCBI annotation: putative head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037703;genbank:gi:9634168;genbank:GeneID:1262533 Probab=20.49 E-value=1.7 Score=19.34 Aligned_cols=40 Identities=15% Similarity=0.037 Sum_probs=32.2 Q ss_pred cCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 417 QNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 417 ~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) |. +|+.++|+.++|....++..|.... .-..+.-|||.++ T Consensus 1 M~-~G~L~~rI~i~~~~~~~d~~G~~~~----~w~~~~~~wA~v~ 40 (112) T protein:vir:19 1 ME-PGRFRNRVKILTFTTSRDPSGQPVE----SWTGGNPVPAEVK 40 (112) T ss_pred CC-ccccCccEEEEeeeeeeCCCCCeec----ceEeEEEEEEEEE Confidence 44 8999999999999988887775543 4556778999998 No 37 >protein:vir:100134 Length: 109 # NCBI annotation: gp8 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945038;genbank:gi:38707898;genbank:GeneID:2744181 Probab=20.34 E-value=1.7 Score=19.37 Aligned_cols=42 Identities=12% Similarity=-0.041 Sum_probs=33.0 Q ss_pred cCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_018275. 417 QNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 461 (461) Q Consensus 417 ~g~~g~y~~R~~~~rlG~~r~~v~f~~r~~~~~p~~l~ga~~~~e 461 (461) |=++|+.++|+.++|.-..++-.|..+ .+.-+.+.-|||.++ T Consensus 1 mm~~G~L~~rI~i~~~~~~~d~~G~~~---~~~w~~~~~~wA~v~ 42 (109) T protein:vir:10 1 MLKAGELTERITIEKRGGGVNENGEPL---PGDWVEHASVWANVR 42 (109) T ss_pred CCCccccCccEEEEeeeeeeCCCCCee---ccceEEEEEEEEEEE Confidence 556788999999999998888877433 233567788999999 Done!