Query lcl|NC_011802.1_cdsid_YP_002455894.1 [gene=orf58] [protein=Gp10] [protein_id=YP_002455894.1] [location=31961..33379] Match_columns 472 No_of_seqs 38 out of 48 Neff 5.2 Searched_HMMs 1612 Date Thu Nov 7 13:22:58 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_58 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_58_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:9268 Length: 472 # 100.0 2E-227 1E-230 1263.4 49.5 472 1-472 1-472 (472) 2 protein:vir:100960 Length: 472 100.0 3E-227 2E-230 1262.5 49.2 472 1-472 1-472 (472) 3 protein:vir:2109 Length: 472 # 100.0 2E-224 1E-227 1247.0 49.8 472 1-472 1-472 (472) 4 protein:vir:105428 Length: 472 100.0 4E-221 3E-224 1228.8 48.9 471 1-472 1-472 (472) 5 protein:vir:177 Length: 472 # 100.0 1E-220 8E-224 1226.0 49.5 471 1-472 1-472 (472) 6 protein:vir:3529 Length: 477 # 100.0 1E-216 8E-220 1204.2 48.1 469 1-472 7-477 (477) 7 protein:vir:105525 Length: 472 100.0 3E-212 2E-215 1179.9 48.3 468 1-472 1-472 (472) 8 protein:vir:108312 Length: 458 100.0 3E-186 2E-189 1038.1 44.1 441 1-472 1-458 (458) 9 protein:vir:8837 Length: 513 # 99.9 3E-26 1.9E-29 160.5 35.6 436 1-472 1-507 (513) 10 protein:vir:352 Length: 536 # 99.2 4.3E-10 2.6E-13 72.0 28.5 431 1-472 2-531 (536) 11 protein:vir:95475 Length: 771 98.9 1.3E-08 8E-12 63.8 26.1 422 1-472 222-766 (771) 12 protein:vir:3133 Length: 911 # 98.9 2.1E-08 1.3E-11 62.7 28.1 432 1-472 177-732 (911) 13 protein:vir:2625 Length: 715 # 97.9 1.4E-05 8.6E-09 47.2 27.6 443 1-472 129-710 (715) 14 protein:vir:102644 Length: 594 95.0 0.003 1.8E-06 34.4 35.1 446 1-472 1-590 (594) 15 protein:vir:1778 Length: 680 # 93.3 0.0082 5.1E-06 32.0 24.6 314 1-338 319-680 (680) 16 protein:vir:95324 Length: 823 93.1 0.0086 5.4E-06 31.9 35.6 410 1-472 176-665 (823) 17 protein:vir:105647 Length: 800 92.1 0.013 7.9E-06 31.0 33.9 440 1-472 207-753 (800) 18 protein:vir:107802 Length: 681 89.4 0.026 1.6E-05 29.2 35.5 423 1-472 129-678 (681) 19 protein:vir:107423 Length: 681 89.4 0.026 1.6E-05 29.2 35.5 423 1-472 129-678 (681) 20 protein:vir:98487 Length: 681 89.4 0.026 1.6E-05 29.2 35.5 423 1-472 129-678 (681) 21 protein:vir:7329 Length: 825 # 87.0 0.042 2.6E-05 28.1 34.7 425 1-472 164-681 (825) 22 protein:vir:2203 Length: 794 # 86.3 0.046 2.9E-05 27.9 39.1 443 1-472 174-739 (794) 23 protein:vir:8887 Length: 808 # 85.2 0.055 3.4E-05 27.5 38.4 447 1-472 160-798 (808) 24 protein:vir:80253 Length: 777 80.8 0.091 5.7E-05 26.3 32.3 415 1-472 211-693 (777) 25 protein:vir:7021 Length: 803 # 76.5 0.13 8.3E-05 25.4 37.7 443 1-472 198-753 (803) 26 protein:vir:78703 Length: 905 75.5 0.15 9E-05 25.2 35.6 437 1-472 317-846 (905) 27 protein:vir:103790 Length: 768 62.5 0.33 0.00021 23.2 37.8 435 1-472 198-763 (768) 28 protein:vir:100022 Length: 976 58.5 0.41 0.00026 22.7 37.7 435 1-472 364-965 (976) 29 protein:vir:94583 Length: 792 47.1 0.71 0.00044 21.4 33.8 442 1-472 169-737 (792) 30 protein:vir:6326 Length: 826 # 44.5 0.81 0.0005 21.1 35.4 438 1-472 219-816 (826) 31 protein:vir:99677 Length: 794 41.1 0.94 0.00058 20.7 37.9 441 1-472 198-740 (794) 32 protein:vir:827 Length: 567 # 38.0 1.1 0.00067 20.4 30.3 397 1-472 125-563 (567) 33 protein:vir:3366 Length: 801 # 33.9 1.3 0.00082 19.9 36.1 438 1-472 198-791 (801) 34 protein:vir:78957 Length: 826 33.7 1.3 0.00083 19.9 34.9 428 1-472 262-776 (826) 35 protein:vir:105563 Length: 396 31.9 1.5 0.00091 19.7 15.3 251 1-307 117-396 (396) 36 protein:vir:10145 Length: 567 31.4 1.5 0.00093 19.6 30.5 397 1-472 125-563 (567) 37 protein:vir:3306 Length: 567 # 31.4 1.5 0.00093 19.6 30.5 397 1-472 125-563 (567) 38 protein:vir:2792 Length: 567 # 31.4 1.5 0.00093 19.6 30.5 397 1-472 125-563 (567) 39 protein:vir:9979 Length: 567 # 31.4 1.5 0.00093 19.6 30.5 397 1-472 125-563 (567) 40 protein:vir:100244 Length: 109 24.1 1.2 0.00073 20.2 3.6 42 428-472 1-42 (109) 41 protein:vir:5977 Length: 109 # 20.5 1.7 0.001 19.4 3.6 39 430-472 1-39 (109) 42 protein:vir:193 Length: 112 # 20.0 1.8 0.0011 19.2 3.7 40 428-472 1-40 (112) No 1 >protein:vir:9268 Length: 472 # NCBI annotation: 10 # Family: family:all:1540 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720332;genbank:gi:24371590;genbank:GeneID:955815 Probab=100.00 E-value=2e-227 Score=1263.41 Aligned_cols=472 Identities=97% Similarity=1.476 Sum_probs=468.8 Q ss_pred CceeeeeecccCccccccCCeeEEeeeeeeecccccCcccceeEcCCCceeeeecCCCccceeeeeccCeEEEEECccee Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGISRGVEYNTAQNAVYRVLGSKLY 80 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~~rg~~~~~~~~~lY~V~G~~LY 80 (472) |+||||||++|++++++++|++|++||||||+||++++|+++||++||++++++++|++||+.||++++.||+|+|++|| T Consensus 1 m~~~~ipl~~g~~~~~~~a~~~~~~pvn~y~~~~~~~~ss~~Lr~~pG~~~~a~~~G~~RG~~~~~~~~~ly~V~G~~Ly 80 (472) T protein:vir:92 1 MPIQQLPMMKGMGKDFKNADYIDYLPINMLATPKEVLDSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAVYRVCGGKLY 80 (472) T ss_pred CceeeccccccccccCccCcceeeeecccccccccccccccceeecccceeecCCCCcccceeeeeeCCeEEEEeCcceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccccccCCcccccceeeeeceeEEEEecCCCe Q lcl|NC_011802. 81 KGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDS 160 (472) Q Consensus 81 ~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~ 160 (472) |+++++|+|+|+|||||+||+++|+|++++++++|+||++++++++||+|++|+++|+++++||||+||||||++||+++ T Consensus 81 ~v~~~iG~i~gsgrVsMa~n~~~~av~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~ 160 (472) T protein:vir:92 81 KGEAVVGDVAGSGRVSMAHGRTSQAVGVNGQLIEYRYDGAVKTVSNWPADSGFTQYELGSVRDITRLRGRYAWSKDGTDS 160 (472) T ss_pred EEEeeEeeccCcccEEEecCCeEEEEEECCceeEEEEecchhhhhcccCccccccccccceeEEEEecceEEEccCCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccccceecccceeeeccccchhh Q lcl|NC_011802. 161 WFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAHASLMVQKGIAGTYCK 240 (472) Q Consensus 161 ~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I~~Gca~~~sv 240 (472) ||||+|+|+++++++..||+||++||+||+++++|++|||||++|||||+|||++|...|||+|+||+|||+||||++|| T Consensus 161 ~~iS~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~~l~lfG~~TiEvw~ntG~ad~~~fpy~r~~g~~i~~Gcaa~~sv 240 (472) T protein:vir:92 161 WFITDLEDESHPDRYSAEYRAESQPDGIIGIGSWRDFIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCK 240 (472) T ss_pred EEEeccCCccccccccccccccCCCCceEEEEeeccEEEEEeccceEEEEecCCCCcCcCceEEcCcceeeecccCcchh Confidence 99999999999988778999999999999999999999999999999999999999889999999999999999999999 Q ss_pred eecCceEEEEEeccccccEEEEccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCCeEEEEecccc Q lcl|NC_011802. 241 TPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPRHVLVYDASSS 320 (472) Q Consensus 241 ~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~~Tw~yD~~t~ 320 (472) |+++|++|||||+++|+++||+++||||||||||+||++|++|+++|+++|++|+||||||+||+||||+||||||++|+ T Consensus 241 ~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aIE~~i~~y~~~e~~~a~~~s~~~eGH~fy~LtfP~~Tw~yD~at~ 320 (472) T protein:vir:92 241 TPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRFDSHELLIIHLPRHVLVYDASSS 320 (472) T ss_pred hecCceEEEEecCCCcccEEEEccCceeEEecCHHHHHHHHhcCcchhceeeEEEEEecCeeEEEEEcCCceEEEEcccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCchheeeeccCccccceEeeeEeecCCeEEEEEccCCeEEEEcCCccCCCCCEEEEEEeeccccCCCceEEEEEEEEEc Q lcl|NC_011802. 321 QNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFKADNARCFDLEVESST 400 (472) Q Consensus 321 ~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~~~le~~~ 400 (472) |||||||++++|++++|||++++|+++||+||||++||+||+||+|.++|+|+|++|++++|++|+||+|+|++|||+++ T Consensus 321 ~~~e~W~~~~sg~~~~~~R~~~~~~~~g~~ivGD~~nG~ly~l~~~~~t~~~~~~~~~~~~P~~~~dn~R~~d~eve~~~ 400 (472) T protein:vir:92 321 QNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFKADNARCFDLEVESST 400 (472) T ss_pred cCCceeeeecCCCcccceeEEEEEeeCCeEEEEEcCCCeEEEEeccccccCCCcceEEEEeceEecCCCEEEEEeeeccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCchhheeeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 401 GVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 401 Gv~~~~~~~~l~~sdDG~~~~~~~~~~~g~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) ||+|++|+|||||||||++||||+|+++|++|||+||++||||||||+||||||||++|+||+|+||+|||| T Consensus 401 Gv~q~~d~v~L~wSddG~~~~~~~~~~~g~~g~~~tr~~~~RlG~~r~~v~f~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:92 401 GVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred CCCCcCceEEEEeeccccccccceeeccCCccchhcceeeeeeeecccceeEEEEEEecCcceeeeeEEeeC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:100960 Length: 472 # NCBI annotation: gp10 # Family: family:all:1540 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006412;genbank:gi:46358704;genbank:GeneID:2777110 Probab=100.00 E-value=2.9e-227 Score=1262.50 Aligned_cols=472 Identities=96% Similarity=1.466 Sum_probs=468.8 Q ss_pred CceeeeeecccCccccccCCeeEEeeeeeeecccccCcccceeEcCCCceeeeecCCCccceeeeeccCeEEEEECccee Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGISRGVEYNTAQNAVYRVLGSKLY 80 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~~rg~~~~~~~~~lY~V~G~~LY 80 (472) |+||||||++|++++++++|++|++||||||+||+++||+++||++||++++++++|++||+.||+++++||+|+|++|| T Consensus 1 m~~~~ipl~~g~~~~~~~a~~~~~~pvn~y~~~~~~~~ss~~Lr~~pG~~~~a~~~G~~RG~~~~~~~~~ly~V~G~~Ly 80 (472) T protein:vir:10 1 MPIQQLPMMKGMGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAVYRVCGGKLY 80 (472) T ss_pred CceeecccccccccCCCcCcceeeeeeccccccccccccccceeecccceeecCCCCcccceeeeeeCCeEEEEeCcceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccccccCCcccccceeeeeceeEEEEecCCCe Q lcl|NC_011802. 81 KGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDS 160 (472) Q Consensus 81 ~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~ 160 (472) |+++++|+|+|+|||||+||+++|+||+++++++|+||++++++++||+|++|+++|+++++||||+||||||++||+++ T Consensus 81 ~v~~~iG~i~gsgrVsMa~n~~~~~v~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~ 160 (472) T protein:vir:10 81 KGEAVVGDVAGSGRVSMAHGRTSQAVGVNGQLIEYRYDGAVKTVSNWPADSGFTQYELGSVRDITRLRGRYAWSKDGTDS 160 (472) T ss_pred EEEeeEeeccCcccEEEeeCCeEEEEEECCceeEEEEecchhhhhcccCccccccccccceeEEEEecceEEEccCCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccccceecccceeeeccccchhh Q lcl|NC_011802. 161 WFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAHASLMVQKGIAGTYCK 240 (472) Q Consensus 161 ~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I~~Gca~~~sv 240 (472) ||||+|+|+++++++..||+||++||+||+++++|++|||||++|||||+|||++|...|||+|+||+|||+||||++|| T Consensus 161 ~~iS~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~~l~lfG~~TiEvw~ntG~ad~~~fpy~r~~g~~i~~Gcaa~~sv 240 (472) T protein:vir:10 161 WFITDLEDESHPDRYSAEYRAESQPDGIIGIGSWRDFIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCK 240 (472) T ss_pred EEEeccCCccccccccccccccCCCCceEEEEeeccEEEEEeccceEEEEecCCCCcCcCceEEcCcceeeecccCcchh Confidence 99999999999988778999999999999999999999999999999999999999889999999999999999999999 Q ss_pred eecCceEEEEEeccccccEEEEccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCCeEEEEecccc Q lcl|NC_011802. 241 TPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPRHVLVYDASSS 320 (472) Q Consensus 241 ~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~~Tw~yD~~t~ 320 (472) |+++|++|||||+++|+++||+++||||||||||+||++|++|+++|+++|++|+||||||+||+||||+||||||++|+ T Consensus 241 ~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aIE~~i~~y~~~e~~~A~~~t~~~~GH~fy~LtfP~~Tw~yD~at~ 320 (472) T protein:vir:10 241 TPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTAEELATGVMETLRFDSHELLIIHLPRHVLVYDASSS 320 (472) T ss_pred hecCceEEEEecCCCcccEEEEccCceeEEecCHHHHHHHHhcCCccccceEEEEEEeCCeEEEEEEcCCeeEEEEcccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCchheeeeccCccccceEeeeEeecCCeEEEEEccCCeEEEEcCCccCCCCCEEEEEEeeccccCCCceEEEEEEEEEc Q lcl|NC_011802. 321 QNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFKADNARCFDLEVESST 400 (472) Q Consensus 321 ~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~~~le~~~ 400 (472) |||||||++++|++++|||++++|+++||+||||++||+||+||+|+++|+|+|++|++++|++|+||+|+|++|||+++ T Consensus 321 ~w~erw~~~~~g~~~~~~R~~~~~~~~g~~ivGD~~nG~ly~ld~~~~t~~g~~~~~~~~~p~l~~dn~R~~d~eve~~~ 400 (472) T protein:vir:10 321 QNGPQWCVLKTGLYDDVYRAVDFMYEGNQITCGDKSEALTGQLQFDISSQYGLQQEHLLFTPLFKADNARCFDLEVESST 400 (472) T ss_pred cccceeeeecCCCcccceeEEEEEeeCCeEEEEEcCCCeEEEEecccCCCCCCcccceEEcccccCCCCEEEEEeeeccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCchhheeeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 401 GVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 401 Gv~~~~~~~~l~~sdDG~~~~~~~~~~~g~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) ||+|++|+|||||||||++||||||+++|++|||+||++||||||||+||||||||++|+||+|+||+|||| T Consensus 401 Gv~~~~d~v~L~wSddG~~~~~~~~~~~g~~g~~~tr~~~~RlG~~r~~v~f~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:10 401 GVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred CCCCcCcEEEEEeeccccccccceeeccCCccchhcceeeeeeeecccceeEEEEEEecCcceeeeeEEeeC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:2109 Length: 472 # NCBI annotation: head completion protein # Family: family:all:1540 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059633;genbank:gi:9635541;genbank:GeneID:1262840 Probab=100.00 E-value=1.9e-224 Score=1247.04 Aligned_cols=472 Identities=95% Similarity=1.458 Sum_probs=468.6 Q ss_pred CceeeeeecccCccccccCCeeEEeeeeeeecccccCcccceeEcCCCceeeeecCCCccceeeeeccCeEEEEECccee Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGISRGVEYNTAQNAVYRVLGSKLY 80 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~~rg~~~~~~~~~lY~V~G~~LY 80 (472) |+||||||++|++|+++++|+++.+||||||+||+++++++|||++||++++++++|++||+.++++++.||+|+|++|| T Consensus 1 m~~~q~Pl~~g~~~~~~~~d~~~~~pVN~~a~~~~~~~s~~~lr~tPG~~~~~~~~g~~RG~~~~t~~~~ly~V~G~~LY 80 (472) T protein:vir:21 1 MPIQQLPMMKGMGKDFKNADYIDYLPVNMLATPKEILNSSGYLRSFPGITKRYDMNGVSRGVEYNTAQNAVYRVCGGKLY 80 (472) T ss_pred CceEEeeccccccccccccceeeeeeeeeeeeccCCcccceeeeecCCcceeccCCCceeeeeecccCCeEEEEeCCceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccccccCCcccccceeeeeceeEEEEecCCCe Q lcl|NC_011802. 81 KGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDS 160 (472) Q Consensus 81 ~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~ 160 (472) ++++++|+|+|+|||||+||+++|+||+++++++|+||++++++++||+|++|+++|+++++||||+||||||++||+++ T Consensus 81 ~v~~~~G~i~gsgrVsMa~n~~~~~v~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~ 160 (472) T protein:vir:21 81 KGESEVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSGFTQYELGSVRDITRLRGRYAWSKDGTDS 160 (472) T ss_pred EEeeeeeeecccccEEEeeCCeEEEEEECCceeEEEEecchhhhhcccCccccccccccceeEEEEecceEEEccCCcce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccccceecccceeeeccccchhh Q lcl|NC_011802. 161 WFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAHASLMVQKGIAGTYCK 240 (472) Q Consensus 161 ~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I~~Gca~~~sv 240 (472) ||||+|+|+++++++.+|||||++||+||+++++|++|||||++|||||+|||++|...|||+|+||+|||+||||++|| T Consensus 161 f~is~l~d~~~~~~y~~FatAE~~pD~Iv~i~~~~~~l~lfG~~TiEvw~ntG~ad~~~fpy~r~~g~~iq~Gcaa~~sv 240 (472) T protein:vir:21 161 WFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCK 240 (472) T ss_pred eEEecCCCCccccCCccceeeccCCCceEEEEeeccEEEEEeccceEEEEecCCCCcCcCceEEcCcceeeecccCcchh Confidence 99999999999998778999999999999999999999999999999999999999889999999999999999999999 Q ss_pred eecCceEEEEEeccccccEEEEccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCCeEEEEecccc Q lcl|NC_011802. 241 TPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPRHVLVYDASSS 320 (472) Q Consensus 241 ~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~~Tw~yD~~t~ 320 (472) |+++|++|||||+++|+++||+++||||||||||+||++|++|+++|+++|++|+||||||+||+||||+||||||++|+ T Consensus 241 ~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aIE~~i~~y~~~e~~~A~~~t~~~eGH~fy~LtfP~~Tw~yD~at~ 320 (472) T protein:vir:21 241 TPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTAEEMATGVMETLRFDSHELLIIHLPRHVLVYDASSS 320 (472) T ss_pred hecCceEEEEecCCCcccEEEEccCceeEEecCHHHHHHHHhcCCccccceEEEEEEeCCeEEEEEEcCCeeEEEEcccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCchheeeeccCccccceEeeeEeecCCeEEEEEccCCeEEEEcCCccCCCCCEEEEEEeeccccCCCceEEEEEEEEEc Q lcl|NC_011802. 321 QNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFKADNARCFDLEVESST 400 (472) Q Consensus 321 ~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~~~le~~~ 400 (472) |||||||++++|+++++||++++|+++||+||||++||+||+|+|+..+++++|+|+++++|++|+||+|+|++|||+++ T Consensus 321 ~~~e~W~~~~sg~~~~~~R~~~~~~~~g~~ivGD~~nG~ly~L~fd~~~~~d~~~~~~r~~p~~~~dn~R~fd~eve~~~ 400 (472) T protein:vir:21 321 QNGPQWCVLKTGLYDDVYRGVDFMYEGNQITCGDKSEAVVGQLQFDISSQYDKQQEHLLFTPLFKADNARCFDLEVESST 400 (472) T ss_pred ccCceeeeeccCCCcCceeEEEEEeeCCeEEEEEcCCCeEEEEEecccccCCCcCcEEEEccceeCCCCEEEEEeeeccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCchhheeeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 401 GVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 401 Gv~~~~~~~~l~~sdDG~~~~~~~~~~~g~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) ||+|++|+|||||||||++||||||+++|++|||+||++||||||||+||||||||++|+||+|+||+|||| T Consensus 401 Gv~q~~d~v~L~wSddG~~~~~~~~~~~g~~g~~~tr~~~~RlG~~r~~v~f~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:21 401 GVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred CCCCcCcEEEEEeeccccccccceeeccCCccchhcceeeeeeeecccceeEEEEEEecCcceeeeeEEeeC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:105428 Length: 472 # NCBI annotation: gene 8 protein # Family: family:all:1540 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958184;genbank:gi:41057286;genbank:GeneID:2716675 Probab=100.00 E-value=4.1e-221 Score=1228.80 Aligned_cols=471 Identities=91% Similarity=1.413 Sum_probs=464.8 Q ss_pred CceeeeeecccCccccccCCeeEEeeeeeeecccccCcccceeEcCCCceeeeecCCCccceeeeeccCeEEEEECccee Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGISRGVEYNTAQNAVYRVLGSKLY 80 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~~rg~~~~~~~~~lY~V~G~~LY 80 (472) |+||||||++|++++++++|+++++||||||+|++.++++++|||+|||+++++|+|++||+.|++.++.||+|+|++|| T Consensus 1 m~~~~~pl~~G~~~~~~~~d~~~~~pVN~~a~~~~~~~s~~~l~~tPGl~~~a~v~G~~RG~~~~~~~g~lY~V~G~~LY 80 (472) T protein:vir:10 1 MPIQQLPLMKGVGKDFRNADYIDYLPVNMLATPKEILNSSGYLRSFPGIAKRSDVNGVSRGVEYNMAQNAVYRVCGGKLY 80 (472) T ss_pred CCeeeeeeccCceeeccccchhheeeeeeeeeccCCCcccceeecCCCceeeccCCccccceEEEeeCCeEEEEecceEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccccccCCcccccceeeeeceeEEEEecCCCe Q lcl|NC_011802. 81 KGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDS 160 (472) Q Consensus 81 ~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~ 160 (472) ++++++|+|+|+|||||+||+++|+|+++|++++|+||+++++..+||+|+.||++|+++++||||+||||||++||+++ T Consensus 81 ~v~~~iGsiag~grVsMa~n~~~~av~~~g~~~~Y~yd~~v~t~~~~~~d~~~p~~dlg~~~dv~f~dGyfV~~~~Gt~~ 160 (472) T protein:vir:10 81 KGESEVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPTDSGFTQYELGSVRDITRLRGRYAWSKDGTDS 160 (472) T ss_pred eeecceecccCcccEEEecCCcEEEEEECCceeEEEeeccchhhhccccccccccccccceeeeeeecceEEEeccCcce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccccceecccceeeeccccchhh Q lcl|NC_011802. 161 WFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAHASLMVQKGIAGTYCK 240 (472) Q Consensus 161 ~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I~~Gca~~~sv 240 (472) ||||+|+|++++++++.||+||++||+||+++++|++|||||++|||||+|||++|+.+|||+|+||+|||+||||++|| T Consensus 161 ~~is~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~~~~fpy~r~~g~~iq~Gcaa~~sv 240 (472) T protein:vir:10 161 WFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQPSLMVQKGIAGTYCK 240 (472) T ss_pred EEEeccCCccccccccccccccCCCCceEEEEeeccEEEEEeccceEEEEecCCCCcccCceeecccceeeecccCcchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCceEEEEEeccccccEEEEccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCCeEEEEecccc Q lcl|NC_011802. 241 TPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPRHVLVYDASSS 320 (472) Q Consensus 241 ~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~~Tw~yD~~t~ 320 (472) |+++|++||||||++|+++||+++||||||||||+||++|++|+++|+++|++|+||||||+||+||||+||||||++|+ T Consensus 241 ~~~~~t~~~l~~d~~g~~~V~~~~g~~~~rIST~aIE~~i~~y~~~e~~dA~~~t~~~~GH~fy~LtfP~~Tw~yD~~t~ 320 (472) T protein:vir:10 241 TPFADSYAFISNPATGAPSVYIIGSGQVSPIASASIEKILRSYTADELADGVMESLRFDAHELLIIHLPRHVLVYDASSS 320 (472) T ss_pred hecCceEEEEecCCccccEEEEccCceeEEecCHHHHHHHHhcCCccccceeEEEEEeCCeEEEEEEcCCceeEeecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCchheeeeccCccccceEeeeEeecCCeEEEEEccCCeEEEEcCCccCCCCCEEEEEEeeccccCCCceEEEEEEEEEc Q lcl|NC_011802. 321 QNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFKADNARCFDLEVESST 400 (472) Q Consensus 321 ~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~~~le~~~ 400 (472) +|||||+++++|++++|||++|+|+++||+||||++||+||+||+|+++|+|+|++|++++|++|+||+|+|++|||+++ T Consensus 321 ~Wherw~~~~~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~ly~l~~~~~td~G~~i~~~~~~p~~~~d~~Rv~d~~ve~~~ 400 (472) T protein:vir:10 321 ANGPQWCVLKTGLYDDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYGLQQEHLLFTPLFKADNARCFDLEVESST 400 (472) T ss_pred cCceeeeeecCCCccCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCcCCCcceEEEeccceeCCCCeEEEEEEEeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCchhhe-eeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 401 GVAQYADRLF-LSATTDGINYGREQMIEQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 401 Gv~~~~~~~~-l~~sdDG~~~~~~~~~~~g~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) |++|.+++++ |.||| |.+||+|+|+++|++|||++|++||||||||+||||||||++|+||+|+||||||| T Consensus 401 G~~~~adp~~~~~~sD-g~~~g~~~~~~~~~~g~~~~R~~~~RlG~~r~~vgf~~r~~~~~~v~l~ga~~~~e 472 (472) T protein:vir:10 401 GVAQYADRLFLSATTD-GINYGREQMIEQNEPFVYDKRVLWKRVGRIRKNVGFKLRVITKSPVTLSGAQIRIE 472 (472) T ss_pred CCCcccCceEEEeccC-CcccchhhhhhhccCcccccceeeeeeeeccccceEEEEEEeccccceeeeeEEeC Confidence 9999877755 55555 99999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:177 Length: 472 # NCBI annotation: DNA stabilization protein # Family: family:all:1540 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112082;genbank:gi:13559872;genbank:GeneID:920982 Probab=100.00 E-value=1.3e-220 Score=1226.02 Aligned_cols=471 Identities=91% Similarity=1.425 Sum_probs=464.9 Q ss_pred CceeeeeecccCccccccCCeeEEeeeeeeecccccCcccceeEcCCCceeeeecCCCccceeeeeccCeEEEEECccee Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGISRGVEYNTAQNAVYRVLGSKLY 80 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~~rg~~~~~~~~~lY~V~G~~LY 80 (472) |+||||||++|++++++++|+++++||||||+|++.++++++|||+|||+++++|+|++||+.|++.++.||+|+|++|| T Consensus 1 m~~~~~Pl~~G~~~~~~~~d~~~~~pVN~~a~~~~~~~s~~~l~~tPGl~~~a~v~G~~RG~~~~~~~g~lY~V~G~~LY 80 (472) T protein:vir:17 1 MPIQQLPLMKGVGKDFRNADYIDYLPVNMLATPKEILNSSGYLRSFPGIAKRSDVNGVSRGVEYNMAQNAVYRVCGGKLY 80 (472) T ss_pred CCeeeeeeccCceeeccccchhheeeeeeeeeccCCCcccceeecCCCceeeccCCccccceEEEeeCCeEEEEecceEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccccccCCcccccceeeeeceeEEEEecCCCe Q lcl|NC_011802. 81 KGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDS 160 (472) Q Consensus 81 ~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~ 160 (472) ++++++|+|+|+|||||+||+++|+|+++|++++|+||+++++..+||+|+.||++|+++++||||+||||||++||+++ T Consensus 81 ~v~~~iGsiag~grVsMa~n~~~~av~~~g~~~~Y~y~~~v~t~~~~~~d~~~~~~dlg~~~dv~f~dGyfV~~~~Gt~~ 160 (472) T protein:vir:17 81 KGESEVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPTDSGFTQYELGSVRDITRLRGRYAWSKDGTDS 160 (472) T ss_pred eeecceecccCcccEEEecCCcEEEEEECCceeEEEeeccchhhhccccccccccccccceeeeeeecceEEEeccCcce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccccceecccceeeeccccchhh Q lcl|NC_011802. 161 WFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAHASLMVQKGIAGTYCK 240 (472) Q Consensus 161 ~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I~~Gca~~~sv 240 (472) ||||+|+|++++++++.||+||++||+||+++++|++|||||++|||||+|||+++..+|||+|+||+|||+||||++|| T Consensus 161 ~~is~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~~~~fpy~r~~g~~iq~Gcaa~~sv 240 (472) T protein:vir:17 161 WFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCK 240 (472) T ss_pred EEEeccCCccccccccccccccCCCCceEEEEeeccEEEEEeccceEEEEeeCCCCCCcCceeecCcceeeecccCcchh Confidence 99999999999999999999999999999999999999999999999999999998888999999999999999999999 Q ss_pred eecCceEEEEEeccccccEEEEccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCCeEEEEecccc Q lcl|NC_011802. 241 TPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPRHVLVYDASSS 320 (472) Q Consensus 241 ~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~~Tw~yD~~t~ 320 (472) |+++|++||||||++|+++||+++||||||||||+||++|++|+++|+++|++|+||||||+||+||||+||||||++|+ T Consensus 241 ~~~~~t~~~l~~d~~g~~~V~~~~g~~~~rIST~aIE~~i~~y~~~e~~dA~~~t~~~~GH~fy~LtfP~~Tw~yD~~t~ 320 (472) T protein:vir:17 241 TPFADSYAFISNPATGAPSVYIIGSGQVSPISSASIEKILRSYTADELADGVMESLRFDAHELLIIHLPRHVLVYDASSS 320 (472) T ss_pred hecCceEEEEecCCccccEEEEccCceeEEecCHHHHHHHHhcCCccccceeEEEEEeCCeEEEEEEcCCceeEeecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCchheeeeccCccccceEeeeEeecCCeEEEEEccCCeEEEEcCCccCCCCCEEEEEEeeccccCCCceEEEEEEEEEc Q lcl|NC_011802. 321 QNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFKADNARCFDLEVESST 400 (472) Q Consensus 321 ~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~~~le~~~ 400 (472) +|||||+++++|++++|||++++|+++||+||||++||+||+||+|++||+|+|++|++++|++|++|+|||++|||+++ T Consensus 321 ~Wherw~~~~~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~ly~ld~~~~td~g~pi~~~~~~p~~~~~~~RV~d~el~~~t 400 (472) T protein:vir:17 321 ANGPQWCVLKTGLYDDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYDKQQEHLLFTPLFKADNARVFDLEVESST 400 (472) T ss_pred cCceeeeeecCCCccCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCCCCceeEEEEecceeeCCCceEEEEEEeeeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCchhhee-eeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 401 GVAQYADRLFL-SATTDGINYGREQMIEQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 401 Gv~~~~~~~~l-~~sdDG~~~~~~~~~~~g~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) |++|.++++|| .||| |.+||+|+|+++|++|||++|++||||||||+||||||||++|+||+|++|||++| T Consensus 401 G~~~~adp~~l~~~sD-g~~~g~~~~~~~~~~g~~~~R~~~~RlG~~r~~v~f~~~~~~~~~~~l~~a~~~~e 472 (472) T protein:vir:17 401 GVAQYADRLFLSATTD-GINYGREQMIEQNEPFVYDKRVLWKRVGRIRKNVGFKLRVITKSPVTLSGCQIRIE 472 (472) T ss_pred CcccCCCceEEEcccC-CcccchhhhhhhccCcccccceeeeeeeeccccceEEEEEeecccceeeeeEEEeC Confidence 99999877665 5555 99999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:3529 Length: 477 # NCBI annotation: P28 # Family: family:all:1540 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050989;genbank:gi:9633575;genbank:GeneID:1262322 Probab=100.00 E-value=1.3e-216 Score=1204.22 Aligned_cols=469 Identities=59% Similarity=1.012 Sum_probs=457.7 Q ss_pred CceeeeeecccCccccccCCeeEEeeeeeeecccccCcccceeEcCCCceeeeecCCCccceeeeeccCeEEEEECccee Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGISRGVEYNTAQNAVYRVLGSKLY 80 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~~rg~~~~~~~~~lY~V~G~~LY 80 (472) |++|||||++|++++++++|+++++||||||+||++++++++|+++||++++.+.+|++||+.|++.++.||+|+|++|| T Consensus 7 m~~~~ipl~~g~~~~~~~~d~~~~~PVN~~a~p~~~~~s~~~L~~~pG~~~~~~~~G~~RG~~~~~~~g~lY~V~G~~LY 86 (477) T protein:vir:35 7 MPKIQIPLAKGLVKDIKTADYIDALPVNMLATPKEVLNASGYLRSFPGIEKKQDAKGVSRGVHFNTKNNALYRVCGNTLY 86 (477) T ss_pred eeeeccccccccccccccccceeeeeeccceeeccccccccccccCCcceeeccCCccccceeEeecCCeEEEEecCeeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccccccCCcccccceeeeeceeEEEEecCCCe Q lcl|NC_011802. 81 KGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDS 160 (472) Q Consensus 81 ~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~ 160 (472) |+++++|+|+|+|||||+||+++|+||++|++++|+||++++++.+ +.++.||++++++++||||+||||||++||+++ T Consensus 87 ~v~~~vG~I~gsg~VsMa~n~~~~aIv~~g~~~gy~y~~t~~~~~~-~~~~~~p~~~l~~~~~v~f~dGyfV~~~~gt~~ 165 (477) T protein:vir:35 87 RNDKEVADIAGMSRVSMSHSSHSQAICFEGKVKLYRYDGTEKALSN-WPKDKYPQYDLGEVIDVCRNRGRYIWLQKGGER 165 (477) T ss_pred eeeeeeeeecccccEEEeeCCcEEEEEECCcceeEEEecccceeee-cCccccCCccccceeEEEeeCceEEEeecCCCe Confidence 9999999999999999999999999999999999999999999987 455569999999999999999999999999999 Q ss_pred EEEEcccCCCCcCCccc-eeEeecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccccceecc-cceeeeccccch Q lcl|NC_011802. 161 WFITDLEDESHPDRYSA-EYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAHA-SLMVQKGIAGTY 238 (472) Q Consensus 161 ~~iS~L~D~s~~~~~l~-fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r~~-~~~I~~Gca~~~ 238 (472) ||||+|+|+++++ +++ |||||++||+||+++++|++|||||++|||||+|||+++ ++|||+|++ ++|||+||||++ T Consensus 166 ~~iS~L~d~s~~d-~~~~FasAE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~-f~~p~~r~~~~~mIq~Gcaa~~ 243 (477) T protein:vir:35 166 FGVTDLEDESKPD-RYQPFYRAESQPDGIVSVDAWRDLIVCFGSSSIEYFTLTGSAD-TSQPLYIHQAAYMIQAGIAGRD 243 (477) T ss_pred EEEeecCCccccc-cccccccccCCCCceEEEEeeccEEEEEeccceEEEEecCCCC-CCcceeecCCceeeeecccCch Confidence 9999999999965 566 999999999999999999999999999999999999986 667999986 555899999999 Q ss_pred hheecCceEEEEEeccccccEEEEccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCCeEEEEecc Q lcl|NC_011802. 239 CKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPRHVLVYDAS 318 (472) Q Consensus 239 sv~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~~Tw~yD~~ 318 (472) |||+++|++||||||++|+++||+++||||||||||+||++|++|+++|++.|++|+||||||+||+||||+||||||++ T Consensus 244 sv~~~~~t~~~l~~d~~g~~~V~~~~g~q~~rIST~aIE~~i~ay~~~e~a~af~~t~~~eGH~fy~LtfP~~Tw~yD~a 323 (477) T protein:vir:35 244 CKCRYQDKYAILSHQSTGQPAVYLIGAGEKNKISTATIDKIIRYYSADELAASFMESIRFDNHELLLLHLPKHTLCFDGS 323 (477) T ss_pred hhhhhCceEEEEecCCCcccEEEEccCceeEEecCHHHHHHHHhcCCcchhceeEEEEEeCCeeEEEEEcCCceEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCchheeeeccCccccceEeeeEeecCCeEEEEEccCCeEEEEcCCccCCCCCEEEEEEeeccccCCCceEEEEEEEE Q lcl|NC_011802. 319 SSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFKADNARCFDLEVES 398 (472) Q Consensus 319 t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~~~le~ 398 (472) |++||||||.+++|++++|||++++|+++||+||||++||+||+||++.++|+|+|++|++++|++|+||+|+|++|||+ T Consensus 324 t~~w~e~W~~~~~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~l~~ld~~~~~d~g~~i~~~~~~p~~~~d~~Rv~~~el~~ 403 (477) T protein:vir:35 324 ASHQYSQWSLLKSGFYDEPYRAIDFMFFDNQITVGDKKEGVLGHLIFNASNQYEQQTEHLLYTPMIKADNARLFDFELEA 403 (477) T ss_pred cccccceeeeeccCCccCceEEEEEEEeCCeEEEEEcCCCeEEEECCCCcccCCCccceEEecceeeCCCCeEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EcCCCCCchhheeeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 399 STGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 399 ~~Gv~~~~~~~~l~~sdDG~~~~~~~~~~~g~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) ++||+|.+++|||||||||++||||||+++|++|||+||++||||||||+||||||||++|+||+|++|+++|| T Consensus 404 ~tGvgq~~d~v~L~~sddG~~~~~~~~~~~g~~g~~~~r~~~~RlG~~r~~vgf~~r~~~~~pv~l~~~~~~~e 477 (477) T protein:vir:35 404 STGVAQIADKLFLSVTTDGINYSREQLIEQNSPFQYDKRILWRRIGRVRKNIGFKIRIITKSPVTLSDLSIRME 477 (477) T ss_pred ecCcCccCceEEEEEeccccccccceeecCCCccccccceeeeeeeeceeccceEEEEEecCCceeccceeEeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999 No 7 >protein:vir:105525 Length: 472 # NCBI annotation: phage DNA stabilization protein # Family: family:all:1540 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516194;genbank:gi:89885997;genbank:GeneID:3964385 Probab=100.00 E-value=3.5e-212 Score=1179.87 Aligned_cols=468 Identities=63% Similarity=1.092 Sum_probs=460.9 Q ss_pred CceeeeeecccCccccccCCeeEEeeeeeeecccccCcccceeEcCCCceeeeecCCCccceeeeeccCeEEEEECccee Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGISRGVEYNTAQNAVYRVLGSKLY 80 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~~rg~~~~~~~~~lY~V~G~~LY 80 (472) |+||||||++|++|+++++|+++++||||||+||++++++++||++||++++++++|++||+.|++++++||+|+|++|| T Consensus 1 m~~~q~pl~~g~~~~~~~~~~~~~lpvN~y~~p~~~~~ss~~lr~~PG~~~~~~~~g~~RG~~~~~~~~~lY~V~G~~Ly 80 (472) T protein:vir:10 1 MAIMQLPLLRGLGKARDDADYIDALPVNMLATPKPVLNASGYLRSFPGITHKAEVAGVSRGVQYNTHEKTVYRGLGNQLY 80 (472) T ss_pred CCceeeecccccccCccccCceeeeeeeeeeccccccccceeecccCCceeecCCCcccceeEeeeeCCeEEEEecceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccccccCCcccccceeeeeceeEEEEecCCCe Q lcl|NC_011802. 81 KGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDS 160 (472) Q Consensus 81 ~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~ 160 (472) |+.+++|+|+|+|||||+||+..++|+.+|++++|+||+++.++++|+.+..+++++++.++||||+||||||++||+++ T Consensus 81 ~v~~~vG~iagsg~VsMa~~~~~q~v~v~g~~~~y~y~g~~~t~~~~~~~~~it~~dl~~~~~v~~~dGyfV~~~~gt~~ 160 (472) T protein:vir:10 81 KGHKPIADLAGKGRISMAFSRNSQAVVAAGKMTLYRYDGTVKTLENWPKEKKYTQYDIGNVRDMCHLRGRYVWCKDGSDI 160 (472) T ss_pred EEEeeeeeecccccEEEEecCCceEEEEecceeEEEeccchhhhhhccccccCCccccCCceeEEEeCceEEEeecCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcccCCCCcCCccc-eeEeecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCcccccee---cccceeeecccc Q lcl|NC_011802. 161 WFITDLEDESHPDRYSA-EYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVA---HASLMVQKGIAG 236 (472) Q Consensus 161 ~~iS~L~D~s~~~~~l~-fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r---~~~~~I~~Gca~ 236 (472) ||||+|+|+++++ +++ |||||++||+||+++++|++|||||++|||||+|||+++ |||+| +||+|||+|||| T Consensus 161 ~~iS~L~d~s~~~-~~~~FatAE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~---fpf~r~~~~pg~~iq~Gcaa 236 (472) T protein:vir:10 161 FGVTDLEDESHPD-RYRALYRAESQPDGIIGIDSWRDFIVCFGASTIEYFSLTGAAD---GQSAIYAAQPALMVEKGIAG 236 (472) T ss_pred EEEeecCCcccCC-cccceeeecCCCCceEEEEeeccEEEEEeccceEEEEecCCCC---cceeeeccCccceeeecccC Confidence 9999999999975 566 999999999999999999999999999999999999997 99998 889999999999 Q ss_pred chhheecCceEEEEEeccccccEEEEccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCCeEEEEe Q lcl|NC_011802. 237 TYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPRHVLVYD 316 (472) Q Consensus 237 ~~sv~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~~Tw~yD 316 (472) ++|||+++|++||||||++|+++||+++||||||||||+||++|++|+++|+++|++|+||||||+||+||||+|||||| T Consensus 237 ~~sv~~~~~s~~~l~~d~~g~~~V~~~~g~q~~rIST~aIE~~i~~y~~~e~~dA~~~s~~~eGH~fy~LtfP~~Tw~yD 316 (472) T protein:vir:10 237 THCKTRLGDAHVIISHQATGAPSVFLINQAQATSIATATIEKILRSYTHDELASAVMETVRFDSHELVLIHLSRQVLCYD 316 (472) T ss_pred chhhhhhCceEEEEecCCCcceEEEEccCceEEEecCHHHHHHHHhCCcccccceeEEEEEeCCeEEEEEEcCCeeEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCchheeeeccCccccceEeeeEeecCCeEEEEEccCCeEEEEcCCccCCCCCEEEEEEeeccccCCCceEEEEEE Q lcl|NC_011802. 317 ASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFKADNARCFDLEV 396 (472) Q Consensus 317 ~~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~~~l 396 (472) ++|++||||||.+++|++++|||++++|+++||++|||++||+||+||++.++|+|+|++|++++|++|++|+|||++|| T Consensus 317 ~at~~~~~~w~~~~~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~l~~ld~~~~td~g~pi~~~~~tp~~~~~n~Rvfd~el 396 (472) T protein:vir:10 317 AAANQNGLQWSLLKTGFYHAPYRGIDFMFADHHLTCGDKNDSLLGQLDFASSAQYEKPQEHVLYTPLFKADNARVFDFEL 396 (472) T ss_pred ccCCccceeeeeeecCCccCceEEEEEEEeCCeEEEEEcCCCeEEEEcCcCcCCCCceeEEEeeccceecCCCeEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcCCCCCchhheeeeccCccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 397 ESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 397 e~~~Gv~~~~~~~~l~~sdDG~~~~~~~~~~~g~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) |+++||++.+++|||||||||.+++++++++++.++.|++|++||||||||+|||||||+++++||.+++++|+|| T Consensus 397 ~~~tGvg~~~~~v~L~wSddg~~~~~~~~~~~~g~~~~~~r~~w~RlG~ar~~vgf~~rv~~s~pv~~~~~~a~~e 472 (472) T protein:vir:10 397 EASTGVAHIADRLFLSATADGLHFGREQMINQNAPFAYDRRILWRRMGRVRKNLGFKVRVITSSPVTLSGCQIRME 472 (472) T ss_pred EeeCCcCccCceEEEEEeccccccchhHHHhhcCccchhheeeeheeeccccccceEEEEEEecccccccceeeeC Confidence 9999999999999999999999988888999999999999999999999999999999999999999999999999 No 8 >protein:vir:108312 Length: 458 # NCBI annotation: hypothetical protein # Family: family:all:1540 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552278;genbank:gi:160700603;genbank:GeneID:5758828 Probab=100.00 E-value=2.6e-186 Score=1038.05 Aligned_cols=441 Identities=20% Similarity=0.267 Sum_probs=406.3 Q ss_pred CceeeeeecccCccccccCC-eeEEeeeeeeecccccCcccceeEcCCCceeeeecCCCccceeeeeccCeEEEEECcce Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNAD-YIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGISRGVEYNTAQNAVYRVLGSKL 79 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d-~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~~rg~~~~~~~~~lY~V~G~~L 79 (472) |++|+||+ |+|.+ +.. +.+|+||||||+|+|++|++++||++||+++|+++++++++-.| .+++.||+|+|++| T Consensus 1 m~~~~ip~--gsy~a--~~~~~daq~~VN~yp~~~e~g~ss~~l~~tPGl~~f~~~~~~~~~g~~-~~~g~ly~v~g~~L 75 (458) T protein:vir:10 1 MVQRQIPL--VATTA--EGDVSGQEILVNVYPRKSDGGKYPFTLRHTPGLAFFCELPTFPVMAMH-QNGSRAFAVTPRDM 75 (458) T ss_pred Cceeeece--eeeec--ccccccceeeeeeeeecccccccccceEecCCceeeecCCCCceeeEE-ecCCEEEEeeCceE Confidence 99999998 45642 222 22889999999999999999999999999999999776655444 67999999999999 Q ss_pred eee-----eeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccccccCCcccccceeeeeceeEEEE Q lcl|NC_011802. 80 YKG-----ETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADSGFTQYELGSVRDITRLRGRYAWS 154 (472) Q Consensus 80 Y~v-----~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~ 154 (472) |+| .+++|+|+|+|||||+||+++++|| +|+ .+|+||+++++++ +++|++|++. +||||+||||||+ T Consensus 76 Y~V~~~~~~~~iG~i~gsg~VsMa~ng~q~vi~-~G~-~gY~yd~at~~~~-~i~d~~~~~~-----~~v~~~dGy~V~~ 147 (458) T protein:vir:10 76 YEISKDGTYKRLGSVDFKGRVVMEDNGKQIVMV-DGE-KGYYYDSETEIVQ-EIKAEGFYPA-----STVTYQDGYFIFD 147 (458) T ss_pred EEEeCCceEEEEecccCceeEEEeeCCcEEEEE-ECC-eEEEEeecccEEE-eccCccccCc-----ceEEEeCcEEEEE Confidence 995 4689999999999999999877666 554 6999999888776 5788877665 8999999999999 Q ss_pred ecCCCeEEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccccceecccceeeecc Q lcl|NC_011802. 155 KDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAHASLMVQKGI 234 (472) Q Consensus 155 ~~g~~~~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I~~Gc 234 (472) +||+++||||+|+|++ +|+|+|||||++||+||+++++|++|||||++|||||+|||++| |||+|+|++|||+|| T Consensus 148 ~~g~~~~~is~L~d~s--~d~l~fa~Ae~~pD~iv~i~~~~~~i~~fG~~TiEvw~ntG~a~---fpy~r~~ga~i~~Gc 222 (458) T protein:vir:10 148 RKGTGQFFISELLDVA--FDPLDFATAEGQPDPLLAVLSDHREVFMFGQETIEVWYNSGAAD---FPFERNQGAFIEKGI 222 (458) T ss_pred eeCCCEEEEEecCcce--eCcceeeeecCCCCceEEEEeeccEEEEEeccceEEEEecCCCC---cceeecccceeeecc Confidence 9999999999999976 68999999999999999999999999999999999999999997 999999999999999 Q ss_pred ccchhheecCceEEEEEeccccccEEEEccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECC--CeE Q lcl|NC_011802. 235 AGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLP--RHV 312 (472) Q Consensus 235 a~~~sv~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P--~~T 312 (472) ||++|||+++|++||||||. +||+++||||+|||||+||++|++| ++++|+||+|++|||+||+|||| ++| T Consensus 223 aa~~sv~~~~~t~~~l~~d~----~Vy~l~g~~~~rIST~aIE~~i~sy---~~~da~a~t~~~eGH~fy~LtfP~a~~T 295 (458) T protein:vir:10 223 GAPYSVAKTNNTVYFIGSDL----MIYQITGYTPVRISTHAVEQTLKGV---NLSDAFAYTYQSEGHLFYVLTIPGKNLT 295 (458) T ss_pred cCcchhhhhCceEEEEcCCe----EEEEecCceeEEeeCHHHHHHHhcC---ChhheEEEEEEecCeEEEEEECCCCCce Confidence 99999999999999999986 6999999999999999999999999 47789999999999999999999 589 Q ss_pred EEEecccccCchheeeeccCccccceEeeeEeecCCeEEEEEccCCeEEEEcCCccCCCCCEEEEEEeeccccCCCceEE Q lcl|NC_011802. 313 LVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFKADNARCF 392 (472) Q Consensus 313 w~yD~~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~ 392 (472) ||||++|+|||| |+||.+ +|||++++|+++||++|||++||+||+||+++++|+|+|++|++++|++|++++|++ T Consensus 296 w~yD~~t~~Whe----r~Sg~~-~~~Ra~~~v~~~g~~~vGD~~ng~ly~ld~~~~td~g~~i~~~~~~p~~~~~~~rl~ 370 (458) T protein:vir:10 296 WCYDISSGSWHV----RQSYQF-DRHVSNNSIYFDQKTLVGDFQNGRIYIMADNYYTDDGDPVVREFILPVVNNGREFLT 370 (458) T ss_pred eEEeccccccee----eccCCC-CceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCCCCceeeeeeeccceeCCCCeEE Confidence 999999996555 667544 699999999999999999999999999999999999999999999999999999875 Q ss_pred --EEEEEEEcCCCCC-----chhheeeeccC-ccccCcceee-ccCCCcccceeEEEEeeEecccceeEEEEEEecCcce Q lcl|NC_011802. 393 --DLEVESSTGVAQY-----ADRLFLSATTD-GINYGREQMI-EQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVT 463 (472) Q Consensus 393 --~~~le~~~Gv~~~-----~~~~~l~~sdD-G~~~~~~~~~-~~g~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~ 463 (472) ++|||+++||++. +|++||.|||| |.|||+++++ ++|++|||+||++||||||+|+|| |||||++|+|++ T Consensus 371 ~~~~el~~~tGvg~~~~~~~~p~~~l~~S~d~g~~~s~~~~~~~lg~~gey~tr~~~~rlG~ar~rv-f~v~~s~p~~~~ 449 (458) T protein:vir:10 371 VDSLELDLSSGVGLTVGQGSDPELRVYFSKDNGNEYSQNFKVGKIGRKGEFLTRAKVNRFGCARQFT-FKVEISDPIPVD 449 (458) T ss_pred EEEEEEEEecceeeeeCCCCCceEEEEEeeCCCcccchhHHHhhcCCcchhhhhhhhhhhccCcceE-EEEEEecchhhc Confidence 8999999999953 68899999998 9999999999 689999999999999999999999 999999999999 Q ss_pred EEEeEEEeC Q lcl|NC_011802. 464 LSGCQIRLE 472 (472) Q Consensus 464 l~~~~~~~e 472 (472) |++||++|. T Consensus 450 l~ga~~~~r 458 (458) T protein:vir:10 450 IGGAWVEVR 458 (458) T ss_pred ceeeeEEeC Confidence 999999999 No 9 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=99.94 E-value=3e-26 Score=160.48 Aligned_cols=436 Identities=13% Similarity=0.131 Sum_probs=260.3 Q ss_pred Cceeeeeecc--cCccccccCCeeEEeeeeeeecccccCcccceeEcCCCceee-eecCCCccceeeeec-cCeEEEEEC Q lcl|NC_011802. 1 MPIQQLPMMK--GMGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKR-NDVNGISRGVEYNTA-QNAVYRVLG 76 (472) Q Consensus 1 M~~~~vPl~~--G~~~~~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~-~~v~g~~rg~~~~~~-~~~lY~V~G 76 (472) |+..+.-+.. |.-+|..+.| +|.+..-+..-.....+.....||.++. +.+..+++|+..-.. ++....+++ T Consensus 1 ~~~~~~~~~~~~g~~~d~~p~~----lp~~a~s~~~N~~~~~~~~~~~~g~~pv~a~~~~~~~g~~~~~~~g~~~~~~~~ 76 (513) T protein:vir:88 1 MALERQEVKNPTGIVTDIAPAD----LPLDKWSFGNNVRFKNGKAQKALGHSPIFDTAQAPILDMFPFIRNNIPYWLLCS 76 (513) T ss_pred CCcCChhhcccccceeccChhh----cCCCcceeeeeeeEecceeeecCccceeeecCCCCceeeeeeecCCCeEEEEee Confidence 6655544422 1112222221 2333222223334455556777999998 678778888653222 345677788 Q ss_pred cceeeeeee--EEcccC-------ceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccccccCCcccccceeeee Q lcl|NC_011802. 77 SKLYKGETV--VGDVAG-------SGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADSGFTQYELGSVRDITRL 147 (472) Q Consensus 77 ~~LY~v~~~--iGtv~g-------sg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~~f~~~~~~~~~dv~~~ 147 (472) +++|...+. ..+|++ +.|.+++-=++.++ ..+|..+-..+|++..+++- .+++| ....++.+... T Consensus 77 ~~~~~~~~~~t~~dvs~~~~~~~~~~~w~~~~f~~~i~-a~ng~~~~q~~~~~s~~f~d---l~g~p--~~~~a~~i~v~ 150 (513) T protein:vir:88 77 EKRLYLADGTTIIDVSPGPYSASVTNRWSVGSFNGVIF-ANDGVNPPHHLPPTESVFRV---LPNFP--ANTTFRRLKSF 150 (513) T ss_pred ceEEEEecCceeeeccccceeecccCceeeeeecCEEE-EEcCCCcceEEcCCCceeee---ccCCC--cccceEEEEEE Confidence 877753221 222332 23566776555544 45777776778876555542 22332 23456677767 Q ss_pred ceeEEEE-e-cC----CCeEEEEcccCC----CCcCCccc-----eeEeecCCCceEEEEecCCEEEEEEcceEEEEEec Q lcl|NC_011802. 148 RGRYAWS-K-DG----TDSWFITDLEDE----SHPDRYSA-----EYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLT 212 (472) Q Consensus 148 dGyfv~~-~-~g----~~~~~iS~L~D~----s~~~~~l~-----fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~t 212 (472) .++.|.. . .+ .++++.|+++|+ ++|+..-. |--.-...+.||..++.++.+++|-+++|-.+..+ T Consensus 151 ~~flv~~~~t~~~~~~PnrV~wS~~~D~~~~P~~W~~t~~t~~a~~~~l~d~~g~~v~g~~~g~~liif~e~~i~~m~y~ 230 (513) T protein:vir:88 151 KNFLIGLNVTSNSIEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSVYSMRYI 230 (513) T ss_pred eeEEEEeecccCcCCCCceEEEecccCCcccccccccccccCcccccccCCCccceeeeeecccceEEEecccEEEEEec Confidence 6664432 1 22 578999999996 66642211 10111134789999999999999999999888878 Q ss_pred CCCCCccccceecccceeeeccccchhheecCceEEEEEeccccccEEEEccCccceecCCHHHHHHH-hhcCchhhccE Q lcl|NC_011802. 213 GATTVGAALYVAHASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKII-RSYTADELATG 291 (472) Q Consensus 213 Ga~~~~~fpy~r~~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~iE~~i-~~y~~~e~~~A 291 (472) |.+. .|.+.. ..-.+||.+|.|++++++.+|||++++ ||+++|.++++|+-..|++.+ ...+..-++.. T Consensus 231 g~~~--if~~~~---i~~~~G~~~p~SI~~~~~~~ffls~~G-----f~~~~G~~~~~Ig~ekVdk~f~~~~n~~~~~~~ 300 (513) T protein:vir:88 231 GGLY--IFQFQQ---LFNDVGILGPNCAIEFDGNHFVVGHGD-----VYVHNGVQKQSVIDAQVRKFFFSDINPDNYQRT 300 (513) T ss_pred CCCc--eEEEEe---ecccccccCCceeEEECCeEEEEeCCc-----eEEecCceeeecccchhhhhhhccCCcccceEE Confidence 7653 566776 445899999999999999999999998 999999999999988898854 44443333333 Q ss_pred EEEEEEeCCEEEEEEECC----------CeEEEEecccccCchheeeeccC------c------ccc------------- Q lcl|NC_011802. 292 VMEALRLDSHELLIIHLP----------RHVLVYDASSSQNGPQWCVLKTG------L------YDD------------- 336 (472) Q Consensus 292 ~~~~~~~~GH~fy~lt~P----------~~Tw~yD~~t~~w~~~w~~~~tg------~------~~~------------- 336 (472) .+..-. =++-+++.+| ++..+||-.+. +|+.++-. + ... T Consensus 301 ~~~~d~--~~~~v~~~y~s~~~~~~~~~~~~lVYd~~~~----~Ws~~~~p~~~~g~~g~~~~~~~~~~~~~~~~~d~~~ 374 (513) T protein:vir:88 301 FVLADH--VNTEMWVCYSSTRSEPGKHCDRAIIWNWKEN----TWSIRDLPNVLSGAYGIIDPKTSNLWDDDSNPWDTDT 374 (513) T ss_pred EEEEcC--cccEEEEEecCCCCCCCcccceEEEEEccCC----eEEEEeccchhhcccccccccccceecccccccccch Confidence 333322 2333444444 35689998888 45443210 0 000 Q ss_pred -ceEeeeEeecCCeEEEEEccCCeEEEEcCCccCCCCCEEEEEEeeccccC-CCceEEEEE--EEEEcCCCCCchhheee Q lcl|NC_011802. 337 -VYRAIDFMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFKA-DNARCFDLE--VESSTGVAQYADRLFLS 412 (472) Q Consensus 337 -~~R~~~~~~~~g~~~vGD~~~g~l~~ld~~~~~d~g~p~~~~~~tP~~~~-~~~r~~~~~--le~~~Gv~~~~~~~~l~ 412 (472) .|............+.++++.|.++.+|.+. +..|.+++..+++|-+.. ++.++..+. +...++-+. -.+-|. T Consensus 375 ~~~~~~~~~~~~~sl~~~~~~~~~~~~fd~~~-~f~G~~lea~~~t~~~~~~~~~~~~~i~~v~~~~t~~g~--~t~~vg 451 (513) T protein:vir:88 375 SVWGEGSYNPAKSSMIFTSFQDAKLFLFGETS-TFSGQSFTSTLERSDIYLGDDRMMKTVSAVIPHITGNGV--CNIWVG 451 (513) T ss_pred hhhhccccccccceeEeeeccCCceeeecccc-cccCCceEEEEEecCccccCchhheeeeeeeeeeecceE--EEEEEe Confidence 1111111111234566778888899888664 679999999999988774 344332211 001111111 111233 Q ss_pred ecc--C-ccccCcceeeccCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 413 ATT--D-GINYGREQMIEQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 413 ~sd--D-G~~~~~~~~~~~g~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) ..+ + -.+|+.+..-....-+. +..|+-| |- ..||||+....|-.+.|..+++- T Consensus 452 ~~~~~~~~~~~s~~~~~~~~~~~~----~~~r~~g--Ry-~~~ri~i~~~~~w~~~G~~ve~~ 507 (513) T protein:vir:88 452 NAQVQGSGIRWKGPYPYRIGQDYK----IDTKHVG--RY-IALKFDFASAGDWYFNGYTLEMA 507 (513) T ss_pred eeccCccccccccceeeecccCce----EEeccCC--ce-EEEEEEccCCCceEEeeEEEEEe Confidence 333 3 67788775555544343 3333333 22 34888888888999999888887 No 10 >protein:vir:352 Length: 536 # NCBI annotation: hypothetical protein # Family: family:all:3197 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203466;genbank:gi:15320622;genbank:GeneID:921729 Probab=99.19 E-value=4.3e-10 Score=71.96 Aligned_cols=431 Identities=14% Similarity=0.131 Sum_probs=226.6 Q ss_pred Cceee-------------eeeccc-Cccc-----cccCCeeEEeeeeeeecccccCcccceeEcCCCceeeee-cCCCcc Q lcl|NC_011802. 1 MPIQQ-------------LPMMKG-MGKD-----FKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRND-VNGISR 60 (472) Q Consensus 1 M~~~~-------------vPl~~G-~~~~-----~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~-v~g~~r 60 (472) ||.+. ||-..| .-++ ..+.| +-...|+-|++ .+ +|-=-|-.++++ +.++.+ T Consensus 2 ~~~~a~r~~~~~~~~~~~~pAPv~G~~t~~~~A~m~~~~--A~vldN~fpt~------~g-~r~R~G~~~~at~~~~~v~ 72 (536) T protein:vir:35 2 MPLRARRVPPPPSIQEAHLPAPVGGLNTVSAGSAMPVSD--CLQGFNLIASE------LG-LRSRLGYREWCTGLGVPAR 72 (536) T ss_pred CccccccCCCCccceeeeeCccccceeccchhhcCCCCc--eEEEeecCCCh------hh-hhhhccchhHhcCCccceE Confidence 32211 111111 1111 11112 33444444443 11 222246666665 444444 Q ss_pred ceeee------eccCeEEEEECcceeeeeee---------EEcccCc---e-eEEEEcCCcEEEEEECCceeEEEEeccc Q lcl|NC_011802. 61 GVEYN------TAQNAVYRVLGSKLYKGETV---------VGDVAGS---G-RVSMAHGRTSQAVGVNGQLVEYRYDGTV 121 (472) Q Consensus 61 g~~~~------~~~~~lY~V~G~~LY~v~~~---------iGtv~gs---g-~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~ 121 (472) ...-- --++.||.+.+..+|.+++. -|.=.|+ + -|-++.-+...++..||.-...+|++++ T Consensus 73 s~~~~~~~~~~Ga~~klf~at~~~i~dvT~pa~p~~~~~~~g~~~g~~~~w~~v~~~~~gG~~l~~~nG~~~~~~~~gt~ 152 (536) T protein:vir:35 73 STLPFAGSAKSGAANRLFQTTSEGIWDVSASSQTPTQVLTFGDQTGDAGFGVSHAFVTQRGHFLFYADETNGLFRYSEST 152 (536) T ss_pred EeeeeeeccccCcceeEEEecccceeeeecCCCCcceEEEeccCCCceeeEEEEEecCCCceEEEEEEcCCCceEeeccc Confidence 33211 02358999999999988752 1110121 2 1323333344466668888888999998 Q ss_pred hhhcc--cccc-ccccCCcccccceeeeeceeEEEEecCCC-eEEEEcccCCCCcCCccceeEeecCCCceEEEEec--- Q lcl|NC_011802. 122 KTVSN--WTAD-SGFTQYELGSVRDITRLRGRYAWSKDGTD-SWFITDLEDESHPDRYSAEYRAESQPDGIIGIGTW--- 194 (472) Q Consensus 122 ~t~s~--~~~d-~~f~~~~~~~~~dv~~~dGyfv~~~~g~~-~~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~--- 194 (472) ..-++ ..+- ..-+|-|..+..-++..+-|..|.+.++- -||.= .+..+-.-..++-..-=-.-.-|++..+| T Consensus 153 ~~w~~v~~~t~~~~i~Gv~~~~l~~i~~~knRLffvq~~s~~awYLp-~~av~G~A~~f~lg~~~~~GGsL~~~~sWS~~ 231 (536) T protein:vir:35 153 DTWTAVAQGTGVGEIDGVNPANIVFVAVFKQRVWLVERDTARAWYLP-AGAIAGTAQPFEMGAQFRAGGHLVGLWNWTYD 231 (536) T ss_pred CchhhcccCCcccccCCCCcccceeeeeEeeeEEEEEeCCceEEEee-cccccceeeeeeccCccccCceEccceeeccc Confidence 43332 1111 11334455666777777788666644443 34332 22111100011111000001124444443 Q ss_pred -----CCEEEEEEcceEEEEEecCCCCC-ccccceecccceeeec---cccchhheecCceEEEEEeccccccEEEEccC Q lcl|NC_011802. 195 -----RDFIVCFGSSTIEYFSLTGATTV-GAALYVAHASLMVQKG---IAGTYCKTPFADSYAFISHPATGAPSVYIIGS 265 (472) Q Consensus 195 -----~~~l~lfG~~T~Evw~~tGa~~~-~~fpy~r~~~~~I~~G---ca~~~sv~~~~~s~~wl~~d~~g~~~V~~~~g 265 (472) .+++. |=+...||+-..| ++| .+=.+.-.. .-.+| -.|+.|..+++.-+.+++.++ +-.|+. T Consensus 232 ~G~Gl~d~~V-fvSs~GeVaVyqG-sdPs~s~~Wsl~g--iy~IG~~pp~G~r~~i~~G~Dl~iit~dG-----ivplsq 302 (536) T protein:vir:35 232 GGAGMDDSLV-AISGGGDVAIWQG-TDPASSATFGLRG--VWSLGGSPPAGRRIATDYGGDVLVLSRLG-----VRPLSR 302 (536) T ss_pred cCCCcceeEE-EEecCCcEEEEec-CCCCcccceeEEE--EEEeccCCCCCceEEEeecCeeEeeecCC-----ccchhh Confidence 23333 3334467777776 332 111222221 23457 578899999999999999998 444444 Q ss_pred c-cceecC----CHHHHHHHhhcCchhhccEE---EEEEEeCCEEEEEEECC------CeEEEEecccccCchheeeecc Q lcl|NC_011802. 266 G-QASPIA----TASIEKIIRSYTADELATGV---MEALRLDSHELLIIHLP------RHVLVYDASSSQNGPQWCVLKT 331 (472) Q Consensus 266 ~-q~~rIS----T~~iE~~i~~y~~~e~~~A~---~~~~~~~GH~fy~lt~P------~~Tw~yD~~t~~w~~~w~~~~t 331 (472) - |-.+.+ |.+||..++.+-... .... +--|-.+.+ +++.+| ++|++++..|+ .||+- T Consensus 303 ~~q~d~~a~~~it~~I~~~~~~~v~~~-a~~~gWq~~~~P~~n~--liV~~P~~~g~~~~~fV~N~~tg----aW~~f-- 373 (536) T protein:vir:35 303 LVAGEVDKDTYVTAKVSNLFSALMLTR-ASLPGWSMQLHPEDNA--LLVTVPTYPGQPTEQLVMALAGR----AWFRY-- 373 (536) T ss_pred hhhhhhhcccCCCccchhhHHHHHhhc-cCCCccEEEEccCCCe--EEEEccCCCCCCceEEEeecccC----ceeee-- Confidence 3 322222 455776665432111 1111 223333444 556665 37999999999 55532 Q ss_pred CccccceEeeeEeecCCeEEEEEccCCeEEEEcC------CccCCCCCEEEEEEeeccccCC--C------ce------- Q lcl|NC_011802. 332 GLYDDVYRAIDFMYEGNQITCGDKSEAVTGQLQF------DISSQYDKQQEHLLFTPIFKAD--N------AR------- 390 (472) Q Consensus 332 g~~~~~~R~~~~~~~~g~~~vGD~~~g~l~~ld~------~~~~d~g~p~~~~~~tP~~~~~--~------~r------- 390 (472) ..|-+.|...+.++..+|.- +|++|+.|- ....+.|+||...+....-|.- . +| T Consensus 374 ----tgw~a~C~~v~~~~LyFG~~-dG~v~~~da~v~g~D~~~~~ag~~I~~~~~~af~~~G~~~~K~~~~~r~~~~s~~ 448 (536) T protein:vir:35 374 ----RDLPIYSSAVWGGKLYFGTV-DGRVCVNDGYVDGVLLSEPSAFTPVQWSLLSAFTNLGSARQKQVQLLRPTLLSES 448 (536) T ss_pred ----cCCcceEEEEecCeEEEeec-CCEEEecccccCccccccCcCcceeeeccccchhhcCchHHHHHHHhhhhhhhcc Confidence 48999999999999999987 999999773 2234468888877655443221 0 12 Q ss_pred ------E-EEEEEEEEcCCCCCchhheeeeccCccccCcceee--ccCCCcccceeEEEEeeEecccceeEEEEEEecCc Q lcl|NC_011802. 391 ------C-FDLEVESSTGVAQYADRLFLSATTDGINYGREQMI--EQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSP 461 (472) Q Consensus 391 ------~-~~~~le~~~Gv~~~~~~~~l~~sdDG~~~~~~~~~--~~g~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~ 461 (472) + .++++++ +..+|..-++ -.+.+|....+= ... ++|.-|=+|+-++..-..+...+|-.+..+ T Consensus 449 ~~p~l~l~~~~d~D~----~~p~~~~~~~--~~~~~Wd~s~Wd~~~Ws--~~~~v~~~~~s~~g~G~~is~~~~g~a~~~ 520 (536) T protein:vir:35 449 ATPSYEVQARYRYDF----AELAPVSAMG--GGSGTWDGSTWDVDVWS--GEYQASQQVRGGTGVGVDLAIAIRGTAVAR 520 (536) T ss_pred CCceEEEEEEEEecc----CCCCCcCCCC--CCcccCCcccCCceecC--CcceeEeeeeEeccceEEEEEEEeeccccc Confidence 1 1233332 2222211111 113344444332 222 467777788888888888889999888889 Q ss_pred ceEEEeEEEeC Q lcl|NC_011802. 462 VTLSGCQIRLE 472 (472) Q Consensus 462 ~~l~~~~~~~e 472 (472) ..|-.+.+..| T Consensus 521 ~~~~~~d~~~e 531 (536) T protein:vir:35 521 TVLVGIDILFT 531 (536) T ss_pred eEEEEEEEEEe Confidence 99999999999 No 11 >protein:vir:95475 Length: 771 # NCBI annotation: hypothetical protein ORF038 # Family: family:all:5234 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294631;genbank:gi:149408197;genbank:GeneID:5237042 Probab=98.92 E-value=1.3e-08 Score=63.84 Aligned_cols=422 Identities=15% Similarity=0.111 Sum_probs=202.8 Q ss_pred Cc----e--eeeeecccCcc-ccccCCeeEEeeeeeeecccccC-------cccc------eeEcCCCceeeeecCCCcc Q lcl|NC_011802. 1 MP----I--QQLPMMKGMGK-DFKNADYIDYLPINMLATPKEVL-------NSSG------YLRSFPGIAKRNDVNGISR 60 (472) Q Consensus 1 M~----~--~~vPl~~G~~~-~~~~~d~~~~~pvn~~~~~~e~~-------~s~~------~Lrs~PGl~~~~~v~g~~r 60 (472) .. . =+.|--+-++- ..++.++.--...|.....-..+ .+.| |-+.--+|+.-.++.|+.| T Consensus 222 iV~~y~a~~g~~pS~sd~~N~a~~k~~~~Ei~t~~~f~~~~~~~~~~Gt~~~~~G~yi~da~~~g~~~Lt~~ve~~gr~~ 301 (771) T protein:vir:95 222 IVTFRSAASGKFPSNSDSVNLALSKRADVEPSTTDRFRAEDIVLNPIGTYETARGFFIIDAMARGKSRLEEIVKLKQRYP 301 (771) T ss_pred ceEeeeccCCCCcCCceeeccccchhhccceeeecccchhhhhhcccCcccccCcceeeehhhhcccccceeeeccccch Confidence 11 1 11111111111 12223222111111111111111 1111 1122223333333333333 Q ss_pred ceeeeeccCeEEEEECcceeeee-----eeEEcccCceeEEEEcCCcEEEEEECC---ceeEEEEeccchhhcccccccc Q lcl|NC_011802. 61 GVEYNTAQNAVYRVLGSKLYKGE-----TVVGDVAGSGRVSMAHGRTSQAVGVNG---QLVEYRYDGTVKTVSNWTADSG 132 (472) Q Consensus 61 g~~~~~~~~~lY~V~G~~LY~v~-----~~iGtv~gsg~VsMa~Ng~~~~iv~~g---~~~~Y~~d~~~~t~s~~~~d~~ 132 (472) -.-++.. .|=+.. ..+.. +.|||.- +|+.-+.|=.+. +++-|.+ ..+-++.- T Consensus 302 s~~~~~~----------~l~~~~t~~~~~~vae--yagRvwY-ag~~~~~iD~dkng~~~~~~il-------fSqLv~s~ 361 (771) T protein:vir:95 302 SLSFGVS----------SLPQDETPGGASVVCE--YAGRVWY-AGFSGQIIDGDDQSPRLVSYIL-------FSQLVDSP 361 (771) T ss_pred hhhcccc----------ccccccCCCCceeEEe--eeeeEEE-ecceeEEeeccccCCceeeeEe-------eehhhcch Confidence 3211110 011111 11222 5566652 333333322111 1112211 11112211 Q ss_pred ccCCcccccceeeeeceeEEEEecCCCeEEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEEEec Q lcl|NC_011802. 133 FTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLT 212 (472) Q Consensus 133 f~~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~t 212 (472) .|+++ ||+|+ |-| .=-+++|.|. | .-|-+-|+. -.|+.++.++..|++|+++.+ |... T Consensus 362 ---~di~n----CyQd~------DPT-see~~dLidT---D--Gg~iri~ga-h~ii~Lv~f~~sLlvfc~NGV--WAi~ 419 (771) T protein:vir:95 362 ---ADIVN----CYQDG------DPT-STEEPELVDT---D--GGFIRIEGA-HDIINLVNVGSAVMVVAANGI--WMIQ 419 (771) T ss_pred ---hhccc----ccccC------CCc-hhhhhhhhhc---C--CCEEEecCC-CCceeEEEecceEEEEEecce--EEEE Confidence 23222 45543 122 1234555432 3 346666775 689999999999999999998 9995 Q ss_pred CCCC----CccccceecccceeeeccccchhheecCceEEEEEeccccccEEEEccCccceecCCHHHHHHHhhcCchhh Q lcl|NC_011802. 213 GATT----VGAALYVAHASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADEL 288 (472) Q Consensus 213 Ga~~----~~~fpy~r~~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~iE~~i~~y~~~e~ 288 (472) |.++ ...|...+. =++||-+|.|+.-+|++++|.|.++-.++.+..++-+.+|-|+...||+.+++...+-+ T Consensus 420 ggsd~g~tAtdY~ltKI----s~vg~sspnSvVvvg~~i~ywsdtgIyal~~Ndfn~~tAqnLTekTIq~~~~~I~~dk~ 495 (771) T protein:vir:95 420 GGSDYGFTATNYLVTKI----SEHGCSSPNSVVVVDNSFMYWGDDGIYHLTRNQYGDYVANNLTEKTIQKYYEKIPSDAI 495 (771) T ss_pred eccCCceeeeeeEEEEe----eeeccCCCccEEEecceEEEeeCCceEEEeecccCcchhhccchHHHHHHHhhcchhhh Confidence 5444 122333443 25899999999999999999999998888889999999999999999999999996544 Q ss_pred ccEEEEEEEeCCEEEEEEECCCe---------EEEEecccccCchheeeec-cCccccceEeeeEeecC------CeE-- Q lcl|NC_011802. 289 ATGVMEALRLDSHELLIIHLPRH---------VLVYDASSSQNGPQWCVLK-TGLYDDVYRAIDFMYEG------NQI-- 350 (472) Q Consensus 289 ~~A~~~~~~~~GH~fy~lt~P~~---------Tw~yD~~t~~w~~~w~~~~-tg~~~~~~R~~~~~~~~------g~~-- 350 (472) ..+-.+.-..|++ |.+.+|++ -+++|++++-..+ |.+.. ++..+..--+.-++.+. .+. T Consensus 496 knVtg~fd~~e~r--vyw~yPn~~D~~~e~~t~LV~dLalgaFYp-~~i~~~~ag~l~~~vg~~~~p~~~lv~T~~eV~v 572 (771) T protein:vir:95 496 LNATGFYDSYDKK--VKWLYNTVLDGRTEPVTELVFDLALGAFYP-SKIGSLTAGRLPIPVGSVKIPPYKLVETGEEVTV 572 (771) T ss_pred cceEEEEEccCCE--EEEEecceecCCCcceeeeeeeeccccccc-ccccccccCccceeeeeeecCccccccccceEEe Confidence 4444555566888 44556632 3999999987765 42222 11111111111111100 011 Q ss_pred ---------------------------EEEEccCCeEEEEcCCccCCCC-------------------------CEEEEE Q lcl|NC_011802. 351 ---------------------------TCGDKSEAVTGQLQFDISSQYD-------------------------KQQEHL 378 (472) Q Consensus 351 ---------------------------~vGD~~~g~l~~ld~~~~~d~g-------------------------~p~~~~ 378 (472) +.--...|.-+++.|..+++.+ .-+-+. T Consensus 573 ~~~~v~~tG~~vtV~~~~r~~~~~~~~y~~~~~dg~~g~~~Fa~~~~~~f~DW~sv~~~~vdy~sy~~~gY~~~gd~~~~ 652 (771) T protein:vir:95 573 ASEQVTATGELVTVKVSTRSPVIRETKYIIVEKLSSPMRISFGGYTDEEFVDWKSVDGIGVDAPAYLLTGYLAGGDYQRE 652 (771) T ss_pred cceeeEecCCceEEEEEEeeccccceEEEEEEecCCCeeEEeccccCcceeecccCCCcccchHHHHHhhhhccchheee Confidence 1111223333444444444322 222223 Q ss_pred EeeccccCCCceEEEEEEEEEcCCCC-Cchh--he---eeeccCccc--cCcceeec--------c-CCCccccee---- Q lcl|NC_011802. 379 LFTPIFKADNARCFDLEVESSTGVAQ-YADR--LF---LSATTDGIN--YGREQMIE--------Q-NEPFVYDKR---- 437 (472) Q Consensus 379 ~~tP~~~~~~~r~~~~~le~~~Gv~~-~~~~--~~---l~~sdDG~~--~~~~~~~~--------~-g~~g~~~~r---- 437 (472) ...|++++.=..--|=-||-..|-=. +++- || .+||-++++ |++.+.+= . +.---|+.- T Consensus 653 k~~PYit~y~~~tedg~v~~~~g~~~p~n~sSclm~~sw~ws~s~~t~k~~~~~eaYk~~~~~~p~~~~~~~yp~~~VV~ 732 (771) T protein:vir:95 653 KFVPYITFHFKKTEDGFVEDAEGDWTPTNQSSCMVQSQWSWTNSPASNKWGRTWQAYRFRRHFFPDNIDNQFDDGNSVVE 732 (771) T ss_pred eccceEEEEEEeecccceecccccccccCCcceEEEEEeeeecCCCCCccccchheeeecceeccCCcchhcCCccceee Confidence 33355443211111223555555111 1221 44 567778665 77765541 1 111112211 Q ss_pred EEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 438 VIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 438 ~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) -+.|--|+.|.- .|+|.--.+-..-|.|-+|=++ T Consensus 733 TKsriRG~Gr~~-~~rf~s~~gKdlhl~Gysil~~ 766 (771) T protein:vir:95 733 TKSRLRGSGKVL-SLYITTEPKKNLHIYGWSMLVD 766 (771) T ss_pred eeheeeecceEE-EEEEEecCCcceEEEeEEEEEe Confidence 234445654444 4677666777788889888888 No 12 >protein:vir:3133 Length: 911 # NCBI annotation: hypothetical protein # Family: family:all:5234 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640315;genbank:gi:21234408;genbank:GeneID:956056 Probab=98.88 E-value=2.1e-08 Score=62.71 Aligned_cols=432 Identities=15% Similarity=0.137 Sum_probs=178.0 Q ss_pred Ccee--eeeecccCccccccCCee-EEeeeeeee-----cccccC--cccceeEcCCCceee--eecCCCccceeeeecc Q lcl|NC_011802. 1 MPIQ--QLPMMKGMGKDFKNADYI-DYLPINMLA-----TPKEVL--NSSGYLRSFPGIAKR--NDVNGISRGVEYNTAQ 68 (472) Q Consensus 1 M~~~--~vPl~~G~~~~~~~~d~~-~~~pvn~~~-----~~~e~~--~s~~~Lrs~PGl~~~--~~v~g~~rg~~~~~~~ 68 (472) +... --|..+| ++-+|.. -..--|+|- ..+..+ .-+|..+--|=---| ++++ |+..+.|+.+. T Consensus 177 ~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 251 (911) T protein:vir:31 177 IRTRELLTPYTTG----TNYGDTLTPEEEWNLYNSGWATITRATKDKSGSGTVYVNPVQYYFDKRGVY-PSHSVLYNSMK 251 (911) T ss_pred eeehhhccccccc----cccCcccCchhhcccccccceeeeeecccCCccceEEEchhheeecccCcC-cchhhhhhhhh Confidence 1111 1122222 1111110 000011110 000011 111222211111111 2222 33333333222 Q ss_pred C----e-----EEEEE-CcceeeeeeeEEc-ccCcee-EEEEcCCcEEEEEE--CCceeEEEEeccchhh-------ccc Q lcl|NC_011802. 69 N----A-----VYRVL-GSKLYKGETVVGD-VAGSGR-VSMAHGRTSQAVGV--NGQLVEYRYDGTVKTV-------SNW 127 (472) Q Consensus 69 ~----~-----lY~V~-G~~LY~v~~~iGt-v~gsg~-VsMa~Ng~~~~iv~--~g~~~~Y~~d~~~~t~-------s~~ 127 (472) . . +|.-+ ..|+- .|+ .+--|| +--+...++-+|.+ -|.++-=.-|++++-- +|- T Consensus 252 ~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~e~~np 326 (911) T protein:vir:31 252 QESAKEIVALNVFSPWADEKIN-----FGTTTPPLGRYIHSAYYFDSAAILSLGIGNLTPPTSDGTTEGSGPAEEEISNP 326 (911) T ss_pred hhccceeEEEeeeccccccccc-----cccCCCchhhhhhhheeeccceeeeecccccCCCCCCCccCCCCCchhhhcCC Confidence 1 0 00000 00000 111 011111 11111122212111 1111111111111100 000 Q ss_pred cccccccCCccc----------------ccceeeeeceeEEE-Ee--cCCCeEEEEcccCCCCc-------CCccceeEe Q lcl|NC_011802. 128 TADSGFTQYELG----------------SVRDITRLRGRYAW-SK--DGTDSWFITDLEDESHP-------DRYSAEYRA 181 (472) Q Consensus 128 ~~d~~f~~~~~~----------------~~~dv~~~dGyfv~-~~--~g~~~~~iS~L~D~s~~-------~~~l~fatA 181 (472) +..... +.+. .++.+.+..|+-.+ .+ .|..++++|.|.+--.. -|+..-..+ T Consensus 327 ~gl~~i--gt~~n~k~~a~~~~~~~~~~r~r~~~~yaGRVfyaD~dkngk~rIlFSqLv~sl~di~nCYQdaDPTSeee~ 404 (911) T protein:vir:31 327 IGLDNI--GTVNNLKLIAEGTVRWTVKDRPRCSGYHNGHVYFGDRDKNGKTRILVSQLVNSLDNIPKCFQDADPTAEEIN 404 (911) T ss_pred CCcccc--cchhceeeeeccceeeeecccccceeeeccEEEEeeeccCcceeEEEEeeccccccccccccCCCccccccc Confidence 000000 0011 12444577788333 33 34668999998632110 122222222 Q ss_pred e-----------cCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccc---cceecccceeeeccccchhheecCceE Q lcl|NC_011802. 182 E-----------SQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAA---LYVAHASLMVQKGIAGTYCKTPFADSY 247 (472) Q Consensus 182 E-----------~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~f---py~r~~~~~I~~Gca~~~sv~~~~~s~ 247 (472) | ...++|+.++.++.-|++|+++++ |...|.+ +..| -|.-+.= -++||..|.|+.-+++.+ T Consensus 405 DLIdTDGg~vri~gah~Ii~LV~~G~sLlVFcaNGV--WAI~G~d-~~g~TATdy~ItKI--sdvGcsspNSVVvVgn~i 479 (911) T protein:vir:31 405 DLIATDGFTMYPVGMGAPITMVEFNKRLLLLCTNGV--WAIRGTS-GGGATATDFTLDKV--ASVEFNSPQSVVDIGTAI 479 (911) T ss_pred hhhhcCCcEEecCCCCCceEEEEecCeEEEEEeCcE--EEEeccC-CCceeeeeeEEEEE--eeeeeCCCCeEEEecCce Confidence 2 113589999999999999999998 9998754 2233 3333322 346999999999999999 Q ss_pred EEEEeccccccEEEEccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCC-------------eEEE Q lcl|NC_011802. 248 AFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPR-------------HVLV 314 (472) Q Consensus 248 ~wl~~d~~g~~~V~~~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~-------------~Tw~ 314 (472) +|+|+.+-.++.+.+++-+.++-++-..|+..+.+.+++.+..+-++....|++.|| .+|+ +.++ T Consensus 480 ~fWSd~GIyaLganqfnD~tAnNLTesTIQ~y~d~I~~dkIkNVtgtyd~de~rVyW--~yPn~lDe~teykt~~~~ILV 557 (911) T protein:vir:31 480 VFWSERGIIAIGVNDFGDLTSNNLTENTIDEYYDSLDRDIIKNVKGTFINDENRVYW--VVPNKQDSNGEYKTDGELVLV 557 (911) T ss_pred EEeeCCcEEEEeecccCccccccccHHHHHHHHhhcChhhhceEEEEEEccCCEEEE--EecCccCCccceeecCceEEE Confidence 999999977777777788888888768899999999998888888888888888555 4553 4699 Q ss_pred EecccccCchheeeeccCccc-cceEeee--------EeecCCeEEEEEccCCeEEEEcCCccCCCCCEEEEEEeecccc Q lcl|NC_011802. 315 YDASSSQNGPQWCVLKTGLYD-DVYRAID--------FMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFK 385 (472) Q Consensus 315 yD~~t~~w~~~w~~~~tg~~~-~~~R~~~--------~~~~~g~~~vGD~~~g~l~~ld~~~~~d~g~p~~~~~~tP~~~ 385 (472) ||++++.|++ |.. ++++.. -++|.-. ++-.-.+-++=++.+..+..--...++.-+.++-.+ T Consensus 558 fdLatgaFYP-wtv-s~gpLl~~p~y~Lv~TreEvtvPi~~etgaiIve~gsdPV~~tl~vdttGvDg~ayLl------- 628 (911) T protein:vir:31 558 LNLDTGGFYK-HTV-SGGPLLHAPFRRLVNTRAEVSIPITETDGTVITDTLGDPVTVTRTVTTTGVDGLAYFA------- 628 (911) T ss_pred EEeccCcccc-eee-ecceeecccccccccccccceeeEEeecceEEEecCCCCeEEEEeeecccccceeEEE------- Confidence 9999999985 643 443222 1111100 000001112222222222211111111111111100 Q ss_pred CCCceEEEEEEEEEcCCC-------CCchhheeeecc-------------C-ccccCcceee-ccCCCc---ccceeEEE Q lcl|NC_011802. 386 ADNARCFDLEVESSTGVA-------QYADRLFLSATT-------------D-GINYGREQMI-EQNEPF---VYDKRVIW 440 (472) Q Consensus 386 ~~~~r~~~~~le~~~Gv~-------~~~~~~~l~~sd-------------D-G~~~~~~~~~-~~g~~g---~~~~r~~~ 440 (472) -+--|+. .--|.-|+.|-- | |..|- |.|+ ..+.|- -|.+-++. T Consensus 629 -----------~frdg~~g~~~f~a~~~~~~~~dw~~~~~~~~~~y~s~~~~~y~~~-~~~~~~~~~pyi~sy~~~~~rv 696 (911) T protein:vir:31 629 -----------SFDDGVNGQFNFIAEHQPWGFADWANVPNMTRVNYSSYVDFAYEYP-EVMIGNISLPYIHSYYLTGIRV 696 (911) T ss_pred -----------eeccCCcceEEEEEeecCCeeeccccCccccccchhHHHHhhhhhh-hhhhhcccCceeeeeeeeeeEE Confidence 0001111 111222333321 0 11111 1111 112221 23444455 Q ss_pred EeeEecccceeEEEEEE-ecCcceEEEe---EEEeC Q lcl|NC_011802. 441 KRVGRIRRLIGFKLRVI-TKSPVTLSGC---QIRLE 472 (472) Q Consensus 441 ~rlG~~r~~v~f~~r~~-~~~~~~l~~~---~~~~e 472 (472) +--|++.+-.+++|.-. ..--++|.-. +++|- T Consensus 697 ~~~~y~~~~a~~~f~~~~~~~~~~~~~~~~~~~~~~ 732 (911) T protein:vir:31 697 QTEQYTTETAHLSFHRVQAHQTTALGTVTFHKVDMM 732 (911) T ss_pred eccceeeecccceeEeeecccceeeeeeeeeeeeeh Confidence 55555555544444321 1112222111 11111 No 13 >protein:vir:2625 Length: 715 # NCBI annotation: gp27 # Family: family:all:5234 # MgeID: mge:55 # MgeName: SIO1 # Cross-refs: genbank:acc:NP_064766;genbank:gi:9964636;genbank:GeneID:1263056 Probab=97.87 E-value=1.4e-05 Score=47.22 Aligned_cols=443 Identities=16% Similarity=0.177 Sum_probs=191.9 Q ss_pred CceeeeeecccCc-----------cccccCCeeEEeeeeeeecccccC-----cccceeEcCCCceee----eecC---- Q lcl|NC_011802. 1 MPIQQLPMMKGMG-----------KDFKNADYIDYLPINMLATPKEVL-----NSSGYLRSFPGIAKR----NDVN---- 56 (472) Q Consensus 1 M~~~~vPl~~G~~-----------~~~~~~d~~~~~pvn~~~~~~e~~-----~s~~~Lrs~PGl~~~----~~v~---- 56 (472) --.+++-++-|.- -..+. .....-|..++-+.+|-+ ...+.++-..|.+.- ..++ T Consensus 129 h~~v~v~~~~G~livanp~i~~~~~~~d~-~t~s~t~~~ll~r~r~f~~qg~d~~~g~~y~~~gt~~tn~~iynlyN~gw 207 (715) T protein:vir:26 129 EERVQVTSLNGYLIVASPAINTFYLGFNT-STEAFTATSISFKERDFEWQGSDVDVTSLYFGEGTSVSNQRIYDTYNVGW 207 (715) T ss_pred eeEEEEEEeeeEEEEecCCccEEEEEecC-CcceeEeeEEEEEeeeheeeccccccccccccCCcccCchhheeccccee Confidence 0011111111100 00000 111222222222222111 112222222222211 0000 Q ss_pred CCccce-eeeeccC-eEEEEECcceeeeeeeEEcccCce--eE----EEEcCCcEEEEEECCceeEEEEeccchhhcccc Q lcl|NC_011802. 57 GISRGV-EYNTAQN-AVYRVLGSKLYKGETVVGDVAGSG--RV----SMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWT 128 (472) Q Consensus 57 g~~rg~-~~~~~~~-~lY~V~G~~LY~v~~~iGtv~gsg--~V----sMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~ 128 (472) ++.+|- ..|..+. .||-..-.+-|..+.+-+-.+--. .. +.+.+| .|..|+..+.-+... T Consensus 208 ~~p~gt~~~N~~~~yiVypa~s~~~~S~kd~n~afsk~ad~ei~tGt~~~~~G------------~yi~D~~~~g~~~le 275 (715) T protein:vir:26 208 VGPKGSAALNTYGSYIVYPALTHPWYSGKDANGAFNKADWLEIYTGSSLASNG------------HYVLDVFNKARTGLT 275 (715) T ss_pred ecceeEEEEcCCCCceEecccccccCCCcccccccChhhccccccccccccCc------------eEEEeeeecCCccch Confidence 111111 1111111 122222222221111111000000 00 112222 233444333222110 Q ss_pred ccccccCCcccccceeeeeceeEEEE----ecCCCeEEEEccc-------------CCCCc------CCccceeEeecCC Q lcl|NC_011802. 129 ADSGFTQYELGSVRDITRLRGRYAWS----KDGTDSWFITDLE-------------DESHP------DRYSAEYRAESQP 185 (472) Q Consensus 129 ~d~~f~~~~~~~~~dv~~~dGyfv~~----~~g~~~~~iS~L~-------------D~s~~------~~~l~fatAE~~p 185 (472) . . -....++.+|.-.|+-.+. .+...++.+|.|- ||++. +--.-+-+-|+. T Consensus 276 e--e---v~k~R~rsv~~yaGrV~yagiD~dkng~rilfSqLv~s~~di~nCyQd~DPTsee~~dLidTDGg~iri~ga- 349 (715) T protein:vir:26 276 T--E---VETGRFRSVAAYAGRVFYAGIDSAKNGGKVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDA- 349 (715) T ss_pred h--h---hhcCCCcceeeecceEEEeecccccCCCeEEEehhhcchhhcccccccCCCchhhhhhhhhcCCCEEEecCC- Confidence 0 1 1144567788888884443 3345589999886 23221 000234455553 Q ss_pred CceEEEEecCCEEEEEEcceEEEEEecCCC---CCccccceecccceeeeccccchhheecCceEEEEEeccccccEEEE Q lcl|NC_011802. 186 DGIIGIGTWRDFIVCFGSSTIEYFSLTGAT---TVGAALYVAHASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYI 262 (472) Q Consensus 186 D~iv~~~~~~~~l~lfG~~T~Evw~~tGa~---~~~~fpy~r~~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~ 262 (472) -.|+.++.++..|++|+++.+ |...|.. +...|...+. =++||-+|.|+..+|++++|.|.++-.+..+.. T Consensus 350 h~ii~Lv~f~~sLlvf~~NGV--WAi~G~d~g~tATdY~ltKI----s~vg~sspnSvVvv~~~i~~WsdtGIyal~~Nd 423 (715) T protein:vir:26 350 HNIRKLHVLGASLLVFAENGV--WAVAGVDNVFRATEYAITRI----SDVGLSNENSFVVADGIPIWWGKTGIYAVQQSE 423 (715) T ss_pred CCceeEEEecceEEEEEecce--EEEeccCCceeeeeeEEEEe----eeeccCCCccEEEecceEEEeeCCcEEEEEecc Confidence 579999999999999999998 9984321 1223444443 258999999999999999999999977777777 Q ss_pred -ccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCC----------eEEEEecccccCchheeeecc Q lcl|NC_011802. 263 -IGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPR----------HVLVYDASSSQNGPQWCVLKT 331 (472) Q Consensus 263 -~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~----------~Tw~yD~~t~~w~~~w~~~~t 331 (472) ++-+.+|-|+...||+.+++...+-+..+-.+.-..|++.||+ +|| -.+++|++++-..+ |.+..+ T Consensus 424 ~fn~~tAqNLTekTIq~~~~~I~~dk~knVtg~fd~~e~rVyW~--yPn~dt~vdykyd~vLV~dLalgaFYp-~~v~~~ 500 (715) T protein:vir:26 424 NLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRVFWF--YPDNDESVDYKYNNILVMDLALQAFYP-WRVEDE 500 (715) T ss_pred ccCcchhhccchHHHHHHHhhcchhhhcceEEEEEccCCEEEEE--EcCCceeeceeecCeEEEEeccccccc-cccccc Confidence 8889999999999999999999655555555555678885554 552 25889999987766 444332 Q ss_pred CccccceEeeeEe------ecCCeEEEEE--ccCC--------------------------eEEEEcCCccCCCCCEEEE Q lcl|NC_011802. 332 GLYDDVYRAIDFM------YEGNQITCGD--KSEA--------------------------VTGQLQFDISSQYDKQQEH 377 (472) Q Consensus 332 g~~~~~~R~~~~~------~~~g~~~vGD--~~~g--------------------------~l~~ld~~~~~d~g~p~~~ 377 (472) -......-+.... --+.|.+.|- ..+| .-++|.+..++++-=.-+. T Consensus 501 a~~~~~~ig~~~~~~~~~~~t~~~vv~~~~v~~~g~~~~v~~~~r~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~dw~ 580 (715) T protein:vir:26 501 ASSTSYIIGTSYYGGLGSTSTETQVVNGADVVVNGSDNVVATLYRDYLEGDSEIKLLVRDGTTGKMTFATFRGDTYLDWG 580 (715) T ss_pred ccccceeeeeeeeCCcccccchhheeccceEEEeccceEEEEeecccccccceEEEEEEcCCceeEEEecccCceeeecc Confidence 1111111111100 0011221111 1122 2223333333332111111 Q ss_pred EEeeccc-------cCC-----CceEEE--------EEEEEEcCCCCCchh--he---eeeccCccc------cCcceee Q lcl|NC_011802. 378 LLFTPIF-------KAD-----NARCFD--------LEVESSTGVAQYADR--LF---LSATTDGIN------YGREQMI 426 (472) Q Consensus 378 ~~~tP~~-------~~~-----~~r~~~--------~~le~~~Gv~~~~~~--~~---l~~sdDG~~------~~~~~~~ 426 (472) ..-.|-+ ..| ++-+.- =-||-..|-.-+++- || .+||.-+++ |.+.+.+ T Consensus 581 s~d~~~~~~~gy~~~gd~~~~k~~pyvt~~~~~tedg~v~~~~g~~p~n~sSclm~~sw~ws~s~st~~eaYk~~~~~~~ 660 (715) T protein:vir:26 581 SADYKSFAEAGYDFMGDITTFKNAPYVTTYMRVTEDGYVASGAGYEFINPSSCLMSVSWNLSKSGSTPREIYKLKDVPVV 660 (715) T ss_pred ccchhhHHHhhhhhcccceeeecCceEEEEEEEecccceeccCCccccCCcceEEEEEeeeccCCCChhhhheecceeee Confidence 1000000 000 111111 124444443222222 44 334433332 1111122 Q ss_pred ccCCCcc--cc--eeE-EEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 427 EQNEPFV--YD--KRV-IWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 427 ~~g~~g~--~~--~r~-~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) .-|.... |+ +-+ +.|.-|+.|.- .|+|.--..-..-|.|-+|=-- T Consensus 661 ~p~~~s~~~yp~~~VvTKsriRG~Gr~~-~~rf~s~~gKdlhl~Gysilg~ 710 (715) T protein:vir:26 661 NPNDLSSINYPTDTVVTKSKVRGRGRSM-KFRFESVAGKDFHLVGYEVIGA 710 (715) T ss_pred CCCccccccCCcceeEeeeeeeccceEE-EEEEEecCCcceEEEeEEEEec Confidence 2222222 11 111 22333443333 3555544555566666665444 No 14 >protein:vir:102644 Length: 594 # NCBI annotation: Hypothetical protein # Family: family:all:780 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024422;genbank:gi:48696643;genbank:GeneID:2948111 Probab=95.01 E-value=0.003 Score=34.44 Aligned_cols=446 Identities=11% Similarity=0.047 Sum_probs=187.9 Q ss_pred Cceee-eeeccc-----CccccccCCee--EEeeeeeeecccccCcccceeEcCCCceeeeecCCC---ccceeeeeccC Q lcl|NC_011802. 1 MPIQQ-LPMMKG-----MGKDFKNADYI--DYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGI---SRGVEYNTAQN 69 (472) Q Consensus 1 M~~~~-vPl~~G-----~~~~~~~~d~~--~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~---~rg~~~~~~~~ 69 (472) |+.+- --+..| +....+.+-|. .....||++.|. |-++.=||+...+.+.++ .|-.-+..-.. T Consensus 1 m~~~~~~~F~~GelsP~l~~r~Dl~~y~~~~~~~~n~~~~~~------G~~~rR~G~~~~~~~~~~~~~~~lipF~~s~~ 74 (594) T protein:vir:10 1 MADFSQTSFKGGVIAPRLQFNEYESAYHHSIEDAVNFVVTEQ------GSLITRCGSEEVGLCQDGEVRLFRLPAVDAPS 74 (594) T ss_pred CceeeccccCcceecceeccchhHHHHHHHHhhhhceEEEec------CCeecCChhHhhhhccCCCCCEEEEEEEeCCC Confidence 77632 222222 11111111122 346667776652 223333565554444322 12211211111 Q ss_pred -------------------eEEEEECcceeeeeeeEEc--ccCceeEEEEcCCcEEEEEECCcee--EEEEeccchhhcc Q lcl|NC_011802. 70 -------------------AVYRVLGSKLYKGETVVGD--VAGSGRVSMAHGRTSQAVGVNGQLV--EYRYDGTVKTVSN 126 (472) Q Consensus 70 -------------------~lY~V~G~~LY~v~~~iGt--v~gsg~VsMa~Ng~~~~iv~~g~~~--~Y~~d~~~~t~s~ 126 (472) .+-.+.++..|..+++.-. .+....++++.+.+.+-++....+. .|+..-....+.. T Consensus 75 ~~~~le~g~~~~r~~~~~~~~v~~~~~~~~~~~tp~~~t~~~~l~~i~~tqsad~~~~~~~~~~p~~L~R~~~~~w~~~~ 154 (594) T protein:vir:10 75 NDVIVEVGNTNIAVWVNDVRQVVANTPSEWRNTIDRIQTAYDTIGDDAGAANTGRLIMVHPALQPKRLYRDNNNAWQFVN 154 (594) T ss_pred CeEEEEEcCCeEEEEecCcEEEEccCCCcccccccceeeccCCccceEEEEEeeEEEEEcCCCCceEEEEccCCCceEEe Confidence 1122222333332221110 0112234455454444433322221 1111111111111 Q ss_pred c-cccccccCCcccccceeeeeceeEEEEecC--CCeEEEEccc--------CCCCcCCccceeEeecCCCceEEEEecC Q lcl|NC_011802. 127 W-TADSGFTQYELGSVRDITRLRGRYAWSKDG--TDSWFITDLE--------DESHPDRYSAEYRAESQPDGIIGIGTWR 195 (472) Q Consensus 127 ~-~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g--~~~~~iS~L~--------D~s~~~~~l~fatAE~~pD~iv~~~~~~ 195 (472) . .....+...+..-+..|+|...|.+|...- .+.++.|.-. .+...+|+.+++.+ +-++++.++... T Consensus 155 ~~~~~~p~~~~~~~~p~~v~f~q~RL~f~~~~~~p~~v~~Srtgd~~nF~~~~~~~ddd~i~~~~s--~~~~~~~~v~~~ 232 (594) T protein:vir:10 155 MHTGAVPAEWSPSNYPQTVGIFQNRVWYVGSPVHRTYFWATRAGKLEDIAPSTANNPNDPISFVGI--MEGTPCWIIASS 232 (594) T ss_pred cccCcccccccCCccceEEEEEeeeEEEEeCCCCCceEEEEecccccccccCCCCCCCccEEEEEe--cccceEEEEecC Confidence 0 011111122334456788888997775321 2344455322 22235778888655 347888999998 Q ss_pred CEEEEEEcceEEEEEecCCCC----CccccceecccceeeeccccchhheecCceEEEEEeccccccEEEE------ccC Q lcl|NC_011802. 196 DFIVCFGSSTIEYFSLTGATT----VGAALYVAHASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYI------IGS 265 (472) Q Consensus 196 ~~l~lfG~~T~Evw~~tGa~~----~~~fpy~r~~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~------~~g 265 (472) +.|++|.+..- |..+|.++ |......++ + ..||.+- --..++++++|+++.+. .|+- .++ T Consensus 233 ~~L~i~t~~~e--~~l~~~~~~~lTp~~~~~~~~-s---~~g~~~~-~P~~vg~~~~fv~~~g~---~vre~~y~~~~d~ 302 (594) T protein:vir:10 233 DVLTIGTTIND--YQLAASTGVSVTAATAILRRS-S---VQGTAAV-QGIPAEEQVIFCSRNKS---KVYAMNYVREQDN 302 (594) T ss_pred CceEEEecCce--EEEecCCCcccccceEEEEEe-e---eeccCCC-cceeeCCeEEEEcCCCC---EEEEEEEeeccCc Confidence 88888766555 66665432 222223332 1 2466432 34578999999987763 3332 457 Q ss_pred ccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCC---eEEEEecccccCchheeeeccCccccceEeee Q lcl|NC_011802. 266 GQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAID 342 (472) Q Consensus 266 ~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~---~Tw~yD~~t~~w~~~w~~~~tg~~~~~~R~~~ 342 (472) |+++.+|-+ ++.++..-....-...+..+|+++-+.++.+...| ..|.|+-...++ -||.-+.. ++..+..| T Consensus 303 y~~~dlt~~-a~hl~~~~~~~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~~eq~v~--aWs~~~~t--~G~v~~va 377 (594) T protein:vir:10 303 WIPDEMSSQ-AQHLFTPISSAKGASVRRVAYISDAAKSLWVVLENGQINYCCFDRTTDTK--AWTQLELS--GGKVIDIA 377 (594) T ss_pred eeccchhhh-hhhhcCccccccCceEEEEEEecCCceEEEEEeCCCeEEEEEEeccccee--eeEeeccC--CCcEEEEE Confidence 888777543 45554321111234467778888888777777775 367777655444 57765421 22445555 Q ss_pred EeecCC---e-EEE--EEccCCe------EEEEcCCccCCCCCEEEEEEeec---------cccCCCceEE--------- Q lcl|NC_011802. 343 FMYEGN---Q-ITC--GDKSEAV------TGQLQFDISSQYDKQQEHLLFTP---------IFKADNARCF--------- 392 (472) Q Consensus 343 ~~~~~g---~-~~v--GD~~~g~------l~~ld~~~~~d~g~p~~~~~~tP---------~~~~~~~r~~--------- 392 (472) .++... - .+| .|..+|. |=+|+. ......+..+.+.++ +-|-++..+. T Consensus 378 ~i~~~~~d~l~~~V~R~~ti~g~~~~y~~lE~~~~--~~~~~~~~~~~~d~~~~~~~~vsgl~hLeg~tv~v~aDG~~~~ 455 (594) T protein:vir:10 378 AAFNPDSDYAYVAVVRSKAINGVQKNYTVLEKISS--PRTDWKRADGWVVAQVNQNGDVLNLDRYIGRTAVIFSKYGLEA 455 (594) T ss_pred EeecCCCCEEEEEEEECCccccceeeEEEeecCCC--ccccccccceeeeecccccceeecccccCCceEEEEeCCeecC Confidence 543211 0 111 1111111 111111 111111111111100 0000111110 Q ss_pred EE---------------------------------EEEEEcCCCCC--c----hhheeeeccC-ccccC-c-------c- Q lcl|NC_011802. 393 DL---------------------------------EVESSTGVAQY--A----DRLFLSATTD-GINYG-R-------E- 423 (472) Q Consensus 393 ~~---------------------------------~le~~~Gv~~~--~----~~~~l~~sdD-G~~~~-~-------~- 423 (472) +. .+|+..+-+.. . .++-|+--+= |..-+ + + T Consensus 456 ~~~V~~g~itL~~~~~~~~~~v~VGl~Y~s~i~~lp~~~~~~~gs~~g~r~ri~r~~v~~~~S~g~~vg~~~~~~r~~~~ 535 (594) T protein:vir:10 456 EVEVNNIGLTHRINGYDPNTVYYVGYKMDSYFRTLTPSNGDMKKSMFGSKIRISKVQLALFDSIEPTVNGEPADDRSTDD 535 (594) T ss_pred CeEEcCCeeEeeccCCCCcceEEEeeeeeEEEEeecccccCCcccccCccEEEEEEEEEEEcceeeEECCcccccccchh Confidence 00 00000000000 0 0011111000 00000 0 0 Q ss_pred ----eeec-cC--CCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 424 ----QMIE-QN--EPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 424 ----~~~~-~g--~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) +... .| ...-.+.++++..+|..++. -.+|+-+.|-|.+|.+...++| T Consensus 536 ~~~~~~~~~~g~~~~~tg~~~v~~~~~G~~~~~-~i~I~qd~PlPltvlai~~ev~ 590 (594) T protein:vir:10 536 IMDARLLDFSSNSGSSNGTRLVDYNPLGWENDG-KMVIAVEQPFLCEVVGVFSVVQ 590 (594) T ss_pred hccccCCcccCcccccCCceEEEEccCCcCccc-EEEEEECCCcCEEEEEEEEEEE Confidence 0000 00 11122445667777865555 4678888888888888888888 No 15 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=93.25 E-value=0.0082 Score=32.01 Aligned_cols=314 Identities=11% Similarity=0.027 Sum_probs=135.0 Q ss_pred CceeeeeecccC-cc-------------c-cccCCeeEEeeeeeeecccccCcccceeEcCCCceeeeecCCCccceeee Q lcl|NC_011802. 1 MPIQQLPMMKGM-GK-------------D-FKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGISRGVEYN 65 (472) Q Consensus 1 M~~~~vPl~~G~-~~-------------~-~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~~rg~~~~ 65 (472) ++..-+-...+. |- + ...+++....++...-. ..+..+..+++|.....--.+..+...|= T Consensus 319 ~~~~~~~~~g~~i~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~----~~~~Lp~~a~~g~~v~v~~~~~~~~~~Yy 394 (680) T protein:vir:17 319 LGTFTAESIGNVIRVRYSDPTRTDEFTMSARGGTSGTGLESIKYSVD----TLAELPTKCWNDYQVAVRNTQDTEVDDYY 394 (680) T ss_pred cCcEEEEECCCEEEEEeccCCCceEEEeeccCCCCceeeeeeeeeec----cccccccccCCCcEEEEEeCCCCcccceE Confidence 222211111110 00 0 01111111111111100 01111222334432221111111111110 Q ss_pred ecc---CeEEEEECcceeeee----eeEEcccCceeEEEEcCC--cEEEEEECCceeEEEEecc--chhhcccccccccc Q lcl|NC_011802. 66 TAQ---NAVYRVLGSKLYKGE----TVVGDVAGSGRVSMAHGR--TSQAVGVNGQLVEYRYDGT--VKTVSNWTADSGFT 134 (472) Q Consensus 66 ~~~---~~lY~V~G~~LY~v~----~~iGtv~gsg~VsMa~Ng--~~~~iv~~g~~~~Y~~d~~--~~t~s~~~~d~~f~ 134 (472) +.- +.-....+..-|+-. ...+-..++-|..+.+++ .......+++.....|... ..+.++ ..+.|. T Consensus 395 v~~~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~~~~~r~~Gdd~tn--p~psF~ 472 (680) T protein:vir:17 395 VKFETDVEDADVPGSGYWVETVKNGDDGGLVDDTMPHVLVRNALGDFTFSSLNNSSYGKTWADRSVGSEDTN--PHPTFT 472 (680) T ss_pred EEEeccCcccCcccccceeecccCcccceeccCcceEEEEEccCceeEEEeeccccccccccccccCCcccC--CCcccc Confidence 000 000000011112100 011222233344444333 2222222322211111111 011111 122342 Q ss_pred CCccc-ccceeeeeceeEEEEecCCCeEEEEcccCC----------CCcCCccceeEeecCCCceEEEEecCCEEEEEEc Q lcl|NC_011802. 135 QYELG-SVRDITRLRGRYAWSKDGTDSWFITDLEDE----------SHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGS 203 (472) Q Consensus 135 ~~~~~-~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~----------s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~ 203 (472) +.+ .+.+|+|..+|++|.. .+.++.|.-.|. ...+|+.+++.+-.+++.|.-++++++.|+||.. T Consensus 473 --~~G~~p~~v~f~q~RL~f~s--~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~ 548 (680) T protein:vir:17 473 --ESGNGIYGMFMYKNRLGFLT--QDAVIMSQVGDYFNFYATSGVTISDADPIDMATSDTKPVKLEAAISSTSGAILFGN 548 (680) T ss_pred --cCCCCceEEEEEcceEEEee--CCeEEEEccCCcccccccccccCCCCccEEEEEcCCcceeeeEEeecCCcEEEEec Confidence 112 3788999999999875 334555644432 1247899999999999999999999999999988 Q ss_pred ceEEEEEecCCCCCccccceeccccee-eeccccchhheecCceEEEEEeccccccEE------EEccCccceecCCHHH Q lcl|NC_011802. 204 STIEYFSLTGATTVGAALYVAHASLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSV------YIIGSGQASPIATASI 276 (472) Q Consensus 204 ~T~Evw~~tGa~~~~~fpy~r~~~~~I-~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V------~~~~g~q~~rIST~~i 276 (472) .. | |..+|..+ ++......--.+ ..+|...-.-..++++++|+++.+. --.| +.-++|+++.+| .-+ T Consensus 549 g~-q-~~ls~~~~--~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~-~s~vre~~y~~~~d~y~a~DlT-~~a 622 (680) T protein:vir:17 549 QA-Q-FRLSSPDE--SFGPKTATLDKISNYTYESKADPVQTGVSMIFPTNMGT-YSSVYELSTESAKGTPVIEDSS-RVI 622 (680) T ss_pred Ce-E-EEEecCCc--eecceeEEEEEEEeecccCCCCceEeCCeEEEeecCCC-cceEEEEeeeeccCceehhhHH-HHH Confidence 63 3 55666433 244333111112 3578777777889999999998752 1224 344566766663 345 Q ss_pred HHHHhhcCchhhccEEEEEEEeCCEEEEEEECCC-eEEEEec---ccccCchheeeeccCccccce Q lcl|NC_011802. 277 EKIIRSYTADELATGVMEALRLDSHELLIIHLPR-HVLVYDA---SSSQNGPQWCVLKTGLYDDVY 338 (472) Q Consensus 277 E~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~-~Tw~yD~---~t~~w~~~w~~~~tg~~~~~~ 338 (472) +.+|+.. + -.++.++.+.+.++....-+ .-++|-- .-.+...-||+-.=+ +..| T Consensus 623 ~hl~~g~----v--~~~~~~~~~~~~~~~~~~~~~~l~~~~yl~~~~e~~v~aW~rw~~~--~~d~ 680 (680) T protein:vir:17 623 PRLIPSG----L--TWSTASMNNDTVFFGNAKKGRNVYVFRFFNEGQERKVAGWTTWYYE--DQDH 680 (680) T ss_pred HHhcCCc----e--EEEEeeCCCCeEEEEEEcCCCEEEEEEEeeCCCceEEEEEEEEecC--CCCC Confidence 6666533 2 22566777887655554333 4444431 122211135533221 2223 No 16 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=93.14 E-value=0.0086 Score=31.90 Aligned_cols=410 Identities=13% Similarity=0.111 Sum_probs=160.0 Q ss_pred CceeeeeecccCccccccC------------------------CeeEEeeeeeeecccccCccccee--EcCCCceeeee Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNA------------------------DYIDYLPINMLATPKEVLNSSGYL--RSFPGIAKRND 54 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~------------------------d~~~~~pvn~~~~~~e~~~s~~~L--rs~PGl~~~~~ 54 (472) .+...++-..........+ .....-.-|++.... . ..++++ ..+||... .. T Consensus 176 ~~~~t~ta~~~~~~~d~vg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~g~~~~~~~~~~~~-~~ 252 (823) T protein:vir:95 176 TGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAVT-A-GKTGTLRPSHTEGTSW-DG 252 (823) T ss_pred CceeEEeecccccchhhccceEEEeccccceeeecceeeeecccceEEecccceeeee-c-cccceeecccCCcceE-Ee Confidence 1111111000000000000 000000001110000 0 001110 01111110 00 Q ss_pred cCCCccceeeeeccCeEEEE--ECcceeeeeeeEEccc-CceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccc Q lcl|NC_011802. 55 VNGISRGVEYNTAQNAVYRV--LGSKLYKGETVVGDVA-GSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADS 131 (472) Q Consensus 55 v~g~~rg~~~~~~~~~lY~V--~G~~LY~v~~~iGtv~-gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~ 131 (472) ..+... .....-|++ .+....+.+...+.+. +.-...|.++ ++.+. ...+.++. ..|.... T Consensus 253 ~~~~~~-----~~~~~~~~~~~~~~g~~~~t~v~~~~~~~~~~~~~~~~-----~~~~~-~~t~~~~~-----~~~~~~~ 316 (823) T protein:vir:95 253 WGGSGD-----DDTGIEWEYLHSGFGIARITAVNGTTATAEVISYIPSQ-----VVGED-NASYKWAK-----YAWNSVN 316 (823) T ss_pred ceeccc-----ccceeEEEEEeCCcceEEEEeecceeeeceEeeeeccc-----cccCC-cCCccccc-----cccCcCC Confidence 000000 000011111 1122333222222221 1111223332 11111 11122221 1122223 Q ss_pred cccCCcccccceeeeeceeEEEEec--CCCeEEEEcccCC--------CCcCCccceeEeecCCCceEEEEecCCEEEEE Q lcl|NC_011802. 132 GFTQYELGSVRDITRLRGRYAWSKD--GTDSWFITDLEDE--------SHPDRYSAEYRAESQPDGIIGIGTWRDFIVCF 201 (472) Q Consensus 132 ~f~~~~~~~~~dv~~~dGyfv~~~~--g~~~~~iS~L~D~--------s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lf 201 (472) +| ++-|+|..+|++|... ..+.++.|.-.|+ ...+|+.++.-+..+++.|.-+++.+ .|++| T Consensus 317 g~-------Ps~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~~~DdD~I~~~~s~~~~~~i~~~v~~~-~Lli~ 388 (823) T protein:vir:95 317 GY-------PGTVVYYQQRLYFAASTAFPQTIWASRTGDYKDFGKSNPTQDDDRIIYTYAGRQVNEIRHLIDVG-SLVAL 388 (823) T ss_pred CC-------ccEEEEEeceEEEEEcCCCCcEEEEeccCCccccccccCCCCCCcEEEEEcCCcceEEEEEeecC-cEEEE Confidence 33 3558888889887631 2344555543332 34578999999999999999999996 46666 Q ss_pred EcceEEEEEecCCC----CCccccceecccceeeeccccchhheecCceEEEEEeccccccEEE------EccCccceec Q lcl|NC_011802. 202 GSSTIEYFSLTGAT----TVGAALYVAHASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVY------IIGSGQASPI 271 (472) Q Consensus 202 G~~T~Evw~~tGa~----~~~~fpy~r~~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~------~~~g~q~~rI 271 (472) .+. -| |..+|.+ +|......++ + ..||.. -.-..++++++|++..+. .|+ ..++|+++.+ T Consensus 389 t~~-~e-~~l~~~~~~~lTP~~~~~~~~-s---~~g~~~-~~Pv~vg~~~~Fv~~~g~---~vre~~~~~~~d~~~~~dl 458 (823) T protein:vir:95 389 TSG-GE-YVITGDQNKVLTPSSFAFSSQ-G---SNGSSN-VPPIAVANIALFVQEKGS---VVRDLAYSFDVDGYQGNDL 458 (823) T ss_pred ecC-cE-EEEEcCCCcccceeeEEEEEe-e---cccccc-ccceEeCCeEEEEecCCC---EEEEEEEeeecCceecchh Confidence 666 44 5555532 2222333332 2 568753 456679999999987652 332 3466777777 Q ss_pred CCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCC---eEEEEecccccCchheeeeccCccccceEeeeEeecCC Q lcl|NC_011802. 272 ATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGN 348 (472) Q Consensus 272 ST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~---~Tw~yD~~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g 348 (472) |- -++.+++.. ..+.+.|+.+.+....+.+-| ..+.|+-....+ -||.-.++ +..+..+++..+. T Consensus 459 T~-~a~hl~~~~------~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~~~q~v~--aW~~~~~~---g~~~~~~~i~~~~ 526 (823) T protein:vir:95 459 TI-LANHLFQKH------SIVDWCFSIVPYSSAFCIRDDGKLLVMTYLRDQQVF--AWAPQSST---GKYESTCSISEGN 526 (823) T ss_pred hh-hhhhhcCCC------ceEEEEEecCCCeEEEEEecCCcEEEEEEeccccee--eeEEEecC---CcEEEEEEecCCC Confidence 42 223344321 245666777777666666664 356777543333 58877663 4677777765322 Q ss_pred eEEEEEccCCeEEEEcCCccCCCCCEEEEE--EeeccccCCCceE-EEEEEEEEcCCCCCchhh---------------e Q lcl|NC_011802. 349 QITCGDKSEAVTGQLQFDISSQYDKQQEHL--LFTPIFKADNARC-FDLEVESSTGVAQYADRL---------------F 410 (472) Q Consensus 349 ~~~vGD~~~g~l~~ld~~~~~d~g~p~~~~--~~tP~~~~~~~r~-~~~~le~~~Gv~~~~~~~---------------~ 410 (472) ...||.+=.. +-+|+....+ +.+..+..+..++ .|..+... |.+-.+... . T Consensus 527 --------~d~l~~~v~R--~i~g~~~~yiE~~~~~~~~~~~~~~~lD~~~s~~-g~~~~~~~~~l~~g~~~l~~l~g~~ 595 (823) T protein:vir:95 527 --------EDAVYFVVNR--TVNGQTVRYIERLSSRLFTSDEDAFFVDSGLSYD-GRNTSDRTMTITGGSGEWDYLAEYT 595 (823) T ss_pred --------CCEEEEEEEe--ccCCeEEEEEEeeccccCCCccceeEEEEEEEee-cCcccceeeEecCCCCcccccCceE Confidence 2345554332 2344443322 2223333222222 23332221 111111111 1 Q ss_pred eeeccCccccCcc---eeeccCCC-------cccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 411 LSATTDGINYGRE---QMIEQNEP-------FVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 411 l~~sdDG~~~~~~---~~~~~g~~-------g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) ++. .||...... -.|.++.+ -.|.++++...... .++-.-+++--..+|..+..+..--. T Consensus 596 v~~-adg~~~~~~~v~g~i~l~~~~~~~~vGl~~~~~i~~~~~~v-~~~~a~~~~~~r~v~a~l~~~~t~~~ 665 (823) T protein:vir:95 596 ISV-SGGAYFTSSDVGAQLQFPYTGADPDTGYEVSKELRCDIISV-TSNTAVVVRANRNVPPSLRNVATTNW 665 (823) T ss_pred EEe-cCcceECCccceeEEEeCcCCCccccccceEEEEEEeecee-eCCceEEEccCCcccceeeeeecccc Confidence 222 222221111 00111110 01444444443332 22211222222223333333222211 No 17 >protein:vir:105647 Length: 800 # NCBI annotation: putative tail tubular B protein # Family: family:all:825 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425011;genbank:gi:83571759;uniprot:Q2WC41;genbank:GeneID:3837288 Probab=92.11 E-value=0.013 Score=30.96 Aligned_cols=440 Identities=11% Similarity=0.068 Sum_probs=162.9 Q ss_pred Cceeeeeeccc-CccccccCCeeEEeeeeeeecccccCcccceeEcCCCc-eeeeecCCCcc-ceeee-----eccCeEE Q lcl|NC_011802. 1 MPIQQLPMMKG-MGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGI-AKRNDVNGISR-GVEYN-----TAQNAVY 72 (472) Q Consensus 1 M~~~~vPl~~G-~~~~~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl-~~~~~v~g~~r-g~~~~-----~~~~~lY 72 (472) |+...+--..+ .+-... +....++.. .++..-.++ ++.-+. ....+|+.-++ |.-.. .....-| T Consensus 207 ~~~~t~~~~g~~i~i~~~-----~~~~~~~~~--~~~~~~~~~-~~~~~~v~~~~~Lp~~~~~g~~~~i~~~~~~~~~~y 278 (800) T protein:vir:10 207 VNDYEIQRDGTSIFIERR-----DGKSFTVTT--TDGAKGKDL-VAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRY 278 (800) T ss_pred ccceEEEEcCcEEEEEEe-----cCCceEEEE--eecCCcceE-EEEEeeccceeeccccCCCCceEEEEcCCCCCCcee Confidence 22211111100 010000 001111111 111111110 100000 00011211111 10000 0111111 Q ss_pred EEEC------cceeeeeeeEEccc----CceeEEEEcCCcEEEEE-ECCceeEEEEeccchhhcc--ccccccccCCcc- Q lcl|NC_011802. 73 RVLG------SKLYKGETVVGDVA----GSGRVSMAHGRTSQAVG-VNGQLVEYRYDGTVKTVSN--WTADSGFTQYEL- 138 (472) Q Consensus 73 ~V~G------~~LY~v~~~iGtv~----gsg~VsMa~Ng~~~~iv-~~g~~~~Y~~d~~~~t~s~--~~~d~~f~~~~~- 138 (472) .|.. ...|+-+...|.+. ++-|-.+..+ .++ .++.-..+..|.....+-. .-..+.|.+... T Consensus 279 ~~~~~~~~~~~~~w~e~~~~~~~~~~~~~tmp~~lv~~----~~~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~~~~~~ 354 (800) T protein:vir:10 279 WLQAEPKEGNLVSWKETIAADVLLGFDKGTMPYIIERT----GIIDGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVP 354 (800) T ss_pred EEEEEeccccceEEEeecccCceeeeecccccEEEEEe----eeeecceeEEEEeccccccccCCCCCCCCchhcCCCCC Confidence 1111 11222111111100 1111111111 111 1233333333433322211 002234444322 Q ss_pred cccceeeeeceeEEEEecCCCeEEEEcccCC----------CCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEE Q lcl|NC_011802. 139 GSVRDITRLRGRYAWSKDGTDSWFITDLEDE----------SHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEY 208 (472) Q Consensus 139 ~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~----------s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Ev 208 (472) ..+.+|+|..+|++|.. .+.++.|.-.|. ...+|+.+++.+-.+++.|.-++++++.|+||.+..- T Consensus 355 ~~i~~v~f~q~RL~f~~--~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~q-- 430 (800) T protein:vir:10 355 QTIGGMFMVQNRLCFTA--GEAVIASRTSYFFDFFRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQ-- 430 (800) T ss_pred CCceeEEEEeeeEEEee--CCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeeeEeecCCcEEEEecCcE-- Confidence 23678999999999875 344555644332 2347899999999999999999999999999976665 Q ss_pred EEecCCCCCccccceeccccee-eeccccchhheecCceEEEEEeccccccEEE------EccCccceecCCHHHHHHHh Q lcl|NC_011802. 209 FSLTGATTVGAALYVAHASLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVY------IIGSGQASPIATASIEKIIR 281 (472) Q Consensus 209 w~~tGa~~~~~fpy~r~~~~~I-~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~------~~~g~q~~rIST~~iE~~i~ 281 (472) |..+|.. ++....-.--.+ ..+|...-.-..++++++|+++.+. -..|+ ..++|+++.+|-| ++.+|+ T Consensus 431 ~~l~g~~---~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~-~s~vre~~~~~~~d~~~a~DlT~~-~~hl~~ 505 (800) T protein:vir:10 431 FILPGDK---PLEKSNALLKPVTTFEVNNKVKPVVTGESVMFATNDGS-YSGVREFYTDSYSDTKKAQAITSH-VNKLIE 505 (800) T ss_pred EEEeCCC---cccceeEEEEEEEeeeccCCCCceEeCCeEEEecCCCC-eeEEEEEeeeecccceehhhHHhH-HHHhcC Confidence 7777642 233322111112 3578888888899999999998752 11243 3366777777433 455664 Q ss_pred hcCchhhccEEEEEEEeCCEEEEEEECC-CeEEEEec---ccccCchheeeeccCccccceEeeeEeecCCeEEEEEccC Q lcl|NC_011802. 282 SYTADELATGVMEALRLDSHELLIIHLP-RHVLVYDA---SSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSE 357 (472) Q Consensus 282 ~y~~~e~~~A~~~~~~~~GH~fy~lt~P-~~Tw~yD~---~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~ 357 (472) .- +.. +.+.+.+.+.++....- +.-.+|-- ...+...-||.-.-+.+ ....+.++ -.+.-+++=...+ T Consensus 506 ~~----v~~--~~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~~~aW~~w~~~~~-~~~~~~~~-~~d~l~~iv~r~~ 577 (800) T protein:vir:10 506 GN----ITN--MAASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQSAWHVWEWPMG-TKVRGMFY-SGELLYLLLERGD 577 (800) T ss_pred Cc----eEE--EEEeCCCCeEEEEEEcCCCeEEEEEEeecCCceEEEEEEEEEcCCC-cEEEEEEE-eCCeEEEEEECCC Confidence 32 211 22233345544333333 33334331 11111123775442211 13333333 2556666766655 Q ss_pred CeE-EEEcCCccCCCCCEEEEEEe---ecccc---C--------------CCceEEEEEEEEEcCCCC-Cchhhe--eee Q lcl|NC_011802. 358 AVT-GQLQFDISSQYDKQQEHLLF---TPIFK---A--------------DNARCFDLEVESSTGVAQ-YADRLF--LSA 413 (472) Q Consensus 358 g~l-~~ld~~~~~d~g~p~~~~~~---tP~~~---~--------------~~~r~~~~~le~~~Gv~~-~~~~~~--l~~ 413 (472) +.. -+|++....+.+.+....+- ++.+. + +...+..+.+ .|... .+-.++ ... T Consensus 578 ~~~ier~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~g~v~~~~~~ 654 (800) T protein:vir:10 578 GVYLEKMDMGDALTYGLNDRIRMDRQAELIFKHFKAEDEWISEPLPWTPTNPELLDCILI---EGWDSYIGGSFLFKYKP 654 (800) T ss_pred cEEEEEEecccCccccccceeeeecceeecccccccCcceEEEeccccccCCcceEEeee---ccceeecCceeEEEEEe Confidence 533 33655544444444322111 11111 1 1111111111 11100 001111 122 Q ss_pred ccCccccCcc--------eeeccCCC----------------cc--cceeEEEEeeE-ecccceeEEEEEEecCc----c Q lcl|NC_011802. 414 TTDGINYGRE--------QMIEQNEP----------------FV--YDKRVIWKRVG-RIRRLIGFKLRVITKSP----V 462 (472) Q Consensus 414 sdDG~~~~~~--------~~~~~g~~----------------g~--~~~r~~~~rlG-~~r~~v~f~~r~~~~~~----~ 462 (472) +++++++-.+ ..+-.|.+ |. ...|.+.+|+= +..+--+|++++..... + T Consensus 655 ~~g~~~~~~~~~~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~ 734 (800) T protein:vir:10 655 SDNTLSTTFDMHDDNHVKAKVIVGQIYPQEFEPTPVVIRDRQDRVSYIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRR 734 (800) T ss_pred cCCceEeeeeecCCCcccceEEEeeeeeEEEeecceEEEcCCCcccccCCeEEEEEEEEeecCceEEEEeccCcccceeE Confidence 2223222110 01112211 11 11222222210 00111124443322211 1 Q ss_pred eE-----EEeEEEe----C Q lcl|NC_011802. 463 TL-----SGCQIRL----E 472 (472) Q Consensus 463 ~l-----~~~~~~~----e 472 (472) .+ .|..... + T Consensus 735 ~~~~~~~~g~~~~~~g~~~ 753 (800) T protein:vir:10 735 VLASNRIGGALNNTVGYVE 753 (800) T ss_pred EccCCeeccccccccCccc Confidence 11 1111111 0 No 18 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=89.42 E-value=0.026 Score=29.23 Aligned_cols=423 Identities=12% Similarity=0.092 Sum_probs=160.6 Q ss_pred CceeeeeecccCccccccCCeeEEeeee----------eeecccccCc------------ccceeEcCCCceeeeecCCC Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYIDYLPIN----------MLATPKEVLN------------SSGYLRSFPGIAKRNDVNGI 58 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~~~~pvn----------~~~~~~e~~~------------s~~~Lrs~PGl~~~~~v~g~ 58 (472) +..-..|..+ +.+ ....|| ..-++. +.++....+- +.+..-+.|.....+.-. . T Consensus 129 i~h~~~~p~~-L~r-~~~~~W-~l~~~~f~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~-~ 204 (681) T protein:vir:10 129 LVHPNYAPRE-LRR-LGATNW-QLATIAFTSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNN-L 204 (681) T ss_pred EECCCCcceE-EEE-ccCCce-EEEEEEeccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeee-e Confidence 1111111111 000 001111 111111 1111100000 000000000000000000 0 Q ss_pred ccceeeeeccCeEEEEECcceeeee-------eeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccc Q lcl|NC_011802. 59 SRGVEYNTAQNAVYRVLGSKLYKGE-------TVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADS 131 (472) Q Consensus 59 ~rg~~~~~~~~~lY~V~G~~LY~v~-------~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~ 131 (472) ..... ...-..-.+.|..-|.+- ..+|. .++--..++| ++.+.. ..++.... .+.... T Consensus 205 ~~~~~--~~t~~w~a~~g~~~~~V~~~~~gi~g~ig~--~~~~~~~~~~-----~~~~~~---~t~~~~~~---~~~~~~ 269 (681) T protein:vir:10 205 FTNGG--ANTIAWSASSGASRYNVYKEQGGLYGYIGQ--TTGTSLVDDN-----IAPDLS---VTPPIYDA---VFNAAG 269 (681) T ss_pred ecCCc--ceeEEEEecCCceeeeecccceeEEEEeec--cceeeeeecc-----cccCcc---cccccccc---ccccCC Confidence 00000 000011222232222211 12222 1111111111 111111 01111111 111222 Q ss_pred cccCCcccccceeeeeceeEEEEec--CCCeEEEEccc--------CCCCcCCccceeEeecCCCceEEEEecCCEEEEE Q lcl|NC_011802. 132 GFTQYELGSVRDITRLRGRYAWSKD--GTDSWFITDLE--------DESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCF 201 (472) Q Consensus 132 ~f~~~~~~~~~dv~~~dGyfv~~~~--g~~~~~iS~L~--------D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lf 201 (472) +| +..|+|..+|.+|... ..+.++.|.-. .+...+|+.++.-+-.+++.|.-++++++ |++| T Consensus 270 gy-------P~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~-lli~ 341 (681) T protein:vir:10 270 DY-------PAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPVRDDDRVAFRVAAREANAIRHIVPLTE-LLLL 341 (681) T ss_pred Cc-------eEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCCCCCCccEEEEEcCCcceeEEEEEecCc-EEEE Confidence 32 3568899999888521 22334444322 23345789999999999999999999865 5555 Q ss_pred EcceEEEEEecCCCCCccccceec-ccceeeeccccchhheecCceEEEEEeccccccEEE------EccCccceecCCH Q lcl|NC_011802. 202 GSSTIEYFSLTGATTVGAALYVAH-ASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVY------IIGSGQASPIATA 274 (472) Q Consensus 202 G~~T~Evw~~tGa~~~~~fpy~r~-~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~------~~~g~q~~rIST~ 274 (472) .+. -|+.-..+..+ ++....- -...-..||.. -.-..++++++|++..+. .|+ ..++|+++.+| - T Consensus 342 t~~-~e~~l~~~~~~--~lTP~~~~~~~~s~~g~~~-~~Pv~vg~~v~fv~~~g~---~vre~~y~~~~d~~~~~dlt-~ 413 (681) T protein:vir:10 342 TSS-GEWRVASVNSD--AVTPTTISVRPQSYVGATD-VQPVVVNNTTIYGAARGG---HVRELAYNWQANGFVTGDLS-L 413 (681) T ss_pred EcC-cEEEEecCCCc--cccceeEEEEEeeeecccc-ccceeeCCeEEEEecCCC---EEEEEEEeeecCceeccchh-h Confidence 555 45443333222 2332220 01112468854 556788999999998873 232 45667777775 1 Q ss_pred HHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCC---eEEEEecccccCchheeeeccCccccceEeeeEeecCCe-- Q lcl|NC_011802. 275 SIEKIIRSYTADELATGVMEALRLDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQ-- 349 (472) Q Consensus 275 ~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~---~Tw~yD~~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~-- 349 (472) -.+.+++.. ..+..+|+.+.+.+..+.+.| ..+.|+-...++ -||.-.++ +..+..|++..+++ T Consensus 414 ~a~Hl~~~~------~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~~eq~v~--aW~~~~~~---g~v~~v~~i~~~~~d~ 482 (681) T protein:vir:10 414 RAAHLFDNL------DILDMAYAKAPQPIVWFISSSGKLLGLTYVPEQQIG--AWHQHDTD---GVFESCAVVAEGNEDR 482 (681) T ss_pred hhhhhcCCC------CeEEEEEecCCCEEEEEEecCCcEEEEEEeccccee--eEEEEecC---CcEEEEEEecCCCCcE Confidence 113333322 355677889999888888875 577787554433 58877763 35566655543322 Q ss_pred -EEE-----EEccCCeEEEEc--------------CCccCCCCCEEEEEEeeccccCCCceEE----------------- Q lcl|NC_011802. 350 -ITC-----GDKSEAVTGQLQ--------------FDISSQYDKQQEHLLFTPIFKADNARCF----------------- 392 (472) Q Consensus 350 -~~v-----GD~~~g~l~~ld--------------~~~~~d~g~p~~~~~~tP~~~~~~~r~~----------------- 392 (472) |++ ++...-.|=+|+ ..... .+.+...+--.+++ ++..+. T Consensus 483 l~~vv~r~~~g~~~~yie~~~~~~~~~~~~~~~vD~~~t~-~~~~~~~~sgl~~l--eG~tv~i~aDG~~~~~~~V~~G~ 559 (681) T protein:vir:10 483 LYAVVRRTIGGNEVRYVERMASRQFDAQADAFFVDSGLTY-SGEPVSHISGLEHL--EGKTVSILADGAVHPQRVVTDGA 559 (681) T ss_pred EEEEEEecCCCCeEEEEEecCCccccccccceEeeccccc-cCcceeeeccccCC--CCcEEEEEeCCeecCcEeecCcE Confidence 111 100000111121 11110 11111110000100 111000 Q ss_pred ----------------EEE-----EEEEcCCC--CC----chhheeeeccC-ccccC--ccee--e------ccCC-Ccc Q lcl|NC_011802. 393 ----------------DLE-----VESSTGVA--QY----ADRLFLSATTD-GINYG--REQM--I------EQNE-PFV 433 (472) Q Consensus 393 ----------------~~~-----le~~~Gv~--~~----~~~~~l~~sdD-G~~~~--~~~~--~------~~g~-~g~ 433 (472) ..+ +++...-+ +. -.++-|+.-+. |...+ ..++ + .+|. +-. T Consensus 560 itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g~~~~~~~~~l~~~~~~~~~~~g~~~~l 639 (681) T protein:vir:10 560 IDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQGRVKNINKLWLRVHRSSGIFAGPHADALTEVKQRTSEPYGSPPAL 639 (681) T ss_pred EEeCcCCceEEEeeeceeEEEecceeeecCCcccCCceEEEEEEEEEEEcccceEEeeCCCceEEEEEeccccccccCCc Confidence 000 11100000 00 01112222221 11100 0000 0 0111 111 Q ss_pred cceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 434 YDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 434 ~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) +.-.++.---|.-.++.-++|+-..|-|..|.++..++| T Consensus 640 ~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~ 678 (681) T protein:vir:10 640 KSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIA 678 (681) T ss_pred cCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEE Confidence 222222221122223334667777777777777777777 No 19 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=89.42 E-value=0.026 Score=29.23 Aligned_cols=423 Identities=12% Similarity=0.092 Sum_probs=160.6 Q ss_pred CceeeeeecccCccccccCCeeEEeeee----------eeecccccCc------------ccceeEcCCCceeeeecCCC Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYIDYLPIN----------MLATPKEVLN------------SSGYLRSFPGIAKRNDVNGI 58 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~~~~pvn----------~~~~~~e~~~------------s~~~Lrs~PGl~~~~~v~g~ 58 (472) +..-..|..+ +.+ ....|| ..-++. +.++....+- +.+..-+.|.....+.-. . T Consensus 129 i~h~~~~p~~-L~r-~~~~~W-~l~~~~f~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~-~ 204 (681) T protein:vir:10 129 LVHPNYAPRE-LRR-LGATNW-QLATIAFTSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNN-L 204 (681) T ss_pred EECCCCcceE-EEE-ccCCce-EEEEEEeccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeee-e Confidence 1111111111 000 001111 111111 1111100000 000000000000000000 0 Q ss_pred ccceeeeeccCeEEEEECcceeeee-------eeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccc Q lcl|NC_011802. 59 SRGVEYNTAQNAVYRVLGSKLYKGE-------TVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADS 131 (472) Q Consensus 59 ~rg~~~~~~~~~lY~V~G~~LY~v~-------~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~ 131 (472) ..... ...-..-.+.|..-|.+- ..+|. .++--..++| ++.+.. ..++.... .+.... T Consensus 205 ~~~~~--~~t~~w~a~~g~~~~~V~~~~~gi~g~ig~--~~~~~~~~~~-----~~~~~~---~t~~~~~~---~~~~~~ 269 (681) T protein:vir:10 205 FTNGG--ANTIAWSASSGASRYNVYKEQGGLYGYIGQ--TTGTSLVDDN-----IAPDLS---VTPPIYDA---VFNAAG 269 (681) T ss_pred ecCCc--ceeEEEEecCCceeeeecccceeEEEEeec--cceeeeeecc-----cccCcc---cccccccc---ccccCC Confidence 00000 000011222232222211 12222 1111111111 111111 01111111 111222 Q ss_pred cccCCcccccceeeeeceeEEEEec--CCCeEEEEccc--------CCCCcCCccceeEeecCCCceEEEEecCCEEEEE Q lcl|NC_011802. 132 GFTQYELGSVRDITRLRGRYAWSKD--GTDSWFITDLE--------DESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCF 201 (472) Q Consensus 132 ~f~~~~~~~~~dv~~~dGyfv~~~~--g~~~~~iS~L~--------D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lf 201 (472) +| +..|+|..+|.+|... ..+.++.|.-. .+...+|+.++.-+-.+++.|.-++++++ |++| T Consensus 270 gy-------P~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~-lli~ 341 (681) T protein:vir:10 270 DY-------PAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPVRDDDRVAFRVAAREANAIRHIVPLTE-LLLL 341 (681) T ss_pred Cc-------eEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCCCCCCccEEEEEcCCcceeEEEEEecCc-EEEE Confidence 32 3568899999888521 22334444322 23345789999999999999999999865 5555 Q ss_pred EcceEEEEEecCCCCCccccceec-ccceeeeccccchhheecCceEEEEEeccccccEEE------EccCccceecCCH Q lcl|NC_011802. 202 GSSTIEYFSLTGATTVGAALYVAH-ASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVY------IIGSGQASPIATA 274 (472) Q Consensus 202 G~~T~Evw~~tGa~~~~~fpy~r~-~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~------~~~g~q~~rIST~ 274 (472) .+. -|+.-..+..+ ++....- -...-..||.. -.-..++++++|++..+. .|+ ..++|+++.+| - T Consensus 342 t~~-~e~~l~~~~~~--~lTP~~~~~~~~s~~g~~~-~~Pv~vg~~v~fv~~~g~---~vre~~y~~~~d~~~~~dlt-~ 413 (681) T protein:vir:10 342 TSS-GEWRVASVNSD--AVTPTTISVRPQSYVGATD-VQPVVVNNTTIYGAARGG---HVRELAYNWQANGFVTGDLS-L 413 (681) T ss_pred EcC-cEEEEecCCCc--cccceeEEEEEeeeecccc-ccceeeCCeEEEEecCCC---EEEEEEEeeecCceeccchh-h Confidence 555 45443333222 2332220 01112468854 556788999999998873 232 45667777775 1 Q ss_pred HHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCC---eEEEEecccccCchheeeeccCccccceEeeeEeecCCe-- Q lcl|NC_011802. 275 SIEKIIRSYTADELATGVMEALRLDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQ-- 349 (472) Q Consensus 275 ~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~---~Tw~yD~~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~-- 349 (472) -.+.+++.. ..+..+|+.+.+.+..+.+.| ..+.|+-...++ -||.-.++ +..+..|++..+++ T Consensus 414 ~a~Hl~~~~------~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~~eq~v~--aW~~~~~~---g~v~~v~~i~~~~~d~ 482 (681) T protein:vir:10 414 RAAHLFDNL------DILDMAYAKAPQPIVWFISSSGKLLGLTYVPEQQIG--AWHQHDTD---GVFESCAVVAEGNEDR 482 (681) T ss_pred hhhhhcCCC------CeEEEEEecCCCEEEEEEecCCcEEEEEEeccccee--eEEEEecC---CcEEEEEEecCCCCcE Confidence 113333322 355677889999888888875 577787554433 58877763 35566655543322 Q ss_pred -EEE-----EEccCCeEEEEc--------------CCccCCCCCEEEEEEeeccccCCCceEE----------------- Q lcl|NC_011802. 350 -ITC-----GDKSEAVTGQLQ--------------FDISSQYDKQQEHLLFTPIFKADNARCF----------------- 392 (472) Q Consensus 350 -~~v-----GD~~~g~l~~ld--------------~~~~~d~g~p~~~~~~tP~~~~~~~r~~----------------- 392 (472) |++ ++...-.|=+|+ ..... .+.+...+--.+++ ++..+. T Consensus 483 l~~vv~r~~~g~~~~yie~~~~~~~~~~~~~~~vD~~~t~-~~~~~~~~sgl~~l--eG~tv~i~aDG~~~~~~~V~~G~ 559 (681) T protein:vir:10 483 LYAVVRRTIGGNEVRYVERMASRQFDAQADAFFVDSGLTY-SGEPVSHISGLEHL--EGKTVSILADGAVHPQRVVTDGA 559 (681) T ss_pred EEEEEEecCCCCeEEEEEecCCccccccccceEeeccccc-cCcceeeeccccCC--CCcEEEEEeCCeecCcEeecCcE Confidence 111 100000111121 11110 11111110000100 111000 Q ss_pred ----------------EEE-----EEEEcCCC--CC----chhheeeeccC-ccccC--ccee--e------ccCC-Ccc Q lcl|NC_011802. 393 ----------------DLE-----VESSTGVA--QY----ADRLFLSATTD-GINYG--REQM--I------EQNE-PFV 433 (472) Q Consensus 393 ----------------~~~-----le~~~Gv~--~~----~~~~~l~~sdD-G~~~~--~~~~--~------~~g~-~g~ 433 (472) ..+ +++...-+ +. -.++-|+.-+. |...+ ..++ + .+|. +-. T Consensus 560 itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g~~~~~~~~~l~~~~~~~~~~~g~~~~l 639 (681) T protein:vir:10 560 IDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQGRVKNINKLWLRVHRSSGIFAGPHADALTEVKQRTSEPYGSPPAL 639 (681) T ss_pred EEeCcCCceEEEeeeceeEEEecceeeecCCcccCCceEEEEEEEEEEEcccceEEeeCCCceEEEEEeccccccccCCc Confidence 000 11100000 00 01112222221 11100 0000 0 0111 111 Q ss_pred cceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 434 YDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 434 ~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) +.-.++.---|.-.++.-++|+-..|-|..|.++..++| T Consensus 640 ~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~ 678 (681) T protein:vir:10 640 KSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIA 678 (681) T ss_pred cCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEE Confidence 222222221122223334667777777777777777777 No 20 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=89.42 E-value=0.026 Score=29.23 Aligned_cols=423 Identities=12% Similarity=0.092 Sum_probs=160.6 Q ss_pred CceeeeeecccCccccccCCeeEEeeee----------eeecccccCc------------ccceeEcCCCceeeeecCCC Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYIDYLPIN----------MLATPKEVLN------------SSGYLRSFPGIAKRNDVNGI 58 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~~~~pvn----------~~~~~~e~~~------------s~~~Lrs~PGl~~~~~v~g~ 58 (472) +..-..|..+ +.+ ....|| ..-++. +.++....+- +.+..-+.|.....+.-. . T Consensus 129 i~h~~~~p~~-L~r-~~~~~W-~l~~~~f~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~-~ 204 (681) T protein:vir:98 129 LVHPNYAPRE-LRR-LGATNW-QLATIAFTSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNN-L 204 (681) T ss_pred EECCCCcceE-EEE-ccCCce-EEEEEEeccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeee-e Confidence 1111111111 000 001111 111111 1111100000 000000000000000000 0 Q ss_pred ccceeeeeccCeEEEEECcceeeee-------eeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccc Q lcl|NC_011802. 59 SRGVEYNTAQNAVYRVLGSKLYKGE-------TVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADS 131 (472) Q Consensus 59 ~rg~~~~~~~~~lY~V~G~~LY~v~-------~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~ 131 (472) ..... ...-..-.+.|..-|.+- ..+|. .++--..++| ++.+.. ..++.... .+.... T Consensus 205 ~~~~~--~~t~~w~a~~g~~~~~V~~~~~gi~g~ig~--~~~~~~~~~~-----~~~~~~---~t~~~~~~---~~~~~~ 269 (681) T protein:vir:98 205 FTNGG--ANTIAWSASSGASRYNVYKEQGGLYGYIGQ--TTGTSLVDDN-----IAPDLS---VTPPIYDA---VFNAAG 269 (681) T ss_pred ecCCc--ceeEEEEecCCceeeeecccceeEEEEeec--cceeeeeecc-----cccCcc---cccccccc---ccccCC Confidence 00000 000011222232222211 12222 1111111111 111111 01111111 111222 Q ss_pred cccCCcccccceeeeeceeEEEEec--CCCeEEEEccc--------CCCCcCCccceeEeecCCCceEEEEecCCEEEEE Q lcl|NC_011802. 132 GFTQYELGSVRDITRLRGRYAWSKD--GTDSWFITDLE--------DESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCF 201 (472) Q Consensus 132 ~f~~~~~~~~~dv~~~dGyfv~~~~--g~~~~~iS~L~--------D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lf 201 (472) +| +..|+|..+|.+|... ..+.++.|.-. .+...+|+.++.-+-.+++.|.-++++++ |++| T Consensus 270 gy-------P~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~-lli~ 341 (681) T protein:vir:98 270 DY-------PAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPVRDDDRVAFRVAAREANAIRHIVPLTE-LLLL 341 (681) T ss_pred Cc-------eEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCCCCCCccEEEEEcCCcceeEEEEEecCc-EEEE Confidence 32 3568899999888521 22334444322 23345789999999999999999999865 5555 Q ss_pred EcceEEEEEecCCCCCccccceec-ccceeeeccccchhheecCceEEEEEeccccccEEE------EccCccceecCCH Q lcl|NC_011802. 202 GSSTIEYFSLTGATTVGAALYVAH-ASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVY------IIGSGQASPIATA 274 (472) Q Consensus 202 G~~T~Evw~~tGa~~~~~fpy~r~-~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~------~~~g~q~~rIST~ 274 (472) .+. -|+.-..+..+ ++....- -...-..||.. -.-..++++++|++..+. .|+ ..++|+++.+| - T Consensus 342 t~~-~e~~l~~~~~~--~lTP~~~~~~~~s~~g~~~-~~Pv~vg~~v~fv~~~g~---~vre~~y~~~~d~~~~~dlt-~ 413 (681) T protein:vir:98 342 TSS-GEWRVASVNSD--AVTPTTISVRPQSYVGATD-VQPVVVNNTTIYGAARGG---HVRELAYNWQANGFVTGDLS-L 413 (681) T ss_pred EcC-cEEEEecCCCc--cccceeEEEEEeeeecccc-ccceeeCCeEEEEecCCC---EEEEEEEeeecCceeccchh-h Confidence 555 45443333222 2332220 01112468854 556788999999998873 232 45667777775 1 Q ss_pred HHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCC---eEEEEecccccCchheeeeccCccccceEeeeEeecCCe-- Q lcl|NC_011802. 275 SIEKIIRSYTADELATGVMEALRLDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQ-- 349 (472) Q Consensus 275 ~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~---~Tw~yD~~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~-- 349 (472) -.+.+++.. ..+..+|+.+.+.+..+.+.| ..+.|+-...++ -||.-.++ +..+..|++..+++ T Consensus 414 ~a~Hl~~~~------~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~~eq~v~--aW~~~~~~---g~v~~v~~i~~~~~d~ 482 (681) T protein:vir:98 414 RAAHLFDNL------DILDMAYAKAPQPIVWFISSSGKLLGLTYVPEQQIG--AWHQHDTD---GVFESCAVVAEGNEDR 482 (681) T ss_pred hhhhhcCCC------CeEEEEEecCCCEEEEEEecCCcEEEEEEeccccee--eEEEEecC---CcEEEEEEecCCCCcE Confidence 113333322 355677889999888888875 577787554433 58877763 35566655543322 Q ss_pred -EEE-----EEccCCeEEEEc--------------CCccCCCCCEEEEEEeeccccCCCceEE----------------- Q lcl|NC_011802. 350 -ITC-----GDKSEAVTGQLQ--------------FDISSQYDKQQEHLLFTPIFKADNARCF----------------- 392 (472) Q Consensus 350 -~~v-----GD~~~g~l~~ld--------------~~~~~d~g~p~~~~~~tP~~~~~~~r~~----------------- 392 (472) |++ ++...-.|=+|+ ..... .+.+...+--.+++ ++..+. T Consensus 483 l~~vv~r~~~g~~~~yie~~~~~~~~~~~~~~~vD~~~t~-~~~~~~~~sgl~~l--eG~tv~i~aDG~~~~~~~V~~G~ 559 (681) T protein:vir:98 483 LYAVVRRTIGGNEVRYVERMASRQFDAQADAFFVDSGLTY-SGEPVSHISGLEHL--EGKTVSILADGAVHPQRVVTDGA 559 (681) T ss_pred EEEEEEecCCCCeEEEEEecCCccccccccceEeeccccc-cCcceeeeccccCC--CCcEEEEEeCCeecCcEeecCcE Confidence 111 100000111121 11110 11111110000100 111000 Q ss_pred ----------------EEE-----EEEEcCCC--CC----chhheeeeccC-ccccC--ccee--e------ccCC-Ccc Q lcl|NC_011802. 393 ----------------DLE-----VESSTGVA--QY----ADRLFLSATTD-GINYG--REQM--I------EQNE-PFV 433 (472) Q Consensus 393 ----------------~~~-----le~~~Gv~--~~----~~~~~l~~sdD-G~~~~--~~~~--~------~~g~-~g~ 433 (472) ..+ +++...-+ +. -.++-|+.-+. |...+ ..++ + .+|. +-. T Consensus 560 itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g~~~~~~~~~l~~~~~~~~~~~g~~~~l 639 (681) T protein:vir:98 560 IDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQGRVKNINKLWLRVHRSSGIFAGPHADALTEVKQRTSEPYGSPPAL 639 (681) T ss_pred EEeCcCCceEEEeeeceeEEEecceeeecCCcccCCceEEEEEEEEEEEcccceEEeeCCCceEEEEEeccccccccCCc Confidence 000 11100000 00 01112222221 11100 0000 0 0111 111 Q ss_pred cceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 434 YDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 434 ~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) +.-.++.---|.-.++.-++|+-..|-|..|.++..++| T Consensus 640 ~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~ 678 (681) T protein:vir:98 640 KSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIA 678 (681) T ss_pred cCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEE Confidence 222222221122223334667777777777777777777 No 21 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=86.96 E-value=0.042 Score=28.13 Aligned_cols=425 Identities=11% Similarity=0.086 Sum_probs=152.5 Q ss_pred CceeeeeecccCccc---cccCCee--EEeeeeeeecccc--c--------CcccceeEcCCCceeeee-------c-CC Q lcl|NC_011802. 1 MPIQQLPMMKGMGKD---FKNADYI--DYLPINMLATPKE--V--------LNSSGYLRSFPGIAKRND-------V-NG 57 (472) Q Consensus 1 M~~~~vPl~~G~~~~---~~~~d~~--~~~pvn~~~~~~e--~--------~~s~~~Lrs~PGl~~~~~-------v-~g 57 (472) |..=..+-..+.... +.+.+.. ...+.-++.+... . ....+..+.+.+..-.+. + .. T Consensus 164 ~~~sv~v~asg~tg~~TiTaS~a~~~~~~vG~~i~~~~~~v~si~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~t~~~~ 243 (825) T protein:vir:73 164 VDETVKVYASASTGTITLTASSAIFGAEQVGKLFYLEQPAVDSVPVWETSKTTAINDVRRADSNYYRANTSGKTGTLRPS 243 (825) T ss_pred ccccceeeecccCceeEEEeeccccCchhcCeEEEEecccccccceeeeeeEEEeeeEEECCCceeeeecccccceeecc Confidence 111000111110000 0000000 0000000000000 0 000000111111111000 0 00 Q ss_pred Cccceeeeecc-------CeE--EEEECcceeeeeeeEEc---ccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhc Q lcl|NC_011802. 58 ISRGVEYNTAQ-------NAV--YRVLGSKLYKGETVVGD---VAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVS 125 (472) Q Consensus 58 ~~rg~~~~~~~-------~~l--Y~V~G~~LY~v~~~iGt---v~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s 125 (472) ...+..+..++ +.. |+-.+....+.+...++ ..++....|.++ ++.+.. ..+.+. .. T Consensus 244 a~~g~~~~~~~g~~~~~~~~~~~~~~~~~g~~~it~~~~~~~~~~~~~~~~~~~~-----~~~~~~-~t~~~~-----~~ 312 (825) T protein:vir:73 244 HTEGMSWDGWGGTGSDDTGIQWEYLHSGFGIAKITAVAGDGLTATADVVSFIPSQ-----VVGSAN-ASYKWA-----KY 312 (825) T ss_pred ccCCceeEeeeeecccCCceEEEEEecCCceEEEeeccccceeeccccceecccc-----cccCCC-CCcccc-----cC Confidence 01111110000 111 11111112222221111 011112222221 111111 001111 11 Q ss_pred cccccccccCCcccccceeeeeceeEEEEec--CCCeEEEEccc--------CCCCcCCccceeEeecCCCceEEEEecC Q lcl|NC_011802. 126 NWTADSGFTQYELGSVRDITRLRGRYAWSKD--GTDSWFITDLE--------DESHPDRYSAEYRAESQPDGIIGIGTWR 195 (472) Q Consensus 126 ~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~--g~~~~~iS~L~--------D~s~~~~~l~fatAE~~pD~iv~~~~~~ 195 (472) .|-.-.+| ++-|+|...|.+|... -.+.++.|.-. .+...+|+.++.-+..+++.|.-+++.+ T Consensus 313 ~~~~~~gy-------Ps~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~~~DdD~I~~~~s~~~~~~i~~~~~~~ 385 (825) T protein:vir:73 313 AWNSVNGY-------PSTVVYYQQRLYFAASTAYPQTIWASRTGDYKDFGKNNPIQDDDRIIYTYAGRQVNEIRHLIDVG 385 (825) T ss_pred CcccCCCC-------ccEEEEEcceEEEeecCCCCCEEEEEccCCccccccCCCCCCCccEEEEEcCCcceeEEEEeecC Confidence 12222233 3447888888887521 22333333322 2223578999999999999998899986 Q ss_pred CEEEEEEcceEEEEEecCCC----CCccccceecccceeeeccccchhheecCceEEEEEeccccccEEE------EccC Q lcl|NC_011802. 196 DFIVCFGSSTIEYFSLTGAT----TVGAALYVAHASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVY------IIGS 265 (472) Q Consensus 196 ~~l~lfG~~T~Evw~~tGa~----~~~~fpy~r~~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~------~~~g 265 (472) .|++|.+. -| |..+|.+ +|......++ -..||.. -.-..++++++|++..+. .|+ ..++ T Consensus 386 -~L~~~t~~-~e-~~l~~~~~~~lTP~~~~~~~~----s~~g~~~-~~Pv~vg~~~~Fv~~~g~---~vre~~~~~~~d~ 454 (825) T protein:vir:73 386 -NLVALTSG-GE-YTISGDQNKVLTPSAFSFSSQ----GNNGSSN-VPPIAVANIALFIQEKGS---VVRDLAYSFDVDG 454 (825) T ss_pred -cEEEEecC-ce-EEEecCCCcccceeeEEEEee----eeecccc-ccceEeCCeEEEEeCCCC---eEEEEEEeeecCc Confidence 56666665 44 4556542 2222222222 2568854 456688999999987663 333 3466 Q ss_pred ccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCC---eEEEEecccccCchheeeeccCccccceEeee Q lcl|NC_011802. 266 GQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAID 342 (472) Q Consensus 266 ~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~---~Tw~yD~~t~~w~~~w~~~~tg~~~~~~R~~~ 342 (472) |+++.+|-++ +.+++.. ..+.++|+++.+.++.+...| ..+.|+-....| -||.-.+ .+..+..| T Consensus 455 ~~~~dlt~~a-~hl~~~~------~~~~~a~~~~p~~~~~~v~~dg~l~~~ty~~~q~v~--aW~~~~~---~g~v~~~~ 522 (825) T protein:vir:73 455 YQGTDLTILA-NHLFQKH------SIVDWSFCIVPYSSAFCIRDDGKLLVLTYLRDQQVF--AWAPQSS---AGKYESTC 522 (825) T ss_pred eeccchhhhh-HhhccCC------ceEEEEEcCCCceEEEEEecCCeEEEEEEeccccce--eeEEEec---CCcEEEEE Confidence 7777775332 4455432 356777888888888877876 467788554444 5888777 35788888 Q ss_pred EeecCCe---EEEEEc-cCC----eEEEEcCCccCCCCCEEEEEEeeccccCCCceEEEEEEEEEcCCC-C-Cchh-hee Q lcl|NC_011802. 343 FMYEGNQ---ITCGDK-SEA----VTGQLQFDISSQYDKQQEHLLFTPIFKADNARCFDLEVESSTGVA-Q-YADR-LFL 411 (472) Q Consensus 343 ~~~~~g~---~~vGD~-~~g----~l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~~~le~~~Gv~-~-~~~~-~~l 411 (472) ++..++. +++=.+ .+| .|-+|+....++..+- +.+-+ -+.+++.-.. ..++-..|-. + .++- +-+ T Consensus 523 ~i~~~~~D~l~~iV~R~~~g~~~~yiE~~~~~~~~~~~~~--~~vD~-g~~~~g~~~~-~~l~~l~g~tv~~~~~g~~~~ 598 (825) T protein:vir:73 523 SISEGSEDAVYFVVNRTINGQTVRYIERLSSRLFTNDEDA--FFVDC-GLSYDGRNTS-SRTMTISGGTGDWSYQVDYPV 598 (825) T ss_pred EecCCCccEEEEEEEEeeCCceEEEEEEecccccCCCcce--eEEEE-Eeeeccccee-eceeeeCCceEEEEeCCeEEE Confidence 8865431 211111 111 1222333222222110 00000 0000000000 0011111100 0 0000 000 Q ss_pred eeccCccccCcceeeccCCCccc---------ceeEEEEeeEec-ccceeEEEEEEecCcceEEEeEEEe---------- Q lcl|NC_011802. 412 SATTDGINYGREQMIEQNEPFVY---------DKRVIWKRVGRI-RRLIGFKLRVITKSPVTLSGCQIRL---------- 471 (472) Q Consensus 412 ~~sdDG~~~~~~~~~~~g~~g~~---------~~r~~~~rlG~~-r~~v~f~~r~~~~~~~~l~~~~~~~---------- 471 (472) -.++.-++...+.....|=+... ... ...+.+++ +.++ .+++.-...|-.+-+|.+.- T Consensus 599 ~v~~g~itl~~~~~~~i~l~~~~~~~~~~~~~~~~-~~~~i~~~~~~~~-v~v~~~~~~~a~~~~~~~t~~~~a~~~~~g 676 (825) T protein:vir:73 599 TVSGGAYFVNTDVGAQIQFPYTGTDPDTNEPVAKE-LRGDIISVTSNTA-VVVRFNRNVPPVLRNVATTNWQMARQTFSG 676 (825) T ss_pred EEcCCeEEecccceEEEEecccCcccccccceece-eeEEEccccCceE-EEEEecccccceeeeecccCCCcchheecc Confidence 00111112222211111111100 000 01111111 1111 22333233333333333221 Q ss_pred ----C Q lcl|NC_011802. 472 ----E 472 (472) Q Consensus 472 ----e 472 (472) | T Consensus 677 L~hLe 681 (825) T protein:vir:73 677 LAHLE 681 (825) T ss_pred ccccC Confidence 1 No 22 >protein:vir:2203 Length: 794 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042000;swissprot:sw:p03747;genbank:gi:9627472;uniprot:P03747;genbank:GeneID:1261024 Probab=86.32 E-value=0.046 Score=27.89 Aligned_cols=443 Identities=10% Similarity=0.060 Sum_probs=162.3 Q ss_pred CceeeeeecccCcccc-----------------ccCCee-EEee--eeeee---------cccccCcccceeEcCCCcee Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDF-----------------KNADYI-DYLP--INMLA---------TPKEVLNSSGYLRSFPGIAK 51 (472) Q Consensus 1 M~~~~vPl~~G~~~~~-----------------~~~d~~-~~~p--vn~~~---------~~~e~~~s~~~Lrs~PGl~~ 51 (472) .+....|-........ ...++. +..+ +..++ +.+.+....+.....--... T Consensus 174 ~a~~~~p~gt~~~~~~~~~~~~ia~~L~~~l~~~~~~~t~~~~~~~~~i~a~~~~~~~~~t~~~g~~~t~~~~~~~~~~~ 253 (794) T protein:vir:22 174 VAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDSFTTKDGYADQLINPVTHYAQS 253 (794) T ss_pred ceEEEEcCCCccccceeechhhhhhhhhhhheeccccceEEeCCceEEEEEcCCceEEEEeeecccCcceeEEEEecccc Confidence 2222222221111000 000000 0000 00000 00011111110000000011 Q ss_pred eeecCCCcc-ceeeeecc---C--eEEEEECc---ceeeeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccch Q lcl|NC_011802. 52 RNDVNGISR-GVEYNTAQ---N--AVYRVLGS---KLYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVK 122 (472) Q Consensus 52 ~~~v~g~~r-g~~~~~~~---~--~lY~V~G~---~LY~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~ 122 (472) +++++.-++ |......+ . .-|.|... ..|+-...-|...+-..-.|.+. +.-..+|+ |...-... T Consensus 254 ~~~lp~~~~~G~~v~i~~~~~~~~~~Y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~---lv~~~~~~---~~~~~~~w 327 (794) T protein:vir:22 254 FSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHA---LVRAADGN---FDFKWLEW 327 (794) T ss_pred ceeccccCCCCeEEEEEeCCCCCcceeEEEEeccceEEEEeeeccceeeecccceeeE---eeeccCCc---EEEeeccc Confidence 122221111 11111111 0 11322222 22221111111112112223321 11112222 22221111 Q ss_pred hhccccc-----cccccCCcccccceeeeeceeEEEEecCCCeEEEEcccCC----------CCcCCccceeEeecCCCc Q lcl|NC_011802. 123 TVSNWTA-----DSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDE----------SHPDRYSAEYRAESQPDG 187 (472) Q Consensus 123 t~s~~~~-----d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~----------s~~~~~l~fatAE~~pD~ 187 (472) ....+.. .+.|.+. .+.+|+|..+|++|.. .+.++.|.-.|. ...+|+.+++.+-.+++. T Consensus 328 ~~r~~Gd~~tnp~psf~g~---~i~~v~f~q~RL~f~~--~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~ss~~~~~ 402 (794) T protein:vir:22 328 SPKSCGDVDTNPWPSFVGS---SINDVFFFRNRLGFLS--GENIILSRTAKYFNFYPASIANLSDDDPIDVAVSTNRIAI 402 (794) T ss_pred cccccCccccCCcceecCC---CcceEEEEcceEEEec--CCeEEEEccCCccccccccCcCCCCCccEEEEecCCccee Confidence 1111111 1223232 2578899999998875 334555544322 224789999999999999 Q ss_pred eEEEEecCCEEEEEEcceEEEEEecCCCCCccccceeccccee-eeccccchhheecCceEEEEEeccccccEEEE---- Q lcl|NC_011802. 188 IIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAHASLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVYI---- 262 (472) Q Consensus 188 iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I-~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~---- 262 (472) |.-++++++.|+||.+..- |..+|+. ++....-.--.+ ..+|...-.-..++++++|+++.+.. -++++ T Consensus 403 i~~~v~~~~~L~i~t~~~e--~~l~~~~---~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~-~~~~r~~~~ 476 (794) T protein:vir:22 403 LKYAVPFSEELLIWSDEAQ--FVLTASG---TLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSF-TSIHRYYAV 476 (794) T ss_pred eEEEeecCCcEEEEecCcE--EEEeCCC---cccceeEEEEEEEEeeccCCCCceEeCCeEEEEecCCCe-eEEEEeEee Confidence 9999999999999976665 6667642 244433211112 35888888888999999999987632 22322 Q ss_pred ---ccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCC-eEEEEe---cccccCchheeeeccCccc Q lcl|NC_011802. 263 ---IGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPR-HVLVYD---ASSSQNGPQWCVLKTGLYD 335 (472) Q Consensus 263 ---~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~-~Tw~yD---~~t~~w~~~w~~~~tg~~~ 335 (472) .++|+++.+| .-++..|+.- + -.+.+++.+.+.++....-+ .-.+|- ....+..--||.-.++ T Consensus 477 ~~~~d~y~~~Dlt-~~~~~~~~~~----~--~~~~~~~~~~~~v~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~--- 546 (794) T protein:vir:22 477 QDVSSVKNAEDIT-SHVPNYIPNG----V--FSICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFG--- 546 (794) T ss_pred ecccCceehhhHH-HHHHHhcCCc----e--EEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceeEEeeEEEEcC--- Confidence 4567777773 3455666433 1 22455666666555555544 433433 2222222248877763 Q ss_pred cceEeeeEee-cCCeEEEEEcc-CCeEEEEc--CCccCCCCCEEEEEEee------c--cccCCCceEEEEEEEEEcCCC Q lcl|NC_011802. 336 DVYRAIDFMY-EGNQITCGDKS-EAVTGQLQ--FDISSQYDKQQEHLLFT------P--IFKADNARCFDLEVESSTGVA 403 (472) Q Consensus 336 ~~~R~~~~~~-~~g~~~vGD~~-~g~l~~ld--~~~~~d~g~p~~~~~~t------P--~~~~~~~r~~~~~le~~~Gv~ 403 (472) +..+..|+.. .+-.+++-... ++.+-+++ .+..+..++|....+-. | ..++ ...+-.+.+....|.. T Consensus 547 g~~~~~~~~~~~d~l~~iv~r~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~g~~~~-~~~~t~~~~~~~~g~~ 625 (794) T protein:vir:22 547 ENVQVLACQSISSDMYVILRNEFNTFLARISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYND-DTFTTSIHIPTIYGAN 625 (794) T ss_pred CCEEEEEEEecCCEEEEEEEeCCCEEEEEEEEeeccccCCCccceeeeeeeEEEeeccceeec-CCcceEEEcccccCcc Confidence 3445444432 34445555433 33444433 12222223332211100 0 0000 0000011111111111 Q ss_pred CCchhheeeeccC--------------------------------ccccCcceeec------c-C-CCccc--ceeEEEE Q lcl|NC_011802. 404 QYADRLFLSATTD--------------------------------GINYGREQMIE------Q-N-EPFVY--DKRVIWK 441 (472) Q Consensus 404 ~~~~~~~l~~sdD--------------------------------G~~~~~~~~~~------~-g-~~g~~--~~r~~~~ 441 (472) .-.-+.. ..-.| |-.|..+.... . | ..+.. .-|++.+ T Consensus 626 ~~~g~~v-~~~~dg~~~~~~~~~~~~~~~~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~~~~~~~~~~~grl~l~ 704 (794) T protein:vir:22 626 FGRGKIT-VLEPDGKITVFEQPTAGWNSDPWLRLSGNLEGRMVYIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLR 704 (794) T ss_pred cccceEE-EEEcCCceeeceeeeeeeeccceEEeCCCCCCcEEEEeeeeeEEEEecceEEEecCCCccceeeecceEEEE Confidence 0000000 01111 22222211110 0 0 00000 1122222 Q ss_pred eeE-ecccceeEEEEEEecCc---ceEEEeEEEeC Q lcl|NC_011802. 442 RVG-RIRRLIGFKLRVITKSP---VTLSGCQIRLE 472 (472) Q Consensus 442 rlG-~~r~~v~f~~r~~~~~~---~~l~~~~~~~e 472 (472) |.= .+.+--+|++++..+.+ +.+.+..+... T Consensus 705 r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~g~~ 739 (794) T protein:vir:22 705 RAWVNYENSGTFDIYVENQSSNWKYTMAGARLGSN 739 (794) T ss_pred EEEEEeccccceEEEEcCCCcccceeecCceeccc Confidence 211 00111134444433221 11111111100 No 23 >protein:vir:8887 Length: 808 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813776;genbank:gi:29366731;genbank:GeneID:1258831 Probab=85.21 E-value=0.055 Score=27.51 Aligned_cols=447 Identities=10% Similarity=0.033 Sum_probs=156.2 Q ss_pred CceeeeeecccCccc-------cccCCeeEEeeeeeeecccccCcccceeEcCCCceeeeecCCCccceeeeeccCeEEE Q lcl|NC_011802. 1 MPIQQLPMMKGMGKD-------FKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGISRGVEYNTAQNAVYR 73 (472) Q Consensus 1 M~~~~vPl~~G~~~~-------~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~~rg~~~~~~~~~lY~ 73 (472) =....|.+.++.... +......+--.....+...........+.. ..++..+.......+-.+....+.+|. T Consensus 160 ~~~y~i~i~g~~s~~~~t~t~~~~~~s~~~v~~~~~~~~~~~~~~~~~~~ia-~~l~~~~~~~~~~~~~~~~~~~~~~~i 238 (808) T protein:vir:88 160 GRTLSITINGDGTGSSPQASIKMPNGSAEKVPAGDPYAGMNQVDMTDASWIA-AELARQLTVSLGGSGWSFQAGTGWILI 238 (808) T ss_pred CceEEEEEecCCcceeeeEeEEEccCcccceeeccceeecccCCccccccch-hhheeeeeecccccceEEEeccceEEE Confidence 111111121110000 000000000000000000000000000000 000000000000000000000111111 Q ss_pred EECc--ceeeeeeeEEcccCceeEE-------------EEcCCcEEEEEEC----CceeEEEEeccch------------ Q lcl|NC_011802. 74 VLGS--KLYKGETVVGDVAGSGRVS-------------MAHGRTSQAVGVN----GQLVEYRYDGTVK------------ 122 (472) Q Consensus 74 V~G~--~LY~v~~~iGtv~gsg~Vs-------------Ma~Ng~~~~iv~~----g~~~~Y~~d~~~~------------ 122 (472) +... +.....+.-|. +++.... .+.+|..+.|... +..++++|+...+ T Consensus 239 ~~~a~~~~~~~~t~~g~-~~~~~~~~~~~v~~~~~lp~~~p~g~~v~i~~~~~~~~~~~yv~~~~~~~~w~e~~~~~~~~ 317 (808) T protein:vir:88 239 NAPANDNVRQIATKDGY-ADTLLSGFIYQVQTFTKLPANAPPGYLVEITGESARSGDNYWVQYDASGKVWKETAKPKIIA 317 (808) T ss_pred EeccCceeEEEcccCCc-CcceeeeeeeeccceeeccccCCCCcEEEEEecCCCCCceeEEEEEcCCeEEEEeeecccee Confidence 1100 00000000000 0000000 0001111111100 0011111111110 Q ss_pred -----------------hhc----cc------cc-cccccCCcccccceeeeeceeEEEEecCCCeEEEEcccCC----- Q lcl|NC_011802. 123 -----------------TVS----NW------TA-DSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDE----- 169 (472) Q Consensus 123 -----------------t~s----~~------~~-d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~----- 169 (472) ++. .| .. .+.+|...-..+.+|+|..+|++|.. .+.++.|.-.|+ T Consensus 318 ~~~~~tmp~~lv~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~--~~~v~~Srtgd~~nF~~ 395 (808) T protein:vir:88 318 GFNNATLPHALVRAADGQFDWTPLTWDGRNAGDDDTNPMPSFVGATINDVFFFRNRLGFLS--GENVVMSRTSKYFNFFP 395 (808) T ss_pred eecccceeEEEEecCCceEEEEecccccccccccccCccceecCCceeEEEEEcceEEEee--CCeEEEEeccCcccccC Confidence 000 00 00 01121111234678999999999875 344655644432 Q ss_pred -----CCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccccceeccccee-eeccccchhheec Q lcl|NC_011802. 170 -----SHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAHASLMV-QKGIAGTYCKTPF 243 (472) Q Consensus 170 -----s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I-~~Gca~~~sv~~~ 243 (472) ...+|+.+++.+-.+++.|.-++++++.|+||.+..- |..+|+. ++....-.--.+ ..||...-.-..+ T Consensus 396 ~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~T~~~e--~~l~~~~---~lTP~~~~~~~~s~~~~~~~~~Pv~v 470 (808) T protein:vir:88 396 SSVATLSDDDPIDVAISHNRISILKYAVPFSEQLLLWSDQAQ--FVLSSKT---ILSSKTIELDLTTEFDVSDGARPYGI 470 (808) T ss_pred CcccCCCCCccEEEEecCCccceeeEEeecCCcEEEEecCcE--EEEeCCC---cccceeEEEEEEEEecccCCCCceEe Confidence 1247899999999999999999999999999976655 7777642 244333221222 4699888888899 Q ss_pred CceEEEEEeccccccEEEE-------ccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEE----------EEEE Q lcl|NC_011802. 244 ADSYAFISHPATGAPSVYI-------IGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHE----------LLII 306 (472) Q Consensus 244 ~~s~~wl~~d~~g~~~V~~-------~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~----------fy~l 306 (472) +++++|+++.+..- .|+| .++|+++.+|- -++..|+.-- ..+++++.+.+. +|.+ T Consensus 471 G~~v~f~~~~g~~~-~v~r~~~~~~~~d~y~~~dlt~-~~~h~~~~~~------~~~~~~~~~~~~~v~~~~~~g~l~~~ 542 (808) T protein:vir:88 471 GRGVYFAAPRASFT-SLKRYYAIQDVSDVKSAEDVSA-HVPSYITNTV------HAIHGSGTENFVSILSDGSPNKVFIY 542 (808) T ss_pred CCeEEEEecCCCee-EEEEEEEeeeccCceehhhHHH-HHHHhcCCCe------EEEEEeCCCCeEEEEEEcCCCEEEEE Confidence 99999999886321 2222 45667777643 3455554321 122333344433 3333 Q ss_pred EC----CCe------EEEEecc--------cccCchheeeecc--Cccccce----------------Eeee-------- Q lcl|NC_011802. 307 HL----PRH------VLVYDAS--------SSQNGPQWCVLKT--GLYDDVY----------------RAID-------- 342 (472) Q Consensus 307 t~----P~~------Tw~yD~~--------t~~w~~~w~~~~t--g~~~~~~----------------R~~~-------- 342 (472) ++ +.+ .|-++.+ ....-+-|..-+- +.+.+|. ...+ T Consensus 543 ~y~~~~~e~~v~aW~r~~~~g~~~~~~~~~~~~~d~l~~vV~r~~~~~ler~~~~~~~~~~~~~~~~~~lD~~~~~~~g~ 622 (808) T protein:vir:88 543 KFLYLDEILQQQSFSHWEFGDAATTRVLAASCIGSYCYLMIDRPEGLCLERMEFTQHTIDYSIEPYRTYMDMKKTIVLGA 622 (808) T ss_pred EEeccCCceeEEeeEEEecCCCeeEEEEEEeccCCEEEEEEEcCCcEEEEEEeeccCCCCCccccceeeeeeeeeecccc Confidence 33 210 1211110 0000000211110 0111110 0000 Q ss_pred ---------------------------EeecCCeEEEEEccCC---eEEEEcCCccC---CCCCEEEEEEeec-c-ccCC Q lcl|NC_011802. 343 ---------------------------FMYEGNQITCGDKSEA---VTGQLQFDISS---QYDKQQEHLLFTP-I-FKAD 387 (472) Q Consensus 343 ---------------------------~~~~~g~~~vGD~~~g---~l~~ld~~~~~---d~g~p~~~~~~tP-~-~~~~ 387 (472) .+-.+|..+.+|..+. .-..+..+... .=|-++++.++++ + +... T Consensus 623 ~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~p~~~~~~ 702 (808) T protein:vir:88 623 YNIDTNLTSFDVRTAYGGTPGPESTFYTIDQQGVLIEHEARDWATNPYISFVGNRAGEQMVIGKQYTFQYEFSKFLIKQT 702 (808) T ss_pred ccCccccceeecccccccccccceeEEEEcCCceEEeeecccccCcceEEeCCCccCceEEEeeeeeEEEEecceEEecC Confidence 1111121222221111 01111111100 0155556666552 2 2211 Q ss_pred -C-----------ceEEEEEEEEE-cC-----CCC-CchhheeeeccCccccCcceeeccCCCcccceeEEEEeeEeccc Q lcl|NC_011802. 388 -N-----------ARCFDLEVESS-TG-----VAQ-YADRLFLSATTDGINYGREQMIEQNEPFVYDKRVIWKRVGRIRR 448 (472) Q Consensus 388 -~-----------~r~~~~~le~~-~G-----v~~-~~~~~~l~~sdDG~~~~~~~~~~~g~~g~~~~r~~~~rlG~~r~ 448 (472) + .||....+.+. +| |.. ..+.+ -...|..++.+.+. |.+-.+..-+++.-.|..++ T Consensus 703 ~g~~~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~---~~~~~~~~~~~~~~--~~~~~~tg~~~vp~~~~~~~ 777 (808) T protein:vir:88 703 ADDGSTSTEDIGRLQLRRAWLNYEESGAFEINVNNGSSEFV---YVMTGGRLGIQRVL--GELSVGTGQFKFPVTGNAVN 777 (808) T ss_pred CCCcceeecccceEEEEEEEEEeecccceEEEeCCCcccce---eeccCcccCccccc--CccccccceEEEEecccCce Confidence 1 13332222221 11 111 11111 12246666655432 22222222223333354332 Q ss_pred ceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 449 LIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 449 ~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) .+|++....|.++...++..| T Consensus 778 ---~~v~i~~d~P~P~tilsi~~e 798 (808) T protein:vir:88 778 ---QRVTITSSNPNPLNVIGCGWE 798 (808) T ss_pred ---eEEEEEECCCCceEEEEEEEE Confidence 345666777777777777777 No 24 >protein:vir:80253 Length: 777 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522886;genbank:gi:158345179;genbank:GeneID:5687516 Probab=80.83 E-value=0.091 Score=26.28 Aligned_cols=415 Identities=12% Similarity=0.002 Sum_probs=164.7 Q ss_pred CceeeeeecccCcc-ccccCCeeEEeeeeeeecccccCcccceeEcCCCcee--eeecCCCcccee--eee-c---cCeE Q lcl|NC_011802. 1 MPIQQLPMMKGMGK-DFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAK--RNDVNGISRGVE--YNT-A---QNAV 71 (472) Q Consensus 1 M~~~~vPl~~G~~~-~~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~--~~~v~g~~rg~~--~~~-~---~~~l 71 (472) .+...+=.. |.+- -..... ..++.. .+.. . +...=+... ..+++..++... +-. . ++.- T Consensus 211 ~~~~~~~~~-g~~~~i~~~~~----~~~t~~----~g~~--~-~~~~~~~~v~~~~~lp~~~~~~~~~~~~~~~~~~~~~ 278 (777) T protein:vir:80 211 AAGFAYYQD-GAYLYVTAPEA----IAVSTD----SGSN--F-LRASNAASIRDAAELPAKLPADADGFIIATGAAKNKT 278 (777) T ss_pred cCceEEEeC-CcEEEEEecCc----eeEecC----CcCc--c-ceeeeeEEEeeccccccccccccceEEEeCCCCCCce Confidence 222111111 1110 000000 000000 0000 0 000000000 011111111100 000 0 0011 Q ss_pred EE--EECcceeeeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhhc--ccc---ccccccCCccccccee Q lcl|NC_011802. 72 YR--VLGSKLYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVS--NWT---ADSGFTQYELGSVRDI 144 (472) Q Consensus 72 Y~--V~G~~LY~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s--~~~---~d~~f~~~~~~~~~dv 144 (472) |+ ..++..++-....|.+.+- ..|-+ .++..++ -|.++....... ... ..+.|.+.. +.+| T Consensus 279 y~~~~~~~~~w~e~~~~~~~~~~--~t~p~----~l~~~~~---~~~~~~~~w~~r~~gd~~tn~~Psf~g~~---i~~v 346 (777) T protein:vir:80 279 YFRWVDLERKWDEDASRGAQAEL--IDMPL----RITYSAP---NFSLTALNYERRASGDATSNPALKFTEQG---ISGM 346 (777) T ss_pred EEEEEccCcEEEEeecccccccc--cccce----EEEecCC---ceEeeccCCccccccccccCCCceecCCc---eeEE Confidence 11 1122222212223332221 11211 1112122 122221111110 000 122333332 4678 Q ss_pred eeeceeEEEEecCCCeEEEEcccCC----------CCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEEEecCC Q lcl|NC_011802. 145 TRLRGRYAWSKDGTDSWFITDLEDE----------SHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGA 214 (472) Q Consensus 145 ~~~dGyfv~~~~g~~~~~iS~L~D~----------s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa 214 (472) +|..+|++|.. .+.++.|.-.|+ ...+|+.+++-+-.+++.|.-++++++.|+||.+..- |..+|. T Consensus 347 ~f~q~RL~f~~--~~~v~~Srtgd~~nF~~~s~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~i~T~~~e--~~l~~~ 422 (777) T protein:vir:80 347 TTMQGRLVLLA--GEYVCMSASGNPLRWFRASVSTQSDDDPIEVAATAPVASPYEYAVAFNKDLVLFAKTHQ--GLVPGA 422 (777) T ss_pred EEEcceeeeec--CCeEEEEeccCccccccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCce--EEEeCC Confidence 99999999875 334555644332 2358899999999999999999999999999977766 677764 Q ss_pred CCCccccceeccccee-eeccccchhheecCceEEEEEeccccccEEEE-------ccCccceecCCHHHHHHHhhcCch Q lcl|NC_011802. 215 TTVGAALYVAHASLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVYI-------IGSGQASPIATASIEKIIRSYTAD 286 (472) Q Consensus 215 ~~~~~fpy~r~~~~~I-~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~-------~~g~q~~rIST~~iE~~i~~y~~~ 286 (472) + ++....-.--.+ ..+|...-.-..++++++|+++.+..--.|++ .++|+++.+|-| ++..|+. T Consensus 423 -~--~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~Fv~~r~g~~s~v~e~~~~~~~~d~y~a~Dlt~~-~~hl~~~---- 494 (777) T protein:vir:80 423 -N--LLTSRNATAAVVTEYSFQNSCSPVVAGRTVFFASPRSGPWSAVWEMLPSQYTDAQVEASDSTSH-LPKYIAG---- 494 (777) T ss_pred -C--cccceeEEEEEEEeeccCCCCCceEeCCeEEEEecCCCceeEEeeeeecccccCceehhHHHHH-HHHhcCC---- Confidence 2 233332211112 35788777778999999999864311112333 356777777433 4556643 Q ss_pred hhccEEEEEEEeCCEEEEEEECCC-eEEEEec---ccccCchheeeeccCccccceEeeeEeecCCeEEEEEccCCeEEE Q lcl|NC_011802. 287 ELATGVMEALRLDSHELLIIHLPR-HVLVYDA---SSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAVTGQ 362 (472) Q Consensus 287 e~~~A~~~~~~~~GH~fy~lt~P~-~Tw~yD~---~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~l~~ 362 (472) .....+|+.+-+.+..+..-+ .-+||-- .-.+..--||.-.++ +..+..|++ .+--+++=..+++ ++. T Consensus 495 ---~v~~~a~s~~p~~v~~~~~~dg~l~~~ty~~~~~e~~v~aW~r~~~~---g~v~~v~~i-~d~l~~iv~r~~~-~~l 566 (777) T protein:vir:80 495 ---PVRFLATSSTTSIVVVGTSNLRELVVHEYLWQGGEKVHAAWHKWSFP---QDITGAYFR-GDRLILLFHVAGR-VIL 566 (777) T ss_pred ---ceEEEEEcCCCceEEEEEcCCCeEEEEEEeecCCceEEEeeEEeccC---CcEEEEEEE-CCEEEEEEEcCCe-EEE Confidence 255667777777777766664 4444321 111111148777663 467776665 4444555444322 222 Q ss_pred EcCCccCCCCC---EEEEEEee----ccccCCCceEEEEEEEEEcCCC----CCchhheeee------------ccCccc Q lcl|NC_011802. 363 LQFDISSQYDK---QQEHLLFT----PIFKADNARCFDLEVESSTGVA----QYADRLFLSA------------TTDGIN 419 (472) Q Consensus 363 ld~~~~~d~g~---p~~~~~~t----P~~~~~~~r~~~~~le~~~Gv~----~~~~~~~l~~------------sdDG~~ 419 (472) --.+...+.+. +..++... -..+.... +- ..+.+ .....+.+.. .-+|.+ T Consensus 567 e~~~~~~~~d~~~~~~~~~D~~~~~~~~~~~~~~------~~-~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~v~~~~~~ 639 (777) T protein:vir:80 567 GELFMQRLGDAQSIPGGFLDLYRVGAANADEEVA------IP-AFAADLYPEDSTFAYKLSGEFQSLGQRCGDRRVDGAT 639 (777) T ss_pred EEEeeccCCCCcccceeeeeeeeeeeeeeCCccc------ee-EeeccccCCcceeEEEecCcccccceeeeeEEeCCce Confidence 11122222221 11111100 00000000 00 00000 0000011111 112211 Q ss_pred cCcceeeccCCCc-------ccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 420 YGREQMIEQNEPF-------VYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 420 ~~~~~~~~~g~~g-------~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) + +-.++++.++ .|..++.+-+.-. +..=| +-....|+-|..++++++ T Consensus 640 ~--~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~-~~~~g---~~~~~~r~~i~r~~~~~~ 693 (777) T protein:vir:80 640 V--YIKVVGAQAGDQYRIGLRYLSKLGPTRPIL-RDPNG---VPITTERTQLHRLTWSLD 693 (777) T ss_pred e--eEEEcCCCCCCEEEEeeeeEEEEEeCceEE-eCCCC---ceeeecCeEEEEEEEEee Confidence 1 1112222221 1222332222110 00001 111224556777888887 No 25 >protein:vir:7021 Length: 803 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853594;genbank:gi:31711676;genbank:GeneID:1481802 Probab=76.55 E-value=0.13 Score=25.36 Aligned_cols=443 Identities=11% Similarity=0.055 Sum_probs=166.2 Q ss_pred CceeeeeecccCcccc--cc-CC--eeEEeee----eeeecccccCcccceeEcCCCceeeeecCCCc-cceeee----- Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDF--KN-AD--YIDYLPI----NMLATPKEVLNSSGYLRSFPGIAKRNDVNGIS-RGVEYN----- 65 (472) Q Consensus 1 M~~~~vPl~~G~~~~~--~~-~d--~~~~~pv----n~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~~-rg~~~~----- 65 (472) |..-.. ..+..++. .. +. ++.+.+- .+....+........... -...+.+|+.-+ +|.... T Consensus 198 l~~~~~--~~~s~a~~~~~~~g~~~~i~~~~~~~~~~~~t~~g~~~~~~~~~~~--~v~~~~~Lp~~~~~g~~v~v~~~g 273 (803) T protein:vir:70 198 LYQSLQ--SWDKIADYEIQLDGTSIYITRRDGSTTFDITTEDGAKGKDLVAIKY--KVASTDLLPSRAPEGYKVQVWPTG 273 (803) T ss_pred hhhhee--ccccccceEEEECCcEEEEEEcCCCCeeEEEeecCcCCcEEEEEEe--cccceeeccccCCCCceEEEEcCC Confidence 211110 00111111 11 11 1111110 010011001100000000 011112221111 111000 Q ss_pred eccCeEEEEECc------ceeeeeeeEEc----ccCceeEEEEcCCcEEEEE-ECCceeEEEEeccchh----hcccccc Q lcl|NC_011802. 66 TAQNAVYRVLGS------KLYKGETVVGD----VAGSGRVSMAHGRTSQAVG-VNGQLVEYRYDGTVKT----VSNWTAD 130 (472) Q Consensus 66 ~~~~~lY~V~G~------~LY~v~~~iGt----v~gsg~VsMa~Ng~~~~iv-~~g~~~~Y~~d~~~~t----~s~~~~d 130 (472) ..+..-|.|.-. .-++-....|. ...+.|..+.+- ..+ ..+.......+..... .+| .. T Consensus 274 ~~~~d~y~v~~~~~~~~~~~w~e~a~~g~~~~~~~~t~p~~~v~~----~~~~~~~~~~~~~~~~~~r~~gdd~tn--p~ 347 (803) T protein:vir:70 274 SKPESRYWLQAEKQNGNIVSWKETLAADVLIGFDKSTMPYIIERT----GFVNGIAQFKIRQGDWEDRKVGDDLTN--PM 347 (803) T ss_pred CCCCceeeEEEEeccCCccceEeeeccceeeeeecccccEEEEEE----EEeecceeEEEEeeccccccccccccC--cc Confidence 011112221100 11211111111 112222222110 000 0111111112221111 111 23 Q ss_pred ccccCCc-ccccceeeeeceeEEEEecCCCeEEEEcccCC----------CCcCCccceeEeecCCCceEEEEecCCEEE Q lcl|NC_011802. 131 SGFTQYE-LGSVRDITRLRGRYAWSKDGTDSWFITDLEDE----------SHPDRYSAEYRAESQPDGIIGIGTWRDFIV 199 (472) Q Consensus 131 ~~f~~~~-~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~----------s~~~~~l~fatAE~~pD~iv~~~~~~~~l~ 199 (472) +.|.+.. ...+.+|+|..+|++|.. .+.++.|.-.|. ...+|+.+++.+-.+++.|.-++++++.|+ T Consensus 348 psf~~~~~~~~~~~v~f~q~RL~f~~--~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~ 425 (803) T protein:vir:70 348 PSFIDEEVPQTLGGMFMVQNRLCVTA--GEAVIATRTSYFFDFFRYTAVSAVATDPFDVFSDASEVYQLKHAVTLDGSTV 425 (803) T ss_pred ccccCccCCCCceeEEEEeceEEEee--CCeEEEEccCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEE Confidence 3344332 234788999999999875 345555644432 124789999999999999999999999999 Q ss_pred EEEcceEEEEEecCCCCCccccceeccccee-eeccccchhheecCceEEEEEeccccccEEEE------ccCccceecC Q lcl|NC_011802. 200 CFGSSTIEYFSLTGATTVGAALYVAHASLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVYI------IGSGQASPIA 272 (472) Q Consensus 200 lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I-~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~------~~g~q~~rIS 272 (472) ||.+..- |..+|+. ++....-.--.+ ..+|...-.-..++++++|+++.+. --.|+. -++|+++.+| T Consensus 426 i~T~~~q--~~l~g~~---~lTP~~~~i~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~-~s~vre~~~~~~~d~y~a~Dlt 499 (803) T protein:vir:70 426 LFADKSQ--FILPGDK---PLEKSNVLLKPVTTFEVNNNVKPVATGESVMFATSEGA-YSGIREFYTDSYSDTKKAQAIT 499 (803) T ss_pred EEecCcE--EEEeCCC---cccceeEEEEEEEEeeccCCCccEEeCCeEEEeccCCC-eeEEEEEeccccccceehhhhh Confidence 9976655 7777642 244332211112 3588887778899999999998862 122433 3667777774 Q ss_pred CHHHHHHHhhcCchhhccEEEEEE-EeCCEEEEEEECC-CeEEEEec---ccccCchheeeeccCccccceEeeeEeecC Q lcl|NC_011802. 273 TASIEKIIRSYTADELATGVMEAL-RLDSHELLIIHLP-RHVLVYDA---SSSQNGPQWCVLKTGLYDDVYRAIDFMYEG 347 (472) Q Consensus 273 T~~iE~~i~~y~~~e~~~A~~~~~-~~~GH~fy~lt~P-~~Tw~yD~---~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~ 347 (472) -| ++.+|+. . .+..++ +.+.+.++....- +.-.||-- .-.+..--||.-.+ ....+..|+++.+ T Consensus 500 ~~-a~hl~~~----~---v~~~~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~v~aW~r~~~---~g~~~~~~~~~~~ 568 (803) T protein:vir:70 500 SH-VNKLLEG----N---VIMMSASTNVNRLLVLTDKYRNIIYCYDWLWQGTERVQAAWHKWEW---PLGTFIRGMFYSG 568 (803) T ss_pred hh-hHhhcCC----c---eEEEEEeCCCCeEEEEEEcCCCeEEEEEEEecCCcEEEEeEEEEEc---CCCEEEEEEEecC Confidence 33 4555543 1 222233 3344433333322 33334432 22222124887777 3456666766533 Q ss_pred -CeEEEEEcc-CCe-EEEEcCCccCCCCCEEEEEEe---ecc---ccCCCc-----------eEEEEEEEEEcCCCCCch Q lcl|NC_011802. 348 -NQITCGDKS-EAV-TGQLQFDISSQYDKQQEHLLF---TPI---FKADNA-----------RCFDLEVESSTGVAQYAD 407 (472) Q Consensus 348 -g~~~vGD~~-~g~-l~~ld~~~~~d~g~p~~~~~~---tP~---~~~~~~-----------r~~~~~le~~~Gv~~~~~ 407 (472) --|++-... +|. |-+|+....++.+.+....+- ++. ...... .+-.++.-+..|...... T Consensus 569 d~l~~vv~r~~~g~~ier~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 648 (803) T protein:vir:70 569 EHLYLLIERGSTGVYLERMDMGDALVYNLNDRIRMDRQAELIFRHIKAEDVWVSEPLPWQPTDVTLLDCVLIDGWDSYIG 648 (803) T ss_pred CEEEEEEEECCCeEEEEEEecccccccCCcceeEeccceeEeeccccCCceeeeecccccCcccceeeEEEeeeeeeecC Confidence 446666654 343 456776666555544332211 111 111110 111112112222222111 Q ss_pred h-heeeeccCccccCcc-e---------eeccCCCcccceeE---EE------------EeeEecccc----eeEEEEEE Q lcl|NC_011802. 408 R-LFLSATTDGINYGRE-Q---------MIEQNEPFVYDKRV---IW------------KRVGRIRRL----IGFKLRVI 457 (472) Q Consensus 408 ~-~~l~~sdDG~~~~~~-~---------~~~~g~~g~~~~r~---~~------------~rlG~~r~~----v~f~~r~~ 457 (472) . +.....+.+.++.-+ - .+-.|.+.+..-+. .. .|+.++.-+ -.|++++. T Consensus 649 ~~~~~~~~~g~~t~~~~~~~~~~~~~a~~v~VGl~Y~~~~~~~~~~i~~~~~~~~~~~~~rl~r~~~~~~~sg~~~v~v~ 728 (803) T protein:vir:70 649 GSFLFSYNPGDNTLTTTFDMHDDDHVKAKVVVGQLYPQEFEPTQVVIRDNQERVSYIDVPTVGLVHLNLDKYPDFKVEVK 728 (803) T ss_pred CeEEEEEcCCCccceeeeeEECCCCcccEEEEeeeeeEEEeecceEEEcCCCccccccccEEEEEEEEeecccceEEEEe Confidence 1 111111112222111 0 11122221111100 00 011111111 11333322 Q ss_pred ecCcce-----EEEeEE-----EeC Q lcl|NC_011802. 458 TKSPVT-----LSGCQI-----RLE 472 (472) Q Consensus 458 ~~~~~~-----l~~~~~-----~~e 472 (472) .....- .++..+ .++ T Consensus 729 ~~~~~~~~~~~~s~~~~g~~~~~~g 753 (803) T protein:vir:70 729 NLKSGKVRNVLASNRVGGAINNIVG 753 (803) T ss_pred cCCccccceeeccchhccccccccC Confidence 211110 000000 000 No 26 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=75.55 E-value=0.15 Score=25.17 Aligned_cols=437 Identities=10% Similarity=0.065 Sum_probs=146.0 Q ss_pred Cceeeeeeccc-CccccccCCeeEEeeeeeeecccccCcccceeEcC-CCceeeeecCCCccceeee--ec-----c-Ce Q lcl|NC_011802. 1 MPIQQLPMMKG-MGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSF-PGIAKRNDVNGISRGVEYN--TA-----Q-NA 70 (472) Q Consensus 1 M~~~~vPl~~G-~~~~~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~-PGl~~~~~v~g~~rg~~~~--~~-----~-~~ 70 (472) -+.+.+-..++ .+-....+ -.+++ +..++..... +.+. .=....++|++.++- +|. ++ . +. T Consensus 317 ~~~~~~~~~g~~i~v~~~~~-----~~~~~--~~~~g~~~~~-~~~~~~~v~~~~~Lp~~~~~-g~~v~v~~~~~~~~d~ 387 (905) T protein:vir:78 317 ISNYSAQAVGNVIEIERTDG-----RDFNL--GVRGGATNRA-MTAIKGTANSIVDLPGQCFD-GFELKVINTENAESDD 387 (905) T ss_pred cccEEEEecCcEEEEEecCC-----CccEE--EEeccCCcce-EEEEeccccccccCccccCC-CcEEEEEeCCCCCcce Confidence 01111111100 00000000 00011 0000000000 0000 001111222222110 010 00 0 11 Q ss_pred EEE---EECc-----ceeeeeeeEEcccCceeEEEEc------CCcEEEEEECCceeEEEEeccchhhcc--cccccccc Q lcl|NC_011802. 71 VYR---VLGS-----KLYKGETVVGDVAGSGRVSMAH------GRTSQAVGVNGQLVEYRYDGTVKTVSN--WTADSGFT 134 (472) Q Consensus 71 lY~---V~G~-----~LY~v~~~iGtv~gsg~VsMa~------Ng~~~~iv~~g~~~~Y~~d~~~~t~s~--~~~d~~f~ 134 (472) -|+ -.++ .-|+=...-|.+.+-....|.+ +|...+...++..... +....+.-. .-..+.|. T Consensus 388 yyv~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~--~~~~r~~Gd~~Tnp~psf~ 465 (905) T protein:vir:78 388 YYVVFRSAAEGIPGSGSWEETVAPGIERGFNTSTMPHALIRQADGNFTLEALNDEGTIT--GWAQREVGDDDTNPKPSFV 465 (905) T ss_pred EEEEEEecccCCcCceeEEEecccccccccccccccEEEEEecCceEEEEEeccccccc--cccccccCCcccCCCCccc Confidence 111 0001 1111000011111111122222 2222222222211111 111111100 01233444 Q ss_pred CCcccccceeeeeceeEEEEecCCCeEEEEcccCC----------CCcCCccceeEeecCCCceEEEEecCCEEEEEEcc Q lcl|NC_011802. 135 QYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDE----------SHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSS 204 (472) Q Consensus 135 ~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~----------s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~ 204 (472) +. .+.+|+|..+|++|..| +.++.|.-.|. ...+|+.+++-+-.+++.|.-++++++.|+||... T Consensus 466 g~---~is~v~f~q~RL~f~s~--~~v~~Srtgd~~nF~~~t~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~ifT~g 540 (905) T protein:vir:78 466 GR---GISDMFFYNNRLGFLSE--DAVIMSQPGDYFNFFVTSAITISDSDPIDVTASSTKPAILRAAIGAPKGLILFAEN 540 (905) T ss_pred CC---CcceEEEEcceEEEecC--CeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecC Confidence 32 35789999999998753 34555543322 12478999999999999999999999999999887 Q ss_pred eEEEEEecCCCCCccccceeccccee-eeccccchhheecCceEEEEEeccccccEEEEc------cCccceecCCHHHH Q lcl|NC_011802. 205 TIEYFSLTGATTVGAALYVAHASLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVYII------GSGQASPIATASIE 277 (472) Q Consensus 205 T~Evw~~tGa~~~~~fpy~r~~~~~I-~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~~------~g~q~~rIST~~iE 277 (472) .- |..+|..+ .+....-.--.+ ..||...-.=..++++++|+++.+. -..|+.+ ++|+++.+|- -++ T Consensus 541 ~e--f~lsg~~~--~lTP~s~~i~~~S~~~~~~~v~Pv~vG~~vlFv~~~g~-~s~vre~~y~~~~d~y~a~DlT~-~a~ 614 (905) T protein:vir:78 541 SQ--FLLASQEV--VFSTATIKLTEISDYFYRSLAKPVSTGVSIAFVSEADT-YSKIFEMSIDSVDNRPQVADITR-IVP 614 (905) T ss_pred ce--EEEecCCc--cccceeEEEEeEEeecccCCCCcEEeCCeEEEeecCCC-eeEEEEEEeeecccceehhHHHH-HHH Confidence 76 77776432 243333111112 3588665555789999999998752 1124332 4566666633 344 Q ss_pred HHHhhcCchhhccEEEEEEEeCCEEE-EEEECCCeEEEEec---ccccCchheeeeccCccccceEeeeEeecCCeEEEE Q lcl|NC_011802. 278 KIIRSYTADELATGVMEALRLDSHEL-LIIHLPRHVLVYDA---SSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCG 353 (472) Q Consensus 278 ~~i~~y~~~e~~~A~~~~~~~~GH~f-y~lt~P~~Tw~yD~---~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vG 353 (472) ..|+.- +.+.++.+-+.+ +...-.+.-+||-- .-.+..--||.-.++ +..+..|.+...-.+++= T Consensus 615 hl~~g~--------v~~~~~s~~~~~v~~~~~~~~l~~ytyl~~~~eq~v~AWsrw~~~---G~~~~~a~i~d~~~~vV~ 683 (905) T protein:vir:78 615 EYVPTG--------LTWSVSTPNNSMMLFGDNSNTAYIFKFFNQGNERQVAGWSKWILP---GEQRMCGFFADTGYFVLY 683 (905) T ss_pred HhcCCc--------eEEEEecCCCcEEEEEcCCCeEEEEEeecCCCceeEEeEEEEecC---CCeEEEEEEcCCEEEEEE Confidence 454322 223333333332 22222345555432 122221248877662 355665555433333332 Q ss_pred EccCCeE--EEEcCCccCCCCCEEEE-EEeeccccCC----CceEEE-----EEEEEEcCCCCCchhheeeeccCccc-- Q lcl|NC_011802. 354 DKSEAVT--GQLQFDISSQYDKQQEH-LLFTPIFKAD----NARCFD-----LEVESSTGVAQYADRLFLSATTDGIN-- 419 (472) Q Consensus 354 D~~~g~l--~~ld~~~~~d~g~p~~~-~~~tP~~~~~----~~r~~~-----~~le~~~Gv~~~~~~~~l~~sdDG~~-- 419 (472) ...+|.. +.++....-+....-.. ....|.+..- ..-.++ ..+.+..|.....-++-+.+.|-+.. T Consensus 684 r~~~G~~~~~~~~l~~~~~~~~~d~~~~~~~~~~d~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dG~~~~~ 763 (905) T protein:vir:78 684 DSTTGSYVLSAMELLDDPDSASIDTAFSSFLPRLDNYVVKSDLTVVDNGDGTLTVDLEAGQAMTGATPVIMFTDGPSEFA 763 (905) T ss_pred EccCCeEEEEEEeeccccCccccccceeeeeeccceeeecccceecccCcceEeeeccCccccccceeEEEeeCCceeee Confidence 2223332 22221000000000000 0000100000 000000 00000011110000111111110000 Q ss_pred --------cCc----ceeeccCCCcccceeEEE----------------EeeEecccc----eeEEEEEEecCcceEEEe Q lcl|NC_011802. 420 --------YGR----EQMIEQNEPFVYDKRVIW----------------KRVGRIRRL----IGFKLRVITKSPVTLSGC 467 (472) Q Consensus 420 --------~~~----~~~~~~g~~g~~~~r~~~----------------~rlG~~r~~----v~f~~r~~~~~~~~l~~~ 467 (472) |-. ...+-.|.+ |..++.. .|++++.-| -+|++.+....+-..... T Consensus 764 ~~~~~~~~~~~t~~~a~~v~VGl~--Y~s~v~~~p~~~~~~~~s~~~~~~rI~rv~lr~~~Sg~~~v~v~~~~~~~~~~~ 841 (905) T protein:vir:78 764 FSQPTITAGQFTVDTTDDFVVGFK--YETKITLPGFFTSEENKADRVYAPIVEFLYLDLYYSGRYQIEVDRIGYDTINID 841 (905) T ss_pred EEEEEeeceeeccccCCeEEEeee--eeEEEeecceEeccCCCcccccceEEEEEEEEeecceeEEEEEcCCCcceeccc Confidence 000 000111111 1111111 122222211 123333222111111000 Q ss_pred EEEeC Q lcl|NC_011802. 468 QIRLE 472 (472) Q Consensus 468 ~~~~e 472 (472) .-..+ T Consensus 842 ~~~~~ 846 (905) T protein:vir:78 842 AGSID 846 (905) T ss_pred cccee Confidence 00000 No 27 >protein:vir:103790 Length: 768 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024932;genbank:gi:48697202;genbank:GeneID:2846114 Probab=62.51 E-value=0.33 Score=23.20 Aligned_cols=435 Identities=10% Similarity=0.002 Sum_probs=157.0 Q ss_pred CceeeeeecccCccccccCCee-EE-eeeeeeecccccCcccceeEcCCCceee---eec-CCCccceeeeeccCeEEEE Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYI-DY-LPINMLATPKEVLNSSGYLRSFPGIAKR---NDV-NGISRGVEYNTAQNAVYRV 74 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~-~~-~pvn~~~~~~e~~~s~~~Lrs~PGl~~~---~~v-~g~~rg~~~~~~~~~lY~V 74 (472) +....+=+..- .+.....|. ++ .....+-.. .. ..++..+.|-... ... +-...+..+....+....+ T Consensus 198 ~v~~~~~l~~~--~~~~~~~~~~~~~~g~~~~~~~--~~--~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~ 271 (768) T protein:vir:10 198 DVGTLFYLEQE--DNSFVKPWVVHQKIGPSELRRV--GD--RVYLCTAVGTATPQVTGTETPTHTSGSRWDGTGQDESAT 271 (768) T ss_pred hcceeeeeeee--ccccccccEEEEeeeeEEEEec--CC--ceEEeeeeccccccccceeccccccCceEEEecCccccc Confidence 11111100000 000000000 00 000000000 00 0000000000000 000 0000010000001100000 Q ss_pred ECcc----e--e-eeeeeEEcccCceeEEEEcCCcEEEEEECCceeEE-----EEecc--chhhccccccccccCCcccc Q lcl|NC_011802. 75 LGSK----L--Y-KGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEY-----RYDGT--VKTVSNWTADSGFTQYELGS 140 (472) Q Consensus 75 ~G~~----L--Y-~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y-----~~d~~--~~t~s~~~~d~~f~~~~~~~ 140 (472) .... . | +...-.+.|.+..--.|+++-. +..++..... ..... ......|....+| T Consensus 272 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~t~~~~~~---~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~g~------- 341 (768) T protein:vir:10 272 DEYGSIGAEWEYQHSGYGTVLITGYTNDQVVTGTV---ATNDPADPGMLPNTVVTLTGTYKWARSLFNSTDGF------- 341 (768) T ss_pred ccccccceEEEEEEcCCceEEEEEecCCeeEEeee---eeecCcccccccccccccCCCcccccCCCcCCCCC------- Confidence 0000 0 0 0000011111111111222110 0001100000 00000 0001112222223 Q ss_pred cceeeeeceeEEEEecCCCeEEEEcccCC----------CCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEEE Q lcl|NC_011802. 141 VRDITRLRGRYAWSKDGTDSWFITDLEDE----------SHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFS 210 (472) Q Consensus 141 ~~dv~~~dGyfv~~~~g~~~~~iS~L~D~----------s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~ 210 (472) ++-|+|..+|++|.. .+.++.|.-.|+ ...+|+.+++-+-.+++.|--+++++ .|+||.+..- |. T Consensus 342 Ps~v~f~q~RL~f~~--~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~~-~L~i~T~~~q--~~ 416 (768) T protein:vir:10 342 PQMGTFWRNRLCLMR--DRWLAMSVSADFETFKTKDADQQTDDSAIVQQLNARQLNKLAWMVESD-SLLIGMTGDE--WV 416 (768) T ss_pred ceEEEEEeeeEEEee--CCEEEEEcccccccccccccccccCCccEEEEecCCcceeEEEEeecC-cEEEEecCce--EE Confidence 355788889988865 334555544331 11378999999999999999999996 5888777755 76 Q ss_pred ecCCC-----CCccccceecccceeeeccccchhheecCceEEEEEecccccc---EEEEccCccceecCCHHHHHHHhh Q lcl|NC_011802. 211 LTGAT-----TVGAALYVAHASLMVQKGIAGTYCKTPFADSYAFISHPATGAP---SVYIIGSGQASPIATASIEKIIRS 282 (472) Q Consensus 211 ~tGa~-----~~~~fpy~r~~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~---~V~~~~g~q~~rIST~~iE~~i~~ 282 (472) .+|++ +|......++ -..||.. -.-..++++++|++..+..-. .-+..++|+++.+|-+ ++.+++. T Consensus 417 l~~~~~~~~lTP~~~~i~~~----s~~g~~~-~~Pv~vG~~v~fv~~~g~~vre~~y~~~~d~y~a~DlT~~-a~hl~~~ 490 (768) T protein:vir:10 417 IGPANASQPVSAANLNAARR----TSYGSKR-IQPVQVGGTIMFVQKAGRKLRDFKYDFSSDNYVSTDVTKI-ADHITRG 490 (768) T ss_pred EecCCCCcccccceEEEEEe----ehhcccc-cccEEeCCeEEEEcCCCCEEEEEEeeeecCceecchhhhh-hhhhccc Confidence 76643 2222222332 2368853 344689999999998873211 1123577888888622 3444443 Q ss_pred cCchhhccEEEEEEEeCCEEEEEEECCC---eEEEEecccc-cCchheeeeccCccccceEeeeEeec-----CCeEEEE Q lcl|NC_011802. 283 YTADELATGVMEALRLDSHELLIIHLPR---HVLVYDASSS-QNGPQWCVLKTGLYDDVYRAIDFMYE-----GNQITCG 353 (472) Q Consensus 283 y~~~e~~~A~~~~~~~~GH~fy~lt~P~---~Tw~yD~~t~-~w~~~w~~~~tg~~~~~~R~~~~~~~-----~g~~~vG 353 (472) .. ..-+..+.+.|+.+.+.++.+-..| ..+.|+-..+ |..--||.-..+ +......|.+.. +--+++= T Consensus 491 ~~-~~~~~i~~~a~~~~p~~v~~~v~~dg~l~~~ty~~e~~~q~v~aW~~~~~~--~g~v~~v~~i~~~~g~~d~l~~~v 567 (768) T protein:vir:10 491 RA-GTNSGIMSLCFQQEPHSVVWAARADGQLIGCTYDEEAGRSDVYGWHRHPDA--NGFVECVASMPAPDGASDDLWVIV 567 (768) T ss_pred cC-ccccceeeEEEeecCCeEEEEEecCCeEEEEEEecCCCceeEEeEEEEEcC--CCEEEEEEEEecCCCCccEEEEEE Confidence 22 1223466777888888776666654 3566665432 211246655421 122233333211 0011110 Q ss_pred E-ccCCe----E----------------EEEcCCccCC------------------------------------------ Q lcl|NC_011802. 354 D-KSEAV----T----------------GQLQFDISSQ------------------------------------------ 370 (472) Q Consensus 354 D-~~~g~----l----------------~~ld~~~~~d------------------------------------------ 370 (472) . .-+|. + +.||....-+ T Consensus 568 ~r~~~g~~~~~ie~l~~~~~~~~~~~~~~~~D~~~~~~~~~~~~~~gl~~leg~~v~v~~dG~~~~~~~v~~g~itl~~~ 647 (768) T protein:vir:10 568 RRQVNGQTVRYVEYLNPALQDDEPQSSAFYVDAGITYNGVPTSTIAGLGHLEGVTVAVLTDGAVHPSRTVTAGAITLDWS 647 (768) T ss_pred EecCCCeEEEEEEecCcccccccccccceEeccccccCCcceeeecCCCCcccceEEEEECCEeccCceecCCEEEeCCC Confidence 0 11111 1 1122111100 Q ss_pred -----CCCEEEEEEeecccc--CC-------CceEEEEEEE--EEcCCCCCchhheeeeccCc--cccCcceee--ccCC Q lcl|NC_011802. 371 -----YDKQQEHLLFTPIFK--AD-------NARCFDLEVE--SSTGVAQYADRLFLSATTDG--INYGREQMI--EQNE 430 (472) Q Consensus 371 -----~g~p~~~~~~tP~~~--~~-------~~r~~~~~le--~~~Gv~~~~~~~~l~~sdDG--~~~~~~~~~--~~g~ 430 (472) =|-++++.+.++.+. .+ +.|+..+.|. -+.| +.+.-+++. ..+-..+.. .+|+ T Consensus 648 ~~~v~vG~~y~s~~~~~p~~~~~~~gs~~~~~~ri~r~~v~~~~S~~-------~~~~~~~~~~~~~~~~~r~~~~~~~~ 720 (768) T protein:vir:10 648 ASIVHIGVPTTCRIQTMQLNAGAANGTAQGKTKRVTNIATRFSRSLG-------GVVGPTFDDNDLEQLSFRKPSNAMDR 720 (768) T ss_pred CceEEEeEeeeEEEEecceEeecCCccccccceEEEEEEEEEecccc-------eEEEecCCCCCceeeeeEecCcccCc Confidence 011112222221111 00 0022111111 0000 000000000 000000110 0111 Q ss_pred C-cccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 431 P-FVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 431 ~-g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) + --+.-.+++.-.|...++.-++|+-..|-|..|.++..+++ T Consensus 721 ~~~l~TG~~~v~~~~~~~~~~~i~i~~d~P~P~tvlsi~~~~~ 763 (768) T protein:vir:10 721 AVPLFDGDMESDWRGGYEGQSWICYQNDQPLPVTLLGFFPILD 763 (768) T ss_pred cCCcccCEEEEEecCCCCcceEEEEEECCCCCEEEEEEEEEEE Confidence 1 11222333333454455555677777777777777777777 No 28 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=58.52 E-value=0.41 Score=22.70 Aligned_cols=435 Identities=10% Similarity=0.107 Sum_probs=158.7 Q ss_pred Cceee---e--eecccCccccccCC-eeEEeeeeeeeccccc----Ccccc-eeEcCCC-ceeeeecCCCccceeeeecc Q lcl|NC_011802. 1 MPIQQ---L--PMMKGMGKDFKNAD-YIDYLPINMLATPKEV----LNSSG-YLRSFPG-IAKRNDVNGISRGVEYNTAQ 68 (472) Q Consensus 1 M~~~~---v--Pl~~G~~~~~~~~d-~~~~~pvn~~~~~~e~----~~s~~-~Lrs~PG-l~~~~~v~g~~rg~~~~~~~ 68 (472) ++... + -|...+-+.-.-.+ .+...+-.++.+-.++ .-..+ .++..-+ ...+.+|+..++ + T Consensus 364 ~~~~~~~~ia~~L~~~l~a~~~~~g~tv~~~g~~~~i~~~~~~~~~s~~~~~~~~~~~~~V~~~~~LP~~~~-------~ 436 (976) T protein:vir:10 364 ETAVTAESIIGDIRTAIIATGNFTSANVQQIGTGLYVTRPSGTFNVTAPSSDLLRVMSGEVANVDDLPSQCK-------H 436 (976) T ss_pred cccccHHHHHHHHHHhhcccccccceEEEEcCcEEEEEecCcceEecCCCceeEEEEEeeecchhhhhhhcc-------C Confidence 11000 0 00000000000000 0000111111100000 00000 0111111 111122322211 1 Q ss_pred CeEEEEECc--------------------ceeeeee----eEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhh Q lcl|NC_011802. 69 NAVYRVLGS--------------------KLYKGET----VVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTV 124 (472) Q Consensus 69 ~~lY~V~G~--------------------~LY~v~~----~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~ 124 (472) +.+..|..+ ..|+=.. .+|--..+=|..+++.++-......- .|.--..-.+. T Consensus 437 g~~v~V~~~~~~~d~yyv~~~~~~~~~~~~~w~E~~~~g~~~g~~~~tmP~~l~~~~~g~f~~~~~---~w~~r~vGd~~ 513 (976) T protein:vir:10 437 GYVVKVANSEADADDYYVKFFGHNNRDGDGVWEECAKPSRNIEFDKGTMPIQLVRQANGTFTVSQA---TWQNAEVGDEL 513 (976) T ss_pred CcEEEEecCCCCceeEEEEeeccccccccceEEEeeccccccccccccccEEEEecccCeEEeeec---cccccccCCcc Confidence 222222111 1121000 01111122234444332111100000 00000000111 Q ss_pred ccccccccccCCcccccceeeeeceeEEEEecCCCeEEEEcccCC----------CCcCCccceeEeecCCCceEEEEec Q lcl|NC_011802. 125 SNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDE----------SHPDRYSAEYRAESQPDGIIGIGTW 194 (472) Q Consensus 125 s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~----------s~~~~~l~fatAE~~pD~iv~~~~~ 194 (472) +| ..+.|.+- .+.+|+|..+|++|.. .+.++.|.-.|. ...+|+.+++.+-.+++.|.-++++ T Consensus 514 tn--p~psf~g~---~is~v~f~q~RL~f~s--~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~ 586 (976) T protein:vir:10 514 TN--PNPSFVGK---TINQLVFFRNRLVFLS--DENVIMSRPGEFFNFWSKTATTFTPQDVIDLSCSSTYPAIVYDGIQV 586 (976) T ss_pred cC--cCceeccc---ccceEEEEcceEEEec--CCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEEec Confidence 22 23334333 2567899999999875 345666654432 1247899999999999999999999 Q ss_pred CCEEEEEEcceEEEEEecCCCCCccccceeccccee-eeccccchhheecCceEEEEEeccccccEEEEc------cCcc Q lcl|NC_011802. 195 RDFIVCFGSSTIEYFSLTGATTVGAALYVAHASLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVYII------GSGQ 267 (472) Q Consensus 195 ~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I-~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~~------~g~q 267 (472) ++.|+||.+..- |..+|..+ ++....-.--.. ..+|...-.-..++++++|+++.+ +-..+|.+ +++. T Consensus 587 ~~~L~l~T~g~e--~~lsg~~~--~lTP~t~~i~~~s~~~~~~~v~Pv~vG~~v~Fv~~~g-~~~r~~~~~~~~~~~~~~ 661 (976) T protein:vir:10 587 NAGLLLFTKNQQ--FMLTTDSD--ILSPETAKINAVSSYNFNEKTHPVSLGTTVAFIDNAN-QFTRFFEMSNVVRQGEPD 661 (976) T ss_pred CCcEEEEecCce--EEEecCCc--eecceeEEEEEEEeeeccCCCccEEeCCeEEEEecCC-CeEEEEEEeecccccccc Confidence 999999987766 66677543 244322111112 357888888889999999998876 23334433 2233 Q ss_pred ceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEEC-CCeEEEEec---ccccCchheeeeccCccccceEeeeE Q lcl|NC_011802. 268 ASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHL-PRHVLVYDA---SSSQNGPQWCVLKTGLYDDVYRAIDF 343 (472) Q Consensus 268 ~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~-P~~Tw~yD~---~t~~w~~~w~~~~tg~~~~~~R~~~~ 343 (472) ++.+ |.-++..|+. . ....+++.+-+.+..... ++.-.||-- .-.+..--||.-.++ +..+..|+ T Consensus 662 ~~dl-t~~~~~l~~g----~---~~~~a~~~~~~~vv~~~~~~g~l~~~ty~~~~~eq~v~aWsr~~~~---G~v~sv~~ 730 (976) T protein:vir:10 662 VVDQ-SKVISRLLDK----N---ISLVSVSRENSVVFFSQKDTDKIYCFRYFTSGEKRLLQAWTTWTIT---GNIQYHCM 730 (976) T ss_pred hhHH-HHHhhhhcCC----c---eEEEEEcCCCcEEEEEEcCCCEEEEEEEeecCCceeEEeeEEEecC---CcEEEEEE Confidence 3333 2223444432 1 223456666665444333 334444321 122221247776663 46666666 Q ss_pred eecCCeEEEEEc-cCCeEEEE----cCC----------ccCCCCCE----------------------EEEEEeecc--- Q lcl|NC_011802. 344 MYEGNQITCGDK-SEAVTGQL----QFD----------ISSQYDKQ----------------------QEHLLFTPI--- 383 (472) Q Consensus 344 ~~~~g~~~vGD~-~~g~l~~l----d~~----------~~~d~g~p----------------------~~~~~~tP~--- 383 (472) + .+--|++=-+ .+|.+.++ +.. ..++.+.+ ....+..|- T Consensus 731 i-~D~ly~vV~r~~~g~~~r~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~t~~~~t~~t~~~~~~~~~ 809 (976) T protein:vir:10 731 L-DDALYVVTRNNNKDQIVKYSLKLDDAGHFVTDTQGTTSTDDDSIYRVHLDHSSSVTAASNTYNTTTIKTTIPKPNGYE 809 (976) T ss_pred e-CCeEEEEEEecCCeEEEEEEEEECCccceeeeccCccccccCCcceeeeccceEEEeccccccCCceeEEeecCcccc Confidence 5 3333333221 22222211 100 00001111 000000000 Q ss_pred ----------------------------cc----CCCceE-----EEEEEEEE-------cCCCC--Cc-h-----hhee Q lcl|NC_011802. 384 ----------------------------FK----ADNARC-----FDLEVESS-------TGVAQ--YA-D-----RLFL 411 (472) Q Consensus 384 ----------------------------~~----~~~~r~-----~~~~le~~-------~Gv~~--~~-~-----~~~l 411 (472) +. .+...| +..++|+. .|-+. .+ - ++-+ T Consensus 810 ~~~~~~~~~~d~~~~~~~~~~~~v~g~~i~l~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~~~~~gRl~i~r~~~ 889 (976) T protein:vir:10 810 STKQLVAYDTDAGNDLGRYALVTVSGSNLEIPGNWSNNSFIIGYLYEMDVQLPTLYVTQQVGDKYRSDAKSSLIVHRIKF 889 (976) T ss_pred CceeEEEEecccCcccccceeeeecCCeeEecCCCCCCeEEEeeeeEEEEeecceeEEeCCCCcccccceeeEEEEEEEE Confidence 00 000011 01111110 01100 00 0 0001 Q ss_pred e----------eccCccc-cC-cceeeccCCCcc-----ccee-EEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 412 S----------ATTDGIN-YG-REQMIEQNEPFV-----YDKR-VIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 412 ~----------~sdDG~~-~~-~~~~~~~g~~g~-----~~~r-~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) + -..+|.. |- .++....+...- .+.. .++--.|. +--.++.+....|.++...++..| T Consensus 890 ~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~pl~~~~~~~vP~~~~---~~~~~v~i~~d~PlP~tilsi~~e 965 (976) T protein:vir:10 890 SFGPLGVYSTTIQRDGKPDFTETKELGLAGVVGASRLPIVPEVIETVPCYER---NTNLKVNVKSEHPAPATLYSLAWE 965 (976) T ss_pred EeecccceEEEEcCCCCccccccccccccCcccccccceecCcEEEEEeccC---CceeEEEEEECCCCceEEEEEEEE Confidence 0 0111110 11 000000000000 0000 11111121 122455666677777777777777 No 29 >protein:vir:94583 Length: 792 # NCBI annotation: Tubular tail protein B # Family: family:all:825 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919014;genbank:gi:119637778;genbank:GeneID:5179343 Probab=47.09 E-value=0.71 Score=21.39 Aligned_cols=442 Identities=10% Similarity=0.066 Sum_probs=153.6 Q ss_pred CceeeeeecccCcccc-ccCCeeE-----Eeee--------------eeeecccccCcccceeEcCCCcee--------- Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDF-KNADYID-----YLPI--------------NMLATPKEVLNSSGYLRSFPGIAK--------- 51 (472) Q Consensus 1 M~~~~vPl~~G~~~~~-~~~d~~~-----~~pv--------------n~~~~~~e~~~s~~~Lrs~PGl~~--------- 51 (472) +..-.+|....+.... ...++++ ..+. .++-+........ .+.+.+|... T Consensus 169 ~~~~~~~~~t~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~g~~~~~~~~~~~~ 247 (792) T protein:vir:94 169 KIAYEIAHGDAPEHSKQTDAQWLVKKLAGLARLNVAFKGWTFTEGPGYIHVIAPSNSQIN-SLSTEDGYADQLMNAVMHT 247 (792) T ss_pred eeeeeeecCcccceecccchhhhhhhhhhhccccccccccEEEECCeEEEEEecCCceee-eeecccCcCcceeeeeeec Confidence 2222333322211100 0001110 0000 0000000000000 0111122211 Q ss_pred ---eeecCCCc-cceee-----eeccCeEEEE---ECcceeeeeeeEEcccCceeEEEEcCCcE-EEEEECCceeEEEEe Q lcl|NC_011802. 52 ---RNDVNGIS-RGVEY-----NTAQNAVYRV---LGSKLYKGETVVGDVAGSGRVSMAHGRTS-QAVGVNGQLVEYRYD 118 (472) Q Consensus 52 ---~~~v~g~~-rg~~~-----~~~~~~lY~V---~G~~LY~v~~~iGtv~gsg~VsMa~Ng~~-~~iv~~g~~~~Y~~d 118 (472) ++.++.-+ .|-.. ...+...|.| ..+..|+ | ..+-|.+..-++.+. ..++..+.. -|.+. T Consensus 248 v~~~~~lp~~~~~G~~v~i~~~~~~~~d~y~v~~~~~~~~w~---E---~~~~~~~~~~~~~tmp~~lv~~~~~-~~~~~ 320 (792) T protein:vir:94 248 SQSFSRLPVEAPNGYTVKIVGDTSKTSDMFYVQYDNMKKVWK---E---VAGWGVQKGLNGGTMPHALVRQADG-SFQMQ 320 (792) T ss_pred ccccccccccCCCCcEEEEEccCCCCccceEEEEEcCCceEE---E---ecccceeeeecccccCeeEEEcCCC-cEEEE Confidence 01111111 01000 0001111111 1111221 1 111122211111111 112212111 12111 Q ss_pred ccchhhcc-cc----ccccccCCcccccceeeeeceeEEEEecCCCeEEEEcccCCC----------CcCCccceeEeec Q lcl|NC_011802. 119 GTVKTVSN-WT----ADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDES----------HPDRYSAEYRAES 183 (472) Q Consensus 119 ~~~~t~s~-~~----~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~s----------~~~~~l~fatAE~ 183 (472) ........ +. ..+.|.+.. ..+|+|..+|++|.. .+.++.|.-.|+. ..+|+.+++.+-. T Consensus 321 ~~~w~~r~~gd~~tnp~psf~g~~---i~~v~f~q~RL~f~~--~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~ 395 (792) T protein:vir:94 321 VLPWTQRTCGDMDTNPTPSIVDQK---INDVFFFRNRLGFLA--GENIVMSRTSKYFSLFPASVANLSDDDPIDVAVSHN 395 (792) T ss_pred eccccccccCccccCccceeccCC---cceEEEEcceEEEec--CCeEEEEccCCcccCccccccCCCCCccEEEEecCC Confidence 11111100 01 123344443 258999999999865 4456666544321 2478999999999 Q ss_pred CCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccccceeccccee-eeccccchhheecCceEEEEEeccccccEEEE Q lcl|NC_011802. 184 QPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAHASLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVYI 262 (472) Q Consensus 184 ~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I-~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~ 262 (472) +++.|.-++++++.|+||.+..- |..+|+. ++....-.--.+ ..+|...-.-..++++++|+++.+.. -.|++ T Consensus 396 ~~~~i~~~v~~~~~L~l~T~~~q--~~l~~~~---~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~-~~v~r 469 (792) T protein:vir:94 396 RISILKYAVPFSEELLLWSDQAQ--FVLSAQG---ILSPKSVELNLTTEFDVSDRARPFGVGRGVYFASPRASY-TSLNR 469 (792) T ss_pred cceeeeEEeecCCcEEEEecCcE--EEEeCCC---cccceeEEEEEEEEeeccCCCCceEeCCeEEEeecCCCe-eEEEe Confidence 99999999999999999976655 7777642 244433211112 35787766677899999999987631 12333 Q ss_pred -------ccCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECC-CeEEEEec---ccccCchheeeecc Q lcl|NC_011802. 263 -------IGSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLP-RHVLVYDA---SSSQNGPQWCVLKT 331 (472) Q Consensus 263 -------~~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P-~~Tw~yD~---~t~~w~~~w~~~~t 331 (472) .++|+++.+|- -++..|+.. . -.+.+++.+.+.+.....- +.-.+|-- .-.+..--||.-.+ T Consensus 470 ~~~~~~~~d~y~a~DlT~-~~~hl~~~~----v--~~~~a~~~~~~~vv~~~~~~g~l~~~ty~~~~~e~~v~aW~~~~~ 542 (792) T protein:vir:94 470 YYAVQDVSSVKSAEDMSA-HVPNYIPNG----V--FSIRGSSTENFISVLSSNAPSRIFLYKFLYLNEEIAQQSWSHWEL 542 (792) T ss_pred eeeeccccCceehhhHHH-HHHHhcCCc----e--EEEEEeCCCCcEEEEEEcCCCeEEEEEEeecCCceEEEeEEEEEc Confidence 35667776633 345565433 1 2245566766654444433 34344321 12221124777666 Q ss_pred CccccceEeeeEeecCC-eE-EEEEccCCeEEEEcCCcc--CCCCCEEE-EEE--eec-----cccCCCceEEEEEEEEE Q lcl|NC_011802. 332 GLYDDVYRAIDFMYEGN-QI-TCGDKSEAVTGQLQFDIS--SQYDKQQE-HLL--FTP-----IFKADNARCFDLEVESS 399 (472) Q Consensus 332 g~~~~~~R~~~~~~~~g-~~-~vGD~~~g~l~~ld~~~~--~d~g~p~~-~~~--~tP-----~~~~~~~r~~~~~le~~ 399 (472) + +..+..|+.+.+. -| +|=...++.+-+++...- +.-+.+.. ++. .+. ...+++. ...+..... T Consensus 543 ~---g~~~~~~~~~~~D~l~~~v~r~~~~~~~r~~~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~-~T~~~~~~~ 618 (792) T protein:vir:94 543 G---SNVTVLACDSIGSTMYLVLRNQSHTWMCRAHFTKNSIDFPDEPYRLYIDNKVKYVIPEGSYNDDTY-ATTVKPVDV 618 (792) T ss_pred C---CcEEEEEEeecCCEEEEEEEeCCCEEEEEEEEeecccccCCCcceeeeeeeeeEEecCcceecCce-eeeeccccc Confidence 3 3445444433222 23 233333444444432210 00111110 000 000 0000000 000000000 Q ss_pred cCCCC--------------------------Cchhheee--eccC----ccccCcceeec------c-CC---Cccccee Q lcl|NC_011802. 400 TGVAQ--------------------------YADRLFLS--ATTD----GINYGREQMIE------Q-NE---PFVYDKR 437 (472) Q Consensus 400 ~Gv~~--------------------------~~~~~~l~--~sdD----G~~~~~~~~~~------~-g~---~g~~~~r 437 (472) .|... ..+.+-|. ++.. |-.|..+.... . |. .-+-.-| T Consensus 619 ~gl~~l~G~~v~v~~dG~~~~~~~~~~~~~~~~~~i~~~g~~~a~~v~VGl~y~~~~~~~~~~~~~~~g~~~~~~~~~gr 698 (792) T protein:vir:94 619 YGMKYWTGKFYIVASDGLVSWFEPPRGGWPNGVPMLTMSGNREGETIYVGLAISFRYVFSKFLIKKTADDGSIATEDIGR 698 (792) T ss_pred cCcccccCcEEEEEecCceeEeecccceecCCccEEEecCCccCCeEEEeeeeeEEEEeccceeeccCCCcCccccceee Confidence 11110 00111110 1110 22222211110 0 00 0001112 Q ss_pred EEEEeeE-ecccceeEEEEEEecCcceE---EEeEEEeC Q lcl|NC_011802. 438 VIWKRVG-RIRRLIGFKLRVITKSPVTL---SGCQIRLE 472 (472) Q Consensus 438 ~~~~rlG-~~r~~v~f~~r~~~~~~~~l---~~~~~~~e 472 (472) ++.||+= .+.+--.|++++....+... .+..+.-. T Consensus 699 ~rl~r~~~~~~~tg~~~v~~~~~~~~~~~~~~~~~~~~~ 737 (792) T protein:vir:94 699 LQLRRAWVNYEDSGAFTVEVENTSRLFSYDMAGARLGSN 737 (792) T ss_pred EEEEEEEEeeeccceeEEEEcCCCcceeeeeccceeccc Confidence 3333210 00000123333222111110 00000000 No 30 >protein:vir:6326 Length: 826 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877473;genbank:gi:33300845;uniprot:Q7Y2D1;genbank:GeneID:1482615 Probab=44.47 E-value=0.81 Score=21.10 Aligned_cols=438 Identities=10% Similarity=0.028 Sum_probs=171.8 Q ss_pred CceeeeeecccCccc-------cccCCeeEEeee---eee----------ecccccCcccceeEc-CCCceeeeecCCCc Q lcl|NC_011802. 1 MPIQQLPMMKGMGKD-------FKNADYIDYLPI---NML----------ATPKEVLNSSGYLRS-FPGIAKRNDVNGIS 59 (472) Q Consensus 1 M~~~~vPl~~G~~~~-------~~~~d~~~~~pv---n~~----------~~~~e~~~s~~~Lrs-~PGl~~~~~v~g~~ 59 (472) .+...++...+.... ....++.++..+ +.+ .+. +.+..-. .++ .-......++.... T Consensus 219 ~~~~~~~~~t~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~-~~~~~~~~~~~~~l~~~~ 296 (826) T protein:vir:63 219 APEYTLPNSTKKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADIHVEVST-DMGNNYG-IASGGMSLNATADLPALL 296 (826) T ss_pred ccccccCCCccccceecCCcccceeecceeEecccccEEEEeeCCcccEEEcc-CCCCcce-EEEEEeeccceeeccccC Confidence 222222211111100 000111111110 000 000 0000000 000 00001111221111 Q ss_pred c-----ceeeeec----------cCeEEEEEC---cceeeeeeeEEc-c-cCceeEEEEcCCcEEEEEECCceeEEEEec Q lcl|NC_011802. 60 R-----GVEYNTA----------QNAVYRVLG---SKLYKGETVVGD-V-AGSGRVSMAHGRTSQAVGVNGQLVEYRYDG 119 (472) Q Consensus 60 r-----g~~~~~~----------~~~lY~V~G---~~LY~v~~~iGt-v-~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~ 119 (472) + +...+.. .+..| |-. +.-|+-...-|. + ..+.|+.++.|. .+| .|.+.. T Consensus 297 p~~~~~~~~~~~~~~~~~~~g~~~d~~y-~~~~~~~~~w~e~~~~~~~~~~~tmp~~l~~~~------~~~---~f~~~~ 366 (826) T protein:vir:63 297 PGVGAPGVGVQFMDGAVMATGSTKAPVY-FEWDSANRRWAERAAYGTDWVLKKMPLALRWDE------ATD---TYSLNE 366 (826) T ss_pred CCcccceEEEeeEEeEEecCCCcccceE-EEEEcCCceEEEEeecCcccccccceEEEEEec------cCC---eEEEec Confidence 1 1110000 01111 111 122221111111 1 123344443321 111 222222 Q ss_pred cchhhcc-cc----ccccccCCcccccceeeeeceeEEEEecCCCeEEEEcccCC----------CCcCCccceeEeecC Q lcl|NC_011802. 120 TVKTVSN-WT----ADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDE----------SHPDRYSAEYRAESQ 184 (472) Q Consensus 120 ~~~t~s~-~~----~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~----------s~~~~~l~fatAE~~ 184 (472) ....... .. ..+.|.+ -.+.+|+|..+|++|.. .+.++.|.-.|. ...+|+.+++-+-.+ T Consensus 367 ~~w~~r~~Gd~~tnp~psf~g---~~~~~v~f~q~RL~f~~--~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~~ss~~ 441 (826) T protein:vir:63 367 LEYDRRGSGDEDTNPTFNFVT---RGITGMTTFQGRLVLLS--QEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSL 441 (826) T ss_pred cccccccccccccCCCccccC---CCceEEEEEeceEEEee--CCeEEEEccCCccccccccccCCCCCccEEEEEcCCc Confidence 1111100 01 1222322 23678999999998875 344555544332 234889999999999 Q ss_pred CCceEEEEecCCEEEEEEcceEEEEEecCCCCCccccceeccccee-eeccccchhheecCceEEEEEeccccccEEEEc Q lcl|NC_011802. 185 PDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAHASLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVYII 263 (472) Q Consensus 185 pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I-~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~~ 263 (472) ++.|.-++++++.|+||.+..- |..+|.. ++....-.--.+ ..+|...-.-..++++++|+++.+..-..|+.+ T Consensus 442 ~~~i~~~v~~~~~L~l~T~~~q--~~ls~~~---~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~~s~v~e~ 516 (826) T protein:vir:63 442 TEPYEHAVTFNKDLIVFAKKYQ--AVVPGGG---IVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEM 516 (826) T ss_pred ceeeEEEeecCCcEEEEecCcE--EEEeCCC---cccceeEEEEEEEeecccCCCCceEeCCeEEEEecCCCceeEEEEE Confidence 9999999999999999976655 6677642 244332111111 357877777779999999998765221123322 Q ss_pred -------cCccceecCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECCC-eEEEEec---ccccCchheeeeccC Q lcl|NC_011802. 264 -------GSGQASPIATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLPR-HVLVYDA---SSSQNGPQWCVLKTG 332 (472) Q Consensus 264 -------~g~q~~rIST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P~-~Tw~yD~---~t~~w~~~w~~~~tg 332 (472) +.|+++.+|-| +.+.|+. .....+++.+-+.+.....-+ .-.||-- .-.+..--||.-.++ T Consensus 517 ~~~~d~~~~y~~~dlt~~-~~~l~~~-------~v~~~a~s~~~~~v~~~~~~dg~l~~~~y~~~~~e~~v~aW~~~~~~ 588 (826) T protein:vir:63 517 APSPSTDSHYVAEDVTSH-IPSYMPG-------PAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLR 588 (826) T ss_pred EeeeccccceehhHHHHH-HHHhcCC-------CeEEEEEcCCCCEEEEEEcCCCEEEEEEEeeCCCcEEEEeEEEEecC Confidence 23555556333 4445532 244455666666555444433 3333331 111211147777663 Q ss_pred ccccceEeeeEeecCCeEEEEE-ccCCeEEEEcCCc------cCCCCCEEEEEE---------------eeccccCCCc- Q lcl|NC_011802. 333 LYDDVYRAIDFMYEGNQITCGD-KSEAVTGQLQFDI------SSQYDKQQEHLL---------------FTPIFKADNA- 389 (472) Q Consensus 333 ~~~~~~R~~~~~~~~g~~~vGD-~~~g~l~~ld~~~------~~d~g~p~~~~~---------------~tP~~~~~~~- 389 (472) +.....|++ .+.-|++=. ..++.+-+++... .+...++..+.+ ..++-|.+.. T Consensus 589 ---g~v~~~~~i-~d~l~~iv~r~~~~~~~r~~~e~~~~~~~~~~~~~d~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 664 (826) T protein:vir:63 589 ---HQIIGAYFT-GDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEATVAGELELTKQHWDLIKDASAVYQ 664 (826) T ss_pred ---CcEEEEEEE-CCeEEEEEEeCCCEEEEEEEEEecCCccccccCCccceEEEEEeeeeeeccCcceeecccCcccccE Confidence 466666666 444444422 2334344432211 010011111000 0010010000 Q ss_pred ----------------eEE---EEEEEEE---------cCCCC----Cchhheeeecc-CccccCcce----eeccCCCc Q lcl|NC_011802. 390 ----------------RCF---DLEVESS---------TGVAQ----YADRLFLSATT-DGINYGREQ----MIEQNEPF 432 (472) Q Consensus 390 ----------------r~~---~~~le~~---------~Gv~~----~~~~~~l~~sd-DG~~~~~~~----~~~~g~~g 432 (472) ++. ..+|++. .|.+= +.+.++++... ++.+=++-| ++++++.| T Consensus 665 ~~~~~~~~~~~~~~~~~~~~~g~v~l~~~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~gr~~l~r~~~~~~~tg 744 (826) T protein:vir:63 665 LQPVAGAYMERTHLGVKRETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTG 744 (826) T ss_pred EEEeeCccccCCccceEEecCCEEEEecCCCccccEEEEeeeeeEEEEecceEEEccCCCcceeccEEEEEEEEEeeccc Confidence 010 0112211 12221 12334444222 233222222 12233333 Q ss_pred cccee-------------EEEEeeEec-------c------------cceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 433 VYDKR-------------VIWKRVGRI-------R------------RLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 433 ~~~~r-------------~~~~rlG~~-------r------------~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) .+..+ ..=+|+|-. . .+...+|.+....|.++....+..| T Consensus 745 ~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~p~~~t~~~~vP~~~~~~~~~i~i~~d~P~p~~il~i~~~ 816 (826) T protein:vir:63 745 EFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYN 816 (826) T ss_pred cEEEEecCccccceeEeecCCceecccccccccccccceEEEEEEeeccceEEEEEEeCCCCcEEEEEEEEE Confidence 22211 111233210 0 0123446677777888888877777 No 31 >protein:vir:99677 Length: 794 # NCBI annotation: Tail tubular protein B # Family: family:all:825 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249591;genbank:gi:68299742;genbank:GeneID:3799992 Probab=41.14 E-value=0.94 Score=20.73 Aligned_cols=441 Identities=10% Similarity=0.056 Sum_probs=158.2 Q ss_pred CceeeeeecccCccccccCCeeEE-eeee---eeecccccCcccceeEcCCCc-eeeeecCCCc-cceeeeec-----cC Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYIDY-LPIN---MLATPKEVLNSSGYLRSFPGI-AKRNDVNGIS-RGVEYNTA-----QN 69 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~~~-~pvn---~~~~~~e~~~s~~~Lrs~PGl-~~~~~v~g~~-rg~~~~~~-----~~ 69 (472) ..++..-+....+.-.....+... .|.. ...+-..+....+ +.....- ....+|+.-+ .|....+. +. T Consensus 198 a~~l~~~l~~~g~~v~~~~g~~~i~~~~~~~v~t~s~~~g~~~t~-~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~~~~~~ 276 (794) T protein:vir:99 198 IDQLAAGLINKGWAVTKGSGYFYFSKSGSVIINSLEVEDGYNGQL-AWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQ 276 (794) T ss_pred hhhhHhhhhcccceEEeCCeEEEEEecCCceeEEEEeecCCCCce-eeEEeeeccceeecccCCCCCeEEEEeccCCCCC Confidence 000000000000000000000000 0000 0000000000000 0000000 0011222111 11111100 00 Q ss_pred -e--EEEEECcceeeeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEE---eccchh----hccccccccccCCccc Q lcl|NC_011802. 70 -A--VYRVLGSKLYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRY---DGTVKT----VSNWTADSGFTQYELG 139 (472) Q Consensus 70 -~--lY~V~G~~LY~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~---d~~~~t----~s~~~~d~~f~~~~~~ 139 (472) . +.....+..|+-+.-.|.+.|-....|.+. ++-.+. ..|.+ +..... .+| ..+.|.+.. T Consensus 277 ~~y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~-----~v~~~~-~~~~~~~~~w~~r~~Gd~~tn--p~psf~g~~-- 346 (794) T protein:vir:99 277 DDYYVRFDASRNVWTECPAPNIKADYNKATMPHV-----LIREAD-GTFTFKQADWTHRAAGDDETN--PYPSFIGNS-- 346 (794) T ss_pred CceEEEEEcCCceEEeeccceeecceeccceEEE-----EeccCC-CceeEeeccccccccCCcccC--CCccccCcc-- Confidence 0 111122233321111121112111122222 111111 01111 111111 111 223343432 Q ss_pred ccceeeeeceeEEEEecCCCeEEEEcccCC----------CCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEE Q lcl|NC_011802. 140 SVRDITRLRGRYAWSKDGTDSWFITDLEDE----------SHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYF 209 (472) Q Consensus 140 ~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~----------s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw 209 (472) +.+|+|..+|++|..+ +.++.|.-.|+ ...+|+.+++.+-.+++.|.-++++++.|+||.+..- | T Consensus 347 -is~v~f~q~RL~f~~~--~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q--~ 421 (794) T protein:vir:99 347 -INDIFFFRNRLGFLSG--ENVILSGSGNYFNFFPESVAVLTDTDPIDVAVSTNRISILKYAVPFSEELILWSDQAQ--F 421 (794) T ss_pred -eeEEEEEeeeEEEecC--CeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcE--E Confidence 3689999999998753 45555544332 2248899999999999999999999999999976655 7 Q ss_pred EecCCCCCccccceeccccee-eeccccchhheecCceEEEEEeccccccEEE------EccCccceecCCHHHHHHHhh Q lcl|NC_011802. 210 SLTGATTVGAALYVAHASLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVY------IIGSGQASPIATASIEKIIRS 282 (472) Q Consensus 210 ~~tGa~~~~~fpy~r~~~~~I-~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~------~~~g~q~~rIST~~iE~~i~~ 282 (472) ..+|+. ++....-.--.+ ..+|...-.-..++++++|+++.+..-..+. --++|+++.+| --++.+|+. T Consensus 422 ~l~~~~---~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~v~r~~~~~~~~d~y~a~Dlt-~~~~hl~~~ 497 (794) T protein:vir:99 422 VLSSDG---GLTPTTIRLDLTTEFEVTEQARPYGIGRGVYFVSPRAKFSSVRRFYAVQDVTQVKNAEDIS-AHVPYYVEN 497 (794) T ss_pred EEeCCC---cccceeEEEEEEEEeeccCCCCceEeCCeEEEEecCCCeeEEEEeeeeccccCceehhhHH-HHHHHhcCC Confidence 777642 233322111112 3578888888899999999998874222222 22566666663 334555543 Q ss_pred cCchhhccEEEEEEEeCCEEEEEEECCC-eEE--EEecc-cccCchheeeeccCccccceEeeeEee-cCCeEEEEEccC Q lcl|NC_011802. 283 YTADELATGVMEALRLDSHELLIIHLPR-HVL--VYDAS-SSQNGPQWCVLKTGLYDDVYRAIDFMY-EGNQITCGDKSE 357 (472) Q Consensus 283 y~~~e~~~A~~~~~~~~GH~fy~lt~P~-~Tw--~yD~~-t~~w~~~w~~~~tg~~~~~~R~~~~~~-~~g~~~vGD~~~ 357 (472) . . ..+++|+.+.+.+.....-+ .-. .|.-. -.+...-||.-.++ ..++..|+++ .+.-+++-...+ T Consensus 498 ~----~--~~~~a~~~~~~~~v~~~~~~g~l~~~~y~~~~~eq~v~aW~~~~~~---g~~~~~~~~~~~d~l~~~v~r~~ 568 (794) T protein:vir:99 498 G----V--FKMSGSSTENFLTILTEGNEQRVYFYKFLYLQEQLVQQSWSHWDFG---VNCRVLCCDMIGAVMHLIIDSPS 568 (794) T ss_pred C----e--EEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEEcC---CCeEEEEEEEcCCEEEEEEEeCC Confidence 2 1 23566777776665555443 333 34321 12222258877763 3566555543 334456666544 Q ss_pred C-eEEEEcC--CccCCCCCEEEEEE---ee---c--cccCCCceEEEEEEEEEcCCCCCchhheeeeccC---------- Q lcl|NC_011802. 358 A-VTGQLQF--DISSQYDKQQEHLL---FT---P--IFKADNARCFDLEVESSTGVAQYADRLFLSATTD---------- 416 (472) Q Consensus 358 g-~l~~ld~--~~~~d~g~p~~~~~---~t---P--~~~~~~~r~~~~~le~~~Gv~~~~~~~~l~~sdD---------- 416 (472) + .|-+++. +..+..+.+....+ .+ | ..+.+.. .-.+.+....|...-+- -.+.+..| T Consensus 569 ~~~ler~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~~~l~g-~~v~~~~dg~~~~~~~~~ 646 (794) T protein:vir:99 569 GVLMEKIEFTQNTKDYPDEPYRLYVDRKIEYTFPEGSYNDDDF-KTRVKLKDIYGSTPANG-QYVFISLGGVTFTFDPPA 646 (794) T ss_pred CEEEEEEEeeeCCCCCCCcccceeeeeeeeeeecccccccCcc-eeEEeccccccccccCC-ceEEEEeCCceeeeeccc Confidence 4 3333431 11111111111000 00 0 0000000 00001111111100000 00111111 Q ss_pred -----------------------ccccCcce-----eec-cCCCccc----ceeEEEEeeE-ecccceeEEEEEEecCcc Q lcl|NC_011802. 417 -----------------------GINYGREQ-----MIE-QNEPFVY----DKRVIWKRVG-RIRRLIGFKLRVITKSPV 462 (472) Q Consensus 417 -----------------------G~~~~~~~-----~~~-~g~~g~~----~~r~~~~rlG-~~r~~v~f~~r~~~~~~~ 462 (472) |-.|-.+. .++ ....|.. ..|++.||+= +..+--+|++.+....+- T Consensus 647 ~~~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~g~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~ 726 (794) T protein:vir:99 647 GGWQANDGLIEFDGDLRGTKFFVGEAYTFLYEFSKFLIKTTDTADGVATEDIGRLQLRRAWVNYDKSGNFRVEVNNQGRT 726 (794) T ss_pred ceEecCccEEEecCCCCCcEEEEeeeeeEEEeecceEEeecCCCCceeeeccceEEEEEEEEEeecccceEEEECCCccc Confidence 22221111 000 0000100 1122222211 000011344433332211 Q ss_pred eE---EEeEEEe-C Q lcl|NC_011802. 463 TL---SGCQIRL-E 472 (472) Q Consensus 463 ~l---~~~~~~~-e 472 (472) .. .+..+.- | T Consensus 727 ~~~~~~~~~~~~~~ 740 (794) T protein:vir:99 727 FTYNMTGNRLSTNE 740 (794) T ss_pred eeeecccccccccc Confidence 11 0001100 0 No 32 >protein:vir:827 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050560;genbank:gi:9633457;genbank:GeneID:1262210 Probab=38.04 E-value=1.1 Score=20.38 Aligned_cols=397 Identities=11% Similarity=0.067 Sum_probs=182.8 Q ss_pred Cceeee-----------------------eecccCc-cc---cccCCeeEE----eeeeeeecccc-cCcccceeEcCCC Q lcl|NC_011802. 1 MPIQQL-----------------------PMMKGMG-KD---FKNADYIDY----LPINMLATPKE-VLNSSGYLRSFPG 48 (472) Q Consensus 1 M~~~~v-----------------------Pl~~G~~-~~---~~~~d~~~~----~pvn~~~~~~e-~~~s~~~Lrs~PG 48 (472) |...++ |.....+ .+ ....|...- --|.-+-.+.. +..|+......|| T Consensus 125 ~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~p~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg 204 (567) T protein:vir:82 125 VTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPG 204 (567) T ss_pred eeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCccccceEEEEEEEcCCCCcCCCcccccceeeecCC Confidence 333332 1111111 11 111221111 11111111111 2334344444566 Q ss_pred cee-eeecCCCccceeeeeccCeEEEEECcc---eeeeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhh Q lcl|NC_011802. 49 IAK-RNDVNGISRGVEYNTAQNAVYRVLGSK---LYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTV 124 (472) Q Consensus 49 l~~-~~~v~g~~rg~~~~~~~~~lY~V~G~~---LY~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~ 124 (472) -.. ...++-++.+.. ...-++||=..+. =|. .+++++ -+..++.||.-+- --+..+.-+.|+.+...+ T Consensus 205 ~~V~ls~~p~~~~~~~--i~~~RIYRS~tg~~gtdy~---lVael~-as~~sf~D~~~~~--~lg~~Lps~~w~~PP~~m 276 (567) T protein:vir:82 205 TAVQLTLAPVPLQNAS--IKRRRIYRSASGGGEADFL---LVAELD-ASVLSYTDKIPAK--NLGPSLATWDYLPPPENM 276 (567) T ss_pred ceEEEeeccCCccccc--cceEEEEEecCCCCceeeE---EEEeec-cceeeeeeccchh--hcccccccccccCcCccc Confidence 633 334433343433 3445677733221 122 133332 1224455553221 113445555556555544 Q ss_pred ccccccccccCCcccccceeeeeceeEEEEecCCCeEEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcc Q lcl|NC_011802. 125 SNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSS 204 (472) Q Consensus 125 s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~ 204 (472) ...+.-+ .|.-. .-.|+ ....|..-=+=+| =.-| ...-.+.||+++++..-|+++-.- T Consensus 277 ~GL~~m~----------------NGimA-gF~Gn-eV~FsEpylPyAW---P~~Y-r~t~~~dIVaiA~~gt~LVV~TkG 334 (567) T protein:vir:82 277 TGLCLMA----------------NGIAA-GFAGN-EVMFSEAYLPYAW---PEVN-RHTTAEDIVAICPLRTSLVVATKG 334 (567) T ss_pred ceeeecc----------------cceEE-eecCC-EEEEecCCCCccc---chhh-ccCCCCCeEEEEecccEEEEEEcC Confidence 4333211 12211 11233 3444422111111 0111 122357899999999999998776 Q ss_pred eEEEEEecCCCCCccccceecccceeeeccccchhheecCceEEEEEeccccccEEEEccCccceecCCHHH--HHHHhh Q lcl|NC_011802. 205 TIEYFSLTGATTVGAALYVAHASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASI--EKIIRS 282 (472) Q Consensus 205 T~Evw~~tGa~~~~~fpy~r~~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~i--E~~i~~ 282 (472) .- +--+|.+ |.+-.-++. -+.-=|+.+.|+..++..+.|=|.|+- |...+.+++..+ |..| -+.+++ T Consensus 335 ~P--Yl~sG~s-P~sms~~kL---~~~qpCvS~rsiV~~~g~v~Yas~dGL----v~i~a~G~a~vv-T~~l~t~~qW~a 403 (567) T protein:vir:82 335 EP--YLFSGVS-PSTISGSKI---PSMQACLSRRSMVAMEGFVLYAGTNGL----VSVDANGNVALA-TEQIVSPEQWQS 403 (567) T ss_pred ce--EEEEcCC-hhhcccccc---ccccccccccceeeecceEEeecCCcE----EEEecCCchhhh-hhhccChHHHHh Confidence 66 5556643 444555553 346689999999999999999999983 444444566555 4444 344543 Q ss_pred cCchhhccEEEEEEEeCCEEEEEEECCC---eEEEEecccccCchheeeeccCccccceEeeeEeecCCeEEEEEccCCe Q lcl|NC_011802. 283 YTADELATGVMEALRLDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAV 359 (472) Q Consensus 283 y~~~e~~~A~~~~~~~~GH~fy~lt~P~---~Tw~yD~~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~ 359 (472) ++.-+-...++.||.=|-.-+=.+ .+.-||.... .=.++++ +|-+.+.=...++.++ .+++. T Consensus 404 ----~~~P~ti~A~~~eG~Y~a~Y~~~~g~~~~fifdp~~~----~~~~i~~-----~~~~~~~d~~~d~Ly~--~~~~~ 468 (567) T protein:vir:82 404 ----QFNPASIVAYPWRGEYIACYTKPDGKQDVFVFSPVNM----DIRYLST-----PFDCAWVDLAKDMMRV--VTGDK 468 (567) T ss_pred ----cCCcceEEEEeecCeEEEEEeCCCCCcceEEEccccc----EEEEEec-----CceeEEEEeecCeEEE--eeCCE Confidence 233355666888999433333332 5788886532 2222222 2222222222233333 34455 Q ss_pred EEEEcCCccCCCCCEEEEEEeeccccCCCceEEE-EEEEEEcCCCCCchhheeeeccCccccCcceeeccCCCcccceeE Q lcl|NC_011802. 360 TGQLQFDISSQYDKQQEHLLFTPIFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRV 438 (472) Q Consensus 360 l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~-~~le~~~Gv~~~~~~~~l~~sdDG~~~~~~~~~~~g~~g~~~~r~ 438 (472) |++++... .|+..+-.|+.+..+..--|. +.++. ..+ ..+.+-+-...|+. +++ .+|+...|. T Consensus 469 l~~~~~g~-----~~~~~~WrSK~f~~p~~~sf~~~rV~s--~~~-~~v~i~~~~dg~~v-------~~~-~~g~~~~~~ 532 (567) T protein:vir:82 469 MSVLAGGA-----LPSTIRWHSKIFSLPERTSFSCIRVKS--PAP-ERVGITIMADDVPV-------IHF-APGTFKGSV 532 (567) T ss_pred EeeecCCC-----CceeEEEecceEEecCccceeEEEEec--cCC-CceeEEEEEcCCce-------eec-CCcccCCce Confidence 66644422 255556678887777753332 22322 111 11112222111111 221 334433443 Q ss_pred EEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 439 IWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 439 ~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) .||=-++-|. |+|.++...+|.---+.-.+| T Consensus 533 --~rlp~~~ar~-Weveisg~~~V~~v~LA~S~~ 563 (567) T protein:vir:82 533 --VRLPAATGQN-WQVMVSGFGQVERITLSTSMS 563 (567) T ss_pred --eeccCcccce-EEEEEEecccEEEEEEecchh Confidence 3443334443 778888888876555555555 No 33 >protein:vir:3366 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523337;swissprot:trembl:q8w5u3;genbank:gi:17570828;goa:Q8W5U3;uniprot:Q8W5U3;genbank:GeneID:927453 Probab=33.87 E-value=1.3 Score=19.91 Aligned_cols=438 Identities=11% Similarity=0.059 Sum_probs=158.3 Q ss_pred Ccee-e--ee--ecccCccccccCCee-EEeeeeeee-cccccCcccceeEcCCCce------------eeeecCCCccc Q lcl|NC_011802. 1 MPIQ-Q--LP--MMKGMGKDFKNADYI-DYLPINMLA-TPKEVLNSSGYLRSFPGIA------------KRNDVNGISRG 61 (472) Q Consensus 1 M~~~-~--vP--l~~G~~~~~~~~d~~-~~~pvn~~~-~~~e~~~s~~~Lrs~PGl~------------~~~~v~g~~rg 61 (472) +... + +| +..+.+ -+|. +..+-.++- +| .+.....+-+..|.. ...+++..+ T Consensus 198 l~~~~~~~~~~~~~~~~~-----~~w~~~~~~g~~~i~~p--~~~~~~~itt~~g~~~~~~~~~~~~v~~~~~lp~~~-- 268 (801) T protein:vir:33 198 LAAQLRNNLGNPNNDQDP-----NKWRFNVGPGFIHILAP--NNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPINA-- 268 (801) T ss_pred hhhhhhccCccceeeecC-----ceEEEEecCeEEEEecC--CCcccccccccCCccceeEEEEeecccceeeeeeec-- Confidence 1110 0 00 000000 0111 111111110 11 000000011111111 011111111 Q ss_pred eeeeeccCeEEEEECc---ce--e--eeeeeEE---cccCceeEEEEcCCcE-EEEEECCceeEEEEeccchhhcc-ccc Q lcl|NC_011802. 62 VEYNTAQNAVYRVLGS---KL--Y--KGETVVG---DVAGSGRVSMAHGRTS-QAVGVNGQLVEYRYDGTVKTVSN-WTA 129 (472) Q Consensus 62 ~~~~~~~~~lY~V~G~---~L--Y--~v~~~iG---tv~gsg~VsMa~Ng~~-~~iv~~g~~~~Y~~d~~~~t~s~-~~~ 129 (472) .++.+..|.+. .. | +....-| +..+-|.+.--++.+. ..++..+. .-|.+.....+-.. .+. T Consensus 269 -----~~g~~v~v~~~~~~~~~~y~v~~~~~~~~w~e~~~~g~~~~~~~~tmp~~l~~~~~-~tf~~~~~~w~~r~~gd~ 342 (801) T protein:vir:33 269 -----PDGYIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLHYHTMPWALVRASD-GNFDFKYLEWGARTVGDD 342 (801) T ss_pred -----CCCcEEEEEecCCCcccceEEEEEcCCcEEEEeeccccceeeeecccceEEEEccC-ceEEecccCccccccCCc Confidence 01112222211 00 0 1000000 0000011111111111 00111110 01221111111100 001 Q ss_pred -cccccCCcccccceeeeeceeEEEEecCCCeEEEEcccCC----------CCcCCccceeEeecCCCceEEEEecCCEE Q lcl|NC_011802. 130 -DSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDE----------SHPDRYSAEYRAESQPDGIIGIGTWRDFI 198 (472) Q Consensus 130 -d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~----------s~~~~~l~fatAE~~pD~iv~~~~~~~~l 198 (472) .+.+|...=..+.+|+|..+|++|.. .+.++.|.-.|+ ...+|+.+++.+-.+++.|.-++++++.| T Consensus 343 ~tnp~psf~g~~~~~v~f~q~RL~f~~--~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L 420 (801) T protein:vir:33 343 TTNPYPSFTGQTINDIFFFRNRLGFLS--GENIILSRTSKYFNFFPASVSNYSDDDPIDVAVSHDRVSTLKYAVPFSEEL 420 (801) T ss_pred cccCcccccCCCceEEEEEcceEEEee--CCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeeEEeecCCcE Confidence 11122222233578999999999875 345666654432 12488999999999999999999999999 Q ss_pred EEEEcceEEEEEecCCCCCccccceeccccee-eeccccchhheecCceEEEEEeccccccEEEE-------ccCcccee Q lcl|NC_011802. 199 VCFGSSTIEYFSLTGATTVGAALYVAHASLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVYI-------IGSGQASP 270 (472) Q Consensus 199 ~lfG~~T~Evw~~tGa~~~~~fpy~r~~~~~I-~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~-------~~g~q~~r 270 (472) +||.+..- |..+|+. ++....-.--.+ ..||...-.-..++++++|+++.+.. -.|++ .++|+++. T Consensus 421 ~l~t~~~q--~~l~~~~---~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~-~~v~r~~~~~~~~d~y~~~D 494 (801) T protein:vir:33 421 LLWSDQAQ--FVLTASD---ILSSRSVGLNLTTQFDVQDRARPHGVGRNVYFSSPRASF-TSINRYYAVQDVSSVKNAED 494 (801) T ss_pred EEEecCcE--EEEeCCC---cccceeEEEEEEEeecccCCCCceEecCeEEEEecCCCe-eEEEEEEeecccccceehhh Confidence 99976655 6667642 244433211112 46888777778999999999988632 12332 45666666 Q ss_pred cCCHHHHHHHhhcCchhhccEEEEEEEeCCEEEEEEECC--CeEEEEec---ccccCchheeeeccCccccceEeeeEee Q lcl|NC_011802. 271 IATASIEKIIRSYTADELATGVMEALRLDSHELLIIHLP--RHVLVYDA---SSSQNGPQWCVLKTGLYDDVYRAIDFMY 345 (472) Q Consensus 271 IST~~iE~~i~~y~~~e~~~A~~~~~~~~GH~fy~lt~P--~~Tw~yD~---~t~~w~~~w~~~~tg~~~~~~R~~~~~~ 345 (472) +|- -++..|+. ..+...++.+-....++.-- +.-.+|-- .-.+...-||.-.++ +..+..|+-. T Consensus 495 lt~-~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~---g~~~~~~~~~ 563 (801) T protein:vir:33 495 MTA-HVPNYIPN-------GVFSISGTTAENFVAILTSGAPNRVYIYKFLYIDEEIRQQSWSHWDFG---DNVTVFAAQV 563 (801) T ss_pred HHH-HHHHhcCC-------ceEEEEEcCCCCeEEEEEecCCCEEEEEEEecCCCceEEEeeEEEEcC---CCEEEEEEec Confidence 633 34555532 23333343332222333332 23333321 111111147766663 2344444322 Q ss_pred -cCCeEEEEEccCC-eEEEEcC--CccCCCCCEEEE-EE---------------------ee----ccccCCCceEE--- Q lcl|NC_011802. 346 -EGNQITCGDKSEA-VTGQLQF--DISSQYDKQQEH-LL---------------------FT----PIFKADNARCF--- 392 (472) Q Consensus 346 -~~g~~~vGD~~~g-~l~~ld~--~~~~d~g~p~~~-~~---------------------~t----P~~~~~~~r~~--- 392 (472) .+.-|++=...++ .|-+|+. ...+..+.+... +. .. ++-|.++.-+. T Consensus 564 ~~d~l~~vv~r~~~~~le~~~~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~eg~~v~~~~ 643 (801) T protein:vir:33 564 INSTMTVLMSNEHAVWMGRLHFTKDSIDLPGEPYRLYIDAKRKYTIPAGTYNDDTYQTSISLSTIYGMNFTKGRVSVVFP 643 (801) T ss_pred CCCEEEEEEEcCCcEEEEEEEEeeccccCCCccceEEeecceEEEecccceecCccccccccccccCCccccceEEEEEe Confidence 2223334333333 2222221 111111111000 00 00 01111111100 Q ss_pred -----E-----------EEEE---------EEcCCCC----Cchhheeeecc-Cccc--cCcce------eeccCCCccc Q lcl|NC_011802. 393 -----D-----------LEVE---------SSTGVAQ----YADRLFLSATT-DGIN--YGREQ------MIEQNEPFVY 434 (472) Q Consensus 393 -----~-----------~~le---------~~~Gv~~----~~~~~~l~~sd-DG~~--~~~~~------~~~~g~~g~~ 434 (472) . ..++ +..|.+- +.+.++++-.+ ||.. =.+.| .++++..|.+ T Consensus 644 dG~v~~~~~~~~~~~~~~~l~i~~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~~~~~~~~~~r~~l~r~~~~~~~tg~~ 723 (801) T protein:vir:33 644 DGKIVEIDQPINGWSSDPMLRLDGNQEGQVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRLQLRRAWVNYEDSGAF 723 (801) T ss_pred CCceEeeeeccccccCceeEEecCCCCCCEEEEeeeeeEEEEeCceEEeccCCCCceeeeeeccEEEEEEEEEeecCcce Confidence 0 0000 0011110 01122232221 1111 11111 1112222222 Q ss_pred ce-----------eEEEEeeEec-------------------ccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 435 DK-----------RVIWKRVGRI-------------------RRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 435 ~~-----------r~~~~rlG~~-------------------r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) .. +..-+|+|.. ..+--.+|++....|..+....+..| T Consensus 724 ~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvl~i~~e 791 (801) T protein:vir:33 724 IIRVNNLSREFIYTMAGARLGSDNLRVGGSNIGTGQYRFPVVGNAQTNTVTIESDASTPLNIIGCGWE 791 (801) T ss_pred EEEECCcccceeeeecccccccccccccccccccceEEEEeeccCceEEEEEEeCCCCCEEEEEEEEE Confidence 11 1112233221 00112356666777777777777777 No 34 >protein:vir:78957 Length: 826 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522826;genbank:gi:158345061;genbank:GeneID:5687447 Probab=33.70 E-value=1.3 Score=19.89 Aligned_cols=428 Identities=11% Similarity=0.074 Sum_probs=164.9 Q ss_pred CceeeeeecccCccccccCCeeEEeeeeeeecccccCcccceeEcCCCceeeee----c-CCCccceeeeeccCeEEEEE Q lcl|NC_011802. 1 MPIQQLPMMKGMGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKRND----V-NGISRGVEYNTAQNAVYRVL 75 (472) Q Consensus 1 M~~~~vPl~~G~~~~~~~~d~~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~----v-~g~~rg~~~~~~~~~lY~V~ 75 (472) =.-+.+-+....+. .-........|+. +. .+.+...-.+++|-...+- + .|...-..| ++-. T Consensus 262 ~~~~~~~~~~~~g~--~~~~~~~~~~v~~--~~--~l~a~~p~~~~~~~~~~~~~~~~~~~g~~~~~~y-------~~~~ 328 (826) T protein:vir:78 262 DGDIVVEVSTDMGN--NYGIASGGMSLNA--TA--DLPALLPGAGTPGTGVQFMDGAIMATGSTKAPVY-------FAWD 328 (826) T ss_pred CCCeEEEeccCCCc--cceEEEeeEEEec--cc--ceeeeecccccceEEEEEEeeeEecCCCccccee-------EEEE Confidence 00111111111110 0011111222221 10 0000000011222221111 1 111111111 1111 Q ss_pred -CcceeeeeeeEEc-cc-CceeEEEEcCCcEEEEEECCceeEEEEeccchhhcc-cc----ccccccCCcccccceeeee Q lcl|NC_011802. 76 -GSKLYKGETVVGD-VA-GSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSN-WT----ADSGFTQYELGSVRDITRL 147 (472) Q Consensus 76 -G~~LY~v~~~iGt-v~-gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~-~~----~d~~f~~~~~~~~~dv~~~ 147 (472) .+..|+-...-|. +. .+.|.-+..+. .++ .|.+......-.. .. ..+.|.+ -.+.+|+|. T Consensus 329 ~~~~~w~e~a~~g~~~~~~tmp~~l~~~~------~~~---~f~~~~~~w~~r~~gd~~tnp~psf~g---~~i~~v~f~ 396 (826) T protein:vir:78 329 AANRRWAERAAYGTDWVLKKMPLALRWDE------STD---TYSLNELEYDRRGSGDEETNPTFNFVK---RGITGMTTF 396 (826) T ss_pred cCCceEEEeeccCcccccccccEEEEEec------CCC---eEEEeeccccccccCcccccCcccccC---CCceEEEEE Confidence 1122321111111 11 22233222210 111 1222211111100 00 1222322 236789999 Q ss_pred ceeEEEEecCCCeEEEEcccCC----------CCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCC Q lcl|NC_011802. 148 RGRYAWSKDGTDSWFITDLEDE----------SHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTV 217 (472) Q Consensus 148 dGyfv~~~~g~~~~~iS~L~D~----------s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~ 217 (472) .+|++|..+ +.++.|.-.|+ ...+|+.+++-+-.+++.|.-++++++.|+||.+..- |..+|.. T Consensus 397 q~RL~f~~~--~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~s~~~~~i~~~v~~~~~L~l~T~~~e--~~l~~~~-- 470 (826) T protein:vir:78 397 QGRLVLLSQ--EYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQ--AVVPGGG-- 470 (826) T ss_pred eceEEEeeC--CeEEEEeccCccccccccccCCCCCCcEEEEEccCcceeEEEEEecCCcEEEEecCcE--EEEeCCC-- Confidence 999988753 44555543322 2357899999999999999999999999999977665 6777642 Q ss_pred ccccceeccccee-eeccccchhheecCceEEEEEeccccccEEEEc-------cCccceecCCHHHHHHHhhcCchhhc Q lcl|NC_011802. 218 GAALYVAHASLMV-QKGIAGTYCKTPFADSYAFISHPATGAPSVYII-------GSGQASPIATASIEKIIRSYTADELA 289 (472) Q Consensus 218 ~~fpy~r~~~~~I-~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~~-------~g~q~~rIST~~iE~~i~~y~~~e~~ 289 (472) ++....-.--.. ..||...-.-..++++++|+++.+..-..|+.+ +.|+++.+|-|. .+.|+. T Consensus 471 -~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~F~~~r~~~~s~v~e~~~~~~~~~~y~~~dlt~~~-~~l~~~------- 541 (826) T protein:vir:78 471 -IVTPRTAVISITTQYDVDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHI-PSYMPG------- 541 (826) T ss_pred -cccceeEEEEEEEeecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeecccCccchHHHHHHH-HHhcCC------- Confidence 243332111111 358888777789999999998765221123222 235666664443 555532 Q ss_pred cEEEEEEEeCCEEEEEEECC-CeEEEEec---ccccCchheeeeccCccccceEeeeEeecCCeEEEEEcc-CCeEEEEc Q lcl|NC_011802. 290 TGVMEALRLDSHELLIIHLP-RHVLVYDA---SSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKS-EAVTGQLQ 364 (472) Q Consensus 290 ~A~~~~~~~~GH~fy~lt~P-~~Tw~yD~---~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~-~g~l~~ld 364 (472) ..+..+|+.+-+.+.....- ..-+||-- .-.+..--||.-.++ +..+..|++ .+.-|++=... ++.+-+++ T Consensus 542 ~v~~~a~s~~~~~~v~~~~~~g~l~~~ty~~~~~e~~v~aW~~~~~~---g~v~~v~~i-~d~l~~vv~r~~~~~~~r~~ 617 (826) T protein:vir:78 542 PAEYIQAAASSGYLVFGTSAADEMICHQYLWQGNEKVQNAYHRWTLR---HQIIGAYFT-GDNLMVLIQKGQEIALGRMH 617 (826) T ss_pred CeEEEEEeCCCCeEEEEEcCCCeEEEEEEEecCCcEEEEeEEEEccC---CcEEEEEEE-CCeEEEEEEeCCCEEEEEEE Confidence 24555666666655444443 34444432 111211248777663 467776666 55666665543 34444443 Q ss_pred -----CCccCCCCCEEEEEE----eeccccCCCc---------eEEEEEEEEEcCCCC--------CchhheeeeccC-- Q lcl|NC_011802. 365 -----FDISSQYDKQQEHLL----FTPIFKADNA---------RCFDLEVESSTGVAQ--------YADRLFLSATTD-- 416 (472) Q Consensus 365 -----~~~~~d~g~p~~~~~----~tP~~~~~~~---------r~~~~~le~~~Gv~~--------~~~~~~l~~sdD-- 416 (472) .+...+......+.. ....++.... -+..++-+++..... ....+.|+-..+ T Consensus 618 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 697 (826) T protein:vir:78 618 LNSLPAREGLQYPKYDYWRRIEATVDGELELTKQHWDLIKDGAAVYQLQPQVGAYMERYQLGVKRETSTKVFLDVPEAVV 697 (826) T ss_pred EEecCCCccccccccceeEEEEEEEcceeccccceeEEecCCceeeeeccceeeeccccceeccccCCCceEEEeCCCcc Confidence 222222221111110 1111221111 111222222111110 111122221111 Q ss_pred ------ccccCcceee------ccCCCcccceeEEEEeeE-ecccceeEEEEEEecCcc----------eEEEeEEEeC Q lcl|NC_011802. 417 ------GINYGREQMI------EQNEPFVYDKRVIWKRVG-RIRRLIGFKLRVITKSPV----------TLSGCQIRLE 472 (472) Q Consensus 417 ------G~~~~~~~~~------~~g~~g~~~~r~~~~rlG-~~r~~v~f~~r~~~~~~~----------~l~~~~~~~e 472 (472) |-.|-.+... .......-..|++.||+- .+++--.|++++....+- -+....+.+. T Consensus 698 ~~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~~r~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~l~~g 776 (826) T protein:vir:78 698 GSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLSSRQLNAG 776 (826) T ss_pred ccEEEEeeceeEEEEeCceEEecCCCcceeecceEEEEEEEEeeccccEEEEeCCCccCcceeeeecccccccccccCC Confidence 2222222111 000111122234444321 001111244444322211 1111111111 No 35 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=31.88 E-value=1.5 Score=19.67 Aligned_cols=251 Identities=14% Similarity=0.093 Sum_probs=108.8 Q ss_pred Cceeeee---------------ecccCccccccCC-e-eEEeeeeeeecccccCcccceeEcCCCceeeeecCCCcccee Q lcl|NC_011802. 1 MPIQQLP---------------MMKGMGKDFKNAD-Y-IDYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGISRGVE 63 (472) Q Consensus 1 M~~~~vP---------------l~~G~~~~~~~~d-~-~~~~pvn~~~~~~e~~~s~~~Lrs~PGl~~~~~v~g~~rg~~ 63 (472) -+++..| ++...++ -..+ + ..+--|+.+-.+......+.-+..-+|+.--..+.... T Consensus 117 ~p~~~~~~~~y~L~vp~P~~a~~~a~~Gs--l~~~~~~Y~~t~V~~~gEEs~p~~~S~~v~~~gg~~vtl~~~~~~---- 190 (396) T protein:vir:10 117 GIFTYDGAQAERLTLDTPAPPLLVAGAGS--LSQGTYGAAVAWLRGPQESAPSLIAFAEVTDAGALEVTFPLCLDA---- 190 (396) T ss_pred CceeeeCCcceecCcCCCcccccccccCc--cCCceEEEEEEEEecCCCcCcccccccccCCCCCcEEEEEcccCC---- Confidence 1111111 1111111 0111 1 11112232222222223333232223332222211111 Q ss_pred eeeccCeEEEE--ECcceeeeeeeEEccc-CceeEEEEcCCcEEEEEECCceeEEEEeccchhhccccccccccCCcccc Q lcl|NC_011802. 64 YNTAQNAVYRV--LGSKLYKGETVVGDVA-GSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWTADSGFTQYELGS 140 (472) Q Consensus 64 ~~~~~~~lY~V--~G~~LY~v~~~iGtv~-gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~s~~~~d~~f~~~~~~~ 140 (472) +...-++|+= .|+-+| .+++++ +.... + +.-+-|++.... ..++... .+ T Consensus 191 -~i~~~RiYrS~~~G~~~~----l~aE~~a~~~s~-----------v----lPs~~w~gpP~~------~~gL~pm--P~ 242 (396) T protein:vir:10 191 -SVTGARLYLTRANGGELL----LAGDYPLGAATV-----------I----LPTLPELGRPAQ------FRHLSPM--PT 242 (396) T ss_pred -CcceEEEEEeCCChhhhh----heehhccceeee-----------e----eecCCCCCCCcc------ccccccC--ch Confidence 1112233321 011122 133333 11111 1 111222222221 1223222 22 Q ss_pred cceeeeeceeEEEEecCCCeEEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcceEEEEEecCCCCCccc Q lcl|NC_011802. 141 VRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAA 220 (472) Q Consensus 141 ~~dv~~~dGyfv~~~~g~~~~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~T~Evw~~tGa~~~~~f 220 (472) ..=+++..||.++ ..|+.-+| |.-.-+ ++.++..-+. -.+..|+++.+...-|++.-+.- ++..+| ++|.+. T Consensus 243 G~~~A~faGRi~~-A~Gn~V~F-SEp~~P-h~~~~~~~~~--~~~~~Iv~lapv~~gL~Vgt~~~--~y~~~G-~dP~sm 314 (396) T protein:vir:10 243 GKHLAYWRGRLLI-ARANVLRF-SEALAY-HLHDERYGFV--QMPQRITFVQPVDGGIWVGQVDH--VAFLDG-ADPASL 314 (396) T ss_pred hHhhhhhcceEEE-EeCCEEEE-ecCCCC-ceecchhccC--CCCCceEEEEEecCeEEEEEcCc--EEEEEc-CChhHc Confidence 2335677888654 55776554 432222 2123322222 14568999999988888765544 477787 446666 Q ss_pred cceec--------ccceeeeccccchhheecCceEEEEEeccccccEEEEccCccceecCCHHHHHHHhhcCchhhccEE Q lcl|NC_011802. 221 LYVAH--------ASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTADELATGV 292 (472) Q Consensus 221 py~r~--------~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~iE~~i~~y~~~e~~~A~ 292 (472) .+++. +++.++-+|++..++...+..+.|.|.++ .|.-+.+++... .|.+ +|+- +... T Consensus 315 s~~~l~~~~pvp~S~v~~p~~~~s~rs~~~~~~~~lwas~dG----l~~g~~~G~v~~-l~~~---~i~p------~~~~ 380 (396) T protein:vir:10 315 SVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENG----YVMGTSSGAIAE-VHAG---VLAG------ITGR 380 (396) T ss_pred ceeecccCCCcccchhcccchhhhcccccccCcEEEEccCCc----EEEEcCCceeee-eccc---ccCC------Cccc Confidence 66665 23445666789999999999999999998 243444445444 3443 3321 1222 Q ss_pred E-EEEEeCCEEEEEEE Q lcl|NC_011802. 293 M-EALRLDSHELLIIH 307 (472) Q Consensus 293 ~-~~~~~~GH~fy~lt 307 (472) + -+.-++.+-+..++ T Consensus 381 A~~~~~~drRy~~~~~ 396 (396) T protein:vir:10 381 AGTSVVFDRRLLTAVS 396 (396) T ss_pred ceEEEeecCeEEEEeC Confidence 3 23345666444444 No 36 >protein:vir:10145 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859275;genbank:gi:32171031;genbank:GeneID:2653447 Probab=31.43 E-value=1.5 Score=19.62 Aligned_cols=397 Identities=11% Similarity=0.070 Sum_probs=181.8 Q ss_pred Cceeee-----------------------eecccCc-ccc---ccCCeeEE----eeeeeeecccc-cCcccceeEcCCC Q lcl|NC_011802. 1 MPIQQL-----------------------PMMKGMG-KDF---KNADYIDY----LPINMLATPKE-VLNSSGYLRSFPG 48 (472) Q Consensus 1 M~~~~v-----------------------Pl~~G~~-~~~---~~~d~~~~----~pvn~~~~~~e-~~~s~~~Lrs~PG 48 (472) |...++ |.....+ .+. +..|...- --|.-+-.+.. +..|+......|| T Consensus 125 ~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg 204 (567) T protein:vir:10 125 VTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPG 204 (567) T ss_pred eeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCC Confidence 333332 1111111 111 11221111 11111111111 2233333344455 Q ss_pred cee-eeecCCCccceeeeeccCeEEEEECcc---eeeeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhh Q lcl|NC_011802. 49 IAK-RNDVNGISRGVEYNTAQNAVYRVLGSK---LYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTV 124 (472) Q Consensus 49 l~~-~~~v~g~~rg~~~~~~~~~lY~V~G~~---LY~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~ 124 (472) -.. ...++-++.+.. ...-++||=..+. =|. .+++++ -+..++.||.-+- --+..+.-+.|+.+...+ T Consensus 205 ~~V~ls~~p~~~~~~~--i~~~RIYRS~tg~~gtdy~---lVael~-as~~sf~D~~~~~--~lg~~Lps~~w~~PP~~m 276 (567) T protein:vir:10 205 TAVQLTLAPVPLQNAS--IKRRRIYRSASGGGEADFL---LVAELD-ASVLSYTDKIPAK--NLGPSLATWDYLPPPENM 276 (567) T ss_pred ceEEEeeccCCccccc--cceEEEEEecCCCCceeeE---EEEeec-cceeeeeeccchh--hcccccccccccCcCccc Confidence 533 333333333333 3445677733221 122 133332 1224455553221 113445555556555544 Q ss_pred ccccccccccCCcccccceeeeeceeEEEEecCCCeEEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcc Q lcl|NC_011802. 125 SNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSS 204 (472) Q Consensus 125 s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~ 204 (472) ...+.-+ .|.-. .-.|+ ....|..-=+=+| =.-| ...-.+.||+++++..-|+++-.- T Consensus 277 ~GL~~m~----------------NGimA-gF~Gn-eV~FsEpylPyAW---P~~Y-r~t~~~dIVaiA~~gt~LVV~TkG 334 (567) T protein:vir:10 277 TGLCLMA----------------NGIAA-GFAGN-EVMFSEAYLPYAW---PEVN-RHTTAEDIVAICPLGTSLVVATKG 334 (567) T ss_pred ceeeecc----------------cceEE-eecCC-EEEEecCCCCccc---chhh-ccCCCCCeEEEeecccEEEEEEcC Confidence 4333211 12211 11233 3444422111111 0111 122357899999999999998776 Q ss_pred eEEEEEecCCCCCccccceecccceeeeccccchhheecCceEEEEEeccccccEEEEccCccceecCCHHH--HHHHhh Q lcl|NC_011802. 205 TIEYFSLTGATTVGAALYVAHASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASI--EKIIRS 282 (472) Q Consensus 205 T~Evw~~tGa~~~~~fpy~r~~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~i--E~~i~~ 282 (472) .- +--+|.+ |.+-.-++ .-+.-=|+.+.|+..++..+.|=|.|+- |...+.+++..+ |..| -+.+++ T Consensus 335 ~P--Yl~sG~s-P~sms~~k---L~~~qpCvS~rsiV~~~g~v~Yas~dGL----v~i~a~G~a~vv-T~~l~t~~qW~a 403 (567) T protein:vir:10 335 EP--YLFSGVS-PSTISGSK---IPSMQACLSRRSMVAMEGFVLYAGTNGL----VSVDANGNVALA-TEQIVSPEQWQS 403 (567) T ss_pred ce--EEEEcCC-hhhccccc---cccccccccccceeEeccEEEeecCCcE----EEEecCCchhhh-hhhccChHHHHh Confidence 66 5556643 44555555 3346789999999999999999999983 444444566555 4444 344543 Q ss_pred cCchhhccEEEEEEEeCCEEEEEEECCC---eEEEEecccccCchheeeeccCccccceEeeeEeecCCeEEEEEccCCe Q lcl|NC_011802. 283 YTADELATGVMEALRLDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAV 359 (472) Q Consensus 283 y~~~e~~~A~~~~~~~~GH~fy~lt~P~---~Tw~yD~~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~ 359 (472) ++.-+-...++.||.=|-.-+=.+ .+.-||.... .=.++++ +|-+.+.=...++.++ .+++. T Consensus 404 ----~~~P~ti~A~~~eG~Y~a~Y~~~~g~~~~fifdp~~~----~~~~i~~-----~~~~~~~d~~~d~Ly~--~~~~~ 468 (567) T protein:vir:10 404 ----QFNPASIVAYPWRGEYIACYTKPDGKQDVFVFSPVNM----DIRYLST-----PFDCAWVDLAKDMMRV--VTGDK 468 (567) T ss_pred ----cCCcceEEEEeecCeEEEEEecCCCCcceEEEccccc----EEEEEec-----CceeEEEEeecCeEEE--eeCCE Confidence 233355666888999433333332 5788886532 2222222 2222222222233333 34455 Q ss_pred EEEEcCCccCCCCCEEEEEEeeccccCCCceEEE-EEEEEEcCCCCCchhheeeeccCccccCcceeeccCCCcccceeE Q lcl|NC_011802. 360 TGQLQFDISSQYDKQQEHLLFTPIFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRV 438 (472) Q Consensus 360 l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~-~~le~~~Gv~~~~~~~~l~~sdDG~~~~~~~~~~~g~~g~~~~r~ 438 (472) |++++... .|+..+-.|+.+..+..--|. +.++. ..+ ..+.+-+-...|+. +++ .+|+...|. T Consensus 469 l~~~~~g~-----~~~~~~WrSK~f~~p~~~sf~~~rV~s--~~~-~~v~i~~~~dg~~v-------~~~-~~g~~~~~~ 532 (567) T protein:vir:10 469 MSVLAGGA-----LPSTIRWHSKIFSLPERTSFSCIRVKS--PAP-ERVGITIMADDVPV-------IHF-APGTFKGSV 532 (567) T ss_pred EeeecCCC-----CceeEEEecceEEecCccceeEEEEec--cCC-cceeEEEEEcCCce-------eec-CCccccCce Confidence 66644422 255556677887777653332 22322 111 11112222111111 221 334433443 Q ss_pred EEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 439 IWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 439 ~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) .||=-++-|. |+|.++...+|.---+.-.+| T Consensus 533 --~rlp~~~ar~-Weveisg~~~V~~v~LA~S~~ 563 (567) T protein:vir:10 533 --VRLPAATGQN-WQVMVSGFGQVERITLSTSMS 563 (567) T ss_pred --eecCCcccce-EEEEEEecccEEEEEEecchh Confidence 3443334443 778888888876555555555 No 37 >protein:vir:3306 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049522;genbank:gi:9632528;genbank:GeneID:1262016 Probab=31.43 E-value=1.5 Score=19.62 Aligned_cols=397 Identities=11% Similarity=0.070 Sum_probs=181.8 Q ss_pred Cceeee-----------------------eecccCc-ccc---ccCCeeEE----eeeeeeecccc-cCcccceeEcCCC Q lcl|NC_011802. 1 MPIQQL-----------------------PMMKGMG-KDF---KNADYIDY----LPINMLATPKE-VLNSSGYLRSFPG 48 (472) Q Consensus 1 M~~~~v-----------------------Pl~~G~~-~~~---~~~d~~~~----~pvn~~~~~~e-~~~s~~~Lrs~PG 48 (472) |...++ |.....+ .+. +..|...- --|.-+-.+.. +..|+......|| T Consensus 125 ~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg 204 (567) T protein:vir:33 125 VTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPG 204 (567) T ss_pred eeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCC Confidence 333332 1111111 111 11221111 11111111111 2233333344455 Q ss_pred cee-eeecCCCccceeeeeccCeEEEEECcc---eeeeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhh Q lcl|NC_011802. 49 IAK-RNDVNGISRGVEYNTAQNAVYRVLGSK---LYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTV 124 (472) Q Consensus 49 l~~-~~~v~g~~rg~~~~~~~~~lY~V~G~~---LY~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~ 124 (472) -.. ...++-++.+.. ...-++||=..+. =|. .+++++ -+..++.||.-+- --+..+.-+.|+.+...+ T Consensus 205 ~~V~ls~~p~~~~~~~--i~~~RIYRS~tg~~gtdy~---lVael~-as~~sf~D~~~~~--~lg~~Lps~~w~~PP~~m 276 (567) T protein:vir:33 205 TAVQLTLAPVPLQNAS--IKRRRIYRSASGGGEADFL---LVAELD-ASVLSYTDKIPAK--NLGPSLATWDYLPPPENM 276 (567) T ss_pred ceEEEeeccCCccccc--cceEEEEEecCCCCceeeE---EEEeec-cceeeeeeccchh--hcccccccccccCcCccc Confidence 533 333333333333 3445677733221 122 133332 1224455553221 113445555556555544 Q ss_pred ccccccccccCCcccccceeeeeceeEEEEecCCCeEEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcc Q lcl|NC_011802. 125 SNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSS 204 (472) Q Consensus 125 s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~ 204 (472) ...+.-+ .|.-. .-.|+ ....|..-=+=+| =.-| ...-.+.||+++++..-|+++-.- T Consensus 277 ~GL~~m~----------------NGimA-gF~Gn-eV~FsEpylPyAW---P~~Y-r~t~~~dIVaiA~~gt~LVV~TkG 334 (567) T protein:vir:33 277 TGLCLMA----------------NGIAA-GFAGN-EVMFSEAYLPYAW---PEVN-RHTTAEDIVAICPLGTSLVVATKG 334 (567) T ss_pred ceeeecc----------------cceEE-eecCC-EEEEecCCCCccc---chhh-ccCCCCCeEEEeecccEEEEEEcC Confidence 4333211 12211 11233 3444422111111 0111 122357899999999999998776 Q ss_pred eEEEEEecCCCCCccccceecccceeeeccccchhheecCceEEEEEeccccccEEEEccCccceecCCHHH--HHHHhh Q lcl|NC_011802. 205 TIEYFSLTGATTVGAALYVAHASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASI--EKIIRS 282 (472) Q Consensus 205 T~Evw~~tGa~~~~~fpy~r~~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~i--E~~i~~ 282 (472) .- +--+|.+ |.+-.-++ .-+.-=|+.+.|+..++..+.|=|.|+- |...+.+++..+ |..| -+.+++ T Consensus 335 ~P--Yl~sG~s-P~sms~~k---L~~~qpCvS~rsiV~~~g~v~Yas~dGL----v~i~a~G~a~vv-T~~l~t~~qW~a 403 (567) T protein:vir:33 335 EP--YLFSGVS-PSTISGSK---IPSMQACLSRRSMVAMEGFVLYAGTNGL----VSVDANGNVALA-TEQIVSPEQWQS 403 (567) T ss_pred ce--EEEEcCC-hhhccccc---cccccccccccceeEeccEEEeecCCcE----EEEecCCchhhh-hhhccChHHHHh Confidence 66 5556643 44555555 3346789999999999999999999983 444444566555 4444 344543 Q ss_pred cCchhhccEEEEEEEeCCEEEEEEECCC---eEEEEecccccCchheeeeccCccccceEeeeEeecCCeEEEEEccCCe Q lcl|NC_011802. 283 YTADELATGVMEALRLDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAV 359 (472) Q Consensus 283 y~~~e~~~A~~~~~~~~GH~fy~lt~P~---~Tw~yD~~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~ 359 (472) ++.-+-...++.||.=|-.-+=.+ .+.-||.... .=.++++ +|-+.+.=...++.++ .+++. T Consensus 404 ----~~~P~ti~A~~~eG~Y~a~Y~~~~g~~~~fifdp~~~----~~~~i~~-----~~~~~~~d~~~d~Ly~--~~~~~ 468 (567) T protein:vir:33 404 ----QFNPASIVAYPWRGEYIACYTKPDGKQDVFVFSPVNM----DIRYLST-----PFDCAWVDLAKDMMRV--VTGDK 468 (567) T ss_pred ----cCCcceEEEEeecCeEEEEEecCCCCcceEEEccccc----EEEEEec-----CceeEEEEeecCeEEE--eeCCE Confidence 233355666888999433333332 5788886532 2222222 2222222222233333 34455 Q ss_pred EEEEcCCccCCCCCEEEEEEeeccccCCCceEEE-EEEEEEcCCCCCchhheeeeccCccccCcceeeccCCCcccceeE Q lcl|NC_011802. 360 TGQLQFDISSQYDKQQEHLLFTPIFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRV 438 (472) Q Consensus 360 l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~-~~le~~~Gv~~~~~~~~l~~sdDG~~~~~~~~~~~g~~g~~~~r~ 438 (472) |++++... .|+..+-.|+.+..+..--|. +.++. ..+ ..+.+-+-...|+. +++ .+|+...|. T Consensus 469 l~~~~~g~-----~~~~~~WrSK~f~~p~~~sf~~~rV~s--~~~-~~v~i~~~~dg~~v-------~~~-~~g~~~~~~ 532 (567) T protein:vir:33 469 MSVLAGGA-----LPSTIRWHSKIFSLPERTSFSCIRVKS--PAP-ERVGITIMADDVPV-------IHF-APGTFKGSV 532 (567) T ss_pred EeeecCCC-----CceeEEEecceEEecCccceeEEEEec--cCC-cceeEEEEEcCCce-------eec-CCccccCce Confidence 66644422 255556677887777653332 22322 111 11112222111111 221 334433443 Q ss_pred EEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 439 IWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 439 ~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) .||=-++-|. |+|.++...+|.---+.-.+| T Consensus 533 --~rlp~~~ar~-Weveisg~~~V~~v~LA~S~~ 563 (567) T protein:vir:33 533 --VRLPAATGQN-WQVMVSGFGQVERITLSTSMS 563 (567) T ss_pred --eecCCcccce-EEEEEEecccEEEEEEecchh Confidence 3443334443 778888888876555555555 No 38 >protein:vir:2792 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612909;genbank:gi:20065826;genbank:GeneID:935648 Probab=31.43 E-value=1.5 Score=19.62 Aligned_cols=397 Identities=11% Similarity=0.070 Sum_probs=181.8 Q ss_pred Cceeee-----------------------eecccCc-ccc---ccCCeeEE----eeeeeeecccc-cCcccceeEcCCC Q lcl|NC_011802. 1 MPIQQL-----------------------PMMKGMG-KDF---KNADYIDY----LPINMLATPKE-VLNSSGYLRSFPG 48 (472) Q Consensus 1 M~~~~v-----------------------Pl~~G~~-~~~---~~~d~~~~----~pvn~~~~~~e-~~~s~~~Lrs~PG 48 (472) |...++ |.....+ .+. +..|...- --|.-+-.+.. +..|+......|| T Consensus 125 ~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg 204 (567) T protein:vir:27 125 VTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPG 204 (567) T ss_pred eeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCC Confidence 333332 1111111 111 11221111 11111111111 2233333344455 Q ss_pred cee-eeecCCCccceeeeeccCeEEEEECcc---eeeeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhh Q lcl|NC_011802. 49 IAK-RNDVNGISRGVEYNTAQNAVYRVLGSK---LYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTV 124 (472) Q Consensus 49 l~~-~~~v~g~~rg~~~~~~~~~lY~V~G~~---LY~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~ 124 (472) -.. ...++-++.+.. ...-++||=..+. =|. .+++++ -+..++.||.-+- --+..+.-+.|+.+...+ T Consensus 205 ~~V~ls~~p~~~~~~~--i~~~RIYRS~tg~~gtdy~---lVael~-as~~sf~D~~~~~--~lg~~Lps~~w~~PP~~m 276 (567) T protein:vir:27 205 TAVQLTLAPVPLQNAS--IKRRRIYRSASGGGEADFL---LVAELD-ASVLSYTDKIPAK--NLGPSLATWDYLPPPENM 276 (567) T ss_pred ceEEEeeccCCccccc--cceEEEEEecCCCCceeeE---EEEeec-cceeeeeeccchh--hcccccccccccCcCccc Confidence 533 333333333333 3445677733221 122 133332 1224455553221 113445555556555544 Q ss_pred ccccccccccCCcccccceeeeeceeEEEEecCCCeEEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcc Q lcl|NC_011802. 125 SNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSS 204 (472) Q Consensus 125 s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~ 204 (472) ...+.-+ .|.-. .-.|+ ....|..-=+=+| =.-| ...-.+.||+++++..-|+++-.- T Consensus 277 ~GL~~m~----------------NGimA-gF~Gn-eV~FsEpylPyAW---P~~Y-r~t~~~dIVaiA~~gt~LVV~TkG 334 (567) T protein:vir:27 277 TGLCLMA----------------NGIAA-GFAGN-EVMFSEAYLPYAW---PEVN-RHTTAEDIVAICPLGTSLVVATKG 334 (567) T ss_pred ceeeecc----------------cceEE-eecCC-EEEEecCCCCccc---chhh-ccCCCCCeEEEeecccEEEEEEcC Confidence 4333211 12211 11233 3444422111111 0111 122357899999999999998776 Q ss_pred eEEEEEecCCCCCccccceecccceeeeccccchhheecCceEEEEEeccccccEEEEccCccceecCCHHH--HHHHhh Q lcl|NC_011802. 205 TIEYFSLTGATTVGAALYVAHASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASI--EKIIRS 282 (472) Q Consensus 205 T~Evw~~tGa~~~~~fpy~r~~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~i--E~~i~~ 282 (472) .- +--+|.+ |.+-.-++ .-+.-=|+.+.|+..++..+.|=|.|+- |...+.+++..+ |..| -+.+++ T Consensus 335 ~P--Yl~sG~s-P~sms~~k---L~~~qpCvS~rsiV~~~g~v~Yas~dGL----v~i~a~G~a~vv-T~~l~t~~qW~a 403 (567) T protein:vir:27 335 EP--YLFSGVS-PSTISGSK---IPSMQACLSRRSMVAMEGFVLYAGTNGL----VSVDANGNVALA-TEQIVSPEQWQS 403 (567) T ss_pred ce--EEEEcCC-hhhccccc---cccccccccccceeEeccEEEeecCCcE----EEEecCCchhhh-hhhccChHHHHh Confidence 66 5556643 44555555 3346789999999999999999999983 444444566555 4444 344543 Q ss_pred cCchhhccEEEEEEEeCCEEEEEEECCC---eEEEEecccccCchheeeeccCccccceEeeeEeecCCeEEEEEccCCe Q lcl|NC_011802. 283 YTADELATGVMEALRLDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAV 359 (472) Q Consensus 283 y~~~e~~~A~~~~~~~~GH~fy~lt~P~---~Tw~yD~~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~ 359 (472) ++.-+-...++.||.=|-.-+=.+ .+.-||.... .=.++++ +|-+.+.=...++.++ .+++. T Consensus 404 ----~~~P~ti~A~~~eG~Y~a~Y~~~~g~~~~fifdp~~~----~~~~i~~-----~~~~~~~d~~~d~Ly~--~~~~~ 468 (567) T protein:vir:27 404 ----QFNPASIVAYPWRGEYIACYTKPDGKQDVFVFSPVNM----DIRYLST-----PFDCAWVDLAKDMMRV--VTGDK 468 (567) T ss_pred ----cCCcceEEEEeecCeEEEEEecCCCCcceEEEccccc----EEEEEec-----CceeEEEEeecCeEEE--eeCCE Confidence 233355666888999433333332 5788886532 2222222 2222222222233333 34455 Q ss_pred EEEEcCCccCCCCCEEEEEEeeccccCCCceEEE-EEEEEEcCCCCCchhheeeeccCccccCcceeeccCCCcccceeE Q lcl|NC_011802. 360 TGQLQFDISSQYDKQQEHLLFTPIFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRV 438 (472) Q Consensus 360 l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~-~~le~~~Gv~~~~~~~~l~~sdDG~~~~~~~~~~~g~~g~~~~r~ 438 (472) |++++... .|+..+-.|+.+..+..--|. +.++. ..+ ..+.+-+-...|+. +++ .+|+...|. T Consensus 469 l~~~~~g~-----~~~~~~WrSK~f~~p~~~sf~~~rV~s--~~~-~~v~i~~~~dg~~v-------~~~-~~g~~~~~~ 532 (567) T protein:vir:27 469 MSVLAGGA-----LPSTIRWHSKIFSLPERTSFSCIRVKS--PAP-ERVGITIMADDVPV-------IHF-APGTFKGSV 532 (567) T ss_pred EeeecCCC-----CceeEEEecceEEecCccceeEEEEec--cCC-cceeEEEEEcCCce-------eec-CCccccCce Confidence 66644422 255556677887777653332 22322 111 11112222111111 221 334433443 Q ss_pred EEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 439 IWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 439 ~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) .||=-++-|. |+|.++...+|.---+.-.+| T Consensus 533 --~rlp~~~ar~-Weveisg~~~V~~v~LA~S~~ 563 (567) T protein:vir:27 533 --VRLPAATGQN-WQVMVSGFGQVERITLSTSMS 563 (567) T ss_pred --eecCCcccce-EEEEEEecccEEEEEEecchh Confidence 3443334443 778888888876555555555 No 39 >protein:vir:9979 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859109;genbank:gi:32170864;genbank:GeneID:2653256 Probab=31.43 E-value=1.5 Score=19.62 Aligned_cols=397 Identities=11% Similarity=0.070 Sum_probs=181.8 Q ss_pred Cceeee-----------------------eecccCc-ccc---ccCCeeEE----eeeeeeecccc-cCcccceeEcCCC Q lcl|NC_011802. 1 MPIQQL-----------------------PMMKGMG-KDF---KNADYIDY----LPINMLATPKE-VLNSSGYLRSFPG 48 (472) Q Consensus 1 M~~~~v-----------------------Pl~~G~~-~~~---~~~d~~~~----~pvn~~~~~~e-~~~s~~~Lrs~PG 48 (472) |...++ |.....+ .+. +..|...- --|.-+-.+.. +..|+......|| T Consensus 125 ~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg 204 (567) T protein:vir:99 125 VTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPG 204 (567) T ss_pred eeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCC Confidence 333332 1111111 111 11221111 11111111111 2233333344455 Q ss_pred cee-eeecCCCccceeeeeccCeEEEEECcc---eeeeeeeEEcccCceeEEEEcCCcEEEEEECCceeEEEEeccchhh Q lcl|NC_011802. 49 IAK-RNDVNGISRGVEYNTAQNAVYRVLGSK---LYKGETVVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTV 124 (472) Q Consensus 49 l~~-~~~v~g~~rg~~~~~~~~~lY~V~G~~---LY~v~~~iGtv~gsg~VsMa~Ng~~~~iv~~g~~~~Y~~d~~~~t~ 124 (472) -.. ...++-++.+.. ...-++||=..+. =|. .+++++ -+..++.||.-+- --+..+.-+.|+.+...+ T Consensus 205 ~~V~ls~~p~~~~~~~--i~~~RIYRS~tg~~gtdy~---lVael~-as~~sf~D~~~~~--~lg~~Lps~~w~~PP~~m 276 (567) T protein:vir:99 205 TAVQLTLAPVPLQNAS--IKRRRIYRSASGGGEADFL---LVAELD-ASVLSYTDKIPAK--NLGPSLATWDYLPPPENM 276 (567) T ss_pred ceEEEeeccCCccccc--cceEEEEEecCCCCceeeE---EEEeec-cceeeeeeccchh--hcccccccccccCcCccc Confidence 533 333333333333 3445677733221 122 133332 1224455553221 113445555556555544 Q ss_pred ccccccccccCCcccccceeeeeceeEEEEecCCCeEEEEcccCCCCcCCccceeEeecCCCceEEEEecCCEEEEEEcc Q lcl|NC_011802. 125 SNWTADSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGTWRDFIVCFGSS 204 (472) Q Consensus 125 s~~~~d~~f~~~~~~~~~dv~~~dGyfv~~~~g~~~~~iS~L~D~s~~~~~l~fatAE~~pD~iv~~~~~~~~l~lfG~~ 204 (472) ...+.-+ .|.-. .-.|+ ....|..-=+=+| =.-| ...-.+.||+++++..-|+++-.- T Consensus 277 ~GL~~m~----------------NGimA-gF~Gn-eV~FsEpylPyAW---P~~Y-r~t~~~dIVaiA~~gt~LVV~TkG 334 (567) T protein:vir:99 277 TGLCLMA----------------NGIAA-GFAGN-EVMFSEAYLPYAW---PEVN-RHTTAEDIVAICPLGTSLVVATKG 334 (567) T ss_pred ceeeecc----------------cceEE-eecCC-EEEEecCCCCccc---chhh-ccCCCCCeEEEeecccEEEEEEcC Confidence 4333211 12211 11233 3444422111111 0111 122357899999999999998776 Q ss_pred eEEEEEecCCCCCccccceecccceeeeccccchhheecCceEEEEEeccccccEEEEccCccceecCCHHH--HHHHhh Q lcl|NC_011802. 205 TIEYFSLTGATTVGAALYVAHASLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASI--EKIIRS 282 (472) Q Consensus 205 T~Evw~~tGa~~~~~fpy~r~~~~~I~~Gca~~~sv~~~~~s~~wl~~d~~g~~~V~~~~g~q~~rIST~~i--E~~i~~ 282 (472) .- +--+|.+ |.+-.-++ .-+.-=|+.+.|+..++..+.|=|.|+- |...+.+++..+ |..| -+.+++ T Consensus 335 ~P--Yl~sG~s-P~sms~~k---L~~~qpCvS~rsiV~~~g~v~Yas~dGL----v~i~a~G~a~vv-T~~l~t~~qW~a 403 (567) T protein:vir:99 335 EP--YLFSGVS-PSTISGSK---IPSMQACLSRRSMVAMEGFVLYAGTNGL----VSVDANGNVALA-TEQIVSPEQWQS 403 (567) T ss_pred ce--EEEEcCC-hhhccccc---cccccccccccceeEeccEEEeecCCcE----EEEecCCchhhh-hhhccChHHHHh Confidence 66 5556643 44555555 3346789999999999999999999983 444444566555 4444 344543 Q ss_pred cCchhhccEEEEEEEeCCEEEEEEECCC---eEEEEecccccCchheeeeccCccccceEeeeEeecCCeEEEEEccCCe Q lcl|NC_011802. 283 YTADELATGVMEALRLDSHELLIIHLPR---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAV 359 (472) Q Consensus 283 y~~~e~~~A~~~~~~~~GH~fy~lt~P~---~Tw~yD~~t~~w~~~w~~~~tg~~~~~~R~~~~~~~~g~~~vGD~~~g~ 359 (472) ++.-+-...++.||.=|-.-+=.+ .+.-||.... .=.++++ +|-+.+.=...++.++ .+++. T Consensus 404 ----~~~P~ti~A~~~eG~Y~a~Y~~~~g~~~~fifdp~~~----~~~~i~~-----~~~~~~~d~~~d~Ly~--~~~~~ 468 (567) T protein:vir:99 404 ----QFNPASIVAYPWRGEYIACYTKPDGKQDVFVFSPVNM----DIRYLST-----PFDCAWVDLAKDMMRV--VTGDK 468 (567) T ss_pred ----cCCcceEEEEeecCeEEEEEecCCCCcceEEEccccc----EEEEEec-----CceeEEEEeecCeEEE--eeCCE Confidence 233355666888999433333332 5788886532 2222222 2222222222233333 34455 Q ss_pred EEEEcCCccCCCCCEEEEEEeeccccCCCceEEE-EEEEEEcCCCCCchhheeeeccCccccCcceeeccCCCcccceeE Q lcl|NC_011802. 360 TGQLQFDISSQYDKQQEHLLFTPIFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRV 438 (472) Q Consensus 360 l~~ld~~~~~d~g~p~~~~~~tP~~~~~~~r~~~-~~le~~~Gv~~~~~~~~l~~sdDG~~~~~~~~~~~g~~g~~~~r~ 438 (472) |++++... .|+..+-.|+.+..+..--|. +.++. ..+ ..+.+-+-...|+. +++ .+|+...|. T Consensus 469 l~~~~~g~-----~~~~~~WrSK~f~~p~~~sf~~~rV~s--~~~-~~v~i~~~~dg~~v-------~~~-~~g~~~~~~ 532 (567) T protein:vir:99 469 MSVLAGGA-----LPSTIRWHSKIFSLPERTSFSCIRVKS--PAP-ERVGITIMADDVPV-------IHF-APGTFKGSV 532 (567) T ss_pred EeeecCCC-----CceeEEEecceEEecCccceeEEEEec--cCC-cceeEEEEEcCCce-------eec-CCccccCce Confidence 66644422 255556677887777653332 22322 111 11112222111111 221 334433443 Q ss_pred EEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 439 IWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 439 ~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) .||=-++-|. |+|.++...+|.---+.-.+| T Consensus 533 --~rlp~~~ar~-Weveisg~~~V~~v~LA~S~~ 563 (567) T protein:vir:99 533 --VRLPAATGQN-WQVMVSGFGQVERITLSTSMS 563 (567) T ss_pred --eecCCcccce-EEEEEEecccEEEEEEecchh Confidence 3443334443 778888888876555555555 No 40 >protein:vir:100244 Length: 109 # NCBI annotation: gp73 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355409;genbank:gi:77864699;genbank:GeneID:3725966 Probab=24.10 E-value=1.2 Score=20.18 Aligned_cols=42 Identities=14% Similarity=0.001 Sum_probs=33.1 Q ss_pred cCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 428 QNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 428 ~g~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) |=++|+.++|+.+++....++..|+.+ ......+.-|||.++ T Consensus 1 mm~~g~L~~rI~i~~~~~~~d~~G~~~---~~~w~~~~~~wA~i~ 42 (109) T protein:vir:10 1 MLRSSDLTEFIVIERKGGRTNENGEPL---PDDWVTHDEVWASVR 42 (109) T ss_pred CCCccccCccEEEEeeeeccCCCCCee---ccceeeEEEEEEEEE Confidence 557799999999999998888777543 234556778999998 No 41 >protein:vir:5977 Length: 109 # NCBI annotation: hypothetical protein # Family: family:all:788 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690677;genbank:geneid:6329133;genbank:gi:22855071;interpro:IPR013045;uniprot:O48446;genbank:GeneID:955315 Probab=20.55 E-value=1.7 Score=19.39 Aligned_cols=39 Identities=8% Similarity=-0.154 Sum_probs=31.7 Q ss_pred CCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 430 EPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 430 ~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) +..++..|+++++.-..++..|....- =+.+.-+|+.+| T Consensus 1 ~~~~L~~RI~i~~~~~~~D~~G~~~~~----w~~~~~~WA~v~ 39 (109) T protein:vir:59 1 MYEEFPDVITFQSYVEQSNGEGGKTYK----WVDEFTAAAHVQ 39 (109) T ss_pred CccccCccEEEEeeeeeeCCCCCeeee----eEeeEEEEEEEe Confidence 889999999999999999888877751 223456888888 No 42 >protein:vir:193 Length: 112 # NCBI annotation: putative head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037703;genbank:gi:9634168;genbank:GeneID:1262533 Probab=20.02 E-value=1.8 Score=19.23 Aligned_cols=40 Identities=15% Similarity=0.037 Sum_probs=32.1 Q ss_pred cCCCcccceeEEEEeeEecccceeEEEEEEecCcceEEEeEEEeC Q lcl|NC_011802. 428 QNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) Q Consensus 428 ~g~~g~~~~r~~~~rlG~~r~~v~f~~r~~~~~~~~l~~~~~~~e 472 (472) |. +|+.++|+.++|....++..|.... .-..+.-|||.++ T Consensus 1 M~-~G~L~~rI~i~~~~~~~d~~G~~~~----~w~~~~~~wA~v~ 40 (112) T protein:vir:19 1 ME-PGRFRNRVKILTFTTSRDPSGQPVE----SWTGGNPVPAEVK 40 (112) T ss_pred CC-ccccCccEEEEeeeeeeCCCCCeec----ceEeEEEEEEEEE Confidence 44 8999999999999988887775543 4556778999998 Done!