Query lcl|NC_011811.1_cdsid_YP_002456075.1 [gene=Ea21-4_gp52] [protein=hypothetical protein] [protein_id=YP_002456075.1] [location=32281..33387] Match_columns 368 No_of_seqs 110 out of 160 Neff 7.9 Searched_HMMs 1612 Date Thu Nov 7 14:13:59 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_52 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_52_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95258 Length: 368 100.0 3E-132 2E-135 742.0 36.9 367 2-368 1-367 (368) 2 protein:vir:10324 Length: 320 100.0 2E-108 1E-111 611.0 31.3 318 20-368 1-318 (320) 3 protein:vir:6378 Length: 346 # 100.0 8.3E-66 5.1E-69 377.4 31.5 335 8-365 1-346 (346) 4 protein:vir:3424 Length: 341 # 100.0 3.9E-62 2.4E-65 357.2 30.5 324 8-365 1-341 (341) 5 protein:vir:393 Length: 341 # 100.0 2.1E-61 1.3E-64 353.2 28.7 327 8-365 1-341 (341) 6 protein:vir:96490 Length: 348 100.0 3.6E-52 2.3E-55 302.6 28.3 329 1-368 1-348 (348) 7 protein:vir:4902 Length: 348 # 100.0 1.2E-50 7.5E-54 294.2 28.1 328 1-368 1-348 (348) 8 protein:vir:2736 Length: 348 # 100.0 1.3E-49 8.1E-53 288.6 27.9 328 1-368 1-348 (348) 9 protein:vir:106590 Length: 349 100.0 3.8E-46 2.4E-49 269.6 26.0 325 1-365 6-349 (349) 10 protein:vir:98480 Length: 348 100.0 8.2E-43 5.1E-46 251.3 26.5 328 1-366 1-348 (348) 11 protein:vir:79503 Length: 409 99.9 7.6E-29 4.7E-32 174.7 23.1 347 1-368 18-393 (409) 12 protein:vir:78006 Length: 409 99.9 7.6E-29 4.7E-32 174.7 23.1 347 1-368 18-393 (409) 13 protein:vir:79078 Length: 307 99.2 5E-12 3.1E-15 82.6 16.6 298 1-365 2-307 (307) 14 protein:vir:107882 Length: 307 99.0 9.4E-11 5.8E-14 75.6 18.0 300 1-365 2-307 (307) 15 protein:vir:99888 Length: 309 98.6 2.7E-08 1.6E-11 62.1 17.3 302 1-366 1-309 (309) 16 protein:vir:98819 Length: 437 90.8 0.019 1.2E-05 30.0 12.4 347 1-368 1-420 (437) 17 protein:vir:103323 Length: 364 90.7 0.019 1.2E-05 30.0 22.6 315 1-368 1-340 (364) 18 protein:vir:94711 Length: 347 89.7 0.025 1.5E-05 29.4 17.7 317 1-368 1-347 (347) 19 protein:vir:105645 Length: 400 89.1 0.028 1.8E-05 29.1 15.2 301 1-368 21-334 (400) 20 protein:vir:108211 Length: 318 88.5 0.032 2E-05 28.8 17.3 288 1-368 16-318 (318) 21 protein:vir:9875 Length: 296 # 77.4 0.13 7.8E-05 25.5 11.3 274 1-368 1-296 (296) 22 protein:vir:8885 Length: 347 # 77.0 0.13 8.1E-05 25.4 16.5 312 1-368 1-347 (347) 23 protein:vir:99675 Length: 324 73.9 0.16 0.0001 24.9 11.6 270 37-368 1-297 (324) 24 protein:vir:7019 Length: 401 # 67.3 0.25 0.00016 23.8 15.9 307 1-368 21-334 (401) 25 protein:vir:100057 Length: 375 67.1 0.26 0.00016 23.8 19.5 319 1-368 9-371 (375) 26 protein:vir:9927 Length: 295 # 65.8 0.28 0.00017 23.6 13.2 271 1-368 1-289 (295) 27 protein:vir:105822 Length: 273 63.3 0.32 0.0002 23.3 19.0 269 1-367 1-273 (273) 28 protein:vir:102605 Length: 273 63.3 0.32 0.0002 23.3 19.0 269 1-367 1-273 (273) 29 protein:vir:1886 Length: 385 # 57.7 0.43 0.00027 22.6 20.4 277 1-368 105-385 (385) 30 protein:vir:191 Length: 385 # 57.7 0.43 0.00027 22.6 20.4 277 1-368 105-385 (385) 31 protein:vir:78935 Length: 335 53.9 0.52 0.00032 22.2 18.0 302 1-368 1-329 (335) 32 protein:vir:94622 Length: 341 52.6 0.55 0.00034 22.0 21.7 304 1-368 1-340 (341) 33 protein:vir:80213 Length: 334 51.5 0.58 0.00036 21.9 15.9 295 1-368 23-333 (334) 34 protein:vir:97031 Length: 402 50.6 0.61 0.00038 21.8 16.6 316 1-368 1-334 (402) 35 protein:vir:6324 Length: 335 # 43.6 0.84 0.00052 21.0 19.5 306 1-368 1-329 (335) 36 protein:vir:1541 Length: 347 # 39.8 1 0.00062 20.6 11.5 312 1-368 1-346 (347) 37 protein:vir:10450 Length: 344 36.4 1.2 0.00073 20.2 15.1 305 1-365 1-344 (344) 38 protein:vir:7990 Length: 273 # 22.8 2.4 0.0015 18.5 20.4 271 1-367 1-273 (273) 39 protein:vir:94576 Length: 347 22.2 2.5 0.0015 18.4 19.9 316 1-367 1-347 (347) No 1 >protein:vir:95258 Length: 368 # NCBI annotation: Phage conserved protein # Family: family:all:570 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944891;genbank:gi:38707831;genbank:GeneID:2744044 Probab=100.00 E-value=2.6e-132 Score=741.98 Aligned_cols=367 Identities=65% Similarity=1.078 Sum_probs=359.0 Q ss_pred cccccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEEEEeCceeeeeccCCCCCccccccCCceeEEEE Q lcl|NC_011811. 2 SLTLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLDITDYGISLLDPVDRDTRNAESSAPESLRQVAF 81 (368) Q Consensus 2 ~~~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~~~f 81 (368) -|||||+|+||+++||++||++|++|++|++||||++++++|++|.||++++.++|+|+++||+++.++.++++|+++.| T Consensus 1 ~~d~f~~d~Fs~~~LT~ain~~p~~p~~l~~lglF~~~~v~t~~v~iE~~~~~l~Lvp~~~rg~~~~~~~~~~~r~~~~f 80 (368) T protein:vir:95 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMDLTDWDVSLLDAVDRDSRKAETSAPERVRQISF 80 (368) T ss_pred CcccccCCcccHHHHHHHHHhcCCCcceecccccccCCCccceEEEEEEEcCeEEEccccCCCCCCcccccCCceeEEEE Confidence 78999999999999999999999999999999999999999999999999999999999999997667888888999999 Q ss_pred ecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEEeeehhcCcc Q lcl|NC_011811. 82 PLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKGFLWADMYQTFGVE 161 (368) Q Consensus 82 ~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG~~~~d~~~~fG~~ 161 (368) ++|||+++|+|+|+||||+|+||+++++++++.++++||++||++|++|+||||+|||+|+|+|+||++++|||++||++ T Consensus 81 ~~ph~~~~d~I~a~eiQg~RafG~~~~l~~v~~~v~~kl~~~r~~~d~T~E~~r~gAL~G~ilDadGtvl~dly~eFGit 160 (368) T protein:vir:95 81 PMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDARGTLYADLYKQFDVE 160 (368) T ss_pred ecceeccccccchHHHccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCeeECCCCcEEecchhhhCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_011811. 162 KKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSSYGLITGSLK 241 (368) Q Consensus 162 ~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~~~~~~~~~~ 241 (368) |++++|+|+++++||+++|++|+++|+++|+++++++.++++||||++||++|++||+|+++|+||++++.++.+++++| T Consensus 161 ~~~v~f~l~~~~tdv~~~~~~~~~~i~d~l~g~~~~~~~~v~alcg~~Ffd~L~~h~~Vkeay~~~~~a~~~~~lr~~~r 240 (368) T protein:vir:95 161 KKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAWQQITGSLR 240 (368) T ss_pred cceEEEEeCCCCcCHHHHHHHHHHHHHHhhcccccccccceEEEEChHHHHHhhcChhHHHHHHHHHhhhhhhhhccccc Confidence 99999999999999999999999999999998888888899999999999999999999999999999999999999999 Q ss_pred cccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecccchHhhc Q lcl|NC_011811. 242 TGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGPANKMGYV 321 (368) Q Consensus 242 ~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~apa~~~~~v 321 (368) .|...+....+++|+||||+|+||+|++++..+..+++++.+.+.|++|+||+||+|++.+.++|+|++||||||++++| T Consensus 241 ~g~~~~~~~~~~~F~fgGi~f~eYrg~~~~~~g~~~~~v~~d~v~I~~gea~~~P~G~~~~~~~~~F~~~~aPad~~e~v 320 (368) T protein:vir:95 241 TGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPCPKMGYA 320 (368) T ss_pred cccccccccccceeEecCEEEEEcceeecCCCcceeeeecCCceeeccCceEEEeecccccccCcceEEEecCCCcHhhc Confidence 99999999999999999999999999999999999999999999999999999999999899999999999999999999 Q ss_pred cCCCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 322 NTLGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 322 n~~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) |+.|+|+|+|+|++++++|++|++||||||||+||++|+++|++|.- T Consensus 321 Nt~g~p~Ya~~~~~~~~~g~~le~qSnpLpic~RP~~lv~~~~~a~~ 367 (368) T protein:vir:95 321 NTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKG 367 (368) T ss_pred CCCcccccceeeeccCCCeeEEEEeecccchhcccceeEEEEecCCC Confidence 99999999999999999999999999999999999999999999888 No 2 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=100.00 E-value=2e-108 Score=611.03 Aligned_cols=318 Identities=25% Similarity=0.496 Sum_probs=294.4 Q ss_pred HHhcCCCccchhhcCcccccCcccceEEEEEEeCceeeeeccCCCCCccccccCCceeEEEEecceeccccccCHHHHhc Q lcl|NC_011811. 20 IANIPNTYGYVNQLDLFRSVPTSQTSVLLDITDYGISLLDPVDRDTRNAESSAPESLRQVAFPLIYFKHIESITPEQVQG 99 (368) Q Consensus 20 i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~ 99 (368) ||.+|+.++++..| ||++++++|++|.||++++.++|+|+++||+++. +.+++++++++|++|||+++++|+|+|||| T Consensus 1 i~~~P~~~g~~~gl-ff~~~~v~T~~V~ie~~~~~l~lip~v~rg~~g~-~~~~~~~~~~~f~~p~~~~~d~i~a~eiq~ 78 (320) T protein:vir:10 1 MNLLPVNYGDSRAL-FAREKKVRTRTILVEEKNGVLTLIQSREPGSTEN-VAKRGKRKVRSFVIPHLPLEDVILPDEYEG 78 (320) T ss_pred CCcCCchhhhhhhh-ccCCCCcccceEEEEEecCceeeeeccCCCCCce-eecCCcceEEEEecceeccCCccCHHHHcC Confidence 99999998877555 4588899999999999999999999999999985 567889999999999999999999999999 Q ss_pred ccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEEeeehhcCcccceEEEEcCCCCccHHHH Q lcl|NC_011811. 100 IRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKGFLWADMYQTFGVEKKTVYFDLENPDADIDGA 179 (368) Q Consensus 100 ~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG~~~~d~~~~fG~~~~~v~~~l~~~~~d~~~~ 179 (368) +|+||+ ++++++++++++++.+||++|++|+||||+|||+|+|+|+||+++||||++||++|++++|+|+++++|+.++ T Consensus 79 ~Ra~G~-~~~~~~~~~v~~~l~~lr~~~~~T~E~m~~~AL~G~ildadGtv~~d~y~~fGi~~~~i~~~l~~a~~dv~~~ 157 (320) T protein:vir:10 79 LRGFGT-TALAAKSELVKERXETMKSSHDITHEHLRMGAKKGQILDADGTVLYDLYAEFGITKKTIYFGLDNKDANVAES 157 (320) T ss_pred cccCCC-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCeEEcCCCcEEEechhhhCCccceeEEecCCCCccHHHH Confidence 999997 6889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhhhcccccccccccccccccccceEEeCC Q lcl|NC_011811. 180 IDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSSYGLITGSLKTGRSDGVATATNEFPYRG 259 (368) Q Consensus 180 ~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~G 259 (368) |.+++++|+++++|..+ ++++|+||++||++|++||+|+++|+||+++... ++++ ...+|+||| T Consensus 158 ~~~~~~~i~~~l~g~~~---t~v~al~g~~f~~al~~h~~Vke~y~~~~~~~~~--l~~~-----------~~~~f~~gG 221 (320) T protein:vir:10 158 CRQVLRHVEDNLRGDVM---KDVSVDVSEEFFDKFIKHASVKEVFLNHEAAVNR--LGGD-----------TRKGFKFGG 221 (320) T ss_pred HHHHHHHHHHHhccCCC---CceEEEEChHHHHHHhcCHHHHHHHHhhhhhhhh--cccc-----------ccceEEecC Confidence 99999999999998755 4789999999999999999999999999876543 2222 234699999 Q ss_pred EEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecccchHhhccCCCceeeEEEeeccCCC Q lcl|NC_011811. 260 VVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGPANKMGYVNTLGQDLYVFEYAKDRDE 339 (368) Q Consensus 260 i~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~apa~~~~~vn~~~~~~y~~~~~~~~~~ 339 (368) |+|++|+|+|.+.++..+.+ |++|+|++||.| ++|+|++||||+|+++++|+.|+|||+|+|++++++ T Consensus 222 i~~~~Y~g~~~d~~g~~~~~-------I~~~~~~~~p~g-----~~~~f~~~~apad~~e~vnt~g~p~y~k~~~~~~~~ 289 (320) T protein:vir:10 222 LIFNENRARHVDEEGKETRF-------IKAGKGHAFPTG-----TTNTFFTALAPADFNETAGTLGKRYYAKMEPRRMGR 289 (320) T ss_pred EEEEEcccEEEcCCCCeeEe-------ecCCeeEEEEec-----CchhheeeecccCcHhhcCCcccccccccccccCCC Confidence 99999999999988876655 578999999998 579999999999999999999999999999999999 Q ss_pred eeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 340 GTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 340 g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) |++|++||||||||+||++|+++|++|+- T Consensus 290 g~~l~~qS~PLpi~~rP~~lv~~~~~a~~ 318 (320) T protein:vir:10 290 GFDLHSQSNVLPMCCRPGVLVELDAAAQP 318 (320) T ss_pred eEEEEeeecccccccCcceEEEEEecCCC Confidence 99999999999999999999999999988 No 3 >protein:vir:6378 Length: 346 # NCBI annotation: capsid protein E # Family: family:all:1021 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918991;genbank:gi:34610166;genbank:GeneID:2559600 Probab=100.00 E-value=8.3e-66 Score=377.38 Aligned_cols=335 Identities=14% Similarity=0.062 Sum_probs=272.6 Q ss_pred CCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEEEEeCceeeeeccCCCCCccccccCCceeEEEEecceec Q lcl|NC_011811. 8 GSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLDITDYGISLLDPVDRDTRNAESSAPESLRQVAFPLIYFK 87 (368) Q Consensus 8 ~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~~~f~~p~~~ 87 (368) -|.|++.+|+++|+++|+. .+|.+++||+.+.+.|++|.||.+++.+.++|+++|+.++... +++++++..|++||++ T Consensus 1 ~d~f~~~~l~~~i~~~p~~-~~l~~~~fp~~~~~~t~~i~i~~~~g~~~la~~v~~~~~~~~~-~~~g~~~~~~~~p~i~ 78 (346) T protein:vir:63 1 MEIFDTLTLAGVIQSGPAL-SMYWQGFYPNEITFDTDEILFDLVFKDKKLAPFVAPNVQGRVI-AARGYTTKTFRPAYVK 78 (346) T ss_pred CCccCHHHHHHHHHhcCCc-cchhhhcCccccccccceEEEEEecCceeeeeeecCCCCccee-cccceeeeEeecCccC Confidence 5778899999999999976 4578888888888999999999999999999999999998765 4557889999999999 Q ss_pred cccccCHHHHhcccC-----CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEEeeehhcCccc Q lcl|NC_011811. 88 HIESITPEQVQGIRQ-----AGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKGFLWADMYQTFGVEK 162 (368) Q Consensus 88 ~~~~i~a~dlq~~R~-----~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG~~~~d~~~~fG~~~ 162 (368) +++.|+|+|++++|. +|+.+..+++.+.+++++.+|++.|++|+||||+|||+|..++.+|+...++..+||+.. T Consensus 79 ~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~~~E~m~~~al~~gki~~~g~~~~~~~vdfg~~~ 158 (346) T protein:vir:63 79 PKDVINPNRTLKRRAGEQPIIGGMSLQERFQAVVADSQLEQRQRIENRIEWMCAMATIYGYVDVVGEAFPMQRVDFGRDP 158 (346) T ss_pred ccceeCHHHHHHHhhhhhhccCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCEEEeeCCceeEEEEeeCCCc Confidence 999999999998664 566677888999999999999999999999999999986667777766677878899864 Q ss_pred -ceE----EEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhhhcccc Q lcl|NC_011811. 163 -KTV----YFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSSYGLIT 237 (368) Q Consensus 163 -~~v----~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~~~~~~ 237 (368) +.+ +..|+++++||.+.|++|.++++++.++ ...+++||+++|++|++|++|+++|.+++......... T Consensus 159 ~~~~~lt~~~~W~~~~adp~~di~~~~~~~~~~~g~------~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~ 232 (346) T protein:vir:63 159 ALTVQLTGGAAWDQATSDPLGNIQTMRTTAWKKSNS------TITRLTMGLDAWSLFSQKPAVVELLNLFYKGSTSDFNR 232 (346) T ss_pred cceeeecccccCCCCCCCHHHHHHHHHHHHHHccCC------ceEEEEECHHHHHHHhcCHHHHHHHhhhccccccccch Confidence 233 3468899999999999999999886432 34589999999999999999999998765433221111 Q ss_pred ccccccccccc-ccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecccc Q lcl|NC_011811. 238 GSLKTGRSDGV-ATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGPAN 316 (368) Q Consensus 238 ~~~~~g~~~~~-~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~apa~ 316 (368) ..+..+..... ......+.++|+.|+.|+++|.+.+|..+.+ ++++++.++|.|. .+.++|||.. T Consensus 233 ~~l~~~~~~~~~~~~~~~~~~~gi~i~~y~~~y~d~~G~~~~~-------ip~~~v~~~p~~~-------~g~~~yg~~~ 298 (346) T protein:vir:63 233 SRLDDGSPVQYQGTIGGYNGMGTLELYTYHDTYTGDDNTEQEI-------LGSYDVVGTGPGL-------QGTQCFGAIM 298 (346) T ss_pred hhcccchhhhhhhhHhhhhccCCeEEEEeccEEEcCCCceecc-------ccCCeEEEEecCC-------cceEEEeecc Confidence 11111100000 0111235679999999999999988876665 4778888999763 3457899987 Q ss_pred hHhhccCCCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEee Q lcl|NC_011811. 317 KMGYVNTLGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRAD 365 (368) Q Consensus 317 ~~~~vn~~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~ 365 (368) +.+ .|+.+.++|+++|..++|+++++++||+|||+|.+|++++.+|++ T Consensus 299 d~~-~~~~~~~~~~~~~~~~dp~~~~~~~~s~plPv~~~p~~~~~~~V~ 346 (346) T protein:vir:63 299 DFK-NGLVPTRMFPKMWEEEDPSVAMLMTQSAPLMVPAQPNASFRMTVK 346 (346) T ss_pred ccc-cCcccceeeeEEEEecCCCEEEEEEeeeccceecCCCcEEEEEeC Confidence 766 488999999999999999999999999999999999999999999 No 4 >protein:vir:3424 Length: 341 # NCBI annotation: capsid component # Family: family:all:1021 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040587;genbank:gi:9626251;genbank:GeneID:2703482 Probab=100.00 E-value=3.9e-62 Score=357.25 Aligned_cols=324 Identities=14% Similarity=0.090 Sum_probs=251.5 Q ss_pred CCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEEEEeCceeeeeccCCCCCccccccCCceeEEEEecceec Q lcl|NC_011811. 8 GSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLDITDYGISLLDPVDRDTRNAESSAPESLRQVAFPLIYFK 87 (368) Q Consensus 8 ~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~~~f~~p~~~ 87 (368) -|+|++.+|+++++++|+.+++|++++|++++.+.|++|.||++++.+.++|+|+|+.++..+ +++++++++|++|||+ T Consensus 1 ~d~f~~~~L~~~i~~~~~~~~~l~d~~fp~~~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~-~~~~~~~~~~~~p~i~ 79 (341) T protein:vir:34 1 MSMYTTAQLLAANEQKFKFDPLFLRLFFRESYPFTTEKVYLSQIPGLVNMALYVSPIVSGEVI-RSRGGSTSEFTPGYVK 79 (341) T ss_pred CCCcCHHHHHHHHHhccCccchhHHhcCCcccccccceEEEEEeeCCeeEEEeecCCCCccee-ccCceeeeEEecCccC Confidence 456888999999999999999999998888888999999999999999999999999998655 5568899999999999 Q ss_pred cccccCHHHHhcccCCCC-----CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcE-EccCCC--EEEeeehhc Q lcl|NC_011811. 88 HIESITPEQVQGIRQAGT-----AAELTTEAMVRARKLQKIRMTHDITKEFLLMQALK-GKV-VDSKGF--LWADMYQTF 158 (368) Q Consensus 88 ~~~~i~a~dlq~~R~~G~-----~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~-G~i-~d~dG~--~~~d~~~~f 158 (368) +++.|+|+|++ .|.+|+ .+.+++..+.+.+++.+|++.|++|+||||+|||+ |+| ++++|. +.+| | T Consensus 80 ~~~~i~~~d~~-~r~~g~~~~~~~~~~~~~~~~i~~~l~~l~~~i~~~~E~m~~qaL~~Gki~~~~~g~~~~~vD----f 154 (341) T protein:vir:34 80 PKHEVNPQMTL-RRLPDEDPQNLADPAYRRRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFDPVEVD----M 154 (341) T ss_pred ccceeCHHHHH-HHhhccccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEEecCCccEEEEE----e Confidence 99999999999 488874 35677788899999999999999999999999995 998 566663 4566 5 Q ss_pred Cccc-ceEE----EEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhhh Q lcl|NC_011811. 159 GVEK-KTVY----FDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSSY 233 (368) Q Consensus 159 G~~~-~~v~----~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~~ 233 (368) |+.. +.++ -.|+++++++...+++|.+.+++ .+ .+..+++||+++|++|+.|++|+++|.++...... T Consensus 155 g~~~~~~~~~t~~~~W~~~~~~~~d~l~di~~~~~~----~g---~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~ 227 (341) T protein:vir:34 155 GRSEENNITQSGGTEWSKRDKSTYDPTDDIEAYALN----AS---GVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSE 227 (341) T ss_pred CCCCccceEecCCccCCcCCCchHHHHHHHHHHHHh----cC---CceEEEEeCHHHHHHHhcCHHHHHHHhhccccccc Confidence 6543 2333 34676666666666666554432 22 23568999999999999999999999875432211 Q ss_pred cccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEec Q lcl|NC_011811. 234 GLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYG 313 (368) Q Consensus 234 ~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~a 313 (368) . ..............+.++|+.|+.|+++|.+ +|..+++ +|+|++.++|.|. . ..++|| T Consensus 228 ~------~~~~~~~~~~~~~~~~~~g~~i~~y~~~y~d-dG~~~~~-------ip~~~v~l~p~g~-----~--g~~~yg 286 (341) T protein:vir:34 228 L------ETAVKDLGKAVSYKGMYGDVAIVVYSGQYVE-NGVKKNF-------LPDNTMVLGNTQA-----R--GLRTYG 286 (341) T ss_pred c------cccccccccceeeeeecCCceEEEEcCEEEE-CCcEEee-------ecCCeEEEeeCCC-----c--ceEEEe Confidence 0 0001111122233346899999999999986 4655544 5788999999763 2 357888 Q ss_pred ccchHhhccC--CCceeeEEEee-ccCCCeeEEEeeecccccccCcceEEEEEee Q lcl|NC_011811. 314 PANKMGYVNT--LGQDLYVFEYA-KDRDEGTDFEAHSYMMPICTRPQLLVDVRAD 365 (368) Q Consensus 314 pa~~~~~vn~--~~~~~y~~~~~-~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~ 365 (368) +..+.+..+. ...++|+++|. .++|+++++++||+|||+|.||++++++|+. T Consensus 287 ~~~d~~~~~~~~~~~~~~~~~~~~~~dp~~~~~~~~s~pLPv~~~pd~~~~a~V~ 341 (341) T protein:vir:34 287 CIQDADAQREGINASARYPKNWVTTGDPAREFTMIQSAPLMLLADPDEFVSVQLA 341 (341) T ss_pred ecccccccccceeeeeEeeeeeeecCCCcEEEEEEcccceeeeeCCCcEEEEEeC Confidence 7665554433 35689999985 4589999999999999999999999999999 No 5 >protein:vir:393 Length: 341 # NCBI annotation: gp8 # Family: family:all:1021 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046903;genbank:gi:9630472;genbank:GeneID:1261647 Probab=100.00 E-value=2.1e-61 Score=353.20 Aligned_cols=327 Identities=13% Similarity=0.065 Sum_probs=248.9 Q ss_pred CCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEEEEeCceeeeeccCCCCCccccccCCceeEEEEecceec Q lcl|NC_011811. 8 GSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLDITDYGISLLDPVDRDTRNAESSAPESLRQVAFPLIYFK 87 (368) Q Consensus 8 ~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~~~f~~p~~~ 87 (368) -|+|++.+|+++|+++|+.+++|.+++|.+++.++|.+|.||.+++.+.++|+++|+.++.. .+++++++++|++|||+ T Consensus 1 ~d~f~~~~L~~~i~~~~~~~~~l~~~~Fp~~~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~-~~~~~~~~~~~~~p~i~ 79 (341) T protein:vir:39 1 MSVYTTAQLLAVNEKKFKFDPLFLRIFFRETYPFSTEKVYLSQIPGLVNMALYVSPIVSGKV-IRSRGGSTSEFTPGYVK 79 (341) T ss_pred CCccCHHHHHHHHHhhcCccchhHhhcCCcccccCcceEEEEEecCCceeeEEecCCCCcce-ecccceeeeeEeccccC Confidence 45678899999999999999999999544566778999999999999999999999999865 55677899999999999 Q ss_pred cccccCHHHHhcccCCCC-----CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCcE-EccCCC--EEEeeehhc Q lcl|NC_011811. 88 HIESITPEQVQGIRQAGT-----AAELTTEAMVRARKLQKIRMTHDITKEFLLMQAL-KGKV-VDSKGF--LWADMYQTF 158 (368) Q Consensus 88 ~~~~i~a~dlq~~R~~G~-----~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL-~G~i-~d~dG~--~~~d~~~~f 158 (368) +++.|+++|++. |.+|+ .+.++...+.+.+++.+|+++|++|+||||+||| +||| ++++|. +.+||...+ T Consensus 80 ~~~~i~~~d~~~-r~~g~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~r~E~m~~qaL~~Gki~i~~~g~~~~~vDfg~~~ 158 (341) T protein:vir:39 80 PKHEVNPLMTLR-RLPDEDPQNLADPVYRRRRIILQNMKDEELAIAQVEEKQAVAAVLSGKYTMTGEAFEPVEVDMGRSA 158 (341) T ss_pred cccccCHHHHHH-HhhcccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceEEEcCCCcEEEEeccCCc Confidence 999999999984 66763 3567777888999999999999999999999999 5998 677774 456643332 Q ss_pred Cccc-ceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhh-hccc Q lcl|NC_011811. 159 GVEK-KTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSS-YGLI 236 (368) Q Consensus 159 G~~~-~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~-~~~~ 236 (368) +... .+.+..|+++++++...++++.+.++ ..+ ..+.+++||+++|++|++|++|+++|.++..... +... T Consensus 159 ~~~~~lt~~~~W~~~~~~~~d~l~di~~~~~-~~g------~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~ 231 (341) T protein:vir:39 159 GNNIVQAGAAAWSSRDKETYDPTDDIEAYAL-NAS------GVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETA 231 (341) T ss_pred cceeEecCCccCCCCCCchHHHHHHHHHHHH-hcC------CceEEEEeChHHHHHHhcCHHHHHHHhhcccccccccch Confidence 2111 11234477777776666777654443 222 2356899999999999999999999987543221 1111 Q ss_pred ccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecccc Q lcl|NC_011811. 237 TGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGPAN 316 (368) Q Consensus 237 ~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~apa~ 316 (368) ..++. ........++|+.|+.|+++|.+. |..++ .+++|++.++|.|. ++.++|||.. T Consensus 232 ~~~~~-------~~~~~~~~~~g~~i~~y~~~y~d~-g~~~~-------~ip~~~~~l~p~~~-------~g~~~yg~~~ 289 (341) T protein:vir:39 232 LKDLG-------KAVSYKGMYGDVAIVVYSGQYIEN-DVKKN-------YLPDLTMVLGNTQA-------RGLRTYGCIL 289 (341) T ss_pred hhhhh-------hHhhhhhhhcCceEEEEccEEEec-CcEEe-------eecCCeEEEeeCCC-------cceEEEeccc Confidence 11111 111122357999999999999873 44444 35788888998763 2357788766 Q ss_pred hHhhcc--CCCceeeEEEeecc-CCCeeEEEeeecccccccCcceEEEEEee Q lcl|NC_011811. 317 KMGYVN--TLGQDLYVFEYAKD-RDEGTDFEAHSYMMPICTRPQLLVDVRAD 365 (368) Q Consensus 317 ~~~~vn--~~~~~~y~~~~~~~-~~~g~~l~~eS~pLpv~~rP~alv~~t~~ 365 (368) +++..+ ..+.++|+++|..+ +|+++++++||+|||+|+||++++++|++ T Consensus 290 d~~~~~~~~~~~~~~~~~~~~~~dp~~~~~~~~s~plPv~~~p~~~~~a~V~ 341 (341) T protein:vir:39 290 DADAQREGINASTRYPKNWVQTGDPAREFTMIQSAPLMLLADPDEFVSVKLA 341 (341) T ss_pred chhhcccceeeeeeeeeeeeecCCCcEEEEEEeccccceeeCCCcEEEEEeC Confidence 555442 35678999998755 89999999999999999999999999999 No 6 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=100.00 E-value=3.6e-52 Score=302.59 Aligned_cols=329 Identities=14% Similarity=0.120 Sum_probs=250.0 Q ss_pred CcccccCCCcccHHHHHHHHHhcCC-CccchhhcCcccccCcccce-EEEEEEeCceeeeeccCCCCCccccccCCceeE Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIPN-TYGYVNQLDLFRSVPTSQTS-VLLDITDYGISLLDPVDRDTRNAESSAPESLRQ 78 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p~-~~~~l~~l~lF~~~~~~t~~-v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~ 78 (368) |+. ++ |.|+..+|++.|+.+|+ ...+|.+. ||+.+++.+.+ +.++..++...++|++++++++ .+.++++.+. T Consensus 1 M~~-i~--d~f~~~~l~~~i~~~~~~~~~~l~~~-~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~-~~~~r~~~~~ 75 (348) T protein:vir:96 1 MGL-IY--DKVTASNIAGYFNTLQENVDSTLGES-IFPARKQLGTKLSYIKGASGQSVALKAAAFDTNV-TIRDRVSAEI 75 (348) T ss_pred Ccc-hh--hccCHHHHHHHHHhcccchhhhhhhh-cCCCccccceeEEEEeecCCceeEeeeecCCCCc-ceecccceee Confidence 664 43 57999999999999985 45677775 78877665444 3455566677789999999987 4567788899 Q ss_pred EEEecceeccccccCHHHHhccc---CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcEEccCCCEEEee Q lcl|NC_011811. 79 VAFPLIYFKHIESITPEQVQGIR---QAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALK-GKVVDSKGFLWADM 154 (368) Q Consensus 79 ~~f~~p~~~~~~~i~a~dlq~~R---~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~-G~i~d~dG~~~~d~ 154 (368) .+|++||++++..+++.|++.++ ..|+....+++.+.+++++.+|++.+++|.||||+|||+ |+|...+++. ++ T Consensus 76 ~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~--~~ 153 (348) T protein:vir:96 76 HDEQMPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGV--NK 153 (348) T ss_pred eeeecCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCe--eE Confidence 99999999999999999987664 455666678888999999999999999999999999996 9985443332 34 Q ss_pred ehhcCcccc---eEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhh Q lcl|NC_011811. 155 YQTFGVEKK---TVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVS 231 (368) Q Consensus 155 ~~~fG~~~~---~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~ 231 (368) ..+||+... +.+..|+++++||.++|++|.+++++. | ...-+++||+++|++|++|++|+++|.++.... T Consensus 154 ~vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~--G-----~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~ 226 (348) T protein:vir:96 154 DIDYGVKADHKKQVSKSWAEPGATPLADLEDAIETAREL--G-----LNPERAIMNAKTFGLIRKAASTVKAIKPLAGDG 226 (348) T ss_pred EEeccCCcccceeeccccCCCCCCHHHHHHHHHHHHHhc--C-----CcccEEEeCHHHHHHHhcCHHHHHHHhccCCcc Confidence 455887532 456789999999999999999888652 2 234589999999999999999999997653221 Q ss_pred hhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEE Q lcl|NC_011811. 232 SYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQIS 311 (368) Q Consensus 232 ~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~ 311 (368) .. ..+..+.. ....++|+.|+.|+++|.+.+|..++++ +++...++|.| .+..++ T Consensus 227 ~~-~~~~~~~~----------~~~~~~g~~i~~y~~~y~d~~G~~~~~~-------p~~~v~l~~~~-------~~G~~~ 281 (348) T protein:vir:96 227 SS-VTKAELQN----------YVADNYGVEIVLENGTYRNEKGEVSKFF-------PDGHLTLIPNG-------PLGNTV 281 (348) T ss_pred cc-ccHHHHHH----------HHhhhcCceEEEEccEEEecCCcEeccc-------cCCeEEEEcCC-------CceeEE Confidence 11 11111110 0114589999999999999888777665 55666677755 235688 Q ss_pred eccc-chHh--h-------ccCCCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 312 YGPA-NKMG--Y-------VNTLGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 312 ~apa-~~~~--~-------vn~~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) |||. ...+ . ++..+..+|.+.|.++||.+.++++||+|||++.+|++++.+|+-+|- T Consensus 282 yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~plPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:96 282 FGTTPEESDLFADNTVNADVEIVDSGIAVTTTKTTDPVNVQTKVSMVALPSFERLGDVYMLTVIPGV 348 (348) T ss_pred eccChhhhhhhhcccccccceecCCeeEEEeeecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 9874 2111 1 111233489999999999999999999999999999999999999999 No 7 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=100.00 E-value=1.2e-50 Score=294.24 Aligned_cols=328 Identities=15% Similarity=0.139 Sum_probs=250.3 Q ss_pred CcccccCCCcccHHHHHHHHHhcCC-CccchhhcCcccccCc-ccceEEEEEEeCceeeeeccCCCCCccccccCCceeE Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIPN-TYGYVNQLDLFRSVPT-SQTSVLLDITDYGISLLDPVDRDTRNAESSAPESLRQ 78 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p~-~~~~l~~l~lF~~~~~-~t~~v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~ 78 (368) |+- || |.|+..+|+..|+.+|. ...+|.++ ||+.+.+ .+..+.++..++...++|++++++++. +.++++.+. T Consensus 1 M~~-l~--d~f~~~~l~~~v~~~~~~~~~~l~~~-~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~-~~~r~~~~~ 75 (348) T protein:vir:49 1 MGL-IY--DKVTASNIAGYFNALQENVDSTLGES-IFPARKQLGTKLSYITGASGQSVALKAAAFDTNVT-VRDRVSAEM 75 (348) T ss_pred Ccc-hh--hhcCHHHHHHHHHhccccchhhhHhh-cCCCccccCceeEEEEeecCceeeeeeecCCCCcc-eecccceee Confidence 553 44 56999999999999975 45678776 6766554 566778899999999999999999874 566788899 Q ss_pred EEEecceeccccccCHHHHhcccC---CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcE-EccCCCEEEe Q lcl|NC_011811. 79 VAFPLIYFKHIESITPEQVQGIRQ---AGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALK-GKV-VDSKGFLWAD 153 (368) Q Consensus 79 ~~f~~p~~~~~~~i~a~dlq~~R~---~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~-G~i-~d~dG~~~~d 153 (368) .+|++||++++..+++.|+++++. .++..+.+.+.+.+.+++++|++.|++|.||||+|||+ |++ ++++|. + T Consensus 76 ~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~---~ 152 (348) T protein:vir:49 76 HDEQMPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGV---N 152 (348) T ss_pred eeeecCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCc---e Confidence 999999999999999999776654 35545556677888999999999999999999999996 988 566663 3 Q ss_pred eehhcCcccc---eEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhh Q lcl|NC_011811. 154 MYQTFGVEKK---TVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAV 230 (368) Q Consensus 154 ~~~~fG~~~~---~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~ 230 (368) +..+||+... +.+..|+++++||.++|++|.+++++. |. ..-+++||+++|++|++|++|++++.++... T Consensus 153 ~~vdyg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~--G~-----~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~ 225 (348) T protein:vir:49 153 KDIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIETAREL--GL-----NPERAVMNAKTFGLIRKAASTVKVIKPLAGD 225 (348) T ss_pred EEEeecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhc--CC-----cccEEEeCHHHHHHHhcCHHHHHHhhccCcc Confidence 3445787532 456789999999999999999988652 22 2347899999999999999999998664322 Q ss_pred hhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeE Q lcl|NC_011811. 231 SSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQI 310 (368) Q Consensus 231 ~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~ 310 (368) .. ...+..+..+ .-.++|+.|+.|+++|.+.+|..++++ |++...++|.| ....+ T Consensus 226 ~~-~i~~~~~~~~----------~~~~~g~~i~~y~~~y~d~dG~~~~~~-------p~~~v~l~~~~-------~~G~~ 280 (348) T protein:vir:49 226 GS-SVTKAELDNY----------IADNFGVTVVLENGTYRNEKGEVSKFF-------PDGHLTLIPNG-------PLGNT 280 (348) T ss_pred cc-cccHHHHHHH----------HHhhcCceEEEEeeEEEecCCcEeeee-------cCCeEEEecCC-------Cccee Confidence 11 1111111110 114689999999999999888777665 55666677755 23467 Q ss_pred Eecccch-Hhhc--cCC-------CceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 311 SYGPANK-MGYV--NTL-------GQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 311 ~~apa~~-~~~v--n~~-------~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) +|||.-. .+.+ ++. +--+|.+.|..+||.++++++||+|||++.+|++++.+|+-+|- T Consensus 281 ~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:49 281 VFGTTPEESDLFADNTVNADVEIVDNGIAVTTTKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred EEecChhhhhhccccccccceeecCCeEEEeeeecCCCceEEEEEeeeccccccCCCcEEEEEEecCC Confidence 8887422 1111 111 12388999999999999999999999999999999999999999 No 8 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=100.00 E-value=1.3e-49 Score=288.59 Aligned_cols=328 Identities=15% Similarity=0.138 Sum_probs=246.6 Q ss_pred CcccccCCCcccHHHHHHHHHhcCCC-ccchhhcCcccccCcccce-EEEEEEeCceeeeeccCCCCCccccccCCceeE Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIPNT-YGYVNQLDLFRSVPTSQTS-VLLDITDYGISLLDPVDRDTRNAESSAPESLRQ 78 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p~~-~~~l~~l~lF~~~~~~t~~-v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~ 78 (368) |+- + .|.|+..+|++.|+++|+. ..+|.+. ||+.+.+.+.+ +.++..++...++|++++++++ .+.++++.+. T Consensus 1 M~~-i--~d~f~~~~l~~~v~~~~~~~~~~l~~~-~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~-~~~~r~~~~~ 75 (348) T protein:vir:27 1 MGL-I--YDKVTASNIAGYFNALQENVSSTLGES-IFPARKQLGTKLSYIKGASGQSVALKAAAFDTNV-TIRDRVSAEM 75 (348) T ss_pred Ccc-h--hhhcCHHHHHHHHHhccchhhhhhHhh-cCCCccccceeEEEEeeccCceeEeeeecCCCCc-ceecccceee Confidence 543 3 3689999999999999754 5678775 77776655444 4445666667789999999987 5667788899 Q ss_pred EEEecceeccccccCHHHHhcc---cCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcE-EccCCCEEEe Q lcl|NC_011811. 79 VAFPLIYFKHIESITPEQVQGI---RQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALK-GKV-VDSKGFLWAD 153 (368) Q Consensus 79 ~~f~~p~~~~~~~i~a~dlq~~---R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~-G~i-~d~dG~~~~d 153 (368) .+|++||++++..|+++|++++ +..++..+.+++.+.+.+++++|++.|++|.||||+|||+ |++ ++.+|. + T Consensus 76 ~~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~---~ 152 (348) T protein:vir:27 76 HDEQMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGV---N 152 (348) T ss_pred eeeecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCe---e Confidence 9999999999999999998765 5555555666778889999999999999999999999996 998 455552 2 Q ss_pred eehhcCcccc---eEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhh Q lcl|NC_011811. 154 MYQTFGVEKK---TVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAV 230 (368) Q Consensus 154 ~~~~fG~~~~---~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~ 230 (368) +-.+||+... +.+..|+++++||.++|++|.+++++. | ...-+++||+++|++|++|++|++++.+.... T Consensus 153 ~~vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~--G-----~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~ 225 (348) T protein:vir:27 153 KDIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIETAREL--G-----LNPERAVMNAKTFGLIRKAASTVKVIKPLAGD 225 (348) T ss_pred EEEeecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhc--C-----CcccEEEECHHHHHHHhcCHHHHHHhcccCcc Confidence 2234676432 445679999999999999999988642 2 23457899999999999999999998764322 Q ss_pred hhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeE Q lcl|NC_011811. 231 SSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQI 310 (368) Q Consensus 231 ~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~ 310 (368) ... .....+. ...-.++|+.|+.|+++|.+.+|..++++ |++...++|.| .+..+ T Consensus 226 ~~~-i~~~~~~----------~~~~~~~g~~i~~yd~~y~d~~G~~~~~~-------p~~~vvl~~~~-------~~G~~ 280 (348) T protein:vir:27 226 GSA-VTKAELE----------NYIADNFGVSIVLENGTYRNDKGEVSKFY-------PDGHLTLIPNG-------PLGNT 280 (348) T ss_pred ccc-cCHHHHH----------HHHHhhcCceEEEEeeEEEcCCCcCcccc-------cCCeEEEEcCC-------cceeE Confidence 111 0011110 00114689999999999999888877765 55666677755 23457 Q ss_pred Eeccc-chHhhcc--C-------CCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 311 SYGPA-NKMGYVN--T-------LGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 311 ~~apa-~~~~~vn--~-------~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) .|||. +..+.+. + .+..+|.+.|..+||.+.++++||+|||++.+|++++.+|+-+|- T Consensus 281 ~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 281 VFGTTPEESDLFADNTVNAEVEIVDNGIAVTTTKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred EeccCcchhhhhhccccccceeeeCCeeEEEeeecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 78864 3222221 1 123388999999999999999999999999999999999999999 No 9 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=100.00 E-value=3.8e-46 Score=269.58 Aligned_cols=325 Identities=17% Similarity=0.156 Sum_probs=238.4 Q ss_pred CcccccC-----CCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceE-EEEEEeCceeeeeccCCCCCccccccCC Q lcl|NC_011811. 1 MSLTLAN-----GSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSV-LLDITDYGISLLDPVDRDTRNAESSAPE 74 (368) Q Consensus 1 m~~~~f~-----~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v-~ie~~~~~~~l~p~v~rg~~~~~~~~~~ 74 (368) |.|++-+ .|.|+...|++.++.+|. +++|.+. ||+.+.+....+ .++..++...++|++++++++. +.+++ T Consensus 6 ~~~~~~~~~~~~~d~~~~~~l~~~~~~~~~-~~~l~~~-~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~-~~~r~ 82 (349) T protein:vir:10 6 LQLDLQRFATPILDMFSQNTVLDYTRNRQY-PEMLGDT-LFPAVKVPTLEVDILKAGSRVPTIASVSAFDAEAE-IGTRE 82 (349) T ss_pred hhHHHHHHHHHhhcccCHHHHHHHHHhcCc-chhhHhh-cCCccccccceeEEEeeccCcceeeeeecCCCCcc-eeccc Confidence 6666555 678999999999999997 5788887 666665554443 4455667778899999999864 55655 Q ss_pred ceeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcEEccCCCEEEe Q lcl|NC_011811. 75 SLRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALK-GKVVDSKGFLWAD 153 (368) Q Consensus 75 ~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~-G~i~d~dG~~~~d 153 (368) + ....+++|+++++..+++.|++.+|.+++..+.+.+.+.+++.+.+|++.+++|+||||+|||+ |+|...++++.+| T Consensus 83 ~-~~~~~~~p~ik~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~q~l~~Gki~~~~~g~~vD 161 (349) T protein:vir:10 83 A-SKMTAELAYVKRKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMVQAVKARGEKMTMEMFATGKITDKKNGIAID 161 (349) T ss_pred c-eeEEeeccccccccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeEEcCCcEEEe Confidence 5 4568999999999999999999999999888888889999999999999999999999999996 9986655555566 Q ss_pred eehhcCcccc-eE----EEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHH Q lcl|NC_011811. 154 MYQTFGVEKK-TV----YFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQ 228 (368) Q Consensus 154 ~~~~fG~~~~-~v----~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~ 228 (368) ||+.+. .+ +-.|+++++||.+.|++|++.+ |. ..-++++|+++|++|+.|++|++++.+.. T Consensus 162 ----~g~~~~~~~~lt~~~~Ws~~~adpi~Di~~~~~~~----g~------~p~~~vm~~~~~~~l~~~~~i~~~~~~~~ 227 (349) T protein:vir:10 162 ----YGVPKKHQETLSGTKTWDKSDASIIDNLQDWSDSL----DV------TPTRALTSKKVLRILMRSTEIKEAIFGKD 227 (349) T ss_pred ----cccCccceeEecCcccCCCCCCCHHHHHHHHHHHh----CC------CccEEEeCHHHHHHHhcCHHHHHHhcccc Confidence 676542 22 3468999999999998886543 22 23478999999999999999999986543 Q ss_pred hhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccce Q lcl|NC_011811. 229 AVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLY 308 (368) Q Consensus 229 ~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f 308 (368) ..... ....+. ...-.++|+.++.|+++|.+.++....- ....+|++...++|.+ .+. T Consensus 228 ~~~~~--~~~~~~----------~~l~~~~~~~i~~yd~~y~d~~~~~~~t---~~~~~p~~~v~l~~~~-------~~G 285 (349) T protein:vir:10 228 TGRVV--GQADLD----------QWMTAQGLPIIRAYDGKYRDEDSRGNLT---TNSYFPEDRIVLFNDE-------VPG 285 (349) T ss_pred ccccc--CHHHHH----------HHHHhcCCceEEEEeeEEEeecCCCcee---ecccccCCeEEEecCC-------Cce Confidence 21110 000000 0011468999999999999876643211 1123466666666644 345 Q ss_pred eEEecccchHhhcc-C-C----CceeeEEE-eeccCCCeeEEEeeecccccccCcceEEEEEee Q lcl|NC_011811. 309 QISYGPANKMGYVN-T-L----GQDLYVFE-YAKDRDEGTDFEAHSYMMPICTRPQLLVDVRAD 365 (368) Q Consensus 309 ~~~~apa~~~~~vn-~-~----~~~~y~~~-~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~ 365 (368) .++||+........ + . ..+++.+. +.+++|.+.++++||+|||++.+|++++.+|+- T Consensus 286 ~~~yG~~~e~~~~~~g~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl 349 (349) T protein:vir:10 286 QKIYGPTPEENRLISSNAQVSNVGNIMAKIYETSEDPIGTWILASATMLPSFASADDVFQAKVL 349 (349) T ss_pred eEEeeccchhhhhcccccceeeccceEEEeeeecCCCceEEEEEeeeeeeeecCCCcEEEEEeC Confidence 78899863322111 1 1 12345554 467899999999999999999999999999999 No 10 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=100.00 E-value=8.2e-43 Score=251.33 Aligned_cols=328 Identities=14% Similarity=0.064 Sum_probs=238.6 Q ss_pred CcccccCCCcccHHHHHHHHHhcC---CCccchhhcCcccccCcccceEEEEEEeC---ceeeeeccCCCCCccccccCC Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIP---NTYGYVNQLDLFRSVPTSQTSVLLDITDY---GISLLDPVDRDTRNAESSAPE 74 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p---~~~~~l~~l~lF~~~~~~t~~v~ie~~~~---~~~l~p~v~rg~~~~~~~~~~ 74 (368) |+..+. .|.|+..+|++.|+..| +.+++|.+. ||+.+. +..+.++..++ ....++++++++++ .+.+++ T Consensus 1 M~~~~~-~d~~~~~~l~~~i~~~~~~~~~~~~l~~~-~fp~~~--~~~~~~~~~~~~~~~~~~a~~~~~~~~~-~~~~r~ 75 (348) T protein:vir:98 1 MSWTLD-TEFIEPTQLTGLIREALRDLQVNRFRLAR-WLPNVD--VDDITFEFLRGGGGLAETASYRSWDTES-KIGRRE 75 (348) T ss_pred Ccchhh-hhccCHHHHHHHHHHHhhccCcchhhHHh-cCCCcc--ccceEEEEEeccCCceeeeeeecCCCcc-ceeecc Confidence 999886 56999999999999886 456788886 676654 44566666543 34567999998876 566667 Q ss_pred ceeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcE-EccCCCEEE Q lcl|NC_011811. 75 SLRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALK-GKV-VDSKGFLWA 152 (368) Q Consensus 75 ~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~-G~i-~d~dG~~~~ 152 (368) +.+...+++|+++++..++++|++.+|. ...+.+.+.+.+.+.+|++.+++|.||||+|||+ |++ +++++. .+ T Consensus 76 g~~~~~~~~~~i~~~~~i~~~d~~~~~~----~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~-~v 150 (348) T protein:vir:98 76 GLAKVMGELPPISEKIPLNEYDRLRLRK----LSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQ-TV 150 (348) T ss_pred cceeeeeeccccccccccCHHHHHHhcC----ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCce-EE Confidence 7788899999999999999999987763 3446677888999999999999999999999996 988 444443 34 Q ss_pred eeehhcCcccc---eEEEEcC-CCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHH Q lcl|NC_011811. 153 DMYQTFGVEKK---TVYFDLE-NPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQ 228 (368) Q Consensus 153 d~~~~fG~~~~---~v~~~l~-~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~ 228 (368) | ||+... +.+..|+ .+++||.+.|++|++.+++..| ...-++++|+++|++|++|++|++.+.++. T Consensus 151 D----yg~~~~~~~t~~~~Ws~~~~adp~~di~~~~~~~~~~~G------~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~ 220 (348) T protein:vir:98 151 D----FGRIGSHSVVAAVLWSVHATATPISDLESWVATYEDTNG------QSPGVILMPKAAVSHMRQCEEVIRQVFPLA 220 (348) T ss_pred c----cccCcccccccccccCCCCCCCHHHHHHHHHHHHHHccC------CcceEEEeCHHHHHHHhcCHHHHHHHhccC Confidence 4 677542 3456785 5789999999999999987632 234589999999999999999999987653 Q ss_pred hhhh-hcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeeccc--ccccc Q lcl|NC_011811. 229 AVSS-YGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAM--LGEAN 305 (368) Q Consensus 229 ~~~~-~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~--~~~~~ 305 (368) .... .......+. ...-.+|+..++.|+.+|.+. |...+++ |+|...++|.+.. +.... T Consensus 221 ~~~~~~~~~~~~~~----------~~~~~~g~~~i~~~d~~~~~~-g~~~~~~-------p~~~i~l~p~~~~~~~~~~~ 282 (348) T protein:vir:98 221 PSGTAPMVSVEQLN----------TVLSSMGLPPIEVYDAKVAVD-GVSTRIT-------PANAIALLPEPGATDAAQPT 282 (348) T ss_pred ccccccccCHHHHH----------HHHHhhCCeEEEEeeeEEEcC-Cceecee-------cCCeEEEEecCCcccccccc Confidence 2111 111111110 001136888999999988764 5555554 4555556665421 11122 Q ss_pred cceeEEecccchHhhccCC-----CceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeee Q lcl|NC_011811. 306 DLYQISYGPANKMGYVNTL-----GQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADK 366 (368) Q Consensus 306 ~~f~~~~apa~~~~~vn~~-----~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~a 366 (368) .+-.++|||.......+.. .-.+|.+.|..++|.++++++||+|||++.+|++++.+|+-| T Consensus 283 ~~G~t~~G~~~e~~~~~~~~~~~~~~~i~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 283 ELGATLLGTTAESLEDDYALAPGEQPGIVAATWKTKDPVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred cccceecccchhhhccccccceeccCceeeeeeeecCCcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 3446778874322211111 112799999999999999999999999999999999999999 No 11 >protein:vir:79503 Length: 409 # NCBI annotation: major head protein # Family: family:all:11999 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468058;genbank:gi:157265500;genbank:GeneID:5600620 Probab=99.94 E-value=7.6e-29 Score=174.73 Aligned_cols=347 Identities=11% Similarity=0.047 Sum_probs=208.5 Q ss_pred Cccccc-CCCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEE--EEeCceeeee--ccCCCCCccccccCCc Q lcl|NC_011811. 1 MSLTLA-NGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLD--ITDYGISLLD--PVDRDTRNAESSAPES 75 (368) Q Consensus 1 m~~~~f-~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie--~~~~~~~l~p--~v~rg~~~~~~~~~~~ 75 (368) +++.-. ..+-+++..+...+.+..++.++|.+. ||+....-.+++.++ ..+|..++.. ..+.+..+.++..+.. T Consensus 18 ~~~~~~~~~~~~~~~~~ia~~~~~~p~~~~L~d~-~FP~~~~f~t~l~~~~~~~kg~kk~~~~~~~~~~d~~~pv~~r~~ 96 (409) T protein:vir:79 18 LSIGGLKFPTTKEIQEAVAAIADKFNQENDLVDR-FFPEDSTFASELELYLLRTQDAEQTGMTFVHQVGSTSLPVEARVA 96 (409) T ss_pred chhcceecCchHHHHHHHHHHHHhcCCccchhhc-cCCCCccccceEEEEeeeccCcccccceEeeecCCccccccccce Confidence 333211 145677888877777776666778886 676554444445544 4555444432 2233444444443332 Q ss_pred ---eeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcEE-ccCC-- Q lcl|NC_011811. 76 ---LRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALK-GKVV-DSKG-- 148 (368) Q Consensus 76 ---~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~-G~i~-d~dG-- 148 (368) .+..++++|||+++..|++.|++.+++.++....+.+.+.+.+.+++|.+++.++.||||+|+|. |+|. .+++ T Consensus 97 ~~~~~~~t~epp~iK~k~~i~e~dl~~~~~~~n~~~~~~i~~~i~~D~~~L~~~I~~R~E~Ma~q~L~tGki~i~g~~~~ 176 (409) T protein:vir:79 97 KVDLAKATWSPLAFKESRVWDEKEILYLGRLADEVQAGVINEQIAESLTWLMARMRNRRRWLTWQVMRTGRITIQPNDPY 176 (409) T ss_pred eeeeeeecccccccccccccCHHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEEecCCC Confidence 34678999999999999999999766655555556677788899999999999999999999996 9884 3332 Q ss_pred C-EEEeeehhcCcccc-----eEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHH-hcCHHHH Q lcl|NC_011811. 149 F-LWADMYQTFGVEKK-----TVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKL-TGHAKVR 221 (368) Q Consensus 149 ~-~~~d~~~~fG~~~~-----~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al-~~h~~V~ 221 (368) + .-.++-.+||+... +.+-.|+++++||.++|++|.+++++.-+.. .+.-.++++...|++| ..++.|+ T Consensus 177 ~~~g~~~~vDyg~pa~hkvtlTgt~~W~~~~AdPi~DIe~w~~~i~~~~g~~----~t~~~~imt~~~~~~l~~~n~~ik 252 (409) T protein:vir:79 177 NPNGLKYVIDYGVTDIELPLPQKFDAKDGNGNSAVDPIQYFRDLIKAATYFP----DRRPVAIIVGPGFDEVLADNTFVQ 252 (409) T ss_pred ccccceEEEecCCCcccceeecccccCCCCCCChHHHHHHHHHHHHHhcCCC----CCccEEEEcHHHHHHHHhCcHHHH Confidence 1 11233344788642 2344699999999999999999998764321 1223456666666665 4667788 Q ss_pred HHHHHHHhh-hhhcccccccccccccccccccce-EEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecc Q lcl|NC_011811. 222 EAYMAQQAV-SSYGLITGSLKTGRSDGVATATNE-FPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTA 299 (368) Q Consensus 222 ~~y~~~~~~-~~~~~~~~~~~~g~~~~~~~~~~~-f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~ 299 (368) +++.+.... ......... . ........+. ....|+.++.|+++|.+.+|..+++++.+ ...+++.. T Consensus 253 ~~l~~~~~~~~~~~~~~~~--~--~l~~~~~ln~~~~~~GL~I~vYd~~Y~dedGt~k~~~Pd~-------~vvLl~ap- 320 (409) T protein:vir:79 253 KYVEYEKGWVVGQNTVQPP--R--EVYRQAALDIFKRYTGLEVMVYDKTYRDQDGSVKYWIPVG-------ELIVLNQS- 320 (409) T ss_pred Hhhhcccccccccccccch--h--hhcchhHhHhhhhhcCceEEEEeeEEEecCCcccceecCC-------eEEEEcCC- Confidence 876543211 110000000 0 0000000011 12347999999999999888877776544 33333211 Q ss_pred cccccccceeEEeccc-c---hHhhccCCCceeeEEEeeccCCCeeEEEeeecccccccCcce----EEEEEeeecC Q lcl|NC_011811. 300 MLGEANDLYQISYGPA-N---KMGYVNTLGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQL----LVDVRADKAS 368 (368) Q Consensus 300 ~~~~~~~~f~~~~apa-~---~~~~vn~~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~a----lv~~t~~aa~ 368 (368) ..-+-.++||+. + ....++..+--+=.+.|..+||--++......-||+...++. ..+.+-..+| T Consensus 321 ----~g~LG~T~yGa~~~~~~~~~~v~~~g~~i~~~~~~~~dP~~~~~~~~~~~~p~l~~~~~~~~~~~~~~~~~~~ 393 (409) T protein:vir:79 321 ----TGPVGRFVYTAHVAGQRNGKVVYATGPYLTVKDHLQDDPPYYAIIAGFHGLPQLSGYNTEDFSFHRFKWLKYA 393 (409) T ss_pred ----cccccceecccccccccchhhhccccceeEecccccCCcceeeeecceEEeeeeecCCccceeehhhhhhhhh Confidence 001235788872 1 111222222223336678899999999999999998886553 2233333333 No 12 >protein:vir:78006 Length: 409 # NCBI annotation: major head protein # Family: family:all:11999 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467942;genbank:gi:157265383;genbank:GeneID:5600496 Probab=99.94 E-value=7.6e-29 Score=174.73 Aligned_cols=347 Identities=11% Similarity=0.047 Sum_probs=208.5 Q ss_pred Cccccc-CCCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEE--EEeCceeeee--ccCCCCCccccccCCc Q lcl|NC_011811. 1 MSLTLA-NGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLD--ITDYGISLLD--PVDRDTRNAESSAPES 75 (368) Q Consensus 1 m~~~~f-~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie--~~~~~~~l~p--~v~rg~~~~~~~~~~~ 75 (368) +++.-. ..+-+++..+...+.+..++.++|.+. ||+....-.+++.++ ..+|..++.. ..+.+..+.++..+.. T Consensus 18 ~~~~~~~~~~~~~~~~~ia~~~~~~p~~~~L~d~-~FP~~~~f~t~l~~~~~~~kg~kk~~~~~~~~~~d~~~pv~~r~~ 96 (409) T protein:vir:78 18 LSIGGLKFPTTKEIQEAVAAIADKFNQENDLVDR-FFPEDSTFASELELYLLRTQDAEQTGMTFVHQVGSTSLPVEARVA 96 (409) T ss_pred chhcceecCchHHHHHHHHHHHHhcCCccchhhc-cCCCCccccceEEEEeeeccCcccccceEeeecCCccccccccce Confidence 333211 145677888877777776666778886 676554444445544 4555444432 2233444444443332 Q ss_pred ---eeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcEE-ccCC-- Q lcl|NC_011811. 76 ---LRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALK-GKVV-DSKG-- 148 (368) Q Consensus 76 ---~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~-G~i~-d~dG-- 148 (368) .+..++++|||+++..|++.|++.+++.++....+.+.+.+.+.+++|.+++.++.||||+|+|. |+|. .+++ T Consensus 97 ~~~~~~~t~epp~iK~k~~i~e~dl~~~~~~~n~~~~~~i~~~i~~D~~~L~~~I~~R~E~Ma~q~L~tGki~i~g~~~~ 176 (409) T protein:vir:78 97 KVDLAKATWSPLAFKESRVWDEKEILYLGRLADEVQAGVINEQIAESLTWLMARMRNRRRWLTWQVMRTGRITIQPNDPY 176 (409) T ss_pred eeeeeeecccccccccccccCHHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEEecCCC Confidence 34678999999999999999999766655555556677788899999999999999999999996 9884 3332 Q ss_pred C-EEEeeehhcCcccc-----eEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHH-hcCHHHH Q lcl|NC_011811. 149 F-LWADMYQTFGVEKK-----TVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKL-TGHAKVR 221 (368) Q Consensus 149 ~-~~~d~~~~fG~~~~-----~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al-~~h~~V~ 221 (368) + .-.++-.+||+... +.+-.|+++++||.++|++|.+++++.-+.. .+.-.++++...|++| ..++.|+ T Consensus 177 ~~~g~~~~vDyg~pa~hkvtlTgt~~W~~~~AdPi~DIe~w~~~i~~~~g~~----~t~~~~imt~~~~~~l~~~n~~ik 252 (409) T protein:vir:78 177 NPNGLKYVIDYGVTDIELPLPQKFDAKDGNGNSAVDPIQYFRDLIKAATYFP----DRRPVAIIVGPGFDEVLADNTFVQ 252 (409) T ss_pred ccccceEEEecCCCcccceeecccccCCCCCCChHHHHHHHHHHHHHhcCCC----CCccEEEEcHHHHHHHHhCcHHHH Confidence 1 11233344788642 2344699999999999999999998764321 1223456666666665 4667788 Q ss_pred HHHHHHHhh-hhhcccccccccccccccccccce-EEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecc Q lcl|NC_011811. 222 EAYMAQQAV-SSYGLITGSLKTGRSDGVATATNE-FPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTA 299 (368) Q Consensus 222 ~~y~~~~~~-~~~~~~~~~~~~g~~~~~~~~~~~-f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~ 299 (368) +++.+.... ......... . ........+. ....|+.++.|+++|.+.+|..+++++.+ ...+++.. T Consensus 253 ~~l~~~~~~~~~~~~~~~~--~--~l~~~~~ln~~~~~~GL~I~vYd~~Y~dedGt~k~~~Pd~-------~vvLl~ap- 320 (409) T protein:vir:78 253 KYVEYEKGWVVGQNTVQPP--R--EVYRQAALDIFKRYTGLEVMVYDKTYRDQDGSVKYWIPVG-------ELIVLNQS- 320 (409) T ss_pred Hhhhcccccccccccccch--h--hhcchhHhHhhhhhcCceEEEEeeEEEecCCcccceecCC-------eEEEEcCC- Confidence 876543211 110000000 0 0000000011 12347999999999999888877776544 33333211 Q ss_pred cccccccceeEEeccc-c---hHhhccCCCceeeEEEeeccCCCeeEEEeeecccccccCcce----EEEEEeeecC Q lcl|NC_011811. 300 MLGEANDLYQISYGPA-N---KMGYVNTLGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQL----LVDVRADKAS 368 (368) Q Consensus 300 ~~~~~~~~f~~~~apa-~---~~~~vn~~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~a----lv~~t~~aa~ 368 (368) ..-+-.++||+. + ....++..+--+=.+.|..+||--++......-||+...++. ..+.+-..+| T Consensus 321 ----~g~LG~T~yGa~~~~~~~~~~v~~~g~~i~~~~~~~~dP~~~~~~~~~~~~p~l~~~~~~~~~~~~~~~~~~~ 393 (409) T protein:vir:78 321 ----TGPVGRFVYTAHVAGQRNGKVVYATGPYLTVKDHLQDDPPYYAIIAGFHGLPQLSGYNTEDFSFHRFKWLKYA 393 (409) T ss_pred ----cccccceecccccccccchhhhccccceeEecccccCCcceeeeecceEEeeeeecCCccceeehhhhhhhhh Confidence 001235788872 1 111222222223336678899999999999999998886553 2233333333 No 13 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=99.16 E-value=5e-12 Score=82.55 Aligned_cols=298 Identities=13% Similarity=0.109 Sum_probs=138.4 Q ss_pred Cccc-ccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEEE-EeCceeeeeccCC--CCCccccccCCce Q lcl|NC_011811. 1 MSLT-LANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLDI-TDYGISLLDPVDR--DTRNAESSAPESL 76 (368) Q Consensus 1 m~~~-~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie~-~~~~~~l~p~v~r--g~~~~~~~~~~~~ 76 (368) |.++ .|--| -.||..--... .+.++++. +|+..++......+-. .+... .+|.+.| ++....+ ...+. T Consensus 2 ~~~~~~~~~d----p~LT~~A~gy~-n~~~Iad~-lfP~vpV~~~~~k~~~f~~e~f-~~~~t~ra~~~~~~~v-~~~~~ 73 (307) T protein:vir:79 2 GRLSKLRIVD----PVLTNLAIGYT-NAEFIGQT-LMPVVEVEKEGGKIPKFGKESF-RLYQTERALRAKSNRM-NPEDI 73 (307) T ss_pred CCCCCCcccC----HHHHHHHhhcc-chhhhhhh-cCCcccccccccceeeeccccc-cccccccccCCCccee-eeecc Confidence 4443 34332 14655554444 45689885 8998887655443322 22233 2355443 2222222 21222 Q ss_pred eEEEEecceeccccccCHHHHhcccCCCCCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcE-EccCCCEEEee Q lcl|NC_011811. 77 RQVAFPLIYFKHIESITPEQVQGIRQAGTAA-ELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKV-VDSKGFLWADM 154 (368) Q Consensus 77 ~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~-~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i-~d~dG~~~~d~ 154 (368) +...+.+.-...... + ..|.-|... .++ .+.+..+.+.+.+++||||+++++... +.++-.+... T Consensus 74 ~~~~~~~~~~~l~~~-----i-d~r~~~~~~~~~~------~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLs- 140 (307) T protein:vir:79 74 DSVDVNLDEHDLEYP-----I-DYREDQESAFPLE------QAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLS- 140 (307) T ss_pred ccccccccccchhhc-----c-cchhcCCCCCCHH------HHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEc- Confidence 233333322222211 1 124444321 122 244666788999999999999998433 2222222111 Q ss_pred ehhcCcccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhhhc Q lcl|NC_011811. 155 YQTFGVEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSSYG 234 (368) Q Consensus 155 ~~~fG~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~~~ 234 (368) |.+ .|+++++||.+.+++|++.|.+..+ ...-++++|..+|++|+.||+|.+.+++...+ . T Consensus 141 ----gt~------~Wsd~~sDPi~di~~~~~ai~~~~g------~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g---~ 201 (307) T protein:vir:79 141 ----ATE------KFTAANSDPVGVIEDGKEAIRTKIG------RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKG---I 201 (307) T ss_pred ----cCc------ccCCCCCCcHHHHHHHHHHHHHhhC------CccceEEeCHHHHHHHhcCHHHHHHhcCcccc---c Confidence 322 4899999999999999999988643 34568899999999999999999988765421 1 Q ss_pred ccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecc Q lcl|NC_011811. 235 LITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGP 314 (368) Q Consensus 235 ~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~ap 314 (368) ...+.+ + + .|+.-.+..+.++|.+..+..+++...+.... |. |.+. ....+.+|.--||. T Consensus 202 it~~~l-a----------~--l~~v~~V~vg~a~y~~~~~~~~~iw~~~~~l~-----y~-~~~~-~~~~~~~~~ps~Gy 261 (307) T protein:vir:79 202 VTVDLL-K----------E--IFEVENIAVGEAIYADDKDRFTDIWGANIVLA-----YV-PLQR-GGQQRTPYEPSYGY 261 (307) T ss_pred cCHHHH-H----------H--HhCceeEEEeeeeeecccccchhcCCCceEEE-----ec-cccc-CCCCCcccccccce Confidence 111111 0 1 12222344455666555554444443222111 00 1110 00111222222222 Q ss_pred cchHhhccCCCceeeEEEeeccCCCeeEEE-ee-ecccccccCcceEEEEEee Q lcl|NC_011811. 315 ANKMGYVNTLGQDLYVFEYAKDRDEGTDFE-AH-SYMMPICTRPQLLVDVRAD 365 (368) Q Consensus 315 a~~~~~vn~~~~~~y~~~~~~~~~~g~~l~-~e-S~pLpv~~rP~alv~~t~~ 365 (368) --..+ +.++-.+ ..+.++++-+. ++ ..|+-++..-..|++--+- T Consensus 262 t~~~~-----g~~~~d~--~~~~~~~~~vrv~~~~~~~i~~~~~G~li~~~v~ 307 (307) T protein:vir:79 262 TLRKK-----GNPVVDT--RIEDGKLELVRATDIFRPYLLGADAGYLISGING 307 (307) T ss_pred eEEec-----CceEEec--ccCCCceeEEeecccccceeeccccchhhccCCC Confidence 11111 1111000 11223333322 22 2222222111122222111 No 14 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=99.02 E-value=9.4e-11 Score=75.56 Aligned_cols=300 Identities=13% Similarity=0.097 Sum_probs=133.1 Q ss_pred Cccc-ccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEE-EEeCceeeeeccCCCCCc-cccccCCcee Q lcl|NC_011811. 1 MSLT-LANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLD-ITDYGISLLDPVDRDTRN-AESSAPESLR 77 (368) Q Consensus 1 m~~~-~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie-~~~~~~~l~p~v~rg~~~-~~~~~~~~~~ 77 (368) |.+. .|--|. .||..---.- .+.++.+. +|+..++......+- +.+.... +|...|+-.+ .+.......+ T Consensus 2 ~~~~~~~~~dp----~LT~~A~gy~-n~~~ia~~-l~P~vpv~~~~~k~~~f~~eaF~-~~~t~r~~~~~~~~v~~~~~~ 74 (307) T protein:vir:10 2 GRLSKLRIVDP----VLTNLAIGYT-NAEFIGQS-LMPVVEVEKEGGKIPKFGKESFR-LYKTERALRARSNRMNPEDLG 74 (307) T ss_pred CCCCCCcccCh----hHHHHHHhhc-chhhhhhh-cCCcccccccccceeeECccccc-chhhhcccCCCcceeeccccc Confidence 4443 333221 3443332222 34688885 899888776544332 2233332 3444332211 1111111111 Q ss_pred EEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCC-CEEEeeeh Q lcl|NC_011811. 78 QVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKG-FLWADMYQ 156 (368) Q Consensus 78 ~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG-~~~~d~~~ 156 (368) ...+.++ ..+-..|-| .|+-|.... ...++.+..+.+.+.+++||++++.++....-+.+ ++... T Consensus 75 ~~~~~~~---~~~L~~~id---~r~~~~~~~-----~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLs--- 140 (307) T protein:vir:10 75 SIDIVLD---EHDLEYPID---YREDQESAF-----PLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGGNKKQLS--- 140 (307) T ss_pred ccccccc---cccccccCC---hhhcCCCCC-----CHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCCceEEec--- Confidence 1122222 222222222 255443211 12335566677899999999999998743221222 21111 Q ss_pred hcCcccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhhhccc Q lcl|NC_011811. 157 TFGVEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSSYGLI 236 (368) Q Consensus 157 ~fG~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~~~~~ 236 (368) |. =.|+++++||.+.+++|++.|.+..+ ...-.+++|.+.|++|+.||+|.+.+++.+.+ .+ T Consensus 141 --Gt------~~Wsd~~sDPi~di~~~~~ai~~~~g------~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g----~i 202 (307) T protein:vir:10 141 --AT------EKFTAAGSDPVGVIEDGKEAIRTKIG------RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKG----IV 202 (307) T ss_pred --cc------cccCCCCCCcHHHHHHHHHHHHhhhC------CccceEEeCHHHHHHHhcCHHHHHHhCCcccc----cc Confidence 22 15889999999999999999987643 34557899999999999999999988765421 11 Q ss_pred ccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecccc Q lcl|NC_011811. 237 TGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGPAN 316 (368) Q Consensus 237 ~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~apa~ 316 (368) ....-+ + .|+.-.+.-..+.|....+..+++. .+..++. | +|... ....++++.--||--- T Consensus 203 t~~~la----------~--ll~v~~i~vg~a~~~~~~~~~~~iw-~~~~vl~----y-v~~~~-~~~~~~~~epsfGyT~ 263 (307) T protein:vir:10 203 TVDLLK----------E--IFEVENIAVGEAIYADDKDRFTDIW-GANIVLA----Y-VPLQR-GGQQRTPYEPSYGYTL 263 (307) T ss_pred CHHHHH----------H--HhCceeEEEeeeeeeccCCccceeC-CCceEEE----e-ccccc-CCCCCcccccccceeE Confidence 111111 1 1222222222344443333333332 2211110 0 01100 0001111111111100 Q ss_pred hHhhccCCCceeeEEEeeccCCCeeEE--EeeecccccccCcceEEEEEee Q lcl|NC_011811. 317 KMGYVNTLGQDLYVFEYAKDRDEGTDF--EAHSYMMPICTRPQLLVDVRAD 365 (368) Q Consensus 317 ~~~~vn~~~~~~y~~~~~~~~~~g~~l--~~eS~pLpv~~rP~alv~~t~~ 365 (368) . ..+.++--+ ..+..+++-+ .-.-.|+-++..-..|++..+- T Consensus 264 ~-----~~g~~~~d~--~~~~~~~~~~r~~~~~~~~i~~~~~G~li~~~~~ 307 (307) T protein:vir:10 264 R-----KKGNPVVDT--RIEDGKLELVRSTDIFRPYLLGADAGYLISGING 307 (307) T ss_pred E-----EcCCeEeec--eecCCceeEEeccccccceeecccccceeccCCC Confidence 0 011111111 1122333222 2223344444333334443333 No 15 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=98.56 E-value=2.7e-08 Score=62.11 Aligned_cols=302 Identities=10% Similarity=0.017 Sum_probs=143.2 Q ss_pred CcccccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEEE-EeC-ceeee-eccCCCCCccccccCCcee Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLDI-TDY-GISLL-DPVDRDTRNAESSAPESLR 77 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie~-~~~-~~~l~-p~v~rg~~~~~~~~~~~~~ 77 (368) |+--.|-.|. .||..---. +.+.++++. +|+..++......+-+ -+. ...+. --+.|++....+ ..+ .+ T Consensus 1 ~~~~~~~~dp----~LT~~A~gy-~n~~~Ia~~-l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v-~~~-~~ 72 (309) T protein:vir:99 1 MSNAPFPIDP----ELTAIAIAY-RNGRMISDE-VLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFS-AT 72 (309) T ss_pred CCCCCcCcCH----hHHHHHhhc-cChhhhhhh-cCCccccCccccceeeechhhcccccchhhccCCCcceE-eec-cc Confidence 5555554331 344433333 345588885 8999888766554432 232 33332 223444443332 222 34 Q ss_pred EEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCC-CEEEeeeh Q lcl|NC_011811. 78 QVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKG-FLWADMYQ 156 (368) Q Consensus 78 ~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG-~~~~d~~~ 156 (368) ...+.+.-...+.+|.-+|+++-+ +.-+ ..++....+.+.+.+++|+++++.++..-.-+.| .+... T Consensus 73 ~~~~~~~~~~L~~~i~~~~~~~a~--~~~d-------~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Ls--- 140 (309) T protein:vir:99 73 DETGSTEDHGLDAPVPQADIDNAP--TNYN-------PLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS--- 140 (309) T ss_pred CceeeecccceeecCCchhhhhcc--CCCC-------HHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEec--- Confidence 456666666777778777776532 2112 1223445678899999999999987743322222 22222 Q ss_pred hcCcccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhhhccc Q lcl|NC_011811. 157 TFGVEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSSYGLI 236 (368) Q Consensus 157 ~fG~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~~~~~ 236 (368) |.+ .|+++++||.+.+++|++.+ +. .+-.+++|...|++|+.||.|.+.+++...... ... T Consensus 141 --gt~------~wsd~~SDPi~~i~~~~~~~-------g~---~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g-~it 201 (309) T protein:vir:99 141 --GAD------QWSDPTSNPLPVITDALDSV-------IL---RPNIGVLGRRTATILRRHPKIVKAYNGSLGDEG-MVP 201 (309) T ss_pred --Ccc------ccCCCCCCcHHHHHHHHHhh-------CC---CcceEEechHHHHHHhhCHHHHHHhcCCCcccc-ccC Confidence 322 38889999999999997654 12 345788999999999999999999876432111 011 Q ss_pred ccccccccccccccccceEEeCCEEEE--EccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecc Q lcl|NC_011811. 237 TGSLKTGRSDGVATATNEFPYRGVVFR--QYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGP 314 (368) Q Consensus 237 ~~~~~~g~~~~~~~~~~~f~~~Gi~~~--~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~ap 314 (368) ...+ -+.|.+..|.+= .|.....+.+..-++... + ...+.+.+.. .+..+..-||. T Consensus 202 ~~~l-----------a~l~~ve~V~vg~a~~n~a~~g~~~~~~~iwg-~-------~~~L~y~~~~---~~~~~~ps~G~ 259 (309) T protein:vir:99 202 MAFL-----------QELLELDAIYIGEARLNIARPGQNPNLIRAWG-P-------HASFIYRDRL---ADTRNGTTFGL 259 (309) T ss_pred HHHH-----------HHHhCcceEEeecceeeccccccccccccccC-C-------cEEEEEcCCC---CCCcccccccc Confidence 1111 111222223221 122222222222222221 1 1222222211 11122222332 Q ss_pred -cchHhhccCCCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeee Q lcl|NC_011811. 315 -ANKMGYVNTLGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADK 366 (368) Q Consensus 315 -a~~~~~vn~~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~a 366 (368) +.+-.-. .+..+..+ +.++-++++..-..-.|+.++..-..+++..+++ T Consensus 260 t~~~~~r~--~g~~~d~~-~~~~g~~~vr~~~~~k~~i~~~d~G~li~~~va~ 309 (309) T protein:vir:99 260 TAQWGDRV--SGSIADPN-IGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) T ss_pred eeeccccc--CCceeeee-eccCCceEEEEeccccchhcchhcchhhhhcccC Confidence 1110111 11211111 1122233344333334444444444444443333 No 16 >protein:vir:98819 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:32561 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851100;genbank:gi:117530257;genbank:GeneID:4484483 Probab=90.77 E-value=0.019 Score=30.00 Aligned_cols=347 Identities=15% Similarity=0.155 Sum_probs=172.2 Q ss_pred Cc------------------cccc-CCCcccHHHHHHHH-HhcCCCccchhhcCcccccCcccceEEEE-EEeCceeeee Q lcl|NC_011811. 1 MS------------------LTLA-NGSRFLLADLTGDI-ANIPNTYGYVNQLDLFRSVPTSQTSVLLD-ITDYGISLLD 59 (368) Q Consensus 1 m~------------------~~~f-~~d~F~~~~Lt~~i-~~~p~~~~~l~~l~lF~~~~~~t~~v~ie-~~~~~~~l~p 59 (368) |+ -+-| ..+.| ++|...| .++|..|- + -+|+.+.+..+.|.-| .++|...+.| T Consensus 1 msdipspnlqalisspylvdnttfprepvy--telarsilaklpatpl--s--avfpdetiaeriviaehviegvntifp 74 (437) T protein:vir:98 1 MSDIPSPNLQALISSPYLVDNTTFPREPVY--TELARSILAKLPATPL--S--AVFPDETIAERIVIAEHVIEGVNTIFP 74 (437) T ss_pred CCCCCCcchHhhhcCceeeccccCCccchH--HHHHHHHHHhcCCccc--c--ccccchhhhhhhhhHHHHHhhhhhhhh Confidence 11 0111 12222 4555544 45566553 2 3688887766655444 4688888999 Q ss_pred ccCCCCCccccccCCceeE--EEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011811. 60 PVDRDTRNAESSAPESLRQ--VAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQ 137 (368) Q Consensus 60 ~v~rg~~~~~~~~~~~~~~--~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~ 137 (368) .|+.|+|.. ....+++++ .++++-.++++-..+-..++|--.-|+.++..+.++.+.+||.+...+|..||....+. T Consensus 75 vvewgapdl-fvdddgytvyrqsyqplpirqsmymsyaqlnntvregttnerataaeqiekkltrqmqkhqltwnvfqaa 153 (437) T protein:vir:98 75 VVEWGAPDL-FVDDDGYTVYRQSYQPLPIRQSMYMSYAQLNNTVREGTTNERATAAEQIEKKLTRQMQKHQLTWNVFQAA 153 (437) T ss_pred hhccCCcce-eecCCCceeeecccCCccchhhhhhhhhhhhhhhhccccchhhhhHHHHHHHHHHHHHhhhhhHHHHHHH Confidence 999999985 455566553 46777778888888888888776678888877778888899999889999999777666 Q ss_pred HhcCcE--EccCCCEE---------EeeehhcCcccc-----eE-----EEEcC--------CCCccHHHHHHHHHHHHH Q lcl|NC_011811. 138 ALKGKV--VDSKGFLW---------ADMYQTFGVEKK-----TV-----YFDLE--------NPDADIDGAIDELVEHME 188 (368) Q Consensus 138 AL~G~i--~d~dG~~~---------~d~~~~fG~~~~-----~v-----~~~l~--------~~~~d~~~~~~~~~~~i~ 188 (368) .+.|.| .|+...+- -|||. |+.+|. +. -+||. -+-+|+.=.+....|.+. T Consensus 154 mmlgginytdprsgvrvkapayiparnffn-fnttqgyrgrnearlfrnlidlnaggtpssgipitdpqfalsnftrrln 232 (437) T protein:vir:98 154 MMLGGINYTDPRSGVRVKAPAYIPARNFFN-FNTTQGYRGRNEARLFRNLIDLNAGGTPSSGIPITDPQFALSNFTRRLN 232 (437) T ss_pred HHhccccccCcccceeeecccccccccccc-cccccccccchHHHHHHHHhhccCCCCCcCCcccccchhhHHHHHHHHH Confidence 666665 34432221 13332 333321 00 11221 122344445555555554 Q ss_pred HHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHH----HHhhhhhcccccccccccc-------------cccccc Q lcl|NC_011811. 189 DTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMA----QQAVSSYGLITGSLKTGRS-------------DGVATA 251 (368) Q Consensus 189 ~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~----~~~~~~~~~~~~~~~~g~~-------------~~~~~~ 251 (368) +..+.-.-+..+ ...+|++.-|-+.-..+.+-+--+ -.+.-+-..+..+-..|.. ...... T Consensus 233 rwfkdtnksdit--dmymgpemrdvilmseearlaqggiiprlgavfgdstidsngsggsfgplppgglgtgmglvlgtr 310 (437) T protein:vir:98 233 RWFKDTNKSDIT--DMYMGPEMRDVILMSEEARLAQGGIIPRLGAVFGDSTIDSNGSGGSFGPLPPGGLGTGMGLVLGTR 310 (437) T ss_pred HHhhccccccch--hhhcCccceeeeeeccchhhhhcccchhhhhhhccccccCCCCCcccCCCCccccccccceeeecc Confidence 444432222221 233455544443322221110000 0000000000000000000 001112 Q ss_pred cceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeE-EecccchHhhccCCCceeeE Q lcl|NC_011811. 252 TNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQI-SYGPANKMGYVNTLGQDLYV 330 (368) Q Consensus 252 ~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~-~~apa~~~~~vn~~~~~~y~ 330 (368) -+...+.||.+...+-.|.++-..+...+.++...++. -|-.- -+.+...-++ ||.--+.+ ...+ ++. T Consensus 311 geilsiaginvhvvdtiykdpvdgvekrvwpknkivav----sfrds--dgnveapgrtqycssensi---dspg--lwt 379 (437) T protein:vir:98 311 GEILSIAGINVHVVDTIYKDPVDGVEKRVWPKNKIVAV----SFRDS--DGNVEAPGRTQYCSSENSI---DSPG--LWT 379 (437) T ss_pred cceeEeecceeeeehhhhhcchhhhhhhcCCccceEEE----EEecC--CCcccCCcccccccccccc---CCCc--cee Confidence 23456778888777777777766655555544433321 11000 0001111122 22211111 1122 233 Q ss_pred EEee---ccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 331 FEYA---KDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 331 ~~~~---~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) .... .+--.|+-+.+-..-||+-+-|--++.+|--+-. T Consensus 380 rtvtdvpppaapgiavqmgnaglpyfkypyrvchvtpctve 420 (437) T protein:vir:98 380 RTVTDVPPPAAPGIAVQMGNAGLPYFKYPYRVCHVTPCTVE 420 (437) T ss_pred eeeccCCCCCCCcceEeecCCCCcccccceeeeeecccchH Confidence 3322 2333466677777778887777777776654443 No 17 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=90.75 E-value=0.019 Score=29.99 Aligned_cols=315 Identities=10% Similarity=0.040 Sum_probs=125.1 Q ss_pred Ccc-cccCCCcccHH-------------HHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCceeeeeccCCCC Q lcl|NC_011811. 1 MSL-TLANGSRFLLA-------------DLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGISLLDPVDRDT 65 (368) Q Consensus 1 m~~-~~f~~d~F~~~-------------~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~l~p~v~rg~ 65 (368) |+. +....+.++-. ++-.++.. ...+. +++..+.++ ++++.|.+. |...+ ....+|. T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~----~s~~~--~~~~~rti~~gkS~q~~~i-G~~~~-~~~~~G~ 72 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLK----GENLL--QWFDVQEVVGTNSVSNKYI-GETEL-QVLSPGK 72 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHH----HHhhc--CcceeeeecccceEEeeee-eeeEE-eeeccCc Confidence 222 11112222210 11122221 11221 334445443 577787776 33333 2222222 Q ss_pred Cc-cccccCCceeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEE Q lcl|NC_011811. 66 RN-AESSAPESLRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVV 144 (368) Q Consensus 66 ~~-~~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~ 144 (368) +- .+....++. ...+....+...-.-.=+|+|+ .+-. -...+.+.....|+++.++.- .+...++ |+...+. T Consensus 73 ~ld~~~~~~~k~-~itID~ll~a~~~V~diDe~q~--~~D~--vR~e~s~e~G~ALA~~~Dq~i-~~~v~~a-a~a~~~~ 145 (364) T protein:vir:10 73 SPDASPTEFDKN-RLVVDTTVIARNTVAHFHDVQN--DIDG--LKSKLSVNQAKKLKKMEDSMV-IQQLVLG-GISNTEA 145 (364) T ss_pred ccCCCCcccCcE-EEEecceeeechhhhhHHHHhc--Cccc--hhHHHHHHHHHHHHHHHHHHH-HHHHHhh-hhhcccc Confidence 11 011112221 2233333332222333344443 1110 012233444555555544432 2222222 2221111 Q ss_pred -ccCCCEEEeeehhcCcccceEEEEc--CCCCcc---HHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCH Q lcl|NC_011811. 145 -DSKGFLWADMYQTFGVEKKTVYFDL--ENPDAD---IDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHA 218 (368) Q Consensus 145 -d~dG~~~~d~~~~fG~~~~~v~~~l--~~~~~d---~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~ 218 (368) +.++.+. . =|. .+.... +....+ +...+.++...+.+ .-.|..+.+++++|.+|.+|++|+ T Consensus 146 ~~~~~~~~-~----~g~---~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdE-----kdVP~~~R~~vv~P~~y~~Ll~~~ 212 (364) T protein:vir:10 146 IRKNPRVA-G----HGF---SIHIVGLASSFLTSPQYMMAAIEMAMEQQTE-----QEVDTSELCGLMPWTAFNCLRDAD 212 (364) T ss_pred cccCCccc-C----Ccc---eeeecccCcchhhhHHHHHHHHHHHHHHHhh-----cCCCccccEEEeChHHHHHHhcCC Confidence 1111100 0 000 011110 111222 33333333333322 224567789999999999999998 Q ss_pred HHHH-HHHHHHhhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCcccc-ccccccceeecCCceeEEe Q lcl|NC_011811. 219 KVRE-AYMAQQAVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVH-KLVGINGVEDSVGVGHAFP 296 (368) Q Consensus 219 ~V~~-~y~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~-~~~~~~~~~i~~~~a~~~P 296 (368) .+.. .|.. .+ .+... .-......|+.+++= -.++...+... ......+..-.++.+..|. T Consensus 213 ~lvn~d~~~---~~-----~~~~~---------~G~v~~v~Gv~Vv~S-n~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~ 274 (364) T protein:vir:10 213 RIVDKSYTI---AA-----SDNTV---------DGFVLKSWNTPIVPS-NRFPKLSDNTEGTGNTKHHKLSNAGNGNRYD 274 (364) T ss_pred ccccccccc---cC-----CCccc---------cceeEEEeceEEEec-cccccccccccccccccccccccccCCcccc Confidence 7542 1100 00 01111 123346678876542 12222111000 0000001111222222332 Q ss_pred ecccccccccceeEEecccchHhhccC-CCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 297 NTAMLGEANDLYQISYGPANKMGYVNT-LGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 297 ~g~~~~~~~~~f~~~~apa~~~~~vn~-~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) ..+ +-...--..|-| +++.+ ..++.=...|.+++..++.|.+--..=.-..||++.+.++..+|. T Consensus 275 v~~---d~~~~~~~~f~~----~Al~tv~~~~~t~e~~~~~~~~~~~ida~~a~G~g~lRPeaa~~i~~~~~~ 340 (364) T protein:vir:10 275 VTA---GQTSAQAVLFTQ----DALLVGRTISITGDIFYEKKEKTWYIDTFLAEGAIPDRWEAVAVVTAADTA 340 (364) T ss_pred ccc---ccceeEEEEEec----ceEEEEEEecceeeeeeccceeeeeeeeehcccCcccCccceEEEEecCCC Confidence 111 000011122333 12322 233455677778888888888877777788999999999999988 No 18 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=89.74 E-value=0.025 Score=29.40 Aligned_cols=317 Identities=13% Similarity=0.024 Sum_probs=113.4 Q ss_pred CcccccC----CC---------------cccHHHHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCceeeeec Q lcl|NC_011811. 1 MSLTLAN----GS---------------RFLLADLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGISLLDP 60 (368) Q Consensus 1 m~~~~f~----~d---------------~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~l~p~ 60 (368) |++..-+ ++ .|.-.-++ ++.+ .+.+. ++...+.++ +.++.|... |...+ .. T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~-~f~~----~s~~~--~~~~~r~i~~G~sv~i~~i-G~~tv-~~ 71 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLT-AFTR----RSVTA--DKHIVRTIQNGKSAQFPVM-GRTSG-VY 71 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHH-HHHH----HHhhh--cccccccccccceEEEecc-cceee-ee Confidence 4433211 00 11111111 1110 11222 234444433 566666554 22222 22 Q ss_pred cCCCCCcc-ccccCCceeEEEEecceecccc-cc-CHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011811. 61 VDRDTRNA-ESSAPESLRQVAFPLIYFKHIE-SI-TPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQ 137 (368) Q Consensus 61 v~rg~~~~-~~~~~~~~~~~~f~~p~~~~~~-~i-~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~ 137 (368) ..||.+-. ....... .-+.+.+=-.+..+ .| .-+++|. - -+++.+...++-..+.+...-.++. T Consensus 72 ~t~G~~l~~~~~~~~~-~e~~itID~~~~~~~~VddiD~~q~---~---------~D~~~~~~~~~g~aLa~~~D~~i~~ 138 (347) T protein:vir:94 72 LAPGERLSDKRKGIKH-TEKVITIDGLLTADVMIFDIEDAMN---H---------YDVAGEYSNQLGEALAIAADGAVLA 138 (347) T ss_pred ecCCCCcCCCCCCCCc-ceEEEEecchhhhhHHhhhHHHHhc---C---------cchHHHHHHHHHHHHHHHHHHHHHH Confidence 22333210 0000000 01111111010000 00 1122221 1 1223333333333344433333333 Q ss_pred Hh---cCcEEccCCCEEEeeehhcCcccceEEEEcCCCCccHHHHHHH---HHHHHHHHhccccccccccEEEEEChHHH Q lcl|NC_011811. 138 AL---KGKVVDSKGFLWADMYQTFGVEKKTVYFDLENPDADIDGAIDE---LVEHMEDTANTGGLTNGEQIIVLVDRAFF 211 (368) Q Consensus 138 AL---~G~i~d~dG~~~~d~~~~fG~~~~~v~~~l~~~~~d~~~~~~~---~~~~i~~~l~~~~~~~~~~v~al~g~~~~ 211 (368) .+ .+..-.+.+... -|| ...++.+.....+.++...... .++.+.+.|.-. -.|..+.+++++|++| T Consensus 139 ~~~~~aa~~~~~~~~~~-----g~~-~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~-~VP~~~R~~vv~P~~~ 211 (347) T protein:vir:94 139 EMAILCNLPAASNENIA-----GLG-TASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSN-YVPAGDRYFYTTPDNY 211 (347) T ss_pred HHHHHhccccccccccC-----CCc-ccceeeccccccccchhhhHHHHHHHHHHHHHHHhhc-CCCCCCcEEEeCHHHH Confidence 22 111111111100 011 1122222222222222211111 122222333322 2366678899999999 Q ss_pred HHHhcCHHHHHHHHHHHhhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCc Q lcl|NC_011811. 212 RKLTGHAKVREAYMAQQAVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGV 291 (368) Q Consensus 212 ~al~~h~~V~~~y~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~ 291 (368) ..|++|+.+.... +... ..++.| ....+.|+.+++- ..++... ...........+.+|+ T Consensus 212 ~~Ll~~~~~~~~~--~~~~-------~~~~~G---------~Vg~i~G~~V~~S-n~lp~~~--~t~~~~~~~~~~~aG~ 270 (347) T protein:vir:94 212 SAILAALMPNAAN--YAAL-------IDPETG---------NIRNVMGFVVVEV-PHLVQGG--AGETRGDDGITIASGQ 270 (347) T ss_pred HHHhccchhhhhh--cccc-------cccccc---------ceEEEeceEEEec-Ccccccc--cccccccCcceecCcc Confidence 9999998876532 1110 111222 2346788888873 2222111 1111222233456666 Q ss_pred eeEEeecccccccccceeEEecccchHhhccCC-CceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 292 GHAFPNTAMLGEANDLYQISYGPANKMGYVNTL-GQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 292 a~~~P~g~~~~~~~~~f~~~~apa~~~~~vn~~-~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) .+.||... .....+-|..-.|-.=+-+.+.+. .++.=...+.+++-.+..|.+-...=.-..||++++.+++++|- T Consensus 271 ~~~~~~~~-~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 271 KHAFPATA-SSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred cccccccc-hhhhcccccceeEEEeehhhhhhhhcccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 66666321 111111111111110011111111 01111122333333344444444444567899999999999888 No 19 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=89.08 E-value=0.028 Score=29.06 Aligned_cols=301 Identities=12% Similarity=0.130 Sum_probs=130.8 Q ss_pred CcccccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCceeeeeccCCCCCc--cccccCCcee Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGISLLDPVDRDTRN--AESSAPESLR 77 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~l~p~v~rg~~~--~~~~~~~~~~ 77 (368) |=+..|.+.. ++ ++.. ...+ ++++..+.++ +.++.|.+. |. .-+....+|.+- .++ ..++ . T Consensus 21 L~Le~f~GeV-----~t-aF~~----~si~--~~~~~vRtI~~gkS~qf~~l-G~-s~a~y~~pG~~ldg~~~-~~dk-~ 84 (400) T protein:vir:10 21 LLIEKFNGKV-----NE-QYLK----GENI--MSYFDVQTVTGTNTVSNKYL-GE-TELQVLAPGQSPAATST-QADK-N 84 (400) T ss_pred hHHhHhcchH-----HH-HHHH----Hhhh--cccceeeeecccceEEEEEe-ee-eEEeeecCCCCcCCCCc-ccCc-E Confidence 2233333221 11 1111 1111 2456666654 677888877 22 222333333321 111 2222 2 Q ss_pred EEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCcEEcc---CCCEEEe Q lcl|NC_011811. 78 QVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQAL-KGKVVDS---KGFLWAD 153 (368) Q Consensus 78 ~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL-~G~i~d~---dG~~~~d 153 (368) +..+-...+...-.-.=+|+|+ .|-. -...+.+.+...|+++.++ ++++.+ .+++... .|.. . T Consensus 85 ~ItIDtLL~a~~~V~dlDd~q~--~yD~--vRse~s~e~G~ALA~~~Dq-------~iiq~i~~a~~a~t~~~~~~~--~ 151 (400) T protein:vir:10 85 QLVIDATVIARNTVAHLHDVQG--DIDS--LKPKLATNQAKQLKKMEDE-------MLIQQMLLGGIANTQAKRTNP--R 151 (400) T ss_pred EEEeCceeeecchhhhHHHHhh--cccc--ccHHHHHHHHHHHHHHHHH-------HHHHHHHHhcccccccccccC--C Confidence 2344444444444444455553 2210 0112333444555554443 344433 2433221 1110 0 Q ss_pred eehhcCcccceEEEEc--CCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHH-HHHHHHhh Q lcl|NC_011811. 154 MYQTFGVEKKTVYFDL--ENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVRE-AYMAQQAV 230 (368) Q Consensus 154 ~~~~fG~~~~~v~~~l--~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~-~y~~~~~~ 230 (368) - .+-|.+ +.+.- ..+..|+......+....++-.... .+..++++|+.+.+|+.|..|+.+.. .|.+.+ T Consensus 152 g-~~~g~s---~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkd--VP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~-- 223 (400) T protein:vir:10 152 V-KGHGFS---VNVEVNEGEALVNPQYVMAAVEFALEQQLEQE--VDISDVAILMPWRYFNVLRDADRIVDKSYTISQ-- 223 (400) T ss_pred c-cccccc---eeecccccccccCHHHHHHHHHHHHHHHHhcC--CCccceEEEcCHHHHHHHHhCCcccchhccccC-- Confidence 0 000111 22211 1122354343333333332211111 24567899999999999999873321 111100 Q ss_pred hhhcccccccccccccccccccceEEeCCEEEEEccccccCCC--ccccccccccceeecCCceeEEeecccccccccce Q lcl|NC_011811. 231 SSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKR--NTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLY 308 (368) Q Consensus 231 ~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~--~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f 308 (368) .+....-....+.||.+++-.- ++... ..... .-++|.++.|..- ++....- T Consensus 224 ---------------~g~~~~g~v~~v~Gv~Iv~Sn~-lP~~a~~~~~~~-------lS~a~~G~~y~~t---~d~s~~~ 277 (400) T protein:vir:10 224 ---------------SGATIQGFVLSSYNCPVIPSNR-FPKYSQGQKHHL-------LSNEDNGYRYDPI---AEMNGAI 277 (400) T ss_pred ---------------CCccccceEEEEeceEEEeeCc-CCcccCcccccc-------cccCCCCccCCcc---cccccee Confidence 1112223345678888876322 21111 00011 1122233333210 0011111 Q ss_pred eEEecccchHhhccC-CCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 309 QISYGPANKMGYVNT-LGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 309 ~~~~apa~~~~~vn~-~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) -..|-| +.+++ ...++=...|.+++..++.|.+-...=..+.||++..-+|.+.-+ T Consensus 278 av~F~~----sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~ 334 (400) T protein:vir:10 278 AVLFTA----DALLVGRSIDVIGDIFYEKKEKTYYIDTFMSEGAIPDRWEAVSVVTTKRQS 334 (400) T ss_pred EEEEeh----hheEEEEeeccccccccchhhHHHHHHHHHHhCCcccchhheEEEEecCCc Confidence 112222 12333 234566777888888888898888888899999999999887655 No 20 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=88.53 E-value=0.032 Score=28.80 Aligned_cols=288 Identities=11% Similarity=0.035 Sum_probs=121.9 Q ss_pred Ccc-cccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCce--eeeeccCCCCCccccccCCce Q lcl|NC_011811. 1 MSL-TLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGI--SLLDPVDRDTRNAESSAPESL 76 (368) Q Consensus 1 m~~-~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~--~l~p~v~rg~~~~~~~~~~~~ 76 (368) +++ |+.+++-|=.+ .|..+= .+.+|.++ ||.....+ .-.+.+....... .=+.-|.+|++ -++...+.. T Consensus 16 itv~~ll~~P~~I~~----~i~e~~-~~~~iad~-lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggE-iP~~~~~~G 88 (318) T protein:vir:10 16 ITVRELVGNPLWIPT----ALKKMM-VNQFISES-LFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGE-IPVSAGARG 88 (318) T ss_pred eehHHhhCCchhHHH----HHHHHH-hccchhhh-hhhcccccccceeEEEecccccccCcHhhccCccc-ccccCCCCC Confidence 333 23333433222 222222 46778876 67654433 3344443322111 11223334442 122221111 Q ss_pred eEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEEeeeh Q lcl|NC_011811. 77 RQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKGFLWADMYQ 156 (368) Q Consensus 77 ~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG~~~~d~~~ 156 (368) ..+.-..-....+-.|+-+.+. |.- -+.+.|.+.++.+.+.+-.+.+++.||+--++..- T Consensus 89 ~~~ia~~~K~G~~~~vS~Em~~--~n~---------~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~--------- 148 (318) T protein:vir:10 89 LPRTAFAVKKALGVRVSKEMID--ENR---------VGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTL--------- 148 (318) T ss_pred chhhhhhehhccceeccHHHHh--hcC---------hhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--------- Confidence 1111111123333334333221 111 14577888889999999999999999962221100 Q ss_pred hcCcccceEEEEcCC---CCccHHHHHHHHHHHHHH-----HhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHH Q lcl|NC_011811. 157 TFGVEKKTVYFDLEN---PDADIDGAIDELVEHMED-----TANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQ 228 (368) Q Consensus 157 ~fG~~~~~v~~~l~~---~~~d~~~~~~~~~~~i~~-----~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~ 228 (368) .+.-.|++ ..+|+....+.+....-+ .....-.-|+..-.+++++..|..|.+|++++++|..-. T Consensus 149 -------~~s~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a 221 (318) T protein:vir:10 149 -------AVPTAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNA 221 (318) T ss_pred -------cCCcCCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccc Confidence 00011111 111222222211110000 000000123445577899999999999999999986322 Q ss_pred hhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccce Q lcl|NC_011811. 229 AVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLY 308 (368) Q Consensus 229 ~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f 308 (368) ..... ... ..|.. ...+-|..++. ...+|.|+++.+=.|.+ | T Consensus 222 ~~~~~--~~~--~tg~~--------~g~~lGl~vi~-------------------s~~~p~~~alvlq~g~v-----G-- 263 (318) T protein:vir:10 222 NYVST--APD--WTGNF--------PGSVMGLNVIR-------------------SRTFPIDRVLIMERGTV-----G-- 263 (318) T ss_pred hhhhh--ccc--ccccc--------cceeeceEEee-------------------cCccCCCeeEEEecCCc-----c-- Confidence 10000 000 00000 01234555542 12346666766654421 2 Q ss_pred eEEecccchHhhccCCCceeeEE---EeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 309 QISYGPANKMGYVNTLGQDLYVF---EYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 309 ~~~~apa~~~~~vn~~~~~~y~~---~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) +++ +..--...++|.- ..-.++.+ |.+..---.-+...+|.|++++|-=-.- T Consensus 264 --~~~-----d~~pl~~t~~~~egg~~~g~~~~s-~~~~~~~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 264 --FYS-----DTRPLQFTALYPEGNGPNGGPTES-YRADASHKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred --eee-----ccccceeeecccCCCCCCCCcchh-hheehheeeeeeeeCcceeEEEeeccCC Confidence 111 1111112223321 00001222 2233333345677899999999866555 No 21 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=77.39 E-value=0.13 Score=25.53 Aligned_cols=274 Identities=16% Similarity=0.122 Sum_probs=114.1 Q ss_pred Ccc--cccCCCcccHHHHHHH-----HHhc-CCCccchhhcCcccccCccc-ceEEEEEEe-CceeeeeccCCCCCcccc Q lcl|NC_011811. 1 MSL--TLANGSRFLLADLTGD-----IANI-PNTYGYVNQLDLFRSVPTSQ-TSVLLDITD-YGISLLDPVDRDTRNAES 70 (368) Q Consensus 1 m~~--~~f~~d~F~~~~Lt~~-----i~~~-p~~~~~l~~l~lF~~~~~~t-~~v~ie~~~-~~~~l~p~v~rg~~~~~~ 70 (368) |.. ..-....=...+|..+ +|+. .....++.-||+++..|... ++|.. +.. .-..-+..|+.|.. -+. T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt-~k~~~y~gda~dVaEGe~-Ipl 78 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKT-YAGYDVTLAEGNVPEGEV-IPL 78 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEee-ccceeeeeccccccCCcc-cch Confidence 110 1111111111122111 1111 34456677788888887764 33421 111 11222234555543 233 Q ss_pred ccCCceeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCE Q lcl|NC_011811. 71 SAPESLRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKGFL 150 (368) Q Consensus 71 ~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG~~ 150 (368) ++-.+......+...-+.-+.+|++.|| +..+|.. --++-. +.+..+.++++ .-+..+|++. .+ T Consensus 79 skvt~~~~~t~t~~ikK~rK~tTdEAIq-lsGyg~a-Vgetd~----qL~~~iq~kId----~d~~t~Lkta----T~-- 142 (296) T protein:vir:98 79 SKVERKIHSEKKIELKKYRKATTGEDIQ-MYGSNEA-VTNTDN----ALVRQLQKKIR----TDFVTALKTG----TG-- 142 (296) T ss_pred hhheeeecceEEEEeeccccccCHHHHH-hhcCCch-hHHHHH----HHHHHHHHhhh----HHHHHHHhcc----cc-- Confidence 3332222234445555556668999987 3456632 111111 11122222333 2344555411 01 Q ss_pred EEeeehhcCcccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhh Q lcl|NC_011811. 151 WADMYQTFGVEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAV 230 (368) Q Consensus 151 ~~d~~~~fG~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~ 230 (368) ++ + .....+.+.+....-...+.+.... ....++|+.|.=..++++++.|-. + T Consensus 143 -------------t~--~--~t~~~lQ~Ala~~~~~l~~~feded---~~~~V~FVnP~D~a~ylg~a~it~-----q-- 195 (296) T protein:vir:98 143 -------------TQ--D--ALGAGLQGALASAWGKLQVLFEDYG---SERAIVFANSLDVAEYIAKAGITT-----Q-- 195 (296) T ss_pred -------------ee--e--echhhHHHHHHHHhhhhhhhccccC---CCceEEEEehHHHHHHhcCCccch-----h-- Confidence 01 0 0111233333333333334443322 124567777766555555543321 0 Q ss_pred hhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeE Q lcl|NC_011811. 231 SSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQI 310 (368) Q Consensus 231 ~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~ 310 (368) -.||+.-.+++.|. . .+ ....++.|+.++.| .+.-.. T Consensus 196 ------------------------t~fG~tyl~nfLG~-~--------II--~S~kV~~G~~~~T~--------~~Ni~~ 232 (296) T protein:vir:98 196 ------------------------TAFGLTYLVDFTGT-V--------II--STNDVTKGEIWATV--------PENIIF 232 (296) T ss_pred ------------------------heechhhhhhcccc-E--------EE--EcCcCCCceEEEee--------ecceEE Confidence 01222222234442 0 01 11235666666654 444566 Q ss_pred EecccchHhhccCCCcee--eE------EEeeccCCCeeEEEeeec----ccccccCcceEEEEEeeecC Q lcl|NC_011811. 311 SYGPANKMGYVNTLGQDL--YV------FEYAKDRDEGTDFEAHSY----MMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 311 ~~apa~~~~~vn~~~~~~--y~------~~~~~~~~~g~~l~~eS~----pLpv~~rP~alv~~t~~aa~ 368 (368) +|.|.+.-+ .++.| |. -+-.... .-.+..|+. -.-.|-|++.+|++|.++|- T Consensus 233 ay~~~~~~~----l~~~f~~~~d~tglIGv~h~~~--~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 233 AYINPNNSE----LAKEFNLYGDPTGYIGMNHFQE--NTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred Eeecccccc----hhhhhccccccccceEEEeccc--cceeeehhHhHhHHHhcccccceEEEEEecCCC Confidence 777753111 11111 11 1111111 122444443 34467899999999999888 No 22 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=76.96 E-value=0.13 Score=25.44 Aligned_cols=312 Identities=13% Similarity=0.017 Sum_probs=107.4 Q ss_pred Cc---------------------ccccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCcee-- Q lcl|NC_011811. 1 MS---------------------LTLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGIS-- 56 (368) Q Consensus 1 m~---------------------~~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~-- 56 (368) |. .++|= -.|+-.-+ .++.+ .+.+.+ +...+.++ +.++.|.+....-. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~i-e~~~geV~-~~f~~----~s~~~~--~~~~r~i~~G~sv~~~~iG~~~~~~ 72 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVL-TAFVR----RSVTMD--KHMVRTIQNGKSASFPVMGRTKGYY 72 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHH-HHHHHHHH-HHHHH----Hhhhhh--ccccccccCcceEEEeeecceeeee Confidence 11 11111 12221111 12221 133333 34444433 56666654433211 Q ss_pred eeeccCCCCCccccccCCceeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011811. 57 LLDPVDRDTRNAESSAPESLRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLM 136 (368) Q Consensus 57 l~p~v~rg~~~~~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~ 136 (368) ..|-.+..++...+ ...+.+...=+.-|+... .=.-+++|. -. +++.+...++-..+.+...-.++ T Consensus 73 ~~~g~~l~~~~~~~-~~~~~~i~ID~~~y~~~~-Vdd~D~~q~---~~---------D~r~~~~~~~g~aLA~~~D~~i~ 138 (347) T protein:vir:88 73 LAPGENLDDKRKDI-KHSEKVIQIDGLLTSDVL-IYDIEDAMN---HY---------DVRAEYSAQLGEALAIAADGAVL 138 (347) T ss_pred eccccCCCCCCCCC-ccceEEEEEechhhhhhh-hhhHHHHhh---cC---------CchHHHHHHHHHHHHHHHHHHHH Confidence 11222222111111 111111111111111100 001122221 11 12222222232222222222222 Q ss_pred HHh-cCc--------EEccCCC-EEEeeehhcCcccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEE Q lcl|NC_011811. 137 QAL-KGK--------VVDSKGF-LWADMYQTFGVEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLV 206 (368) Q Consensus 137 ~AL-~G~--------i~d~dG~-~~~d~~~~fG~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~ 206 (368) ..| ++- ...+-++ ...+ .+.. + +...+..+.......+... ...|.-. ..|..+.++++ T Consensus 139 ~~l~~~a~~~~~~~~~~~g~~~~~~~~----~~~~-~----~~~~~~~~~~~~~~~i~~a-~~~Lde~-~VP~~gR~~vv 207 (347) T protein:vir:88 139 AEMAKLCNLPAASNENIAGLGQAVVLN----IGAA-A----DLVDVEARGKAILKGLTLA-RARLTKN-YVPAGDRRFYC 207 (347) T ss_pred HHHHHhhccccccccccCCcccccccc----cccc-c----cccchhhhHHHHHHHHHHH-HHHHhhc-CCCCCCCEEEe Confidence 222 111 1111111 1001 0100 0 0111222221122222222 2223222 24667888999 Q ss_pred ChHHHHHHhcCHHHHHHHHHHHhhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCcccccccccccee Q lcl|NC_011811. 207 DRAFFRKLTGHAKVREAYMAQQAVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVE 286 (368) Q Consensus 207 g~~~~~al~~h~~V~~~y~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~ 286 (368) +|++|..|++++.+-...- .. ..+++.| ....+.|+.+++...--....+ .... .... T Consensus 208 ~P~~y~~Ll~~~~~~~~~~--~~-------~~~~~~G---------~vg~i~G~~V~~s~nlp~~~~~-~~~~---~~~~ 265 (347) T protein:vir:88 208 APEDYSAILSALMPNAANY--AA-------LIDPETG---------NIRNVMGFEVIEVPHLTVGGAG-DNNP---ADGV 265 (347) T ss_pred CHHHHHHHhcchhhhhhhh--cc-------ccchhcc---------eeeeeccceEEEeecccccccc-cccc---cccc Confidence 9999999999886554221 11 1122222 1234678877764332111110 0000 0001 Q ss_pred ecCCceeEEeecccccccccceeEEecccchHhhccCC-CceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEee Q lcl|NC_011811. 287 DSVGVGHAFPNTAMLGEANDLYQISYGPANKMGYVNTL-GQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRAD 365 (368) Q Consensus 287 i~~~~a~~~P~g~~~~~~~~~f~~~~apa~~~~~vn~~-~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~ 365 (368) ...+.++.++.+. .....+-|....+-.-+.+.+++. ..+.=...+.+++..++.|.+-...=.-..||++++.++.+ T Consensus 266 ~~t~~~~~~~~~~-~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~ 344 (347) T protein:vir:88 266 APTNQKHIFPATA-TGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFT 344 (347) T ss_pred ccccccccccccc-ccccccccCcEEEEEechhhhhheecccceeeeeechhhHHHHhhhhhhhcCceeccceEEEEEeC Confidence 1223334443331 111111111111111112222222 11112333344444455555555555667899999999999 Q ss_pred ecC Q lcl|NC_011811. 366 KAS 368 (368) Q Consensus 366 aa~ 368 (368) +|+ T Consensus 345 ~a~ 347 (347) T protein:vir:88 345 PAA 347 (347) T ss_pred CCC Confidence 999 No 23 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=73.91 E-value=0.16 Score=24.88 Aligned_cols=270 Identities=11% Similarity=0.004 Sum_probs=94.8 Q ss_pred cccCc-ccceEEEEEEeCceeeeeccCCCCCcc---ccccCCceeEEEEecceeccccccCHHHHhcccCCCCCCHHHHH Q lcl|NC_011811. 37 RSVPT-SQTSVLLDITDYGISLLDPVDRDTRNA---ESSAPESLRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTE 112 (368) Q Consensus 37 ~~~~~-~t~~v~ie~~~~~~~l~p~v~rg~~~~---~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~ 112 (368) =.+.+ .++++.|.+. |... +....+|.+-. +-....+. +..+--..+.. +.-+|+-...+-. +-.... T Consensus 1 ~vr~i~~g~s~~~~~i-G~~~-~~~~~~G~~l~~~~~~~~~~e~-~itID~~l~~~---~~VdDiD~~qa~~--Dlr~e~ 72 (324) T protein:vir:99 1 MTRTITSGKSAQFPVM-GRTK-ARYLKQGQSLDDGREDIKHTEK-VITIDGLLTTD---VLIYDIEDAMNHY--DVRSEY 72 (324) T ss_pred CeeeeecCceEEEeee-eeeE-eccccCCCCcCCCcCCcCcccE-EEEecchhhhh---hhhhhHHHHhcCc--cchhHH Confidence 11222 2567777766 3322 23333333210 00111111 11111110000 1111222111111 111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh-cC------cEEccCCCEEEeeehhcCcccceEEEEcCCCCc---cHHHHHHH Q lcl|NC_011811. 113 AMVRARKLQKIRMTHDITKEFLLMQAL-KG------KVVDSKGFLWADMYQTFGVEKKTVYFDLENPDA---DIDGAIDE 182 (368) Q Consensus 113 ~~~v~~~l~~~~~~~~~t~E~m~~~AL-~G------~i~d~dG~~~~d~~~~fG~~~~~v~~~l~~~~~---d~~~~~~~ 182 (368) .+.....|++..++.-... ++.+. .. .+.-.+|..+++ +.-.-.++.. .+...+.+ T Consensus 73 s~~~G~aLA~~~Dq~i~~~---~a~~~~~~a~~~~~~~~~~g~~~~~~-----------~~~~~~~~~~~~~~~~dai~~ 138 (324) T protein:vir:99 73 STQMGEALAMAADVANYAE---MAKLVNSRKETTNENIEGLGAASLVK-----------ITGKKEDPAKYGTQVIQALTY 138 (324) T ss_pred HHHHHHHHHHHHHHHHHHH---HHHhhhcccccccCCcccCCccceec-----------ccccccccccCHHHHHHHHHH Confidence 2222233333222221111 11111 11 111111111111 0000011122 34444445 Q ss_pred HHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhhhcccccccccccccccccccceEEeCCEEE Q lcl|NC_011811. 183 LVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSSYGLITGSLKTGRSDGVATATNEFPYRGVVF 262 (368) Q Consensus 183 ~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~ 262 (368) +...+-++ - .|..+.+++++|++|.+|++|+.+.... +.. .+.++.| ....+.|+.+ T Consensus 139 a~~~Lde~----~-VP~~gR~~vv~P~~y~~Ll~~~~~~~~~--~~~-------~~~~~~G---------~V~~i~Gf~V 195 (324) T protein:vir:99 139 ARAAFAKK----Y-IPAGDRTFYTDPDTYSAILAALMPNAAN--YAA-------LIDPETG---------NIRNVMGFEV 195 (324) T ss_pred HHHHHhhc----C-CCCCCCEEEeChHHHHHHhhcccccccc--ccc-------ccceecc---------eEEEEeceEE Confidence 44444332 2 3556778999999999999987765421 110 1122222 2345678887 Q ss_pred EEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecccchHhhccCCCceee------------- Q lcl|NC_011811. 263 RQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGPANKMGYVNTLGQDLY------------- 329 (368) Q Consensus 263 ~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~apa~~~~~vn~~~~~~y------------- 329 (368) ++=.- ++...+.. .....++.+++++...-. .....|.+ +.-+..++=|+ T Consensus 196 ~~Sn~-lp~~~~t~-------~~~a~~~~~~~~~~~~~~----~~~~ky~~-----d~~~~~gl~~~~~a~~tv~~~~~~ 258 (324) T protein:vir:99 196 VETPH-MTAQMVTN-------PTDAFDGTGHIFPATGDS----TTTGKMTV-----GADNVVGLFVHRSAVATLKLKDMA 258 (324) T ss_pred EecCC-cccccccc-------cccccccccccccccccc----cccccccc-----ccCceeEEEEehhheEEEeeecce Confidence 75221 11111100 011123333443322100 00001110 00111121111 Q ss_pred EEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 330 VFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 330 ~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) ...+.+++..++.|..-...=....||+++.-+++.+-+ T Consensus 259 ~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~ 297 (324) T protein:vir:99 259 LERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGE 297 (324) T ss_pred ecceechhhHHHhhhhhhhhcCcccccceEEEEEEccCc Confidence 111223333445555555555667799988766665554 No 24 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=67.31 E-value=0.25 Score=23.85 Aligned_cols=307 Identities=10% Similarity=0.077 Sum_probs=124.9 Q ss_pred CcccccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCceeeeeccCCCCCc-cccccCCceeE Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGISLLDPVDRDTRN-AESSAPESLRQ 78 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~l~p~v~rg~~~-~~~~~~~~~~~ 78 (368) |-+..|.+.. + .++.. ...+ ++++..+.++ ++++.|.+. |. .-+....+|.+- .+-...++ .. T Consensus 21 l~Le~f~GeV-----~-taF~~----~si~--~~~~~vRti~~gkS~qf~~~-G~-s~~~~~~pG~~ld~~~~~~dK-~~ 85 (401) T protein:vir:70 21 LLIEKFNGKV-----N-EQYLK----GENI--MSYFDVQTVTGTNTVSNKYL-GE-TELQVLAPGQSPAATSTQADK-NQ 85 (401) T ss_pred hHHhHhcchH-----H-HHHHH----Hhhh--cccceeeeecccceEEEEEe-ee-eEeeeecCCCCcCCCCccccc-EE Confidence 3333333321 1 11111 1111 2446666654 677888777 22 223333344421 00111222 12 Q ss_pred EEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcEEccCCCEEEeeehh Q lcl|NC_011811. 79 VAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALK-GKVVDSKGFLWADMYQT 157 (368) Q Consensus 79 ~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~-G~i~d~dG~~~~d~~~~ 157 (368) ..+-...+...-.-.=+|+|+ .|... ...+.+.+.+.|+++.++ +++|.+. ..+.+.++.... . T Consensus 86 ItID~lL~a~~~V~dlDe~q~--~yD~v--Rse~s~e~G~ALA~~~Dq-------~iiq~i~~aa~ana~~~~~~----p 150 (401) T protein:vir:70 86 LVIDATVIARNTVAHLHDVQG--DIDSL--KPKLATNQAKQLKRMEDE-------MLIQQMMLGGIANTQAKRTN----P 150 (401) T ss_pred EEeCceeehhhhhhhHHHHHh--ccccc--chHHHHHHHHHHHHHHHH-------HHHHHHHHhccccccccccC----C Confidence 333344333333333445553 22210 111233344444444333 3344331 222221110000 0 Q ss_pred cCcc-cceEEEEcC--CCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhhhc Q lcl|NC_011811. 158 FGVE-KKTVYFDLE--NPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSSYG 234 (368) Q Consensus 158 fG~~-~~~v~~~l~--~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~~~ 234 (368) +|.. ...+++.-. +...++...+..+...++.-.... .|..++++|+.+.+|+.|..|+.+... .+...+ T Consensus 151 ~~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkd--VP~~r~vvl~pp~~Ys~Ll~~d~L~nr--d~~~s~--- 223 (401) T protein:vir:70 151 RVKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQE--VDISDVAILMPWRYFNVLRDADRIVDK--TYTISQ--- 223 (401) T ss_pred CcCCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcC--CCccceEEEcCHHHHHHHHhcCcccch--hhcccc--- Confidence 0100 011222211 122343333333333332211112 245689999999999999999854420 011000 Q ss_pred ccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecc Q lcl|NC_011811. 235 LITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGP 314 (368) Q Consensus 235 ~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~ap 314 (368) .+....-......|+.+++-.- ++.... .+. .+..-+++.+..|..- ++....--..|-| T Consensus 224 -----------~g~~~~G~v~~vaGv~Vv~Snn-lP~~a~----~it-~~~ls~a~~G~~y~~~---~d~s~~~~v~f~~ 283 (401) T protein:vir:70 224 -----------SGATIQGFTLSSYNCPVIPSNR-FPKYSQ----GQT-HHLLSNEDNGYRYDPL---PAMNGAIAVLFTA 283 (401) T ss_pred -----------CCccccceEEEEeceEEEeecc-cccccc----ccc-cccccccCCCccCCCC---ccccceeEEEEeh Confidence 0111112334568888776322 111100 000 0111122333333210 0011111112222 Q ss_pred cchHhhccC-CCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 315 ANKMGYVNT-LGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 315 a~~~~~vn~-~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) +.+++ ...++=...|.+++...+.|.+-...=..+.||++..-+|.+--. T Consensus 284 ----~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a~g~g~~RPeaa~vv~~k~~~ 334 (401) T protein:vir:70 284 ----DALLVGRSIDVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRNT 334 (401) T ss_pred ----hheEEEEeeccccchhhhhhhhHHHHHHHHHhCCcccchhheEEEeecCcc Confidence 12333 234555667888888888888888888889999998776554332 No 25 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=67.08 E-value=0.26 Score=23.81 Aligned_cols=319 Identities=10% Similarity=0.035 Sum_probs=118.3 Q ss_pred Ccc-cccCCC--------------cccHHHHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCceeeeeccCCC Q lcl|NC_011811. 1 MSL-TLANGS--------------RFLLADLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGISLLDPVDRD 64 (368) Q Consensus 1 m~~-~~f~~d--------------~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~l~p~v~rg 64 (368) |.. ++-+++ .|+ -++..++.+. +.+. ++...+.++ ++++.|.++ |..++-- ..|| T Consensus 9 ~~~~n~~t~~~~~~~~~~~al~le~f~-geV~~~f~~~----si~~--~~~~~rti~~Gksv~f~~i-G~~t~~~-~t~G 79 (375) T protein:vir:10 9 LGRSNLSTGTGYGGATDKYALYLKLFS-GEMFKGFQHE----TIAR--DLVTKRTLKNGKSLQFIYT-GRMTSSF-HTPG 79 (375) T ss_pred cCccccCCccccccccchHHHHHHHHh-HHHHHHHHHH----Hhhh--ccccccccccCceEEEEee-eeeEEee-ecCC Confidence 111 111111 122 1223333332 3333 345555554 688888877 4444333 3334 Q ss_pred CCc-cccccCCcee--EEEEecc-eeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-h Q lcl|NC_011811. 65 TRN-AESSAPESLR--QVAFPLI-YFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQA-L 139 (368) Q Consensus 65 ~~~-~~~~~~~~~~--~~~f~~p-~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~A-L 139 (368) .+- .+-....+.+ ...+--. |+. ..-+|+-...+- -+++.+..+++-..+.+...-++++. . T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~~----~~VdDiD~aqa~---------~Dlr~e~s~~~G~aLA~~~D~~i~~~l~ 146 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLISS----AFVYDLDETLAH---------YELRGEISKKIGYALAEKYDRLIFRSIT 146 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhhh----hhHhhHHHHhcC---------chhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 321 0101111111 1111111 110 111222211111 12223333333333333333333322 2 Q ss_pred cCcEEcc--CCC-EEEeeehhcCcccceE-EEEcCCCC---ccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHH Q lcl|NC_011811. 140 KGKVVDS--KGF-LWADMYQTFGVEKKTV-YFDLENPD---ADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFR 212 (368) Q Consensus 140 ~G~i~d~--dG~-~~~d~~~~fG~~~~~v-~~~l~~~~---~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~ 212 (368) ++....+ .+. ... .|.++... +..-+++. ..+...+.++.+.+.++ - .|..+.+++++|++|. T Consensus 147 kaa~~~~p~~~~~~~~-----~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~----~-VP~~~R~~vv~P~~y~ 216 (375) T protein:vir:10 147 RGARSASPVSATNFVE-----PGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEK----G-VSSQGRCAVLNPRQYY 216 (375) T ss_pred Hhhhhccccccccccc-----cCcceeeeccccccccccCHHHHHHHHHHHHHHHhhc----C-CCCCCCEEEeChHHHH Confidence 2221110 010 000 12221111 11111111 23445555555444332 2 3566778999999999 Q ss_pred HHhcCHHHHHHHHHHHhhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCC-c Q lcl|NC_011811. 213 KLTGHAKVREAYMAQQAVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVG-V 291 (368) Q Consensus 213 al~~h~~V~~~y~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~-~ 291 (368) +|+.|.+..++. +..... .+.........+.|+.+++-.. ++...+....+...-...-+.. . T Consensus 217 ~Ll~~~d~~~~~-n~d~~~--------------~~~~~~g~v~~i~Gv~V~~Sn~-lP~~~~~~~~~g~~~~~~a~~~~~ 280 (375) T protein:vir:10 217 ALIQDIGSNGLV-NRDVQG--------------SALQSGNGVIEIAGIHIYKSMN-IPFLGKYGVKYGGTTGETSPGNLG 280 (375) T ss_pred HHHhcCCcccee-eecccc--------------cceeccceEEEEeceEEEEecc-ccccccccccccccccccchhhhh Confidence 999885433211 110000 0001111234667887766222 2322222111111100010110 0 Q ss_pred eeEEeecccccccccceeEEecccc----------hHhhccCC-----CceeeEEEeeccCCCeeEEEeeecccccccCc Q lcl|NC_011811. 292 GHAFPNTAMLGEANDLYQISYGPAN----------KMGYVNTL-----GQDLYVFEYAKDRDEGTDFEAHSYMMPICTRP 356 (368) Q Consensus 292 a~~~P~g~~~~~~~~~f~~~~apa~----------~~~~vn~~-----~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP 356 (368) -+..|...-...+.|.+..|.+-.+ +-+++++. ..+. +....+..-.++.|.+-...=+-+.|| T Consensus 281 ~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~~~~~~-~~~~~~~~~q~~~i~~~~a~G~~~lrp 359 (375) T protein:vir:10 281 SHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEAIGPQVQV-TNGDVSVIYQGDVILGRMAMGADYLNP 359 (375) T ss_pred ccccccCCcceeeccccccccccccccCceEEEEEchhheeeeeeecccccc-ccchhhheeeeeeeeeeeeeccCccCc Confidence 1122222111123344333332111 11222211 1111 111123445667777777777889999 Q ss_pred ceEEEEEeeecC Q lcl|NC_011811. 357 QLLVDVRADKAS 368 (368) Q Consensus 357 ~alv~~t~~aa~ 368 (368) ++.+.++..+++ T Consensus 360 ~~av~l~~~~~~ 371 (375) T protein:vir:10 360 AAAVELYIGATA 371 (375) T ss_pred eeEEEEecCcCc Confidence 999999998766 No 26 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=65.77 E-value=0.28 Score=23.63 Aligned_cols=271 Identities=14% Similarity=0.118 Sum_probs=116.1 Q ss_pred CcccccCCCcccHHHHH-----HHHHhc-CCCccchhhcCcccccCccc-ceEEEEEEeCceeeeeccCCCCCccccccC Q lcl|NC_011811. 1 MSLTLANGSRFLLADLT-----GDIANI-PNTYGYVNQLDLFRSVPTSQ-TSVLLDITDYGISLLDPVDRDTRNAESSAP 73 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt-----~~i~~~-p~~~~~l~~l~lF~~~~~~t-~~v~ie~~~~~~~l~p~v~rg~~~~~~~~~ 73 (368) |.- ...=...+|. .-+++. .....|+.-||+++..|... .+|.+=. -.-..-+..|..|.. -+.++- T Consensus 1 mAe----~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK-~~~tgda~dVaEGe~-Iplskv 74 (295) T protein:vir:99 1 MAE----KNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYK-WEVTLDQTDPGEGET-IPLSKV 74 (295) T ss_pred CCC----cccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeee-eeeecccccccCCcc-cchhhh Confidence 222 1111112222 111221 34446677788888887653 4455421 111122234555543 233333 Q ss_pred CceeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEEe Q lcl|NC_011811. 74 ESLRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKGFLWAD 153 (368) Q Consensus 74 ~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG~~~~d 153 (368) .+.+....+...-+.-+.+|++.|| +..||.. .-|+-.+. +..+.++++. -++.+|+..-....| T Consensus 75 t~~~~~t~t~kikK~rK~tTdEAIq-lsGygdp-vgead~qL----~~~ia~kId~----D~~~~lktat~t~tg----- 139 (295) T protein:vir:99 75 TRTKDKDYTVKWFKKRRATTAEAIA-RHGAARA-ITEADKRI----MRELQNGIKD----AFFTFLKTKPTKVKG----- 139 (295) T ss_pred eeeeeeeeEEEeeeecccccHHHHH-hcCCCch-hHHHHHHH----HHHHHHhhhH----HHHHHhccCceeeeh----- Confidence 2222334555555656678999987 3456632 11221111 1222223332 234444311111111 Q ss_pred eehhcCcccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhhh Q lcl|NC_011811. 154 MYQTFGVEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSSY 233 (368) Q Consensus 154 ~~~~fG~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~~ 233 (368) .++...+..+...+........ ...++|+.|.=..++++++.+ +|+.+..+ T Consensus 140 --------------------~~lq~a~a~~~~al~~f~Ee~~----~~~V~FVnP~D~a~yl~~A~~-----~~~~a~~f 190 (295) T protein:vir:99 140 --------------------VGLQKALSASWAKLATFNEFEG----SPLVSFVSPLDVANYLGDTKV-----GADASNVF 190 (295) T ss_pred --------------------hhHHHHHHHhhhhhhhcccccC----CceEEEEehHHHHHHHhcccc-----ccchhhhh Confidence 1122222222222222111111 245788888888888776665 34433221 Q ss_pred cccccccccccccccccccceEEeCCEEE-EEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEe Q lcl|NC_011811. 234 GLITGSLKTGRSDGVATATNEFPYRGVVF-RQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISY 312 (368) Q Consensus 234 ~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~-~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~ 312 (368) |.++ .++.|. +..+ ....++.|++++-+ .+.-..+| T Consensus 191 -------------------------G~~~L~nfLG~--------q~II--~S~kv~~G~~~aT~--------~~Ni~~ay 227 (295) T protein:vir:99 191 -------------------------GMTLLKNFLGM--------QNVI--VMPSVPEGKIYSTA--------VENLVFAS 227 (295) T ss_pred -------------------------hhhhhhhhhcc--------ceEE--EcccCCCceEEEee--------ccceEEEE Confidence 1111 122221 0000 01235566665543 45556677 Q ss_pred cccchHhhccCCCceeeEEEe------eccCCCeeEEEeeec----ccccccCcceEEEEEeeecC Q lcl|NC_011811. 313 GPANKMGYVNTLGQDLYVFEY------AKDRDEGTDFEAHSY----MMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 313 apa~~~~~vn~~~~~~y~~~~------~~~~~~g~~l~~eS~----pLpv~~rP~alv~~t~~aa~ 368 (368) .|.+--+ .++-|+-... .-.+...-.+..|+. -.-.|-|++.+|++|.+++. T Consensus 228 ~~~~~g~----l~~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~ 289 (295) T protein:vir:99 228 LNVKGGD----LGGLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAA 289 (295) T ss_pred ecCCchh----hhhhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEecCc Confidence 7765222 1122221111 001111122444443 34467899999999998887 No 27 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=63.28 E-value=0.32 Score=23.30 Aligned_cols=269 Identities=11% Similarity=0.016 Sum_probs=103.3 Q ss_pred CcccccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEEEEeCceeeeeccCCCCCccccccCCceeEEE Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLDITDYGISLLDPVDRDTRNAESSAPESLRQVA 80 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~~~ 80 (368) |+.+.|-...|+-.-+.. +.+.=....+. ..+ ++.+.....+|.|-. -+......+...+.+. ...... -+... T Consensus 1 MA~~~~~pe~~~~~v~~~-~~~~lv~~~l~-~~~-~~~~~~~Gdtv~ip~-~~~~~~~d~~~~~~~~-~~~~~~-~~~~~ 74 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEE-WTAQTVFANLV-NRE-YEGTASKGNVVHIAG-VVAPTVKDYKAAGRQT-SADAIS-DTGVD 74 (273) T ss_pred CcchhhhHHHHHHHHHHH-HHhhhccchhh-ccc-cccccccCceEEEee-cccccccccccCCCcc-Cccccc-cceEE Confidence 888877655665333333 32221111111 111 122222334555433 2233333444333322 111111 11122 Q ss_pred Eecceecc-ccccCH-HHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEEeeehhc Q lcl|NC_011811. 81 FPLIYFKH-IESITP-EQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKGFLWADMYQTF 158 (368) Q Consensus 81 f~~p~~~~-~~~i~a-~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG~~~~d~~~~f 158 (368) +.+-..+. .-.|+- ++.|.. + + .+ . .+.++...+....+..++..+.+......+ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~---~--~-~~---~----~~~~~~~alA~~vD~~i~~~~~~a~~~~~~---------- 131 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVA---G--S-LE---A----YTRAGATALATDTDKFIADMLVDNGTALTG---------- 131 (273) T ss_pred EEEeeeeecceEeecHHHhhhh---c--c-HH---H----HHHHHHHHHHHHHHHHHHHHHhcccccccc---------- Confidence 22221111 111221 222221 1 1 11 1 122233344444444455444321100000 Q ss_pred CcccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHH-HHHHHHHHHhhhhhcccc Q lcl|NC_011811. 159 GVEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAK-VREAYMAQQAVSSYGLIT 237 (368) Q Consensus 159 G~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~-V~~~y~~~~~~~~~~~~~ 237 (368) +. .....++...+.++.+.+.++ . .|..+-+++|+|+++.+|++.+. +++++... .. T Consensus 132 -----~~----~~~~~~~~~~i~~a~~~ld~~----~-vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~--------~~ 189 (273) T protein:vir:10 132 -----SA----PTDADDAFDLIAKALKELTKA----N-VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSG--------DA 189 (273) T ss_pred -----cc----ccchhHHHHHHHHHHHHhhhc----C-CCcCCCEEEECHHHHHHHhcchhhhhhhhccc--------cc Confidence 00 011234455555555555332 2 24456678999999999998764 55433211 01 Q ss_pred cccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecc-cc Q lcl|NC_011811. 238 GSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGP-AN 316 (368) Q Consensus 238 ~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~ap-a~ 316 (368) +.++.|. ...+.|+.|++...- |.+++.-+-.| .++.+. ++- .+ T Consensus 190 ~~l~~G~---------ig~i~G~~v~~s~~l-------------------p~~~~~~~~~~-----~~~A~~--~a~q~~ 234 (273) T protein:vir:10 190 AGLRAGT---------IGNLLGARIVESNNL-------------------RDTDDEQFVAF-----HPSAAA--YVSQID 234 (273) T ss_pred cceeeee---------eeEEeceEEEEeccc-------------------ccCCccEEEEE-----ecccee--eeeeee Confidence 1222221 235778888763321 11111100001 111110 110 01 Q ss_pred hHhhccCCCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeec Q lcl|NC_011811. 317 KMGYVNTLGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKA 367 (368) Q Consensus 317 ~~~~vn~~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa 367 (368) ..|. ..++...+-.+..-...=.-..||++++.++.+.+ T Consensus 235 ~~e~------------~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 235 TVEA------------LRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred hhhc------------ccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 1110 01111122223333333444579999999998888 No 28 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=63.28 E-value=0.32 Score=23.30 Aligned_cols=269 Identities=11% Similarity=0.016 Sum_probs=103.3 Q ss_pred CcccccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEEEEeCceeeeeccCCCCCccccccCCceeEEE Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLDITDYGISLLDPVDRDTRNAESSAPESLRQVA 80 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~~~ 80 (368) |+.+.|-...|+-.-+.. +.+.=....+. ..+ ++.+.....+|.|-. -+......+...+.+. ...... -+... T Consensus 1 MA~~~~~pe~~~~~v~~~-~~~~lv~~~l~-~~~-~~~~~~~Gdtv~ip~-~~~~~~~d~~~~~~~~-~~~~~~-~~~~~ 74 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEE-WTAQTVFANLV-NRE-YEGTASKGNVVHIAG-VVAPTVKDYKAAGRQT-SADAIS-DTGVD 74 (273) T ss_pred CcchhhhHHHHHHHHHHH-HHhhhccchhh-ccc-cccccccCceEEEee-cccccccccccCCCcc-Cccccc-cceEE Confidence 888877655665333333 32221111111 111 122222334555433 2233333444333322 111111 11122 Q ss_pred Eecceecc-ccccCH-HHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEEeeehhc Q lcl|NC_011811. 81 FPLIYFKH-IESITP-EQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKGFLWADMYQTF 158 (368) Q Consensus 81 f~~p~~~~-~~~i~a-~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG~~~~d~~~~f 158 (368) +.+-..+. .-.|+- ++.|.. + + .+ . .+.++...+....+..++..+.+......+ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~---~--~-~~---~----~~~~~~~alA~~vD~~i~~~~~~a~~~~~~---------- 131 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVA---G--S-LE---A----YTRAGATALATDTDKFIADMLVDNGTALTG---------- 131 (273) T ss_pred EEEeeeeecceEeecHHHhhhh---c--c-HH---H----HHHHHHHHHHHHHHHHHHHHHhcccccccc---------- Confidence 22221111 111221 222221 1 1 11 1 122233344444444455444321100000 Q ss_pred CcccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHH-HHHHHHHHHhhhhhcccc Q lcl|NC_011811. 159 GVEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAK-VREAYMAQQAVSSYGLIT 237 (368) Q Consensus 159 G~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~-V~~~y~~~~~~~~~~~~~ 237 (368) +. .....++...+.++.+.+.++ . .|..+-+++|+|+++.+|++.+. +++++... .. T Consensus 132 -----~~----~~~~~~~~~~i~~a~~~ld~~----~-vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~--------~~ 189 (273) T protein:vir:10 132 -----SA----PTDADDAFDLIAKALKELTKA----N-VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSG--------DA 189 (273) T ss_pred -----cc----ccchhHHHHHHHHHHHHhhhc----C-CCcCCCEEEECHHHHHHHhcchhhhhhhhccc--------cc Confidence 00 011234455555555555332 2 24456678999999999998764 55433211 01 Q ss_pred cccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecc-cc Q lcl|NC_011811. 238 GSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGP-AN 316 (368) Q Consensus 238 ~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~ap-a~ 316 (368) +.++.|. ...+.|+.|++...- |.+++.-+-.| .++.+. ++- .+ T Consensus 190 ~~l~~G~---------ig~i~G~~v~~s~~l-------------------p~~~~~~~~~~-----~~~A~~--~a~q~~ 234 (273) T protein:vir:10 190 AGLRAGT---------IGNLLGARIVESNNL-------------------RDTDDEQFVAF-----HPSAAA--YVSQID 234 (273) T ss_pred cceeeee---------eeEEeceEEEEeccc-------------------ccCCccEEEEE-----ecccee--eeeeee Confidence 1222221 235778888763321 11111100001 111110 110 01 Q ss_pred hHhhccCCCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeec Q lcl|NC_011811. 317 KMGYVNTLGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKA 367 (368) Q Consensus 317 ~~~~vn~~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa 367 (368) ..|. ..++...+-.+..-...=.-..||++++.++.+.+ T Consensus 235 ~~e~------------~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 235 TVEA------------LRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred hhhc------------ccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 1110 01111122223333333444579999999998888 No 29 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=57.71 E-value=0.43 Score=22.60 Aligned_cols=277 Identities=9% Similarity=0.032 Sum_probs=109.2 Q ss_pred CcccccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEEEEeCceeeeeccCCCCCccccccCCceeEEE Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLDITDYGISLLDPVDRDTRNAESSAPESLRQVA 80 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~~~ 80 (368) |....-+...+-..++...|-......+-|.++ .+..++.+..+.+-...+...-+-.+..++. .+..+ ..-.... T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~--~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~-~~~~~~~ 180 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDL--LAQGRTSSNALEYVREEVFTNNADVVAEKAL-KPESD-ITFSKQT 180 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhh--cceecccCcceEEEEEecCCcceeeeccCcc-ccccc-cceeEEE Confidence 333332222222233333344444445556553 5555555555554444333333344444443 22222 2333444 Q ss_pred EecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEEeeehhcCc Q lcl|NC_011811. 81 FPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKGFLWADMYQTFGV 160 (368) Q Consensus 81 f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG~~~~d~~~~fG~ 160 (368) +.+-.+...-.|+- ++.+ +.. .++.++.+ .+.+.+..+.|..++ .| |+.|+-. =|+ T Consensus 181 ~~~~k~~~~~~is~-ell~-------d~~-~l~~~i~~---~la~a~~~~~d~~~l---~G---~g~~~~~------~Gi 236 (385) T protein:vir:18 181 ANVKTIAHWVQASR-QVMD-------DAP-MLQSYINN---RLMYGLALKEEGQLL---NG---DGTGDNL------EGL 236 (385) T ss_pred EeeeeEEEeehhhH-HHHh-------hHH-HHHHHHHH---HHHHHHHHHHHHHHH---hc---cCCCCcc------ccc Confidence 45444444444443 3321 111 23444433 344556666665433 34 1222110 011 Q ss_pred cc-c-eEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhhhccccc Q lcl|NC_011811. 161 EK-K-TVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSSYGLITG 238 (368) Q Consensus 161 ~~-~-~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~~~~~~~ 238 (368) -. . ......+..+......+.++...+.... .. .-..+|++..|.+|.. +++. .+..+. . T Consensus 237 ~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~----~~---~~~~~~~~~~~~~l~~---lkd~-----~G~~l~---~ 298 (385) T protein:vir:18 237 NKVATAYDTSLNATGDTRADIIAHAIYQVTESE----FS---ASGIVLNPRDWHNIAL---LKDN-----EGRYIF---G 298 (385) T ss_pred ccccccccccccccccchHHHHHHHHHhhcccc----CC---CCEEEEcHHHHHHHHH---hhcC-----CCceec---c Confidence 10 0 1111122223334444555444443321 11 2256889999998853 2221 111110 0 Q ss_pred ccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecccchH Q lcl|NC_011811. 239 SLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGPANKM 318 (368) Q Consensus 239 ~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~apa~~~ 318 (368) +.. .+ ..-.+-|+.++.... +|++...+ |-|..+|--.+. T Consensus 299 ~~~----~~-----~~~~l~G~pV~~~~~-------------------~p~~~~~~-----------gd~~~~~~~~~~- 338 (385) T protein:vir:18 299 GPQ----AF-----TSNIMWGLPVVPTKA-------------------QAAGTFTV-----------GGFDMASQVWDR- 338 (385) T ss_pred Ccc----cC-----CCceecceeeEEcCc-------------------CCCCcEEE-----------eecccEEEEEEe- Confidence 000 00 001233444432111 12222111 111111111111 Q ss_pred hhccCCCceeeEEEeec-c-CCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 319 GYVNTLGQDLYVFEYAK-D-RDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 319 ~~vn~~~~~~y~~~~~~-~-~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) .+..+=...+.. . .-..+.+.++.+.=..+.+|++++++|++||| T Consensus 339 -----~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 339 -----MDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred -----cceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 011000000000 0 01234566767777777999999999999999 No 30 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=57.71 E-value=0.43 Score=22.60 Aligned_cols=277 Identities=9% Similarity=0.032 Sum_probs=109.2 Q ss_pred CcccccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEEEEeCceeeeeccCCCCCccccccCCceeEEE Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLDITDYGISLLDPVDRDTRNAESSAPESLRQVA 80 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~~~ 80 (368) |....-+...+-..++...|-......+-|.++ .+..++.+..+.+-...+...-+-.+..++. .+..+ ..-.... T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~--~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~-~~~~~~~ 180 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDL--LAQGRTSSNALEYVREEVFTNNADVVAEKAL-KPESD-ITFSKQT 180 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhh--cceecccCcceEEEEEecCCcceeeeccCcc-ccccc-cceeEEE Confidence 333332222222233333344444445556553 5555555555554444333333344444443 22222 2333444 Q ss_pred EecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEEeeehhcCc Q lcl|NC_011811. 81 FPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKGFLWADMYQTFGV 160 (368) Q Consensus 81 f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG~~~~d~~~~fG~ 160 (368) +.+-.+...-.|+- ++.+ +.. .++.++.+ .+.+.+..+.|..++ .| |+.|+-. =|+ T Consensus 181 ~~~~k~~~~~~is~-ell~-------d~~-~l~~~i~~---~la~a~~~~~d~~~l---~G---~g~~~~~------~Gi 236 (385) T protein:vir:19 181 ANVKTIAHWVQASR-QVMD-------DAP-MLQSYINN---RLMYGLALKEEGQLL---NG---DGTGDNL------EGL 236 (385) T ss_pred EeeeeEEEeehhhH-HHHh-------hHH-HHHHHHHH---HHHHHHHHHHHHHHH---hc---cCCCCcc------ccc Confidence 45444444444443 3321 111 23444433 344556666665433 34 1222110 011 Q ss_pred cc-c-eEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHHhhhhhccccc Q lcl|NC_011811. 161 EK-K-TVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQAVSSYGLITG 238 (368) Q Consensus 161 ~~-~-~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~~~~~~~~~~~ 238 (368) -. . ......+..+......+.++...+.... .. .-..+|++..|.+|.. +++. .+..+. . T Consensus 237 ~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~----~~---~~~~~~~~~~~~~l~~---lkd~-----~G~~l~---~ 298 (385) T protein:vir:19 237 NKVATAYDTSLNATGDTRADIIAHAIYQVTESE----FS---ASGIVLNPRDWHNIAL---LKDN-----EGRYIF---G 298 (385) T ss_pred ccccccccccccccccchHHHHHHHHHhhcccc----CC---CCEEEEcHHHHHHHHH---hhcC-----CCceec---c Confidence 10 0 1111122223334444555444443321 11 2256889999998853 2221 111110 0 Q ss_pred ccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecccchH Q lcl|NC_011811. 239 SLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGPANKM 318 (368) Q Consensus 239 ~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~apa~~~ 318 (368) +.. .+ ..-.+-|+.++.... +|++...+ |-|..+|--.+. T Consensus 299 ~~~----~~-----~~~~l~G~pV~~~~~-------------------~p~~~~~~-----------gd~~~~~~~~~~- 338 (385) T protein:vir:19 299 GPQ----AF-----TSNIMWGLPVVPTKA-------------------QAAGTFTV-----------GGFDMASQVWDR- 338 (385) T ss_pred Ccc----cC-----CCceecceeeEEcCc-------------------CCCCcEEE-----------eecccEEEEEEe- Confidence 000 00 001233444432111 12222111 111111111111 Q ss_pred hhccCCCceeeEEEeec-c-CCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 319 GYVNTLGQDLYVFEYAK-D-RDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 319 ~~vn~~~~~~y~~~~~~-~-~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) .+..+=...+.. . .-..+.+.++.+.=..+.+|++++++|++||| T Consensus 339 -----~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 339 -----MDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred -----cceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 011000000000 0 01234566767777777999999999999999 No 31 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=53.92 E-value=0.52 Score=22.16 Aligned_cols=302 Identities=13% Similarity=0.018 Sum_probs=111.8 Q ss_pred Ccc----------------cccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCceeeeeccCC Q lcl|NC_011811. 1 MSL----------------TLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGISLLDPVDR 63 (368) Q Consensus 1 m~~----------------~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~l~p~v~r 63 (368) |+- ++|= -.|+-.-+++--+. ..+. +++..+.++ ++++.|.+. |...+ ....+ T Consensus 1 ms~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~-----s~~~--~~~~~rti~~g~s~~~~~i-G~~~~-~~~~p 70 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYT-----SKFA--PLMNIRDLRGSNVVRLDRL-GNVEA-KGRRA 70 (335) T ss_pred CCccccccccccccccchhhhhh-hhhhhHHHHHHHHh-----hhhc--cccceeeeccceeEEEeee-eeeee-ccccc Confidence 221 1111 12332222222221 2221 345555443 577777765 33322 33333 Q ss_pred CCCcc-ccccCCceeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_011811. 64 DTRNA-ESSAPESLRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGK 142 (368) Q Consensus 64 g~~~~-~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~ 142 (368) |.+-- +....++. ...+....+...-.=.=+|+|+- -+-.....+...+.|+++.++ ....+.+++- T Consensus 71 G~~l~~~~~~~~k~-~itID~ll~a~~~VddlDe~~~~-----yDvR~e~s~~~G~aLA~~~Dq------~~~~~l~~aa 138 (335) T protein:vir:78 71 GEELERSRVVNDKW-NLTVDTLLYLRHQFDHQDEWTQS-----FDMRKEVAELDGQELARKFDQ------ACLIQVIKAA 138 (335) T ss_pred CcccCCCCcccCCe-EEEecceeechhhHhhHHHhhcC-----chhHHHHHHHHHHHHHHHHHH------HHHHHHHhhc Confidence 33210 00111111 12222222211111122222220 011122233333333333332 2222333332 Q ss_pred EE----ccCCCEEEeeehhcCcccceEEEEcCCCCcc---HHHHHHHHHHHHHH-HhccccccccccEEEEEChHHHHHH Q lcl|NC_011811. 143 VV----DSKGFLWADMYQTFGVEKKTVYFDLENPDAD---IDGAIDELVEHMED-TANTGGLTNGEQIIVLVDRAFFRKL 214 (368) Q Consensus 143 i~----d~dG~~~~d~~~~fG~~~~~v~~~l~~~~~d---~~~~~~~~~~~i~~-~l~~~~~~~~~~v~al~g~~~~~al 214 (368) .. ..++. ..-|.+..+ .+.-+.+.++ +...+.++...+.+ ..... +..+.+++++|++|.+| T Consensus 139 ~~~a~~~~~~~------~~~G~~~~~-~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~---~~~~rv~vv~P~~y~~L 208 (335) T protein:vir:78 139 AMDAPVDLEDA------FSPGVLEKL-DLTGLTAKEAAEKIVRMHRRVVETFIERDLGDA---VYSEGLTPMSPRVFSLL 208 (335) T ss_pred ccccccccCCC------cCCCcceee-eeccccccccHHHHHHHHHHHHHHHHhccCCCC---CCCccEEEeChHHHHHH Confidence 21 11110 001322211 1111122223 33333333333322 22111 12346789999999999 Q ss_pred hcCHHHHHHHHHHHhhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeE Q lcl|NC_011811. 215 TGHAKVREAYMAQQAVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHA 294 (368) Q Consensus 215 ~~h~~V~~~y~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~ 294 (368) +.|+.+... -++..... +. ...-+.....|+.+++- -.++...+....+... +..+. T Consensus 209 l~~~~l~n~--~~~~s~~~----~~---------~~~g~v~~v~Gv~V~~S-n~lP~~~~t~~~lg~a-------~n~~~ 265 (335) T protein:vir:78 209 LEHDKLMSV--EYQATGAT----ND---------YVKSRVAILNGVKVLET-PRFATKAISAHPLGRH-------FNVSA 265 (335) T ss_pred hcccccccc--cccccccc----cc---------cccceeEEeeceEEEee-ccCCCCCCcccccccc-------CCccc Confidence 999865431 01110000 01 11123456778876652 2222222111111100 00000 Q ss_pred EeecccccccccceeEEecccchHhhccC-CCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 295 FPNTAMLGEANDLYQISYGPANKMGYVNT-LGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 295 ~P~g~~~~~~~~~f~~~~apa~~~~~vn~-~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) . +......++| +.+ ++.+ ...+.=...|.+++..++.|.+--..=.-..||++.+.++.+-.- T Consensus 266 ~-------d~~~~~~~~~-~~~---Al~t~~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~~ 329 (335) T protein:vir:78 266 E-------EAERQIALFL-PSK---TLITAQVAPVQAKLWEDHDQFSWVLDTFQMYNIGARRPDTAGAIELKGIE 329 (335) T ss_pred c-------cccceEEEEE-ecc---eEEEEEEEecccceeeccchhhHhhhHHHHcCCcccCcceEEEEEecCCC Confidence 0 0001111111 111 1111 122333455666666666677766666778999999999988766 No 32 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=52.57 E-value=0.55 Score=22.00 Aligned_cols=304 Identities=9% Similarity=-0.028 Sum_probs=105.5 Q ss_pred Ccc-cccCCCcccHHHHHHHHHhc---------CCCccchhhcC-cccccCcccceEEEEEEeCceeeeeccCCCCCccc Q lcl|NC_011811. 1 MSL-TLANGSRFLLADLTGDIANI---------PNTYGYVNQLD-LFRSVPTSQTSVLLDITDYGISLLDPVDRDTRNAE 69 (368) Q Consensus 1 m~~-~~f~~d~F~~~~Lt~~i~~~---------p~~~~~l~~l~-lF~~~~~~t~~v~ie~~~~~~~l~p~v~rg~~~~~ 69 (368) |+| +=+..+++++...+..|.++ ...- .+.++- -++.+.....+|.|-... ....-. ..|+.+- . T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~-v~~~~~~d~~~~~~~Gdtv~ip~~g-~~~~~d-~~~~~~i-~ 76 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKM-LDTSVVKTWGAQVKKGDTFHVPRIS-ELGVED-KATDVPV-G 76 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhc-chhhccccccccccCCceEEEeccC-cceeee-ecCCCcc-c Confidence 666 44446667666666555221 1100 111110 001111113344443221 212111 1222221 1 Q ss_pred cccCCceeEEEEecceec-cccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCC Q lcl|NC_011811. 70 SSAPESLRQVAFPLIYFK-HIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKG 148 (368) Q Consensus 70 ~~~~~~~~~~~f~~p~~~-~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG 148 (368) ...... ....+.+=..+ -.-.|+..|. .+..-+.+.+.+.++...+.+..+..++..+.+.-..+.+ T Consensus 77 ~~~~~~-~~~~itiD~~~~~~~~i~d~d~-----------~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~ 144 (341) T protein:vir:94 77 VQPVND-TDFVITVDTDRTTAVALDDLLE-----------IQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQ 144 (341) T ss_pred cccccC-ceEEEEEeeeeecceeechHHH-----------HhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccC Confidence 111111 11122221111 1111222111 1112223344444454555554444444444311111111 Q ss_pred CEEEeeehhcCcccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHHHHHHH Q lcl|NC_011811. 149 FLWADMYQTFGVEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREAYMAQQ 228 (368) Q Consensus 149 ~~~~d~~~~fG~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~y~~~~ 228 (368) ... +...... .+++.......+.++.+.+.++ . .|..+.+++|+|+++..|++++....+- T Consensus 145 ~~~---------~~~~~~~-t~~~~~~~~~~i~~a~~~Lde~----~-VP~~gR~lvv~P~~~~~Ll~~~~~~~~~---- 205 (341) T protein:vir:94 145 NVF---------SSSNGAI-TGNGQAFSFAVFLAARRLLLEA----D-VPEEKIVLLISPGQESALFTIPQFISKD---- 205 (341) T ss_pred ccc---------cCccccc-cCchhhhhHHHHHHHHHHHhhc----C-CCccCCEEEeCHHHHHHHhhchhhhhhh---- Confidence 110 0111111 1122222334455555555433 2 3556677899999999999987765431 Q ss_pred hhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccce Q lcl|NC_011811. 229 AVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLY 308 (368) Q Consensus 229 ~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f 308 (368) ... ...++.| ....+.|+.+++-.. ++...+... ..+.+...|.+.... ..+. T Consensus 206 ~~g-----~~~l~~G---------~ig~i~G~~V~~Sn~-lp~~~~~~~----------~~~~~~~~~~~~~~~-i~~~- 258 (341) T protein:vir:94 206 FIN-----NAPIAQG---------QIGSLMGVRVIRTSL-IGNNSATGW----------RNGAPTIAPAEATPG-FTGS- 258 (341) T ss_pred ccc-----cchhhee---------eeeeEeceEEEEecc-ccccccccc----------cccccceeccccccc-cccc- Confidence 111 1112222 123567888876322 222111100 011111111111000 0000 Q ss_pred eEEecccchHhhccCCCce------eeEEEeec-----------c-------CCCeeEEEeeecccccccCcceEEEEEe Q lcl|NC_011811. 309 QISYGPANKMGYVNTLGQD------LYVFEYAK-----------D-------RDEGTDFEAHSYMMPICTRPQLLVDVRA 364 (368) Q Consensus 309 ~~~~apa~~~~~vn~~~~~------~y~~~~~~-----------~-------~~~g~~l~~eS~pLpv~~rP~alv~~t~ 364 (368) ..+++-+. ..-+..++- +-.|+... . ...+..|..-...=+=..||+++|.++. T Consensus 259 -~~~~~~~~-~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~ 336 (341) T protein:vir:94 259 -RYLPKQDS-FTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHT 336 (341) T ss_pred -cccccccc-ccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhhhhhhhcccccCcceeEEEec Confidence 00110000 000011110 11111110 0 0111111111111233579999999999 Q ss_pred eecC Q lcl|NC_011811. 365 DKAS 368 (368) Q Consensus 365 ~aa~ 368 (368) .+++ T Consensus 337 ~~~~ 340 (341) T protein:vir:94 337 TGDT 340 (341) T ss_pred CcCC Confidence 9999 No 33 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=51.54 E-value=0.58 Score=21.89 Aligned_cols=295 Identities=12% Similarity=0.033 Sum_probs=102.6 Q ss_pred CcccccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCceeeeeccCCCCCcc-ccccCCceeE Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGISLLDPVDRDTRNA-ESSAPESLRQ 78 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~l~p~v~rg~~~~-~~~~~~~~~~ 78 (368) +-+.+|.. .-+++--+. ..+. ++...+.++ +.++.|.+. |...+ ....||.+-. +....++.+ T Consensus 23 l~le~~~g-----eV~~af~~~-----s~~~--~~~~~r~i~~G~s~~~~~i-G~~~~-~~~~~g~~l~~~~~~~~~~~- 87 (334) T protein:vir:80 23 LHIEEHLG-----LVDASFMYS-----SKFA--SWMNVRSLRGTNQLRVDRV-GASTI-AGRKAGEELVVQKNVSDKLN- 87 (334) T ss_pred ehhhhhhh-----HHHHHHHHh-----hhhh--ccceeeeccccceEEEeee-cceee-eeecCCCCCCCCCcccCceE- Confidence 22233322 222222221 2222 234555444 677777765 33333 3333443210 111112211 Q ss_pred EEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEc---------cCCC Q lcl|NC_011811. 79 VAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVD---------SKGF 149 (368) Q Consensus 79 ~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d---------~dG~ 149 (368) ..+-...+...-.=.-+|+|+ -+ +-.....+.....|++. .| +....+++++.... .+|. T Consensus 88 l~ID~~l~~~~~VddiD~~q~---~~--D~rse~~~~~G~aLA~~---~D---~~~~~~l~kaa~~~~~~~~~~~~~~G~ 156 (334) T protein:vir:80 88 LTVDTVLYARHFFDKFDEWTS---NL--DVRKETAREDGIALARQ---YD---QACIIQLQKCGDFLAPAHLKPAFHDGI 156 (334) T ss_pred EEEeeeeehhhhHhhHHHHhc---Cc--chHHHHHHHHHHHHHHH---HH---HHHHHHHHHhhhhcccccccccccCCc Confidence 122222111111111222332 11 11111222233333322 22 22223333332211 1221 Q ss_pred EEEeeehhcCcccceEEEEcCCCCcc---HHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHH-HH Q lcl|NC_011811. 150 LWADMYQTFGVEKKTVYFDLENPDAD---IDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREA-YM 225 (368) Q Consensus 150 ~~~d~~~~fG~~~~~v~~~l~~~~~d---~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~-y~ 225 (368) ...... -|.+ ..+.++ +...+.++.+.+-++- ..--+..+.+++++|.+|.+|+.|+.+... |- T Consensus 157 ~~~~~~--~g~~--------~~~~~~~~~l~~a~~~a~~~L~e~d--vp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~ 224 (334) T protein:vir:80 157 LLPSTI--SGLA--------ADAAADADVLVAAHRQGVEAMVFRD--LGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFG 224 (334) T ss_pred ceeecc--cccc--------cchhhhHHHHHHHHHHHHHHHHhcC--CCCCcCCceEEEeChHHHHHHhcccccccceec Confidence 111000 0110 111222 2233333333332221 000013467899999999999999876532 11 Q ss_pred HHHhhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccc Q lcl|NC_011811. 226 AQQAVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEAN 305 (368) Q Consensus 226 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~ 305 (368) +.+.. +. ........+.|+++++-.. ++....... +.|..+. ... T Consensus 225 ~s~~~-------~~---------~~~g~i~~v~G~~V~~Sn~-~P~~~~t~~----------~~g~~~~--------~~a 269 (334) T protein:vir:80 225 AKEGG-------NS---------FVGGRIAMLNGVRVVETPR-FPQSAITAN----------ALGADFN--------VTD 269 (334) T ss_pred ccccc-------cc---------ccceeEEEEeceEEEeecC-CCCcccccc----------ccccccc--------ccc Confidence 11100 01 1112345678888875221 221111000 0010000 011 Q ss_pred cceeEEecccchHhhccC-CCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 306 DLYQISYGPANKMGYVNT-LGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 306 ~~f~~~~apa~~~~~vn~-~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) |-|....+-.-+-+++.+ ...+.=...|.+++..++.|.+--..=.-..||++++-++++--- T Consensus 270 gd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a~G~g~lRPeaa~vv~~~~~~ 333 (334) T protein:vir:80 270 AEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQSYNIGQRRPDAVAVHDITVTN 333 (334) T ss_pred ccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHHHcCCceeccceEEEEEEeeec Confidence 111111111001111111 111222334444444444444444444566799887766666555 No 34 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=50.56 E-value=0.61 Score=21.78 Aligned_cols=316 Identities=10% Similarity=0.055 Sum_probs=123.1 Q ss_pred Ccc-cccCCCcccHH-------------HHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCceeeeeccCCCC Q lcl|NC_011811. 1 MSL-TLANGSRFLLA-------------DLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGISLLDPVDRDT 65 (368) Q Consensus 1 m~~-~~f~~d~F~~~-------------~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~l~p~v~rg~ 65 (368) |+. +....+.++-. ++-.++.. ...+. +++..+.++ ++++.|.+. |..++ ....+|. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~----~si~~--~~~~vrti~~GkS~qf~~i-G~~~a-~y~~~G~ 72 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLK----GENIL--SYFDVQTVTGTNTVSNKYL-GETEL-QVLAPGQ 72 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHH----HHhhc--CcceeeeecccceEEEEEE-eeeEE-eeecccc Confidence 322 11112222211 11122211 12221 335555443 677888777 33333 3333333 Q ss_pred Cc-cccccCCceeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCcE Q lcl|NC_011811. 66 RN-AESSAPESLRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQAL-KGKV 143 (368) Q Consensus 66 ~~-~~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL-~G~i 143 (368) +- .+....++. ...+....+...-.-.=+|+|+ .+-. -...+.+...+.|+++.+++- .....++++. ...+ T Consensus 73 ~ldg~~~~~~k~-~ItID~lL~a~~~V~diDeaq~--~yD~--vRse~s~e~G~ALA~~~Dq~i-i~~i~~aa~a~t~~~ 146 (402) T protein:vir:97 73 SPNATPTQADKN-QLVIDTTVIARNTVAHIHDVQG--DIDS--LKPKLAMNQAKQLKRLEDQMA-IQQMLLGGIANTKAE 146 (402) T ss_pred ccCCCCcccccE-EEEeCceeechhhhhhHHHHHh--cccc--hhHHHHHHHHHHHHHHHHHHH-HHHHHHhhccccccc Confidence 11 001112221 2233333333333333344443 1110 011233444445555444421 1122222211 1111 Q ss_pred EccCCCEEEeeehhcCcccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHHHHHH Q lcl|NC_011811. 144 VDSKGFLWADMYQTFGVEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAKVREA 223 (368) Q Consensus 144 ~d~dG~~~~d~~~~fG~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~V~~~ 223 (368) +..+...-+ |.+.. +...-..+..++......+....+. |. ..-.|..+.+++++|++|..|++|+.+... T Consensus 147 -~~~~~~~~~-----g~s~~-~~~t~~~a~~~~~~l~~ai~~a~~~-Ld-EkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~ 217 (402) T protein:vir:97 147 -RNKPRVKGH-----GFSIN-VNVTESEALANPQYVMAAVEYALEQ-QL-EQEVDISDVAIMMPWKFFNALRDADRIVDK 217 (402) T ss_pred -cccCccccc-----ccccc-cccccchhhcCHHHHHHHHHHHHHH-HH-hcCCCccccEEEeChHHHHHHhhcccccch Confidence 111100000 11111 1111112223433333333222221 11 112456678899999999999999875421 Q ss_pred HHHHHhhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccc Q lcl|NC_011811. 224 YMAQQAVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGE 303 (368) Q Consensus 224 y~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~ 303 (368) -+...+ .+... .-......|+.+++-.- ++...+ .+. .+..-.+|.+..|... ++ T Consensus 218 --d~~~~~-----~g~~~---------~G~v~~v~Gv~Vv~Snn-lP~~a~----~it-~~~ls~a~~G~~y~~t---~d 272 (402) T protein:vir:97 218 --TYTISQ-----SGATI---------NGFVLSSYNCPVIPSNR-FPTFAQ----DQA-HHLLSNEDNGYRYDPI---AE 272 (402) T ss_pred --hhcccc-----CCccc---------cceeEEEeceEEEecCc-cccccc----ccc-ccccccCCCCccCCcC---cc Confidence 000000 01111 11234567887765211 221100 000 0111123333433311 11 Q ss_pred cccceeEEecccchHhhccC-CCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 304 ANDLYQISYGPANKMGYVNT-LGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 304 ~~~~f~~~~apa~~~~~vn~-~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) -....-..|-| +++++ ...++=...|.+++...+.|.+-...=..+.||++.--++++--. T Consensus 273 ~t~~~~~~f~~----~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~ 334 (402) T protein:vir:97 273 MNGAVAVLFTS----DALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) T ss_pred cceeEEEEEec----ceEEEEEeeccccchhhchhHHHHHHHHHHHhCCcccCccceEEEEEeccc Confidence 11112223333 23433 234555677888887888888888887888999988777555411 No 35 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=43.60 E-value=0.84 Score=21.00 Aligned_cols=306 Identities=11% Similarity=0.010 Sum_probs=113.4 Q ss_pred Ccc----------------cccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCceeeeeccCC Q lcl|NC_011811. 1 MSL----------------TLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGISLLDPVDR 63 (368) Q Consensus 1 m~~----------------~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~l~p~v~r 63 (368) |+- ++|= -.|+-.-+++--+ ...+. +++..+.++ +.++.|.+. |... +....+ T Consensus 1 ms~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~-----~s~~~--~~~~~rti~~g~s~~~~~i-G~~~-~~~~~p 70 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAY-----TSKFA--PLMNIRDLRGSNVVRLDRL-GNVE-AKGRRA 70 (335) T ss_pred CCCcccchhhhcccccchhheeh-hhhhhhHHHHHHh-----hhhhc--cccceeeeccceeEEEeee-eeee-eecccC Confidence 221 1111 1222222222211 12221 345555443 566777766 2222 222222 Q ss_pred CCCc--cccccCCceeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011811. 64 DTRN--AESSAPESLRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKG 141 (368) Q Consensus 64 g~~~--~~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G 141 (368) |.+- .++ ..++. +..+....+...-.=.=+|+|+- + +-.....+.+.+.|+++.++ ....+++++ T Consensus 71 G~~l~~~~~-~~~k~-~itVD~ll~a~~~I~dlDe~~~~--y---DvRse~s~e~G~aLA~~~D~------~~~~~i~~a 137 (335) T protein:vir:63 71 GEELERSRV-VNDKW-NLTVDTLLYLRHQFDHQDEWTQS--F---DMRKEVAELDGQELARKFDQ------ACLIQVIKA 137 (335) T ss_pred CcCcCCCCc-cccce-EEEecceeechhhhhhHHHHhcC--c---hhHHHHHHHHHHHHHHHHHH------HHHHHHHhh Confidence 2211 011 11221 22333322222222222333321 0 11122233333333333222 222333343 Q ss_pred cEEccCCCEEEeeehhcCcccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccc---cccEEEEEChHHHHHHhcCH Q lcl|NC_011811. 142 KVVDSKGFLWADMYQTFGVEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTN---GEQIIVLVDRAFFRKLTGHA 218 (368) Q Consensus 142 ~i~d~dG~~~~d~~~~fG~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~---~~~v~al~g~~~~~al~~h~ 218 (368) -...+..+ +.+-+ .-|+++.+ ...-+++.+++ .++.+....+.+.|...- .| ..+.+++++|++|.+|++|+ T Consensus 138 a~~~a~~~-~~~~~-~~G~~~~~-~~tg~~~~~~~-~~l~~a~~~a~~~L~e~d-VP~~~~~dr~~vv~P~~y~~Ll~~~ 212 (335) T protein:vir:63 138 AAMDAPVD-LEDAF-SPGVLEKL-DLTGLTAKQAA-DKIVRMHRRVVETFIDRD-LGDAVYSEGLTPMSPRVFSLLLEHD 212 (335) T ss_pred ccccCccc-cCCCc-CCCcceee-eeccCcccccH-HHHHHHHHHHHHHHHhcc-CCCcccCceEEEeChHHHHHHhccc Confidence 22211100 00000 01333221 11112222232 333332222222222111 22 23468999999999999998 Q ss_pred HHHHHHHHHHhhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeec Q lcl|NC_011811. 219 KVREAYMAQQAVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNT 298 (368) Q Consensus 219 ~V~~~y~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g 298 (368) .+-.. -++..... + ....-+.....|+.+++- -.++...+....+ |.++. T Consensus 213 ~l~n~--~~~~s~~~----~---------~~~~g~v~~v~Gv~V~~s-n~lP~~~~t~~~l----------g~a~n---- 262 (335) T protein:vir:63 213 KLMNV--EYQATGAT----N---------DYVKSRVAILNGVKVLET-PRFATKAIAAHPL----------GRHFN---- 262 (335) T ss_pred ccccc--cccccccc----c---------cccCceeEEeeceEEEee-ccCCCCCcccccc----------cccCC---- Confidence 65321 01110000 0 111123456778876652 1222222111111 11110 Q ss_pred ccccccccceeEEecccchHhhccC-CCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeecC Q lcl|NC_011811. 299 AMLGEANDLYQISYGPANKMGYVNT-LGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKAS 368 (368) Q Consensus 299 ~~~~~~~~~f~~~~apa~~~~~vn~-~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa~ 368 (368) ...+-|....|-.-+-+++.+ ...+.=..+|.+++..++.|.+--..=.-..||++.+.++.+-.- T Consensus 263 ----~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~~ 329 (335) T protein:vir:63 263 ----VSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYNIGARRPDTAGAIELKGIG 329 (335) T ss_pred ----ccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHHHcCCcccccceEEEEEEcCCC Confidence 011122111111111112222 122333556666666666777766666788999999999986554 No 36 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=39.80 E-value=1 Score=20.58 Aligned_cols=312 Identities=11% Similarity=0.014 Sum_probs=102.5 Q ss_pred Cccccc-------------CCCc-------ccHHHHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCceeeee Q lcl|NC_011811. 1 MSLTLA-------------NGSR-------FLLADLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGISLLD 59 (368) Q Consensus 1 m~~~~f-------------~~d~-------F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~l~p 59 (368) |.=.+- +.|. |+ .++..++.+ .+.+.+ ++..+.+. +.++.|...... + +. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~-g~V~~~f~~----~s~~~~--~~~~~~~~~G~sv~i~~ig~~-t-~~ 71 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFG-GEVLTAFAR----TSVTMP--RHMLRSIASGKSAQFPVIGRT-K-AA 71 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHH-HHHHHHHHH----hhhhhh--ccccccccccceeEeeeccce-e-ee Confidence 211111 1110 11 011112221 223332 23333322 555666544321 1 12 Q ss_pred ccCCCCC-cccc--ccCCceeEEEEecceeccccccCHHHHhcccCCCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011811. 60 PVDRDTR-NAES--SAPESLRQVAFPLIYFKHIESITPEQVQGIRQAGT-AAELTTEAMVRARKLQKIRMTHDITKEFLL 135 (368) Q Consensus 60 ~v~rg~~-~~~~--~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~-~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~ 135 (368) ...+|.+ .... ....+ +. ++-+++.-.+.+=. -++.+..-+++.+..+++-..+.+...-.+ T Consensus 72 ~~~~g~~l~~~~~~~~~~e---~~-----------ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i 137 (347) T protein:vir:15 72 YLKPGENLDDKRKDIKHTE---KV-----------IHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAV 137 (347) T ss_pred eeccCCCCCCCCCCCccce---EE-----------EEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHH Confidence 2222221 0000 00111 11 11111111111100 011111112333344444444444444444 Q ss_pred HHHhcCcEEccCCCEEEeeehhcCcccc-eEEEEcCCCCcc-------HHHHHHHHHHHHHHHhccccccccccEEEEEC Q lcl|NC_011811. 136 MQALKGKVVDSKGFLWADMYQTFGVEKK-TVYFDLENPDAD-------IDGAIDELVEHMEDTANTGGLTNGEQIIVLVD 207 (368) Q Consensus 136 ~~AL~G~i~d~dG~~~~d~~~~fG~~~~-~v~~~l~~~~~d-------~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g 207 (368) +..|.+..--+..+. .-...+|-..- .....-+...++ +...+.++.+.+- -.- .|..+.+++++ T Consensus 138 ~~~l~~~~~~~~~~~--~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Ld----e~~-VP~~gR~~vv~ 210 (347) T protein:vir:15 138 LAELAGLVNLPDASN--ENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLT----KNY-VPAADRTFYTT 210 (347) T ss_pred HHHHHHHhhcccccc--ccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHh----hcC-CCccCCEEEeC Confidence 433321100000000 00000111100 000000111112 2333333333332 222 35567889999 Q ss_pred hHHHHHHhcCHHHHHHHHHHHhhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceee Q lcl|NC_011811. 208 RAFFRKLTGHAKVREAYMAQQAVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVED 287 (368) Q Consensus 208 ~~~~~al~~h~~V~~~y~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i 287 (368) |++|..|++|++... ...+. ...++.| ....+.|++|++-. .++...+.... ... T Consensus 211 P~~y~~LL~~~~~~~----~d~~~-----~~~~~~G---------~Vg~i~G~~V~~Sn-~lp~~~~t~~~------~~~ 265 (347) T protein:vir:15 211 PDNYSAILAALMPNA----ANYQA-----LIDHERG---------TIRNVMGFEVVEVP-HLTAGGAGDTR------EDA 265 (347) T ss_pred HHHHHHHhccccccc----ccccc-----cccccce---------EEEEEeceEEEecc-ccccccccccc------ccc Confidence 999999999986432 11100 1112222 22456888887622 22222111111 113 Q ss_pred cCCceeEEeecccccccccceeEEecccchHhhccCC-CceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeee Q lcl|NC_011811. 288 SVGVGHAFPNTAMLGEANDLYQISYGPANKMGYVNTL-GQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADK 366 (368) Q Consensus 288 ~~~~a~~~P~g~~~~~~~~~f~~~~apa~~~~~vn~~-~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~a 366 (368) +.|..+.++.+.-. .....|....+-.-+-..+++. .+..-.-...+++..+..|..-...=.-..||++++.++++- T Consensus 266 ~~g~~~~~~~~~~~-~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~~~d~i~~~~~~G~~vlrP~~av~~~~~~ 344 (347) T protein:vir:15 266 PADQKHAFPATSST-TVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRPEAAGAIVLPK 344 (347) T ss_pred cccccccccccccc-eeeeccccceeeeeccceeeeeEeeceeeeecccchhhhhhhehhhhcCCceeccccEEEEecCC Confidence 44555555443210 0011111111100001111111 011000000122222222332222234457999999999998 Q ss_pred cC Q lcl|NC_011811. 367 AS 368 (368) Q Consensus 367 a~ 368 (368) -+ T Consensus 345 ~~ 346 (347) T protein:vir:15 345 VS 346 (347) T ss_pred CC Confidence 88 No 37 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=36.38 E-value=1.2 Score=20.20 Aligned_cols=305 Identities=13% Similarity=0.010 Sum_probs=105.8 Q ss_pred Cc----------------------ccccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCceee Q lcl|NC_011811. 1 MS----------------------LTLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGISL 57 (368) Q Consensus 1 m~----------------------~~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~l 57 (368) |. .++|= -.|+-. +..++.+. +.+. ++...+.++ +.++.|.+. |..++ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~i-e~~~ge-V~~~f~~~----s~~~--~~~~~r~i~~g~s~~~~~i-G~~~~ 71 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFL-KVFGGE-VLTAFART----SVTT--SRHMVRSISSGKSAQFPVL-GRTQA 71 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHH-HHHHHH-HHHHHHHH----hhhc--ccceeeeecccceEEEEee-ceeEE Confidence 11 01110 012222 22222221 2332 345555554 677888776 33333 Q ss_pred eeccCCCCCccccc---cCCceeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011811. 58 LDPVDRDTRNAESS---APESLRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFL 134 (368) Q Consensus 58 ~p~v~rg~~~~~~~---~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m 134 (368) ....||.+-.... ...+. ...+.-..+..--.=.-+++|+ -. +-.....+.....|+ +.+|. . T Consensus 72 -~~~~~G~~l~~t~~~~~~~e~-~l~ID~~~y~~~~VdDiD~~q~---~~--D~r~~~~~~~G~aLA---~~~D~----~ 137 (344) T protein:vir:10 72 -AYLAPGENLDDIRKDIKHTEK-VITIDGLLTADVLIYDIEDAMN---HY--DVRSEYTSQLGESLA---MAADG----A 137 (344) T ss_pred -EeeecCCCCCCCCCCcccceE-EEEEcchhhhhhhhhhHHHHhc---Cc--chHHHHHHHHHHHHH---HHHHH----H Confidence 3444554321110 11111 1121111111111112233332 11 111111222222222 22222 2 Q ss_pred HHHHh-cCcE-EccCCCEEEeeehhcCcccceEEE-----EcCCCC---ccHHHHHHHHHHHHHHHhccccccccccEEE Q lcl|NC_011811. 135 LMQAL-KGKV-VDSKGFLWADMYQTFGVEKKTVYF-----DLENPD---ADIDGAIDELVEHMEDTANTGGLTNGEQIIV 204 (368) Q Consensus 135 ~~~AL-~G~i-~d~dG~~~~d~~~~fG~~~~~v~~-----~l~~~~---~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~a 204 (368) +++.| .+.. .++. +.+...+-+...+.. ..+.+. ..+...+.++...+.++ - .|..+.++ T Consensus 138 i~~~la~~a~~~~~~-----~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~----~-VP~~gR~~ 207 (344) T protein:vir:10 138 VLAEIAGLCNVESQY-----NENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKN----Y-VPSSDRVF 207 (344) T ss_pred HHHHHHhhhcccccc-----ccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhc----C-CCccCCEE Confidence 22221 1110 0000 000000000111110 011111 12333344444333322 2 36677889 Q ss_pred EEChHHHHHHhcCHHHHHHHHHHHhhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccc Q lcl|NC_011811. 205 LVDRAFFRKLTGHAKVREAYMAQQAVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGING 284 (368) Q Consensus 205 l~g~~~~~al~~h~~V~~~y~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~ 284 (368) +++|++|..|++|+.+-.. .+.. .+..+.| ....+.|+.+++-.- ++..... .. T Consensus 208 vv~P~~y~~Ll~~~~~~~~--~~~~-------~~~~~~G---------~V~~v~G~~V~~Sn~-lp~~~~~-------~~ 261 (344) T protein:vir:10 208 YCDPDSYSAILAALMPNAA--NYAA-------LIDPEKG---------SIRNVMGFEVVEVPH-LTAGGAG-------TS 261 (344) T ss_pred EeChHHHHHHhhccccccc--cccc-------ccceeee---------EEEEEeceEEEeccc-cccccCC-------cc Confidence 9999999999999876431 1111 1111222 224567887776332 1111110 11 Q ss_pred eeecCCceeEEeecccccccccceeEEecccchHhhccCC-CceeeEEEeeccCCCeeEEEeeecccccccCcceE--EE Q lcl|NC_011811. 285 VEDSVGVGHAFPNTAMLGEANDLYQISYGPANKMGYVNTL-GQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLL--VD 361 (368) Q Consensus 285 ~~i~~~~a~~~P~g~~~~~~~~~f~~~~apa~~~~~vn~~-~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~al--v~ 361 (368) .....|..+.+|.+.- ......|....|-.=+-+.+.+. .+++=...+.+++..++.|.+-...=.-..||+++ |+ T Consensus 262 ~~~~tg~~~~~~~~~~-~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~i~g~~~~G~~vlRPe~a~~v~ 340 (344) T protein:vir:10 262 REGTTGQKHAFPATKS-GNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRANFQADQIIAKYAMGHGGLRPEAAGAVV 340 (344) T ss_pred cccccCccccccCCcc-cceeeecceeEEEeechhhhhhhhhccceeecccchhHHHHHHHHHhhcccceecccceEEEE Confidence 1234555666654310 00000111111110011111111 11111222233333444444444444457899988 66 Q ss_pred EEee Q lcl|NC_011811. 362 VRAD 365 (368) Q Consensus 362 ~t~~ 365 (368) +|-+ T Consensus 341 ~~~~ 344 (344) T protein:vir:10 341 FKTK 344 (344) T ss_pred eecC Confidence 6666 No 38 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=22.82 E-value=2.4 Score=18.51 Aligned_cols=271 Identities=11% Similarity=-0.004 Sum_probs=104.5 Q ss_pred CcccccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcccceEEEEEEeCceeeeeccCCCCCccccccCCceeEEE Q lcl|NC_011811. 1 MSLTLANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTSQTSVLLDITDYGISLLDPVDRDTRNAESSAPESLRQVA 80 (368) Q Consensus 1 m~~~~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~t~~v~ie~~~~~~~l~p~v~rg~~~~~~~~~~~~~~~~ 80 (368) |+...|-...|+-.-+...-+.+.. ..+. +-. ++..+....+|.|=. -+......+..++.+. ...... -+... T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~-~~l~-~~~-~~~~~~~GdTv~ip~-~~~~~~~d~~~~~~~~-~~~~~~-~~~~~ 74 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVF-ANLV-NRE-YEGIASKGNVVHIAG-VVAPTVKDYKAAGRQT-SADAIS-DTGVD 74 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccc-hhhh-hcc-ccccccCCcEEEEee-cCcccccccccCCCcc-Cccccc-cceEE Confidence 8887775556653333332222221 1111 111 122222233454422 2233333444444432 111111 11223 Q ss_pred Eecceecc-ccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEEeeehhcC Q lcl|NC_011811. 81 FPLIYFKH-IESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFLLMQALKGKVVDSKGFLWADMYQTFG 159 (368) Q Consensus 81 f~~p~~~~-~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m~~~AL~G~i~d~dG~~~~d~~~~fG 159 (368) +.+-..+. .-.|+-.|. .-..+. . ..+ +.++...+.+..+...+..+.+. +.. T Consensus 75 ~tid~~~~~~~~i~d~d~--~~~~~~---~---~~~----~~~~~~ala~~vD~~i~~~~~~a-----~~~--------- 128 (273) T protein:vir:79 75 LLIDQEKSIDFLVDDIDR--VQVAGS---L---EAY----TRAGATALATDTDKFIADMLVDN-----GTA--------- 128 (273) T ss_pred EEEeeecccceeeccHHH--Hhhccc---H---HHH----HHHHHHHHHHHHHHHHHHHHhhc-----ccc--------- Confidence 33322111 112332221 111121 1 111 12233333444444445555321 000 Q ss_pred cccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHHHHHHhcCHH-HHHHHHHHHhhhhhccccc Q lcl|NC_011811. 160 VEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAFFRKLTGHAK-VREAYMAQQAVSSYGLITG 238 (368) Q Consensus 160 ~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~~~al~~h~~-V~~~y~~~~~~~~~~~~~~ 238 (368) .+ ... ....+++.+.+.++.+.+.++ . .|..+-+++|+|+++..|++.+. +.++... + ..+ T Consensus 129 ~~---~~~--~~~~~~~~~~i~~a~~~ld~~----~-vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~---~-----~~~ 190 (273) T protein:vir:79 129 LT---GSA--PSDADDAFDLIASALKELTKA----N-VPNVGRVVVVNAEMAFWLRSSGSKLTSADTS---G-----DAA 190 (273) T ss_pred cc---ccc--ccchhhHHHHHHHHHHHhhhc----c-CCccCcEEEECHHHHHHHhhchhhhhhhhhc---c-----ccc Confidence 00 000 001123445555555444332 2 24556678999999999998764 4443211 0 011 Q ss_pred ccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCCceeEEeecccccccccceeEEecccchH Q lcl|NC_011811. 239 SLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVGVGHAFPNTAMLGEANDLYQISYGPANKM 318 (368) Q Consensus 239 ~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~~a~~~P~g~~~~~~~~~f~~~~apa~~~ 318 (368) .++.| ....+.|+.|++...- |.+.+..+-.| .++.+. +.-..+.+ T Consensus 191 ~l~~G---------~ig~~~G~~i~~s~~l-------------------p~~~~~~~~a~-----~~~A~~-~a~~~~~~ 236 (273) T protein:vir:79 191 GLRAG---------TIGNLLGARIVESNNL-------------------RDTDDEQFVAF-----HPSAAA-YVSQIDTV 236 (273) T ss_pred ceeee---------EeeEEeceEEEecccc-------------------cccCceEEEEE-----ecccee-eeeehhhh Confidence 22222 1246788888864331 11111110001 112111 00001111 Q ss_pred hhccCCCceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeec Q lcl|NC_011811. 319 GYVNTLGQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKA 367 (368) Q Consensus 319 ~~vn~~~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa 367 (368) |. ..++...+-.+..-...=.-..||+.++.++.+.+ T Consensus 237 e~------------~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 237 EA------------LRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred hc------------ccCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 10 01122223334443444455679999999998888 No 39 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=22.18 E-value=2.5 Score=18.42 Aligned_cols=316 Identities=12% Similarity=-0.035 Sum_probs=112.1 Q ss_pred Cc---------cc------------ccCCCcccHHHHHHHHHhcCCCccchhhcCcccccCcc-cceEEEEEEeCceeee Q lcl|NC_011811. 1 MS---------LT------------LANGSRFLLADLTGDIANIPNTYGYVNQLDLFRSVPTS-QTSVLLDITDYGISLL 58 (368) Q Consensus 1 m~---------~~------------~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~lF~~~~~~-t~~v~ie~~~~~~~l~ 58 (368) |. .+ +|= -.|+-.-+ .++.+- +.+.+ +...+.++ +.++.|.+....- + T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~geV~-~~f~~~----s~~~~--~~~~rti~~G~sv~~~~iG~~~-~- 70 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFL-KVFGGEVL-TAFTRT----SVTMN--KHLVRSIQSGKSAQFPVLGRTK-A- 70 (347) T ss_pred CCccccccccccccccCCcccchHHHHH-HHHhHHHH-HHHHHH----Hhhhh--hhhheeccccceEEeeecccee-E- Confidence 11 00 111 11221111 112111 22322 24444333 5666766443221 1 Q ss_pred eccCCCCCc----cccccCCceeEEEEecceeccccccCHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011811. 59 DPVDRDTRN----AESSAPESLRQVAFPLIYFKHIESITPEQVQGIRQAGTAAELTTEAMVRARKLQKIRMTHDITKEFL 134 (368) Q Consensus 59 p~v~rg~~~----~~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~~~~t~E~m 134 (368) ....+|.+. ..+ ...+.+...=+.-|+.. -.=.-+++|..- +......+.....|++..+++-.. +.. T Consensus 71 ~~~~~G~~l~~~~~~~-~~~e~~ltID~~~y~~~-~VddiD~~q~~~-----D~rs~~~~~~g~ALA~~~D~~i~~-~l~ 142 (347) T protein:vir:94 71 AYLQPGENLDDKRKDM-KHTEKTINIDGLLTADV-LIYDIEDAMNHY-----DVRSEYTAQLGESLAMAADGAVLA-EMA 142 (347) T ss_pred eeeecCcCCCCCcCCc-cccceEEEEcchhhhhh-hhhhHHHHhcCc-----chHHHHHHHHHHHHHHHHHHHHHH-HHH Confidence 222222221 011 11222211111112110 001223344211 111112222233333322221111 111 Q ss_pred HHHHhcC---cEEccCC-CEEEeeehhcCcccceEEEEcCCCCccHHHHHHHHHHHHHHHhccccccccccEEEEEChHH Q lcl|NC_011811. 135 LMQALKG---KVVDSKG-FLWADMYQTFGVEKKTVYFDLENPDADIDGAIDELVEHMEDTANTGGLTNGEQIIVLVDRAF 210 (368) Q Consensus 135 ~~~AL~G---~i~d~dG-~~~~d~~~~fG~~~~~v~~~l~~~~~d~~~~~~~~~~~i~~~l~~~~~~~~~~v~al~g~~~ 210 (368) .+.+... +...+.| ...+.... + .+..-+-......+...+.++. +.|...- .|..+.+++++|++ T Consensus 143 ~~a~~~~~~~~~~~g~~~~~~v~i~~--~---~~~~~~~~~~~~~~~d~i~~a~----~~Lde~d-VP~~~R~~vv~P~~ 212 (347) T protein:vir:94 143 KLCNLPTANNENIAGLGKAHVLEVGD--Q---ATLQGDQVKLGQAIIAQLTLAR----AKLTGNY-VPSSDRVFYTTPDN 212 (347) T ss_pred HhhccccccccccccCCcceeEeeec--c---ccccccccccHHHHHHHHHHHH----HHhhhcC-CCCCCCEEEeChHH Confidence 1111111 1111111 11111000 0 0000000001112233333333 3333322 36668899999999 Q ss_pred HHHHhcCHHHHHHHHHHHhhhhhcccccccccccccccccccceEEeCCEEEEEccccccCCCccccccccccceeecCC Q lcl|NC_011811. 211 FRKLTGHAKVREAYMAQQAVSSYGLITGSLKTGRSDGVATATNEFPYRGVVFRQYNGKFTDKRNTVHKLVGINGVEDSVG 290 (368) Q Consensus 211 ~~al~~h~~V~~~y~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~f~~~Gi~~~~y~g~~~~~~~~~~~~~~~~~~~i~~~ 290 (368) |..|+++..... .++. ..+.++.| ....+.|+.+++-.. ++...+...... .-....+ T Consensus 213 y~~LLk~~~~~~--~~~~-------~~~~~~~G---------~V~~v~G~~V~~Sn~-~p~~~~~~~~~~---~~~~~~~ 270 (347) T protein:vir:94 213 YSAILAALMPNA--ANYQ-------ALIDPSTG---------SIRNVMGFEVIEVPH-LTAGGAGDNRAE---EGVAPTN 270 (347) T ss_pred HHHHHHhhcccc--cccc-------cccccccc---------eeEEeeceEEEEcCc-cccccCcccccc---ccccccc Confidence 999986533221 1111 11112211 335678888875322 111111000000 0011223 Q ss_pred ceeEEeecccccccccceeEEecccchHhhccCC-CceeeEEEeeccCCCeeEEEeeecccccccCcceEEEEEeeec Q lcl|NC_011811. 291 VGHAFPNTAMLGEANDLYQISYGPANKMGYVNTL-GQDLYVFEYAKDRDEGTDFEAHSYMMPICTRPQLLVDVRADKA 367 (368) Q Consensus 291 ~a~~~P~g~~~~~~~~~f~~~~apa~~~~~vn~~-~~~~y~~~~~~~~~~g~~l~~eS~pLpv~~rP~alv~~t~~aa 367 (368) ..+.+|.+.- ....+-|....|-.-+-+.+.+. ..++=...|.+.+-.++.|.+--..=.-..||++.+.+++++| T Consensus 271 ~~~~~~~~~~-~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 271 QKHAFPDTAS-GDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred cccccccccc-ccccccccceEEEEechhhhhhhhhcccceeeeechhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 3444443310 00011111111111122233332 2333345566667777778777777778899999999999999 Done!