Query lcl|Aclame:protein:vir:95258|NCBI_annot:Phage conserved protein|genbank:acc:NP_944891;genbank:gi:38707831;genbank:GeneID:2744044 Match_columns 368 No_of_seqs 112 out of 177 Neff 8.1 Searched_HMMs 1612 Date Sun Dec 1 04:39:08 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_78 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_78_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95258 Length: 368 100.0 3E-131 2E-134 735.9 36.2 368 1-368 1-368 (368) 2 protein:vir:10324 Length: 320 100.0 3E-103 2E-106 582.9 31.4 319 19-368 1-319 (320) 3 protein:vir:6378 Length: 346 # 100.0 3.7E-64 2.3E-67 368.3 29.3 335 7-364 1-346 (346) 4 protein:vir:3424 Length: 341 # 100.0 7E-60 4.3E-63 344.9 29.4 324 2-364 1-341 (341) 5 protein:vir:393 Length: 341 # 100.0 8.7E-59 5.4E-62 338.9 27.9 323 2-364 1-341 (341) 6 protein:vir:96490 Length: 348 100.0 7.3E-51 4.5E-54 295.4 27.1 329 1-367 1-348 (348) 7 protein:vir:4902 Length: 348 # 100.0 2E-49 1.3E-52 287.5 26.9 328 1-367 1-348 (348) 8 protein:vir:2736 Length: 348 # 100.0 9E-49 5.6E-52 284.0 27.5 327 1-367 1-348 (348) 9 protein:vir:106590 Length: 349 100.0 1.4E-44 8.4E-48 261.1 26.6 315 1-364 17-349 (349) 10 protein:vir:98480 Length: 348 100.0 5.7E-43 3.5E-46 252.2 26.7 330 1-365 1-348 (348) 11 protein:vir:78006 Length: 409 100.0 2.7E-31 1.7E-34 188.2 22.6 346 1-368 20-390 (409) 12 protein:vir:79503 Length: 409 100.0 2.7E-31 1.7E-34 188.2 22.6 346 1-368 20-390 (409) 13 protein:vir:79078 Length: 307 99.2 6.7E-13 4.2E-16 87.3 14.6 300 1-364 1-307 (307) 14 protein:vir:107882 Length: 307 99.1 2.5E-11 1.6E-14 78.7 18.9 301 1-364 1-307 (307) 15 protein:vir:99888 Length: 309 98.7 1.8E-09 1.1E-12 68.6 16.0 302 5-365 1-309 (309) 16 protein:vir:108211 Length: 318 94.5 0.0044 2.7E-06 33.5 17.6 291 1-367 7-318 (318) 17 protein:vir:103323 Length: 364 92.5 0.011 6.9E-06 31.3 21.7 310 1-368 18-341 (364) 18 protein:vir:6324 Length: 335 # 91.4 0.016 1E-05 30.4 18.7 304 1-368 18-330 (335) 19 protein:vir:98819 Length: 437 91.1 0.017 1.1E-05 30.2 11.1 349 1-368 1-436 (437) 20 protein:vir:78935 Length: 335 90.3 0.021 1.3E-05 29.7 14.9 305 1-368 18-330 (335) 21 protein:vir:100057 Length: 375 74.7 0.16 9.6E-05 25.0 16.3 320 1-368 11-372 (375) 22 protein:vir:3969 Length: 287 # 65.9 0.28 0.00017 23.7 12.5 272 2-365 1-287 (287) 23 protein:vir:97031 Length: 402 65.4 0.28 0.00018 23.6 14.3 309 1-368 22-337 (402) 24 protein:vir:99675 Length: 324 57.4 0.44 0.00027 22.6 15.0 274 36-368 1-300 (324) 25 protein:vir:80213 Length: 334 52.2 0.56 0.00035 22.0 15.3 294 1-368 20-334 (334) 26 protein:vir:94711 Length: 347 43.1 0.86 0.00053 20.9 16.9 308 1-367 1-347 (347) 27 protein:vir:98871 Length: 314 37.9 1.1 0.00068 20.4 13.6 269 1-367 27-314 (314) 28 protein:vir:105645 Length: 400 37.0 1.1 0.00071 20.3 16.2 300 1-368 22-337 (400) 29 protein:vir:94528 Length: 286 34.8 1.3 0.00079 20.0 11.9 272 2-365 1-286 (286) 30 protein:vir:7019 Length: 401 # 27.9 1.8 0.0011 19.2 15.7 303 1-368 22-335 (401) 31 protein:vir:3033 Length: 272 # 23.7 2.3 0.0014 18.6 13.8 267 1-368 1-271 (272) 32 protein:vir:9820 Length: 272 # 23.7 2.3 0.0014 18.6 13.8 267 1-368 1-271 (272) 33 protein:vir:3613 Length: 272 # 23.5 2.3 0.0014 18.6 12.6 258 1-366 1-272 (272) No 1 >protein:vir:95258 Length: 368 # NCBI annotation: Phage conserved protein # Family: family:all:570 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944891;genbank:gi:38707831;genbank:GeneID:2744044 Probab=100.00 E-value=3.3e-131 Score=735.93 Aligned_cols=368 Identities=100% Similarity=1.460 Sum_probs=361.4 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhhcccccccccccceEEEEEEcCceeeeeccCCCCCccccccCCceeEEEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMDLTDWDVSLLDAVDRDSRKAETSAPERVRQISF 80 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~t~~i~id~~~~~~~lvp~v~rg~~~~~~~~~~~~~~~~f 80 (368) |||||++|+||+++||++||++|+.|++|++||||++++++|++|.||++++.++|+|+++||+++.++.++++++++.| T Consensus 1 ~~d~f~~d~Fs~~~LT~ain~~p~~p~~l~~lglF~~~~v~t~~v~iE~~~~~l~Lvp~~~rg~~~~~~~~~~~r~~~~f 80 (368) T protein:vir:95 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMDLTDWDVSLLDAVDRDSRKAETSAPERVRQISF 80 (368) T ss_pred CcccccCCcccHHHHHHHHHhcCCCcceecccccccCCCccceEEEEEEEcCeEEEccccCCCCCCcccccCCceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999997777788888999999 Q ss_pred ecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEeccHhhcCCC Q lcl|Aclame:pro 81 PMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDARGTLYADLYKQFDVE 160 (368) Q Consensus 81 ~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d~~G~~~~d~~~~fG~~ 160 (368) ++|||+++|.|+|+||||+|+||+++++++++.+++++|++||++|++|+||||+|||+|+|+|+||++++|||++||++ T Consensus 81 ~~ph~~~~d~I~a~eiQg~RafG~~~~l~~v~~~v~~kl~~~r~~~d~T~E~~r~gAL~G~ilDadGtvl~dly~eFGit 160 (368) T protein:vir:95 81 PMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDARGTLYADLYKQFDVE 160 (368) T ss_pred ecceeccccccchHHHccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCeeECCCCcEEecchhhhCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhhhhhhhhhcccc Q lcl|Aclame:pro 161 KKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAWQQITGSLR 240 (368) Q Consensus 161 ~~~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~~~~~~~~~~~~ 240 (368) |++++|+|+++++||+++|++|+++|+++++++.+++.++++||||++||++|++||+|+++|++|++++.+++++.++| T Consensus 161 ~~~v~f~l~~~~tdv~~~~~~~~~~i~d~l~g~~~~~~~~v~alcg~~Ffd~L~~h~~Vkeay~~~~~a~~~~~lr~~~r 240 (368) T protein:vir:95 161 KKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAWQQITGSLR 240 (368) T ss_pred cceEEEEeCCCCcCHHHHHHHHHHHHHHhhcccccccccceEEEEChHHHHHhhcChhHHHHHHHHHhhhhhhhhccccc Confidence 99999999999999999999999999999998888888899999999999999999999999999999999999999999 Q ss_pred cccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheeeccccchhhc Q lcl|Aclame:pro 241 TGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPCPKMGYA 320 (368) Q Consensus 241 ~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~apa~~~~~~ 320 (368) .+......+++++|+|+||+|++|+|++++.+|+.++++++++++|++++|++||.++..+.++|+|++||||||+++++ T Consensus 241 ~g~~~~~~~~~~~F~fgGi~f~eYrg~~~~~~g~~~~~v~~d~v~I~~gea~~~P~G~~~~~~~~~F~~~~aPad~~e~v 320 (368) T protein:vir:95 241 TGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPCPKMGYA 320 (368) T ss_pred cccccccccccceeEecCEEEEEcceeecCCCcceeeeecCCceeeccCceEEEeecccccccCcceEEEecCCCcHhhc Confidence 99999999999999999999999999999999999999999999999999999999988888999999999999999999 Q ss_pred ccccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCCC Q lcl|Aclame:pro 321 NTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKGG 368 (368) Q Consensus 321 n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~~ 368 (368) |+.|+|+|+|+|++++++|++|++||||||||+||++|+++|++|+|| T Consensus 321 Nt~g~p~Ya~~~~~~~~~g~~le~qSnpLpic~RP~~lv~~~~~a~~~ 368 (368) T protein:vir:95 321 NTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKGG 368 (368) T ss_pred CCCcccccceeeeccCCCeeEEEEeecccchhcccceeEEEEecCCCC Confidence 999999999999999999999999999999999999999999999999 No 2 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=100.00 E-value=2.7e-103 Score=582.88 Aligned_cols=319 Identities=29% Similarity=0.507 Sum_probs=292.9 Q ss_pred HHhcCCcccchhhcccccccccccceEEEEEEcCceeeeeccCCCCCccccccCCceeEEEEecccccccccccHHHHhc Q lcl|Aclame:pro 19 VQSIPNTYGYISNLGLFRSAPITQTTFLMDLTDWDVSLLDAVDRDSRKAETSAPERVRQISFPMMYFKEVESITPDEIQG 98 (368) Q Consensus 19 i~~~p~~~~~l~~l~~F~~~~~~t~~i~id~~~~~~~lvp~v~rg~~~~~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~ 98 (368) ||.+|...+++..| ||++++++|++|.||++++.++|+|+++||+++. +.+++++++++|++|||+++++|+|+|+|+ T Consensus 1 i~~~P~~~g~~~gl-ff~~~~v~T~~V~ie~~~~~l~lip~v~rg~~g~-~~~~~~~~~~~f~~p~~~~~d~i~a~eiq~ 78 (320) T protein:vir:10 1 MNLLPVNYGDSRAL-FAREKKVRTRTILVEEKNGVLTLIQSREPGSTEN-VAKRGKRKVRSFVIPHLPLEDVILPDEYEG 78 (320) T ss_pred CCcCCchhhhhhhh-ccCCCCcccceEEEEEecCceeeeeccCCCCCce-eecCCcceEEEEecceeccCCccCHHHHcC Confidence 88888888876555 5588899999999999999999999999999875 566788899999999999999999999999 Q ss_pred ccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEeccHhhcCCCcceeEEecCCCCCcHHHH Q lcl|Aclame:pro 99 VRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDARGTLYADLYKQFDVEKKTIYFDLDNPNADIDAS 178 (368) Q Consensus 99 ~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d~~G~~~~d~~~~fG~~~~~~~~~l~~a~~di~~~ 178 (368) +|+||+ ++++++++++++++.+|+++|++|+||||+|||+|+|+|+||++++|||++||++|+++.|+|+++++|+.++ T Consensus 79 ~Ra~G~-~~~~~~~~~v~~~l~~lr~~~~~T~E~m~~~AL~G~ildadGtv~~d~y~~fGi~~~~i~~~l~~a~~dv~~~ 157 (320) T protein:vir:10 79 LRGFGT-TALAAKSELVKERXETMKSSHDITHEHLRMGAKKGQILDADGTVLYDLYAEFGITKKTIYFGLDNKDANVAES 157 (320) T ss_pred cccCCC-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCeEEcCCCcEEEechhhhCCccceeEEecCCCCccHHHH Confidence 999997 6889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhhhhhhhhhcccccccccccccccceeeeCC Q lcl|Aclame:pro 179 IEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAWQQITGSLRTGGADGVQAHMNTFYYGG 258 (368) Q Consensus 179 l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~g 258 (368) |.+++++|+++++|.. .++++|+||++||++|++||+|+|+|++|+++... ++. .....|.|+| T Consensus 158 ~~~~~~~i~~~l~g~~---~t~v~al~g~~f~~al~~h~~Vke~y~~~~~~~~~--l~~-----------~~~~~f~~gG 221 (320) T protein:vir:10 158 CRQVLRHVEDNLRGDV---MKDVSVDVSEEFFDKFIKHASVKEVFLNHEAAVNR--LGG-----------DTRKGFKFGG 221 (320) T ss_pred HHHHHHHHHHHhccCC---CCceEEEEChHHHHHHhcCHHHHHHHHhhhhhhhh--ccc-----------cccceEEecC Confidence 9999999999998653 36789999999999999999999999999775432 222 2345689999 Q ss_pred EEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheeeccccchhhcccccceeeEeeeeccCCC Q lcl|Aclame:pro 259 VKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPCPKMGYANTLGQELYVFEYEKDRDE 338 (368) Q Consensus 259 i~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~apa~~~~~~n~~~~~~y~~~~~~~~~~ 338 (368) |+|++|+|+|.+.+|+.+.+ |+++++++||. ++++.|++||||+|+++++|+.++|||+|+|++++++ T Consensus 222 i~~~~Y~g~~~d~~g~~~~~-------I~~~~~~~~p~-----g~~~~f~~~~apad~~e~vnt~g~p~y~k~~~~~~~~ 289 (320) T protein:vir:10 222 LIFNENRARHVDEEGKETRF-------IKAGKGHAFPT-----GTTNTFFTALAPADFNETAGTLGKRYYAKMEPRRMGR 289 (320) T ss_pred EEEEEcccEEEcCCCCeeEe-------ecCCeeEEEEe-----cCchhheeeecccCcHhhcCCcccccccccccccCCC Confidence 99999999999999888865 45678999998 6789999999999999999999999999999999999 Q ss_pred eeEEEeeecccccccCCceEEEEEEecCCC Q lcl|Aclame:pro 339 GIDFEAHSYMLPYCTRPQLLVDVRADAKGG 368 (368) Q Consensus 339 ~~~l~~eS~pLpv~~rP~al~~~t~~a~~~ 368 (368) |++|++||+|||+|+||++|+++|++|+|- T Consensus 290 g~~l~~qS~PLpi~~rP~~lv~~~~~a~~~ 319 (320) T protein:vir:10 290 GFDLHSQSNVLPMCCRPGVLVELDAAAQPA 319 (320) T ss_pred eEEEEeeecccccccCcceEEEEEecCCCC Confidence 999999999999999999999999999998 No 3 >protein:vir:6378 Length: 346 # NCBI annotation: capsid protein E # Family: family:all:1021 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918991;genbank:gi:34610166;genbank:GeneID:2559600 Probab=100.00 E-value=3.7e-64 Score=368.35 Aligned_cols=335 Identities=14% Similarity=0.085 Sum_probs=269.0 Q ss_pred CCcccHHHHHHHHHhcCCcccchhhcccccccccccceEEEEEEcCceeeeeccCCCCCccccccCCceeEEEEeccccc Q lcl|Aclame:pro 7 KSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMDLTDWDVSLLDAVDRDSRKAETSAPERVRQISFPMMYFK 86 (368) Q Consensus 7 ~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~t~~i~id~~~~~~~lvp~v~rg~~~~~~~~~~~~~~~~f~~p~~~ 86 (368) =|.|++.+|+++|+++|.++ +|.+++||++.++.|++|.||.+++.+.++|+++|+.++....++ ++++..|++||++ T Consensus 1 ~d~f~~~~l~~~i~~~p~~~-~l~~~~fp~~~~~~t~~i~i~~~~g~~~la~~v~~~~~~~~~~~~-g~~~~~~~~p~i~ 78 (346) T protein:vir:63 1 MEIFDTLTLAGVIQSGPALS-MYWQGFYPNEITFDTDEILFDLVFKDKKLAPFVAPNVQGRVIAAR-GYTTKTFRPAYVK 78 (346) T ss_pred CCccCHHHHHHHHHhcCCcc-chhhhcCccccccccceEEEEEecCceeeeeeecCCCCcceeccc-ceeeeEeecCccC Confidence 24588999999999999765 578888888889999999999999999999999999988765554 5688899999999 Q ss_pred ccccccHHHHhcccC-----CCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEeccHhhcCCCc Q lcl|Aclame:pro 87 EVESITPDEIQGVRQ-----PGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDARGTLYADLYKQFDVEK 161 (368) Q Consensus 87 ~~~~i~a~dlq~~R~-----~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d~~G~~~~d~~~~fG~~~ 161 (368) +++.++|+|++++|. +|+.+..+++.+.+++++.+|+++|++|+||||+|||++..++.+|+...+++.+||++. T Consensus 79 ~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~~~E~m~~~al~~gki~~~g~~~~~~~vdfg~~~ 158 (346) T protein:vir:63 79 PKDVINPNRTLKRRAGEQPIIGGMSLQERFQAVVADSQLEQRQRIENRIEWMCAMATIYGYVDVVGEAFPMQRVDFGRDP 158 (346) T ss_pred ccceeCHHHHHHHhhhhhhccCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCEEEeeCCceeEEEEeeCCCc Confidence 999999999998764 466677888999999999999999999999999999987667777776667777799864 Q ss_pred c-eeE----EecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 162 K-TIY----FDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAWQQIT 236 (368) Q Consensus 162 ~-~~~----~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~~~~~~~~ 236 (368) . .++ ..|+++++||.+.|++|.++++++.+ ..+.+++||+++|++|+.|++|+++|..+.......... T Consensus 159 ~~~~~lt~~~~W~~~~adp~~di~~~~~~~~~~~g------~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~ 232 (346) T protein:vir:63 159 ALTVQLTGGAAWDQATSDPLGNIQTMRTTAWKKSN------STITRLTMGLDAWSLFSQKPAVVELLNLFYKGSTSDFNR 232 (346) T ss_pred cceeeecccccCCCCCCCHHHHHHHHHHHHHHccC------CceEEEEECHHHHHHHhcCHHHHHHHhhhccccccccch Confidence 2 333 35778999999999999999987533 245689999999999999999999998765443221111 Q ss_pred ccccccccccccc-ccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheeecccc Q lcl|Aclame:pro 237 GSLRTGGADGVQA-HMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPCP 315 (368) Q Consensus 237 ~~~~~~~~~~~~~-~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~apa~ 315 (368) ..+..+....... ....+.++|++|+.|+++|.+.+|+.+.++|++ ++.++|. +. ...++|||.. T Consensus 233 ~~l~~~~~~~~~~~~~~~~~~~gi~i~~y~~~y~d~~G~~~~~ip~~-------~v~~~p~-----~~--~g~~~yg~~~ 298 (346) T protein:vir:63 233 SRLDDGSPVQYQGTIGGYNGMGTLELYTYHDTYTGDDNTEQEILGSY-------DVVGTGP-----GL--QGTQCFGAIM 298 (346) T ss_pred hhcccchhhhhhhhHhhhhccCCeEEEEeccEEEcCCCceeccccCC-------eEEEEec-----CC--cceEEEeecc Confidence 1111111111111 112356789999999999999999888776654 5556664 22 2357899988 Q ss_pred chhhcccccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEe Q lcl|Aclame:pro 316 KMGYANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRAD 364 (368) Q Consensus 316 ~~~~~n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~ 364 (368) +.+. |..+.++|+++|..++|++.++++||+|||+|.+|++++.+||+ T Consensus 299 d~~~-~~~~~~~~~~~~~~~dp~~~~~~~~s~plPv~~~p~~~~~~~V~ 346 (346) T protein:vir:63 299 DFKN-GLVPTRMFPKMWEEEDPSVAMLMTQSAPLMVPAQPNASFRMTVK 346 (346) T ss_pred cccc-CcccceeeeEEEEecCCCEEEEEEeeeccceecCCCcEEEEEeC Confidence 7774 88999999999999999999999999999999999999999999 No 4 >protein:vir:3424 Length: 341 # NCBI annotation: capsid component # Family: family:all:1021 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040587;genbank:gi:9626251;genbank:GeneID:2703482 Probab=100.00 E-value=7e-60 Score=344.89 Aligned_cols=324 Identities=14% Similarity=0.081 Sum_probs=247.1 Q ss_pred cccccCCcccHHHHHHHHHhcCCcccchhhcccccccccccceEEEEEEcCceeeeeccCCCCCccccccCCceeEEEEe Q lcl|Aclame:pro 2 LTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMDLTDWDVSLLDAVDRDSRKAETSAPERVRQISFP 81 (368) Q Consensus 2 ~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~t~~i~id~~~~~~~lvp~v~rg~~~~~~~~~~~~~~~~f~ 81 (368) ||+ |+..+|+++++++|+++++|.+++|+++.+++|++|.||.+++.+.++|+++|+.++..... +++++++|+ T Consensus 1 ~d~-----f~~~~L~~~i~~~~~~~~~l~d~~fp~~~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~-~~~~~~~~~ 74 (341) T protein:vir:34 1 MSM-----YTTAQLLAANEQKFKFDPLFLRLFFRESYPFTTEKVYLSQIPGLVNMALYVSPIVSGEVIRS-RGGSTSEFT 74 (341) T ss_pred CCC-----cCHHHHHHHHHhccCccchhHHhcCCcccccccceEEEEEeeCCeeEEEeecCCCCcceecc-CceeeeEEe Confidence 554 88899999999999999999999888888999999999999999999999999998876554 557889999 Q ss_pred cccccccccccHHHHhcccCCC-----CcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcE-EccCCC--EEec Q lcl|Aclame:pro 82 MMYFKEVESITPDEIQGVRQPG-----TANELTTEAVVRAKKLMKIRTKFDITREFLFMQALK-GKV-VDARGT--LYAD 152 (368) Q Consensus 82 ~p~~~~~~~i~a~dlq~~R~~G-----~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~-G~i-~d~~G~--~~~d 152 (368) +|||++++.|+|+|++ .|.+| ..+..++..+.+.+++.+|+++|++|+||||+|||+ |+| ++++|. +.+| T Consensus 75 ~p~i~~~~~i~~~d~~-~r~~g~~~~~~~~~~~~~~~~i~~~l~~l~~~i~~~~E~m~~qaL~~Gki~~~~~g~~~~~vD 153 (341) T protein:vir:34 75 PGYVKPKHEVNPQMTL-RRLPDEDPQNLADPAYRRRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFDPVEVD 153 (341) T ss_pred cCccCccceeCHHHHH-HHhhccccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEEecCCccEEEEE Confidence 9999999999999999 47776 335678888899999999999999999999999995 988 455653 4566 Q ss_pred cHhhcCCCc-ceeEE----ecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhh Q lcl|Aclame:pro 153 LYKQFDVEK-KTIYF----DLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQ 227 (368) Q Consensus 153 ~~~~fG~~~-~~~~~----~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~ 227 (368) ||+.. +++++ .|+++++++...+.++.+.+++ .+ .++.+++||+++|++|+.|++|+++|.++. T Consensus 154 ----fg~~~~~~~~~t~~~~W~~~~~~~~d~l~di~~~~~~-~g------~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~ 222 (341) T protein:vir:34 154 ----MGRSEENNITQSGGTEWSKRDKSTYDPTDDIEAYALN-AS------GVVNIIVFDPKGWALFRSFKAVKEKLDTRR 222 (341) T ss_pred ----eCCCCccceEecCCccCCcCCCchHHHHHHHHHHHHh-cC------CceEEEEeCHHHHHHHhcCHHHHHHHhhcc Confidence 56643 23333 2455555555556665554432 21 246789999999999999999999998654 Q ss_pred hhhhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhh Q lcl|Aclame:pro 228 TPLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIF 307 (368) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f 307 (368) ....... .............+.++|++|+.|+++|.+ +|+.++++|+ +++.++|. +.. . T Consensus 223 ~~~~~~~------~~~~~~~~~~~~~~~~~g~~i~~y~~~y~d-dG~~~~~ip~-------~~v~l~p~-----g~~--g 281 (341) T protein:vir:34 223 GSNSELE------TAVKDLGKAVSYKGMYGDVAIVVYSGQYVE-NGVKKNFLPD-------NTMVLGNT-----QAR--G 281 (341) T ss_pred ccccccc------ccccccccceeeeeecCCceEEEEcCEEEE-CCcEEeeecC-------CeEEEeeC-----CCc--c Confidence 3322111 000011112223346899999999999986 6877776654 55666664 222 3 Q ss_pred heeeccccchhhccc--ccceeeEeeee-ccCCCeeEEEeeecccccccCCceEEEEEEe Q lcl|Aclame:pro 308 EVAYGPCPKMGYANT--LGQELYVFEYE-KDRDEGIDFEAHSYMLPYCTRPQLLVDVRAD 364 (368) Q Consensus 308 ~~~~apa~~~~~~n~--~~~~~y~~~~~-~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~ 364 (368) .++|||..+.+..+. ...++|+++|. +++|+++++++||+|||+|.||++++++||+ T Consensus 282 ~~~yg~~~d~~~~~~~~~~~~~~~~~~~~~~dp~~~~~~~~s~pLPv~~~pd~~~~a~V~ 341 (341) T protein:vir:34 282 LRTYGCIQDADAQREGINASARYPKNWVTTGDPAREFTMIQSAPLMLLADPDEFVSVQLA 341 (341) T ss_pred eEEEeecccccccccceeeeeEeeeeeeecCCCcEEEEEEcccceeeeeCCCcEEEEEeC Confidence 578887766554433 34689999985 4589999999999999999999999999999 No 5 >protein:vir:393 Length: 341 # NCBI annotation: gp8 # Family: family:all:1021 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046903;genbank:gi:9630472;genbank:GeneID:1261647 Probab=100.00 E-value=8.7e-59 Score=338.89 Aligned_cols=323 Identities=12% Similarity=0.051 Sum_probs=244.3 Q ss_pred cccccCCcccHHHHHHHHHhcCCcccchhhcccccccccccceEEEEEEcCceeeeeccCCCCCccccccCCceeEEEEe Q lcl|Aclame:pro 2 LTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMDLTDWDVSLLDAVDRDSRKAETSAPERVRQISFP 81 (368) Q Consensus 2 ~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~t~~i~id~~~~~~~lvp~v~rg~~~~~~~~~~~~~~~~f~ 81 (368) ||+ |++.+|+++|+++|.++++|.+++|.++..++|.+|.+|.+++.+.++|+++|+.++.... ++++++++|+ T Consensus 1 ~d~-----f~~~~L~~~i~~~~~~~~~l~~~~Fp~~~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~-~~~~~~~~~~ 74 (341) T protein:vir:39 1 MSV-----YTTAQLLAVNEKKFKFDPLFLRIFFRETYPFSTEKVYLSQIPGLVNMALYVSPIVSGKVIR-SRGGSTSEFT 74 (341) T ss_pred CCc-----cCHHHHHHHHHhhcCccchhHhhcCCcccccCcceEEEEEecCCceeeEEecCCCCcceec-ccceeeeeEe Confidence 554 8889999999999999999999955566788899999999999999999999999886654 4557889999 Q ss_pred cccccccccccHHHHhcccCCC-----CcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCcE-EccCCC--EEec Q lcl|Aclame:pro 82 MMYFKEVESITPDEIQGVRQPG-----TANELTTEAVVRAKKLMKIRTKFDITREFLFMQAL-KGKV-VDARGT--LYAD 152 (368) Q Consensus 82 ~p~~~~~~~i~a~dlq~~R~~G-----~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL-~G~i-~d~~G~--~~~d 152 (368) +|||++++.|+|+|++. |.+| ..++.++..+.+.+++.+|+++|++|+||||+||| .|+| ++++|. +.+| T Consensus 75 ~p~i~~~~~i~~~d~~~-r~~g~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~r~E~m~~qaL~~Gki~i~~~g~~~~~vD 153 (341) T protein:vir:39 75 PGYVKPKHEVNPLMTLR-RLPDEDPQNLADPVYRRRRIILQNMKDEELAIAQVEEKQAVAAVLSGKYTMTGEAFEPVEVD 153 (341) T ss_pred ccccCcccccCHHHHHH-HhhcccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceEEEcCCCcEEEEe Confidence 99999999999999984 5554 44677778888999999999999999999999999 5998 577763 4566 Q ss_pred cHhhcCCCcc-eeE----EecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhh Q lcl|Aclame:pro 153 LYKQFDVEKK-TIY----FDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQ 227 (368) Q Consensus 153 ~~~~fG~~~~-~~~----~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~ 227 (368) | |+... .++ ..|+++++++...++++.+.++ ..+ .++.+++||+++|++|+.|++|+++|+++. T Consensus 154 f----g~~~~~~~~lt~~~~W~~~~~~~~d~l~di~~~~~-~~g------~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~ 222 (341) T protein:vir:39 154 M----GRSAGNNIVQAGAAAWSSRDKETYDPTDDIEAYAL-NAS------GVVNIIVFDPKGWALFRSFKAVKEKLDTRR 222 (341) T ss_pred c----cCCccceeEecCCccCCCCCCchHHHHHHHHHHHH-hcC------CceEEEEeChHHHHHHhcCHHHHHHHhhcc Confidence 4 55432 222 2355555556666666544443 222 245789999999999999999999998754 Q ss_pred hhhhhh-hhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhh Q lcl|Aclame:pro 228 TPLAWQ-QITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNI 306 (368) Q Consensus 228 ~~~~~~-~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~ 306 (368) ...... ..+.++.. .......++|++|+.|+++|.+ +|..+.++|+ +++.++|.+ .. T Consensus 223 ~~~~~~~~~~~~~~~-------~~~~~~~~~g~~i~~y~~~y~d-~g~~~~~ip~-------~~~~l~p~~-----~~-- 280 (341) T protein:vir:39 223 GSNSELETALKDLGK-------AVSYKGMYGDVAIVVYSGQYIE-NDVKKNYLPD-------LTMVLGNTQ-----AR-- 280 (341) T ss_pred cccccccchhhhhhh-------HhhhhhhhcCceEEEEccEEEe-cCcEEeeecC-------CeEEEeeCC-----Cc-- Confidence 332211 11111111 1111235789999999999987 5666666654 455556642 22 Q ss_pred hheeeccccchhhcc--cccceeeEeeeecc-CCCeeEEEeeecccccccCCceEEEEEEe Q lcl|Aclame:pro 307 FEVAYGPCPKMGYAN--TLGQELYVFEYEKD-RDEGIDFEAHSYMLPYCTRPQLLVDVRAD 364 (368) Q Consensus 307 f~~~~apa~~~~~~n--~~~~~~y~~~~~~~-~~~~~~l~~eS~pLpv~~rP~al~~~t~~ 364 (368) ..++|||..+++... ..+.++|+++|..+ +|+++++++||+|||+|+||++++++||+ T Consensus 281 g~~~yg~~~d~~~~~~~~~~~~~~~~~~~~~~dp~~~~~~~~s~plPv~~~p~~~~~a~V~ 341 (341) T protein:vir:39 281 GLRTYGCILDADAQREGINASTRYPKNWVQTGDPAREFTMIQSAPLMLLADPDEFVSVKLA 341 (341) T ss_pred ceEEEecccchhhcccceeeeeeeeeeeeecCCCcEEEEEEeccccceeeCCCcEEEEEeC Confidence 357788866655443 35678999998665 89999999999999999999999999999 No 6 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=100.00 E-value=7.3e-51 Score=295.45 Aligned_cols=329 Identities=13% Similarity=0.110 Sum_probs=246.9 Q ss_pred CcccccCCcccHHHHHHHHHhcCCc-ccchhhcccccccccccce-EEEEEEcCceeeeeccCCCCCccccccCCceeEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNT-YGYISNLGLFRSAPITQTT-FLMDLTDWDVSLLDAVDRDSRKAETSAPERVRQI 78 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~-~~~l~~l~~F~~~~~~t~~-i~id~~~~~~~lvp~v~rg~~~~~~~~~~~~~~~ 78 (368) |-.|+ |.|+..+|++.|+.+|.+ ..+|.+. ||+.++..+.+ +.++..++...++|++++++++. +..+++.+.. T Consensus 1 M~~i~--d~f~~~~l~~~i~~~~~~~~~~l~~~-~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~-~~~r~~~~~~ 76 (348) T protein:vir:96 1 MGLIY--DKVTASNIAGYFNTLQENVDSTLGES-IFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVT-IRDRVSAEIH 76 (348) T ss_pred Ccchh--hccCHHHHHHHHHhcccchhhhhhhh-cCCCccccceeEEEEeecCCceeEeeeecCCCCcc-eecccceeee Confidence 77764 469999999999999864 4677775 77776555444 44455666777799999998875 5556667889 Q ss_pred EEecccccccccccHHHHhcccC---CCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcEEccCCCEEeccH Q lcl|Aclame:pro 79 SFPMMYFKEVESITPDEIQGVRQ---PGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALK-GKVVDARGTLYADLY 154 (368) Q Consensus 79 ~f~~p~~~~~~~i~a~dlq~~R~---~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~-G~i~d~~G~~~~d~~ 154 (368) +|++||++++..++++|++.++. .|+....+++.+.+++++.+|++.|++|.||||+|||+ |+|...+++. ++. T Consensus 77 ~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~--~~~ 154 (348) T protein:vir:96 77 DEQMPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGV--NKD 154 (348) T ss_pred eeecCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCe--eEE Confidence 99999999999999999876643 45556668888899999999999999999999999996 9886543322 333 Q ss_pred hhcCCCcc---eeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhhhh Q lcl|Aclame:pro 155 KQFDVEKK---TIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLA 231 (368) Q Consensus 155 ~~fG~~~~---~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~~~ 231 (368) .+||++.. +.+.+|+++++||.++|++|.+++++. | .++.+++||+++|++|+.|++|+++++++..... T Consensus 155 vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~---G----~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~ 227 (348) T protein:vir:96 155 IDYGVKADHKKQVSKSWAEPGATPLADLEDAIETAREL---G----LNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGS 227 (348) T ss_pred EeccCCcccceeeccccCCCCCCHHHHHHHHHHHHHhc---C----CcccEEEeCHHHHHHHhcCHHHHHHHhccCCccc Confidence 45787542 345678899999999999999988652 2 2445889999999999999999999986643322 Q ss_pred hhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheee Q lcl|Aclame:pro 232 WQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAY 311 (368) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~ 311 (368) ... ..++. ... -.++|++|+.|+++|.+.+|+.+.++|++.+.+ +|. + ....++| T Consensus 228 ~~~-~~~~~--------~~~--~~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l-------~~~-----~--~~G~~~y 282 (348) T protein:vir:96 228 SVT-KAELQ--------NYV--ADNYGVEIVLENGTYRNEKGEVSKFFPDGHLTL-------IPN-----G--PLGNTVF 282 (348) T ss_pred ccc-HHHHH--------HHH--hhhcCceEEEEccEEEecCCcEeccccCCeEEE-------EcC-----C--CceeEEe Confidence 110 00110 011 145799999999999999999998887765443 332 1 1235778 Q ss_pred ccc-c--chhh-------cccccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCC Q lcl|Aclame:pro 312 GPC-P--KMGY-------ANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKG 367 (368) Q Consensus 312 apa-~--~~~~-------~n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~ 367 (368) ||. + .... ++..+..+|.+.|.+++|+++++++||+|||++.+|++++.+||-++= T Consensus 283 g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~plPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:96 283 GTTPEESDLFADNTVNADVEIVDSGIAVTTTKTTDPVNVQTKVSMVALPSFERLGDVYMLTVIPGV 348 (348) T ss_pred ccChhhhhhhhcccccccceecCCeeEEEeeecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 874 1 1111 122233488999999999999999999999999999999999999999 No 7 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=100.00 E-value=2e-49 Score=287.52 Aligned_cols=328 Identities=15% Similarity=0.150 Sum_probs=248.2 Q ss_pred CcccccCCcccHHHHHHHHHhcCCc-ccchhhccccccc-ccccceEEEEEEcCceeeeeccCCCCCccccccCCceeEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNT-YGYISNLGLFRSA-PITQTTFLMDLTDWDVSLLDAVDRDSRKAETSAPERVRQI 78 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~-~~~l~~l~~F~~~-~~~t~~i~id~~~~~~~lvp~v~rg~~~~~~~~~~~~~~~ 78 (368) |-.|| |.|+..+|+..|+.+|.+ ..+|.++ ||+.+ ..+++.+.++..++...++|++++++++. +..++..+.. T Consensus 1 M~~l~--d~f~~~~l~~~v~~~~~~~~~~l~~~-~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~-~~~r~~~~~~ 76 (348) T protein:vir:49 1 MGLIY--DKVTASNIAGYFNALQENVDSTLGES-IFPARKQLGTKLSYITGASGQSVALKAAAFDTNVT-VRDRVSAEMH 76 (348) T ss_pred Ccchh--hhcCHHHHHHHHHhccccchhhhHhh-cCCCccccCceeEEEEeecCceeeeeeecCCCCcc-eecccceeee Confidence 88875 559999999999999854 5677776 56654 55577788899999999999999998875 4556667888 Q ss_pred EEecccccccccccHHHHhcccC---CCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcE-EccCCCEEecc Q lcl|Aclame:pro 79 SFPMMYFKEVESITPDEIQGVRQ---PGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALK-GKV-VDARGTLYADL 153 (368) Q Consensus 79 ~f~~p~~~~~~~i~a~dlq~~R~---~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~-G~i-~d~~G~~~~d~ 153 (368) ++++||++++..+++.|+++++. .++....+.+.+.+++++.+|++.|++|.||||+|||+ |+| ++.+|. ++ T Consensus 77 ~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~---~~ 153 (348) T protein:vir:49 77 DEQMPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGV---NK 153 (348) T ss_pred eeecCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCc---eE Confidence 99999999999999999776644 35545556677889999999999999999999999996 988 455542 23 Q ss_pred HhhcCCCcc---eeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhhh Q lcl|Aclame:pro 154 YKQFDVEKK---TIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPL 230 (368) Q Consensus 154 ~~~fG~~~~---~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~~ 230 (368) ..+||+... +.+.+|+++++||.++|++|.+++++ . |. ++-+++||+++|++|+.|++|++++..+.... T Consensus 154 ~vdyg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~-~--G~----~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~ 226 (348) T protein:vir:49 154 DIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARE-L--GL----NPERAVMNAKTFGLIRKAASTVKVIKPLAGDG 226 (348) T ss_pred EEeecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHh-c--CC----cccEEEeCHHHHHHHhcCHHHHHHhhccCccc Confidence 335787532 34567899999999999999998865 2 21 34578999999999999999999987654332 Q ss_pred hhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhhee Q lcl|Aclame:pro 231 AWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVA 310 (368) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~ 310 (368) ... ...+++. ... .++|++|+.|+++|.+.+|+.+.++|++.+.+ +|. +. ...++ T Consensus 227 ~~i-~~~~~~~--------~~~--~~~g~~i~~y~~~y~d~dG~~~~~~p~~~v~l-------~~~-----~~--~G~~~ 281 (348) T protein:vir:49 227 SSV-TKAELDN--------YIA--DNFGVTVVLENGTYRNEKGEVSKFFPDGHLTL-------IPN-----GP--LGNTV 281 (348) T ss_pred ccc-cHHHHHH--------HHH--hhcCceEEEEeeEEEecCCcEeeeecCCeEEE-------ecC-----CC--cceeE Confidence 110 0001111 111 35799999999999999999998888765443 332 11 23577 Q ss_pred eccccc---hhhcccc-------cceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCC Q lcl|Aclame:pro 311 YGPCPK---MGYANTL-------GQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKG 367 (368) Q Consensus 311 ~apa~~---~~~~n~~-------~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~ 367 (368) |||.-. ....++. +-.+|.+.|.+++|+++++++||+|||++.+|++++.+||-++= T Consensus 282 yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:49 282 FGTTPEESDLFADNTVNADVEIVDNGIAVTTTKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred EecChhhhhhccccccccceeecCCeEEEeeeecCCCceEEEEEeeeccccccCCCcEEEEEEecCC Confidence 877421 1111211 22388999999999999999999999999999999999999998 No 8 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=100.00 E-value=9e-49 Score=283.98 Aligned_cols=327 Identities=15% Similarity=0.138 Sum_probs=244.1 Q ss_pred CcccccCCcccHHHHHHHHHhcCCc-ccchhhcccccccccccce-EEEEEEcCceeeeeccCCCCCccccccCCceeEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNT-YGYISNLGLFRSAPITQTT-FLMDLTDWDVSLLDAVDRDSRKAETSAPERVRQI 78 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~-~~~l~~l~~F~~~~~~t~~-i~id~~~~~~~lvp~v~rg~~~~~~~~~~~~~~~ 78 (368) |-+|+ |.|+..+|++.|+++|.+ ..+|.+. ||+.+...+.+ +.++..++...++|++++++++. +.+++..+.. T Consensus 1 M~~i~--d~f~~~~l~~~v~~~~~~~~~~l~~~-~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~-~~~r~~~~~~ 76 (348) T protein:vir:27 1 MGLIY--DKVTASNIAGYFNALQENVSSTLGES-IFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVT-IRDRVSAEMH 76 (348) T ss_pred Ccchh--hhcCHHHHHHHHHhccchhhhhhHhh-cCCCccccceeEEEEeeccCceeEeeeecCCCCcc-eecccceeee Confidence 88774 679999999999999865 4677776 67766555444 44466666777799999998874 5566667888 Q ss_pred EEecccccccccccHHHHhcc---cCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcEE-ccCC-CEEec Q lcl|Aclame:pro 79 SFPMMYFKEVESITPDEIQGV---RQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALK-GKVV-DARG-TLYAD 152 (368) Q Consensus 79 ~f~~p~~~~~~~i~a~dlq~~---R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~-G~i~-d~~G-~~~~d 152 (368) +|++||++++..|+++|++++ |..++..+.+++.+.+.+++.+|++.|++|.||||+|||+ |++. +.+| ...+| T Consensus 77 ~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~~~vd 156 (348) T protein:vir:27 77 DEQMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDID 156 (348) T ss_pred eeecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCeeEEEe Confidence 999999999999999998765 5555555667778889999999999999999999999996 9885 4444 23344 Q ss_pred cHhhcCCCcc---eeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhh Q lcl|Aclame:pro 153 LYKQFDVEKK---TIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTP 229 (368) Q Consensus 153 ~~~~fG~~~~---~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~ 229 (368) ||+... +.+.+|+++++||.++|++|.+++++ . | .++.+++||+++|++|+.|++|++++.+.... T Consensus 157 ----fg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~-~--G----~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~ 225 (348) T protein:vir:27 157 ----YGVKPDHKKQVSKSWAEPGATPLADLEDAIETARE-L--G----LNPERAVMNAKTFGLIRKAASTVKVIKPLAGD 225 (348) T ss_pred ----ecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHh-c--C----CcccEEEECHHHHHHHhcCHHHHHHhcccCcc Confidence 677532 33467899999999999999998864 2 2 24457899999999999999999999765432 Q ss_pred hhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhhe Q lcl|Aclame:pro 230 LAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEV 309 (368) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~ 309 (368) ...... .+++ ... -.++|+.++.|+++|.+.+|+.+.++|++.+++ +|. + ....+ T Consensus 226 ~~~i~~-~~~~--------~~~--~~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl-------~~~-----~--~~G~~ 280 (348) T protein:vir:27 226 GSAVTK-AELE--------NYI--ADNFGVSIVLENGTYRNDKGEVSKFYPDGHLTL-------IPN-----G--PLGNT 280 (348) T ss_pred ccccCH-HHHH--------HHH--HhhcCceEEEEeeEEEcCCCcCcccccCCeEEE-------EcC-----C--cceeE Confidence 211100 0000 011 145799999999999999999998888765443 332 1 12345 Q ss_pred eeccc-cchh--hccc-------ccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCC Q lcl|Aclame:pro 310 AYGPC-PKMG--YANT-------LGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKG 367 (368) Q Consensus 310 ~~apa-~~~~--~~n~-------~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~ 367 (368) +|||. +-.+ ..+. .+..+|.+.|.+++|.++++++||.|||++.+|++++.+||-++= T Consensus 281 ~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 281 VFGTTPEESDLFADNTVNAEVEIVDNGIAVTTTKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred EeccCcchhhhhhccccccceeeeCCeeEEEeeecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 67753 2221 1111 223388999999999999999999999999999999999999888 No 9 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=100.00 E-value=1.4e-44 Score=261.08 Aligned_cols=315 Identities=15% Similarity=0.195 Sum_probs=229.3 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhhcccccccccccceE-EEEEEcCceeeeeccCCCCCccccccCCceeEEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPITQTTF-LMDLTDWDVSLLDAVDRDSRKAETSAPERVRQIS 79 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~t~~i-~id~~~~~~~lvp~v~rg~~~~~~~~~~~~~~~~ 79 (368) |+|+ |+...|++.++++|. +.+|.++ ||+.+.+...++ .++..++...++|++++++++. ..+++. .... T Consensus 17 ~~d~-----~~~~~l~~~~~~~~~-~~~l~~~-~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~-~~~r~~-~~~~ 87 (349) T protein:vir:10 17 ILDM-----FSQNTVLDYTRNRQY-PEMLGDT-LFPAVKVPTLEVDILKAGSRVPTIASVSAFDAEAE-IGTREA-SKMT 87 (349) T ss_pred hhcc-----cCHHHHHHHHHhcCc-chhhHhh-cCCccccccceeEEEeeccCcceeeeeecCCCCcc-eecccc-eeEE Confidence 5554 667889999999997 4678887 566655544444 4455667778899999988765 455554 4558 Q ss_pred EecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcEEccCCCEEeccHhhcC Q lcl|Aclame:pro 80 FPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALK-GKVVDARGTLYADLYKQFD 158 (368) Q Consensus 80 f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~-G~i~d~~G~~~~d~~~~fG 158 (368) +++|+++++..+++.|++.+|.+++..+.+.+.+.+++.+.+|++.|++|+||||+|||+ |+|...++++.+| || T Consensus 88 ~~~p~ik~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~q~l~~Gki~~~~~g~~vD----~g 163 (349) T protein:vir:10 88 AELAYVKRKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMVQAVKARGEKMTMEMFATGKITDKKNGIAID----YG 163 (349) T ss_pred eeccccccccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeEEcCCcEEEe----cc Confidence 999999999999999999999999888888889999999999999999999999999996 9987665556666 67 Q ss_pred CCcc-eeE----EecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 159 VEKK-TIY----FDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAWQ 233 (368) Q Consensus 159 ~~~~-~~~----~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~~~~~ 233 (368) +... .++ -.|+++++||.++|++|.+.+ + ..+-++++|+++|++|+.|++|++++.+....... T Consensus 164 ~~~~~~~~lt~~~~Ws~~~adpi~Di~~~~~~~----g------~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~- 232 (349) T protein:vir:10 164 VPKKHQETLSGTKTWDKSDASIIDNLQDWSDSL----D------VTPTRALTSKKVLRILMRSTEIKEAIFGKDTGRVV- 232 (349) T ss_pred cCccceeEecCcccCCCCCCCHHHHHHHHHHHh----C------CCccEEEeCHHHHHHHhcCHHHHHHhccccccccc- Confidence 7542 233 347788999999999886543 2 13457899999999999999999998754322110 Q ss_pred hhhcccccccccccccccceeeeCCEEEEEccccccCCCcc----cccccccccceecCCceeeeeeeeccccchhhhhe Q lcl|Aclame:pro 234 QITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGK----VHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEV 309 (368) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~----~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~ 309 (368) ...++. .... .++|++++.|+++|.+.+|+ .+.++|++.++ ++|. + .+..+ T Consensus 233 -~~~~~~--------~~l~--~~~~~~i~~yd~~y~d~~~~~~~t~~~~~p~~~v~-------l~~~-----~--~~G~~ 287 (349) T protein:vir:10 233 -GQADLD--------QWMT--AQGLPIIRAYDGKYRDEDSRGNLTTNSYFPEDRIV-------LFND-----E--VPGQK 287 (349) T ss_pred -CHHHHH--------HHHH--hcCCceEEEEeeEEEeecCCCceeecccccCCeEE-------EecC-----C--CceeE Confidence 000000 0111 46899999999999887774 44566655433 3332 1 23467 Q ss_pred eeccccch---hhccc---ccceeeEee-eeccCCCeeEEEeeecccccccCCceEEEEEEe Q lcl|Aclame:pro 310 AYGPCPKM---GYANT---LGQELYVFE-YEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRAD 364 (368) Q Consensus 310 ~~apa~~~---~~~n~---~~~~~y~~~-~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~ 364 (368) +|||.... ...+. ...+++.+. +.+++|.+.++.+||+|||++.+|++++.+||- T Consensus 288 ~yG~~~e~~~~~~g~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl 349 (349) T protein:vir:10 288 IYGPTPEENRLISSNAQVSNVGNIMAKIYETSEDPIGTWILASATMLPSFASADDVFQAKVL 349 (349) T ss_pred EeeccchhhhhcccccceeeccceEEEeeeecCCCceEEEEEeeeeeeeecCCCcEEEEEeC Confidence 88885322 21111 112345554 567899999999999999999999999999999 No 10 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=100.00 E-value=5.7e-43 Score=252.20 Aligned_cols=330 Identities=9% Similarity=0.022 Sum_probs=236.2 Q ss_pred CcccccCCcccHHHHHHHHHhcC---CcccchhhcccccccccccceEEEEEEc---CceeeeeccCCCCCccccccCCc Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIP---NTYGYISNLGLFRSAPITQTTFLMDLTD---WDVSLLDAVDRDSRKAETSAPER 74 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p---~~~~~l~~l~~F~~~~~~t~~i~id~~~---~~~~lvp~v~rg~~~~~~~~~~~ 74 (368) |.-+.+.|.|+..+|++.|+..| ..+++|.+. ||+.+. +..+.++..+ +....+++++++++.. +.+++. T Consensus 1 M~~~~~~d~~~~~~l~~~i~~~~~~~~~~~~l~~~-~fp~~~--~~~~~~~~~~~~~~~~~~a~~~~~~~~~~-~~~r~g 76 (348) T protein:vir:98 1 MSWTLDTEFIEPTQLTGLIREALRDLQVNRFRLAR-WLPNVD--VDDITFEFLRGGGGLAETASYRSWDTESK-IGRREG 76 (348) T ss_pred CcchhhhhccCHHHHHHHHHHHhhccCcchhhHHh-cCCCcc--ccceEEEEEeccCCceeeeeeecCCCccc-eeeccc Confidence 88878899999999999999886 456788887 666654 4445565543 3345678999887764 555555 Q ss_pred eeEEEEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcEE-ccCCCEEec Q lcl|Aclame:pro 75 VRQISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALK-GKVV-DARGTLYAD 152 (368) Q Consensus 75 ~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~-G~i~-d~~G~~~~d 152 (368) .+...+++|+++++..++++|++.+|. ...+.+.+.+.+.+.+|++.+++|.||||+|||+ |+|. ++.+ ..+| T Consensus 77 ~~~~~~~~~~i~~~~~i~~~d~~~~~~----~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~-~~vD 151 (348) T protein:vir:98 77 LAKVMGELPPISEKIPLNEYDRLRLRK----LSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQ-QTVD 151 (348) T ss_pred ceeeeeeccccccccccCHHHHHHhcC----ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCc-eEEc Confidence 677789999999999999999987763 3456778889999999999999999999999996 8884 4333 3344 Q ss_pred cHhhcCCCcc---eeEEecC-CCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhh Q lcl|Aclame:pro 153 LYKQFDVEKK---TIYFDLD-NPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQT 228 (368) Q Consensus 153 ~~~~fG~~~~---~~~~~l~-~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~ 228 (368) ||++.. +.+..|+ .+++||.++|++|.+.+++..+ ..+.++++|+++|++|+.|++|++.+.++.. T Consensus 152 ----yg~~~~~~~t~~~~Ws~~~~adp~~di~~~~~~~~~~~G------~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~ 221 (348) T protein:vir:98 152 ----FGRIGSHSVVAAVLWSVHATATPISDLESWVATYEDTNG------QSPGVILMPKAAVSHMRQCEEVIRQVFPLAP 221 (348) T ss_pred ----cccCcccccccccccCCCCCCCHHHHHHHHHHHHHHccC------CcceEEEeCHHHHHHHhcCHHHHHHHhccCc Confidence 677543 2345665 4678999999999999986532 2456889999999999999999999876533 Q ss_pred hhhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecC-Cceeeeeeeeccccchhhh Q lcl|Aclame:pro 229 PLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTV-GVGHAFPNVAMLGEANNIF 307 (368) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~-~~~~~~p~~~~~~~~~~~f 307 (368) ..... . +. .... ..+-. .+|+..++.|+.+|.+ +|+.+.++|++.+.+.. +...... +...+- T Consensus 222 ~~~~~--~--~~---~~~~-~~~~~-~~g~~~i~~~d~~~~~-~g~~~~~~p~~~i~l~p~~~~~~~~------~~~~~G 285 (348) T protein:vir:98 222 SGTAP--M--VS---VEQL-NTVLS-SMGLPPIEVYDAKVAV-DGVSTRITPANAIALLPEPGATDAA------QPTELG 285 (348) T ss_pred ccccc--c--cC---HHHH-HHHHH-hhCCeEEEEeeeEEEc-CCceeceecCCeEEEEecCCccccc------cccccc Confidence 21100 0 00 0000 01111 3578899999988876 67888888877655422 2111111 112223 Q ss_pred heeeccccchhhcc---cc--cceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEec Q lcl|Aclame:pro 308 EVAYGPCPKMGYAN---TL--GQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADA 365 (368) Q Consensus 308 ~~~~apa~~~~~~n---~~--~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a 365 (368) .++|||.......+ .. .-.+|.++|.+++|+++++++||+|||++.+|++++.+||-| T Consensus 286 ~t~~G~~~e~~~~~~~~~~~~~~~i~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 286 ATLLGTTAESLEDDYALAPGEQPGIVAATWKTKDPVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred ceecccchhhhccccccceeccCceeeeeeeecCCcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 57788743222111 11 112799999999999999999999999999999999999999 No 11 >protein:vir:78006 Length: 409 # NCBI annotation: major head protein # Family: family:all:11999 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467942;genbank:gi:157265383;genbank:GeneID:5600496 Probab=99.96 E-value=2.7e-31 Score=188.17 Aligned_cols=346 Identities=11% Similarity=0.025 Sum_probs=210.6 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhhcccccccccccceEEEE--EEcCceeeee--ccCCCCCccccccCCc-- Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMD--LTDWDVSLLD--AVDRDSRKAETSAPER-- 74 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~t~~i~id--~~~~~~~lvp--~v~rg~~~~~~~~~~~-- 74 (368) |--+-.-+-+++..+.+.+.++.++.++|.+. ||+....-.+++.++ ..++..++.. ..+.+..+..+..+.. T Consensus 20 ~~~~~~~~~~~~~~~ia~~~~~~p~~~~L~d~-~FP~~~~f~t~l~~~~~~~kg~kk~~~~~~~~~~d~~~pv~~r~~~~ 98 (409) T protein:vir:78 20 IGGLKFPTTKEIQEAVAAIADKFNQENDLVDR-FFPEDSTFASELELYLLRTQDAEQTGMTFVHQVGSTSLPVEARVAKV 98 (409) T ss_pred hcceecCchHHHHHHHHHHHHhcCCccchhhc-cCCCCccccceEEEEeeeccCcccccceEeeecCCccccccccceee Confidence 33333345677777766666665555667786 566544433345554 4455444432 2233434433333321 Q ss_pred -eeEEEEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcEEc-cCC---C Q lcl|Aclame:pro 75 -VRQISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALK-GKVVD-ARG---T 148 (368) Q Consensus 75 -~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~-G~i~d-~~G---~ 148 (368) .+..++++|||+++..|++.|++.+++.++......+.+.+.+.+.+|.+++.+|.||||+|+|+ |+|.. .++ . T Consensus 99 ~~~~~t~epp~iK~k~~i~e~dl~~~~~~~n~~~~~~i~~~i~~D~~~L~~~I~~R~E~Ma~q~L~tGki~i~g~~~~~~ 178 (409) T protein:vir:78 99 DLAKATWSPLAFKESRVWDEKEILYLGRLADEVQAGVINEQIAESLTWLMARMRNRRRWLTWQVMRTGRITIQPNDPYNP 178 (409) T ss_pred eeeeecccccccccccccCHHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEEecCCCcc Confidence 24568999999999999999999877766655666777889999999999999999999999996 98852 221 1 Q ss_pred EEeccHhhcCCCcc-ee----EEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHH-hcCHHHHHH Q lcl|Aclame:pro 149 LYADLYKQFDVEKK-TI----YFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKL-TKHPKIRDA 222 (368) Q Consensus 149 ~~~d~~~~fG~~~~-~~----~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l-~~h~~v~~~ 222 (368) ...++-.+||++.. .+ +-.|+++++||.++|++|.+++++..+.+ .++-.+++++..|++| ..|+.|+++ T Consensus 179 ~g~~~~vDyg~pa~hkvtlTgt~~W~~~~AdPi~DIe~w~~~i~~~~g~~----~t~~~~imt~~~~~~l~~~n~~ik~~ 254 (409) T protein:vir:78 179 NGLKYVIDYGVTDIELPLPQKFDAKDGNGNSAVDPIQYFRDLIKAATYFP----DRRPVAIIVGPGFDEVLADNTFVQKY 254 (409) T ss_pred ccceEEEecCCCcccceeecccccCCCCCCChHHHHHHHHHHHHHhcCCC----CCccEEEEcHHHHHHHHhCcHHHHHh Confidence 12223334788542 22 33578889999999999999998653311 1333456666666654 567778887 Q ss_pred HHHhhhhhhhhhhhcccccccccccccccce-eeeCCEEEEEccccccCCCcccccccccccce-ecCCceeeeeeeecc Q lcl|Aclame:pro 223 YLAQQTPLAWQQITGSLRTGGADGVQAHMNT-FYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVA-DTVGVGHAFPNVAML 300 (368) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~-i~~~~~~~~p~~~~~ 300 (368) +..........+......... ....... ....|+.++.|+++|.+.+|+.++++|++.+. ++++.+. T Consensus 255 l~~~~~~~~~~~~~~~~~~l~---~~~~ln~~~~~~GL~I~vYd~~Y~dedGt~k~~~Pd~~vvLl~ap~g~-------- 323 (409) T protein:vir:78 255 VEYEKGWVVGQNTVQPPREVY---RQAALDIFKRYTGLEVMVYDKTYRDQDGSVKYWIPVGELIVLNQSTGP-------- 323 (409) T ss_pred hhcccccccccccccchhhhc---chhHhHhhhhhcCceEEEEeeEEEecCCcccceecCCeEEEEcCCccc-------- Confidence 765433222111000000000 0000011 12347999999999999999999999988764 4332211 Q ss_pred ccchhhhheeeccc--c--chhhcccccceee-EeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCCC Q lcl|Aclame:pro 301 GEANNIFEVAYGPC--P--KMGYANTLGQELY-VFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKGG 368 (368) Q Consensus 301 ~~~~~~f~~~~apa--~--~~~~~n~~~~~~y-~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~~ 368 (368) +=.++||+. . ....++. .-+.. .+.|..++|...+......-||+...+|.=-..-..++-- T Consensus 324 -----LG~T~yGa~~~~~~~~~~v~~-~g~~i~~~~~~~~dP~~~~~~~~~~~~p~l~~~~~~~~~~~~~~~~ 390 (409) T protein:vir:78 324 -----VGRFVYTAHVAGQRNGKVVYA-TGPYLTVKDHLQDDPPYYAIIAGFHGLPQLSGYNTEDFSFHRFKWL 390 (409) T ss_pred -----ccceecccccccccchhhhcc-ccceeEecccccCCcceeeeecceEEeeeeecCCccceeehhhhhh Confidence 114677762 1 1122221 22333 3567889999999999999999998776532222222111 No 12 >protein:vir:79503 Length: 409 # NCBI annotation: major head protein # Family: family:all:11999 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468058;genbank:gi:157265500;genbank:GeneID:5600620 Probab=99.96 E-value=2.7e-31 Score=188.17 Aligned_cols=346 Identities=11% Similarity=0.025 Sum_probs=210.6 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhhcccccccccccceEEEE--EEcCceeeee--ccCCCCCccccccCCc-- Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMD--LTDWDVSLLD--AVDRDSRKAETSAPER-- 74 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~t~~i~id--~~~~~~~lvp--~v~rg~~~~~~~~~~~-- 74 (368) |--+-.-+-+++..+.+.+.++.++.++|.+. ||+....-.+++.++ ..++..++.. ..+.+..+..+..+.. T Consensus 20 ~~~~~~~~~~~~~~~ia~~~~~~p~~~~L~d~-~FP~~~~f~t~l~~~~~~~kg~kk~~~~~~~~~~d~~~pv~~r~~~~ 98 (409) T protein:vir:79 20 IGGLKFPTTKEIQEAVAAIADKFNQENDLVDR-FFPEDSTFASELELYLLRTQDAEQTGMTFVHQVGSTSLPVEARVAKV 98 (409) T ss_pred hcceecCchHHHHHHHHHHHHhcCCccchhhc-cCCCCccccceEEEEeeeccCcccccceEeeecCCccccccccceee Confidence 33333345677777766666665555667786 566544433345554 4455444432 2233434433333321 Q ss_pred -eeEEEEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcEEc-cCC---C Q lcl|Aclame:pro 75 -VRQISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALK-GKVVD-ARG---T 148 (368) Q Consensus 75 -~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~-G~i~d-~~G---~ 148 (368) .+..++++|||+++..|++.|++.+++.++......+.+.+.+.+.+|.+++.+|.||||+|+|+ |+|.. .++ . T Consensus 99 ~~~~~t~epp~iK~k~~i~e~dl~~~~~~~n~~~~~~i~~~i~~D~~~L~~~I~~R~E~Ma~q~L~tGki~i~g~~~~~~ 178 (409) T protein:vir:79 99 DLAKATWSPLAFKESRVWDEKEILYLGRLADEVQAGVINEQIAESLTWLMARMRNRRRWLTWQVMRTGRITIQPNDPYNP 178 (409) T ss_pred eeeeecccccccccccccCHHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEEecCCCcc Confidence 24568999999999999999999877766655666777889999999999999999999999996 98852 221 1 Q ss_pred EEeccHhhcCCCcc-ee----EEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHH-hcCHHHHHH Q lcl|Aclame:pro 149 LYADLYKQFDVEKK-TI----YFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKL-TKHPKIRDA 222 (368) Q Consensus 149 ~~~d~~~~fG~~~~-~~----~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l-~~h~~v~~~ 222 (368) ...++-.+||++.. .+ +-.|+++++||.++|++|.+++++..+.+ .++-.+++++..|++| ..|+.|+++ T Consensus 179 ~g~~~~vDyg~pa~hkvtlTgt~~W~~~~AdPi~DIe~w~~~i~~~~g~~----~t~~~~imt~~~~~~l~~~n~~ik~~ 254 (409) T protein:vir:79 179 NGLKYVIDYGVTDIELPLPQKFDAKDGNGNSAVDPIQYFRDLIKAATYFP----DRRPVAIIVGPGFDEVLADNTFVQKY 254 (409) T ss_pred ccceEEEecCCCcccceeecccccCCCCCCChHHHHHHHHHHHHHhcCCC----CCccEEEEcHHHHHHHHhCcHHHHHh Confidence 12223334788542 22 33578889999999999999998653311 1333456666666654 567778887 Q ss_pred HHHhhhhhhhhhhhcccccccccccccccce-eeeCCEEEEEccccccCCCcccccccccccce-ecCCceeeeeeeecc Q lcl|Aclame:pro 223 YLAQQTPLAWQQITGSLRTGGADGVQAHMNT-FYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVA-DTVGVGHAFPNVAML 300 (368) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~-i~~~~~~~~p~~~~~ 300 (368) +..........+......... ....... ....|+.++.|+++|.+.+|+.++++|++.+. ++++.+. T Consensus 255 l~~~~~~~~~~~~~~~~~~l~---~~~~ln~~~~~~GL~I~vYd~~Y~dedGt~k~~~Pd~~vvLl~ap~g~-------- 323 (409) T protein:vir:79 255 VEYEKGWVVGQNTVQPPREVY---RQAALDIFKRYTGLEVMVYDKTYRDQDGSVKYWIPVGELIVLNQSTGP-------- 323 (409) T ss_pred hhcccccccccccccchhhhc---chhHhHhhhhhcCceEEEEeeEEEecCCcccceecCCeEEEEcCCccc-------- Confidence 765433222111000000000 0000011 12347999999999999999999999988764 4332211 Q ss_pred ccchhhhheeeccc--c--chhhcccccceee-EeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCCC Q lcl|Aclame:pro 301 GEANNIFEVAYGPC--P--KMGYANTLGQELY-VFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKGG 368 (368) Q Consensus 301 ~~~~~~f~~~~apa--~--~~~~~n~~~~~~y-~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~~ 368 (368) +=.++||+. . ....++. .-+.. .+.|..++|...+......-||+...+|.=-..-..++-- T Consensus 324 -----LG~T~yGa~~~~~~~~~~v~~-~g~~i~~~~~~~~dP~~~~~~~~~~~~p~l~~~~~~~~~~~~~~~~ 390 (409) T protein:vir:79 324 -----VGRFVYTAHVAGQRNGKVVYA-TGPYLTVKDHLQDDPPYYAIIAGFHGLPQLSGYNTEDFSFHRFKWL 390 (409) T ss_pred -----ccceecccccccccchhhhcc-ccceeEecccccCCcceeeeecceEEeeeeecCCccceeehhhhhh Confidence 114677762 1 1122221 22333 3567889999999999999999998776532222222111 No 13 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=99.22 E-value=6.7e-13 Score=87.31 Aligned_cols=300 Identities=12% Similarity=0.106 Sum_probs=144.1 Q ss_pred CcccccCCcccHH-HHHHHHHhcCCcccchhhcccccccccccceEEEEE-EcCceeeeeccCC--CCCccccccCCcee Q lcl|Aclame:pro 1 MLTNSEKSRFFLA-DLTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMDL-TDWDVSLLDAVDR--DSRKAETSAPERVR 76 (368) Q Consensus 1 ~~d~f~~d~F~~~-~Lt~~i~~~p~~~~~l~~l~~F~~~~~~t~~i~id~-~~~~~~lvp~v~r--g~~~~~~~~~~~~~ 76 (368) ||.. +..|-+. .||+.---.. .+.++++. +|+..++......+-. -+... .+|.+.| ++....+.. ...+ T Consensus 1 m~~~--~~~~~~dp~LT~~A~gy~-n~~~Iad~-lfP~vpV~~~~~k~~~f~~e~f-~~~~t~ra~~~~~~~v~~-~~~~ 74 (307) T protein:vir:79 1 MGRL--SKLRIVDPVLTNLAIGYT-NAEFIGQT-LMPVVEVEKEGGKIPKFGKESF-RLYQTERALRAKSNRMNP-EDID 74 (307) T ss_pred CCCC--CCCcccCHHHHHHHhhcc-chhhhhhh-cCCcccccccccceeeeccccc-cccccccccCCCcceeee-eccc Confidence 7763 3345544 3555544444 46688885 8998877654443322 12222 2355544 222222211 1111 Q ss_pred EEEEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEE-ccCCCEEeccHh Q lcl|Aclame:pro 77 QISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVV-DARGTLYADLYK 155 (368) Q Consensus 77 ~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~-d~~G~~~~d~~~ 155 (368) ..++.+.-...... + ..|.-|.... ..-.+.+..+.+.|.+++||||+++++.... .++..+.+- T Consensus 75 ~~~~~~~~~~l~~~-----i-d~r~~~~~~~-----~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLs--- 140 (307) T protein:vir:79 75 SVDVNLDEHDLEYP-----I-DYREDQESAF-----PLEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLS--- 140 (307) T ss_pred cccccccccchhhc-----c-cchhcCCCCC-----CHHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEc--- Confidence 22222221111111 1 1244332111 1123445667889999999999999984332 222222111 Q ss_pred hcCCCcceeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhhhhhhhh Q lcl|Aclame:pro 156 QFDVEKKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAWQQI 235 (368) Q Consensus 156 ~fG~~~~~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~~~~~~~ 235 (368) | +=.|+++++||...+++|++.|.+..+ ..+-++++|.++|++|+.||.|.+.+++...+.-.... T Consensus 141 --g------t~~Wsd~~sDPi~di~~~~~ai~~~~g------~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it~~~ 206 (307) T protein:vir:79 141 --A------TEKFTAANSDPVGVIEDGKEAIRTKIG------RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVDL 206 (307) T ss_pred --c------CcccCCCCCCcHHHHHHHHHHHHHhhC------CccceEEeCHHHHHHHhcCHHHHHHhcCccccccCHHH Confidence 1 113778999999999999999987543 36678999999999999999999999875432211110 Q ss_pred hcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheeecccc Q lcl|Aclame:pro 236 TGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPCP 315 (368) Q Consensus 236 ~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~apa~ 315 (368) + ...| +.-.+.-+.++|.+.++..+.+.+.+..+.-.+... ..+..+++.--||.-- T Consensus 207 ---l-----------a~l~--~v~~V~vg~a~y~~~~~~~~~iw~~~~~l~y~~~~~-------~~~~~~~~~ps~Gyt~ 263 (307) T protein:vir:79 207 ---L-----------KEIF--EVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQR-------GGQQRTPYEPSYGYTL 263 (307) T ss_pred ---H-----------HHHh--CceeEEEeeeeeecccccchhcCCCceEEEeccccc-------CCCCCcccccccceeE Confidence 0 1112 222345556677777777776665443222111110 0122222222222211 Q ss_pred chhhcccccceeeEeeeeccCCCeeEEEeeecccccccCCce--EEEEEEe Q lcl|Aclame:pro 316 KMGYANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQL--LVDVRAD 364 (368) Q Consensus 316 ~~~~~n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~a--l~~~t~~ 364 (368) ..+ +.++ .-. ..+.++++-+.+--.-=|+..-|++ |++--++ T Consensus 264 ~~~-----g~~~-~d~-~~~~~~~~~vrv~~~~~~~i~~~~~G~li~~~v~ 307 (307) T protein:vir:79 264 RKK-----GNPV-VDT-RIEDGKLELVRATDIFRPYLLGADAGYLISGING 307 (307) T ss_pred Eec-----CceE-Eec-ccCCCceeEEeecccccceeeccccchhhccCCC Confidence 111 1111 000 1122333333222222222233443 3332222 No 14 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=99.12 E-value=2.5e-11 Score=78.67 Aligned_cols=301 Identities=11% Similarity=0.079 Sum_probs=137.3 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhhcccccccccccceEEEE-EEcCceeeeeccCCCCCcc-ccccCCceeEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMD-LTDWDVSLLDAVDRDSRKA-ETSAPERVRQI 78 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~t~~i~id-~~~~~~~lvp~v~rg~~~~-~~~~~~~~~~~ 78 (368) ||.. +..|-+.=.+..|..-=+.+.++.+. +|+..++......+= .-+... .+|.+.|+-.+. +.......+.. T Consensus 1 m~~~--~~~~~~dp~LT~~A~gy~n~~~ia~~-l~P~vpv~~~~~k~~~f~~eaF-~~~~t~r~~~~~~~~v~~~~~~~~ 76 (307) T protein:vir:10 1 MGRL--SKLRIVDPVLTNLAIGYTNAEFIGQS-LMPVVEVEKEGGKIPKFGKESF-RLYKTERALRARSNRMNPEDLGSI 76 (307) T ss_pred CCCC--CCCcccChhHHHHHHhhcchhhhhhh-cCCcccccccccceeeECcccc-cchhhhcccCCCcceeeccccccc Confidence 7763 34455443333333333345688885 899887765544332 222232 244444421111 11111111111 Q ss_pred EEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEcc-CCCEEeccHhhc Q lcl|Aclame:pro 79 SFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDA-RGTLYADLYKQF 157 (368) Q Consensus 79 ~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d~-~G~~~~d~~~~f 157 (368) .+.++ ..+-..|- ..|+-|... -+.-++.+..+.+.|.+++|+++++.++....-+ +.++.+ T Consensus 77 ~~~~~---~~~L~~~i---d~r~~~~~~-----~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tL------ 139 (307) T protein:vir:10 77 DIVLD---EHDLEYPI---DYREDQESA-----FPLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGGNKKQL------ 139 (307) T ss_pred ccccc---cccccccC---ChhhcCCCC-----CCHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCCceEEe------ Confidence 12222 11111221 234443211 1123455666778999999999999987422111 111111 Q ss_pred CCCcceeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhhhhhhhhhc Q lcl|Aclame:pro 158 DVEKKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAWQQITG 237 (368) Q Consensus 158 G~~~~~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~~~~~~~~~ 237 (368) +.+=.|+++++||...+++|+++|.+..+ ..+-++++|.++|++|+.||+|.|.+++...+.-..+. T Consensus 140 -----sGt~~Wsd~~sDPi~di~~~~~ai~~~~g------~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~it~~~-- 206 (307) T protein:vir:10 140 -----SATEKFTAAGSDPVGVIEDGKEAIRTKIG------RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVDL-- 206 (307) T ss_pred -----ccccccCCCCCCcHHHHHHHHHHHHhhhC------CccceEEeCHHHHHHHhcCHHHHHHhCCccccccCHHH-- Confidence 01125678899999999999999987543 36678999999999999999999998865432111111 Q ss_pred ccccccccccccccceeeeCCEEEEE-ccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheeeccccc Q lcl|Aclame:pro 238 SLRTGGADGVQAHMNTFYYGGVKFVQ-YNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPCPK 316 (368) Q Consensus 238 ~~~~~~~~~~~~~~~~f~~~gi~~~~-y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~apa~~ 316 (368) + ...| |++-+. ..+.|...+++.+.+.+.+..+.-.+.. ...+..+++.--||-- T Consensus 207 -l-----------a~ll---~v~~i~vg~a~~~~~~~~~~~iw~~~~vl~yv~~~-------~~~~~~~~~epsfGyT-- 262 (307) T protein:vir:10 207 -L-----------KEIF---EVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQ-------RGGQQRTPYEPSYGYT-- 262 (307) T ss_pred -H-----------HHHh---CceeEEEeeeeeeccCCccceeCCCceEEEecccc-------cCCCCCccccccccee-- Confidence 0 0112 233233 3455555566555555433221111000 0001111111111100 Q ss_pred hhhcccccceeeEeeeeccCCCeeEEEe--eecccccccCCceEEEEEEe Q lcl|Aclame:pro 317 MGYANTLGQELYVFEYEKDRDEGIDFEA--HSYMLPYCTRPQLLVDVRAD 364 (368) Q Consensus 317 ~~~~n~~~~~~y~~~~~~~~~~~~~l~~--eS~pLpv~~rP~al~~~t~~ 364 (368) +-..+.++--+ ..+..+++-+.+ .-.|+-++..-..|++-.++ T Consensus 263 ---~~~~g~~~~d~--~~~~~~~~~~r~~~~~~~~i~~~~~G~li~~~~~ 307 (307) T protein:vir:10 263 ---LRKKGNPVVDT--RIEDGKLELVRSTDIFRPYLLGADAGYLISGING 307 (307) T ss_pred ---EEEcCCeEeec--eecCCceeEEeccccccceeecccccceeccCCC Confidence 00011111100 112233322322 22333333333334443333 No 15 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=98.75 E-value=1.8e-09 Score=68.58 Aligned_cols=302 Identities=10% Similarity=-0.030 Sum_probs=141.8 Q ss_pred ccCCcccHHHHHHHHHhcCCcccchhhcccccccccccceEEEEE-EcCceeeeecc--CCCCCccccccCCceeEEEEe Q lcl|Aclame:pro 5 SEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMDL-TDWDVSLLDAV--DRDSRKAETSAPERVRQISFP 81 (368) Q Consensus 5 f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~t~~i~id~-~~~~~~lvp~v--~rg~~~~~~~~~~~~~~~~f~ 81 (368) -++-.|-+.-.+..+..-=+.+.++++. +|+..++......+-. -+...-.++.. .|++....+. .+ .+...+. T Consensus 1 ~~~~~~~~dp~LT~~A~gy~n~~~Ia~~-l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v~-~~-~~~~~~~ 77 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYRNGRMISDE-VLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVE-FS-ATDETGS 77 (309) T ss_pred CCCCCcCcCHhHHHHHhhccChhhhhhh-cCCccccCccccceeeechhhcccccchhhccCCCcceEe-ec-ccCceee Confidence 3445676553333333333456688885 8998887765544422 22222233333 3444333222 22 2234555 Q ss_pred cccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEE-ccCCCEEeccHhhcCCC Q lcl|Aclame:pro 82 MMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVV-DARGTLYADLYKQFDVE 160 (368) Q Consensus 82 ~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~-d~~G~~~~d~~~~fG~~ 160 (368) +.-......|.-+|+++-+ +.-+. .++....+.+.|.+++|+++++.++.--. .++..+.+. |. T Consensus 78 ~~~~~L~~~i~~~~~~~a~--~~~d~-------~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Ls-----gt- 142 (309) T protein:vir:99 78 TEDHGLDAPVPQADIDNAP--TNYNP-------LGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS-----GA- 142 (309) T ss_pred ecccceeecCCchhhhhcc--CCCCH-------HHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEec-----Cc- Confidence 6556666667666765432 11122 23334567889999999999998774322 222222221 11 Q ss_pred cceeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhhhhhhhhhcccc Q lcl|Aclame:pro 161 KKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAWQQITGSLR 240 (368) Q Consensus 161 ~~~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~~~~~~~~~~~~ 240 (368) =.|+++++||...+++|++.+ | ..+-.+++|.++|++|+.||+|.+.+++.......... .++ T Consensus 143 -----~~wsd~~SDPi~~i~~~~~~~------g----~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~-~~l- 205 (309) T protein:vir:99 143 -----DQWSDPTSNPLPVITDALDSV------I----LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM-AFL- 205 (309) T ss_pred -----cccCCCCCCcHHHHHHHHHhh------C----CCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCH-HHH- Confidence 136678999999999997554 1 25678899999999999999999999865332111110 011 Q ss_pred cccccccccccceeeeCCEEEE--EccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheeeccc-cch Q lcl|Aclame:pro 241 TGGADGVQAHMNTFYYGGVKFV--QYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPC-PKM 317 (368) Q Consensus 241 ~~~~~~~~~~~~~f~~~gi~~~--~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~apa-~~~ 317 (368) ...|.+..|.+= .|.....+.++..+.....+..++-.+. ..++....-||.- .+- T Consensus 206 ----------a~l~~ve~V~vg~a~~n~a~~g~~~~~~~iwg~~~~L~y~~~-----------~~~~~~~ps~G~t~~~~ 264 (309) T protein:vir:99 206 ----------QELLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDR-----------LADTRNGTTFGLTAQWG 264 (309) T ss_pred ----------HHHhCcceEEeecceeeccccccccccccccCCcEEEEEcCC-----------CCCCcccccccceeecc Confidence 112333333221 1222222333333333222222111110 1111111122210 100 Q ss_pred hhcccccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEec Q lcl|Aclame:pro 318 GYANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADA 365 (368) Q Consensus 318 ~~~n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a 365 (368) ... .+..+..+ +.++-++++..--.-.++.++..-..+++-.++| T Consensus 265 ~r~--~g~~~d~~-~~~~g~~~vr~~~~~k~~i~~~d~G~li~~~va~ 309 (309) T protein:vir:99 265 DRV--SGSIADPN-IGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) T ss_pred ccc--CCceeeee-eccCCceEEEEeccccchhcchhcchhhhhcccC Confidence 001 11111111 2222233333333334444443333444444444 No 16 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=94.45 E-value=0.0044 Score=33.52 Aligned_cols=291 Identities=10% Similarity=0.053 Sum_probs=120.9 Q ss_pred CcccccCCcccHHHHHH-------HHHhcCCcccchhhcccccccccc-cceEEEEEEcCce--eeeeccCCCCCccccc Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTG-------EVQSIPNTYGYISNLGLFRSAPIT-QTTFLMDLTDWDV--SLLDAVDRDSRKAETS 70 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~-------~i~~~p~~~~~l~~l~~F~~~~~~-t~~i~id~~~~~~--~lvp~v~rg~~~~~~~ 70 (368) |.-+.....-++.+|.. .|..+= .+.++.++ ||.....+ ...+.+....... .=..-|..|++=. +. T Consensus 7 i~s~~~~~~itv~~ll~~P~~I~~~i~e~~-~~~~iad~-lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEiP-~~ 83 (318) T protein:vir:10 7 IVSVSDGPAITVRELVGNPLWIPTALKKMM-VNQFISES-LFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEIP-VS 83 (318) T ss_pred ceeeecCCceehHHhhCCchhHHHHHHHHH-hccchhhh-hhhcccccccceeEEEecccccccCcHhhccCccccc-cc Confidence 44444444444444433 222222 46777776 67665443 3333333221111 0111222232211 11 Q ss_pred cCCceeEEEEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcE--EccCC Q lcl|Aclame:pro 71 APERVRQISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALK-GKV--VDARG 147 (368) Q Consensus 71 ~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~-G~i--~d~~G 147 (368) ..+....+.-..--...+-.|+-|-+. | .--+.+.+.+.++.+.+.+-.+.++..||+ ..+ +-+.+ T Consensus 84 ~~~~G~~~ia~~~K~G~~~~vS~Em~~--~---------n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~ 152 (318) T protein:vir:10 84 AGARGLPRTAFAVKKALGVRVSKEMID--E---------NRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPT 152 (318) T ss_pred CCCCCchhhhhhehhccceeccHHHHh--h---------cChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCc Confidence 111111110000011111222222110 1 112457888999999999999999999996 322 11111 Q ss_pred CEEeccHhhcCCCcceeEEecCCCCCcHHHHHHHHHHHH-----HHHhccccccccceEEEEEChHHHHHHhcCHHHHHH Q lcl|Aclame:pro 148 TLYADLYKQFDVEKKTIYFDLDNPNADIDASIEELRMHM-----EDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDA 222 (368) Q Consensus 148 ~~~~d~~~~fG~~~~~~~~~l~~a~~di~~~l~~~~~~i-----~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~ 222 (368) +.. + ++ +..+|+....+.+.... ++.+.-........-.+++++..|..|.+|+.++++ T Consensus 153 ~w~-~----~~-----------~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~ 216 (318) T protein:vir:10 153 AWD-N----GG-----------KVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKV 216 (318) T ss_pred CCC-C----cc-----------cccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhh Confidence 110 0 00 01112222222111000 000000111234556788999999999999999999 Q ss_pred HHHhhhhhhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeecccc Q lcl|Aclame:pro 223 YLAQQTPLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGE 302 (368) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~ 302 (368) |....+...... ...+.+.. ++-|+.|+. . ..++.+++..+-. + T Consensus 217 y~~~a~~~~~~~-----------~~tg~~~g-~~lGl~vi~------------s-------~~~p~~~alvlq~-----g 260 (318) T protein:vir:10 217 YERNANYVSTAP-----------DWTGNFPG-SVMGLNVIR------------S-------RTFPIDRVLIMER-----G 260 (318) T ss_pred hhccchhhhhcc-----------cccccccc-eeeceEEee------------c-------CccCCCeeEEEec-----C Confidence 975432211100 00111111 334666664 1 1233333333332 1 Q ss_pred chhhhheeeccccchhhcccccceeeEe---eeeccCCCeeEEEeeecccccccCCceEEEEEEecCC Q lcl|Aclame:pro 303 ANNIFEVAYGPCPKMGYANTLGQELYVF---EYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKG 367 (368) Q Consensus 303 ~~~~f~~~~apa~~~~~~n~~~~~~y~~---~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~ 367 (368) ..| +++ +..-....++|.- ..-.++.+ |.+..---.-+...+|.|++++|==--| T Consensus 261 ~vG----~~~-----d~~pl~~t~~~~egg~~~g~~~~s-~~~~~~~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 261 TVG----FYS-----DTRPLQFTALYPEGNGPNGGPTES-YRADASHKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred Ccc----eee-----ccccceeeecccCCCCCCCCcchh-hheehheeeeeeeeCcceeEEEeeccCC Confidence 112 111 0001111122210 00012223 2333333446778899999999977777 No 17 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=92.52 E-value=0.011 Score=31.31 Aligned_cols=310 Identities=10% Similarity=0.052 Sum_probs=125.2 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhhcccccccccc-cceEEEEEEcCceeeeeccCCCCCcc-ccccCCceeEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPIT-QTTFLMDLTDWDVSLLDAVDRDSRKA-ETSAPERVRQI 78 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~-t~~i~id~~~~~~~lvp~v~rg~~~~-~~~~~~~~~~~ 78 (368) -.++|-. .|+-. +..++.. ...+. +++..++++ ++++.|.++ |..++ ....+|.+.. +....++. .. T Consensus 18 ~~al~le-~f~ge-V~taf~~----~s~~~--~~~~~rti~~gkS~q~~~i-G~~~~-~~~~~G~~ld~~~~~~~k~-~i 86 (364) T protein:vir:10 18 VDSLLIE-KFNNR-VHEQYLK----GENLL--QWFDVQEVVGTNSVSNKYI-GETEL-QVLSPGKSPDASPTEFDKN-RL 86 (364) T ss_pred hhhhhhh-hhhhh-HHHHHHH----HHhhc--CcceeeeecccceEEeeee-eeeEE-eeeccCcccCCCCcccCcE-EE Confidence 2222211 01111 1222221 12222 345555554 677888777 33332 2222232210 01112221 22 Q ss_pred EEecccccccccccHHHHhcccCCCCcC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEE-ccCCCEEeccHhh Q lcl|Aclame:pro 79 SFPMMYFKEVESITPDEIQGVRQPGTAN-ELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVV-DARGTLYADLYKQ 156 (368) Q Consensus 79 ~f~~p~~~~~~~i~a~dlq~~R~~G~~~-~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~-d~~G~~~~d~~~~ 156 (368) .+....+...-.-.=+|+|+ .+ + -...+.+.....|+++.++.-. ....+ .|...... +..+.+. T Consensus 87 tID~ll~a~~~V~diDe~q~--~~---D~vR~e~s~e~G~ALA~~~Dq~i~-~~v~~-aa~a~~~~~~~~~~~~------ 153 (364) T protein:vir:10 87 VVDTTVIARNTVAHFHDVQN--DI---DGLKSKLSVNQAKKLKKMEDSMVI-QQLVL-GGISNTEAIRKNPRVA------ 153 (364) T ss_pred EecceeeechhhhhHHHHhc--Cc---cchhHHHHHHHHHHHHHHHHHHHH-HHHHh-hhhhcccccccCCccc------ Confidence 33333222222222233442 11 1 1123334445555554444321 11112 12221111 1111100 Q ss_pred cCCCcceeEEecC----CCCCc---HHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHH-HHHHhhh Q lcl|Aclame:pro 157 FDVEKKTIYFDLD----NPNAD---IDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRD-AYLAQQT 228 (368) Q Consensus 157 fG~~~~~~~~~l~----~a~~d---i~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~-~~~~~~~ 228 (368) +....++.+ ++.++ +...+.++.+++- ....+..+.++++.|.+|..|++|+.+.. -|... T Consensus 154 ----~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~Ld-----EkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~-- 222 (364) T protein:vir:10 154 ----GHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQT-----EQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIA-- 222 (364) T ss_pred ----CCcceeeecccCcchhhhHHHHHHHHHHHHHHHh-----hcCCCccccEEEeChHHHHHHhcCCcccccccccc-- Confidence 111111111 11222 3333334433332 22345577889999999999999988542 11100 Q ss_pred hhhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccc-cccccceecCCceeeeeeeeccccchhhh Q lcl|Aclame:pro 229 PLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTL-VSIDSVADTVGVGHAFPNVAMLGEANNIF 307 (368) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~-~~~~~~~i~~~~~~~~p~~~~~~~~~~~f 307 (368) ..+.........+.|+.+++-. .++...+..... ...+...-..+.+..|....-..... T Consensus 223 ---------------~~~~~~~G~v~~v~Gv~Vv~Sn-~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~--- 283 (364) T protein:vir:10 223 ---------------ASDNTVDGFVLKSWNTPIVPSN-RFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQ--- 283 (364) T ss_pred ---------------CCCccccceeEEEeceEEEecc-ccccccccccccccccccccccccCCcccccccccceeE--- Confidence 0011112233456788776632 222211110000 00000111111222222111000000 Q ss_pred heeeccccchhhccc-ccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCCC Q lcl|Aclame:pro 308 EVAYGPCPKMGYANT-LGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKGG 368 (368) Q Consensus 308 ~~~~apa~~~~~~n~-~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~~ 368 (368) -..|.| +++.+ ..++.=...|.+++...+.|.+-...=.-..||++.+.++.++++| T Consensus 284 ~~~f~~----~Al~tv~~~~~t~e~~~~~~~~~~~ida~~a~G~g~lRPeaa~~i~~~~~~~ 341 (364) T protein:vir:10 284 AVLFTQ----DALLVGRTISITGDIFYEKKEKTWYIDTFLAEGAIPDRWEAVAVVTAADTAE 341 (364) T ss_pred EEEEec----ceEEEEEEecceeeeeeccceeeeeeeeehcccCcccCccceEEEEecCCCC Confidence 122333 12222 2234445667777778888888888888899999999999999999 No 18 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=91.37 E-value=0.016 Score=30.40 Aligned_cols=304 Identities=12% Similarity=0.029 Sum_probs=120.7 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhhcccccccccc-cceEEEEEEcCceeeeeccCCCCCccc-cccCCceeEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPIT-QTTFLMDLTDWDVSLLDAVDRDSRKAE-TSAPERVRQI 78 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~-t~~i~id~~~~~~~lvp~v~rg~~~~~-~~~~~~~~~~ 78 (368) -.++|- -.|+-.-+++--+ ...+. +++..++++ ++++.|.++ |..+ +....+|.+-.. ....++. +. T Consensus 18 d~al~l-e~f~geV~~af~~-----~s~~~--~~~~~rti~~g~s~~~~~i-G~~~-~~~~~pG~~l~~~~~~~~k~-~i 86 (335) T protein:vir:63 18 DVDIHL-EEHLGIVDKHFAY-----TSKFA--PLMNIRDLRGSNVVRLDRL-GNVE-AKGRRAGEELERSRVVNDKW-NL 86 (335) T ss_pred hhheeh-hhhhhhHHHHHHh-----hhhhc--cccceeeeccceeEEEeee-eeee-eecccCCcCcCCCCccccce-EE Confidence 223333 1233333333222 22222 345555554 677888877 2222 222222322110 0111221 22 Q ss_pred EEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEeccHhh-- Q lcl|Aclame:pro 79 SFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDARGTLYADLYKQ-- 156 (368) Q Consensus 79 ~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d~~G~~~~d~~~~-- 156 (368) .+....+...-.=.=+|+|+ .-+-...+.+.....|+++.+..-. .+++.+-...+.. .+..- T Consensus 87 tVD~ll~a~~~I~dlDe~~~-----~yDvRse~s~e~G~aLA~~~D~~~~------~~i~~aa~~~a~~----~~~~~~~ 151 (335) T protein:vir:63 87 TVDTLLYLRHQFDHQDEWTQ-----SFDMRKEVAELDGQELARKFDQACL------IQVIKAAAMDAPV----DLEDAFS 151 (335) T ss_pred EecceeechhhhhhHHHHhc-----CchhHHHHHHHHHHHHHHHHHHHHH------HHHHhhccccCcc----ccCCCcC Confidence 33333222222222233332 1122233344455555544443222 2222322111100 00000 Q ss_pred cCCCcceeEEecCCCCCcHHHHHHHHHHHHHHHhccccccc---cceEEEEEChHHHHHHhcCHHHHHH-HHHhhhhhhh Q lcl|Aclame:pro 157 FDVEKKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVIN---GEEIHVVVDRVFFSKLTKHPKIRDA-YLAQQTPLAW 232 (368) Q Consensus 157 fG~~~~~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~---~~~~~~l~g~~~~~~l~~h~~v~~~-~~~~~~~~~~ 232 (368) -|+++.+ .++-+++.++ ...|..+.....+.+. ....+ ..+.+++++|++|.+|++|+.+... |. ...+ T Consensus 152 ~G~~~~~-~~tg~~~~~~-~~~l~~a~~~a~~~L~-e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~---~s~~- 224 (335) T protein:vir:63 152 PGVLEKL-DLTGLTAKQA-ADKIVRMHRRVVETFI-DRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQ---ATGA- 224 (335) T ss_pred CCcceee-eeccCccccc-HHHHHHHHHHHHHHHH-hccCCCcccCceEEEeChHHHHHHhccccccccccc---cccc- Confidence 1332221 1122222223 2233322222222222 11122 2347889999999999999865321 11 0000 Q ss_pred hhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheeec Q lcl|Aclame:pro 233 QQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYG 312 (368) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~a 312 (368) ...........+.|+.+++-. ..+...+.... .+.......+-|.+..+ T Consensus 225 ------------~~~~~~g~v~~v~Gv~V~~sn-~lP~~~~t~~~------------------lg~a~n~~~~d~~~~~~ 273 (335) T protein:vir:63 225 ------------TNDYVKSRVAILNGVKVLETP-RFATKAIAAHP------------------LGRHFNVSAEESERQIA 273 (335) T ss_pred ------------cccccCceeEEeeceEEEeec-cCCCCCccccc------------------ccccCCccccccceeEE Confidence 011223345577888877632 22222111111 00001111111211111 Q ss_pred cccchhhccc-ccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCCC Q lcl|Aclame:pro 313 PCPKMGYANT-LGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKGG 368 (368) Q Consensus 313 pa~~~~~~n~-~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~~ 368 (368) -.-+-+++.+ ...+.=..+|.+++...+.|.+-...=....||++.+.++.+.-|- T Consensus 274 ~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~~~ 330 (335) T protein:vir:63 274 LFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYNIGARRPDTAGAIELKGIGA 330 (335) T ss_pred EEEecceEEEEEEeecccceeeccchhhHHhHHHHHcCCcccccceEEEEEEcCCCc Confidence 1111111211 1122334566666666677777777778889999999999977776 No 19 >protein:vir:98819 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:32561 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851100;genbank:gi:117530257;genbank:GeneID:4484483 Probab=91.10 E-value=0.017 Score=30.22 Aligned_cols=349 Identities=14% Similarity=0.159 Sum_probs=154.9 Q ss_pred CcccccCC--------------ccc----HHHHHHH-HHhcCCcccchhhcccccccccccceEEE-EEEcCceeeeecc Q lcl|Aclame:pro 1 MLTNSEKS--------------RFF----LADLTGE-VQSIPNTYGYISNLGLFRSAPITQTTFLM-DLTDWDVSLLDAV 60 (368) Q Consensus 1 ~~d~f~~d--------------~F~----~~~Lt~~-i~~~p~~~~~l~~l~~F~~~~~~t~~i~i-d~~~~~~~lvp~v 60 (368) |-||=+-+ .|- .++|... +.++|..| |+. +|+.+.+....|.- ..+.+...+.|.| T Consensus 1 msdipspnlqalisspylvdnttfprepvytelarsilaklpatp--lsa--vfpdetiaeriviaehviegvntifpvv 76 (437) T protein:vir:98 1 MSDIPSPNLQALISSPYLVDNTTFPREPVYTELARSILAKLPATP--LSA--VFPDETIAERIVIAEHVIEGVNTIFPVV 76 (437) T ss_pred CCCCCCcchHhhhcCceeeccccCCccchHHHHHHHHHHhcCCcc--ccc--cccchhhhhhhhhHHHHHhhhhhhhhhh Confidence 54543221 121 2334333 34555544 333 57777775544333 3467778888999 Q ss_pred CCCCCccccccCCceeE--EEEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 61 DRDSRKAETSAPERVRQ--ISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQAL 138 (368) Q Consensus 61 ~rg~~~~~~~~~~~~~~--~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL 138 (368) +.|++..-+...+ .++ .++++--+..+-..+-..+++--.-|+..+..+.++.+.+||.++.++|..||....+..+ T Consensus 77 ewgapdlfvdddg-ytvyrqsyqplpirqsmymsyaqlnntvregttnerataaeqiekkltrqmqkhqltwnvfqaamm 155 (437) T protein:vir:98 77 EWGAPDLFVDDDG-YTVYRQSYQPLPIRQSMYMSYAQLNNTVREGTTNERATAAEQIEKKLTRQMQKHQLTWNVFQAAMM 155 (437) T ss_pred ccCCcceeecCCC-ceeeecccCCccchhhhhhhhhhhhhhhhccccchhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHH Confidence 9999986555443 344 3677767888888888888876556777777777888999999999999999988777766 Q ss_pred cCcE--EccCCCEE---------eccHhhcCCCcc-----e-----eEEecCCC--------CCcHHHHHHHHHHHHHHH Q lcl|Aclame:pro 139 KGKV--VDARGTLY---------ADLYKQFDVEKK-----T-----IYFDLDNP--------NADIDASIEELRMHMEDE 189 (368) Q Consensus 139 ~G~i--~d~~G~~~---------~d~~~~fG~~~~-----~-----~~~~l~~a--------~~di~~~l~~~~~~i~~~ 189 (368) .|.| .|+...+- -+||. |..+|. . .-+||+.. -+|+.=.|....|.+.+- T Consensus 156 lgginytdprsgvrvkapayiparnffn-fnttqgyrgrnearlfrnlidlnaggtpssgipitdpqfalsnftrrlnrw 234 (437) T protein:vir:98 156 LGGINYTDPRSGVRVKAPAYIPARNFFN-FNTTQGYRGRNEARLFRNLIDLNAGGTPSSGIPITDPQFALSNFTRRLNRW 234 (437) T ss_pred hccccccCcccceeeecccccccccccc-cccccccccchHHHHHHHHhhccCCCCCcCCcccccchhhHHHHHHHHHHH Confidence 7766 34432221 13322 333321 0 11333221 234444444444444333 Q ss_pred hccccccccceEEEEEChHHHHHHhcCHHHHHHHHH---hhhhh-hhhhhhcccccccc-------------cccccccc Q lcl|Aclame:pro 190 AKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLA---QQTPL-AWQQITGSLRTGGA-------------DGVQAHMN 252 (368) Q Consensus 190 ~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~---~~~~~-~~~~~~~~~~~~~~-------------~~~~~~~~ 252 (368) .+...-+.++. ..+|++.-|.+.-..+.+-+--. +..+. +...+...-.++.. ........ T Consensus 235 fkdtnksditd--mymgpemrdvilmseearlaqggiiprlgavfgdstidsngsggsfgplppgglgtgmglvlgtrge 312 (437) T protein:vir:98 235 FKDTNKSDITD--MYMGPEMRDVILMSEEARLAQGGIIPRLGAVFGDSTIDSNGSGGSFGPLPPGGLGTGMGLVLGTRGE 312 (437) T ss_pred hhccccccchh--hhcCccceeeeeeccchhhhhcccchhhhhhhccccccCCCCCcccCCCCccccccccceeeecccc Confidence 33222222222 22444443333322221100000 00000 00000000000000 00111223 Q ss_pred eeeeCCEEEEEccccccCCCcccc-cccccccceecCCceeeeeeeeccccchhhhheeeccccchhhcccccceeeEee Q lcl|Aclame:pro 253 TFYYGGVKFVQYNGKFKDKRGKVH-TLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPCPKMGYANTLGQELYVFE 331 (368) Q Consensus 253 ~f~~~gi~~~~y~~~~~~~~g~~~-~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~apa~~~~~~n~~~~~~y~~~ 331 (368) ...+.||.+...+..|+++-..+. ...|-++++ ++-.--.-+.+.-.=++-|.-.+ .....+++ +... T Consensus 313 ilsiaginvhvvdtiykdpvdgvekrvwpknkiv-------avsfrdsdgnveapgrtqycsse--nsidspgl--wtrt 381 (437) T protein:vir:98 313 ILSIAGINVHVVDTIYKDPVDGVEKRVWPKNKIV-------AVSFRDSDGNVEAPGRTQYCSSE--NSIDSPGL--WTRT 381 (437) T ss_pred eeEeecceeeeehhhhhcchhhhhhhcCCccceE-------EEEEecCCCcccCCccccccccc--cccCCCcc--eeee Confidence 345667766666655666543333 333333221 11000000111111122222110 00111111 1111 Q ss_pred ee---ccCCCeeEEEeeecccccccCCceEEEEEEec----------C-----CC Q lcl|Aclame:pro 332 YE---KDRDEGIDFEAHSYMLPYCTRPQLLVDVRADA----------K-----GG 368 (368) Q Consensus 332 ~~---~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a----------~-----~~ 368 (368) .. -.--.|+-+..-..-||+-.-|--++.+|--. + || T Consensus 382 vtdvpppaapgiavqmgnaglpyfkypyrvchvtpctveqinerlgiqgelffpg 436 (437) T protein:vir:98 382 VTDVPPPAAPGIAVQMGNAGLPYFKYPYRVCHVTPCTVEQINERLGIQGELFFPG 436 (437) T ss_pred eccCCCCCCCcceEeecCCCCcccccceeeeeecccchHHhhhhhCcceeeecCC Confidence 11 11223445555555566655555555443211 0 01 No 20 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=90.32 E-value=0.021 Score=29.73 Aligned_cols=305 Identities=13% Similarity=0.053 Sum_probs=120.2 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhhcccccccccc-cceEEEEEEcCceeeeeccCCCCCcccc-ccCCceeEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPIT-QTTFLMDLTDWDVSLLDAVDRDSRKAET-SAPERVRQI 78 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~-t~~i~id~~~~~~~lvp~v~rg~~~~~~-~~~~~~~~~ 78 (368) -.++|- -.|+-.-+++--+ ...+. +++..+.++ ++++.|.++ |..++ ....+|.+.... ...++. .. T Consensus 18 d~al~l-e~f~geV~~af~~-----~s~~~--~~~~~rti~~g~s~~~~~i-G~~~~-~~~~pG~~l~~~~~~~~k~-~i 86 (335) T protein:vir:78 18 DVDIHL-EEHLGIVDKHFAY-----TSKFA--PLMNIRDLRGSNVVRLDRL-GNVEA-KGRRAGEELERSRVVNDKW-NL 86 (335) T ss_pred hhhhhh-hhhhhHHHHHHHH-----hhhhc--cccceeeeccceeEEEeee-eeeee-cccccCcccCCCCcccCCe-EE Confidence 223333 2233333443333 22222 346666554 677888876 33332 333333322110 111211 12 Q ss_pred EEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEeccHhh-- Q lcl|Aclame:pro 79 SFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDARGTLYADLYKQ-- 156 (368) Q Consensus 79 ~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d~~G~~~~d~~~~-- 156 (368) .+....+...-.=.=+|+|+ .-+-.....+...+.|+++.++.-. .+++.+-...+- .++..- T Consensus 87 tID~ll~a~~~VddlDe~~~-----~yDvR~e~s~~~G~aLA~~~Dq~~~------~~l~~aa~~~a~----~~~~~~~~ 151 (335) T protein:vir:78 87 TVDTLLYLRHQFDHQDEWTQ-----SFDMRKEVAELDGQELARKFDQACL------IQVIKAAAMDAP----VDLEDAFS 151 (335) T ss_pred EecceeechhhHhhHHHhhc-----CchhHHHHHHHHHHHHHHHHHHHHH------HHHHhhcccccc----cccCCCcC Confidence 23222221111111122332 1122223344444555444332221 222332211110 000000 Q ss_pred cCCCcceeEEecCCCCCc---HHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHH-HHHhhhhhhh Q lcl|Aclame:pro 157 FDVEKKTIYFDLDNPNAD---IDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDA-YLAQQTPLAW 232 (368) Q Consensus 157 fG~~~~~~~~~l~~a~~d---i~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~-~~~~~~~~~~ 232 (368) -|.+..+ .+.-+++.++ +...+.++...+.+.---. ....+.+++++|++|.+|+.|+.+... |. ...+ T Consensus 152 ~G~~~~~-~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~--~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~---~s~~- 224 (335) T protein:vir:78 152 PGVLEKL-DLTGLTAKEAAEKIVRMHRRVVETFIERDLGD--AVYSEGLTPMSPRVFSLLLEHDKLMSVEYQ---ATGA- 224 (335) T ss_pred CCcceee-eeccccccccHHHHHHHHHHHHHHHHhccCCC--CCCCccEEEeChHHHHHHhccccccccccc---cccc- Confidence 1222211 1111122223 3334444443333211000 012346789999999999999875321 11 0000 Q ss_pred hhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheeec Q lcl|Aclame:pro 233 QQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYG 312 (368) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~a 312 (368) ...........+.|+.+++-. ..+...+....+= .++..+. .+......+++ T Consensus 225 ------------~~~~~~g~v~~v~Gv~V~~Sn-~lP~~~~t~~~lg------------~a~n~~~--~d~~~~~~~~~- 276 (335) T protein:vir:78 225 ------------TNDYVKSRVAILNGVKVLETP-RFATKAISAHPLG------------RHFNVSA--EEAERQIALFL- 276 (335) T ss_pred ------------ccccccceeEEeeceEEEeec-cCCCCCCcccccc------------ccCCccc--ccccceEEEEE- Confidence 011223344567888877622 2222221111100 0000000 00000001111 Q ss_pred cccchhhcccccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCCC Q lcl|Aclame:pro 313 PCPKMGYANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKGG 368 (368) Q Consensus 313 pa~~~~~~n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~~ 368 (368) +.+-..++ ..++.=...|.+++...+.|.+-...=....||++.+.++.+..|- T Consensus 277 ~~~Al~t~--~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~~~ 330 (335) T protein:vir:78 277 PSKTLITA--QVAPVQAKLWEDHDQFSWVLDTFQMYNIGARRPDTAGAIELKGIEA 330 (335) T ss_pred ecceEEEE--EEEecccceeeccchhhHhhhHHHHcCCcccCcceEEEEEecCCCc Confidence 11111111 1122334556666666667777777778889999999999988888 No 21 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=74.70 E-value=0.16 Score=25.02 Aligned_cols=320 Identities=8% Similarity=-0.030 Sum_probs=115.4 Q ss_pred CcccccCC--------------cccHHHHHHHHHhcCCcccchhhcccccccccc-cceEEEEEEcCceeeeeccCCCCC Q lcl|Aclame:pro 1 MLTNSEKS--------------RFFLADLTGEVQSIPNTYGYISNLGLFRSAPIT-QTTFLMDLTDWDVSLLDAVDRDSR 65 (368) Q Consensus 1 ~~d~f~~d--------------~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~-t~~i~id~~~~~~~lvp~v~rg~~ 65 (368) .+++.++. .|+ -++..++++ ...+. ++...+.++ ++++.|.++ |..++-. ..||.+ T Consensus 11 ~~n~~t~~~~~~~~~~~al~le~f~-geV~~~f~~----~si~~--~~~~~rti~~Gksv~f~~i-G~~t~~~-~t~G~~ 81 (375) T protein:vir:10 11 RSNLSTGTGYGGATDKYALYLKLFS-GEMFKGFQH----ETIAR--DLVTKRTLKNGKSLQFIYT-GRMTSSF-HTPGTP 81 (375) T ss_pred ccccCCccccccccchHHHHHHHHh-HHHHHHHHH----HHhhh--ccccccccccCceEEEEee-eeeEEee-ecCCcC Confidence 11111111 122 122333332 13333 346666665 788888887 4444333 233432 Q ss_pred c-cccccCCcee--EEEEecc-cccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cC Q lcl|Aclame:pro 66 K-AETSAPERVR--QISFPMM-YFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQAL-KG 140 (368) Q Consensus 66 ~-~~~~~~~~~~--~~~f~~p-~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL-~G 140 (368) - .+.....+.+ ...+.-. |+. ..=+|+... +.--+++.+...++-..+.+....++++.| .+ T Consensus 82 i~~~~~~d~~~te~~l~ID~~~y~~----~~VdDiD~a---------qa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~ka 148 (375) T protein:vir:10 82 ILGNADKAPPVAEKTIVMDDLLISS----AFVYDLDET---------LAHYELRGEISKKIGYALAEKYDRLIFRSITRG 148 (375) T ss_pred cCCccccCCCCCceEEEecchhhhh----hhHhhHHHH---------hcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 1 1101111111 1122111 111 111122211 111112222222222222233333333322 22 Q ss_pred cEEcc--CCCEEeccHhhcCCCcce-eEEecCCC---CCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHh Q lcl|Aclame:pro 141 KVVDA--RGTLYADLYKQFDVEKKT-IYFDLDNP---NADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLT 214 (368) Q Consensus 141 ~i~d~--~G~~~~d~~~~fG~~~~~-~~~~l~~a---~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~ 214 (368) ....+ .+...+ .-|.++-. .+..-+++ ...+...+.++.+++.++ ..+..+.+++++|++|.+|+ T Consensus 149 a~~~~p~~~~~~~----~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~-----~VP~~~R~~vv~P~~y~~Ll 219 (375) T protein:vir:10 149 ARSASPVSATNFV----EPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEK-----GVSSQGRCAVLNPRQYYALI 219 (375) T ss_pred hhhcccccccccc----ccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhc-----CCCCCCCEEEeChHHHHHHH Confidence 21111 010000 01111110 01111111 223556666666555432 23445667889999999999 Q ss_pred cCHHHHHHHHHhhhhhhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCc-eee Q lcl|Aclame:pro 215 KHPKIRDAYLAQQTPLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGV-GHA 293 (368) Q Consensus 215 ~h~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~-~~~ 293 (368) .|.+..++...-..+ .+.........+.|+++++-. ..+...+.....-.+....-+... .+. T Consensus 220 ~~~d~~~~~n~d~~~---------------~~~~~~g~v~~i~Gv~V~~Sn-~lP~~~~~~~~~g~~~~~~a~~~~~~~~ 283 (375) T protein:vir:10 220 QDIGSNGLVNRDVQG---------------SALQSGNGVIEIAGIHIYKSM-NIPFLGKYGVKYGGTTGETSPGNLGSHI 283 (375) T ss_pred hcCCccceeeecccc---------------cceeccceEEEEeceEEEEec-cccccccccccccccccccchhhhhccc Confidence 885433222110000 011112233466788877633 233333222111100000000000 011 Q ss_pred eeeeeccccchhhhheeecccc----------chhhccc-----ccceeeEeeeeccCCCeeEEEeeecccccccCCceE Q lcl|Aclame:pro 294 FPNVAMLGEANNIFEVAYGPCP----------KMGYANT-----LGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLL 358 (368) Q Consensus 294 ~p~~~~~~~~~~~f~~~~apa~----------~~~~~n~-----~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al 358 (368) .|...-...+.+.|-.|.+-.+ +-+++++ +..+.+.. .-+..-+++.|.+-...=.-+.||++. T Consensus 284 ~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~~~~~~~~~-~~~~~~q~~~i~~~~a~G~~~lrp~~a 362 (375) T protein:vir:10 284 GPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEAIGPQVQVTNG-DVSVIYQGDVILGRMAMGADYLNPAAA 362 (375) T ss_pred cccCCcceeeccccccccccccccCceEEEEEchhheeeeeeeccccccccc-hhhheeeeeeeeeeeeeccCccCceeE Confidence 1110000011121211111110 1112221 11111110 112233566788888888889999999 Q ss_pred EEEEEecCCC Q lcl|Aclame:pro 359 VDVRADAKGG 368 (368) Q Consensus 359 ~~~t~~a~~~ 368 (368) +.++..|.+- T Consensus 363 v~l~~~~~~~ 372 (375) T protein:vir:10 363 VELYIGATAP 372 (375) T ss_pred EEEecCcCcc Confidence 9999986333 No 22 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=65.94 E-value=0.28 Score=23.65 Aligned_cols=272 Identities=10% Similarity=0.062 Sum_probs=97.8 Q ss_pred cccccCCcccHHHHHHHHHhcCCcccchhhcccccc-----cccccceEEEEEEcCcee-e---------eeccCCCCCc Q lcl|Aclame:pro 2 LTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRS-----APITQTTFLMDLTDWDVS-L---------LDAVDRDSRK 66 (368) Q Consensus 2 ~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~-----~~~~t~~i~id~~~~~~~-l---------vp~v~rg~~~ 66 (368) |-+ -.|+ -.....+..+=.....+.+. |.. .++.-+...++.+..... + +.|-+ +.+ T Consensus 1 ~av---r~y~-Kq~~glL~~vf~~qa~F~~~--FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FGt--GTg 72 (287) T protein:vir:39 1 MAI---KYFT-KQYAGMLPDLFAKKSAFLRA--FGGVLQVKDGVTENDTFMELKVSDTDVVIQAYSTDANVGFGS--GTG 72 (287) T ss_pred CCc---cccc-HHHHHHHHHHHHHHHhhhhh--cccceeeecCCcccceEEEEEecCcceEEecccCCCCccccc--CCC Confidence 111 1111 11111122111111111111 111 223233333333322221 1 12211 111 Q ss_pred cccccCCceeEEEEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccC Q lcl|Aclame:pro 67 AETSAPERVRQISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDAR 146 (368) Q Consensus 67 ~~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d~~ 146 (368) ..+.-+++.++++..++ ++.+....-.| |+-.+=-. .-+.+++++||.-|-..-.+......-.+|. .- T Consensus 73 ~ssRFG~rkEi~y~dt~-V~Y~~~~~ihE--GiD~~TVN---nd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls-----~~ 141 (287) T protein:vir:39 73 NTSRFGQRKEVKSVNKQ-VSYDAPLAINE--GIDDFTVN---DIKDQVVAERLALHGVAWAQHVDKLLGKLLS-----DS 141 (287) T ss_pred ccccccceeEEEEeccc-ccceecccccc--cccccccc---CChhHHHHHHHHhHHHHHHHHHHHHHHHHHH-----hh Confidence 11112222233322222 22222222222 11111101 1235678888886665444444433333332 11 Q ss_pred CCEEeccHhhcCCCcceeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHh Q lcl|Aclame:pro 147 GTLYADLYKQFDVEKKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQ 226 (368) Q Consensus 147 G~~~~d~~~~fG~~~~~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~ 226 (368) . -++.++.+++ .++...++++...--.+ ......++++.+.+++|++|+.||.+..+- T Consensus 142 A-------------~~t~~~~~t~--d~V~~LF~~a~~~yvNn----~v~~~~~~~AyV~aevYnaiiD~~l~TsaK--- 199 (287) T protein:vir:39 142 A-------------SETLTVKLDE--DSVTKLFSDAHKKFVNN----NVSIAVPWVAYVNADIYDLLIDSKLATTAK--- 199 (287) T ss_pred c-------------chheeeeecc--cchHHHHHHHHHHhhcc----ceeeEEEEEEEEChhHHhHHhccccccccc--- Confidence 1 1233333443 34556666555333221 123346789999999999999999975321 Q ss_pred hhhhhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhh Q lcl|Aclame:pro 227 QTPLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNI 306 (368) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~ 306 (368) ....|+.. ...+.|.|+.+.+--.+... .|+...+- +.+-+++|- |+. T Consensus 200 ---~SsaNiDe-------------n~i~kFkGf~l~e~P~~~~q-~g~~a~fs-------~dnig~af~------GI~-- 247 (287) T protein:vir:39 200 ---NSSANVDE-------------QTLYKFKGFILSELPDEKFQ-LNEGAYFA-------ADNVGVAGV------GIQ-- 247 (287) T ss_pred ---cceeeecc-------------CCcceecceEEEecchHhhc-cCcEEEEc-------cccceeecc------cce-- Confidence 11122211 12356778888874432211 11111111 111122111 110 Q ss_pred hheeeccccchhhcccccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEec Q lcl|Aclame:pro 307 FEVAYGPCPKMGYANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADA 365 (368) Q Consensus 307 f~~~~apa~~~~~~n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a 365 (368) ...-=-++++.-+..++.-=|-+--++++ -.|++|+|.+- T Consensus 248 -vaR~i~sEdF~GvalQgAgK~G~~i~e~N------------------k~Ai~k~t~~k 287 (287) T protein:vir:39 248 -VTRAMDSEDFAGTALQAAAKYGKYLPEKN------------------KKAILKATVTK 287 (287) T ss_pred -eEEeeecccccceeeeccccccccccccc------------------ceEEEEEecCC Confidence 00000022222233333322332222222 23455555444 No 23 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=65.35 E-value=0.28 Score=23.57 Aligned_cols=309 Identities=11% Similarity=0.117 Sum_probs=120.9 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhhcccccccccc-cceEEEEEEcCceeeeeccCCCCCcc-ccccCCceeEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPIT-QTTFLMDLTDWDVSLLDAVDRDSRKA-ETSAPERVRQI 78 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~-t~~i~id~~~~~~~lvp~v~rg~~~~-~~~~~~~~~~~ 78 (368) -|-.|+.. +..++.. ...+. +++..+.++ ++++.|.+. |..++ ....+|.+.. .....++ ... T Consensus 22 ~le~f~ge------V~taF~~----~si~~--~~~~vrti~~GkS~qf~~i-G~~~a-~y~~~G~~ldg~~~~~~k-~~I 86 (402) T protein:vir:97 22 LIEKFNGK------VNEQYLK----GENIL--SYFDVQTVTGTNTVSNKYL-GETEL-QVLAPGQSPNATPTQADK-NQL 86 (402) T ss_pred hhhhhhhh------HHHHHHH----HHhhc--CcceeeeecccceEEEEEE-eeeEE-eeeccccccCCCCccccc-EEE Confidence 22333322 1222221 22222 345556554 677888887 33333 3333332210 0111222 122 Q ss_pred EEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCcEEccCCCEEeccHhhc Q lcl|Aclame:pro 79 SFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQAL-KGKVVDARGTLYADLYKQF 157 (368) Q Consensus 79 ~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL-~G~i~d~~G~~~~d~~~~f 157 (368) .+....+...-.-.=+|+|+ .+-. -...+.+.....|+++.+++-.. ...++++. ...+ +..+...-+ T Consensus 87 tID~lL~a~~~V~diDeaq~--~yD~--vRse~s~e~G~ALA~~~Dq~ii~-~i~~aa~a~t~~~-~~~~~~~~~----- 155 (402) T protein:vir:97 87 VIDTTVIARNTVAHIHDVQG--DIDS--LKPKLAMNQAKQLKRLEDQMAIQ-QMLLGGIANTKAE-RNKPRVKGH----- 155 (402) T ss_pred EeCceeechhhhhhHHHHHh--cccc--hhHHHHHHHHHHHHHHHHHHHHH-HHHHhhccccccc-cccCccccc----- Confidence 33333222222222233442 1100 11223334455555444432111 11111111 1111 111111100 Q ss_pred CCCcceeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHH-HHHHhhhhhhhhhhh Q lcl|Aclame:pro 158 DVEKKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRD-AYLAQQTPLAWQQIT 236 (368) Q Consensus 158 G~~~~~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~-~~~~~~~~~~~~~~~ 236 (368) |-+.. ++.+-..+..++......+....+. +. ....+..+.+++++|++|..|++|+.+.. -|... + T Consensus 156 g~s~~-~~~t~~~a~~~~~~l~~ai~~a~~~-Ld-EkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~--~------- 223 (402) T protein:vir:97 156 GFSIN-VNVTESEALANPQYVMAAVEYALEQ-QL-EQEVDISDVAIMMPWKFFNALRDADRIVDKTYTIS--Q------- 223 (402) T ss_pred ccccc-cccccchhhcCHHHHHHHHHHHHHH-HH-hcCCCccccEEEeChHHHHHHhhcccccchhhccc--c------- Confidence 11111 1111122233433333322222221 11 22245567889999999999999987542 11100 0 Q ss_pred cccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheeeccccc Q lcl|Aclame:pro 237 GSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPCPK 316 (368) Q Consensus 237 ~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~apa~~ 316 (368) .+.........+.|+.+++-. .++...+... ....-..+.+..|.... +-.......|.| T Consensus 224 --------~g~~~~G~v~~v~Gv~Vv~Sn-nlP~~a~~it-----~~~ls~a~~G~~y~~t~---d~t~~~~~~f~~--- 283 (402) T protein:vir:97 224 --------SGATINGFVLSSYNCPVIPSN-RFPTFAQDQA-----HHLLSNEDNGYRYDPIA---EMNGAVAVLFTS--- 283 (402) T ss_pred --------CCccccceeEEEeceEEEecC-cccccccccc-----ccccccCCCCccCCcCc---ccceeEEEEEec--- Confidence 011112233456788777632 2221110000 00111122233333211 111111223333 Q ss_pred hhhccc-ccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEec--CCC Q lcl|Aclame:pro 317 MGYANT-LGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADA--KGG 368 (368) Q Consensus 317 ~~~~n~-~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a--~~~ 368 (368) +++++ ...++-...|.+++...+.|.+..+.=..+.||++.--++.+- -+| T Consensus 284 -~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~t~~ 337 (402) T protein:vir:97 284 -DALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) T ss_pred -ceEEEEEeeccccchhhchhHHHHHHHHHHHhCCcccCccceEEEEEecccccc Confidence 22333 2334555667777777778888888888899999886553333 333 No 24 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=57.39 E-value=0.44 Score=22.56 Aligned_cols=274 Identities=11% Similarity=0.009 Sum_probs=99.1 Q ss_pred cccccc-cceEEEEEEcCceeeeeccCCCCCcc---ccccCCceeEEEEecccccccccccHHHHhcccCCCCcCHHHHH Q lcl|Aclame:pro 36 RSAPIT-QTTFLMDLTDWDVSLLDAVDRDSRKA---ETSAPERVRQISFPMMYFKEVESITPDEIQGVRQPGTANELTTE 111 (368) Q Consensus 36 ~~~~~~-t~~i~id~~~~~~~lvp~v~rg~~~~---~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~ 111 (368) =.++++ ++++.|.++ |..+ +....+|.+-. +.....+. ...+.-.-+.. +.=+|+....+-. +-.... T Consensus 1 ~vr~i~~g~s~~~~~i-G~~~-~~~~~~G~~l~~~~~~~~~~e~-~itID~~l~~~---~~VdDiD~~qa~~--Dlr~e~ 72 (324) T protein:vir:99 1 MTRTITSGKSAQFPVM-GRTK-ARYLKQGQSLDDGREDIKHTEK-VITIDGLLTTD---VLIYDIEDAMNHY--DVRSEY 72 (324) T ss_pred CeeeeecCceEEEeee-eeeE-eccccCCCCcCCCcCCcCcccE-EEEecchhhhh---hhhhhHHHHhcCc--cchhHH Confidence 122232 677777776 3332 23333343321 10111111 11111110000 0111222211111 111122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-hc------CcEEccCCCEEeccHhhcCCCcceeEEecCCCCCcHHHHHHHHHH Q lcl|Aclame:pro 112 AVVRAKKLMKIRTKFDITREFLFMQA-LK------GKVVDARGTLYADLYKQFDVEKKTIYFDLDNPNADIDASIEELRM 184 (368) Q Consensus 112 ~~~v~~~l~~~~~~i~~t~E~m~a~A-L~------G~i~d~~G~~~~d~~~~fG~~~~~~~~~l~~a~~di~~~l~~~~~ 184 (368) .+.....|++..++.-... ++.+ .. +.+.-.+|...++- ..-..+.......+...+.++.+ T Consensus 73 s~~~G~aLA~~~Dq~i~~~---~a~~~~~~a~~~~~~~~~~g~~~~~~~--------~~~~~~~~~~~~~~~dai~~a~~ 141 (324) T protein:vir:99 73 STQMGEALAMAADVANYAE---MAKLVNSRKETTNENIEGLGAASLVKI--------TGKKEDPAKYGTQVIQALTYARA 141 (324) T ss_pred HHHHHHHHHHHHHHHHHHH---HHHhhhcccccccCCcccCCccceecc--------cccccccccCHHHHHHHHHHHHH Confidence 2233333333333222211 1111 11 11111112222210 00000000111234555555555 Q ss_pred HHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhhhhhhhhhcccccccccccccccceeeeCCEEEEEc Q lcl|Aclame:pro 185 HMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQY 264 (368) Q Consensus 185 ~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y 264 (368) .+.++ ..+..+.+++++|++|..|+.|+.+.....+ + ...++. .....+.|+++++- T Consensus 142 ~Lde~-----~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~---~------~~~~~~---------G~V~~i~Gf~V~~S 198 (324) T protein:vir:99 142 AFAKK-----YIPAGDRTFYTDPDTYSAILAALMPNAANYA---A------LIDPET---------GNIRNVMGFEVVET 198 (324) T ss_pred HHhhc-----CCCCCCCEEEeChHHHHHHhhcccccccccc---c------ccceec---------ceEEEEeceEEEec Confidence 55432 1344567789999999999998776532110 0 011111 12345678888863 Q ss_pred cccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheeeccccchhhcccccceeeE-------------ee Q lcl|Aclame:pro 265 NGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPCPKMGYANTLGQELYV-------------FE 331 (368) Q Consensus 265 ~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~apa~~~~~~n~~~~~~y~-------------~~ 331 (368) .. .+...+... .-...+.+++++.... .+....|.+ +..+..++=|+. .. T Consensus 199 n~-lp~~~~t~~-------~~a~~~~~~~~~~~~~----~~~~~ky~~-----d~~~~~gl~~~~~a~~tv~~~~~~~e~ 261 (324) T protein:vir:99 199 PH-MTAQMVTNP-------TDAFDGTGHIFPATGD----STTTGKMTV-----GADNVVGLFVHRSAVATLKLKDMALER 261 (324) T ss_pred CC-ccccccccc-------cccccccccccccccc----ccccccccc-----ccCceeEEEEehhheEEEeeecceecc Confidence 21 221111100 0011112222222110 000001111 111222221111 11 Q ss_pred eeccCCCeeEEEeeecccccccCCceE--EEEEEecCCC Q lcl|Aclame:pro 332 YEKDRDEGIDFEAHSYMLPYCTRPQLL--VDVRADAKGG 368 (368) Q Consensus 332 ~~~~~~~~~~l~~eS~pLpv~~rP~al--~~~t~~a~~~ 368 (368) +.+++-.++.|..-...=....||++. ++++.+|.|| T Consensus 262 ~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~ 300 (324) T protein:vir:99 262 ARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPA 300 (324) T ss_pred eechhhHHHhhhhhhhhcCcccccceEEEEEEccCcccc Confidence 122222444555555555677899977 6777888887 No 25 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=52.19 E-value=0.56 Score=21.96 Aligned_cols=294 Identities=13% Similarity=0.068 Sum_probs=103.9 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhhcccccccccc-cceEEEEEEcCceeeeeccCCCCCcccc-ccCCceeEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPIT-QTTFLMDLTDWDVSLLDAVDRDSRKAET-SAPERVRQI 78 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~-t~~i~id~~~~~~~lvp~v~rg~~~~~~-~~~~~~~~~ 78 (368) -.++|- -.|+-.-+++--++ ..+.+ +...+.++ +.++.|.++ +..++ ....||.+-... ...++. .. T Consensus 20 ~~~l~l-e~~~geV~~af~~~-----s~~~~--~~~~r~i~~G~s~~~~~i-G~~~~-~~~~~g~~l~~~~~~~~~~-~l 88 (334) T protein:vir:80 20 DVSLHI-EEHLGLVDASFMYS-----SKFAS--WMNVRSLRGTNQLRVDRV-GASTI-AGRKAGEELVVQKNVSDKL-NL 88 (334) T ss_pred hheehh-hhhhhHHHHHHHHh-----hhhhc--cceeeeccccceEEEeee-cceee-eeecCCCCCCCCCcccCce-EE Confidence 223332 11332223333221 23332 35555555 677888776 33332 333334332111 111211 11 Q ss_pred EEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEcc---------CCCE Q lcl|Aclame:pro 79 SFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDA---------RGTL 149 (368) Q Consensus 79 ~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d~---------~G~~ 149 (368) .+.-..+..--.=.-+|+|. - -+-.....+.....|++..++ . ...+++++....+ +|.. T Consensus 89 ~ID~~l~~~~~VddiD~~q~---~--~D~rse~~~~~G~aLA~~~D~---~---~~~~l~kaa~~~~~~~~~~~~~~G~~ 157 (334) T protein:vir:80 89 TVDTVLYARHFFDKFDEWTS---N--LDVRKETAREDGIALARQYDQ---A---CIIQLQKCGDFLAPAHLKPAFHDGIL 157 (334) T ss_pred EEeeeeehhhhHhhHHHHhc---C--cchHHHHHHHHHHHHHHHHHH---H---HHHHHHHhhhhcccccccccccCCcc Confidence 22222111111111122222 1 122222233334444433332 1 2222233322111 1211 Q ss_pred EeccHhhcCCCcceeEEecCCCCCc---HHHHHHHHHHHHHHHhccccccc---cceEEEEEChHHHHHHhcCHHHHHH- Q lcl|Aclame:pro 150 YADLYKQFDVEKKTIYFDLDNPNAD---IDASIEELRMHMEDEAKTGTVIN---GEEIHVVVDRVFFSKLTKHPKIRDA- 222 (368) Q Consensus 150 ~~d~~~~fG~~~~~~~~~l~~a~~d---i~~~l~~~~~~i~~~~~~g~~~~---~~~~~~l~g~~~~~~l~~h~~v~~~- 222 (368) .. ...+-...++.++ +...+.++.+++-++ . .+ ..+.+++++|++|.+|+.|+.+... T Consensus 158 ~~----------~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~-d----vp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d 222 (334) T protein:vir:80 158 LP----------STISGLAADAAADADVLVAAHRQGVEAMVFR-D----LGDQLMSEGVTLLDPVIFSFLLEHDRLMNVE 222 (334) T ss_pred ee----------ecccccccchhhhHHHHHHHHHHHHHHHHhc-C----CCCCcCCceEEEeChHHHHHHhcccccccce Confidence 10 0000000111222 233333444333321 1 22 3457889999999999999876432 Q ss_pred HHHhhhhhhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeecccc Q lcl|Aclame:pro 223 YLAQQTPLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGE 302 (368) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~ 302 (368) |..... ...........+.|+++++-. ..+...+.... .+..+.. T Consensus 223 ~~~s~~----------------~~~~~~g~i~~v~G~~V~~Sn-~~P~~~~t~~~------------~g~~~~~------ 267 (334) T protein:vir:80 223 FGAKEG----------------GNSFVGGRIAMLNGVRVVETP-RFPQSAITANA------------LGADFNV------ 267 (334) T ss_pred eccccc----------------cccccceeEEEEeceEEEeec-CCCCccccccc------------ccccccc------ Confidence 111000 011122334567888888732 12221111000 0000000 Q ss_pred chhhhhee---eccccchhhcccccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCCC Q lcl|Aclame:pro 303 ANNIFEVA---YGPCPKMGYANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKGG 368 (368) Q Consensus 303 ~~~~f~~~---~apa~~~~~~n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~~ 368 (368) ..+-|.+. |.+..-+.++ ...+.=...|.+++-.++.|.+-...=.-..||++++-+++.---- T Consensus 268 ~agd~t~~~~~~~~~~Al~t~--~~~~~~~e~~~~~~~~~d~i~~~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 268 TDAEVRRKMITFIPSMALISA--QVHPVSAQFWEEKKDFGHYLDTFQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred ccccccceEEEEEeCceEEEE--EEeecceeeeechhhHHHHHHHHHHcCCceeccceEEEEEEeeecC Confidence 00000000 0011101111 1111123334444444444555445556678998775444433222 No 26 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=43.10 E-value=0.86 Score=20.95 Aligned_cols=308 Identities=11% Similarity=0.014 Sum_probs=109.5 Q ss_pred Cccc---------------------ccCCcccHHHHHHHHHhcCCcccchhhcccccccccc-cceEEEEEEcCceeeee Q lcl|Aclame:pro 1 MLTN---------------------SEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPIT-QTTFLMDLTDWDVSLLD 58 (368) Q Consensus 1 ~~d~---------------------f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~-t~~i~id~~~~~~~lvp 58 (368) |.|+ |-. .|.-.-+++ +. ...++. ++...+.++ +.++.|..+ +...+ . T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik-~f~~eV~~~-f~----~~s~~~--~~~~~r~i~~G~sv~i~~i-G~~tv-~ 70 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLK-VFAGEVLTA-FT----RRSVTA--DKHIVRTIQNGKSAQFPVM-GRTSG-V 70 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHH-HHhHHHHHH-HH----HHHhhh--cccccccccccceEEEecc-cceee-e Confidence 3333 110 122222221 11 112222 234444443 566666665 22222 2 Q ss_pred ccCCCCCccccccCCceeEEEEecccccccc-cc-cHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 59 AVDRDSRKAETSAPERVRQISFPMMYFKEVE-SI-TPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQ 136 (368) Q Consensus 59 ~v~rg~~~~~~~~~~~~~~~~f~~p~~~~~~-~i-~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~ 136 (368) ...||.+-......-.-..+.+.+=-.+..+ .| .-+++|. - -++..+...++-..+.+....+++. T Consensus 71 ~~t~G~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~---~---------~D~~~~~~~~~g~aLa~~~D~~i~~ 138 (347) T protein:vir:94 71 YLAPGERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMN---H---------YDVAGEYSNQLGEALAIAADGAVLA 138 (347) T ss_pred eecCCCCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhc---C---------cchHHHHHHHHHHHHHHHHHHHHHH Confidence 2223332110000000011112111011000 01 1112221 1 1122222223333333333333333 Q ss_pred Hh---cCcEEccCCCEEeccHhhcCCCcceeEEecCCCCC-------cHHHHHHHHHHHHHHHhccccccccceEEEEEC Q lcl|Aclame:pro 137 AL---KGKVVDARGTLYADLYKQFDVEKKTIYFDLDNPNA-------DIDASIEELRMHMEDEAKTGTVINGEEIHVVVD 206 (368) Q Consensus 137 AL---~G~i~d~~G~~~~d~~~~fG~~~~~~~~~l~~a~~-------di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g 206 (368) .+ .+..-.+.+... -|+ ....+.+.....+. .+...+.++.+.+.+ ...+..+.+++++ T Consensus 139 ~~~~~aa~~~~~~~~~~-----g~~-~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde-----~~VP~~~R~~vv~ 207 (347) T protein:vir:94 139 EMAILCNLPAASNENIA-----GLG-TASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTS-----NYVPAGDRYFYTT 207 (347) T ss_pred HHHHHhccccccccccC-----CCc-ccceeeccccccccchhhhHHHHHHHHHHHHHHHhh-----cCCCCCCcEEEeC Confidence 22 111111111000 011 11112221111111 223333333333322 2234456788899 Q ss_pred hHHHHHHhcCHHHHHHHHHhhhhhhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCccccccccccccee Q lcl|Aclame:pro 207 RVFFSKLTKHPKIRDAYLAQQTPLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVAD 286 (368) Q Consensus 207 ~~~~~~l~~h~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i 286 (368) |++|..|..|+.+...... ... .++.+ ....+.|+.|++-. ..+... ...........+ T Consensus 208 P~~~~~Ll~~~~~~~~~~~--~~~-------~~~~G---------~Vg~i~G~~V~~Sn-~lp~~~--~t~~~~~~~~~~ 266 (347) T protein:vir:94 208 PDNYSAILAALMPNAANYA--ALI-------DPETG---------NIRNVMGFVVVEVP-HLVQGG--AGETRGDDGITI 266 (347) T ss_pred HHHHHHHhccchhhhhhcc--ccc-------ccccc---------ceEEEeceEEEecC-cccccc--cccccccCccee Confidence 9999999999887653211 100 11111 22466788888743 222111 111111223344 Q ss_pred cCCceeeeeeeeccccchhhh----heeeccccchhhccccc-ceeeEeeeeccCCCeeEEEeeecccccccCCceEEEE Q lcl|Aclame:pro 287 TVGVGHAFPNVAMLGEANNIF----EVAYGPCPKMGYANTLG-QELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDV 361 (368) Q Consensus 287 ~~~~~~~~p~~~~~~~~~~~f----~~~~apa~~~~~~n~~~-~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~ 361 (368) .+++.+.|+... +....+-| ...|.| +++.+.- ++.=...+.+++-.++.|.+-...=.-+.||++++.+ T Consensus 267 ~aG~~~~~~~~~-~~~~~~~~~~~~~l~~h~----~A~~~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~ 341 (347) T protein:vir:94 267 ASGQKHAFPATA-SSDVKVTMDNVVGLFSHR----SAVGTVKLRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGAL 341 (347) T ss_pred cCcccccccccc-hhhhcccccceeEEEeeh----hhhhhhhcccccccchhchhhHHHHhhhhhhhcCcccccceeEEE Confidence 455555554311 00000000 001111 1111100 0001112233333344455555555667899999999 Q ss_pred EEecCC Q lcl|Aclame:pro 362 RADAKG 367 (368) Q Consensus 362 t~~a~~ 367 (368) +++++- T Consensus 342 ~~~~A~ 347 (347) T protein:vir:94 342 VFSPAE 347 (347) T ss_pred EecCCC Confidence 999888 No 27 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=37.91 E-value=1.1 Score=20.37 Aligned_cols=269 Identities=13% Similarity=0.045 Sum_probs=97.9 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhh---------------cccccccccccceEE-EEEEcCceeeeeccCCCC Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISN---------------LGLFRSAPITQTTFL-MDLTDWDVSLLDAVDRDS 64 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~---------------l~~F~~~~~~t~~i~-id~~~~~~~lvp~v~rg~ 64 (368) -.-.+.. ....|+++|=. ....+.+ --.|..+..+++.|. =++..+.- +.|-. + T Consensus 27 avr~Y~K---qf~glL~~vf~---~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~pVVig~~Y~TdeN--vaFGt--G 96 (314) T protein:vir:98 27 AARSYQK---EFRQLLQAVFR---SQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIPVVVGNEYNKDEN--VGFGE--G 96 (314) T ss_pred ceeeecH---HHHHHHHHHHh---hHhhhhhhcccceeeccCCCccceEEEEeecccceeecCcccCCCC--ccccc--C Confidence 1111111 01122222211 1111111 112333333333322 12222211 22211 1 Q ss_pred CccccccCCceeEEEEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEc Q lcl|Aclame:pro 65 RKAETSAPERVRQISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVD 144 (368) Q Consensus 65 ~~~~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d 144 (368) .+..+.-+++.++++..++ ++.+....=.| |+-++--. .-+++++++||.-+...-.+......-.+|. T Consensus 97 Tg~SsRFGprkEi~y~dtd-VpY~~~~~iHE--GiD~~TVN---nd~~aaVAdRL~LQA~Akt~~~n~~~Gk~lS----- 165 (314) T protein:vir:98 97 TSRSTRFGPRREIIYQDTP-VPYTWEWVYHE--GIDKHTVN---NDFQAAVADRLDLQANAKIKQFNAQHSKFIS----- 165 (314) T ss_pred CccccccCceeEEEeeccc-ccccccchhhh--cccccccc---CChhHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Confidence 1111222233333332222 33333333232 22111101 1235678888886665544444433333332 Q ss_pred cCCCEEeccHhhcCCCcceeEE-ecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHH Q lcl|Aclame:pro 145 ARGTLYADLYKQFDVEKKTIYF-DLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAY 223 (368) Q Consensus 145 ~~G~~~~d~~~~fG~~~~~~~~-~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~ 223 (368) .-. -++..+ +++. .++...++++...--+. . ...++++.+.+++|++|+.||.+..+- T Consensus 166 ~~A-------------s~te~ltd~~~--d~V~~LF~~as~~yvn~-e-----v~~~~~AyV~~evYnaiiD~~l~TsaK 224 (314) T protein:vir:98 166 SIA-------------EKTETLTDYSA--DNVLRLFNELSKYYVNI-E-----AIGTKAAKVSPELYNAIVDHPLTTSAK 224 (314) T ss_pred hhh-------------hhhhhhhhcch--hhHHHHHHHHHhhhhcc-e-----eeEEEEEEEchhHHhHhhccccccccc Confidence 111 111111 2332 35666666655443322 1 124689999999999999999975421 Q ss_pred HHhhhhhhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecC--Cceeeeeeeeccc Q lcl|Aclame:pro 224 LAQQTPLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTV--GVGHAFPNVAMLG 301 (368) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~--~~~~~~p~~~~~~ 301 (368) ....|+.. ...+.|.|+.+.+--..+-.. ..+++.+ +-+++|- T Consensus 225 ------~SsaNIDe-------------ngi~~FkGf~i~e~P~~~~q~----------g~ia~~s~dnig~aft------ 269 (314) T protein:vir:98 225 ------SSSANIDQ-------------NGIVNFKGFAIQEIPESMLQS----------GDVAYTYITNIGKAFT------ 269 (314) T ss_pred ------cceeeecc-------------CCcceecceEEEecchhhcCC----------CcEEEEccccceeecc------ Confidence 11122211 123567788888744333221 1111111 1111111 Q ss_pred cchhhhheeeccccchhhcccccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCC Q lcl|Aclame:pro 302 EANNIFEVAYGPCPKMGYANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKG 367 (368) Q Consensus 302 ~~~~~f~~~~apa~~~~~~n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~ 367 (368) |+. ...-=-+++++-+..+|.-=|-+- ..++++ .|++|.|..-++ T Consensus 270 GIn---~aR~IesEdF~GValQgAGK~G~~-I~edNk-----------------~Ai~k~t~tp~~ 314 (314) T protein:vir:98 270 GIN---TSRIIESEDFDGVALQGAGKAGEF-ILDDNK-----------------KAVAKVTSTPEG 314 (314) T ss_pred cce---eeeeeecccccceeeecccccccc-cccccc-----------------eeeEEEecCCCC Confidence 110 000000122222222333222222 222222 344455544444 No 28 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=37.02 E-value=1.1 Score=20.27 Aligned_cols=300 Identities=11% Similarity=0.128 Sum_probs=128.4 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhhcccccccccc-cceEEEEEEcCceeeeeccCCCCCccc-cccCCceeEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPIT-QTTFLMDLTDWDVSLLDAVDRDSRKAE-TSAPERVRQI 78 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~-t~~i~id~~~~~~~lvp~v~rg~~~~~-~~~~~~~~~~ 78 (368) -|-+|+... +++--+ ...+. +++..++++ ++++.+.+. |.. -+....+|.+... ....++ .+. T Consensus 22 ~Le~f~GeV-----~taF~~-----~si~~--~~~~vRtI~~gkS~qf~~l-G~s-~a~y~~pG~~ldg~~~~~dk-~~I 86 (400) T protein:vir:10 22 LIEKFNGKV-----NEQYLK-----GENIM--SYFDVQTVTGTNTVSNKYL-GET-ELQVLAPGQSPAATSTQADK-NQL 86 (400) T ss_pred HHhHhcchH-----HHHHHH-----Hhhhc--ccceeeeecccceEEEEEe-eee-EEeeecCCCCcCCCCcccCc-EEE Confidence 333333322 221111 11112 456667675 677888887 222 2233333333210 112222 223 Q ss_pred EEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCcEEcc---CCCE-Eecc Q lcl|Aclame:pro 79 SFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQAL-KGKVVDA---RGTL-YADL 153 (368) Q Consensus 79 ~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL-~G~i~d~---~G~~-~~d~ 153 (368) .+....+.+.-.=.=+|+|+ .|-+ -...+.+.+...|+++.++ ++++.+ .+.+... .|.. ... T Consensus 87 tIDtLL~a~~~V~dlDd~q~--~yD~--vRse~s~e~G~ALA~~~Dq-------~iiq~i~~a~~a~t~~~~~~~~g~~- 154 (400) T protein:vir:10 87 VIDATVIARNTVAHLHDVQG--DIDS--LKPKLATNQAKQLKKMEDE-------MLIQQMLLGGIANTQAKRTNPRVKG- 154 (400) T ss_pred EeCceeeecchhhhHHHHhh--cccc--ccHHHHHHHHHHHHHHHHH-------HHHHHHHHhcccccccccccCCccc- Confidence 44444444433333445543 1210 1122333445555554443 344433 3434221 1100 000 Q ss_pred HhhcCCCcceeEEecC--CCCCcHHHH---HHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHH-HHHHhh Q lcl|Aclame:pro 154 YKQFDVEKKTIYFDLD--NPNADIDAS---IEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRD-AYLAQQ 227 (368) Q Consensus 154 ~~~fG~~~~~~~~~l~--~a~~di~~~---l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~-~~~~~~ 227 (368) -|. ++.+.-. .+.+|+... +.++...+.+ ...+..++++++.+.+|+.|..|+.+.. -|... T Consensus 155 ---~g~---s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdE-----kdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s- 222 (400) T protein:vir:10 155 ---HGF---SVNVEVNEGEALVNPQYVMAAVEFALEQQLE-----QEVDISDVAILMPWRYFNVLRDADRIVDKSYTIS- 222 (400) T ss_pred ---ccc---ceeecccccccccCHHHHHHHHHHHHHHHHh-----cCCCccceEEEcCHHHHHHHHhCCcccchhcccc- Confidence 011 1222111 122344333 3333333221 1234567899999999999999974321 11100 Q ss_pred hhhhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhh Q lcl|Aclame:pro 228 TPLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIF 307 (368) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f 307 (368) ..+.........+.||.+++-. .++...+... ....-..+.+..|....-+....+ T Consensus 223 ----------------~~g~~~~g~v~~v~Gv~Iv~Sn-~lP~~a~~~~-----~~~lS~a~~G~~y~~t~d~s~~~a-- 278 (400) T protein:vir:10 223 ----------------QSGATIQGFVLSSYNCPVIPSN-RFPKYSQGQK-----HHLLSNEDNGYRYDPIAEMNGAIA-- 278 (400) T ss_pred ----------------CCCccccceEEEEeceEEEeeC-cCCcccCccc-----ccccccCCCCccCCccccccceeE-- Confidence 0011112223467888888743 2221110000 000111122333321111111111 Q ss_pred heeeccccchhhccc-ccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecC--CC Q lcl|Aclame:pro 308 EVAYGPCPKMGYANT-LGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAK--GG 368 (368) Q Consensus 308 ~~~~apa~~~~~~n~-~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~--~~ 368 (368) ..|.| +.+++ ...++=...|.+++...+.|.+..+.=..+.||++..-+|.+-. |+ T Consensus 279 -v~F~~----sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~~~~ 337 (400) T protein:vir:10 279 -VLFTA----DALLVGRSIDVIGDIFYEKKEKTYYIDTFMSEGAIPDRWEAVSVVTTKRQSTGA 337 (400) T ss_pred -EEEeh----hheEEEEeeccccccccchhhHHHHHHHHHHhCCcccchhheEEEEecCCcccc Confidence 12222 12333 23355566788888888888888888889999999988876532 22 No 29 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=34.81 E-value=1.3 Score=20.02 Aligned_cols=272 Identities=11% Similarity=0.069 Sum_probs=100.6 Q ss_pred cccccCCcccHHHHHH----HHHhcCCcccchhhc-c-cccccccccceEEEEEEcCceee-e-ec-----cCCC-CCcc Q lcl|Aclame:pro 2 LTNSEKSRFFLADLTG----EVQSIPNTYGYISNL-G-LFRSAPITQTTFLMDLTDWDVSL-L-DA-----VDRD-SRKA 67 (368) Q Consensus 2 ~d~f~~d~F~~~~Lt~----~i~~~p~~~~~l~~l-~-~F~~~~~~t~~i~id~~~~~~~l-v-p~-----v~rg-~~~~ 67 (368) |---|+|. .++.+++ .++.+=.....+.+. | |=-...+.-+...+..+.....+ + ++ +.=| +.+. T Consensus 1 m~t~N~n~-avr~Y~Kqf~glL~~vf~~qa~F~~~fgglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNv~FGtgTg~ 79 (286) T protein:vir:94 1 MATTNNDL-PVRVYSKEFLQLLSTVYQAQSVFTPTFGALQALDGVPNNATAFSVKTNDMAVVVGEYSTDANTAFGTGTSN 79 (286) T ss_pred CCCCcccc-ceeehhHHHHHHHHHHHhhHHHhhhhhcchhhhhCCCccceEEEEeecCcceEEecccCCCccccccCCcc Confidence 33334332 2332222 222221111111111 0 00012222233333333222211 1 11 1100 1111 Q ss_pred ccccCCceeEEEEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCC Q lcl|Aclame:pro 68 ETSAPERVRQISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDARG 147 (368) Q Consensus 68 ~~~~~~~~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d~~G 147 (368) .+.-+++.++++..++ ++.+....=.| ++-++--. .-+++.+++||.-+...-.+......-.+|. .-. T Consensus 80 SsRFG~rkEi~y~dtd-V~Y~~~~~iHE--GiD~~TVN---nd~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls-----~~A 148 (286) T protein:vir:94 80 SSRFGEMKEVIYADTD-VPYTAGWAIHE--GLDQMTVN---NDLDAAVADRLNLQAQAKTRLFNVAMGEALA-----TAG 148 (286) T ss_pred ccccCceeeEEeeccc-ccccccchhhh--cccccccc---CChhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hhh Confidence 1122222333322222 33333332222 12111101 1235678888886655444444333333332 100 Q ss_pred CEEeccHhhcCCCcceeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhh Q lcl|Aclame:pro 148 TLYADLYKQFDVEKKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQ 227 (368) Q Consensus 148 ~~~~d~~~~fG~~~~~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~ 227 (368) + ++.++ -++...++++...--+. . ...++.+.+.+++|++|+.||.+..+- T Consensus 149 ~-------------~t~~~------D~V~~LF~~as~~yvn~-e-----v~~~~~ayV~~evYnaiiD~~l~TsaK---- 199 (286) T protein:vir:94 149 T-------------DLGAV------DDVNALFESAVEKYTDL-E-----VIAPVRAYVTASVYNAIIDLANVTTAK---- 199 (286) T ss_pred h-------------hhhhh------hhHHHHHHHHHHHhhhh-h-----eeeeeEEEEchhHHHHHhccccccccc---- Confidence 0 11112 14444455444333221 1 124567899999999999999975321 Q ss_pred hhhhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhh Q lcl|Aclame:pro 228 TPLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIF 307 (368) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f 307 (368) ....|+.. ...+.|.|+.+.+--..|.. |...-+-+ .+-++.|- |+. T Consensus 200 --~SsaNiDe-------------ngi~~FkGf~i~e~P~~~~~--g~~aifs~-------dnig~aft------GIn--- 246 (286) T protein:vir:94 200 --NSAVNIDT-------------NGMLSFRGIAITKVPTQYMG--GKAVIFAP-------DNVARVFT------GIN--- 246 (286) T ss_pred --cceeeecc-------------CCcceecceEEeecchhhcc--CceEEEcc-------ccceeeec------cce--- Confidence 11122211 12346778888875544332 22221111 11222221 111 Q ss_pred heeeccccchhhcccccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEec Q lcl|Aclame:pro 308 EVAYGPCPKMGYANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADA 365 (368) Q Consensus 308 ~~~~apa~~~~~~n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a 365 (368) ...-=-+++++-++.+|.-=|-+--.+++ -.|+++.+-+| T Consensus 247 ~aR~IesEdF~GValQgAGK~G~~I~edN------------------k~Ai~~~~~k~ 286 (286) T protein:vir:94 247 IARTIQAIDFAGVELQGAGKYGTFILDDN------------------KKAIFTATPKA 286 (286) T ss_pred eeeeeeccccCceeeeccccccccccccC------------------ceeEEEeecCC Confidence 01111123444444444443333323332 23556666666 No 30 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=27.93 E-value=1.8 Score=19.19 Aligned_cols=303 Identities=11% Similarity=0.110 Sum_probs=124.7 Q ss_pred CcccccCCcccHHHHHHHHHhcCCcccchhhcccccccccc-cceEEEEEEcCceeeeeccCCCCCcc-ccccCCceeEE Q lcl|Aclame:pro 1 MLTNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPIT-QTTFLMDLTDWDVSLLDAVDRDSRKA-ETSAPERVRQI 78 (368) Q Consensus 1 ~~d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~-t~~i~id~~~~~~~lvp~v~rg~~~~-~~~~~~~~~~~ 78 (368) -|-.|+... +++--+ ...+ ++++..++++ ++++.+.+. |. .-+....+|.+.. +....++ .+. T Consensus 22 ~Le~f~GeV-----~taF~~-----~si~--~~~~~vRti~~gkS~qf~~~-G~-s~~~~~~pG~~ld~~~~~~dK-~~I 86 (401) T protein:vir:70 22 LIEKFNGKV-----NEQYLK-----GENI--MSYFDVQTVTGTNTVSNKYL-GE-TELQVLAPGQSPAATSTQADK-NQL 86 (401) T ss_pred HHhHhcchH-----HHHHHH-----Hhhh--cccceeeeecccceEEEEEe-ee-eEeeeecCCCCcCCCCccccc-EEE Confidence 333333322 221111 1111 2456667665 677888887 22 2233333343321 0111122 122 Q ss_pred EEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcEEccCCCEEeccHhhc Q lcl|Aclame:pro 79 SFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALK-GKVVDARGTLYADLYKQF 157 (368) Q Consensus 79 ~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~-G~i~d~~G~~~~d~~~~f 157 (368) .+....+.+--.=.=+|+|+ .|... ...+.+.+.+.|+++.++ +++|.+. ..+.+..+..-. .+ T Consensus 87 tID~lL~a~~~V~dlDe~q~--~yD~v--Rse~s~e~G~ALA~~~Dq-------~iiq~i~~aa~ana~~~~~~----p~ 151 (401) T protein:vir:70 87 VIDATVIARNTVAHLHDVQG--DIDSL--KPKLATNQAKQLKRMEDE-------MLIQQMMLGGIANTQAKRTN----PR 151 (401) T ss_pred EeCceeehhhhhhhHHHHHh--ccccc--chHHHHHHHHHHHHHHHH-------HHHHHHHHhccccccccccC----CC Confidence 33333333333323344443 12110 112233344444444332 3344431 222221110000 00 Q ss_pred CCC-cceeEEecC--CCCCc---HHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHH-HHHHhhhhh Q lcl|Aclame:pro 158 DVE-KKTIYFDLD--NPNAD---IDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRD-AYLAQQTPL 230 (368) Q Consensus 158 G~~-~~~~~~~l~--~a~~d---i~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~-~~~~~~~~~ 230 (368) |.. ...+++.-. ++..+ +...+.++...+.+ -..+..++++|+.+.+|..|..|+.+.. .|.+.. T Consensus 152 ~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdE-----kdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~--- 223 (401) T protein:vir:70 152 VKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLE-----QEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQ--- 223 (401) T ss_pred cCCCceEEeccccccccccCHHHHHHHHHHHHHHHHh-----cCCCccceEEEcCHHHHHHHHhcCcccchhhcccc--- Confidence 100 011222211 11234 44444444433322 1234568999999999999999985432 111000 Q ss_pred hhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhhee Q lcl|Aclame:pro 231 AWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVA 310 (368) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~ 310 (368) .+.........+.||.+++-.. ++...+... ....-..+.+..|....-+....+ .. T Consensus 224 --------------~g~~~~G~v~~vaGv~Vv~Snn-lP~~a~~it-----~~~ls~a~~G~~y~~~~d~s~~~~---v~ 280 (401) T protein:vir:70 224 --------------SGATIQGFTLSSYNCPVIPSNR-FPKYSQGQT-----HHLLSNEDNGYRYDPLPAMNGAIA---VL 280 (401) T ss_pred --------------CCccccceEEEEeceEEEeecc-ccccccccc-----cccccccCCCccCCCCccccceeE---EE Confidence 0111122334567888887432 211110000 001111222333321111111111 11 Q ss_pred eccccchhhccc-ccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCCC Q lcl|Aclame:pro 311 YGPCPKMGYANT-LGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKGG 368 (368) Q Consensus 311 ~apa~~~~~~n~-~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~~ 368 (368) |.| +.+++ ...++=...|.+++...+.|.+....=..+.||++..-+|.+-.+- T Consensus 281 f~~----~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a~g~g~~RPeaa~vv~~k~~~~ 335 (401) T protein:vir:70 281 FTA----DALLVGRSIDVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRNTT 335 (401) T ss_pred Eeh----hheEEEEeeccccchhhhhhhhHHHHHHHHHhCCcccchhheEEEeecCccc Confidence 222 12332 2234555668888888888888888888899999996665543311 No 31 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=23.73 E-value=2.3 Score=18.63 Aligned_cols=267 Identities=10% Similarity=-0.000 Sum_probs=94.1 Q ss_pred Ccc--cccCCcccHHHHHHHHHhcCCcccchhhcccccc--cccccceEEEEEEcCceeeeeccCCCCCccccccCCcee Q lcl|Aclame:pro 1 MLT--NSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRS--APITQTTFLMDLTDWDVSLLDAVDRDSRKAETSAPERVR 76 (368) Q Consensus 1 ~~d--~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~--~~~~t~~i~id~~~~~~~lvp~v~rg~~~~~~~~~~~~~ 76 (368) |.+ =-..+.+-...+...|...-.....+..+..-.. .+....+|.|-..+. ..-+..+.-|..- ...+-. .. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~-~~~a~~v~eg~~i-~~~~~~-~~ 77 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDY-IGDAEDVAEGEAI-PMTQLG-FK 77 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecC-CCCcccccCCCcc-cccccc-cc Confidence 442 1112233333333333221111112222211111 122234454433321 1112222222211 111111 11 Q ss_pred EEEEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEeccHhh Q lcl|Aclame:pro 77 QISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDARGTLYADLYKQ 156 (368) Q Consensus 77 ~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d~~G~~~~d~~~~ 156 (368) ...+.+-++... +...|....++.+ +... .-..++-+.+.+..|..++.++.|...... T Consensus 78 ~~~~~~~~~~~~--~~itd~~~~~s~~--d~~~-------~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~---------- 136 (272) T protein:vir:30 78 KTTMTIKKAGKG--VEITDEAILSGYG--DPVG-------QAAKQIVEAIDHKVDADVLDALSKSTQTVE---------- 136 (272) T ss_pred eEEEEeeeeeee--eeecHHHHhhccc--cHHH-------HHHHHHHHHHHHHHHHHHHHHhcccccccc---------- Confidence 223333333322 2222322222221 2222 222233334444555556666654321110 Q ss_pred cCCCcceeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 157 FDVEKKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAWQQIT 236 (368) Q Consensus 157 fG~~~~~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~~~~~~~~ 236 (368) +......+.++...+.+. . .....++|+|++|..|+.++... +...... .. T Consensus 137 ---------------~~~t~d~i~da~~~l~~~-----~--~~~~~~vv~p~~~~~L~k~~~~~-----~~~~~~~--~~ 187 (272) T protein:vir:30 137 ---------------ATATVDGVSKALDIFNDE-----D--DAETVIVMNPADASTLRLDAAKE-----WLGATEV--GA 187 (272) T ss_pred ---------------cccCHHHHHHHHHHHhcc-----C--CCccEEEEcHHHHHHHHHhcccc-----ccccccc--cc Confidence 011123344444444322 1 23346889999999988664432 1111000 00 Q ss_pred cccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheeeccccc Q lcl|Aclame:pro 237 GSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPCPK 316 (368) Q Consensus 237 ~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~apa~~ 316 (368) ..++.+ .. -.+.|+.++.-. .++.+++.++-. +.+..+.. T Consensus 188 ~~~~~g-------~i--g~i~G~~Vi~s~-------------------~~p~~t~~~~~~--------~a~~~~~~---- 227 (272) T protein:vir:30 188 NRVVSG-------VY--GEVLGVQIVRSR-------------------KCPKGTAYMVRK--------GALRIMLK---- 227 (272) T ss_pred cccccc-------cc--hhhcCeeEEEcC-------------------CCCcceEEEEcC--------CeEEEEec---- Confidence 001110 11 135677666521 122233332211 11111111 Q ss_pred hhhcccccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCCC Q lcl|Aclame:pro 317 MGYANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKGG 368 (368) Q Consensus 317 ~~~~n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~~ 368 (368) .+... -.+.+..-.--.+.+-.....-..+|++++++|+++++- T Consensus 228 ------~~~~v--e~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~ 271 (272) T protein:vir:30 228 ------RNTMV--ETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDAAK 271 (272) T ss_pred ------CCcee--eeccccccceeEEEEEEEEEEEEEcCCceEEEEeccccc Confidence 11110 011111101122333333344566999999999999998 No 32 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=23.73 E-value=2.3 Score=18.63 Aligned_cols=267 Identities=10% Similarity=-0.000 Sum_probs=94.1 Q ss_pred Ccc--cccCCcccHHHHHHHHHhcCCcccchhhcccccc--cccccceEEEEEEcCceeeeeccCCCCCccccccCCcee Q lcl|Aclame:pro 1 MLT--NSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRS--APITQTTFLMDLTDWDVSLLDAVDRDSRKAETSAPERVR 76 (368) Q Consensus 1 ~~d--~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~--~~~~t~~i~id~~~~~~~lvp~v~rg~~~~~~~~~~~~~ 76 (368) |.+ =-..+.+-...+...|...-.....+..+..-.. .+....+|.|-..+. ..-+..+.-|..- ...+-. .. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~-~~~a~~v~eg~~i-~~~~~~-~~ 77 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDY-IGDAEDVAEGEAI-PMTQLG-FK 77 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecC-CCCcccccCCCcc-cccccc-cc Confidence 442 1112233333333333221111112222211111 122234454433321 1112222222211 111111 11 Q ss_pred EEEEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccCCCEEeccHhh Q lcl|Aclame:pro 77 QISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDARGTLYADLYKQ 156 (368) Q Consensus 77 ~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d~~G~~~~d~~~~ 156 (368) ...+.+-++... +...|....++.+ +... .-..++-+.+.+..|..++.++.|...... T Consensus 78 ~~~~~~~~~~~~--~~itd~~~~~s~~--d~~~-------~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~---------- 136 (272) T protein:vir:98 78 KTTMTIKKAGKG--VEITDEAILSGYG--DPVG-------QAAKQIVEAIDHKVDADVLDALSKSTQTVE---------- 136 (272) T ss_pred eEEEEeeeeeee--eeecHHHHhhccc--cHHH-------HHHHHHHHHHHHHHHHHHHHHhcccccccc---------- Confidence 223333333322 2222322222221 2222 222233334444555556666654321110 Q ss_pred cCCCcceeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 157 FDVEKKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAWQQIT 236 (368) Q Consensus 157 fG~~~~~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~~~~~~~~~~~ 236 (368) +......+.++...+.+. . .....++|+|++|..|+.++... +...... .. T Consensus 137 ---------------~~~t~d~i~da~~~l~~~-----~--~~~~~~vv~p~~~~~L~k~~~~~-----~~~~~~~--~~ 187 (272) T protein:vir:98 137 ---------------ATATVDGVSKALDIFNDE-----D--DAETVIVMNPADASTLRLDAAKE-----WLGATEV--GA 187 (272) T ss_pred ---------------cccCHHHHHHHHHHHhcc-----C--CCccEEEEcHHHHHHHHHhcccc-----ccccccc--cc Confidence 011123344444444322 1 23346889999999988664432 1111000 00 Q ss_pred cccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhhhheeeccccc Q lcl|Aclame:pro 237 GSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIFEVAYGPCPK 316 (368) Q Consensus 237 ~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~f~~~~apa~~ 316 (368) ..++.+ .. -.+.|+.++.-. .++.+++.++-. +.+..+.. T Consensus 188 ~~~~~g-------~i--g~i~G~~Vi~s~-------------------~~p~~t~~~~~~--------~a~~~~~~---- 227 (272) T protein:vir:98 188 NRVVSG-------VY--GEVLGVQIVRSR-------------------KCPKGTAYMVRK--------GALRIMLK---- 227 (272) T ss_pred cccccc-------cc--hhhcCeeEEEcC-------------------CCCcceEEEEcC--------CeEEEEec---- Confidence 001110 11 135677666521 122233332211 11111111 Q ss_pred hhhcccccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecCCC Q lcl|Aclame:pro 317 MGYANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAKGG 368 (368) Q Consensus 317 ~~~~n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~~~ 368 (368) .+... -.+.+..-.--.+.+-.....-..+|++++++|+++++- T Consensus 228 ------~~~~v--e~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~ 271 (272) T protein:vir:98 228 ------RNTMV--ETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDAAK 271 (272) T ss_pred ------CCcee--eeccccccceeEEEEEEEEEEEEEcCCceEEEEeccccc Confidence 11110 011111101122333333344566999999999999998 No 33 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=23.52 E-value=2.3 Score=18.61 Aligned_cols=258 Identities=9% Similarity=-0.008 Sum_probs=89.7 Q ss_pred Cc-------ccccCCcccHHHHHHHHHhcCCcccchhhcccccccccccceEEEEEEcCceeeeeccCCCCCccccccCC Q lcl|Aclame:pro 1 ML-------TNSEKSRFFLADLTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMDLTDWDVSLLDAVDRDSRKAETSAPE 73 (368) Q Consensus 1 ~~-------d~f~~d~F~~~~Lt~~i~~~p~~~~~l~~l~~F~~~~~~t~~i~id~~~~~~~lvp~v~rg~~~~~~~~~~ 73 (368) |. |++.=..|+.. +.+.+++ ...+..+..-... ++-..|...-+|+-..-+......++. T Consensus 1 ma~~~T~~~d~iiPev~~~~-v~~~~~~----~~~~~~~~~~~~~--------l~g~~G~ti~iP~~~~~gda~~~~eg~ 67 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPI-VSYELNK----ALRFAPLAQVDTT--------LQGQPGNTLKFPAFTYIGDAADVAEGG 67 (272) T ss_pred CCCcceehhhhhchHHHHHH-HHHHHHh----hhhhccccccccc--------cccCCCCEEEEeeeccCccccccCCCC Confidence 33 33222333221 1111111 1111221111110 011112222233321111111111100 Q ss_pred c-------eeEEEEecccccccccccHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEccC Q lcl|Aclame:pro 74 R-------VRQISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDAR 146 (368) Q Consensus 74 ~-------~~~~~f~~p~~~~~~~i~a~dlq~~R~~G~~~~~~~~~~~v~~~l~~~~~~i~~t~E~m~a~AL~G~i~d~~ 146 (368) . .+.....+-+. ...+...|+....+. .+.+..-..++-..+.+..+..++.+|.|...... T Consensus 68 ~i~~~~lt~~~~~~~i~~~--~k~~~vtD~~~~~~~---------~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~~ 136 (272) T protein:vir:36 68 EISLDKIGTTTKSVTIKKA--AKGTEITDEAALSGY---------GDPIGESNKQLGLSLANKVDDDLLSAAKTTSQTVS 136 (272) T ss_pred ccChhhcCCcceeEeeehh--hccccccHHHHhhcc---------chHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 0 00011111111 112222222222221 12334444444555566677777777776432211 Q ss_pred CCEEeccHhhcCCCcceeEEecCCCCCcHHHHHHHHHHHHHHHhccccccccceEEEEEChHHHHHHhcCHHHHHHHHHh Q lcl|Aclame:pro 147 GTLYADLYKQFDVEKKTIYFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQ 226 (368) Q Consensus 147 G~~~~d~~~~fG~~~~~~~~~l~~a~~di~~~l~~~~~~i~~~~~~g~~~~~~~~~~l~g~~~~~~l~~h~~v~~~~~~~ 226 (368) + ..-...+.++...+.++- .....++|+|+.+..|++++......... T Consensus 137 ~-------------------------~~~~d~i~~A~~~lgd~~-------~~~~~ivv~p~~~~~L~k~~~~~~~~~~~ 184 (272) T protein:vir:36 137 T-------------------------KANVDGVQAALDIFNDED-------AQAYVLIVNPKDAAKIRKDANAKNIGSEV 184 (272) T ss_pred c-------------------------cccHHHHHHHHHHhhhcC-------CCceEEEEcHHHHHHHhcccccccccccc Confidence 1 111223444444443321 13456889999999998876654322110 Q ss_pred hhhhhhhhhhcccccccccccccccceeeeCCEEEEEccccccCCCcccccccccccceecCCceeeeeeeeccccchhh Q lcl|Aclame:pro 227 QTPLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNI 306 (368) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~gi~~~~y~~~~~~~~g~~~~~~~~~~~~i~~~~~~~~p~~~~~~~~~~~ 306 (368) .+ ..++.+ ....+.|+.++.-+. ++.+++..... ...++. T Consensus 185 ~~--------~~~~~G---------~ig~~~G~~Vv~s~~-------------------~p~~~~~~~~~----~~~~gA 224 (272) T protein:vir:36 185 GA--------NALING---------TYADVLGAQIVRSKK-------------------LAEGSALMFKI----VSNSPA 224 (272) T ss_pred cc--------cceeee---------ccceecCeeEEEeCC-------------------CCCCceeEEEE----Eecccc Confidence 00 001111 112467777775221 12222211110 011222 Q ss_pred hheeeccccchhhcccccceeeEeeeeccCCCeeEEEeeecccccccCCceEEEEEEecC Q lcl|Aclame:pro 307 FEVAYGPCPKMGYANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAK 366 (368) Q Consensus 307 f~~~~apa~~~~~~n~~~~~~y~~~~~~~~~~~~~l~~eS~pLpv~~rP~al~~~t~~a~ 366 (368) +..+-......| .. + ..++.+ -.|..-...-.-..+|+.++++|.+.- T Consensus 225 ~~~~~~~~~~vE-~~-----R-----~~~~~~-d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 225 LKLVLKRGVQVE-TD-----R-----DIVTKT-TVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eeeeecCCcccc-cc-----c-----chhhcC-cEEEEEEEEEEEEEcCccEEEEeecCC Confidence 222111110111 00 0 000111 122222223444579999999999877 Done!